RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16189
(262 letters)
>gnl|CDD|238093 cd00162, RING, RING-finger (Really Interesting New Gene) domain, a
specialized type of Zn-finger of 40 to 60 residues that
binds two atoms of zinc; defined by the 'cross-brace'
motif C-X2-C-X(9-39)-C-X(1-3)-
H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved
in mediating protein-protein interactions; identified in
a proteins with a wide range of functions such as viral
replication, signal transduction, and development; has
two variants, the C3HC4-type and a C3H2C3-type (RING-H2
finger), which have different cysteine/histidine
pattern; a subset of RINGs are associated with B-Boxes
(C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Length = 45
Score = 50.5 bits (121), Expect = 5e-09
Identities = 15/45 (33%), Positives = 20/45 (44%), Gaps = 1/45 (2%)
Query: 207 TCPSCK-VKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
CP C R+ VL C HVFC C+ ++ + CP C
Sbjct: 1 ECPICLEEFREPVVLLPCGHVFCRSCIDKWLKSGKNTCPLCRTPI 45
>gnl|CDD|214546 smart00184, RING, Ring finger. E3 ubiquitin-protein ligase
activity is intrinsic to the RING domain of c-Cbl and is
likely to be a general function of this domain; Various
RING fingers exhibit binding activity towards E2
ubiquitin-conjugating enzymes (Ubc' s).
Length = 40
Score = 49.0 bits (117), Expect = 2e-08
Identities = 16/40 (40%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Query: 208 CPSCK-VKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKC 246
CP C KD V+ C H FC C+R E+ CP C
Sbjct: 1 CPICLEEYLKDPVILPCGHTFCRSCIRKWLESGNNTCPIC 40
>gnl|CDD|215715 pfam00097, zf-C3HC4, Zinc finger, C3HC4 type (RING finger). The
C3HC4 type zinc-finger (RING finger) is a cysteine-rich
domain of 40 to 60 residues that coordinates two zinc
ions, and has the consensus sequence:
C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C
where X is any amino acid. Many proteins containing a
RING finger play a key role in the ubiquitination
pathway.
Length = 40
Score = 47.5 bits (113), Expect = 6e-08
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 208 CPSCKVKRKDAV-LTKCFHVFCWDCLRTRYETRQRKCPKC 246
CP C + KD V + C H+FC C+ + E+ CP C
Sbjct: 1 CPICLEEPKDPVTILPCGHLFCSKCILSWLESGNVTCPLC 40
>gnl|CDD|206094 pfam13923, zf-C3HC4_2, Zinc finger, C3HC4 type (RING finger).
Length = 45
Score = 46.0 bits (109), Expect = 2e-07
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 206 LTCPSCKVKRKDA-VLTKCFHVFCWDCLRTRYETRQRKCPKCNA 248
L CP C +D VLT C HVFC +C+ RY ++ KCP C
Sbjct: 3 LECPICLDLLRDPVVLTPCGHVFCRECIL-RYLKKKSKCPICRT 45
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 46.2 bits (110), Expect = 1e-05
Identities = 38/208 (18%), Positives = 93/208 (44%), Gaps = 7/208 (3%)
Query: 1 MSLGQVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 60
++ G + A E++ + +++ Q +L ++L+ + L + +
Sbjct: 652 ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711
Query: 61 KLAREEKDIL-------HEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRN 113
+ + + L E++ L+ +++ + + +LEE+ LQ L +E+EL
Sbjct: 712 EELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE 771
Query: 114 QAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIA 173
+A+ K + E + L+ LE+ +L+EA++ + LE+ + +RL++EI
Sbjct: 772 EALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831
Query: 174 QLRRKTERMKKIEQAGTLDEVMMEEIRE 201
+L + E +++ + +E+ E
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELE 859
Score = 40.1 bits (94), Expect = 0.001
Identities = 39/193 (20%), Positives = 90/193 (46%), Gaps = 3/193 (1%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
E E L +E + E ++E ++L +++ E ++ L E + + + A D
Sbjct: 751 EELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLD 810
Query: 69 ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
L ++ +L+ + + + I +LEE+ L+ L +E+EL + ++ K + E
Sbjct: 811 ALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEA 870
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
+L+ L++ + +E ++ + E S L + ++L+E + +L +++++E
Sbjct: 871 EKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELE---AKLERLEVE 927
Query: 189 GTLDEVMMEEIRE 201
E +EE E
Sbjct: 928 LPELEEELEEEYE 940
Score = 36.6 bits (85), Expect = 0.011
Identities = 34/180 (18%), Positives = 80/180 (44%), Gaps = 3/180 (1%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
+ EAL E+E E++++ + L + E ++ L+ E + L + REE
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE---LFEALREELA 386
Query: 69 ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
L ++ ++ +++ + I LEE+ L L ++++EL ++ + + E +
Sbjct: 387 ELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNE 446
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
+L+ LE+ +LKE ++ + E L+ + L+ + +L + + +
Sbjct: 447 ELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAV 506
Score = 34.3 bits (79), Expect = 0.057
Identities = 23/124 (18%), Positives = 56/124 (45%)
Query: 61 KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNK 120
EE + L E++ + +I+ + + + +L E+ LQ L +++E+ + L +
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308
Query: 121 RKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
+ E +L+ LE+ +++ ++ + E+ + LE L+E +L K
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368
Query: 181 RMKK 184
+ +
Sbjct: 369 ALLE 372
Score = 32.0 bits (73), Expect = 0.31
Identities = 31/135 (22%), Positives = 68/135 (50%), Gaps = 10/135 (7%)
Query: 73 QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAAD 132
++ L+ ++ + + KLEE+ + L+ L ++E L + ++ +R+ E + A
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727
Query: 133 LKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLD 192
L+ LE+ ++L+E ++ + E LE LQE + +L E ++ +E+A
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELE-------ELQERLEELE---EELESLEEALAKL 777
Query: 193 EVMMEEIREYKETLT 207
+ +EE+ E ++ L
Sbjct: 778 KEEIEELEEKRQALQ 792
Score = 30.8 bits (70), Expect = 0.85
Identities = 28/141 (19%), Positives = 65/141 (46%)
Query: 66 EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIE 125
E +L ++ L+ +++ + + +LEE+ LQ L EKE+ ++ + + E
Sbjct: 226 ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285
Query: 126 SAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
+ +LK +E+ ++ ++ + E + LE + + L+E+I L+ + E + +
Sbjct: 286 LQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345
Query: 186 EQAGTLDEVMMEEIREYKETL 206
+ +EE +E E
Sbjct: 346 LEELEQLLAELEEAKEELEEK 366
Score = 28.9 bits (65), Expect = 3.2
Identities = 21/125 (16%), Positives = 53/125 (42%), Gaps = 1/125 (0%)
Query: 8 GNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 67
E L E + +++ + + + +L E L + ++ + +EE
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL 427
Query: 68 DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
L ++ L+ +++ ++ + +LEE+ L+ L +E+EL + + K + S
Sbjct: 428 KELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQ-RLEKELSSL 486
Query: 128 QSAAD 132
++ D
Sbjct: 487 EARLD 491
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 44.3 bits (105), Expect = 4e-05
Identities = 40/185 (21%), Positives = 94/185 (50%), Gaps = 4/185 (2%)
Query: 26 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMH 85
++E+ SR+ +LRE + +L E+ + + +E++ L EQ+ +++ +I+ ++
Sbjct: 801 SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLN 860
Query: 86 TAIRKLEEKERYLQTVLTNVEKEL-HLRNQAMDL-NKRKAIESAQSAADLKLHLEKYH-A 142
+LEE+ L+ L ++E L L+ + +L + + +E + ++ ++ +
Sbjct: 861 GKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLS 920
Query: 143 QLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA-GTLDEVMMEEIRE 201
+LK + + E+ S +E + + + EE L ++++E+ L+ V M I+E
Sbjct: 921 ELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQE 980
Query: 202 YKETL 206
Y+E L
Sbjct: 981 YEEVL 985
Score = 38.1 bits (89), Expect = 0.004
Identities = 51/203 (25%), Positives = 93/203 (45%), Gaps = 13/203 (6%)
Query: 10 EEEALLNEMEVTGQAFEDMQEQNSRLLQQL----REKDDANFKLMTERIKSNQLHKLARE 65
++E L E+E + E + QQL RE++ A ER ++ K E
Sbjct: 171 KKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKA------ERYQALLKEKREYE 224
Query: 66 EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAM-DLNKRKAI 124
++L E AL+ Q +A+ + LEE+ L ++ +EK L Q + +LNK+
Sbjct: 225 GYELLKE-KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD 283
Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
+ +K + + A++ ++ + EK LE + +L+ EI +L + E +++
Sbjct: 284 LGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELER 343
Query: 185 -IEQAGTLDEVMMEEIREYKETL 206
IE+ + + EE E KE L
Sbjct: 344 EIEEERKRRDKLTEEYAELKEEL 366
Score = 37.0 bits (86), Expect = 0.009
Identities = 36/191 (18%), Positives = 78/191 (40%), Gaps = 13/191 (6%)
Query: 26 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMH 85
E ++ + S L +LR ++ +L E +++ +E + L ++ LK +++ +
Sbjct: 684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELE 743
Query: 86 TAIRKLEEKERYLQTVLTNVEKEL--------HLRNQAMDLNKRKAIESAQSAADLKLHL 137
+ LE++ +++ L +E + L DL R + L
Sbjct: 744 EDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKL 803
Query: 138 EKYHAQLKEAQQVVTEKTSSLEAE-AYKTKRLQEEIAQLR----RKTERMKKIEQAGTLD 192
E+ ++++ + + +K + L E Y K +QE Q + K+IE
Sbjct: 804 EEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKK 863
Query: 193 EVMMEEIREYK 203
E + EE+ E +
Sbjct: 864 EELEEELEELE 874
Score = 35.8 bits (83), Expect = 0.023
Identities = 43/214 (20%), Positives = 91/214 (42%), Gaps = 17/214 (7%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERI--KSNQLHKLAREE 66
E E L E E + E+++E S L +Q E + K + RI LHKL
Sbjct: 723 KEIEQLEQEEEKLKERLEELEEDLSSL-EQEIENVKSELKELEARIEELEEDLHKLEEAL 781
Query: 67 KDI----LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL--------HLRNQ 114
D+ H ++ ++ ++ + + ++E + R ++ L + E L+ Q
Sbjct: 782 NDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQ 841
Query: 115 AMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQ 174
+DL ++ + +L E+ +L+E + + + S L + L+ ++ +
Sbjct: 842 RIDLKEQIK-SIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRE 900
Query: 175 LRRKTERMK-KIEQAGTLDEVMMEEIREYKETLT 207
L RK E ++ +IE+ + ++ +E L+
Sbjct: 901 LERKIEELEAQIEKKRKRLSELKAKLEALEEELS 934
Score = 28.9 bits (65), Expect = 3.6
Identities = 31/167 (18%), Positives = 69/167 (41%), Gaps = 21/167 (12%)
Query: 22 GQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQI 81
+ + +++E+ L +L E D + E K RE+ + L ++N LK ++
Sbjct: 356 TEEYAELKEELEDLRAELEEVDKEFAETRDE-------LKDYREKLEKLKREINELKREL 408
Query: 82 DAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYH 141
D R EE +R E+ L + + E + D L ++K
Sbjct: 409 D------RLQEELQRL-------SEELADLNAAIAGIEAKIN-ELEEEKEDKALEIKKQE 454
Query: 142 AQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
+L++ +++ L + R+++E+++L+R+ + +A
Sbjct: 455 WKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARA 501
>gnl|CDD|222279 pfam13639, zf-RING_2, Ring finger domain.
Length = 46
Score = 39.7 bits (93), Expect = 5e-05
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)
Query: 207 TCPSCK---VKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNA 248
CP C ++ V+ C HVF +CL ++ CP C A
Sbjct: 2 ECPICLDEFEPGEEVVVLPCGHVFHKECLD-KWLRSSNTCPLCRA 45
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 43.1 bits (102), Expect = 8e-05
Identities = 38/201 (18%), Positives = 93/201 (46%), Gaps = 11/201 (5%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS-NQLHKLAREEK 67
+ E L ++ + ++++E+ + LL++L E + + + ER+K + E K
Sbjct: 549 EKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELK 608
Query: 68 DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
D E L+ + + +L++ L +E+ LR + +L K+ + E
Sbjct: 609 DAEKE----LEREEKELKKLEEELDKAFEELAETEKRLEE---LRKELEELEKKYSEEEY 661
Query: 128 QSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQ 187
+ + L L + A L+ + + ++ ++ ++L+EE+ + + + ++K+E+
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTL---EKLKEELEEREKAKKELEKLEK 718
Query: 188 AGTLDEVMMEEIREYKETLTC 208
A E + E++++YK L
Sbjct: 719 ALERVEELREKVKKYKALLKE 739
Score = 38.9 bits (91), Expect = 0.002
Identities = 47/209 (22%), Positives = 102/209 (48%), Gaps = 10/209 (4%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
E L ++ T E ++E+ L + LRE ++ + +L R + +L K +E +
Sbjct: 176 RRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKE-VKELE 234
Query: 69 ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELH-LRNQAMDLNK------- 120
L E++ L+ +++++ + RKLEEK R L+ + ++KE+ L + +L +
Sbjct: 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEE 294
Query: 121 -RKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKT 179
K E + D +EK ++L+E + E+ LE + + + L++++ +L ++
Sbjct: 295 YIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRL 354
Query: 180 ERMKKIEQAGTLDEVMMEEIREYKETLTC 208
E +++ + + EE+ K+ LT
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTG 383
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 41.6 bits (98), Expect = 3e-04
Identities = 41/198 (20%), Positives = 86/198 (43%), Gaps = 23/198 (11%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
E+L + T + ED++EQ L + + +L + + E+
Sbjct: 824 ERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA 883
Query: 69 ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
L E + L+ +++ + +R+LE K L+ R+ E +
Sbjct: 884 SLEEALALLRSELEELSEELRELESKRSELR---------------------RELEELRE 922
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTS-SLEAEAYKTKRLQEEIAQLRRKTERMK-KIE 186
A L+L LE ++ Q+ ++E+ S +LE +++++ + RR+ +R++ KI+
Sbjct: 923 KLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIK 982
Query: 187 QAGTLDEVMMEEIREYKE 204
+ G ++ +EE E KE
Sbjct: 983 ELGPVNLAAIEEYEELKE 1000
Score = 37.0 bits (86), Expect = 0.010
Identities = 25/169 (14%), Positives = 66/169 (39%), Gaps = 9/169 (5%)
Query: 29 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAI 88
+E+ + L +QL E + +L ++ + + E+ + L E++ +L+ +++ + +
Sbjct: 308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAEL 367
Query: 89 RKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQ 148
+LE + L+ L + ++ + + L+ E+ +++E
Sbjct: 368 EELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427
Query: 149 Q---------VVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
+ + E E + L+ L E +++ EQA
Sbjct: 428 KKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476
Score = 37.0 bits (86), Expect = 0.010
Identities = 29/142 (20%), Positives = 60/142 (42%), Gaps = 2/142 (1%)
Query: 64 REEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKA 123
R E + L E++ L+ +I + A+ +L ++ L+ L + KEL ++ + ++
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735
Query: 124 IESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMK 183
L+ + + +L E + + E LE + + EI +L + E++K
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795
Query: 184 KIEQAGTLDEVMMEEIREYKET 205
E+ L E + E E
Sbjct: 796 --EELKALREALDELRAELTLL 815
Score = 36.2 bits (84), Expect = 0.018
Identities = 35/196 (17%), Positives = 78/196 (39%), Gaps = 15/196 (7%)
Query: 13 ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHE 72
AL ++ E ++E+ ++L ++L E + +L ++ + A E + L
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA 789
Query: 73 QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAAD 132
Q+ LK ++ A+ A+ +L EL L N+ + + + A
Sbjct: 790 QIEQLKEELKALREALDELR--------------AELTLLNEEAANLRERLESLERRIAA 835
Query: 133 LKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL-RRKTERMKKIEQAGTL 191
+ LE Q++E + + + +E + L+ E+ L + + + +
Sbjct: 836 TERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895
Query: 192 DEVMMEEIREYKETLT 207
E + EE+RE + +
Sbjct: 896 LEELSEELRELESKRS 911
Score = 35.0 bits (81), Expect = 0.038
Identities = 39/174 (22%), Positives = 83/174 (47%), Gaps = 14/174 (8%)
Query: 5 QVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 64
+ E L E+E + ++ +E+ L +L+E ++ KL R++ ++L +
Sbjct: 228 ALLVLRLEELREELEELQEELKEAEEELEELTAELQELEE---KLEELRLEVSELEEEIE 284
Query: 65 EEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAI 124
E + L+ N I +LE++++ L+ L N+E++L ++ + K
Sbjct: 285 ELQKELYALANE-----------ISRLEQQKQILRERLANLERQLEELEAQLEELESKLD 333
Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRK 178
E A+ A+L+ LE+ +L+ + + E + LE + + L+E++ LR K
Sbjct: 334 ELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK 387
Score = 35.0 bits (81), Expect = 0.040
Identities = 23/167 (13%), Positives = 62/167 (37%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
+ L + + E+++E+ +L ++L E L + + + E
Sbjct: 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA 750
Query: 69 ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
L +++ L+ +I+ + + + EE+ + + +E ++ + + + E
Sbjct: 751 QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRA 810
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL 175
L +L+ ++ + LE + + L E+I L
Sbjct: 811 ELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL 857
Score = 35.0 bits (81), Expect = 0.044
Identities = 25/138 (18%), Positives = 62/138 (44%), Gaps = 1/138 (0%)
Query: 47 FKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVE 106
+K + ++ +L L ++ L E++ L+ ++ + +L + + L+ L +
Sbjct: 215 YKELKAELRELELALLVLRLEE-LREELEELQEELKEAEEELEELTAELQELEEKLEELR 273
Query: 107 KELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTK 166
E+ + ++ +++ A + L+ + +L ++ + E + LE K
Sbjct: 274 LEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLD 333
Query: 167 RLQEEIAQLRRKTERMKK 184
L EE+A+L K E +K+
Sbjct: 334 ELAEELAELEEKLEELKE 351
Score = 34.3 bits (79), Expect = 0.067
Identities = 35/173 (20%), Positives = 78/173 (45%), Gaps = 5/173 (2%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
+E L E+E + E+++E+ L +L E + +L + + + + R +
Sbjct: 333 DELAEELAELE---EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
Query: 69 ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
L Q+ +L +I+ + + +LE++ LQ + + K+L + + + E +
Sbjct: 390 QLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE--EAELKELQAELEELEE 447
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER 181
+L+ LE+ L+E ++ + E +L+A + +LQ + L R E
Sbjct: 448 ELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQEN 500
Score = 29.3 bits (66), Expect = 2.6
Identities = 27/165 (16%), Positives = 66/165 (40%), Gaps = 18/165 (10%)
Query: 61 KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL-------HLRN 113
+ E+ L + + L+ +++ + + +L ++ L ++ + K+L
Sbjct: 687 EELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLE 746
Query: 114 QAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEA-------QQVVTEKTSSLEAEAYKTK 166
+ + ++ E +L+ LE+ +L EA + + + L+A
Sbjct: 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD 806
Query: 167 RLQEEIAQLRR----KTERMKKIEQAGTLDEVMMEEIREYKETLT 207
L+ E+ L ER++ +E+ E +E++ E E L+
Sbjct: 807 ELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELS 851
Score = 28.9 bits (65), Expect = 3.7
Identities = 36/188 (19%), Positives = 88/188 (46%)
Query: 10 EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDI 69
E E L E++ + E+++ + S L +++ E + L E + Q ++ RE
Sbjct: 254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLAN 313
Query: 70 LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
L Q+ L+ Q++ + + + +L E+ L+ L +++EL ++ + + E
Sbjct: 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373
Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAG 189
+L+ LE +++ + + + + +E + +RL++ +L+++ E + K +
Sbjct: 374 LEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEA 433
Query: 190 TLDEVMME 197
L E+ E
Sbjct: 434 ELKELQAE 441
>gnl|CDD|193335 pfam12861, zf-Apc11, Anaphase-promoting complex subunit 11 RING-H2
finger. Apc11 is one of the subunits of the
anaphase-promoting complex or cyclosome. The APC
subunits are cullin family proteins with ubiquitin
ligase activity. Polyubiquitination marks proteins for
degradation by the 26S proteasome and is carried out by
a cascade of enzymes that includes ubiquitin-activating
enzymes (E1s), ubiquitin-conjugating enzymes (E2s), and
ubiquitin ligases (E3s). Apc11 acts as an E3 enzyme and
is responsible for recruiting E2s to the APC and for
mediating the subsequent transfer of ubiquitin to APC
substrates in vivo. In Saccharomyces cerevisiae this
RING-H2 finger protein defines the minimal ubiquitin
ligase activity of the APC, and the integrity of the
RING-H2 finger is essential for budding yeast cell
viability.
Length = 85
Score = 37.1 bits (86), Expect = 8e-04
Identities = 18/48 (37%), Positives = 19/48 (39%), Gaps = 4/48 (8%)
Query: 207 TCPSCKVKRKDA--VLTKCFHVFCWDCLRTRYETRQRK--CPKCNAAF 250
TCP CK D V KC H F C+ T K CP C F
Sbjct: 34 TCPQCKFPGDDCPLVWGKCKHNFHMHCILKWLATETSKGLCPMCRQTF 81
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 40.0 bits (93), Expect = 8e-04
Identities = 33/185 (17%), Positives = 78/185 (42%), Gaps = 4/185 (2%)
Query: 26 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMH 85
+D++ Q L +L TER + + AR+E++ + +++ A + +
Sbjct: 77 DDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQ 136
Query: 86 TAIRKLEEKERYLQTVLTNVEKELHLRNQAMD--LNKRKAIESAQSAADLK-LHLEKYHA 142
+ +L ++ + LQT L + ++ +K ++++ + + L L+ A
Sbjct: 137 QELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSA 196
Query: 143 QLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRK-TERMKKIEQAGTLDEVMMEEIRE 201
Q+++ Q + + ++ +A + R Q + +R +I Q E+IRE
Sbjct: 197 QIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRE 256
Query: 202 YKETL 206
+ L
Sbjct: 257 RERQL 261
Score = 33.5 bits (76), Expect = 0.12
Identities = 32/184 (17%), Positives = 72/184 (39%), Gaps = 11/184 (5%)
Query: 5 QVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 64
Q + E EA+ E+ Q Q++ +RL +Q ++ L +R + + +
Sbjct: 112 QKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQ 171
Query: 65 EEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAI 124
+ L LK Q+ + ++E++ + L T R A +
Sbjct: 172 ASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLAT-----------RANAAQARTEELA 220
Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
A +A +++ AQ+ + Q + + + + +RL+ A+L ++ +++
Sbjct: 221 RRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEA 280
Query: 185 IEQA 188
QA
Sbjct: 281 YYQA 284
Score = 31.5 bits (71), Expect = 0.40
Identities = 31/198 (15%), Positives = 75/198 (37%), Gaps = 7/198 (3%)
Query: 12 EALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILH 71
AL E+ + + +L++ + E + Q A++E L
Sbjct: 84 RALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLT 143
Query: 72 EQVNALKLQIDAMHTAIRKLEEKERYL---QTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
+Q L+ ++ + R+LE + + L Q L +L + + L + + AQ
Sbjct: 144 KQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQ 203
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIA----QLRRKTERMKK 184
+ A + +L + +++ + + ++IA Q+R + ++++
Sbjct: 204 NLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQR 263
Query: 185 IEQAGTLDEVMMEEIREY 202
+E A E + ++ Y
Sbjct: 264 LETAQARLEQEVAQLEAY 281
>gnl|CDD|222454 pfam13920, zf-C3HC4_3, Zinc finger, C3HC4 type (RING finger).
Length = 49
Score = 35.0 bits (81), Expect = 0.002
Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 3/46 (6%)
Query: 204 ETLTCPSCKVKRKDAVLTKCFH-VFCWDCLRTRYETRQRKCPKCNA 248
E C C + ++ V C H C +C + ++KCP C
Sbjct: 1 EDDLCVICLERPRNVVFLPCGHLCLCEECAKRLR--SKKKCPICRQ 44
>gnl|CDD|233043 TIGR00599, rad18, DNA repair protein rad18. All proteins in this
family for which functions are known are involved in
nucleotide excision repair.This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 397
Score = 38.4 bits (89), Expect = 0.003
Identities = 20/45 (44%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 206 LTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
L C CK VLT C H FC C+R R + Q KCP C A
Sbjct: 27 LRCHICKDFFDVPVLTSCSHTFCSLCIR-RCLSNQPKCPLCRAED 70
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 37.7 bits (88), Expect = 0.006
Identities = 25/177 (14%), Positives = 73/177 (41%), Gaps = 24/177 (13%)
Query: 24 AFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDA 83
++++ EQ L+ + + + + + + + + + + + L + AL+ ++
Sbjct: 279 EYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILE- 337
Query: 84 MHTAIRKLEEKERYLQTVLT---------NVEKELHLRNQAMDLNKRKA--IESAQSAA- 131
++KLEEK + T + ++ + N+ + + K ++ ++ A
Sbjct: 338 --KNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAK 395
Query: 132 -DLKLH--------LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKT 179
L LH ++ Y + K ++ + ++ + K L++EI +L ++
Sbjct: 396 KKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQL 452
>gnl|CDD|227861 COG5574, PEX10, RING-finger-containing E3 ubiquitin ligase
[Posttranslational modification, protein turnover,
chaperones].
Length = 271
Score = 36.8 bits (85), Expect = 0.007
Identities = 25/107 (23%), Positives = 38/107 (35%), Gaps = 9/107 (8%)
Query: 147 AQQVVTEKTSSLEAEA--YKTKRLQEEIAQLRRKTERMKKI--EQAGTLDEVMMEEIREY 202
+ E S L A A RLQ + + I E + + + +Y
Sbjct: 157 LAMNIPEVISDLTAVALSLDESRLQPILQPSNNLHTLFQVITKENLSKKNGLPFIPLADY 216
Query: 203 KETLTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQ-RKCPKCNA 248
K C C + + T C H+FC CL + ++ CP C A
Sbjct: 217 K----CFLCLEEPEVPSCTPCGHLFCLSCLLISWTKKKYEFCPLCRA 259
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 36.6 bits (85), Expect = 0.007
Identities = 35/155 (22%), Positives = 65/155 (41%), Gaps = 13/155 (8%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSNQLHKLARE 65
E EAL +E ED++ Q S+L +Q++RE+ + ++ +L L E
Sbjct: 38 AELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIE 97
Query: 66 EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIE 125
I E++N+L+ ++ + I KLE++ L+ L +EK L ++ + E
Sbjct: 98 I-QIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIRE 156
Query: 126 SAQSAADLKLHL---------EKYHAQLKEAQQVV 151
Q + + L +Y K + V
Sbjct: 157 EGQELSSKREELKEKLDPELLSEYERIRKNKKGVG 191
>gnl|CDD|222135 pfam13445, zf-RING_LisH, RING-type zinc-finger, LisH dimerisation
motif. This zinc-finger is the dimerisation motif for
LisH proteins, and is also a typical RING-type of plant
ubiquitin ligases.
Length = 55
Score = 33.1 bits (76), Expect = 0.013
Identities = 12/54 (22%), Positives = 17/54 (31%), Gaps = 5/54 (9%)
Query: 196 MEEIREYKETLTCPSCKVKRKDA----VLTKCFHVFCWDCLRT-RYETRQRKCP 244
+ + CP K D V+ C HV+ L + KCP
Sbjct: 1 LPYAHHFHSIFVCPISKEVMTDEENPPVMLPCGHVYSRKALEKLAKNGGKFKCP 54
>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase. BRE1 is an E3 ubiquitin
ligase that has been shown to act as a transcriptional
activator through direct activator interactions.
Length = 96
Score = 33.8 bits (78), Expect = 0.016
Identities = 22/96 (22%), Positives = 46/96 (47%)
Query: 14 LLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQ 73
L E+ +AF+++ + + +L + +L E+ K++Q + A KD L +
Sbjct: 1 LQKELVSLEKAFKELSSLALKKVAELTILEQKKLRLTAEKAKADQKYFAAMRSKDALDAE 60
Query: 74 VNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL 109
L Q++ I +L++ E L+ + +EK+L
Sbjct: 61 NKKLNTQLNKSSELIEQLKDLENELRRRIKELEKQL 96
>gnl|CDD|178747 PLN03208, PLN03208, E3 ubiquitin-protein ligase RMA2; Provisional.
Length = 193
Score = 35.4 bits (81), Expect = 0.016
Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 15/56 (26%)
Query: 208 CPSCKVKRKDAVLTKCFHVFCWDCLR----TRYETRQR-----------KCPKCNA 248
C C + +D V+T C H+FCW C+ +RQR KCP C +
Sbjct: 21 CNICLDQVRDPVVTLCGHLFCWPCIHKWTYASNNSRQRVDQYDHKREPPKCPVCKS 76
>gnl|CDD|227547 COG5222, COG5222, Uncharacterized conserved protein, contains RING
Zn-finger [General function prediction only].
Length = 427
Score = 35.9 bits (82), Expect = 0.017
Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 1/43 (2%)
Query: 205 TLTCPSCKVKRKDAVLTK-CFHVFCWDCLRTRYETRQRKCPKC 246
+L CP C ++ + T C H FC +C+ T KCP C
Sbjct: 274 SLKCPLCHCLLRNPMKTPCCGHTFCDECIGTALLDSDFKCPNC 316
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.5 bits (81), Expect = 0.030
Identities = 50/199 (25%), Positives = 87/199 (43%), Gaps = 11/199 (5%)
Query: 11 EEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDIL 70
EEA + E+ + + M+ + ++ ++ + K + K E+ K QL K EEK
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
Query: 71 HEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSA 130
E A + +K EE ++ + +K +A + K++A E A+ A
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAE----EAKKAEEDEKKAAEALKKEA-EEAKKA 1704
Query: 131 ADLKLHLE---KYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQ 187
+LK K +LK+A++ K + EA + K+ EE ++ E KKI
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE---AKKDEEEKKKIAH 1761
Query: 188 AGTLDEVMMEEIREYKETL 206
+E EEIR+ KE +
Sbjct: 1762 LKKEEEKKAEEIRKEKEAV 1780
Score = 30.1 bits (67), Expect = 1.4
Identities = 37/177 (20%), Positives = 80/177 (45%), Gaps = 11/177 (6%)
Query: 37 QQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKER 96
++ ++ D+A K ++ K+++L K A +K ++ + A +K EE ++
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKA--DEAKKKAEEKKKADEAKKKAEEAKK 1445
Query: 97 Y--LQTVLTNVEKELHLRNQAMDLNK----RKAIESAQSAADLKLHLEKYHAQLKEAQQV 150
+ +K + +A + K +K E A+ A + K E+ + EA++
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Query: 151 VTEKTSSLEAEAYKTKRLQEEI--AQLRRKTERMKKIEQAGTLDEV-MMEEIREYKE 204
K + EA+ + + +E A+ +K + KK E+ DE+ EE+++ +E
Sbjct: 1506 AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 35.4 bits (81), Expect = 0.032
Identities = 37/184 (20%), Positives = 59/184 (32%), Gaps = 15/184 (8%)
Query: 15 LNEMEVTGQAFEDM----QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR--EEKD 68
L E + ED + +L+E+ +A + S L R E
Sbjct: 621 LRETVKAMLSREDFYMIKIMRQQGEYIKLQEQANALAHIQALNFASIDLPSAQRQIAELQ 680
Query: 69 ILHEQVNA-------LKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKR 121
E++ K +DA T + LE + Y Q V + L+ AM K
Sbjct: 681 ARLERLTHTQSDIAIAKAALDAAQTRQKVLERQ--YQQEVTECAGLKKDLKRAAMLSRKV 738
Query: 122 KAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER 181
+I L+ + Q+ Q +E KR+ A+LRR E
Sbjct: 739 HSIAKQGMTGALQALGAAHFPQVAPEQHDDIVDIERIEHRRQLQKRIDAVNARLRRLREE 798
Query: 182 MKKI 185
+
Sbjct: 799 IIGR 802
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 34.7 bits (80), Expect = 0.045
Identities = 22/99 (22%), Positives = 44/99 (44%), Gaps = 15/99 (15%)
Query: 9 NEEEALLNEM-EVTGQAFEDMQEQNS-RLLQQLREKDDANFKLMTERIKSNQLHKLAREE 66
+E + L NE+ E + ++ +QN +LL+ LRE + + + +RI +
Sbjct: 105 SEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESA-------- 156
Query: 67 KDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNV 105
E+ + L +ID + I++L ++ L L
Sbjct: 157 -----EERSTLLEEIDRLLGEIQQLAQEAGNLTAALKGN 190
>gnl|CDD|223447 COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion
transport and metabolism].
Length = 653
Score = 34.6 bits (80), Expect = 0.046
Identities = 33/180 (18%), Positives = 68/180 (37%), Gaps = 18/180 (10%)
Query: 30 EQNSRLLQQLREK-----------DDANFKLMTERIKSNQLHKLAREEKDILHEQVNALK 78
E+N L QL E D+A I+ + KL++ + V
Sbjct: 95 ERNLYLTLQLLELGIPMILALNMIDEAK----KRGIRIDI-EKLSKLLGVPVVPTVAKRG 149
Query: 79 LQIDAMHTAIRKLEEKE--RYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLH 136
++ + AI +L E + +E+E+ + + AI+ + ++
Sbjct: 150 EGLEELKRAIIELAESKTTPREVDYGEEIEEEIKELEALSEDPRWLAIKLLEDDELVEAV 209
Query: 137 LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMM 196
L++ +++E + ++E+ L A L E I + K E +K +D V++
Sbjct: 210 LKEPEKRVEELLEELSEEEGHLLLIADARYALIERILRSVVKQEEEEKSSLTDKIDRVLL 269
>gnl|CDD|227827 COG5540, COG5540, RING-finger-containing ubiquitin ligase
[Posttranslational modification, protein turnover,
chaperones].
Length = 374
Score = 33.8 bits (77), Expect = 0.072
Identities = 17/73 (23%), Positives = 26/73 (35%), Gaps = 3/73 (4%)
Query: 180 ERMKKIEQAGTLDEVMMEEIREYKETLTCPSCK---VKRKDAVLTKCFHVFCWDCLRTRY 236
E + G+L + +E E + + C C +K + C H F C+
Sbjct: 298 EAIPTTTTKGSLKPLSIERAVEADKGVECAICMSNFIKNDRLRVLPCDHRFHVGCVDKWL 357
Query: 237 ETRQRKCPKCNAA 249
KCP C A
Sbjct: 358 LGYSNKCPVCRTA 370
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 33.8 bits (78), Expect = 0.082
Identities = 29/146 (19%), Positives = 59/146 (40%), Gaps = 17/146 (11%)
Query: 58 QLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMD 117
L K L Q+ + QI + + L+ + + L+ L + +EL R +
Sbjct: 130 GQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRK--- 186
Query: 118 LNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL-- 175
L ++ + +L L + + EAQ + + LE + LQ E Q+
Sbjct: 187 LKEKGLVS--------RLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQ 238
Query: 176 RRKTERMKKIEQAGTLDEVMMEEIRE 201
+ E ++++ +A + + E+RE
Sbjct: 239 TFREEVLEELTEA----QARLAELRE 260
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 32.1 bits (74), Expect = 0.12
Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 5/68 (7%)
Query: 122 KAIESA-QSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
+ +A ++ +L+L KY AQL EA+ + + + +A K E A+ + E
Sbjct: 42 DGLAAAERAKKELELAQAKYEAQLAEARA----EAAEIIEQAKKRAAQIIEEAKAEAEAE 97
Query: 181 RMKKIEQA 188
+ QA
Sbjct: 98 AARIKAQA 105
>gnl|CDD|227719 COG5432, RAD18, RING-finger-containing E3 ubiquitin ligase [Signal
transduction mechanisms].
Length = 391
Score = 33.1 bits (75), Expect = 0.12
Identities = 15/45 (33%), Positives = 18/45 (40%), Gaps = 1/45 (2%)
Query: 206 LTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
L C C + T C H FC C+R R+ Q CP C
Sbjct: 26 LRCRICDCRISIPCETTCGHTFCSLCIR-RHLGTQPFCPVCREDP 69
>gnl|CDD|222944 PHA02929, PHA02929, N1R/p28-like protein; Provisional.
Length = 238
Score = 32.8 bits (75), Expect = 0.14
Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Query: 219 VLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
+L+ C HVFC +C+ ++ + CP C F
Sbjct: 196 ILSNCNHVFCIECI-DIWKKEKNTCPVCRTPF 226
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 32.8 bits (75), Expect = 0.20
Identities = 50/199 (25%), Positives = 91/199 (45%), Gaps = 26/199 (13%)
Query: 3 LGQVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT----ERIKSNQ 58
L + E + L ++ + + +Q Q L+QL+E+ NF+L RIK +
Sbjct: 219 LQERLAQLEGSSLGPNQLGSKKYNLLQSQ----LEQLQEE---NFRLEAARDDYRIKCEE 271
Query: 59 LHKLARE------EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLR 112
L K E E L + ALK +ID + + K ++ E ++T +E LR
Sbjct: 272 LEKELAELQHRNDELTSLAAESQALKDEIDVLRESSDKAKKLEAQVETYKKKLEDLNDLR 331
Query: 113 NQAMDLNKRKAI---------ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAY 163
Q L +R A+ E + A + LE Y Q++E ++E++ + +
Sbjct: 332 RQVKLLEERNAMYMQNTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEF 391
Query: 164 KTKRLQEEIAQLRRKTERM 182
+ KRL+E++ L+++ ER+
Sbjct: 392 EYKRLEEKLEALQKEKERL 410
>gnl|CDD|236292 PRK08566, PRK08566, DNA-directed RNA polymerase subunit A';
Validated.
Length = 882
Score = 32.5 bits (75), Expect = 0.22
Identities = 15/54 (27%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 169 QEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKETLTCPSCKVKRKDAVLTK 222
+EEI + K ER+K+ D+++ E +E + + CP C K+ K
Sbjct: 111 EEEIEEYLEKLERLKEWGSL--ADDLIKEVKKEAAKRMVCPHCGEKQYKIKFEK 162
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 32.3 bits (74), Expect = 0.22
Identities = 19/102 (18%), Positives = 46/102 (45%), Gaps = 4/102 (3%)
Query: 61 KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNK 120
KL R D++ E + A L +D + L+++ + L + + ++ + + ++ + K
Sbjct: 5 KLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAK 64
Query: 121 RKAIESAQSAA----DLKLHLEKYHAQLKEAQQVVTEKTSSL 158
+ + + +LK L + A LK + + +K S+
Sbjct: 65 GQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSI 106
>gnl|CDD|233843 TIGR02390, RNA_pol_rpoA1, DNA-directed RNA polymerase subunit A'.
This family consists of the archaeal A' subunit of the
DNA-directed RNA polymerase. The example from
Methanocaldococcus jannaschii contains an intein.
Length = 867
Score = 32.4 bits (74), Expect = 0.23
Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 3/58 (5%)
Query: 169 QEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREY-KETLTCPSCKVKRKDAVLTKCFH 225
+EEI Q K ++K E+ G L ++E+I + + + CP C ++K K +
Sbjct: 106 EEEIEQYLEKINKLK--EEGGDLASTLIEKIVKEAAKRMKCPHCGEEQKKIKFEKPTY 161
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 32.1 bits (74), Expect = 0.27
Identities = 22/98 (22%), Positives = 45/98 (45%), Gaps = 17/98 (17%)
Query: 26 EDMQEQNSRLLQQLREK-DDANFKL-MT------------ERIKSNQLHKLAREEKDILH 71
E+ QE+ ++L E+ + F L T E + + L EE++ L
Sbjct: 122 EEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEFEALPEEEREELE 181
Query: 72 EQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL 109
E+++ L+ + + +R+L E ER + L +++E+
Sbjct: 182 EKIDELE---EELQEILRQLRELEREAREKLRELDREV 216
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 31.7 bits (72), Expect = 0.39
Identities = 34/176 (19%), Positives = 77/176 (43%), Gaps = 10/176 (5%)
Query: 10 EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDI 69
EE LL + + + ++ + L++L E+ +L+ S ++L E++
Sbjct: 576 EELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEE- 634
Query: 70 LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
L ++ L LQ + LEE E ++ + + +EL N+ + E +
Sbjct: 635 LESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI-----ENEEQLEEKLEE 689
Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
L+ LE+ +L+E +K +E + + + E+ +L+++ E+++K
Sbjct: 690 LEQLEEELEQLREELEELL----KKLGEIEQLIEELESRKAELEELKKELEKLEKA 741
Score = 29.3 bits (66), Expect = 2.1
Identities = 38/199 (19%), Positives = 79/199 (39%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
E EAL + + E ++E +RLL+ + +A E + +L + E+ +
Sbjct: 232 QEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIE 291
Query: 69 ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
L E ++ + + LEE E L+ + + E+ L + L + +
Sbjct: 292 RLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
KL E+ + +++ E +LE + +QE +L + +++I++
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411
Query: 189 GTLDEVMMEEIREYKETLT 207
E +EE+ E L
Sbjct: 412 LEELEKELEELERELEELE 430
Score = 28.6 bits (64), Expect = 3.8
Identities = 27/144 (18%), Positives = 60/144 (41%), Gaps = 20/144 (13%)
Query: 63 AREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRK 122
+E ++ ++ L+ ++ +L E+ L+ L +E+EL + + K +
Sbjct: 472 EKELLELYELELEELEEELSREKE-EAELREEIEELEKELRELEEELIELLELEEALKEE 530
Query: 123 AIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERM 182
E + +L LE+ +L+ ++L+EE+ QL + + +
Sbjct: 531 LEEKLEKLENLLEELEELKEKLQ-------------------LQQLKEELRQLEDRLQEL 571
Query: 183 KKIEQAGTLDEVMMEEIREYKETL 206
K++ + L EE+ E +E L
Sbjct: 572 KELLEELRLLRTRKEELEELRERL 595
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 31.5 bits (72), Expect = 0.39
Identities = 32/146 (21%), Positives = 66/146 (45%), Gaps = 25/146 (17%)
Query: 64 REEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKA 123
+ ++++L++ +N + ++ KL +K + L+ + + LR A +LN
Sbjct: 160 KRDEELLNKDLNLI-------NSIKPKLRKKLQALKEEIAS------LRQLADELNLCDP 206
Query: 124 IESAQSAADLKLH---LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
+E ++ +L+ + + QL+E QQ + E T ++EA K L EEIA+ + E
Sbjct: 207 LELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIRE 266
Query: 181 RMKKIEQAGTLDEVMMEEIREYKETL 206
+ +EI + K +
Sbjct: 267 ECRGWS---------AKEISKLKAKV 283
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 30.7 bits (70), Expect = 0.45
Identities = 23/91 (25%), Positives = 41/91 (45%), Gaps = 3/91 (3%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA---RE 65
E L E+ + ++Q++N L Q+L + +L E + QL A E
Sbjct: 73 QELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDE 132
Query: 66 EKDILHEQVNALKLQIDAMHTAIRKLEEKER 96
E L E++ LK + +A+ +L+E E+
Sbjct: 133 ENRELREELAELKQENEALEAENERLQENEQ 163
Score = 30.4 bits (69), Expect = 0.68
Identities = 18/91 (19%), Positives = 37/91 (40%), Gaps = 3/91 (3%)
Query: 12 EALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIK-SNQLHKLAREEKDI- 69
L E+ + ++QEQ + L Q+ +E L E + +L ++ + +
Sbjct: 69 PELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI 128
Query: 70 -LHEQVNALKLQIDAMHTAIRKLEEKERYLQ 99
L E+ L+ ++ + LE + LQ
Sbjct: 129 ELDEENRELREELAELKQENEALEAENERLQ 159
>gnl|CDD|220784 pfam10497, zf-4CXXC_R1, Zinc-finger domain of monoamine-oxidase A
repressor R1. R1 is a transcription factor repressor
that inhibits monoamine oxidase A gene expression. This
domain is a four-CXXC zinc finger putative DNA-binding
domain found at the C-terminal end of R1. The domain
carries 12 cysteines of which four pairs are of the CXXC
type.
Length = 105
Score = 29.7 bits (67), Expect = 0.46
Identities = 16/45 (35%), Positives = 18/45 (40%), Gaps = 8/45 (17%)
Query: 210 SCKVKRKDAVLTKCFHVFCWDCLRTRY-----ETRQRK---CPKC 246
SCK K+K FC CLR RY E + CP C
Sbjct: 23 SCKDKKKGKECHGVRGQFCGKCLRNRYGENADEAAKDPDWICPPC 67
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 31.3 bits (71), Expect = 0.52
Identities = 30/127 (23%), Positives = 55/127 (43%), Gaps = 20/127 (15%)
Query: 70 LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
H + L+ ++ A+H I +++ LQ + NV +E RN A++ +++ E Q
Sbjct: 417 QHAEQRPLRQRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQ 476
Query: 130 AADLKL---------HLEKYHAQLKEAQ---------QVVTEKTSSLEAEAYKTKR--LQ 169
AD+K LE AQL+ Q E +LE +++ L+
Sbjct: 477 LADVKTICEQEARIKDLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLDALE 536
Query: 170 EEIAQLR 176
+E+ +L
Sbjct: 537 KEVKKLG 543
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
play an important role in the degradation of both lipids
and cellular proteins. In order to perform this
degradative function, vacuoles/lysosomes contain
numerous hydrolases which have been transported in the
form of inactive precursors via the biosynthetic pathway
and are proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or Golgi
complex, or remain in the limiting membrane of the MVB
and are thereby transported to the limiting membrane of
the vacuole/lysosome as a consequence of fusion.
Therefore, the MVB sorting pathway plays a critical role
in the decision between recycling and degradation of
membrane proteins. A few archaeal sequences are also
present within this family.
Length = 169
Score = 30.3 bits (69), Expect = 0.53
Identities = 27/129 (20%), Positives = 59/129 (45%), Gaps = 18/129 (13%)
Query: 81 IDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKY 140
I ++ AIR+L++K+ L+ + +E E ++ A NK A + L++
Sbjct: 3 ILSLRKAIRELDKKQESLEKKIKKLEAE--IKKLAKKGNKDAA----------LILLKQK 50
Query: 141 HAQLKEAQQVVTEKTSSLEAEAYKTKRL---QEEIAQLRRKTERMKKIEQAGTLDEV--M 195
K+ Q+ + ++LE + QE + ++ + MK + + +D++ +
Sbjct: 51 KRYEKQLDQLDGQL-ANLEQVRMAIENAKTNQEVLNAMKGGAKAMKAMNKNMDIDKIDDL 109
Query: 196 MEEIREYKE 204
M+EI + E
Sbjct: 110 MDEIEDQME 118
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 31.5 bits (72), Expect = 0.55
Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 4/62 (6%)
Query: 136 HLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVM 195
+L++ LKE ++ K K +RL EE+ +L ++ ER+KK A L +++
Sbjct: 709 YLQEQEDLLKEIASLL--KVP--PELLPKVERLLEELKELEKELERLKKKLAAAELADLL 764
Query: 196 ME 197
Sbjct: 765 SN 766
>gnl|CDD|227568 COG5243, HRD1, HRD ubiquitin ligase complex, ER membrane component
[Posttranslational modification, protein turnover,
chaperones].
Length = 491
Score = 31.1 bits (70), Expect = 0.56
Identities = 20/82 (24%), Positives = 28/82 (34%), Gaps = 7/82 (8%)
Query: 171 EIAQLRRKTERMKK------IEQAGTLDEVMMEEIREYKETLTCPSCKVKRKDAVLTKCF 224
E A+ RR T+ + EQ D + E P + C
Sbjct: 260 EHARFRRATKDLNAMYPTATEEQLTNSDRTCTICMDEMFHPDHEPLPRGLDMTPKRLPCG 319
Query: 225 HVFCWDCLRTRYETRQRKCPKC 246
H+ CL+ E RQ+ CP C
Sbjct: 320 HILHLHCLKNWLE-RQQTCPIC 340
>gnl|CDD|218160 pfam04587, ADP_PFK_GK, ADP-specific Phosphofructokinase/Glucokinase
conserved region. In archaea a novel type of glycolytic
pathway exists that is deviant from the classical
Embden-Meyerhof pathway. This pathway utilises two novel
proteins: an ADP-dependent Glucokinase and an
ADP-dependent Phosphofructokinase. This conserved region
is present at the C-terminal of both these proteins.
Interestingly this family contains sequences from higher
eukaryotes..
Length = 444
Score = 30.9 bits (70), Expect = 0.64
Identities = 14/54 (25%), Positives = 25/54 (46%)
Query: 149 QVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREY 202
Q++T S + Y KR++E+I L++ +E A D + +EI
Sbjct: 219 QMLTLFYSDGKDAEYYLKRVKEDIKLLKKPKGIKVHLEFASIQDRELRKEIVHN 272
>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
protein. This family of transmembrane coiled-coil
containing proteins is conserved from worms to humans.
Its function is unknown.
Length = 387
Score = 30.9 bits (70), Expect = 0.65
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 11/82 (13%)
Query: 78 KLQIDAMHTAIRKLEEKERYLQT-VLTNVEKELHLRNQAMDLNKRKAIE---------SA 127
K ID++ I KL+E+ R QT NV + L L ++A D + I+ SA
Sbjct: 3 KAAIDSLKQKILKLKEQLRVEQTARDENVAEYLKLVSKA-DKQQVGRIKQVFEKKNQKSA 61
Query: 128 QSAADLKLHLEKYHAQLKEAQQ 149
+ A L+ LE+YH +LKE ++
Sbjct: 62 HTIAQLQKKLEQYHQKLKELEE 83
>gnl|CDD|226419 COG3904, COG3904, Predicted periplasmic protein [Function unknown].
Length = 245
Score = 30.7 bits (69), Expect = 0.65
Identities = 13/47 (27%), Positives = 20/47 (42%)
Query: 13 ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQL 59
A L EM + + M ++QL K+ KL+T R + L
Sbjct: 197 AYLREMGIGPGLLQMMLATPPSDIRQLGLKEMTAMKLVTSRDAVDLL 243
>gnl|CDD|238862 cd01824, Phospholipase_B_like, Phospholipase-B_like. This subgroup
of the SGNH-family of lipolytic enzymes may have both
esterase and phospholipase-A/lysophospholipase activity.
It's members may be involved in the conversion of
phosphatidylcholine to fatty acids and
glycerophosphocholine, perhaps in the context of dietary
lipid uptake. Members may be membrane proteins. The
tertiary fold of the SGNH-hydrolases is substantially
different from that of the alpha/beta hydrolase family
and unique among all known hydrolases; Its active site
closely resembles two of the three components of typical
Ser-His-Asp(Glu) triad from other serine hydrolases.
Length = 288
Score = 30.8 bits (70), Expect = 0.68
Identities = 26/124 (20%), Positives = 48/124 (38%), Gaps = 22/124 (17%)
Query: 2 SLGQVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF----KLMTERIKSN 57
S+G ++ N + G ED+ +Q L++++++ +F KL+T I N
Sbjct: 72 SVGTGDETLPDSGFN-VAEPGAKSEDLPQQARLLVRRMKKDPRVDFKNDWKLITIFIGGN 130
Query: 58 ---------QLHKLAREEK------DILHEQV-NALKLQIDAMHTAI-RKLEEKERYLQT 100
K DIL ++V A + ++ A R L +K +T
Sbjct: 131 DLCSLCEDANPGSPQTFVKNLRKALDILRDEVPRAFVNLVGLLNVASLRSLTKKPLQCET 190
Query: 101 VLTN 104
+L
Sbjct: 191 LLAP 194
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.8 bits (70), Expect = 0.74
Identities = 39/221 (17%), Positives = 89/221 (40%), Gaps = 32/221 (14%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
E L +E+E +A ED +E+ L +++ E + + + + REE+D
Sbjct: 363 EEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERD 422
Query: 69 ILHEQVNALKLQIDAMHTAIRK-------------------------LEEKERYLQTVLT 103
L E+ L+ + + + +EE ++ +
Sbjct: 423 ELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEA 482
Query: 104 NVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAY 163
+E L + ++ +R +E A+ + + +E+ + ++ ++++ E+ ++E +
Sbjct: 483 ELED---LEEEVEEVEER--LERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRE 537
Query: 164 KTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKE 204
+ + L+E A+L E +K E A +E E E E
Sbjct: 538 RAEELRERAAELE--AEAEEKREAAAEAEEEAEEAREEVAE 576
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.8 bits (70), Expect = 0.82
Identities = 22/117 (18%), Positives = 48/117 (41%), Gaps = 19/117 (16%)
Query: 80 QIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRK-----------AIESAQ 128
+ DA+ A + + L VE++L D+ + K A+ +
Sbjct: 343 ERDALAAAYKAYLA----YKPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVK 398
Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
+ EK + +E +++ + + E +RL+EE ++L+R+ E +K+
Sbjct: 399 EEERPR---EKEGTEEEERREITVYEKRIKKLEE-TVERLEEENSELKRELEELKRE 451
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.5 bits (70), Expect = 0.86
Identities = 35/182 (19%), Positives = 78/182 (42%), Gaps = 34/182 (18%)
Query: 27 DMQEQNSRLLQQLREKDDA---NFKLMTERIK-----SNQLHKLAREEKDILHEQVNALK 78
+ S L +++ EK A +K + + + HK + ++ L E++ +L+
Sbjct: 208 SLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLREKLKEELKSLR 267
Query: 79 L---QIDA----MHTAIRKLEEKERYLQTVLTN--VEKELHLRN-QAMDLN---KRKAIE 125
L QID + +++ +ER L ++ + ++ L+ Q +L+ K I
Sbjct: 268 LTSKQIDELVEQLRDINKRVRGQERELLRLVERLKMPRKDFLKLFQGNELDITWLEKEIA 327
Query: 126 SAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
S + ++ L + + ++K+ QQ + ++E E T I +L+ ++ K
Sbjct: 328 SGKPWSEF---LVRVYDEIKKLQQ----ELEAIEEETGLT------IEELKEINRQISKG 374
Query: 186 EQ 187
E
Sbjct: 375 EA 376
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.0 bits (68), Expect = 0.89
Identities = 35/184 (19%), Positives = 74/184 (40%), Gaps = 16/184 (8%)
Query: 34 RLLQQLREK------DDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQ------- 80
+ L+ L+EK D+ F +++ + E++ + EQV LK Q
Sbjct: 37 KTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKEVLSKEQVKLLKTQDLNYVRT 96
Query: 81 -IDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEK 139
A I KL+E+ + + + + ++ + A + L +
Sbjct: 97 KRQAEAKKIEKLKEELHLTDSGSAGPKHIVFVDDEEEQKSFDPAEYFDTTPELLDRRENR 156
Query: 140 YHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEI 199
E +V EK A+ K ++L +E+ + + + +++KK+EQ L +M++
Sbjct: 157 PRISQLEKTSLVDEKQKKKSAK--KKRKLYKELKERKEREKKLKKVEQRLELQRELMKKG 214
Query: 200 REYK 203
+ K
Sbjct: 215 KGKK 218
>gnl|CDD|179749 PRK04147, PRK04147, N-acetylneuraminate lyase; Provisional.
Length = 293
Score = 30.3 bits (69), Expect = 0.89
Identities = 13/44 (29%), Positives = 23/44 (52%)
Query: 91 LEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLK 134
EEK++ L+ V + ++ L Q +N +A E A+ A +L
Sbjct: 56 TEEKKQVLEIVAEEAKGKVKLIAQVGSVNTAEAQELAKYATELG 99
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 30.7 bits (69), Expect = 0.95
Identities = 45/187 (24%), Positives = 89/187 (47%), Gaps = 21/187 (11%)
Query: 21 TGQAFEDMQEQNSRL---LQQLREKDD---ANFKLMT--ERIKSNQLHKLAREEKDILHE 72
T A E +++ +L LQQ R+ + L+T ER + L +L ++E D+L
Sbjct: 404 TPSAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQL-KKENDMLQT 462
Query: 73 QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIE---SAQS 129
++N++ +++ +E++ + N EK+L + KRK E +A++
Sbjct: 463 KLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLA------EEKKRKKEEEETAARA 516
Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAG 189
AA E+ LK+A+Q + + LE + K +EE L ++ + ++K +++
Sbjct: 517 AAQAAASREECAESLKQAKQDLEMEIKKLE---HDLKLKEEECRMLEKEAQELRKYQESE 573
Query: 190 TLDEVMM 196
EV+M
Sbjct: 574 KETEVLM 580
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 29.7 bits (67), Expect = 1.1
Identities = 24/129 (18%), Positives = 61/129 (47%), Gaps = 3/129 (2%)
Query: 54 IKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRN 113
+ + + ++ + L +++ LK Q+ + + LEE+ L++ L +++++L L
Sbjct: 61 LLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLR 120
Query: 114 QAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIA 173
+ + + + +S +L ++ ++ ++ V E LE +RLQE I
Sbjct: 121 ELLKSLEERLESLEESIKEL---AKELRELRQDLREEVEELREELERLQENLQRLQEAIQ 177
Query: 174 QLRRKTERM 182
+L+ E++
Sbjct: 178 ELQSLLEQL 186
Score = 27.8 bits (62), Expect = 5.2
Identities = 20/105 (19%), Positives = 47/105 (44%), Gaps = 7/105 (6%)
Query: 10 EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQ-LHKLAREEKD 68
E + L+E++ + ED++E+ + L +L + + L + L L K+
Sbjct: 83 ELKQQLSELQ---EELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKE 139
Query: 69 I---LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELH 110
+ L E L+ +++ + + +L+E + LQ + ++ L
Sbjct: 140 LAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLE 184
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 30.3 bits (68), Expect = 1.2
Identities = 28/123 (22%), Positives = 51/123 (41%), Gaps = 20/123 (16%)
Query: 84 MHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQ 143
+ T IRKL E++ L+ + ++E E + + LKL +K Q
Sbjct: 73 LQTKIRKLTEQDEGLRNIRESIE------------------ERQKESDQLKLQNQKLVNQ 114
Query: 144 LKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYK 203
L A+ V + + ++ L+EE L+ + + + +++ E E RE
Sbjct: 115 LGHARGVFIKTKGRYQKLEELSRHLKEENQCLQIQLDAL--VQECNEKIEENQELNRELA 172
Query: 204 ETL 206
ETL
Sbjct: 173 ETL 175
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 30.0 bits (67), Expect = 1.2
Identities = 35/156 (22%), Positives = 63/156 (40%), Gaps = 14/156 (8%)
Query: 27 DMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHT 86
DM+ + +L LR + +L + ++ + E++ L Q + +I +
Sbjct: 168 DMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQ 227
Query: 87 AIRKLEEKERYLQTVLTN-VEKELHLRNQAMDLNKRKAIESAQ------SAADLKLHLEK 139
I+ L +K++ VL + E L +L +R + Q DL+L E+
Sbjct: 228 DIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERIDLQLENEQ 287
Query: 140 YHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL 175
H L+ Q+ LE+ K L EE+A L
Sbjct: 288 LHEDLRTLQER-------LESSQQKAGLLGEELASL 316
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 30.3 bits (68), Expect = 1.2
Identities = 19/111 (17%), Positives = 50/111 (45%), Gaps = 8/111 (7%)
Query: 88 IRKLEEKERYLQTVLTNVEKELHLRNQAMDLN---KRKAIESAQSAADLKLHLEKYHAQL 144
+++E + ++ + ++ + + Q +++ +++ D++ +L+K L
Sbjct: 635 YNEIQENKILIEKLRGKID---NYKKQIAEIDSIIPDLKEITSR-INDIEDNLKKSRKAL 690
Query: 145 KEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA-GTLDEV 194
+A+ S++E + L + I + E MKKI++A G L +
Sbjct: 691 DDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDLKRL 741
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.1 bits (68), Expect = 1.3
Identities = 37/206 (17%), Positives = 75/206 (36%), Gaps = 27/206 (13%)
Query: 5 QVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS-------- 56
+ + + E+ + + Q+Q ++L +QL+ + L + I++
Sbjct: 34 AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93
Query: 57 ----------NQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEE---KERYLQTVLT 103
N L RE++ L EQ+ AL+ A+ E + L
Sbjct: 94 KQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYG 153
Query: 104 NVEKEL-----HLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSL 158
+ L+ L +A +A+ A L E+ AQ + Q++ E+ +L
Sbjct: 154 ALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQ-RAQQAKLAQLLEERKKTL 212
Query: 159 EAEAYKTKRLQEEIAQLRRKTERMKK 184
+ Q+++ +LR R+K
Sbjct: 213 AQLNSELSADQKKLEELRANESRLKN 238
>gnl|CDD|227481 COG5152, COG5152, Uncharacterized conserved protein, contains RING
and CCCH-type Zn-fingers [General function prediction
only].
Length = 259
Score = 29.7 bits (66), Expect = 1.3
Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 4/61 (6%)
Query: 186 EQAGTLDEVMMEEIREYKETLTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPK 245
E +E + K C CK + V+T+C H FC C +Y +K +
Sbjct: 177 EWNAEYEEAPVISGPGEKIPFLCGICKKDYESPVVTECGHSFCSLCAIRKY----QKGDE 232
Query: 246 C 246
C
Sbjct: 233 C 233
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 30.0 bits (69), Expect = 1.3
Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 10/81 (12%)
Query: 35 LLQQLREKDDANFKLMTE----RIKSNQL----HKLAREEKDI--LHEQVNALKLQIDAM 84
+ +L E D+ +L TE + + N L + R+ +D L +V LK +I A+
Sbjct: 26 DVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKAL 85
Query: 85 HTAIRKLEEKERYLQTVLTNV 105
+ +LE + L + N+
Sbjct: 86 EAELDELEAELEELLLRIPNL 106
>gnl|CDD|165237 PHA02926, PHA02926, zinc finger-like protein; Provisional.
Length = 242
Score = 29.6 bits (66), Expect = 1.4
Identities = 16/41 (39%), Positives = 19/41 (46%), Gaps = 5/41 (12%)
Query: 215 RKDAVLTKCFHVFCWDCL----RTRYET-RQRKCPKCNAAF 250
R +L C H+FC C+ RTR ET CP C F
Sbjct: 189 RYFGLLDSCNHIFCITCINIWHRTRRETGASDNCPICRTRF 229
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 30.0 bits (68), Expect = 1.6
Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 6/66 (9%)
Query: 144 LKEAQQVVTEKTSSLEAE----AYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVM--ME 197
L E + + E L+ K +R EE L+++ E +K +DE+ E
Sbjct: 703 LNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDKAE 762
Query: 198 EIREYK 203
I K
Sbjct: 763 TINGVK 768
Score = 28.5 bits (64), Expect = 4.9
Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 1/91 (1%)
Query: 68 DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL-HLRNQAMDLNKRKAIES 126
+ L+EQ + LK D + +L +K ++KEL L+ + D + +
Sbjct: 701 EYLNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDK 760
Query: 127 AQSAADLKLHLEKYHAQLKEAQQVVTEKTSS 157
A++ +K+ +E A+ ++ + + ++ S
Sbjct: 761 AETINGVKVLVEVVDAKDMKSLKTMADRLKS 791
>gnl|CDD|220191 pfam09344, Cas_CT1975, CT1975-like protein. CRISPR is a term for
Clustered, Regularly Interspaced Short Palidromic
Repeats. A number of protein families appear only in
association with these repeats and are designated Cas
(CRISPR-Associated) proteins. This family is represented
by CT1975 of Chlorobium tepidum.
Length = 353
Score = 29.6 bits (67), Expect = 1.7
Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 144 LKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMME 197
LK A + E L E +TKRL EE+A +R + + E A L E ++
Sbjct: 40 LKRAIRTSEEFAQLLAEEGVRTKRLAEEVA--KRLEQAGVEEELAKALAEKILG 91
>gnl|CDD|99801 cd06204, CYPOR, NADPH cytochrome p450 reductase (CYPOR) serves as
an electron donor in several oxygenase systems and is a
component of nitric oxide synthases and methionine
synthase reductases. CYPOR transfers two electrons from
NADPH to the heme of cytochrome p450 via FAD and FMN.
Ferredoxin-NADP+ (oxido)reductase is an FAD-containing
enzyme that catalyzes the reversible electron transfer
between NADP(H) and electron carrier proteins such as
ferredoxin and flavodoxin. Isoforms of these
flavoproteins (i.e. having a non-covalently bound FAD as
a prosthetic group) are present in chloroplasts,
mitochondria, and bacteria in which they participate in
a wide variety of redox metabolic pathways. The
C-terminal domain contains most of the NADP(H) binding
residues and the N-terminal domain interacts
non-covalently with the isoalloxazine rings of the
flavin molecule which lies largely in a large gap
betweed the two domains. Ferredoxin-NADP+ reductase
first accepts one electron from reduced ferredoxin to
form a flavin semiquinone intermediate. The enzyme then
accepts a second electron to form FADH2 which then
transfers two electrons and a proton to NADP+ to form
NADPH.
Length = 416
Score = 29.5 bits (67), Expect = 2.0
Identities = 7/45 (15%), Positives = 17/45 (37%), Gaps = 10/45 (22%)
Query: 57 NQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTV 101
L ++ E+ + + ++KL+ + RY + V
Sbjct: 381 KTLLEILAEQGGMTETEAEEY----------VKKLKTRGRYQEDV 415
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 29.6 bits (66), Expect = 2.1
Identities = 25/124 (20%), Positives = 51/124 (41%)
Query: 95 ERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEK 154
++ ++ + + EK ++ R DL +R++ E A+ A LK L+K +AQQ
Sbjct: 180 KKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFA 239
Query: 155 TSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKETLTCPSCKVK 214
+ + + + ++ Q+E L + + E + E + E +K
Sbjct: 240 QDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALK 299
Query: 215 RKDA 218
KD
Sbjct: 300 AKDH 303
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 29.7 bits (67), Expect = 2.2
Identities = 40/216 (18%), Positives = 82/216 (37%), Gaps = 36/216 (16%)
Query: 9 NEEE--ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE 66
NE E L + E Q+ Q+Q L Q + + + E K AR +
Sbjct: 601 NETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEA---RTALKQARLD 657
Query: 67 KDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL------------HLRNQ 114
L + +LK +++ + E+++ +T L ++ +L L++
Sbjct: 658 LQRLQNEQQSLKDKLELA------IAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDD 711
Query: 115 AMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQ----- 169
+L + + +L L + A ++ A+ + L+ + Y L
Sbjct: 712 FRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELK-KQYDR-ELASLDVD 769
Query: 170 -EEIAQLRRKTERMK-KIEQAGTLDEVMMEEIREYK 203
+ +L+R+ E ++ IE+ V E+REY+
Sbjct: 770 PNTVKELKRQIEELETTIERI----AVRRPEVREYR 801
Score = 28.5 bits (64), Expect = 4.8
Identities = 19/139 (13%), Positives = 50/139 (35%), Gaps = 10/139 (7%)
Query: 73 QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAAD 132
V + +++ EE + E++L N ++ KR E+ +
Sbjct: 594 DVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQ 653
Query: 133 LKLHLEKYHAQLKEAQQ-VVTEKTSSLEAEAYKTKR----LQEEIAQLRRKTERMKKIEQ 187
+L L+ +L+ QQ + + ++ + + L ++ QL + + +
Sbjct: 654 ARLDLQ----RLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLE-AL 708
Query: 188 AGTLDEVMMEEIREYKETL 206
E+ E + +++
Sbjct: 709 KDDFRELRTERLAKWQVVE 727
>gnl|CDD|178657 PLN03110, PLN03110, Rab GTPase; Provisional.
Length = 216
Score = 28.7 bits (64), Expect = 2.3
Identities = 24/85 (28%), Positives = 41/85 (48%), Gaps = 11/85 (12%)
Query: 23 QAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDILHEQVNALKLQ 80
Q F+++Q R L++LR+ D+N +M KS N L +A E+ L E K
Sbjct: 98 QTFDNVQ----RWLRELRDHADSNIVIMMAGNKSDLNHLRSVAEEDGQALAE-----KEG 148
Query: 81 IDAMHTAIRKLEEKERYLQTVLTNV 105
+ + T+ + E+ QT+L +
Sbjct: 149 LSFLETSALEATNVEKAFQTILLEI 173
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 29.2 bits (66), Expect = 2.3
Identities = 44/196 (22%), Positives = 80/196 (40%), Gaps = 29/196 (14%)
Query: 11 EEALLNEMEVTGQAF-EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA------ 63
EE LL +E A + ++ + R ++LR+K + + ER KL
Sbjct: 206 EEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQ 265
Query: 64 -----REEKDILHEQV----NALKLQIDAMHTAIRKLEEK-ERYLQTVLTNVE-KELHLR 112
RE + E+V N ++ +++ ++ LE+ + + N + ++L L
Sbjct: 266 AIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENHKVQQLWLA 325
Query: 113 NQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEA---QQVVTEKTSSLEAEAYKTKRLQ 169
+A+ + A++S A L K LKE ++V +SL EA + L
Sbjct: 326 VEAL----KSALKSGS--AGSPRPLVKELDALKELAKDDELVDAALASLPPEASQRGILS 379
Query: 170 EEIAQLRRKTERMKKI 185
EE QLR + +
Sbjct: 380 EE--QLRNRFNLLAPE 393
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 29.2 bits (66), Expect = 2.5
Identities = 21/131 (16%), Positives = 49/131 (37%), Gaps = 20/131 (15%)
Query: 91 LEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIES-AQSAADLKLHLEKYHAQLKEAQQ 149
++K R L + ++ ++ Q + + E ++ ++ KY ++EA+
Sbjct: 172 NKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKT 231
Query: 150 VVTEKTSSLEAEAYKTKRLQEEIAQLRRK----TERMKKIEQAGTLDEVMME----EIRE 201
+ E L +E+ L + + K+ A + +E I+
Sbjct: 232 IKAEIE-----------ELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKM 280
Query: 202 YKETLTCPSCK 212
Y++ CP+C
Sbjct: 281 YEKGGVCPTCT 291
>gnl|CDD|220735 pfam10408, Ufd2P_core, Ubiquitin elongating factor core. This is
the most conserved part of the core region of Ufd2P
ubiquitin elongating factor or E4, running from helix
alpha-11 to alpha-38. It consists of 31 helices of
variable length connected by loops of variable size
forming a compact unit; the helical packing pattern of
the compact unit consists of five structural repeats
that resemble tandem Armadillo (ARM) repeats. This
domain is involved in ubiquitination as it binds Cdc48p
and escorts ubiquitinated proteins from Cdc48p to the
proteasome for degradation. The core is structurally
similar to the nuclear transporter protein
importin-alpha. The core is associated with the U-box at
the C-terminus, pfam04564, which has ligase activity.
Length = 625
Score = 29.2 bits (66), Expect = 2.7
Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 1/54 (1%)
Query: 28 MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQI 81
+ E+ RLL++L+ D +L R QL +L ++ +IL + +L+ +
Sbjct: 186 LIEKYKRLLRELKRLQDELEELEQSRSNWAQLKRL-EKQLEILSAEKLSLEGFL 238
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.1 bits (66), Expect = 2.9
Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 21/96 (21%)
Query: 87 AIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKE 146
IR+++E+ ++ ++ +EL E A+ + L L +Y E
Sbjct: 216 LIREIKEELEEIEKERESLLEEL--------------KELAKKYLEELLALYEYLEIELE 261
Query: 147 AQQV-----VTEKTSSLEA--EAYKTKRLQEEIAQL 175
+ T+KT ++E + K+L+E I +
Sbjct: 262 RAEALSKFLKTDKTFAIEGWVPEDRVKKLKELIDKA 297
>gnl|CDD|188641 cd00954, NAL, N-Acetylneuraminic acid aldolase, also called
N-acetylneuraminate lyase (NAL). N-Acetylneuraminic
acid aldolase, also called N-acetylneuraminate lyase
(NAL), which catalyses the reversible aldol reaction of
N-acetyl-D-mannosamine and pyruvate to give
N-acetyl-D-neuraminic acid (D-sialic acid). It has a
widespread application as biocatalyst for the synthesis
of sialic acid and its derivatives. This enzyme has been
shown to be quite specific for pyruvate as the donor,
but flexible to a variety of D- and, to some extent,
L-hexoses and pentoses as acceptor substrates. NAL is
member of dihydrodipicolinate synthase family that
comprises several pyruvate-dependent class I aldolases.
Length = 288
Score = 28.4 bits (64), Expect = 3.2
Identities = 10/43 (23%), Positives = 23/43 (53%)
Query: 91 LEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADL 133
+EE+++ + V + ++ L LN +++ E A+ A +L
Sbjct: 53 VEERKQIAEIVAEAAKGKVTLIAHVGSLNLKESQELAKHAEEL 95
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 29.0 bits (65), Expect = 3.3
Identities = 27/130 (20%), Positives = 55/130 (42%), Gaps = 11/130 (8%)
Query: 60 HKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELH-LRNQAMDL 118
H + + K E + + I+ + ++LE+K +L+ +L EK L + +L
Sbjct: 496 HFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEEL 555
Query: 119 NKRKAIESAQSAADLKLHLEKYHAQ-LKEAQQVVTEK-TSSLEAEAYKTKRLQEEIAQLR 176
+R+ + KL LEK + LK ++ V E + +K K ++ ++
Sbjct: 556 KERE--------RNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVK 607
Query: 177 RKTERMKKIE 186
K + K +
Sbjct: 608 LKETKQKIPQ 617
>gnl|CDD|149785 pfam08838, DUF1811, Protein of unknown function (DUF1811). This is
a bacterial family of uncharacterized proteins. Some of
the proteins are annotated as being transcriptional
regulators. The structure of one of the proteins in this
family has revealed a beta-barrel like structure with
helix-turn-helix like motif.
Length = 102
Score = 27.3 bits (61), Expect = 3.4
Identities = 11/27 (40%), Positives = 17/27 (62%), Gaps = 3/27 (11%)
Query: 168 LQEEIAQLRRKTERMKKIEQAGTLDEV 194
L++EIA L E+ +K EQ G ++E
Sbjct: 12 LRQEIALL---KEKARKAEQLGIVNEF 35
>gnl|CDD|222465 pfam13942, Lipoprotein_20, YfhG lipoprotein. This family includes
the YfhG protein from E. coli. Members of this family
have an N-terminal lipoprotein attachment site. The
members of this family are functionally uncharacterized.
Length = 180
Score = 28.0 bits (63), Expect = 3.5
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 35 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEK 94
LLQ RE +L ER + +L + + E D L EQ L+ +++ T RKLE
Sbjct: 100 LLQLWREGQVLQLQLAEERARYQRLQQSSDSELDRLREQQARLQYELE---TTTRKLEN- 155
Query: 95 ERYLQTVLTNVEKELHLRNQA 115
LT++E++L R Q
Sbjct: 156 -------LTDIERQLSSRKQG 169
>gnl|CDD|112645 pfam03841, SelA, L-seryl-tRNA selenium transferase.
Length = 367
Score = 28.4 bits (64), Expect = 3.7
Identities = 33/135 (24%), Positives = 55/135 (40%), Gaps = 21/135 (15%)
Query: 48 KLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEK 107
K + ER+K N L + R +K L L+L ++ KL EK L+ +LT K
Sbjct: 227 KELIERLKKNPLKRALRVDKLTLAALEATLRLYLNP-----EKLYEKIPTLR-LLTQPLK 280
Query: 108 ELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTE----------KTSS 157
EL +A L K A+ + ++ +E +Q+ V E + +
Sbjct: 281 ELRA--KAERLQKLL---KARLGSGAEVSVEPSFSQIGGGSLPVKELPSAAVTIRPENMA 335
Query: 158 LEAEAYKTKRLQEEI 172
L+A + + L I
Sbjct: 336 LDALKLRLRLLDPPI 350
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided
into two structural domains; the N-terminal two-thirds
of the protein has no obvious structural motifs but is
the region for binding to Nup98, one of the components
of the nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 28.7 bits (64), Expect = 3.8
Identities = 35/162 (21%), Positives = 63/162 (38%), Gaps = 31/162 (19%)
Query: 27 DMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQ---VNALKLQIDA 83
+E+ +Q RE+ K ++ER KLA + ++ + Q VN K + +
Sbjct: 572 LQKEKQLEDIQDCREER----KSLSERA-----EKLAEKFEEAKYNQELLVNRCKRLLQS 622
Query: 84 MHTAIRKLEEKERYLQTVLTNVEKEL-HLRNQAMDLNKRKAIESAQSAADLKLHLEKYHA 142
++ + L + ER + L + K+L HL N + K+K + A+
Sbjct: 623 ANSQLPVLSDSERDMSKELQRINKQLQHLANGIKQVKKKKNYQRYHMASQESP------- 675
Query: 143 QLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
K SS + K + E + +L +RM K
Sbjct: 676 -----------KKSSYTLPEKQHKTITEILKELGEHIDRMIK 706
>gnl|CDD|226107 COG3579, PepC, Aminopeptidase C [Amino acid transport and
metabolism].
Length = 444
Score = 28.6 bits (64), Expect = 3.9
Identities = 15/63 (23%), Positives = 25/63 (39%)
Query: 137 LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMM 196
EKY K ++S E A K L+++ LR + + L E ++
Sbjct: 148 FEKYGVVPKSVYPESFSSSNSRELNALLNKLLRQDAQILRDALKEGADDDTVEALKEELL 207
Query: 197 EEI 199
+EI
Sbjct: 208 QEI 210
>gnl|CDD|232933 TIGR00348, hsdR, type I site-specific deoxyribonuclease, HsdR
family. This gene is part of the type I restriction and
modification system which is composed of three
polypeptides R (restriction endonuclease), M
(modification) and S (specificity). This group of
enzymes recognize specific short DNA sequences and have
an absolute requirement for ATP (or dATP) and
S-adenosyl-L-methionine. They also catalyse the
reactions of EC 2.1.1.72 and EC 2.1.1.73, with similar
site specificity.(J. Mol. Biol. 271 (3), 342-348
(1997)). Members of this family are assumed to differ
from each other in DNA site specificity [DNA metabolism,
Restriction/modification].
Length = 667
Score = 28.5 bits (64), Expect = 4.1
Identities = 16/112 (14%), Positives = 36/112 (32%), Gaps = 2/112 (1%)
Query: 52 ERIKSNQLHKLAREEKDILHEQVNA--LKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL 109
E + +E L + K + +K+ E L+++ ++ +
Sbjct: 447 EDHLDKKKLDAFFDEIFELLPERIREITKESLKEKLQKTKKILFNEDRLESIAKDIAEHY 506
Query: 110 HLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAE 161
+ S + + K L++ + EA +V S +AE
Sbjct: 507 AKFKELFKFKAMVVAISRYACVEEKNALDEELNEKFEASAIVMTGKESDDAE 558
>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein. The CorA
transport system is the primary Mg2+ influx system of
Salmonella typhimurium and Escherichia coli. CorA is
virtually ubiquitous in the Bacteria and Archaea. There
are also eukaryotic relatives of this protein. The
family includes the MRS2 protein from yeast that is
thought to be an RNA splicing protein. However its
membership of this family suggests that its effect on
splicing is due to altered magnesium levels in the cell.
Length = 291
Score = 28.0 bits (63), Expect = 4.2
Identities = 26/156 (16%), Positives = 54/156 (34%), Gaps = 33/156 (21%)
Query: 39 LREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYL 98
+R + + +R+++ L D+L+ ++++ +D + KLEE+
Sbjct: 83 VRREPLPAVDEIRKRLENLGLGPK--SPGDLLYLLLDSI---VDRYFEILEKLEEE---- 133
Query: 99 QTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVT------ 152
L +E EL L + L+ L + L ++V+
Sbjct: 134 ---LDELEDELEDSTTNELL---------RELLRLRRSLVRLRRSLAPLREVLNRLLSDD 181
Query: 153 ------EKTSSLEAEAYKTKRLQEEIAQLRRKTERM 182
E+ L + +RL E LR + +
Sbjct: 182 GPLIDEEEKEYLRDLLDELERLLEMAEILRERLRSL 217
>gnl|CDD|218186 pfam04636, PA26, PA26 p53-induced protein (sestrin). PA26 is a
p53-inducible protein. Its function is unknown. It has
similarity to pfam04636 in its N-terminus.
Length = 450
Score = 28.1 bits (63), Expect = 4.5
Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 5/36 (13%)
Query: 180 ERMKKIEQAGTLDEVMMEEI-----REYKETLTCPS 210
ERMKK+++ DE EE+ +E KE+L S
Sbjct: 234 ERMKKLQEERDDDEASQEEMTTRFEKEKKESLLVES 269
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 28.5 bits (65), Expect = 4.5
Identities = 26/113 (23%), Positives = 48/113 (42%), Gaps = 15/113 (13%)
Query: 20 VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKL 79
VTG+A + + LL++L A + ER+++ EE L +++ LK
Sbjct: 687 VTGEAALEYLNEQEALLKELAALLKAKPSELPERVEALL------EELKELEKELEQLKA 740
Query: 80 QIDAMHTAIRKLEEKE-----RYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
++ A A L + + + L + V+ + LR A DL + + S
Sbjct: 741 KL-AAAAAGDLLAQAKEVNGVKVLAAQVEGVDAK-ALRTLADDL--KDKLGSG 789
Score = 27.7 bits (63), Expect = 8.4
Identities = 17/86 (19%), Positives = 36/86 (41%), Gaps = 23/86 (26%)
Query: 136 HLEKYHAQLKEAQQV-------VTEKTSSLEAEAYKTKRLQEEIAQLRRK------TERM 182
+L + A LKE + + E+ +L E K L++E+ QL+ K + +
Sbjct: 695 YLNEQEALLKELAALLKAKPSELPERVEALLEE---LKELEKELEQLKAKLAAAAAGDLL 751
Query: 183 KKIEQ-------AGTLDEVMMEEIRE 201
+ ++ A ++ V + +R
Sbjct: 752 AQAKEVNGVKVLAAQVEGVDAKALRT 777
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 28.4 bits (64), Expect = 4.7
Identities = 27/113 (23%), Positives = 52/113 (46%), Gaps = 17/113 (15%)
Query: 68 DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
D++ E ++++ID+ + +L+ R +Q L + L K K S
Sbjct: 387 DLIDEAAARIRMEIDSKPEELDELDR--RIIQ-----------LEIEREALKKEKDEASK 433
Query: 128 QSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
+ DL EK A+L+E + E+ + +A ++++EEI Q+R + E
Sbjct: 434 ERLEDL----EKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQVRLELE 482
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 28.3 bits (64), Expect = 4.8
Identities = 37/178 (20%), Positives = 75/178 (42%), Gaps = 29/178 (16%)
Query: 12 EALLNEMEVTGQAFEDMQE-----QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE 66
E L+E+E + F ++ E + +L +L E+ DA + M E I
Sbjct: 167 EKQLDELEEEFEQFVELTESGDYLEAREVLLKLEEETDALEQKM-EEIP----------- 214
Query: 67 KDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIES 126
+L E N Q++ + R++ E+ + + ++EKEL + +D N +E
Sbjct: 215 -PLLKELQNEFPDQLEELKAGYREMTEEGYHFDHL--DIEKELQDLKEQIDQNL-ALLE- 269
Query: 127 AQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
+L L++ + +E ++ + LE E K +++ I +L E ++
Sbjct: 270 -------ELDLDEAEEENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLTDFLEHARE 320
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 28.2 bits (63), Expect = 5.3
Identities = 32/192 (16%), Positives = 63/192 (32%), Gaps = 36/192 (18%)
Query: 61 KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERY-----LQTVLTNVEKELHLRNQA 115
+ A +E ++ + ++ A+ A R +E T ++ L
Sbjct: 818 EEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVERSDTYRELRKRIAALERTL 877
Query: 116 MDLNKRKAIESAQSAAD-------------LKLHLEKYHAQLKEAQQVVTEKTSSLEAEA 162
+ ++E+ + A L +E+ +L E Q V L
Sbjct: 878 VRAGGGLSLEALVAEAAALDPDELPARLEELARDIEELEEELNELAQEVGAAKQELARMD 937
Query: 163 YKTK------RLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKETLTCPSCKVKRK 216
+ + +AQLR ER ++ A L + + I Y+E +R+
Sbjct: 938 GGSTAAELEAERESLLAQLRDLAERYLELALASRL---LRKAIERYRE---------QRQ 985
Query: 217 DAVLTKCFHVFC 228
VL + +F
Sbjct: 986 PPVLNRAGEIFA 997
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 28.3 bits (63), Expect = 5.3
Identities = 37/195 (18%), Positives = 66/195 (33%), Gaps = 19/195 (9%)
Query: 10 EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDI 69
+EE E EV + +E L+++ +E + N + A+EE
Sbjct: 301 QEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEP---QERSYFINAAQRQAQEEAKA 357
Query: 70 LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
A+ Q +A R+ EE ER Q L ++ I
Sbjct: 358 AANIAEAIGAQAEAAVETARETEEAERAEQAALVAA--------AEAAEQEQVEIAVRAE 409
Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK---IE 186
AA + + A++K + + EK +AEA + L E I L ++
Sbjct: 410 AAKAEAEAQA--AEIKAEAEAIREKG---KAEAEAKRALAEAIQVLGDAAAAELFKALVQ 464
Query: 187 QAGTLDEVMMEEIRE 201
+ E + ++
Sbjct: 465 ALPEVAEEAAQPMKN 479
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 28.0 bits (63), Expect = 5.8
Identities = 28/137 (20%), Positives = 60/137 (43%), Gaps = 24/137 (17%)
Query: 72 EQVNALKLQIDAMHTAIR-KLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSA 130
E+V+ L+ +++ R +L+ ER L ++E L + L+K++
Sbjct: 58 EEVHKLRAELERELKERRNELQRLERRLL------QREETLDRKMESLDKKEE------- 104
Query: 131 ADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRR---KTERMKKIEQ 187
+LEK +L ++ + EK LE + + E I+ L + K ++++E+
Sbjct: 105 -----NLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEE 159
Query: 188 AGTLDEVMMEEIREYKE 204
+ + I+E +E
Sbjct: 160 EARHEAAKL--IKEIEE 174
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 27.3 bits (61), Expect = 5.8
Identities = 15/64 (23%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
E+ + + + L +Y +L+EA++ +E + EA + +E A+ + ER+K+
Sbjct: 48 EAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIA--EEIKAEAEEELERIKE 105
Query: 185 IEQA 188
+A
Sbjct: 106 AAEA 109
>gnl|CDD|226306 COG3783, CybC, Soluble cytochrome b562 [Energy production and
conversion].
Length = 100
Score = 26.8 bits (59), Expect = 5.8
Identities = 21/73 (28%), Positives = 33/73 (45%)
Query: 122 KAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER 181
K +E A AA +K LEK A +AQ+ K ++ + K + L + ++
Sbjct: 9 KVVEKADKAAVIKAALEKMRAAALDAQKATPPKLEGKYPDSPEMKDYRHGFDILIGQIDK 68
Query: 182 MKKIEQAGTLDEV 194
K+ Q G LDE
Sbjct: 69 ADKLAQEGNLDEA 81
>gnl|CDD|184546 PRK14163, PRK14163, heat shock protein GrpE; Provisional.
Length = 214
Score = 27.8 bits (61), Expect = 5.9
Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 11/58 (18%)
Query: 142 AQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER----MKKIEQAGTLDEVM 195
AQL + + + E+T+ L+ RLQ E RR+ ER +K+I A L E++
Sbjct: 47 AQLDQVRTALGERTADLQ-------RLQAEYQNYRRRVERDRVTVKEIAVANLLSELL 97
>gnl|CDD|113545 pfam04778, LMP, LMP repeated region. This family consists of a
repeated sequence element found in the LMP group of
surface-located membrane proteins of Mycoplasma hominis.
The the number of repeats in the protein affects the
tendency of cells to spontaneously aggregate.
Agglutination may be an important factor in
colonisation. Non-agglutinating microorganisms might
easily be distributed whereas aggregation might provide
a better chance to avoid an antibody response since some
of the epitopes may be buried.
Length = 157
Score = 27.2 bits (60), Expect = 6.0
Identities = 16/50 (32%), Positives = 30/50 (60%)
Query: 86 TAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKL 135
T + KL + + +Q T ++KE+ NQA+ N +++SA+S+ D K+
Sbjct: 4 TLLAKLTDNDNTIQKAKTELDKEIQKANQAVASNNTASMQSAKSSLDAKV 53
>gnl|CDD|234734 PRK00349, uvrA, excinuclease ABC subunit A; Reviewed.
Length = 943
Score = 28.1 bits (64), Expect = 6.3
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 193 EVMMEEIREYKETLTCPSCKVKR 215
E + EE+ +Y CPSCK KR
Sbjct: 394 EYVREELEKYMSERPCPSCKGKR 416
>gnl|CDD|113156 pfam04375, HemX, HemX. This family consists of several bacterial
HemX proteins. The hemX gene is not essential for haem
synthesis in B. subtilis. HemX is a polytopic membrane
protein which by an unknown mechanism down-regulates the
level of HemA.
Length = 372
Score = 27.6 bits (61), Expect = 7.0
Identities = 19/85 (22%), Positives = 35/85 (41%), Gaps = 8/85 (9%)
Query: 23 QAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSNQLHKLAREEKDI--LHEQVNAL 77
Q +Q + L L QL + +A + + LA+ EK + L + + L
Sbjct: 53 QQVAGLQIKLEALAQELTQLAQALEAQLAQEVRVLIRDNQEALAQLEKVLGQLDKSLAEL 112
Query: 78 KLQIDAMHTAIRK---LEEKERYLQ 99
+ ++ A+ A R L E + L+
Sbjct: 113 QDRVAAISGANRNDWLLAEADFLLK 137
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 27.7 bits (62), Expect = 7.4
Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 3/94 (3%)
Query: 111 LRNQAMDLNKRKAIESAQSAAD-LKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQ 169
+ ++ R+A + + A LK EK + K A+++ S +E + + K
Sbjct: 57 VDSELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSG 116
Query: 170 EEIAQLRR--KTERMKKIEQAGTLDEVMMEEIRE 201
EEIA++ R K + E + +EI+
Sbjct: 117 EEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQS 150
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 27.7 bits (62), Expect = 7.7
Identities = 19/113 (16%), Positives = 36/113 (31%), Gaps = 11/113 (9%)
Query: 12 EALLNEMEVTGQAFEDMQEQNSRLLQQL-----------REKDDANFKLMTERIKSNQLH 60
E L + E + E E+ + + L + A+F E
Sbjct: 299 ENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKS 358
Query: 61 KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRN 113
K E ++ LK + + +L+E Y ++ T +EK +
Sbjct: 359 KTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKA 411
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 27.5 bits (62), Expect = 7.8
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 64 REEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQ 99
E L EQ+ L+L++ + KLE + L+
Sbjct: 7 EERNSELEEQIRQLELKLRDLEAENEKLERELERLK 42
>gnl|CDD|234162 TIGR03299, LGT_TIGR03299, phage/plasmid-like protein TIGR03299.
Members of this uncharacterized protein family are found
in various Mycobacterium phage genomes, in Streptomyces
coelicolor plasmid SCP1, and in bacterial genomes near
various markers that suggest lateral gene transfer. The
function is unknown [Mobile and extrachromosomal element
functions, Other].
Length = 309
Score = 27.4 bits (61), Expect = 8.1
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 9/60 (15%)
Query: 136 HLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVM 195
H K+ A+L EA+Q + + A + KRL E R ++R + A LD+V+
Sbjct: 177 HTAKFDARLDEAKQALGIVVKQFDQFAAEAKRLAE-----RPVSDR----QFAAFLDDVL 227
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 27.7 bits (62), Expect = 8.6
Identities = 48/257 (18%), Positives = 98/257 (38%), Gaps = 64/257 (24%)
Query: 12 EALLNEMEVT----GQAFEDMQEQNSRLLQQLRE-----------KDDA--NFKLMTERI 54
E +NE+E+ +A + Q+ + QQ++E ++DA + R
Sbjct: 552 EGDINELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERRA 611
Query: 55 K---------------SNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQ 99
+ + K A E E+VN L Q ++ RKLE + L
Sbjct: 612 TALEAELEELRSALEQAERARKQAETELAEASERVNELTAQNSSLIAQKRKLEGE---LA 668
Query: 100 TVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLK------LHLEKYHAQLKEAQQVVTE 153
+ +++++ ++ A + K+ ++A+ A +L+ HLE+ QL+ + +
Sbjct: 669 ALQSDLDEAVNELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQV 728
Query: 154 KTSSLEAEAYK-----------------------TKRLQEEIAQLRRKTERMKKIEQAGT 190
+ EA A K +R E LR+ R+K+++
Sbjct: 729 RLDEAEAAALKGGKKMIQKLEARVRELEAELDGEQRRHAETQKNLRKMERRVKELQFQVE 788
Query: 191 LDEVMMEEIREYKETLT 207
D+ +E +++ + L
Sbjct: 789 EDKKNLERLQDLVDKLQ 805
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 27.4 bits (61), Expect = 8.7
Identities = 39/146 (26%), Positives = 58/146 (39%), Gaps = 29/146 (19%)
Query: 72 EQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAA 131
E++ LK A+ R+LEEK R + EL + Q K+ I+ +S
Sbjct: 312 EKMRMLKKARSALDACDRELEEKAREVS--------ELKMERQR----KKPQIDELESIV 359
Query: 132 DLKLHLEKYHAQLK------EA---QQVVTEKTSSLE---AEAYKTKRLQEEIAQLRRKT 179
LK E QLK EA Q++ KT E A Y RL E + +
Sbjct: 360 RLK-QAEADMFQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLF 418
Query: 180 ERMKKIEQAG----TLDEVMMEEIRE 201
E +K E++ ++ M +IR
Sbjct: 419 EELKLQEESHRDSDSMKMRMESKIRG 444
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 27.3 bits (61), Expect = 9.0
Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 14/171 (8%)
Query: 25 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAM 84
ED+ + + ++R D+ KL K KL KD + + +
Sbjct: 16 NEDVSAFQRKFVNEVRRCDEVERKLRKLESKIK---KLGIPLKDTGGKPDVPPSKEFLDL 72
Query: 85 HTAIRKLEEKERYLQTVLTNVEKELH-LRNQAMDLNKRK-----AIESAQSAADLKLHLE 138
I LE + + ++ L ++EKE++ L L++ K +E ++L + +
Sbjct: 73 EEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFK 132
Query: 139 KYHAQLKEAQQVVT-----EKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
V EK + E E ++ R A++ E KK
Sbjct: 133 YLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKK 183
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 27.2 bits (60), Expect = 9.2
Identities = 30/160 (18%), Positives = 60/160 (37%), Gaps = 6/160 (3%)
Query: 29 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAI 88
Q+ QQ K + E + +L E++ L + L+ +
Sbjct: 62 QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQ----LEKERLKAQEQQ 117
Query: 89 RKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQ 148
++ EE E+ Q L ++E R A + K+ A++AA+ A+ K +
Sbjct: 118 KQAEEAEK--QAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEE 175
Query: 149 QVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
+ + +AEA K+ E A+ + + + +A
Sbjct: 176 AAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKA 215
>gnl|CDD|214763 smart00647, IBR, In Between Ring fingers. the domains occurs
between pairs og RING fingers.
Length = 64
Score = 25.1 bits (55), Expect = 9.3
Identities = 5/13 (38%), Positives = 6/13 (46%)
Query: 222 KCFHVFCWDCLRT 234
KC FC+ C
Sbjct: 45 KCGFSFCFRCKVP 57
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 26.9 bits (60), Expect = 9.3
Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 20/160 (12%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
L + + G+A + + + +L E K E A +E
Sbjct: 32 KRRRELGSALGEFGKALIKLAKCEEEVGGELGEALSKLGKAAEELSSL--SEAQANQELV 89
Query: 69 ILHEQVN-------ALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKR 121
L E + A+K +D A+ L+ ++ L + +EK
Sbjct: 90 KLLEPLKEYLRYCQAVKETLDDRADALLTLQSLKKDLASKKAQLEKLKAAPGIKPA---- 145
Query: 122 KAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAE 161
+L+ LE+ + L+EA++ E + L+ E
Sbjct: 146 -------KVEELEEELEEAESALEEARKRYEEISERLKEE 178
>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
His-Domain type N23 protein tyrosine phosphatase and
related domains. This family contains the V-shaped (V)
domain of mammalian His-Domain type N23 protein tyrosine
phosphatase (HD-PTP, also known as PTPN23) and related
domains. It belongs to the V_Alix_like superfamily which
includes the V domains of Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, mammalian Alix
(apoptosis-linked gene-2 interacting protein X/ also
known as apoptosis-linked gene-2 interacting protein 1,
AIP1), and related domains. HD_PTP interacts with the
ESCRT (Endosomal Sorting Complexes Required for
Transport) system, and participates in cell migration
and endosomal trafficking. The related Alix V-domain
(belonging to a different family in this superfamily)
contains a binding site, partially conserved in the
superfamily, for the retroviral late assembly (L) domain
YPXnL motif. The Alix V-domain is also a dimerization
domain. In addition to the V-domain, HD_PTP also has an
N-terminal Bro1-like domain, a proline-rich region
(PRR), a catalytically inactive tyrosine phosphatase
domain, and a region containing a PEST motif. Bro1-like
domains bind components of the ESCRT-III complex,
specifically to CHMP4 in the case of HD-PTP. The
Bro1-like domain of HD-PTP can also bind human
immunodeficiency virus type 1 (HIV-1) nucleocapsid.
HD-PTP is encoded by the PTPN23 gene, a tumor suppressor
gene candidate frequently absent in human kidney,
breast, lung, and cervical tumors. This family also
contains Drosophila Myopic, which promotes epidermal
growth factor receptor (EGFR) signaling, and
Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
promotes Notch signaling.
Length = 337
Score = 27.3 bits (61), Expect = 9.4
Identities = 16/46 (34%), Positives = 26/46 (56%), Gaps = 3/46 (6%)
Query: 9 NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE---KDDANFKLMT 51
E+EA+ E++ +M++Q L QQLR+ +DD KL+T
Sbjct: 184 PEDEAIEKELKRILNKVNEMRKQRRSLEQQLRDAIHEDDITSKLVT 229
>gnl|CDD|116585 pfam07975, C1_4, TFIIH C1-like domain. The carboxyl-terminal
region of TFIIH is essential for transcription activity.
This regions binds three zinc atoms through two
independent domain. The first contains a C4 zinc finger
motif, whereas the second is characterized by a
CX(2)CX(2-4)FCADCD motif. The solution structure of the
second C-terminal domain revealed homology with the
regulatory domain of protein kinase C (pfam00130).
Length = 48
Score = 24.6 bits (54), Expect = 9.5
Identities = 11/33 (33%), Positives = 14/33 (42%), Gaps = 8/33 (24%)
Query: 207 TCPSCKVKRKDAVL--------TKCFHVFCWDC 231
C C+ K D + KC +VFC DC
Sbjct: 1 FCYGCQQKLPDGINKTSSVYRCPKCKNVFCLDC 33
>gnl|CDD|233062 TIGR00630, uvra, excinuclease ABC, A subunit. This family is a
member of the ABC transporter superfamily of proteins of
which all members for which functions are known except
the UvrA proteins are involved in the transport of
material through membranes. UvrA orthologs are involved
in the recognition of DNA damage as a step in nucleotide
excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 925
Score = 27.3 bits (61), Expect = 9.7
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 190 TLDEVMMEEIREYKETLTCPSCKVKR 215
T E M E + ++ CPSC R
Sbjct: 390 TESESMREYLEKFMSERPCPSCGGTR 415
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 27.2 bits (61), Expect = 10.0
Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 4/58 (6%)
Query: 137 LEKYHAQLKEAQ-QVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI---EQAGT 190
LE + AQ QV + A + LQ EI + + + + E GT
Sbjct: 115 LEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQCEAEGT 172
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.127 0.348
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,850,290
Number of extensions: 1222711
Number of successful extensions: 3351
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2924
Number of HSP's successfully gapped: 620
Length of query: 262
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 167
Effective length of database: 6,723,972
Effective search space: 1122903324
Effective search space used: 1122903324
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (26.2 bits)