RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6276
(271 letters)
>gnl|CDD|240217 cd05791, S1_CSL4, S1_CSL4: CSL4, S1-like RNA-binding domain.
S1-like RNA-binding domains are found in a wide variety
of RNA-associated proteins. ScCSL4 protein is a subunit
of the exosome complex. The exosome plays a central role
in 3' to 5' RNA processing and degradation in eukarytes
and archaea. Its functions include the removal of
incorrectly processed RNA and the maintenance of proper
levels of mRNA, rRNA and a number of small RNA species.
In S. cerevisiae, the exosome includes nine core
components, six of which are homologous to bacterial
RNase PH. These form a hexameric ring structure. The
other three subunits (RrP4, Rrp40, and Csl4) contain an
S1 RNA binding domain and are part of the "S1 pore
structure".
Length = 92
Score = 118 bits (299), Expect = 3e-34
Identities = 43/92 (46%), Positives = 64/92 (69%), Gaps = 1/92 (1%)
Query: 65 VIVPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATEIDRIEMYKC 124
+++P G +V AR+ + K ++ VG L +FR ++R+E++RATE D++EMYKC
Sbjct: 1 LVLPKVGSIVIARVTRINPRFAKVDILCVGGRPLKESFRGVIRKEDIRATEKDKVEMYKC 60
Query: 125 YRPGDIILAR-LPLKELHSYQLSTAENELGVV 155
+RPGDI+ A+ + L + SY LSTAENELGVV
Sbjct: 61 FRPGDIVRAKVISLGDASSYYLSTAENELGVV 92
>gnl|CDD|192591 pfam10447, EXOSC1, Exosome component EXOSC1/CSL4. This family of
proteins are components of the exosome 3'->5'
exoribonuclease complex. The exosome mediates
degradation of unstable mRNAs that contain AU-rich
elements (AREs) within their 3' untranslated regions.
Length = 72
Score = 96.5 bits (241), Expect = 8e-26
Identities = 30/68 (44%), Positives = 49/68 (72%)
Query: 67 VPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATEIDRIEMYKCYR 126
+P GD+V AR+ V +++VG TAL TF+ ++R+++VRATE DR+++Y+C+R
Sbjct: 1 LPKVGDIVLARVTRVTPRQAYCEILAVGGTALKETFKGIIRKQDVRATEKDRVKIYECFR 60
Query: 127 PGDIILAR 134
PGDI+ A+
Sbjct: 61 PGDIVRAK 68
>gnl|CDD|224021 COG1096, COG1096, Predicted RNA-binding protein (consists of S1
domain and a Zn-ribbon domain) [Translation, ribosomal
structure and biogenesis].
Length = 188
Score = 79.3 bits (196), Expect = 4e-18
Identities = 38/139 (27%), Positives = 63/139 (45%), Gaps = 11/139 (7%)
Query: 23 GRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSSYKKEVIVPTPGDVVTARIMAVQ 82
G GTY G + + G V + N+ +SV KK +P GD+V R+ V+
Sbjct: 23 GEGTYEEGGEIRAAATGVVRRDD---KNRVISVK---PGKKTPPLPKGGDIVYGRVTDVR 76
Query: 83 QHLCKAHVISV--GNTALSRTFRALLRRENVRATEIDRIEMYKCYRPGDIILARLPLKEL 140
+ ++ V L+ + A + VR ++++ +R GDI+ AR+ +
Sbjct: 77 EQRALVRIVGVEGKERELATSGAADIHVSQVRDGYVEKLS--DAFRIGDIVRARV-ISTG 133
Query: 141 HSYQLSTAENELGVVIALS 159
QLST N+LGV+ A
Sbjct: 134 DPIQLSTKGNDLGVIYARC 152
>gnl|CDD|236547 PRK09521, PRK09521, exosome complex RNA-binding protein Csl4;
Provisional.
Length = 189
Score = 67.3 bits (165), Expect = 1e-13
Identities = 38/152 (25%), Positives = 75/152 (49%), Gaps = 12/152 (7%)
Query: 9 PGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSSYKKEVIVP 68
PG + ++ + + G GTY G +Y+++ G V I+ N+++SVI +KK +
Sbjct: 10 PGDYLAVIE-EYLPGEGTYEDNGEVYASVVGKVFIDD---INRKISVI---PFKKTPPLL 62
Query: 69 TPGDVVTARIMAVQQHLCKAHVISV--GNTALSRTFRALLRRENVRATEIDRIEMYKCYR 126
GD+V R++ V++ ++S+ L+ + A + V ++ + ++
Sbjct: 63 KKGDIVYGRVVDVKEQRALVRIVSIEGSERELATSKLAYIHISQVSDGYVE--SLTDAFK 120
Query: 127 PGDIILARLPLKELHSYQLSTAENELGVVIAL 158
GDI+ A+ + QLST +LGV+ A+
Sbjct: 121 IGDIVRAK-VISYTDPLQLSTKGKDLGVIYAM 151
>gnl|CDD|206550 pfam14382, ECR1_N, Exosome complex exonuclease RRP4 N-terminal
region. ECR1_N is an N-terminal region of the exosome
complex exonuclease RRP proteins. It is a G-rich domain
which structurally is a rudimentary single hybrid fold
with a permuted topology.
Length = 39
Score = 55.9 bits (136), Expect = 5e-11
Identities = 17/40 (42%), Positives = 26/40 (65%), Gaps = 1/40 (2%)
Query: 6 ICIPGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINK 45
I +PG+R+ S+ + G GTYV G +Y+++AG VSI
Sbjct: 1 IVVPGERLGT-ASEYMPGHGTYVEDGNIYASVAGVVSIVN 39
>gnl|CDD|239901 cd04454, S1_Rrp4_like, S1_Rrp4_like: Rrp4-like, S1-like RNA-binding
domain. S1-like RNA-binding domains are found in a wide
variety of RNA-associated proteins. Rrp4 protein, and
Rrp40 and Csl4 proteins, also represented in this group,
are subunits of the exosome complex. The exosome plays a
central role in 3' to 5' RNA processing and degradation
in eukarytes and archaea. Its functions include the
removal of incorrectly processed RNA and the maintenance
of proper levels of mRNA, rRNA and a number of small RNA
species. In Saccharomyces cerevisiae, the exosome
includes nine core components, six of which are
homologous to bacterial RNase PH. These form a hexameric
ring structure. The other three subunits (RrP4, Rrp40,
and Csl4) contain an S1 RNA binding domain and are part
of the "S1 pore structure".
Length = 82
Score = 52.9 bits (127), Expect = 2e-09
Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 11/90 (12%)
Query: 67 VPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATEIDRIEMYKCYR 126
+P GD+V + V K ++S G L E+ ATE D+ E+ K +
Sbjct: 3 LPDVGDIVIGIVTEVNSRFWKVDILSRGTARL----------EDSSATEKDKKEIRKSLQ 52
Query: 127 PGDIILAR-LPLKELHSYQLSTAENELGVV 155
PGD+ILA+ + L + + L+TA+NELGV+
Sbjct: 53 PGDLILAKVISLGDDMNVLLTTADNELGVI 82
>gnl|CDD|224022 COG1097, RRP4, RNA-binding protein Rrp4 and related proteins
(contain S1 domain and KH domain) [Translation,
ribosomal structure and biogenesis].
Length = 239
Score = 49.6 bits (119), Expect = 3e-07
Identities = 33/136 (24%), Positives = 55/136 (40%), Gaps = 22/136 (16%)
Query: 2 SNPPICIPGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSS- 60
S I +PG + + G GTY G +YS++ G + + K V VI +
Sbjct: 6 SMRKIVLPGDLVLA-EGSYKLGHGTYFEGGKIYSSVVGLLDV-----KGKLVRVIPLEGR 59
Query: 61 YKKEVIVPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENV--RATEIDR 118
Y +P GDVV +I+ V + +G+ + ALL + R E
Sbjct: 60 Y-----IPEVGDVVIGKIIEVGP---SGWKVDIGS-----PYPALLSLSDFLRRKFENAE 106
Query: 119 IEMYKCYRPGDIILAR 134
++ GD++ A+
Sbjct: 107 KDLRPFLNVGDLVYAK 122
>gnl|CDD|235233 PRK04163, PRK04163, exosome complex RNA-binding protein Rrp4;
Provisional.
Length = 235
Score = 41.0 bits (97), Expect = 3e-04
Identities = 33/135 (24%), Positives = 54/135 (40%), Gaps = 29/135 (21%)
Query: 6 ICIPGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSSYKKEV 65
I +PG + + + GRGTY G +YST+ G V I + +V VI +
Sbjct: 10 IVVPGDLLAE--GEFKAGRGTYKENGKIYSTVVGLVDI-----KDDKVRVIPLEG----K 58
Query: 66 IVPTPGDVVTARIMAVQQHLCKAHVISVGNTALS----RTFRALLRRENV--RATEIDRI 119
+P GD+V + V V + ++A L V R ++
Sbjct: 59 YIPKVGDLVIGK------------VTDVTFSGWEVDINSPYKAYLPVSEVLGRPVNVEGT 106
Query: 120 EMYKCYRPGDIILAR 134
++ K GD I+A+
Sbjct: 107 DLRKYLDIGDYIIAK 121
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 39.6 bits (93), Expect = 6e-04
Identities = 27/93 (29%), Positives = 50/93 (53%)
Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
L+QE +L EEL+ Q++ L+Q +L+ L + ++ ++ + + + + +L
Sbjct: 71 LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130
Query: 238 SDTIRELNEENIGLKQKVELQEAENQLIQTTKN 270
+ REL EE LKQ+ E EAEN+ +Q +
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQENEQ 163
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 38.5 bits (90), Expect = 0.004
Identities = 35/130 (26%), Positives = 67/130 (51%), Gaps = 3/130 (2%)
Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
+EL + E E + E SL + E+L +IE L++E +L E+L ++E E
Sbjct: 793 EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELE 852
Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVEL 257
L + +L+ L ++ K E+E ++KEL ++ + +L + +REL E LK+++E
Sbjct: 853 ELEKELEELKEELEELEAEKEELEDELKELEEE---KEELEEELRELESELAELKEEIEK 909
Query: 258 QEAENQLIQT 267
+ ++
Sbjct: 910 LRERLEELEA 919
Score = 34.7 bits (80), Expect = 0.047
Identities = 27/97 (27%), Positives = 52/97 (53%), Gaps = 4/97 (4%)
Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
E+L ++E L++ KL EE+ +++ + L++ +LE ++ E + ++ +EL
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEELEELE---EELEEAERRLDALERELE 817
Query: 229 KKNQSESKLSDTIRELNEENIGLKQKV-ELQEAENQL 264
Q +L I EL EE L++K+ EL+E +L
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854
Score = 31.2 bits (71), Expect = 0.63
Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 11/114 (9%)
Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
E+L ++ L++E +L EEL ++KE E L+ +L L ++ E E++ +I+EL
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301
Query: 229 KK-----------NQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
+ +L + + EL E+ LK+++E +E + ++ E
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAE 355
Score = 30.8 bits (70), Expect = 0.79
Identities = 32/146 (21%), Positives = 66/146 (45%), Gaps = 7/146 (4%)
Query: 130 IILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEEL 189
+ + LKEL +L+ E +L E SL L + +E L+++ +L +L
Sbjct: 662 SLAQKRELKEL-EEELAELEAQLE--KLEEELKSLKNELRSLEDLLEELRRQLEELERQL 718
Query: 190 SSSQKENEILRQTTGDLEVRL----MQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELN 245
++E L + L+ RL ++ E++ E+E + L + + L + + +L
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLK 778
Query: 246 EENIGLKQKVELQEAENQLIQTTKNE 271
EE L++K + + E + ++ E
Sbjct: 779 EEIEELEEKRQALQEELEELEEELEE 804
Score = 29.7 bits (67), Expect = 2.1
Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 3/95 (3%)
Query: 174 DIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQS 233
++ L++E +L EELS ++E L + +LE ++ E+K E+E +EL + +
Sbjct: 233 KLKELRKELEELEEELSRLEEE---LEELQEELEEAEKEIEELKSELEELREELEELQEE 289
Query: 234 ESKLSDTIRELNEENIGLKQKVELQEAENQLIQTT 268
+L + I EL E L++++E E E + ++
Sbjct: 290 LLELKEEIEELEGEISLLRERLEELENELEELEER 324
Score = 28.5 bits (64), Expect = 5.2
Identities = 29/121 (23%), Positives = 53/121 (43%), Gaps = 8/121 (6%)
Query: 137 LKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQ-------EKLKLYEEL 189
+ + E E + + L E+L ++E LK+ EK +L +EL
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDEL 879
Query: 190 SSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQS-ESKLSDTIRELNEEN 248
++E E L + +LE L ++ E ++ +++EL K + E +L + EL EE
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEY 939
Query: 249 I 249
Sbjct: 940 E 940
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 36.2 bits (84), Expect = 0.017
Identities = 28/131 (21%), Positives = 62/131 (47%), Gaps = 7/131 (5%)
Query: 144 QLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTT 203
+L AE L +E L E+L +IE ++ + KL EE + ++E E LR
Sbjct: 316 ELEDAEERL--AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEL 373
Query: 204 GDLEVRLM----QMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVE-LQ 258
+++ ++ + + ++E +E+N+ + +L + ++ L+EE L + ++
Sbjct: 374 EEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIE 433
Query: 259 EAENQLIQTTK 269
N+L + +
Sbjct: 434 AKINELEEEKE 444
Score = 32.7 bits (75), Expect = 0.23
Identities = 26/143 (18%), Positives = 64/143 (44%), Gaps = 4/143 (2%)
Query: 133 ARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSS 192
AR+ E ++L A N+L ++ S + KL ++ ++ ++ ++L+
Sbjct: 765 ARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRL 824
Query: 193 QKENEILRQTTGDLEVRLM----QMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEEN 248
E E L + +L+ + + Q+ ++ E+E + + + +L +R+L
Sbjct: 825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRL 884
Query: 249 IGLKQKVELQEAENQLIQTTKNE 271
LK++ + EA+ + ++ E
Sbjct: 885 GDLKKERDELEAQLRELERKIEE 907
Score = 31.6 bits (72), Expect = 0.57
Identities = 27/120 (22%), Positives = 50/120 (41%), Gaps = 11/120 (9%)
Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGE 219
E EKL +I LK+E +L EEL +E L +E ++ ++ E K +
Sbjct: 386 ELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
Query: 220 MEFKIKE-----------LNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTT 268
+IK+ L+K Q L + + +E L++++ EA+ + +
Sbjct: 446 KALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
Score = 30.0 bits (68), Expect = 1.5
Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 2/106 (1%)
Query: 157 ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV 216
E + R ++ +IE L+QE+ KL E L +++ L Q +++ L ++
Sbjct: 707 LSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR 766
Query: 217 KGEMEFKIKELNKK-NQSESKLSDT-IRELNEENIGLKQKVELQEA 260
E+E + +L + N E++LS + I E+ E L+++V EA
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEA 812
Score = 28.9 bits (65), Expect = 3.9
Identities = 28/120 (23%), Positives = 58/120 (48%), Gaps = 8/120 (6%)
Query: 153 GVVIALSEAASLFRSSEK---LTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVR 209
G++ + SE A L R E+ L ++ L+ E ++ L +E + G++E
Sbjct: 665 GILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKE 724
Query: 210 LMQMNEVKGEMEFKIKELNKKNQS-ESKLSDTIREL--NEENIGLKQ--KVELQEAENQL 264
+ Q+ + + +++ +++EL + S E ++ + EL E I + +L+EA N L
Sbjct: 725 IEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784
Score = 28.1 bits (63), Expect = 5.7
Identities = 30/125 (24%), Positives = 47/125 (37%), Gaps = 22/125 (17%)
Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQ----MNEVKGEMEFKI 224
+KLT + LK+E L EL KE R D +L + +NE+K E++
Sbjct: 353 DKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQ 412
Query: 225 KELN------------------KKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
+EL K N+ E + D E+ ++ L+Q Q +
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472
Query: 267 TTKNE 271
K E
Sbjct: 473 DLKEE 477
>gnl|CDD|227141 COG4804, COG4804, Predicted nuclease of restriction
endonuclease-like fold [General function prediction
only].
Length = 159
Score = 33.6 bits (77), Expect = 0.049
Identities = 22/109 (20%), Positives = 39/109 (35%), Gaps = 10/109 (9%)
Query: 163 SLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEF 222
S ++ + N + L E L S Q E RQ L R + +
Sbjct: 2 SHYKLLLSVKNLDKRLFYEIETLRNRWSVRQLE----RQIASQLYERTL-----LSHDKE 52
Query: 223 KIKELNKKNQSESKLSDTIRE-LNEENIGLKQKVELQEAENQLIQTTKN 270
K+K+ + + + + I + E + LK+ + E LIQ +
Sbjct: 53 KVKQQSLQIEDDLTPEQIIMDPYVLEFLSLKRNYLEFDLEYALIQILQQ 101
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 32.9 bits (75), Expect = 0.16
Identities = 25/111 (22%), Positives = 49/111 (44%), Gaps = 2/111 (1%)
Query: 159 SEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKG 218
S S ++ IEFLK+E + SS+ +++ L+ R + +++
Sbjct: 289 SRRTPSLLRSRTNSSLIEFLKRESSENLLPSLSSRTSSDL--LKNKRLQKRQVDLSDSSR 346
Query: 219 EMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTK 269
+ E K+K+ N+ + +L I L + N L K ++ AE + Q +
Sbjct: 347 QHEEKLKKKQMLNEQKKELKRAISALKKSNRELSSKDIVETAEKRSSQFGQ 397
>gnl|CDD|237970 PRK15453, PRK15453, phosphoribulokinase; Provisional.
Length = 290
Score = 32.5 bits (75), Expect = 0.18
Identities = 14/46 (30%), Positives = 20/46 (43%), Gaps = 14/46 (30%)
Query: 90 VISV------GNTALSRTFRALLRRENVRATEI--------DRIEM 121
+I+V G T + R F + RREN+ A + R EM
Sbjct: 7 IIAVTGSSGAGTTTVKRAFEKIFRRENINAAVVEGDSFHRYTRPEM 52
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 33.1 bits (76), Expect = 0.19
Identities = 31/134 (23%), Positives = 62/134 (46%), Gaps = 7/134 (5%)
Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
KEL++ + E I A+L R E+L +E L+ + +L EEL+ +++ E
Sbjct: 288 KELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE 347
Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESKLSDTIRELNEENIGLKQKVE 256
L++ LE L ++ E+E +++EL ++ SK++ ++ N E
Sbjct: 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNN------E 401
Query: 257 LQEAENQLIQTTKN 270
++ E +L +
Sbjct: 402 IERLEARLERLEDR 415
Score = 30.4 bits (69), Expect = 1.0
Identities = 30/110 (27%), Positives = 50/110 (45%), Gaps = 11/110 (10%)
Query: 159 SEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLM--QMNEV 216
S+ A L L N+IE L+ L + E L+Q +L +L ++ E+
Sbjct: 386 SKVAQLELQIASLNNEIERLE-------ARLERLEDRRERLQQEIEELLKKLEEAELKEL 438
Query: 217 KGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
+ E+E +EL + + +L + + EL EE +Q L AE +L Q
Sbjct: 439 QAELEELEEELEELQEELERLEEALEELREELEEAEQ--ALDAAERELAQ 486
Score = 30.0 bits (68), Expect = 1.6
Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 14/128 (10%)
Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
+EL Q E E E L ++L +E L+ E +L EE+ QKE
Sbjct: 239 EELEELQEELKEAE-------EELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291
Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESKLSDTIRELNEENIGLKQKVE 256
L LE + + E +E +++EL + + ESKL + EL E + +
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAE------LEEK 345
Query: 257 LQEAENQL 264
L+E + +L
Sbjct: 346 LEELKEEL 353
Score = 29.6 bits (67), Expect = 1.9
Identities = 28/105 (26%), Positives = 56/105 (53%), Gaps = 4/105 (3%)
Query: 171 LTNDIEFLKQEKLKLYEELSSSQKENEI----LRQTTGDLEVRLMQMNEVKGEMEFKIKE 226
L +E L++E +L EEL +++E E L++ LE ++++E++ E+E KE
Sbjct: 230 LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKE 289
Query: 227 LNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
L S+L + L E L++++E EA+ + +++ +E
Sbjct: 290 LYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
Score = 29.3 bits (66), Expect = 2.8
Identities = 28/133 (21%), Positives = 49/133 (36%), Gaps = 6/133 (4%)
Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
EL A + A+ R E L IE L ++ L E+ ++ E
Sbjct: 810 AELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIE 869
Query: 198 ILRQTTGDLEVRLMQMNE----VKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQ 253
L L + E ++ E+E +EL + S+L + EL E+ L+
Sbjct: 870 ELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929
Query: 254 KVELQEAENQLIQ 266
+ L+ E ++
Sbjct: 930 R--LEGLEVRIDN 940
Score = 28.9 bits (65), Expect = 3.4
Identities = 23/106 (21%), Positives = 49/106 (46%), Gaps = 3/106 (2%)
Query: 166 RSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIK 225
R E+L IE L+++ +L + L+ +KE E L + L L +++ + +
Sbjct: 677 REIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLA 736
Query: 226 ELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
L + + +L + I +L++E L+ ++E E + + E
Sbjct: 737 RLEAEVE---QLEERIAQLSKELTELEAEIEELEERLEEAEEELAE 779
Score = 28.1 bits (63), Expect = 6.2
Identities = 22/115 (19%), Positives = 50/115 (43%), Gaps = 3/115 (2%)
Query: 157 ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV 216
E +L + ++L ++ L +E L E L S ++ + DLE ++ +++E
Sbjct: 794 LKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE- 852
Query: 217 KGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
++E E+ + + +L + L E L++ + L +E + + E
Sbjct: 853 --DIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 32.1 bits (73), Expect = 0.19
Identities = 29/114 (25%), Positives = 45/114 (39%), Gaps = 3/114 (2%)
Query: 157 ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV 216
S ++S I ++Q+ KL EEL+ K+ L Q L L Q+
Sbjct: 51 QSSHSSSPSADGPGSDAAIAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRRE 110
Query: 217 KGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKN 270
++E I EL + S L IR+L EE ++ E + E + N
Sbjct: 111 IQQLEKTIAELRSEITS---LETEIRDLREELQEKEKDNETLQDELISLNIELN 161
Score = 29.4 bits (66), Expect = 1.4
Identities = 26/104 (25%), Positives = 42/104 (40%), Gaps = 10/104 (9%)
Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGE 219
E L + +L + L E +L E+ +K LR LE + + E
Sbjct: 82 ELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLREE--- 138
Query: 220 MEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQ 263
++E K N+ L D + LN E L++K+ + ENQ
Sbjct: 139 ----LQEKEKDNE---TLQDELISLNIELNALEEKLRKLQKENQ 175
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 31.9 bits (73), Expect = 0.32
Identities = 19/101 (18%), Positives = 45/101 (44%), Gaps = 8/101 (7%)
Query: 172 TNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKN 231
T ++ + KL +S + + + + + E R ++ + KI+E K+
Sbjct: 279 TENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTE-----KIREKEKRL 333
Query: 232 -QSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
+ E L + +ELN + + + +L++ E +L + N+
Sbjct: 334 EELEQNLIEERKELNSKL--EEIQKKLEDLEKRLEKLKSNK 372
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.8 bits (73), Expect = 0.33
Identities = 30/107 (28%), Positives = 50/107 (46%), Gaps = 17/107 (15%)
Query: 179 KQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV-KGEMEFKIKELNKKNQSESKL 237
K+E KL EL ++E + R LE RL+Q E +ME K+ + E +L
Sbjct: 57 KEEVHKLRAEL---ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKEL 113
Query: 238 SDTIRELNE-----ENIGLKQKVELQ--------EAENQLIQTTKNE 271
S+ + L+E E + +Q+ EL+ EA+ L++ + E
Sbjct: 114 SNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEEE 160
>gnl|CDD|240215 cd05789, S1_Rrp4, S1_Rrp4: Rrp4 S1-like RNA-binding domain. S1-like
RNA-binding domains are found in a wide variety of
RNA-associated proteins. Rrp4 protein is a subunit of
the exosome complex. The exosome plays a central role in
3' to 5' RNA processing and degradation in eukarytes and
archaea. Its functions include the removal of
incorrectly processed RNA and the maintenance of proper
levels of mRNA, rRNA and a number of small RNA species.
In Saccharomyces cerevisiae, the exosome includes nine
core components, six of which are homologous to
bacterial RNase PH. These form a hexameric ring
structure. The other three subunits (RrP4, Rrp40, and
Csl4) contain an S1 RNA binding domain and are part of
the "S1 pore structure".
Length = 86
Score = 29.8 bits (68), Expect = 0.38
Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 10/70 (14%)
Query: 67 VPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENV--RATEIDRIEMYKC 124
+P GDVV R+ V K + + + A+L V T+ D + M
Sbjct: 3 IPEVGDVVIGRVTEVGF---KRWKVDINSPYD-----AVLPLSEVNLPRTDEDELNMRSY 54
Query: 125 YRPGDIILAR 134
GD+I+A
Sbjct: 55 LDEGDLIVAE 64
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 31.6 bits (72), Expect = 0.45
Identities = 27/94 (28%), Positives = 48/94 (51%)
Query: 163 SLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEF 222
+ +E + I+ ++E ++ E++ E LR+ LE + ++ E+K E+E
Sbjct: 183 KFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEE 242
Query: 223 KIKELNKKNQSESKLSDTIRELNEENIGLKQKVE 256
KEL S+ KL + IREL E LK+++E
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIE 276
Score = 28.9 bits (65), Expect = 3.7
Identities = 22/101 (21%), Positives = 51/101 (50%), Gaps = 1/101 (0%)
Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
E+L +IE L+ EK+K +EL +E L + + L ++ + +E +I +
Sbjct: 269 EELKKEIEELE-EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327
Query: 229 KKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTK 269
++ + + + + EL ++ L++++E E ++L + K
Sbjct: 328 ERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAK 368
Score = 27.7 bits (62), Expect = 7.1
Identities = 30/125 (24%), Positives = 61/125 (48%), Gaps = 9/125 (7%)
Query: 144 QLSTAENELGVVIA-LSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQT 202
+ E EL V+ ++E +S + +E +E +L EE+ +KE E L +
Sbjct: 194 LIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGS 253
Query: 203 TGDLEVRLMQ----MNEVKG---EMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKV 255
LE ++ + + E+K E+E K+KEL K+ + +++ + E EE + +++
Sbjct: 254 KRKLEEKIRELEERIEELKKEIEELEEKVKEL-KELKEKAEEYIKLSEFYEEYLDELREI 312
Query: 256 ELQEA 260
E + +
Sbjct: 313 EKRLS 317
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.9 bits (71), Expect = 0.81
Identities = 35/156 (22%), Positives = 69/156 (44%), Gaps = 21/156 (13%)
Query: 130 IILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEEL 189
++ K++ ++ AE E ++ ++ + E L E + + + + +EL
Sbjct: 18 AVIGYFVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKEL 77
Query: 190 SSSQKENEI------LRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRE 243
++ NE+ L Q +L+ +L + + + E+E K KEL +K Q K + + E
Sbjct: 78 R--ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135
Query: 244 LNEENIGLKQKVELQ--------EAENQLIQTTKNE 271
L EE Q EL+ EA+ L++ + E
Sbjct: 136 LIEE-----QLQELERISGLTAEEAKEILLEKVEEE 166
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.6 bits (69), Expect = 0.81
Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 12/97 (12%)
Query: 169 EKLTNDIEFLKQEK--LKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKE 226
E+L +++ ++E+ K Y SQ + E +++E+K E E + E
Sbjct: 32 ERLDSELRDAEKERDTYKQYLSKLESQNVEI------SNYEALDSELDELKKEEERLLDE 85
Query: 227 LNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQ 263
L + + + L + EL EE K+++E +E +
Sbjct: 86 LEELEKEDDDLDGELVELQEE----KEQLENEELQYL 118
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.8 bits (70), Expect = 0.85
Identities = 20/98 (20%), Positives = 40/98 (40%), Gaps = 11/98 (11%)
Query: 166 RSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIK 225
+S EKL I+ LK + +L + + + +++ R+ E +++ ++K
Sbjct: 277 KSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFE-------RDNEKLDAEVK 329
Query: 226 ELNKKNQSESKLSDTIRELNEE----NIGLKQKVELQE 259
E K+ + E K I L E + K E +
Sbjct: 330 EKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENKT 367
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 30.9 bits (70), Expect = 0.88
Identities = 29/119 (24%), Positives = 49/119 (41%), Gaps = 13/119 (10%)
Query: 150 NELGVVIALSEAA-----SLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTG 204
E+ V+I A EKL + E LK+E + EEL ++ +
Sbjct: 511 EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKK----LEL 566
Query: 205 DLEVRLMQMNEVKGEMEFKIKELNKKN---QSESKLSDTIRELNEENIGLKQKVELQEA 260
+ E + + +K E+E I+EL +K E K + + +L E + QK +A
Sbjct: 567 EKEAQ-EALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQA 624
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 30.5 bits (69), Expect = 1.1
Identities = 37/126 (29%), Positives = 59/126 (46%), Gaps = 7/126 (5%)
Query: 144 QLSTAENELGVVIALSEAA--SLFRSSEKLTNDIEFLKQEKLKL-YEELSSSQKENEILR 200
+L E EL ++ L EA L EKL N +E L++ K KL ++L ++ E
Sbjct: 509 ELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRL 568
Query: 201 QTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEA 260
Q +L L + K E ++EL ++ + K + E + L Q +EL EA
Sbjct: 569 QELKELLEELRLLRTRKEE----LEELRERLKELKKKLKELEERLSQLEELLQSLELSEA 624
Query: 261 ENQLIQ 266
EN+L +
Sbjct: 625 ENELEE 630
>gnl|CDD|220625 pfam10191, COG7, Golgi complex component 7 (COG7). COG7 is a
component of the conserved oligomeric Golgi complex
which is required for normal Golgi morphology and
localisation. Mutation in COG7 causes a congenital
disorder of glycosylation.
Length = 768
Score = 30.5 bits (69), Expect = 1.1
Identities = 18/79 (22%), Positives = 37/79 (46%), Gaps = 1/79 (1%)
Query: 170 KLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNK 229
++ D+E L+QE L E++ +++ + Q TG L +++ VK M+ + L +
Sbjct: 67 RVLRDVEALRQEASFLKEQMILVKQDIKKFEQDTGQSMQVLERLDTVKQRMQAAKESL-Q 125
Query: 230 KNQSESKLSDTIRELNEEN 248
+ LS + + E
Sbjct: 126 EADGWGTLSAELEDTFETQ 144
>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
kinase; Provisional.
Length = 968
Score = 30.2 bits (68), Expect = 1.3
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 6/62 (9%)
Query: 189 LSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQ-SESKLSDTIRELNEE 247
LSS ++EN I R G + +K M+F +KE+N N S+++D + +L
Sbjct: 689 LSSLKEENVISRGKKG----ASYKGKSIKNGMQFVVKEINDVNSIPSSEIAD-MGKLQHP 743
Query: 248 NI 249
NI
Sbjct: 744 NI 745
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 30.2 bits (68), Expect = 1.3
Identities = 28/137 (20%), Positives = 66/137 (48%), Gaps = 3/137 (2%)
Query: 138 KELHSYQLSTAENELGVVI--ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKE 195
K+ + Q+ T E + V+ A+ E S + S L Q+ K E+L +Q+E
Sbjct: 128 KQEYIQQIMTLEESVQHVVMTAIQELMSKEQGSSPSRESAGNLDQQLKKALEDLKEAQEE 187
Query: 196 NEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESKLSDTIRELNEENIGLKQK 254
+ L Q +L+ +++ + E K ++ + ++L ++ Q E + +++ L+ +
Sbjct: 188 KDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKYNLLQSQ 247
Query: 255 VELQEAENQLIQTTKNE 271
+E + EN ++ +++
Sbjct: 248 LEQLQEENFRLEAARDD 264
>gnl|CDD|238571 cd01166, KdgK, 2-keto-3-deoxygluconate kinase (KdgK) phosphorylates
2-keto-3-deoxygluconate (KDG) to form
2-keto-3-deoxy-6-phosphogluconate (KDGP). KDG is the
common intermediate product, that allows organisms to
channel D-glucuronate and/or D-galacturinate into the
glycolysis and therefore use polymers, like pectin and
xylan as carbon sources.
Length = 294
Score = 29.9 bits (68), Expect = 1.3
Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 3/47 (6%)
Query: 75 TARIMAVQQHLCKAHVIS-VGNTALSRTFRALLRRENVRATEIDRIE 120
A +A + + +++ VG+ R A LRRE V + +
Sbjct: 37 VAVGLA--RLGHRVALVTAVGDDPFGRFILAELRREGVDTSHVRVDP 81
>gnl|CDD|185235 PRK15335, PRK15335, type III secretion system protein SpaM;
Provisional.
Length = 147
Score = 29.3 bits (65), Expect = 1.4
Identities = 19/81 (23%), Positives = 49/81 (60%), Gaps = 2/81 (2%)
Query: 187 EELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRE--L 244
EE+ + ++ I+R+ DLE++++Q+ E + E+E K +E KK++ + + +
Sbjct: 66 EEIYTLLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNYQRWII 125
Query: 245 NEENIGLKQKVELQEAENQLI 265
++ ++++++ +EAE++ I
Sbjct: 126 RQKRFYIQREIQQEEAESEEI 146
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 29.9 bits (67), Expect = 1.6
Identities = 24/134 (17%), Positives = 51/134 (38%), Gaps = 2/134 (1%)
Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
K L YQL + + L L ++ +QE+++ ++ +KE E
Sbjct: 209 KALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEL--EKEEE 266
Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVEL 257
IL Q + + + + E++ KE + KL + E+ +++++
Sbjct: 267 ILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKK 326
Query: 258 QEAENQLIQTTKNE 271
E E + + E
Sbjct: 327 LEKELKKEKEEIEE 340
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 29.6 bits (67), Expect = 1.7
Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 10/113 (8%)
Query: 162 ASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEV---RLMQ-----M 213
L + L ++E L K KL + + ++E L+Q +LE + +
Sbjct: 154 EGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKL 213
Query: 214 NEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
++ E+ K+K+L + + +L I +L + L E+ EAE +L Q
Sbjct: 214 KKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT--EIAEAEKKLEQ 264
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.5 bits (66), Expect = 1.8
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 3/84 (3%)
Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
LK++ +L E+L QKE E L + +LE ++ E +E E ++ + KL
Sbjct: 133 LKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLE---VENSRLEEMLKKL 189
Query: 238 SDTIRELNEENIGLKQKVELQEAE 261
+ +L + L+ VEL E E
Sbjct: 190 PGEVYDLKKRWDELEPGVELPEEE 213
Score = 28.7 bits (64), Expect = 3.1
Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 3/101 (2%)
Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGE 219
E L E+L +E L++EK +L +EL + E E +++ LEV ++ E+ +
Sbjct: 129 EYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKK 188
Query: 220 MEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEA 260
+ ++ +L K+ +L + EE I K L A
Sbjct: 189 LPGEVYDLKKR---WDELEPGVELPEEELISDLVKETLNLA 226
>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein.
Length = 326
Score = 29.5 bits (66), Expect = 1.8
Identities = 19/91 (20%), Positives = 34/91 (37%), Gaps = 10/91 (10%)
Query: 185 LYEELSSSQKENEILRQTTG-------DLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
+YE L++ + L ++ DL L M + + + L Q+ L
Sbjct: 33 IYERLTNLESATASLSESVSTISSRLSDLSADLQDMTTSLDDAKATLNGLITTVQA---L 89
Query: 238 SDTIRELNEENIGLKQKVELQEAENQLIQTT 268
++ L+ E L V +A +QTT
Sbjct: 90 QTSVDSLSSELADLSLTVSAHDAAISDLQTT 120
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 29.6 bits (67), Expect = 2.0
Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 6/91 (6%)
Query: 181 EKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDT 240
EK+ + E+ S N + +L ++++ E + + +E E +L
Sbjct: 782 EKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELEL--- 838
Query: 241 IRELNEENIGLKQKVELQEAENQLIQTTKNE 271
+NEE LKQ+++L+ +E Q +
Sbjct: 839 ---INEEQKNLKQEIKLELSEIQEAEEEIQN 866
>gnl|CDD|227306 COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 354
Score = 29.3 bits (66), Expect = 2.2
Identities = 14/51 (27%), Positives = 24/51 (47%)
Query: 65 VIVPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATE 115
++V T +VV +RI A + + V+ V + AL R +R L +
Sbjct: 142 LLVATRKEVVESRIDAFELAGLEPKVLDVESFALLRAYRLLASQFGPEEAA 192
>gnl|CDD|111032 pfam02090, SPAM, Salmonella surface presentation of antigen gene
type M protein.
Length = 147
Score = 28.5 bits (63), Expect = 2.5
Identities = 15/46 (32%), Positives = 30/46 (65%)
Query: 187 EELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQ 232
EE+ S ++ I+R+ DLE+++ Q+ E + E+E K +E +K++
Sbjct: 66 EEIYSLLRKQSIVRRQIKDLELQITQIQEKRCELEKKREEFQEKSK 111
>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640). This
family consists of sequences derived from hypothetical
eukaryotic proteins. A region approximately 100 residues
in length is featured.
Length = 177
Score = 28.7 bits (65), Expect = 2.5
Identities = 31/108 (28%), Positives = 51/108 (47%), Gaps = 24/108 (22%)
Query: 159 SEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKG 218
SE A+L +EKL ND+E LK +L +E++ VRL +N KG
Sbjct: 73 SEFANLQAENEKLKNDLEKLKN---RLRDEITKVTA------------GVRL-DLNLEKG 116
Query: 219 EMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
+I+E + E K+ +T +++ E GLK ++E + I+
Sbjct: 117 ----RIREE--SAEQELKIKETDTKIDTEIAGLKTQLE--SVKLDTIK 156
>gnl|CDD|238987 cd02029, PRK_like, Phosphoribulokinase-like (PRK-like) is a family
of proteins similar to phosphoribulokinase (PRK), the
enzyme involved in the Benson-Calvin cycle in
chloroplasts or photosynthetic prokaryotes. PRK
catalyzes the phosphorylation of D-ribulose 5-phosphate
to form D-ribulose 1, 5-biphosphate, using ATP and NADPH
produced by the primary reactions of photosynthesis.
Length = 277
Score = 29.0 bits (65), Expect = 2.6
Identities = 9/44 (20%), Positives = 15/44 (34%), Gaps = 8/44 (18%)
Query: 94 GNTALSRTFRALLRRENVRATEI--------DRIEMYKCYRPGD 129
G T + R F + RE + + +R+EM
Sbjct: 11 GTTTVKRAFEHIFAREGIHPAVVEGDSFHRYERMEMKMAIAEAL 54
>gnl|CDD|234309 TIGR03683, A-tRNA_syn_arch, alanyl-tRNA synthetase. This family of
alanyl-tRNA synthetases is limited to the archaea, and
is a subset of those sequences identified by the model
pfam07973 covering the second additional domain (SAD) of
alanyl and threonyl tRNA synthetases.
Length = 902
Score = 29.2 bits (66), Expect = 3.0
Identities = 16/81 (19%), Positives = 38/81 (46%), Gaps = 1/81 (1%)
Query: 164 LFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVK-GEMEF 222
L SS+ L E L + + +EE +KE E L++ +L++ + + G +E
Sbjct: 749 LRESSDILKVPPEQLPETVKRFFEEWKEQRKEIERLKKKLAELKIYELISEAERIGGVEV 808
Query: 223 KIKELNKKNQSESKLSDTIRE 243
++ ++ ++++ + E
Sbjct: 809 IVELVDADMDELQEIAEKLAE 829
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 28.4 bits (64), Expect = 3.3
Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 7/88 (7%)
Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
EKL ++E LKQ +L ++ +K R+ T + L ++ +++ E++ EL
Sbjct: 72 EKLKKELEELKQRIAELQAQIEKLKKG----REETEERTELLEELKQLEKELKKLKAELE 127
Query: 229 KKNQSESKLSDTIRELNEENIGLKQKVE 256
K +++ + I +L EE K+
Sbjct: 128 KYEKNDPE---RIEKLKEETKVAKEAAN 152
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 28.9 bits (65), Expect = 3.4
Identities = 17/106 (16%), Positives = 38/106 (35%), Gaps = 13/106 (12%)
Query: 176 EFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSES 235
L + K L E + ++E R+ E RL+Q E + ++L+
Sbjct: 52 AALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDA---RAEKLDNLENQLE 108
Query: 236 KLSDTIRELNEENIGLKQKVELQ----------EAENQLIQTTKNE 271
+ + E L+++++ + +A L++ E
Sbjct: 109 EREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAE 154
>gnl|CDD|238159 cd00256, VATPase_H, VATPase_H, regulatory vacuolar ATP synthase
subunit H (Vma13p); activation component of the
peripheral V1 complex of V-ATPase, a heteromultimeric
enzyme which uses ATP to actively transport protons
into organelles and extracellular compartments. The
topology is that of a superhelical spiral, in part the
geometry is similar to superhelices composed of
armadillo repeat motifs, as found in importins for
example.
Length = 429
Score = 28.5 bits (64), Expect = 3.6
Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 12/89 (13%)
Query: 167 SSEKLTNDIEFLKQEKLKLYEELSSSQK-ENEILRQTTGDLEVRLMQMNEVKGEMEF--- 222
E LT+D++FL +E ++LSS + ++E+ +G L + +E F
Sbjct: 291 DDEDLTDDLKFLTEELKNSVQDLSSFDEYKSELR---SGRLHWSPVHKSEK-----FWRE 342
Query: 223 KIKELNKKNQSESKLSDTIRELNEENIGL 251
LN+KN K+ + E + + I L
Sbjct: 343 NADRLNEKNYELLKILIHLLETSVDPIIL 371
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 28.8 bits (65), Expect = 3.7
Identities = 18/121 (14%), Positives = 47/121 (38%), Gaps = 12/121 (9%)
Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILR---------QTTGDLEVRL 210
E +L E + + L K+ ++ QK ++ Q + R+
Sbjct: 242 ELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRI 301
Query: 211 MQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKN 270
++ + E++ +++L+ ++ D E +++ + LK K+ Q + T +
Sbjct: 302 TKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKI---STNKQSLITLVD 358
Query: 271 E 271
+
Sbjct: 359 K 359
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 28.7 bits (64), Expect = 3.8
Identities = 31/133 (23%), Positives = 56/133 (42%), Gaps = 21/133 (15%)
Query: 156 IALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMN- 214
L + + +++EK +I L Q +L+ E+L E E L+ LE+RL + +
Sbjct: 128 FQLEDLVGMIQNAEK---NILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDA 184
Query: 215 ------EVKGEMEFKIKELNK-----------KNQSESKLSDTIRELNEENIGLKQKVEL 257
+ K +E ++L K + LS + L EEN+ LK ++
Sbjct: 185 RIKLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKELDVLKEENMLLKDDIQF 244
Query: 258 QEAENQLIQTTKN 270
+AE + T+
Sbjct: 245 LKAELIEVAETEE 257
>gnl|CDD|226470 COG3961, COG3961, Pyruvate decarboxylase and related thiamine
pyrophosphate-requiring enzymes [Carbohydrate transport
and metabolism / Coenzyme metabolism / General function
prediction only].
Length = 557
Score = 28.8 bits (65), Expect = 3.8
Identities = 23/80 (28%), Positives = 30/80 (37%), Gaps = 12/80 (15%)
Query: 103 RALLRRENVRATEIDRI--EMYKCYRPGDIIL----ARL----PLKELHSYQLSTAENEL 152
+A+L N EIDR+ K RP I L A L PL L ++ L
Sbjct: 136 QAMLTDINTAPREIDRVIRTALKQRRPVYIGLPADVADLPIEAPLTPLDLQLKTSDPEAL 195
Query: 153 GVVIALSEAASLFRSSEKLT 172
VI A L ++K
Sbjct: 196 SEVI--DTIAELINKAKKPV 213
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 28.6 bits (64), Expect = 4.2
Identities = 16/73 (21%), Positives = 33/73 (45%), Gaps = 1/73 (1%)
Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESK 236
++KL + ++ + +Q G L + LM+ + K E K +NK N+ + K
Sbjct: 82 KIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKK 141
Query: 237 LSDTIRELNEENI 249
+ + EL + +
Sbjct: 142 VLSSKDELIKYDN 154
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 28.6 bits (64), Expect = 4.4
Identities = 29/82 (35%), Positives = 46/82 (56%), Gaps = 5/82 (6%)
Query: 175 IEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV-KGEMEFKIKELNKKNQS 233
I L+ E +L ELS+ Q ENE L + +L+ +M E+ +G + +IKE +
Sbjct: 36 ILELQAELKQLRAELSNVQAENERLSSLSQELK-EENEMLELQRGRLRDEIKEYKFR--- 91
Query: 234 ESKLSDTIRELNEENIGLKQKV 255
E++L EL EENI L+++V
Sbjct: 92 EARLLQDYSELEEENISLQKQV 113
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 28.4 bits (63), Expect = 4.4
Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 12/103 (11%)
Query: 167 SSEKLTNDIEFLKQEKLKLYEELSSSQKENEILR-------QTTGDLEVRLMQMNEVKGE 219
++E LK EK L ELS + + E + + +L+ L E
Sbjct: 588 VCISTLEELETLKSEKENLDGELSKCKDDLEESKNKLQETEKKLEELKSELDASQESNSL 647
Query: 220 MEFKIKELNKKNQS-ESKLSDTIRELNEENIGLKQKVELQEAE 261
E ++K + + +S E + D L E+ L++K+ E E
Sbjct: 648 AETQLKCMVESYESLELRAKD----LEAEHKSLQEKISSLEDE 686
>gnl|CDD|226463 COG3954, PrkB, Phosphoribulokinase [Energy production and
conversion].
Length = 289
Score = 28.3 bits (63), Expect = 4.5
Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 6/47 (12%)
Query: 87 KAHVISV------GNTALSRTFRALLRRENVRATEIDRIEMYKCYRP 127
K VI+V G T S FR + + N+ A E++ ++ RP
Sbjct: 4 KHPVIAVTGSSGAGTTTTSLAFRKIFAQLNIHAAEVEGDSFHRYTRP 50
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 27.9 bits (62), Expect = 4.9
Identities = 24/121 (19%), Positives = 49/121 (40%), Gaps = 1/121 (0%)
Query: 147 TAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSS-QKENEILRQTTGD 205
+ + E+ L + K E +L E + + + EIL
Sbjct: 81 ENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYL 140
Query: 206 LEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLI 265
+ RL ++ + E + +++EL K+ + D + +L EE K+ EL+E +L+
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200
Query: 266 Q 266
+
Sbjct: 201 E 201
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 27.4 bits (61), Expect = 5.5
Identities = 31/137 (22%), Positives = 53/137 (38%), Gaps = 5/137 (3%)
Query: 130 IILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEEL 189
+ L L L Q + + LSE ++ D+E E E+L
Sbjct: 53 LALGLLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDL 112
Query: 190 SSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESK-LSDTIRELNEEN 248
+ L + LE RL + E E+ +++EL + + E + L + + L E
Sbjct: 113 KEDLQLLRELLK---SLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENL 169
Query: 249 IGLKQKV-ELQEAENQL 264
L++ + ELQ QL
Sbjct: 170 QRLQEAIQELQSLLEQL 186
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 28.1 bits (63), Expect = 5.6
Identities = 25/94 (26%), Positives = 44/94 (46%), Gaps = 11/94 (11%)
Query: 175 IEFLKQEKL-KLYEELSSSQKENEILRQTT------GDLEVRLMQMNEVKGEMEFKIKEL 227
I L +EK+ KL EL +KE E L+ TT DL+ + E + E KE+
Sbjct: 1096 IWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEE---KEI 1152
Query: 228 NKKNQSESKLSDTIRELNEENIGLKQKVELQEAE 261
K+ + +SK +L + K++ + +++
Sbjct: 1153 AKEQRLKSKTKGKASKLR-KPKLKKKEKKKKKSS 1185
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 28.1 bits (63), Expect = 5.8
Identities = 26/147 (17%), Positives = 48/147 (32%), Gaps = 16/147 (10%)
Query: 141 HSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLK------------LYEE 188
++ + + S ++ L + E T I LK+ E
Sbjct: 896 SLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPE 955
Query: 189 LSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEEN 248
L+ + L++T+ + E L + + E EL + ++LS L E
Sbjct: 956 LNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQEST 1015
Query: 249 IGLKQK----VELQEAENQLIQTTKNE 271
LK+ ELQ A + +
Sbjct: 1016 KQLKELPVEVAELQSASKIISSESTEL 1042
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 27.9 bits (63), Expect = 6.0
Identities = 10/33 (30%), Positives = 15/33 (45%)
Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQ 201
+L + L+ E KL EL + E E L+
Sbjct: 18 RQLELKLRDLEAENEKLERELERLKSELEKLKS 50
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 28.1 bits (63), Expect = 6.6
Identities = 20/101 (19%), Positives = 41/101 (40%), Gaps = 18/101 (17%)
Query: 158 LSEAASLFRS-SEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEV-RLMQMNE 215
L E A + + +L IE + +EE + QKE E L+ D ++ L E
Sbjct: 710 LKELADILKVTPNELPKKIE-------RFFEEWKALQKELESLKLKIADQKIDELKDKAE 762
Query: 216 VKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVE 256
++ ++ ++ K+ ++ + + LK K+
Sbjct: 763 TINGVKVLVEVVDAKDMKS------LKTMADR---LKSKLG 794
>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
This is the conserved N-terminal 350 residues of a
family of proteins of unknown function possibly
containing a coiled-coil domain.
Length = 308
Score = 27.6 bits (61), Expect = 6.7
Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 21/118 (17%)
Query: 167 SSEKLTNDIEFLKQEKLKLYEEL-------SSSQKENEILRQTTGDLEVRLMQMNE-VKG 218
S E+L I LKQE L EL S Q+EN LRQ + ++ + Q E +
Sbjct: 21 SRERLRGRIHSLKQENKVLKMELDTFKLKCKSLQEENRALRQASVSIQAKAEQEEEFISN 80
Query: 219 EMEFKIKELNKKNQS------------ESKLSDTIRELNEENIGLKQKVELQEAENQL 264
+ KI+ L K+ ++ + LS + +L +E I L+Q +E QE E Q+
Sbjct: 81 TLLKKIQALKKEKETLAMNYEQEEEFLTNDLSRKLTQLRQEKIQLEQTLE-QEQEYQV 137
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 27.9 bits (62), Expect = 6.7
Identities = 21/94 (22%), Positives = 37/94 (39%), Gaps = 7/94 (7%)
Query: 170 KLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNK 229
LT I K +KL +E S + +EI+ L + E + + +K+L +
Sbjct: 5 LLTQQIADRKLKKLNTADEKSLQSRSDEIIA-----LIKLEKLLEEAERKALEALKKLAE 59
Query: 230 KNQSESKLSDTIRELNEENIGLKQKVELQEAENQ 263
S + R+L+ +G KQ+ L
Sbjct: 60 ATASIR--AQNKRQLDRFGLGDKQRERLDARLQI 91
>gnl|CDD|219816 pfam08388, GIIM, Group II intron, maturase-specific domain. This
region is found mainly in various bacterial and archaeal
species, but a few members of this family are expressed
by fungal and chlamydomonal species. It has been
implicated in the binding of intron RNA during reverse
transcription and splicing.
Length = 80
Score = 26.0 bits (58), Expect = 7.5
Identities = 7/23 (30%), Positives = 14/23 (60%)
Query: 223 KIKELNKKNQSESKLSDTIRELN 245
KI+EL +++ + I++LN
Sbjct: 7 KIRELTRRSNKGLSQEELIKKLN 29
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 27.7 bits (62), Expect = 8.0
Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
L++ KL E + + ENE L++ ++ ++ Q V+ E + KE+ + +L
Sbjct: 71 LRKRLAKLISENEALKAENERLQKREQSIDQQIQQ--AVQSETQELTKEIEQLKSERQQL 128
Query: 238 SDTIRELNEENIGLK 252
I +L G+
Sbjct: 129 QGLIDQLQRRLAGVL 143
>gnl|CDD|227569 COG5244, NIP100, Dynactin complex subunit involved in mitotic
spindle partitioning in anaphase B [Cell division and
chromosome partitioning].
Length = 669
Score = 27.7 bits (61), Expect = 8.1
Identities = 27/111 (24%), Positives = 45/111 (40%), Gaps = 13/111 (11%)
Query: 166 RSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIK 225
+KL E + ++ +L+ S + IL D Q E + I
Sbjct: 443 ARIDKL-IGKEEISKQDNRLFLYPSCDITLSSILTILFSDKLEVFFQGIE---SLLENIT 498
Query: 226 ELNKKNQSESKLSDTIRE-------LNEENIGLKQKVELQEAENQLIQTTK 269
++ ++ S+ I+E LNEENI LK+ + + EN L + TK
Sbjct: 499 IFPEQPSQQTSDSENIKENSLLSDRLNEENIRLKEVLV--QKENMLTEETK 547
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 27.6 bits (62), Expect = 8.2
Identities = 34/151 (22%), Positives = 66/151 (43%), Gaps = 23/151 (15%)
Query: 123 KCYRPGDIILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLK-QE 181
+ R D + A L LKE Q + L ++ E L ++E LK Q
Sbjct: 158 EPGRLADFVAASLQLKEKDELQEVLETVNIEK--------RLKKALELLKKELELLKLQN 209
Query: 182 KL--KLYEELSSSQKE---NEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESK 236
K+ K+ E++ +Q+E E L+ ++ L + K E+E K+KE ++ K
Sbjct: 210 KITKKVEEKMEKTQREYYLREQLKA----IKKELGIEKDDKDELE-KLKEKLEEL----K 260
Query: 237 LSDTIRELNEENIGLKQKVELQEAENQLIQT 267
L + ++++ E+ + +E +E + +
Sbjct: 261 LPEEVKKVIEKELTKLSLLEPSSSEFTVTRN 291
>gnl|CDD|235456 PRK05425, PRK05425, asparagine synthetase AsnA; Provisional.
Length = 327
Score = 27.5 bits (62), Expect = 8.4
Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 168 SEKLTNDIEFL-KQEKLKLYEELSSSQKENEILR 200
L +I F+ QE Y +L+ ++E+ I +
Sbjct: 160 LPFLPEEITFITSQELEDRYPDLTPKEREDAIAK 193
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 27.6 bits (61), Expect = 8.5
Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 8/98 (8%)
Query: 171 LTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK 230
L +DI L Q KL ELS + + E+ ++ + E+E KELN
Sbjct: 492 LKHDINELTQILEKLELELSEANSKFELSKEENEREL------VAQRIEIEKLEKELNDL 545
Query: 231 N-QSESKLSDTIRELNEENIGLKQ-KVELQEAENQLIQ 266
N S++ + D + + I L + KV+L ++ +
Sbjct: 546 NLLSKTSILDAEQLVQSTEIKLDELKVDLNRKRYKIHK 583
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 27.6 bits (61), Expect = 9.1
Identities = 27/138 (19%), Positives = 51/138 (36%), Gaps = 18/138 (13%)
Query: 144 QLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTT 203
Q A+ E + + + L +I+ LK++ + LS ++ LR
Sbjct: 453 QSHLAKMENQLSELEEDVGQQKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKI 512
Query: 204 GDLEVRLMQMNEVKGEMEFKIKELNKK---NQSESKL---------------SDTIRELN 245
LE ++ + K +E K++ L + N S +K+ TI L
Sbjct: 513 ETLERERDRLRQEKSLLEMKLEHLCLQGDYNASRTKVLHMSLNPASEAEQIAKQTIEALQ 572
Query: 246 EENIGLKQKVELQEAENQ 263
E LK++++ E
Sbjct: 573 AECEKLKERLQALEEGKS 590
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 27.3 bits (61), Expect = 9.2
Identities = 20/75 (26%), Positives = 38/75 (50%), Gaps = 4/75 (5%)
Query: 160 EAASLFRSSEKLTN----DIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNE 215
E ASL + +++L ++E +QE L ++S +K+ E L+Q +L + + +
Sbjct: 190 EIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTN 249
Query: 216 VKGEMEFKIKELNKK 230
K E+ +I E K
Sbjct: 250 KKSELLEEIAEAEKI 264
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 26.4 bits (59), Expect = 9.9
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 10/91 (10%)
Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
E+L++ + L+ E L +EL+ + E + RQ V+L + NE E++ +I+EL
Sbjct: 20 ERLSSTLRRLEGELASLKDELARLEAERDEARQEI----VKLTEENEELRELKKEIEELE 75
Query: 229 KKNQSESKLSDTIRELNEENIGLKQKVELQE 259
K+ + + +T EL L +K E E
Sbjct: 76 KELEDLEQRYETTLEL------LGEKSERVE 100
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.130 0.347
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,356,386
Number of extensions: 1287642
Number of successful extensions: 1792
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1641
Number of HSP's successfully gapped: 301
Length of query: 271
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 176
Effective length of database: 6,723,972
Effective search space: 1183419072
Effective search space used: 1183419072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.1 bits)