RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5601
(270 letters)
>gnl|CDD|241242 cd01206, EVH1_Homer_Vesl, Homer/Vesl family proteins EVH1 domain.
Homer/Vesl proteins are synaptic scaffolding proteins,
required for long-term potentiation, a form of synaptic
plasticity thought to underlie memory formation. They
contains an N-terminal EVH1 domain and bind to both
neurotransmitter receptors, such as the metabotropic
group 1 glutamate receptor (mGluR) and to other
scaffolding proteins via PPXXF motifs, in order to
target them to the synaptic junction. These mGluRs
possess a long C-terminal intracellular tail that may
be important for subcellular localization of the
receptor. The C-terminus is also the site of binding by
the immediate early gene (IEG), Homer 1a. In contrast
to Homer 1a, other Homer members additionally encode a
C-terminal coiled-coil (CC) domain and form multivalent
complexes that bind group 1 mGluRs. Homer 1a competes
with constitutively expressed CC-Homers to modify the
association of group 1 mGluRs with CC-Homer complexes.
Since Homer proteins are strikingly enriched at the
postsynaptic density (PSD), these observations suggest
a role for the Homer family in regulating synaptic
metabotropic receptor function. PSD-Zip45 (also named
Homer 1c/Vesl-1L) has an EVH1 domain with a longer
alpha-helix and its linking part included in the
conserved region of Homer 1 (CRH1) interacts with the
EVH1 domain of the neighbour CRH1 molecule in the
crystal, suggesting that the EVH1 domain recognizes the
PPXXF motif found in the binding partners, and the
SPLTP sequence (P-motif) in the linking region of the
CRH1. The two types of binding are partly overlapped in
the EVH1 domain, implying a mechanism to regulate
multimerization of Homer 1 family proteins. Homer 2
and Homer 3 are negative regulators of T cell
activation. They bind the nuclear factor of activated T
cells (NFAT) and compete with calcineurin binding. NFAT
plays a critical role in calcium-dependent signaling in
other cell types, including muscle and neurons.
Homer-NFAT binding is also antagonized by active
serine-threonine kinase AKT, enhancing TCR signaling
via calcineurin-dependent dephosphorylation of NFAT
resulting in changes in cytokine expression and an
increase in effector-memory T cell populations in
Homer-deficient mice. The EVH1 domains are part of the
PH domain superamily. There are 5 EVH1 subfamilies:
Enables/VASP, Homer/Vesl, WASP, Dcp1, and Spred.
Ligands are known for three of the EVH1 subfamilies,
all of which bind proline-rich sequences: the
Enabled/VASP family binds to FPPPP peptides, the
Homer/Vesl family binds PPxxF peptides, and the WASP
family binds LPPPEP peptides. EVH1 has a PH-like fold,
despite having minimal sequence similarity to PH or PTB
domains.
Length = 109
Score = 57.7 bits (140), Expect = 7e-11
Identities = 19/21 (90%), Positives = 20/21 (95%)
Query: 1 MTFTKTSQKFGQWSDIRANTV 21
MTFTKTSQKFGQW+D RANTV
Sbjct: 63 MTFTKTSQKFGQWADSRANTV 83
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 48.1 bits (115), Expect = 3e-06
Identities = 53/246 (21%), Positives = 100/246 (40%), Gaps = 9/246 (3%)
Query: 25 QAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQEST 84
+ +L+ L ++ L +L + E++ L+ L L
Sbjct: 238 REELEELQEELKEAEEELEELTAELQELEEKLEELRL---EVSELEEEIEELQKELYALA 294
Query: 85 ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAA-VELRKELTSVRSKNETLDSELK 143
+ ++Q Q +E L+ + ELEA E+ EL +EL + K E L +
Sbjct: 295 NEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL---KE 351
Query: 144 CRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHA 203
+ +L + +E+ EL ++L ++RSK L+ ++ + E+++L
Sbjct: 352 ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER 411
Query: 204 VEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHRE 263
+ED+ + L QE EL L +A+L L A + ++ L +L L + E
Sbjct: 412 LEDRRERLQQEIEELLK--KLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
Query: 264 INTALQ 269
+ A Q
Sbjct: 470 LEEAEQ 475
Score = 40.0 bits (94), Expect = 0.001
Identities = 51/194 (26%), Positives = 93/194 (47%), Gaps = 6/194 (3%)
Query: 72 NNLRLTSA-LQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTS 130
L T L + E ++QL++ + + +K + +Y EL+A E A+ L L
Sbjct: 179 RKLERTRENLDRLEDILNELERQLKSLERQAEKAE-RYKELKAELRELELAL-LVLRLEE 236
Query: 131 VRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCR 190
+R + E L ELK +EE+++LT +E+K ELR E++ + + E L EL
Sbjct: 237 LREELEELQEELKEAEEELEELTAELQELEEK---LEELRLEVSELEEEIEELQKELYAL 293
Query: 191 DEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQL 250
E+ +L K + ++ L ++ EL+A + ++++LD L ++ L +L
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 251 AGRVAELATIHREI 264
AEL + E+
Sbjct: 354 ESLEAELEELEAEL 367
Score = 39.3 bits (92), Expect = 0.002
Identities = 43/242 (17%), Positives = 101/242 (41%), Gaps = 9/242 (3%)
Query: 37 AQRRIIDTLNHQLAGRVADSANAKK----WEIELANLKSNNLRLTSALQESTANVEEWKK 92
+RR I+ L ++ A +K EL L+ +L L+E + + +K
Sbjct: 674 ERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRK 733
Query: 93 QLQAYKEENQKLKTKYIELEATKGVSEA-AVELRKELTSVRSKNETLDSELKCRDEEVKK 151
L + E ++L+ + +L EA EL + L + ++E++ + ++++
Sbjct: 734 DLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ 793
Query: 152 LTNAKHAVEDKCKAA----VELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDK 207
L A+ + L +E ++R + E+L+ + + ++ L + +
Sbjct: 794 LKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSED 853
Query: 208 CKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTA 267
+ L+ E EL+ I ++++L+ LL + + + L +L EL + + +
Sbjct: 854 IESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSEL 913
Query: 268 LQ 269
+
Sbjct: 914 RR 915
Score = 35.4 bits (82), Expect = 0.033
Identities = 42/230 (18%), Positives = 102/230 (44%), Gaps = 22/230 (9%)
Query: 42 IDTLNHQLAGRVADSANAKKWEIELANLKSNNLR--LTSALQESTANVEEWKKQLQAYKE 99
+ L L + + E++ ++ LT+ LQE +EE + ++ +E
Sbjct: 222 LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEE 281
Query: 100 ENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAV 159
E ++L+ + L + +E +K++ R N L+ +L+ + ++++L +
Sbjct: 282 EIEELQKELYALANEI----SRLEQQKQILRERLAN--LERQLEELEAQLEELESKL--- 332
Query: 160 EDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
++ + EL ++L ++ + E+L++EL+ E +L + +E+ + L +++
Sbjct: 333 DELAEELAELEEKLEELKEELESLEAELE---ELEAELEELESRLEELEEQLETLRSKV- 388
Query: 220 AAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQ 269
AQL+ +AS + + ++ +L R L E+ L+
Sbjct: 389 -------AQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE 431
Score = 34.6 bits (80), Expect = 0.057
Identities = 46/224 (20%), Positives = 97/224 (43%), Gaps = 19/224 (8%)
Query: 27 QLDTLLASQDAQRRIIDTLNHQLAGRVAD-SANAKKWEIELANLKSNNLRLTSALQESTA 85
+L +++ ++ QL R+A S + E E+ L+ L E+ A
Sbjct: 723 ELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
Query: 86 NVEEWKKQLQAYKEENQKLKTKYIELEATKG--------VSEAAVELRKELTSVRSKNET 137
+EE + Q++ KEE + L+ EL A + E L + + + + E
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842
Query: 138 LDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELT-------SVRSKNETLDSELKCR 190
L+ +++ E+++ L +E+ + EL EL S+ L SEL+
Sbjct: 843 LEEQIEELSEDIESLA---AEIEELEELIEELESELEALLNERASLEEALALLRSELEEL 899
Query: 191 DEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLA 234
EE+++L + + + + + L ++ A+L+ + ++ ++D L
Sbjct: 900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQE 943
Score = 31.6 bits (72), Expect = 0.54
Identities = 27/163 (16%), Positives = 64/163 (39%), Gaps = 4/163 (2%)
Query: 112 EATKGVSEAAVELRKELTSVRSKNETLDSELKCRD----EEVKKLTNAKHAVEDKCKAAV 167
E + + E EL K L +R + E L+ EL+ E ++++ + +
Sbjct: 684 EKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVE 743
Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQA 227
+L + + + + L++E++ +E +++ E + + L + +L+ + ++
Sbjct: 744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALRE 803
Query: 228 QLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQT 270
LD L A L +L +A R + +
Sbjct: 804 ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ 846
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 48.1 bits (115), Expect = 3e-06
Identities = 49/226 (21%), Positives = 88/226 (38%), Gaps = 21/226 (9%)
Query: 24 VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQES 83
V+ ++ L A + R I +L + E LA L++ +L + ++E
Sbjct: 292 VKEKIGELEAEIASLERSIAEKEREL----------EDAEERLAKLEAEIDKLLAEIEEL 341
Query: 84 TANVEEWKKQLQAYKEENQKLKTKYIELEATKG-VSEAAVELRKELTSVRSKNETLDSEL 142
+EE +K+ EE +LK + +L A V + E R EL R K E L E+
Sbjct: 342 EREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREI 401
Query: 143 KCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLT-NAK 201
E+ +L + + +L + + +K L+ E + + E+KK +
Sbjct: 402 NELKRELDRLQEEL---QRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458
Query: 202 HAVEDKCKL------LSQENAELQAAISLVQAQLDTLLASQDAQRR 241
D K L +E ++ +S +Q +L A A
Sbjct: 459 QLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEE 504
Score = 41.6 bits (98), Expect = 4e-04
Identities = 54/255 (21%), Positives = 122/255 (47%), Gaps = 18/255 (7%)
Query: 28 LDTLLASQDAQRRIIDTLNHQLAGRVAD-SANAKKWEIELANLKSNNLRLTSALQESTAN 86
L L+S ++ R I+ +L+ ++D S + E E+ L+ +L L+E +
Sbjct: 686 LKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEED 745
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAV-------------ELRKELTSVRS 133
+ +++++ K E ++L+ + ELE E A+ E++ EL+ +
Sbjct: 746 LSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEE 805
Query: 134 KNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELT----SVRSKNETLDSELKC 189
+ +++ L+ ++++ +LT K +E + + E R +L S+ + E L+ + +
Sbjct: 806 EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE 865
Query: 190 RDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQ 249
+EE+++L A +E + L +E EL+A + ++ +++ L A + +R+ + L +
Sbjct: 866 LEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAK 925
Query: 250 LAGRVAELATIHREI 264
L EL+ I
Sbjct: 926 LEALEEELSEIEDPK 940
Score = 40.1 bits (94), Expect = 0.001
Identities = 54/246 (21%), Positives = 103/246 (41%), Gaps = 21/246 (8%)
Query: 24 VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANA-----KKWEIELANLKSNNLRLTS 78
++ QL +L + I L +L E E +K L +
Sbjct: 242 IERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEA 301
Query: 79 ALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETL 138
+ ++ E +++L+ +E KL+ + +L A EL +E+ R + + L
Sbjct: 302 EIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIE------ELEREIEEERKRRDKL 355
Query: 139 DSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLT 198
E EE++ L +E+ K E R EL R K E L E+ E+ +L
Sbjct: 356 TEEYAELKEELEDLRA---ELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL- 411
Query: 199 NAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELA 258
+++ + LS+E A+L AAI+ ++A+++ L ++ + I +L A+L+
Sbjct: 412 ------QEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465
Query: 259 TIHREI 264
+E+
Sbjct: 466 KYEQEL 471
Score = 36.6 bits (85), Expect = 0.013
Identities = 36/162 (22%), Positives = 64/162 (39%), Gaps = 5/162 (3%)
Query: 113 ATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAA----VE 168
A +G + EL +R + E L EL E++++ N + + A E
Sbjct: 661 APRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGE 720
Query: 169 LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQ 228
+ KE+ + + E L L+ +E++ L V+ + K L EL+ + ++
Sbjct: 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEA 780
Query: 229 LDTLLASQDAQR-RIIDTLNHQLAGRVAELATIHREINTALQ 269
L+ L A R I +L V+ + REI L
Sbjct: 781 LNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN 822
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 41.6 bits (98), Expect = 3e-04
Identities = 49/203 (24%), Positives = 91/203 (44%), Gaps = 15/203 (7%)
Query: 63 EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAV 122
E E++ L+ L + L+E +EE K++++A KEE ++ +T ELE E A
Sbjct: 301 EGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAK 360
Query: 123 E-----LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVR 177
E L L + E L EL + E+ ++ N EL++E+ S+
Sbjct: 361 EELEEKLSALLEELEELFEALREELAELEAELAEIRNELE----------ELKREIESLE 410
Query: 178 SKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQD 237
+ E L L+ EE+K+L ++ + + L++E EL+ + ++ +L L
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELA 470
Query: 238 AQRRIIDTLNHQLAGRVAELATI 260
+ + L +L+ A L +
Sbjct: 471 ELQEELQRLEKELSSLEARLDRL 493
Score = 39.7 bits (93), Expect = 0.001
Identities = 46/180 (25%), Positives = 86/180 (47%), Gaps = 7/180 (3%)
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVE-LRKELTSVRSKNETLDSELKCR 145
++E +K+L+ +EE +L+ + EL+ +E +E L+ EL +R + E L EL
Sbjct: 234 LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLEL 293
Query: 146 DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
EE+++L + ++ + EL EL + + E L +++ EE+++ +E
Sbjct: 294 KEEIEELEGEISLLRERLE---ELENELEELEERLEELKEKIEALKEELEERETLLEELE 350
Query: 206 DKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREIN 265
L + EL+ +S + +L+ L A R + L +LA EL + REI
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFE---ALREELAELEAELAEIRNELEELKREIE 407
Score = 35.5 bits (82), Expect = 0.027
Identities = 46/194 (23%), Positives = 91/194 (46%), Gaps = 8/194 (4%)
Query: 80 LQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLD 139
L+ +EE +KQL+ + + +K + +Y EL+A E A+ L +L +R + E L+
Sbjct: 188 LERLEDLLEELEKQLEKLERQAEKAE-RYQELKAELRELELAL-LLAKLKELRKELEELE 245
Query: 140 SELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
EL +EE+++L E + + EL+ EL +R + E L EL EE+++L
Sbjct: 246 EELSRLEEELEELQEELEEAEKEIE---ELKSELEELREELEELQEELLELKEEIEELEG 302
Query: 200 AKHAVEDKCKLLSQENAELQAAISLVQAQLD---TLLASQDAQRRIIDTLNHQLAGRVAE 256
+ ++ + L E EL+ + ++ +++ L ++ ++ L +L E
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEE 362
Query: 257 LATIHREINTALQT 270
L + L+
Sbjct: 363 LEEKLSALLEELEE 376
Score = 34.7 bits (80), Expect = 0.053
Identities = 33/156 (21%), Positives = 80/156 (51%), Gaps = 3/156 (1%)
Query: 114 TKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKEL 173
K + E EL +L + + ++L +EL+ ++ +++L +E + EL++EL
Sbjct: 669 LKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ---LEELKREL 725
Query: 174 TSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLL 233
++ + E L S L+ +EE+++L ++++ + L +E L+ A++ ++ +++ L
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE 785
Query: 234 ASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQ 269
+ A + ++ L +L L + RE+ + Q
Sbjct: 786 EKRQALQEELEELEEELEEAERRLDALERELESLEQ 821
Score = 33.1 bits (76), Expect = 0.15
Identities = 35/170 (20%), Positives = 77/170 (45%), Gaps = 1/170 (0%)
Query: 98 KEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKH 157
E +L+ EL++ K + +L +EL + E ELK +++
Sbjct: 677 AELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQ 736
Query: 158 A-VEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENA 216
+ +E+ + EL +EL ++ + E L+ EL+ +E + KL +E+K + L +E
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796
Query: 217 ELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
EL+ + + +LD L ++ + + L ++ E+ + +++
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDE 846
Score = 29.7 bits (67), Expect = 2.1
Identities = 38/172 (22%), Positives = 71/172 (41%), Gaps = 8/172 (4%)
Query: 27 QLDTLLASQDAQRRIIDTLNHQLAGRVADSANAK-KWEIELANLKSNNLRLTSALQESTA 85
L L + + + L + + + E E+ L+ L L E
Sbjct: 790 ALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEE 849
Query: 86 NVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCR 145
+EE +K+L+ KEE ++L+ + ELE + EL +E + + L+SEL
Sbjct: 850 ELEELEKELEELKEELEELEAEKEELE------DELKELEEEKEELEEELRELESELAEL 903
Query: 146 DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKL 197
EE++KL +E K + +E+ + E D+ + E+++L
Sbjct: 904 KEEIEKLRERLEELEAKLE-RLEVELPELEEELEEEYEDTLETELEREIERL 954
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 39.8 bits (93), Expect = 0.001
Identities = 50/188 (26%), Positives = 82/188 (43%), Gaps = 8/188 (4%)
Query: 63 EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAV 122
E ELA L+ N LTS ES A K ++ +E + K K ++E K E
Sbjct: 273 EKELAELQHRNDELTSLAAESQA----LKDEIDVLRESSDKAKKLEAQVETYKKKLEDLN 328
Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
+LR+++ + +N +EE+KK A+ +E + EL +L+ K +
Sbjct: 329 DLRRQVKLLEERNAMYMQNTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADK 388
Query: 183 LDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRI 242
L+ E K +E+++ L K + + L + N EL+ QAQ D L +
Sbjct: 389 LEFEYKRLEEKLEALQKEKERLLAERDSLRETNEELRCG----QAQQDQLTQADAGTSPS 444
Query: 243 IDTLNHQL 250
D L +L
Sbjct: 445 GDNLAAEL 452
Score = 28.6 bits (64), Expect = 4.0
Identities = 47/223 (21%), Positives = 84/223 (37%), Gaps = 30/223 (13%)
Query: 54 ADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIE--- 110
+S A K E E L+ L E + + +E N++L+ +
Sbjct: 381 EESKKADKLEFEYKRLEEKLEALQKE-------KERLLAERDSLRETNEELRCGQAQQDQ 433
Query: 111 -LEATKGVSEAAVELRKELTS--VRSKNETLDSE---LKCRDE--EVKKLTNAKHAVEDK 162
+A G S + L EL R K L E L+ E E +++T + +ED
Sbjct: 434 LTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQEGSENERITELQQLLEDA 493
Query: 163 CKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENA------ 216
+ EL +L R N+ + EL+ + E+++K + + + LL +
Sbjct: 494 NRRNNELETQL---RLANQRI-LELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLEQL 549
Query: 217 -ELQAAISLVQAQLDTLLASQDAQR-RIIDTLNHQLAGRVAEL 257
E + + Q++ L QD R I L L + ++
Sbjct: 550 HEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDM 592
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 39.1 bits (91), Expect = 0.002
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 10/161 (6%)
Query: 60 KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSE 119
KK + EL + N L + + T+ K L K+EN L+TK + + K
Sbjct: 416 KKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAK---- 471
Query: 120 AAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNA-KHAVEDKCKAAVELRKELTSVRS 178
+K+ S++S + L SE R K+L K E++ AA + S
Sbjct: 472 -----QKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREE 526
Query: 179 KNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
E+L + + E+KKL + E++C++L +E EL+
Sbjct: 527 CAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELR 567
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 39.3 bits (92), Expect = 0.002
Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 15/184 (8%)
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEAT-KGVSEAAVELRKELTSVRSKNETLDSELKCR 145
V ++ +E Q+ + A K E V+ EL + + LK
Sbjct: 595 VPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQA 654
Query: 146 DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
++++L N + +++DK + A+ RK+ ++ LD++LK E+ + + +
Sbjct: 655 RLDLQRLQNEQQSLKDKLELAIAERKQ--QAETQLRQLDAQLKQLLEQQQAF--LEALKD 710
Query: 206 DKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREIN 265
D +L ++ A+ Q + QL L A+ +A R A L + ++ +
Sbjct: 711 DFRELRTERLAKWQVVEGELDNQLAQLSAAIEAAR----------TQAKARLKELKKQYD 760
Query: 266 TALQ 269
L
Sbjct: 761 RELA 764
Score = 36.6 bits (85), Expect = 0.012
Identities = 48/244 (19%), Positives = 94/244 (38%), Gaps = 17/244 (6%)
Query: 27 QLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQESTAN 86
+ L QD + I D L + +K + + L S LRL E A+
Sbjct: 218 PPKSRLNPQDVEHWIADI--QALRAIQKVAPEIEKLQEDFEQLLSLELRLQHLHGELVAD 275
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
E ++ + +E +L+ + LE + EA EL +EL++ +K SEL+ +
Sbjct: 276 EERLAEEQEERQEAKNRLRQQLRTLE--DQLKEARDELNQELSAANAKLAADRSELELLE 333
Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
++ D + +L +L S+RS+ E +++ L + + + ++
Sbjct: 334 DQKG-----AFEDADIEQLQADL-DQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQ 387
Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQR-RIIDTLNHQLAGRVAELATIHREIN 265
K K +L+ + +L + +D Q+ I + L + +L E N
Sbjct: 388 KIKE------QLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFN 441
Query: 266 TALQ 269
Sbjct: 442 EEEY 445
Score = 30.4 bits (69), Expect = 1.2
Identities = 38/207 (18%), Positives = 72/207 (34%), Gaps = 26/207 (12%)
Query: 8 QKFGQWSDIRANTVSLVQAQLDTLLASQDAQRRIIDTL--NHQLAGRVADSANAKKWEIE 65
+ G + D + QL ++ + + +D L HQ R K+E
Sbjct: 334 DQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQR--------KYERL 385
Query: 66 LANLKSNNLRLTSALQESTANVEEWKKQLQAYKEE-----NQKLKTKYIELEATKGVSEA 120
+K R E A + E K + +A EE +L+ + + E
Sbjct: 386 KQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEY 445
Query: 121 AVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE-LRKELTSVRSK 179
+ELR R + T E + E + +++ +A VE L+ EL +R +
Sbjct: 446 ELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKR 505
Query: 180 NETLD----------SELKCRDEEVKK 196
+ +L+ +E++
Sbjct: 506 RDEALEALQRAERRLLQLRQALDELEL 532
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 37.7 bits (88), Expect = 0.002
Identities = 23/78 (29%), Positives = 36/78 (46%), Gaps = 3/78 (3%)
Query: 65 ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL 124
ELA L+ L L E +E K++L + E ++L+ EL K +S A+EL
Sbjct: 74 ELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQK---ELARIKQLSANAIEL 130
Query: 125 RKELTSVRSKNETLDSEL 142
+E +R + L E
Sbjct: 131 DEENRELREELAELKQEN 148
Score = 33.4 bits (77), Expect = 0.077
Identities = 16/92 (17%), Positives = 45/92 (48%), Gaps = 4/92 (4%)
Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
++L + + + + EL++E ++ + TL++EL+ +E+ ++ +
Sbjct: 73 QELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIE--- 129
Query: 210 LLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
L +EN EL+ ++ ++ + + L A + +
Sbjct: 130 -LDEENRELREELAELKQENEALEAENERLQE 160
Score = 29.6 bits (67), Expect = 1.2
Identities = 26/101 (25%), Positives = 56/101 (55%), Gaps = 13/101 (12%)
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
+ E +++L +EE +L+ + EL+ EL++EL+++ ++ E L EL
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQ------ELKQELSTLEAELERLQKEL---- 117
Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSEL 187
+K+L+ ++++ + ELR+EL ++ +NE L++E
Sbjct: 118 ARIKQLSANAIELDEENR---ELREELAELKQENEALEAEN 155
Score = 29.2 bits (66), Expect = 1.7
Identities = 17/80 (21%), Positives = 29/80 (36%)
Query: 24 VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQES 83
+Q QL L ++ + TL +L + A K+ L N L L E
Sbjct: 85 LQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAEL 144
Query: 84 TANVEEWKKQLQAYKEENQK 103
E + + + +E Q+
Sbjct: 145 KQENEALEAENERLQENEQR 164
Score = 28.4 bits (64), Expect = 2.8
Identities = 14/89 (15%), Positives = 34/89 (38%)
Query: 24 VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQES 83
+Q +L L + + L + + + + L + +L++ E
Sbjct: 71 LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130
Query: 84 TANVEEWKKQLQAYKEENQKLKTKYIELE 112
E +++L K+EN+ L+ + L+
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQ 159
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 37.0 bits (86), Expect = 0.010
Identities = 40/186 (21%), Positives = 65/186 (34%), Gaps = 13/186 (6%)
Query: 38 QRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAY 97
+ + + A+ K A + + L+ + +S VE ++QLQ
Sbjct: 828 REKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQEL 887
Query: 98 KEEN---QKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 154
K + LK +ELE +EL+K L+S +N +EL +KKL N
Sbjct: 888 KIDVKSISSLKLVNLELE------SEIIELKKSLSSDLIENLEFKTEL---IARLKKLLN 938
Query: 155 AKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQE 214
E V+L EL + L + ++ +KK T L
Sbjct: 939 NIDLEEGPSIEYVKL-PELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNF 997
Query: 215 NAELQA 220
EL
Sbjct: 998 KKELAE 1003
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 35.0 bits (81), Expect = 0.031
Identities = 16/94 (17%), Positives = 39/94 (41%), Gaps = 2/94 (2%)
Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQE--NAELQAAISLV 225
ELR +L + +NE L +++ E A K ++ ++ +++A IS +
Sbjct: 24 ELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQL 83
Query: 226 QAQLDTLLASQDAQRRIIDTLNHQLAGRVAELAT 259
+ +++ + +R + L+ +L
Sbjct: 84 KEEIEQKRERIEELKRALAQRRSDLSSASYQLEK 117
Score = 27.7 bits (62), Expect = 6.8
Identities = 26/128 (20%), Positives = 55/128 (42%), Gaps = 10/128 (7%)
Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
ELR +L + +NE L +++ E + K A + ++ L +R++
Sbjct: 24 ELRLDLARLLLENEELKQKVEEALEGATN-EDGKLAADLLKLEVARKKERLNQIRARISQ 82
Query: 183 LDSELKCRDEEVKKL--------TNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLA 234
L E++ + E +++L ++ A K + + +LQ I +++L L
Sbjct: 83 LKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKL-NALH 141
Query: 235 SQDAQRRI 242
S A++R
Sbjct: 142 SLLAEKRS 149
Score = 27.7 bits (62), Expect = 6.9
Identities = 23/157 (14%), Positives = 60/157 (38%), Gaps = 15/157 (9%)
Query: 14 SDIRANTVSLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNN 73
R + L LL +++ ++++ + L +A+ K E+ + N
Sbjct: 16 HCARNSLYELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQ 75
Query: 74 LRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRS 133
+R + + + +E+ +++++ K + ++ +S A+ +L K S
Sbjct: 76 IR--ARISQLKEEIEQKRERIEELKRALAQRRSD---------LSSASYQLEKRRASQLE 124
Query: 134 KNETLDSELKCRDEEVKKLTN-AKHAVEDKCKAAVEL 169
K + E+K ++ L + C+ +L
Sbjct: 125 KLQ---DEIKRTRSKLNALHSLLAEKRSFLCRELAKL 158
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.1 bits (80), Expect = 0.037
Identities = 39/149 (26%), Positives = 66/149 (44%), Gaps = 2/149 (1%)
Query: 81 QESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDS 140
+ +E KK+ + K+ ++ K K E + + A E +K + + E +
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADE-AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKA 1472
Query: 141 -ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
E K + EE KK AK E+ K A E +K + + +E +E + +E KK
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532
Query: 200 AKHAVEDKCKLLSQENAELQAAISLVQAQ 228
AK A E K ++ EL+ A L +A+
Sbjct: 1533 AKKADEAKKAEEKKKADELKKAEELKKAE 1561
Score = 34.0 bits (77), Expect = 0.092
Identities = 36/159 (22%), Positives = 67/159 (42%), Gaps = 4/159 (2%)
Query: 54 ADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEA 113
A A+ K + E A K++ + + + A + + + A + E + K + E +
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK- 1372
Query: 114 TKGVSEAAVELRKELTSVRSKNETLD--SELKCRDEEVKKLTNAKHAVEDKCKAAVELRK 171
+ + A +K+ + +E E K + +E+KK AK ++ K A E +K
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Query: 172 ELTSVRSKNETLDS-ELKCRDEEVKKLTNAKHAVEDKCK 209
+ + E + E K + EE KK AK E+ K
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Score = 33.6 bits (76), Expect = 0.13
Identities = 50/170 (29%), Positives = 71/170 (41%), Gaps = 13/170 (7%)
Query: 50 AGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYI 109
A + AD A K E + A+ E+ EE KK +A K+ + K
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD-- 1473
Query: 110 ELEATKGVSEA--AVELRKELTSVRSKNETLD--SELKCRDEEVKKLTNAKHAVE----D 161
EA K EA A E +K+ + K + +E K + +E KK AK A E +
Sbjct: 1474 --EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
Query: 162 KCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLL 211
+ K A E +K + +E +E + EE KK AK A EDK L
Sbjct: 1532 EAKKADEAKKA-EEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580
Score = 32.8 bits (74), Expect = 0.23
Identities = 48/195 (24%), Positives = 71/195 (36%), Gaps = 17/195 (8%)
Query: 33 ASQDAQRRIIDTLNHQLAGRVADSANAKKWE---IELANLKSNNLRLTSALQESTANVEE 89
A + + RI + + + + AKK E I+ LK Q EE
Sbjct: 1586 AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645
Query: 90 WKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSEL------- 142
KK + K E + E + + + A E +K + E L E
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
Query: 143 ---KCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
K EE KK K A E+ A E +KE + K E E K +EE KK+ +
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE----EAKKDEEEKKKIAH 1761
Query: 200 AKHAVEDKCKLLSQE 214
K E K + + +E
Sbjct: 1762 LKKEEEKKAEEIRKE 1776
Score = 30.9 bits (69), Expect = 0.76
Identities = 30/127 (23%), Positives = 50/127 (39%), Gaps = 1/127 (0%)
Query: 82 ESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVS-EAAVELRKELTSVRSKNETLDS 140
E N EE +K +A + + EA K + A E +K + +++ +
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305
Query: 141 ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNA 200
E K + EE KK AK E+ K A +K+ + E +E + +E +
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Query: 201 KHAVEDK 207
A E K
Sbjct: 1366 AEAAEKK 1372
Score = 30.9 bits (69), Expect = 0.94
Identities = 41/159 (25%), Positives = 63/159 (39%), Gaps = 11/159 (6%)
Query: 58 NAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYK-EENQKLKTKYIELEATKG 116
N + + E A + R + E +E KK + K +E +K + K EA K
Sbjct: 1251 NEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310
Query: 117 VSEA--AVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE----DKCKAAVELR 170
EA A E +K+ + K + K + EE KK A A D+ +AA E
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKAD----AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366
Query: 171 KELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
+ + + + K + EE KK AK E+ K
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
Score = 30.5 bits (68), Expect = 1.0
Identities = 38/132 (28%), Positives = 57/132 (43%), Gaps = 6/132 (4%)
Query: 81 QESTANVEEWKKQLQAYK--EENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETL 138
+E+ +E KK +A K +E +K + EA K + K+ + +E
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552
Query: 139 DSELKCRDEEVKKLTNAKHAVEDKCKA---AVELRK-ELTSVRSKNETLDSELKCRDEEV 194
+E + EE KK AK A EDK A A E +K E + + + E K + EE
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
Query: 195 KKLTNAKHAVED 206
KK AK E+
Sbjct: 1613 KKAEEAKIKAEE 1624
Score = 30.5 bits (68), Expect = 1.1
Identities = 42/150 (28%), Positives = 63/150 (42%), Gaps = 13/150 (8%)
Query: 82 ESTANVEEWKKQLQAYKEE---NQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETL 138
E EE KK +A K E N L+ +A + E ++L +E ++++
Sbjct: 1555 EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614
Query: 139 DSELKCRDEEVKKLTNAKHAVE-------DKCKAAVELRKELTSVRSKNETLDSELKCRD 191
E K + EE+KK K VE ++ K A EL+K + K E K +
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA---EEAKKAE 1671
Query: 192 EEVKKLTNAKHAVEDKCKLLSQENAELQAA 221
E+ KK AK A ED+ K E + A
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
Score = 30.5 bits (68), Expect = 1.1
Identities = 27/129 (20%), Positives = 49/129 (37%), Gaps = 5/129 (3%)
Query: 81 QESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDS 140
+E+ + EE KK + E + + + + E ++ ++ E +
Sbjct: 1233 EEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292
Query: 141 ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNA 200
+ + EE KK AK E+ KA E +K+ + K + K + EE KK A
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKAD-EAKKKAEEAKKKAD----AAKKKAEEAKKAAEA 1347
Query: 201 KHAVEDKCK 209
A +
Sbjct: 1348 AKAEAEAAA 1356
Score = 29.0 bits (64), Expect = 4.0
Identities = 36/152 (23%), Positives = 54/152 (35%), Gaps = 29/152 (19%)
Query: 82 ESTANVEEWKKQLQAYK------------------------EENQKLKTKYIELEATKGV 117
E +E KK+ + K E + + E EA
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Query: 118 SEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVR 177
+EAA E + + + + K + EE KK AK E+ K A EL+K + +
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418
Query: 178 SKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
+ E K + EE KK AK E+ K
Sbjct: 1419 KAD-----EAKKKAEEKKKADEAKKKAEEAKK 1445
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 34.4 bits (79), Expect = 0.060
Identities = 41/193 (21%), Positives = 88/193 (45%), Gaps = 1/193 (0%)
Query: 60 KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSE 119
+ E E L+ RL ++ +++L+ + ++L+ K LE + E
Sbjct: 242 AELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIE 301
Query: 120 AAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSK 179
E + L ++ + E L +LK +E ++KL +E + + E + EL + +
Sbjct: 302 ELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEE 361
Query: 180 N-ETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDA 238
+ L+ L+ ++E++K +E+ + L +E AEL AA+ +Q +L+ L +
Sbjct: 362 RLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEE 421
Query: 239 QRRIIDTLNHQLA 251
R ++ L ++
Sbjct: 422 LERELEELEEEIK 434
Score = 30.5 bits (69), Expect = 1.0
Identities = 44/208 (21%), Positives = 90/208 (43%), Gaps = 8/208 (3%)
Query: 63 EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKT-KYIELEATKGVSEAA 121
++E + L ++ + E +++ + +E +L + +ELEA K E
Sbjct: 217 KLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL 276
Query: 122 VELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNE 181
EL + L + K E L+ + E+++L + + EL ++L S+ + E
Sbjct: 277 RELERLLEELEEKIE----RLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLE 332
Query: 182 TLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
L+ +L+ + E+++L K+ + KLL + EL+ + ++ +L+ L
Sbjct: 333 KLEEKLEKLESELEELAEEKNELA---KLLEERLKELEERLEELEKELEKALERLKQLEE 389
Query: 242 IIDTLNHQLAGRVAELATIHREINTALQ 269
I L +LA A L I E+ +
Sbjct: 390 AIQELKEELAELSAALEEIQEELEELEK 417
Score = 28.6 bits (64), Expect = 4.1
Identities = 22/101 (21%), Positives = 48/101 (47%)
Query: 85 ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKC 144
A +EE +++++ + E ++ + E + E +L +EL +R + E L +L
Sbjct: 654 AALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGE 713
Query: 145 RDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDS 185
++ +++L + K +E+ K +L K L + E L
Sbjct: 714 IEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGK 754
Score = 27.4 bits (61), Expect = 9.6
Identities = 32/183 (17%), Positives = 79/183 (43%), Gaps = 2/183 (1%)
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
++E K +++ + + +L +L ++ +E+ + E L+ E++ +
Sbjct: 180 IKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQE-EQEEEELEQEIEALE 238
Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
E + +L K +E+ +E+ E EL+ EE+++ +E
Sbjct: 239 ERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELER 298
Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
+ + L +E L+A + ++ L+ L + ++ ++ + L +L + ELA E+
Sbjct: 299 EIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLE-KLESELEELAEEKNELAK 357
Query: 267 ALQ 269
L+
Sbjct: 358 LLE 360
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 33.5 bits (77), Expect = 0.11
Identities = 28/110 (25%), Positives = 49/110 (44%), Gaps = 12/110 (10%)
Query: 92 KQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVR---SKNETLDSELKCRDEE 148
+ ++ +EEN +LK + EL+ +L EL R D E++ RD
Sbjct: 429 ETVERLEEENSELKRELEELKREI------EKLESELERFRREVRDKVRKDREIRARDRR 482
Query: 149 VKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLT 198
+++L +E+K K EL ++L +R + S + V+KLT
Sbjct: 483 IERLEK---ELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT 529
Score = 30.4 bits (69), Expect = 0.96
Identities = 16/75 (21%), Positives = 37/75 (49%)
Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
+L + + + +N L EL+ E++KL + + + V +E+ + + E
Sbjct: 426 KLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIER 485
Query: 183 LDSELKCRDEEVKKL 197
L+ EL+ + + V++L
Sbjct: 486 LEKELEEKKKRVEEL 500
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 33.3 bits (77), Expect = 0.14
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 3/109 (2%)
Query: 68 NLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKE 127
L+ + L+E+ EE +++ + +EE KL + E EA + + EA E +
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLL-EEAEKEAQQAIKEAKKEADEI 589
Query: 128 LTSVRSKNETLDSELKCRD--EEVKKLTNAKHAVEDKCKAAVELRKELT 174
+ +R + + +K + E K+L A E K K E ++EL
Sbjct: 590 IKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.20
Identities = 34/135 (25%), Positives = 64/135 (47%), Gaps = 4/135 (2%)
Query: 59 AKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVS 118
K E EL + +L L ++ + E +K+L+ ++E ++L+ KY E E + +
Sbjct: 607 LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE-EEYEELR 665
Query: 119 EAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRK---ELTS 175
E +EL +EL +R++ E L+ + + ++KL E K +L K +
Sbjct: 666 EEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEE 725
Query: 176 VRSKNETLDSELKCR 190
+R K + + LK R
Sbjct: 726 LREKVKKYKALLKER 740
Score = 32.0 bits (73), Expect = 0.36
Identities = 28/158 (17%), Positives = 67/158 (42%), Gaps = 1/158 (0%)
Query: 65 ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL 124
E+ L+ L + ++ + E +++++ K+E ++L+ K EL+ K +E ++L
Sbjct: 239 EIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKL 298
Query: 125 RKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLD 184
+ + ++ L +EE+ + +E+K + EL+K+L + + E L+
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELE 358
Query: 185 SELKCRDEEVKKLTNA-KHAVEDKCKLLSQENAELQAA 221
+ +E K + + EL+
Sbjct: 359 ERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEEL 396
Score = 31.6 bits (72), Expect = 0.43
Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 9/126 (7%)
Query: 78 SALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNET 137
S L+E +EE K+L+ +E ++LK K ELE E EL +E + + + E
Sbjct: 317 SRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376
Query: 138 LDSELKCRDEE--VKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVK 195
L L E K+L + KA E+ +E++ + ++ L E+K + ++
Sbjct: 377 LKKRLTGLTPEKLEKELEELE-------KAKEEIEEEISKITARIGELKKEIKELKKAIE 429
Query: 196 KLTNAK 201
+L AK
Sbjct: 430 ELKKAK 435
Score = 31.2 bits (71), Expect = 0.60
Identities = 40/179 (22%), Positives = 78/179 (43%), Gaps = 17/179 (9%)
Query: 60 KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKE-----------ENQKLKTKY 108
+++ EL ++ + ++ + E +K L+ E + + K K
Sbjct: 455 EEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKK 514
Query: 109 IELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE 168
LE + +E +L+++L ++ + ++L EL+ +E KKL + +++ + E
Sbjct: 515 YNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAE 574
Query: 169 LRKEL-----TSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAI 222
L KEL SV E L EL+ E +L +A+ +E + K L + EL A
Sbjct: 575 LLKELEELGFESVEELEERL-KELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAF 632
Score = 30.4 bits (69), Expect = 1.3
Identities = 30/120 (25%), Positives = 58/120 (48%)
Query: 90 WKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEV 149
+K + KE ++++ ++ T+ + E E KEL V + + SEL EE+
Sbjct: 164 YKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL 223
Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
+KL +E+ + EL KEL S+ L+ +++ +E +++L +E+K K
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283
Score = 28.5 bits (64), Expect = 4.3
Identities = 39/195 (20%), Positives = 84/195 (43%), Gaps = 17/195 (8%)
Query: 88 EEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDE 147
E +K ++ + + +K K ELE V E+ EL +R + E L+ E+K +E
Sbjct: 179 ERLEKFIKRTENIEELIKEKEKELEE---VLREINEISSELPELREELEKLEKEVKELEE 235
Query: 148 EVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDK 207
+++ + +E + +L +++ + + E L E++ +E+VK+L K E+
Sbjct: 236 LKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEY 295
Query: 208 CKL-------------LSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRV 254
KL + + + L+ I+ ++ ++ L ++ + + +L R+
Sbjct: 296 IKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKE-LEEKEERLEELKKKLKELEKRL 354
Query: 255 AELATIHREINTALQ 269
EL H A
Sbjct: 355 EELEERHELYEEAKA 369
Score = 28.1 bits (63), Expect = 6.7
Identities = 39/139 (28%), Positives = 74/139 (53%), Gaps = 17/139 (12%)
Query: 80 LQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLD 139
L+E TA ++ +K+L+ +E+ +KL+ + ELE L+KE S K + L
Sbjct: 454 LEEYTAELKRIEKELKEIEEKERKLRKELRELEK---------VLKKE--SELIKLKELA 502
Query: 140 SELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
+LK +E++KK + +E K + +L+++L ++ + ++L EL E++++L
Sbjct: 503 EQLKELEEKLKKYNLEE--LEKKAEEYEKLKEKLIKLKGEIKSLKKEL----EKLEELKK 556
Query: 200 AKHAVEDKCKLLSQENAEL 218
+E K L +E AEL
Sbjct: 557 KLAELEKKLDELEEELAEL 575
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 32.2 bits (73), Expect = 0.20
Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 8/104 (7%)
Query: 67 ANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRK 126
A LK+ L+ + + EE K++L+ ++E ++L + ELEA E+++
Sbjct: 119 AGLKT--LQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYE------EVQE 170
Query: 127 ELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELR 170
L + +N L+ LK EV L +E + E
Sbjct: 171 RLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEEL 214
Score = 31.4 bits (71), Expect = 0.46
Identities = 18/92 (19%), Positives = 35/92 (38%), Gaps = 3/92 (3%)
Query: 126 KELTSVRSKNETLDSELKCR---DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
K L V + + + + +E K+ +E+ E+++ L + +N
Sbjct: 122 KTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSR 181
Query: 183 LDSELKCRDEEVKKLTNAKHAVEDKCKLLSQE 214
L+ LK EV L +E +L +E
Sbjct: 182 LEEMLKKLPGEVYDLKKRWDELEPGVELPEEE 213
>gnl|CDD|241234 cd00837, EVH1_family, EVH1 (Drosophila Enabled
(Ena)/Vasodilator-stimulated phosphoprotein (VASP)
homology 1) domain. The EVH1 domains are part of the
PH domain superfamily. EVH1 subfamilies include
Enables/VASP, Homer/Vesl, WASP, and Spred. Ligands
are known for three of the EVH1 subfamilies, all of
which bind proline-rich sequences: the Enabled/VASP
family binds to FPPPP peptides, the Homer/Vesl family
binds PPxxF peptides, and the WASP family binds LPPPEP
peptides. EVH1 has a PH-like fold, despite having
minimal sequence similarity to PH or PTB domains.
Length = 102
Score = 30.9 bits (70), Expect = 0.22
Identities = 7/17 (41%), Positives = 11/17 (64%)
Query: 1 MTFTKTSQKFGQWSDIR 17
+ + K +Q F QW+D R
Sbjct: 62 LVYNKATQTFHQWADDR 78
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 32.2 bits (74), Expect = 0.25
Identities = 38/177 (21%), Positives = 83/177 (46%), Gaps = 30/177 (16%)
Query: 90 WKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEV 149
KK A + + ++ E E K EA +E ++E+ +R++ L+ ELK R E+
Sbjct: 24 EKKLGSAEELAKRIIEEAKKEAETLK--KEALLEAKEEVHKLRAE---LERELKERRNEL 78
Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
++L E + + + ETLD +++ D++ + L + + +K K
Sbjct: 79 QRL-------ERR-------------LLQREETLDRKMESLDKKEENLEKKEKELSNKEK 118
Query: 210 LLSQENAELQAAISLVQAQLDTL--LASQDAQRRIIDTLNHQLAGRVAELATIHREI 264
L ++ EL+ I+ + +L+ + L ++A+ +++ + + E A + +EI
Sbjct: 119 NLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEEEAR---HEAAKLIKEI 172
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 32.3 bits (74), Expect = 0.26
Identities = 49/256 (19%), Positives = 98/256 (38%), Gaps = 37/256 (14%)
Query: 38 QRRIIDTLNHQLAGRVA----DSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQ 93
R ++ L + +A D A+A+ E L+ + L L+E + ++
Sbjct: 284 LRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEE 343
Query: 94 LQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEV---- 149
++ +E+ L+ + EL E A EL EL R E E++ +EE+
Sbjct: 344 AESLREDADDLEERAEELR------EEAAELESELEEAREAVEDRREEIEELEEEIEELR 397
Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKK------------- 196
++ +A + + ELR+E +R + L++ L+ E V++
Sbjct: 398 ERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPEC 457
Query: 197 ---LTNAKHA-----VEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNH 248
+ + H ++ + L E +L+ + V+ +L+ +A+ RI
Sbjct: 458 GQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEER 517
Query: 249 QLAGRVAELATIHREI 264
+ EL RE
Sbjct: 518 --REDLEELIAERRET 531
Score = 30.4 bits (69), Expect = 1.3
Identities = 51/240 (21%), Positives = 95/240 (39%), Gaps = 36/240 (15%)
Query: 49 LAGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKY 108
+ R+ + + + E + L+ L + E +EE +++ + +E +L+ +
Sbjct: 494 VEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEA 553
Query: 109 IELEATKGVSEAAVELRKELTSVRSKNETLDS---ELKCRDEEVKKLTNAKHAVEDKCKA 165
E EAA E +E R + L+S ELK R E ++++ A+ D
Sbjct: 554 EEKR------EAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDE 607
Query: 166 AVELRKELTSVRSKNETLDSELKCRDEEVKKL----------------TNAKHA---VED 206
LR++ ++ N+ L + E ++L A+ VE+
Sbjct: 608 IERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEE 667
Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
K L +E +LQA I V+ +L+ L ++ RR L RV L ++ E
Sbjct: 668 KLDELREERDDLQAEIGAVENELEELEELRE--RR------EALENRVEALEALYDEAEE 719
Score = 29.6 bits (67), Expect = 2.3
Identities = 36/177 (20%), Positives = 75/177 (42%), Gaps = 19/177 (10%)
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
+EE +++++ + E + L+ + E+E +E VE + + + E L+
Sbjct: 470 IEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLE------- 522
Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
+ + + +E+K + A ELR+ L++E + + E + +
Sbjct: 523 ---ELIAERRETIEEKRERAEELRER-------AAELEAEAEEKREAAAEAEEEAEEARE 572
Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHRE 263
+ L+ + AEL+ I + ++ TLLA+ I+ L + AEL RE
Sbjct: 573 EVAELNSKLAELKERIESL-ERIRTLLAAIADAEDEIERLREKREAL-AELNDERRE 627
Score = 28.9 bits (65), Expect = 3.7
Identities = 37/149 (24%), Positives = 65/149 (43%), Gaps = 10/149 (6%)
Query: 125 RKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLD 184
KE + + L+SEL DEE+++ E++ + A E R E V ++E
Sbjct: 198 EKEEKDLHERLNGLESELAELDEEIERY-------EEQREQARETRDEADEVLEEHEERR 250
Query: 185 SELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLAS---QDAQRR 241
EL+ + E++ L E + + L++E +L+ + ++ + D LLA DA
Sbjct: 251 EELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAE 310
Query: 242 IIDTLNHQLAGRVAELATIHREINTALQT 270
++ +L R EL E A Q
Sbjct: 311 AVEARREELEDRDEELRDRLEECRVAAQA 339
Score = 28.1 bits (63), Expect = 6.2
Identities = 36/167 (21%), Positives = 63/167 (37%), Gaps = 3/167 (1%)
Query: 101 NQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 160
+L E+E + E A E R E V ++E EL+ + E++ L E
Sbjct: 212 ESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETE 271
Query: 161 DKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQA 220
+ + EL +E+ +R + E L+ E E AVE + + L + EL+
Sbjct: 272 RERE---ELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRD 328
Query: 221 AISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTA 267
+ + ++ R D L + E A + E+ A
Sbjct: 329 RLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEA 375
Score = 27.7 bits (62), Expect = 8.5
Identities = 48/206 (23%), Positives = 80/206 (38%), Gaps = 15/206 (7%)
Query: 65 ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEA---A 121
EL L++ L + E+ EE ++++ +E ++L+ + +L A G+ +A A
Sbjct: 252 ELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEA 311
Query: 122 VELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNE 181
VE R+E R E D +CR A+ ED A +L + +R +
Sbjct: 312 VEARREELEDR-DEELRDRLEECRVAAQAHNEEAESLRED----ADDLEERAEELREEAA 366
Query: 182 TLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
L+SEL+ E V+ +E+ E EL+ L + R
Sbjct: 367 ELESELEEAREAVEDRREEIEELEE-------EIEELRERFGDAPVDLGNAEDFLEELRE 419
Query: 242 IIDTLNHQLAGRVAELATIHREINTA 267
D L + A A L T + A
Sbjct: 420 ERDELREREAELEATLRTARERVEEA 445
>gnl|CDD|144235 pfam00568, WH1, WH1 domain. WASp Homology domain 1 (WH1) domain.
WASP is the protein that is defective in
Wiskott-Aldrich syndrome (WAS). The majority of point
mutations occur within the amino- terminal WH1 domain.
The metabotropic glutamate receptors mGluR1alpha and
mGluR5 bind a protein called homer, which is a WH1
domain homologue. A subset of WH1 domains has been
termed a "EVH1" domain and appear to bind a polyproline
motif.
Length = 111
Score = 30.5 bits (69), Expect = 0.33
Identities = 4/18 (22%), Positives = 8/18 (44%)
Query: 1 MTFTKTSQKFGQWSDIRA 18
M + + F ++D R
Sbjct: 71 MEYNQARPFFHTFADSRC 88
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 31.7 bits (73), Expect = 0.42
Identities = 40/159 (25%), Positives = 71/159 (44%), Gaps = 18/159 (11%)
Query: 70 KSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELT 129
K ++ A +E+ +EE KK+ +A K+E LEA + + + E KEL
Sbjct: 27 KIAEAKIKEAEEEAKRILEEAKKEAEAIKKE--------ALLEAKEEIHKLRNEFEKELR 78
Query: 130 SVRSKNETLDSELKCRDEEVKK----LTNAKHAVEDKCKAAVELRKELTSVRSKNETLDS 185
R++ + L+ L ++E + + L + +E K K + ++EL + E L
Sbjct: 79 ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIE 138
Query: 186 ELKCRDEEVKKLTN--AKHAVEDKCKLLSQENAELQAAI 222
E E + LT AK + +K + E A +AA+
Sbjct: 139 EQLQELERISGLTAEEAKEILLEKVE----EEARHEAAV 173
Score = 29.7 bits (68), Expect = 1.9
Identities = 38/176 (21%), Positives = 80/176 (45%), Gaps = 30/176 (17%)
Query: 91 KKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVK 150
K +A +E + L+ E EA K EA +E ++E+ +R++ + EL+ R E++
Sbjct: 31 AKIKEAEEEAKRILEEAKKEAEAIK--KEALLEAKEEIHKLRNE---FEKELRERRNELQ 85
Query: 151 KLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKL 210
KL + K E LD +L+ ++ ++L + +E K +
Sbjct: 86 KLEK--------------------RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125
Query: 211 LSQENAELQAAISLVQAQLDTL--LASQDAQRRIIDTLNHQLAGRVAELATIHREI 264
L ++ EL+ I +L+ + L +++A+ +++ + + E A + +EI
Sbjct: 126 LEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEA---RHEAAVLIKEI 178
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 31.1 bits (70), Expect = 0.62
Identities = 39/217 (17%), Positives = 93/217 (42%), Gaps = 13/217 (5%)
Query: 61 KWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLK----TKYIELEATKG 116
K+E + +K + L++ + +E +++++A + +L + I E +
Sbjct: 313 KYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFEL 372
Query: 117 VSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSV 176
+++ +L +EL + +++ L +K R E + + ++E + L + +T
Sbjct: 373 MNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFK---SLEKTLRQYDSLIQNITRS 429
Query: 177 RSK--NETLDSELKCRDEEV-KKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLL 233
RS+ + DS LK E++ K + +++ K + + N E+Q I + + TL
Sbjct: 430 RSQIGHNVNDSSLKINIEQLFPKGSGINESIK---KSILELNDEIQERIKTEENKSITLE 486
Query: 234 ASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQT 270
+ I+ L L EL+ + + + +
Sbjct: 487 EDIKNLKHDINELTQILEKLELELSEANSKFELSKEE 523
>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein.
Length = 326
Score = 31.0 bits (70), Expect = 0.64
Identities = 17/120 (14%), Positives = 46/120 (38%), Gaps = 4/120 (3%)
Query: 124 LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETL 183
+ + LT++ S +L + +L++ ++D + + + L + + + L
Sbjct: 33 IYERLTNLESATASLSESV---STISSRLSDLSADLQDMTTSLDDAKATLNGLITTVQAL 89
Query: 184 DSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTL-LASQDAQRRI 242
+ + E+ L+ A + L IS +++ + L D ++R+
Sbjct: 90 QTSVDSLSSELADLSLTVSAHDAAISDLQTTVHANSTDISNLKSSVSANGLNITDLEQRV 149
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 30.8 bits (70), Expect = 0.81
Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 6/108 (5%)
Query: 154 NAKHAVEDKCK-AAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLS 212
+ EDK K A L+ + K E E K +E++ L K A + +L
Sbjct: 701 EYLNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESL-KLKIADQKIDELKD 759
Query: 213 QENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATI 260
+ AE + ++ +D + + + D L +L + LA+
Sbjct: 760 K--AETINGVKVLVEVVDA--KDMKSLKTMADRLKSKLGSAIYVLASF 803
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 30.3 bits (69), Expect = 0.91
Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%)
Query: 66 LANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELR 125
+ ++ S R+ +A +E + V E L +EEN++LK + +ELE+ + +L
Sbjct: 43 VGDVVSPVQRVVNAPREFVSGVFESLASLFDLREENEELKKELLELES---RLQELEQLE 99
Query: 126 KELTSVRSKNETLDSELKCRDEEVKKLTNAK 156
E N L L ++ + A+
Sbjct: 100 AE-------NARLRELLNLKESLDYQFITAR 123
>gnl|CDD|234309 TIGR03683, A-tRNA_syn_arch, alanyl-tRNA synthetase. This family of
alanyl-tRNA synthetases is limited to the archaea, and
is a subset of those sequences identified by the model
pfam07973 covering the second additional domain (SAD) of
alanyl and threonyl tRNA synthetases.
Length = 902
Score = 30.7 bits (70), Expect = 0.97
Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEA 120
EEWK+Q ++E ++LK K EL+ + +SEA
Sbjct: 771 FEEWKEQ----RKEIERLKKKLAELKIYELISEA 800
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 29.9 bits (68), Expect = 1.1
Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 57 ANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAY---KEENQKLKTKYIELE 112
N + E +A + N RL L+++ VEE +K+L+ Y K+ + LK + ELE
Sbjct: 41 KNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELE 99
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 30.4 bits (69), Expect = 1.2
Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 13/85 (15%)
Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQA 227
ELRK L + S+NE LK +E ++K + Q +Q+ +
Sbjct: 70 ELRKRLAKLISENE----ALKAENERLQKREQ---------SIDQQIQQAVQSETQELTK 116
Query: 228 QLDTLLASQDAQRRIIDTLNHQLAG 252
+++ L + + + +ID L +LAG
Sbjct: 117 EIEQLKSERQQLQGLIDQLQRRLAG 141
>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
Length = 508
Score = 30.2 bits (68), Expect = 1.3
Identities = 33/143 (23%), Positives = 53/143 (37%), Gaps = 27/143 (18%)
Query: 75 RLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSK 134
L +++ +E K A E LK +LE K ++A ++ R + +K
Sbjct: 194 TLAREAEDAARAADEAKTAAAAAAREAAPLKASLRKLERAKARADAELK-RADKALAAAK 252
Query: 135 NETLDSELKCRDEEVK------------KLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
+ E K R EE + +L AK E K AA ++ + +K
Sbjct: 253 TD----EAKARAEERQQKAAQQAAEAATQLDTAKADAEAKRAAAAATKEAAKAAAAKKA- 307
Query: 183 LDSELKCRDEEVKKLTNAKHAVE 205
E K T+AK A+E
Sbjct: 308 ---------ETAKAATDAKLALE 321
>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
Length = 568
Score = 30.2 bits (68), Expect = 1.3
Identities = 23/129 (17%), Positives = 42/129 (32%), Gaps = 8/129 (6%)
Query: 54 ADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEA 113
+ AN + E + A ++ R + S N ++W+ + E + K A
Sbjct: 165 RNEANRSRNEADRARNQAE--RFNNESGASATNTKQWRSEADGSNSEANRFKG-----YA 217
Query: 114 TKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVEL-RKE 172
S + +S + N D K + +A AA R +
Sbjct: 218 DSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANRAK 277
Query: 173 LTSVRSKNE 181
+ R+K E
Sbjct: 278 QQADRAKTE 286
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 28.7 bits (65), Expect = 1.3
Identities = 21/73 (28%), Positives = 35/73 (47%)
Query: 127 ELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSE 186
EL S++ + L++E +E+ KLT + + K EL KEL + + ET
Sbjct: 32 ELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQRYETTLEL 91
Query: 187 LKCRDEEVKKLTN 199
L + E V++L
Sbjct: 92 LGEKSERVEELRA 104
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 30.2 bits (68), Expect = 1.4
Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 8/71 (11%)
Query: 85 ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSK----NETLDS 140
+++E+ +KQLQ + KL Y+ LE TK +S E K + ++SK +T
Sbjct: 1164 SDIEKLEKQLQ---VIDTKLADAYL-LEVTKQISALEKEKPKNQSELKSKIAKFFDTTAD 1219
Query: 141 ELKCRDEEVKK 151
R+E +KK
Sbjct: 1220 IEVLRNERIKK 1230
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 30.2 bits (69), Expect = 1.4
Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 4/34 (11%)
Query: 87 VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEA 120
EEWK+Q K+E +KL+ + EL A++ +S+A
Sbjct: 767 FEEWKEQ----KKEIEKLRKELAELLASELLSKA 796
>gnl|CDD|183218 PRK11590, PRK11590, hypothetical protein; Provisional.
Length = 211
Score = 29.6 bits (67), Expect = 1.5
Identities = 10/31 (32%), Positives = 13/31 (41%)
Query: 8 QKFGQWSDIRANTVSLVQAQLDTLLASQDAQ 38
F +W +VQ +L T L S DA
Sbjct: 84 ADFVRWFRDNVTAFPVVQERLTTYLLSSDAD 114
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 30.0 bits (67), Expect = 1.6
Identities = 27/103 (26%), Positives = 54/103 (52%), Gaps = 3/103 (2%)
Query: 95 QAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 154
Q + + QK++ +EL+ K E A E+R ++TS ++ E+ +K + E+ L N
Sbjct: 193 QVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDPLKN 252
Query: 155 AKHAVEDKCKAAVELRKELTSVRSKN---ETLDSELKCRDEEV 194
+E ++L E+ +++S+ E +SEL+ + E+V
Sbjct: 253 RLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKV 295
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 29.8 bits (67), Expect = 1.7
Identities = 21/127 (16%), Positives = 49/127 (38%), Gaps = 8/127 (6%)
Query: 56 SANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATK 115
SA K+ E + +L+ L+E +E +++++ KE + K +E EA +
Sbjct: 521 SALEKELEQKNEHLEK-------LLKEQEKLKKELEQEMEELKERERNKKL-ELEKEAQE 572
Query: 116 GVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTS 175
+ E+ + ++ K E+K ++ VK + + + +
Sbjct: 573 ALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVR 632
Query: 176 VRSKNET 182
+R +
Sbjct: 633 IRYFGQK 639
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 29.6 bits (66), Expect = 2.0
Identities = 41/232 (17%), Positives = 80/232 (34%), Gaps = 20/232 (8%)
Query: 26 AQLDTLLASQDAQRRIIDTLNHQL------AGRVADSANAKKWEIELANLKSNNLR--LT 77
QLD + A R + T + + E E + R L
Sbjct: 74 FQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLA 133
Query: 78 SALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATK-GVSEAAVELRKELTSVRSKNE 136
A QE + KQ Q + + L + +LEA + + +L+ T ++S+
Sbjct: 134 KAQQE----LARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVL 189
Query: 137 TLDSELKCRDEEVKKLTNAKHAVEDKCK-------AAVELRKELTSVRSKNETLDSELKC 189
L ++E + L +A + + + AA + + + ++ ++
Sbjct: 190 DLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAA 249
Query: 190 RDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
R E++++ +E L QE A+L+A A+Q Q
Sbjct: 250 RAEQIRERERQLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAATQRGQVL 301
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.5 bits (67), Expect = 2.1
Identities = 28/144 (19%), Positives = 54/144 (37%), Gaps = 18/144 (12%)
Query: 63 EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIEL------EATKG 116
IE + +N RL T + E +L++Y + L+ + ++ E K
Sbjct: 32 HIEDLKEELSNERLRKLRSLLTK-LSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKD 90
Query: 117 VSEAAVELRKELTSVRSKNETLDSELK-CRDE--EVKKLTNAKHAVEDKCKAAVELRKEL 173
V E ++ KE+ + + L++E+K E ++ N ++ L +
Sbjct: 91 VEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFD--LDLSL----LLGFKY 144
Query: 174 TSVR--SKNETLDSELKCRDEEVK 195
SV + E ELK +
Sbjct: 145 VSVFVGTVPEDKLEELKLESDVEN 168
Score = 28.0 bits (63), Expect = 6.8
Identities = 20/90 (22%), Positives = 36/90 (40%), Gaps = 19/90 (21%)
Query: 76 LTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKN 135
+ ++E +EE +K+ ++ EE ++L KY+E + L + L +
Sbjct: 213 PSELIREIKEELEEIEKERESLLEELKELAKKYLEEL---------LALYEYLEIELERA 263
Query: 136 ETLDSELKCR----------DEEVKKLTNA 155
E L LK ++ VKKL
Sbjct: 264 EALSKFLKTDKTFAIEGWVPEDRVKKLKEL 293
Score = 28.0 bits (63), Expect = 7.3
Identities = 27/102 (26%), Positives = 39/102 (38%), Gaps = 23/102 (22%)
Query: 109 IELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE 168
+ELE SE E+++EL + + E+L ELK E KK ++ A E
Sbjct: 205 LELEEEGTPSELIREIKEELEEIEKERESLLEELK---ELAKKY-------LEELLALYE 254
Query: 169 LRKELTSVRSKNETLDSELKCR----------DEEVKKLTNA 200
L + E L LK ++ VKKL
Sbjct: 255 Y---LEIELERAEALSKFLKTDKTFAIEGWVPEDRVKKLKEL 293
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 29.6 bits (67), Expect = 2.2
Identities = 31/140 (22%), Positives = 50/140 (35%), Gaps = 15/140 (10%)
Query: 65 ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL 124
L L + L+E VE+ +++ N KT +EL+A + V A + L
Sbjct: 505 LLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKA-QIVVLAHMGL 563
Query: 125 RKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLD 184
+ + LK KK A E L+ + V + +
Sbjct: 564 PVPAEEAKVEAVDEVKFLK-----KKKGILDAGAFEST------LKDKKNKVLPEAKK-- 610
Query: 185 SELKCRDEEVKKLTNAKHAV 204
+LK D EV+ +T AV
Sbjct: 611 RKLKLGD-EVEVITGEPGAV 629
>gnl|CDD|220717 pfam10360, DUF2433, Protein of unknown function (DUF2433). This
is a conserved 120 residue region of a family of
proteins found in fungi. The function is not known.
Length = 132
Score = 28.3 bits (63), Expect = 2.3
Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 6/76 (7%)
Query: 8 QKFGQWSDIRANTVSLVQAQLDTLLASQDAQRRIID-TLNHQLAGRVADSAN---AKKWE 63
K + I V+ +++TLL ++QR +++ L+ L +V S N K
Sbjct: 19 SKLSRAKAIFGEVWDTVKDEVETLLTPNESQRNLLENALS--LFEKVPTSINGGIPKIGP 76
Query: 64 IELANLKSNNLRLTSA 79
+ L + + +R +A
Sbjct: 77 LSLQSAAAEAMRDETA 92
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 29.2 bits (66), Expect = 2.4
Identities = 19/53 (35%), Positives = 30/53 (56%), Gaps = 4/53 (7%)
Query: 210 LLSQENAELQAAISLVQAQLDTLLA-SQDAQRRIID---TLNHQLAGRVAELA 258
LL+Q+ A L+ ++ ++A LD +++Q +I D LN LA RV EL
Sbjct: 141 LLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNVALAQRVQELN 193
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 29.2 bits (66), Expect = 2.5
Identities = 22/109 (20%), Positives = 51/109 (46%), Gaps = 20/109 (18%)
Query: 84 TANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELK 143
++E+ +++++A K + ++LK + E +L+++L +SK E + +L
Sbjct: 276 EKSMEKLQEKIKALKYQLKRLKKMILLFEMIS-------DLKRKL---KSKFERDNEKLD 325
Query: 144 CRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDE 192
+E KK E+K K K++ + + E L+ + ++E
Sbjct: 326 AEVKEKKKEKK----KEEKKK------KQIERLEERIEKLEVQATDKEE 364
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 29.0 bits (66), Expect = 2.8
Identities = 13/48 (27%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 92 KQLQAYKEENQKLKTKYIELEATK-GVSEAAVELRKELTSVRSKNETL 138
++L+A +E N +L+ + +LE + +L +EL ++S+ E L
Sbjct: 1 ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKL 48
Score = 28.6 bits (65), Expect = 3.3
Identities = 11/42 (26%), Positives = 23/42 (54%)
Query: 65 ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKT 106
+ L+ +L L++ A E+ +++L+ K E +KLK+
Sbjct: 9 RNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKS 50
>gnl|CDD|220671 pfam10275, Peptidase_C65, Peptidase C65 Otubain. This family of
proteins conserved from plants to humans is a highly
specific ubiquitin iso-peptidase that removes ubiquitin
from proteins. The modification of cellular proteins by
ubiquitin (Ub) is an important event that underlies
protein stability and function in eukaryote being a
dynamic and reversible process. Otubain carries several
key conserved domains: (i) the OTU (ovarian tumour
domain) in which there is an active cysteine protease
triad (ii) a nuclear localisation signal, (iii) a Ub
interaction motif (UIM)-like motif phi-xx-A-xxxs-xx-Ac
(where phi indicates an aromatic amino acid, x indicates
any amino acid and Ac indicates an acidic amino acid),
(iv) a Ub-associated (UBA)-like domain and (v) the LxxLL
motif.
Length = 237
Score = 28.8 bits (65), Expect = 2.8
Identities = 21/131 (16%), Positives = 38/131 (29%), Gaps = 32/131 (24%)
Query: 75 RLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKG------------------ 116
L + +A +E+ Y ++ + L KY + +G
Sbjct: 7 PLVGEKEPLSALTKEYALADPIYLKKIKDLSKKYSGIRRVRGDGNCFYRAFGFSYLEILL 66
Query: 117 VSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSV 176
+ E+ + + + S E L +E ED +EL L +
Sbjct: 67 RKKDPAEINRFIARIESLKEKLV---ALGFDE--------FTFEDFLDEFIEL---LEKL 112
Query: 177 RSKNETLDSEL 187
T EL
Sbjct: 113 EDGGSTSREEL 123
>gnl|CDD|182809 PRK10884, PRK10884, SH3 domain-containing protein; Provisional.
Length = 206
Score = 28.5 bits (64), Expect = 3.0
Identities = 29/121 (23%), Positives = 49/121 (40%), Gaps = 19/121 (15%)
Query: 4 TKTSQKFGQWSDIRANTVSLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWE 63
+ + Q D + T + QL T + R + L +Q+ K
Sbjct: 61 VNANTNYAQIRDSKGRTAWIPLKQLST----TPSLRTRVPDLENQV----------KTLT 106
Query: 64 IELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVE 123
+L N+ + + T+ +Q+ A + + KEENQKLK + I A K V A ++
Sbjct: 107 DKLNNIDNTWNQRTAEMQQKVAQSD---SVINGLKEENQKLKNQLI--VAQKKVDAANLQ 161
Query: 124 L 124
L
Sbjct: 162 L 162
>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
transferase subunit alpha; Provisional.
Length = 762
Score = 29.1 bits (65), Expect = 3.3
Identities = 38/164 (23%), Positives = 70/164 (42%), Gaps = 32/164 (19%)
Query: 60 KKWEIELAN-LKSNNLRLTSALQESTANVEEW-----KKQLQAYKEE-NQKL-------- 104
K+ E+ELA LKS L + +++ E+ ++++++ EE N+K+
Sbjct: 611 KEIELELAGVLKSMGLEVIGVTKKNKDTAEQTPPPNLQEKIESLNEEINKKIERVIRSSD 670
Query: 105 ---KTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 161
K + ++LE K V ++ K E L+ ++K +K+ A ++ E
Sbjct: 671 LKSKIELLKLEVAKASKTPDVTEKE-------KIEALEQQIK------QKIAEALNSSEL 717
Query: 162 KCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
K K EL EL + R + LK D++ + VE
Sbjct: 718 KEKFE-ELEAELAAARETAAESNGSLKNDDDKEEDSKEDGSRVE 760
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 28.2 bits (63), Expect = 3.6
Identities = 18/97 (18%), Positives = 45/97 (46%)
Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQA 227
EL++EL + + L+SEL+ E+++ L ++E++ + L + EL + ++
Sbjct: 90 ELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQ 149
Query: 228 QLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREI 264
L + + + +L + EL ++ ++
Sbjct: 150 DLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186
Score = 27.0 bits (60), Expect = 8.2
Identities = 19/97 (19%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
EL++EL + + L+SEL+ E+++ L ++E++ ++ E KEL E
Sbjct: 90 ELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELR--EL 147
Query: 183 LDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
+ +E ++L + ++ + + + + L+
Sbjct: 148 RQDLREEVEELREELERLQENLQRLQEAIQELQSLLE 184
>gnl|CDD|236076 PRK07710, PRK07710, acetolactate synthase catalytic subunit;
Reviewed.
Length = 571
Score = 28.6 bits (64), Expect = 4.0
Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 13/53 (24%)
Query: 81 QESTANVEEWKKQLQA--------YKEENQKLKTKY-IEL--EATKGVSEAAV 122
+ N EW L+ YK ++ +K + IE+ E TKG EA V
Sbjct: 344 EGKKENHHEWLSLLKNWKEKYPLSYKRNSESIKPQKAIEMLYEITKG--EAIV 394
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 28.4 bits (64), Expect = 4.7
Identities = 30/162 (18%), Positives = 56/162 (34%), Gaps = 32/162 (19%)
Query: 124 LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETL 183
L +E +E L EL+ E+ L A+E + + EL + + ++R++ E
Sbjct: 438 LEREAALGADHDERLA-ELR---AELAALEAELAALEARWQQEKELVEAILALRAELE-- 491
Query: 184 DSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLD------------- 230
+ A+ + L A Q LV ++D
Sbjct: 492 --ADADAPAD------DDAALRAQLAELEAALASAQGEEPLVFPEVDAQAVAEVVADWTG 543
Query: 231 ----TLLASQ-DAQRRIIDTLNHQLAGRVAELATIHREINTA 267
++ + +A + D L ++ G+ L I I TA
Sbjct: 544 IPVGRMVRDEIEAVLSLPDRLAERVIGQDHALEAIAERIRTA 585
>gnl|CDD|224483 COG1567, COG1567, CRISPR system related protein, RAMP superfamily
[Defense mechanisms].
Length = 313
Score = 28.2 bits (63), Expect = 4.8
Identities = 18/124 (14%), Positives = 40/124 (32%), Gaps = 24/124 (19%)
Query: 60 KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEEN-------QKLKTKYIELE 112
++ + + N + LR++S L + K A E + K +I +
Sbjct: 46 REIDDLIENFAKSPLRISSLLPYIGNDYLVPKPLHNALILEGNDEIKKKKLKKISFIPI- 104
Query: 113 ATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE-LRK 171
+L S D + + ++ ++ ++ K K A+ R
Sbjct: 105 -------------DQLESFLD--GEADEKYQVKELPKRQTEIGVDSLSKKAKKAIHNGRI 149
Query: 172 ELTS 175
+ S
Sbjct: 150 DADS 153
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 28.6 bits (64), Expect = 5.0
Identities = 28/152 (18%), Positives = 64/152 (42%), Gaps = 12/152 (7%)
Query: 54 ADSANAKKWEIELANLKSNNLR--LTSALQESTANVEEWKKQLQAYKEENQKLKTKYIEL 111
+ N + + + NN+ ++++L + T + K ++ YKEE +KL+ ++
Sbjct: 695 KELQNLLSLKENIIKKQLNNIEQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQI 754
Query: 112 EATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRK 171
K +E + L + + T + L+ +D + N ++ + + E +K
Sbjct: 755 INRK--NEFILHLYENDKDLPDGKNTYEEFLQYKDT----ILNKENKISNDINILKENKK 808
Query: 172 ELTSVRSKNETLDSELKC----RDEEVKKLTN 199
+ + L +L+ DEE+K+L
Sbjct: 809 NNQDLLNSYNILIQKLEAHTEKNDEELKQLLQ 840
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 28.3 bits (63), Expect = 5.5
Identities = 34/214 (15%), Positives = 86/214 (40%), Gaps = 27/214 (12%)
Query: 15 DIRANTVSLVQAQLDTLLASQDAQRR-IIDTLNHQLAGRVADSANAKKWEIELANLKSNN 73
D+ N++ + Q ++D+L++ AQR+ I+D + +E+ +L+ N
Sbjct: 127 DVFLNSIFVGQGEMDSLISGDPAQRKKILDEI------------------LEINSLERNY 168
Query: 74 LRLTSALQESTANVEEW---KKQLQAYKEENQKLKTKYIELEATKGVSEAAVE-LRKELT 129
+L + A + +++L++ E + +K + + E + ++ +E L E
Sbjct: 169 DKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYN 228
Query: 130 SVRSKNETLDSELK---CRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSK-NETLDS 185
+ L S L ++ + + E +E + + + ++
Sbjct: 229 NAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIIND 288
Query: 186 ELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
+ + K+ +E+K ++LS +AE+
Sbjct: 289 PVYKNRNYINDYFKYKNDIENKKQILSNIDAEIN 322
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 28.1 bits (63), Expect = 5.5
Identities = 31/137 (22%), Positives = 59/137 (43%), Gaps = 11/137 (8%)
Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNA---KHAVEDKCKAAVELRKELTSVRSK 179
EL+KEL + K E L++E++ ++EE++K N K + K L + L + K
Sbjct: 99 ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYK 158
Query: 180 NETLDSELKCRDEEVKKLTNAKH-----AVEDKCKLLSQENAELQAAISLVQAQLDTLLA 234
+ K E + NA ++ K K L N A ++L D +
Sbjct: 159 KNFKE---KLLKELKSVILNASSLLSLEELKAKIKTLFSSNKPELALLTLSVIDFDEIEQ 215
Query: 235 SQDAQRRIIDTLNHQLA 251
++ ++ II + + ++
Sbjct: 216 AEILEKSIIGSSDVPIS 232
Score = 28.1 bits (63), Expect = 6.3
Identities = 38/169 (22%), Positives = 67/169 (39%), Gaps = 26/169 (15%)
Query: 60 KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEE----NQKLKTKY------I 109
+ + + L E +VEE K L+A +E QKL+ K I
Sbjct: 295 ESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEKLKDPSTSI 354
Query: 110 ELEATKGVSEAAVELRKEL-TSVRSKNE---TLDSEL-KCRD--------EEVKKLTNAK 156
ELE+ + E+ ++ + +R NE L E K + E + + +
Sbjct: 355 ELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQ 414
Query: 157 HAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
+ KA L KE+ + ++ + L+ E+K E K+LTN + +
Sbjct: 415 KEKKGLEKAINSLEKEIKQLEAEIKALEKEIK---ELEKQLTNIEPTAD 460
>gnl|CDD|235399 PRK05303, flgI, flagellar basal body P-ring protein; Provisional.
Length = 367
Score = 27.8 bits (63), Expect = 5.6
Identities = 7/24 (29%), Positives = 12/24 (50%)
Query: 245 TLNHQLAGRVAELATIHREINTAL 268
T+N GR+ A + RE+ +
Sbjct: 160 TVNVPTVGRIPNGAIVEREVPSDF 183
>gnl|CDD|233513 TIGR01657, P-ATPase-V, P-type ATPase of unknown pump specificity
(type V). These P-type ATPases form a distinct clade
but the substrate of their pumping activity has yet to
be determined. This clade has been designated type V in.
Length = 1054
Score = 28.1 bits (63), Expect = 5.9
Identities = 17/77 (22%), Positives = 31/77 (40%), Gaps = 3/77 (3%)
Query: 89 EWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL--RKELTSVRSKNETLDSELKCRD 146
EWK +L+ N+ +T I S+ VEL + ++++N E +
Sbjct: 39 EWKVKLRYVPVSNEDAETVVIVDPTPNSGSDYIVELSNKSLSNDLQTENAVEGGE-EPIY 97
Query: 147 EEVKKLTNAKHAVEDKC 163
+ +K + H E K
Sbjct: 98 FDFRKQRFSYHEKELKI 114
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 27.9 bits (62), Expect = 6.7
Identities = 24/103 (23%), Positives = 51/103 (49%), Gaps = 1/103 (0%)
Query: 148 EVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDK 207
+++KLT + + ++ E +KE ++ +N+ L ++L K +E+
Sbjct: 76 KIRKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKLEEL 135
Query: 208 CKLLSQENAELQAAI-SLVQAQLDTLLASQDAQRRIIDTLNHQ 249
+ L +EN LQ + +LVQ + + +Q+ R + +TL +Q
Sbjct: 136 SRHLKEENQCLQIQLDALVQECNEKIEENQELNRELAETLAYQ 178
>gnl|CDD|187543 cd05232, UDP_G4E_4_SDR_e, UDP-glucose 4 epimerase, subgroup 4,
extended (e) SDRs. UDP-glucose 4 epimerase (aka
UDP-galactose-4-epimerase), is a homodimeric extended
SDR. It catalyzes the NAD-dependent conversion of
UDP-galactose to UDP-glucose, the final step in Leloir
galactose synthesis. This subgroup is comprised of
bacterial proteins, and includes the Staphylococcus
aureus capsular polysaccharide Cap5N, which may have a
role in the synthesis of UDP-N-acetyl-d-fucosamine. This
subgroup has the characteristic active site tetrad and
NAD-binding motif of the extended SDRs. Extended SDRs
are distinct from classical SDRs. In addition to the
Rossmann fold (alpha/beta folding pattern with a central
beta-sheet) core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids. Extended SDRs are a
diverse collection of proteins, and include isomerases,
epimerases, oxidoreductases, and lyases; they typically
have a TGXXGXXG cofactor binding motif. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 303
Score = 27.7 bits (62), Expect = 6.8
Identities = 14/74 (18%), Positives = 29/74 (39%), Gaps = 13/74 (17%)
Query: 208 CKLLSQENAELQAAI--------SLVQAQLDTLLASQDAQRRIID-----TLNHQLAGRV 254
L E++ A+ S+V A+L + + D + H + +
Sbjct: 16 VDKLLSRGEEVRIAVRNAENAEPSVVLAELPDIDSFTDLFLGVDAVVHLAARVHVMNDQG 75
Query: 255 AELATIHREINTAL 268
A+ + +R++NT L
Sbjct: 76 ADPLSDYRKVNTEL 89
>gnl|CDD|237746 PRK14529, PRK14529, adenylate kinase; Provisional.
Length = 223
Score = 27.4 bits (61), Expect = 6.9
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 97 YKEENQKLKTKYIELEATKGVSEAAVELRKEL 128
+K+ K TKYIEL+ + E L K+L
Sbjct: 191 FKDLAAKGSTKYIELDGEGSIDEIKETLLKQL 222
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 27.3 bits (61), Expect = 7.0
Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 4/108 (3%)
Query: 35 QDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQL 94
Q+ + RI A + A EI++A + N+L L E +E+ +K++
Sbjct: 69 QEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLE--DELAELMEEIEKLEKEI 126
Query: 95 QAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSEL 142
+ KE ++L+ E EA + E E+R+E + SK E L +L
Sbjct: 127 EDLKERLERLEKNLAEAEAR--LEEEVAEIREEGQELSSKREELKEKL 172
>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
Length = 3095
Score = 28.0 bits (62), Expect = 7.0
Identities = 14/49 (28%), Positives = 24/49 (48%)
Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVK 195
+ KLT + + ++R+++ RS+ ETL + L R EVK
Sbjct: 1032 NSIVKLTKGLDGAKLASSLSSDIREKIRQKRSETETLIARLSARYAEVK 1080
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.0 bits (63), Expect = 7.1
Identities = 19/60 (31%), Positives = 25/60 (41%), Gaps = 4/60 (6%)
Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
K KLL E+ +Q L Q LL D Q+ + L QLA A+L E+
Sbjct: 50 KQKLLEAEDKLVQQ--DLEQTLA--LLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEA 105
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in length.
The function of this family is unknown.
Length = 156
Score = 26.9 bits (60), Expect = 7.9
Identities = 15/70 (21%), Positives = 34/70 (48%), Gaps = 7/70 (10%)
Query: 118 SEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVR 177
+ + +L + ++L +LK RD++++ L + K K EL+K++ ++
Sbjct: 12 GKDIQNPKSKLDKANEERDSLKKQLKNRDKQIEDL-------KKKVKDNEELQKKIEKLK 64
Query: 178 SKNETLDSEL 187
+N+T E
Sbjct: 65 QQNKTAKEEY 74
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 27.6 bits (61), Expect = 8.0
Identities = 31/152 (20%), Positives = 65/152 (42%), Gaps = 11/152 (7%)
Query: 75 RLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSK 134
L+ + E +L K++ ++ K K E E + EL+ EL + +
Sbjct: 591 STLEELETLKSEKENLDGELSKCKDDLEESKNKLQETE------KKLEELKSELDASQES 644
Query: 135 NETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE----LRKELTSVRSKNETLDSELKCR 190
N +++LKC E + L +E + K+ E L EL R ++ L ++ +
Sbjct: 645 NSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQELIAKCREL 704
Query: 191 DEEVKKLTNAK-HAVEDKCKLLSQENAELQAA 221
+E++++ + D+ + ++ E+ AA
Sbjct: 705 EEKIERAEQEENMQKLDEEEQKIKQEKEIAAA 736
>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
ribosomal structure and biogenesis].
Length = 363
Score = 27.6 bits (62), Expect = 8.1
Identities = 23/100 (23%), Positives = 44/100 (44%), Gaps = 4/100 (4%)
Query: 141 ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRD--EEVKKLT 198
L R EE++ L + + D +L KE + E K ++ E+ K++
Sbjct: 11 SLLERYEELEALLSDPEVISDP-DEYRKLSKEYAELEPIVEKYREYKKAQEDLEDAKEML 69
Query: 199 NAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDA 238
+ E + ++ +E EL+A I ++ +L LL +D
Sbjct: 70 AEEKDPEMR-EMAEEEIKELEAKIEELEEELKILLLPKDP 108
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 27.8 bits (61), Expect = 8.5
Identities = 34/154 (22%), Positives = 61/154 (39%), Gaps = 9/154 (5%)
Query: 59 AKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEEN--QKLKTKYIELEATKG 116
KKW LA LKS A +S EE + + E+ + + E+E ++G
Sbjct: 567 KKKWS-SLAQLKSR--FQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRG 623
Query: 117 VSEAA--VELRKELT--SVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKE 172
S A E E+ + R +N EL+ E ++ K V+ + ++ ++
Sbjct: 624 SSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683
Query: 173 LTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
L RS+ ET+ E + E + + +
Sbjct: 684 LKINRSEFETMVPESRVVIEGYRAGRYVRIVLSH 717
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.4 bits (61), Expect = 8.5
Identities = 15/65 (23%), Positives = 27/65 (41%)
Query: 200 AKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELAT 259
+ A + + K + +E A L+ I Q Q L + I +L QL +L
Sbjct: 32 SAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKK 91
Query: 260 IHREI 264
+ ++I
Sbjct: 92 LRKQI 96
>gnl|CDD|217511 pfam03358, FMN_red, NADPH-dependent FMN reductase.
Length = 151
Score = 26.8 bits (60), Expect = 8.8
Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 3/54 (5%)
Query: 46 NHQLAGRVADSANAKKWEIELANLKSNNLRLTSA---LQESTANVEEWKKQLQA 96
+LA A+ E+EL +L L L +V+E ++++ A
Sbjct: 16 TRKLAEWAAELLEEAGAEVELIDLADLPLPLYDEDLEACGDPDDVQELREKIAA 69
>gnl|CDD|216884 pfam02119, FlgI, Flagellar P-ring protein.
Length = 342
Score = 27.5 bits (62), Expect = 8.8
Identities = 7/26 (26%), Positives = 11/26 (42%)
Query: 245 TLNHQLAGRVAELATIHREINTALQT 270
T+N GR+ A + RE+
Sbjct: 135 TVNVPTVGRIPNGAIVEREVPFDFGQ 160
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 27.6 bits (62), Expect = 10.0
Identities = 24/100 (24%), Positives = 38/100 (38%), Gaps = 5/100 (5%)
Query: 85 ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKN---ETLDSE 141
A +E ++Q KE LK L + + E EL KEL ++ K E D
Sbjct: 705 AALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLL 764
Query: 142 LKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNE 181
+ EE+ + V+ ELR+ ++ K
Sbjct: 765 SNAKAEEIGGVKVLAKEVDG--ADMKELREIADDLKKKLG 802
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.121 0.317
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,471,420
Number of extensions: 1155802
Number of successful extensions: 2479
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2034
Number of HSP's successfully gapped: 514
Length of query: 270
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 175
Effective length of database: 6,723,972
Effective search space: 1176695100
Effective search space used: 1176695100
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.1 bits)