Query psy10209
Match_columns 106
No_of_seqs 58 out of 60
Neff 2.8
Searched_HMMs 46136
Date Fri Aug 16 20:37:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10209.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10209hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05044 HPD: Homeo-prospero d 100.0 1.7E-63 3.6E-68 376.0 6.3 101 1-106 58-158 (158)
2 KOG3779|consensus 100.0 6.4E-54 1.4E-58 370.9 7.8 101 1-106 635-735 (737)
3 PF04891 NifQ: NifQ; InterPro 75.0 4.9 0.00011 30.8 3.9 46 56-103 113-158 (167)
4 PHA00455 hypothetical protein 70.1 1.3 2.7E-05 31.5 -0.3 11 31-41 1-11 (85)
5 PF01342 SAND: SAND domain; I 53.5 10 0.00022 25.7 1.8 22 59-82 45-66 (82)
6 PF04542 Sigma70_r2: Sigma-70 42.4 69 0.0015 18.6 5.3 36 51-86 22-58 (71)
7 KOG3059|consensus 40.5 16 0.00036 30.7 1.5 31 70-105 5-35 (292)
8 PF13911 AhpC-TSA_2: AhpC/TSA 37.7 89 0.0019 20.6 4.4 26 16-41 37-62 (115)
9 PF09580 Spore_YhcN_YlaJ: Spor 35.8 79 0.0017 22.7 4.2 36 2-37 119-154 (177)
10 PF11892 DUF3412: Domain of un 35.4 15 0.00033 27.5 0.5 38 22-73 33-70 (123)
11 COG1656 Uncharacterized conser 34.2 53 0.0012 25.6 3.3 27 16-42 47-77 (165)
12 COG5079 SAC3 Nuclear protein e 33.1 64 0.0014 29.9 4.0 49 34-91 204-253 (646)
13 COG3963 Phospholipid N-methylt 31.9 1.1E+02 0.0024 24.6 4.8 38 5-42 63-100 (194)
14 PRK12322 NADH dehydrogenase su 31.3 20 0.00042 30.1 0.5 97 10-106 79-200 (366)
15 PF13089 PP_kinase_N: Polyphos 31.1 47 0.001 23.2 2.3 16 50-65 31-46 (109)
16 PF12870 Lumazine_bd: Lumazine 30.7 45 0.00098 20.8 2.0 16 56-71 9-24 (111)
17 TIGR02736 cbb3_Q_epsi cytochro 29.3 26 0.00057 23.2 0.8 15 2-16 34-48 (56)
18 PF10173 Mit_KHE1: Mitochondri 28.7 33 0.00071 26.3 1.3 23 72-94 54-76 (187)
19 PF04097 Nic96: Nup93/Nic96; 27.8 69 0.0015 28.3 3.3 37 61-98 173-212 (613)
20 KOG2154|consensus 25.9 76 0.0016 28.8 3.2 36 45-80 45-80 (505)
21 PF11830 DUF3350: Domain of un 25.3 54 0.0012 21.7 1.7 13 76-88 34-46 (56)
22 PF07417 Crl: Transcriptional 25.1 78 0.0017 23.8 2.7 32 35-68 74-105 (125)
23 COG0649 NuoD NADH:ubiquinone o 24.9 86 0.0019 27.6 3.3 82 24-105 109-207 (398)
24 PRK10984 DNA-binding transcrip 24.3 1.1E+02 0.0024 23.0 3.4 34 34-69 75-109 (127)
25 PF01832 Glucosaminidase: Mann 23.5 95 0.0021 21.1 2.7 33 57-89 102-135 (136)
26 PF13376 OmdA: Bacteriocin-pro 22.0 55 0.0012 20.4 1.2 23 47-69 5-29 (63)
27 TIGR00651 pta phosphate acetyl 21.1 2E+02 0.0043 23.5 4.6 52 3-57 13-65 (303)
28 smart00222 Sec7 Sec7 domain. D 20.9 1.1E+02 0.0024 22.8 2.9 31 57-87 151-184 (187)
29 PF10296 DUF2404: Putative int 20.7 79 0.0017 20.9 1.8 29 73-105 13-41 (91)
30 smart00258 SAND SAND domain. 20.5 51 0.0011 22.5 0.9 22 59-82 36-57 (73)
No 1
>PF05044 HPD: Homeo-prospero domain; InterPro: IPR007738 The homeobox gene Prox1 is expressed in a subpopulation of endothelial cells that, after budding from veins, gives rise to the mammalian lymphatic system []. Prox1 has been found to be an early specific marker for the developing liver and pancreas in the mammalian foregut endoderm []. This family contains an atypical homeobox domain.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2LMD_A 1XPX_A 1MIJ_A.
Probab=100.00 E-value=1.7e-63 Score=375.97 Aligned_cols=101 Identities=68% Similarity=1.154 Sum_probs=89.8
Q ss_pred ChHHHHHHHHhcCCCCCceeeccchhHHHHHHhhhccCCCccccCCCCCccHHHHHHHHHHHHHHHHhcCCCCCcchhHH
Q psy10209 1 MEKYARQAVSEGVLHAEDLHVSGECEIYRVLNLHYNRNNHIEVCGPQVPTNFRFVIEQTLREFFKAIQDGKDQEPSWKKS 80 (106)
Q Consensus 1 MEK~ARQAi~eGv~~~~~l~V~rdsELfr~LN~HYNk~Ndfe~~~~~vP~~Fl~v~~~tlrEFf~ai~~gkD~~psWKK~ 80 (106)
||||||||++|||+++++|+|+|||||||+|||||||+|||| ||++|++|+++||||||+||++|||+||||||+
T Consensus 58 MEK~ARqa~~eGv~~~~~l~V~rdsELfr~LN~HYNk~N~~e-----vP~~Fl~v~~~tLrEFf~AI~~gkD~dpsWKK~ 132 (158)
T PF05044_consen 58 MEKFARQAVSEGVKNADDLRVTRDSELFRVLNMHYNKNNDFE-----VPDRFLEVVQITLREFFNAIQAGKDSDPSWKKP 132 (158)
T ss_dssp HHHHHHHHHHHT-S-GGGSSSS-TTCHHHHHHHHHHTT-STT-------HHHHHHHHHHHHHHHHHHHCTCCCSTCCCHH
T ss_pred HHHHHHHHHHccCCcHHHeeeccchHHHHHHHhhcccCCCcc-----CchhHHHHHHHHHHHHHHHHHcCCCCCccccHH
Confidence 899999999999999999999999999999999999999999 999999999999999999999999999999999
Q ss_pred HHHHHHhhCCCCCccccCCCchhccC
Q psy10209 81 IYKIISRMDDPVPEYFKSPNFLEQLE 106 (106)
Q Consensus 81 IYKiI~kLD~~vPe~fks~~~l~~le 106 (106)
|||||||||++|||+|||||||++||
T Consensus 133 IYKvI~kLD~~vPe~FK~pn~l~~Le 158 (158)
T PF05044_consen 133 IYKVISKLDDPVPEFFKSPNFLEELE 158 (158)
T ss_dssp HHHHHHCC-SS--GGCC-HHHCGGG-
T ss_pred HHHHHHHccCcccHhhcCCchhhhcC
Confidence 99999999999999999999999998
No 2
>KOG3779|consensus
Probab=100.00 E-value=6.4e-54 Score=370.90 Aligned_cols=101 Identities=57% Similarity=1.011 Sum_probs=100.3
Q ss_pred ChHHHHHHHHhcCCCCCceeeccchhHHHHHHhhhccCCCccccCCCCCccHHHHHHHHHHHHHHHHhcCCCCCcchhHH
Q psy10209 1 MEKYARQAVSEGVLHAEDLHVSGECEIYRVLNLHYNRNNHIEVCGPQVPTNFRFVIEQTLREFFKAIQDGKDQEPSWKKS 80 (106)
Q Consensus 1 MEK~ARQAi~eGv~~~~~l~V~rdsELfr~LN~HYNk~Ndfe~~~~~vP~~Fl~v~~~tlrEFf~ai~~gkD~~psWKK~ 80 (106)
|||||||||+||||+.++|.|+|||||||+|||||||+|||| ||++|++|+|+||||||+||.+|||.||||||+
T Consensus 635 MEK~ARQAi~dGvT~~~ei~itrD~EL~r~LN~HYNk~N~~~-----~PeR~~~V~Q~TLREF~~AI~~GKDvdPSWKK~ 709 (737)
T KOG3779|consen 635 MEKYARQAINDGVTSTEELSITRDCELYRALNMHYNKANDFE-----VPERFLEVAQITLREFFNAIIAGKDVDPSWKKA 709 (737)
T ss_pred HHHHHHHHHhcccCchhheeeeccHHHHHHHHhhhccccccc-----ccHHHHHHHHHHHHHHHHHHhcCCcCCchHHHH
Confidence 899999999999999999999999999999999999999999 999999999999999999999999999999999
Q ss_pred HHHHHHhhCCCCCccccCCCchhccC
Q psy10209 81 IYKIISRMDDPVPEYFKSPNFLEQLE 106 (106)
Q Consensus 81 IYKiI~kLD~~vPe~fks~~~l~~le 106 (106)
|||||||||++|||+||+||||++||
T Consensus 710 IYKVI~kLd~evPe~FK~PN~Lq~L~ 735 (737)
T KOG3779|consen 710 IYKVICKLDSEVPEIFKSPNCLQELL 735 (737)
T ss_pred HHHHHHhhhhhhhHhhcCccHHHHhh
Confidence 99999999999999999999999996
No 3
>PF04891 NifQ: NifQ; InterPro: IPR006975 NifQ is involved in early stages of the biosynthesis of the iron-molybdenum cofactor (FeMo-co) [], which is an integral part of the active site of dinitrogenase []. The conserved C-terminal cysteine residues may be involved in metal binding [].; GO: 0030151 molybdenum ion binding, 0009399 nitrogen fixation
Probab=75.05 E-value=4.9 Score=30.81 Aligned_cols=46 Identities=17% Similarity=0.403 Sum_probs=36.6
Q ss_pred HHHHHHHHHHHHhcCCCCCcchhHHHHHHHHhhCCCCCccccCCCchh
Q psy10209 56 IEQTLREFFKAIQDGKDQEPSWKKSIYKIISRMDDPVPEYFKSPNFLE 103 (106)
Q Consensus 56 ~~~tlrEFf~ai~~gkD~~psWKK~IYKiI~kLD~~vPe~fks~~~l~ 103 (106)
...=++..|-++.+....+--|||=.||-+|.-+.-++ =++|+|-+
T Consensus 113 Ls~Lm~r~Fp~Laa~N~~~MrWKKFfYrqlCe~eG~~~--C~aPsC~~ 158 (167)
T PF04891_consen 113 LSALMRRHFPPLAARNTRNMRWKKFFYRQLCEREGLYL--CRAPSCEE 158 (167)
T ss_pred HHHHHHHHhHHHHHhccCCCcHHHHHHHHHHHHcCCCc--CCCCCCCC
Confidence 34457889999999999999999999999997666554 36777643
No 4
>PHA00455 hypothetical protein
Probab=70.09 E-value=1.3 Score=31.53 Aligned_cols=11 Identities=45% Similarity=0.969 Sum_probs=8.9
Q ss_pred HHhhhccCCCc
Q psy10209 31 LNLHYNRNNHI 41 (106)
Q Consensus 31 LN~HYNk~Ndf 41 (106)
+++||||+|-+
T Consensus 1 ~rLhfNksng~ 11 (85)
T PHA00455 1 MRLHFNKSNGI 11 (85)
T ss_pred CceeEeccCCe
Confidence 57999998864
No 5
>PF01342 SAND: SAND domain; InterPro: IPR000770 The SAND domain (named after Sp100, AIRE-1, NucP41/75, DEAF-1) is a conserved ~80 residue region found in a number of nuclear proteins, many of which function in chromatin-dependent transcriptional control. These include proteins linked to various human diseases, such as the Sp100 (Speckled protein 100 kDa), NUDR (Nuclear DEAF-1 related), GMEB (Glucocorticoid Modulatory Element Binding) proteins and AIRE-1 (Autoimmune regulator 1) proteins. Proteins containing the SAND domain have a modular structure; the SAND domain can be associated with a number of other modules, including the bromodomain, the PHD finger and the MYND finger. Because no SAND domain has been found in yeast, it is thought that the SAND domain could be restricted to animal phyla. Many SAND domain-containing proteins, including NUDR, DEAF-1 (Deformed epidermal autoregulatory factor-1) and GMEB, have been shown to bind DNA sequences specifically. The SAND domain has been proposed to mediate the DNA binding activity of these proteins [, ]. The resolution of the 3D structure of the SAND domain from Sp100b has revealed that it consists of a novel alpha/beta fold. The SAND domain adopts a compact fold consisting of a strongly twisted, five-stranded antiparallel beta-sheet with four alpha-helices packing against one side of the beta-sheet. The opposite side of the beta-sheet is solvent exposed. The beta-sheet and alpha-helical parts of the structure form two distinct regions. Multiple hydrophobic residues pack between these regions to form a structural core. A conserved KDWK sequence motif is found within the alpha-helical, positively charged surface patch. The DNA binding surface has been mapped to the alpha-helical region encompassing the KDWK motif [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 1OQJ_B 1UFN_A 1H5P_A.
Probab=53.54 E-value=10 Score=25.71 Aligned_cols=22 Identities=41% Similarity=0.670 Sum_probs=15.5
Q ss_pred HHHHHHHHHhcCCCCCcchhHHHH
Q psy10209 59 TLREFFKAIQDGKDQEPSWKKSIY 82 (106)
Q Consensus 59 tlrEFf~ai~~gkD~~psWKK~IY 82 (106)
|..|| -+.+|+.+...||.+|+
T Consensus 45 TP~eF--E~~~G~~~sK~WK~SIr 66 (82)
T PF01342_consen 45 TPSEF--ERHGGKGSSKDWKRSIR 66 (82)
T ss_dssp -HHHH--HHHHTTCTCS-HHHHSE
T ss_pred CHHHH--HhhcCcccCCCCCccEE
Confidence 56666 34579999999999984
No 6
>PF04542 Sigma70_r2: Sigma-70 region 2 ; InterPro: IPR007627 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. Region 2 of sigma-70 is the most conserved region of the entire protein. All members of this class of sigma-factor contain region 2. The high conservation is due to region 2 containing both the -10 promoter recognition helix and the primary core RNA polymerase binding determinant. The core-binding helix, interacts with the clamp domain of the largest polymerase subunit, beta prime [, ]. The aromatic residues of the recognition helix, found at the C terminus of this domain are thought to mediate strand separation, thereby allowing transcription initiation [, ].; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1OR7_B 1H3L_B 2Z2S_C 2Q1Z_C 2O7G_B 1SMY_F 1IW7_P 2BE5_F 2A6E_F 2CW0_F ....
Probab=42.36 E-value=69 Score=18.58 Aligned_cols=36 Identities=11% Similarity=0.333 Sum_probs=27.2
Q ss_pred cHHHHHHHHHHHHHHHHhcCCC-CCcchhHHHHHHHH
Q psy10209 51 NFRFVIEQTLREFFKAIQDGKD-QEPSWKKSIYKIIS 86 (106)
Q Consensus 51 ~Fl~v~~~tlrEFf~ai~~gkD-~~psWKK~IYKiI~ 86 (106)
..-+++|.++-..++++..... .+.+.+.=+++++.
T Consensus 22 ~~eD~~qe~~~~l~~~~~~~~~~~~~~~~~~l~~~~~ 58 (71)
T PF04542_consen 22 DAEDLVQEAFIKLWRAIDSYDPDRGDSFRAWLFRIAR 58 (71)
T ss_dssp HHHHHHHHHHHHHHHHHHHTSTTSSSHHHHHHHHHHH
T ss_pred hHHHHhhHHHHHHHhhhhcccccccCCHHHHHHHHHH
Confidence 3459999999999999998873 44367776776665
No 7
>KOG3059|consensus
Probab=40.53 E-value=16 Score=30.72 Aligned_cols=31 Identities=29% Similarity=0.616 Sum_probs=27.1
Q ss_pred CCCCCcchhHHHHHHHHhhCCCCCccccCCCchhcc
Q psy10209 70 GKDQEPSWKKSIYKIISRMDDPVPEYFKSPNFLEQL 105 (106)
Q Consensus 70 gkD~~psWKK~IYKiI~kLD~~vPe~fks~~~l~~l 105 (106)
+-..++-|||-+|- .++-|+-+.++.|+++|
T Consensus 5 ~~~~~~pWkKvLy~-----kQ~YPDNYtDesFle~l 35 (292)
T KOG3059|consen 5 DMMTRMPWKKVLYL-----KQEYPDNYTDESFLEEL 35 (292)
T ss_pred cccCCCchHHHHhh-----cCCCCCcccCHHHHHHH
Confidence 45668899999996 89999999999999876
No 8
>PF13911 AhpC-TSA_2: AhpC/TSA antioxidant enzyme
Probab=37.73 E-value=89 Score=20.62 Aligned_cols=26 Identities=23% Similarity=0.271 Sum_probs=20.7
Q ss_pred CCceeeccchhHHHHHHhhhccCCCc
Q psy10209 16 AEDLHVSGECEIYRVLNLHYNRNNHI 41 (106)
Q Consensus 16 ~~~l~V~rdsELfr~LN~HYNk~Ndf 41 (106)
+-+|-++++.+||++|.|.......+
T Consensus 37 p~~ly~D~~~~lY~~lg~~~~~~~~~ 62 (115)
T PF13911_consen 37 PFPLYVDPERKLYKALGLKRGLKWSL 62 (115)
T ss_pred CCcEEEeCcHHHHHHhCCccccccCC
Confidence 44599999999999999986555444
No 9
>PF09580 Spore_YhcN_YlaJ: Sporulation lipoprotein YhcN/YlaJ (Spore_YhcN_YlaJ); InterPro: IPR019076 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain.
Probab=35.78 E-value=79 Score=22.69 Aligned_cols=36 Identities=22% Similarity=0.212 Sum_probs=27.0
Q ss_pred hHHHHHHHHhcCCCCCceeeccchhHHHHHHhhhcc
Q psy10209 2 EKYARQAVSEGVLHAEDLHVSGECEIYRVLNLHYNR 37 (106)
Q Consensus 2 EK~ARQAi~eGv~~~~~l~V~rdsELfr~LN~HYNk 37 (106)
++=+++++.+-...-..+.||-|..+|.-|.-.-++
T Consensus 119 ~~~V~~~v~~~~p~~~~V~Vs~D~~~~~ri~~~~~~ 154 (177)
T PF09580_consen 119 KKKVEKAVKSADPRIYNVYVSTDPDIFDRIRNLANR 154 (177)
T ss_pred HHHHHHHHHHhCCCccEEEEEcCHHHHHHHHHHHHH
Confidence 455677777777777899999999999877654443
No 10
>PF11892 DUF3412: Domain of unknown function (DUF3412); InterPro: IPR021826 This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is about 120 amino acids in length. This domain is found associated with PF03641 from PFAM. ; PDB: 3BQ9_A 2PMB_D 3GH1_D.
Probab=35.42 E-value=15 Score=27.54 Aligned_cols=38 Identities=26% Similarity=0.322 Sum_probs=24.7
Q ss_pred ccchhHHHHHHhhhccCCCccccCCCCCccHHHHHHHHHHHHHHHHhcCCCC
Q psy10209 22 SGECEIYRVLNLHYNRNNHIEVCGPQVPTNFRFVIEQTLREFFKAIQDGKDQ 73 (106)
Q Consensus 22 ~rdsELfr~LN~HYNk~Ndfe~~~~~vP~~Fl~v~~~tlrEFf~ai~~gkD~ 73 (106)
.+.+|-..+||||.|..- ......||..|+.|.+|.--
T Consensus 33 ~PTHenMa~L~Lh~~qp~--------------~~LAanLRrAFSGIVAGNVK 70 (123)
T PF11892_consen 33 EPTHENMANLNLHRDQPP--------------HELAANLRRAFSGIVAGNVK 70 (123)
T ss_dssp --SHHHHHT--B-TTS-H--------------HHHHHHHHHHHHHHHHHHH-
T ss_pred CCCHHHHhhCcCCCCCCH--------------HHHHHHHHHHhccccccccC
Confidence 356888999999987431 34567899999999999644
No 11
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=34.18 E-value=53 Score=25.56 Aligned_cols=27 Identities=19% Similarity=0.373 Sum_probs=21.6
Q ss_pred CCceeeccchhHHHH----HHhhhccCCCcc
Q psy10209 16 AEDLHVSGECEIYRV----LNLHYNRNNHIE 42 (106)
Q Consensus 16 ~~~l~V~rdsELfr~----LN~HYNk~Ndfe 42 (106)
.+-|.+|||.|||+- ..--|=+++++|
T Consensus 47 e~rIllTRDr~L~~r~k~g~~~i~i~~~s~~ 77 (165)
T COG1656 47 EGRILLTRDRELYKRAKLGIKAILIRSDSIE 77 (165)
T ss_pred CCeEEEeccHHHHHHhhccCceEEEeCCCHH
Confidence 568999999999998 444577777777
No 12
>COG5079 SAC3 Nuclear protein export factor [Intracellular trafficking and secretion / Cell division and chromosome partitioning]
Probab=33.08 E-value=64 Score=29.95 Aligned_cols=49 Identities=22% Similarity=0.432 Sum_probs=39.6
Q ss_pred hhccCCCccccCCCCCccHHHHHHHHHHHHHHHHhcCCCCCcchhH-HHHHHHHhhCCC
Q psy10209 34 HYNRNNHIEVCGPQVPTNFRFVIEQTLREFFKAIQDGKDQEPSWKK-SIYKIISRMDDP 91 (106)
Q Consensus 34 HYNk~Ndfe~~~~~vP~~Fl~v~~~tlrEFf~ai~~gkD~~psWKK-~IYKiI~kLD~~ 91 (106)
|||+..++|+ |.++-.||-|||.-..+||-.-|-=.. --|-|+++|.+|
T Consensus 204 ~Fs~qqeleQ---------L~ksL~sL~elYdd~r~~~~~cpneaEFraYaiL~slgDp 253 (646)
T COG5079 204 HFSKQQELEQ---------LKKSLASLIELYDDGRAGKKECPNEAEFRAYAILASLGDP 253 (646)
T ss_pred cccHHhHHHH---------HHHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHhCCc
Confidence 6889888883 677789999999999999876554444 579999999864
No 13
>COG3963 Phospholipid N-methyltransferase [Lipid metabolism]
Probab=31.91 E-value=1.1e+02 Score=24.62 Aligned_cols=38 Identities=21% Similarity=0.149 Sum_probs=33.8
Q ss_pred HHHHHHhcCCCCCceeeccchhHHHHHHhhhccCCCcc
Q psy10209 5 ARQAVSEGVLHAEDLHVSGECEIYRVLNLHYNRNNHIE 42 (106)
Q Consensus 5 ARQAi~eGv~~~~~l~V~rdsELfr~LN~HYNk~Ndfe 42 (106)
-|.-++-||.+..=-.+.-+.+.|+.||-+|--.|=||
T Consensus 63 TkaIL~~gv~~~~L~~iE~~~dF~~~L~~~~p~~~ii~ 100 (194)
T COG3963 63 TKAILSRGVRPESLTAIEYSPDFVCHLNQLYPGVNIIN 100 (194)
T ss_pred HHHHHhcCCCccceEEEEeCHHHHHHHHHhCCCccccc
Confidence 35557899999998899999999999999999999887
No 14
>PRK12322 NADH dehydrogenase subunit D; Provisional
Probab=31.34 E-value=20 Score=30.06 Aligned_cols=97 Identities=18% Similarity=0.365 Sum_probs=71.1
Q ss_pred HhcCCCCCceeeccchhHHHHHHhhhcc-CCCccc-------cCCCCCccHHHHHHHHHHHHHHHHhcCC----------
Q psy10209 10 SEGVLHAEDLHVSGECEIYRVLNLHYNR-NNHIEV-------CGPQVPTNFRFVIEQTLREFFKAIQDGK---------- 71 (106)
Q Consensus 10 ~eGv~~~~~l~V~rdsELfr~LN~HYNk-~Ndfe~-------~~~~vP~~Fl~v~~~tlrEFf~ai~~gk---------- 71 (106)
+..|.++-.+.|.......|++-+--+| .||+-. .|-.+|..+..-.+..+...+..+.-.+
T Consensus 79 ~~AvE~~~gievp~ra~~iR~i~~EleRi~sHl~~~g~~~~diG~~t~~~~~~~~RE~i~~~~e~~tG~R~~~~~~~~GG 158 (366)
T PRK12322 79 CHAVETMMGLEVPERAEYLRVIAMELGRIASHLVWWGTYLLDIGAVSPFLYAFREREMIINLLNELCGARLTFNYMRIGG 158 (366)
T ss_pred HHHHHHHHCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHhhCccccccccccCC
Confidence 4455556668888888888888777776 555432 2445777777777888888888887666
Q ss_pred ---CCCcchhHHHHHHHHhhCCCCCcc----ccCCCchhccC
Q psy10209 72 ---DQEPSWKKSIYKIISRMDDPVPEY----FKSPNFLEQLE 106 (106)
Q Consensus 72 ---D~~psWKK~IYKiI~kLD~~vPe~----fks~~~l~~le 106 (106)
|..+.|...|.+.+..+...++++ ..+|.|++|++
T Consensus 159 v~~dl~~~~~~~i~~~l~~~~~~~~~~~~~~~~n~~~~~R~~ 200 (366)
T PRK12322 159 VKWDAPDGWIEKVKEFVPYMREQLAGYHDLVTGNEIFLNRVK 200 (366)
T ss_pred ccccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCchHHHhcC
Confidence 667889999999998888887765 56777777663
No 15
>PF13089 PP_kinase_N: Polyphosphate kinase N-terminal domain; PDB: 2O8R_A 1XDP_A 1XDO_B.
Probab=31.06 E-value=47 Score=23.15 Aligned_cols=16 Identities=31% Similarity=0.561 Sum_probs=14.5
Q ss_pred ccHHHHHHHHHHHHHH
Q psy10209 50 TNFRFVIEQTLREFFK 65 (106)
Q Consensus 50 ~~Fl~v~~~tlrEFf~ 65 (106)
=+||.+...-|-|||+
T Consensus 31 l~Fl~I~ssNlDEFFm 46 (109)
T PF13089_consen 31 LKFLAIFSSNLDEFFM 46 (109)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHhhhhcchhHHHH
Confidence 3799999999999997
No 16
>PF12870 Lumazine_bd: Lumazine-binding domain; InterPro: IPR024267 This entry represents a lumazine-binding domain found in a family of putative lipoproteins from bacteria. Lumazine is a fluorescent accessory protein having 6,7-dimethyl-8-(1'-D-ribityl) lumazine (DMRL) as its authentic chromophore; it modulates the emission of bacterial luciferase to shorter wavelengths with increasing luminous strength.; PDB: 3K7C_C.
Probab=30.67 E-value=45 Score=20.82 Aligned_cols=16 Identities=31% Similarity=0.804 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHhcCC
Q psy10209 56 IEQTLREFFKAIQDGK 71 (106)
Q Consensus 56 ~~~tlrEFf~ai~~gk 71 (106)
.+.+++.||.|+.+|.
T Consensus 9 P~~~v~~f~~al~~gd 24 (111)
T PF12870_consen 9 PEEVVKNFFDALKNGD 24 (111)
T ss_dssp HHHHHHHHHHHHCTT-
T ss_pred HHHHHHHHHHHHHcCC
Confidence 4678999999999985
No 17
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=29.33 E-value=26 Score=23.18 Aligned_cols=15 Identities=33% Similarity=0.510 Sum_probs=13.3
Q ss_pred hHHHHHHHHhcCCCC
Q psy10209 2 EKYARQAVSEGVLHA 16 (106)
Q Consensus 2 EK~ARQAi~eGv~~~ 16 (106)
||||+-|+.|++.+.
T Consensus 34 EKY~~LAL~D~ldd~ 48 (56)
T TIGR02736 34 EKYANLALNDDLDDS 48 (56)
T ss_pred HHHhhhhccccccch
Confidence 899999999998764
No 18
>PF10173 Mit_KHE1: Mitochondrial K+-H+ exchange-related; InterPro: IPR018786 This entry represents a family of proteins conserved from plants to humans. Their function is not known.
Probab=28.65 E-value=33 Score=26.32 Aligned_cols=23 Identities=26% Similarity=0.491 Sum_probs=19.1
Q ss_pred CCCcchhHHHHHHHHhhCCCCCc
Q psy10209 72 DQEPSWKKSIYKIISRMDDPVPE 94 (106)
Q Consensus 72 D~~psWKK~IYKiI~kLD~~vPe 94 (106)
.++.+||+.||+...+|-+.||-
T Consensus 54 ~a~~g~k~ki~~~g~~ll~rIp~ 76 (187)
T PF10173_consen 54 KAPKGWKRKIVSYGNRLLDRIPY 76 (187)
T ss_pred cCchhHHHHHHHHHHHHHHhCCH
Confidence 34556999999999999998873
No 19
>PF04097 Nic96: Nup93/Nic96; InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=27.76 E-value=69 Score=28.26 Aligned_cols=37 Identities=32% Similarity=0.686 Sum_probs=21.8
Q ss_pred HHHHHHHhcCCCCCcchhHHHHHHHHhhCC---CCCccccC
Q psy10209 61 REFFKAIQDGKDQEPSWKKSIYKIISRMDD---PVPEYFKS 98 (106)
Q Consensus 61 rEFf~ai~~gkD~~psWKK~IYKiI~kLD~---~vPe~fks 98 (106)
.||=..+....+.|| .|++.||||.+-|- .+|+.+.+
T Consensus 173 ~ey~~~~r~~~~~Dp-yK~AvY~ilg~cD~~~~~~~~V~~t 212 (613)
T PF04097_consen 173 LEYNQRIRNSTDGDP-YKRAVYKILGRCDLSRRHLPEVART 212 (613)
T ss_dssp HHHHHHTTT-TTS-H-HHHHHHHHHHT--CCC-S-TTC--S
T ss_pred HHHHHHhcCCCCCCh-HHHHHHHHHhcCCccccchHHHhCc
Confidence 455556766666665 69999999998876 45666544
No 20
>KOG2154|consensus
Probab=25.92 E-value=76 Score=28.76 Aligned_cols=36 Identities=31% Similarity=0.492 Sum_probs=31.5
Q ss_pred CCCCCccHHHHHHHHHHHHHHHHhcCCCCCcchhHH
Q psy10209 45 GPQVPTNFRFVIEQTLREFFKAIQDGKDQEPSWKKS 80 (106)
Q Consensus 45 ~~~vP~~Fl~v~~~tlrEFf~ai~~gkD~~psWKK~ 80 (106)
-+..|++|.+.+...+|+|+.-++.|---.||=+|+
T Consensus 45 ~~E~p~~~~~~~~~~~~~f~~ll~~gll~~Ps~~ks 80 (505)
T KOG2154|consen 45 EDETPKNFVVAVALSVQSFFKLLLRGLLPRPSSTKS 80 (505)
T ss_pred cccCchhHHHHHHHHHHHHHHHHHcCCCCCCchhcc
Confidence 356999999999999999999999998888876663
No 21
>PF11830 DUF3350: Domain of unknown function (DUF3350); InterPro: IPR021785 This domain is functionally uncharacterised. This domain is found in eukaryotes. This presumed domain is typically between 50 to 64 amino acids in length.
Probab=25.25 E-value=54 Score=21.72 Aligned_cols=13 Identities=31% Similarity=0.590 Sum_probs=10.3
Q ss_pred chhHHHHHHHHhh
Q psy10209 76 SWKKSIYKIISRM 88 (106)
Q Consensus 76 sWKK~IYKiI~kL 88 (106)
=|||+|...|-.+
T Consensus 34 LWrkAI~QqIlL~ 46 (56)
T PF11830_consen 34 LWRKAIHQQILLL 46 (56)
T ss_pred HHHHHHHHHHHHH
Confidence 3999999988644
No 22
>PF07417 Crl: Transcriptional regulator Crl; InterPro: IPR009986 This family contains the bacterial transcriptional regulator Crl (approximately 130 residues long). This is a transcriptional regulator of the csgA curlin subunit gene for curli fibres that are found on the surface of certain bacteria [].These proteins bind to the sigma-S subunit of RNA polymerase, activating expression of sigma-S-regulated genes. They also stimulate RNA polymerase holoenzyme formation and may bind to several other sigma factors, such as sigma-70 and sigma-32.; GO: 0016987 sigma factor activity, 0045893 positive regulation of transcription, DNA-dependent, 0005737 cytoplasm; PDB: 3RPJ_A.
Probab=25.08 E-value=78 Score=23.81 Aligned_cols=32 Identities=19% Similarity=0.494 Sum_probs=21.9
Q ss_pred hccCCCccccCCCCCccHHHHHHHHHHHHHHHHh
Q psy10209 35 YNRNNHIEVCGPQVPTNFRFVIEQTLREFFKAIQ 68 (106)
Q Consensus 35 YNk~Ndfe~~~~~vP~~Fl~v~~~tlrEFf~ai~ 68 (106)
|||..+.. .-.+|..=.+-++.||+.|...+.
T Consensus 74 ydk~G~W~--~~~i~~~~~~~v~~Tl~~Fh~kL~ 105 (125)
T PF07417_consen 74 YDKEGNWQ--AEKIKKEVQEEVERTLRDFHQKLV 105 (125)
T ss_dssp E-TTS-EE--S----HHHHHHHHHHHHHHHHHHH
T ss_pred ccCCCCee--ecCCCHHHHHHHHHHHHHHHHHHH
Confidence 88888887 344778888889999999988765
No 23
>COG0649 NuoD NADH:ubiquinone oxidoreductase 49 kD subunit 7 [Energy production and conversion]
Probab=24.89 E-value=86 Score=27.58 Aligned_cols=82 Identities=18% Similarity=0.293 Sum_probs=65.2
Q ss_pred chhHHHHHHhhhccCCCccccCCCCCccHHHHHHHHHHHHHHHHhcCC-------------CCCcchhHHHHHHHHhhCC
Q psy10209 24 ECEIYRVLNLHYNRNNHIEVCGPQVPTNFRFVIEQTLREFFKAIQDGK-------------DQEPSWKKSIYKIISRMDD 90 (106)
Q Consensus 24 dsELfr~LN~HYNk~Ndfe~~~~~vP~~Fl~v~~~tlrEFf~ai~~gk-------------D~~psWKK~IYKiI~kLD~ 90 (106)
=+||.|.+|==|+=+-.-.-.|-..|.-+-.--+.-+..+|+++...+ |.-+.|.+-|.+.+..+-.
T Consensus 109 ~~EL~RI~sHLl~lg~~~~dlGa~T~f~yaf~eRE~i~~l~E~~tGaRm~~~y~rpGGV~~DlP~~~~e~i~~f~d~~~~ 188 (398)
T COG0649 109 LSELNRIASHLLWLGTFALDLGAMTPFLYAFREREKIMDLFEAITGARMHHAYFRPGGVRRDLPEGWLELIREFLDYFPK 188 (398)
T ss_pred HHHHHHHHHHHHHHHHhhhhhccccHhhhhhHHHHHHHHHHHHHhcccccccccccCcccccCCHHHHHHHHHHHHHHHH
Confidence 378888888778877777667778888888888888999999998654 6688999999999998888
Q ss_pred CCCcc----ccCCCchhcc
Q psy10209 91 PVPEY----FKSPNFLEQL 105 (106)
Q Consensus 91 ~vPe~----fks~~~l~~l 105 (106)
.++|+ ++++-|..|+
T Consensus 189 ~l~eye~l~~~N~I~~~R~ 207 (398)
T COG0649 189 RLDEYEKLLTKNRIWRARL 207 (398)
T ss_pred hHHHHHHHHhcChHHHHhc
Confidence 88875 4555555554
No 24
>PRK10984 DNA-binding transcriptional regulator Crl; Provisional
Probab=24.30 E-value=1.1e+02 Score=22.98 Aligned_cols=34 Identities=18% Similarity=0.388 Sum_probs=26.0
Q ss_pred hhccCCCccccCCCCCc-cHHHHHHHHHHHHHHHHhc
Q psy10209 34 HYNRNNHIEVCGPQVPT-NFRFVIEQTLREFFKAIQD 69 (106)
Q Consensus 34 HYNk~Ndfe~~~~~vP~-~Fl~v~~~tlrEFf~ai~~ 69 (106)
-|||..+.. .-.+|+ .-.+-++.||++|...++.
T Consensus 75 lydk~G~W~--~~~i~~~~~~e~v~~Tl~~Fh~kL~~ 109 (127)
T PRK10984 75 LFDKEGNWV--AEPIKDKEVVERLEHTLREFHEKLRE 109 (127)
T ss_pred cccCCCCee--eccCCcHHHHHHHHHHHHHHHHHHHH
Confidence 489998887 344777 4478899999999887753
No 25
>PF01832 Glucosaminidase: Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase; InterPro: IPR002901 This domain is found in many different proteins including mannosyl-glycoprotein endo-beta-N-acetylglucosamidase (3.2.1.96 from EC).It is also found in flagellar protein J (P75942 from SWISSPROT), which has been shown to hydrolyse peptidoglycan [].; GO: 0004040 amidase activity, 0009253 peptidoglycan catabolic process; PDB: 3FI7_A 2ZYC_A 3K3T_A.
Probab=23.50 E-value=95 Score=21.08 Aligned_cols=33 Identities=21% Similarity=0.442 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHhcCCCC-CcchhHHHHHHHHhhC
Q psy10209 57 EQTLREFFKAIQDGKDQ-EPSWKKSIYKIISRMD 89 (106)
Q Consensus 57 ~~tlrEFf~ai~~gkD~-~psWKK~IYKiI~kLD 89 (106)
..+..+|-.+|+.++.+ ||.|.+.|=+||.+++
T Consensus 102 ~~~~~~~~~~l~~~gYatd~~Y~~kl~~i~~~~~ 135 (136)
T PF01832_consen 102 AKDYKAFAKALQKAGYATDPNYAEKLASIIKSYN 135 (136)
T ss_dssp -SSHHHHHHHHCCCTSSS-TTHHHHHHHHHHCCT
T ss_pred ccCHHHHHHHHhcCcCCCCHHHHHHHHHHHHHcC
Confidence 56778888889888644 8999999999988765
No 26
>PF13376 OmdA: Bacteriocin-protection, YdeI or OmpD-Associated
Probab=21.98 E-value=55 Score=20.45 Aligned_cols=23 Identities=26% Similarity=0.647 Sum_probs=18.7
Q ss_pred CCCccHHHHHHH--HHHHHHHHHhc
Q psy10209 47 QVPTNFRFVIEQ--TLREFFKAIQD 69 (106)
Q Consensus 47 ~vP~~Fl~v~~~--tlrEFf~ai~~ 69 (106)
.+|++|....+. ...+||.++..
T Consensus 5 ~vP~dl~~aL~~~p~a~~~f~~l~~ 29 (63)
T PF13376_consen 5 EVPEDLEAALEANPEAKEFFESLTP 29 (63)
T ss_pred CCCHHHHHHHHCCHHHHHHHHHCCH
Confidence 399999998876 78899988653
No 27
>TIGR00651 pta phosphate acetyltransferase. Model contains a gene from E.coli coding for ethanolamine utilization protein (euti) and also contains similarity to malate oxidoreductases
Probab=21.10 E-value=2e+02 Score=23.47 Aligned_cols=52 Identities=17% Similarity=0.202 Sum_probs=36.4
Q ss_pred HHHHHHHHhcCCCCCceeeccchhHHHHH-HhhhccCCCccccCCCCCccHHHHHH
Q psy10209 3 KYARQAVSEGVLHAEDLHVSGECEIYRVL-NLHYNRNNHIEVCGPQVPTNFRFVIE 57 (106)
Q Consensus 3 K~ARQAi~eGv~~~~~l~V~rdsELfr~L-N~HYNk~Ndfe~~~~~vP~~Fl~v~~ 57 (106)
+-|.+|+.+|+- +-+-|++...+-+.+ .+.++= ++||+.-|+-+.++-+.++
T Consensus 13 ~Aa~~~~~egi~--~pILvG~~~~I~~~~~~~gl~l-~~~eIid~~~~~~~~~~~~ 65 (303)
T TIGR00651 13 KAAALLAERGIA--TPVVLGNPEEIVPSAAGCNLDL-GHVVIIDPDVSPDRESYAE 65 (303)
T ss_pred HHHHHHHHcCCe--eEEEECCHHHHHHHHHHcCCCc-CCcEEECCCCchHHHHHHH
Confidence 467889999995 458888887776652 122332 5899988888877766655
No 28
>smart00222 Sec7 Sec7 domain. Domain named after the S. cerevisiae SEC7 gene product, which is required for proper protein transport through the Golgi. The domain facilitates guanine nucleotide exchange on the small GTPases, ARFs (ADP ribosylation factors).
Probab=20.90 E-value=1.1e+02 Score=22.82 Aligned_cols=31 Identities=26% Similarity=0.557 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHhc---CCCCCcchhHHHHHHHHh
Q psy10209 57 EQTLREFFKAIQD---GKDQEPSWKKSIYKIISR 87 (106)
Q Consensus 57 ~~tlrEFf~ai~~---gkD~~psWKK~IYKiI~k 87 (106)
..|..+|.+.+.. |+|....|-..||.-|..
T Consensus 151 kmt~~~Fi~~~~~~~~~~~~~~~~L~~iY~~I~~ 184 (187)
T smart00222 151 KMTLEDFIKNVRGSNDGEDLPREFLEELYDSIKN 184 (187)
T ss_pred CCCHHHHHHHHhccCCCCCCCHHHHHHHHHHHHh
Confidence 4789999988876 799999999999998864
No 29
>PF10296 DUF2404: Putative integral membrane protein conserved region (DUF2404); InterPro: IPR019411 This is entry represents a domain of unknown function found in mitochondrial distribution and morphology proteins Mdm12 and Mdm34, and in maintenance of mitochondrial morphology protein Mmm1. These proteins are components of the ERMES/MDM complex, which serves as a molecular tether to connect the endoplasmic reticulum and mitochondria [].
Probab=20.73 E-value=79 Score=20.92 Aligned_cols=29 Identities=14% Similarity=0.448 Sum_probs=19.8
Q ss_pred CCcchhHHHHHHHHhhCCCCCccccCCCchhcc
Q psy10209 73 QEPSWKKSIYKIISRMDDPVPEYFKSPNFLEQL 105 (106)
Q Consensus 73 ~~psWKK~IYKiI~kLD~~vPe~fks~~~l~~l 105 (106)
.++.|+..|.+.|.+.=+.++ .|.|++.+
T Consensus 13 ~t~~~~~~i~~~L~~kL~~i~----~P~fl~~i 41 (91)
T PF10296_consen 13 RTEAFRDKIKEKLQKKLNKIK----LPSFLDEI 41 (91)
T ss_pred cCHHHHHHHHHHHHHHHcccc----CCCccCcE
Confidence 356888888887776655555 66666654
No 30
>smart00258 SAND SAND domain.
Probab=20.49 E-value=51 Score=22.47 Aligned_cols=22 Identities=45% Similarity=0.838 Sum_probs=15.8
Q ss_pred HHHHHHHHHhcCCCCCcchhHHHH
Q psy10209 59 TLREFFKAIQDGKDQEPSWKKSIY 82 (106)
Q Consensus 59 tlrEFf~ai~~gkD~~psWKK~IY 82 (106)
|.-|| -+.+|+...--||.+|.
T Consensus 36 TP~eF--e~~~g~~~~K~WK~sIR 57 (73)
T smart00258 36 TPKEF--EIEGGKGKSKDWKRSIR 57 (73)
T ss_pred ChHHH--HhhcCCcccCCcchhee
Confidence 34444 34589999999999884
Done!