Query psy3378
Match_columns 239
No_of_seqs 178 out of 667
Neff 4.2
Searched_HMMs 46136
Date Fri Aug 16 23:29:43 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3378.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3378hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3737|consensus 100.0 7E-39 1.5E-43 303.8 8.5 127 107-238 73-202 (603)
2 KOG3736|consensus 100.0 1E-32 2.2E-37 270.8 8.1 109 126-239 82-190 (578)
3 KOG3738|consensus 100.0 5.6E-30 1.2E-34 243.4 8.9 93 145-238 79-171 (559)
4 TIGR03469 HonB hopene-associat 98.0 1.1E-05 2.4E-10 75.1 5.9 48 187-236 36-83 (384)
5 cd06439 CESA_like_1 CESA_like_ 97.8 3.6E-05 7.7E-10 65.2 5.7 49 187-236 25-73 (251)
6 TIGR03111 glyc2_xrt_Gpos1 puta 97.8 4.2E-05 9.1E-10 72.7 5.8 48 188-236 46-93 (439)
7 PRK11204 N-glycosyltransferase 97.6 0.00014 2.9E-09 67.5 5.8 46 188-236 51-96 (420)
8 PRK14583 hmsR N-glycosyltransf 97.5 0.00018 4E-09 68.2 6.0 45 189-236 73-117 (444)
9 TIGR03472 HpnI hopanoid biosyn 97.4 0.00027 5.9E-09 65.5 5.8 45 189-236 39-83 (373)
10 PRK13915 putative glucosyl-3-p 97.4 0.00021 4.5E-09 65.6 4.8 47 189-236 29-75 (306)
11 TIGR03030 CelA cellulose synth 97.1 0.00066 1.4E-08 68.9 5.6 50 187-236 127-176 (713)
12 COG1215 Glycosyltransferases, 97.0 0.0009 2E-08 61.4 4.9 45 190-236 53-97 (439)
13 PRK11498 bcsA cellulose syntha 97.0 0.0011 2.3E-08 69.5 5.3 49 188-236 257-305 (852)
14 PTZ00260 dolichyl-phosphate be 96.9 0.0012 2.5E-08 61.1 4.8 48 188-236 67-120 (333)
15 PRK14716 bacteriophage N4 adso 94.7 0.054 1.2E-06 53.7 5.5 47 187-236 62-109 (504)
16 PRK05454 glucosyltransferase M 92.1 0.26 5.7E-06 50.7 5.6 47 188-236 121-172 (691)
17 PRK11234 nfrB bacteriophage N4 82.9 2.5 5.5E-05 43.9 5.9 48 185-235 57-105 (727)
18 KOG3916|consensus 81.0 40 0.00087 32.9 12.7 54 173-235 140-195 (372)
19 KOG2977|consensus 77.2 3.5 7.6E-05 39.3 4.4 57 170-236 55-117 (323)
20 PRK15489 nfrB bacteriophage N4 67.2 7.5 0.00016 40.6 4.4 46 187-235 67-116 (703)
21 PF13733 Glyco_transf_7N: N-te 58.2 16 0.00034 31.0 4.1 52 173-233 36-89 (136)
22 PF11119 DUF2633: Protein of u 52.8 18 0.00039 26.8 3.1 21 30-50 8-28 (59)
23 PF13937 DUF4212: Domain of un 31.5 58 0.0013 25.2 3.1 26 30-55 9-37 (81)
24 COG4327 Predicted membrane pro 27.6 65 0.0014 26.1 2.8 25 32-56 18-44 (101)
25 PF13704 Glyco_tranf_2_4: Glyc 25.1 1.2E+02 0.0027 22.1 3.9 31 201-236 2-32 (97)
26 PHA03164 hypothetical protein; 23.7 83 0.0018 24.8 2.7 34 18-51 44-80 (88)
27 TIGR03647 Na_symport_sm probab 21.8 1E+02 0.0022 23.7 2.8 24 31-54 6-32 (77)
28 PF02532 PsbI: Photosystem II 20.1 2.2E+02 0.0048 19.2 3.8 21 30-50 2-22 (36)
No 1
>KOG3737|consensus
Probab=100.00 E-value=7e-39 Score=303.79 Aligned_cols=127 Identities=43% Similarity=0.684 Sum_probs=119.9
Q ss_pred ccccccCCCCCCCccc---cCcCCCCCCCceeecCCCCCHHHHHHHHhcccccccchhhcccccCCCCCCCCCCccccCc
Q psy3378 107 DKMVLHAPEKGEFDED---KNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVP 183 (239)
Q Consensus 107 ~~~~~~~p~~g~~~~d---~~~~~pGE~G~~v~l~~~~~~e~k~~~e~~~~~~~FN~~~SD~Isl~RslpD~R~~~Ck~~ 183 (239)
-.|++.+..+||||++ ...++|||.|+|+.+ ++|.+.+.+..-+++|||+++||+||++|+++|+|.++|+.
T Consensus 73 ~~p~l~~G~LGNfEPKepe~P~~gPGE~gkp~~l----~pe~k~a~~AsekEfGfN~~~SDmISm~R~v~D~R~EECkh- 147 (603)
T KOG3737|consen 73 HDPVLRPGILGNFEPKEPEPPVGGPGEKGKPLVL----GPEFKQAIQASEKEFGFNMVASDMISMDRNVNDLRQEECKH- 147 (603)
T ss_pred CCCcccccccCCCCCCCCCCCCCCCCcCCccccc----ChhHHHHHHHHHHhhCcceeehhhhhcccCccccCHhhccc-
Confidence 4578888899999998 367799999999977 67788888999999999999999999999999999999999
Q ss_pred cccCCCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCCCC
Q psy3378 184 GRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRKY 238 (239)
Q Consensus 184 ~~Y~~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~~L 238 (239)
..|+.+||++||||+||||.||||+|||||||.|||.++|+|||||||+||++||
T Consensus 148 WdYpe~Lpt~SVviVFHNEGws~LmRTVHSVi~RsP~~~l~eivlvDDfSdKehL 202 (603)
T KOG3737|consen 148 WDYPENLPTSSVVIVFHNEGWSTLMRTVHSVIKRSPRKYLAEIVLVDDFSDKEHL 202 (603)
T ss_pred cCCcccCCcceEEEEEecCccHHHHHHHHHHHhcCcHHhhheEEEeccCCccHHH
Confidence 9999999999999999999999999999999999999999999999999999987
No 2
>KOG3736|consensus
Probab=99.97 E-value=1e-32 Score=270.80 Aligned_cols=109 Identities=56% Similarity=0.813 Sum_probs=95.2
Q ss_pred CCCCCCCceeecCCCCCHHHHHHHHhcccccccchhhcccccCCCCCCCCCCccccCccccCCCCCceeEEEEeecChhH
Q psy3378 126 MQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWS 205 (239)
Q Consensus 126 ~~pGE~G~~v~l~~~~~~e~k~~~e~~~~~~~FN~~~SD~Isl~RslpD~R~~~Ck~~~~Y~~~LP~~SVII~FhNEa~S 205 (239)
.++|+.|+ +.+| +.++++.+.+|++++||+++||+|+++|++||+|++.|+....|..+||++||||+|||||||
T Consensus 82 ~~~~~~g~-~~~~----~~~~~~~~~~~~~~~~N~~~Sd~I~l~Rsl~D~r~~~C~~~~~~~~~Lp~~Svii~f~nE~~s 156 (578)
T KOG3736|consen 82 GGGGGGGK-VKLP----EAEEDLEDEGYEKNAFNALLSDRISLNRSLPDLRHPECKKLKYYSDKLPTTSVIIIFHNEAWS 156 (578)
T ss_pred CCccccCc-ccCC----cchHHHHHHHhhcccchhhhhhcccccCCCcchhchhhhhccccccccCCCceEEEEecCCCc
Confidence 44555556 4443 346667788899999999999999999999999999997634567789999999999999999
Q ss_pred HHHHHHHHHhhcCcccccceEEEEeCCCCCCCCC
Q psy3378 206 VLLRTVHSVLDRSPAHLLKEIILVDDYSDMRKYL 239 (239)
Q Consensus 206 tLLRTVhSVl~rTP~~LLkEIILVDD~S~~~~Ll 239 (239)
|||||||||++|||.+||+|||||||+|+.++|.
T Consensus 157 ~llRtv~Svi~rtp~~lLkEIiLVdD~S~~~~l~ 190 (578)
T KOG3736|consen 157 TLLRTVHSVINRTPPYLLKEIILVDDFSDRDHLK 190 (578)
T ss_pred chhheEEeehccCChhHeEEEEEeecCcchhhhh
Confidence 9999999999999999999999999999998863
No 3
>KOG3738|consensus
Probab=99.96 E-value=5.6e-30 Score=243.36 Aligned_cols=93 Identities=52% Similarity=0.708 Sum_probs=87.4
Q ss_pred HHHHHHhcccccccchhhcccccCCCCCCCCCCccccCccccCCCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccc
Q psy3378 145 IKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLK 224 (239)
Q Consensus 145 ~k~~~e~~~~~~~FN~~~SD~Isl~RslpD~R~~~Ck~~~~Y~~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLk 224 (239)
.-+..++.|+.|.||+..||.+..+|.+||+||++|.. ..|..+||.|||||+|||||.|+|||||.|||||||++||+
T Consensus 79 ~~~~Ged~y~~~~fnq~esd~l~~~r~i~dtRh~qC~~-~~y~~dlp~TsviITfHNEARS~LLRTv~SvlnrsP~~li~ 157 (559)
T KOG3738|consen 79 KWHQGEDPYKANSFNQEESDKLNPTRKIPDTRHPQCRD-VDYKVDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIH 157 (559)
T ss_pred CccCCCchhhcCccchhhhhhhCCCCcccccccccccc-ceeecCCCCceEEEEeccHHHHHHHHHHHHHHcCChHHhhh
Confidence 34456778999999999999999999999999999999 79999999999999999999999999999999999999999
Q ss_pred eEEEEeCCCCCCCC
Q psy3378 225 EIILVDDYSDMRKY 238 (239)
Q Consensus 225 EIILVDD~S~~~~L 238 (239)
|||||||+|+++++
T Consensus 158 EiILVDD~S~Dped 171 (559)
T KOG3738|consen 158 EIILVDDFSQDPED 171 (559)
T ss_pred eeEEecCCCCChHH
Confidence 99999999997653
No 4
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=97.98 E-value=1.1e-05 Score=75.15 Aligned_cols=48 Identities=31% Similarity=0.335 Sum_probs=41.1
Q ss_pred CCCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 187 LKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 187 ~~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
++..|++||||+.|||+ ..|.+++.|+++++-+.- -|||+|||.|+++
T Consensus 36 ~~~~p~VSVIIpa~Ne~-~~L~~~L~sL~~q~yp~~-~eIIVVDd~StD~ 83 (384)
T TIGR03469 36 PEAWPAVVAVVPARNEA-DVIGECVTSLLEQDYPGK-LHVILVDDHSTDG 83 (384)
T ss_pred CCCCCCEEEEEecCCcH-hHHHHHHHHHHhCCCCCc-eEEEEEeCCCCCc
Confidence 35689999999999998 689999999999876531 3999999999864
No 5
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=97.82 E-value=3.6e-05 Score=65.17 Aligned_cols=49 Identities=29% Similarity=0.254 Sum_probs=41.2
Q ss_pred CCCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 187 LKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 187 ~~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
....|.+||||+.|||. +.|.+++.|++.++.+.-.-|||+|||+|+.+
T Consensus 25 ~~~~~~isVvip~~n~~-~~l~~~l~si~~q~~~~~~~eiivvdd~s~d~ 73 (251)
T cd06439 25 PAYLPTVTIIIPAYNEE-AVIEAKLENLLALDYPRDRLEIIVVSDGSTDG 73 (251)
T ss_pred CCCCCEEEEEEecCCcH-HHHHHHHHHHHhCcCCCCcEEEEEEECCCCcc
Confidence 34688999999999997 78999999999987655334999999999864
No 6
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=97.76 E-value=4.2e-05 Score=72.67 Aligned_cols=48 Identities=25% Similarity=0.366 Sum_probs=41.7
Q ss_pred CCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 188 KNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 188 ~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
..+|++||||+.|||+ .+|-+++.|+++++-+.-.-|||+|||+|+++
T Consensus 46 ~~~P~vsVIIP~yNe~-~~l~~~l~sl~~q~yp~~~~eIiVVDd~StD~ 93 (439)
T TIGR03111 46 GKLPDITIIIPVYNSE-DTLFNCIESIYNQTYPIELIDIILANNQSTDD 93 (439)
T ss_pred CCCCCEEEEEEeCCCh-HHHHHHHHHHHhcCCCCCCeEEEEEECCCChh
Confidence 4589999999999998 89999999999998665333999999999865
No 7
>PRK11204 N-glycosyltransferase; Provisional
Probab=97.55 E-value=0.00014 Score=67.54 Aligned_cols=46 Identities=24% Similarity=0.326 Sum_probs=40.2
Q ss_pred CCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 188 KNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 188 ~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
...|.+||||+.|||. .++-+++.|+++++-+.+ |||+|||+|+++
T Consensus 51 ~~~p~vsViIp~yne~-~~i~~~l~sl~~q~yp~~--eiiVvdD~s~d~ 96 (420)
T PRK11204 51 KEYPGVSILVPCYNEG-ENVEETISHLLALRYPNY--EVIAINDGSSDN 96 (420)
T ss_pred CCCCCEEEEEecCCCH-HHHHHHHHHHHhCCCCCe--EEEEEECCCCcc
Confidence 3578999999999996 689999999999886654 999999999865
No 8
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=97.49 E-value=0.00018 Score=68.25 Aligned_cols=45 Identities=24% Similarity=0.372 Sum_probs=39.8
Q ss_pred CCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 189 NLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 189 ~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
..|.+||||+.|||+- .+.+|+.|+++++-+.+ |||+|||+|+++
T Consensus 73 ~~p~vsViIP~yNE~~-~i~~~l~sll~q~yp~~--eIivVdDgs~D~ 117 (444)
T PRK14583 73 GHPLVSILVPCFNEGL-NARETIHAALAQTYTNI--EVIAINDGSSDD 117 (444)
T ss_pred CCCcEEEEEEeCCCHH-HHHHHHHHHHcCCCCCe--EEEEEECCCCcc
Confidence 4799999999999985 68999999999987664 999999999864
No 9
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=97.40 E-value=0.00027 Score=65.51 Aligned_cols=45 Identities=16% Similarity=0.169 Sum_probs=40.1
Q ss_pred CCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 189 NLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 189 ~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
..|.+||||+-|||.- .|.+++.|+++++.+.+ |||+|||.|++.
T Consensus 39 ~~p~VSViiP~~nee~-~l~~~L~Sl~~q~Yp~~--EIivvdd~s~D~ 83 (373)
T TIGR03472 39 AWPPVSVLKPLHGDEP-ELYENLASFCRQDYPGF--QMLFGVQDPDDP 83 (373)
T ss_pred CCCCeEEEEECCCCCh-hHHHHHHHHHhcCCCCe--EEEEEeCCCCCc
Confidence 4799999999999975 78999999999997774 999999999864
No 10
>PRK13915 putative glucosyl-3-phosphoglycerate synthase; Provisional
Probab=97.39 E-value=0.00021 Score=65.55 Aligned_cols=47 Identities=23% Similarity=0.159 Sum_probs=38.6
Q ss_pred CCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 189 NLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 189 ~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
..|.+||||+.|||+- +|-+++.|+.+.+......|||+|||+|++.
T Consensus 29 ~~~~vSVVIPayNee~-~I~~~l~sl~~~~~~~~~~EIIVVDDgStD~ 75 (306)
T PRK13915 29 AGRTVSVVLPALNEEE-TVGKVVDSIRPLLMEPLVDELIVIDSGSTDA 75 (306)
T ss_pred CCCCEEEEEecCCcHH-HHHHHHHHHHHHhccCCCcEEEEEeCCCccH
Confidence 4689999999999974 7899999998876423345999999999864
No 11
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.
Probab=97.12 E-value=0.00066 Score=68.92 Aligned_cols=50 Identities=18% Similarity=0.139 Sum_probs=41.2
Q ss_pred CCCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 187 LKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 187 ~~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
.+.+|++||||+.|||....+-+|+.|+++.+-+.=.-|||+|||.|++.
T Consensus 127 ~~~~P~VsViIP~yNE~~~iv~~tl~s~~~~dYP~~~~eIiVvDDgStD~ 176 (713)
T TIGR03030 127 PEEWPTVDVFIPTYNEDLEIVATTVLAAKNMDYPADKFRVWILDDGGTDQ 176 (713)
T ss_pred cccCCeeEEEEcCCCCCHHHHHHHHHHHHhCCCCccceEEEEEECcCCcc
Confidence 45689999999999999877778999999887553223999999999753
No 12
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=97.01 E-value=0.0009 Score=61.36 Aligned_cols=45 Identities=42% Similarity=0.420 Sum_probs=41.5
Q ss_pred CCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 190 LPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 190 LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
+|++||+|+.|||.-.++-+|+.|+++...+. .|||+|||.|+++
T Consensus 53 ~p~vsviiP~ynE~~~~~~~~l~s~~~~dyp~--~evivv~d~~~d~ 97 (439)
T COG1215 53 LPKVSVIIPAYNEEPEVLEETLESLLSQDYPR--YEVIVVDDGSTDE 97 (439)
T ss_pred CCceEEEEecCCCchhhHHHHHHHHHhCCCCC--ceEEEECCCCChh
Confidence 69999999999999999999999999999777 4999999988754
No 13
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional
Probab=96.95 E-value=0.0011 Score=69.45 Aligned_cols=49 Identities=18% Similarity=0.142 Sum_probs=41.0
Q ss_pred CCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 188 KNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 188 ~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
+..|++||||+.|||.-..+-+|+.++++.+-+.=--|||+|||+|+.+
T Consensus 257 ~~~P~VsViIPtYNE~~~vv~~tI~a~l~~dYP~~k~EViVVDDgS~D~ 305 (852)
T PRK11498 257 SLWPTVDIFVPTYNEDLNVVKNTIYASLGIDWPKDKLNIWILDDGGREE 305 (852)
T ss_pred CCCCcEEEEEecCCCcHHHHHHHHHHHHhccCCCCceEEEEEeCCCChH
Confidence 3579999999999999888889999999887664223999999999754
No 14
>PTZ00260 dolichyl-phosphate beta-glucosyltransferase; Provisional
Probab=96.92 E-value=0.0012 Score=61.14 Aligned_cols=48 Identities=23% Similarity=0.189 Sum_probs=36.8
Q ss_pred CCCCceeEEEEeecChhHHHHHHHHHHhhcCcc------cccceEEEEeCCCCCC
Q psy3378 188 KNLPATSVIICFHNEAWSVLLRTVHSVLDRSPA------HLLKEIILVDDYSDMR 236 (239)
Q Consensus 188 ~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~------~LLkEIILVDD~S~~~ 236 (239)
...|.+||||+.|||+- +|.++++++++..-. ..--|||+|||+|++.
T Consensus 67 ~~~~~isVVIP~yNe~~-~i~~~L~~l~~~~~~~~~~~~~~~~EIIVVDDgStD~ 120 (333)
T PTZ00260 67 DSDVDLSIVIPAYNEED-RLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDK 120 (333)
T ss_pred CCCeEEEEEEeeCCCHH-HHHHHHHHHHHHHHhhhccCCCCCEEEEEEeCCCCCc
Confidence 46789999999999975 788999988764321 1123999999999764
No 15
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=94.68 E-value=0.054 Score=53.70 Aligned_cols=47 Identities=15% Similarity=0.041 Sum_probs=38.7
Q ss_pred CCCCCceeEEEEeecChhHHHHHHHHHHhhcC-cccccceEEEEeCCCCCC
Q psy3378 187 LKNLPATSVIICFHNEAWSVLLRTVHSVLDRS-PAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 187 ~~~LP~~SVII~FhNEa~StLLRTVhSVl~rT-P~~LLkEIILVDD~S~~~ 236 (239)
..+.|.++|+|+-|||+ .++-++|.|.+.+. .+.+ |||++||.||.+
T Consensus 62 ~~~~p~vaIlIPA~NE~-~vI~~~l~s~L~~ldY~~~--eIiVv~d~ndd~ 109 (504)
T PRK14716 62 SVPEKRIAIFVPAWREA-DVIGRMLEHNLATLDYENY--RIFVGTYPNDPA 109 (504)
T ss_pred cCCCCceEEEEeccCch-hHHHHHHHHHHHcCCCCCe--EEEEEECCCChh
Confidence 34589999999999997 48999999988765 3444 999999988764
No 16
>PRK05454 glucosyltransferase MdoH; Provisional
Probab=92.09 E-value=0.26 Score=50.74 Aligned_cols=47 Identities=19% Similarity=0.210 Sum_probs=35.5
Q ss_pred CCCCceeEEEEeecChhH----HHHHHHHHHhhcCc-ccccceEEEEeCCCCCC
Q psy3378 188 KNLPATSVIICFHNEAWS----VLLRTVHSVLDRSP-AHLLKEIILVDDYSDMR 236 (239)
Q Consensus 188 ~~LP~~SVII~FhNEa~S----tLLRTVhSVl~rTP-~~LLkEIILVDD~S~~~ 236 (239)
...|.|+|+|+-|||... +|-.++.|+..... .++ ||+++||.+|.+
T Consensus 121 ~~~~~VaVliP~yNEd~~~v~~~L~a~~~Sl~~~~~~~~~--e~~vLdD~~d~~ 172 (691)
T PRK05454 121 PPEARTAILMPIYNEDPARVFAGLRAMYESLAATGHGAHF--DFFILSDTRDPD 172 (691)
T ss_pred CCCCceEEEEeCCCCChHHHHHHHHHHHHHHHhcCCCCCE--EEEEEECCCChh
Confidence 457899999999999975 45555667766544 344 999999999864
No 17
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=82.92 E-value=2.5 Score=43.94 Aligned_cols=48 Identities=13% Similarity=0.035 Sum_probs=37.5
Q ss_pred ccCCCCCceeEEEEeecChhHHHHHHHHHHhh-cCcccccceEEEEeCCCCC
Q psy3378 185 RYLKNLPATSVIICFHNEAWSVLLRTVHSVLD-RSPAHLLKEIILVDDYSDM 235 (239)
Q Consensus 185 ~Y~~~LP~~SVII~FhNEa~StLLRTVhSVl~-rTP~~LLkEIILVDD~S~~ 235 (239)
.+....|.+||+|+-|||+- ++-.|+.+.+. ..-+.+ |||+++|.+|.
T Consensus 57 l~~~~~~~vsIlVPa~nE~~-vi~~~i~~ll~~ldYP~~--eI~vi~~~nD~ 105 (727)
T PRK11234 57 LYKPDEKPLAIMVPAWNETG-VIGNMAELAATTLDYENY--HIFVGTYPNDP 105 (727)
T ss_pred cccCCCCCEEEEEecCcchh-hHHHHHHHHHHhCCCCCe--EEEEEecCCCh
Confidence 34456799999999999975 88999999874 344554 99999876654
No 18
>KOG3916|consensus
Probab=80.99 E-value=40 Score=32.94 Aligned_cols=54 Identities=28% Similarity=0.432 Sum_probs=43.1
Q ss_pred CCCCCccccCccccCCCCCceeEEEEeec--ChhHHHHHHHHHHhhcCcccccceEEEEeCCCCC
Q psy3378 173 PDPRDEWCKVPGRYLKNLPATSVIICFHN--EAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDM 235 (239)
Q Consensus 173 pD~R~~~Ck~~~~Y~~~LP~~SVII~FhN--Ea~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~ 235 (239)
...|++.|.. .+ ++.|||+|-| |+...+|+.+|.+|.|.--++ -|++|+-+-+.
T Consensus 140 G~~~P~dC~a--r~-----kvAIIIPfR~Re~HL~~~l~~LhP~LqrQrL~y--~iyVieQ~g~~ 195 (372)
T KOG3916|consen 140 GRYRPEDCQA--RH-----KVAIIIPFRNREEHLRYLLHHLHPFLQRQRLDY--RIYVIEQAGNK 195 (372)
T ss_pred CCcCCCCCCc--cc-----eeEEEeecccHHHHHHHHHHHhhHHHHhhhhce--eEEEEEecCCC
Confidence 5688899998 33 8999999975 678999999999999986665 78888765443
No 19
>KOG2977|consensus
Probab=77.19 E-value=3.5 Score=39.26 Aligned_cols=57 Identities=35% Similarity=0.340 Sum_probs=35.8
Q ss_pred CCCCCCCCccccCccccCCCCCceeEEEEeecChh---HHHHHHHHHHhhc---CcccccceEEEEeCCCCCC
Q psy3378 170 RKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAW---SVLLRTVHSVLDR---SPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 170 RslpD~R~~~Ck~~~~Y~~~LP~~SVII~FhNEa~---StLLRTVhSVl~r---TP~~LLkEIILVDD~S~~~ 236 (239)
|.+|..|.+. ..| ..||||+-|||.- ..|=-++.+.=.| +|. .=-|||+|||+|.+.
T Consensus 55 ~~lp~~~d~~----~~~-----~lsVIVpaynE~~ri~~mldeav~~le~ry~~~~~-F~~eiiVvddgs~d~ 117 (323)
T KOG2977|consen 55 RTLPNIRDSP----EKM-----YLSVIVPAYNEEGRIGAMLDEAVDYLEKRYLSDKS-FTYEIIVVDDGSTDS 117 (323)
T ss_pred eeCcccccCh----hhc-----eeEEEEecCCcccchHHHHHHHHHHHHHHhccCCC-CceeEEEeCCCCchh
Confidence 5666666542 121 7899999999964 3333344443333 333 445999999999864
No 20
>PRK15489 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=67.15 E-value=7.5 Score=40.58 Aligned_cols=46 Identities=11% Similarity=0.211 Sum_probs=35.4
Q ss_pred CCCCCceeEEEEeecChhHHHHHHHHHHhhc-CcccccceEEE---EeCCCCC
Q psy3378 187 LKNLPATSVIICFHNEAWSVLLRTVHSVLDR-SPAHLLKEIIL---VDDYSDM 235 (239)
Q Consensus 187 ~~~LP~~SVII~FhNEa~StLLRTVhSVl~r-TP~~LLkEIIL---VDD~S~~ 235 (239)
..+.|..||+|+-|||+ .++-++|.+++.+ .-+.+ |||+ +||..+.
T Consensus 67 ~~~~~~vsIlVPa~nE~-~VI~~~v~~ll~~ldYp~~--~I~v~~~~nD~~T~ 116 (703)
T PRK15489 67 ERDEQPLAIMVPAWKEY-DVIAKMIENMLATLDYRRY--VIFVGTYPNDAETI 116 (703)
T ss_pred ccCCCceEEEEeCCCcH-HHHHHHHHHHHhcCCCCCe--EEEEEecCCCccHH
Confidence 34568999999999997 5999999999855 43444 8888 5777653
No 21
>PF13733 Glyco_transf_7N: N-terminal region of glycosyl transferase group 7; PDB: 2AGD_B 3EE5_A 2AE7_B 2AEC_A 2FYA_A 2AES_B 2AH9_A 2FYB_A 2FY7_A 3LW6_A ....
Probab=58.23 E-value=16 Score=31.02 Aligned_cols=52 Identities=19% Similarity=0.382 Sum_probs=35.7
Q ss_pred CCCCCccccCccccCCCCCceeEEEEeec--ChhHHHHHHHHHHhhcCcccccceEEEEeCCC
Q psy3378 173 PDPRDEWCKVPGRYLKNLPATSVIICFHN--EAWSVLLRTVHSVLDRSPAHLLKEIILVDDYS 233 (239)
Q Consensus 173 pD~R~~~Ck~~~~Y~~~LP~~SVII~FhN--Ea~StLLRTVhSVl~rTP~~LLkEIILVDD~S 233 (239)
.-.++..|+. .. .+.|||+|.| +++..+|..+|-+|.|.--.. .|++|+=.-
T Consensus 36 G~~~p~~C~~-~~------kvAiIIPyRdR~~hL~~fl~~l~~~L~rQ~~~y--~I~vieQ~~ 89 (136)
T PF13733_consen 36 GHWKPPDCKP-RH------KVAIIIPYRDREEHLRIFLPHLHPFLQRQQLDY--RIFVIEQVD 89 (136)
T ss_dssp TEE--SSSB--S-------EEEEEEEESS-HHHHHHHHHHHHHHHHHTT-EE--EEEEEEE-S
T ss_pred ceecCCcccc-cc------ceEEEEEeCCHHHHHHHHHHHHHHHHhhCcceE--EEEEEeecc
Confidence 3456778987 22 8999999986 456788888999999975444 888887543
No 22
>PF11119 DUF2633: Protein of unknown function (DUF2633); InterPro: IPR022576 This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known.
Probab=52.78 E-value=18 Score=26.78 Aligned_cols=21 Identities=19% Similarity=0.303 Sum_probs=17.4
Q ss_pred hHHHHHHHHHHHHHHHHHHhh
Q psy3378 30 VVLKLILLVCLIWLTVAALLF 50 (239)
Q Consensus 30 ~~~Klill~~~vWf~v~~~~y 50 (239)
..-|+|+|+|.+-|+.-|.||
T Consensus 8 ~mtriVLLISfiIlfgRl~Y~ 28 (59)
T PF11119_consen 8 RMTRIVLLISFIILFGRLIYS 28 (59)
T ss_pred hHHHHHHHHHHHHHHHHHHHH
Confidence 788999999998888766665
No 23
>PF13937 DUF4212: Domain of unknown function (DUF4212)
Probab=31.45 E-value=58 Score=25.20 Aligned_cols=26 Identities=35% Similarity=0.692 Sum_probs=19.3
Q ss_pred hHHHHHHHHHHHHHHHHH---HhhhcccC
Q psy3378 30 VVLKLILLVCLIWLTVAA---LLFMEDRN 55 (239)
Q Consensus 30 ~~~Klill~~~vWf~v~~---~~ysDd~~ 55 (239)
..+++++.+.+|||.|.| .++-|+..
T Consensus 9 ~n~rl~~~lL~iW~vvsfg~~~lfa~~Ln 37 (81)
T PF13937_consen 9 KNLRLIAILLAIWFVVSFGVGILFADELN 37 (81)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 478999999999999933 33555544
No 24
>COG4327 Predicted membrane protein [Function unknown]
Probab=27.60 E-value=65 Score=26.15 Aligned_cols=25 Identities=24% Similarity=0.449 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHH--HHHhhhcccCC
Q psy3378 32 LKLILLVCLIWLTV--AALLFMEDRNR 56 (239)
Q Consensus 32 ~Klill~~~vWf~v--~~~~ysDd~~~ 56 (239)
..||..+.+|||+| ++.++-++...
T Consensus 18 ttli~~lL~vwflVSfvvi~fa~alst 44 (101)
T COG4327 18 TTLIAALLGVWFLVSFVVILFARALST 44 (101)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 45778888999999 55667776653
No 25
>PF13704 Glyco_tranf_2_4: Glycosyl transferase family 2
Probab=25.13 E-value=1.2e+02 Score=22.14 Aligned_cols=31 Identities=29% Similarity=0.154 Sum_probs=20.8
Q ss_pred cChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCC
Q psy3378 201 NEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMR 236 (239)
Q Consensus 201 NEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~ 236 (239)
||+. .|...+..-+.-. ..+|+++||.|+.+
T Consensus 2 ne~~-~L~~wl~~~~~lG----~d~i~i~d~~s~D~ 32 (97)
T PF13704_consen 2 NEAD-YLPEWLAHHLALG----VDHIYIYDDGSTDG 32 (97)
T ss_pred ChHH-HHHHHHHHHHHcC----CCEEEEEECCCCcc
Confidence 6654 6666666554332 46799999999854
No 26
>PHA03164 hypothetical protein; Provisional
Probab=23.72 E-value=83 Score=24.75 Aligned_cols=34 Identities=24% Similarity=0.207 Sum_probs=18.0
Q ss_pred cceeeeeehhhh---hHHHHHHHHHHHHHHHHHHhhh
Q psy3378 18 SRFYIYFLRRRF---VVLKLILLVCLIWLTVAALLFM 51 (239)
Q Consensus 18 ~~~~~~~~RrRs---~~~Klill~~~vWf~v~~~~ys 51 (239)
||-..=.|+-|. +++-|.-|..++-+|++|.+|-
T Consensus 44 srtawnlwnnrRktftFlvLtgLaIamILfiifvlyv 80 (88)
T PHA03164 44 SRTAWNLWNNRRKTFTFLVLTGLAIAMILFIIFVLYV 80 (88)
T ss_pred CchhHHHHHhhhheeehHHHHHHHHHHHHHHHHHHHh
Confidence 444444454333 4444444455556666777773
No 27
>TIGR03647 Na_symport_sm probable solute:sodium symporter small subunit. Members of this family are highly hydrophobic bacterial proteins of about 90 amino acids in length. Members usually are found immediately upstream (sometimes fused to) a member of the solute:sodium symporter family, and therefore are a putative sodium:solute symporter small subunit. Members tend to be found in aquatic species, especially those from marine or other high salt environments.
Probab=21.83 E-value=1e+02 Score=23.68 Aligned_cols=24 Identities=33% Similarity=0.607 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHHHHH---hhhccc
Q psy3378 31 VLKLILLVCLIWLTVAAL---LFMEDR 54 (239)
Q Consensus 31 ~~Klill~~~vWf~v~~~---~ysDd~ 54 (239)
.+|++..+.++||.|.|. ++.|..
T Consensus 6 n~~li~~lL~iWf~vsfg~~~lf~~~L 32 (77)
T TIGR03647 6 NLRLIAVLLAIWFVVSFGAGILFADEL 32 (77)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 578999999999998442 456654
No 28
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=20.11 E-value=2.2e+02 Score=19.24 Aligned_cols=21 Identities=14% Similarity=0.336 Sum_probs=16.0
Q ss_pred hHHHHHHHHHHHHHHHHHHhh
Q psy3378 30 VVLKLILLVCLIWLTVAALLF 50 (239)
Q Consensus 30 ~~~Klill~~~vWf~v~~~~y 50 (239)
.++|++.-+.++.|..+|.+.
T Consensus 2 ~~LK~~Vy~vV~ffv~LFifG 22 (36)
T PF02532_consen 2 LTLKIFVYTVVIFFVSLFIFG 22 (36)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred eEEEEeehhhHHHHHHHHhcc
Confidence 478988888888777766665
Done!