Query psy13894
Match_columns 191
No_of_seqs 118 out of 1157
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 23:08:46 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13894.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13894hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PTZ00101 rhomboid-1 protease; 100.0 2.7E-30 5.8E-35 213.7 15.4 155 8-177 88-245 (278)
2 KOG2289|consensus 99.9 1.4E-28 3E-33 205.3 6.2 143 17-172 112-254 (316)
3 KOG2290|consensus 99.9 9.1E-28 2E-32 205.1 7.5 158 2-173 429-586 (652)
4 PRK10907 intramembrane serine 99.9 3.6E-26 7.8E-31 189.3 14.9 144 20-176 131-274 (276)
5 COG0705 Membrane associated se 99.9 3E-25 6.4E-30 179.5 14.2 157 10-177 52-216 (228)
6 PF01694 Rhomboid: Rhomboid fa 99.9 3E-25 6.6E-30 166.9 5.6 142 20-174 2-144 (145)
7 KOG2632|consensus 99.7 2.5E-17 5.5E-22 133.2 11.9 138 19-169 48-195 (258)
8 PF08551 DUF1751: Eukaryotic i 98.5 2.9E-07 6.2E-12 65.3 6.7 59 23-81 7-65 (99)
9 KOG2980|consensus 98.4 2.4E-08 5.3E-13 82.7 -2.9 144 23-173 154-302 (310)
10 PF04511 DER1: Der1-like famil 98.2 5.9E-06 1.3E-10 65.6 8.3 137 19-169 37-179 (197)
11 KOG2890|consensus 98.0 2.9E-05 6.4E-10 64.6 8.3 141 22-174 65-217 (326)
12 KOG0858|consensus 97.6 0.00019 4.1E-09 58.0 6.5 61 19-79 48-112 (239)
13 COG5291 Predicted membrane pro 95.7 0.0047 1E-07 50.3 1.1 43 19-61 55-99 (313)
14 KOG4463|consensus 95.2 0.035 7.5E-07 45.8 4.6 61 20-81 47-107 (323)
15 PF12732 YtxH: YtxH-like prote 66.0 6 0.00013 26.0 2.4 29 154-182 3-31 (74)
16 COG0705 Membrane associated se 49.8 31 0.00067 27.4 4.4 72 19-104 135-206 (228)
17 PF06946 Phage_holin_5: Phage 47.9 81 0.0018 22.0 5.6 17 157-173 65-81 (93)
18 PF09527 ATPase_gene1: Putativ 46.7 66 0.0014 19.6 6.6 41 38-78 8-49 (55)
19 PF03419 Peptidase_U4: Sporula 46.4 1.7E+02 0.0037 24.3 9.9 37 38-78 11-47 (293)
20 COG3105 Uncharacterized protei 39.4 25 0.00055 26.0 2.1 22 152-173 7-28 (138)
21 COG4769 Predicted membrane pro 37.1 65 0.0014 25.0 4.1 28 148-175 139-166 (181)
22 PF13829 DUF4191: Domain of un 33.4 66 0.0014 26.2 3.8 56 120-176 21-76 (224)
23 COG5336 Uncharacterized protei 30.4 1.2E+02 0.0026 21.9 4.3 26 148-173 69-94 (116)
24 PF04246 RseC_MucC: Positive r 30.2 1.3E+02 0.0028 21.9 4.8 23 151-173 98-120 (135)
25 PRK10720 uracil transporter; P 29.7 1.9E+02 0.0041 25.6 6.5 31 155-185 390-420 (428)
26 COG1296 AzlC Predicted branche 29.3 3.2E+02 0.0069 22.4 9.1 22 152-173 210-231 (238)
27 PF01306 LacY_symp: LacY proto 28.9 3.8E+02 0.0081 23.8 8.1 40 124-163 141-181 (412)
28 PF04892 VanZ: VanZ like famil 27.7 49 0.0011 23.7 2.1 29 148-176 104-132 (133)
29 PF14241 DUF4341: Domain of un 26.4 1.4E+02 0.0031 18.8 3.8 34 67-100 6-39 (62)
30 PRK10862 SoxR reducing system 25.6 2.2E+02 0.0048 21.5 5.4 16 157-172 107-122 (154)
31 TIGR02854 spore_II_GA sigma-E 24.5 4.2E+02 0.0091 22.1 10.4 42 38-83 11-53 (288)
32 PF04971 Lysis_S: Lysis protei 22.1 2.4E+02 0.0052 18.5 4.6 28 150-177 32-62 (68)
33 PF04632 FUSC: Fusaric acid re 21.7 6.3E+02 0.014 23.2 12.8 29 152-180 471-499 (650)
34 COG4980 GvpP Gas vesicle prote 21.1 92 0.002 22.6 2.4 27 153-179 8-34 (115)
35 PF14851 FAM176: FAM176 family 20.8 3E+02 0.0066 20.9 5.3 18 63-80 21-38 (153)
36 COG3086 RseC Positive regulato 20.5 2.8E+02 0.0061 21.0 4.9 25 149-173 103-127 (150)
37 PF02960 K1: K1 glycoprotein; 20.5 1E+02 0.0022 22.3 2.4 35 151-185 77-111 (130)
No 1
>PTZ00101 rhomboid-1 protease; Provisional
Probab=99.97 E-value=2.7e-30 Score=213.67 Aligned_cols=155 Identities=23% Similarity=0.316 Sum_probs=122.7
Q ss_pred cccccccccCCCCCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHHhcCCce
Q psy13894 8 NVCGMIPFYNVDSPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIFVPYRAD 87 (191)
Q Consensus 8 ~~~gl~p~~~~~~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~~~~~~~ 87 (191)
+.+++.|... .++||||++|++|+|.|+.|+++||+.++.+|..+|+.+|++|+..+|+++|+.|++++..+.+....
T Consensus 88 ~~Ga~~~~~i--~~gq~WRLiT~~FlH~~~~HLl~Nm~~l~~~G~~lE~~~G~~r~~ilYl~sGi~G~l~s~~~~~~~~s 165 (278)
T PTZ00101 88 TLGANVASRI--KQGEIHRLILPIFLHANIFHTFFNVFFQLRMGFTLEKNYGIVKIIILYFLTGIYGNILSSSVTYCPIK 165 (278)
T ss_pred HHhCcchhhh--hcCCCHHHHHHHHHccCHHHHHHHHHHHHHHHHHHHHHHChHHHHHHHHHHHHHHHHHHHHHccCCcE
Confidence 3445556433 47999999999999999999999999999999999999999999999999999999999998887789
Q ss_pred echhHHHHHHHHHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHH--HHHhc-cCchhhHHHHHHHHHHHH
Q psy13894 88 VSCLDNVCGMIPFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLS--IFAYG-LGPFGFNLAHNSGLVVQL 164 (191)
Q Consensus 88 vGaSg~v~gl~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~--~~~~~-~~~~i~~~aHl~G~l~G~ 164 (191)
+||||++||++|+++.. ....|...+ .+.+ . +..+...... .+... ..|++|+.||++|+++|+
T Consensus 166 vGASgAifGLiGa~~~~----lil~w~~~~-----~~~~-~---~~~~i~~~li~~~l~~~~~g~~Id~~aHlGG~i~G~ 232 (278)
T PTZ00101 166 VGASTSGMGLLGIVTSE----LILLWHVIR-----HRER-V---VFNIIFFSLISFFYYFTFNGSNIDHVGHLGGLLSGI 232 (278)
T ss_pred EehhHHHHHHHHHHHHH----HHHHHHhhc-----cHHH-H---HHHHHHHHHHHHHHHHhcCCCCccHHHHHHHHHHHH
Confidence 99999999999999887 677776655 3322 1 1111111111 11122 258899999999999999
Q ss_pred HHhhhhcCCCccc
Q psy13894 165 EAGTTKARDDGNR 177 (191)
Q Consensus 165 l~g~~~~~~~~~~ 177 (191)
++|..+.++.+++
T Consensus 233 llg~~~~~~l~~~ 245 (278)
T PTZ00101 233 SMGILYNSQMENK 245 (278)
T ss_pred HHHHHHHhhhhhc
Confidence 9999987764433
No 2
>KOG2289|consensus
Probab=99.95 E-value=1.4e-28 Score=205.28 Aligned_cols=143 Identities=26% Similarity=0.478 Sum_probs=130.0
Q ss_pred CCCCCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHHhcCCceechhHHHHH
Q psy13894 17 NVDSPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIFVPYRADVSCLDNVCG 96 (191)
Q Consensus 17 ~~~~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~~~~~~~vGaSg~v~g 96 (191)
.+..++|+||++||+++|+|+.|+..||+.+.++|..+|..+|.+|+..+|++++++|++++++++++...|||||++||
T Consensus 112 ~~~~r~E~WRllTym~LHaGi~HL~~N~~~ql~iGi~LE~~~G~~RiglIYl~gg~aGSlls~l~d~~~~sVGASggvfa 191 (316)
T KOG2289|consen 112 KPVHRGELWRLLTYMWLHAGIFHLLLNMLSQLFIGIPLEQVHGFLRIGLIYLAGGVAGSLLSSLFDPNSISVGASGGVFA 191 (316)
T ss_pred ChhhhchhHHHHHHHHHhcCHHHHHHHHHHHHhccccHHhhcCceEEeeehhhhhhhhHHHHHHhccCCceecccHHHHH
Confidence 45678999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhhhhcC
Q psy13894 97 MIPFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGTTKAR 172 (191)
Q Consensus 97 l~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~~~~~ 172 (191)
|+|+.+.. ..+||..++ .+.. .+..++.++...+.+|+.|.+++++|++|++.|..++++...
T Consensus 192 LlgA~Ls~----l~~Nw~~m~-----~~~~----~l~~ll~Ii~i~l~~G~~~~~~~~~h~gg~~~G~~~~fil~~ 254 (316)
T KOG2289|consen 192 LLGAHLSN----LLTNWTIMK-----NKFA----ALRTLLIIIFINLDLGFAPYVDNFAHIGGLLAGFLLGFVLHI 254 (316)
T ss_pred HHHHHHHH----HHhhHHHhc-----chHH----HHHHHHHHHHHHHhhccccceeccccccccCCCcchhHHhhh
Confidence 99999999 999999988 6655 455556556667778889999999999999999999988644
No 3
>KOG2290|consensus
Probab=99.94 E-value=9.1e-28 Score=205.07 Aligned_cols=158 Identities=49% Similarity=0.803 Sum_probs=145.4
Q ss_pred ccceeccccccccccCCCCCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHH
Q psy13894 2 VVSCLDNVCGMIPFYNVDSPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIF 81 (191)
Q Consensus 2 ~~~~l~~~~gl~p~~~~~~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~ 81 (191)
||+|+++.||++|+.++.++.|+||++||.|+|+|+.|++..+..++.+.+.+|+..|+.|..++|++||+.||+++.++
T Consensus 429 QVhC~d~VCGllPFln~e~PdQfYRL~~SLFlHagviH~~vSi~FQm~vmrdlEkL~g~~riAIiy~~SGitGNLASAIF 508 (652)
T KOG2290|consen 429 QVHCFDGVCGLLPFLNPEVPDQFYRLWLSLFLHAGVIHLLVSICFQMTVMRDLEKLAGWHRIAIIYFLSGITGNLASAIF 508 (652)
T ss_pred hhhhhhcccccccccCCCChhHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhcchhhheeeecccccccchheee
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcCCceechhHHHHHHHHHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHH
Q psy13894 82 VPYRADVSCLDNVCGMIPFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLV 161 (191)
Q Consensus 82 ~~~~~~vGaSg~v~gl~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l 161 (191)
.|..+.||.||+-+|++++.++. ....|+.+. +|.+ ++..++..++. +..|+.|.+|+++|+.|++
T Consensus 509 lpY~~eVgPa~sQ~Gila~l~vE----l~qs~~il~-----~~w~----a~~~Lia~~L~-L~iGliPWiDN~aHlfG~i 574 (652)
T KOG2290|consen 509 LPYRAEVGPAGSQFGILACLFVE----LFQSWQILE-----RPWR----AFFHLIATLLV-LCIGLIPWIDNWAHLFGTI 574 (652)
T ss_pred eccccccCCcccccchHHHHHHH----HHhhhHhhh-----hHHH----HHHHHHHHHHH-HHhccccchhhHHHHHHHH
Confidence 99999999999999999999999 899999888 6554 55555554444 4449999999999999999
Q ss_pred HHHHHhhhhcCC
Q psy13894 162 VQLEAGTTKARD 173 (191)
Q Consensus 162 ~G~l~g~~~~~~ 173 (191)
.|++..+++.+.
T Consensus 575 ~GLl~s~~~~PY 586 (652)
T KOG2290|consen 575 FGLLTSIIFLPY 586 (652)
T ss_pred HHHHHHHHhhcc
Confidence 999999998886
No 4
>PRK10907 intramembrane serine protease GlpG; Provisional
Probab=99.94 E-value=3.6e-26 Score=189.30 Aligned_cols=144 Identities=17% Similarity=0.118 Sum_probs=108.5
Q ss_pred CCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHHhcCCceechhHHHHHHHH
Q psy13894 20 SPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIFVPYRADVSCLDNVCGMIP 99 (191)
Q Consensus 20 ~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~~~~~~~vGaSg~v~gl~~ 99 (191)
.++||||++|++|+|.|+.|+++||+.++.+|..+|+.+|+++++.+|+++++.|++.+.++.+ ...+|+||++||++|
T Consensus 131 ~~~q~WRl~T~~flH~~~~Hl~fNml~l~~lG~~iE~~~G~~~~l~l~l~s~i~~~~~~~~~~~-~~~gGaSGvVygL~g 209 (276)
T PRK10907 131 LKFELWRYFTHALLHFSLLHILFNLLWWWYLGGAVEKRLGSGKLIVITLISALLSGWVQSKFSG-PWFGGLSGVVYALMG 209 (276)
T ss_pred ccCCcHHHHhHHHHhCCHHHHHHHHHHHHHHHHHHHHHHChHHHHHHHHHHHHHHHHHHHHHcc-chhhHHHHHHHHHHH
Confidence 4799999999999999999999999999999999999999999999999999999999888764 468899999999999
Q ss_pred HHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhhhhcCCCcc
Q psy13894 100 FYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGTTKARDDGN 176 (191)
Q Consensus 100 ~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~~~~~~~~~ 176 (191)
+..... .+ .+... ...|.. + +..++++....+...+.+++++.||++|+++|+++|+...+++++
T Consensus 210 ~~~~~~----~~-~p~~~---~~lp~~-~---~~f~llwl~~g~~~~~g~~Ian~AHlgGli~Gll~g~~~~~~~~~ 274 (276)
T PRK10907 210 YVWLRG----ER-DPQSG---IYLPRG-L---IAFALLWLVAGYFDLFGMSIANAAHVAGLAVGLAMAFWDTRNARK 274 (276)
T ss_pred HHHHHh----cc-ccccc---hhhhHH-H---HHHHHHHHHHHHHHccCcccHHHHHHHHHHHHHHHHHHhhhhhhc
Confidence 765541 11 00000 001111 1 112222222222222357899999999999999999988765443
No 5
>COG0705 Membrane associated serine protease [Amino acid transport and metabolism]
Probab=99.93 E-value=3e-25 Score=179.55 Aligned_cols=157 Identities=28% Similarity=0.414 Sum_probs=122.2
Q ss_pred cccccccCCCCCC--CchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHHhcCC--
Q psy13894 10 CGMIPFYNVDSPD--QFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIFVPYR-- 85 (191)
Q Consensus 10 ~gl~p~~~~~~~~--q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~~~~~-- 85 (191)
.++.|........ |+||++|++|+|.|+.|+++||+.++.+|..+|+..|+.+++.+|+++|+.+++.+..+.|..
T Consensus 52 ~~~~~~~~~~~~~~~~~w~lit~~FlH~~~~Hll~N~~~l~~fg~~le~~~G~~~f~~~yl~~gl~~~~~~~~~~~~~~~ 131 (228)
T COG0705 52 FGLYPLNLLGALARDQLWRLITAIFLHAGFLHLLFNMLALWVFGSNLERRLGTLRFLLFYLLSGLLAGLAQVLFGPKGGA 131 (228)
T ss_pred HhhcchhhhccccccchHHHHHHHHHHhhHHHHHHHHHHHHHhhHHHHHHhchhHHHHHHHHHHHHHHHHHHHHcccccC
Confidence 4555643322222 899999999999999999999999999999999999999999999999999999999998765
Q ss_pred ceechhHHHHHHHHHHhhhCcchhHHHHHH-hhhhcCCCchhhHHHHHHHHHHHHHHHHHhcc---CchhhHHHHHHHHH
Q psy13894 86 ADVSCLDNVCGMIPFYNVDSPDQFYRLWTS-LFLHAGARPFFTYWITTVQIIILCLSIFAYGL---GPFGFNLAHNSGLV 161 (191)
Q Consensus 86 ~~vGaSg~v~gl~~~~~~~~p~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~~i~~~aHl~G~l 161 (191)
+.+||||+++|++++++...|......... .+ ++. ...+.......++.+. .+++++.+|++|++
T Consensus 132 ~~~GASG~i~gllga~~~~~~~~~~~~~~~~~~-----~~~------~~~i~~~~~~~~~~~~~~~~~~va~~aHl~G~i 200 (228)
T COG0705 132 PSLGASGAIFGLLGAYFLLFPFARILLLFLSLP-----RPA------LILILIWLLYSLFSGAGSFGPSVAWSAHLGGLI 200 (228)
T ss_pred cccchhHHHHHHHHHHHHHccccchhhhhccCc-----hhH------HHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHH
Confidence 799999999999999999988764443322 22 121 1223333333333333 23799999999999
Q ss_pred HHHHHhhhhcCCCccc
Q psy13894 162 VQLEAGTTKARDDGNR 177 (191)
Q Consensus 162 ~G~l~g~~~~~~~~~~ 177 (191)
.|.+++....++.++|
T Consensus 201 ~G~l~~~~~~~~~~~~ 216 (228)
T COG0705 201 GGLLLAALLSRKLRKR 216 (228)
T ss_pred HHHHHHHHHhhhhhhh
Confidence 9999999988875544
No 6
>PF01694 Rhomboid: Rhomboid family; InterPro: IPR022764 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of proteins contain serine peptidases belonging to the MEROPS peptidase family S54 (Rhomboid, clan ST). They are integral membrane proteins related to the Drosophila melanogaster (Fruit fly) rhomboid protein P20350 from SWISSPROT. Members of this family are found in archaea, bacteria and eukaryotes. The D. melanogaster rhomboid protease cleaves type-1 transmembrane domains using a catalytic triad composed of serine, histidine and asparagine contributed by different transmembrane domains. It cleaves the transmembrane proteins Spitz, Gurken and Keren within their transmembrane domains to release a soluble TGFalpha-like growth factor. Cleavage occurs in the Golgi, following translocation of the substrates from the endoplasmic reticulum membrane by Star, another transmembrane protein. The growth factors are then able to activate the epidermal growth factor receptor [, ]. Few substrates of mammalian rhomboid homologues have been determined, but rhomboid-like protein 2 (MEROPS S54.002) has been shown to cleave ephrin B3 []. Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite. In Saccharomyces cerevisiae (Baker's yeast) the Pcp1 (MDM37) protein (MEROPS S54.007) is a mitochondrial endopeptidase required for the activation of cytochrome c peroxidase and for the processing of the mitochondrial dynamin-like protein Mgm1 [, ]. Mutations in Pcp1 result in cells have fragmented mitochondria, which have very few short tubulues []. This entry represents the 6 transmembrane helix rhomboid domain.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=99.91 E-value=3e-25 Score=166.87 Aligned_cols=142 Identities=28% Similarity=0.382 Sum_probs=104.3
Q ss_pred CCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHHhcCC-ceechhHHHHHHH
Q psy13894 20 SPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIFVPYR-ADVSCLDNVCGMI 98 (191)
Q Consensus 20 ~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~~~~~-~~vGaSg~v~gl~ 98 (191)
+++|+||++|++|+|.|+.|++.|++.++.+|..+|+.+|++++..+|+.+++.+++...++.+.. ..+|+||+++|++
T Consensus 2 ~~~~~wrl~T~~f~h~~~~hl~~n~~~l~~~g~~lE~~~G~~~~~~~~l~~~~~~~l~~~~~~~~~~~~~G~Sg~~~~l~ 81 (145)
T PF01694_consen 2 QNGQWWRLFTSPFVHANFLHLLFNLLALWFFGSLLERRLGSRRFLALYLLSGLLGSLLSLLFSPPNQPYVGASGAVFGLL 81 (145)
T ss_dssp GCC-TTHHHHGGG--SSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-S-----SSHHHHHHHH
T ss_pred CCCcchhhhHHHHHccCHHHHHHHHHHHHHhhhhHhhhccchHHHHHHHHHHHhhhhccccccccccccCCCcccchHHH
Confidence 478999999999999999999999999999999999999999999999999999999999988766 8999999999999
Q ss_pred HHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhhhhcCCC
Q psy13894 99 PFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGTTKARDD 174 (191)
Q Consensus 99 ~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~~~~~~~ 174 (191)
++.+...|+.......... ... .........+..+..+++++.+|++|+++|++++..++|++
T Consensus 82 ~~~~~~~~~~~~~~~~~~~------~~~-------~~~~~~~~~~~~~~~~~~~~~~hl~G~~~G~~~~~~~~~~~ 144 (145)
T PF01694_consen 82 GAFLFLYPQNKKRLRFIYL------ALV-------VPIIVLVIILLLGFIPNISFLGHLGGFLAGLLYGFLILRRP 144 (145)
T ss_dssp HHHHHHHHCCCCCS---HC------CCC-------CCCCCCCHHHCTSSSSTTTHHHHHHHHHHHHHHHHHHCH--
T ss_pred HHHHHHHhhccchhhcchH------HHH-------HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 9998885543111110000 000 00001112233344789999999999999999999988753
No 7
>KOG2632|consensus
Probab=99.74 E-value=2.5e-17 Score=133.22 Aligned_cols=138 Identities=20% Similarity=0.214 Sum_probs=109.2
Q ss_pred CCCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhC-hhHHHHHHHHHHHHHHHHHHHHhc---------CCcee
Q psy13894 19 DSPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTG-SFRIAIIYFGSGIGGNLASAIFVP---------YRADV 88 (191)
Q Consensus 19 ~~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G-~~~~~~lyl~s~i~g~l~~~l~~~---------~~~~v 88 (191)
..+.|.||++||+++|.+..|+++||++++.+|..+|+.+| +.+++....+.++..+++.+++.. +...+
T Consensus 48 l~~~ql~RL~Ty~l~H~s~~hllfnmlaL~~~g~~fE~~~G~t~~~l~~~~llalf~gIl~ll~~~~~~~~d~~~~~~a~ 127 (258)
T KOG2632|consen 48 LINWQLYRLITYALVHLSLPHLLFNMLALWPLGSQFERTHGTTVRILMFTVLLALFSGILYLLAYHVFLLSDLVYVEGAI 127 (258)
T ss_pred hhhHHHHHHHHHHHHhccHHHHHHHHHHHHhchhHHHhhccceehHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcccc
Confidence 46899999999999999999999999999999999999999 899999999999999998877642 13579
Q ss_pred chhHHHHHHHHHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhh
Q psy13894 89 SCLDNVCGMIPFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGT 168 (191)
Q Consensus 89 GaSg~v~gl~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~ 168 (191)
|.|+..|++++...+..|......+..+. .|.+ +..+..+..+.-+.|+.|+.+|++|+++|+.+++
T Consensus 128 G~s~v~Fam~~~~~~~sp~r~~~~fg~~s-----iP~~--------l~Pw~lLi~~~~lvp~aSFlghl~GllvG~ay~~ 194 (258)
T KOG2632|consen 128 GFSGVLFAMMAVLEVQSPVRSRSVFGLFS-----IPIV--------LAPWALLIATQILVPQASFLGHLCGLLVGYAYAF 194 (258)
T ss_pred cccHHHHHHHHHHhhcCcccchhhccccc-----ccHH--------HHHHHHHHHHHHHccCchHHHHHHHHHHHHHHHH
Confidence 99999999999988887765433333333 3433 1111122222223699999999999999999999
Q ss_pred h
Q psy13894 169 T 169 (191)
Q Consensus 169 ~ 169 (191)
.
T Consensus 195 ~ 195 (258)
T KOG2632|consen 195 S 195 (258)
T ss_pred H
Confidence 3
No 8
>PF08551 DUF1751: Eukaryotic integral membrane protein (DUF1751); InterPro: IPR013861 This entry is found in eukaryotic integral membrane proteins. Q12239 from SWISSPROT, a Saccharomyces cerevisiae (Baker's yeast) protein, has been shown to localise COP II vesicles [].
Probab=98.54 E-value=2.9e-07 Score=65.34 Aligned_cols=59 Identities=24% Similarity=0.283 Sum_probs=55.3
Q ss_pred CchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHH
Q psy13894 23 QFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIF 81 (191)
Q Consensus 23 q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~ 81 (191)
++||++|+.|++.++..++.|.+.++..|+.+|+.+|++.++....+..+++|+...+.
T Consensus 7 ~pWtl~T~~fve~~i~~~l~~~~~l~~~g~~lE~~WGs~E~lkFi~vv~~~tnl~~~~~ 65 (99)
T PF08551_consen 7 YPWTLFTAGFVETNIIGLLFSLLTLFYGGRYLEPIWGSREFLKFILVVNVITNLLTFLL 65 (99)
T ss_pred ehHHHHHHHHHHhHHHHHHHHHHHHHHhhHHHHHhcChHHHHHHHHHHHHHhHHHHHHH
Confidence 79999999999999999999999999999999999999999999999999988877554
No 9
>KOG2980|consensus
Probab=98.36 E-value=2.4e-08 Score=82.70 Aligned_cols=144 Identities=17% Similarity=0.138 Sum_probs=98.0
Q ss_pred CchhhhhhhhcccCHHHHHHHHHHHHHHHH-HHHHHhChhHHHHHHHHHHHHHHHHHHHH-hc---CCceechhHHHHHH
Q psy13894 23 QFYRLWTSLFLHAGVIHLVISVIVQFILMR-DLEKLTGSFRIAIIYFGSGIGGNLASAIF-VP---YRADVSCLDNVCGM 97 (191)
Q Consensus 23 q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~-~lE~~~G~~~~~~lyl~s~i~g~l~~~l~-~~---~~~~vGaSg~v~gl 97 (191)
--|.+++|.|.|.+..|+-.||+.+..+.. .+....|...+..+|+.++..|..+...- .+ ..+..||||+++++
T Consensus 154 ~~w~i~~s~Fsh~~a~h~g~~~~~~~~y~~~a~~~~~~~~~~~AlylSa~~~~~~i~~~~~v~~~~~gp~LGAsGav~ai 233 (310)
T KOG2980|consen 154 GCWKIILSTFSHYSALHLGPNMLVLKSYLAGALKGSLGFSSFFALYLSAGVKGLFISVKDKVPTSWAGPSLGASGAVYAI 233 (310)
T ss_pred cceeEEeehhcchhHhhhcHHHHHHHHHhcccccCCcchhhcccceeccccccceeEeeccccccccccccccchHHHHH
Confidence 346699999999999999999999888887 77888999999999995555555443221 12 34679999999999
Q ss_pred HHHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhhhhcCC
Q psy13894 98 IPFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGTTKARD 173 (191)
Q Consensus 98 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~~~~~~ 173 (191)
.++....+|+.........+ .+.- .+..+-.+....+..+.++. -..++.+|++|-+.|..++...-+.
T Consensus 234 ~a~~~~lfP~~~~~i~f~~~-----v~~g-a~~~~~~i~~~~~a~~~l~~-~~~n~~Ah~~gsl~Gv~va~~~~~r 302 (310)
T KOG2980|consen 234 LALDCTLFPKTTLYILFVFP-----VPAG-AGLAFKAIAAYDFAGLILGW-GFFNHAAHLSGSLFGVVVATYLWAR 302 (310)
T ss_pred HHHHhhcCcCcceeEEEeec-----cccc-chhHHHHHHHhhhcceeecc-ccchhHhhhcchHHHHHHHHHHHHH
Confidence 99999999976554433334 2221 11111111111111222222 2456679999999999999887443
No 10
>PF04511 DER1: Der1-like family; InterPro: IPR007599 The endoplasmic reticulum (ER) of the yeast Saccharomyces cerevisiae (Baker's yeast) contains a proteolytic system able to selectively degrade misfolded lumenal secretory proteins. For examination of the components involved in this degradation process, mutants were isolated. They could be divided into four complementation groups. The mutations led to stabilisation of two different substrates for this process, and the classes were called der for degradation in the ER. DER1 was cloned by complementation of the der1-2 mutation. The DER1 gene codes for a novel, hydrophobic protein that is localized to the ER. Deletion of DER1 abolished degradation of the substrate proteins, suggesting that the function of the Der1 protein may be specifically required for the degradation process associated with the ER []. Interestingly this family seems distantly related to the Rhomboid family of membrane peptidases. This family may also mediate degradation of misfolded proteins.
Probab=98.24 E-value=5.9e-06 Score=65.58 Aligned_cols=137 Identities=11% Similarity=0.086 Sum_probs=84.2
Q ss_pred CCCCCchhhhhhhhccc-CHHHHHHHHHHHHHHHHHHHHH-hCh--hHHHHHHHHHHHHHHHHHHHHhcC--CceechhH
Q psy13894 19 DSPDQFYRLWTSLFLHA-GVIHLVISVIVQFILMRDLEKL-TGS--FRIAIIYFGSGIGGNLASAIFVPY--RADVSCLD 92 (191)
Q Consensus 19 ~~~~q~wrl~T~~f~H~-~~~Hl~~N~~~l~~~g~~lE~~-~G~--~~~~~lyl~s~i~g~l~~~l~~~~--~~~vGaSg 92 (191)
.++.|+||++|+.|.-. .-.+.++|++.++..++.+|+. +.. ..++...+.++..-.+++.+..+. +...-.+.
T Consensus 37 ~~~~q~WRl~Tsff~~g~~~~~~l~~~~~l~~~s~~LE~~~f~~~~ady~~~ll~~~~~i~~~~~~~~~~~~~~~~l~~~ 116 (197)
T PF04511_consen 37 FKKFQIWRLFTSFFYFGPFSLNFLFNLYFLYQYSSSLEEGHFQGRSADYLWFLLFGASLILILSLLIGPYFFNIPFLGSS 116 (197)
T ss_pred hhhcCceeeEEEEEEEcCCCHHHHHHHHHHHHHhhHhccCCCCCCHHHHHHHHHHHHHHHHHHHHhhccchhHHHHHHHH
Confidence 35789999999999864 4479999999999999999998 332 467666565555555555443321 11222344
Q ss_pred HHHHHHHHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhhh
Q psy13894 93 NVCGMIPFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGTT 169 (191)
Q Consensus 93 ~v~gl~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~~ 169 (191)
-...+.=.+.-..|++.+..+...+ .+.+ |+ ..+.. ....+ . .+.+...++-|+++|-++-+.
T Consensus 117 l~~~l~Y~wsr~np~~~v~~~g~~~-----i~a~--yl--P~~~~-~~~~l-~---~~~~~~~~l~Gi~~Ghly~fl 179 (197)
T PF04511_consen 117 LSFALTYIWSRKNPNAQVSFFGLFT-----IKAK--YL--PWVLL-AFSLL-F---GGSSPIPDLLGILVGHLYYFL 179 (197)
T ss_pred HHHHHHHHHHHhCcccceeeEEEEE-----EChh--hH--HHHHH-HHHHH-h---CCCcHHHHHHHHHHHHHHHHH
Confidence 5555555666666766665544333 2322 22 11111 11111 1 223566999999999998877
No 11
>KOG2890|consensus
Probab=98.02 E-value=2.9e-05 Score=64.59 Aligned_cols=141 Identities=14% Similarity=0.019 Sum_probs=96.2
Q ss_pred CCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHH-------hcC-----Cceec
Q psy13894 22 DQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIF-------VPY-----RADVS 89 (191)
Q Consensus 22 ~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~-------~~~-----~~~vG 89 (191)
-..|+++|+.|+-.+++-.+.|.+.+.+-|..+|+.+|+..++..|.+.-...+++.... ..+ .+..|
T Consensus 65 ~~~WtliTs~fie~~vw~V~~sv~~L~v~G~~lEp~Wg~~e~lkff~ivn~~~~l~v~v~~~l~Y~it~n~v~L~~~i~G 144 (326)
T KOG2890|consen 65 FFPWTLITSGFIELNVWDVLVSVLTLSVGGKFLEPNWGSLELLKFFAIVNGSTTLVVLVPALLLYMITDNHVYLYIPIHG 144 (326)
T ss_pred hhhHHHHhcchhhhhHHHHHHHHHheeecceeeccCCCCHHHHHHHHHhhchhHHHHHHHHHHHHHHhcCceEEEEEecc
Confidence 379999999999999999999999999999999999999999988877655544433221 111 25679
Q ss_pred hhHHHHHHHHHHhhhCcchhHHHHHHhhhhcCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhhh
Q psy13894 90 CLDNVCGMIPFYNVDSPDQFYRLWTSLFLHAGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGTT 169 (191)
Q Consensus 90 aSg~v~gl~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~~ 169 (191)
..|...|++.++=-..||..+..-+.-+...+.-|. ..++...++.+ -.....+.+..+..|...+|.
T Consensus 145 ~~gilaGilVa~kQllpd~~il~~~~~r~~~~~lP~-------~~l~~~~il~i-----~~f~~f~~l~s~~~g~~~sWt 212 (326)
T KOG2890|consen 145 TTGILAGILVAWKQLLPDTIILELKSGRFLYAHLPL-------LVLFLSLILSI-----ITFLVFASLPSITFGVLVSWT 212 (326)
T ss_pred chHHHHHHHHHHHHHcCceeEEeccchhhhhhhCCH-------HHHHHHHHHHH-----HHHHHhhhhHHHHHhhhhhhh
Confidence 999999999999888898855443222211111121 12222222222 123446666777778888888
Q ss_pred hcCCC
Q psy13894 170 KARDD 174 (191)
Q Consensus 170 ~~~~~ 174 (191)
+.|.-
T Consensus 213 YLRfy 217 (326)
T KOG2890|consen 213 YLRFY 217 (326)
T ss_pred hheec
Confidence 77753
No 12
>KOG0858|consensus
Probab=97.60 E-value=0.00019 Score=58.02 Aligned_cols=61 Identities=13% Similarity=0.097 Sum_probs=51.3
Q ss_pred CCCCCchhhhhhhhcccC-HHHHHHHHHHHHHHHHHHHHHh---ChhHHHHHHHHHHHHHHHHHH
Q psy13894 19 DSPDQFYRLWTSLFLHAG-VIHLVISVIVQFILMRDLEKLT---GSFRIAIIYFGSGIGGNLASA 79 (191)
Q Consensus 19 ~~~~q~wrl~T~~f~H~~-~~Hl~~N~~~l~~~g~~lE~~~---G~~~~~~lyl~s~i~g~l~~~ 79 (191)
.++.|+||++|+.+.-.. -.|.++||+.++--.+.+|+-. .+.+|+.+.+.+++.-.+.+.
T Consensus 48 ~kk~QiWRliTs~lyfg~~gf~fl~n~~FlyrY~~~LE~g~f~~rtadf~~mllf~~~l~~~~~~ 112 (239)
T KOG0858|consen 48 FKKFQIWRLITSFLYFGPFGFDFLMNLYFLYRYSSMLEEGSFRGRTADFLYMLLFGAVLLTLTGL 112 (239)
T ss_pred HhHhHHHHhhhhhheeccccHHHHHHHHHHHHHHHHHhcCCCCCchhHHHHHHHHHHHHHHHHHH
Confidence 568999999999999866 7999999999999999999943 347888888888887776554
No 13
>COG5291 Predicted membrane protein [Function unknown]
Probab=95.66 E-value=0.0047 Score=50.34 Aligned_cols=43 Identities=21% Similarity=0.277 Sum_probs=35.4
Q ss_pred CCCCCchhhhhhhhccc-CHHHHHHHHHHHHHHHHHHHH-HhChh
Q psy13894 19 DSPDQFYRLWTSLFLHA-GVIHLVISVIVQFILMRDLEK-LTGSF 61 (191)
Q Consensus 19 ~~~~q~wrl~T~~f~H~-~~~Hl~~N~~~l~~~g~~lE~-~~G~~ 61 (191)
.++.||||++|+...-. --+..++|++.++--.+.+|+ .+++.
T Consensus 55 ~k~~qiwRlfTs~~~~~~~~~d~~M~vyf~Y~yS~~LE~g~f~~~ 99 (313)
T COG5291 55 LKRLQIWRLFTSFLYFGKPTLDMFMHVYFLYRYSRMLEEGCFNTS 99 (313)
T ss_pred HHHHHHHHHHHHHHhhcCcchhHHHHHHHHHHHHHHHhccccCcc
Confidence 46899999999777664 568899999999999999998 34544
No 14
>KOG4463|consensus
Probab=95.15 E-value=0.035 Score=45.80 Aligned_cols=61 Identities=25% Similarity=0.371 Sum_probs=50.7
Q ss_pred CCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHH
Q psy13894 20 SPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIF 81 (191)
Q Consensus 20 ~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~ 81 (191)
+..||||++.+.|...+--.+..-++.++.+ +.+||..|+.|+..+-+.++..+.+....+
T Consensus 47 ~y~qywrlL~~qF~~~n~~e~~~~l~I~Y~f-R~~ERlLGShky~~fiv~s~~~~~l~~~il 107 (323)
T KOG4463|consen 47 KYFQYWRLLMSQFAFSNTPELMFGLYILYYF-RVFERLLGSHKYSVFIVFSGTVSLLLEVIL 107 (323)
T ss_pred HHHHHHHHHHHHHHhcCChHHHHHHHHHHHH-HHHHHHhccccceeehhHHHHHHHHHHHHH
Confidence 4579999999999999888888877666655 779999999999998888888887766544
No 15
>PF12732 YtxH: YtxH-like protein; InterPro: IPR024623 This family of uncharacterised proteins is found in bacteria. Proteins in this family are typically between 100 and 143 amino acids in length. The N-terminal region is the most conserved.
Probab=65.99 E-value=6 Score=26.04 Aligned_cols=29 Identities=24% Similarity=0.341 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHhhhhcCCCccchhccc
Q psy13894 154 LAHNSGLVVQLEAGTTKARDDGNRTKQDM 182 (191)
Q Consensus 154 ~aHl~G~l~G~l~g~~~~~~~~~~~~~~~ 182 (191)
.+-+.|.++|.+.|+++-|...+..|+++
T Consensus 3 ~g~l~Ga~~Ga~~glL~aP~sG~e~R~~l 31 (74)
T PF12732_consen 3 LGFLAGAAAGAAAGLLFAPKSGKETREKL 31 (74)
T ss_pred HHHHHHHHHHHHHHHHhCCCCcHHHHHHH
Confidence 45678999999999999888776666654
No 16
>COG0705 Membrane associated serine protease [Amino acid transport and metabolism]
Probab=49.80 E-value=31 Score=27.40 Aligned_cols=72 Identities=15% Similarity=0.055 Sum_probs=52.2
Q ss_pred CCCCCchhhhhhhhcccCHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHHhcCCceechhHHHHHHH
Q psy13894 19 DSPDQFYRLWTSLFLHAGVIHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAIFVPYRADVSCLDNVCGMI 98 (191)
Q Consensus 19 ~~~~q~wrl~T~~f~H~~~~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l~~~~~~~vGaSg~v~gl~ 98 (191)
.-++++|+++++.++|....|...+... ..+...+++...+..+++....++. ..++.++-+.|++
T Consensus 135 GASG~i~gllga~~~~~~~~~~~~~~~~-------------~~~~~~~~i~~~~~~~~~~~~~~~~-~~va~~aHl~G~i 200 (228)
T COG0705 135 GASGAIFGLLGAYFLLFPFARILLLFLS-------------LPRPALILILIWLLYSLFSGAGSFG-PSVAWSAHLGGLI 200 (228)
T ss_pred chhHHHHHHHHHHHHHccccchhhhhcc-------------CchhHHHHHHHHHHHHHHHHhcCCc-hHHHHHHHHHHHH
Confidence 4578899999999999888877776644 4445567777777777777665543 5788889999998
Q ss_pred HHHhhh
Q psy13894 99 PFYNVD 104 (191)
Q Consensus 99 ~~~~~~ 104 (191)
+.....
T Consensus 201 ~G~l~~ 206 (228)
T COG0705 201 GGLLLA 206 (228)
T ss_pred HHHHHH
Confidence 655444
No 17
>PF06946 Phage_holin_5: Phage holin; InterPro: IPR009708 This entry represents the Bacteriophage A118, holin protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This protein family represent one of a large number of mutually dissimilar families of phage holins. It is thought that the temporal precision of holin-mediated lysis may occur through the build-up of a holin oligomer which causes the lysis [].
Probab=47.92 E-value=81 Score=21.98 Aligned_cols=17 Identities=18% Similarity=-0.052 Sum_probs=11.5
Q ss_pred HHHHHHHHHHhhhhcCC
Q psy13894 157 NSGLVVQLEAGTTKARD 173 (191)
Q Consensus 157 l~G~l~G~l~g~~~~~~ 173 (191)
.+|.++|+....++...
T Consensus 65 ~aG~laGlAaTGL~e~~ 81 (93)
T PF06946_consen 65 WAGGLAGLAATGLFEQF 81 (93)
T ss_pred HHHHHhhhhhhhHHHHH
Confidence 56888888777665443
No 18
>PF09527 ATPase_gene1: Putative F0F1-ATPase subunit (ATPase_gene1)
Probab=46.74 E-value=66 Score=19.58 Aligned_cols=41 Identities=22% Similarity=0.289 Sum_probs=30.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHhCh-hHHHHHHHHHHHHHHHHH
Q psy13894 38 IHLVISVIVQFILMRDLEKLTGS-FRIAIIYFGSGIGGNLAS 78 (191)
Q Consensus 38 ~Hl~~N~~~l~~~g~~lE~~~G~-~~~~~lyl~s~i~g~l~~ 78 (191)
..+..+++.-..+|..+++.+++ ..+..+.++-|+.+++-.
T Consensus 8 ~~~~~~i~~g~~~G~~lD~~~~t~p~~~~~g~llG~~~g~~~ 49 (55)
T PF09527_consen 8 FTMAAPILVGFFLGYWLDKWFGTSPWFTLIGLLLGIAAGFYN 49 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHH
Confidence 35677888888999999999998 555566666666666544
No 19
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=46.42 E-value=1.7e+02 Score=24.32 Aligned_cols=37 Identities=16% Similarity=0.310 Sum_probs=28.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHH
Q psy13894 38 IHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLAS 78 (191)
Q Consensus 38 ~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~ 78 (191)
..+++|.+.++..+..+.+....+|.+ +++.+|++.+
T Consensus 11 ~N~~md~~lL~~t~~~~~~~~~~~Rll----~~A~~Gal~~ 47 (293)
T PF03419_consen 11 VNFLMDYFLLWLTARLLKRRASRWRLL----LGAAIGALYS 47 (293)
T ss_pred HHHHHHHHHHHHHHHHhcCCCcHHHHH----HHHHHHHHHH
Confidence 567899999999999998888888874 4555555544
No 20
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.37 E-value=25 Score=26.05 Aligned_cols=22 Identities=32% Similarity=0.096 Sum_probs=12.7
Q ss_pred hHHHHHHHHHHHHHHhhhhcCC
Q psy13894 152 FNLAHNSGLVVQLEAGTTKARD 173 (191)
Q Consensus 152 ~~~aHl~G~l~G~l~g~~~~~~ 173 (191)
.|..-+-|+++|+++|+++.|-
T Consensus 7 ~W~~a~igLvvGi~IG~li~Rl 28 (138)
T COG3105 7 TWEYALIGLVVGIIIGALIARL 28 (138)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555666666666665554
No 21
>COG4769 Predicted membrane protein [Function unknown]
Probab=37.12 E-value=65 Score=25.05 Aligned_cols=28 Identities=25% Similarity=0.036 Sum_probs=19.9
Q ss_pred CchhhHHHHHHHHHHHHHHhhhhcCCCc
Q psy13894 148 GPFGFNLAHNSGLVVQLEAGTTKARDDG 175 (191)
Q Consensus 148 ~~~i~~~aHl~G~l~G~l~g~~~~~~~~ 175 (191)
.|-....+-+.|...|.+........+.
T Consensus 139 lPll~flGivsG~~vg~~~~~~i~~v~~ 166 (181)
T COG4769 139 LPLLIFLGIVSGTAVGILANTLIITVKI 166 (181)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4666777888888888888777655433
No 22
>PF13829 DUF4191: Domain of unknown function (DUF4191)
Probab=33.37 E-value=66 Score=26.16 Aligned_cols=56 Identities=11% Similarity=-0.008 Sum_probs=31.9
Q ss_pred cCCCchhhHHHHHHHHHHHHHHHHHhccCchhhHHHHHHHHHHHHHHhhhhcCCCcc
Q psy13894 120 AGARPFFTYWITTVQIIILCLSIFAYGLGPFGFNLAHNSGLVVQLEAGTTKARDDGN 176 (191)
Q Consensus 120 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~aHl~G~l~G~l~g~~~~~~~~~ 176 (191)
.+..|...-|+ +..++..++..+..++.-+.-+.--+.|++.|++.+.+..-++-+
T Consensus 21 rk~dp~l~~~m-l~a~l~~~~v~v~ig~l~~~~~~~~i~gi~~g~l~am~vl~rra~ 76 (224)
T PF13829_consen 21 RKEDPKLPWLM-LGAFLGPIAVFVLIGLLFGSWWYWLIIGILLGLLAAMIVLSRRAQ 76 (224)
T ss_pred HHHCcchHHHH-HHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33355542232 333444444444455543455677788999999998887664433
No 23
>COG5336 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.36 E-value=1.2e+02 Score=21.87 Aligned_cols=26 Identities=12% Similarity=-0.152 Sum_probs=19.6
Q ss_pred CchhhHHHHHHHHHHHHHHhhhhcCC
Q psy13894 148 GPFGFNLAHNSGLVVQLEAGTTKARD 173 (191)
Q Consensus 148 ~~~i~~~aHl~G~l~G~l~g~~~~~~ 173 (191)
.-+.+-++.+...+.|+..|+.-.++
T Consensus 69 ~agTsPwglIv~lllGf~AG~lnv~R 94 (116)
T COG5336 69 FAGTSPWGLIVFLLLGFGAGVLNVLR 94 (116)
T ss_pred hcCCCcHHHHHHHHHHHHHHHHHHHH
Confidence 34566688888999999998886554
No 24
>PF04246 RseC_MucC: Positive regulator of sigma(E), RseC/MucC; InterPro: IPR007359 This bacterial family of integral membrane proteins represents a positive regulator of the sigma(E) transcription factor, namely RseC/MucC. The sigma(E) transcription factor is up-regulated by cell envelope protein misfolding, and regulates the expression of genes that are collectively termed ECF (devoted to Extra-Cellular Functions) []. In Pseudomonas aeruginosa, derepression of sigma(E) is associated with the alginate-overproducing phenotype characteristic of chronic respiratory tract colonization in cystic fibrosis patients. The mechanism by which RseC/MucC positively regulates the sigma(E) transcription factor is unknown. RseC is also thought to have a role in thiamine biosynthesis in Salmonella typhimurium []. In addition, this family also includes an N-terminal part of RnfF, a Rhodobacter capsulatus protein, of unknown function, that is essential for nitrogen fixation. This protein also contains a domain found in ApbE protein IPR003374 from INTERPRO, which is itself involved in thiamine biosynthesis.
Probab=30.23 E-value=1.3e+02 Score=21.89 Aligned_cols=23 Identities=17% Similarity=-0.089 Sum_probs=12.3
Q ss_pred hhHHHHHHHHHHHHHHhhhhcCC
Q psy13894 151 GFNLAHNSGLVVQLEAGTTKARD 173 (191)
Q Consensus 151 i~~~aHl~G~l~G~l~g~~~~~~ 173 (191)
.+..+-+.|++.|+++...+.++
T Consensus 98 ~~~l~~l~~l~~~~~~~~~~~~~ 120 (135)
T PF04246_consen 98 WAILGGLLGLALGFLILRLFDRR 120 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Confidence 44455555555555555555444
No 25
>PRK10720 uracil transporter; Provisional
Probab=29.74 E-value=1.9e+02 Score=25.64 Aligned_cols=31 Identities=6% Similarity=-0.048 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHhhhhcCCCccchhcccCCC
Q psy13894 155 AHNSGLVVQLEAGTTKARDDGNRTKQDMGSD 185 (191)
Q Consensus 155 aHl~G~l~G~l~g~~~~~~~~~~~~~~~~~~ 185 (191)
+-.-|.++|.++-.++..++|.|.-++.++.
T Consensus 390 gi~~g~~~ai~Lnlll~~~~~~~~~~~~~~~ 420 (428)
T PRK10720 390 GMALATIVGIGLSLIFKLISKLRPEEEVLDA 420 (428)
T ss_pred cHHHHHHHHHHHHHHhcccccccCCcccccc
Confidence 4445677777777776655555544444433
No 26
>COG1296 AzlC Predicted branched-chain amino acid permease (azaleucine resistance) [Amino acid transport and metabolism]
Probab=29.25 E-value=3.2e+02 Score=22.36 Aligned_cols=22 Identities=23% Similarity=0.058 Sum_probs=14.5
Q ss_pred hHHHHHHHHHHHHHHhhhhcCC
Q psy13894 152 FNLAHNSGLVVQLEAGTTKARD 173 (191)
Q Consensus 152 ~~~aHl~G~l~G~l~g~~~~~~ 173 (191)
..+.-+.|.++|.+...+..+.
T Consensus 210 ~~~~v~~~~la~l~~~~l~~~~ 231 (238)
T COG1296 210 GPWAVLAGILAGLLAALLLAAK 231 (238)
T ss_pred HHHHHHHHHHHHHHHHHHhccc
Confidence 4455666778887777775543
No 27
>PF01306 LacY_symp: LacY proton/sugar symporter; InterPro: IPR022814 In bacteria there are a number of families of transport proteins, including symporters and antiporters, that mediate the intake of a variety of sugars with the concomitant uptake of hydrogen ions (proton symporters) []. The lacY family of Escherichia coli and Klebsiella pneumoniae are proton/beta-galactoside symporters, which, like most sugar transporters, are integral membrane proteins with 12 predicted transmembrane (TM) regions. Also similar to the lacY family are the rafinose (rafB) and sucrose (cscB) permeases from E. coli []. This entry also includes bacterial phenylproprionate permease.; PDB: 1PV7_B 1PV6_A 2Y5Y_B 2CFQ_A 2V8N_B 2CFP_A.
Probab=28.86 E-value=3.8e+02 Score=23.83 Aligned_cols=40 Identities=13% Similarity=0.101 Sum_probs=20.6
Q ss_pred chhhHHHHHHHHHHHHHHHHHhccCchhhHH-HHHHHHHHH
Q psy13894 124 PFFTYWITTVQIIILCLSIFAYGLGPFGFNL-AHNSGLVVQ 163 (191)
Q Consensus 124 ~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~-aHl~G~l~G 163 (191)
-..|.|=++...+...+........|+...+ +...+.+..
T Consensus 141 G~~R~wGSig~ai~s~~~G~L~~i~p~~~fwi~s~~~~il~ 181 (412)
T PF01306_consen 141 GRARMWGSIGFAIASLLAGILFNINPNIIFWIASAAAIILL 181 (412)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred chHHHHhhHHHHHHHHHhheeeeeCccHHHHHHHHHHHHHH
Confidence 3445665555555444444444445665544 555555553
No 28
>PF04892 VanZ: VanZ like family ; InterPro: IPR006976 This entry represents a conserved sequence region found in the VanZ protein and also several phosphotransbutyrylases. VanZ confers low-level resistance to the glycopeptide antibiotic teicoplanin (Te). Analysis of cytoplasmic peptidoglycan precursors, accumulated in the presence of ramoplanin, showed that VanZ-mediated Te resistance does not involve incorporation of a substituent of D-alanine into the peptidoglycan precursors [].
Probab=27.67 E-value=49 Score=23.67 Aligned_cols=29 Identities=10% Similarity=-0.149 Sum_probs=22.9
Q ss_pred CchhhHHHHHHHHHHHHHHhhhhcCCCcc
Q psy13894 148 GPFGFNLAHNSGLVVQLEAGTTKARDDGN 176 (191)
Q Consensus 148 ~~~i~~~aHl~G~l~G~l~g~~~~~~~~~ 176 (191)
.+-.|...+..|.++|+++.....|...+
T Consensus 104 ~d~~Dv~~n~~G~~lG~~l~~~~~~~~~r 132 (133)
T PF04892_consen 104 FDIDDVLANTLGALLGYLLYRLIRKRWQR 132 (133)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 34578899999999999998887765443
No 29
>PF14241 DUF4341: Domain of unknown function (DUF4341)
Probab=26.36 E-value=1.4e+02 Score=18.84 Aligned_cols=34 Identities=15% Similarity=0.105 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHHHHHhcCCceechhHHHHHHHHH
Q psy13894 67 YFGSGIGGNLASAIFVPYRADVSCLDNVCGMIPF 100 (191)
Q Consensus 67 yl~s~i~g~l~~~l~~~~~~~vGaSg~v~gl~~~ 100 (191)
|+.+.++|-....++.-+....|+||.+.+++..
T Consensus 6 l~GG~lIGla~~~ll~~~Gri~GiSGil~~~~~~ 39 (62)
T PF14241_consen 6 LIGGLLIGLAASLLLLLNGRIAGISGILSGLLSP 39 (62)
T ss_pred HHHHHHHHHHHHHHHHHcCcceehHHHHHHHhCC
Confidence 3444455544455555566789999988887554
No 30
>PRK10862 SoxR reducing system protein RseC; Provisional
Probab=25.55 E-value=2.2e+02 Score=21.45 Aligned_cols=16 Identities=25% Similarity=0.156 Sum_probs=8.0
Q ss_pred HHHHHHHHHHhhhhcC
Q psy13894 157 NSGLVVQLEAGTTKAR 172 (191)
Q Consensus 157 l~G~l~G~l~g~~~~~ 172 (191)
+.+.+.|+.+++.+.|
T Consensus 107 ~~~~~~g~~~g~~~~r 122 (154)
T PRK10862 107 LCGALLGGVGGFLLAR 122 (154)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444445555555544
No 31
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=24.48 E-value=4.2e+02 Score=22.12 Aligned_cols=42 Identities=19% Similarity=0.407 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHH-Hhc
Q psy13894 38 IHLVISVIVQFILMRDLEKLTGSFRIAIIYFGSGIGGNLASAI-FVP 83 (191)
Q Consensus 38 ~Hl~~N~~~l~~~g~~lE~~~G~~~~~~lyl~s~i~g~l~~~l-~~~ 83 (191)
..+++|.+.|+..+..+.+....+|.+ +++.+|++.+++ +.|
T Consensus 11 ~Nf~~d~~LL~~t~~~lk~~~~~~Rll----~ga~iGa~~~~~~~~p 53 (288)
T TIGR02854 11 ENFIIDYFLLYLTARTLKDKVSQWRLL----LAALIGSLYVLFMFTP 53 (288)
T ss_pred HHHHHHHHHHHHHHHHhhccchHHHHH----HHHHHHHHHHHHHHhc
Confidence 568899999999999998888888874 455555554433 344
No 32
>PF04971 Lysis_S: Lysis protein S ; InterPro: IPR007054 The lysis S protein is a cytotoxic protein forming holes in membranes causing cell lysis. The action of Lysis S is independent of the proportion of acidic phospholipids in the membrane [].
Probab=22.07 E-value=2.4e+02 Score=18.46 Aligned_cols=28 Identities=11% Similarity=-0.003 Sum_probs=14.4
Q ss_pred hhhHHHHHHHHHHHHHH---hhhhcCCCccc
Q psy13894 150 FGFNLAHNSGLVVQLEA---GTTKARDDGNR 177 (191)
Q Consensus 150 ~i~~~aHl~G~l~G~l~---g~~~~~~~~~~ 177 (191)
+-+..+-++|+++|++- -+.++.++++|
T Consensus 32 qW~aIGvi~gi~~~~lt~ltN~YFK~k~drr 62 (68)
T PF04971_consen 32 QWAAIGVIGGIFFGLLTYLTNLYFKIKEDRR 62 (68)
T ss_pred cchhHHHHHHHHHHHHHHHhHhhhhhhHhhh
Confidence 34445667776655543 44444443333
No 33
>PF04632 FUSC: Fusaric acid resistance protein family; InterPro: IPR006726 This entry represents the p-hydroxybenzoic acid efflux pump subunit AaeB (pHBA efflux pump protein B) whose substrates are p-hydroxybenzoic acid (pHBA), 6-hydroxy-2-naphthoic and 2-hydroxycinnamate. It could function as a metabolic relief valve, allowing to eliminate certain compounds when they accumulate to high levels in the cell []. This family also includes fusaric acid resistance proteins [], which are likely to be membrane transporter proteins, and uncharacterised transporter YdhK.; GO: 0006810 transport, 0005886 plasma membrane
Probab=21.65 E-value=6.3e+02 Score=23.19 Aligned_cols=29 Identities=14% Similarity=-0.057 Sum_probs=17.8
Q ss_pred hHHHHHHHHHHHHHHhhhhcCCCccchhc
Q psy13894 152 FNLAHNSGLVVQLEAGTTKARDDGNRTKQ 180 (191)
Q Consensus 152 ~~~aHl~G~l~G~l~g~~~~~~~~~~~~~ 180 (191)
...+.+.|.++..+.-.++.+.+..+..+
T Consensus 471 ~~la~l~G~~~a~l~~~li~p~~~~~~~r 499 (650)
T PF04632_consen 471 RALAILLGIVIAALVFRLIRPFSPEWRRR 499 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHCCCChhHHHH
Confidence 34677777777777766666665444433
No 34
>COG4980 GvpP Gas vesicle protein [General function prediction only]
Probab=21.14 E-value=92 Score=22.59 Aligned_cols=27 Identities=15% Similarity=0.094 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHhhhhcCCCccchh
Q psy13894 153 NLAHNSGLVVQLEAGTTKARDDGNRTK 179 (191)
Q Consensus 153 ~~aHl~G~l~G~l~g~~~~~~~~~~~~ 179 (191)
..+-+-|.++|.+.+.++.|...+..|
T Consensus 8 l~G~liGgiiGa~aaLL~AP~sGkelR 34 (115)
T COG4980 8 LFGILIGGIIGAAAALLFAPKSGKELR 34 (115)
T ss_pred HHHHHHHHHHHHHHHHHhCCcccHHHH
Confidence 456777888899999998887554333
No 35
>PF14851 FAM176: FAM176 family
Probab=20.83 E-value=3e+02 Score=20.94 Aligned_cols=18 Identities=28% Similarity=0.440 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy13894 63 IAIIYFGSGIGGNLASAI 80 (191)
Q Consensus 63 ~~~lyl~s~i~g~l~~~l 80 (191)
-+.+||++|++.+++..+
T Consensus 21 ~~aLYFv~gVC~GLlLtL 38 (153)
T PF14851_consen 21 RFALYFVSGVCAGLLLTL 38 (153)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 346888888888876644
No 36
>COG3086 RseC Positive regulator of sigma E activity [Signal transduction mechanisms]
Probab=20.53 E-value=2.8e+02 Score=21.03 Aligned_cols=25 Identities=12% Similarity=0.064 Sum_probs=12.0
Q ss_pred chhhHHHHHHHHHHHHHHhhhhcCC
Q psy13894 149 PFGFNLAHNSGLVVQLEAGTTKARD 173 (191)
Q Consensus 149 ~~i~~~aHl~G~l~G~l~g~~~~~~ 173 (191)
.-....+-+.|...|+++..-+.|+
T Consensus 103 e~~~~~~~~lg~~l~fl~~r~ysRk 127 (150)
T COG3086 103 ELIVIFGAFLGLALGFLLARRYSRK 127 (150)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444445555555555444443
No 37
>PF02960 K1: K1 glycoprotein; InterPro: IPR004121 Current genotyping systems for Human herpesvirus 8 (HHV-8) are based on the highly variable gene encoding the K1 glycoprotein []. This entry represents the C-terminal region of the K1 glycoprotein.
Probab=20.50 E-value=1e+02 Score=22.33 Aligned_cols=35 Identities=14% Similarity=0.118 Sum_probs=26.9
Q ss_pred hhHHHHHHHHHHHHHHhhhhcCCCccchhcccCCC
Q psy13894 151 GFNLAHNSGLVVQLEAGTTKARDDGNRTKQDMGSD 185 (191)
Q Consensus 151 i~~~aHl~G~l~G~l~g~~~~~~~~~~~~~~~~~~ 185 (191)
++..+-.+|++.-.+++-..++++.+++.++..+|
T Consensus 77 VaLIgTMCgILgTiIfahcqkq~dSnkTvpqql~d 111 (130)
T PF02960_consen 77 VALIGTMCGILGTIIFAHCQKQRDSNKTVPQQLRD 111 (130)
T ss_pred HHHHHHHHHHHHHHHHHHhcccccccccchHHHHh
Confidence 46678888999888889888888777776665544
Done!