Query 017729
Match_columns 367
No_of_seqs 155 out of 271
Neff 4.0
Searched_HMMs 46136
Date Fri Mar 29 02:55:18 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/017729.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/017729hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2433 Uncharacterized conser 100.0 1.4E-91 3E-96 694.0 25.5 331 18-367 214-577 (577)
2 PF07910 Peptidase_C78: Peptid 100.0 2.1E-63 4.5E-68 462.3 12.8 173 183-359 15-218 (218)
3 KOG4696 Uncharacterized conser 100.0 3.8E-32 8.3E-37 262.8 10.8 192 164-361 86-370 (393)
4 KOG4696 Uncharacterized conser 98.3 1.5E-07 3.2E-12 92.9 0.9 61 186-252 191-252 (393)
5 PF03416 Peptidase_C54: Peptid 98.0 2E-05 4.4E-10 76.3 8.4 132 188-325 32-226 (278)
6 PF13529 Peptidase_C39_2: Pept 96.0 0.043 9.2E-07 44.7 8.4 111 189-320 11-144 (144)
7 KOG2674 Cysteine protease requ 95.7 0.09 1.9E-06 54.3 10.6 131 188-325 92-300 (409)
8 cd02549 Peptidase_C39A A sub-f 91.5 1.5 3.2E-05 36.4 8.6 101 193-321 6-115 (141)
9 PF03412 Peptidase_C39: Peptid 86.5 1.3 2.9E-05 36.6 4.8 97 192-325 11-108 (131)
10 cd02418 Peptidase_C39B A sub-f 69.6 61 0.0013 26.6 10.0 95 191-319 9-108 (136)
11 cd02259 Peptidase_C39_like Pep 65.9 50 0.0011 26.4 8.2 92 193-320 6-98 (122)
12 PF09778 Guanylate_cyc_2: Guan 65.7 63 0.0014 31.2 10.0 64 189-268 7-70 (212)
13 cd02424 Peptidase_C39E A sub-f 57.7 96 0.0021 25.8 8.8 92 193-319 11-106 (129)
14 KOG1089 Myotubularin-related p 54.8 1.2 2.6E-05 48.2 -3.9 105 214-338 284-390 (573)
15 cd02425 Peptidase_C39F A sub-f 53.1 1E+02 0.0022 24.9 8.0 94 193-321 11-105 (126)
16 smart00230 CysPc Calpain-like 48.0 1.1E+02 0.0024 30.3 8.8 94 224-321 143-254 (318)
17 cd02419 Peptidase_C39C A sub-f 45.9 1E+02 0.0022 25.0 7.0 93 193-321 11-104 (127)
18 PF14399 Transpep_BrtH: NlpC/p 43.0 1E+02 0.0022 29.5 7.4 62 254-323 60-137 (317)
19 cd02417 Peptidase_C39_likeA A 40.7 1.9E+02 0.0042 23.2 9.5 96 193-324 6-102 (121)
20 cd02423 Peptidase_C39G A sub-f 39.3 2.1E+02 0.0045 23.2 8.6 96 190-321 8-108 (129)
21 cd02420 Peptidase_C39D A sub-f 35.8 1.9E+02 0.0042 23.4 7.1 93 193-321 11-104 (125)
22 KOG2947 Carbohydrate kinase [C 34.4 1.1E+02 0.0023 30.9 6.2 77 215-301 23-111 (308)
23 PF14229 DUF4332: Domain of un 31.5 44 0.00095 28.9 2.7 37 193-229 71-119 (122)
24 cd02421 Peptidase_C39_likeD A 30.8 2.9E+02 0.0063 22.3 8.5 54 252-324 48-103 (124)
25 cd00044 CysPc Calpains, domain 28.0 3.7E+02 0.0081 26.3 8.8 96 223-322 150-263 (315)
26 PRK13977 myosin-cross-reactive 22.1 2.3E+02 0.0049 31.2 6.6 118 237-355 186-334 (576)
27 PF15256 SPATIAL: SPATIAL 20.3 41 0.00089 32.1 0.5 19 216-234 166-184 (196)
28 TIGR03796 NHPM_micro_ABC1 NHPM 20.0 5.1E+02 0.011 28.1 8.7 93 193-321 12-105 (710)
No 1
>KOG2433 consensus Uncharacterized conserved protein [Function unknown]
Probab=100.00 E-value=1.4e-91 Score=694.01 Aligned_cols=331 Identities=41% Similarity=0.694 Sum_probs=295.9
Q ss_pred cceeEEEEEeecCCCCCC--CCCCeeeeecCcccceeEEEeeeeeE-EEEecccCcHHHHHHHhhHHHHHHHHHH-HHHH
Q 017729 18 GDKIQVSVLLNTSQKPTK--STAPIAEYYPALEDARLLVVDWKLDV-LCYATKRLPLIYALSKLVVPGLVDQLNT-MKKA 93 (367)
Q Consensus 18 ~~~~~~~~~~~~~~~~~~--~~~p~~~y~~~~~~~~~~~~~~~ld~-l~~~~~~~~~~~~~~~l~~~~l~~ql~~-~~~~ 93 (367)
-|+|.+.++...|..++. +-.|+++--..-+...-+++ ++|+ .-.+..+-.++ +-++++++++++|+. |.++
T Consensus 214 ~~vi~id~M~s~srd~ts~~~~~P~v~v~~~n~h~~r~~~--p~evv~~~~~~~t~l~--lyk~l~eai~r~l~~~m~~~ 289 (577)
T KOG2433|consen 214 KDVIEIDAMQSLSRDTTSDQKLVPTVKVTKDNKHFTRLVT--PGEVVFPAFFGDTSLD--LYKRLREAINRRLNNTMMVT 289 (577)
T ss_pred hheeeeHHHHhhccCCcCCCCCCceEEEeeCCceeEEEee--ehheeEeeccccchhH--HHHHHHHHHHHHhhHHHHHH
Confidence 589999999888876653 34477775444333333334 4554 44556666676 569999999999987 9999
Q ss_pred hhchhhhc-CCCccceeecCCCCCCCeEEEecCCCCcchhhhhhhcccccchhhhhhhhhhc--------ccccccccc-
Q 017729 94 IMPYLLTQ-HPQLRPFHFSPPGVLQPITVIYELSYGETEMKQADLTTFSRCVSNLRSLRCQL--------IVEALSCIV- 163 (367)
Q Consensus 94 ~~~~~~~~-~~~~~~~hf~p~~~~~~~t~~y~~~~~~~~~~~~~~r~~~~~~~~lh~~~l~l--------~~n~~~~~~- 163 (367)
|...+.+. ...+.++||+|||+.|+|++.||++.+|++ +..+|+ +|| +.|.| |+|++.|.+
T Consensus 290 I~~~~~G~gv~vp~s~hflppg~~~lv~~~yp~g~~D~~--~~~yRk------rLH-~lFnLP~srPyfrrsna~~f~~E 360 (577)
T KOG2433|consen 290 INGIRAGRGVTVPTSAHFLPPGWVSLVHLQYPTGWTDNE--QRNYRK------RLH-KLFNLPSSRPYFRRSNALAFHSE 360 (577)
T ss_pred HHhhhcCCceeccCcceecCcCcceEEEEecCCCCCcHH--HHHHHH------HHH-HhhCCCCCchhhhhhhhhhcCCc
Confidence 98888764 447889999999999999999999999874 456899 899 67888 789999964
Q ss_pred ----cccccccccccc---------eeeeecC------CCCCCCCCCcccchhhHHHHHHHhhccCCCCCCCCChHHHHH
Q 017729 164 ----HLALVKHKNLKI---------MSALSWM------LKWSTFSLGWGCAYRSLQTIISWFRLQHYASVDVPSHREIQQ 224 (367)
Q Consensus 164 ----~~~lL~n~H~~l---------~~lv~G~------~qd~~~D~GWGCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq 224 (367)
...++||+|.++ .++|+|. |||+.+|+||||||||+|||||||.+|||++++||||+||||
T Consensus 361 ~~~~~~~~irnpH~~l~ps~~~~G~iy~VnG~Y~YhHYmQd~idD~GWGCAYRSlQTIcSWFilqGYT~~pIPtHrEiQq 440 (577)
T KOG2433|consen 361 SARLTKKLIRNPHLSLTPSYQPVGEIYTVNGPYNYHHYMQDGIDDSGWGCAYRSLQTICSWFILQGYTDKPIPTHREIQQ 440 (577)
T ss_pred hhhcccccccCCccccCCCCCccceEEEecCcchhHHHHHhccccCCcchhhHhHHHHHHHHHHcCccCCCCCcHHHHHH
Confidence 467999999999 8999998 999999999999999999999999999999999999999999
Q ss_pred HHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCChhHHHHHHHHHhccCCCceEecCCcceeEEEEEE
Q 017729 225 ALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELPEKCRELALHFESQGTPIMIGGGVLAYTLLGVD 304 (367)
Q Consensus 225 ~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~~l~~~l~~hF~~~gtPImigg~ghS~TIvGVe 304 (367)
+|+++.|||+.|||||+|||++|+++||+.+++++|||+++++|+|+.+..++|++||+++||||||||+++||||+||+
T Consensus 441 aLvdi~DKpA~FVGSrQWIGStEis~vLn~ll~~~skil~v~sGaEva~~~rELA~HFqt~GTPVMIGGgvLAHTIlGVd 520 (577)
T KOG2433|consen 441 ALVDIQDKPAKFVGSRQWIGSTEISFVLNELLKLESKILAVNSGAEVAERVRELARHFQTSGTPVMIGGGVLAHTILGVD 520 (577)
T ss_pred HHHhccCcccceecccceecchhHHHHHHHHhccceEEEEeccccHHHHHHHHHHHHhhccCCcEEEccceeeeeEeeee
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EeCCCCceEEEEeCCCCCCchhhhhhhcCCeEeEEecccCCCccccccCCeeeeccCCCCCCC
Q 017729 305 YNEASGDCAFLILDPHYTGNDEHKKIVNGGWCGWKKAVDSKGKNFFLHDKFYNLLLPQRPSMV 367 (367)
Q Consensus 305 ~~~~~G~~~LLIlDPhytg~~~lk~l~~kGw~gWKk~~~~~g~~~f~~~~fYNLClPq~p~~v 367 (367)
++..+|+++||||||||||+||++.|++||||||| |++||.|++||||||||||+++
T Consensus 521 ~n~~TGq~KFLILDPHYTGaeDl~tI~~KGWCgWK------g~dFW~Kd~yYNKOGPQrP~~i 577 (577)
T KOG2433|consen 521 FNDTTGQTKFLILDPHYTGAEDLKTITSKGWCGWK------GADFWSKDHYYNLCLPQRPDAI 577 (577)
T ss_pred eecccCceEEEEeCCCcCChhhHHHHhhccccccc------CcccccccceeeeccCCCCCCC
Confidence 99999999999999999999999999999999999 7799999999999999999875
No 2
>PF07910 Peptidase_C78: Peptidase family C78; InterPro: IPR012462 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This entry contains UfSP1 and UfSP2, which are cysteine peptidases required for the processing and activation of Ubiquitin fold modifier 1 (Ufm1, IPR005375 from INTERPRO) and for its release from conjugated cellular proteins. UfSP1 and UfSP2 are 217 aa and 461 aa respectively [, ]. The peptidases belong to MEROPS peptidase family C78, clan CA. The UfSP2 family have an N-terminal extension with one or more zinc finger domains of the C2H2 type (IPR007087 from INTERPRO), which have been shown to be involved in protein:protein interaction. UfSP2 is present in most, if not all, multi-cellular organisms including plants, nematodes, flies, and mammals, whereas UfSP1 is not present in plants and nematodes []. ; PDB: 3OQC_B 2Z84_A.
Probab=100.00 E-value=2.1e-63 Score=462.31 Aligned_cols=173 Identities=42% Similarity=0.770 Sum_probs=128.6
Q ss_pred CCCCCCCCCcccchhhHHHHHHHhhc-------cCCCCCCCCChHHHHHHHHhcCCC--CC-------Cccccccccchh
Q 017729 183 LKWSTFSLGWGCAYRSLQTIISWFRL-------QHYASVDVPSHREIQQALVDIGDK--DP-------SFVGSREWIGAI 246 (367)
Q Consensus 183 ~qd~~~D~GWGCGYRnLQml~Sw~~~-------q~y~~~~vPSI~eIQq~Le~awDK--~~-------~fvGSrkWIGT~ 246 (367)
.| +++|+|||||||||||||||++. +.+....||||++||++||+|||| +. +|+||||||||+
T Consensus 15 ~~-~~~D~GWGCGYRniQml~S~l~~~~~~~~~~~~~~~~vPsi~~iQ~~le~awdkG~d~~G~~~~~~~~GsrkWIGt~ 93 (218)
T PF07910_consen 15 SQ-GFDDEGWGCGYRNIQMLCSWLLHQDQPGYEQFFGGSRVPSIREIQQWLEEAWDKGFDPQGAQLTGGFVGSRKWIGTT 93 (218)
T ss_dssp TT-T---TTT-HHHHHHHHHHCCCCC-------TTS--TT---HHHHHHHHHHCTSS---C-------CGTT------HH
T ss_pred ee-cCCCCCccchhhHHHHHHHHHHhhhccccccccCCCCCCCHHHHHHHHHHHHhhcCCcccccccccccccccEEcHH
Confidence 45 99999999999999999999988 345557999999999999999999 66 999999999999
Q ss_pred HHHHHHHHhhCCcEEEEEec-CCCCh---hHHHHHHHHHhccC-C----------CceEecCCcceeEEEEEEEeCCCCc
Q 017729 247 ELSFVLDKLLGVSCKVLNVR-SGAEL---PEKCRELALHFESQ-G----------TPIMIGGGVLAYTLLGVDYNEASGD 311 (367)
Q Consensus 247 Ev~~vL~~~lGI~ckIv~f~-sg~e~---~~l~~~l~~hF~~~-g----------tPImigg~ghS~TIvGVe~~~~~G~ 311 (367)
|++++|+++ ||+|+|++|+ ++++. +.+++||++||++. + +||||||+|||+||||||.+ .+|+
T Consensus 94 E~~~~l~~~-gi~~~i~~f~~~~~~~~~~~~l~~~v~~yF~~~~~~~~~~~~t~~~Piylqh~ghS~TIvGie~~-~~g~ 171 (218)
T PF07910_consen 94 EASALLRSL-GIPCKIVDFPKSGSEIRAHPRLLDWVWNYFESGCGSPSQSRQTNKPPIYLQHDGHSRTIVGIERN-KDGE 171 (218)
T ss_dssp HHHHHHHHC--SEEEEEEES-SGCCC---CCGHHHHHHHHCCT--------------EEEEETTEEEEEEEEEE--TT--
T ss_pred HHHHHHhhC-CceEEEEEEECCCcccccHHHHHHHHHHHhhcCCCccccccccCCCCeEeCccccceEEEEEEEC-CCCC
Confidence 999999985 9999999999 76654 89999999999987 7 89999999999999999998 6899
Q ss_pred eEEEEeCCCCCCchhhhhhhcCCeEeEEecccCCCccccccCCeeeec
Q 017729 312 CAFLILDPHYTGNDEHKKIVNGGWCGWKKAVDSKGKNFFLHDKFYNLL 359 (367)
Q Consensus 312 ~~LLIlDPhytg~~~lk~l~~kGw~gWKk~~~~~g~~~f~~~~fYNLC 359 (367)
++||||||||++++..+.|.++||++|.++.. ||.++|++.+|||||
T Consensus 172 ~~LLVlDP~~~~~~~~~~l~~~~~~~w~~~~r-r~~~~l~~~~~Ynl~ 218 (218)
T PF07910_consen 172 VNLLVLDPHYTGSDIKKLLGEKGWVSWQKLYR-RGPSFLKKYSFYNLC 218 (218)
T ss_dssp EEEEEE-TT--S-S-CHHHHHTTSEEE----E-EHCCCS-TTS-EEEE
T ss_pred EEEEEECCCCCCHHHHHHHHhCCccccccccc-cChhhcccCCEeeeC
Confidence 99999999999998889999999999943322 377999999999998
No 3
>KOG4696 consensus Uncharacterized conserved protein [Function unknown]
Probab=99.97 E-value=3.8e-32 Score=262.83 Aligned_cols=192 Identities=20% Similarity=0.262 Sum_probs=153.8
Q ss_pred cccccccccccc-----------eeeeecC----CCCCCCCCCcccchhhHHHHHHHhhccC-------CC-CCCCCChH
Q 017729 164 HLALVKHKNLKI-----------MSALSWM----LKWSTFSLGWGCAYRSLQTIISWFRLQH-------YA-SVDVPSHR 220 (367)
Q Consensus 164 ~~~lL~n~H~~l-----------~~lv~G~----~qd~~~D~GWGCGYRnLQml~Sw~~~q~-------y~-~~~vPSI~ 220 (367)
...+|+||.+.+ +++++|+ ++..-.|+||||||||+||.|||++.++ +. ...||.|.
T Consensus 86 Ll~il~~Clq~l~~~~~~~lic~~sll~g~VD~hf~~~~~d~Gwgcgw~niqmq~shll~~~e~~krr~f~s~n~i~ei~ 165 (393)
T KOG4696|consen 86 LLDILSKCLQQLKRQLQHFLICGCSLLDGDVDYHFTVTGIDRGWGCGWRNIQMQISHLLYTNENWKRRNFSSGNEIYEIN 165 (393)
T ss_pred HHHHHHHHHHHHHhhhcccceeeehhccccchhheeecccccccCccccchHHHHHHHHhhChhhhhhccccCccccchH
Confidence 466788888877 8899998 8888899999999999999999986653 22 35799999
Q ss_pred HHHHHHHhcCCCC----------CCccccccccchhHHHHHHHHhhC---------------------------------
Q 017729 221 EIQQALVDIGDKD----------PSFVGSREWIGAIELSFVLDKLLG--------------------------------- 257 (367)
Q Consensus 221 eIQq~Le~awDK~----------~~fvGSrkWIGT~Ev~~vL~~~lG--------------------------------- 257 (367)
.+|++||.||.|+ .+..|+|.|||++|...+|++ .|
T Consensus 166 sLQr~le~awnkGFDi~~ALH~D~R~~G~K~W~~~~~~~qml~s-~gl~~~~~d~~P~K~qSM~l~~~~e~~~Pq~~SiG 244 (393)
T KOG4696|consen 166 SLQRLLESAWNKGFDIIEALHTDVRSLGDKGWGCGYRNFQMLDS-EGLAQLGDDLIPPKIQSMILHGKWEGFDPQGASIG 244 (393)
T ss_pred HHHHHHHHHHhcccchhhhhcccchhcccccccccchhHHHHHH-HHHHhhccccCchhhhhhhhcccccccCccccccc
Confidence 9999999999997 578999999999999999987 34
Q ss_pred ------------CcEEEEEecCCC----ChhHHHHHHHHHhccC-----------CCceEecCCcceeEEEEEEEeCCCC
Q 017729 258 ------------VSCKVLNVRSGA----ELPEKCRELALHFESQ-----------GTPIMIGGGVLAYTLLGVDYNEASG 310 (367)
Q Consensus 258 ------------I~ckIv~f~sg~----e~~~l~~~l~~hF~~~-----------gtPImigg~ghS~TIvGVe~~~~~G 310 (367)
+.|+++||...+ ..+.++.||++||++. ++|+|+||+|||||||||+... .-
T Consensus 245 a~evY~l~tgl~vk~~~VDfh~s~~~~s~~~~Lfewv~nyfss~~e~S~~v~~tsk~P~YlQhqGHSrtiiG~~~~l-~~ 323 (393)
T KOG4696|consen 245 ATEVYSLFTGLFVKVALVDFHFSSEPASASNALFEWVKNYFSSSGEGSPNVTSTSKSPCYLQHQGHSRTIIGFCSSL-ER 323 (393)
T ss_pred ceeEEEEeecceeeEEEEeccccCCccccchHHHHHHHHHhccccCCCCCceecCCCCeEEEecCceeEEEEeeecc-cc
Confidence 456777775432 3579999999999864 5799999999999999999985 46
Q ss_pred ceEEEEeCCCCCCchhhhhhhcCCeEeEEecccCCCccccccCCeeeeccC
Q 017729 311 DCAFLILDPHYTGNDEHKKIVNGGWCGWKKAVDSKGKNFFLHDKFYNLLLP 361 (367)
Q Consensus 311 ~~~LLIlDPhytg~~~lk~l~~kGw~gWKk~~~~~g~~~f~~~~fYNLClP 361 (367)
..+||||||.-...+--|+++++ .+|-+... ||+..+ |-+.|++..-
T Consensus 324 t~~LlILDP~d~~r~vqk~l~~~--a~~~~~l~-r~~~~L-K~~qyQ~l~v 370 (393)
T KOG4696|consen 324 TLTLLILDPGDRYRSVQKKLVNI--ADFNHCLM-RKKRSL-KFSQYQLLHV 370 (393)
T ss_pred ceeEEEeCCCCchHHHHHHHHHH--hhhHHHHH-hhccCc-CCcceEEEEe
Confidence 89999999987777766666554 33433322 355666 7788888653
No 4
>KOG4696 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.32 E-value=1.5e-07 Score=92.89 Aligned_cols=61 Identities=30% Similarity=0.317 Sum_probs=42.1
Q ss_pred CCCCCCcccchhhHHHHHHHhhcc-CCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHH
Q 017729 186 STFSLGWGCAYRSLQTIISWFRLQ-HYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVL 252 (367)
Q Consensus 186 ~~~D~GWGCGYRnLQml~Sw~~~q-~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL 252 (367)
...|+||||||||.||+.|....+ +|. .+| ++||.+++.+|-.+-.-.| +=||++|++.++
T Consensus 191 ~~G~K~W~~~~~~~qml~s~gl~~~~~d--~~P--~K~qSM~l~~~~e~~~Pq~--~SiGa~evY~l~ 252 (393)
T KOG4696|consen 191 SLGDKGWGCGYRNFQMLDSEGLAQLGDD--LIP--PKIQSMILHGKWEGFDPQG--ASIGATEVYSLF 252 (393)
T ss_pred hcccccccccchhHHHHHHHHHHhhccc--cCc--hhhhhhhhcccccccCccc--ccccceeEEEEe
Confidence 347999999999999999986544 443 677 8999999999843211111 116666665543
No 5
>PF03416 Peptidase_C54: Peptidase family C54 This family belongs to family C54 of the peptidase classification.; InterPro: IPR005078 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This is a group of cysteine peptidases which constitute MEROPS peptidase family C54 (Aut2 peptidase family, clan CA), which are a group of proteins of unknown function.; PDB: 2CY7_A 2ZZP_A 2D1I_B 2Z0E_A 2Z0D_A 2P82_D.
Probab=98.00 E-value=2e-05 Score=76.29 Aligned_cols=132 Identities=24% Similarity=0.398 Sum_probs=67.5
Q ss_pred CCCCcccchhhHHHHHHH-hhcc----CCC-CCCCCChHHHHHHHHhcCCCCC-------------Cccc--cccccchh
Q 017729 188 FSLGWGCAYRSLQTIISW-FRLQ----HYA-SVDVPSHREIQQALVDIGDKDP-------------SFVG--SREWIGAI 246 (367)
Q Consensus 188 ~D~GWGCGYRnLQml~Sw-~~~q----~y~-~~~vPSI~eIQq~Le~awDK~~-------------~fvG--SrkWIGT~ 246 (367)
-|.||||-.|+-|||+.- ++.. ++. ....+...+..++|.--.|++. ...| =.+|-|.+
T Consensus 32 SD~GWGCmlRs~QMlLAqaL~~~~lgr~~~~~~~~~~~~~~~~il~~F~D~~~apfSIh~i~~~g~~~~g~~~G~W~gPs 111 (278)
T PF03416_consen 32 SDCGWGCMLRSGQMLLAQALLRHHLGRDWRWPDNSDNNEEYRRILSLFQDSPSAPFSIHNIVQEGKSEFGKKPGEWFGPS 111 (278)
T ss_dssp B-TTT-HHHHHHHHHHHHHHHHHHC-TT--TTTTSS--HHHHHHHHTTSSSTTSTTSHHHHHHHHHTT-T--TTS-B-HH
T ss_pred cCCCcccccchhHHHHHHHHHHHhhcccccccccccCcHHHHHHHHhcCCCCCCcchHHHHHHHHHHHcCCCCcccCCHH
Confidence 599999999999999886 4332 222 1111444555555555555541 1112 35899999
Q ss_pred HHHHHHHHhhCC----cEEEEEecCCCCh-------------------------------------hHHHHHHHHHhccC
Q 017729 247 ELSFVLDKLLGV----SCKVLNVRSGAEL-------------------------------------PEKCRELALHFESQ 285 (367)
Q Consensus 247 Ev~~vL~~~lGI----~ckIv~f~sg~e~-------------------------------------~~l~~~l~~hF~~~ 285 (367)
.++.++..+..- .-++.-..++.-. +...+.|...|+-.
T Consensus 112 ~~~~~l~~l~~~~~~~~l~v~v~~d~~i~~~d~~~~~~~~~~~~~~~~~~~vLlliplrLGl~~in~~Y~~~l~~~l~~p 191 (278)
T PF03416_consen 112 TIAQALKKLVNEADLSGLRVYVSSDGTIYYDDVEELCSNSNPTKQSSWWKPVLLLIPLRLGLDKINPKYIPSLKSLLSLP 191 (278)
T ss_dssp HHHHHHHHHHCC-TTT--EEEE-BTTEEEHHHHHHHHCCS-S-----CE--EEEEEEEE-SSSS--GGGHHHHHHHCCST
T ss_pred HHHHHHHHHHHhccccCceEEEeeccccchhHHHHHHhhhccccccccCceEEEEEEeecCCCCCCHHHHHHHHHHhCCc
Confidence 999999987643 1222222222111 12233333333321
Q ss_pred CCceEecC-CcceeEEEEEEEeCCCCceEEEEeCCCCCCch
Q 017729 286 GTPIMIGG-GVLAYTLLGVDYNEASGDCAFLILDPHYTGND 325 (367)
Q Consensus 286 gtPImigg-~ghS~TIvGVe~~~~~G~~~LLIlDPhytg~~ 325 (367)
-+==|+|| ..+|+=++|+.-+ +|+-|||||+...
T Consensus 192 q~vGiiGG~p~~a~YfvG~~~d------~liYLDPH~~Q~a 226 (278)
T PF03416_consen 192 QSVGIIGGRPNSALYFVGFQGD------QLIYLDPHYVQPA 226 (278)
T ss_dssp TEEEEEEEETTEEEEEEEEETT------EEEEE---SEEE-
T ss_pred ccceeeccCCCceEEEEEEccC------eEEEECCCCCeeC
Confidence 11124555 6799999998754 4999999998654
No 6
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=96.05 E-value=0.043 Score=44.72 Aligned_cols=111 Identities=21% Similarity=0.403 Sum_probs=59.2
Q ss_pred CCCcccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhc--CCCCCCcccc-----ccccchhHHHHHHHHhhCCcEE
Q 017729 189 SLGWGCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDI--GDKDPSFVGS-----REWIGAIELSFVLDKLLGVSCK 261 (367)
Q Consensus 189 D~GWGCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~a--wDK~~~fvGS-----rkWIGT~Ev~~vL~~~lGI~ck 261 (367)
....|||=-+++|+++++ +. -++..+|-+.+... +|....++|. ...+...++...+.. +|....
T Consensus 11 ~~~~~Cg~as~~mvl~~~---g~----~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 82 (144)
T PF13529_consen 11 ETSYGCGPASAAMVLNYY---GK----NISQEDLADEAGTNPDGDPNTGFVGNPYYDSGYGTSPDDLARYLEK-YGYKAT 82 (144)
T ss_dssp T-TT-HHHHHHHHHHHHT---T--------HHHHHHHS-EE-E--TTTSEEB-SSTS-B----HHHHHHHHHH-H-TTEE
T ss_pred CCCCcCHHHHHHHHHHHc---CC----CCCHHHHHHHhhhccCCCCCcccccCccccCCCccccHHHHHHHHH-cCccee
Confidence 356779999999999998 22 46777777777654 3555555532 334555566666776 565211
Q ss_pred EEEecCCCChhHHHHHHHHHhccCCCceEecC----------------CcceeEEEEEEEeCCCCceEEEEeCCC
Q 017729 262 VLNVRSGAELPEKCRELALHFESQGTPIMIGG----------------GVLAYTLLGVDYNEASGDCAFLILDPH 320 (367)
Q Consensus 262 Iv~f~sg~e~~~l~~~l~~hF~~~gtPImigg----------------~ghS~TIvGVe~~~~~G~~~LLIlDPh 320 (367)
. ...... +.|.++.+ .|.||++.. ++|..+|+|++.+ + .+.|.||.
T Consensus 83 --~-~~~~~~----~~i~~~i~-~G~Pvi~~~~~~~~~~~~~~~~~~~~~H~vvi~Gy~~~---~--~~~v~DP~ 144 (144)
T PF13529_consen 83 --D-TSDASF----DDIKQEID-AGRPVIVSVNSGWRPPNGDGYDGTYGGHYVVIIGYDED---G--YVYVNDPW 144 (144)
T ss_dssp --E--TTS-H----HHHHHHHH-TT--EEEEEETTSS--TTEEEEE-TTEEEEEEEEE-SS---E---EEEE-TT
T ss_pred --e-ccCCcH----HHHHHHHH-CCCcEEEEEEcccccCCCCCcCCCcCCEEEEEEEEeCC---C--EEEEeCCC
Confidence 1 223344 44444444 377877655 6799999999874 2 89999994
No 7
>KOG2674 consensus Cysteine protease required for autophagy - Apg4p/Aut2p [Cytoskeleton; Intracellular trafficking, secretion, and vesicular transport]
Probab=95.66 E-value=0.09 Score=54.34 Aligned_cols=131 Identities=25% Similarity=0.406 Sum_probs=78.5
Q ss_pred CCCCcccchhhHHHHHHH-hhccC----CC-CCCCCChHHHHHHHHhcCCCCCCc--------------cccccccchhH
Q 017729 188 FSLGWGCAYRSLQTIISW-FRLQH----YA-SVDVPSHREIQQALVDIGDKDPSF--------------VGSREWIGAIE 247 (367)
Q Consensus 188 ~D~GWGCGYRnLQml~Sw-~~~q~----y~-~~~vPSI~eIQq~Le~awDK~~~f--------------vGSrkWIGT~E 247 (367)
-|.||||=-|+=||++.- +..++ ++ ...-+...+-+++|+.-.|.+.++ .-=-+|-|-..
T Consensus 92 tD~GWGCMlR~gQMllaqaL~~~~lGRdw~w~~~~~~~~~y~~il~~F~D~~~a~~SiHq~~~~G~~~~~~~g~WfGP~~ 171 (409)
T KOG2674|consen 92 TDCGWGCMLRCGQMLLAQALICRHLGRDWRWTDEKRLEEEYLKILNLFEDEPDAPFSIHQIVQMGVGEGKAVGSWFGPNT 171 (409)
T ss_pred cCcceeeEEehhHHHHHHHHHHhhcccccccccccccchHHHHHHHhhcCCCccccCHHHHHHHHhhccCCCccccCCcH
Confidence 599999999999999886 32222 22 222333333333555444543110 11247999999
Q ss_pred HHHHHHHhhCCc------EEEEEe-------------cCC-------------------------------------CCh
Q 017729 248 LSFVLDKLLGVS------CKVLNV-------------RSG-------------------------------------AEL 271 (367)
Q Consensus 248 v~~vL~~~lGI~------ckIv~f-------------~sg-------------------------------------~e~ 271 (367)
++-++.+ ++.. ...+.+ ..+ +++
T Consensus 172 ~a~~~~~-L~~~~~~~~~~~~v~~~~~vv~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ewkpllLLVPvRLG~~~i 250 (409)
T KOG2674|consen 172 VAQVLKK-LARFDPWSSLAVYVAMDNAVIIRDIVEKCRRGPLPALTIEDATKQSLEFSNGITEWKPLLLLIPLRLGITSI 250 (409)
T ss_pred HHHHHHH-hhccCCCCCccEEEecccceEEeeeehhcccCCcccceecccchhhcccCCCCCCCcceEEEEEeeeccccc
Confidence 9888876 4321 111110 001 011
Q ss_pred -hHHHHHHHHHhccCCCceEecC-CcceeEEEEEEEeCCCCceEEEEeCCCCCCch
Q 017729 272 -PEKCRELALHFESQGTPIMIGG-GVLAYTLLGVDYNEASGDCAFLILDPHYTGND 325 (367)
Q Consensus 272 -~~l~~~l~~hF~~~gtPImigg-~ghS~TIvGVe~~~~~G~~~LLIlDPhytg~~ 325 (367)
+..+..|.+-|+-.-+==++|| .+||+-++|+.-++ |+-|||||+-+.
T Consensus 251 Np~Yvp~lk~~f~~~q~lGI~GGkP~~S~YFvGyq~d~------l~YLDPH~~Q~~ 300 (409)
T KOG2674|consen 251 NPSYVPALKECFEMPQSVGIIGGRPNHSLYFVGYQGDE------LFYLDPHYTQPA 300 (409)
T ss_pred ChHHHHHHHHHhcchhhceeccCCCCcceEEEEEecce------EEEeCCccCccc
Confidence 3566777777764333335666 78999999998775 999999999874
No 8
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=91.46 E-value=1.5 Score=36.42 Aligned_cols=101 Identities=18% Similarity=0.241 Sum_probs=56.5
Q ss_pred ccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHH-HHHhhCCcEEEEEecCCCCh
Q 017729 193 GCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFV-LDKLLGVSCKVLNVRSGAEL 271 (367)
Q Consensus 193 GCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~v-L~~~lGI~ckIv~f~sg~e~ 271 (367)
+||=.++.|++.|+-.+- +..++... .+...... ++.-.-..++... ++. +|+.++.+....
T Consensus 6 ~C~~~slamvl~~~g~~~-------~~~~l~~~---~~~~~~~~--~~~g~~~~~l~~~~a~~-~G~~~~~~~~~~---- 68 (141)
T cd02549 6 GCGPTSLAMVLSYLGVKV-------TKPQLAAE---GNTYDFAK--DGYGTYPKPIVSAAARK-YGLVVRPLTGLL---- 68 (141)
T ss_pred ccHHHHHHHHHHhcCCCC-------CHHHHHhh---ccccccCC--CCCCcCHHHHHHHHHhh-CCCcEEECCCHH----
Confidence 699999999999974321 22333221 11111111 1111223344444 554 799888653211
Q ss_pred hHHHHHHHHHhccCCCceEec--------CCcceeEEEEEEEeCCCCceEEEEeCCCC
Q 017729 272 PEKCRELALHFESQGTPIMIG--------GGVLAYTLLGVDYNEASGDCAFLILDPHY 321 (367)
Q Consensus 272 ~~l~~~l~~hF~~~gtPImig--------g~ghS~TIvGVe~~~~~G~~~LLIlDPhy 321 (367)
. +. ...+.+-|+++. .++|...|.|++ .. + .++|.||..
T Consensus 69 ~-----~~-~~l~~~~Pvi~~~~~~~~~~~~gH~vVv~g~~-~~--~--~~~i~DP~~ 115 (141)
T cd02549 69 A-----LL-RQLAAGHPVIVSVNLGVSITPSGHAMVVIGYD-RK--G--NVYVNDPGG 115 (141)
T ss_pred H-----HH-HHHHCCCeEEEEEecCcccCCCCeEEEEEEEc-CC--C--CEEEECCCC
Confidence 1 11 223458899883 378999999998 11 1 288999964
No 9
>PF03412 Peptidase_C39: Peptidase C39 family This is family C39 in the peptidase classification. ; InterPro: IPR005074 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of sequences defined by this cysteine peptidase domain belong to the MEROPS peptidase family C39 (clan CA). It is found in a wide range of ABC transporters, which are maturation proteases for peptide bacteriocins, the proteolytic domain residing in the N-terminal region of the protein []. A number of the proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Lantibiotic and non-lantibiotic bacteriocins are synthesised as precursor peptides containing N-terminal extensions (leader peptides) which are cleaved off during maturation. Most non-lantibiotics and also some lantibiotics have leader peptides of the so-called double-glycine type. These leader peptides share consensus sequences and also a common processing site with two conserved glycine residues in positions -1 and -2. The double- glycine-type leader peptides are unrelated to the N-terminal signal sequences which direct proteins across the cytoplasmic membrane via the sec pathway. Their processing sites are also different from typical signal peptidase cleavage sites, suggesting that a different processing enzyme is involved. ; GO: 0005524 ATP binding, 0008233 peptidase activity, 0006508 proteolysis, 0016021 integral to membrane; PDB: 3K8U_A 3B79_A.
Probab=86.50 E-value=1.3 Score=36.58 Aligned_cols=97 Identities=19% Similarity=0.276 Sum_probs=58.9
Q ss_pred cccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCCh
Q 017729 192 WGCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAEL 271 (367)
Q Consensus 192 WGCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~ 271 (367)
-.||-..+.|++.++... -|..+|.+.+. .++.-+.-.++.-+++. +|++++.+.++.. +
T Consensus 11 ~dcg~acl~~l~~~~g~~-------~s~~~l~~~~~----------~~~~g~s~~~L~~~~~~-~gl~~~~~~~~~~-~- 70 (131)
T PF03412_consen 11 NDCGLACLAMLLKYYGIP-------VSEEELRRQLG----------TSEEGTSLADLKRAARK-YGLKAKAVKLNFE-K- 70 (131)
T ss_dssp T-HHHHHHHHHHHHTT-----------HHHHHCCTT-----------BTTB--CCCHHHHHHH-TTEEEEEEE--GG-G-
T ss_pred CCHHHHHHHHHHHHhCCC-------chHHHHHHHhc----------CCccCCCHHHHHHHHHh-cccceeeeecchh-h-
Confidence 469999999999995322 13334433221 12222334455566776 7999998876543 1
Q ss_pred hHHHHHHHHHhccCCCceEec-CCcceeEEEEEEEeCCCCceEEEEeCCCCCCch
Q 017729 272 PEKCRELALHFESQGTPIMIG-GGVLAYTLLGVDYNEASGDCAFLILDPHYTGND 325 (367)
Q Consensus 272 ~~l~~~l~~hF~~~gtPImig-g~ghS~TIvGVe~~~~~G~~~LLIlDPhytg~~ 325 (367)
+.+...|+++. .++|--.|.|++. -+++|+|| ..|..
T Consensus 71 ----------l~~~~~P~I~~~~~~h~vVi~~~~~------~~~~i~dP-~~g~~ 108 (131)
T PF03412_consen 71 ----------LKRLPLPAIAHLKDGHFVVIYKIDD------GRVLIYDP-KKGKI 108 (131)
T ss_dssp ----------CTCGGSSEEEEECCCEEEEEEEECC------CEEEECCT-TTCEE
T ss_pred ----------hhhccccEEEEecCcceEEEEeEcC------cEEEEEeC-CCCeE
Confidence 24567899998 7888888888832 34999999 55544
No 10
>cd02418 Peptidase_C39B A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family.
Probab=69.56 E-value=61 Score=26.55 Aligned_cols=95 Identities=16% Similarity=0.216 Sum_probs=58.2
Q ss_pred CcccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCC
Q 017729 191 GWGCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAE 270 (367)
Q Consensus 191 GWGCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e 270 (367)
...||=..+.|++.++...- +..+|.+.+. +++++ .....+.-.++. +|+.++....+..+
T Consensus 9 ~~~~gl~~l~~~~~~~g~~~-------~~~~l~~~~~--~~~~~--------~~~~~l~~~a~~-~Gl~~~~~~~~~~~- 69 (136)
T cd02418 9 EMDCGAACLAMIAKYYGKNY-------SLAKLRELAG--TDREG--------TSLLGLVKAAEK-LGFETRAVKADMDL- 69 (136)
T ss_pred cccHHHHHHHHHHHHhCCCC-------CHHHHHHHcC--CCCCC--------cCHHHHHHHHHH-CCCeeEEEEcccch-
Confidence 34799999999998864321 3333433221 12111 233344445665 79999998865431
Q ss_pred hhHHHHHHHHHhccCCCceEec-----CCcceeEEEEEEEeCCCCceEEEEeCC
Q 017729 271 LPEKCRELALHFESQGTPIMIG-----GGVLAYTLLGVDYNEASGDCAFLILDP 319 (367)
Q Consensus 271 ~~~l~~~l~~hF~~~gtPImig-----g~ghS~TIvGVe~~~~~G~~~LLIlDP 319 (367)
.. +.+...|+++. .++|...|.|++. + .++|.||
T Consensus 70 -~~--------l~~~~~P~I~~~~~~~~~~~~~Vl~~~~~----~--~~~i~dp 108 (136)
T cd02418 70 -FE--------LKDIPLPFIAHVIKEWKLNHYVVVYKIKK----K--KILIADP 108 (136)
T ss_pred -hh--------HhcCCCCEEEEEccCCCCCeEEEEEEEcC----C--EEEEECC
Confidence 00 13457799984 5789999998762 2 3889999
No 11
>cd02259 Peptidase_C39_like Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is not conserved in all sub-families.
Probab=65.91 E-value=50 Score=26.38 Aligned_cols=92 Identities=20% Similarity=0.327 Sum_probs=54.6
Q ss_pred ccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCChh
Q 017729 193 GCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELP 272 (367)
Q Consensus 193 GCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~ 272 (367)
.||-..+.|++.++..+- +..+|.+.+. +.+++ .-..++.-+.+. +|++|+....+-
T Consensus 6 ~~gl~~l~~i~~~~g~~~-------~~~~l~~~~~--~~~~~--------~~~~~l~~~a~~-~gl~~~~~~~~~----- 62 (122)
T cd02259 6 DCGLACLQMLLRYFGIPV-------RRDVLLNAQQ--RRQQG--------LSLADLVSLANK-LGLTAQGVKLPL----- 62 (122)
T ss_pred chHHHHHHHHHHHcCCCC-------CHHHHHHHHh--hccCC--------CCHHHHHHHHHH-cCCeeeEEEcCH-----
Confidence 588889999988874331 2333322221 11110 112233444554 799999876432
Q ss_pred HHHHHHHHHhccCCCceEe-cCCcceeEEEEEEEeCCCCceEEEEeCCC
Q 017729 273 EKCRELALHFESQGTPIMI-GGGVLAYTLLGVDYNEASGDCAFLILDPH 320 (367)
Q Consensus 273 ~l~~~l~~hF~~~gtPImi-gg~ghS~TIvGVe~~~~~G~~~LLIlDPh 320 (367)
+.+.+...|+++ ..++|-..|.|++ + + .++|.||.
T Consensus 63 -------~~l~~~~~P~i~~~~~~~~~Vl~~~~-~---~--~~~i~dp~ 98 (122)
T cd02259 63 -------AALSRLQLPALLLWKQGHFVILYGAD-K---G--QVLIADPL 98 (122)
T ss_pred -------HHhccCCCCEEEEcCCCcEEEEEEEc-C---C--EEEEECCc
Confidence 124456789998 5678877888765 2 2 38999997
No 12
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=65.66 E-value=63 Score=31.15 Aligned_cols=64 Identities=16% Similarity=0.510 Sum_probs=47.0
Q ss_pred CCCcccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCC
Q 017729 189 SLGWGCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSG 268 (367)
Q Consensus 189 D~GWGCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg 268 (367)
..-|-||---+.|++++...+.+. .+++++..+. + .++--| |++.+++|.. +||++.-....-|
T Consensus 7 ~~~WDCGlACv~MvL~~~~~~~~~-------~~~~~~c~~~-----~-~t~SiW--TIDLayLL~~-f~v~~~f~T~TlG 70 (212)
T PF09778_consen 7 RYNWDCGLACVLMVLRYLGRNNFL-------ANFEEICQEE-----G-FTTSIW--TIDLAYLLRR-FGVRHSFYTVTLG 70 (212)
T ss_pred eccccccHHHHHHHHHHcCccchH-------HHHHHHHHHc-----c-CCccee--hhHHHHHHHH-cCCCeeEecCccc
Confidence 457999999999999998665542 5666665543 1 344444 9999999998 7999887776543
No 13
>cd02424 Peptidase_C39E A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family, which contains Colicin V perocessing peptidase.
Probab=57.74 E-value=96 Score=25.79 Aligned_cols=92 Identities=16% Similarity=0.291 Sum_probs=48.3
Q ss_pred ccchhhHHHHHHH-hhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCCh
Q 017729 193 GCAYRSLQTIISW-FRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAEL 271 (367)
Q Consensus 193 GCGYRnLQml~Sw-~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~ 271 (367)
-||-..+.|++.+ +...- ++.++...+. .. ... ....++.-+++. +|+++|.+..+.
T Consensus 11 dcgla~l~~i~~~~~g~~~-------~~~~l~~~~~---~~------~~g-~s~~~l~~~a~~-~Gl~~k~~~~~~---- 68 (129)
T cd02424 11 DCGIAVIQMLYNHYYKKKY-------DLNELKIKAN---LK------KNG-LSIYDLENLAKK-FGLETESYQGSF---- 68 (129)
T ss_pred chHHHHHHHHHHHhcCCCc-------cHHHHHHHhC---CC------CCC-ccHHHHHHHHHH-cCCceeEEEcCH----
Confidence 5999999999998 44321 2222222110 00 000 112233333554 799999987642
Q ss_pred hHHHHHHHHHhccC--CCceEec-CCcceeEEEEEEEeCCCCceEEEEeCC
Q 017729 272 PEKCRELALHFESQ--GTPIMIG-GGVLAYTLLGVDYNEASGDCAFLILDP 319 (367)
Q Consensus 272 ~~l~~~l~~hF~~~--gtPImig-g~ghS~TIvGVe~~~~~G~~~LLIlDP 319 (367)
+.+.+. --|+++- +++..+.|+.-..+ -.++|.||
T Consensus 69 --------~~l~~~~~p~P~i~~~~~~~hfvVl~~~~~-----~~v~I~DP 106 (129)
T cd02424 69 --------LEFLELKNKFIILLKSNGLNHFVIVKKIKK-----NKFIVLDP 106 (129)
T ss_pred --------HHHhhccCCEEEEEecCCCCeEEEEEEEEC-----CEEEEECC
Confidence 112233 4578874 55555666643212 12899999
No 14
>KOG1089 consensus Myotubularin-related phosphatidylinositol 3-phosphate 3-phosphatase MTM6 [General function prediction only]
Probab=54.79 E-value=1.2 Score=48.15 Aligned_cols=105 Identities=16% Similarity=0.278 Sum_probs=69.7
Q ss_pred CCCCChHHHHHHHHhcCCCCC-CccccccccchhHHHHHHHHhhCCcEEEEEecCCCChhHHHHHHHHHhccCCCceEec
Q 017729 214 VDVPSHREIQQALVDIGDKDP-SFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELPEKCRELALHFESQGTPIMIG 292 (367)
Q Consensus 214 ~~vPSI~eIQq~Le~awDK~~-~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~~l~~~l~~hF~~~gtPImig 292 (367)
..||.|..++..+.++=+--. ...-.-+|+++.|.+-=|.+. ...-+...++++.-+..|.+|.++
T Consensus 284 ~~i~nIh~v~~s~~kl~e~c~~~~~~~~~~ls~LE~SgWL~~i-------------~~~L~~a~~ia~~l~~~~~sVlvh 350 (573)
T KOG1089|consen 284 LGIENIHVVRSSLQKLLEVCNNFLPTMDKWLSLLESSGWLKHI-------------RAILKAAAEIAKCLSSEGASVLVH 350 (573)
T ss_pred cCcchHHHHHHHHHHHHHHHhccCccHHHHHHHhhhccHHHHH-------------HHHHHHHHHHHHHHHhCCCeEEEE
Confidence 568888887776665432222 223347899999988777652 122345667788888888888877
Q ss_pred C-CcceeEEEEEEEeCCCCceEEEEeCCCCCCchhhhhhhcCCeEeE
Q 017729 293 G-GVLAYTLLGVDYNEASGDCAFLILDPHYTGNDEHKKIVNGGWCGW 338 (367)
Q Consensus 293 g-~ghS~TIvGVe~~~~~G~~~LLIlDPhytg~~~lk~l~~kGw~gW 338 (367)
. +|-=+|-.= -.+.=|+|||||++-.--.+|++|-|++-
T Consensus 351 csdGwDrT~qV-------~SLaQllLDP~yRTi~GFqsLIeKeWi~~ 390 (573)
T KOG1089|consen 351 CSDGWDRTCQV-------SSLAQLLLDPYYRTIKGFQSLIEKEWISF 390 (573)
T ss_pred ccCCcchhHHH-------HHHHHHHhCchhhhHHHHHHHHHHHHHHc
Confidence 6 343332210 12334789999999988999999998753
No 15
>cd02425 Peptidase_C39F A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family.
Probab=53.09 E-value=1e+02 Score=24.90 Aligned_cols=94 Identities=19% Similarity=0.292 Sum_probs=56.6
Q ss_pred ccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCChh
Q 017729 193 GCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELP 272 (367)
Q Consensus 193 GCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~ 272 (367)
.||=..+.|++.++...- +..+|.+.+ .+++++ +...++.-+++. +|++++++..+.-
T Consensus 11 ~~~l~~l~~~~~~~~~~~-------~~~~l~~~~--~~~~~~--------~~~~~l~~~a~~-~gl~~~~~~~~~~---- 68 (126)
T cd02425 11 ECGLACYAMILNYFGYKV-------SLNELREKY--ELGRDG--------LSLSYLKQLLEE-YGFKCKVYKISFK---- 68 (126)
T ss_pred cHHHHHHHHHHHHhCCCC-------CHHHHHHhc--cCCCCC--------cCHHHHHHHHHH-CCCcceEEEEchH----
Confidence 599999999988864331 222333221 122111 222344455665 7999999876431
Q ss_pred HHHHHHHHHhccCCCceEecC-CcceeEEEEEEEeCCCCceEEEEeCCCC
Q 017729 273 EKCRELALHFESQGTPIMIGG-GVLAYTLLGVDYNEASGDCAFLILDPHY 321 (367)
Q Consensus 273 ~l~~~l~~hF~~~gtPImigg-~ghS~TIvGVe~~~~~G~~~LLIlDPhy 321 (367)
+.+.+...|+++.. ++|...|.+++. + .++|+||..
T Consensus 69 -------~~l~~~~lP~I~~~~~~~~~Vl~~~~~----~--~~~i~dp~~ 105 (126)
T cd02425 69 -------KNLYPLKLPVIIFWNNNHFVVLEKIKK----N--KVTIVDPAI 105 (126)
T ss_pred -------HHHhhCCCCEEEEEcCCcEEEEEEEEC----C--EEEEEcCCC
Confidence 12334577998865 678888888742 2 388999955
No 16
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=48.04 E-value=1.1e+02 Score=30.29 Aligned_cols=94 Identities=16% Similarity=0.141 Sum_probs=57.4
Q ss_pred HHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCC-ChhHHHHHHHHHhccCCCceEe-----------
Q 017729 224 QALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGA-ELPEKCRELALHFESQGTPIMI----------- 291 (367)
Q Consensus 224 q~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~-e~~~l~~~l~~hF~~~gtPImi----------- 291 (367)
..||+|.-| +.||=.=|+.--+.-.|..+.|-++..+++.... +..++...|.++++.. ..|-.
T Consensus 143 ~LLEKAyAK---~~GsY~~i~gg~~~~al~~LTG~~~~~i~l~~~~~~~~~~w~~l~~~~~~g-~lv~~~t~~~~~~~~~ 218 (318)
T smart00230 143 ALLEKAYAK---LNGCYEALKGGSTTEALEDLTGGVAESIDLKEASKDPDNLFEDLFKAFERG-SLMGCSIGAGTAVEEE 218 (318)
T ss_pred HHHHHHHHH---HcCCCcccCCCCHHHHHHHhcCCCeEEEEcccccCCHHHHHHHHHHHHhCC-CeEEEEcCCCCcchhh
Confidence 356776655 2334334443334444666789999999987654 4566778888888753 11111
Q ss_pred -----c-CCcceeEEEEEEEeCCCCceEEEEeCCCC
Q 017729 292 -----G-GGVLAYTLLGVDYNEASGDCAFLILDPHY 321 (367)
Q Consensus 292 -----g-g~ghS~TIvGVe~~~~~G~~~LLIlDPhy 321 (367)
| ..+|||+|+++..-...+..-+.+-+|.-
T Consensus 219 ~~~~~GLv~~HaYsVl~v~~~~~~~~~Ll~lrNPWg 254 (318)
T smart00230 219 EQKDCGLVKGHAYSVTDVREVQGRRQELLRLRNPWG 254 (318)
T ss_pred hhhhcCcccCccEEEEEEEEEecCCeEEEEEECCCC
Confidence 1 14899999999876422222456778864
No 17
>cd02419 Peptidase_C39C A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family.
Probab=45.87 E-value=1e+02 Score=25.04 Aligned_cols=93 Identities=18% Similarity=0.189 Sum_probs=52.5
Q ss_pred ccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCChh
Q 017729 193 GCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELP 272 (367)
Q Consensus 193 GCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~ 272 (367)
-||=..+.|++.++...- +..+|.+.+ .|+.++ .-..++.-+++. +|++++.+..+..
T Consensus 11 ~~~l~~l~~~~~~~g~~~-------~~~~l~~~~--~~~~~~--------~~~~~l~~~a~~-~Gl~~~~~~~~~~---- 68 (127)
T cd02419 11 ECGLACLAMIASYHGHHV-------DLASLRQRF--PVSLKG--------ATLADLIDIAQQ-LGLSTRALRLDLE---- 68 (127)
T ss_pred cHHHHHHHHHHHHcCCCC-------CHHHHHHHc--CCCCCC--------cCHHHHHHHHHH-CCCceeEEEccHH----
Confidence 599999999998864432 222222211 011110 111223333554 7999988875421
Q ss_pred HHHHHHHHHhccCCCceEecC-CcceeEEEEEEEeCCCCceEEEEeCCCC
Q 017729 273 EKCRELALHFESQGTPIMIGG-GVLAYTLLGVDYNEASGDCAFLILDPHY 321 (367)
Q Consensus 273 ~l~~~l~~hF~~~gtPImigg-~ghS~TIvGVe~~~~~G~~~LLIlDPhy 321 (367)
. +.+...|+++.. +||...|.|++. + .++|+||..
T Consensus 69 ~--------l~~~~lP~i~~~~~g~~~Vl~~~~~----~--~~~i~dp~~ 104 (127)
T cd02419 69 E--------LGQLKLPCILHWDMNHFVVLKKVSR----R--RIVIHDPAL 104 (127)
T ss_pred H--------HhhCCCCEEEEECCCEEEEEEEEcC----C--EEEEECCcc
Confidence 1 223456888754 678888888632 1 389999975
No 18
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=42.95 E-value=1e+02 Score=29.45 Aligned_cols=62 Identities=23% Similarity=0.302 Sum_probs=37.9
Q ss_pred HhhCCcEEEEEecCCCChhHHHHHHHHHhccCCCceEec----------------CCcceeEEEEEEEeCCCCceEEEEe
Q 017729 254 KLLGVSCKVLNVRSGAELPEKCRELALHFESQGTPIMIG----------------GGVLAYTLLGVDYNEASGDCAFLIL 317 (367)
Q Consensus 254 ~~lGI~ckIv~f~sg~e~~~l~~~l~~hF~~~gtPImig----------------g~ghS~TIvGVe~~~~~G~~~LLIl 317 (367)
..+|++++...+++.+ ...+.|.+.-. .|.||+++ |.+|.-.|+|+|..+ -.++|.
T Consensus 60 ~~lG~~~~~~~~~~~~---~~~~~l~~~l~-~g~pv~~~~D~~~lpy~~~~~~~~~~~H~i~v~G~d~~~----~~~~v~ 131 (317)
T PF14399_consen 60 ERLGIKYEWREFSSPD---EAWEELKEALD-AGRPVIVWVDMYYLPYRPNYYKKHHADHYIVVYGYDEEE----DVFYVS 131 (317)
T ss_pred HHCCceEEEEecCCHH---HHHHHHHHHHh-CCCceEEEeccccCCCCccccccccCCcEEEEEEEeCCC----CEEEEE
Confidence 3589999977765544 33444444433 34555543 356788888888543 238889
Q ss_pred CCCCCC
Q 017729 318 DPHYTG 323 (367)
Q Consensus 318 DPhytg 323 (367)
||....
T Consensus 132 D~~~~~ 137 (317)
T PF14399_consen 132 DPPSYE 137 (317)
T ss_pred cCCCCc
Confidence 994433
No 19
>cd02417 Peptidase_C39_likeA A sub-family of peptidase C39 which contains Cyclolysin and Hemolysin processing peptidases. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is not conserved in this
Probab=40.71 E-value=1.9e+02 Score=23.22 Aligned_cols=96 Identities=15% Similarity=0.204 Sum_probs=55.2
Q ss_pred ccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCChh
Q 017729 193 GCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELP 272 (367)
Q Consensus 193 GCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~ 272 (367)
-||-..+.|++.++...- +...|++.+. |++++ .....+.-+++. +|+.++.+.++-
T Consensus 6 ~~~l~~l~~i~~~~g~~~-------~~~~l~~~~~--~~~~~--------~~~~~l~~~a~~-~Gl~~~~~~~~~----- 62 (121)
T cd02417 6 DSGLLALVLLARYHGIAA-------DPEQLRHEFG--LAGEP--------FNSTELLLAAKS-LGLKAKAVRQPV----- 62 (121)
T ss_pred ccHHHHHHHHHHHcCCCC-------CHHHHHHHhc--CCCCC--------CCHHHHHHHHHH-cCCeeEEEecCH-----
Confidence 588888898888864432 2233333221 11110 122334444564 799999987642
Q ss_pred HHHHHHHHHhccCCCceEec-CCcceeEEEEEEEeCCCCceEEEEeCCCCCCc
Q 017729 273 EKCRELALHFESQGTPIMIG-GGVLAYTLLGVDYNEASGDCAFLILDPHYTGN 324 (367)
Q Consensus 273 ~l~~~l~~hF~~~gtPImig-g~ghS~TIvGVe~~~~~G~~~LLIlDPhytg~ 324 (367)
+.+.+.--|+++. .+|+...|.+++. + .++|.||...+.
T Consensus 63 -------~~l~~~~lP~I~~~~~g~~~Vl~~~~~----~--~~~i~dp~~~~~ 102 (121)
T cd02417 63 -------ERLARLPLPALAWDDDGGHFILAKLDG----Q--KYLIQDPISQRP 102 (121)
T ss_pred -------HHhccCCCCEEEEccCCCEEEEEEEcC----C--CEEEECCCcCCC
Confidence 1233445699884 4778777777552 1 299999966433
No 20
>cd02423 Peptidase_C39G A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=39.33 E-value=2.1e+02 Score=23.18 Aligned_cols=96 Identities=14% Similarity=0.295 Sum_probs=54.8
Q ss_pred CCcccchhhHHHHHHHhh-ccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCC
Q 017729 190 LGWGCAYRSLQTIISWFR-LQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSG 268 (367)
Q Consensus 190 ~GWGCGYRnLQml~Sw~~-~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg 268 (367)
..+-||=..+.|++.++. .+- +..++.+.+. ++.++ .-..++.-+++. +|+.++++..+.
T Consensus 8 ~~~~~~l~~l~~~~~~~g~~~~-------~~~~l~~~~~--~~~~~--------~s~~~l~~~a~~-~Gl~~~~~~~~~- 68 (129)
T cd02423 8 YDFSCGPAALATLLRYYGGINI-------TEQEVLKLML--IRSEG--------FSMLDLKRYAEA-LGLKANGYRLNL- 68 (129)
T ss_pred CCCChHHHHHHHHHHhcCCCCC-------CHHHHHHHhC--cccCC--------cCHHHHHHHHHH-CCCcceEEEcCH-
Confidence 344799999999999876 332 2333333221 11110 112233444665 799999987643
Q ss_pred CChhHHHHHHHHHhccCCCceEecC----CcceeEEEEEEEeCCCCceEEEEeCCCC
Q 017729 269 AELPEKCRELALHFESQGTPIMIGG----GVLAYTLLGVDYNEASGDCAFLILDPHY 321 (367)
Q Consensus 269 ~e~~~l~~~l~~hF~~~gtPImigg----~ghS~TIvGVe~~~~~G~~~LLIlDPhy 321 (367)
.. ..+...|+++.- ++|.-.|.+++. + +++|.||..
T Consensus 69 ---~~--------L~~~~lP~i~~~~~~~~~~~vvl~~~~~----~--~~~i~dp~~ 108 (129)
T cd02423 69 ---DK--------LNALQIPVIVLVNNGGYGHFVVIKGIDG----D--RVLVGDPAL 108 (129)
T ss_pred ---HH--------HhhCCCCEEEEEecCCCceEEEEEEEeC----C--EEEEECCCC
Confidence 11 123466998853 446655555652 1 389999965
No 21
>cd02420 Peptidase_C39D A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family.
Probab=35.78 E-value=1.9e+02 Score=23.43 Aligned_cols=93 Identities=17% Similarity=0.171 Sum_probs=52.7
Q ss_pred ccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCChh
Q 017729 193 GCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELP 272 (367)
Q Consensus 193 GCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~ 272 (367)
-||=..+.+++.++...- +..+|...+ .+++++ . -..++.-+++. +|++++.+..+..
T Consensus 11 ~~gl~~l~~i~~~~g~~~-------~~~~l~~~~--~~~~~~-~-------~~~~l~~~a~~-~Gl~~~~~~~~~~---- 68 (125)
T cd02420 11 ECGAASLAIILAYYGRYV-------PLSELRIAC--GVSRDG-S-------NASNLLKAARE-YGLTAKGYKKDLE---- 68 (125)
T ss_pred CHHHHHHHHHHHHcCCCC-------CHHHHHHHc--CCCCCC-C-------CHHHHHHHHHH-cCcccceEecCHH----
Confidence 599999999888865432 222222211 111111 0 11122333454 6888888765321
Q ss_pred HHHHHHHHHhccCCCceEec-CCcceeEEEEEEEeCCCCceEEEEeCCCC
Q 017729 273 EKCRELALHFESQGTPIMIG-GGVLAYTLLGVDYNEASGDCAFLILDPHY 321 (367)
Q Consensus 273 ~l~~~l~~hF~~~gtPImig-g~ghS~TIvGVe~~~~~G~~~LLIlDPhy 321 (367)
. +.+..-|+++. .+||...|.|++.+ .++|.||..
T Consensus 69 ~--------L~~~~lP~I~~~~~g~~~Vl~~~~~~------~~~i~dp~~ 104 (125)
T cd02420 69 A--------LREVSLPAIVFWNFNHFLVVEGFDKR------KVFLNDPAT 104 (125)
T ss_pred H--------HhcCCCCEEEEeCCCEEEEEEEEeCC------EEEEECCCc
Confidence 1 22345688875 57899999987622 489999965
No 22
>KOG2947 consensus Carbohydrate kinase [Carbohydrate transport and metabolism]
Probab=34.36 E-value=1.1e+02 Score=30.95 Aligned_cols=77 Identities=23% Similarity=0.316 Sum_probs=49.2
Q ss_pred CCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEec-CCCChhHHHHHHHHH--------hccC
Q 017729 215 DVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVR-SGAELPEKCRELALH--------FESQ 285 (367)
Q Consensus 215 ~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~-sg~e~~~l~~~l~~h--------F~~~ 285 (367)
..|=-.+-|+.++..|.+++. +.-++++|+ +||.+|..+-+= ++..+..+++-+..+ |++.
T Consensus 23 ~~~fe~~~~r~~~g~wqRgG~---------asNvcTvlr-lLG~~cef~Gvlsr~~~f~~lLddl~~rgIdishcpftd~ 92 (308)
T KOG2947|consen 23 KYPFEDSEIRCLSGRWQRGGN---------ASNVCTVLR-LLGAPCEFFGVLSRGHVFRFLLDDLRRRGIDISHCPFTDH 92 (308)
T ss_pred CCCCCccceehhhhhhhcCCC---------cchHHHHHH-HhCCchheeeecccchhHHHHHHHHHhcCCCcccCccccC
Confidence 445555678899999998864 345788898 589999998864 455455555555433 2333
Q ss_pred CCc---eEecCCcceeEEE
Q 017729 286 GTP---IMIGGGVLAYTLL 301 (367)
Q Consensus 286 gtP---Imigg~ghS~TIv 301 (367)
.+| |+++...-++||+
T Consensus 93 ~pp~ssiI~~r~s~trTil 111 (308)
T KOG2947|consen 93 SPPFSSIIINRNSGTRTIL 111 (308)
T ss_pred CCCcceEEEecCCCceEEE
Confidence 333 4555555556665
No 23
>PF14229 DUF4332: Domain of unknown function (DUF4332)
Probab=31.45 E-value=44 Score=28.87 Aligned_cols=37 Identities=11% Similarity=0.167 Sum_probs=25.8
Q ss_pred ccchhhHHHHHHH----hhc-------cCCCCCCC-CChHHHHHHHHhc
Q 017729 193 GCAYRSLQTIISW----FRL-------QHYASVDV-PSHREIQQALVDI 229 (367)
Q Consensus 193 GCGYRnLQml~Sw----~~~-------q~y~~~~v-PSI~eIQq~Le~a 229 (367)
.|||+|...+... +.. .......+ ||..++++||+.|
T Consensus 71 ~AGv~Tv~~LA~~~p~~L~~~l~~~n~~~~~~r~~~p~~~~v~~WI~~A 119 (122)
T PF14229_consen 71 HAGVDTVEELAQRNPQNLHQKLGRLNRKLKLRRQLCPSLEEVQEWIEQA 119 (122)
T ss_pred HhCcCcHHHHHhCCHHHHHHHHHHHHHHhcCCcCCCCCHHHHHHHHHHH
Confidence 6899998888664 110 11223445 9999999999987
No 24
>cd02421 Peptidase_C39_likeD A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is not conserved in this sub-family.
Probab=30.81 E-value=2.9e+02 Score=22.34 Aligned_cols=54 Identities=28% Similarity=0.316 Sum_probs=36.3
Q ss_pred HHHhhCCcEEEEEecCCCChhHHHHHHHHHhccCCCceEec-CCcceeEEEEEEEeCCCCceEEEEeCCC-CCCc
Q 017729 252 LDKLLGVSCKVLNVRSGAELPEKCRELALHFESQGTPIMIG-GGVLAYTLLGVDYNEASGDCAFLILDPH-YTGN 324 (367)
Q Consensus 252 L~~~lGI~ckIv~f~sg~e~~~l~~~l~~hF~~~gtPImig-g~ghS~TIvGVe~~~~~G~~~LLIlDPh-ytg~ 324 (367)
++. +|.++|.+..+.. -+.+.-.|.++. .+|+...|.+++.+ . ++|+||. -.++
T Consensus 48 a~~-~Gl~~~~~~~~~~------------~l~~~~lP~i~~~~~g~~~Vl~~~~~~----~--~~i~dp~~~~~~ 103 (124)
T cd02421 48 AAR-AGLSARVVRRPLD------------AIPTLLLPAILLLKNGRACVLLGVDDG----H--ARILDPESGGGE 103 (124)
T ss_pred HHH-CCCcceeeeCCHH------------HCCcccCCEEEEEcCCCEEEEEEecCC----e--EEEEccCCCCCc
Confidence 554 7888888876421 123446799985 47788888887642 1 9999997 4444
No 25
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=28.02 E-value=3.7e+02 Score=26.31 Aligned_cols=96 Identities=20% Similarity=0.145 Sum_probs=55.2
Q ss_pred HHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCC---ChhHHHHHHHHHhccCCCceEe--------
Q 017729 223 QQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGA---ELPEKCRELALHFESQGTPIMI-------- 291 (367)
Q Consensus 223 Qq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~---e~~~l~~~l~~hF~~~gtPImi-------- 291 (367)
-..||+|.-| +.||-.=|-.--....|..+.|-++..++..... ....+.+.+..+.+. +.+|..
T Consensus 150 ~~LlEKAyAK---~~GsY~~i~gg~~~~al~~LTG~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~-~~lv~~~t~~~~~~ 225 (315)
T cd00044 150 VALLEKAYAK---LHGSYEALVGGNTAEALEDLTGGPTERIDLKSADASSGDNDLFALLLSFLQG-GSLIGCSTGSRSEE 225 (315)
T ss_pred HHHHHHHHHh---hcCCccccCCCCHHHHHHHhhCCCcEEEEccccccccCHHHHHHHHHHHhhC-CCEEEEEcCCCCcc
Confidence 4467777655 2222222111112233556779999998887653 245566667776653 211111
Q ss_pred ------c-CCcceeEEEEEEEeCCCCceEEEEeCCCCC
Q 017729 292 ------G-GGVLAYTLLGVDYNEASGDCAFLILDPHYT 322 (367)
Q Consensus 292 ------g-g~ghS~TIvGVe~~~~~G~~~LLIlDPhyt 322 (367)
| ..+|||+|+++......|.--+.+-+|.=.
T Consensus 226 ~~~~~~Gl~~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~ 263 (315)
T cd00044 226 EARTANGLVKGHAYSVLDVREVQEEGLRLLRLRNPWGV 263 (315)
T ss_pred hhhccCCcccCcceEEeEEEEEccCceEEEEecCCccC
Confidence 1 158999999998764324555678899643
No 26
>PRK13977 myosin-cross-reactive antigen; Provisional
Probab=22.09 E-value=2.3e+02 Score=31.18 Aligned_cols=118 Identities=16% Similarity=0.250 Sum_probs=74.7
Q ss_pred cccccccchhHHHHHHHHh----hCC-cEEEEEecCCCChhHHHHHHHHHhccCCCceEecCC---------cceeEEEE
Q 017729 237 VGSREWIGAIELSFVLDKL----LGV-SCKVLNVRSGAELPEKCRELALHFESQGTPIMIGGG---------VLAYTLLG 302 (367)
Q Consensus 237 vGSrkWIGT~Ev~~vL~~~----lGI-~ckIv~f~sg~e~~~l~~~l~~hF~~~gtPImigg~---------ghS~TIvG 302 (367)
.+-..|+.+.|.-..+..| .++ ...-+.|...+..+.+++-|.+|.+..|-.+.++.. +.-.++.|
T Consensus 186 FaF~~whSA~E~rry~~rf~~~~~~l~~~s~l~ft~ynqyeSLV~PL~~~Le~~GV~f~~~t~VtdL~~~~d~~~~~Vtg 265 (576)
T PRK13977 186 FAFEKWHSALEMRRYMHRFIHHIGGLPDLSGLKFTKYNQYESLVLPLIKYLEDHGVDFQYGTKVTDIDFDITGGKKTATA 265 (576)
T ss_pred HCCchhhHHHHHHHHHHHHHHhhccCCccccccCCCCCchhHHHHHHHHHHHhCCCEEEeCCEEEEEEEcCCCCceEEEE
Confidence 4445999999999988766 343 455667777888899999999999998877776652 22378999
Q ss_pred EEEeCCC-------CceEEEEeCCC-CCCch---------hhhhhhcCCeEeEEecccCCCccccccCCe
Q 017729 303 VDYNEAS-------GDCAFLILDPH-YTGND---------EHKKIVNGGWCGWKKAVDSKGKNFFLHDKF 355 (367)
Q Consensus 303 Ve~~~~~-------G~~~LLIlDPh-ytg~~---------~lk~l~~kGw~gWKk~~~~~g~~~f~~~~f 355 (367)
|+...+. ++-.+.|+=-. ++.+. ....-...+|.=|+++.. +.+.|=....|
T Consensus 266 I~~~~~~~~~~I~l~~~DlVivTnGs~t~ns~~G~~~~p~~~~~~~~~~w~LW~~la~-~~~~fG~P~~F 334 (576)
T PRK13977 266 IHLTRNGKEETIDLTEDDLVFVTNGSITESSTYGDMDTPAPLNRELGGSWTLWKNIAA-QSPEFGNPDKF 334 (576)
T ss_pred EEEEeCCceeEEEecCCCEEEEeCCcCccccccCCCCCCCCCCCCCCccHHHHHHHHh-cCccCCChhhh
Confidence 9886411 12234444332 22111 112223578999998843 33444334444
No 27
>PF15256 SPATIAL: SPATIAL
Probab=20.34 E-value=41 Score=32.07 Aligned_cols=19 Identities=32% Similarity=0.356 Sum_probs=16.8
Q ss_pred CCChHHHHHHHHhcCCCCC
Q 017729 216 VPSHREIQQALVDIGDKDP 234 (367)
Q Consensus 216 vPSI~eIQq~Le~awDK~~ 234 (367)
.=|+.+||+||..|.+|.+
T Consensus 166 TDSL~~vq~WLl~A~~kEK 184 (196)
T PF15256_consen 166 TDSLSAVQQWLLSASDKEK 184 (196)
T ss_pred cCCHHHHHHHHHhCChhhH
Confidence 4589999999999999976
No 28
>TIGR03796 NHPM_micro_ABC1 NHPM bacteriocin system ABC transporter, peptidase/ATP-binding protein. This protein describes an multidomain ABC transporter subunit that is one of three protein families associated with some regularity with a distinctive family of putative bacteriocins. It includes a bacteriocin-processing peptidase domain at the N-terminus. Model TIGR03793 describes a conserved propeptide region for this bacteriocin family, unusual because it shows obvious homology a region of the enzyme nitrile hydratase up to the classic Gly-Gly cleavage motif. This family is therefore predicted to be a subunit of a bacteriocin processing and export system characteristic to this system that we designate NHPM, Nitrile Hydratase Propeptide Microcin.
Probab=20.02 E-value=5.1e+02 Score=28.13 Aligned_cols=93 Identities=17% Similarity=0.195 Sum_probs=53.6
Q ss_pred ccchhhHHHHHHHhhccCCCCCCCCChHHHHHHHHhcCCCCCCccccccccchhHHHHHHHHhhCCcEEEEEecCCCChh
Q 017729 193 GCAYRSLQTIISWFRLQHYASVDVPSHREIQQALVDIGDKDPSFVGSREWIGAIELSFVLDKLLGVSCKVLNVRSGAELP 272 (367)
Q Consensus 193 GCGYRnLQml~Sw~~~q~y~~~~vPSI~eIQq~Le~awDK~~~fvGSrkWIGT~Ev~~vL~~~lGI~ckIv~f~sg~e~~ 272 (367)
-||--.++|++.|+-..- ++.++.+...- +++ | ..-.++.-+++. +|..++.+..+ .+
T Consensus 12 dCg~acl~mi~~~~g~~~-------~~~~lr~~~~~--~~~----g----~s~~~l~~~~~~-~g~~~~~~~~~----~~ 69 (710)
T TIGR03796 12 ECGAASLAMILAYYGRYV-------PLEELREECGV--SRD----G----SKASNLLKAARS-YGLEAKGFRKE----LD 69 (710)
T ss_pred cHHHHHHHHHHHHcCCCC-------CHHHHHHHcCC--CCC----C----CCHHHHHHHHHH-CCCEeEEEecC----HH
Confidence 499999999999975431 12222211000 000 0 122233333555 79999988753 11
Q ss_pred HHHHHHHHHhccCCCceEec-CCcceeEEEEEEEeCCCCceEEEEeCCCC
Q 017729 273 EKCRELALHFESQGTPIMIG-GGVLAYTLLGVDYNEASGDCAFLILDPHY 321 (367)
Q Consensus 273 ~l~~~l~~hF~~~gtPImig-g~ghS~TIvGVe~~~~~G~~~LLIlDPhy 321 (367)
+ +.+...|.++. +++|--.+.+++.+ ++.|.||..
T Consensus 70 ~--------l~~~~lP~i~~~~~~h~vvl~~~~~~------~~~i~dP~~ 105 (710)
T TIGR03796 70 A--------LAELPLPYIVFWNFNHFVVVEGFRGG------RVYLNDPAL 105 (710)
T ss_pred H--------hccCCCCEEEEEcCCcEEEEEEEeCC------EEEEECCCC
Confidence 1 13345688876 56777777776433 499999987
Done!