Query psy8067
Match_columns 239
No_of_seqs 188 out of 1394
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 17:31:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8067.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8067hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR02110 PQQ_syn_pqqF coenzym 100.0 9.9E-29 2.1E-33 229.3 21.6 172 66-239 2-177 (696)
2 KOG0960|consensus 100.0 1.7E-28 3.7E-33 207.4 16.1 175 61-238 31-205 (467)
3 COG0612 PqqL Predicted Zn-depe 100.0 6.6E-28 1.4E-32 216.3 18.5 176 61-239 14-190 (438)
4 PRK15101 protease3; Provisiona 99.9 6.9E-27 1.5E-31 227.6 19.8 177 61-239 41-222 (961)
5 KOG2067|consensus 99.9 1.6E-26 3.5E-31 195.6 14.7 176 60-238 21-196 (472)
6 PF00675 Peptidase_M16: Insuli 99.9 5.9E-25 1.3E-29 169.3 15.7 147 74-223 1-148 (149)
7 COG1025 Ptr Secreted/periplasm 99.9 2.4E-21 5.2E-26 180.3 16.8 179 59-239 19-202 (937)
8 KOG0959|consensus 99.9 9.6E-21 2.1E-25 178.7 16.7 177 61-239 25-207 (974)
9 KOG2583|consensus 99.8 4.2E-18 9.1E-23 144.9 16.0 168 65-238 24-192 (429)
10 PTZ00432 falcilysin; Provision 99.8 3.3E-18 7.1E-23 167.8 15.3 168 64-239 92-296 (1119)
11 PRK15101 protease3; Provisiona 99.4 2.5E-11 5.5E-16 118.9 16.7 165 63-239 522-691 (961)
12 COG1026 Predicted Zn-dependent 99.3 1.5E-11 3.2E-16 116.0 10.1 166 68-239 26-207 (978)
13 KOG2019|consensus 98.9 2.6E-09 5.6E-14 97.2 5.8 164 70-239 60-243 (998)
14 KOG0961|consensus 98.9 7.8E-09 1.7E-13 94.1 8.8 162 71-239 28-202 (1022)
15 PF08367 M16C_assoc: Peptidase 98.1 0.00014 3E-09 60.6 14.5 118 70-189 76-206 (248)
16 COG1026 Predicted Zn-dependent 97.2 0.019 4E-07 55.7 16.1 127 67-195 530-669 (978)
17 PTZ00432 falcilysin; Provision 97.1 0.016 3.5E-07 58.2 15.3 125 68-194 664-806 (1119)
18 KOG2019|consensus 97.1 0.015 3.3E-07 54.2 13.1 128 66-195 563-702 (998)
19 KOG0959|consensus 96.3 0.17 3.8E-06 49.6 15.1 158 71-238 517-677 (974)
20 COG1025 Ptr Secreted/periplasm 96.1 0.31 6.7E-06 47.4 15.2 161 68-238 507-670 (937)
21 PF03410 Peptidase_M44: Protei 95.2 0.038 8.2E-07 49.5 5.3 104 67-185 2-111 (590)
22 KOG2067|consensus 94.9 0.46 1E-05 41.9 10.9 147 83-238 263-427 (472)
23 PHA03081 putative metalloprote 94.4 0.065 1.4E-06 48.1 4.6 91 67-172 2-93 (595)
24 KOG0960|consensus 92.4 2 4.4E-05 38.0 10.4 159 72-238 258-431 (467)
25 PF05193 Peptidase_M16_C: Pept 89.1 5.5 0.00012 30.1 9.6 108 72-185 67-184 (184)
26 COG0612 PqqL Predicted Zn-depe 71.6 63 0.0014 29.0 10.9 110 125-238 291-413 (438)
27 KOG0961|consensus 67.1 20 0.00044 34.3 6.7 135 81-218 559-706 (1022)
28 PF08288 PIGA: PIGA (GPI ancho 48.2 16 0.00034 25.4 2.0 15 66-80 9-23 (90)
29 PRK11840 bifunctional sulfur c 38.2 29 0.00063 30.1 2.7 56 82-138 54-111 (326)
30 cd04914 ACT_AKi-DapG-BS_1 ACT 37.8 93 0.002 19.8 4.5 48 124-172 17-64 (67)
31 cd04922 ACT_AKi-HSDH-ThrA_2 AC 36.9 1E+02 0.0022 18.9 4.9 46 124-169 19-65 (66)
32 PF09186 DUF1949: Domain of un 34.9 1E+02 0.0022 18.4 7.1 49 120-168 5-53 (56)
33 PHA02541 23 major capsid prote 34.3 68 0.0015 29.4 4.4 115 15-143 368-485 (518)
34 PF06135 DUF965: Bacterial pro 34.3 1.1E+02 0.0023 20.7 4.3 58 175-237 14-71 (79)
35 PHA02548 24 capsid vertex prot 34.3 63 0.0014 28.7 4.1 63 15-95 262-325 (412)
36 cd04919 ACT_AK-Hom3_2 ACT doma 32.9 1.2E+02 0.0026 18.7 4.9 46 124-169 19-65 (66)
37 PTZ00115 40S ribosomal protein 32.6 3E+02 0.0065 23.1 7.6 44 60-107 144-191 (290)
38 PF13840 ACT_7: ACT domain ; P 32.3 1.3E+02 0.0029 19.0 4.9 40 125-166 26-65 (65)
39 PF12122 DUF3582: Protein of u 31.9 1.6E+02 0.0036 20.8 5.3 48 123-171 13-61 (101)
40 cd04936 ACT_AKii-LysC-BS-like_ 31.0 1.2E+02 0.0027 18.2 4.7 44 124-168 18-61 (63)
41 PRK05473 hypothetical protein; 30.1 1.7E+02 0.0036 20.1 4.7 58 175-237 17-74 (86)
42 cd04923 ACT_AK-LysC-DapG-like_ 29.7 1.3E+02 0.0028 18.0 4.7 44 124-168 18-61 (63)
43 cd04916 ACT_AKiii-YclM-BS_2 AC 29.6 1.4E+02 0.003 18.3 5.0 46 124-169 19-65 (66)
44 COG4565 CitB Response regulato 26.9 2.4E+02 0.0053 23.1 6.0 82 152-238 81-166 (224)
45 cd04924 ACT_AK-Arch_2 ACT doma 26.6 1.6E+02 0.0034 18.0 4.7 45 124-168 19-64 (66)
46 PF15061 DUF4538: Domain of un 22.6 1.1E+02 0.0023 19.4 2.6 35 162-196 20-54 (58)
47 PRK06063 DNA polymerase III su 21.7 2.3E+02 0.0051 24.4 5.4 69 110-184 236-307 (313)
48 PF09851 SHOCT: Short C-termin 21.5 1.5E+02 0.0032 15.9 4.1 28 159-186 3-30 (31)
No 1
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=99.97 E-value=9.9e-29 Score=229.27 Aligned_cols=172 Identities=22% Similarity=0.280 Sum_probs=162.3
Q ss_pred EEEcCCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeeccc
Q psy8067 66 VTTLPNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRD 144 (239)
Q Consensus 66 ~~~L~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~ 144 (239)
.++|+||++|++.+.+ .+.+.+.+++++|+.+|+.+..|++|+++||+|+||.+++...++.+.++..|+++|++++.|
T Consensus 2 ~~tL~NGLrVllv~~p~~p~vav~l~v~aGS~~Ep~~~~GLAHfLEHMLFkGT~~~~~~~~i~~~le~lGG~lNA~Ts~d 81 (696)
T TIGR02110 2 RITLPNGLRVHLYHQPDAKRAAALLRVAAGSHDEPSAWPGLAHFLEHLLFLGGERFQGDDRLMPWVQRQGGQVNATTLER 81 (696)
T ss_pred eEEcCCCCEEEEEECCCCCEEEEEEEEeeccCCCCCCCCcHHHHHHHHHhcCCCCCCcHHHHHHHHHHhCCeEEEEEcCC
Confidence 4789999999998866 589999999999999999999999999999999999999433479999999999999999999
Q ss_pred eeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCCh
Q psy8067 145 TFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPP 224 (239)
Q Consensus 145 ~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~ 224 (239)
++.|.+++++++++.+|+++.+++++|.|++++++++|+.++++++...++| .....+.+.+.+|++|||+++.+|+.
T Consensus 82 ~T~y~~~v~~~~l~~aL~lLaD~l~~P~f~eeeierEr~vvl~Ei~~~~ddp--~~~~~~~l~~~l~~~HPy~~~~iGt~ 159 (696)
T TIGR02110 82 TTAFFFELPAAALAAGLARLCDMLARPLLTAEDQQREREVLEAEYIAWQNDA--DTLREAALLDALQAGHPLRRFHAGSR 159 (696)
T ss_pred eEEEEEEecHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHhcCH--HHHHHHHHHHHcCCCCCCCCCCCCCH
Confidence 9999999999999999999999999999999999999999999999998888 88899999999999999999999999
Q ss_pred hhhcCCC---HHHHhccC
Q psy8067 225 ASIGVIN---NNTLFTYL 239 (239)
Q Consensus 225 ~~l~~i~---~e~l~~~y 239 (239)
++|+.++ .+++++||
T Consensus 160 esL~~it~~t~edL~~F~ 177 (696)
T TIGR02110 160 DSLALPNTAFQQALRDFH 177 (696)
T ss_pred HHHhCcccchHHHHHHHH
Confidence 9999876 99999987
No 2
>KOG0960|consensus
Probab=99.96 E-value=1.7e-28 Score=207.36 Aligned_cols=175 Identities=35% Similarity=0.558 Sum_probs=168.9
Q ss_pred CCCcEEEEcCCCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeee
Q psy8067 61 DRNTQVTTLPNGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCL 140 (239)
Q Consensus 61 ~~~~~~~~L~NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~ 140 (239)
.|+++..+|+||++|.+..+.....++++++++||++|.+...|.+||++||.|+|+.+. +...|...++..|+.+|++
T Consensus 31 ~P~t~vttL~NGlrVaTE~~~a~TATVGVwidaGSR~EnekNNG~ahFLEhlaFKGT~~R-s~~alElEieniGahLNAy 109 (467)
T KOG0960|consen 31 VPETEVTTLPNGLRVATEHNSASTATVGVWIDAGSRFENEKNNGTAHFLEHLAFKGTKNR-SQAALELEIENIGAHLNAY 109 (467)
T ss_pred CCcceEEEcCCCcEEEeccCCCcceEEEEEeccCccccccccccHHHHHHHHHhcCCCcc-hhHHHHHHHHHHHHHhccc
Confidence 577899999999999999887799999999999999999999999999999999999996 8899999999999999999
Q ss_pred eccceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCC
Q psy8067 141 SSRDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPK 220 (239)
Q Consensus 141 ~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~ 220 (239)
++++.+.|...+.+++++.++++|++++++..+.+.+++++|..+..|++..+.+- +.+.+|.||...|.++|++++.
T Consensus 110 tSReqT~yyakal~~dv~kavdiLaDIlqns~L~~s~IerER~vILrEmqevd~~~--~eVVfdhLHatafQgtPL~~ti 187 (467)
T KOG0960|consen 110 TSREQTVYYAKALSKDVPKAVDILADILQNSKLEESAIERERDVILREMQEVDKNH--QEVVFDHLHATAFQGTPLGRTI 187 (467)
T ss_pred ccccceeeehhhccccchHHHHHHHHHHHhCccchhHHHHHHHHHHHHHHHHHhhh--hHHHHHHHHHHHhcCCcccccc
Confidence 99999999999999999999999999999999999999999999999999998876 8899999999999999999999
Q ss_pred CCChhhhcCCCHHHHhcc
Q psy8067 221 LCPPASIGVINNNTLFTY 238 (239)
Q Consensus 221 ~g~~~~l~~i~~e~l~~~ 238 (239)
.|+.++|++|+.+||++|
T Consensus 188 lGp~enI~si~r~DL~~y 205 (467)
T KOG0960|consen 188 LGPSENIKSISRADLKDY 205 (467)
T ss_pred cChhhhhhhhhHHHHHHH
Confidence 999999999999999987
No 3
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=99.96 E-value=6.6e-28 Score=216.30 Aligned_cols=176 Identities=32% Similarity=0.500 Sum_probs=168.4
Q ss_pred CCCcEEEEcCCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeee
Q psy8067 61 DRNTQVTTLPNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDC 139 (239)
Q Consensus 61 ~~~~~~~~L~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~ 139 (239)
..+++..+|+||+++++.+.+ .+.+.+.+++++|+..++.+..|++|+++||+++|+.++ ...++.+.++..|+..++
T Consensus 14 ~~~~~~~~L~nGl~~~~~~~~~~~~vs~~~~v~~Gs~~e~~~~~G~AH~lehm~fkgt~~~-~~~~i~~~~~~~G~~~na 92 (438)
T COG0612 14 LPGLQVFTLPNGLRVITYPNPTAPTVSLDVWVKAGSRAEPAGKAGIAHFLEHMAFKGTTGL-PSAELAEAFEKLGGQLNA 92 (438)
T ss_pred cccceEEEcCCCCEEEEEeCCCCCEEEEEEEEeecccCCCCCcccHHHHHHHHHccCCCCC-ChHHHHHHHHHhcCeeec
Confidence 345899999999999999877 699999999999999999999999999999999999887 445999999999999999
Q ss_pred eeccceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCC
Q psy8067 140 LSSRDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLP 219 (239)
Q Consensus 140 ~~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~ 219 (239)
+++.+++.|.+++.+++++.+++++++++.+|.|++++|+++|+.+.++++...++| ..++.+.+...+|++||++++
T Consensus 93 ~ts~d~t~y~~~~l~~~~~~~l~llad~l~~p~f~~~~~e~Ek~vil~ei~~~~d~p--~~~~~~~l~~~~~~~~p~~~~ 170 (438)
T COG0612 93 FTSFDYTVYYLSVLPDNLDKALDLLADILLNPTFDEEEVEREKGVILEEIRMRQDDP--DDLAFERLLEALYGNHPLGRP 170 (438)
T ss_pred cccchhhhhhhhhchhhhHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHhhccCc--hHHHHHHHHHHhhccCCCCCC
Confidence 999999999999999999999999999999999999999999999999999999998 899999999999999999999
Q ss_pred CCCChhhhcCCCHHHHhccC
Q psy8067 220 KLCPPASIGVINNNTLFTYL 239 (239)
Q Consensus 220 ~~g~~~~l~~i~~e~l~~~y 239 (239)
..|+.++|++++.++|++||
T Consensus 171 ~~G~~e~I~~it~~dl~~f~ 190 (438)
T COG0612 171 ILGTEESIEAITREDLKDFY 190 (438)
T ss_pred CCCCHHHHHhCCHHHHHHHH
Confidence 99999999999999999997
No 4
>PRK15101 protease3; Provisional
Probab=99.95 E-value=6.9e-27 Score=227.63 Aligned_cols=177 Identities=16% Similarity=0.208 Sum_probs=166.4
Q ss_pred CCCcEEEEcCCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeee
Q psy8067 61 DRNTQVTTLPNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDC 139 (239)
Q Consensus 61 ~~~~~~~~L~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~ 139 (239)
...++.++|+||++|++.+.+ .+.+.+++.+++|+..|+.+..|++|+++||++.|+.+|++..++.+.++..|+++|+
T Consensus 41 ~~~~~~~~L~NGL~v~l~~~~~~~~~~~~l~v~~Gs~~ep~~~~GlAHflEHmlf~GT~~~p~~~~~~~~l~~~Gg~~NA 120 (961)
T PRK15101 41 PRQYQAIRLDNGMTVLLVSDPQAVKSLAALALPVGSLEDPDAQQGLAHYLEHMVLMGSKKYPQPDSLAEFLKKHGGSHNA 120 (961)
T ss_pred ccceEEEEeCCCCEEEEEeCCCCcceeEEEEeCcCCCCCCCCCCchHHHHHHHHhcCCccCCCcchHHHHHHHhCCCccc
Confidence 467788999999999998765 5899999999999999999999999999999999999996567999999999999999
Q ss_pred eeccceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCC
Q psy8067 140 LSSRDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLP 219 (239)
Q Consensus 140 ~~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~ 219 (239)
+++.+++.|.+++++++++.+|.++++++.+|.|++++++++|+.+.++++...++| ...+.+.+.+.+|++|||+++
T Consensus 121 ~T~~d~T~y~~~~~~~~l~~aL~~~ad~~~~P~f~~~~~erE~~~v~~E~~~~~~~~--~~~~~~~~~~~~~~~hp~~~~ 198 (961)
T PRK15101 121 STASYRTAFYLEVENDALPPAVDRLADAIAEPLLDPKNADRERNAVNAELTMARSRD--GMRMAQVSAETINPAHPGSRF 198 (961)
T ss_pred eECCCceEEEEEcCHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHHhcCCH--HHHHHHHHHhhCCCCCCcccC
Confidence 999999999999999999999999999999999999999999999999999887777 788889999999999999999
Q ss_pred CCCChhhhcCC----CHHHHhccC
Q psy8067 220 KLCPPASIGVI----NNNTLFTYL 239 (239)
Q Consensus 220 ~~g~~~~l~~i----~~e~l~~~y 239 (239)
..|+.++|.++ +.++|++||
T Consensus 199 ~~G~~etl~~~~~~~~~~~L~~f~ 222 (961)
T PRK15101 199 SGGNLETLSDKPGSKLQDALVDFY 222 (961)
T ss_pred CCCCHHHhhcCCchHHHHHHHHHH
Confidence 99999999998 699999987
No 5
>KOG2067|consensus
Probab=99.94 E-value=1.6e-26 Score=195.63 Aligned_cols=176 Identities=56% Similarity=0.960 Sum_probs=169.2
Q ss_pred CCCCcEEEEcCCCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeee
Q psy8067 60 NDRNTQVTTLPNGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDC 139 (239)
Q Consensus 60 ~~~~~~~~~L~NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~ 139 (239)
+...++..+|+||+||...+++++...++++++.|+++|.....|++|+++.++|+++.++++ .++.+.|+..|+.++|
T Consensus 21 ~~~~~kvttL~NGlkvase~~pg~f~~vGlyIdsGsrYE~~~~~GisH~lerLAF~ST~~~~~-~ei~~~LE~~GGn~~c 99 (472)
T KOG2067|consen 21 EPSNTKVTTLPNGLKVASENTPGQFCTVGLYIDSGSRYEAKYFSGISHFLERLAFKSTERFSS-KEILAELEKLGGNCDC 99 (472)
T ss_pred ccccceeeecCCccEEeccCCCCCceEEEEEEecCccccCcCcccHHHHHHHHhhccccCCcH-HHHHHHHHHhCCcccc
Confidence 456788899999999999999999999999999999999999999999999999999999954 5999999999999999
Q ss_pred eeccceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCC
Q psy8067 140 LSSRDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLP 219 (239)
Q Consensus 140 ~~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~ 219 (239)
++++|...|.+++.++++++++++|++.+.+|+|++++++.++..++-|++..-.+| +.++.+.+|.++|.+++.+.+
T Consensus 100 qsSRetm~Yaas~~~~~v~sm~~lLadtV~~P~~~d~ev~~~~~~v~~E~~el~~~P--e~lL~e~iH~Aay~~ntlg~p 177 (472)
T KOG2067|consen 100 QSSRETMMYAASADSDGVDSMVELLADTVLNPKFTDQEVEEARRAVKYEIEELWMRP--EPLLTEMIHSAAYSGNTLGLP 177 (472)
T ss_pred cccHhhhHHHHHhhhcccHHHHHHHHHHHhcccccHHHHHHHHHhhhheccccccCc--hhhHHHHHHHHHhccCccccc
Confidence 999999999999999999999999999999999999999999999999999999999 899999999999999999999
Q ss_pred CCCChhhhcCCCHHHHhcc
Q psy8067 220 KLCPPASIGVINNNTLFTY 238 (239)
Q Consensus 220 ~~g~~~~l~~i~~e~l~~~ 238 (239)
.+++.+.|++|+.+.+..|
T Consensus 178 l~cp~~~i~~I~~~~l~~y 196 (472)
T KOG2067|consen 178 LLCPEENIDKINREVLEEY 196 (472)
T ss_pred ccCChhhhhhhhHHHHHHH
Confidence 9999999999999999876
No 6
>PF00675 Peptidase_M16: Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ; InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=99.93 E-value=5.9e-25 Score=169.26 Aligned_cols=147 Identities=30% Similarity=0.596 Sum_probs=138.7
Q ss_pred EEEEeeC-CCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeeccceeEEEEEe
Q psy8067 74 RVASENR-FGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRDTFIYAASA 152 (239)
Q Consensus 74 ~v~~~~~-~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~~~~~~~~~ 152 (239)
||++..+ ..+.+.+++++++|+.+|+.+..|++|++++|+++|+.++ +..++.+.++..|++++++++.+++.|++++
T Consensus 1 ~V~~~~~~~~~~~~~~l~~~~Gs~~e~~~~~G~a~ll~~l~~~gs~~~-~~~~l~~~l~~~G~~~~~~t~~d~t~~~~~~ 79 (149)
T PF00675_consen 1 KVVLVEDPGSPVVSVSLVFKAGSRYEPPGKPGLAHLLEHLLFRGSKKY-SSDELQEELESLGASFNASTSRDSTSYSASV 79 (149)
T ss_dssp EEEEEESTTSSEEEEEEEES-SGGGSCTTTTTHHHHHHHHTTSBBSSS-BHHHHHHHHHHTTCEEEEEEESSEEEEEEEE
T ss_pred CEEEEEcCCCCEEEEEEEEeeccCCCCCCCCchhhhhhhhcccccchh-hhhhhHHHhhhhccccceEecccceEEEEEE
Confidence 6788776 5799999999999999999999999999999999999998 6799999999999999999999999999999
Q ss_pred cCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCC
Q psy8067 153 ATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCP 223 (239)
Q Consensus 153 ~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~ 223 (239)
++++++.+|+++++++.+|.|++++|+++|..+..+++....+| ..++.+.+++.+|.++||+++..|+
T Consensus 80 ~~~~~~~~l~~l~~~~~~P~f~~~~~~~~r~~~~~ei~~~~~~~--~~~~~~~l~~~~f~~~p~~~~~~~~ 148 (149)
T PF00675_consen 80 LSEDLEKALELLADMLFNPSFDEEEFEREREQILQEIEEIKENP--QELAFEKLHSAAFRGHPYGNPLLGP 148 (149)
T ss_dssp EGGGHHHHHHHHHHHHHSBGGCHHHHHHHHHHHHHHHHHHTTHH--HHHHHHHHHHHHHTTSGGGSHSS-T
T ss_pred ecccchhHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHCCH--HHHHHHHHHHHHhccCCCCCCCCCC
Confidence 99999999999999999999999999999999999999998887 8999999999999999999998875
No 7
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=99.87 E-value=2.4e-21 Score=180.31 Aligned_cols=179 Identities=18% Similarity=0.279 Sum_probs=164.7
Q ss_pred CCCCCcEEEEcCCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCee
Q psy8067 59 DNDRNTQVTTLPNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGIC 137 (239)
Q Consensus 59 ~~~~~~~~~~L~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~ 137 (239)
..+..++..+|+||+++.+...+ .+....++.|+.|+..||.+.+|+||++|||+|.|+.+||....+..+|...|++.
T Consensus 19 ~d~r~y~~I~LpNGl~~LlisDP~a~ks~aAL~V~vGs~~DP~e~~GLAHflEHmlfmGseKYP~~~~f~~fLskhgGs~ 98 (937)
T COG1025 19 LDDRKYRAIKLPNGLRALLVSDPQADKSSAALVVPVGSFDDPEEYPGLAHFLEHMLFMGSEKYPDEGGFSEFLSKHGGSH 98 (937)
T ss_pred ccCcceeEEECCCCceEEEecCCCCCccceeEEeecCCCCChhhcccHHHHHHHHHHhcCccCCCccchHHHHHHcCCcc
Confidence 35678899999999999987654 68999999999999999999999999999999999999999999999999999999
Q ss_pred eeeeccceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCC
Q psy8067 138 DCLSSRDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLG 217 (239)
Q Consensus 138 ~~~~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~ 217 (239)
|++|..+.|.|.+....+.|+.++..++++|.+|.|+++..++++.++.+|......+- ...+.+.....+-++||++
T Consensus 99 NA~T~~~~T~fyFeV~~~al~~ALDrFa~ff~~PLf~~e~~dRE~~AV~sE~~~~~~~D--~~R~~~~~~~~~np~HP~s 176 (937)
T COG1025 99 NASTAGERTAFYFEVENDALEGALDRFADFFIEPLFNKEALDRERNAVNSEFTMNLTSD--GWRMYQVQALTANPGHPLS 176 (937)
T ss_pred ccccCCCceeEEEEecHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHHHHhcCcCch--HHHHHHHHHhhcCCCCCcc
Confidence 99999999999999999999999999999999999999999999999999999877764 7778888888788899999
Q ss_pred CCCCCChhhhcC----CCHHHHhccC
Q psy8067 218 LPKLCPPASIGV----INNNTLFTYL 239 (239)
Q Consensus 218 ~~~~g~~~~l~~----i~~e~l~~~y 239 (239)
+...|..+++.. ...+++.+||
T Consensus 177 rFs~GN~~TL~~~p~~~v~~el~ef~ 202 (937)
T COG1025 177 KFSTGNLETLSDKPGLVVQQELKEFH 202 (937)
T ss_pred ccCCCChhhhccCCCchHHHHHHHHH
Confidence 999999999988 5567777775
No 8
>KOG0959|consensus
Probab=99.86 E-value=9.6e-21 Score=178.69 Aligned_cols=177 Identities=15% Similarity=0.186 Sum_probs=161.4
Q ss_pred CCCcEEEEcCCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeee
Q psy8067 61 DRNTQVTTLPNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDC 139 (239)
Q Consensus 61 ~~~~~~~~L~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~ 139 (239)
...++..+|+||+++.+.+.+ .+..++++.|+.||..||.+.+|+||+++||+|.|+.+||+.....+.|++.|++-|+
T Consensus 25 ~r~yr~~~L~Ngl~alLisDp~tD~ssaal~V~vGS~~DP~dl~GLAHF~EHMlFmGS~KYP~En~y~~~lsk~gGssNA 104 (974)
T KOG0959|consen 25 TREYRGIELTNGLRALLISDPKTDKSSAALDVKVGSFSDPEDLQGLAHFCEHMLFMGSEKYPDENEYSKFLSKNGGSSNA 104 (974)
T ss_pred ccceeEEEecCCceEEEecCCCCCccceeeeeeccccCCccccccHHHHHHHHHhhccccCCCcchhHHHHHhcCCcccc
Confidence 457899999999999987544 4668888899999999999999999999999999999999999999999999999999
Q ss_pred eeccceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCC
Q psy8067 140 LSSRDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLP 219 (239)
Q Consensus 140 ~~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~ 219 (239)
+|..++|.|.+.+..+.|+.+|..++++|.+|.|.+++.++++.++..|.++..++- ...+.+......-++||+++.
T Consensus 105 ~T~~e~T~y~F~V~~~~l~~ALDrFaqFf~~Plf~~~a~eREv~AVdSE~~~nl~~D--~wr~~ql~~~l~~~~hp~~kF 182 (974)
T KOG0959|consen 105 YTDSEHTNYYFDVQHDHLEGALDRFAQFFSDPLFNKSATEREVGAVDSEHEKNLNSD--GWRFDQLLRSLSNPGHPYSKF 182 (974)
T ss_pred ccccccceEEEecchHHHHHHHHHHHHHhhCcccChHHHHHHHHHHHHHHHhccCcc--hhHHHHHHHHhcCCCCcchhc
Confidence 999999999999999999999999999999999999999999999999999887764 566666666666679999999
Q ss_pred CCCChhhhcCCC-----HHHHhccC
Q psy8067 220 KLCPPASIGVIN-----NNTLFTYL 239 (239)
Q Consensus 220 ~~g~~~~l~~i~-----~e~l~~~y 239 (239)
..|..+.|.... ++.|.+||
T Consensus 183 ~tGN~~tL~~~p~~~~~r~~L~kF~ 207 (974)
T KOG0959|consen 183 STGNKKTLLEGPREIDLRDELLKFY 207 (974)
T ss_pred cccchhhhhhccccchHHHHHHHHH
Confidence 999999999988 88888876
No 9
>KOG2583|consensus
Probab=99.79 E-value=4.2e-18 Score=144.85 Aligned_cols=168 Identities=27% Similarity=0.435 Sum_probs=154.7
Q ss_pred EEEEcCCCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeeccc
Q psy8067 65 QVTTLPNGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRD 144 (239)
Q Consensus 65 ~~~~L~NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~ 144 (239)
...+|.||++|.....+.++..+.+.+++||++++.++.|++|++....++.+.++ +...|.+..+..|+.++.++++|
T Consensus 24 ~~~kl~ngL~Vas~e~~~~is~l~l~~~AGSRYe~~~~~G~sHllr~f~g~~Tq~~-sal~ivr~se~~GG~Lss~~tRe 102 (429)
T KOG2583|consen 24 KTTKLVNGLTVASREAPTAISSLSLAFRAGSRYEPADQQGLSHLLRNFVGRDTQER-SALKIVRESEQLGGTLSSTATRE 102 (429)
T ss_pred hhhccccceEEEeccCCCcceEEEEEEecCccCCccccccHHHHHHHhcccCcccc-chhhhhhhhHhhCceeeeeeecc
Confidence 34589999999999999999999999999999999999999999999999999998 78999999999999999999999
Q ss_pred eeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHH-HHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCC
Q psy8067 145 TFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMAR-QTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCP 223 (239)
Q Consensus 145 ~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k-~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~ 223 (239)
..+++.++++++++..+.+|.+....|.|-+|+++... ..+..++. ..+| ...+.+.+|+++|.+ .++++.+.+
T Consensus 103 ~~~~tvt~lrd~~~~~l~~L~~V~~~paFkPwEl~D~~~~ti~~~l~--~~t~--~~~a~e~lH~aAfRn-gLgnslY~p 177 (429)
T KOG2583|consen 103 LIGLTVTFLRDDLEYYLSLLGDVLDAPAFKPWELEDVVLATIDADLA--YQTP--YTIAIEQLHAAAFRN-GLGNSLYSP 177 (429)
T ss_pred eEEEEEEEecccHHHHHHHHHHhhcccCcCchhhhhhhhhhhHHHhh--hcCh--HHHHHHHHHHHHHhc-ccCCcccCC
Confidence 99999999999999999999999999999999999998 66665554 4567 899999999999986 888888876
Q ss_pred hhhhcCCCHHHHhcc
Q psy8067 224 PASIGVINNNTLFTY 238 (239)
Q Consensus 224 ~~~l~~i~~e~l~~~ 238 (239)
--.+.+++.++|.+|
T Consensus 178 ~~~vg~vss~eL~~F 192 (429)
T KOG2583|consen 178 GYQVGSVSSSELKDF 192 (429)
T ss_pred cccccCccHHHHHHH
Confidence 678999999999887
No 10
>PTZ00432 falcilysin; Provisional
Probab=99.78 E-value=3.3e-18 Score=167.75 Aligned_cols=168 Identities=15% Similarity=0.195 Sum_probs=145.8
Q ss_pred cEEEEcCCCCEEEEeeCCC---CeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhC--Ceee
Q psy8067 64 TQVTTLPNGLRVASENRFG---SFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIG--GICD 138 (239)
Q Consensus 64 ~~~~~L~NG~~v~~~~~~~---~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g--~~~~ 138 (239)
+.....+||++|+..++++ +.+.++++++.|+ .+..|++|+++||+++|+.+| +..++...++..| +.+|
T Consensus 92 ~~~~H~~nGl~vl~~~~~d~~~~~~~f~i~f~T~~----~d~~G~aH~LEH~~f~GS~k~-p~~~~~~~l~~~gl~~~lN 166 (1119)
T PTZ00432 92 TVYSHKKTGLQVISLKTNDSSGKEMCFDFYVPTPP----HNDKGIPHILEHSVLSGSKKY-NYKDSFSLLVQGGFNSFLN 166 (1119)
T ss_pred EEEEEcCCCCEEEEEecCCCccceeEEEEEecCCC----CCCcchhHHHHHHHhCCCCCC-CcccHHHHHHhcCcCCCcc
Confidence 3444567999999877654 4788999999987 345799999999999999999 5568888887765 7899
Q ss_pred eeeccceeEEEEEecCC-CHHHHHHHHHHhhhCCCCCHHHH--H---------HH--------------------HHHHH
Q psy8067 139 CLSSRDTFIYAASAATS-GLDTVMKILGDVVLRPKFSKEEI--N---------MA--------------------RQTIQ 186 (239)
Q Consensus 139 ~~~~~~~~~~~~~~~~~-~le~~l~ll~~~~~~P~f~~~~~--~---------~~--------------------k~~~~ 186 (239)
++++.|++.|.+.+..+ ++..++.++.+.+.+|.|+++++ . .+ +..+.
T Consensus 167 A~T~~D~T~Y~~~~~~e~d~~~~ldv~~d~v~~P~~~~~~~~f~qEgwh~E~~~~~~~~~~~~e~~~~~~~~l~~kgVV~ 246 (1119)
T PTZ00432 167 AYTFKDRTSYLFASTNEKDFYNTADVYMDSVFQPNILEDKDIFKQEGWHYKVTKLKDDEKNADELGNVHDRHVSYSGIVY 246 (1119)
T ss_pred ccCCCCceEEEeccCCHHHHHHHHHHHHHHHhCcCcccccchhhhhhhhccccccccccccccccccccccccchhhHHH
Confidence 99999999999999875 79999999999999999988863 2 21 66788
Q ss_pred HHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCChhhhcCCCHHHHhccC
Q psy8067 187 FELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPPASIGVINNNTLFTYL 239 (239)
Q Consensus 187 ~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~~~l~~i~~e~l~~~y 239 (239)
+|++...++| ..++.+.+.+.+| +|||+++..|++++|..++++++++||
T Consensus 247 ~Emk~~~~~p--~~~~~~~~~~~lf-~~pY~~~~~G~~~~I~~lt~e~l~~Fh 296 (1119)
T PTZ00432 247 SEMKKRFSDP--LSFGYSVIYQNLF-SNVYKYDSGGDPKDIVELTYEELVEFY 296 (1119)
T ss_pred HHHHHhhCCH--HHHHHHHHHHHHh-CCCCCCCCCCChHhhccCCHHHHHHHH
Confidence 9999998888 8999999999999 999999999999999999999999997
No 11
>PRK15101 protease3; Provisional
Probab=99.36 E-value=2.5e-11 Score=118.89 Aligned_cols=165 Identities=12% Similarity=0.044 Sum_probs=135.1
Q ss_pred CcEEEEcCCCCEEEEeeCC----CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeee
Q psy8067 63 NTQVTTLPNGLRVASENRF----GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICD 138 (239)
Q Consensus 63 ~~~~~~L~NG~~v~~~~~~----~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~ 138 (239)
..+.+.++||++|++++.+ .|.+.+.+.+.+|...++....+++.++..++.. +..++....+-.|.+++
T Consensus 522 ~p~~i~~~~g~~vw~~~d~~f~~~Pk~~i~~~~~~~~~~~~~~~~~l~~L~~~ll~~------~l~e~~y~a~~aG~~~~ 595 (961)
T PRK15101 522 HPELIVDEPGLRVVYMPSQYFADEPKADISLVLRNPKAMDSARNQVLFALNDYLAGL------ALDQLSNQASVGGISFS 595 (961)
T ss_pred CCeEEEcCCCeEEEEeCCCccccCCCEEEEEEEeCCCccCCHHHHHHHHHHHHHHHH------HHHHHhchHHhcCcEEE
Confidence 3588999999999998754 4899999999999988888788898888777621 33455555566688888
Q ss_pred eeeccceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHH-HhcCCCCCC
Q psy8067 139 CLSSRDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHA-AAYKDNTLG 217 (239)
Q Consensus 139 ~~~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~-~~~~~~p~~ 217 (239)
.. +.+.+.+++++.+++++.+|+++.+.+.+|.|++++|+++|+.+.+++++...+. .+...+.. ..+.+|||+
T Consensus 596 ~~-~~~g~~i~v~g~s~~l~~ll~~l~d~l~~~~~~~~~fe~~k~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~py~ 670 (961)
T PRK15101 596 TN-ANNGLMVNANGYTQRLPQLLQALLEGYFSFTPTEEQLAQAKSWYREQLDSAEKGK----AYEQAIMPAQMLSQVPYF 670 (961)
T ss_pred Ec-cCCCEEEEEEecChhHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhhhcccC----cHHHHHHHHHHHhcCCCC
Confidence 88 7899999999999999999999999999999999999999999999998765432 24444432 356789998
Q ss_pred CCCCCChhhhcCCCHHHHhccC
Q psy8067 218 LPKLCPPASIGVINNNTLFTYL 239 (239)
Q Consensus 218 ~~~~g~~~~l~~i~~e~l~~~y 239 (239)
.+ .++.+.|++++.+++++||
T Consensus 671 ~~-~~~~~~l~~it~edl~~f~ 691 (961)
T PRK15101 671 ER-DERRKLLPSITLKDVLAYR 691 (961)
T ss_pred CH-HHHHHHHhcCCHHHHHHHH
Confidence 64 5688999999999999986
No 12
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=99.29 E-value=1.5e-11 Score=116.05 Aligned_cols=166 Identities=17% Similarity=0.272 Sum_probs=142.2
Q ss_pred EcCCCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHh-CCeeeeeecccee
Q psy8067 68 TLPNGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKI-GGICDCLSSRDTF 146 (239)
Q Consensus 68 ~L~NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~-g~~~~~~~~~~~~ 146 (239)
.-++|++++...++++....++.++. ++.+..|++|.++|+.++|+.+||=++-+.+.+.+- +.-+|+.|..|.|
T Consensus 26 H~~TGa~l~hi~~~d~~~vFsi~F~T----~p~dstGVaHiLEHtvlcGS~kYPvkdPF~~ml~rSLntF~NA~T~~D~T 101 (978)
T COG1026 26 HEKTGAELAHIKNEDPNNVFSIAFKT----EPHDSTGVAHILEHTVLCGSKKYPVKDPFFKMLKRSLNTFLNAFTFPDKT 101 (978)
T ss_pred eccCCceEEEecCCCcCceEEEEeec----CCCCCCCcchHHHHHhhhCCCCCCCCChHHHHHHHhHHHHHhhccCCCcc
Confidence 34589999987776666666666654 667778999999999999999998777777777554 5458999999999
Q ss_pred EEEEEecC-CCHHHHHHHHHHhhhCCCCCHHHHHHH--------------HHHHHHHHHHHhcCCchHHHHHHHHHHHhc
Q psy8067 147 IYAASAAT-SGLDTVMKILGDVVLRPKFSKEEINMA--------------RQTIQFELENYQMRPEQETLLMDMIHAAAY 211 (239)
Q Consensus 147 ~~~~~~~~-~~le~~l~ll~~~~~~P~f~~~~~~~~--------------k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~ 211 (239)
.|-+++.. +|+-.++.+..|.+.+|.+.++.|.++ +..+-+|++....++ ..++++.+.+.+|
T Consensus 102 ~YP~sS~~~~Df~NLl~VYlDavf~PlL~~e~F~QEgwr~e~~~~~~l~~~GVVyNEMKGa~ss~--~~~~~~~~~~slf 179 (978)
T COG1026 102 VYPASSANEKDFYNLLSVYLDAVFHPLLTKESFLQEGWRIEFKDESNLKYKGVVYNEMKGAYSSG--ESVLSRAMQQSLF 179 (978)
T ss_pred eeeccccCcchHHHHHHHHHHhhhCcccchHHHhhhhhccccCCCccceeeeEEeehhcccccCc--hhHHHHHHHHhhC
Confidence 99988655 799999999999999999999999877 344556788888888 9999999999999
Q ss_pred CCCCCCCCCCCChhhhcCCCHHHHhccC
Q psy8067 212 KDNTLGLPKLCPPASIGVINNNTLFTYL 239 (239)
Q Consensus 212 ~~~p~~~~~~g~~~~l~~i~~e~l~~~y 239 (239)
++..|+....|.+..|..+++|++++||
T Consensus 180 p~~ty~~~SGG~P~~I~~LtyE~~r~FH 207 (978)
T COG1026 180 PGTTYGVNSGGDPKNIPDLTYEEFRAFH 207 (978)
T ss_pred CCccccccCCCCcccccccCHHHHHHHH
Confidence 9999999899999999999999999997
No 13
>KOG2019|consensus
Probab=98.88 E-value=2.6e-09 Score=97.22 Aligned_cols=164 Identities=16% Similarity=0.263 Sum_probs=138.0
Q ss_pred CCCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHh-CCeeeeeeccceeEE
Q psy8067 70 PNGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKI-GGICDCLSSRDTFIY 148 (239)
Q Consensus 70 ~NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~-g~~~~~~~~~~~~~~ 148 (239)
.-|+++.-.+++++-...++.++ ..+.+..|+.|+++|..++|+.+||=++-+-+.|.+- ..-+|+++..|++.|
T Consensus 60 ~Tgae~lhl~reD~N~vFsI~Fr----Tpp~dstGiPHILEHtvLCGS~KYPvrdPFfkmLnrSLatFmNAfT~pD~T~y 135 (998)
T KOG2019|consen 60 KTGAEVLHLDREDENNVFSIVFR----TPPKDSTGIPHILEHTVLCGSRKYPVRDPFFKMLNRSLATFMNAFTAPDYTFY 135 (998)
T ss_pred CCCceeEeeccCCCCceeEEEee----cCCCccCCCchhhhhheeeccCcCcccChHHHHHHHHHHHHHhhccCCCccee
Confidence 57999988776644333344443 3667778999999999999999998888888888554 666899999999999
Q ss_pred EEEe-cCCCHHHHHHHHHHhhhCCCCCHHHHHHH------------------HHHHHHHHHHHhcCCchHHHHHHHHHHH
Q psy8067 149 AASA-ATSGLDTVMKILGDVVLRPKFSKEEINMA------------------RQTIQFELENYQMRPEQETLLMDMIHAA 209 (239)
Q Consensus 149 ~~~~-~~~~le~~l~ll~~~~~~P~f~~~~~~~~------------------k~~~~~~l~~~~~~p~~~~~~~~~l~~~ 209 (239)
-+++ .++|+.-+..+..+....|...+.+|.++ +..+-+|++....+| .+++...+.+.
T Consensus 136 PfattN~kDf~NL~dVYLDAtffPklr~~dF~QEGWr~Eh~dpsd~~SpivfkGVVfNEMKG~~S~~--~~if~~~~Qq~ 213 (998)
T KOG2019|consen 136 PFATTNTKDFYNLRDVYLDATFFPKLRKLDFQQEGWRLEHNDPSDPISPIVFKGVVFNEMKGQYSDP--DYIFGMLFQQA 213 (998)
T ss_pred ecccCChHHHHHHHHHhhhcccchHHHhhhhhhhcceeecCCCCCCcccceeeeeeeecccccccCh--hHHHHHHHHHh
Confidence 9875 55899999999999999999888888776 566777888888888 88999999999
Q ss_pred hcCCCCCCCCCCCChhhhcCCCHHHHhccC
Q psy8067 210 AYKDNTLGLPKLCPPASIGVINNNTLFTYL 239 (239)
Q Consensus 210 ~~~~~p~~~~~~g~~~~l~~i~~e~l~~~y 239 (239)
+|+++.||....|++-+|..++++++++||
T Consensus 214 L~p~~tYgv~SGGDPl~IpdLt~eelk~FH 243 (998)
T KOG2019|consen 214 LFPENTYGVNSGGDPLDIPDLTYEELKEFH 243 (998)
T ss_pred hCccccccccCCCCcccCccccHHHHHHHH
Confidence 999999999999999999999999999987
No 14
>KOG0961|consensus
Probab=98.88 E-value=7.8e-09 Score=94.09 Aligned_cols=162 Identities=17% Similarity=0.201 Sum_probs=130.7
Q ss_pred CCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHh-CCeeeeeeccceeEEE
Q psy8067 71 NGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKI-GGICDCLSSRDTFIYA 149 (239)
Q Consensus 71 NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~-g~~~~~~~~~~~~~~~ 149 (239)
-|++|.....+...+.-.+.+.. |..+..|+.|-++|+.|+|+.+||=. -+-...++. =++.|+.++.|++.|.
T Consensus 28 Tkl~va~~~~pts~vhG~f~v~T----Ea~~d~G~PHTLEHL~FMGSKkYP~k-GvLd~~anr~l~dtNAwTDtD~T~Yt 102 (1022)
T KOG0961|consen 28 TKLRVAIGEVPTSMVHGAFSVVT----EADSDDGLPHTLEHLVFMGSKKYPFK-GVLDVIANRCLADTNAWTDTDHTAYT 102 (1022)
T ss_pred cceEEEEeecCCcceeeeEEeee----eecCCCCCchhHHHHhhhccccCCcc-cHHHHhhcchhcccccccccCcceEE
Confidence 46777776665555554444432 44445699999999999999999654 444444443 6679999999999999
Q ss_pred EEecC-CCHHHHHHHHHHhhhCCCCCHHHHHHHH----------HHHHHHHHHHhcCCchHHHHHHHHHHHhcC-CCCCC
Q psy8067 150 ASAAT-SGLDTVMKILGDVVLRPKFSKEEINMAR----------QTIQFELENYQMRPEQETLLMDMIHAAAYK-DNTLG 217 (239)
Q Consensus 150 ~~~~~-~~le~~l~ll~~~~~~P~f~~~~~~~~k----------~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~-~~p~~ 217 (239)
+++.- +.+-.+|.++.+.+..|-+++++|..+. ..+-.++++.+... ..++.+..++.+|+ .++|.
T Consensus 103 LStag~dGFlklLPvy~dHiL~P~Ltdeaf~TEVyHI~geg~d~GVVySEMq~~es~~--~~im~~~~~~~~yP~~sgY~ 180 (1022)
T KOG0961|consen 103 LSTAGSDGFLKLLPVYIDHILTPMLTDEAFATEVYHITGEGNDAGVVYSEMQDHESEM--ESIMDRKTKEVIYPPFSGYA 180 (1022)
T ss_pred eecccccchHHHhHHHHHhhcCcccchhhhhhheeeecCCCCccceeehhhhhhhccc--chhhhhhhheeecCCCCCce
Confidence 99766 5588999999999999999999998874 56667888887776 78899999999998 78999
Q ss_pred CCCCCChhhhcCCCHHHHhccC
Q psy8067 218 LPKLCPPASIGVINNNTLFTYL 239 (239)
Q Consensus 218 ~~~~g~~~~l~~i~~e~l~~~y 239 (239)
....|-+.++.+++.|.+++||
T Consensus 181 ~eTGG~~knLR~lt~ekIR~yH 202 (1022)
T KOG0961|consen 181 VETGGRLKNLRELTLEKIRDYH 202 (1022)
T ss_pred eccCCChhhHHHhhHHHHHHHH
Confidence 8888999999999999999986
No 15
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=98.12 E-value=0.00014 Score=60.63 Aligned_cols=118 Identities=11% Similarity=0.179 Sum_probs=86.6
Q ss_pred CCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeecc-----
Q psy8067 70 PNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSR----- 143 (239)
Q Consensus 70 ~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~----- 143 (239)
.+|++|+..+.+ ++++.+.+.+..+... .++.+-+..+...+...|++++ +..++...+....+.+++.+.-
T Consensus 76 ~~~~~v~~~~~~TnGI~Y~~l~fdl~~l~-~e~l~yl~Ll~~ll~~lgT~~~-sy~el~~~i~~~tGGis~~~~~~~~~~ 153 (248)
T PF08367_consen 76 LGGIPVLFHEQPTNGIVYVRLYFDLSDLP-EEDLPYLPLLTDLLGELGTKNY-SYEELSNEIDLYTGGISFSIEVYTDYD 153 (248)
T ss_dssp CTTCEEEEEE---TTEEEEEEEEE-TTS--CCCHCCHHHHHHHCCCS-BSSS--HHHHHHHHHHHSSEEEEEEEEEEEEC
T ss_pred cCCccEEEEEcCCCCeEEEEEEecCCCCC-HHHHHhHHHHHHHHHhCCCCCC-CHHHHHHHHHHhCCCeEEEeeeccCCC
Confidence 379999987754 7899999999987543 3456788888888888899998 8999999999997767666631
Q ss_pred ------ceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHH-HHHHHHHHHHHH
Q psy8067 144 ------DTFIYAASAATSGLDTVMKILGDVVLRPKFSKEE-INMARQTIQFEL 189 (239)
Q Consensus 144 ------~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~-~~~~k~~~~~~l 189 (239)
-...+++.+..++++.+++++.+++++|+|+... +.....+.+..+
T Consensus 154 ~~~~~~~~l~is~k~L~~~~~~~~~ll~eil~~~~f~d~~rl~~ll~~~~s~~ 206 (248)
T PF08367_consen 154 DDDKYRPYLVISAKCLDEKLDEAFELLSEILTETDFDDKERLKELLKELKSDM 206 (248)
T ss_dssp TECCCEEEEEEEEEEEGGGHHHHHHHHHHHHHCB-TT-HHHHHHHHHHHHHHH
T ss_pred CccceeEEEEEEEEeHhhhHHHHHHHHHHHHhccCCCcHHHHHHHHHHHHHHH
Confidence 2567888999999999999999999999997774 333344444333
No 16
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=97.25 E-value=0.019 Score=55.71 Aligned_cols=127 Identities=13% Similarity=0.178 Sum_probs=100.5
Q ss_pred EEcCCCCEEEEeeC-CCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeec--c
Q psy8067 67 TTLPNGLRVASENR-FGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSS--R 143 (239)
Q Consensus 67 ~~L~NG~~v~~~~~-~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~--~ 143 (239)
..-.|.++|+..+. +++++.+.+++.. +..+..+.+-++.+...+...|+.++ +..++.+.++...+.++...+ .
T Consensus 530 ~~~~~~~~v~~~~~~tn~i~yl~~~~~~-~~l~~~llpyL~L~~~~l~~lgt~~~-~y~e~~~~i~~~TGgis~~~~~~~ 607 (978)
T COG1026 530 TEVSNEAKVLHHDLFTNGITYLRLYFDL-DMLPSELLPYLPLFAFALTNLGTETY-SYKELLNQIERHTGGISVSLSVDT 607 (978)
T ss_pred eeccCCcceEEeecCCCCeEEEEEEeec-CCCChhhhhhHHHHHHHHHhcCCCCc-CHHHHHHHHHHHhCCceeeEeecc
Confidence 45578888888774 4789999999999 44556667889999999999999998 888999988888444444333 2
Q ss_pred ---------ceeEEEEEecCCCHHHHHHHHHHhhhCCCC-CHHHHHHHHHHHHHHHHHHhcC
Q psy8067 144 ---------DTFIYAASAATSGLDTVMKILGDVVLRPKF-SKEEINMARQTIQFELENYQMR 195 (239)
Q Consensus 144 ---------~~~~~~~~~~~~~le~~l~ll~~~~~~P~f-~~~~~~~~k~~~~~~l~~~~~~ 195 (239)
....+++.+..+..+.+++++.+++.++.| |.+.+.....++.+.+.....+
T Consensus 608 ~~~~~~~~~~~~~i~~K~l~~k~~~~~~~i~~~l~~~~F~D~~Rlkell~q~~~~l~~~vr~ 669 (978)
T COG1026 608 DPGDDGEYRPSFSISGKALRSKVEKLFELIREILANTDFHDRERLKELLEQYLSDLTSSVRN 669 (978)
T ss_pred CCCccccccceEEEEEEehhhhhhHHHHHHHHHHhcCCcCcHHHHHHHHHHHHhhhHHhhhc
Confidence 356788899999999999999999999999 7777777777777776665544
No 17
>PTZ00432 falcilysin; Provisional
Probab=97.14 E-value=0.016 Score=58.18 Aligned_cols=125 Identities=10% Similarity=-0.021 Sum_probs=95.3
Q ss_pred EcCCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeec----
Q psy8067 68 TLPNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSS---- 142 (239)
Q Consensus 68 ~L~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~---- 142 (239)
...+|++|...+.+ ++++.+.+.+.-..+.+ ++.+-+..+...+...|+.++ +..++...+....+.+++.+.
T Consensus 664 ~~~~~~~~~~~~~~TnGi~y~~~~fdl~~l~~-e~~~yl~L~~~~l~~~gT~~~-s~~el~~~i~~~tGg~~~~~~~~~~ 741 (1119)
T PTZ00432 664 SDGGSVTVLVHPIESRGILYLDFAFSLDSLTV-DELKYLNLFKALLKENGTDKL-SSEEFTYKREKNLGGLSASTAFYSE 741 (1119)
T ss_pred ccCCCcceEEEecCCCCeEEEEEEecCCCCCH-HHHhhHHHHHHHHHhcCCCCC-CHHHHHHHHHHhCCCeEEEEEEecc
Confidence 34578999887654 57999999998766433 345567777677777799998 899999999999666665421
Q ss_pred ------------cceeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHH-HHHHHHHHHHHHHHHhc
Q psy8067 143 ------------RDTFIYAASAATSGLDTVMKILGDVVLRPKFSKEE-INMARQTIQFELENYQM 194 (239)
Q Consensus 143 ------------~~~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~-~~~~k~~~~~~l~~~~~ 194 (239)
.....+++.+..++++.+++++.+++.++.|+... +....++.+..+++...
T Consensus 742 ~~~~~~~~~~~~~~~~~v~~k~l~~~~~~~~~l~~eil~~~~f~d~~rl~~il~~~~~~~~~~~~ 806 (1119)
T PTZ00432 742 TNNLTYDDPYNGVGYLNVRAKVLKHKVNEMVDIVLEALKDADFSNSKKGVEILKRKINGMKTVFS 806 (1119)
T ss_pred ccccccCcccccceEEEEEEEEhhhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhhh
Confidence 22578889999999999999999999999998765 66666666666665443
No 18
>KOG2019|consensus
Probab=97.08 E-value=0.015 Score=54.25 Aligned_cols=128 Identities=16% Similarity=0.186 Sum_probs=98.4
Q ss_pred EEEcCCCCEEEEeeCC-CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHh--CCeeeeeec
Q psy8067 66 VTTLPNGLRVASENRF-GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKI--GGICDCLSS 142 (239)
Q Consensus 66 ~~~L~NG~~v~~~~~~-~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~--g~~~~~~~~ 142 (239)
....-||++|-..... ++++.+.+.+.-++.-+. -.+-+..+...++..|++.. +-.++.+.+..+ |.++++.+.
T Consensus 563 ~v~dingvkv~~~dl~tngi~Y~r~~~~l~~~p~e-L~PylPlfc~sll~lGt~~l-sf~el~qqI~rkTGGiS~~p~~~ 640 (998)
T KOG2019|consen 563 EVGDINGVKVQRCDLFTNGITYTRVVFDLNSLPEE-LLPYLPLFCQSLLNLGTGDL-SFVELEQQIGRKTGGISVSPLVS 640 (998)
T ss_pred eeeeccCceeEEeeccCCceEEEEEeeccccCcHH-hhcchHHHHHHHHhcCCCcc-cHHHHHHHhhhhcCceeecceec
Confidence 3566799999887755 679999999988884433 35678899999999999997 899999999888 455555554
Q ss_pred cc--------eeEEEEEecCCCHHHHHHHHHHhhhCCCCC-HHHHHHHHHHHHHHHHHHhcC
Q psy8067 143 RD--------TFIYAASAATSGLDTVMKILGDVVLRPKFS-KEEINMARQTIQFELENYQMR 195 (239)
Q Consensus 143 ~~--------~~~~~~~~~~~~le~~l~ll~~~~~~P~f~-~~~~~~~k~~~~~~l~~~~~~ 195 (239)
.+ ...+++.+...+.+.+|+|+...+.++.|. ++.|+...+...+++.+.-.+
T Consensus 641 s~~~~d~p~~~i~~~~~~l~rn~~dlfel~n~il~e~~f~n~dkfkvlvk~s~s~~~n~i~d 702 (998)
T KOG2019|consen 641 SDDGMDEPELGIVFSGSMLDRNADDLFELWNKILQETCFTNQDKFKVLVKQSASRMTNGIAD 702 (998)
T ss_pred cCCCCCccceeEEechhhhcCChhHHHHHHHHHhcccCcccHHHHHHHHHHHHHHhhccCCc
Confidence 32 356777888899999999999999999995 455777666666666655443
No 19
>KOG0959|consensus
Probab=96.32 E-value=0.17 Score=49.62 Aligned_cols=158 Identities=9% Similarity=0.008 Sum_probs=112.1
Q ss_pred CCCEEEEeeCC---CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeeccceeE
Q psy8067 71 NGLRVASENRF---GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRDTFI 147 (239)
Q Consensus 71 NG~~v~~~~~~---~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~~~~ 147 (239)
-..++.+++.+ .|...+.+.+..--....+....++.+...++..-... ..--....|..++...+.....
T Consensus 517 ~~~~lw~k~dd~f~~Pka~~~~~~~~p~~~~~~~~~~l~~l~~~~l~d~l~E------~~Y~A~~aGl~~~~~~s~~G~~ 590 (974)
T KOG0959|consen 517 PFSELWYKQDDKFNVPKAYTKFDFICPGATQSPLNSVLSTLYVRLLKDQLNE------YLYPALLAGLTYSLSSSSKGVE 590 (974)
T ss_pred CcceeEEecccccccchhheeeeecCcccccCHHHHHHHHHHHHHHHHHHhH------HHHHHHhccceEEeeecCCceE
Confidence 45577776533 36666777666554555554556666666665433222 2233455588888888899999
Q ss_pred EEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCChhhh
Q psy8067 148 YAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPPASI 227 (239)
Q Consensus 148 ~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~~~l 227 (239)
+++.+-.+.+..+++.+.+++..-..+++.|+..++.+..++++...++ -......+...+..+..|...- -.+.+
T Consensus 591 ~~v~Gfnekl~~ll~~~~~~~~~f~~~~~rf~iike~~~~~~~n~~~~~--p~~~a~~~~~lll~~~~W~~~e--~~~al 666 (974)
T KOG0959|consen 591 LRVSGFNEKLPLLLEKVVQMMANFELDEDRFEIIKELLKRELRNHAFDN--PYQLANDYLLLLLEESIWSKEE--LLEAL 666 (974)
T ss_pred EEEeccCcccHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHhhhhhcc--HHHHHHHHHHHHhhccccchHH--HHHHh
Confidence 9999999999999999999998888999999999999999999954443 3455555556666655555432 35677
Q ss_pred cCCCHHHHhcc
Q psy8067 228 GVINNNTLFTY 238 (239)
Q Consensus 228 ~~i~~e~l~~~ 238 (239)
+.++++++..|
T Consensus 667 ~~~~le~~~~F 677 (974)
T KOG0959|consen 667 DDVTLEDLESF 677 (974)
T ss_pred hcccHHHHHHH
Confidence 78888888765
No 20
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=96.09 E-value=0.31 Score=47.44 Aligned_cols=161 Identities=11% Similarity=0.072 Sum_probs=107.0
Q ss_pred EcCCCCEEEEeeCC---CCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeeccc
Q psy8067 68 TLPNGLRVASENRF---GSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRD 144 (239)
Q Consensus 68 ~L~NG~~v~~~~~~---~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~ 144 (239)
-.+|+..++.+... .|...+.+.+.+--....+...-+..+...++..-..++ .--...-|.+++...+.+
T Consensus 507 ~~~~~~~wy~~~d~F~~~PK~~v~~~irsp~~~~s~r~~Vl~~l~~~la~dal~~~------~y~A~~aG~sfs~~~~~~ 580 (937)
T COG1025 507 EDPNLRLWYLKEDYFAVEPKASVSLAIRSPHASRSPRNQVLTELYAYLANDALDKL------SYQASLAGLSFSLAANSN 580 (937)
T ss_pred cCCCceEEEecCCccccCCcceeEEEEeCcccccCHHHHHHHHHHHHHHHHHHHhh------hhHHHhcceEEEeecCCC
Confidence 44555555555432 377777777775444333322223333344433222222 223344488888888889
Q ss_pred eeEEEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCCh
Q psy8067 145 TFIYAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPP 224 (239)
Q Consensus 145 ~~~~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~ 224 (239)
...+++++.++.++.++..+.+.+..-.++++.|+..|+.+.+++++..... +-..+.+.+...+-.. ..+..-..
T Consensus 581 Gl~ltisGft~~lp~L~~~~l~~l~~~~~~~~~f~~~K~~~~~~~~~a~~~~-p~~~~~~~l~~l~~~~---~~s~~e~~ 656 (937)
T COG1025 581 GLDLTISGFTQRLPQLLRAFLDGLFSLPVDEDRFEQAKSQLSEELKNALTGK-PYRQALDGLTGLLQVP---YWSREERR 656 (937)
T ss_pred ceEEEeeccccchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHhhhhcC-CHHHHHHHhhhhhCCC---CcCHHHHH
Confidence 9999999999999999999999999988999999999999999999876542 2555666665555432 22221235
Q ss_pred hhhcCCCHHHHhcc
Q psy8067 225 ASIGVINNNTLFTY 238 (239)
Q Consensus 225 ~~l~~i~~e~l~~~ 238 (239)
+.++.++.+++..|
T Consensus 657 ~~l~~v~~~e~~~f 670 (937)
T COG1025 657 NALESVSVEEFAAF 670 (937)
T ss_pred HHhhhccHHHHHHH
Confidence 67888888888765
No 21
>PF03410 Peptidase_M44: Protein G1; InterPro: IPR005072 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M44 (clan ME). The active site residues for members of this family and family M16 occur in the motif HXXEHProtein. The type example is the vaccinia virus-type metalloendopeptidase G1 from vaccinia virus, it is a metalloendopeptidase expressed by many Poxviridae which appears to play a role in the maturation of viral proteins.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0019067 viral assembly, maturation, egress, and release
Probab=95.19 E-value=0.038 Score=49.46 Aligned_cols=104 Identities=14% Similarity=0.265 Sum_probs=66.5
Q ss_pred EEcCCCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeecccee
Q psy8067 67 TTLPNGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRDTF 146 (239)
Q Consensus 67 ~~L~NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~~~ 146 (239)
..|+||+||.+.+.-..-+.+++. +.|--.|-.+.-|+|||+||++.+ | +. .++ -.|+++++.+.
T Consensus 2 IvL~NGVRiFin~~M~KDIYlGIs-~FGFe~DI~~iLGiAHLLEHILIs----F-D~---~~F------~ANASTaRsYM 66 (590)
T PF03410_consen 2 IVLSNGVRIFINPSMKKDIYLGIS-NFGFENDIGEILGIAHLLEHILIS----F-DS---SKF------LANASTARSYM 66 (590)
T ss_pred eEecCceEEEecCccccceEEeec-ccccccchHHHHhHHHHHHHHeee----c-ch---HHh------hcccchhhhhh
Confidence 479999999998866556666664 233333446678999999999763 3 22 222 24678888888
Q ss_pred EEEEEecCCC-HHHHHHHHHHhhhCC-----CCCHHHHHHHHHHH
Q psy8067 147 IYAASAATSG-LDTVMKILGDVVLRP-----KFSKEEINMARQTI 185 (239)
Q Consensus 147 ~~~~~~~~~~-le~~l~ll~~~~~~P-----~f~~~~~~~~k~~~ 185 (239)
+|...+.... -..++.-+..+|.+- .|+...++...+++
T Consensus 67 SFWC~si~g~~~~DAvrtliSWFF~~g~Lk~~F~~~~i~~hikEL 111 (590)
T PF03410_consen 67 SFWCKSIRGRTYIDAVRTLISWFFDNGKLKDNFSRSKIKNHIKEL 111 (590)
T ss_pred hhhhhhccCCChhHHHHHHHHHhhcCCcccccccHhHHHHHHHHH
Confidence 8888776643 344555565665442 35555554444444
No 22
>KOG2067|consensus
Probab=94.89 E-value=0.46 Score=41.90 Aligned_cols=147 Identities=11% Similarity=0.098 Sum_probs=94.7
Q ss_pred CeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHH--------HH-------HHHHHHhCCeeeeeeccceeE
Q psy8067 83 SFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKD--------AI-------AQSLEKIGGICDCLSSRDTFI 147 (239)
Q Consensus 83 ~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~--------~l-------~~~l~~~g~~~~~~~~~~~~~ 147 (239)
+...+.+.+.+-+..++ | -++..+-.|+..|.|.| |+. .| ..+++...+--..+.+..-.+
T Consensus 263 EltHv~lg~Eg~~~~de-D--~v~~avLq~lmGGGGSF-SAGGPGKGMySrLY~~vLNry~wv~sctAfnhsy~DtGlfg 338 (472)
T KOG2067|consen 263 ELTHVVLGFEGCSWNDE-D--FVALAVLQMLMGGGGSF-SAGGPGKGMYSRLYLNVLNRYHWVYSCTAFNHSYSDTGLFG 338 (472)
T ss_pred ceeeeeEeeccCCCCCh-h--HHHHHHHHHHhcCCccc-CCCCCCcchHHHHHHHHHhhhHHHHHhhhhhccccCCceeE
Confidence 45556666666666555 2 25555556677777776 421 11 134444455556666667889
Q ss_pred EEEEecCCCHHHHHHHHHHhhhC--CCCCHHHHHHHHHHHHHHHHH-HhcCCchHHHHHHHHHHHhcCCCCCCCCCCCCh
Q psy8067 148 YAASAATSGLDTVMKILGDVVLR--PKFSKEEINMARQTIQFELEN-YQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPP 224 (239)
Q Consensus 148 ~~~~~~~~~le~~l~ll~~~~~~--P~f~~~~~~~~k~~~~~~l~~-~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~ 224 (239)
+.++++++....+++++..-+.+ -..+++++++.|.++++.+-. ++..| -.+.|.-++.+-.+. ..++.--.
T Consensus 339 i~~s~~P~~a~~aveli~~e~~~~~~~v~~~el~RAK~qlkS~LlMNLESR~---V~~EDvGRQVL~~g~--rk~p~e~~ 413 (472)
T KOG2067|consen 339 IYASAPPQAANDAVELIAKEMINMAGGVTQEELERAKTQLKSMLLMNLESRP---VAFEDVGRQVLTTGE--RKPPDEFI 413 (472)
T ss_pred EeccCCHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHhcccccc---hhHHHHhHHHHhccC--cCCHHHHH
Confidence 99999999999999999765533 448999999999999987754 33443 345555555544322 23332235
Q ss_pred hhhcCCCHHHHhcc
Q psy8067 225 ASIGVINNNTLFTY 238 (239)
Q Consensus 225 ~~l~~i~~e~l~~~ 238 (239)
+.|++++.+|+.++
T Consensus 414 ~~Ie~lt~~DI~rv 427 (472)
T KOG2067|consen 414 KKIEQLTPSDISRV 427 (472)
T ss_pred HHHHhcCHHHHHHH
Confidence 78899999998764
No 23
>PHA03081 putative metalloprotease; Provisional
Probab=94.36 E-value=0.065 Score=48.08 Aligned_cols=91 Identities=13% Similarity=0.243 Sum_probs=60.9
Q ss_pred EEcCCCCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeecccee
Q psy8067 67 TTLPNGLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRDTF 146 (239)
Q Consensus 67 ~~L~NG~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~~~ 146 (239)
.+|+||++|.+.+.-..-+.+++. +.|--.+-.+.-|++|+++|++.+ | +. .++ -.++++++.+.
T Consensus 2 i~~~ngvr~f~~~~m~kdiy~gi~-~fgfe~di~~~lg~ahllehili~----f-d~---~~f------~anast~r~ym 66 (595)
T PHA03081 2 IVLSNGVRIFINPSMKKDIYLGIS-NFGFENDIGEILGIAHLLEHILIS----F-DS---SKF------VANASTARSYM 66 (595)
T ss_pred eEecCceEEEecCccccceEEeec-ccccccchHHHHhHHHHHHHHeee----c-ch---HHh------cccchhhhhhH
Confidence 479999999998866556666653 233334555678999999999763 3 22 122 23567778777
Q ss_pred EEEEEecCC-CHHHHHHHHHHhhhCCC
Q psy8067 147 IYAASAATS-GLDTVMKILGDVVLRPK 172 (239)
Q Consensus 147 ~~~~~~~~~-~le~~l~ll~~~~~~P~ 172 (239)
+|...+... .-..++.-+..+|.++.
T Consensus 67 sfwc~sirg~~y~DAvrtliSWFF~~~ 93 (595)
T PHA03081 67 SFWCKSIRGRSYIDAIRTLISWFFDNG 93 (595)
T ss_pred hHhhHhhcCCchHHHHHHHHHHhccCC
Confidence 777766553 33567777778887765
No 24
>KOG0960|consensus
Probab=92.37 E-value=2 Score=37.98 Aligned_cols=159 Identities=10% Similarity=0.068 Sum_probs=96.9
Q ss_pred CCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHH------hcccCCCCCHHHHHHHHHHhC---Ceeeeeec
Q psy8067 72 GLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLA------FNSTSEYVNKDAIAQSLEKIG---GICDCLSS 142 (239)
Q Consensus 72 G~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~------~~G~~~~~s~~~l~~~l~~~g---~~~~~~~~ 142 (239)
|-.|-+.+.+.|...+++.+.|-+...|+. ++..+...+ ..|.+.. ....|.+...... .-.++.++
T Consensus 258 gsEvR~rdd~lP~a~~AiAVEG~~w~~pD~---~~l~van~iiG~wdr~~g~g~~-~~s~La~~~~~~~l~~sfqsFnt~ 333 (467)
T KOG0960|consen 258 GSEVRVRDDDLPLAHIAIAVEGVSWAHPDY---FALMVANTIIGNWDRTEGGGRN-LSSRLAQKIQQDQLCHSFQSFNTS 333 (467)
T ss_pred CceeeecCCCCchhheeeeEecCCcCCccH---HHHHHHHHHhhhhhcccCCccC-CccHHHHHHHHHHHHHHHhhhhcc
Confidence 666667777779999999999988777752 444433222 3455543 3456666554431 11233333
Q ss_pred cce---eEEEEEe-cCCCHHHHHHHHHHhhh--CCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCC
Q psy8067 143 RDT---FIYAASA-ATSGLDTVMKILGDVVL--RPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTL 216 (239)
Q Consensus 143 ~~~---~~~~~~~-~~~~le~~l~ll~~~~~--~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~ 216 (239)
.-- .++++-+ ....++.++..+..-+. ....++.++++.|.+++..+-...+.. ..++.+.-++.++.+..
T Consensus 334 YkDTGLwG~y~V~~~~~~iddl~~~vl~eW~rL~~~vteaEV~RAKn~Lkt~Lll~ldgt--tpi~ediGrqlL~~Grr- 410 (467)
T KOG0960|consen 334 YKDTGLWGIYFVTDNLTMIDDLIHSVLKEWMRLATSVTEAEVERAKNQLKTNLLLSLDGT--TPIAEDIGRQLLTYGRR- 410 (467)
T ss_pred cccccceeEEEEecChhhHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHhcCC--CchHHHHHHHHhhcCCc-
Confidence 333 3444445 34445555544432221 136899999999999997776655544 45777877787776543
Q ss_pred CCCCCCChhhhcCCCHHHHhcc
Q psy8067 217 GLPKLCPPASIGVINNNTLFTY 238 (239)
Q Consensus 217 ~~~~~g~~~~l~~i~~e~l~~~ 238 (239)
.|+.--...|++|+.++++++
T Consensus 411 -i~l~El~~rId~vt~~~Vr~v 431 (467)
T KOG0960|consen 411 -IPLAELEARIDAVTAKDVREV 431 (467)
T ss_pred -CChHHHHHHHhhccHHHHHHH
Confidence 222123468999999999875
No 25
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=89.15 E-value=5.5 Score=30.11 Aligned_cols=108 Identities=19% Similarity=0.187 Sum_probs=60.6
Q ss_pred CCEEEEeeCCCCeEEEEEEEecCCCCCCCCCCcHHHHHHHHHhcccCCCCCHHHHHHHHH-HhCC--eeeeeec--c--c
Q psy8067 72 GLRVASENRFGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKLAFNSTSEYVNKDAIAQSLE-KIGG--ICDCLSS--R--D 144 (239)
Q Consensus 72 G~~v~~~~~~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~~~~G~~~~~s~~~l~~~l~-~~g~--~~~~~~~--~--~ 144 (239)
+..++....+.+...+.+.+.+....... ......++..++..+.. ..|.+.|. ..+. +++++.. . .
T Consensus 67 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~-~~~~~~~l~~~l~~~~~-----s~l~~~lr~~~~l~y~v~~~~~~~~~~~ 140 (184)
T PF05193_consen 67 GKEIVIPSKDESQSIVSIAFPGPPIKDSK-DYFALNLLSSLLGNGMS-----SRLFQELREKQGLAYSVSASNSSYRDSG 140 (184)
T ss_dssp EEEEEEEESSSSSEEEEEEEEEEETGTST-THHHHHHHHHHHHCSTT-----SHHHHHHHTTTTSESEEEEEEEEESSEE
T ss_pred ccccccccccccccccccccccccccccc-hhhHHHHHHHHHhcCcc-----chhHHHHHhccccceEEEeeeeccccce
Confidence 44455555443455555555544442222 23555666777665522 25666666 4443 3333322 2 3
Q ss_pred eeEEEEEecCCCHHHHHHHHHHhhhC---CCCCHHHHHHHHHHH
Q psy8067 145 TFIYAASAATSGLDTVMKILGDVVLR---PKFSKEEINMARQTI 185 (239)
Q Consensus 145 ~~~~~~~~~~~~le~~l~ll~~~~~~---P~f~~~~~~~~k~~~ 185 (239)
...+.+.+.+++++.+++.+...+.. -.+++++|++.|+++
T Consensus 141 ~~~i~~~~~~~~~~~~~~~~~~~l~~l~~~~~s~~el~~~k~~L 184 (184)
T PF05193_consen 141 LFSISFQVTPENLDEAIEAILQELKRLREGGISEEELERAKNQL 184 (184)
T ss_dssp EEEEEEEEEGGGHHHHHHHHHHHHHHHHHHCS-HHHHHHHHHHH
T ss_pred EEEEEEEcCcccHHHHHHHHHHHHHHHHHcCCCHHHHHHHHhcC
Confidence 45666777777888877777666543 249999999988764
No 26
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=71.58 E-value=63 Score=28.95 Aligned_cols=110 Identities=20% Similarity=0.120 Sum_probs=64.6
Q ss_pred HHHHHH-HHhCCeeeeee--cc----ceeEEEEEecCCCHH----HHHHHHHHhhhCCC--CCHHHHHHHHHHHHHHHHH
Q psy8067 125 AIAQSL-EKIGGICDCLS--SR----DTFIYAASAATSGLD----TVMKILGDVVLRPK--FSKEEINMARQTIQFELEN 191 (239)
Q Consensus 125 ~l~~~l-~~~g~~~~~~~--~~----~~~~~~~~~~~~~le----~~l~ll~~~~~~P~--f~~~~~~~~k~~~~~~l~~ 191 (239)
.|...+ +..|...+.+. +. ....+...+...+.+ .+-+++........ +++++++..+..+...+-.
T Consensus 291 rLf~~~re~~glay~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~t~~~~~~~k~~~~~~~~~ 370 (438)
T COG0612 291 RLFQELREKRGLAYSVSSFSDFLSDSGLFSIYAGTAPENPEKTAELVEEILKALKKGLKGPFTEEELDAAKQLLIGLLLL 370 (438)
T ss_pred HHHHHHHHhcCceeeeccccccccccCCceEEEEecCCChhhHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHhhh
Confidence 455444 44477666653 22 333444555544433 33333333334432 8899999999888877776
Q ss_pred HhcCCchHHHHHHHHHHHhcCCCCCCCCCCCChhhhcCCCHHHHhcc
Q psy8067 192 YQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPPASIGVINNNTLFTY 238 (239)
Q Consensus 192 ~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~~~l~~i~~e~l~~~ 238 (239)
..++| ...+........++........ ..+.|+.++.+++.++
T Consensus 371 ~~~s~--~~~~~~~~~~~~~~~~~~~~~~--~~~~i~~vt~~dv~~~ 413 (438)
T COG0612 371 SLDSP--SSIAELLGQYLLLGGSLITLEE--LLERIEAVTLEDVNAV 413 (438)
T ss_pred ccCCH--HHHHHHHHHHHHhcCCccCHHH--HHHHHHhcCHHHHHHH
Confidence 66666 6666655555565544332222 4689999999997653
No 27
>KOG0961|consensus
Probab=67.06 E-value=20 Score=34.31 Aligned_cols=135 Identities=13% Similarity=0.097 Sum_probs=96.3
Q ss_pred CCCeEEEEEEEecCCCCCCCCCCcHHHHHHHH-----HhcccC---CCCCHHHHHHHHHHhCCeeeeeec-----cceeE
Q psy8067 81 FGSFCTIGVIIDSGSRYEQPYPSGVSHFLEKL-----AFNSTS---EYVNKDAIAQSLEKIGGICDCLSS-----RDTFI 147 (239)
Q Consensus 81 ~~~~~~i~~~~~~Gs~~e~~~~~g~a~l~~~~-----~~~G~~---~~~s~~~l~~~l~~~g~~~~~~~~-----~~~~~ 147 (239)
+...+.+.+.+...+.. ....+-+..+...+ +..|+- -+.+..++.+.+.+..++.+..++ ++-..
T Consensus 559 ps~Fvel~fl~dss~i~-~sl~pYl~~f~~l~~~~pa~ldgtiptp~~~s~~~v~~~~~s~~id~si~~g~~G~~~~lvn 637 (1022)
T KOG0961|consen 559 PSKFVELFFLLDSSNIS-ISLRPYLFLFTDLLFESPAMLDGTIPTPVLTSADDVAKHFTSDLIDHSIQVGVSGLYDRLVN 637 (1022)
T ss_pred chHHHhHhhhhccccCc-hhhhhHHHHHHHHHhcCHHHhcCCCCcchhhhHHHHHHHHHhhhhhhhhcccccccchhhee
Confidence 34555566666555443 22334444443322 345532 223677888888777665555544 47788
Q ss_pred EEEEecCCCHHHHHHHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCC
Q psy8067 148 YAASAATSGLDTVMKILGDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGL 218 (239)
Q Consensus 148 ~~~~~~~~~le~~l~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~ 218 (239)
+.+.+..+.-+...+.+..++..--||++.+-...+++..++...+.+- ..++.......+|+++..-.
T Consensus 638 ~~Ikv~a~~Y~~~v~Wi~~~l~~~VfD~~Ri~~~~~~~l~~i~~~KRdg--~~vlss~~~~~lY~~~slk~ 706 (1022)
T KOG0961|consen 638 LRIKVGADKYPLLVKWIQIFLQGVVFDPSRIHQCAQKLLGEIRDRKRDG--CTVLSSAVASMLYGKNSLKI 706 (1022)
T ss_pred EEEEEccCCcchhHHHHHHHhhhhccCHHHHHHHHHHHHhhhhhhhcCc--cEehHHHHHHHHhcccchhh
Confidence 9999999999999999999999999999999999999999998888776 67788888888998765543
No 28
>PF08288 PIGA: PIGA (GPI anchor biosynthesis); InterPro: IPR013234 This domain is found on phosphatidylinositol N-acetylglucosaminyltransferase proteins. These proteins are involved in GPI anchor biosynthesis and are associated with the disease paroxysmal nocturnal haemoglobinuria [].; GO: 0006506 GPI anchor biosynthetic process
Probab=48.20 E-value=16 Score=25.36 Aligned_cols=15 Identities=40% Similarity=0.355 Sum_probs=12.8
Q ss_pred EEEcCCCCEEEEeeC
Q psy8067 66 VTTLPNGLRVASENR 80 (239)
Q Consensus 66 ~~~L~NG~~v~~~~~ 80 (239)
++-|+||+|||..|-
T Consensus 9 vRyltngLKVYYlP~ 23 (90)
T PF08288_consen 9 VRYLTNGLKVYYLPL 23 (90)
T ss_pred eEEcCCCeEEEeecc
Confidence 468999999999884
No 29
>PRK11840 bifunctional sulfur carrier protein/thiazole synthase protein; Provisional
Probab=38.17 E-value=29 Score=30.10 Aligned_cols=56 Identities=18% Similarity=0.146 Sum_probs=37.3
Q ss_pred CCeEEEEEEEecCCCCCCCCCC--cHHHHHHHHHhcccCCCCCHHHHHHHHHHhCCeee
Q psy8067 82 GSFCTIGVIIDSGSRYEQPYPS--GVSHFLEKLAFNSTSEYVNKDAIAQSLEKIGGICD 138 (239)
Q Consensus 82 ~~~~~i~~~~~~Gs~~e~~~~~--g~a~l~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~ 138 (239)
++.+.+--.++|||.....+.. +-..| ..-++-||++|+|...+.+.+..-|+.+-
T Consensus 54 GD~IEII~~VgGGs~~~~~~~~~i~~~~~-~sRl~~Gtg~y~s~~~~~~a~~asg~e~v 111 (326)
T PRK11840 54 GDELEIVHFVGGGSDLVADDSWTVAGKTF-SSRLLVGTGKYKDFEETAAAVEASGAEIV 111 (326)
T ss_pred CCEEEEEEEecCCCCCCCCCCeEECCEEE-ecceeEecCCCCCHHHHHHHHHHhCCCEE
Confidence 4566666678888863333221 11222 33356699999999999999999888763
No 30
>cd04914 ACT_AKi-DapG-BS_1 ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI. This CD includes the N-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) strain 168), Clostridia, and Actinobacteria, bacterial species. In B. subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and
Probab=37.76 E-value=93 Score=19.85 Aligned_cols=48 Identities=15% Similarity=0.148 Sum_probs=35.3
Q ss_pred HHHHHHHHHhCCeeeeeeccceeEEEEEecCCCHHHHHHHHHHhhhCCC
Q psy8067 124 DAIAQSLEKIGGICDCLSSRDTFIYAASAATSGLDTVMKILGDVVLRPK 172 (239)
Q Consensus 124 ~~l~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~le~~l~ll~~~~~~P~ 172 (239)
.++-+.|++.|+++.... .....+++.....+++.+.+++..+=.+|.
T Consensus 17 a~if~~La~~~InvDmI~-~~~~~isFtv~~~d~~~~~~il~~~~~~~~ 64 (67)
T cd04914 17 QRVFKALANAGISVDLIN-VSPEEVIFTVDGEVAEKAVDILEKMGLDPS 64 (67)
T ss_pred HHHHHHHHHcCCcEEEEE-ecCCCEEEEEchhhHHHHHHHHHHcCCceE
Confidence 467788888888888873 222358888888999988888777665554
No 31
>cd04922 ACT_AKi-HSDH-ThrA_2 ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). This CD includes the second of two ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). The ACT domains are positioned between the N-terminal catalytic domain of AK and the C-terminal HSDH domain found in bacteria (Escherichia coli (EC) ThrA) and higher plants (Zea mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. HSDH is the first committed reaction in the branch of the pathway that leads to Thr and Met. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathwa
Probab=36.95 E-value=1e+02 Score=18.93 Aligned_cols=46 Identities=7% Similarity=0.049 Sum_probs=34.2
Q ss_pred HHHHHHHHHhCCeeeeeec-cceeEEEEEecCCCHHHHHHHHHHhhh
Q psy8067 124 DAIAQSLEKIGGICDCLSS-RDTFIYAASAATSGLDTVMKILGDVVL 169 (239)
Q Consensus 124 ~~l~~~l~~~g~~~~~~~~-~~~~~~~~~~~~~~le~~l~ll~~~~~ 169 (239)
.++-+.|.+.|.++..... .....+++....++.+.+++.|++.|.
T Consensus 19 ~~i~~~l~~~~I~v~~i~~~~s~~~is~~v~~~~~~~~~~~lh~~~~ 65 (66)
T cd04922 19 ATFFSALAKANVNIRAIAQGSSERNISAVIDEDDATKALRAVHERFF 65 (66)
T ss_pred HHHHHHHHHCCCCEEEEEecCcccEEEEEEeHHHHHHHHHHHHHHHh
Confidence 4666778888888865532 234788888888999999999888775
No 32
>PF09186 DUF1949: Domain of unknown function (DUF1949); InterPro: IPR015269 Members of this entry are a set of functionally uncharacterised hypothetical bacterial proteins. They adopt a ferredoxin-like fold, with a beta-alpha-beta-beta-alpha-beta arrangement []. This entry contains the protein Impact, which is a translational regulator that ensures constant high levels of translation under amino acid starvation. It acts by interacting with Gcn1/Gcn1L1, thereby preventing activation of Gcn2 protein kinases (EIF2AK1 to 4) and subsequent down-regulation of protein synthesis. It is evolutionary conserved from eukaryotes to archaea []. ; PDB: 2CVE_A 1VI7_A.
Probab=34.94 E-value=1e+02 Score=18.40 Aligned_cols=49 Identities=14% Similarity=0.122 Sum_probs=40.7
Q ss_pred CCCHHHHHHHHHHhCCeeeeeeccceeEEEEEecCCCHHHHHHHHHHhh
Q psy8067 120 YVNKDAIAQSLEKIGGICDCLSSRDTFIYAASAATSGLDTVMKILGDVV 168 (239)
Q Consensus 120 ~~s~~~l~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~le~~l~ll~~~~ 168 (239)
|+....+..+++..++.+.-..-.+...+.+..+.++.+...+.|.+..
T Consensus 5 Y~~~~~v~~~l~~~~~~i~~~~y~~~V~~~v~v~~~~~~~f~~~l~~~t 53 (56)
T PF09186_consen 5 YSQYGKVERLLEQNGIEIVDEDYTDDVTLTVAVPEEEVEEFKAQLTDLT 53 (56)
T ss_dssp CCCHHHHHHHHHHTTTEEEEEEECTTEEEEEEEECCCHHHHHHHHHHHT
T ss_pred hhhHHHHHHHHHHCCCEEEcceecceEEEEEEECHHHHHHHHHHHHHHc
Confidence 4456788899999999997766677799999999999999888887753
No 33
>PHA02541 23 major capsid protein; Provisional
Probab=34.30 E-value=68 Score=29.40 Aligned_cols=115 Identities=13% Similarity=0.092 Sum_probs=58.1
Q ss_pred ccccCceEEeeccccCCCCCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCcEEEEcCCCCEEEEeeCC-CCeEEEEEEEec
Q psy8067 15 SIKGNHIQCSHFCSQAKPPPPSFQQPPLSTPLPGLPKVHYSCVKDNDRNTQVTTLPNGLRVASENRF-GSFCTIGVIIDS 93 (239)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~p~~i~~~~~~~~~~~~~~~L~NG~~v~~~~~~-~~~~~i~~~~~~ 93 (239)
|.+||.|-||+.|..+-...+--...+. +- ... .. ..+....+...+|.||++||+-+-. .+.+.++.- |
T Consensus 368 rg~gnfvi~S~~Vas~La~~g~l~~~~a---~~-~~~-~~--~~D~t~~~~aG~L~~~~~VYiD~ya~~dy~~vGyK--G 438 (518)
T PHA02541 368 RGAGNFIIASRNVVSALAMADTGISPAA---QG-LAG-GL--NVDTTKSVFAGVLGGKYKVYIDQYARQDYFTVGYK--G 438 (518)
T ss_pred cCcceEEEEchhHHHHHhhCCccccccc---cc-ccc-cc--CcCcccceeeeeecCceEEEecCCCcccEEEEEec--c
Confidence 6899999999999987666543221111 00 000 00 0111123566899999999997743 455555443 3
Q ss_pred CCCCCCCCCCcHHHH--HHHHHhcccCCCCCHHHHHHHHHHhCCeeeeeecc
Q psy8067 94 GSRYEQPYPSGVSHF--LEKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSR 143 (239)
Q Consensus 94 Gs~~e~~~~~g~a~l--~~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~ 143 (239)
.+..+. |+-+. ++....+++.. .+-+--.-++...|..+|+++..
T Consensus 439 ~~e~da----glfyaPYv~l~~~r~~DP-~sfqP~~gfktRYgl~~NPf~~~ 485 (518)
T PHA02541 439 SNEMDA----GIYYCPYVALTPLRGSDP-KNFQPVMGFKTRYGIGINPFADS 485 (518)
T ss_pred Cccccc----eeEecccccccccCCCCC-cccCcceeeeeeeeeccCCcccc
Confidence 222211 11111 11122333322 12223334556667777777753
No 34
>PF06135 DUF965: Bacterial protein of unknown function (DUF965); InterPro: IPR009309 This family consists of several hypothetical bacterial proteins. The function of the family is unknown.
Probab=34.30 E-value=1.1e+02 Score=20.75 Aligned_cols=58 Identities=16% Similarity=0.137 Sum_probs=37.9
Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCChhhhcCCCHHHHhc
Q psy8067 175 KEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPPASIGVINNNTLFT 237 (239)
Q Consensus 175 ~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~~~l~~i~~e~l~~ 237 (239)
+..+......+-..|+..--+| .++.+--++-|+..|-....+-..-|.++.++++.+
T Consensus 14 ~~~~~~iL~~Vy~AL~EKGYnP-----inQivGYllSGDPaYItsh~nAR~lIr~~eRDellE 71 (79)
T PF06135_consen 14 EKEIREILKQVYAALEEKGYNP-----INQIVGYLLSGDPAYITSHNNARNLIRKIERDELLE 71 (79)
T ss_pred hhhHHHHHHHHHHHHHHcCCCh-----HHHHHhheecCCCccccCcccHHHHHHHHhHHHHHH
Confidence 4555556666667777776677 666666667776555433334467888899888764
No 35
>PHA02548 24 capsid vertex protein; Provisional
Probab=34.25 E-value=63 Score=28.67 Aligned_cols=63 Identities=21% Similarity=0.211 Sum_probs=42.6
Q ss_pred ccccCceEEeeccccCCCCCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCcEEEEcCCCCEEEEeeCC-CCeEEEEEEEec
Q psy8067 15 SIKGNHIQCSHFCSQAKPPPPSFQQPPLSTPLPGLPKVHYSCVKDNDRNTQVTTLPNGLRVASENRF-GSFCTIGVIIDS 93 (239)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~p~~i~~~~~~~~~~~~~~~L~NG~~v~~~~~~-~~~~~i~~~~~~ 93 (239)
|.+||.+-||+.|..+-....- ... ......+...+|.||++|+.-+.. .+.+.++.-...
T Consensus 262 rg~gnfvv~S~~Vas~La~sG~-~~~-----------------~D~t~~~~aG~L~ng~~VY~D~ya~~DY~~vGyK~~~ 323 (412)
T PHA02548 262 SFEATYVVASSRVAAILAASGW-MEE-----------------KDEDDPLAYGYLRNGLPVYADTNSPFDYVIVGCKHDI 323 (412)
T ss_pred cCceEEEEEchhHHHHHhhcCe-eec-----------------cCCCCceeeeeecCccEEEecCCCcccEEEEEEeccC
Confidence 7889999999999876554311 110 112334567899999999998754 477777776555
Q ss_pred CC
Q psy8067 94 GS 95 (239)
Q Consensus 94 Gs 95 (239)
|.
T Consensus 324 G~ 325 (412)
T PHA02548 324 GE 325 (412)
T ss_pred CC
Confidence 54
No 36
>cd04919 ACT_AK-Hom3_2 ACT domains located C-terminal to the catalytic domain of the aspartokinase (AK) HOM3. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the aspartokinase (AK) HOM3, a monofunctional class enzyme found in Saccharomyces cerevisiae, and other related ACT domains. AK is the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP, and in fungi, is responsible for the production of threonine, isoleucine and methionine. S. cerevisiae has a single AK, which is regulated by feedback, allosteric inhibition by L-threonine. Recent studies shown that the allosteric transition triggered by binding of threonine to AK involves a large change in the conformation of the native hexameric enzyme that is converted to an inactive one of different shape and substantially smaller hydrodynamic size. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=32.93 E-value=1.2e+02 Score=18.66 Aligned_cols=46 Identities=9% Similarity=0.051 Sum_probs=33.8
Q ss_pred HHHHHHHHHhCCeeeeeec-cceeEEEEEecCCCHHHHHHHHHHhhh
Q psy8067 124 DAIAQSLEKIGGICDCLSS-RDTFIYAASAATSGLDTVMKILGDVVL 169 (239)
Q Consensus 124 ~~l~~~l~~~g~~~~~~~~-~~~~~~~~~~~~~~le~~l~ll~~~~~ 169 (239)
.++-+.|+..|+++..-.. .....+++....++.+.+++.++..|.
T Consensus 19 ~~if~~L~~~~I~v~~i~q~~s~~~isf~v~~~~~~~a~~~lh~~~~ 65 (66)
T cd04919 19 GRMFTTLADHRINIEMISQGASEINISCVIDEKDAVKALNIIHTNLL 65 (66)
T ss_pred HHHHHHHHHCCCCEEEEEecCccceEEEEEeHHHHHHHHHHHHHHHh
Confidence 4667778888888754431 234778888888899999999888764
No 37
>PTZ00115 40S ribosomal protein S12; Provisional
Probab=32.61 E-value=3e+02 Score=23.09 Aligned_cols=44 Identities=18% Similarity=0.219 Sum_probs=26.3
Q ss_pred CCCCcEEEEcCCCCEEEE-eeCCC---CeEEEEEEEecCCCCCCCCCCcHHH
Q psy8067 60 NDRNTQVTTLPNGLRVAS-ENRFG---SFCTIGVIIDSGSRYEQPYPSGVSH 107 (239)
Q Consensus 60 ~~~~~~~~~L~NG~~v~~-~~~~~---~~~~i~~~~~~Gs~~e~~~~~g~a~ 107 (239)
....+...+|+||-+|.. .|..+ +.-. .+.+.||...|- +|+.+
T Consensus 144 A~RKvarVrLsNGk~VtAyIPGeGHnLQEHs-~VLVRGGrvkDL---PGVrY 191 (290)
T PTZ00115 144 GLRKVARVRLSTGRTVTVYIPGIGHNLNTHS-VVLVRGGRCKDV---PGCNY 191 (290)
T ss_pred cccceEEEEecCCCEEEEEcCCCCcccccCC-EEEEeCCCcCCC---CCceE
Confidence 355667789999999975 57543 1111 223467775444 56544
No 38
>PF13840 ACT_7: ACT domain ; PDB: 3S1T_A 1ZHV_A 3AB4_K 3AB2_O 2DTJ_A 3AAW_A 2RE1_B 3MAH_A 1ZVP_D.
Probab=32.33 E-value=1.3e+02 Score=18.97 Aligned_cols=40 Identities=15% Similarity=0.135 Sum_probs=29.1
Q ss_pred HHHHHHHHhCCeeeeeeccceeEEEEEecCCCHHHHHHHHHH
Q psy8067 125 AIAQSLEKIGGICDCLSSRDTFIYAASAATSGLDTVMKILGD 166 (239)
Q Consensus 125 ~l~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~le~~l~ll~~ 166 (239)
.+.+.|+..|.++-... -+...++-...++++.+++.|.+
T Consensus 26 ~i~~~La~~~I~i~~is--S~~~~~ilV~~~~~~~A~~~L~~ 65 (65)
T PF13840_consen 26 KIFSALAEAGINIFMIS--SEISISILVKEEDLEKAVEALHE 65 (65)
T ss_dssp HHHHHHHHTTS-ECEEE--ESSEEEEEEEGGGHHHHHHHHHH
T ss_pred HHHHHHHHCCCCEEEEE--EeeeEEEEEeHHHHHHHHHHhcC
Confidence 44556777788886665 56667777888999999998863
No 39
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=31.88 E-value=1.6e+02 Score=20.77 Aligned_cols=48 Identities=15% Similarity=0.113 Sum_probs=30.8
Q ss_pred HHHHHHHHHHhCCeeeeeeccce-eEEEEEecCCCHHHHHHHHHHhhhCC
Q psy8067 123 KDAIAQSLEKIGGICDCLSSRDT-FIYAASAATSGLDTVMKILGDVVLRP 171 (239)
Q Consensus 123 ~~~l~~~l~~~g~~~~~~~~~~~-~~~~~~~~~~~le~~l~ll~~~~~~P 171 (239)
+..+.+.|...|+.+........ ..+.+. ..++++.+-+.+..++.+|
T Consensus 13 AqaF~DYl~sqgI~~~i~~~~~~~~~lwl~-de~~~~~a~~el~~Fl~nP 61 (101)
T PF12122_consen 13 AQAFIDYLASQGIELQIEPEGQGQFALWLH-DEEHLEQAEQELEEFLQNP 61 (101)
T ss_dssp HHHHHHHHHHTT--EEEE-SSSE--EEEES--GGGHHHHHHHHHHHHHS-
T ss_pred HHHHHHHHHHCCCeEEEEECCCCceEEEEe-CHHHHHHHHHHHHHHHHCC
Confidence 45778899999976666653333 344444 6678999999998998888
No 40
>cd04936 ACT_AKii-LysC-BS-like_2 ACT domains of the lysine-sensitive, aspartokinase (AK) isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related domains. This CD includes the C-terminal of the two ACT domains of the lysine-sensitive, aspartokinase (AK) isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis strain 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive AK isoenzymes. The B. subtilis strain 168 AKII is induced by methionine and repressed and inhibited by lysine. Although C. glutamicum is known to contain a single AK, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In corynebacteria and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regu
Probab=30.96 E-value=1.2e+02 Score=18.15 Aligned_cols=44 Identities=9% Similarity=0.159 Sum_probs=33.8
Q ss_pred HHHHHHHHHhCCeeeeeeccceeEEEEEecCCCHHHHHHHHHHhh
Q psy8067 124 DAIAQSLEKIGGICDCLSSRDTFIYAASAATSGLDTVMKILGDVV 168 (239)
Q Consensus 124 ~~l~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~le~~l~ll~~~~ 168 (239)
.++-+.|+..++++...... ...+++....++.+.+.+.+...|
T Consensus 18 ~~i~~~L~~~~i~v~~i~~s-~~~is~~v~~~d~~~~~~~l~~~~ 61 (63)
T cd04936 18 AKMFEALAEAGINIEMISTS-EIKISCLIDEDDAEKAVRALHEAF 61 (63)
T ss_pred HHHHHHHHHCCCcEEEEEcc-CceEEEEEeHHHHHHHHHHHHHHh
Confidence 46677788888888777643 467788888888988888887765
No 41
>PRK05473 hypothetical protein; Provisional
Probab=30.07 E-value=1.7e+02 Score=20.15 Aligned_cols=58 Identities=12% Similarity=0.096 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCChhhhcCCCHHHHhc
Q psy8067 175 KEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPPASIGVINNNTLFT 237 (239)
Q Consensus 175 ~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~~~l~~i~~e~l~~ 237 (239)
+..+......+-..|+..--+| .++.+--++.|+..|-.....-..-|.++.+|++.+
T Consensus 17 ~~~v~eiL~~Vy~AL~EKGYNP-----inQiVGYllSGDPaYItsh~nAR~lIrkiERDEilE 74 (86)
T PRK05473 17 KKDVREILTTVYDALEEKGYNP-----INQIVGYLLSGDPAYIPRHNDARNLIRKLERDEILE 74 (86)
T ss_pred HHHHHHHHHHHHHHHHHcCCCh-----HHHHHhhhccCCCCccCCcccHHHHHHHHhHHHHHH
Confidence 3445555666666666666666 566666666676544333223456788888888754
No 42
>cd04923 ACT_AK-LysC-DapG-like_2 ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related domains. This CD includes the C-terminal of the two ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, as well as, the second and fourth, of four, ACT domains present in cyanobacteria AK. Also included are the C-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase isoenzyme AKI found in Bacilli (B. subtilis strain 168), Clostridia, and Actinobacteria bacterial species. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=29.68 E-value=1.3e+02 Score=18.03 Aligned_cols=44 Identities=9% Similarity=0.164 Sum_probs=33.6
Q ss_pred HHHHHHHHHhCCeeeeeeccceeEEEEEecCCCHHHHHHHHHHhh
Q psy8067 124 DAIAQSLEKIGGICDCLSSRDTFIYAASAATSGLDTVMKILGDVV 168 (239)
Q Consensus 124 ~~l~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~le~~l~ll~~~~ 168 (239)
.++-+.|+..++++...... ...+++....++.+.+.+.+...|
T Consensus 18 ~~i~~~L~~~~i~v~~i~~s-~~~is~~v~~~~~~~~~~~l~~~l 61 (63)
T cd04923 18 AKMFKALAEAGINIEMISTS-EIKISCLVDEDDAEKAVRALHEAF 61 (63)
T ss_pred HHHHHHHHHCCCCEEEEEcc-CCeEEEEEeHHHHHHHHHHHHHHh
Confidence 46777788888888777643 466778888888888888887765
No 43
>cd04916 ACT_AKiii-YclM-BS_2 ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. B. subtilis YclM is reported to be a single polypeptide of 50 kD. AKIII from B. subtilis strain 168 is induced by lysine and repressed by threonine and it is synergistically inhibited by lysine and threonine. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=29.57 E-value=1.4e+02 Score=18.26 Aligned_cols=46 Identities=11% Similarity=0.105 Sum_probs=33.1
Q ss_pred HHHHHHHHHhCCeeeeeec-cceeEEEEEecCCCHHHHHHHHHHhhh
Q psy8067 124 DAIAQSLEKIGGICDCLSS-RDTFIYAASAATSGLDTVMKILGDVVL 169 (239)
Q Consensus 124 ~~l~~~l~~~g~~~~~~~~-~~~~~~~~~~~~~~le~~l~ll~~~~~ 169 (239)
.++-+.|++.+.++..... .....+++....++.+.+.+.+...|.
T Consensus 19 ~~i~~~L~~~~i~v~~i~~~~s~~~isf~v~~~d~~~~~~~lh~~~~ 65 (66)
T cd04916 19 ARATAALAKAGINIRMINQGSSEISIMIGVHNEDADKAVKAIYEEFF 65 (66)
T ss_pred HHHHHHHHHCCCCEEEEEecCcccEEEEEEeHHHHHHHHHHHHHHHh
Confidence 3566667777777765542 234678888888999999999888765
No 44
>COG4565 CitB Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]
Probab=26.86 E-value=2.4e+02 Score=23.07 Aligned_cols=82 Identities=15% Similarity=0.186 Sum_probs=44.9
Q ss_pred ecCCCHHHHHHHH----HHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhcCCCCCCCCCCCChhhh
Q psy8067 152 AATSGLDTVMKIL----GDVVLRPKFSKEEINMARQTIQFELENYQMRPEQETLLMDMIHAAAYKDNTLGLPKLCPPASI 227 (239)
Q Consensus 152 ~~~~~le~~l~ll----~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p~~~~~~~~~l~~~~~~~~p~~~~~~g~~~~l 227 (239)
+...|.+++=+.+ .+|+..| |.-+.|+....+.......+.+. ..+-++++.+....... ..+...-+.-|
T Consensus 81 TAA~d~~tI~~alr~Gv~DYLiKP-f~~eRl~~aL~~y~~~r~~l~~~---~~~sQ~~lD~l~~~~~k-~~~~~~LPkGi 155 (224)
T COG4565 81 TAASDMETIKEALRYGVVDYLIKP-FTFERLQQALTRYRQKRHALESH---QQLSQKELDQLFNIQSK-EQPPDDLPKGL 155 (224)
T ss_pred eccchHHHHHHHHhcCchhheecc-eeHHHHHHHHHHHHHHHHHHhhh---cccCHHHHHHHHhcccc-ccCcccCCCCc
Confidence 3344555555544 5788888 77777777777776666655554 23445555555443322 22222233555
Q ss_pred cCCCHHHHhcc
Q psy8067 228 GVINNNTLFTY 238 (239)
Q Consensus 228 ~~i~~e~l~~~ 238 (239)
+..|...+.++
T Consensus 156 ~~~Tl~~i~~~ 166 (224)
T COG4565 156 DELTLQKVREA 166 (224)
T ss_pred CHHHHHHHHHH
Confidence 55666555543
No 45
>cd04924 ACT_AK-Arch_2 ACT domains of a monofunctional aspartokinase found mostly in Archaea species (ACT_AK-Arch_2). Included in this CD is the second of two ACT domains of a monofunctional aspartokinase found mostly in Archaea species (ACT_AK-Arch_2). The first or N-terminal ACT domain of these proteins cluster with the ThrA-like ACT 1 domains (ACT_AKi-HSDH-ThrA-like_1) which includes the threonine-sensitive archaeal Methanococcus jannaschii aspartokinase ACT 1 domain. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=26.65 E-value=1.6e+02 Score=17.95 Aligned_cols=45 Identities=16% Similarity=0.053 Sum_probs=32.8
Q ss_pred HHHHHHHHHhCCeeeeeec-cceeEEEEEecCCCHHHHHHHHHHhh
Q psy8067 124 DAIAQSLEKIGGICDCLSS-RDTFIYAASAATSGLDTVMKILGDVV 168 (239)
Q Consensus 124 ~~l~~~l~~~g~~~~~~~~-~~~~~~~~~~~~~~le~~l~ll~~~~ 168 (239)
.++-+.|++.+.++..... .....+++....++.+.+.+.++..|
T Consensus 19 ~~i~~~L~~~~I~v~~i~q~~s~~~isf~i~~~~~~~~~~~Lh~~~ 64 (66)
T cd04924 19 GRVFGALGKAGINVIMISQGSSEYNISFVVAEDDGWAAVKAVHDEF 64 (66)
T ss_pred HHHHHHHHHCCCCEEEEEecCccceEEEEEeHHHHHHHHHHHHHHh
Confidence 3566677787877765542 13367888888899999999988766
No 46
>PF15061 DUF4538: Domain of unknown function (DUF4538)
Probab=22.62 E-value=1.1e+02 Score=19.39 Aligned_cols=35 Identities=17% Similarity=0.310 Sum_probs=25.3
Q ss_pred HHHHHhhhCCCCCHHHHHHHHHHHHHHHHHHhcCC
Q psy8067 162 KILGDVVLRPKFSKEEINMARQTIQFELENYQMRP 196 (239)
Q Consensus 162 ~ll~~~~~~P~f~~~~~~~~k~~~~~~l~~~~~~p 196 (239)
..+|-.+..|-++++++.++.+.-++.+...+-+|
T Consensus 20 ~a~Ypi~~~Pmm~~eeYk~~Q~~nR~gI~qedvQP 54 (58)
T PF15061_consen 20 AALYPIYFRPMMNPEEYKKEQKINRAGIKQEDVQP 54 (58)
T ss_pred HHHhhhhcccccChHHHHHHHHHHHhcccHhhcCC
Confidence 44677788999999999887776665565554444
No 47
>PRK06063 DNA polymerase III subunit epsilon; Provisional
Probab=21.71 E-value=2.3e+02 Score=24.37 Aligned_cols=69 Identities=9% Similarity=0.075 Sum_probs=47.1
Q ss_pred HHHHhcccCCCCCHHHHHHHHHHhCCeeeeeeccceeEEEEEecC---CCHHHHHHHHHHhhhCCCCCHHHHHHHHHH
Q psy8067 110 EKLAFNSTSEYVNKDAIAQSLEKIGGICDCLSSRDTFIYAASAAT---SGLDTVMKILGDVVLRPKFSKEEINMARQT 184 (239)
Q Consensus 110 ~~~~~~G~~~~~s~~~l~~~l~~~g~~~~~~~~~~~~~~~~~~~~---~~le~~l~ll~~~~~~P~f~~~~~~~~k~~ 184 (239)
..+.++|+-.. ++.++.+.+...|+.+..+++.....+-..-.. ..+..+-++ -=|-+++++|.+..+.
T Consensus 236 ~~~v~TG~l~~-~R~e~~~~~~~~G~~v~~sVs~~t~~lv~g~~~~~ssK~~kA~~~-----gi~ii~e~~f~~ll~~ 307 (313)
T PRK06063 236 MRVALSAEVSR-THEELVERILHAGLAYSDSVDRDTSLVVCNDPAPEQGKGYHARQL-----GVPVLDEAAFLELLRA 307 (313)
T ss_pred CEEEEecCCCC-CHHHHHHHHHHcCCEecCccccCccEEEECCCCCcccHHHHHHHc-----CCccccHHHHHHHHHh
Confidence 35578888776 899999999999999999999866555433211 234444442 3366777777665543
No 48
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=21.53 E-value=1.5e+02 Score=15.88 Aligned_cols=28 Identities=14% Similarity=0.221 Sum_probs=22.1
Q ss_pred HHHHHHHHhhhCCCCCHHHHHHHHHHHH
Q psy8067 159 TVMKILGDVVLRPKFSKEEINMARQTIQ 186 (239)
Q Consensus 159 ~~l~ll~~~~~~P~f~~~~~~~~k~~~~ 186 (239)
.-++.|...+.+--+++++|++.|+.+.
T Consensus 3 ~~L~~L~~l~~~G~IseeEy~~~k~~ll 30 (31)
T PF09851_consen 3 DRLEKLKELYDKGEISEEEYEQKKARLL 30 (31)
T ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHHHh
Confidence 3466677778888899999999888763
Done!