Query 007713
Match_columns 592
No_of_seqs 158 out of 194
Neff 6.4
Searched_HMMs 46136
Date Thu Mar 28 14:20:08 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/007713.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/007713hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13320 DUF4091: Domain of un 99.9 1.5E-25 3.2E-30 182.8 7.6 68 476-549 1-68 (68)
2 PF10633 NPCBM_assoc: NPCBM-as 95.4 0.07 1.5E-06 44.5 7.5 32 97-128 45-76 (78)
3 COG1470 Predicted membrane pro 88.4 5 0.00011 44.4 11.7 36 94-129 325-360 (513)
4 PF01229 Glyco_hydro_39: Glyco 87.5 2 4.3E-05 48.4 8.4 108 257-381 83-202 (486)
5 PF06030 DUF916: Bacterial pro 84.8 30 0.00065 31.6 13.2 109 4-129 3-120 (121)
6 COG1470 Predicted membrane pro 84.0 9 0.00019 42.5 10.9 33 97-129 437-469 (513)
7 PF02221 E1_DerP2_DerF2: ML do 74.5 6.5 0.00014 35.6 5.4 36 95-130 85-120 (134)
8 PF15418 DUF4625: Domain of un 73.6 3.9 8.4E-05 38.0 3.7 88 19-131 28-119 (132)
9 PF14352 DUF4402: Domain of un 68.7 5.2 0.00011 36.6 3.4 34 96-129 93-128 (130)
10 PF02449 Glyco_hydro_42: Beta- 67.5 65 0.0014 34.8 12.1 87 437-543 286-373 (374)
11 PF06280 DUF1034: Fn3-like dom 61.0 7.7 0.00017 34.4 2.9 36 94-129 62-100 (112)
12 PF01835 A2M_N: MG2 domain; I 60.8 41 0.00088 28.8 7.4 26 103-128 61-86 (99)
13 PF13731 WxL: WxL domain surfa 59.8 29 0.00062 34.6 7.0 79 49-128 105-210 (215)
14 smart00633 Glyco_10 Glycosyl h 58.6 31 0.00067 35.2 7.2 99 253-381 10-124 (254)
15 KOG1579 Homocysteine S-methylt 51.8 93 0.002 33.1 9.4 125 309-449 131-259 (317)
16 PF09608 Alph_Pro_TM: Putative 51.3 38 0.00082 34.6 6.3 53 94-163 147-199 (236)
17 cd00917 PG-PI_TP The phosphati 50.1 27 0.0006 31.6 4.7 34 95-129 76-109 (122)
18 COG5520 O-Glycosyl hydrolase [ 48.8 1.9E+02 0.0041 31.5 11.1 146 324-487 150-310 (433)
19 PF04234 CopC: CopC domain; I 44.9 37 0.00081 29.4 4.5 25 108-133 61-85 (97)
20 PF00150 Cellulase: Cellulase 43.8 2.2E+02 0.0048 28.4 10.8 102 255-382 59-172 (281)
21 PF12245 Big_3_2: Bacterial Ig 41.6 61 0.0013 25.7 4.9 47 105-167 10-56 (60)
22 TIGR02186 alph_Pro_TM conserve 41.5 35 0.00075 35.4 4.3 50 95-161 173-222 (261)
23 PF09087 Cyc-maltodext_N: Cycl 38.2 1.9E+02 0.004 25.1 7.6 20 106-126 51-70 (88)
24 PRK10301 hypothetical protein; 37.4 60 0.0013 29.7 4.8 25 108-133 88-112 (124)
25 smart00737 ML Domain involved 34.8 71 0.0015 28.3 4.8 35 95-129 71-105 (118)
26 PLN00180 NDF6 (NDH-dependent f 34.2 36 0.00077 32.4 2.8 79 457-552 90-169 (180)
27 PF13204 DUF4038: Protein of u 32.4 4.5E+02 0.0098 27.4 11.1 197 248-489 78-286 (289)
28 PF09099 Qn_am_d_aIII: Quinohe 26.9 62 0.0013 27.6 2.8 21 105-125 49-69 (81)
29 PF08428 Rib: Rib/alpha-like r 26.0 1.4E+02 0.0031 24.0 4.7 33 97-133 20-52 (65)
30 PF00868 Transglut_N: Transglu 25.5 5E+02 0.011 23.4 9.3 29 100-128 89-117 (118)
31 PF05345 He_PIG: Putative Ig d 23.0 1.3E+02 0.0028 22.9 3.7 19 111-129 28-46 (49)
32 PF14734 DUF4469: Domain of un 22.4 89 0.0019 27.8 3.1 23 107-129 65-87 (102)
33 TIGR03769 P_ac_wall_RPT actino 22.3 1E+02 0.0022 22.8 2.8 14 116-129 10-23 (41)
34 PRK09778 putative antitoxin of 20.9 1.9E+02 0.004 25.5 4.6 25 529-553 44-68 (97)
35 PF07495 Y_Y_Y: Y_Y_Y domain; 20.8 1.4E+02 0.0031 23.2 3.7 14 116-129 36-49 (66)
No 1
>PF13320 DUF4091: Domain of unknown function (DUF4091)
Probab=99.92 E-value=1.5e-25 Score=182.76 Aligned_cols=68 Identities=47% Similarity=0.862 Sum_probs=63.6
Q ss_pred cCCCEEEEeecccccCCCCCCccccccCCCCCCceEEEccCccCCCCCCceechhHHHHHHHHHHHHHHHHHHh
Q 007713 476 EGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYAS 549 (592)
Q Consensus 476 ~g~~GfL~W~~n~w~~~~~P~~~~~~~~~~~~GD~~LvYPG~~~~~~~~Pv~SiRle~lreGieDye~L~lL~~ 549 (592)
||++|||||+||+|+++ |+.+++++. |++||++|||||++ .++|++|||||+||+||||||||++|++
T Consensus 1 y~~~G~L~W~~~~w~~d--P~~d~~~~~-~~~GD~~lvYPg~~---~~~p~~SiRle~lr~G~qD~e~l~~l~~ 68 (68)
T PF13320_consen 1 YGFDGFLRWAYNFWNED--PWEDTRFRG-FPAGDGFLVYPGED---TGGPVSSIRLEVLREGIQDYEYLRLLEK 68 (68)
T ss_pred CCCCeEEEecccccccC--cccccCcCc-CCCCCeEEEecCCC---CCCcccCHHHHHHHHHHHHHHHHHHHhC
Confidence 68999999999999887 999999996 99999999999983 3899999999999999999999999985
No 2
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=95.43 E-value=0.07 Score=44.50 Aligned_cols=32 Identities=31% Similarity=0.543 Sum_probs=25.8
Q ss_pred eecCCCeeEEEEEEEcCCCCCCceeEEEEEEE
Q 007713 97 SLIPGETTAVWVSIDAPYAQPPGLYEGEIIIT 128 (592)
Q Consensus 97 ~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~ 128 (592)
.|++|+.+.+=++|.+|+++.||.|+.+++++
T Consensus 45 ~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a~ 76 (78)
T PF10633_consen 45 SLPPGESVTVTFTVTVPADAAPGTYTVTVTAR 76 (78)
T ss_dssp -B-TTSEEEEEEEEEE-TT--SEEEEEEEEEE
T ss_pred cCCCCCEEEEEEEEECCCCCCCceEEEEEEEE
Confidence 78899999999999999999999999999886
No 3
>COG1470 Predicted membrane protein [Function unknown]
Probab=88.37 E-value=5 Score=44.42 Aligned_cols=36 Identities=28% Similarity=0.458 Sum_probs=34.3
Q ss_pred cceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713 94 CQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 94 ~~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~ 129 (592)
..+.+.+|+...+-++|+.|++|.||.|..+|+++.
T Consensus 325 t~vkL~~gE~kdvtleV~ps~na~pG~Ynv~I~A~s 360 (513)
T COG1470 325 TSVKLKPGEEKDVTLEVYPSLNATPGTYNVTITASS 360 (513)
T ss_pred EEEEecCCCceEEEEEEecCCCCCCCceeEEEEEec
Confidence 579999999999999999999999999999999985
No 4
>PF01229 Glyco_hydro_39: Glycosyl hydrolases family 39; InterPro: IPR000514 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 39 GH39 from CAZY comprises enzymes with several known activities; alpha-L-iduronidase (3.2.1.76 from EC); beta-xylosidase (3.2.1.37 from EC). The most highly conserved regions in these enzymes are located in their N-terminal sections. These contain a glutamic acid residue which, on the basis of similarities with other families of glycosyl hydrolases [], probably acts as the proton donor in their catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BS9_D 2BFG_E 1W91_B 1UHV_D 1PX8_A.
Probab=87.49 E-value=2 Score=48.36 Aligned_cols=108 Identities=24% Similarity=0.318 Sum_probs=62.1
Q ss_pred HHHHHHHHHHHHhCCCCccccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHHHHHHHHHHHH
Q 007713 257 YEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELL 336 (592)
Q Consensus 257 f~~ldr~~~~~~~~ris~~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~~~L~~~~~hl 336 (592)
|+.+|+-++++++++|.|+. . ++++..+-+. +.... |. |.....|. ...+.|.+++++|++|+
T Consensus 83 f~~lD~i~D~l~~~g~~P~v-e-------l~f~p~~~~~-~~~~~-~~------~~~~~~pp-~~~~~W~~lv~~~~~h~ 145 (486)
T PF01229_consen 83 FTYLDQILDFLLENGLKPFV-E-------LGFMPMALAS-GYQTV-FW------YKGNISPP-KDYEKWRDLVRAFARHY 145 (486)
T ss_dssp -HHHHHHHHHHHHCT-EEEE-E-------E-SB-GGGBS-S--EE-TT------TTEE-S-B-S-HHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHcCCEEEE-E-------EEechhhhcC-CCCcc-cc------ccCCcCCc-ccHHHHHHHHHHHHHHH
Confidence 99999999999999999853 1 1211000000 11001 10 00011111 23358999999999999
Q ss_pred HHc-cc--ccceeeeecCCCCCc---------cchHHHHHHHHHHHHhCCCCeEEEe
Q 007713 337 RTK-AH--WKKAYFYLWDEPLNM---------EHYSSVRNMASELHAYAPDARVLTT 381 (592)
Q Consensus 337 ~~k-g~--~~~~y~~i~DEP~~~---------~~~~~~~~~~~~i~~~~P~~ki~~t 381 (592)
..+ |. ....+|=++.||... +=++.|+..++.||++.|++||---
T Consensus 146 ~~RYG~~ev~~W~fEiWNEPd~~~f~~~~~~~ey~~ly~~~~~~iK~~~p~~~vGGp 202 (486)
T PF01229_consen 146 IDRYGIEEVSTWYFEIWNEPDLKDFWWDGTPEEYFELYDATARAIKAVDPELKVGGP 202 (486)
T ss_dssp HHHHHHHHHTTSEEEESS-TTSTTTSGGG-HHHHHHHHHHHHHHHHHH-TTSEEEEE
T ss_pred HhhcCCccccceeEEeCcCCCcccccCCCCHHHHHHHHHHHHHHHHHhCCCCcccCc
Confidence 753 42 235567689999632 2244678889999999999998654
No 5
>PF06030 DUF916: Bacterial protein of unknown function (DUF916); InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function.
Probab=84.80 E-value=30 Score=31.58 Aligned_cols=109 Identities=17% Similarity=0.226 Sum_probs=69.8
Q ss_pred ccccCCCCCCC-CCCceEEEeecCceeEEEEEEecCcccCCCCCCcceEEEEcc-ccCCCCCccccccceEEEEeeecCC
Q 007713 4 TANVGPQEMPR-PLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSD-LCSASGDRLVVGQSLMLRRVVPMLG 81 (592)
Q Consensus 4 ~~~v~p~~~p~-~~~~~~l~aarnE~~sfQi~l~~~~~~~~~~~~~~v~v~~sd-l~~~~G~~~i~~~~i~~~~v~~vpg 81 (592)
..-|.|+.+-. ....+.|.+.-|+...+|+-+.-. +...-.|.|++.+ .++.+|. +.|.+-
T Consensus 3 V~p~~p~~Q~~~~~~YFdL~~~P~q~~~l~v~i~N~-----s~~~~tv~v~~~~A~Tn~nG~------------I~Y~~~ 65 (121)
T PF06030_consen 3 VTPVLPENQIDKNVSYFDLKVKPGQKQTLEVRITNN-----SDKEITVKVSANTATTNDNGV------------IDYSQN 65 (121)
T ss_pred EeecCCccccCCCCCeEEEEeCCCCEEEEEEEEEeC-----CCCCEEEEEEEeeeEecCCEE------------EEECCC
Confidence 34577877654 457899999999999999999653 1222234443221 1122221 222210
Q ss_pred --CCC--ccccCC---CCccceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713 82 --VPD--ALVPLD---LPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 82 --~PD--~L~p~~---~~~~~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~ 129 (592)
-.| +-.++. .....++|+|++.+-+=++|.+|+..-.|+.-|.|.|+.
T Consensus 66 ~~~~d~sl~~~~~~~v~~~~~Vtl~~~~sk~V~~~i~~P~~~f~G~ilGGi~~~e 120 (121)
T PF06030_consen 66 NPKKDKSLKYPFSDLVKIPKEVTLPPNESKTVTFTIKMPKKAFDGIILGGIYFSE 120 (121)
T ss_pred CcccCcccCcchHHhccCCcEEEECCCCEEEEEEEEEcCCCCcCCEEEeeEEEEe
Confidence 111 111111 011349999999999999999999999999999999983
No 6
>COG1470 Predicted membrane protein [Function unknown]
Probab=84.03 E-value=9 Score=42.53 Aligned_cols=33 Identities=36% Similarity=0.457 Sum_probs=30.3
Q ss_pred eecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713 97 SLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 97 ~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~ 129 (592)
.+.||+...+=++|.||++|.+|.|..+|+.++
T Consensus 437 sL~pge~~tV~ltI~vP~~a~aGdY~i~i~~ks 469 (513)
T COG1470 437 SLEPGESKTVSLTITVPEDAGAGDYRITITAKS 469 (513)
T ss_pred ccCCCCcceEEEEEEcCCCCCCCcEEEEEEEee
Confidence 457899999999999999999999999999985
No 7
>PF02221 E1_DerP2_DerF2: ML domain; InterPro: IPR003172 The MD-2-related lipid-recognition (ML) domain is implicated in lipid recognition, particularly in the recognition of pathogen related products. It has an immunoglobulin-like beta-sandwich fold similar to that of E-set Ig domains. This domain is present in the following proteins: Epididymal secretory protein E1 (also known as Niemann-Pick C2 protein), which is known to bind cholesterol. Niemann-Pick disease type C2 is a fatal hereditary disease characterised by accumulation of low-density lipoprotein-derived cholesterol in lysosomes []. House-dust mite allergen proteins such as Der f 2 from Dermatophagoides farinae and Der p 2 from Dermatophagoides pteronyssinus []. ; PDB: 2AG9_B 1G13_B 2AG2_B 2AG4_A 1TJJ_C 1PU5_C 1PUB_A 2AF9_A 3T6Q_D 3M7O_B ....
Probab=74.47 E-value=6.5 Score=35.58 Aligned_cols=36 Identities=25% Similarity=0.342 Sum_probs=33.2
Q ss_pred ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEec
Q 007713 95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSK 130 (592)
Q Consensus 95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~ 130 (592)
...+.+|+....-+++.||...++|.|+++++++..
T Consensus 85 ~CPi~~G~~~~~~~~~~i~~~~p~~~~~i~~~l~d~ 120 (134)
T PF02221_consen 85 SCPIKAGEYYTYTYTIPIPKIYPPGKYTIQWKLTDQ 120 (134)
T ss_dssp TSTBTTTEEEEEEEEEEESTTSSSEEEEEEEEEEET
T ss_pred cCccCCCcEEEEEEEEEcccceeeEEEEEEEEEEeC
Confidence 577889999999999999999999999999999963
No 8
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=73.55 E-value=3.9 Score=38.02 Aligned_cols=88 Identities=19% Similarity=0.264 Sum_probs=52.4
Q ss_pred eEEEeecCceeEEEEEEecCcccCCCCCCcceEEEEcc-c--cCCCCCccccccceEEEEeeecCCCCCccccCCCCccc
Q 007713 19 INLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSD-L--CSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQ 95 (592)
Q Consensus 19 ~~l~aarnE~~sfQi~l~~~~~~~~~~~~~~v~v~~sd-l--~~~~G~~~i~~~~i~~~~v~~vpg~PD~L~p~~~~~~~ 95 (592)
-.-.+-||+...|..-+.+. ..++.++|++-. + -+-++. .+ +...|+.- ...
T Consensus 28 ~~~~~~~G~~ihfe~~i~d~------~~i~si~VeIH~nfd~H~h~~~---~~---------------~~~~~~~~-~~~ 82 (132)
T PF15418_consen 28 NCKVATRGDDIHFEADISDN------SAIKSIKVEIHNNFDHHTHSTE---AG---------------ECEKPWVF-EQD 82 (132)
T ss_pred CCeEEecCCcEEEEEEEEcc------cceeEEEEEEecCcCccccccc---cc---------------ccccCcEE-EEE
Confidence 34566889999999888874 467788887620 0 011000 00 00112110 011
Q ss_pred eeecCC-CeeEEEEEEEcCCCCCCceeEEEEEEEecC
Q 007713 96 ISLIPG-ETTAVWVSIDAPYAQPPGLYEGEIIITSKA 131 (592)
Q Consensus 96 ~~v~~~-~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~ 131 (592)
+.+..| ...-+=..|.||++++||.|...|+|+.++
T Consensus 83 ~~~~~g~~~~~~h~~i~IPa~a~~G~YH~~i~VtD~~ 119 (132)
T PF15418_consen 83 YDIYGGKKNYDFHEHIDIPADAPAGDYHFMITVTDAA 119 (132)
T ss_pred EcccCCcccEeEEEeeeCCCCCCCcceEEEEEEEECC
Confidence 222222 245677899999999999999999999743
No 9
>PF14352 DUF4402: Domain of unknown function (DUF4402)
Probab=68.68 E-value=5.2 Score=36.55 Aligned_cols=34 Identities=26% Similarity=0.485 Sum_probs=26.7
Q ss_pred eeecCCCeeEEEE--EEEcCCCCCCceeEEEEEEEe
Q 007713 96 ISLIPGETTAVWV--SIDAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 96 ~~v~~~~~q~lWv--~v~VP~~a~pG~Y~g~i~v~~ 129 (592)
..+..+....+.| +++|+.++++|.|+|+++|+.
T Consensus 93 ~~~~~~g~~~~~VGGtL~v~~~~~~G~YsGt~~VtV 128 (130)
T PF14352_consen 93 TTLDTGGSATFNVGGTLNVPANQAAGTYSGTFTVTV 128 (130)
T ss_pred eEecCCCcEEEEEEEEEEcCCCCCCeEEEEEEEEEE
Confidence 3344555666776 589999999999999999984
No 10
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=67.47 E-value=65 Score=34.76 Aligned_cols=87 Identities=17% Similarity=0.339 Sum_probs=41.2
Q ss_pred CCCceeEEEe-cCCCCCCCCCcccCCchhHHHHHHHHHHHcCCCEEEEeecccccCCCCCCccccccCCCCCCceEEEcc
Q 007713 437 ENGEEWWTYV-CMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYP 515 (592)
Q Consensus 437 ~~G~~~W~Y~-C~~p~~~~pN~fi~~p~~~~R~~gw~~~~~g~~GfL~W~~n~w~~~~~P~~~~~~~~~~~~GD~~LvYP 515 (592)
+.|++.|.=- +.++ ..+...-..-.+-+.|...|++..+|.+|.++|.+...... .. .|..+.-
T Consensus 286 ~~~kpf~v~E~~~g~-~~~~~~~~~~~pg~~~~~~~~~~A~Ga~~i~~~~wr~~~~g---~E--~~~~g~~--------- 350 (374)
T PF02449_consen 286 AKGKPFWVMEQQPGP-VNWRPYNRPPRPGELRLWSWQAIAHGADGILFWQWRQSRFG---AE--QFHGGLV--------- 350 (374)
T ss_dssp TTT--EEEEEE--S---SSSSS-----TTHHHHHHHHHHHTT-S-EEEC-SB--SSS---TT--TTS--SB---------
T ss_pred cCCCceEeecCCCCC-CCCccCCCCCCCCHHHHHHHHHHHHhCCeeEeeeccCCCCC---ch--hhhcccC---------
Confidence 4789888652 3322 12322233344568899999999999999999998654222 11 1111111
Q ss_pred CccCCCCCCceechhHHHHHHHHHHHHH
Q 007713 516 GEVFSSSRQPVASLRLERILSGLQDIEY 543 (592)
Q Consensus 516 G~~~~~~~~Pv~SiRle~lreGieDye~ 543 (592)
+ .++..++.|++-+.+-.++++.
T Consensus 351 ~-----~dg~~~~~~~~e~~~~~~~l~~ 373 (374)
T PF02449_consen 351 D-----HDGREPTRRYREVAQLGRELKK 373 (374)
T ss_dssp ------TTS--B-HHHHHHHHHHHHHHT
T ss_pred C-----ccCCCCCcHHHHHHHHHHHHhc
Confidence 1 2335677888887777666553
No 11
>PF06280 DUF1034: Fn3-like domain (DUF1034); InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=61.04 E-value=7.7 Score=34.44 Aligned_cols=36 Identities=28% Similarity=0.426 Sum_probs=30.7
Q ss_pred cceeecCCCeeEEEEEEEcCCCCCC---ceeEEEEEEEe
Q 007713 94 CQISLIPGETTAVWVSIDAPYAQPP---GLYEGEIIITS 129 (592)
Q Consensus 94 ~~~~v~~~~~q~lWv~v~VP~~a~p---G~Y~g~i~v~~ 129 (592)
..++|+||+.+.+=|+|++|++..+ ..|.|-|.++.
T Consensus 62 ~~vTV~ag~s~~v~vti~~p~~~~~~~~~~~eG~I~~~~ 100 (112)
T PF06280_consen 62 DTVTVPAGQSKTVTVTITPPSGLDASNGPFYEGFITFKS 100 (112)
T ss_dssp EEEEE-TTEEEEEEEEEE--GGGHHTT-EEEEEEEEEES
T ss_pred CeEEECCCCEEEEEEEEEehhcCCcccCCEEEEEEEEEc
Confidence 5799999999999999999998886 99999999995
No 12
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=60.84 E-value=41 Score=28.77 Aligned_cols=26 Identities=19% Similarity=0.180 Sum_probs=17.2
Q ss_pred eeEEEEEEEcCCCCCCceeEEEEEEE
Q 007713 103 TTAVWVSIDAPYAQPPGLYEGEIIIT 128 (592)
Q Consensus 103 ~q~lWv~v~VP~~a~pG~Y~g~i~v~ 128 (592)
.-.+-.++.+|+++..|.|+.++...
T Consensus 61 ~G~~~~~~~lp~~~~~G~y~i~~~~~ 86 (99)
T PF01835_consen 61 NGIFSGSFQLPDDAPLGTYTIRVKTD 86 (99)
T ss_dssp TTEEEEEEE--SS---EEEEEEEEET
T ss_pred CCEEEEEEECCCCCCCEeEEEEEEEc
Confidence 33566789999999999999999885
No 13
>PF13731 WxL: WxL domain surface cell wall-binding
Probab=59.80 E-value=29 Score=34.63 Aligned_cols=79 Identities=24% Similarity=0.390 Sum_probs=45.5
Q ss_pred ceEEEEccccCCCCCccccccceEEEEeeecC---C--CCCc------cccCCCCccceeecCCCeeEEE----------
Q 007713 49 VVQVQCSDLCSASGDRLVVGQSLMLRRVVPML---G--VPDA------LVPLDLPVCQISLIPGETTAVW---------- 107 (592)
Q Consensus 49 ~v~v~~sdl~~~~G~~~i~~~~i~~~~v~~vp---g--~PD~------L~p~~~~~~~~~v~~~~~q~lW---------- 107 (592)
.|+|+.++|++.+|. .+.+..+.+....... . -|-. |.+-......++-..++-+..|
T Consensus 105 ~L~v~~s~F~~~~~~-~L~ga~l~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~v~~A~~~~g~G~~~~~~~~~~~~ 183 (215)
T PF13731_consen 105 TLTVKLSPFTNADGD-TLPGATLTFNNGKVQSTANNTNTPTTVSSNITLTPGGQAQTVMSAAKGQGQGTWSYSFGDQDAT 183 (215)
T ss_pred EEEEEeccccccCCc-CcccceEEecCceeEeecccccCCcccccceEeccCCcceeeEeecccccceEEEEEeCCcccc
Confidence 477788888888755 4556556554433221 0 0111 1111111111223345555555
Q ss_pred ----EEEEcCCCCC--CceeEEEEEEE
Q 007713 108 ----VSIDAPYAQP--PGLYEGEIIIT 128 (592)
Q Consensus 108 ----v~v~VP~~a~--pG~Y~g~i~v~ 128 (592)
|.+.||..+. +|.|+++|+=+
T Consensus 184 ~~~~v~L~VP~~~~~~ag~Yt~tlTWt 210 (215)
T PF13731_consen 184 ADTGVSLSVPANTAKQAGTYTATLTWT 210 (215)
T ss_pred cccceEEEeCCCCcccCCcEEEEEEEE
Confidence 8999999998 69999999976
No 14
>smart00633 Glyco_10 Glycosyl hydrolase family 10.
Probab=58.60 E-value=31 Score=35.18 Aligned_cols=99 Identities=12% Similarity=0.166 Sum_probs=58.9
Q ss_pred ChhH-HHHHHHHHHHHHhCCCCc--cccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHHHHH
Q 007713 253 SDEW-YEALDQHFKWLLQYRISP--FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYV 329 (592)
Q Consensus 253 ~~~~-f~~ldr~~~~~~~~ris~--~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~~~L 329 (592)
.+.| |+.+|+.++|+.+++|.- +..-|+.. ...| .. ..+ .++ -.+.+.+|+
T Consensus 10 ~G~~n~~~~D~~~~~a~~~gi~v~gH~l~W~~~------~P~W-~~--------~~~-~~~----------~~~~~~~~i 63 (254)
T smart00633 10 RGQFNFSGADAIVNFAKENGIKVRGHTLVWHSQ------TPDW-VF--------NLS-KET----------LLARLENHI 63 (254)
T ss_pred CCccChHHHHHHHHHHHHCCCEEEEEEEeeccc------CCHh-hh--------cCC-HHH----------HHHHHHHHH
Confidence 3444 899999999999999972 11234331 1123 00 000 000 012456677
Q ss_pred HHHHHHHHHcccccceeeeecCCCCCcc-------c----h--HHHHHHHHHHHHhCCCCeEEEe
Q 007713 330 RKEIELLRTKAHWKKAYFYLWDEPLNME-------H----Y--SSVRNMASELHAYAPDARVLTT 381 (592)
Q Consensus 330 ~~~~~hl~~kg~~~~~y~~i~DEP~~~~-------~----~--~~~~~~~~~i~~~~P~~ki~~t 381 (592)
.+.++|.+.+ -..+ -+..||.+.. . + +.++.+.+.+|++.|++|++..
T Consensus 64 ~~v~~ry~g~---i~~w-dV~NE~~~~~~~~~~~~~w~~~~G~~~i~~af~~ar~~~P~a~l~~N 124 (254)
T smart00633 64 KTVVGRYKGK---IYAW-DVVNEALHDNGSGLRRSVWYQILGEDYIEKAFRYAREADPDAKLFYN 124 (254)
T ss_pred HHHHHHhCCc---ceEE-EEeeecccCCCcccccchHHHhcChHHHHHHHHHHHHhCCCCEEEEe
Confidence 7777776644 1122 3678885321 1 2 6678999999999999999875
No 15
>KOG1579 consensus Homocysteine S-methyltransferase [Amino acid transport and metabolism]
Probab=51.75 E-value=93 Score=33.14 Aligned_cols=125 Identities=13% Similarity=0.155 Sum_probs=76.5
Q ss_pred ccccccccCCCCCchhHHHHHHHHHHHHHHcccccceeeeecCCCCCccchHHHHHHHHHHHHhCCCCeEEEeeccCCCC
Q 007713 309 AAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 388 (592)
Q Consensus 309 ~~y~~~~~~~l~g~~~~~~~L~~~~~hl~~kg~~~~~y~~i~DEP~~~~~~~~~~~~~~~i~~~~P~~ki~~t~~~~p~~ 388 (592)
.+|+..|+.-.+.. .+++|.+.-++-+-++|.-.-++=.| | +...-+++.+++.+..|+.++..+++|.++.
T Consensus 131 ~eytg~Y~~~~~~~-el~~~~k~qle~~~~~gvD~L~fETi---p----~~~EA~a~l~~l~~~~~~~p~~is~t~~d~g 202 (317)
T KOG1579|consen 131 SEYTGIYGDNVEFE-ELYDFFKQQLEVFLEAGVDLLAFETI---P----NVAEAKAALELLQELGPSKPFWISFTIKDEG 202 (317)
T ss_pred cccccccccccCHH-HHHHHHHHHHHHHHhCCCCEEEEeec---C----CHHHHHHHHHHHHhcCCCCcEEEEEEecCCC
Confidence 56777776665554 47888888899998998333333344 4 3334578889999999999999999988754
Q ss_pred CCCCCCCcccccc---cc-cccCCccccccccccccCCchhhhHHhHhhcccCCCceeEEEecCC
Q 007713 389 APLGPTPFESFVK---VP-KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMG 449 (592)
Q Consensus 389 ~~~~~~~~e~~~~---vp-~~~~~~idi~c~~~wv~~~~~~~~~~~~~~~~~~~G~~~W~Y~C~~ 449 (592)
.......++.++. .. ++ ..|.++|.+. ...+....++.+ .-....+-.|...+
T Consensus 203 ~l~~G~t~e~~~~~~~~~~~~--~~IGvNC~~~---~~~~~~~~~L~~---~~~~~~llvYPNsG 259 (317)
T KOG1579|consen 203 RLRSGETGEEAAQLLKDGINL--LGIGVNCVSP---NFVEPLLKELMA---KLTKIPLLVYPNSG 259 (317)
T ss_pred cccCCCcHHHHHHHhccCCce--EEEEeccCCc---hhccHHHHHHhh---ccCCCeEEEecCCC
Confidence 4444445555432 11 11 2477888743 222333333331 23455666665543
No 16
>PF09608 Alph_Pro_TM: Putative transmembrane protein (Alph_Pro_TM); InterPro: IPR019088 This entry consists of predicted transmembrane proteins of about 270 amino acids. They are found predominantly, though not exclusively, in alphaproteobacteria, generally only once in each genome.
Probab=51.28 E-value=38 Score=34.64 Aligned_cols=53 Identities=19% Similarity=0.224 Sum_probs=38.5
Q ss_pred cceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEecCCccchhhccccccccccccceeeeeeeccCCC
Q 007713 94 CQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIE 163 (592)
Q Consensus 94 ~~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g~~~~~~~~~~~~~~l~~~l~~~l~V~~~~l 163 (592)
+.+.+..+ +-+..+|.+|++.++|.|+.++-+.. ||.++++. ...|+|.-.=+
T Consensus 147 ~~V~~~~~--~lFra~i~LPanvp~G~Y~v~v~l~r--dG~vv~~~-------------~~~l~V~KvG~ 199 (236)
T PF09608_consen 147 GGVQFLEG--TLFRARIPLPANVPPGDYTVRVYLFR--DGQVVASQ-------------ETPLRVRKVGF 199 (236)
T ss_pred CeEEEcCC--CeEEEEeEcCCCCCcceEEEEEEEEE--CCEEEEEE-------------eeEEEEEEccH
Confidence 34665444 47889999999999999999999984 67655433 66667755433
No 17
>cd00917 PG-PI_TP The phosphatidylinositol/phosphatidylglycerol transfer protein (PG/PI-TP) has been shown to bind phosphatidylglycerol and phosphatidylinositol, but the biological significance of this is still obscure. These proteins belong to the ML domain family.
Probab=50.15 E-value=27 Score=31.59 Aligned_cols=34 Identities=24% Similarity=0.365 Sum_probs=30.5
Q ss_pred ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713 95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~ 129 (592)
...+.+|+.. +=.++.||...++|.|+++.++..
T Consensus 76 ~CPi~~G~~~-~~~~~~ip~~~P~g~y~v~~~l~d 109 (122)
T cd00917 76 SCPIEPGDKF-LTKLVDLPGEIPPGKYTVSARAYT 109 (122)
T ss_pred cCCcCCCcEE-EEEEeeCCCCCCCceEEEEEEEEC
Confidence 5778899977 888899999999999999999995
No 18
>COG5520 O-Glycosyl hydrolase [Cell envelope biogenesis, outer membrane]
Probab=48.83 E-value=1.9e+02 Score=31.53 Aligned_cols=146 Identities=16% Similarity=0.281 Sum_probs=77.1
Q ss_pred hHHHHHHHHHHHHHHcccccceeeeecCCCCCccchHH----HHHHHHHHHHh----CCCCeEEEeeccCCCCCCCCCCC
Q 007713 324 GAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSS----VRNMASELHAY----APDARVLTTYYCGPSDAPLGPTP 395 (592)
Q Consensus 324 ~~~~~L~~~~~hl~~kg~~~~~y~~i~DEP~~~~~~~~----~~~~~~~i~~~----~P~~ki~~t~~~~p~~~~~~~~~ 395 (592)
.+-+||..|+..++..|.-..+ +.+-.||.-...++. ..+..++++++ ...+||+.- ++
T Consensus 150 ~yA~~l~~fv~~m~~nGvnlya-lSVQNEPd~~p~~d~~~wtpQe~~rF~~qyl~si~~~~rV~~p------------es 216 (433)
T COG5520 150 DYADYLNDFVLEMKNNGVNLYA-LSVQNEPDYAPTYDWCWWTPQEELRFMRQYLASINAEMRVIIP------------ES 216 (433)
T ss_pred HHHHHHHHHHHHHHhCCCceeE-EeeccCCcccCCCCcccccHHHHHHHHHHhhhhhccccEEecc------------hh
Confidence 3678999999999999976654 356799964433433 23445555555 334566652 12
Q ss_pred cccccc--cccccCCc----cccccccccccCCchhhhHHhHhhcccCCCceeEEEecCCCCCCCCCcccCCchhHHH-H
Q 007713 396 FESFVK--VPKFLRPH----TQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHR-A 468 (592)
Q Consensus 396 ~e~~~~--vp~~~~~~----idi~c~~~wv~~~~~~~~~~~~~~~~~~~G~~~W~Y~C~~p~~~~pN~fi~~p~~~~R-~ 468 (592)
+....+ -|.+-+|. ++|. .-+|-.++-.++..-+. + +...||.+|+=-|..+. .=||.-.- ..+..- =
T Consensus 217 ~~~~~~~~dp~lnDp~a~a~~~il-g~H~Ygg~v~~~p~~la-k-~~~~gKdlwmte~y~~e-sd~~s~dr-~~~~~~~h 291 (433)
T COG5520 217 FKDLPNMSDPILNDPKALANMDIL-GTHLYGGQVSDQPYPLA-K-QKPAGKDLWMTECYPPE-SDPNSADR-EALHVALH 291 (433)
T ss_pred cccccccccccccCHhHhccccee-EeeecccccccchhhHh-h-CCCcCCceEEeecccCC-CCCCcchH-HHHHHHHH
Confidence 111111 12222222 2322 22344444444443332 2 33569999987887553 23332111 111111 1
Q ss_pred HHHHHHHcCCCEEEEeecc
Q 007713 469 VMWRVWKEGGTGFLYWGAN 487 (592)
Q Consensus 469 ~gw~~~~~g~~GfL~W~~n 487 (592)
+.--..+-|+.||+.|..-
T Consensus 292 i~~gm~~gg~~ayv~W~i~ 310 (433)
T COG5520 292 IHIGMTEGGFQAYVWWNIR 310 (433)
T ss_pred HHhhccccCccEEEEEEEe
Confidence 3445567789999999764
No 19
>PF04234 CopC: CopC domain; InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=44.86 E-value=37 Score=29.38 Aligned_cols=25 Identities=32% Similarity=0.533 Sum_probs=19.0
Q ss_pred EEEEcCCCCCCceeEEEEEEEecCCc
Q 007713 108 VSIDAPYAQPPGLYEGEIIITSKADT 133 (592)
Q Consensus 108 v~v~VP~~a~pG~Y~g~i~v~~~~~g 133 (592)
+.+.+|..-++|.|+..-+|.+ +||
T Consensus 61 ~~~~l~~~l~~G~YtV~wrvvs-~DG 85 (97)
T PF04234_consen 61 LTVPLPPPLPPGTYTVSWRVVS-ADG 85 (97)
T ss_dssp EEEEESS---SEEEEEEEEEEE-TTS
T ss_pred EEEECCCCCCCceEEEEEEEEe-cCC
Confidence 5788899999999999999986 566
No 20
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=43.82 E-value=2.2e+02 Score=28.42 Aligned_cols=102 Identities=13% Similarity=0.176 Sum_probs=58.9
Q ss_pred hHHHHHHHHHHHHHhCCCCccccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHHHHHHHHHH
Q 007713 255 EWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE 334 (592)
Q Consensus 255 ~~f~~ldr~~~~~~~~ris~~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~~~L~~~~~ 334 (592)
.+++.||+-++++.+++|.-+++-.+. ..|... ... +.. .. ...+....+++.++.
T Consensus 59 ~~~~~ld~~v~~a~~~gi~vild~h~~--------~~w~~~-~~~---~~~-----~~-------~~~~~~~~~~~~la~ 114 (281)
T PF00150_consen 59 TYLARLDRIVDAAQAYGIYVILDLHNA--------PGWANG-GDG---YGN-----ND-------TAQAWFKSFWRALAK 114 (281)
T ss_dssp HHHHHHHHHHHHHHHTT-EEEEEEEES--------TTCSSS-TST---TTT-----HH-------HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhCCCeEEEEeccC--------cccccc-ccc---ccc-----ch-------hhHHHHHhhhhhhcc
Confidence 458999999999999999744321110 113000 000 000 00 001134556677777
Q ss_pred HHHHcccccceeeeecCCCCCccc------------hHHHHHHHHHHHHhCCCCeEEEee
Q 007713 335 LLRTKAHWKKAYFYLWDEPLNMEH------------YSSVRNMASELHAYAPDARVLTTY 382 (592)
Q Consensus 335 hl~~kg~~~~~y~~i~DEP~~~~~------------~~~~~~~~~~i~~~~P~~ki~~t~ 382 (592)
+++.. -..+.+-|+.||..... .+.++++.+.||+..|+..|+...
T Consensus 115 ~y~~~--~~v~~~el~NEP~~~~~~~~w~~~~~~~~~~~~~~~~~~Ir~~~~~~~i~~~~ 172 (281)
T PF00150_consen 115 RYKDN--PPVVGWELWNEPNGGNDDANWNAQNPADWQDWYQRAIDAIRAADPNHLIIVGG 172 (281)
T ss_dssp HHTTT--TTTEEEESSSSGCSTTSTTTTSHHHTHHHHHHHHHHHHHHHHTTSSSEEEEEE
T ss_pred ccCCC--CcEEEEEecCCccccCCccccccccchhhhhHHHHHHHHHHhcCCcceeecCC
Confidence 77533 23556678999963211 356788999999999998877654
No 21
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=41.63 E-value=61 Score=25.65 Aligned_cols=47 Identities=21% Similarity=0.274 Sum_probs=34.0
Q ss_pred EEEEEEEcCCCCCCceeEEEEEEEecCCccchhhccccccccccccceeeeeeeccCCCCCCh
Q 007713 105 AVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPL 167 (592)
Q Consensus 105 ~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g~~~~~~~~~~~~~~l~~~l~~~l~V~~~~lp~~~ 167 (592)
..|.. -+|.+...|.|+.+|+++.+++.. .+.....-+++...|.|.
T Consensus 10 ~~~~~-~~P~~~~dg~yt~~v~a~D~AGN~---------------~~~~~~~~i~d~~~p~pt 56 (60)
T PF12245_consen 10 GVWST-VIPENDADGEYTLTVTATDKAGNT---------------SSSTTQIVIVDNTAPAPT 56 (60)
T ss_pred cceec-cccCccCCccEEEEEEEEECCCCE---------------EEeeeEEEEEcCCCCCcc
Confidence 34443 469998999999999999776442 344667778888878664
No 22
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=41.48 E-value=35 Score=35.42 Aligned_cols=50 Identities=14% Similarity=0.185 Sum_probs=36.5
Q ss_pred ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEecCCccchhhccccccccccccceeeeeeeccC
Q 007713 95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEP 161 (592)
Q Consensus 95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g~~~~~~~~~~~~~~l~~~l~~~l~V~~~ 161 (592)
.+++..+. =+--+|.+|++.+.|.|+.++-+.. +|.++++. ...|+|.-.
T Consensus 173 gV~~~~~~--LFra~i~LPAnvp~G~Y~v~v~L~r--~G~vv~~~-------------~t~l~V~Kv 222 (261)
T TIGR02186 173 GVTFISPT--LFRATLRLPANVPNGTHEVRAYLFR--GGVFIART-------------ELALEIVKT 222 (261)
T ss_pred eEEEcCCc--eEEEeeecCCCCCCceEEEEEEEEe--CCEEEEEE-------------EeEEEEEEe
Confidence 35554443 3678999999999999999999984 67655543 667777653
No 23
>PF09087 Cyc-maltodext_N: Cyclomaltodextrinase, N-terminal; InterPro: IPR015171 This domain is found at the N terminus of cyclomaltodextrinase. The domain assumes a beta-sandwich structure composed of the eight antiparallel beta-strands. A ten residue linker is also present at the C-terminal end, which connects the N-terminal domain to a distal domain in the protein. This domain participates in oligomerisation of the protein, wherein the N-terminal domain of one subunit contacts the active centre of the other subunit, and is also required for binding of cyclodextrin to substrate []. ; PDB: 3EDK_B 3EDD_A 3EDJ_B 3EDE_A 1H3G_B 3EDF_B.
Probab=38.20 E-value=1.9e+02 Score=25.10 Aligned_cols=20 Identities=20% Similarity=0.436 Sum_probs=13.1
Q ss_pred EEEEEEcCCCCCCceeEEEEE
Q 007713 106 VWVSIDAPYAQPPGLYEGEII 126 (592)
Q Consensus 106 lWv~v~VP~~a~pG~Y~g~i~ 126 (592)
|-|+++|- +|+||+++..++
T Consensus 51 LFv~L~i~-~akpg~~~i~~~ 70 (88)
T PF09087_consen 51 LFVYLDIS-DAKPGTFTINFK 70 (88)
T ss_dssp EEEEEEE--T--SEEEEEEEE
T ss_pred EEEEEecC-CCCCcEEEEEEE
Confidence 56777777 999999887666
No 24
>PRK10301 hypothetical protein; Provisional
Probab=37.36 E-value=60 Score=29.73 Aligned_cols=25 Identities=20% Similarity=0.303 Sum_probs=20.9
Q ss_pred EEEEcCCCCCCceeEEEEEEEecCCc
Q 007713 108 VSIDAPYAQPPGLYEGEIIITSKADT 133 (592)
Q Consensus 108 v~v~VP~~a~pG~Y~g~i~v~~~~~g 133 (592)
+.+.+|..-++|+|+.+-+|.+ +||
T Consensus 88 ~~v~l~~~L~~G~YtV~Wrvvs-~DG 112 (124)
T PRK10301 88 LIVPLADSLKPGTYTVDWHVVS-VDG 112 (124)
T ss_pred EEEECCCCCCCccEEEEEEEEe-cCC
Confidence 4777888889999999999986 466
No 25
>smart00737 ML Domain involved in innate immunity and lipid metabolism. ML (MD-2-related lipid-recognition) is a novel domain identified in MD-1, MD-2, GM2A, Npc2 and multiple proteins of unknown function in plants, animals and fungi. These single-domain proteins were predicted to form a beta-rich fold containing multiple strands, and to mediate diverse biological functions through interacting with specific lipids.
Probab=34.81 E-value=71 Score=28.31 Aligned_cols=35 Identities=29% Similarity=0.361 Sum_probs=29.7
Q ss_pred ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713 95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~ 129 (592)
...+.+|+..-.=.++.||...++|.|+++++++.
T Consensus 71 ~CPl~~G~~~~~~~~~~v~~~~P~~~~~v~~~l~d 105 (118)
T smart00737 71 KCPIEKGETVNYTNSLTVPGIFPPGKYTVKWELTD 105 (118)
T ss_pred CCCCCCCeeEEEEEeeEccccCCCeEEEEEEEEEc
Confidence 46778898765667789999999999999999995
No 26
>PLN00180 NDF6 (NDH-dependent flow 6); Provisional
Probab=34.22 E-value=36 Score=32.41 Aligned_cols=79 Identities=19% Similarity=0.335 Sum_probs=51.0
Q ss_pred cccCCchhHHHHHHHHHHHcCCCEEEEeecccccCCCCCCccc-cccCCCCCCceEEEccCccCCCCCCceechhHHHHH
Q 007713 457 WHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEI-RFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERIL 535 (592)
Q Consensus 457 ~fi~~p~~~~R~~gw~~~~~g~~GfL~W~~n~w~~~~~P~~~~-~~~~~~~~GD~~LvYPG~~~~~~~~Pv~SiRle~lr 535 (592)
|++....+.|-....+.+ --||..|+.+..|||-|. .|+++-+.|-+.-||--. ....+|-|-|+||
T Consensus 90 wHLSD~aiKnVYtfY~mF-------T~WG~~fFgSmKDPfYDSe~YRgdGGDGT~hW~Yd~Q-----Ed~E~sAReeL~R 157 (180)
T PLN00180 90 WHLSDAAIKNVYTFYIMF-------TCWGCLFFGSMKDPFYDSEEYRGDGGDGTGHWVYERQ-----EDIEESARAELWR 157 (180)
T ss_pred hhccHHHHhHHHHHHHHH-------HHHHHhheeccCCcccchHHhcccCCCCceeeEeehH-----HHHHHHHHHHHHH
Confidence 455555566554333332 357777776666798765 455543566667788764 3378999999999
Q ss_pred HHHHHHHHHHHHHhhcC
Q 007713 536 SGLQDIEYLNLYASRYG 552 (592)
Q Consensus 536 eGieDye~L~lL~~~~~ 552 (592)
| |+|..++++.|
T Consensus 158 E-----ELiEEIEQkVG 169 (180)
T PLN00180 158 E-----ELIEEIEQKVG 169 (180)
T ss_pred H-----HHHHHHHHHhh
Confidence 8 55666666654
No 27
>PF13204 DUF4038: Protein of unknown function (DUF4038); PDB: 3KZS_D.
Probab=32.40 E-value=4.5e+02 Score=27.43 Aligned_cols=197 Identities=14% Similarity=0.149 Sum_probs=86.7
Q ss_pred ccccCChhHHHHHHHHHHHHHhCCCCc-cccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHH
Q 007713 248 GVRHGSDEWYEALDQHFKWLLQYRISP-FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAK 326 (592)
Q Consensus 248 ~v~~~~~~~f~~ldr~~~~~~~~ris~-~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~ 326 (592)
.+..-...||+.+|+-++.+.+++|-. +..-||.+. . -.-| +.. +-+=+.+..+
T Consensus 78 d~~~~N~~YF~~~d~~i~~a~~~Gi~~~lv~~wg~~~---~-~~~W----g~~-----------------~~~m~~e~~~ 132 (289)
T PF13204_consen 78 DFTRPNPAYFDHLDRRIEKANELGIEAALVPFWGCPY---V-PGTW----GFG-----------------PNIMPPENAE 132 (289)
T ss_dssp --TT----HHHHHHHHHHHHHHTT-EEEEESS-HHHH---H-----------------------------TTSS-HHHHH
T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCeEEEEEEECCcc---c-cccc----ccc-----------------ccCCCHHHHH
Confidence 334445678999999999999999985 333443221 0 0012 100 0111233567
Q ss_pred HHHHHHHHHHHHcccccceeeeecCCCCCccchHHHHHHHHHHHHhCCCCeEEEee-ccCCCCCCCCCCCcccccccccc
Q 007713 327 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY-YCGPSDAPLGPTPFESFVKVPKF 405 (592)
Q Consensus 327 ~~L~~~~~hl~~kg~~~~~y~~i~DEP~~~~~~~~~~~~~~~i~~~~P~~ki~~t~-~~~p~~~~~~~~~~e~~~~vp~~ 405 (592)
.|++=+++.+++.- .-+++..-|.-......+.++++++.|++.+|.- +.|+ +|+. .+..+.|.+.
T Consensus 133 ~Y~~yv~~Ry~~~~--NviW~l~gd~~~~~~~~~~w~~~~~~i~~~dp~~--L~T~H~~~~------~~~~~~~~~~--- 199 (289)
T PF13204_consen 133 RYGRYVVARYGAYP--NVIWILGGDYFDTEKTRADWDAMARGIKENDPYQ--LITIHPCGR------TSSPDWFHDE--- 199 (289)
T ss_dssp HHHHHHHHHHTT-S--SEEEEEESSS--TTSSHHHHHHHHHHHHHH--SS---EEEEE-BT------EBTHHHHTT----
T ss_pred HHHHHHHHHHhcCC--CCEEEecCccCCCCcCHHHHHHHHHHHHhhCCCC--cEEEeCCCC------CCcchhhcCC---
Confidence 77777777776651 2234444555123567788899999999999965 4433 3221 0111112211
Q ss_pred cCCccccccccccccCCch--hhhHHhH---hhcccCCCceeEEEecCCCCCCCCCccc----CCchhHHHHHHHHHHHc
Q 007713 406 LRPHTQIYCTSEWVLGNRE--DLVKDIV---TELQPENGEEWWTYVCMGPSDPHPNWHL----GMRGSQHRAVMWRVWKE 476 (592)
Q Consensus 406 ~~~~idi~c~~~wv~~~~~--~~~~~~~---~~~~~~~G~~~W~Y~C~~p~~~~pN~fi----~~p~~~~R~~gw~~~~~ 476 (592)
+.+|+++.- .|+.. ...-..+ ...+....|++.-=-||. ...|...- .....+.|--.|.+.--
T Consensus 200 --~Wldf~~~Q---sgh~~~~~~~~~~~~~~~~~~~~p~KPvin~Ep~Y--Eg~~~~~~~~~~~~~~~dvrr~aw~svla 272 (289)
T PF13204_consen 200 --PWLDFNMYQ---SGHNRYDQDNWYYLPEEFDYRRKPVKPVINGEPCY--EGIPYSRWGYNGRFSAEDVRRRAWWSVLA 272 (289)
T ss_dssp --TT--SEEEB-----S--TT--THHHH--HHHHTSSS---EEESS-----BT-BTTSS-TS-B--HHHHHHHHHHHHHC
T ss_pred --CcceEEEee---cCCCcccchHHHHHhhhhhhhhCCCCCEEcCcccc--cCCCCCcCcccCCCCHHHHHHHHHHHHhc
Confidence 223433221 12210 1111111 222446677775334553 11222111 24456777789999999
Q ss_pred CC-CEEEEeecccc
Q 007713 477 GG-TGFLYWGANCY 489 (592)
Q Consensus 477 g~-~GfL~W~~n~w 489 (592)
|. -|+-|.+-.-|
T Consensus 273 Ga~aG~tYG~~~iW 286 (289)
T PF13204_consen 273 GAYAGHTYGAHGIW 286 (289)
T ss_dssp T--SEEEE-BHHHH
T ss_pred CCCccccCCCCCcc
Confidence 99 99998875555
No 28
>PF09099 Qn_am_d_aIII: Quinohemoprotein amine dehydrogenase, alpha subunit domain III; InterPro: IPR015183 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.94 E-value=62 Score=27.60 Aligned_cols=21 Identities=24% Similarity=0.370 Sum_probs=18.7
Q ss_pred EEEEEEEcCCCCCCceeEEEE
Q 007713 105 AVWVSIDAPYAQPPGLYEGEI 125 (592)
Q Consensus 105 ~lWv~v~VP~~a~pG~Y~g~i 125 (592)
.++++|.+.++++||.|+..+
T Consensus 49 ~v~v~V~~aa~a~~G~~~v~v 69 (81)
T PF09099_consen 49 EVVVRVKAAADAAPGIRTVRV 69 (81)
T ss_dssp CEEEEEEEECTSSSEEEEEEE
T ss_pred EEEEEEEEcCCCCCccEEEEe
Confidence 589999999999999998665
No 29
>PF08428 Rib: Rib/alpha-like repeat; InterPro: IPR012706 This entry represents a region of about 79 amino acids found tandemly repeated up to fourteen times within the proteins that contain it. The repeats lack cysteines and are highly conserved, even at the DNA level, within and between proteins []. Proteins containing these repeats include the Rib and alpha surface antigens of group B Streptococcus, Esp of Enterococcus faecalis (Streptococcus faecalis), and related proteins of Lactobacillus. Most members of this protein family also have the cell wall anchor motif, LPXTG, shared by many staphyloccal and streptococcal surface antigens. These repeats are thought to define protective epitopes and may play a role in generating phenotypic and genotypic variation [].
Probab=26.02 E-value=1.4e+02 Score=24.05 Aligned_cols=33 Identities=30% Similarity=0.528 Sum_probs=24.2
Q ss_pred eecCCCeeEEEEEEEcCCCCCCceeEEEEEEEecCCc
Q 007713 97 SLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADT 133 (592)
Q Consensus 97 ~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g 133 (592)
+++.|. .--|.+ .|....+|.|++.|+|+-. ||
T Consensus 20 ~lP~gt-~~~w~~--~pdt~~~G~~~~~V~Vtyp-Dg 52 (65)
T PF08428_consen 20 NLPAGT-TYSWKD--KPDTSKPGTKTGKVKVTYP-DG 52 (65)
T ss_pred cCCCCc-ceeecc--CCccccCccEEEEEEEEcC-CC
Confidence 444433 346766 8999999999999999952 44
No 30
>PF00868 Transglut_N: Transglutaminase family; InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=25.45 E-value=5e+02 Score=23.36 Aligned_cols=29 Identities=24% Similarity=0.370 Sum_probs=20.0
Q ss_pred CCCeeEEEEEEEcCCCCCCceeEEEEEEE
Q 007713 100 PGETTAVWVSIDAPYAQPPGLYEGEIIIT 128 (592)
Q Consensus 100 ~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~ 128 (592)
..+...+=|.|.+|++|.-|.|+-+|.++
T Consensus 89 ~~~~~~~tv~V~spa~A~VG~y~l~v~~~ 117 (118)
T PF00868_consen 89 SQDGNSVTVSVTSPANAPVGRYKLSVETK 117 (118)
T ss_dssp EEETTEEEEEEE--TTS--EEEEEEEEEE
T ss_pred ecCCCEEEEEEECCCCCceEEEEEEEEEe
Confidence 33444577899999999999999999886
No 31
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=23.01 E-value=1.3e+02 Score=22.90 Aligned_cols=19 Identities=21% Similarity=0.315 Sum_probs=15.0
Q ss_pred EcCCCCCCceeEEEEEEEe
Q 007713 111 DAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 111 ~VP~~a~pG~Y~g~i~v~~ 129 (592)
.+....++|.|+.+|+++.
T Consensus 28 tp~~~~~~G~y~~~vtatd 46 (49)
T PF05345_consen 28 TPTSSVQPGTYTFTVTATD 46 (49)
T ss_pred ecCCCccccEEEEEEEEEc
Confidence 4555668899999999984
No 32
>PF14734 DUF4469: Domain of unknown function (DUF4469) with IG-like fold
Probab=22.35 E-value=89 Score=27.76 Aligned_cols=23 Identities=17% Similarity=0.110 Sum_probs=20.5
Q ss_pred EEEEEcCCCCCCceeEEEEEEEe
Q 007713 107 WVSIDAPYAQPPGLYEGEIIITS 129 (592)
Q Consensus 107 Wv~v~VP~~a~pG~Y~g~i~v~~ 129 (592)
=+.+.||++-++|.|+.+|+=..
T Consensus 65 ~l~~~lPa~L~~G~Y~l~V~Tq~ 87 (102)
T PF14734_consen 65 RLIFILPADLAAGEYTLEVRTQY 87 (102)
T ss_pred EEEEECcCccCceEEEEEEEEEe
Confidence 36889999999999999999875
No 33
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=22.27 E-value=1e+02 Score=22.77 Aligned_cols=14 Identities=29% Similarity=0.465 Sum_probs=12.3
Q ss_pred CCCceeEEEEEEEe
Q 007713 116 QPPGLYEGEIIITS 129 (592)
Q Consensus 116 a~pG~Y~g~i~v~~ 129 (592)
.+||.|+.+++.+.
T Consensus 10 T~PG~Y~l~~~a~~ 23 (41)
T TIGR03769 10 TKPGTYTLTVQATA 23 (41)
T ss_pred CCCeEEEEEEEEEE
Confidence 58999999999973
No 34
>PRK09778 putative antitoxin of the YafO-YafN toxin-antitoxin system; Provisional
Probab=20.90 E-value=1.9e+02 Score=25.54 Aligned_cols=25 Identities=16% Similarity=0.198 Sum_probs=22.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCc
Q 007713 529 LRLERILSGLQDIEYLNLYASRYGR 553 (592)
Q Consensus 529 iRle~lreGieDye~L~lL~~~~~~ 553 (592)
=-+|.|.+-++|+|+.++.+++...
T Consensus 44 ~~yE~m~e~LeD~eL~~l~~~R~~~ 68 (97)
T PRK09778 44 SAFEALMDMLAEQEEKKPIKARFRP 68 (97)
T ss_pred HHHHHHHHHHHhHHHHHHHHHHcCc
Confidence 3579999999999999999998664
No 35
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=20.83 E-value=1.4e+02 Score=23.16 Aligned_cols=14 Identities=36% Similarity=0.608 Sum_probs=11.3
Q ss_pred CCCceeEEEEEEEe
Q 007713 116 QPPGLYEGEIIITS 129 (592)
Q Consensus 116 a~pG~Y~g~i~v~~ 129 (592)
-+||.|+..|+++.
T Consensus 36 L~~G~Y~l~V~a~~ 49 (66)
T PF07495_consen 36 LPPGKYTLEVRAKD 49 (66)
T ss_dssp --SEEEEEEEEEEE
T ss_pred CCCEEEEEEEEEEC
Confidence 48999999999986
Done!