Query         007713
Match_columns 592
No_of_seqs    158 out of 194
Neff          6.4 
Searched_HMMs 46136
Date          Thu Mar 28 14:20:08 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/007713.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/007713hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF13320 DUF4091:  Domain of un  99.9 1.5E-25 3.2E-30  182.8   7.6   68  476-549     1-68  (68)
  2 PF10633 NPCBM_assoc:  NPCBM-as  95.4    0.07 1.5E-06   44.5   7.5   32   97-128    45-76  (78)
  3 COG1470 Predicted membrane pro  88.4       5 0.00011   44.4  11.7   36   94-129   325-360 (513)
  4 PF01229 Glyco_hydro_39:  Glyco  87.5       2 4.3E-05   48.4   8.4  108  257-381    83-202 (486)
  5 PF06030 DUF916:  Bacterial pro  84.8      30 0.00065   31.6  13.2  109    4-129     3-120 (121)
  6 COG1470 Predicted membrane pro  84.0       9 0.00019   42.5  10.9   33   97-129   437-469 (513)
  7 PF02221 E1_DerP2_DerF2:  ML do  74.5     6.5 0.00014   35.6   5.4   36   95-130    85-120 (134)
  8 PF15418 DUF4625:  Domain of un  73.6     3.9 8.4E-05   38.0   3.7   88   19-131    28-119 (132)
  9 PF14352 DUF4402:  Domain of un  68.7     5.2 0.00011   36.6   3.4   34   96-129    93-128 (130)
 10 PF02449 Glyco_hydro_42:  Beta-  67.5      65  0.0014   34.8  12.1   87  437-543   286-373 (374)
 11 PF06280 DUF1034:  Fn3-like dom  61.0     7.7 0.00017   34.4   2.9   36   94-129    62-100 (112)
 12 PF01835 A2M_N:  MG2 domain;  I  60.8      41 0.00088   28.8   7.4   26  103-128    61-86  (99)
 13 PF13731 WxL:  WxL domain surfa  59.8      29 0.00062   34.6   7.0   79   49-128   105-210 (215)
 14 smart00633 Glyco_10 Glycosyl h  58.6      31 0.00067   35.2   7.2   99  253-381    10-124 (254)
 15 KOG1579 Homocysteine S-methylt  51.8      93   0.002   33.1   9.4  125  309-449   131-259 (317)
 16 PF09608 Alph_Pro_TM:  Putative  51.3      38 0.00082   34.6   6.3   53   94-163   147-199 (236)
 17 cd00917 PG-PI_TP The phosphati  50.1      27  0.0006   31.6   4.7   34   95-129    76-109 (122)
 18 COG5520 O-Glycosyl hydrolase [  48.8 1.9E+02  0.0041   31.5  11.1  146  324-487   150-310 (433)
 19 PF04234 CopC:  CopC domain;  I  44.9      37 0.00081   29.4   4.5   25  108-133    61-85  (97)
 20 PF00150 Cellulase:  Cellulase   43.8 2.2E+02  0.0048   28.4  10.8  102  255-382    59-172 (281)
 21 PF12245 Big_3_2:  Bacterial Ig  41.6      61  0.0013   25.7   4.9   47  105-167    10-56  (60)
 22 TIGR02186 alph_Pro_TM conserve  41.5      35 0.00075   35.4   4.3   50   95-161   173-222 (261)
 23 PF09087 Cyc-maltodext_N:  Cycl  38.2 1.9E+02   0.004   25.1   7.6   20  106-126    51-70  (88)
 24 PRK10301 hypothetical protein;  37.4      60  0.0013   29.7   4.8   25  108-133    88-112 (124)
 25 smart00737 ML Domain involved   34.8      71  0.0015   28.3   4.8   35   95-129    71-105 (118)
 26 PLN00180 NDF6 (NDH-dependent f  34.2      36 0.00077   32.4   2.8   79  457-552    90-169 (180)
 27 PF13204 DUF4038:  Protein of u  32.4 4.5E+02  0.0098   27.4  11.1  197  248-489    78-286 (289)
 28 PF09099 Qn_am_d_aIII:  Quinohe  26.9      62  0.0013   27.6   2.8   21  105-125    49-69  (81)
 29 PF08428 Rib:  Rib/alpha-like r  26.0 1.4E+02  0.0031   24.0   4.7   33   97-133    20-52  (65)
 30 PF00868 Transglut_N:  Transglu  25.5   5E+02   0.011   23.4   9.3   29  100-128    89-117 (118)
 31 PF05345 He_PIG:  Putative Ig d  23.0 1.3E+02  0.0028   22.9   3.7   19  111-129    28-46  (49)
 32 PF14734 DUF4469:  Domain of un  22.4      89  0.0019   27.8   3.1   23  107-129    65-87  (102)
 33 TIGR03769 P_ac_wall_RPT actino  22.3   1E+02  0.0022   22.8   2.8   14  116-129    10-23  (41)
 34 PRK09778 putative antitoxin of  20.9 1.9E+02   0.004   25.5   4.6   25  529-553    44-68  (97)
 35 PF07495 Y_Y_Y:  Y_Y_Y domain;   20.8 1.4E+02  0.0031   23.2   3.7   14  116-129    36-49  (66)

No 1  
>PF13320 DUF4091:  Domain of unknown function (DUF4091)
Probab=99.92  E-value=1.5e-25  Score=182.76  Aligned_cols=68  Identities=47%  Similarity=0.862  Sum_probs=63.6

Q ss_pred             cCCCEEEEeecccccCCCCCCccccccCCCCCCceEEEccCccCCCCCCceechhHHHHHHHHHHHHHHHHHHh
Q 007713          476 EGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYAS  549 (592)
Q Consensus       476 ~g~~GfL~W~~n~w~~~~~P~~~~~~~~~~~~GD~~LvYPG~~~~~~~~Pv~SiRle~lreGieDye~L~lL~~  549 (592)
                      ||++|||||+||+|+++  |+.+++++. |++||++|||||++   .++|++|||||+||+||||||||++|++
T Consensus         1 y~~~G~L~W~~~~w~~d--P~~d~~~~~-~~~GD~~lvYPg~~---~~~p~~SiRle~lr~G~qD~e~l~~l~~   68 (68)
T PF13320_consen    1 YGFDGFLRWAYNFWNED--PWEDTRFRG-FPAGDGFLVYPGED---TGGPVSSIRLEVLREGIQDYEYLRLLEK   68 (68)
T ss_pred             CCCCeEEEecccccccC--cccccCcCc-CCCCCeEEEecCCC---CCCcccCHHHHHHHHHHHHHHHHHHHhC
Confidence            68999999999999887  999999996 99999999999983   3899999999999999999999999985


No 2  
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=95.43  E-value=0.07  Score=44.50  Aligned_cols=32  Identities=31%  Similarity=0.543  Sum_probs=25.8

Q ss_pred             eecCCCeeEEEEEEEcCCCCCCceeEEEEEEE
Q 007713           97 SLIPGETTAVWVSIDAPYAQPPGLYEGEIIIT  128 (592)
Q Consensus        97 ~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~  128 (592)
                      .|++|+.+.+=++|.+|+++.||.|+.+++++
T Consensus        45 ~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a~   76 (78)
T PF10633_consen   45 SLPPGESVTVTFTVTVPADAAPGTYTVTVTAR   76 (78)
T ss_dssp             -B-TTSEEEEEEEEEE-TT--SEEEEEEEEEE
T ss_pred             cCCCCCEEEEEEEEECCCCCCCceEEEEEEEE
Confidence            78899999999999999999999999999886


No 3  
>COG1470 Predicted membrane protein [Function unknown]
Probab=88.37  E-value=5  Score=44.42  Aligned_cols=36  Identities=28%  Similarity=0.458  Sum_probs=34.3

Q ss_pred             cceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713           94 CQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus        94 ~~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~  129 (592)
                      ..+.+.+|+...+-++|+.|++|.||.|..+|+++.
T Consensus       325 t~vkL~~gE~kdvtleV~ps~na~pG~Ynv~I~A~s  360 (513)
T COG1470         325 TSVKLKPGEEKDVTLEVYPSLNATPGTYNVTITASS  360 (513)
T ss_pred             EEEEecCCCceEEEEEEecCCCCCCCceeEEEEEec
Confidence            579999999999999999999999999999999985


No 4  
>PF01229 Glyco_hydro_39:  Glycosyl hydrolases family 39;  InterPro: IPR000514 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 39 GH39 from CAZY comprises enzymes with several known activities; alpha-L-iduronidase (3.2.1.76 from EC); beta-xylosidase (3.2.1.37 from EC). The most highly conserved regions in these enzymes are located in their N-terminal sections. These contain a glutamic acid residue which, on the basis of similarities with other families of glycosyl hydrolases [], probably acts as the proton donor in their catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BS9_D 2BFG_E 1W91_B 1UHV_D 1PX8_A.
Probab=87.49  E-value=2  Score=48.36  Aligned_cols=108  Identities=24%  Similarity=0.318  Sum_probs=62.1

Q ss_pred             HHHHHHHHHHHHhCCCCccccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHHHHHHHHHHHH
Q 007713          257 YEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELL  336 (592)
Q Consensus       257 f~~ldr~~~~~~~~ris~~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~~~L~~~~~hl  336 (592)
                      |+.+|+-++++++++|.|+. .       ++++..+-+. +.... |.      |.....|. ...+.|.+++++|++|+
T Consensus        83 f~~lD~i~D~l~~~g~~P~v-e-------l~f~p~~~~~-~~~~~-~~------~~~~~~pp-~~~~~W~~lv~~~~~h~  145 (486)
T PF01229_consen   83 FTYLDQILDFLLENGLKPFV-E-------LGFMPMALAS-GYQTV-FW------YKGNISPP-KDYEKWRDLVRAFARHY  145 (486)
T ss_dssp             -HHHHHHHHHHHHCT-EEEE-E-------E-SB-GGGBS-S--EE-TT------TTEE-S-B-S-HHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHcCCEEEE-E-------EEechhhhcC-CCCcc-cc------ccCCcCCc-ccHHHHHHHHHHHHHHH
Confidence            99999999999999999853 1       1211000000 11001 10      00011111 23358999999999999


Q ss_pred             HHc-cc--ccceeeeecCCCCCc---------cchHHHHHHHHHHHHhCCCCeEEEe
Q 007713          337 RTK-AH--WKKAYFYLWDEPLNM---------EHYSSVRNMASELHAYAPDARVLTT  381 (592)
Q Consensus       337 ~~k-g~--~~~~y~~i~DEP~~~---------~~~~~~~~~~~~i~~~~P~~ki~~t  381 (592)
                      ..+ |.  ....+|=++.||...         +=++.|+..++.||++.|++||---
T Consensus       146 ~~RYG~~ev~~W~fEiWNEPd~~~f~~~~~~~ey~~ly~~~~~~iK~~~p~~~vGGp  202 (486)
T PF01229_consen  146 IDRYGIEEVSTWYFEIWNEPDLKDFWWDGTPEEYFELYDATARAIKAVDPELKVGGP  202 (486)
T ss_dssp             HHHHHHHHHTTSEEEESS-TTSTTTSGGG-HHHHHHHHHHHHHHHHHH-TTSEEEEE
T ss_pred             HhhcCCccccceeEEeCcCCCcccccCCCCHHHHHHHHHHHHHHHHHhCCCCcccCc
Confidence            753 42  235567689999632         2244678889999999999998654


No 5  
>PF06030 DUF916:  Bacterial protein of unknown function (DUF916);  InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function. 
Probab=84.80  E-value=30  Score=31.58  Aligned_cols=109  Identities=17%  Similarity=0.226  Sum_probs=69.8

Q ss_pred             ccccCCCCCCC-CCCceEEEeecCceeEEEEEEecCcccCCCCCCcceEEEEcc-ccCCCCCccccccceEEEEeeecCC
Q 007713            4 TANVGPQEMPR-PLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSD-LCSASGDRLVVGQSLMLRRVVPMLG   81 (592)
Q Consensus         4 ~~~v~p~~~p~-~~~~~~l~aarnE~~sfQi~l~~~~~~~~~~~~~~v~v~~sd-l~~~~G~~~i~~~~i~~~~v~~vpg   81 (592)
                      ..-|.|+.+-. ....+.|.+.-|+...+|+-+.-.     +...-.|.|++.+ .++.+|.            +.|.+-
T Consensus         3 V~p~~p~~Q~~~~~~YFdL~~~P~q~~~l~v~i~N~-----s~~~~tv~v~~~~A~Tn~nG~------------I~Y~~~   65 (121)
T PF06030_consen    3 VTPVLPENQIDKNVSYFDLKVKPGQKQTLEVRITNN-----SDKEITVKVSANTATTNDNGV------------IDYSQN   65 (121)
T ss_pred             EeecCCccccCCCCCeEEEEeCCCCEEEEEEEEEeC-----CCCCEEEEEEEeeeEecCCEE------------EEECCC
Confidence            34577877654 457899999999999999999653     1222234443221 1122221            222210


Q ss_pred             --CCC--ccccCC---CCccceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713           82 --VPD--ALVPLD---LPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus        82 --~PD--~L~p~~---~~~~~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~  129 (592)
                        -.|  +-.++.   .....++|+|++.+-+=++|.+|+..-.|+.-|.|.|+.
T Consensus        66 ~~~~d~sl~~~~~~~v~~~~~Vtl~~~~sk~V~~~i~~P~~~f~G~ilGGi~~~e  120 (121)
T PF06030_consen   66 NPKKDKSLKYPFSDLVKIPKEVTLPPNESKTVTFTIKMPKKAFDGIILGGIYFSE  120 (121)
T ss_pred             CcccCcccCcchHHhccCCcEEEECCCCEEEEEEEEEcCCCCcCCEEEeeEEEEe
Confidence              111  111111   011349999999999999999999999999999999983


No 6  
>COG1470 Predicted membrane protein [Function unknown]
Probab=84.03  E-value=9  Score=42.53  Aligned_cols=33  Identities=36%  Similarity=0.457  Sum_probs=30.3

Q ss_pred             eecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713           97 SLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus        97 ~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~  129 (592)
                      .+.||+...+=++|.||++|.+|.|..+|+.++
T Consensus       437 sL~pge~~tV~ltI~vP~~a~aGdY~i~i~~ks  469 (513)
T COG1470         437 SLEPGESKTVSLTITVPEDAGAGDYRITITAKS  469 (513)
T ss_pred             ccCCCCcceEEEEEEcCCCCCCCcEEEEEEEee
Confidence            457899999999999999999999999999985


No 7  
>PF02221 E1_DerP2_DerF2:  ML domain;  InterPro: IPR003172  The MD-2-related lipid-recognition (ML) domain is implicated in lipid recognition, particularly in the recognition of pathogen related products. It has an immunoglobulin-like beta-sandwich fold similar to that of E-set Ig domains. This domain is present in the following proteins:  Epididymal secretory protein E1 (also known as Niemann-Pick C2 protein), which is known to bind cholesterol. Niemann-Pick disease type C2 is a fatal hereditary disease characterised by accumulation of low-density lipoprotein-derived cholesterol in lysosomes [].  House-dust mite allergen proteins such as Der f 2 from Dermatophagoides farinae and Der p 2 from Dermatophagoides pteronyssinus [].  ; PDB: 2AG9_B 1G13_B 2AG2_B 2AG4_A 1TJJ_C 1PU5_C 1PUB_A 2AF9_A 3T6Q_D 3M7O_B ....
Probab=74.47  E-value=6.5  Score=35.58  Aligned_cols=36  Identities=25%  Similarity=0.342  Sum_probs=33.2

Q ss_pred             ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEec
Q 007713           95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSK  130 (592)
Q Consensus        95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~  130 (592)
                      ...+.+|+....-+++.||...++|.|+++++++..
T Consensus        85 ~CPi~~G~~~~~~~~~~i~~~~p~~~~~i~~~l~d~  120 (134)
T PF02221_consen   85 SCPIKAGEYYTYTYTIPIPKIYPPGKYTIQWKLTDQ  120 (134)
T ss_dssp             TSTBTTTEEEEEEEEEEESTTSSSEEEEEEEEEEET
T ss_pred             cCccCCCcEEEEEEEEEcccceeeEEEEEEEEEEeC
Confidence            577889999999999999999999999999999963


No 8  
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=73.55  E-value=3.9  Score=38.02  Aligned_cols=88  Identities=19%  Similarity=0.264  Sum_probs=52.4

Q ss_pred             eEEEeecCceeEEEEEEecCcccCCCCCCcceEEEEcc-c--cCCCCCccccccceEEEEeeecCCCCCccccCCCCccc
Q 007713           19 INLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSD-L--CSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQ   95 (592)
Q Consensus        19 ~~l~aarnE~~sfQi~l~~~~~~~~~~~~~~v~v~~sd-l--~~~~G~~~i~~~~i~~~~v~~vpg~PD~L~p~~~~~~~   95 (592)
                      -.-.+-||+...|..-+.+.      ..++.++|++-. +  -+-++.   .+               +...|+.- ...
T Consensus        28 ~~~~~~~G~~ihfe~~i~d~------~~i~si~VeIH~nfd~H~h~~~---~~---------------~~~~~~~~-~~~   82 (132)
T PF15418_consen   28 NCKVATRGDDIHFEADISDN------SAIKSIKVEIHNNFDHHTHSTE---AG---------------ECEKPWVF-EQD   82 (132)
T ss_pred             CCeEEecCCcEEEEEEEEcc------cceeEEEEEEecCcCccccccc---cc---------------ccccCcEE-EEE
Confidence            34566889999999888874      467788887620 0  011000   00               00112110 011


Q ss_pred             eeecCC-CeeEEEEEEEcCCCCCCceeEEEEEEEecC
Q 007713           96 ISLIPG-ETTAVWVSIDAPYAQPPGLYEGEIIITSKA  131 (592)
Q Consensus        96 ~~v~~~-~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~  131 (592)
                      +.+..| ...-+=..|.||++++||.|...|+|+.++
T Consensus        83 ~~~~~g~~~~~~h~~i~IPa~a~~G~YH~~i~VtD~~  119 (132)
T PF15418_consen   83 YDIYGGKKNYDFHEHIDIPADAPAGDYHFMITVTDAA  119 (132)
T ss_pred             EcccCCcccEeEEEeeeCCCCCCCcceEEEEEEEECC
Confidence            222222 245677899999999999999999999743


No 9  
>PF14352 DUF4402:  Domain of unknown function (DUF4402)
Probab=68.68  E-value=5.2  Score=36.55  Aligned_cols=34  Identities=26%  Similarity=0.485  Sum_probs=26.7

Q ss_pred             eeecCCCeeEEEE--EEEcCCCCCCceeEEEEEEEe
Q 007713           96 ISLIPGETTAVWV--SIDAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus        96 ~~v~~~~~q~lWv--~v~VP~~a~pG~Y~g~i~v~~  129 (592)
                      ..+..+....+.|  +++|+.++++|.|+|+++|+.
T Consensus        93 ~~~~~~g~~~~~VGGtL~v~~~~~~G~YsGt~~VtV  128 (130)
T PF14352_consen   93 TTLDTGGSATFNVGGTLNVPANQAAGTYSGTFTVTV  128 (130)
T ss_pred             eEecCCCcEEEEEEEEEEcCCCCCCeEEEEEEEEEE
Confidence            3344555666776  589999999999999999984


No 10 
>PF02449 Glyco_hydro_42:  Beta-galactosidase;  InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=67.47  E-value=65  Score=34.76  Aligned_cols=87  Identities=17%  Similarity=0.339  Sum_probs=41.2

Q ss_pred             CCCceeEEEe-cCCCCCCCCCcccCCchhHHHHHHHHHHHcCCCEEEEeecccccCCCCCCccccccCCCCCCceEEEcc
Q 007713          437 ENGEEWWTYV-CMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYP  515 (592)
Q Consensus       437 ~~G~~~W~Y~-C~~p~~~~pN~fi~~p~~~~R~~gw~~~~~g~~GfL~W~~n~w~~~~~P~~~~~~~~~~~~GD~~LvYP  515 (592)
                      +.|++.|.=- +.++ ..+...-..-.+-+.|...|++..+|.+|.++|.+......   ..  .|..+.-         
T Consensus       286 ~~~kpf~v~E~~~g~-~~~~~~~~~~~pg~~~~~~~~~~A~Ga~~i~~~~wr~~~~g---~E--~~~~g~~---------  350 (374)
T PF02449_consen  286 AKGKPFWVMEQQPGP-VNWRPYNRPPRPGELRLWSWQAIAHGADGILFWQWRQSRFG---AE--QFHGGLV---------  350 (374)
T ss_dssp             TTT--EEEEEE--S---SSSSS-----TTHHHHHHHHHHHTT-S-EEEC-SB--SSS---TT--TTS--SB---------
T ss_pred             cCCCceEeecCCCCC-CCCccCCCCCCCCHHHHHHHHHHHHhCCeeEeeeccCCCCC---ch--hhhcccC---------
Confidence            4789888652 3322 12322233344568899999999999999999998654222   11  1111111         


Q ss_pred             CccCCCCCCceechhHHHHHHHHHHHHH
Q 007713          516 GEVFSSSRQPVASLRLERILSGLQDIEY  543 (592)
Q Consensus       516 G~~~~~~~~Pv~SiRle~lreGieDye~  543 (592)
                      +     .++..++.|++-+.+-.++++.
T Consensus       351 ~-----~dg~~~~~~~~e~~~~~~~l~~  373 (374)
T PF02449_consen  351 D-----HDGREPTRRYREVAQLGRELKK  373 (374)
T ss_dssp             ------TTS--B-HHHHHHHHHHHHHHT
T ss_pred             C-----ccCCCCCcHHHHHHHHHHHHhc
Confidence            1     2335677888887777666553


No 11 
>PF06280 DUF1034:  Fn3-like domain (DUF1034);  InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=61.04  E-value=7.7  Score=34.44  Aligned_cols=36  Identities=28%  Similarity=0.426  Sum_probs=30.7

Q ss_pred             cceeecCCCeeEEEEEEEcCCCCCC---ceeEEEEEEEe
Q 007713           94 CQISLIPGETTAVWVSIDAPYAQPP---GLYEGEIIITS  129 (592)
Q Consensus        94 ~~~~v~~~~~q~lWv~v~VP~~a~p---G~Y~g~i~v~~  129 (592)
                      ..++|+||+.+.+=|+|++|++..+   ..|.|-|.++.
T Consensus        62 ~~vTV~ag~s~~v~vti~~p~~~~~~~~~~~eG~I~~~~  100 (112)
T PF06280_consen   62 DTVTVPAGQSKTVTVTITPPSGLDASNGPFYEGFITFKS  100 (112)
T ss_dssp             EEEEE-TTEEEEEEEEEE--GGGHHTT-EEEEEEEEEES
T ss_pred             CeEEECCCCEEEEEEEEEehhcCCcccCCEEEEEEEEEc
Confidence            5799999999999999999998886   99999999995


No 12 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=60.84  E-value=41  Score=28.77  Aligned_cols=26  Identities=19%  Similarity=0.180  Sum_probs=17.2

Q ss_pred             eeEEEEEEEcCCCCCCceeEEEEEEE
Q 007713          103 TTAVWVSIDAPYAQPPGLYEGEIIIT  128 (592)
Q Consensus       103 ~q~lWv~v~VP~~a~pG~Y~g~i~v~  128 (592)
                      .-.+-.++.+|+++..|.|+.++...
T Consensus        61 ~G~~~~~~~lp~~~~~G~y~i~~~~~   86 (99)
T PF01835_consen   61 NGIFSGSFQLPDDAPLGTYTIRVKTD   86 (99)
T ss_dssp             TTEEEEEEE--SS---EEEEEEEEET
T ss_pred             CCEEEEEEECCCCCCCEeEEEEEEEc
Confidence            33566789999999999999999885


No 13 
>PF13731 WxL:  WxL domain surface cell wall-binding
Probab=59.80  E-value=29  Score=34.63  Aligned_cols=79  Identities=24%  Similarity=0.390  Sum_probs=45.5

Q ss_pred             ceEEEEccccCCCCCccccccceEEEEeeecC---C--CCCc------cccCCCCccceeecCCCeeEEE----------
Q 007713           49 VVQVQCSDLCSASGDRLVVGQSLMLRRVVPML---G--VPDA------LVPLDLPVCQISLIPGETTAVW----------  107 (592)
Q Consensus        49 ~v~v~~sdl~~~~G~~~i~~~~i~~~~v~~vp---g--~PD~------L~p~~~~~~~~~v~~~~~q~lW----------  107 (592)
                      .|+|+.++|++.+|. .+.+..+.+.......   .  -|-.      |.+-......++-..++-+..|          
T Consensus       105 ~L~v~~s~F~~~~~~-~L~ga~l~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~v~~A~~~~g~G~~~~~~~~~~~~  183 (215)
T PF13731_consen  105 TLTVKLSPFTNADGD-TLPGATLTFNNGKVQSTANNTNTPTTVSSNITLTPGGQAQTVMSAAKGQGQGTWSYSFGDQDAT  183 (215)
T ss_pred             EEEEEeccccccCCc-CcccceEEecCceeEeecccccCCcccccceEeccCCcceeeEeecccccceEEEEEeCCcccc
Confidence            477788888888755 4556556554433221   0  0111      1111111111223345555555          


Q ss_pred             ----EEEEcCCCCC--CceeEEEEEEE
Q 007713          108 ----VSIDAPYAQP--PGLYEGEIIIT  128 (592)
Q Consensus       108 ----v~v~VP~~a~--pG~Y~g~i~v~  128 (592)
                          |.+.||..+.  +|.|+++|+=+
T Consensus       184 ~~~~v~L~VP~~~~~~ag~Yt~tlTWt  210 (215)
T PF13731_consen  184 ADTGVSLSVPANTAKQAGTYTATLTWT  210 (215)
T ss_pred             cccceEEEeCCCCcccCCcEEEEEEEE
Confidence                8999999998  69999999976


No 14 
>smart00633 Glyco_10 Glycosyl hydrolase family 10.
Probab=58.60  E-value=31  Score=35.18  Aligned_cols=99  Identities=12%  Similarity=0.166  Sum_probs=58.9

Q ss_pred             ChhH-HHHHHHHHHHHHhCCCCc--cccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHHHHH
Q 007713          253 SDEW-YEALDQHFKWLLQYRISP--FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYV  329 (592)
Q Consensus       253 ~~~~-f~~ldr~~~~~~~~ris~--~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~~~L  329 (592)
                      .+.| |+.+|+.++|+.+++|.-  +..-|+..      ...| ..        ..+ .++          -.+.+.+|+
T Consensus        10 ~G~~n~~~~D~~~~~a~~~gi~v~gH~l~W~~~------~P~W-~~--------~~~-~~~----------~~~~~~~~i   63 (254)
T smart00633       10 RGQFNFSGADAIVNFAKENGIKVRGHTLVWHSQ------TPDW-VF--------NLS-KET----------LLARLENHI   63 (254)
T ss_pred             CCccChHHHHHHHHHHHHCCCEEEEEEEeeccc------CCHh-hh--------cCC-HHH----------HHHHHHHHH
Confidence            3444 899999999999999972  11234331      1123 00        000 000          012456677


Q ss_pred             HHHHHHHHHcccccceeeeecCCCCCcc-------c----h--HHHHHHHHHHHHhCCCCeEEEe
Q 007713          330 RKEIELLRTKAHWKKAYFYLWDEPLNME-------H----Y--SSVRNMASELHAYAPDARVLTT  381 (592)
Q Consensus       330 ~~~~~hl~~kg~~~~~y~~i~DEP~~~~-------~----~--~~~~~~~~~i~~~~P~~ki~~t  381 (592)
                      .+.++|.+.+   -..+ -+..||.+..       .    +  +.++.+.+.+|++.|++|++..
T Consensus        64 ~~v~~ry~g~---i~~w-dV~NE~~~~~~~~~~~~~w~~~~G~~~i~~af~~ar~~~P~a~l~~N  124 (254)
T smart00633       64 KTVVGRYKGK---IYAW-DVVNEALHDNGSGLRRSVWYQILGEDYIEKAFRYAREADPDAKLFYN  124 (254)
T ss_pred             HHHHHHhCCc---ceEE-EEeeecccCCCcccccchHHHhcChHHHHHHHHHHHHhCCCCEEEEe
Confidence            7777776644   1122 3678885321       1    2  6678999999999999999875


No 15 
>KOG1579 consensus Homocysteine S-methyltransferase [Amino acid transport and metabolism]
Probab=51.75  E-value=93  Score=33.14  Aligned_cols=125  Identities=13%  Similarity=0.155  Sum_probs=76.5

Q ss_pred             ccccccccCCCCCchhHHHHHHHHHHHHHHcccccceeeeecCCCCCccchHHHHHHHHHHHHhCCCCeEEEeeccCCCC
Q 007713          309 AAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD  388 (592)
Q Consensus       309 ~~y~~~~~~~l~g~~~~~~~L~~~~~hl~~kg~~~~~y~~i~DEP~~~~~~~~~~~~~~~i~~~~P~~ki~~t~~~~p~~  388 (592)
                      .+|+..|+.-.+.. .+++|.+.-++-+-++|.-.-++=.|   |    +...-+++.+++.+..|+.++..+++|.++.
T Consensus       131 ~eytg~Y~~~~~~~-el~~~~k~qle~~~~~gvD~L~fETi---p----~~~EA~a~l~~l~~~~~~~p~~is~t~~d~g  202 (317)
T KOG1579|consen  131 SEYTGIYGDNVEFE-ELYDFFKQQLEVFLEAGVDLLAFETI---P----NVAEAKAALELLQELGPSKPFWISFTIKDEG  202 (317)
T ss_pred             cccccccccccCHH-HHHHHHHHHHHHHHhCCCCEEEEeec---C----CHHHHHHHHHHHHhcCCCCcEEEEEEecCCC
Confidence            56777776665554 47888888899998998333333344   4    3334578889999999999999999988754


Q ss_pred             CCCCCCCcccccc---cc-cccCCccccccccccccCCchhhhHHhHhhcccCCCceeEEEecCC
Q 007713          389 APLGPTPFESFVK---VP-KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMG  449 (592)
Q Consensus       389 ~~~~~~~~e~~~~---vp-~~~~~~idi~c~~~wv~~~~~~~~~~~~~~~~~~~G~~~W~Y~C~~  449 (592)
                      .......++.++.   .. ++  ..|.++|.+.   ...+....++.+   .-....+-.|...+
T Consensus       203 ~l~~G~t~e~~~~~~~~~~~~--~~IGvNC~~~---~~~~~~~~~L~~---~~~~~~llvYPNsG  259 (317)
T KOG1579|consen  203 RLRSGETGEEAAQLLKDGINL--LGIGVNCVSP---NFVEPLLKELMA---KLTKIPLLVYPNSG  259 (317)
T ss_pred             cccCCCcHHHHHHHhccCCce--EEEEeccCCc---hhccHHHHHHhh---ccCCCeEEEecCCC
Confidence            4444445555432   11 11  2477888743   222333333331   23455666665543


No 16 
>PF09608 Alph_Pro_TM:  Putative transmembrane protein (Alph_Pro_TM);  InterPro: IPR019088  This entry consists of predicted transmembrane proteins of about 270 amino acids. They are found predominantly, though not exclusively, in alphaproteobacteria, generally only once in each genome. 
Probab=51.28  E-value=38  Score=34.64  Aligned_cols=53  Identities=19%  Similarity=0.224  Sum_probs=38.5

Q ss_pred             cceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEecCCccchhhccccccccccccceeeeeeeccCCC
Q 007713           94 CQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIE  163 (592)
Q Consensus        94 ~~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g~~~~~~~~~~~~~~l~~~l~~~l~V~~~~l  163 (592)
                      +.+.+..+  +-+..+|.+|++.++|.|+.++-+..  ||.++++.             ...|+|.-.=+
T Consensus       147 ~~V~~~~~--~lFra~i~LPanvp~G~Y~v~v~l~r--dG~vv~~~-------------~~~l~V~KvG~  199 (236)
T PF09608_consen  147 GGVQFLEG--TLFRARIPLPANVPPGDYTVRVYLFR--DGQVVASQ-------------ETPLRVRKVGF  199 (236)
T ss_pred             CeEEEcCC--CeEEEEeEcCCCCCcceEEEEEEEEE--CCEEEEEE-------------eeEEEEEEccH
Confidence            34665444  47889999999999999999999984  67655433             66667755433


No 17 
>cd00917 PG-PI_TP The phosphatidylinositol/phosphatidylglycerol transfer protein (PG/PI-TP) has been shown to bind phosphatidylglycerol and phosphatidylinositol, but the biological significance of this is still obscure. These proteins belong to the ML domain family.
Probab=50.15  E-value=27  Score=31.59  Aligned_cols=34  Identities=24%  Similarity=0.365  Sum_probs=30.5

Q ss_pred             ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713           95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus        95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~  129 (592)
                      ...+.+|+.. +=.++.||...++|.|+++.++..
T Consensus        76 ~CPi~~G~~~-~~~~~~ip~~~P~g~y~v~~~l~d  109 (122)
T cd00917          76 SCPIEPGDKF-LTKLVDLPGEIPPGKYTVSARAYT  109 (122)
T ss_pred             cCCcCCCcEE-EEEEeeCCCCCCCceEEEEEEEEC
Confidence            5778899977 888899999999999999999995


No 18 
>COG5520 O-Glycosyl hydrolase [Cell envelope biogenesis, outer membrane]
Probab=48.83  E-value=1.9e+02  Score=31.53  Aligned_cols=146  Identities=16%  Similarity=0.281  Sum_probs=77.1

Q ss_pred             hHHHHHHHHHHHHHHcccccceeeeecCCCCCccchHH----HHHHHHHHHHh----CCCCeEEEeeccCCCCCCCCCCC
Q 007713          324 GAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSS----VRNMASELHAY----APDARVLTTYYCGPSDAPLGPTP  395 (592)
Q Consensus       324 ~~~~~L~~~~~hl~~kg~~~~~y~~i~DEP~~~~~~~~----~~~~~~~i~~~----~P~~ki~~t~~~~p~~~~~~~~~  395 (592)
                      .+-+||..|+..++..|.-..+ +.+-.||.-...++.    ..+..++++++    ...+||+.-            ++
T Consensus       150 ~yA~~l~~fv~~m~~nGvnlya-lSVQNEPd~~p~~d~~~wtpQe~~rF~~qyl~si~~~~rV~~p------------es  216 (433)
T COG5520         150 DYADYLNDFVLEMKNNGVNLYA-LSVQNEPDYAPTYDWCWWTPQEELRFMRQYLASINAEMRVIIP------------ES  216 (433)
T ss_pred             HHHHHHHHHHHHHHhCCCceeE-EeeccCCcccCCCCcccccHHHHHHHHHHhhhhhccccEEecc------------hh
Confidence            3678999999999999976654 356799964433433    23445555555    334566652            12


Q ss_pred             cccccc--cccccCCc----cccccccccccCCchhhhHHhHhhcccCCCceeEEEecCCCCCCCCCcccCCchhHHH-H
Q 007713          396 FESFVK--VPKFLRPH----TQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHR-A  468 (592)
Q Consensus       396 ~e~~~~--vp~~~~~~----idi~c~~~wv~~~~~~~~~~~~~~~~~~~G~~~W~Y~C~~p~~~~pN~fi~~p~~~~R-~  468 (592)
                      +....+  -|.+-+|.    ++|. .-+|-.++-.++..-+. + +...||.+|+=-|..+. .=||.-.- ..+..- =
T Consensus       217 ~~~~~~~~dp~lnDp~a~a~~~il-g~H~Ygg~v~~~p~~la-k-~~~~gKdlwmte~y~~e-sd~~s~dr-~~~~~~~h  291 (433)
T COG5520         217 FKDLPNMSDPILNDPKALANMDIL-GTHLYGGQVSDQPYPLA-K-QKPAGKDLWMTECYPPE-SDPNSADR-EALHVALH  291 (433)
T ss_pred             cccccccccccccCHhHhccccee-EeeecccccccchhhHh-h-CCCcCCceEEeecccCC-CCCCcchH-HHHHHHHH
Confidence            111111  12222222    2322 22344444444443332 2 33569999987887553 23332111 111111 1


Q ss_pred             HHHHHHHcCCCEEEEeecc
Q 007713          469 VMWRVWKEGGTGFLYWGAN  487 (592)
Q Consensus       469 ~gw~~~~~g~~GfL~W~~n  487 (592)
                      +.--..+-|+.||+.|..-
T Consensus       292 i~~gm~~gg~~ayv~W~i~  310 (433)
T COG5520         292 IHIGMTEGGFQAYVWWNIR  310 (433)
T ss_pred             HHhhccccCccEEEEEEEe
Confidence            3445567789999999764


No 19 
>PF04234 CopC:  CopC domain;  InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=44.86  E-value=37  Score=29.38  Aligned_cols=25  Identities=32%  Similarity=0.533  Sum_probs=19.0

Q ss_pred             EEEEcCCCCCCceeEEEEEEEecCCc
Q 007713          108 VSIDAPYAQPPGLYEGEIIITSKADT  133 (592)
Q Consensus       108 v~v~VP~~a~pG~Y~g~i~v~~~~~g  133 (592)
                      +.+.+|..-++|.|+..-+|.+ +||
T Consensus        61 ~~~~l~~~l~~G~YtV~wrvvs-~DG   85 (97)
T PF04234_consen   61 LTVPLPPPLPPGTYTVSWRVVS-ADG   85 (97)
T ss_dssp             EEEEESS---SEEEEEEEEEEE-TTS
T ss_pred             EEEECCCCCCCceEEEEEEEEe-cCC
Confidence            5788899999999999999986 566


No 20 
>PF00150 Cellulase:  Cellulase (glycosyl hydrolase family 5);  InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=43.82  E-value=2.2e+02  Score=28.42  Aligned_cols=102  Identities=13%  Similarity=0.176  Sum_probs=58.9

Q ss_pred             hHHHHHHHHHHHHHhCCCCccccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHHHHHHHHHH
Q 007713          255 EWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE  334 (592)
Q Consensus       255 ~~f~~ldr~~~~~~~~ris~~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~~~L~~~~~  334 (592)
                      .+++.||+-++++.+++|.-+++-.+.        ..|... ...   +..     ..       ...+....+++.++.
T Consensus        59 ~~~~~ld~~v~~a~~~gi~vild~h~~--------~~w~~~-~~~---~~~-----~~-------~~~~~~~~~~~~la~  114 (281)
T PF00150_consen   59 TYLARLDRIVDAAQAYGIYVILDLHNA--------PGWANG-GDG---YGN-----ND-------TAQAWFKSFWRALAK  114 (281)
T ss_dssp             HHHHHHHHHHHHHHHTT-EEEEEEEES--------TTCSSS-TST---TTT-----HH-------HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHhCCCeEEEEeccC--------cccccc-ccc---ccc-----ch-------hhHHHHHhhhhhhcc
Confidence            458999999999999999744321110        113000 000   000     00       001134556677777


Q ss_pred             HHHHcccccceeeeecCCCCCccc------------hHHHHHHHHHHHHhCCCCeEEEee
Q 007713          335 LLRTKAHWKKAYFYLWDEPLNMEH------------YSSVRNMASELHAYAPDARVLTTY  382 (592)
Q Consensus       335 hl~~kg~~~~~y~~i~DEP~~~~~------------~~~~~~~~~~i~~~~P~~ki~~t~  382 (592)
                      +++..  -..+.+-|+.||.....            .+.++++.+.||+..|+..|+...
T Consensus       115 ~y~~~--~~v~~~el~NEP~~~~~~~~w~~~~~~~~~~~~~~~~~~Ir~~~~~~~i~~~~  172 (281)
T PF00150_consen  115 RYKDN--PPVVGWELWNEPNGGNDDANWNAQNPADWQDWYQRAIDAIRAADPNHLIIVGG  172 (281)
T ss_dssp             HHTTT--TTTEEEESSSSGCSTTSTTTTSHHHTHHHHHHHHHHHHHHHHTTSSSEEEEEE
T ss_pred             ccCCC--CcEEEEEecCCccccCCccccccccchhhhhHHHHHHHHHHhcCCcceeecCC
Confidence            77533  23556678999963211            356788999999999998877654


No 21 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=41.63  E-value=61  Score=25.65  Aligned_cols=47  Identities=21%  Similarity=0.274  Sum_probs=34.0

Q ss_pred             EEEEEEEcCCCCCCceeEEEEEEEecCCccchhhccccccccccccceeeeeeeccCCCCCCh
Q 007713          105 AVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPL  167 (592)
Q Consensus       105 ~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g~~~~~~~~~~~~~~l~~~l~~~l~V~~~~lp~~~  167 (592)
                      ..|.. -+|.+...|.|+.+|+++.+++..               .+.....-+++...|.|.
T Consensus        10 ~~~~~-~~P~~~~dg~yt~~v~a~D~AGN~---------------~~~~~~~~i~d~~~p~pt   56 (60)
T PF12245_consen   10 GVWST-VIPENDADGEYTLTVTATDKAGNT---------------SSSTTQIVIVDNTAPAPT   56 (60)
T ss_pred             cceec-cccCccCCccEEEEEEEEECCCCE---------------EEeeeEEEEEcCCCCCcc
Confidence            34443 469998999999999999776442               344667778888878664


No 22 
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=41.48  E-value=35  Score=35.42  Aligned_cols=50  Identities=14%  Similarity=0.185  Sum_probs=36.5

Q ss_pred             ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEecCCccchhhccccccccccccceeeeeeeccC
Q 007713           95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEP  161 (592)
Q Consensus        95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g~~~~~~~~~~~~~~l~~~l~~~l~V~~~  161 (592)
                      .+++..+.  =+--+|.+|++.+.|.|+.++-+..  +|.++++.             ...|+|.-.
T Consensus       173 gV~~~~~~--LFra~i~LPAnvp~G~Y~v~v~L~r--~G~vv~~~-------------~t~l~V~Kv  222 (261)
T TIGR02186       173 GVTFISPT--LFRATLRLPANVPNGTHEVRAYLFR--GGVFIART-------------ELALEIVKT  222 (261)
T ss_pred             eEEEcCCc--eEEEeeecCCCCCCceEEEEEEEEe--CCEEEEEE-------------EeEEEEEEe
Confidence            35554443  3678999999999999999999984  67655543             667777653


No 23 
>PF09087 Cyc-maltodext_N:  Cyclomaltodextrinase, N-terminal;  InterPro: IPR015171 This domain is found at the N terminus of cyclomaltodextrinase. The domain assumes a beta-sandwich structure composed of the eight antiparallel beta-strands. A ten residue linker is also present at the C-terminal end, which connects the N-terminal domain to a distal domain in the protein. This domain participates in oligomerisation of the protein, wherein the N-terminal domain of one subunit contacts the active centre of the other subunit, and is also required for binding of cyclodextrin to substrate []. ; PDB: 3EDK_B 3EDD_A 3EDJ_B 3EDE_A 1H3G_B 3EDF_B.
Probab=38.20  E-value=1.9e+02  Score=25.10  Aligned_cols=20  Identities=20%  Similarity=0.436  Sum_probs=13.1

Q ss_pred             EEEEEEcCCCCCCceeEEEEE
Q 007713          106 VWVSIDAPYAQPPGLYEGEII  126 (592)
Q Consensus       106 lWv~v~VP~~a~pG~Y~g~i~  126 (592)
                      |-|+++|- +|+||+++..++
T Consensus        51 LFv~L~i~-~akpg~~~i~~~   70 (88)
T PF09087_consen   51 LFVYLDIS-DAKPGTFTINFK   70 (88)
T ss_dssp             EEEEEEE--T--SEEEEEEEE
T ss_pred             EEEEEecC-CCCCcEEEEEEE
Confidence            56777777 999999887666


No 24 
>PRK10301 hypothetical protein; Provisional
Probab=37.36  E-value=60  Score=29.73  Aligned_cols=25  Identities=20%  Similarity=0.303  Sum_probs=20.9

Q ss_pred             EEEEcCCCCCCceeEEEEEEEecCCc
Q 007713          108 VSIDAPYAQPPGLYEGEIIITSKADT  133 (592)
Q Consensus       108 v~v~VP~~a~pG~Y~g~i~v~~~~~g  133 (592)
                      +.+.+|..-++|+|+.+-+|.+ +||
T Consensus        88 ~~v~l~~~L~~G~YtV~Wrvvs-~DG  112 (124)
T PRK10301         88 LIVPLADSLKPGTYTVDWHVVS-VDG  112 (124)
T ss_pred             EEEECCCCCCCccEEEEEEEEe-cCC
Confidence            4777888889999999999986 466


No 25 
>smart00737 ML Domain involved in innate immunity and lipid metabolism. ML (MD-2-related lipid-recognition) is a novel domain identified in MD-1, MD-2, GM2A, Npc2 and multiple proteins of unknown function in plants, animals and fungi. These single-domain proteins were predicted to form a beta-rich fold containing multiple strands, and to mediate diverse biological functions through interacting with specific lipids.
Probab=34.81  E-value=71  Score=28.31  Aligned_cols=35  Identities=29%  Similarity=0.361  Sum_probs=29.7

Q ss_pred             ceeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 007713           95 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus        95 ~~~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~  129 (592)
                      ...+.+|+..-.=.++.||...++|.|+++++++.
T Consensus        71 ~CPl~~G~~~~~~~~~~v~~~~P~~~~~v~~~l~d  105 (118)
T smart00737       71 KCPIEKGETVNYTNSLTVPGIFPPGKYTVKWELTD  105 (118)
T ss_pred             CCCCCCCeeEEEEEeeEccccCCCeEEEEEEEEEc
Confidence            46778898765667789999999999999999995


No 26 
>PLN00180 NDF6 (NDH-dependent flow 6); Provisional
Probab=34.22  E-value=36  Score=32.41  Aligned_cols=79  Identities=19%  Similarity=0.335  Sum_probs=51.0

Q ss_pred             cccCCchhHHHHHHHHHHHcCCCEEEEeecccccCCCCCCccc-cccCCCCCCceEEEccCccCCCCCCceechhHHHHH
Q 007713          457 WHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEI-RFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERIL  535 (592)
Q Consensus       457 ~fi~~p~~~~R~~gw~~~~~g~~GfL~W~~n~w~~~~~P~~~~-~~~~~~~~GD~~LvYPG~~~~~~~~Pv~SiRle~lr  535 (592)
                      |++....+.|-....+.+       --||..|+.+..|||-|. .|+++-+.|-+.-||--.     ....+|-|-|+||
T Consensus        90 wHLSD~aiKnVYtfY~mF-------T~WG~~fFgSmKDPfYDSe~YRgdGGDGT~hW~Yd~Q-----Ed~E~sAReeL~R  157 (180)
T PLN00180         90 WHLSDAAIKNVYTFYIMF-------TCWGCLFFGSMKDPFYDSEEYRGDGGDGTGHWVYERQ-----EDIEESARAELWR  157 (180)
T ss_pred             hhccHHHHhHHHHHHHHH-------HHHHHhheeccCCcccchHHhcccCCCCceeeEeehH-----HHHHHHHHHHHHH
Confidence            455555566554333332       357777776666798765 455543566667788764     3378999999999


Q ss_pred             HHHHHHHHHHHHHhhcC
Q 007713          536 SGLQDIEYLNLYASRYG  552 (592)
Q Consensus       536 eGieDye~L~lL~~~~~  552 (592)
                      |     |+|..++++.|
T Consensus       158 E-----ELiEEIEQkVG  169 (180)
T PLN00180        158 E-----ELIEEIEQKVG  169 (180)
T ss_pred             H-----HHHHHHHHHhh
Confidence            8     55666666654


No 27 
>PF13204 DUF4038:  Protein of unknown function (DUF4038); PDB: 3KZS_D.
Probab=32.40  E-value=4.5e+02  Score=27.43  Aligned_cols=197  Identities=14%  Similarity=0.149  Sum_probs=86.7

Q ss_pred             ccccCChhHHHHHHHHHHHHHhCCCCc-cccccCCccccccccCCCCCCCCCCcccccCcccccccccccCCCCCchhHH
Q 007713          248 GVRHGSDEWYEALDQHFKWLLQYRISP-FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAK  326 (592)
Q Consensus       248 ~v~~~~~~~f~~ldr~~~~~~~~ris~-~f~~wg~~~~i~~y~~pw~~~~~~~~~y~~~~~~~~y~~~~~~~l~g~~~~~  326 (592)
                      .+..-...||+.+|+-++.+.+++|-. +..-||.+.   . -.-|    +..                 +-+=+.+..+
T Consensus        78 d~~~~N~~YF~~~d~~i~~a~~~Gi~~~lv~~wg~~~---~-~~~W----g~~-----------------~~~m~~e~~~  132 (289)
T PF13204_consen   78 DFTRPNPAYFDHLDRRIEKANELGIEAALVPFWGCPY---V-PGTW----GFG-----------------PNIMPPENAE  132 (289)
T ss_dssp             --TT----HHHHHHHHHHHHHHTT-EEEEESS-HHHH---H-----------------------------TTSS-HHHHH
T ss_pred             CCCCCCHHHHHHHHHHHHHHHHCCCeEEEEEEECCcc---c-cccc----ccc-----------------ccCCCHHHHH
Confidence            334445678999999999999999985 333443221   0 0012    100                 0111233567


Q ss_pred             HHHHHHHHHHHHcccccceeeeecCCCCCccchHHHHHHHHHHHHhCCCCeEEEee-ccCCCCCCCCCCCcccccccccc
Q 007713          327 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY-YCGPSDAPLGPTPFESFVKVPKF  405 (592)
Q Consensus       327 ~~L~~~~~hl~~kg~~~~~y~~i~DEP~~~~~~~~~~~~~~~i~~~~P~~ki~~t~-~~~p~~~~~~~~~~e~~~~vp~~  405 (592)
                      .|++=+++.+++.-  .-+++..-|.-......+.++++++.|++.+|.-  +.|+ +|+.      .+..+.|.+.   
T Consensus       133 ~Y~~yv~~Ry~~~~--NviW~l~gd~~~~~~~~~~w~~~~~~i~~~dp~~--L~T~H~~~~------~~~~~~~~~~---  199 (289)
T PF13204_consen  133 RYGRYVVARYGAYP--NVIWILGGDYFDTEKTRADWDAMARGIKENDPYQ--LITIHPCGR------TSSPDWFHDE---  199 (289)
T ss_dssp             HHHHHHHHHHTT-S--SEEEEEESSS--TTSSHHHHHHHHHHHHHH--SS---EEEEE-BT------EBTHHHHTT----
T ss_pred             HHHHHHHHHHhcCC--CCEEEecCccCCCCcCHHHHHHHHHHHHhhCCCC--cEEEeCCCC------CCcchhhcCC---
Confidence            77777777776651  2234444555123567788899999999999965  4433 3221      0111112211   


Q ss_pred             cCCccccccccccccCCch--hhhHHhH---hhcccCCCceeEEEecCCCCCCCCCccc----CCchhHHHHHHHHHHHc
Q 007713          406 LRPHTQIYCTSEWVLGNRE--DLVKDIV---TELQPENGEEWWTYVCMGPSDPHPNWHL----GMRGSQHRAVMWRVWKE  476 (592)
Q Consensus       406 ~~~~idi~c~~~wv~~~~~--~~~~~~~---~~~~~~~G~~~W~Y~C~~p~~~~pN~fi----~~p~~~~R~~gw~~~~~  476 (592)
                        +.+|+++.-   .|+..  ...-..+   ...+....|++.-=-||.  ...|...-    .....+.|--.|.+.--
T Consensus       200 --~Wldf~~~Q---sgh~~~~~~~~~~~~~~~~~~~~p~KPvin~Ep~Y--Eg~~~~~~~~~~~~~~~dvrr~aw~svla  272 (289)
T PF13204_consen  200 --PWLDFNMYQ---SGHNRYDQDNWYYLPEEFDYRRKPVKPVINGEPCY--EGIPYSRWGYNGRFSAEDVRRRAWWSVLA  272 (289)
T ss_dssp             --TT--SEEEB-----S--TT--THHHH--HHHHTSSS---EEESS-----BT-BTTSS-TS-B--HHHHHHHHHHHHHC
T ss_pred             --CcceEEEee---cCCCcccchHHHHHhhhhhhhhCCCCCEEcCcccc--cCCCCCcCcccCCCCHHHHHHHHHHHHhc
Confidence              223433221   12210  1111111   222446677775334553  11222111    24456777789999999


Q ss_pred             CC-CEEEEeecccc
Q 007713          477 GG-TGFLYWGANCY  489 (592)
Q Consensus       477 g~-~GfL~W~~n~w  489 (592)
                      |. -|+-|.+-.-|
T Consensus       273 Ga~aG~tYG~~~iW  286 (289)
T PF13204_consen  273 GAYAGHTYGAHGIW  286 (289)
T ss_dssp             T--SEEEE-BHHHH
T ss_pred             CCCccccCCCCCcc
Confidence            99 99998875555


No 28 
>PF09099 Qn_am_d_aIII:  Quinohemoprotein amine dehydrogenase, alpha subunit domain III;  InterPro: IPR015183 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.94  E-value=62  Score=27.60  Aligned_cols=21  Identities=24%  Similarity=0.370  Sum_probs=18.7

Q ss_pred             EEEEEEEcCCCCCCceeEEEE
Q 007713          105 AVWVSIDAPYAQPPGLYEGEI  125 (592)
Q Consensus       105 ~lWv~v~VP~~a~pG~Y~g~i  125 (592)
                      .++++|.+.++++||.|+..+
T Consensus        49 ~v~v~V~~aa~a~~G~~~v~v   69 (81)
T PF09099_consen   49 EVVVRVKAAADAAPGIRTVRV   69 (81)
T ss_dssp             CEEEEEEEECTSSSEEEEEEE
T ss_pred             EEEEEEEEcCCCCCccEEEEe
Confidence            589999999999999998665


No 29 
>PF08428 Rib:  Rib/alpha-like repeat;  InterPro: IPR012706 This entry represents a region of about 79 amino acids found tandemly repeated up to fourteen times within the proteins that contain it. The repeats lack cysteines and are highly conserved, even at the DNA level, within and between proteins []. Proteins containing these repeats include the Rib and alpha surface antigens of group B Streptococcus, Esp of Enterococcus faecalis (Streptococcus faecalis), and related proteins of Lactobacillus. Most members of this protein family also have the cell wall anchor motif, LPXTG, shared by many staphyloccal and streptococcal surface antigens. These repeats are thought to define protective epitopes and may play a role in generating phenotypic and genotypic variation [].
Probab=26.02  E-value=1.4e+02  Score=24.05  Aligned_cols=33  Identities=30%  Similarity=0.528  Sum_probs=24.2

Q ss_pred             eecCCCeeEEEEEEEcCCCCCCceeEEEEEEEecCCc
Q 007713           97 SLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADT  133 (592)
Q Consensus        97 ~v~~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~~~~~g  133 (592)
                      +++.|. .--|.+  .|....+|.|++.|+|+-. ||
T Consensus        20 ~lP~gt-~~~w~~--~pdt~~~G~~~~~V~Vtyp-Dg   52 (65)
T PF08428_consen   20 NLPAGT-TYSWKD--KPDTSKPGTKTGKVKVTYP-DG   52 (65)
T ss_pred             cCCCCc-ceeecc--CCccccCccEEEEEEEEcC-CC
Confidence            444433 346766  8999999999999999952 44


No 30 
>PF00868 Transglut_N:  Transglutaminase family;  InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=25.45  E-value=5e+02  Score=23.36  Aligned_cols=29  Identities=24%  Similarity=0.370  Sum_probs=20.0

Q ss_pred             CCCeeEEEEEEEcCCCCCCceeEEEEEEE
Q 007713          100 PGETTAVWVSIDAPYAQPPGLYEGEIIIT  128 (592)
Q Consensus       100 ~~~~q~lWv~v~VP~~a~pG~Y~g~i~v~  128 (592)
                      ..+...+=|.|.+|++|.-|.|+-+|.++
T Consensus        89 ~~~~~~~tv~V~spa~A~VG~y~l~v~~~  117 (118)
T PF00868_consen   89 SQDGNSVTVSVTSPANAPVGRYKLSVETK  117 (118)
T ss_dssp             EEETTEEEEEEE--TTS--EEEEEEEEEE
T ss_pred             ecCCCEEEEEEECCCCCceEEEEEEEEEe
Confidence            33444577899999999999999999886


No 31 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=23.01  E-value=1.3e+02  Score=22.90  Aligned_cols=19  Identities=21%  Similarity=0.315  Sum_probs=15.0

Q ss_pred             EcCCCCCCceeEEEEEEEe
Q 007713          111 DAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus       111 ~VP~~a~pG~Y~g~i~v~~  129 (592)
                      .+....++|.|+.+|+++.
T Consensus        28 tp~~~~~~G~y~~~vtatd   46 (49)
T PF05345_consen   28 TPTSSVQPGTYTFTVTATD   46 (49)
T ss_pred             ecCCCccccEEEEEEEEEc
Confidence            4555668899999999984


No 32 
>PF14734 DUF4469:  Domain of unknown function (DUF4469) with IG-like fold
Probab=22.35  E-value=89  Score=27.76  Aligned_cols=23  Identities=17%  Similarity=0.110  Sum_probs=20.5

Q ss_pred             EEEEEcCCCCCCceeEEEEEEEe
Q 007713          107 WVSIDAPYAQPPGLYEGEIIITS  129 (592)
Q Consensus       107 Wv~v~VP~~a~pG~Y~g~i~v~~  129 (592)
                      =+.+.||++-++|.|+.+|+=..
T Consensus        65 ~l~~~lPa~L~~G~Y~l~V~Tq~   87 (102)
T PF14734_consen   65 RLIFILPADLAAGEYTLEVRTQY   87 (102)
T ss_pred             EEEEECcCccCceEEEEEEEEEe
Confidence            36889999999999999999875


No 33 
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=22.27  E-value=1e+02  Score=22.77  Aligned_cols=14  Identities=29%  Similarity=0.465  Sum_probs=12.3

Q ss_pred             CCCceeEEEEEEEe
Q 007713          116 QPPGLYEGEIIITS  129 (592)
Q Consensus       116 a~pG~Y~g~i~v~~  129 (592)
                      .+||.|+.+++.+.
T Consensus        10 T~PG~Y~l~~~a~~   23 (41)
T TIGR03769        10 TKPGTYTLTVQATA   23 (41)
T ss_pred             CCCeEEEEEEEEEE
Confidence            58999999999973


No 34 
>PRK09778 putative antitoxin of the YafO-YafN toxin-antitoxin system; Provisional
Probab=20.90  E-value=1.9e+02  Score=25.54  Aligned_cols=25  Identities=16%  Similarity=0.198  Sum_probs=22.0

Q ss_pred             hhHHHHHHHHHHHHHHHHHHhhcCc
Q 007713          529 LRLERILSGLQDIEYLNLYASRYGR  553 (592)
Q Consensus       529 iRle~lreGieDye~L~lL~~~~~~  553 (592)
                      =-+|.|.+-++|+|+.++.+++...
T Consensus        44 ~~yE~m~e~LeD~eL~~l~~~R~~~   68 (97)
T PRK09778         44 SAFEALMDMLAEQEEKKPIKARFRP   68 (97)
T ss_pred             HHHHHHHHHHHhHHHHHHHHHHcCc
Confidence            3579999999999999999998664


No 35 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=20.83  E-value=1.4e+02  Score=23.16  Aligned_cols=14  Identities=36%  Similarity=0.608  Sum_probs=11.3

Q ss_pred             CCCceeEEEEEEEe
Q 007713          116 QPPGLYEGEIIITS  129 (592)
Q Consensus       116 a~pG~Y~g~i~v~~  129 (592)
                      -+||.|+..|+++.
T Consensus        36 L~~G~Y~l~V~a~~   49 (66)
T PF07495_consen   36 LPPGKYTLEVRAKD   49 (66)
T ss_dssp             --SEEEEEEEEEEE
T ss_pred             CCCEEEEEEEEEEC
Confidence            48999999999986


Done!