Query         psy4833
Match_columns 141
No_of_seqs    132 out of 414
Neff          4.2 
Searched_HMMs 46136
Date          Fri Aug 16 18:55:52 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4833.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4833hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4659|consensus               99.8 8.9E-19 1.9E-23  169.5  11.1  126    2-129    45-182 (1899)
  2 PF13715 DUF4480:  Domain of un  99.7 1.6E-16 3.4E-21  108.7  10.4   79   10-93      1-80  (88)
  3 PF13620 CarboxypepD_reg:  Carb  99.5 7.1E-13 1.5E-17   88.9   9.6   79   10-90      1-82  (82)
  4 cd03863 M14_CPD_II The second   99.4 1.6E-12 3.5E-17  112.9   9.0   77    8-89    296-373 (375)
  5 cd06245 M14_CPD_III The third   99.3 3.6E-11 7.9E-16  103.9  10.1   76    8-89    286-361 (363)
  6 cd03858 M14_CP_N-E_like Carbox  99.2 6.5E-11 1.4E-15  101.2   9.2   72    9-84    298-370 (374)
  7 cd03864 M14_CPN Peptidase M14   99.1 2.5E-10 5.4E-15   99.9   9.6   71    9-84    316-387 (392)
  8 cd03867 M14_CPZ Peptidase M14-  99.1 3.2E-10 6.9E-15   98.8   9.3   66    9-78    318-383 (395)
  9 cd03868 M14_CPD_I The first ca  99.1 4.5E-10 9.8E-15   96.6   8.2   73    8-84    295-368 (372)
 10 cd03865 M14_CPE_H Peptidase M1  99.0 6.1E-10 1.3E-14   98.1   8.0   71   10-84    327-397 (402)
 11 cd03866 M14_CPM Peptidase M14   98.9 4.8E-09   1E-13   91.1   8.8   69    9-79    295-363 (376)
 12 PF08400 phage_tail_N:  Prophag  98.1 1.9E-05 4.2E-10   61.0   9.0   74    8-94      2-82  (134)
 13 cd03869 M14_CPX_like Peptidase  98.0   2E-05 4.4E-10   69.9   7.2   67    9-79    329-395 (405)
 14 PF07210 DUF1416:  Protein of u  97.7 0.00037 8.1E-09   50.5   9.1   67    3-71      2-70  (85)
 15 KOG2649|consensus               97.6 0.00024 5.3E-09   64.7   8.5   89    9-102   378-466 (500)
 16 PRK15036 hydroxyisourate hydro  97.6 0.00012 2.7E-09   56.3   4.9   61    9-69     27-96  (137)
 17 cd00421 intradiol_dioxygenase   97.4  0.0016 3.4E-08   49.8   9.4   74    5-78      8-107 (146)
 18 cd03461 1,2-HQD Hydroxyquinol   97.4   0.002 4.4E-08   54.8  10.2   75    5-80    117-232 (277)
 19 TIGR02423 protocat_alph protoc  97.3  0.0031 6.8E-08   51.0  10.5   77    6-82     37-146 (193)
 20 cd03458 Catechol_intradiol_dio  97.2  0.0037   8E-08   52.8  10.0   76    5-80    101-216 (256)
 21 TIGR02438 catachol_actin catec  97.2  0.0032 6.8E-08   53.9   9.7   78    4-81    128-245 (281)
 22 PF00775 Dioxygenase_C:  Dioxyg  97.2  0.0061 1.3E-07   48.7  10.5   48    4-51     25-89  (183)
 23 cd03462 1,2-CCD chlorocatechol  97.1  0.0045 9.8E-08   52.0   9.8   77    5-81     96-212 (247)
 24 cd03459 3,4-PCD Protocatechuat  97.1  0.0066 1.4E-07   47.6   9.9   74    5-78     12-118 (158)
 25 TIGR02439 catechol_proteo cate  97.1  0.0056 1.2E-07   52.4  10.2   79    5-83    125-243 (285)
 26 PF02369 Big_1:  Bacterial Ig-l  97.1  0.0047   1E-07   44.2   8.3   70    4-73     20-97  (100)
 27 TIGR02465 chlorocat_1_2 chloro  97.1  0.0064 1.4E-07   51.0  10.1   77    6-82     96-212 (246)
 28 TIGR02422 protocat_beta protoc  97.1   0.006 1.3E-07   50.4   9.7   75    7-81     59-168 (220)
 29 cd03460 1,2-CTD Catechol 1,2 d  97.0  0.0076 1.7E-07   51.6  10.2   79    5-83    121-239 (282)
 30 cd03464 3,4-PCD_beta Protocate  96.9    0.01 2.3E-07   49.0  10.1   75    6-80     63-172 (220)
 31 cd03463 3,4-PCD_alpha Protocat  96.9   0.011 2.3E-07   47.6   9.9   73    6-78     34-138 (185)
 32 smart00634 BID_1 Bacterial Ig-  96.9    0.01 2.2E-07   41.5   8.1   69    4-72     15-90  (92)
 33 PF10670 DUF4198:  Domain of un  96.5   0.012 2.5E-07   45.3   7.0   54   10-64    152-211 (215)
 34 COG3485 PcaH Protocatechuate 3  96.5  0.0064 1.4E-07   50.6   5.8   73    7-79     71-177 (226)
 35 KOG1948|consensus               96.4   0.014   3E-07   57.2   8.1   64    9-76    316-383 (1165)
 36 PRK15310 fimbrial outer membra  95.4   0.056 1.2E-06   52.6   7.7   78    8-92    785-863 (895)
 37 PF11974 MG1:  Alpha-2-macroglo  95.3   0.024 5.3E-07   40.7   3.7   39   15-53     19-60  (97)
 38 PF01190 Pollen_Ole_e_I:  Polle  95.2   0.044 9.6E-07   38.6   4.7   36   20-55     18-58  (97)
 39 cd03457 intradiol_dioxygenase_  94.7     0.1 2.2E-06   42.0   6.1   47    5-51     23-92  (188)
 40 COG5266 CbiK ABC-type Co2+ tra  92.4    0.22 4.8E-06   42.6   4.7   42   10-53    173-229 (264)
 41 PF12985 DUF3869:  Domain of un  92.2    0.75 1.6E-05   34.5   6.7   56    7-77     20-77  (104)
 42 PF14686 fn3_3:  Polysaccharide  91.5     1.9 4.1E-05   31.1   8.0   49   10-60      4-64  (95)
 43 PF08308 PEGA:  PEGA domain;  I  90.8    0.99 2.1E-05   29.6   5.6   52   24-84     11-63  (71)
 44 TIGR02962 hdxy_isourate hydrox  90.5    0.43 9.2E-06   35.7   4.0   59   11-69      3-71  (112)
 45 PF01060 DUF290:  Transthyretin  90.1    0.56 1.2E-05   32.3   4.0   43   12-55      1-48  (80)
 46 PF05738 Cna_B:  Cna protein B-  89.3       3 6.6E-05   26.9   6.9   47   23-70      1-53  (70)
 47 cd05822 TLP_HIUase HIUase (5-h  87.7       1 2.3E-05   33.6   4.3   58   11-68      3-70  (112)
 48 KOG1948|consensus               84.3     2.3 4.9E-05   42.4   5.8   48    9-56    119-167 (1165)
 49 PF10577 UPF0560:  Uncharacteri  82.4      11 0.00023   37.1   9.4   85   10-94      2-105 (807)
 50 PF12866 DUF3823:  Protein of u  80.5      10 0.00022   31.3   7.6   29    2-31     16-49  (222)
 51 cd05469 Transthyretin_like Tra  78.9     2.2 4.9E-05   32.1   2.9   57   11-67      3-69  (113)
 52 cd05821 TLP_Transthyretin Tran  78.6     2.7 5.8E-05   32.1   3.3   60    9-68      7-76  (121)
 53 PF00576 Transthyretin:  HIUase  77.2     2.6 5.5E-05   31.5   2.8   39   11-49      3-47  (112)
 54 smart00095 TR_THY Transthyreti  76.4     4.7  0.0001   30.7   4.1   58   10-67      5-72  (121)
 55 PF03785 Peptidase_C25_C:  Pept  75.9      15 0.00033   26.5   6.4   51   24-75     26-80  (81)
 56 PF09430 DUF2012:  Protein of u  73.1      23 0.00049   26.2   7.0   54   23-77      7-61  (123)
 57 PF13953 PapC_C:  PapC C-termin  68.4      32 0.00069   22.8   7.3   47   14-64      2-49  (68)
 58 COG2351 Transthyretin-like pro  62.8      18  0.0004   28.0   4.8   56   10-65     10-75  (124)
 59 KOG3006|consensus               57.0      65  0.0014   25.2   6.9   54   10-63     22-85  (132)
 60 PF14054 DUF4249:  Domain of un  56.4   1E+02  0.0023   24.8  10.5   66   22-90     59-132 (298)
 61 PF05688 DUF824:  Salmonella re  52.4      23 0.00051   22.9   3.3   26    8-33     14-39  (47)
 62 COG1470 Predicted membrane pro  51.3      85  0.0018   29.5   7.8   77    8-90    186-263 (513)
 63 PRK15314 outer membrane protei  46.7      49  0.0011   36.1   6.1   24   10-33   1609-1632(2435)
 64 PF09244 DUF1964:  Domain of un  45.5      46   0.001   23.3   4.0   27   44-72      2-28  (68)
 65 PF13115 YtkA:  YtkA-like        42.9      30 0.00065   23.1   2.9   22    8-32     23-44  (86)
 66 PF03717 PBP_dimer:  Penicillin  42.2      15 0.00033   27.7   1.4   17   10-26      4-20  (178)
 67 PF11184 DUF2969:  Protein of u  41.3      93   0.002   21.5   5.1   39   14-55     11-49  (71)
 68 PRK15207 long polar fimbrial o  38.2   1E+02  0.0022   30.1   6.5   48    3-51    751-799 (842)
 69 PF07492 Trehalase_Ca-bi:  Neut  38.0      35 0.00076   20.5   2.2   18   43-60      9-26  (30)
 70 PF06115 DUF956:  Domain of unk  37.5      68  0.0015   24.6   4.3   33   22-56     54-88  (118)
 71 COG3188 FimD P pilus assembly   35.1 1.6E+02  0.0035   28.9   7.4   49    3-53    747-796 (835)
 72 PF03983 SHD1:  SLA1 homology d  32.9      31 0.00067   24.1   1.7   43   40-92     14-56  (70)
 73 KOG4217|consensus               32.2      15 0.00032   34.6  -0.1   15  113-134   277-291 (605)
 74 PRK15248 fimbrial outer membra  31.4 1.6E+02  0.0034   28.8   6.7   47    2-51    762-809 (853)
 75 COG3422 Uncharacterized conser  31.1      58  0.0013   22.3   2.7   28   42-71      9-37  (59)
 76 PF14289 DUF4369:  Domain of un  31.0 1.7E+02  0.0036   19.6  10.1   44    7-53     11-55  (106)
 77 PF13753 SWM_repeat:  Putative   30.3      88  0.0019   26.0   4.2   44    6-55    264-308 (317)
 78 COG4594 FecB ABC-type Fe3+-cit  29.5      34 0.00074   30.1   1.7   33   38-70     33-65  (310)
 79 PRK15213 fimbrial outer membra  29.4 2.1E+02  0.0044   27.6   7.0   51    3-56    718-770 (797)
 80 PRK07121 hypothetical protein;  28.5      28 0.00061   30.7   1.1   15   13-27    439-453 (492)
 81 TIGR02485 CobZ_N-term precorri  28.2      27 0.00058   30.1   0.8   16   11-26    375-390 (432)
 82 PF06030 DUF916:  Bacterial pro  27.2 1.4E+02   0.003   22.2   4.4   38   42-79     12-50  (121)
 83 PRK15235 outer membrane fimbri  27.0 1.6E+02  0.0035   28.5   5.9   28    4-31    723-751 (814)
 84 PRK15223 pilin outer membrane   24.8   5E+02   0.011   25.3   8.7   58    3-64    749-808 (836)
 85 PRK12837 3-ketosteroid-delta-1  24.6      39 0.00084   30.3   1.2   16   11-26    457-472 (513)
 86 PF03404 Mo-co_dimer:  Mo-co ox  24.3 3.1E+02  0.0067   20.6   8.8   40    7-46     27-66  (131)
 87 PRK08274 tricarballylate dehyd  24.2      35 0.00075   29.6   0.8   16   11-26    406-421 (466)
 88 PRK15316 RatA-like protein; Pr  22.4 2.2E+02  0.0048   31.7   6.1   55    6-69   2160-2250(2683)
 89 PRK15298 fimbrial outer membra  22.3 3.1E+02  0.0067   26.8   6.9   45    6-51    763-808 (848)
 90 TIGR01813 flavo_cyto_c flavocy  22.0      45 0.00097   28.6   1.1   17   11-27    391-407 (439)
 91 PRK06481 fumarate reductase fl  21.1      47   0.001   29.7   1.0   16   11-26    449-464 (506)
 92 PF09624 DUF2393:  Protein of u  20.9 1.8E+02  0.0039   21.6   4.1   26    6-31     61-86  (149)
 93 PRK15273 putative fimbrial out  20.9 2.5E+02  0.0055   27.6   6.0   36   13-51    746-782 (881)
 94 PRK12834 putative FAD-binding   20.9      43 0.00094   30.2   0.8   16   11-26    492-507 (549)
 95 COG3549 HigB Plasmid maintenan  20.7 1.4E+02  0.0031   22.1   3.3   23   43-67     60-82  (94)
 96 PF00207 A2M:  Alpha-2-macroglo  20.7 2.6E+02  0.0057   19.0   4.6   51   40-92      8-61  (92)
 97 COG0196 RibF FAD synthase [Coe  20.3 1.6E+02  0.0035   25.6   4.1   57    7-70    184-244 (304)
 98 KOG1691|consensus               20.3 5.2E+02   0.011   21.7   8.2   72   10-84     44-124 (210)
 99 PRK15304 putative fimbrial out  20.1 2.9E+02  0.0064   26.7   6.2   46    3-51    712-758 (801)

No 1  
>KOG4659|consensus
Probab=99.78  E-value=8.9e-19  Score=169.52  Aligned_cols=126  Identities=36%  Similarity=0.549  Sum_probs=114.5

Q ss_pred             CcCCcceEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccE
Q psy4833           2 TDDELRVAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQI   81 (141)
Q Consensus         2 ~~d~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~   81 (141)
                      +||++|.++|||+|+-.+|.||.||+|+...++ . ++|+|+.||.|+|.|++|.++||+|-+..|.+|+++|++|||++
T Consensus        45 ~fne~~~~vIrgrvv~~~~~pLVGVrVS~~~~~-~-yfTlTR~DG~FDL~vnGg~svtLqF~R~pF~~qkr~v~vpwnq~  122 (1899)
T KOG4659|consen   45 QFNENRISVIRGRVVWGGGVPLVGVRVSDAAHP-L-YFTLTREDGYFDLTVNGGRSVTLQFLRTPFQSQKRSVFVPWNQI  122 (1899)
T ss_pred             hhccccceEEeccEeecCCcceEEEEeeccccc-c-eEEEEecCceEEEEEcccceEEEEEccCCCcccceeEEeChhhE
Confidence            589999999999999999999999999999999 7 49999999999999999999999999999999999999999999


Q ss_pred             EEeCCeEEEecCCC------------ccceeccccccCcCCCCCcccccccCCCCCCcce
Q psy4833          82 VVLPPVKMQLSDAS------------DIFREVASGVTPSYAPKGNSDLQHYGPRSSNKTV  129 (141)
Q Consensus        82 ~~~~~v~m~~~d~~------------~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~  129 (141)
                      +++++|+|+..+..            ..+..|.|.|.|+++..|-.+|-+.+++--..-|
T Consensus       123 i~id~vvm~~~~~~~~~~~~~~~C~~~~~~~pdpvvissw~~tf~ga~pdr~~Iv~esqv  182 (1899)
T KOG4659|consen  123 IHIDDVVMYRQEGGSPPAPARAKCSPTLRRIPDPVVISSWQYTFDGACPDRTSIVVESQV  182 (1899)
T ss_pred             EEEEeEEEEeecCCCCCCCCCCccChhhccCCCcEEecCcccccCCcCCCCcccccchhh
Confidence            99999999977653            2345678899999999999999999997654433


No 2  
>PF13715 DUF4480:  Domain of unknown function (DUF4480)
Probab=99.70  E-value=1.6e-16  Score=108.74  Aligned_cols=79  Identities=23%  Similarity=0.376  Sum_probs=68.2

Q ss_pred             EEEEEEECCC-CCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeE
Q psy4833          10 VVRGQVLSVQ-GLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVK   88 (141)
Q Consensus        10 ~IrG~V~D~~-GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~   88 (141)
                      +|+|+|+|++ |+||+||+|.+++.. . + ++||.+|+|.|.++.+. .+|.||++||+++++.+....+....+ .+.
T Consensus         1 ti~G~V~d~~t~~pl~~a~V~~~~~~-~-~-~~Td~~G~F~i~~~~g~-~~l~is~~Gy~~~~~~i~~~~~~~~~~-~i~   75 (88)
T PF13715_consen    1 TISGKVVDSDTGEPLPGATVYLKNTK-K-G-TVTDENGRFSIKLPEGD-YTLKISYIGYETKTITISVNSNKNTNL-NIY   75 (88)
T ss_pred             CEEEEEEECCCCCCccCeEEEEeCCc-c-e-EEECCCeEEEEEEcCCC-eEEEEEEeCEEEEEEEEEecCCCEEEE-EEE
Confidence            5899999987 999999999999987 4 4 99999999999988665 579999999999999999987765555 688


Q ss_pred             EEecC
Q psy4833          89 MQLSD   93 (141)
Q Consensus        89 m~~~d   93 (141)
                      |.+..
T Consensus        76 L~~~~   80 (88)
T PF13715_consen   76 LEPKS   80 (88)
T ss_pred             EeeCc
Confidence            87653


No 3  
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=99.46  E-value=7.1e-13  Score=88.90  Aligned_cols=79  Identities=24%  Similarity=0.409  Sum_probs=58.1

Q ss_pred             EEEEEEECCCCCeeeeEEEEeccCCC-eeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEE-EEEccCccEEEeCC
Q psy4833          10 VVRGQVLSVQGLGITGIRVSVDKDSG-RFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTR-TVFIPWNQIVVLPP   86 (141)
Q Consensus        10 ~IrG~V~D~~GePL~GVsV~Vkgt~~-~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~-~V~~~~n~~~~~~~   86 (141)
                      +|+|+|+|++|+||+||+|.+....+ ....+.||.+|+|.|. ++.| .++|.+++.||.+... .+.+..++...+ +
T Consensus         1 tI~G~V~d~~g~pv~~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~g-~Y~l~v~~~g~~~~~~~~v~v~~~~~~~~-~   78 (82)
T PF13620_consen    1 TISGTVTDATGQPVPGATVTLTDQDGGTVYTTTTDSDGRFSFEGLPPG-TYTLRVSAPGYQPQTQENVTVTAGQTTTV-D   78 (82)
T ss_dssp             -EEEEEEETTSCBHTT-EEEET--TTTECCEEE--TTSEEEEEEE-SE-EEEEEEEBTTEE-EEEEEEEESSSSEEE---
T ss_pred             CEEEEEEcCCCCCcCCEEEEEEEeeCCCEEEEEECCCceEEEEccCCE-eEEEEEEECCcceEEEEEEEEeCCCEEEE-E
Confidence            58999999999999999999983211 1234999999999998 7765 5689999999999998 699998887776 7


Q ss_pred             eEEE
Q psy4833          87 VKMQ   90 (141)
Q Consensus        87 v~m~   90 (141)
                      +.|.
T Consensus        79 i~L~   82 (82)
T PF13620_consen   79 ITLE   82 (82)
T ss_dssp             EEEE
T ss_pred             EEEC
Confidence            7764


No 4  
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of  Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, while the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally ac
Probab=99.38  E-value=1.6e-12  Score=112.91  Aligned_cols=77  Identities=26%  Similarity=0.260  Sum_probs=66.0

Q ss_pred             eEEEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCC
Q psy4833           8 VAVVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPP   86 (141)
Q Consensus         8 ~~~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~   86 (141)
                      ...|+|+|+|+ +|+||+||+|+|+|.. . | ++||.||+|.+.++.|. .+|++|++||+++++.|.+..++...+ +
T Consensus       296 ~~gI~G~V~D~~~g~pl~~AtV~V~g~~-~-~-~~Td~~G~f~~~l~pG~-ytl~vs~~GY~~~~~~v~V~~~~~~~~-~  370 (375)
T cd03863         296 HRGVRGFVLDATDGRGILNATISVADIN-H-P-VTTYKDGDYWRLLVPGT-YKVTASARGYDPVTKTVEVDSKGAVQV-N  370 (375)
T ss_pred             cCeEEEEEEeCCCCCCCCCeEEEEecCc-C-c-eEECCCccEEEccCCee-EEEEEEEcCcccEEEEEEEcCCCcEEE-E
Confidence            45799999996 7999999999999987 4 5 99999999999988775 689999999999999999987766554 4


Q ss_pred             eEE
Q psy4833          87 VKM   89 (141)
Q Consensus        87 v~m   89 (141)
                      +.|
T Consensus       371 ~~L  373 (375)
T cd03863         371 FTL  373 (375)
T ss_pred             EEe
Confidence            444


No 5  
>cd06245 M14_CPD_III The third carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active a
Probab=99.25  E-value=3.6e-11  Score=103.94  Aligned_cols=76  Identities=18%  Similarity=0.359  Sum_probs=66.3

Q ss_pred             eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCe
Q psy4833           8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPV   87 (141)
Q Consensus         8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v   87 (141)
                      -..|+|+|+|.+|+||+||+|.|+|..   + ++||.+|.|.+.++.|. .+|.+|++||++++++|.+..++...+ ++
T Consensus       286 ~~gI~G~V~d~~g~pi~~A~V~v~g~~---~-~~T~~~G~y~~~L~pG~-y~v~vs~~Gy~~~~~~V~v~~~~~~~~-~f  359 (363)
T cd06245         286 HKGVHGVVTDKAGKPISGATIVLNGGH---R-VYTKEGGYFHVLLAPGQ-HNINVIAEGYQQEHLPVVVSHDEASSV-KI  359 (363)
T ss_pred             CcEEEEEEEcCCCCCccceEEEEeCCC---c-eEeCCCcEEEEecCCce-EEEEEEEeCceeEEEEEEEcCCCeEEE-EE
Confidence            468999999999999999999999864   4 89999999999987665 589999999999999999988777665 55


Q ss_pred             EE
Q psy4833          88 KM   89 (141)
Q Consensus        88 ~m   89 (141)
                      .|
T Consensus       360 ~L  361 (363)
T cd06245         360 VL  361 (363)
T ss_pred             Ee
Confidence            55


No 6  
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. The N/E subfamily includes eight members, of which five (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically active, while the other three are non-active (CPX1, PCX2, ACLP/AEBP1) and lack the critical active site and substrate-binding residues considered necessary for CP activity. These non-active members may function as binding proteins or display catalytic activity towards other substrates. Unlike the A/B CP subfamily, enzymes belonging to the N/E subfamily are not produced as inactive precursors that require proteolysis to produce the active form; rather, they rely on their substrate specificity and subcellular compartmentalization to prevent inappr
Probab=99.21  E-value=6.5e-11  Score=101.22  Aligned_cols=72  Identities=26%  Similarity=0.350  Sum_probs=62.3

Q ss_pred             EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccC-ccEEEe
Q psy4833           9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPW-NQIVVL   84 (141)
Q Consensus         9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~-n~~~~~   84 (141)
                      ..|+|+|+|.+|+||+||+|.++|.. . + ++||.+|+|.+.++.| ..+|.+|++||+++++++.+.. ++...+
T Consensus       298 ~~i~G~V~d~~g~pl~~A~V~i~~~~-~-~-~~Td~~G~f~~~l~~G-~y~l~vs~~Gy~~~~~~v~v~~~g~~~~~  370 (374)
T cd03858         298 RGIKGFVRDANGNPIANATISVEGIN-H-D-VTTAEDGDYWRLLLPG-TYNVTASAPGYEPQTKSVVVPNDNSAVVV  370 (374)
T ss_pred             CceEEEEECCCCCccCCeEEEEecce-e-e-eEECCCceEEEecCCE-eEEEEEEEcCcceEEEEEEEecCCceEEE
Confidence            38999999999999999999999887 4 5 9999999999998755 4689999999999999888876 555544


No 7  
>cd03864 M14_CPN Peptidase M14 Carboxypeptidase N (CPN, also known as kininase I, creatine kinase conversion factor, plasma carboxypeptidase B, arginine carboxypeptidase, and protaminase; EC 3.4.17.3) is an extracellular glycoprotein synthesized in the liver and released into the blood, where it is present in high concentrations. CPN belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPN plays an important role in protecting the body from excessive buildup of potentially deleterious peptides that normally act as local autocrine or paracrine hormones. It specifically removes C-terminal basic residues. As CPN can cleave lysine more avidly than arginine residues it is also called lysine carboxypeptidase. CPN substrates inclu
Probab=99.13  E-value=2.5e-10  Score=99.93  Aligned_cols=71  Identities=21%  Similarity=0.227  Sum_probs=60.8

Q ss_pred             EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEEEEEccCccEEEe
Q psy4833           9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVL   84 (141)
Q Consensus         9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~   84 (141)
                      ..|+|+|+|.+|+||+||+|.|+|.. . + ++||.+|.| +. ++.| ..+|.+|++||++++++|.+..++...+
T Consensus       316 ~gI~G~V~D~~g~pi~~A~V~v~g~~-~-~-~~T~~~G~y-~r~l~pG-~Y~l~vs~~Gy~~~t~~v~V~~~~~~~~  387 (392)
T cd03864         316 QGIKGMVTDENNNGIANAVISVSGIS-H-D-VTSGTLGDY-FRLLLPG-TYTVTASAPGYQPSTVTVTVGPAEATLV  387 (392)
T ss_pred             CeEEEEEECCCCCccCCeEEEEECCc-c-c-eEECCCCcE-EecCCCe-eEEEEEEEcCceeEEEEEEEcCCCcEEE
Confidence            47999999999999999999999997 4 5 999999999 54 5555 4689999999999999999887665443


No 8  
>cd03867 M14_CPZ Peptidase M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPZ is a secreted Zn-dependent enzyme whose biological function is largely unknown. Unlike other members of the N/E subfamily, CPZ has a bipartite structure, which consists of an N-terminal cysteine-rich domain (CRD) whose sequence is similar to Wnt-binding proteins, and a C-terminal CP catalytic domain that removes C-terminal Arg residues from substrates. CPZ is enriched in the extracellular matrix and is widely distributed during early embryogenesis.  That the CRD of CPZ can bind to Wnt4 suggests that CPZ plays a role in Wnt signaling.
Probab=99.11  E-value=3.2e-10  Score=98.76  Aligned_cols=66  Identities=24%  Similarity=0.230  Sum_probs=58.8

Q ss_pred             EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccC
Q psy4833           9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPW   78 (141)
Q Consensus         9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~   78 (141)
                      ..|+|+|+|++|+||+||+|.++|+. . + +.||.+|.|...++.|. .+|.+|++||++++.+|.++.
T Consensus       318 ~~i~G~V~D~~g~pi~~A~V~v~g~~-~-~-~~Td~~G~y~~~l~~G~-y~l~vs~~Gy~~~~~~v~v~~  383 (395)
T cd03867         318 RGIKGFVKDKDGNPIKGARISVRGIR-H-D-ITTAEDGDYWRLLPPGI-HIVSAQAPGYTKVMKRVTLPA  383 (395)
T ss_pred             ceeEEEEEcCCCCccCCeEEEEeccc-c-c-eEECCCceEEEecCCCc-EEEEEEecCeeeEEEEEEeCC
Confidence            47999999999999999999999987 3 5 99999999987777665 579999999999999999864


No 9  
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of  Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active at p
Probab=99.07  E-value=4.5e-10  Score=96.56  Aligned_cols=73  Identities=22%  Similarity=0.167  Sum_probs=61.2

Q ss_pred             eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEE-EEccCccEEEe
Q psy4833           8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRT-VFIPWNQIVVL   84 (141)
Q Consensus         8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~-V~~~~n~~~~~   84 (141)
                      ...|+|+|+|.+|+||+||+|.+++.. . + ++||.+|.|.+.++.| ..+|.+|+.||++++++ +.+..++...+
T Consensus       295 ~~~i~G~V~d~~g~pv~~A~V~v~~~~-~-~-~~td~~G~y~~~l~~G-~Y~l~vs~~Gf~~~~~~~v~v~~g~~~~~  368 (372)
T cd03868         295 HIGVKGFVRDASGNPIEDATIMVAGID-H-N-VTTAKFGDYWRLLLPG-TYTITAVAPGYEPSTVTDVVVKEGEATSV  368 (372)
T ss_pred             CCceEEEEEcCCCCcCCCcEEEEEecc-c-c-eEeCCCceEEecCCCE-EEEEEEEecCCCceEEeeEEEcCCCeEEE
Confidence            346899999999999999999999987 4 5 9999999998777756 46899999999999886 45766665554


No 10 
>cd03865 M14_CPE_H Peptidase M14 Carboxypeptidase (CP) E (CPE, also known as carboxypeptidase H, and enkephalin convertase; EC 3.4.17.10) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPE is an important enzyme responsible for the proteolytic processing of prohormone intermediates (such as pro-insulin, pro-opiomelanocortin, or pro-gonadotropin-releasing hormone) by specifically removing C-terminal basic residues. In addition, it has been proposed that the regulated secretory pathway (RSP) of the nervous and endocrine systems utilizes membrane-bound CPE as a sorting receptor. A naturally occurring point mutation in CPE reduces the stability of the enzyme and causes its degradation, leading to an accumulation of numerous neuroendocrine pe
Probab=99.04  E-value=6.1e-10  Score=98.09  Aligned_cols=71  Identities=23%  Similarity=0.286  Sum_probs=62.1

Q ss_pred             EEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEe
Q psy4833          10 VVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVL   84 (141)
Q Consensus        10 ~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~   84 (141)
                      -|+|+|+|.+|+||+||+|+|+|.. . + +.||.+|.|.+.++.|. .+|.+|+.||++++..|.++.++...+
T Consensus       327 gI~G~V~D~~g~pI~~AtV~V~g~~-~-~-~~T~~~G~Y~~~L~pG~-Ytv~vsa~Gy~~~~~~V~V~~~~~~~v  397 (402)
T cd03865         327 GVKGFVKDLQGNPIANATISVEGID-H-D-ITSAKDGDYWRLLAPGN-YKLTASAPGYLAVVKKVAVPYSPAVRV  397 (402)
T ss_pred             ceEEEEECCCCCcCCCeEEEEEcCc-c-c-cEECCCeeEEECCCCEE-EEEEEEecCcccEEEEEEEcCCCcEEE
Confidence            4999999999999999999999987 4 5 99999999999777664 689999999999999999987765443


No 11 
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the cont
Probab=98.92  E-value=4.8e-09  Score=91.14  Aligned_cols=69  Identities=23%  Similarity=0.292  Sum_probs=58.9

Q ss_pred             EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCc
Q psy4833           9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWN   79 (141)
Q Consensus         9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n   79 (141)
                      .-|+|+|+|.+|+||+||+|.|.|.. ....+.||.+|.|.+.+..|. .+|.+|..||+++..++.++.+
T Consensus       295 ~gI~G~V~D~~g~pi~~A~V~v~g~~-~~~~~~T~~~G~y~~~l~pG~-Y~v~vsa~Gy~~~~~~v~v~~~  363 (376)
T cd03866         295 LGVKGQVFDSNGNPIPNAIVEVKGRK-HICPYRTNVNGEYFLLLLPGK-YMINVTAPGFKTVITNVIIPYN  363 (376)
T ss_pred             CceEEEEECCCCCccCCeEEEEEcCC-ceeEEEECCCceEEEecCCee-EEEEEEeCCcceEEEEEEeCCC
Confidence            46999999999999999999999886 311268999999999877665 5799999999999999988754


No 12 
>PF08400 phage_tail_N:  Prophage tail fibre N-terminal;  InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=98.14  E-value=1.9e-05  Score=61.04  Aligned_cols=74  Identities=12%  Similarity=0.119  Sum_probs=55.4

Q ss_pred             eEEEEEEEECCCCCeeeeEEEEeccCCC-------eeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCcc
Q psy4833           8 VAVVRGQVLSVQGLGITGIRVSVDKDSG-------RFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQ   80 (141)
Q Consensus         8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~-------~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~   80 (141)
                      .++|+|...|..|+|++|++|.++--.+       ......||.+|.|+|.+..|..             .+.+...|..
T Consensus         2 sV~ISGvL~dg~G~pv~g~~I~L~A~~tS~~Vv~~t~as~~t~~~G~Ys~~~epG~Y-------------~V~l~~~g~~   68 (134)
T PF08400_consen    2 SVKISGVLKDGAGKPVPGCTITLKARRTSSTVVVGTVASVVTGEAGEYSFDVEPGVY-------------RVTLKVEGRP   68 (134)
T ss_pred             eEEEEEEEeCCCCCcCCCCEEEEEEccCchheEEEEEEEEEcCCCceEEEEecCCeE-------------EEEEEECCCC
Confidence            3689999999999999999999874321       1244689999999999987753             3445566666


Q ss_pred             EEEeCCeEEEecCC
Q psy4833          81 IVVLPPVKMQLSDA   94 (141)
Q Consensus        81 ~~~~~~v~m~~~d~   94 (141)
                      ..++.+|..+.-+.
T Consensus        69 ~~~vG~I~V~~dS~   82 (134)
T PF08400_consen   69 PVYVGDITVYEDSK   82 (134)
T ss_pred             ceeEEEEEEecCCC
Confidence            67777887775544


No 13 
>cd03869 M14_CPX_like Peptidase M14-like domain of carboxypeptidase (CP)-like protein X (CPX), CPX forms a distinct subgroup of the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Proteins belonging to this subgroup include CP-like protein X1 (CPX1), CP-like protein X2 (CPX2),  and aortic CP-like protein (ACLP) and its isoform adipocyte enhancer binding protein-1 (AEBP1). AEBP1 is a truncated form of ACLP, which may arise from alternative splicing of the gene. These proteins are inactive towards standard CP substrates because they lack one or more critical active site and substrate-binding residues that are necessary for activity. They may function as binding proteins rather than as active CPs or display catalytic activity toward other substrates.  Pro
Probab=97.98  E-value=2e-05  Score=69.89  Aligned_cols=67  Identities=21%  Similarity=0.272  Sum_probs=55.8

Q ss_pred             EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCc
Q psy4833           9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWN   79 (141)
Q Consensus         9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n   79 (141)
                      .=|+|.|+|.+|.||++|+|+|+|.. . . ..|..+|.|-=-+.+|. .++.+|.-||.++.+++.++.+
T Consensus       329 ~GikG~V~d~~g~~i~~a~i~v~g~~-~-~-v~t~~~GdywRll~pG~-y~v~~~a~gy~~~~~~~~v~~~  395 (405)
T cd03869         329 RGIKGVVRDKTGKGIPNAIISVEGIN-H-D-IRTASDGDYWRLLNPGE-YRVTAHAEGYTSSTKNCEVGYE  395 (405)
T ss_pred             cCceEEEECCCCCcCCCcEEEEecCc-c-c-eeeCCCCceEEecCCce-EEEEEEecCCCcccEEEEEcCC
Confidence            35899999999999999999999976 3 3 77788998866655564 5799999999999998888754


No 14 
>PF07210 DUF1416:  Protein of unknown function (DUF1416);  InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length. Members of this family appear to be Actinomycete specific. The function of this family is unknown.
Probab=97.74  E-value=0.00037  Score=50.49  Aligned_cols=67  Identities=24%  Similarity=0.307  Sum_probs=51.1

Q ss_pred             cCCcceEEEEEEEECCCCCeeeeEEEEeccCCCee-eceEecCCceEEEEecCCCcEEEE-EEecCccceE
Q psy4833           3 DDELRVAVVRGQVLSVQGLGITGIRVSVDKDSGRF-GFTLTRGGGWFDMLVNGGGAVTLQ-FQRSPYRPQT   71 (141)
Q Consensus         3 ~d~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~-G~TiTD~DG~FsL~V~~g~~~tL~-FSyiGY~sqe   71 (141)
                      .|.+...+|+|+|+ .+|+|++|+-|+++..+++| +.+.|.++|.|.+-..+|.= +|+ .+.-|-....
T Consensus         2 vd~~ke~VItG~V~-~~G~Pv~gAyVRLLD~sgEFtaEvvts~~G~FRFfaapG~W-tvRal~~~g~~d~~   70 (85)
T PF07210_consen    2 VDVEKETVITGRVT-RDGEPVGGAYVRLLDSSGEFTAEVVTSATGDFRFFAAPGSW-TVRALSRGGNGDAM   70 (85)
T ss_pred             cCccceEEEEEEEe-cCCcCCCCeEEEEEcCCCCeEEEEEecCCccEEEEeCCCce-EEEEEccCCCCceE
Confidence            57788899999999 99999999999998765433 56899999999998765643 343 3555555333


No 15 
>KOG2649|consensus
Probab=97.64  E-value=0.00024  Score=64.74  Aligned_cols=89  Identities=25%  Similarity=0.249  Sum_probs=68.5

Q ss_pred             EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeE
Q psy4833           9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVK   88 (141)
Q Consensus         9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~   88 (141)
                      .-|+|.|+|.+|+||+.|+|+|+|-+ - . +.|-.+|.|=--+++|.. .|.++.-||.+...+|.++. .--..=++.
T Consensus       378 ~GIkG~V~D~~G~~I~NA~IsV~gin-H-d-v~T~~~GDYWRLL~PG~y-~vta~A~Gy~~~tk~v~V~~-~~a~~~df~  452 (500)
T KOG2649|consen  378 RGIKGLVFDDTGNPIANATISVDGIN-H-D-VTTAKEGDYWRLLPPGKY-IITASAEGYDPVTKTVTVPP-DRAARVNFT  452 (500)
T ss_pred             hccceeEEcCCCCccCceEEEEecCc-C-c-eeecCCCceEEeeCCcce-EEEEecCCCcceeeEEEeCC-CCccceeEE
Confidence            46899999999999999999999887 3 3 888889999877776764 69999999999999999886 333333566


Q ss_pred             EEecCCCccceecc
Q psy4833          89 MQLSDASDIFREVA  102 (141)
Q Consensus        89 m~~~d~~~~~~~~~  102 (141)
                      |.....++.+.+..
T Consensus       453 L~~~~~~~~~r~~~  466 (500)
T KOG2649|consen  453 LQRSIPQPPFRELL  466 (500)
T ss_pred             EecCCCcchhhHHH
Confidence            66555444444333


No 16 
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=97.57  E-value=0.00012  Score=56.27  Aligned_cols=61  Identities=16%  Similarity=0.287  Sum_probs=45.7

Q ss_pred             EEEEEEEECC-CCCeeeeEEEEeccCCC----eeeceEecCCceEEEEe----cCCCcEEEEEEecCccc
Q psy4833           9 AVVRGQVLSV-QGLGITGIRVSVDKDSG----RFGFTLTRGGGWFDMLV----NGGGAVTLQFQRSPYRP   69 (141)
Q Consensus         9 ~~IrG~V~D~-~GePL~GVsV~Vkgt~~----~~G~TiTD~DG~FsL~V----~~g~~~tL~FSyiGY~s   69 (141)
                      ..|+|.|+|. .|.|++||.|.+.+...    .++.+.||+||+|....    -..+...|+|.--+|=.
T Consensus        27 ~~Is~HVLDt~~G~PA~gV~V~L~~~~~~~w~~l~~~~Td~dGR~~~l~~~~~~~~G~Y~L~F~t~~Yf~   96 (137)
T PRK15036         27 NILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNTAKTDKDGRIKALWPEQTATTGDYRVVFKTGDYFK   96 (137)
T ss_pred             CCeEEEEEeCCCCcCCCCCEEEEEEccCCceEEEEEEEECCCCCCccccCcccCCCeeEEEEEEcchhhh
Confidence            3699999996 79999999999976431    12458999999998621    11245689998888765


No 17 
>cd00421 intradiol_dioxygenase Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. This family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases which are mononuclear non-heme iron enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings. The members are intradiol-cleaving enzymes which break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. Catechol 1,2-dioxygenases are mostly homodimers with one catalytic ferric ion per monomer. Protocatechuate 3,4-dioxygenases form more diverse oligomers.
Probab=97.43  E-value=0.0016  Score=49.80  Aligned_cols=74  Identities=18%  Similarity=0.260  Sum_probs=51.1

Q ss_pred             CcceEEEEEEEECCCCCeeeeEEEEeccCCC-----------------eeeceEecCCceEEEE-ecCCCc-------EE
Q psy4833           5 ELRVAVVRGQVLSVQGLGITGIRVSVDKDSG-----------------RFGFTLTRGGGWFDML-VNGGGA-------VT   59 (141)
Q Consensus         5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~-----------------~~G~TiTD~DG~FsL~-V~~g~~-------~t   59 (141)
                      +.....|+|+|+|.+|.|++||.|.+-....                 ..|...||++|+|.+. +.++..       +-
T Consensus         8 ~G~~l~l~G~V~D~~g~pv~~A~VeiW~~d~~G~Y~~~~~~~~~~~~~~rg~~~Td~~G~y~f~ti~Pg~Y~~~R~~HiH   87 (146)
T cd00421           8 PGEPLTLTGTVLDGDGCPVPDALVEIWQADADGRYSGQDDSGLDPEFFLRGRQITDADGRYRFRTIKPGPYPIGRPPHIH   87 (146)
T ss_pred             CCCEEEEEEEEECCCCCCCCCcEEEEEecCCCCccCCcCccccCCCCCCEEEEEECCCcCEEEEEEcCCCCCCCCCCEEE
Confidence            4567899999999999999999998843221                 1367899999999997 333321       23


Q ss_pred             EEEEecCc-cceEEEEEccC
Q psy4833          60 LQFQRSPY-RPQTRTVFIPW   78 (141)
Q Consensus        60 L~FSyiGY-~sqe~~V~~~~   78 (141)
                      +.+.--|| ....-+++.+.
T Consensus        88 ~~V~~~g~~~~l~Tqlyf~~  107 (146)
T cd00421          88 FKVFAPGYNRRLTTQLYFPG  107 (146)
T ss_pred             EEEECCCccCcEEEEEEeCC
Confidence            55555666 55555555553


No 18 
>cd03461 1,2-HQD Hydroxyquinol 1,2-dioxygenase (1,2-HQD) catalyzes the ring cleavage of hydroxyquinol (1,2,4-trihydroxybenzene), a intermediate in the degradation of a large variety of aromatic compounds including some polychloro- and nitroaromatic pollutants, to form 3-hydroxy-cis,cis-muconates. 1,2-HQD blongs to the aromatic dioxygenase family, a family of mononuclear non-heme intradiol-cleaving enzymes.
Probab=97.36  E-value=0.002  Score=54.85  Aligned_cols=75  Identities=17%  Similarity=0.194  Sum_probs=55.0

Q ss_pred             CcceEEEEEEEECCCCCeeeeEEEEecc----------------CCCeeeceEecCCceEEEE-ecCCC-----------
Q psy4833           5 ELRVAVVRGQVLSVQGLGITGIRVSVDK----------------DSGRFGFTLTRGGGWFDML-VNGGG-----------   56 (141)
Q Consensus         5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkg----------------t~~~~G~TiTD~DG~FsL~-V~~g~-----------   56 (141)
                      +.....|+|+|+|.+|.||+||.|-+=.                .. ..|...||+||+|.+. +.++.           
T Consensus       117 ~G~~l~v~G~V~D~~G~Pv~gA~VeiWqad~~G~Y~~~~~~~~~~~-lRGr~~Td~~G~y~F~Ti~Pg~Ypip~dGp~g~  195 (277)
T cd03461         117 DGEPCFVHGRVTDTDGKPLPGATVDVWQADPNGLYDVQDPDQPEFN-LRGKFRTDEDGRYAFRTLRPTPYPIPTDGPVGK  195 (277)
T ss_pred             CCCEEEEEEEEEcCCCCCcCCcEEEEECcCCCCCcCCCCCCCCCCC-CeEEEEeCCCCCEEEEEECCCCcCCCCCCcHHH
Confidence            4556899999999999999999998821                11 2367899999999996 32221           


Q ss_pred             -------------cEEEEEEecCccceEEEEEccCcc
Q psy4833          57 -------------AVTLQFQRSPYRPQTRTVFIPWNQ   80 (141)
Q Consensus        57 -------------~~tL~FSyiGY~sqe~~V~~~~n~   80 (141)
                                   .+-+.++.-||+...-+++.+...
T Consensus       196 lL~~~grh~~RpaHIHf~V~a~G~~~L~Tqlyf~~d~  232 (277)
T cd03461         196 LLKAMGRHPMRPAHIHFMVTAPGYRTLVTQIFDSGDP  232 (277)
T ss_pred             HHHhhhccCCCCCeEEEEEEcCCcCceEEeEecCCCc
Confidence                         345667778887777777777554


No 19 
>TIGR02423 protocat_alph protocatechuate 3,4-dioxygenase, alpha subunit. This model represents the alpha chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the beta chain (TIGR02422), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=97.34  E-value=0.0031  Score=50.98  Aligned_cols=77  Identities=18%  Similarity=0.217  Sum_probs=52.2

Q ss_pred             cceEEEEEEEECCCCCeeeeEEEEecc------------------CCC--eeeceEecCCceEEEE-ecCCC--------
Q psy4833           6 LRVAVVRGQVLSVQGLGITGIRVSVDK------------------DSG--RFGFTLTRGGGWFDML-VNGGG--------   56 (141)
Q Consensus         6 ~r~~~IrG~V~D~~GePL~GVsV~Vkg------------------t~~--~~G~TiTD~DG~FsL~-V~~g~--------   56 (141)
                      ...+.|+|+|+|.+|+||+||.|-+=.                  .++  ..|...||++|+|.+. +..+.        
T Consensus        37 G~~l~l~G~V~D~~g~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~~~~f~grGr~~Td~~G~y~f~TI~Pg~Yp~~~g~~  116 (193)
T TIGR02423        37 GERIRLEGRVLDGDGHPVPDALIEIWQADAAGRYNSPADLRAPATDPGFRGWGRTGTDESGEFTFETVKPGAVPDRDGVL  116 (193)
T ss_pred             CCEEEEEEEEECCCCCCCCCCEEEEEccCCCCccCCccCCcccccCCCCCCeEEEEECCCCCEEEEEEcCCCcCCCCCCC
Confidence            456899999999999999999998721                  000  1255789999999996 33332        


Q ss_pred             ---cEEEEEEecCc-cceEEEEEccCccEE
Q psy4833          57 ---AVTLQFQRSPY-RPQTRTVFIPWNQIV   82 (141)
Q Consensus        57 ---~~tL~FSyiGY-~sqe~~V~~~~n~~~   82 (141)
                         .+-+.+..-|| +...-+++.+.....
T Consensus       117 R~~HiH~~V~a~G~~~~LtTqlYf~~~~~~  146 (193)
T TIGR02423       117 QAPHINVSVFARGINRRLYTRLYFDDEAAA  146 (193)
T ss_pred             cCCeEEEEEECCCcccceEEEEEcCCCccc
Confidence               23355556676 556556666655444


No 20 
>cd03458 Catechol_intradiol_dioxygenases Catechol intradiol dioxygenases can be divided into several subgroups according to their substrate specificity for catechol, chlorocatechols and hydroxyquinols. Almost all members of this family are homodimers containing one ferric ion (Fe3+) per monomer. They belong to the intradiol dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=97.22  E-value=0.0037  Score=52.76  Aligned_cols=76  Identities=17%  Similarity=0.220  Sum_probs=53.4

Q ss_pred             CcceEEEEEEEECCCCCeeeeEEEEeccCC---------------CeeeceEecCCceEEEE-ecCC-------------
Q psy4833           5 ELRVAVVRGQVLSVQGLGITGIRVSVDKDS---------------GRFGFTLTRGGGWFDML-VNGG-------------   55 (141)
Q Consensus         5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~---------------~~~G~TiTD~DG~FsL~-V~~g-------------   55 (141)
                      +.....|+|+|+|.+|+||+||.|-+=...               +..|...||+||+|.+. +.++             
T Consensus       101 ~G~~l~l~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~lRG~~~Td~~G~y~f~Ti~P~~Ypip~dGp~g~l  180 (256)
T cd03458         101 DGEPLFVHGTVTDTDGKPLAGATVDVWHADPDGFYSQQDPDQPEFNLRGKFRTDEDGRYRFRTIRPVPYPIPPDGPTGEL  180 (256)
T ss_pred             CCcEEEEEEEEEcCCCCCCCCcEEEEEccCCCCCcCCCCCCCCCCCCEEEEEeCCCCCEEEEEECCCCccCCCCCcHHHH
Confidence            456789999999999999999999882110               02367899999999996 3332             


Q ss_pred             -----------CcEEEEEEecCccceEEEEEccCcc
Q psy4833          56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQ   80 (141)
Q Consensus        56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~   80 (141)
                                 ..+-+.++.-||+...-+++.+...
T Consensus       181 L~~~grh~~RpaHIHf~V~a~G~~~LtTqlyf~~d~  216 (256)
T cd03458         181 LEALGRHPWRPAHIHFMVSAPGYRTLTTQIYFEGDP  216 (256)
T ss_pred             HHhcccCCCCCCeEEEEEECCCcceeeEEEecCCCc
Confidence                       2234666777776666666666444


No 21 
>TIGR02438 catachol_actin catechol 1,2-dioxygenase, Actinobacterial. Members of this family are catechol 1,2-dioxygenases of the Actinobacteria. They are more closely related to actinobacterial chlorocatechol 1,2-dioxygenases than to proteobacterial catechol 1,2-dioxygenases, and so are built in this separate model. The member from Rhodococcus rhodochrous NCIMB 13259 (GB|AAC33003.1) is described as a homodimer with bound Fe, similarly active on catechol, 3-methylcatechol and 4-methylcatechol.
Probab=97.22  E-value=0.0032  Score=53.88  Aligned_cols=78  Identities=19%  Similarity=0.322  Sum_probs=53.8

Q ss_pred             CCcceEEEEEEEECCCCCeeeeEEEEe--ccCCC-------------eeeceEecCCceEEEE-ecCC------------
Q psy4833           4 DELRVAVVRGQVLSVQGLGITGIRVSV--DKDSG-------------RFGFTLTRGGGWFDML-VNGG------------   55 (141)
Q Consensus         4 d~~r~~~IrG~V~D~~GePL~GVsV~V--kgt~~-------------~~G~TiTD~DG~FsL~-V~~g------------   55 (141)
                      ++.....|+|+|+|.+|+||+||.|.+  .+..+             ..|...||+||+|.+. +.++            
T Consensus       128 ~~G~pl~v~G~V~D~~G~Pv~gA~VdiWqada~G~Ys~~~~~~~~~~lRGr~~TDadG~y~F~TI~Pg~YpiP~dGp~G~  207 (281)
T TIGR02438       128 EAGTPLVFSGQVTDLDGNGLAGAKVELWHADDDGFYSQFAPGIPEWNLRGTIIADDEGRFEITTMQPAPYQIPTDGPTGK  207 (281)
T ss_pred             CCCCEEEEEEEEEcCCCCCcCCCEEEEEecCCCCCcCCCCCCCCCCCCeEEEEeCCCCCEEEEEECCCCcCCCCCCchHH
Confidence            345568999999999999999999999  11111             1256789999999996 3222            


Q ss_pred             ------------CcEEEEEEecCccceEEEEEccCccE
Q psy4833          56 ------------GAVTLQFQRSPYRPQTRTVFIPWNQI   81 (141)
Q Consensus        56 ------------~~~tL~FSyiGY~sqe~~V~~~~n~~   81 (141)
                                  ..+-+.++.-||+...-+++.+...+
T Consensus       208 lL~~~Grh~~RpaHIHf~V~a~G~~~LtTqiyf~gd~~  245 (281)
T TIGR02438       208 FIAAAGGHPWRPAHLHLKVSAPGHELITTQLYFKGGEH  245 (281)
T ss_pred             HHHhcccCCCCCCEEEEEEECCCcceEEEeEecCCCCC
Confidence                        22345567777776666666664443


No 22 
>PF00775 Dioxygenase_C:  Dioxygenase;  InterPro: IPR000627 This entry represents the C-terminal domain common to several intradiol ring-cleavage dioxygenases. Dioxygenases catalyse the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms. Cleavage of aromatic rings is one of the most important functions of dioxygenases, which play key roles in the degradation of aromatic compounds. The substrates of ring-cleavage dioxygenases can be classified into two groups according to the mode of scission of the aromatic ring. Intradiol enzymes use a non-haem Fe(III) to cleave the aromatic ring between two hydroxyl groups (ortho-cleavage), whereas extradiol enzymes (IPR000486 from INTERPRO) use a non-haem Fe(II) to cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage) []. These two subfamilies differ in sequence, structural fold, iron ligands, and the orientation of second sphere active site amino acid residues. Enzymes that belong to the intradiol family include catechol 1,2-dioxygenase (1,2-CTD) (1.13.11.1 from EC); protocatechuate 3,4-dioxygenase (3,4-PCD) (1.13.11.3 from EC); and chlorocatechol 1,2-dioxygenase (1.13.11.1 from EC) [].; GO: 0003824 catalytic activity, 0008199 ferric iron binding, 0006725 cellular aromatic compound metabolic process, 0055114 oxidation-reduction process; PDB: 2BUV_A 2BUX_A 2BUU_A 2BUR_A 1EO9_A 2BUZ_A 2BV0_A 1EO2_A 1EOC_A 1EOA_A ....
Probab=97.17  E-value=0.0061  Score=48.70  Aligned_cols=48  Identities=19%  Similarity=0.193  Sum_probs=36.4

Q ss_pred             CCcceEEEEEEEECCCCCeeeeEEEEec-----cC------------CCeeeceEecCCceEEEE
Q psy4833           4 DELRVAVVRGQVLSVQGLGITGIRVSVD-----KD------------SGRFGFTLTRGGGWFDML   51 (141)
Q Consensus         4 d~~r~~~IrG~V~D~~GePL~GVsV~Vk-----gt------------~~~~G~TiTD~DG~FsL~   51 (141)
                      .+.....|+|+|+|.+|+||+||.|-|=     |.            .+..|...||++|+|.+.
T Consensus        25 ~~G~~l~l~G~V~D~~g~Pv~~A~veiWqada~G~Ys~~~~~~~~~~~~~rG~~~Td~~G~y~f~   89 (183)
T PF00775_consen   25 APGEPLVLHGRVIDTDGKPVPGALVEIWQADADGRYSGQDPGSDQPDFNLRGRFRTDADGRYSFR   89 (183)
T ss_dssp             SSS-EEEEEEEEEETTSSB-TTEEEEEEE--TTS--TTTBTTSSSSTTTTEEEEEECTTSEEEEE
T ss_pred             CCCCEEEEEEEEECCCCCCCCCcEEEEEecCCCCccccccccccccCCCcceEEecCCCCEEEEE
Confidence            3556889999999999999999999982     10            012377899999999986


No 23 
>cd03462 1,2-CCD chlorocatechol 1,2-dioxygenases (1,2-CCDs) (type II enzymes) are homodimeric intradiol dioxygenases that degrade chlorocatechols via the addition of molecular oxygen and the subsequent cleavage between two adjacent hydroxyl groups. This reaction is part of the modified ortho-cleavage pathway which is a central oxidative bacterial pathway that channels chlorocatechols, derived from the degradation of chlorinated benzoic acids, phenoxyacetic acids, phenols, benzenes, and other aromatics into the energy-generating tricarboxylic acid pathway.
Probab=97.14  E-value=0.0045  Score=52.03  Aligned_cols=77  Identities=16%  Similarity=0.217  Sum_probs=54.8

Q ss_pred             CcceEEEEEEEECCCCCeeeeEEEEeccCC---------------CeeeceEecCCceEEEE-ecCC-------------
Q psy4833           5 ELRVAVVRGQVLSVQGLGITGIRVSVDKDS---------------GRFGFTLTRGGGWFDML-VNGG-------------   55 (141)
Q Consensus         5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~---------------~~~G~TiTD~DG~FsL~-V~~g-------------   55 (141)
                      +.....|+|+|+|.+|+||+||.|-+=...               +..|...||+||+|.+. +.++             
T Consensus        96 ~G~~l~l~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~~RG~~~Td~~G~y~F~Ti~P~~YpiP~dGp~g~l  175 (247)
T cd03462          96 DHKPLLFRGTVKDLAGAPVAGAVIDVWHSTPDGKYSGFHPNIPEDYYRGKIRTDEDGRYEVRTTVPVPYQIPNDGPTGAL  175 (247)
T ss_pred             CCCEEEEEEEEEcCCCCCcCCcEEEEECCCCCCCcCCCCCCCCCCCCEEEEEeCCCCCEEEEEECCCCcCCCCCCcHHHH
Confidence            445689999999999999999999883111               01256789999999996 3322             


Q ss_pred             -----------CcEEEEEEecCccceEEEEEccCccE
Q psy4833          56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQI   81 (141)
Q Consensus        56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~~   81 (141)
                                 ..+-+.++.-||+...-+++.+...+
T Consensus       176 L~~~grh~~RpaHIHf~V~a~G~~~L~Tqlyf~~d~~  212 (247)
T cd03462         176 LEAMGGHSWRPAHVHFKVRADGYETLTTQLYFEGGEW  212 (247)
T ss_pred             HHhcccCCCCCCeEEEEEECCCcCceEEEEecCCCCC
Confidence                       23456777788877666777664444


No 24 
>cd03459 3,4-PCD Protocatechuate 3,4-dioxygenase (3,4-PCD) catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=97.11  E-value=0.0066  Score=47.56  Aligned_cols=74  Identities=16%  Similarity=0.267  Sum_probs=52.1

Q ss_pred             CcceEEEEEEEECCCCCeeeeEEEEeccC------------------CC--eeeceEecCCceEEEE-ecCCC-------
Q psy4833           5 ELRVAVVRGQVLSVQGLGITGIRVSVDKD------------------SG--RFGFTLTRGGGWFDML-VNGGG-------   56 (141)
Q Consensus         5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt------------------~~--~~G~TiTD~DG~FsL~-V~~g~-------   56 (141)
                      +...+.|+|+|+|.+|+||+||.|.+=..                  ++  ..|...||+||+|.+. +.++.       
T Consensus        12 ~G~~l~l~g~V~D~~g~Pv~~A~veiWqad~~G~Y~~~~~~~~~~~~~~f~~rG~~~Td~~G~~~f~Ti~Pg~Y~~p~~~   91 (158)
T cd03459          12 IGERIILEGRVLDGDGRPVPDALVEIWQADAAGRYRHPRDSHRAPLDPNFTGFGRVLTDADGRYRFRTIKPGAYPWRNGA   91 (158)
T ss_pred             CCcEEEEEEEEECCCCCCCCCCEEEEEccCCCCccCCccCCcccccCCCCCceeEEEECCCCcEEEEEECCCCcCCCCCC
Confidence            34578999999999999999999987211                  00  1255789999999996 33221       


Q ss_pred             ----cEEEEEEecCc-cceEEEEEccC
Q psy4833          57 ----AVTLQFQRSPY-RPQTRTVFIPW   78 (141)
Q Consensus        57 ----~~tL~FSyiGY-~sqe~~V~~~~   78 (141)
                          .+-+.+.--|| +...-+++.+.
T Consensus        92 ~R~~HIH~~V~~~g~~~~L~Tqlyf~~  118 (158)
T cd03459          92 WRAPHIHVSVFARGLLERLVTRLYFPG  118 (158)
T ss_pred             CcCCEEEEEEECCCcccceEEeEecCC
Confidence                13356666788 66666777775


No 25 
>TIGR02439 catechol_proteo catechol 1,2-dioxygenase, proteobacterial. Members of this family known so far are catechol 1,2-dioxygenases of the Proteobacteria. They are distinct from catechol 1,2-dioxygenases and chlorocatechol 1,2-dioxygenases of the Actinobacteria, which are quite similar to each other and resolved by separate models. This enzyme catalyzes intradiol cleavage in which catechol + O2 becomes cis,cis-muconate. Catechol is an intermediate in the catabolism of many different aromatic compounds, as is the alternative intermediate protocatechuate. In Acinetobacter lwoffii, two isozymes are present with abilities, differing somewhat, to act on catechol analogs 3-methylcatechol, 4-methylcatechol, 4-methoxycatechol, and 4-chlorocatechol.
Probab=97.11  E-value=0.0056  Score=52.41  Aligned_cols=79  Identities=18%  Similarity=0.146  Sum_probs=56.0

Q ss_pred             CcceEEEEEEEECCCCCeeeeEEEEeccC-------------C--CeeeceEecCCceEEEE-ecCC-------------
Q psy4833           5 ELRVAVVRGQVLSVQGLGITGIRVSVDKD-------------S--GRFGFTLTRGGGWFDML-VNGG-------------   55 (141)
Q Consensus         5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt-------------~--~~~G~TiTD~DG~FsL~-V~~g-------------   55 (141)
                      +.....|+|+|+|.+|+||+||.|-+=..             +  +..|...||+||+|.+. +.++             
T Consensus       125 ~G~pl~v~G~V~D~~G~PI~gA~VeIWqad~~G~Ys~~~~~~~~~~lRG~~~TD~~G~y~F~TI~P~~YpiP~dGp~g~l  204 (285)
T TIGR02439       125 PGETLFLHGQVTDADGKPIAGAKVELWHANTKGNYSHFDKSQSEFNLRRTIITDAEGRYRARSIVPSGYGCPPQGPTQQL  204 (285)
T ss_pred             CCcEEEEEEEEECCCCCCcCCcEEEEEccCCCCCcCCCCCCCCCCCceEEEEECCCCCEEEEEECCCCCcCCCCCcHHHH
Confidence            35678999999999999999999988211             0  02267899999999996 3222             


Q ss_pred             -----------CcEEEEEEecCccceEEEEEccCccEEE
Q psy4833          56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQIVV   83 (141)
Q Consensus        56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~~~~   83 (141)
                                 ..+-+.++.-||+...-+++.+...++.
T Consensus       205 L~~~grh~~RpaHIHf~V~a~G~~~LtTQiyf~gd~~l~  243 (285)
T TIGR02439       205 LNLLGRHGNRPAHVHFFVSAPGHRKLTTQINIEGDPYLW  243 (285)
T ss_pred             HHhccCCCCCCCeEEEEEecCCCCeeEEEEecCCCCCcc
Confidence                       2345667777887777777766554443


No 26 
>PF02369 Big_1:  Bacterial Ig-like domain (group 1);  InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=97.11  E-value=0.0047  Score=44.25  Aligned_cols=70  Identities=14%  Similarity=0.187  Sum_probs=44.3

Q ss_pred             CCcceEEEEEEEECCCCCeeeeEEEEe--ccCCCeee----ceEecCCceEEEEecC--CCcEEEEEEecCccceEEE
Q psy4833           4 DELRVAVVRGQVLSVQGLGITGIRVSV--DKDSGRFG----FTLTRGGGWFDMLVNG--GGAVTLQFQRSPYRPQTRT   73 (141)
Q Consensus         4 d~~r~~~IrG~V~D~~GePL~GVsV~V--kgt~~~~G----~TiTD~DG~FsL~V~~--g~~~tL~FSyiGY~sqe~~   73 (141)
                      |.....+|+-+|.|.+|.||+|+.|.+  ......+.    ...||.+|.+.+.+.+  .+..++..+.-|.......
T Consensus        20 ~g~~~~tltatV~D~~gnpv~g~~V~f~~~~~~~~l~~~~~~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~~~~~~   97 (100)
T PF02369_consen   20 DGSDTNTLTATVTDANGNPVPGQPVTFSSSSSGGTLSPTNTSATTDSNGIATVTLTSTKAGTYTVTATVDGGSTSVTS   97 (100)
T ss_dssp             SSSS-EEEEEEEEETTSEB-TS-EEEE--EESSSEES-CEE-EEE-TTSEEEEEEE-SS-EEEEEEEEETTEEEEEEE
T ss_pred             CCcCcEEEEEEEEcCCCCCCCCCEEEEEEcCCCcEEecCccccEECCCEEEEEEEEecCceEEEEEEEECCcceeEEe
Confidence            456678899999999999999999999  32221111    2589999999998753  2334566666665544433


No 27 
>TIGR02465 chlorocat_1_2 chlorocatechol 1,2-dioxygenase. Members of this protein family are chlorocatechol 1,2-dioxygenase. This protein is closely related to catechol 1,2-dioxygenase, TIGR02439, EC 1.13.11.1. Note that annotated database entries have appeared for the present protein family with the EC number that refers to that of family TIGR02439. This protein acts in pathways of the biodegradation of chlorinated aromatic compounds.
Probab=97.08  E-value=0.0064  Score=51.03  Aligned_cols=77  Identities=18%  Similarity=0.276  Sum_probs=54.1

Q ss_pred             cceEEEEEEEECCCCCeeeeEEEEeccCC---------------CeeeceEecCCceEEEE-ecCC--------------
Q psy4833           6 LRVAVVRGQVLSVQGLGITGIRVSVDKDS---------------GRFGFTLTRGGGWFDML-VNGG--------------   55 (141)
Q Consensus         6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt~---------------~~~G~TiTD~DG~FsL~-V~~g--------------   55 (141)
                      .-...|+|+|+|.+|+||+||.|-|=...               +..|...||+||+|.+. +.++              
T Consensus        96 G~~l~v~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~lRG~~~Td~~G~y~F~Ti~P~~YpiP~dgp~g~lL  175 (246)
T TIGR02465        96 HKPLLIRGTVRDLSGTPVAGAVIDVWHSTPDGKYSGFHDNIPDDYYRGKLVTAADGSYEVRTTMPVPYQIPDAGPTGALL  175 (246)
T ss_pred             CcEEEEEEEEEcCCCCCcCCcEEEEECCCCCCCCCCCCCCCCCCCCeEEEEECCCCCEEEEEECCCCCCCCCCCchHHHH
Confidence            45689999999999999999999882110               01266889999999996 3222              


Q ss_pred             ----------CcEEEEEEecCccceEEEEEccCccEE
Q psy4833          56 ----------GAVTLQFQRSPYRPQTRTVFIPWNQIV   82 (141)
Q Consensus        56 ----------~~~tL~FSyiGY~sqe~~V~~~~n~~~   82 (141)
                                ..+-+.++.-||+...-+++.+....+
T Consensus       176 ~~~grh~~RpaHIH~~V~a~G~~~L~Tqiyf~~d~~l  212 (246)
T TIGR02465       176 ETMGRHSWRPAHVHYKVRADGYRPLTTQAYFEGGPYI  212 (246)
T ss_pred             HhcccCCCCCCeEEEEEECCCcCceEEeEecCCCccc
Confidence                      234566677788766666666644433


No 28 
>TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit. This model represents the beta chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the alpha chain (TIGR02423), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=97.07  E-value=0.006  Score=50.41  Aligned_cols=75  Identities=17%  Similarity=0.343  Sum_probs=50.5

Q ss_pred             ceEEEEEEEECCCCCeeeeEEEEeccC------------------CC--eeeceEecCCceEEEE-ecCCC---------
Q psy4833           7 RVAVVRGQVLSVQGLGITGIRVSVDKD------------------SG--RFGFTLTRGGGWFDML-VNGGG---------   56 (141)
Q Consensus         7 r~~~IrG~V~D~~GePL~GVsV~Vkgt------------------~~--~~G~TiTD~DG~FsL~-V~~g~---------   56 (141)
                      ..+.|+|+|+|.+|+||+||.|.|=..                  ++  .+|...||.||+|.+. +..+.         
T Consensus        59 ~~i~l~G~V~D~~g~PV~~A~VEIWQada~G~Y~~~~d~~~~~~~~~f~grGr~~TD~~G~y~F~TI~PG~Y~~p~~~~~  138 (220)
T TIGR02422        59 ERIIVHGRVLDEDGRPVPNTLVEVWQANAAGRYRHKNDQYLAPLDPNFGGVGRTLTDSDGYYRFRTIKPGPYPWGNHHNA  138 (220)
T ss_pred             CEEEEEEEEECCCCCCCCCCEEEEEecCCCCcccCccCccccccCCCCCCEEEEEECCCccEEEEEECCCCccCCCCCCC
Confidence            458999999999999999999987211                  00  1367889999999996 33332         


Q ss_pred             ----cEEEEEEecCc-cceEEEEEccCccE
Q psy4833          57 ----AVTLQFQRSPY-RPQTRTVFIPWNQI   81 (141)
Q Consensus        57 ----~~tL~FSyiGY-~sqe~~V~~~~n~~   81 (141)
                          .+-+.+..-|| ....-+++.+....
T Consensus       139 ~R~pHIH~~V~~~g~~~~L~TqlYF~dd~~  168 (220)
T TIGR02422       139 WRPAHIHFSLFGTSFAQRLITQMYFEGDPL  168 (220)
T ss_pred             CcCCeEEEEEECCCCccceEEeEECCCCcc
Confidence                22344455666 35555566665443


No 29 
>cd03460 1,2-CTD Catechol 1,2 dioxygenase (1,2-CTD) catalyzes an intradiol cleavage reaction of catechol to form cis,cis-muconate. 1,2-CTDs is homodimers with one catalytic non-heme ferric ion per monomer. They belong to the aromatic dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=97.02  E-value=0.0076  Score=51.55  Aligned_cols=79  Identities=16%  Similarity=0.177  Sum_probs=55.2

Q ss_pred             CcceEEEEEEEECCCCCeeeeEEEEeccC-------------C--CeeeceEecCCceEEEE-ecCC-------------
Q psy4833           5 ELRVAVVRGQVLSVQGLGITGIRVSVDKD-------------S--GRFGFTLTRGGGWFDML-VNGG-------------   55 (141)
Q Consensus         5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt-------------~--~~~G~TiTD~DG~FsL~-V~~g-------------   55 (141)
                      +.....|+|+|+|.+|+||+||.|-|=..             +  +..|...||++|+|.+. +.++             
T Consensus       121 ~Gepl~l~G~V~D~~G~PI~~A~VeiWqad~~G~Ys~~~~~~~~f~~RGr~~TD~~G~y~F~TI~P~~YpiP~dGp~g~l  200 (282)
T cd03460         121 DGETLVMHGTVTDTDGKPVPGAKVEVWHANSKGFYSHFDPTQSPFNLRRSIITDADGRYRFRSIMPSGYGVPPGGPTQQL  200 (282)
T ss_pred             CCCEEEEEEEEECCCCCCcCCcEEEEECCCCCCCcCCCCCCCCCCCCceEEEeCCCCCEEEEEECCCCCcCCCCCcHHHH
Confidence            35678999999999999999999988211             0  02266899999999996 3222             


Q ss_pred             -----------CcEEEEEEecCccceEEEEEccCccEEE
Q psy4833          56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQIVV   83 (141)
Q Consensus        56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~~~~   83 (141)
                                 ..+-+.++.-||+...-+++.+...++.
T Consensus       201 L~~~grh~~RpaHIHf~V~a~G~~~LtTqlyf~gd~~~~  239 (282)
T cd03460         201 LNALGRHGNRPAHIHFFVSAPGHRKLTTQINIEGDPYIW  239 (282)
T ss_pred             HHhhcCCCCCCCeEEEEEecCCcCceEEEEecCCCCccc
Confidence                       2244666777777766667766554443


No 30 
>cd03464 3,4-PCD_beta Protocatechuate 3,4-dioxygenase (3,4-PCD) , beta subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=96.95  E-value=0.01  Score=49.03  Aligned_cols=75  Identities=17%  Similarity=0.331  Sum_probs=50.3

Q ss_pred             cceEEEEEEEECCCCCeeeeEEEEeccC------------------CC--eeeceEecCCceEEEE-ecCCC--------
Q psy4833           6 LRVAVVRGQVLSVQGLGITGIRVSVDKD------------------SG--RFGFTLTRGGGWFDML-VNGGG--------   56 (141)
Q Consensus         6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt------------------~~--~~G~TiTD~DG~FsL~-V~~g~--------   56 (141)
                      ...+.|+|+|+|.+|+||+||.|.|=..                  ++  ..|...||+||+|.+. +..+.        
T Consensus        63 G~~i~l~G~V~D~~G~PV~~A~VEIWQad~~G~Y~~~~d~~~~~~~~~f~grGr~~TD~~G~y~F~TI~Pg~Yp~p~~r~  142 (220)
T cd03464          63 GERIIVHGRVLDEDGRPVPNTLVEIWQANAAGRYRHKRDQHDAPLDPNFGGAGRTLTDDDGYYRFRTIKPGAYPWGNHPN  142 (220)
T ss_pred             CCEEEEEEEEECCCCCCCCCCEEEEEecCCCCcccCccCCcccccCCCCCCEEEEEECCCccEEEEEECCCCccCCCCCC
Confidence            4458999999999999999999987211                  00  1356799999999996 33332        


Q ss_pred             -----cEEEEEEecCcc-ceEEEEEccCcc
Q psy4833          57 -----AVTLQFQRSPYR-PQTRTVFIPWNQ   80 (141)
Q Consensus        57 -----~~tL~FSyiGY~-sqe~~V~~~~n~   80 (141)
                           .+-+.++.-||. ...-+++.+...
T Consensus       143 ~~RppHIH~~V~a~G~~~~L~TqlYF~dd~  172 (220)
T cd03464         143 AWRPAHIHFSLFGPSFATRLVTQMYFPGDP  172 (220)
T ss_pred             CCcCCeEEEEEECCCccCceEEeEECCCCc
Confidence                 223444566765 455556666444


No 31 
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=96.95  E-value=0.011  Score=47.64  Aligned_cols=73  Identities=16%  Similarity=0.263  Sum_probs=51.2

Q ss_pred             cceEEEEEEEECCCCCeeeeEEEEeccC-----------------CC--eeeceEecCCceEEEE-ecCCC---------
Q psy4833           6 LRVAVVRGQVLSVQGLGITGIRVSVDKD-----------------SG--RFGFTLTRGGGWFDML-VNGGG---------   56 (141)
Q Consensus         6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt-----------------~~--~~G~TiTD~DG~FsL~-V~~g~---------   56 (141)
                      .....|+|+|+|.+|+||+||.|-|=..                 ++  ..|...||+||+|.+. +..+.         
T Consensus        34 G~~l~l~G~V~D~~g~Pi~gA~VeiWqad~~G~Y~~~~~~~~~~~~~f~~rGr~~TD~~G~y~F~Ti~Pg~Y~~~~g~~R  113 (185)
T cd03463          34 GERITLEGRVYDGDGAPVPDAMLEIWQADAAGRYAHPADSRRRLDPGFRGFGRVATDADGRFSFTTVKPGAVPGRDGAGQ  113 (185)
T ss_pred             CCEEEEEEEEECCCCCCCCCCEEEEEcCCCCCccCCcCCcccccCCCCCcEEEEEECCCCCEEEEEEcCCCcCCCCCCCc
Confidence            4678999999999999999999987211                 10  1145789999999996 32221         


Q ss_pred             --cEEEEEEecCc-cceEEEEEccC
Q psy4833          57 --AVTLQFQRSPY-RPQTRTVFIPW   78 (141)
Q Consensus        57 --~~tL~FSyiGY-~sqe~~V~~~~   78 (141)
                        .+-+.+..-|| +...-+++.+.
T Consensus       114 ~~HIH~~V~~~g~~~~L~Tqlyf~~  138 (185)
T cd03463         114 APHINVWVFARGLLKHLFTRIYFPD  138 (185)
T ss_pred             CCeEEEEEECCCcccceEEeEecCC
Confidence              13356666777 66666777765


No 32 
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=96.86  E-value=0.01  Score=41.47  Aligned_cols=69  Identities=16%  Similarity=0.197  Sum_probs=48.9

Q ss_pred             CCcceEEEEEEEECCCCCeeeeEEEEeccCCCe-----eeceEecCCceEEEEecC--CCcEEEEEEecCccceEE
Q psy4833           4 DELRVAVVRGQVLSVQGLGITGIRVSVDKDSGR-----FGFTLTRGGGWFDMLVNG--GGAVTLQFQRSPYRPQTR   72 (141)
Q Consensus         4 d~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~-----~G~TiTD~DG~FsL~V~~--g~~~tL~FSyiGY~sqe~   72 (141)
                      |......|+=+|+|++|.|++|+.|.+.-....     -+...||.+|+..+.+.+  .+.+++..+.-|..+++.
T Consensus        15 dg~d~~~i~v~v~D~~Gnpv~~~~V~f~~~~~~~~~~~~~~~~Td~~G~a~~~l~~~~~G~~~vta~~~~~~~~~~   90 (92)
T smart00634       15 NGSDAITLTATVTDANGNPVAGQEVTFTTPSGGALTLSKGTATTDANGIATVTLTSTTAGVYTVTASLENGSSSVK   90 (92)
T ss_pred             cCcccEEEEEEEECCCCCCcCCCEEEEEECCCceeeccCCeeeeCCCCEEEEEEECCCCcEEEEEEEECCCcceee
Confidence            445677899999999999999988887633200     023579999999998753  345567777777665543


No 33 
>PF10670 DUF4198:  Domain of unknown function (DUF4198)
Probab=96.52  E-value=0.012  Score=45.31  Aligned_cols=54  Identities=19%  Similarity=0.133  Sum_probs=39.3

Q ss_pred             EEEEEEECCCCCeeeeEEEEeccCCCe------eeceEecCCceEEEEecCCCcEEEEEEe
Q psy4833          10 VVRGQVLSVQGLGITGIRVSVDKDSGR------FGFTLTRGGGWFDMLVNGGGAVTLQFQR   64 (141)
Q Consensus        10 ~IrG~V~D~~GePL~GVsV~Vkgt~~~------~G~TiTD~DG~FsL~V~~g~~~tL~FSy   64 (141)
                      .++.+|+ -+|+||+|+.|.+-.....      .....||++|++.|.++..+...|..++
T Consensus       152 ~~~~~vl-~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G~~~~~~~~~G~wli~a~~  211 (215)
T PF10670_consen  152 PLPFQVL-FDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANGRATFTLPRPGLWLIRASH  211 (215)
T ss_pred             EEEEEEE-ECCeEcccEEEEEEECCCccccccceEEEEECCCCEEEEecCCCEEEEEEEEE
Confidence            4677887 7899999999998544210      1238899999999998755554565554


No 34 
>COG3485 PcaH Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=96.49  E-value=0.0064  Score=50.60  Aligned_cols=73  Identities=21%  Similarity=0.311  Sum_probs=49.1

Q ss_pred             ceEEEEEEEECCCCCeeeeEEEEec-----------cCC--------CeeeceEecCCceEEEE-ecCC------C----
Q psy4833           7 RVAVVRGQVLSVQGLGITGIRVSVD-----------KDS--------GRFGFTLTRGGGWFDML-VNGG------G----   56 (141)
Q Consensus         7 r~~~IrG~V~D~~GePL~GVsV~Vk-----------gt~--------~~~G~TiTD~DG~FsL~-V~~g------~----   56 (141)
                      ..+.|+|+|+|.+|.||+||-|-|=           ..+        +..|.++||.+|+|.+. +.++      .    
T Consensus        71 e~i~l~G~VlD~~G~Pv~~A~VEiWQAda~GrY~~~~d~~~~~~~~f~g~Gr~~Td~~G~y~F~Ti~Pg~yp~~~~~~~~  150 (226)
T COG3485          71 ERILLEGRVLDGNGRPVPDALVEIWQADADGRYSHPKDSRLAPLPNFNGRGRTITDEDGEYRFRTIKPGPYPWRNGGPMW  150 (226)
T ss_pred             ceEEEEEEEECCCCCCCCCCEEEEEEcCCCCcccCccccccCcCccccceEEEEeCCCceEEEEEeecccccCCCCCCcC
Confidence            5789999999999999999999871           110        01267999999999996 2222      1    


Q ss_pred             ---cEEEEEEecCccc-eEEEEEccCc
Q psy4833          57 ---AVTLQFQRSPYRP-QTRTVFIPWN   79 (141)
Q Consensus        57 ---~~tL~FSyiGY~s-qe~~V~~~~n   79 (141)
                         .+.+.++--|+.. .+-.++++..
T Consensus       151 R~aHih~~v~~~g~~~~L~TqlYf~dd  177 (226)
T COG3485         151 RPAHIHFSVFARGINTRLVTQLYFPDD  177 (226)
T ss_pred             ccceeEEEEeCCcccceEEEEEecCCC
Confidence               3456666666633 3444555544


No 35 
>KOG1948|consensus
Probab=96.39  E-value=0.014  Score=57.21  Aligned_cols=64  Identities=25%  Similarity=0.367  Sum_probs=45.4

Q ss_pred             EEEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEe-cCCCcEEEEEE--ecCccceEEEEEc
Q psy4833           9 AVVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLV-NGGGAVTLQFQ--RSPYRPQTRTVFI   76 (141)
Q Consensus         9 ~~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V-~~g~~~tL~FS--yiGY~sqe~~V~~   76 (141)
                      ..|+|+|++. +|.||.||-|.|.|..   + .-||++|.|.|.- -..+.++|...  ..-|.+...++..
T Consensus       316 fSvtGRVl~g~~g~~l~gvvvlvngk~---~-~kTdaqGyykLen~~t~gtytI~a~kehlqFstv~~kv~p  383 (1165)
T KOG1948|consen  316 FSVTGRVLVGSKGLPLSGVVVLVNGKS---G-GKTDAQGYYKLENLKTDGTYTITAKKEHLQFSTVHAKVKP  383 (1165)
T ss_pred             EEeeeeEEeCCCCCCccceEEEEcCcc---c-ceEcccceEEeeeeeccCcEEEEEeccceeeeeEEEEecC
Confidence            3789999996 8999999999999887   4 8999999999974 22333455443  3334444444443


No 36 
>PRK15310 fimbrial outer membrane usher protein TcfC; Provisional
Probab=95.44  E-value=0.056  Score=52.65  Aligned_cols=78  Identities=15%  Similarity=0.089  Sum_probs=52.2

Q ss_pred             eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCcc-ceEEEEEccCccEEEeCC
Q psy4833           8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYR-PQTRTVFIPWNQIVVLPP   86 (141)
Q Consensus         8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~-sqe~~V~~~~n~~~~~~~   86 (141)
                      +.++-|++.|++|.||.++.  |+++.   + +.||+||.|+|.++.... +|-+-.=+=. .=+..|.=..+.+.++.+
T Consensus       785 ~~~yiGr~~~~~G~~l~~a~--IlN~~---~-~~td~~GgF~~d~d~~~~-~L~ll~~~~~~~C~L~v~k~R~~i~~VGd  857 (895)
T PRK15310        785 SRLYVGRVLDKDGRPLLDAQ--PLNYP---F-LSLGPSGRFSLQSEHKES-SLWLLSKNRILRCPMSVHKRRDVMQVVGD  857 (895)
T ss_pred             EEEEEEEEECCCCCCcccce--eecCc---c-ceecCCCCEEEEcccCCc-eEEEEeCCcEEEcccchhhhcceEEEEEe
Confidence            34478999999999999987  56676   6 999999999999985543 4533322111 112233333444557788


Q ss_pred             eEEEec
Q psy4833          87 VKMQLS   92 (141)
Q Consensus        87 v~m~~~   92 (141)
                      |.-.+.
T Consensus       858 V~C~~~  863 (895)
T PRK15310        858 VRCELS  863 (895)
T ss_pred             EEeccC
Confidence            877655


No 37 
>PF11974 MG1:  Alpha-2-macroglobulin MG1 domain;  InterPro: IPR021868  This is the N-terminal MG1 domain from alpha-2-macroglobulin []. 
Probab=95.27  E-value=0.024  Score=40.74  Aligned_cols=39  Identities=21%  Similarity=0.251  Sum_probs=30.7

Q ss_pred             EEC-CCCCeeeeEEEEecc-CCC-eeeceEecCCceEEEEec
Q psy4833          15 VLS-VQGLGITGIRVSVDK-DSG-RFGFTLTRGGGWFDMLVN   53 (141)
Q Consensus        15 V~D-~~GePL~GVsV~Vkg-t~~-~~G~TiTD~DG~FsL~V~   53 (141)
                      |.| .+|+|++||.|.+-+ ..+ .++...||++|...|...
T Consensus        19 v~~L~tg~Pv~ga~V~l~~~~~~~~l~~g~TD~~G~a~~~~~   60 (97)
T PF11974_consen   19 VTSLSTGKPVAGAEVELYDSRNGQVLASGKTDADGFASFDST   60 (97)
T ss_pred             EeeCCCCCccCCCEEEEEECCCCcEeeeeeeCCCceEEecCC
Confidence            455 579999999999965 321 346678999999999876


No 38 
>PF01190 Pollen_Ole_e_I:  Pollen proteins Ole e I like;  InterPro: IPR006041 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation.  The allergens in this family include allergens with the following designations: Ole e 1. A number of plant pollen proteins, whose biological function is not yet known, are structurally related []. These proteins are most probably secreted and consist of about 145 residues. There are six cysteines which are conserved in the sequence of these proteins. They seem to be involved in disulphide bonds. 
Probab=95.17  E-value=0.044  Score=38.64  Aligned_cols=36  Identities=19%  Similarity=0.233  Sum_probs=27.4

Q ss_pred             CCeeeeEEEEeccCC-----CeeeceEecCCceEEEEecCC
Q psy4833          20 GLGITGIRVSVDKDS-----GRFGFTLTRGGGWFDMLVNGG   55 (141)
Q Consensus        20 GePL~GVsV~Vkgt~-----~~~G~TiTD~DG~FsL~V~~g   55 (141)
                      ..||+||.|.+.=..     .......||.+|.|.|.++..
T Consensus        18 ~~~l~GA~V~v~C~~~~~~~~~~~~~~Td~~G~F~i~l~~~   58 (97)
T PF01190_consen   18 AKPLPGAKVSVECKDGNGGVVFSAEAKTDENGYFSIELPSD   58 (97)
T ss_pred             CccCCCCEEEEECCCCCCCcEEEEEEEeCCCCEEEEEecCc
Confidence            579999999995111     023558999999999999853


No 39 
>cd03457 intradiol_dioxygenase_like Intradiol dioxygenase supgroup. Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. They break the catechol C1-C2 bond and utilize Fe3+, as opposed to  the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. The family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases. The specific function of this subgroup is unknown.
Probab=94.69  E-value=0.1  Score=42.02  Aligned_cols=47  Identities=15%  Similarity=-0.053  Sum_probs=35.4

Q ss_pred             CcceEEEEEEEECC-CCCeeeeEEEEeccC----------------------CCeeeceEecCCceEEEE
Q psy4833           5 ELRVAVVRGQVLSV-QGLGITGIRVSVDKD----------------------SGRFGFTLTRGGGWFDML   51 (141)
Q Consensus         5 ~~r~~~IrG~V~D~-~GePL~GVsV~Vkgt----------------------~~~~G~TiTD~DG~FsL~   51 (141)
                      +--..+|+|+|+|. +++||+||.|-+=..                      ....|...||++|+|++.
T Consensus        23 ~G~pl~l~g~V~D~~~c~Pv~~a~VdiWh~da~G~Ys~~~~~~~~~~~~~~~~flRG~~~TD~~G~~~F~   92 (188)
T cd03457          23 PGVPLTLDLQVVDVATCCPPPNAAVDIWHCDATGVYSGYSAGGGGGEDTDDETFLRGVQPTDADGVVTFT   92 (188)
T ss_pred             CCCEEEEEEEEEeCCCCccCCCeEEEEecCCCCCCCCCccCCccccccccCCCcCEEEEEECCCccEEEE
Confidence            34467999999995 699999999987210                      002266889999999996


No 40 
>COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=92.43  E-value=0.22  Score=42.61  Aligned_cols=42  Identities=24%  Similarity=0.337  Sum_probs=31.7

Q ss_pred             EEEEEEECCCCCeeeeEEEEec-------c----CCC----eeeceEecCCceEEEEec
Q psy4833          10 VVRGQVLSVQGLGITGIRVSVD-------K----DSG----RFGFTLTRGGGWFDMLVN   53 (141)
Q Consensus        10 ~IrG~V~D~~GePL~GVsV~Vk-------g----t~~----~~G~TiTD~DG~FsL~V~   53 (141)
                      .++|+|+|. |.||++|+|-+.       +    +.+    .+- --||.+|.|++...
T Consensus       173 ~f~~~vl~~-GkPv~nA~V~v~~~n~~~~d~~a~~~~~ek~~~~-~~TD~kG~~~fip~  229 (264)
T COG5266         173 VFRGKVLDN-GKPVPNATVEVEFDNIDTKDNRAKTGNTEKTALV-QFTDDKGEVSFIPL  229 (264)
T ss_pred             eEEEEEEEC-CccCCCcEEEEEEecccccccccccCCCCCcceE-EEcCCCceEEEEEc
Confidence            689999998 999999999887       1    110    112 46899999999744


No 41 
>PF12985 DUF3869:  Domain of unknown function (DUF3869);  InterPro: IPR024620 This functionally uncharacterised domain is found in bacterial sequences, including the N-terminal region of a putative pore-forming toxin (YP_001301288.1) from Bacteroides vulgatus ATCC 8482.; PDB: 3G3L_A 3KOG_A.
Probab=92.22  E-value=0.75  Score=34.52  Aligned_cols=56  Identities=23%  Similarity=0.323  Sum_probs=39.5

Q ss_pred             ceEEEEEEEECC-CCCeee-eEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEcc
Q psy4833           7 RVAVVRGQVLSV-QGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIP   77 (141)
Q Consensus         7 r~~~IrG~V~D~-~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~   77 (141)
                      -..+|.|.|+|. +|++|. |+.+.+.+..          ||.+.-..     ..+.-|.-||.+.+++|.++
T Consensus        20 A~~tI~~tV~da~tG~~vt~~a~vti~~~~----------~~~~A~~a-----~~vtas~~~y~~~~~tV~vp   77 (104)
T PF12985_consen   20 AKYTIAGTVYDAETGEDVTTTATVTISAGS----------DGTLAAKA-----VTVTASKDGYMTGSVTVNVP   77 (104)
T ss_dssp             -EEEEEEEEEETTTTEE-CGCSEEEETTCC----------CCCE--EC-----CEEEEEECTCEEEEEEEEE-
T ss_pred             CEEEEEEEEEecCCCCeeeccceEEEccCC----------Cccccccc-----EEEEEEccCCeeeEEEEEeh
Confidence            356999999996 899998 8888886443          55555322     23555899999999999988


No 42 
>PF14686 fn3_3:  Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=91.46  E-value=1.9  Score=31.06  Aligned_cols=49  Identities=22%  Similarity=0.310  Sum_probs=23.9

Q ss_pred             EEEEEEECCCC---Cee-eeEEEEec-------cCCCeeeceEecCCceEEEE-ecCCCcEEE
Q psy4833          10 VVRGQVLSVQG---LGI-TGIRVSVD-------KDSGRFGFTLTRGGGWFDML-VNGGGAVTL   60 (141)
Q Consensus        10 ~IrG~V~D~~G---ePL-~GVsV~Vk-------gt~~~~G~TiTD~DG~FsL~-V~~g~~~tL   60 (141)
                      .|+|+|...+|   .|. ..+.|-+.       +..-.| .+-||.+|+|+|. |.+|. ..|
T Consensus         4 ~VsG~l~l~dg~~~~~~~~~~~Vgl~~~~d~~q~~~yqY-wt~td~~G~Fti~~V~pGt-Y~L   64 (95)
T PF14686_consen    4 SVSGRLTLSDGVTNPPAGANAVVGLAPPGDFQQNKGYQY-WTRTDSDGNFTIPNVRPGT-YRL   64 (95)
T ss_dssp             EEEEEEE---SS--TT--S-EEEEEE--------SS-EE-EEE--TTSEEE---B-SEE-EEE
T ss_pred             EEEEEEEEccCcccCccceeEEEEeeeccccccCCCCcE-EEEeCCCCcEEeCCeeCcE-eEE
Confidence            68999987666   333 34555554       332122 3889999999997 55564 444


No 43 
>PF08308 PEGA:  PEGA domain;  InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=90.81  E-value=0.99  Score=29.61  Aligned_cols=52  Identities=23%  Similarity=0.337  Sum_probs=38.8

Q ss_pred             eeEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEEEEEccCccEEEe
Q psy4833          24 TGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVL   84 (141)
Q Consensus        24 ~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~   84 (141)
                      .||.|.+.|..  .|      ..-..+. ++.|. .+|++..-||++.+..+.+..++...+
T Consensus        11 ~gA~V~vdg~~--~G------~tp~~~~~l~~G~-~~v~v~~~Gy~~~~~~v~v~~~~~~~v   63 (71)
T PF08308_consen   11 SGAEVYVDGKY--IG------TTPLTLKDLPPGE-HTVTVEKPGYEPYTKTVTVKPGETTTV   63 (71)
T ss_pred             CCCEEEECCEE--ec------cCcceeeecCCcc-EEEEEEECCCeeEEEEEEECCCCEEEE
Confidence            48899998865  35      2233555 66564 589999999999999999997776654


No 44 
>TIGR02962 hdxy_isourate hydroxyisourate hydrolase. Members of this family, hydroxyisourate hydrolase, represent a distinct clade of transthyretin-related proteins. Bacterial members typically are encoded next to ureidoglycolate hydrolase and often near either xanthine dehydrogenase or xanthine/uracil permease genes and have been demonstrated to have hydroxyisourate hydrolase activity. In eukaryotes, a clade separate from the transthyretins (a family of thyroid-hormone binding proteins) has also been shown to have HIU hydrolase activity in urate catabolizing organisms. Transthyretin, then, would appear to be the recently diverged paralog of the more ancient HIUH family.
Probab=90.53  E-value=0.43  Score=35.68  Aligned_cols=59  Identities=22%  Similarity=0.351  Sum_probs=40.2

Q ss_pred             EEEEEECC-CCCeeeeEEEEeccCCC----eeeceEecCCceEEEEec-----CCCcEEEEEEecCccc
Q psy4833          11 VRGQVLSV-QGLGITGIRVSVDKDSG----RFGFTLTRGGGWFDMLVN-----GGGAVTLQFQRSPYRP   69 (141)
Q Consensus        11 IrG~V~D~-~GePL~GVsV~Vkgt~~----~~G~TiTD~DG~FsL~V~-----~g~~~tL~FSyiGY~s   69 (141)
                      |+=.|+|. .|.|-+||.|.+..-.+    .++...||+||+..-...     ..+...|+|.--.|=.
T Consensus         3 lstHVLDt~~G~PAagv~V~L~~~~~~~~~~i~~~~Tn~DGR~~~~l~~~~~~~~G~Y~l~F~~g~Yf~   71 (112)
T TIGR02962         3 LSTHVLDTTSGKPAAGVPVTLYRLDGSGWTPLAEGVTNADGRCPDLLPEGETLAAGIYKLRFDTGDYFA   71 (112)
T ss_pred             ceEEEEeCCCCccCCCCEEEEEEecCCCeEEEEEEEECCCCCCcCcccCcccCCCeeEEEEEEhhhhhh
Confidence            56679995 79999999999964221    234588999999973111     1245678887655543


No 45 
>PF01060 DUF290:  Transthyretin-like family;  InterPro: IPR001534 This new apparently nematode-specific protein family has been called family 2 []. The proteins show weak similarity to transthyretin (formerly called prealbumin) which transports thyroid hormones. The specific function of this protein is unknown.; GO: 0005615 extracellular space
Probab=90.07  E-value=0.56  Score=32.31  Aligned_cols=43  Identities=21%  Similarity=0.281  Sum_probs=30.8

Q ss_pred             EEEEECCCCCeeeeEEEEeccCCC-----eeeceEecCCceEEEEecCC
Q psy4833          12 RGQVLSVQGLGITGIRVSVDKDSG-----RFGFTLTRGGGWFDMLVNGG   55 (141)
Q Consensus        12 rG~V~D~~GePL~GVsV~Vkgt~~-----~~G~TiTD~DG~FsL~V~~g   55 (141)
                      +|+++= +|+|+.+|.|++-....     .++.+.||.+|+|.|.-...
T Consensus         1 ~G~L~C-~~~P~~~~~V~L~e~d~~~~Ddll~~~~Td~~G~F~l~G~~~   48 (80)
T PF01060_consen    1 KGQLMC-GGKPAKNVKVKLWEDDYFDPDDLLDETKTDSDGNFELSGSTN   48 (80)
T ss_pred             CeEEEe-CCccCCCCEEEEEECCCCCCCceeEEEEECCCceEEEEEEcc
Confidence            355542 48999999998843321     34669999999999996533


No 46 
>PF05738 Cna_B:  Cna protein B-type domain;  InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=89.27  E-value=3  Score=26.86  Aligned_cols=47  Identities=17%  Similarity=0.229  Sum_probs=31.8

Q ss_pred             eeeEEEEeccCCCee-ec--eEecCCceEEEE-ecCCCcEEEEEEe--cCccce
Q psy4833          23 ITGIRVSVDKDSGRF-GF--TLTRGGGWFDML-VNGGGAVTLQFQR--SPYRPQ   70 (141)
Q Consensus        23 L~GVsV~Vkgt~~~~-G~--TiTD~DG~FsL~-V~~g~~~tL~FSy--iGY~sq   70 (141)
                      |+||.+.+....+.. ..  ..||.||.|.+. ++.|. ..|....  -||...
T Consensus         1 L~Ga~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~-Y~l~E~~aP~GY~~~   53 (70)
T PF05738_consen    1 LAGATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGT-YTLKETKAPDGYQLD   53 (70)
T ss_dssp             -STEEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEE-EEEEEEETTTTEEEE
T ss_pred             CCCeEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeE-EEEEEEECCCCCEEC
Confidence            689999997654221 21  579999999997 55565 5677666  677643


No 47 
>cd05822 TLP_HIUase HIUase (5-hydroxyisourate hydrolase) catalyzes the second step in a three-step ureide pathway in which 5-hydroxyisourate (HIU), a product of the uricase (urate oxidase) reaction, is hydrolyzed to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU). HIUase has high sequence similarity with transthyretins and is a member of the transthyretin-like protein (TLP) family.   HIUase is distinguished from transthyretins by a conserved signature motif at its C-terminus that forms part of the active site.  In HIUase, this motif is YRGS, while transthyretins have a conserved TAVV sequence in the same location.  Most HIUases are cytosolic but in plants and slime molds, they are peroxisomal based on the presence of N-terminal periplasmic localization sequences.  HIUase forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix.  The central channel of the tetramer contains two independent binding sites, each located betw
Probab=87.74  E-value=1  Score=33.62  Aligned_cols=58  Identities=24%  Similarity=0.382  Sum_probs=39.7

Q ss_pred             EEEEEECC-CCCeeeeEEEEeccCCC----eeeceEecCCceEEEEecC-----CCcEEEEEEecCcc
Q psy4833          11 VRGQVLSV-QGLGITGIRVSVDKDSG----RFGFTLTRGGGWFDMLVNG-----GGAVTLQFQRSPYR   68 (141)
Q Consensus        11 IrG~V~D~-~GePL~GVsV~Vkgt~~----~~G~TiTD~DG~FsL~V~~-----g~~~tL~FSyiGY~   68 (141)
                      |+=.|+|. .|.|-+|+.|.+-.-..    .++...||+||+..-....     .+...|+|-=-.|=
T Consensus         3 lstHVLDt~~G~PAagv~V~L~~~~~~~~~~i~~~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~Yf   70 (112)
T cd05822           3 LSTHVLDTATGKPAAGVAVTLYRLDGNGWTLLATGVTNADGRCDDLLPPGAQLAAGTYKLTFDTGAYF   70 (112)
T ss_pred             ceeEEEeCCCCcccCCCEEEEEEecCCCeEEEEEEEECCCCCccCcccccccCCCeeEEEEEEhhhhh
Confidence            45578995 79999999999854321    2466899999999753321     24457887654553


No 48 
>KOG1948|consensus
Probab=84.31  E-value=2.3  Score=42.43  Aligned_cols=48  Identities=29%  Similarity=0.443  Sum_probs=38.5

Q ss_pred             EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEE-ecCCC
Q psy4833           9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGG   56 (141)
Q Consensus         9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~   56 (141)
                      ..|+|+|+-+-|-..+||.|..+........|.|+.+|.|.+. +.+|.
T Consensus       119 Fsv~GkVlgaaggGpagV~velrs~e~~iast~T~~~Gky~f~~iiPG~  167 (1165)
T KOG1948|consen  119 FSVRGKVLGAAGGGPAGVLVELRSQEDPIASTKTEDGGKYEFRNIIPGK  167 (1165)
T ss_pred             eeEeeEEeeccCCCcccceeecccccCcceeeEecCCCeEEEEecCCCc
Confidence            4699999998888999999999876222457999999999997 54453


No 49 
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=82.40  E-value=11  Score=37.05  Aligned_cols=85  Identities=12%  Similarity=0.055  Sum_probs=62.2

Q ss_pred             EEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEec--CCCcEEEEEEecCccceEE-------------E
Q psy4833          10 VVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVN--GGGAVTLQFQRSPYRPQTR-------------T   73 (141)
Q Consensus        10 ~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~--~g~~~tL~FSyiGY~sqe~-------------~   73 (141)
                      +|+=+|.|. +=+||++|.|.|-++........|+.||.=-|.++  -|..+++.-+.-||.+..+             +
T Consensus         2 ~lKV~V~d~~t~qpl~~A~V~V~~N~t~~~s~~t~~dG~~~l~~~yrlg~~l~v~a~k~gyv~ns~Pw~~~rlPv~ssvs   81 (807)
T PF10577_consen    2 TLKVQVSDASTRQPLSDASVEVFGNQTLTASGTTGNDGVVLLKFPYRLGTWLIVTASKDGYVPNSVPWKTSRLPVYSSVS   81 (807)
T ss_pred             eEEEEEecccCcccCCCCeEEEEeceeEeecceecCCceEEEEEeccCCCeEEEEEecCCccCCCCCcccccccccccee
Confidence            567788886 68999999999976543456689999999999987  4667788889999975422             2


Q ss_pred             EEc---cCccEEEeCCeEEEecCC
Q psy4833          74 VFI---PWNQIVVLPPVKMQLSDA   94 (141)
Q Consensus        74 V~~---~~n~~~~~~~v~m~~~d~   94 (141)
                      +++   ....+.+.||+++|....
T Consensus        82 L~Llp~r~a~l~lyeD~v~I~~~~  105 (807)
T PF10577_consen   82 LSLLPERQATLWLYEDVVQITGGS  105 (807)
T ss_pred             eeeeccCcceEEEEeeeeeeeccC
Confidence            222   134566679999986644


No 50 
>PF12866 DUF3823:  Protein of unknown function (DUF3823);  InterPro: IPR024278 This is a family of uncharacterised proteins from Bacteroidetes. These proteins have characteristic DN and DR sequence-motifs but their function is not known.; PDB: 3HN5_B 4EIU_A.
Probab=80.55  E-value=10  Score=31.27  Aligned_cols=29  Identities=21%  Similarity=0.401  Sum_probs=17.6

Q ss_pred             CcCCcceEEEEEEEEC-CCCCeee----eEEEEec
Q psy4833           2 TDDELRVAVVRGQVLS-VQGLGIT----GIRVSVD   31 (141)
Q Consensus         2 ~~d~~r~~~IrG~V~D-~~GePL~----GVsV~Vk   31 (141)
                      ||| +-...++|+|+| .+|||+.    ||.|++-
T Consensus        16 NYD-~P~s~l~G~iiD~~tgE~i~~~~~gv~i~l~   49 (222)
T PF12866_consen   16 NYD-EPDSTLTGRIIDVYTGEPIQTDIGGVRIQLY   49 (222)
T ss_dssp             -------EEEEEEEEECCTTEE----STSSEEEEE
T ss_pred             CCc-CCCceEEEEEEEeecCCeeeecCCceEEEEE
Confidence            344 456799999999 5899876    6777774


No 51 
>cd05469 Transthyretin_like Transthyretin_like.  This domain is present in the transthyretin-like protein (TLP) family which includes transthyretin (TTR) and a transthyretin-related protein called 5-hydroxyisourate hydrolase (HIUase).  TTR and HIUase are homotetrameric proteins with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits. TTR transports thyroid hormones and retinol in the blood serum of vertebrates while HIUase catalyzes the second step in a three-step ureide pathway. TTRs are highly conserved and found only in vertebrates while the HIUases are found in a wide range of bacterial, plant, fungal, slime mold and vertebrate organisms.
Probab=78.85  E-value=2.2  Score=32.11  Aligned_cols=57  Identities=19%  Similarity=0.323  Sum_probs=38.4

Q ss_pred             EEEEEECC-CCCeeeeEEEEeccCC-----CeeeceEecCCceEEEEe-c---CCCcEEEEEEecCc
Q psy4833          11 VRGQVLSV-QGLGITGIRVSVDKDS-----GRFGFTLTRGGGWFDMLV-N---GGGAVTLQFQRSPY   67 (141)
Q Consensus        11 IrG~V~D~-~GePL~GVsV~Vkgt~-----~~~G~TiTD~DG~FsL~V-~---~g~~~tL~FSyiGY   67 (141)
                      |+=.|+|. .|.|-+|+.|.+....     ..++...||+||+..-.. .   ..+...|+|.=-.|
T Consensus         3 lStHVLDt~~G~PAagv~V~L~~~~~~~~w~~l~~~~Tn~DGR~~~~l~~~~~~~G~Y~l~F~t~~Y   69 (113)
T cd05469           3 LMVKVLDAVRGSPAANVAIKVFRKTADGSWEIFATGKTNEDGELHGLITEEEFXAGVYRVEFDTKSY   69 (113)
T ss_pred             ceEEEEeCCCCccCCCCEEEEEEecCCCceEEEEEEEECCCCCccCccccccccceEEEEEEehHHh
Confidence            56679995 7999999999994321     124568999999996321 1   12446788764444


No 52 
>cd05821 TLP_Transthyretin Transthyretin (TTR) is a 55 kDa protein responsible for the transport of thyroid hormones and retinol in vertebrates.  TTR distributes the two thyroid hormones T3 (3,5,3'-triiodo-L-thyronine) and T4 (Thyroxin, or 3,5,3',5'-tetraiodo-L-thyronine), as well as retinol (vitamin A) through the formation of a macromolecular complex that includes each of these as well as retinol-binding protein.  Misfolded forms of TTR are implicated in the amyloid diseases familial amyloidotic polyneuropathy and senile systemic amyloidosis. TTR forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits, which differ in their ligand binding affinity.  A negative cooperativity has been observed for the binding of T4 and other TTR ligands. A fraction of plasma TTR is carried in high density lipoproteins by bindi
Probab=78.61  E-value=2.7  Score=32.09  Aligned_cols=60  Identities=22%  Similarity=0.250  Sum_probs=40.7

Q ss_pred             EEEEEEEECC-CCCeeeeEEEEecc-C-C---CeeeceEecCCceEEEEec----CCCcEEEEEEecCcc
Q psy4833           9 AVVRGQVLSV-QGLGITGIRVSVDK-D-S---GRFGFTLTRGGGWFDMLVN----GGGAVTLQFQRSPYR   68 (141)
Q Consensus         9 ~~IrG~V~D~-~GePL~GVsV~Vkg-t-~---~~~G~TiTD~DG~FsL~V~----~g~~~tL~FSyiGY~   68 (141)
                      ..|+=-|+|. .|.|-.||.|.+.. . .   ..++...||+||+..--..    ..+...|+|-=-.|=
T Consensus         7 ~~ittHVLDt~~G~PAaGV~V~L~~~~~~~~w~~l~~~~Tn~DGR~~~ll~~~~~~~G~Y~l~F~tg~Yf   76 (121)
T cd05821           7 CPLMVKVLDAVRGSPAANVAVKVFKKTADGSWEPFASGKTTETGEIHGLTTDEQFTEGVYKVEFDTKAYW   76 (121)
T ss_pred             CCcEEEEEECCCCccCCCCEEEEEEecCCCceEEEEEEEECCCCCCCCccCccccCCeeEEEEEehhHhh
Confidence            4567789995 79999999999942 1 1   1346689999999963111    124567888655553


No 53 
>PF00576 Transthyretin:  HIUase/Transthyretin family;  InterPro: IPR023416 This family includes transthyretin that is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. However, most of the sequences listed in this family do not bind thyroid hormones. They are actually enzymes of the purine catabolism that catalyse the conversion of 5-hydroxyisourate (HIU) to OHCU [, ]. HIU hydrolysis is the original function of the family and is conserved from bacteria to mammals; transthyretins arose by gene duplications in the vertebrate lineage [, ]. HIUases are distinguished in the alignment from the conserved C-terminal YRGS sequence. Transthyretin (formerly prealbumin) is one of 3 thyroid hormone-binding proteins found in the blood of vertebrates []. It is produced in the liver and circulates in the bloodstream, where it binds retinol and thyroxine (T4) []. It differs from the other 2 hormone-binding proteins (T4-binding globulin and albumin) in 3 distinct ways: (1) the gene is expressed at a high rate in the brain choroid plexus; (2) it is enriched in cerebrospinal fluid; and (3) no genetically caused absence has been observed, suggesting an essential role in brain function, distinct from that played in the bloodstream []. The protein consists of around 130 amino acids, which assemble as a homotetramer that contains an internal channel in which T4 is bound. Within this complex, T4 appears to be transported across the blood-brain barrier, where, in the choroid plexus, the hormone stimulates further synthesis of transthyretin. The protein then diffuses back into the bloodstream, where it binds T4 for transport back to the brain [].; PDB: 1TFP_B 1KGJ_D 1IE4_C 1GKE_C 1KGI_D 2H0J_B 2H0E_B 2H0F_B 1ZD6_A 3DGD_D ....
Probab=77.17  E-value=2.6  Score=31.50  Aligned_cols=39  Identities=18%  Similarity=0.265  Sum_probs=27.6

Q ss_pred             EEEEEECC-CCCeeeeEEEEeccCC-----CeeeceEecCCceEE
Q psy4833          11 VRGQVLSV-QGLGITGIRVSVDKDS-----GRFGFTLTRGGGWFD   49 (141)
Q Consensus        11 IrG~V~D~-~GePL~GVsV~Vkgt~-----~~~G~TiTD~DG~Fs   49 (141)
                      |+=.|+|. .|.|-.||.|.+-...     ..++...||+||+..
T Consensus         3 iStHVLDt~~G~PA~gv~V~L~~~~~~~~~~~l~~~~Td~DGR~~   47 (112)
T PF00576_consen    3 ISTHVLDTTTGKPAAGVPVTLYRLDSDGSWTLLAEGVTDADGRIK   47 (112)
T ss_dssp             EEEEEEETTTTEE-TT-EEEEEEEETTSCEEEEEEEEBETTSEES
T ss_pred             cEEEEeeCCCCCCccCCEEEEEEecCCCCcEEEEEEEECCCCccc
Confidence            56679995 7999999999984221     133568999999993


No 54 
>smart00095 TR_THY Transthyretin.
Probab=76.42  E-value=4.7  Score=30.75  Aligned_cols=58  Identities=22%  Similarity=0.260  Sum_probs=38.9

Q ss_pred             EEEEEEECC-CCCeeeeEEEEecc-C-C---CeeeceEecCCceEEEEec----CCCcEEEEEEecCc
Q psy4833          10 VVRGQVLSV-QGLGITGIRVSVDK-D-S---GRFGFTLTRGGGWFDMLVN----GGGAVTLQFQRSPY   67 (141)
Q Consensus        10 ~IrG~V~D~-~GePL~GVsV~Vkg-t-~---~~~G~TiTD~DG~FsL~V~----~g~~~tL~FSyiGY   67 (141)
                      .|+=.|+|. .|.|-.|+.|.+-. . .   ..++...||+||+-.--.+    ..+...|+|-=-.|
T Consensus         5 plTtHVLDt~~G~PAagv~V~L~~~~~~~~w~~la~~~Tn~DGR~~~ll~~~~~~~G~Y~l~F~tg~Y   72 (121)
T smart00095        5 PLMVKVLDAVRGSPAVNVAVKVFKKTEEGTWEPFASGKTNESGEIHELTTDEKFVEGLYKVEFDTKSY   72 (121)
T ss_pred             CeEEEEEECCCCccCCCCEEEEEEeCCCCceEEEEEEecCCCccccCccCcccccceEEEEEEehhHh
Confidence            467789995 79999999999932 1 1   1246689999999963111    12445788854444


No 55 
>PF03785 Peptidase_C25_C:  Peptidase family C25, C terminal ig-like domain;  InterPro: IPR005536 This domain is found in almost all members of MEROPS peptidase family C25, (clan CD). Peptidase family C25 is a protein family found in the bacteria Porphyromonas gingivalis (Bacteroides gingivalis) a Gram-negative anaerobic bacterial species strongly associated with adult periodontitis. One of its distinguishing characteristics and putative virulence properties is the ability to agglutinate erythrocytes []. It is a highly proteolytic organism which metabolises small peptides and amino acids. Indirect evidence suggests that the proteases produced by this microorganism constitute an important virulence factor []. Protease-encoding genes have been shown to contain multiple copies of repeated nucleotide sequences. These conserved sequences have also been found in haemagglutinin genes [].; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 1CVR_A.
Probab=75.94  E-value=15  Score=26.51  Aligned_cols=51  Identities=20%  Similarity=0.330  Sum_probs=30.2

Q ss_pred             eeEEEEeccCCCeeeceEecCCceEEEEec----CCCcEEEEEEecCccceEEEEE
Q psy4833          24 TGIRVSVDKDSGRFGFTLTRGGGWFDMLVN----GGGAVTLQFQRSPYRPQTRTVF   75 (141)
Q Consensus        24 ~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~----~g~~~tL~FSyiGY~sqe~~V~   75 (141)
                      +|+.|.+-.+...+|..++| +|++.|..+    +.+.++|.+...-|++...++.
T Consensus        26 ~gs~ValS~dg~l~G~ai~~-sG~ati~l~~~it~~~~~tlTit~~n~~t~i~~i~   80 (81)
T PF03785_consen   26 PGSYVALSQDGDLYGKAIVN-SGNATINLTNPITDEGTLTLTITAFNYVTYIKTIQ   80 (81)
T ss_dssp             TT-EEEEEETTEEEEEEE-B-TTEEEEE-SS--TT-SEEEEEEE-TTB--EEEEEE
T ss_pred             CCcEEEEecCCEEEEEEEec-CceEEEECCcccCCCceEEEEEEEEccEEEEEEee
Confidence            56677775443356888999 999999876    3455677777766666555543


No 56 
>PF09430 DUF2012:  Protein of unknown function (DUF2012);  InterPro: IPR019008  This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development []. 
Probab=73.14  E-value=23  Score=26.15  Aligned_cols=54  Identities=20%  Similarity=0.189  Sum_probs=36.3

Q ss_pred             eeeEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEEEEEcc
Q psy4833          23 ITGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTRTVFIP   77 (141)
Q Consensus        23 L~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~~V~~~   77 (141)
                      ++..+|.+.+.....-.+..+.||.|.+. |+.|. ..|++....|.=....|.+.
T Consensus         7 ~~~t~V~L~~g~~~~~~~~v~~dG~F~f~~Vp~Gs-Y~L~V~s~~~~F~~~RVdV~   61 (123)
T PF09430_consen    7 PSSTRVTLNGGQYRPISAFVRSDGSFVFHNVPPGS-YLLEVHSPDYVFPPYRVDVS   61 (123)
T ss_pred             CCCEEEEEeCCCccceEEEecCCCEEEeCCCCCce-EEEEEECCCccccCEEEEEe
Confidence            45667777655410013888999999997 77664 57999888876544555544


No 57 
>PF13953 PapC_C:  PapC C-terminal domain; PDB: 3L48_E 2XET_A 3RFZ_E 2KT6_A.
Probab=68.35  E-value=32  Score=22.77  Aligned_cols=47  Identities=15%  Similarity=0.097  Sum_probs=25.8

Q ss_pred             EEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEe
Q psy4833          14 QVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQR   64 (141)
Q Consensus        14 ~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSy   64 (141)
                      ++.+.+|+||| |+.|...+.. . - .+...+|+--|.-..... +|.+..
T Consensus         2 ~l~~~~G~~lPfGA~v~~~~g~-~-~-g~Vg~~G~vyl~~~~~~~-~L~V~w   49 (68)
T PF13953_consen    2 TLRDADGKPLPFGASVSDEDGN-N-I-GIVGQDGQVYLSGLPPKG-TLTVKW   49 (68)
T ss_dssp             EEEETTSEE--TT-EEEETTSS-E-E-EEB-GCGEEEEEEE-TCE-EEEEES
T ss_pred             EEEcCCCCcCCCCcEEEcCCCC-E-E-EEEcCCCEEEEECCCCCc-EEEEEE
Confidence            46788999997 7777765444 3 2 466667777776432221 465555


No 58 
>COG2351 Transthyretin-like protein [General function prediction only]
Probab=62.81  E-value=18  Score=27.99  Aligned_cols=56  Identities=21%  Similarity=0.340  Sum_probs=36.9

Q ss_pred             EEEEEEECC-CCCeeeeEEEEec---cCC-CeeeceEecCCceEEEEecCC-----CcEEEEEEec
Q psy4833          10 VVRGQVLSV-QGLGITGIRVSVD---KDS-GRFGFTLTRGGGWFDMLVNGG-----GAVTLQFQRS   65 (141)
Q Consensus        10 ~IrG~V~D~-~GePL~GVsV~Vk---gt~-~~~G~TiTD~DG~FsL~V~~g-----~~~tL~FSyi   65 (141)
                      .++=-|+|. .|.|-+||.|.+.   ++. ..+..+.||.||+-+...-.+     +...|+|--=
T Consensus        10 ~LTTHVLDta~GkPAagv~V~L~rl~~~~~~~l~t~~Tn~DGR~d~pll~g~~~~~G~Y~l~F~~g   75 (124)
T COG2351          10 RLTTHVLDTASGKPAAGVKVELYRLEGNQWELLKTVVTNADGRIDAPLLAGETLATGIYELVFHTG   75 (124)
T ss_pred             eeeeeeeecccCCcCCCCEEEEEEecCCcceeeeEEEecCCCcccccccCccccccceEEEEEEcc
Confidence            455678995 7999999999874   221 012458999999999443211     3456777543


No 59 
>KOG3006|consensus
Probab=56.99  E-value=65  Score=25.19  Aligned_cols=54  Identities=24%  Similarity=0.394  Sum_probs=37.2

Q ss_pred             EEEEEEECC-CCCeeeeEEEEec---cC--CCeeeceEecCCceEEEEecC----CCcEEEEEE
Q psy4833          10 VVRGQVLSV-QGLGITGIRVSVD---KD--SGRFGFTLTRGGGWFDMLVNG----GGAVTLQFQ   63 (141)
Q Consensus        10 ~IrG~V~D~-~GePL~GVsV~Vk---gt--~~~~G~TiTD~DG~FsL~V~~----g~~~tL~FS   63 (141)
                      -|+-.|+|. .|.|-+||.|.+-   ++  ...+|..-|+.||+-....+.    .+..+|+|-
T Consensus        22 ~itahVLd~s~GsPA~gVqV~~f~~~~~~~w~~igs~~T~~nGrv~~~~~~~tl~~GtYr~~~d   85 (132)
T KOG3006|consen   22 PITAHVLDISRGSPAAGVQVHLFILANDDTWTPIGSGFTQDNGRVDWVSPDFTLIPGTYRLVFD   85 (132)
T ss_pred             CcEeEEeecccCCcccceEEEEEEecCCCcccCccccccccCceeecccchhhhccceEEEEEe
Confidence            467789996 6999999998762   22  113577889999998865331    234566663


No 60 
>PF14054 DUF4249:  Domain of unknown function (DUF4249)
Probab=56.42  E-value=1e+02  Score=24.76  Aligned_cols=66  Identities=11%  Similarity=0.205  Sum_probs=40.4

Q ss_pred             eeeeEEEEe-ccCCCeeeceEecCC---ceEE----EEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeEEE
Q psy4833          22 GITGIRVSV-DKDSGRFGFTLTRGG---GWFD----MLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVKMQ   90 (141)
Q Consensus        22 PL~GVsV~V-kgt~~~~G~TiTD~D---G~Fs----L~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~m~   90 (141)
                      ++.||+|.+ .+.. .......+.+   |.|.    +....|...+|.+++-|.+..+-...+|..-  -++.|...
T Consensus        59 ~v~~A~V~i~~~~~-~~~~~~~~~~~~~g~Y~~~~~~~~~~G~~Y~L~V~~~~~~~~sa~~~vp~~~--~i~~v~~~  132 (298)
T PF14054_consen   59 PVSGATVTIYEDGQ-GNEYLFEESSNNDGVYYSSNSFRGRPGRTYRLEVETPGGKTYSAETTVPPPP--PIDSVSYE  132 (298)
T ss_pred             ccCCcEEEEEeCCC-cceEeecccCCCcceEEecccccccCCCEEEEEEEECCCCEEEEEEEECCCC--ceeEEEEE
Confidence            389999999 4443 1122333333   8887    3344577789999998887766665555333  33444444


No 61 
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=52.41  E-value=23  Score=22.89  Aligned_cols=26  Identities=8%  Similarity=0.110  Sum_probs=20.9

Q ss_pred             eEEEEEEEECCCCCeeeeEEEEeccC
Q psy4833           8 VAVVRGQVLSVQGLGITGIRVSVDKD   33 (141)
Q Consensus         8 ~~~IrG~V~D~~GePL~GVsV~Vkgt   33 (141)
                      .+.++=+|.|.+|.|++++.+.++..
T Consensus        14 ~I~ltVt~kda~G~pv~n~~f~l~r~   39 (47)
T PF05688_consen   14 TIPLTVTVKDANGNPVPNAPFTLTRG   39 (47)
T ss_pred             eEEEEEEEECCCCCCcCCceEEEEec
Confidence            35566678999999999999888644


No 62 
>COG1470 Predicted membrane protein [Function unknown]
Probab=51.26  E-value=85  Score=29.50  Aligned_cols=77  Identities=16%  Similarity=0.157  Sum_probs=60.8

Q ss_pred             eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecC-ccceEEEEEccCccEEEeCC
Q psy4833           8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSP-YRPQTRTVFIPWNQIVVLPP   86 (141)
Q Consensus         8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiG-Y~sqe~~V~~~~n~~~~~~~   86 (141)
                      ....+=+++|++|.|++-+-|...+..     --|+.+|+=.+.+.+|.. .+.+..-| |+...+.+.++.+....++-
T Consensus       186 ~a~~~l~~vd~~G~gv~~~~v~~g~e~-----~ets~~g~~~~e~t~g~y-~~~i~~~g~ye~~~~av~l~d~~t~dLkl  259 (513)
T COG1470         186 KAQSKLRLVDDDGAGVPKALVKDGNES-----FETSSKGNLEVEITPGKY-VVLIAKKGIYEKKKRAVKLNDGETKDLKL  259 (513)
T ss_pred             cceEEEEEEccCCCccchheeecCcee-----EEeecccceeEEecCcce-EEEeccccceecceEEEEcCCCcccceeE
Confidence            345566889999999999999887554     457888988888887765 58889999 99999999999887766655


Q ss_pred             eEEE
Q psy4833          87 VKMQ   90 (141)
Q Consensus        87 v~m~   90 (141)
                      ....
T Consensus       260 s~~~  263 (513)
T COG1470         260 SVTE  263 (513)
T ss_pred             EEEe
Confidence            5444


No 63 
>PRK15314 outer membrane protein RatB; Provisional
Probab=46.70  E-value=49  Score=36.05  Aligned_cols=24  Identities=13%  Similarity=0.221  Sum_probs=19.5

Q ss_pred             EEEEEEECCCCCeeeeEEEEeccC
Q psy4833          10 VVRGQVLSVQGLGITGIRVSVDKD   33 (141)
Q Consensus        10 ~IrG~V~D~~GePL~GVsV~Vkgt   33 (141)
                      -++=+|+|..|+|++|+.|.++..
T Consensus      1609 PmtVTvtdsaGnP~~~a~~~l~Rg 1632 (2435)
T PRK15314       1609 AATIDVKDAAGQPMKNVMVKISRG 1632 (2435)
T ss_pred             eEEEEEecCCCCCCCCceEEEEec
Confidence            344478999999999999999743


No 64 
>PF09244 DUF1964:  Domain of unknown function (DUF1964);  InterPro: IPR015325 This domain is C-terminal to the catalytic sucrose phosphorylase beta/alpha barrel domain. It adopts a beta-sandwich fold, with Greek-key topology and is functionally uncharacterised []. ; PDB: 1R7A_B 2GDU_A 2GDV_A.
Probab=45.46  E-value=46  Score=23.27  Aligned_cols=27  Identities=15%  Similarity=0.218  Sum_probs=20.0

Q ss_pred             CCceEEEEecCCCcEEEEEEecCccceEE
Q psy4833          44 GGGWFDMLVNGGGAVTLQFQRSPYRPQTR   72 (141)
Q Consensus        44 ~DG~FsL~V~~g~~~tL~FSyiGY~sqe~   72 (141)
                      .||.|+..+.++.  .|.|+..|..++.-
T Consensus         2 FdG~FSy~~dgdt--Sitf~W~g~~t~at   28 (68)
T PF09244_consen    2 FDGEFSYEADGDT--SITFTWTGATTSAT   28 (68)
T ss_dssp             GGSEEEEEEETTT--EEEEEEE-SS-EEE
T ss_pred             ccceeeEecCCCc--EEEEEEeccccEEE
Confidence            4899999998665  48999999976643


No 65 
>PF13115 YtkA:  YtkA-like
Probab=42.89  E-value=30  Score=23.12  Aligned_cols=22  Identities=18%  Similarity=0.214  Sum_probs=17.4

Q ss_pred             eEEEEEEEECCCCCeeeeEEEEecc
Q psy4833           8 VAVVRGQVLSVQGLGITGIRVSVDK   32 (141)
Q Consensus         8 ~~~IrG~V~D~~GePL~GVsV~Vkg   32 (141)
                      .+.|+   +|.+|+|+.|+.|.+.-
T Consensus        23 ~i~v~---~~~~g~pv~~a~V~~~~   44 (86)
T PF13115_consen   23 TITVT---VDQGGKPVTDADVQFEI   44 (86)
T ss_pred             EEEEE---ECCCCCCCCCCEEEEEE
Confidence            34554   78899999999998863


No 66 
>PF03717 PBP_dimer:  Penicillin-binding Protein dimerisation domain;  InterPro: IPR005311 This domain is found at the N terminus of Class B High Molecular Weight Penicillin-Binding Proteins. Its function has not been precisely defined, but is strongly implicated in PBP polymerisation. The domain forms a largely disordered "sugar tongs" structure.; GO: 0008658 penicillin binding; PDB: 3EQU_B 3EQV_A 3UE3_A 3PBR_A 3PBT_A 1QME_A 1RP5_B 2ZC3_A 2Z2M_D 2ZC4_D ....
Probab=42.19  E-value=15  Score=27.72  Aligned_cols=17  Identities=18%  Similarity=0.294  Sum_probs=9.3

Q ss_pred             EEEEEEECCCCCeeeeE
Q psy4833          10 VVRGQVLSVQGLGITGI   26 (141)
Q Consensus        10 ~IrG~V~D~~GePL~GV   26 (141)
                      --||.+.|.+|.||++=
T Consensus         4 a~RG~I~DrnG~~La~n   20 (178)
T PF03717_consen    4 APRGNIYDRNGRPLATN   20 (178)
T ss_dssp             E----EE-TTS-EEEEE
T ss_pred             CCCCCEEeCCCCEeEEe
Confidence            46899999999999873


No 67 
>PF11184 DUF2969:  Protein of unknown function (DUF2969);  InterPro: IPR021351  This family of proteins with unknown function appears to be restricted to Lactobacillales. 
Probab=41.34  E-value=93  Score=21.50  Aligned_cols=39  Identities=15%  Similarity=0.323  Sum_probs=27.8

Q ss_pred             EEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCC
Q psy4833          14 QVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGG   55 (141)
Q Consensus        14 ~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g   55 (141)
                      +|.|.+..-.+|-.|.+ |.. ..| .|.-.||+|.+...++
T Consensus        11 ~i~d~~~~~~~~~~l~I-gkk-~IG-~I~e~d~~fav~~~~~   49 (71)
T PF11184_consen   11 EIKDTKVNGQPGYELFI-GKK-VIG-EIEEDDGRFAVVKNDN   49 (71)
T ss_pred             EEEecccCCeEEEEEEE-CCE-EEE-EEEEcCCcEEEEeCCC
Confidence            56676666677778888 665 567 8888899996665533


No 68 
>PRK15207 long polar fimbrial outer membrane usher protein LpfC; Provisional
Probab=38.17  E-value=1e+02  Score=30.08  Aligned_cols=48  Identities=10%  Similarity=0.031  Sum_probs=27.9

Q ss_pred             cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833           3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML   51 (141)
Q Consensus         3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~   51 (141)
                      |+-.+...+-=+|.|++|+||| |+.|.+.+.....- .+-..||.==|.
T Consensus       751 F~~~~g~~~ll~l~~~~G~~lP~Ga~V~~~~~~~~~~-g~Vg~~G~vyl~  799 (842)
T PRK15207        751 FDTRVGYRVLMSLKQANGNAVPFGATATLIDTTKEAS-SIVGEEGQLYIS  799 (842)
T ss_pred             EEEeeeeEEEEEEEcCCCCcCCccEEEEecCCCCceE-EEECCCCEEEEE
Confidence            4444444445567889999998 88877643321111 344556764444


No 69 
>PF07492 Trehalase_Ca-bi:  Neutral trehalase Ca2+ binding domain;  InterPro: IPR011120 Neutral trehalases mobilise trehalose accumulated by fungal cells as a protective and storage carbohydrate. This family represents a calcium-binding domain similar to EF hand. Residues 97 and 108 in O42893 from SWISSPROT have been implicated in this interaction. It is thought that this domain may provide a general mechanism for regulating neutral trehalase activity in yeasts and filamentous fungi [].; GO: 0004555 alpha,alpha-trehalase activity, 0005509 calcium ion binding, 0005993 trehalose catabolic process, 0005737 cytoplasm
Probab=37.98  E-value=35  Score=20.46  Aligned_cols=18  Identities=22%  Similarity=0.427  Sum_probs=13.5

Q ss_pred             cCCceEEEEecCCCcEEE
Q psy4833          43 RGGGWFDMLVNGGGAVTL   60 (141)
Q Consensus        43 D~DG~FsL~V~~g~~~tL   60 (141)
                      |-||+|.|.+...+..+|
T Consensus         9 DTDgn~qITIeD~GPKv~   26 (30)
T PF07492_consen    9 DTDGNFQITIEDTGPKVL   26 (30)
T ss_pred             ccCCCcEEEEecCCCeEE
Confidence            779999999975555444


No 70 
>PF06115 DUF956:  Domain of unknown function (DUF956);  InterPro: IPR010360 This is a family of bacterial sequences with undetermined function.
Probab=37.46  E-value=68  Score=24.63  Aligned_cols=33  Identities=21%  Similarity=0.272  Sum_probs=25.3

Q ss_pred             eeeeEEEEeccC--CCeeeceEecCCceEEEEecCCC
Q psy4833          22 GITGIRVSVDKD--SGRFGFTLTRGGGWFDMLVNGGG   56 (141)
Q Consensus        22 PL~GVsV~Vkgt--~~~~G~TiTD~DG~FsL~V~~g~   56 (141)
                      -.++|+|..+|.  + +|. ..|+.+|.|.+.....+
T Consensus        54 ~~V~a~V~fkgk~I~-RF~-I~Tk~~G~f~Fsskd~k   88 (118)
T PF06115_consen   54 DYVIASVSFKGKWIP-RFA-IFTKKNGKFTFSSKDSK   88 (118)
T ss_pred             eEEEEEEEECCCEEe-eEE-EEECCCCEEEEEECChH
Confidence            356889998876  4 555 88999999999986443


No 71 
>COG3188 FimD P pilus assembly protein, porin PapC [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=35.14  E-value=1.6e+02  Score=28.88  Aligned_cols=49  Identities=10%  Similarity=-0.069  Sum_probs=34.1

Q ss_pred             cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEEec
Q psy4833           3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDMLVN   53 (141)
Q Consensus         3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~   53 (141)
                      |+......+--++.+++|+||| |+.|...+.. ..- .+.+.||+.=|...
T Consensus       747 F~t~~g~~~li~l~~~dG~~lPfGa~v~~~~~~-~~~-g~Vg~~G~vyl~gl  796 (835)
T COG3188         747 FDTRIGYRALITLRRADGSPLPFGAEVTDEGGG-QNV-GIVGDDGQVYLRGL  796 (835)
T ss_pred             EeeeceeEEEEEEECCCCCcCCCceEEEECCCc-eEE-EEEecCCeEEEecC
Confidence            4444445555677889999998 7888877653 212 57788999999753


No 72 
>PF03983 SHD1:  SLA1 homology domain 1, SHD1 ;  InterPro: IPR007131 The SLA1 homology domain is found in the cytoskeleton assembly control protein SLA1, which is responsible for the correct formation of the actin cytoskeleton.; GO: 0008092 cytoskeletal protein binding, 0030674 protein binding, bridging, 0042802 identical protein binding, 0043130 ubiquitin binding; PDB: 2HBP_A.
Probab=32.91  E-value=31  Score=24.11  Aligned_cols=43  Identities=14%  Similarity=0.051  Sum_probs=23.7

Q ss_pred             eEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeEEEec
Q psy4833          40 TLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVKMQLS   92 (141)
Q Consensus        40 TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~m~~~   92 (141)
                      +=||.+|.|.+..          +|+|+..-.+.+.-..+..+-+|--.|...
T Consensus        14 tWtD~tG~f~VeA----------~fv~~~dgkV~L~k~nG~~i~VP~~kLS~~   56 (70)
T PF03983_consen   14 TWTDRTGKFKVEA----------EFVGVNDGKVHLHKTNGVKIAVPLEKLSDE   56 (70)
T ss_dssp             EEEBSSS--EEEE----------EEEEEETTEEEEE-TTS-EEEEETTSB-HH
T ss_pred             EEEeCCCCEEEEE----------EEEEeeCCEEEEEecCCeEEEeEhHHcCHH
Confidence            7899999998864          355555555655555555555555444443


No 73 
>KOG4217|consensus
Probab=32.16  E-value=15  Score=34.65  Aligned_cols=15  Identities=60%  Similarity=1.101  Sum_probs=11.5

Q ss_pred             CcccccccCCCCCCcceeeecc
Q psy4833         113 GNSDLQHYGPRSSNKTVRHCES  134 (141)
Q Consensus       113 ~~~~~~~~~~~~~~~~~~~~~~  134 (141)
                      -+++|||||       ||-||.
T Consensus       277 DnAaCqHYG-------vRTCEG  291 (605)
T KOG4217|consen  277 DNAACQHYG-------VRTCEG  291 (605)
T ss_pred             ChHHhhhcC-------cccccc
Confidence            468999999       566764


No 74 
>PRK15248 fimbrial outer membrane usher protein StbC; Provisional
Probab=31.41  E-value=1.6e+02  Score=28.76  Aligned_cols=47  Identities=13%  Similarity=-0.085  Sum_probs=28.6

Q ss_pred             CcCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833           2 TDDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML   51 (141)
Q Consensus         2 ~~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~   51 (141)
                      .|+-.+...+--+|.+++|+||| |+.|...+.. . - .+-..+|.-=|.
T Consensus       762 ~F~~~~g~~~l~~l~~~dG~~lP~Ga~V~~~~g~-~-~-g~Vg~~G~vyl~  809 (853)
T PRK15248        762 NFATISGKAVLISVKMPDGGIPPMGADVFNGEGT-N-I-GMVGQSGQIYAR  809 (853)
T ss_pred             EEEEeeceEEEEEEEcCCCCcCCCceEEECCCCc-E-E-EEEcCCCEEEEE
Confidence            35555666666778899999986 6666543332 2 1 344556665554


No 75 
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=31.06  E-value=58  Score=22.28  Aligned_cols=28  Identities=11%  Similarity=0.359  Sum_probs=20.0

Q ss_pred             ecCCceEEEEecC-CCcEEEEEEecCccceE
Q psy4833          42 TRGGGWFDMLVNG-GGAVTLQFQRSPYRPQT   71 (141)
Q Consensus        42 TD~DG~FsL~V~~-g~~~tL~FSyiGY~sqe   71 (141)
                      -|.+|.|.+.... ++  .+.|+--||+++.
T Consensus         9 kdk~Ge~rfrlkA~N~--eiI~~sEgY~ska   37 (59)
T COG3422           9 KDKAGEYRFRLKAANG--EIILTSEGYKSKA   37 (59)
T ss_pred             EcCCCcEEEEEEccCc--cEEEeecccchhH
Confidence            4889998887653 33  3667889998764


No 76 
>PF14289 DUF4369:  Domain of unknown function (DUF4369)
Probab=31.03  E-value=1.7e+02  Score=19.61  Aligned_cols=44  Identities=11%  Similarity=0.145  Sum_probs=25.8

Q ss_pred             ceEEEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEec
Q psy4833           7 RVAVVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVN   53 (141)
Q Consensus         7 r~~~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~   53 (141)
                      -..+|+|++.+. +|.-+-=...  .+.....-.+..+ ||+|++..+
T Consensus        11 ~~~~I~G~i~~~~~~~~vyL~~~--~~~~~~~ds~~v~-nG~F~f~~~   55 (106)
T PF14289_consen   11 KQFTIEGKIKGLPDGDKVYLYYY--DNGKVVIDSVVVK-NGKFSFKGP   55 (106)
T ss_pred             CcEEEEEEEcCCCCCCEEEEEEe--CCCCEEEEEEEEe-CCEEEEEEe
Confidence            468999999887 6655433322  1111011223334 999999865


No 77 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=30.29  E-value=88  Score=26.01  Aligned_cols=44  Identities=20%  Similarity=0.368  Sum_probs=30.6

Q ss_pred             cceEEEEEEEEC-CCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCC
Q psy4833           6 LRVAVVRGQVLS-VQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGG   55 (141)
Q Consensus         6 ~r~~~IrG~V~D-~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g   55 (141)
                      ....+|+|.|.+ +.|..+   .|.+.|..  +. +..+.+|.|++.+.+.
T Consensus       264 ~~~~t~sg~v~~~~~g~~v---~v~v~g~~--~~-t~v~~~g~~s~~v~~~  308 (317)
T PF13753_consen  264 NTTVTISGTVTDAEAGDTV---TVTVNGNT--YT-TTVDADGTWSVTVTGA  308 (317)
T ss_pred             cccEEEecccCCCCCCCEE---EEEECCEE--EE-EEEeCCCeEEEEcCHH
Confidence            346789999876 357764   46777655  33 5666799999998743


No 78 
>COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=29.45  E-value=34  Score=30.06  Aligned_cols=33  Identities=15%  Similarity=0.084  Sum_probs=26.9

Q ss_pred             eceEecCCceEEEEecCCCcEEEEEEecCccce
Q psy4833          38 GFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQ   70 (141)
Q Consensus        38 G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sq   70 (141)
                      ..|+.|..|.|.+.-.+...++|+.||+-+..+
T Consensus        33 ~~tVkde~Gt~tv~k~PKRVVVLE~SFaDaLaa   65 (310)
T COG4594          33 AHTVKDELGTFTVPKTPKRVVVLELSFADALAA   65 (310)
T ss_pred             eeeeeccCCceecCCCCceEEEEEecHHHHHHH
Confidence            358999999999987666677899999877543


No 79 
>PRK15213 fimbrial outer membrane usher protein PefC; Provisional
Probab=29.38  E-value=2.1e+02  Score=27.65  Aligned_cols=51  Identities=16%  Similarity=0.100  Sum_probs=31.7

Q ss_pred             cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE-ecCCC
Q psy4833           3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGG   56 (141)
Q Consensus         3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~   56 (141)
                      |+-.+...+-=++.|++|+||| |+.|...+.. . - .+...||.==|. ++..+
T Consensus       718 F~~~~g~~~ll~l~~~~G~~lP~Ga~v~~~~g~-~-~-g~Vg~~G~vyl~gl~~~~  770 (797)
T PRK15213        718 FKALKVLRYILRVKQKDGRFVPGGSWARNEQGT-P-L-GFVANNGVLLMNLLDAPG  770 (797)
T ss_pred             EEEEeeeEEEEEEEccCCCcCCCCeEEECCCCc-E-E-EEEcCCCEEEEEcCCCCC
Confidence            5555566666678899999987 7767543322 2 2 466778885554 44444


No 80 
>PRK07121 hypothetical protein; Validated
Probab=28.52  E-value=28  Score=30.68  Aligned_cols=15  Identities=40%  Similarity=0.711  Sum_probs=13.3

Q ss_pred             EEEECCCCCeeeeEE
Q psy4833          13 GQVLSVQGLGITGIR   27 (141)
Q Consensus        13 G~V~D~~GePL~GVs   27 (141)
                      ++|+|.+|+||+|.=
T Consensus       439 ~qVld~~g~pI~GLY  453 (492)
T PRK07121        439 GEVLRADGAPIPGLY  453 (492)
T ss_pred             ceEECCCCCCcCceE
Confidence            899999999999953


No 81 
>TIGR02485 CobZ_N-term precorrin 3B synthase CobZ. CobZ is essential for cobalamin biosynthesis (by knockout of the R. capsulatus gene ) and is complemented by the characterized precorrin 3B synthase CobG. The enzyme has been shown to contain flavin, heme and Fe-S cluster cofactors and is believed to require dioxygen as a substrate. This model identifies the N-terminal portion of the R. capsulatus gene which, in other species exists as a separate protein. The C-terminal portion is homologous to the 2-component signal transduction system protein CitB (TIGR02484).
Probab=28.24  E-value=27  Score=30.14  Aligned_cols=16  Identities=6%  Similarity=0.067  Sum_probs=13.7

Q ss_pred             EEEEEECCCCCeeeeE
Q psy4833          11 VRGQVLSVQGLGITGI   26 (141)
Q Consensus        11 IrG~V~D~~GePL~GV   26 (141)
                      -+++|+|++|+||+|-
T Consensus       375 ~~~~Vl~~~g~~I~GL  390 (432)
T TIGR02485       375 ATARVRLNDAVAPDNL  390 (432)
T ss_pred             CCceEECCCCCCCCCe
Confidence            4689999999999984


No 82 
>PF06030 DUF916:  Bacterial protein of unknown function (DUF916);  InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function. 
Probab=27.23  E-value=1.4e+02  Score=22.24  Aligned_cols=38  Identities=24%  Similarity=0.255  Sum_probs=25.5

Q ss_pred             ecC-CceEEEEecCCCcEEEEEEecCccceEEEEEccCc
Q psy4833          42 TRG-GGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWN   79 (141)
Q Consensus        42 TD~-DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n   79 (141)
                      .|. -|.|+|.+.++...+|++.----..+++++.+..+
T Consensus        12 ~~~~~~YFdL~~~P~q~~~l~v~i~N~s~~~~tv~v~~~   50 (121)
T PF06030_consen   12 IDKNVSYFDLKVKPGQKQTLEVRITNNSDKEITVKVSAN   50 (121)
T ss_pred             cCCCCCeEEEEeCCCCEEEEEEEEEeCCCCCEEEEEEEe
Confidence            355 49999999877766676666555566666655533


No 83 
>PRK15235 outer membrane fimbrial usher protein SefC; Provisional
Probab=27.02  E-value=1.6e+02  Score=28.55  Aligned_cols=28  Identities=11%  Similarity=0.064  Sum_probs=17.1

Q ss_pred             CCcceEEEEEEEECCCCCeee-eEEEEec
Q psy4833           4 DELRVAVVRGQVLSVQGLGIT-GIRVSVD   31 (141)
Q Consensus         4 d~~r~~~IrG~V~D~~GePL~-GVsV~Vk   31 (141)
                      +-.+...+-=++.+++|+||| |+.|.++
T Consensus       723 ~~~~g~~~ll~l~~~~G~~lPfGa~V~~~  751 (814)
T PRK15235        723 KTSVGTNALIRITRTNGKPLALGTVLSLK  751 (814)
T ss_pred             EEeeeeEEEEEEECCCCCcCCCceEEEec
Confidence            333344444467888999887 6656543


No 84 
>PRK15223 pilin outer membrane usher protein SafC; Provisional
Probab=24.82  E-value=5e+02  Score=25.28  Aligned_cols=58  Identities=17%  Similarity=0.152  Sum_probs=33.8

Q ss_pred             cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEe
Q psy4833           3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQR   64 (141)
Q Consensus         3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSy   64 (141)
                      |+-.+...+-=++.+++|+||| |+.|...+.... - .+...+|.-=|. ++..+  +|.+..
T Consensus       749 F~~~~g~~~ll~l~~~~G~~lP~Ga~v~~~~~g~~-~-g~Vg~~G~vyl~g~~~~~--~l~v~w  808 (836)
T PRK15223        749 FHTRIGARALMTLKREDGSAIPFGAQVTVNGQDGS-A-ALVDTDSQVYLTGLADKG--ELTVKW  808 (836)
T ss_pred             EEEeeeEEEEEEEECCCCCcCCCCeEEEecCCCce-E-EEECCCCEEEEEccCCCc--eEEEEE
Confidence            4444445555567889999987 777765432102 2 466778886665 34333  354444


No 85 
>PRK12837 3-ketosteroid-delta-1-dehydrogenase; Provisional
Probab=24.56  E-value=39  Score=30.28  Aligned_cols=16  Identities=31%  Similarity=0.553  Sum_probs=13.7

Q ss_pred             EEEEEECCCCCeeeeE
Q psy4833          11 VRGQVLSVQGLGITGI   26 (141)
Q Consensus        11 IrG~V~D~~GePL~GV   26 (141)
                      -.++|+|.+|+||+|.
T Consensus       457 ~~~qVl~~~g~pIpGL  472 (513)
T PRK12837        457 TAARVLDTDGRPIPGL  472 (513)
T ss_pred             CCceEECCCCCEeCCc
Confidence            3579999999999994


No 86 
>PF03404 Mo-co_dimer:  Mo-co oxidoreductase dimerisation domain;  InterPro: IPR005066 The majority of molybdenum-containing enzymes utilise a molybdenum cofactor (MoCF or Moco) consisting of a Mo atom coordinated via a cis-dithiolene moiety to molybdopterin (MPT). MoCF is ubiquitous in nature, and the pathway for MoCF biosynthesis is conserved in all three domains of life. MoCF-containing enzymes function as oxidoreductases in carbon, nitrogen, and sulphur metabolism [, ].  In Escherichia coli, biosynthesis of MoCF is a three stage process. It begins with the MoaA and MoaC conversion of GTP to the meta-stable pterin intermediate precursor Z. The second stage involves MPT synthase (MoaD and MoaE), which converts precursor Z to MPT; MoeB is involved in the recycling of MPT synthase. The final step in MoCF synthesis is the attachment of mononuclear Mo to MPT, a process that requires MoeA and which is enhanced by MogA in an Mg2 ATP-dependent manner []. MoCF is the active co-factor in eukaryotic and some prokaryotic molybdo-enzymes, but the majority of bacterial enzymes requiring MoCF, need a modification of MTP for it to be active; MobA is involved in the attachment of a nucleotide monophosphate to MPT resulting in the MGD co-factor, the active co-factor for most prokaryotic molybdo-enzymes. Bacterial two-hybrid studies have revealed the close interactions between MoeA, MogA, and MobA in the synthesis of MoCF []. Moreover the close functional association of MoeA and MogA in the synthesis of MoCF is supported by fact that the known eukaryotic homologues to MoeA and MogA exist as fusion proteins: CNX1 (Q39054 from SWISSPROT) of Arabidopsis thaliana (Mouse-ear cress), mammalian Gephryin (e.g. Q9NQX3 from SWISSPROT) and Drosophila melanogaster (Fruit fly) Cinnamon (P39205 from SWISSPROT) []. This domain is found in molybdopterin cofactor oxidoreductases, such as in the C-terminal of Mo-containing sulphite oxidase, which catalyses the conversion of sulphite to sulphate, the terminal step in the oxidative degradation of cysteine and methionine []. This domain is involved in dimer formation, and has an Ig-fold structure [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2C9X_A 2CA3_A 2BLF_A 2CA4_A 2BPB_A 2XTS_C 2BII_A 2BIH_A 1OGP_A 2A9A_B ....
Probab=24.30  E-value=3.1e+02  Score=20.55  Aligned_cols=40  Identities=28%  Similarity=0.266  Sum_probs=27.8

Q ss_pred             ceEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCc
Q psy4833           7 RVAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGG   46 (141)
Q Consensus         7 r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG   46 (141)
                      ..++|+|.-.+..|.++..|-|++.|..+=.-..+....+
T Consensus        27 ~~v~i~G~A~~g~g~~I~rVEVS~DgG~tW~~A~l~~~~~   66 (131)
T PF03404_consen   27 GTVTIRGYAWSGGGRGIARVEVSTDGGKTWQEATLDGPES   66 (131)
T ss_dssp             EEEEEEEEEE-STT--EEEEEEESSTTSSEEE-EEESTSC
T ss_pred             cEEEEEEEEEeCCCcceEEEEEEeCCCCCcEEeEeccCCC
Confidence            5789999999999999999999999765311236666666


No 87 
>PRK08274 tricarballylate dehydrogenase; Validated
Probab=24.15  E-value=35  Score=29.61  Aligned_cols=16  Identities=13%  Similarity=0.075  Sum_probs=13.4

Q ss_pred             EEEEEECCCCCeeeeE
Q psy4833          11 VRGQVLSVQGLGITGI   26 (141)
Q Consensus        11 IrG~V~D~~GePL~GV   26 (141)
                      .+|+|+|.+|+||+|.
T Consensus       406 ~~~~vl~~~g~~I~GL  421 (466)
T PRK08274        406 EDARVRFADGRPSPNL  421 (466)
T ss_pred             CCceEECCCCCCCCCc
Confidence            3588999999999985


No 88 
>PRK15316 RatA-like protein; Provisional
Probab=22.44  E-value=2.2e+02  Score=31.67  Aligned_cols=55  Identities=11%  Similarity=0.109  Sum_probs=0.0

Q ss_pred             cceEEEEEEEECCCCCeeeeEEEEeccCCCe------------------------------------eeceEecCCceEE
Q psy4833           6 LRVAVVRGQVLSVQGLGITGIRVSVDKDSGR------------------------------------FGFTLTRGGGWFD   49 (141)
Q Consensus         6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~------------------------------------~G~TiTD~DG~Fs   49 (141)
                      .+.+.|+=+|.|.+|.|++|+.|.++... .                                    +. -.||.||.-.
T Consensus      2160 GEsiPLTVTVKD~~GnPv~nva~tLkRg~-A~nR~g~~~~~~~~ddm~~~~~tP~~at~~L~~~~~t~~-G~TgsDGtat 2237 (2683)
T PRK15316       2160 GEMLPIVVTIKDANGNPVGGEGVTLKRVQ-AKSRSGISVSSNTVDDLILDEVTPTSARISFNQNTSAWS-GFTGSDGTIT 2237 (2683)
T ss_pred             CCceeEEEEEEcCCCCCcCCceEEEeecc-ccccCCccccccccccceeccccccceeEEeccCcceEe-eeeCCCceEE


Q ss_pred             EEecCCCcEEEEEEecCccc
Q psy4833          50 MLVNGGGAVTLQFQRSPYRP   69 (141)
Q Consensus        50 L~V~~g~~~tL~FSyiGY~s   69 (141)
                      |.+...+.       +|+++
T Consensus      2238 ftvtQpnt-------~GLKT 2250 (2683)
T PRK15316       2238 FNVTQNNT-------VGLVT 2250 (2683)
T ss_pred             EEEecCCC-------Ccccc


No 89 
>PRK15298 fimbrial outer membrane usher protein StiC; Provisional
Probab=22.34  E-value=3.1e+02  Score=26.83  Aligned_cols=45  Identities=16%  Similarity=0.133  Sum_probs=24.2

Q ss_pred             cceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833           6 LRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML   51 (141)
Q Consensus         6 ~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~   51 (141)
                      .+...+-=+|.|++|+||| |+.|.+.+.....- .+...||.==|.
T Consensus       763 ~~g~~~ll~l~~~~G~~lP~Ga~v~~~~~~~~~~-g~Vg~~G~vyl~  808 (848)
T PRK15298        763 HKGYRVLLTLTRSNGEPVPFGATASVDGQDANLA-SIVGDKGQVFLS  808 (848)
T ss_pred             EeeEEEEEEEECCCCCcCCCCcEEEEcCCCCeeE-EEECCCCEEEEE
Confidence            3333333457788999886 66666532210111 455667774444


No 90 
>TIGR01813 flavo_cyto_c flavocytochrome c. This model describes a family of redox proteins related to the succinate dehydrogenases and fumarate reductases of E. coli, mitochondria, and other well-characterized systems. A member of this family from Shewanella frigidimarina NCIMB400 is characterized as a water-soluble periplasmic protein with four heme groups, a non-covalently bound FAD, and essentially unidirectional fumarate reductase activity. At least seven distinct members of this family are found in Shewanella oneidensis, a species able to use a wide variety of pathways for respiraton.
Probab=22.01  E-value=45  Score=28.64  Aligned_cols=17  Identities=35%  Similarity=0.565  Sum_probs=13.6

Q ss_pred             EEEEEECCCCCeeeeEE
Q psy4833          11 VRGQVLSVQGLGITGIR   27 (141)
Q Consensus        11 IrG~V~D~~GePL~GVs   27 (141)
                      -.++|+|.+|+||+|.=
T Consensus       391 ~~~~vl~~~g~~IpGLy  407 (439)
T TIGR01813       391 TKAEVLDAQGKPIPGLF  407 (439)
T ss_pred             CCCeEECCCCCEecccE
Confidence            45788999999999953


No 91 
>PRK06481 fumarate reductase flavoprotein subunit; Validated
Probab=21.05  E-value=47  Score=29.71  Aligned_cols=16  Identities=38%  Similarity=0.551  Sum_probs=13.4

Q ss_pred             EEEEEECCCCCeeeeE
Q psy4833          11 VRGQVLSVQGLGITGI   26 (141)
Q Consensus        11 IrG~V~D~~GePL~GV   26 (141)
                      -.++|+|++|+||+|-
T Consensus       449 ~~~qVld~~g~pI~GL  464 (506)
T PRK06481        449 TNTEVLKKDGSPITGL  464 (506)
T ss_pred             CCceEEcCCCCEeCCe
Confidence            3578999999999995


No 92 
>PF09624 DUF2393:  Protein of unknown function (DUF2393);  InterPro: IPR013417  The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=20.90  E-value=1.8e+02  Score=21.59  Aligned_cols=26  Identities=19%  Similarity=0.239  Sum_probs=22.7

Q ss_pred             cceEEEEEEEECCCCCeeeeEEEEec
Q psy4833           6 LRVAVVRGQVLSVQGLGITGIRVSVD   31 (141)
Q Consensus         6 ~r~~~IrG~V~D~~GePL~GVsV~Vk   31 (141)
                      +-...|.|+|+...+.++-+|.|.++
T Consensus        61 ~~~~~v~g~V~N~g~~~i~~c~i~~~   86 (149)
T PF09624_consen   61 SESFYVDGTVTNTGKFTIKKCKITVK   86 (149)
T ss_pred             ccEEEEEEEEEECCCCEeeEEEEEEE
Confidence            34678999999999999999999886


No 93 
>PRK15273 putative fimbrial outer membrane usher protein SteB; Provisional
Probab=20.90  E-value=2.5e+02  Score=27.62  Aligned_cols=36  Identities=14%  Similarity=0.016  Sum_probs=22.3

Q ss_pred             EEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833          13 GQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML   51 (141)
Q Consensus        13 G~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~   51 (141)
                      =+|.+++|+||| |+.|...+.. . - .+...+|.==|.
T Consensus       746 ~~l~~~dG~~lPfGa~V~~~~g~-~-~-giVg~~G~vyl~  782 (881)
T PRK15273        746 AVFRLADGDFPPFGAEVKNERQQ-Q-L-GLVADDGNAWLA  782 (881)
T ss_pred             EEEEccCCCcCCCceEEEcCCCc-E-E-EEEcCCCEEEEe
Confidence            356788999987 7777644333 2 2 355667774444


No 94 
>PRK12834 putative FAD-binding dehydrogenase; Reviewed
Probab=20.85  E-value=43  Score=30.18  Aligned_cols=16  Identities=25%  Similarity=0.588  Sum_probs=13.6

Q ss_pred             EEEEEECCCCCeeeeE
Q psy4833          11 VRGQVLSVQGLGITGI   26 (141)
Q Consensus        11 IrG~V~D~~GePL~GV   26 (141)
                      -.++|+|.+|+||+|.
T Consensus       492 ~~~qVld~dg~pIpGL  507 (549)
T PRK12834        492 LDSRVLGADGTPLPGL  507 (549)
T ss_pred             CCCceeCCCCCEeCCe
Confidence            3478999999999995


No 95 
>COG3549 HigB Plasmid maintenance system killer protein [General function prediction only]
Probab=20.73  E-value=1.4e+02  Score=22.12  Aligned_cols=23  Identities=22%  Similarity=0.188  Sum_probs=18.5

Q ss_pred             cCCceEEEEecCCCcEEEEEEecCc
Q psy4833          43 RGGGWFDMLVNGGGAVTLQFQRSPY   67 (141)
Q Consensus        43 D~DG~FsL~V~~g~~~tL~FSyiGY   67 (141)
                      |.+|.|+|-|+.  ...|.|-+-|=
T Consensus        60 d~~g~ySIrvN~--QwrLiF~~~~~   82 (94)
T COG3549          60 DLKGLYSIRVNQ--QWRLIFRWEGG   82 (94)
T ss_pred             CccCcEEEEECC--EEEEEEEeCCC
Confidence            789999999983  34698988774


No 96 
>PF00207 A2M:  Alpha-2-macroglobulin family;  InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins.  The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=20.70  E-value=2.6e+02  Score=19.00  Aligned_cols=51  Identities=18%  Similarity=0.151  Sum_probs=28.4

Q ss_pred             eEecCCceEEEEe--cCCCcEEEEEEecCccceEEEEEccCc-cEEEeCCeEEEec
Q psy4833          40 TLTRGGGWFDMLV--NGGGAVTLQFQRSPYRPQTRTVFIPWN-QIVVLPPVKMQLS   92 (141)
Q Consensus        40 TiTD~DG~FsL~V--~~g~~~tL~FSyiGY~sqe~~V~~~~n-~~~~~~~v~m~~~   92 (141)
                      .++|.+|+-.+..  |..- .+.+++.+++.... .+..... ++.+..++.|.+.
T Consensus         8 ~~~d~~G~~~~~~~lPd~i-t~w~v~a~a~s~~~-~~g~~~~~~~~v~~p~~i~~~   61 (92)
T PF00207_consen    8 VVTDANGKATFSFTLPDSI-TSWRVTAFAVSPTG-GFGIAEPPEITVFKPFFIQLN   61 (92)
T ss_dssp             EECCTTEEEEEEEE-SSSS-SEEEEEEEEEETTT-EEEEECCEEEEEB-SEEEEEE
T ss_pred             eccCCCCEEEEEEECCCCc-cEEEEEEEEECCCC-cceEecceEEEEEeeEEEEcC
Confidence            5677888877764  4332 24666666665431 3444444 5666666666654


No 97 
>COG0196 RibF FAD synthase [Coenzyme metabolism]
Probab=20.31  E-value=1.6e+02  Score=25.60  Aligned_cols=57  Identities=26%  Similarity=0.399  Sum_probs=39.8

Q ss_pred             ceEEEEEEEECCC--C--CeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccce
Q psy4833           7 RVAVVRGQVLSVQ--G--LGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQ   70 (141)
Q Consensus         7 r~~~IrG~V~D~~--G--ePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sq   70 (141)
                      |...|+|+|+..+  |  ...|=|++..+...      +. ..|-|...+...+..---++.+||.+.
T Consensus       184 ~py~i~G~Vv~G~~~Gr~lGfPTaNi~~~~~~------~~-~~GVYav~v~~~~~~~~gv~~vG~rpt  244 (304)
T COG0196         184 RPYSIEGKVVHGQKLGRTLGFPTANIYLKDNV------LP-AFGVYAVRVKLDGGVYPGVANVGYRPT  244 (304)
T ss_pred             CCeEEEEEEEcccccccccCCCcccccccccc------cc-CCeeEEEEEEECCccccceEEeccccc
Confidence            6678999998854  5  67788888887554      22 799999987632211256688888754


No 98 
>KOG1691|consensus
Probab=20.26  E-value=5.2e+02  Score=21.67  Aligned_cols=72  Identities=22%  Similarity=0.350  Sum_probs=42.0

Q ss_pred             EEEE--EEECCCCCeeeeEEEEeccCCCeeeceEecC----CceEEEEecCCCcEEEEEEe--cCccceE-EEEEccCcc
Q psy4833          10 VVRG--QVLSVQGLGITGIRVSVDKDSGRFGFTLTRG----GGWFDMLVNGGGAVTLQFQR--SPYRPQT-RTVFIPWNQ   80 (141)
Q Consensus        10 ~IrG--~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~----DG~FsL~V~~g~~~tL~FSy--iGY~sqe-~~V~~~~n~   80 (141)
                      .+.|  .+.|..|.-.+-.+|.|....   |-+.-.+    +|+|++.+...+.....|.-  .|..+.. +.|.+.|-.
T Consensus        44 lv~g~y~i~~~~~~~~~~~~~~Vts~~---G~~~~~~env~~gqFaFta~e~~~y~~Cf~~~~~~~~p~~~~~I~ld~k~  120 (210)
T KOG1691|consen   44 LVVGDYEIINPNGDHSHKLSVKVTSPY---GNNLHSKENVTKGQFAFTAEESGMYEACFTADVPGHKPETKRSIDLDWKT  120 (210)
T ss_pred             EEEEEEEEecCCCCccceEEEEEEcCC---CceeehhhccccceEEEEeccCCcEEEEEecccCCCCCCcceEEEEEeec
Confidence            3445  567765443255666664322   2222111    89999998766666677777  7777654 566666655


Q ss_pred             EEEe
Q psy4833          81 IVVL   84 (141)
Q Consensus        81 ~~~~   84 (141)
                      -+..
T Consensus       121 Gv~a  124 (210)
T KOG1691|consen  121 GVEA  124 (210)
T ss_pred             cccc
Confidence            4443


No 99 
>PRK15304 putative fimbrial outer membrane usher protein; Provisional
Probab=20.14  E-value=2.9e+02  Score=26.66  Aligned_cols=46  Identities=17%  Similarity=0.069  Sum_probs=23.8

Q ss_pred             cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833           3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML   51 (141)
Q Consensus         3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~   51 (141)
                      |+-.+...+-=++.|++|+||| |+.|...+.. . - .+...+|.-=|.
T Consensus       712 F~~~~g~~~l~~l~~~~G~~lPfGa~V~~~~g~-~-~-g~Vg~~G~vyl~  758 (801)
T PRK15304        712 FSVLNQRRVMLNVKRADGSPLPKGVSIVDEKGN-Y-L-TSAVDDGRVFLN  758 (801)
T ss_pred             EEEccceEEEEEEEccCCCcCCCcEEEECCCCc-E-E-EEEeCCCEEEEE
Confidence            3444444444467788888886 5555432222 2 1 234445665444


Done!