Query         psy5364
Match_columns 78
No_of_seqs    107 out of 383
Neff          6.9 
Searched_HMMs 46136
Date          Sat Aug 17 00:41:52 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5364.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5364hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0518|consensus               99.7 6.6E-18 1.4E-22  133.7   8.2   77    1-78    724-801 (1113)
  2 KOG0518|consensus               99.7 2.6E-17 5.6E-22  130.5   7.3   64    2-78   1049-1112(1113)
  3 smart00557 IG_FLMN Filamin-typ  99.0 7.4E-10 1.6E-14   66.6   5.8   43   34-77      1-43  (93)
  4 smart00557 IG_FLMN Filamin-typ  98.9 2.6E-09 5.7E-14   64.2   4.6   29    2-30     64-92  (93)
  5 PF00630 Filamin:  Filamin/ABP2  98.8 9.7E-09 2.1E-13   61.4   4.5   45   32-77      2-53  (101)
  6 COG4856 Uncharacterized protei  84.1     5.5 0.00012   30.1   6.4   53    8-71     52-109 (403)
  7 PF00927 Transglut_C:  Transglu  81.0     8.8 0.00019   22.8   5.6   38    9-46      2-50  (107)
  8 PF10670 DUF4198:  Domain of un  72.4      24 0.00052   23.0   6.7   63    7-75    152-214 (215)
  9 TIGR03769 P_ac_wall_RPT actino  71.7     5.2 0.00011   20.6   2.4   20    3-22     10-29  (41)
 10 PF14347 DUF4399:  Domain of un  70.1     4.3 9.4E-05   24.2   2.1   22    3-24     59-81  (87)
 11 TIGR02148 Fibro_Slime fibro-sl  69.2     6.6 0.00014   23.8   2.8   26    3-28     60-86  (90)
 12 PF08770 SoxZ:  Sulphur oxidati  69.1      23  0.0005   21.5   5.8   25   52-77     67-91  (100)
 13 PF08400 phage_tail_N:  Prophag  68.8     3.1 6.8E-05   26.9   1.4   17    1-17     51-67  (134)
 14 PF13954 PapC_N:  PapC N-termin  68.6     6.9 0.00015   24.9   3.0   21    2-22     26-46  (146)
 15 PRK15275 putative fimbrial pro  67.6      27 0.00058   23.1   5.7   62    8-75     25-88  (166)
 16 PF07495 Y_Y_Y:  Y_Y_Y domain;   60.3     8.9 0.00019   20.5   2.0   14    2-15     35-48  (66)
 17 cd08548 Type_I_cohesin_like Ty  58.0      45 0.00097   21.0   5.9   34    4-41     11-48  (135)
 18 PRK10301 hypothetical protein;  57.3      17 0.00038   22.8   3.2   25    3-28     96-123 (124)
 19 PF09906 DUF2135:  Uncharacteri  53.9      28 0.00061   18.5   3.2   18    4-21      2-19  (50)
 20 PF09430 DUF2012:  Protein of u  47.7      59  0.0013   20.1   4.5   38    3-42     37-76  (123)
 21 PRK15286 putative minor fimbri  46.9      77  0.0017   21.2   5.2   62    7-76     28-93  (170)
 22 TIGR03517 GldM_gliding gliding  46.5      65  0.0014   25.3   5.4   24   22-46    340-363 (523)
 23 PF09087 Cyc-maltodext_N:  Cycl  46.2      58  0.0013   19.5   4.1   41   35-75     28-72  (88)
 24 PF10029 DUF2271:  Predicted pe  44.0      40 0.00086   21.6   3.4   26    4-29     75-103 (139)
 25 PRK15311 putative fimbrial pro  43.8 1.4E+02   0.003   22.5   6.7   67    6-76    277-356 (359)
 26 PF12080 GldM_C:  GldM C-termin  41.5 1.1E+02  0.0023   20.4   5.9   34    9-46      6-41  (181)
 27 PF01483 P_proprotein:  Proprot  41.2      46   0.001   18.9   3.1   22   55-78      7-28  (87)
 28 PF07523 Big_3:  Bacterial Ig-l  39.9      30 0.00065   18.9   2.1   14    3-16     45-58  (67)
 29 PF05359 DUF748:  Domain of Unk  39.8      90   0.002   19.2   4.6   52   11-71     14-65  (151)
 30 PF12863 DUF3821:  Domain of un  36.5      69  0.0015   22.1   3.8   30   49-78    103-142 (209)
 31 PF10632 He_PIG_assoc:  He_PIG   36.0      31 0.00067   16.7   1.4    9   20-28     13-21  (29)
 32 PF08141 SspH:  Small acid-solu  35.8      36 0.00077   18.9   1.9   12    8-19     14-25  (58)
 33 PF08308 PEGA:  PEGA domain;  I  35.6      36 0.00078   18.5   2.0   15    3-17     32-46  (71)
 34 PRK15315 outer membrane protei  35.0 2.2E+02  0.0048   25.9   7.1   63    8-77    229-299 (1865)
 35 PF05688 DUF824:  Salmonella re  34.5      36 0.00078   18.1   1.7   16   15-30     25-40  (47)
 36 TIGR03503 conserved hypothetic  34.3      50  0.0011   24.8   3.0   28    2-29    274-301 (374)
 37 PF02019 WIF:  WIF domain;  Int  32.4      63  0.0014   20.7   2.9   65    3-71     37-108 (132)
 38 PF09898 DUF2125:  Uncharacteri  32.3 1.4E+02   0.003   21.0   4.9   19   11-29     25-43  (312)
 39 PRK03174 sspH acid-soluble spo  32.0      39 0.00085   18.9   1.7   11    9-19     15-25  (59)
 40 COG3459 Cellobiose phosphoryla  31.6 2.5E+02  0.0054   24.2   6.7   60    3-76    997-1056(1056)
 41 PF07162 B9-C2:  Ciliary basal   31.3      55  0.0012   21.3   2.6   22    1-23    101-122 (168)
 42 TIGR02861 SASP_H small acid-so  31.1      41 0.00088   18.7   1.6   11    9-19     15-25  (58)
 43 PF04571 Lipin_N:  lipin, N-ter  30.5      78  0.0017   19.9   3.0   20   10-29     31-50  (110)
 44 PF13584 BatD:  Oxygen toleranc  30.3      78  0.0017   23.7   3.5   31    1-31     80-113 (484)
 45 PRK01625 sspH acid-soluble spo  30.2      43 0.00094   18.7   1.7   11    9-19     15-25  (59)
 46 KOG0416|consensus               27.6      66  0.0014   21.9   2.5   24   55-78     19-42  (189)
 47 PF10528 PA14_2:  GLEYA domain;  27.4      59  0.0013   20.1   2.1   18   61-78     80-97  (113)
 48 PRK15273 putative fimbrial out  27.2      68  0.0015   26.5   2.9   18    3-20     41-58  (881)
 49 PF05949 DUF881:  Bacterial pro  27.1      83  0.0018   20.2   2.8   18   11-29     80-97  (149)
 50 TIGR03773 anch_rpt_wall putati  26.6 1.1E+02  0.0024   24.0   3.8   32    3-34    408-445 (513)
 51 COG4856 Uncharacterized protei  26.0 1.4E+02  0.0031   22.8   4.2   43   36-78    345-387 (403)
 52 PF13600 DUF4140:  N-terminal d  25.9      89  0.0019   18.4   2.6   13   31-43     34-46  (104)
 53 PRK15298 fimbrial outer membra  25.4      77  0.0017   26.1   2.9   20    3-22     61-80  (848)
 54 PRK15255 fimbrial outer membra  25.1      78  0.0017   25.9   2.9   19    3-21     48-66  (829)
 55 PRK15314 outer membrane protei  25.0 4.2E+02   0.009   25.0   7.1   60   14-78    767-838 (2435)
 56 PF11797 DUF3324:  Protein of u  24.6      65  0.0014   20.3   2.0   14    2-15    102-115 (140)
 57 PRK15193 outer membrane usher   24.5      81  0.0017   26.1   2.9   22    3-24     63-84  (876)
 58 PF13620 CarboxypepD_reg:  Carb  24.4      95  0.0021   16.9   2.5   14    3-16     45-58  (82)
 59 PF08695 Coa1:  Cytochrome oxid  24.2      88  0.0019   18.8   2.4   15   63-77     66-80  (116)
 60 PRK15198 outer membrane usher   24.0      83  0.0018   25.9   2.8   24    3-26     54-77  (860)
 61 PRK15207 long polar fimbrial o  23.4      88  0.0019   25.7   2.9   24    3-28     62-85  (842)
 62 PF14900 DUF4493:  Domain of un  23.2      62  0.0013   22.0   1.7   24    2-25     61-86  (235)
 63 PRK15248 fimbrial outer membra  22.8 1.2E+02  0.0027   24.9   3.6   24    3-26     53-77  (853)
 64 KOG4680|consensus               22.0      79  0.0017   20.9   1.9   23    4-26     69-91  (153)
 65 PRK15284 putative fimbrial out  21.8      99  0.0022   25.6   2.9   18    3-20     59-76  (881)
 66 PF03422 CBM_6:  Carbohydrate b  21.8      67  0.0014   19.1   1.5   15    2-16     96-110 (125)
 67 COG1470 Predicted membrane pro  21.7 4.1E+02  0.0089   21.0   6.3   13    4-16    348-360 (513)
 68 TIGR02186 alph_Pro_TM conserve  21.7 3.1E+02  0.0068   19.6   5.1   46   22-77     35-80  (261)
 69 PRK09828 putative fimbrial out  21.4   1E+02  0.0022   25.4   2.9   20    2-21     59-78  (865)
 70 PRK15304 putative fimbrial out  21.4 1.2E+02  0.0026   24.8   3.2   19    3-21     57-75  (801)
 71 PRK15294 putative fimbrial out  21.3   1E+02  0.0022   25.3   2.8   19    3-21     57-75  (845)
 72 PF00801 PKD:  PKD domain;  Int  21.3   1E+02  0.0022   16.5   2.1   14    3-16     49-62  (69)
 73 PRK15316 RatA-like protein; Pr  21.2 2.5E+02  0.0055   26.4   5.2   62   14-78    753-824 (2683)
 74 PF14734 DUF4469:  Domain of un  20.7      93   0.002   19.0   2.0   12    3-14     74-85  (102)
 75 COG4136 ABC-type uncharacteriz  20.4      68  0.0015   22.0   1.4   19   54-78     21-39  (213)
 76 TIGR03292 PhnH_redo phosphonat  20.3 1.1E+02  0.0024   20.5   2.5   15   32-46    114-128 (174)
 77 PF02018 CBM_4_9:  Carbohydrate  20.2 1.9E+02  0.0041   16.6   3.3   26    3-30     58-84  (131)

No 1  
>KOG0518|consensus
Probab=99.74  E-value=6.6e-18  Score=133.75  Aligned_cols=77  Identities=40%  Similarity=0.628  Sum_probs=70.9

Q ss_pred             CCccceeeEEEEEECCEecCCCCeEEEecC-CCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364           1 MSLRRRRINLRTEGEGVDVPGSPFTVRVSG-APDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus         1 ~~~~~G~~~i~V~~~g~~V~gSPf~v~V~~-~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      .|.+.|.|.|+|+++|+||+||||+|+|.. ..||++++++|.||..|. .-+|++|+||+|+||.|.|.+.++||+++
T Consensus       724 ~P~e~GeH~I~Vk~~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~-t~ep~~fivDtr~agyGgLsi~~~Gpskv  801 (1113)
T KOG0518|consen  724 TPREVGEHKINVKVAGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGH-TFEPAEFIVDTRKAGYGGLSISVQGPSKV  801 (1113)
T ss_pred             CCCcCcceEEEEEEcceECCCCCeEEEeccccccceeEEEecccccccc-cccchheEeccccCCCCceEEEEeCCccc
Confidence            378999999999999999999999999985 348999999999999997 45899999999999999999999999953


No 2  
>KOG0518|consensus
Probab=99.70  E-value=2.6e-17  Score=130.45  Aligned_cols=64  Identities=41%  Similarity=0.744  Sum_probs=60.9

Q ss_pred             CccceeeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364           2 SLRRRRINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus         2 ~~~~G~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      |.|+|.|+|.|+|||+||+||||++.|.       +++||+||++      +++|.|.|++||.|.|+|+|.||+++
T Consensus      1049 P~e~G~h~i~Vkfd~~hv~gSPF~i~v~-------vsayG~gle~------~sefvvnt~~agaG~lsv~~~gPsk~ 1112 (1113)
T KOG0518|consen 1049 PKENGVHTIDVKFDGEHVPGSPFKIRVG-------VSAYGPGLEG------KSEFVVNTSNAGAGALSVTIDGPSKA 1112 (1113)
T ss_pred             ecCCCcEEEEEEeCCccCCCCCeEEEEE-------EEeecCCcCc------cceEEeeccCCCcceeeeecCCCccC
Confidence            6799999999999999999999999995       9999999998      57899999999999999999999986


No 3  
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.04  E-value=7.4e-10  Score=66.57  Aligned_cols=43  Identities=56%  Similarity=0.927  Sum_probs=40.0

Q ss_pred             CCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCC
Q psy5364          34 ASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKG   77 (78)
Q Consensus        34 ~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g   77 (78)
                      |++|+|+|+||+.+. .++++.|+|+++++|.++|+|.|++|+|
T Consensus         1 p~~~~v~G~Gl~~~~-vg~~~~f~v~~~d~G~~~~~v~i~~p~g   43 (93)
T smart00557        1 ASKVKASGPGLEKGV-VGEPAEFTIDTRGAGGGELEVEVTGPSG   43 (93)
T ss_pred             CCeEEEECCCcCcee-cCCCEEEEEEcCCCCCCcEEEEEECCCC
Confidence            689999999999985 5689999999999999999999999997


No 4  
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=98.90  E-value=2.6e-09  Score=64.15  Aligned_cols=29  Identities=31%  Similarity=0.505  Sum_probs=27.0

Q ss_pred             CccceeeEEEEEECCEecCCCCeEEEecC
Q psy5364           2 SLRRRRINLRTEGEGVDVPGSPFTVRVSG   30 (78)
Q Consensus         2 ~~~~G~~~i~V~~~g~~V~gSPf~v~V~~   30 (78)
                      |.++|.|+|+|+|+|+||+||||.++|.+
T Consensus        64 P~~~G~~~i~V~~~g~~I~gSPF~v~V~~   92 (93)
T smart00557       64 PTEPGDYTVTVKFGGEHIPGSPFTVKVGP   92 (93)
T ss_pred             eCCCEeEEEEEEECCEECCCCCEEEEEeC
Confidence            78999999999999999999999999853


No 5  
>PF00630 Filamin:  Filamin/ABP280 repeat;  InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=98.79  E-value=9.7e-09  Score=61.43  Aligned_cols=45  Identities=40%  Similarity=0.674  Sum_probs=39.4

Q ss_pred             CCCCeEEEeCCCcccceeceeceEEEEEccCCCcc-------ceEEEEECCCC
Q psy5364          32 PDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAG-------QLTVRVRGPKG   77 (78)
Q Consensus        32 ~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g-------~l~v~v~gP~g   77 (78)
                      .||++|+++|+||+.+. .++++.|+|++++++.+       .|.|.|.+|++
T Consensus         2 ~dp~~~~v~G~gl~~~~-~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~   53 (101)
T PF00630_consen    2 IDPSKCKVSGPGLEPAV-VGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDG   53 (101)
T ss_dssp             CSGCGEEEESGGGTEEE-TTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSS
T ss_pred             CCCCEEEEECCccCCeE-CCCcEEEEEEEccCCCCccccCCceeEEEEeCCCC
Confidence            69999999999998885 66899999999999665       57799999976


No 6  
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=84.11  E-value=5.5  Score=30.10  Aligned_cols=53  Identities=26%  Similarity=0.333  Sum_probs=32.0

Q ss_pred             eEEEEEECC--EecCCCCeEEEecCCCCCCeEEEeCC-CcccceeceeceEEEE--EccCCCccceEEE
Q psy5364           8 INLRTEGEG--VDVPGSPFTVRVSGAPDASKVRVYGP-GIEHGVLATFQSRFIC--DTRGAGAGQLTVR   71 (78)
Q Consensus         8 ~~i~V~~~g--~~V~gSPf~v~V~~~~d~~kv~v~Gp-GL~~g~~~~~~~~F~I--d~~~aG~g~l~v~   71 (78)
                      --+.|+|+.  .+|+|-|++++|         .+.|| ++......  +..|.|  |-+.|+.|..+|.
T Consensus        52 vpvdvlYD~~~y~isg~~etV~V---------tl~G~ns~~~~~~~--~~dFkV~ADLt~a~~Gt~evk  109 (403)
T COG4856          52 VPVDVLYDSDKYFISGQPETVTV---------TLKGPNSIVLKSEK--PEDFKVVADLTHAGVGTHEVK  109 (403)
T ss_pred             ceeEEEEccccccccCCceEEEE---------EEeCCcceeeeeec--CcCeEEEEEhhhcCCCceEee
Confidence            346788974  789999999988         45552 23222222  333666  5567766655444


No 7  
>PF00927 Transglut_C:  Transglutaminase family, C-terminal ig like domain;  InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase  Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=81.03  E-value=8.8  Score=22.82  Aligned_cols=38  Identities=26%  Similarity=0.355  Sum_probs=27.3

Q ss_pred             EEEEEECCEecCCCCeEEEec---CCCCC--------CeEEEeCCCccc
Q psy5364           9 NLRTEGEGVDVPGSPFTVRVS---GAPDA--------SKVRVYGPGIEH   46 (78)
Q Consensus         9 ~i~V~~~g~~V~gSPf~v~V~---~~~d~--------~kv~v~GpGL~~   46 (78)
                      .+.|+..+..+-|++|.+.+.   +...+        ..-.++|.|+..
T Consensus         2 ~~~i~~~~~~~vG~d~~v~v~~~N~~~~~l~~v~~~l~~~~v~ytG~~~   50 (107)
T PF00927_consen    2 EIKIKLPGDPVVGQDFTVSVSFTNPSSEPLRNVSLNLCAFTVEYTGLTR   50 (107)
T ss_dssp             EEEEEEESEEBTTSEEEEEEEEEE-SSS-EECEEEEEEEEEEECTTTEE
T ss_pred             eEEEEECCCccCCCCEEEEEEEEeCCcCccccceeEEEEEEEEECCccc
Confidence            477888899999999999876   22333        344568999864


No 8  
>PF10670 DUF4198:  Domain of unknown function (DUF4198)
Probab=72.37  E-value=24  Score=22.98  Aligned_cols=63  Identities=11%  Similarity=0.093  Sum_probs=36.6

Q ss_pred             eeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECC
Q psy5364           7 RINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGP   75 (78)
Q Consensus         7 ~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP   75 (78)
                      ...+.|+|+|++++|.+..+.-..  +.......    ..-......-.|+|....+|.=.|.+..+.|
T Consensus       152 ~~~~~vl~~GkPl~~a~V~~~~~~--~~~~~~~~----~~~~~TD~~G~~~~~~~~~G~wli~a~~~~p  214 (215)
T PF10670_consen  152 PLPFQVLFDGKPLAGAEVEAFSPG--GWYDVEHE----AKTLKTDANGRATFTLPRPGLWLIRASHKDP  214 (215)
T ss_pred             EEEEEEEECCeEcccEEEEEEECC--Cccccccc----eEEEEECCCCEEEEecCCCEEEEEEEEEecC
Confidence            488999999999999766543321  11100000    1111122334677777777777777777666


No 9  
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=71.71  E-value=5.2  Score=20.62  Aligned_cols=20  Identities=10%  Similarity=0.127  Sum_probs=15.7

Q ss_pred             ccceeeEEEEEECCEecCCC
Q psy5364           3 LRRRRINLRTEGEGVDVPGS   22 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gS   22 (78)
                      .+||.|.|.+...+....|.
T Consensus        10 T~PG~Y~l~~~a~~~~~~G~   29 (41)
T TIGR03769        10 TKPGTYTLTVQATATLTDGK   29 (41)
T ss_pred             CCCeEEEEEEEEEEEeCCCc
Confidence            58999999999876555554


No 10 
>PF14347 DUF4399:  Domain of unknown function (DUF4399)
Probab=70.06  E-value=4.3  Score=24.21  Aligned_cols=22  Identities=23%  Similarity=0.194  Sum_probs=17.7

Q ss_pred             ccceeeEEEEEECC-EecCCCCe
Q psy5364           3 LRRRRINLRTEGEG-VDVPGSPF   24 (78)
Q Consensus         3 ~~~G~~~i~V~~~g-~~V~gSPf   24 (78)
                      +.||.|++.+.|.+ .|++-+|-
T Consensus        59 L~PG~htLtl~~~d~~h~~~~~~   81 (87)
T PF14347_consen   59 LPPGKHTLTLQLGDGDHVPHDPP   81 (87)
T ss_pred             eCCCCEEEEEEeCCCCcccCCCc
Confidence            68999999998864 78876664


No 11 
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=69.19  E-value=6.6  Score=23.82  Aligned_cols=26  Identities=19%  Similarity=0.227  Sum_probs=21.1

Q ss_pred             cccee-eEEEEEECCEecCCCCeEEEe
Q psy5364           3 LRRRR-INLRTEGEGVDVPGSPFTVRV   28 (78)
Q Consensus         3 ~~~G~-~~i~V~~~g~~V~gSPf~v~V   28 (78)
                      +++|. |.+++.|...|-.+|.|+++-
T Consensus        60 l~~g~~Y~~d~F~~ERht~~S~~~i~T   86 (90)
T TIGR02148        60 LKEGKTYPFDIFYCERHTTGSNFRIRT   86 (90)
T ss_pred             CccCcEeeEEEEEEeecCCCCCEEEEE
Confidence            34444 999999999999999998754


No 12 
>PF08770 SoxZ:  Sulphur oxidation protein SoxZ;  InterPro: IPR014880 SoxZ forms an anti parallel beta structure and forms a complex with SoxY. Sulphur oxidation occurs at the thiol of a conserved cysteine residue of the SoxY subunit []. ; PDB: 1V8H_B 2OX5_E 2OXG_E 2OXH_C.
Probab=69.11  E-value=23  Score=21.47  Aligned_cols=25  Identities=28%  Similarity=0.312  Sum_probs=14.8

Q ss_pred             eceEEEEEccCCCccceEEEEECCCC
Q psy5364          52 FQSRFICDTRGAGAGQLTVRVRGPKG   77 (78)
Q Consensus        52 ~~~~F~Id~~~aG~g~l~v~v~gP~g   77 (78)
                      +|. |+...+..+.|.|++.+++-+|
T Consensus        67 NP~-l~F~~~~~~~g~l~v~~~Dn~G   91 (100)
T PF08770_consen   67 NPY-LRFSFKGKKSGTLTVTWTDNKG   91 (100)
T ss_dssp             S-E-EEEEEEESSSEEEEEEEEETTS
T ss_pred             CCc-EEEEEecCCCcEEEEEEEECCC
Confidence            443 4444444445599999887665


No 13 
>PF08400 phage_tail_N:  Prophage tail fibre N-terminal;  InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=68.77  E-value=3.1  Score=26.91  Aligned_cols=17  Identities=12%  Similarity=0.128  Sum_probs=14.8

Q ss_pred             CCccceeeEEEEEECCE
Q psy5364           1 MSLRRRRINLRTEGEGV   17 (78)
Q Consensus         1 ~~~~~G~~~i~V~~~g~   17 (78)
                      |.++||.|.|.++.+|.
T Consensus        51 ~~~epG~Y~V~l~~~g~   67 (134)
T PF08400_consen   51 FDVEPGVYRVTLKVEGR   67 (134)
T ss_pred             EEecCCeEEEEEEECCC
Confidence            67899999999999875


No 14 
>PF13954 PapC_N:  PapC N-terminal domain; PDB: 2VQI_B 3FIP_A 3RFZ_E 3OHN_A 1ZDV_A 1ZE3_D 3BWU_D 1ZDX_A.
Probab=68.60  E-value=6.9  Score=24.86  Aligned_cols=21  Identities=5%  Similarity=-0.043  Sum_probs=17.0

Q ss_pred             CccceeeEEEEEECCEecCCC
Q psy5364           2 SLRRRRINLRTEGEGVDVPGS   22 (78)
Q Consensus         2 ~~~~G~~~i~V~~~g~~V~gS   22 (78)
                      .+.||.|.|+|.+|++.+...
T Consensus        26 ~~~pG~Y~vdv~vN~~~~~~~   46 (146)
T PF13954_consen   26 AIPPGEYSVDVYVNGKFIGRY   46 (146)
T ss_dssp             SS-SEEEEEEEEETTEEEEEE
T ss_pred             CCCCeEEEEEEEECCeeeeeE
Confidence            357999999999999998733


No 15 
>PRK15275 putative fimbrial protein SteD; Provisional
Probab=67.62  E-value=27  Score=23.10  Aligned_cols=62  Identities=18%  Similarity=0.156  Sum_probs=37.4

Q ss_pred             eEEEEEECCEecCCCCeEEEec--CCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECC
Q psy5364           8 INLRTEGEGVDVPGSPFTVRVS--GAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGP   75 (78)
Q Consensus         8 ~~i~V~~~g~~V~gSPf~v~V~--~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP   75 (78)
                      -..+|.|.|.-+.-.|..|.=.  +-.|-..-++.|     .. -.++..|+|+|.......+++++.|-
T Consensus        25 ~~~~~~~~g~l~~pppC~In~~~VdFG~I~~~~idg-----~~-~~~~~~y~I~Cd~~~~~~vkltf~Gt   88 (166)
T PRK15275         25 KTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGD-----AS-QTKPVGYSLNCDGRASDYLKLQIQGT   88 (166)
T ss_pred             ccceEEEEEEEecCCCCEEcCCEEECCcceeeeEcC-----cc-cEeeecEEEECCCCCCCcEEEEEEcc
Confidence            3467788888887778776421  001212222233     11 12566899999877666888888874


No 16 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=60.26  E-value=8.9  Score=20.50  Aligned_cols=14  Identities=14%  Similarity=0.266  Sum_probs=10.1

Q ss_pred             CccceeeEEEEEEC
Q psy5364           2 SLRRRRINLRTEGE   15 (78)
Q Consensus         2 ~~~~G~~~i~V~~~   15 (78)
                      ++.||.|++.|+-.
T Consensus        35 ~L~~G~Y~l~V~a~   48 (66)
T PF07495_consen   35 NLPPGKYTLEVRAK   48 (66)
T ss_dssp             S--SEEEEEEEEEE
T ss_pred             eCCCEEEEEEEEEE
Confidence            47899999999874


No 17 
>cd08548 Type_I_cohesin_like Type I cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I cohesins; their interactions with dockerin mediate assembly of a range of dockerin-borne enzymes to the complex.
Probab=57.96  E-value=45  Score=20.99  Aligned_cols=34  Identities=15%  Similarity=0.228  Sum_probs=23.8

Q ss_pred             ccee-eEEEEEECCEe---cCCCCeEEEecCCCCCCeEEEeC
Q psy5364           4 RRRR-INLRTEGEGVD---VPGSPFTVRVSGAPDASKVRVYG   41 (78)
Q Consensus         4 ~~G~-~~i~V~~~g~~---V~gSPf~v~V~~~~d~~kv~v~G   41 (78)
                      ++|. .+|.|.+.+.+   |.+..|.+.-    |++..++..
T Consensus        11 ~~G~tv~VpV~~~~v~~~~i~~~~f~l~y----D~s~Lev~~   48 (135)
T cd08548          11 KPGDTVTVPVTLSNVPSKGIGACDFVLSY----DPSVLEVVS   48 (135)
T ss_pred             cCCCEEEEEEEEecCCccCEEEEEEEEEe----CCceeEEEe
Confidence            3444 78888888776   7788887765    666666654


No 18 
>PRK10301 hypothetical protein; Provisional
Probab=57.34  E-value=17  Score=22.78  Aligned_cols=25  Identities=20%  Similarity=0.322  Sum_probs=15.5

Q ss_pred             ccceeeEEEEEE---CCEecCCCCeEEEe
Q psy5364           3 LRRRRINLRTEG---EGVDVPGSPFTVRV   28 (78)
Q Consensus         3 ~~~G~~~i~V~~---~g~~V~gSPf~v~V   28 (78)
                      +.+|.|+|+-+.   +|.++.|+ |.+.|
T Consensus        96 L~~G~YtV~Wrvvs~DGH~~~G~-~~F~V  123 (124)
T PRK10301         96 LKPGTYTVDWHVVSVDGHKTKGH-YTFSV  123 (124)
T ss_pred             CCCccEEEEEEEEecCCCccCCe-EEEEE
Confidence            678888877554   56666655 44443


No 19 
>PF09906 DUF2135:  Uncharacterized protein conserved in bacteria (DUF2135);  InterPro: IPR019220  This entry, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=53.87  E-value=28  Score=18.51  Aligned_cols=18  Identities=6%  Similarity=-0.167  Sum_probs=14.7

Q ss_pred             cceeeEEEEEECCEecCC
Q psy5364           4 RRRRINLRTEGEGVDVPG   21 (78)
Q Consensus         4 ~~G~~~i~V~~~g~~V~g   21 (78)
                      .+|.|.|-|.|.|-+=..
T Consensus         2 ~~G~Y~VyVnYyg~~~~~   19 (50)
T PF09906_consen    2 PPGTYLVYVNYYGNRSQS   19 (50)
T ss_pred             CCceEEEEEEeccCCCcC
Confidence            479999999999866554


No 20 
>PF09430 DUF2012:  Protein of unknown function (DUF2012);  InterPro: IPR019008  This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development []. 
Probab=47.65  E-value=59  Score=20.08  Aligned_cols=38  Identities=24%  Similarity=0.294  Sum_probs=25.8

Q ss_pred             ccceeeEEEEEECCEecCCCCeEEEecC--CCCCCeEEEeCC
Q psy5364           3 LRRRRINLRTEGEGVDVPGSPFTVRVSG--APDASKVRVYGP   42 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gSPf~v~V~~--~~d~~kv~v~Gp   42 (78)
                      +.+|.|.++|.-.+-.-  +|++|.|.+  ...+.++.+..+
T Consensus        37 Vp~GsY~L~V~s~~~~F--~~~RVdV~~~~~~~~~~~~~~~~   76 (123)
T PF09430_consen   37 VPPGSYLLEVHSPDYVF--PPYRVDVSSSGKIRARKVNYWQP   76 (123)
T ss_pred             CCCceEEEEEECCCccc--cCEEEEEecCCCCCceEEEeecC
Confidence            57899999998776554  679998863  234445555444


No 21 
>PRK15286 putative minor fimbrial subunit StfE; Provisional
Probab=46.87  E-value=77  Score=21.25  Aligned_cols=62  Identities=16%  Similarity=0.143  Sum_probs=37.7

Q ss_pred             eeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcc----cceeceeceEEEEEccCCCccceEEEEECCC
Q psy5364           7 RINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIE----HGVLATFQSRFICDTRGAGAGQLTVRVRGPK   76 (78)
Q Consensus         7 ~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~----~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~   76 (78)
                      .-..++.|.+.-+.--|+.|+=.       ..-+|. +-    .|.--.++..++++|.+.....|+++++|..
T Consensus        28 ~~~~~~~~~g~l~~pPpC~In~g-------~V~FGd-V~~~~IDG~nY~q~v~Y~l~C~~~~~~~lkL~i~G~~   93 (170)
T PRK15286         28 SKTVNMTLTIVVNAAPPCTVTGG-------EVEFGN-VLTTKVDGVNYRQAVGYRLSCNGRVSDYLKLQIQGNA   93 (170)
T ss_pred             CccceEEEEEEEecCCCcEEcCC-------EEECCc-cccccccCeeeeEecCeEEEcCCCCCCcEEEEEEccc
Confidence            34577777777776566655411       122232 21    1222235678999998877778999998753


No 22 
>TIGR03517 GldM_gliding gliding motility-associated protein GldM. This protein family, GldM, is named for the member from Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes. However, members are found also in several members of the Bacteriodetes that appear not to be motile. The best conserved region, toward the N-terminus, is centered on a highly hydrobobic probable transmembrane helix. Two paralogs are found in Cytophaga hutchinsonii.
Probab=46.50  E-value=65  Score=25.26  Aligned_cols=24  Identities=25%  Similarity=0.465  Sum_probs=17.0

Q ss_pred             CCeEEEecCCCCCCeEEEeCCCccc
Q psy5364          22 SPFTVRVSGAPDASKVRVYGPGIEH   46 (78)
Q Consensus        22 SPf~v~V~~~~d~~kv~v~GpGL~~   46 (78)
                      =|..|.| |...+.+++++++||..
T Consensus       340 NPisIsv-pgv~~~~v~as~~gl~~  363 (523)
T TIGR03517       340 NPINISV-PGVGANYVTATGAGLKS  363 (523)
T ss_pred             CceEEEe-cCCCcceEEEEccccee
Confidence            3555555 34667889999999864


No 23 
>PF09087 Cyc-maltodext_N:  Cyclomaltodextrinase, N-terminal;  InterPro: IPR015171 This domain is found at the N terminus of cyclomaltodextrinase. The domain assumes a beta-sandwich structure composed of the eight antiparallel beta-strands. A ten residue linker is also present at the C-terminal end, which connects the N-terminal domain to a distal domain in the protein. This domain participates in oligomerisation of the protein, wherein the N-terminal domain of one subunit contacts the active centre of the other subunit, and is also required for binding of cyclodextrin to substrate []. ; PDB: 3EDK_B 3EDD_A 3EDJ_B 3EDE_A 1H3G_B 3EDF_B.
Probab=46.22  E-value=58  Score=19.47  Aligned_cols=41  Identities=20%  Similarity=0.299  Sum_probs=21.5

Q ss_pred             CeEEEeCCCcc--cceeceeceEEE--EEccCCCccceEEEEECC
Q psy5364          35 SKVRVYGPGIE--HGVLATFQSRFI--CDTRGAGAGQLTVRVRGP   75 (78)
Q Consensus        35 ~kv~v~GpGL~--~g~~~~~~~~F~--Id~~~aG~g~l~v~v~gP   75 (78)
                      .+|.+..+|+.  .-.....|+-..  ++.++|=+|.|.+.+..+
T Consensus        28 ~~v~i~~~gV~i~~v~~~~npNYLFv~L~i~~akpg~~~i~~~~~   72 (88)
T PF09087_consen   28 AEVSISYPGVTIKKVVKTDNPNYLFVYLDISDAKPGTFTINFKKG   72 (88)
T ss_dssp             SEEEE-BTTEEEEEEEE-SSTTEEEEEEEE-T--SEEEEEEEEET
T ss_pred             CEEEEeCCCeEEEEEEecCCCCEEEEEEecCCCCCcEEEEEEEcC
Confidence            45555666764  223344555443  345688888898888765


No 24 
>PF10029 DUF2271:  Predicted periplasmic protein (DUF2271);  InterPro: IPR014469 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=43.97  E-value=40  Score=21.60  Aligned_cols=26  Identities=27%  Similarity=0.466  Sum_probs=15.5

Q ss_pred             cceeeEEEEEE---CCEecCCCCeEEEec
Q psy5364           4 RRRRINLRTEG---EGVDVPGSPFTVRVS   29 (78)
Q Consensus         4 ~~G~~~i~V~~---~g~~V~gSPf~v~V~   29 (78)
                      .+|.|++....   .+..++..=|++.|+
T Consensus        75 ~~G~~~~~~d~~~~~~~~l~~g~Y~l~vE  103 (139)
T PF10029_consen   75 PPGKYTLSWDGTDDIGNPLPDGGYTLRVE  103 (139)
T ss_pred             CCCccEEEEEccccccCccCCCcEEEEEE
Confidence            46777666643   345566666777663


No 25 
>PRK15311 putative fimbrial protein TcfD; Provisional
Probab=43.85  E-value=1.4e+02  Score=22.48  Aligned_cols=67  Identities=22%  Similarity=0.286  Sum_probs=42.2

Q ss_pred             eeeEEEEEECCEecC---CCCeEEEecCCCCCCe-EEEeCCCcccceeceeceE-------EEEEccCCCc--cceEEEE
Q psy5364           6 RRINLRTEGEGVDVP---GSPFTVRVSGAPDASK-VRVYGPGIEHGVLATFQSR-------FICDTRGAGA--GQLTVRV   72 (78)
Q Consensus         6 G~~~i~V~~~g~~V~---gSPf~v~V~~~~d~~k-v~v~GpGL~~g~~~~~~~~-------F~Id~~~aG~--g~l~v~v   72 (78)
                      =+|.|+..|+|..|.   |-=|.++=   .|-.. --|.=||+...+.+ -|+.       |.|-.+.||.  |.|+|++
T Consensus       277 iDY~Vsl~y~G~~i~v~Nge~~~l~~---v~~~~~R~V~LPgIs~pV~C-vPtPLTL~t~~F~v~~k~aG~Y~G~LtVt~  352 (359)
T PRK15311        277 IDYAVSMNYGGRSIPVTRGVEFSLDN---VDKAATRPVVLPGQRQAVRC-VPVPLTLTTQPFNIREKRSGEYQGTLTVTM  352 (359)
T ss_pred             eeEEEEEecCCceeccCCCceEEEec---cchhceeeeecCCCccceEE-eccccEeecCcccccCCCCceeeeEEEEEE
Confidence            358899988876664   44444421   23233 33566888876643 4553       5555588985  9999988


Q ss_pred             ECCC
Q psy5364          73 RGPK   76 (78)
Q Consensus        73 ~gP~   76 (78)
                      +=++
T Consensus       353 ~~~t  356 (359)
T PRK15311        353 LMGT  356 (359)
T ss_pred             ecCC
Confidence            7543


No 26 
>PF12080 GldM_C:  GldM C-terminal domain;  InterPro: IPR022719  This domain is found in bacteria at the C terminus of the GldM protein. This domain is typically between 169 to 182 amino acids in length and has two completely conserved residues (Y and N) that may be functionally important. GldM, is named for the member from Bacteriodetes Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes []. 
Probab=41.47  E-value=1.1e+02  Score=20.44  Aligned_cols=34  Identities=21%  Similarity=0.393  Sum_probs=23.8

Q ss_pred             EEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCC--ccc
Q psy5364           9 NLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPG--IEH   46 (78)
Q Consensus         9 ~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpG--L~~   46 (78)
                      +++|+|.|.   .=|..+.+. ...+.++.++..|  |..
T Consensus         6 ~MNVLY~G~---~NpisIsvp-gv~~~~v~~s~~ggsl~~   41 (181)
T PF12080_consen    6 KMNVLYRGV---DNPISISVP-GVPSNKVPASATGGSLSK   41 (181)
T ss_pred             ccceEecCC---CCcEEEEeC-CCCccccEEEeeCCEEEe
Confidence            467888875   457777664 4567888888877  654


No 27 
>PF01483 P_proprotein:  Proprotein convertase P-domain;  InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=41.17  E-value=46  Score=18.91  Aligned_cols=22  Identities=18%  Similarity=0.234  Sum_probs=13.3

Q ss_pred             EEEEEccCCCccceEEEEECCCCC
Q psy5364          55 RFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus        55 ~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      .+.|....  .|+|.+.+..|+|.
T Consensus         7 ~v~i~H~~--~gdL~i~L~SP~Gt   28 (87)
T PF01483_consen    7 SVDITHPY--RGDLRITLISPSGT   28 (87)
T ss_dssp             EEEEEESS--GGGEEEEEE-TT--
T ss_pred             EEEEEcCC--cCCEEEEEECCCCC
Confidence            34444443  67899999999983


No 28 
>PF07523 Big_3:  Bacterial Ig-like domain (group 3);  InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=39.89  E-value=30  Score=18.86  Aligned_cols=14  Identities=7%  Similarity=0.050  Sum_probs=11.8

Q ss_pred             ccceeeEEEEEECC
Q psy5364           3 LRRRRINLRTEGEG   16 (78)
Q Consensus         3 ~~~G~~~i~V~~~g   16 (78)
                      .++|.|.|.++|.+
T Consensus        45 ~~~G~y~Vt~~y~~   58 (67)
T PF07523_consen   45 SKAGTYTVTYTYKG   58 (67)
T ss_dssp             TS-CCEEEEEEECT
T ss_pred             CCCceEEEEEEECC
Confidence            47899999999998


No 29 
>PF05359 DUF748:  Domain of Unknown Function (DUF748);  InterPro: IPR008023  This is a family of proteins of unknown function.
Probab=39.84  E-value=90  Score=19.19  Aligned_cols=52  Identities=17%  Similarity=0.270  Sum_probs=24.9

Q ss_pred             EEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEE
Q psy5364          11 RTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVR   71 (78)
Q Consensus        11 ~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~   71 (78)
                      +|.|.+..++. ||++.+.+    -...+..  |...  .+.++.+.+.+.-.+.|.+.+.
T Consensus        14 ~i~f~D~~~~~-~~~~~l~~----i~~~l~~--l~~~--~~~~~~~~l~~~~~~~g~l~~~   65 (151)
T PF05359_consen   14 RINFTDQSVPP-PFKLELSD----INLTLDN--LSTD--SNEPAPVSLSASLNGGGTLSAS   65 (151)
T ss_pred             EEEEEeCCCCC-CceEEEEE----EEEEEEe--ccCC--CCCcEEEEEEEEeCCCCEEEEE
Confidence            45666655554 88776632    2223332  3211  1234555555554444444443


No 30 
>PF12863 DUF3821:  Domain of unknown function (DUF3821);  InterPro: IPR024277 This is a domain largely confined to sequences from Methanomicrobiales. It is found in putative lipases but the function is unknown.
Probab=36.52  E-value=69  Score=22.12  Aligned_cols=30  Identities=27%  Similarity=0.381  Sum_probs=20.4

Q ss_pred             eceeceEEEEEcc--------CCCc--cceEEEEECCCCC
Q psy5364          49 LATFQSRFICDTR--------GAGA--GQLTVRVRGPKGF   78 (78)
Q Consensus        49 ~~~~~~~F~Id~~--------~aG~--g~l~v~v~gP~g~   78 (78)
                      +.+....|+|++.        +.+.  +.+.+.|++|+|.
T Consensus       103 ~~G~~v~FrI~tNL~~~~~R~g~~~~~~~v~I~V~~P~G~  142 (209)
T PF12863_consen  103 PRGDNVNFRIDTNLYSIFQRGGYTPGDGPVDIKVTTPSGA  142 (209)
T ss_pred             ccCCeEEEEEcccHHHHhhcCCCCCCcceEEEEEeCCCCc
Confidence            3445678999863        1122  3399999999984


No 31 
>PF10632 He_PIG_assoc:  He_PIG associated, NEW1 domain of bacterial glycohydrolase;  InterPro: IPR019599 This domain has been named NEW1 but its actual function is not known. It is found on proteins which are bacterial galactosidases []. The domain is associated with IPR008009 from INTERPRO, a putative Ig-containing domain. 
Probab=36.05  E-value=31  Score=16.66  Aligned_cols=9  Identities=56%  Similarity=1.209  Sum_probs=7.7

Q ss_pred             CCCCeEEEe
Q psy5364          20 PGSPFTVRV   28 (78)
Q Consensus        20 ~gSPf~v~V   28 (78)
                      +||||...|
T Consensus        13 Pg~pfl~~I   21 (29)
T PF10632_consen   13 PGSPFLFTI   21 (29)
T ss_pred             CCCcEEEEe
Confidence            689998877


No 32 
>PF08141 SspH:  Small acid-soluble spore protein H family;  InterPro: IPR012610 This family consists of the small acid-soluble spore proteins (SASP) of the H type (sspH). SspH are unique to spores of Bacillus subtilis and are expressed only in the forespore compartment during sporulation of this organism. The sspH genes are monocistronic and are recognised by the forespore-specific sigma factor for RNA polymerase - sigma-G. The specific role of this protein is unclear but is thought to play a role in sporulation under conditions different from that of the common laboratory tests of spore properties [].; GO: 0030436 asexual sporulation, 0042601 endospore-forming forespore
Probab=35.80  E-value=36  Score=18.90  Aligned_cols=12  Identities=25%  Similarity=0.188  Sum_probs=9.5

Q ss_pred             eEEEEEECCEec
Q psy5364           8 INLRTEGEGVDV   19 (78)
Q Consensus         8 ~~i~V~~~g~~V   19 (78)
                      -.++|+|+|.+|
T Consensus        14 ~~i~V~y~G~pV   25 (58)
T PF08141_consen   14 DMIEVTYNGVPV   25 (58)
T ss_pred             ceEEEEECCEEE
Confidence            357899998776


No 33 
>PF08308 PEGA:  PEGA domain;  InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=35.62  E-value=36  Score=18.51  Aligned_cols=15  Identities=20%  Similarity=0.202  Sum_probs=12.1

Q ss_pred             ccceeeEEEEEECCE
Q psy5364           3 LRRRRINLRTEGEGV   17 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~   17 (78)
                      +.+|.|.|.++..|-
T Consensus        32 l~~G~~~v~v~~~Gy   46 (71)
T PF08308_consen   32 LPPGEHTVTVEKPGY   46 (71)
T ss_pred             cCCccEEEEEEECCC
Confidence            568889988888873


No 34 
>PRK15315 outer membrane protein RatA; Provisional
Probab=34.96  E-value=2.2e+02  Score=25.88  Aligned_cols=63  Identities=27%  Similarity=0.402  Sum_probs=37.1

Q ss_pred             eEEEEEE-CCEecCCCCeEEEecCCC-------CCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCC
Q psy5364           8 INLRTEG-EGVDVPGSPFTVRVSGAP-------DASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKG   77 (78)
Q Consensus         8 ~~i~V~~-~g~~V~gSPf~v~V~~~~-------d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g   77 (78)
                      .+|.++. +|+++++-||.|+..+..       ++.-|++-|--|     ....++|.= ..+| .|.++|.|+.|+|
T Consensus       229 LTVttk~~~g~~v~nv~f~i~~~da~NRQ~~vnn~~pV~vg~tel-----t~~~~~Y~G-~TDa-nG~aTv~vTqpnG  299 (1865)
T PRK15315        229 LTVTTKACDGEPAINAPFVIRREDAINRQGVVNNANPVRVGNTEL-----TTAQTEYHG-VTDG-QGNATVVVTQENG  299 (1865)
T ss_pred             EEEEeeccCCcccCceeEEEEeccchhccCCcCCCCceEECCeec-----ccccceEee-eeCC-CCeEEEEEeCCCC
Confidence            4555566 479999999999865422       223455532211     111234332 2333 6789999998887


No 35 
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=34.49  E-value=36  Score=18.09  Aligned_cols=16  Identities=38%  Similarity=0.715  Sum_probs=13.4

Q ss_pred             CCEecCCCCeEEEecC
Q psy5364          15 EGVDVPGSPFTVRVSG   30 (78)
Q Consensus        15 ~g~~V~gSPf~v~V~~   30 (78)
                      +|.++++-||.++-..
T Consensus        25 ~G~pv~n~~f~l~r~~   40 (47)
T PF05688_consen   25 NGNPVPNAPFTLTRGD   40 (47)
T ss_pred             CCCCcCCceEEEEecC
Confidence            6899999999987743


No 36 
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=34.29  E-value=50  Score=24.84  Aligned_cols=28  Identities=7%  Similarity=0.204  Sum_probs=24.9

Q ss_pred             CccceeeEEEEEECCEecCCCCeEEEec
Q psy5364           2 SLRRRRINLRTEGEGVDVPGSPFTVRVS   29 (78)
Q Consensus         2 ~~~~G~~~i~V~~~g~~V~gSPf~v~V~   29 (78)
                      ..+.|.|++.....+..+.|.||.+.+.
T Consensus       274 ~~~~G~Y~i~~~~~~Tt~~GRe~~~~lp  301 (374)
T TIGR03503       274 NYEFGKYRVKGTVFGTTITGRELQLTLP  301 (374)
T ss_pred             cCCCeEEEEEEEEEEecCCCCEEEEEcC
Confidence            3588999999999999999999998774


No 37 
>PF02019 WIF:  WIF domain;  InterPro: IPR003306 Wnt proteins constitute a large family of secreted molecules that are involved in intercellular signalling during development. The name derives from the first 2 members of the family to be discovered: int-1 (mouse) and wingless (Drosophila) []. It is now recognised that Wnt signalling controls many cell fate decisions in a variety of different organisms, including mammals []. Wnt signalling has been implicated in tumourigenesis, early mesodermal patterning of the embryo, morphogenesis of the brain and kidneys, regulation of mammary gland proliferation and Alzheimer's disease [, ]. Wnt-mediated signalling is believed to proceed initially through binding to cell surface receptors of the frizzled family; the signal is subsequently transduced through several cytoplasmic components to B-catenin, which enters the nucleus and activates the transcription of several genes important in development []. Several non-canonical Wnt signalling pathways have also been elucidated that act independently of B-catenin. Canonical and noncanonical Wnt signaling branches are highly interconnected, and cross-regulate each other []. Members of the Wnt gene family are defined by their sequence similarity to mouse Wnt-1 and Wingless in Drosophila. They encode proteins of ~350-400 residues in length, with orthologues identified in several, mostly vertebrate, species. Very little is known about the structure of Wnts as they are notoriously insoluble, but they share the following features characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites and 22 conserved cysteines [] that are probably involved in disulphide bonds. The Wnt proteins seem to adhere to the plasma membrane of the secreting cells and are therefore likely to signal over only few cell diameters. Fifteen major Wnt gene families have been identified in vertebrates, with multiple subtypes within some classes. This entry represents the WIF domain, and is found in the RYK tyrosine kinase receptors and WIF the Wnt-inhibitory-factor. The domain is extracellular and contains two conserved cysteines that may form a disulphide bridge. This domain is Wnt binding in WIF, and it has been suggested that RYK may also bind to Wnt [].; GO: 0004713 protein tyrosine kinase activity; PDB: 2YGP_A 2YGO_A 2YGN_A 2D3J_A 2YGQ_A.
Probab=32.35  E-value=63  Score=20.71  Aligned_cols=65  Identities=15%  Similarity=0.207  Sum_probs=27.4

Q ss_pred             ccceeeEEEEEECCEecCCCCeEEEecCCCCC-----CeEEEeCCCcccceeceeceEEEEEc--cCCCccceEEE
Q psy5364           3 LRRRRINLRTEGEGVDVPGSPFTVRVSGAPDA-----SKVRVYGPGIEHGVLATFQSRFICDT--RGAGAGQLTVR   71 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gSPf~v~V~~~~d~-----~kv~v~GpGL~~g~~~~~~~~F~Id~--~~aG~g~l~v~   71 (78)
                      ..+..+.++.+|...-=..-++.+.+....|+     -.++++-    .|.+-..+..|.|+-  .+.-.|+..++
T Consensus        37 VPs~i~~l~FtW~s~~~~~~~Y~~~~l~s~d~~~l~~P~lnIs~----~G~VP~~~~~F~V~L~CtG~~~g~a~v~  108 (132)
T PF02019_consen   37 VPSEINYLNFTWQSLGKKPYPYNFDVLKSSDPSILPHPSLNISK----SGEVPHSPQVFSVELPCTGKRSGEATVT  108 (132)
T ss_dssp             B-TT-SEEEEEEEE-SSS-EEEEEEEEEES-TTTBB--EESS-S----EEEE-SS-EEEEEE--B-SSS-EEEEEE
T ss_pred             CCCCcceEEEEEEcCCCceEEEEEEEEEccCcccccCCccccCC----CeeccCCCEEEEEEEEecCccceEEEEE
Confidence            44556777777865333355666666222222     2222221    233334577898874  34323444443


No 38 
>PF09898 DUF2125:  Uncharacterized protein conserved in bacteria (DUF2125);  InterPro: IPR018666  This family of conserved hypothetical proteins has no known function. 
Probab=32.30  E-value=1.4e+02  Score=21.03  Aligned_cols=19  Identities=32%  Similarity=0.434  Sum_probs=16.8

Q ss_pred             EEEECCEecCCCCeEEEec
Q psy5364          11 RTEGEGVDVPGSPFTVRVS   29 (78)
Q Consensus        11 ~V~~~g~~V~gSPf~v~V~   29 (78)
                      .+.|.+..+.|-||++.+.
T Consensus        25 ~~~~~~~~~~GfPfR~~~~   43 (312)
T PF09898_consen   25 QVECADRSVGGFPFRFDVR   43 (312)
T ss_pred             EEEECCceeeccCeEEEEE
Confidence            5788999999999999875


No 39 
>PRK03174 sspH acid-soluble spore protein H; Provisional
Probab=32.00  E-value=39  Score=18.87  Aligned_cols=11  Identities=18%  Similarity=0.187  Sum_probs=8.9

Q ss_pred             EEEEEECCEec
Q psy5364           9 NLRTEGEGVDV   19 (78)
Q Consensus         9 ~i~V~~~g~~V   19 (78)
                      .++|+|+|.+|
T Consensus        15 ~i~VtY~G~pV   25 (59)
T PRK03174         15 MANVTYNGVPI   25 (59)
T ss_pred             ceEEEECCEEE
Confidence            57899998776


No 40 
>COG3459 Cellobiose phosphorylase [Carbohydrate transport and metabolism]
Probab=31.60  E-value=2.5e+02  Score=24.19  Aligned_cols=60  Identities=20%  Similarity=0.303  Sum_probs=40.3

Q ss_pred             ccceeeEEEEEECCEecCCCCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCC
Q psy5364           3 LRRRRINLRTEGEGVDVPGSPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPK   76 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gSPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~   76 (78)
                      .+=..|+++++|+     |+|++++|........+.+.|..++.-.   .|      -..+|...-.|.|.+|.
T Consensus       997 ~ew~G~~~~~~~~-----~~~~~i~V~~~~~~k~I~ldG~~~~~~~---~p------~~~~~~~~~evvv~~~~ 1056 (1056)
T COG3459         997 EEWDGYEATYELR-----NAPYDITVERPQGVKEILLDGQKIEGKI---LP------VGEAGDHKQEVVVKGPA 1056 (1056)
T ss_pred             cccCCeEEEEEEC-----CceEEEEEEcCCCcceEEEecccccccc---cc------ccccCCcceEEEEecCC
Confidence            3445577887776     6999999987555577888887665421   11      23467777778887763


No 41 
>PF07162 B9-C2:  Ciliary basal body-associated, B9 protein;  InterPro: IPR010796 Proteins in this entry include the MSK1 protein (Q9NXB0 from SWISSPROT) and other known or predicted flagellar basal body proteome components [] or cilia-containing species. Although the function is unknown, a cilia-specific role has been suggested for the poorly characterised B9 domain [, , ]. Mutations in MSK1 have been shown to cause Meckel syndrome type 1, a severe foetal development disorder that has been reported in most populations.
Probab=31.30  E-value=55  Score=21.28  Aligned_cols=22  Identities=18%  Similarity=0.131  Sum_probs=16.5

Q ss_pred             CCccceeeEEEEEECCEecCCCC
Q psy5364           1 MSLRRRRINLRTEGEGVDVPGSP   23 (78)
Q Consensus         1 ~~~~~G~~~i~V~~~g~~V~gSP   23 (78)
                      +|+.||.|+++|... .+..+|-
T Consensus       101 lP~~pG~h~~~v~~w-rP~~~s~  122 (168)
T PF07162_consen  101 LPTQPGRHEVEVPTW-RPVSGSI  122 (168)
T ss_pred             eCCCCceEEEEEEEE-eecCCCH
Confidence            488999999999777 4545543


No 42 
>TIGR02861 SASP_H small acid-soluble spore protein, H-type. This model is derived from pfam08141 but has been expanded to include in the seed corresponding proteins from three species of Clostridium. Members of this family should occur only in endospore-forming bacteria, typically with two members per genome, but may be absent from the genomes of some endospore-forming bacteria. SspH (previously designated YfjU) was shown to be expressed specifically in spores of Bacillus subtilis.
Probab=31.11  E-value=41  Score=18.67  Aligned_cols=11  Identities=27%  Similarity=0.326  Sum_probs=8.9

Q ss_pred             EEEEEECCEec
Q psy5364           9 NLRTEGEGVDV   19 (78)
Q Consensus         9 ~i~V~~~g~~V   19 (78)
                      .+.|+|+|++|
T Consensus        15 ~i~V~Y~G~pV   25 (58)
T TIGR02861        15 MINVTYKGVPV   25 (58)
T ss_pred             ceEEEECCEEE
Confidence            47899998776


No 43 
>PF04571 Lipin_N:  lipin, N-terminal conserved region;  InterPro: IPR007651 Mutations in the lipin gene lead to fatty liver dystrophy in mice. The protein has been shown to be phosphorylated by the TOR Ser/Thr protein kinases in response to insulin stimulation. This entry represents a conserved domain found at the N terminus of the member proteins [, ].
Probab=30.55  E-value=78  Score=19.86  Aligned_cols=20  Identities=30%  Similarity=0.449  Sum_probs=13.3

Q ss_pred             EEEEECCEecCCCCeEEEec
Q psy5364          10 LRTEGEGVDVPGSPFTVRVS   29 (78)
Q Consensus        10 i~V~~~g~~V~gSPf~v~V~   29 (78)
                      |-|+..+-...-|||-++..
T Consensus        31 IVV~q~DGs~~sSPFhVRFG   50 (110)
T PF04571_consen   31 IVVEQPDGSLKSSPFHVRFG   50 (110)
T ss_pred             EEEecCCCCEecCccEEEEc
Confidence            34556666777888877654


No 44 
>PF13584 BatD:  Oxygen tolerance
Probab=30.28  E-value=78  Score=23.67  Aligned_cols=31  Identities=23%  Similarity=0.335  Sum_probs=25.2

Q ss_pred             CCccceeeEE---EEEECCEecCCCCeEEEecCC
Q psy5364           1 MSLRRRRINL---RTEGEGVDVPGSPFTVRVSGA   31 (78)
Q Consensus         1 ~~~~~G~~~i---~V~~~g~~V~gSPf~v~V~~~   31 (78)
                      +|.++|.|+|   .|.++|+...=.|++++|.+.
T Consensus        80 ~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~  113 (484)
T PF13584_consen   80 QPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKA  113 (484)
T ss_pred             EecccceEEEceEEEEECCEEEeecCEEEEEEec
Confidence            4778999886   567799888889999999764


No 45 
>PRK01625 sspH acid-soluble spore protein H; Provisional
Probab=30.25  E-value=43  Score=18.68  Aligned_cols=11  Identities=36%  Similarity=0.456  Sum_probs=9.0

Q ss_pred             EEEEEECCEec
Q psy5364           9 NLRTEGEGVDV   19 (78)
Q Consensus         9 ~i~V~~~g~~V   19 (78)
                      .+.|+|+|.+|
T Consensus        15 ~i~V~Y~G~pV   25 (59)
T PRK01625         15 RIDVTYEGVPV   25 (59)
T ss_pred             ceEEEECCEEE
Confidence            57899998877


No 46 
>KOG0416|consensus
Probab=27.60  E-value=66  Score=21.93  Aligned_cols=24  Identities=21%  Similarity=0.382  Sum_probs=17.3

Q ss_pred             EEEEEccCCCccceEEEEECCCCC
Q psy5364          55 RFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus        55 ~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      .+.|+.-+-+-.+|.|+++||.++
T Consensus        19 ~yeV~~ind~m~ef~V~f~GP~ds   42 (189)
T KOG0416|consen   19 DYEVTIINDGMQEFYVKFHGPKDS   42 (189)
T ss_pred             CCeEEEecCcccEEEEEeeCCCCC
Confidence            456666666778888888888763


No 47 
>PF10528 PA14_2:  GLEYA domain;  InterPro: IPR018871  This presumed domain is found in fungal adhesins and is related to the PA14 domain. ; PDB: 4A3X_A.
Probab=27.42  E-value=59  Score=20.05  Aligned_cols=18  Identities=28%  Similarity=0.713  Sum_probs=13.0

Q ss_pred             cCCCccceEEEEECCCCC
Q psy5364          61 RGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus        61 ~~aG~g~l~v~v~gP~g~   78 (78)
                      ...|.+.|..+|..|+|.
T Consensus        80 N~~g~~~~~~~i~~P~G~   97 (113)
T PF10528_consen   80 NGGGPGSFDFSITDPDGT   97 (113)
T ss_dssp             E-SS-EEEEEEEEETT-S
T ss_pred             cCCCceEEEEEEECCCCc
Confidence            446789999999999984


No 48 
>PRK15273 putative fimbrial outer membrane usher protein SteB; Provisional
Probab=27.15  E-value=68  Score=26.53  Aligned_cols=18  Identities=17%  Similarity=0.187  Sum_probs=16.5

Q ss_pred             ccceeeEEEEEECCEecC
Q psy5364           3 LRRRRINLRTEGEGVDVP   20 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~   20 (78)
                      +.||.|.|+|.+||+.+.
T Consensus        41 ~~pG~Y~vdv~vN~~~~~   58 (881)
T PRK15273         41 ILPGVYLLQIKINGQTLP   58 (881)
T ss_pred             cCCeEEEEEEEECCEEcc
Confidence            579999999999999996


No 49 
>PF05949 DUF881:  Bacterial protein of unknown function (DUF881);  InterPro: IPR010273 This family consists of a series of hypothetical bacterial proteins. One of the family members Q45543 from SWISSPROT from Bacillus subtilis is thought to be involved in cell division and sporulation [].; PDB: 3GMG_B.
Probab=27.15  E-value=83  Score=20.23  Aligned_cols=18  Identities=11%  Similarity=0.346  Sum_probs=14.3

Q ss_pred             EEEECCEecCCCCeEEEec
Q psy5364          11 RTEGEGVDVPGSPFTVRVS   29 (78)
Q Consensus        11 ~V~~~g~~V~gSPf~v~V~   29 (78)
                      .++.||+++. +||.+++.
T Consensus        80 ~i~Vng~~i~-~Py~I~AI   97 (149)
T PF05949_consen   80 TILVNGRPIS-PPYVIKAI   97 (149)
T ss_dssp             EEEETTEEE--SSEEEEEE
T ss_pred             EEEECCEEcc-CCEEEEEE
Confidence            4678999996 89999876


No 50 
>TIGR03773 anch_rpt_wall putative ABC transporter-associated repeat protein. Members of this protein family occur in genomes that contain a three-gene ABC transporter operon associated with the presence of domain TIGR03769. That domain occurs as a single-copy insert in the substrate-binding protein, and occurs in two or more copies in members of this protein family. Members of this family typically are encoded adjacent to the said transporter operon and may serve as a substrate receptor.
Probab=26.61  E-value=1.1e+02  Score=23.97  Aligned_cols=32  Identities=16%  Similarity=0.224  Sum_probs=19.9

Q ss_pred             ccceeeEEEEEEC-----CEecC-CCCeEEEecCCCCC
Q psy5364           3 LRRRRINLRTEGE-----GVDVP-GSPFTVRVSGAPDA   34 (78)
Q Consensus         3 ~~~G~~~i~V~~~-----g~~V~-gSPf~v~V~~~~d~   34 (78)
                      .+||.|+|.+++.     |..+. -.-+++.|....||
T Consensus       408 TkPG~Y~l~~~~sa~~~~G~~vS~~~tltfaVG~~~~~  445 (513)
T TIGR03773       408 SKPGTYKLQVTQSATTTNGKKASDTTTLTFAVGDNNDP  445 (513)
T ss_pred             CCCcEEEEEEEEEEEeCCCceecccEEEEEEeCCCCCc
Confidence            5899999999974     44443 23345566554444


No 51 
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.99  E-value=1.4e+02  Score=22.77  Aligned_cols=43  Identities=19%  Similarity=0.268  Sum_probs=24.9

Q ss_pred             eEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364          36 KVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus        36 kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      -+.++|.+-..=..+...+.|.+..-..|.+...+.|+||++.
T Consensus       345 ~~ta~g~~~~~v~~~di~~~~d~~~L~~g~~~v~v~v~gp~~l  387 (403)
T COG4856         345 DVTAKGYSTNNVKDGDIKAYVDVSNLAEGNHSVSVEVTGPSGL  387 (403)
T ss_pred             eeeeccCcccceeecceeEEEehhhcccccceeEEEEECCccc
Confidence            3556665542211122233344444457889999999999973


No 52 
>PF13600 DUF4140:  N-terminal domain of unknown function (DUF4140)
Probab=25.90  E-value=89  Score=18.37  Aligned_cols=13  Identities=38%  Similarity=0.644  Sum_probs=10.1

Q ss_pred             CCCCCeEEEeCCC
Q psy5364          31 APDASKVRVYGPG   43 (78)
Q Consensus        31 ~~d~~kv~v~GpG   43 (78)
                      ..|+.-+++.|+|
T Consensus        34 ~~d~~Sl~V~~~g   46 (104)
T PF13600_consen   34 SLDPDSLRVSGEG   46 (104)
T ss_pred             ccCCCcEEEEecC
Confidence            3678888888877


No 53 
>PRK15298 fimbrial outer membrane usher protein StiC; Provisional
Probab=25.36  E-value=77  Score=26.06  Aligned_cols=20  Identities=0%  Similarity=-0.102  Sum_probs=17.3

Q ss_pred             ccceeeEEEEEECCEecCCC
Q psy5364           3 LRRRRINLRTEGEGVDVPGS   22 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gS   22 (78)
                      +.||.|.|+|.+||+.+...
T Consensus        61 ~~pG~Y~vdi~vN~~~~~~~   80 (848)
T PRK15298         61 QLPGTYYVTININKTEIGAR   80 (848)
T ss_pred             cCCeEEEEEEEECCeEcccE
Confidence            46999999999999998653


No 54 
>PRK15255 fimbrial outer membrane usher protein StdB; Provisional
Probab=25.07  E-value=78  Score=25.93  Aligned_cols=19  Identities=11%  Similarity=-0.155  Sum_probs=16.8

Q ss_pred             ccceeeEEEEEECCEecCC
Q psy5364           3 LRRRRINLRTEGEGVDVPG   21 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~g   21 (78)
                      +.||.|.|+|.+||+.+..
T Consensus        48 ~~pG~Y~vdi~vN~~~~~~   66 (829)
T PRK15255         48 VAPGNYILDIWLNDQPVRE   66 (829)
T ss_pred             cCCeEEEEEEEECCEEccc
Confidence            4699999999999999954


No 55 
>PRK15314 outer membrane protein RatB; Provisional
Probab=24.99  E-value=4.2e+02  Score=24.97  Aligned_cols=60  Identities=28%  Similarity=0.413  Sum_probs=34.5

Q ss_pred             ECCEecCCCCeEEEecCC----------CCCC--eEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364          14 GEGVDVPGSPFTVRVSGA----------PDAS--KVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus        14 ~~g~~V~gSPf~v~V~~~----------~d~~--kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      .+|.+|++-+|+++-...          .||+  .+.+-  |+--+  .++++...=...+| .|.+.|.|+.|+|.
T Consensus       767 ~ng~pvp~~~ftvt~~~g~nRqg~~~G~~D~~~g~v~i~--G~~y~--t~q~s~~Y~G~Tda-~G~~tv~ITqP~Gv  838 (2435)
T PRK15314        767 LNGLSVPNASFTVTMSHGKNRDNATTGFTDPSDGTLVMG--GTPFG--SSQASMTYQGMTDA-EGNATLVIEQPQGV  838 (2435)
T ss_pred             cCCCccCCcceEEEeccccccCCcccCCCCCCcceEEEC--ccccc--cCccceeeeeeecC-CCcEEEEEeCCCCc
Confidence            358999999999976532          1221  11221  11111  13333333344555 78999999999884


No 56 
>PF11797 DUF3324:  Protein of unknown function C-terminal (DUF3324);  InterPro: IPR021759  This family consists of several hypothetical bacterial proteins of unknown function. 
Probab=24.57  E-value=65  Score=20.33  Aligned_cols=14  Identities=14%  Similarity=0.361  Sum_probs=10.9

Q ss_pred             CccceeeEEEEEEC
Q psy5364           2 SLRRRRINLRTEGE   15 (78)
Q Consensus         2 ~~~~G~~~i~V~~~   15 (78)
                      .++||.|++.+...
T Consensus       102 ~lk~G~Y~l~~~~~  115 (140)
T PF11797_consen  102 KLKPGKYTLKITAK  115 (140)
T ss_pred             CccCCEEEEEEEEE
Confidence            46899999987654


No 57 
>PRK15193 outer membrane usher protein; Provisional
Probab=24.52  E-value=81  Score=26.08  Aligned_cols=22  Identities=5%  Similarity=-0.127  Sum_probs=18.0

Q ss_pred             ccceeeEEEEEECCEecCCCCe
Q psy5364           3 LRRRRINLRTEGEGVDVPGSPF   24 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gSPf   24 (78)
                      +.||.|.|+|.+||+.+.-.-.
T Consensus        63 ~~pG~Y~vdi~vN~~~~~~~~i   84 (876)
T PRK15193         63 IAAGVYRVDIVLNNTFVATRDV   84 (876)
T ss_pred             cCCeEEEEEEEECCeEcccEEE
Confidence            4699999999999999865433


No 58 
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=24.38  E-value=95  Score=16.85  Aligned_cols=14  Identities=29%  Similarity=0.389  Sum_probs=7.9

Q ss_pred             ccceeeEEEEEECC
Q psy5364           3 LRRRRINLRTEGEG   16 (78)
Q Consensus         3 ~~~G~~~i~V~~~g   16 (78)
                      +.+|.|.|.+...|
T Consensus        45 l~~g~Y~l~v~~~g   58 (82)
T PF13620_consen   45 LPPGTYTLRVSAPG   58 (82)
T ss_dssp             E-SEEEEEEEEBTT
T ss_pred             cCCEeEEEEEEECC
Confidence            34566666666555


No 59 
>PF08695 Coa1:  Cytochrome oxidase complex assembly protein 1;  InterPro: IPR014807 Coa1 is an inner mitochondrial membrane protein that associates with Shy1 and is required for cytochrome oxidase complex IV assembly. It contains a conserved hydrophobic segment (amino acids 74-92) with the potential to form a membrane-spanning helix. The N terminus of Coa1 is rich in positively charged amino acids and could form an amphipathic alpha helix, characteristic of a mitochondrial presequence. A cleavage site for the mitochondrial processing peptidase is predicted adjacent to the presequence. Upon in vitro import into mitochondria, Coa1 is processed to a mature form, indicating that it possesses a cleavable presequence []. The eukaryotic cytochrome oxidase complex consists of 12-13 subunits, with three mitochondrial encoded subunits, Cox1-Cox3, forming the core enzyme. Translation of the Cox1 transcript requires the two promoters, Pet309 and Mss51, and the latter has an additional role in translational elongation. Coa1 is necessary for linking the activity of Mss51 to Cox1 insertion into the assembly complex [].
Probab=24.19  E-value=88  Score=18.78  Aligned_cols=15  Identities=47%  Similarity=0.928  Sum_probs=8.5

Q ss_pred             CCccceEEEEECCCC
Q psy5364          63 AGAGQLTVRVRGPKG   77 (78)
Q Consensus        63 aG~g~l~v~v~gP~g   77 (78)
                      .|.-+|++.|+||++
T Consensus        66 ~g~a~~~~pV~G~k~   80 (116)
T PF08695_consen   66 KGRADLSFPVKGPKG   80 (116)
T ss_pred             CcEEEEEEEEEcCCC
Confidence            445556666666654


No 60 
>PRK15198 outer membrane usher protein FimD; Provisional
Probab=24.03  E-value=83  Score=25.91  Aligned_cols=24  Identities=0%  Similarity=-0.216  Sum_probs=18.9

Q ss_pred             ccceeeEEEEEECCEecCCCCeEE
Q psy5364           3 LRRRRINLRTEGEGVDVPGSPFTV   26 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gSPf~v   26 (78)
                      +.||.|.|+|.+||+.+.-.-..+
T Consensus        54 ~~pG~Y~vdi~vN~~~~~~~~i~f   77 (860)
T PRK15198         54 QPPGVYRVDIWRNDEFIGTQDIRF   77 (860)
T ss_pred             cCCeEEEEEEEECCeEcccEEEEE
Confidence            469999999999999886543333


No 61 
>PRK15207 long polar fimbrial outer membrane usher protein LpfC; Provisional
Probab=23.45  E-value=88  Score=25.71  Aligned_cols=24  Identities=4%  Similarity=-0.056  Sum_probs=19.0

Q ss_pred             ccceeeEEEEEECCEecCCCCeEEEe
Q psy5364           3 LRRRRINLRTEGEGVDVPGSPFTVRV   28 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gSPf~v~V   28 (78)
                      ..||.|.|+|.+||+.+...  .|.+
T Consensus        62 ~~pG~Y~vdi~vN~~~~~~~--~i~f   85 (842)
T PRK15207         62 QLPGKYRVSVFVNEEKMESR--TLEF   85 (842)
T ss_pred             cCCeEEEEEEEECCeEcccE--EEEE
Confidence            46999999999999998543  4444


No 62 
>PF14900 DUF4493:  Domain of unknown function (DUF4493)
Probab=23.16  E-value=62  Score=21.96  Aligned_cols=24  Identities=8%  Similarity=0.097  Sum_probs=16.8

Q ss_pred             CccceeeEEEEEECCEecC--CCCeE
Q psy5364           2 SLRRRRINLRTEGEGVDVP--GSPFT   25 (78)
Q Consensus         2 ~~~~G~~~i~V~~~g~~V~--gSPf~   25 (78)
                      .+..|.|+|.+.+++....  ..||-
T Consensus        61 ~L~~G~Ytv~A~~g~~~~~~~d~pyy   86 (235)
T PF14900_consen   61 ELPVGSYTVKASYGDNVAAGFDKPYY   86 (235)
T ss_pred             eecCCcEEEEEEcCCCccccccCcee
Confidence            5788999999997654443  25763


No 63 
>PRK15248 fimbrial outer membrane usher protein StbC; Provisional
Probab=22.85  E-value=1.2e+02  Score=24.88  Aligned_cols=24  Identities=13%  Similarity=-0.071  Sum_probs=19.6

Q ss_pred             ccceeeEEEEEECCEecCCC-CeEE
Q psy5364           3 LRRRRINLRTEGEGVDVPGS-PFTV   26 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~gS-Pf~v   26 (78)
                      +.||.|.|+|.+||+.+... -.++
T Consensus        53 ~~pG~Y~vdi~vN~~~~~~~~~i~f   77 (853)
T PRK15248         53 TPSGDYLADVYVNGTLVTSSTNIRF   77 (853)
T ss_pred             cCCeEEEEEEEECCEEcCceeEEEE
Confidence            46999999999999998764 5554


No 64 
>KOG4680|consensus
Probab=21.98  E-value=79  Score=20.89  Aligned_cols=23  Identities=4%  Similarity=0.167  Sum_probs=19.1

Q ss_pred             cceeeEEEEEECCEecCCCCeEE
Q psy5364           4 RRRRINLRTEGEGVDVPGSPFTV   26 (78)
Q Consensus         4 ~~G~~~i~V~~~g~~V~gSPf~v   26 (78)
                      ..|.|-|+|+|.+.+|..-=|.+
T Consensus        69 s~Gk~VIeV~y~gi~ihsethDL   91 (153)
T KOG4680|consen   69 SEGKYVIEVSYGGIRIHSETHDL   91 (153)
T ss_pred             eCCeEEEEEEEeeEEEeeccccc
Confidence            57999999999999998766654


No 65 
>PRK15284 putative fimbrial outer membrane usher protein StfC; Provisional
Probab=21.82  E-value=99  Score=25.59  Aligned_cols=18  Identities=11%  Similarity=0.161  Sum_probs=16.3

Q ss_pred             ccceeeEEEEEECCEecC
Q psy5364           3 LRRRRINLRTEGEGVDVP   20 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~   20 (78)
                      ..||.|.|+|.+|++.+.
T Consensus        59 ~~pG~Y~v~v~vN~~~~~   76 (881)
T PRK15284         59 VEPGKYNLQVQLNKQPLA   76 (881)
T ss_pred             cCCeEEEEEEEECCEEcc
Confidence            469999999999999986


No 66 
>PF03422 CBM_6:  Carbohydrate binding module (family 6);  InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see [].  This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=21.77  E-value=67  Score=19.07  Aligned_cols=15  Identities=13%  Similarity=0.062  Sum_probs=12.0

Q ss_pred             CccceeeEEEEEECC
Q psy5364           2 SLRRRRINLRTEGEG   16 (78)
Q Consensus         2 ~~~~G~~~i~V~~~g   16 (78)
                      .+..|.|+|.+.+.+
T Consensus        96 ~l~~G~h~i~l~~~~  110 (125)
T PF03422_consen   96 KLPAGKHTIYLVFNG  110 (125)
T ss_dssp             EEESEEEEEEEEESS
T ss_pred             eeCCCeeEEEEEEEC
Confidence            356799999999876


No 67 
>COG1470 Predicted membrane protein [Function unknown]
Probab=21.75  E-value=4.1e+02  Score=21.02  Aligned_cols=13  Identities=8%  Similarity=0.074  Sum_probs=8.7

Q ss_pred             cceeeEEEEEECC
Q psy5364           4 RRRRINLRTEGEG   16 (78)
Q Consensus         4 ~~G~~~i~V~~~g   16 (78)
                      +||.|.+.|.--+
T Consensus       348 ~pG~Ynv~I~A~s  360 (513)
T COG1470         348 TPGTYNVTITASS  360 (513)
T ss_pred             CCCceeEEEEEec
Confidence            6777777776543


No 68 
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=21.73  E-value=3.1e+02  Score=19.64  Aligned_cols=46  Identities=15%  Similarity=0.359  Sum_probs=26.8

Q ss_pred             CCeEEEecCCCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCC
Q psy5364          22 SPFTVRVSGAPDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKG   77 (78)
Q Consensus        22 SPf~v~V~~~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g   77 (78)
                      |+-+|.|....+-+++.++|-     + .+ +..   ...+.|.-++-|.|+||..
T Consensus        35 S~~~V~Its~F~G~elliFGA-----v-~~-~~~---~~~~~~~~dVVV~v~GP~~   80 (261)
T TIGR02186        35 STEEIAITSNFRGADLTLFGA-----L-DN-DDP---TLLAQGAYDIVVTLEGPRD   80 (261)
T ss_pred             cCceEEEecCccCcEEEEEEe-----e-cC-Ccc---CCCCCCCccEEEEEeCCCC
Confidence            566667766677777777771     1 11 110   0122346678888888864


No 69 
>PRK09828 putative fimbrial outer membrane usher protein; Provisional
Probab=21.38  E-value=1e+02  Score=25.35  Aligned_cols=20  Identities=0%  Similarity=-0.087  Sum_probs=16.9

Q ss_pred             CccceeeEEEEEECCEecCC
Q psy5364           2 SLRRRRINLRTEGEGVDVPG   21 (78)
Q Consensus         2 ~~~~G~~~i~V~~~g~~V~g   21 (78)
                      +..||.|.++|..||+.+..
T Consensus        59 ~~~pG~y~v~v~vNg~~~~~   78 (865)
T PRK09828         59 PTLPGTYDVSVYVNDQPISS   78 (865)
T ss_pred             ccCCeEEEEEEEECCeEccc
Confidence            35799999999999998754


No 70 
>PRK15304 putative fimbrial outer membrane usher protein; Provisional
Probab=21.35  E-value=1.2e+02  Score=24.75  Aligned_cols=19  Identities=5%  Similarity=-0.062  Sum_probs=16.5

Q ss_pred             ccceeeEEEEEECCEecCC
Q psy5364           3 LRRRRINLRTEGEGVDVPG   21 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~g   21 (78)
                      +.||.|.++|..||+.+.-
T Consensus        57 ~~pG~Y~v~v~vNg~~~~~   75 (801)
T PRK15304         57 FLPGSHSVMVKINGKKRGS   75 (801)
T ss_pred             cCCeEeEEEEEECCeEccc
Confidence            5799999999999998753


No 71 
>PRK15294 putative fimbrial outer membrane usher protein SthC; Provisional
Probab=21.29  E-value=1e+02  Score=25.30  Aligned_cols=19  Identities=11%  Similarity=-0.043  Sum_probs=16.2

Q ss_pred             ccceeeEEEEEECCEecCC
Q psy5364           3 LRRRRINLRTEGEGVDVPG   21 (78)
Q Consensus         3 ~~~G~~~i~V~~~g~~V~g   21 (78)
                      +.||.|.|+|.+||+.+..
T Consensus        57 ~~pG~y~vdi~vNg~~~~~   75 (845)
T PRK15294         57 QAEGTYNVDIYINNTSVET   75 (845)
T ss_pred             CCCeEEEEEEEECCeEccc
Confidence            3599999999999998753


No 72 
>PF00801 PKD:  PKD domain;  InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions.  PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=21.28  E-value=1e+02  Score=16.50  Aligned_cols=14  Identities=0%  Similarity=-0.173  Sum_probs=10.6

Q ss_pred             ccceeeEEEEEECC
Q psy5364           3 LRRRRINLRTEGEG   16 (78)
Q Consensus         3 ~~~G~~~i~V~~~g   16 (78)
                      ..+|.|.|.|...+
T Consensus        49 ~~~G~y~V~ltv~n   62 (69)
T PF00801_consen   49 SSPGTYTVTLTVTN   62 (69)
T ss_dssp             SSSEEEEEEEEEEE
T ss_pred             CCCeEEEEEEEEEE
Confidence            46899999887643


No 73 
>PRK15316 RatA-like protein; Provisional
Probab=21.17  E-value=2.5e+02  Score=26.37  Aligned_cols=62  Identities=23%  Similarity=0.244  Sum_probs=37.3

Q ss_pred             ECCEecCCCCeEEEecC----------CCCCCeEEEeCCCcccceeceeceEEEEEccCCCccceEEEEECCCCC
Q psy5364          14 GEGVDVPGSPFTVRVSG----------APDASKVRVYGPGIEHGVLATFQSRFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus        14 ~~g~~V~gSPf~v~V~~----------~~d~~kv~v~GpGL~~g~~~~~~~~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      +||.+|++-=|+|+...          ..||+...+.=.|..-+  ..|+..-.=+..+| .|.++|.|+.|.|+
T Consensus       753 ln~~pV~~~aftVt~~~g~nR~g~~tgf~dps~g~i~~nG~~y~--~~q~s~~Y~G~TDa-~G~atv~itqP~Gv  824 (2683)
T PRK15316        753 LNNAPVPYTAFTITKDMGKNRQGQTTGFDDPTRGAIEMNGTLYG--TSQPSLVYAGTTDA-QGFATVEIKQSQGV  824 (2683)
T ss_pred             cCCCcccceeEEEEeccccccCCcccCCCCCCCceEEECCEecc--cCCCcceeeeeecC-CCeEEEEEeCCCCc
Confidence            46889999999887652          24555544433232222  12333333344454 78899999999985


No 74 
>PF14734 DUF4469:  Domain of unknown function (DUF4469) with IG-like fold
Probab=20.68  E-value=93  Score=18.98  Aligned_cols=12  Identities=17%  Similarity=0.160  Sum_probs=10.1

Q ss_pred             ccceeeEEEEEE
Q psy5364           3 LRRRRINLRTEG   14 (78)
Q Consensus         3 ~~~G~~~i~V~~   14 (78)
                      +.+|.|+|.|+-
T Consensus        74 L~~G~Y~l~V~T   85 (102)
T PF14734_consen   74 LAAGEYTLEVRT   85 (102)
T ss_pred             cCceEEEEEEEE
Confidence            689999999964


No 75 
>COG4136 ABC-type uncharacterized transport system, ATPase component [General function prediction only]
Probab=20.39  E-value=68  Score=21.98  Aligned_cols=19  Identities=32%  Similarity=0.623  Sum_probs=13.6

Q ss_pred             eEEEEEccCCCccceEEEEECCCCC
Q psy5364          54 SRFICDTRGAGAGQLTVRVRGPKGF   78 (78)
Q Consensus        54 ~~F~Id~~~aG~g~l~v~v~gP~g~   78 (78)
                      .+|+|+-     |+ -|+++|||||
T Consensus        21 ~n~Tia~-----Ge-ivtlMGPSGc   39 (213)
T COG4136          21 VNFTIAK-----GE-IVTLMGPSGC   39 (213)
T ss_pred             eeEEecC-----Cc-EEEEECCCCc
Confidence            4677753     34 3889999997


No 76 
>TIGR03292 PhnH_redo phosphonate C-P lyase system protein PhnH. PhnH is a component of the C-P lyase system (GenProp0232) for the catabolism of phosphonate compounds. The specific function of this component is unknown. This model is based on Pfam model pfam05845.2, and has been broadened to include sequences missed by that model which are clearly true positive hits based on genome context.
Probab=20.33  E-value=1.1e+02  Score=20.47  Aligned_cols=15  Identities=27%  Similarity=0.603  Sum_probs=11.7

Q ss_pred             CCCCeEEEeCCCccc
Q psy5364          32 PDASKVRVYGPGIEH   46 (78)
Q Consensus        32 ~d~~kv~v~GpGL~~   46 (78)
                      .+...++.+||||+.
T Consensus       114 ~~g~~l~L~GPGI~~  128 (174)
T TIGR03292       114 ASGPPLTLTGPGIAT  128 (174)
T ss_pred             CCCCceEEECCCcCC
Confidence            355679999999974


No 77 
>PF02018 CBM_4_9:  Carbohydrate binding domain;  InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=20.17  E-value=1.9e+02  Score=16.64  Aligned_cols=26  Identities=8%  Similarity=0.029  Sum_probs=13.2

Q ss_pred             cccee-eEEEEEECCEecCCCCeEEEecC
Q psy5364           3 LRRRR-INLRTEGEGVDVPGSPFTVRVSG   30 (78)
Q Consensus         3 ~~~G~-~~i~V~~~g~~V~gSPf~v~V~~   30 (78)
                      +++|. |++++.......  .++.+.+..
T Consensus        58 l~~G~~Y~~s~~vk~~~~--~~~~~~~~~   84 (131)
T PF02018_consen   58 LKPGKTYTVSFWVKADSG--GTVSVSLRD   84 (131)
T ss_dssp             E-TTSEEEEEEEEEESSS--EEEEEEEEE
T ss_pred             ecCCCEEEEEEEEEeCCC--CEEEEEEEE
Confidence            34444 666666553222  556665554


Done!