Query         028956
Match_columns 201
No_of_seqs    148 out of 594
Neff          5.7 
Searched_HMMs 46136
Date          Fri Mar 29 05:11:23 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/028956.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/028956hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0439 VAMP-associated protei 100.0 1.2E-30 2.6E-35  219.7  14.7  184   10-200     8-218 (218)
  2 COG5066 SCS2 VAMP-associated p 100.0 2.1E-28 4.5E-33  206.3  12.0  118   10-135     2-121 (242)
  3 PF00635 Motile_Sperm:  MSP (Ma  99.9 2.2E-23 4.7E-28  156.7  12.7  104   10-121     1-108 (109)
  4 PF14874 PapD-like:  Flagellar-  98.5 1.3E-06 2.8E-11   64.9  10.3   66   10-79      3-71  (102)
  5 PF00345 PapD_N:  Pili and flag  96.7    0.05 1.1E-06   41.6  11.6   63   10-79      1-72  (122)
  6 PF14646 MYCBPAP:  MYCBP-associ  94.3     1.2 2.6E-05   41.6  13.2  112   20-152   237-361 (426)
  7 PF11614 FixG_C:  IG-like fold   92.6    0.72 1.6E-05   35.1   7.4   71    7-80      9-83  (118)
  8 PRK09918 putative fimbrial cha  92.2     6.8 0.00015   33.7  13.8  108   10-138    25-137 (230)
  9 PF07610 DUF1573:  Protein of u  92.0    0.71 1.5E-05   29.8   5.7   42   36-78      2-44  (45)
 10 PRK11385 putativi pili assembl  88.7      14 0.00031   32.1  12.6  107   10-137    27-147 (236)
 11 PRK15249 fimbrial chaperone pr  86.8       8 0.00017   33.8  10.1   63   10-79     29-102 (253)
 12 PRK09926 putative chaperone pr  86.5     8.8 0.00019   33.3  10.1   63   10-79     26-98  (246)
 13 PF06280 DUF1034:  Fn3-like dom  85.9     2.1 4.6E-05   32.2   5.3   51   29-79      7-78  (112)
 14 smart00809 Alpha_adaptinC2 Ada  84.0     5.7 0.00012   29.0   6.8   50   30-79     18-71  (104)
 15 PRK15211 fimbrial chaperone pr  83.0      18 0.00039   31.3  10.4   63   10-79     23-91  (229)
 16 PF03173 CHB_HEX:  Putative car  82.4     2.5 5.5E-05   34.9   4.7   32   48-79     69-102 (164)
 17 PF05506 DUF756:  Domain of unk  82.2     6.6 0.00014   28.5   6.4   45   32-79     20-65  (89)
 18 PRK15295 fimbrial assembly cha  81.9      20 0.00044   30.8  10.3   61   10-79     20-89  (226)
 19 PRK15299 fimbrial chaperone pr  81.7      32  0.0007   29.4  12.9  110   10-137    23-140 (227)
 20 PRK15246 fimbrial assembly cha  81.4      35 0.00075   29.5  13.7  111   10-138    11-134 (233)
 21 PF02883 Alpha_adaptinC2:  Adap  78.3     7.3 0.00016   29.1   5.7   51   29-79     23-77  (115)
 22 PF00927 Transglut_C:  Transglu  76.6      10 0.00022   28.1   6.0   52   28-79     13-74  (107)
 23 PF06030 DUF916:  Bacterial pro  76.4      20 0.00044   27.9   7.9   58   22-79     19-101 (121)
 24 PF10633 NPCBM_assoc:  NPCBM-as  76.2     4.6  0.0001   28.4   3.9   52   28-79      3-58  (78)
 25 PRK15208 long polar fimbrial c  74.3      44 0.00096   28.6  10.2   63   10-79     22-90  (228)
 26 PRK15290 lfpB fimbrial chapero  73.9      46 0.00099   29.1  10.2  111   10-137    38-156 (243)
 27 TIGR03079 CH4_NH3mon_ox_B meth  73.9     8.9 0.00019   35.8   6.0   52   28-79    280-352 (399)
 28 PRK15249 fimbrial chaperone pr  73.6     5.6 0.00012   34.8   4.5   74    2-77    145-219 (253)
 29 PRK15195 fimbrial chaperone pr  72.7      44 0.00096   28.8   9.8   61   10-79     26-94  (229)
 30 PRK15192 fimbrial chaperone Bc  71.6      68  0.0015   27.9  13.8   84    9-108    22-119 (234)
 31 PRK15254 fimbrial chaperone pr  70.7      65  0.0014   28.0  10.4   87   10-108    17-110 (239)
 32 PRK15192 fimbrial chaperone Bc  70.0     9.2  0.0002   33.2   5.0   67    2-77    135-202 (234)
 33 COG3121 FimC P pilus assembly   69.4     9.7 0.00021   32.9   5.0   69    2-77    138-208 (235)
 34 PRK15188 fimbrial chaperone pr  68.1      80  0.0017   27.3  12.9  111   10-138    28-145 (228)
 35 PRK09926 putative chaperone pr  67.8      11 0.00024   32.7   5.1   73    2-77    142-216 (246)
 36 PF11611 DUF4352:  Domain of un  67.4      25 0.00054   26.0   6.4   52   28-79     34-99  (123)
 37 PRK15246 fimbrial assembly cha  67.2      11 0.00023   32.8   4.8   66    3-77    127-192 (233)
 38 PRK15290 lfpB fimbrial chapero  65.8      11 0.00025   32.8   4.7   66    2-77    149-215 (243)
 39 COG3121 FimC P pilus assembly   65.5      89  0.0019   26.9  13.7  112   10-141    28-149 (235)
 40 PF12690 BsuPI:  Intracellular   65.1      32  0.0007   24.9   6.3   47   33-79      3-68  (82)
 41 TIGR02745 ccoG_rdxA_fixG cytoc  64.6      75  0.0016   30.2  10.2   72    6-79    323-397 (434)
 42 PRK15218 fimbrial chaperone pr  64.4      66  0.0014   27.7   9.1  101   21-137    26-139 (226)
 43 PRK15233 putative fimbrial cha  63.7      96  0.0021   27.2  10.1   69   32-109    56-131 (246)
 44 PRK15295 fimbrial assembly cha  62.2      15 0.00032   31.6   4.7   67    3-78    130-198 (226)
 45 PRK15224 pili assembly chapero  61.6      69  0.0015   27.9   8.8   76   21-108    36-118 (237)
 46 PRK15188 fimbrial chaperone pr  61.6      15 0.00032   31.8   4.6   65    2-75    137-201 (228)
 47 PRK11385 putativi pili assembl  60.7      17 0.00037   31.6   4.9   67    2-77    140-207 (236)
 48 PF02753 PapD_C:  Pili assembly  60.2      10 0.00022   25.8   2.8   44   36-79      1-45  (68)
 49 PRK15285 putative fimbrial cha  59.6 1.2E+02  0.0027   26.5  10.5   56   21-79     33-95  (250)
 50 PRK15208 long polar fimbrial c  59.4      18 0.00039   31.1   4.7   67    2-77    131-198 (228)
 51 PRK15299 fimbrial chaperone pr  58.3      19 0.00042   30.8   4.8   66    3-77    134-200 (227)
 52 PF04744 Monooxygenase_B:  Mono  57.0      47   0.001   31.1   7.2   58   21-80    255-334 (381)
 53 PRK15211 fimbrial chaperone pr  56.1      22 0.00048   30.7   4.8   69    2-76    131-204 (229)
 54 PRK15253 putative fimbrial ass  55.1 1.1E+02  0.0024   26.6   9.0   76   21-108    41-128 (242)
 55 PF00553 CBM_2:  Cellulose bind  54.9      30 0.00064   25.7   4.7   50   30-79     13-82  (101)
 56 PRK09918 putative fimbrial cha  54.7      29 0.00063   29.7   5.3   69    2-78    129-200 (230)
 57 PF05753 TRAP_beta:  Translocon  53.1      69  0.0015   26.7   7.1   51   28-79     36-95  (181)
 58 PRK15274 putative periplasmic   52.7 1.6E+02  0.0036   25.9  10.5   78   21-108    34-120 (257)
 59 smart00637 CBD_II CBD_II domai  50.9      62  0.0013   23.2   5.8   48   32-79      8-75  (92)
 60 PHA01327 hypothetical protein   48.3      18 0.00039   23.7   2.2   14  180-193    20-33  (49)
 61 PF07705 CARDB:  CARDB;  InterP  45.0      71  0.0015   22.3   5.3   52   28-79     17-69  (101)
 62 PRK15253 putative fimbrial ass  44.7      43 0.00092   29.2   4.8   65    4-76    149-214 (242)
 63 PF14796 AP3B1_C:  Clathrin-ada  43.9 1.3E+02  0.0028   24.4   7.1   59   21-79     72-138 (145)
 64 PRK15218 fimbrial chaperone pr  42.9      51  0.0011   28.4   5.0   66    4-77    134-199 (226)
 65 PRK15224 pili assembly chapero  42.6      40 0.00086   29.4   4.3   62    6-76    146-208 (237)
 66 PF07233 DUF1425:  Protein of u  41.4 1.3E+02  0.0027   22.2   6.3   51   29-79     23-80  (94)
 67 PRK15195 fimbrial chaperone pr  40.4      53  0.0011   28.3   4.6   64    3-76    137-201 (229)
 68 PF08277 PAN_3:  PAN-like domai  40.3      40 0.00086   22.8   3.2   19   32-50     53-71  (71)
 69 PF13473 Cupredoxin_1:  Cupredo  39.8 1.5E+02  0.0032   21.6   6.8   53   11-79     30-82  (104)
 70 TIGR02656 cyanin_plasto plasto  38.3 1.2E+02  0.0026   22.1   5.8   15   64-78     61-75  (99)
 71 PRK15254 fimbrial chaperone pr  37.6      72  0.0016   27.8   5.1   70    2-78    126-199 (239)
 72 PRK15308 putative fimbrial pro  36.6 2.9E+02  0.0062   24.1  10.7   85   10-110    17-119 (234)
 73 smart00605 CW CW domain.        34.8      71  0.0015   23.1   4.0   22   35-56     58-80  (94)
 74 PF03168 LEA_2:  Late embryogen  33.8 1.6E+02  0.0036   20.4   6.2   45   35-79      1-51  (101)
 75 PF11616 EZH2_WD-Binding:  WD r  32.8      21 0.00046   21.6   0.7    9  178-186    18-26  (30)
 76 PF06159 DUF974:  Protein of un  31.6 1.5E+02  0.0033   25.7   6.1   51   29-79     13-75  (249)
 77 PF06483 ChiC:  Chitinase C;  I  29.7      62  0.0014   27.3   3.2   26   44-80    116-141 (180)
 78 PF13205 Big_5:  Bacterial Ig-l  29.2 2.1E+02  0.0046   20.3   6.6   56   21-79     26-84  (107)
 79 PF11538 Snurportin1:  Snurport  28.0      44 0.00096   21.4   1.6   15  181-195    15-29  (40)
 80 PRK02710 plastocyanin; Provisi  27.8 2.2E+02  0.0048   21.6   5.9   15   65-79     82-96  (119)
 81 PF11668 Gp_UL130:  HCMV glycop  25.5 1.8E+02  0.0039   23.9   5.1   45   20-64    100-154 (156)
 82 PF09640 DUF2027:  Domain of un  25.3 2.6E+02  0.0055   23.3   6.0   68   32-110    18-85  (162)
 83 PF12086 DUF3563:  Protein of u  23.1      56  0.0012   22.8   1.5   13  181-193    18-30  (59)
 84 PF00127 Copper-bind:  Copper b  23.1   2E+02  0.0043   20.9   4.7   16   63-78     60-75  (99)
 85 PRK15285 putative fimbrial cha  22.1 2.3E+02   0.005   24.8   5.6   66    6-77    139-208 (250)
 86 PRK15233 putative fimbrial cha  22.1   1E+02  0.0022   27.0   3.4   62    6-74    160-221 (246)
 87 PF11906 DUF3426:  Protein of u  21.1 2.5E+02  0.0055   21.7   5.2   50   30-79     68-134 (149)
 88 PF09478 CBM49:  Carbohydrate b  20.9 3.1E+02  0.0067   19.3   5.4   21   33-53     20-41  (80)
 89 KOG1769 Ubiquitin-like protein  20.7 1.2E+02  0.0027   23.2   3.1   26   32-57     19-44  (99)
 90 COG3311 AlpA Predicted transcr  20.5      51  0.0011   23.6   0.9   22  165-187    42-65  (70)
 91 PRK15274 putative periplasmic   20.4 2.3E+02  0.0049   25.0   5.2   66    5-77    139-209 (257)

No 1  
>KOG0439 consensus VAMP-associated protein involved in inositol metabolism [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.97  E-value=1.2e-30  Score=219.71  Aligned_cols=184  Identities=34%  Similarity=0.464  Sum_probs=157.5

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEeeccCCCCCCC
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFKFVELPENNEK   89 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e~   89 (201)
                      .|.++|.+   +|.|.+++.+++++.|+|+|+++.++|||||||+|++||||||.|+|.||+++.|.|.+   +| ....
T Consensus         8 ~l~i~P~~---~l~F~~~~~~~~~~~l~l~N~t~~~vaFKvktT~p~~y~VrP~~G~i~p~~t~~i~v~~---q~-~~~~   80 (218)
T KOG0439|consen    8 LLEIEPSD---ELVFPLPLNEQVKCSLTLKNPTKLRVAFKVKTTAPKLYCVRPNGGVIDPGSTVEIEVTH---QP-FEKS   80 (218)
T ss_pred             ccccCCCc---eEEeccCCCceEEEEEEEecCCCCceEEEEEcCCCCeEEEcCCcceECCCCcEEEEEEe---cc-CccC
Confidence            57889974   99999999999999999999999999999999999999999999999999999999998   67 3445


Q ss_pred             CCCCCCCCeEEEEEEEeCCC-CCChhHhhhccC--CCcceeEEeEEEEeCCCCCCchhHHH---HhhhhhHHHHHHHhcC
Q 028956           90 PMYQKSRDKFKIISMKVKAD-VDYVPELFDEQK--DQTAAEQILRVVFLNPERPEPALEKL---KRQLADADAAVAARKK  163 (201)
Q Consensus        90 p~~~~~kDKFlVqs~~v~~~-~d~~~~lwk~~~--~~~i~e~kLrV~f~~p~~ps~~~e~l---~~~l~~a~~~~~~~~~  163 (201)
                      |.+++|+|||+||++.++.+ .....++|+..+  +..+.+.+++|.|+.|..+....++.   ..+..+.++...+...
T Consensus        81 P~d~~~r~kF~v~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  160 (218)
T KOG0439|consen   81 PPDFKSRHKFLIQSLKAPPPTTRDVVDLWKFQKETPKESFETKLRVVFVAPTETDSVVAKLQKAKKKEAEKEAFGEATKE  160 (218)
T ss_pred             chhhcccceEEEEEEecCCccccchhhhccccccccccccceeeEEEeeCCCCCcccccccccccccCCccccccccccc
Confidence            78889999999999999976 567899999988  78899999999999887777666555   5566666677777666


Q ss_pred             CCCCC---CCceeeeeeehhHH------------------HHHHHHHHHHhhhcccCC
Q 028956          164 PPEDT---GPRIIGEGLVIDEW------------------KERRERYLARQQVEGVDS  200 (201)
Q Consensus       164 ~~~~~---~~~~~~~~~~~~~~------------------~~~r~~~~~~~~~~~~~~  200 (201)
                      .....   .+...++.+++++|                  +++++++++.+|.+...|
T Consensus       161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  218 (218)
T KOG0439|consen  161 ASDGEVCVKSKEFGEKLELKEELKAFKLKANKVDEERLLKKKKEGRLLAELQAELVIS  218 (218)
T ss_pred             cCcccccchhhhhhccccchhhhhccccccccccccchhhhhhhHHHHHhhhhhhccC
Confidence            65543   46677899999999                  889999999999876543


No 2  
>COG5066 SCS2 VAMP-associated protein involved in inositol metabolism [Intracellular trafficking and secretion]
Probab=99.95  E-value=2.1e-28  Score=206.28  Aligned_cols=118  Identities=22%  Similarity=0.284  Sum_probs=109.6

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEeeccCCCCCCC
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFKFVELPENNEK   89 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e~   89 (201)
                      .+.++|.     +.|..++....++.+-|.|++..+|+||||||+|+.||||||.|+|.|++++.|.|+|   +++.+++
T Consensus         2 aveisp~-----~~fy~Plt~~ske~~sv~NnspepvgfKVKTTaPK~YcVRPN~g~Iep~stv~VeVil---q~l~eEp   73 (242)
T COG5066           2 AVEISPQ-----TTFYVPLTNKSKEMFSVQNNSPEPVGFKVKTTAPKDYCVRPNMGLIEPMSTVEVEVIL---QGLTEEP   73 (242)
T ss_pred             ceEecCc-----eEEecccccccceeeEeecCCCCceeEEeeccCCcceeEcCCCceeccCCeeEEEEEe---eccccCC
Confidence            5677777     8888899999999999999999999999999999999999999999999999999999   8999999


Q ss_pred             CCCCCCCCeEEEEEEEeCCCC--CChhHhhhccCCCcceeEEeEEEEe
Q 028956           90 PMYQKSRDKFKIISMKVKADV--DYVPELFDEQKDQTAAEQILRVVFL  135 (201)
Q Consensus        90 p~~~~~kDKFlVqs~~v~~~~--d~~~~lwk~~~~~~i~e~kLrV~f~  135 (201)
                      .++.+|+||||||++..+.+.  .+..++|+...+.-|+++||+|+|.
T Consensus        74 apdfKCrdKFLiqs~~~~~~l~g~d~ad~wt~~sk~~i~~rkIrcvys  121 (242)
T COG5066          74 APDFKCRDKFLIQSYRFDWRLSGSDFADHWTSSSKKPIWTRKIRCVYS  121 (242)
T ss_pred             CCCccccceeEEEEeccChhhccchHHHHHHhhccccchhhheeEEee
Confidence            999999999999999998654  4569999999988899999999998


No 3  
>PF00635 Motile_Sperm:  MSP (Major sperm protein) domain;  InterPro: IPR000535 Major sperm proteins (MSP) are central components in molecular interactions underlying sperm motility in Caenorhabditis elegans, whose sperm employ an amoebae-like crawling motion using a MSP-containing lamellipod, rather than the flagellar-based swimming motion associated with other sperm. These proteins oligomerise to form an extensive filament system that extends from sperm villipoda, along the leading edge of the pseudopod. About 30 MSP isoforms may exist in C. elegans. MSPs form a fibrous network, whereby MSP dimers form helical subfilaments that coil around one another to produce filaments, which in turn form supercoils to produce bundles. The crystal structure of MSP from C. elegans reveals an immunoglobulin (Ig)-like seven-stranded beta sandwich fold []. ; GO: 0005198 structural molecule activity; PDB: 1MSP_A 3MSP_B 2BVU_B 2MSP_C 1Z9O_F 1Z9L_A 3IKK_A 1WIC_A 2CRI_A 2RR3_A ....
Probab=99.90  E-value=2.2e-23  Score=156.67  Aligned_cols=104  Identities=25%  Similarity=0.353  Sum_probs=82.8

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEeeccCCCCCCC
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFKFVELPENNEK   89 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e~   89 (201)
                      .|.|+|.+   .|.|..+.++..++.|+|+|+++.+||||||||+|.+|+|+|+.|+|.||+++.|.|++   ++.....
T Consensus         1 ~l~v~P~~---~i~F~~~~~~~~~~~l~l~N~s~~~i~fKiktt~~~~y~v~P~~G~i~p~~~~~i~I~~---~~~~~~~   74 (109)
T PF00635_consen    1 DLSVEPSE---LIFFNAPFNKQQSCELTLTNPSDKPIAFKIKTTNPNRYRVKPSYGIIEPGESVEITITF---QPFDFEP   74 (109)
T ss_dssp             -CEEESSS---EEEEESSTSS-EEEEEEEEE-SSSEEEEEEEES-TTTEEEESSEEEE-TTEEEEEEEEE----SSSTTT
T ss_pred             CeEEeCCc---ceEEcCCCCceEEEEEEEECCCCCcEEEEEEcCCCceEEecCCCEEECCCCEEEEEEEE---EecccCC
Confidence            37899995   99999999999999999999999999999999999999999999999999999999998   6654432


Q ss_pred             CCCCCCCCeEEEEEEEeCCCCC----ChhHhhhccC
Q 028956           90 PMYQKSRDKFKIISMKVKADVD----YVPELFDEQK  121 (201)
Q Consensus        90 p~~~~~kDKFlVqs~~v~~~~d----~~~~lwk~~~  121 (201)
                       . ...+|||+|+++.++++..    .+..+|++..
T Consensus        75 -~-~~~~dkf~I~~~~~~~~~~~~~~~~~~~~~~~~  108 (109)
T PF00635_consen   75 -S-NKKKDKFLIQSIVVPDNATDPKKDFKQIWKNGK  108 (109)
T ss_dssp             -T-STSSEEEEEEEEEE-TT-SSSHHHHHCCHHHSS
T ss_pred             -C-CCCCCEEEEEEEEcCCCccchhhhHHHHHhccC
Confidence             1 1239999999999987763    3567777654


No 4  
>PF14874 PapD-like:  Flagellar-associated PapD-like
Probab=98.53  E-value=1.3e-06  Score=64.91  Aligned_cols=66  Identities=23%  Similarity=0.265  Sum_probs=58.7

Q ss_pred             eeEEccCCCceeEEEccC-CCCceEEEEEEEcCCCCeEEEEEeeCC--CcceeecCCceeeCCCCeEEEEEEe
Q 028956           10 HDHFYPNGAVSSLILPIS-STFPVRSAIKIKNTSKSHVAFKFQTTA--PKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~-~~k~~~~~LtLtN~S~~~VAFKVKTTa--P~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.++|.    +|.|-.- .+...+..++|+|.|..+..|+|+...  ...|.|.|..|.|+||++..+.|++
T Consensus         3 ~l~v~P~----~ldFG~v~~g~~~~~~v~l~N~s~~p~~f~v~~~~~~~~~~~v~~~~g~l~PG~~~~~~V~~   71 (102)
T PF14874_consen    3 TLEVSPK----ELDFGNVFVGQTYSRTVTLTNTSSIPARFRVRQPESLSSFFSVEPPSGFLAPGESVELEVTF   71 (102)
T ss_pred             EEEEeCC----EEEeeEEccCCEEEEEEEEEECCCCCEEEEEEeCCcCCCCEEEECCCCEECCCCEEEEEEEE
Confidence            7899999    8999764 566788999999999999999997533  5689999999999999999999998


No 5  
>PF00345 PapD_N:  Pili and flagellar-assembly chaperone, PapD N-terminal domain;  InterPro: IPR016147 Most Gram-negative bacteria possess a supramolecular structure - the pili - on their surface, which mediates attachment to specific receptors. Many interactive subunits are required to assemble pili, but their assembly only takes place after translocation across the cytoplasmic membrane. Periplasmic chaperones assist pili assembly by binding to the subunits, thereby preventing premature aggregation [, ]. Pili chaperones are structurally, and possibly evolutionarily, related to the immunoglobulin superfamily [, ]: they contain two globular domains, with a topology identical to an immunoglobulin fold. This entry represents the N-terminal domain of pili assembly chaperone, and has a beta-sandwich fold consisting of seven strands in two sheets with a Greek key topology.; GO: 0007047 cellular cell wall organization, 0030288 outer membrane-bounded periplasmic space; PDB: 2CO6_B 2CO7_B 1L4I_B 3GFU_A 3F65_F 3F6L_A 3F6I_A 3GEW_B 3DSN_D 2OS7_B ....
Probab=96.69  E-value=0.05  Score=41.61  Aligned_cols=63  Identities=22%  Similarity=0.313  Sum_probs=52.0

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCC----C-----cceeecCCceeeCCCCeEEEEEEe
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTA----P-----KSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTa----P-----~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.|+|.    .+.|...   .....++|+|.++.++.+.+....    +     ..|.|-|+.-.|+||++..|.|..
T Consensus         1 gi~i~~t----rii~~~~---~~~~~i~v~N~~~~~~~vq~~v~~~~~~~~~~~~~~~~vsPp~~~L~pg~~q~vRv~~   72 (122)
T PF00345_consen    1 GIQISPT----RIIFNES---QRSASITVTNNSDQPYLVQVWVYDQDDEDEDEPTDPFIVSPPIFRLEPGESQTVRVYR   72 (122)
T ss_dssp             EEEESSS----EEEEETT---SSEEEEEEEESSSSEEEEEEEEEETTSTTSSSSSSSEEEESSEEEEETTEEEEEEEEE
T ss_pred             CEEEccE----EEEEeCC---CCEEEEEEEcCCCCcEEEEEEEEcCCCcccccccccEEEeCCceEeCCCCcEEEEEEe
Confidence            3678888    7887763   358899999999999999987553    1     268999999999999999999943


No 6  
>PF14646 MYCBPAP:  MYCBP-associated protein family
Probab=94.31  E-value=1.2  Score=41.57  Aligned_cols=112  Identities=13%  Similarity=0.219  Sum_probs=76.5

Q ss_pred             eeEEEccCCCCceEEEEE-EEcCCCCeEEEEEeeCC------------CcceeecCCceeeCCCCeEEEEEEeeccCCCC
Q 028956           20 SSLILPISSTFPVRSAIK-IKNTSKSHVAFKFQTTA------------PKSCFMRPPGAILAPGESLIATVFKFVELPEN   86 (201)
Q Consensus        20 ~eL~F~~~~~k~~~~~Lt-LtN~S~~~VAFKVKTTa------------P~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~   86 (201)
                      -.|.|.-.++......|+ |.|.+..-|-|..+--.            ...|......|+|.||++..+.|+.   +|..
T Consensus       237 ~~l~Fe~~p~e~~~~~v~~l~N~Gt~~I~y~W~~~~~~~~~~~~~~~~~~~F~Fd~~~gvilPGe~~~~~~~F---~s~~  313 (426)
T PF14646_consen  237 IRLTFECHPGERVSKEVVRLENNGTTAIYYSWRRVPFFKNFGSLFRAQDQRFYFDTSSGVILPGETRNFPFMF---KSRK  313 (426)
T ss_pred             eEEEEEcccCceeeEEEEEEecCCceEEEEEEEecccccccchhccccCCeEEEeCCCCEECCCceEEEEEEE---eCCC
Confidence            478999999987877777 99999999999986432            3468889999999999999999997   5542


Q ss_pred             CCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCCCcceeEEeEEEEeCCCCCCchhHHHHhhhh
Q 028956           87 NEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKDQTAAEQILRVVFLNPERPEPALEKLKRQLA  152 (201)
Q Consensus        87 ~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~~~i~e~kLrV~f~~p~~ps~~~e~l~~~l~  152 (201)
                      .     .-.++...+..-+.         +|.    .....-+|+.+-+++..-...++.+++.|+
T Consensus       314 ~-----Gif~E~W~L~t~P~---------l~~----~~~l~v~L~G~~~~~~~~~~~~~~~~~~l~  361 (426)
T PF14646_consen  314 V-----GIFKERWELRTFPP---------LFG----GASLTVRLHGVCTPPDEYLDKRKMLEEELA  361 (426)
T ss_pred             c-----eEEEEEEEEEEecc---------ccC----CCceEEEEEEEEcCchHhHHHHHHHHHHHH
Confidence            1     12356666655332         222    122445777777765444444555555443


No 7  
>PF11614 FixG_C:  IG-like fold at C-terminal of FixG, putative oxidoreductase; PDB: 2R39_A.
Probab=92.58  E-value=0.72  Score=35.07  Aligned_cols=71  Identities=14%  Similarity=0.241  Sum_probs=41.4

Q ss_pred             eCceeEEccCCCceeEEEc-cCCCC-ceEEEEEEEcCCCCeEEEEEeeCCCcceee-cCCce-eeCCCCeEEEEEEee
Q 028956            7 LPLHDHFYPNGAVSSLILP-ISSTF-PVRSAIKIKNTSKSHVAFKFQTTAPKSCFM-RPPGA-ILAPGESLIATVFKF   80 (201)
Q Consensus         7 ~p~~L~i~P~~~~~eL~F~-~~~~k-~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~V-rP~~G-iI~Pg~s~~I~V~l~   80 (201)
                      .|+.+.+-+.   +-..|. ...|. ...-.|+|.|.+.++..|.|+...+..+.+ .|... .|.||++..+.|.+.
T Consensus         9 ~~~~~~V~rd---r~~ly~~~~dg~I~N~Y~lkl~Nkt~~~~~~~i~~~g~~~~~l~~~~~~i~v~~g~~~~~~v~v~   83 (118)
T PF11614_consen    9 KPVELNVLRD---RGPLYRELSDGSIRNQYTLKLTNKTNQPRTYTISVEGLPGAELQGPENTITVPPGETREVPVFVT   83 (118)
T ss_dssp             -SEEEEEEE----SS---------SEEEEEEEEEEE-SSS-EEEEEEEES-SS-EE-ES--EEEE-TT-EEEEEEEEE
T ss_pred             CcEEEEEEec---CCCcEEEcCCCeEEEEEEEEEEECCCCCEEEEEEEecCCCeEEECCCcceEECCCCEEEEEEEEE
Confidence            3556666665   344444 33333 446799999999999999999888888888 66455 599999999888873


No 8  
>PRK09918 putative fimbrial chaperone protein; Provisional
Probab=92.23  E-value=6.8  Score=33.66  Aligned_cols=108  Identities=18%  Similarity=0.140  Sum_probs=68.2

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCC-----cceeecCCceeeCCCCeEEEEEEeeccCC
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAP-----KSCFMRPPGAILAPGESLIATVFKFVELP   84 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP-----~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p   84 (201)
                      .|.++|.    .+.|...   ....+++|+|.++.++.........     .-|.|-|+.-.|+||++..|.|.+   ..
T Consensus        25 ~v~l~~t----Rvi~~~~---~~~~si~v~N~~~~p~lvQ~wv~~~~~~~~~~fivtPPl~rl~pg~~q~vRii~---~~   94 (230)
T PRK09918         25 GMVPETS----VVIVEES---DGEGSINVKNTDSNPILLYTTLVDLPEDKSKLLLVTPPVARVEPGQSQQVRFIL---KS   94 (230)
T ss_pred             eEEEccE----EEEEECC---CCeEEEEEEcCCCCcEEEEEEEecCCCCCCCCEEEcCCeEEECCCCceEEEEEE---CC
Confidence            5677777    6777654   4678999999998876665543211     259999999999999999999987   22


Q ss_pred             CCCCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCCCcceeEEeEEEEeCCC
Q 028956           85 ENNEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKDQTAAEQILRVVFLNPE  138 (201)
Q Consensus        85 ~~~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~~~i~e~kLrV~f~~p~  138 (201)
                         ..|.+.  ---|.+....++...+.      +..=+.....++++-|-+..
T Consensus        95 ---~lp~dr--Es~f~l~v~~IP~~~~~------~~~l~ia~r~~iklfyRP~~  137 (230)
T PRK09918         95 ---GSPLNT--EHLLRVSFEGVPPKPGG------KNKVVMPIRQDLPVLIQPAA  137 (230)
T ss_pred             ---CCCCCe--eEEEEEEEEEcCCCCCC------CCEEEEEEEeEEEEEEeCCC
Confidence               123322  12366666666653221      10112223556777676444


No 9  
>PF07610 DUF1573:  Protein of unknown function (DUF1573);  InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=91.96  E-value=0.71  Score=29.77  Aligned_cols=42  Identities=26%  Similarity=0.184  Sum_probs=33.9

Q ss_pred             EEEEcCCCCeE-EEEEeeCCCcceeecCCceeeCCCCeEEEEEE
Q 028956           36 IKIKNTSKSHV-AFKFQTTAPKSCFMRPPGAILAPGESLIATVF   78 (201)
Q Consensus        36 LtLtN~S~~~V-AFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~   78 (201)
                      ++++|.++.++ -.+|+|+ =+=..+..+...|.||++..|.|+
T Consensus         2 F~~~N~g~~~L~I~~v~ts-CgCt~~~~~~~~i~PGes~~i~v~   44 (45)
T PF07610_consen    2 FEFTNTGDSPLVITDVQTS-CGCTTAEYSKKPIAPGESGKIKVT   44 (45)
T ss_pred             EEEEECCCCcEEEEEeeEc-cCCEEeeCCcceECCCCEEEEEEE
Confidence            68999998776 6677664 556677778889999999999987


No 10 
>PRK11385 putativi pili assembly chaperone; Provisional
Probab=88.65  E-value=14  Score=32.06  Aligned_cols=107  Identities=16%  Similarity=0.228  Sum_probs=65.3

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEee--C------------CCcceeecCCceeeCCCCeEEE
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQT--T------------APKSCFMRPPGAILAPGESLIA   75 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKT--T------------aP~~Y~VrP~~GiI~Pg~s~~I   75 (201)
                      .+.+++.    .+.|...   ....+++|.|.++.++  =|++  .            ...-|.|-|+.=-|+||+...+
T Consensus        27 ~v~l~~T----RvIy~~~---~~~~sv~l~N~~~~p~--LvQswv~~~~~~~~~~~~~~~~pFivtPPlfrl~p~~~q~l   97 (236)
T PRK11385         27 GVVVGGT----RFIFPAD---RESISILLTNTSQESW--LINSKINRPTRWAGGEASTVPAPLLAAPPLILLKPGTTGTL   97 (236)
T ss_pred             eEEeCce----EEEEcCC---CceEEEEEEeCCCCcE--EEEEEcccCccccCcccccccCCEEEcCCeEEECCCCceEE
Confidence            4566776    6777653   4778999999998864  4444  1            1124999999999999999999


Q ss_pred             EEEeeccCCCCCCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCCCcceeEEeEEEEeCC
Q 028956           76 TVFKFVELPENNEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKDQTAAEQILRVVFLNP  137 (201)
Q Consensus        76 ~V~l~~e~p~~~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~~~i~e~kLrV~f~~p  137 (201)
                      .|..   ... ...|.|.  -.-|-+....+++..+.      ...=......+|++-|-+.
T Consensus        98 RIi~---~~~-~~LP~DR--ESlf~lnv~~IPp~~~~------~n~L~iair~riKLFyRP~  147 (236)
T PRK11385         98 RLLR---TES-DILPVDR--ETLFELSIASVPSGKVE------NQSVKVAMRSVFKLFWRPE  147 (236)
T ss_pred             EEEE---CCC-CCCCCCc--eEEEEEEEEecCCCcCC------CceEEEEEEeeEEEEEccc
Confidence            9997   221 1123332  23355666666653211      0011223466677766543


No 11 
>PRK15249 fimbrial chaperone protein StbB; Provisional
Probab=86.81  E-value=8  Score=33.81  Aligned_cols=63  Identities=14%  Similarity=0.210  Sum_probs=47.6

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCC------C-----cceeecCCceeeCCCCeEEEEEE
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTA------P-----KSCFMRPPGAILAPGESLIATVF   78 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTa------P-----~~Y~VrP~~GiI~Pg~s~~I~V~   78 (201)
                      .|.|+|.    .+.|...   ....+|+|.|.++.++....-+..      |     .-|.|-|+.--|+||+...|.|.
T Consensus        29 ~l~l~~T----Rviy~~~---~~~~sl~l~N~~~~p~LvQsWv~~~~~~~~p~~~~~~pFivtPPlfrl~p~~~q~lRI~  101 (253)
T PRK15249         29 SVTILGS----RIIYPST---ASSVDVQLKNNDAIPYIVQTWFDDGDMNTSPENSSAMPFIATPPVFRIQPKAGQVVRVI  101 (253)
T ss_pred             EEEeCce----EEEEeCC---CcceeEEEEcCCCCcEEEEEEEeCCCCCCCccccccCcEEEcCCeEEecCCCceEEEEE
Confidence            4778888    7777644   367899999999887554442211      1     13999999999999999999999


Q ss_pred             e
Q 028956           79 K   79 (201)
Q Consensus        79 l   79 (201)
                      .
T Consensus       102 ~  102 (253)
T PRK15249        102 Y  102 (253)
T ss_pred             E
Confidence            7


No 12 
>PRK09926 putative chaperone protein EcpD; Provisional
Probab=86.54  E-value=8.8  Score=33.35  Aligned_cols=63  Identities=8%  Similarity=0.101  Sum_probs=49.1

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCc----------ceeecCCceeeCCCCeEEEEEEe
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPK----------SCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~----------~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.|+|.    .+.|...   ....+++|.|.++.++.--.-...-+          -|.|-|+.--|+||+...|.|..
T Consensus        26 ~i~l~~T----RvI~~~~---~~~~sv~l~N~~~~p~LvQ~Wvd~~~~~~~p~~~~~pfivtPPl~rl~p~~~q~lRIi~   98 (246)
T PRK09926         26 DIVISGT----RIIYKSD---QKDVNVRLENKGNNPLLVQSWLDTGDDNAEPGSIKVPFTATPPVSRIDPKRGQTIKLMY   98 (246)
T ss_pred             eEEeCce----EEEEeCC---CceEEEEEEeCCCCcEEEEEEecCCCCccCccccCCCEEEcCCeEEECCCCccEEEEEe
Confidence            5677887    7887764   46789999999988766554332211          39999999999999999999997


No 13 
>PF06280 DUF1034:  Fn3-like domain (DUF1034);  InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=85.86  E-value=2.1  Score=32.24  Aligned_cols=51  Identities=16%  Similarity=0.188  Sum_probs=32.3

Q ss_pred             CCceEEEEEEEcCCCCeEEEEEeeC-----C---Ccceee-c------------CCceeeCCCCeEEEEEEe
Q 028956           29 TFPVRSAIKIKNTSKSHVAFKFQTT-----A---PKSCFM-R------------PPGAILAPGESLIATVFK   79 (201)
Q Consensus        29 ~k~~~~~LtLtN~S~~~VAFKVKTT-----a---P~~Y~V-r------------P~~GiI~Pg~s~~I~V~l   79 (201)
                      +...+..|+|+|.+++.+.|++.-.     .   .+.|.. .            |..=.|+||++..|.|++
T Consensus         7 ~~~~~~~itl~N~~~~~~ty~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~vTV~ag~s~~v~vti   78 (112)
T PF06280_consen    7 GNKFSFTITLHNYGDKPVTYTLSHVPVLTDKTDTEEGYSILVPPVPSISTVSFSPDTVTVPAGQSKTVTVTI   78 (112)
T ss_dssp             -SEEEEEEEEEE-SSS-EEEEEEEE-EEEEEE--ETTEEEEEEEE----EEE---EEEEE-TTEEEEEEEEE
T ss_pred             CCceEEEEEEEECCCCCEEEEEeeEEEEeeEeeccCCcccccccccceeeEEeCCCeEEECCCCEEEEEEEE
Confidence            3447889999999999999998644     1   112221 1            222368999999999998


No 14 
>smart00809 Alpha_adaptinC2 Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
Probab=83.99  E-value=5.7  Score=29.02  Aligned_cols=50  Identities=30%  Similarity=0.513  Sum_probs=39.0

Q ss_pred             CceEEEEEEEcCCCCeEE-EEEeeCCCcceeec--CCce-eeCCCCeEEEEEEe
Q 028956           30 FPVRSAIKIKNTSKSHVA-FKFQTTAPKSCFMR--PPGA-ILAPGESLIATVFK   79 (201)
Q Consensus        30 k~~~~~LtLtN~S~~~VA-FKVKTTaP~~Y~Vr--P~~G-iI~Pg~s~~I~V~l   79 (201)
                      ....-.++..|.+..++. |.++-..|+.+.++  |..| .|+||+.+.-.+.+
T Consensus        18 ~~~~i~~~~~N~s~~~it~f~~~~avpk~~~l~l~~~s~~~l~p~~~i~q~~~i   71 (104)
T smart00809       18 GLIRITLTFTNKSPSPITNFSFQAAVPKSLKLQLQPPSSPTLPPGGQITQVLKV   71 (104)
T ss_pred             CeEEEEEEEEeCCCCeeeeEEEEEEcccceEEEEcCCCCCccCCCCCEEEEEEE
Confidence            457788899999988874 88888888887776  5544 89999887666665


No 15 
>PRK15211 fimbrial chaperone protein PefD; Provisional
Probab=83.02  E-value=18  Score=31.29  Aligned_cols=63  Identities=13%  Similarity=0.083  Sum_probs=46.5

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeC------CCcceeecCCceeeCCCCeEEEEEEe
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTT------APKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTT------aP~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.+++.    .+.|...   ....+++|+|.++.++.-.....      ..--|.|-|+.-.|+||+...|.|..
T Consensus        23 ~v~l~~T----RvIy~~~---~~~~si~i~N~~~~p~LvQswv~~~~~~~~~~pFivtPPlfrl~p~~~q~lRI~~   91 (229)
T PRK15211         23 AFVLNGT----RFIYDEG---RKNISFEVTNQADQTYGGQVWIDNTTQGSSTVYMVPAPPFFKVRPKEKQIIRIMK   91 (229)
T ss_pred             EEEECce----EEEEcCC---CceEEEEEEeCCCCcEEEEEEEecCCCCCccCCEEEcCCeEEECCCCceEEEEEE
Confidence            4566666    6777653   46889999999988744333221      11249999999999999999999997


No 16 
>PF03173 CHB_HEX:  Putative carbohydrate binding domain;  InterPro: IPR004866 This domain represents the N-terminal domain in chitobiases and beta-hexosaminidases 3.2.1.52 from EC. Chitobiases degrade chitin, which forms the exoskeleton in insects and crustaceans, and which is one of the most abundant polysaccharides on earth []. Beta-hexosaminidases are composed of either a HexA/HexB heterodimer or a HexB homodimer, and can hydrolyse diverse substrates, including GM(2)-gangliosides; mutations in this enzyme are associated with Tay-Sachs disease []. HexB is structurally similar to chitobiase, consisting of a beta sandwich structure; this structure is similar to that found in the cellulose-binding domain of cellulase from Cellulomonas fimi (IPR001919 from INTERPRO), suggesting that it may function as a carbohydrate-binding domain.; GO: 0030246 carbohydrate binding; PDB: 1C7T_A 1QBA_A 1QBB_A 1C7S_A.
Probab=82.45  E-value=2.5  Score=34.86  Aligned_cols=32  Identities=31%  Similarity=0.424  Sum_probs=24.6

Q ss_pred             EEEeeCCCcceeecCCcee--eCCCCeEEEEEEe
Q 028956           48 FKFQTTAPKSCFMRPPGAI--LAPGESLIATVFK   79 (201)
Q Consensus        48 FKVKTTaP~~Y~VrP~~Gi--I~Pg~s~~I~V~l   79 (201)
                      |+|.-=+-+.|++.|.-|+  |.||+++.|.++-
T Consensus        69 f~i~hinGDl~kl~Pt~~F~gl~~Ges~~I~~~~  102 (164)
T PF03173_consen   69 FKITHINGDLHKLTPTAGFKGLAPGESLEIPFVG  102 (164)
T ss_dssp             EEEEE-STTEEEEEE-TT---B-TTEEEEEEEEE
T ss_pred             eEEEEEcCeEEEEeECCCCCccCCCCEEEEEEEc
Confidence            6777777788999999997  8999999999885


No 17 
>PF05506 DUF756:  Domain of unknown function (DUF756);  InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=82.20  E-value=6.6  Score=28.46  Aligned_cols=45  Identities=16%  Similarity=0.087  Sum_probs=31.4

Q ss_pred             eEEEEEEEcCCCCeEEEEEeeCCCccee-ecCCceeeCCCCeEEEEEEe
Q 028956           32 VRSAIKIKNTSKSHVAFKFQTTAPKSCF-MRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        32 ~~~~LtLtN~S~~~VAFKVKTTaP~~Y~-VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      ..-.|+|.|.+...+.|.|...   .|. -.|-.=.|.||++..+..-+
T Consensus        20 g~l~l~l~N~g~~~~~~~v~~~---~y~~~~~~~~~v~ag~~~~~~w~l   65 (89)
T PF05506_consen   20 GNLRLTLSNPGSAAVTFTVYDN---AYGGGGPWTYTVAAGQTVSLTWPL   65 (89)
T ss_pred             CEEEEEEEeCCCCcEEEEEEeC---CcCCCCCEEEEECCCCEEEEEEee
Confidence            4678999999999999999862   122 22333355668877777665


No 18 
>PRK15295 fimbrial assembly chaperone SthB; Provisional
Probab=81.89  E-value=20  Score=30.78  Aligned_cols=61  Identities=16%  Similarity=0.178  Sum_probs=46.1

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEee--CC-------CcceeecCCceeeCCCCeEEEEEEe
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQT--TA-------PKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKT--Ta-------P~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.+++.    .+.|...   ....+++|.|.++.++  =|++  ..       ..-|.|-|+.=.|+||+...|.|..
T Consensus        20 ~i~l~~T----RvI~~~~---~~~~si~i~N~~~~p~--LvQsWv~~~~~~~~~~~pFivtPPl~rl~p~~~q~lRI~~   89 (226)
T PRK15295         20 SIVVGGT----RLVFDGN---NDESSINVENKDSKAN--LVQSWLSVVDPQVTNKQAFIITPPLFRLDAGQKNSIRVIR   89 (226)
T ss_pred             cEEeCce----EEEEeCC---CceeEEEEEeCCCCcE--EEEEEEeCCCCCCCCCCCEEEcCCeEEECCCCceEEEEEE
Confidence            4567777    6777664   3578999999998764  3553  11       1249999999999999999999987


No 19 
>PRK15299 fimbrial chaperone protein StiB; Provisional
Probab=81.71  E-value=32  Score=29.41  Aligned_cols=110  Identities=9%  Similarity=0.088  Sum_probs=66.5

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCC--------CcceeecCCceeeCCCCeEEEEEEeec
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTA--------PKSCFMRPPGAILAPGESLIATVFKFV   81 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTa--------P~~Y~VrP~~GiI~Pg~s~~I~V~l~~   81 (201)
                      .|.|+|.    .+.|...   ....+|+|+|.++.++.-..-+..        ..-|.|-|+.-.|+||+...|.|..  
T Consensus        23 ~i~l~~T----Rvi~~~~---~~~~sl~l~N~~~~p~lvQsWv~~~~~~~~~~~~pfivtPPl~rl~p~~~q~lRI~~--   93 (227)
T PRK15299         23 GINIGTT----RVIFHGD---AKDASISISNSDNVPYLIQSWAQSISETGASGDAPFMVTPPLFRLNGGQKNVLRIIR--   93 (227)
T ss_pred             eEEECce----EEEEeCC---CcEEEEEEEeCCCCcEEEEEEeecCCCCCCcCCCCEEEcCCeEEECCCCccEEEEEE--
Confidence            4677777    6777665   367899999998876544432211        1239999999999999999999997  


Q ss_pred             cCCCCCCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCCCcceeEEeEEEEeCC
Q 028956           82 ELPENNEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKDQTAAEQILRVVFLNP  137 (201)
Q Consensus        82 e~p~~~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~~~i~e~kLrV~f~~p  137 (201)
                       ...  ..|.+..  .-|.+....++...+...   ++ .=.....++|++.|-++
T Consensus        94 -~~~--~lP~DrE--slf~lnv~eIP~~~~~~~---~n-~l~iavr~riKLfyRP~  140 (227)
T PRK15299         94 -TGG--NLPEDRE--SLYWLDIKSIPSSNPDNK---HN-TLMLAVKAEFKLIYRPK  140 (227)
T ss_pred             -CCC--CCCCcce--EEEEEEeEecCCCCcccc---cc-eEEEEEeeeeeEEEccc
Confidence             221  1233321  236666666665321100   00 01223456777777544


No 20 
>PRK15246 fimbrial assembly chaperone StbE; Provisional
Probab=81.39  E-value=35  Score=29.55  Aligned_cols=111  Identities=11%  Similarity=0.173  Sum_probs=67.0

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEee--C------CCc----ceeecCCceeeCCCCeEEEEE
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQT--T------APK----SCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKT--T------aP~----~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      .|.|++.    .+.|...   ....+++|.|.++.++  -|++  .      .|.    -|.|-|+.=.|+||+...+.|
T Consensus        11 ~v~l~~T----RvI~~~~---~~~~sv~l~N~~~~p~--LvQsWvd~~~~~~~p~~~~~pFivtPPlfrl~~~~~~~lRI   81 (233)
T PRK15246         11 AVNIDRT----RIIFASD---DVAQSLTLSNDNTTPM--LLQVWTDAGNIDASPDNSKTPLVALPPVFKMQPGELRTLRL   81 (233)
T ss_pred             EEEECce----EEEEcCC---CceEEEEEEeCCCCcE--EEEEEEeCCCCccCcccccCcEEECCcceEECCCCceEEEE
Confidence            4667777    7777753   4678999999998764  4443  1      111    499999999999999999999


Q ss_pred             EeeccCCCCCCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCC-CcceeEEeEEEEeCCC
Q 028956           78 FKFVELPENNEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKD-QTAAEQILRVVFLNPE  138 (201)
Q Consensus        78 ~l~~e~p~~~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~-~~i~e~kLrV~f~~p~  138 (201)
                      .+   ... ...|.|.  ---|-+....+++..+...   +.... ......+|++-|-+..
T Consensus        82 ~~---~~~-~~LP~DR--ESlf~lnv~~IP~~~~~~~---~~~~~l~iair~rIKlFyRP~~  134 (233)
T PRK15246         82 LL---SSR-QQLATDR--ESLFWLNIYQIPPVTQDIK---NHPRKLVLPLRLRLKILIRPTG  134 (233)
T ss_pred             EE---CCC-CCCCCCc--eEEEEEEEEEcCCCCcccc---cccceEEEEeeeEEEEEECCcc
Confidence            97   221 1123322  1236666666665322100   00001 1223667777776443


No 21 
>PF02883 Alpha_adaptinC2:  Adaptin C-terminal domain;  InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface [].  GGAs (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) are a family of monomeric clathrin adaptor proteins that are conserved from yeasts to humans. GGAs regulate clathrin-mediated the transport of proteins (such as mannose 6-phosphate receptors) from the TGN to endosomes and lysosomes through interactions with TGN-sorting receptors, sometimes in conjunction with AP-1 [, ]. GGAs bind cargo, membranes, clathrin and accessory factors. GGA1, GGA2 and GGA3 all contain a domain homologous to the ear domain of gamma-adaptin. GGAs are composed of a single polypeptide with four domains: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The VHS domain is responsible for endocytosis and signal transduction, recognising transmembrane cargo through the ACLL sequence in the cytoplasmic domains of sorting receptors []. The GAT domain (also found in Tom1 proteins) interacts with ARF (ADP-ribosylation factor) to regulate membrane trafficking [], and with ubiquitin for receptor sorting []. The hinge region contains a clathrin box for recognition and binding to clathrin, similar to that found in AP adaptins. The GAE domain is similar to the AP gamma-adaptin ear domain, and is responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis [].  This entry represents a beta-sandwich structural motif found in the appendage (ear) domain of alpha-, beta- and gamma-adaptin from AP clathrin adaptor complexes, and the GAE (gamma-adaptin ear) domain of GGA adaptor proteins. These domains have an immunoglobulin-like beta-sandwich fold containing 7 or 8 strands in 2 beta-sheets in a Greek key topology [, ]. Although these domains share a similar fold, there is little sequence identity between the alpha/beta-adaptins and gamma-adaptin/GAE. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 3MNM_B 3ZY7_B 1GYU_A 1GYW_B 2A7B_A 1GYV_A 2E9G_A 1E42_B 2G30_A 2IV9_B ....
Probab=78.28  E-value=7.3  Score=29.10  Aligned_cols=51  Identities=25%  Similarity=0.488  Sum_probs=35.7

Q ss_pred             CCceEEEEEEEcCCCCeEE-EEEeeCCCcceeec--CC-ceeeCCCCeEEEEEEe
Q 028956           29 TFPVRSAIKIKNTSKSHVA-FKFQTTAPKSCFMR--PP-GAILAPGESLIATVFK   79 (201)
Q Consensus        29 ~k~~~~~LtLtN~S~~~VA-FKVKTTaP~~Y~Vr--P~-~GiI~Pg~s~~I~V~l   79 (201)
                      +....-.++..|.+..++. |.++-..|+.|.++  |. ...|+|+..++-.+.+
T Consensus        23 ~~~~~i~~~f~N~s~~~it~f~~q~avpk~~~l~l~~~s~~~i~p~~~i~Q~~~v   77 (115)
T PF02883_consen   23 PNQGRIKLTFGNKSSQPITNFSFQAAVPKSFKLQLQPPSSSTIPPGQQITQVIKV   77 (115)
T ss_dssp             TTEEEEEEEEEE-SSS-BEEEEEEEEEBTTSEEEEEESS-SSB-TTTEEEEEEEE
T ss_pred             CCEEEEEEEEEECCCCCcceEEEEEEeccccEEEEeCCCCCeeCCCCeEEEEEEE
Confidence            4567889999999988875 88777667765554  65 5599999988877776


No 22 
>PF00927 Transglut_C:  Transglutaminase family, C-terminal ig like domain;  InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase  Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=76.56  E-value=10  Score=28.12  Aligned_cols=52  Identities=19%  Similarity=0.142  Sum_probs=38.4

Q ss_pred             CCCceEEEEEEEcCCCCe--------EEEEEeeCCCc--ceeecCCceeeCCCCeEEEEEEe
Q 028956           28 STFPVRSAIKIKNTSKSH--------VAFKFQTTAPK--SCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        28 ~~k~~~~~LtLtN~S~~~--------VAFKVKTTaP~--~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      -|+.....++++|+++.+        .|+-|--|.--  .+..+-..+.|.||++..+.+.+
T Consensus        13 vG~d~~v~v~~~N~~~~~l~~v~~~l~~~~v~ytG~~~~~~~~~~~~~~l~p~~~~~~~~~i   74 (107)
T PF00927_consen   13 VGQDFTVSVSFTNPSSEPLRNVSLNLCAFTVEYTGLTRDQFKKEKFEVTLKPGETKSVEVTI   74 (107)
T ss_dssp             TTSEEEEEEEEEE-SSS-EECEEEEEEEEEEECTTTEEEEEEEEEEEEEE-TTEEEEEEEEE
T ss_pred             CCCCEEEEEEEEeCCcCccccceeEEEEEEEEECCcccccEeEEEcceeeCCCCEEEEEEEE
Confidence            578899999999999876        55555544332  25677889999999999999998


No 23 
>PF06030 DUF916:  Bacterial protein of unknown function (DUF916);  InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function. 
Probab=76.38  E-value=20  Score=27.86  Aligned_cols=58  Identities=21%  Similarity=0.213  Sum_probs=40.8

Q ss_pred             EEEccCCCCceEEEEEEEcCCCCeEEEEEe-----eCCCcc--e------------------eecCCceeeCCCCeEEEE
Q 028956           22 LILPISSTFPVRSAIKIKNTSKSHVAFKFQ-----TTAPKS--C------------------FMRPPGAILAPGESLIAT   76 (201)
Q Consensus        22 L~F~~~~~k~~~~~LtLtN~S~~~VAFKVK-----TTaP~~--Y------------------~VrP~~GiI~Pg~s~~I~   76 (201)
                      ..+...+++..+-.++|+|.|++.+.|+|.     |+..+.  |                  .=.|..-.|+|+++..|.
T Consensus        19 FdL~~~P~q~~~l~v~i~N~s~~~~tv~v~~~~A~Tn~nG~I~Y~~~~~~~d~sl~~~~~~~v~~~~~Vtl~~~~sk~V~   98 (121)
T PF06030_consen   19 FDLKVKPGQKQTLEVRITNNSDKEITVKVSANTATTNDNGVIDYSQNNPKKDKSLKYPFSDLVKIPKEVTLPPNESKTVT   98 (121)
T ss_pred             EEEEeCCCCEEEEEEEEEeCCCCCEEEEEEEeeeEecCCEEEEECCCCcccCcccCcchHHhccCCcEEEECCCCEEEEE
Confidence            444456777888999999999999999985     333331  2                  112333578889988888


Q ss_pred             EEe
Q 028956           77 VFK   79 (201)
Q Consensus        77 V~l   79 (201)
                      +++
T Consensus        99 ~~i  101 (121)
T PF06030_consen   99 FTI  101 (121)
T ss_pred             EEE
Confidence            876


No 24 
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=76.18  E-value=4.6  Score=28.39  Aligned_cols=52  Identities=21%  Similarity=0.323  Sum_probs=31.6

Q ss_pred             CCCceEEEEEEEcCCCCeE-EEEEeeCCCccee--ecCCce-eeCCCCeEEEEEEe
Q 028956           28 STFPVRSAIKIKNTSKSHV-AFKFQTTAPKSCF--MRPPGA-ILAPGESLIATVFK   79 (201)
Q Consensus        28 ~~k~~~~~LtLtN~S~~~V-AFKVKTTaP~~Y~--VrP~~G-iI~Pg~s~~I~V~l   79 (201)
                      .|...+-.++++|.++.++ ..++.-..|.-..  ..|... .|.||++..+++.+
T Consensus         3 ~G~~~~~~~tv~N~g~~~~~~v~~~l~~P~GW~~~~~~~~~~~l~pG~s~~~~~~V   58 (78)
T PF10633_consen    3 PGETVTVTLTVTNTGTAPLTNVSLSLSLPEGWTVSASPASVPSLPPGESVTVTFTV   58 (78)
T ss_dssp             TTEEEEEEEEEE--SSS-BSS-EEEEE--TTSE---EEEEE--B-TTSEEEEEEEE
T ss_pred             CCCEEEEEEEEEECCCCceeeEEEEEeCCCCccccCCccccccCCCCCEEEEEEEE
Confidence            4566788999999987543 3555555688777  445443 79999999999887


No 25 
>PRK15208 long polar fimbrial chaperone LpfB; Provisional
Probab=74.34  E-value=44  Score=28.63  Aligned_cols=63  Identities=16%  Similarity=0.220  Sum_probs=45.6

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCC-eE-EEEEeeCCC----cceeecCCceeeCCCCeEEEEEEe
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKS-HV-AFKFQTTAP----KSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~-~V-AFKVKTTaP----~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.++|.    .+.|...   ....+++|+|.+++ ++ .+..-....    .-|.|-|+.-.|+||+...|.|..
T Consensus        22 gv~l~~T----RvI~~~~---~~~~si~i~N~~~~~~~LvQsWv~~~~~~~~~pfivtPPl~rl~p~~~q~lRIi~   90 (228)
T PRK15208         22 GVALSST----RVIYDGS---KKEASLTVNNKSKTEEFLIQSWIDDANGNKKTPFIITPPLFKLDPTKNNVLRIVN   90 (228)
T ss_pred             cEEeCce----EEEEeCC---CceEEEEEEeCCCCCcEEEEEEEECCCCCccCCEEECCCeEEECCCCccEEEEEE
Confidence            5677777    6777764   36789999999863 33 333221111    129999999999999999999987


No 26 
>PRK15290 lfpB fimbrial chaperone protein; Provisional
Probab=73.95  E-value=46  Score=29.06  Aligned_cols=111  Identities=7%  Similarity=0.058  Sum_probs=66.4

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCC-CeEEEEEeeC--C-C----cceeecCCceeeCCCCeEEEEEEeec
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSK-SHVAFKFQTT--A-P----KSCFMRPPGAILAPGESLIATVFKFV   81 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~-~~VAFKVKTT--a-P----~~Y~VrP~~GiI~Pg~s~~I~V~l~~   81 (201)
                      .|.+++.    .+.|+..   ....+++|+|.++ .++.--.-..  . .    .-|.|-|+.-.|+||+...+.|..  
T Consensus        38 gv~l~~T----RvIy~~~---~~~~sl~v~N~~~~~p~LvQsWvd~~~~~~~~~~pFivtPPlfrl~p~~~q~lRIi~--  108 (243)
T PRK15290         38 GVVIGGT----RVVYLSN---NPDKSISVFSKEEKIPYLIQAWVDPFNKEDKSKAPFTVIPPVSRLEPSQEKVLRIIH--  108 (243)
T ss_pred             eEEECce----EEEEeCC---CceEEEEEEeCCCCCcEEEEEEEecCCCCCcccCCEEEcCCeEEECCCCceEEEEEE--
Confidence            4667777    7777753   4677999999986 4555444332  1 1    139999999999999999999997  


Q ss_pred             cCCCCCCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCCCcceeEEeEEEEeCC
Q 028956           82 ELPENNEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKDQTAAEQILRVVFLNP  137 (201)
Q Consensus        82 e~p~~~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~~~i~e~kLrV~f~~p  137 (201)
                       .... ..|.|.  ---|-+....+++..+. .+  ++ .=+.....++++-|-+.
T Consensus       109 -~~~~-~LP~DR--ESlf~lnv~eIPp~~~~-~~--~n-~L~iair~rIKlFyRP~  156 (243)
T PRK15290        109 -TKGV-SLPDDR--ESVFWLNIKNIPPSASN-KA--TN-SLEIAVKTRIKLFWRPA  156 (243)
T ss_pred             -cCCC-CCCCCe--eEEEEEEEEEcCCCCcc-cc--cc-eEEEEEEEeeeEEEecc
Confidence             2211 123332  23366666666653211 00  00 11223466677777644


No 27 
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=73.91  E-value=8.9  Score=35.82  Aligned_cols=52  Identities=21%  Similarity=0.245  Sum_probs=38.3

Q ss_pred             CCCceEEEEEEEcCCCCeEEEEEeeCCCc-------ceeecCCce--------------eeCCCCeEEEEEEe
Q 028956           28 STFPVRSAIKIKNTSKSHVAFKFQTTAPK-------SCFMRPPGA--------------ILAPGESLIATVFK   79 (201)
Q Consensus        28 ~~k~~~~~LtLtN~S~~~VAFKVKTTaP~-------~Y~VrP~~G--------------iI~Pg~s~~I~V~l   79 (201)
                      +++..+-++++||.++++|-.+==+|+.-       .|...|++.              -|.||||.+|.|..
T Consensus       280 PGR~l~~~~~VTN~g~~~vrlgEF~TA~vRFlN~~~v~~~~~~yP~~lla~GL~v~d~~pI~PGETr~v~v~a  352 (399)
T TIGR03079       280 PGRALRVTMEITNNGDQVISIGEFTTAGIRFMNANGVRVLDPDYPRELLAEGLEVDDQSAIAPGETVEVKMEA  352 (399)
T ss_pred             CCcEEEEEEEEEcCCCCceEEEeEeecceEeeCcccccccCCCChHHHhhccceeCCCCCcCCCcceEEEEEE
Confidence            57788999999999999998774444443       333333332              38999999999998


No 28 
>PRK15249 fimbrial chaperone protein StbB; Provisional
Probab=73.60  E-value=5.6  Score=34.77  Aligned_cols=74  Identities=14%  Similarity=0.068  Sum_probs=44.6

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +|+-|=|..| -.|.++.+.|.|....+......|+++|+|..++.|- ++....++ .+....|+|.|+++..+.+
T Consensus       145 IKLFyRP~~L-~~~~~~~~~L~~~~~~~~~~~~~l~v~Nptpyyitl~~l~~~~~~~-~~~~~~~mv~P~s~~~~~l  219 (253)
T PRK15249        145 IKLFYRPDGL-GKPDNLAKKLQIKTVNKGSGKSGIVIVNPQPWFASLSNLNVKVNGA-SYNLDADMIAPFSSQTWWL  219 (253)
T ss_pred             EEEEEccccC-CChhHhhhceEEEEeeccCCCCEEEEECCCceEEEeeeeeeccCCe-ecCCCCceECCCCccEEEc
Confidence            4566667666 4554444567777533111122599999999988775 33211221 1222458999999988764


No 29 
>PRK15195 fimbrial chaperone protein FimC; Provisional
Probab=72.74  E-value=44  Score=28.76  Aligned_cols=61  Identities=15%  Similarity=0.256  Sum_probs=45.1

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCC-eEEEEEee---CC----CcceeecCCceeeCCCCeEEEEEEe
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKS-HVAFKFQT---TA----PKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~-~VAFKVKT---Ta----P~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.+++.    .+.|....   ...+++|+|.++. +..  |++   +.    ..-|.|-|+.=-|+||+...|.|..
T Consensus        26 gi~i~~T----RvIy~~~~---~~~si~l~N~~~~~~~L--vQsWv~~~~~~~~~pfivtPPlfrl~p~~~q~lRIi~   94 (229)
T PRK15195         26 GIALGAT----RVIYPADA---KQTSLAIRNSHTNERYL--VNSWIENSSGVKEKSFIVTPPLFVSEPKSENTLRIIY   94 (229)
T ss_pred             eEEECCe----EEEEeCCC---ceEEEEEEeCCCCccEE--EEEEecCCCCCccCCEEEcCCeEEECCCCceEEEEEE
Confidence            4667777    67776543   4589999999864 333  442   11    1249999999999999999999997


No 30 
>PRK15192 fimbrial chaperone BcfG; Provisional
Probab=71.61  E-value=68  Score=27.87  Aligned_cols=84  Identities=12%  Similarity=0.112  Sum_probs=56.5

Q ss_pred             ceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeC----------C----CcceeecCCceeeCCCCeEE
Q 028956            9 LHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTT----------A----PKSCFMRPPGAILAPGESLI   74 (201)
Q Consensus         9 ~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTT----------a----P~~Y~VrP~~GiI~Pg~s~~   74 (201)
                      -.|.++..    .+.|+..   ....+++|.|.++.+  |=|++.          .    ..-|.|-|+.-.|+||+...
T Consensus        22 Agi~l~~T----RvIy~~~---~k~~sv~l~N~~~~p--~LvQswv~~~~~w~~~~~~~~~~PFivtPPlfrl~p~~~~~   92 (234)
T PRK15192         22 AGVVIGGT----RFIYHAG---APALSVPVSNHSEAS--WLIDTHILPGGRWPGTKNEGNITPFVVTPPLFMLSARQENS   92 (234)
T ss_pred             eeEEeCce----EEEEcCC---CceEEEEEEeCCCCc--EEEEEEeccCccccccCCccccCCEEEcCCeEEECCCCceE
Confidence            35667766    6777663   467899999999876  555551          1    11399999999999999999


Q ss_pred             EEEEeeccCCCCCCCCCCCCCCCeEEEEEEEeCC
Q 028956           75 ATVFKFVELPENNEKPMYQKSRDKFKIISMKVKA  108 (201)
Q Consensus        75 I~V~l~~e~p~~~e~p~~~~~kDKFlVqs~~v~~  108 (201)
                      +.|..   ...  ..|.|.  ---|-+....+++
T Consensus        93 lRI~~---~~~--~LP~DR--ESlf~lnv~~IPp  119 (234)
T PRK15192         93 MRVVY---TGA--PLPADR--ESLFTLSIAAIPS  119 (234)
T ss_pred             EEEEE---CCC--CCCCcc--eEEEEEEEEecCC
Confidence            99997   221  123332  2335666666665


No 31 
>PRK15254 fimbrial chaperone protein StdC; Provisional
Probab=70.69  E-value=65  Score=28.02  Aligned_cols=87  Identities=13%  Similarity=0.125  Sum_probs=55.1

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCC-CeEEEEEee-C-C--C-cceeecCCceeeCCCCeEEEEEEeeccC
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSK-SHVAFKFQT-T-A--P-KSCFMRPPGAILAPGESLIATVFKFVEL   83 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~-~~VAFKVKT-T-a--P-~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~   83 (201)
                      .+.+++.    .+.|...   ....+++|.|.++ .++-=..-. . .  + .-|.|-|+.-.|+||+...|.|..   .
T Consensus        17 ~v~l~~T----RvIy~~~---~~~~sv~v~N~~~~~p~LvQsWv~d~~~~~~~pFivtPPlfrl~p~~~~~lRI~~---~   86 (239)
T PRK15254         17 AVNVDRT----RIIMDAP---QKTVAITLNNDDKTTPFLAQSWVTDADGVRTDALMALPPLQRIDAGQKSQVRITQ---V   86 (239)
T ss_pred             eEEECce----EEEEeCC---CceEEEEEEeCCCCCcEEEEEEEecCCCCCcCCEEEcCCeEEECCCCceEEEEEE---c
Confidence            4667776    6777753   4678999999986 354333211 1 1  1 249999999999999999999987   2


Q ss_pred             CC-CCCCCCCCCCCCeEEEEEEEeCC
Q 028956           84 PE-NNEKPMYQKSRDKFKIISMKVKA  108 (201)
Q Consensus        84 p~-~~e~p~~~~~kDKFlVqs~~v~~  108 (201)
                      .. ....|.|.  -.-|-+....+++
T Consensus        87 ~~~~~~lP~DR--ESlf~lnv~~IP~  110 (239)
T PRK15254         87 RGLTDKLPQDR--ETLFWFNVRGVPP  110 (239)
T ss_pred             ccCCCCCCCCc--eEEEEEEEEEcCC
Confidence            11 11223332  2335555555655


No 32 
>PRK15192 fimbrial chaperone BcfG; Provisional
Probab=69.96  E-value=9.2  Score=33.24  Aligned_cols=67  Identities=13%  Similarity=0.184  Sum_probs=45.6

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +++-|=|..|.-.|.++.+.|.|....+     .|+++|+|..+|.|- ++-. .+.  + ...++|+|.++..+.+
T Consensus       135 iKlFYRP~~L~~~~~~a~~~L~~~~~~~-----~l~v~NpTPyyvtl~~l~v~-~~~--~-~~~~miaPfs~~~~~~  202 (234)
T PRK15192        135 LKLLYRPEGLAGNPQQAYRHLIWSLTPD-----GATVRNPTPYYVTLFLLRAN-ERA--Q-DNAGVVAPFATRQTDW  202 (234)
T ss_pred             eeEEEccccccCChhhhhhheEEEEcCC-----EEEEECCCCcEEEEEeEEEc-Ccc--c-CCCceECCCCccEEec
Confidence            3556777777777766666888886533     299999999999875 2322 222  2 2457899999887764


No 33 
>COG3121 FimC P pilus assembly protein, chaperone PapD [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=69.44  E-value=9.7  Score=32.89  Aligned_cols=69  Identities=14%  Similarity=0.076  Sum_probs=47.3

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE--EeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK--FQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK--VKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +++-|=|-.|.-.|.++...|.|....     ..|+++|++..++.|-  .-+. .++-.. -+.+.|.|+++..+.+
T Consensus       138 iKlf~RP~~l~~~~~~~~~~L~~~~~~-----~~l~v~Nptpy~vtl~~~~l~~-~~~~~~-~~~~mv~P~s~~~~~l  208 (235)
T COG3121         138 IKLFYRPAGLAGPPAEAPKKLTWSRSG-----NLLTVKNPTPYYVTLANLTLNV-GGRKLG-LNSGMVAPFSTRQFPL  208 (235)
T ss_pred             eeEEECcccCCCChhHhhheEEEEEcC-----CEEEEECCCCcEEEEEEEEEee-CceecC-CCcceECCCccceeec
Confidence            345566666766665444567776654     6899999999999987  3333 333322 7889999999988443


No 34 
>PRK15188 fimbrial chaperone protein BcfB; Provisional
Probab=68.06  E-value=80  Score=27.31  Aligned_cols=111  Identities=14%  Similarity=0.191  Sum_probs=64.7

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCC-eEE-EEE-eeCC---CcceeecCCceeeCCCCeEEEEEEeeccC
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKS-HVA-FKF-QTTA---PKSCFMRPPGAILAPGESLIATVFKFVEL   83 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~-~VA-FKV-KTTa---P~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~   83 (201)
                      .|.+++.    .+.|+..   ....+++|+|.+++ +.. ... ....   ..-|.|.|+.--|+||+...+.|..   .
T Consensus        28 gi~l~~T----RvIy~~~---~~~~sv~i~N~~~~~p~LvQsWv~~~~~~~~~pFivtPPlfrl~~~~~~~lRI~~---~   97 (228)
T PRK15188         28 GIALGAT----RVIYPQG---SKQTSLPIINSSASNVFLIQSWVANADGSRSTDFIITPPLFVIQPKKENILRIMY---V   97 (228)
T ss_pred             eEEECcE----EEEEcCC---CceEEEEEEeCCCCccEEEEEEEecCCCCccCCEEEcCCeEEECCCCceEEEEEE---C
Confidence            4667777    6777664   46789999999864 333 221 1111   1249999999999999999999997   2


Q ss_pred             CCCCCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCC-CcceeEEeEEEEeCCC
Q 028956           84 PENNEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKD-QTAAEQILRVVFLNPE  138 (201)
Q Consensus        84 p~~~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~-~~i~e~kLrV~f~~p~  138 (201)
                      ..  ..|.|..  --|-+....+++..+..    .+... ......+|++-|-+..
T Consensus        98 ~~--~lP~DRE--Slf~lnv~~IP~~~~~~----~~~n~l~ia~r~~IKLFyRP~~  145 (228)
T PRK15188         98 GP--SLPTDRE--SVFYLNSKAIPSVDKNK----LTGNSLQIATQSVIKLFIRPKN  145 (228)
T ss_pred             CC--CCCCCce--EEEEEEEEecCCCCccc----cccceEEEEEeeeEEEEECCcc
Confidence            21  1233322  33556666666532210    01111 2233666777776443


No 35 
>PRK09926 putative chaperone protein EcpD; Provisional
Probab=67.81  E-value=11  Score=32.68  Aligned_cols=73  Identities=15%  Similarity=0.124  Sum_probs=46.1

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeC-CCcceeecCCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTT-APKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTT-aP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +|+-|=|-.|.-.|.++.+.|.|....+. ....|+++|+|..++.|- ++-. .-+.+.+  ..+.|.|+++..+.+
T Consensus       142 IKLFyRP~~l~~~~~~~~~~L~~~~~~~~-~~~~L~v~Nptpy~itl~~l~~~~~g~~~~~--~~~mi~P~s~~~~~l  216 (246)
T PRK09926        142 IKLFYRPDGLKGNPSEAPLALKWSWAGSE-GKASLRVTNPTPYYVSFSSGDLEAGGKRYPV--DSKMIAPFSDESMKV  216 (246)
T ss_pred             EEEEEcCccCCCChhHhhhccEEEEecCC-CeEEEEEECCCceEEEEEeeeeecCCeeccc--CcceECCCCcceEec
Confidence            35556666666555555567888754332 235699999999988764 3322 2222322  347899999888764


No 36 
>PF11611 DUF4352:  Domain of unknown function (DUF4352);  InterPro: IPR021652 This entry is represented by Bacteriophage A118, Gp32. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a group of putative lipoproteins of unknown function.; PDB: 3CFU_A.
Probab=67.40  E-value=25  Score=25.99  Aligned_cols=52  Identities=17%  Similarity=0.178  Sum_probs=32.1

Q ss_pred             CCCceEEEEEEEcCCCCeEE-----EEEeeCCCcceeecC---------CceeeCCCCeEEEEEEe
Q 028956           28 STFPVRSAIKIKNTSKSHVA-----FKFQTTAPKSCFMRP---------PGAILAPGESLIATVFK   79 (201)
Q Consensus        28 ~~k~~~~~LtLtN~S~~~VA-----FKVKTTaP~~Y~VrP---------~~GiI~Pg~s~~I~V~l   79 (201)
                      .++-..-.++++|.++.++.     |++.+..-..|....         ..+.|.||+++.-.+..
T Consensus        34 g~~fv~v~v~v~N~~~~~~~~~~~~f~l~d~~g~~~~~~~~~~~~~~~~~~~~i~pG~~~~g~l~F   99 (123)
T PF11611_consen   34 GNKFVVVDVTVKNNGDEPLDFSPSDFKLYDSDGNKYDPDFSASSNDNDLFSETIKPGESVTGKLVF   99 (123)
T ss_dssp             -SEEEEEEEEEEE-SSS-EEEEGGGEEEE-TT--B--EEE-CCCTTTB--EEEE-TT-EEEEEEEE
T ss_pred             CCEEEEEEEEEEECCCCcEEecccceEEEeCCCCEEcccccchhccccccccEECCCCEEEEEEEE
Confidence            34456789999999987775     788876656665332         45899999999999886


No 37 
>PRK15246 fimbrial assembly chaperone StbE; Provisional
Probab=67.19  E-value=11  Score=32.75  Aligned_cols=66  Identities=15%  Similarity=0.168  Sum_probs=44.7

Q ss_pred             ccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            3 PVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         3 ~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      ++-|=|-.|.-.|.++.+.|.|....+     .|++.|+|..+|.|---.-.-+.  +  ..++|.|+++..+.+
T Consensus       127 KlFyRP~~L~~~~~~a~~~L~~~~~~~-----~l~v~NpTPyyvtl~~l~~~~~~--~--~~~mi~P~s~~~~~~  192 (233)
T PRK15246        127 KILIRPTGLKAPTEAEEKKLRFIAKEN-----TIRIVNPTSWYMSLTLTMDNKKS--I--GDIMVAPKTALDVPL  192 (233)
T ss_pred             EEEECCcccCCChhhhhhccEEEEcCC-----EEEEECCCCcEEEEEeEEECCcc--c--CcceECCCCccEEEc
Confidence            455667777666666667888885432     39999999999988632222222  2  246899999887753


No 38 
>PRK15290 lfpB fimbrial chaperone protein; Provisional
Probab=65.82  E-value=11  Score=32.80  Aligned_cols=66  Identities=14%  Similarity=0.214  Sum_probs=45.2

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEE-EEeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAF-KFQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAF-KVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +++-|=|-.|.-.|.++.+.|.|....     ..|+++|+|..++.| .++-. .+  .+  ..+.|.|+++..+.+
T Consensus       149 IKlFyRP~~L~~~~~~a~~~L~~~~~g-----~~l~v~NptPyyvtl~~l~v~-g~--~~--~~~mI~P~s~~~~~~  215 (243)
T PRK15290        149 IKLFWRPASIRLIPEDAAPKVKWRREG-----RNLIAENPNPIHISVMDVIVD-GH--DV--PLNMIRPFETLTLPL  215 (243)
T ss_pred             eeEEEeccccCCChhHhhhhcEEEEcC-----CEEEEECCCcEEEEEEEEeec-Cc--cc--CCceECCCCceEEEc
Confidence            456677777766666555678888432     259999999999987 45432 22  22  247899999987753


No 39 
>COG3121 FimC P pilus assembly protein, chaperone PapD [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=65.47  E-value=89  Score=26.91  Aligned_cols=112  Identities=12%  Similarity=0.187  Sum_probs=73.9

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeC-------CCcceeecCCceeeCCCCeEEEEEEeecc
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTT-------APKSCFMRPPGAILAPGESLIATVFKFVE   82 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTT-------aP~~Y~VrP~~GiI~Pg~s~~I~V~l~~e   82 (201)
                      .+.+++.    .+.|+..   ....+++|.|.++.++.-.+..-       ...-|-|-|+.=.|+||+...|.|.+   
T Consensus        28 ~v~i~~T----RiI~~~~---~k~~sl~l~N~~~~p~LvQ~wvd~~~~~~~~~~pfvvtPPv~rl~p~~~q~vRi~~---   97 (235)
T COG3121          28 GVVLGGT----RIIYPAG---DKETSLTLRNDGNQPYLVQSWVDDGLEPEKSTVPFVVTPPVFRLEPGQEQQLRILY---   97 (235)
T ss_pred             eEEecce----EEEEeCC---CceeEEEEEcCCCCCEEEEEEEcCCCCCccccCCEEecCCeEEECCCCccEEEEEe---
Confidence            4556666    6776665   36789999998889998886543       23359999999999999999999998   


Q ss_pred             CCCCCCCCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCC---CcceeEEeEEEEeCCCCCC
Q 028956           83 LPENNEKPMYQKSRDKFKIISMKVKADVDYVPELFDEQKD---QTAAEQILRVVFLNPERPE  141 (201)
Q Consensus        83 ~p~~~e~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~---~~i~e~kLrV~f~~p~~ps  141 (201)
                      .+. . .|.|.  ..-|-+....++....      +...+   +....++|++-|-++.-..
T Consensus        98 ~~~-~-lP~dr--Eslf~lnv~eIPp~~~------~~~~~n~lq~a~r~riKlf~RP~~l~~  149 (235)
T COG3121          98 TGN-K-LPADR--ESLFRLNVDEIPPKSK------DDKGPNVLQLALRSRIKLFYRPAGLAG  149 (235)
T ss_pred             cCC-C-CCCCc--eeEEEEEeeecCCCCc------ccCCcceEEEEeeeeeeEEECcccCCC
Confidence            332 1 23432  3457777777765431      11111   2334678888777654443


No 40 
>PF12690 BsuPI:  Intracellular proteinase inhibitor;  InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=65.12  E-value=32  Score=24.91  Aligned_cols=47  Identities=21%  Similarity=0.282  Sum_probs=22.3

Q ss_pred             EEEEEEEcCCCCeEEEEEeeCCCcceeec-------------------CCceeeCCCCeEEEEEEe
Q 028956           33 RSAIKIKNTSKSHVAFKFQTTAPKSCFMR-------------------PPGAILAPGESLIATVFK   79 (201)
Q Consensus        33 ~~~LtLtN~S~~~VAFKVKTTaP~~Y~Vr-------------------P~~GiI~Pg~s~~I~V~l   79 (201)
                      .-.|+|+|.++.+|-+.+.|..-=-|.|+                   -..=.|+||++....+++
T Consensus         3 ~~~l~v~N~s~~~v~l~f~sgq~~D~~v~d~~g~~vwrwS~~~~FtQal~~~~l~pGe~~~~~~~~   68 (82)
T PF12690_consen    3 EFTLTVTNNSDEPVTLQFPSGQRYDFVVKDKEGKEVWRWSDGKMFTQALQEETLEPGESLTYEETW   68 (82)
T ss_dssp             EEEEEEEE-SSS-EEEEESSS--EEEEEE-TT--EEEETTTT-------EEEEE-TT-EEEEEEEE
T ss_pred             EEEEEEEeCCCCeEEEEeCCCCEEEEEEECCCCCEEEEecCCchhhheeeEEEECCCCEEEEEEEE
Confidence            44666777766666665533222222222                   223356777777777776


No 41 
>TIGR02745 ccoG_rdxA_fixG cytochrome c oxidase accessory protein FixG. Member of this ferredoxin-like protein family are found exclusively in species with an operon encoding the cbb3 type of cytochrome c oxidase (cco-cbb3), and near the cco-cbb3 operon in about half the cases. The cco-cbb3 is found in a variety of proteobacteria and almost nowhere else, and is associated with oxygen use under microaerobic conditions. Some (but not all) of these proteobacteria are also nitrogen-fixing, hence the gene symbol fixG. FixG was shown essential for functional cco-cbb3 expression in Bradyrhizobium japonicum.
Probab=64.58  E-value=75  Score=30.15  Aligned_cols=72  Identities=15%  Similarity=0.161  Sum_probs=49.4

Q ss_pred             eeCceeEEccCCCceeEEEccCCCC-ceEEEEEEEcCCCCeEEEEEeeCCCcceeec-C-CceeeCCCCeEEEEEEe
Q 028956            6 FLPLHDHFYPNGAVSSLILPISSTF-PVRSAIKIKNTSKSHVAFKFQTTAPKSCFMR-P-PGAILAPGESLIATVFK   79 (201)
Q Consensus         6 ~~p~~L~i~P~~~~~eL~F~~~~~k-~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~Vr-P-~~GiI~Pg~s~~I~V~l   79 (201)
                      ..|+.|.+..+-  .-|....+.+. ...-+++|.|.+.++..|.++........+. + +.=.|+||+...+.|++
T Consensus       323 r~~~~~~v~r~r--~~l~~~~~~g~i~N~Y~~~i~Nk~~~~~~~~l~v~g~~~~~~~~~~~~i~v~~g~~~~~~v~v  397 (434)
T TIGR02745       323 REPMDLNVLRDR--NLLYVRNSDGVVENTYTLKILNKTEQPHEYYLSVLGLPGIKIEGPGAPIHVKAGEKVKLPVFL  397 (434)
T ss_pred             CCceEEEEEecC--CcceEECCCCcEEEEEEEEEEECCCCCEEEEEEEecCCCcEEEcCCceEEECCCCEEEEEEEE
Confidence            457788887773  22444444443 5577999999999888888876554443333 2 23489999999888887


No 42 
>PRK15218 fimbrial chaperone protein PegB; Provisional
Probab=64.39  E-value=66  Score=27.71  Aligned_cols=101  Identities=18%  Similarity=0.292  Sum_probs=60.8

Q ss_pred             eEEEccCCCCceEEEEEEEcCCCCeEEEEEee---CC-----C----cceeecCCceeeCCCCeEEEEEEeeccCCCCCC
Q 028956           21 SLILPISSTFPVRSAIKIKNTSKSHVAFKFQT---TA-----P----KSCFMRPPGAILAPGESLIATVFKFVELPENNE   88 (201)
Q Consensus        21 eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKT---Ta-----P----~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e   88 (201)
                      .+.|+..   ....+++|.|.++.+  |=|++   +.     |    .-|.|.|+.=.|+||+...+.|..   .. . .
T Consensus        26 RvIy~~~---~~~~si~i~N~~~~p--yLvQsWvd~~~~~~~~~~~~~pFivtPPlfRl~p~~~~~lRI~~---~~-~-~   95 (226)
T PRK15218         26 RIIYPAQ---KKDITVQLMNDGKRS--SLIQAWIDNGDTSLPPEKLQVPFIMTPPVIRVAANSGQQLKIKK---LA-N-N   95 (226)
T ss_pred             EEEEcCC---CcEEEEEEEcCCCCc--EEEEEEEeCCCCCCCcccccCCEEECCCeEEECCCCceEEEEEE---CC-C-C
Confidence            5666643   467799999999876  55554   11     1    149999999999999999999997   22 1 1


Q ss_pred             CCCCCCCCCeEEEEEEEeCCCCCChhHhhhccCC-CcceeEEeEEEEeCC
Q 028956           89 KPMYQKSRDKFKIISMKVKADVDYVPELFDEQKD-QTAAEQILRVVFLNP  137 (201)
Q Consensus        89 ~p~~~~~kDKFlVqs~~v~~~~d~~~~lwk~~~~-~~i~e~kLrV~f~~p  137 (201)
                      .|.|.  ---|-+....+++..+..    +.... +.....+|++-|-+.
T Consensus        96 LP~DR--ESlfwlnv~~IPp~~~~~----~~~n~L~iairtrIKLfYRP~  139 (226)
T PRK15218         96 LPGDR--ESLFYLNVLDIPPNSDEN----KDKNIIKFALQNRIKLIYRPP  139 (226)
T ss_pred             CCcce--eEEEEEEEEEcCCCCCCc----CcCcEEEEEeeeEEEEEEccc
Confidence            23332  223666666666532210    01111 223366677767543


No 43 
>PRK15233 putative fimbrial chaperone protein SefB; Provisional
Probab=63.73  E-value=96  Score=27.24  Aligned_cols=69  Identities=13%  Similarity=0.051  Sum_probs=46.7

Q ss_pred             eEEEEEEEcCCCCeEEEEEee---CC----CcceeecCCceeeCCCCeEEEEEEeeccCCCCCCCCCCCCCCCeEEEEEE
Q 028956           32 VRSAIKIKNTSKSHVAFKFQT---TA----PKSCFMRPPGAILAPGESLIATVFKFVELPENNEKPMYQKSRDKFKIISM  104 (201)
Q Consensus        32 ~~~~LtLtN~S~~~VAFKVKT---Ta----P~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e~p~~~~~kDKFlVqs~  104 (201)
                      ...+++|.|.++.+  |-|++   +.    ..-|.|.|+.=.|+||+...+.|..   . +. ..|.|..  --|.+...
T Consensus        56 ~~~sl~i~N~~~~p--~LvQsWvd~~~~~~~~pFiVtPPLfRLep~~~~~lRIi~---~-~~-~LP~DRE--Slfwlnv~  126 (246)
T PRK15233         56 PSTSFWIMNEKEYP--ILVQTQVYNDDKSSKAPFIVTPPILKVESNARTRLKVIP---T-SN-LFNKNEE--SLYWLCVK  126 (246)
T ss_pred             cEEEEEEEcCCCCc--EEEEEEEecCCCCccCCEEECCCeEEECCCCceEEEEEE---C-CC-CCCcCce--EEEEEEEE
Confidence            67899999987776  55554   11    1249999999999999999999997   2 21 2233321  12666666


Q ss_pred             EeCCC
Q 028956          105 KVKAD  109 (201)
Q Consensus       105 ~v~~~  109 (201)
                      .+|+.
T Consensus       127 ~IPp~  131 (246)
T PRK15233        127 GVPPL  131 (246)
T ss_pred             EcCCC
Confidence            66653


No 44 
>PRK15295 fimbrial assembly chaperone SthB; Provisional
Probab=62.16  E-value=15  Score=31.58  Aligned_cols=67  Identities=12%  Similarity=0.177  Sum_probs=44.0

Q ss_pred             ccceeCceeEE-ccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEEEE
Q 028956            3 PVRFLPLHDHF-YPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIATVF   78 (201)
Q Consensus         3 ~~~~~p~~L~i-~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~   78 (201)
                      ++-|=|-.|.- .|.++.+.|.|....+     .|+++|+|..++.|- ++... +.  +. +.|.|.|+++..+.+-
T Consensus       130 KLFyRP~~L~~~~~~~~~~~L~~~~~~~-----~l~v~NptPyyitl~~l~~~~-~~--~~-~~~mI~P~s~~~~~~~  198 (226)
T PRK15295        130 KLIYRPPALTKSTPDSQSQQLKWQTAGD-----VITVNNPTPYYMNFASVTLNS-HE--VK-SATFVPPKSSASFKLG  198 (226)
T ss_pred             eEEEchhhcCCCChhHhhhccEEEEcCC-----EEEEECCCceEEEEEEEEECC-cc--cC-CCceECCCCccEEEcc
Confidence            44555666654 3454556788885432     499999999999764 54432 22  22 3589999999888643


No 45 
>PRK15224 pili assembly chaperone protein SafB; Provisional
Probab=61.62  E-value=69  Score=27.88  Aligned_cols=76  Identities=12%  Similarity=0.166  Sum_probs=50.9

Q ss_pred             eEEEccCCCCceEEEEEEEcCCCCeEEEEEee----CC---CcceeecCCceeeCCCCeEEEEEEeeccCCCCCCCCCCC
Q 028956           21 SLILPISSTFPVRSAIKIKNTSKSHVAFKFQT----TA---PKSCFMRPPGAILAPGESLIATVFKFVELPENNEKPMYQ   93 (201)
Q Consensus        21 eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKT----Ta---P~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e~p~~~   93 (201)
                      .+.|+..   ....+++|.|.++.+  |=|++    ..   ..-|.|.|+.=.|+|++...+.|..   .. . ..|.|.
T Consensus        36 RvIy~~~---~k~~sl~v~N~~~~p--yLvQsWvd~~~~~~~~pFivtPPlfRlep~~~~~lRI~~---~~-~-~LP~DR  105 (237)
T PRK15224         36 RVIYHAG---TAGATLSVSNPQNYP--ILVQSSVKAADKSSPAPFLVMPPLFRLEANQQSQLRIVR---TG-G-DMPTDR  105 (237)
T ss_pred             EEEEeCC---CcEEEEEEEcCCCCc--EEEEEEEeCCCCCccCCEEECCCeEEECCCCceEEEEEE---CC-C-CCCCce
Confidence            5665543   467899999998876  66665    11   1239999999999999999999997   22 1 123332


Q ss_pred             CCCCeEEEEEEEeCC
Q 028956           94 KSRDKFKIISMKVKA  108 (201)
Q Consensus        94 ~~kDKFlVqs~~v~~  108 (201)
                      .  --|-+....+++
T Consensus       106 E--SlFwlnv~~IPp  118 (237)
T PRK15224        106 E--TLQWVCIKAVPP  118 (237)
T ss_pred             e--EEEEEEEEEcCC
Confidence            2  235555555665


No 46 
>PRK15188 fimbrial chaperone protein BcfB; Provisional
Probab=61.56  E-value=15  Score=31.81  Aligned_cols=65  Identities=20%  Similarity=0.185  Sum_probs=43.3

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIA   75 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I   75 (201)
                      +++-|=|..|.-.|.++.+.|.|....+     .|+++|+|..+|.|-=-+-+-+.  +  ..++|+|.++..+
T Consensus       137 IKLFyRP~~l~~~~~~a~~~L~~~~~~~-----~l~v~NpTPyyvtl~~l~v~~~~--~--~~~mi~P~s~~~~  201 (228)
T PRK15188        137 IKLFIRPKNLAEAPAHAPSTLRCRNERG-----QLTITNPSPYYVSMVELYSAGKK--L--PNTMVPPKGAITL  201 (228)
T ss_pred             EEEEECCccCCCChhhhhhceEEEEcCC-----EEEEECCCCcEEEEEEEEECCcc--c--CCceECCCCccee
Confidence            3566778788777776667889986533     49999999999987421112222  2  2367888888543


No 47 
>PRK11385 putativi pili assembly chaperone; Provisional
Probab=60.75  E-value=17  Score=31.58  Aligned_cols=67  Identities=12%  Similarity=0.098  Sum_probs=46.0

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +++-|=|-.|.-.|.++.+.|.|....+     .|+++|+|..++.|- ++-. .+  .+ .+.++|+|.++..+.+
T Consensus       140 iKLFyRP~~L~~~~~~a~~~L~~~~~g~-----~l~v~NpTPyyvtl~~l~~~-~~--~~-~~~~mi~Pfs~~~~~~  207 (236)
T PRK11385        140 FKLFWRPEGLPGDPLEAYQQLRWTRNSQ-----GVQLTNPTPYYINLIQVSVN-GK--AL-SNAGVVPPKSQRQTSW  207 (236)
T ss_pred             EEEEEcccccCCChhhhhhheEEEEcCC-----EEEEECCCCcEEEEEeEEEC-Cc--cc-CCCceECCCCccEEec
Confidence            3556667777777776677899885432     499999999999763 3332 22  22 2356899999887764


No 48 
>PF02753 PapD_C:  Pili assembly chaperone PapD, C-terminal domain;  InterPro: IPR016148 Most Gram-negative bacteria possess a supramolecular structure - the pili - on their surface, which mediates attachment to specific receptors. Many interactive subunits are required to assemble pili, but their assembly only takes place after translocation across the cytoplasmic membrane. Periplasmic chaperones assist pili assembly by binding to the subunits, thereby preventing premature aggregation [, ]. Pili chaperones are structurally, and possibly evolutionarily, related to the immunoglobulin superfamily [, ]: they contain two globular domains, with a topology identical to an immunoglobulin fold. This entry represents the C-terminal domain of pili assembly chaperone, and has a beta-sandwich fold consisting of eight strands in two sheets with a Greek key topology.; GO: 0007047 cellular cell wall organization, 0030288 outer membrane-bounded periplasmic space; PDB: 2UY7_C 2UY6_A 2W07_A 3GFU_A 3F65_F 3F6L_A 3F6I_A 3GEW_B 1PDK_A 2XG4_A ....
Probab=60.21  E-value=10  Score=25.82  Aligned_cols=44  Identities=14%  Similarity=0.042  Sum_probs=27.4

Q ss_pred             EEEEcCCCCeEEEEE-eeCCCcceeecCCceeeCCCCeEEEEEEe
Q 028956           36 IKIKNTSKSHVAFKF-QTTAPKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        36 LtLtN~S~~~VAFKV-KTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      |+++|+|.-+|.|-= +....+.=.--...++|+|+++..+.+.-
T Consensus         1 L~v~NpTPy~vtl~~~~~~~~~~~~~~~~~~mi~P~s~~~~~~~~   45 (68)
T PF02753_consen    1 LTVKNPTPYYVTLSSLKLNGGGKKKKIDNSGMIAPFSSKSFPLPA   45 (68)
T ss_dssp             EEEEE-SSS-EEEEEEEETHHHCCEECCCETEE-TTEEEEEETST
T ss_pred             CEEECCCCcEEEEEeeeecccccccccCCceEECCCCceEEeccC
Confidence            689999999998864 44433332233444499999998877653


No 49 
>PRK15285 putative fimbrial chaperone protein StfD; Provisional
Probab=59.58  E-value=1.2e+02  Score=26.53  Aligned_cols=56  Identities=11%  Similarity=0.106  Sum_probs=39.7

Q ss_pred             eEEEccCCCCceEEEEEEEcCCCC-eEEEEE--eeCCCc----ceeecCCceeeCCCCeEEEEEEe
Q 028956           21 SLILPISSTFPVRSAIKIKNTSKS-HVAFKF--QTTAPK----SCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        21 eL~F~~~~~k~~~~~LtLtN~S~~-~VAFKV--KTTaP~----~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .+.|+..   ....+++|+|.++. ++.=-.  .....+    -|.|.|+.-.|+||+...+.|..
T Consensus        33 RVIy~~~---~~~~sv~i~N~~~~~p~LvQsWvd~~~~~~~~~pFiVtPPlfRl~p~~~~~lRI~~   95 (250)
T PRK15285         33 RLVFRGE---DKSISVDLKNANSKLPYLAQSWVEDEKGVKITSPLIVVPPVQRIEPSAIGQVKIQG   95 (250)
T ss_pred             EEEEcCC---CceEEEEEEeCCCCCcEEEEEEeeCCCCCcccCCEEEcCCeEEECCCCceEEEEEE
Confidence            5666643   46789999999864 432222  111111    39999999999999999999996


No 50 
>PRK15208 long polar fimbrial chaperone LpfB; Provisional
Probab=59.42  E-value=18  Score=31.08  Aligned_cols=67  Identities=16%  Similarity=0.199  Sum_probs=43.8

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +|+-|=|-.|.-.+.++.+.|.|....     ..|+++|+|..++.|- ++-. -+  .+. ..|.|.|+++..+.+
T Consensus       131 IKlFyRP~~l~~~~~~~~~~L~~~~~~-----~~l~v~Nptpy~vtl~~l~~~-g~--~~~-~~~mi~P~s~~~~~l  198 (228)
T PRK15208        131 LKLFYRPAGLKGNSMDAWNKLRFTRAG-----NQIKVENPSAFNLTFNQFYAN-GR--DIE-KAGMVPAKGSLNIEL  198 (228)
T ss_pred             eeEEEchhhcCCchhhhhhceEEEEeC-----CEEEEECCCccEEEEEEEEEC-Cc--ccC-CCceECCCCccEEEc
Confidence            355566666655555545577887432     2599999999999875 4432 22  222 368999999888764


No 51 
>PRK15299 fimbrial chaperone protein StiB; Provisional
Probab=58.34  E-value=19  Score=30.79  Aligned_cols=66  Identities=14%  Similarity=0.160  Sum_probs=43.4

Q ss_pred             ccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            3 PVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         3 ~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      ++.|=|..|.-.+.++.+.|.|....     ..|+++|+|..++.|- ++-... .  + ...|.|.|+++..+.+
T Consensus       134 KLfyRP~~l~~~~~~~~~~L~~~~~~-----~~l~v~Nptpy~vtl~~l~~~~~-~--~-~~~~mv~P~s~~~~~l  200 (227)
T PRK15299        134 KLIYRPKALTQKPEEVADRLTWSRQG-----RTLTVKNPTPYYMNFATLSVGSQ-K--V-KAPRYVAPFGNAQYTL  200 (227)
T ss_pred             eEEEcccccCcchhhhhhccEEEEcC-----CEEEEECCCccEEEEEeEEECCc-c--c-CCCceECCCCccEEEc
Confidence            45677777766555455678887432     2599999999888763 333322 1  2 2358999999887764


No 52 
>PF04744 Monooxygenase_B:  Monooxygenase subunit B protein;  InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=57.00  E-value=47  Score=31.13  Aligned_cols=58  Identities=19%  Similarity=0.177  Sum_probs=36.8

Q ss_pred             eEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCccee----------------------ecCCceeeCCCCeEEEEEE
Q 028956           21 SLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCF----------------------MRPPGAILAPGESLIATVF   78 (201)
Q Consensus        21 eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~----------------------VrP~~GiI~Pg~s~~I~V~   78 (201)
                      .-.|.- +++..+-.|+++|.++++|-..==+|+.-+|.                      |.| .+-|.||++.+++|.
T Consensus       255 ~A~Y~v-pgR~l~~~l~VtN~g~~pv~LgeF~tA~vrFln~~v~~~~~~~P~~l~A~~gL~vs~-~~pI~PGETrtl~V~  332 (381)
T PF04744_consen  255 DATYRV-PGRTLTMTLTVTNNGDSPVRLGEFNTANVRFLNPDVPTDDPDYPDELLAERGLSVSD-NSPIAPGETRTLTVE  332 (381)
T ss_dssp             EEEEES-SSSEEEEEEEEEEESSS-BEEEEEESSS-EEE-TTT-SS-S---TTTEETT-EEES---S-B-TT-EEEEEEE
T ss_pred             ccEEec-CCcEEEEEEEEEcCCCCceEeeeEEeccEEEeCcccccCCCCCchhhhccCcceeCC-CCCcCCCceEEEEEE
Confidence            345544 47789999999999999998774344443332                      233 346899999999999


Q ss_pred             ee
Q 028956           79 KF   80 (201)
Q Consensus        79 l~   80 (201)
                      .+
T Consensus       333 a~  334 (381)
T PF04744_consen  333 AQ  334 (381)
T ss_dssp             EE
T ss_pred             ee
Confidence            84


No 53 
>PRK15211 fimbrial chaperone protein PefD; Provisional
Probab=56.11  E-value=22  Score=30.72  Aligned_cols=69  Identities=16%  Similarity=0.039  Sum_probs=44.7

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecC----CceeeCCCCeEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRP----PGAILAPGESLIAT   76 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP----~~GiI~Pg~s~~I~   76 (201)
                      +++-|=|..|.-+|.++.+.|.|....+     .|++.|+|..++.|- ++- ..+......    ..+.|+|.++..+.
T Consensus       131 iKLfyRP~~L~~~~~~~~~~L~~~~~~~-----~l~v~NpTPyyitl~~l~v-~~~~~~~~~~~~~~~~mi~Pfs~~~~~  204 (229)
T PRK15211        131 VKLIYRPKALKEGRENAEKNLQIIHRGG-----ETYLKNPTPYYFAVTGVKL-NGQPVKLSDRVMNEIAQLAPFSEVSLG  204 (229)
T ss_pred             eeeEEcchhcccChhhhhhccEEEEeCC-----EEEEECCCCcEEEeEeEEE-CCeeccccccccccceeECCCCcceEe
Confidence            3556777778777766667888875433     499999999999874 332 222221111    23579999987664


No 54 
>PRK15253 putative fimbrial assembly chaperone protein StcB; Provisional
Probab=55.06  E-value=1.1e+02  Score=26.61  Aligned_cols=76  Identities=18%  Similarity=0.312  Sum_probs=50.6

Q ss_pred             eEEEccCCCCceEEEEEEEcCCCCeEEEEEee---CC-----C----cceeecCCceeeCCCCeEEEEEEeeccCCCCCC
Q 028956           21 SLILPISSTFPVRSAIKIKNTSKSHVAFKFQT---TA-----P----KSCFMRPPGAILAPGESLIATVFKFVELPENNE   88 (201)
Q Consensus        21 eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKT---Ta-----P----~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e   88 (201)
                      .+.|+..   ....+++|.|.++.+  |=|++   ..     |    .-|.|.|+.=.|+||+...+.|..   .+ . .
T Consensus        41 RvIy~~~---~k~~sv~i~N~~~~p--yLvQsWvd~~~~~~~~~~~~~pFivtPPlfRl~p~~~~~lRI~~---~~-~-~  110 (242)
T PRK15253         41 RVIYPAE---KKEVVVQLVNQGEQA--SLVQSWIDDGNTSLPPEKIQVPFMLTPPVARVAAESGQQIKIKK---MP-N-S  110 (242)
T ss_pred             EEEEeCC---CceEEEEEEcCCCCc--EEEEEEEECCCCCCCcccccCCEEECCCeEEECCCCceEEEEEE---CC-C-C
Confidence            5666643   467799999999875  55554   11     1    149999999999999999999996   22 1 1


Q ss_pred             CCCCCCCCCeEEEEEEEeCC
Q 028956           89 KPMYQKSRDKFKIISMKVKA  108 (201)
Q Consensus        89 ~p~~~~~kDKFlVqs~~v~~  108 (201)
                      .|.|.  ---|-+....+|+
T Consensus       111 LP~DR--ESlfwlnv~~IPp  128 (242)
T PRK15253        111 LPDNK--ESLFYLNVLDIPP  128 (242)
T ss_pred             CCcce--eEEEEEEEEEcCC
Confidence            23332  1235566666665


No 55 
>PF00553 CBM_2:  Cellulose binding domain;  InterPro: IPR001919 The microbial degradation of cellulose and xylans requires several types of enzyme such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) []. Structurally, cellulases and xylanases generally consist of a catalytic domain joined to a cellulose-binding domain (CBD) by a short linker sequence rich in proline and/or hydroxy-amino acids. The CBD domain is found either at the N-terminal or at the C-terminal extremity of these enzymes. As it is shown in the following schematic representation, there are two conserved cysteines in this CBD domain - one at each extremity of the domain - which have been shown [] to be involved in a disulphide bond. There are also four conserved tryptophan, two are involved in cellulose binding. The CBD of a number of bacterial cellulases has been shown to consist of about 105 amino acid residues [, ].  +-------------------------------------------------+ | | xCxxxxWxxxxxNxxxWxxxxxxxWxxxxxxxxWNxxxxxGxxxxxxxxxxCx 'C': conserved cysteine involved in a disulphide bond. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process; PDB: 2CZN_A 2CWR_A 1HEH_C 1HEJ_C 3NDZ_E 3NDY_E 2XBD_A 1E5C_A 1XBD_A 1E5B_A ....
Probab=54.93  E-value=30  Score=25.67  Aligned_cols=50  Identities=18%  Similarity=0.294  Sum_probs=34.0

Q ss_pred             CceEEEEEEEcCCCCeEE-EEEeeCCC-----------------cceeecCCc--eeeCCCCeEEEEEEe
Q 028956           30 FPVRSAIKIKNTSKSHVA-FKFQTTAP-----------------KSCFMRPPG--AILAPGESLIATVFK   79 (201)
Q Consensus        30 k~~~~~LtLtN~S~~~VA-FKVKTTaP-----------------~~Y~VrP~~--GiI~Pg~s~~I~V~l   79 (201)
                      ......|+|+|.++.++. ++|.=+-|                 ..|.|+|..  +.|+||+++.+-+..
T Consensus        13 ~Gf~~~v~v~N~~~~~i~~W~v~~~~~~~~~i~~~Wna~~s~~g~~~~v~~~~wn~~i~~G~s~~~Gf~~   82 (101)
T PF00553_consen   13 GGFQGEVTVTNNGSSPINGWTVTFTFPSGQTITSSWNATVSQSGNTVTVTNPSWNGTIAPGGSVTFGFQA   82 (101)
T ss_dssp             SEEEEEEEEEESSSSTEESEEEEEEESTTEEEEEEESCEEEEETTEEEEEESSTCSEEEESEEEEEEEEE
T ss_pred             CCeEEEEEEEECCCCccCCEEEEEEeCCCCEEeeeeccEEEecCCEEEEEcCCcCcccCCCCeEEEEEEE
Confidence            345668899998877652 33332222                 358888654  699999999887775


No 56 
>PRK09918 putative fimbrial chaperone protein; Provisional
Probab=54.72  E-value=29  Score=29.75  Aligned_cols=69  Identities=23%  Similarity=0.268  Sum_probs=42.6

Q ss_pred             cccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-E-eeC-CCcceeecCCceeeCCCCeEEEEEE
Q 028956            2 VPVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-F-QTT-APKSCFMRPPGAILAPGESLIATVF   78 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-V-KTT-aP~~Y~VrP~~GiI~Pg~s~~I~V~   78 (201)
                      +|+-|-|-.|. .+.++.+.|.|....     ..|+++|++..++-|- + +.. .-+.+.+  ..|.|.|+++..+.+-
T Consensus       129 iklfyRP~~l~-~~~~~~~~l~~~~~~-----~~l~v~N~~p~~i~l~~l~~~~~~g~~~~~--~~~~v~P~s~~~~~l~  200 (230)
T PRK09918        129 LPVLIQPAALP-VVRDPWKLLVWSISG-----NNLVVSNPSPYVVRLGQQVILLPSGKVVAL--PKPYILPGESLTVAIT  200 (230)
T ss_pred             EEEEEeCCCCC-CcccchhccEEEEeC-----CEEEEECCCCEEEEEeccEEEccCCceecc--CCceECCCceEEEEcc
Confidence            45666676665 333333457776542     2499999999888764 2 221 2333333  3489999999888643


No 57 
>PF05753 TRAP_beta:  Translocon-associated protein beta (TRAPB);  InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins. The normal translocation of nascent polypeptides into the lumen of the endoplasmic reticulum (ER) is thought to be aided in part by a translocon-associated protein (TRAP) complex consisting of 4 protein subunits. The association of mature proteins with the ER and Golgi, or other intracellular locales, such as lysosomes, depends on the initial targeting of the nascent polypeptide to the ER membrane. A similar scenario must also exist for proteins destined for secretion [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=53.13  E-value=69  Score=26.70  Aligned_cols=51  Identities=20%  Similarity=0.232  Sum_probs=36.9

Q ss_pred             CCCceEEEEEEEcCCCCeEEEEEeeCC----CcceeecC-----CceeeCCCCeEEEEEEe
Q 028956           28 STFPVRSAIKIKNTSKSHVAFKFQTTA----PKSCFMRP-----PGAILAPGESLIATVFK   79 (201)
Q Consensus        28 ~~k~~~~~LtLtN~S~~~VAFKVKTTa----P~~Y~VrP-----~~GiI~Pg~s~~I~V~l   79 (201)
                      .++..+..++|.|.++. -||.|+-+.    ++.|-+.-     ....|+||+++.-.+++
T Consensus        36 ~g~~v~V~~~iyN~G~~-~A~dV~l~D~~fp~~~F~lvsG~~s~~~~~i~pg~~vsh~~vv   95 (181)
T PF05753_consen   36 EGEDVTVTYTIYNVGSS-AAYDVKLTDDSFPPEDFELVSGSLSASWERIPPGENVSHSYVV   95 (181)
T ss_pred             CCcEEEEEEEEEECCCC-eEEEEEEECCCCCccccEeccCceEEEEEEECCCCeEEEEEEE
Confidence            47789999999999877 799998776    24444332     13567788877777776


No 58 
>PRK15274 putative periplasmic fimbrial chaperone protein SteC; Provisional
Probab=52.67  E-value=1.6e+02  Score=25.89  Aligned_cols=78  Identities=15%  Similarity=0.152  Sum_probs=50.0

Q ss_pred             eEEEccCCCCceEEEEEEEcCCCC-eEEEEEee---CC-----CcceeecCCceeeCCCCeEEEEEEeeccCCCCCCCCC
Q 028956           21 SLILPISSTFPVRSAIKIKNTSKS-HVAFKFQT---TA-----PKSCFMRPPGAILAPGESLIATVFKFVELPENNEKPM   91 (201)
Q Consensus        21 eL~F~~~~~k~~~~~LtLtN~S~~-~VAFKVKT---Ta-----P~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e~p~   91 (201)
                      .+.|...   ....+|+|+|.++. ++  =|++   ..     ..-|.|.|+.-.|+||+...|.|..   .+.....|.
T Consensus        34 RvIy~e~---~~~~sv~v~N~~~~~p~--LVQsWvdd~~~~~~~~pFivtPPLfRlep~~~q~lRI~~---~~~~~~LP~  105 (257)
T PRK15274         34 RVIFNGN---ENSITVTLKNGNATLPY--LAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQP---LPAAASLPQ  105 (257)
T ss_pred             EEEEeCC---CceEEEEEEeCCCCCcE--EEEEEccCCCCCcccCCEEEcCCeEEECCCCceEEEEEE---CCCCCCCCC
Confidence            5666643   46789999999865 53  3333   11     1149999999999999999999997   221112233


Q ss_pred             CCCCCCeEEEEEEEeCC
Q 028956           92 YQKSRDKFKIISMKVKA  108 (201)
Q Consensus        92 ~~~~kDKFlVqs~~v~~  108 (201)
                      |..  --|-+....+|+
T Consensus       106 DRE--SlFwlNv~eIPp  120 (257)
T PRK15274        106 DRE--SLFYFNVREIPP  120 (257)
T ss_pred             cee--EEEEEEEEEcCC
Confidence            322  235666666665


No 59 
>smart00637 CBD_II CBD_II domain.
Probab=50.91  E-value=62  Score=23.24  Aligned_cols=48  Identities=17%  Similarity=0.250  Sum_probs=30.5

Q ss_pred             eEEEEEEEcCCCCeE-----EEEEee-------------CCCcceeecCCc--eeeCCCCeEEEEEEe
Q 028956           32 VRSAIKIKNTSKSHV-----AFKFQT-------------TAPKSCFMRPPG--AILAPGESLIATVFK   79 (201)
Q Consensus        32 ~~~~LtLtN~S~~~V-----AFKVKT-------------TaP~~Y~VrP~~--GiI~Pg~s~~I~V~l   79 (201)
                      ....|+|+|.++.++     .|.+--             .....|.++|..  +.|+||+++.+-+..
T Consensus         8 ~~~~v~vtN~~~~~~~~W~v~~~~~~~~~i~~~Wn~~~~~~g~~~~~~~~~wn~~i~~G~s~~~gf~~   75 (92)
T smart00637        8 FTANVTVTNTGSSAINGWTVTFDLPGGQTVTNSWNATVSQSGGHVTATNASWNGTIAPGGSVSFGFQG   75 (92)
T ss_pred             EEEEEEEEeCCCCcccCeEEEEEcCCCcEEeeeEEEEEEecCCEEEEecCccccccCCCCEEEEEEEe
Confidence            456788888776443     333311             123368888644  799999998876654


No 60 
>PHA01327 hypothetical protein
Probab=48.34  E-value=18  Score=23.72  Aligned_cols=14  Identities=43%  Similarity=1.037  Sum_probs=11.7

Q ss_pred             hHHHHHHHHHHHHh
Q 028956          180 DEWKERRERYLARQ  193 (201)
Q Consensus       180 ~~~~~~r~~~~~~~  193 (201)
                      ++|.|||+++-.|.
T Consensus        20 e~wqer~drmkkrh   33 (49)
T PHA01327         20 EEWQERKDRMKKRH   33 (49)
T ss_pred             HHHHHHHHHHHHHH
Confidence            68999999998753


No 61 
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=45.00  E-value=71  Score=22.29  Aligned_cols=52  Identities=21%  Similarity=0.161  Sum_probs=34.5

Q ss_pred             CCCceEEEEEEEcCCCC-eEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEe
Q 028956           28 STFPVRSAIKIKNTSKS-HVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        28 ~~k~~~~~LtLtN~S~~-~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .++..+-.++|+|.+.. .=.|+|+-...+.-.-.-..+-|.||++..+++.+
T Consensus        17 ~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~~~~~i~~L~~g~~~~v~~~~   69 (101)
T PF07705_consen   17 PGEPVTITVTVKNNGTADAENVTVRLYLDGNSVSTVTIPSLAPGESETVTFTW   69 (101)
T ss_dssp             TTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEEEEEEESEB-TTEEEEEEEEE
T ss_pred             CCCEEEEEEEEEECCCCCCCCEEEEEEECCceeccEEECCcCCCcEEEEEEEE
Confidence            56788999999999764 34567664333333322333788999999999998


No 62 
>PRK15253 putative fimbrial assembly chaperone protein StcB; Provisional
Probab=44.72  E-value=43  Score=29.23  Aligned_cols=65  Identities=11%  Similarity=0.116  Sum_probs=43.2

Q ss_pred             cceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEE
Q 028956            4 VRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIAT   76 (201)
Q Consensus         4 ~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~   76 (201)
                      +-|=|-.|...+.++.+.|.|....+     .|+++|+|..++.|- ++- ..+  .+....++|+|.++..+.
T Consensus       149 LFYRP~~L~~~~~~~~~~L~~~~~g~-----~l~v~NpTPyyvtl~~l~~-~~~--~~~~~~~mi~Pfs~~~~~  214 (242)
T PRK15253        149 LIWRPSRIAAVTKDSFQRIGLFRSNK-----TVIMKNDTANWITVTDVKA-GNT--KINDQTIMLPPLSTQNIN  214 (242)
T ss_pred             EEEcchhcccchhhhhhceEEEEcCC-----EEEEECCCCcEEEeEeeEE-CCc--ccCCCCceECCCCccEEe
Confidence            34556666666666666888885432     499999999999875 232 222  233345689999988765


No 63 
>PF14796 AP3B1_C:  Clathrin-adaptor complex-3 beta-1 subunit C-terminal
Probab=43.88  E-value=1.3e+02  Score=24.43  Aligned_cols=59  Identities=15%  Similarity=0.192  Sum_probs=38.1

Q ss_pred             eEEEccCC----CCceEEEEEEEcCCCCeEE-EEEeeCC-Ccceeec--CCceeeCCCCeEEEEEEe
Q 028956           21 SLILPISS----TFPVRSAIKIKNTSKSHVA-FKFQTTA-PKSCFMR--PPGAILAPGESLIATVFK   79 (201)
Q Consensus        21 eL~F~~~~----~k~~~~~LtLtN~S~~~VA-FKVKTTa-P~~Y~Vr--P~~GiI~Pg~s~~I~V~l   79 (201)
                      +..|...+    ...+.-.|+++|.++..+. -+|.... +.--+|+  |..+.|+||+++.+.+-.
T Consensus        72 ~Y~F~RqP~~~s~~mvsIql~ftN~s~~~i~~I~i~~k~l~~g~~i~~F~~I~~L~pg~s~t~~lgI  138 (145)
T PF14796_consen   72 EYRFSRQPSLYSPSMVSIQLTFTNNSDEPIKNIHIGEKKLPAGMRIHEFPEIESLEPGASVTVSLGI  138 (145)
T ss_pred             EEEEccCCcCCCCCcEEEEEEEEecCCCeecceEECCCCCCCCcEeeccCcccccCCCCeEEEEEEE
Confidence            45566632    3455678999999997652 3333322 2233444  888999999998877654


No 64 
>PRK15218 fimbrial chaperone protein PegB; Provisional
Probab=42.92  E-value=51  Score=28.40  Aligned_cols=66  Identities=21%  Similarity=0.180  Sum_probs=42.6

Q ss_pred             cceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEE
Q 028956            4 VRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         4 ~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      +-|=|-.|.-.|.++.+.|.|....+     .|+++|+|..++.|-==. ..+.  +....+.|+|.++..+.+
T Consensus       134 LfYRP~~L~~~~~~a~~~L~~~~~~~-----~l~v~NpTPyyitl~~l~-~~~~--~~~~~~mi~Pfs~~~~~~  199 (226)
T PRK15218        134 LIYRPPGVQKVDKATFSRLQLLRAPG-----SISVKNNSANWITIPEIK-AKSK--VNKETLLLAPWSSQSITT  199 (226)
T ss_pred             EEEcccccccChhhhhhccEEEEcCC-----EEEEECCCCcEEEeEeee-cCCc--ccCCcceECCCCccEEEc
Confidence            34556666656665556888875432     499999999999885222 1222  222346899999887753


No 65 
>PRK15224 pili assembly chaperone protein SafB; Provisional
Probab=42.65  E-value=40  Score=29.37  Aligned_cols=62  Identities=13%  Similarity=0.114  Sum_probs=40.5

Q ss_pred             eeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEE
Q 028956            6 FLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIAT   76 (201)
Q Consensus         6 ~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~   76 (201)
                      |=|-.|...|.++.+.|.|....+     .|++.|+|..+|.|- ++. ..+.  + .+.++|+|.++..+.
T Consensus       146 YRP~~L~~~~~~a~~~L~~~~~g~-----~l~v~NpTPYyvtl~~l~~-~~~~--~-~~~~miaPfs~~~~~  208 (237)
T PRK15224        146 FRPDAVKGTPEDVAGNLRWVETGN-----KLKVENPTPFYMNLASVTV-GGKP--I-TGLEYIPPFADKTLN  208 (237)
T ss_pred             EchhhcCCChhHhhhccEEEEcCC-----EEEEECCCCcEEEeEeEEE-CCcc--c-CCceeECCCCccEEE
Confidence            445556555655556788875432     499999999999874 333 2222  2 224689999987765


No 66 
>PF07233 DUF1425:  Protein of unknown function (DUF1425);  InterPro: IPR010824 This family consists of several hypothetical bacterial proteins of around 125 residues in length. Several members of this family are described as putative lipoproteins and are often known as YcfL. The function of this family is unknown.; PDB: 3O0L_A.
Probab=41.45  E-value=1.3e+02  Score=22.16  Aligned_cols=51  Identities=14%  Similarity=0.221  Sum_probs=31.0

Q ss_pred             CCceEEEEEEEcCCCCe--EEEEEeeCCCcceeecCC-----ceeeCCCCeEEEEEEe
Q 028956           29 TFPVRSAIKIKNTSKSH--VAFKFQTTAPKSCFMRPP-----GAILAPGESLIATVFK   79 (201)
Q Consensus        29 ~k~~~~~LtLtN~S~~~--VAFKVKTTaP~~Y~VrP~-----~GiI~Pg~s~~I~V~l   79 (201)
                      ++.....+.|+|.++.+  +.||+-==..+-+.|.|.     .=+|.++++..|.-+-
T Consensus        23 ~g~~~~~~~l~N~~~~~~~l~Yrf~WyD~~G~~v~~~~~~w~~~~l~~~~~~~l~~~a   80 (94)
T PF07233_consen   23 NGLLRAQATLSNKSSKPLTLQYRFYWYDKQGLEVDPEQSPWQSLTLPGGQTVTLSAVA   80 (94)
T ss_dssp             CCEEEEEEEEEE-SSS-EEEEEEEEEE-TTS-EE--TT---EEEEE-TT-EEEEEEE-
T ss_pred             CCeEEEEEEEEECCCCcEEEEEEEEEECCCCCCcCCCCCCCEEEEEcCCCEEEEEEEC
Confidence            66789999999999765  788886556666777766     3366677776666553


No 67 
>PRK15195 fimbrial chaperone protein FimC; Provisional
Probab=40.39  E-value=53  Score=28.30  Aligned_cols=64  Identities=20%  Similarity=0.237  Sum_probs=40.3

Q ss_pred             ccceeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeecCCceeeCCCCeEEEE
Q 028956            3 PVRFLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMRPPGAILAPGESLIAT   76 (201)
Q Consensus         3 ~~~~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~VrP~~GiI~Pg~s~~I~   76 (201)
                      |+-|=|-.|.-.|.++...|.|....     ..|+++|+|..++.|. ++-. -+  .+.+  +.|.|+++..+.
T Consensus       137 KlFyRP~~l~~~~~~~~~~L~~~~~~-----~~L~v~NptPy~vtl~~l~v~-~~--~i~~--~mv~P~s~~~~~  201 (229)
T PRK15195        137 KLFVRPINLQELPEEAPDTLKFSRSG-----NHLKVHNPSPYYVTLVNLQVG-SQ--KLGN--AMIAPKVNSQIP  201 (229)
T ss_pred             EEEEcccccCCChhhhhhccEEEEcC-----CEEEEECCCceEEEEEEEEEC-Cc--ccCc--eEECCCCceEEE
Confidence            44555655655554444577787432     2599999999999885 3332 22  2322  478898887775


No 68 
>PF08277 PAN_3:  PAN-like domain;  InterPro: IPR006583 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs The PAN-3 or CW is a domain associated with a number of Caenorhabditis elegans hypothetical proteins.
Probab=40.26  E-value=40  Score=22.78  Aligned_cols=19  Identities=32%  Similarity=0.487  Sum_probs=13.2

Q ss_pred             eEEEEEEEcCCCCeEEEEE
Q 028956           32 VRSAIKIKNTSKSHVAFKF   50 (201)
Q Consensus        32 ~~~~LtLtN~S~~~VAFKV   50 (201)
                      ....-++...+...||||+
T Consensus        53 i~~v~~~~~~~~~~VA~K~   71 (71)
T PF08277_consen   53 ISTVQKTDSSSGNKVAFKI   71 (71)
T ss_pred             EEEEEEeecCCCeEEEEEC
Confidence            4455556666678999996


No 69 
>PF13473 Cupredoxin_1:  Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=39.81  E-value=1.5e+02  Score=21.64  Aligned_cols=53  Identities=17%  Similarity=0.254  Sum_probs=31.2

Q ss_pred             eEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEe
Q 028956           11 DHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        11 L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      ..++|+    ++..+.  |  ..-.|+++|.++..-.|-+..     +.+   ...|.||++..++++-
T Consensus        30 ~~f~P~----~i~v~~--G--~~v~l~~~N~~~~~h~~~i~~-----~~~---~~~l~~g~~~~~~f~~   82 (104)
T PF13473_consen   30 FGFSPS----TITVKA--G--QPVTLTFTNNDSRPHEFVIPD-----LGI---SKVLPPGETATVTFTP   82 (104)
T ss_dssp             EEEES-----EEEEET--T--CEEEEEEEE-SSS-EEEEEGG-----GTE---EEEE-TT-EEEEEEEE
T ss_pred             CeEecC----EEEEcC--C--CeEEEEEEECCCCcEEEEECC-----Cce---EEEECCCCEEEEEEcC
Confidence            345666    555433  2  234588999988888887755     111   2689999999999853


No 70 
>TIGR02656 cyanin_plasto plastocyanin. Members of this family are plastocyanin, a blue copper protein related to pseudoazurin, halocyanin, amicyanin, etc. This protein, located in the thylakoid luman, performs electron transport to photosystem I in Cyanobacteria and chloroplasts.
Probab=38.26  E-value=1.2e+02  Score=22.13  Aligned_cols=15  Identities=33%  Similarity=0.379  Sum_probs=11.8

Q ss_pred             ceeeCCCCeEEEEEE
Q 028956           64 GAILAPGESLIATVF   78 (201)
Q Consensus        64 ~GiI~Pg~s~~I~V~   78 (201)
                      .+.+.||++..+++.
T Consensus        61 ~~~~~pG~t~~~tF~   75 (99)
T TIGR02656        61 DLLNSPGESYEVTFS   75 (99)
T ss_pred             ccccCCCCEEEEEeC
Confidence            467899999888655


No 71 
>PRK15254 fimbrial chaperone protein StdC; Provisional
Probab=37.59  E-value=72  Score=27.76  Aligned_cols=70  Identities=11%  Similarity=0.133  Sum_probs=43.0

Q ss_pred             cccceeCceeEEccCC-CceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcceeec--CCceeeCCCCeEEEEE
Q 028956            2 VPVRFLPLHDHFYPNG-AVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKSCFMR--PPGAILAPGESLIATV   77 (201)
Q Consensus         2 ~~~~~~p~~L~i~P~~-~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~Y~Vr--P~~GiI~Pg~s~~I~V   77 (201)
                      +++-|=|-.|.-.+.+ +-+.|.|....+     .|+++|+|..+|.|- ++-.  ++-.+.  ...++|+|.++..+.+
T Consensus       126 iKLFyRP~~L~~~~~~~~~~~L~~~~~g~-----~l~v~NpTPYyitl~~l~~~--~~~~~~~~~~~~mi~Pfs~~~~~~  198 (239)
T PRK15254        126 LKLFYRPKAIIRSSSDQPERKLTAERNAG-----HLTLRNPTPYYITVAWLGAD--RSHRLSGFREGVMVPPLGSLPLKA  198 (239)
T ss_pred             EeEEEccccccCCcccchhhceEEEEeCC-----EEEEECCCceEEEeeeEEec--CcccccccCCCceECCCCceEEec
Confidence            3556667777545442 225788875422     499999999999874 3321  111121  1446899999887765


Q ss_pred             E
Q 028956           78 F   78 (201)
Q Consensus        78 ~   78 (201)
                      .
T Consensus       199 ~  199 (239)
T PRK15254        199 V  199 (239)
T ss_pred             c
Confidence            3


No 72 
>PRK15308 putative fimbrial protein TcfA; Provisional
Probab=36.57  E-value=2.9e+02  Score=24.07  Aligned_cols=85  Identities=11%  Similarity=0.133  Sum_probs=56.6

Q ss_pred             eeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEee---CCC----c-----------ceeecCCceeeCCCC
Q 028956           10 HDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQT---TAP----K-----------SCFMRPPGAILAPGE   71 (201)
Q Consensus        10 ~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKT---TaP----~-----------~Y~VrP~~GiI~Pg~   71 (201)
                      .|.+.|-    .+.+..  +.+....++|+|.++.+..++|..   ++|    +           .-.+-|+.-+|.||+
T Consensus        17 ~l~V~Pi----~~~i~a--~~~~~~~v~V~N~g~~~~~vqV~v~r~~~PG~~~e~~~~~~~~~~~eLiaSP~~l~L~pg~   90 (234)
T PRK15308         17 NMLVYPM----AAEIGA--GREEATSLFVYSKSDHTQYVRTRIKRIEHPATPQEKEVPAGNDIETGLVVSPEKFALPAGT   90 (234)
T ss_pred             eEEEEEe----EEEecC--CCcceEEEEEEeCCCCcEEEEEEEEEEcCCCCCCCcccccccCCCCcEEEcCceeEECCCC
Confidence            3556666    344332  235678999999999888777652   343    1           366789999999999


Q ss_pred             eEEEEEEeeccCCCCCCCCCCCCCCCeEEEEEEEeCCCC
Q 028956           72 SLIATVFKFVELPENNEKPMYQKSRDKFKIISMKVKADV  110 (201)
Q Consensus        72 s~~I~V~l~~e~p~~~e~p~~~~~kDKFlVqs~~v~~~~  110 (201)
                      +-.|.+...  ++..        .-.-|.|...+++...
T Consensus        91 ~q~IRli~l--g~~~--------kE~~YRl~~~pvp~~~  119 (234)
T PRK15308         91 TRTVRVISL--QAPE--------REEAWRVYFEPVAELE  119 (234)
T ss_pred             eEEEEEEEc--CCCC--------cEEEEEEEEEecCCcc
Confidence            999998862  1211        1344777777776543


No 73 
>smart00605 CW CW domain.
Probab=34.77  E-value=71  Score=23.10  Aligned_cols=22  Identities=36%  Similarity=0.504  Sum_probs=13.8

Q ss_pred             EEEEEcC-CCCeEEEEEeeCCCc
Q 028956           35 AIKIKNT-SKSHVAFKFQTTAPK   56 (201)
Q Consensus        35 ~LtLtN~-S~~~VAFKVKTTaP~   56 (201)
                      .++-.+. +...||||+.++.+.
T Consensus        58 ~v~~~~~~~~~~VAfK~~~~~~~   80 (94)
T smart00605       58 TVKKLSSSSGKKVAFKVSTDQPS   80 (94)
T ss_pred             EEEEccCCCCcEEEEEEeCCCCC
Confidence            3344444 458899999865444


No 74 
>PF03168 LEA_2:  Late embryogenesis abundant protein;  InterPro: IPR004864 Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress [, ]. The function of these proteins is unknown. ; PDB: 3BUT_A 1XO8_A 1YYC_A.
Probab=33.83  E-value=1.6e+02  Score=20.45  Aligned_cols=45  Identities=9%  Similarity=0.066  Sum_probs=22.7

Q ss_pred             EEEEEcCCCCeE-----EEEEeeCCCcce-eecCCceeeCCCCeEEEEEEe
Q 028956           35 AIKIKNTSKSHV-----AFKFQTTAPKSC-FMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        35 ~LtLtN~S~~~V-----AFKVKTTaP~~Y-~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      +|+++|++...+     .+.|.-..-..- ......+.++|+++..+.+.+
T Consensus         1 ~l~v~NPN~~~i~~~~~~~~v~~~g~~v~~~~~~~~~~i~~~~~~~v~~~v   51 (101)
T PF03168_consen    1 TLSVRNPNSFGIRYDSIEYDVYYNGQRVGTGGSLPPFTIPARSSTTVPVPV   51 (101)
T ss_dssp             EEEEEESSSS-EEEEEEEEEEEESSSEEEEEEECE-EEESSSCEEEEEEEE
T ss_pred             CEEEECCCceeEEEeCEEEEEEECCEEEECccccCCeEECCCCcEEEEEEE
Confidence            477888887333     233333222222 344555566666666655554


No 75 
>PF11616 EZH2_WD-Binding:  WD repeat binding protein EZH2;  InterPro: IPR021654  This family of proteins represents Enhancer of zest homologue 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression []. This interaction is required for the HMTase activity of PCR2 []. ; GO: 0018024 histone-lysine N-methyltransferase activity; PDB: 2QXV_B.
Probab=32.79  E-value=21  Score=21.55  Aligned_cols=9  Identities=44%  Similarity=0.977  Sum_probs=5.4

Q ss_pred             ehhHHHHHH
Q 028956          178 VIDEWKERR  186 (201)
Q Consensus       178 ~~~~~~~~r  186 (201)
                      .=+|||.+|
T Consensus        18 LN~eWk~lR   26 (30)
T PF11616_consen   18 LNEEWKKLR   26 (30)
T ss_dssp             HHHHHHH--
T ss_pred             HHHHHHHhc
Confidence            457899998


No 76 
>PF06159 DUF974:  Protein of unknown function (DUF974);  InterPro: IPR010378 This is a family of uncharacterised eukaryotic proteins.
Probab=31.57  E-value=1.5e+02  Score=25.73  Aligned_cols=51  Identities=27%  Similarity=0.256  Sum_probs=35.4

Q ss_pred             CCceEEEEEEEcCCC---CeEEEEEe--eCCCc-ceeecC---C---ceeeCCCCeEEEEEEe
Q 028956           29 TFPVRSAIKIKNTSK---SHVAFKFQ--TTAPK-SCFMRP---P---GAILAPGESLIATVFK   79 (201)
Q Consensus        29 ~k~~~~~LtLtN~S~---~~VAFKVK--TTaP~-~Y~VrP---~---~GiI~Pg~s~~I~V~l   79 (201)
                      |...+|.|-+.|.+.   ..|.+||-  |.+-. +...-+   .   .+-|+||++++..|.+
T Consensus        13 GEtF~~~l~~~N~s~~~v~~v~ikvemqT~s~~~r~~L~~~~~~~~~~~~L~p~~~l~~iv~~   75 (249)
T PF06159_consen   13 GETFSCYLSVNNDSNKPVRNVRIKVEMQTPSQSLRLPLSDNENSDSPVASLAPGESLDFIVSH   75 (249)
T ss_pred             cCCEEEEEEeecCCCCceEEeEEEEEEeCCCCCccccCCCCccccccccccCCCCeEeEEEEE
Confidence            557899999999987   45689984  54331 332222   2   3469999999987775


No 77 
>PF06483 ChiC:  Chitinase C;  InterPro: IPR009470 This ~170 aa region is found at the C-terminal to the catalytic domain (IPR001223 from INTERPRO) found in members of glycoside hydrolase family 18.
Probab=29.69  E-value=62  Score=27.32  Aligned_cols=26  Identities=31%  Similarity=0.530  Sum_probs=22.6

Q ss_pred             CeEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEee
Q 028956           44 SHVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFKF   80 (201)
Q Consensus        44 ~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l~   80 (201)
                      ++|+||+           |....|+||+++++.+..+
T Consensus       116 Hrvs~tl-----------p~wqslapG~s~~~~~~Yy  141 (180)
T PF06483_consen  116 HRVSFTL-----------PAWQSLAPGASVELDMVYY  141 (180)
T ss_pred             EEEEEEC-----------CCccccCCCCEEEEeEEEE
Confidence            6777776           8889999999999999876


No 78 
>PF13205 Big_5:  Bacterial Ig-like domain
Probab=29.23  E-value=2.1e+02  Score=20.28  Aligned_cols=56  Identities=23%  Similarity=0.288  Sum_probs=34.3

Q ss_pred             eEEEccCCC-CceEEEEEEEc--CCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEe
Q 028956           21 SLILPISST-FPVRSAIKIKN--TSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFK   79 (201)
Q Consensus        21 eL~F~~~~~-k~~~~~LtLtN--~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l   79 (201)
                      .|.|.-+-. ......+.+.+  ....+|.+.  ....+.+.+.|. +-|.+|.+..|+|.-
T Consensus        26 ~i~Fs~~v~~~s~~~~~~~~~~~~~~~~v~~~--~~~~~~~~i~p~-~~L~~~t~Y~v~i~~   84 (107)
T PF13205_consen   26 VITFSEPVDPASVSSAITITDSNGSGVPVSFS--SWDGNTLTITPS-QPLKPGTTYTVTIDS   84 (107)
T ss_pred             EEEECCceecCccceEEEEEecCCCcEEEEEE--EccCCEEEEEEC-CcCCCCCEEEEEECC
Confidence            344544432 23445566654  334555555  334488899998 557889999998864


No 79 
>PF11538 Snurportin1:  Snurportin1;  InterPro: IPR024721 Snurportin-1 is a nuclear import receptor that contains an N-terminal importin beta binding domain which is essential for its function as an snRNP-specific nuclear import receptor []. Snurportin-1 interacts with m3G-cap where it enhances the m3G-cap dependent nuclear import of U snRNPs in Xenopus laevis oocytes and digitonin-permeabilized HeLa cells []. This entry represents the snurportin-1 N-terminal importin beta binding domain (IBB). The essential role of the IBB domain for snurportin-1 function suggests that snurportin-1 cooperates with importin beta in mediating nuclear import of snRNPs.; GO: 0005515 protein binding; PDB: 3LWW_D 3NC0_E 3NBZ_E 2Q5D_C 3NBY_B 3GB8_B 3GJX_B 2QNA_B 2P8Q_B.
Probab=28.00  E-value=44  Score=21.44  Aligned_cols=15  Identities=40%  Similarity=0.685  Sum_probs=11.8

Q ss_pred             HHHHHHHHHHHHhhh
Q 028956          181 EWKERRERYLARQQV  195 (201)
Q Consensus       181 ~~~~~r~~~~~~~~~  195 (201)
                      .=.|||+++|++|-.
T Consensus        15 ~Q~eRR~~~Le~QK~   29 (40)
T PF11538_consen   15 DQEERRREFLERQKN   29 (40)
T ss_dssp             SHHHHHHHHHHHHHS
T ss_pred             hHHHHHHHHHHHHHH
Confidence            457999999998753


No 80 
>PRK02710 plastocyanin; Provisional
Probab=27.81  E-value=2.2e+02  Score=21.59  Aligned_cols=15  Identities=40%  Similarity=0.446  Sum_probs=11.4

Q ss_pred             eeeCCCCeEEEEEEe
Q 028956           65 AILAPGESLIATVFK   79 (201)
Q Consensus        65 GiI~Pg~s~~I~V~l   79 (201)
                      +.+.||++..+++.-
T Consensus        82 ~~~~pg~t~~~tF~~   96 (119)
T PRK02710         82 LAFAPGESWEETFSE   96 (119)
T ss_pred             cccCCCCEEEEEecC
Confidence            468999998877663


No 81 
>PF11668 Gp_UL130:  HCMV glycoprotein pUL130;  InterPro: IPR021038 This entry represents UL130 from Human cytomegalovirus, a glycoprotein secreted from infected cells that is incorporated into the virion envelope as a Golgi-matured form. The protein promotes endothelial cell infection through a producer cell modification of the virion [].
Probab=25.51  E-value=1.8e+02  Score=23.94  Aligned_cols=45  Identities=9%  Similarity=0.129  Sum_probs=30.7

Q ss_pred             eeEEEccCCCC-ceEEEEEEEcC---CCCeEEEEEee------CCCcceeecCCc
Q 028956           20 SSLILPISSTF-PVRSAIKIKNT---SKSHVAFKFQT------TAPKSCFMRPPG   64 (201)
Q Consensus        20 ~eL~F~~~~~k-~~~~~LtLtN~---S~~~VAFKVKT------TaP~~Y~VrP~~   64 (201)
                      +-|.|...+|- -..|.++|.--   ....|+|++|-      ..+..+|+||+-
T Consensus       100 ~~Lry~vkDG~~~~~C~m~v~TwA~~~~~~i~Fq~kiel~~A~~~~stiCthPnl  154 (156)
T PF11668_consen  100 KLLRYRVKDGTRWEMCIMRVQTWAHTKSNYIQFQVKIELTHAYRQPSTICTHPNL  154 (156)
T ss_pred             eeEEEEeccCCceeeEEEEeeehhhhhcccEEEEEEEEEeeccCCccceeccccc
Confidence            35667666553 55788988743   24669999983      345668888874


No 82 
>PF09640 DUF2027:  Domain of unknown function (DUF2027);  InterPro: IPR018598  This protein domain is of unknown function. though putatively involved in DNA mismatch repair. It is associated with IPR002625 from INTERPRO. ; PDB: 2HUH_A.
Probab=25.26  E-value=2.6e+02  Score=23.30  Aligned_cols=68  Identities=18%  Similarity=0.265  Sum_probs=47.0

Q ss_pred             eEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEEEEEEeeccCCCCCCCCCCCCCCCeEEEEEEEeCCCC
Q 028956           32 VRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLIATVFKFVELPENNEKPMYQKSRDKFKIISMKVKADV  110 (201)
Q Consensus        32 ~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~I~V~l~~e~p~~~e~p~~~~~kDKFlVqs~~v~~~~  110 (201)
                      .+-..-|.|.|.-.+-|=.-+...+.|.+| +.|.|+|+..+-|.-.-   ...     .  ..-.+.-||.+.-..+.
T Consensus        18 T~fE~YlVNDSNYy~~y~y~~~~g~~w~lr-s~G~iEPNtKl~ieef~---~~e-----L--N~~~~v~vQ~iAyK~~K   85 (162)
T PF09640_consen   18 TRFECYLVNDSNYYLHYTYLTAEGNSWTLR-SAGEIEPNTKLFIEEFS---KEE-----L--NDLERVAVQLIAYKKDK   85 (162)
T ss_dssp             --EEEEEEE-SSSEEEEEEEEEETTEEEEE-EEEEE-TTEEEEEEEE----GGG-----G--GG-SSEEEEEEEE-SSS
T ss_pred             CceEEEEEecCccEEEEEEEeccCCeEEEE-ecceECCCceeehhhcC---HHH-----h--hccceeEEEEEEEcCCC
Confidence            455778899999999999998888889888 78999999998887553   111     1  23466778888876654


No 83 
>PF12086 DUF3563:  Protein of unknown function (DUF3563);  InterPro: IPR021946  This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 50 amino acids in length. This protein has conserved AYL and DLE sequence motifs. 
Probab=23.11  E-value=56  Score=22.78  Aligned_cols=13  Identities=54%  Similarity=0.782  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHh
Q 028956          181 EWKERRERYLARQ  193 (201)
Q Consensus       181 ~~~~~r~~~~~~~  193 (201)
                      .-++|||.|||+-
T Consensus        18 ~~~~r~eaYLA~s   30 (59)
T PF12086_consen   18 SERERREAYLAQS   30 (59)
T ss_pred             HHHHHHHHHHHhc
Confidence            3478999999963


No 84 
>PF00127 Copper-bind:  Copper binding proteins, plastocyanin/azurin family;  InterPro: IPR000923 Blue (type 1) copper proteins are small proteins which bind a single copper atom and which are characterised by an intense electronic absorption band near 600 nm [, ]. The most well known members of this class of proteins are the plant chloroplastic plastocyanins, which exchange electrons with cytochrome c6, and the distantly related bacterial azurins, which exchange electrons with cytochrome c551. This family of proteins also includes amicyanin from bacteria such as Methylobacterium extorquens or Paracoccus versutus (Thiobacillus versutus) that can grow on methylamine; auracyanins A and B from Chloroflexus aurantiacus []; blue copper protein from Alcaligenes faecalis; cupredoxin (CPC) from Cucumis sativus (Cucumber) peelings []; cusacyanin (basic blue protein; plantacyanin, CBP) from cucumber; halocyanin from Natronomonas pharaonis (Natronobacterium pharaonis) [], a membrane associated copper-binding protein; pseudoazurin from Pseudomonas; rusticyanin from Thiobacillus ferrooxidans []; stellacyanin from Rhus vernicifera (Japanese lacquer tree); umecyanin from the roots of Armoracia rusticana (Horseradish); and allergen Ra3 from ragweed. This pollen protein is evolutionary related to the above proteins, but seems to have lost the ability to bind copper. Although there is an appreciable amount of divergence in the sequences of all these proteins, the copper ligand sites are conserved.; GO: 0005507 copper ion binding, 0009055 electron carrier activity; PDB: 1UAT_A 1CUO_A 1PLC_A 4PCY_A 3PCY_A 1PND_A 1PNC_A 1JXG_A 6PCY_A 1TKW_A ....
Probab=23.07  E-value=2e+02  Score=20.89  Aligned_cols=16  Identities=38%  Similarity=0.590  Sum_probs=13.7

Q ss_pred             CceeeCCCCeEEEEEE
Q 028956           63 PGAILAPGESLIATVF   78 (201)
Q Consensus        63 ~~GiI~Pg~s~~I~V~   78 (201)
                      ...++.||++..++++
T Consensus        60 ~~~~~~~G~~~~~tF~   75 (99)
T PF00127_consen   60 SSPLLAPGETYSVTFT   75 (99)
T ss_dssp             EEEEBSTTEEEEEEEE
T ss_pred             cceecCCCCEEEEEeC
Confidence            5779999999888877


No 85 
>PRK15285 putative fimbrial chaperone protein StfD; Provisional
Probab=22.14  E-value=2.3e+02  Score=24.82  Aligned_cols=66  Identities=9%  Similarity=0.033  Sum_probs=37.4

Q ss_pred             eeCceeEEccCCC--ceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCc-ceeecCCceeeCCCCeEEEEE
Q 028956            6 FLPLHDHFYPNGA--VSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPK-SCFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         6 ~~p~~L~i~P~~~--~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~-~Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      |=|-.|.-.+.+.  ...|.|....+     .|++.|+|..+|.|- ++-+..+ .+ .....+.|+|.++..+.+
T Consensus       139 YRP~~L~~~~~~~~~~~~l~~~~~g~-----~l~i~NPTPyyvtl~~l~~~~~~~~~-~~~~~~mi~Pfs~~~~~~  208 (250)
T PRK15285        139 YRPQALSKIDMQHPWQYKITLQRQGN-----GYQVSNPTGYYVVLSNASNRMDGTPA-RGFSPLVIAPKSNVTLGG  208 (250)
T ss_pred             ECcccccCChhhhhhhceEEEEEeCC-----EEEEECCCCcEEEeeheeeccCCeec-ccCCCceECCCCccEEEc
Confidence            4455554333321  12466654322     499999999999884 2211222 22 111446899999988754


No 86 
>PRK15233 putative fimbrial chaperone protein SefB; Provisional
Probab=22.05  E-value=1e+02  Score=27.04  Aligned_cols=62  Identities=19%  Similarity=0.155  Sum_probs=37.3

Q ss_pred             eeCceeEEccCCCceeEEEccCCCCceEEEEEEEcCCCCeEEEEEeeCCCcceeecCCceeeCCCCeEE
Q 028956            6 FLPLHDHFYPNGAVSSLILPISSTFPVRSAIKIKNTSKSHVAFKFQTTAPKSCFMRPPGAILAPGESLI   74 (201)
Q Consensus         6 ~~p~~L~i~P~~~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFKVKTTaP~~Y~VrP~~GiI~Pg~s~~   74 (201)
                      |=|-.|.-.|.++.+.|.|....+     .|+++|+|..++.|.==+-..+.  +.-..+.|+|.++..
T Consensus       160 YRP~~L~~~~~~a~~kL~~~~~g~-----~l~i~NPTPYyitl~~l~v~~~~--i~~~~~mIaPfs~~~  221 (246)
T PRK15233        160 YRPKTIDLTTMEIADKLKLERKGN-----SIVIKNPTSSYVNIANIKSGNLS--FNIPNGYIEPFGYAQ  221 (246)
T ss_pred             EchhhcCCChhhhhhccEEEEcCC-----EEEEECCCCcEEEeEeEEECCcc--cCCCCceECCCCccc
Confidence            445555555655556788875432     49999999999987521112222  221345788887775


No 87 
>PF11906 DUF3426:  Protein of unknown function (DUF3426);  InterPro: IPR021834  This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 262 to 463 amino acids in length. 
Probab=21.07  E-value=2.5e+02  Score=21.70  Aligned_cols=50  Identities=20%  Similarity=0.145  Sum_probs=30.0

Q ss_pred             CceEEEEEEEcCCCCeEEE---EE------------eeCCCcceeecC--CceeeCCCCeEEEEEEe
Q 028956           30 FPVRSAIKIKNTSKSHVAF---KF------------QTTAPKSCFMRP--PGAILAPGESLIATVFK   79 (201)
Q Consensus        30 k~~~~~LtLtN~S~~~VAF---KV------------KTTaP~~Y~VrP--~~GiI~Pg~s~~I~V~l   79 (201)
                      ....-..+|.|.++.+++|   ++            ++-.|..|...+  +..-|.||+++.+.+.+
T Consensus        68 ~~l~v~g~i~N~~~~~~~~P~l~l~L~D~~g~~l~~r~~~P~~yl~~~~~~~~~l~pg~~~~~~~~~  134 (149)
T PF11906_consen   68 GVLVVSGTIRNRADFPQALPALELSLLDAQGQPLARRVFTPADYLPPGLAAQAGLPPGESVPFRLRL  134 (149)
T ss_pred             CEEEEEEEEEeCCCCcccCceEEEEEECCCCCEEEEEEEChHHhcccccccccccCCCCeEEEEEEe
Confidence            3445555666666655443   11            122355555544  34459999999999997


No 88 
>PF09478 CBM49:  Carbohydrate binding domain CBM49;  InterPro: IPR019028 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see [].  This domain is found at the C-terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose []. ; GO: 0030246 carbohydrate binding, 0005576 extracellular region
Probab=20.91  E-value=3.1e+02  Score=19.28  Aligned_cols=21  Identities=14%  Similarity=0.325  Sum_probs=13.9

Q ss_pred             EEEEEEEcCCCCeE-EEEEeeC
Q 028956           33 RSAIKIKNTSKSHV-AFKFQTT   53 (201)
Q Consensus        33 ~~~LtLtN~S~~~V-AFKVKTT   53 (201)
                      .-.++|+|.+..+| -+++.+.
T Consensus        20 qy~v~I~N~~~~~I~~~~i~~~   41 (80)
T PF09478_consen   20 QYDVTITNNGSKPIKSLKISID   41 (80)
T ss_pred             EEEEEEEECCCCeEEEEEEEEC
Confidence            45789999998666 3444433


No 89 
>KOG1769 consensus Ubiquitin-like proteins [Posttranslational modification, protein turnover, chaperones]
Probab=20.69  E-value=1.2e+02  Score=23.18  Aligned_cols=26  Identities=27%  Similarity=0.396  Sum_probs=19.5

Q ss_pred             eEEEEEEEcCCCCeEEEEEeeCCCcc
Q 028956           32 VRSAIKIKNTSKSHVAFKFQTTAPKS   57 (201)
Q Consensus        32 ~~~~LtLtN~S~~~VAFKVKTTaP~~   57 (201)
                      -.-.|++.+-...-+-||||.++|-+
T Consensus        19 ~hi~LKV~gqd~~~~~Fkikr~t~Lk   44 (99)
T KOG1769|consen   19 EHINLKVKGQDGSVVVFKIKRHTPLK   44 (99)
T ss_pred             ceEEEEEecCCCCEEEEEeecCChHH
Confidence            34567777855677899999988854


No 90 
>COG3311 AlpA Predicted transcriptional regulator [Transcription]
Probab=20.46  E-value=51  Score=23.62  Aligned_cols=22  Identities=41%  Similarity=0.737  Sum_probs=15.3

Q ss_pred             CCCCCCceee--eeeehhHHHHHHH
Q 028956          165 PEDTGPRIIG--EGLVIDEWKERRE  187 (201)
Q Consensus       165 ~~~~~~~~~~--~~~~~~~~~~~r~  187 (201)
                      +.-.|+++++  |. .||+|.++|-
T Consensus        42 pvklG~r~v~W~~S-EI~~Wi~~~~   65 (70)
T COG3311          42 PVKLGGRSVAWPES-EIDEWIASRK   65 (70)
T ss_pred             CeecCcccccccHH-HHHHHHHHHH
Confidence            3445666554  55 7999999885


No 91 
>PRK15274 putative periplasmic fimbrial chaperone protein SteC; Provisional
Probab=20.41  E-value=2.3e+02  Score=25.00  Aligned_cols=66  Identities=11%  Similarity=0.142  Sum_probs=37.9

Q ss_pred             ceeCceeEEccCC---CceeEEEccCCCCceEEEEEEEcCCCCeEEEE-EeeCCCcc-eeecCCceeeCCCCeEEEEE
Q 028956            5 RFLPLHDHFYPNG---AVSSLILPISSTFPVRSAIKIKNTSKSHVAFK-FQTTAPKS-CFMRPPGAILAPGESLIATV   77 (201)
Q Consensus         5 ~~~p~~L~i~P~~---~~~eL~F~~~~~k~~~~~LtLtN~S~~~VAFK-VKTTaP~~-Y~VrP~~GiI~Pg~s~~I~V   77 (201)
                      -|=|-.|...|.+   ....|.|....+     .|+++|+|..+|.|. ++-...+. ..+  ..++|+|.++..+.+
T Consensus       139 FYRP~~L~~~~~~~~~~~~~l~~~~~g~-----~l~v~NpTPYyvtl~~i~~~~~~~~~~~--~~~mi~Pfs~~~~~l  209 (257)
T PRK15274        139 FYRPVAVARQVDKTHPWQTKLTLTYQGD-----GVIFDNPTPFYLVISNAGSKENETASGF--KNLLIAPREKVTSPI  209 (257)
T ss_pred             EEcccccCCCchhcchhhceEEEEEeCC-----EEEEECCCCcEEEeeeeEeccCcccccC--CCceECCCCccEEec
Confidence            3445556444442   123566654322     499999999999885 22211111 011  346899999877754


Done!