Query         psy4837
Match_columns 100
No_of_seqs    107 out of 420
Neff          4.0 
Searched_HMMs 46136
Date          Fri Aug 16 19:01:03 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4837.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4837hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF04442 CtaG_Cox11:  Cytochrom 100.0   8E-42 1.7E-46  251.9   4.8   86    3-88     53-143 (152)
  2 PTZ00128 cytochrome c oxidase  100.0 1.4E-40   3E-45  259.1   8.3   87    3-89    124-215 (232)
  3 PRK05089 cytochrome C oxidase  100.0 1.4E-39   3E-44  246.9   8.1   77    3-79     80-161 (188)
  4 COG3175 COX11 Cytochrome oxida 100.0 4.2E-39 9.1E-44  245.2   7.1   87    3-89     79-170 (195)
  5 KOG2540|consensus              100.0 5.3E-34 1.1E-38  223.9   5.8   88    4-91    150-242 (269)
  6 PF06030 DUF916:  Bacterial pro  84.0     2.8 6.2E-05   29.5   4.8   69   10-79     20-106 (121)
  7 PF14874 PapD-like:  Flagellar-  82.3     4.1 8.9E-05   26.4   4.8   55   12-74     15-73  (102)
  8 PF07610 DUF1573:  Protein of u  81.1     4.1   9E-05   23.8   4.0   36   23-71      2-41  (45)
  9 PF10633 NPCBM_assoc:  NPCBM-as  81.0    0.93   2E-05   28.7   1.3   62   13-80      1-64  (78)
 10 TIGR02756 TraK_Ftype type-F co  80.7     2.6 5.7E-05   32.7   3.9   47   17-71    182-228 (232)
 11 PRK15218 fimbrial chaperone pr  66.4      19 0.00041   27.9   5.5   67   12-82     27-102 (226)
 12 PRK15192 fimbrial chaperone Bc  66.2      19 0.00042   28.1   5.6   66   12-83     31-109 (234)
 13 PRK15224 pili assembly chapero  63.6      22 0.00047   27.9   5.4   66   12-83     37-108 (237)
 14 PRK15246 fimbrial assembly cha  63.5      22 0.00048   27.6   5.4   64   13-82     20-95  (233)
 15 PF13473 Cupredoxin_1:  Cupredo  63.2      24 0.00053   23.1   4.9   45    8-71     34-78  (104)
 16 PRK15295 fimbrial assembly cha  62.0      24 0.00051   27.2   5.3   67   12-82     28-100 (226)
 17 PF06586 TraK:  TraK protein;    61.9      12 0.00026   28.2   3.6   47   17-71    186-232 (234)
 18 PF07705 CARDB:  CARDB;  InterP  60.9      16 0.00036   22.6   3.6   55    9-71     11-65  (101)
 19 PRK15233 putative fimbrial cha  60.1      27 0.00059   27.7   5.5   65   12-82     49-119 (246)
 20 PRK15253 putative fimbrial ass  58.2      29 0.00063   27.2   5.3   67   12-82     42-117 (242)
 21 PF07177 Neuralized:  Neuralize  57.1     5.7 0.00012   25.6   1.0   17   55-71     31-47  (69)
 22 PRK15211 fimbrial chaperone pr  57.0      32  0.0007   26.7   5.3   66   12-82     31-102 (229)
 23 PRK11385 putativi pili assembl  52.4      41 0.00088   26.3   5.2   72   12-83     35-114 (236)
 24 PF11611 DUF4352:  Domain of un  51.9      27 0.00058   22.8   3.6   64   16-80     35-105 (123)
 25 PF00207 A2M:  Alpha-2-macroglo  51.9      33 0.00071   22.2   4.0   26   12-37     65-90  (92)
 26 PF06205 GT36_AF:  Glycosyltran  51.8      21 0.00045   23.7   3.1   19    7-25     64-82  (90)
 27 PF01345 DUF11:  Domain of unkn  50.8      35 0.00077   21.1   3.9   35    9-43     33-67  (76)
 28 PRK13736 conjugal transfer pro  49.5      33 0.00072   27.0   4.4   47   17-71    185-233 (245)
 29 PRK09926 putative chaperone pr  49.4      45 0.00098   25.9   5.1   64   13-82     35-110 (246)
 30 TIGR03079 CH4_NH3mon_ox_B meth  47.4      57  0.0012   28.1   5.7   55   15-71    280-348 (399)
 31 PF14646 MYCBPAP:  MYCBP-associ  46.6      68  0.0015   26.6   5.9   64    8-71    238-305 (426)
 32 PF02752 Arrestin_C:  Arrestin   44.9      52  0.0011   21.2   4.2   29    9-37     12-40  (136)
 33 PF04744 Monooxygenase_B:  Mono  44.4      47   0.001   28.4   4.7   57   14-71    260-329 (381)
 34 PRK15249 fimbrial chaperone pr  43.4      66  0.0014   25.2   5.1   65   14-83     39-115 (253)
 35 smart00588 NEUZ domain in neur  41.8      25 0.00054   24.8   2.4   17   55-71     32-48  (123)
 36 PF09624 DUF2393:  Protein of u  39.2 1.3E+02  0.0029   21.0   6.4   59   13-71     58-127 (149)
 37 PF14310 Fn3-like:  Fibronectin  39.0      52  0.0011   20.4   3.3   19    9-27     25-43  (71)
 38 TIGR02656 cyanin_plasto plasto  39.0 1.1E+02  0.0024   20.0   5.5   56    9-71     17-72  (99)
 39 PRK02710 plastocyanin; Provisi  37.5   1E+02  0.0022   21.0   4.9   46    9-71     47-92  (119)
 40 PRK15274 putative periplasmic   32.8 1.2E+02  0.0025   24.2   5.1   65   12-82     35-109 (257)
 41 TIGR01451 B_ant_repeat conserv  32.8 1.1E+02  0.0023   18.4   3.9   35   10-44      5-39  (53)
 42 PF08308 PEGA:  PEGA domain;  I  32.4      90   0.002   18.9   3.6   22    5-26     47-68  (71)
 43 PF00362 Integrin_beta:  Integr  32.0      77  0.0017   26.9   4.2   37    2-38     72-108 (426)
 44 PF00127 Copper-bind:  Copper b  30.3 1.4E+02  0.0031   19.4   4.5   56    8-71     16-72  (99)
 45 TIGR03096 nitroso_cyanin nitro  29.8      80  0.0017   23.1   3.5   44    9-71     61-104 (135)
 46 KOG3960|consensus               29.6      12 0.00026   30.8  -1.0    9   86-94    149-157 (284)
 47 PF02018 CBM_4_9:  Carbohydrate  29.5 1.4E+02   0.003   19.0   4.2   25    9-33     54-78  (131)
 48 PRK11055 galM galactose-1-epim  29.5 1.1E+02  0.0024   24.8   4.5   44    8-62    265-308 (342)
 49 cd06184 flavohem_like_fad_nad_  29.4 1.5E+02  0.0032   21.8   4.9   24    2-25     24-49  (247)
 50 PF14415 DUF4424:  Domain of un  29.0      87  0.0019   24.9   3.8   23   17-39      1-23  (253)
 51 PRK15195 fimbrial chaperone pr  28.7 1.6E+02  0.0036   22.7   5.2   63   14-82     36-105 (229)
 52 smart00391 MBD Methyl-CpG bind  28.1      25 0.00055   23.0   0.6   16   49-64     56-71  (77)
 53 PF11325 DUF3127:  Domain of un  27.4      61  0.0013   22.0   2.3   30   12-56     52-81  (84)
 54 PRK15188 fimbrial chaperone pr  26.1   2E+02  0.0043   22.4   5.3   65   12-82     36-107 (228)
 55 PF06650 DUF1162:  Protein of u  26.0 2.1E+02  0.0045   21.4   5.2   22   55-76    252-275 (277)
 56 PF00345 PapD_N:  Pili and flag  25.2   2E+02  0.0042   19.1   4.5   71    8-81      6-81  (122)
 57 PF00963 Cohesin:  Cohesin doma  25.2 2.3E+02  0.0049   19.3   5.1   36    9-44      6-41  (141)
 58 PF01429 MBD:  Methyl-CpG bindi  25.1      31 0.00067   22.2   0.5   18   48-65     59-76  (77)
 59 KOG0360|consensus               24.9      36 0.00078   30.3   1.0   29   51-79    235-265 (545)
 60 PRK13202 ureB urease subunit b  24.7   1E+02  0.0022   22.0   3.1   67    5-71      7-80  (104)
 61 cd05711 Ig_FcalphaRI Immunoglo  24.4 1.2E+02  0.0027   19.5   3.3   14   37-50     58-71  (94)
 62 PF04314 DUF461:  Protein of un  24.4 1.5E+02  0.0033   20.0   3.9   53   17-71     14-73  (110)
 63 TIGR02636 galM_Leloir galactos  24.3 1.4E+02   0.003   23.9   4.2   46    8-64    260-305 (335)
 64 PRK00415 rps27e 30S ribosomal   24.0      30 0.00065   22.3   0.3   25   47-71      6-31  (59)
 65 smart00187 INB Integrin beta s  23.7 1.2E+02  0.0026   26.2   3.9   34    2-35     69-102 (423)
 66 KOG0121|consensus               23.7      29 0.00062   26.3   0.2   13   49-61     72-85  (153)
 67 PF00635 Motile_Sperm:  MSP (Ma  23.4 1.5E+02  0.0032   18.9   3.5   61    2-71      2-63  (109)
 68 PTZ00485 aldolase 1-epimerase;  23.1      94   0.002   26.0   3.1   48    7-64    292-339 (376)
 69 PF13815 Dzip-like_N:  Iguana/D  23.1      18 0.00038   25.1  -1.0   40   56-99      1-46  (118)
 70 PF06280 DUF1034:  Fn3-like dom  21.2 1.5E+02  0.0033   19.6   3.3   26    4-29     57-82  (112)
 71 PF07883 Cupin_2:  Cupin domain  20.9 1.9E+02   0.004   16.7   4.3   35    4-38     32-70  (71)
 72 PRK13203 ureB urease subunit b  20.7 2.5E+02  0.0054   19.9   4.4   66    5-71      7-79  (102)
 73 COG1470 Predicted membrane pro  20.6 2.6E+02  0.0056   25.0   5.4   64    7-71    274-337 (513)
 74 PF12690 BsuPI:  Intracellular   20.5 1.3E+02  0.0029   19.6   2.9   49   19-71      2-64  (82)

No 1  
>PF04442 CtaG_Cox11:  Cytochrome c oxidase assembly protein CtaG/Cox11;  InterPro: IPR007533 Cytochrome c oxidase assembly protein is essential for the assembly of functional cytochrome oxidase protein. In eukaryotes it is an integral protein of the mitochondrial inner membrane. Cox11 is essential for the insertion of Cu(I) ions to form the CuB site. This is essential for the stability of other structures in subunit I, for example haems a and a3, and the magnesium/manganese centre. Cox11 is probably only required in sub-stoichiometric amounts relative to the structural units []. The C-terminal region of the protein is known to form a dimer. Each monomer coordinates one Cu(I) ion via three conserved cysteine residues (111, 208 and 210) in Saccharomyces cerevisiae (P19516 from SWISSPROT). Met 224 is also thought to play a role in copper transfer or stabilising the copper site [].; GO: 0005507 copper ion binding; PDB: 1SO9_A 1SP0_A.
Probab=100.00  E-value=8e-42  Score=251.94  Aligned_cols=86  Identities=45%  Similarity=0.785  Sum_probs=64.5

Q ss_pred             eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837           3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP   77 (100)
Q Consensus         3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~   77 (100)
                      ..+|+|++|.|||||+++++|+|+|++|++++|||+|||+|++||.||||||||||+||+|+|||+++|     |||+|+
T Consensus        53 ~F~P~q~~v~V~pGe~~~~~y~a~N~s~~~i~g~A~~nV~P~~a~~YF~KieCFCF~eQ~L~pgE~~~mPv~F~IDp~i~  132 (152)
T PF04442_consen   53 EFKPEQRSVKVHPGETALVFYEATNPSDKPITGQAIPNVTPGEAGKYFNKIECFCFEEQTLAPGETVDMPVVFYIDPDIP  132 (152)
T ss_dssp             EEE-S-SEEEEETT--EEEEEEEEE-SSS-EE---EEEE-SSS-STTECCS-TTS-S--EE-TT-EEEEEEEEEE-GGGG
T ss_pred             EEEeeeeeEEeCCCCEEEEEEEEECCCCCcEEEEEeeeECHHHhhhhccccceEeccCcCcCCCCeEEEEEEEEECCccc
Confidence            568999999999999999999999999999999999999999999999999999999999999999999     999999


Q ss_pred             CCCchhhhccc
Q psy4837          78 HHNEHKMRSQI   88 (100)
Q Consensus        78 ~d~~~~~~~~~   88 (100)
                      +|.++++++.|
T Consensus       133 ~d~~~~~v~~i  143 (152)
T PF04442_consen  133 EDPDMKDVKTI  143 (152)
T ss_dssp             SSTTTTT--BE
T ss_pred             CCcccCCcCEE
Confidence            99999999987


No 2  
>PTZ00128 cytochrome c oxidase assembly protein-like; Provisional
Probab=100.00  E-value=1.4e-40  Score=259.07  Aligned_cols=87  Identities=48%  Similarity=0.872  Sum_probs=84.8

Q ss_pred             eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837           3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP   77 (100)
Q Consensus         3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~   77 (100)
                      ..+|+|++|.|||||+++++|+|+|++|++++|||+|||+|++||.||||||||||+||+|+|||+++|     |||+|+
T Consensus       124 ~F~P~q~~v~V~pGE~~lv~Y~a~N~sd~~i~G~A~ynV~P~~Ag~YFnKieCFCF~eQ~L~pgE~~~MPV~F~IDP~i~  203 (232)
T PTZ00128        124 EFEPLQKEVEVLPGETALAFYRAKNRSDKPVIGVATYHIAPPEAGLYFNKIQCFCFEEQRLNPHEEVDMPVFFYIDPDIL  203 (232)
T ss_pred             eEEeeeeEEEEcCCCeEEEEEEEECCCCCcEEEEEecccCHHHHhhhccceeeecccccccCCCCeEecCEEEEECCCCC
Confidence            568999999999999999999999999999999999999999999999999999999999999999999     999999


Q ss_pred             CCCchhhhcccc
Q psy4837          78 HHNEHKMRSQIL   89 (100)
Q Consensus        78 ~d~~~~~~~~~~   89 (100)
                      +|.++++++.|-
T Consensus       204 ~D~~~~~v~~IT  215 (232)
T PTZ00128        204 NDPRLKWVDEIT  215 (232)
T ss_pred             CCcccCCcCEEE
Confidence            999999999874


No 3  
>PRK05089 cytochrome C oxidase assembly protein; Provisional
Probab=100.00  E-value=1.4e-39  Score=246.89  Aligned_cols=77  Identities=42%  Similarity=0.747  Sum_probs=75.9

Q ss_pred             eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837           3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP   77 (100)
Q Consensus         3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~   77 (100)
                      ..+|+|++|+|||||+++++|+|+|++|++++|||+|||+|++||.||||||||||+||+|+|||+++|     |||+|+
T Consensus        80 ~F~P~q~~v~V~pGE~~~~~y~a~N~sd~~i~g~A~~nV~P~~a~~YF~KieCFCF~eQ~L~pgE~~~mPV~F~IDP~i~  159 (188)
T PRK05089         80 EFKPEQRSVDVHPGELNLVFYEAENLSDRPIVGQAIPSVTPGQAGAYFNKIECFCFTQQTLQPGETREMPVVFYVDPDLP  159 (188)
T ss_pred             eEEeeeeEEEEcCCCeEEEEEEEECCCCCcEEEEEecccCHHHHhhhccceeeecccCcccCCCCeEecCEEEEECCCcc
Confidence            568999999999999999999999999999999999999999999999999999999999999999999     999999


Q ss_pred             CC
Q psy4837          78 HH   79 (100)
Q Consensus        78 ~d   79 (100)
                      +|
T Consensus       160 ~d  161 (188)
T PRK05089        160 KD  161 (188)
T ss_pred             cc
Confidence            99


No 4  
>COG3175 COX11 Cytochrome oxidase assembly factor [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=4.2e-39  Score=245.23  Aligned_cols=87  Identities=43%  Similarity=0.712  Sum_probs=84.5

Q ss_pred             eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837           3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP   77 (100)
Q Consensus         3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~   77 (100)
                      ...|.|++|.|||||+++++|+|+|++|++++|||+|||+|++||+||||+|||||+||+|+|||+++|     ||||+.
T Consensus        79 ~F~p~q~~v~v~pGet~~~~y~a~N~sd~~itg~A~~nv~P~~Ag~YF~KveCFCFteq~L~pgE~vemPV~FfVDpd~~  158 (195)
T COG3175          79 RFRPVQREVYVRPGETNLIFYEAENLSDKPITGQATYNVAPGQAGAYFNKVECFCFTEQTLKPGETVEMPVVFFVDPDFA  158 (195)
T ss_pred             eeEecCceeEeccCceEEEEEEEecCCCCCceeEEecccChhHhhhheeeeeEEEeeecccCCCCeEeccEEEEECcccc
Confidence            357999999999999999999999999999999999999999999999999999999999999999999     999999


Q ss_pred             CCCchhhhcccc
Q psy4837          78 HHNEHKMRSQIL   89 (100)
Q Consensus        78 ~d~~~~~~~~~~   89 (100)
                      +|++++|++.|.
T Consensus       159 ~dPe~kdvk~iT  170 (195)
T COG3175         159 DDPEMKDVKTIT  170 (195)
T ss_pred             cCcccCCCCeEE
Confidence            999999999874


No 5  
>KOG2540|consensus
Probab=100.00  E-value=5.3e-34  Score=223.95  Aligned_cols=88  Identities=49%  Similarity=0.819  Sum_probs=85.4

Q ss_pred             eceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCCC
Q psy4837           4 THRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKPH   78 (100)
Q Consensus         4 ~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~~   78 (100)
                      ..|.|++|.|+|||++++||.|+|.||++++|.|+||++|++||.||||||||||+||.|.|||+++|     |||+...
T Consensus       150 f~PqQrEiyV~PGEtALaFYta~N~sdkpIiGvstYni~P~~Aa~YFnKiqCFCFEEQ~L~pgE~vDmPVFFyIDPefa~  229 (269)
T KOG2540|consen  150 FTPQQREIYVLPGETALAFYTAENPSDKPIIGVSTYNITPGQAAVYFNKIQCFCFEEQKLNPGEQVDMPVFFYIDPEFAT  229 (269)
T ss_pred             ccccceEEEEcCCcceeeeEeccCCCCCCceeeEeeccCccHhhhheeceeEEeehhhccCCCcccCcceEEEeCccccc
Confidence            57999999999999999999999999999999999999999999999999999999999999999999     9999999


Q ss_pred             CCchhhhcccccc
Q psy4837          79 HNEHKMRSQILPK   91 (100)
Q Consensus        79 d~~~~~~~~~~~~   91 (100)
                      |..+++++.||=.
T Consensus       230 DP~m~~id~i~Ls  242 (269)
T KOG2540|consen  230 DPAMDGIDDILLS  242 (269)
T ss_pred             CcccccccceEEE
Confidence            9999999998743


No 6  
>PF06030 DUF916:  Bacterial protein of unknown function (DUF916);  InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function. 
Probab=83.97  E-value=2.8  Score=29.51  Aligned_cols=69  Identities=20%  Similarity=0.223  Sum_probs=45.1

Q ss_pred             EEEeecCCeEEEEEEEECCCCCcEEEEEeC-----------cccccchh------hcccceeeeeccccccCCCCeeec-
Q psy4837          10 TLHVYPGETALAFYTAKNPTNSPIVGISTY-----------NVVPFDAG------QYFNKIQCFCFEEQQLNPHEELQI-   71 (100)
Q Consensus        10 ~v~V~pGE~~~v~y~a~N~sd~~v~GqAvp-----------sVsP~~A~------~YF~KieCFCF~eQ~L~pgE~~~M-   71 (100)
                      .+.+.||+...+...++|.++++++-.+..           .+++....      .-|.++-=.=- +-+|.|||++.. 
T Consensus        20 dL~~~P~q~~~l~v~i~N~s~~~~tv~v~~~~A~Tn~nG~I~Y~~~~~~~d~sl~~~~~~~v~~~~-~Vtl~~~~sk~V~   98 (121)
T PF06030_consen   20 DLKVKPGQKQTLEVRITNNSDKEITVKVSANTATTNDNGVIDYSQNNPKKDKSLKYPFSDLVKIPK-EVTLPPNESKTVT   98 (121)
T ss_pred             EEEeCCCCEEEEEEEEEeCCCCCEEEEEEEeeeEecCCEEEEECCCCcccCcccCcchHHhccCCc-EEEECCCCEEEEE
Confidence            477999999999999999999998754432           22222211      13333322222 278999999988 


Q ss_pred             cCCCCCCC
Q psy4837          72 FAPKKPHH   79 (100)
Q Consensus        72 vDP~i~~d   79 (100)
                      +.=.+|+.
T Consensus        99 ~~i~~P~~  106 (121)
T PF06030_consen   99 FTIKMPKK  106 (121)
T ss_pred             EEEEcCCC
Confidence            65555654


No 7  
>PF14874 PapD-like:  Flagellar-associated PapD-like
Probab=82.32  E-value=4.1  Score=26.36  Aligned_cols=55  Identities=25%  Similarity=0.413  Sum_probs=38.5

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCccc-ccchhhcccceeeeeccccccCCCCeeec---cCC
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNVV-PFDAGQYFNKIQCFCFEEQQLNPHEELQI---FAP   74 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVs-P~~A~~YF~KieCFCF~eQ~L~pgE~~~M---vDP   74 (100)
                      .|..|+.+.....++|.+..+..    |.+. |.....+|   .+. ..+..|.||++.++   +.|
T Consensus        15 ~v~~g~~~~~~v~l~N~s~~p~~----f~v~~~~~~~~~~---~v~-~~~g~l~PG~~~~~~V~~~~   73 (102)
T PF14874_consen   15 NVFVGQTYSRTVTLTNTSSIPAR----FRVRQPESLSSFF---SVE-PPSGFLAPGESVELEVTFSP   73 (102)
T ss_pred             EEccCCEEEEEEEEEECCCCCEE----EEEEeCCcCCCCE---EEE-CCCCEECCCCEEEEEEEEEe
Confidence            57899999999999999999854    3332 22222222   222 35789999999999   663


No 8  
>PF07610 DUF1573:  Protein of unknown function (DUF1573);  InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=81.07  E-value=4.1  Score=23.84  Aligned_cols=36  Identities=25%  Similarity=0.484  Sum_probs=28.1

Q ss_pred             EEEECCCCCcEEEEEeCcccccchhhcccceeeeec----cccccCCCCeeec
Q psy4837          23 YTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCF----EEQQLNPHEELQI   71 (100)
Q Consensus        23 y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF----~eQ~L~pgE~~~M   71 (100)
                      |.++|.+++++.=..+             +..|=|.    ...+|+|||+..+
T Consensus         2 F~~~N~g~~~L~I~~v-------------~tsCgCt~~~~~~~~i~PGes~~i   41 (45)
T PF07610_consen    2 FEFTNTGDSPLVITDV-------------QTSCGCTTAEYSKKPIAPGESGKI   41 (45)
T ss_pred             EEEEECCCCcEEEEEe-------------eEccCCEEeeCCcceECCCCEEEE
Confidence            8899999999876554             3567774    5688999998765


No 9  
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=80.99  E-value=0.93  Score=28.69  Aligned_cols=62  Identities=23%  Similarity=0.318  Sum_probs=29.9

Q ss_pred             eecCCeEEEEEEEECCCCCcEEE-EEeCcccccchhhcccceeeeeccccccCCCCeeec-cCCCCCCCC
Q psy4837          13 VYPGETALAFYTAKNPTNSPIVG-ISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-FAPKKPHHN   80 (100)
Q Consensus        13 V~pGE~~~v~y~a~N~sd~~v~G-qAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-vDP~i~~d~   80 (100)
                      |.||+...+...++|..+.++.. ....++ |.-...-+.--+-     ..|.|||++.. +.=..|.|.
T Consensus         1 v~~G~~~~~~~tv~N~g~~~~~~v~~~l~~-P~GW~~~~~~~~~-----~~l~pG~s~~~~~~V~vp~~a   64 (78)
T PF10633_consen    1 VTPGETVTVTLTVTNTGTAPLTNVSLSLSL-PEGWTVSASPASV-----PSLPPGESVTVTFTVTVPADA   64 (78)
T ss_dssp             --TTEEEEEEEEEE--SSS-BSS-EEEEE---TTSE---EEEEE-------B-TTSEEEEEEEEEE-TT-
T ss_pred             CCCCCEEEEEEEEEECCCCceeeEEEEEeC-CCCccccCCcccc-----ccCCCCCEEEEEEEEECCCCC
Confidence            57999999999999999877543 222333 5322211111111     18999999988 444445543


No 10 
>TIGR02756 TraK_Ftype type-F conjugative transfer system secretin TraK. The TraK protein is predicted to interact with the TraV and TraB proteins as part of the scaffold which extends from the inner membrane, through the periplasm to the cell envelope and through which the F-type conjugative pilus passes. TraK is homologous to the P-type IV secretion system protein TrbG, the Ti-type protein VirB9 and the I-type TraN protein. The protein is related to the secretin family especially the HrcC subgroup of the type III secretion system. The protein is hypothesized to oligomerize to form a ring structure akin to other secretins.
Probab=80.70  E-value=2.6  Score=32.69  Aligned_cols=47  Identities=23%  Similarity=0.236  Sum_probs=36.6

Q ss_pred             CeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837          17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus        17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      +..-.-|+++|.+++++.-.--=-.+|+-.|.-|.+        +.|.|||+..+
T Consensus       182 ~l~g~~y~l~N~s~~~~~L~E~~F~~~gv~AVa~~~--------~~L~PGe~t~v  228 (232)
T TIGR02756       182 HLKGERFELENKTNSPLELTESWFWQPGTRAVALSK--------PQLAPGETADL  228 (232)
T ss_pred             CcEEEEEEEEcCCCCCeEechHHhCCcCcEEEEecc--------CccCCCCEEEE
Confidence            455678999999999998876666677666665544        79999998765


No 11 
>PRK15218 fimbrial chaperone protein PegB; Provisional
Probab=66.37  E-value=19  Score=27.94  Aligned_cols=67  Identities=13%  Similarity=0.057  Sum_probs=42.8

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcc-----cccchhhcccceeeeeccccccCCCCeeec----cCCCCCCCCch
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNV-----VPFDAGQYFNKIQCFCFEEQQLNPHEELQI----FAPKKPHHNEH   82 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsV-----sP~~A~~YF~KieCFCF~eQ~L~pgE~~~M----vDP~i~~d~~~   82 (100)
                      -|.+|....+...+.|.++.+...|+--.-     +|.....-|-=+    =-=..|+||+...+    ....||+|+|.
T Consensus        27 vIy~~~~~~~si~i~N~~~~pyLvQsWvd~~~~~~~~~~~~~pFivt----PPlfRl~p~~~~~lRI~~~~~~LP~DRES  102 (226)
T PRK15218         27 IIYPAQKKDITVQLMNDGKRSSLIQAWIDNGDTSLPPEKLQVPFIMT----PPVIRVAANSGQQLKIKKLANNLPGDRES  102 (226)
T ss_pred             EEEcCCCcEEEEEEEcCCCCcEEEEEEEeCCCCCCCcccccCCEEEC----CCeEEECCCCceEEEEEECCCCCCcceeE
Confidence            467788889999999999999999984321     111110111100    01246889887777    66778888765


No 12 
>PRK15192 fimbrial chaperone BcfG; Provisional
Probab=66.15  E-value=19  Score=28.14  Aligned_cols=66  Identities=11%  Similarity=0.012  Sum_probs=42.8

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCc-------ccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCC
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYN-------VVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPH   78 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvps-------VsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~   78 (100)
                      -+.+|....+...+.|.++.+...|+--.       -.+.....      =|=-+  -..|+||++..+    ....||+
T Consensus        31 vIy~~~~k~~sv~l~N~~~~p~LvQswv~~~~~w~~~~~~~~~~------PFivtPPlfrl~p~~~~~lRI~~~~~~LP~  104 (234)
T PRK15192         31 FIYHAGAPALSVPVSNHSEASWLIDTHILPGGRWPGTKNEGNIT------PFVVTPPLFMLSARQENSMRVVYTGAPLPA  104 (234)
T ss_pred             EEEcCCCceEEEEEEeCCCCcEEEEEEeccCccccccCCccccC------CEEEcCCeEEECCCCceEEEEEECCCCCCC
Confidence            46677888999999999999999998421       11101000      02111  246789988777    7777888


Q ss_pred             CCchh
Q psy4837          79 HNEHK   83 (100)
Q Consensus        79 d~~~~   83 (100)
                      |+|.-
T Consensus       105 DRESl  109 (234)
T PRK15192        105 DRESL  109 (234)
T ss_pred             cceEE
Confidence            87653


No 13 
>PRK15224 pili assembly chaperone protein SafB; Provisional
Probab=63.62  E-value=22  Score=27.93  Aligned_cols=66  Identities=23%  Similarity=0.234  Sum_probs=43.1

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCchh
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEHK   83 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~~   83 (100)
                      -+.+|....+...++|.++.+...|+--.=.-+..     +. =|=-+  -..|+||+...+    ....||.|+|.=
T Consensus        37 vIy~~~~k~~sl~v~N~~~~pyLvQsWvd~~~~~~-----~~-pFivtPPlfRlep~~~~~lRI~~~~~~LP~DRESl  108 (237)
T PRK15224         37 VIYHAGTAGATLSVSNPQNYPILVQSSVKAADKSS-----PA-PFLVMPPLFRLEANQQSQLRIVRTGGDMPTDRETL  108 (237)
T ss_pred             EEEeCCCcEEEEEEEcCCCCcEEEEEEEeCCCCCc-----cC-CEEECCCeEEECCCCceEEEEEECCCCCCCceeEE
Confidence            45677778899999999999999998443111110     00 01111  246889987777    666789887753


No 14 
>PRK15246 fimbrial assembly chaperone StbE; Provisional
Probab=63.51  E-value=22  Score=27.63  Aligned_cols=64  Identities=11%  Similarity=0.129  Sum_probs=39.5

Q ss_pred             eecCCeEEEEEEEECCCCCcEEEEEeCc-----ccccchhhcccceeeeec--cccccCCCCeeec----c-CCCCCCCC
Q psy4837          13 VYPGETALAFYTAKNPTNSPIVGISTYN-----VVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----F-APKKPHHN   80 (100)
Q Consensus        13 V~pGE~~~v~y~a~N~sd~~v~GqAvps-----VsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----v-DP~i~~d~   80 (100)
                      +.++....+...++|.++.+...|+--.     ..|.....-      |=-  --..|+||++..+    . +..||+|+
T Consensus        20 I~~~~~~~~sv~l~N~~~~p~LvQsWvd~~~~~~~p~~~~~p------FivtPPlfrl~~~~~~~lRI~~~~~~~LP~DR   93 (233)
T PRK15246         20 IFASDDVAQSLTLSNDNTTPMLLQVWTDAGNIDASPDNSKTP------LVALPPVFKMQPGELRTLRLLLSSRQQLATDR   93 (233)
T ss_pred             EEcCCCceEEEEEEeCCCCcEEEEEEEeCCCCccCcccccCc------EEECCcceEECCCCceEEEEEECCCCCCCCCc
Confidence            4556678899999999999999998221     111110000      111  1246788887776    5 35788887


Q ss_pred             ch
Q psy4837          81 EH   82 (100)
Q Consensus        81 ~~   82 (100)
                      |.
T Consensus        94 ES   95 (233)
T PRK15246         94 ES   95 (233)
T ss_pred             eE
Confidence            65


No 15 
>PF13473 Cupredoxin_1:  Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=63.19  E-value=24  Score=23.15  Aligned_cols=45  Identities=18%  Similarity=0.203  Sum_probs=26.0

Q ss_pred             ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837           8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      -..++|+.|+.+++.+  +|..+.+ ..-.++.               + .....|.||++..+
T Consensus        34 P~~i~v~~G~~v~l~~--~N~~~~~-h~~~i~~---------------~-~~~~~l~~g~~~~~   78 (104)
T PF13473_consen   34 PSTITVKAGQPVTLTF--TNNDSRP-HEFVIPD---------------L-GISKVLPPGETATV   78 (104)
T ss_dssp             S-EEEEETTCEEEEEE--EE-SSS--EEEEEGG---------------G-TEEEEE-TT-EEEE
T ss_pred             cCEEEEcCCCeEEEEE--EECCCCc-EEEEECC---------------C-ceEEEECCCCEEEE
Confidence            3589999999888664  6776665 3333333               1 12267888887766


No 16 
>PRK15295 fimbrial assembly chaperone SthB; Provisional
Probab=62.02  E-value=24  Score=27.22  Aligned_cols=67  Identities=18%  Similarity=0.160  Sum_probs=41.0

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeec--cccccCCCCeeec----cCCCCCCCCch
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----FAPKKPHHNEH   82 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----vDP~i~~d~~~   82 (100)
                      -+.+|+...+...+.|.++.+...|+--.-.....    .+-.=|=-  --.+|+||++-.+    ..+.||.|.|.
T Consensus        28 vI~~~~~~~~si~i~N~~~~p~LvQsWv~~~~~~~----~~~~pFivtPPl~rl~p~~~q~lRI~~~~~~LP~DrEs  100 (226)
T PRK15295         28 LVFDGNNDESSINVENKDSKANLVQSWLSVVDPQV----TNKQAFIITPPLFRLDAGQKNSIRVIRSGAPLPADRES  100 (226)
T ss_pred             EEEeCCCceeEEEEEeCCCCcEEEEEEEeCCCCCC----CCCCCEEEcCCeEEECCCCceEEEEEECCCCCCCCceE
Confidence            35666778899999999999999987431110000    00000111  1235788987777    66778888765


No 17 
>PF06586 TraK:  TraK protein;  InterPro: IPR010563 This family consists of several TraK proteins from Escherichia coli, Salmonella typhi and Salmonella typhimurium. TraK is known to be essential for pilus assembly but its exact role in this process is unknown [].
Probab=61.91  E-value=12  Score=28.18  Aligned_cols=47  Identities=15%  Similarity=0.081  Sum_probs=34.1

Q ss_pred             CeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837          17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus        17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      ...-.-|+++|.++.++.-.--==.+|+..|..|        +...|.|||+.+.
T Consensus       186 ~l~~~~y~v~N~~~~~v~l~E~~f~~~~v~AVa~--------~~~~L~PGe~t~v  232 (234)
T PF06586_consen  186 GLRGEVYRVTNTSDQPVELDERDFYSPGVRAVAL--------WPPTLAPGESTEV  232 (234)
T ss_pred             ceEEEEEEEEeCCCCCEEecHHHhCCCCcEEEEe--------cccccCCCCEEEE
Confidence            4556789999999998876554444576666655        4458999998764


No 18 
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=60.89  E-value=16  Score=22.63  Aligned_cols=55  Identities=20%  Similarity=0.231  Sum_probs=30.5

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      ..-.+.+|+..++.+.++|.......+.-+ .       .|...-..==..-..|+|||+...
T Consensus        11 ~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v-~-------~~~~~~~~~~~~i~~L~~g~~~~v   65 (101)
T PF07705_consen   11 SPSNVVPGEPVTITVTVKNNGTADAENVTV-R-------LYLDGNSVSTVTIPSLAPGESETV   65 (101)
T ss_dssp             C-SEEETTSEEEEEEEEEE-SSS-BEEEEE-E-------EEETTEEEEEEEESEB-TTEEEEE
T ss_pred             CCCcccCCCEEEEEEEEEECCCCCCCCEEE-E-------EEECCceeccEEECCcCCCcEEEE
Confidence            345678999999999999998876433221 1       122211110011167888888776


No 19 
>PRK15233 putative fimbrial chaperone protein SefB; Provisional
Probab=60.13  E-value=27  Score=27.66  Aligned_cols=65  Identities=11%  Similarity=0.083  Sum_probs=41.7

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeec--cccccCCCCeeec----cCCCCCCCCch
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----FAPKKPHHNEH   82 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----vDP~i~~d~~~   82 (100)
                      -+.+|....+...+.|.++.+...|+--.=.....     +. =|==  --..|+||+.-.+    ....||+|+|.
T Consensus        49 vIy~~~~~~~sl~i~N~~~~p~LvQsWvd~~~~~~-----~~-pFiVtPPLfRLep~~~~~lRIi~~~~~LP~DRES  119 (246)
T PRK15233         49 VIYKEDAPSTSFWIMNEKEYPILVQTQVYNDDKSS-----KA-PFIVTPPILKVESNARTRLKVIPTSNLFNKNEES  119 (246)
T ss_pred             EEEeCCCcEEEEEEEcCCCCcEEEEEEEecCCCCc-----cC-CEEECCCeEEECCCCceEEEEEECCCCCCcCceE
Confidence            45677778999999999999999998432111110     00 0110  1235788887777    67778888665


No 20 
>PRK15253 putative fimbrial assembly chaperone protein StcB; Provisional
Probab=58.16  E-value=29  Score=27.23  Aligned_cols=67  Identities=16%  Similarity=0.112  Sum_probs=42.1

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcc-----cccchhhcccceeeeeccccccCCCCeeec----cCCCCCCCCch
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNV-----VPFDAGQYFNKIQCFCFEEQQLNPHEELQI----FAPKKPHHNEH   82 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsV-----sP~~A~~YF~KieCFCF~eQ~L~pgE~~~M----vDP~i~~d~~~   82 (100)
                      -|.+|....+.+.+.|.++.+...|+--.=     +|......|-=+    =--..|+||+...+    ....||+|+|.
T Consensus        42 vIy~~~~k~~sv~i~N~~~~pyLvQsWvd~~~~~~~~~~~~~pFivt----PPlfRl~p~~~~~lRI~~~~~~LP~DRES  117 (242)
T PRK15253         42 VIYPAEKKEVVVQLVNQGEQASLVQSWIDDGNTSLPPEKIQVPFMLT----PPVARVAAESGQQIKIKKMPNSLPDNKES  117 (242)
T ss_pred             EEEeCCCceEEEEEEcCCCCcEEEEEEEECCCCCCCcccccCCEEEC----CCeEEECCCCceEEEEEECCCCCCcceeE
Confidence            456777788999999999999999984421     111110111100    01246888887777    55568888765


No 21 
>PF07177 Neuralized:  Neuralized;  InterPro: IPR006573 NEUZ is a domain of unknown function found in neuralized proteins, i.e. proteins involved in the specification of the neuroblast during cellular differentiation. ; PDB: 2YUE_A 2E63_A.
Probab=57.13  E-value=5.7  Score=25.59  Aligned_cols=17  Identities=18%  Similarity=0.386  Sum_probs=12.5

Q ss_pred             eeeccccccCCCCeeec
Q psy4837          55 CFCFEEQQLNPHEELQI   71 (100)
Q Consensus        55 CFCF~eQ~L~pgE~~~M   71 (100)
                      +.+|++++|.+||.+++
T Consensus        31 giVFS~rPl~~~E~~~v   47 (69)
T PF07177_consen   31 GIVFSSRPLRIGEKFEV   47 (69)
T ss_dssp             -EEEESS-B-TT-EEEE
T ss_pred             eEEEecCCccCCCEEEE
Confidence            78999999999999998


No 22 
>PRK15211 fimbrial chaperone protein PefD; Provisional
Probab=56.98  E-value=32  Score=26.72  Aligned_cols=66  Identities=12%  Similarity=0.113  Sum_probs=42.4

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCch
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEH   82 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~   82 (100)
                      -+.++....+...++|.++.+...|+--.-.......     .=|=-+  -..|+||++..+    .+..||+|+|.
T Consensus        31 vIy~~~~~~~si~i~N~~~~p~LvQswv~~~~~~~~~-----~pFivtPPlfrl~p~~~q~lRI~~~~~~LP~DRES  102 (229)
T PRK15211         31 FIYDEGRKNISFEVTNQADQTYGGQVWIDNTTQGSST-----VYMVPAPPFFKVRPKEKQIIRIMKTDSALPKDRES  102 (229)
T ss_pred             EEEcCCCceEEEEEEeCCCCcEEEEEEEecCCCCCcc-----CCEEEcCCeEEECCCCceEEEEEECCCCCCCCceE
Confidence            3566777889999999999999999854322111110     002221  246788987777    66678888765


No 23 
>PRK11385 putativi pili assembly chaperone; Provisional
Probab=52.44  E-value=41  Score=26.27  Aligned_cols=72  Identities=14%  Similarity=0.015  Sum_probs=42.5

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcccccch-hhcccceeeeecc--ccccCCCCeeec----cC-CCCCCCCchh
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDA-GQYFNKIQCFCFE--EQQLNPHEELQI----FA-PKKPHHNEHK   83 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A-~~YF~KieCFCF~--eQ~L~pgE~~~M----vD-P~i~~d~~~~   83 (100)
                      -+.++....+...+.|.++++...|+--.-.+..+ +.--++..=|=-+  =..|+||++..+    .. ..||+|+|.-
T Consensus        35 vIy~~~~~~~sv~l~N~~~~p~LvQswv~~~~~~~~~~~~~~~~pFivtPPlfrl~p~~~q~lRIi~~~~~~LP~DRESl  114 (236)
T PRK11385         35 FIFPADRESISILLTNTSQESWLINSKINRPTRWAGGEASTVPAPLLAAPPLILLKPGTTGTLRLLRTESDILPVDRETL  114 (236)
T ss_pred             EEEcCCCceEEEEEEeCCCCcEEEEEEcccCccccCcccccccCCEEEcCCeEEECCCCceEEEEEECCCCCCCCCceEE
Confidence            35567778899999999999999999443111000 0000000112222  236789998777    54 4789997753


No 24 
>PF11611 DUF4352:  Domain of unknown function (DUF4352);  InterPro: IPR021652 This entry is represented by Bacteriophage A118, Gp32. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a group of putative lipoproteins of unknown function.; PDB: 3CFU_A.
Probab=51.94  E-value=27  Score=22.81  Aligned_cols=64  Identities=13%  Similarity=0.133  Sum_probs=30.9

Q ss_pred             CCeEEEEEEEECCCCCcEEEEEeCcccc-cchhhcccceeeee-----ccccccCCCCeeec-cCCCCCCCC
Q psy4837          16 GETALAFYTAKNPTNSPIVGISTYNVVP-FDAGQYFNKIQCFC-----FEEQQLNPHEELQI-FAPKKPHHN   80 (100)
Q Consensus        16 GE~~~v~y~a~N~sd~~v~GqAvpsVsP-~~A~~YF~KieCFC-----F~eQ~L~pgE~~~M-vDP~i~~d~   80 (100)
                      ++...|.+.++|.+++++.-... ...= ...+.-+.--.=..     +..+.|+||++++- +-=++|++.
T Consensus        35 ~~fv~v~v~v~N~~~~~~~~~~~-~f~l~d~~g~~~~~~~~~~~~~~~~~~~~i~pG~~~~g~l~F~vp~~~  105 (123)
T PF11611_consen   35 NKFVVVDVTVKNNGDEPLDFSPS-DFKLYDSDGNKYDPDFSASSNDNDLFSETIKPGESVTGKLVFEVPKDD  105 (123)
T ss_dssp             SEEEEEEEEEEE-SSS-EEEEGG-GEEEE-TT--B--EEE-CCCTTTB--EEEE-TT-EEEEEEEEEESTT-
T ss_pred             CEEEEEEEEEEECCCCcEEeccc-ceEEEeCCCCEEcccccchhccccccccEECCCCEEEEEEEEEECCCC
Confidence            56788999999999999864322 1111 11111111111111     55689999999988 333556653


No 25 
>PF00207 A2M:  Alpha-2-macroglobulin family;  InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins.  The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=51.88  E-value=33  Score=22.20  Aligned_cols=26  Identities=15%  Similarity=-0.055  Sum_probs=20.9

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEE
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGIS   37 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqA   37 (100)
                      .+..||...+...+.|.+++.+....
T Consensus        65 ~l~~GD~~~i~v~v~N~~~~~~~v~V   90 (92)
T PF00207_consen   65 SLRRGDQIQIPVTVFNYTDKDQEVTV   90 (92)
T ss_dssp             EEETTSEEEEEEEEEE-SSS-EEEEE
T ss_pred             EEecCCEEEEEEEEEeCCCCCEEEEE
Confidence            47899999999999999999887653


No 26 
>PF06205 GT36_AF:  Glycosyltransferase 36 associated family  ;  InterPro: IPR010403 This domain is found in the NvdB protein (P20471 from SWISSPROT), which is involved in the production of beta-(1-->2)-glucan.; PDB: 1V7V_A 1V7W_A 1V7X_A 3ACT_B 2CQT_A 3QFY_B 3QFZ_A 2CQS_A 3QG0_B 3AFJ_A ....
Probab=51.78  E-value=21  Score=23.71  Aligned_cols=19  Identities=16%  Similarity=0.315  Sum_probs=14.9

Q ss_pred             eceEEEeecCCeEEEEEEE
Q psy4837           7 YLLTLHVYPGETALAFYTA   25 (100)
Q Consensus         7 ~q~~v~V~pGE~~~v~y~a   25 (100)
                      .+..+++.|||...+.|..
T Consensus        64 l~~~v~L~PGe~~~v~f~l   82 (90)
T PF06205_consen   64 LQVRVTLEPGEEKEVVFLL   82 (90)
T ss_dssp             EEEEEEE-TT-EEEEEEEE
T ss_pred             EEEEEEECCCCEEEEEEEE
Confidence            5778999999999999975


No 27 
>PF01345 DUF11:  Domain of unknown function DUF11;  InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins.  In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=50.81  E-value=35  Score=21.08  Aligned_cols=35  Identities=20%  Similarity=0.299  Sum_probs=28.7

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccc
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVP   43 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP   43 (100)
                      ..-.+.||+.......++|..+.+..+..+-..-|
T Consensus        33 ~~~~~~~Gd~v~ytitvtN~G~~~a~nv~v~D~lp   67 (76)
T PF01345_consen   33 NPSTANPGDTVTYTITVTNTGPAPATNVVVTDTLP   67 (76)
T ss_pred             CCCcccCCCEEEEEEEEEECCCCeeEeEEEEEcCC
Confidence            44568999999999999999999988776655444


No 28 
>PRK13736 conjugal transfer protein TraK; Provisional
Probab=49.47  E-value=33  Score=27.05  Aligned_cols=47  Identities=19%  Similarity=0.150  Sum_probs=35.7

Q ss_pred             CeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccc--cccCCCCeeec
Q psy4837          17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEE--QQLNPHEELQI   71 (100)
Q Consensus        17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~e--Q~L~pgE~~~M   71 (100)
                      +..-+-|+++|.+++++.-.--==.+|+--        -+=|..  +.|.|||+..+
T Consensus       185 ~l~g~~y~l~N~~~~~v~L~E~~F~~~gvr--------AVa~~~~~~~L~PG~~t~v  233 (245)
T PRK13736        185 HLKVVRYRVENPTLSARNLRESDFWQPGTR--------AVMFSQPARQLLAGGRMDV  233 (245)
T ss_pred             CcEEEEEEEEcCCCCCeEechHHhCCCCce--------EEEecCCcccCCCCCEEEE
Confidence            466788999999999988765555555433        345677  99999999887


No 29 
>PRK09926 putative chaperone protein EcpD; Provisional
Probab=49.40  E-value=45  Score=25.89  Aligned_cols=64  Identities=20%  Similarity=0.288  Sum_probs=39.3

Q ss_pred             eecCCeEEEEEEEECCCCCcEEEEEeCc-----ccccchhhcccceeeeec--cccccCCCCeeec----cCC-CCCCCC
Q psy4837          13 VYPGETALAFYTAKNPTNSPIVGISTYN-----VVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----FAP-KKPHHN   80 (100)
Q Consensus        13 V~pGE~~~v~y~a~N~sd~~v~GqAvps-----VsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----vDP-~i~~d~   80 (100)
                      +.++....+...++|.++++...|+--.     .+|...     |.. |=-  --..|+||++..+    ..+ .+|.|+
T Consensus        35 I~~~~~~~~sv~l~N~~~~p~LvQ~Wvd~~~~~~~p~~~-----~~p-fivtPPl~rl~p~~~q~lRIi~~~~~~lP~Dr  108 (246)
T PRK09926         35 IYKSDQKDVNVRLENKGNNPLLVQSWLDTGDDNAEPGSI-----KVP-FTATPPVSRIDPKRGQTIKLMYTASTALPKDR  108 (246)
T ss_pred             EEeCCCceEEEEEEeCCCCcEEEEEEecCCCCccCcccc-----CCC-EEEcCCeEEECCCCccEEEEEeCCCCCCCCCc
Confidence            4456667888899999999999997331     111110     000 111  1246788887776    666 688887


Q ss_pred             ch
Q psy4837          81 EH   82 (100)
Q Consensus        81 ~~   82 (100)
                      |.
T Consensus       109 ES  110 (246)
T PRK09926        109 ES  110 (246)
T ss_pred             eE
Confidence            65


No 30 
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=47.42  E-value=57  Score=28.14  Aligned_cols=55  Identities=13%  Similarity=0.153  Sum_probs=38.6

Q ss_pred             cCCeEEEEEEEECCCCCcEEEE-------------EeCcccccchhhcccceeeeec-cccccCCCCeeec
Q psy4837          15 PGETALAFYTAKNPTNSPIVGI-------------STYNVVPFDAGQYFNKIQCFCF-EEQQLNPHEELQI   71 (100)
Q Consensus        15 pGE~~~v~y~a~N~sd~~v~Gq-------------AvpsVsP~~A~~YF~KieCFCF-~eQ~L~pgE~~~M   71 (100)
                      ||-.-++...++|.+|+++.--             .+|...|..-..+.-+  ---. ..-+++|||+++.
T Consensus       280 PGR~l~~~~~VTN~g~~~vrlgEF~TA~vRFlN~~~v~~~~~~yP~~lla~--GL~v~d~~pI~PGETr~v  348 (399)
T TIGR03079       280 PGRALRVTMEITNNGDQVISIGEFTTAGIRFMNANGVRVLDPDYPRELLAE--GLEVDDQSAIAPGETVEV  348 (399)
T ss_pred             CCcEEEEEEEEEcCCCCceEEEeEeecceEeeCcccccccCCCChHHHhhc--cceeCCCCCcCCCcceEE
Confidence            8889999999999999998531             3555555544444444  3333 3447999999987


No 31 
>PF14646 MYCBPAP:  MYCBP-associated protein family
Probab=46.58  E-value=68  Score=26.62  Aligned_cols=64  Identities=19%  Similarity=0.116  Sum_probs=44.6

Q ss_pred             ceEEEeecCCeEEEEEE-EECCCCCcEEEEEeCcccccch-hhcccceeeeecccc--ccCCCCeeec
Q psy4837           8 LLTLHVYPGETALAFYT-AKNPTNSPIVGISTYNVVPFDA-GQYFNKIQCFCFEEQ--QLNPHEELQI   71 (100)
Q Consensus         8 q~~v~V~pGE~~~v~y~-a~N~sd~~v~GqAvpsVsP~~A-~~YF~KieCFCF~eQ--~L~pgE~~~M   71 (100)
                      .-....+|||...=.-. ++|.....+.=+=.-.-.+... ...-...+||=|+..  +|.|||+..+
T Consensus       238 ~l~Fe~~p~e~~~~~v~~l~N~Gt~~I~y~W~~~~~~~~~~~~~~~~~~~F~Fd~~~gvilPGe~~~~  305 (426)
T PF14646_consen  238 RLTFECHPGERVSKEVVRLENNGTTAIYYSWRRVPFFKNFGSLFRAQDQRFYFDTSSGVILPGETRNF  305 (426)
T ss_pred             EEEEEcccCceeeEEEEEEecCCceEEEEEEEecccccccchhccccCCeEEEeCCCCEECCCceEEE
Confidence            34567899999888888 9999888776543332222222 333445888888765  6899999987


No 32 
>PF02752 Arrestin_C:  Arrestin (or S-antigen), C-terminal domain;  InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ].  The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin.  The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=44.92  E-value=52  Score=21.24  Aligned_cols=29  Identities=24%  Similarity=0.193  Sum_probs=20.6

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcEEEEE
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPIVGIS   37 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqA   37 (100)
                      ..-...|||...+...+.|.+++.+.+.-
T Consensus        12 ~~~~~~~Ge~i~v~v~i~n~s~~~i~~I~   40 (136)
T PF02752_consen   12 PRTAYVPGETIPVNVEIDNQSKKKIKKIK   40 (136)
T ss_dssp             S-SEEETT--EEEEEEEEE-SSSEEEEEE
T ss_pred             CCCEECCCCEEEEEEEEEECCCCEEEEEE
Confidence            34456799999999999999999887743


No 33 
>PF04744 Monooxygenase_B:  Monooxygenase subunit B protein;  InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=44.35  E-value=47  Score=28.45  Aligned_cols=57  Identities=19%  Similarity=0.198  Sum_probs=30.3

Q ss_pred             ecCCeEEEEEEEECCCCCcEEEEE---------eCcccccchh---hcccceeeeec-cccccCCCCeeec
Q psy4837          14 YPGETALAFYTAKNPTNSPIVGIS---------TYNVVPFDAG---QYFNKIQCFCF-EEQQLNPHEELQI   71 (100)
Q Consensus        14 ~pGE~~~v~y~a~N~sd~~v~GqA---------vpsVsP~~A~---~YF~KieCFCF-~eQ~L~pgE~~~M   71 (100)
                      -||-.-++...++|.+++++.-.-         -+.|. ....   .++-..+---. ...+++|||++++
T Consensus       260 vpgR~l~~~l~VtN~g~~pv~LgeF~tA~vrFln~~v~-~~~~~~P~~l~A~~gL~vs~~~pI~PGETrtl  329 (381)
T PF04744_consen  260 VPGRTLTMTLTVTNNGDSPVRLGEFNTANVRFLNPDVP-TDDPDYPDELLAERGLSVSDNSPIAPGETRTL  329 (381)
T ss_dssp             SSSSEEEEEEEEEEESSS-BEEEEEESSS-EEE-TTT--SS-S---TTTEETT-EEES--S-B-TT-EEEE
T ss_pred             cCCcEEEEEEEEEcCCCCceEeeeEEeccEEEeCcccc-cCCCCCchhhhccCcceeCCCCCcCCCceEEE
Confidence            489999999999999999987543         11111 1111   12222222233 3459999999998


No 34 
>PRK15249 fimbrial chaperone protein StbB; Provisional
Probab=43.36  E-value=66  Score=25.19  Aligned_cols=65  Identities=18%  Similarity=0.232  Sum_probs=38.7

Q ss_pred             ecCCeEEEEEEEECCCCCcEEEEEe-----Ccccccchhhcccceeeeecc--ccccCCCCeeec----cCC-CCCCCCc
Q psy4837          14 YPGETALAFYTAKNPTNSPIVGIST-----YNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAP-KKPHHNE   81 (100)
Q Consensus        14 ~pGE~~~v~y~a~N~sd~~v~GqAv-----psVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP-~i~~d~~   81 (100)
                      .+|....+...++|.++++...|+-     -+.+|....    +. =|=-+  -.+|+||++..+    .+. .+|+|.|
T Consensus        39 y~~~~~~~sl~l~N~~~~p~LvQsWv~~~~~~~~p~~~~----~~-pFivtPPlfrl~p~~~q~lRI~~~~~~~lP~DRE  113 (253)
T PRK15249         39 YPSTASSVDVQLKNNDAIPYIVQTWFDDGDMNTSPENSS----AM-PFIATPPVFRIQPKAGQVVRVIYNNTKKLPQDRE  113 (253)
T ss_pred             EeCCCcceeEEEEcCCCCcEEEEEEEeCCCCCCCccccc----cC-cEEEcCCeEEecCCCceEEEEEEcCCCCCCCCce
Confidence            3555678888999999999999974     122221110    00 01111  135778887666    553 7888876


Q ss_pred             hh
Q psy4837          82 HK   83 (100)
Q Consensus        82 ~~   83 (100)
                      .-
T Consensus       114 Sl  115 (253)
T PRK15249        114 SV  115 (253)
T ss_pred             EE
Confidence            53


No 35 
>smart00588 NEUZ domain in neuralized proteins.
Probab=41.80  E-value=25  Score=24.76  Aligned_cols=17  Identities=18%  Similarity=0.423  Sum_probs=15.6

Q ss_pred             eeeccccccCCCCeeec
Q psy4837          55 CFCFEEQQLNPHEELQI   71 (100)
Q Consensus        55 CFCF~eQ~L~pgE~~~M   71 (100)
                      +.||+..+|++||.+.+
T Consensus        32 givFS~rPl~~~E~~~v   48 (123)
T smart00588       32 ALVFSARPLRINELFEV   48 (123)
T ss_pred             eEEecCCCCcCCCEEEE
Confidence            57999999999999887


No 36 
>PF09624 DUF2393:  Protein of unknown function (DUF2393);  InterPro: IPR013417  The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=39.23  E-value=1.3e+02  Score=20.97  Aligned_cols=59  Identities=20%  Similarity=0.115  Sum_probs=44.6

Q ss_pred             eecCCeEEEEEEEECCCCCcEEEEE----eCc---ccccchhhcccceeeeec----cccccCCCCeeec
Q psy4837          13 VYPGETALAFYTAKNPTNSPIVGIS----TYN---VVPFDAGQYFNKIQCFCF----EEQQLNPHEELQI   71 (100)
Q Consensus        13 V~pGE~~~v~y~a~N~sd~~v~GqA----vps---VsP~~A~~YF~KieCFCF----~eQ~L~pgE~~~M   71 (100)
                      +.-++..-+.+.++|.+++++.+--    +++   +....-..|+.|..=|=-    -...|.|||+++.
T Consensus        58 l~~~~~~~v~g~V~N~g~~~i~~c~i~~~l~~~~~~~~n~~~~~~~~~~~f~~~~~~i~~~L~~~e~~~f  127 (149)
T PF09624_consen   58 LQYSESFYVDGTVTNTGKFTIKKCKITVKLYNDKQVSGNKFKEIFYQQIPFVKKSIPIADNLKPGESKEF  127 (149)
T ss_pred             eeeccEEEEEEEEEECCCCEeeEEEEEEEEEeCCCccCchhhhhhccccchhccceeHHhhcCcccceeE
Confidence            4578999999999999999987643    233   666777888888876411    1334999999999


No 37 
>PF14310 Fn3-like:  Fibronectin type III-like domain; PDB: 3ABZ_D 3AC0_D 2X40_A 2X41_A 2X42_A.
Probab=38.98  E-value=52  Score=20.38  Aligned_cols=19  Identities=21%  Similarity=0.317  Sum_probs=14.0

Q ss_pred             eEEEeecCCeEEEEEEEEC
Q psy4837           9 LTLHVYPGETALAFYTAKN   27 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N   27 (100)
                      ..+.+.|||..++.|.+..
T Consensus        25 ~rv~l~pGes~~v~~~l~~   43 (71)
T PF14310_consen   25 ERVSLAPGESKTVSFTLPP   43 (71)
T ss_dssp             EEEEE-TT-EEEEEEEEEH
T ss_pred             EEEEECCCCEEEEEEEECH
Confidence            4677999999999998864


No 38 
>TIGR02656 cyanin_plasto plastocyanin. Members of this family are plastocyanin, a blue copper protein related to pseudoazurin, halocyanin, amicyanin, etc. This protein, located in the thylakoid luman, performs electron transport to photosystem I in Cyanobacteria and chloroplasts.
Probab=38.95  E-value=1.1e+02  Score=20.03  Aligned_cols=56  Identities=14%  Similarity=0.256  Sum_probs=27.7

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      ..++|++|+++  .|.  |....+-.-.+.-.=.|..+......+.   -+...|+|||+.++
T Consensus        17 ~~i~v~~G~~V--~~~--N~~~~~H~~~~~~~~~~~~~~~~~~~~~---~~~~~~~pG~t~~~   72 (99)
T TIGR02656        17 AKISIAAGDTV--EWV--NNKGGPHNVVFDEDAVPAGVKELAKSLS---HKDLLNSPGESYEV   72 (99)
T ss_pred             CEEEECCCCEE--EEE--ECCCCCceEEECCCCCccchhhhccccc---ccccccCCCCEEEE
Confidence            57999999975  343  6543332222211112322211111110   13457899999988


No 39 
>PRK02710 plastocyanin; Provisional
Probab=37.46  E-value=1e+02  Score=21.04  Aligned_cols=46  Identities=22%  Similarity=0.239  Sum_probs=26.8

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      ..++|++|+++  .|.  |..+.+-.      +.....       +=+.++...++|||+.+.
T Consensus        47 ~~i~v~~Gd~V--~~~--N~~~~~H~------v~~~~~-------~~~~~~~~~~~pg~t~~~   92 (119)
T PRK02710         47 STLTIKAGDTV--KWV--NNKLAPHN------AVFDGA-------KELSHKDLAFAPGESWEE   92 (119)
T ss_pred             CEEEEcCCCEE--EEE--ECCCCCce------EEecCC-------ccccccccccCCCCEEEE
Confidence            67999999974  443  55433222      111111       112345677899999887


No 40 
>PRK15274 putative periplasmic fimbrial chaperone protein SteC; Provisional
Probab=32.84  E-value=1.2e+02  Score=24.19  Aligned_cols=65  Identities=18%  Similarity=0.225  Sum_probs=40.8

Q ss_pred             EeecCCeEEEEEEEECCCCC-cEEEEEeCcccccchhhcccceee-eecc--ccccCCCCeeec----cC--CCCCCCCc
Q psy4837          12 HVYPGETALAFYTAKNPTNS-PIVGISTYNVVPFDAGQYFNKIQC-FCFE--EQQLNPHEELQI----FA--PKKPHHNE   81 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~-~v~GqAvpsVsP~~A~~YF~KieC-FCF~--eQ~L~pgE~~~M----vD--P~i~~d~~   81 (100)
                      -|.+|....+.+.+.|.++. +...|+--.  -+...    +... |=-+  -..|+||+.-.+    .+  ..||+|+|
T Consensus        35 vIy~e~~~~~sv~v~N~~~~~p~LVQsWvd--d~~~~----~~~~pFivtPPLfRlep~~~q~lRI~~~~~~~~LP~DRE  108 (257)
T PRK15274         35 VIFNGNENSITVTLKNGNATLPYLAQAWLE--DDKFA----KDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRE  108 (257)
T ss_pred             EEEeCCCceEEEEEEeCCCCCcEEEEEEcc--CCCCC----cccCCEEEcCCeEEECCCCceEEEEEECCCCCCCCCcee
Confidence            45667778899999999866 899998432  21100    1101 1111  236789987777    53  67999976


Q ss_pred             h
Q psy4837          82 H   82 (100)
Q Consensus        82 ~   82 (100)
                      .
T Consensus       109 S  109 (257)
T PRK15274        109 S  109 (257)
T ss_pred             E
Confidence            4


No 41 
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=32.76  E-value=1.1e+02  Score=18.36  Aligned_cols=35  Identities=20%  Similarity=0.227  Sum_probs=29.8

Q ss_pred             EEEeecCCeEEEEEEEECCCCCcEEEEEeCccccc
Q psy4837          10 TLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPF   44 (100)
Q Consensus        10 ~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~   44 (100)
                      .-.+.||+..+-.-.++|..+.+..+..+-..-|.
T Consensus         5 ~~~~~~Gd~v~Yti~v~N~g~~~a~~v~v~D~lP~   39 (53)
T TIGR01451         5 KTVATIGDTITYTITVTNNGNVPATNVVVTDILPS   39 (53)
T ss_pred             ccccCCCCEEEEEEEEEECCCCceEeEEEEEcCCC
Confidence            44678999999999999999999988888777663


No 42 
>PF08308 PEGA:  PEGA domain;  InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=32.41  E-value=90  Score=18.88  Aligned_cols=22  Identities=36%  Similarity=0.413  Sum_probs=17.8

Q ss_pred             ceeceEEEeecCCeEEEEEEEE
Q psy4837           5 HRYLLTLHVYPGETALAFYTAK   26 (100)
Q Consensus         5 ~p~q~~v~V~pGE~~~v~y~a~   26 (100)
                      .+....+.|.+|+...+.+.++
T Consensus        47 ~~~~~~v~v~~~~~~~v~~~L~   68 (71)
T PF08308_consen   47 EPYTKTVTVKPGETTTVNVTLE   68 (71)
T ss_pred             eeEEEEEEECCCCEEEEEEEEE
Confidence            3566789999999999988765


No 43 
>PF00362 Integrin_beta:  Integrin, beta chain;  InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus.  Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another.  The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices.  Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=32.00  E-value=77  Score=26.91  Aligned_cols=37  Identities=16%  Similarity=0.177  Sum_probs=29.4

Q ss_pred             eeeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEe
Q psy4837           2 ITTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGIST   38 (100)
Q Consensus         2 ~~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAv   38 (100)
                      +-+.|..-+++++||+...+.+.++=..+.|+--.-.
T Consensus        72 vQl~PQ~v~l~LRpG~~~~f~v~~~~a~~yPvDLYyL  108 (426)
T PF00362_consen   72 VQLSPQKVSLKLRPGEPVTFNVTVRPAEDYPVDLYYL  108 (426)
T ss_dssp             BSEESSEEEEEEETTBEEEEEEEEEBSSS--EEEEEE
T ss_pred             eeeccceeEEEeecceeEEEEEEEeeccccceeEEEE
Confidence            3478999999999999999999999888888765433


No 44 
>PF00127 Copper-bind:  Copper binding proteins, plastocyanin/azurin family;  InterPro: IPR000923 Blue (type 1) copper proteins are small proteins which bind a single copper atom and which are characterised by an intense electronic absorption band near 600 nm [, ]. The most well known members of this class of proteins are the plant chloroplastic plastocyanins, which exchange electrons with cytochrome c6, and the distantly related bacterial azurins, which exchange electrons with cytochrome c551. This family of proteins also includes amicyanin from bacteria such as Methylobacterium extorquens or Paracoccus versutus (Thiobacillus versutus) that can grow on methylamine; auracyanins A and B from Chloroflexus aurantiacus []; blue copper protein from Alcaligenes faecalis; cupredoxin (CPC) from Cucumis sativus (Cucumber) peelings []; cusacyanin (basic blue protein; plantacyanin, CBP) from cucumber; halocyanin from Natronomonas pharaonis (Natronobacterium pharaonis) [], a membrane associated copper-binding protein; pseudoazurin from Pseudomonas; rusticyanin from Thiobacillus ferrooxidans []; stellacyanin from Rhus vernicifera (Japanese lacquer tree); umecyanin from the roots of Armoracia rusticana (Horseradish); and allergen Ra3 from ragweed. This pollen protein is evolutionary related to the above proteins, but seems to have lost the ability to bind copper. Although there is an appreciable amount of divergence in the sequences of all these proteins, the copper ligand sites are conserved.; GO: 0005507 copper ion binding, 0009055 electron carrier activity; PDB: 1UAT_A 1CUO_A 1PLC_A 4PCY_A 3PCY_A 1PND_A 1PNC_A 1JXG_A 6PCY_A 1TKW_A ....
Probab=30.31  E-value=1.4e+02  Score=19.39  Aligned_cols=56  Identities=16%  Similarity=0.252  Sum_probs=32.1

Q ss_pred             ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccc-cchhhcccceeeeeccccccCCCCeeec
Q psy4837           8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVP-FDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP-~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      ...++|++|++  |.|...+...+.++.-+  .-.| +....+...-.    ....|+||++...
T Consensus        16 P~~i~V~~G~t--V~~~n~~~~~Hnv~~~~--~~~~~~~~~~~~~~~~----~~~~~~~G~~~~~   72 (99)
T PF00127_consen   16 PSEITVKAGDT--VTFVNNDSMPHNVVFVA--DGMPAGADSDYVPPGD----SSPLLAPGETYSV   72 (99)
T ss_dssp             SSEEEEETTEE--EEEEEESSSSBEEEEET--TSSHTTGGHCHHSTTC----EEEEBSTTEEEEE
T ss_pred             CCEEEECCCCE--EEEEECCCCCceEEEec--ccccccccccccCccc----cceecCCCCEEEE
Confidence            36799999996  45666666666666554  1111 12222222222    4556788887665


No 45 
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=29.81  E-value=80  Score=23.14  Aligned_cols=44  Identities=20%  Similarity=0.111  Sum_probs=28.4

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      ..++|+.|+.+++.+.  |.++- ..+-.+..        |        ..+..|.|||+...
T Consensus        61 ~~I~VkaGD~Vtl~vt--N~d~~-~H~f~i~~--------~--------gis~~I~pGet~Ti  104 (135)
T TIGR03096        61 EALVVKKGTPVKVTVE--NKSPI-SEGFSIDA--------Y--------GISEVIKAGETKTI  104 (135)
T ss_pred             CEEEECCCCEEEEEEE--eCCCC-ccceEECC--------C--------CcceEECCCCeEEE
Confidence            5688999998888775  87774 22222221        1        12677888988754


No 46 
>KOG3960|consensus
Probab=29.57  E-value=12  Score=30.77  Aligned_cols=9  Identities=78%  Similarity=0.999  Sum_probs=6.2

Q ss_pred             cccccceeE
Q psy4837          86 SQILPKVEI   94 (100)
Q Consensus        86 ~~~~~~~~~   94 (100)
                      +|-||||||
T Consensus       149 NQRLPKVEI  157 (284)
T KOG3960|consen  149 NQRLPKVEI  157 (284)
T ss_pred             cccccHHHH
Confidence            466777776


No 47 
>PF02018 CBM_4_9:  Carbohydrate binding domain;  InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=29.53  E-value=1.4e+02  Score=19.02  Aligned_cols=25  Identities=20%  Similarity=0.326  Sum_probs=20.1

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcE
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPI   33 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v   33 (100)
                      ..+.|.+|+.+++.|.++-.....+
T Consensus        54 ~~~~l~~G~~Y~~s~~vk~~~~~~~   78 (131)
T PF02018_consen   54 QTISLKPGKTYTVSFWVKADSGGTV   78 (131)
T ss_dssp             EEEEE-TTSEEEEEEEEEESSSEEE
T ss_pred             cceEecCCCEEEEEEEEEeCCCCEE
Confidence            3489999999999999999888333


No 48 
>PRK11055 galM galactose-1-epimerase; Provisional
Probab=29.45  E-value=1.1e+02  Score=24.84  Aligned_cols=44  Identities=25%  Similarity=0.416  Sum_probs=31.8

Q ss_pred             ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccc
Q psy4837           8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQ   62 (100)
Q Consensus         8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~   62 (100)
                      .++|.|..-+...+.|.+....+.           ++..|.-+.|-.+||+|-|.
T Consensus       265 g~~l~v~t~~p~lqvYT~n~~~~~-----------~~~~g~~~~~~~gialE~q~  308 (342)
T PRK11055        265 KLQMKVYTTAPALQFYSGNFLAGT-----------PSRGGGPYADYAGLALESQF  308 (342)
T ss_pred             CeEEEEEcCCCEEEEecCCccCCc-----------cCCCCcEeCCCceEEEEccc
Confidence            467888888888888887544321           24456777889999999664


No 49 
>cd06184 flavohem_like_fad_nad_binding FAD_NAD(P)H binding domain of flavohemoglobin. Flavohemoglobins have a globin domain containing a B-type heme fused with a ferredoxin reductase-like FAD/NAD-binding domain. Flavohemoglobins detoxify nitric oxide (NO) via an NO dioxygenase reaction. The hemoglobin domain adopts a globin fold with an embedded heme molecule. Flavohemoglobins also have a C-terminal reductase domain with bindiing sites for FAD and NAD(P)H. This domain catalyzes the conversion of NO + O2 + NAD(P)H to NO3- + NAD(P)+.  Instead of the oxygen transport function of hemoglobins, flavohemoglobins seem to act in NO dioxygenation and NO signalling.
Probab=29.38  E-value=1.5e+02  Score=21.80  Aligned_cols=24  Identities=8%  Similarity=-0.168  Sum_probs=15.0

Q ss_pred             eeeceeceE--EEeecCCeEEEEEEE
Q psy4837           2 ITTHRYLLT--LHVYPGETALAFYTA   25 (100)
Q Consensus         2 ~~~~p~q~~--v~V~pGE~~~v~y~a   25 (100)
                      +++++....  ...+|||...+.+..
T Consensus        24 l~l~~~~~~~~~~~~pGQ~v~l~~~~   49 (247)
T cd06184          24 FYLEPADGGPLPPFLPGQYLSVRVKL   49 (247)
T ss_pred             EEEEeCCCCcCCCCCCCCEEEEEEec
Confidence            455554432  578888887777543


No 50 
>PF14415 DUF4424:  Domain of unknown function (DUF4424)
Probab=29.00  E-value=87  Score=24.87  Aligned_cols=23  Identities=22%  Similarity=0.364  Sum_probs=18.3

Q ss_pred             CeEEEEEEEECCCCCcEEEEEeC
Q psy4837          17 ETALAFYTAKNPTNSPIVGISTY   39 (100)
Q Consensus        17 E~~~v~y~a~N~sd~~v~GqAvp   39 (100)
                      +.++|.|+.+|.++++++..-.|
T Consensus         1 ~~I~V~Y~F~N~t~~dv~~~VaF   23 (253)
T PF14415_consen    1 ERIRVRYVFRNPTDQDVTVTVAF   23 (253)
T ss_pred             CEEEEEEEEeCCCCCcEEEEEEE
Confidence            45789999999999998555444


No 51 
>PRK15195 fimbrial chaperone protein FimC; Provisional
Probab=28.70  E-value=1.6e+02  Score=22.66  Aligned_cols=63  Identities=16%  Similarity=0.151  Sum_probs=35.9

Q ss_pred             ecCCeEEEEEEEECCCCC-cEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCch
Q psy4837          14 YPGETALAFYTAKNPTNS-PIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEH   82 (100)
Q Consensus        14 ~pGE~~~v~y~a~N~sd~-~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~   82 (100)
                      .+++...+.+.++|.++. +...|+--.  -+...    +..=|=-+  =..|+||++..+    .++.||+|+|.
T Consensus        36 y~~~~~~~si~l~N~~~~~~~LvQsWv~--~~~~~----~~~pfivtPPlfrl~p~~~q~lRIi~~~~~LP~DrES  105 (229)
T PRK15195         36 YPADAKQTSLAIRNSHTNERYLVNSWIE--NSSGV----KEKSFIVTPPLFVSEPKSENTLRIIYAGPPLAADRES  105 (229)
T ss_pred             EeCCCceEEEEEEeCCCCccEEEEEEec--CCCCC----ccCCEEEcCCeEEECCCCceEEEEEECCCCCCCCeeE
Confidence            345555588999999865 677776331  11100    00112111  135788887777    77778888665


No 52 
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=28.14  E-value=25  Score=23.02  Aligned_cols=16  Identities=19%  Similarity=0.343  Sum_probs=11.6

Q ss_pred             cccceeeeeccccccC
Q psy4837          49 YFNKIQCFCFEEQQLN   64 (100)
Q Consensus        49 YF~KieCFCF~eQ~L~   64 (100)
                      ++-.++||||+-+.+.
T Consensus        56 ~~~~~~~F~F~~~~~~   71 (77)
T smart00391       56 LSLDLECFDFNATVPV   71 (77)
T ss_pred             cccccccccCcCCccc
Confidence            4455688999987654


No 53 
>PF11325 DUF3127:  Domain of unknown function (DUF3127);  InterPro: IPR021474  This bacterial family of proteins has no known function. 
Probab=27.40  E-value=61  Score=21.99  Aligned_cols=30  Identities=17%  Similarity=0.406  Sum_probs=21.9

Q ss_pred             EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeee
Q psy4837          12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCF   56 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCF   56 (100)
                      .+++|+...|+|.++=+.   -.            ++||+-+.+.
T Consensus        52 ~~~~Gd~V~Vsf~i~~RE---~~------------gr~fn~i~aW   81 (84)
T PF11325_consen   52 NFQVGDEVKVSFNIEGRE---WN------------GRWFNSIRAW   81 (84)
T ss_pred             cCCCCCEEEEEEEeeccE---ec------------ceEeeEeEEE
Confidence            578999999999776432   22            7888877653


No 54 
>PRK15188 fimbrial chaperone protein BcfB; Provisional
Probab=26.12  E-value=2e+02  Score=22.41  Aligned_cols=65  Identities=17%  Similarity=0.163  Sum_probs=38.9

Q ss_pred             EeecCCeEEEEEEEECCCCC-cEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCch
Q psy4837          12 HVYPGETALAFYTAKNPTNS-PIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEH   82 (100)
Q Consensus        12 ~V~pGE~~~v~y~a~N~sd~-~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~   82 (100)
                      -+.+|....+.+.++|.+++ +...|+--. ......    + .=|=-+  -..|+||++..+    ....+|+|+|.
T Consensus        36 vIy~~~~~~~sv~i~N~~~~~p~LvQsWv~-~~~~~~----~-~pFivtPPlfrl~~~~~~~lRI~~~~~~lP~DRES  107 (228)
T PRK15188         36 VIYPQGSKQTSLPIINSSASNVFLIQSWVA-NADGSR----S-TDFIITPPLFVIQPKKENILRIMYVGPSLPTDRES  107 (228)
T ss_pred             EEEcCCCceEEEEEEeCCCCccEEEEEEEe-cCCCCc----c-CCEEEcCCeEEECCCCceEEEEEECCCCCCCCceE
Confidence            34566777899999999855 677777442 111100    0 001111  235788887777    67778888765


No 55 
>PF06650 DUF1162:  Protein of unknown function (DUF1162);  InterPro: IPR009543 Proteins in this entry may play a role in the control of protein cycling through the trans-Golgi network. Vacuolar sorting protein is an ATPase required for endosomal trafficking []. Defects in the human protein VPS13A cause chorea-acanthocytosis, an autosomal recessive neurodegenerative disorder characterised by the gradual onset of hyperkinetic movements and abnormal erythrocyte morphology [].; GO: 0008104 protein localization
Probab=26.00  E-value=2.1e+02  Score=21.44  Aligned_cols=22  Identities=18%  Similarity=0.007  Sum_probs=15.2

Q ss_pred             eeeccccccCCCCeeec--cCCCC
Q psy4837          55 CFCFEEQQLNPHEELQI--FAPKK   76 (100)
Q Consensus        55 CFCF~eQ~L~pgE~~~M--vDP~i   76 (100)
                      |.+-....|.||++...  =||..
T Consensus       252 ~~~~~~~~l~p~~~~~yaWd~P~~  275 (277)
T PF06650_consen  252 GGSSEWYVLPPGSSMPYAWDDPTA  275 (277)
T ss_pred             cCCeeeEEECCCcceeEEccCcCC
Confidence            45555668999998876  55544


No 56 
>PF00345 PapD_N:  Pili and flagellar-assembly chaperone, PapD N-terminal domain;  InterPro: IPR016147 Most Gram-negative bacteria possess a supramolecular structure - the pili - on their surface, which mediates attachment to specific receptors. Many interactive subunits are required to assemble pili, but their assembly only takes place after translocation across the cytoplasmic membrane. Periplasmic chaperones assist pili assembly by binding to the subunits, thereby preventing premature aggregation [, ]. Pili chaperones are structurally, and possibly evolutionarily, related to the immunoglobulin superfamily [, ]: they contain two globular domains, with a topology identical to an immunoglobulin fold. This entry represents the N-terminal domain of pili assembly chaperone, and has a beta-sandwich fold consisting of seven strands in two sheets with a Greek key topology.; GO: 0007047 cellular cell wall organization, 0030288 outer membrane-bounded periplasmic space; PDB: 2CO6_B 2CO7_B 1L4I_B 3GFU_A 3F65_F 3F6L_A 3F6I_A 3GEW_B 3DSN_D 2OS7_B ....
Probab=25.25  E-value=2e+02  Score=19.13  Aligned_cols=71  Identities=15%  Similarity=0.039  Sum_probs=41.0

Q ss_pred             ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec---cCCCCCCCCc
Q psy4837           8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI---FAPKKPHHNE   81 (100)
Q Consensus         8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M---vDP~i~~d~~   81 (100)
                      ...+.+..++. .+.+.+.|.+++++..|+...=...  ..-..+..-|=++  .-.|+||++..+   ..+.++.|.|
T Consensus         6 ~trii~~~~~~-~~~i~v~N~~~~~~~vq~~v~~~~~--~~~~~~~~~~~vsPp~~~L~pg~~q~vRv~~~~~~~~~~E   81 (122)
T PF00345_consen    6 PTRIIFNESQR-SASITVTNNSDQPYLVQVWVYDQDD--EDEDEPTDPFIVSPPIFRLEPGESQTVRVYRGSKLPIDRE   81 (122)
T ss_dssp             SSEEEEETTSS-EEEEEEEESSSSEEEEEEEEEETTS--TTSSSSSSSEEEESSEEEEETTEEEEEEEEECSGS-SSS-
T ss_pred             cEEEEEeCCCC-EEEEEEEcCCCCcEEEEEEEEcCCC--cccccccccEEEeCCceEeCCCCcEEEEEEecCCCCCCce
Confidence            34566666554 7899999999999999987643111  0011122223332  346888887777   4344444443


No 57 
>PF00963 Cohesin:  Cohesin domain;  InterPro: IPR002102 Cohesin domains interact with a complementary domain, termed the dockerin domain (see IPR002105 from INTERPRO). The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome []. The scaffoldin component of the cellulolytic bacterium Clostridium thermocellum is a non-hydrolytic protein which organises the hydrolytic enzymes in a large complex, called the cellulosome. Scaffoldin comprises a series of functional domains, amongst which is a single cellulose-binding domain and nine cohesin domains which are responsible for integrating the individual enzymatic subunits into the complex.; GO: 0030246 carbohydrate binding, 0000272 polysaccharide catabolic process; PDB: 2BM3_A 3P0D_I 3KCP_A 2B59_A 3L8Q_B 3FNK_C 3GHP_B 2CCL_A 1ANU_A 1OHZ_A ....
Probab=25.20  E-value=2.3e+02  Score=19.31  Aligned_cols=36  Identities=25%  Similarity=0.103  Sum_probs=25.5

Q ss_pred             eEEEeecCCeEEEEEEEECCCCCcEEEEEeCccccc
Q psy4837           9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPF   44 (100)
Q Consensus         9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~   44 (100)
                      .+..++||++..+.-.+.|..+.....+...++=|.
T Consensus         6 ~~~~a~~G~tv~V~V~v~~~~~~i~~~~~~l~yDp~   41 (141)
T PF00963_consen    6 DSVSAKPGETVTVPVNVSNVSNSIAGMQFTLSYDPS   41 (141)
T ss_dssp             SECEE-TTSEEEEEEEEESCTTTEEEEEEEEEE-TT
T ss_pred             CCceECCCCEEEEEEEEEcCCCcEEEEEEEEEeCCc
Confidence            455679999999999999998874455555555554


No 58 
>PF01429 MBD:  Methyl-CpG binding domain;  InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=25.08  E-value=31  Score=22.23  Aligned_cols=18  Identities=17%  Similarity=0.117  Sum_probs=10.6

Q ss_pred             hcccceeeeeccccccCC
Q psy4837          48 QYFNKIQCFCFEEQQLNP   65 (100)
Q Consensus        48 ~YF~KieCFCF~eQ~L~p   65 (100)
                      .+.-++++|||+.+...+
T Consensus        59 ~~~l~~~~F~F~~~~~~~   76 (77)
T PF01429_consen   59 EHDLKPENFSFSKRLIML   76 (77)
T ss_dssp             --SS-CTTBBTTTTB---
T ss_pred             cccCCHhHCCCCCCcccC
Confidence            456679999999887654


No 59 
>KOG0360|consensus
Probab=24.89  E-value=36  Score=30.35  Aligned_cols=29  Identities=24%  Similarity=0.402  Sum_probs=20.9

Q ss_pred             cceeeeeccccc--cCCCCeeeccCCCCCCC
Q psy4837          51 NKIQCFCFEEQQ--LNPHEELQIFAPKKPHH   79 (100)
Q Consensus        51 ~KieCFCF~eQ~--L~pgE~~~MvDP~i~~d   79 (100)
                      -||.||||+-|.  .+=|+.+-..||+-.++
T Consensus       235 ~Kiacld~~Lqk~km~lGv~vvv~dp~kle~  265 (545)
T KOG0360|consen  235 AKIACLDFTLQKTKMKLGVQVVVDDPEKLEQ  265 (545)
T ss_pred             ceeeeEecccccccccccceEEEcChHHHHH
Confidence            499999999885  34456655588876554


No 60 
>PRK13202 ureB urease subunit beta; Reviewed
Probab=24.67  E-value=1e+02  Score=22.02  Aligned_cols=67  Identities=7%  Similarity=-0.000  Sum_probs=40.1

Q ss_pred             ceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeee-------ccccccCCCCeeec
Q psy4837           5 HRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFC-------FEEQQLNPHEELQI   71 (100)
Q Consensus         5 ~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFC-------F~eQ~L~pgE~~~M   71 (100)
                      .+....+.+.+|..-.+.-.++|..|+|+---+=|-..=...+.-|.--..+=       =+--.++|||+++.
T Consensus         7 ~~~~~~I~ln~grr~~~~l~V~NtGDRPIQVGSHyHF~E~N~aL~FDR~~A~G~RLdIpaGTavRFEPG~~k~V   80 (104)
T PRK13202          7 FYGSGDIEMNAAALSRLQMRIINAGDRPVQVGSHVHLPQANRALSFDRATAHGYRLDIPAATAVRFEPGIPQIV   80 (104)
T ss_pred             ecCCCCEEeCCCCCceEEEEEEeCCCCceEEccccchhhcCcceeecHhHhcCcccccCCCCeEEECCCCeEEE
Confidence            34556789999965667789999999998655554433332222222222221       23345778877765


No 61 
>cd05711 Ig_FcalphaRI Immunoglobulin (IG)-like domain of of FcalphaRI. IG_FcalphaRI : immunoglobulin (IG)-like domain of of FcalphaRI. FcalphaRI (CD89) is an IgA-specific receptor that is expressed on monocytes, eosinophils, neutrophils and macrophages. FcalphaRI mediates IgA-induced immune effector responses such as phagocytosis, antibody-dependent cell-mediated cytotoxicity and respiratory burst. Both monomeric and dimeric IgA can bind to FcalphaRI, and monomeric or dimeric IgA immune complexes can activate phagocytosis and other immune responses through the clustering of FcalphaRI. The Fc RI ectodomain is comprised of two Ig-like domains oriented at about 90 degree to each another.
Probab=24.43  E-value=1.2e+02  Score=19.47  Aligned_cols=14  Identities=36%  Similarity=0.403  Sum_probs=11.2

Q ss_pred             EeCcccccchhhcc
Q psy4837          37 STYNVVPFDAGQYF   50 (100)
Q Consensus        37 AvpsVsP~~A~~YF   50 (100)
                      -+.+++|..+|.|+
T Consensus        58 ~I~~~~~~~~G~Y~   71 (94)
T cd05711          58 PLGPVTPAHAGTYR   71 (94)
T ss_pred             EecCCCcccCEEEE
Confidence            56677889999996


No 62 
>PF04314 DUF461:  Protein of unknown function (DUF461);  InterPro: IPR007410 This entry represents a domain found in of proteins of unknown function, including DR1885 from Deinococcus radiodurans and CC3502 from Caulobacter crescentus (Caulobacter vibrioides), which share a potential metal binding motif H(M)X10MX21HXM. DR1885 was found to bind copper(I) through a histidine and three Mets in a cupredoxin-like fold []. The surface location of the copper-binding site as well as the type of coordination are well poised for metal transfer chemistry, suggesting that DR1885 might transfer copper, taking the role of Cox17 in bacteria (Cox17 being an accessory protein required for correct assembly of eukaryotic cyochrome c oxidase). ; PDB: 2K6W_A 2K6Z_A 2K6Y_A 2K70_A 1X9L_A 2JQA_A.
Probab=24.37  E-value=1.5e+02  Score=19.98  Aligned_cols=53  Identities=6%  Similarity=0.037  Sum_probs=29.2

Q ss_pred             CeEEEEEEEECCCCCcEEEEEeCcccccchhhccccee-------eeeccccccCCCCeeec
Q psy4837          17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQ-------CFCFEEQQLNPHEELQI   71 (100)
Q Consensus        17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~Kie-------CFCF~eQ~L~pgE~~~M   71 (100)
                      ...-+||.++|.+++++.-.++-+=.-+.  .=++...       =.=.+.-.+.||+++.+
T Consensus        14 ~~~a~y~ti~N~g~~~~~L~~v~s~~a~~--v~lh~~~~~~g~~~m~~v~~i~ipa~~~v~l   73 (110)
T PF04314_consen   14 NVTAAYFTITNNGDQDDRLVGVSSPAAAR--VELHETVMEDGVMKMRPVDSIPIPAGSTVEL   73 (110)
T ss_dssp             SEEEEEEEEE-CSSSEEEEEEEE-TTCCE--EEEEEECCCCCEEEECCSS-EEEETT-EEEE
T ss_pred             ccEEEEEEEEeCCCCCeEEEEEEcCCCce--EEEEEEEccCCeEEEEECCCEEECCCCeEEe
Confidence            58899999999999987666555422222  2222211       11122556777777777


No 63 
>TIGR02636 galM_Leloir galactose mutarotase. Members of this protein family act as galactose mutarotase (D-galactose 1-epimerase) and participate in the Leloir pathway for galactose/glucose interconversion. All members of the seed alignment for this model are found in gene clusters with other enzymes of the Leloir pathway. This enzyme family belongs to the aldose 1-epimerase family, described by pfam model pfam01263. However, the enzyme described as aldose 1-epimerase itself (EC 5.1.3.3) is called broadly specific for D-glucose, L-arabinose, D-xylose, D-galactose, maltose and lactose. The restricted genome context for genes in this family suggests members should act primarily on D-galactose.
Probab=24.27  E-value=1.4e+02  Score=23.94  Aligned_cols=46  Identities=26%  Similarity=0.367  Sum_probs=30.6

Q ss_pred             ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccC
Q psy4837           8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLN   64 (100)
Q Consensus         8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~   64 (100)
                      .++|+|..-+...+.|......+           .++..|.-+.+=.+||+|-|.+-
T Consensus       260 g~~l~v~t~~p~~~vyT~~~~~~-----------~~~~~g~~~~~~~gialE~q~~p  305 (335)
T TIGR02636       260 DLSLEVFTTQPALQIYTGNFLAG-----------TPNRGGKKYVDHAGLALETQFLP  305 (335)
T ss_pred             CcEEEEecCCCEEEEecCCCcCC-----------ccCCCCcEeCCCcEEEEecccCC
Confidence            35677777777777777644422           12334566678899999988764


No 64 
>PRK00415 rps27e 30S ribosomal protein S27e; Reviewed
Probab=24.01  E-value=30  Score=22.32  Aligned_cols=25  Identities=28%  Similarity=0.452  Sum_probs=19.9

Q ss_pred             hhcccceee-eeccccccCCCCeeec
Q psy4837          47 GQYFNKIQC-FCFEEQQLNPHEELQI   71 (100)
Q Consensus        47 ~~YF~KieC-FCF~eQ~L~pgE~~~M   71 (100)
                      .-||-|+.| =|.++|++=.+-+...
T Consensus         6 ~S~F~~VkCp~C~n~q~vFsha~t~V   31 (59)
T PRK00415          6 RSRFLKVKCPDCGNEQVVFSHASTVV   31 (59)
T ss_pred             CCeEEEEECCCCCCeEEEEecCCcEE
Confidence            359999999 7999998877665544


No 65 
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular  matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=23.72  E-value=1.2e+02  Score=26.25  Aligned_cols=34  Identities=18%  Similarity=0.241  Sum_probs=27.9

Q ss_pred             eeeceeceEEEeecCCeEEEEEEEECCCCCcEEE
Q psy4837           2 ITTHRYLLTLHVYPGETALAFYTAKNPTNSPIVG   35 (100)
Q Consensus         2 ~~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~G   35 (100)
                      +.+.|..-.++++||+..++...++=..+.|+--
T Consensus        69 vQi~PQ~v~l~LRpG~~~~f~~~~~~a~~yPvDL  102 (423)
T smart00187       69 VQVSPQRVRLKLRPGEPQNFTLTVRQAEDYPVDL  102 (423)
T ss_pred             eEeccceEEEEeccCCcEEEEEEEEecccCccce
Confidence            4568899999999999999998888777766643


No 66 
>KOG0121|consensus
Probab=23.70  E-value=29  Score=26.28  Aligned_cols=13  Identities=54%  Similarity=1.162  Sum_probs=9.6

Q ss_pred             cccceee-eecccc
Q psy4837          49 YFNKIQC-FCFEEQ   61 (100)
Q Consensus        49 YF~KieC-FCF~eQ   61 (100)
                      =|.|+-| |||-+-
T Consensus        72 r~kktpCGFCFVey   85 (153)
T KOG0121|consen   72 RFKKTPCGFCFVEY   85 (153)
T ss_pred             cCCcCccceEEEEE
Confidence            3677777 999764


No 67 
>PF00635 Motile_Sperm:  MSP (Major sperm protein) domain;  InterPro: IPR000535 Major sperm proteins (MSP) are central components in molecular interactions underlying sperm motility in Caenorhabditis elegans, whose sperm employ an amoebae-like crawling motion using a MSP-containing lamellipod, rather than the flagellar-based swimming motion associated with other sperm. These proteins oligomerise to form an extensive filament system that extends from sperm villipoda, along the leading edge of the pseudopod. About 30 MSP isoforms may exist in C. elegans. MSPs form a fibrous network, whereby MSP dimers form helical subfilaments that coil around one another to produce filaments, which in turn form supercoils to produce bundles. The crystal structure of MSP from C. elegans reveals an immunoglobulin (Ig)-like seven-stranded beta sandwich fold []. ; GO: 0005198 structural molecule activity; PDB: 1MSP_A 3MSP_B 2BVU_B 2MSP_C 1Z9O_F 1Z9L_A 3IKK_A 1WIC_A 2CRI_A 2RR3_A ....
Probab=23.41  E-value=1.5e+02  Score=18.92  Aligned_cols=61  Identities=18%  Similarity=0.283  Sum_probs=33.8

Q ss_pred             eeeceeceE-EEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837           2 ITTHRYLLT-LHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         2 ~~~~p~q~~-v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      |.+.|...- .....+........++|.++.++.    |.|-=.....|.-+-.     .=.|+|||++++
T Consensus         2 l~v~P~~~i~F~~~~~~~~~~~l~l~N~s~~~i~----fKiktt~~~~y~v~P~-----~G~i~p~~~~~i   63 (109)
T PF00635_consen    2 LSVEPSELIFFNAPFNKQQSCELTLTNPSDKPIA----FKIKTTNPNRYRVKPS-----YGIIEPGESVEI   63 (109)
T ss_dssp             CEEESSSEEEEESSTSS-EEEEEEEEE-SSSEEE----EEEEES-TTTEEEESS-----EEEE-TTEEEEE
T ss_pred             eEEeCCcceEEcCCCCceEEEEEEEECCCCCcEE----EEEEcCCCceEEecCC-----CEEECCCCEEEE
Confidence            345566333 333346778889999999999653    3333333334443322     235899998888


No 68 
>PTZ00485 aldolase 1-epimerase; Provisional
Probab=23.15  E-value=94  Score=25.97  Aligned_cols=48  Identities=23%  Similarity=0.296  Sum_probs=35.1

Q ss_pred             eceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccC
Q psy4837           7 YLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLN   64 (100)
Q Consensus         7 ~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~   64 (100)
                      .-++|+|.--+...+.|.+-.+.+ .         .++..|.-|.+-.+||||-|.+-
T Consensus       292 sg~~l~v~T~~P~~qiYT~n~l~~-~---------~~~~~g~~~~~~~giclE~Q~~P  339 (376)
T PTZ00485        292 TNICMKVYSTFPCMWVYTANNKPL-P---------ASGGPGQRYARWTGMGLEPQYFP  339 (376)
T ss_pred             CCCEEEEEECCCEEEEECCCCCCc-c---------ccCCCCcCcCCCCEEEEeccCCC
Confidence            346899999999999998755532 1         12344556788899999999874


No 69 
>PF13815 Dzip-like_N:  Iguana/Dzip1-like DAZ-interacting protein N-terminal
Probab=23.09  E-value=18  Score=25.08  Aligned_cols=40  Identities=10%  Similarity=0.187  Sum_probs=31.6

Q ss_pred             eeccccccCCCCeeec-----cCC-CCCCCCchhhhcccccceeEeeccc
Q psy4837          56 FCFEEQQLNPHEELQI-----FAP-KKPHHNEHKMRSQILPKVEIANVSS   99 (100)
Q Consensus        56 FCF~eQ~L~pgE~~~M-----vDP-~i~~d~~~~~~~~~~~~~~~~~~~~   99 (100)
                      |||..    ..|.+++     ||+ .|.++.+.+.+..+++.+-.++++.
T Consensus         1 F~f~~----r~~~iDWr~i~~iDvd~i~~~~Di~~Lq~~i~~vtf~~l~~   46 (118)
T PF13815_consen    1 FQFRP----RREPIDWRLISAIDVDRIVRELDIDTLQENIENVTFCDLEN   46 (118)
T ss_pred             CCCCC----CCCCCcHHHHhccCHHHHHhccCHHHHHHHHHhcceeccCh
Confidence            67763    3577777     898 6888899999999999888887764


No 70 
>PF06280 DUF1034:  Fn3-like domain (DUF1034);  InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=21.17  E-value=1.5e+02  Score=19.60  Aligned_cols=26  Identities=19%  Similarity=0.103  Sum_probs=17.3

Q ss_pred             eceeceEEEeecCCeEEEEEEEECCC
Q psy4837           4 THRYLLTLHVYPGETALAFYTAKNPT   29 (100)
Q Consensus         4 ~~p~q~~v~V~pGE~~~v~y~a~N~s   29 (100)
                      ..-...+++|.||+...+...+.-.+
T Consensus        57 ~~~~~~~vTV~ag~s~~v~vti~~p~   82 (112)
T PF06280_consen   57 VSFSPDTVTVPAGQSKTVTVTITPPS   82 (112)
T ss_dssp             EE---EEEEE-TTEEEEEEEEEE--G
T ss_pred             EEeCCCeEEECCCCEEEEEEEEEehh
Confidence            34456899999999999999988755


No 71 
>PF07883 Cupin_2:  Cupin domain;  InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=20.91  E-value=1.9e+02  Score=16.72  Aligned_cols=35  Identities=14%  Similarity=0.144  Sum_probs=22.0

Q ss_pred             eceeceEEEeecCCeEEE----EEEEECCCCCcEEEEEe
Q psy4837           4 THRYLLTLHVYPGETALA----FYTAKNPTNSPIVGIST   38 (100)
Q Consensus         4 ~~p~q~~v~V~pGE~~~v----~y~a~N~sd~~v~GqAv   38 (100)
                      +......+.+++|+...+    -+.+.|.++.+..--.|
T Consensus        32 ~~~~~~~~~l~~Gd~~~i~~~~~H~~~n~~~~~~~~l~V   70 (71)
T PF07883_consen   32 LTVDGERVELKPGDAIYIPPGVPHQVRNPGDEPARFLVV   70 (71)
T ss_dssp             EEETTEEEEEETTEEEEEETTSEEEEEEESSSEEEEEEE
T ss_pred             EEEccEEeEccCCEEEEECCCCeEEEEECCCCCEEEEEE
Confidence            334456777777776664    46667777776654443


No 72 
>PRK13203 ureB urease subunit beta; Reviewed
Probab=20.71  E-value=2.5e+02  Score=19.94  Aligned_cols=66  Identities=14%  Similarity=0.143  Sum_probs=40.3

Q ss_pred             ceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeee-------ccccccCCCCeeec
Q psy4837           5 HRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFC-------FEEQQLNPHEELQI   71 (100)
Q Consensus         5 ~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFC-------F~eQ~L~pgE~~~M   71 (100)
                      .+....+.+.+|... +.-.++|..|||+---+=|-..=...+.-|.--..|=       =+--.++|||+++.
T Consensus         7 ~~~~~~I~ln~gr~~-~~l~V~NtGDRPIQVGSH~HF~E~N~aL~FDR~~A~G~RLdIpaGTavRFEPG~~k~V   79 (102)
T PRK13203          7 ITADGEIELNAGRET-VTLTVANTGDRPIQVGSHYHFFEVNPALSFDREAARGMRLNIPAGTAVRFEPGQTREV   79 (102)
T ss_pred             ecCCCCEEeCCCCCE-EEEEEEeCCCCceEEccccchhhcCcchhccHhhhcCcccccCCCCeEeECCCCeEEE
Confidence            345567888888654 6778999999998765555444333333333222222       23456778887765


No 73 
>COG1470 Predicted membrane protein [Function unknown]
Probab=20.62  E-value=2.6e+02  Score=25.01  Aligned_cols=64  Identities=17%  Similarity=0.073  Sum_probs=47.8

Q ss_pred             eceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837           7 YLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus         7 ~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      .+....+++|++....-.++|+...+-.--.--+=.|.-....|. =+=---+.-.|+|||+++.
T Consensus       274 s~~~~~i~~~~t~sf~V~IeN~g~~~d~y~Le~~g~pe~w~~~Ft-eg~~~vt~vkL~~gE~kdv  337 (513)
T COG1470         274 SDIYLEISPSTTASFTVSIENRGKQDDEYALELSGLPEGWTAEFT-EGELRVTSVKLKPGEEKDV  337 (513)
T ss_pred             ccceeEEccCCceEEEEEEccCCCCCceeEEEeccCCCCcceEEe-eCceEEEEEEecCCCceEE
Confidence            345578999999999999999999887665555546777777776 2222334567999999999


No 74 
>PF12690 BsuPI:  Intracellular proteinase inhibitor;  InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=20.51  E-value=1.3e+02  Score=19.61  Aligned_cols=49  Identities=22%  Similarity=0.368  Sum_probs=23.9

Q ss_pred             EEEEEEEECCCCCcEEEEEe------Ccc--------cccchhhcccceeeeeccccccCCCCeeec
Q psy4837          19 ALAFYTAKNPTNSPIVGIST------YNV--------VPFDAGQYFNKIQCFCFEEQQLNPHEELQI   71 (100)
Q Consensus        19 ~~v~y~a~N~sd~~v~GqAv------psV--------sP~~A~~YF~KieCFCF~eQ~L~pgE~~~M   71 (100)
                      ..+.+.++|.++++++-+=.      +.|        --.+.+.-|...    +.+-+|+|||++..
T Consensus         2 v~~~l~v~N~s~~~v~l~f~sgq~~D~~v~d~~g~~vwrwS~~~~FtQa----l~~~~l~pGe~~~~   64 (82)
T PF12690_consen    2 VEFTLTVTNNSDEPVTLQFPSGQRYDFVVKDKEGKEVWRWSDGKMFTQA----LQEETLEPGESLTY   64 (82)
T ss_dssp             EEEEEEEEE-SSS-EEEEESSS--EEEEEE-TT--EEEETTTT-----------EEEEE-TT-EEEE
T ss_pred             EEEEEEEEeCCCCeEEEEeCCCCEEEEEEECCCCCEEEEecCCchhhhe----eeEEEECCCCEEEE
Confidence            45778888888888775511      111        122355555544    33678999999888


Done!