Query         psy16510
Match_columns 147
No_of_seqs    113 out of 1126
Neff          5.8 
Searched_HMMs 46136
Date          Fri Aug 16 23:10:00 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy16510.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16510hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00379 Chitin_bind_4:  Insect  99.7 4.7E-17   1E-21  103.9   6.7   52   54-110     1-52  (52)
  2 PF06004 DUF903:  Bacterial pro  89.2     1.2 2.6E-05   28.3   4.8   39   56-102     2-40  (50)
  3 PRK09408 ompX outer membrane p  79.0     6.3 0.00014   30.8   5.6   12   83-94     60-71  (171)
  4 PF05984 Cytomega_UL20A:  Cytom  78.5     6.6 0.00014   27.9   5.0   21    7-27      2-22  (100)
  5 COG5510 Predicted small secret  78.3     2.3   5E-05   26.4   2.4   22    1-24      1-22  (44)
  6 PF06286 Coleoptericin:  Coleop  73.5     1.4   3E-05   33.3   0.6   19    1-28      1-19  (143)
  7 PF06316 Ail_Lom:  Enterobacter  73.2     7.7 0.00017   31.4   4.8   15   83-97     63-77  (199)
  8 PF07172 GRP:  Glycine rich pro  68.6     4.6  0.0001   28.8   2.3    9   17-25     14-22  (95)
  9 PF10731 Anophelin:  Thrombin i  66.4       8 0.00017   25.7   2.9   19   11-29      5-23  (65)
 10 COG5294 Uncharacterized protei  66.3      25 0.00054   26.0   5.8   14   94-107    63-76  (113)
 11 PF12930 DUF3836:  Family of un  65.6      27 0.00059   26.2   6.1   20   53-72     63-82  (132)
 12 COG4907 Predicted membrane pro  64.6      14 0.00031   33.7   5.1   23   83-105    51-73  (595)
 13 PHA02698 hypothetical protein;  64.5     4.1 8.8E-05   28.4   1.3   21  108-128    19-39  (89)
 14 PRK09980 ompL outer membrane p  62.4      40 0.00086   27.7   7.0   60   53-112    40-106 (230)
 15 PRK15346 outer membrane secret  60.2     8.9 0.00019   34.5   3.1   20    1-27      1-20  (499)
 16 PF15240 Pro-rich:  Proline-ric  58.6     9.9 0.00021   30.3   2.7   18   11-28      2-19  (179)
 17 COG3117 Uncharacterized protei  58.2      25 0.00054   28.2   5.0   25    1-26      1-25  (188)
 18 PRK13254 cytochrome c-type bio  56.6      11 0.00025   28.8   2.7   21    1-22      1-21  (148)
 19 PF12729 4HB_MCP_1:  Four helix  55.3      10 0.00022   27.2   2.2   23    2-24      2-24  (181)
 20 KOG1691|consensus               55.1      96  0.0021   25.3   7.9   31   84-114    80-110 (210)
 21 TIGR03795 chp_BMA0021 conserve  54.4      21 0.00044   26.5   3.7   24  116-139    88-112 (114)
 22 COG3017 LolB Outer membrane li  54.2      40 0.00086   27.4   5.6   15   12-26     10-24  (206)
 23 PRK10894 lipopolysaccharide tr  53.7      24 0.00052   27.3   4.2   35   11-46      6-40  (180)
 24 PRK13159 cytochrome c-type bio  52.0      13 0.00029   28.8   2.5   13  123-135   128-140 (155)
 25 PF08138 Sex_peptide:  Sex pept  51.8     4.8  0.0001   26.1   0.0   16   10-25      4-19  (56)
 26 PF11153 DUF2931:  Protein of u  51.0      62  0.0013   25.6   6.3   22  105-133   107-128 (216)
 27 PF06316 Ail_Lom:  Enterobacter  50.0      80  0.0017   25.5   6.7   27   81-107    49-75  (199)
 28 PRK13150 cytochrome c-type bio  49.8      15 0.00033   28.6   2.5   17  123-139   135-151 (159)
 29 COG3026 RseB Negative regulato  49.4      24 0.00052   30.4   3.8   22   81-102    82-103 (320)
 30 PRK15401 alpha-ketoglutarate-d  47.9      44 0.00095   27.1   5.0   49   87-135    46-105 (213)
 31 PF02402 Lysis_col:  Lysis prot  47.8      13 0.00027   23.3   1.4   17    7-23      3-19  (46)
 32 COG4939 Major membrane immunog  44.8 1.1E+02  0.0024   23.3   6.4   44   50-98     30-73  (147)
 33 PRK10523 lipoprotein involved   44.5 1.8E+02  0.0039   24.0   8.1   12   45-57     61-72  (234)
 34 PRK10755 sensor protein BasS/P  43.8      19 0.00041   29.6   2.3   18    1-19      1-18  (356)
 35 PF03562 MltA:  MltA specific i  42.8      26 0.00057   27.2   2.8   57   81-140    80-143 (158)
 36 PF13511 DUF4124:  Domain of un  42.4      29 0.00063   21.6   2.6   17   87-106    14-30  (60)
 37 COG2332 CcmE Cytochrome c-type  42.2 1.2E+02  0.0026   23.6   6.3   14  123-136   129-142 (153)
 38 PF00879 Defensin_propep:  Defe  42.2      66  0.0014   20.6   4.1   12   14-25      5-16  (52)
 39 smart00817 Amelin Ameloblastin  41.0      25 0.00055   31.0   2.7   18   13-30      4-21  (411)
 40 PF13623 SurA_N_2:  SurA N-term  40.9      22 0.00048   26.9   2.2   20    3-23      3-22  (145)
 41 KOG3866|consensus               39.5      29 0.00064   30.4   2.9   26    2-27      1-26  (442)
 42 PF10929 DUF2811:  Protein of u  36.9      32  0.0007   22.4   2.1   20  122-141     5-25  (57)
 43 TIGR01643 YD_repeat_2x YD repe  33.6      71  0.0015   18.1   3.1   20   90-111     9-28  (42)
 44 PRK13165 cytochrome c-type bio  33.4      42  0.0009   26.2   2.6   16  123-138   135-150 (160)
 45 TIGR00219 mreC rod shape-deter  33.3      37 0.00079   28.4   2.5   19    2-20      1-19  (283)
 46 PLN03207 stomagen; Provisional  33.2      55  0.0012   23.9   3.0   26    1-26      5-30  (113)
 47 PRK05461 apaG CO2+/MG2+ efflux  33.1      54  0.0012   24.4   3.1   21   81-101    93-113 (127)
 48 cd07252 BphC1-RGP6_N_like N-te  33.0      60  0.0013   22.3   3.2   21   87-107   100-120 (120)
 49 PF09244 DUF1964:  Domain of un  32.7      52  0.0011   22.0   2.6   22   86-107     3-24  (68)
 50 COG3137 Putative salt-induced   31.6      68  0.0015   26.8   3.7   15   51-65     51-65  (262)
 51 PRK14818 NADH dehydrogenase su  31.6      38 0.00083   26.8   2.2   24  112-135   129-152 (173)
 52 PF05581 MCP_N:  Vibrio chemota  31.6      16 0.00034   26.6   0.0   24    2-25      1-24  (103)
 53 cd07240 ED_TypeI_classII_N N-t  31.5      73  0.0016   21.1   3.4   21   87-107    97-117 (117)
 54 PF07437 YfaZ:  YfaZ precursor;  31.4 1.9E+02  0.0041   22.6   6.1   13   83-95     77-89  (180)
 55 TIGR03352 VI_chp_3 type VI sec  30.9      62  0.0013   24.4   3.2   13   13-25      5-17  (146)
 56 COG2967 ApaG Uncharacterized p  30.2 1.4E+02   0.003   22.5   4.8   21   81-101    92-112 (126)
 57 COG1934 Uncharacterized protei  29.9      71  0.0015   25.1   3.4   22   35-56     30-51  (173)
 58 KOG4824|consensus               29.4 2.9E+02  0.0064   22.6   6.9   22   82-103    65-86  (224)
 59 PF03100 CcmE:  CcmE;  InterPro  28.7 2.4E+02  0.0051   20.7   6.2    7    2-8       1-7   (131)
 60 PF05899 Cupin_3:  Protein of u  28.6 1.3E+02  0.0028   19.7   4.1   30   81-112    28-57  (74)
 61 PF05538 Campylo_MOMP:  Campylo  28.4 1.9E+02  0.0041   26.1   6.1   15   46-61     31-45  (431)
 62 COG0450 AhpC Peroxiredoxin [Po  28.2   1E+02  0.0022   24.8   4.1   48   81-138   120-167 (194)
 63 PHA02978 hypothetical protein;  27.8      58  0.0012   24.2   2.4   16   87-102    75-90  (135)
 64 PRK00315 potassium-transportin  27.0      42 0.00092   27.0   1.7   34  106-139    74-112 (193)
 65 KOG4063|consensus               27.0      60  0.0013   25.4   2.5   17   11-27      6-22  (158)
 66 PTZ00459 mucin-associated surf  26.6      43 0.00093   28.3   1.8    9   16-24     11-19  (291)
 67 PF04710 Pellino:  Pellino;  In  26.4   1E+02  0.0022   27.6   4.1   53   60-112    46-100 (416)
 68 PRK13999 potassium-transportin  26.3      44 0.00095   27.1   1.7   32  107-138    86-122 (201)
 69 PF12984 DUF3868:  Domain of un  26.1      39 0.00084   24.5   1.3   11   10-20      5-15  (115)
 70 PF05423 Mycobact_memb:  Mycoba  25.6 1.4E+02  0.0031   22.5   4.3   30   88-118    54-85  (140)
 71 PF13124 DUF3963:  Protein of u  25.6      69  0.0015   19.2   2.0   13    6-18     21-33  (40)
 72 PF00659 POLO_box:  POLO box du  25.5 1.9E+02   0.004   18.4   4.7   64   53-135     4-68  (68)
 73 PF11912 DUF3430:  Protein of u  25.4      57  0.0012   25.2   2.2    8   53-60     40-47  (212)
 74 TIGR01655 yxeA_fam conserved h  25.0 2.7E+02  0.0058   20.0   6.6   14   93-106    63-76  (114)
 75 PRK09558 ushA bifunctional UDP  24.8      57  0.0012   29.5   2.4   10    1-10      1-10  (551)
 76 PRK11627 hypothetical protein;  24.3      68  0.0015   25.5   2.5    9   84-92    114-122 (192)
 77 PRK14864 putative biofilm stre  23.8      85  0.0019   22.7   2.7    8    1-8       1-8   (104)
 78 cd08344 MhqB_like_N N-terminal  23.6      95   0.002   20.9   2.8   16   88-103    93-108 (112)
 79 COG2834 LolA Outer membrane li  23.0 1.9E+02  0.0042   22.7   4.8   30   83-112    83-113 (211)
 80 PF07263 DMP1:  Dentin matrix p  22.9      38 0.00083   31.0   0.9   13   14-26      7-19  (514)
 81 PF07584 BatA:  Aerotolerance r  22.7      84  0.0018   20.8   2.3   16   10-25     60-75  (77)
 82 PRK09455 rseB anti-sigma E fac  22.7 1.4E+02   0.003   25.5   4.2   20   81-100    83-102 (319)
 83 TIGR01200 Porphyromonas gingiv  22.7      96  0.0021   26.5   3.1   11   13-23      7-17  (292)
 84 PHA02823 chemokine binding pro  22.6      71  0.0015   26.4   2.3   13   14-26      5-17  (255)
 85 PF10902 DUF2693:  Protein of u  21.7 2.5E+02  0.0055   19.5   4.6   23   52-74      1-23  (83)
 86 PF03866 HAP:  Hydrophobic abun  21.6      71  0.0015   24.6   1.9   17    1-25      1-17  (164)
 87 COG4594 FecB ABC-type Fe3+-cit  21.6 1.8E+02  0.0039   24.9   4.5   27   65-94     36-62  (310)
 88 COG3354 FlaG Putative archaeal  21.3 1.4E+02   0.003   23.2   3.5   27   36-62     50-78  (154)
 89 TIGR01965 VCBS_repeat VCBS rep  21.2 1.6E+02  0.0034   21.1   3.6   20   83-102    55-74  (99)
 90 PRK00178 tolB translocation pr  20.5      84  0.0018   26.8   2.5   18   10-27      5-22  (430)
 91 COG3422 Uncharacterized conser  20.2      87  0.0019   20.6   1.9   22   50-71     11-33  (59)
 92 cd09013 BphC-JF8_N_like N-term  20.1 1.3E+02  0.0029   20.3   3.0   16   88-103   102-117 (121)

No 1  
>PF00379 Chitin_bind_4:  Insect cuticle protein;  InterPro: IPR000618 Insect cuticle is composed of proteins and chitin. The cuticular proteins seem to be specific to the type of cuticle (flexible or stiff) that occur at stages of the insect development. The proteins found in the flexible cuticle of larva and pupa of different insects share a conserved C-terminal section [] such a region is also found in the soft endocuticle of adults insects [] as well as in other cuticular proteins including in arachnids []. In addition, cuticular proteins share hydrophobic regions dominated by tetrapeptide repeats (A-A-P-A/V), which are presumed to be functionally important [, ]. Many insect cuticle proteins also include a 35-36 amino acid motif known as the R and R consensus. An extended form of this motif has been shown [] to bind chitin. It has no sequence similiarity to the cysteine-containing chitin-binding domain of chitinases and some peritrophic membrane proteins, suggesting that arthropods have two distinct classes of chitin-binding proteins, those with the chitin-binding domains found in lectins, chitinases and peritrophic membranes (cysCBD), and those with the type of chitin-binding domains found in cuticular proteins (non-cysCBD) []. The cuticle protein signature has been found in locust cuticle proteins 7 (LM-7), 8 (LM-8), 19 (LM-19) and endocuticle structural glycoprotein ABD-4; Hyalophora cecropia (Cecropia moth) cuticle proteins 12 and 66; Drosophila melanogaster (Fruit fly) larval cuticles proteins I, II, III and IV (LCP1 to LCP4); drosophila pupal cuticle proteins PCP, EDG-78E and EDG-84E; Manduca sexta (Tobacco hawkmoth) cuticle protein LCP-14; Tenebrio molitor (Yellow mealworm) cuticle proteins ACP-20, A1A, A2B and A3A; and Araneus diadematus (Spider) cuticle proteins ACP 11.9, ACP 12.4, ACP 12.6, ACP 15.5 and ACP 15.7.; GO: 0042302 structural constituent of cuticle
Probab=99.70  E-value=4.7e-17  Score=103.92  Aligned_cols=52  Identities=42%  Similarity=0.876  Sum_probs=46.1

Q ss_pred             eeEEEeeCCceeeeecceecCCCCCcccccceeeEEEEECCCCcEEEEEEEcCCCCc
Q psy16510         54 YRYSYETGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTPITVQWFADETGF  110 (147)
Q Consensus        54 Y~~~Ye~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADe~Gf  110 (147)
                      |+|+|+++|| .++|+|.   .++++ ..++|+|+|+|++|||+.++|+|+||++||
T Consensus         1 Y~f~Y~~~dg-~~~e~~~---~~~~~-~~~~v~GsY~y~~pdG~~~~V~Y~Ad~~Gf   52 (52)
T PF00379_consen    1 YSFGYETSDG-SRQEEGR---PETED-EGGVVRGSYSYIDPDGQTRTVTYVADENGF   52 (52)
T ss_pred             CeeEeEcCCC-CEEEEEE---cccCC-CCCEEEEEEEEECCCCCEEEEEEECCCCCC
Confidence            7899999999 8888876   23334 689999999999999999999999999998


No 2  
>PF06004 DUF903:  Bacterial protein of unknown function (DUF903);  InterPro: IPR010305 This family consists of several small bacterial proteins several of which are classified as putative lipoproteins. The function of this family is unknown.; PDB: 2RB6_B 3BDU_F 2RA2_F 2RD1_C 3FIF_H 2JN0_A 2K57_A.
Probab=89.23  E-value=1.2  Score=28.28  Aligned_cols=39  Identities=26%  Similarity=0.453  Sum_probs=28.9

Q ss_pred             EEEeeCCceeeeecceecCCCCCcccccceeeEEEEECCCCcEEEEE
Q psy16510         56 YSYETGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTPITVQ  102 (147)
Q Consensus        56 ~~Ye~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~~~V~  102 (147)
                      |--.+.||....-.|..+        .-.-.|.|+|.+.+|+.+.++
T Consensus         2 yvi~t~dG~~i~t~gkP~--------~D~~tG~~~y~D~~G~~~qIn   40 (50)
T PF06004_consen    2 YVITTNDGRTIVTDGKPK--------YDKDTGMYSYTDADGKEQQIN   40 (50)
T ss_dssp             EEEEETTSEEEEESSEEE--------EETTTTEEEEEBTTS-EEEEE
T ss_pred             cEEEeCCCcEEEeCCCcc--------ccCCCCcEEEECCCCCEEEEc
Confidence            345678888877777764        234579999999999988875


No 3  
>PRK09408 ompX outer membrane protein X; Provisional
Probab=78.99  E-value=6.3  Score=30.78  Aligned_cols=12  Identities=42%  Similarity=0.598  Sum_probs=7.3

Q ss_pred             cceeeEEEEECC
Q psy16510         83 QTAQGQFTYTAP   94 (147)
Q Consensus        83 ~~v~GsYsy~~p   94 (147)
                      +-+.||++|...
T Consensus        60 ~Gvi~sfty~~~   71 (171)
T PRK09408         60 LGVIGSFTYTEK   71 (171)
T ss_pred             EEEEEEEEEecC
Confidence            456666666654


No 4  
>PF05984 Cytomega_UL20A:  Cytomegalovirus UL20A protein;  InterPro: IPR009245 This family consists of several Cytomegalovirus UL20A proteins. UL20A is thought to be a glycoprotein [].
Probab=78.49  E-value=6.6  Score=27.88  Aligned_cols=21  Identities=19%  Similarity=0.169  Sum_probs=17.5

Q ss_pred             chhhHHHHHHHHHHHHHhhcc
Q psy16510          7 DIGALAISFVALFIAVVSAQR   27 (147)
Q Consensus         7 ~~~~~~~~i~~~~la~a~A~~   27 (147)
                      .|+.|++.+|++-|.+|+|++
T Consensus         2 aRRlwiLslLAVtLtVALAAP   22 (100)
T PF05984_consen    2 ARRLWILSLLAVTLTVALAAP   22 (100)
T ss_pred             chhhHHHHHHHHHHHHHhhcc
Confidence            467888899999998888875


No 5  
>COG5510 Predicted small secreted protein [Function unknown]
Probab=78.31  E-value=2.3  Score=26.41  Aligned_cols=22  Identities=27%  Similarity=0.288  Sum_probs=9.8

Q ss_pred             CcccccchhhHHHHHHHHHHHHHh
Q psy16510          1 MMKIEKDIGALAISFVALFIAVVS   24 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~la~a~   24 (147)
                      |||-  .|-..++++++|++..+|
T Consensus         1 mmk~--t~l~i~~vll~s~llaaC   22 (44)
T COG5510           1 MMKK--TILLIALVLLASTLLAAC   22 (44)
T ss_pred             CchH--HHHHHHHHHHHHHHHHHh
Confidence            6773  333333344444444443


No 6  
>PF06286 Coleoptericin:  Coleoptericin;  InterPro: IPR009382 This family consists of several insect coleoptericin, acaloleptin, holotricin and rhinocerosin proteins which are all known to be antibacterial proteins []. These all appear to be short, glycine-rich molecules, inducible by infection.; GO: 0042742 defense response to bacterium, 0005576 extracellular region
Probab=73.46  E-value=1.4  Score=33.34  Aligned_cols=19  Identities=32%  Similarity=0.303  Sum_probs=14.0

Q ss_pred             CcccccchhhHHHHHHHHHHHHHhhccC
Q psy16510          1 MMKIEKDIGALAISFVALFIAVVSAQRG   28 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~la~a~A~~~   28 (147)
                      |||+         .|++||+++++|-+.
T Consensus         1 mmkl---------~i~~~lia~saay~v   19 (143)
T PF06286_consen    1 MMKL---------YIIFGLIALSAAYAV   19 (143)
T ss_pred             CceE---------eeehhHHHHHHhhcc
Confidence            8998         777788887765443


No 7  
>PF06316 Ail_Lom:  Enterobacterial Ail/Lom protein;  InterPro: IPR000758 Virulence-related outer membrane proteins are expressed in Gram-negative bacteria and are essential to bacterial survival within macrophages and for eukaryotic cell invasion. Members of this group include: PagC, required by Salmonella typhimurium for survival in macrophages and for virulence in mice [] Rck outer membrane protein of the S. typhimurium virulence plasmid [] Ail, a product of the Yersinia enterocolitica chromosome capable of mediating bacterial adherence to and invasion of epithelial cell lines []  OmpX from Escherichia coli that promotes adhesion to and entry into mammalian cells. It also has a role in the resistance against attack by the human complement system []  a Bacteriophage lambda outer membrane protein, Lom [] The crystal structure of OmpX from E. coli reveals that OmpX consists of an eight-stranded antiparallel all-next-neighbour beta barrel []. The structure shows two girdles of aromatic amino acid residues and a ribbon of nonpolar residues that attach to the membrane interior. The core of the barrel consists of an extended hydrogen-bonding network of highly conserved residues. OmpX thus resembles an inverse micelle. The OmpX structure shows that the membrane-spanning part of the protein is much better conserved than the extracellular loops. Moreover, these loops form a protruding beta sheet, the edge of which presumably binds to external proteins. It is suggested that this type of binding promotes cell adhesion and invasion and helps defend against the complement system. Although OmpX has the same beta-sheet topology as the structurally related outer membrane protein A (OmpA) IPR000498 from INTERPRO, their barrels differ with respect to the shear numbers and internal hydrogen-bonding networks.; GO: 0009279 cell outer membrane
Probab=73.17  E-value=7.7  Score=31.36  Aligned_cols=15  Identities=20%  Similarity=0.386  Sum_probs=8.9

Q ss_pred             cceeeEEEEECCCCc
Q psy16510         83 QTAQGQFTYTAPDGT   97 (147)
Q Consensus        83 ~~v~GsYsy~~pdG~   97 (147)
                      +-|.|||+|...++.
T Consensus        63 lGvItSfSY~~~~~~   77 (199)
T PF06316_consen   63 LGVITSFSYASGDDS   77 (199)
T ss_pred             eeeeEEEEEeecCCC
Confidence            445666666665554


No 8  
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=68.64  E-value=4.6  Score=28.75  Aligned_cols=9  Identities=22%  Similarity=0.338  Sum_probs=3.4

Q ss_pred             HHHHHHHhh
Q psy16510         17 ALFIAVVSA   25 (147)
Q Consensus        17 ~~~la~a~A   25 (147)
                      +++|+++++
T Consensus        14 A~lLlisSe   22 (95)
T PF07172_consen   14 AALLLISSE   22 (95)
T ss_pred             HHHHHHHhh
Confidence            333444333


No 9  
>PF10731 Anophelin:  Thrombin inhibitor from mosquito;  InterPro: IPR018932  Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing. 
Probab=66.38  E-value=8  Score=25.71  Aligned_cols=19  Identities=21%  Similarity=0.209  Sum_probs=11.0

Q ss_pred             HHHHHHHHHHHHHhhccCC
Q psy16510         11 LAISFVALFIAVVSAQRGP   29 (147)
Q Consensus        11 ~~~~i~~~~la~a~A~~~~   29 (147)
                      |++|-|+|+.++|.++++|
T Consensus         5 l~vialLC~aLva~vQ~AP   23 (65)
T PF10731_consen    5 LIVIALLCVALVAIVQSAP   23 (65)
T ss_pred             hhHHHHHHHHHHHHHhcCc
Confidence            3445566666666666544


No 10 
>COG5294 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=66.26  E-value=25  Score=26.04  Aligned_cols=14  Identities=21%  Similarity=0.349  Sum_probs=12.0

Q ss_pred             CCCcEEEEEEEcCC
Q psy16510         94 PDGTPITVQWFADE  107 (147)
Q Consensus        94 pdG~~~~V~Y~ADe  107 (147)
                      .+|+.-+|+|+|+.
T Consensus        63 ~~Gkkk~v~f~a~~   76 (113)
T COG5294          63 KNGKKKEVKFTATH   76 (113)
T ss_pred             cCCcEEEEEEEecC
Confidence            47999999999985


No 11 
>PF12930 DUF3836:  Family of unknown function (DUF3836);  InterPro: IPR024339 This entry represents a family of bacterial proteins of unknown function.; PDB: 3MSW_A.
Probab=65.58  E-value=27  Score=26.18  Aligned_cols=20  Identities=20%  Similarity=0.454  Sum_probs=9.6

Q ss_pred             CeeEEEeeCCceeeeeccee
Q psy16510         53 SYRYSYETGNGIAANEQGYL   72 (147)
Q Consensus        53 ~Y~~~Ye~~dG~~~~E~G~~   72 (147)
                      .|.|.|...+-....|.-.+
T Consensus        63 KY~Y~YD~~grl~eK~a~kW   82 (132)
T PF12930_consen   63 KYNYKYDDNGRLTEKEAYKW   82 (132)
T ss_dssp             EEEEEEETTTTEEEEEEEEE
T ss_pred             EEEEEECCCCCEeeeeeeee
Confidence            45566655444444443333


No 12 
>COG4907 Predicted membrane protein [Function unknown]
Probab=64.63  E-value=14  Score=33.72  Aligned_cols=23  Identities=9%  Similarity=0.195  Sum_probs=15.1

Q ss_pred             cceeeEEEEECCCCcEEEEEEEc
Q psy16510         83 QTAQGQFTYTAPDGTPITVQWFA  105 (147)
Q Consensus        83 ~~v~GsYsy~~pdG~~~~V~Y~A  105 (147)
                      +.+.++|||++-+|..+...|.-
T Consensus        51 ~~ieeiysy~~~eG~~y~~~~rd   73 (595)
T COG4907          51 DEIEEIYSYIDAEGYGYITSLRD   73 (595)
T ss_pred             ceeeeeeeeeeccccceEEEEee
Confidence            45677777777777665555543


No 13 
>PHA02698 hypothetical protein; Provisional
Probab=64.48  E-value=4.1  Score=28.40  Aligned_cols=21  Identities=33%  Similarity=0.727  Sum_probs=17.6

Q ss_pred             CCcccccCCCCCCCCCcHHHH
Q psy16510        108 TGFHASGAHLPTPPPIPEAIL  128 (147)
Q Consensus       108 ~Gf~p~g~hlP~~pp~p~~i~  128 (147)
                      .|-|+.|+..|+|-|+|+++-
T Consensus        19 ddvqvnGDlmPtp~p~peeV~   39 (89)
T PHA02698         19 DDVQVNGDLMPTPEPTPEEVP   39 (89)
T ss_pred             hccccCCcccCCCCCChhhhc
Confidence            477889999999999887764


No 14 
>PRK09980 ompL outer membrane porin L; Provisional
Probab=62.40  E-value=40  Score=27.67  Aligned_cols=60  Identities=12%  Similarity=0.072  Sum_probs=33.9

Q ss_pred             CeeEEEeeCCceeeeecceecCCCCCc--ccccceeeEEEE-ECCCCcEEE----EEEEcCCCCccc
Q psy16510         53 SYRYSYETGNGIAANEQGYLKNPGQKD--LEAQTAQGQFTY-TAPDGTPIT----VQWFADETGFHA  112 (147)
Q Consensus        53 ~Y~~~Ye~~dG~~~~E~G~~k~~g~~~--~~~~~v~GsYsy-~~pdG~~~~----V~Y~ADe~Gf~p  112 (147)
                      ....+...+||.-...+..++..+.++  .....+.++|.| ...|.-..+    +....+.++|+|
T Consensus        40 Ri~vsh~f~nG~Gf~~eak~~~~~dd~~~~~~~E~~~sY~~~k~~d~~tl~PG~~~~s~s~~~~Y~P  106 (230)
T PRK09980         40 MLRVGYNFDMGAGIMLTNTYTFQRDDELKHGYNEIEGWYPLFKPTDKLTIQPGGLINDKSIGSGGAV  106 (230)
T ss_pred             EEEEEEEccCceeEEEEEEeccCCCccccccceEEEEEEEeEecCCCEEEecceEEEecCCCceEee
Confidence            677788888888766666664322211  135667788874 555543333    233444455555


No 15 
>PRK15346 outer membrane secretin SsaC; Provisional
Probab=60.16  E-value=8.9  Score=34.49  Aligned_cols=20  Identities=15%  Similarity=0.220  Sum_probs=13.7

Q ss_pred             CcccccchhhHHHHHHHHHHHHHhhcc
Q psy16510          1 MMKIEKDIGALAISFVALFIAVVSAQR   27 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~la~a~A~~   27 (147)
                      |||.       ++++|+|||+++.+++
T Consensus         1 ~~~~-------~~~~~~~~~~~~~~~~   20 (499)
T PRK15346          1 MKKL-------LILIFLFLLNTAKFAA   20 (499)
T ss_pred             Cchh-------HHHHHHHHHhhhhhhc
Confidence            7885       5677777777765544


No 16 
>PF15240 Pro-rich:  Proline-rich
Probab=58.62  E-value=9.9  Score=30.26  Aligned_cols=18  Identities=28%  Similarity=0.329  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHhhccC
Q psy16510         11 LAISFVALFIAVVSAQRG   28 (147)
Q Consensus        11 ~~~~i~~~~la~a~A~~~   28 (147)
                      |+++|.++|||+.+|...
T Consensus         2 LlVLLSvALLALSSAQ~~   19 (179)
T PF15240_consen    2 LLVLLSVALLALSSAQST   19 (179)
T ss_pred             hhHHHHHHHHHhhhcccc
Confidence            444555666666666543


No 17 
>COG3117 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=58.17  E-value=25  Score=28.21  Aligned_cols=25  Identities=4%  Similarity=-0.125  Sum_probs=16.4

Q ss_pred             CcccccchhhHHHHHHHHHHHHHhhc
Q psy16510          1 MMKIEKDIGALAISFVALFIAVVSAQ   26 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~la~a~A~   26 (147)
                      +|++ |-+.++++++++|++++....
T Consensus         1 ~~~~-Rw~~~ILll~a~~~~~w~~~~   25 (188)
T COG3117           1 SMSR-RWVYLILLLAALALSGWLLGL   25 (188)
T ss_pred             Ccch-hHHHHHHHHHHHHHHHHhhhc
Confidence            4665 555567777777777776654


No 18 
>PRK13254 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=56.62  E-value=11  Score=28.81  Aligned_cols=21  Identities=29%  Similarity=0.175  Sum_probs=10.7

Q ss_pred             CcccccchhhHHHHHHHHHHHH
Q psy16510          1 MMKIEKDIGALAISFVALFIAV   22 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~la~   22 (147)
                      ||+ .+.++.++++++++++++
T Consensus         1 m~~-~~~~rl~~~~~~~~~~~~   21 (148)
T PRK13254          1 MMK-RKRRRLLIILGALAALGL   21 (148)
T ss_pred             CCc-cchhHHHHHHHHHHHHHH
Confidence            676 455565544444444443


No 19 
>PF12729 4HB_MCP_1:  Four helix bundle sensory module for signal transduction;  InterPro: IPR024478 This entry represents a four-helix bundle that operates as a ubiquitous sensory module in prokaryotic signal-transduction, which is known as four-helix bundles methyl-accepting chemotaxis protein (4HB_MCP) domain. The 4HB_MCP is always found between two predicted transmembrane helices indicating that it detects only extracellular signals. In many cases the domain is associated with a cytoplasmic HAMP domain suggesting that most proteins carrying the bundle might share the mechanism of transmembrane signalling which is well-characterised in E coli chemoreceptors [].
Probab=55.33  E-value=10  Score=27.20  Aligned_cols=23  Identities=13%  Similarity=0.015  Sum_probs=15.0

Q ss_pred             cccccchhhHHHHHHHHHHHHHh
Q psy16510          2 MKIEKDIGALAISFVALFIAVVS   24 (147)
Q Consensus         2 m~~~~~~~~~~~~i~~~~la~a~   24 (147)
                      |||.+++...+.+++++++++++
T Consensus         2 l~I~~KL~~~f~~~~~l~~~~~~   24 (181)
T PF12729_consen    2 LSIRTKLILGFGLIILLLLIVGI   24 (181)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHH
Confidence            78888888666555555555543


No 20 
>KOG1691|consensus
Probab=55.13  E-value=96  Score=25.32  Aligned_cols=31  Identities=26%  Similarity=0.412  Sum_probs=27.2

Q ss_pred             ceeeEEEEECCCCcEEEEEEEcCCCCccccc
Q psy16510         84 TAQGQFTYTAPDGTPITVQWFADETGFHASG  114 (147)
Q Consensus        84 ~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p~g  114 (147)
                      +..|+|+|+..+-..+.+.++||..|.+|..
T Consensus        80 v~~gqFaFta~e~~~y~~Cf~~~~~~~~p~~  110 (210)
T KOG1691|consen   80 VTKGQFAFTAEESGMYEACFTADVPGHKPET  110 (210)
T ss_pred             cccceEEEEeccCCcEEEEEecccCCCCCCc
Confidence            3499999999998899999999998888843


No 21 
>TIGR03795 chp_BMA0021 conserved hypothetical protein, BMA_0021 family. Members of this protein family are found sparsely, mostly in members of the genus Burkholderia. Members often occur as tandem homologous genes, such as BMA_0021 and BMA_0022 in Burkholderia mallei ATCC 23344. The genes regularly are encoded near the so-called docking protein of TOMM (thiazole/oxazole-modified microcins) biosynthetic clusters, suggesting a role in bacteriocin biosynthesis. The function is unknown.
Probab=54.42  E-value=21  Score=26.46  Aligned_cols=24  Identities=17%  Similarity=0.228  Sum_probs=16.2

Q ss_pred             CCCCCCCCcHHHHHHHH-HHhcCCC
Q psy16510        116 HLPTPPPIPEAILKSLQ-QVQVSPP  139 (147)
Q Consensus       116 hlP~~pp~p~~i~~al~-~~~~~p~  139 (147)
                      .||.+|..-+...-||+ |+.+|+.
T Consensus        88 ~iP~aP~~~~~~a~ALAayn~~~~~  112 (114)
T TIGR03795        88 GIPPAPEHLAEEAIALAAYNDAGPA  112 (114)
T ss_pred             EcCCCCCccchHHHHHHHHhccCcc
Confidence            47877665444555776 9988874


No 22 
>COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane]
Probab=54.21  E-value=40  Score=27.44  Aligned_cols=15  Identities=7%  Similarity=0.049  Sum_probs=8.4

Q ss_pred             HHHHHHHHHHHHhhc
Q psy16510         12 AISFVALFIAVVSAQ   26 (147)
Q Consensus        12 ~~~i~~~~la~a~A~   26 (147)
                      ++..++|+|+.+|+.
T Consensus        10 ~l~~~As~LL~aC~~   24 (206)
T COG3017          10 LLLALASLLLTACTL   24 (206)
T ss_pred             HHHHHHHHHHHhccC
Confidence            335556666666644


No 23 
>PRK10894 lipopolysaccharide transport periplasmic protein LptA; Provisional
Probab=53.68  E-value=24  Score=27.33  Aligned_cols=35  Identities=20%  Similarity=0.060  Sum_probs=14.7

Q ss_pred             HHHHHHHHHHHHHhhccCCCCCCCCCCcceeeeccC
Q psy16510         11 LAISFVALFIAVVSAQRGPPQYAPGQFIPIISYQNE   46 (147)
Q Consensus        11 ~~~~i~~~~la~a~A~~~~~~~~~~~~~~il~~~~~   46 (147)
                      |-++|++||+++++++.+-+ .....++.|-....+
T Consensus         6 ~~~~~~~~ll~~~~~a~A~~-~d~~~pI~I~AD~~~   40 (180)
T PRK10894          6 LNLLLASSLLAASIPAFALT-GDTDQPIHIDSDQQS   40 (180)
T ss_pred             HHHHHHHHHHHHHHHHhhcc-cccCCCEEEEeCceE
Confidence            34444455554433332221 123345665554433


No 24 
>PRK13159 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=52.04  E-value=13  Score=28.79  Aligned_cols=13  Identities=8%  Similarity=0.406  Sum_probs=8.1

Q ss_pred             CcHHHHHHHHHHh
Q psy16510        123 IPEAILKSLQQVQ  135 (147)
Q Consensus       123 ~p~~i~~al~~~~  135 (147)
                      .|+++.++|....
T Consensus       128 mP~Ev~~al~~~~  140 (155)
T PRK13159        128 MPKELKDAMAEGH  140 (155)
T ss_pred             CCHHHHHHHHhcc
Confidence            4566777776443


No 25 
>PF08138 Sex_peptide:  Sex peptide (SP) family;  InterPro: IPR012608 This family consists of Sex Peptides (SP) that are found in Drosophila. On mating, Drosophila females decreases her remating rate and increases her egg-laying rate due, in part, to the transfer of SP from the male to the female. SP are found in seminal fluids transferred from the male to the female during mating. The male seminal fluid proteins are referred to as accessory gland proteins (Acps). The SP is one of the most interesting Acps and plays an important role in reproduction [].; GO: 0005179 hormone activity, 0046008 regulation of female receptivity, post-mating, 0005576 extracellular region; PDB: 2LAQ_A.
Probab=51.85  E-value=4.8  Score=26.08  Aligned_cols=16  Identities=6%  Similarity=0.374  Sum_probs=0.0

Q ss_pred             hHHHHHHHHHHHHHhh
Q psy16510         10 ALAISFVALFIAVVSA   25 (147)
Q Consensus        10 ~~~~~i~~~~la~a~A   25 (147)
                      ++++++++|++.+|.+
T Consensus         4 p~~llllvlllGla~s   19 (56)
T PF08138_consen    4 PIFLLLLVLLLGLAQS   19 (56)
T ss_dssp             ----------------
T ss_pred             hHHHHHHHHHHHHHhc
Confidence            6777788887777766


No 26 
>PF11153 DUF2931:  Protein of unknown function (DUF2931);  InterPro: IPR021326  Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. 
Probab=51.00  E-value=62  Score=25.57  Aligned_cols=22  Identities=23%  Similarity=0.363  Sum_probs=11.5

Q ss_pred             cCCCCcccccCCCCCCCCCcHHHHHHHHH
Q psy16510        105 ADETGFHASGAHLPTPPPIPEAILKSLQQ  133 (147)
Q Consensus       105 ADe~Gf~p~g~hlP~~pp~p~~i~~al~~  133 (147)
                      +|+.=|+...+       +|+.+.+.+.-
T Consensus       107 ~ekK~Y~~~i~-------ip~~~~~~m~~  128 (216)
T PF11153_consen  107 AEKKFYETKID-------IPEEIWQKMLT  128 (216)
T ss_pred             ccCcEEEEEEE-------cCHHHHHHhcc
Confidence            34555555444       45566665543


No 27 
>PF06316 Ail_Lom:  Enterobacterial Ail/Lom protein;  InterPro: IPR000758 Virulence-related outer membrane proteins are expressed in Gram-negative bacteria and are essential to bacterial survival within macrophages and for eukaryotic cell invasion. Members of this group include: PagC, required by Salmonella typhimurium for survival in macrophages and for virulence in mice [] Rck outer membrane protein of the S. typhimurium virulence plasmid [] Ail, a product of the Yersinia enterocolitica chromosome capable of mediating bacterial adherence to and invasion of epithelial cell lines []  OmpX from Escherichia coli that promotes adhesion to and entry into mammalian cells. It also has a role in the resistance against attack by the human complement system []  a Bacteriophage lambda outer membrane protein, Lom [] The crystal structure of OmpX from E. coli reveals that OmpX consists of an eight-stranded antiparallel all-next-neighbour beta barrel []. The structure shows two girdles of aromatic amino acid residues and a ribbon of nonpolar residues that attach to the membrane interior. The core of the barrel consists of an extended hydrogen-bonding network of highly conserved residues. OmpX thus resembles an inverse micelle. The OmpX structure shows that the membrane-spanning part of the protein is much better conserved than the extracellular loops. Moreover, these loops form a protruding beta sheet, the edge of which presumably binds to external proteins. It is suggested that this type of binding promotes cell adhesion and invasion and helps defend against the complement system. Although OmpX has the same beta-sheet topology as the structurally related outer membrane protein A (OmpA) IPR000498 from INTERPRO, their barrels differ with respect to the shear numbers and internal hydrogen-bonding networks.; GO: 0009279 cell outer membrane
Probab=50.02  E-value=80  Score=25.54  Aligned_cols=27  Identities=7%  Similarity=0.289  Sum_probs=22.0

Q ss_pred             cccceeeEEEEECCCCcEEEEEEEcCC
Q psy16510         81 EAQTAQGQFTYTAPDGTPITVQWFADE  107 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~V~Y~ADe  107 (147)
                      .+.+++=.|.|.||=|.+-..+|..++
T Consensus        49 ~GvNvKYRYE~~d~lGvItSfSY~~~~   75 (199)
T PF06316_consen   49 RGVNVKYRYEFDDPLGVITSFSYASGD   75 (199)
T ss_pred             CccEEEEEEecccceeeeEEEEEeecC
Confidence            567888888888888888888887765


No 28 
>PRK13150 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=49.77  E-value=15  Score=28.60  Aligned_cols=17  Identities=24%  Similarity=0.442  Sum_probs=12.5

Q ss_pred             CcHHHHHHHHHHhcCCC
Q psy16510        123 IPEAILKSLQQVQVSPP  139 (147)
Q Consensus       123 ~p~~i~~al~~~~~~p~  139 (147)
                      .|+++..+|....+.|.
T Consensus       135 mPpEv~~al~~~~~~~~  151 (159)
T PRK13150        135 TPPEVEKAMQENHRRPQ  151 (159)
T ss_pred             CCHHHHHHHHHhCCCCc
Confidence            67889999986666554


No 29 
>COG3026 RseB Negative regulator of sigma E activity [Signal transduction mechanisms]
Probab=49.39  E-value=24  Score=30.36  Aligned_cols=22  Identities=23%  Similarity=0.403  Sum_probs=15.4

Q ss_pred             cccceeeEEEEECCCCcEEEEE
Q psy16510         81 EAQTAQGQFTYTAPDGTPITVQ  102 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~V~  102 (147)
                      +.+..-+.-||+.|.=+.++++
T Consensus        82 Evvrr~n~vsyf~pg~~~ft~~  103 (320)
T COG3026          82 EVVRRGNEVSYFEPGLEPFTLN  103 (320)
T ss_pred             eeeeeCCEEEeecCCcceeeec
Confidence            6666777788888876666543


No 30 
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=47.86  E-value=44  Score=27.09  Aligned_cols=49  Identities=24%  Similarity=0.484  Sum_probs=33.7

Q ss_pred             eEEEEECCCCcEEEE--------EEEcCCCCcccccCC--CCCC-CCCcHHHHHHHHHHh
Q psy16510         87 GQFTYTAPDGTPITV--------QWFADETGFHASGAH--LPTP-PPIPEAILKSLQQVQ  135 (147)
Q Consensus        87 GsYsy~~pdG~~~~V--------~Y~ADe~Gf~p~g~h--lP~~-pp~p~~i~~al~~~~  135 (147)
                      .-+...-|.|..-+|        -|.+|+.||+=.+.+  -+.| ||+|+.+++....++
T Consensus        46 p~~~~~~~gg~~msv~mt~~G~~~W~~d~~~YrYs~~~~~~~~pwp~~P~~l~~L~~~~~  105 (213)
T PRK15401         46 PFRHMVTPGGYTMSVAMTNCGALGWVTDRRGYRYSPIDPLTGKPWPAMPASFLALAQRAA  105 (213)
T ss_pred             CccceecCCCCcceeEEeccccceEecCCCCcccCCcCCCCCCCCCCchHHHHHHHHHHH
Confidence            345567777765544        789999999998764  3444 568877777666554


No 31 
>PF02402 Lysis_col:  Lysis protein;  InterPro: IPR003059 The DNA sequence of the entire colicin E2 operon has been determined []. The operon comprises the colicin activity gene (ceaB), the colicin immunity gene (ceiB) and the lysis gene (celB), which is essential for colicin release from producing cells []. A putative LexA binding site is located upstream from ceaB, and a rho-independent terminator structure is located downstream from celB []. Comparison of the amino acid sequences of colicin E2 and cloacin DF13 reveal extensive similarity. These colicins have different modes of action and recognise different cell surface receptors; the two major regions of heterology at the C terminus, and in the C-terminal end of the central region are thought to correspond to the catalytic and receptor-recognition domains, respectively [].  Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. By contrast, the lysis proteins of the ColE2, ColE1 and CloDF13 plasmids are almost identical except in the N-terminal regions, which themselves are similar to lipoprotein signal peptides []. Processing of the ColE2 prolysis protein to the mature form is prevented by globomycin, a specific inhibitor of the lipoprotein signal peptidase []. The mature ColE2 lysis protein is located in the cell envelope [].; GO: 0009405 pathogenesis, 0019835 cytolysis, 0019867 outer membrane
Probab=47.83  E-value=13  Score=23.26  Aligned_cols=17  Identities=18%  Similarity=0.329  Sum_probs=9.3

Q ss_pred             chhhHHHHHHHHHHHHH
Q psy16510          7 DIGALAISFVALFIAVV   23 (147)
Q Consensus         7 ~~~~~~~~i~~~~la~a   23 (147)
                      +|.+++++++..+|+.+
T Consensus         3 Ki~~~~i~~~~~~L~aC   19 (46)
T PF02402_consen    3 KIIFIGIFLLTMLLAAC   19 (46)
T ss_pred             EEEEeHHHHHHHHHHHh
Confidence            56666666665444433


No 32 
>COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown]
Probab=44.83  E-value=1.1e+02  Score=23.33  Aligned_cols=44  Identities=16%  Similarity=0.347  Sum_probs=24.9

Q ss_pred             CCCCeeEEEeeCCceeeeecceecCCCCCcccccceeeEEEEECCCCcE
Q psy16510         50 GDGSYRYSYETGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTP   98 (147)
Q Consensus        50 ~dG~Y~~~Ye~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~   98 (147)
                      +||.|+-.|+.-|...  -.+.+ .+- -+ .+-.|.=.|-|.+.+|+.
T Consensus        30 ~dGtY~~~y~~fDd~g--wk~f~-~it-i~-dGKiv~~~ydy~~k~G~~   73 (147)
T COG4939          30 NDGTYQGHYESFDDHG--WKAFV-TIT-IQ-DGKIVACTYDYRDKKGNI   73 (147)
T ss_pred             cCCceeeeeccccccC--ccceE-EEE-Ee-CCEEEEEEeeeecCCCCc
Confidence            5889999998776221  01111 000 00 344567778888887765


No 33 
>PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional
Probab=44.48  E-value=1.8e+02  Score=24.01  Aligned_cols=12  Identities=17%  Similarity=0.174  Sum_probs=6.2

Q ss_pred             cCCCCCCCCeeEE
Q psy16510         45 NEPNQGDGSYRYS   57 (147)
Q Consensus        45 ~~~~~~dG~Y~~~   57 (147)
                      -..+ .||.|...
T Consensus        61 LtL~-~DgTY~L~   72 (234)
T PRK10523         61 LFLE-KDGTWVMN   72 (234)
T ss_pred             EEEc-CCCCEEEE
Confidence            3445 56655543


No 34 
>PRK10755 sensor protein BasS/PmrB; Provisional
Probab=43.79  E-value=19  Score=29.56  Aligned_cols=18  Identities=22%  Similarity=0.392  Sum_probs=9.4

Q ss_pred             CcccccchhhHHHHHHHHH
Q psy16510          1 MMKIEKDIGALAISFVALF   19 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~   19 (147)
                      |||+ +++.++++++++|+
T Consensus         1 ~~~~-~~~~~~~~~~~~~~   18 (356)
T PRK10755          1 MISL-RRRLLLTIGLILLV   18 (356)
T ss_pred             CchH-HHHHHHHHHHHHHH
Confidence            7887 45554444444433


No 35 
>PF03562 MltA:  MltA specific insert domain;  InterPro: IPR005300 This group of proteins includes MltA; a membrane-bound, murein degrading transglycosylase enzyme which plays an important role in the controlled growth of the stress-bearing sacculus of Escherichia coli [, ].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 2PI8_D 2AE0_X 2PIC_A 2GAE_A 2PJJ_A 3CZB_A 2G6G_A 2PNW_A 2G5D_A.
Probab=42.79  E-value=26  Score=27.19  Aligned_cols=57  Identities=16%  Similarity=0.251  Sum_probs=35.8

Q ss_pred             cccceeeEEEEECCCCcEEEEEEEcC-CCCcccccC------CCCCCCCCcHHHHHHHHHHhcCCCC
Q psy16510         81 EAQTAQGQFTYTAPDGTPITVQWFAD-ETGFHASGA------HLPTPPPIPEAILKSLQQVQVSPPG  140 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~V~Y~AD-e~Gf~p~g~------hlP~~pp~p~~i~~al~~~~~~p~~  140 (147)
                      ..+.|+||=.-.-|||....|.|-+. -.-|+..|.      .|+..   --..+..-+|+++||.+
T Consensus        80 fflqIQGSGrl~l~DG~~~rvgYAg~NG~pY~sIGr~Li~~G~i~~~---~~Smq~Ir~wl~~~P~~  143 (158)
T PF03562_consen   80 FFLQIQGSGRLRLPDGSTVRVGYAGQNGHPYTSIGRLLIDRGEIPRE---QMSMQAIRAWLRAHPEE  143 (158)
T ss_dssp             HHHHHHSEEEEE-TTSSEEEEEEEEE-SS----HHHHHHHTTSS-TT---S-SHHHHHHHHHHTGGG
T ss_pred             HHeeecCcEEEEcCCCCEEEEEEcccCCCccccHHHHHHHcCCcChh---hCCHHHHHHHHHHCHHH
Confidence            44789999999999999999999654 234666544      35432   12345556799999864


No 36 
>PF13511 DUF4124:  Domain of unknown function (DUF4124)
Probab=42.38  E-value=29  Score=21.62  Aligned_cols=17  Identities=12%  Similarity=0.405  Sum_probs=11.1

Q ss_pred             eEEEEECCCCcEEEEEEEcC
Q psy16510         87 GQFTYTAPDGTPITVQWFAD  106 (147)
Q Consensus        87 GsYsy~~pdG~~~~V~Y~AD  106 (147)
                      .-|+|+|.+|+   |.|+-.
T Consensus        14 ~vYk~~D~~G~---v~ysd~   30 (60)
T PF13511_consen   14 EVYKWVDENGV---VHYSDT   30 (60)
T ss_pred             cEEEEECCCCC---EEECcc
Confidence            56788888885   355544


No 37 
>COG2332 CcmE Cytochrome c-type biogenesis protein CcmE [Posttranslational modification, protein turnover, chaperones]
Probab=42.20  E-value=1.2e+02  Score=23.60  Aligned_cols=14  Identities=21%  Similarity=0.446  Sum_probs=9.0

Q ss_pred             CcHHHHHHHHHHhc
Q psy16510        123 IPEAILKSLQQVQV  136 (147)
Q Consensus       123 ~p~~i~~al~~~~~  136 (147)
                      +|+++.+++.+.-.
T Consensus       129 ~P~ev~~~mk~~~~  142 (153)
T COG2332         129 TPPEVAKAMKKNGV  142 (153)
T ss_pred             CCHHHHHHhhhcCC
Confidence            56777777765443


No 38 
>PF00879 Defensin_propep:  Defensin propeptide The pattern for this Prosite entry doesn't match the propeptide.;  InterPro: IPR002366 Defensins are 2-6 kDa, cationic, microbicidal peptides active against many Gram-negative and Gram-positive bacteria, fungi, and enveloped viruses [], containing three pairs of intramolecular disulphide bonds []. On the basis of their size and pattern of disulphide bonding, mammalian defensins are classified into alpha, beta and theta categories. Alpha-defensins, which have been identified in humans, monkeys and several rodent species, are particularly abundant in neutrophils, certain macrophage populations and Paneth cells of the small intestine. Every mammalian species explored thus far has beta-defensins. In cows, as many as 13 beta-defensins exist in neutrophils. However, in other species, beta-defensins are more often produced by epithelial cells lining various organs (e.g. the epidermis, bronchial tree and genitourinary tract). Theta-defensins are cyclic and have so far only been identified in primate phagocytes.   Defensins are produced constitutively and/or in response to microbial products or proinflammatory cytokines. Some defensins are also called corticostatins (CS) because they inhibit corticotropin-stimulated corticosteroid production. The mechanism(s) by which microorganisms are killed and/or inactivated by defensins is not understood completely. However, it is generally believed that killing is a consequence of disruption of the microbial membrane. The polar topology of defensins, with spatially separated charged and hydrophobic regions, allows them to insert themselves into the phospholipid membranes so that their hydrophobic regions are buried within the lipid membrane interior and their charged (mostly cationic) regions interact with anionic phospholipid head groups and water. Subsequently, some defensins can aggregate to form `channel-like' pores; others might bind to and cover the microbial membrane in a `carpet-like' manner. The net outcome is the disruption of membrane integrity and function, which ultimately leads to the lysis of microorganisms. Some defensins are synthesized as propeptides which may be relevant to this process - in neutrophils only the mature peptides have been identified but in Paneth cells, the propeptide is stored in vesicles [] and appears to be cleaved by trypsin on activation.  ; GO: 0006952 defense response
Probab=42.16  E-value=66  Score=20.62  Aligned_cols=12  Identities=8%  Similarity=0.260  Sum_probs=5.2

Q ss_pred             HHHHHHHHHHhh
Q psy16510         14 SFVALFIAVVSA   25 (147)
Q Consensus        14 ~i~~~~la~a~A   25 (147)
                      .|++++|++|+-
T Consensus         5 ~LLaAlLLlAlq   16 (52)
T PF00879_consen    5 ALLAALLLLALQ   16 (52)
T ss_pred             HHHHHHHHHHHH
Confidence            344444444443


No 39 
>smart00817 Amelin Ameloblastin precursor (Amelin). This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.
Probab=40.99  E-value=25  Score=31.01  Aligned_cols=18  Identities=17%  Similarity=0.215  Sum_probs=14.8

Q ss_pred             HHHHHHHHHHHhhccCCC
Q psy16510         13 ISFVALFIAVVSAQRGPP   30 (147)
Q Consensus        13 ~~i~~~~la~a~A~~~~~   30 (147)
                      ++|++|||..++|.+..+
T Consensus         4 ~iL~~cLl~~afA~Pl~P   21 (411)
T smart00817        4 LALVLCLLKTSFAVPAFP   21 (411)
T ss_pred             hhHHHHHHhhhhccccCC
Confidence            388899999999987665


No 40 
>PF13623 SurA_N_2:  SurA N-terminal domain
Probab=40.88  E-value=22  Score=26.92  Aligned_cols=20  Identities=30%  Similarity=0.444  Sum_probs=10.9

Q ss_pred             ccccchhhHHHHHHHHHHHHH
Q psy16510          3 KIEKDIGALAISFVALFIAVV   23 (147)
Q Consensus         3 ~~~~~~~~~~~~i~~~~la~a   23 (147)
                      || |+++.++++|++++|.+.
T Consensus         3 kI-R~r~~lLi~vIglAL~aF   22 (145)
T PF13623_consen    3 KI-RQRGGLLIIVIGLALFAF   22 (145)
T ss_pred             hH-hhcchHHHHHHHHHHHHH
Confidence            67 566765555544444443


No 41 
>KOG3866|consensus
Probab=39.55  E-value=29  Score=30.40  Aligned_cols=26  Identities=31%  Similarity=0.321  Sum_probs=18.4

Q ss_pred             cccccchhhHHHHHHHHHHHHHhhcc
Q psy16510          2 MKIEKDIGALAISFVALFIAVVSAQR   27 (147)
Q Consensus         2 m~~~~~~~~~~~~i~~~~la~a~A~~   27 (147)
                      |..++-|-++.+++++|++-++||.+
T Consensus         1 ~~~~~~~l~l~~llvllll~av~avP   26 (442)
T KOG3866|consen    1 MRWRTMILPLVILLVLLLLDAVCAVP   26 (442)
T ss_pred             CcchhhHHHHHHHHHHHHHhhhcCCc
Confidence            44555566788788888888777754


No 42 
>PF10929 DUF2811:  Protein of unknown function (DUF2811);  InterPro: IPR021231  This is a bacterial family of uncharacterised proteins. 
Probab=36.92  E-value=32  Score=22.45  Aligned_cols=20  Identities=20%  Similarity=0.438  Sum_probs=17.1

Q ss_pred             CCcHHHHHHHH-HHhcCCCCC
Q psy16510        122 PIPEAILKSLQ-QVQVSPPGP  141 (147)
Q Consensus       122 p~p~~i~~al~-~~~~~p~~~  141 (147)
                      .||+.+.+++. ||.+||.=+
T Consensus         5 eiPe~L~~~m~~fie~hP~WD   25 (57)
T PF10929_consen    5 EIPEDLHQAMKDFIETHPNWD   25 (57)
T ss_pred             cccHHHHHHHHHHHHcCCCch
Confidence            47899999998 999999643


No 43 
>TIGR01643 YD_repeat_2x YD repeat (two copies). This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Probab=33.60  E-value=71  Score=18.15  Aligned_cols=20  Identities=30%  Similarity=0.375  Sum_probs=12.1

Q ss_pred             EEECCCCcEEEEEEEcCCCCcc
Q psy16510         90 TYTAPDGTPITVQWFADETGFH  111 (147)
Q Consensus        90 sy~~pdG~~~~V~Y~ADe~Gf~  111 (147)
                      +.++|+|...+..|  |..|-.
T Consensus         9 ~~~~p~G~~~~~~Y--D~~Grl   28 (42)
T TIGR01643         9 GSTDADGTTTRYTY--DAAGRL   28 (42)
T ss_pred             EEECCCCCEEEEEE--CCCCCE
Confidence            35677777666666  555543


No 44 
>PRK13165 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=33.44  E-value=42  Score=26.21  Aligned_cols=16  Identities=13%  Similarity=0.403  Sum_probs=10.7

Q ss_pred             CcHHHHHHHHHHhcCC
Q psy16510        123 IPEAILKSLQQVQVSP  138 (147)
Q Consensus       123 ~p~~i~~al~~~~~~p  138 (147)
                      .|+++..+|......|
T Consensus       135 mPpEv~~al~~~~~~~  150 (160)
T PRK13165        135 TPPEVEEAMKKNHRRP  150 (160)
T ss_pred             CCHHHHHHHHhccCCC
Confidence            5677888887554444


No 45 
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=33.34  E-value=37  Score=28.42  Aligned_cols=19  Identities=21%  Similarity=0.188  Sum_probs=10.9

Q ss_pred             cccccchhhHHHHHHHHHH
Q psy16510          2 MKIEKDIGALAISFVALFI   20 (147)
Q Consensus         2 m~~~~~~~~~~~~i~~~~l   20 (147)
                      ||+.+...+++.+++++++
T Consensus         1 ~~f~~~~~~~~~~~~~~~~   19 (283)
T TIGR00219         1 MKFLVKPKLFIRLLLALIV   19 (283)
T ss_pred             CCCCcCCchHHHHHHHHHH
Confidence            8886666655444444444


No 46 
>PLN03207 stomagen; Provisional
Probab=33.24  E-value=55  Score=23.91  Aligned_cols=26  Identities=15%  Similarity=0.093  Sum_probs=12.2

Q ss_pred             CcccccchhhHHHHHHHHHHHHHhhc
Q psy16510          1 MMKIEKDIGALAISFVALFIAVVSAQ   26 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~la~a~A~   26 (147)
                      ||-+.+.--+||++++.++|+.++.+
T Consensus         5 ~~~~tt~~~~lffLl~~llla~~v~q   30 (113)
T PLN03207          5 CMTATTRCLTLFFLLFFLLLGAYVIQ   30 (113)
T ss_pred             cccccchhHHHHHHHHHHHHHHHHHh
Confidence            44443333455555554454444444


No 47 
>PRK05461 apaG CO2+/MG2+ efflux protein ApaG; Reviewed
Probab=33.05  E-value=54  Score=24.45  Aligned_cols=21  Identities=29%  Similarity=0.487  Sum_probs=17.0

Q ss_pred             cccceeeEEEEECCCCcEEEE
Q psy16510         81 EAQTAQGQFTYTAPDGTPITV  101 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~V  101 (147)
                      ..+.++|+|.+++++|....|
T Consensus        93 p~G~M~G~y~~~~~~G~~F~v  113 (127)
T PRK05461         93 PSGTMQGHYQMVDEDGERFEV  113 (127)
T ss_pred             CCEEEEEEEEEEeCCCCEEEE
Confidence            457899999999999986554


No 48 
>cd07252 BphC1-RGP6_N_like N-terminal domain of 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC, EC 1.13.11.39) 1 from Rhodococcus globerulus P6 (BphC1-RGP6) and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of BphC1-RGP6 and similar proteins. BphC catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, the third step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). This subfamily of BphCs belongs to the type I extradiol dioxygenase family, which require a metal in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of 2,3-dihydroxybiphenyl 1,2-dioxygenases. For example, three types of BphC enzymes have been found in Rhodococcus globerulus (BphC1-RGP6 - BphC3-RGP6), all three enzymes are type I extradiol dioxygenases. BphC1-RGP6 has an internal duplication, it is a two-domain dioxygenase which forms octamers, and has Fe(II) at the catalytic site. Its N-
Probab=32.96  E-value=60  Score=22.28  Aligned_cols=21  Identities=29%  Similarity=0.635  Sum_probs=17.0

Q ss_pred             eEEEEECCCCcEEEEEEEcCC
Q psy16510         87 GQFTYTAPDGTPITVQWFADE  107 (147)
Q Consensus        87 GsYsy~~pdG~~~~V~Y~ADe  107 (147)
                      .++.+.||||..+++.|..+|
T Consensus       100 ~~~~~~DPdG~~iE~~~~~~~  120 (120)
T cd07252         100 GLIRFADPDGNRHELFWGPKE  120 (120)
T ss_pred             EEEEEECCCCCEEEEEecccC
Confidence            368899999999998876543


No 49 
>PF09244 DUF1964:  Domain of unknown function (DUF1964);  InterPro: IPR015325 This domain is C-terminal to the catalytic sucrose phosphorylase beta/alpha barrel domain. It adopts a beta-sandwich fold, with Greek-key topology and is functionally uncharacterised []. ; PDB: 1R7A_B 2GDU_A 2GDV_A.
Probab=32.69  E-value=52  Score=22.04  Aligned_cols=22  Identities=36%  Similarity=0.741  Sum_probs=16.0

Q ss_pred             eeEEEEECCCCcEEEEEEEcCC
Q psy16510         86 QGQFTYTAPDGTPITVQWFADE  107 (147)
Q Consensus        86 ~GsYsy~~pdG~~~~V~Y~ADe  107 (147)
                      -|.|||-..++..++.+|++.+
T Consensus         3 dG~FSy~~dgdtSitf~W~g~~   24 (68)
T PF09244_consen    3 DGEFSYEADGDTSITFTWTGAT   24 (68)
T ss_dssp             GSEEEEEEETTTEEEEEEE-SS
T ss_pred             cceeeEecCCCcEEEEEEeccc
Confidence            3778888777778888887764


No 50 
>COG3137 Putative salt-induced outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=31.64  E-value=68  Score=26.83  Aligned_cols=15  Identities=27%  Similarity=0.421  Sum_probs=8.4

Q ss_pred             CCCeeEEEeeCCcee
Q psy16510         51 DGSYRYSYETGNGIA   65 (147)
Q Consensus        51 dG~Y~~~Ye~~dG~~   65 (147)
                      +|+-.++|...-|-.
T Consensus        51 ~gsve~Gy~a~sGNT   65 (262)
T COG3137          51 EGSVEAGYLAQSGNT   65 (262)
T ss_pred             ccccceeeeeccCCc
Confidence            456666666554443


No 51 
>PRK14818 NADH dehydrogenase subunit B; Provisional
Probab=31.60  E-value=38  Score=26.80  Aligned_cols=24  Identities=33%  Similarity=0.562  Sum_probs=18.6

Q ss_pred             cccCCCCCCCCCcHHHHHHHHHHh
Q psy16510        112 ASGAHLPTPPPIPEAILKSLQQVQ  135 (147)
Q Consensus       112 p~g~hlP~~pp~p~~i~~al~~~~  135 (147)
                      ++.-.+|-=||.|++|+++|..+.
T Consensus       129 pVDvyIPGCPP~PeaIl~gil~L~  152 (173)
T PRK14818        129 PVDVYVPGCPPRPEALTEGLLRLQ  152 (173)
T ss_pred             CCcEEccCCCCCHHHHHHHHHHHH
Confidence            344478888999999999886554


No 52 
>PF05581 MCP_N:  Vibrio chemotaxis protein N terminus;  InterPro: IPR008762 Methyl-accepting chemotaxis proteins (MCPs) are a family of bacterial receptors that mediate chemotaxis to diverse signals, responding to changes in the concentration of attractants and repellents in the environment by altering swimming behaviour []. Environmental diversity gives rise to diversity in bacterial signalling receptors, and consequently there are many genes encoding MCPs []. For example, there are four well-characterised MCPs found in Escherichia coli: Tar (taxis towards aspartate and maltose, away from nickel and cobalt), Tsr (taxis towards serine, away from leucine, indole and weak acids), Trg (taxis towards galactose and ribose) and Tap (taxis towards dipeptides).  MCPs share similar topology and signalling mechanisms. MCPs either bind ligands directly or interact with ligand-binding proteins, transducing the signal to downstream signalling proteins in the cytoplasm. MCPs undergo two covalent modifications: deamidation and reversible methylation at a number of glutamate residues. Attractants increase the level of methylation, while repellents decrease it. The methyl groups are added by the methyl-transferase cheR and are removed by the methylesterase cheB. Most MCPs are homodimers that contain the following organisation: an N-terminal signal sequence that acts as a transmembrane domain in the mature protein; a poorly-conserved periplasmic receptor (ligand-binding) domain; a second transmembrane domain; and a highly-conserved C-terminal cytoplasmic domain that interacts with downstream signalling components. The C-terminal domain contains the glycosylated glutamate residues.  This entry represents the N-terminal domain found in chemotaxis methyl-accepting proteins primarily from from Vibrio species.; PDB: 3C8C_A.
Probab=31.56  E-value=16  Score=26.56  Aligned_cols=24  Identities=17%  Similarity=0.228  Sum_probs=0.0

Q ss_pred             cccccchhhHHHHHHHHHHHHHhh
Q psy16510          2 MKIEKDIGALAISFVALFIAVVSA   25 (147)
Q Consensus         2 m~~~~~~~~~~~~i~~~~la~a~A   25 (147)
                      ||+.|+|...-.+|+++.+++.+.
T Consensus         1 MKFshKIv~AsS~lLlvt~slls~   24 (103)
T PF05581_consen    1 MKFSHKIVLASSLLLLVTVSLLSL   24 (103)
T ss_dssp             ------------------------
T ss_pred             CchhHHHHHHHHHHHHHHHHHHHH
Confidence            999999995554555555555443


No 53 
>cd07240 ED_TypeI_classII_N N-terminal domain of type I, class II extradiol dioxygenases; non-catalytic domain. This family contains the N-terminal, non-catalytic, domain of type I, class II extradiol dioxygenases. Dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. Two major groups of dioxygenases have been identified according to the cleavage site; extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon, whereas intradiol enzymes cleave the aromatic ring between two hydroxyl groups. Extradiol dioxygenases are classified into type I and type II enzymes. Type I extradiol dioxygenases include class I and class II enzymes. These two classes of enzymes show sequence similarity; the two-domain class II enzymes evolved from a class I enzyme through gene duplication. The extradiol dioxygenases represented in this fa
Probab=31.48  E-value=73  Score=21.13  Aligned_cols=21  Identities=24%  Similarity=0.392  Sum_probs=16.3

Q ss_pred             eEEEEECCCCcEEEEEEEcCC
Q psy16510         87 GQFTYTAPDGTPITVQWFADE  107 (147)
Q Consensus        87 GsYsy~~pdG~~~~V~Y~ADe  107 (147)
                      -++.+.||||..+++.+..|+
T Consensus        97 ~~~~~~DP~G~~ie~~~~~~~  117 (117)
T cd07240          97 RGLRFQDPDGHLLELFVEADK  117 (117)
T ss_pred             eEEEEECCCCCEEEEEEccCC
Confidence            355689999999998877553


No 54 
>PF07437 YfaZ:  YfaZ precursor;  InterPro: IPR009998 This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins.
Probab=31.40  E-value=1.9e+02  Score=22.64  Aligned_cols=13  Identities=31%  Similarity=0.498  Sum_probs=6.4

Q ss_pred             cceeeEEEEECCC
Q psy16510         83 QTAQGQFTYTAPD   95 (147)
Q Consensus        83 ~~v~GsYsy~~pd   95 (147)
                      +.+-|...|+++|
T Consensus        77 ~~vG~K~~~~~~~   89 (180)
T PF07437_consen   77 LGVGGKALYLNPD   89 (180)
T ss_pred             EEEeEEEEEeccc
Confidence            3444555555553


No 55 
>TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PubMed:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al.
Probab=30.88  E-value=62  Score=24.44  Aligned_cols=13  Identities=8%  Similarity=0.146  Sum_probs=6.2

Q ss_pred             HHHHHHHHHHHhh
Q psy16510         13 ISFVALFIAVVSA   25 (147)
Q Consensus        13 ~~i~~~~la~a~A   25 (147)
                      +++++|+++.+|+
T Consensus         5 ~~l~~~llL~gC~   17 (146)
T TIGR03352         5 VLLAACLLLAGCS   17 (146)
T ss_pred             HHHHHHHHHhhcc
Confidence            3444555544444


No 56 
>COG2967 ApaG Uncharacterized protein affecting Mg2+/Co2+ transport [Inorganic ion transport and metabolism]
Probab=30.24  E-value=1.4e+02  Score=22.49  Aligned_cols=21  Identities=29%  Similarity=0.493  Sum_probs=18.1

Q ss_pred             cccceeeEEEEECCCCcEEEE
Q psy16510         81 EAQTAQGQFTYTAPDGTPITV  101 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~V  101 (147)
                      ..+..+|.|..++.+|...+|
T Consensus        92 p~G~M~GhY~M~~e~G~~F~v  112 (126)
T COG2967          92 PSGTMQGHYEMIDEDGETFDV  112 (126)
T ss_pred             CcceEEEEEEEecCCCcEEEe
Confidence            457899999999999998765


No 57 
>COG1934 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.90  E-value=71  Score=25.14  Aligned_cols=22  Identities=27%  Similarity=0.303  Sum_probs=12.9

Q ss_pred             CCCcceeeeccCCCCCCCCeeE
Q psy16510         35 GQFIPIISYQNEPNQGDGSYRY   56 (147)
Q Consensus        35 ~~~~~il~~~~~~~~~dG~Y~~   56 (147)
                      ..++.|.....+.++.++...|
T Consensus        30 ~qPi~i~ad~~el~~~~~~a~f   51 (173)
T COG1934          30 DQPITIEADQQELDDKNGVATF   51 (173)
T ss_pred             cCCEEEEccceeeeccCCEEEE
Confidence            5677777666554434555554


No 58 
>KOG4824|consensus
Probab=29.44  E-value=2.9e+02  Score=22.65  Aligned_cols=22  Identities=14%  Similarity=0.306  Sum_probs=15.5

Q ss_pred             ccceeeEEEEECCCCcEEEEEE
Q psy16510         82 AQTAQGQFTYTAPDGTPITVQW  103 (147)
Q Consensus        82 ~~~v~GsYsy~~pdG~~~~V~Y  103 (147)
                      ++..+|.|++-.+||++...+-
T Consensus        65 ~gctt~~y~~~nkngkI~Vln~   86 (224)
T KOG4824|consen   65 GGCTTGAYTFDNKNGKIHVLNE   86 (224)
T ss_pred             CCceeeeeEecCCCceEEEeee
Confidence            3467788888888887666543


No 59 
>PF03100 CcmE:  CcmE;  InterPro: IPR004329 CcmE is the product of one of a cluster of Ccm genes that are necessary for cytochrome c biosynthesis in eubacteria. Expression of these proteins is induced when the organisms are grown under anaerobic conditions with nitrate or nitrite as the final electron acceptor.; GO: 0017003 protein-heme linkage, 0017004 cytochrome complex assembly, 0005886 plasma membrane; PDB: 1SR3_A 2KCT_A 1J6Q_A 1LM0_A.
Probab=28.74  E-value=2.4e+02  Score=20.69  Aligned_cols=7  Identities=43%  Similarity=0.325  Sum_probs=0.0

Q ss_pred             cccccch
Q psy16510          2 MKIEKDI    8 (147)
Q Consensus         2 m~~~~~~    8 (147)
                      ||-++.+
T Consensus         1 ~~~~~~r    7 (131)
T PF03100_consen    1 MKRRKKR    7 (131)
T ss_dssp             -------
T ss_pred             CCcceee
Confidence            6643333


No 60 
>PF05899 Cupin_3:  Protein of unknown function (DUF861);  InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=28.60  E-value=1.3e+02  Score=19.69  Aligned_cols=30  Identities=27%  Similarity=0.285  Sum_probs=22.1

Q ss_pred             cccceeeEEEEECCCCcEEEEEEEcCCCCccc
Q psy16510         81 EAQTAQGQFTYTAPDGTPITVQWFADETGFHA  112 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p  112 (147)
                      -..++.|+...++.+|..  +++.|++-=|-|
T Consensus        28 ~~~vleG~v~it~~~G~~--~~~~aGD~~~~p   57 (74)
T PF05899_consen   28 FFYVLEGEVTITDEDGET--VTFKAGDAFFLP   57 (74)
T ss_dssp             EEEEEEEEEEEEETTTEE--EEEETTEEEEE-
T ss_pred             EEEEEEeEEEEEECCCCE--EEEcCCcEEEEC
Confidence            457899999999999986  567777643333


No 61 
>PF05538 Campylo_MOMP:  Campylobacter major outer membrane protein;  InterPro: IPR008439 Campylobacter are Gram-negative, spiral, microaerophilic bacteria. Campylobacter jejuni is one of the main causative agents of food poisoning in the developed world. This family consists of Campylobacter major outer membrane proteins. The major outer membrane protein (MOMP), a putative porin and a multifunction surface protein of Campylobacter jejuni, may play an important role in the adaptation of the organism to various host environments [].
Probab=28.40  E-value=1.9e+02  Score=26.14  Aligned_cols=15  Identities=27%  Similarity=0.490  Sum_probs=8.8

Q ss_pred             CCCCCCCCeeEEEeeC
Q psy16510         46 EPNQGDGSYRYSYETG   61 (147)
Q Consensus        46 ~~~~~dG~Y~~~Ye~~   61 (147)
                      +++ -.|.-.|+|+..
T Consensus        31 dVd-vSG~~RYRY~~~   45 (431)
T PF05538_consen   31 DVD-VSGFARYRYTDD   45 (431)
T ss_pred             ccc-cceEEEEEecCC
Confidence            344 556666777654


No 62 
>COG0450 AhpC Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
Probab=28.23  E-value=1e+02  Score=24.81  Aligned_cols=48  Identities=21%  Similarity=0.216  Sum_probs=34.0

Q ss_pred             cccceeeEEEEECCCCcEEEEEEEcCCCCcccccCCCCCCCCCcHHHHHHHHHHhcCC
Q psy16510         81 EAQTAQGQFTYTAPDGTPITVQWFADETGFHASGAHLPTPPPIPEAILKSLQQVQVSP  138 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p~g~hlP~~pp~p~~i~~al~~~~~~p  138 (147)
                      ++..+||.| -+||||..+.+.+.....|=.+. +        .-.+++||++.++|.
T Consensus       120 ~g~a~R~~F-IIDp~g~ir~~~v~~~~iGRn~d-E--------ilR~idAlq~~~~hg  167 (194)
T COG0450         120 EGLALRGTF-IIDPDGVIRHILVNPLTIGRNVD-E--------ILRVIDALQFVAKHG  167 (194)
T ss_pred             CCcceeEEE-EECCCCeEEEEEEecCCCCcCHH-H--------HHHHHHHHHHHHHhC
Confidence            456899999 69999999999888776443221 0        124556888888874


No 63 
>PHA02978 hypothetical protein; Provisional
Probab=27.81  E-value=58  Score=24.23  Aligned_cols=16  Identities=44%  Similarity=0.764  Sum_probs=12.6

Q ss_pred             eEEEEECCCCcEEEEE
Q psy16510         87 GQFTYTAPDGTPITVQ  102 (147)
Q Consensus        87 GsYsy~~pdG~~~~V~  102 (147)
                      =.+||.+|||-.++|+
T Consensus        75 iy~sy~~~~gisiqvs   90 (135)
T PHA02978         75 IYFSYADPDGISIQVS   90 (135)
T ss_pred             EEEEecCCCceEEEEe
Confidence            3566899999988875


No 64 
>PRK00315 potassium-transporting ATPase subunit C; Reviewed
Probab=27.05  E-value=42  Score=26.97  Aligned_cols=34  Identities=15%  Similarity=0.326  Sum_probs=18.9

Q ss_pred             CCCCcccccC----CCCCCCCCcHHHHHHHHH-HhcCCC
Q psy16510        106 DETGFHASGA----HLPTPPPIPEAILKSLQQ-VQVSPP  139 (147)
Q Consensus       106 De~Gf~p~g~----hlP~~pp~p~~i~~al~~-~~~~p~  139 (147)
                      +++||.+.++    .-|+-|-.-+.|.+.++. .+++|.
T Consensus        74 ~~~~y~~~~SggSNl~psnp~l~~~v~~r~~~~~~~~~~  112 (193)
T PRK00315         74 APMPYNPQASGGSNLAPSNPALDDAIKARVAALRAANPG  112 (193)
T ss_pred             CCCCCCcccccccCCCCCCHHHHHHHHHHHHHHHHhCCC
Confidence            3557877543    345554455556666664 455653


No 65 
>KOG4063|consensus
Probab=26.99  E-value=60  Score=25.35  Aligned_cols=17  Identities=18%  Similarity=0.311  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHHHhhcc
Q psy16510         11 LAISFVALFIAVVSAQR   27 (147)
Q Consensus        11 ~~~~i~~~~la~a~A~~   27 (147)
                      +.+++|+++|+.++|++
T Consensus         6 ~~~v~l~alls~a~aq~   22 (158)
T KOG4063|consen    6 LKTVILLALLSLAAAQA   22 (158)
T ss_pred             HHHHHHHHHHHHhhhcc
Confidence            45566666666666553


No 66 
>PTZ00459 mucin-associated surface protein (MASP); Provisional
Probab=26.60  E-value=43  Score=28.26  Aligned_cols=9  Identities=11%  Similarity=0.124  Sum_probs=3.7

Q ss_pred             HHHHHHHHh
Q psy16510         16 VALFIAVVS   24 (147)
Q Consensus        16 ~~~~la~a~   24 (147)
                      |+|+|+|.+
T Consensus        11 LVCALCVLW   19 (291)
T PTZ00459         11 LVCALCVLW   19 (291)
T ss_pred             HHHHHHHHh
Confidence            344444433


No 67 
>PF04710 Pellino:  Pellino;  InterPro: IPR006800 Pellino is involved in Toll-like signalling pathways, and associates with the kinase domain of the Pelle Ser/Thr kinase [, , ].; PDB: 3EGB_B 3EGA_A.
Probab=26.41  E-value=1e+02  Score=27.63  Aligned_cols=53  Identities=17%  Similarity=0.321  Sum_probs=32.7

Q ss_pred             eCCceeeeecceecCCCCCcccccceeeEEEEECCCCcEEEEEEEcCCC--Cccc
Q psy16510         60 TGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTPITVQWFADET--GFHA  112 (147)
Q Consensus        60 ~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADe~--Gf~p  112 (147)
                      ..||++....=.++++.....-...-+.|-||+..-.+.+.|.|+.|++  =||+
T Consensus        46 ~aNGVkps~~h~v~tp~~ska~~~~~~Hsisytlsr~~~VvVEY~~D~~tDMFQI  100 (416)
T PF04710_consen   46 KANGVKPSTVHIVSTPQASKAISNKSQHSISYTLSRNQTVVVEYTHDPDTDMFQI  100 (416)
T ss_dssp             S--EEEEEEEEEEE--TT----EETTCCEEEEE-SSSEEEEEEEEEETTEEEEEE
T ss_pred             cCCCcCccceEEecCCCccceecccCCceEEEEecCCceeeeeeecCCCcchhhh
Confidence            4477776665544433222112234588999999999999999999974  6777


No 68 
>PRK13999 potassium-transporting ATPase subunit C; Provisional
Probab=26.33  E-value=44  Score=27.07  Aligned_cols=32  Identities=13%  Similarity=0.319  Sum_probs=16.7

Q ss_pred             CCCcccccC---C-CCCCCCCcHHHHHHHH-HHhcCC
Q psy16510        107 ETGFHASGA---H-LPTPPPIPEAILKSLQ-QVQVSP  138 (147)
Q Consensus       107 e~Gf~p~g~---h-lP~~pp~p~~i~~al~-~~~~~p  138 (147)
                      +.||.+.++   + -|+-|-.-..|.+.++ +.+++|
T Consensus        86 ~~~y~~~~SGgSNlgpsnp~L~~~v~~r~~~~~~~~~  122 (201)
T PRK13999         86 PAPYNAANSMGSNLGPTSKALADRVKEDVDALKAENP  122 (201)
T ss_pred             CCCCCcccccccCCCCCCHHHHHHHHHHHHHHHHhCC
Confidence            446777543   3 4444444455555555 455565


No 69 
>PF12984 DUF3868:  Domain of unknown function, B. Theta Gene description (DUF3868);  InterPro: IPR024480 This domain of unknown function is found in a number of bacterial proteins. The function of the proteins is not known, but the Bacteroides thetaiotaomicron gene appears to be upregulated in the presence of host or other bacterial species compared to pure culture [, ].
Probab=26.12  E-value=39  Score=24.53  Aligned_cols=11  Identities=0%  Similarity=-0.040  Sum_probs=4.8

Q ss_pred             hHHHHHHHHHH
Q psy16510         10 ALAISFVALFI   20 (147)
Q Consensus        10 ~~~~~i~~~~l   20 (147)
                      +++++|++|++
T Consensus         5 ~i~~~Ll~~~~   15 (115)
T PF12984_consen    5 FILFFLLLCSL   15 (115)
T ss_pred             EHHHHHHHHhh
Confidence            44444444444


No 70 
>PF05423 Mycobact_memb:  Mycobacterium membrane protein;  InterPro: IPR008693 This family contains several membrane proteins from Mycobacterium species [].
Probab=25.56  E-value=1.4e+02  Score=22.53  Aligned_cols=30  Identities=27%  Similarity=0.617  Sum_probs=17.5

Q ss_pred             EEEEECCCCcEEEEEEEcCCCC--cccccCCCC
Q psy16510         88 QFTYTAPDGTPITVQWFADETG--FHASGAHLP  118 (147)
Q Consensus        88 sYsy~~pdG~~~~V~Y~ADe~G--f~p~g~hlP  118 (147)
                      .|.-.++.|...+|+|. |++|  -+.....||
T Consensus        54 ~YEV~G~~G~~~~I~Y~-D~~~~~~~~~~v~LP   85 (140)
T PF05423_consen   54 TYEVTGPPGSTATISYL-DADGQPQQVDNVSLP   85 (140)
T ss_pred             EEEEEcCCCCeEEEEEE-cCCCceEeecCcCCC
Confidence            56666677777778886 4443  222344566


No 71 
>PF13124 DUF3963:  Protein of unknown function (DUF3963)
Probab=25.56  E-value=69  Score=19.19  Aligned_cols=13  Identities=15%  Similarity=0.158  Sum_probs=5.9

Q ss_pred             cchhhHHHHHHHH
Q psy16510          6 KDIGALAISFVAL   18 (147)
Q Consensus         6 ~~~~~~~~~i~~~   18 (147)
                      |+|++.+-+++++
T Consensus        21 rnit~cfal~vv~   33 (40)
T PF13124_consen   21 RNITFCFALLVVV   33 (40)
T ss_pred             HHHHHHHHHHHHH
Confidence            4555544333333


No 72 
>PF00659 POLO_box:  POLO box duplicated region;  InterPro: IPR000959 A subgroup of serine/threonine protein kinases, Polo or Polo-like kinases play multiple roles during the cell cycle. Polo kinases are required at several key points through mitosis, starting from control of the G2/M transition through phosphorylation of Cdc25C and mitotic cyclins. Polo kinases are characterised by an amino terminal catalytic domain, and a carboxy terminal non-catalytic domain consisting of three blocks of conserved sequences known as polo boxes which form one single functional domain []. The domain is named after its founding member encoded by the polo gene of Drosophila melanogaster []. This domain of around 70 amino acids has been found in species ranging from yeast to mammals. Polo boxes appear to mediate interaction with multiple proteins through protein:protein interactions; some but not all of these proteins are substrates for the kinase domain of the molecule [].  The crystal structure of the polo domain of the murine protein, Sak, is dimeric, consisting of two alpha-helices and two six-stranded beta-sheets []. The topology of one polypeptide subunit of the dimer consists of, from its N- to C terminus, an extended strand segment, five beta-strands, one alpha-helix (A) and a C-terminal beta-strand. Beta-strands from one subunit form a contiguous antiparallel beta-sheet with beta-strands from the second subunit. The two beta-sheets pack with a crossing angle of 110 degrees, orienting the hydrophobic surfaces inward and the hydrophilic surfaces outward. Helix A, which is colinear with beta-strand 6 of the same polypeptide, buries a large portion of the non-overlapping hydrophobic beta-sheet surfaces. Interactions involving helices A comprise a majority of the hydrophobic core structure and also the dimer interface. Point mutations in the Polo box of the budding yeast Cdc5 protein abolish the ability of overexpressed Cdc5 to interact with the spindle poles and to organise cytokinetic structures [].; GO: 0005515 protein binding; PDB: 1MBY_B 3P37_A 3MHN_A 1Q4K_A 3HIK_A 3Q1I_A 3P35_A 3MHQ_A 1UMW_B 3MQ8_B ....
Probab=25.54  E-value=1.9e+02  Score=18.43  Aligned_cols=64  Identities=25%  Similarity=0.341  Sum_probs=31.7

Q ss_pred             CeeEEEeeCCceeeeecceecCCCCCccccc-ceeeEEEEECCCCcEEEEEEEcCCCCcccccCCCCCCCCCcHHHHHHH
Q psy16510         53 SYRYSYETGNGIAANEQGYLKNPGQKDLEAQ-TAQGQFTYTAPDGTPITVQWFADETGFHASGAHLPTPPPIPEAILKSL  131 (147)
Q Consensus        53 ~Y~~~Ye~~dG~~~~E~G~~k~~g~~~~~~~-~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p~g~hlP~~pp~p~~i~~al  131 (147)
                      .|.+.|..+||+..--   + +.++.  -.+ .-.....|++++|+..+  |..++         +...  .|+++.+-|
T Consensus         4 k~gi~~~LSng~vqv~---F-nD~tk--ivl~~~~~~v~yi~~~~~~~~--~~~~~---------~~~~--~p~~l~~kl   64 (68)
T PF00659_consen    4 KYGIGYQLSNGTVQVN---F-NDHTK--IVLSPDGRLVTYIDRDGERQT--YSLSS---------LLED--FPEDLKKKL   64 (68)
T ss_dssp             CSEEEEEETTSEEEEE---E-TTS-E--EEEETTCCEEEEE-TTS-EEE--EECTC---------HHHH----HHHHHHH
T ss_pred             ceEEEEEEeCCCEEEE---E-eCCCE--EEECCCCCEEEEECCCCcEEE--EEccc---------cccC--CCHHHHHHh
Confidence            4778999999986322   2 22221  001 11247788888776443  43332         0000  367777777


Q ss_pred             HHHh
Q psy16510        132 QQVQ  135 (147)
Q Consensus       132 ~~~~  135 (147)
                      .+++
T Consensus        65 ~~~k   68 (68)
T PF00659_consen   65 TYLK   68 (68)
T ss_dssp             HHHH
T ss_pred             hccC
Confidence            7653


No 73 
>PF11912 DUF3430:  Protein of unknown function (DUF3430);  InterPro: IPR021837  This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 209 to 265 amino acids in length. 
Probab=25.41  E-value=57  Score=25.17  Aligned_cols=8  Identities=38%  Similarity=0.671  Sum_probs=4.4

Q ss_pred             CeeEEEee
Q psy16510         53 SYRYSYET   60 (147)
Q Consensus        53 ~Y~~~Ye~   60 (147)
                      .+.++|..
T Consensus        40 ~~G~Gys~   47 (212)
T PF11912_consen   40 SYGIGYSI   47 (212)
T ss_pred             CcEEEEEe
Confidence            45666554


No 74 
>TIGR01655 yxeA_fam conserved hypothetical protein TIGR01655. This model represents a family of small (about 115 amino acids) uncharacterized proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.
Probab=24.97  E-value=2.7e+02  Score=20.04  Aligned_cols=14  Identities=21%  Similarity=0.411  Sum_probs=8.6

Q ss_pred             CCCCcEEEEEEEcC
Q psy16510         93 APDGTPITVQWFAD  106 (147)
Q Consensus        93 ~pdG~~~~V~Y~AD  106 (147)
                      |.+|+..++++.|.
T Consensus        63 d~~G~~k~i~f~~~   76 (114)
T TIGR01655        63 DSSGKKHKVKFMAG   76 (114)
T ss_pred             CCCCCEEEEEEEcC
Confidence            34666666666655


No 75 
>PRK09558 ushA bifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor; Reviewed
Probab=24.83  E-value=57  Score=29.47  Aligned_cols=10  Identities=50%  Similarity=0.494  Sum_probs=7.6

Q ss_pred             Ccccccchhh
Q psy16510          1 MMKIEKDIGA   10 (147)
Q Consensus         1 ~m~~~~~~~~   10 (147)
                      |||+++.|..
T Consensus         1 ~~~~~~~~~~   10 (551)
T PRK09558          1 MMKFLKRLVA   10 (551)
T ss_pred             CcchHHHHHH
Confidence            9999777653


No 76 
>PRK11627 hypothetical protein; Provisional
Probab=24.33  E-value=68  Score=25.50  Aligned_cols=9  Identities=22%  Similarity=0.678  Sum_probs=6.2

Q ss_pred             ceeeEEEEE
Q psy16510         84 TAQGQFTYT   92 (147)
Q Consensus        84 ~v~GsYsy~   92 (147)
                      +-+|.|+|.
T Consensus       114 V~~g~~~~~  122 (192)
T PRK11627        114 VSEGNVRYN  122 (192)
T ss_pred             cccceeeec
Confidence            347888776


No 77 
>PRK14864 putative biofilm stress and motility protein A; Provisional
Probab=23.79  E-value=85  Score=22.72  Aligned_cols=8  Identities=0%  Similarity=-0.031  Sum_probs=3.5

Q ss_pred             Ccccccch
Q psy16510          1 MMKIEKDI    8 (147)
Q Consensus         1 ~m~~~~~~    8 (147)
                      ||++-|.+
T Consensus         1 ~~~~mk~~    8 (104)
T PRK14864          1 RNMVMRRF    8 (104)
T ss_pred             CchHHHHH
Confidence            35553333


No 78 
>cd08344 MhqB_like_N N-terminal domain of MhqB, a type I extradiol dioxygenase, and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of Burkholderia sp. NF100 MhqB and similar proteins. MhqB is a type I extradiol dioxygenase involved in the catabolism of methylhydroquinone, an intermediate in the degradation of fenitrothion. The purified enzyme has shown extradiol ring cleavage activity toward 3-methylcatechol. Fe2+ was suggested as a cofactor, the same as most other enzymes in the family. Burkholderia sp. NF100 MhqB is encoded on the plasmid pNF1. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=23.62  E-value=95  Score=20.91  Aligned_cols=16  Identities=25%  Similarity=0.468  Sum_probs=13.7

Q ss_pred             EEEEECCCCcEEEEEE
Q psy16510         88 QFTYTAPDGTPITVQW  103 (147)
Q Consensus        88 sYsy~~pdG~~~~V~Y  103 (147)
                      ++.+.||||..+++.+
T Consensus        93 ~~~~~DP~Gn~iel~~  108 (112)
T cd08344          93 GVWFRDPDGNLLQVKV  108 (112)
T ss_pred             EEEEECCCCCEEEEec
Confidence            4779999999999875


No 79 
>COG2834 LolA Outer membrane lipoprotein-sorting protein [Cell envelope biogenesis, outer membrane]
Probab=23.04  E-value=1.9e+02  Score=22.68  Aligned_cols=30  Identities=23%  Similarity=0.520  Sum_probs=16.4

Q ss_pred             cceeeEEEEE-CCCCcEEEEEEEcCCCCccc
Q psy16510         83 QTAQGQFTYT-APDGTPITVQWFADETGFHA  112 (147)
Q Consensus        83 ~~v~GsYsy~-~pdG~~~~V~Y~ADe~Gf~p  112 (147)
                      ++..|..-|+ +|+...+++++..+..+.-|
T Consensus        83 ivsdG~~v~iydp~~~q~~~~~~~~~~~~tp  113 (211)
T COG2834          83 IVSDGKTVWIYDPDLEQVTKTWLSEATGNTP  113 (211)
T ss_pred             EEECCCEEEEECCCCceEEEEecCCCCCCCc
Confidence            4444544443 45666666666666554444


No 80 
>PF07263 DMP1:  Dentin matrix protein 1 (DMP1);  InterPro: IPR009889 This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21 []. DMP1 is a bone and teeth specific protein initially identified from mineralised dentin. DMP1 is primarily localised in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca2+ surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralised matrix formation extracellularly, at later stages of osteoblast maturation []. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown [].; GO: 0001503 ossification, 0030198 extracellular matrix organization
Probab=22.94  E-value=38  Score=30.95  Aligned_cols=13  Identities=0%  Similarity=-0.171  Sum_probs=7.4

Q ss_pred             HHHHHHHHHHhhc
Q psy16510         14 SFVALFIAVVSAQ   26 (147)
Q Consensus        14 ~i~~~~la~a~A~   26 (147)
                      ||||+.|.||+.+
T Consensus         7 l~~lwgls~alpv   19 (514)
T PF07263_consen    7 LMFLWGLSCALPV   19 (514)
T ss_pred             HHHHHHhhcccch
Confidence            4556666666544


No 81 
>PF07584 BatA:  Aerotolerance regulator N-terminal;  InterPro: IPR024163 The Batl operon appears to be important in pathogenicity and aerotolerance. BatA ensures bacterial survival in the early stages of the infection process, when the infected sites are aerobic, and is produced under conditions of oxidative stress []. Proteins produced by the Batl operon share a highly-conserved sequence at their N terminus and the full length proteins carry multiple membrane-spanning domains []. This entry represents the conserved N-terminal domain, which is also found in some uncharacterised proteins.
Probab=22.66  E-value=84  Score=20.76  Aligned_cols=16  Identities=13%  Similarity=0.065  Sum_probs=6.3

Q ss_pred             hHHHHHHHHHHHHHhh
Q psy16510         10 ALAISFVALFIAVVSA   25 (147)
Q Consensus        10 ~~~~~i~~~~la~a~A   25 (147)
                      .++.++.++++.+|.|
T Consensus        60 l~L~lLal~lli~AlA   75 (77)
T PF07584_consen   60 LLLRLLALALLILALA   75 (77)
T ss_pred             HHHHHHHHHHHHHHHc
Confidence            3333333444444443


No 82 
>PRK09455 rseB anti-sigma E factor; Provisional
Probab=22.66  E-value=1.4e+02  Score=25.46  Aligned_cols=20  Identities=25%  Similarity=0.398  Sum_probs=11.2

Q ss_pred             cccceeeEEEEECCCCcEEE
Q psy16510         81 EAQTAQGQFTYTAPDGTPIT  100 (147)
Q Consensus        81 ~~~~v~GsYsy~~pdG~~~~  100 (147)
                      +.+.....-+|..|+.+.++
T Consensus        83 EviR~~d~V~~~~p~~~~~~  102 (319)
T PRK09455         83 EIIQRGNEISYFEPGLEPFT  102 (319)
T ss_pred             EEEEECCEEEEEecCCceEE
Confidence            55555555666666655543


No 83 
>TIGR01200 Porphyromonas gingivalis paralogous family TIGR01200. No identifiable homologs outside Porphyromonas gingivalis.
Probab=22.65  E-value=96  Score=26.54  Aligned_cols=11  Identities=9%  Similarity=0.368  Sum_probs=6.5

Q ss_pred             HHHHHHHHHHH
Q psy16510         13 ISFVALFIAVV   23 (147)
Q Consensus        13 ~~i~~~~la~a   23 (147)
                      ++|+.|+|...
T Consensus         7 ~~~~~~~~~~~   17 (292)
T TIGR01200         7 LYILICFLLSI   17 (292)
T ss_pred             HHHHHHHHHhh
Confidence            36666766543


No 84 
>PHA02823 chemokine binding protein; Provisional
Probab=22.59  E-value=71  Score=26.41  Aligned_cols=13  Identities=15%  Similarity=0.110  Sum_probs=5.7

Q ss_pred             HHHHHHHHHHhhc
Q psy16510         14 SFVALFIAVVSAQ   26 (147)
Q Consensus        14 ~i~~~~la~a~A~   26 (147)
                      ++++|..+.|.|+
T Consensus         5 ~v~~~~cl~a~~~   17 (255)
T PHA02823          5 IVLACMCLAATAA   17 (255)
T ss_pred             hhHHHHHHHHHhc
Confidence            4444444444444


No 85 
>PF10902 DUF2693:  Protein of unknown function (DUF2693);  InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown. 
Probab=21.72  E-value=2.5e+02  Score=19.52  Aligned_cols=23  Identities=17%  Similarity=0.332  Sum_probs=19.3

Q ss_pred             CCeeEEEeeCCceeeeecceecC
Q psy16510         52 GSYRYSYETGNGIAANEQGYLKN   74 (147)
Q Consensus        52 G~Y~~~Ye~~dG~~~~E~G~~k~   74 (147)
                      |.-.|-|+-.||+.|.-.|.++.
T Consensus         1 G~h~~~feK~DGtiR~a~gT~d~   23 (83)
T PF10902_consen    1 GEHEFVFEKVDGTIRVAKGTRDP   23 (83)
T ss_pred             CcEEEEEEecCCCEEEEEEecCc
Confidence            45578899999999999999764


No 86 
>PF03866 HAP:  Hydrophobic abundant protein (HAP)        ;  InterPro: IPR005566  Expression of Hydrophobic Abundant protein is thought to be developmentally regulated and possibly involved in spherule cell wall formation []. 
Probab=21.56  E-value=71  Score=24.61  Aligned_cols=17  Identities=35%  Similarity=0.413  Sum_probs=12.3

Q ss_pred             CcccccchhhHHHHHHHHHHHHHhh
Q psy16510          1 MMKIEKDIGALAISFVALFIAVVSA   25 (147)
Q Consensus         1 ~m~~~~~~~~~~~~i~~~~la~a~A   25 (147)
                      |||-        +|+.+|+.++++-
T Consensus         1 mmky--------vfvalc~~avval   17 (164)
T PF03866_consen    1 MMKY--------VFVALCLFAVVAL   17 (164)
T ss_pred             Cchh--------HHHHHHHHHHHHH
Confidence            8995        4777888877643


No 87 
>COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=21.55  E-value=1.8e+02  Score=24.94  Aligned_cols=27  Identities=11%  Similarity=0.267  Sum_probs=15.8

Q ss_pred             eeeecceecCCCCCcccccceeeEEEEECC
Q psy16510         65 AANEQGYLKNPGQKDLEAQTAQGQFTYTAP   94 (147)
Q Consensus        65 ~~~E~G~~k~~g~~~~~~~~v~GsYsy~~p   94 (147)
                      ...|.|.++-.+++.   =+|.=.|||.|.
T Consensus        36 Vkde~Gt~tv~k~PK---RVVVLE~SFaDa   62 (310)
T COG4594          36 VKDELGTFTVPKTPK---RVVVLELSFADA   62 (310)
T ss_pred             eeccCCceecCCCCc---eEEEEEecHHHH
Confidence            445666666555443   345567888763


No 88 
>COG3354 FlaG Putative archaeal flagellar protein G [Cell motility and secretion]
Probab=21.27  E-value=1.4e+02  Score=23.21  Aligned_cols=27  Identities=26%  Similarity=0.384  Sum_probs=14.7

Q ss_pred             CCcceeeeccCC--CCCCCCeeEEEeeCC
Q psy16510         36 QFIPIISYQNEP--NQGDGSYRYSYETGN   62 (147)
Q Consensus        36 ~~~~il~~~~~~--~~~dG~Y~~~Ye~~d   62 (147)
                      .+..|+......  +..+|.|.|.|-..|
T Consensus        50 ~dFaIIndPg~i~~~~~~g~~t~t~yiKN   78 (154)
T COG3354          50 TDFAIINDPGQIPYVGTDGPYTYTFYIKN   78 (154)
T ss_pred             ccEEEecCCCCCccccCCCceEEEEEEec
Confidence            356666554332  213477777776554


No 89 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=21.18  E-value=1.6e+02  Score=21.10  Aligned_cols=20  Identities=45%  Similarity=0.561  Sum_probs=14.8

Q ss_pred             cceeeEEEEECCCCcEEEEE
Q psy16510         83 QTAQGQFTYTAPDGTPITVQ  102 (147)
Q Consensus        83 ~~v~GsYsy~~pdG~~~~V~  102 (147)
                      -.+.=+|.|...||...+|+
T Consensus        55 e~~tdsFtvtv~DGtt~~vt   74 (99)
T TIGR01965        55 ETLTDTFTVTSADGTSQTVT   74 (99)
T ss_pred             CEEEEEEEEEEeCCCeEEEE
Confidence            45667899999999766654


No 90 
>PRK00178 tolB translocation protein TolB; Provisional
Probab=20.54  E-value=84  Score=26.78  Aligned_cols=18  Identities=22%  Similarity=0.089  Sum_probs=11.1

Q ss_pred             hHHHHHHHHHHHHHhhcc
Q psy16510         10 ALAISFVALFIAVVSAQR   27 (147)
Q Consensus        10 ~~~~~i~~~~la~a~A~~   27 (147)
                      ++++++++||++.+++++
T Consensus         5 ~~~~~~~~~~~~~~~~~~   22 (430)
T PRK00178          5 KRIALFVLCLLAGGAAAA   22 (430)
T ss_pred             hHHHHHHHHHHhhHhhcc
Confidence            455666777776666543


No 91 
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=20.25  E-value=87  Score=20.59  Aligned_cols=22  Identities=32%  Similarity=0.721  Sum_probs=15.8

Q ss_pred             CCCCeeEEEeeCCcee-eeecce
Q psy16510         50 GDGSYRYSYETGNGIA-ANEQGY   71 (147)
Q Consensus        50 ~dG~Y~~~Ye~~dG~~-~~E~G~   71 (147)
                      .+|.|.|.+.-+||.. .+.+|+
T Consensus        11 k~Ge~rfrlkA~N~eiI~~sEgY   33 (59)
T COG3422          11 KAGEYRFRLKAANGEIILTSEGY   33 (59)
T ss_pred             CCCcEEEEEEccCccEEEeeccc
Confidence            7899999999988753 233444


No 92 
>cd09013 BphC-JF8_N_like N-terminal, non-catalytic, domain of BphC_JF8, (2,3-dihydroxybiphenyl 1,2-dioxygenase) from Bacillus sp. JF8 and similar proteins. 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC) catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, a key step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). BphC belongs to the type I extradiol dioxygenase family, which requires a metal ion in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of BphCs. This subfamily of BphC is represented by the enzyme purified from the thermophilic biphenyl and naphthalene degrader, Bacillus sp. JF8. The members in this family of BphC enzymes may use either Mn(II) or Fe(II) as cofactors. The enzyme purified from Bacillus sp. JF8 is Mn(II)-dependent, however, the enzyme from Rhodococcus jostii RHAI has Fe(II) bound to it. BphC_JF8 is thermostable and its optimum activity is at 85 degrees C
Probab=20.06  E-value=1.3e+02  Score=20.33  Aligned_cols=16  Identities=31%  Similarity=1.115  Sum_probs=13.8

Q ss_pred             EEEEECCCCcEEEEEE
Q psy16510         88 QFTYTAPDGTPITVQW  103 (147)
Q Consensus        88 sYsy~~pdG~~~~V~Y  103 (147)
                      ++.+.||||..+++.|
T Consensus       102 ~~~~~DPdG~~iEl~~  117 (121)
T cd09013         102 AYRFRSPDGHPMELYW  117 (121)
T ss_pred             eEEEECCCCCEEEEEE
Confidence            5679999999998876


Done!