Query psy2711
Match_columns 114
No_of_seqs 131 out of 1036
Neff 6.3
Searched_HMMs 46136
Date Fri Aug 16 23:16:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2711.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2711hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00379 Chitin_bind_4: Insect 99.7 5.1E-18 1.1E-22 102.6 7.1 47 41-88 1-52 (52)
2 COG5294 Uncharacterized protei 93.3 0.47 1E-05 33.0 6.4 45 39-85 28-76 (113)
3 PF13956 Ibs_toxin: Toxin Ibs, 91.9 0.098 2.1E-06 25.2 1.1 12 1-12 1-12 (19)
4 PF06004 DUF903: Bacterial pro 86.8 0.58 1.3E-05 28.1 2.0 37 44-80 3-40 (50)
5 TIGR01655 yxeA_fam conserved h 86.0 5 0.00011 27.5 6.7 16 70-85 62-77 (114)
6 PF13511 DUF4124: Domain of un 84.2 2.8 6E-05 24.9 4.3 11 65-75 14-24 (60)
7 PRK05461 apaG CO2+/MG2+ efflux 82.5 10 0.00022 26.7 7.1 20 60-79 94-113 (127)
8 PF06286 Coleoptericin: Coleop 81.0 0.44 9.5E-06 34.0 -0.2 15 1-15 1-15 (143)
9 COG2967 ApaG Uncharacterized p 75.6 20 0.00043 25.4 6.7 21 60-80 93-113 (126)
10 PF05404 TRAP-delta: Transloco 71.0 24 0.00052 26.2 6.6 15 1-15 1-16 (167)
11 PF15240 Pro-rich: Proline-ric 68.4 4.7 0.0001 30.2 2.4 17 1-17 1-17 (179)
12 PF07437 YfaZ: YfaZ precursor; 67.5 32 0.0007 25.5 6.8 10 1-10 1-10 (180)
13 PF11153 DUF2931: Protein of u 67.2 12 0.00026 28.0 4.5 16 1-16 1-18 (216)
14 PF12930 DUF3836: Family of un 66.3 24 0.00051 25.0 5.6 43 40-83 63-110 (132)
15 PRK09408 ompX outer membrane p 66.2 44 0.00095 24.5 7.2 12 62-73 61-72 (171)
16 KOG4088|consensus 61.8 47 0.001 24.3 6.4 17 1-17 1-18 (167)
17 COG4939 Major membrane immunog 61.1 53 0.0012 23.6 6.5 42 36-77 29-74 (147)
18 PF13473 Cupredoxin_1: Cupredo 59.6 19 0.00042 23.5 4.0 42 41-87 45-86 (104)
19 PF11777 DUF3316: Protein of u 56.3 7.9 0.00017 26.3 1.7 10 1-10 1-10 (114)
20 PF11948 DUF3465: Protein of u 52.6 83 0.0018 22.5 6.7 51 37-90 55-119 (131)
21 PRK11443 lipoprotein; Provisio 50.4 14 0.00031 25.9 2.2 16 1-16 1-18 (124)
22 PF11714 Inhibitor_I53: Thromb 49.7 15 0.00032 23.7 2.0 10 1-10 1-10 (78)
23 KOG4824|consensus 48.1 72 0.0016 24.6 5.8 24 60-83 65-88 (224)
24 PF10855 DUF2648: Protein of u 47.0 6.5 0.00014 21.6 0.0 12 1-12 1-12 (33)
25 PF00879 Defensin_propep: Defe 45.7 60 0.0013 19.6 4.1 12 1-12 1-12 (52)
26 PRK09980 ompL outer membrane p 44.9 68 0.0015 24.8 5.4 34 40-73 40-85 (230)
27 PF11803 UXS1_N: UDP-glucurona 44.8 14 0.0003 24.1 1.3 14 1-14 17-30 (78)
28 PRK15240 resistance to complem 42.5 93 0.002 23.0 5.7 10 1-10 1-10 (185)
29 PRK15209 long polar fimbrial p 42.3 27 0.0006 24.9 2.7 12 1-12 1-12 (174)
30 PF09403 FadA: Adhesion protei 39.9 9.6 0.00021 26.9 0.0 12 1-12 1-12 (126)
31 PRK09810 entericidin A; Provis 39.4 26 0.00057 20.1 1.8 8 1-8 2-9 (41)
32 COG5455 Predicted integral mem 39.2 23 0.00051 25.2 1.9 15 1-15 2-16 (129)
33 PF06316 Ail_Lom: Enterobacter 38.9 88 0.0019 23.8 5.1 14 63-76 65-78 (199)
34 PF08139 LPAM_1: Prokaryotic m 38.5 24 0.00053 18.1 1.4 8 2-9 8-15 (25)
35 PRK11372 lysozyme inhibitor; P 38.3 24 0.00051 24.1 1.8 16 1-16 3-20 (109)
36 PF06291 Lambda_Bor: Bor prote 38.1 28 0.0006 23.5 2.0 10 1-10 1-10 (97)
37 COG2821 MltA Membrane-bound ly 36.7 44 0.00096 27.8 3.4 27 62-89 192-218 (373)
38 TIGR02052 MerP mercuric transp 36.0 30 0.00066 20.5 1.9 15 1-15 1-15 (92)
39 TIGR00156 conserved hypothetic 36.0 25 0.00055 24.8 1.6 10 1-10 1-10 (126)
40 PRK10159 outer membrane phosph 35.2 35 0.00076 27.4 2.5 12 1-12 1-12 (351)
41 PF12276 DUF3617: Protein of u 34.7 68 0.0015 22.4 3.7 12 1-12 1-12 (162)
42 cd07240 ED_TypeI_classII_N N-t 34.5 67 0.0014 20.1 3.4 20 65-84 97-116 (117)
43 PRK10053 hypothetical protein; 34.2 33 0.00071 24.3 2.0 10 1-10 1-10 (130)
44 PF00659 POLO_box: POLO box du 33.9 1.1E+02 0.0024 18.4 5.3 40 40-79 4-47 (68)
45 TIGR01643 YD_repeat_2x YD repe 32.8 72 0.0016 17.0 2.9 14 68-81 9-22 (42)
46 PRK10350 hypothetical protein; 32.4 32 0.00069 24.7 1.7 13 1-13 1-13 (145)
47 PF05423 Mycobact_memb: Mycoba 32.2 1.1E+02 0.0024 21.7 4.5 24 63-87 51-74 (140)
48 COG2140 Thermophilic glucose-6 31.9 61 0.0013 24.9 3.2 29 64-92 113-141 (209)
49 COG3422 Uncharacterized conser 31.4 65 0.0014 19.9 2.7 20 33-52 7-26 (59)
50 cd07252 BphC1-RGP6_N_like N-te 31.1 74 0.0016 20.5 3.3 20 65-84 100-119 (120)
51 PHA02698 hypothetical protein; 31.1 29 0.00063 22.8 1.2 25 86-110 19-43 (89)
52 KOG4327|consensus 30.9 38 0.00082 26.0 1.9 16 94-109 168-183 (218)
53 PF03562 MltA: MltA specific i 30.7 33 0.00071 25.1 1.6 31 62-92 83-114 (158)
54 PF10584 Proteasome_A_N: Prote 30.7 1.8 3.9E-05 22.0 -3.6 11 71-81 9-19 (23)
55 PRK11528 hypothetical protein; 30.3 40 0.00087 26.4 2.1 13 1-13 1-13 (254)
56 PRK10290 superoxide dismutase; 29.9 1.2E+02 0.0026 22.4 4.5 14 1-14 1-14 (173)
57 PRK10397 lipoprotein; Provisio 29.6 40 0.00086 24.2 1.8 14 1-14 1-14 (137)
58 PF11106 YjbE: Exopolysacchari 29.6 47 0.001 21.7 2.0 8 1-8 1-8 (80)
59 smart00817 Amelin Ameloblastin 29.3 47 0.001 27.7 2.4 16 1-16 1-16 (411)
60 COG2834 LolA Outer membrane li 29.2 85 0.0018 23.3 3.6 21 70-90 93-113 (211)
61 COG5633 Predicted periplasmic 29.2 2.1E+02 0.0045 20.2 6.4 18 60-77 69-86 (123)
62 PRK13791 lysozyme inhibitor; P 29.1 61 0.0013 22.4 2.6 8 40-47 41-48 (113)
63 PF13127 DUF3955: Protein of u 29.1 33 0.00071 21.2 1.2 9 1-9 1-9 (63)
64 PHA02097 hypothetical protein 28.8 1.1E+02 0.0023 18.7 3.3 24 67-90 14-37 (59)
65 PF09244 DUF1964: Domain of un 28.2 69 0.0015 20.2 2.5 21 65-85 4-24 (68)
66 PF05968 Bacillus_PapR: Bacill 28.0 54 0.0012 19.3 1.9 10 1-10 1-10 (48)
67 PRK10449 heat-inducible protei 27.9 46 0.001 23.3 1.9 9 1-9 1-9 (140)
68 PF10956 DUF2756: Protein of u 27.8 37 0.00081 23.3 1.3 12 1-12 1-12 (104)
69 cd08344 MhqB_like_N N-terminal 27.4 79 0.0017 20.1 2.9 16 66-81 93-108 (112)
70 TIGR03836 termin_org_HMW1 cyta 27.3 57 0.0012 21.2 2.1 20 71-91 29-48 (82)
71 PRK15184 curli production asse 26.4 63 0.0014 25.9 2.6 12 1-12 1-12 (277)
72 COG3637 Opacity protein and re 26.1 49 0.0011 24.6 1.8 12 1-12 1-12 (199)
73 PF09242 FCSD-flav_bind: Flavo 26.1 1E+02 0.0022 19.3 3.1 27 65-91 9-36 (71)
74 PRK11627 hypothetical protein; 25.9 54 0.0012 24.6 2.0 8 63-70 115-122 (192)
75 KOG4737|consensus 25.6 27 0.00059 28.2 0.4 17 1-17 1-17 (326)
76 PRK12698 flgH flagellar basal 25.4 54 0.0012 25.2 2.0 9 1-9 1-9 (224)
77 PRK06724 hypothetical protein; 25.4 96 0.0021 20.9 3.1 21 64-84 103-125 (128)
78 PF08194 DIM: DIM protein; In 25.2 79 0.0017 17.7 2.1 6 1-6 1-6 (36)
79 PRK15346 outer membrane secret 25.0 48 0.001 28.1 1.8 12 1-12 1-12 (499)
80 PF05593 RHS_repeat: RHS Repea 24.9 1.2E+02 0.0026 16.2 2.9 8 70-77 11-18 (38)
81 cd09013 BphC-JF8_N_like N-term 24.9 1E+02 0.0022 19.7 3.1 17 66-82 102-118 (121)
82 PF15339 Afaf: Acrosome format 24.7 43 0.00093 25.3 1.3 15 2-16 139-153 (200)
83 PRK10959 outer membrane protei 24.4 60 0.0013 24.0 2.1 10 1-10 1-10 (212)
84 PF10972 DUF2803: Protein of u 24.4 99 0.0022 23.2 3.3 15 1-15 1-15 (213)
85 COG4764 Uncharacterized protei 23.6 70 0.0015 24.0 2.2 17 1-17 5-23 (197)
86 PF04379 DUF525: Protein of un 23.5 1.2E+02 0.0026 19.9 3.1 14 60-73 77-90 (90)
87 PRK13682 hypothetical protein; 23.3 71 0.0015 19.2 1.8 13 1-13 1-13 (51)
88 PF03032 Brevenin: Brevenin/es 23.0 66 0.0014 18.8 1.6 15 1-15 3-17 (46)
89 PF15016 DUF4520: Domain of un 22.8 2.3E+02 0.005 18.6 6.0 51 27-80 4-64 (85)
90 PF06788 UPF0257: Uncharacteri 22.8 2.1E+02 0.0045 22.3 4.8 28 62-90 140-167 (236)
91 PF05899 Cupin_3: Protein of u 22.7 1.3E+02 0.0029 18.5 3.1 24 61-86 30-53 (74)
92 PF11355 DUF3157: Protein of u 22.6 1E+02 0.0023 23.5 3.0 8 40-47 37-44 (199)
93 PF09919 DUF2149: Uncharacteri 22.4 2.4E+02 0.0053 18.6 5.4 19 60-79 72-90 (92)
94 cd07266 HPCD_N_class_II N-term 22.3 1.3E+02 0.0027 19.1 3.1 16 66-81 102-117 (121)
95 TIGR00547 lolA periplasmic cha 22.1 68 0.0015 24.0 1.9 12 1-12 1-12 (204)
96 TIGR00616 rect recombinase, ph 22.1 1.2E+02 0.0026 23.6 3.4 44 40-84 145-189 (241)
97 cd07244 FosA FosA, a Fosfomyci 22.1 1.1E+02 0.0025 19.6 2.9 18 66-83 94-111 (121)
98 PF05643 DUF799: Putative bact 21.9 65 0.0014 24.8 1.8 9 1-9 1-9 (215)
99 cd08357 Glo_EDI_BRP_like_18 Th 21.6 1.3E+02 0.0028 19.0 3.0 15 67-81 110-124 (125)
100 PHA02978 hypothetical protein; 21.6 86 0.0019 22.0 2.2 19 62-80 72-90 (135)
101 cd09012 Glo_EDI_BRP_like_24 Th 20.7 1.4E+02 0.003 19.1 3.1 15 67-81 109-123 (124)
102 cd08361 PpCmtC_N N-terminal do 20.6 1.5E+02 0.0033 19.2 3.3 16 67-82 104-119 (124)
103 PF12771 SusD-like_2: Starch-b 20.6 33 0.00072 28.7 0.0 10 1-10 1-10 (488)
104 cd07241 Glo_EDI_BRP_like_3 Thi 20.5 1.1E+02 0.0024 19.2 2.5 11 69-79 114-124 (125)
105 PF12445 FliC: Flagellin prote 20.5 1.1E+02 0.0024 21.7 2.6 24 66-89 78-106 (128)
No 1
>PF00379 Chitin_bind_4: Insect cuticle protein; InterPro: IPR000618 Insect cuticle is composed of proteins and chitin. The cuticular proteins seem to be specific to the type of cuticle (flexible or stiff) that occur at stages of the insect development. The proteins found in the flexible cuticle of larva and pupa of different insects share a conserved C-terminal section [] such a region is also found in the soft endocuticle of adults insects [] as well as in other cuticular proteins including in arachnids []. In addition, cuticular proteins share hydrophobic regions dominated by tetrapeptide repeats (A-A-P-A/V), which are presumed to be functionally important [, ]. Many insect cuticle proteins also include a 35-36 amino acid motif known as the R and R consensus. An extended form of this motif has been shown [] to bind chitin. It has no sequence similiarity to the cysteine-containing chitin-binding domain of chitinases and some peritrophic membrane proteins, suggesting that arthropods have two distinct classes of chitin-binding proteins, those with the chitin-binding domains found in lectins, chitinases and peritrophic membranes (cysCBD), and those with the type of chitin-binding domains found in cuticular proteins (non-cysCBD) []. The cuticle protein signature has been found in locust cuticle proteins 7 (LM-7), 8 (LM-8), 19 (LM-19) and endocuticle structural glycoprotein ABD-4; Hyalophora cecropia (Cecropia moth) cuticle proteins 12 and 66; Drosophila melanogaster (Fruit fly) larval cuticles proteins I, II, III and IV (LCP1 to LCP4); drosophila pupal cuticle proteins PCP, EDG-78E and EDG-84E; Manduca sexta (Tobacco hawkmoth) cuticle protein LCP-14; Tenebrio molitor (Yellow mealworm) cuticle proteins ACP-20, A1A, A2B and A3A; and Araneus diadematus (Spider) cuticle proteins ACP 11.9, ACP 12.4, ACP 12.6, ACP 15.5 and ACP 15.7.; GO: 0042302 structural constituent of cuticle
Probab=99.75 E-value=5.1e-18 Score=102.57 Aligned_cols=47 Identities=51% Similarity=0.912 Sum_probs=43.7
Q ss_pred eeeeEecCCCcEEEEeec-----CCCeEEeEEEEECCCCcEEEEEEEcCCCCe
Q psy2711 41 FGSAYTQEDGVQFTEESD-----QDGNRRGAYSYIDPSGQRRTITYTAGKNGF 88 (114)
Q Consensus 41 y~f~y~~~dg~~~~E~g~-----~~g~v~GsYsy~~pdG~~~~V~Y~AD~~Gf 88 (114)
|+|+|+++|| .++|++. ..++|+|+|+|++|||++++|+|+||++||
T Consensus 1 Y~f~Y~~~dg-~~~e~~~~~~~~~~~~v~GsY~y~~pdG~~~~V~Y~Ad~~Gf 52 (52)
T PF00379_consen 1 YSFGYETSDG-SRQEEGRPETEDEGGVVRGSYSYIDPDGQTRTVTYVADENGF 52 (52)
T ss_pred CeeEeEcCCC-CEEEEEEcccCCCCCEEEEEEEEECCCCCEEEEEEECCCCCC
Confidence 7899999999 7777775 788999999999999999999999999998
No 2
>COG5294 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=93.29 E-value=0.47 Score=33.03 Aligned_cols=45 Identities=27% Similarity=0.524 Sum_probs=26.1
Q ss_pred CceeeeEecCCCcEEEEeecCCCeEEeEEEEE----CCCCcEEEEEEEcCC
Q psy2711 39 GQFGSAYTQEDGVQFTEESDQDGNRRGAYSYI----DPSGQRRTITYTAGK 85 (114)
Q Consensus 39 g~y~f~y~~~dg~~~~E~g~~~g~v~GsYsy~----~pdG~~~~V~Y~AD~ 85 (114)
|+|++--...|.-...++...++.. .|+|. +.+|+.-+|+|+|+.
T Consensus 28 d~fnpyvk~~~~y~k~~~~t~~~~~--~y~y~i~ayn~~Gkkk~v~f~a~~ 76 (113)
T COG5294 28 DRFNPYVKITDSYAKVEEDTQDGSP--GYEYTITAYNKNGKKKEVKFTATH 76 (113)
T ss_pred CCCCceEEEeccceecccccccCCc--cceeeehhhccCCcEEEEEEEecC
Confidence 3444444444433344544444433 45443 468999999999984
No 3
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=91.90 E-value=0.098 Score=25.21 Aligned_cols=12 Identities=33% Similarity=0.517 Sum_probs=8.2
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|||+++++.++.
T Consensus 1 MMk~vIIlvvLL 12 (19)
T PF13956_consen 1 MMKLVIILVVLL 12 (19)
T ss_pred CceehHHHHHHH
Confidence 888887765443
No 4
>PF06004 DUF903: Bacterial protein of unknown function (DUF903); InterPro: IPR010305 This family consists of several small bacterial proteins several of which are classified as putative lipoproteins. The function of this family is unknown.; PDB: 2RB6_B 3BDU_F 2RA2_F 2RD1_C 3FIF_H 2JN0_A 2K57_A.
Probab=86.78 E-value=0.58 Score=28.05 Aligned_cols=37 Identities=27% Similarity=0.545 Sum_probs=25.9
Q ss_pred eEecCCCcEEEEeecCC-CeEEeEEEEECCCCcEEEEE
Q psy2711 44 AYTQEDGVQFTEESDQD-GNRRGAYSYIDPSGQRRTIT 80 (114)
Q Consensus 44 ~y~~~dg~~~~E~g~~~-g~v~GsYsy~~pdG~~~~V~ 80 (114)
--.+.||....-.|.+. ..-.|-|+|.|.+|+.+.|+
T Consensus 3 vi~t~dG~~i~t~gkP~~D~~tG~~~y~D~~G~~~qIn 40 (50)
T PF06004_consen 3 VITTNDGRTIVTDGKPKYDKDTGMYSYTDADGKEQQIN 40 (50)
T ss_dssp EEEETTSEEEEESSEEEEETTTTEEEEEBTTS-EEEEE
T ss_pred EEEeCCCcEEEeCCCccccCCCCcEEEECCCCCEEEEc
Confidence 34567887666655432 23569999999999998875
No 5
>TIGR01655 yxeA_fam conserved hypothetical protein TIGR01655. This model represents a family of small (about 115 amino acids) uncharacterized proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.
Probab=85.99 E-value=5 Score=27.53 Aligned_cols=16 Identities=31% Similarity=0.727 Sum_probs=12.1
Q ss_pred ECCCCcEEEEEEEcCC
Q psy2711 70 IDPSGQRRTITYTAGK 85 (114)
Q Consensus 70 ~~pdG~~~~V~Y~AD~ 85 (114)
.|.+|+.+++++.+..
T Consensus 62 yd~~G~~k~i~f~~~~ 77 (114)
T TIGR01655 62 YDSSGKKHKVKFMAGT 77 (114)
T ss_pred ECCCCCEEEEEEEcCC
Confidence 4678888888887763
No 6
>PF13511 DUF4124: Domain of unknown function (DUF4124)
Probab=84.21 E-value=2.8 Score=24.93 Aligned_cols=11 Identities=27% Similarity=0.745 Sum_probs=7.2
Q ss_pred eEEEEECCCCc
Q psy2711 65 GAYSYIDPSGQ 75 (114)
Q Consensus 65 GsYsy~~pdG~ 75 (114)
.-|+|+|.+|+
T Consensus 14 ~vYk~~D~~G~ 24 (60)
T PF13511_consen 14 EVYKWVDENGV 24 (60)
T ss_pred cEEEEECCCCC
Confidence 34677777775
No 7
>PRK05461 apaG CO2+/MG2+ efflux protein ApaG; Reviewed
Probab=82.55 E-value=10 Score=26.75 Aligned_cols=20 Identities=30% Similarity=0.694 Sum_probs=16.4
Q ss_pred CCeEEeEEEEECCCCcEEEE
Q psy2711 60 DGNRRGAYSYIDPSGQRRTI 79 (114)
Q Consensus 60 ~g~v~GsYsy~~pdG~~~~V 79 (114)
.|..+|+|..+++||+...|
T Consensus 94 ~G~M~G~y~~~~~~G~~F~v 113 (127)
T PRK05461 94 SGTMQGHYQMVDEDGERFEV 113 (127)
T ss_pred CEEEEEEEEEEeCCCCEEEE
Confidence 46799999999999987554
No 8
>PF06286 Coleoptericin: Coleoptericin; InterPro: IPR009382 This family consists of several insect coleoptericin, acaloleptin, holotricin and rhinocerosin proteins which are all known to be antibacterial proteins []. These all appear to be short, glycine-rich molecules, inducible by infection.; GO: 0042742 defense response to bacterium, 0005576 extracellular region
Probab=81.03 E-value=0.44 Score=33.97 Aligned_cols=15 Identities=40% Similarity=0.647 Sum_probs=12.4
Q ss_pred CchHHHHHHHHHHhh
Q psy2711 1 MEKLVFLFAIASLSV 15 (114)
Q Consensus 1 M~k~v~~~~l~a~~a 15 (114)
||||.|+|+|++..+
T Consensus 1 mmkl~i~~~lia~sa 15 (143)
T PF06286_consen 1 MMKLYIIFGLIALSA 15 (143)
T ss_pred CceEeeehhHHHHHH
Confidence 999999999887444
No 9
>COG2967 ApaG Uncharacterized protein affecting Mg2+/Co2+ transport [Inorganic ion transport and metabolism]
Probab=75.58 E-value=20 Score=25.42 Aligned_cols=21 Identities=29% Similarity=0.608 Sum_probs=17.5
Q ss_pred CCeEEeEEEEECCCCcEEEEE
Q psy2711 60 DGNRRGAYSYIDPSGQRRTIT 80 (114)
Q Consensus 60 ~g~v~GsYsy~~pdG~~~~V~ 80 (114)
.|..+|+|.-++.||...+|.
T Consensus 93 ~G~M~GhY~M~~e~G~~F~v~ 113 (126)
T COG2967 93 SGTMQGHYEMIDEDGETFDVA 113 (126)
T ss_pred cceEEEEEEEecCCCcEEEee
Confidence 367899999999999987664
No 10
>PF05404 TRAP-delta: Translocon-associated protein, delta subunit precursor (TRAP-delta); InterPro: IPR008855 This family consists of several eukaryotic translocon-associated protein, delta subunit precursors (TRAP-delta or SSR-delta). The exact function of this protein is unknown [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=71.04 E-value=24 Score=26.15 Aligned_cols=15 Identities=33% Similarity=0.529 Sum_probs=9.5
Q ss_pred CchHHHHHHHHH-Hhh
Q psy2711 1 MEKLVFLFAIAS-LSV 15 (114)
Q Consensus 1 M~k~v~~~~l~a-~~a 15 (114)
|+++++++++++ +|+
T Consensus 1 m~~~~~~~~~~~~~~~ 16 (167)
T PF05404_consen 1 MKKIVALLALLVSFCA 16 (167)
T ss_pred ChhHHHHHHHHHHHhh
Confidence 888876666544 444
No 11
>PF15240 Pro-rich: Proline-rich
Probab=68.41 E-value=4.7 Score=30.24 Aligned_cols=17 Identities=35% Similarity=0.362 Sum_probs=11.0
Q ss_pred CchHHHHHHHHHHhhcc
Q psy2711 1 MEKLVFLFAIASLSVAQ 17 (114)
Q Consensus 1 M~k~v~~~~l~a~~aa~ 17 (114)
|+.+||.+||||+.+|.
T Consensus 1 MLlVLLSvALLALSSAQ 17 (179)
T PF15240_consen 1 MLLVLLSVALLALSSAQ 17 (179)
T ss_pred ChhHHHHHHHHHhhhcc
Confidence 66666667777655544
No 12
>PF07437 YfaZ: YfaZ precursor; InterPro: IPR009998 This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins.
Probab=67.54 E-value=32 Score=25.46 Aligned_cols=10 Identities=30% Similarity=0.404 Sum_probs=7.5
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+|+++++++
T Consensus 1 m~k~~~a~~~ 10 (180)
T PF07437_consen 1 MKKFLLASAA 10 (180)
T ss_pred CchHHHHHHH
Confidence 8998876654
No 13
>PF11153 DUF2931: Protein of unknown function (DUF2931); InterPro: IPR021326 Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function.
Probab=67.22 E-value=12 Score=27.96 Aligned_cols=16 Identities=19% Similarity=0.312 Sum_probs=9.0
Q ss_pred CchHHHHHHHHH--Hhhc
Q psy2711 1 MEKLVFLFAIAS--LSVA 16 (114)
Q Consensus 1 M~k~v~~~~l~a--~~aa 16 (114)
|+|++++++++. +|+.
T Consensus 1 mk~i~~l~l~lll~~C~~ 18 (216)
T PF11153_consen 1 MKKILLLLLLLLLTGCST 18 (216)
T ss_pred ChHHHHHHHHHHHHhhcC
Confidence 778776554333 4543
No 14
>PF12930 DUF3836: Family of unknown function (DUF3836); InterPro: IPR024339 This entry represents a family of bacterial proteins of unknown function.; PDB: 3MSW_A.
Probab=66.31 E-value=24 Score=24.99 Aligned_cols=43 Identities=16% Similarity=0.245 Sum_probs=21.0
Q ss_pred ceeeeEecCCCcEEEEee--cC---CCeEEeEEEEECCCCcEEEEEEEc
Q psy2711 40 QFGSAYTQEDGVQFTEES--DQ---DGNRRGAYSYIDPSGQRRTITYTA 83 (114)
Q Consensus 40 ~y~f~y~~~dg~~~~E~g--~~---~g~v~GsYsy~~pdG~~~~V~Y~A 83 (114)
.|.|.|+..+-....|.- +. ...-.-.+.|. -+|..++++|..
T Consensus 63 KY~Y~YD~~grl~eK~a~kWn~~k~~W~~~~~~~y~-Y~~~~~~~~Y~~ 110 (132)
T PF12930_consen 63 KYNYKYDDNGRLTEKEAYKWNSNKNGWENYYKLNYS-YNGNEVTIEYAK 110 (132)
T ss_dssp EEEEEEETTTTEEEEEEEEEETTTTEEEEEEEEEEE-EETTEEEEEEEE
T ss_pred EEEEEECCCCCEeeeeeeeecccccceeeeEEEEEE-eCCcEEEEEEee
Confidence 677888776655444432 11 12222333333 233367777754
No 15
>PRK09408 ompX outer membrane protein X; Provisional
Probab=66.18 E-value=44 Score=24.54 Aligned_cols=12 Identities=25% Similarity=0.606 Sum_probs=7.0
Q ss_pred eEEeEEEEECCC
Q psy2711 62 NRRGAYSYIDPS 73 (114)
Q Consensus 62 ~v~GsYsy~~pd 73 (114)
-+.||++|...|
T Consensus 61 Gvi~sfty~~~~ 72 (171)
T PRK09408 61 GVIGSFTYTEKS 72 (171)
T ss_pred EEEEEEEEecCc
Confidence 356666666543
No 16
>KOG4088|consensus
Probab=61.76 E-value=47 Score=24.34 Aligned_cols=17 Identities=12% Similarity=0.335 Sum_probs=11.7
Q ss_pred CchHHHHHHHHH-Hhhcc
Q psy2711 1 MEKLVFLFAIAS-LSVAQ 17 (114)
Q Consensus 1 M~k~v~~~~l~a-~~aa~ 17 (114)
|.|.++.+||++ +|+++
T Consensus 1 ms~~a~AlcLvas~C~~~ 18 (167)
T KOG4088|consen 1 MSKIAVALCLVASACLCA 18 (167)
T ss_pred CchhHHHHHHHHHHHHHH
Confidence 777777777776 55554
No 17
>COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown]
Probab=61.05 E-value=53 Score=23.64 Aligned_cols=42 Identities=21% Similarity=0.294 Sum_probs=27.8
Q ss_pred CCCCceeeeEecCCCcE----EEEeecCCCeEEeEEEEECCCCcEE
Q psy2711 36 SNNGQFGSAYTQEDGVQ----FTEESDQDGNRRGAYSYIDPSGQRR 77 (114)
Q Consensus 36 ~~~g~y~f~y~~~dg~~----~~E~g~~~g~v~GsYsy~~pdG~~~ 77 (114)
-.+|.|+-.|+.-|... -+-+-+.+..|+=.|-|.+-+|+..
T Consensus 29 ~~dGtY~~~y~~fDd~gwk~f~~iti~dGKiv~~~ydy~~k~G~~K 74 (147)
T COG4939 29 FNDGTYQGHYESFDDHGWKAFVTITIQDGKIVACTYDYRDKKGNIK 74 (147)
T ss_pred ccCCceeeeeccccccCccceEEEEEeCCEEEEEEeeeecCCCCcc
Confidence 46788998888765321 1112234457888899999998864
No 18
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=59.59 E-value=19 Score=23.48 Aligned_cols=42 Identities=24% Similarity=0.260 Sum_probs=17.0
Q ss_pred eeeeEecCCCcEEEEeecCCCeEEeEEEEECCCCcEEEEEEEcCCCC
Q psy2711 41 FGSAYTQEDGVQFTEESDQDGNRRGAYSYIDPSGQRRTITYTAGKNG 87 (114)
Q Consensus 41 y~f~y~~~dg~~~~E~g~~~g~v~GsYsy~~pdG~~~~V~Y~AD~~G 87 (114)
+.+.+...|...+.=.-+. .. -+-..+-|+..+++.++++-|
T Consensus 45 v~l~~~N~~~~~h~~~i~~-~~----~~~~l~~g~~~~~~f~~~~~G 86 (104)
T PF13473_consen 45 VTLTFTNNDSRPHEFVIPD-LG----ISKVLPPGETATVTFTPLKPG 86 (104)
T ss_dssp EEEEEEE-SSS-EEEEEGG-GT----EEEEE-TT-EEEEEEEE-S-E
T ss_pred EEEEEEECCCCcEEEEECC-Cc----eEEEECCCCEEEEEEcCCCCE
Confidence 7777766655542222221 11 112334456666666666544
No 19
>PF11777 DUF3316: Protein of unknown function (DUF3316); InterPro: IPR016879 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=56.29 E-value=7.9 Score=26.34 Aligned_cols=10 Identities=30% Similarity=0.740 Sum_probs=7.5
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+|+++++++
T Consensus 1 MKk~~ll~~~ 10 (114)
T PF11777_consen 1 MKKIILLASL 10 (114)
T ss_pred CchHHHHHHH
Confidence 8898877754
No 20
>PF11948 DUF3465: Protein of unknown function (DUF3465); InterPro: IPR021856 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 131 to 151 amino acids in length. This protein has a conserved HWTH sequence motif.
Probab=52.58 E-value=83 Score=22.46 Aligned_cols=51 Identities=14% Similarity=0.175 Sum_probs=29.1
Q ss_pred CCCceeeeEecCCCcEEEEeec-----------CC--CeEEeEEEEECCCCcEEEEEEEc-CCCCeee
Q psy2711 37 NNGQFGSAYTQEDGVQFTEESD-----------QD--GNRRGAYSYIDPSGQRRTITYTA-GKNGFQA 90 (114)
Q Consensus 37 ~~g~y~f~y~~~dg~~~~E~g~-----------~~--g~v~GsYsy~~pdG~~~~V~Y~A-D~~Gf~~ 90 (114)
+...-.|-.+.++|.+-.-.-+ .+ -.+.|+|.|.. .|. .|.||- |..|=++
T Consensus 55 GsrHQ~Fiv~l~~g~tllIahNIDlaprip~l~~GD~V~f~GeYe~n~-kgg--vIHWTH~dp~~~h~ 119 (131)
T PF11948_consen 55 GSRHQRFIVRLSSGQTLLIAHNIDLAPRIPWLQKGDQVEFYGEYEWNP-KGG--VIHWTHHDPRGRHP 119 (131)
T ss_pred CCcceEEEEEeCCCCEEEEEeccCccccCcCcCCCCEEEEEEEEEECC-CCC--EEEeeccCCCCCCC
Confidence 3345667777888876332211 11 25889998877 554 456665 4444333
No 21
>PRK11443 lipoprotein; Provisional
Probab=50.37 E-value=14 Score=25.87 Aligned_cols=16 Identities=19% Similarity=0.077 Sum_probs=9.6
Q ss_pred CchHHHHHHH--HHHhhc
Q psy2711 1 MEKLVFLFAI--ASLSVA 16 (114)
Q Consensus 1 M~k~v~~~~l--~a~~aa 16 (114)
|+++++++++ |+.|++
T Consensus 1 Mk~~~~~~~~~lLsgCa~ 18 (124)
T PRK11443 1 MKKFIAPLLALLLSGCQI 18 (124)
T ss_pred ChHHHHHHHHHHHHhccC
Confidence 8887766653 334554
No 22
>PF11714 Inhibitor_I53: Thrombin inhibitor Madanin ; InterPro: IPR021716 Members of this family are the peptidase inhibitor madanin proteins. These proteins were isolated from tick saliva [].
Probab=49.70 E-value=15 Score=23.70 Aligned_cols=10 Identities=20% Similarity=0.408 Sum_probs=7.2
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|++|.+|+++
T Consensus 1 MKhFaiLila 10 (78)
T PF11714_consen 1 MKHFAILILA 10 (78)
T ss_pred CchHHHHHHH
Confidence 8888766554
No 23
>KOG4824|consensus
Probab=48.08 E-value=72 Score=24.63 Aligned_cols=24 Identities=21% Similarity=0.461 Sum_probs=19.7
Q ss_pred CCeEEeEEEEECCCCcEEEEEEEc
Q psy2711 60 DGNRRGAYSYIDPSGQRRTITYTA 83 (114)
Q Consensus 60 ~g~v~GsYsy~~pdG~~~~V~Y~A 83 (114)
+|..+|.|++.++||+++..+-.-
T Consensus 65 ~gctt~~y~~~nkngkI~Vln~~v 88 (224)
T KOG4824|consen 65 GGCTTGAYTFDNKNGKIHVLNECV 88 (224)
T ss_pred CCceeeeeEecCCCceEEEeeeee
Confidence 457899999999999998877554
No 24
>PF10855 DUF2648: Protein of unknown function (DUF2648); InterPro: IPR022561 This family of proteins with unknown function appears to be restricted to eubacteia.
Probab=47.01 E-value=6.5 Score=21.55 Aligned_cols=12 Identities=25% Similarity=0.598 Sum_probs=9.0
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|+||.+++.|..
T Consensus 1 MKkl~i~L~l~g 12 (33)
T PF10855_consen 1 MKKLAIILILGG 12 (33)
T ss_pred CCceeehhhhhh
Confidence 889888777654
No 25
>PF00879 Defensin_propep: Defensin propeptide The pattern for this Prosite entry doesn't match the propeptide.; InterPro: IPR002366 Defensins are 2-6 kDa, cationic, microbicidal peptides active against many Gram-negative and Gram-positive bacteria, fungi, and enveloped viruses [], containing three pairs of intramolecular disulphide bonds []. On the basis of their size and pattern of disulphide bonding, mammalian defensins are classified into alpha, beta and theta categories. Alpha-defensins, which have been identified in humans, monkeys and several rodent species, are particularly abundant in neutrophils, certain macrophage populations and Paneth cells of the small intestine. Every mammalian species explored thus far has beta-defensins. In cows, as many as 13 beta-defensins exist in neutrophils. However, in other species, beta-defensins are more often produced by epithelial cells lining various organs (e.g. the epidermis, bronchial tree and genitourinary tract). Theta-defensins are cyclic and have so far only been identified in primate phagocytes. Defensins are produced constitutively and/or in response to microbial products or proinflammatory cytokines. Some defensins are also called corticostatins (CS) because they inhibit corticotropin-stimulated corticosteroid production. The mechanism(s) by which microorganisms are killed and/or inactivated by defensins is not understood completely. However, it is generally believed that killing is a consequence of disruption of the microbial membrane. The polar topology of defensins, with spatially separated charged and hydrophobic regions, allows them to insert themselves into the phospholipid membranes so that their hydrophobic regions are buried within the lipid membrane interior and their charged (mostly cationic) regions interact with anionic phospholipid head groups and water. Subsequently, some defensins can aggregate to form `channel-like' pores; others might bind to and cover the microbial membrane in a `carpet-like' manner. The net outcome is the disruption of membrane integrity and function, which ultimately leads to the lysis of microorganisms. Some defensins are synthesized as propeptides which may be relevant to this process - in neutrophils only the mature peptides have been identified but in Paneth cells, the propeptide is stored in vesicles [] and appears to be cleaved by trypsin on activation. ; GO: 0006952 defense response
Probab=45.67 E-value=60 Score=19.56 Aligned_cols=12 Identities=33% Similarity=0.404 Sum_probs=8.5
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|..|+++.+|+.
T Consensus 1 MRTL~LLaAlLL 12 (52)
T PF00879_consen 1 MRTLALLAALLL 12 (52)
T ss_pred CcHHHHHHHHHH
Confidence 777877777554
No 26
>PRK09980 ompL outer membrane porin L; Provisional
Probab=44.88 E-value=68 Score=24.76 Aligned_cols=34 Identities=12% Similarity=0.134 Sum_probs=18.1
Q ss_pred ceeeeEecCCCcEEEEeec-----------CCCeEEeEEEE-ECCC
Q psy2711 40 QFGSAYTQEDGVQFTEESD-----------QDGNRRGAYSY-IDPS 73 (114)
Q Consensus 40 ~y~f~y~~~dg~~~~E~g~-----------~~g~v~GsYsy-~~pd 73 (114)
....++..+||.-...+.. ...++.++|.| ..-|
T Consensus 40 Ri~vsh~f~nG~Gf~~eak~~~~~dd~~~~~~~E~~~sY~~~k~~d 85 (230)
T PRK09980 40 MLRVGYNFDMGAGIMLTNTYTFQRDDELKHGYNEIEGWYPLFKPTD 85 (230)
T ss_pred EEEEEEEccCceeEEEEEEeccCCCccccccceEEEEEEEeEecCC
Confidence 5666666667664433321 12356678864 4444
No 27
>PF11803 UXS1_N: UDP-glucuronate decarboxylase N-terminal; InterPro: IPR021761 The N terminus of the UDP-glucuronate decarboxylases may be involved in localisation to the perinuclear Golgi membrane. ; GO: 0048040 UDP-glucuronate decarboxylase activity
Probab=44.84 E-value=14 Score=24.08 Aligned_cols=14 Identities=21% Similarity=0.207 Sum_probs=10.9
Q ss_pred CchHHHHHHHHHHh
Q psy2711 1 MEKLVFLFAIASLS 14 (114)
Q Consensus 1 M~k~v~~~~l~a~~ 14 (114)
|||+++.++|++-+
T Consensus 17 ~mkll~~lal~ayi 30 (78)
T PF11803_consen 17 RMKLLLGLALIAYI 30 (78)
T ss_pred HHHHHHHHHHHHHH
Confidence 78999888887733
No 28
>PRK15240 resistance to complement killing; Provisional
Probab=42.52 E-value=93 Score=22.96 Aligned_cols=10 Identities=30% Similarity=0.454 Sum_probs=6.9
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+|.++++++
T Consensus 1 Mkk~~~~~~~ 10 (185)
T PRK15240 1 MKKIVLSSLL 10 (185)
T ss_pred CchhHHHHHH
Confidence 8888765544
No 29
>PRK15209 long polar fimbrial protein LpfA; Provisional
Probab=42.27 E-value=27 Score=24.93 Aligned_cols=12 Identities=33% Similarity=0.609 Sum_probs=8.1
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|+|.++.+++++
T Consensus 1 Mkk~~~~~~~~~ 12 (174)
T PRK15209 1 MKKVVFALSALA 12 (174)
T ss_pred CchHHHHHHHHH
Confidence 899877665443
No 30
>PF09403 FadA: Adhesion protein FadA; InterPro: IPR018543 FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. ; PDB: 3ETZ_B 3ETY_A 2GL2_B 3ETX_C 3ETW_A.
Probab=39.90 E-value=9.6 Score=26.92 Aligned_cols=12 Identities=25% Similarity=0.359 Sum_probs=0.0
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|+|++|++.|++
T Consensus 1 MKK~ll~~~lll 12 (126)
T PF09403_consen 1 MKKILLLGMLLL 12 (126)
T ss_dssp ------------
T ss_pred ChHHHHHHHHHH
Confidence 889877655443
No 31
>PRK09810 entericidin A; Provisional
Probab=39.45 E-value=26 Score=20.09 Aligned_cols=8 Identities=38% Similarity=0.812 Sum_probs=3.2
Q ss_pred CchHHHHH
Q psy2711 1 MEKLVFLF 8 (114)
Q Consensus 1 M~k~v~~~ 8 (114)
|+|+++++
T Consensus 2 Mkk~~~l~ 9 (41)
T PRK09810 2 MKRLIVLV 9 (41)
T ss_pred hHHHHHHH
Confidence 34444333
No 32
>COG5455 Predicted integral membrane protein [Function unknown]
Probab=39.18 E-value=23 Score=25.19 Aligned_cols=15 Identities=27% Similarity=0.244 Sum_probs=8.6
Q ss_pred CchHHHHHHHHHHhh
Q psy2711 1 MEKLVFLFAIASLSV 15 (114)
Q Consensus 1 M~k~v~~~~l~a~~a 15 (114)
|+|||++++++++++
T Consensus 2 mkrlv~~lla~s~~a 16 (129)
T COG5455 2 MKRLVSALLAVSSVA 16 (129)
T ss_pred chhHHHHHHHHHHhh
Confidence 666666665555443
No 33
>PF06316 Ail_Lom: Enterobacterial Ail/Lom protein; InterPro: IPR000758 Virulence-related outer membrane proteins are expressed in Gram-negative bacteria and are essential to bacterial survival within macrophages and for eukaryotic cell invasion. Members of this group include: PagC, required by Salmonella typhimurium for survival in macrophages and for virulence in mice [] Rck outer membrane protein of the S. typhimurium virulence plasmid [] Ail, a product of the Yersinia enterocolitica chromosome capable of mediating bacterial adherence to and invasion of epithelial cell lines [] OmpX from Escherichia coli that promotes adhesion to and entry into mammalian cells. It also has a role in the resistance against attack by the human complement system [] a Bacteriophage lambda outer membrane protein, Lom [] The crystal structure of OmpX from E. coli reveals that OmpX consists of an eight-stranded antiparallel all-next-neighbour beta barrel []. The structure shows two girdles of aromatic amino acid residues and a ribbon of nonpolar residues that attach to the membrane interior. The core of the barrel consists of an extended hydrogen-bonding network of highly conserved residues. OmpX thus resembles an inverse micelle. The OmpX structure shows that the membrane-spanning part of the protein is much better conserved than the extracellular loops. Moreover, these loops form a protruding beta sheet, the edge of which presumably binds to external proteins. It is suggested that this type of binding promotes cell adhesion and invasion and helps defend against the complement system. Although OmpX has the same beta-sheet topology as the structurally related outer membrane protein A (OmpA) IPR000498 from INTERPRO, their barrels differ with respect to the shear numbers and internal hydrogen-bonding networks.; GO: 0009279 cell outer membrane
Probab=38.87 E-value=88 Score=23.84 Aligned_cols=14 Identities=14% Similarity=0.226 Sum_probs=10.0
Q ss_pred EEeEEEEECCCCcE
Q psy2711 63 RRGAYSYIDPSGQR 76 (114)
Q Consensus 63 v~GsYsy~~pdG~~ 76 (114)
+.||++|...|+..
T Consensus 65 vItSfSY~~~~~~~ 78 (199)
T PF06316_consen 65 VITSFSYASGDDSS 78 (199)
T ss_pred eeEEEEEeecCCCc
Confidence 67788887777654
No 34
>PF08139 LPAM_1: Prokaryotic membrane lipoprotein lipid attachment site; InterPro: IPR012640 In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognises a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [,]. This lipid attachment site is found in homologues of the VirB proteins of type IV secretion systems (T4SS). Conjugal transfer across the cell envelope of Gram-negative bacteria is mediated by a supramolecular structure termed mating pair formation (Mpf) complex. Collectively, secretion pathways ancestrally related to bacterial conjugation systems are now known as T4SS. T4SS are involved in the delivery of effector molecules to eukaryotic target cells; each of these systems exports distinct DNA or protein substrates to effect a myriad of changes in host cell physiology during infection [].
Probab=38.52 E-value=24 Score=18.13 Aligned_cols=8 Identities=25% Similarity=0.604 Sum_probs=3.9
Q ss_pred chHHHHHH
Q psy2711 2 EKLVFLFA 9 (114)
Q Consensus 2 ~k~v~~~~ 9 (114)
+|+++.++
T Consensus 8 Kkil~~l~ 15 (25)
T PF08139_consen 8 KKILFPLL 15 (25)
T ss_pred HHHHHHHH
Confidence 55554444
No 35
>PRK11372 lysozyme inhibitor; Provisional
Probab=38.32 E-value=24 Score=24.12 Aligned_cols=16 Identities=25% Similarity=0.233 Sum_probs=9.0
Q ss_pred CchHHHHHHH--HHHhhc
Q psy2711 1 MEKLVFLFAI--ASLSVA 16 (114)
Q Consensus 1 M~k~v~~~~l--~a~~aa 16 (114)
|+++++++++ |+.|+.
T Consensus 3 mk~ll~~~~~~lL~gCs~ 20 (109)
T PRK11372 3 MKKLLIICLPVLLTGCSA 20 (109)
T ss_pred hHHHHHHHHHHHHHHhcC
Confidence 6776666553 444543
No 36
>PF06291 Lambda_Bor: Bor protein; InterPro: IPR010438 This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the E. coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis [].
Probab=38.06 E-value=28 Score=23.51 Aligned_cols=10 Identities=40% Similarity=0.607 Sum_probs=7.2
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+|++++++|
T Consensus 1 mKk~ll~~~l 10 (97)
T PF06291_consen 1 MKKLLLAAAL 10 (97)
T ss_pred CcHHHHHHHH
Confidence 8888776654
No 37
>COG2821 MltA Membrane-bound lytic murein transglycosylase [Cell envelope biogenesis, outer membrane]
Probab=36.66 E-value=44 Score=27.81 Aligned_cols=27 Identities=30% Similarity=0.388 Sum_probs=23.1
Q ss_pred eEEeEEEEECCCCcEEEEEEEcCCCCee
Q psy2711 62 NRRGAYSYIDPSGQRRTITYTAGKNGFQ 89 (114)
Q Consensus 62 ~v~GsYsy~~pdG~~~~V~Y~AD~~Gf~ 89 (114)
.|+||=...-+||++..+.| |++||+-
T Consensus 192 ~VQGSg~l~~~dG~~~r~~Y-Agkng~p 218 (373)
T COG2821 192 HVQGSGRLDLGDGRLNRFGY-AGKNGHP 218 (373)
T ss_pred eecccEEEEcCCCCEEEEEE-cccCCCc
Confidence 59999999999999999998 6667653
No 38
>TIGR02052 MerP mercuric transport protein periplasmic component. This model represents the periplasmic mercury (II) binding protein of the bacterial mercury detoxification system which passes mercuric ion to the MerT transporter for subsequent reduction to Hg(0) by the mercuric reductase MerA. MerP contains a distinctive GMTCXXC motif associated with metal binding. MerP is related to a larger family of metal binding proteins (pfam00403).
Probab=36.00 E-value=30 Score=20.50 Aligned_cols=15 Identities=40% Similarity=0.512 Sum_probs=10.6
Q ss_pred CchHHHHHHHHHHhh
Q psy2711 1 MEKLVFLFAIASLSV 15 (114)
Q Consensus 1 M~k~v~~~~l~a~~a 15 (114)
|+|++-+++|+....
T Consensus 1 ~~~~~~~~~~~~~~~ 15 (92)
T TIGR02052 1 MKKLATLLALFVLTS 15 (92)
T ss_pred ChhHHHHHHHHHHhc
Confidence 888887777765443
No 39
>TIGR00156 conserved hypothetical protein TIGR00156. As of the last revision, this family consists only of two proteins from Escherichia coli and one from the related species Haemophilus influenzae.
Probab=35.97 E-value=25 Score=24.82 Aligned_cols=10 Identities=20% Similarity=0.205 Sum_probs=6.9
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+|++++++|
T Consensus 1 MKK~~~~~~~ 10 (126)
T TIGR00156 1 MKFQAIVLAS 10 (126)
T ss_pred CchHHHHHHH
Confidence 8887765554
No 40
>PRK10159 outer membrane phosphoporin protein E; Provisional
Probab=35.15 E-value=35 Score=27.38 Aligned_cols=12 Identities=17% Similarity=0.343 Sum_probs=7.9
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|+|.++.+++++
T Consensus 1 Mkk~l~a~~~~a 12 (351)
T PRK10159 1 MKKSTLALVVMG 12 (351)
T ss_pred CchhhHHHHHHH
Confidence 888776665554
No 41
>PF12276 DUF3617: Protein of unknown function (DUF3617); InterPro: IPR022061 This family of proteins is found in bacteria. Proteins in this family are typically between 155 and 179 amino acids in length. There is a single completely conserved residue C that may be functionally important.
Probab=34.65 E-value=68 Score=22.38 Aligned_cols=12 Identities=17% Similarity=0.263 Sum_probs=7.8
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|.++++++++++
T Consensus 1 M~~~~~~~~~~~ 12 (162)
T PF12276_consen 1 MKRRLLLALALA 12 (162)
T ss_pred CchHHHHHHHHH
Confidence 777776666544
No 42
>cd07240 ED_TypeI_classII_N N-terminal domain of type I, class II extradiol dioxygenases; non-catalytic domain. This family contains the N-terminal, non-catalytic, domain of type I, class II extradiol dioxygenases. Dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. Two major groups of dioxygenases have been identified according to the cleavage site; extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon, whereas intradiol enzymes cleave the aromatic ring between two hydroxyl groups. Extradiol dioxygenases are classified into type I and type II enzymes. Type I extradiol dioxygenases include class I and class II enzymes. These two classes of enzymes show sequence similarity; the two-domain class II enzymes evolved from a class I enzyme through gene duplication. The extradiol dioxygenases represented in this fa
Probab=34.54 E-value=67 Score=20.09 Aligned_cols=20 Identities=20% Similarity=0.343 Sum_probs=15.8
Q ss_pred eEEEEECCCCcEEEEEEEcC
Q psy2711 65 GAYSYIDPSGQRRTITYTAG 84 (114)
Q Consensus 65 GsYsy~~pdG~~~~V~Y~AD 84 (114)
-++.+.||||..+++.+..|
T Consensus 97 ~~~~~~DP~G~~ie~~~~~~ 116 (117)
T cd07240 97 RGLRFQDPDGHLLELFVEAD 116 (117)
T ss_pred eEEEEECCCCCEEEEEEccC
Confidence 35678999999999887655
No 43
>PRK10053 hypothetical protein; Provisional
Probab=34.17 E-value=33 Score=24.34 Aligned_cols=10 Identities=20% Similarity=0.145 Sum_probs=6.9
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+|+++++++
T Consensus 1 MKK~~~~~~~ 10 (130)
T PRK10053 1 MKLQAIALAS 10 (130)
T ss_pred CcHHHHHHHH
Confidence 8997765554
No 44
>PF00659 POLO_box: POLO box duplicated region; InterPro: IPR000959 A subgroup of serine/threonine protein kinases, Polo or Polo-like kinases play multiple roles during the cell cycle. Polo kinases are required at several key points through mitosis, starting from control of the G2/M transition through phosphorylation of Cdc25C and mitotic cyclins. Polo kinases are characterised by an amino terminal catalytic domain, and a carboxy terminal non-catalytic domain consisting of three blocks of conserved sequences known as polo boxes which form one single functional domain []. The domain is named after its founding member encoded by the polo gene of Drosophila melanogaster []. This domain of around 70 amino acids has been found in species ranging from yeast to mammals. Polo boxes appear to mediate interaction with multiple proteins through protein:protein interactions; some but not all of these proteins are substrates for the kinase domain of the molecule []. The crystal structure of the polo domain of the murine protein, Sak, is dimeric, consisting of two alpha-helices and two six-stranded beta-sheets []. The topology of one polypeptide subunit of the dimer consists of, from its N- to C terminus, an extended strand segment, five beta-strands, one alpha-helix (A) and a C-terminal beta-strand. Beta-strands from one subunit form a contiguous antiparallel beta-sheet with beta-strands from the second subunit. The two beta-sheets pack with a crossing angle of 110 degrees, orienting the hydrophobic surfaces inward and the hydrophilic surfaces outward. Helix A, which is colinear with beta-strand 6 of the same polypeptide, buries a large portion of the non-overlapping hydrophobic beta-sheet surfaces. Interactions involving helices A comprise a majority of the hydrophobic core structure and also the dimer interface. Point mutations in the Polo box of the budding yeast Cdc5 protein abolish the ability of overexpressed Cdc5 to interact with the spindle poles and to organise cytokinetic structures [].; GO: 0005515 protein binding; PDB: 1MBY_B 3P37_A 3MHN_A 1Q4K_A 3HIK_A 3Q1I_A 3P35_A 3MHQ_A 1UMW_B 3MQ8_B ....
Probab=33.92 E-value=1.1e+02 Score=18.40 Aligned_cols=40 Identities=25% Similarity=0.422 Sum_probs=23.1
Q ss_pred ceeeeEecCCCcEEEEeecCCCeE----EeEEEEECCCCcEEEE
Q psy2711 40 QFGSAYTQEDGVQFTEESDQDGNR----RGAYSYIDPSGQRRTI 79 (114)
Q Consensus 40 ~y~f~y~~~dg~~~~E~g~~~g~v----~GsYsy~~pdG~~~~V 79 (114)
.|.+.|..+||+.+-.--|....+ .....|++++|+..+.
T Consensus 4 k~gi~~~LSng~vqv~FnD~tkivl~~~~~~v~yi~~~~~~~~~ 47 (68)
T PF00659_consen 4 KYGIGYQLSNGTVQVNFNDHTKIVLSPDGRLVTYIDRDGERQTY 47 (68)
T ss_dssp CSEEEEEETTSEEEEEETTS-EEEEETTCCEEEEE-TTS-EEEE
T ss_pred ceEEEEEEeCCCEEEEEeCCCEEEECCCCCEEEEECCCCcEEEE
Confidence 367888889988755433322111 1377888888876544
No 45
>TIGR01643 YD_repeat_2x YD repeat (two copies). This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Probab=32.76 E-value=72 Score=17.03 Aligned_cols=14 Identities=29% Similarity=0.453 Sum_probs=8.9
Q ss_pred EEECCCCcEEEEEE
Q psy2711 68 SYIDPSGQRRTITY 81 (114)
Q Consensus 68 sy~~pdG~~~~V~Y 81 (114)
+.++|+|...+.+|
T Consensus 9 ~~~~p~G~~~~~~Y 22 (42)
T TIGR01643 9 GSTDADGTTTRYTY 22 (42)
T ss_pred EEECCCCCEEEEEE
Confidence 35667777666666
No 46
>PRK10350 hypothetical protein; Provisional
Probab=32.43 E-value=32 Score=24.73 Aligned_cols=13 Identities=31% Similarity=0.621 Sum_probs=10.1
Q ss_pred CchHHHHHHHHHH
Q psy2711 1 MEKLVFLFAIASL 13 (114)
Q Consensus 1 M~k~v~~~~l~a~ 13 (114)
|++|+++.+|+-+
T Consensus 1 MKr~L~~~allP~ 13 (145)
T PRK10350 1 MKRLLLLTALLPF 13 (145)
T ss_pred ChhHHHHHHHhHH
Confidence 8899988887653
No 47
>PF05423 Mycobact_memb: Mycobacterium membrane protein; InterPro: IPR008693 This family contains several membrane proteins from Mycobacterium species [].
Probab=32.19 E-value=1.1e+02 Score=21.72 Aligned_cols=24 Identities=29% Similarity=0.601 Sum_probs=16.0
Q ss_pred EEeEEEEECCCCcEEEEEEEcCCCC
Q psy2711 63 RRGAYSYIDPSGQRRTITYTAGKNG 87 (114)
Q Consensus 63 v~GsYsy~~pdG~~~~V~Y~AD~~G 87 (114)
.+=.|....+.|...+|+|. |++|
T Consensus 51 k~V~YEV~G~~G~~~~I~Y~-D~~~ 74 (140)
T PF05423_consen 51 KTVTYEVTGPPGSTATISYL-DADG 74 (140)
T ss_pred cEEEEEEEcCCCCeEEEEEE-cCCC
Confidence 34467777777887888886 4444
No 48
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=31.93 E-value=61 Score=24.89 Aligned_cols=29 Identities=28% Similarity=0.447 Sum_probs=26.9
Q ss_pred EeEEEEECCCCcEEEEEEEcCCCCeeecc
Q psy2711 64 RGAYSYIDPSGQRRTITYTAGKNGFQASG 92 (114)
Q Consensus 64 ~GsYsy~~pdG~~~~V~Y~AD~~Gf~~~~ 92 (114)
+|.+-.-+++|+.+++.+.++.-||.|.+
T Consensus 113 ~g~m~v~~~~G~~~v~~~~~Gd~iyVPp~ 141 (209)
T COG2140 113 EGRMLVQKPEGEARVIAVRAGDVIYVPPG 141 (209)
T ss_pred cEEEEEEcCCCcEEEEEecCCcEEEeCCC
Confidence 58899999999999999999999999976
No 49
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=31.40 E-value=65 Score=19.94 Aligned_cols=20 Identities=15% Similarity=0.305 Sum_probs=16.3
Q ss_pred eccCCCCceeeeEecCCCcE
Q psy2711 33 RYLSNNGQFGSAYTQEDGVQ 52 (114)
Q Consensus 33 ~~~~~~g~y~f~y~~~dg~~ 52 (114)
-+.+.+|.|.|.+...||..
T Consensus 7 i~kdk~Ge~rfrlkA~N~ei 26 (59)
T COG3422 7 IYKDKAGEYRFRLKAANGEI 26 (59)
T ss_pred EEEcCCCcEEEEEEccCccE
Confidence 35678899999999988864
No 50
>cd07252 BphC1-RGP6_N_like N-terminal domain of 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC, EC 1.13.11.39) 1 from Rhodococcus globerulus P6 (BphC1-RGP6) and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of BphC1-RGP6 and similar proteins. BphC catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, the third step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). This subfamily of BphCs belongs to the type I extradiol dioxygenase family, which require a metal in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of 2,3-dihydroxybiphenyl 1,2-dioxygenases. For example, three types of BphC enzymes have been found in Rhodococcus globerulus (BphC1-RGP6 - BphC3-RGP6), all three enzymes are type I extradiol dioxygenases. BphC1-RGP6 has an internal duplication, it is a two-domain dioxygenase which forms octamers, and has Fe(II) at the catalytic site. Its N-
Probab=31.15 E-value=74 Score=20.54 Aligned_cols=20 Identities=25% Similarity=0.615 Sum_probs=16.2
Q ss_pred eEEEEECCCCcEEEEEEEcC
Q psy2711 65 GAYSYIDPSGQRRTITYTAG 84 (114)
Q Consensus 65 GsYsy~~pdG~~~~V~Y~AD 84 (114)
....+.||||...++.|.-+
T Consensus 100 ~~~~~~DPdG~~iE~~~~~~ 119 (120)
T cd07252 100 GLIRFADPDGNRHELFWGPK 119 (120)
T ss_pred EEEEEECCCCCEEEEEeccc
Confidence 36899999999988877544
No 51
>PHA02698 hypothetical protein; Provisional
Probab=31.08 E-value=29 Score=22.78 Aligned_cols=25 Identities=28% Similarity=0.608 Sum_probs=19.5
Q ss_pred CCeeeccCCCCCCCCCCCCCCCCCC
Q psy2711 86 NGFQASGDDIPVAPPAPPAPAPVAP 110 (114)
Q Consensus 86 ~Gf~~~~~~lP~~pp~p~~~~~~~~ 110 (114)
.|-|+.|+..|+|-|+|+.-..-+|
T Consensus 19 ddvqvnGDlmPtp~p~peeV~~CsP 43 (89)
T PHA02698 19 DDVQVNGDLMPTPEPTPEEVPQCSP 43 (89)
T ss_pred hccccCCcccCCCCCChhhhccCCH
Confidence 4778899999999998877655444
No 52
>KOG4327|consensus
Probab=30.89 E-value=38 Score=26.05 Aligned_cols=16 Identities=38% Similarity=0.787 Sum_probs=11.2
Q ss_pred CCCCCCCCCCCCCCCC
Q psy2711 94 DIPVAPPAPPAPAPVA 109 (114)
Q Consensus 94 ~lP~~pp~p~~~~~~~ 109 (114)
..|.+||+|+.+.|+.
T Consensus 168 fmpppPP~pp~i~p~~ 183 (218)
T KOG4327|consen 168 FMPPPPPMPPPICPDS 183 (218)
T ss_pred cCCCCCCCCcccCCCC
Confidence 3677777777777764
No 53
>PF03562 MltA: MltA specific insert domain; InterPro: IPR005300 This group of proteins includes MltA; a membrane-bound, murein degrading transglycosylase enzyme which plays an important role in the controlled growth of the stress-bearing sacculus of Escherichia coli [, ].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 2PI8_D 2AE0_X 2PIC_A 2GAE_A 2PJJ_A 3CZB_A 2G6G_A 2PNW_A 2G5D_A.
Probab=30.69 E-value=33 Score=25.10 Aligned_cols=31 Identities=16% Similarity=0.311 Sum_probs=22.0
Q ss_pred eEEeEEEEECCCCcEEEEEEEcC-CCCeeecc
Q psy2711 62 NRRGAYSYIDPSGQRRTITYTAG-KNGFQASG 92 (114)
Q Consensus 62 ~v~GsYsy~~pdG~~~~V~Y~AD-~~Gf~~~~ 92 (114)
.|+||=....|||+.+.|.|-+. ..-|+..|
T Consensus 83 qIQGSGrl~l~DG~~~rvgYAg~NG~pY~sIG 114 (158)
T PF03562_consen 83 QIQGSGRLRLPDGSTVRVGYAGQNGHPYTSIG 114 (158)
T ss_dssp HHHSEEEEE-TTSSEEEEEEEEE-SS----HH
T ss_pred eecCcEEEEcCCCCEEEEEEcccCCCccccHH
Confidence 48899999999999999999775 35677665
No 54
>PF10584 Proteasome_A_N: Proteasome subunit A N-terminal signature; InterPro: IPR000426 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). This family contains the alpha subunit sequences which range from 210 to 290 amino acids. These sequences are classified as non-peptidase homologues in MEROPS peptidase family T1 (clan PB(T)). ; GO: 0004175 endopeptidase activity, 0006511 ubiquitin-dependent protein catabolic process, 0019773 proteasome core complex, alpha-subunit complex; PDB: 3H4P_M 1IRU_O 3UN4_U 1FNT_A 3OEV_G 3OEU_U 3SDK_U 3DY3_G 3MG7_G 3L5Q_C ....
Probab=30.68 E-value=1.8 Score=21.97 Aligned_cols=11 Identities=27% Similarity=0.670 Sum_probs=7.8
Q ss_pred CCCCcEEEEEE
Q psy2711 71 DPSGQRRTITY 81 (114)
Q Consensus 71 ~pdG~~~~V~Y 81 (114)
+|||++..|+|
T Consensus 9 Sp~Grl~QVEY 19 (23)
T PF10584_consen 9 SPDGRLFQVEY 19 (23)
T ss_dssp BTTSSBHHHHH
T ss_pred CCCCeEEeeEe
Confidence 58888776665
No 55
>PRK11528 hypothetical protein; Provisional
Probab=30.27 E-value=40 Score=26.36 Aligned_cols=13 Identities=15% Similarity=0.319 Sum_probs=8.7
Q ss_pred CchHHHHHHHHHH
Q psy2711 1 MEKLVFLFAIASL 13 (114)
Q Consensus 1 M~k~v~~~~l~a~ 13 (114)
|+|.+++++|+++
T Consensus 1 ~~k~llal~l~a~ 13 (254)
T PRK11528 1 MRKSLLALTLLAT 13 (254)
T ss_pred CchHHHHHHHHHH
Confidence 7887777666543
No 56
>PRK10290 superoxide dismutase; Provisional
Probab=29.94 E-value=1.2e+02 Score=22.43 Aligned_cols=14 Identities=7% Similarity=0.209 Sum_probs=6.5
Q ss_pred CchHHHHHHHHHHh
Q psy2711 1 MEKLVFLFAIASLS 14 (114)
Q Consensus 1 M~k~v~~~~l~a~~ 14 (114)
|.++++++..+.+|
T Consensus 1 ~~~~~~~~~~~~~~ 14 (173)
T PRK10290 1 MKRFSLAILALVVC 14 (173)
T ss_pred CcchHHHHHHHHhh
Confidence 55665444433333
No 57
>PRK10397 lipoprotein; Provisional
Probab=29.62 E-value=40 Score=24.18 Aligned_cols=14 Identities=36% Similarity=0.432 Sum_probs=10.0
Q ss_pred CchHHHHHHHHHHh
Q psy2711 1 MEKLVFLFAIASLS 14 (114)
Q Consensus 1 M~k~v~~~~l~a~~ 14 (114)
|+|++++.++++.+
T Consensus 1 mKKii~~~a~~~La 14 (137)
T PRK10397 1 MKKLAIAGALMALA 14 (137)
T ss_pred CchhHHHHHHHHhh
Confidence 88988877765543
No 58
>PF11106 YjbE: Exopolysaccharide production protein YjbE
Probab=29.62 E-value=47 Score=21.71 Aligned_cols=8 Identities=25% Similarity=0.721 Sum_probs=5.7
Q ss_pred CchHHHHH
Q psy2711 1 MEKLVFLF 8 (114)
Q Consensus 1 M~k~v~~~ 8 (114)
|||++..+
T Consensus 1 MKK~~~~~ 8 (80)
T PF11106_consen 1 MKKIIYGL 8 (80)
T ss_pred ChhHHHHH
Confidence 88887643
No 59
>smart00817 Amelin Ameloblastin precursor (Amelin). This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.
Probab=29.27 E-value=47 Score=27.69 Aligned_cols=16 Identities=25% Similarity=0.449 Sum_probs=12.3
Q ss_pred CchHHHHHHHHHHhhc
Q psy2711 1 MEKLVFLFAIASLSVA 16 (114)
Q Consensus 1 M~k~v~~~~l~a~~aa 16 (114)
|+.|++++||+.++.|
T Consensus 1 M~~~iL~~cLl~~afA 16 (411)
T smart00817 1 MKDLALVLCLLKTSFA 16 (411)
T ss_pred CcchhHHHHHHhhhhc
Confidence 7889999998875443
No 60
>COG2834 LolA Outer membrane lipoprotein-sorting protein [Cell envelope biogenesis, outer membrane]
Probab=29.18 E-value=85 Score=23.30 Aligned_cols=21 Identities=24% Similarity=0.300 Sum_probs=14.0
Q ss_pred ECCCCcEEEEEEEcCCCCeee
Q psy2711 70 IDPSGQRRTITYTAGKNGFQA 90 (114)
Q Consensus 70 ~~pdG~~~~V~Y~AD~~Gf~~ 90 (114)
.+||-...++++..+..+.-+
T Consensus 93 ydp~~~q~~~~~~~~~~~~tp 113 (211)
T COG2834 93 YDPDLEQVTKTWLSEATGNTP 113 (211)
T ss_pred ECCCCceEEEEecCCCCCCCc
Confidence 467777777777777654444
No 61
>COG5633 Predicted periplasmic lipoprotein [General function prediction only]
Probab=29.15 E-value=2.1e+02 Score=20.21 Aligned_cols=18 Identities=11% Similarity=0.172 Sum_probs=12.3
Q ss_pred CCeEEeEEEEECCCCcEE
Q psy2711 60 DGNRRGAYSYIDPSGQRR 77 (114)
Q Consensus 60 ~g~v~GsYsy~~pdG~~~ 77 (114)
...++=.+-|.|.+|-.+
T Consensus 69 pv~v~YrfYWYD~~Gle~ 86 (123)
T COG5633 69 PVTVHYRFYWYDAQGLEQ 86 (123)
T ss_pred ceEEEEEEEEEcCCCcee
Confidence 345667777788888754
No 62
>PRK13791 lysozyme inhibitor; Provisional
Probab=29.14 E-value=61 Score=22.36 Aligned_cols=8 Identities=13% Similarity=0.268 Sum_probs=4.1
Q ss_pred ceeeeEec
Q psy2711 40 QFGSAYTQ 47 (114)
Q Consensus 40 ~y~f~y~~ 47 (114)
++.=.|-.
T Consensus 41 ~l~V~y~n 48 (113)
T PRK13791 41 VLEVIYVN 48 (113)
T ss_pred eEEEEEeC
Confidence 35555554
No 63
>PF13127 DUF3955: Protein of unknown function (DUF3955)
Probab=29.07 E-value=33 Score=21.24 Aligned_cols=9 Identities=22% Similarity=0.479 Sum_probs=4.9
Q ss_pred CchHHHHHH
Q psy2711 1 MEKLVFLFA 9 (114)
Q Consensus 1 M~k~v~~~~ 9 (114)
|+|+.+.+.
T Consensus 1 m~~~~l~~~ 9 (63)
T PF13127_consen 1 MKKYILSLI 9 (63)
T ss_pred CcchHHHHH
Confidence 666655444
No 64
>PHA02097 hypothetical protein
Probab=28.83 E-value=1.1e+02 Score=18.65 Aligned_cols=24 Identities=21% Similarity=0.200 Sum_probs=18.9
Q ss_pred EEEECCCCcEEEEEEEcCCCCeee
Q psy2711 67 YSYIDPSGQRRTITYTAGKNGFQA 90 (114)
Q Consensus 67 Ysy~~pdG~~~~V~Y~AD~~Gf~~ 90 (114)
+.-..|.|.+++|.|+-|..-|..
T Consensus 14 ~amntp~gv~iri~~tf~~~~f~~ 37 (59)
T PHA02097 14 SAMNTPGGVIIRIAHTFDVSNFKI 37 (59)
T ss_pred EEeeCCCcEEEEEEeEEeeccceE
Confidence 456679999999999988666654
No 65
>PF09244 DUF1964: Domain of unknown function (DUF1964); InterPro: IPR015325 This domain is C-terminal to the catalytic sucrose phosphorylase beta/alpha barrel domain. It adopts a beta-sandwich fold, with Greek-key topology and is functionally uncharacterised []. ; PDB: 1R7A_B 2GDU_A 2GDV_A.
Probab=28.20 E-value=69 Score=20.19 Aligned_cols=21 Identities=29% Similarity=0.471 Sum_probs=13.7
Q ss_pred eEEEEECCCCcEEEEEEEcCC
Q psy2711 65 GAYSYIDPSGQRRTITYTAGK 85 (114)
Q Consensus 65 GsYsy~~pdG~~~~V~Y~AD~ 85 (114)
|.|||...++.-++.+|+..+
T Consensus 4 G~FSy~~dgdtSitf~W~g~~ 24 (68)
T PF09244_consen 4 GEFSYEADGDTSITFTWTGAT 24 (68)
T ss_dssp SEEEEEEETTTEEEEEEE-SS
T ss_pred ceeeEecCCCcEEEEEEeccc
Confidence 667777766677777777653
No 66
>PF05968 Bacillus_PapR: Bacillus PapR protein; InterPro: IPR009239 This family consists of the Bacillus species-specific PapR protein. The papR gene belongs to the PlcR regulon and is located 70 bp downstream from plcR. It encodes a 48-amino-acid peptide. Disruption of the papR gene abolishes expression of the PlcR regulon, resulting in a large decrease in haemolysis and virulence in insect larvae. A processed form of PapR activates the PlcR regulon by allowing PlcR to bind to its DNA target. This activating mechanism is strain specific [].
Probab=28.03 E-value=54 Score=19.35 Aligned_cols=10 Identities=30% Similarity=0.331 Sum_probs=7.5
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+||++...|
T Consensus 1 mkkll~~sll 10 (48)
T PF05968_consen 1 MKKLLIGSLL 10 (48)
T ss_pred CchHHHhHHH
Confidence 8888876665
No 67
>PRK10449 heat-inducible protein; Provisional
Probab=27.90 E-value=46 Score=23.35 Aligned_cols=9 Identities=56% Similarity=0.671 Sum_probs=6.1
Q ss_pred CchHHHHHH
Q psy2711 1 MEKLVFLFA 9 (114)
Q Consensus 1 M~k~v~~~~ 9 (114)
|+|++++++
T Consensus 1 mk~~~~~~~ 9 (140)
T PRK10449 1 MKKVVALVA 9 (140)
T ss_pred ChhHHHHHH
Confidence 888875544
No 68
>PF10956 DUF2756: Protein of unknown function (DUF2756); InterPro: IPR020158 This entry contains proteins with no known function.
Probab=27.80 E-value=37 Score=23.26 Aligned_cols=12 Identities=33% Similarity=0.587 Sum_probs=8.8
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|++++++++|+-
T Consensus 1 MK~ll~laalLP 12 (104)
T PF10956_consen 1 MKRLLILAALLP 12 (104)
T ss_pred ChHHHHHHHHcc
Confidence 788887777654
No 69
>cd08344 MhqB_like_N N-terminal domain of MhqB, a type I extradiol dioxygenase, and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of Burkholderia sp. NF100 MhqB and similar proteins. MhqB is a type I extradiol dioxygenase involved in the catabolism of methylhydroquinone, an intermediate in the degradation of fenitrothion. The purified enzyme has shown extradiol ring cleavage activity toward 3-methylcatechol. Fe2+ was suggested as a cofactor, the same as most other enzymes in the family. Burkholderia sp. NF100 MhqB is encoded on the plasmid pNF1. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=27.36 E-value=79 Score=20.07 Aligned_cols=16 Identities=19% Similarity=0.347 Sum_probs=13.7
Q ss_pred EEEEECCCCcEEEEEE
Q psy2711 66 AYSYIDPSGQRRTITY 81 (114)
Q Consensus 66 sYsy~~pdG~~~~V~Y 81 (114)
++.+.||||.++++.+
T Consensus 93 ~~~~~DP~Gn~iel~~ 108 (112)
T cd08344 93 GVWFRDPDGNLLQVKV 108 (112)
T ss_pred EEEEECCCCCEEEEec
Confidence 5789999999998874
No 70
>TIGR03836 termin_org_HMW1 cytadherence high molecular weight protein 1 N-terminal region. This model describes the N-terminal region of the Mycoplasma cytadherence protein HMW1, up to but not including the first EAGR box domain. The apparent orthologs in different Mycoplasma species differ profoundly in archictecture C-terminally to the region described here.
Probab=27.30 E-value=57 Score=21.15 Aligned_cols=20 Identities=25% Similarity=0.514 Sum_probs=15.3
Q ss_pred CCCCcEEEEEEEcCCCCeeec
Q psy2711 71 DPSGQRRTITYTAGKNGFQAS 91 (114)
Q Consensus 71 ~pdG~~~~V~Y~AD~~Gf~~~ 91 (114)
-.||. +.|-|-||.|||-..
T Consensus 29 ~eDg~-ikvAyDaDgNgYYIa 48 (82)
T TIGR03836 29 PEDGS-IKVAYDADGNGYYIA 48 (82)
T ss_pred CCCCc-EEEEEecCCCEEEEE
Confidence 34665 589999999998764
No 71
>PRK15184 curli production assembly/transport protein CsgG; Provisional
Probab=26.43 E-value=63 Score=25.85 Aligned_cols=12 Identities=33% Similarity=0.600 Sum_probs=8.3
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|++++++++++.
T Consensus 1 m~~~~~~~~~~~ 12 (277)
T PRK15184 1 MQRLFLLVAVML 12 (277)
T ss_pred CchhHHHHHHHH
Confidence 888777666543
No 72
>COG3637 Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane]
Probab=26.14 E-value=49 Score=24.56 Aligned_cols=12 Identities=42% Similarity=0.484 Sum_probs=7.1
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|++++++++|++
T Consensus 1 mk~~l~~a~~~~ 12 (199)
T COG3637 1 MKKLLAAAALAA 12 (199)
T ss_pred ChhHHHHHHHHH
Confidence 677665555544
No 73
>PF09242 FCSD-flav_bind: Flavocytochrome c sulphide dehydrogenase, flavin-binding; InterPro: IPR015323 This entry represents the flavin-binding domain of flavocytochrome c sulphide dehydrogenase (FCSD), enzymes found in sulphur-oxidising bacteria such as the purple phototrophic bacteria Chromatium vinosum [, ]. These enzymes are complexes of flavoprotein and a dihaem cytochrome that carry out hydrogen sulphide-dependent cytochrome C reduction. The dihaem cytochrome folds into two domains, each of which resembles mitochondrial cytochrome c, with the two haem groups bound to the interior of the subunit. The flavoprotein subunit has a glutathione reductase-like fold consisting of a beta(3,4)-alpha(3) core, and an alpha+beta sandwich. The active site of the flavoprotein subunit contains a catalytically important disulphide bridge located above the pyrimidine portion of the flavin ring []. Electrons are transferred from the flavin to one of the haem groups in the cytochrome. This entry represents a flavoprotein domain required for binding to flavin, and subsequent electron transfer. ; PDB: 1FCD_A.
Probab=26.07 E-value=1e+02 Score=19.25 Aligned_cols=27 Identities=15% Similarity=0.207 Sum_probs=19.3
Q ss_pred eEEEEECCC-CcEEEEEEEcCCCCeeec
Q psy2711 65 GAYSYIDPS-GQRRTITYTAGKNGFQAS 91 (114)
Q Consensus 65 GsYsy~~pd-G~~~~V~Y~AD~~Gf~~~ 91 (114)
=.||+++|| |-.+.-.|.-++.++...
T Consensus 9 tCYSlv~p~~aIsVa~vy~~~~~~i~~v 36 (71)
T PF09242_consen 9 TCYSLVAPDYAISVAGVYRYDDGKIVEV 36 (71)
T ss_dssp EEEEEEETTEEEEEEEEEEE-TTTEEE-
T ss_pred eEEEEeCCCceEEEEEEEEECCCeEEEe
Confidence 368899888 777777888887776544
No 74
>PRK11627 hypothetical protein; Provisional
Probab=25.92 E-value=54 Score=24.57 Aligned_cols=8 Identities=25% Similarity=0.426 Sum_probs=5.2
Q ss_pred EEeEEEEE
Q psy2711 63 RRGAYSYI 70 (114)
Q Consensus 63 v~GsYsy~ 70 (114)
.+|.++|.
T Consensus 115 ~~g~~~~~ 122 (192)
T PRK11627 115 SEGNVRYN 122 (192)
T ss_pred ccceeeec
Confidence 35777665
No 75
>KOG4737|consensus
Probab=25.59 E-value=27 Score=28.22 Aligned_cols=17 Identities=24% Similarity=0.358 Sum_probs=13.8
Q ss_pred CchHHHHHHHHHHhhcc
Q psy2711 1 MEKLVFLFAIASLSVAQ 17 (114)
Q Consensus 1 M~k~v~~~~l~a~~aa~ 17 (114)
|+.|+++.+|+|+|++.
T Consensus 1 m~vfvl~salvAa~~as 17 (326)
T KOG4737|consen 1 MKVFVLVSALVAACAAS 17 (326)
T ss_pred CeEEeeHHHHHHHHhcc
Confidence 77788888899888764
No 76
>PRK12698 flgH flagellar basal body L-ring protein; Reviewed
Probab=25.45 E-value=54 Score=25.19 Aligned_cols=9 Identities=22% Similarity=0.467 Sum_probs=4.8
Q ss_pred CchHHHHHH
Q psy2711 1 MEKLVFLFA 9 (114)
Q Consensus 1 M~k~v~~~~ 9 (114)
|||.+++++
T Consensus 1 ~~~~~~~~~ 9 (224)
T PRK12698 1 MARYILLAL 9 (224)
T ss_pred ChhHHHHHH
Confidence 666555444
No 77
>PRK06724 hypothetical protein; Provisional
Probab=25.41 E-value=96 Score=20.93 Aligned_cols=21 Identities=33% Similarity=0.722 Sum_probs=15.7
Q ss_pred EeEEE--EECCCCcEEEEEEEcC
Q psy2711 64 RGAYS--YIDPSGQRRTITYTAG 84 (114)
Q Consensus 64 ~GsYs--y~~pdG~~~~V~Y~AD 84 (114)
.|.|+ +.||||..+++.|.++
T Consensus 103 ~g~~~~~f~DPdG~~iEl~~~~~ 125 (128)
T PRK06724 103 EGYYTIDFYDPNGFIIEVAYTPN 125 (128)
T ss_pred CCEEEEEEECCCCCEEEEEeCCC
Confidence 34443 7799999999988754
No 78
>PF08194 DIM: DIM protein; InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila []. This entry includes DIMs 1 to 4 and DIM23.
Probab=25.18 E-value=79 Score=17.66 Aligned_cols=6 Identities=33% Similarity=0.429 Sum_probs=3.1
Q ss_pred CchHHH
Q psy2711 1 MEKLVF 6 (114)
Q Consensus 1 M~k~v~ 6 (114)
|+.|.+
T Consensus 1 Mk~l~~ 6 (36)
T PF08194_consen 1 MKCLSL 6 (36)
T ss_pred CceeHH
Confidence 555544
No 79
>PRK15346 outer membrane secretin SsaC; Provisional
Probab=25.03 E-value=48 Score=28.11 Aligned_cols=12 Identities=33% Similarity=0.462 Sum_probs=7.4
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|||++++.+|+.
T Consensus 1 ~~~~~~~~~~~~ 12 (499)
T PRK15346 1 MKKLLILIFLFL 12 (499)
T ss_pred CchhHHHHHHHH
Confidence 888766554433
No 80
>PF05593 RHS_repeat: RHS Repeat; InterPro: IPR006530 These sequences contain two tandem copies of a 21-residue extracellular repeat that is found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin [, , ].
Probab=24.88 E-value=1.2e+02 Score=16.22 Aligned_cols=8 Identities=38% Similarity=0.829 Sum_probs=3.7
Q ss_pred ECCCCcEE
Q psy2711 70 IDPSGQRR 77 (114)
Q Consensus 70 ~~pdG~~~ 77 (114)
++|+|...
T Consensus 11 ~d~~G~~~ 18 (38)
T PF05593_consen 11 TDPDGRTT 18 (38)
T ss_pred EcCCCCEE
Confidence 34555544
No 81
>cd09013 BphC-JF8_N_like N-terminal, non-catalytic, domain of BphC_JF8, (2,3-dihydroxybiphenyl 1,2-dioxygenase) from Bacillus sp. JF8 and similar proteins. 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC) catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, a key step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). BphC belongs to the type I extradiol dioxygenase family, which requires a metal ion in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of BphCs. This subfamily of BphC is represented by the enzyme purified from the thermophilic biphenyl and naphthalene degrader, Bacillus sp. JF8. The members in this family of BphC enzymes may use either Mn(II) or Fe(II) as cofactors. The enzyme purified from Bacillus sp. JF8 is Mn(II)-dependent, however, the enzyme from Rhodococcus jostii RHAI has Fe(II) bound to it. BphC_JF8 is thermostable and its optimum activity is at 85 degrees C
Probab=24.85 E-value=1e+02 Score=19.73 Aligned_cols=17 Identities=24% Similarity=0.614 Sum_probs=14.1
Q ss_pred EEEEECCCCcEEEEEEE
Q psy2711 66 AYSYIDPSGQRRTITYT 82 (114)
Q Consensus 66 sYsy~~pdG~~~~V~Y~ 82 (114)
++.+.||||...++.|-
T Consensus 102 ~~~~~DPdG~~iEl~~~ 118 (121)
T cd09013 102 AYRFRSPDGHPMELYWE 118 (121)
T ss_pred eEEEECCCCCEEEEEEe
Confidence 57799999999888763
No 82
>PF15339 Afaf: Acrosome formation-associated factor
Probab=24.66 E-value=43 Score=25.27 Aligned_cols=15 Identities=13% Similarity=0.262 Sum_probs=9.3
Q ss_pred chHHHHHHHHHHhhc
Q psy2711 2 EKLVFLFAIASLSVA 16 (114)
Q Consensus 2 ~k~v~~~~l~a~~aa 16 (114)
|.|+|++.|+++|.|
T Consensus 139 mTl~lfv~Ll~~c~a 153 (200)
T PF15339_consen 139 MTLFLFVILLAFCSA 153 (200)
T ss_pred HHHHHHHHHHHHHHH
Confidence 455666667776654
No 83
>PRK10959 outer membrane protein W; Provisional
Probab=24.39 E-value=60 Score=23.99 Aligned_cols=10 Identities=30% Similarity=0.245 Sum_probs=5.6
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|+|+++++++
T Consensus 1 Mk~~~~~~~~ 10 (212)
T PRK10959 1 MKKLTVAALA 10 (212)
T ss_pred CchHHHHHHH
Confidence 7777544333
No 84
>PF10972 DUF2803: Protein of unknown function (DUF2803); InterPro: IPR021241 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=24.38 E-value=99 Score=23.24 Aligned_cols=15 Identities=20% Similarity=0.370 Sum_probs=10.4
Q ss_pred CchHHHHHHHHHHhh
Q psy2711 1 MEKLVFLFAIASLSV 15 (114)
Q Consensus 1 M~k~v~~~~l~a~~a 15 (114)
|++++++++|+.++.
T Consensus 1 ~~~ll~lll~~~s~~ 15 (213)
T PF10972_consen 1 MKALLLLLLLLVSPT 15 (213)
T ss_pred CHHHHHHHHHHhChh
Confidence 888887777665443
No 85
>COG4764 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.63 E-value=70 Score=23.97 Aligned_cols=17 Identities=24% Similarity=0.341 Sum_probs=9.7
Q ss_pred CchHHHHHHHHH--Hhhcc
Q psy2711 1 MEKLVFLFAIAS--LSVAQ 17 (114)
Q Consensus 1 M~k~v~~~~l~a--~~aa~ 17 (114)
||.++++++||+ .|+..
T Consensus 5 ~~r~~~~v~lL~LagCaTa 23 (197)
T COG4764 5 MMRLVFAVVLLALAGCATA 23 (197)
T ss_pred hHHHHHHHHHHHHhhcccC
Confidence 677766666554 45433
No 86
>PF04379 DUF525: Protein of unknown function (DUF525); InterPro: IPR007474 This domain is found in the bacterial protein ApaG and at the C termini of some F-box proteins (IPR001810 from INTERPRO). F-box proteins contain a carboxy-terminal domain that interacts with protein substrates []. The ApaG domain is ~125 amino acids in length, and is named after the bacterial ApaG protein, of which it forms the core. The Salmonella typhimurium ApaG domain protein, CorD, is involved in Co(2+) resistance and Mg(2+) efflux. Tertiary structures from different ApaG proteins show a fold of several beta-sheets. The ApaG domain may be involved in protein-protein interactions which could be implicated in substrate-specificity [, , ].; PDB: 2F1E_A 1XVS_A 1TZA_A 1XQ4_D.
Probab=23.49 E-value=1.2e+02 Score=19.93 Aligned_cols=14 Identities=29% Similarity=0.745 Sum_probs=9.5
Q ss_pred CCeEEeEEEEECCC
Q psy2711 60 DGNRRGAYSYIDPS 73 (114)
Q Consensus 60 ~g~v~GsYsy~~pd 73 (114)
.|.++|+|.+++.|
T Consensus 77 ~G~M~G~y~~~~~~ 90 (90)
T PF04379_consen 77 SGSMEGSYTMVDED 90 (90)
T ss_dssp EEEEEEEEEEEET-
T ss_pred CEEEEEEEEEEECc
Confidence 46778888877655
No 87
>PRK13682 hypothetical protein; Provisional
Probab=23.26 E-value=71 Score=19.20 Aligned_cols=13 Identities=31% Similarity=0.542 Sum_probs=9.0
Q ss_pred CchHHHHHHHHHH
Q psy2711 1 MEKLVFLFAIASL 13 (114)
Q Consensus 1 M~k~v~~~~l~a~ 13 (114)
|+++.+++.+++.
T Consensus 1 Ml~waliFliiA~ 13 (51)
T PRK13682 1 MLKWAIIFLVIAL 13 (51)
T ss_pred ChhHHHHHHHHHH
Confidence 7788777776553
No 88
>PF03032 Brevenin: Brevenin/esculentin/gaegurin/rugosin family; InterPro: IPR004275 In addition to the highly specific cell-mediated immune system, vertebrates possess an efficient host-defence mechanism against invading microorganisms which involves the synthesis of highly potent antimicrobial peptides with a large spectrum of activity. This entry represents a number of these defence peptides secreted from the skin of amphibians, including the opiate-like dermorphins and deltorphins, and the antimicrobial dermoseptins and temporins.; GO: 0006952 defense response, 0042742 defense response to bacterium, 0005576 extracellular region
Probab=23.01 E-value=66 Score=18.79 Aligned_cols=15 Identities=13% Similarity=0.195 Sum_probs=8.7
Q ss_pred CchHHHHHHHHHHhh
Q psy2711 1 MEKLVFLFAIASLSV 15 (114)
Q Consensus 1 M~k~v~~~~l~a~~a 15 (114)
|+|-++++.++..++
T Consensus 3 lKKsllLlfflG~IS 17 (46)
T PF03032_consen 3 LKKSLLLLFFLGTIS 17 (46)
T ss_pred chHHHHHHHHHHHcc
Confidence 677666666554443
No 89
>PF15016 DUF4520: Domain of unknown function (DUF4520)
Probab=22.83 E-value=2.3e+02 Score=18.57 Aligned_cols=51 Identities=22% Similarity=0.279 Sum_probs=33.5
Q ss_pred eeeeeeeccCCCCceeeeEecCCCcEEEEeec----------CCCeEEeEEEEECCCCcEEEEE
Q psy2711 27 AILSEKRYLSNNGQFGSAYTQEDGVQFTEESD----------QDGNRRGAYSYIDPSGQRRTIT 80 (114)
Q Consensus 27 ~i~~~~~~~~~~g~y~f~y~~~dg~~~~E~g~----------~~g~v~GsYsy~~pdG~~~~V~ 80 (114)
+++-++...++.|+|. .-+||..+----| ..-...|....+.|||+...|.
T Consensus 4 ~vv~~e~~i~~~GrF~---AysDgrVr~~F~Drt~L~l~~~~~~~~~~~~c~l~~pDG~~~~i~ 64 (85)
T PF15016_consen 4 PVVLEESSIPGVGRFT---AYSDGRVRVHFDDRTILTLIWNFSSREELGWCKLTFPDGQQVLIQ 64 (85)
T ss_pred ceeEEEEecCCceEEE---EEcCCeEEEEEcCCCEEEEEeCCCCcccCCEEEEEccCCCEEEEE
Confidence 3445566678888884 3356665443222 1223679999999999998886
No 90
>PF06788 UPF0257: Uncharacterised protein family (UPF0257); InterPro: IPR010646 This is a group of proteins of unknown function.; GO: 0005886 plasma membrane
Probab=22.81 E-value=2.1e+02 Score=22.32 Aligned_cols=28 Identities=21% Similarity=0.175 Sum_probs=19.1
Q ss_pred eEEeEEEEECCCCcEEEEEEEcCCCCeee
Q psy2711 62 NRRGAYSYIDPSGQRRTITYTAGKNGFQA 90 (114)
Q Consensus 62 ~v~GsYsy~~pdG~~~~V~Y~AD~~Gf~~ 90 (114)
.++.+|+|-+ +|.+...++.++++=+..
T Consensus 140 e~~~~Y~Yd~-~g~pl~~t~~~~~k~~~v 167 (236)
T PF06788_consen 140 ETKVTYRYDD-EGFPLGKTTYYDKKVLSV 167 (236)
T ss_pred CcceEEEECC-CCCccceEEecCCceEEE
Confidence 3556788877 787777777777654543
No 91
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=22.72 E-value=1.3e+02 Score=18.48 Aligned_cols=24 Identities=25% Similarity=0.425 Sum_probs=19.1
Q ss_pred CeEEeEEEEECCCCcEEEEEEEcCCC
Q psy2711 61 GNRRGAYSYIDPSGQRRTITYTAGKN 86 (114)
Q Consensus 61 g~v~GsYsy~~pdG~~~~V~Y~AD~~ 86 (114)
-++.|+-..++.||.. +++.|++-
T Consensus 30 ~vleG~v~it~~~G~~--~~~~aGD~ 53 (74)
T PF05899_consen 30 YVLEGEVTITDEDGET--VTFKAGDA 53 (74)
T ss_dssp EEEEEEEEEEETTTEE--EEEETTEE
T ss_pred EEEEeEEEEEECCCCE--EEEcCCcE
Confidence 3688999999999996 66677653
No 92
>PF11355 DUF3157: Protein of unknown function (DUF3157); InterPro: IPR021501 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=22.61 E-value=1e+02 Score=23.46 Aligned_cols=8 Identities=25% Similarity=0.393 Sum_probs=6.4
Q ss_pred ceeeeEec
Q psy2711 40 QFGSAYTQ 47 (114)
Q Consensus 40 ~y~f~y~~ 47 (114)
.|.|.|-.
T Consensus 37 DFTWeYv~ 44 (199)
T PF11355_consen 37 DFTWEYVI 44 (199)
T ss_pred CceEEEEe
Confidence 59999965
No 93
>PF09919 DUF2149: Uncharacterized conserved protein (DUF2149); InterPro: IPR018676 This family of conserved hypothetical proteins has no known function.
Probab=22.39 E-value=2.4e+02 Score=18.63 Aligned_cols=19 Identities=26% Similarity=0.436 Sum_probs=11.3
Q ss_pred CCeEEeEEEEECCCCcEEEE
Q psy2711 60 DGNRRGAYSYIDPSGQRRTI 79 (114)
Q Consensus 60 ~g~v~GsYsy~~pdG~~~~V 79 (114)
.|.-.|. -|..+||+.+-|
T Consensus 72 ~G~~~G~-aYrl~~Gk~I~V 90 (92)
T PF09919_consen 72 SGERLGT-AYRLKDGKLIYV 90 (92)
T ss_pred CCeECeE-EEEcCCceEEEe
Confidence 3444454 234888988755
No 94
>cd07266 HPCD_N_class_II N-terminal domain of 3,4-dihydroxyphenylacetate 2,3-dioxygenase (HPCD); belongs to the type I class II family of extradiol dioxygenases. This subfamily contains the N-terminal, non-catalytic, domain of HPCD. HPCD catalyses the second step in the degradation of 4-hydroxyphenylacetate to succinate and pyruvate. The aromatic ring of 4-hydroxyphenylacetate is opened by this dioxygenase to yield the 3,4-diol product, 2-hydroxy-5-carboxymethylmuconate semialdehyde. HPCD is a homotetramer and each monomer contains two structurally homologous barrel-shaped domains at the N- and C-terminus. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism. Most extradiol dioxygenases contain Fe(II) in their active site, but HPCD can be activated by either Mn(II) or Fe(II). These enzymes belong to the type I class II family of extradiol dioxygenases. The class III 3,4-dihydroxyphenylacetate 2,3-dioxygenases belong to a differ
Probab=22.29 E-value=1.3e+02 Score=19.13 Aligned_cols=16 Identities=25% Similarity=0.129 Sum_probs=12.9
Q ss_pred EEEEECCCCcEEEEEE
Q psy2711 66 AYSYIDPSGQRRTITY 81 (114)
Q Consensus 66 sYsy~~pdG~~~~V~Y 81 (114)
+..+.||||..+++.+
T Consensus 102 ~~~~~DPdG~~ve~~~ 117 (121)
T cd07266 102 ALRVEDPLGFPIEFYA 117 (121)
T ss_pred EEEEECCCCCEEEEEe
Confidence 5678899999888765
No 95
>TIGR00547 lolA periplasmic chaperone LolA. This protein, LolA, is known so far only in the gamma and beta subdivisions of the Proteobacteria. The E. coli major outer lipoprotein (Lpp) of E. coli is released from the inner membrane as a complex with this chaperone in an energy-requiring process, and is then delivered to LolB for insertion into the outer membrane. LolA is involved in the delivery of lipoproteins generally, rather than just Lpp, and is an essential protein in E. coli, unlike Lpp itself.
Probab=22.13 E-value=68 Score=23.99 Aligned_cols=12 Identities=33% Similarity=0.343 Sum_probs=7.9
Q ss_pred CchHHHHHHHHH
Q psy2711 1 MEKLVFLFAIAS 12 (114)
Q Consensus 1 M~k~v~~~~l~a 12 (114)
|+|+++++||++
T Consensus 1 mkk~~~~~~l~~ 12 (204)
T TIGR00547 1 MKKIAIKCAALS 12 (204)
T ss_pred CchHHHHHHHHH
Confidence 788777655543
No 96
>TIGR00616 rect recombinase, phage RecT family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). RecT and homologs are found in prophage regions of bacterial genomes. RecT works with a partner protein, RecE.
Probab=22.09 E-value=1.2e+02 Score=23.64 Aligned_cols=44 Identities=14% Similarity=0.164 Sum_probs=25.6
Q ss_pred ceeeeEecCCCcEEEEeecCCCeEEeEEEEE-CCCCcEEEEEEEcC
Q psy2711 40 QFGSAYTQEDGVQFTEESDQDGNRRGAYSYI-DPSGQRRTITYTAG 84 (114)
Q Consensus 40 ~y~f~y~~~dg~~~~E~g~~~g~v~GsYsy~-~pdG~~~~V~Y~AD 84 (114)
.|+|++..+....+.+ ....|.+.|.|.+. ..||....+-.+.+
T Consensus 145 ~f~~~~g~~~~~~h~p-~~~~g~iiG~YA~~~l~dG~~~~~~Mt~e 189 (241)
T TIGR00616 145 LFKWEPGLDEKLIHRP-GRESDAPVGYYAVAELKDGGRKFVVMTKA 189 (241)
T ss_pred eEEeccCCCCccccCC-CCCCCCEEEEEEEEEEcCCCEEEEEeeHH
Confidence 4555554433333222 23446789999954 58888766655544
No 97
>cd07244 FosA FosA, a Fosfomycin resistance protein, catalyzes the addition of glutathione to the antibiotic fosfomycin, making it inactive. This subfamily family contains FosA, a fosfomycin resistant protein. Fosfomycin inhibits the enzyme UDP-N-acetylglucosamine-3-enolpyruvyltransferase (MurA), which catalyzes the first committed step in bacterial cell wall biosynthesis. FosA, catalyzes the addition of glutathione to the antibiotic fosfomycin, (1R,2S)-epoxypropylphosphonic acid, making it inactive. FosA is a Mn(II) dependent enzyme. It is evolutionarily related to glyoxalase I and type I extradiol dioxygenases.
Probab=22.07 E-value=1.1e+02 Score=19.60 Aligned_cols=18 Identities=17% Similarity=0.641 Sum_probs=14.4
Q ss_pred EEEEECCCCcEEEEEEEc
Q psy2711 66 AYSYIDPSGQRRTITYTA 83 (114)
Q Consensus 66 sYsy~~pdG~~~~V~Y~A 83 (114)
+..+.||||..+++.+.+
T Consensus 94 ~~~f~DPdG~~ie~~~~~ 111 (121)
T cd07244 94 SFYFLDPDGHKLELHVGS 111 (121)
T ss_pred EEEEECCCCCEEEEEeCC
Confidence 567899999999888654
No 98
>PF05643 DUF799: Putative bacterial lipoprotein (DUF799); InterPro: IPR008517 This family consists of several bacterial proteins of unknown function. Some of the family members are described as putative lipoproteins.
Probab=21.87 E-value=65 Score=24.82 Aligned_cols=9 Identities=44% Similarity=0.659 Sum_probs=7.0
Q ss_pred CchHHHHHH
Q psy2711 1 MEKLVFLFA 9 (114)
Q Consensus 1 M~k~v~~~~ 9 (114)
|+|++++++
T Consensus 1 mk~l~~~l~ 9 (215)
T PF05643_consen 1 MKKLILGLA 9 (215)
T ss_pred ChhHHHHHH
Confidence 888887665
No 99
>cd08357 Glo_EDI_BRP_like_18 This conserved domain belongs to a superfamily including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. This protein family belongs to a conserved domain superfamily that is found in a variety of structurally related metalloproteins, including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. A bound metal ion is required for protein activities for the members of this superfamily. A variety of metal ions have been found in the catalytic centers of these proteins including Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein superfamily contains members with or without domain swapping. The proteins of this family share three conserved metal binding amino acids with the type I extradiol dioxygenases, which shows no domain swapping.
Probab=21.63 E-value=1.3e+02 Score=18.98 Aligned_cols=15 Identities=27% Similarity=0.401 Sum_probs=11.6
Q ss_pred EEEECCCCcEEEEEE
Q psy2711 67 YSYIDPSGQRRTITY 81 (114)
Q Consensus 67 Ysy~~pdG~~~~V~Y 81 (114)
..+.||||..++++.
T Consensus 110 ~~~~DPdG~~iE~~~ 124 (125)
T cd08357 110 FFLKDPSGNALEFKA 124 (125)
T ss_pred EEEECCCCCEEEEee
Confidence 568899998887763
No 100
>PHA02978 hypothetical protein; Provisional
Probab=21.56 E-value=86 Score=21.97 Aligned_cols=19 Identities=26% Similarity=0.408 Sum_probs=14.4
Q ss_pred eEEeEEEEECCCCcEEEEE
Q psy2711 62 NRRGAYSYIDPSGQRRTIT 80 (114)
Q Consensus 62 ~v~GsYsy~~pdG~~~~V~ 80 (114)
...=.+||.+|||-.+.|+
T Consensus 72 ~iiiy~sy~~~~gisiqvs 90 (135)
T PHA02978 72 EIIIYFSYADPDGISIQVS 90 (135)
T ss_pred EEEEEEEecCCCceEEEEe
Confidence 3445667899999988875
No 101
>cd09012 Glo_EDI_BRP_like_24 This conserved domain belongs to a superfamily including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. This protein family belongs to a conserved domain superfamily that is found in a variety of structurally related metalloproteins, including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. A bound metal ion is required for protein activities for the members of this superfamily. A variety of metal ions have been found in the catalytic centers of these proteins including Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein superfamily contains members with or without domain swapping.
Probab=20.73 E-value=1.4e+02 Score=19.14 Aligned_cols=15 Identities=20% Similarity=0.388 Sum_probs=12.2
Q ss_pred EEEECCCCcEEEEEE
Q psy2711 67 YSYIDPSGQRRTITY 81 (114)
Q Consensus 67 Ysy~~pdG~~~~V~Y 81 (114)
..+.||||...++.|
T Consensus 109 ~~~~DPdG~~ie~~~ 123 (124)
T cd09012 109 RSFADLDGHLWEVLW 123 (124)
T ss_pred EEEECCCCCEEEEEE
Confidence 348999999988876
No 102
>cd08361 PpCmtC_N N-terminal domain of 2,3-dihydroxy-p-cumate-3,4-dioxygenase (PpCmtC). This subfamily contains the N-terminal, non-catalytic, domain of PpCmtC. 2,3-dihydroxy-p-cumate-3,4-dioxygenase (CmtC of Pseudomonas putida F1) is a dioxygenase involved in the eight-step catabolism pathway of p-cymene. CmtC acts upon the reaction intermediate 2,3-dihydroxy-p-cumate, yielding 2-hydroxy-3-carboxy-6-oxo-7-methylocta-2,4-dienoate. The CmtC belongs to the type I family of extradiol dioxygenases. Fe2+ was suggested as a cofactor, same as other enzymes in the family. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=20.60 E-value=1.5e+02 Score=19.24 Aligned_cols=16 Identities=25% Similarity=0.495 Sum_probs=13.1
Q ss_pred EEEECCCCcEEEEEEE
Q psy2711 67 YSYIDPSGQRRTITYT 82 (114)
Q Consensus 67 Ysy~~pdG~~~~V~Y~ 82 (114)
..+.||||..+++.+.
T Consensus 104 ~~f~DPdG~~iE~~~~ 119 (124)
T cd08361 104 IAFRDPSGNSIELVVR 119 (124)
T ss_pred EEEECcCCCEEEEEEe
Confidence 3689999999988754
No 103
>PF12771 SusD-like_2: Starch-binding associating with outer membrane; PDB: 3EJN_A 3FDH_A 3MX3_A.
Probab=20.57 E-value=33 Score=28.71 Aligned_cols=10 Identities=20% Similarity=0.733 Sum_probs=0.0
Q ss_pred CchHHHHHHH
Q psy2711 1 MEKLVFLFAI 10 (114)
Q Consensus 1 M~k~v~~~~l 10 (114)
|||.|++++|
T Consensus 1 MKK~Il~i~l 10 (488)
T PF12771_consen 1 MKKIILIILL 10 (488)
T ss_dssp ----------
T ss_pred CcHHHHHHHH
Confidence 8898776554
No 104
>cd07241 Glo_EDI_BRP_like_3 This conserved domain belongs to a superfamily including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. This protein family belongs to a conserved domain superfamily that is found in a variety of structurally related metalloproteins, including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. A bound metal ion is required for protein activities for the members of this superfamily. A variety of metal ions have been found in the catalytic centers of these proteins including Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein superfamily contains members with or without domain swapping. The proteins of this family share three conserved metal binding amino acids with the type I extradiol dioxygenases, which shows no domain swapping.
Probab=20.52 E-value=1.1e+02 Score=19.20 Aligned_cols=11 Identities=45% Similarity=0.851 Sum_probs=8.8
Q ss_pred EECCCCcEEEE
Q psy2711 69 YIDPSGQRRTI 79 (114)
Q Consensus 69 y~~pdG~~~~V 79 (114)
+.||||..+++
T Consensus 114 ~~DPdG~~iE~ 124 (125)
T cd07241 114 ILDPEGNRIEI 124 (125)
T ss_pred EECCCCCEEEe
Confidence 78888887765
No 105
>PF12445 FliC: Flagellin protein
Probab=20.51 E-value=1.1e+02 Score=21.66 Aligned_cols=24 Identities=38% Similarity=0.462 Sum_probs=18.0
Q ss_pred EEEEECCCCcE-----EEEEEEcCCCCee
Q psy2711 66 AYSYIDPSGQR-----RTITYTAGKNGFQ 89 (114)
Q Consensus 66 sYsy~~pdG~~-----~~V~Y~AD~~Gf~ 89 (114)
.=.|+|.||.+ ++++|..+++|=.
T Consensus 78 ~~~Yvd~dG~lTt~~~~tt~Y~~~~dGsV 106 (128)
T PF12445_consen 78 ADVYVDDDGALTTTNTYTTTYFVQKDGSV 106 (128)
T ss_pred cceEEcCCCcEEecCcceEEEEEcCCCCE
Confidence 56788899865 4588998888743
Done!