Query psy16510
Match_columns 147
No_of_seqs 113 out of 1126
Neff 5.8
Searched_HMMs 46136
Date Fri Aug 16 23:10:00 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16510.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16510hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00379 Chitin_bind_4: Insect 99.7 4.7E-17 1E-21 103.9 6.7 52 54-110 1-52 (52)
2 PF06004 DUF903: Bacterial pro 89.2 1.2 2.6E-05 28.3 4.8 39 56-102 2-40 (50)
3 PRK09408 ompX outer membrane p 79.0 6.3 0.00014 30.8 5.6 12 83-94 60-71 (171)
4 PF05984 Cytomega_UL20A: Cytom 78.5 6.6 0.00014 27.9 5.0 21 7-27 2-22 (100)
5 COG5510 Predicted small secret 78.3 2.3 5E-05 26.4 2.4 22 1-24 1-22 (44)
6 PF06286 Coleoptericin: Coleop 73.5 1.4 3E-05 33.3 0.6 19 1-28 1-19 (143)
7 PF06316 Ail_Lom: Enterobacter 73.2 7.7 0.00017 31.4 4.8 15 83-97 63-77 (199)
8 PF07172 GRP: Glycine rich pro 68.6 4.6 0.0001 28.8 2.3 9 17-25 14-22 (95)
9 PF10731 Anophelin: Thrombin i 66.4 8 0.00017 25.7 2.9 19 11-29 5-23 (65)
10 COG5294 Uncharacterized protei 66.3 25 0.00054 26.0 5.8 14 94-107 63-76 (113)
11 PF12930 DUF3836: Family of un 65.6 27 0.00059 26.2 6.1 20 53-72 63-82 (132)
12 COG4907 Predicted membrane pro 64.6 14 0.00031 33.7 5.1 23 83-105 51-73 (595)
13 PHA02698 hypothetical protein; 64.5 4.1 8.8E-05 28.4 1.3 21 108-128 19-39 (89)
14 PRK09980 ompL outer membrane p 62.4 40 0.00086 27.7 7.0 60 53-112 40-106 (230)
15 PRK15346 outer membrane secret 60.2 8.9 0.00019 34.5 3.1 20 1-27 1-20 (499)
16 PF15240 Pro-rich: Proline-ric 58.6 9.9 0.00021 30.3 2.7 18 11-28 2-19 (179)
17 COG3117 Uncharacterized protei 58.2 25 0.00054 28.2 5.0 25 1-26 1-25 (188)
18 PRK13254 cytochrome c-type bio 56.6 11 0.00025 28.8 2.7 21 1-22 1-21 (148)
19 PF12729 4HB_MCP_1: Four helix 55.3 10 0.00022 27.2 2.2 23 2-24 2-24 (181)
20 KOG1691|consensus 55.1 96 0.0021 25.3 7.9 31 84-114 80-110 (210)
21 TIGR03795 chp_BMA0021 conserve 54.4 21 0.00044 26.5 3.7 24 116-139 88-112 (114)
22 COG3017 LolB Outer membrane li 54.2 40 0.00086 27.4 5.6 15 12-26 10-24 (206)
23 PRK10894 lipopolysaccharide tr 53.7 24 0.00052 27.3 4.2 35 11-46 6-40 (180)
24 PRK13159 cytochrome c-type bio 52.0 13 0.00029 28.8 2.5 13 123-135 128-140 (155)
25 PF08138 Sex_peptide: Sex pept 51.8 4.8 0.0001 26.1 0.0 16 10-25 4-19 (56)
26 PF11153 DUF2931: Protein of u 51.0 62 0.0013 25.6 6.3 22 105-133 107-128 (216)
27 PF06316 Ail_Lom: Enterobacter 50.0 80 0.0017 25.5 6.7 27 81-107 49-75 (199)
28 PRK13150 cytochrome c-type bio 49.8 15 0.00033 28.6 2.5 17 123-139 135-151 (159)
29 COG3026 RseB Negative regulato 49.4 24 0.00052 30.4 3.8 22 81-102 82-103 (320)
30 PRK15401 alpha-ketoglutarate-d 47.9 44 0.00095 27.1 5.0 49 87-135 46-105 (213)
31 PF02402 Lysis_col: Lysis prot 47.8 13 0.00027 23.3 1.4 17 7-23 3-19 (46)
32 COG4939 Major membrane immunog 44.8 1.1E+02 0.0024 23.3 6.4 44 50-98 30-73 (147)
33 PRK10523 lipoprotein involved 44.5 1.8E+02 0.0039 24.0 8.1 12 45-57 61-72 (234)
34 PRK10755 sensor protein BasS/P 43.8 19 0.00041 29.6 2.3 18 1-19 1-18 (356)
35 PF03562 MltA: MltA specific i 42.8 26 0.00057 27.2 2.8 57 81-140 80-143 (158)
36 PF13511 DUF4124: Domain of un 42.4 29 0.00063 21.6 2.6 17 87-106 14-30 (60)
37 COG2332 CcmE Cytochrome c-type 42.2 1.2E+02 0.0026 23.6 6.3 14 123-136 129-142 (153)
38 PF00879 Defensin_propep: Defe 42.2 66 0.0014 20.6 4.1 12 14-25 5-16 (52)
39 smart00817 Amelin Ameloblastin 41.0 25 0.00055 31.0 2.7 18 13-30 4-21 (411)
40 PF13623 SurA_N_2: SurA N-term 40.9 22 0.00048 26.9 2.2 20 3-23 3-22 (145)
41 KOG3866|consensus 39.5 29 0.00064 30.4 2.9 26 2-27 1-26 (442)
42 PF10929 DUF2811: Protein of u 36.9 32 0.0007 22.4 2.1 20 122-141 5-25 (57)
43 TIGR01643 YD_repeat_2x YD repe 33.6 71 0.0015 18.1 3.1 20 90-111 9-28 (42)
44 PRK13165 cytochrome c-type bio 33.4 42 0.0009 26.2 2.6 16 123-138 135-150 (160)
45 TIGR00219 mreC rod shape-deter 33.3 37 0.00079 28.4 2.5 19 2-20 1-19 (283)
46 PLN03207 stomagen; Provisional 33.2 55 0.0012 23.9 3.0 26 1-26 5-30 (113)
47 PRK05461 apaG CO2+/MG2+ efflux 33.1 54 0.0012 24.4 3.1 21 81-101 93-113 (127)
48 cd07252 BphC1-RGP6_N_like N-te 33.0 60 0.0013 22.3 3.2 21 87-107 100-120 (120)
49 PF09244 DUF1964: Domain of un 32.7 52 0.0011 22.0 2.6 22 86-107 3-24 (68)
50 COG3137 Putative salt-induced 31.6 68 0.0015 26.8 3.7 15 51-65 51-65 (262)
51 PRK14818 NADH dehydrogenase su 31.6 38 0.00083 26.8 2.2 24 112-135 129-152 (173)
52 PF05581 MCP_N: Vibrio chemota 31.6 16 0.00034 26.6 0.0 24 2-25 1-24 (103)
53 cd07240 ED_TypeI_classII_N N-t 31.5 73 0.0016 21.1 3.4 21 87-107 97-117 (117)
54 PF07437 YfaZ: YfaZ precursor; 31.4 1.9E+02 0.0041 22.6 6.1 13 83-95 77-89 (180)
55 TIGR03352 VI_chp_3 type VI sec 30.9 62 0.0013 24.4 3.2 13 13-25 5-17 (146)
56 COG2967 ApaG Uncharacterized p 30.2 1.4E+02 0.003 22.5 4.8 21 81-101 92-112 (126)
57 COG1934 Uncharacterized protei 29.9 71 0.0015 25.1 3.4 22 35-56 30-51 (173)
58 KOG4824|consensus 29.4 2.9E+02 0.0064 22.6 6.9 22 82-103 65-86 (224)
59 PF03100 CcmE: CcmE; InterPro 28.7 2.4E+02 0.0051 20.7 6.2 7 2-8 1-7 (131)
60 PF05899 Cupin_3: Protein of u 28.6 1.3E+02 0.0028 19.7 4.1 30 81-112 28-57 (74)
61 PF05538 Campylo_MOMP: Campylo 28.4 1.9E+02 0.0041 26.1 6.1 15 46-61 31-45 (431)
62 COG0450 AhpC Peroxiredoxin [Po 28.2 1E+02 0.0022 24.8 4.1 48 81-138 120-167 (194)
63 PHA02978 hypothetical protein; 27.8 58 0.0012 24.2 2.4 16 87-102 75-90 (135)
64 PRK00315 potassium-transportin 27.0 42 0.00092 27.0 1.7 34 106-139 74-112 (193)
65 KOG4063|consensus 27.0 60 0.0013 25.4 2.5 17 11-27 6-22 (158)
66 PTZ00459 mucin-associated surf 26.6 43 0.00093 28.3 1.8 9 16-24 11-19 (291)
67 PF04710 Pellino: Pellino; In 26.4 1E+02 0.0022 27.6 4.1 53 60-112 46-100 (416)
68 PRK13999 potassium-transportin 26.3 44 0.00095 27.1 1.7 32 107-138 86-122 (201)
69 PF12984 DUF3868: Domain of un 26.1 39 0.00084 24.5 1.3 11 10-20 5-15 (115)
70 PF05423 Mycobact_memb: Mycoba 25.6 1.4E+02 0.0031 22.5 4.3 30 88-118 54-85 (140)
71 PF13124 DUF3963: Protein of u 25.6 69 0.0015 19.2 2.0 13 6-18 21-33 (40)
72 PF00659 POLO_box: POLO box du 25.5 1.9E+02 0.004 18.4 4.7 64 53-135 4-68 (68)
73 PF11912 DUF3430: Protein of u 25.4 57 0.0012 25.2 2.2 8 53-60 40-47 (212)
74 TIGR01655 yxeA_fam conserved h 25.0 2.7E+02 0.0058 20.0 6.6 14 93-106 63-76 (114)
75 PRK09558 ushA bifunctional UDP 24.8 57 0.0012 29.5 2.4 10 1-10 1-10 (551)
76 PRK11627 hypothetical protein; 24.3 68 0.0015 25.5 2.5 9 84-92 114-122 (192)
77 PRK14864 putative biofilm stre 23.8 85 0.0019 22.7 2.7 8 1-8 1-8 (104)
78 cd08344 MhqB_like_N N-terminal 23.6 95 0.002 20.9 2.8 16 88-103 93-108 (112)
79 COG2834 LolA Outer membrane li 23.0 1.9E+02 0.0042 22.7 4.8 30 83-112 83-113 (211)
80 PF07263 DMP1: Dentin matrix p 22.9 38 0.00083 31.0 0.9 13 14-26 7-19 (514)
81 PF07584 BatA: Aerotolerance r 22.7 84 0.0018 20.8 2.3 16 10-25 60-75 (77)
82 PRK09455 rseB anti-sigma E fac 22.7 1.4E+02 0.003 25.5 4.2 20 81-100 83-102 (319)
83 TIGR01200 Porphyromonas gingiv 22.7 96 0.0021 26.5 3.1 11 13-23 7-17 (292)
84 PHA02823 chemokine binding pro 22.6 71 0.0015 26.4 2.3 13 14-26 5-17 (255)
85 PF10902 DUF2693: Protein of u 21.7 2.5E+02 0.0055 19.5 4.6 23 52-74 1-23 (83)
86 PF03866 HAP: Hydrophobic abun 21.6 71 0.0015 24.6 1.9 17 1-25 1-17 (164)
87 COG4594 FecB ABC-type Fe3+-cit 21.6 1.8E+02 0.0039 24.9 4.5 27 65-94 36-62 (310)
88 COG3354 FlaG Putative archaeal 21.3 1.4E+02 0.003 23.2 3.5 27 36-62 50-78 (154)
89 TIGR01965 VCBS_repeat VCBS rep 21.2 1.6E+02 0.0034 21.1 3.6 20 83-102 55-74 (99)
90 PRK00178 tolB translocation pr 20.5 84 0.0018 26.8 2.5 18 10-27 5-22 (430)
91 COG3422 Uncharacterized conser 20.2 87 0.0019 20.6 1.9 22 50-71 11-33 (59)
92 cd09013 BphC-JF8_N_like N-term 20.1 1.3E+02 0.0029 20.3 3.0 16 88-103 102-117 (121)
No 1
>PF00379 Chitin_bind_4: Insect cuticle protein; InterPro: IPR000618 Insect cuticle is composed of proteins and chitin. The cuticular proteins seem to be specific to the type of cuticle (flexible or stiff) that occur at stages of the insect development. The proteins found in the flexible cuticle of larva and pupa of different insects share a conserved C-terminal section [] such a region is also found in the soft endocuticle of adults insects [] as well as in other cuticular proteins including in arachnids []. In addition, cuticular proteins share hydrophobic regions dominated by tetrapeptide repeats (A-A-P-A/V), which are presumed to be functionally important [, ]. Many insect cuticle proteins also include a 35-36 amino acid motif known as the R and R consensus. An extended form of this motif has been shown [] to bind chitin. It has no sequence similiarity to the cysteine-containing chitin-binding domain of chitinases and some peritrophic membrane proteins, suggesting that arthropods have two distinct classes of chitin-binding proteins, those with the chitin-binding domains found in lectins, chitinases and peritrophic membranes (cysCBD), and those with the type of chitin-binding domains found in cuticular proteins (non-cysCBD) []. The cuticle protein signature has been found in locust cuticle proteins 7 (LM-7), 8 (LM-8), 19 (LM-19) and endocuticle structural glycoprotein ABD-4; Hyalophora cecropia (Cecropia moth) cuticle proteins 12 and 66; Drosophila melanogaster (Fruit fly) larval cuticles proteins I, II, III and IV (LCP1 to LCP4); drosophila pupal cuticle proteins PCP, EDG-78E and EDG-84E; Manduca sexta (Tobacco hawkmoth) cuticle protein LCP-14; Tenebrio molitor (Yellow mealworm) cuticle proteins ACP-20, A1A, A2B and A3A; and Araneus diadematus (Spider) cuticle proteins ACP 11.9, ACP 12.4, ACP 12.6, ACP 15.5 and ACP 15.7.; GO: 0042302 structural constituent of cuticle
Probab=99.70 E-value=4.7e-17 Score=103.92 Aligned_cols=52 Identities=42% Similarity=0.876 Sum_probs=46.1
Q ss_pred eeEEEeeCCceeeeecceecCCCCCcccccceeeEEEEECCCCcEEEEEEEcCCCCc
Q psy16510 54 YRYSYETGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTPITVQWFADETGF 110 (147)
Q Consensus 54 Y~~~Ye~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADe~Gf 110 (147)
|+|+|+++|| .++|+|. .++++ ..++|+|+|+|++|||+.++|+|+||++||
T Consensus 1 Y~f~Y~~~dg-~~~e~~~---~~~~~-~~~~v~GsY~y~~pdG~~~~V~Y~Ad~~Gf 52 (52)
T PF00379_consen 1 YSFGYETSDG-SRQEEGR---PETED-EGGVVRGSYSYIDPDGQTRTVTYVADENGF 52 (52)
T ss_pred CeeEeEcCCC-CEEEEEE---cccCC-CCCEEEEEEEEECCCCCEEEEEEECCCCCC
Confidence 7899999999 8888876 23334 689999999999999999999999999998
No 2
>PF06004 DUF903: Bacterial protein of unknown function (DUF903); InterPro: IPR010305 This family consists of several small bacterial proteins several of which are classified as putative lipoproteins. The function of this family is unknown.; PDB: 2RB6_B 3BDU_F 2RA2_F 2RD1_C 3FIF_H 2JN0_A 2K57_A.
Probab=89.23 E-value=1.2 Score=28.28 Aligned_cols=39 Identities=26% Similarity=0.453 Sum_probs=28.9
Q ss_pred EEEeeCCceeeeecceecCCCCCcccccceeeEEEEECCCCcEEEEE
Q psy16510 56 YSYETGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTPITVQ 102 (147)
Q Consensus 56 ~~Ye~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~~~V~ 102 (147)
|--.+.||....-.|..+ .-.-.|.|+|.+.+|+.+.++
T Consensus 2 yvi~t~dG~~i~t~gkP~--------~D~~tG~~~y~D~~G~~~qIn 40 (50)
T PF06004_consen 2 YVITTNDGRTIVTDGKPK--------YDKDTGMYSYTDADGKEQQIN 40 (50)
T ss_dssp EEEEETTSEEEEESSEEE--------EETTTTEEEEEBTTS-EEEEE
T ss_pred cEEEeCCCcEEEeCCCcc--------ccCCCCcEEEECCCCCEEEEc
Confidence 345678888877777764 234579999999999988875
No 3
>PRK09408 ompX outer membrane protein X; Provisional
Probab=78.99 E-value=6.3 Score=30.78 Aligned_cols=12 Identities=42% Similarity=0.598 Sum_probs=7.3
Q ss_pred cceeeEEEEECC
Q psy16510 83 QTAQGQFTYTAP 94 (147)
Q Consensus 83 ~~v~GsYsy~~p 94 (147)
+-+.||++|...
T Consensus 60 ~Gvi~sfty~~~ 71 (171)
T PRK09408 60 LGVIGSFTYTEK 71 (171)
T ss_pred EEEEEEEEEecC
Confidence 456666666654
No 4
>PF05984 Cytomega_UL20A: Cytomegalovirus UL20A protein; InterPro: IPR009245 This family consists of several Cytomegalovirus UL20A proteins. UL20A is thought to be a glycoprotein [].
Probab=78.49 E-value=6.6 Score=27.88 Aligned_cols=21 Identities=19% Similarity=0.169 Sum_probs=17.5
Q ss_pred chhhHHHHHHHHHHHHHhhcc
Q psy16510 7 DIGALAISFVALFIAVVSAQR 27 (147)
Q Consensus 7 ~~~~~~~~i~~~~la~a~A~~ 27 (147)
.|+.|++.+|++-|.+|+|++
T Consensus 2 aRRlwiLslLAVtLtVALAAP 22 (100)
T PF05984_consen 2 ARRLWILSLLAVTLTVALAAP 22 (100)
T ss_pred chhhHHHHHHHHHHHHHhhcc
Confidence 467888899999998888875
No 5
>COG5510 Predicted small secreted protein [Function unknown]
Probab=78.31 E-value=2.3 Score=26.41 Aligned_cols=22 Identities=27% Similarity=0.288 Sum_probs=9.8
Q ss_pred CcccccchhhHHHHHHHHHHHHHh
Q psy16510 1 MMKIEKDIGALAISFVALFIAVVS 24 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~la~a~ 24 (147)
|||- .|-..++++++|++..+|
T Consensus 1 mmk~--t~l~i~~vll~s~llaaC 22 (44)
T COG5510 1 MMKK--TILLIALVLLASTLLAAC 22 (44)
T ss_pred CchH--HHHHHHHHHHHHHHHHHh
Confidence 6773 333333344444444443
No 6
>PF06286 Coleoptericin: Coleoptericin; InterPro: IPR009382 This family consists of several insect coleoptericin, acaloleptin, holotricin and rhinocerosin proteins which are all known to be antibacterial proteins []. These all appear to be short, glycine-rich molecules, inducible by infection.; GO: 0042742 defense response to bacterium, 0005576 extracellular region
Probab=73.46 E-value=1.4 Score=33.34 Aligned_cols=19 Identities=32% Similarity=0.303 Sum_probs=14.0
Q ss_pred CcccccchhhHHHHHHHHHHHHHhhccC
Q psy16510 1 MMKIEKDIGALAISFVALFIAVVSAQRG 28 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~la~a~A~~~ 28 (147)
|||+ .|++||+++++|-+.
T Consensus 1 mmkl---------~i~~~lia~saay~v 19 (143)
T PF06286_consen 1 MMKL---------YIIFGLIALSAAYAV 19 (143)
T ss_pred CceE---------eeehhHHHHHHhhcc
Confidence 8998 777788887765443
No 7
>PF06316 Ail_Lom: Enterobacterial Ail/Lom protein; InterPro: IPR000758 Virulence-related outer membrane proteins are expressed in Gram-negative bacteria and are essential to bacterial survival within macrophages and for eukaryotic cell invasion. Members of this group include: PagC, required by Salmonella typhimurium for survival in macrophages and for virulence in mice [] Rck outer membrane protein of the S. typhimurium virulence plasmid [] Ail, a product of the Yersinia enterocolitica chromosome capable of mediating bacterial adherence to and invasion of epithelial cell lines [] OmpX from Escherichia coli that promotes adhesion to and entry into mammalian cells. It also has a role in the resistance against attack by the human complement system [] a Bacteriophage lambda outer membrane protein, Lom [] The crystal structure of OmpX from E. coli reveals that OmpX consists of an eight-stranded antiparallel all-next-neighbour beta barrel []. The structure shows two girdles of aromatic amino acid residues and a ribbon of nonpolar residues that attach to the membrane interior. The core of the barrel consists of an extended hydrogen-bonding network of highly conserved residues. OmpX thus resembles an inverse micelle. The OmpX structure shows that the membrane-spanning part of the protein is much better conserved than the extracellular loops. Moreover, these loops form a protruding beta sheet, the edge of which presumably binds to external proteins. It is suggested that this type of binding promotes cell adhesion and invasion and helps defend against the complement system. Although OmpX has the same beta-sheet topology as the structurally related outer membrane protein A (OmpA) IPR000498 from INTERPRO, their barrels differ with respect to the shear numbers and internal hydrogen-bonding networks.; GO: 0009279 cell outer membrane
Probab=73.17 E-value=7.7 Score=31.36 Aligned_cols=15 Identities=20% Similarity=0.386 Sum_probs=8.9
Q ss_pred cceeeEEEEECCCCc
Q psy16510 83 QTAQGQFTYTAPDGT 97 (147)
Q Consensus 83 ~~v~GsYsy~~pdG~ 97 (147)
+-|.|||+|...++.
T Consensus 63 lGvItSfSY~~~~~~ 77 (199)
T PF06316_consen 63 LGVITSFSYASGDDS 77 (199)
T ss_pred eeeeEEEEEeecCCC
Confidence 445666666665554
No 8
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=68.64 E-value=4.6 Score=28.75 Aligned_cols=9 Identities=22% Similarity=0.338 Sum_probs=3.4
Q ss_pred HHHHHHHhh
Q psy16510 17 ALFIAVVSA 25 (147)
Q Consensus 17 ~~~la~a~A 25 (147)
+++|+++++
T Consensus 14 A~lLlisSe 22 (95)
T PF07172_consen 14 AALLLISSE 22 (95)
T ss_pred HHHHHHHhh
Confidence 333444333
No 9
>PF10731 Anophelin: Thrombin inhibitor from mosquito; InterPro: IPR018932 Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing.
Probab=66.38 E-value=8 Score=25.71 Aligned_cols=19 Identities=21% Similarity=0.209 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHhhccCC
Q psy16510 11 LAISFVALFIAVVSAQRGP 29 (147)
Q Consensus 11 ~~~~i~~~~la~a~A~~~~ 29 (147)
|++|-|+|+.++|.++++|
T Consensus 5 l~vialLC~aLva~vQ~AP 23 (65)
T PF10731_consen 5 LIVIALLCVALVAIVQSAP 23 (65)
T ss_pred hhHHHHHHHHHHHHHhcCc
Confidence 3445566666666666544
No 10
>COG5294 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=66.26 E-value=25 Score=26.04 Aligned_cols=14 Identities=21% Similarity=0.349 Sum_probs=12.0
Q ss_pred CCCcEEEEEEEcCC
Q psy16510 94 PDGTPITVQWFADE 107 (147)
Q Consensus 94 pdG~~~~V~Y~ADe 107 (147)
.+|+.-+|+|+|+.
T Consensus 63 ~~Gkkk~v~f~a~~ 76 (113)
T COG5294 63 KNGKKKEVKFTATH 76 (113)
T ss_pred cCCcEEEEEEEecC
Confidence 47999999999985
No 11
>PF12930 DUF3836: Family of unknown function (DUF3836); InterPro: IPR024339 This entry represents a family of bacterial proteins of unknown function.; PDB: 3MSW_A.
Probab=65.58 E-value=27 Score=26.18 Aligned_cols=20 Identities=20% Similarity=0.454 Sum_probs=9.6
Q ss_pred CeeEEEeeCCceeeeeccee
Q psy16510 53 SYRYSYETGNGIAANEQGYL 72 (147)
Q Consensus 53 ~Y~~~Ye~~dG~~~~E~G~~ 72 (147)
.|.|.|...+-....|.-.+
T Consensus 63 KY~Y~YD~~grl~eK~a~kW 82 (132)
T PF12930_consen 63 KYNYKYDDNGRLTEKEAYKW 82 (132)
T ss_dssp EEEEEEETTTTEEEEEEEEE
T ss_pred EEEEEECCCCCEeeeeeeee
Confidence 45566655444444443333
No 12
>COG4907 Predicted membrane protein [Function unknown]
Probab=64.63 E-value=14 Score=33.72 Aligned_cols=23 Identities=9% Similarity=0.195 Sum_probs=15.1
Q ss_pred cceeeEEEEECCCCcEEEEEEEc
Q psy16510 83 QTAQGQFTYTAPDGTPITVQWFA 105 (147)
Q Consensus 83 ~~v~GsYsy~~pdG~~~~V~Y~A 105 (147)
+.+.++|||++-+|..+...|.-
T Consensus 51 ~~ieeiysy~~~eG~~y~~~~rd 73 (595)
T COG4907 51 DEIEEIYSYIDAEGYGYITSLRD 73 (595)
T ss_pred ceeeeeeeeeeccccceEEEEee
Confidence 45677777777777665555543
No 13
>PHA02698 hypothetical protein; Provisional
Probab=64.48 E-value=4.1 Score=28.40 Aligned_cols=21 Identities=33% Similarity=0.727 Sum_probs=17.6
Q ss_pred CCcccccCCCCCCCCCcHHHH
Q psy16510 108 TGFHASGAHLPTPPPIPEAIL 128 (147)
Q Consensus 108 ~Gf~p~g~hlP~~pp~p~~i~ 128 (147)
.|-|+.|+..|+|-|+|+++-
T Consensus 19 ddvqvnGDlmPtp~p~peeV~ 39 (89)
T PHA02698 19 DDVQVNGDLMPTPEPTPEEVP 39 (89)
T ss_pred hccccCCcccCCCCCChhhhc
Confidence 477889999999999887764
No 14
>PRK09980 ompL outer membrane porin L; Provisional
Probab=62.40 E-value=40 Score=27.67 Aligned_cols=60 Identities=12% Similarity=0.072 Sum_probs=33.9
Q ss_pred CeeEEEeeCCceeeeecceecCCCCCc--ccccceeeEEEE-ECCCCcEEE----EEEEcCCCCccc
Q psy16510 53 SYRYSYETGNGIAANEQGYLKNPGQKD--LEAQTAQGQFTY-TAPDGTPIT----VQWFADETGFHA 112 (147)
Q Consensus 53 ~Y~~~Ye~~dG~~~~E~G~~k~~g~~~--~~~~~v~GsYsy-~~pdG~~~~----V~Y~ADe~Gf~p 112 (147)
....+...+||.-...+..++..+.++ .....+.++|.| ...|.-..+ +....+.++|+|
T Consensus 40 Ri~vsh~f~nG~Gf~~eak~~~~~dd~~~~~~~E~~~sY~~~k~~d~~tl~PG~~~~s~s~~~~Y~P 106 (230)
T PRK09980 40 MLRVGYNFDMGAGIMLTNTYTFQRDDELKHGYNEIEGWYPLFKPTDKLTIQPGGLINDKSIGSGGAV 106 (230)
T ss_pred EEEEEEEccCceeEEEEEEeccCCCccccccceEEEEEEEeEecCCCEEEecceEEEecCCCceEee
Confidence 677788888888766666664322211 135667788874 555543333 233444455555
No 15
>PRK15346 outer membrane secretin SsaC; Provisional
Probab=60.16 E-value=8.9 Score=34.49 Aligned_cols=20 Identities=15% Similarity=0.220 Sum_probs=13.7
Q ss_pred CcccccchhhHHHHHHHHHHHHHhhcc
Q psy16510 1 MMKIEKDIGALAISFVALFIAVVSAQR 27 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~la~a~A~~ 27 (147)
|||. ++++|+|||+++.+++
T Consensus 1 ~~~~-------~~~~~~~~~~~~~~~~ 20 (499)
T PRK15346 1 MKKL-------LILIFLFLLNTAKFAA 20 (499)
T ss_pred Cchh-------HHHHHHHHHhhhhhhc
Confidence 7885 5677777777765544
No 16
>PF15240 Pro-rich: Proline-rich
Probab=58.62 E-value=9.9 Score=30.26 Aligned_cols=18 Identities=28% Similarity=0.329 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHhhccC
Q psy16510 11 LAISFVALFIAVVSAQRG 28 (147)
Q Consensus 11 ~~~~i~~~~la~a~A~~~ 28 (147)
|+++|.++|||+.+|...
T Consensus 2 LlVLLSvALLALSSAQ~~ 19 (179)
T PF15240_consen 2 LLVLLSVALLALSSAQST 19 (179)
T ss_pred hhHHHHHHHHHhhhcccc
Confidence 444555666666666543
No 17
>COG3117 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=58.17 E-value=25 Score=28.21 Aligned_cols=25 Identities=4% Similarity=-0.125 Sum_probs=16.4
Q ss_pred CcccccchhhHHHHHHHHHHHHHhhc
Q psy16510 1 MMKIEKDIGALAISFVALFIAVVSAQ 26 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~la~a~A~ 26 (147)
+|++ |-+.++++++++|++++....
T Consensus 1 ~~~~-Rw~~~ILll~a~~~~~w~~~~ 25 (188)
T COG3117 1 SMSR-RWVYLILLLAALALSGWLLGL 25 (188)
T ss_pred Ccch-hHHHHHHHHHHHHHHHHhhhc
Confidence 4665 555567777777777776654
No 18
>PRK13254 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=56.62 E-value=11 Score=28.81 Aligned_cols=21 Identities=29% Similarity=0.175 Sum_probs=10.7
Q ss_pred CcccccchhhHHHHHHHHHHHH
Q psy16510 1 MMKIEKDIGALAISFVALFIAV 22 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~la~ 22 (147)
||+ .+.++.++++++++++++
T Consensus 1 m~~-~~~~rl~~~~~~~~~~~~ 21 (148)
T PRK13254 1 MMK-RKRRRLLIILGALAALGL 21 (148)
T ss_pred CCc-cchhHHHHHHHHHHHHHH
Confidence 676 455565544444444443
No 19
>PF12729 4HB_MCP_1: Four helix bundle sensory module for signal transduction; InterPro: IPR024478 This entry represents a four-helix bundle that operates as a ubiquitous sensory module in prokaryotic signal-transduction, which is known as four-helix bundles methyl-accepting chemotaxis protein (4HB_MCP) domain. The 4HB_MCP is always found between two predicted transmembrane helices indicating that it detects only extracellular signals. In many cases the domain is associated with a cytoplasmic HAMP domain suggesting that most proteins carrying the bundle might share the mechanism of transmembrane signalling which is well-characterised in E coli chemoreceptors [].
Probab=55.33 E-value=10 Score=27.20 Aligned_cols=23 Identities=13% Similarity=0.015 Sum_probs=15.0
Q ss_pred cccccchhhHHHHHHHHHHHHHh
Q psy16510 2 MKIEKDIGALAISFVALFIAVVS 24 (147)
Q Consensus 2 m~~~~~~~~~~~~i~~~~la~a~ 24 (147)
|||.+++...+.+++++++++++
T Consensus 2 l~I~~KL~~~f~~~~~l~~~~~~ 24 (181)
T PF12729_consen 2 LSIRTKLILGFGLIILLLLIVGI 24 (181)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHH
Confidence 78888888666555555555543
No 20
>KOG1691|consensus
Probab=55.13 E-value=96 Score=25.32 Aligned_cols=31 Identities=26% Similarity=0.412 Sum_probs=27.2
Q ss_pred ceeeEEEEECCCCcEEEEEEEcCCCCccccc
Q psy16510 84 TAQGQFTYTAPDGTPITVQWFADETGFHASG 114 (147)
Q Consensus 84 ~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p~g 114 (147)
+..|+|+|+..+-..+.+.++||..|.+|..
T Consensus 80 v~~gqFaFta~e~~~y~~Cf~~~~~~~~p~~ 110 (210)
T KOG1691|consen 80 VTKGQFAFTAEESGMYEACFTADVPGHKPET 110 (210)
T ss_pred cccceEEEEeccCCcEEEEEecccCCCCCCc
Confidence 3499999999998899999999998888843
No 21
>TIGR03795 chp_BMA0021 conserved hypothetical protein, BMA_0021 family. Members of this protein family are found sparsely, mostly in members of the genus Burkholderia. Members often occur as tandem homologous genes, such as BMA_0021 and BMA_0022 in Burkholderia mallei ATCC 23344. The genes regularly are encoded near the so-called docking protein of TOMM (thiazole/oxazole-modified microcins) biosynthetic clusters, suggesting a role in bacteriocin biosynthesis. The function is unknown.
Probab=54.42 E-value=21 Score=26.46 Aligned_cols=24 Identities=17% Similarity=0.228 Sum_probs=16.2
Q ss_pred CCCCCCCCcHHHHHHHH-HHhcCCC
Q psy16510 116 HLPTPPPIPEAILKSLQ-QVQVSPP 139 (147)
Q Consensus 116 hlP~~pp~p~~i~~al~-~~~~~p~ 139 (147)
.||.+|..-+...-||+ |+.+|+.
T Consensus 88 ~iP~aP~~~~~~a~ALAayn~~~~~ 112 (114)
T TIGR03795 88 GIPPAPEHLAEEAIALAAYNDAGPA 112 (114)
T ss_pred EcCCCCCccchHHHHHHHHhccCcc
Confidence 47877665444555776 9988874
No 22
>COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane]
Probab=54.21 E-value=40 Score=27.44 Aligned_cols=15 Identities=7% Similarity=0.049 Sum_probs=8.4
Q ss_pred HHHHHHHHHHHHhhc
Q psy16510 12 AISFVALFIAVVSAQ 26 (147)
Q Consensus 12 ~~~i~~~~la~a~A~ 26 (147)
++..++|+|+.+|+.
T Consensus 10 ~l~~~As~LL~aC~~ 24 (206)
T COG3017 10 LLLALASLLLTACTL 24 (206)
T ss_pred HHHHHHHHHHHhccC
Confidence 335556666666644
No 23
>PRK10894 lipopolysaccharide transport periplasmic protein LptA; Provisional
Probab=53.68 E-value=24 Score=27.33 Aligned_cols=35 Identities=20% Similarity=0.060 Sum_probs=14.7
Q ss_pred HHHHHHHHHHHHHhhccCCCCCCCCCCcceeeeccC
Q psy16510 11 LAISFVALFIAVVSAQRGPPQYAPGQFIPIISYQNE 46 (147)
Q Consensus 11 ~~~~i~~~~la~a~A~~~~~~~~~~~~~~il~~~~~ 46 (147)
|-++|++||+++++++.+-+ .....++.|-....+
T Consensus 6 ~~~~~~~~ll~~~~~a~A~~-~d~~~pI~I~AD~~~ 40 (180)
T PRK10894 6 LNLLLASSLLAASIPAFALT-GDTDQPIHIDSDQQS 40 (180)
T ss_pred HHHHHHHHHHHHHHHHhhcc-cccCCCEEEEeCceE
Confidence 34444455554433332221 123345665554433
No 24
>PRK13159 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=52.04 E-value=13 Score=28.79 Aligned_cols=13 Identities=8% Similarity=0.406 Sum_probs=8.1
Q ss_pred CcHHHHHHHHHHh
Q psy16510 123 IPEAILKSLQQVQ 135 (147)
Q Consensus 123 ~p~~i~~al~~~~ 135 (147)
.|+++.++|....
T Consensus 128 mP~Ev~~al~~~~ 140 (155)
T PRK13159 128 MPKELKDAMAEGH 140 (155)
T ss_pred CCHHHHHHHHhcc
Confidence 4566777776443
No 25
>PF08138 Sex_peptide: Sex peptide (SP) family; InterPro: IPR012608 This family consists of Sex Peptides (SP) that are found in Drosophila. On mating, Drosophila females decreases her remating rate and increases her egg-laying rate due, in part, to the transfer of SP from the male to the female. SP are found in seminal fluids transferred from the male to the female during mating. The male seminal fluid proteins are referred to as accessory gland proteins (Acps). The SP is one of the most interesting Acps and plays an important role in reproduction [].; GO: 0005179 hormone activity, 0046008 regulation of female receptivity, post-mating, 0005576 extracellular region; PDB: 2LAQ_A.
Probab=51.85 E-value=4.8 Score=26.08 Aligned_cols=16 Identities=6% Similarity=0.374 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHHhh
Q psy16510 10 ALAISFVALFIAVVSA 25 (147)
Q Consensus 10 ~~~~~i~~~~la~a~A 25 (147)
++++++++|++.+|.+
T Consensus 4 p~~llllvlllGla~s 19 (56)
T PF08138_consen 4 PIFLLLLVLLLGLAQS 19 (56)
T ss_dssp ----------------
T ss_pred hHHHHHHHHHHHHHhc
Confidence 6777788887777766
No 26
>PF11153 DUF2931: Protein of unknown function (DUF2931); InterPro: IPR021326 Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function.
Probab=51.00 E-value=62 Score=25.57 Aligned_cols=22 Identities=23% Similarity=0.363 Sum_probs=11.5
Q ss_pred cCCCCcccccCCCCCCCCCcHHHHHHHHH
Q psy16510 105 ADETGFHASGAHLPTPPPIPEAILKSLQQ 133 (147)
Q Consensus 105 ADe~Gf~p~g~hlP~~pp~p~~i~~al~~ 133 (147)
+|+.=|+...+ +|+.+.+.+.-
T Consensus 107 ~ekK~Y~~~i~-------ip~~~~~~m~~ 128 (216)
T PF11153_consen 107 AEKKFYETKID-------IPEEIWQKMLT 128 (216)
T ss_pred ccCcEEEEEEE-------cCHHHHHHhcc
Confidence 34555555444 45566665543
No 27
>PF06316 Ail_Lom: Enterobacterial Ail/Lom protein; InterPro: IPR000758 Virulence-related outer membrane proteins are expressed in Gram-negative bacteria and are essential to bacterial survival within macrophages and for eukaryotic cell invasion. Members of this group include: PagC, required by Salmonella typhimurium for survival in macrophages and for virulence in mice [] Rck outer membrane protein of the S. typhimurium virulence plasmid [] Ail, a product of the Yersinia enterocolitica chromosome capable of mediating bacterial adherence to and invasion of epithelial cell lines [] OmpX from Escherichia coli that promotes adhesion to and entry into mammalian cells. It also has a role in the resistance against attack by the human complement system [] a Bacteriophage lambda outer membrane protein, Lom [] The crystal structure of OmpX from E. coli reveals that OmpX consists of an eight-stranded antiparallel all-next-neighbour beta barrel []. The structure shows two girdles of aromatic amino acid residues and a ribbon of nonpolar residues that attach to the membrane interior. The core of the barrel consists of an extended hydrogen-bonding network of highly conserved residues. OmpX thus resembles an inverse micelle. The OmpX structure shows that the membrane-spanning part of the protein is much better conserved than the extracellular loops. Moreover, these loops form a protruding beta sheet, the edge of which presumably binds to external proteins. It is suggested that this type of binding promotes cell adhesion and invasion and helps defend against the complement system. Although OmpX has the same beta-sheet topology as the structurally related outer membrane protein A (OmpA) IPR000498 from INTERPRO, their barrels differ with respect to the shear numbers and internal hydrogen-bonding networks.; GO: 0009279 cell outer membrane
Probab=50.02 E-value=80 Score=25.54 Aligned_cols=27 Identities=7% Similarity=0.289 Sum_probs=22.0
Q ss_pred cccceeeEEEEECCCCcEEEEEEEcCC
Q psy16510 81 EAQTAQGQFTYTAPDGTPITVQWFADE 107 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~V~Y~ADe 107 (147)
.+.+++=.|.|.||=|.+-..+|..++
T Consensus 49 ~GvNvKYRYE~~d~lGvItSfSY~~~~ 75 (199)
T PF06316_consen 49 RGVNVKYRYEFDDPLGVITSFSYASGD 75 (199)
T ss_pred CccEEEEEEecccceeeeEEEEEeecC
Confidence 567888888888888888888887765
No 28
>PRK13150 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=49.77 E-value=15 Score=28.60 Aligned_cols=17 Identities=24% Similarity=0.442 Sum_probs=12.5
Q ss_pred CcHHHHHHHHHHhcCCC
Q psy16510 123 IPEAILKSLQQVQVSPP 139 (147)
Q Consensus 123 ~p~~i~~al~~~~~~p~ 139 (147)
.|+++..+|....+.|.
T Consensus 135 mPpEv~~al~~~~~~~~ 151 (159)
T PRK13150 135 TPPEVEKAMQENHRRPQ 151 (159)
T ss_pred CCHHHHHHHHHhCCCCc
Confidence 67889999986666554
No 29
>COG3026 RseB Negative regulator of sigma E activity [Signal transduction mechanisms]
Probab=49.39 E-value=24 Score=30.36 Aligned_cols=22 Identities=23% Similarity=0.403 Sum_probs=15.4
Q ss_pred cccceeeEEEEECCCCcEEEEE
Q psy16510 81 EAQTAQGQFTYTAPDGTPITVQ 102 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~V~ 102 (147)
+.+..-+.-||+.|.=+.++++
T Consensus 82 Evvrr~n~vsyf~pg~~~ft~~ 103 (320)
T COG3026 82 EVVRRGNEVSYFEPGLEPFTLN 103 (320)
T ss_pred eeeeeCCEEEeecCCcceeeec
Confidence 6666777788888876666543
No 30
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=47.86 E-value=44 Score=27.09 Aligned_cols=49 Identities=24% Similarity=0.484 Sum_probs=33.7
Q ss_pred eEEEEECCCCcEEEE--------EEEcCCCCcccccCC--CCCC-CCCcHHHHHHHHHHh
Q psy16510 87 GQFTYTAPDGTPITV--------QWFADETGFHASGAH--LPTP-PPIPEAILKSLQQVQ 135 (147)
Q Consensus 87 GsYsy~~pdG~~~~V--------~Y~ADe~Gf~p~g~h--lP~~-pp~p~~i~~al~~~~ 135 (147)
.-+...-|.|..-+| -|.+|+.||+=.+.+ -+.| ||+|+.+++....++
T Consensus 46 p~~~~~~~gg~~msv~mt~~G~~~W~~d~~~YrYs~~~~~~~~pwp~~P~~l~~L~~~~~ 105 (213)
T PRK15401 46 PFRHMVTPGGYTMSVAMTNCGALGWVTDRRGYRYSPIDPLTGKPWPAMPASFLALAQRAA 105 (213)
T ss_pred CccceecCCCCcceeEEeccccceEecCCCCcccCCcCCCCCCCCCCchHHHHHHHHHHH
Confidence 345567777765544 789999999998764 3444 568877777666554
No 31
>PF02402 Lysis_col: Lysis protein; InterPro: IPR003059 The DNA sequence of the entire colicin E2 operon has been determined []. The operon comprises the colicin activity gene (ceaB), the colicin immunity gene (ceiB) and the lysis gene (celB), which is essential for colicin release from producing cells []. A putative LexA binding site is located upstream from ceaB, and a rho-independent terminator structure is located downstream from celB []. Comparison of the amino acid sequences of colicin E2 and cloacin DF13 reveal extensive similarity. These colicins have different modes of action and recognise different cell surface receptors; the two major regions of heterology at the C terminus, and in the C-terminal end of the central region are thought to correspond to the catalytic and receptor-recognition domains, respectively []. Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. By contrast, the lysis proteins of the ColE2, ColE1 and CloDF13 plasmids are almost identical except in the N-terminal regions, which themselves are similar to lipoprotein signal peptides []. Processing of the ColE2 prolysis protein to the mature form is prevented by globomycin, a specific inhibitor of the lipoprotein signal peptidase []. The mature ColE2 lysis protein is located in the cell envelope [].; GO: 0009405 pathogenesis, 0019835 cytolysis, 0019867 outer membrane
Probab=47.83 E-value=13 Score=23.26 Aligned_cols=17 Identities=18% Similarity=0.329 Sum_probs=9.3
Q ss_pred chhhHHHHHHHHHHHHH
Q psy16510 7 DIGALAISFVALFIAVV 23 (147)
Q Consensus 7 ~~~~~~~~i~~~~la~a 23 (147)
+|.+++++++..+|+.+
T Consensus 3 Ki~~~~i~~~~~~L~aC 19 (46)
T PF02402_consen 3 KIIFIGIFLLTMLLAAC 19 (46)
T ss_pred EEEEeHHHHHHHHHHHh
Confidence 56666666665444433
No 32
>COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown]
Probab=44.83 E-value=1.1e+02 Score=23.33 Aligned_cols=44 Identities=16% Similarity=0.347 Sum_probs=24.9
Q ss_pred CCCCeeEEEeeCCceeeeecceecCCCCCcccccceeeEEEEECCCCcE
Q psy16510 50 GDGSYRYSYETGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTP 98 (147)
Q Consensus 50 ~dG~Y~~~Ye~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~ 98 (147)
+||.|+-.|+.-|... -.+.+ .+- -+ .+-.|.=.|-|.+.+|+.
T Consensus 30 ~dGtY~~~y~~fDd~g--wk~f~-~it-i~-dGKiv~~~ydy~~k~G~~ 73 (147)
T COG4939 30 NDGTYQGHYESFDDHG--WKAFV-TIT-IQ-DGKIVACTYDYRDKKGNI 73 (147)
T ss_pred cCCceeeeeccccccC--ccceE-EEE-Ee-CCEEEEEEeeeecCCCCc
Confidence 5889999998776221 01111 000 00 344567778888887765
No 33
>PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional
Probab=44.48 E-value=1.8e+02 Score=24.01 Aligned_cols=12 Identities=17% Similarity=0.174 Sum_probs=6.2
Q ss_pred cCCCCCCCCeeEE
Q psy16510 45 NEPNQGDGSYRYS 57 (147)
Q Consensus 45 ~~~~~~dG~Y~~~ 57 (147)
-..+ .||.|...
T Consensus 61 LtL~-~DgTY~L~ 72 (234)
T PRK10523 61 LFLE-KDGTWVMN 72 (234)
T ss_pred EEEc-CCCCEEEE
Confidence 3445 56655543
No 34
>PRK10755 sensor protein BasS/PmrB; Provisional
Probab=43.79 E-value=19 Score=29.56 Aligned_cols=18 Identities=22% Similarity=0.392 Sum_probs=9.4
Q ss_pred CcccccchhhHHHHHHHHH
Q psy16510 1 MMKIEKDIGALAISFVALF 19 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~ 19 (147)
|||+ +++.++++++++|+
T Consensus 1 ~~~~-~~~~~~~~~~~~~~ 18 (356)
T PRK10755 1 MISL-RRRLLLTIGLILLV 18 (356)
T ss_pred CchH-HHHHHHHHHHHHHH
Confidence 7887 45554444444433
No 35
>PF03562 MltA: MltA specific insert domain; InterPro: IPR005300 This group of proteins includes MltA; a membrane-bound, murein degrading transglycosylase enzyme which plays an important role in the controlled growth of the stress-bearing sacculus of Escherichia coli [, ].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 2PI8_D 2AE0_X 2PIC_A 2GAE_A 2PJJ_A 3CZB_A 2G6G_A 2PNW_A 2G5D_A.
Probab=42.79 E-value=26 Score=27.19 Aligned_cols=57 Identities=16% Similarity=0.251 Sum_probs=35.8
Q ss_pred cccceeeEEEEECCCCcEEEEEEEcC-CCCcccccC------CCCCCCCCcHHHHHHHHHHhcCCCC
Q psy16510 81 EAQTAQGQFTYTAPDGTPITVQWFAD-ETGFHASGA------HLPTPPPIPEAILKSLQQVQVSPPG 140 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~V~Y~AD-e~Gf~p~g~------hlP~~pp~p~~i~~al~~~~~~p~~ 140 (147)
..+.|+||=.-.-|||....|.|-+. -.-|+..|. .|+.. --..+..-+|+++||.+
T Consensus 80 fflqIQGSGrl~l~DG~~~rvgYAg~NG~pY~sIGr~Li~~G~i~~~---~~Smq~Ir~wl~~~P~~ 143 (158)
T PF03562_consen 80 FFLQIQGSGRLRLPDGSTVRVGYAGQNGHPYTSIGRLLIDRGEIPRE---QMSMQAIRAWLRAHPEE 143 (158)
T ss_dssp HHHHHHSEEEEE-TTSSEEEEEEEEE-SS----HHHHHHHTTSS-TT---S-SHHHHHHHHHHTGGG
T ss_pred HHeeecCcEEEEcCCCCEEEEEEcccCCCccccHHHHHHHcCCcChh---hCCHHHHHHHHHHCHHH
Confidence 44789999999999999999999654 234666544 35432 12345556799999864
No 36
>PF13511 DUF4124: Domain of unknown function (DUF4124)
Probab=42.38 E-value=29 Score=21.62 Aligned_cols=17 Identities=12% Similarity=0.405 Sum_probs=11.1
Q ss_pred eEEEEECCCCcEEEEEEEcC
Q psy16510 87 GQFTYTAPDGTPITVQWFAD 106 (147)
Q Consensus 87 GsYsy~~pdG~~~~V~Y~AD 106 (147)
.-|+|+|.+|+ |.|+-.
T Consensus 14 ~vYk~~D~~G~---v~ysd~ 30 (60)
T PF13511_consen 14 EVYKWVDENGV---VHYSDT 30 (60)
T ss_pred cEEEEECCCCC---EEECcc
Confidence 56788888885 355544
No 37
>COG2332 CcmE Cytochrome c-type biogenesis protein CcmE [Posttranslational modification, protein turnover, chaperones]
Probab=42.20 E-value=1.2e+02 Score=23.60 Aligned_cols=14 Identities=21% Similarity=0.446 Sum_probs=9.0
Q ss_pred CcHHHHHHHHHHhc
Q psy16510 123 IPEAILKSLQQVQV 136 (147)
Q Consensus 123 ~p~~i~~al~~~~~ 136 (147)
+|+++.+++.+.-.
T Consensus 129 ~P~ev~~~mk~~~~ 142 (153)
T COG2332 129 TPPEVAKAMKKNGV 142 (153)
T ss_pred CCHHHHHHhhhcCC
Confidence 56777777765443
No 38
>PF00879 Defensin_propep: Defensin propeptide The pattern for this Prosite entry doesn't match the propeptide.; InterPro: IPR002366 Defensins are 2-6 kDa, cationic, microbicidal peptides active against many Gram-negative and Gram-positive bacteria, fungi, and enveloped viruses [], containing three pairs of intramolecular disulphide bonds []. On the basis of their size and pattern of disulphide bonding, mammalian defensins are classified into alpha, beta and theta categories. Alpha-defensins, which have been identified in humans, monkeys and several rodent species, are particularly abundant in neutrophils, certain macrophage populations and Paneth cells of the small intestine. Every mammalian species explored thus far has beta-defensins. In cows, as many as 13 beta-defensins exist in neutrophils. However, in other species, beta-defensins are more often produced by epithelial cells lining various organs (e.g. the epidermis, bronchial tree and genitourinary tract). Theta-defensins are cyclic and have so far only been identified in primate phagocytes. Defensins are produced constitutively and/or in response to microbial products or proinflammatory cytokines. Some defensins are also called corticostatins (CS) because they inhibit corticotropin-stimulated corticosteroid production. The mechanism(s) by which microorganisms are killed and/or inactivated by defensins is not understood completely. However, it is generally believed that killing is a consequence of disruption of the microbial membrane. The polar topology of defensins, with spatially separated charged and hydrophobic regions, allows them to insert themselves into the phospholipid membranes so that their hydrophobic regions are buried within the lipid membrane interior and their charged (mostly cationic) regions interact with anionic phospholipid head groups and water. Subsequently, some defensins can aggregate to form `channel-like' pores; others might bind to and cover the microbial membrane in a `carpet-like' manner. The net outcome is the disruption of membrane integrity and function, which ultimately leads to the lysis of microorganisms. Some defensins are synthesized as propeptides which may be relevant to this process - in neutrophils only the mature peptides have been identified but in Paneth cells, the propeptide is stored in vesicles [] and appears to be cleaved by trypsin on activation. ; GO: 0006952 defense response
Probab=42.16 E-value=66 Score=20.62 Aligned_cols=12 Identities=8% Similarity=0.260 Sum_probs=5.2
Q ss_pred HHHHHHHHHHhh
Q psy16510 14 SFVALFIAVVSA 25 (147)
Q Consensus 14 ~i~~~~la~a~A 25 (147)
.|++++|++|+-
T Consensus 5 ~LLaAlLLlAlq 16 (52)
T PF00879_consen 5 ALLAALLLLALQ 16 (52)
T ss_pred HHHHHHHHHHHH
Confidence 344444444443
No 39
>smart00817 Amelin Ameloblastin precursor (Amelin). This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.
Probab=40.99 E-value=25 Score=31.01 Aligned_cols=18 Identities=17% Similarity=0.215 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHhhccCCC
Q psy16510 13 ISFVALFIAVVSAQRGPP 30 (147)
Q Consensus 13 ~~i~~~~la~a~A~~~~~ 30 (147)
++|++|||..++|.+..+
T Consensus 4 ~iL~~cLl~~afA~Pl~P 21 (411)
T smart00817 4 LALVLCLLKTSFAVPAFP 21 (411)
T ss_pred hhHHHHHHhhhhccccCC
Confidence 388899999999987665
No 40
>PF13623 SurA_N_2: SurA N-terminal domain
Probab=40.88 E-value=22 Score=26.92 Aligned_cols=20 Identities=30% Similarity=0.444 Sum_probs=10.9
Q ss_pred ccccchhhHHHHHHHHHHHHH
Q psy16510 3 KIEKDIGALAISFVALFIAVV 23 (147)
Q Consensus 3 ~~~~~~~~~~~~i~~~~la~a 23 (147)
|| |+++.++++|++++|.+.
T Consensus 3 kI-R~r~~lLi~vIglAL~aF 22 (145)
T PF13623_consen 3 KI-RQRGGLLIIVIGLALFAF 22 (145)
T ss_pred hH-hhcchHHHHHHHHHHHHH
Confidence 67 566765555544444443
No 41
>KOG3866|consensus
Probab=39.55 E-value=29 Score=30.40 Aligned_cols=26 Identities=31% Similarity=0.321 Sum_probs=18.4
Q ss_pred cccccchhhHHHHHHHHHHHHHhhcc
Q psy16510 2 MKIEKDIGALAISFVALFIAVVSAQR 27 (147)
Q Consensus 2 m~~~~~~~~~~~~i~~~~la~a~A~~ 27 (147)
|..++-|-++.+++++|++-++||.+
T Consensus 1 ~~~~~~~l~l~~llvllll~av~avP 26 (442)
T KOG3866|consen 1 MRWRTMILPLVILLVLLLLDAVCAVP 26 (442)
T ss_pred CcchhhHHHHHHHHHHHHHhhhcCCc
Confidence 44555566788788888888777754
No 42
>PF10929 DUF2811: Protein of unknown function (DUF2811); InterPro: IPR021231 This is a bacterial family of uncharacterised proteins.
Probab=36.92 E-value=32 Score=22.45 Aligned_cols=20 Identities=20% Similarity=0.438 Sum_probs=17.1
Q ss_pred CCcHHHHHHHH-HHhcCCCCC
Q psy16510 122 PIPEAILKSLQ-QVQVSPPGP 141 (147)
Q Consensus 122 p~p~~i~~al~-~~~~~p~~~ 141 (147)
.||+.+.+++. ||.+||.=+
T Consensus 5 eiPe~L~~~m~~fie~hP~WD 25 (57)
T PF10929_consen 5 EIPEDLHQAMKDFIETHPNWD 25 (57)
T ss_pred cccHHHHHHHHHHHHcCCCch
Confidence 47899999998 999999643
No 43
>TIGR01643 YD_repeat_2x YD repeat (two copies). This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Probab=33.60 E-value=71 Score=18.15 Aligned_cols=20 Identities=30% Similarity=0.375 Sum_probs=12.1
Q ss_pred EEECCCCcEEEEEEEcCCCCcc
Q psy16510 90 TYTAPDGTPITVQWFADETGFH 111 (147)
Q Consensus 90 sy~~pdG~~~~V~Y~ADe~Gf~ 111 (147)
+.++|+|...+..| |..|-.
T Consensus 9 ~~~~p~G~~~~~~Y--D~~Grl 28 (42)
T TIGR01643 9 GSTDADGTTTRYTY--DAAGRL 28 (42)
T ss_pred EEECCCCCEEEEEE--CCCCCE
Confidence 35677777666666 555543
No 44
>PRK13165 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=33.44 E-value=42 Score=26.21 Aligned_cols=16 Identities=13% Similarity=0.403 Sum_probs=10.7
Q ss_pred CcHHHHHHHHHHhcCC
Q psy16510 123 IPEAILKSLQQVQVSP 138 (147)
Q Consensus 123 ~p~~i~~al~~~~~~p 138 (147)
.|+++..+|......|
T Consensus 135 mPpEv~~al~~~~~~~ 150 (160)
T PRK13165 135 TPPEVEEAMKKNHRRP 150 (160)
T ss_pred CCHHHHHHHHhccCCC
Confidence 5677888887554444
No 45
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=33.34 E-value=37 Score=28.42 Aligned_cols=19 Identities=21% Similarity=0.188 Sum_probs=10.9
Q ss_pred cccccchhhHHHHHHHHHH
Q psy16510 2 MKIEKDIGALAISFVALFI 20 (147)
Q Consensus 2 m~~~~~~~~~~~~i~~~~l 20 (147)
||+.+...+++.+++++++
T Consensus 1 ~~f~~~~~~~~~~~~~~~~ 19 (283)
T TIGR00219 1 MKFLVKPKLFIRLLLALIV 19 (283)
T ss_pred CCCCcCCchHHHHHHHHHH
Confidence 8886666655444444444
No 46
>PLN03207 stomagen; Provisional
Probab=33.24 E-value=55 Score=23.91 Aligned_cols=26 Identities=15% Similarity=0.093 Sum_probs=12.2
Q ss_pred CcccccchhhHHHHHHHHHHHHHhhc
Q psy16510 1 MMKIEKDIGALAISFVALFIAVVSAQ 26 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~la~a~A~ 26 (147)
||-+.+.--+||++++.++|+.++.+
T Consensus 5 ~~~~tt~~~~lffLl~~llla~~v~q 30 (113)
T PLN03207 5 CMTATTRCLTLFFLLFFLLLGAYVIQ 30 (113)
T ss_pred cccccchhHHHHHHHHHHHHHHHHHh
Confidence 44443333455555554454444444
No 47
>PRK05461 apaG CO2+/MG2+ efflux protein ApaG; Reviewed
Probab=33.05 E-value=54 Score=24.45 Aligned_cols=21 Identities=29% Similarity=0.487 Sum_probs=17.0
Q ss_pred cccceeeEEEEECCCCcEEEE
Q psy16510 81 EAQTAQGQFTYTAPDGTPITV 101 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~V 101 (147)
..+.++|+|.+++++|....|
T Consensus 93 p~G~M~G~y~~~~~~G~~F~v 113 (127)
T PRK05461 93 PSGTMQGHYQMVDEDGERFEV 113 (127)
T ss_pred CCEEEEEEEEEEeCCCCEEEE
Confidence 457899999999999986554
No 48
>cd07252 BphC1-RGP6_N_like N-terminal domain of 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC, EC 1.13.11.39) 1 from Rhodococcus globerulus P6 (BphC1-RGP6) and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of BphC1-RGP6 and similar proteins. BphC catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, the third step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). This subfamily of BphCs belongs to the type I extradiol dioxygenase family, which require a metal in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of 2,3-dihydroxybiphenyl 1,2-dioxygenases. For example, three types of BphC enzymes have been found in Rhodococcus globerulus (BphC1-RGP6 - BphC3-RGP6), all three enzymes are type I extradiol dioxygenases. BphC1-RGP6 has an internal duplication, it is a two-domain dioxygenase which forms octamers, and has Fe(II) at the catalytic site. Its N-
Probab=32.96 E-value=60 Score=22.28 Aligned_cols=21 Identities=29% Similarity=0.635 Sum_probs=17.0
Q ss_pred eEEEEECCCCcEEEEEEEcCC
Q psy16510 87 GQFTYTAPDGTPITVQWFADE 107 (147)
Q Consensus 87 GsYsy~~pdG~~~~V~Y~ADe 107 (147)
.++.+.||||..+++.|..+|
T Consensus 100 ~~~~~~DPdG~~iE~~~~~~~ 120 (120)
T cd07252 100 GLIRFADPDGNRHELFWGPKE 120 (120)
T ss_pred EEEEEECCCCCEEEEEecccC
Confidence 368899999999998876543
No 49
>PF09244 DUF1964: Domain of unknown function (DUF1964); InterPro: IPR015325 This domain is C-terminal to the catalytic sucrose phosphorylase beta/alpha barrel domain. It adopts a beta-sandwich fold, with Greek-key topology and is functionally uncharacterised []. ; PDB: 1R7A_B 2GDU_A 2GDV_A.
Probab=32.69 E-value=52 Score=22.04 Aligned_cols=22 Identities=36% Similarity=0.741 Sum_probs=16.0
Q ss_pred eeEEEEECCCCcEEEEEEEcCC
Q psy16510 86 QGQFTYTAPDGTPITVQWFADE 107 (147)
Q Consensus 86 ~GsYsy~~pdG~~~~V~Y~ADe 107 (147)
-|.|||-..++..++.+|++.+
T Consensus 3 dG~FSy~~dgdtSitf~W~g~~ 24 (68)
T PF09244_consen 3 DGEFSYEADGDTSITFTWTGAT 24 (68)
T ss_dssp GSEEEEEEETTTEEEEEEE-SS
T ss_pred cceeeEecCCCcEEEEEEeccc
Confidence 3778888777778888887764
No 50
>COG3137 Putative salt-induced outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=31.64 E-value=68 Score=26.83 Aligned_cols=15 Identities=27% Similarity=0.421 Sum_probs=8.4
Q ss_pred CCCeeEEEeeCCcee
Q psy16510 51 DGSYRYSYETGNGIA 65 (147)
Q Consensus 51 dG~Y~~~Ye~~dG~~ 65 (147)
+|+-.++|...-|-.
T Consensus 51 ~gsve~Gy~a~sGNT 65 (262)
T COG3137 51 EGSVEAGYLAQSGNT 65 (262)
T ss_pred ccccceeeeeccCCc
Confidence 456666666554443
No 51
>PRK14818 NADH dehydrogenase subunit B; Provisional
Probab=31.60 E-value=38 Score=26.80 Aligned_cols=24 Identities=33% Similarity=0.562 Sum_probs=18.6
Q ss_pred cccCCCCCCCCCcHHHHHHHHHHh
Q psy16510 112 ASGAHLPTPPPIPEAILKSLQQVQ 135 (147)
Q Consensus 112 p~g~hlP~~pp~p~~i~~al~~~~ 135 (147)
++.-.+|-=||.|++|+++|..+.
T Consensus 129 pVDvyIPGCPP~PeaIl~gil~L~ 152 (173)
T PRK14818 129 PVDVYVPGCPPRPEALTEGLLRLQ 152 (173)
T ss_pred CCcEEccCCCCCHHHHHHHHHHHH
Confidence 344478888999999999886554
No 52
>PF05581 MCP_N: Vibrio chemotaxis protein N terminus; InterPro: IPR008762 Methyl-accepting chemotaxis proteins (MCPs) are a family of bacterial receptors that mediate chemotaxis to diverse signals, responding to changes in the concentration of attractants and repellents in the environment by altering swimming behaviour []. Environmental diversity gives rise to diversity in bacterial signalling receptors, and consequently there are many genes encoding MCPs []. For example, there are four well-characterised MCPs found in Escherichia coli: Tar (taxis towards aspartate and maltose, away from nickel and cobalt), Tsr (taxis towards serine, away from leucine, indole and weak acids), Trg (taxis towards galactose and ribose) and Tap (taxis towards dipeptides). MCPs share similar topology and signalling mechanisms. MCPs either bind ligands directly or interact with ligand-binding proteins, transducing the signal to downstream signalling proteins in the cytoplasm. MCPs undergo two covalent modifications: deamidation and reversible methylation at a number of glutamate residues. Attractants increase the level of methylation, while repellents decrease it. The methyl groups are added by the methyl-transferase cheR and are removed by the methylesterase cheB. Most MCPs are homodimers that contain the following organisation: an N-terminal signal sequence that acts as a transmembrane domain in the mature protein; a poorly-conserved periplasmic receptor (ligand-binding) domain; a second transmembrane domain; and a highly-conserved C-terminal cytoplasmic domain that interacts with downstream signalling components. The C-terminal domain contains the glycosylated glutamate residues. This entry represents the N-terminal domain found in chemotaxis methyl-accepting proteins primarily from from Vibrio species.; PDB: 3C8C_A.
Probab=31.56 E-value=16 Score=26.56 Aligned_cols=24 Identities=17% Similarity=0.228 Sum_probs=0.0
Q ss_pred cccccchhhHHHHHHHHHHHHHhh
Q psy16510 2 MKIEKDIGALAISFVALFIAVVSA 25 (147)
Q Consensus 2 m~~~~~~~~~~~~i~~~~la~a~A 25 (147)
||+.|+|...-.+|+++.+++.+.
T Consensus 1 MKFshKIv~AsS~lLlvt~slls~ 24 (103)
T PF05581_consen 1 MKFSHKIVLASSLLLLVTVSLLSL 24 (103)
T ss_dssp ------------------------
T ss_pred CchhHHHHHHHHHHHHHHHHHHHH
Confidence 999999995554555555555443
No 53
>cd07240 ED_TypeI_classII_N N-terminal domain of type I, class II extradiol dioxygenases; non-catalytic domain. This family contains the N-terminal, non-catalytic, domain of type I, class II extradiol dioxygenases. Dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. Two major groups of dioxygenases have been identified according to the cleavage site; extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon, whereas intradiol enzymes cleave the aromatic ring between two hydroxyl groups. Extradiol dioxygenases are classified into type I and type II enzymes. Type I extradiol dioxygenases include class I and class II enzymes. These two classes of enzymes show sequence similarity; the two-domain class II enzymes evolved from a class I enzyme through gene duplication. The extradiol dioxygenases represented in this fa
Probab=31.48 E-value=73 Score=21.13 Aligned_cols=21 Identities=24% Similarity=0.392 Sum_probs=16.3
Q ss_pred eEEEEECCCCcEEEEEEEcCC
Q psy16510 87 GQFTYTAPDGTPITVQWFADE 107 (147)
Q Consensus 87 GsYsy~~pdG~~~~V~Y~ADe 107 (147)
-++.+.||||..+++.+..|+
T Consensus 97 ~~~~~~DP~G~~ie~~~~~~~ 117 (117)
T cd07240 97 RGLRFQDPDGHLLELFVEADK 117 (117)
T ss_pred eEEEEECCCCCEEEEEEccCC
Confidence 355689999999998877553
No 54
>PF07437 YfaZ: YfaZ precursor; InterPro: IPR009998 This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins.
Probab=31.40 E-value=1.9e+02 Score=22.64 Aligned_cols=13 Identities=31% Similarity=0.498 Sum_probs=6.4
Q ss_pred cceeeEEEEECCC
Q psy16510 83 QTAQGQFTYTAPD 95 (147)
Q Consensus 83 ~~v~GsYsy~~pd 95 (147)
+.+-|...|+++|
T Consensus 77 ~~vG~K~~~~~~~ 89 (180)
T PF07437_consen 77 LGVGGKALYLNPD 89 (180)
T ss_pred EEEeEEEEEeccc
Confidence 3444555555553
No 55
>TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PubMed:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al.
Probab=30.88 E-value=62 Score=24.44 Aligned_cols=13 Identities=8% Similarity=0.146 Sum_probs=6.2
Q ss_pred HHHHHHHHHHHhh
Q psy16510 13 ISFVALFIAVVSA 25 (147)
Q Consensus 13 ~~i~~~~la~a~A 25 (147)
+++++|+++.+|+
T Consensus 5 ~~l~~~llL~gC~ 17 (146)
T TIGR03352 5 VLLAACLLLAGCS 17 (146)
T ss_pred HHHHHHHHHhhcc
Confidence 3444555544444
No 56
>COG2967 ApaG Uncharacterized protein affecting Mg2+/Co2+ transport [Inorganic ion transport and metabolism]
Probab=30.24 E-value=1.4e+02 Score=22.49 Aligned_cols=21 Identities=29% Similarity=0.493 Sum_probs=18.1
Q ss_pred cccceeeEEEEECCCCcEEEE
Q psy16510 81 EAQTAQGQFTYTAPDGTPITV 101 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~V 101 (147)
..+..+|.|..++.+|...+|
T Consensus 92 p~G~M~GhY~M~~e~G~~F~v 112 (126)
T COG2967 92 PSGTMQGHYEMIDEDGETFDV 112 (126)
T ss_pred CcceEEEEEEEecCCCcEEEe
Confidence 457899999999999998765
No 57
>COG1934 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.90 E-value=71 Score=25.14 Aligned_cols=22 Identities=27% Similarity=0.303 Sum_probs=12.9
Q ss_pred CCCcceeeeccCCCCCCCCeeE
Q psy16510 35 GQFIPIISYQNEPNQGDGSYRY 56 (147)
Q Consensus 35 ~~~~~il~~~~~~~~~dG~Y~~ 56 (147)
..++.|.....+.++.++...|
T Consensus 30 ~qPi~i~ad~~el~~~~~~a~f 51 (173)
T COG1934 30 DQPITIEADQQELDDKNGVATF 51 (173)
T ss_pred cCCEEEEccceeeeccCCEEEE
Confidence 5677777666554434555554
No 58
>KOG4824|consensus
Probab=29.44 E-value=2.9e+02 Score=22.65 Aligned_cols=22 Identities=14% Similarity=0.306 Sum_probs=15.5
Q ss_pred ccceeeEEEEECCCCcEEEEEE
Q psy16510 82 AQTAQGQFTYTAPDGTPITVQW 103 (147)
Q Consensus 82 ~~~v~GsYsy~~pdG~~~~V~Y 103 (147)
++..+|.|++-.+||++...+-
T Consensus 65 ~gctt~~y~~~nkngkI~Vln~ 86 (224)
T KOG4824|consen 65 GGCTTGAYTFDNKNGKIHVLNE 86 (224)
T ss_pred CCceeeeeEecCCCceEEEeee
Confidence 3467788888888887666543
No 59
>PF03100 CcmE: CcmE; InterPro: IPR004329 CcmE is the product of one of a cluster of Ccm genes that are necessary for cytochrome c biosynthesis in eubacteria. Expression of these proteins is induced when the organisms are grown under anaerobic conditions with nitrate or nitrite as the final electron acceptor.; GO: 0017003 protein-heme linkage, 0017004 cytochrome complex assembly, 0005886 plasma membrane; PDB: 1SR3_A 2KCT_A 1J6Q_A 1LM0_A.
Probab=28.74 E-value=2.4e+02 Score=20.69 Aligned_cols=7 Identities=43% Similarity=0.325 Sum_probs=0.0
Q ss_pred cccccch
Q psy16510 2 MKIEKDI 8 (147)
Q Consensus 2 m~~~~~~ 8 (147)
||-++.+
T Consensus 1 ~~~~~~r 7 (131)
T PF03100_consen 1 MKRRKKR 7 (131)
T ss_dssp -------
T ss_pred CCcceee
Confidence 6643333
No 60
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=28.60 E-value=1.3e+02 Score=19.69 Aligned_cols=30 Identities=27% Similarity=0.285 Sum_probs=22.1
Q ss_pred cccceeeEEEEECCCCcEEEEEEEcCCCCccc
Q psy16510 81 EAQTAQGQFTYTAPDGTPITVQWFADETGFHA 112 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p 112 (147)
-..++.|+...++.+|.. +++.|++-=|-|
T Consensus 28 ~~~vleG~v~it~~~G~~--~~~~aGD~~~~p 57 (74)
T PF05899_consen 28 FFYVLEGEVTITDEDGET--VTFKAGDAFFLP 57 (74)
T ss_dssp EEEEEEEEEEEEETTTEE--EEEETTEEEEE-
T ss_pred EEEEEEeEEEEEECCCCE--EEEcCCcEEEEC
Confidence 457899999999999986 567777643333
No 61
>PF05538 Campylo_MOMP: Campylobacter major outer membrane protein; InterPro: IPR008439 Campylobacter are Gram-negative, spiral, microaerophilic bacteria. Campylobacter jejuni is one of the main causative agents of food poisoning in the developed world. This family consists of Campylobacter major outer membrane proteins. The major outer membrane protein (MOMP), a putative porin and a multifunction surface protein of Campylobacter jejuni, may play an important role in the adaptation of the organism to various host environments [].
Probab=28.40 E-value=1.9e+02 Score=26.14 Aligned_cols=15 Identities=27% Similarity=0.490 Sum_probs=8.8
Q ss_pred CCCCCCCCeeEEEeeC
Q psy16510 46 EPNQGDGSYRYSYETG 61 (147)
Q Consensus 46 ~~~~~dG~Y~~~Ye~~ 61 (147)
+++ -.|.-.|+|+..
T Consensus 31 dVd-vSG~~RYRY~~~ 45 (431)
T PF05538_consen 31 DVD-VSGFARYRYTDD 45 (431)
T ss_pred ccc-cceEEEEEecCC
Confidence 344 556666777654
No 62
>COG0450 AhpC Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
Probab=28.23 E-value=1e+02 Score=24.81 Aligned_cols=48 Identities=21% Similarity=0.216 Sum_probs=34.0
Q ss_pred cccceeeEEEEECCCCcEEEEEEEcCCCCcccccCCCCCCCCCcHHHHHHHHHHhcCC
Q psy16510 81 EAQTAQGQFTYTAPDGTPITVQWFADETGFHASGAHLPTPPPIPEAILKSLQQVQVSP 138 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p~g~hlP~~pp~p~~i~~al~~~~~~p 138 (147)
++..+||.| -+||||..+.+.+.....|=.+. + .-.+++||++.++|.
T Consensus 120 ~g~a~R~~F-IIDp~g~ir~~~v~~~~iGRn~d-E--------ilR~idAlq~~~~hg 167 (194)
T COG0450 120 EGLALRGTF-IIDPDGVIRHILVNPLTIGRNVD-E--------ILRVIDALQFVAKHG 167 (194)
T ss_pred CCcceeEEE-EECCCCeEEEEEEecCCCCcCHH-H--------HHHHHHHHHHHHHhC
Confidence 456899999 69999999999888776443221 0 124556888888874
No 63
>PHA02978 hypothetical protein; Provisional
Probab=27.81 E-value=58 Score=24.23 Aligned_cols=16 Identities=44% Similarity=0.764 Sum_probs=12.6
Q ss_pred eEEEEECCCCcEEEEE
Q psy16510 87 GQFTYTAPDGTPITVQ 102 (147)
Q Consensus 87 GsYsy~~pdG~~~~V~ 102 (147)
=.+||.+|||-.++|+
T Consensus 75 iy~sy~~~~gisiqvs 90 (135)
T PHA02978 75 IYFSYADPDGISIQVS 90 (135)
T ss_pred EEEEecCCCceEEEEe
Confidence 3566899999988875
No 64
>PRK00315 potassium-transporting ATPase subunit C; Reviewed
Probab=27.05 E-value=42 Score=26.97 Aligned_cols=34 Identities=15% Similarity=0.326 Sum_probs=18.9
Q ss_pred CCCCcccccC----CCCCCCCCcHHHHHHHHH-HhcCCC
Q psy16510 106 DETGFHASGA----HLPTPPPIPEAILKSLQQ-VQVSPP 139 (147)
Q Consensus 106 De~Gf~p~g~----hlP~~pp~p~~i~~al~~-~~~~p~ 139 (147)
+++||.+.++ .-|+-|-.-+.|.+.++. .+++|.
T Consensus 74 ~~~~y~~~~SggSNl~psnp~l~~~v~~r~~~~~~~~~~ 112 (193)
T PRK00315 74 APMPYNPQASGGSNLAPSNPALDDAIKARVAALRAANPG 112 (193)
T ss_pred CCCCCCcccccccCCCCCCHHHHHHHHHHHHHHHHhCCC
Confidence 3557877543 345554455556666664 455653
No 65
>KOG4063|consensus
Probab=26.99 E-value=60 Score=25.35 Aligned_cols=17 Identities=18% Similarity=0.311 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHhhcc
Q psy16510 11 LAISFVALFIAVVSAQR 27 (147)
Q Consensus 11 ~~~~i~~~~la~a~A~~ 27 (147)
+.+++|+++|+.++|++
T Consensus 6 ~~~v~l~alls~a~aq~ 22 (158)
T KOG4063|consen 6 LKTVILLALLSLAAAQA 22 (158)
T ss_pred HHHHHHHHHHHHhhhcc
Confidence 45566666666666553
No 66
>PTZ00459 mucin-associated surface protein (MASP); Provisional
Probab=26.60 E-value=43 Score=28.26 Aligned_cols=9 Identities=11% Similarity=0.124 Sum_probs=3.7
Q ss_pred HHHHHHHHh
Q psy16510 16 VALFIAVVS 24 (147)
Q Consensus 16 ~~~~la~a~ 24 (147)
|+|+|+|.+
T Consensus 11 LVCALCVLW 19 (291)
T PTZ00459 11 LVCALCVLW 19 (291)
T ss_pred HHHHHHHHh
Confidence 344444433
No 67
>PF04710 Pellino: Pellino; InterPro: IPR006800 Pellino is involved in Toll-like signalling pathways, and associates with the kinase domain of the Pelle Ser/Thr kinase [, , ].; PDB: 3EGB_B 3EGA_A.
Probab=26.41 E-value=1e+02 Score=27.63 Aligned_cols=53 Identities=17% Similarity=0.321 Sum_probs=32.7
Q ss_pred eCCceeeeecceecCCCCCcccccceeeEEEEECCCCcEEEEEEEcCCC--Cccc
Q psy16510 60 TGNGIAANEQGYLKNPGQKDLEAQTAQGQFTYTAPDGTPITVQWFADET--GFHA 112 (147)
Q Consensus 60 ~~dG~~~~E~G~~k~~g~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADe~--Gf~p 112 (147)
..||++....=.++++.....-...-+.|-||+..-.+.+.|.|+.|++ =||+
T Consensus 46 ~aNGVkps~~h~v~tp~~ska~~~~~~Hsisytlsr~~~VvVEY~~D~~tDMFQI 100 (416)
T PF04710_consen 46 KANGVKPSTVHIVSTPQASKAISNKSQHSISYTLSRNQTVVVEYTHDPDTDMFQI 100 (416)
T ss_dssp S--EEEEEEEEEEE--TT----EETTCCEEEEE-SSSEEEEEEEEEETTEEEEEE
T ss_pred cCCCcCccceEEecCCCccceecccCCceEEEEecCCceeeeeeecCCCcchhhh
Confidence 4477776665544433222112234588999999999999999999974 6777
No 68
>PRK13999 potassium-transporting ATPase subunit C; Provisional
Probab=26.33 E-value=44 Score=27.07 Aligned_cols=32 Identities=13% Similarity=0.319 Sum_probs=16.7
Q ss_pred CCCcccccC---C-CCCCCCCcHHHHHHHH-HHhcCC
Q psy16510 107 ETGFHASGA---H-LPTPPPIPEAILKSLQ-QVQVSP 138 (147)
Q Consensus 107 e~Gf~p~g~---h-lP~~pp~p~~i~~al~-~~~~~p 138 (147)
+.||.+.++ + -|+-|-.-..|.+.++ +.+++|
T Consensus 86 ~~~y~~~~SGgSNlgpsnp~L~~~v~~r~~~~~~~~~ 122 (201)
T PRK13999 86 PAPYNAANSMGSNLGPTSKALADRVKEDVDALKAENP 122 (201)
T ss_pred CCCCCcccccccCCCCCCHHHHHHHHHHHHHHHHhCC
Confidence 446777543 3 4444444455555555 455565
No 69
>PF12984 DUF3868: Domain of unknown function, B. Theta Gene description (DUF3868); InterPro: IPR024480 This domain of unknown function is found in a number of bacterial proteins. The function of the proteins is not known, but the Bacteroides thetaiotaomicron gene appears to be upregulated in the presence of host or other bacterial species compared to pure culture [, ].
Probab=26.12 E-value=39 Score=24.53 Aligned_cols=11 Identities=0% Similarity=-0.040 Sum_probs=4.8
Q ss_pred hHHHHHHHHHH
Q psy16510 10 ALAISFVALFI 20 (147)
Q Consensus 10 ~~~~~i~~~~l 20 (147)
+++++|++|++
T Consensus 5 ~i~~~Ll~~~~ 15 (115)
T PF12984_consen 5 FILFFLLLCSL 15 (115)
T ss_pred EHHHHHHHHhh
Confidence 44444444444
No 70
>PF05423 Mycobact_memb: Mycobacterium membrane protein; InterPro: IPR008693 This family contains several membrane proteins from Mycobacterium species [].
Probab=25.56 E-value=1.4e+02 Score=22.53 Aligned_cols=30 Identities=27% Similarity=0.617 Sum_probs=17.5
Q ss_pred EEEEECCCCcEEEEEEEcCCCC--cccccCCCC
Q psy16510 88 QFTYTAPDGTPITVQWFADETG--FHASGAHLP 118 (147)
Q Consensus 88 sYsy~~pdG~~~~V~Y~ADe~G--f~p~g~hlP 118 (147)
.|.-.++.|...+|+|. |++| -+.....||
T Consensus 54 ~YEV~G~~G~~~~I~Y~-D~~~~~~~~~~v~LP 85 (140)
T PF05423_consen 54 TYEVTGPPGSTATISYL-DADGQPQQVDNVSLP 85 (140)
T ss_pred EEEEEcCCCCeEEEEEE-cCCCceEeecCcCCC
Confidence 56666677777778886 4443 222344566
No 71
>PF13124 DUF3963: Protein of unknown function (DUF3963)
Probab=25.56 E-value=69 Score=19.19 Aligned_cols=13 Identities=15% Similarity=0.158 Sum_probs=5.9
Q ss_pred cchhhHHHHHHHH
Q psy16510 6 KDIGALAISFVAL 18 (147)
Q Consensus 6 ~~~~~~~~~i~~~ 18 (147)
|+|++.+-+++++
T Consensus 21 rnit~cfal~vv~ 33 (40)
T PF13124_consen 21 RNITFCFALLVVV 33 (40)
T ss_pred HHHHHHHHHHHHH
Confidence 4555544333333
No 72
>PF00659 POLO_box: POLO box duplicated region; InterPro: IPR000959 A subgroup of serine/threonine protein kinases, Polo or Polo-like kinases play multiple roles during the cell cycle. Polo kinases are required at several key points through mitosis, starting from control of the G2/M transition through phosphorylation of Cdc25C and mitotic cyclins. Polo kinases are characterised by an amino terminal catalytic domain, and a carboxy terminal non-catalytic domain consisting of three blocks of conserved sequences known as polo boxes which form one single functional domain []. The domain is named after its founding member encoded by the polo gene of Drosophila melanogaster []. This domain of around 70 amino acids has been found in species ranging from yeast to mammals. Polo boxes appear to mediate interaction with multiple proteins through protein:protein interactions; some but not all of these proteins are substrates for the kinase domain of the molecule []. The crystal structure of the polo domain of the murine protein, Sak, is dimeric, consisting of two alpha-helices and two six-stranded beta-sheets []. The topology of one polypeptide subunit of the dimer consists of, from its N- to C terminus, an extended strand segment, five beta-strands, one alpha-helix (A) and a C-terminal beta-strand. Beta-strands from one subunit form a contiguous antiparallel beta-sheet with beta-strands from the second subunit. The two beta-sheets pack with a crossing angle of 110 degrees, orienting the hydrophobic surfaces inward and the hydrophilic surfaces outward. Helix A, which is colinear with beta-strand 6 of the same polypeptide, buries a large portion of the non-overlapping hydrophobic beta-sheet surfaces. Interactions involving helices A comprise a majority of the hydrophobic core structure and also the dimer interface. Point mutations in the Polo box of the budding yeast Cdc5 protein abolish the ability of overexpressed Cdc5 to interact with the spindle poles and to organise cytokinetic structures [].; GO: 0005515 protein binding; PDB: 1MBY_B 3P37_A 3MHN_A 1Q4K_A 3HIK_A 3Q1I_A 3P35_A 3MHQ_A 1UMW_B 3MQ8_B ....
Probab=25.54 E-value=1.9e+02 Score=18.43 Aligned_cols=64 Identities=25% Similarity=0.341 Sum_probs=31.7
Q ss_pred CeeEEEeeCCceeeeecceecCCCCCccccc-ceeeEEEEECCCCcEEEEEEEcCCCCcccccCCCCCCCCCcHHHHHHH
Q psy16510 53 SYRYSYETGNGIAANEQGYLKNPGQKDLEAQ-TAQGQFTYTAPDGTPITVQWFADETGFHASGAHLPTPPPIPEAILKSL 131 (147)
Q Consensus 53 ~Y~~~Ye~~dG~~~~E~G~~k~~g~~~~~~~-~v~GsYsy~~pdG~~~~V~Y~ADe~Gf~p~g~hlP~~pp~p~~i~~al 131 (147)
.|.+.|..+||+..-- + +.++. -.+ .-.....|++++|+..+ |..++ +... .|+++.+-|
T Consensus 4 k~gi~~~LSng~vqv~---F-nD~tk--ivl~~~~~~v~yi~~~~~~~~--~~~~~---------~~~~--~p~~l~~kl 64 (68)
T PF00659_consen 4 KYGIGYQLSNGTVQVN---F-NDHTK--IVLSPDGRLVTYIDRDGERQT--YSLSS---------LLED--FPEDLKKKL 64 (68)
T ss_dssp CSEEEEEETTSEEEEE---E-TTS-E--EEEETTCCEEEEE-TTS-EEE--EECTC---------HHHH----HHHHHHH
T ss_pred ceEEEEEEeCCCEEEE---E-eCCCE--EEECCCCCEEEEECCCCcEEE--EEccc---------cccC--CCHHHHHHh
Confidence 4778999999986322 2 22221 001 11247788888776443 43332 0000 367777777
Q ss_pred HHHh
Q psy16510 132 QQVQ 135 (147)
Q Consensus 132 ~~~~ 135 (147)
.+++
T Consensus 65 ~~~k 68 (68)
T PF00659_consen 65 TYLK 68 (68)
T ss_dssp HHHH
T ss_pred hccC
Confidence 7653
No 73
>PF11912 DUF3430: Protein of unknown function (DUF3430); InterPro: IPR021837 This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 209 to 265 amino acids in length.
Probab=25.41 E-value=57 Score=25.17 Aligned_cols=8 Identities=38% Similarity=0.671 Sum_probs=4.4
Q ss_pred CeeEEEee
Q psy16510 53 SYRYSYET 60 (147)
Q Consensus 53 ~Y~~~Ye~ 60 (147)
.+.++|..
T Consensus 40 ~~G~Gys~ 47 (212)
T PF11912_consen 40 SYGIGYSI 47 (212)
T ss_pred CcEEEEEe
Confidence 45666554
No 74
>TIGR01655 yxeA_fam conserved hypothetical protein TIGR01655. This model represents a family of small (about 115 amino acids) uncharacterized proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.
Probab=24.97 E-value=2.7e+02 Score=20.04 Aligned_cols=14 Identities=21% Similarity=0.411 Sum_probs=8.6
Q ss_pred CCCCcEEEEEEEcC
Q psy16510 93 APDGTPITVQWFAD 106 (147)
Q Consensus 93 ~pdG~~~~V~Y~AD 106 (147)
|.+|+..++++.|.
T Consensus 63 d~~G~~k~i~f~~~ 76 (114)
T TIGR01655 63 DSSGKKHKVKFMAG 76 (114)
T ss_pred CCCCCEEEEEEEcC
Confidence 34666666666655
No 75
>PRK09558 ushA bifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor; Reviewed
Probab=24.83 E-value=57 Score=29.47 Aligned_cols=10 Identities=50% Similarity=0.494 Sum_probs=7.6
Q ss_pred Ccccccchhh
Q psy16510 1 MMKIEKDIGA 10 (147)
Q Consensus 1 ~m~~~~~~~~ 10 (147)
|||+++.|..
T Consensus 1 ~~~~~~~~~~ 10 (551)
T PRK09558 1 MMKFLKRLVA 10 (551)
T ss_pred CcchHHHHHH
Confidence 9999777653
No 76
>PRK11627 hypothetical protein; Provisional
Probab=24.33 E-value=68 Score=25.50 Aligned_cols=9 Identities=22% Similarity=0.678 Sum_probs=6.2
Q ss_pred ceeeEEEEE
Q psy16510 84 TAQGQFTYT 92 (147)
Q Consensus 84 ~v~GsYsy~ 92 (147)
+-+|.|+|.
T Consensus 114 V~~g~~~~~ 122 (192)
T PRK11627 114 VSEGNVRYN 122 (192)
T ss_pred cccceeeec
Confidence 347888776
No 77
>PRK14864 putative biofilm stress and motility protein A; Provisional
Probab=23.79 E-value=85 Score=22.72 Aligned_cols=8 Identities=0% Similarity=-0.031 Sum_probs=3.5
Q ss_pred Ccccccch
Q psy16510 1 MMKIEKDI 8 (147)
Q Consensus 1 ~m~~~~~~ 8 (147)
||++-|.+
T Consensus 1 ~~~~mk~~ 8 (104)
T PRK14864 1 RNMVMRRF 8 (104)
T ss_pred CchHHHHH
Confidence 35553333
No 78
>cd08344 MhqB_like_N N-terminal domain of MhqB, a type I extradiol dioxygenase, and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of Burkholderia sp. NF100 MhqB and similar proteins. MhqB is a type I extradiol dioxygenase involved in the catabolism of methylhydroquinone, an intermediate in the degradation of fenitrothion. The purified enzyme has shown extradiol ring cleavage activity toward 3-methylcatechol. Fe2+ was suggested as a cofactor, the same as most other enzymes in the family. Burkholderia sp. NF100 MhqB is encoded on the plasmid pNF1. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=23.62 E-value=95 Score=20.91 Aligned_cols=16 Identities=25% Similarity=0.468 Sum_probs=13.7
Q ss_pred EEEEECCCCcEEEEEE
Q psy16510 88 QFTYTAPDGTPITVQW 103 (147)
Q Consensus 88 sYsy~~pdG~~~~V~Y 103 (147)
++.+.||||..+++.+
T Consensus 93 ~~~~~DP~Gn~iel~~ 108 (112)
T cd08344 93 GVWFRDPDGNLLQVKV 108 (112)
T ss_pred EEEEECCCCCEEEEec
Confidence 4779999999999875
No 79
>COG2834 LolA Outer membrane lipoprotein-sorting protein [Cell envelope biogenesis, outer membrane]
Probab=23.04 E-value=1.9e+02 Score=22.68 Aligned_cols=30 Identities=23% Similarity=0.520 Sum_probs=16.4
Q ss_pred cceeeEEEEE-CCCCcEEEEEEEcCCCCccc
Q psy16510 83 QTAQGQFTYT-APDGTPITVQWFADETGFHA 112 (147)
Q Consensus 83 ~~v~GsYsy~-~pdG~~~~V~Y~ADe~Gf~p 112 (147)
++..|..-|+ +|+...+++++..+..+.-|
T Consensus 83 ivsdG~~v~iydp~~~q~~~~~~~~~~~~tp 113 (211)
T COG2834 83 IVSDGKTVWIYDPDLEQVTKTWLSEATGNTP 113 (211)
T ss_pred EEECCCEEEEECCCCceEEEEecCCCCCCCc
Confidence 4444544443 45666666666666554444
No 80
>PF07263 DMP1: Dentin matrix protein 1 (DMP1); InterPro: IPR009889 This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21 []. DMP1 is a bone and teeth specific protein initially identified from mineralised dentin. DMP1 is primarily localised in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca2+ surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralised matrix formation extracellularly, at later stages of osteoblast maturation []. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown [].; GO: 0001503 ossification, 0030198 extracellular matrix organization
Probab=22.94 E-value=38 Score=30.95 Aligned_cols=13 Identities=0% Similarity=-0.171 Sum_probs=7.4
Q ss_pred HHHHHHHHHHhhc
Q psy16510 14 SFVALFIAVVSAQ 26 (147)
Q Consensus 14 ~i~~~~la~a~A~ 26 (147)
||||+.|.||+.+
T Consensus 7 l~~lwgls~alpv 19 (514)
T PF07263_consen 7 LMFLWGLSCALPV 19 (514)
T ss_pred HHHHHHhhcccch
Confidence 4556666666544
No 81
>PF07584 BatA: Aerotolerance regulator N-terminal; InterPro: IPR024163 The Batl operon appears to be important in pathogenicity and aerotolerance. BatA ensures bacterial survival in the early stages of the infection process, when the infected sites are aerobic, and is produced under conditions of oxidative stress []. Proteins produced by the Batl operon share a highly-conserved sequence at their N terminus and the full length proteins carry multiple membrane-spanning domains []. This entry represents the conserved N-terminal domain, which is also found in some uncharacterised proteins.
Probab=22.66 E-value=84 Score=20.76 Aligned_cols=16 Identities=13% Similarity=0.065 Sum_probs=6.3
Q ss_pred hHHHHHHHHHHHHHhh
Q psy16510 10 ALAISFVALFIAVVSA 25 (147)
Q Consensus 10 ~~~~~i~~~~la~a~A 25 (147)
.++.++.++++.+|.|
T Consensus 60 l~L~lLal~lli~AlA 75 (77)
T PF07584_consen 60 LLLRLLALALLILALA 75 (77)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 3333333444444443
No 82
>PRK09455 rseB anti-sigma E factor; Provisional
Probab=22.66 E-value=1.4e+02 Score=25.46 Aligned_cols=20 Identities=25% Similarity=0.398 Sum_probs=11.2
Q ss_pred cccceeeEEEEECCCCcEEE
Q psy16510 81 EAQTAQGQFTYTAPDGTPIT 100 (147)
Q Consensus 81 ~~~~v~GsYsy~~pdG~~~~ 100 (147)
+.+.....-+|..|+.+.++
T Consensus 83 EviR~~d~V~~~~p~~~~~~ 102 (319)
T PRK09455 83 EIIQRGNEISYFEPGLEPFT 102 (319)
T ss_pred EEEEECCEEEEEecCCceEE
Confidence 55555555666666655543
No 83
>TIGR01200 Porphyromonas gingivalis paralogous family TIGR01200. No identifiable homologs outside Porphyromonas gingivalis.
Probab=22.65 E-value=96 Score=26.54 Aligned_cols=11 Identities=9% Similarity=0.368 Sum_probs=6.5
Q ss_pred HHHHHHHHHHH
Q psy16510 13 ISFVALFIAVV 23 (147)
Q Consensus 13 ~~i~~~~la~a 23 (147)
++|+.|+|...
T Consensus 7 ~~~~~~~~~~~ 17 (292)
T TIGR01200 7 LYILICFLLSI 17 (292)
T ss_pred HHHHHHHHHhh
Confidence 36666766543
No 84
>PHA02823 chemokine binding protein; Provisional
Probab=22.59 E-value=71 Score=26.41 Aligned_cols=13 Identities=15% Similarity=0.110 Sum_probs=5.7
Q ss_pred HHHHHHHHHHhhc
Q psy16510 14 SFVALFIAVVSAQ 26 (147)
Q Consensus 14 ~i~~~~la~a~A~ 26 (147)
++++|..+.|.|+
T Consensus 5 ~v~~~~cl~a~~~ 17 (255)
T PHA02823 5 IVLACMCLAATAA 17 (255)
T ss_pred hhHHHHHHHHHhc
Confidence 4444444444444
No 85
>PF10902 DUF2693: Protein of unknown function (DUF2693); InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown.
Probab=21.72 E-value=2.5e+02 Score=19.52 Aligned_cols=23 Identities=17% Similarity=0.332 Sum_probs=19.3
Q ss_pred CCeeEEEeeCCceeeeecceecC
Q psy16510 52 GSYRYSYETGNGIAANEQGYLKN 74 (147)
Q Consensus 52 G~Y~~~Ye~~dG~~~~E~G~~k~ 74 (147)
|.-.|-|+-.||+.|.-.|.++.
T Consensus 1 G~h~~~feK~DGtiR~a~gT~d~ 23 (83)
T PF10902_consen 1 GEHEFVFEKVDGTIRVAKGTRDP 23 (83)
T ss_pred CcEEEEEEecCCCEEEEEEecCc
Confidence 45578899999999999999764
No 86
>PF03866 HAP: Hydrophobic abundant protein (HAP) ; InterPro: IPR005566 Expression of Hydrophobic Abundant protein is thought to be developmentally regulated and possibly involved in spherule cell wall formation [].
Probab=21.56 E-value=71 Score=24.61 Aligned_cols=17 Identities=35% Similarity=0.413 Sum_probs=12.3
Q ss_pred CcccccchhhHHHHHHHHHHHHHhh
Q psy16510 1 MMKIEKDIGALAISFVALFIAVVSA 25 (147)
Q Consensus 1 ~m~~~~~~~~~~~~i~~~~la~a~A 25 (147)
|||- +|+.+|+.++++-
T Consensus 1 mmky--------vfvalc~~avval 17 (164)
T PF03866_consen 1 MMKY--------VFVALCLFAVVAL 17 (164)
T ss_pred Cchh--------HHHHHHHHHHHHH
Confidence 8995 4777888877643
No 87
>COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=21.55 E-value=1.8e+02 Score=24.94 Aligned_cols=27 Identities=11% Similarity=0.267 Sum_probs=15.8
Q ss_pred eeeecceecCCCCCcccccceeeEEEEECC
Q psy16510 65 AANEQGYLKNPGQKDLEAQTAQGQFTYTAP 94 (147)
Q Consensus 65 ~~~E~G~~k~~g~~~~~~~~v~GsYsy~~p 94 (147)
...|.|.++-.+++. =+|.=.|||.|.
T Consensus 36 Vkde~Gt~tv~k~PK---RVVVLE~SFaDa 62 (310)
T COG4594 36 VKDELGTFTVPKTPK---RVVVLELSFADA 62 (310)
T ss_pred eeccCCceecCCCCc---eEEEEEecHHHH
Confidence 445666666555443 345567888763
No 88
>COG3354 FlaG Putative archaeal flagellar protein G [Cell motility and secretion]
Probab=21.27 E-value=1.4e+02 Score=23.21 Aligned_cols=27 Identities=26% Similarity=0.384 Sum_probs=14.7
Q ss_pred CCcceeeeccCC--CCCCCCeeEEEeeCC
Q psy16510 36 QFIPIISYQNEP--NQGDGSYRYSYETGN 62 (147)
Q Consensus 36 ~~~~il~~~~~~--~~~dG~Y~~~Ye~~d 62 (147)
.+..|+...... +..+|.|.|.|-..|
T Consensus 50 ~dFaIIndPg~i~~~~~~g~~t~t~yiKN 78 (154)
T COG3354 50 TDFAIINDPGQIPYVGTDGPYTYTFYIKN 78 (154)
T ss_pred ccEEEecCCCCCccccCCCceEEEEEEec
Confidence 356666554332 213477777776554
No 89
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=21.18 E-value=1.6e+02 Score=21.10 Aligned_cols=20 Identities=45% Similarity=0.561 Sum_probs=14.8
Q ss_pred cceeeEEEEECCCCcEEEEE
Q psy16510 83 QTAQGQFTYTAPDGTPITVQ 102 (147)
Q Consensus 83 ~~v~GsYsy~~pdG~~~~V~ 102 (147)
-.+.=+|.|...||...+|+
T Consensus 55 e~~tdsFtvtv~DGtt~~vt 74 (99)
T TIGR01965 55 ETLTDTFTVTSADGTSQTVT 74 (99)
T ss_pred CEEEEEEEEEEeCCCeEEEE
Confidence 45667899999999766654
No 90
>PRK00178 tolB translocation protein TolB; Provisional
Probab=20.54 E-value=84 Score=26.78 Aligned_cols=18 Identities=22% Similarity=0.089 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHHHHhhcc
Q psy16510 10 ALAISFVALFIAVVSAQR 27 (147)
Q Consensus 10 ~~~~~i~~~~la~a~A~~ 27 (147)
++++++++||++.+++++
T Consensus 5 ~~~~~~~~~~~~~~~~~~ 22 (430)
T PRK00178 5 KRIALFVLCLLAGGAAAA 22 (430)
T ss_pred hHHHHHHHHHHhhHhhcc
Confidence 455666777776666543
No 91
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=20.25 E-value=87 Score=20.59 Aligned_cols=22 Identities=32% Similarity=0.721 Sum_probs=15.8
Q ss_pred CCCCeeEEEeeCCcee-eeecce
Q psy16510 50 GDGSYRYSYETGNGIA-ANEQGY 71 (147)
Q Consensus 50 ~dG~Y~~~Ye~~dG~~-~~E~G~ 71 (147)
.+|.|.|.+.-+||.. .+.+|+
T Consensus 11 k~Ge~rfrlkA~N~eiI~~sEgY 33 (59)
T COG3422 11 KAGEYRFRLKAANGEIILTSEGY 33 (59)
T ss_pred CCCcEEEEEEccCccEEEeeccc
Confidence 7899999999988753 233444
No 92
>cd09013 BphC-JF8_N_like N-terminal, non-catalytic, domain of BphC_JF8, (2,3-dihydroxybiphenyl 1,2-dioxygenase) from Bacillus sp. JF8 and similar proteins. 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC) catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, a key step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). BphC belongs to the type I extradiol dioxygenase family, which requires a metal ion in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of BphCs. This subfamily of BphC is represented by the enzyme purified from the thermophilic biphenyl and naphthalene degrader, Bacillus sp. JF8. The members in this family of BphC enzymes may use either Mn(II) or Fe(II) as cofactors. The enzyme purified from Bacillus sp. JF8 is Mn(II)-dependent, however, the enzyme from Rhodococcus jostii RHAI has Fe(II) bound to it. BphC_JF8 is thermostable and its optimum activity is at 85 degrees C
Probab=20.06 E-value=1.3e+02 Score=20.33 Aligned_cols=16 Identities=31% Similarity=1.115 Sum_probs=13.8
Q ss_pred EEEEECCCCcEEEEEE
Q psy16510 88 QFTYTAPDGTPITVQW 103 (147)
Q Consensus 88 sYsy~~pdG~~~~V~Y 103 (147)
++.+.||||..+++.|
T Consensus 102 ~~~~~DPdG~~iEl~~ 117 (121)
T cd09013 102 AYRFRSPDGHPMELYW 117 (121)
T ss_pred eEEEECCCCCEEEEEE
Confidence 5679999999998876
Done!