Query 032541
Match_columns 138
No_of_seqs 56 out of 58
Neff 2.3
Searched_HMMs 46136
Date Fri Mar 29 02:46:57 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/032541.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/032541hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF10251 PEN-2: Presenilin enh 94.2 0.16 3.4E-06 37.5 5.6 49 82-131 13-68 (94)
2 PF07787 DUF1625: Protein of u 89.8 1.3 2.8E-05 35.5 6.4 55 77-138 188-244 (248)
3 KOG3402 Predicted membrane pro 80.6 2.8 6E-05 31.9 3.9 48 82-129 18-71 (101)
4 PF01034 Syndecan: Syndecan do 78.7 0.65 1.4E-05 32.6 0.0 25 113-137 12-36 (64)
5 PF05915 DUF872: Eukaryotic pr 69.3 2.7 5.9E-05 31.4 1.3 46 72-123 40-85 (115)
6 PF12158 DUF3592: Protein of u 68.2 12 0.00026 26.1 4.3 43 52-99 97-145 (148)
7 KOG2621 Prohibitins and stomat 65.2 8.3 0.00018 33.7 3.6 26 65-94 22-50 (288)
8 PLN03160 uncharacterized prote 52.9 4.7 0.0001 32.4 0.1 27 67-93 33-59 (219)
9 PF05393 Hum_adeno_E3A: Human 50.6 25 0.00054 26.5 3.6 34 60-95 19-53 (94)
10 PF06687 SUR7: SUR7/PalI famil 49.9 1E+02 0.0022 23.0 6.8 52 76-131 114-166 (212)
11 PF08999 SP_C-Propep: Surfacta 49.7 30 0.00066 26.0 3.9 36 70-106 25-61 (93)
12 COG2738 Predicted Zn-dependent 49.6 22 0.00047 30.3 3.5 25 70-94 121-148 (226)
13 PRK11383 hypothetical protein; 49.6 48 0.001 26.6 5.2 65 68-133 6-93 (145)
14 PF01284 MARVEL: Membrane-asso 47.1 92 0.002 21.4 6.0 27 80-106 77-103 (144)
15 KOG4788 Members of chemokine-l 46.7 70 0.0015 24.8 5.7 57 76-134 64-126 (172)
16 PF09878 DUF2105: Predicted me 45.5 26 0.00057 29.6 3.4 25 76-100 164-192 (212)
17 PF04835 Pox_A9: A9 protein co 44.3 41 0.00089 23.1 3.6 31 108-138 14-44 (54)
18 PF07954 DUF1689: Protein of u 44.2 21 0.00045 28.2 2.5 37 77-115 35-71 (152)
19 cd02435 CCC1 CCC1. CCC1: This 44.1 74 0.0016 26.2 5.7 16 79-94 160-176 (241)
20 COG3671 Predicted membrane pro 41.6 1.7E+02 0.0036 23.1 7.0 35 67-108 19-53 (125)
21 PF11694 DUF3290: Protein of u 41.0 1E+02 0.0022 24.0 5.8 32 79-112 16-47 (149)
22 TIGR01191 ccmC heme exporter p 40.0 85 0.0018 25.0 5.3 31 107-137 105-135 (184)
23 KOG2927 Membrane component of 39.5 24 0.00053 31.9 2.4 19 104-122 206-225 (372)
24 cd02432 Nodulin-21_like_1 Nodu 39.2 1.1E+02 0.0023 24.8 5.9 18 77-94 141-159 (218)
25 PRK09459 pspG phage shock prot 38.5 1.5E+02 0.0034 21.5 6.0 10 81-90 4-13 (76)
26 TIGR00267 conserved hypothetic 38.2 1.2E+02 0.0026 23.4 5.8 13 81-93 97-110 (169)
27 cd02433 Nodulin-21_like_2 Nodu 38.0 1E+02 0.0022 25.3 5.6 15 79-93 157-172 (234)
28 PF03733 DUF307: Domain of unk 37.9 56 0.0012 21.4 3.4 24 78-102 5-28 (53)
29 PRK09554 feoB ferrous iron tra 37.4 79 0.0017 30.1 5.5 52 77-132 689-740 (772)
30 PF09323 DUF1980: Domain of un 34.5 1.3E+02 0.0028 23.1 5.4 45 79-128 2-48 (182)
31 KOG2887 Membrane protein invol 33.9 45 0.00098 27.3 3.0 83 44-127 18-129 (175)
32 COG5336 Uncharacterized protei 33.3 26 0.00057 27.2 1.4 26 74-102 58-84 (116)
33 PF03845 Spore_permease: Spore 32.8 1.5E+02 0.0033 23.7 5.8 55 78-133 172-230 (320)
34 PF02687 FtsX: FtsX-like perme 32.2 70 0.0015 20.7 3.2 7 106-112 88-94 (121)
35 PF11511 RhodobacterPufX: Intr 31.6 42 0.00091 23.9 2.1 20 82-101 35-54 (67)
36 COG5503 Uncharacterized conser 31.3 25 0.00054 25.3 1.0 19 14-32 41-59 (69)
37 COG2194 Predicted membrane-ass 31.0 2E+02 0.0044 26.7 6.9 32 77-108 11-42 (555)
38 COG1347 NqrD Na+-transporting 30.9 30 0.00066 29.1 1.6 24 66-89 123-152 (208)
39 PF13886 DUF4203: Domain of un 30.5 2E+02 0.0044 22.2 6.0 42 82-130 26-69 (210)
40 PF04298 Zn_peptidase_2: Putat 29.5 71 0.0015 26.8 3.5 23 67-89 113-137 (222)
41 PRK00968 tetrahydromethanopter 28.4 67 0.0015 27.7 3.2 27 107-133 208-237 (240)
42 PF06738 DUF1212: Protein of u 28.3 2.3E+02 0.005 21.1 5.8 27 107-133 145-171 (193)
43 PF02285 COX8: Cytochrome oxid 28.3 1.2E+02 0.0025 19.9 3.6 28 107-134 4-31 (44)
44 PF07234 DUF1426: Protein of u 28.0 93 0.002 24.2 3.7 33 78-119 12-45 (117)
45 PF01102 Glycophorin_A: Glycop 27.1 96 0.0021 23.7 3.6 10 82-91 68-77 (122)
46 PF05478 Prominin: Prominin; 26.8 2.4E+02 0.0052 26.7 6.8 22 84-105 98-120 (806)
47 TIGR00927 2A1904 K+-dependent 26.7 1.5E+02 0.0033 30.4 5.7 13 73-85 1012-1024(1096)
48 COG4036 Predicted membrane pro 26.3 80 0.0017 26.8 3.3 25 76-100 166-194 (224)
49 COG1457 CodB Purine-cytosine p 26.0 2E+02 0.0043 26.2 5.9 56 79-134 191-248 (442)
50 KOG1172 Na+-independent Cl/HCO 25.9 23 0.00051 35.0 0.1 27 74-100 664-696 (876)
51 PRK10429 melibiose:sodium symp 25.8 1.9E+02 0.0041 24.2 5.4 12 107-118 136-147 (473)
52 PF11377 DUF3180: Protein of u 25.7 1.7E+02 0.0036 22.2 4.6 12 91-102 87-98 (138)
53 PF09948 DUF2182: Predicted me 25.7 1.5E+02 0.0033 23.9 4.7 48 62-113 124-174 (191)
54 PF14023 DUF4239: Protein of u 25.6 1.9E+02 0.004 22.1 4.9 21 110-130 165-185 (209)
55 PF03631 Virul_fac_BrkB: Virul 25.4 3.2E+02 0.007 21.2 6.7 50 78-127 72-121 (260)
56 PF03729 DUF308: Short repeat 25.3 1.7E+02 0.0036 17.8 5.7 22 108-129 49-70 (72)
57 PF14017 DUF4233: Protein of u 24.8 96 0.0021 23.0 3.1 18 80-97 77-94 (107)
58 PF01036 Bac_rhodopsin: Bacter 24.5 2E+02 0.0043 22.5 5.0 46 80-128 1-47 (222)
59 PRK09597 lipid A 1-phosphatase 24.3 32 0.0007 28.1 0.6 17 75-91 22-38 (190)
60 PRK10714 undecaprenyl phosphat 24.2 3.7E+02 0.0079 22.2 6.7 31 67-98 227-257 (325)
61 PF05255 UPF0220: Uncharacteri 24.2 1.4E+02 0.0031 23.6 4.1 55 79-135 23-78 (166)
62 cd02434 Nodulin-21_like_3 Nodu 23.7 2.1E+02 0.0045 23.1 5.1 16 79-94 141-157 (225)
63 PF14362 DUF4407: Domain of un 23.6 3.5E+02 0.0076 21.9 6.4 25 77-101 17-41 (301)
64 COG2364 Predicted membrane pro 23.5 1.2E+02 0.0026 25.6 3.8 36 74-122 48-84 (210)
65 KOG0581 Mitogen-activated prot 23.4 30 0.00065 31.1 0.3 39 78-116 285-330 (364)
66 PRK09669 putative symporter Ya 23.2 2.2E+02 0.0047 23.3 5.2 10 106-115 138-147 (444)
67 PRK11770 hypothetical protein; 23.0 1.2E+02 0.0025 23.5 3.4 20 79-99 8-27 (135)
68 COG1814 Uncharacterized membra 22.5 3.7E+02 0.008 21.6 6.3 17 79-95 149-166 (229)
69 TIGR02847 CyoD cytochrome o ub 22.1 1.4E+02 0.0031 21.8 3.6 33 104-136 54-86 (96)
70 PF13430 DUF4112: Domain of un 21.7 1.6E+02 0.0035 21.5 3.8 53 67-128 13-72 (106)
71 PLN02715 lipid phosphate phosp 21.6 2.9E+02 0.0064 24.1 5.9 25 61-91 81-105 (327)
72 PF13779 DUF4175: Domain of un 21.2 2.7E+02 0.0058 27.3 6.1 22 79-101 8-29 (820)
73 PF01988 VIT1: VIT family; In 21.0 3.8E+02 0.0082 21.0 6.0 15 79-93 138-153 (213)
74 PF07213 DAP10: DAP10 membrane 20.9 1.4E+02 0.0031 21.7 3.3 6 72-77 24-29 (79)
75 PF06210 DUF1003: Protein of u 20.7 3.2E+02 0.0069 20.2 5.2 46 83-129 6-54 (108)
76 TIGR02975 phageshock_pspG phag 20.6 3.2E+02 0.007 19.3 5.6 10 81-90 3-12 (64)
77 PRK14398 membrane protein; Pro 20.5 1E+02 0.0022 24.9 2.8 30 80-113 4-34 (191)
78 TIGR01571 A_thal_Cys_rich unch 20.4 41 0.00089 24.0 0.4 39 71-117 40-78 (104)
79 PF09583 Phageshock_PspG: Phag 20.4 3.3E+02 0.0071 19.3 6.0 10 81-90 4-13 (65)
80 PF13664 DUF4149: Domain of un 20.1 2.9E+02 0.0062 18.6 5.1 12 109-120 26-37 (101)
No 1
>PF10251 PEN-2: Presenilin enhancer-2 subunit of gamma secretase; InterPro: IPR019379 This entry is a short, 101 peptide protein, which is the smallest subunit of the gamma-secretase aspartyl protease complex. It catalyses the intra-membrane cleavage of a subset of type I transmembrane proteins. The other active constituents of the complex are presenilin (PS) nicastrin and anterior pharynx defective-1 (APH-1) protein. Presenilin enhancer-2 (PEN-2) adopts a hairpin orientation in the membrane with its N- and C-terminal domains facing the luminal/extracellular space. The C-terminal domain maintains PS stability within the complex [].
Probab=94.17 E-value=0.16 Score=37.50 Aligned_cols=49 Identities=31% Similarity=0.604 Sum_probs=37.5
Q ss_pred HHHHHHHH-hHHHHHHHHHhhcccccCC------cccchhHHHHHHHHHHHHHHHHH
Q 032541 82 SFLLGFVF-PLMWYYGTFLYFGNHCRKD------PRERAGLAASAIAAMACSVVMLV 131 (138)
Q Consensus 82 lFllGFf~-~ipWYvgafl~~~~~~r~D------pRErpGl~AcaIAa~v~tia~~I 131 (138)
.|+.||++ |..|.+-++-++- ....+ ++-|.=.+.|+|.+++.+++++.
T Consensus 13 yf~~GFa~LP~lW~vN~~wF~~-~af~~p~~~~~~~Ir~YVi~SaiG~~vw~v~l~~ 68 (94)
T PF10251_consen 13 YFLGGFAFLPFLWLVNVVWFFR-EAFSKPPYDEQPQIRKYVIRSAIGFLVWTVVLIS 68 (94)
T ss_pred HHHHHHHHhHHHHHHHHHHHhH-HHhcCCCcchhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 58899988 7999999877652 33333 36677788999999999988765
No 2
>PF07787 DUF1625: Protein of unknown function (DUF1625); InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long.
Probab=89.81 E-value=1.3 Score=35.48 Aligned_cols=55 Identities=13% Similarity=0.337 Sum_probs=34.5
Q ss_pred chHHHHHHHHHHHh--HHHHHHHHHhhcccccCCcccchhHHHHHHHHHHHHHHHHHHHHhhcC
Q 032541 77 GVGWFSFLLGFVFP--LMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVIVVFRLM 138 (138)
Q Consensus 77 GiGWflFllGFf~~--ipWYvgafl~~~~~~r~DpRErpGl~AcaIAa~v~tia~~Il~~~r~~ 138 (138)
.+||++..+||.+- +.+.+..++++ .. .-.+..-+.+|.++...+.+++.|+.|+
T Consensus 188 ~~G~llmf~G~~~~~~~l~~l~~~~P~--lg-----~l~~~~~~~~~~~~s~~lsl~~Ia~aW~ 244 (248)
T PF07787_consen 188 FIGWLLMFIGFFLLFSPLYTLVDWIPL--LG-----NLVGFGLFLVAFIISFSLSLLTIALAWL 244 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhce--ee-----chhhhHHHHHHHHHHHHHHHHHHHHhhe
Confidence 56999999998884 44444455554 11 2445556666666666666667777664
No 3
>KOG3402 consensus Predicted membrane protein [Function unknown]
Probab=80.65 E-value=2.8 Score=31.85 Aligned_cols=48 Identities=31% Similarity=0.697 Sum_probs=32.0
Q ss_pred HHHHHHHH-hHHHHHHHHHhhcc--cccCCc--cc-chhHHHHHHHHHHHHHHH
Q 032541 82 SFLLGFVF-PLMWYYGTFLYFGN--HCRKDP--RE-RAGLAASAIAAMACSVVM 129 (138)
Q Consensus 82 lFllGFf~-~ipWYvgafl~~~~--~~r~Dp--RE-rpGl~AcaIAa~v~tia~ 129 (138)
-+++||-| |..|.+-.|-++-- +.+..| |. |.=.++|+|+.++.+|++
T Consensus 18 yyl~GfafLP~lW~VN~FwFf~~af~~pa~~~r~QIr~YVvrSavGf~fw~ivL 71 (101)
T KOG3402|consen 18 YYLFGFAFLPWLWFVNCFWFFPVAFHSPAFPHRRQIRNYVVRSAVGFSFWTIVL 71 (101)
T ss_pred HHHhhHHHHHHHHHHHHHHHhHHHHcCcccchHHHHHHHHHHHHHHHHHHHHHH
Confidence 37889977 89999999987631 333344 12 233468888888777765
No 4
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=78.67 E-value=0.65 Score=32.57 Aligned_cols=25 Identities=12% Similarity=0.582 Sum_probs=0.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhc
Q 032541 113 AGLAASAIAAMACSVVMLVIVVFRL 137 (138)
Q Consensus 113 pGl~AcaIAa~v~tia~~Il~~~r~ 137 (138)
.|++|+.+++++++|+++++..+|.
T Consensus 12 aavIaG~Vvgll~ailLIlf~iyR~ 36 (64)
T PF01034_consen 12 AAVIAGGVVGLLFAILLILFLIYRM 36 (64)
T ss_dssp -------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5677788888899999988888774
No 5
>PF05915 DUF872: Eukaryotic protein of unknown function (DUF872); InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=69.29 E-value=2.7 Score=31.41 Aligned_cols=46 Identities=22% Similarity=0.403 Sum_probs=24.5
Q ss_pred cccccchHHHHHHHHHHHhHHHHHHHHHhhcccccCCcccchhHHHHHHHHH
Q 032541 72 PCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAM 123 (138)
Q Consensus 72 PCcG~GiGWflFllGFf~~ipWYvgafl~~~~~~r~DpRErpGl~AcaIAa~ 123 (138)
|-=-+-++-+||++|.++-+ +|.+++. .++|-....+++--.++.+
T Consensus 40 pwK~I~la~~Lli~G~~li~---~g~l~~~---~~i~~~~~~~~~llilG~L 85 (115)
T PF05915_consen 40 PWKSIALAVFLLIFGTVLII---IGLLLFF---GHIDGDRDRGWALLILGIL 85 (115)
T ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHHHh---cccCCCCcccchHHHHHHH
Confidence 45556667788888877653 3444444 2334334445544444444
No 6
>PF12158 DUF3592: Protein of unknown function (DUF3592); InterPro: IPR021994 This family of proteins is functionally uncharacterised.This family of proteins is found in bacteria, archaea, eukaryotes and viruses. Proteins in this family are typically between 150 and 242 amino acids in length.
Probab=68.19 E-value=12 Score=26.11 Aligned_cols=43 Identities=23% Similarity=0.554 Sum_probs=26.9
Q ss_pred ceeeecCCccccccccc---CCCcccccchHHHHHHHHHH--Hh-HHHHHHHHH
Q 032541 52 RYTLIRDPENFQFGIYD---KPLPCFGCGVGWFSFLLGFV--FP-LMWYYGTFL 99 (138)
Q Consensus 52 ~Y~lird~e~~~~g~~~---~rLPCcG~GiGWflFllGFf--~~-ipWYvgafl 99 (138)
+-++.+|++||+...-+ ++. +.-|..++++++ ++ +...+|.|+
T Consensus 97 ~V~V~Y~P~~P~~~~l~~~~~~~-----~~~~~~~~~~~~~~lG~~~~~~gl~~ 145 (148)
T PF12158_consen 97 TVTVYYNPNNPEEARLEPRKRPW-----SGLWLMFIFGFGFILGLIFFLVGLFM 145 (148)
T ss_pred EEEEEECCcCCCeEEEeeecCch-----HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666999999765555 333 367888888777 34 344444444
No 7
>KOG2621 consensus Prohibitins and stomatins of the PID superfamily [Energy production and conversion]
Probab=65.23 E-value=8.3 Score=33.69 Aligned_cols=26 Identities=23% Similarity=0.405 Sum_probs=20.9
Q ss_pred ccccCCCcccccchHHHHHHHHHHHh---HHHH
Q 032541 65 GIYDKPLPCFGCGVGWFSFLLGFVFP---LMWY 94 (138)
Q Consensus 65 g~~~~rLPCcG~GiGWflFllGFf~~---ipWY 94 (138)
+...+++.|| +|++|+++|+|- +||-
T Consensus 22 ~~~~~~~~~~----~~~l~~~S~llvi~TfP~S 50 (288)
T KOG2621|consen 22 EDDSKPLGAC----EWLLVILSFLLVLMTFPIS 50 (288)
T ss_pred ccccCCcchH----HHHHHHHHHHHHHHHhHHH
Confidence 4456789999 999999999985 5664
No 8
>PLN03160 uncharacterized protein; Provisional
Probab=52.94 E-value=4.7 Score=32.35 Aligned_cols=27 Identities=19% Similarity=0.396 Sum_probs=16.4
Q ss_pred ccCCCcccccchHHHHHHHHHHHhHHH
Q 032541 67 YDKPLPCFGCGVGWFSFLLGFVFPLMW 93 (138)
Q Consensus 67 ~~~rLPCcG~GiGWflFllGFf~~ipW 93 (138)
+.++.-||||-+.++++|.+.++.+.|
T Consensus 33 r~~~~~c~~~~~a~~l~l~~v~~~l~~ 59 (219)
T PLN03160 33 RRNCIKCCGCITATLLILATTILVLVF 59 (219)
T ss_pred cccceEEHHHHHHHHHHHHHHHHheee
Confidence 334566887777777777554444443
No 9
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=50.62 E-value=25 Score=26.52 Aligned_cols=34 Identities=15% Similarity=0.420 Sum_probs=20.5
Q ss_pred cccccccccCCCcccccchHHHHHHHHHHHh-HHHHH
Q 032541 60 ENFQFGIYDKPLPCFGCGVGWFSFLLGFVFP-LMWYY 95 (138)
Q Consensus 60 e~~~~g~~~~rLPCcG~GiGWflFllGFf~~-ipWYv 95 (138)
|-++.-++..+.| |+||.|++..+=|++. |+|++
T Consensus 19 ~~p~~~~~~n~~~--~Lgm~~lvI~~iFil~Vilwfv 53 (94)
T PF05393_consen 19 ETPVVSMFVNNWP--NLGMWFLVICGIFILLVILWFV 53 (94)
T ss_pred ccceeEeecCCCC--ccchhHHHHHHHHHHHHHHHHH
Confidence 3444444666666 8998555544444555 88865
No 10
>PF06687 SUR7: SUR7/PalI family; InterPro: IPR009571 This family consists of several fungal-specific SUR7 proteins. Its activity regulates expression of RVS161, a homologue of human endophilin, suggesting a function for both in endocytosis [, ]. The protein carries four transmembrane domains and is thus likely to act as an anchoring protein for the eisosome to the plasma membrane. Eisosomes are the immobile protein complexes, that include the proteins Pil1 and Lsp1, which co-localise with sites of protein and lipid endocytosis at the plasma membrane. SUR7 protein may play a role in sporulation []. Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. This family also includes PalI which is part of a pH signal transduction cascade. Based on the similarity of PalI to the yeast Rim9 meiotic signal transduction component it has been suggested that PalI might be a membrane sensor for ambient pH [].
Probab=49.88 E-value=1e+02 Score=23.04 Aligned_cols=52 Identities=21% Similarity=0.349 Sum_probs=30.0
Q ss_pred cchHHHHHHHHHHHh-HHHHHHHHHhhcccccCCcccchhHHHHHHHHHHHHHHHHH
Q 032541 76 CGVGWFSFLLGFVFP-LMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLV 131 (138)
Q Consensus 76 ~GiGWflFllGFf~~-ipWYvgafl~~~~~~r~DpRErpGl~AcaIAa~v~tia~~I 131 (138)
......++++|+.+. +......++-+ +.| +|++.+....++.+++..+..++
T Consensus 114 ~~~~~~l~~ia~~~t~l~~~~~~~~~~--~~~--~~~~~~~~~~~~~s~~a~~~~lv 166 (212)
T PF06687_consen 114 LKAMFILYPIAIVFTFLALILSGLLAF--FSR--PRNTILSLVASILSLLAFIFLLV 166 (212)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--Hcc--chhHHHHHHHHHHHHHHHHHHHH
Confidence 445778889998875 66666444333 232 44456666655555544444443
No 11
>PF08999 SP_C-Propep: Surfactant protein C, N terminal propeptide; InterPro: IPR015091 The N-terminal propeptide of surfactant protein C adopts an alpha-helical structure, with turn and extended regions. Its main function is the stabilisation of metastable surfactant protein C (SP-C), since the latter can irreversibly transform from its native alpha-helical structure to beta-sheet aggregates and form amyloid-like fibrils. The correct intracellular trafficking of proSP-C has also been reported to depend on the propeptide []. ; PDB: 1SPF_A 2YAD_F.
Probab=49.66 E-value=30 Score=25.97 Aligned_cols=36 Identities=19% Similarity=0.383 Sum_probs=18.4
Q ss_pred CCcccccchHHHHHHHHHHHh-HHHHHHHHHhhccccc
Q 032541 70 PLPCFGCGVGWFSFLLGFVFP-LMWYYGTFLYFGNHCR 106 (138)
Q Consensus 70 rLPCcG~GiGWflFllGFf~~-ipWYvgafl~~~~~~r 106 (138)
.+|||++++-=++.|.=.+.- +.=.+|+ |+++....
T Consensus 25 ~iPc~p~~lKrlliivvVvVlvVvvivg~-LLMGLhms 61 (93)
T PF08999_consen 25 GIPCCPVNLKRLLIIVVVVVLVVVVIVGA-LLMGLHMS 61 (93)
T ss_dssp --SSS-SHHHHHHHHHHHHHHHHHHHHHH-HHH-----
T ss_pred CCCccccccceEEEEEEeeehhHHHHHHH-HHHHhhhh
Confidence 799999999888877655554 3334444 44444443
No 12
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=49.58 E-value=22 Score=30.30 Aligned_cols=25 Identities=32% Similarity=0.850 Sum_probs=19.7
Q ss_pred CCcccccchHHHHHHHHHHHh---HHHH
Q 032541 70 PLPCFGCGVGWFSFLLGFVFP---LMWY 94 (138)
Q Consensus 70 rLPCcG~GiGWflFllGFf~~---ipWY 94 (138)
|.--+|-.+.|.+|++|+++. +.|.
T Consensus 121 Pv~~~gSn~a~~l~i~Gil~~~~~ll~l 148 (226)
T COG2738 121 PVANFGSNLAPLLFILGILLGSTGLLWL 148 (226)
T ss_pred ceeccccchhHHHHHHHHHHcchHHHHH
Confidence 444578889999999999994 5663
No 13
>PRK11383 hypothetical protein; Provisional
Probab=49.56 E-value=48 Score=26.59 Aligned_cols=65 Identities=25% Similarity=0.452 Sum_probs=39.4
Q ss_pred cCCCcccccchHHHHHHHHHHHh--HHH----------HHHHHHhhccc----ccCCcccc----h---hHHHHHHHHHH
Q 032541 68 DKPLPCFGCGVGWFSFLLGFVFP--LMW----------YYGTFLYFGNH----CRKDPRER----A---GLAASAIAAMA 124 (138)
Q Consensus 68 ~~rLPCcG~GiGWflFllGFf~~--ipW----------Yvgafl~~~~~----~r~DpREr----p---Gl~AcaIAa~v 124 (138)
++|-|-+ .|+.|+.|+.|.+.- -.| ||.+.+.++.+ +++.-|.+ | .+.+-+=.+++
T Consensus 6 ~~~t~af-~~~sw~al~~g~~~y~iGLwnA~~~LsEKGyY~~vl~lglF~avs~QK~vRD~~egi~vt~~f~~~cw~a~l 84 (145)
T PRK11383 6 STYSPAF-SIVSWIALVGGIVTYLLGLWNAEMQLNEKGYYFAVLVLGLFSAASYQKTVRDKYEGIPTTSIYYMTCLTVFI 84 (145)
T ss_pred CCCcHHH-HHHHHHHHHHHHHHHHHHHhhcccccCcccHHHHHHHHHHHHHHHHHHHHhhcccCCChhHHHHHHHHHHHH
Confidence 5566666 799999999998773 456 55555544443 33444545 4 55555555555
Q ss_pred HHHHHHHHH
Q 032541 125 CSVVMLVIV 133 (138)
Q Consensus 125 ~tia~~Il~ 133 (138)
.++.++.+.
T Consensus 85 ~~i~LL~iG 93 (145)
T PRK11383 85 ISVALLMVG 93 (145)
T ss_pred HHHHHHHHH
Confidence 666555543
No 14
>PF01284 MARVEL: Membrane-associating domain; InterPro: IPR021128 This entry represents the ~130-residue MARVEL (MAL and related proteins for vesicle trafficking and membrane link) domain. The MARVEL domain is a module with a four transmembrane-helix architecture that has been identified in proteins of the myelin and lymphocyte (MAL), physins, gyrins and occludin families. All described MARVEL domain-containing proteins are consistent with the M-shaped topology: four transmembrane-helix region architecture with cytoplasmic N- and C-terminal regions. Their function could be related to cholesterol-rich membrane apposition events in a variety of cellular processes, such as biogenesis of vesicular transport carriers or tight junction regulation [].
Probab=47.07 E-value=92 Score=21.35 Aligned_cols=27 Identities=15% Similarity=0.375 Sum_probs=16.2
Q ss_pred HHHHHHHHHHhHHHHHHHHHhhccccc
Q 032541 80 WFSFLLGFVFPLMWYYGTFLYFGNHCR 106 (138)
Q Consensus 80 WflFllGFf~~ipWYvgafl~~~~~~r 106 (138)
+..++.-.+..+.|.++..++--+..+
T Consensus 77 ~~~~~~~~v~~il~l~a~~~~a~~~~~ 103 (144)
T PF01284_consen 77 LVEFIFDAVFAILWLAAFIALAAYLSD 103 (144)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 444555556667777777766644444
No 15
>KOG4788 consensus Members of chemokine-like factor super family and related proteins [Defense mechanisms]
Probab=46.65 E-value=70 Score=24.78 Aligned_cols=57 Identities=18% Similarity=0.273 Sum_probs=37.0
Q ss_pred cchHHHHHHHHHHH--hHHHHHHHHHhhccccc-CCcccch---hHHHHHHHHHHHHHHHHHHHH
Q 032541 76 CGVGWFSFLLGFVF--PLMWYYGTFLYFGNHCR-KDPRERA---GLAASAIAAMACSVVMLVIVV 134 (138)
Q Consensus 76 ~GiGWflFllGFf~--~ipWYvgafl~~~~~~r-~DpRErp---Gl~AcaIAa~v~tia~~Il~~ 134 (138)
.+.+|+.|...+.+ -++-++..+..+ +.+ .++=-+| .++=+.++++++.++..++..
T Consensus 64 ~~~~~~~~vsv~~~i~tl~fl~~~~~~~--~~~~~~~i~wp~~~~l~~~~v~~~~~~i~~~~~~~ 126 (172)
T KOG4788|consen 64 LALAFFEFVSVFAFLLTLAFLILYLTLL--HETIVLPIRWPFLLDLLNLVVALLLFAIASWVLAQ 126 (172)
T ss_pred CcceeeeHHHHHHHHHHHHHHHHHHHHh--hhccccccCcHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67788887776655 355555555444 444 4455556 888888888877777766554
No 16
>PF09878 DUF2105: Predicted membrane protein (DUF2105); InterPro: IPR019212 This entry represents a protein found in various hypothetical archaeal proteins, has no known function.
Probab=45.50 E-value=26 Score=29.58 Aligned_cols=25 Identities=28% Similarity=0.792 Sum_probs=18.1
Q ss_pred cchHHHHHHHHHHH----hHHHHHHHHHh
Q 032541 76 CGVGWFSFLLGFVF----PLMWYYGTFLY 100 (138)
Q Consensus 76 ~GiGWflFllGFf~----~ipWYvgafl~ 100 (138)
-|+.|.+-++||+. |=-|.++.++-
T Consensus 164 SGiaWalWi~gF~~Ff~~P~~Wl~~L~lA 192 (212)
T PF09878_consen 164 SGIAWALWIAGFIGFFLFPQYWLLALMLA 192 (212)
T ss_pred hhHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 48999999999864 45577666553
No 17
>PF04835 Pox_A9: A9 protein conserved region; InterPro: IPR006920 This entry represents a family of Chordopoxvirus A9 proteins. Chordopoxvirus belongs to the family Poxviridae and is the cause of vertebrate infections [].
Probab=44.27 E-value=41 Score=23.14 Aligned_cols=31 Identities=19% Similarity=0.208 Sum_probs=26.9
Q ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHhhcC
Q 032541 108 DPRERAGLAASAIAAMACSVVMLVIVVFRLM 138 (138)
Q Consensus 108 DpRErpGl~AcaIAa~v~tia~~Il~~~r~~ 138 (138)
|-|-||--.+-.|.-++.+.++..+.|+.++
T Consensus 14 e~k~R~NsF~fViik~vismimylilGi~L~ 44 (54)
T PF04835_consen 14 ENKLRPNSFWFVIIKSVISMIMYLILGIALI 44 (54)
T ss_pred HhhcCCchHHHHHHHHHHHHHHHHHHHHHHh
Confidence 7889999999999999999999888887653
No 18
>PF07954 DUF1689: Protein of unknown function (DUF1689) ; InterPro: IPR012470 Family of fungal proteins with unknown function. A member of this family has been found to localise in the mitochondria [].
Probab=44.18 E-value=21 Score=28.23 Aligned_cols=37 Identities=24% Similarity=0.282 Sum_probs=23.5
Q ss_pred chHHHHHHHHHHHhHHHHHHHHHhhcccccCCcccchhH
Q 032541 77 GVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGL 115 (138)
Q Consensus 77 GiGWflFllGFf~~ipWYvgafl~~~~~~r~DpRErpGl 115 (138)
-.||..|++||+.|..||.-----. -.-.-||.||-+
T Consensus 35 ~~g~~~~~~gF~~Pt~y~~yk~~~~--~gv~~~~~~pfl 71 (152)
T PF07954_consen 35 LGGYGGFMAGFFAPTAYYRYKTGAI--KGVPVPRQKPFL 71 (152)
T ss_pred HHHHHHHHHHHhhHHHHHHHhcccc--cCCcCCccCcch
Confidence 3599999999999988876310000 111356777754
No 19
>cd02435 CCC1 CCC1. CCC1: This domain is present in the CCC1, an iron and manganese transporter of Saccharomyces cerevisiae. CCC1 is a transmembrane protein that is located in the vacuole and transfers the iron and manganese ions from the cytosol to the vacuole. This domain may be unique to certain fungi and plants.
Probab=44.07 E-value=74 Score=26.17 Aligned_cols=16 Identities=25% Similarity=0.588 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHh-HHHH
Q 032541 79 GWFSFLLGFVFP-LMWY 94 (138)
Q Consensus 79 GWflFllGFf~~-ipWY 94 (138)
--++|++|=++| +|++
T Consensus 160 s~lsf~lG~liPLlPy~ 176 (241)
T cd02435 160 IGLSYFIGGLIPLLPYF 176 (241)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 345688888888 7753
No 20
>COG3671 Predicted membrane protein [Function unknown]
Probab=41.64 E-value=1.7e+02 Score=23.12 Aligned_cols=35 Identities=26% Similarity=0.355 Sum_probs=28.2
Q ss_pred ccCCCcccccchHHHHHHHHHHHhHHHHHHHHHhhcccccCC
Q 032541 67 YDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKD 108 (138)
Q Consensus 67 ~~~rLPCcG~GiGWflFllGFf~~ipWYvgafl~~~~~~r~D 108 (138)
.+|++|-- -..|+++|+.-++.=.+|+|.- |.++|
T Consensus 19 ~~k~l~~v----vY~Ly~~G~v~git~lvgvi~A---Yv~rd 53 (125)
T COG3671 19 SGKKLPIV----VYILYLLGAVTGITPLVGVIFA---YVNRD 53 (125)
T ss_pred ccccchHH----HHHHHHHHHHHHHHHHHHHHHH---hcccc
Confidence 56788865 8899999999998888888765 56666
No 21
>PF11694 DUF3290: Protein of unknown function (DUF3290); InterPro: IPR021707 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=41.05 E-value=1e+02 Score=23.98 Aligned_cols=32 Identities=19% Similarity=0.404 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHhHHHHHHHHHhhcccccCCcccc
Q 032541 79 GWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRER 112 (138)
Q Consensus 79 GWflFllGFf~~ipWYvgafl~~~~~~r~DpREr 112 (138)
-|+-.++.+++.+.+-+.++.|+ .-|+|-|=|
T Consensus 16 ~~~~~~~i~~ll~~l~~~~~~Y~--r~r~~tKyR 47 (149)
T PF11694_consen 16 DYLRYILIIILLLVLIFFFIKYL--RNRLDTKYR 47 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HhcCcchhh
Confidence 46667777777788888999999 555555444
No 22
>TIGR01191 ccmC heme exporter protein CcmC. This model describes the cyt c biogenesis protein encoded by ccmC in bacteria. It must be noted an arabidopsis, a tritcum and a piscum plant proteins were recognizable in the clade. Quite likely they are of organellar origin. Bacterial c-type cytocromes are located on the periplasmic side of the cytoplasmic membrane. Several gene products encoded in a locus designated as 'ccm' are implicated in the transport and assembly of the functional cytochrome C. This cluster includes genes, ccmA;B;C;D;E;F;G and H. The posttranslational pathway includes the transport of heme moiety, the secretion of the apoprotein and the covalent attachment of the heme with the apoprotein. The proteins ccmA and B represent an ABC transporter; ccmC and D participate in the heme transfer to ccmE, which function as a periplasmic heme chaperone. The presence of ccmF, G and H is suggested to be obligatory for the final functional assembly of cytochrome c.
Probab=40.02 E-value=85 Score=25.03 Aligned_cols=31 Identities=16% Similarity=0.201 Sum_probs=24.1
Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHhhc
Q 032541 107 KDPRERAGLAASAIAAMACSVVMLVIVVFRL 137 (138)
Q Consensus 107 ~DpRErpGl~AcaIAa~v~tia~~Il~~~r~ 137 (138)
.+.|||.|-.|+..+-+.+..++++..+++|
T Consensus 105 ~~~~~~~~r~aAvl~i~gfi~vpi~~~~V~~ 135 (184)
T TIGR01191 105 IDNRDSAAKAAGILCLVGVVNIPIIKFSVEW 135 (184)
T ss_pred ccChhhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5777888888888777777777777788776
No 23
>KOG2927 consensus Membrane component of ER protein translocation complex [Intracellular trafficking, secretion, and vesicular transport]
Probab=39.52 E-value=24 Score=31.89 Aligned_cols=19 Identities=32% Similarity=0.244 Sum_probs=8.9
Q ss_pred cccCCc-ccchhHHHHHHHH
Q 032541 104 HCRKDP-RERAGLAASAIAA 122 (138)
Q Consensus 104 ~~r~Dp-RErpGl~AcaIAa 122 (138)
++.+=| +=|.|.-=.+|++
T Consensus 206 LFPLWP~~mR~gvyY~sig~ 225 (372)
T KOG2927|consen 206 LFPLWPRRMRQGVYYLSIGA 225 (372)
T ss_pred hcccCcHHHhcceeeeecch
Confidence 344334 3467764444433
No 24
>cd02432 Nodulin-21_like_1 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_1: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter.
Probab=39.20 E-value=1.1e+02 Score=24.80 Aligned_cols=18 Identities=28% Similarity=0.364 Sum_probs=13.4
Q ss_pred chHHHHHHHHHHHh-HHHH
Q 032541 77 GVGWFSFLLGFVFP-LMWY 94 (138)
Q Consensus 77 GiGWflFllGFf~~-ipWY 94 (138)
-.--++|++|=++| +|+.
T Consensus 141 l~s~~sf~lg~liPllpy~ 159 (218)
T cd02432 141 LASAISFSVGALLPLLAIL 159 (218)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 33567899999999 7743
No 25
>PRK09459 pspG phage shock protein G; Reviewed
Probab=38.52 E-value=1.5e+02 Score=21.52 Aligned_cols=10 Identities=30% Similarity=0.763 Sum_probs=7.8
Q ss_pred HHHHHHHHHh
Q 032541 81 FSFLLGFVFP 90 (138)
Q Consensus 81 flFllGFf~~ 90 (138)
++|++|||..
T Consensus 4 llFvl~F~~~ 13 (76)
T PRK09459 4 LLFVIGFFVM 13 (76)
T ss_pred hHHHHHHHHH
Confidence 5788888875
No 26
>TIGR00267 conserved hypothetical protein TIGR00267. This family is represented in three of the first four completed archaeal genomes, with two members in A. fulgidus.
Probab=38.16 E-value=1.2e+02 Score=23.38 Aligned_cols=13 Identities=38% Similarity=0.733 Sum_probs=10.8
Q ss_pred HHHHHHHHHh-HHH
Q 032541 81 FSFLLGFVFP-LMW 93 (138)
Q Consensus 81 flFllGFf~~-ipW 93 (138)
++|++|+++| +|.
T Consensus 97 ls~~~g~liPllp~ 110 (169)
T TIGR00267 97 FSTFMGSFVPVLPF 110 (169)
T ss_pred HHHHHHHHHHHHHH
Confidence 6789999999 774
No 27
>cd02433 Nodulin-21_like_2 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_2: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter.
Probab=37.98 E-value=1e+02 Score=25.27 Aligned_cols=15 Identities=33% Similarity=0.746 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHh-HHH
Q 032541 79 GWFSFLLGFVFP-LMW 93 (138)
Q Consensus 79 GWflFllGFf~~-ipW 93 (138)
.-++|++|=++| +|.
T Consensus 157 sflsF~ig~liPLLPf 172 (234)
T cd02433 157 SFLLFALGALIPVLPF 172 (234)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 456788999998 774
No 28
>PF03733 DUF307: Domain of unknown function (DUF307); InterPro: IPR005185 This proteins contain a domain which occurs as one or more copies in a small family of putative membrane proteins.
Probab=37.89 E-value=56 Score=21.42 Aligned_cols=24 Identities=25% Similarity=0.713 Sum_probs=19.1
Q ss_pred hHHHHHHHHHHHhHHHHHHHHHhhc
Q 032541 78 VGWFSFLLGFVFPLMWYYGTFLYFG 102 (138)
Q Consensus 78 iGWflFllGFf~~ipWYvgafl~~~ 102 (138)
+-|++ +.|+.+++.|.+++.+.+.
T Consensus 5 ilW~i-~~G~~lal~~~~~~~~~~i 28 (53)
T PF03733_consen 5 ILWFI-FFGWWLALIWLLAGILCCI 28 (53)
T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHH
Confidence 45776 7899999999998888763
No 29
>PRK09554 feoB ferrous iron transport protein B; Reviewed
Probab=37.39 E-value=79 Score=30.14 Aligned_cols=52 Identities=13% Similarity=0.263 Sum_probs=37.1
Q ss_pred chHHHHHHHHHHHhHHHHHHHHHhhcccccCCcccchhHHHHHHHHHHHHHHHHHH
Q 032541 77 GVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVVMLVI 132 (138)
Q Consensus 77 GiGWflFllGFf~~ipWYvgafl~~~~~~r~DpRErpGl~AcaIAa~v~tia~~Il 132 (138)
|..|.+|.+++-+.+.|-++++.|-. .+.- ..|+..+.+|++++..++.++.
T Consensus 689 ~~kw~~~~~~~~~~~Ay~~a~~~yq~--~~~~--~~~~~~~~~~~~~~~~~~~~~~ 740 (772)
T PRK09554 689 SRGWMGFSILWGLNIAYSLATLFYQV--ASFS--QHPTYSLVCILAVILFNIVVLG 740 (772)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHH--HHHH--hccchHHHHHHHHHHHHHHHHH
Confidence 68999999999999999988888752 2211 3577777777776555555443
No 30
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=34.46 E-value=1.3e+02 Score=23.07 Aligned_cols=45 Identities=29% Similarity=0.340 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHhHHHHHH--HHHhhcccccCCcccchhHHHHHHHHHHHHHH
Q 032541 79 GWFSFLLGFVFPLMWYYG--TFLYFGNHCRKDPRERAGLAASAIAAMACSVV 128 (138)
Q Consensus 79 GWflFllGFf~~ipWYvg--afl~~~~~~r~DpRErpGl~AcaIAa~v~tia 128 (138)
-|++.|+||.+-+.++.- -+.++ +.||=.+=+..++|..++.+++
T Consensus 2 ir~liL~~~~~l~~~l~~sG~i~~Y-----I~P~~~~~~~~a~i~l~ilai~ 48 (182)
T PF09323_consen 2 IRFLILLGFGILLFYLILSGKILLY-----IHPRYIPLLYFAAILLLILAIV 48 (182)
T ss_pred HHHHHHHHHHHHHHHHHHhCcHHHH-----hCccHHHHHHHHHHHHHHHHHH
Confidence 477888888776655544 34433 3677776666665555444433
No 31
>KOG2887 consensus Membrane protein involved in ER to Golgi transport [Intracellular trafficking, secretion, and vesicular transport]
Probab=33.90 E-value=45 Score=27.33 Aligned_cols=83 Identities=19% Similarity=0.379 Sum_probs=48.2
Q ss_pred CcccccCCceeeecCCccc--ccccccCCCccccc--chHHHHHHHHHHH-----------hHHHHHHHHHhhcc-----
Q 032541 44 DTADREKGRYTLIRDPENF--QFGIYDKPLPCFGC--GVGWFSFLLGFVF-----------PLMWYYGTFLYFGN----- 103 (138)
Q Consensus 44 d~~~~~~g~Y~lird~e~~--~~g~~~~rLPCcG~--GiGWflFllGFf~-----------~ipWYvgafl~~~~----- 103 (138)
|+.+..++.--...|.++. .+ .+-.|+-|+|+ +.|-++|+++.+. +++|-.|-++.++.
T Consensus 18 d~~~~~~~~~~~~~~~~~~~fsL-s~~qR~~~F~~cl~~gv~c~~l~~~lf~v~~~~~~kFal~~TlGnll~i~sf~fLm 96 (175)
T KOG2887|consen 18 DPGDHQTEERSFTSDLQESTFSL-SRTQRIMGFGICLAGGVLCFLLAMVLFPVLVVSPRKFALLYTLGNLLAIGSFAFLM 96 (175)
T ss_pred CCCccccccccchhhhhhhhccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceeehhHHHHHHHHHHHHHHHH
Confidence 6666655555555555433 22 23335555554 4588899988764 34676665544321
Q ss_pred ----ccc--CCcccch---hHHHHHHHHHHHHH
Q 032541 104 ----HCR--KDPRERA---GLAASAIAAMACSV 127 (138)
Q Consensus 104 ----~~r--~DpRErp---Gl~AcaIAa~v~ti 127 (138)
+.+ -||+.+| .++||.++++.+++
T Consensus 97 GP~~ql~~m~~p~Rl~~T~~~l~~~~~Tly~al 129 (175)
T KOG2887|consen 97 GPVSQLKHMFSPERLPATLSYLATMVLTLYVAL 129 (175)
T ss_pred hHHHHHHHhcChhHHHHHHHHHHHHHHHHHHHH
Confidence 222 3776665 67888888876654
No 32
>COG5336 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.28 E-value=26 Score=27.25 Aligned_cols=26 Identities=31% Similarity=0.659 Sum_probs=16.0
Q ss_pred cccchHHHHHHHHHHHh-HHHHHHHHHhhc
Q 032541 74 FGCGVGWFSFLLGFVFP-LMWYYGTFLYFG 102 (138)
Q Consensus 74 cG~GiGWflFllGFf~~-ipWYvgafl~~~ 102 (138)
-|.||||++ =-|+. -||..-.|++++
T Consensus 58 VGa~iG~ll---D~~agTsPwglIv~lllG 84 (116)
T COG5336 58 VGAGIGWLL---DKFAGTSPWGLIVFLLLG 84 (116)
T ss_pred HHHHHHHHH---HHhcCCCcHHHHHHHHHH
Confidence 588899975 22333 666666666654
No 33
>PF03845 Spore_permease: Spore germination protein; InterPro: IPR004761 Amino acid permeases are integral membrane proteins involved in the transport of amino acids into the cell. A number of such proteins have been found to be evolutionary related [, , ]. These proteins seem to contain up to 12 transmembrane segments. The best conserved region in this family is located in the second transmembrane segment. Spore germination protein (amino acid permease) is involved in the response to the germinative mixture of L-asparagine, glucose, fructose and potassium ions (AFFK). These proteins could be amino acid transporters.; GO: 0009847 spore germination, 0016021 integral to membrane
Probab=32.84 E-value=1.5e+02 Score=23.74 Aligned_cols=55 Identities=16% Similarity=0.197 Sum_probs=31.5
Q ss_pred hHHHHHHHHHHHhHHHHHHHHHhh--cccccCCccc--chhHHHHHHHHHHHHHHHHHHH
Q 032541 78 VGWFSFLLGFVFPLMWYYGTFLYF--GNHCRKDPRE--RAGLAASAIAAMACSVVMLVIV 133 (138)
Q Consensus 78 iGWflFllGFf~~ipWYvgafl~~--~~~~r~DpRE--rpGl~AcaIAa~v~tia~~Il~ 133 (138)
-||--++-|......||.+..+++ ..+. +|+++ |..+.|..+.++.++...++..
T Consensus 172 ~g~~~i~~~~~~~~~~~~~~~~~l~~~p~~-~~~~~~~k~~~~~~~~~~~~~~~~~~~~i 230 (320)
T PF03845_consen 172 SGIKPILKGSLVISFPFGGIEILLFLFPFV-KDKKKLKKSLLIAILISGLFLLFIIFITI 230 (320)
T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHc-CCchHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356667778666677777664332 2223 34433 6666677676666665554433
No 34
>PF02687 FtsX: FtsX-like permease family; InterPro: IPR003838 This domain is found in predicted permeases and hypothetical transmembrane proteins. P57382 from SWISSPROT has been shown to transport lipids targeted to the outer membrane across the inner membrane. Both P57382 and O54500 from SWISSPROT have been shown to require ATP. This domain contains three transmembrane helices.; GO: 0016020 membrane
Probab=32.19 E-value=70 Score=20.68 Aligned_cols=7 Identities=14% Similarity=0.060 Sum_probs=3.4
Q ss_pred cCCcccc
Q 032541 106 RKDPRER 112 (138)
Q Consensus 106 r~DpREr 112 (138)
..++..=
T Consensus 88 ~~~~~~~ 94 (121)
T PF02687_consen 88 TISPWSF 94 (121)
T ss_pred eeCHHHH
Confidence 3456543
No 35
>PF11511 RhodobacterPufX: Intrinsic membrane protein PufX; InterPro: IPR020169 PufX organises RC-LH1, the photosynthesis reaction centre-light harvesting complex 1 core complex of Rhodobacter sphaeroides []. It also facilitates the exchange of quinol for quinone between the reaction centre and cytochrome bc(1) complexes. In organic solvent, PufX contains two hydrophobic helices which are flanked by unstructured regions and connected by a helical bend [].; PDB: 2DW3_A 2ITA_A 2NRG_A.
Probab=31.59 E-value=42 Score=23.86 Aligned_cols=20 Identities=20% Similarity=0.304 Sum_probs=13.1
Q ss_pred HHHHHHHHhHHHHHHHHHhh
Q 032541 82 SFLLGFVFPLMWYYGTFLYF 101 (138)
Q Consensus 82 lFllGFf~~ipWYvgafl~~ 101 (138)
+|.+.|++...|++|.+|+-
T Consensus 35 ~~~~~~~l~~~~~iG~~LPe 54 (67)
T PF11511_consen 35 FLGLWFLLVALYFIGLLLPE 54 (67)
T ss_dssp HHHHHHHHHHHHHHHHSSTT
T ss_pred HHHHHHHHHHHHHHHHhCch
Confidence 34444444588888888776
No 36
>COG5503 Uncharacterized conserved small protein [Function unknown]
Probab=31.27 E-value=25 Score=25.25 Aligned_cols=19 Identities=32% Similarity=0.600 Sum_probs=15.8
Q ss_pred CCceeeeeeeccccccccC
Q 032541 14 KTGLELVKSVSDKHLDLLR 32 (138)
Q Consensus 14 ~~~~el~~svsdkh~~llr 32 (138)
.-.+|.+.++||+|+|-=+
T Consensus 41 ~yniEFI~~lsd~~L~YEk 59 (69)
T COG5503 41 NYNIEFITPLSDAHLDYEK 59 (69)
T ss_pred CcceEEEeecchhhhhhhh
Confidence 5579999999999998543
No 37
>COG2194 Predicted membrane-associated, metal-dependent hydrolase [General function prediction only]
Probab=30.98 E-value=2e+02 Score=26.71 Aligned_cols=32 Identities=25% Similarity=0.201 Sum_probs=23.7
Q ss_pred chHHHHHHHHHHHhHHHHHHHHHhhcccccCC
Q 032541 77 GVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKD 108 (138)
Q Consensus 77 GiGWflFllGFf~~ipWYvgafl~~~~~~r~D 108 (138)
+-.|+++++.|.+.+.|-..+|.........+
T Consensus 11 ~~~~l~ll~a~~~~l~~n~~~~~~~~~~~~~~ 42 (555)
T COG2194 11 TKLSLSLLLAWYFLLLLNFAFFLQVFLINSLD 42 (555)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccch
Confidence 44688888888888888888887775555544
No 38
>COG1347 NqrD Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrD [Energy production and conversion]
Probab=30.93 E-value=30 Score=29.06 Aligned_cols=24 Identities=38% Similarity=0.938 Sum_probs=18.4
Q ss_pred cccCCCccc------ccchHHHHHHHHHHH
Q 032541 66 IYDKPLPCF------GCGVGWFSFLLGFVF 89 (138)
Q Consensus 66 ~~~~rLPCc------G~GiGWflFllGFf~ 89 (138)
+...|+|-+ |+|.||.|..+||+=
T Consensus 123 m~~~Pi~sf~DGignGlGYg~~L~~v~~iR 152 (208)
T COG1347 123 MKSPPIESFLDGIGNGLGYGWMLLVVGFVR 152 (208)
T ss_pred ccCCCcHHHHhhccccccchHHHHHHHHHH
Confidence 344566654 899999999999973
No 39
>PF13886 DUF4203: Domain of unknown function (DUF4203)
Probab=30.49 E-value=2e+02 Score=22.16 Aligned_cols=42 Identities=29% Similarity=0.354 Sum_probs=23.3
Q ss_pred HHHHHHHHh-HHHHHHHHHhhcccccC-CcccchhHHHHHHHHHHHHHHHH
Q 032541 82 SFLLGFVFP-LMWYYGTFLYFGNHCRK-DPRERAGLAASAIAAMACSVVML 130 (138)
Q Consensus 82 lFllGFf~~-ipWYvgafl~~~~~~r~-DpRErpGl~AcaIAa~v~tia~~ 130 (138)
+|+.||++. +..++- +.+. ++....-..|+.+++++..+...
T Consensus 26 ~fl~Gf~~g~~~~~~i-------~~~~~~~~~~~~~~~~~v~g~~~G~i~g 69 (210)
T PF13886_consen 26 MFLSGFLFGSLITFVI-------ILRINVLVSNANLGASVVAGVLGGIILG 69 (210)
T ss_pred HHHHHHHHHHHHHHHH-------HHHhcccchhHHHHHHHHHHHHHHHHHH
Confidence 488888886 444332 2232 33333556777777776654443
No 40
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=29.53 E-value=71 Score=26.77 Aligned_cols=23 Identities=22% Similarity=0.652 Sum_probs=16.6
Q ss_pred ccCCCccc--ccchHHHHHHHHHHH
Q 032541 67 YDKPLPCF--GCGVGWFSFLLGFVF 89 (138)
Q Consensus 67 ~~~rLPCc--G~GiGWflFllGFf~ 89 (138)
|..-.|-. |--++|.+|++|+|+
T Consensus 113 Rs~lvP~~~~~s~~~~~l~~~G~~l 137 (222)
T PF04298_consen 113 RSALVPVANIGSNLSWILLILGLFL 137 (222)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444544 455799999999999
No 41
>PRK00968 tetrahydromethanopterin S-methyltransferase subunit D; Provisional
Probab=28.39 E-value=67 Score=27.67 Aligned_cols=27 Identities=26% Similarity=0.618 Sum_probs=22.1
Q ss_pred CCcc---cchhHHHHHHHHHHHHHHHHHHH
Q 032541 107 KDPR---ERAGLAASAIAAMACSVVMLVIV 133 (138)
Q Consensus 107 ~DpR---ErpGl~AcaIAa~v~tia~~Il~ 133 (138)
+||. -..|.+||.||++++.+..+++.
T Consensus 208 HDPKFKr~p~~vias~vaS~~~gii~v~~~ 237 (240)
T PRK00968 208 HDPKFKRWPRAVIASFVASLVCGIVAVLMI 237 (240)
T ss_pred CCcccccchHHHHHHHHHHHHHHHHHHHHh
Confidence 5774 45799999999999999887764
No 42
>PF06738 DUF1212: Protein of unknown function (DUF1212); InterPro: IPR010619 This entry represents a predicted domain found within a number of hypothetical proteins of unknown function found in eukaryotes, bacteria and archaea. Some of these sequences are predicted to be membrane proteins.
Probab=28.34 E-value=2.3e+02 Score=21.15 Aligned_cols=27 Identities=19% Similarity=0.220 Sum_probs=20.4
Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHH
Q 032541 107 KDPRERAGLAASAIAAMACSVVMLVIV 133 (138)
Q Consensus 107 ~DpRErpGl~AcaIAa~v~tia~~Il~ 133 (138)
...++.+-.+.-.+||++.+++..++.
T Consensus 145 ~~r~~~~~~~~~~~aa~~~~~~a~~~~ 171 (193)
T PF06738_consen 145 LSRRRLNSFIQEFIAAFLASLLAALLA 171 (193)
T ss_pred HHhccchHHHHHHHHHHHHHHHHHHHH
Confidence 466777888888888888877776654
No 43
>PF02285 COX8: Cytochrome oxidase c subunit VIII; InterPro: IPR003205 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits.This family is composed of cytochrome c oxidase subunit VIII. ; GO: 0004129 cytochrome-c oxidase activity; PDB: 3AG3_Z 3ABM_M 1OCC_Z 3ASO_Z 3AG2_Z 3ABL_M 3AG4_M 3AG1_M 3ASN_M 1OCZ_M ....
Probab=28.33 E-value=1.2e+02 Score=19.90 Aligned_cols=28 Identities=36% Similarity=0.368 Sum_probs=18.5
Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHH
Q 032541 107 KDPRERAGLAASAIAAMACSVVMLVIVV 134 (138)
Q Consensus 107 ~DpRErpGl~AcaIAa~v~tia~~Il~~ 134 (138)
+-||++-|-+-.+|+-.++.+.+++-.+
T Consensus 4 kP~~~~~s~~e~aigltv~f~~~L~Pag 31 (44)
T PF02285_consen 4 KPPREPLSPAEQAIGLTVCFVTFLGPAG 31 (44)
T ss_dssp ---SS---HHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhhHH
Confidence 4689999999999998888888777554
No 44
>PF07234 DUF1426: Protein of unknown function (DUF1426); InterPro: IPR009871 This family consists of several Banana bunchy top virus proteins of around 120 residues in length. Q9IGU4 from SWISSPROT is annotated a movement protein whereas most other family members are hypothetical. The function of this family is unknown.
Probab=27.99 E-value=93 Score=24.20 Aligned_cols=33 Identities=21% Similarity=0.356 Sum_probs=22.7
Q ss_pred hHHHHHHHHHHHh-HHHHHHHHHhhcccccCCcccchhHHHHH
Q 032541 78 VGWFSFLLGFVFP-LMWYYGTFLYFGNHCRKDPRERAGLAASA 119 (138)
Q Consensus 78 iGWflFllGFf~~-ipWYvgafl~~~~~~r~DpRErpGl~Aca 119 (138)
+-||||+-..|.+ -.-|+-.-++| |-|-|+--.
T Consensus 12 FEwFLF~~AIFiAItIlYILLalL~---------EvPkYIK~~ 45 (117)
T PF07234_consen 12 FEWFLFFGAIFIAITILYILLALLF---------EVPKYIKEL 45 (117)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---------hhHHHHHHH
Confidence 3699999999998 34566555666 666665443
No 45
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=27.11 E-value=96 Score=23.68 Aligned_cols=10 Identities=10% Similarity=0.414 Sum_probs=4.7
Q ss_pred HHHHHHHHhH
Q 032541 82 SFLLGFVFPL 91 (138)
Q Consensus 82 lFllGFf~~i 91 (138)
+.++|-.+++
T Consensus 68 ~Ii~gv~aGv 77 (122)
T PF01102_consen 68 GIIFGVMAGV 77 (122)
T ss_dssp HHHHHHHHHH
T ss_pred ehhHHHHHHH
Confidence 3444555543
No 46
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=26.82 E-value=2.4e+02 Score=26.72 Aligned_cols=22 Identities=27% Similarity=0.535 Sum_probs=12.3
Q ss_pred HHHHHHh-HHHHHHHHHhhcccc
Q 032541 84 LLGFVFP-LMWYYGTFLYFGNHC 105 (138)
Q Consensus 84 llGFf~~-ipWYvgafl~~~~~~ 105 (138)
++|.+|. ++=.+|.+.-+|.|-
T Consensus 98 ~i~ll~~il~P~vg~~fCcCRCc 120 (806)
T PF05478_consen 98 VIGLLFIILMPLVGLCFCCCRCC 120 (806)
T ss_pred HHHHHHHHHHHHHHHHHhccccC
Confidence 4465554 555667766555443
No 47
>TIGR00927 2A1904 K+-dependent Na+/Ca+ exchanger.
Probab=26.68 E-value=1.5e+02 Score=30.38 Aligned_cols=13 Identities=38% Similarity=0.715 Sum_probs=8.4
Q ss_pred ccccchHHHHHHH
Q 032541 73 CFGCGVGWFSFLL 85 (138)
Q Consensus 73 CcG~GiGWflFll 85 (138)
+.|+|+-|+++.+
T Consensus 1012 llgLGlPWlI~~l 1024 (1096)
T TIGR00927 1012 TVGLPVPWLLFSL 1024 (1096)
T ss_pred eeeccHHHHHHHH
Confidence 3567788876544
No 48
>COG4036 Predicted membrane protein [Function unknown]
Probab=26.26 E-value=80 Score=26.83 Aligned_cols=25 Identities=28% Similarity=0.808 Sum_probs=19.1
Q ss_pred cchHHHHHHHHHHH----hHHHHHHHHHh
Q 032541 76 CGVGWFSFLLGFVF----PLMWYYGTFLY 100 (138)
Q Consensus 76 ~GiGWflFllGFf~----~ipWYvgafl~ 100 (138)
-|++|.+.+.||.. |=.|..+.|+-
T Consensus 166 SGi~WalWvaGF~~FF~~P~~WLlaL~ma 194 (224)
T COG4036 166 SGIGWALWVAGFSTFFLHPKAWLLALIMA 194 (224)
T ss_pred chHHHHHHHHHHHHHHhcHHHHHHHHHHc
Confidence 38999999988732 67798877654
No 49
>COG1457 CodB Purine-cytosine permease and related proteins [Nucleotide transport and metabolism]
Probab=26.01 E-value=2e+02 Score=26.23 Aligned_cols=56 Identities=14% Similarity=0.168 Sum_probs=44.3
Q ss_pred HHHHHHHHHHHhHHHHHHHHHhhcccccCCcccc--hhHHHHHHHHHHHHHHHHHHHH
Q 032541 79 GWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRER--AGLAASAIAAMACSVVMLVIVV 134 (138)
Q Consensus 79 GWflFllGFf~~ipWYvgafl~~~~~~r~DpREr--pGl~AcaIAa~v~tia~~Il~~ 134 (138)
.|-.|+.++-+.+-|+.+.--+-.-|.|.-|++| .--.++.++..+.+..++++.+
T Consensus 191 ~~~~fl~a~slv~g~~~sw~~~~aDysRy~~~~t~~~~~~~~~~G~~l~~~~~~ilGa 248 (442)
T COG1457 191 SPLSFLSALSLVIGSFASWGPYAADYSRYAPSPTPSKAFLAAVLGFFLGTSFMMILGA 248 (442)
T ss_pred cchhHHHHHHHHHHHHHhhhhhhhhhhhhcCCCchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5888999998989999988888899999999999 5555566666666666666554
No 50
>KOG1172 consensus Na+-independent Cl/HCO3 exchanger AE1 and related transporters (SLC4 family) [Inorganic ion transport and metabolism]
Probab=25.86 E-value=23 Score=35.01 Aligned_cols=27 Identities=26% Similarity=0.338 Sum_probs=22.6
Q ss_pred cccchHHHHHHHHHHH------hHHHHHHHHHh
Q 032541 74 FGCGVGWFSFLLGFVF------PLMWYYGTFLY 100 (138)
Q Consensus 74 cG~GiGWflFllGFf~------~ipWYvgafl~ 100 (138)
=|+|.=|=+|++|+.. ++||+.||..-
T Consensus 664 KgsgyH~DLlllgil~~icsllGLPw~~~a~p~ 696 (876)
T KOG1172|consen 664 KGSGYHLDLLLLGILTLICSLLGLPWSNAATVQ 696 (876)
T ss_pred CCcchhHHHHHHHHHHHHHHhcCCCcccccccc
Confidence 4789999999999963 59999998753
No 51
>PRK10429 melibiose:sodium symporter; Provisional
Probab=25.81 E-value=1.9e+02 Score=24.23 Aligned_cols=12 Identities=42% Similarity=0.459 Sum_probs=9.5
Q ss_pred CCcccchhHHHH
Q 032541 107 KDPRERAGLAAS 118 (138)
Q Consensus 107 ~DpRErpGl~Ac 118 (138)
.||+||..+.+.
T Consensus 136 ~~~~eR~~l~~~ 147 (473)
T PRK10429 136 LDKREREQLVPY 147 (473)
T ss_pred CCHHHHHHHHHH
Confidence 599999986654
No 52
>PF11377 DUF3180: Protein of unknown function (DUF3180); InterPro: IPR021517 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function.
Probab=25.71 E-value=1.7e+02 Score=22.20 Aligned_cols=12 Identities=33% Similarity=0.858 Sum_probs=9.5
Q ss_pred HHHHHHHHHhhc
Q 032541 91 LMWYYGTFLYFG 102 (138)
Q Consensus 91 ipWYvgafl~~~ 102 (138)
--||.|..+|+.
T Consensus 87 ~G~~~G~~~~~l 98 (138)
T PF11377_consen 87 AGWYAGQLVYLL 98 (138)
T ss_pred HHHHHHHHHHHH
Confidence 358999999984
No 53
>PF09948 DUF2182: Predicted metal-binding integral membrane protein (DUF2182); InterPro: IPR018688 This family of various hypothetical bacterial membrane proteins having predicted metal-binding properties has no known function.
Probab=25.67 E-value=1.5e+02 Score=23.88 Aligned_cols=48 Identities=27% Similarity=0.437 Sum_probs=30.8
Q ss_pred cccccccCCCcccccchHHHHHHHHHHHh---HHHHHHHHHhhcccccCCcccch
Q 032541 62 FQFGIYDKPLPCFGCGVGWFSFLLGFVFP---LMWYYGTFLYFGNHCRKDPRERA 113 (138)
Q Consensus 62 ~~~g~~~~rLPCcG~GiGWflFllGFf~~---ipWYvgafl~~~~~~r~DpRErp 113 (138)
.+.|+++ -+=|.| .-|-+.++=|..+ +.|-.+.-++.- .-|..|+.+.
T Consensus 124 lr~Gl~h-G~~CvG--CCWaLMllmfv~G~mnl~wMa~lt~~~~-~EK~~p~g~~ 174 (191)
T PF09948_consen 124 LRMGLRH-GLYCVG--CCWALMLLMFVVGVMNLAWMAALTALMF-AEKLLPWGRR 174 (191)
T ss_pred HHHHHHH-ccHHHH--HHHHHHHHHHHhccccHHHHHHHHHHHH-HHHhCCcchH
Confidence 3444443 234543 3699999999997 889887766661 3445676553
No 54
>PF14023 DUF4239: Protein of unknown function (DUF4239)
Probab=25.60 E-value=1.9e+02 Score=22.08 Aligned_cols=21 Identities=29% Similarity=0.360 Sum_probs=8.6
Q ss_pred ccchhHHHHHHHHHHHHHHHH
Q 032541 110 RERAGLAASAIAAMACSVVML 130 (138)
Q Consensus 110 RErpGl~AcaIAa~v~tia~~ 130 (138)
+.++-+++.++.+...+.+++
T Consensus 165 ~~~~~~~~~~l~a~~i~~~l~ 185 (209)
T PF14023_consen 165 NRRAHLIAIALFAASIALALF 185 (209)
T ss_pred chhHHHHHHHHHHHHHHHHHH
Confidence 334444444444433333333
No 55
>PF03631 Virul_fac_BrkB: Virulence factor BrkB; InterPro: IPR017039 This entry represents the uncharacterised protein family UPF0761. It includes the E. coli gene product of yihY, and was previously thought to be a family of tRNA-processing ribonuclease BN proteins []. This has been shown to be incorrect [].; GO: 0004540 ribonuclease activity
Probab=25.40 E-value=3.2e+02 Score=21.17 Aligned_cols=50 Identities=14% Similarity=0.168 Sum_probs=25.1
Q ss_pred hHHHHHHHHHHHhHHHHHHHHHhhcccccCCcccchhHHHHHHHHHHHHH
Q 032541 78 VGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSV 127 (138)
Q Consensus 78 iGWflFllGFf~~ipWYvgafl~~~~~~r~DpRErpGl~AcaIAa~v~ti 127 (138)
.+.+.++.-++...-+.-+.---+-+-++.+|+|+......-+-+++.++
T Consensus 72 ~~~i~~~~ll~~a~~~~~~l~~a~~~i~~~~~~~~r~~~~~~~~~~~~~i 121 (260)
T PF03631_consen 72 LGLIGILILLWSASSFFASLQRALNRIYGVPPRERRSFWKRRLIALLFLI 121 (260)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCCHHHHHHHHHHHHH
Confidence 34444444345555555554444444566666776555544444443333
No 56
>PF03729 DUF308: Short repeat of unknown function (DUF308); InterPro: IPR005325 This represents a group of short repeats that occurs in a limited number of membrane proteins. It may divide further in short repeats of around 7-10 residues of the pattern G-#-X(2)-#(2)-X (#=hydrophobic).
Probab=25.34 E-value=1.7e+02 Score=17.84 Aligned_cols=22 Identities=9% Similarity=0.090 Sum_probs=11.5
Q ss_pred CcccchhHHHHHHHHHHHHHHH
Q 032541 108 DPRERAGLAASAIAAMACSVVM 129 (138)
Q Consensus 108 DpRErpGl~AcaIAa~v~tia~ 129 (138)
+.+.+...+...+..+++.+.+
T Consensus 49 ~~~~~~~~l~~gi~~i~~Gi~~ 70 (72)
T PF03729_consen 49 GSKGWWWSLLSGILSIVLGIIL 70 (72)
T ss_pred cchhhHHHHHHHHHHHHHHHHH
Confidence 3345555555555555555444
No 57
>PF14017 DUF4233: Protein of unknown function (DUF4233)
Probab=24.82 E-value=96 Score=22.99 Aligned_cols=18 Identities=28% Similarity=0.827 Sum_probs=10.3
Q ss_pred HHHHHHHHHHhHHHHHHH
Q 032541 80 WFSFLLGFVFPLMWYYGT 97 (138)
Q Consensus 80 WflFllGFf~~ipWYvga 97 (138)
|..|++|..|...|.++.
T Consensus 77 p~m~vvG~iF~~~W~~~l 94 (107)
T PF14017_consen 77 PAMFVVGVIFAAVWWYAL 94 (107)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 344666666666665543
No 58
>PF01036 Bac_rhodopsin: Bacteriorhodopsin-like protein; InterPro: IPR001425 The bacterial opsins are retinal-binding proteins that provide light- dependent ion transport and sensory functions to a family of halophilic bacteria [, ]. They are integral membrane proteins believed to contain seven transmembrane (TM) domains, the last of which contains the attachment point for retinal (a conserved lysine). There are several classes of these bacterial proteins: they include bacteriorhodopsin and archaerhodopsin, which are light-driven proton pumps; halorhodopsin, a light-driven chloride pump; and sensory rhodopsin, which mediates both photoattractant (in the red) and photophobic (in the UV) responses.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 3QBI_B 3QBK_D 3QBL_D 3QBG_B 3AM6_D 1UAZ_B 1E12_A 2JAF_A 2JAG_A 3UG9_A ....
Probab=24.54 E-value=2e+02 Score=22.51 Aligned_cols=46 Identities=22% Similarity=0.231 Sum_probs=24.4
Q ss_pred HHHHHHHHHHhHHHHHHHHHhhccccc-CCcccchhHHHHHHHHHHHHHH
Q 032541 80 WFSFLLGFVFPLMWYYGTFLYFGNHCR-KDPRERAGLAASAIAAMACSVV 128 (138)
Q Consensus 80 WflFllGFf~~ipWYvgafl~~~~~~r-~DpRErpGl~AcaIAa~v~tia 128 (138)
|..|-+++.+-+ ++++.++..-.| .+++.|..+..++.-..+.+++
T Consensus 1 ~~~~~v~~~~~~---~~~l~f~~~~~~~~~~~~R~~~~~~~~i~~iaa~a 47 (222)
T PF01036_consen 1 WTWFWVFAAAML---VSTLFFLLWSRRVTSPRKRYFYYLSALITGIAAIA 47 (222)
T ss_dssp HHHHHHHHHHHH---HHHHHHHHHHTTSSTTHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHH---HHHHHHHHHHhcCCCccchhHHHHHHHHHHHHHHH
Confidence 445555544432 344444433556 4777788776665544444444
No 59
>PRK09597 lipid A 1-phosphatase; Reviewed
Probab=24.29 E-value=32 Score=28.06 Aligned_cols=17 Identities=24% Similarity=0.253 Sum_probs=14.9
Q ss_pred ccchHHHHHHHHHHHhH
Q 032541 75 GCGVGWFSFLLGFVFPL 91 (138)
Q Consensus 75 G~GiGWflFllGFf~~i 91 (138)
=+|+||.+-|+|.|+|.
T Consensus 22 ~~~~~~~~~~~~~~~~~ 38 (190)
T PRK09597 22 LLALSLGLILLGIFAPF 38 (190)
T ss_pred HHHHHHHHHHHHhccCC
Confidence 47999999999999874
No 60
>PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional
Probab=24.19 E-value=3.7e+02 Score=22.21 Aligned_cols=31 Identities=16% Similarity=0.211 Sum_probs=17.8
Q ss_pred ccCCCcccccchHHHHHHHHHHHhHHHHHHHH
Q 032541 67 YDKPLPCFGCGVGWFSFLLGFVFPLMWYYGTF 98 (138)
Q Consensus 67 ~~~rLPCcG~GiGWflFllGFf~~ipWYvgaf 98 (138)
..+||--+ ..+|-++|++||++.+.+.+.-+
T Consensus 227 s~~Plr~~-~~~g~~~~~~~~~~~~~~~~~~~ 257 (325)
T PRK10714 227 TTTPLRLL-SLLGSIIAIGGFSLAVLLVVLRL 257 (325)
T ss_pred chhhHHHH-HHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566555 34566677777776654444333
No 61
>PF05255 UPF0220: Uncharacterised protein family (UPF0220); InterPro: IPR007919 This family of proteins is functionally uncharacterised.
Probab=24.17 E-value=1.4e+02 Score=23.58 Aligned_cols=55 Identities=13% Similarity=0.141 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHhHHHHHHHHHhhccccc-CCcccchhHHHHHHHHHHHHHHHHHHHHh
Q 032541 79 GWFSFLLGFVFPLMWYYGTFLYFGNHCR-KDPRERAGLAASAIAAMACSVVMLVIVVF 135 (138)
Q Consensus 79 GWflFllGFf~~ipWYvgafl~~~~~~r-~DpRErpGl~AcaIAa~v~tia~~Il~~~ 135 (138)
.+.+.+-|++|.+-|.+-.=-.. ++. .++.+-+=--+--+-.++.|+.+++++.+
T Consensus 23 ~~~~~~AGaLF~~gwWi~iDa~v--~s~~~~~~~~~~~f~~~ipgI~stlgm~mvN~V 78 (166)
T PF05255_consen 23 AIGSYVAGALFALGWWIFIDAAV--YSKHANGSDVHVTFVDWIPGIFSTLGMFMVNSV 78 (166)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--hCCccCCCCccccceeeehHHHHHHHHHHhccc
Confidence 67889999999988887554444 222 22222111122233445667777777765
No 62
>cd02434 Nodulin-21_like_3 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_3: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter.
Probab=23.67 E-value=2.1e+02 Score=23.13 Aligned_cols=16 Identities=31% Similarity=0.552 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHh-HHHH
Q 032541 79 GWFSFLLGFVFP-LMWY 94 (138)
Q Consensus 79 GWflFllGFf~~-ipWY 94 (138)
--++|++|=++| +|++
T Consensus 141 sflsf~~ggliPLlp~~ 157 (225)
T cd02434 141 TFLSFLVFGIIPLLPYL 157 (225)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 556788998998 7754
No 63
>PF14362 DUF4407: Domain of unknown function (DUF4407)
Probab=23.59 E-value=3.5e+02 Score=21.95 Aligned_cols=25 Identities=20% Similarity=0.319 Sum_probs=15.5
Q ss_pred chHHHHHHHHHHHhHHHHHHHHHhh
Q 032541 77 GVGWFSFLLGFVFPLMWYYGTFLYF 101 (138)
Q Consensus 77 GiGWflFllGFf~~ipWYvgafl~~ 101 (138)
|+|=..|+.+.+-.+-+++++.-++
T Consensus 17 ~~G~~vl~ta~la~~s~~~a~~~~~ 41 (301)
T PF14362_consen 17 GIGAAVLFTALLAGLSGGYALYTVF 41 (301)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4566666666666666666665555
No 64
>COG2364 Predicted membrane protein [Function unknown]
Probab=23.47 E-value=1.2e+02 Score=25.59 Aligned_cols=36 Identities=33% Similarity=0.589 Sum_probs=23.8
Q ss_pred cccchHHHHHHHHHHHh-HHHHHHHHHhhcccccCCcccchhHHHHHHHH
Q 032541 74 FGCGVGWFSFLLGFVFP-LMWYYGTFLYFGNHCRKDPRERAGLAASAIAA 122 (138)
Q Consensus 74 cG~GiGWflFllGFf~~-ipWYvgafl~~~~~~r~DpRErpGl~AcaIAa 122 (138)
.|+-+||.++++|+++- .-|.. . ||||++.+--.+-
T Consensus 48 ~gLtvG~wsi~l~~~li~~~~~~-----l--------r~~~~Lg~lln~l 84 (210)
T COG2364 48 FGLTVGSWSIILGSCLIGCTWIL-----L--------RKKPGLGTLLNAL 84 (210)
T ss_pred cCcceeeHHHHHHHHHHHHHHHH-----H--------hcchhHHHHHHHH
Confidence 57778988888887763 44421 1 7899887755443
No 65
>KOG0581 consensus Mitogen-activated protein kinase kinase (MAP2K) [Signal transduction mechanisms]
Probab=23.38 E-value=30 Score=31.05 Aligned_cols=39 Identities=28% Similarity=0.344 Sum_probs=23.8
Q ss_pred hHHHHHHHHHHH-h---HH---HHHHHHHhhcccccCCcccchhHH
Q 032541 78 VGWFSFLLGFVF-P---LM---WYYGTFLYFGNHCRKDPRERAGLA 116 (138)
Q Consensus 78 iGWflFllGFf~-~---ip---WYvgafl~~~~~~r~DpRErpGl~ 116 (138)
.+||..+--... | +| |-=-.--+...|.|+|||||+..-
T Consensus 285 ~~~~~Ll~~Iv~~ppP~lP~~~fS~ef~~FV~~CL~Kdp~~R~s~~ 330 (364)
T KOG0581|consen 285 LDIFELLCAIVDEPPPRLPEGEFSPEFRSFVSCCLRKDPSERPSAK 330 (364)
T ss_pred CCHHHHHHHHhcCCCCCCCcccCCHHHHHHHHHHhcCCcccCCCHH
Confidence 477777666665 2 33 221122334449999999999753
No 66
>PRK09669 putative symporter YagG; Provisional
Probab=23.22 E-value=2.2e+02 Score=23.33 Aligned_cols=10 Identities=60% Similarity=0.983 Sum_probs=7.4
Q ss_pred cCCcccchhH
Q 032541 106 RKDPRERAGL 115 (138)
Q Consensus 106 r~DpRErpGl 115 (138)
..||+||.-+
T Consensus 138 t~~~~eR~~l 147 (444)
T PRK09669 138 TNDPRERHSL 147 (444)
T ss_pred cCCHHHHHHH
Confidence 4699999844
No 67
>PRK11770 hypothetical protein; Provisional
Probab=22.96 E-value=1.2e+02 Score=23.55 Aligned_cols=20 Identities=30% Similarity=0.672 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHhHHHHHHHHH
Q 032541 79 GWFSFLLGFVFPLMWYYGTFL 99 (138)
Q Consensus 79 GWflFllGFf~~ipWYvgafl 99 (138)
-||.| .||..++-|+.+..+
T Consensus 8 lW~i~-gG~~~al~~~~~g~l 27 (135)
T PRK11770 8 IWFVL-GGFWTALGWLLAGLV 27 (135)
T ss_pred HHHHH-HHHHHHHHHHHHHHH
Confidence 45442 455555555555443
No 68
>COG1814 Uncharacterized membrane protein [Function unknown]
Probab=22.51 E-value=3.7e+02 Score=21.57 Aligned_cols=17 Identities=29% Similarity=0.848 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHh-HHHHH
Q 032541 79 GWFSFLLGFVFP-LMWYY 95 (138)
Q Consensus 79 GWflFllGFf~~-ipWYv 95 (138)
-=++|++|-++| +|-|+
T Consensus 149 sg~s~~~G~l~Pllp~~~ 166 (229)
T COG1814 149 SGISFIIGALLPLLPFFF 166 (229)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 345677777777 55443
No 69
>TIGR02847 CyoD cytochrome o ubiquinol oxidase subunit IV. Cytochrome o terminal oxidase complex is the component of the aerobic respiratory chain which reacts with oxygen, reducing it to water with the concomitant transport of 4 protons across the membrane. Also known as the cytochrome bo complex, cytochrome o ubiquinol oxidase contains four subunits, two heme b cofactors and a copper atom which is believed to be the oxygen active site. This complex is structurally related to the cytochrome caa3 oxidases which utilize cytochrome c as the reductant and contain heme a cofactors, as well as the intermediate form aa3 oxidases which also react directly with quinones as the reductant.
Probab=22.10 E-value=1.4e+02 Score=21.77 Aligned_cols=33 Identities=3% Similarity=-0.063 Sum_probs=19.6
Q ss_pred cccCCcccchhHHHHHHHHHHHHHHHHHHHHhh
Q 032541 104 HCRKDPRERAGLAASAIAAMACSVVMLVIVVFR 136 (138)
Q Consensus 104 ~~r~DpRErpGl~AcaIAa~v~tia~~Il~~~r 136 (138)
+.|.|.++++++--.+..-.+..+++++...+|
T Consensus 54 FlHl~~~~~~~~n~~~l~Ft~~i~~iiv~GSiW 86 (96)
T TIGR02847 54 FLHLNTSSEQRWNLISLLFTILIIFILIGGSIW 86 (96)
T ss_pred HhhccCccccchHHHHHHHHHHHHHHHHHHHHH
Confidence 568888888887655544444444444455444
No 70
>PF13430 DUF4112: Domain of unknown function (DUF4112)
Probab=21.74 E-value=1.6e+02 Score=21.53 Aligned_cols=53 Identities=19% Similarity=0.374 Sum_probs=28.5
Q ss_pred ccCCCcccc--cchHHHHHHHHHHHh-----HHHHHHHHHhhcccccCCcccchhHHHHHHHHHHHHHH
Q 032541 67 YDKPLPCFG--CGVGWFSFLLGFVFP-----LMWYYGTFLYFGNHCRKDPRERAGLAASAIAAMACSVV 128 (138)
Q Consensus 67 ~~~rLPCcG--~GiGWflFllGFf~~-----ipWYvgafl~~~~~~r~DpRErpGl~AcaIAa~v~tia 128 (138)
.|+...||| .++||= .|+|++ | +...++..++. ...|+ |+-....+.+++-++
T Consensus 13 lD~~~~i~g~~~~~Gld-~iiglI-P~vGD~~~~~~s~~iv~-~a~~~------g~p~~l~~~M~~Ni~ 72 (106)
T PF13430_consen 13 LDRAFRIPGTNFRFGLD-PIIGLI-PVVGDIISALLSLYIVY-EARRL------GLPKWLLARMLFNIL 72 (106)
T ss_pred HhcccCCCCCCcccchH-HHHHHh-ccHhHHHHHHHHHHHHH-HHHHc------CCCHHHHHHHHHHHH
Confidence 355667777 888985 567876 4 34444444443 12222 444445555544433
No 71
>PLN02715 lipid phosphate phosphatase
Probab=21.60 E-value=2.9e+02 Score=24.06 Aligned_cols=25 Identities=16% Similarity=0.558 Sum_probs=17.9
Q ss_pred ccccccccCCCcccccchHHHHHHHHHHHhH
Q 032541 61 NFQFGIYDKPLPCFGCGVGWFSFLLGFVFPL 91 (138)
Q Consensus 61 ~~~~g~~~~rLPCcG~GiGWflFllGFf~~i 91 (138)
+-+-|..+...| .|.++++++++|+
T Consensus 81 ~i~yP~~~~tVp------~~~l~vi~~liPi 105 (327)
T PLN02715 81 DLKYPFKDNTVP------IWSVPVYAVLLPI 105 (327)
T ss_pred hccCCCCCCccc------HHHHHHHHHHHHH
Confidence 334455556666 7999999998885
No 72
>PF13779 DUF4175: Domain of unknown function (DUF4175)
Probab=21.23 E-value=2.7e+02 Score=27.30 Aligned_cols=22 Identities=18% Similarity=0.342 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHhHHHHHHHHHhh
Q 032541 79 GWFSFLLGFVFPLMWYYGTFLYF 101 (138)
Q Consensus 79 GWflFllGFf~~ipWYvgafl~~ 101 (138)
-+++.++|+|+.+.| .|.+..+
T Consensus 8 ~p~~~v~~lflal~~-lGl~~~l 29 (820)
T PF13779_consen 8 WPLLSVLALFLALSW-LGLWDLL 29 (820)
T ss_pred HHHHHHHHHHHHHHH-HhHHHhc
Confidence 566667777776665 4666665
No 73
>PF01988 VIT1: VIT family; InterPro: IPR008217 Proteins containing this entry have no known function and are predicted to be integral membrane proteins. They include the Ccc1 protein from Saccharomyces cerevisiae (Baker's yeast) (P47818 from SWISSPROT) that may have a role in regulating calcium levels [].
Probab=21.04 E-value=3.8e+02 Score=20.98 Aligned_cols=15 Identities=40% Similarity=0.844 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHh-HHH
Q 032541 79 GWFSFLLGFVFP-LMW 93 (138)
Q Consensus 79 GWflFllGFf~~-ipW 93 (138)
.-++|++|-++| +|.
T Consensus 138 ~~~sf~lg~liPllp~ 153 (213)
T PF01988_consen 138 TFLSFILGGLIPLLPY 153 (213)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 445677777777 554
No 74
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=20.87 E-value=1.4e+02 Score=21.73 Aligned_cols=6 Identities=50% Similarity=1.287 Sum_probs=2.8
Q ss_pred cccccc
Q 032541 72 PCFGCG 77 (138)
Q Consensus 72 PCcG~G 77 (138)
+|||||
T Consensus 24 scs~C~ 29 (79)
T PF07213_consen 24 SCSGCY 29 (79)
T ss_pred CCCCcc
Confidence 455543
No 75
>PF06210 DUF1003: Protein of unknown function (DUF1003); InterPro: IPR010406 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=20.68 E-value=3.2e+02 Score=20.22 Aligned_cols=46 Identities=22% Similarity=0.243 Sum_probs=23.4
Q ss_pred HHHHHHHh-HHHHHHHHHhhcccccCCccc--chhHHHHHHHHHHHHHHH
Q 032541 83 FLLGFVFP-LMWYYGTFLYFGNHCRKDPRE--RAGLAASAIAAMACSVVM 129 (138)
Q Consensus 83 FllGFf~~-ipWYvgafl~~~~~~r~DpRE--rpGl~AcaIAa~v~tia~ 129 (138)
|+++|..- +.|-+.-.+.... .+-||-- =--++.|..|++.+++.+
T Consensus 6 Fi~~~~~~~~~Wi~~N~~~~~~-~~fDpyPFilLnl~lS~~Aa~~ap~Il 54 (108)
T PF06210_consen 6 FIIIFTVFLAVWILLNILAPPR-PAFDPYPFILLNLVLSLEAAYQAPLIL 54 (108)
T ss_pred HHHHHHHHHHHHHHHHhhcccc-CCCCCccHHHHHHHHHHHHHHHHHHHH
Confidence 44444443 5676655554421 3557532 224556666666555444
No 76
>TIGR02975 phageshock_pspG phage shock protein G. This protein previously was designated yjbO in E. coli. It is found only in genomes that have the phage shock operon (psp), but only rarely is encoded near other psp genes. The psp regulon is upregulated in response to a number of stress conditions, including ethanol, expression of the filamentous phage secretin protein IV and other secretins, and heat shock.
Probab=20.55 E-value=3.2e+02 Score=19.33 Aligned_cols=10 Identities=40% Similarity=0.813 Sum_probs=6.8
Q ss_pred HHHHHHHHHh
Q 032541 81 FSFLLGFVFP 90 (138)
Q Consensus 81 flFllGFf~~ 90 (138)
++|++||+..
T Consensus 3 liFvl~F~~~ 12 (64)
T TIGR02975 3 LIFVLGFFVM 12 (64)
T ss_pred ehHHHHHHHH
Confidence 3677787765
No 77
>PRK14398 membrane protein; Provisional
Probab=20.54 E-value=1e+02 Score=24.94 Aligned_cols=30 Identities=10% Similarity=0.193 Sum_probs=20.9
Q ss_pred HHHHHHHHHHh-HHHHHHHHHhhcccccCCcccch
Q 032541 80 WFSFLLGFVFP-LMWYYGTFLYFGNHCRKDPRERA 113 (138)
Q Consensus 80 WflFllGFf~~-ipWYvgafl~~~~~~r~DpRErp 113 (138)
+.++++|++++ +|+-+=. ++...+|.|+.-
T Consensus 4 ~~~~~~~YllGsip~~~li----~k~~g~DiR~~G 34 (191)
T PRK14398 4 YIVLILSYILGSIPFSLII----TKIKGINLREVG 34 (191)
T ss_pred HHHHHHHHHHHhhHHHHHH----HHHcCCCccccC
Confidence 45678899997 8885432 234667999853
No 78
>TIGR01571 A_thal_Cys_rich uncharacterized Cys-rich domain. This model describes an uncharacterized domain of about 100 residues. It is common in plants but found also in Homo sapiens, Dictyostelium, and Leishmania; at least 12 distinct members are found in Arabidopsis. Most members of this family contain more than 10 per cent Cys, but no Cys residue is invariant across the family.
Probab=20.44 E-value=41 Score=23.96 Aligned_cols=39 Identities=18% Similarity=0.261 Sum_probs=22.7
Q ss_pred CcccccchHHHHHHHHHHHhHHHHHHHHHhhcccccCCcccchhHHH
Q 032541 71 LPCFGCGVGWFSFLLGFVFPLMWYYGTFLYFGNHCRKDPRERAGLAA 117 (138)
Q Consensus 71 LPCcG~GiGWflFllGFf~~ipWYvgafl~~~~~~r~DpRErpGl~A 117 (138)
.+|...| ++..+++.+++++|.+.. .+|..-|||-|+-.
T Consensus 40 ~~C~~~~--~~~~~~~~~~~~~~~~~~------~~R~~~R~ry~i~g 78 (104)
T TIGR01571 40 GECLCGG--LTAIAMSALCGFCGCYTC------FIRIKLREKYGIQG 78 (104)
T ss_pred CchhhHH--HHHHHHHHHHhHHHHHHH------HHHHHHHHHhCCCC
Confidence 4676555 333344445556665432 46778888877654
No 79
>PF09583 Phageshock_PspG: Phage shock protein G (Phageshock_PspG); InterPro: IPR014318 This protein previously was designated yjbO in Escherichia coli. It is found only in genomes that have the phage shock operon (psp), but it is only rarely encoded near other psp genes. The psp regulon is upregulated in response to a number of stress conditions, including ethanol, expression of the filamentous phage secretin protein IV and other secretins and heat shock.
Probab=20.35 E-value=3.3e+02 Score=19.35 Aligned_cols=10 Identities=40% Similarity=0.733 Sum_probs=7.2
Q ss_pred HHHHHHHHHh
Q 032541 81 FSFLLGFVFP 90 (138)
Q Consensus 81 flFllGFf~~ 90 (138)
++|++||+..
T Consensus 4 liFvl~F~~~ 13 (65)
T PF09583_consen 4 LIFVLGFFAM 13 (65)
T ss_pred HHHHHHHHHH
Confidence 4688888765
No 80
>PF13664 DUF4149: Domain of unknown function (DUF4149)
Probab=20.09 E-value=2.9e+02 Score=18.57 Aligned_cols=12 Identities=33% Similarity=0.673 Sum_probs=6.4
Q ss_pred cccchhHHHHHH
Q 032541 109 PRERAGLAASAI 120 (138)
Q Consensus 109 pRErpGl~AcaI 120 (138)
||+..|-++..+
T Consensus 26 ~~~~ag~i~~~l 37 (101)
T PF13664_consen 26 PRQQAGKIQGKL 37 (101)
T ss_pred CHHHHHHHHHHH
Confidence 555555555444
Done!