Query psy6104
Match_columns 118
No_of_seqs 41 out of 43
Neff 3.3
Searched_HMMs 46136
Date Fri Aug 16 17:41:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6104.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6104hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1420|consensus 100.0 5.7E-35 1.2E-39 262.5 6.3 113 5-118 17-133 (1103)
2 PF07204 Orthoreo_P10: Orthore 76.3 2.1 4.5E-05 31.5 2.1 28 33-66 41-69 (98)
3 PRK04960 universal stress prot 72.9 7.2 0.00016 29.2 4.2 17 24-42 80-96 (111)
4 PF12606 RELT: Tumour necrosis 63.6 9.6 0.00021 24.7 2.9 23 35-58 3-25 (50)
5 PF14898 DUF4491: Domain of un 59.5 9.1 0.0002 27.9 2.5 22 29-50 31-52 (94)
6 MTH00180 ND4L NADH dehydrogena 59.0 13 0.00029 26.0 3.2 24 32-55 1-24 (99)
7 PF11947 DUF3464: Protein of u 54.4 13 0.00028 28.8 2.7 18 34-51 97-114 (153)
8 PF11151 DUF2929: Protein of u 54.2 13 0.00028 24.2 2.3 18 32-49 7-24 (57)
9 MTH00015 ND6 NADH dehydrogenas 42.1 34 0.00074 25.5 3.3 23 33-57 48-70 (155)
10 PF10779 XhlA: Haemolysin XhlA 38.1 42 0.00092 22.0 2.9 21 30-50 50-70 (71)
11 MTH00001 ND4L NADH dehydrogena 37.3 49 0.0011 23.1 3.3 23 33-55 2-24 (99)
12 PF11255 DUF3054: Protein of u 36.8 38 0.00082 24.6 2.7 22 34-58 90-111 (112)
13 COG1786 Swiveling domain assoc 36.7 18 0.00039 27.8 1.0 13 104-116 44-56 (131)
14 TIGR03715 KxYKxGKxW KxYKxGKxW 35.7 42 0.00092 18.8 2.3 17 32-48 11-27 (29)
15 MTH00166 ND6 NADH dehydrogenas 35.6 50 0.0011 24.6 3.3 22 33-56 47-68 (160)
16 PF06678 DUF1179: Protein of u 34.0 84 0.0018 23.3 4.1 27 35-65 5-31 (103)
17 KOG1278|consensus 32.2 11 0.00024 35.2 -0.9 31 28-58 260-292 (628)
18 PF13962 PGG: Domain of unknow 30.6 72 0.0016 22.2 3.2 24 32-55 51-74 (113)
19 PF03729 DUF308: Short repeat 30.5 61 0.0013 19.6 2.6 22 30-51 50-71 (72)
20 PF04120 Iron_permease: Low af 29.5 61 0.0013 24.5 2.9 26 33-58 7-32 (132)
21 KOG3188|consensus 29.3 51 0.0011 27.6 2.6 24 31-54 11-34 (246)
22 PF01989 DUF126: Protein of un 28.3 31 0.00067 24.1 1.0 13 104-116 22-34 (82)
23 PF07243 Phlebovirus_G1: Phleb 28.0 90 0.002 28.8 4.1 26 34-59 421-446 (526)
24 MTH00212 ND6 NADH dehydrogenas 27.8 80 0.0017 23.9 3.3 23 33-57 48-70 (160)
25 KOG4482|consensus 27.4 1.1E+02 0.0024 27.7 4.5 31 39-69 296-328 (449)
26 MTH00152 ND6 NADH dehydrogenas 25.8 91 0.002 23.4 3.3 22 33-56 48-69 (163)
27 PF15159 PIG-Y: Phosphatidylin 25.7 1E+02 0.0022 21.1 3.2 21 34-54 4-24 (72)
28 PF11188 DUF2975: Protein of u 25.5 1.2E+02 0.0027 20.6 3.7 25 34-58 99-123 (136)
29 PF11807 DUF3328: Domain of un 24.5 1.4E+02 0.003 21.4 3.9 28 31-58 8-35 (217)
30 cd01356 AcnX_swivel Putative A 22.9 44 0.00096 25.0 1.1 13 104-116 37-49 (123)
31 PF05510 Sarcoglycan_2: Sarcog 22.7 88 0.0019 27.6 3.0 29 39-67 284-314 (386)
32 TIGR03017 EpsF chain length de 22.6 1.2E+02 0.0026 25.3 3.6 22 31-52 12-33 (444)
33 PF07589 VPEP: PEP-CTERM motif 22.2 1.4E+02 0.003 16.5 2.7 15 39-53 6-20 (25)
34 PF12729 4HB_MCP_1: Four helix 22.2 2.5E+02 0.0055 18.8 4.9 31 28-58 4-34 (181)
35 PF13334 DUF4094: Domain of un 22.1 98 0.0021 21.9 2.6 11 40-50 9-19 (95)
36 PF05151 PsbM: Photosystem II 21.7 1.9E+02 0.0041 17.3 3.4 21 34-54 6-26 (31)
37 KOG1500|consensus 21.5 47 0.001 30.1 1.1 12 105-116 423-434 (517)
38 TIGR02848 spore_III_AC stage I 21.5 1E+02 0.0022 21.1 2.5 15 42-56 34-48 (64)
39 PTZ00443 Thioredoxin domain-co 21.2 2.1E+02 0.0047 22.8 4.7 23 34-56 177-199 (224)
40 PF10805 DUF2730: Protein of u 21.1 1.3E+02 0.0028 21.3 3.1 27 30-57 5-31 (106)
41 PF09301 DUF1970: Domain of un 20.3 88 0.0019 23.2 2.2 14 45-58 11-24 (117)
42 PF04689 S1FA: DNA binding pro 20.3 1.3E+02 0.0028 21.0 2.8 17 37-53 20-36 (69)
43 PF05767 Pox_A14: Poxvirus vir 20.3 2.1E+02 0.0046 20.8 4.1 26 39-64 47-74 (92)
44 PF00858 ASC: Amiloride-sensit 20.2 1.7E+02 0.0037 23.4 3.9 30 29-58 21-50 (439)
No 1
>KOG1420|consensus
Probab=100.00 E-value=5.7e-35 Score=262.47 Aligned_cols=113 Identities=39% Similarity=0.584 Sum_probs=96.5
Q ss_pred cccccccccc-eeeeccCCc---hhhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCchhhhHHH
Q psy6104 5 CCVLCKPTES-ILISTATSD---IDICLQERKWWCFLLSSIFTFILGLVIVLLWRLFAFICCRKEPELSPNDPKQKEQKA 80 (118)
Q Consensus 5 ~~~~~~~~~~-~~~~~~~~~---~~~C~~~R~~waFLlSSivTF~gGLliIliwR~i~~~cc~k~~~~~~~~~~~~~~~~ 80 (118)
-+++|+.+.+ ....+++++ ...|++.|||||||+||+|||||||||||+||.++|+||++.+...++.++|+.++.
T Consensus 17 ~~s~~~~ss~~s~~~ep~~~~~~~~~~m~vr~~w~fl~ss~~tf~~gl~iillwr~~~~~~~~~~~~~g~~~~~qk~~~~ 96 (1103)
T KOG1420|consen 17 SSSSSSSSSSSSSVHEPKMDALIIPVTMEVRMWWAFLASSMVTFFGGLFIILLWRTLKYLWTVCCHCGGKTKEAQKINNG 96 (1103)
T ss_pred CCCCCCcccccccccCCccCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHheeeeccCCCCCcHhhhhhcc
Confidence 3556655432 233444444 679999999999999999999999999999999999999999988888888888877
Q ss_pred HhccccceeeeehhhhhhHHHhHhhhccccceEEEEeC
Q psy6104 81 IRQGKQEFEGTFMTEAKDWAGELISGQTTTGRILVPNV 118 (118)
Q Consensus 81 ~~~~~~~~evgwmt~~KDwag~LISaQT~tGRILVVLV 118 (118)
+.|.. +.|+||||++|||||+||||||+|||||||||
T Consensus 97 s~~~~-~~e~~~mt~akdwagelisgqtltgr~lvvlv 133 (1103)
T KOG1420|consen 97 SSQAD-AAEVGWMTSAKDWAGELISGQTLTGRVLVVLV 133 (1103)
T ss_pred cchhh-hhhheeeEEehhhhcceeecccccceeeehhH
Confidence 76654 69999999999999999999999999999997
No 2
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=76.32 E-value=2.1 Score=31.48 Aligned_cols=28 Identities=32% Similarity=0.878 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-cCC
Q psy6104 33 WWCFLLSSIFTFILGLVIVLLWRLFAFICCR-KEP 66 (118)
Q Consensus 33 ~waFLlSSivTF~gGLliIliwR~i~~~cc~-k~~ 66 (118)
||.+|+.- ||+++|+|. +.-..||+ |+|
T Consensus 41 yWpyLA~G-----GG~iLilIi-i~Lv~CC~~K~K 69 (98)
T PF07204_consen 41 YWPYLAAG-----GGLILILII-IALVCCCRAKHK 69 (98)
T ss_pred hhHHhhcc-----chhhhHHHH-HHHHHHhhhhhh
Confidence 79998876 888888888 44445555 554
No 3
>PRK04960 universal stress protein UspB; Provisional
Probab=72.87 E-value=7.2 Score=29.24 Aligned_cols=17 Identities=35% Similarity=0.647 Sum_probs=14.1
Q ss_pred hhhHHhHHHHHHHHHHHHH
Q psy6104 24 IDICLQERKWWCFLLSSIF 42 (118)
Q Consensus 24 ~~~C~~~R~~waFLlSSiv 42 (118)
.++|+..|++ |+|||-.
T Consensus 80 i~kCervR~~--FiL~s~L 96 (111)
T PRK04960 80 IRRCERVRRQ--FILTSAL 96 (111)
T ss_pred HHHHHHHHHH--HHHHHHH
Confidence 7899888886 8888865
No 4
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=63.59 E-value=9.6 Score=24.65 Aligned_cols=23 Identities=30% Similarity=0.622 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 35 CFLLSSIFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 35 aFLlSSivTF~gGLliIliwR~i~ 58 (118)
+|++=|++ |..|++.++++-+.+
T Consensus 3 ~~~iV~i~-iv~~lLg~~I~~~~K 25 (50)
T PF12606_consen 3 AFLIVSIF-IVMGLLGLSICTTLK 25 (50)
T ss_pred ehHHHHHH-HHHHHHHHHHHHHhh
Confidence 45665554 778899999998874
No 5
>PF14898 DUF4491: Domain of unknown function (DUF4491)
Probab=59.54 E-value=9.1 Score=27.88 Aligned_cols=22 Identities=36% Similarity=0.587 Sum_probs=18.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy6104 29 QERKWWCFLLSSIFTFILGLVI 50 (118)
Q Consensus 29 ~~R~~waFLlSSivTF~gGLli 50 (118)
++|.||.||+..++.-.++|++
T Consensus 31 g~~~W~~FL~~Gi~~~~~Sl~~ 52 (94)
T PF14898_consen 31 GTRIWPIFLLAGIACIIASLFV 52 (94)
T ss_pred CCCcHHHHHHHHHHHHHHHHHH
Confidence 6799999999999887777765
No 6
>MTH00180 ND4L NADH dehydrogenase subunit 4L; Provisional
Probab=59.03 E-value=13 Score=25.97 Aligned_cols=24 Identities=33% Similarity=0.631 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 32 KWWCFLLSSIFTFILGLVIVLLWR 55 (118)
Q Consensus 32 ~~waFLlSSivTF~gGLliIliwR 55 (118)
+|+.|+.-+...|+.|++-++..|
T Consensus 1 ~~~~~l~~~~~lf~~gl~~~~~~r 24 (99)
T MTH00180 1 MYYEYLTVGIILFILGVLGIVLNR 24 (99)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHH
Confidence 478899999999999999998855
No 7
>PF11947 DUF3464: Protein of unknown function (DUF3464); InterPro: IPR021855 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 137 to 196 amino acids in length.
Probab=54.40 E-value=13 Score=28.84 Aligned_cols=18 Identities=44% Similarity=0.781 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy6104 34 WCFLLSSIFTFILGLVIV 51 (118)
Q Consensus 34 waFLlSSivTF~gGLliI 51 (118)
|+.++.|+++|..||+=|
T Consensus 97 ~~~~~~S~~~Fg~gllGi 114 (153)
T PF11947_consen 97 WAVLLVSLVFFGLGLLGI 114 (153)
T ss_pred hHHHHHHHHHHHHHHHhh
Confidence 889999999999998753
No 8
>PF11151 DUF2929: Protein of unknown function (DUF2929); InterPro: IPR021324 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=54.18 E-value=13 Score=24.19 Aligned_cols=18 Identities=22% Similarity=0.665 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy6104 32 KWWCFLLSSIFTFILGLV 49 (118)
Q Consensus 32 ~~waFLlSSivTF~gGLl 49 (118)
.-|+||++-++.|.+|=+
T Consensus 7 ~fWs~il~~vvgyI~ssL 24 (57)
T PF11151_consen 7 FFWSFILGEVVGYIGSSL 24 (57)
T ss_pred hHHHHHHHHHHHHHHHHH
Confidence 359999999999988743
No 9
>MTH00015 ND6 NADH dehydrogenase subunit 6; Validated
Probab=42.07 E-value=34 Score=25.53 Aligned_cols=23 Identities=22% Similarity=0.775 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 33 WWCFLLSSIFTFILGLVIVLLWRLF 57 (118)
Q Consensus 33 ~waFLlSSivTF~gGLliIliwR~i 57 (118)
|+.|++ +..|.|||++++++=.-
T Consensus 48 W~syil--fLI~iGGmLVlF~Y~~s 70 (155)
T MTH00015 48 WFAFLT--FLIYVGGMLVMFAYFLA 70 (155)
T ss_pred HHHHHH--HHHHHhHHHHHHHHHHH
Confidence 888887 68899999999886443
No 10
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=38.06 E-value=42 Score=22.02 Aligned_cols=21 Identities=29% Similarity=0.529 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6104 30 ERKWWCFLLSSIFTFILGLVI 50 (118)
Q Consensus 30 ~R~~waFLlSSivTF~gGLli 50 (118)
+.-.|-++.+.+++|++++++
T Consensus 50 ~kW~~r~iiGaiI~~i~~~i~ 70 (71)
T PF10779_consen 50 TKWIWRTIIGAIITAIIYLII 70 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 444578999999999988874
No 11
>MTH00001 ND4L NADH dehydrogenase subunit 4L; Provisional
Probab=37.33 E-value=49 Score=23.13 Aligned_cols=23 Identities=26% Similarity=0.639 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 33 WWCFLLSSIFTFILGLVIVLLWR 55 (118)
Q Consensus 33 ~waFLlSSivTF~gGLliIliwR 55 (118)
|+.|+.-+++-|+.|++-++..|
T Consensus 2 ~~~~l~~~~~lf~~gl~g~~~~r 24 (99)
T MTH00001 2 YYKYLILVVILFLLGIWGIILNR 24 (99)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHH
Confidence 56777778889999999888865
No 12
>PF11255 DUF3054: Protein of unknown function (DUF3054); InterPro: IPR021414 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known.
Probab=36.75 E-value=38 Score=24.63 Aligned_cols=22 Identities=32% Similarity=0.862 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 34 WCFLLSSIFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 34 waFLlSSivTF~gGLliIliwR~i~ 58 (118)
|.|++ |++..++++++-||.+.
T Consensus 90 ~~Fii---Va~~~~~vlL~gWR~~~ 111 (112)
T PF11255_consen 90 WSFII---VALVFLAVLLLGWRAVA 111 (112)
T ss_pred cchHH---HHHHHHHHHHHHHHHHh
Confidence 66776 99999999999999874
No 13
>COG1786 Swiveling domain associated with predicted aconitase [Energy production and conversion]
Probab=36.75 E-value=18 Score=27.82 Aligned_cols=13 Identities=46% Similarity=0.782 Sum_probs=11.6
Q ss_pred hhhccccceEEEE
Q psy6104 104 ISGQTTTGRILVP 116 (118)
Q Consensus 104 ISaQT~tGRILVV 116 (118)
+.||.++|||||.
T Consensus 44 l~G~~l~Gkilv~ 56 (131)
T COG1786 44 LHGESLTGKILVF 56 (131)
T ss_pred cccccccceEEEe
Confidence 6799999999985
No 14
>TIGR03715 KxYKxGKxW KxYKxGKxW signal peptide. This model describes a novel form of signal peptide that occurs as an N-terminal domain with a recognizable motif, reminiscent of the YSIRK and PEP-CTERM forms of signal peptide. This domain tends to occur on long, low-complexity (usually Serine-rich and heavily glycosylated) proteins of the Firmicutes, and (as with YSIRK) the majority of these proteins have the LPXTG cell wall-anchoring motif at the C-terminus.
Probab=35.69 E-value=42 Score=18.79 Aligned_cols=17 Identities=24% Similarity=0.456 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy6104 32 KWWCFLLSSIFTFILGL 48 (118)
Q Consensus 32 ~~waFLlSSivTF~gGL 48 (118)
|.|+|...+.++++++.
T Consensus 11 K~Wv~a~~~~~~l~~~~ 27 (29)
T TIGR03715 11 KQWVFAAITTLALAGGA 27 (29)
T ss_pred cHHHHHHHHHHHHHHhc
Confidence 67999988888877764
No 15
>MTH00166 ND6 NADH dehydrogenase subunit 6; Provisional
Probab=35.63 E-value=50 Score=24.60 Aligned_cols=22 Identities=18% Similarity=0.892 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 33 WWCFLLSSIFTFILGLVIVLLWRL 56 (118)
Q Consensus 33 ~waFLlSSivTF~gGLliIliwR~ 56 (118)
|+.|++ +..|.||+++++++=.
T Consensus 47 W~syiL--fLI~iGGmlVlF~Y~~ 68 (160)
T MTH00166 47 WFSYIL--FLIFLGGMLVLFIYIT 68 (160)
T ss_pred HHHHHH--HHHHHhHHHHHHHHHH
Confidence 666665 6789999999887543
No 16
>PF06678 DUF1179: Protein of unknown function (DUF1179); InterPro: IPR009564 This family consists of several hypothetical Caenorhabditis elegans proteins of around 106 residues in length. The function of the family is unknown.
Probab=33.97 E-value=84 Score=23.32 Aligned_cols=27 Identities=33% Similarity=0.668 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccC
Q psy6104 35 CFLLSSIFTFILGLVIVLLWRLFAFICCRKE 65 (118)
Q Consensus 35 aFLlSSivTF~gGLliIliwR~i~~~cc~k~ 65 (118)
.||+.+|+- .|++++|+.-++ .|.+|.
T Consensus 5 ~~i~~~i~l--~g~~lll~~~iv--~C~sKK 31 (103)
T PF06678_consen 5 WFILKSIFL--FGPFLLLIVSIV--QCQSKK 31 (103)
T ss_pred HHHHHHHHH--hhHHHHHHHHHH--Hhhhcc
Confidence 588888765 788888888887 566653
No 17
>KOG1278|consensus
Probab=32.17 E-value=11 Score=35.20 Aligned_cols=31 Identities=23% Similarity=0.499 Sum_probs=26.4
Q ss_pred HhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH
Q psy6104 28 LQERKWWCFLLSS--IFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 28 ~~~R~~waFLlSS--ivTF~gGLliIliwR~i~ 58 (118)
.+..++|-.|+-| +|+|+.|++.+.+.|.++
T Consensus 260 ~~~qIhWfSIiNSlvIVlfLSgiv~mI~lRtl~ 292 (628)
T KOG1278|consen 260 EDVQIHWFSIINSLVIVLFLSGIVAMIMLRTLY 292 (628)
T ss_pred CCCceEEEehhhhHHHHHHHHHHHHHHHHHHHH
Confidence 3567889999877 578999999999999996
No 18
>PF13962 PGG: Domain of unknown function
Probab=30.62 E-value=72 Score=22.16 Aligned_cols=24 Identities=25% Similarity=0.469 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 32 KWWCFLLSSIFTFILGLVIVLLWR 55 (118)
Q Consensus 32 ~~waFLlSSivTF~gGLliIliwR 55 (118)
.+++|+.+.-+.|+.++..+++.-
T Consensus 51 ~f~~F~~~nt~af~~S~~~i~~l~ 74 (113)
T PF13962_consen 51 AFKAFLISNTIAFFSSLAAIFLLI 74 (113)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHH
Confidence 789999999999999999888765
No 19
>PF03729 DUF308: Short repeat of unknown function (DUF308); InterPro: IPR005325 This represents a group of short repeats that occurs in a limited number of membrane proteins. It may divide further in short repeats of around 7-10 residues of the pattern G-#-X(2)-#(2)-X (#=hydrophobic).
Probab=30.54 E-value=61 Score=19.63 Aligned_cols=22 Identities=36% Similarity=1.055 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6104 30 ERKWWCFLLSSIFTFILGLVIV 51 (118)
Q Consensus 30 ~R~~waFLlSSivTF~gGLliI 51 (118)
.+.||-.++++++....|.+++
T Consensus 50 ~~~~~~~l~~gi~~i~~Gi~~l 71 (72)
T PF03729_consen 50 SKGWWWSLLSGILSIVLGIILL 71 (72)
T ss_pred chhhHHHHHHHHHHHHHHHHHH
Confidence 4577888999999999998875
No 20
>PF04120 Iron_permease: Low affinity iron permease ; InterPro: IPR007251 Although originally identified as a low-affinity iron(II) permease [, ], Fet4 has since been shown to import several other transition metal ions, including copper [, ] and zinc []. Copper, cobalt, and cadmium inhibit Fet4 [, ]. Fet4 is an integral protein of the plasma membrane [, ]. FET4 is not essential, not even in fet3 fet4 double mutants []. Over expression of FET4 improves growth under alkaline conditions []. Transcription of FET4 is induced by Aft1 in response to low levels of iron [, , ] or by Zap1 in response to low zinc [, ], but not in response to low copper []. When the high-affinity iron permease component Fet3 is deleted, FET4 is induced by the addition of copper, zinc, cobalt, or manganese []. It is also induced under anaerobic conditions [, , ] and repressed by Rox1 in aerobic conditions [, ]. Rox1 attenuates the activation of FET4 by Aft1 or Zap1 []. ; GO: 0055085 transmembrane transport
Probab=29.51 E-value=61 Score=24.50 Aligned_cols=26 Identities=15% Similarity=0.680 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 33 WWCFLLSSIFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 33 ~waFLlSSivTF~gGLliIliwR~i~ 58 (118)
+.+=.+||..+|..++++|+.|=+..
T Consensus 7 ~is~~~gs~~~f~~~~~~Ii~W~i~G 32 (132)
T PF04120_consen 7 WISDVAGSPWAFVIAVAVIIVWAISG 32 (132)
T ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHh
Confidence 34567799999999999999998874
No 21
>KOG3188|consensus
Probab=29.26 E-value=51 Score=27.64 Aligned_cols=24 Identities=21% Similarity=0.568 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 31 RKWWCFLLSSIFTFILGLVIVLLW 54 (118)
Q Consensus 31 R~~waFLlSSivTF~gGLliIliw 54 (118)
=++|.||=-|+|||+.|.+==.+-
T Consensus 11 iR~WVlLPI~ivm~liGilRhyvs 34 (246)
T KOG3188|consen 11 IRYWVLLPIVIVMFLIGILRHYVS 34 (246)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHH
Confidence 357999999999999998744433
No 22
>PF01989 DUF126: Protein of unknown function DUF126; InterPro: IPR002840 These archaebacterial proteins have no known function.; PDB: 2HI6_A.
Probab=28.30 E-value=31 Score=24.09 Aligned_cols=13 Identities=46% Similarity=0.787 Sum_probs=9.4
Q ss_pred hhhccccceEEEE
Q psy6104 104 ISGQTTTGRILVP 116 (118)
Q Consensus 104 ISaQT~tGRILVV 116 (118)
+.||...|||||.
T Consensus 22 l~G~si~gkILv~ 34 (82)
T PF01989_consen 22 LEGQSIAGKILVF 34 (82)
T ss_dssp STT-B-TTSEEEE
T ss_pred cCCCcccCeEEEe
Confidence 4799999999985
No 23
>PF07243 Phlebovirus_G1: Phlebovirus glycoprotein G1; InterPro: IPR010826 This domain is found in several Phlebovirus glycoprotein G1 sequences. Members of the Bunyaviridae family acquire an envelope by budding through the lipid bilayer of the Golgi complex. The budding compartment is thought to be determined by the accumulation of the two heterodimeric membrane glycoproteins G1 and G2 in the Golgi [].; GO: 0016021 integral to membrane, 0019012 virion
Probab=27.99 E-value=90 Score=28.78 Aligned_cols=26 Identities=15% Similarity=0.440 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6104 34 WCFLLSSIFTFILGLVIVLLWRLFAF 59 (118)
Q Consensus 34 waFLlSSivTF~gGLliIliwR~i~~ 59 (118)
=||++|.++.-+.=|++.+|+|++.+
T Consensus 421 SAlvVStliss~iylil~IL~K~L~~ 446 (526)
T PF07243_consen 421 SALVVSTLISSLIYLILSILSKVLYF 446 (526)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35666666666666667777777743
No 24
>MTH00212 ND6 NADH dehydrogenase subunit 6; Provisional
Probab=27.83 E-value=80 Score=23.90 Aligned_cols=23 Identities=22% Similarity=0.755 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 33 WWCFLLSSIFTFILGLVIVLLWRLF 57 (118)
Q Consensus 33 ~waFLlSSivTF~gGLliIliwR~i 57 (118)
|+.|++ +..|.||+++++++=.-
T Consensus 48 w~s~il--fli~iGGmlVlF~Y~~s 70 (160)
T MTH00212 48 WYGYIL--FLIYIGGLLVLFGYMVA 70 (160)
T ss_pred HHHHHH--HHHHHhHHHHHHHHHHH
Confidence 666666 67899999999876443
No 25
>KOG4482|consensus
Probab=27.41 E-value=1.1e+02 Score=27.72 Aligned_cols=31 Identities=35% Similarity=0.697 Sum_probs=16.0
Q ss_pred HHHHHHHHHH-HHHHHHHHHHh-hhhccCCCCC
Q psy6104 39 SSIFTFILGL-VIVLLWRLFAF-ICCRKEPELS 69 (118)
Q Consensus 39 SSivTF~gGL-liIliwR~i~~-~cc~k~~~~~ 69 (118)
=+.+||..=+ +.+|+.-++.| +||++|...+
T Consensus 296 df~~tfaIpl~Valll~~~La~imc~rrEg~~~ 328 (449)
T KOG4482|consen 296 DFLHTFAIPLGVALLLVLALAYIMCCRREGQKK 328 (449)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc
Confidence 3445665422 22333334455 5999876544
No 26
>MTH00152 ND6 NADH dehydrogenase subunit 6; Provisional
Probab=25.84 E-value=91 Score=23.44 Aligned_cols=22 Identities=23% Similarity=0.878 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 33 WWCFLLSSIFTFILGLVIVLLWRL 56 (118)
Q Consensus 33 ~waFLlSSivTF~gGLliIliwR~ 56 (118)
|+.|++ +..|.||+++++++=.
T Consensus 48 w~s~il--fli~iGGmlVlF~Y~~ 69 (163)
T MTH00152 48 WYGYLL--FLVYVGGLLVMFAYVV 69 (163)
T ss_pred HHHHHH--HHHHHhHHHHHHHHHH
Confidence 777765 6789999999987543
No 27
>PF15159 PIG-Y: Phosphatidylinositol N-acetylglucosaminyltransferase subunit Y
Probab=25.72 E-value=1e+02 Score=21.13 Aligned_cols=21 Identities=14% Similarity=0.352 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6104 34 WCFLLSSIFTFILGLVIVLLW 54 (118)
Q Consensus 34 waFLlSSivTF~gGLliIliw 54 (118)
|++++.+.++|.+++..++.-
T Consensus 4 ~~~il~~~v~fv~~~y~~v~s 24 (72)
T PF15159_consen 4 WLLILFTLVFFVGFFYAAVFS 24 (72)
T ss_pred hHHHHHHHHHHHHHHHHHHHc
Confidence 788999999999988877543
No 28
>PF11188 DUF2975: Protein of unknown function (DUF2975); InterPro: IPR021354 This family of proteins have no known function. Some members are annotated as membrane proteins however this cannot be confirmed.
Probab=25.50 E-value=1.2e+02 Score=20.60 Aligned_cols=25 Identities=28% Similarity=0.502 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 34 WCFLLSSIFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 34 waFLlSSivTF~gGLliIliwR~i~ 58 (118)
..+...++..++.|++++++-++++
T Consensus 99 ~~~~~~~~~~~~~gl~~~vla~if~ 123 (136)
T PF11188_consen 99 IFFSGFDFLLLIIGLIILVLAEIFK 123 (136)
T ss_pred HhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 5566677788888899999999886
No 29
>PF11807 DUF3328: Domain of unknown function (DUF3328); InterPro: IPR021765 This family of proteins are functionally uncharacterised. This family is only found in eukaryotes.
Probab=24.48 E-value=1.4e+02 Score=21.42 Aligned_cols=28 Identities=32% Similarity=0.629 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 31 RKWWCFLLSSIFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 31 R~~waFLlSSivTF~gGLliIliwR~i~ 58 (118)
|.|.-.++.+++.++.+++++++++.+.
T Consensus 8 ~~w~~~~~~~~~~l~~~l~~~~~~~~~~ 35 (217)
T PF11807_consen 8 RRWRRLLLLFLLLLILSLLLLVLAVLLR 35 (217)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3444445666666666777777776663
No 30
>cd01356 AcnX_swivel Putative Aconitase X swivel domain. It is predicted by comparative genomic analysis. The proteins are mainly found in archaea and proteobacteria. They are distantly related to Aconitase family of proteins by sequence similarity and seconary structure prediction. The functions have not yet been experimentally characterized. Thus, the prediction should be treated with caution.
Probab=22.90 E-value=44 Score=24.96 Aligned_cols=13 Identities=31% Similarity=0.723 Sum_probs=11.7
Q ss_pred hhhccccceEEEE
Q psy6104 104 ISGQTTTGRILVP 116 (118)
Q Consensus 104 ISaQT~tGRILVV 116 (118)
+.||+..|||||.
T Consensus 37 l~G~si~gkILv~ 49 (123)
T cd01356 37 LYGESIAGKVLVL 49 (123)
T ss_pred cCCCcccceEEEe
Confidence 6899999999985
No 31
>PF05510 Sarcoglycan_2: Sarcoglycan alpha/epsilon; InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=22.73 E-value=88 Score=27.62 Aligned_cols=29 Identities=34% Similarity=0.789 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHH-HHHHHHHHh-hhhccCCC
Q psy6104 39 SSIFTFILGLVI-VLLWRLFAF-ICCRKEPE 67 (118)
Q Consensus 39 SSivTF~gGLli-IliwR~i~~-~cc~k~~~ 67 (118)
-.++||+.=++| +++.=++.| +|||+|..
T Consensus 284 d~~vtl~iPl~i~llL~llLs~Imc~rREG~ 314 (386)
T PF05510_consen 284 DFLVTLAIPLIIALLLLLLLSYIMCCRREGV 314 (386)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHheechHHh
Confidence 345555543333 444445566 59998654
No 32
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=22.62 E-value=1.2e+02 Score=25.26 Aligned_cols=22 Identities=23% Similarity=0.554 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6104 31 RKWWCFLLSSIFTFILGLVIVL 52 (118)
Q Consensus 31 R~~waFLlSSivTF~gGLliIl 52 (118)
|.||.+++..+++|.+|++..+
T Consensus 12 rr~~lil~v~~~~~~~~~~~~~ 33 (444)
T TIGR03017 12 ARYWIVLFTLLITVTTTAVVSL 33 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 5566677777777777766554
No 33
>PF07589 VPEP: PEP-CTERM motif; InterPro: IPR013424 This entry describes a 25-residue region including an invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In most cases, this motif is found within nine residues of the C-terminal end of the protein. Proteins containing this motif typically have signal sequences at the N terminus [].
Probab=22.21 E-value=1.4e+02 Score=16.47 Aligned_cols=15 Identities=27% Similarity=0.332 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHH
Q psy6104 39 SSIFTFILGLVIVLL 53 (118)
Q Consensus 39 SSivTF~gGLliIli 53 (118)
||+.-|..|++.+..
T Consensus 6 st~~l~~~gl~~l~~ 20 (25)
T PF07589_consen 6 STLALLGLGLLGLAF 20 (25)
T ss_pred HHHHHHHHHHHHHHH
Confidence 566777777666665
No 34
>PF12729 4HB_MCP_1: Four helix bundle sensory module for signal transduction; InterPro: IPR024478 This entry represents a four-helix bundle that operates as a ubiquitous sensory module in prokaryotic signal-transduction, which is known as four-helix bundles methyl-accepting chemotaxis protein (4HB_MCP) domain. The 4HB_MCP is always found between two predicted transmembrane helices indicating that it detects only extracellular signals. In many cases the domain is associated with a cytoplasmic HAMP domain suggesting that most proteins carrying the bundle might share the mechanism of transmembrane signalling which is well-characterised in E coli chemoreceptors [].
Probab=22.20 E-value=2.5e+02 Score=18.78 Aligned_cols=31 Identities=16% Similarity=0.225 Sum_probs=24.4
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 28 LQERKWWCFLLSSIFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 28 ~~~R~~waFLlSSivTF~gGLliIliwR~i~ 58 (118)
.+.|...+|.+..++..+.|.+-+.-.+-+.
T Consensus 4 I~~KL~~~f~~~~~l~~~~~~~~~~~l~~~~ 34 (181)
T PF12729_consen 4 IRTKLILGFGLIILLLLIVGIVGLYSLSQIN 34 (181)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3678899999999888888887777766653
No 35
>PF13334 DUF4094: Domain of unknown function (DUF4094)
Probab=22.05 E-value=98 Score=21.93 Aligned_cols=11 Identities=27% Similarity=0.612 Sum_probs=9.0
Q ss_pred HHHHHHHHHHH
Q psy6104 40 SIFTFILGLVI 50 (118)
Q Consensus 40 SivTF~gGLli 50 (118)
+++-||+|+++
T Consensus 9 c~~SF~~G~lf 19 (95)
T PF13334_consen 9 CIASFCAGMLF 19 (95)
T ss_pred HHHHHHHHHHH
Confidence 58889999874
No 36
>PF05151 PsbM: Photosystem II reaction centre M protein (PsbM); InterPro: IPR007826 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbM found in PSII. PsbM is one of the most hydrophobic proteins in the thylakoid membrane. The function of this protein is unknown.; GO: 0015979 photosynthesis, 0019684 photosynthesis, light reaction, 0009523 photosystem II, 0016021 integral to membrane; PDB: 3A0H_m 3ARC_m 3A0B_M 3PRR_M 3PRQ_M 1S5L_M 4FBY_e 3BZ2_M 3BZ1_M 2AXT_M ....
Probab=21.70 E-value=1.9e+02 Score=17.28 Aligned_cols=21 Identities=19% Similarity=0.611 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6104 34 WCFLLSSIFTFILGLVIVLLW 54 (118)
Q Consensus 34 waFLlSSivTF~gGLliIliw 54 (118)
.+|+++-+++++--.|++.+|
T Consensus 6 l~fiAtaLfi~iPt~FLiily 26 (31)
T PF05151_consen 6 LAFIATALFILIPTAFLIILY 26 (31)
T ss_dssp THHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHhheE
Confidence 478888888887777766654
No 37
>KOG1500|consensus
Probab=21.51 E-value=47 Score=30.12 Aligned_cols=12 Identities=58% Similarity=0.872 Sum_probs=8.9
Q ss_pred hhccccceEEEE
Q psy6104 105 SGQTTTGRILVP 116 (118)
Q Consensus 105 SaQT~tGRILVV 116 (118)
-|||+|||+|.|
T Consensus 423 aGq~ltGr~~Li 434 (517)
T KOG1500|consen 423 AGQTLTGRLLLI 434 (517)
T ss_pred cCCeeeeeEEEE
Confidence 478888887765
No 38
>TIGR02848 spore_III_AC stage III sporulation protein AC. Members of this protein family are designated SpoIIIAC, part of the spoIIIA operon of sporulation genes whose mutant phenotype is linked to sporulation stage III. Members of this family are encoded by the genome of a species if and only if that species is capable of endospore formation, as in Bacillus subtilis. The molecular function of this small, probable integral membrane protein is unknown.
Probab=21.46 E-value=1e+02 Score=21.07 Aligned_cols=15 Identities=27% Similarity=0.580 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHH
Q psy6104 42 FTFILGLVIVLLWRL 56 (118)
Q Consensus 42 vTF~gGLliIliwR~ 56 (118)
.|.++|++++|.|=+
T Consensus 34 ~~tLaG~iiVL~~Vi 48 (64)
T TIGR02848 34 MVTLAGIVVVLFMVI 48 (64)
T ss_pred HHHHHHHHHHHHHHH
Confidence 466899999998743
No 39
>PTZ00443 Thioredoxin domain-containing protein; Provisional
Probab=21.19 E-value=2.1e+02 Score=22.77 Aligned_cols=23 Identities=30% Similarity=0.564 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 34 WCFLLSSIFTFILGLVIVLLWRL 56 (118)
Q Consensus 34 waFLlSSivTF~gGLliIliwR~ 56 (118)
..|-.=|.++|++|+|+-++.-.
T Consensus 177 ~~~~~~~~~~~~~g~~~~~~~~~ 199 (224)
T PTZ00443 177 AGFFTISSFAFLFGILMGLMIAL 199 (224)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHH
Confidence 34556677789999887766543
No 40
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=21.10 E-value=1.3e+02 Score=21.26 Aligned_cols=27 Identities=22% Similarity=0.501 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 30 ERKWWCFLLSSIFTFILGLVIVLLWRLF 57 (118)
Q Consensus 30 ~R~~waFLlSSivTF~gGLliIliwR~i 57 (118)
-++||..+. ++++.++|++...+-|.+
T Consensus 5 ~~~~w~ii~-a~~~~~~~~~~~~l~~~~ 31 (106)
T PF10805_consen 5 IKKNWGIIW-AVFGIAGGIFWLWLRRTY 31 (106)
T ss_pred HHhCcHHHH-HHHHHHHHHHHHHHHHhh
Confidence 478999854 455666666666665543
No 41
>PF09301 DUF1970: Domain of unknown function (DUF1970); InterPro: IPR015380 This entry is represented by Bacteriophage PRD1, P16; it is a family of uncharacterised viral proteins.; PDB: 1W8X_P.
Probab=20.30 E-value=88 Score=23.23 Aligned_cols=14 Identities=43% Similarity=0.845 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHH
Q psy6104 45 ILGLVIVLLWRLFA 58 (118)
Q Consensus 45 ~gGLliIliwR~i~ 58 (118)
-|||++||||--|+
T Consensus 11 ggglvliliwlwfr 24 (117)
T PF09301_consen 11 GGGLVLILIWLWFR 24 (117)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred cCchhhHHHHHHHc
Confidence 48999999997775
No 42
>PF04689 S1FA: DNA binding protein S1FA; InterPro: IPR006779 S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=20.30 E-value=1.3e+02 Score=20.95 Aligned_cols=17 Identities=12% Similarity=0.481 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy6104 37 LLSSIFTFILGLVIVLL 53 (118)
Q Consensus 37 LlSSivTF~gGLliIli 53 (118)
+..++++|+.|.+++..
T Consensus 20 V~g~ll~flvGnyvlY~ 36 (69)
T PF04689_consen 20 VAGLLLVFLVGNYVLYV 36 (69)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 45677888888877754
No 43
>PF05767 Pox_A14: Poxvirus virion envelope protein A14; InterPro: IPR008785 This family consists of several Poxvirus virion envelope protein A14-like sequences. A14 is a component of the virion membrane and has been found to be an H1 phosphatase substrate in vivo and in vitro. A14 is hyperphosphorylated on serine residues in the absence of H1 expression [].; GO: 0019031 viral envelope
Probab=20.29 E-value=2.1e+02 Score=20.84 Aligned_cols=26 Identities=38% Similarity=0.588 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHH--Hhhhhcc
Q psy6104 39 SSIFTFILGLVIVLLWRLF--AFICCRK 64 (118)
Q Consensus 39 SSivTF~gGLliIliwR~i--~~~cc~k 64 (118)
-|++.|..|++++|--=++ +++-|+-
T Consensus 47 lSii~FI~giil~lG~~i~s~ygr~C~~ 74 (92)
T PF05767_consen 47 LSIICFILGIILTLGIVIFSMYGRYCRP 74 (92)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCC
Confidence 5899999999888766555 2367753
No 44
>PF00858 ASC: Amiloride-sensitive sodium channel; InterPro: IPR001873 The apical membrane of many tight epithelia contains sodium channels that are primarily characterised by their high affinity to the diuretic blocker amiloride [, , , ]. These channels mediate the first step of active sodium reabsorption essential for the maintenance of body salt and water homeostasis []. In vertebrates, the channels control reabsorption of sodium in kidney, colon, lung and sweat glands; they also play a role in taste perception. Members of the epithelial Na+ channel (ENaC) family fall into four subfamilies, termed alpha, beta, gamma and delta []. The proteins exhibit the same apparent topology, each with two transmembrane (TM) spanning segments, separated by a large extracellular loop. In most ENaC proteins studied to date, the extracellular domains are highly conserved and contain numerous cysteine residues, with flanking C-terminal amphipathic TM regions, postulated to contribute to the formation of the hydrophilic pores of the oligomeric channel protein complexes. It is thought that the well-conserved extracellular domains serve as receptors to control the activities of the channels. Vertebrate ENaC proteins are similar to degenerins of Caenorhabditis elegans []: deg-1, del-1, mec-4, mec-10 and unc-8. These proteins can be mutated to cause neuronal degradation, and are also thought to form sodium channels. Structurally, the proteins that belong to this family consist of about 510 to 920 amino acid residues. They are made of an intracellular N terminus region followed by a transmembrane domain, a large extracellular loop, a second transmembrane segment and a C-terminal intracellular tail [].; GO: 0005272 sodium channel activity, 0006814 sodium ion transport, 0016020 membrane; PDB: 2QTS_B 3S3W_C 3IJ4_A 3S3X_A 3HGC_A 2K2B_A.
Probab=20.16 E-value=1.7e+02 Score=23.41 Aligned_cols=30 Identities=20% Similarity=0.476 Sum_probs=18.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6104 29 QERKWWCFLLSSIFTFILGLVIVLLWRLFA 58 (118)
Q Consensus 29 ~~R~~waFLlSSivTF~gGLliIliwR~i~ 58 (118)
-+|..|++++.+.+.++.-.+..++-|...
T Consensus 21 ~~R~~W~~~~~~~~~~~~~~~~~~~~~y~~ 50 (439)
T PF00858_consen 21 FERLFWLLVVVVSFILFIYQIYLLIERYLS 50 (439)
T ss_dssp HCHHHHHHHHHHHHHHHHHHHHHHHHHHTT
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 357788877766666655555555555443
Done!