Query 027616
Match_columns 221
No_of_seqs 64 out of 66
Neff 2.1
Searched_HMMs 46136
Date Fri Mar 29 12:37:30 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/027616.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/027616hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF10716 NdhL: NADH dehydrogen 100.0 1.7E-40 3.8E-45 251.4 10.1 81 95-189 1-81 (81)
2 PRK13455 F0F1 ATP synthase sub 75.0 8.1 0.00018 31.6 5.4 50 101-151 7-59 (184)
3 PRK10921 twin-arginine protein 73.4 16 0.00034 32.3 7.1 60 122-181 112-174 (258)
4 PF06550 DUF1119: Protein of u 71.4 27 0.00058 32.4 8.4 73 93-172 5-79 (283)
5 PF08560 DUF1757: Protein of u 69.1 13 0.00027 31.2 5.3 64 93-156 33-104 (155)
6 PF10063 DUF2301: Uncharacteri 68.3 25 0.00054 29.4 6.8 55 117-171 45-114 (135)
7 PRK08475 F0F1 ATP synthase sub 67.9 12 0.00025 30.7 4.8 49 103-151 6-54 (167)
8 PF09991 DUF2232: Predicted me 67.2 28 0.0006 28.7 6.9 49 127-177 234-282 (290)
9 CHL00182 tatC Sec-independent 64.8 30 0.00065 30.5 7.1 60 122-181 122-184 (249)
10 PF13994 PgaD: PgaD-like prote 61.8 34 0.00073 27.3 6.3 64 121-184 14-87 (138)
11 TIGR01912 TatC-Arch Twin argin 59.4 29 0.00064 30.2 6.0 60 122-181 109-172 (237)
12 PF07264 EI24: Etoposide-induc 55.9 36 0.00079 27.3 5.6 42 142-183 159-215 (219)
13 TIGR00945 tatC Twin arginine t 53.9 74 0.0016 27.0 7.4 60 122-181 101-163 (215)
14 PF10260 SAYSvFN: Uncharacteri 51.1 19 0.00041 27.2 3.1 32 164-198 18-49 (71)
15 KOG3114 Uncharacterized conser 50.9 30 0.00066 32.1 4.9 49 129-177 159-213 (290)
16 PF08019 DUF1705: Domain of un 49.5 1.3E+02 0.0029 24.0 7.8 57 117-173 55-113 (156)
17 PF11893 DUF3413: Domain of un 49.2 1.2E+02 0.0026 26.8 8.1 63 137-200 145-211 (253)
18 PF00902 TatC: Sec-independent 48.2 96 0.0021 26.0 7.1 60 122-181 105-168 (215)
19 PF01102 Glycophorin_A: Glycop 47.7 36 0.00077 27.8 4.4 26 124-151 66-91 (122)
20 PF05232 BTP: Bacterial Transm 47.2 37 0.00081 24.6 4.0 55 91-155 9-64 (67)
21 PF14798 Ca_hom_mod: Calcium h 40.5 17 0.00036 32.5 1.6 26 160-185 47-74 (251)
22 PF05934 MCLC: Mid-1-related c 39.5 88 0.0019 31.7 6.4 26 141-168 196-221 (549)
23 PF07123 PsbW: Photosystem II 39.2 47 0.001 28.2 3.9 32 106-138 79-123 (138)
24 TIGR02484 CitB CitB domain pro 39.0 72 0.0016 30.5 5.6 60 118-178 246-312 (372)
25 TIGR01433 CyoA cytochrome o ub 38.8 24 0.00052 30.6 2.3 76 101-177 9-99 (226)
26 PHA02702 ORF033 IMV membrane p 38.5 75 0.0016 24.9 4.6 48 126-177 15-64 (78)
27 PF04632 FUSC: Fusaric acid re 37.7 59 0.0013 30.4 4.7 20 157-176 416-435 (650)
28 PLN00077 photosystem II reacti 37.2 68 0.0015 27.1 4.5 10 103-112 65-74 (128)
29 PRK13275 mtrF tetrahydromethan 36.9 65 0.0014 24.3 4.0 27 122-148 40-66 (67)
30 PF04612 T2SM: Type II secreti 36.0 12 0.00026 29.1 0.0 43 145-187 1-46 (160)
31 PRK11560 phosphoethanolamine t 35.4 77 0.0017 31.2 5.3 52 118-170 111-167 (558)
32 TIGR01478 STEVOR variant surfa 35.0 44 0.00095 31.4 3.4 20 132-151 266-285 (295)
33 PTZ00370 STEVOR; Provisional 34.8 44 0.00096 31.3 3.4 20 132-151 262-281 (296)
34 TIGR01432 QOXA cytochrome aa3 34.2 1.4E+02 0.0029 25.5 6.0 24 156-179 65-92 (217)
35 KOG4040 NADH:ubiquinone oxidor 33.4 65 0.0014 28.5 4.0 69 140-220 105-179 (186)
36 PHA02974 putative IMV membrane 33.0 1.3E+02 0.0028 23.7 5.2 51 126-177 16-68 (81)
37 PF05529 Bap31: B-cell recepto 32.3 2.3E+02 0.0049 23.2 6.8 63 122-186 6-72 (192)
38 PRK14584 hmsS hemin storage sy 31.4 1.9E+02 0.0042 24.7 6.4 60 121-183 18-85 (153)
39 COG2194 Predicted membrane-ass 31.3 2.4E+02 0.0053 28.0 8.0 35 118-152 107-141 (555)
40 PLN02755 complex I subunit 30.9 28 0.00062 26.7 1.3 20 130-149 34-53 (71)
41 PRK09173 F0F1 ATP synthase sub 30.9 1.1E+02 0.0023 24.3 4.6 31 121-151 3-34 (159)
42 PRK00068 hypothetical protein; 30.6 1.7E+02 0.0037 31.3 7.2 58 131-188 22-88 (970)
43 PF06814 Lung_7-TM_R: Lung sev 29.2 2.6E+02 0.0056 24.3 7.0 32 149-190 264-295 (295)
44 PF09972 DUF2207: Predicted me 29.2 57 0.0012 28.8 3.0 41 170-210 236-280 (511)
45 TIGR01167 LPXTG_anchor LPXTG-m 28.9 83 0.0018 19.0 2.9 6 110-115 2-7 (34)
46 PF07136 DUF1385: Protein of u 28.2 3.2E+02 0.007 24.6 7.6 32 117-148 38-69 (236)
47 PF07760 DUF1616: Protein of u 27.8 1E+02 0.0022 27.2 4.3 45 129-173 26-72 (287)
48 PF12292 DUF3624: Protein of u 27.6 1.1E+02 0.0024 23.7 4.0 34 117-150 42-75 (77)
49 TIGR02507 MtrF tetrahydrometha 26.7 1.1E+02 0.0023 23.2 3.6 26 121-146 39-64 (65)
50 PLN00092 photosystem I reactio 26.6 1.1E+02 0.0024 26.1 4.1 9 104-112 76-84 (137)
51 PRK10263 DNA translocase FtsK; 25.8 2.1E+02 0.0046 31.9 7.0 47 107-154 48-100 (1355)
52 PF09624 DUF2393: Protein of u 25.4 1.2E+02 0.0027 23.7 4.1 27 125-151 18-44 (149)
53 PF09472 MtrF: Tetrahydrometha 25.4 89 0.0019 23.3 3.0 24 123-146 41-64 (64)
54 TIGR03469 HonB hopene-associat 25.3 1.5E+02 0.0032 26.5 4.9 36 149-185 337-372 (384)
55 PF04144 SCAMP: SCAMP family; 25.2 1.9E+02 0.0041 24.1 5.3 40 131-170 69-108 (177)
56 PRK14475 F0F1 ATP synthase sub 25.1 1.9E+02 0.0041 23.5 5.1 32 120-151 9-42 (167)
57 PF13858 DUF4199: Protein of u 24.1 3.4E+02 0.0074 21.0 7.8 38 139-176 38-81 (163)
58 PRK01026 tetrahydromethanopter 23.6 1.6E+02 0.0034 23.0 4.1 21 125-145 50-70 (77)
59 PF01529 zf-DHHC: DHHC palmito 23.5 2.9E+02 0.0062 21.3 5.7 27 112-140 85-111 (174)
60 PF15102 TMEM154: TMEM154 prot 22.9 14 0.0003 31.3 -1.8 48 103-153 38-86 (146)
61 PRK14585 pgaD putative PGA bio 22.7 4.1E+02 0.0088 22.6 6.8 55 128-184 23-78 (137)
62 PRK12438 hypothetical protein; 22.6 2.9E+02 0.0063 29.8 7.2 51 137-187 30-89 (991)
63 TIGR03426 shape_MreD rod shape 22.6 1.8E+02 0.0039 22.4 4.4 41 131-171 67-109 (154)
64 PRK14740 kdbF potassium-transp 22.5 1.3E+02 0.0027 19.9 2.9 18 123-140 5-22 (29)
65 KOG0812 SNARE protein SED5/Syn 22.1 82 0.0018 29.9 2.8 23 149-171 287-309 (311)
66 PF15071 TMEM220: Transmembran 21.4 1.6E+02 0.0035 22.8 3.9 36 115-150 8-43 (104)
67 PRK06568 F0F1 ATP synthase sub 21.1 2.1E+02 0.0045 24.0 4.7 33 120-152 5-37 (154)
68 PF05884 ZYG-11_interact: Inte 20.7 1.8E+02 0.0039 27.4 4.7 48 129-180 138-185 (299)
69 KOG1311 DHHC-type Zn-finger pr 20.3 4.7E+02 0.01 22.7 7.0 68 112-181 150-228 (299)
70 PF11833 DUF3353: Protein of u 20.1 5.6E+02 0.012 22.1 7.3 22 147-171 131-152 (194)
No 1
>PF10716 NdhL: NADH dehydrogenase transmembrane subunit; InterPro: IPR019654 NAD(P)H-quinone oxidoreductase subunit L (NdhL) is a component of the NDH-1L complex that is one of the proton-pumping NADH:ubiquinone oxidoreductases that catalyse the electron transfer from NADH to ubiquinone linked with proton translocation across the membrane. NDH-1L is essential for photoheterotrophic cell growth. NdhL appears to contain two transmembrane helices and it is necessary for the functioning of though not the correct assembly of the NDH-1 complex in Synechocystis 6803. The conservation between cyanobacteria and green plants suggests that chloroplast NDH-1 complexes contain related subunits []. ; GO: 0016655 oxidoreductase activity, acting on NADH or NADPH, quinone or similar compound as acceptor, 0055114 oxidation-reduction process
Probab=100.00 E-value=1.7e-40 Score=251.40 Aligned_cols=81 Identities=43% Similarity=0.874 Sum_probs=77.8
Q ss_pred hhHHHHHHHhhcCCceeeecccCCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhhh
Q 027616 95 IQAGAVLLATLEQPALAVTGENNHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVRWYKRKLFEMYVQFMFVFMFFPGL 174 (221)
Q Consensus 95 lq~Ga~llA~~e~PAlAvtg~~n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy~~~~~ER~~mY~LVF~FFPGl 174 (221)
.||||++ +++ ++|+++|+++|++++|+||+|+|+++|+|||+|||+||++||++||||||+|||||
T Consensus 1 m~~~~l~-~~i-------------~~~~l~vl~~y~~l~~~YLlVvP~~l~~wm~~RWy~~~~~Er~~~y~lvF~FFPGl 66 (81)
T PF10716_consen 1 MQCGALL-SSI-------------PSDTLLVLLAYAALAGLYLLVVPLILYFWMNKRWYVMSSFERLFMYFLVFLFFPGL 66 (81)
T ss_pred CcHHHHH-HHc-------------chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 3899999 655 78999999999999999999999999999999999999999999999999999999
Q ss_pred HhhhhccCcccCCCC
Q 027616 175 LLWAPFLNFRKLPRD 189 (221)
Q Consensus 175 lL~APFLNFR~~pR~ 189 (221)
+|||||+|||++|||
T Consensus 67 lL~aPFlNfR~~~r~ 81 (81)
T PF10716_consen 67 LLLAPFLNFRPKPRQ 81 (81)
T ss_pred HHHhhhcCCCCCCCC
Confidence 999999999999997
No 2
>PRK13455 F0F1 ATP synthase subunit B; Provisional
Probab=74.98 E-value=8.1 Score=31.59 Aligned_cols=50 Identities=24% Similarity=0.266 Sum_probs=27.4
Q ss_pred HHHhhcCCceeeecccC--CchhHHHHHHHHHHHHH-HHHHhHhHHHHHHHHHH
Q 027616 101 LLATLEQPALAVTGENN--HEIDLTVALIKVGIIAF-WYFLIMPPIIMNWLRVR 151 (221)
Q Consensus 101 llA~~e~PAlAvtg~~n--~~~D~l~Vll~Y~~La~-~YLLVvP~~ly~wLn~R 151 (221)
++.....+|+|.+|.-- |.-+++ .++.++++.+ ++.++.|+.+..+|..|
T Consensus 7 ~~~~~~~~~~~~~g~~~~~~~t~~~-~~inflil~~iL~~f~~~~~v~~~L~~R 59 (184)
T PRK13455 7 LAALAASPALAAGGPFFSLSNTDFV-VTLAFLLFIGILVYFKVPGMIGGMLDKR 59 (184)
T ss_pred HHHHccchHhhcCCCCCCCcchHHH-HHHHHHHHHHHHHHHhccHHHHHHHHHH
Confidence 33555566888877521 233443 3344444443 44445677777777776
No 3
>PRK10921 twin-arginine protein translocation system subunit TatC; Provisional
Probab=73.35 E-value=16 Score=32.30 Aligned_cols=60 Identities=17% Similarity=0.232 Sum_probs=44.3
Q ss_pred HHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHH---hhhhHHHHHHHHHHHHhhhhhHhhhhcc
Q 027616 122 LTVALIKVGIIAFWYFLIMPPIIMNWLRVRWY---KRKLFEMYVQFMFVFMFFPGLLLWAPFL 181 (221)
Q Consensus 122 ~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy---~~~~~ER~~mY~LVF~FFPGllL~APFL 181 (221)
...+++.++++++.|++|+|.++-+-++-.-- ..-+++.|+.|.+.+++--|+..--|++
T Consensus 112 ~~s~~LF~~G~~f~y~~vlP~~~~Fl~~f~~~~~~~~~~i~~Y~~fv~~~~l~fGl~FelPli 174 (258)
T PRK10921 112 VSSSLLFYIGMAFAYFVVFPLAFGFLAKTAPEGVQVSTDIASYLSFVMALFMAFGVSFEVPVA 174 (258)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44567788889999999999998887764221 3346888888888887777776666654
No 4
>PF06550 DUF1119: Protein of unknown function (DUF1119); InterPro: IPR010545 This family consists of several hypothetical archaeal proteins of unknown function.
Probab=71.38 E-value=27 Score=32.41 Aligned_cols=73 Identities=19% Similarity=0.245 Sum_probs=40.7
Q ss_pred hhhhHHHHHHHhhcCCceeeecccCCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHH--HHHHhhhhHHHHHHHHHHHHh
Q 027616 93 LGIQAGAVLLATLEQPALAVTGENNHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLR--VRWYKRKLFEMYVQFMFVFMF 170 (221)
Q Consensus 93 lalq~Ga~llA~~e~PAlAvtg~~n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn--~RWy~~~~~ER~~mY~LVF~F 170 (221)
++.|+||++|+..=..+=--.-| ||+|.+.-+. |.++. +++..++...++ .+|-.+.-+.=...+...+.|
T Consensus 5 l~vql~Al~L~~~~~~~~~~a~e--dP~~~~Nsl~-YI~~i----L~fT~~mL~~ik~~~~~~I~~ii~~~i~~~~~YVf 77 (283)
T PF06550_consen 5 LIVQLGALLLVPPFEEAGYQAFE--DPSSPSNSLY-YIIAI----LVFTAFMLLAIKYGKKWIIRLIIYLAIFLTIFYVF 77 (283)
T ss_pred HHHHHHHHHHcCchhhcCCeeec--CCCCchHHHH-HHHHH----HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Confidence 67899999865421111001223 7888777655 44333 444544444444 367777777666655555555
Q ss_pred hh
Q 027616 171 FP 172 (221)
Q Consensus 171 FP 172 (221)
++
T Consensus 78 ~~ 79 (283)
T PF06550_consen 78 SA 79 (283)
T ss_pred HH
Confidence 54
No 5
>PF08560 DUF1757: Protein of unknown function (DUF1757); InterPro: IPR013869 This entry shows proteins that are about 150 amino acids in length and have no known function.
Probab=69.07 E-value=13 Score=31.18 Aligned_cols=64 Identities=20% Similarity=0.174 Sum_probs=42.2
Q ss_pred hhhhHHHHHHHhhcCCce-eeecccCCchhHHHHHHHHHHHHHH-HHHhHhHHHHHHHHH------HHHhhh
Q 027616 93 LGIQAGAVLLATLEQPAL-AVTGENNHEIDLTVALIKVGIIAFW-YFLIMPPIIMNWLRV------RWYKRK 156 (221)
Q Consensus 93 lalq~Ga~llA~~e~PAl-Avtg~~n~~~D~l~Vll~Y~~La~~-YLLVvP~~ly~wLn~------RWy~~~ 156 (221)
-++|+|+++=+.+.+|.. +....+.++++++--..-++..+++ =+++.|.+.|..|+. .|+.+.
T Consensus 33 k~~q~gs~lGsl~~~Pi~~~~~~~~~~~~~~~~~~~~~~~~G~l~G~~~gp~m~~~rmr~~~~~~~e~~DR~ 104 (155)
T PF08560_consen 33 KGAQAGSFLGSLIVGPIYRLLKQPRLNPKELTNRFVKGGRNGALAGAVLGPVMTYARMRGSSLEEIELQDRC 104 (155)
T ss_pred HHHHHHHHHHHHHhHHHHHHHhCccccHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhccccchHHHHHHH
Confidence 368999999555558954 4555533677776554444433322 357889999999988 677664
No 6
>PF10063 DUF2301: Uncharacterized integral membrane protein (DUF2301); InterPro: IPR019275 This family contains uncharacterised integral membrane proteins.
Probab=68.31 E-value=25 Score=29.41 Aligned_cols=55 Identities=11% Similarity=0.153 Sum_probs=37.5
Q ss_pred CCchhHHHHHHHHHHHHHHHH---------------HhHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhh
Q 027616 117 NHEIDLTVALIKVGIIAFWYF---------------LIMPPIIMNWLRVRWYKRKLFEMYVQFMFVFMFF 171 (221)
Q Consensus 117 n~~~D~l~Vll~Y~~La~~YL---------------LVvP~~ly~wLn~RWy~~~~~ER~~mY~LVF~FF 171 (221)
|+|..++.|...+++++|+++ +++|.++.-.+-.-|......--+...+..|+.|
T Consensus 45 ~~P~~~l~vG~~FaaLtGi~fKE~FCF~~~e~~~l~~llp~llLghl~g~~~~~~~~~ll~~~~~L~~i~ 114 (135)
T PF10063_consen 45 GQPLWLLAVGPLFAALTGIAFKEYFCFRRPEAKLLTFLLPLLLLGHLFGLLPASVELALLGIWALLFLIF 114 (135)
T ss_pred cCccHHHHHHHHHHHHHhHHhhchhhhhhHHHhhHHHHHHHHHHHHHHCCCcHHHHHHHHHHHHHHHHHH
Confidence 589999999999999999986 5677776666655555444443333335555444
No 7
>PRK08475 F0F1 ATP synthase subunit B; Validated
Probab=67.87 E-value=12 Score=30.72 Aligned_cols=49 Identities=16% Similarity=0.122 Sum_probs=26.1
Q ss_pred HhhcCCceeeecccCCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHH
Q 027616 103 ATLEQPALAVTGENNHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVR 151 (221)
Q Consensus 103 A~~e~PAlAvtg~~n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~R 151 (221)
+....+|+|-+|+-.|..|+++.++.++++.++--.++.--+...|..|
T Consensus 6 ~~~~~~a~~~~~~~~~~~~~~~~~inflil~~lL~~fl~kPi~~~l~~R 54 (167)
T PRK08475 6 LLLGFYAFAASLGATEQYDIIERTINFLIFVGILWYFAAKPLKNFYKSR 54 (167)
T ss_pred HHHHHHHHHcccCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334445555676445577777776666654433333333344555555
No 8
>PF09991 DUF2232: Predicted membrane protein (DUF2232); InterPro: IPR018710 This family of bacterial and eukaryotic proteins has no known fucntion; however this signature belongs to a Pfam Gx transporter clan.
Probab=67.22 E-value=28 Score=28.72 Aligned_cols=49 Identities=16% Similarity=0.320 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHHhHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhhhHhh
Q 027616 127 IKVGIIAFWYFLIMPPIIMNWLRVRWYKRKLFEMYVQFMFVFMFFPGLLLW 177 (221)
Q Consensus 127 l~Y~~La~~YLLVvP~~ly~wLn~RWy~~~~~ER~~mY~LVF~FFPGllL~ 177 (221)
-...++.++|++--=.++.+|+++| +.+++=|.+.|.+++++.+...++
T Consensus 234 Nl~~v~~~l~~~qGla~~~~~~~~~--~~~~~~~~l~~~~~i~~~~~~~~l 282 (290)
T PF09991_consen 234 NLLIVLSFLFFIQGLAVIHFFLKRR--KMSKFLRVLLYILLILFPFLIVIL 282 (290)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHc--CCcHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666677766656666676666 777777999998888776555443
No 9
>CHL00182 tatC Sec-independent translocase component C; Provisional
Probab=64.83 E-value=30 Score=30.45 Aligned_cols=60 Identities=20% Similarity=0.400 Sum_probs=45.6
Q ss_pred HHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHH---hhhhHHHHHHHHHHHHhhhhhHhhhhcc
Q 027616 122 LTVALIKVGIIAFWYFLIMPPIIMNWLRVRWY---KRKLFEMYVQFMFVFMFFPGLLLWAPFL 181 (221)
Q Consensus 122 ~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy---~~~~~ER~~mY~LVF~FFPGllL~APFL 181 (221)
...+++-+++.++.|.+|+|.++-+-++-.-- ..-++..|+.|.+.+++.-|+..--|++
T Consensus 122 ~~s~~lF~~G~~f~y~vvlP~~~~Fl~~f~~~~~~~~~~i~~Yl~f~~~~~l~fGl~FelPvi 184 (249)
T CHL00182 122 ISSLVLFGLGLIFAYFVLVPAALNFFINYGSDVVEPLWSFDQYFDFILVLFFSTGLAFQIPII 184 (249)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhhhccHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667788899999999999999887763211 2236888999998888888888776654
No 10
>PF13994 PgaD: PgaD-like protein
Probab=61.83 E-value=34 Score=27.27 Aligned_cols=64 Identities=9% Similarity=0.099 Sum_probs=41.3
Q ss_pred hHHHHHHHHHHHHHHHHHhHhH--HHHHHHHHHHH--------hhhhHHHHHHHHHHHHhhhhhHhhhhccCcc
Q 027616 121 DLTVALIKVGIIAFWYFLIMPP--IIMNWLRVRWY--------KRKLFEMYVQFMFVFMFFPGLLLWAPFLNFR 184 (221)
Q Consensus 121 D~l~Vll~Y~~La~~YLLVvP~--~ly~wLn~RWy--------~~~~~ER~~mY~LVF~FFPGllL~APFLNFR 184 (221)
=+...+++.++=+++-.|..|+ .++..++.+=. ..+..+++.+|.++.++..++++.==.+|-|
T Consensus 14 r~~~~~lT~~~W~~~~yL~~pl~~ll~~ll~~~~~~~~~~~~~~~~~~~~l~~y~~i~~~~a~~Li~Wa~yn~~ 87 (138)
T PF13994_consen 14 RLIDYFLTLLFWGGFIYLWRPLLTLLAWLLGLHLFYPQMSLGGFLSSLNTLQIYLLIALVNAVILILWAKYNRL 87 (138)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccchhhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555555555556676 33333332222 2678999999999999999988877777743
No 11
>TIGR01912 TatC-Arch Twin arginine targeting (Tat) protein translocase TatC, Archaeal clade. This model represents the TatC translocase component of the Sec-independent protein translocation system. This system is responsible for translocation of folded proteins, often with bound cofactors across the periplasmic membrane. A related model (TIGR00945) represents the bacterial clade of this family. TatC is often found (in bacteria) in a gene cluster with the two other components of the system, TatA/E (TIGR01411) and TatB (TIGR01410). A model also exists for the Twin-arginine signal sequence (TIGR01409).
Probab=59.37 E-value=29 Score=30.16 Aligned_cols=60 Identities=17% Similarity=0.196 Sum_probs=45.7
Q ss_pred HHHHHHHHHHHHHHHHHhHhHHHHHHHHHHH-H---hhhhHHHHHHHHHHHHhhhhhHhhhhcc
Q 027616 122 LTVALIKVGIIAFWYFLIMPPIIMNWLRVRW-Y---KRKLFEMYVQFMFVFMFFPGLLLWAPFL 181 (221)
Q Consensus 122 ~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RW-y---~~~~~ER~~mY~LVF~FFPGllL~APFL 181 (221)
....++-+++.++.|.+|+|.++-+-++--- . ..-+++.|+.|.+.+++--|+..--|++
T Consensus 109 ~~~~~lF~~G~~f~y~~vlP~~~~f~~~f~~~~~~~~~~~i~~Y~~f~~~~~~~fGl~FelPvv 172 (237)
T TIGR01912 109 VIAVGLFAFGALFAYWVIFPLIFQILFEFASPLGLSAIMDIRKYTSFALKLILSFGLAFETPVV 172 (237)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccceeecHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566777888899999999999998876421 1 1227889999988888888887777754
No 12
>PF07264 EI24: Etoposide-induced protein 2.4 (EI24); PDB: 3TX3_B.
Probab=55.87 E-value=36 Score=27.30 Aligned_cols=42 Identities=24% Similarity=0.483 Sum_probs=25.6
Q ss_pred HHHHHHHHHH-------HHhh--------hhHHHHHHHHHHHHhhhhhHhhhhccCc
Q 027616 142 PIIMNWLRVR-------WYKR--------KLFEMYVQFMFVFMFFPGLLLWAPFLNF 183 (221)
Q Consensus 142 ~~ly~wLn~R-------Wy~~--------~~~ER~~mY~LVF~FFPGllL~APFLNF 183 (221)
.++++|++.+ |+.+ ..+|+.-.|++.|=+.=.+++.-|++|+
T Consensus 159 ~~~~~~l~~~~~~~e~~~~~~~~~~~er~~~~~~~~~~~~gfG~~~~ll~~IP~~~l 215 (219)
T PF07264_consen 159 FVLWFWLNAYFLGFEYLWSSLGRSFEERKRFLERNRGYFLGFGLPFALLLLIPLVNL 215 (219)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHCTHHHHHHHHHHHHHHTTSCCHHC
T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 4667777765 4444 3444445566666666666777888775
No 13
>TIGR00945 tatC Twin arginine targeting (Tat) protein translocase TatC. This model represents the TatC translocase component of the Sec-independent protein translocation system. This system is responsible for translocation of folded proteins, often with bound cofactors across the periplasmic membrane. A related model (TIGR01912) represents the archaeal clade of this family. TatC is often found in a gene cluster with the two other components of the system, TatA/E (TIGR01411) and TatB (TIGR01410). A model also exists for the Twin-arginine signal sequence (TIGR01409).
Probab=53.90 E-value=74 Score=27.00 Aligned_cols=60 Identities=22% Similarity=0.365 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHH---hhhhHHHHHHHHHHHHhhhhhHhhhhcc
Q 027616 122 LTVALIKVGIIAFWYFLIMPPIIMNWLRVRWY---KRKLFEMYVQFMFVFMFFPGLLLWAPFL 181 (221)
Q Consensus 122 ~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy---~~~~~ER~~mY~LVF~FFPGllL~APFL 181 (221)
...+++-.+++++.|.+|+|.++-+-++-.-- ..-+++.|+.+.+.+++.=|+..--|.+
T Consensus 101 ~~~~~lF~~G~~f~y~~vlP~~~~F~~~~~~~~~~~~~~i~~y~~f~~~~~l~fGl~FqlPli 163 (215)
T TIGR00945 101 LGSILLFLAGLAFAYYVLFPAALNFLLTYGADVVEILLSIDQYFEFVLKLLFSFGVAFQVPVL 163 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44566677888899999999999887764322 2335778877777776666665554443
No 14
>PF10260 SAYSvFN: Uncharacterized conserved domain (SAYSvFN); InterPro: IPR019387 This domain of approximately 75 residues contains a highly conserved SATSv/iFN motif. The function is unknown but the domain is conserved from plants to humans.
Probab=51.07 E-value=19 Score=27.16 Aligned_cols=32 Identities=31% Similarity=0.617 Sum_probs=25.2
Q ss_pred HHHHHHhhhhhHhhhhccCcccCCCCCCCCCCCCC
Q 027616 164 FMFVFMFFPGLLLWAPFLNFRKLPRDPSMKAPWDT 198 (221)
Q Consensus 164 Y~LVF~FFPGllL~APFLNFR~~pR~psmkyPWs~ 198 (221)
|+.||+.+-|+.++ |.|+|. +|++...=.||.
T Consensus 18 fG~vf~i~s~f~~I--~~Nl~~-~r~~ge~SAYSV 49 (71)
T PF10260_consen 18 FGPVFFILSGFYLI--FTNLGT-PRKPGELSAYSV 49 (71)
T ss_pred hhHHHHHHHHHHHH--HHcCCC-CCCCCCccchhh
Confidence 78888888888877 889999 888776655554
No 15
>KOG3114 consensus Uncharacterized conserved protein [Function unknown]
Probab=50.92 E-value=30 Score=32.11 Aligned_cols=49 Identities=27% Similarity=0.587 Sum_probs=35.9
Q ss_pred HHHHHHHHHHhHhHHHHHHHHHHHHhh-----hhHHHHHHHHH-HHHhhhhhHhh
Q 027616 129 VGIIAFWYFLIMPPIIMNWLRVRWYKR-----KLFEMYVQFMF-VFMFFPGLLLW 177 (221)
Q Consensus 129 Y~~La~~YLLVvP~~ly~wLn~RWy~~-----~~~ER~~mY~L-VF~FFPGllL~ 177 (221)
-+.+..+|+..+|+++|-.|+.|=|.+ ..+|.++.|+- .|+|||-+++|
T Consensus 159 aa~~iy~Y~~ivp~~l~~iL~~~~~~~~~~~~~l~~~~~iygysl~i~ip~~vl~ 213 (290)
T KOG3114|consen 159 AATLIYGYLTIVPLALWGILSWNGYSLLLHCYVLLELVCIYGYSLFIFIPLLVLW 213 (290)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccccceehhhHHHHHhhHHHHHHHHHHHH
Confidence 345667899999999988765444433 35889999986 68899966655
No 16
>PF08019 DUF1705: Domain of unknown function (DUF1705); InterPro: IPR012549 Some members of this family are putative bacterial membrane proteins. This domain is found immediately N-terminal to the sulphatase domain in many sulphatases.; GO: 0016021 integral to membrane
Probab=49.49 E-value=1.3e+02 Score=24.00 Aligned_cols=57 Identities=14% Similarity=0.335 Sum_probs=37.2
Q ss_pred CCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHH--HHhhhhHHHHHHHHHHHHhhhh
Q 027616 117 NHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVR--WYKRKLFEMYVQFMFVFMFFPG 173 (221)
Q Consensus 117 n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~R--Wy~~~~~ER~~mY~LVF~FFPG 173 (221)
.|+.|....+-...++-++.+.++|.++.++++.+ =..+....|.....+..+.+-|
T Consensus 55 Tn~~Ea~ells~~~~~~~l~~~vlP~~~l~~~~i~~~~~~~~~~~r~~~~~~~l~~~~~ 113 (156)
T PF08019_consen 55 TNTAEASELLSWKLILWLLLLGVLPALLLWRVRIKKRSWKRELLRRLLLILLSLLVIAG 113 (156)
T ss_pred cCHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHH
Confidence 47777777788888888999999999888887651 1123444555444444443333
No 17
>PF11893 DUF3413: Domain of unknown function (DUF3413); InterPro: IPR024588 This entry represents an uncharacterised domain found in the N-terminal of predicted HI0842 type membrane sulphatases.
Probab=49.18 E-value=1.2e+02 Score=26.79 Aligned_cols=63 Identities=22% Similarity=0.440 Sum_probs=38.4
Q ss_pred HHhHhHHHHHHHHH--HHHhhhhHHHHHH--HHHHHHhhhhhHhhhhccCcccCCCCCCCCCCCCCCC
Q 027616 137 FLIMPPIIMNWLRV--RWYKRKLFEMYVQ--FMFVFMFFPGLLLWAPFLNFRKLPRDPSMKAPWDTPA 200 (221)
Q Consensus 137 LLVvP~~ly~wLn~--RWy~~~~~ER~~m--Y~LVF~FFPGllL~APFLNFR~~pR~psmkyPWs~P~ 200 (221)
++++=+++-+|+++ |-..+.++=|.+. +++.|+.-=++=.||=.-+.|+-.++-+ .+|++.|-
T Consensus 145 il~~~~~~a~~~w~kl~~~~~~~~~~~~~~~~~~~fl~sh~ih~wadA~~~~~It~~~~-~lPL~yP~ 211 (253)
T PF11893_consen 145 ILLLELLLANWLWKKLRKLQRRKLGRPVAALFFLCFLASHLIHIWADANLYRPITQQDN-NLPLYYPL 211 (253)
T ss_pred HHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCchhhhhc-cCCccchh
Confidence 44555555555544 4444444555543 3444444456677888888888887755 46888773
No 18
>PF00902 TatC: Sec-independent protein translocase protein (TatC); InterPro: IPR002033 Proteins encoded by the mttABC operon (formerly yigTUW), mediate a novel Sec-independent membrane targeting and translocation system in Escherichia coli that interacts with cofactor-containing redox proteins having a S/TRRXFLK "twin arginine" leader motif. This family contains the E. coli mttB gene (TATC) []. A functional Tat system or Delta pH-dependent pathway requires three integral membrane proteins: TatA/Tha4, TatB/Hcf106 and TatC/cpTatC. The TatC protein is essential for the function of both pathways. It might be involved in twin-arginine signal peptide recognition, protein translocation and proton translocation. Sequence analysis predicts that TatC contains six transmembrane helices (TMHs), and experimental data confirmed that N and C termini of TatC or cpTatC are exposed to the cytoplasmic or stromal face of the membrane. The cytoplasmic N terminus and the first cytoplasmic loop region of the E. coli TatC protein are essential for protein export. At least two TatC molecules co-exist within each Tat translocon [, ].
Probab=48.19 E-value=96 Score=25.97 Aligned_cols=60 Identities=25% Similarity=0.461 Sum_probs=42.8
Q ss_pred HHHHHHHHHHHHHHHHHhHhHHHHHHHHHH----HHhhhhHHHHHHHHHHHHhhhhhHhhhhcc
Q 027616 122 LTVALIKVGIIAFWYFLIMPPIIMNWLRVR----WYKRKLFEMYVQFMFVFMFFPGLLLWAPFL 181 (221)
Q Consensus 122 ~l~Vll~Y~~La~~YLLVvP~~ly~wLn~R----Wy~~~~~ER~~mY~LVF~FFPGllL~APFL 181 (221)
...+++-++++++.|.+++|.++-+-++-- ....-+++.|+-+.+.+++.=|++.--|.+
T Consensus 105 ~~~~~lf~~g~~f~y~~ilP~~~~fl~~f~~~~~~~~~~~i~~y~~f~~~~~~~~gl~FqlPli 168 (215)
T PF00902_consen 105 LISFILFLLGVAFAYFVILPLILKFLLSFSPTSGIQPEPSISSYLNFVIQFLLIFGLIFQLPLI 168 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhccHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455667788889999999999988877611 133345778888887777777776655544
No 19
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=47.71 E-value=36 Score=27.77 Aligned_cols=26 Identities=23% Similarity=0.151 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHhHhHHHHHHHHHH
Q 027616 124 VALIKVGIIAFWYFLIMPPIIMNWLRVR 151 (221)
Q Consensus 124 ~Vll~Y~~La~~YLLVvP~~ly~wLn~R 151 (221)
.+++.+++++|.-+++ ++|+|++++|
T Consensus 66 i~~Ii~gv~aGvIg~I--lli~y~irR~ 91 (122)
T PF01102_consen 66 IIGIIFGVMAGVIGII--LLISYCIRRL 91 (122)
T ss_dssp HHHHHHHHHHHHHHHH--HHHHHHHHHH
T ss_pred eeehhHHHHHHHHHHH--HHHHHHHHHH
Confidence 3556667777764444 3555555443
No 20
>PF05232 BTP: Bacterial Transmembrane Pair family; InterPro: IPR007896 This domain represents a conserved pair of transmembrane helices. It appears to be found as two tandem repeats in a family of hypothetical proteins.
Probab=47.23 E-value=37 Score=24.60 Aligned_cols=55 Identities=22% Similarity=0.318 Sum_probs=38.2
Q ss_pred hhhhhhHHHHHHHhhcCCcee-eecccCCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHhh
Q 027616 91 SSLGIQAGAVLLATLEQPALA-VTGENNHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVRWYKR 155 (221)
Q Consensus 91 ~~lalq~Ga~llA~~e~PAlA-vtg~~n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy~~ 155 (221)
+++.-+.|+++ +..|.+| ++|. +-.|...+.+...+++..|= .+|||+-.||..+
T Consensus 9 hai~FE~~~l~---~~~P~~a~~~~~--~~~~a~~l~v~~s~~a~~wn-----~ifN~~FD~~~~r 64 (67)
T PF05232_consen 9 HAILFEVGALL---ISVPLIAWWLGI--SLWQAGALDVGLSLFAMVWN-----YIFNWLFDKIEPR 64 (67)
T ss_pred HHHHHHHHHHH---HHHHHHHHHHCC--CHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhcc
Confidence 45667888887 4458777 5776 66777777766666666664 5788888877653
No 21
>PF14798 Ca_hom_mod: Calcium homeostasis modulator
Probab=40.49 E-value=17 Score=32.49 Aligned_cols=26 Identities=31% Similarity=0.570 Sum_probs=19.6
Q ss_pred HHHHHHHHHHhhhhhHhhhh--ccCccc
Q 027616 160 MYVQFMFVFMFFPGLLLWAP--FLNFRK 185 (221)
Q Consensus 160 R~~mY~LVF~FFPGllL~AP--FLNFR~ 185 (221)
.-+.||+||++-|+++|+.- ++|-|-
T Consensus 47 ~N~~YGl~fLlvPAl~LfllG~~ln~~~ 74 (251)
T PF14798_consen 47 WNFLYGLVFLLVPALVLFLLGYLLNRRT 74 (251)
T ss_pred ccchhHhHHHHHHHHHHHHHHHHHhccc
Confidence 45789999999999887643 456554
No 22
>PF05934 MCLC: Mid-1-related chloride channel (MCLC); InterPro: IPR009231 This entry consists of several Chloride channel CLIC-like proteins, which function as a chloride channel when incorporated in the planar lipid bilayer [].
Probab=39.55 E-value=88 Score=31.66 Aligned_cols=26 Identities=15% Similarity=0.423 Sum_probs=17.8
Q ss_pred hHHHHHHHHHHHHhhhhHHHHHHHHHHH
Q 027616 141 PPIIMNWLRVRWYKRKLFEMYVQFMFVF 168 (221)
Q Consensus 141 P~~ly~wLn~RWy~~~~~ER~~mY~LVF 168 (221)
=.+...|...|||.. +-|+|+..+++
T Consensus 196 iVAteLwt~V~W~~Q--l~R~fvisFLi 221 (549)
T PF05934_consen 196 IVATELWTYVSWFTQ--LRRMFVISFLI 221 (549)
T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHH
Confidence 345789999999876 66665444433
No 23
>PF07123 PsbW: Photosystem II reaction centre W protein (PsbW); InterPro: IPR009806 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbW found in PSII, where it is a subunit of the oxygen-evolving complex. PsbW appears to have several roles, including guiding PSII biogenesis and assembly, stabilising dimeric PSII [], and facilitating PSII repair after photo-inhibition []. There appears to be two classes of PsbW, class 1 being found predominantly in algae and cyanobacteria, and class 2 being found predominantly in plants. This entry represents class 2 PsbW.; GO: 0015979 photosynthesis, 0009507 chloroplast, 0009523 photosystem II
Probab=39.20 E-value=47 Score=28.19 Aligned_cols=32 Identities=34% Similarity=0.431 Sum_probs=17.6
Q ss_pred cCCceee---------e----cccCCchhHHHHHHHHHHHHHHHHH
Q 027616 106 EQPALAV---------T----GENNHEIDLTVALIKVGIIAFWYFL 138 (221)
Q Consensus 106 e~PAlAv---------t----g~~n~~~D~l~Vll~Y~~La~~YLL 138 (221)
.+||+|+ | |. ||+.-.+..++.++.+-.+|++
T Consensus 79 a~PA~ALVDeRlsteGTGL~lGl-sn~~LgwIL~gVf~lIWslY~~ 123 (138)
T PF07123_consen 79 ASPALALVDERLSTEGTGLPLGL-SNNLLGWILLGVFGLIWSLYFV 123 (138)
T ss_pred cCcHHHHHHHHhcCCCccccccc-cCchhHHHHHHHHHHHHHHHHh
Confidence 7899996 2 23 2333334444455555555554
No 24
>TIGR02484 CitB CitB domain protein. CobZ is essential for cobalamin biosynthesis (by knockout of the R. capsulatus gene ) and is complemented by the characterized precorrin 3B synthase CobG. The enzyme has been shown to contain flavin, heme and Fe-S cluster cofactors and is believed to require dioxygen as a substrate. This model identifies the C-terminal domain of the R. capsulatus CobZ, which, in most other species exists as a separate gene adjacent to CobZ.
Probab=39.01 E-value=72 Score=30.49 Aligned_cols=60 Identities=15% Similarity=0.087 Sum_probs=44.8
Q ss_pred CchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHhh-----hh--HHHHHHHHHHHHhhhhhHhhh
Q 027616 118 HEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVRWYKR-----KL--FEMYVQFMFVFMFFPGLLLWA 178 (221)
Q Consensus 118 ~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy~~-----~~--~ER~~mY~LVF~FFPGllL~A 178 (221)
-|-+++...-+.+.++++=|++-+.. ++|++.|.... +. .+++|..++.+.-.=|+++++
T Consensus 246 aPypl~s~pklLG~~GGi~Ll~G~~~-l~~l~~R~~~~~~~~~~~~~~D~~fl~lL~lv~~TGl~l~~ 312 (372)
T TIGR02484 246 APYPLLSLPVILGLVGGVAMLAGAAG-LSGLEARADPEPLKTPAMLRSDRFLLGQLALLAGTGLALLA 312 (372)
T ss_pred CCCCcccHHHHHHHHHHHHHHHHHHH-HHHHHHhcCcccccccccccchHHHHHHHHHHHHHHHHHHH
Confidence 46666766666777777777777554 78999999532 23 599999999888888988874
No 25
>TIGR01433 CyoA cytochrome o ubiquinol oxidase subunit II. This enzyme catalyzes the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in gram positive bacteria but which is in complex with subunits which utilize cytochromes a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cyt. c as the oxidant.
Probab=38.79 E-value=24 Score=30.63 Aligned_cols=76 Identities=13% Similarity=0.143 Sum_probs=40.6
Q ss_pred HHHhhcCCceeeecccCCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHH-----------HHhhhhHHHHHH----HH
Q 027616 101 LLATLEQPALAVTGENNHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVR-----------WYKRKLFEMYVQ----FM 165 (221)
Q Consensus 101 llA~~e~PAlAvtg~~n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~R-----------Wy~~~~~ER~~m----Y~ 165 (221)
+|+...+++|--.|.....+|.++++... +.+++.++|.=+++|+..+-| |.....+|.... ..
T Consensus 9 ~l~g~~~~~l~p~g~~a~~~~~l~~~~~~-~~~ii~v~v~~~~~~~~~r~r~~~~~~~~~p~~~~~~~lE~~wt~iP~ii 87 (226)
T TIGR01433 9 LLSGCNSALLDPKGQIGLEERSLILTAFG-LMLLVVIPVILMTLFFAWKYRATNKDADYSPNWHHSTKIEIVVWTIPILI 87 (226)
T ss_pred HHcCCCccccCCCChhHHHHHHHHHHHHH-HHHHHHHHHHHHHheeeEEEeccCCcCCCCCcccCCceeehhhHHHHHHH
Confidence 33555566777677655667776654333 333333445444455554433 334456886432 33
Q ss_pred HHHHhhhhhHhh
Q 027616 166 FVFMFFPGLLLW 177 (221)
Q Consensus 166 LVF~FFPGllL~ 177 (221)
++++++|++-+.
T Consensus 88 l~~l~~~s~~~~ 99 (226)
T TIGR01433 88 IIFLGVLTWITT 99 (226)
T ss_pred HHHHHHHHHHHH
Confidence 456666666555
No 26
>PHA02702 ORF033 IMV membrane protein; Provisional
Probab=38.49 E-value=75 Score=24.90 Aligned_cols=48 Identities=13% Similarity=0.095 Sum_probs=34.3
Q ss_pred HHHHHHHHHHHHHh--HhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhhhHhh
Q 027616 126 LIKVGIIAFWYFLI--MPPIIMNWLRVRWYKRKLFEMYVQFMFVFMFFPGLLLW 177 (221)
Q Consensus 126 ll~Y~~La~~YLLV--vP~~ly~wLn~RWy~~~~~ER~~mY~LVF~FFPGllL~ 177 (221)
+.+..+++|.=+++ +|+ -.-||.+ ..+.+=|.+=|+-+++|.||.+.+
T Consensus 15 LmLlMvisGgali~r~~~p--~l~~rS~--~~~Rvltvle~va~l~~IPgtIiL 64 (78)
T PHA02702 15 LMLLIVVTGGATIARRGAP--SLGIRSR--GALRVLTVLDFVSLLTTIPCTIIL 64 (78)
T ss_pred HHHHHHHhhHHHHHhhcCc--hhheecc--cchhHHHHHHHHHHHHHhchHHHH
Confidence 33444455554443 455 4456666 899999999999999999998765
No 27
>PF04632 FUSC: Fusaric acid resistance protein family; InterPro: IPR006726 This entry represents the p-hydroxybenzoic acid efflux pump subunit AaeB (pHBA efflux pump protein B) whose substrates are p-hydroxybenzoic acid (pHBA), 6-hydroxy-2-naphthoic and 2-hydroxycinnamate. It could function as a metabolic relief valve, allowing to eliminate certain compounds when they accumulate to high levels in the cell []. This family also includes fusaric acid resistance proteins [], which are likely to be membrane transporter proteins, and uncharacterised transporter YdhK.; GO: 0006810 transport, 0005886 plasma membrane
Probab=37.68 E-value=59 Score=30.41 Aligned_cols=20 Identities=30% Similarity=0.614 Sum_probs=11.0
Q ss_pred hHHHHHHHHHHHHhhhhhHh
Q 027616 157 LFEMYVQFMFVFMFFPGLLL 176 (221)
Q Consensus 157 ~~ER~~mY~LVF~FFPGllL 176 (221)
.+|.+.+.+.+|+|+=|++.
T Consensus 416 ~f~~L~l~l~~~l~~~~~~~ 435 (650)
T PF04632_consen 416 GFPLLALVLAPFLFLGGLLM 435 (650)
T ss_pred cHHHHHHHHHHHHHHHHHHH
Confidence 36666666655555544443
No 28
>PLN00077 photosystem II reaction centre W protein; Provisional
Probab=37.18 E-value=68 Score=27.06 Aligned_cols=10 Identities=30% Similarity=0.577 Sum_probs=8.2
Q ss_pred HhhcCCceee
Q 027616 103 ATLEQPALAV 112 (221)
Q Consensus 103 A~~e~PAlAv 112 (221)
++..+||+|+
T Consensus 65 ~a~a~PA~Al 74 (128)
T PLN00077 65 MAYAHPAFAL 74 (128)
T ss_pred HhccccHHHH
Confidence 5668999997
No 29
>PRK13275 mtrF tetrahydromethanopterin S-methyltransferase subunit F; Provisional
Probab=36.87 E-value=65 Score=24.29 Aligned_cols=27 Identities=11% Similarity=0.445 Sum_probs=20.5
Q ss_pred HHHHHHHHHHHHHHHHHhHhHHHHHHH
Q 027616 122 LTVALIKVGIIAFWYFLIMPPIIMNWL 148 (221)
Q Consensus 122 ~l~Vll~Y~~La~~YLLVvP~~ly~wL 148 (221)
+-....+++.+..+-|+++|+++++.+
T Consensus 40 ~~~~G~aiG~~~AlvLv~ip~~l~~~~ 66 (67)
T PRK13275 40 TGIIGFAIGFLLALLLVVVPPLLYGLV 66 (67)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344667778888888889999988754
No 30
>PF04612 T2SM: Type II secretion system (T2SS), protein M; InterPro: IPR007690 General secretion pathway (GSP) protein M is a membrane protein involved in the export of proteins in bacteria. It consists of a short cytosolic N-terminal domain, a transmembrane domain, and a C-terminal periplasmic domain. The precise function of this protein is unknown, though in Vibrio cholerae, the EpsM protein interacts with the EpsL protein, and also forms homodimers [],; GO: 0006858 extracellular transport; PDB: 1UV7_A.
Probab=35.99 E-value=12 Score=29.05 Aligned_cols=43 Identities=26% Similarity=0.430 Sum_probs=0.0
Q ss_pred HHHHHHHHHhhhhHHHHHHHHH---HHHhhhhhHhhhhccCcccCC
Q 027616 145 MNWLRVRWYKRKLFEMYVQFMF---VFMFFPGLLLWAPFLNFRKLP 187 (221)
Q Consensus 145 y~wLn~RWy~~~~~ER~~mY~L---VF~FFPGllL~APFLNFR~~p 187 (221)
|.-++.||...+.=||.++..+ +++|+-.+++|.|..+-|-.-
T Consensus 1 m~~l~~~w~~ls~REr~ll~~~~~~l~~~l~~~~~~~P~~~~~~~~ 46 (160)
T PF04612_consen 1 MQQLKQWWQSLSPRERRLLLVLGVVLLLALLYLLLWQPLLERRDQL 46 (160)
T ss_dssp ----------------------------------------------
T ss_pred ChHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4457889999999999887754 444555667889999876543
No 31
>PRK11560 phosphoethanolamine transferase; Provisional
Probab=35.39 E-value=77 Score=31.20 Aligned_cols=52 Identities=17% Similarity=0.030 Sum_probs=36.6
Q ss_pred CchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHh--hhhHH---HHHHHHHHHHh
Q 027616 118 HEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVRWYK--RKLFE---MYVQFMFVFMF 170 (221)
Q Consensus 118 ~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy~--~~~~E---R~~mY~LVF~F 170 (221)
|..|....+-..+++-++-+-|+|.++.++.+.+ +. +...+ |+.....+++.
T Consensus 111 d~~Ea~~lls~~~~~~~l~~gvlP~~~i~~~~i~-~~~~~~~~~~~~~~~~~~~~~~~ 167 (558)
T PRK11560 111 DIDLSKEVVGLHFILWLVAVSALPLILIWNNRCR-YTLLRQLRTPGQRIRSLAVVVLA 167 (558)
T ss_pred CHHHHHHhcCHHHHHHHHHHHHHHHHHHHHhhcc-ccchhHHHHHHHHHHHHHHHHHH
Confidence 6777777777888888888899999999999875 22 44445 55444444444
No 32
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=35.02 E-value=44 Score=31.36 Aligned_cols=20 Identities=20% Similarity=0.431 Sum_probs=14.5
Q ss_pred HHHHHHHhHhHHHHHHHHHH
Q 027616 132 IAFWYFLIMPPIIMNWLRVR 151 (221)
Q Consensus 132 La~~YLLVvP~~ly~wLn~R 151 (221)
|..+-|.||=.|+|.||.+|
T Consensus 266 lvllil~vvliiLYiWlyrr 285 (295)
T TIGR01478 266 LVLIILTVVLIILYIWLYRR 285 (295)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 33445567778889999887
No 33
>PTZ00370 STEVOR; Provisional
Probab=34.80 E-value=44 Score=31.32 Aligned_cols=20 Identities=20% Similarity=0.451 Sum_probs=14.5
Q ss_pred HHHHHHHhHhHHHHHHHHHH
Q 027616 132 IAFWYFLIMPPIIMNWLRVR 151 (221)
Q Consensus 132 La~~YLLVvP~~ly~wLn~R 151 (221)
|..+-|.||=.|+|.||.+|
T Consensus 262 lvllil~vvliilYiwlyrr 281 (296)
T PTZ00370 262 LVLLILAVVLIILYIWLYRR 281 (296)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 33445567778889999887
No 34
>TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II. This enzyme catalyzes the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in gram positive bacteria of the Bacillus/Staphylococcus group. Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes a similar to the cyt. c oxidases (as opposed to cytochromes b).
Probab=34.15 E-value=1.4e+02 Score=25.46 Aligned_cols=24 Identities=8% Similarity=0.117 Sum_probs=16.2
Q ss_pred hhHHHHH----HHHHHHHhhhhhHhhhh
Q 027616 156 KLFEMYV----QFMFVFMFFPGLLLWAP 179 (221)
Q Consensus 156 ~~~ER~~----mY~LVF~FFPGllL~AP 179 (221)
..+|... +..++++++|++-++.-
T Consensus 65 ~~LEiiWTiIP~lIl~~L~~~s~~~~~~ 92 (217)
T TIGR01432 65 AILETIWTVIPIIIVIALAIPTVKTIYD 92 (217)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4477544 45678888888877643
No 35
>KOG4040 consensus NADH:ubiquinone oxidoreductase, NDUFB8/ASHI subunit [Energy production and conversion]
Probab=33.43 E-value=65 Score=28.53 Aligned_cols=69 Identities=19% Similarity=0.241 Sum_probs=47.9
Q ss_pred HhHHHHHHHHHHHHhhhhHHH----HHHHHHHHHhhhhhHhhhhccCcccCCCCCCC--CCCCCCCCCCcccccccccCC
Q 027616 140 MPPIIMNWLRVRWYKRKLFEM----YVQFMFVFMFFPGLLLWAPFLNFRKLPRDPSM--KAPWDTPADPSKVKNAYLKFP 213 (221)
Q Consensus 140 vP~~ly~wLn~RWy~~~~~ER----~~mY~LVF~FFPGllL~APFLNFR~~pR~psm--kyPWs~P~dps~ikn~y~kyP 213 (221)
||.=...|=..||....+ |+ -.|+|+.|-|.|++|+..=|.|=-|.=| |-| +||. |.|.-||
T Consensus 105 v~~d~d~Y~~dR~t~~e~-p~y~~w~~~~mcl~g~~~~~l~~~y~~d~~p~yk-Pv~pKQYpy----------~~~~~y~ 172 (186)
T KOG4040|consen 105 VPIDMDRYRGDRFTGLEA-PDYTTWNSIVMCLRGLVPMALLAWYFTDEHPRYK-PVMPKQYPY----------DFYRAYP 172 (186)
T ss_pred cchhHhhhccccccccCC-CCcccHHHHHHHHHHHHHHHHHHHHHcccccccc-cCCcccCCC----------CCeeecc
Confidence 455567777888776543 22 3567888889999999999998777544 655 4554 3466678
Q ss_pred CCCCCCC
Q 027616 214 WAQVEDY 220 (221)
Q Consensus 214 ~A~pEDY 220 (221)
|.+|..|
T Consensus 173 f~dp~k~ 179 (186)
T KOG4040|consen 173 FDDPRKY 179 (186)
T ss_pred CCCCccC
Confidence 8777654
No 36
>PHA02974 putative IMV membrane protein; Provisional
Probab=33.02 E-value=1.3e+02 Score=23.67 Aligned_cols=51 Identities=16% Similarity=0.394 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHHh--HhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhhhHhh
Q 027616 126 LIKVGIIAFWYFLI--MPPIIMNWLRVRWYKRKLFEMYVQFMFVFMFFPGLLLW 177 (221)
Q Consensus 126 ll~Y~~La~~YLLV--vP~~ly~wLn~RWy~~~~~ER~~mY~LVF~FFPGllL~ 177 (221)
+....+++|.=|++ +|+ .+-.+-+|=...+++=|.+=|.-+++|.||-+.+
T Consensus 16 L~llMiisG~aLi~k~~~P-~~~~v~~ss~tf~rvv~~lE~vailifiPGti~L 68 (81)
T PHA02974 16 LVVLLIISGFSLILRLIPG-VYSSVIRSSFTAGKILRFMEIFSTIMFIPGIIIL 68 (81)
T ss_pred HHHHHHHhChHHHHhhcCc-hhhhhhhHHHHHHHHHHHHHHHHHhheeccHHHH
Confidence 33444555555543 344 3344556666778899999999999999998754
No 37
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=32.26 E-value=2.3e+02 Score=23.22 Aligned_cols=63 Identities=16% Similarity=0.168 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHHHHHhHhHHHHHHHHHH---HHhhhhHHHHHH-HHHHHHhhhhhHhhhhccCcccC
Q 027616 122 LTVALIKVGIIAFWYFLIMPPIIMNWLRVR---WYKRKLFEMYVQ-FMFVFMFFPGLLLWAPFLNFRKL 186 (221)
Q Consensus 122 ~l~Vll~Y~~La~~YLLVvP~~ly~wLn~R---Wy~~~~~ER~~m-Y~LVF~FFPGllL~APFLNFR~~ 186 (221)
.+...++|+-++++-+|++|+.--. ++. +...+...+.+. ++.+.++|=+++++.-+-+-++.
T Consensus 6 ~lvf~~L~~Ei~~~~lL~lPlp~~~--R~~i~~~~~~~~~~~~~~~~~~~~~~~~~~lf~ds~~~~~k~ 72 (192)
T PF05529_consen 6 SLVFGLLYAEIAVLLLLVLPLPSPI--RRKIFKFLDKSFFSGKFKTVFKILLAILLLLFLDSIRRMYKY 72 (192)
T ss_pred HHHHHHHHHHHHHHHHHHHhCCcHH--HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455677788888999999865322 222 222233333333 33444455566666665554443
No 38
>PRK14584 hmsS hemin storage system protein; Provisional
Probab=31.38 E-value=1.9e+02 Score=24.67 Aligned_cols=60 Identities=17% Similarity=0.232 Sum_probs=36.1
Q ss_pred hHHHHHHHHHHHHHHHHHhHhHHHHHHH------HHHH-HhhhhHHHHHHHHHHHHhhhhh-HhhhhccCc
Q 027616 121 DLTVALIKVGIIAFWYFLIMPPIIMNWL------RVRW-YKRKLFEMYVQFMFVFMFFPGL-LLWAPFLNF 183 (221)
Q Consensus 121 D~l~Vll~Y~~La~~YLLVvP~~ly~wL------n~RW-y~~~~~ER~~mY~LVF~FFPGl-lL~APFLNF 183 (221)
|.++.++++ ++|+||++.=++-+++= +-.| +.-+.+-++..|.++.+|.=++ ++|| ..|-
T Consensus 18 D~~lT~~aW--~gfi~l~~~~~~~~~~~~~~~gp~~~~~~~~s~~~tl~~yl~ial~nAvlLI~WA-~YN~ 85 (153)
T PRK14584 18 DIILTALAW--FGFLFLLVRGLLEMISRAPHMGPIPLRIYILSGLTTIALYLAIAAFNAVLLIIWA-KYNQ 85 (153)
T ss_pred HHHHHHHHH--HHHHHHHHHHHHHHhccCcccCCcchhHHHhhhHHHHHHHHHHHHHHHHHHHHHH-HHHH
Confidence 444444433 45667776655555431 1234 4457788899999999999854 4555 4453
No 39
>COG2194 Predicted membrane-associated, metal-dependent hydrolase [General function prediction only]
Probab=31.31 E-value=2.4e+02 Score=27.95 Aligned_cols=35 Identities=9% Similarity=0.206 Sum_probs=29.8
Q ss_pred CchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHH
Q 027616 118 HEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRVRW 152 (221)
Q Consensus 118 ~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RW 152 (221)
|..|...++.++.++..+...++|.++..+++.|-
T Consensus 107 n~~E~~el~t~~~~~~l~~~g~l~~ll~~~~~~r~ 141 (555)
T COG2194 107 NTAESSELLTLYFLLWLVLVGLLPALLIVLVIIRY 141 (555)
T ss_pred ChhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 66677778888888888888999999999999883
No 40
>PLN02755 complex I subunit
Probab=30.95 E-value=28 Score=26.70 Aligned_cols=20 Identities=5% Similarity=0.049 Sum_probs=16.1
Q ss_pred HHHHHHHHHhHhHHHHHHHH
Q 027616 130 GIIAFWYFLIMPPIIMNWLR 149 (221)
Q Consensus 130 ~~La~~YLLVvP~~ly~wLn 149 (221)
+.++++|-++||.++|.=.-
T Consensus 34 ~~i~~ifgv~VP~liy~giv 53 (71)
T PLN02755 34 LAVVGIFGIAVPILVYKGIV 53 (71)
T ss_pred hhhhhhhhhhhhHHhhhhhh
Confidence 56788999999999986443
No 41
>PRK09173 F0F1 ATP synthase subunit B; Validated
Probab=30.89 E-value=1.1e+02 Score=24.35 Aligned_cols=31 Identities=19% Similarity=0.248 Sum_probs=17.7
Q ss_pred hHHHHHHHHHHHHHHH-HHhHhHHHHHHHHHH
Q 027616 121 DLTVALIKVGIIAFWY-FLIMPPIIMNWLRVR 151 (221)
Q Consensus 121 D~l~Vll~Y~~La~~Y-LLVvP~~ly~wLn~R 151 (221)
|++|.++.++++.++. .+..|--+...|..|
T Consensus 3 ~~~w~~i~f~i~l~~l~~~~~~~pi~~~l~~R 34 (159)
T PRK09173 3 ATFWAFVGLVLFLALVVYLKVPGMIARSLDAR 34 (159)
T ss_pred chHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 5666666666554442 223555566667666
No 42
>PRK00068 hypothetical protein; Validated
Probab=30.65 E-value=1.7e+02 Score=31.33 Aligned_cols=58 Identities=22% Similarity=0.468 Sum_probs=40.7
Q ss_pred HHHHHHHHhHhHHHHHHHHHHHHhh---------hhHHHHHHHHHHHHhhhhhHhhhhccCcccCCC
Q 027616 131 IIAFWYFLIMPPIIMNWLRVRWYKR---------KLFEMYVQFMFVFMFFPGLLLWAPFLNFRKLPR 188 (221)
Q Consensus 131 ~La~~YLLVvP~~ly~wLn~RWy~~---------~~~ER~~mY~LVF~FFPGllL~APFLNFR~~pR 188 (221)
++.++-+++++.+.=+|.+.-||.. -.+=|+.+|..+|+..=+++.++=++.+|-+|.
T Consensus 22 ~~i~vll~~~~~~~~~~td~lWF~~lgy~~Vf~t~l~t~~~Lf~~~~~~~a~~~~~~~~la~r~rp~ 88 (970)
T PRK00068 22 LIIILLLLFGPRLVDFYIDWLWFGEVGYRSVFFTKLVTRIVLFIPVGLLVGGIVFISLWLAYRSRPV 88 (970)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCceeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc
Confidence 3333445556677788899999864 445578888888887777777777777776554
No 43
>PF06814 Lung_7-TM_R: Lung seven transmembrane receptor; InterPro: IPR009637 This family represents a conserved region with eukaryotic lung seven transmembrane receptors and related proteins.; GO: 0016021 integral to membrane
Probab=29.23 E-value=2.6e+02 Score=24.27 Aligned_cols=32 Identities=13% Similarity=0.437 Sum_probs=19.3
Q ss_pred HHHHHhhhhHHHHHHHHHHHHhhhhhHhhhhccCcccCCCCC
Q 027616 149 RVRWYKRKLFEMYVQFMFVFMFFPGLLLWAPFLNFRKLPRDP 190 (221)
Q Consensus 149 n~RWy~~~~~ER~~mY~LVF~FFPGllL~APFLNFR~~pR~p 190 (221)
+.+|.....+|-+...+++.+. ..|||..+.|
T Consensus 264 ~~~W~~~~~~~~l~~~~~~~i~----------~lwRPs~~n~ 295 (295)
T PF06814_consen 264 KYQWFIEAFWELLYFVFLVAIM----------YLWRPSENNQ 295 (295)
T ss_pred HHHhHHHHHHHHHHHHHHHHHH----------heeCCCCCCc
Confidence 4579888888765555544443 2566665543
No 44
>PF09972 DUF2207: Predicted membrane protein (DUF2207); InterPro: IPR018702 This domain has no known function.
Probab=29.17 E-value=57 Score=28.81 Aligned_cols=41 Identities=32% Similarity=0.370 Sum_probs=25.2
Q ss_pred hhhhhHhhhhccCcccCCCCCC----CCCCCCCCCCCcccccccc
Q 027616 170 FFPGLLLWAPFLNFRKLPRDPS----MKAPWDTPADPSKVKNAYL 210 (221)
Q Consensus 170 FFPGllL~APFLNFR~~pR~ps----mkyPWs~P~dps~ikn~y~ 210 (221)
...+++++.+++-++.++|++. ..|-.+.|+|-++.--+|+
T Consensus 236 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~P~~~~Pa~v~~l 280 (511)
T PF09972_consen 236 VLGILLLLIFLIIWRKYGRDPKKGKPGEYYREPPEDLSPAVVGYL 280 (511)
T ss_pred HHHHHHHHHHHHHhhhcccccccCCCCceeeCCCCCCChHHhhHh
Confidence 3444445566667777777765 4556778877666554444
No 45
>TIGR01167 LPXTG_anchor LPXTG-motif cell wall anchor domain. A common feature of this proteins containing this domain appears to be a high proportion of charged and zwitterionic residues immediatedly upstream of the LPXTG motif. This model differs from other descriptions of the LPXTG region by including a portion of that upstream charged region.
Probab=28.95 E-value=83 Score=18.96 Aligned_cols=6 Identities=67% Similarity=0.850 Sum_probs=4.0
Q ss_pred eeeecc
Q 027616 110 LAVTGE 115 (221)
Q Consensus 110 lAvtg~ 115 (221)
|--||+
T Consensus 2 LP~TG~ 7 (34)
T TIGR01167 2 LPKTGE 7 (34)
T ss_pred CCCCCC
Confidence 345898
No 46
>PF07136 DUF1385: Protein of unknown function (DUF1385); InterPro: IPR010787 This family contains a number of hypothetical bacterial proteins of unknown function approximately 300 residues in length. Some family members are predicted to be metal-dependent.
Probab=28.19 E-value=3.2e+02 Score=24.64 Aligned_cols=32 Identities=16% Similarity=0.341 Sum_probs=25.3
Q ss_pred CCchhHHHHHHHHHHHHHHHHHhHhHHHHHHH
Q 027616 117 NHEIDLTVALIKVGIIAFWYFLIMPPIIMNWL 148 (221)
Q Consensus 117 n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wL 148 (221)
-++.+..+..+.-.+++.+.|+|+|..+-.++
T Consensus 38 ~~~~~~~~~~~~s~~~~i~lF~~lP~~l~~~~ 69 (236)
T PF07136_consen 38 LSSWEMALTVILSLALAIGLFVVLPTFLAGLL 69 (236)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556666667777888899999999999888
No 47
>PF07760 DUF1616: Protein of unknown function (DUF1616); InterPro: IPR011674 This is a group of sequences from hypothetical archaeal proteins. The region in question is approximately 330 amino acid residues long.
Probab=27.79 E-value=1e+02 Score=27.22 Aligned_cols=45 Identities=16% Similarity=0.078 Sum_probs=21.5
Q ss_pred HHHHHHHHHHhHhHHHHHHHH-HHHHhhhhHHHHHH-HHHHHHhhhh
Q 027616 129 VGIIAFWYFLIMPPIIMNWLR-VRWYKRKLFEMYVQ-FMFVFMFFPG 173 (221)
Q Consensus 129 Y~~La~~YLLVvP~~ly~wLn-~RWy~~~~~ER~~m-Y~LVF~FFPG 173 (221)
=.++++.|++.+|.-...-.= =|=..-+.+||..+ +++-....|-
T Consensus 26 r~~~g~~~vlf~PGy~l~~~lfp~~~~l~~~er~~ls~glSi~~~~~ 72 (287)
T PF07760_consen 26 RVILGFPFVLFLPGYALVAALFPRKHDLDGIERLALSVGLSIAIVPL 72 (287)
T ss_pred HHHHHHHHHHHhccHHHHHHHccCcCCCcHHHHHHHHHHHHHHHHHH
Confidence 345666666667654332221 01223366777644 3443444443
No 48
>PF12292 DUF3624: Protein of unknown function (DUF3624); InterPro: IPR022072 This family of proteins is found in bacteria. Proteins in this family are approximately 90 amino acids in length. There is a conserved GRC sequence motif.
Probab=27.63 E-value=1.1e+02 Score=23.75 Aligned_cols=34 Identities=15% Similarity=0.262 Sum_probs=28.9
Q ss_pred CCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHH
Q 027616 117 NHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRV 150 (221)
Q Consensus 117 n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~ 150 (221)
++|...-.|+++.++.++.=|+..-++++-|-+.
T Consensus 42 d~P~sieSIALl~~~~AfsgLL~lHLvv~~~r~~ 75 (77)
T PF12292_consen 42 DTPTSIESIALLFFCFAFSGLLFLHLVVFPWRRR 75 (77)
T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5788888899999999999999999988887654
No 49
>TIGR02507 MtrF tetrahydromethanopterin S-methyltransferase, F subunit. coenzyme M methyltransferase in methanogenic archaea. This methyltranferase is membrane-associated enzyme complex that uses methy-transfer reaction to drive sodium-ion pump. Archaea domain, have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.
Probab=26.74 E-value=1.1e+02 Score=23.18 Aligned_cols=26 Identities=12% Similarity=0.153 Sum_probs=19.2
Q ss_pred hHHHHHHHHHHHHHHHHHhHhHHHHH
Q 027616 121 DLTVALIKVGIIAFWYFLIMPPIIMN 146 (221)
Q Consensus 121 D~l~Vll~Y~~La~~YLLVvP~~ly~ 146 (221)
.+-.....++.+..+-|+++|+++++
T Consensus 39 ~~~~~G~~iG~~~Al~lV~IP~ll~~ 64 (65)
T TIGR02507 39 TTTITGLAYGFLFAVLLVAVPIAMKF 64 (65)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHh
Confidence 34456667777777778889998875
No 50
>PLN00092 photosystem I reaction center subunit V (PsaG); Provisional
Probab=26.59 E-value=1.1e+02 Score=26.06 Aligned_cols=9 Identities=33% Similarity=0.538 Sum_probs=6.9
Q ss_pred hhcCCceee
Q 027616 104 TLEQPALAV 112 (221)
Q Consensus 104 ~~e~PAlAv 112 (221)
+..+||+|+
T Consensus 76 ~~a~PA~Al 84 (137)
T PLN00092 76 MSASPAMAL 84 (137)
T ss_pred hhcCcHHHH
Confidence 346999997
No 51
>PRK10263 DNA translocase FtsK; Provisional
Probab=25.76 E-value=2.1e+02 Score=31.87 Aligned_cols=47 Identities=17% Similarity=0.114 Sum_probs=20.2
Q ss_pred CCceeeecccCCchhHHHHHHHHH------HHHHHHHHhHhHHHHHHHHHHHHh
Q 027616 107 QPALAVTGENNHEIDLTVALIKVG------IIAFWYFLIMPPIIMNWLRVRWYK 154 (221)
Q Consensus 107 ~PAlAvtg~~n~~~D~l~Vll~Y~------~La~~YLLVvP~~ly~wLn~RWy~ 154 (221)
.|.+--|+.+..-..+.++++.|+ ++|... +++|++++++.+..|..
T Consensus 48 DPSwS~sa~~~~V~Nl~GiVGA~LAD~L~~LFGl~A-YLLP~LL~~~a~~l~R~ 100 (1355)
T PRK10263 48 DPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFGVMA-YTIPVIIVGGCWFAWRH 100 (1355)
T ss_pred CCcccccCcccccccccchHHHHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHhc
Confidence 344433332223445555555543 233222 23445554454444544
No 52
>PF09624 DUF2393: Protein of unknown function (DUF2393); InterPro: IPR013417 The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=25.41 E-value=1.2e+02 Score=23.67 Aligned_cols=27 Identities=19% Similarity=0.294 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHhHhHHHHHHHHHH
Q 027616 125 ALIKVGIIAFWYFLIMPPIIMNWLRVR 151 (221)
Q Consensus 125 Vll~Y~~La~~YLLVvP~~ly~wLn~R 151 (221)
+++.+++++++-++.+|.++|.||.+.
T Consensus 18 ~~~~~~~~~~i~~~~~~~~~~~~l~~~ 44 (149)
T PF09624_consen 18 LALSFIIASFILAFLIPFFGYYWLDKY 44 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 566677777777777999999999865
No 53
>PF09472 MtrF: Tetrahydromethanopterin S-methyltransferase, F subunit (MtrF); InterPro: IPR013347 Many archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This domain is mostly found in MtrF, where it covers the entire length of the protein. This polypeptide is one of eight subunits of the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase complex found in methanogenic archaea. This is a membrane-associated enzyme complex that uses methyl-transfer reactions to drive a sodium-ion pump []. MtrF itself is involved in the transfer of the methyl group from N5-methyltetrahydromethanopterin to coenzyme M. Subsequently, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase. In some organisms this domain is found at the C-terminal region of what appears to be a fusion of the MtrA and MtrF proteins [, ]. The function of these proteins is unknown, though it is likely that they are involved in C1 metabolism.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016020 membrane
Probab=25.36 E-value=89 Score=23.28 Aligned_cols=24 Identities=13% Similarity=0.259 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHhHhHHHHH
Q 027616 123 TVALIKVGIIAFWYFLIMPPIIMN 146 (221)
Q Consensus 123 l~Vll~Y~~La~~YLLVvP~~ly~ 146 (221)
-.+...++.+..+-|+++|++++|
T Consensus 41 ~~~GfaiG~~~AlvLv~ip~~l~~ 64 (64)
T PF09472_consen 41 GIKGFAIGFLFALVLVGIPILLMF 64 (64)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHhC
Confidence 345667777777778888887764
No 54
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=25.30 E-value=1.5e+02 Score=26.52 Aligned_cols=36 Identities=14% Similarity=0.243 Sum_probs=27.0
Q ss_pred HHHHHhhhhHHHHHHHHHHHHhhhhhHhhhhccCccc
Q 027616 149 RVRWYKRKLFEMYVQFMFVFMFFPGLLLWAPFLNFRK 185 (221)
Q Consensus 149 n~RWy~~~~~ER~~mY~LVF~FFPGllL~APFLNFR~ 185 (221)
-.|.+..+.+. .+.|-+.+++|+.+++.|=+...|.
T Consensus 337 ~~~~~~~~~~~-~~~~p~~~~~~~~~~~~s~~~~~~~ 372 (384)
T TIGR03469 337 TLRFYRLPPLW-ALALPLIALFYTLATLDSARRHWRG 372 (384)
T ss_pred HHHHhCCChHH-HHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 35777777776 4678899999999888887765543
No 55
>PF04144 SCAMP: SCAMP family; InterPro: IPR007273 In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cytoplasmic tail. SCAMPs probably function in endocytosis by recruiting EH-domain proteins to the N-terminal NPF repeats but may have additional functions mediated by their other sequences [].; GO: 0015031 protein transport, 0016021 integral to membrane
Probab=25.22 E-value=1.9e+02 Score=24.10 Aligned_cols=40 Identities=25% Similarity=0.375 Sum_probs=22.2
Q ss_pred HHHHHHHHhHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHh
Q 027616 131 IIAFWYFLIMPPIIMNWLRVRWYKRKLFEMYVQFMFVFMF 170 (221)
Q Consensus 131 ~La~~YLLVvP~~ly~wLn~RWy~~~~~ER~~mY~LVF~F 170 (221)
+++.+|+++..++.|+---++=|++-+=|+-+-|+.-|++
T Consensus 69 ~lai~y~~~~~P~sf~~wyrplY~A~r~dss~~f~~ff~~ 108 (177)
T PF04144_consen 69 GLAILYLLLGTPASFFCWYRPLYKAFRTDSSFRFMWFFFF 108 (177)
T ss_pred hHHHHHHHHHhHHHHHHHHHHHHHHHhcccchHHHHHHHH
Confidence 6677785555555444333456776666665554444433
No 56
>PRK14475 F0F1 ATP synthase subunit B; Provisional
Probab=25.05 E-value=1.9e+02 Score=23.45 Aligned_cols=32 Identities=9% Similarity=-0.138 Sum_probs=17.6
Q ss_pred hhHHHHHHHHHHHHH--HHHHhHhHHHHHHHHHH
Q 027616 120 IDLTVALIKVGIIAF--WYFLIMPPIIMNWLRVR 151 (221)
Q Consensus 120 ~D~l~Vll~Y~~La~--~YLLVvP~~ly~wLn~R 151 (221)
...+|.++.++++.+ +|+.+.+.-|...|..|
T Consensus 9 ~~~~w~~i~f~il~~iL~~~k~l~~pi~~~le~R 42 (167)
T PRK14475 9 NPEFWVGAGLLIFFGILIALKVLPKALAGALDAY 42 (167)
T ss_pred chHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 444666666666554 33444455555666655
No 57
>PF13858 DUF4199: Protein of unknown function (DUF4199)
Probab=24.13 E-value=3.4e+02 Score=20.98 Aligned_cols=38 Identities=24% Similarity=0.515 Sum_probs=26.6
Q ss_pred hHhHHHHHHHHHHHHhhh------hHHHHHHHHHHHHhhhhhHh
Q 027616 139 IMPPIIMNWLRVRWYKRK------LFEMYVQFMFVFMFFPGLLL 176 (221)
Q Consensus 139 VvP~~ly~wLn~RWy~~~------~~ER~~mY~LVF~FFPGllL 176 (221)
.++.++..+.-.|.|+.+ ++-+.+.++++..++=|++.
T Consensus 38 ~~~~~~~i~~~i~~~R~~~~~g~isf~~a~~~g~~~~~ia~li~ 81 (163)
T PF13858_consen 38 MVITIIFIYFAIRRYRKKYNGGFISFGQAFKVGFLISLIAGLIS 81 (163)
T ss_pred HHHHHHHHHHHHHHHHHHccCCCeeHHHHHHHHHHHHHHHHHHH
Confidence 334444447777888854 78888888888888777654
No 58
>PRK01026 tetrahydromethanopterin S-methyltransferase subunit G; Provisional
Probab=23.59 E-value=1.6e+02 Score=22.99 Aligned_cols=21 Identities=14% Similarity=0.134 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHhHhHHHH
Q 027616 125 ALIKVGIIAFWYFLIMPPIIM 145 (221)
Q Consensus 125 Vll~Y~~La~~YLLVvP~~ly 145 (221)
|.++|+++.|+.++++=..+.
T Consensus 50 iGIlYG~viGlli~~i~~~~~ 70 (77)
T PRK01026 50 IGILYGLVIGLLIVLVYIILS 70 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 678999999988866544433
No 59
>PF01529 zf-DHHC: DHHC palmitoyltransferase; InterPro: IPR001594 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the DHHC-type zinc finger domain, which is also known as NEW1 []. The DHHC Zn-finger was first isolated in the Drosophila putative transcription factor DNZ1 and was named after a conserved sequence motif []. This domain has palmitoyltransferase activity; this post-translational modification attaches the C16 saturated fatty acid palmitate via a thioester linkage, predominantly to cysteine residues []. This domain is found in the DHHC proteins which are palmitoyl transferases []; the DHHC motif is found within a cysteine-rich domain which is thought to contain the catalytic site. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding
Probab=23.55 E-value=2.9e+02 Score=21.34 Aligned_cols=27 Identities=22% Similarity=0.413 Sum_probs=16.1
Q ss_pred eecccCCchhHHHHHHHHHHHHHHHHHhH
Q 027616 112 VTGENNHEIDLTVALIKVGIIAFWYFLIM 140 (221)
Q Consensus 112 vtg~~n~~~D~l~Vll~Y~~La~~YLLVv 140 (221)
-.|++|+ -...+.+.|+.++.++.+++
T Consensus 85 cIG~~N~--~~F~~fl~~~~~~~~~~~~~ 111 (174)
T PF01529_consen 85 CIGRRNH--RYFLLFLLYLCLYCLYFFIL 111 (174)
T ss_pred ccccccH--HHHHHHHHHHHHHHHHHHHH
Confidence 4677565 33555666666666666653
No 60
>PF15102 TMEM154: TMEM154 protein family
Probab=22.92 E-value=14 Score=31.33 Aligned_cols=48 Identities=15% Similarity=0.290 Sum_probs=23.3
Q ss_pred HhhcCCceeeecccCCchh-HHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHH
Q 027616 103 ATLEQPALAVTGENNHEID-LTVALIKVGIIAFWYFLIMPPIIMNWLRVRWY 153 (221)
Q Consensus 103 A~~e~PAlAvtg~~n~~~D-~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RWy 153 (221)
|.+.....-.|.++.+..+ .|.|++..++|+.+.|+||=++++. +||-
T Consensus 38 a~~~st~~~~t~~~~~q~efiLmIlIP~VLLvlLLl~vV~lv~~~---kRkr 86 (146)
T PF15102_consen 38 ANINSTETSLTEEDSSQLEFILMILIPLVLLVLLLLSVVCLVIYY---KRKR 86 (146)
T ss_pred cccCcccccccCCCCCCcceEEEEeHHHHHHHHHHHHHHHheeEE---eecc
Confidence 4444444323444334444 5666666655555555554333332 6663
No 61
>PRK14585 pgaD putative PGA biosynthesis protein; Provisional
Probab=22.75 E-value=4.1e+02 Score=22.60 Aligned_cols=55 Identities=13% Similarity=0.237 Sum_probs=33.9
Q ss_pred HHHHHHHHHHHhHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhh-hhHhhhhccCcc
Q 027616 128 KVGIIAFWYFLIMPPIIMNWLRVRWYKRKLFEMYVQFMFVFMFFP-GLLLWAPFLNFR 184 (221)
Q Consensus 128 ~Y~~La~~YLLVvP~~ly~wLn~RWy~~~~~ER~~mY~LVF~FFP-GllL~APFLNFR 184 (221)
+.+=+||+||+|.=++-+. =.-+|-.- .+-|+..|.++-++.- -+|.||=-=-+|
T Consensus 23 ~laW~gf~~~~~~~l~~~l-~~p~~~~~-~l~tl~~Y~~iAv~nAvvLI~WA~YNq~R 78 (137)
T PRK14585 23 TILWTLFALFIFLFAMDLL-TGYYWQSE-ARSRLQFYFLLAVANAVVLIVWALYNKLR 78 (137)
T ss_pred HHHHHHHHHHHHHHHHHHh-cchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344677777776554332 22445222 7778899999988877 566777553333
No 62
>PRK12438 hypothetical protein; Provisional
Probab=22.64 E-value=2.9e+02 Score=29.80 Aligned_cols=51 Identities=24% Similarity=0.438 Sum_probs=36.4
Q ss_pred HHhHhHHHHHHHHHHHHhh---------hhHHHHHHHHHHHHhhhhhHhhhhccCcccCC
Q 027616 137 FLIMPPIIMNWLRVRWYKR---------KLFEMYVQFMFVFMFFPGLLLWAPFLNFRKLP 187 (221)
Q Consensus 137 LLVvP~~ly~wLn~RWy~~---------~~~ER~~mY~LVF~FFPGllL~APFLNFR~~p 187 (221)
+++++.++=+|.+.-||.. -.+=|+.+|.++|+++=+.+.++=++.+|-+|
T Consensus 30 ~~~~~~~~~~~td~lWf~~lgy~~Vf~t~l~tr~~Lf~~~~~~~~~~v~~~~~la~r~rp 89 (991)
T PRK12438 30 LLFGPRLVDIYTDWLWFGEVGFRSVWITVLLTRLALFAAVALVVGGIVLAALLLAYRSRP 89 (991)
T ss_pred HHHHHHHHHHHHHHHHHHhCCCceehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc
Confidence 3345677888999999864 34557788888887777777766777777555
No 63
>TIGR03426 shape_MreD rod shape-determining protein MreD. Members of this protein family are the MreD protein of bacterial cell shape determination. Most rod-shaped bacteria depend on MreB and RodA to achieve either a rod shape or some other non-spherical morphology such as coil or stalk formation. MreD is encoded in an operon with MreB, and often with RodA and PBP-2 as well. It is highly hydrophobic (therefore somewhat low-complexity) and highly divergent, and therefore sometimes tricky to discover by homology, but this model finds most examples.
Probab=22.60 E-value=1.8e+02 Score=22.45 Aligned_cols=41 Identities=7% Similarity=0.008 Sum_probs=30.5
Q ss_pred HHHHHHHHhHhHHHHHH--HHHHHHhhhhHHHHHHHHHHHHhh
Q 027616 131 IIAFWYFLIMPPIIMNW--LRVRWYKRKLFEMYVQFMFVFMFF 171 (221)
Q Consensus 131 ~La~~YLLVvP~~ly~w--Ln~RWy~~~~~ER~~mY~LVF~FF 171 (221)
...|.|.+..+++.|.- +.+|++..+.+.+....++..+.+
T Consensus 67 ~~lG~~al~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (154)
T TIGR03426 67 SPLGVHALALSLVAYLAASKFQRFRQFSLWQQALIIFLLLILL 109 (154)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHH
Confidence 34578888888887775 678999999999987666544444
No 64
>PRK14740 kdbF potassium-transporting ATPase subunit F; Provisional
Probab=22.48 E-value=1.3e+02 Score=19.90 Aligned_cols=18 Identities=17% Similarity=0.244 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHhH
Q 027616 123 TVALIKVGIIAFWYFLIM 140 (221)
Q Consensus 123 l~Vll~Y~~La~~YLLVv 140 (221)
.++.++.++..+.||++.
T Consensus 5 ~wls~a~a~~Lf~YLv~A 22 (29)
T PRK14740 5 DWLSLALATGLFVYLLVA 22 (29)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 456677778888888764
No 65
>KOG0812 consensus SNARE protein SED5/Syntaxin 5 [Intracellular trafficking, secretion, and vesicular transport]
Probab=22.09 E-value=82 Score=29.86 Aligned_cols=23 Identities=35% Similarity=0.883 Sum_probs=17.9
Q ss_pred HHHHHhhhhHHHHHHHHHHHHhh
Q 027616 149 RVRWYKRKLFEMYVQFMFVFMFF 171 (221)
Q Consensus 149 n~RWy~~~~~ER~~mY~LVF~FF 171 (221)
.+||.-+..|==+++||+||.+|
T Consensus 287 SNRwLmvkiF~i~ivFflvfvlf 309 (311)
T KOG0812|consen 287 SNRWLMVKIFGILIVFFLVFVLF 309 (311)
T ss_pred cchHHHHHHHHHHHHHHHHHHHh
Confidence 47999888877777777777776
No 66
>PF15071 TMEM220: Transmembrane family 220, helix
Probab=21.41 E-value=1.6e+02 Score=22.82 Aligned_cols=36 Identities=19% Similarity=0.171 Sum_probs=27.1
Q ss_pred ccCCchhHHHHHHHHHHHHHHHHHhHhHHHHHHHHH
Q 027616 115 ENNHEIDLTVALIKVGIIAFWYFLIMPPIIMNWLRV 150 (221)
Q Consensus 115 ~~n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~ 150 (221)
++.||.|...=...|++-+++-.++.|.....++++
T Consensus 8 vQ~NDPD~~lWv~iY~i~a~~~~~~~~~~~~~~~~~ 43 (104)
T PF15071_consen 8 VQINDPDPELWVPIYGIAAVLCVLANFGVTPNWIWK 43 (104)
T ss_pred eecCCCCHHHHHHHHHHHHHHHHHHhhccchHHHHH
Confidence 456777888888899998888777777777666543
No 67
>PRK06568 F0F1 ATP synthase subunit B; Validated
Probab=21.06 E-value=2.1e+02 Score=23.97 Aligned_cols=33 Identities=12% Similarity=0.083 Sum_probs=24.7
Q ss_pred hhHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHH
Q 027616 120 IDLTVALIKVGIIAFWYFLIMPPIIMNWLRVRW 152 (221)
Q Consensus 120 ~D~l~Vll~Y~~La~~YLLVvP~~ly~wLn~RW 152 (221)
.+++|+++.++++.++.--..+.-|...|..|=
T Consensus 5 ~~~fwq~I~FlIll~ll~kfawkPI~~~LeeR~ 37 (154)
T PRK06568 5 DESFWLAVSFVIFVYLIYRPAKKAILNSLDAKI 37 (154)
T ss_pred HhHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 456778888888777766667777888888773
No 68
>PF05884 ZYG-11_interact: Interactor of ZYG-11; InterPro: IPR008574 This family consists of proteins of unknown function found in Caenorhabditis species.
Probab=20.66 E-value=1.8e+02 Score=27.37 Aligned_cols=48 Identities=23% Similarity=0.463 Sum_probs=30.4
Q ss_pred HHHHHHHHHHhHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhhhHhhhhc
Q 027616 129 VGIIAFWYFLIMPPIIMNWLRVRWYKRKLFEMYVQFMFVFMFFPGLLLWAPF 180 (221)
Q Consensus 129 Y~~La~~YLLVvP~~ly~wLn~RWy~~~~~ER~~mY~LVF~FFPGllL~APF 180 (221)
+++..+. .+|+|++.|+++|.| .++.-|+-++ -+++-+|=|++.=+=+
T Consensus 138 ~gAaila-~iviP~~~~y~ln~~--~~s~~~~R~~-ll~~a~~QGvL~Ga~l 185 (299)
T PF05884_consen 138 FGAAILA-YIVIPLIAYYYLNKE--DGSLAESRLA-LLFFALFQGVLVGAGL 185 (299)
T ss_pred hhHHHHH-HHHHHHHHHhhcccc--cCchHHHHHH-HHHHHHHHHHHHHHHh
Confidence 3344444 468999999999996 5555554332 3456677787765443
No 69
>KOG1311 consensus DHHC-type Zn-finger proteins [General function prediction only]
Probab=20.31 E-value=4.7e+02 Score=22.70 Aligned_cols=68 Identities=15% Similarity=0.068 Sum_probs=34.9
Q ss_pred eecccCCchhHHHHHHHHHHHHHHHHHhHhHHHHHHH----H------HHHHhhhhHHHHHHHHHHHHhhhh-hHhhhhc
Q 027616 112 VTGENNHEIDLTVALIKVGIIAFWYFLIMPPIIMNWL----R------VRWYKRKLFEMYVQFMFVFMFFPG-LLLWAPF 180 (221)
Q Consensus 112 vtg~~n~~~D~l~Vll~Y~~La~~YLLVvP~~ly~wL----n------~RWy~~~~~ER~~mY~LVF~FFPG-llL~APF 180 (221)
-.|++|..-=. ..+.|.+++.+|.+++=.+.+.++ . .-|.......-++++.+++++++| |+.+-..
T Consensus 150 CVG~rNyr~F~--~f~~~~~l~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~fh~~ 227 (299)
T KOG1311|consen 150 CIGERNYRYFV--LFLFYLALGVLLALAFLFYELLQRADNLKVNLTPVLIPAGTFLSALLGLLSALFLAFTSALLCFHIY 227 (299)
T ss_pred eECCCchHHHH--HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHhhee
Confidence 36775544444 333377777777665543332220 1 222233333445567788888888 3333333
Q ss_pred c
Q 027616 181 L 181 (221)
Q Consensus 181 L 181 (221)
+
T Consensus 228 l 228 (299)
T KOG1311|consen 228 L 228 (299)
T ss_pred e
Confidence 3
No 70
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.
Probab=20.12 E-value=5.6e+02 Score=22.06 Aligned_cols=22 Identities=5% Similarity=0.076 Sum_probs=16.9
Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHhh
Q 027616 147 WLRVRWYKRKLFEMYVQFMFVFMFF 171 (221)
Q Consensus 147 wLn~RWy~~~~~ER~~mY~LVF~FF 171 (221)
.+|+| ..++=|.+.+.++.+.+
T Consensus 131 fl~~K---~~~~~rA~~~~~~~L~~ 152 (194)
T PF11833_consen 131 FLNRK---ERKLGRAFLWTLGGLVV 152 (194)
T ss_pred HHHHh---cchHHHHHHHHHHHHHH
Confidence 45665 77888999999888754
Done!