Query psy6892
Match_columns 190
No_of_seqs 310 out of 1606
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 21:08:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6892.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6892hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4219|consensus 99.6 2.4E-16 5.3E-21 134.6 5.8 97 8-104 29-128 (423)
2 PHA03234 DNA packaging protein 99.4 2.2E-13 4.9E-18 116.5 2.9 89 13-103 31-124 (338)
3 KOG4220|consensus 99.4 6.4E-14 1.4E-18 120.2 -1.1 76 15-90 31-106 (503)
4 PHA02834 chemokine receptor-li 99.2 6.5E-12 1.4E-16 106.6 3.2 78 13-93 27-104 (323)
5 PHA02638 CC chemokine receptor 99.1 1.9E-11 4.1E-16 107.4 2.5 80 13-95 97-176 (417)
6 PHA03087 G protein-coupled che 99.1 1.3E-10 2.8E-15 98.5 4.4 80 12-93 38-117 (335)
7 PHA03235 DNA packaging protein 99.0 9.9E-11 2.2E-15 102.6 2.9 89 13-103 31-124 (409)
8 PF00001 7tm_1: 7 transmembran 98.7 1.3E-09 2.8E-14 86.4 -1.8 64 31-94 1-64 (257)
9 KOG2087|consensus 98.4 8.1E-08 1.7E-12 81.7 1.4 85 16-100 26-117 (363)
10 PF10320 7TM_GPCR_Srsx: Serpen 98.3 5.3E-08 1.2E-12 80.3 -1.9 50 26-75 2-51 (257)
11 PF05296 TAS2R: Mammalian tast 96.9 0.0006 1.3E-08 57.6 2.2 62 12-73 4-68 (303)
12 PF05462 Dicty_CAR: Slime mold 96.6 0.0014 3.1E-08 55.5 2.6 55 14-68 6-60 (303)
13 PF10328 7TM_GPCR_Srx: Serpent 96.1 0.001 2.3E-08 55.0 -0.8 45 24-68 3-47 (274)
14 PF11710 Git3: G protein-coupl 96.1 0.0019 4.1E-08 51.5 0.4 52 43-94 30-81 (201)
15 PF10324 7TM_GPCR_Srw: Serpent 95.8 0.0021 4.6E-08 54.1 -0.6 52 23-75 5-57 (318)
16 PF04931 DNA_pol_phi: DNA poly 95.7 0.0016 3.4E-08 61.8 -1.5 17 57-73 589-605 (784)
17 PF10317 7TM_GPCR_Srd: Serpent 95.3 0.0066 1.4E-07 50.8 1.0 49 19-67 3-52 (292)
18 PF10321 7TM_GPCR_Srt: Serpent 95.0 0.018 3.8E-07 49.1 2.6 56 11-66 29-84 (313)
19 KOG3064|consensus 93.0 0.024 5.2E-07 46.5 -0.4 14 26-39 124-137 (303)
20 PF00002 7tm_2: 7 transmembran 92.9 0.061 1.3E-06 43.3 1.9 49 19-67 5-53 (242)
21 PF03839 Sec62: Translocation 92.4 0.087 1.9E-06 42.7 2.1 15 50-64 141-155 (224)
22 PF02724 CDC45: CDC45-like pro 92.1 0.056 1.2E-06 50.2 0.8 13 51-63 55-68 (622)
23 KOG1189|consensus 92.0 0.04 8.8E-07 51.5 -0.3 12 157-168 928-939 (960)
24 PF10327 7TM_GPCR_Sri: Serpent 90.5 0.61 1.3E-05 39.4 5.5 65 14-78 8-76 (303)
25 PF03402 V1R: Vomeronasal orga 87.6 0.55 1.2E-05 39.1 3.1 45 46-92 8-53 (265)
26 KOG2927|consensus 84.4 0.27 5.8E-06 42.3 -0.3 6 79-84 256-261 (372)
27 PF09882 DUF2109: Predicted me 84.2 0.26 5.7E-06 33.0 -0.4 48 26-73 5-52 (78)
28 PF10856 DUF2678: Protein of u 83.6 0.38 8.3E-06 34.9 0.3 23 82-104 87-109 (118)
29 PF10326 7TM_GPCR_Str: Serpent 82.3 0.43 9.3E-06 39.9 0.1 47 20-66 4-51 (307)
30 COG5406 Nucleosome binding fac 79.1 0.64 1.4E-05 43.1 0.1 13 97-109 919-931 (1001)
31 PF09726 Macoilin: Transmembra 76.9 1.2 2.7E-05 42.0 1.3 22 49-71 76-97 (697)
32 KOG1832|consensus 75.8 0.76 1.6E-05 44.3 -0.4 14 50-63 1314-1327(1516)
33 PF08229 SHR3_chaperone: ER me 73.3 5.7 0.00012 31.6 4.0 22 15-36 59-80 (196)
34 PF03344 Daxx: Daxx Family; I 72.5 1.2 2.6E-05 42.2 0.0 7 50-56 346-352 (713)
35 PF10319 7TM_GPCR_Srj: Serpent 70.7 10 0.00022 32.3 5.2 53 20-72 10-65 (310)
36 PF02101 Ocular_alb: Ocular al 70.5 6.2 0.00013 34.6 3.9 57 20-76 32-94 (405)
37 PHA03055 Hypothetical protein; 69.9 7.9 0.00017 26.1 3.5 57 20-76 13-69 (79)
38 PF10292 7TM_GPCR_Srab: Serpen 66.2 5 0.00011 34.1 2.5 53 14-66 16-68 (324)
39 KOG4193|consensus 66.0 2 4.4E-05 39.9 0.1 37 23-59 328-364 (610)
40 KOG2082|consensus 64.9 5 0.00011 38.3 2.4 30 51-82 581-610 (1075)
41 PHA03062 putative IMV membrane 64.7 10 0.00022 25.6 3.2 56 20-75 14-69 (78)
42 PF11446 DUF2897: Protein of u 63.8 2.9 6.4E-05 26.3 0.5 12 27-38 14-25 (55)
43 PF04713 Pox_I5: Poxvirus prot 63.3 12 0.00026 25.2 3.4 54 22-75 15-68 (77)
44 KOG3114|consensus 63.2 5.2 0.00011 33.7 2.0 31 8-38 110-140 (290)
45 KOG0205|consensus 62.0 18 0.00038 34.3 5.3 48 20-67 758-806 (942)
46 PF02532 PsbI: Photosystem II 60.5 11 0.00023 21.5 2.3 20 15-34 9-28 (36)
47 PF10316 7TM_GPCR_Srbc: Serpen 57.2 3.4 7.3E-05 34.6 -0.1 57 16-72 7-63 (273)
48 PF01102 Glycophorin_A: Glycop 56.8 2.6 5.6E-05 31.0 -0.7 8 36-43 85-92 (122)
49 CHL00024 psbI photosystem II p 56.6 9.7 0.00021 21.6 1.7 20 15-34 9-28 (36)
50 PRK02655 psbI photosystem II r 54.8 10 0.00023 21.7 1.7 20 15-34 9-28 (38)
51 PF10323 7TM_GPCR_Srv: Serpent 53.8 2.4 5.1E-05 35.4 -1.6 23 45-67 29-51 (283)
52 KOG1832|consensus 53.5 2.3 5E-05 41.2 -1.8 9 99-107 1394-1402(1516)
53 PF10192 GpcrRhopsn4: Rhodopsi 53.2 18 0.00039 29.7 3.6 45 15-59 29-73 (257)
54 PRK03557 zinc transporter ZitB 52.6 9.2 0.0002 32.4 1.8 21 24-44 127-147 (312)
55 PF11861 DUF3381: Domain of un 50.8 24 0.00051 27.1 3.7 6 58-63 69-74 (159)
56 PF05297 Herpes_LMP1: Herpesvi 50.6 5.2 0.00011 33.8 0.0 7 40-46 71-77 (381)
57 PF14851 FAM176: FAM176 family 50.2 5.8 0.00013 30.3 0.2 34 29-63 4-37 (153)
58 PF10318 7TM_GPCR_Srh: Serpent 50.0 17 0.00037 30.1 3.0 60 17-76 6-70 (302)
59 KOG1482|consensus 49.6 6.4 0.00014 34.3 0.4 39 57-95 233-271 (379)
60 PF10329 DUF2417: Region of un 49.6 11 0.00024 30.8 1.8 14 50-63 105-118 (232)
61 KOG4349|consensus 48.2 8.8 0.00019 28.2 0.9 19 28-46 61-79 (143)
62 KOG2592|consensus 48.2 18 0.0004 31.8 3.0 30 10-39 177-206 (426)
63 COG2181 NarI Nitrate reductase 46.9 18 0.00039 29.5 2.5 30 43-72 118-147 (228)
64 COG5406 Nucleosome binding fac 46.7 4.2 9.2E-05 37.9 -1.2 9 157-165 978-986 (1001)
65 PF14940 TMEM219: Transmembran 46.5 12 0.00025 30.5 1.4 45 17-63 11-60 (223)
66 PHA02702 ORF033 IMV membrane p 46.2 15 0.00033 24.5 1.7 26 51-76 41-66 (78)
67 COG1230 CzcD Co/Zn/Cd efflux s 45.9 12 0.00025 31.8 1.4 57 20-78 126-183 (296)
68 PRK13183 psbN photosystem II r 44.9 29 0.00063 20.9 2.6 32 17-48 9-40 (46)
69 PHA03099 epidermal growth fact 44.0 25 0.00053 26.2 2.6 32 14-45 100-131 (139)
70 TIGR00869 sec62 protein transl 43.4 16 0.00036 29.8 1.8 6 80-85 184-189 (232)
71 CHL00020 psbN photosystem II p 43.3 30 0.00064 20.6 2.4 32 17-48 6-37 (43)
72 PF10856 DUF2678: Protein of u 43.2 33 0.00071 25.0 3.1 9 50-58 62-70 (118)
73 PF02009 Rifin_STEVOR: Rifin/s 42.4 7.2 0.00016 33.1 -0.3 21 24-44 263-283 (299)
74 PF15330 SIT: SHP2-interacting 42.1 12 0.00026 26.9 0.7 30 16-45 2-31 (107)
75 KOG4564|consensus 42.0 18 0.0004 32.6 2.1 51 20-70 150-200 (473)
76 PF13074 DUF3938: Protein of u 41.6 66 0.0014 21.6 4.2 17 92-108 78-94 (102)
77 PF03348 Serinc: Serine incorp 41.1 46 0.00099 29.7 4.4 55 12-69 179-235 (429)
78 KOG0476|consensus 40.3 18 0.0004 34.6 1.9 31 53-83 372-402 (931)
79 PF12606 RELT: Tumour necrosis 40.0 7.7 0.00017 23.9 -0.4 9 23-31 9-17 (50)
80 PF06398 Pex24p: Integral pero 38.1 39 0.00085 29.0 3.5 52 17-69 15-66 (359)
81 PF05745 CRPA: Chlamydia 15 kD 38.0 47 0.001 24.7 3.3 10 57-66 104-113 (150)
82 COG4420 Predicted membrane pro 38.0 67 0.0015 25.4 4.4 38 17-59 60-97 (191)
83 PF05915 DUF872: Eukaryotic pr 37.4 72 0.0016 23.1 4.3 14 19-32 48-61 (115)
84 PF09802 Sec66: Preprotein tra 36.7 5.6 0.00012 31.5 -1.8 45 23-68 10-57 (190)
85 PF14494 DUF4436: Domain of un 35.8 25 0.00054 29.2 1.8 29 20-48 173-201 (256)
86 PF02468 PsbN: Photosystem II 35.6 37 0.00079 20.2 2.0 31 18-48 7-37 (43)
87 PF05934 MCLC: Mid-1-related c 35.2 15 0.00032 33.3 0.5 27 14-40 181-207 (549)
88 PF09788 Tmemb_55A: Transmembr 35.0 55 0.0012 27.1 3.6 12 16-27 199-210 (256)
89 PF03006 HlyIII: Haemolysin-II 34.9 77 0.0017 24.7 4.5 42 13-54 106-147 (222)
90 PF07787 DUF1625: Protein of u 34.8 72 0.0016 26.0 4.4 10 50-59 222-231 (248)
91 PF06210 DUF1003: Protein of u 34.5 1.2E+02 0.0026 21.7 5.0 37 18-59 7-43 (108)
92 COG0341 SecF Preprotein transl 34.1 17 0.00036 31.0 0.6 46 21-66 191-239 (305)
93 PF14068 YuiB: Putative membra 33.9 54 0.0012 23.2 3.0 19 50-68 63-81 (102)
94 TIGR01477 RIFIN variant surfac 33.4 15 0.00032 31.9 0.2 29 18-46 311-339 (353)
95 PF04789 DUF621: Protein of un 33.1 41 0.0009 28.4 2.7 36 33-68 33-68 (305)
96 PF03729 DUF308: Short repeat 33.1 82 0.0018 19.6 3.7 21 46-66 49-69 (72)
97 PF13903 Claudin_2: PMP-22/EMP 32.9 55 0.0012 24.1 3.3 23 22-44 76-98 (172)
98 PTZ00046 rifin; Provisional 32.9 16 0.00034 31.8 0.2 28 19-46 317-344 (358)
99 KOG1056|consensus 32.4 49 0.0011 32.3 3.4 43 17-59 561-610 (878)
100 PF14654 Epiglycanin_C: Mucin, 32.1 35 0.00076 24.1 1.8 23 12-34 17-39 (106)
101 PF11683 DUF3278: Protein of u 31.8 31 0.00067 25.3 1.6 26 19-44 35-60 (129)
102 PF09323 DUF1980: Domain of un 31.7 73 0.0016 24.6 3.8 8 22-29 40-47 (182)
103 PF13127 DUF3955: Protein of u 31.7 1.3E+02 0.0028 19.3 4.3 25 39-65 28-52 (63)
104 KOG4404|consensus 31.6 21 0.00045 30.7 0.7 17 47-63 156-172 (350)
105 PRK13664 hypothetical protein; 31.2 38 0.00082 21.5 1.7 16 19-34 9-24 (62)
106 KOG2443|consensus 31.2 9.1 0.0002 33.0 -1.5 27 58-84 152-178 (362)
107 PF15145 DUF4577: Domain of un 30.8 30 0.00064 25.1 1.3 27 14-40 60-86 (128)
108 KOG3850|consensus 30.4 50 0.0011 29.1 2.8 37 18-57 398-434 (455)
109 PF07226 DUF1422: Protein of u 30.3 10 0.00022 27.6 -1.2 27 42-68 53-79 (117)
110 KOG1281|consensus 30.2 1E+02 0.0022 28.7 4.9 29 11-39 469-497 (586)
111 PHA02974 putative IMV membrane 30.2 72 0.0016 21.6 3.0 56 20-75 13-69 (81)
112 KOG2442|consensus 29.9 11 0.00023 34.1 -1.3 29 51-79 306-334 (541)
113 PF11044 TMEMspv1-c74-12: Plec 29.8 62 0.0013 19.4 2.3 16 16-31 7-22 (49)
114 PF12191 stn_TNFRSF12A: Tumour 28.9 19 0.00041 26.6 0.0 14 18-31 83-96 (129)
115 PF02118 Srg: Srg family chemo 28.8 22 0.00047 28.9 0.4 45 22-66 5-52 (275)
116 PRK06758 hypothetical protein; 28.6 1.3E+02 0.0028 21.1 4.2 17 92-108 102-118 (128)
117 cd07912 Tweety_N N-terminal do 28.2 1E+02 0.0022 27.6 4.4 15 49-63 79-93 (418)
118 PF14184 YrvL: Regulatory prot 28.2 61 0.0013 24.0 2.7 20 16-35 41-60 (132)
119 PRK11056 hypothetical protein; 28.0 15 0.00033 26.8 -0.6 29 41-69 52-80 (120)
120 PF10873 DUF2668: Protein of u 27.8 8.8 0.00019 29.0 -1.9 25 16-40 63-87 (155)
121 KOG0473|consensus 27.6 60 0.0013 27.2 2.7 35 21-55 265-300 (326)
122 COG2116 FocA Formate/nitrite f 27.3 38 0.00082 28.3 1.6 43 21-63 158-201 (265)
123 PTZ00415 transmission-blocking 27.3 14 0.00031 38.6 -1.1 6 29-34 30-35 (2849)
124 cd08763 Cyt_b561_CYB561 Verteb 27.3 84 0.0018 23.6 3.3 45 20-64 12-58 (143)
125 COG5130 YIP3 Prenylated rab ac 27.1 33 0.00072 26.0 1.1 23 48-70 113-135 (169)
126 KOG0065|consensus 26.4 51 0.0011 33.7 2.5 35 32-66 1250-1284(1391)
127 PRK11562 nitrite transporter N 26.0 36 0.00078 28.4 1.2 34 21-54 150-183 (268)
128 PF15109 TMEM125: TMEM125 prot 25.9 1.8E+02 0.0038 20.6 4.4 46 16-61 21-66 (112)
129 PHA03164 hypothetical protein; 25.8 52 0.0011 22.1 1.7 22 46-67 56-77 (88)
130 KOG2632|consensus 25.6 60 0.0013 27.0 2.4 58 14-71 18-79 (258)
131 PF06024 DUF912: Nucleopolyhed 25.5 73 0.0016 22.3 2.6 11 34-44 81-91 (101)
132 COG4970 FimT Tfp pilus assembl 24.8 76 0.0017 24.9 2.8 44 39-82 36-79 (181)
133 PF12304 BCLP: Beta-casein lik 24.7 1E+02 0.0022 24.4 3.4 44 20-66 46-90 (188)
134 KOG3832|consensus 24.7 2.5E+02 0.0053 23.0 5.6 62 35-100 225-286 (319)
135 PF10267 Tmemb_cc2: Predicted 24.3 50 0.0011 29.2 1.8 15 43-57 367-381 (395)
136 COG1272 Predicted membrane pro 24.2 1.6E+02 0.0034 24.0 4.5 42 19-60 119-160 (226)
137 COG5456 Predicted integral mem 24.1 50 0.0011 25.2 1.6 34 29-62 30-63 (166)
138 KOG4536|consensus 24.1 1.1E+02 0.0024 25.9 3.6 43 23-65 200-244 (347)
139 COG3125 CyoD Heme/copper-type 23.9 79 0.0017 22.8 2.5 20 46-65 45-64 (111)
140 PF01545 Cation_efflux: Cation 23.8 20 0.00043 29.3 -0.7 10 56-65 143-152 (284)
141 PF13120 DUF3974: Domain of un 23.6 53 0.0011 23.1 1.5 17 23-39 10-26 (126)
142 PF09801 SYS1: Integral membra 23.5 1.3E+02 0.0028 22.7 3.7 10 50-59 93-102 (144)
143 KOG2513|consensus 23.4 1.4E+02 0.0031 28.0 4.6 52 15-66 532-586 (647)
144 KOG2302|consensus 23.4 76 0.0016 31.9 2.9 24 43-66 1191-1214(1956)
145 PF06706 CTV_P6: Citrus triste 23.2 1.5E+02 0.0032 17.7 3.1 15 28-42 20-34 (51)
146 PF09971 DUF2206: Predicted me 23.1 88 0.0019 27.3 3.1 58 14-72 114-171 (367)
147 PF03839 Sec62: Translocation 22.9 48 0.001 27.0 1.3 7 79-85 177-183 (224)
148 PF10726 DUF2518: Protein of f 22.8 67 0.0014 24.3 2.0 9 51-59 44-52 (145)
149 PF05751 FixH: FixH; InterPro 22.5 75 0.0016 23.2 2.3 40 20-59 7-46 (146)
150 PF10190 Tmemb_170: Putative t 21.9 50 0.0011 23.6 1.1 11 79-89 74-84 (105)
151 cd08762 Cyt_b561_CYBASC3 Verte 21.8 1.2E+02 0.0025 23.9 3.2 47 19-65 39-89 (179)
152 KOG3058|consensus 21.8 43 0.00094 29.0 0.9 54 20-73 70-132 (351)
153 PF03073 TspO_MBR: TspO/MBR fa 21.7 1.9E+02 0.0041 21.4 4.4 20 42-61 66-85 (148)
154 COG4758 Predicted membrane pro 21.6 50 0.0011 27.0 1.2 17 17-33 10-26 (235)
155 COG5521 Predicted integral mem 21.5 2.3E+02 0.0049 23.6 4.9 14 50-63 221-234 (275)
156 COG2456 Uncharacterized conser 21.4 56 0.0012 23.6 1.3 40 31-70 19-58 (121)
157 cd08762 Cyt_b561_CYBASC3 Verte 21.3 1.8E+02 0.004 22.8 4.3 20 46-65 108-127 (179)
158 TIGR00815 sulP high affinity s 21.2 87 0.0019 28.8 2.8 54 17-70 261-315 (563)
159 PF14798 Ca_hom_mod: Calcium h 21.0 3.4E+02 0.0075 22.4 6.0 12 21-32 20-31 (251)
160 KOG2450|consensus 20.9 1.3E+02 0.0028 27.5 3.7 42 29-80 184-225 (501)
161 KOG3145|consensus 20.9 2.2E+02 0.0047 24.5 4.8 26 79-104 294-319 (372)
162 PHA02910 hypothetical protein; 20.8 43 0.00092 24.5 0.6 23 14-36 23-45 (171)
163 PRK05771 V-type ATP synthase s 20.7 1.4E+02 0.0031 27.8 4.2 43 24-66 359-409 (646)
164 PF12597 DUF3767: Protein of u 20.7 1.1E+02 0.0023 22.2 2.7 16 44-59 64-79 (118)
165 TIGR01297 CDF cation diffusion 20.6 30 0.00065 28.1 -0.3 42 25-66 100-142 (268)
166 KOG1734|consensus 20.4 1E+02 0.0022 26.0 2.8 34 6-39 129-162 (328)
167 KOG4452|consensus 20.3 2.4E+02 0.0052 18.5 3.9 23 37-59 39-61 (79)
168 TIGR00834 ae anion exchange pr 20.1 1.7E+02 0.0036 28.9 4.5 42 31-72 672-715 (900)
No 1
>KOG4219|consensus
Probab=99.64 E-value=2.4e-16 Score=134.59 Aligned_cols=97 Identities=26% Similarity=0.349 Sum_probs=84.0
Q ss_pred CCCchhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccc
Q psy6892 8 GLDAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVL 87 (190)
Q Consensus 8 ~~~~~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC 87 (190)
..+...+.+++++|+++.+++++||++|+|++..+|++|+.+|+||+|||+||+++++++.|+.....+...|++|.++|
T Consensus 29 ~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f~C 108 (423)
T KOG4219|consen 29 VLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSFYC 108 (423)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhcccccccee
Confidence 33555677888999999999999999999999999999999999999999999999999999999988889999999999
Q ss_pred eeeccchh---hhhhhhhcc
Q psy6892 88 RLKKNILC---ENQLFKAGA 104 (190)
Q Consensus 88 ~i~~~i~c---~~slfSl~s 104 (190)
++..++.- .++.|++.+
T Consensus 109 ~f~nf~~itav~vSVfTlvA 128 (423)
T KOG4219|consen 109 RFVNFFPITAVFVSVFTLVA 128 (423)
T ss_pred eeccccchhhhhHhHHHHHH
Confidence 98765554 445555443
No 2
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.37 E-value=2.2e-13 Score=116.47 Aligned_cols=89 Identities=15% Similarity=0.154 Sum_probs=67.3
Q ss_pred hhHHHHHHHHHHHHhhhccchheeeee--eccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceee
Q psy6892 13 SWNIWAVVTLILVFCTATGNILVCLAI--SWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLK 90 (190)
Q Consensus 13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi--~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~ 90 (190)
.+.++.++|.+++++|++||++|++++ ++++++|+++|+|++|||+||++++++ +|+.+.. ....|++|..+|++.
T Consensus 31 ~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l~-lp~~~~~-~~~~w~fG~~lCk~~ 108 (338)
T PHA03234 31 AQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHMLC-VFFLTLN-REALFNFNQAFCQCV 108 (338)
T ss_pred HHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHH-HHHHHHH-HhCCccCchhHHHHH
Confidence 456777899999999999999999854 456677999999999999999999765 4544433 245699999999986
Q ss_pred ccch---hhhhhhhhc
Q psy6892 91 KNIL---CENQLFKAG 103 (190)
Q Consensus 91 ~~i~---c~~slfSl~ 103 (190)
+++. +.+++++++
T Consensus 109 ~~~~~~~~~~Si~~L~ 124 (338)
T PHA03234 109 LFIYHASCSYSICMLA 124 (338)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5333 344444443
No 3
>KOG4220|consensus
Probab=99.36 E-value=6.4e-14 Score=120.18 Aligned_cols=76 Identities=34% Similarity=0.382 Sum_probs=71.5
Q ss_pred HHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceee
Q psy6892 15 NIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLK 90 (190)
Q Consensus 15 ~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~ 90 (190)
..++++.+++.+++++||++|++.+..+++||+..||||++||+|||++|.+.+|++.++++.+.|++|..+|.+.
T Consensus 31 v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlW 106 (503)
T KOG4220|consen 31 VFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLW 106 (503)
T ss_pred EeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHH
Confidence 4566778889999999999999999999999999999999999999999999999999999999999999999875
No 4
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.21 E-value=6.5e-12 Score=106.55 Aligned_cols=78 Identities=21% Similarity=0.272 Sum_probs=63.3
Q ss_pred hhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeecc
Q psy6892 13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKN 92 (190)
Q Consensus 13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~ 92 (190)
.+.++.+++++++++|++||++|++++.++++ ++++|+|++|||++|+++ ++.+|+.+.... ..|.+|..+|++.++
T Consensus 27 ~~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~-~~~~n~~i~nLAiaDll~-~~~lP~~i~~~~-~~w~~g~~~C~~~~~ 103 (323)
T PHA02834 27 VNYFVIVFYILLFIFGLIGNVLVIAVLIVKRF-MFVVDVYLFNIAMSDLML-VFSFPFIIHNDL-NEWIFGEFMCKLVLG 103 (323)
T ss_pred hhhhHHHHHHHHHHHHHhhHHHHHHHHHhccc-cchhhhhhHHHHHHHHHH-HHHHHHHHHHHc-CCcCCcchHHHhHHH
Confidence 45677899999999999999999998877665 457899999999999986 567887665443 468999999998653
Q ss_pred c
Q psy6892 93 I 93 (190)
Q Consensus 93 i 93 (190)
+
T Consensus 104 ~ 104 (323)
T PHA02834 104 V 104 (323)
T ss_pred H
Confidence 3
No 5
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.13 E-value=1.9e-11 Score=107.40 Aligned_cols=80 Identities=15% Similarity=0.241 Sum_probs=65.4
Q ss_pred hhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeecc
Q psy6892 13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKN 92 (190)
Q Consensus 13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~ 92 (190)
...++.+++.+++++|++||++|++++. ++++|+++|++++|||++|+++. +.+|+.+... .+.|.++..+|++.+.
T Consensus 97 ~~~~l~~~y~lvfvlgliGN~LVl~il~-~k~lrt~t~i~llnLAisDLl~~-l~lPf~i~~~-~~~W~fg~~~Ck~~~~ 173 (417)
T PHA02638 97 ISEYIKIFYIIIFILGLFGNAAIIMILF-CKKIKTITDIYIFNLAISDLIFV-IDFPFIIYNE-FDQWIFGDFMCKVISA 173 (417)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHH-hccCCCHhHHHHHHHHHHHHHHH-HHHHHHHHHH-hccccccccchhhHHH
Confidence 3456778899999999999999997765 47889999999999999999884 5788776654 4579999999998654
Q ss_pred chh
Q psy6892 93 ILC 95 (190)
Q Consensus 93 i~c 95 (190)
+.+
T Consensus 174 l~~ 176 (417)
T PHA02638 174 SYY 176 (417)
T ss_pred HHH
Confidence 444
No 6
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.06 E-value=1.3e-10 Score=98.55 Aligned_cols=80 Identities=19% Similarity=0.387 Sum_probs=66.5
Q ss_pred hhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeec
Q psy6892 12 ASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKK 91 (190)
Q Consensus 12 ~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~ 91 (190)
....++.+++.+++++|++||++++++++++ ++|+++|+|++|||++|++.++ .+|..+.......|.++..+|++.+
T Consensus 38 ~~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~-~~~~~~~~~~~~~~~~~~~~C~~~~ 115 (335)
T PHA03087 38 TNSTILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIVS 115 (335)
T ss_pred chhhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHH-hHHHHHHHHhCCCCCCCcHHHHHHH
Confidence 3456788889999999999999999999888 8999999999999999998765 5666655555677888889999765
Q ss_pred cc
Q psy6892 92 NI 93 (190)
Q Consensus 92 ~i 93 (190)
++
T Consensus 116 ~~ 117 (335)
T PHA03087 116 GL 117 (335)
T ss_pred HH
Confidence 44
No 7
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.04 E-value=9.9e-11 Score=102.63 Aligned_cols=89 Identities=15% Similarity=0.234 Sum_probs=61.8
Q ss_pred hhHHHHHHHHHHHHhhhccchheeeeeeccCCC--CchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceee
Q psy6892 13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRL--QNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLK 90 (190)
Q Consensus 13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L--~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~ 90 (190)
.+.+..+++++++++|++||++|++++.+++++ ++++++|++|||++|+++ ++.+|+.+... ...|..+..+|++.
T Consensus 31 ~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~-l~~lP~~i~~~-~~~~~~g~~~Ck~~ 108 (409)
T PHA03235 31 ARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLT-VFVLPFIMLSN-QGLLSGSVAGCKFA 108 (409)
T ss_pred hHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHH-HHHHHHHHHhc-CccccCCCCeehhH
Confidence 456788899999999999999999875543332 356789999999999987 56788776432 11223345789987
Q ss_pred ccc---hhhhhhhhhc
Q psy6892 91 KNI---LCENQLFKAG 103 (190)
Q Consensus 91 ~~i---~c~~slfSl~ 103 (190)
+++ .+.+++++++
T Consensus 109 ~~l~~~~~~~Si~tL~ 124 (409)
T PHA03235 109 SLLYYASCTVGFATVA 124 (409)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 543 3344554443
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=98.70 E-value=1.3e-09 Score=86.44 Aligned_cols=64 Identities=34% Similarity=0.496 Sum_probs=55.6
Q ss_pred cchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeeccch
Q psy6892 31 GNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNIL 94 (190)
Q Consensus 31 gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~ 94 (190)
||+++++++.+++++|+++++|++|||++|++++++.+|..+.....+.|.++..+|++.+++.
T Consensus 1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~ 64 (257)
T PF00001_consen 1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLF 64 (257)
T ss_dssp HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHH
T ss_pred CchhehhhhhhhccCCChhHHHHHHHHHHHHhhccccccccccccccccccccccccccccccc
Confidence 8999999999999999999999999999999999988888877776677888889998865443
No 9
>KOG2087|consensus
Probab=98.41 E-value=8.1e-08 Score=81.75 Aligned_cols=85 Identities=14% Similarity=0.172 Sum_probs=66.1
Q ss_pred HHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecc-------cccccccccce
Q psy6892 16 IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVR-------VRTNLSGAVLR 88 (190)
Q Consensus 16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~-------~~~~~s~~lC~ 88 (190)
++-+..+++.+++++||++|++++...+...++..++++|||++|+++++....+..+.... ..||..+..|+
T Consensus 26 ~lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~gey~~~ai~W~tg~gC~ 105 (363)
T KOG2087|consen 26 ILRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRGEYYKHAIDWQTGLGCP 105 (363)
T ss_pred eeeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHHHHHHHHHhhhhcCCCc
Confidence 44456678888999999999999988888888999999999999999997765555544432 23777888899
Q ss_pred eeccchhhhhhh
Q psy6892 89 LKKNILCENQLF 100 (190)
Q Consensus 89 i~~~i~c~~slf 100 (190)
+.|+....++..
T Consensus 106 ~aGflavFASEl 117 (363)
T KOG2087|consen 106 VAGFLAVFASEL 117 (363)
T ss_pred hHHHHHHHHHHH
Confidence 988776654433
No 10
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.31 E-value=5.3e-08 Score=80.29 Aligned_cols=50 Identities=20% Similarity=0.337 Sum_probs=45.4
Q ss_pred HhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892 26 FCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL 75 (190)
Q Consensus 26 vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~ 75 (190)
++|++||++++++++++|+||+|++++++.+|++|++.++..+|..+..+
T Consensus 2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~ 51 (257)
T PF10320_consen 2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLF 51 (257)
T ss_pred EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 57999999999999999999999999999999999999888888776443
No 11
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=96.85 E-value=0.0006 Score=57.63 Aligned_cols=62 Identities=18% Similarity=0.245 Sum_probs=45.8
Q ss_pred hhhHHHHHHHHHHHHhhhccchheeeeeec---cCCCCchhHHHHHHHHHHHHHHhhhhccccee
Q psy6892 12 ASWNIWAVVTLILVFCTATGNILVCLAISW---EKRLQNVTNYFLMSLAITDLMVAVLVMPLGIL 73 (190)
Q Consensus 12 ~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~---~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~ 73 (190)
....++.++.++.+++|++||+.++++... +++.-+|.+..+++||++.++.-.+.+...+.
T Consensus 4 ~~~~i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~ 68 (303)
T PF05296_consen 4 SLEIIFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVILLNSFL 68 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445667888889999999999998766543 33334689999999999998876554433333
No 12
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=96.59 E-value=0.0014 Score=55.48 Aligned_cols=55 Identities=20% Similarity=0.330 Sum_probs=47.1
Q ss_pred hHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892 14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVM 68 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~l 68 (190)
...+.++..+..+++++|.++++..+++.+++|++.+-++.-++++|++..+..+
T Consensus 6 ~~~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~ 60 (303)
T PF05462_consen 6 IRTLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASM 60 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHH
Confidence 3456677778889999999999999999999999999999999999998775543
No 13
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=96.13 E-value=0.001 Score=55.01 Aligned_cols=45 Identities=22% Similarity=0.262 Sum_probs=40.4
Q ss_pred HHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892 24 LVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVM 68 (190)
Q Consensus 24 i~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~l 68 (190)
+.++|++.|+++++.+.+.+.+|++.+++..+.|++|++++++.+
T Consensus 3 ~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~~~l 47 (274)
T PF10328_consen 3 ISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICLIFL 47 (274)
T ss_pred eeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHHHH
Confidence 467899999999999999999999999999999999999877543
No 14
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=96.06 E-value=0.0019 Score=51.52 Aligned_cols=52 Identities=15% Similarity=0.160 Sum_probs=34.7
Q ss_pred CCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeeccch
Q psy6892 43 KRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNIL 94 (190)
Q Consensus 43 ~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~ 94 (190)
++++..-+.++.||.++|++.++..+...+.++..+.-..+..+|...|++.
T Consensus 30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~ 81 (201)
T PF11710_consen 30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFL 81 (201)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHH
Confidence 3445666779999999999998876554444444444444567777765443
No 15
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=95.75 E-value=0.0021 Score=54.08 Aligned_cols=52 Identities=21% Similarity=0.323 Sum_probs=41.3
Q ss_pred HHHHhhhccchheeeeeeccCCCCc-hhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892 23 ILVFCTATGNILVCLAISWEKRLQN-VTNYFLMSLAITDLMVAVLVMPLGILTL 75 (190)
Q Consensus 23 ii~vlgi~gN~lvi~vi~~~~~L~~-~~~~fl~nLAvaDLl~~l~~lP~~l~~~ 75 (190)
++.++|+++|++.+.++ .+|.||+ ++|.+++.+|++|++..+..++..+...
T Consensus 5 ~~~~~g~~~N~~h~~VL-trk~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~ 57 (318)
T PF10324_consen 5 ILSIFGLFINIFHLIVL-TRKSMRSSSINILLIGIAICDLLYMLSILIWELFFF 57 (318)
T ss_pred EEeHHHHHHHHHHhhhc-CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35678999999988776 4566775 8999999999999999888777665443
No 16
>PF04931 DNA_pol_phi: DNA polymerase phi; InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=95.74 E-value=0.0016 Score=61.85 Aligned_cols=17 Identities=18% Similarity=0.602 Sum_probs=12.0
Q ss_pred HHHHHHHhhhhccccee
Q psy6892 57 AITDLMVAVLVMPLGIL 73 (190)
Q Consensus 57 AvaDLl~~l~~lP~~l~ 73 (190)
.+.|++++++.-|..+.
T Consensus 589 vlveiLLslls~~s~ll 605 (784)
T PF04931_consen 589 VLVEILLSLLSQPSALL 605 (784)
T ss_pred HHHHHHHHHHhCcchHH
Confidence 56788888877776543
No 17
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=95.33 E-value=0.0066 Score=50.81 Aligned_cols=49 Identities=20% Similarity=0.247 Sum_probs=38.5
Q ss_pred HHHHHHHHhhhccchheeeeeecc-CCCCchhHHHHHHHHHHHHHHhhhh
Q psy6892 19 VVTLILVFCTATGNILVCLAISWE-KRLQNVTNYFLMSLAITDLMVAVLV 67 (190)
Q Consensus 19 v~~~ii~vlgi~gN~lvi~vi~~~-~~L~~~~~~fl~nLAvaDLl~~l~~ 67 (190)
+++.+++++|++.|++++++++++ ++.-+...+++.|-|++|++.++..
T Consensus 3 i~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~~~ 52 (292)
T PF10317_consen 3 IYHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISIISA 52 (292)
T ss_pred EeHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356788899999999999877654 4444667899999999999987653
No 18
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=94.99 E-value=0.018 Score=49.07 Aligned_cols=56 Identities=20% Similarity=0.140 Sum_probs=47.1
Q ss_pred chhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892 11 AASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 11 ~~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~ 66 (190)
...++++++.+++.+++.++..+++++++.+.+.+|.|.|-++.-||+.|++....
T Consensus 29 g~~~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~ 84 (313)
T PF10321_consen 29 GVKRPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFI 84 (313)
T ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHh
Confidence 34567788888888888888899999998887777899999999999999998644
No 19
>KOG3064|consensus
Probab=92.97 E-value=0.024 Score=46.51 Aligned_cols=14 Identities=14% Similarity=0.007 Sum_probs=6.4
Q ss_pred Hhhhccchheeeee
Q psy6892 26 FCTATGNILVCLAI 39 (190)
Q Consensus 26 vlgi~gN~lvi~vi 39 (190)
-+++-|+-..++.+
T Consensus 124 klalr~~q~~~vp~ 137 (303)
T KOG3064|consen 124 KLALRGRQKKLVPI 137 (303)
T ss_pred HHHhccCceEEeec
Confidence 44555554444333
No 20
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=92.93 E-value=0.061 Score=43.30 Aligned_cols=49 Identities=10% Similarity=0.181 Sum_probs=4.0
Q ss_pred HHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhh
Q psy6892 19 VVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLV 67 (190)
Q Consensus 19 v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~ 67 (190)
++..+...+++++-++++.+....|++|+..+....||+++.++..+..
T Consensus 5 ~i~~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~ 53 (242)
T PF00002_consen 5 IISYVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSF 53 (242)
T ss_dssp HHHHHHHH-----------------------------------------
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHH
Confidence 3444555566666666666666668888888888899999988776543
No 21
>PF03839 Sec62: Translocation protein Sec62; InterPro: IPR004728 Members of the NSCC2 family have been sequenced from various yeast, fungal and animals species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane
Probab=92.40 E-value=0.087 Score=42.73 Aligned_cols=15 Identities=20% Similarity=0.211 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHh
Q psy6892 50 NYFLMSLAITDLMVA 64 (190)
Q Consensus 50 ~~fl~nLAvaDLl~~ 64 (190)
|+.++.|++.-++++
T Consensus 141 YlS~~~lgll~~~~~ 155 (224)
T PF03839_consen 141 YLSVGALGLLGLFFA 155 (224)
T ss_pred hhHHHHHHHHHHHHH
Confidence 444444444444443
No 22
>PF02724 CDC45: CDC45-like protein; InterPro: IPR003874 CDC45 is an essential gene required for initiation of DNA replication in Saccharomyces cerevisiae (cell division control protein 45), forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human [], mouse and the smut fungus, Melampsora spp., (tsd2 protein) among others.; GO: 0006270 DNA-dependent DNA replication initiation
Probab=92.12 E-value=0.056 Score=50.20 Aligned_cols=13 Identities=31% Similarity=0.319 Sum_probs=5.4
Q ss_pred HHHHHH-HHHHHHH
Q psy6892 51 YFLMSL-AITDLMV 63 (190)
Q Consensus 51 ~fl~nL-AvaDLl~ 63 (190)
++++|. |..||.-
T Consensus 55 vilIncGa~~dl~~ 68 (622)
T PF02724_consen 55 VILINCGATVDLEE 68 (622)
T ss_pred EEEEecCchhhHHH
Confidence 333443 3345543
No 23
>KOG1189|consensus
Probab=91.95 E-value=0.04 Score=51.45 Aligned_cols=12 Identities=25% Similarity=0.263 Sum_probs=5.6
Q ss_pred hhhhhhhhhhhh
Q psy6892 157 EEEEDKQGEEKR 168 (190)
Q Consensus 157 ~~~~~~~~~~~~ 168 (190)
++.+++++.+.+
T Consensus 928 E~gkdwdele~e 939 (960)
T KOG1189|consen 928 ESGKDWDELERE 939 (960)
T ss_pred hhccchhhhHHH
Confidence 444455544443
No 24
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=90.55 E-value=0.61 Score=39.40 Aligned_cols=65 Identities=20% Similarity=0.231 Sum_probs=45.4
Q ss_pred hHHHHHHHHHHHHhhhccchheeeeeec-cCCCCchhHHHH---HHHHHHHHHHhhhhcccceeeeccc
Q psy6892 14 WNIWAVVTLILVFCTATGNILVCLAISW-EKRLQNVTNYFL---MSLAITDLMVAVLVMPLGILTLVRV 78 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~-~~~L~~~~~~fl---~nLAvaDLl~~l~~lP~~l~~~l~~ 78 (190)
..++...+-++++++++.|.++++.++. .+++.+-.++++ +...++|+..+++..|..+..+..+
T Consensus 8 P~~li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~ag 76 (303)
T PF10327_consen 8 PQWLINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPAG 76 (303)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeEE
Confidence 3456677888999999999999877665 455555444333 3455679999988888776665544
No 25
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=87.62 E-value=0.55 Score=39.13 Aligned_cols=45 Identities=11% Similarity=0.106 Sum_probs=30.8
Q ss_pred CchhHHHHHHHHHHHHHHhhh-hcccceeeecccccccccccceeecc
Q psy6892 46 QNVTNYFLMSLAITDLMVAVL-VMPLGILTLVRVRTNLSGAVLRLKKN 92 (190)
Q Consensus 46 ~~~~~~fl~nLAvaDLl~~l~-~lP~~l~~~l~~~~~~s~~lC~i~~~ 92 (190)
.+|++..+.|||++++++.++ ++|..... .+...++...|++..+
T Consensus 8 ~kp~dlIl~hLa~aN~lvLl~rGip~~~~~--~~~~~~~d~gCK~v~Y 53 (265)
T PF03402_consen 8 LKPIDLILIHLALANILVLLSRGIPQTMAF--FGWKFFDDIGCKIVFY 53 (265)
T ss_pred CCcHHHHHHHHHHHHHHHHHHhhHHHHHHH--hhcccCCCceeeeeee
Confidence 468999999999999988654 55643222 2222457888997643
No 26
>KOG2927|consensus
Probab=84.42 E-value=0.27 Score=42.28 Aligned_cols=6 Identities=0% Similarity=-0.379 Sum_probs=3.0
Q ss_pred cccccc
Q psy6892 79 RTNLSG 84 (190)
Q Consensus 79 ~~~~s~ 84 (190)
.|.|+.
T Consensus 256 ~W~FPN 261 (372)
T KOG2927|consen 256 FWLFPN 261 (372)
T ss_pred eEeccc
Confidence 455544
No 27
>PF09882 DUF2109: Predicted membrane protein (DUF2109); InterPro: IPR019214 This entry is found in various hypothetical archaeal proteins and has no known function.
Probab=84.20 E-value=0.26 Score=33.04 Aligned_cols=48 Identities=13% Similarity=0.225 Sum_probs=34.2
Q ss_pred HhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhccccee
Q psy6892 26 FCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGIL 73 (190)
Q Consensus 26 vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~ 73 (190)
++|+++-+.++.++..+.+.++.+|+..+|.+++-++....-.|+.++
T Consensus 5 i~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g~i 52 (78)
T PF09882_consen 5 IIGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMGAI 52 (78)
T ss_pred HHHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHHHH
Confidence 344444445555666667778888999999999999987777776543
No 28
>PF10856 DUF2678: Protein of unknown function (DUF2678); InterPro: IPR022564 This family of proteins has no known function.
Probab=83.62 E-value=0.38 Score=34.85 Aligned_cols=23 Identities=17% Similarity=-0.133 Sum_probs=9.8
Q ss_pred cccccceeeccchhhhhhhhhcc
Q psy6892 82 LSGAVLRLKKNILCENQLFKAGA 104 (190)
Q Consensus 82 ~s~~lC~i~~~i~c~~slfSl~s 104 (190)
+..-+-++..++++.+.+..+++
T Consensus 87 l~Pkfr~li~~~~~~ivllci~a 109 (118)
T PF10856_consen 87 LDPKFRYLIYYNCFSIVLLCICA 109 (118)
T ss_pred CChhHHHHHHHHHHHHHHHHHhh
Confidence 33333333444444444444443
No 29
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=82.29 E-value=0.43 Score=39.90 Aligned_cols=47 Identities=15% Similarity=0.192 Sum_probs=30.4
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCc-hhHHHHHHHHHHHHHHhhh
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQN-VTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~-~~~~fl~nLAvaDLl~~l~ 66 (190)
+..+.++++++.|+++++.+.++.+.+. .--++++..|+.+++++++
T Consensus 4 ~~~~~~~~s~~~N~~Li~Li~~~s~k~~G~Yk~Lm~~fs~~~i~fs~~ 51 (307)
T PF10326_consen 4 IQYIGFVLSLFLNSLLIYLILTKSPKSLGSYKYLMIYFSIFEIIFSIL 51 (307)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhccCCCCCCEEEEEehhHHHHHHHHHH
Confidence 3456677889999999988765433332 2235666667777776655
No 30
>COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics]
Probab=79.11 E-value=0.64 Score=43.08 Aligned_cols=13 Identities=15% Similarity=0.136 Sum_probs=5.6
Q ss_pred hhhhhhccccccc
Q psy6892 97 NQLFKAGARDETA 109 (190)
Q Consensus 97 ~slfSl~se~~~~ 109 (190)
...|....+.++.
T Consensus 919 gW~fL~~gsddE~ 931 (1001)
T COG5406 919 GWSFLMVGSDDES 931 (1001)
T ss_pred cceeeecCCcccc
Confidence 3344444444444
No 31
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=76.89 E-value=1.2 Score=41.98 Aligned_cols=22 Identities=18% Similarity=0.508 Sum_probs=13.6
Q ss_pred hHHHHHHHHHHHHHHhhhhcccc
Q psy6892 49 TNYFLMSLAITDLMVAVLVMPLG 71 (190)
Q Consensus 49 ~~~fl~nLAvaDLl~~l~~lP~~ 71 (190)
+-+|+|-.+.+||++ ++++|+.
T Consensus 76 ~~~~~~~~~~~d~~~-~~~~p~~ 97 (697)
T PF09726_consen 76 SVFFVCIAFTSDLIC-LFFIPVH 97 (697)
T ss_pred HHHHHHHHHHHHHHH-HHHHHHH
Confidence 345555555678876 5666653
No 32
>KOG1832|consensus
Probab=75.83 E-value=0.76 Score=44.31 Aligned_cols=14 Identities=29% Similarity=0.430 Sum_probs=6.1
Q ss_pred HHHHHHHHHHHHHH
Q psy6892 50 NYFLMSLAITDLMV 63 (190)
Q Consensus 50 ~~fl~nLAvaDLl~ 63 (190)
++....|-++|...
T Consensus 1314 ~VmYa~~~~~d~~s 1327 (1516)
T KOG1832|consen 1314 DVMYAMLNIEDVMS 1327 (1516)
T ss_pred cchhhhhhhhhhhh
Confidence 34444444444443
No 33
>PF08229 SHR3_chaperone: ER membrane protein SH3 ; InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=73.31 E-value=5.7 Score=31.60 Aligned_cols=22 Identities=14% Similarity=0.066 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHhhhccchhee
Q psy6892 15 NIWAVVTLILVFCTATGNILVC 36 (190)
Q Consensus 15 ~~~~v~~~ii~vlgi~gN~lvi 36 (190)
.++..++.++..+|++|.++=+
T Consensus 59 ~~v~~~Lh~v~~lglig~~iKl 80 (196)
T PF08229_consen 59 PIVKYILHIVIGLGLIGLLIKL 80 (196)
T ss_pred hHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455556666676665443
No 34
>PF03344 Daxx: Daxx Family; InterPro: IPR005012 Daxx is a ubiquitously expressed protein that functions, in part, as a transcriptional co-repressor through its interaction with a growing number of nuclear, DNA-associated proteins. Human Daxx contains four structural domains commonly found in transcriptional regulatory proteins: two predicted paired amphipathic helices, an acid-rich domain and a Ser/Pro/Thr (SPT)-rich domain. The post-translational modification status of the SPT-domain of hDaxx regulates its association with transcription factors such as Pax3 and ETS-1, effectively bringing hDaxx to sites of active transcription. Through its presence at the site of active transcription, hDaxx could then be able to associate with acetylated histones present in the nucleosomes and Dek that is associated with chromatin. Through its association with the SPT-domain of hDaxx, histone deacetylases may also be brought to the site of active transcription. As a consequence, nucleosomes in the vicinity of the site of active transcription will have the histone tails deacetylated, allowing the deactylated tail to bind to DNA, thereby leading to an inactive chromatin structure and transcriptional repression []. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis as a component of nuclear promyelocytic leukemia protein (PML) oncogenic domains (PODS). Daxx associates with PODs through a direct interaction with PML, a critical component of PODs. The interaction is a dynamic, cell cycle regulated event and is dependent on the post-translational modification of PML by the small ubiquitin-related modifier SUMO-1. ; PDB: 2KZS_A 2KZU_A.
Probab=72.50 E-value=1.2 Score=42.17 Aligned_cols=7 Identities=29% Similarity=0.130 Sum_probs=0.0
Q ss_pred HHHHHHH
Q psy6892 50 NYFLMSL 56 (190)
Q Consensus 50 ~~fl~nL 56 (190)
|-|.++|
T Consensus 346 ~~~~~~l 352 (713)
T PF03344_consen 346 YNFGCHL 352 (713)
T ss_dssp -------
T ss_pred ccccccc
Confidence 4444444
No 35
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=70.74 E-value=10 Score=32.34 Aligned_cols=53 Identities=25% Similarity=0.310 Sum_probs=39.7
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCch-hHHHHHHHHHHHHHHhhh--hcccce
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQNV-TNYFLMSLAITDLMVAVL--VMPLGI 72 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~-~~~fl~nLAvaDLl~~l~--~lP~~l 72 (190)
+.-+.++++.+-|-+.++++...++.+-- --++++..|+-|++.+++ .+|..+
T Consensus 10 ~Pk~~~~lsf~~Np~fiyli~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~v 65 (310)
T PF10319_consen 10 IPKIFGILSFIVNPIFIYLIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICV 65 (310)
T ss_pred HHHHHHHHHHHHhhhhheeEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence 44566777888899999999888877643 357888999999998876 445443
No 36
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=70.52 E-value=6.2 Score=34.63 Aligned_cols=57 Identities=21% Similarity=0.165 Sum_probs=31.4
Q ss_pred HHHHHHHhhhccchheeeeeeccC----CCCch--hHHHHHHHHHHHHHHhhhhcccceeeec
Q psy6892 20 VTLILVFCTATGNILVCLAISWEK----RLQNV--TNYFLMSLAITDLMVAVLVMPLGILTLV 76 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~----~L~~~--~~~fl~nLAvaDLl~~l~~lP~~l~~~l 76 (190)
+.+.-.++|++|-++-|+--.|.. +.++| ..-.+..||+||++.++..+...++++.
T Consensus 32 vCLgSs~l~l~gallQLlp~rr~~~~~~~~~sp~~~~rIl~~la~aDlLaclGVivRS~vWl~ 94 (405)
T PF02101_consen 32 VCLGSSVLSLLGALLQLLPRRRSAGPRAPARSPSSSRRILFWLAVADLLACLGVIVRSSVWLG 94 (405)
T ss_pred hHHHHHHHHHHHHHHhhccccccccccccccCCcCCchhHHHHHHHHHHhhhhHHHHhhhhhc
Confidence 334444556666555554222211 11122 3468889999999988766555444443
No 37
>PHA03055 Hypothetical protein; Provisional
Probab=69.87 E-value=7.9 Score=26.09 Aligned_cols=57 Identities=11% Similarity=0.219 Sum_probs=34.9
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeec
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV 76 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l 76 (190)
+.+++.+..+.|-.+++-.+.-++.+-..+..+.--|.+.+++..++.+|..+..+.
T Consensus 13 ~TlL~llMiisG~ali~k~~~p~r~~~~RS~~~~rVl~~lE~va~lifIPgti~LY~ 69 (79)
T PHA03055 13 ITVLMLLMVISGTAMIVKELNPNDIFTMQSLKFNRAVTIFKYIGLFIYIPGTIILYA 69 (79)
T ss_pred HHHHHHHHHHhccHHHhhhcCccceeeehhhhhHHHHHHHHHHHHHHhhccHHHHHH
Confidence 333444445566666654444443333344467777888899888888887765543
No 38
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=66.24 E-value=5 Score=34.06 Aligned_cols=53 Identities=13% Similarity=0.060 Sum_probs=39.1
Q ss_pred hHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892 14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~ 66 (190)
..+..++.+++.++|++.++.+++.+..++.+|.-+-+++....++-++.++.
T Consensus 16 l~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~H~N~ril~~~~~~~~l~~~~~ 68 (324)
T PF10292_consen 16 LRLSLIFNLLLSIIAFPVIIYALWKIRNSKLFHFNTRILFIVHCFSFLIHCTG 68 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhchhHHHHHHHHHHHHHHHHHH
Confidence 34455567777788888888888888788888877777777777777766543
No 39
>KOG4193|consensus
Probab=66.00 E-value=2 Score=39.92 Aligned_cols=37 Identities=14% Similarity=0.111 Sum_probs=22.2
Q ss_pred HHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892 23 ILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT 59 (190)
Q Consensus 23 ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva 59 (190)
+-.++++++=++.+++.+..|++++..+...+||+++
T Consensus 328 VG~~iSlvcL~lti~ty~~~~~l~~~~~~i~~~l~~~ 364 (610)
T KOG4193|consen 328 VGCIISLVCLLLTIATYLLFRKLQNDRTKIHINLCLC 364 (610)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHH
Confidence 3334444444555555556666665557777888877
No 40
>KOG2082|consensus
Probab=64.95 E-value=5 Score=38.35 Aligned_cols=30 Identities=20% Similarity=0.215 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHhhhhcccceeeeccccccc
Q psy6892 51 YFLMSLAITDLMVAVLVMPLGILTLVRVRTNL 82 (190)
Q Consensus 51 ~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~ 82 (190)
|..+||||+ +.+++-.|.+-..+-.++|.+
T Consensus 581 Y~fVNLaCa--vqtLLrtPnWRPRfkyyHW~L 610 (1075)
T KOG2082|consen 581 YLFVNLACA--VQTLLRTPNWRPRFKYYHWSL 610 (1075)
T ss_pred HHHHhHHHH--HHHHhcCCCCCccchhhhhHH
Confidence 334455554 223445565544444445543
No 41
>PHA03062 putative IMV membrane protein; Provisional
Probab=64.74 E-value=10 Score=25.56 Aligned_cols=56 Identities=13% Similarity=0.225 Sum_probs=34.1
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL 75 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~ 75 (190)
+.+++.+..+.|..+++-.+.-++.+-..+..+.--+.+.+++..++.+|..+..+
T Consensus 14 ~TlL~llMivsG~ali~k~~~p~r~~~~RS~~~~rVl~~lE~va~lifIPgti~LY 69 (78)
T PHA03062 14 LTLLALLMIVSGGALLFKSLAPHRVVMMRSATFNKVVSFLEYVAILIFVPGTIALY 69 (78)
T ss_pred HHHHHHHHHHhChHHHHhhcCccceeeehhhhhHHHHHHHHHHHHHHhhccHHHHH
Confidence 33344444556666665444433333333446777788889888888888776554
No 42
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=63.81 E-value=2.9 Score=26.29 Aligned_cols=12 Identities=25% Similarity=0.116 Sum_probs=8.2
Q ss_pred hhhccchheeee
Q psy6892 27 CTATGNILVCLA 38 (190)
Q Consensus 27 lgi~gN~lvi~v 38 (190)
-.|+||+++|-.
T Consensus 14 gvIigNia~LK~ 25 (55)
T PF11446_consen 14 GVIIGNIAALKY 25 (55)
T ss_pred HHHHhHHHHHHH
Confidence 347899887743
No 43
>PF04713 Pox_I5: Poxvirus protein I5; InterPro: IPR006803 This entry represents the Poxvirus protein I5.
Probab=63.30 E-value=12 Score=25.15 Aligned_cols=54 Identities=15% Similarity=0.285 Sum_probs=33.5
Q ss_pred HHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892 22 LILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL 75 (190)
Q Consensus 22 ~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~ 75 (190)
+++.+..+.|-.+++-.+.-++.+-..+..+.--|.+.|.+..++.+|..+..+
T Consensus 15 lL~llMvisG~ali~k~~ap~~~v~~rS~~~~rVl~~le~vailifIPGTIiLY 68 (77)
T PF04713_consen 15 LLMLLMVISGGALIFKRIAPHRSVMMRSATFNRVLTFLEYVAILIFIPGTIILY 68 (77)
T ss_pred HHHHHHHHhChHHHhhccCCcceehhhhhhhHHHHHHHHHHHHHhhcccHHHHH
Confidence 333444456666665444444444444556677788888888888888776544
No 44
>KOG3114|consensus
Probab=63.21 E-value=5.2 Score=33.68 Aligned_cols=31 Identities=6% Similarity=0.163 Sum_probs=22.7
Q ss_pred CCCchhhHHHHHHHHHHHHhhhccchheeee
Q psy6892 8 GLDAASWNIWAVVTLILVFCTATGNILVCLA 38 (190)
Q Consensus 8 ~~~~~~~~~~~v~~~ii~vlgi~gN~lvi~v 38 (190)
...+-.+..+++...+++++++.||+..++.
T Consensus 110 ~~~PDLYGPfWI~~TlVf~l~~~g~~~~~i~ 140 (290)
T KOG3114|consen 110 QDNPDLYGPFWITATLVFALAISGNLATFIR 140 (290)
T ss_pred CCCccccccHHHHHHHHHHHHHcccHHHHHH
Confidence 3355566778888888999999998775543
No 45
>KOG0205|consensus
Probab=61.96 E-value=18 Score=34.29 Aligned_cols=48 Identities=8% Similarity=0.168 Sum_probs=34.3
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCC-chhHHHHHHHHHHHHHHhhhh
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQ-NVTNYFLMSLAITDLMVAVLV 67 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~-~~~~~fl~nLAvaDLl~~l~~ 67 (190)
...+...++++.|.+++++..|.+..+ +|...++...-+++++.+++.
T Consensus 758 ~~a~ylqvsi~sqaliFvtrsr~w~~~erpg~~L~~af~~aqliatlia 806 (942)
T KOG0205|consen 758 MSALYLQVSIISQALIFVTRSRSWSFVERPGWLLLIAFFAAQLIATLIA 806 (942)
T ss_pred HHhhhhhheehhceeeEEEeccCCccccCcHHHHHHHHHHHHHHHHHHH
Confidence 334455667888888888877777765 677777777777888877654
No 46
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=60.51 E-value=11 Score=21.46 Aligned_cols=20 Identities=10% Similarity=-0.094 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHhhhccchh
Q psy6892 15 NIWAVVTLILVFCTATGNIL 34 (190)
Q Consensus 15 ~~~~v~~~ii~vlgi~gN~l 34 (190)
+...++++.++++|.+.|-.
T Consensus 9 y~vV~ffv~LFifGflsnDp 28 (36)
T PF02532_consen 9 YTVVIFFVSLFIFGFLSNDP 28 (36)
T ss_dssp HHHHHHHHHHHHHHHHTTCT
T ss_pred hhhHHHHHHHHhccccCCCC
Confidence 44556777888888888743
No 47
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=57.22 E-value=3.4 Score=34.57 Aligned_cols=57 Identities=16% Similarity=0.155 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccce
Q psy6892 16 IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGI 72 (190)
Q Consensus 16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l 72 (190)
+..++-++..+++++.|..+++.+...++.+++...++--..+.|.+++....+..+
T Consensus 7 iv~~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~ 63 (273)
T PF10316_consen 7 IVSIIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFIYLI 63 (273)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHHHHH
Confidence 344455556677788899888887766664455444555567788888877655433
No 48
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=56.83 E-value=2.6 Score=31.00 Aligned_cols=8 Identities=13% Similarity=0.115 Sum_probs=3.2
Q ss_pred eeeeeccC
Q psy6892 36 CLAISWEK 43 (190)
Q Consensus 36 i~vi~~~~ 43 (190)
+|+++|++
T Consensus 85 ~y~irR~~ 92 (122)
T PF01102_consen 85 SYCIRRLR 92 (122)
T ss_dssp HHHHHHHS
T ss_pred HHHHHHHh
Confidence 34444333
No 49
>CHL00024 psbI photosystem II protein I
Probab=56.59 E-value=9.7 Score=21.65 Aligned_cols=20 Identities=10% Similarity=-0.094 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHhhhccchh
Q psy6892 15 NIWAVVTLILVFCTATGNIL 34 (190)
Q Consensus 15 ~~~~v~~~ii~vlgi~gN~l 34 (190)
+...++++.++++|.+.|-.
T Consensus 9 y~vV~ffvsLFifGFlsnDp 28 (36)
T CHL00024 9 YTVVIFFVSLFIFGFLSNDP 28 (36)
T ss_pred hhHHHHHHHHHHccccCCCC
Confidence 45566778888888888753
No 50
>PRK02655 psbI photosystem II reaction center I protein I; Provisional
Probab=54.83 E-value=10 Score=21.73 Aligned_cols=20 Identities=10% Similarity=-0.175 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHhhhccchh
Q psy6892 15 NIWAVVTLILVFCTATGNIL 34 (190)
Q Consensus 15 ~~~~v~~~ii~vlgi~gN~l 34 (190)
+.+.++++.++++|.+.|-.
T Consensus 9 y~vV~ffvsLFiFGflsnDP 28 (38)
T PRK02655 9 YIVVFFFVGLFVFGFLSSDP 28 (38)
T ss_pred hhhHHHHHHHHHcccCCCCC
Confidence 44566778888888888743
No 51
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=53.76 E-value=2.4 Score=35.36 Aligned_cols=23 Identities=13% Similarity=0.322 Sum_probs=19.9
Q ss_pred CCchhHHHHHHHHHHHHHHhhhh
Q psy6892 45 LQNVTNYFLMSLAITDLMVAVLV 67 (190)
Q Consensus 45 L~~~~~~fl~nLAvaDLl~~l~~ 67 (190)
++++.+.++.+-+++|++..+..
T Consensus 29 ~~~~Fy~l~~~~~iaDi~~~~~~ 51 (283)
T PF10323_consen 29 FKSTFYTLLIQHCIADILSMLFY 51 (283)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHH
Confidence 67999999999999999986653
No 52
>KOG1832|consensus
Probab=53.50 E-value=2.3 Score=41.19 Aligned_cols=9 Identities=22% Similarity=0.674 Sum_probs=3.6
Q ss_pred hhhhccccc
Q psy6892 99 LFKAGARDE 107 (190)
Q Consensus 99 lfSl~se~~ 107 (190)
++-+.-.+.
T Consensus 1394 ~yEIGR~r~ 1402 (1516)
T KOG1832|consen 1394 MYEIGRRRP 1402 (1516)
T ss_pred hhhhcccCC
Confidence 333444444
No 53
>PF10192 GpcrRhopsn4: Rhodopsin-like GPCR transmembrane domain; InterPro: IPR019336 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). This region of 270 amino acids is the seven transmembrane alpha-helical domains included within five GPCRRHODOPSN4 motifs of a G-protein-coupled-receptor (GPCR) protein, conserved from nematodes to humans [].
Probab=53.23 E-value=18 Score=29.69 Aligned_cols=45 Identities=13% Similarity=0.093 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892 15 NIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT 59 (190)
Q Consensus 15 ~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva 59 (190)
..+..++++++++.++-++..+..+.+++.+|.+...+.+++.+.
T Consensus 29 ~~~~~~~~~~y~vl~~~~~~~~~~l~~~~~~h~~~~l~~~~l~l~ 73 (257)
T PF10192_consen 29 LEIYLLFLLLYIVLSIISIYSIQSLKKRGLMHPVYKLFSAALLLQ 73 (257)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Confidence 333334444444444444444455556666677777777776654
No 54
>PRK03557 zinc transporter ZitB; Provisional
Probab=52.60 E-value=9.2 Score=32.38 Aligned_cols=21 Identities=14% Similarity=0.161 Sum_probs=11.0
Q ss_pred HHHhhhccchheeeeeeccCC
Q psy6892 24 LVFCTATGNILVCLAISWEKR 44 (190)
Q Consensus 24 i~vlgi~gN~lvi~vi~~~~~ 44 (190)
+.++|++.|++..+..++.++
T Consensus 127 v~~~~~~~~~~~~~~~~~~~~ 147 (312)
T PRK03557 127 IAVAGLLANILSFWLLHHGSE 147 (312)
T ss_pred HHHHHHHHHHHHHHHHhcccc
Confidence 344566667665554444333
No 55
>PF11861 DUF3381: Domain of unknown function (DUF3381); InterPro: IPR024576 This uncharacterised domain is found in eukaryotic rRNA methyltransferases like yeast Spb1 and mammalian homologue Ftsj3. Spb1 is required for proper assembly of pre-ribosomal particles during the biogenesis of the 60S ribosomal subunit [, ].
Probab=50.79 E-value=24 Score=27.07 Aligned_cols=6 Identities=50% Similarity=0.540 Sum_probs=2.7
Q ss_pred HHHHHH
Q psy6892 58 ITDLMV 63 (190)
Q Consensus 58 vaDLl~ 63 (190)
|.||-+
T Consensus 69 c~DLKV 74 (159)
T PF11861_consen 69 CKDLKV 74 (159)
T ss_pred HHHHHh
Confidence 345444
No 56
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=50.57 E-value=5.2 Score=33.80 Aligned_cols=7 Identities=29% Similarity=0.596 Sum_probs=0.0
Q ss_pred eccCCCC
Q psy6892 40 SWEKRLQ 46 (190)
Q Consensus 40 ~~~~~L~ 46 (190)
..+|+|-
T Consensus 71 lF~RrLL 77 (381)
T PF05297_consen 71 LFKRRLL 77 (381)
T ss_dssp -------
T ss_pred HHHHhhc
Confidence 3444443
No 57
>PF14851 FAM176: FAM176 family
Probab=50.20 E-value=5.8 Score=30.30 Aligned_cols=34 Identities=18% Similarity=0.221 Sum_probs=18.1
Q ss_pred hccchheeeeeeccCCCCchhHHHHHHHHHHHHHH
Q psy6892 29 ATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMV 63 (190)
Q Consensus 29 i~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~ 63 (190)
|+.|.+..++.++... =+..=||+++.++.=+++
T Consensus 4 llSnsLaaya~I~~~P-E~~aLYFv~gVC~GLlLt 37 (153)
T PF14851_consen 4 LLSNSLAAYAHIRDNP-ERFALYFVSGVCAGLLLT 37 (153)
T ss_pred HHHHHHHHHHHHHhCh-HHHHHHHHHHHHHHHHHH
Confidence 4567777666554332 233345666666554443
No 58
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=50.03 E-value=17 Score=30.13 Aligned_cols=60 Identities=15% Similarity=0.319 Sum_probs=35.6
Q ss_pred HHHHHHHHHHhhhccchheeeeeecc-CCCCchhHHHHHHHH----HHHHHHhhhhcccceeeec
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWE-KRLQNVTNYFLMSLA----ITDLMVAVLVMPLGILTLV 76 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~-~~L~~~~~~fl~nLA----vaDLl~~l~~lP~~l~~~l 76 (190)
+..+.-++.+++++-|++.++++.++ ++-.+...+.+.|+. +.|+..+++..|..+....
T Consensus 6 ~~~~~h~~~~i~~Pi~~~~~y~Il~ktp~~m~~~k~~ll~~~~~~~~~d~~~~~l~~P~~~~P~~ 70 (302)
T PF10318_consen 6 YSIVLHIITIISIPIYIFGFYCILFKTPKSMKSVKWYLLNHHFWNFLLDLYLSFLIIPYFFFPSP 70 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHheeEeChhhhHHHHHHHHHHHHHHHHHHHHhhhEEeeeeccchH
Confidence 34455566677777787777665543 433333444444444 4688888877776555443
No 59
>KOG1482|consensus
Probab=49.60 E-value=6.4 Score=34.30 Aligned_cols=39 Identities=13% Similarity=0.260 Sum_probs=30.8
Q ss_pred HHHHHHHhhhhcccceeeecccccccccccceeeccchh
Q psy6892 57 AITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILC 95 (190)
Q Consensus 57 AvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~c 95 (190)
++.|++.++..+...++.++.+.|......|.+...+..
T Consensus 233 VlGDliQSvGV~iaa~Ii~f~P~~~i~DpICT~~FSiiv 271 (379)
T KOG1482|consen 233 VLGDLIQSVGVLIAALIIYFKPEYKIADPICTFVFSIIV 271 (379)
T ss_pred HHHHHHHHHHHHhhheeEEecccceecCchhhhhHHHHH
Confidence 356899888888888888888899999999987654444
No 60
>PF10329 DUF2417: Region of unknown function (DUF2417); InterPro: IPR019431 This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO).
Probab=49.55 E-value=11 Score=30.79 Aligned_cols=14 Identities=36% Similarity=0.608 Sum_probs=5.7
Q ss_pred HHHHHHHHHHHHHH
Q psy6892 50 NYFLMSLAITDLMV 63 (190)
Q Consensus 50 ~~fl~nLAvaDLl~ 63 (190)
++.++.|-+.|+++
T Consensus 105 ~~vl~~Lllvdlil 118 (232)
T PF10329_consen 105 NIVLAGLLLVDLIL 118 (232)
T ss_pred HHHHHHHHHHHHHH
Confidence 33444444444433
No 61
>KOG4349|consensus
Probab=48.22 E-value=8.8 Score=28.24 Aligned_cols=19 Identities=16% Similarity=0.100 Sum_probs=9.1
Q ss_pred hhccchheeeeeeccCCCC
Q psy6892 28 TATGNILVCLAISWEKRLQ 46 (190)
Q Consensus 28 gi~gN~lvi~vi~~~~~L~ 46 (190)
.+..-++.++.++++++-|
T Consensus 61 h~~A~ilalFTLRkHkygr 79 (143)
T KOG4349|consen 61 HLGATILALFTLRKHKYGR 79 (143)
T ss_pred HhhHHHHHHHHhhcCccce
Confidence 3444445555555555444
No 62
>KOG2592|consensus
Probab=48.20 E-value=18 Score=31.83 Aligned_cols=30 Identities=13% Similarity=0.191 Sum_probs=20.1
Q ss_pred CchhhHHHHHHHHHHHHhhhccchheeeee
Q psy6892 10 DAASWNIWAVVTLILVFCTATGNILVCLAI 39 (190)
Q Consensus 10 ~~~~~~~~~v~~~ii~vlgi~gN~lvi~vi 39 (190)
+...+..++++.++.+++++++-+++.+.+
T Consensus 177 sr~wy~~Ll~~T~~~Y~~s~~~~~l~fv~f 206 (426)
T KOG2592|consen 177 SRFWYAALLGVTLLMYLLSLVATVLLFVYF 206 (426)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhheEe
Confidence 344566777778888888888765544333
No 63
>COG2181 NarI Nitrate reductase gamma subunit [Energy production and conversion]
Probab=46.94 E-value=18 Score=29.48 Aligned_cols=30 Identities=13% Similarity=0.400 Sum_probs=22.7
Q ss_pred CCCCchhHHHHHHHHHHHHHHhhhhcccce
Q psy6892 43 KRLQNVTNYFLMSLAITDLMVAVLVMPLGI 72 (190)
Q Consensus 43 ~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l 72 (190)
|...+|.++|+..|-++-++.++...++..
T Consensus 118 R~~St~~D~f~lilLla~~~~Gl~~~~~~~ 147 (228)
T COG2181 118 RATSTPSDIFALLLLLAQLLLGLYATPFSA 147 (228)
T ss_pred eecCChhHHHHHHHHHHHHHhhhhhhhhcc
Confidence 334578889988888888888887777654
No 64
>COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics]
Probab=46.65 E-value=4.2 Score=37.94 Aligned_cols=9 Identities=33% Similarity=0.284 Sum_probs=3.8
Q ss_pred hhhhhhhhh
Q psy6892 157 EEEEDKQGE 165 (190)
Q Consensus 157 ~~~~~~~~~ 165 (190)
++.|++++.
T Consensus 978 E~gEDwdel 986 (1001)
T COG5406 978 EDGEDWDEL 986 (1001)
T ss_pred ccccchhhH
Confidence 444444443
No 65
>PF14940 TMEM219: Transmembrane 219
Probab=46.54 E-value=12 Score=30.50 Aligned_cols=45 Identities=31% Similarity=0.262 Sum_probs=26.4
Q ss_pred HHHHHHHHHHhhhccchheeeeeeccCCCCch-----hHHHHHHHHHHHHHH
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWEKRLQNV-----TNYFLMSLAITDLMV 63 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~-----~~~fl~nLAvaDLl~ 63 (190)
+.++++.+.++||.+ +.+..+.+...+++| -|.|+.+|+=.|+++
T Consensus 11 lVvF~l~Ll~~aI~~--l~Lg~yi~~~~l~nPDi~~DWN~fL~~ls~l~fC~ 60 (223)
T PF14940_consen 11 LVVFTLCLLLLAISF--LCLGYYIKRNELKNPDIPQDWNTFLLSLSQLDFCV 60 (223)
T ss_pred chHHHHHHHHHHHHH--heeeeEecccCCCcccchhhHHHHHHhhcCeeEcc
Confidence 344444444455433 333333355566664 478998888878777
No 66
>PHA02702 ORF033 IMV membrane protein; Provisional
Probab=46.21 E-value=15 Score=24.52 Aligned_cols=26 Identities=19% Similarity=0.272 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHhhhhcccceeeec
Q psy6892 51 YFLMSLAITDLMVAVLVMPLGILTLV 76 (190)
Q Consensus 51 ~fl~nLAvaDLl~~l~~lP~~l~~~l 76 (190)
.+.--|.+.+++..++.+|..++.+.
T Consensus 41 ~~~Rvltvle~va~l~~IPgtIiLY~ 66 (78)
T PHA02702 41 GALRVLTVLDFVSLLTTIPCTIILYF 66 (78)
T ss_pred cchhHHHHHHHHHHHHHhchHHHHHH
Confidence 45556788888888888887766543
No 67
>COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]
Probab=45.93 E-value=12 Score=31.81 Aligned_cols=57 Identities=16% Similarity=0.065 Sum_probs=34.8
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHH-HHHHHhhhhcccceeeeccc
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAI-TDLMVAVLVMPLGILTLVRV 78 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAv-aDLl~~l~~lP~~l~~~l~~ 78 (190)
.++++.++|++.|++..+.+.+.+. ...|.=-..|.+ +|++.++..+...++.++.+
T Consensus 126 ~ml~va~~GL~vN~~~a~ll~~~~~--~~lN~r~a~LHvl~D~Lgsv~vIia~i~i~~~~ 183 (296)
T COG1230 126 GMLVVAIIGLVVNLVSALLLHKGHE--ENLNMRGAYLHVLGDALGSVGVIIAAIVIRFTG 183 (296)
T ss_pred chHHHHHHHHHHHHHHHHHhhCCCc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 3556778899999998888776511 123443444443 68887766555555544444
No 68
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=44.92 E-value=29 Score=20.93 Aligned_cols=32 Identities=16% Similarity=0.083 Sum_probs=16.5
Q ss_pred HHHHHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWEKRLQNV 48 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~ 48 (190)
++++++..+++|+.|-.+-...=--.+.||.|
T Consensus 9 ~~~i~i~~lL~~~TgyaiYtaFGppSk~LrDP 40 (46)
T PRK13183 9 SLAITILAILLALTGFGIYTAFGPPSKELDDP 40 (46)
T ss_pred HHHHHHHHHHHHHhhheeeeccCCcccccCCc
Confidence 34455566666776654433332234556655
No 69
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=44.02 E-value=25 Score=26.15 Aligned_cols=32 Identities=19% Similarity=0.151 Sum_probs=17.3
Q ss_pred hHHHHHHHHHHHHhhhccchheeeeeeccCCC
Q psy6892 14 WNIWAVVTLILVFCTATGNILVCLAISWEKRL 45 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L 45 (190)
+.++.+++++++.+.+.|-++.++-..|+++|
T Consensus 100 Yia~~~il~il~~i~is~~~~~~yr~~r~~~~ 131 (139)
T PHA03099 100 YIPSPGIVLVLVGIIITCCLLSVYRFTRRTKL 131 (139)
T ss_pred hhhhhHHHHHHHHHHHHHHHHhhheeeecccC
Confidence 34445555556666666655555554444443
No 70
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=43.38 E-value=16 Score=29.79 Aligned_cols=6 Identities=0% Similarity=-0.257 Sum_probs=2.7
Q ss_pred cccccc
Q psy6892 80 TNLSGA 85 (190)
Q Consensus 80 ~~~s~~ 85 (190)
|.++..
T Consensus 184 WlfPNL 189 (232)
T TIGR00869 184 WIFPNL 189 (232)
T ss_pred eeecch
Confidence 444433
No 71
>CHL00020 psbN photosystem II protein N
Probab=43.31 E-value=30 Score=20.61 Aligned_cols=32 Identities=16% Similarity=0.078 Sum_probs=15.9
Q ss_pred HHHHHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWEKRLQNV 48 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~ 48 (190)
++++++...++|+.|-.+-...=--.+.||.|
T Consensus 6 ~~~i~i~~ll~~~Tgy~iYtaFGppSk~LrDP 37 (43)
T CHL00020 6 LVAIFISGLLVSFTGYALYTAFGQPSKQLRDP 37 (43)
T ss_pred hHHHHHHHHHHHhhheeeeeccCCchhccCCc
Confidence 34455556666666654433322234455554
No 72
>PF10856 DUF2678: Protein of unknown function (DUF2678); InterPro: IPR022564 This family of proteins has no known function.
Probab=43.20 E-value=33 Score=24.99 Aligned_cols=9 Identities=11% Similarity=0.526 Sum_probs=3.9
Q ss_pred HHHHHHHHH
Q psy6892 50 NYFLMSLAI 58 (190)
Q Consensus 50 ~~fl~nLAv 58 (190)
.+|.+++.+
T Consensus 62 iffavcI~l 70 (118)
T PF10856_consen 62 IFFAVCILL 70 (118)
T ss_pred EehHHHHHH
Confidence 344444443
No 73
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=42.44 E-value=7.2 Score=33.08 Aligned_cols=21 Identities=10% Similarity=0.064 Sum_probs=10.3
Q ss_pred HHHhhhccchheeeeeeccCC
Q psy6892 24 LVFCTATGNILVCLAISWEKR 44 (190)
Q Consensus 24 i~vlgi~gN~lvi~vi~~~~~ 44 (190)
+.++.|+.-.++|+.|+|+||
T Consensus 263 iaIliIVLIMvIIYLILRYRR 283 (299)
T PF02009_consen 263 IAILIIVLIMVIIYLILRYRR 283 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333333333455566666554
No 74
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=42.06 E-value=12 Score=26.85 Aligned_cols=30 Identities=10% Similarity=0.174 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHhhhccchheeeeeeccCCC
Q psy6892 16 IWAVVTLILVFCTATGNILVCLAISWEKRL 45 (190)
Q Consensus 16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L 45 (190)
++.+++++++++.++.|+++.....+..+.
T Consensus 2 ~Ll~il~llLll~l~asl~~wr~~~rq~k~ 31 (107)
T PF15330_consen 2 LLLGILALLLLLSLAASLLAWRMKQRQKKA 31 (107)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhccc
Confidence 345667777777888888876666555443
No 75
>KOG4564|consensus
Probab=41.98 E-value=18 Score=32.64 Aligned_cols=51 Identities=14% Similarity=0.126 Sum_probs=33.3
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhccc
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPL 70 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~ 70 (190)
+|.+-+-++++.=++.+.++..-|+||-..|++=.||.++=++.+++.+..
T Consensus 150 lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~~~~i~ 200 (473)
T KOG4564|consen 150 LYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAASVLIK 200 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444444555666778899888888889988877766554443
No 76
>PF13074 DUF3938: Protein of unknown function (DUF3938)
Probab=41.62 E-value=66 Score=21.64 Aligned_cols=17 Identities=18% Similarity=0.333 Sum_probs=11.6
Q ss_pred cchhhhhhhhhcccccc
Q psy6892 92 NILCENQLFKAGARDET 108 (190)
Q Consensus 92 ~i~c~~slfSl~se~~~ 108 (190)
..+|.+..|.+..+.++
T Consensus 78 iafcladvfnllqdnee 94 (102)
T PF13074_consen 78 IAFCLADVFNLLQDNEE 94 (102)
T ss_pred HHHHHHHHHHHhcCchh
Confidence 34788888888776544
No 77
>PF03348 Serinc: Serine incorporator (Serinc); InterPro: IPR005016 This is a family of proteins which display differential expression in various tumour and cell lines. The function of these proteins is unknown. ; GO: 0016020 membrane
Probab=41.07 E-value=46 Score=29.73 Aligned_cols=55 Identities=18% Similarity=0.267 Sum_probs=30.6
Q ss_pred hhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHH--HHHHHHHHHHHhhhhcc
Q psy6892 12 ASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYF--LMSLAITDLMVAVLVMP 69 (190)
Q Consensus 12 ~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~f--l~nLAvaDLl~~l~~lP 69 (190)
..+..++++.++++++++.+.++..+.+. ... -..|.+ .+||.++=++..+...|
T Consensus 179 ~w~~~Li~~T~~~y~~si~~~v~~y~~f~-~~~--C~lN~~fIt~nliL~vi~s~lSv~p 235 (429)
T PF03348_consen 179 RWYIALIGVTLLFYAASIAGIVLMYVFFT-PSG--CSLNKFFITFNLILCVIISVLSVLP 235 (429)
T ss_pred eehhHHHHHHHHHHHHHHHHHHHHHHHhC-CCC--CchhHHHHHHHHHHHHHHHHHHhhh
Confidence 34566666777778888877666554433 221 223433 36666665555444445
No 78
>KOG0476|consensus
Probab=40.35 E-value=18 Score=34.59 Aligned_cols=31 Identities=23% Similarity=0.354 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHhhhhcccceeeecccccccc
Q psy6892 53 LMSLAITDLMVAVLVMPLGILTLVRVRTNLS 83 (190)
Q Consensus 53 l~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s 83 (190)
++--+++-++++.+..|..+-.++.+...+.
T Consensus 372 llyp~~~a~v~ssltfP~GlG~f~aG~l~f~ 402 (931)
T KOG0476|consen 372 LLYPAFIALVFSSLTFPPGLGQFLAGRLSFR 402 (931)
T ss_pred cHHHHHHHHHHhheecCCcccccccccccHH
Confidence 3344566677777888877766666654443
No 79
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=39.97 E-value=7.7 Score=23.91 Aligned_cols=9 Identities=22% Similarity=0.419 Sum_probs=3.5
Q ss_pred HHHHhhhcc
Q psy6892 23 ILVFCTATG 31 (190)
Q Consensus 23 ii~vlgi~g 31 (190)
+++++|++|
T Consensus 9 i~iv~~lLg 17 (50)
T PF12606_consen 9 IFIVMGLLG 17 (50)
T ss_pred HHHHHHHHH
Confidence 333444433
No 80
>PF06398 Pex24p: Integral peroxisomal membrane peroxin; InterPro: IPR010482 Peroxisomes play diverse roles in the cell, compartmentalising many activities related to lipid metabolism and functioning in the decomposition of toxic hydrogen peroxide. Sequence similarity was identified between two hypothetical proteins and the peroxin integral membrane protein Pex24p [].
Probab=38.14 E-value=39 Score=28.98 Aligned_cols=52 Identities=6% Similarity=-0.018 Sum_probs=29.5
Q ss_pred HHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcc
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMP 69 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP 69 (190)
+...+..+|.+.++.+.++-++.|+++. ++.+.++++.+.|......++.+|
T Consensus 15 l~~~l~~~f~~~~~~d~vl~il~W~~p~-~t~~~L~l~t~~~l~p~l~l~~lp 66 (359)
T PF06398_consen 15 LSSRLGPIFPFQLILDRVLRILTWTNPD-YTLSFLLLYTFLCLNPYLLLLSLP 66 (359)
T ss_pred HHHHHHHhhHHHHHHHHHHHeEEeCCCC-cchHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555556666667777776 455556666666655555444344
No 81
>PF05745 CRPA: Chlamydia 15 kDa cysteine-rich outer membrane protein (CRPA); InterPro: IPR008436 Chlamydia is a genus of bacteria, which causes the most common bacterial sexually transmitted diseases. They are obligate intracellular bacterial pathogens. Members of this genus lack a peptidoglycan layer, but as a substitute, it has been proposed that they have several cysteine rich membrane proteins. This includes the major outer membrane protein (MOMP). These form disulphide bonds to provide rigidity to the cell wall. The alignment of the amino acid sequences of the MOMP from various serovars of Chlamydia show that they have between seven and ten cysteine residues; seven of which are highly conserved []. The MOMP has been the focus of efforts to produce a vaccine for Chlamydia trachomatis []. The 15 kDa cysteine-rich protein in this entry is a multi-pass outer membrane protein. They are associated with the differentiation of reticulate bodies (RBs) into elementary bodies (EBs) []. They immunolocalise to the inclusion membrane, which is the membrane that surrounds the intracellular parasite. These proteins are recognised by CD8+ T cells in both human and mouse infections, suggesting they gain access to the host cytoplasm.; GO: 0019867 outer membrane
Probab=38.03 E-value=47 Score=24.74 Aligned_cols=10 Identities=20% Similarity=0.577 Sum_probs=4.5
Q ss_pred HHHHHHHhhh
Q psy6892 57 AITDLMVAVL 66 (190)
Q Consensus 57 AvaDLl~~l~ 66 (190)
.+..++++.+
T Consensus 104 Glvkll~ts~ 113 (150)
T PF05745_consen 104 GLVKLLITSL 113 (150)
T ss_pred HHHHHHHHHH
Confidence 3445555443
No 82
>COG4420 Predicted membrane protein [Function unknown]
Probab=38.00 E-value=67 Score=25.42 Aligned_cols=38 Identities=13% Similarity=0.426 Sum_probs=22.0
Q ss_pred HHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT 59 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva 59 (190)
+++.+++++++=|+.|+.++....+.+ .-|++.||.++
T Consensus 60 fil~~~~~ll~Wi~lNl~~~~~~~wDp-----yPFi~LnLllS 97 (191)
T COG4420 60 FILTFTLLLLLWIVLNLFLVPGLAWDP-----YPFILLNLLLS 97 (191)
T ss_pred HHHHHHHHHHHHHHHHHhhhcCCcCCC-----ccHHHHHHHHH
Confidence 334444555555677877766543333 34666777765
No 83
>PF05915 DUF872: Eukaryotic protein of unknown function (DUF872); InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=37.38 E-value=72 Score=23.08 Aligned_cols=14 Identities=21% Similarity=0.000 Sum_probs=5.2
Q ss_pred HHHHHHHHhhhccc
Q psy6892 19 VVTLILVFCTATGN 32 (190)
Q Consensus 19 v~~~ii~vlgi~gN 32 (190)
+++++++++.++..
T Consensus 48 ~~Lli~G~~li~~g 61 (115)
T PF05915_consen 48 VFLLIFGTVLIIIG 61 (115)
T ss_pred HHHHHHHHHHHHHH
Confidence 33333333333333
No 84
>PF09802 Sec66: Preprotein translocase subunit Sec66; InterPro: IPR018624 Members of this family of proteins are a component of the heterotetrameric Sec62/63 complex composed of SEC62, SEC63, SEC66 and SEC72. The Sec62/63 complex associates with the Sec61 complex to form the Sec complex. Sec 66 is involved in SRP-independent post-translational translocation across the endoplasmic reticulum and functions together with the Sec61 complex and KAR2 in a channel-forming translocon complex. Furthermore, Sec66 is also required for growth at elevated temperatures [, , , ].
Probab=36.72 E-value=5.6 Score=31.51 Aligned_cols=45 Identities=16% Similarity=0.320 Sum_probs=25.1
Q ss_pred HHHHhhhccchheeeeeeccCCCCc---hhHHHHHHHHHHHHHHhhhhc
Q psy6892 23 ILVFCTATGNILVCLAISWEKRLQN---VTNYFLMSLAITDLMVAVLVM 68 (190)
Q Consensus 23 ii~vlgi~gN~lvi~vi~~~~~L~~---~~~~fl~nLAvaDLl~~l~~l 68 (190)
++|+..++|-+.++..++|+|+++. ...+|.-|++ .|+.+++..+
T Consensus 10 ~~Y~~vl~~sl~~Fs~~YRkr~~~~~~~l~p~F~~~~~-rdiY~sL~~~ 57 (190)
T PF09802_consen 10 LAYVAVLVGSLATFSSIYRKRKAAKSASLEPWFPEHLQ-RDIYLSLLHM 57 (190)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCchhH-HHHHHHHHhc
Confidence 3344444555556656677766542 3456666664 5666666544
No 85
>PF14494 DUF4436: Domain of unknown function (DUF4436)
Probab=35.79 E-value=25 Score=29.19 Aligned_cols=29 Identities=3% Similarity=0.135 Sum_probs=13.5
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQNV 48 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~ 48 (190)
+++++.++++++=++.+.++.++|++..+
T Consensus 173 i~~vm~~L~~~al~~ai~~~~~rr~~~~~ 201 (256)
T PF14494_consen 173 ICLVMWALATLALFVAIQVLRRRRKFEPP 201 (256)
T ss_pred HHHHHHHHHHHHHHHHHHHHhCccccccc
Confidence 33333444443333444455555666555
No 86
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=35.60 E-value=37 Score=20.24 Aligned_cols=31 Identities=13% Similarity=0.099 Sum_probs=14.9
Q ss_pred HHHHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892 18 AVVTLILVFCTATGNILVCLAISWEKRLQNV 48 (190)
Q Consensus 18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~ 48 (190)
+++++.+.++|+.|-.+-...=--.+.||.|
T Consensus 7 ~~i~i~~~lv~~Tgy~iYtaFGppSk~LrDP 37 (43)
T PF02468_consen 7 LAIFISCLLVSITGYAIYTAFGPPSKELRDP 37 (43)
T ss_pred HHHHHHHHHHHHHhhhhhheeCCCccccCCc
Confidence 4455555666666633322222234556654
No 87
>PF05934 MCLC: Mid-1-related chloride channel (MCLC); InterPro: IPR009231 This entry consists of several Chloride channel CLIC-like proteins, which function as a chloride channel when incorporated in the planar lipid bilayer [].
Probab=35.16 E-value=15 Score=33.35 Aligned_cols=27 Identities=11% Similarity=0.132 Sum_probs=14.2
Q ss_pred hHHHHHHHHHHHHhhhccchheeeeee
Q psy6892 14 WNIWAVVTLILVFCTATGNILVCLAIS 40 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~ 40 (190)
+.++.+++++++++.|+.+-+..++.|
T Consensus 181 ~t~l~v~l~lL~IV~iVAteLwt~V~W 207 (549)
T PF05934_consen 181 YTVLMVLLCLLCIVAIVATELWTYVSW 207 (549)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555566666665554444443
No 88
>PF09788 Tmemb_55A: Transmembrane protein 55A; InterPro: IPR019178 Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Probab=34.98 E-value=55 Score=27.13 Aligned_cols=12 Identities=17% Similarity=0.110 Sum_probs=4.5
Q ss_pred HHHHHHHHHHHh
Q psy6892 16 IWAVVTLILVFC 27 (190)
Q Consensus 16 ~~~v~~~ii~vl 27 (190)
++++++++++++
T Consensus 199 i~f~llgllfli 210 (256)
T PF09788_consen 199 IIFFLLGLLFLI 210 (256)
T ss_pred HHHHHHHHHHHH
Confidence 333333333333
No 89
>PF03006 HlyIII: Haemolysin-III related; InterPro: IPR004254 Members of this family are integral membrane proteins. This family includes proteins that are hemolysin-III homologs.; GO: 0016021 integral to membrane
Probab=34.93 E-value=77 Score=24.71 Aligned_cols=42 Identities=12% Similarity=0.120 Sum_probs=17.8
Q ss_pred hhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHH
Q psy6892 13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLM 54 (190)
Q Consensus 13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~ 54 (190)
.+....++..++.++|++.++.......+.+.+|+..+..+.
T Consensus 106 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~r~~~~~~~g 147 (222)
T PF03006_consen 106 LGWFYLAFIWILALIGIVLSLFPCFSSPRFRWLRTIFFLLLG 147 (222)
T ss_pred HHHHHHHHHHHHHHHhHHhhcchhhcCCccceeeehHhHHHH
Confidence 334444444455555554443322222333444555444443
No 90
>PF07787 DUF1625: Protein of unknown function (DUF1625); InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long.
Probab=34.77 E-value=72 Score=25.97 Aligned_cols=10 Identities=30% Similarity=0.693 Sum_probs=4.2
Q ss_pred HHHHHHHHHH
Q psy6892 50 NYFLMSLAIT 59 (190)
Q Consensus 50 ~~fl~nLAva 59 (190)
-++++++.++
T Consensus 222 ~~~~~~~~~s 231 (248)
T PF07787_consen 222 GLFLVAFIIS 231 (248)
T ss_pred HHHHHHHHHH
Confidence 3444444433
No 91
>PF06210 DUF1003: Protein of unknown function (DUF1003); InterPro: IPR010406 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=34.54 E-value=1.2e+02 Score=21.70 Aligned_cols=37 Identities=14% Similarity=0.246 Sum_probs=17.6
Q ss_pred HHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892 18 AVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT 59 (190)
Q Consensus 18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva 59 (190)
++++++++++=++.|++.+......|+ + |.+.||.++
T Consensus 7 i~~~~~~~~~Wi~~N~~~~~~~~fDpy----P-FilLnl~lS 43 (108)
T PF06210_consen 7 IIIFTVFLAVWILLNILAPPRPAFDPY----P-FILLNLVLS 43 (108)
T ss_pred HHHHHHHHHHHHHHHhhccccCCCCCc----c-HHHHHHHHH
Confidence 334444444446667766544333333 2 445555543
No 92
>COG0341 SecF Preprotein translocase subunit SecF [Intracellular trafficking and secretion]
Probab=34.10 E-value=17 Score=30.99 Aligned_cols=46 Identities=9% Similarity=0.072 Sum_probs=24.0
Q ss_pred HHHHHHhhhccchheeeeeeccC---CCCchhHHHHHHHHHHHHHHhhh
Q psy6892 21 TLILVFCTATGNILVCLAISWEK---RLQNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 21 ~~ii~vlgi~gN~lvi~vi~~~~---~L~~~~~~fl~nLAvaDLl~~l~ 66 (190)
-.++.++|..-|-.+++.=+.+. +.|..+..-++|.|+...+..++
T Consensus 191 AAlLtiIGYSvNDtIVvfDRIREn~r~~~~~~~~~iin~si~qTlsRti 239 (305)
T COG0341 191 AALLTIIGYSVNDTIVVFDRIRENLRKYRRETLREIINTSINQTLTRTI 239 (305)
T ss_pred HHHHHHeeeccCCeEEEEhHHHHHHhhhccCCHHHHHHHHHHHHHHHHH
Confidence 33455555555544443322222 23333344789999887665544
No 93
>PF14068 YuiB: Putative membrane protein
Probab=33.88 E-value=54 Score=23.25 Aligned_cols=19 Identities=26% Similarity=0.366 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHhhhhc
Q psy6892 50 NYFLMSLAITDLMVAVLVM 68 (190)
Q Consensus 50 ~~fl~nLAvaDLl~~l~~l 68 (190)
.--+.+|+.+|++.....+
T Consensus 63 ~~~l~~l~~~Di~il~~Gl 81 (102)
T PF14068_consen 63 GENLVSLALADILILSSGL 81 (102)
T ss_pred HHHHhhhhHHHHHHHHHHH
Confidence 3457788999988765543
No 94
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=33.45 E-value=15 Score=31.89 Aligned_cols=29 Identities=10% Similarity=0.143 Sum_probs=17.1
Q ss_pred HHHHHHHHHhhhccchheeeeeeccCCCC
Q psy6892 18 AVVTLILVFCTATGNILVCLAISWEKRLQ 46 (190)
Q Consensus 18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~ 46 (190)
.+++.++.++.|+.-.++|++|.|+||-+
T Consensus 311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKK 339 (353)
T TIGR01477 311 PIIASIIAILIIVLIMVIIYLILRYRRKK 339 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 34444555555555566777777777643
No 95
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=33.12 E-value=41 Score=28.40 Aligned_cols=36 Identities=25% Similarity=0.375 Sum_probs=24.1
Q ss_pred hheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892 33 ILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVM 68 (190)
Q Consensus 33 ~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~l 68 (190)
.+|+.++.+.+-.+-+..+|+..|.++-++++.+.+
T Consensus 33 Flv~~i~lW~~Fk~m~ffwFl~qlt~s~fi~S~lNl 68 (305)
T PF04789_consen 33 FLVLSIILWSHFKPMKFFWFLTQLTISVFIISSLNL 68 (305)
T ss_pred HHHHHHHHHHhcccchHHHHHHHHHHHHHHHHhhhh
Confidence 344444444444456778999999999888876643
No 96
>PF03729 DUF308: Short repeat of unknown function (DUF308); InterPro: IPR005325 This represents a group of short repeats that occurs in a limited number of membrane proteins. It may divide further in short repeats of around 7-10 residues of the pattern G-#-X(2)-#(2)-X (#=hydrophobic).
Probab=33.12 E-value=82 Score=19.57 Aligned_cols=21 Identities=5% Similarity=0.224 Sum_probs=12.7
Q ss_pred CchhHHHHHHHHHHHHHHhhh
Q psy6892 46 QNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 46 ~~~~~~fl~nLAvaDLl~~l~ 66 (190)
+.+...+..-+++.+++.+.+
T Consensus 49 ~~~~~~~~l~~gi~~i~~Gi~ 69 (72)
T PF03729_consen 49 GSKGWWWSLLSGILSIVLGII 69 (72)
T ss_pred cchhhHHHHHHHHHHHHHHHH
Confidence 444555666667777776654
No 97
>PF13903 Claudin_2: PMP-22/EMP/MP20/Claudin tight junction
Probab=32.89 E-value=55 Score=24.13 Aligned_cols=23 Identities=0% Similarity=-0.005 Sum_probs=11.9
Q ss_pred HHHHHhhhccchheeeeeeccCC
Q psy6892 22 LILVFCTATGNILVCLAISWEKR 44 (190)
Q Consensus 22 ~ii~vlgi~gN~lvi~vi~~~~~ 44 (190)
++-.++.++|.++.++..+++++
T Consensus 76 ~l~~~~~~~a~~~~~~~~~~~~~ 98 (172)
T PF13903_consen 76 ILGLLLLLFAFVFALIGFCKRSY 98 (172)
T ss_pred HHHHHHHHHHHHHHHHHhhcccc
Confidence 33334455666666555555444
No 98
>PTZ00046 rifin; Provisional
Probab=32.87 E-value=16 Score=31.81 Aligned_cols=28 Identities=11% Similarity=0.158 Sum_probs=16.9
Q ss_pred HHHHHHHHhhhccchheeeeeeccCCCC
Q psy6892 19 VVTLILVFCTATGNILVCLAISWEKRLQ 46 (190)
Q Consensus 19 v~~~ii~vlgi~gN~lvi~vi~~~~~L~ 46 (190)
+++.++.++.|+.-.++|++|.|+||-+
T Consensus 317 IiaSiiAIvVIVLIMvIIYLILRYRRKK 344 (358)
T PTZ00046 317 IIASIVAIVVIVLIMVIIYLILRYRRKK 344 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 4445555555555567777777777643
No 99
>KOG1056|consensus
Probab=32.40 E-value=49 Score=32.27 Aligned_cols=43 Identities=12% Similarity=0.352 Sum_probs=24.0
Q ss_pred HHHHHHHHHHhhhccchheeeeeeccCCC-------CchhHHHHHHHHHH
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWEKRL-------QNVTNYFLMSLAIT 59 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L-------~~~~~~fl~nLAva 59 (190)
+.++.+++.++||+.-++|++++.++... |..+|+++..+.++
T Consensus 561 ~~i~~l~lA~LGii~tl~vlv~fvr~~~TpvVkasgreLsyiLL~gi~l~ 610 (878)
T KOG1056|consen 561 WSIAPLLLAILGIIATLFVLVLFVRYNDTPVVKASGRELSYILLLGIFLC 610 (878)
T ss_pred ccHHHHHHHHHHHHHHHHhheeEEEecCCceEEecCcEehHHHHHHHHHH
Confidence 33455555566665555555555444322 45567888777643
No 100
>PF14654 Epiglycanin_C: Mucin, catalytic, TM and cytoplasmic tail region
Probab=32.12 E-value=35 Score=24.05 Aligned_cols=23 Identities=4% Similarity=-0.042 Sum_probs=12.2
Q ss_pred hhhHHHHHHHHHHHHhhhccchh
Q psy6892 12 ASWNIWAVVTLILVFCTATGNIL 34 (190)
Q Consensus 12 ~~~~~~~v~~~ii~vlgi~gN~l 34 (190)
++.++++.+..++..+|++.-++
T Consensus 17 PWeIfLItLasVvvavGl~aGLf 39 (106)
T PF14654_consen 17 PWEIFLITLASVVVAVGLFAGLF 39 (106)
T ss_pred chHHHHHHHHHHHHHHHHHHHHH
Confidence 33445555555666666654433
No 101
>PF11683 DUF3278: Protein of unknown function (DUF3278); InterPro: IPR021697 This bacterial family of proteins has no known function.
Probab=31.84 E-value=31 Score=25.30 Aligned_cols=26 Identities=19% Similarity=0.161 Sum_probs=16.9
Q ss_pred HHHHHHHHhhhccchheeeeeeccCC
Q psy6892 19 VVTLILVFCTATGNILVCLAISWEKR 44 (190)
Q Consensus 19 v~~~ii~vlgi~gN~lvi~vi~~~~~ 44 (190)
-.++.++...+++|++.++...+.+.
T Consensus 35 ~a~i~l~~~~l~~~li~l~l~~~~~~ 60 (129)
T PF11683_consen 35 NAFIILFYYSLLLNLISLLLASRYPE 60 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 35666777778888887765444433
No 102
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=31.72 E-value=73 Score=24.63 Aligned_cols=8 Identities=13% Similarity=0.447 Sum_probs=3.0
Q ss_pred HHHHHhhh
Q psy6892 22 LILVFCTA 29 (190)
Q Consensus 22 ~ii~vlgi 29 (190)
++++++|+
T Consensus 40 i~l~ilai 47 (182)
T PF09323_consen 40 ILLLILAI 47 (182)
T ss_pred HHHHHHHH
Confidence 33333433
No 103
>PF13127 DUF3955: Protein of unknown function (DUF3955)
Probab=31.66 E-value=1.3e+02 Score=19.31 Aligned_cols=25 Identities=20% Similarity=0.450 Sum_probs=14.2
Q ss_pred eeccCCCCchhHHHHHHHHHHHHHHhh
Q psy6892 39 ISWEKRLQNVTNYFLMSLAITDLMVAV 65 (190)
Q Consensus 39 i~~~~~L~~~~~~fl~nLAvaDLl~~l 65 (190)
+-.+-.+|-| ++++.++..=++.++
T Consensus 28 Vd~~G~L~Ep--FfLiPlg~l~~~~g~ 52 (63)
T PF13127_consen 28 VDEDGVLHEP--FFLIPLGYLFLLIGI 52 (63)
T ss_pred ECCCCeEecc--cHHHHHHHHHHHHHH
Confidence 3455667877 566666655444443
No 104
>KOG4404|consensus
Probab=31.64 E-value=21 Score=30.74 Aligned_cols=17 Identities=24% Similarity=0.356 Sum_probs=8.9
Q ss_pred chhHHHHHHHHHHHHHH
Q psy6892 47 NVTNYFLMSLAITDLMV 63 (190)
Q Consensus 47 ~~~~~fl~nLAvaDLl~ 63 (190)
++.|++++.+.++-++.
T Consensus 156 S~~~l~~i~~~~~~~~i 172 (350)
T KOG4404|consen 156 SVYNLVLILFTACILLI 172 (350)
T ss_pred cHHHHHHHHHHHHHHHH
Confidence 45566666555544443
No 105
>PRK13664 hypothetical protein; Provisional
Probab=31.24 E-value=38 Score=21.48 Aligned_cols=16 Identities=6% Similarity=0.225 Sum_probs=10.6
Q ss_pred HHHHHHHHhhhccchh
Q psy6892 19 VVTLILVFCTATGNIL 34 (190)
Q Consensus 19 v~~~ii~vlgi~gN~l 34 (190)
.++++++++|++-|++
T Consensus 9 Wilill~lvG~i~N~i 24 (62)
T PRK13664 9 WILVLVFLVGVLLNVI 24 (62)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3445667778877765
No 106
>KOG2443|consensus
Probab=31.15 E-value=9.1 Score=33.02 Aligned_cols=27 Identities=11% Similarity=0.041 Sum_probs=17.9
Q ss_pred HHHHHHhhhhcccceeeeccccccccc
Q psy6892 58 ITDLMVAVLVMPLGILTLVRVRTNLSG 84 (190)
Q Consensus 58 vaDLl~~l~~lP~~l~~~l~~~~~~s~ 84 (190)
-+|++..+.+.+..+.+++.++|....
T Consensus 152 ~~~iv~~vls~~i~v~~ll~~HWl~nN 178 (362)
T KOG2443|consen 152 RAQIVALVLSSMIVVWYLLTKHWLANN 178 (362)
T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHHHh
Confidence 356666666666777777778876543
No 107
>PF15145 DUF4577: Domain of unknown function (DUF4577)
Probab=30.78 E-value=30 Score=25.06 Aligned_cols=27 Identities=15% Similarity=0.167 Sum_probs=13.4
Q ss_pred hHHHHHHHHHHHHhhhccchheeeeee
Q psy6892 14 WNIWAVVTLILVFCTATGNILVCLAIS 40 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~ 40 (190)
+.++++.++++.++++..-.+||+.+.
T Consensus 60 ~~lffvglii~LivSLaLVsFvIFLii 86 (128)
T PF15145_consen 60 RSLFFVGLIIVLIVSLALVSFVIFLII 86 (128)
T ss_pred eeehHHHHHHHHHHHHHHHHHHHHhee
Confidence 445555555555555544445554443
No 108
>KOG3850|consensus
Probab=30.40 E-value=50 Score=29.11 Aligned_cols=37 Identities=27% Similarity=0.468 Sum_probs=19.5
Q ss_pred HHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHH
Q psy6892 18 AVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLA 57 (190)
Q Consensus 18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLA 57 (190)
.++.+++++++.+.|++. .....++|+.+.+|++-|+
T Consensus 398 alm~VlLvfVSTIa~~v~---PLmkSR~rt~~t~~LV~l~ 434 (455)
T KOG3850|consen 398 ALMTVLLVFVSTIANCVS---PLMKSRNRTASTFFLVFLL 434 (455)
T ss_pred HHHHHHHHHHHHHHhhcc---HHhhhhhHHHHHHHHHHHH
Confidence 333444455555555443 3344556677766666554
No 109
>PF07226 DUF1422: Protein of unknown function (DUF1422); InterPro: IPR009867 This family consists of several hypothetical bacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=30.28 E-value=10 Score=27.58 Aligned_cols=27 Identities=11% Similarity=0.082 Sum_probs=15.1
Q ss_pred cCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892 42 EKRLQNVTNYFLMSLAITDLMVAVLVM 68 (190)
Q Consensus 42 ~~~L~~~~~~fl~nLAvaDLl~~l~~l 68 (190)
.++++.|+.-=+.-+|++-++++++..
T Consensus 53 Q~Yl~~~m~eg~P~~a~acFflG~f~y 79 (117)
T PF07226_consen 53 QRYLNHPMPEGTPKLALACFFLGLFGY 79 (117)
T ss_pred HHHhcCCCCCCChHHHHHHHHHHHHHH
Confidence 345566665555555666566665543
No 110
>KOG1281|consensus
Probab=30.23 E-value=1e+02 Score=28.74 Aligned_cols=29 Identities=17% Similarity=0.047 Sum_probs=16.2
Q ss_pred chhhHHHHHHHHHHHHhhhccchheeeee
Q psy6892 11 AASWNIWAVVTLILVFCTATGNILVCLAI 39 (190)
Q Consensus 11 ~~~~~~~~v~~~ii~vlgi~gN~lvi~vi 39 (190)
+.....+++..+++.+..++.|.+...++
T Consensus 469 p~~~i~li~~ll~~v~TeF~Sntata~If 497 (586)
T KOG1281|consen 469 PVFAILLIFSLLILVVTEFVSNTATASIF 497 (586)
T ss_pred ChHHHHHHHHHHHHHHHHHhcchhheeeh
Confidence 33334444444555566688887765444
No 111
>PHA02974 putative IMV membrane protein; Provisional
Probab=30.22 E-value=72 Score=21.61 Aligned_cols=56 Identities=14% Similarity=0.137 Sum_probs=28.5
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCC-chhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQ-NVTNYFLMSLAITDLMVAVLVMPLGILTL 75 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~-~~~~~fl~nLAvaDLl~~l~~lP~~l~~~ 75 (190)
+.++..+..+.|-.+++-.+.-++.+- +....|---+.+..++..++.+|..+..+
T Consensus 13 iTlL~llMiisG~aLi~k~~~P~~~~v~~ss~tf~rvv~~lE~vailifiPGti~LY 69 (81)
T PHA02974 13 ITILVVLLIISGFSLILRLIPGVYSSVIRSSFTAGKILRFMEIFSTIMFIPGIIILY 69 (81)
T ss_pred HHHHHHHHHHhChHHHHhhcCchhhhhhhHHHHHHHHHHHHHHHHHhheeccHHHHH
Confidence 334444445666666654443332221 22224445566666666666777655443
No 112
>KOG2442|consensus
Probab=29.89 E-value=11 Score=34.10 Aligned_cols=29 Identities=14% Similarity=0.157 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHhhhhcccceeeecccc
Q psy6892 51 YFLMSLAITDLMVAVLVMPLGILTLVRVR 79 (190)
Q Consensus 51 ~fl~nLAvaDLl~~l~~lP~~l~~~l~~~ 79 (190)
.++-++...-++++++++.+.+++...++
T Consensus 306 ~~l~~~~~~~l~l~~~Cia~aV~W~v~R~ 334 (541)
T KOG2442|consen 306 PRLGNMSYRLLFLSILCIAVAVVWAVFRN 334 (541)
T ss_pred cccCChhHHHHHHHHhhhheeEEEEEeec
Confidence 44455555556666667667666665544
No 113
>PF11044 TMEMspv1-c74-12: Plectrovirus spv1-c74 ORF 12 transmembrane protein; InterPro: IPR022743 This is a group of proteins expressed by Plectroviruses. The Plectroviruses are single-stranded DNA viruses belonging to the Inoviridae. This entry represents putative transmembrane proteins of unknown function.
Probab=29.77 E-value=62 Score=19.42 Aligned_cols=16 Identities=25% Similarity=0.177 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHhhhcc
Q psy6892 16 IWAVVTLILVFCTATG 31 (190)
Q Consensus 16 ~~~v~~~ii~vlgi~g 31 (190)
.++.+.+++++++.+|
T Consensus 7 ~iFsvvIil~If~~iG 22 (49)
T PF11044_consen 7 TIFSVVIILGIFAWIG 22 (49)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333333444433344
No 114
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=28.89 E-value=19 Score=26.62 Aligned_cols=14 Identities=0% Similarity=0.003 Sum_probs=0.0
Q ss_pred HHHHHHHHHhhhcc
Q psy6892 18 AVVTLILVFCTATG 31 (190)
Q Consensus 18 ~v~~~ii~vlgi~g 31 (190)
+.+..++++++++.
T Consensus 83 ~sal~v~lVl~lls 96 (129)
T PF12191_consen 83 GSALSVVLVLALLS 96 (129)
T ss_dssp --------------
T ss_pred hhHHHHHHHHHHHH
Confidence 34444444555544
No 115
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=28.81 E-value=22 Score=28.86 Aligned_cols=45 Identities=13% Similarity=0.085 Sum_probs=27.3
Q ss_pred HHHHHhhhccchheeeeeeccCC---CCchhHHHHHHHHHHHHHHhhh
Q psy6892 22 LILVFCTATGNILVCLAISWEKR---LQNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 22 ~ii~vlgi~gN~lvi~vi~~~~~---L~~~~~~fl~nLAvaDLl~~l~ 66 (190)
++-++.+++.+++++++++++++ .+++.+.+...=+++++++-+.
T Consensus 5 ~~Ygi~s~~l~~~~~~~i~~~~~~~~~~~sFy~l~~~d~~~ni~~~ln 52 (275)
T PF02118_consen 5 LIYGIPSLILYIFILYVIWKSKKKSYFKSSFYRLYIMDLIMNILTYLN 52 (275)
T ss_pred hhHHHHHHHHHHHHHHHHHHhccccccCCccHHHHHHHhHHHHHHHHH
Confidence 34455566666677777766665 3666666666656666665433
No 116
>PRK06758 hypothetical protein; Provisional
Probab=28.59 E-value=1.3e+02 Score=21.13 Aligned_cols=17 Identities=18% Similarity=0.333 Sum_probs=11.7
Q ss_pred cchhhhhhhhhcccccc
Q psy6892 92 NILCENQLFKAGARDET 108 (190)
Q Consensus 92 ~i~c~~slfSl~se~~~ 108 (190)
..+|.+..|.+-.+.++
T Consensus 102 iafcladvfnllqd~ee 118 (128)
T PRK06758 102 IAFCLADVFNLLQDNEE 118 (128)
T ss_pred HHHHHHHHHHHhcCchh
Confidence 34788888888776543
No 117
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=28.24 E-value=1e+02 Score=27.56 Aligned_cols=15 Identities=27% Similarity=0.306 Sum_probs=8.2
Q ss_pred hHHHHHHHHHHHHHH
Q psy6892 49 TNYFLMSLAITDLMV 63 (190)
Q Consensus 49 ~~~fl~nLAvaDLl~ 63 (190)
...+...|.+.-++.
T Consensus 79 ~~c~~~sLiiltL~~ 93 (418)
T cd07912 79 ICCLKWSLVIATLLC 93 (418)
T ss_pred ccHHHHHHHHHHHHH
Confidence 345555666555554
No 118
>PF14184 YrvL: Regulatory protein YrvL
Probab=28.19 E-value=61 Score=24.03 Aligned_cols=20 Identities=10% Similarity=0.034 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHhhhccchhe
Q psy6892 16 IWAVVTLILVFCTATGNILV 35 (190)
Q Consensus 16 ~~~v~~~ii~vlgi~gN~lv 35 (190)
.++.++++.+++|++-+.+.
T Consensus 41 ~llLF~li~~~lg~~~e~~~ 60 (132)
T PF14184_consen 41 SLLLFFLIIFVLGLPFELFE 60 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444455555555443
No 119
>PRK11056 hypothetical protein; Provisional
Probab=28.02 E-value=15 Score=26.76 Aligned_cols=29 Identities=17% Similarity=0.162 Sum_probs=18.3
Q ss_pred ccCCCCchhHHHHHHHHHHHHHHhhhhcc
Q psy6892 41 WEKRLQNVTNYFLMSLAITDLMVAVLVMP 69 (190)
Q Consensus 41 ~~~~L~~~~~~fl~nLAvaDLl~~l~~lP 69 (190)
..++++.|+.-=+.-+|++-++++++...
T Consensus 52 yQ~Yl~~~m~eg~P~~a~acFflG~f~yS 80 (120)
T PRK11056 52 HQRYLNRPMPEGLPGLAAACFFLGVFLYS 80 (120)
T ss_pred HHHHhcCCCCCCChHHHHHHHHHHHHHHH
Confidence 34566777766666677776777665443
No 120
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=27.83 E-value=8.8 Score=29.03 Aligned_cols=25 Identities=20% Similarity=0.388 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHhhhccchheeeeee
Q psy6892 16 IWAVVTLILVFCTATGNILVCLAIS 40 (190)
Q Consensus 16 ~~~v~~~ii~vlgi~gN~lvi~vi~ 40 (190)
+..++|.++|++|+++-+.+++..+
T Consensus 63 IaGIVfgiVfimgvva~i~icvCmc 87 (155)
T PF10873_consen 63 IAGIVFGIVFIMGVVAGIAICVCMC 87 (155)
T ss_pred eeeeehhhHHHHHHHHHHHHHHhhh
Confidence 4456777788888777666554443
No 121
>KOG0473|consensus
Probab=27.65 E-value=60 Score=27.16 Aligned_cols=35 Identities=11% Similarity=-0.045 Sum_probs=16.5
Q ss_pred HHHHHHhhhccchheeeeeec-cCCCCchhHHHHHH
Q psy6892 21 TLILVFCTATGNILVCLAISW-EKRLQNVTNYFLMS 55 (190)
Q Consensus 21 ~~ii~vlgi~gN~lvi~vi~~-~~~L~~~~~~fl~n 55 (190)
.++.|.+++.||++.++.+.. .+.+.+..+-++.|
T Consensus 265 ~L~~FgVa~~~~~l~V~mv~q~~~d~~~sv~~~~~h 300 (326)
T KOG0473|consen 265 GLKQFGVALEGFVLGVKMVFQVVADALRSVGALLVH 300 (326)
T ss_pred hHHHHHHHhhhHHhhhHhhhhhccHHHHHHHHHHHH
Confidence 344455556666665544432 23333334444444
No 122
>COG2116 FocA Formate/nitrite family of transporters [Inorganic ion transport and metabolism]
Probab=27.33 E-value=38 Score=28.28 Aligned_cols=43 Identities=21% Similarity=0.192 Sum_probs=27.7
Q ss_pred HHHHHHhhhccchheeeeeeccCCCCchh-HHHHHHHHHHHHHH
Q psy6892 21 TLILVFCTATGNILVCLAISWEKRLQNVT-NYFLMSLAITDLMV 63 (190)
Q Consensus 21 ~~ii~vlgi~gN~lvi~vi~~~~~L~~~~-~~fl~nLAvaDLl~ 63 (190)
+.-+++.||+.|.+|+.++|...+.+... -++++-+.+..+..
T Consensus 158 ~~~~F~~GI~cN~lVclAvwm~~~~k~~~~K~~~m~~pv~~Fv~ 201 (265)
T COG2116 158 FLEAFAKGILCNWLVCLAVWMSYSAKDGAAKIFAIILPVMLFVA 201 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccchhhHHHHHHHHHHHHHH
Confidence 46677888888888888887766655432 45555555544443
No 123
>PTZ00415 transmission-blocking target antigen s230; Provisional
Probab=27.32 E-value=14 Score=38.55 Aligned_cols=6 Identities=33% Similarity=0.407 Sum_probs=2.8
Q ss_pred hccchh
Q psy6892 29 ATGNIL 34 (190)
Q Consensus 29 i~gN~l 34 (190)
++||.+
T Consensus 30 ~~~~~~ 35 (2849)
T PTZ00415 30 IIGNGI 35 (2849)
T ss_pred eecCCc
Confidence 445543
No 124
>cd08763 Cyt_b561_CYB561 Vertebrate cytochrome b(561), CYB561 gene product. Cytochrome b(561), as found in vertebrates, which might act as a ferric-chelate reductase, catalyzing the reduction of Fe(3+) to Fe(2+), such as associated with the transport of iron from the endosome to the cytoplasm. It is assumed that this protein uses ascorbate as the electron donor. Belongs to the cytochrome b(561) family, which are secretory vesicle-specific electron transport proteins. Cytochromes b(561) are integral membrane proteins that bind two heme groups non-covalently, and may have six alpha-helical trans-membrane segments.
Probab=27.32 E-value=84 Score=23.61 Aligned_cols=45 Identities=18% Similarity=0.252 Sum_probs=21.7
Q ss_pred HHHHHHHhhhccchheeeeeec--cCCCCchhHHHHHHHHHHHHHHh
Q psy6892 20 VTLILVFCTATGNILVCLAISW--EKRLQNVTNYFLMSLAITDLMVA 64 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~--~~~L~~~~~~fl~nLAvaDLl~~ 64 (190)
++++++.+-+.|+.++++-..+ .++..+..+..+--+|+.=.+++
T Consensus 12 ~lm~~G~i~l~geaiL~~~~~~~~~k~~~k~~H~~L~~la~~~~~~G 58 (143)
T cd08763 12 LCMVLGLVFLCGEALLVYRVFRNETKRSTKILHGLLHIMALVISLVG 58 (143)
T ss_pred HHHHHHHHHHHHHHHHHhccccccccchhHHHHHHHHHHHHHHHHHH
Confidence 4445555555566666554333 22223445555555555444443
No 125
>COG5130 YIP3 Prenylated rab acceptor 1 and related proteins [Intracellular trafficking and secretion / Signal transduction mechanisms]
Probab=27.13 E-value=33 Score=25.96 Aligned_cols=23 Identities=9% Similarity=0.157 Sum_probs=12.6
Q ss_pred hhHHHHHHHHHHHHHHhhhhccc
Q psy6892 48 VTNYFLMSLAITDLMVAVLVMPL 70 (190)
Q Consensus 48 ~~~~fl~nLAvaDLl~~l~~lP~ 70 (190)
|.+.+...|++.-+.++++..|.
T Consensus 113 ~~~~ly~glvcvlip~gffaspI 135 (169)
T COG5130 113 PRSVLYAGLVCVLIPFGFFASPI 135 (169)
T ss_pred ecchhhhhHHHHHHHHHHHHhHH
Confidence 34556666666555555554443
No 126
>KOG0065|consensus
Probab=26.36 E-value=51 Score=33.70 Aligned_cols=35 Identities=17% Similarity=0.174 Sum_probs=18.8
Q ss_pred chheeeeeeccCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892 32 NILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 32 N~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~ 66 (190)
+.+=++++...+.+++..++.-.-+++..++.+++
T Consensus 1250 ~~~Gmm~~s~tPn~~~Aav~~s~~~s~~~~F~G~l 1284 (1391)
T KOG0065|consen 1250 TTLGMMLVSLTPNLQTAAVIASLFFSFWNLFSGFL 1284 (1391)
T ss_pred HHHHHHHHHhCCChhHHHHHHHHHHHHHHHhcccc
Confidence 33334455556666666555555555555555543
No 127
>PRK11562 nitrite transporter NirC; Provisional
Probab=25.95 E-value=36 Score=28.38 Aligned_cols=34 Identities=32% Similarity=0.512 Sum_probs=22.1
Q ss_pred HHHHHHhhhccchheeeeeeccCCCCchhHHHHH
Q psy6892 21 TLILVFCTATGNILVCLAISWEKRLQNVTNYFLM 54 (190)
Q Consensus 21 ~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~ 54 (190)
+..+++.||++|.+|+..++...+.+....++.+
T Consensus 150 ~~~~f~~gI~CNwLVclavw~~~~~~~~gKi~~i 183 (268)
T PRK11562 150 AMVLFFKGALCNWLVCLAIWMAIRTEGAAKFIAI 183 (268)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccchhHHH
Confidence 4577888999999998877755543333333333
No 128
>PF15109 TMEM125: TMEM125 protein family
Probab=25.87 E-value=1.8e+02 Score=20.61 Aligned_cols=46 Identities=20% Similarity=0.182 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHH
Q psy6892 16 IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDL 61 (190)
Q Consensus 16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDL 61 (190)
.+.+...++...|+-|-.+.-.+..+....|-....-+|-||+.=+
T Consensus 21 Cy~~aV~Lil~cG~gGv~LLstttS~sgeWRL~vGt~LCLLAL~vL 66 (112)
T PF15109_consen 21 CYCVAVALILGCGLGGVGLLSTTTSRSGEWRLAVGTTLCLLALLVL 66 (112)
T ss_pred HHHHHHHHHHhcCcCceEEEecccccccceehhhhHHHHHHHHHHH
Confidence 3333344444445544444434444444455555556666665433
No 129
>PHA03164 hypothetical protein; Provisional
Probab=25.79 E-value=52 Score=22.11 Aligned_cols=22 Identities=27% Similarity=0.407 Sum_probs=15.8
Q ss_pred CchhHHHHHHHHHHHHHHhhhh
Q psy6892 46 QNVTNYFLMSLAITDLMVAVLV 67 (190)
Q Consensus 46 ~~~~~~fl~nLAvaDLl~~l~~ 67 (190)
++.+.+.+..||++=+++.++.
T Consensus 56 ktftFlvLtgLaIamILfiifv 77 (88)
T PHA03164 56 KTFTFLVLTGLAIAMILFIIFV 77 (88)
T ss_pred heeehHHHHHHHHHHHHHHHHH
Confidence 4567788888999877765443
No 130
>KOG2632|consensus
Probab=25.57 E-value=60 Score=27.00 Aligned_cols=58 Identities=16% Similarity=0.211 Sum_probs=34.2
Q ss_pred hHHHHHHHHHHHHhhhccchhee----eeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccc
Q psy6892 14 WNIWAVVTLILVFCTATGNILVC----LAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLG 71 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi----~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~ 71 (190)
...+.++..++++++++.-+... ..+.+++++++-..+-+.|+++-.+++..+.++..
T Consensus 18 ts~~~~~~~~i~lv~~~~~i~~~~~l~~~~l~~~ql~RL~Ty~l~H~s~~hllfnmlaL~~~ 79 (258)
T KOG2632|consen 18 TSIVVVLAILIYLVSFFPGIVEVLGLPSELLINWQLYRLITYALVHLSLPHLLFNMLALWPL 79 (258)
T ss_pred HHHHHHHHHHHHHHhccchhhhHhcCCHHHhhhHHHHHHHHHHHHhccHHHHHHHHHHHHhc
Confidence 34444555555555554444321 22345666777777778888888888776655433
No 131
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=25.54 E-value=73 Score=22.32 Aligned_cols=11 Identities=18% Similarity=0.347 Sum_probs=5.5
Q ss_pred heeeeeeccCC
Q psy6892 34 LVCLAISWEKR 44 (190)
Q Consensus 34 lvi~vi~~~~~ 44 (190)
+..++|.|.|+
T Consensus 81 IyYFVILRer~ 91 (101)
T PF06024_consen 81 IYYFVILRERQ 91 (101)
T ss_pred heEEEEEeccc
Confidence 34455555554
No 132
>COG4970 FimT Tfp pilus assembly protein FimT [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=24.85 E-value=76 Score=24.93 Aligned_cols=44 Identities=11% Similarity=0.049 Sum_probs=29.5
Q ss_pred eeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeeccccccc
Q psy6892 39 ISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNL 82 (190)
Q Consensus 39 i~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~ 82 (190)
.++..++++..+-+...|.++-.-......|..+.....+.|..
T Consensus 36 ~i~~~rl~s~a~eL~a~l~lARteAv~~~~pv~~~~~~~~~~~~ 79 (181)
T COG4970 36 WIRSQRLRSAADELAAALQLARTEAVAQNTPVQVCPDWDGGWCL 79 (181)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeeeeCCCCCeec
Confidence 34667788888989888888765554445676666555555444
No 133
>PF12304 BCLP: Beta-casein like protein; InterPro: IPR020977 This entry represents eukaryotic proteins that are typically between 216 to 240 amino acids in length which have two conserved sequence motifs: VLR and TRIY. Beta-casein-like protein is associated with cell morphology and a regulation of growth pattern of tumours. It is found in adenocarcinomas of uterine cervical tissues[].
Probab=24.72 E-value=1e+02 Score=24.36 Aligned_cols=44 Identities=7% Similarity=0.220 Sum_probs=23.5
Q ss_pred HHHHHHHhhhccchheeeeeeccCCC-CchhHHHHHHHHHHHHHHhhh
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRL-QNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L-~~~~~~fl~nLAvaDLl~~l~ 66 (190)
+-+.-++++|.+.++.|++ .|++ +.+..+.+..+++...+++..
T Consensus 46 isv~Sgll~I~~GI~AIvl---Srnl~~~~L~W~Ll~~S~ln~LlSaA 90 (188)
T PF12304_consen 46 ISVTSGLLSIICGIVAIVL---SRNLRNRPLHWTLLVVSLLNALLSAA 90 (188)
T ss_pred HHHHHHHHHHHHhHHHHhh---hccCCCCcchHHHHHHHHHHHHHHHH
Confidence 3334445555554444333 3333 355677777777776666544
No 134
>KOG3832|consensus
Probab=24.70 E-value=2.5e+02 Score=23.03 Aligned_cols=62 Identities=8% Similarity=0.056 Sum_probs=30.1
Q ss_pred eeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeeccchhhhhhh
Q psy6892 35 VCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLF 100 (190)
Q Consensus 35 vi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~c~~slf 100 (190)
++.-+.+.+++ .++..-+-+++.+.++....+.+..-....|+...++|-+-....|.+..+
T Consensus 225 tf~diqktk~i----qiltslmrw~~~~~g~~l~tmqaeedaa~a~p~arc~cgfgkl~~~aly~~ 286 (319)
T KOG3832|consen 225 TFADIQKTKHI----QILTSLMRWAAFILGISLATMQAEEDAAAAHPPARCFCGFGKLFGCALYAF 286 (319)
T ss_pred eEeccccchhH----HHHHHHHHHHhccchhhhhhhhhcccccccCCccccccccccccchhhHHH
Confidence 34444444443 333333446666655544434444444455666666666544444544333
No 135
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=24.33 E-value=50 Score=29.23 Aligned_cols=15 Identities=7% Similarity=-0.135 Sum_probs=8.1
Q ss_pred CCCCchhHHHHHHHH
Q psy6892 43 KRLQNVTNYFLMSLA 57 (190)
Q Consensus 43 ~~L~~~~~~fl~nLA 57 (190)
|-++++..++..-|+
T Consensus 367 Pl~~tR~r~~~t~~~ 381 (395)
T PF10267_consen 367 PLTRTRLRTLTTLLL 381 (395)
T ss_pred HHhhccHHHHHHHHH
Confidence 445666555554444
No 136
>COG1272 Predicted membrane protein, hemolysin III homolog [General function prediction only]
Probab=24.17 E-value=1.6e+02 Score=24.03 Aligned_cols=42 Identities=10% Similarity=-0.014 Sum_probs=24.3
Q ss_pred HHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHH
Q psy6892 19 VVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITD 60 (190)
Q Consensus 19 v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaD 60 (190)
++++++..+|++|-++-+....+.|+++...++.+.-+++.=
T Consensus 119 ~~~~iiW~lal~Gi~~kl~~~~~~r~ls~~~yl~mGw~~v~~ 160 (226)
T COG1272 119 ILLGLIWGLALAGILFKLFFKKRFRKLSLVLYLAMGWLGLIV 160 (226)
T ss_pred HHHHHHHHHHHHHHhhhhhccCcCceeeehhhHHHHHHHHHH
Confidence 344445555555555555555666677777776666655543
No 137
>COG5456 Predicted integral membrane protein linked to a cation pump [Inorganic ion transport and metabolism]
Probab=24.14 E-value=50 Score=25.22 Aligned_cols=34 Identities=15% Similarity=0.045 Sum_probs=23.7
Q ss_pred hccchheeeeeeccCCCCchhHHHHHHHHHHHHH
Q psy6892 29 ATGNILVCLAISWEKRLQNVTNYFLMSLAITDLM 62 (190)
Q Consensus 29 i~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl 62 (190)
|.-|+++.+.-.+...=....|.|+.++.+.+-.
T Consensus 30 IaVnlvma~~A~~SwSGlVv~NtYvAsqqFN~ka 63 (166)
T COG5456 30 IAVNLVMAWNASRSWSGLVVENTYVASQQFNRKA 63 (166)
T ss_pred HHHHHHHHHHhhccccceEecchhHHHHHHHHHH
Confidence 4457777666555555556779999999987644
No 138
>KOG4536|consensus
Probab=24.06 E-value=1.1e+02 Score=25.93 Aligned_cols=43 Identities=19% Similarity=0.326 Sum_probs=22.9
Q ss_pred HHHHhhhccchheeeeeeccCCC--CchhHHHHHHHHHHHHHHhh
Q psy6892 23 ILVFCTATGNILVCLAISWEKRL--QNVTNYFLMSLAITDLMVAV 65 (190)
Q Consensus 23 ii~vlgi~gN~lvi~vi~~~~~L--~~~~~~fl~nLAvaDLl~~l 65 (190)
.++++++.|-++-++....+-+| |...+.++.-|++.+++.++
T Consensus 200 ~l~Lvy~~~milp~~~~k~r~kLPsr~sFy~Y~~im~~Ln~Lq~~ 244 (347)
T KOG4536|consen 200 KLLLVYSYGMILPMYNSKWREKLPSRPSFYVYIGIMAALNLLQLL 244 (347)
T ss_pred HHHHHHHHHheeeccchhhhhcCCCcchHHHHHHHHHHHHHHHHH
Confidence 33344444444444433333333 34457777778887877654
No 139
>COG3125 CyoD Heme/copper-type cytochrome/quinol oxidase, subunit 4 [Energy production and conversion]
Probab=23.88 E-value=79 Score=22.82 Aligned_cols=20 Identities=25% Similarity=0.496 Sum_probs=14.8
Q ss_pred CchhHHHHHHHHHHHHHHhh
Q psy6892 46 QNVTNYFLMSLAITDLMVAV 65 (190)
Q Consensus 46 ~~~~~~fl~nLAvaDLl~~l 65 (190)
++....++..||+..+++=+
T Consensus 45 ~~~~~~~i~~lA~iQi~vqL 64 (111)
T COG3125 45 STVTLIIILGLAVIQILVHL 64 (111)
T ss_pred hhhHHHHHHHHHHHHHHHHH
Confidence 55677888888888777644
No 140
>PF01545 Cation_efflux: Cation efflux family; InterPro: IPR002524 Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are considered to be efflux pumps that remove these ions from cells [, ], however others are implicated in ion uptake []. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3BYP_B 3BYR_A 3H90_B 2QFI_B 2ZZT_A.
Probab=23.80 E-value=20 Score=29.33 Aligned_cols=10 Identities=10% Similarity=0.421 Sum_probs=4.0
Q ss_pred HHHHHHHHhh
Q psy6892 56 LAITDLMVAV 65 (190)
Q Consensus 56 LAvaDLl~~l 65 (190)
-+.+|.+.++
T Consensus 143 ~~~~d~~~s~ 152 (284)
T PF01545_consen 143 HSLIDVLSSL 152 (284)
T ss_dssp HHHHHTS-SS
T ss_pred hcccchhHHH
Confidence 3344555443
No 141
>PF13120 DUF3974: Domain of unknown function (DUF3974)
Probab=23.58 E-value=53 Score=23.14 Aligned_cols=17 Identities=18% Similarity=0.208 Sum_probs=6.9
Q ss_pred HHHHhhhccchheeeee
Q psy6892 23 ILVFCTATGNILVCLAI 39 (190)
Q Consensus 23 ii~vlgi~gN~lvi~vi 39 (190)
+++++-++|-.+|++++
T Consensus 10 l~g~llligftivvl~v 26 (126)
T PF13120_consen 10 LIGTLLLIGFTIVVLLV 26 (126)
T ss_pred HHHHHHHHHHHHHhhhh
Confidence 33333344444444333
No 142
>PF09801 SYS1: Integral membrane protein S linking to the trans Golgi network; InterPro: IPR019185 Members of this family are integral membrane proteins involved in protein trafficking between the late Golgi and endosome. They may also serve as a receptor for ADP-ribosylation factor-related protein 1 (ARFRP1) []. Sys1p is a small integral membrane protein with four predicted transmembrane domains that localises to the Trans Golgi network TGN in yeast and human cells [].
Probab=23.50 E-value=1.3e+02 Score=22.65 Aligned_cols=10 Identities=20% Similarity=0.381 Sum_probs=4.3
Q ss_pred HHHHHHHHHH
Q psy6892 50 NYFLMSLAIT 59 (190)
Q Consensus 50 ~~fl~nLAva 59 (190)
.++++||.++
T Consensus 93 Tlh~iHli~~ 102 (144)
T PF09801_consen 93 TLHFIHLIIC 102 (144)
T ss_pred HHHHHHHHHH
Confidence 3444444433
No 143
>KOG2513|consensus
Probab=23.41 E-value=1.4e+02 Score=27.98 Aligned_cols=52 Identities=12% Similarity=0.162 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHhhhccchheeeeeeccCCCC---chhHHHHHHHHHHHHHHhhh
Q psy6892 15 NIWAVVTLILVFCTATGNILVCLAISWEKRLQ---NVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 15 ~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~---~~~~~fl~nLAvaDLl~~l~ 66 (190)
..+--++.++.++|++.|..+++......++. ++.+..+.-+.+-.+++++.
T Consensus 532 G~Wq~~l~~lSvls~vtncaLi~~~~~~~~~~~~~s~~~~il~~V~~EH~~l~Lk 586 (647)
T KOG2513|consen 532 GAWQNALELLSVLSCVTNCALIGMYPQVNKLWPLLSPENAILIIVILEHVLLLLK 586 (647)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHhccccccccCcccchHHHHHHHHHHHHHHHHHH
Confidence 34445666777888888887776655444433 46677776666666666543
No 144
>KOG2302|consensus
Probab=23.39 E-value=76 Score=31.86 Aligned_cols=24 Identities=25% Similarity=0.459 Sum_probs=18.3
Q ss_pred CCCCchhHHHHHHHHHHHHHHhhh
Q psy6892 43 KRLQNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 43 ~~L~~~~~~fl~nLAvaDLl~~l~ 66 (190)
+..++....|++.+|+.|++++..
T Consensus 1191 ~ssWN~LDgflv~vsviDilvs~a 1214 (1956)
T KOG2302|consen 1191 RSSWNVLDGFLVAVSVIDILVSQA 1214 (1956)
T ss_pred HHHHHhhhHHHHHHHHHHHHHHHh
Confidence 334566788999999999998754
No 145
>PF06706 CTV_P6: Citrus tristeza virus 6-kDa protein; InterPro: IPR009584 This family consists of several Citrus tristeza virus (CTV) 6 kDa, 51 residue long hydrophobic (P6) proteins. The function of this family is unknown.
Probab=23.18 E-value=1.5e+02 Score=17.70 Aligned_cols=15 Identities=13% Similarity=0.162 Sum_probs=6.8
Q ss_pred hhccchheeeeeecc
Q psy6892 28 TATGNILVCLAISWE 42 (190)
Q Consensus 28 gi~gN~lvi~vi~~~ 42 (190)
++++-++++..++|+
T Consensus 20 AF~~~iiIvitiyrc 34 (51)
T PF06706_consen 20 AFAALIIIVITIYRC 34 (51)
T ss_pred HHHHHhEEEEEEEEe
Confidence 344444444445544
No 146
>PF09971 DUF2206: Predicted membrane protein (DUF2206); InterPro: IPR018701 This family of predicted membrane proteins from archaea has no known function.
Probab=23.14 E-value=88 Score=27.33 Aligned_cols=58 Identities=22% Similarity=0.321 Sum_probs=25.3
Q ss_pred hHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccce
Q psy6892 14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGI 72 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l 72 (190)
+.+...+.+++..+.++|=+.+++-..+++.... ..++..++-..=++++.+.+|.+.
T Consensus 114 ~~i~k~i~l~~~~~i~IG~l~~~~~~~~~k~~~~-~~Yl~fs~~~~iiLia~i~lP~fa 171 (367)
T PF09971_consen 114 HSISKYIHLFIQFFIIIGFLALILKRIYKKIKFN-IEYLAFSLVSLIILIASIVLPFFA 171 (367)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-HHHHHHHHHHHHHHHHHHhccchh
Confidence 3333444444444445554444433323222222 333444444434445555677664
No 147
>PF03839 Sec62: Translocation protein Sec62; InterPro: IPR004728 Members of the NSCC2 family have been sequenced from various yeast, fungal and animals species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane
Probab=22.93 E-value=48 Score=26.97 Aligned_cols=7 Identities=0% Similarity=-0.382 Sum_probs=3.4
Q ss_pred ccccccc
Q psy6892 79 RTNLSGA 85 (190)
Q Consensus 79 ~~~~s~~ 85 (190)
.|.++..
T Consensus 177 fWlfPNL 183 (224)
T PF03839_consen 177 FWLFPNL 183 (224)
T ss_pred EEeCCcc
Confidence 3555544
No 148
>PF10726 DUF2518: Protein of function (DUF2518); InterPro: IPR019664 This entry contains the Ycf51 protein family, which is conserved in Cyanobacteria. The function is not known.
Probab=22.79 E-value=67 Score=24.30 Aligned_cols=9 Identities=22% Similarity=0.604 Sum_probs=4.0
Q ss_pred HHHHHHHHH
Q psy6892 51 YFLMSLAIT 59 (190)
Q Consensus 51 ~fl~nLAva 59 (190)
-|+.-|+++
T Consensus 44 ~F~~vLt~g 52 (145)
T PF10726_consen 44 SFMLVLTVG 52 (145)
T ss_pred HHHHHHHHH
Confidence 344444444
No 149
>PF05751 FixH: FixH; InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=22.50 E-value=75 Score=23.22 Aligned_cols=40 Identities=13% Similarity=0.101 Sum_probs=23.9
Q ss_pred HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892 20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT 59 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva 59 (190)
++++++++.++.|+.++++-.+.+.-....+||--.++.-
T Consensus 7 ~ii~~~~~~v~~~~~~v~~A~~~~~~lV~~dYY~~g~~y~ 46 (146)
T PF05751_consen 7 FIIAFFAVFVVANVTMVYIAISTPDGLVVDDYYEKGLAYN 46 (146)
T ss_pred hhHHhhhhEeeeeeeEEeeeccCCCCceeccHHHhhhhhh
Confidence 3444455556667666665555554334457778777765
No 150
>PF10190 Tmemb_170: Putative transmembrane protein 170; InterPro: IPR019334 This entry represents a group of putative transmembrane proteins conserved from nematodes to humans. The protein is only approximately 130 amino acids in length. The function is unknown.
Probab=21.88 E-value=50 Score=23.58 Aligned_cols=11 Identities=9% Similarity=-0.167 Sum_probs=4.3
Q ss_pred cccccccccee
Q psy6892 79 RTNLSGAVLRL 89 (190)
Q Consensus 79 ~~~~s~~lC~i 89 (190)
....+...|.+
T Consensus 74 ~~~M~~~~a~v 84 (105)
T PF10190_consen 74 GFRMSTWEAMV 84 (105)
T ss_pred CCcccHHHHHH
Confidence 33444443333
No 151
>cd08762 Cyt_b561_CYBASC3 Vertebrate cytochrome b(561), CYBASC3 gene product. Cytochrome b ascorbate-dependent 3, as found in vertebrates, which might act as a ferric-chelate reductase, catalyzing the reduction of Fe(3+) to Fe(2+), such as associated with the transport of iron from the endosome to the cytoplasm. It is assumed that this protein uses ascorbate as the electron donor. Belongs to the cytochrome b(561) family, which are secretory vesicle-specific electron transport proteins. Cytochromes b(561) are integral membrane proteins that bind two heme groups non-covalently, and may have six alpha-helical trans-membrane segments.
Probab=21.80 E-value=1.2e+02 Score=23.89 Aligned_cols=47 Identities=17% Similarity=0.124 Sum_probs=26.0
Q ss_pred HHHHHHHHhhhccchheeeeeec----cCCCCchhHHHHHHHHHHHHHHhh
Q psy6892 19 VVTLILVFCTATGNILVCLAISW----EKRLQNVTNYFLMSLAITDLMVAV 65 (190)
Q Consensus 19 v~~~ii~vlgi~gN~lvi~vi~~----~~~L~~~~~~fl~nLAvaDLl~~l 65 (190)
.+++++..+-+.|+.++++-+.+ .++.+++.+.++-.+|+.=.++++
T Consensus 39 P~lMv~Gfi~L~geAiL~Yr~~~~~~~~k~~~K~~H~~L~~~Al~~~vvGl 89 (179)
T cd08762 39 PVLMVTGMVVLYGNAALVYRIPLTWGGPKLPWKLLHAGLLLLAFILTVIGL 89 (179)
T ss_pred HHHHHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHHHHHHHHH
Confidence 44555555566788777665433 223345666666666655444443
No 152
>KOG3058|consensus
Probab=21.79 E-value=43 Score=29.05 Aligned_cols=54 Identities=15% Similarity=0.226 Sum_probs=26.8
Q ss_pred HHHHHHHhhhccchheeeeeec-cCCC-CchhHHHHHH-------HHHHHHHHhhhhccccee
Q psy6892 20 VTLILVFCTATGNILVCLAISW-EKRL-QNVTNYFLMS-------LAITDLMVAVLVMPLGIL 73 (190)
Q Consensus 20 ~~~ii~vlgi~gN~lvi~vi~~-~~~L-~~~~~~fl~n-------LAvaDLl~~l~~lP~~l~ 73 (190)
+..+...++.+.|.+++.+++- .++. .+.+.+++.+ .++++++..+..+..+++
T Consensus 70 lafl~~~~~~~l~~v~l~~vHervP~~~pPLPDi~f~~vp~i~wa~~~~e~~~~~~~~~~f~l 132 (351)
T KOG3058|consen 70 LAFLYLFVAALLNSVTLVYVHERVPDPYPPLPDIFFDLVPEIPWAFSLCEIIGMILVVLLFTL 132 (351)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCcHHHHhcccchHHHHHHHHHHHHHHHHHHHH
Confidence 3344445667788888766652 2321 1223333332 456676665444333333
No 153
>PF03073 TspO_MBR: TspO/MBR family; InterPro: IPR004307 Members of this group are involved in transmembrane signalling. In both prokaryotes and mitochondria they are localized to the outer membrane, and have been shown to bind and transport dicarboxylic tetrapyrrole intermediates of the haem biosynthetic pathway [, ]. They are associated with the major outer membrane porins (in prokaryotes) and with the voltage-dependent anion channel (in mitochondria) []. Rhodobacter sphaeroides TspO (previously CrtK) is involved in signal transduction, functioning as a negative regulator of the expression of some photosynthesis genes (PpsR/AppA repressor/antirepressor regulon). This down-regulation is believed to be in response to oxygen levels. TspO works through (or modulates) the PpsR/AppA system and acts upstream of the site of action of these regulatory proteins []. It has been suggested that the TspO regulatory pathway works by regulating the efflux of certain tetrapyrrole intermediates of the haem/bacteriochlorophyll biosynthetic pathways in response to the availability of molecular oxygen, thereby causing the accumulation of a biosynthetic intermediate that serves as a corepressor for the regulated genes []. A homologue of the TspO protein in Rhizobium meliloti (Sinorhizobium meliloti) is involved in regulating expression of the ndi locus in response to stress conditions []. There is evidence that the S. meliloti TspO acts through, or in addition to, the FixL regulatory system. In animals, the peripheral-type benzodiazepine receptor (PBR, MBR) is a mitochondrial protein (located in the outer mitochondrial membrane) characterised by its ability to bind with nanomolar affinity to a variety of benzodiazepine-like drugs, as well as to dicarboxylic tetrapyrrole intermediates of the haem biosynthetic pathway. Depending upon the tissue, it was shown to be involved in steroidogenesis, haem biosynthesis, apoptosis, cell growth and differentiation, mitochondrial respiratory control, and immune and stress response, but the precise function of the PBR remains unclear. The role of PBR in the regulation of cholesterol transport from the outer to the inner mitochondrial membrane, the rate-determining step in steroid biosynthesis, has been studied in detail. PBR is required for the binding, uptake and release, upon ligand activation, of the substrate cholesterol []. PBR forms a multimeric complex with the voltage-dependent anion channel (VDAC) [] and adenine nucleotide carrier []. Molecular modeling of PBR suggested that it might function as a channel for cholesterol. Indeed, cholesterol uptake and transport by bacterial cells was induced upon PBR expression. Mutagenesis studies identified a cholesterol recognition/interaction motif (CRAC) in the cytoplasmic C terminus of PBR [, ]. In complementation experiments, rat PBR (pk18) functionally substitutes for its homologue TspO in R. sphaeroides, negatively affecting transcription of specific photosynthesis genes []. This suggests that PBR may function as an oxygen sensor, transducing an oxygen-triggered signal leading to an adaptive cellular response. These observations suggest that fundamental aspects of this receptor and the downstream signal transduction pathway are conserved in bacteria and higher eukaryotic mitochondria. The alpha-3 subdivision of the purple bacteria is considered to be a likely source of the endosymbiont that ultimately gave rise to the mitochondrion. Therefore, it is possible that the mammalian PBR remains both evolutionarily and functionally related to the TspO of R. sphaeroides.; GO: 0016021 integral to membrane
Probab=21.73 E-value=1.9e+02 Score=21.41 Aligned_cols=20 Identities=15% Similarity=0.218 Sum_probs=11.7
Q ss_pred cCCCCchhHHHHHHHHHHHH
Q psy6892 42 EKRLQNVTNYFLMSLAITDL 61 (190)
Q Consensus 42 ~~~L~~~~~~fl~nLAvaDL 61 (190)
.+..+.+..++.++|++.-+
T Consensus 66 ~~~~~~~l~l~~~~l~ln~~ 85 (148)
T PF03073_consen 66 SPRRRRALALYAIQLALNFA 85 (148)
T ss_pred cchhHHHHHHHHHHHHHHHH
Confidence 34445566677777766543
No 154
>COG4758 Predicted membrane protein [Function unknown]
Probab=21.64 E-value=50 Score=26.98 Aligned_cols=17 Identities=12% Similarity=0.044 Sum_probs=8.1
Q ss_pred HHHHHHHHHHhhhccch
Q psy6892 17 WAVVTLILVFCTATGNI 33 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~ 33 (190)
+.++++++.+++++++.
T Consensus 10 Ii~Ifl~l~ii~~~~~~ 26 (235)
T COG4758 10 IIAIFLALMIIQIVIET 26 (235)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33444444455555554
No 155
>COG5521 Predicted integral membrane protein [Function unknown]
Probab=21.47 E-value=2.3e+02 Score=23.55 Aligned_cols=14 Identities=29% Similarity=0.309 Sum_probs=6.8
Q ss_pred HHHHHHHHHHHHHH
Q psy6892 50 NYFLMSLAITDLMV 63 (190)
Q Consensus 50 ~~fl~nLAvaDLl~ 63 (190)
++.+++||+--++.
T Consensus 221 ~l~lnclalP~laa 234 (275)
T COG5521 221 SLALNCLALPALAA 234 (275)
T ss_pred HHHHHhhhhHHHHH
Confidence 45555555544433
No 156
>COG2456 Uncharacterized conserved protein [Function unknown]
Probab=21.41 E-value=56 Score=23.56 Aligned_cols=40 Identities=15% Similarity=0.020 Sum_probs=21.4
Q ss_pred cchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhccc
Q psy6892 31 GNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPL 70 (190)
Q Consensus 31 gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~ 70 (190)
+-.++++++.|.++-+...+-++.=+++.-+++.....|.
T Consensus 19 ~l~~m~~~~~r~k~~k~s~~~ll~W~~~wv~vlifal~P~ 58 (121)
T COG2456 19 ILFLMIYVFFRLKRRKLSLSDLLFWEAFWVFVLIFALFPE 58 (121)
T ss_pred HHHHHHHHHHHHHHhcCCccchHHHHHHHHHHHHHHhcch
Confidence 3344556666666656555656666665544443334443
No 157
>cd08762 Cyt_b561_CYBASC3 Vertebrate cytochrome b(561), CYBASC3 gene product. Cytochrome b ascorbate-dependent 3, as found in vertebrates, which might act as a ferric-chelate reductase, catalyzing the reduction of Fe(3+) to Fe(2+), such as associated with the transport of iron from the endosome to the cytoplasm. It is assumed that this protein uses ascorbate as the electron donor. Belongs to the cytochrome b(561) family, which are secretory vesicle-specific electron transport proteins. Cytochromes b(561) are integral membrane proteins that bind two heme groups non-covalently, and may have six alpha-helical trans-membrane segments.
Probab=21.29 E-value=1.8e+02 Score=22.77 Aligned_cols=20 Identities=5% Similarity=0.076 Sum_probs=9.3
Q ss_pred CchhHHHHHHHHHHHHHHhh
Q psy6892 46 QNVTNYFLMSLAITDLMVAV 65 (190)
Q Consensus 46 ~~~~~~fl~nLAvaDLl~~l 65 (190)
|+-.-+..+.|-....+.+.
T Consensus 108 HSWlGl~t~~Lf~lQ~~~Gf 127 (179)
T cd08762 108 HSWVGICTVALFTCQWVMGF 127 (179)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444443
No 158
>TIGR00815 sulP high affinity sulphate transporter 1. (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3- (out).
Probab=21.16 E-value=87 Score=28.76 Aligned_cols=54 Identities=11% Similarity=0.126 Sum_probs=35.9
Q ss_pred HHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhh-ccc
Q psy6892 17 WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLV-MPL 70 (190)
Q Consensus 17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~-lP~ 70 (190)
++...+.+.+++++.++.+.-.+......+.-.|--++.+.+++++.+++. +|.
T Consensus 261 l~~~a~~ia~v~~~e~l~~a~~~~~~~~~~~d~n~El~a~G~~N~~~~~fg~~p~ 315 (563)
T TIGR00815 261 LAPDAIAIAIVGLIESIAIARSFARMTGYKIDANQELVAQGIANIVGSFFSCYPA 315 (563)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhhHHHHHHHHhCccCC
Confidence 334445566667777766665554444445567888899999999988774 444
No 159
>PF14798 Ca_hom_mod: Calcium homeostasis modulator
Probab=20.96 E-value=3.4e+02 Score=22.38 Aligned_cols=12 Identities=33% Similarity=0.293 Sum_probs=4.7
Q ss_pred HHHHHHhhhccc
Q psy6892 21 TLILVFCTATGN 32 (190)
Q Consensus 21 ~~ii~vlgi~gN 32 (190)
++++.++++.|-
T Consensus 20 ngi~~llt~g~e 31 (251)
T PF14798_consen 20 NGICALLTAGGE 31 (251)
T ss_pred HHHHHHHHHHHH
Confidence 333333344333
No 160
>KOG2450|consensus
Probab=20.89 E-value=1.3e+02 Score=27.51 Aligned_cols=42 Identities=26% Similarity=0.318 Sum_probs=26.7
Q ss_pred hccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeeccccc
Q psy6892 29 ATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRT 80 (190)
Q Consensus 29 i~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~ 80 (190)
..||++|+ .+..++| +..|.++.++- -..+|-.+++++.+.-
T Consensus 184 a~GNtvV~-----Kpae~tp----lsal~~~~l~~-eaG~P~GVvNii~G~G 225 (501)
T KOG2450|consen 184 AAGNTVVL-----KPAEQTP----LTALYLASLCK-EAGFPPGVVNIVPGSG 225 (501)
T ss_pred hhCcEEEe-----cCCCCCC----chHHHHHHHhH-HhcCCcceEEEccCCC
Confidence 46887764 3445555 34444455544 5688889999888754
No 161
>KOG3145|consensus
Probab=20.85 E-value=2.2e+02 Score=24.51 Aligned_cols=26 Identities=8% Similarity=0.058 Sum_probs=17.0
Q ss_pred ccccccccceeeccchhhhhhhhhcc
Q psy6892 79 RTNLSGAVLRLKKNILCENQLFKAGA 104 (190)
Q Consensus 79 ~~~~s~~lC~i~~~i~c~~slfSl~s 104 (190)
.|+++..+..+.|......++.....
T Consensus 294 gwsIgnIlLDfTGG~~slLQMilQ~~ 319 (372)
T KOG3145|consen 294 GWSIGNILLDFTGGTASLLQMILQSS 319 (372)
T ss_pred ccccccEEEEecccHHHHHHHHHHHh
Confidence 47777777777776666666655443
No 162
>PHA02910 hypothetical protein; Provisional
Probab=20.80 E-value=43 Score=24.52 Aligned_cols=23 Identities=13% Similarity=0.079 Sum_probs=16.1
Q ss_pred hHHHHHHHHHHHHhhhccchhee
Q psy6892 14 WNIWAVVTLILVFCTATGNILVC 36 (190)
Q Consensus 14 ~~~~~v~~~ii~vlgi~gN~lvi 36 (190)
...+..+.+.+|++||+|..+|-
T Consensus 23 kisitsicvslfligiiggyivg 45 (171)
T PHA02910 23 KISITSICVSLFLIGIIGGYIVG 45 (171)
T ss_pred EeehhhhHHHHHHHhhhccEEEE
Confidence 34455667778888988876653
No 163
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=20.74 E-value=1.4e+02 Score=27.85 Aligned_cols=43 Identities=21% Similarity=0.157 Sum_probs=22.3
Q ss_pred HHHhhhccchheeeee---eccC--CCC---chhHHHHHHHHHHHHHHhhh
Q psy6892 24 LVFCTATGNILVCLAI---SWEK--RLQ---NVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 24 i~vlgi~gN~lvi~vi---~~~~--~L~---~~~~~fl~nLAvaDLl~~l~ 66 (190)
-+++|=+|.+++++++ .+.+ ++. ...-.++..++++-++.+++
T Consensus 359 GmM~gD~GyGLil~l~~~~l~~~~~k~~~~~~~~~~il~~~gi~sii~G~l 409 (646)
T PRK05771 359 GMMLGDAGYGLLLLLIGLLLSFKLKKKSEGLKRLLKILIYLGISTIIWGLL 409 (646)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3355666776665442 1221 111 22345556777777777655
No 164
>PF12597 DUF3767: Protein of unknown function (DUF3767); InterPro: IPR022533 This group of proteins includes mitochodrial cytochrome c oxidase proteins [], and some transmembrane domain-containing proteins of unknown function known as FAM36A. Proteins in this family are typically between 112 and 199 amino acids in length.
Probab=20.72 E-value=1.1e+02 Score=22.25 Aligned_cols=16 Identities=13% Similarity=0.179 Sum_probs=8.1
Q ss_pred CCCchhHHHHHHHHHH
Q psy6892 44 RLQNVTNYFLMSLAIT 59 (190)
Q Consensus 44 ~L~~~~~~fl~nLAva 59 (190)
+.++..|+-+...++.
T Consensus 64 ~~~~A~nwavgsF~l~ 79 (118)
T PF12597_consen 64 NPRKAANWAVGSFFLG 79 (118)
T ss_pred CCccchhhhhHHHHHH
Confidence 4455555555544433
No 165
>TIGR01297 CDF cation diffusion facilitator family transporter. This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
Probab=20.55 E-value=30 Score=28.06 Aligned_cols=42 Identities=14% Similarity=0.137 Sum_probs=17.0
Q ss_pred HHhhhccchheeeeeec-cCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892 25 VFCTATGNILVCLAISW-EKRLQNVTNYFLMSLAITDLMVAVL 66 (190)
Q Consensus 25 ~vlgi~gN~lvi~vi~~-~~~L~~~~~~fl~nLAvaDLl~~l~ 66 (190)
.++|++.|+++.+...+ .++.+++.-.-..--..+|++.++.
T Consensus 100 ~~~~~~v~~~~~~~~~~~~~~~~s~~l~a~~~~~~~D~~~s~~ 142 (268)
T TIGR01297 100 AIVGLIVNLILALYLHRVGHRLGSLALRAAALHVLSDALSSVG 142 (268)
T ss_pred HHHHHHHHHHHHHHHHHhCccCCCHHHHHHHHHHHHHHHHHHH
Confidence 33444555544333322 2233344333333333456555443
No 166
>KOG1734|consensus
Probab=20.41 E-value=1e+02 Score=25.96 Aligned_cols=34 Identities=9% Similarity=0.064 Sum_probs=21.0
Q ss_pred CCCCCchhhHHHHHHHHHHHHhhhccchheeeee
Q psy6892 6 SFGLDAASWNIWAVVTLILVFCTATGNILVCLAI 39 (190)
Q Consensus 6 ~~~~~~~~~~~~~v~~~ii~vlgi~gN~lvi~vi 39 (190)
+-..+.-.+.++..+|.+-+++|++|-++++.+.
T Consensus 129 ~g~tpRlVYkwFl~lyklSy~~g~vGyl~im~~~ 162 (328)
T KOG1734|consen 129 SGDTPRLVYKWFLFLYKLSYLLGVVGYLAIMFAQ 162 (328)
T ss_pred CCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333344455666777777888888866655443
No 167
>KOG4452|consensus
Probab=20.27 E-value=2.4e+02 Score=18.53 Aligned_cols=23 Identities=22% Similarity=0.322 Sum_probs=12.1
Q ss_pred eeeeccCCCCchhHHHHHHHHHH
Q psy6892 37 LAISWEKRLQNVTNYFLMSLAIT 59 (190)
Q Consensus 37 ~vi~~~~~L~~~~~~fl~nLAva 59 (190)
+.+...++-|+...-++++|..+
T Consensus 39 ~~VtStKy~r~l~KELlIsl~aS 61 (79)
T KOG4452|consen 39 IQVTSTKYNRNLLKELLISLTAS 61 (79)
T ss_pred eeEecchhhHHHHHHHHHHHHHH
Confidence 33345555556555566665533
No 168
>TIGR00834 ae anion exchange protein. They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
Probab=20.10 E-value=1.7e+02 Score=28.93 Aligned_cols=42 Identities=14% Similarity=0.114 Sum_probs=29.7
Q ss_pred cchheeeeeeccCCCCchh--HHHHHHHHHHHHHHhhhhcccce
Q psy6892 31 GNILVCLAISWEKRLQNVT--NYFLMSLAITDLMVAVLVMPLGI 72 (190)
Q Consensus 31 gN~lvi~vi~~~~~L~~~~--~~fl~nLAvaDLl~~l~~lP~~l 72 (190)
-|+..+++....++|++++ |+=++-|++..++++++++|+..
T Consensus 672 ~nIts~iv~~~e~kLkKgsgyH~Dllllg~~~~v~sllGLPw~~ 715 (900)
T TIGR00834 672 QQITTLIVSKKERKLKKGSGFHLDLLLVVGMGGVAALFGLPWLS 715 (900)
T ss_pred hhhHHHHhcCccccCCCCcccchHHHHHHHHHHHHHhcCCCccc
Confidence 4555566666667788765 56666677788888888888654
Done!