Query psy12432
Match_columns 229
No_of_seqs 157 out of 1495
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 19:08:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12432.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12432hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1219|consensus 99.4 5E-13 1.1E-17 141.9 10.7 170 22-208 3884-4102(4289)
2 PHA03099 epidermal growth fact 99.3 1.4E-12 3.1E-17 104.7 5.8 90 24-119 41-130 (139)
3 PF02158 Neuregulin: Neureguli 98.8 1.4E-09 3E-14 101.2 0.1 83 79-166 5-106 (404)
4 PHA02887 EGF-like protein; Pro 98.4 1.5E-07 3.3E-12 74.8 3.3 60 8-71 66-125 (126)
5 KOG1219|consensus 98.2 1.7E-06 3.7E-11 93.9 6.0 96 16-119 3916-4025(4289)
6 KOG4289|consensus 97.9 1.5E-05 3.2E-10 84.2 5.0 53 21-77 1219-1283(2531)
7 PF00008 EGF: EGF-like domain 97.7 3E-05 6.4E-10 48.1 2.9 29 35-66 4-32 (32)
8 smart00179 EGF_CA Calcium-bind 96.8 0.0018 4E-08 40.1 3.4 30 35-68 9-39 (39)
9 cd00054 EGF_CA Calcium-binding 96.5 0.0037 8.1E-08 38.0 3.3 30 35-68 9-38 (38)
10 KOG4289|consensus 96.2 0.0048 1E-07 66.1 4.3 52 16-70 1252-1318(2531)
11 cd00053 EGF Epidermal growth f 96.1 0.008 1.7E-07 35.9 3.4 30 35-68 6-36 (36)
12 PF12955 DUF3844: Domain of un 96.0 0.018 3.9E-07 45.2 5.7 38 35-72 13-64 (103)
13 PF02439 Adeno_E3_CR2: Adenovi 95.8 0.014 2.9E-07 37.9 3.4 21 86-106 4-24 (38)
14 smart00181 EGF Epidermal growt 95.2 0.035 7.5E-07 33.8 3.7 29 35-68 6-35 (35)
15 PF14575 EphA2_TM: Ephrin type 95.2 0.0094 2E-07 44.0 1.3 12 152-163 54-65 (75)
16 PF01102 Glycophorin_A: Glycop 94.8 0.036 7.7E-07 44.7 3.7 29 87-117 66-94 (122)
17 PF07974 EGF_2: EGF-like domai 94.7 0.049 1.1E-06 33.8 3.4 27 35-67 6-32 (32)
18 PF12661 hEGF: Human growth fa 94.5 0.017 3.7E-07 29.1 0.8 13 55-67 1-13 (13)
19 PF14991 MLANA: Protein melan- 94.0 0.0097 2.1E-07 47.4 -1.1 31 91-121 26-56 (118)
20 PF12273 RCR: Chitin synthesis 94.0 0.052 1.1E-06 43.6 3.1 6 171-176 78-83 (130)
21 PF05454 DAG1: Dystroglycan (D 93.4 0.022 4.7E-07 52.2 0.0 10 141-150 186-195 (290)
22 PHA03099 epidermal growth fact 93.1 0.17 3.6E-06 41.3 4.6 22 19-40 62-83 (139)
23 KOG1214|consensus 93.0 0.096 2.1E-06 54.1 3.7 46 21-70 806-866 (1289)
24 PF13908 Shisa: Wnt and FGF in 92.6 0.26 5.7E-06 41.4 5.4 14 88-101 78-91 (179)
25 PF01034 Syndecan: Syndecan do 91.3 0.051 1.1E-06 39.1 -0.3 19 88-106 12-30 (64)
26 PF07645 EGF_CA: Calcium-bindi 90.0 0.37 8.1E-06 31.1 2.9 31 27-63 4-34 (42)
27 PF05393 Hum_adeno_E3A: Human 89.2 0.5 1.1E-05 36.2 3.5 15 96-110 41-55 (94)
28 PF15069 FAM163: FAM163 family 89.0 1.4 3E-05 36.5 6.2 24 88-111 6-29 (143)
29 PF04478 Mid2: Mid2 like cell 88.8 0.025 5.5E-07 47.2 -4.1 26 91-118 55-80 (154)
30 PF07204 Orthoreo_P10: Orthore 88.7 0.42 9.2E-06 36.9 2.8 32 87-119 42-73 (98)
31 KOG2052|consensus 88.5 1.3 2.7E-05 43.3 6.5 32 16-47 27-60 (513)
32 PF12662 cEGF: Complement Clr- 88.4 0.35 7.5E-06 28.3 1.7 21 23-45 1-21 (24)
33 PF08374 Protocadherin: Protoc 87.4 1.4 3E-05 38.9 5.5 10 183-192 146-155 (221)
34 KOG3516|consensus 87.0 0.42 9E-06 51.0 2.5 40 23-70 543-583 (1306)
35 PF12877 DUF3827: Domain of un 86.2 0.44 9.4E-06 48.0 2.0 29 90-118 271-299 (684)
36 KOG3514|consensus 86.2 0.41 8.9E-06 50.7 1.9 34 34-71 628-662 (1591)
37 PF00558 Vpu: Vpu protein; In 85.4 0.97 2.1E-05 34.0 3.1 26 95-120 13-38 (81)
38 KOG3607|consensus 84.8 1.4 2.9E-05 45.3 4.8 29 37-72 632-660 (716)
39 PF15330 SIT: SHP2-interacting 84.7 1.1 2.5E-05 35.2 3.4 7 107-113 19-25 (107)
40 PF05545 FixQ: Cbb3-type cytoc 84.2 2.6 5.7E-05 28.1 4.5 22 93-114 12-33 (49)
41 KOG1225|consensus 83.3 1.3 2.7E-05 43.9 3.7 38 25-70 297-343 (525)
42 PF08693 SKG6: Transmembrane a 82.9 0.11 2.4E-06 34.1 -2.5 18 88-105 11-28 (40)
43 KOG1225|consensus 81.9 1.8 4E-05 42.8 4.3 14 55-68 297-310 (525)
44 KOG1217|consensus 80.7 2.4 5.3E-05 38.8 4.5 42 22-67 250-306 (487)
45 PF01299 Lamp: Lysosome-associ 80.4 0.79 1.7E-05 41.8 1.1 28 87-116 272-299 (306)
46 PF05568 ASFV_J13L: African sw 79.9 3.2 6.8E-05 34.9 4.4 15 105-119 46-60 (189)
47 KOG0994|consensus 79.7 1.5 3.2E-05 47.1 2.9 50 18-69 879-949 (1758)
48 PF12191 stn_TNFRSF12A: Tumour 79.1 0.61 1.3E-05 37.9 0.0 27 88-114 79-105 (129)
49 PF07213 DAP10: DAP10 membrane 79.0 2.5 5.4E-05 31.7 3.2 18 101-118 48-65 (79)
50 PF14584 DUF4446: Protein of u 78.8 3.4 7.5E-05 34.4 4.4 25 91-115 4-28 (151)
51 cd01324 cbb3_Oxidase_CcoQ Cyto 78.5 3.2 6.9E-05 28.1 3.3 32 88-119 8-39 (48)
52 PF12947 EGF_3: EGF domain; I 78.3 1.8 4E-05 27.4 2.0 27 35-65 6-32 (36)
53 KOG1094|consensus 78.2 18 0.00039 37.0 9.7 32 88-119 390-421 (807)
54 TIGR01478 STEVOR variant surfa 78.0 2.7 5.9E-05 38.6 3.8 7 24-30 143-149 (295)
55 PTZ00370 STEVOR; Provisional 77.6 2.7 5.9E-05 38.6 3.7 9 24-32 143-151 (296)
56 PF12301 CD99L2: CD99 antigen 77.5 2.5 5.4E-05 35.9 3.2 29 88-116 114-142 (169)
57 KOG3637|consensus 77.4 1 2.2E-05 48.0 0.9 33 85-117 976-1010(1030)
58 PF08374 Protocadherin: Protoc 77.2 1.1 2.4E-05 39.5 1.0 21 88-108 41-61 (221)
59 PF06697 DUF1191: Protein of u 76.6 0.9 2E-05 41.5 0.3 13 87-99 212-224 (278)
60 PF03302 VSP: Giardia variant- 76.6 0.65 1.4E-05 44.2 -0.7 12 85-97 364-375 (397)
61 PF02009 Rifin_STEVOR: Rifin/s 76.5 4.4 9.5E-05 37.4 4.8 12 110-121 276-287 (299)
62 KOG1217|consensus 75.7 3.3 7.2E-05 37.9 3.8 45 22-70 150-208 (487)
63 PF08114 PMP1_2: ATPase proteo 74.4 4.6 0.0001 26.7 3.1 23 96-118 17-39 (43)
64 PF02038 ATP1G1_PLM_MAT8: ATP1 74.1 4.4 9.6E-05 27.8 3.1 24 96-119 22-45 (50)
65 PF06305 DUF1049: Protein of u 73.6 10 0.00022 26.4 5.0 26 90-115 24-49 (68)
66 KOG1836|consensus 72.0 2.5 5.5E-05 47.1 2.4 45 26-72 759-816 (1705)
67 PRK00523 hypothetical protein; 71.5 11 0.00024 27.7 4.9 17 90-106 6-22 (72)
68 PF06084 Cytomega_TRL10: Cytom 71.5 2.5 5.5E-05 34.1 1.7 19 152-170 118-136 (150)
69 COG4736 CcoQ Cbb3-type cytochr 70.8 8.9 0.00019 27.3 4.1 35 95-129 14-51 (60)
70 PF11770 GAPT: GRB2-binding ad 70.7 1.2 2.5E-05 37.3 -0.4 16 88-103 10-25 (158)
71 smart00051 DSL delta serrate l 70.4 3.5 7.6E-05 29.3 2.1 41 24-67 17-63 (63)
72 PF15102 TMEM154: TMEM154 prot 69.7 1.2 2.5E-05 37.1 -0.6 6 114-119 83-88 (146)
73 PF12768 Rax2: Cortical protei 68.6 4.4 9.5E-05 36.9 2.8 8 92-99 234-241 (281)
74 PF06365 CD34_antigen: CD34/Po 68.5 3.6 7.7E-05 36.0 2.1 20 98-117 111-130 (202)
75 PF15065 NCU-G1: Lysosomal tra 68.0 2 4.4E-05 40.4 0.5 28 91-118 322-349 (350)
76 COG1862 YajC Preprotein transl 66.8 5.9 0.00013 30.7 2.8 19 100-118 15-33 (97)
77 PF15345 TMEM51: Transmembrane 65.6 8.7 0.00019 34.3 4.0 25 142-166 120-144 (233)
78 KOG3516|consensus 65.1 7.3 0.00016 42.0 3.9 33 35-71 961-994 (1306)
79 PF10577 UPF0560: Uncharacteri 65.1 6.1 0.00013 40.9 3.3 11 36-46 220-230 (807)
80 PF15050 SCIMP: SCIMP protein 64.9 12 0.00025 30.4 4.2 9 107-115 26-34 (133)
81 KOG1226|consensus 64.8 7.5 0.00016 40.1 3.8 49 18-70 571-622 (783)
82 PF01299 Lamp: Lysosome-associ 64.7 4 8.7E-05 37.2 1.8 32 89-121 270-301 (306)
83 PF14610 DUF4448: Protein of u 64.6 1.2 2.5E-05 37.9 -1.6 22 89-110 159-180 (189)
84 PRK11901 hypothetical protein; 64.4 7.2 0.00016 36.5 3.3 19 89-107 37-55 (327)
85 PF11980 DUF3481: Domain of un 64.1 7.4 0.00016 29.6 2.8 29 87-115 16-45 (87)
86 PF04971 Lysis_S: Lysis protei 63.1 12 0.00026 27.3 3.6 15 91-105 35-49 (68)
87 PF10873 DUF2668: Protein of u 62.9 4 8.6E-05 34.0 1.2 29 86-114 62-90 (155)
88 PHA03265 envelope glycoprotein 62.2 3.8 8.3E-05 38.7 1.2 15 108-122 368-382 (402)
89 KOG0196|consensus 61.5 7.8 0.00017 40.5 3.3 17 147-163 602-618 (996)
90 PF05568 ASFV_J13L: African sw 60.9 18 0.0004 30.4 4.8 36 86-121 30-65 (189)
91 PF06143 Baculo_11_kDa: Baculo 60.6 27 0.00059 26.5 5.3 27 85-111 34-60 (84)
92 PF02480 Herpes_gE: Alphaherpe 59.9 3 6.4E-05 40.4 0.0 31 91-121 354-384 (439)
93 PF14851 FAM176: FAM176 family 59.7 7 0.00015 32.8 2.2 20 88-107 24-43 (153)
94 PTZ00046 rifin; Provisional 59.6 16 0.00035 34.6 4.8 16 93-108 323-338 (358)
95 PRK12705 hypothetical protein; 59.2 12 0.00026 37.0 4.0 29 91-119 5-33 (508)
96 PF15099 PIRT: Phosphoinositid 59.0 8.4 0.00018 31.4 2.4 11 101-111 93-103 (129)
97 PF14914 LRRC37AB_C: LRRC37A/B 58.9 12 0.00025 31.4 3.3 27 90-116 123-149 (154)
98 KOG3488|consensus 58.8 16 0.00035 27.0 3.6 21 88-108 48-68 (81)
99 TIGR02976 phageshock_pspB phag 58.2 16 0.00035 27.0 3.7 12 104-115 18-29 (75)
100 PF12597 DUF3767: Protein of u 58.0 24 0.00052 28.2 4.9 27 95-121 72-98 (118)
101 PHA02902 putative IMV membrane 57.6 15 0.00032 26.7 3.2 22 97-118 9-30 (70)
102 TIGR01477 RIFIN variant surfac 57.5 18 0.00038 34.3 4.7 14 108-121 328-341 (353)
103 PF03302 VSP: Giardia variant- 57.2 2.9 6.4E-05 39.7 -0.5 27 86-112 368-395 (397)
104 PF05083 LST1: LST-1 protein; 56.9 11 0.00023 27.8 2.5 9 108-116 16-24 (74)
105 PF11884 DUF3404: Domain of un 56.9 22 0.00049 32.3 5.1 10 105-114 248-257 (262)
106 PF10873 DUF2668: Protein of u 56.6 25 0.00054 29.4 4.8 27 88-114 60-86 (155)
107 PF04478 Mid2: Mid2 like cell 56.4 0.96 2.1E-05 37.9 -3.4 31 90-121 50-80 (154)
108 PF12259 DUF3609: Protein of u 56.4 8.1 0.00018 36.5 2.3 17 100-116 311-327 (361)
109 PF15099 PIRT: Phosphoinositid 55.2 11 0.00024 30.6 2.6 29 91-119 87-115 (129)
110 PF15330 SIT: SHP2-interacting 54.8 19 0.00042 28.3 3.8 21 100-120 9-29 (107)
111 PRK10905 cell division protein 53.4 8.3 0.00018 36.0 1.8 13 93-105 3-15 (328)
112 PF00053 Laminin_EGF: Laminin 53.1 9.7 0.00021 25.0 1.7 23 41-69 11-33 (49)
113 PHA03265 envelope glycoprotein 53.1 6.9 0.00015 37.1 1.2 39 83-121 346-384 (402)
114 PF05399 EVI2A: Ectropic viral 52.8 31 0.00068 30.6 5.1 10 62-71 116-125 (227)
115 PF01102 Glycophorin_A: Glycop 51.4 21 0.00045 28.8 3.6 29 92-121 67-95 (122)
116 KOG1214|consensus 51.1 15 0.00032 38.7 3.3 41 24-65 757-820 (1289)
117 PF06247 Plasmod_Pvs28: Plasmo 51.1 8.5 0.00018 33.5 1.4 42 23-64 19-80 (197)
118 TIGR00847 ccoS cytochrome oxid 50.5 22 0.00048 24.4 3.1 21 94-114 8-28 (51)
119 PTZ00045 apical membrane antig 50.4 48 0.001 33.4 6.6 29 23-64 459-487 (595)
120 PF07423 DUF1510: Protein of u 50.4 12 0.00026 33.1 2.2 12 94-105 20-31 (217)
121 PRK00523 hypothetical protein; 50.2 32 0.0007 25.3 4.1 34 91-124 3-37 (72)
122 PF15176 LRR19-TM: Leucine-ric 49.9 26 0.00057 27.5 3.8 23 86-108 16-38 (102)
123 COG4499 Predicted membrane pro 49.6 20 0.00043 34.5 3.6 29 92-120 223-251 (434)
124 PF15048 OSTbeta: Organic solu 48.6 16 0.00035 29.6 2.6 32 87-118 35-66 (125)
125 PRK09458 pspB phage shock prot 48.2 18 0.0004 26.8 2.6 17 98-114 12-28 (75)
126 cd00055 EGF_Lam Laminin-type e 47.3 15 0.00033 24.3 1.9 17 53-69 18-34 (50)
127 PRK14748 kdpF potassium-transp 46.6 39 0.00085 20.6 3.3 14 91-104 6-19 (29)
128 PF02439 Adeno_E3_CR2: Adenovi 46.3 42 0.0009 21.8 3.7 16 92-107 6-21 (38)
129 PF15347 PAG: Phosphoprotein a 46.3 26 0.00057 33.5 3.9 26 96-121 24-49 (428)
130 PF02699 YajC: Preprotein tran 46.2 40 0.00086 24.9 4.2 10 112-121 24-33 (82)
131 COG3763 Uncharacterized protei 45.8 57 0.0012 24.0 4.8 14 92-105 8-21 (71)
132 PF05808 Podoplanin: Podoplani 45.7 6.9 0.00015 33.1 0.0 12 88-99 132-143 (162)
133 PTZ00208 65 kDa invariant surf 45.7 12 0.00027 36.0 1.7 31 86-116 384-414 (436)
134 PF05510 Sarcoglycan_2: Sarcog 45.5 20 0.00043 34.4 3.0 25 91-115 290-314 (386)
135 KOG4260|consensus 45.1 18 0.00038 33.5 2.5 43 27-71 143-185 (350)
136 PF06305 DUF1049: Protein of u 44.4 70 0.0015 22.0 5.1 7 92-98 30-36 (68)
137 PHA02681 ORF089 virion membran 44.0 37 0.0008 25.9 3.7 18 98-115 10-27 (92)
138 PF05283 MGC-24: Multi-glycosy 43.9 21 0.00045 30.9 2.6 29 84-113 157-185 (186)
139 PF14575 EphA2_TM: Ephrin type 43.2 11 0.00024 27.6 0.8 28 91-118 3-30 (75)
140 PF05393 Hum_adeno_E3A: Human 42.7 35 0.00076 26.2 3.4 28 93-120 34-62 (94)
141 KOG3653|consensus 42.7 1.2E+02 0.0025 30.3 7.8 6 181-186 286-291 (534)
142 PRK13855 type IV secretion sys 42.1 68 0.0015 30.7 6.0 9 100-108 39-47 (376)
143 KOG1226|consensus 42.1 25 0.00054 36.4 3.3 20 25-44 606-626 (783)
144 PF03597 CcoS: Cytochrome oxid 41.7 39 0.00084 22.5 3.2 20 95-114 8-27 (45)
145 PF07010 Endomucin: Endomucin; 40.8 35 0.00076 30.6 3.6 6 108-113 210-215 (259)
146 PF11694 DUF3290: Protein of u 40.8 33 0.00072 28.5 3.3 19 95-113 23-41 (149)
147 PF04863 EGF_alliinase: Alliin 40.8 10 0.00023 26.6 0.3 36 36-71 18-53 (56)
148 PF04689 S1FA: DNA binding pro 40.6 23 0.0005 25.7 2.0 14 100-113 26-39 (69)
149 PF00558 Vpu: Vpu protein; In 40.4 37 0.0008 25.6 3.2 24 98-121 19-42 (81)
150 PF09835 DUF2062: Uncharacteri 40.4 67 0.0015 25.9 5.1 19 91-109 122-140 (154)
151 KOG1024|consensus 40.4 20 0.00044 35.0 2.2 29 91-119 188-216 (563)
152 PRK01658 holin-like protein; V 40.4 59 0.0013 26.0 4.6 22 87-108 88-109 (122)
153 KOG4482|consensus 39.9 31 0.00068 33.1 3.4 28 91-118 302-329 (449)
154 TIGR01195 oadG_fam sodium pump 39.9 67 0.0014 23.9 4.6 10 91-100 12-21 (82)
155 PF07271 Cytadhesin_P30: Cytad 39.3 15 0.00032 33.6 1.1 9 140-148 148-156 (279)
156 PF14991 MLANA: Protein melan- 39.2 6 0.00013 31.7 -1.2 33 86-119 25-57 (118)
157 PF10883 DUF2681: Protein of u 39.1 52 0.0011 25.1 3.9 8 107-114 20-27 (87)
158 PRK01844 hypothetical protein; 38.9 79 0.0017 23.3 4.7 25 100-124 12-36 (72)
159 KOG3653|consensus 38.2 1E+02 0.0022 30.7 6.7 9 53-61 115-123 (534)
160 PF11157 DUF2937: Protein of u 38.1 51 0.0011 27.7 4.1 17 85-101 131-147 (167)
161 PF03908 Sec20: Sec20; InterP 38.0 38 0.00082 25.3 3.0 15 100-114 77-91 (92)
162 PF10883 DUF2681: Protein of u 37.4 85 0.0018 23.9 4.8 26 90-115 6-31 (87)
163 PF07297 DPM2: Dolichol phosph 37.3 59 0.0013 24.3 3.9 28 89-116 47-75 (78)
164 PF09472 MtrF: Tetrahydrometha 37.0 41 0.00089 24.2 2.9 20 87-106 42-61 (64)
165 PF09835 DUF2062: Uncharacteri 36.3 72 0.0016 25.7 4.7 12 88-99 123-134 (154)
166 smart00180 EGF_Lam Laminin-typ 36.3 27 0.00059 22.8 1.8 16 54-69 18-33 (46)
167 PF01414 DSL: Delta serrate li 34.7 19 0.0004 25.6 0.8 41 24-67 17-63 (63)
168 PF03554 Herpes_UL73: UL73 vir 34.7 1.2E+02 0.0025 22.9 5.1 29 86-114 47-75 (82)
169 PF12768 Rax2: Cortical protei 34.5 41 0.00089 30.7 3.2 12 88-99 234-245 (281)
170 PHA02887 EGF-like protein; Pro 34.5 32 0.00069 27.9 2.2 21 19-39 103-123 (126)
171 PTZ00233 variable surface prot 34.4 28 0.00061 34.5 2.3 27 95-121 443-469 (509)
172 KOG3054|consensus 34.3 41 0.00089 30.6 3.1 18 165-184 78-96 (299)
173 PRK13275 mtrF tetrahydromethan 34.2 78 0.0017 23.0 4.0 20 87-106 42-61 (67)
174 PF02430 AMA-1: Apical membran 32.6 15 0.00032 35.9 0.0 31 87-117 429-459 (471)
175 PF14670 FXa_inhibition: Coagu 32.4 45 0.00097 21.0 2.2 18 42-63 11-28 (36)
176 PF11395 DUF2873: Protein of u 32.4 61 0.0013 21.1 2.8 11 98-108 18-28 (43)
177 PRK11486 flagellar biosynthesi 32.3 59 0.0013 26.3 3.4 19 97-115 24-42 (124)
178 PF13124 DUF3963: Protein of u 31.7 84 0.0018 20.3 3.3 21 87-107 18-38 (40)
179 COG3115 ZipA Cell division pro 31.7 1.6E+02 0.0035 27.5 6.5 10 208-217 133-142 (324)
180 PF01034 Syndecan: Syndecan do 31.7 17 0.00036 26.3 0.1 17 89-105 17-33 (64)
181 PF14584 DUF4446: Protein of u 31.6 1E+02 0.0023 25.5 4.9 23 93-115 9-31 (151)
182 PF12158 DUF3592: Protein of u 31.5 70 0.0015 24.9 3.8 9 183-191 101-109 (148)
183 cd01475 vWA_Matrilin VWA_Matri 31.2 50 0.0011 28.2 3.1 36 22-65 180-219 (224)
184 PF15339 Afaf: Acrosome format 31.0 1E+02 0.0022 26.7 4.7 6 91-96 133-138 (200)
185 PF13703 PepSY_TM_2: PepSY-ass 31.0 80 0.0017 23.2 3.8 15 91-105 19-33 (88)
186 PF05356 Phage_Coat_B: Phage C 30.9 91 0.002 23.3 3.9 23 91-113 59-81 (83)
187 PF13268 DUF4059: Protein of u 30.7 74 0.0016 23.5 3.4 19 100-118 20-38 (72)
188 PF12606 RELT: Tumour necrosis 30.7 66 0.0014 22.0 3.0 17 99-115 10-26 (50)
189 TIGR00859 ENaC sodium channel 30.0 3.5E+02 0.0076 27.2 9.2 22 178-200 564-585 (595)
190 PRK13707 conjugal transfer pil 29.7 1.2E+02 0.0025 23.5 4.6 14 106-119 53-66 (101)
191 KOG3054|consensus 29.6 52 0.0011 30.0 2.9 8 110-117 21-28 (299)
192 PF13131 DUF3951: Protein of u 29.5 58 0.0013 22.6 2.5 19 96-114 11-29 (53)
193 TIGR02736 cbb3_Q_epsi cytochro 28.5 74 0.0016 22.4 2.9 19 100-118 11-29 (56)
194 PRK04125 murein hydrolase regu 28.4 1.2E+02 0.0027 24.9 4.8 21 87-107 91-111 (141)
195 CHL00008 petG cytochrome b6/f 28.4 1.1E+02 0.0023 19.7 3.4 24 91-114 6-31 (37)
196 PF07271 Cytadhesin_P30: Cytad 28.3 41 0.00088 30.8 2.1 13 93-105 78-90 (279)
197 COG4218 MtrF Tetrahydromethano 28.3 60 0.0013 23.9 2.5 19 87-105 51-69 (73)
198 PF07178 TraL: TraL protein; 28.0 1.3E+02 0.0027 22.7 4.5 10 108-117 49-58 (95)
199 PHA03283 envelope glycoprotein 27.9 36 0.00079 33.9 1.8 19 99-117 412-430 (542)
200 PF09064 Tme5_EGF_like: Thromb 27.8 32 0.00068 21.8 0.9 10 23-32 17-26 (34)
201 PF11353 DUF3153: Protein of u 27.7 59 0.0013 28.0 2.9 14 103-116 195-208 (209)
202 PF04415 DUF515: Protein of un 27.7 66 0.0014 31.2 3.5 26 90-115 32-57 (416)
203 PHA03163 hypothetical protein; 27.6 2.4E+02 0.0052 21.7 5.8 29 86-114 56-84 (92)
204 TIGR01432 QOXA cytochrome aa3 27.5 1.3E+02 0.0029 26.0 5.1 10 106-115 42-51 (217)
205 PRK00665 petG cytochrome b6-f 27.2 1.1E+02 0.0025 19.6 3.4 24 91-114 6-31 (37)
206 PRK09400 secE preprotein trans 27.2 1.4E+02 0.003 21.1 4.2 26 85-110 30-55 (61)
207 KOG3514|consensus 27.0 47 0.001 36.1 2.5 25 87-111 1514-1538(1591)
208 PF02404 SCF: Stem cell factor 27.0 21 0.00046 32.5 0.0 20 96-115 223-242 (273)
209 PF03988 DUF347: Repeat of Unk 26.9 92 0.002 21.3 3.2 10 101-110 38-47 (55)
210 PF11431 Transport_MerF: Membr 26.7 77 0.0017 21.4 2.7 18 100-117 28-45 (46)
211 PF10669 Phage_Gp23: Protein g 26.4 1.8E+02 0.0038 22.9 5.0 12 95-106 23-34 (121)
212 TIGR00383 corA magnesium Mg(2+ 26.3 72 0.0016 28.6 3.4 14 103-116 303-316 (318)
213 PF04156 IncA: IncA protein; 26.3 96 0.0021 25.8 3.9 11 100-110 50-60 (191)
214 PHA03231 glycoprotein BALF4; P 26.2 1E+02 0.0023 32.4 4.8 13 106-118 718-730 (829)
215 PF11174 DUF2970: Protein of u 26.1 1.1E+02 0.0024 21.3 3.5 19 87-105 31-49 (56)
216 PF11743 DUF3301: Protein of u 26.0 90 0.002 23.8 3.4 17 102-118 8-24 (97)
217 PF13179 DUF4006: Family of un 25.9 1.4E+02 0.003 21.7 4.1 18 101-118 24-41 (66)
218 COG3088 CcmH Uncharacterized p 25.7 1.1E+02 0.0024 25.7 4.0 18 91-108 108-125 (153)
219 TIGR00327 secE_euk_arch protei 25.5 1.8E+02 0.0039 20.6 4.6 26 85-110 26-51 (61)
220 PF03408 Foamy_virus_ENV: Foam 25.5 42 0.00091 35.2 1.8 9 93-101 957-965 (981)
221 PF14163 SieB: Superinfection 25.4 1.7E+02 0.0036 23.7 5.0 21 87-107 35-55 (151)
222 PHA03185 UL14 tegument protein 25.2 88 0.0019 27.5 3.5 31 181-211 171-201 (214)
223 PF00954 S_locus_glycop: S-loc 25.0 61 0.0013 24.6 2.3 26 35-65 84-109 (110)
224 PF11157 DUF2937: Protein of u 25.0 85 0.0018 26.4 3.3 18 87-104 137-154 (167)
225 PF12877 DUF3827: Domain of un 24.6 55 0.0012 33.5 2.4 27 87-114 272-298 (684)
226 PF11359 gpUL132: Glycoprotein 24.5 64 0.0014 28.7 2.5 30 88-118 55-84 (235)
227 PHA03164 hypothetical protein; 24.1 45 0.00097 25.1 1.3 25 89-115 61-85 (88)
228 PRK08455 fliL flagellar basal 24.0 74 0.0016 27.1 2.8 6 106-111 36-41 (182)
229 COG5353 Uncharacterized protei 23.9 45 0.00097 28.1 1.4 21 91-111 8-28 (161)
230 KOG3498|consensus 23.9 1.5E+02 0.0031 21.6 3.8 25 84-108 30-54 (67)
231 PF06809 NPDC1: Neural prolife 23.8 18 0.0004 33.8 -1.0 9 142-150 270-278 (341)
232 PRK01026 tetrahydromethanopter 23.7 1.6E+02 0.0035 22.0 4.1 19 88-106 50-68 (77)
233 PHA03156 hypothetical protein; 23.7 3.1E+02 0.0067 21.1 5.8 29 86-114 55-83 (90)
234 PF02529 PetG: Cytochrome B6-F 23.6 1.7E+02 0.0038 18.8 3.7 25 91-115 6-32 (37)
235 PF04202 Mfp-3: Foot protein 3 23.3 52 0.0011 24.0 1.4 11 96-106 8-18 (71)
236 COG3630 OadG Na+-transporting 23.1 1.8E+02 0.0038 22.1 4.3 20 91-110 15-34 (84)
237 TIGR02507 MtrF tetrahydrometha 23.1 1.4E+02 0.003 21.6 3.6 19 87-105 42-60 (65)
238 cd00930 Cyt_c_Oxidase_VIII Cyt 23.1 1.5E+02 0.0032 19.7 3.5 23 91-113 16-38 (43)
239 PF05337 CSF-1: Macrophage col 23.1 28 0.0006 32.0 0.0 14 103-116 241-254 (285)
240 PHA03286 envelope glycoprotein 23.1 1.2E+02 0.0025 30.0 4.2 9 98-106 401-409 (492)
241 KOG1025|consensus 22.9 1.1E+02 0.0024 32.8 4.2 8 58-65 571-578 (1177)
242 PF13974 YebO: YebO-like prote 22.8 94 0.002 23.4 2.8 13 96-108 5-17 (80)
243 PF07438 DUF1514: Protein of u 22.7 84 0.0018 22.8 2.4 10 92-101 5-14 (66)
244 PRK11901 hypothetical protein; 22.7 67 0.0015 30.2 2.4 21 87-108 39-59 (327)
245 PF01372 Melittin: Melittin; 22.6 1.8E+02 0.0039 17.2 3.4 6 109-114 19-24 (26)
246 PF06692 MNSV_P7B: Melon necro 22.6 93 0.002 22.0 2.5 6 100-105 18-23 (61)
247 PRK05886 yajC preprotein trans 22.5 1.3E+02 0.0028 23.8 3.7 10 112-121 26-35 (109)
248 PRK03814 oxaloacetate decarbox 22.4 1.8E+02 0.0039 21.9 4.3 8 91-98 16-23 (85)
249 PHA03291 envelope glycoprotein 22.1 46 0.00099 31.7 1.2 30 89-118 290-319 (401)
250 COG0598 CorA Mg2+ and Co2+ tra 22.0 87 0.0019 28.6 3.1 20 97-116 301-320 (322)
251 PF13120 DUF3974: Domain of un 21.9 79 0.0017 24.8 2.3 20 98-117 14-33 (126)
252 PF11857 DUF3377: Domain of un 21.9 36 0.00077 25.3 0.4 11 110-120 52-62 (74)
253 PF14017 DUF4233: Protein of u 21.8 2.6E+02 0.0057 21.8 5.3 21 101-121 84-104 (107)
254 PTZ00234 variable surface prot 21.7 57 0.0012 31.7 1.8 14 97-110 370-383 (433)
255 COG4736 CcoQ Cbb3-type cytochr 21.6 1.9E+02 0.004 20.6 4.0 32 91-122 7-38 (60)
256 PF07178 TraL: TraL protein; 21.5 1.4E+02 0.0031 22.5 3.7 23 98-120 42-64 (95)
257 PF11085 YqhR: Conserved membr 21.5 1.7E+02 0.0036 25.1 4.4 28 87-114 67-94 (173)
258 PF05961 Chordopox_A13L: Chord 21.4 1.2E+02 0.0025 22.2 2.9 9 108-116 19-27 (68)
259 TIGR01495 ETRAMP Plasmodium ri 21.4 81 0.0018 23.8 2.2 13 97-109 62-74 (85)
260 PF07226 DUF1422: Protein of u 21.4 99 0.0021 24.9 2.8 20 97-116 96-115 (117)
261 PF05827 ATP-synt_S1: Vacuolar 21.1 1E+02 0.0022 27.4 3.2 23 88-110 257-279 (282)
262 COG3105 Uncharacterized protei 20.8 1.6E+02 0.0035 24.2 4.0 13 87-99 8-20 (138)
263 PF13706 PepSY_TM_3: PepSY-ass 20.8 1.8E+02 0.0039 18.2 3.5 12 92-103 11-22 (37)
264 PRK09546 zntB zinc transporter 20.6 1.1E+02 0.0023 27.9 3.3 10 106-115 312-321 (324)
265 PRK02919 oxaloacetate decarbox 20.6 2.7E+02 0.0058 20.8 4.9 6 91-96 15-20 (82)
266 PRK04778 septation ring format 20.6 1.4E+02 0.003 29.7 4.3 22 91-112 6-27 (569)
267 PF10251 PEN-2: Presenilin enh 20.5 2.8E+02 0.006 21.4 5.1 28 87-114 50-77 (94)
268 PRK11056 hypothetical protein; 20.5 98 0.0021 25.0 2.6 21 97-117 96-116 (120)
269 PF10855 DUF2648: Protein of u 20.4 73 0.0016 20.0 1.5 15 102-116 8-22 (33)
270 PF15069 FAM163: FAM163 family 20.4 1E+02 0.0022 25.6 2.8 27 88-114 10-37 (143)
271 PF02285 COX8: Cytochrome oxid 20.2 2.6E+02 0.0057 18.6 4.2 21 91-111 16-36 (44)
272 PF02937 COX6C: Cytochrome c o 20.1 2.7E+02 0.0058 20.4 4.7 11 95-105 22-32 (73)
273 PF04341 DUF485: Protein of un 20.1 1.6E+02 0.0034 22.1 3.6 20 96-115 60-79 (91)
274 PF13955 Fst_toxin: Toxin Fst, 20.1 1.7E+02 0.0036 16.7 2.7 11 90-100 4-14 (21)
275 PRK10905 cell division protein 20.1 67 0.0014 30.2 1.8 16 92-107 5-20 (328)
No 1
>KOG1219|consensus
Probab=99.43 E-value=5e-13 Score=141.87 Aligned_cols=170 Identities=21% Similarity=0.381 Sum_probs=99.0
Q ss_pred CCceeeCCCCCCC------------CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC----C-------------
Q psy12432 22 TFHTYACPPTYAT------------WYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD----L------------- 72 (229)
Q Consensus 22 ~gy~C~C~~gy~g------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~----~------------- 72 (229)
.||.|+|+.-|.| +||++||+|.... ++|.|.|+.||+|.|||... .
T Consensus 3884 ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~----n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~ 3959 (4289)
T KOG1219|consen 3884 GGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFY----NGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINI 3959 (4289)
T ss_pred CceEEeCcccccCcccccccccccCCCCCCCCEEEecC----CCeeEeCCCCccCceeecccccccccccccCCceeecc
Confidence 3899999999997 5999999998763 37999999999999999872 1
Q ss_pred CCCcc----cc-c----------chhhhHHHHH--hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccCCCCCCCC
Q psy12432 73 DGSYL----PS-R----------KQVMLETASI--ASGASIAVFLVVILCFSLYVHCQRRKKQAQAASVCCTDGPGSSLQ 135 (229)
Q Consensus 73 ~~~~~----p~-~----------~~~~~~~~~I--a~gI~V~lLLVlii~vvly~~~kRrk~~k~~~~~~~~d~~~~~~~ 135 (229)
.|+|+ +. . ......|.+. +++|+|+|++ ++++++++|+++|++.+|+-...+-+|--
T Consensus 3960 ~gsf~CncT~g~~gr~c~~~~pni~~~~~~~gkaEli~I~V~l~~-ifilvvlf~~crKk~~rkky~~~~~~~ll----- 4033 (4289)
T KOG1219|consen 3960 PGSFHCNCTPGILGRTCCAEKPNILSTVLWLGKAELIIIIVLLAL-IFILVVLFWKCRKKNSRKKYGAHCPVDLL----- 4033 (4289)
T ss_pred CCceEeccChhHhcccCccccCccccccchhcccceeehhHHHHH-HHHHHHHHHhhhhhccCCcccccCChhHH-----
Confidence 11121 10 0 0112233222 2334444433 33444466666666555544333322210
Q ss_pred CCCCCCccCCCCcceeeeeccccCCCCCCCCCccc--ccCCCCCCcCCCccccCCCCCC-CCCCCCCCCCCCCCCC
Q psy12432 136 RPRMPFERRPSPADFVLTRITTEAPRAADTRTSIT--ITGKGDSVSASQLYHQSSPPLP-HMSHPPSCTPLPPAPS 208 (229)
Q Consensus 136 ~~~~~~~~~~~p~~~~~~~~~~e~~~~~~~~~~ld--~~~~g~~~~~~~~f~~~~~~~~-~~~~~~~~~~~~~~~~ 208 (229)
++++=....|++-----|++...+|||| ..+.+..++||.+|++++..+. .-|-+|--.|++|--|
T Consensus 4034 -------~rn~nVy~~v~PqvplrP~sytsrnNld~n~~s~ssv~~e~~Tf~Pes~~r~vvcSvapnlp~pspc~S 4102 (4289)
T KOG1219|consen 4034 -------RRNTNVYVPVVPQVPLRPRSYTSRNNLDSNFISTSSVPTELVTFRPESQVRPVVCSVAPNLPPPSPCGS 4102 (4289)
T ss_pred -------hcCCCccccCCCCCCcCCcccccccccCcCCcccCCccccccccCCCccccceeeccCCCCCCCCCCCC
Confidence 0111111111110001133444789999 8899999999999999999877 5555555444443333
No 2
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=99.34 E-value=1.4e-12 Score=104.71 Aligned_cols=90 Identities=27% Similarity=0.449 Sum_probs=58.7
Q ss_pred ceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCCCCCcccccchhhhHHHHHhhhHHHHHHHHHH
Q psy12432 24 HTYACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDLDGSYLPSRKQVMLETASIASGASIAVFLVVI 103 (229)
Q Consensus 24 y~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~~~~~~p~~~~~~~~~~~Ia~gI~V~lLLVli 103 (229)
.+-.|++.|+ +||+|| +|+.+... ..+.|+|..||+|+|||+.++. .|.+..+.... +..|+..+++++|++++
T Consensus 41 ~i~~Cp~ey~-~YClHG-~C~yI~dl--~~~~CrC~~GYtGeRCEh~dLl-~~~~~~k~n~~-t~Yia~~~il~il~~i~ 114 (139)
T PHA03099 41 AIRLCGPEGD-GYCLHG-DCIHARDI--DGMYCRCSHGYTGIRCQHVVLV-DYQRSEKPNTT-TSYIPSPGIVLVLVGII 114 (139)
T ss_pred ccccCChhhC-CEeECC-EEEeeccC--CCceeECCCCcccccccceeee-eeeccccccch-hhhhhhhHHHHHHHHHH
Confidence 3789999996 899997 89988643 3799999999999999998753 34443333322 34555444455555444
Q ss_pred HHHHHHHHHHHhhhhh
Q psy12432 104 LCFSLYVHCQRRKKQA 119 (229)
Q Consensus 104 i~vvly~~~kRrk~~k 119 (229)
+.+++++.++..||+|
T Consensus 115 is~~~~~~yr~~r~~~ 130 (139)
T PHA03099 115 ITCCLLSVYRFTRRTK 130 (139)
T ss_pred HHHHHHhhheeeeccc
Confidence 4444444444433333
No 3
>PF02158 Neuregulin: Neuregulin family; InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers. The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission. The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=98.76 E-value=1.4e-09 Score=101.16 Aligned_cols=83 Identities=16% Similarity=0.283 Sum_probs=0.4
Q ss_pred ccchhhhHHHHHhhhHHHHHHHHHHHHHH-HHHHHHHhhhhhhh--cc-------c--cccCCCCCC--CCC-CCCCC--
Q psy12432 79 SRKQVMLETASIASGASIAVFLVVILCFS-LYVHCQRRKKQAQA--AS-------V--CCTDGPGSS--LQR-PRMPF-- 141 (229)
Q Consensus 79 ~~~~~~~~~~~Ia~gI~V~lLLVlii~vv-ly~~~kRrk~~k~~--~~-------~--~~~d~~~~~--~~~-~~~~~-- 141 (229)
++++|++ |.||||+||||+|+|++ +||+.||++|+++. .+ + .+.+||++. ..+ +++||
T Consensus 5 YQkrVLT-----ITgIcvaLlVVGi~Cvv~aYCKTKKQRkklh~hLkqs~~~~~~n~~n~angp~~p~n~pp~e~vQlvn 79 (404)
T PF02158_consen 5 YQKRVLT-----ITGICVALLVVGIVCVVDAYCKTKKQRKKLHEHLKQSQRSKNYNLRNLANGPNHPRNPPPQEEVQLVN 79 (404)
T ss_dssp S-------------------------------------------------------------------------------
T ss_pred hhhhhhh-----hhhhhHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhhcccchhhhhhhhcCCCCCCCCCccccccccc
Confidence 4566665 88999999999999999 99999999888753 22 2 246777653 444 88886
Q ss_pred --ccCCCCcceeeeeccccCCCCCCCC
Q psy12432 142 --ERRPSPADFVLTRITTEAPRAADTR 166 (229)
Q Consensus 142 --~~~~~p~~~~~~~~~~e~~~~~~~~ 166 (229)
.+|+++++++|++..+|..|.+...
T Consensus 80 ~y~SKn~~~t~~v~~~~tEtSFs~Sh~ 106 (404)
T PF02158_consen 80 QYISKNVRHTEHVIEKETETSFSGSHS 106 (404)
T ss_dssp ---------------------------
T ss_pred cccccCCCCchhhhccccccccCCCCc
Confidence 6899999999999999999977553
No 4
>PHA02887 EGF-like protein; Provisional
Probab=98.43 E-value=1.5e-07 Score=74.81 Aligned_cols=60 Identities=20% Similarity=0.417 Sum_probs=48.0
Q ss_pred CCCCCCCCCcCCCCCCceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC
Q psy12432 8 PKPRPPSPTPRPNITFHTYACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD 71 (229)
Q Consensus 8 ~~~~~~~~~~~~~~~gy~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~ 71 (229)
++-.+-.-+.+-.+.+|+-+|++.|. +||+| |+|...... ..+.|+|..||+|.||+...
T Consensus 66 y~en~~~~~~~rk~~~hf~pC~~eyk-~YCiH-G~C~yI~dL--~epsCrC~~GYtG~RCE~vs 125 (126)
T PHA02887 66 YKENANAQNFKRKNSMFFEKCKNDFN-DFCIN-GECMNIIDL--DEKFCICNKGYTGIRCDEVS 125 (126)
T ss_pred hhccccccchhhccccCccccChHhh-CEeeC-CEEEccccC--CCceeECCCCcccCCCCccc
Confidence 44445555667777899999999995 89997 599987643 37999999999999999763
No 5
>KOG1219|consensus
Probab=98.21 E-value=1.7e-06 Score=93.89 Aligned_cols=96 Identities=22% Similarity=0.352 Sum_probs=65.3
Q ss_pred CcCCCCCCceeeCCCCCCC-------------CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCCCCCcccccch
Q psy12432 16 TPRPNITFHTYACPPTYAT-------------WYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDLDGSYLPSRKQ 82 (229)
Q Consensus 16 ~~~~~~~gy~C~C~~gy~g-------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~~~~~~p~~~~ 82 (229)
|+.+...++.|+|+.||.| |.|.|||.|.++. +.|.|.|..||.|+.|...... .+.....
T Consensus 3916 tCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~----gsf~CncT~g~~gr~c~~~~pn-i~~~~~~- 3989 (4289)
T KOG1219|consen 3916 TCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIP----GSFHCNCTPGILGRTCCAEKPN-ILSTVLW- 3989 (4289)
T ss_pred EEEecCCCeeEeCCCCccCceeecccccccccccccCCceeeccC----CceEeccChhHhcccCccccCc-cccccch-
Confidence 6677778899999999996 6899999999975 3699999999999999865432 1111111
Q ss_pred hhh-HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 83 VML-ETASIASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 83 ~~~-~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
+++ |..+|++ +++++++++. ++|+|.++.++|+..
T Consensus 3990 ~gkaEli~I~V-~l~~ifilvv-lf~~crKk~~rkky~ 4025 (4289)
T KOG1219|consen 3990 LGKAELIIIIV-LLALIFILVV-LFWKCRKKNSRKKYG 4025 (4289)
T ss_pred hcccceeehhH-HHHHHHHHHH-HHHhhhhhccCCccc
Confidence 122 3444444 4444444444 667788877666554
No 6
>KOG4289|consensus
Probab=97.87 E-value=1.5e-05 Score=84.15 Aligned_cols=53 Identities=28% Similarity=0.654 Sum_probs=44.3
Q ss_pred CCCceeeCCCCCCC------------CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCCCCCcc
Q psy12432 21 ITFHTYACPPTYAT------------WYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDLDGSYL 77 (229)
Q Consensus 21 ~~gy~C~C~~gy~g------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~~~~~~ 77 (229)
+.|..|+||+||.| +||-|+|.|...+ ++|+|.|..||+|++||.....+..+
T Consensus 1219 vnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srE----ggYtCeCrpg~tGehCEvs~~agrCv 1283 (2531)
T KOG4289|consen 1219 VNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSRE----GGYTCECRPGFTGEHCEVSARAGRCV 1283 (2531)
T ss_pred cCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEec----CceeEEecCCccccceeeecccCccc
Confidence 34899999999997 4899999999875 37999999999999999876544433
No 7
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.73 E-value=3e-05 Score=48.07 Aligned_cols=29 Identities=45% Similarity=0.953 Sum_probs=25.3
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCCCCC
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYMGQR 66 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~r 66 (229)
+||.|||+|.... ...|.|.|+.||.|.+
T Consensus 4 ~~C~n~g~C~~~~---~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 4 NPCQNGGTCIDLP---GGGYTCECPPGYTGKR 32 (32)
T ss_dssp TSSTTTEEEEEES---TSEEEEEEBTTEESTT
T ss_pred CcCCCCeEEEeCC---CCCEEeECCCCCccCC
Confidence 6999999999875 2379999999999975
No 8
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.76 E-value=0.0018 Score=40.10 Aligned_cols=30 Identities=47% Similarity=1.022 Sum_probs=25.7
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCC-CCCCc
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYM-GQRCE 68 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~-G~rCe 68 (229)
++|.++|+|.+.. ..|.|.|+.||. |.+|+
T Consensus 9 ~~C~~~~~C~~~~----g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 9 NPCQNGGTCVNTV----GSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CCcCCCCEeECCC----CCeEeECCCCCccCCcCC
Confidence 5899999999764 369999999999 98885
No 9
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.46 E-value=0.0037 Score=38.02 Aligned_cols=30 Identities=43% Similarity=0.985 Sum_probs=25.4
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCCCCCCc
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYMGQRCE 68 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe 68 (229)
.+|.+++.|.... ..|.|.|..+|.|.+|+
T Consensus 9 ~~C~~~~~C~~~~----~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 9 NPCQNGGTCVNTV----GSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CCcCCCCEeECCC----CCeEeECCCCCcCCcCC
Confidence 5799999998763 36999999999998885
No 10
>KOG4289|consensus
Probab=96.22 E-value=0.0048 Score=66.13 Aligned_cols=52 Identities=29% Similarity=0.577 Sum_probs=44.1
Q ss_pred CcCCCCCCceeeCCCCCCC--------------CCCCCCCEEEeeccCCCCceEeecCCC-CCCCCCccc
Q psy12432 16 TPRPNITFHTYACPPTYAT--------------WYCLNGATCFTVKIGESLLYNCECADG-YMGQRCEFK 70 (229)
Q Consensus 16 ~~~~~~~gy~C~C~~gy~g--------------npClNGGtC~~~~~~~~~~~sC~C~~G-y~G~rCe~~ 70 (229)
+++..-.||.|.|.+||.| .+|.|||+|.+.. ..++-|.|+.| |.+.+|+..
T Consensus 1252 ~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~---nggf~c~Cp~ge~e~prC~v~ 1318 (2531)
T KOG4289|consen 1252 RCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL---NGGFCCHCPYGEFEDPRCEVT 1318 (2531)
T ss_pred ceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecC---CCceeccCCCcccCCCceEEE
Confidence 5666667899999999997 4899999999875 34799999997 999999975
No 11
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.14 E-value=0.008 Score=35.86 Aligned_cols=30 Identities=43% Similarity=0.956 Sum_probs=25.4
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCCCC-CCc
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYMGQ-RCE 68 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~-rCe 68 (229)
++|.+++.|.... ..|.|.|+.||.|. .|+
T Consensus 6 ~~C~~~~~C~~~~----~~~~C~C~~g~~g~~~C~ 36 (36)
T cd00053 6 NPCSNGGTCVNTP----GSYRCVCPPGYTGDRSCE 36 (36)
T ss_pred CCCCCCCEEecCC----CCeEeECCCCCcccCCcC
Confidence 6899999999764 36999999999998 774
No 12
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=96.01 E-value=0.018 Score=45.17 Aligned_cols=38 Identities=21% Similarity=0.490 Sum_probs=28.0
Q ss_pred CCCCCCCEEEeeccCC-CCceEeecCC-------------CCCCCCCcccCC
Q psy12432 35 WYCLNGATCFTVKIGE-SLLYNCECAD-------------GYMGQRCEFKDL 72 (229)
Q Consensus 35 npClNGGtC~~~~~~~-~~~~sC~C~~-------------Gy~G~rCe~~~~ 72 (229)
|-|.+.|.|....... ..=|.|.|.. .|.|.-|+.++.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkDv 64 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKDV 64 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeecccccccccc
Confidence 6899999998763211 1249999988 488888998875
No 13
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=95.78 E-value=0.014 Score=37.87 Aligned_cols=21 Identities=19% Similarity=0.403 Sum_probs=11.8
Q ss_pred HHHHHhhhHHHHHHHHHHHHH
Q psy12432 86 ETASIASGASIAVFLVVILCF 106 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~v 106 (229)
++.+|++|+.+++.+++|..+
T Consensus 4 s~IaIIv~V~vg~~iiii~~~ 24 (38)
T PF02439_consen 4 STIAIIVAVVVGMAIIIICMF 24 (38)
T ss_pred chhhHHHHHHHHHHHHHHHHH
Confidence 356667666666554444443
No 14
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.19 E-value=0.035 Score=33.77 Aligned_cols=29 Identities=48% Similarity=1.078 Sum_probs=23.9
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCCC-CCCc
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYMG-QRCE 68 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G-~rCe 68 (229)
++|.++ .|.... ..|.|.|..||.| ..|+
T Consensus 6 ~~C~~~-~C~~~~----~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 6 GPCSNG-TCINTP----GSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCCC-EEECCC----CCeEeECCCCCccCCccC
Confidence 589998 998763 3799999999999 7774
No 15
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=95.17 E-value=0.0094 Score=43.98 Aligned_cols=12 Identities=33% Similarity=0.382 Sum_probs=7.6
Q ss_pred eeeccccCCCCC
Q psy12432 152 LTRITTEAPRAA 163 (229)
Q Consensus 152 ~~~~~~e~~~~~ 163 (229)
+=+++||||-++
T Consensus 54 IDP~TYEDP~qA 65 (75)
T PF14575_consen 54 IDPHTYEDPNQA 65 (75)
T ss_dssp --GGGSSSHHHH
T ss_pred cCcccccCHHHH
Confidence 557889998543
No 16
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=94.78 E-value=0.036 Score=44.72 Aligned_cols=29 Identities=28% Similarity=0.250 Sum_probs=13.5
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhh
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQRRKK 117 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~ 117 (229)
+++|++|+..++++ ++++++|+.+++|||
T Consensus 66 i~~Ii~gv~aGvIg--~Illi~y~irR~~Kk 94 (122)
T PF01102_consen 66 IIGIIFGVMAGVIG--IILLISYCIRRLRKK 94 (122)
T ss_dssp HHHHHHHHHHHHHH--HHHHHHHHHHHHS--
T ss_pred eeehhHHHHHHHHH--HHHHHHHHHHHHhcc
Confidence 45566666555533 333445554444444
No 17
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.69 E-value=0.049 Score=33.83 Aligned_cols=27 Identities=33% Similarity=0.813 Sum_probs=23.2
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCCCCCC
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYMGQRC 67 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rC 67 (229)
..|.+.|+|... ...|.|..||.|..|
T Consensus 6 ~~C~~~G~C~~~------~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP------CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC------CCEEECCCCCcCCCC
Confidence 579999999853 379999999999887
No 18
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.49 E-value=0.017 Score=29.06 Aligned_cols=13 Identities=38% Similarity=1.359 Sum_probs=10.0
Q ss_pred EeecCCCCCCCCC
Q psy12432 55 NCECADGYMGQRC 67 (229)
Q Consensus 55 sC~C~~Gy~G~rC 67 (229)
.|.|+.||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4888999998887
No 19
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=94.01 E-value=0.0097 Score=47.35 Aligned_cols=31 Identities=26% Similarity=0.346 Sum_probs=2.1
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
|+||+++++|+.|++++-||++|||..=|.+
T Consensus 26 AaGIGiL~VILgiLLliGCWYckRRSGYk~L 56 (118)
T PF14991_consen 26 AAGIGILIVILGILLLIGCWYCKRRSGYKTL 56 (118)
T ss_dssp --SSS--------------------------
T ss_pred hccceeHHHHHHHHHHHhheeeeecchhhhh
Confidence 6677777777777777777777777555544
No 20
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=93.98 E-value=0.052 Score=43.55 Aligned_cols=6 Identities=17% Similarity=0.479 Sum_probs=3.1
Q ss_pred ccCCCC
Q psy12432 171 ITGKGD 176 (229)
Q Consensus 171 ~~~~g~ 176 (229)
||..|.
T Consensus 78 Yd~~g~ 83 (130)
T PF12273_consen 78 YDQQGN 83 (130)
T ss_pred CCCCCC
Confidence 555543
No 21
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=93.43 E-value=0.022 Score=52.17 Aligned_cols=10 Identities=30% Similarity=0.477 Sum_probs=0.0
Q ss_pred CccCCCCcce
Q psy12432 141 FERRPSPADF 150 (229)
Q Consensus 141 ~~~~~~p~~~ 150 (229)
|.+|.+|.-+
T Consensus 186 f~~KGiPvIF 195 (290)
T PF05454_consen 186 FISKGIPVIF 195 (290)
T ss_dssp ----------
T ss_pred HHhcCCceec
Confidence 6666666543
No 22
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=93.14 E-value=0.17 Score=41.31 Aligned_cols=22 Identities=18% Similarity=0.259 Sum_probs=17.7
Q ss_pred CCCCCceeeCCCCCCCCCCCCC
Q psy12432 19 PNITFHTYACPPTYATWYCLNG 40 (229)
Q Consensus 19 ~~~~gy~C~C~~gy~gnpClNG 40 (229)
++...+.|.|..||.|..|++.
T Consensus 62 ~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 62 RDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred ccCCCceeECCCCcccccccce
Confidence 3455789999999999888764
No 23
>KOG1214|consensus
Probab=92.96 E-value=0.096 Score=54.07 Aligned_cols=46 Identities=28% Similarity=0.680 Sum_probs=36.8
Q ss_pred CCCceeeCCCCCCC-------------CCCCCCCEEEeeccCCCCceEeecCCCCCCC--CCccc
Q psy12432 21 ITFHTYACPPTYAT-------------WYCLNGATCFTVKIGESLLYNCECADGYMGQ--RCEFK 70 (229)
Q Consensus 21 ~~gy~C~C~~gy~g-------------npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~--rCe~~ 70 (229)
...|+|.|.+||+| +.|.....|++.. +++.|+|..||.|. +|...
T Consensus 806 gs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~Cyntp----gsfsC~C~pGy~GDGf~CVP~ 866 (1289)
T KOG1214|consen 806 GSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTP----GSFSCRCQPGYYGDGFQCVPD 866 (1289)
T ss_pred CceEEEeecCCccCCccccccccccCccccCCCceEecCC----CcceeecccCccCCCceecCC
Confidence 45589999999997 3677788999864 47999999999987 45544
No 24
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=92.61 E-value=0.26 Score=41.39 Aligned_cols=14 Identities=21% Similarity=0.330 Sum_probs=7.3
Q ss_pred HHHhhhHHHHHHHH
Q psy12432 88 ASIASGASIAVFLV 101 (229)
Q Consensus 88 ~~Ia~gI~V~lLLV 101 (229)
+.|++||+++|++|
T Consensus 78 ~~iivgvi~~Vi~I 91 (179)
T PF13908_consen 78 TGIIVGVICGVIAI 91 (179)
T ss_pred eeeeeehhhHHHHH
Confidence 44566665544433
No 25
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=91.33 E-value=0.051 Score=39.11 Aligned_cols=19 Identities=16% Similarity=0.585 Sum_probs=0.0
Q ss_pred HHHhhhHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCF 106 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~v 106 (229)
+++++|+++++++++++++
T Consensus 12 aavIaG~Vvgll~ailLIl 30 (64)
T PF01034_consen 12 AAVIAGGVVGLLFAILLIL 30 (64)
T ss_dssp -------------------
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4555555566554444443
No 26
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=89.96 E-value=0.37 Score=31.10 Aligned_cols=31 Identities=29% Similarity=0.743 Sum_probs=24.4
Q ss_pred eCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCC
Q psy12432 27 ACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYM 63 (229)
Q Consensus 27 ~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~ 63 (229)
.|..+- +.|.+++.|.+.. ++|.|.|+.||.
T Consensus 4 EC~~~~--~~C~~~~~C~N~~----Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 4 ECAEGP--HNCPENGTCVNTE----GSYSCSCPPGYE 34 (42)
T ss_dssp TTTTTS--SSSSTTSEEEEET----TEEEEEESTTEE
T ss_pred ccCCCC--CcCCCCCEEEcCC----CCEEeeCCCCcE
Confidence 355532 6898899999985 379999999997
No 27
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=89.21 E-value=0.5 Score=36.16 Aligned_cols=15 Identities=7% Similarity=0.498 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHH
Q psy12432 96 IAVFLVVILCFSLYV 110 (229)
Q Consensus 96 V~lLLVlii~vvly~ 110 (229)
.++|+++++++++||
T Consensus 41 ~~iFil~VilwfvCC 55 (94)
T PF05393_consen 41 CGIFILLVILWFVCC 55 (94)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344455555554444
No 28
>PF15069 FAM163: FAM163 family
Probab=89.03 E-value=1.4 Score=36.52 Aligned_cols=24 Identities=25% Similarity=0.418 Sum_probs=14.1
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCFSLYVH 111 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~~ 111 (229)
++|.+||...|+||.||+++.||+
T Consensus 6 vVItGgILAtVILLcIIaVLCYCR 29 (143)
T PF15069_consen 6 VVITGGILATVILLCIIAVLCYCR 29 (143)
T ss_pred EEEechHHHHHHHHHHHHHHHHHh
Confidence 445566655565666666666655
No 29
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=88.76 E-value=0.025 Score=47.20 Aligned_cols=26 Identities=27% Similarity=0.498 Sum_probs=11.3
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
++||++++|| ++++++|+++.|+|+.
T Consensus 55 VVGVGg~ill--~il~lvf~~c~r~kkt 80 (154)
T PF04478_consen 55 VVGVGGPILL--GILALVFIFCIRRKKT 80 (154)
T ss_pred EecccHHHHH--HHHHhheeEEEecccC
Confidence 4444444443 3333444444444443
No 30
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=88.70 E-value=0.42 Score=36.93 Aligned_cols=32 Identities=13% Similarity=0.364 Sum_probs=13.8
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
|-+++.|..++| +|+|+++++||+.|+|..++
T Consensus 42 WpyLA~GGG~iL-ilIii~Lv~CC~~K~K~~~~ 73 (98)
T PF07204_consen 42 WPYLAAGGGLIL-ILIIIALVCCCRAKHKTSAA 73 (98)
T ss_pred hHHhhccchhhh-HHHHHHHHHHhhhhhhhHhh
Confidence 445555543333 33334444444444443443
No 31
>KOG2052|consensus
Probab=88.46 E-value=1.3 Score=43.30 Aligned_cols=32 Identities=16% Similarity=0.438 Sum_probs=20.9
Q ss_pred CcCCCCCCceeeCCCCCCC--CCCCCCCEEEeec
Q psy12432 16 TPRPNITFHTYACPPTYAT--WYCLNGATCFTVK 47 (229)
Q Consensus 16 ~~~~~~~gy~C~C~~gy~g--npClNGGtC~~~~ 47 (229)
...++..-+.|.|...... +-|.+||.|+...
T Consensus 27 ~~~~~~~~~~C~C~~~~~~n~~tC~t~g~C~~s~ 60 (513)
T KOG2052|consen 27 SVPPGVQLLKCYCSSCLCSNANTCETDGACFVSV 60 (513)
T ss_pred CCCCCceeEEEEecCCCcCCCCeeeecCeEEEEE
Confidence 3344444688888765443 4688899998653
No 32
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=88.40 E-value=0.35 Score=28.28 Aligned_cols=21 Identities=33% Similarity=0.649 Sum_probs=12.8
Q ss_pred CceeeCCCCCCCCCCCCCCEEEe
Q psy12432 23 FHTYACPPTYATWYCLNGATCFT 45 (229)
Q Consensus 23 gy~C~C~~gy~gnpClNGGtC~~ 45 (229)
+|.|.|++||..++ +|-.|..
T Consensus 1 sy~C~C~~Gy~l~~--d~~~C~D 21 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP--DGRSCED 21 (24)
T ss_pred CEEeeCCCCCcCCC--CCCcccc
Confidence 47788888875332 4555554
No 33
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=87.40 E-value=1.4 Score=38.93 Aligned_cols=10 Identities=30% Similarity=0.604 Sum_probs=7.1
Q ss_pred ccccCCCCCC
Q psy12432 183 LYHQSSPPLP 192 (229)
Q Consensus 183 ~f~~~~~~~~ 192 (229)
+|.+.+|...
T Consensus 146 ~~~P~sPDLa 155 (221)
T PF08374_consen 146 TFKPDSPDLA 155 (221)
T ss_pred CCCCCCcchh
Confidence 7777777654
No 34
>KOG3516|consensus
Probab=87.02 E-value=0.42 Score=50.96 Aligned_cols=40 Identities=28% Similarity=0.660 Sum_probs=33.0
Q ss_pred CceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecC-CCCCCCCCccc
Q psy12432 23 FHTYACPPTYATWYCLNGATCFTVKIGESLLYNCECA-DGYMGQRCEFK 70 (229)
Q Consensus 23 gy~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~-~Gy~G~rCe~~ 70 (229)
+..|+|.+ |+|.|||.|.-. ++ .+.|.|. .||.|..|...
T Consensus 543 ~i~drClP----N~CehgG~C~Qs--~~--~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 543 GISDRCLP----NPCEHGGKCSQS--WD--DFECNCELTGYKGATCHTS 583 (1306)
T ss_pred ccccccCC----ccccCCCccccc--cc--ceeEeccccccccccccCC
Confidence 35789988 899999999873 22 6999999 79999988754
No 35
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=86.22 E-value=0.44 Score=47.98 Aligned_cols=29 Identities=28% Similarity=0.508 Sum_probs=18.0
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 90 IASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 90 Ia~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
|++|+++.|+||++|++++||+.-|++|.
T Consensus 271 II~gVlvPv~vV~~Iiiil~~~LCRk~K~ 299 (684)
T PF12877_consen 271 IIAGVLVPVLVVLLIIIILYWKLCRKNKL 299 (684)
T ss_pred EEehHhHHHHHHHHHHHHHHHHHhccccc
Confidence 34456677777777777777665555443
No 36
>KOG3514|consensus
Probab=86.20 E-value=0.41 Score=50.73 Aligned_cols=34 Identities=29% Similarity=0.700 Sum_probs=28.7
Q ss_pred CCCCCCCCEEEeeccCCCCceEeecCC-CCCCCCCcccC
Q psy12432 34 TWYCLNGATCFTVKIGESLLYNCECAD-GYMGQRCEFKD 71 (229)
Q Consensus 34 gnpClNGGtC~~~~~~~~~~~sC~C~~-Gy~G~rCe~~~ 71 (229)
+|||+|||.|... . +.+.|.|.. +|.|..||...
T Consensus 628 ~nPC~N~g~C~eg--w--NrfiCDCs~T~~~G~~CerE~ 662 (1591)
T KOG3514|consen 628 SNPCQNGGKCSEG--W--NRFICDCSGTGFEGRTCEREA 662 (1591)
T ss_pred CCcccCCCCcccc--c--cccccccccCcccCcccccee
Confidence 4899999999874 3 389999987 79999999753
No 37
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=85.42 E-value=0.97 Score=34.03 Aligned_cols=26 Identities=15% Similarity=0.466 Sum_probs=5.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy12432 95 SIAVFLVVILCFSLYVHCQRRKKQAQ 120 (229)
Q Consensus 95 ~V~lLLVlii~vvly~~~kRrk~~k~ 120 (229)
+|++++.+++..++|..++|.|++++
T Consensus 13 iv~~iiaIvvW~iv~ieYrk~~rqrk 38 (81)
T PF00558_consen 13 IVALIIAIVVWTIVYIEYRKIKRQRK 38 (81)
T ss_dssp HHHHHHHHHHHHHH------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33333333333344555665555554
No 38
>KOG3607|consensus
Probab=84.81 E-value=1.4 Score=45.26 Aligned_cols=29 Identities=24% Similarity=0.587 Sum_probs=23.3
Q ss_pred CCCCCEEEeeccCCCCceEeecCCCCCCCCCcccCC
Q psy12432 37 CLNGATCFTVKIGESLLYNCECADGYMGQRCEFKDL 72 (229)
Q Consensus 37 ClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~~ 72 (229)
|...|+|.+ ...|.|..||.+..|+....
T Consensus 632 C~g~GVCnn-------~~~ChC~~gwapp~C~~~~~ 660 (716)
T KOG3607|consen 632 CNGHGVCNN-------ELNCHCEPGWAPPFCFIFGY 660 (716)
T ss_pred cCCCcccCC-------CcceeeCCCCCCCccccccC
Confidence 666677754 58999999999999998754
No 39
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=84.74 E-value=1.1 Score=35.21 Aligned_cols=7 Identities=14% Similarity=-0.078 Sum_probs=2.7
Q ss_pred HHHHHHH
Q psy12432 107 SLYVHCQ 113 (229)
Q Consensus 107 vly~~~k 113 (229)
++.|+.+
T Consensus 19 l~~wr~~ 25 (107)
T PF15330_consen 19 LLAWRMK 25 (107)
T ss_pred HHHHHHH
Confidence 3334333
No 40
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=84.17 E-value=2.6 Score=28.14 Aligned_cols=22 Identities=14% Similarity=0.345 Sum_probs=10.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy12432 93 GASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 93 gI~V~lLLVlii~vvly~~~kR 114 (229)
++..++++++++.+++|.+++|
T Consensus 12 ~~~~v~~~~~F~gi~~w~~~~~ 33 (49)
T PF05545_consen 12 SIGTVLFFVFFIGIVIWAYRPR 33 (49)
T ss_pred HHHHHHHHHHHHHHHHHHHccc
Confidence 3344444444444445444443
No 41
>KOG1225|consensus
Probab=83.29 E-value=1.3 Score=43.91 Aligned_cols=38 Identities=29% Similarity=0.763 Sum_probs=22.1
Q ss_pred eeeCCCCCCCC---------CCCCCCEEEeeccCCCCceEeecCCCCCCCCCccc
Q psy12432 25 TYACPPTYATW---------YCLNGATCFTVKIGESLLYNCECADGYMGQRCEFK 70 (229)
Q Consensus 25 ~C~C~~gy~gn---------pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~ 70 (229)
.|.|++||.|. .|.+.|-|.. -.|.|..||.|.-|+..
T Consensus 297 ~CiC~~g~~G~dCs~~~cpadC~g~G~Ci~--------G~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 297 ECICNPGYSGKDCSIRRCPADCSGHGKCID--------GECLCDEGYTGELCIQR 343 (525)
T ss_pred EeecCCCccccccccccCCccCCCCCcccC--------CceEeCCCCcCCccccc
Confidence 56666666552 3455555541 35666666666666664
No 42
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=82.90 E-value=0.11 Score=34.05 Aligned_cols=18 Identities=28% Similarity=0.486 Sum_probs=7.7
Q ss_pred HHHhhhHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILC 105 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~ 105 (229)
+.|+.|+.+-+.++++++
T Consensus 11 vaIa~~VvVPV~vI~~vl 28 (40)
T PF08693_consen 11 VAIAVGVVVPVGVIIIVL 28 (40)
T ss_pred EEEEEEEEechHHHHHHH
Confidence 344444444444333333
No 43
>KOG1225|consensus
Probab=81.90 E-value=1.8 Score=42.77 Aligned_cols=14 Identities=43% Similarity=1.253 Sum_probs=9.7
Q ss_pred EeecCCCCCCCCCc
Q psy12432 55 NCECADGYMGQRCE 68 (229)
Q Consensus 55 sC~C~~Gy~G~rCe 68 (229)
.|.|..+|.|..|+
T Consensus 297 ~CiC~~g~~G~dCs 310 (525)
T KOG1225|consen 297 ECICNPGYSGKDCS 310 (525)
T ss_pred EeecCCCccccccc
Confidence 67777777777664
No 44
>KOG1217|consensus
Probab=80.73 E-value=2.4 Score=38.80 Aligned_cols=42 Identities=38% Similarity=0.818 Sum_probs=33.2
Q ss_pred CCceeeCCCCCCCC---------------CCCCCCEEEeeccCCCCceEeecCCCCCCCCC
Q psy12432 22 TFHTYACPPTYATW---------------YCLNGATCFTVKIGESLLYNCECADGYMGQRC 67 (229)
Q Consensus 22 ~gy~C~C~~gy~gn---------------pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rC 67 (229)
.+|.|.|++||.+. +|.|+++|.... ..|.|.|+.+|.|..|
T Consensus 250 ~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~----~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 250 GSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVP----GSYRCTCPPGFTGRLC 306 (487)
T ss_pred CceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCC----CcceeeCCCCCCCCCC
Confidence 34778888777643 388899999864 2499999999999998
No 45
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=80.43 E-value=0.79 Score=41.77 Aligned_cols=28 Identities=29% Similarity=0.522 Sum_probs=13.1
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQRRK 116 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk 116 (229)
++-||+|++|++|||++ ++.|+..+||.
T Consensus 272 ~vPIaVG~~La~lvliv--LiaYli~Rrr~ 299 (306)
T PF01299_consen 272 LVPIAVGAALAGLVLIV--LIAYLIGRRRS 299 (306)
T ss_pred hHHHHHHHHHHHHHHHH--HHhheeEeccc
Confidence 45556665554443333 33444444433
No 46
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=79.85 E-value=3.2 Score=34.87 Aligned_cols=15 Identities=33% Similarity=0.541 Sum_probs=7.4
Q ss_pred HHHHHHHHHHhhhhh
Q psy12432 105 CFSLYVHCQRRKKQA 119 (229)
Q Consensus 105 ~vvly~~~kRrk~~k 119 (229)
++++|++-.||||+.
T Consensus 46 ivli~lcssRKkKaa 60 (189)
T PF05568_consen 46 IVLIYLCSSRKKKAA 60 (189)
T ss_pred HHHHHHHhhhhHHHH
Confidence 334555555555544
No 47
>KOG0994|consensus
Probab=79.67 E-value=1.5 Score=47.15 Aligned_cols=50 Identities=30% Similarity=0.664 Sum_probs=33.2
Q ss_pred CCCCCCcee-eCCCCCCCC------------CCCCC--------CEEEeeccCCCCceEeecCCCCCCCCCcc
Q psy12432 18 RPNITFHTY-ACPPTYATW------------YCLNG--------ATCFTVKIGESLLYNCECADGYMGQRCEF 69 (229)
Q Consensus 18 ~~~~~gy~C-~C~~gy~gn------------pClNG--------GtC~~~~~~~~~~~sC~C~~Gy~G~rCe~ 69 (229)
+...+||.| +|..||.|+ ||..| ..|+.... +....|.|..||+|.||+.
T Consensus 879 qD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~--t~~ivC~C~~GY~G~RCe~ 949 (1758)
T KOG0994|consen 879 QDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTR--TQQIVCHCQEGYSGSRCEI 949 (1758)
T ss_pred cccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhcccccccccc--ccceeeecccCccccchhh
Confidence 445567665 688888875 33333 34554321 2358999999999999986
No 48
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=79.15 E-value=0.61 Score=37.87 Aligned_cols=27 Identities=22% Similarity=0.101 Sum_probs=0.0
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
+.|.+++..++||+.++.+++|+++-|
T Consensus 79 ~pi~~sal~v~lVl~llsg~lv~rrcr 105 (129)
T PF12191_consen 79 WPILGSALSVVLVLALLSGFLVWRRCR 105 (129)
T ss_dssp ---------------------------
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHhhhh
Confidence 334444444444444444444444333
No 49
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=79.00 E-value=2.5 Score=31.67 Aligned_cols=18 Identities=22% Similarity=0.429 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHHHHhhhh
Q psy12432 101 VVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 101 Vlii~vvly~~~kRrk~~ 118 (229)
++|+.+++||-+.|+|++
T Consensus 48 LLIv~~vy~car~r~r~~ 65 (79)
T PF07213_consen 48 LLIVLVVYYCARPRRRPT 65 (79)
T ss_pred HHHHHHHHhhcccccCCc
Confidence 444444454444443333
No 50
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=78.78 E-value=3.4 Score=34.35 Aligned_cols=25 Identities=20% Similarity=0.352 Sum_probs=11.4
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRr 115 (229)
+++++++++|+++++++++.+.+|.
T Consensus 4 ~i~l~~l~iilli~~~~~~~kl~kl 28 (151)
T PF14584_consen 4 IIGLLVLVIILLILIIILNIKLRKL 28 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444444454544433
No 51
>cd01324 cbb3_Oxidase_CcoQ Cytochrome cbb oxidase CcoQ. Cytochrome cbb3 oxidase, the terminal oxidase in the respiratory chains of proteobacteria, is a multi-chain transmembrane protein located in the cell membrane. Like other cytochrome oxidases, it catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. Found exclusively in proteobacteria, cbb3 is believed to be a modern enzyme that has evolved independently to perform a specialized function in microaerobic energy metabolism. The cbb3 operon contains four genes (ccoNOQP or fixNOQP), with ccoN coding for subunit I. Instead of a CuA-containing subunit II analogous to other cytochrome oxidases, cbb3 utilizes subunits ccoO and ccoP, which contain one and two hemes, respectively, to transfer electrons to the binuclear center. ccoQ, the fourth subunit, is a single transmembrane helix protein. It has been shown to protect the core complex from proteolytic degradation by serine proteases. See cd00919, cd01322
Probab=78.47 E-value=3.2 Score=28.06 Aligned_cols=32 Identities=16% Similarity=0.201 Sum_probs=19.2
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 88 ASIASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
..++...++++++++++++++|.+..++|++-
T Consensus 8 r~~a~~~~l~~~~~~Figiv~wa~~p~~k~~f 39 (48)
T cd01324 8 RGLADSWGLLYLALFFLGVVVWAFRPGRKKAF 39 (48)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCcchhH
Confidence 34454455556667777777777666655543
No 52
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=78.34 E-value=1.8 Score=27.38 Aligned_cols=27 Identities=37% Similarity=0.719 Sum_probs=19.8
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCCCC
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYMGQ 65 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~ 65 (229)
..|....+|.+.. ..+.|.|..||.|.
T Consensus 6 ~~C~~nA~C~~~~----~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTG----GSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-T----TSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCC----CCEEeECCCCCccC
Confidence 3677889999874 27999999999886
No 53
>KOG1094|consensus
Probab=78.22 E-value=18 Score=36.96 Aligned_cols=32 Identities=28% Similarity=0.312 Sum_probs=16.8
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 88 ASIASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
..|+++|.|+++++++++++++.+++|+.+.|
T Consensus 390 t~~~~~~f~~if~iva~ii~~~L~R~rr~~~k 421 (807)
T KOG1094|consen 390 TAILIIIFVAIFLIVALIIALMLWRWRRLLSK 421 (807)
T ss_pred ceehHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34556666666655555554444444444333
No 54
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=77.99 E-value=2.7 Score=38.59 Aligned_cols=7 Identities=14% Similarity=0.586 Sum_probs=5.1
Q ss_pred ceeeCCC
Q psy12432 24 HTYACPP 30 (229)
Q Consensus 24 y~C~C~~ 30 (229)
..|.|.+
T Consensus 143 s~cectd 149 (295)
T TIGR01478 143 KSCECTN 149 (295)
T ss_pred Cceeeec
Confidence 5788876
No 55
>PTZ00370 STEVOR; Provisional
Probab=77.63 E-value=2.7 Score=38.61 Aligned_cols=9 Identities=22% Similarity=0.397 Sum_probs=5.9
Q ss_pred ceeeCCCCC
Q psy12432 24 HTYACPPTY 32 (229)
Q Consensus 24 y~C~C~~gy 32 (229)
..|.|.+-.
T Consensus 143 s~cectd~n 151 (296)
T PTZ00370 143 STCECTDIN 151 (296)
T ss_pred Cceeeeecc
Confidence 378887643
No 56
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=77.50 E-value=2.5 Score=35.95 Aligned_cols=29 Identities=28% Similarity=0.408 Sum_probs=19.8
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432 88 ASIASGASIAVFLVVILCFSLYVHCQRRK 116 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk 116 (229)
..+|.||+-+|++.|+..+.-|+.++|||
T Consensus 114 ~g~IaGIvsav~valvGAvsSyiaYqkKK 142 (169)
T PF12301_consen 114 AGTIAGIVSAVVVALVGAVSSYIAYQKKK 142 (169)
T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHhhc
Confidence 44566777677777777777777776655
No 57
>KOG3637|consensus
Probab=77.43 E-value=1 Score=47.98 Aligned_cols=33 Identities=18% Similarity=0.236 Sum_probs=24.9
Q ss_pred hHHHHHhhhHHHHHHHHHHHHHHHHHH--HHHhhh
Q psy12432 85 LETASIASGASIAVFLVVILCFSLYVH--CQRRKK 117 (229)
Q Consensus 85 ~~~~~Ia~gI~V~lLLVlii~vvly~~--~kRrk~ 117 (229)
..+|+|+++++++|||++++.+++|.. +||+|+
T Consensus 976 vp~wiIi~svl~GLLlL~llv~~LwK~GFFKR~r~ 1010 (1030)
T KOG3637|consen 976 VPLWIIILSVLGGLLLLALLVLLLWKCGFFKRNRK 1010 (1030)
T ss_pred cceeeehHHHHHHHHHHHHHHHHHHhcCccccCCC
Confidence 457888888988888888877777664 666665
No 58
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=77.19 E-value=1.1 Score=39.50 Aligned_cols=21 Identities=14% Similarity=0.384 Sum_probs=8.9
Q ss_pred HHHhhhHHHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCFSL 108 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvl 108 (229)
++|+.|++.++||+++++++-
T Consensus 41 iaiVAG~~tVILVI~i~v~vR 61 (221)
T PF08374_consen 41 IAIVAGIMTVILVIFIVVLVR 61 (221)
T ss_pred eeeecchhhhHHHHHHHHHHH
Confidence 344444444444444444333
No 59
>PF06697 DUF1191: Protein of unknown function (DUF1191); InterPro: IPR010605 This family contains hypothetical plant proteins of unknown function.
Probab=76.58 E-value=0.9 Score=41.48 Aligned_cols=13 Identities=23% Similarity=0.036 Sum_probs=6.2
Q ss_pred HHHHhhhHHHHHH
Q psy12432 87 TASIASGASIAVF 99 (229)
Q Consensus 87 ~~~Ia~gI~V~lL 99 (229)
+|.|++|+.++++
T Consensus 212 ~W~iv~g~~~G~~ 224 (278)
T PF06697_consen 212 WWKIVVGVVGGVV 224 (278)
T ss_pred eEEEEEEehHHHH
Confidence 4555555444443
No 60
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=76.56 E-value=0.65 Score=44.15 Aligned_cols=12 Identities=58% Similarity=0.783 Sum_probs=5.1
Q ss_pred hHHHHHhhhHHHH
Q psy12432 85 LETASIASGASIA 97 (229)
Q Consensus 85 ~~~~~Ia~gI~V~ 97 (229)
+++.+|+ ||.|+
T Consensus 364 LstgaIa-GIsva 375 (397)
T PF03302_consen 364 LSTGAIA-GISVA 375 (397)
T ss_pred cccccee-eeeeh
Confidence 3444443 44444
No 61
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=76.50 E-value=4.4 Score=37.38 Aligned_cols=12 Identities=8% Similarity=0.127 Sum_probs=6.8
Q ss_pred HHHHHhhhhhhh
Q psy12432 110 VHCQRRKKQAQA 121 (229)
Q Consensus 110 ~~~kRrk~~k~~ 121 (229)
+...|.|++|++
T Consensus 276 YLILRYRRKKKm 287 (299)
T PF02009_consen 276 YLILRYRRKKKM 287 (299)
T ss_pred HHHHHHHHHhhh
Confidence 346666665543
No 62
>KOG1217|consensus
Probab=75.67 E-value=3.3 Score=37.89 Aligned_cols=45 Identities=33% Similarity=0.693 Sum_probs=35.1
Q ss_pred CCceeeCCCCCCCC--------------CCCCCCEEEeeccCCCCceEeecCCCCCCCCCccc
Q psy12432 22 TFHTYACPPTYATW--------------YCLNGATCFTVKIGESLLYNCECADGYMGQRCEFK 70 (229)
Q Consensus 22 ~gy~C~C~~gy~gn--------------pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~ 70 (229)
..+.|.|..||.+. +|.|++.|.... ..|.|.|+.+|.|..|+..
T Consensus 150 ~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~----~~~~C~c~~~~~~~~~~~~ 208 (487)
T KOG1217|consen 150 GPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTG----GSYLCSCPPGYTGSTCETT 208 (487)
T ss_pred CceeeeeCCCcccccccccccccccCCCCcCCCcccccCC----CCeeEeCCCCccCCcCcCC
Confidence 35778888888752 477888888764 2599999999999998865
No 63
>PF08114 PMP1_2: ATPase proteolipid family; InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=74.40 E-value=4.6 Score=26.68 Aligned_cols=23 Identities=22% Similarity=0.394 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 96 IAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 96 V~lLLVlii~vvly~~~kRrk~~ 118 (229)
|++..++|++.++|-++..||+.
T Consensus 17 Vglv~i~iva~~iYRKw~aRkr~ 39 (43)
T PF08114_consen 17 VGLVGIGIVALFIYRKWQARKRA 39 (43)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444455555555555444443
No 64
>PF02038 ATP1G1_PLM_MAT8: ATP1G1/PLM/MAT8 family; InterPro: IPR000272 The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable. Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=74.08 E-value=4.4 Score=27.82 Aligned_cols=24 Identities=17% Similarity=0.289 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 96 IAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 96 V~lLLVlii~vvly~~~kRrk~~k 119 (229)
.+||.++.+++++.-+++.+.++|
T Consensus 22 A~vlfi~Gi~iils~kckCk~~qk 45 (50)
T PF02038_consen 22 AGVLFILGILIILSGKCKCKFNQK 45 (50)
T ss_dssp HHHHHHHHHHHHCTTHHHHHHSTT
T ss_pred HHHHHHHHHHHHHcCccccCCCCC
Confidence 344444444444444444443333
No 65
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=73.63 E-value=10 Score=26.42 Aligned_cols=26 Identities=19% Similarity=0.230 Sum_probs=12.8
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432 90 IASGASIAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 90 Ia~gI~V~lLLVlii~vvly~~~kRr 115 (229)
|++...+++++..++.+..+++.+++
T Consensus 24 il~~f~~G~llg~l~~~~~~~~~r~~ 49 (68)
T PF06305_consen 24 ILIAFLLGALLGWLLSLPSRLRLRRR 49 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444455555555555555544433
No 66
>KOG1836|consensus
Probab=72.01 E-value=2.5 Score=47.15 Aligned_cols=45 Identities=31% Similarity=0.554 Sum_probs=34.0
Q ss_pred eeCCCCCCCC------------CCCCCCEEEeeccCCCCceEee-cCCCCCCCCCcccCC
Q psy12432 26 YACPPTYATW------------YCLNGATCFTVKIGESLLYNCE-CADGYMGQRCEFKDL 72 (229)
Q Consensus 26 C~C~~gy~gn------------pClNGGtC~~~~~~~~~~~sC~-C~~Gy~G~rCe~~~~ 72 (229)
-+|..||.|+ +|.|++.|..... .....|+ |+.||+|.||+.-..
T Consensus 759 ~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~--~~~~iCk~Cp~gytG~rCe~c~d 816 (1705)
T KOG1836|consen 759 AQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPE--ILEVVCKNCPPGYTGLRCEECAD 816 (1705)
T ss_pred hhhcCCCCCccccCCCCCCccCCCCCChhhcCcCc--ccceecCCCCCCCcccccccCCC
Confidence 3577777653 6778888887642 3479999 999999999997543
No 67
>PRK00523 hypothetical protein; Provisional
Probab=71.54 E-value=11 Score=27.74 Aligned_cols=17 Identities=24% Similarity=0.235 Sum_probs=7.2
Q ss_pred HhhhHHHHHHHHHHHHH
Q psy12432 90 IASGASIAVFLVVILCF 106 (229)
Q Consensus 90 Ia~gI~V~lLLVlii~v 106 (229)
++++++++.|+++++..
T Consensus 6 l~I~l~i~~li~G~~~G 22 (72)
T PRK00523 6 LALGLGIPLLIVGGIIG 22 (72)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444444443
No 68
>PF06084 Cytomega_TRL10: Cytomegalovirus TRL10 protein; InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=71.47 E-value=2.5 Score=34.06 Aligned_cols=19 Identities=26% Similarity=0.317 Sum_probs=11.7
Q ss_pred eeeccccCCCCCCCCCccc
Q psy12432 152 LTRITTEAPRAADTRTSIT 170 (229)
Q Consensus 152 ~~~~~~e~~~~~~~~~~ld 170 (229)
+.||-|--+|+.|....|.
T Consensus 118 yeripyrp~rq~d~~p~~~ 136 (150)
T PF06084_consen 118 YERIPYRPCRQNDNSPPIE 136 (150)
T ss_pred hhhcCCCcccccCCCCccc
Confidence 3456666677766665555
No 69
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=70.81 E-value=8.9 Score=27.28 Aligned_cols=35 Identities=14% Similarity=0.210 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh-hhcc--ccccCC
Q psy12432 95 SIAVFLVVILCFSLYVHCQRRKKQA-QAAS--VCCTDG 129 (229)
Q Consensus 95 ~V~lLLVlii~vvly~~~kRrk~~k-~~~~--~~~~d~ 129 (229)
.+++|.+++++++.+.++.++|++. +.++ ..+.|+
T Consensus 14 ~t~~~~l~fiavi~~ayr~~~K~~~d~aa~~~l~l~Dd 51 (60)
T COG4736 14 GTIAFTLFFIAVIYFAYRPGKKGEFDEAARGILPLNDD 51 (60)
T ss_pred HHHHHHHHHHHHHHHHhcccchhhHHHHhccCCCCCcc
Confidence 3444555555555555555555554 3333 334444
No 70
>PF11770 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR021082 This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region [].
Probab=70.72 E-value=1.2 Score=37.33 Aligned_cols=16 Identities=25% Similarity=0.509 Sum_probs=7.6
Q ss_pred HHHhhhHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVI 103 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVli 103 (229)
+.|++||.++||||+.
T Consensus 10 v~i~igi~Ll~lLl~c 25 (158)
T PF11770_consen 10 VAISIGISLLLLLLLC 25 (158)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3445555555443333
No 71
>smart00051 DSL delta serrate ligand.
Probab=70.45 E-value=3.5 Score=29.28 Aligned_cols=41 Identities=24% Similarity=0.330 Sum_probs=25.2
Q ss_pred ceeeCCCCCCCCCCCCCCEEEeeccCCCCc------eEeecCCCCCCCCC
Q psy12432 24 HTYACPPTYATWYCLNGATCFTVKIGESLL------YNCECADGYMGQRC 67 (229)
Q Consensus 24 y~C~C~~gy~gnpClNGGtC~~~~~~~~~~------~sC~C~~Gy~G~rC 67 (229)
+.-.|.++|.|..|. -.|....+ .... -.|.|.+||.|..|
T Consensus 17 ~rv~C~~~~yG~~C~--~~C~~~~d-~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCN--KFCRPRDD-FFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccC--CEeCcCcc-ccCCccCCcCCCEecCCCCcCCCC
Confidence 467788988887664 23432110 0112 34669999999887
No 72
>PF15102 TMEM154: TMEM154 protein family
Probab=69.73 E-value=1.2 Score=37.12 Aligned_cols=6 Identities=33% Similarity=0.717 Sum_probs=2.3
Q ss_pred Hhhhhh
Q psy12432 114 RRKKQA 119 (229)
Q Consensus 114 Rrk~~k 119 (229)
|||+.|
T Consensus 83 kRkr~K 88 (146)
T PF15102_consen 83 KRKRTK 88 (146)
T ss_pred eecccC
Confidence 333334
No 73
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=68.62 E-value=4.4 Score=36.93 Aligned_cols=8 Identities=0% Similarity=0.090 Sum_probs=3.1
Q ss_pred hhHHHHHH
Q psy12432 92 SGASIAVF 99 (229)
Q Consensus 92 ~gI~V~lL 99 (229)
++|.++++
T Consensus 234 lAiALG~v 241 (281)
T PF12768_consen 234 LAIALGTV 241 (281)
T ss_pred hHHHHHHH
Confidence 33334433
No 74
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=68.53 E-value=3.6 Score=35.98 Aligned_cols=20 Identities=35% Similarity=0.333 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHHHHhhh
Q psy12432 98 VFLVVILCFSLYVHCQRRKK 117 (229)
Q Consensus 98 lLLVlii~vvly~~~kRrk~ 117 (229)
+||++++++.+|+.++||..
T Consensus 111 ~lLla~~~~~~Y~~~~Rrs~ 130 (202)
T PF06365_consen 111 FLLLAILLGAGYCCHQRRSW 130 (202)
T ss_pred HHHHHHHHHHHHHhhhhccC
Confidence 34445555556666666543
No 75
>PF15065 NCU-G1: Lysosomal transcription factor, NCU-G1
Probab=68.04 E-value=2 Score=40.41 Aligned_cols=28 Identities=29% Similarity=0.469 Sum_probs=13.2
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
+++|++++=++++++..+|+..+|+|++
T Consensus 322 i~~vgLG~P~l~li~Ggl~v~~~r~r~~ 349 (350)
T PF15065_consen 322 IMAVGLGVPLLLLILGGLYVCLRRRRKR 349 (350)
T ss_pred HHHHHhhHHHHHHHHhhheEEEeccccC
Confidence 3444444444444444455555554444
No 76
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=66.81 E-value=5.9 Score=30.71 Aligned_cols=19 Identities=21% Similarity=0.475 Sum_probs=7.2
Q ss_pred HHHHHHHHHHHHHHHhhhh
Q psy12432 100 LVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 100 LVlii~vvly~~~kRrk~~ 118 (229)
+|++++++++...+.+|++
T Consensus 15 ~vl~~~ifyFli~RPQrKr 33 (97)
T COG1862 15 LVLIFAIFYFLIIRPQRKR 33 (97)
T ss_pred HHHHHHHHHHhhcCHHHHH
Confidence 3344443333333333333
No 77
>PF15345 TMEM51: Transmembrane protein 51
Probab=65.64 E-value=8.7 Score=34.28 Aligned_cols=25 Identities=16% Similarity=0.104 Sum_probs=19.3
Q ss_pred ccCCCCcceeeeeccccCCCCCCCC
Q psy12432 142 ERRPSPADFVLTRITTEAPRAADTR 166 (229)
Q Consensus 142 ~~~~~p~~~~~~~~~~e~~~~~~~~ 166 (229)
.+-.+|.-+.|+...+...++.+.+
T Consensus 120 s~y~vPSYEEvv~s~~~~~r~~~~~ 144 (233)
T PF15345_consen 120 SRYYVPSYEEVVNSGYSPVRQSNQR 144 (233)
T ss_pred ccccCCChHHHHhcCCCCccccccC
Confidence 4566888888998888888877664
No 78
>KOG3516|consensus
Probab=65.11 E-value=7.3 Score=42.00 Aligned_cols=33 Identities=33% Similarity=0.761 Sum_probs=28.0
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCC-CCCCCCCcccC
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECAD-GYMGQRCEFKD 71 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~-Gy~G~rCe~~~ 71 (229)
.+|+|||.|...-. +|+|.|.. .|.|..|....
T Consensus 961 ~~C~NGG~Cvery~----gytCDCs~Tay~Gp~Cs~ei 994 (1306)
T KOG3516|consen 961 YPCLNGGHCVERYD----GYTCDCSRTAYDGPFCSKEI 994 (1306)
T ss_pred ccccCCCEEEEecC----ceeeccccCcCCCCcccccc
Confidence 59999999998642 69999987 59999999764
No 79
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=65.05 E-value=6.1 Score=40.94 Aligned_cols=11 Identities=0% Similarity=-0.091 Sum_probs=5.1
Q ss_pred CCCCCCEEEee
Q psy12432 36 YCLNGATCFTV 46 (229)
Q Consensus 36 pClNGGtC~~~ 46 (229)
.+.+.|.-.+.
T Consensus 220 ~Wv~~G~G~Vk 230 (807)
T PF10577_consen 220 AWVKSGLGMVK 230 (807)
T ss_pred eeEecceEEEE
Confidence 34444554443
No 80
>PF15050 SCIMP: SCIMP protein
Probab=64.91 E-value=12 Score=30.44 Aligned_cols=9 Identities=33% Similarity=0.774 Sum_probs=3.9
Q ss_pred HHHHHHHHh
Q psy12432 107 SLYVHCQRR 115 (229)
Q Consensus 107 vly~~~kRr 115 (229)
++||.++++
T Consensus 26 IlyCvcR~~ 34 (133)
T PF15050_consen 26 ILYCVCRWQ 34 (133)
T ss_pred HHHHHHHHH
Confidence 444444433
No 81
>KOG1226|consensus
Probab=64.82 E-value=7.5 Score=40.08 Aligned_cols=49 Identities=29% Similarity=0.506 Sum_probs=28.4
Q ss_pred CCCCCCceeeCCCCCCCCCCC--CCCEEEeeccCCCCceEeecCCC-CCCCCCccc
Q psy12432 18 RPNITFHTYACPPTYATWYCL--NGATCFTVKIGESLLYNCECADG-YMGQRCEFK 70 (229)
Q Consensus 18 ~~~~~gy~C~C~~gy~gnpCl--NGGtC~~~~~~~~~~~sC~C~~G-y~G~rCe~~ 70 (229)
.++|+|..|.|+..- +-|. +|++|.-.. .=.=..|.|... |.|..||.-
T Consensus 571 ~~GwtG~~C~C~~st--d~C~~~~G~iCSGrG--~C~Cg~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 571 NPGWTGSACNCPLST--DTCESSDGQICSGRG--TCECGRCKCTDPPYSGEFCEKC 622 (783)
T ss_pred CCCCccCCCCCCCCC--ccccCCCCceeCCCc--eeeCCceEcCCCCcCcchhhcC
Confidence 456666666666532 3443 455553221 000146678877 899999995
No 82
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=64.70 E-value=4 Score=37.15 Aligned_cols=32 Identities=13% Similarity=0.113 Sum_probs=17.1
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 89 SIASGASIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 89 ~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
.+++.|+|++.| +++++++.+-+.=.|||.+.
T Consensus 270 ~~~vPIaVG~~L-a~lvlivLiaYli~Rrr~~~ 301 (306)
T PF01299_consen 270 SDLVPIAVGAAL-AGLVLIVLIAYLIGRRRSRA 301 (306)
T ss_pred cchHHHHHHHHH-HHHHHHHHHhheeEeccccc
Confidence 455556555543 44455555555555555543
No 83
>PF14610 DUF4448: Protein of unknown function (DUF4448)
Probab=64.61 E-value=1.2 Score=37.86 Aligned_cols=22 Identities=9% Similarity=0.486 Sum_probs=9.7
Q ss_pred HHhhhHHHHHHHHHHHHHHHHH
Q psy12432 89 SIASGASIAVFLVVILCFSLYV 110 (229)
Q Consensus 89 ~Ia~gI~V~lLLVlii~vvly~ 110 (229)
++++++-++|++++++++++++
T Consensus 159 ~laI~lPvvv~~~~~~~~~~~~ 180 (189)
T PF14610_consen 159 ALAIALPVVVVVLALIMYGFFF 180 (189)
T ss_pred eEEEEccHHHHHHHHHHHhhhe
Confidence 4445554444444444433333
No 84
>PRK11901 hypothetical protein; Reviewed
Probab=64.39 E-value=7.2 Score=36.46 Aligned_cols=19 Identities=26% Similarity=0.536 Sum_probs=9.4
Q ss_pred HHhhhHHHHHHHHHHHHHH
Q psy12432 89 SIASGASIAVFLVVILCFS 107 (229)
Q Consensus 89 ~Ia~gI~V~lLLVlii~vv 107 (229)
.+.+||+|+|||||||.+.
T Consensus 37 h~MiGiGilVLlLLIi~Ig 55 (327)
T PRK11901 37 HMMIGIGILVLLLLIIAIG 55 (327)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 3444555555555555443
No 85
>PF11980 DUF3481: Domain of unknown function (DUF3481); InterPro: IPR022579 This domain of unknown function is located in the C terminus of the eukaryotic neuropilin receptor family of proteins. It is found in association with PF00754 from PFAM, PF00431 from PFAM and PF00629 from PFAM. There are two completely conserved residues (Y and E) that may be functionally important.
Probab=64.07 E-value=7.4 Score=29.58 Aligned_cols=29 Identities=21% Similarity=0.154 Sum_probs=13.1
Q ss_pred HHHHhhhHHHHH-HHHHHHHHHHHHHHHHh
Q psy12432 87 TASIASGASIAV-FLVVILCFSLYVHCQRR 115 (229)
Q Consensus 87 ~~~Ia~gI~V~l-LLVlii~vvly~~~kRr 115 (229)
|..|+.+..++| |+.+.+.+++||.+.+-
T Consensus 16 ~yyiiA~gga~llL~~v~l~vvL~C~r~~~ 45 (87)
T PF11980_consen 16 WYYIIAMGGALLLLVAVCLGVVLYCHRFHW 45 (87)
T ss_pred eeHHHhhccHHHHHHHHHHHHHHhhhhhcc
Confidence 344443333333 33333355566655544
No 86
>PF04971 Lysis_S: Lysis protein S ; InterPro: IPR007054 The lysis S protein is a cytotoxic protein forming holes in membranes causing cell lysis. The action of Lysis S is independent of the proportion of acidic phospholipids in the membrane [].
Probab=63.06 E-value=12 Score=27.33 Aligned_cols=15 Identities=20% Similarity=0.293 Sum_probs=6.5
Q ss_pred hhhHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILC 105 (229)
Q Consensus 91 a~gI~V~lLLVlii~ 105 (229)
++|++.+|++.++..
T Consensus 35 aIGvi~gi~~~~lt~ 49 (68)
T PF04971_consen 35 AIGVIGGIFFGLLTY 49 (68)
T ss_pred hHHHHHHHHHHHHHH
Confidence 334444444444443
No 87
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=62.90 E-value=4 Score=34.05 Aligned_cols=29 Identities=14% Similarity=0.147 Sum_probs=15.1
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 86 ETASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
.+++|++||++++.+|..+++.++...++
T Consensus 62 AIaGIVfgiVfimgvva~i~icvCmc~kn 90 (155)
T PF10873_consen 62 AIAGIVFGIVFIMGVVAGIAICVCMCMKN 90 (155)
T ss_pred eeeeeehhhHHHHHHHHHHHHHHhhhhhc
Confidence 34566666655555555544444444443
No 88
>PHA03265 envelope glycoprotein D; Provisional
Probab=62.24 E-value=3.8 Score=38.74 Aligned_cols=15 Identities=40% Similarity=0.638 Sum_probs=7.5
Q ss_pred HHHHHHHhhhhhhhc
Q psy12432 108 LYVHCQRRKKQAQAA 122 (229)
Q Consensus 108 ly~~~kRrk~~k~~~ 122 (229)
+|+.++|||..++..
T Consensus 368 l~~~~rr~k~~~k~~ 382 (402)
T PHA03265 368 LYVCLRRKKELKKSA 382 (402)
T ss_pred HHHHhhhhhhhhhhh
Confidence 444455555555443
No 89
>KOG0196|consensus
Probab=61.53 E-value=7.8 Score=40.53 Aligned_cols=17 Identities=24% Similarity=0.196 Sum_probs=12.7
Q ss_pred CcceeeeeccccCCCCC
Q psy12432 147 PADFVLTRITTEAPRAA 163 (229)
Q Consensus 147 p~~~~~~~~~~e~~~~~ 163 (229)
.....+-+++||+|-++
T Consensus 602 G~k~YiDP~TYEDPnqA 618 (996)
T KOG0196|consen 602 GTKTYIDPHTYEDPNQA 618 (996)
T ss_pred CceeecCCccccCccHH
Confidence 45667788999999654
No 90
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=60.87 E-value=18 Score=30.39 Aligned_cols=36 Identities=6% Similarity=0.184 Sum_probs=26.4
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 86 ETASIASGASIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
-+..|+++|+|.++++++++.|.--++||+...-..
T Consensus 30 hm~tILiaIvVliiiiivli~lcssRKkKaaAAi~e 65 (189)
T PF05568_consen 30 HMYTILIAIVVLIIIIIVLIYLCSSRKKKAAAAIEE 65 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHhhhhh
Confidence 356677788888777777777788888887766543
No 91
>PF06143 Baculo_11_kDa: Baculovirus 11 kDa family; InterPro: IPR009313 This is a family of uncharacterised Baculovirus proteins that are all about 11 kDa in size.
Probab=60.58 E-value=27 Score=26.46 Aligned_cols=27 Identities=7% Similarity=0.133 Sum_probs=13.1
Q ss_pred hHHHHHhhhHHHHHHHHHHHHHHHHHH
Q psy12432 85 LETASIASGASIAVFLVVILCFSLYVH 111 (229)
Q Consensus 85 ~~~~~Ia~gI~V~lLLVlii~vvly~~ 111 (229)
.++..++.|+.|.+++++++.+++.+.
T Consensus 34 rdFvLVic~~lVfVii~lFi~ll~~i~ 60 (84)
T PF06143_consen 34 RDFVLVICCFLVFVIIVLFILLLYNIN 60 (84)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555555555554444433
No 92
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=59.85 E-value=3 Score=40.39 Aligned_cols=31 Identities=13% Similarity=0.316 Sum_probs=0.0
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
++++++++++++++++++++.+.++|++++.
T Consensus 354 ~l~vVlgvavlivVv~viv~vc~~~rrrR~~ 384 (439)
T PF02480_consen 354 LLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQ 384 (439)
T ss_dssp -------------------------------
T ss_pred hHHHHHHHHHHHHHHHHHhheeeeehhcccc
Confidence 3444444445555555566667787777755
No 93
>PF14851 FAM176: FAM176 family
Probab=59.69 E-value=7 Score=32.76 Aligned_cols=20 Identities=20% Similarity=0.268 Sum_probs=13.6
Q ss_pred HHHhhhHHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCFS 107 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vv 107 (229)
.+.+.|||++|||.|.++++
T Consensus 24 LYFv~gVC~GLlLtLcllV~ 43 (153)
T PF14851_consen 24 LYFVSGVCAGLLLTLCLLVI 43 (153)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 35578888888766655544
No 94
>PTZ00046 rifin; Provisional
Probab=59.56 E-value=16 Score=34.59 Aligned_cols=16 Identities=25% Similarity=0.515 Sum_probs=8.2
Q ss_pred hHHHHHHHHHHHHHHH
Q psy12432 93 GASIAVFLVVILCFSL 108 (229)
Q Consensus 93 gI~V~lLLVlii~vvl 108 (229)
+|+|.||+++|+=+++
T Consensus 323 AIvVIVLIMvIIYLIL 338 (358)
T PTZ00046 323 AIVVIVLIMVIIYLIL 338 (358)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4445555555555443
No 95
>PRK12705 hypothetical protein; Provisional
Probab=59.24 E-value=12 Score=37.05 Aligned_cols=29 Identities=7% Similarity=0.071 Sum_probs=14.9
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
++++++++|+.++.+|+++|.++|+++.+
T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 33 (508)
T PRK12705 5 ILLVILLLLIGLLLGVLVVLLKKRQRLAK 33 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344455555555555555555554443
No 96
>PF15099 PIRT: Phosphoinositide-interacting protein family
Probab=58.95 E-value=8.4 Score=31.39 Aligned_cols=11 Identities=18% Similarity=0.495 Sum_probs=4.2
Q ss_pred HHHHHHHHHHH
Q psy12432 101 VVILCFSLYVH 111 (229)
Q Consensus 101 Vlii~vvly~~ 111 (229)
+++++..++|+
T Consensus 93 mlL~~~alcW~ 103 (129)
T PF15099_consen 93 MLLACSALCWK 103 (129)
T ss_pred HHHHhhhheeh
Confidence 33333334443
No 97
>PF14914 LRRC37AB_C: LRRC37A/B like protein 1 C-terminal domain
Probab=58.92 E-value=12 Score=31.40 Aligned_cols=27 Identities=19% Similarity=0.453 Sum_probs=11.6
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432 90 IASGASIAVFLVVILCFSLYVHCQRRK 116 (229)
Q Consensus 90 Ia~gI~V~lLLVlii~vvly~~~kRrk 116 (229)
+++.+.+++++++|+..++.++.+|+.
T Consensus 123 laisvtvv~~iliii~CLiei~shr~a 149 (154)
T PF14914_consen 123 LAISVTVVVMILIIIFCLIEICSHRRA 149 (154)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 344444444444443334444444443
No 98
>KOG3488|consensus
Probab=58.83 E-value=16 Score=27.02 Aligned_cols=21 Identities=14% Similarity=0.271 Sum_probs=11.0
Q ss_pred HHHhhhHHHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCFSL 108 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvl 108 (229)
+.|.+.+.++++++.++..++
T Consensus 48 yAi~iPvaagl~ll~lig~Fi 68 (81)
T KOG3488|consen 48 YAITIPVAAGLFLLCLIGTFI 68 (81)
T ss_pred HHhhhHHHHHHHHHHHHHHHH
Confidence 445555666665555444433
No 99
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=58.21 E-value=16 Score=26.97 Aligned_cols=12 Identities=17% Similarity=0.251 Sum_probs=4.8
Q ss_pred HHHHHHHHHHHh
Q psy12432 104 LCFSLYVHCQRR 115 (229)
Q Consensus 104 i~vvly~~~kRr 115 (229)
..++++.++.+|
T Consensus 18 ap~wl~lHY~~k 29 (75)
T TIGR02976 18 APLWLILHYRSK 29 (75)
T ss_pred HHHHHHHHHHhh
Confidence 333444444433
No 100
>PF12597 DUF3767: Protein of unknown function (DUF3767); InterPro: IPR022533 This group of proteins includes mitochodrial cytochrome c oxidase proteins [], and some transmembrane domain-containing proteins of unknown function known as FAM36A. Proteins in this family are typically between 112 and 199 amino acids in length.
Probab=58.00 E-value=24 Score=28.16 Aligned_cols=27 Identities=26% Similarity=0.349 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 95 SIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 95 ~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
.|+-|+++-++.|.||+++|++.+.++
T Consensus 72 avgsF~l~s~~~we~Cr~~r~~~~~~~ 98 (118)
T PF12597_consen 72 AVGSFFLGSLGSWEYCRYNRRKERQQM 98 (118)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344456666667788888877776654
No 101
>PHA02902 putative IMV membrane protein; Provisional
Probab=57.63 E-value=15 Score=26.69 Aligned_cols=22 Identities=27% Similarity=0.576 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHHHHHHHhhhh
Q psy12432 97 AVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 97 ~lLLVlii~vvly~~~kRrk~~ 118 (229)
.++.|+|+|.++|..++|.|..
T Consensus 9 ~~v~v~Ivclliya~YrR~kci 30 (70)
T PHA02902 9 LAVIVIIFCLLIYAAYKRYKCI 30 (70)
T ss_pred HHHHHHHHHHHHHHHHHHhcCC
Confidence 3445666777777777775443
No 102
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=57.48 E-value=18 Score=34.26 Aligned_cols=14 Identities=7% Similarity=0.159 Sum_probs=9.3
Q ss_pred HHHHHHHhhhhhhh
Q psy12432 108 LYVHCQRRKKQAQA 121 (229)
Q Consensus 108 ly~~~kRrk~~k~~ 121 (229)
+.+..-|.|++|++
T Consensus 328 IIYLILRYRRKKKM 341 (353)
T TIGR01477 328 IIYLILRYRRKKKM 341 (353)
T ss_pred HHHHHHHhhhcchh
Confidence 33447888888765
No 103
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=57.18 E-value=2.9 Score=39.73 Aligned_cols=27 Identities=26% Similarity=0.348 Sum_probs=19.0
Q ss_pred HHHHHhhhHHHHHH-HHHHHHHHHHHHH
Q psy12432 86 ETASIASGASIAVF-LVVILCFSLYVHC 112 (229)
Q Consensus 86 ~~~~Ia~gI~V~lL-LVlii~vvly~~~ 112 (229)
.+++|.++++|+|- ||.|++||+.+|-
T Consensus 368 aIaGIsvavvvvVgglvGfLcWwf~crg 395 (397)
T PF03302_consen 368 AIAGISVAVVVVVGGLVGFLCWWFICRG 395 (397)
T ss_pred ceeeeeehhHHHHHHHHHHHhhheeecc
Confidence 56778777666655 7788888876643
No 104
>PF05083 LST1: LST-1 protein; InterPro: IPR007775 B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length. B144/LST1 is strongly expressed in dendritic cells. Transfection of B144/LST1 into a variety of cells induces morphologic changes including the production of long, thin filopodia []. A possible role in modulating immune responses. Induces morphological changes including production of filopodia and microspikes when overexpressed in a variety of cell types and may be involved in dendritic cell maturation. Isoform 1 and isoform 2 have an inhibitory effect on lymphocyte proliferation [, ]. ; GO: 0000902 cell morphogenesis, 0006955 immune response, 0016020 membrane
Probab=56.90 E-value=11 Score=27.79 Aligned_cols=9 Identities=33% Similarity=0.265 Sum_probs=3.7
Q ss_pred HHHHHHHhh
Q psy12432 108 LYVHCQRRK 116 (229)
Q Consensus 108 ly~~~kRrk 116 (229)
+|+..+|.+
T Consensus 16 lC~lsrRvk 24 (74)
T PF05083_consen 16 LCRLSRRVK 24 (74)
T ss_pred HHHHHhhhh
Confidence 444444433
No 105
>PF11884 DUF3404: Domain of unknown function (DUF3404); InterPro: IPR021821 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 260 amino acids in length. This domain is found associated with PF02518 from PFAM, PF00512 from PFAM.
Probab=56.90 E-value=22 Score=32.28 Aligned_cols=10 Identities=40% Similarity=0.548 Sum_probs=3.8
Q ss_pred HHHHHHHHHH
Q psy12432 105 CFSLYVHCQR 114 (229)
Q Consensus 105 ~vvly~~~kR 114 (229)
.+.+|-+.+|
T Consensus 248 gw~~y~~~~k 257 (262)
T PF11884_consen 248 GWSLYRWNQK 257 (262)
T ss_pred HHHHHHHHHH
Confidence 3344433333
No 106
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=56.56 E-value=25 Score=29.42 Aligned_cols=27 Identities=4% Similarity=0.143 Sum_probs=16.5
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
..++.||+++++++++++..+.+....
T Consensus 60 gtAIaGIVfgiVfimgvva~i~icvCm 86 (155)
T PF10873_consen 60 GTAIAGIVFGIVFIMGVVAGIAICVCM 86 (155)
T ss_pred cceeeeeehhhHHHHHHHHHHHHHHhh
Confidence 556677877776666666555444443
No 107
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=56.44 E-value=0.96 Score=37.90 Aligned_cols=31 Identities=19% Similarity=0.027 Sum_probs=16.7
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 90 IASGASIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 90 Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
|++|++|+| -+.|+++++.+.+.-.+++|+.
T Consensus 50 IVIGvVVGV-Gg~ill~il~lvf~~c~r~kkt 80 (154)
T PF04478_consen 50 IVIGVVVGV-GGPILLGILALVFIFCIRRKKT 80 (154)
T ss_pred EEEEEEecc-cHHHHHHHHHhheeEEEecccC
Confidence 355555553 3355555555555555555554
No 108
>PF12259 DUF3609: Protein of unknown function (DUF3609); InterPro: IPR022048 This domain family is found in eukaryotes and viruses, and is typically between 348 and 360 amino acids in length.
Probab=56.41 E-value=8.1 Score=36.50 Aligned_cols=17 Identities=35% Similarity=0.469 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHHhh
Q psy12432 100 LVVILCFSLYVHCQRRK 116 (229)
Q Consensus 100 LVlii~vvly~~~kRrk 116 (229)
+|++.++|+|.++.||+
T Consensus 311 ~vl~~~~~~~~~~~~~~ 327 (361)
T PF12259_consen 311 IVLISLAWLYRTFRRRQ 327 (361)
T ss_pred HHHHHHHhheeehHHHH
Confidence 33444444544444433
No 109
>PF15099 PIRT: Phosphoinositide-interacting protein family
Probab=55.24 E-value=11 Score=30.65 Aligned_cols=29 Identities=10% Similarity=0.353 Sum_probs=17.9
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
...+++.+|+...+||..-.++||+|+++
T Consensus 87 lLs~GLmlL~~~alcW~~~~rkK~~kr~e 115 (129)
T PF15099_consen 87 LLSLGLMLLACSALCWKPIIRKKKKKRRE 115 (129)
T ss_pred HHHHHHHHHHhhhheehhhhHhHHHHhhh
Confidence 34455666666667777766666555555
No 110
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=54.76 E-value=19 Score=28.26 Aligned_cols=21 Identities=19% Similarity=0.362 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHhhhhhh
Q psy12432 100 LVVILCFSLYVHCQRRKKQAQ 120 (229)
Q Consensus 100 LVlii~vvly~~~kRrk~~k~ 120 (229)
|+|++.+++-+...|.++++.
T Consensus 9 llLll~l~asl~~wr~~~rq~ 29 (107)
T PF15330_consen 9 LLLLLSLAASLLAWRMKQRQK 29 (107)
T ss_pred HHHHHHHHHHHHHHHHHhhhc
Confidence 334444556666666666553
No 111
>PRK10905 cell division protein DamX; Validated
Probab=53.39 E-value=8.3 Score=36.01 Aligned_cols=13 Identities=38% Similarity=0.710 Sum_probs=5.2
Q ss_pred hHHHHHHHHHHHH
Q psy12432 93 GASIAVFLVVILC 105 (229)
Q Consensus 93 gI~V~lLLVlii~ 105 (229)
||+|+|||||||.
T Consensus 3 GiGilVLlLLIig 15 (328)
T PRK10905 3 GVGILVLLLLIIG 15 (328)
T ss_pred chhHHHHHHHHHH
Confidence 3334444434433
No 112
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=53.15 E-value=9.7 Score=24.98 Aligned_cols=23 Identities=35% Similarity=0.733 Sum_probs=18.0
Q ss_pred CEEEeeccCCCCceEeecCCCCCCCCCcc
Q psy12432 41 ATCFTVKIGESLLYNCECADGYMGQRCEF 69 (229)
Q Consensus 41 GtC~~~~~~~~~~~sC~C~~Gy~G~rCe~ 69 (229)
..|... ...|.|..+|.|.+|+.
T Consensus 11 ~~C~~~------~G~C~C~~~~~G~~C~~ 33 (49)
T PF00053_consen 11 QTCDPS------TGQCVCKPGTTGPRCDQ 33 (49)
T ss_dssp SSEEET------CEEESBSTTEESTTS-E
T ss_pred CcccCC------CCEEeccccccCCcCcC
Confidence 467653 47999999999999996
No 113
>PHA03265 envelope glycoprotein D; Provisional
Probab=53.11 E-value=6.9 Score=37.10 Aligned_cols=39 Identities=15% Similarity=0.024 Sum_probs=27.9
Q ss_pred hhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 83 VMLETASIASGASIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 83 ~~~~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
.+....+|+++.+++|++|+.|+.+++-++|..||+.+.
T Consensus 346 ~~~~~g~~ig~~i~glv~vg~il~~~~rr~k~~~k~~~~ 384 (402)
T PHA03265 346 NSTFVGISVGLGIAGLVLVGVILYVCLRRKKELKKSAQN 384 (402)
T ss_pred CCcccceEEccchhhhhhhhHHHHHHhhhhhhhhhhhhc
Confidence 343344455566677889999999999999877766543
No 114
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=52.80 E-value=31 Score=30.55 Aligned_cols=10 Identities=20% Similarity=0.537 Sum_probs=5.5
Q ss_pred CCCCCCcccC
Q psy12432 62 YMGQRCEFKD 71 (229)
Q Consensus 62 y~G~rCe~~~ 71 (229)
|.-+.||...
T Consensus 116 ~kk~~CEen~ 125 (227)
T PF05399_consen 116 FKKEICEENN 125 (227)
T ss_pred cchhhhhcCc
Confidence 3345677664
No 115
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=51.37 E-value=21 Score=28.80 Aligned_cols=29 Identities=14% Similarity=0.236 Sum_probs=18.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 92 SGASIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 92 ~gI~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
++|++++ +.+++++++++.+.-+|++|+.
T Consensus 67 ~~Ii~gv-~aGvIg~Illi~y~irR~~Kk~ 95 (122)
T PF01102_consen 67 IGIIFGV-MAGVIGIILLISYCIRRLRKKS 95 (122)
T ss_dssp HHHHHHH-HHHHHHHHHHHHHHHHHHS---
T ss_pred eehhHHH-HHHHHHHHHHHHHHHHHHhccC
Confidence 4444444 5677777788888888888875
No 116
>KOG1214|consensus
Probab=51.12 E-value=15 Score=38.74 Aligned_cols=41 Identities=27% Similarity=0.622 Sum_probs=28.4
Q ss_pred ceeeCCCCCC----------------CCCCCCC-------CEEEeeccCCCCceEeecCCCCCCC
Q psy12432 24 HTYACPPTYA----------------TWYCLNG-------ATCFTVKIGESLLYNCECADGYMGQ 65 (229)
Q Consensus 24 y~C~C~~gy~----------------gnpClNG-------GtC~~~~~~~~~~~sC~C~~Gy~G~ 65 (229)
|.|.|..+|. -|+|..| |.|..+..+ -+.|+|.|.+||.|.
T Consensus 757 ~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG-gs~y~C~CLPGfsGD 820 (1289)
T KOG1214|consen 757 YRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG-GSTYSCACLPGFSGD 820 (1289)
T ss_pred eeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC-CceEEEeecCCccCC
Confidence 6888888886 1578766 444333212 236999999999997
No 117
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=51.09 E-value=8.5 Score=33.45 Aligned_cols=42 Identities=26% Similarity=0.607 Sum_probs=28.4
Q ss_pred CceeeCCCCCCC-------------------CCCCCCCEEEeecc-CCCCceEeecCCCCCC
Q psy12432 23 FHTYACPPTYAT-------------------WYCLNGATCFTVKI-GESLLYNCECADGYMG 64 (229)
Q Consensus 23 gy~C~C~~gy~g-------------------npClNGGtC~~~~~-~~~~~~sC~C~~Gy~G 64 (229)
.+.|.|++||.. .+|-+-+.|..... +.+..|.|.|..||.-
T Consensus 19 HfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~ 80 (197)
T PF06247_consen 19 HFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYIL 80 (197)
T ss_dssp EEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE
T ss_pred ceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCcee
Confidence 357999998862 36777788887652 2245799999999754
No 118
>TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type. CcoS from Rhodobacter capsulatus has been shown essential for incorporation of redox-active prosthetic groups (heme, Cu) into cytochrome cbb(3) oxidase. FixS of Bradyrhizobium japonicum appears to have the same function. Members of this family are found so far in organisms with a cbb3-type cytochrome oxidase, including Neisseria meningitidis, Helicobacter pylori, Campylobacter jejuni, Caulobacter crescentus, Bradyrhizobium japonicum, and Rhodobacter capsulatus.
Probab=50.54 E-value=22 Score=24.43 Aligned_cols=21 Identities=10% Similarity=0.055 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy12432 94 ASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 94 I~V~lLLVlii~vvly~~~kR 114 (229)
|.+.++++++.+++++|-.|+
T Consensus 8 IpiSl~l~~~~l~~f~Wavk~ 28 (51)
T TIGR00847 8 IPISLLLGGVGLVAFLWSLKS 28 (51)
T ss_pred HHHHHHHHHHHHHHHHHHHcc
Confidence 344555555555555554444
No 119
>PTZ00045 apical membrane antigen 1; Provisional
Probab=50.41 E-value=48 Score=33.40 Aligned_cols=29 Identities=24% Similarity=0.440 Sum_probs=16.0
Q ss_pred CceeeCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCC
Q psy12432 23 FHTYACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMG 64 (229)
Q Consensus 23 gy~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G 64 (229)
..+|.|.+... -..+|. .|.|.|...--|
T Consensus 459 ~~~c~~t~~~~-----~~~~C~--------~yvc~~~~~sc~ 487 (595)
T PTZ00045 459 SPFCPATPLKV-----SNSTCS--------FYVCERVEKSCG 487 (595)
T ss_pred cccCccccccc-----cCcccc--------eeEeeeeecccc
Confidence 45777766432 234453 477777765433
No 120
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=50.37 E-value=12 Score=33.06 Aligned_cols=12 Identities=25% Similarity=0.589 Sum_probs=4.6
Q ss_pred HHHHHHHHHHHH
Q psy12432 94 ASIAVFLVVILC 105 (229)
Q Consensus 94 I~V~lLLVlii~ 105 (229)
|+|++|||+|++
T Consensus 20 I~IV~lLIiiva 31 (217)
T PF07423_consen 20 IGIVSLLIIIVA 31 (217)
T ss_pred HHHHHHHHHHHh
Confidence 333333333333
No 121
>PRK00523 hypothetical protein; Provisional
Probab=50.19 E-value=32 Score=25.34 Aligned_cols=34 Identities=12% Similarity=0.041 Sum_probs=19.4
Q ss_pred hhhHHHHHH-HHHHHHHHHHHHHHHhhhhhhhccc
Q psy12432 91 ASGASIAVF-LVVILCFSLYVHCQRRKKQAQAASV 124 (229)
Q Consensus 91 a~gI~V~lL-LVlii~vvly~~~kRrk~~k~~~~~ 124 (229)
++++.+++. +.+++.+++=++..|+.-+|.+..|
T Consensus 3 ~~~l~I~l~i~~li~G~~~Gffiark~~~k~l~~N 37 (72)
T PRK00523 3 AIGLALGLGIPLLIVGGIIGYFVSKKMFKKQIREN 37 (72)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
Confidence 455555544 3444445666667777777766433
No 122
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=49.93 E-value=26 Score=27.47 Aligned_cols=23 Identities=4% Similarity=0.068 Sum_probs=9.1
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHH
Q psy12432 86 ETASIASGASIAVFLVVILCFSL 108 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvl 108 (229)
.|..+++.++++|++-++|.+++
T Consensus 16 sW~~LVGVv~~al~~SlLIalaa 38 (102)
T PF15176_consen 16 SWPFLVGVVVTALVTSLLIALAA 38 (102)
T ss_pred ccHhHHHHHHHHHHHHHHHHHHH
Confidence 34444433333444433333333
No 123
>COG4499 Predicted membrane protein [Function unknown]
Probab=49.58 E-value=20 Score=34.53 Aligned_cols=29 Identities=24% Similarity=0.429 Sum_probs=17.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy12432 92 SGASIAVFLVVILCFSLYVHCQRRKKQAQ 120 (229)
Q Consensus 92 ~gI~V~lLLVlii~vvly~~~kRrk~~k~ 120 (229)
+||++.+++|++++++.|+++..+.++..
T Consensus 223 ~giGliillvl~li~~~Y~~f~~~p~qea 251 (434)
T COG4499 223 FGIGLIILLVLLLIYFTYYYFSNQPKQEA 251 (434)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHcChhHHH
Confidence 45555566666666666666666555543
No 124
>PF15048 OSTbeta: Organic solute transporter subunit beta protein
Probab=48.63 E-value=16 Score=29.61 Aligned_cols=32 Identities=13% Similarity=0.176 Sum_probs=13.5
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
|-+.+.++.++++++.++++..-+...|+||.
T Consensus 35 WNysiL~Ls~vvlvi~~~LLgrsi~ANRnrK~ 66 (125)
T PF15048_consen 35 WNYSILALSFVVLVISFFLLGRSIQANRNRKM 66 (125)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHhHhcccccc
Confidence 43334444444444444444444444444333
No 125
>PRK09458 pspB phage shock protein B; Provisional
Probab=48.17 E-value=18 Score=26.83 Aligned_cols=17 Identities=18% Similarity=0.530 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy12432 98 VFLVVILCFSLYVHCQR 114 (229)
Q Consensus 98 lLLVlii~vvly~~~kR 114 (229)
+|++++.-+|++.+++.
T Consensus 12 iF~ifVaPiWL~LHY~s 28 (75)
T PRK09458 12 IFVLFVAPIWLWLHYRS 28 (75)
T ss_pred HHHHHHHHHHHHHhhcc
Confidence 33333333444444443
No 126
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=47.26 E-value=15 Score=24.32 Aligned_cols=17 Identities=35% Similarity=0.835 Sum_probs=14.8
Q ss_pred ceEeecCCCCCCCCCcc
Q psy12432 53 LYNCECADGYMGQRCEF 69 (229)
Q Consensus 53 ~~sC~C~~Gy~G~rCe~ 69 (229)
...|.|..+|.|.+|+.
T Consensus 18 ~G~C~C~~~~~G~~C~~ 34 (50)
T cd00055 18 TGQCECKPNTTGRRCDR 34 (50)
T ss_pred CCEEeCCCcCCCCCCCC
Confidence 36899999999999985
No 127
>PRK14748 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=46.56 E-value=39 Score=20.60 Aligned_cols=14 Identities=21% Similarity=0.384 Sum_probs=6.4
Q ss_pred hhhHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVIL 104 (229)
Q Consensus 91 a~gI~V~lLLVlii 104 (229)
++|++++++|++-+
T Consensus 6 i~G~ilv~lLlgYL 19 (29)
T PRK14748 6 ITGVLLVFLLLGYL 19 (29)
T ss_pred HHHHHHHHHHHHHH
Confidence 44555444444433
No 128
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=46.29 E-value=42 Score=21.81 Aligned_cols=16 Identities=13% Similarity=0.113 Sum_probs=8.0
Q ss_pred hhHHHHHHHHHHHHHH
Q psy12432 92 SGASIAVFLVVILCFS 107 (229)
Q Consensus 92 ~gI~V~lLLVlii~vv 107 (229)
++|++++++.++++++
T Consensus 6 IaIIv~V~vg~~iiii 21 (38)
T PF02439_consen 6 IAIIVAVVVGMAIIII 21 (38)
T ss_pred hhHHHHHHHHHHHHHH
Confidence 4555665554444433
No 129
>PF15347 PAG: Phosphoprotein associated with glycosphingolipid-enriched
Probab=46.29 E-value=26 Score=33.53 Aligned_cols=26 Identities=31% Similarity=0.572 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 96 IAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 96 V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
|+.||||.+++++|--+.|.||.++.
T Consensus 24 v~~f~lis~LifLCsSC~reKK~~~~ 49 (428)
T PF15347_consen 24 VTTFLLISFLIFLCSSCDREKKPKQQ 49 (428)
T ss_pred HHHHHHHHHHHHHhhcccccccCCCC
Confidence 33444444445555556677766643
No 130
>PF02699 YajC: Preprotein translocase subunit; InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA []. Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought []. More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=46.21 E-value=40 Score=24.94 Aligned_cols=10 Identities=10% Similarity=0.272 Sum_probs=6.2
Q ss_pred HHHhhhhhhh
Q psy12432 112 CQRRKKQAQA 121 (229)
Q Consensus 112 ~kRrk~~k~~ 121 (229)
+||+|+++.+
T Consensus 24 kk~~k~~~~m 33 (82)
T PF02699_consen 24 KKQQKEHQEM 33 (82)
T ss_dssp HHHHHHHTTG
T ss_pred HHHHHHHHHH
Confidence 5566677654
No 131
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.84 E-value=57 Score=23.97 Aligned_cols=14 Identities=14% Similarity=-0.102 Sum_probs=5.7
Q ss_pred hhHHHHHHHHHHHH
Q psy12432 92 SGASIAVFLVVILC 105 (229)
Q Consensus 92 ~gI~V~lLLVlii~ 105 (229)
+++++++|+-+++.
T Consensus 8 l~ivl~ll~G~~~G 21 (71)
T COG3763 8 LLIVLALLAGLIGG 21 (71)
T ss_pred HHHHHHHHHHHHHH
Confidence 33444444433333
No 132
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=45.67 E-value=6.9 Score=33.11 Aligned_cols=12 Identities=25% Similarity=0.290 Sum_probs=0.0
Q ss_pred HHHhhhHHHHHH
Q psy12432 88 ASIASGASIAVF 99 (229)
Q Consensus 88 ~~Ia~gI~V~lL 99 (229)
++|++|+++++-
T Consensus 132 VGIIVGVLlaIG 143 (162)
T PF05808_consen 132 VGIIVGVLLAIG 143 (162)
T ss_dssp ------------
T ss_pred eeehhhHHHHHH
Confidence 344444444433
No 133
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=45.65 E-value=12 Score=35.97 Aligned_cols=31 Identities=19% Similarity=0.471 Sum_probs=16.1
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhh
Q psy12432 86 ETASIASGASIAVFLVVILCFSLYVHCQRRK 116 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk 116 (229)
++.+|++.+.|-+++|.|+.++++++.+|||
T Consensus 384 ~~~~i~~avl~p~~il~~~~~~~~~~v~rrr 414 (436)
T PTZ00208 384 RTAMIILAVLVPAIILAIIAVAFFIMVKRRR 414 (436)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHhheeeeecc
Confidence 3455666555555555555544545444444
No 134
>PF05510 Sarcoglycan_2: Sarcoglycan alpha/epsilon; InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=45.45 E-value=20 Score=34.38 Aligned_cols=25 Identities=20% Similarity=0.313 Sum_probs=11.0
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRr 115 (229)
++..+|++||++++..++|+++...
T Consensus 290 ~iPl~i~llL~llLs~Imc~rREG~ 314 (386)
T PF05510_consen 290 AIPLIIALLLLLLLSYIMCCRREGV 314 (386)
T ss_pred HHHHHHHHHHHHHHHHHheechHHh
Confidence 3333444444444444454444443
No 135
>KOG4260|consensus
Probab=45.13 E-value=18 Score=33.48 Aligned_cols=43 Identities=23% Similarity=0.620 Sum_probs=29.6
Q ss_pred eCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC
Q psy12432 27 ACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD 71 (229)
Q Consensus 27 ~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~ 71 (229)
.|+.|- ..+|...|.|.-.. .-.+.-.|+|..||.|..|..=.
T Consensus 143 ~Cpggs-er~C~GnG~C~GdG-sR~GsGkCkC~~GY~Gp~C~~Cg 185 (350)
T KOG4260|consen 143 QCPGGS-ERPCFGNGSCHGDG-SREGSGKCKCETGYTGPLCRYCG 185 (350)
T ss_pred cCCCCC-cCCcCCCCcccCCC-CCCCCCcccccCCCCCccccccc
Confidence 344443 25888888887532 11346899999999999988653
No 136
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=44.40 E-value=70 Score=22.01 Aligned_cols=7 Identities=29% Similarity=0.264 Sum_probs=2.5
Q ss_pred hhHHHHH
Q psy12432 92 SGASIAV 98 (229)
Q Consensus 92 ~gI~V~l 98 (229)
+|++++.
T Consensus 30 ~G~llg~ 36 (68)
T PF06305_consen 30 LGALLGW 36 (68)
T ss_pred HHHHHHH
Confidence 3333333
No 137
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=43.98 E-value=37 Score=25.85 Aligned_cols=18 Identities=22% Similarity=0.663 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy12432 98 VFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 98 lLLVlii~vvly~~~kRr 115 (229)
+++|.|+|.++|..++|.
T Consensus 10 ~V~V~IVclliya~YRR~ 27 (92)
T PHA02681 10 VIVISIVCYIVIMMYRRS 27 (92)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 445666677777777765
No 138
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=43.90 E-value=21 Score=30.88 Aligned_cols=29 Identities=24% Similarity=0.534 Sum_probs=15.1
Q ss_pred hhHHHHHhhhHHHHHHHHHHHHHHHHHHHH
Q psy12432 84 MLETASIASGASIAVFLVVILCFSLYVHCQ 113 (229)
Q Consensus 84 ~~~~~~Ia~gI~V~lLLVlii~vvly~~~k 113 (229)
.++.+..|+||++.|-| +.|++++|.++|
T Consensus 157 ~FD~~SFiGGIVL~LGv-~aI~ff~~KF~k 185 (186)
T PF05283_consen 157 TFDAASFIGGIVLTLGV-LAIIFFLYKFCK 185 (186)
T ss_pred CCchhhhhhHHHHHHHH-HHHHHHHhhhcc
Confidence 34556667777555443 333444544443
No 139
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=43.19 E-value=11 Score=27.62 Aligned_cols=28 Identities=21% Similarity=0.343 Sum_probs=8.9
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
++++++++++++++++++.+.++|+++.
T Consensus 3 i~~~~~g~~~ll~~v~~~~~~~rr~~~~ 30 (75)
T PF14575_consen 3 IASIIVGVLLLLVLVIIVIVCFRRCKYS 30 (75)
T ss_dssp HHHHHHHHHHHHHHHHHHHCCCTT----
T ss_pred EehHHHHHHHHHHhheeEEEEEeeEcCC
Confidence 3334444443333333333333333333
No 140
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=42.72 E-value=35 Score=26.24 Aligned_cols=28 Identities=25% Similarity=0.524 Sum_probs=17.0
Q ss_pred hHHHHHHH-HHHHHHHHHHHHHHhhhhhh
Q psy12432 93 GASIAVFL-VVILCFSLYVHCQRRKKQAQ 120 (229)
Q Consensus 93 gI~V~lLL-Vlii~vvly~~~kRrk~~k~ 120 (229)
|+-.+++. +.++++++|+.+-++||+.+
T Consensus 34 gm~~lvI~~iFil~VilwfvCC~kRkrsR 62 (94)
T PF05393_consen 34 GMWFLVICGIFILLVILWFVCCKKRKRSR 62 (94)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 34445544 44555677777887777654
No 141
>KOG3653|consensus
Probab=42.67 E-value=1.2e+02 Score=30.31 Aligned_cols=6 Identities=17% Similarity=0.396 Sum_probs=3.1
Q ss_pred CCcccc
Q psy12432 181 SQLYHQ 186 (229)
Q Consensus 181 ~~~f~~ 186 (229)
+.+||.
T Consensus 286 Vt~fh~ 291 (534)
T KOG3653|consen 286 VTEFHP 291 (534)
T ss_pred Eeeecc
Confidence 445554
No 142
>PRK13855 type IV secretion system protein VirB10; Provisional
Probab=42.15 E-value=68 Score=30.67 Aligned_cols=9 Identities=0% Similarity=0.032 Sum_probs=3.4
Q ss_pred HHHHHHHHH
Q psy12432 100 LVVILCFSL 108 (229)
Q Consensus 100 LVlii~vvl 108 (229)
+|.++++++
T Consensus 39 ~~~~~~~~~ 47 (376)
T PRK13855 39 VLALSLSLI 47 (376)
T ss_pred HHHHHHHHH
Confidence 333333333
No 143
>KOG1226|consensus
Probab=42.14 E-value=25 Score=36.43 Aligned_cols=20 Identities=25% Similarity=0.609 Sum_probs=11.6
Q ss_pred eeeCCCC-CCCCCCCCCCEEE
Q psy12432 25 TYACPPT-YATWYCLNGATCF 44 (229)
Q Consensus 25 ~C~C~~g-y~gnpClNGGtC~ 44 (229)
.|+|.+. |.|..|..-.+|.
T Consensus 606 ~C~C~~~~~sG~~CE~cptc~ 626 (783)
T KOG1226|consen 606 RCKCTDPPYSGEFCEKCPTCP 626 (783)
T ss_pred ceEcCCCCcCcchhhcCCCCC
Confidence 5666666 6666665444443
No 144
>PF03597 CcoS: Cytochrome oxidase maturation protein cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Probab=41.72 E-value=39 Score=22.54 Aligned_cols=20 Identities=10% Similarity=0.167 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy12432 95 SIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 95 ~V~lLLVlii~vvly~~~kR 114 (229)
.+.++++++.+++++|-.|+
T Consensus 8 p~sl~l~~~~l~~f~Wavk~ 27 (45)
T PF03597_consen 8 PVSLILGLIALAAFLWAVKS 27 (45)
T ss_pred HHHHHHHHHHHHHHHHHHcc
Confidence 34444444444444444443
No 145
>PF07010 Endomucin: Endomucin; InterPro: IPR010740 This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells.
Probab=40.83 E-value=35 Score=30.64 Aligned_cols=6 Identities=50% Similarity=0.960 Sum_probs=2.4
Q ss_pred HHHHHH
Q psy12432 108 LYVHCQ 113 (229)
Q Consensus 108 ly~~~k 113 (229)
+|..++
T Consensus 210 Lyr~C~ 215 (259)
T PF07010_consen 210 LYRMCW 215 (259)
T ss_pred HHHHhh
Confidence 444343
No 146
>PF11694 DUF3290: Protein of unknown function (DUF3290); InterPro: IPR021707 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=40.79 E-value=33 Score=28.46 Aligned_cols=19 Identities=32% Similarity=0.572 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy12432 95 SIAVFLVVILCFSLYVHCQ 113 (229)
Q Consensus 95 ~V~lLLVlii~vvly~~~k 113 (229)
.+++++++++.++.|++.+
T Consensus 23 i~~ll~~l~~~~~~Y~r~r 41 (149)
T PF11694_consen 23 IIILLLVLIFFFIKYLRNR 41 (149)
T ss_pred HHHHHHHHHHHHHHHHHhc
Confidence 3344444444444555443
No 147
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=40.76 E-value=10 Score=26.55 Aligned_cols=36 Identities=22% Similarity=0.324 Sum_probs=17.9
Q ss_pred CCCCCCEEEeeccCCCCceEeecCCCCCCCCCcccC
Q psy12432 36 YCLNGATCFTVKIGESLLYNCECADGYMGQRCEFKD 71 (229)
Q Consensus 36 pClNGGtC~~~~~~~~~~~sC~C~~Gy~G~rCe~~~ 71 (229)
+|...|..+.......+...|.|..-|.|..|+...
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~ 53 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI 53 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence 566666666322111235999999999999999864
No 148
>PF04689 S1FA: DNA binding protein S1FA; InterPro: IPR006779 S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=40.57 E-value=23 Score=25.65 Aligned_cols=14 Identities=29% Similarity=0.498 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHHHH
Q psy12432 100 LVVILCFSLYVHCQ 113 (229)
Q Consensus 100 LVlii~vvly~~~k 113 (229)
+.++.-+++|.+.+
T Consensus 26 ~flvGnyvlY~Yaq 39 (69)
T PF04689_consen 26 VFLVGNYVLYVYAQ 39 (69)
T ss_pred HHHHHHHHHHHHHh
Confidence 33344455555544
No 149
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=40.42 E-value=37 Score=25.58 Aligned_cols=24 Identities=17% Similarity=0.326 Sum_probs=5.9
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 98 VFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 98 lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
.++|-.+++..|.+.+|+|+..+.
T Consensus 19 aIvvW~iv~ieYrk~~rqrkId~l 42 (81)
T PF00558_consen 19 AIVVWTIVYIEYRKIKRQRKIDRL 42 (81)
T ss_dssp HHHHHHHH------------CHHH
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHH
Confidence 444445555677777766666543
No 150
>PF09835 DUF2062: Uncharacterized protein conserved in bacteria (DUF2062); InterPro: IPR018639 This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=40.41 E-value=67 Score=25.90 Aligned_cols=19 Identities=21% Similarity=0.401 Sum_probs=7.4
Q ss_pred hhhHHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSLY 109 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly 109 (229)
++++++++++.+++-++.+
T Consensus 122 ~G~~i~~~v~~~i~Y~l~~ 140 (154)
T PF09835_consen 122 LGSLILGIVLGIISYFLVY 140 (154)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3344444443333333333
No 151
>KOG1024|consensus
Probab=40.41 E-value=20 Score=35.03 Aligned_cols=29 Identities=28% Similarity=0.402 Sum_probs=20.3
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
++|++|++.+|.+++.++||.+++.+|+.
T Consensus 188 ~V~v~va~a~v~lva~il~c~~~~s~Kr~ 216 (563)
T KOG1024|consen 188 IVGVIVALALVSLVALILYCAKGPSKKRS 216 (563)
T ss_pred eeeHHHHHHHHHHHHHHHHHHhcccchhh
Confidence 45666777677777777888877776664
No 152
>PRK01658 holin-like protein; Validated
Probab=40.40 E-value=59 Score=26.03 Aligned_cols=22 Identities=27% Similarity=0.238 Sum_probs=10.1
Q ss_pred HHHHhhhHHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFSL 108 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvl 108 (229)
.+.|++.+.+..++++++..+.
T Consensus 88 ~~~il~~ivvsT~l~l~vtg~~ 109 (122)
T PRK01658 88 GISLFLVVVISTFVVMIVTGYL 109 (122)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444443
No 153
>KOG4482|consensus
Probab=39.92 E-value=31 Score=33.14 Aligned_cols=28 Identities=25% Similarity=0.412 Sum_probs=14.8
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
++...|++||+++++.++|+++.+++++
T Consensus 302 aIpl~Valll~~~La~imc~rrEg~~~r 329 (449)
T KOG4482|consen 302 AIPLGVALLLVLALAYIMCCRREGQKKR 329 (449)
T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccc
Confidence 3444455555555555565555544444
No 154
>TIGR01195 oadG_fam sodium pump decarboxylases, gamma subunit. Most sequences scoring between the noise and trusted cutoffs are eukaryotic sodium channel proteins.
Probab=39.87 E-value=67 Score=23.89 Aligned_cols=10 Identities=20% Similarity=0.418 Sum_probs=4.4
Q ss_pred hhhHHHHHHH
Q psy12432 91 ASGASIAVFL 100 (229)
Q Consensus 91 a~gI~V~lLL 100 (229)
++|+++++++
T Consensus 12 v~GM~~VF~f 21 (82)
T TIGR01195 12 VLGMGIVFLF 21 (82)
T ss_pred HHHHHHHHHH
Confidence 4444444333
No 155
>PF07271 Cytadhesin_P30: Cytadhesin P30/P32; InterPro: IPR009896 This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localised on the tip organelle. It is thought that it is important in cytadherence and virulence [].; GO: 0007157 heterophilic cell-cell adhesion, 0009405 pathogenesis, 0016021 integral to membrane
Probab=39.31 E-value=15 Score=33.57 Aligned_cols=9 Identities=22% Similarity=0.479 Sum_probs=3.6
Q ss_pred CCccCCCCc
Q psy12432 140 PFERRPSPA 148 (229)
Q Consensus 140 ~~~~~~~p~ 148 (229)
++...++|.
T Consensus 148 ~v~~~p~p~ 156 (279)
T PF07271_consen 148 QVPVNPQPQ 156 (279)
T ss_pred CCCCCCCCc
Confidence 344444443
No 156
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=39.16 E-value=6 Score=31.68 Aligned_cols=33 Identities=24% Similarity=0.144 Sum_probs=1.8
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy12432 86 ETASIASGASIAVFLVVILCFSLYVHCQRRKKQA 119 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~k 119 (229)
|.++|.+-++|+.+|++|.| |+|-++.-.|..+
T Consensus 25 EAaGIGiL~VILgiLLliGC-WYckRRSGYk~L~ 57 (118)
T PF14991_consen 25 EAAGIGILIVILGILLLIGC-WYCKRRSGYKTLR 57 (118)
T ss_dssp ---SSS----------------------------
T ss_pred HhccceeHHHHHHHHHHHhh-eeeeecchhhhhh
Confidence 45566554544444445554 7889887666554
No 157
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=39.06 E-value=52 Score=25.07 Aligned_cols=8 Identities=13% Similarity=0.351 Sum_probs=3.4
Q ss_pred HHHHHHHH
Q psy12432 107 SLYVHCQR 114 (229)
Q Consensus 107 vly~~~kR 114 (229)
.++|+.+|
T Consensus 20 y~~~k~~k 27 (87)
T PF10883_consen 20 YLWWKVKK 27 (87)
T ss_pred HHHHHHHH
Confidence 44444443
No 158
>PRK01844 hypothetical protein; Provisional
Probab=38.87 E-value=79 Score=23.31 Aligned_cols=25 Identities=8% Similarity=0.014 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHhhhhhhhccc
Q psy12432 100 LVVILCFSLYVHCQRRKKQAQAASV 124 (229)
Q Consensus 100 LVlii~vvly~~~kRrk~~k~~~~~ 124 (229)
|.+++.+++=++..|+.-+|.+..|
T Consensus 12 ~~li~G~~~Gff~ark~~~k~lk~N 36 (72)
T PRK01844 12 VALVAGVALGFFIARKYMMNYLQKN 36 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC
Confidence 5556666666666677666665443
No 159
>KOG3653|consensus
Probab=38.18 E-value=1e+02 Score=30.67 Aligned_cols=9 Identities=44% Similarity=1.106 Sum_probs=5.1
Q ss_pred ceEeecCCC
Q psy12432 53 LYNCECADG 61 (229)
Q Consensus 53 ~~sC~C~~G 61 (229)
.|.|-|..+
T Consensus 115 ~~~CcCs~~ 123 (534)
T KOG3653|consen 115 LYFCCCSTD 123 (534)
T ss_pred EEEEecCCC
Confidence 466666443
No 160
>PF11157 DUF2937: Protein of unknown function (DUF2937); InterPro: IPR022584 This family of proteins with unknown function appears to be found mainly in Proteobacteria.
Probab=38.07 E-value=51 Score=27.74 Aligned_cols=17 Identities=24% Similarity=0.149 Sum_probs=9.1
Q ss_pred hHHHHHhhhHHHHHHHH
Q psy12432 85 LETASIASGASIAVFLV 101 (229)
Q Consensus 85 ~~~~~Ia~gI~V~lLLV 101 (229)
+...+|+.|++.++++.
T Consensus 131 lt~~gi~~g~vg~l~~~ 147 (167)
T PF11157_consen 131 LTPEGIVFGLVGALLGA 147 (167)
T ss_pred CCHHHHHHHHHHHHHHH
Confidence 34556666665555433
No 161
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=37.96 E-value=38 Score=25.32 Aligned_cols=15 Identities=27% Similarity=0.549 Sum_probs=7.1
Q ss_pred HHHHHHHHHHHHHHH
Q psy12432 100 LVVILCFSLYVHCQR 114 (229)
Q Consensus 100 LVlii~vvly~~~kR 114 (229)
+++++++++|+.++|
T Consensus 77 ~~~f~~~v~yI~~rR 91 (92)
T PF03908_consen 77 FLFFLLVVLYILWRR 91 (92)
T ss_pred HHHHHHHHHHHhhhc
Confidence 334444445555554
No 162
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=37.42 E-value=85 Score=23.89 Aligned_cols=26 Identities=15% Similarity=0.268 Sum_probs=15.3
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432 90 IASGASIAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 90 Ia~gI~V~lLLVlii~vvly~~~kRr 115 (229)
|+++++.++++++.-++|-..+-+++
T Consensus 6 iv~~~~~v~~~i~~y~~~k~~ka~~~ 31 (87)
T PF10883_consen 6 IVGGVGAVVALILAYLWWKVKKAKKQ 31 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555555666666666666655
No 163
>PF07297 DPM2: Dolichol phosphate-mannose biosynthesis regulatory protein (DPM2); InterPro: IPR009914 This family consists of several eukaryotic dolichol phosphate-mannose biosynthesis regulatory (DPM2) proteins. Biosynthesis of glycosylphosphatidylinositol and N-glycan precursor is dependent upon a mannosyl donor, dolichol phosphate-mannose (DPM). DPM2, an 84 amino acid membrane protein expressed in the endoplasmic reticulum (ER), makes a complex with DPM1 that is essential for the ER localisation and stable expression of DPM1. Moreover, DPM2 enhances binding of dolichol phosphate, a substrate of DPM synthase. Biosynthesis of DPM in mammalian cells is regulated by DPM2 [].; GO: 0009059 macromolecule biosynthetic process, 0030176 integral to endoplasmic reticulum membrane
Probab=37.27 E-value=59 Score=24.29 Aligned_cols=28 Identities=21% Similarity=0.202 Sum_probs=11.5
Q ss_pred HHhhhHHHHHH-HHHHHHHHHHHHHHHhh
Q psy12432 89 SIASGASIAVF-LVVILCFSLYVHCQRRK 116 (229)
Q Consensus 89 ~Ia~gI~V~lL-LVlii~vvly~~~kRrk 116 (229)
+|.+.+.++++ +.++..++.+...|.+|
T Consensus 47 Ai~lP~~lll~~~~~vg~f~g~vmik~~~ 75 (78)
T PF07297_consen 47 AIILPIFLLLLGLSGVGTFLGYVMIKSKK 75 (78)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 34444444443 33333344444444333
No 164
>PF09472 MtrF: Tetrahydromethanopterin S-methyltransferase, F subunit (MtrF); InterPro: IPR013347 Many archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This domain is mostly found in MtrF, where it covers the entire length of the protein. This polypeptide is one of eight subunits of the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase complex found in methanogenic archaea. This is a membrane-associated enzyme complex that uses methyl-transfer reactions to drive a sodium-ion pump []. MtrF itself is involved in the transfer of the methyl group from N5-methyltetrahydromethanopterin to coenzyme M. Subsequently, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase. In some organisms this domain is found at the C-terminal region of what appears to be a fusion of the MtrA and MtrF proteins [, ]. The function of these proteins is unknown, though it is likely that they are involved in C1 metabolism.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016020 membrane
Probab=36.99 E-value=41 Score=24.20 Aligned_cols=20 Identities=30% Similarity=0.255 Sum_probs=11.2
Q ss_pred HHHHhhhHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCF 106 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~v 106 (229)
..+.++|+.++++++++..+
T Consensus 42 ~~GfaiG~~~AlvLv~ip~~ 61 (64)
T PF09472_consen 42 IKGFAIGFLFALVLVGIPIL 61 (64)
T ss_pred hHHHHHHHHHHHHHHHHHHH
Confidence 34556666666655555443
No 165
>PF09835 DUF2062: Uncharacterized protein conserved in bacteria (DUF2062); InterPro: IPR018639 This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=36.34 E-value=72 Score=25.74 Aligned_cols=12 Identities=17% Similarity=0.412 Sum_probs=5.5
Q ss_pred HHHhhhHHHHHH
Q psy12432 88 ASIASGASIAVF 99 (229)
Q Consensus 88 ~~Ia~gI~V~lL 99 (229)
..++.|++++++
T Consensus 123 G~~i~~~v~~~i 134 (154)
T PF09835_consen 123 GSLILGIVLGII 134 (154)
T ss_pred HHHHHHHHHHHH
Confidence 344444444444
No 166
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=36.31 E-value=27 Score=22.81 Aligned_cols=16 Identities=38% Similarity=0.931 Sum_probs=14.2
Q ss_pred eEeecCCCCCCCCCcc
Q psy12432 54 YNCECADGYMGQRCEF 69 (229)
Q Consensus 54 ~sC~C~~Gy~G~rCe~ 69 (229)
..|.|..++.|.+|+.
T Consensus 18 G~C~C~~~~~G~~C~~ 33 (46)
T smart00180 18 GQCECKPNVTGRRCDR 33 (46)
T ss_pred CEEECCCCCCCCCCCc
Confidence 5899999999999984
No 167
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=34.74 E-value=19 Score=25.57 Aligned_cols=41 Identities=24% Similarity=0.555 Sum_probs=19.1
Q ss_pred ceeeCCCCCCCCCCCCCCEEEeeccCCCCceEee------cCCCCCCCCC
Q psy12432 24 HTYACPPTYATWYCLNGATCFTVKIGESLLYNCE------CADGYMGQRC 67 (229)
Q Consensus 24 y~C~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~------C~~Gy~G~rC 67 (229)
+.-.|.+.|.|..|.. .|....+. ...|.|. |..||.|..|
T Consensus 17 ~rv~C~~nyyG~~C~~--~C~~~~d~-~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSK--FCKPRDDS-FGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-E--E---EEET-TEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccC--CcCCCcCC-cCCcccCCCCCCCCCCCCcCCCC
Confidence 4678888888877753 45443211 2357774 8888998877
No 168
>PF03554 Herpes_UL73: UL73 viral envelope glycoprotein ; InterPro: IPR005211 This entry represents a conserved region found in a number of viral proteins: BLRF1, U46, 53, and UL73, collectively known as glycoprotein N. These UL73-like envelope glycoproteins, which associate in a high molecular mass complex with their counterpart protein gM, induce neutralizing antibody responses in the host. These glycoproteins are highly polymorphic, particularly in the N-terminal region [].; GO: 0019031 viral envelope
Probab=34.67 E-value=1.2e+02 Score=22.89 Aligned_cols=29 Identities=17% Similarity=0.277 Sum_probs=14.6
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 86 ETASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
.+..|.+-+-+.+|++..++.+.|+.++|
T Consensus 47 SFsSIW~iiN~~il~~A~~vyLry~Cf~~ 75 (82)
T PF03554_consen 47 SFSSIWAIINVVILLCAFCVYLRYLCFQK 75 (82)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555543444444445555555555554
No 169
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=34.54 E-value=41 Score=30.66 Aligned_cols=12 Identities=25% Similarity=0.338 Sum_probs=5.6
Q ss_pred HHHhhhHHHHHH
Q psy12432 88 ASIASGASIAVF 99 (229)
Q Consensus 88 ~~Ia~gI~V~lL 99 (229)
..|++|+++++.
T Consensus 234 lAiALG~v~ll~ 245 (281)
T PF12768_consen 234 LAIALGTVFLLV 245 (281)
T ss_pred hHHHHHHHHHHH
Confidence 345555544443
No 170
>PHA02887 EGF-like protein; Provisional
Probab=34.47 E-value=32 Score=27.85 Aligned_cols=21 Identities=14% Similarity=0.265 Sum_probs=16.6
Q ss_pred CCCCCceeeCCCCCCCCCCCC
Q psy12432 19 PNITFHTYACPPTYATWYCLN 39 (229)
Q Consensus 19 ~~~~gy~C~C~~gy~gnpClN 39 (229)
.+.....|.|..||.|..|..
T Consensus 103 ~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 103 IDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred ccCCCceeECCCCcccCCCCc
Confidence 345568999999999888853
No 171
>PTZ00233 variable surface protein Vir18; Provisional
Probab=34.36 E-value=28 Score=34.50 Aligned_cols=27 Identities=15% Similarity=0.315 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy12432 95 SIAVFLVVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 95 ~V~lLLVlii~vvly~~~kRrk~~k~~ 121 (229)
+++|||-||+=..-.|+..-||+||+-
T Consensus 443 GIvLLLGLLFKyTPLWRvLTKknRKk~ 469 (509)
T PTZ00233 443 GIALLLGLLFKYTPLWRVLTKKNRKKG 469 (509)
T ss_pred hHHHHHHHhhccchhHHhhhhcccccc
Confidence 344444333334445555555555543
No 172
>KOG3054|consensus
Probab=34.26 E-value=41 Score=30.62 Aligned_cols=18 Identities=22% Similarity=0.171 Sum_probs=9.4
Q ss_pred CCCccc-ccCCCCCCcCCCcc
Q psy12432 165 TRTSIT-ITGKGDSVSASQLY 184 (229)
Q Consensus 165 ~~~~ld-~~~~g~~~~~~~~f 184 (229)
++++-| -++.|++ +-|++
T Consensus 78 ~sd~ee~~~~dg~e--e~~e~ 96 (299)
T KOG3054|consen 78 TSDVEEEGSGDGDE--EEPEA 96 (299)
T ss_pred cccccccccccccc--ccccc
Confidence 345555 5555655 55544
No 173
>PRK13275 mtrF tetrahydromethanopterin S-methyltransferase subunit F; Provisional
Probab=34.22 E-value=78 Score=23.02 Aligned_cols=20 Identities=30% Similarity=0.373 Sum_probs=11.7
Q ss_pred HHHHhhhHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCF 106 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~v 106 (229)
+.++++|..++++++++...
T Consensus 42 ~~G~aiG~~~AlvLv~ip~~ 61 (67)
T PRK13275 42 IIGFAIGFLLALLLVVVPPL 61 (67)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 45566666666665554443
No 174
>PF02430 AMA-1: Apical membrane antigen 1; InterPro: IPR003298 A novel antigen of Plasmodium falciparum has been cloned that contains a hydrophobic domain typical of an integral membrane protein. The antigen is designated apical membrane antigen 1 (AMA-1) by virtue of appearing to be located in the apical complex []. AMA-1 appears to be transported to the merozoite surface close to the time of schizont rupture. The 66kDa merozoite surface antigen (PK66) of Plasmodium knowlesi, a simian malaria, possesses vaccine-related properties believed to originate from a receptor-like role in parasite invasion of erythrocytes []. The sequence of PK66 is conserved throughout plasmodium, and shows high similarity to P. falciparum AMA-1. Following schizont rupture, the distribution of PK66 changes in a coordinate manner associated with merozoite invasion. Prior to rupture, the protein is concentrated at the apical end, following which it distributes itself entirely across the surface of the free merozoite. Immunofluorescence studies suggest that, during invasion, PK66 is excluded from the erythrocyte at, and behind, the invasion interface []. ; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2Y8R_D 2X2Z_A 2Y8T_A 2Y8S_D 1HN6_A 2Z8V_B 2Z8W_A 1Z40_E 2Q8B_A 2Q8A_A ....
Probab=32.57 E-value=15 Score=35.92 Aligned_cols=31 Identities=16% Similarity=0.215 Sum_probs=0.0
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHhhh
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQRRKK 117 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kRrk~ 117 (229)
+.+|++++.++++||.|++.++|-+.++.|+
T Consensus 429 ~~~~~~~~~~~~~~~~l~~~~~~k~~~~~kk 459 (471)
T PF02430_consen 429 RIVIIISIATGAVLLALAIYWYYKRLWKTKK 459 (471)
T ss_dssp -------------------------------
T ss_pred eeehhhHHHHHHHHHHHHHHHHHhhhhcccc
Confidence 3445555666665555555555554444433
No 175
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=32.44 E-value=45 Score=21.03 Aligned_cols=18 Identities=33% Similarity=0.894 Sum_probs=13.0
Q ss_pred EEEeeccCCCCceEeecCCCCC
Q psy12432 42 TCFTVKIGESLLYNCECADGYM 63 (229)
Q Consensus 42 tC~~~~~~~~~~~sC~C~~Gy~ 63 (229)
.|.+.. ..|.|.|+.||.
T Consensus 11 ~C~~~~----g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICVNTP----GSYRCSCPPGYK 28 (36)
T ss_dssp EEEEET----TSEEEE-STTEE
T ss_pred CCccCC----CceEeECCCCCE
Confidence 677764 369999999984
No 176
>PF11395 DUF2873: Protein of unknown function (DUF2873); InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=32.39 E-value=61 Score=21.08 Aligned_cols=11 Identities=36% Similarity=0.781 Sum_probs=4.5
Q ss_pred HHHHHHHHHHH
Q psy12432 98 VFLVVILCFSL 108 (229)
Q Consensus 98 lLLVlii~vvl 108 (229)
+++|+|+.+++
T Consensus 18 lflv~imliif 28 (43)
T PF11395_consen 18 LFLVIIMLIIF 28 (43)
T ss_pred HHHHHHHHHHH
Confidence 33444444333
No 177
>PRK11486 flagellar biosynthesis protein FliO; Provisional
Probab=32.26 E-value=59 Score=26.31 Aligned_cols=19 Identities=16% Similarity=0.148 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHHHHh
Q psy12432 97 AVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 97 ~lLLVlii~vvly~~~kRr 115 (229)
+|++|+.+++++.|..||-
T Consensus 24 ~L~lVl~lI~~~aWLlkR~ 42 (124)
T PRK11486 24 ALIGIIALILAAAWLVKRL 42 (124)
T ss_pred HHHHHHHHHHHHHHHHHHc
Confidence 3333444444444545553
No 178
>PF13124 DUF3963: Protein of unknown function (DUF3963)
Probab=31.72 E-value=84 Score=20.29 Aligned_cols=21 Identities=14% Similarity=0.099 Sum_probs=13.6
Q ss_pred HHHHhhhHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFS 107 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vv 107 (229)
.+.-.+..|+++|+|.++.+|
T Consensus 18 kwirnit~cfal~vv~lvslw 38 (40)
T PF13124_consen 18 KWIRNITFCFALLVVVLVSLW 38 (40)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344455567888877777655
No 179
>COG3115 ZipA Cell division protein [Cell division and chromosome partitioning]
Probab=31.68 E-value=1.6e+02 Score=27.47 Aligned_cols=10 Identities=30% Similarity=0.155 Sum_probs=5.3
Q ss_pred CCCCchhhhh
Q psy12432 208 SQPPDDIKAD 217 (229)
Q Consensus 208 ~~~~~~~~~~ 217 (229)
++++..++|+
T Consensus 133 ~~~~~~~~a~ 142 (324)
T COG3115 133 RQPVLPQEAP 142 (324)
T ss_pred ccCccccccc
Confidence 4455555555
No 180
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=31.66 E-value=17 Score=26.27 Aligned_cols=17 Identities=24% Similarity=0.386 Sum_probs=0.0
Q ss_pred HHhhhHHHHHHHHHHHH
Q psy12432 89 SIASGASIAVFLVVILC 105 (229)
Q Consensus 89 ~Ia~gI~V~lLLVlii~ 105 (229)
++++|++++++|+++++
T Consensus 17 G~Vvgll~ailLIlf~i 33 (64)
T PF01034_consen 17 GGVVGLLFAILLILFLI 33 (64)
T ss_dssp -----------------
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33444444444444444
No 181
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=31.63 E-value=1e+02 Score=25.53 Aligned_cols=23 Identities=17% Similarity=0.226 Sum_probs=11.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHh
Q psy12432 93 GASIAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 93 gI~V~lLLVlii~vvly~~~kRr 115 (229)
++++++|+++++...-+-+.+|+
T Consensus 9 ~l~iilli~~~~~~~kl~kl~r~ 31 (151)
T PF14584_consen 9 VLVIILLILIIILNIKLRKLKRR 31 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445555555555555554444
No 182
>PF12158 DUF3592: Protein of unknown function (DUF3592); InterPro: IPR021994 This family of proteins is functionally uncharacterised.This family of proteins is found in bacteria, archaea, eukaryotes and viruses. Proteins in this family are typically between 150 and 242 amino acids in length.
Probab=31.46 E-value=70 Score=24.85 Aligned_cols=9 Identities=22% Similarity=0.552 Sum_probs=4.6
Q ss_pred ccccCCCCC
Q psy12432 183 LYHQSSPPL 191 (229)
Q Consensus 183 ~f~~~~~~~ 191 (229)
-|++.||..
T Consensus 101 ~Y~P~~P~~ 109 (148)
T PF12158_consen 101 YYNPNNPEE 109 (148)
T ss_pred EECCcCCCe
Confidence 455555544
No 183
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=31.21 E-value=50 Score=28.18 Aligned_cols=36 Identities=22% Similarity=0.544 Sum_probs=23.7
Q ss_pred CCcee----eCCCCCCCCCCCCCCEEEeeccCCCCceEeecCCCCCCC
Q psy12432 22 TFHTY----ACPPTYATWYCLNGATCFTVKIGESLLYNCECADGYMGQ 65 (229)
Q Consensus 22 ~gy~C----~C~~gy~gnpClNGGtC~~~~~~~~~~~sC~C~~Gy~G~ 65 (229)
.+..| .|..+ ...|. ..|.+.. +.|.|.|..||+..
T Consensus 180 ~~~~C~~~~~C~~~--~~~c~--~~C~~~~----g~~~c~c~~g~~~~ 219 (224)
T cd01475 180 QGKICVVPDLCATL--SHVCQ--QVCISTP----GSYLCACTEGYALL 219 (224)
T ss_pred ccccCcCchhhcCC--CCCcc--ceEEcCC----CCEEeECCCCccCC
Confidence 34556 47653 24565 3687653 36999999999764
No 184
>PF15339 Afaf: Acrosome formation-associated factor
Probab=31.04 E-value=1e+02 Score=26.69 Aligned_cols=6 Identities=33% Similarity=0.445 Sum_probs=2.5
Q ss_pred hhhHHH
Q psy12432 91 ASGASI 96 (229)
Q Consensus 91 a~gI~V 96 (229)
..||.+
T Consensus 133 mLGIsL 138 (200)
T PF15339_consen 133 MLGISL 138 (200)
T ss_pred HHHHHH
Confidence 344443
No 185
>PF13703 PepSY_TM_2: PepSY-associated TM helix
Probab=30.98 E-value=80 Score=23.18 Aligned_cols=15 Identities=13% Similarity=0.379 Sum_probs=6.0
Q ss_pred hhhHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILC 105 (229)
Q Consensus 91 a~gI~V~lLLVlii~ 105 (229)
++++..++++++++.
T Consensus 19 iv~~~al~~l~~~is 33 (88)
T PF13703_consen 19 IVGILALLLLLLLIS 33 (88)
T ss_pred HHHHHHHHHHHHHHH
Confidence 444443333333333
No 186
>PF05356 Phage_Coat_B: Phage Coat protein B ; InterPro: IPR008020 The major coat protein in the capsid of filamentous bacteriophage forms a helical assembly of about 7000 identical protomers, with each protomer comprised of 46 amino acids, after the cleavage of the signal peptide. Each protomer forms a slightly curved helix that combines to form a tubular structure that encapsulates the viral DNA [].; PDB: 1QL1_A 2XKM_A 4IFM_A 1QL2_A 1IFM_A 2KLV_A 1IFN_A 2IFN_A 3IFM_A 2KSJ_A ....
Probab=30.89 E-value=91 Score=23.34 Aligned_cols=23 Identities=13% Similarity=0.301 Sum_probs=13.0
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQ 113 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~k 113 (229)
++|.+|++|+++.+..++|-..+
T Consensus 59 vgg~IVgvl~~laVaGlI~~l~R 81 (83)
T PF05356_consen 59 VGGYIVGVLVILAVAGLIYSLLR 81 (83)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHHHh
Confidence 34556666666666665554433
No 187
>PF13268 DUF4059: Protein of unknown function (DUF4059)
Probab=30.74 E-value=74 Score=23.47 Aligned_cols=19 Identities=11% Similarity=0.254 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHHhhhh
Q psy12432 100 LVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 100 LVlii~vvly~~~kRrk~~ 118 (229)
++++.++|+.++..|++.+
T Consensus 20 V~~~~~~wi~~Ra~~~~DK 38 (72)
T PF13268_consen 20 VLLVSGIWILWRALRKKDK 38 (72)
T ss_pred HHHHHHHHHHHHHHHcCCC
Confidence 3444445555555444433
No 188
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=30.72 E-value=66 Score=22.02 Aligned_cols=17 Identities=18% Similarity=0.049 Sum_probs=6.1
Q ss_pred HHHHHHHHHHHHHHHHh
Q psy12432 99 FLVVILCFSLYVHCQRR 115 (229)
Q Consensus 99 LLVlii~vvly~~~kRr 115 (229)
|+|++++++..|...|+
T Consensus 10 ~iv~~lLg~~I~~~~K~ 26 (50)
T PF12606_consen 10 FIVMGLLGLSICTTLKA 26 (50)
T ss_pred HHHHHHHHHHHHHHhhc
Confidence 33333333333333333
No 189
>TIGR00859 ENaC sodium channel transporter. This model is designed from the vertebrate members of the ENaC family.
Probab=30.05 E-value=3.5e+02 Score=27.20 Aligned_cols=22 Identities=23% Similarity=0.336 Sum_probs=8.7
Q ss_pred CcCCCccccCCCCCCCCCCCCCC
Q psy12432 178 VSASQLYHQSSPPLPHMSHPPSC 200 (229)
Q Consensus 178 ~~~~~~f~~~~~~~~~~~~~~~~ 200 (229)
+...+.+++. ..-+++.+++-+
T Consensus 564 ~~~~~~~~~~-~~~~~~~~~~~~ 585 (595)
T TIGR00859 564 LDDPPTFPSA-LPLPHASGLSLP 585 (595)
T ss_pred ccCCCCCCcc-cCCCCccCCCCC
Confidence 3334455443 333344444333
No 190
>PRK13707 conjugal transfer pilus assembly protein TraL; Provisional
Probab=29.71 E-value=1.2e+02 Score=23.55 Aligned_cols=14 Identities=14% Similarity=0.199 Sum_probs=6.0
Q ss_pred HHHHHHHHHhhhhh
Q psy12432 106 FSLYVHCQRRKKQA 119 (229)
Q Consensus 106 vvly~~~kRrk~~k 119 (229)
++++...+|-|+.+
T Consensus 53 ~~~~~~~r~lK~g~ 66 (101)
T PRK13707 53 VLVWFGIRKLKKGR 66 (101)
T ss_pred HHHHHHHHHHHcCC
Confidence 34444444444444
No 191
>KOG3054|consensus
Probab=29.60 E-value=52 Score=29.99 Aligned_cols=8 Identities=25% Similarity=0.434 Sum_probs=3.2
Q ss_pred HHHHHhhh
Q psy12432 110 VHCQRRKK 117 (229)
Q Consensus 110 ~~~kRrk~ 117 (229)
+.++|++.
T Consensus 21 ~l~~r~ra 28 (299)
T KOG3054|consen 21 FLWKRRRA 28 (299)
T ss_pred HHHHhhcc
Confidence 33444433
No 192
>PF13131 DUF3951: Protein of unknown function (DUF3951)
Probab=29.49 E-value=58 Score=22.55 Aligned_cols=19 Identities=16% Similarity=0.481 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy12432 96 IAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 96 V~lLLVlii~vvly~~~kR 114 (229)
+.+++.+++.++.|..+-+
T Consensus 11 ~~~~I~~lIgfity~mfV~ 29 (53)
T PF13131_consen 11 FTIFIFFLIGFITYKMFVK 29 (53)
T ss_pred HHHHHHHHHHHHHHHhhee
Confidence 3444444444444444443
No 193
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=28.46 E-value=74 Score=22.36 Aligned_cols=19 Identities=32% Similarity=0.443 Sum_probs=7.6
Q ss_pred HHHHHHHHHHHHHHHhhhh
Q psy12432 100 LVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 100 LVlii~vvly~~~kRrk~~ 118 (229)
|++++-.-+|..++++|+-
T Consensus 11 lvv~LYgY~yhLYrsek~G 29 (56)
T TIGR02736 11 LVIFLYAYIYHLYRSQKKG 29 (56)
T ss_pred HHHHHHHHHHHhhhhhccc
Confidence 3333333344444444433
No 194
>PRK04125 murein hydrolase regulator LrgA; Provisional
Probab=28.44 E-value=1.2e+02 Score=24.95 Aligned_cols=21 Identities=14% Similarity=0.036 Sum_probs=9.8
Q ss_pred HHHHhhhHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFS 107 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vv 107 (229)
++.|++.+.+.-++++++..+
T Consensus 91 ~~~Il~~ivvSTllvl~vtg~ 111 (141)
T PRK04125 91 PVQIIGVIIVATILLLACTGL 111 (141)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444444444433
No 195
>CHL00008 petG cytochrome b6/f complex subunit V
Probab=28.42 E-value=1.1e+02 Score=19.71 Aligned_cols=24 Identities=17% Similarity=0.426 Sum_probs=10.7
Q ss_pred hhhHHHHHHHHHH--HHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVI--LCFSLYVHCQR 114 (229)
Q Consensus 91 a~gI~V~lLLVli--i~vvly~~~kR 114 (229)
..||+++++-+-+ +.+..|..++|
T Consensus 6 L~GiVLGlipvTl~GlfvaAylQYrR 31 (37)
T CHL00008 6 LFGIVLGLIPITLAGLFVTAYLQYRR 31 (37)
T ss_pred hhhHHHHhHHHHHHHHHHHHHHHHhh
Confidence 3455555542222 22345555554
No 196
>PF07271 Cytadhesin_P30: Cytadhesin P30/P32; InterPro: IPR009896 This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localised on the tip organelle. It is thought that it is important in cytadherence and virulence [].; GO: 0007157 heterophilic cell-cell adhesion, 0009405 pathogenesis, 0016021 integral to membrane
Probab=28.34 E-value=41 Score=30.80 Aligned_cols=13 Identities=8% Similarity=0.169 Sum_probs=4.8
Q ss_pred hHHHHHHHHHHHH
Q psy12432 93 GASIAVFLVVILC 105 (229)
Q Consensus 93 gI~V~lLLVlii~ 105 (229)
|+++++|||++.+
T Consensus 78 G~~~v~liLgl~i 90 (279)
T PF07271_consen 78 GLLAVALILGLAI 90 (279)
T ss_pred hHHHHHHHHHHhh
Confidence 3333333333333
No 197
>COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism]
Probab=28.26 E-value=60 Score=23.90 Aligned_cols=19 Identities=42% Similarity=0.347 Sum_probs=11.1
Q ss_pred HHHHhhhHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILC 105 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~ 105 (229)
+.++++|.+++++++++..
T Consensus 51 i~GlaiGfvfA~vLv~ill 69 (73)
T COG4218 51 IAGLAIGFVFAGVLVGILL 69 (73)
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 4566666666665555544
No 198
>PF07178 TraL: TraL protein; InterPro: IPR009838 This entry represents bacterial TraL proteins. TraL is a predicted peripheral membrane protein involved in bacterial sex pilus assembly []. TraL is part of the type IV secretion system for conjugative plasmid transfer []. The exact function of TraL is unknown.; GO: 0000746 conjugation, 0019867 outer membrane
Probab=28.00 E-value=1.3e+02 Score=22.73 Aligned_cols=10 Identities=30% Similarity=0.368 Sum_probs=3.6
Q ss_pred HHHHHHHhhh
Q psy12432 108 LYVHCQRRKK 117 (229)
Q Consensus 108 ly~~~kRrk~ 117 (229)
+++.++|-|+
T Consensus 49 ~~~~~~k~K~ 58 (95)
T PF07178_consen 49 LWWGYRKFKK 58 (95)
T ss_pred HHHHHHHHHc
Confidence 3333333333
No 199
>PHA03283 envelope glycoprotein E; Provisional
Probab=27.91 E-value=36 Score=33.86 Aligned_cols=19 Identities=21% Similarity=0.216 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHhhh
Q psy12432 99 FLVVILCFSLYVHCQRRKK 117 (229)
Q Consensus 99 LLVlii~vvly~~~kRrk~ 117 (229)
++|+.+++|.|++++|+++
T Consensus 412 ~~~~~l~vw~c~~~r~~~~ 430 (542)
T PHA03283 412 ALLVALVVWGCILYRRSNR 430 (542)
T ss_pred HHHHHHhhhheeeehhhcC
Confidence 3444445556665544333
No 200
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=27.75 E-value=32 Score=21.83 Aligned_cols=10 Identities=30% Similarity=0.577 Sum_probs=8.1
Q ss_pred CceeeCCCCC
Q psy12432 23 FHTYACPPTY 32 (229)
Q Consensus 23 gy~C~C~~gy 32 (229)
...|.||+||
T Consensus 17 ~~~C~CPeGy 26 (34)
T PF09064_consen 17 PGQCFCPEGY 26 (34)
T ss_pred CCceeCCCce
Confidence 4589999987
No 201
>PF11353 DUF3153: Protein of unknown function (DUF3153); InterPro: IPR021499 This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed.
Probab=27.72 E-value=59 Score=27.97 Aligned_cols=14 Identities=7% Similarity=0.289 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHHhh
Q psy12432 103 ILCFSLYVHCQRRK 116 (229)
Q Consensus 103 ii~vvly~~~kRrk 116 (229)
++++.+|++++|.+
T Consensus 195 l~~~~~~l~~~r~~ 208 (209)
T PF11353_consen 195 LILLGFLLRRRRLP 208 (209)
T ss_pred HHHHHHHHHHhhcC
Confidence 33334555555543
No 202
>PF04415 DUF515: Protein of unknown function (DUF515) ; InterPro: IPR007509 This is a family of hypothetical archaeal proteins.
Probab=27.68 E-value=66 Score=31.17 Aligned_cols=26 Identities=19% Similarity=0.625 Sum_probs=13.4
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432 90 IASGASIAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 90 Ia~gI~V~lLLVlii~vvly~~~kRr 115 (229)
+++|..|+.++++++.+.+|+..+-+
T Consensus 32 iiiGa~Vl~iIii~~~~~~Y~~~~~~ 57 (416)
T PF04415_consen 32 IIIGAAVLIIIIIFIVYNIYYFLQNQ 57 (416)
T ss_pred hhhhhhhHhHHHHHHHHHHHHHhhhh
Confidence 34555555555555555555554433
No 203
>PHA03163 hypothetical protein; Provisional
Probab=27.61 E-value=2.4e+02 Score=21.69 Aligned_cols=29 Identities=14% Similarity=0.359 Sum_probs=14.0
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 86 ETASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
.+..|.+-+=+.++++..++.+.|+.++|
T Consensus 56 SFSSIWaliNv~Ivl~A~~iyL~y~CF~k 84 (92)
T PHA03163 56 SFSSIWAILNVLIMLIACIIYCIYMCFNK 84 (92)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444443334444444445555555544
No 204
>TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II. This enzyme catalyzes the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in gram positive bacteria of the Bacillus/Staphylococcus group. Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes a similar to the cyt. c oxidases (as opposed to cytochromes b).
Probab=27.45 E-value=1.3e+02 Score=25.98 Aligned_cols=10 Identities=20% Similarity=0.198 Sum_probs=3.9
Q ss_pred HHHHHHHHHh
Q psy12432 106 FSLYVHCQRR 115 (229)
Q Consensus 106 vvly~~~kRr 115 (229)
+++.++++++
T Consensus 42 i~~~~kyr~r 51 (217)
T TIGR01432 42 TIFLVKYRYR 51 (217)
T ss_pred HHHHHHHHhc
Confidence 3333444333
No 205
>PRK00665 petG cytochrome b6-f complex subunit PetG; Reviewed
Probab=27.22 E-value=1.1e+02 Score=19.61 Aligned_cols=24 Identities=17% Similarity=0.358 Sum_probs=10.7
Q ss_pred hhhHHHHHHHHHH--HHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVI--LCFSLYVHCQR 114 (229)
Q Consensus 91 a~gI~V~lLLVli--i~vvly~~~kR 114 (229)
..||+++++-+-+ +.+..|..++|
T Consensus 6 L~GiVLGlipiTl~GlfvaAylQYrR 31 (37)
T PRK00665 6 LCGIVLGLIPVTLAGLFVAAWNQYKR 31 (37)
T ss_pred hhhHHHHhHHHHHHHHHHHHHHHHhc
Confidence 3455555542222 22345555554
No 206
>PRK09400 secE preprotein translocase subunit SecE; Reviewed
Probab=27.21 E-value=1.4e+02 Score=21.13 Aligned_cols=26 Identities=19% Similarity=0.411 Sum_probs=15.3
Q ss_pred hHHHHHhhhHHHHHHHHHHHHHHHHH
Q psy12432 85 LETASIASGASIAVFLVVILCFSLYV 110 (229)
Q Consensus 85 ~~~~~Ia~gI~V~lLLVlii~vvly~ 110 (229)
.|...++-...+++++++++..+++.
T Consensus 30 ~Ef~~ia~~~~iG~~i~G~iGf~Ikl 55 (61)
T PRK09400 30 EEFLLVAKVTGLGILLIGLIGFIIYL 55 (61)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35555555556666666666655543
No 207
>KOG3514|consensus
Probab=26.99 E-value=47 Score=36.14 Aligned_cols=25 Identities=16% Similarity=0.251 Sum_probs=14.6
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVH 111 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~ 111 (229)
+.++++||.+++.|.+|++.++..+
T Consensus 1514 ttGmVvGIvaAaaLcILilL~am~k 1538 (1591)
T KOG3514|consen 1514 TTGMVVGIVAAAALCILILLYAMYK 1538 (1591)
T ss_pred ccchhhHHHHHHHHHHHHHHhhccc
Confidence 4566777776666555555444443
No 208
>PF02404 SCF: Stem cell factor; InterPro: IPR003452 Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. SCF stimulates the proliferation of mast cells and is able to augment the proliferation of both myeloid and lymphoid hematopoietic progenitors in bone marrow culture. It also mediates cell-cell adhesion and acts synergistically with other cytokines. SCF is a type I membrane protein, but is also found in a secretable, soluble form. The crystal structure of human SCF has been resolved and a potential receptor-binding site identified [].; GO: 0005173 stem cell factor receptor binding, 0007155 cell adhesion, 0016020 membrane; PDB: 1EXZ_A 1SCF_D 2E9W_C 2O26_A 2O27_A.
Probab=26.98 E-value=21 Score=32.50 Aligned_cols=20 Identities=15% Similarity=0.252 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy12432 96 IAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 96 V~lLLVlii~vvly~~~kRr 115 (229)
++-||++++++++||+++.+
T Consensus 223 l~SLVIGFvlG~l~WKkkq~ 242 (273)
T PF02404_consen 223 LFSLVIGFVLGALYWKKKQR 242 (273)
T ss_dssp --------------------
T ss_pred HHHHHHHHHHHHHHHHhhcc
Confidence 44456777777777755544
No 209
>PF03988 DUF347: Repeat of Unknown Function (DUF347) ; InterPro: IPR007136 This repeat is found as four tandem repeats in a family of bacterial membrane proteins. Each repeat contains two transmembrane regions and a conserved tryptophan.
Probab=26.87 E-value=92 Score=21.27 Aligned_cols=10 Identities=0% Similarity=-0.074 Sum_probs=3.6
Q ss_pred HHHHHHHHHH
Q psy12432 101 VVILCFSLYV 110 (229)
Q Consensus 101 Vlii~vvly~ 110 (229)
++++++++++
T Consensus 38 ~~l~~~~~~~ 47 (55)
T PF03988_consen 38 ALLAVVLALW 47 (55)
T ss_pred HHHHHHHHHH
Confidence 3333333333
No 210
>PF11431 Transport_MerF: Membrane transport protein MerF; InterPro: IPR021091 This entry represents Mercury ion transport protein MerF, which has a core helix-loop-helix domain. It also has two vicinal pairs of cysteine residues which are involved in the transport of Hg(II) across the membrane and are exposed to the cytoplasm []. ; PDB: 1WAZ_A 2LJ2_A 2H3O_A.
Probab=26.74 E-value=77 Score=21.41 Aligned_cols=18 Identities=33% Similarity=0.693 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHhhh
Q psy12432 100 LVVILCFSLYVHCQRRKK 117 (229)
Q Consensus 100 LVlii~vvly~~~kRrk~ 117 (229)
+.+++.+.+|-.++|+|+
T Consensus 28 La~fi~lt~yal~r~~~~ 45 (46)
T PF11431_consen 28 LAVFIGLTIYALWRRRRK 45 (46)
T ss_dssp HHHHHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHHHHHHhcc
Confidence 445555566666666554
No 211
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=26.36 E-value=1.8e+02 Score=22.94 Aligned_cols=12 Identities=25% Similarity=0.603 Sum_probs=5.2
Q ss_pred HHHHHHHHHHHH
Q psy12432 95 SIAVFLVVILCF 106 (229)
Q Consensus 95 ~V~lLLVlii~v 106 (229)
.|+||++++++.
T Consensus 23 ~i~~FiILLIi~ 34 (121)
T PF10669_consen 23 FIVVFIILLIIT 34 (121)
T ss_pred HHHHHHHHHHHH
Confidence 344444444443
No 212
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=26.33 E-value=72 Score=28.57 Aligned_cols=14 Identities=7% Similarity=0.344 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHHhh
Q psy12432 103 ILCFSLYVHCQRRK 116 (229)
Q Consensus 103 ii~vvly~~~kRrk 116 (229)
+++++.|+++|||+
T Consensus 303 ~i~~~~~~~fkrk~ 316 (318)
T TIGR00383 303 VIALGPLIYFRRKG 316 (318)
T ss_pred HHHHHHHHHHHHcC
Confidence 33334455555543
No 213
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=26.26 E-value=96 Score=25.75 Aligned_cols=11 Identities=18% Similarity=0.129 Sum_probs=4.3
Q ss_pred HHHHHHHHHHH
Q psy12432 100 LVVILCFSLYV 110 (229)
Q Consensus 100 LVlii~vvly~ 110 (229)
+++++.++.++
T Consensus 50 ~vL~~~g~~~~ 60 (191)
T PF04156_consen 50 VVLLSLGLLCL 60 (191)
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 214
>PHA03231 glycoprotein BALF4; Provisional
Probab=26.21 E-value=1e+02 Score=32.41 Aligned_cols=13 Identities=8% Similarity=0.427 Sum_probs=5.5
Q ss_pred HHHHHHHHHhhhh
Q psy12432 106 FSLYVHCQRRKKQ 118 (229)
Q Consensus 106 vvly~~~kRrk~~ 118 (229)
++++..++|.++.
T Consensus 718 v~v~l~~rr~~~~ 730 (829)
T PHA03231 718 VAVFLAYRRVRRL 730 (829)
T ss_pred hhhhHHHHHHHHH
Confidence 3344444444433
No 215
>PF11174 DUF2970: Protein of unknown function (DUF2970); InterPro: IPR021344 This short family is conserved in Proteobacteria. The function is not known.
Probab=26.15 E-value=1.1e+02 Score=21.32 Aligned_cols=19 Identities=26% Similarity=0.263 Sum_probs=10.3
Q ss_pred HHHHhhhHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILC 105 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~ 105 (229)
+..|++|++.++++++.+.
T Consensus 31 ~~~Ii~gii~~~~fV~~Lv 49 (56)
T PF11174_consen 31 VHFIIVGIILAALFVAGLV 49 (56)
T ss_pred chHHHHHHHHHHHHHHHHH
Confidence 4556666665555444443
No 216
>PF11743 DUF3301: Protein of unknown function (DUF3301); InterPro: IPR021732 This family is conserved in Proteobacteria, but the function is not known.
Probab=26.03 E-value=90 Score=23.81 Aligned_cols=17 Identities=12% Similarity=0.284 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHhhhh
Q psy12432 102 VILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 102 lii~vvly~~~kRrk~~ 118 (229)
+++++++++..+|.++.
T Consensus 8 l~~~~~~~w~~~~~~E~ 24 (97)
T PF11743_consen 8 LALVGWFWWQSRRQRER 24 (97)
T ss_pred HHHHHHHHHHHhhHHHH
Confidence 33334444544444444
No 217
>PF13179 DUF4006: Family of unknown function (DUF4006)
Probab=25.86 E-value=1.4e+02 Score=21.65 Aligned_cols=18 Identities=28% Similarity=0.420 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHhhhh
Q psy12432 101 VVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 101 Vlii~vvly~~~kRrk~~ 118 (229)
+.|+.++-|+-.+-++..
T Consensus 24 LsIl~~lt~~ai~~Qq~~ 41 (66)
T PF13179_consen 24 LSILAFLTYWAIKVQQEQ 41 (66)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333334444444444433
No 218
>COG3088 CcmH Uncharacterized protein involved in biosynthesis of c-type cytochromes [Posttranslational modification, protein turnover, chaperones]
Probab=25.73 E-value=1.1e+02 Score=25.69 Aligned_cols=18 Identities=11% Similarity=0.102 Sum_probs=8.2
Q ss_pred hhhHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSL 108 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvl 108 (229)
..+.-+++||+++++++.
T Consensus 108 LW~~Pv~llllG~~~~~~ 125 (153)
T COG3088 108 LWGLPVVLLLLGGVLLVR 125 (153)
T ss_pred HHHhHHHHHHHHHHHHHH
Confidence 344445555444444333
No 219
>TIGR00327 secE_euk_arch protein translocase SEC61 complex gamma subunit, archaeal and eukaryotic. This model describes archaeal SEC61-like and eukaryotic SEC61 but not bacterial secE proteins, for which a Pfam pfam00584 (SecE) has been created.
Probab=25.50 E-value=1.8e+02 Score=20.62 Aligned_cols=26 Identities=19% Similarity=0.316 Sum_probs=15.3
Q ss_pred hHHHHHhhhHHHHHHHHHHHHHHHHH
Q psy12432 85 LETASIASGASIAVFLVVILCFSLYV 110 (229)
Q Consensus 85 ~~~~~Ia~gI~V~lLLVlii~vvly~ 110 (229)
.|...++-...+++++++++..+++.
T Consensus 26 ~Ef~~iak~t~iG~~i~G~IGf~Ikl 51 (61)
T TIGR00327 26 EEYLKVAKVTGIGIIIVGIIGYIIKI 51 (61)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35555555556666666666665544
No 220
>PF03408 Foamy_virus_ENV: Foamy virus envelope protein ; InterPro: IPR005070 Expression of the envelope (Env) glycoprotein is essential for viral particle egress. This feature is unique to the Spumavirinae, a subclass of the Retroviridae. ; GO: 0019031 viral envelope
Probab=25.47 E-value=42 Score=35.24 Aligned_cols=9 Identities=22% Similarity=0.656 Sum_probs=3.3
Q ss_pred hHHHHHHHH
Q psy12432 93 GASIAVFLV 101 (229)
Q Consensus 93 gI~V~lLLV 101 (229)
||+|.+|++
T Consensus 957 Gi~ii~lii 965 (981)
T PF03408_consen 957 GIVIILLII 965 (981)
T ss_pred HHHHHHHHH
Confidence 333333333
No 221
>PF14163 SieB: Superinfection exclusion protein B
Probab=25.40 E-value=1.7e+02 Score=23.67 Aligned_cols=21 Identities=14% Similarity=0.151 Sum_probs=11.2
Q ss_pred HHHHhhhHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFS 107 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vv 107 (229)
+.+++.-++++.++..++..+
T Consensus 35 ~i~~~fl~s~s~li~~~~~~~ 55 (151)
T PF14163_consen 35 WIGLIFLFSVSYLIAQLLSFI 55 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 455555555555555555443
No 222
>PHA03185 UL14 tegument protein; Provisional
Probab=25.22 E-value=88 Score=27.49 Aligned_cols=31 Identities=26% Similarity=0.389 Sum_probs=25.6
Q ss_pred CCccccCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy12432 181 SQLYHQSSPPLPHMSHPPSCTPLPPAPSQPP 211 (229)
Q Consensus 181 ~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (229)
.|+|.+.-.-......|++.||+.|+|..|.
T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (214)
T PHA03185 171 RPTSGPVAGGLPGGGAPAPPTPPRPGPTDPA 201 (214)
T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCc
Confidence 5788888888888899999999999886553
No 223
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=25.01 E-value=61 Score=24.63 Aligned_cols=26 Identities=19% Similarity=0.498 Sum_probs=19.3
Q ss_pred CCCCCCCEEEeeccCCCCceEeecCCCCCCC
Q psy12432 35 WYCLNGATCFTVKIGESLLYNCECADGYMGQ 65 (229)
Q Consensus 35 npClNGGtC~~~~~~~~~~~sC~C~~Gy~G~ 65 (229)
+.|-..|.|... ....|.|..||...
T Consensus 84 ~~CG~~g~C~~~-----~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 84 GFCGPNGICNSN-----NSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCccEeCCC-----CCCceECCCCcCCC
Confidence 578888999532 25779999999643
No 224
>PF11157 DUF2937: Protein of unknown function (DUF2937); InterPro: IPR022584 This family of proteins with unknown function appears to be found mainly in Proteobacteria.
Probab=25.00 E-value=85 Score=26.38 Aligned_cols=18 Identities=11% Similarity=0.165 Sum_probs=8.1
Q ss_pred HHHHhhhHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVIL 104 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii 104 (229)
.+++++++.+.+++-+++
T Consensus 137 ~~g~vg~l~~~~l~~~l~ 154 (167)
T PF11157_consen 137 VFGLVGALLGALLVELLL 154 (167)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 344455555444443333
No 225
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=24.62 E-value=55 Score=33.48 Aligned_cols=27 Identities=11% Similarity=0.319 Sum_probs=16.5
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
++++++.+.|++|+|+|+.| .+|++.|
T Consensus 272 I~gVlvPv~vV~~Iiiil~~-~LCRk~K 298 (684)
T PF12877_consen 272 IAGVLVPVLVVLLIIIILYW-KLCRKNK 298 (684)
T ss_pred EehHhHHHHHHHHHHHHHHH-HHhcccc
Confidence 36667777776666665555 5566543
No 226
>PF11359 gpUL132: Glycoprotein UL132; InterPro: IPR021023 Glycoprotein UL132 is a low-abundance structural component of Human herpesvirus 5 []. The function of this protein is not fully understood.
Probab=24.54 E-value=64 Score=28.68 Aligned_cols=30 Identities=30% Similarity=0.499 Sum_probs=14.4
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 88 ASIASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
+++++|..++-||++++++ +|.-++|+..+
T Consensus 55 iYcVTg~sllsli~Vtvaa-lYsSC~~~pg~ 84 (235)
T PF11359_consen 55 IYCVTGFSLLSLIVVTVAA-LYSSCCRRPGR 84 (235)
T ss_pred eeeehhHHHHHHHHHHHHH-HHHHHHhCCCc
Confidence 4445555555555555554 44444443333
No 227
>PHA03164 hypothetical protein; Provisional
Probab=24.09 E-value=45 Score=25.06 Aligned_cols=25 Identities=32% Similarity=0.632 Sum_probs=11.4
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHh
Q psy12432 89 SIASGASIAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 89 ~Ia~gI~V~lLLVlii~vvly~~~kRr 115 (229)
.|..|+.++ +++++++++|++.-++
T Consensus 61 lvLtgLaIa--mILfiifvlyvFnVnr 85 (88)
T PHA03164 61 LVLTGLAIA--MILFIIFVLYVFNVNR 85 (88)
T ss_pred HHHHHHHHH--HHHHHHHHHHheeecc
Confidence 334444333 3444445566554443
No 228
>PRK08455 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=23.95 E-value=74 Score=27.10 Aligned_cols=6 Identities=0% Similarity=-0.175 Sum_probs=2.4
Q ss_pred HHHHHH
Q psy12432 106 FSLYVH 111 (229)
Q Consensus 106 vvly~~ 111 (229)
+++|+.
T Consensus 36 ~~~~~~ 41 (182)
T PRK08455 36 VIAMLL 41 (182)
T ss_pred HHHHHH
Confidence 334433
No 229
>COG5353 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.88 E-value=45 Score=28.09 Aligned_cols=21 Identities=24% Similarity=0.463 Sum_probs=10.2
Q ss_pred hhhHHHHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSLYVH 111 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~ 111 (229)
+++|.+++|+++++.+++++.
T Consensus 8 ~i~ii~viflai~~s~~~~~~ 28 (161)
T COG5353 8 IIIIILVIFLAIILSIALFFW 28 (161)
T ss_pred eehhHHHHHHHHHHHHHHHHh
Confidence 444444555555555444443
No 230
>KOG3498|consensus
Probab=23.87 E-value=1.5e+02 Score=21.59 Aligned_cols=25 Identities=20% Similarity=0.276 Sum_probs=16.9
Q ss_pred hhHHHHHhhhHHHHHHHHHHHHHHH
Q psy12432 84 MLETASIASGASIAVFLVVILCFSL 108 (229)
Q Consensus 84 ~~~~~~Ia~gI~V~lLLVlii~vvl 108 (229)
..|+..|+.++.+++++++++..++
T Consensus 30 rKEf~ki~~~~aiGf~~mG~iGf~v 54 (67)
T KOG3498|consen 30 RKEFTKIAKATAIGFVIMGFIGFFV 54 (67)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566677777777777777776544
No 231
>PF06809 NPDC1: Neural proliferation differentiation control-1 protein (NPDC1); InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=23.76 E-value=18 Score=33.80 Aligned_cols=9 Identities=11% Similarity=-0.101 Sum_probs=4.7
Q ss_pred ccCCCCcce
Q psy12432 142 ERRPSPADF 150 (229)
Q Consensus 142 ~~~~~p~~~ 150 (229)
.+.-+-+|+
T Consensus 270 QKQQMiamE 278 (341)
T PF06809_consen 270 QKQQMIAME 278 (341)
T ss_pred HHHHHHhhc
Confidence 444555555
No 232
>PRK01026 tetrahydromethanopterin S-methyltransferase subunit G; Provisional
Probab=23.72 E-value=1.6e+02 Score=22.00 Aligned_cols=19 Identities=21% Similarity=0.426 Sum_probs=10.5
Q ss_pred HHHhhhHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCF 106 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~v 106 (229)
.+|+-|+++++|++++...
T Consensus 50 iGIlYG~viGlli~~i~~~ 68 (77)
T PRK01026 50 IGILYGLVIGLLIVLVYII 68 (77)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4566666666665444443
No 233
>PHA03156 hypothetical protein; Provisional
Probab=23.66 E-value=3.1e+02 Score=21.06 Aligned_cols=29 Identities=17% Similarity=0.087 Sum_probs=14.2
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 86 ETASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 86 ~~~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
.+..|.+-+=+.++++..++.+.|+.++|
T Consensus 55 SFSSIWallN~~i~~~A~~ifL~y~CF~k 83 (90)
T PHA03156 55 TFSSIWAILNGIIFFCASLFFLRHLCFVK 83 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445443344444455555555555544
No 234
>PF02529 PetG: Cytochrome B6-F complex subunit 5; InterPro: IPR003683 This family consists of cytochrome b6/f complex subunit 5 (PetG). The cytochrome bf complex, found in green plants, eukaryotic algae and cyanobacteria, connects photosystem I to photosystem II in the electron transport chain, functioning as a plastoquinol:plastocyanin/cytochrome c6 oxidoreductase []. The purified complex from the unicellular alga Chlamydomonas reinhardtii contains seven subunits; namely four high molecular weight subunits (cytochrome f, Rieske iron-sulphur protein, cytochrome b6, and subunit IV) and three approximately miniproteins (PetG, PetL, and PetX) []. Stoichiometry measurements are consistent with every subunit being present as two copies per b6/f dimer. The absence of PetG affects either the assembly or stability of the cytochrome bf complex in C. reinhardtii [].; GO: 0009512 cytochrome b6f complex; PDB: 1Q90_G 2ZT9_G 1VF5_G 2D2C_G 2E74_G 2E75_G 2E76_G.
Probab=23.64 E-value=1.7e+02 Score=18.83 Aligned_cols=25 Identities=20% Similarity=0.376 Sum_probs=11.3
Q ss_pred hhhHHHHHHHHHHHH--HHHHHHHHHh
Q psy12432 91 ASGASIAVFLVVILC--FSLYVHCQRR 115 (229)
Q Consensus 91 a~gI~V~lLLVlii~--vvly~~~kRr 115 (229)
..||+++++-+.+.. +..|..++|-
T Consensus 6 L~GiVlGli~vtl~Glfv~Ay~QY~Rg 32 (37)
T PF02529_consen 6 LSGIVLGLIPVTLAGLFVAAYLQYRRG 32 (37)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHCS-
T ss_pred hhhHHHHhHHHHHHHHHHHHHHHHhcc
Confidence 455666655332222 3455555543
No 235
>PF04202 Mfp-3: Foot protein 3; InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=23.27 E-value=52 Score=23.98 Aligned_cols=11 Identities=36% Similarity=0.525 Sum_probs=4.8
Q ss_pred HHHHHHHHHHH
Q psy12432 96 IAVFLVVILCF 106 (229)
Q Consensus 96 V~lLLVlii~v 106 (229)
|+|.||||.++
T Consensus 8 VLlaLvLIg~f 18 (71)
T PF04202_consen 8 VLLALVLIGSF 18 (71)
T ss_pred HHHHHHHHhhh
Confidence 34444444443
No 236
>COG3630 OadG Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, gamma subunit [Energy production and conversion]
Probab=23.10 E-value=1.8e+02 Score=22.12 Aligned_cols=20 Identities=15% Similarity=0.353 Sum_probs=9.8
Q ss_pred hhhHHHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSLYV 110 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~ 110 (229)
+.|+.++++++.|++.+++.
T Consensus 15 vlGmg~VflfL~iLi~~~~~ 34 (84)
T COG3630 15 VLGMGFVFLFLSILIYAMRG 34 (84)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 45555555544444444444
No 237
>TIGR02507 MtrF tetrahydromethanopterin S-methyltransferase, F subunit. coenzyme M methyltransferase in methanogenic archaea. This methyltranferase is membrane-associated enzyme complex that uses methy-transfer reaction to drive sodium-ion pump. Archaea domain, have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.
Probab=23.09 E-value=1.4e+02 Score=21.62 Aligned_cols=19 Identities=32% Similarity=0.385 Sum_probs=10.8
Q ss_pred HHHHhhhHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILC 105 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~ 105 (229)
+.++++|..++|++|++-.
T Consensus 42 ~~G~~iG~~~Al~lV~IP~ 60 (65)
T TIGR02507 42 ITGLAYGFLFAVLLVAVPI 60 (65)
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 4455666666666544433
No 238
>cd00930 Cyt_c_Oxidase_VIII Cytochrome oxidase c subunit VIII. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit VIII is the smallest of the nuclear-encoded subunits. It exists in muscle-specific and non-muscle-specific isoforms that are differently expressed in different species, suggesting species-specific regulation of energy metabolism.
Probab=23.08 E-value=1.5e+02 Score=19.71 Aligned_cols=23 Identities=17% Similarity=0.103 Sum_probs=10.9
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQ 113 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~k 113 (229)
++|+.+.++-+++...|+....+
T Consensus 16 aigl~~~f~~~L~p~gWVLshL~ 38 (43)
T cd00930 16 AIGLSVFFTTFLLPAGWVLSHLE 38 (43)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHH
Confidence 45555555444444444444433
No 239
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=23.07 E-value=28 Score=32.00 Aligned_cols=14 Identities=21% Similarity=0.218 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHhh
Q psy12432 103 ILCFSLYVHCQRRK 116 (229)
Q Consensus 103 ii~vvly~~~kRrk 116 (229)
.+..++||++|||.
T Consensus 241 aVGGLLfYr~rrRs 254 (285)
T PF05337_consen 241 AVGGLLFYRRRRRS 254 (285)
T ss_dssp --------------
T ss_pred hccceeeecccccc
Confidence 33344545555444
No 240
>PHA03286 envelope glycoprotein E; Provisional
Probab=23.06 E-value=1.2e+02 Score=29.97 Aligned_cols=9 Identities=11% Similarity=0.718 Sum_probs=3.4
Q ss_pred HHHHHHHHH
Q psy12432 98 VFLVVILCF 106 (229)
Q Consensus 98 lLLVlii~v 106 (229)
++||+++.+
T Consensus 401 ~~~~~~~~~ 409 (492)
T PHA03286 401 ILVVLLFAL 409 (492)
T ss_pred HHHHHHHHH
Confidence 333333333
No 241
>KOG1025|consensus
Probab=22.86 E-value=1.1e+02 Score=32.84 Aligned_cols=8 Identities=50% Similarity=1.157 Sum_probs=4.8
Q ss_pred cCCCCCCC
Q psy12432 58 CADGYMGQ 65 (229)
Q Consensus 58 C~~Gy~G~ 65 (229)
|+.|..|.
T Consensus 571 CP~G~~G~ 578 (1177)
T KOG1025|consen 571 CPDGVTGP 578 (1177)
T ss_pred CCCcccCC
Confidence 66665554
No 242
>PF13974 YebO: YebO-like protein
Probab=22.81 E-value=94 Score=23.36 Aligned_cols=13 Identities=23% Similarity=0.508 Sum_probs=5.5
Q ss_pred HHHHHHHHHHHHH
Q psy12432 96 IAVFLVVILCFSL 108 (229)
Q Consensus 96 V~lLLVlii~vvl 108 (229)
+.++|+++++|++
T Consensus 5 ~~~~lv~livWFF 17 (80)
T PF13974_consen 5 VLVLLVGLIVWFF 17 (80)
T ss_pred HHHHHHHHHHHHH
Confidence 3444444444433
No 243
>PF07438 DUF1514: Protein of unknown function (DUF1514); InterPro: IPR009999 This entry is represented by Bacteriophage phi PVL, Orf60. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several Staphylococcus aureus and related bacteriophage proteins of around 65 residues in length. The function of this family is unknown.
Probab=22.68 E-value=84 Score=22.76 Aligned_cols=10 Identities=20% Similarity=0.288 Sum_probs=4.0
Q ss_pred hhHHHHHHHH
Q psy12432 92 SGASIAVFLV 101 (229)
Q Consensus 92 ~gI~V~lLLV 101 (229)
+.|+++++++
T Consensus 5 iSIvLai~lL 14 (66)
T PF07438_consen 5 ISIVLAIALL 14 (66)
T ss_pred HHHHHHHHHH
Confidence 3344444433
No 244
>PRK11901 hypothetical protein; Reviewed
Probab=22.66 E-value=67 Score=30.15 Aligned_cols=21 Identities=10% Similarity=0.025 Sum_probs=13.8
Q ss_pred HHHHhhhHHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFSL 108 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvl 108 (229)
+++|. .++++||||.|...+-
T Consensus 39 MiGiG-ilVLlLLIi~IgSALk 59 (327)
T PRK11901 39 MIGIG-ILVLLLLIIAIGSALK 59 (327)
T ss_pred HHHHH-HHHHHHHHHHHhhhcc
Confidence 34443 4667788888888774
No 245
>PF01372 Melittin: Melittin; InterPro: IPR002116 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an Arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Api m 3. Melittin is the principal protein component of the venom of the honeybee, Apis mellifera. It inhibits protein kinase C, Ca2+/calmodulin-dependent protein kinase II, myosin light chain kinase and Na+/K+-ATPase (synaptosomal membrane) and is a cell membrane lytic factor. Melittin is a small peptide with no disulphide bridge; the N-terminal part of the molecule is predominantly hydrophobic and the C-terminal part is hydrophilic and strongly basic. The molecular mechanisms underlying the various effects of melittin on membranes have not been completely defined and much of the evidence indicates that different molecular mechanisms may underlie different actions of the peptide []. Extensive work with melittin has shown that the venom has multiple effects, probably, as a result of its interaction with negatively changed phospholipids. It inhibits well known transport pumps such as the Na+-K+-ATPase and the H+-K+-ATPase. Melittin increases the permeability of cell membranes to ions, particularly Na+ and indirectly Ca2+, because of the Na+-Ca2+-exchange. This effect results in marked morphological and functional changes, particularly in excitable tissues such as cardiac myocytes. In some other tissues, e.g., cornea, not only Na+ but Cl- permeability is also increased by melittin. Similar effects to melittin on H+-K+-ATPase have been found with the synthetic amphipathic polypeptide Trp-3 []. The study of melittin in model membranes has been useful for the development of methodology for determination of membrane protein structures. A molecular dynamics simulation of melittin in a hydrated dipalmitoylphosphatidylcholine (DPPC) bilayer was carried out. The effect of melittin on the surrounding membrane was localised to its immediate vicinity, and its asymmetry with respect to the two layers may be a result of the fact that it is not fully transmembranal. Melittin's hydrophilic C terminus anchors it at the extracellular interface, leaving the N terminus "loose" in the lower layer of the membrane [].; GO: 0004860 protein kinase inhibitor activity, 0005576 extracellular region; PDB: 3QRX_B 2MLT_A 1BH1_A.
Probab=22.62 E-value=1.8e+02 Score=17.21 Aligned_cols=6 Identities=17% Similarity=0.678 Sum_probs=2.2
Q ss_pred HHHHHH
Q psy12432 109 YVHCQR 114 (229)
Q Consensus 109 y~~~kR 114 (229)
|++.+|
T Consensus 19 WIK~kr 24 (26)
T PF01372_consen 19 WIKNKR 24 (26)
T ss_dssp HHHHHH
T ss_pred HHHHHh
Confidence 333333
No 246
>PF06692 MNSV_P7B: Melon necrotic spot virus P7B protein; InterPro: IPR009575 This family consists of several Melon necrotic spot virus (MNSV) P7B proteins. The function of this family is unknown.
Probab=22.55 E-value=93 Score=21.96 Aligned_cols=6 Identities=33% Similarity=0.877 Sum_probs=2.3
Q ss_pred HHHHHH
Q psy12432 100 LVVILC 105 (229)
Q Consensus 100 LVlii~ 105 (229)
|++|++
T Consensus 18 Liliis 23 (61)
T PF06692_consen 18 LILIIS 23 (61)
T ss_pred HHHHHH
Confidence 333333
No 247
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=22.53 E-value=1.3e+02 Score=23.76 Aligned_cols=10 Identities=0% Similarity=-0.097 Sum_probs=5.9
Q ss_pred HHHhhhhhhh
Q psy12432 112 CQRRKKQAQA 121 (229)
Q Consensus 112 ~kRrk~~k~~ 121 (229)
+||+|+.+.+
T Consensus 26 kKr~K~~~~m 35 (109)
T PRK05886 26 RKAMQATIDL 35 (109)
T ss_pred HHHHHHHHHH
Confidence 4566666654
No 248
>PRK03814 oxaloacetate decarboxylase subunit gamma; Provisional
Probab=22.38 E-value=1.8e+02 Score=21.87 Aligned_cols=8 Identities=13% Similarity=0.414 Sum_probs=3.5
Q ss_pred hhhHHHHH
Q psy12432 91 ASGASIAV 98 (229)
Q Consensus 91 a~gI~V~l 98 (229)
++|.++++
T Consensus 16 ~~GM~~VF 23 (85)
T PRK03814 16 LTGMGVVF 23 (85)
T ss_pred HHHHHHHH
Confidence 44444433
No 249
>PHA03291 envelope glycoprotein I; Provisional
Probab=22.12 E-value=46 Score=31.74 Aligned_cols=30 Identities=27% Similarity=0.285 Sum_probs=13.0
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy12432 89 SIASGASIAVFLVVILCFSLYVHCQRRKKQ 118 (229)
Q Consensus 89 ~Ia~gI~V~lLLVlii~vvly~~~kRrk~~ 118 (229)
-|++-..++++|+|.-|+....++.||+++
T Consensus 290 QiAIPasii~cV~lGSC~Ccl~R~~rRr~r 319 (401)
T PHA03291 290 QIAIPASIIACVFLGSCACCLHRRCRRRRR 319 (401)
T ss_pred eeccchHHHHHhhhhhhhhhhhhhhhcccC
Confidence 344444444444444444333344443333
No 250
>COG0598 CorA Mg2+ and Co2+ transporters [Inorganic ion transport and metabolism]
Probab=22.02 E-value=87 Score=28.59 Aligned_cols=20 Identities=30% Similarity=0.637 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHHHHHhh
Q psy12432 97 AVFLVVILCFSLYVHCQRRK 116 (229)
Q Consensus 97 ~lLLVlii~vvly~~~kRrk 116 (229)
++++++++++++|++++||+
T Consensus 301 ~l~~m~~~~~~~~~~frrk~ 320 (322)
T COG0598 301 ALILMLLLALLLYLYFRRKG 320 (322)
T ss_pred HHHHHHHHHHHHHHHHHhcC
Confidence 33344444455555555554
No 251
>PF13120 DUF3974: Domain of unknown function (DUF3974)
Probab=21.86 E-value=79 Score=24.84 Aligned_cols=20 Identities=20% Similarity=0.180 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHHHHHHhhh
Q psy12432 98 VFLVVILCFSLYVHCQRRKK 117 (229)
Q Consensus 98 lLLVlii~vvly~~~kRrk~ 117 (229)
+||+++.++++.+++-|+.+
T Consensus 14 llligftivvl~vyfgrk~y 33 (126)
T PF13120_consen 14 LLLIGFTIVVLLVYFGRKFY 33 (126)
T ss_pred HHHHHHHHHhhhheecceee
Confidence 34455555555555555443
No 252
>PF11857 DUF3377: Domain of unknown function (DUF3377); InterPro: IPR021805 This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=21.86 E-value=36 Score=25.26 Aligned_cols=11 Identities=9% Similarity=0.277 Sum_probs=4.5
Q ss_pred HHHHHhhhhhh
Q psy12432 110 VHCQRRKKQAQ 120 (229)
Q Consensus 110 ~~~kRrk~~k~ 120 (229)
++++|+--.++
T Consensus 52 ~~fkrkGtPr~ 62 (74)
T PF11857_consen 52 FQFKRKGTPRR 62 (74)
T ss_pred heeeecCCCcE
Confidence 33444433333
No 253
>PF14017 DUF4233: Protein of unknown function (DUF4233)
Probab=21.78 E-value=2.6e+02 Score=21.85 Aligned_cols=21 Identities=14% Similarity=0.195 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHhhhhhhh
Q psy12432 101 VVILCFSLYVHCQRRKKQAQA 121 (229)
Q Consensus 101 Vlii~vvly~~~kRrk~~k~~ 121 (229)
+++..++.|..+.|++-.+++
T Consensus 84 ~iF~~~W~~~l~lg~~i~~~~ 104 (107)
T PF14017_consen 84 VIFAAVWWYALYLGRRIDRRM 104 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 444555666666666655543
No 254
>PTZ00234 variable surface protein Vir12; Provisional
Probab=21.75 E-value=57 Score=31.70 Aligned_cols=14 Identities=14% Similarity=0.467 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHH
Q psy12432 97 AVFLVVILCFSLYV 110 (229)
Q Consensus 97 ~lLLVlii~vvly~ 110 (229)
++-||++|.+++|+
T Consensus 370 ~~ailGtifFlfyy 383 (433)
T PTZ00234 370 GASIIGVLVFLFFF 383 (433)
T ss_pred HHHHHHHHHHhhhh
Confidence 33344555555554
No 255
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=21.56 E-value=1.9e+02 Score=20.57 Aligned_cols=32 Identities=13% Similarity=-0.050 Sum_probs=22.5
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhhhhhc
Q psy12432 91 ASGASIAVFLVVILCFSLYVHCQRRKKQAQAA 122 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~kRrk~~k~~~ 122 (229)
...+-...+++++++++.++.+.=++.+|+..
T Consensus 7 ~~~a~a~~t~~~~l~fiavi~~ayr~~~K~~~ 38 (60)
T COG4736 7 RGFADAWGTIAFTLFFIAVIYFAYRPGKKGEF 38 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccchhhH
Confidence 34445566677777788888888877777664
No 256
>PF07178 TraL: TraL protein; InterPro: IPR009838 This entry represents bacterial TraL proteins. TraL is a predicted peripheral membrane protein involved in bacterial sex pilus assembly []. TraL is part of the type IV secretion system for conjugative plasmid transfer []. The exact function of TraL is unknown.; GO: 0000746 conjugation, 0019867 outer membrane
Probab=21.53 E-value=1.4e+02 Score=22.46 Aligned_cols=23 Identities=9% Similarity=0.147 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhh
Q psy12432 98 VFLVVILCFSLYVHCQRRKKQAQ 120 (229)
Q Consensus 98 lLLVlii~vvly~~~kRrk~~k~ 120 (229)
.+++.++++..|-+.|+++.+.-
T Consensus 42 g~i~g~~~~~~~~k~K~~~~~g~ 64 (95)
T PF07178_consen 42 GLILGIVLWWGYRKFKKGRGRGF 64 (95)
T ss_pred HHHHHHHHHHHHHHHHccCCccH
Confidence 34566667888888888877763
No 257
>PF11085 YqhR: Conserved membrane protein YqhR; InterPro: IPR024563 This family of proteins is conserved in the Bacillaceae family of the Firmicutes. Their function is not known.
Probab=21.46 E-value=1.7e+02 Score=25.12 Aligned_cols=28 Identities=7% Similarity=0.083 Sum_probs=13.2
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
+++.++|+++..++-++.+++.|...+|
T Consensus 67 ~~G~~igi~~~gv~Si~aAllY~~~l~k 94 (173)
T PF11085_consen 67 WLGNLIGIVFIGVFSIVAALLYYALLKK 94 (173)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555544444444444444444444
No 258
>PF05961 Chordopox_A13L: Chordopoxvirus A13L protein; InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=21.42 E-value=1.2e+02 Score=22.22 Aligned_cols=9 Identities=44% Similarity=0.634 Sum_probs=3.8
Q ss_pred HHHHHHHhh
Q psy12432 108 LYVHCQRRK 116 (229)
Q Consensus 108 ly~~~kRrk 116 (229)
+|-.+.|++
T Consensus 19 lY~iYnr~~ 27 (68)
T PF05961_consen 19 LYGIYNRKK 27 (68)
T ss_pred HHHHHhccc
Confidence 344444433
No 259
>TIGR01495 ETRAMP Plasmodium ring stage membrane protein ETRAMP. These genes have been shown to be found in the sub-telomeric regions of both P. falciparum and P. yoelii chromosomes.
Probab=21.42 E-value=81 Score=23.81 Aligned_cols=13 Identities=15% Similarity=0.266 Sum_probs=4.7
Q ss_pred HHHHHHHHHHHHH
Q psy12432 97 AVFLVVILCFSLY 109 (229)
Q Consensus 97 ~lLLVlii~vvly 109 (229)
++|+..++.+.+|
T Consensus 62 a~lv~t~~G~g~y 74 (85)
T TIGR01495 62 ALLVGAGVGLGYY 74 (85)
T ss_pred HHHHHHHHHHhhh
Confidence 3333333333333
No 260
>PF07226 DUF1422: Protein of unknown function (DUF1422); InterPro: IPR009867 This family consists of several hypothetical bacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=21.36 E-value=99 Score=24.87 Aligned_cols=20 Identities=30% Similarity=0.306 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHHHhh
Q psy12432 97 AVFLVVILCFSLYVHCQRRK 116 (229)
Q Consensus 97 ~lLLVlii~vvly~~~kRrk 116 (229)
.+++.+++++|++.+.+.+|
T Consensus 96 p~il~l~L~~Wi~~kl~~~k 115 (117)
T PF07226_consen 96 PSILCLILVFWIGYKLGFRK 115 (117)
T ss_pred HHHHHHHHHHHHHHHHhhhc
Confidence 34455555555555554443
No 261
>PF05827 ATP-synt_S1: Vacuolar ATP synthase subunit S1 (ATP6S1); InterPro: IPR024722 This family consists of metazoan vacuolar ATP synthase subunit S1 proteins [] and fungal proteins belonging to the BIG family. In Candida albicans BIG is required for normal beta-1,6-glucan synthesis, hyphal morphogenesis, adhesion and virulence [].
Probab=21.06 E-value=1e+02 Score=27.44 Aligned_cols=23 Identities=30% Similarity=0.390 Sum_probs=14.1
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVILCFSLYV 110 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii~vvly~ 110 (229)
-+|.+|++++++|+.|+.+.+.+
T Consensus 257 pgi~mglii~~~ll~IL~~gl~~ 279 (282)
T PF05827_consen 257 PGIWMGLIISLVLLSILYVGLSM 279 (282)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHH
Confidence 45666776666666666655443
No 262
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.85 E-value=1.6e+02 Score=24.23 Aligned_cols=13 Identities=8% Similarity=0.077 Sum_probs=6.1
Q ss_pred HHHHhhhHHHHHH
Q psy12432 87 TASIASGASIAVF 99 (229)
Q Consensus 87 ~~~Ia~gI~V~lL 99 (229)
|.++++|++|+|+
T Consensus 8 W~~a~igLvvGi~ 20 (138)
T COG3105 8 WEYALIGLVVGII 20 (138)
T ss_pred HHHHHHHHHHHHH
Confidence 4444554444443
No 263
>PF13706 PepSY_TM_3: PepSY-associated TM helix
Probab=20.79 E-value=1.8e+02 Score=18.15 Aligned_cols=12 Identities=33% Similarity=0.559 Sum_probs=5.1
Q ss_pred hhHHHHHHHHHH
Q psy12432 92 SGASIAVFLVVI 103 (229)
Q Consensus 92 ~gI~V~lLLVli 103 (229)
+|+.++++++++
T Consensus 11 ~Gl~~g~~l~~~ 22 (37)
T PF13706_consen 11 LGLILGLLLFVI 22 (37)
T ss_pred HHHHHHHHHHHH
Confidence 344444443333
No 264
>PRK09546 zntB zinc transporter; Reviewed
Probab=20.63 E-value=1.1e+02 Score=27.92 Aligned_cols=10 Identities=10% Similarity=-0.134 Sum_probs=4.4
Q ss_pred HHHHHHHHHh
Q psy12432 106 FSLYVHCQRR 115 (229)
Q Consensus 106 vvly~~~kRr 115 (229)
+++++++|||
T Consensus 312 ~~~~~~fkrk 321 (324)
T PRK09546 312 GGVAWWLKRS 321 (324)
T ss_pred HHHHHHHHhc
Confidence 3344444444
No 265
>PRK02919 oxaloacetate decarboxylase subunit gamma; Provisional
Probab=20.60 E-value=2.7e+02 Score=20.84 Aligned_cols=6 Identities=17% Similarity=0.163 Sum_probs=2.7
Q ss_pred hhhHHH
Q psy12432 91 ASGASI 96 (229)
Q Consensus 91 a~gI~V 96 (229)
++|.++
T Consensus 15 vlGMg~ 20 (82)
T PRK02919 15 FLGMGF 20 (82)
T ss_pred HHHHHH
Confidence 444444
No 266
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=20.56 E-value=1.4e+02 Score=29.73 Aligned_cols=22 Identities=5% Similarity=0.054 Sum_probs=9.1
Q ss_pred hhhHHHHHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSLYVHC 112 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~~ 112 (229)
+++|+|++++++++++++=-+.
T Consensus 6 ii~i~ii~i~~~~~~~~~rr~~ 27 (569)
T PRK04778 6 IIAIVVIIIIAYLAGLILRKRN 27 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444443444444433
No 267
>PF10251 PEN-2: Presenilin enhancer-2 subunit of gamma secretase; InterPro: IPR019379 This entry is a short, 101 peptide protein, which is the smallest subunit of the gamma-secretase aspartyl protease complex. It catalyses the intra-membrane cleavage of a subset of type I transmembrane proteins. The other active constituents of the complex are presenilin (PS) nicastrin and anterior pharynx defective-1 (APH-1) protein. Presenilin enhancer-2 (PEN-2) adopts a hairpin orientation in the membrane with its N- and C-terminal domains facing the luminal/extracellular space. The C-terminal domain maintains PS stability within the complex [].
Probab=20.54 E-value=2.8e+02 Score=21.43 Aligned_cols=28 Identities=25% Similarity=0.374 Sum_probs=18.6
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHH
Q psy12432 87 TASIASGASIAVFLVVILCFSLYVHCQR 114 (229)
Q Consensus 87 ~~~Ia~gI~V~lLLVlii~vvly~~~kR 114 (229)
...+..+|+.++.+++++.+++.+..+|
T Consensus 50 ~YVi~SaiG~~vw~v~l~~W~~~Fq~~R 77 (94)
T PF10251_consen 50 KYVIRSAIGFLVWTVVLISWILIFQINR 77 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555566666677777777777776644
No 268
>PRK11056 hypothetical protein; Provisional
Probab=20.48 E-value=98 Score=24.99 Aligned_cols=21 Identities=38% Similarity=0.436 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhh
Q psy12432 97 AVFLVVILCFSLYVHCQRRKK 117 (229)
Q Consensus 97 ~lLLVlii~vvly~~~kRrk~ 117 (229)
.+++.+++++|++.+.+.+|+
T Consensus 96 p~il~~~L~~Wi~~kl~~~~~ 116 (120)
T PRK11056 96 PAVLSVILVFWIGRKLRNRKK 116 (120)
T ss_pred HHHHHHHHHHHHHHHHhcccc
Confidence 344555555566555554433
No 269
>PF10855 DUF2648: Protein of unknown function (DUF2648); InterPro: IPR022561 This family of proteins with unknown function appears to be restricted to eubacteia.
Probab=20.40 E-value=73 Score=19.96 Aligned_cols=15 Identities=13% Similarity=0.184 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHhh
Q psy12432 102 VILCFSLYVHCQRRK 116 (229)
Q Consensus 102 lii~vvly~~~kRrk 116 (229)
+++....|+-+||.+
T Consensus 8 L~l~ga~f~~fKKyQ 22 (33)
T PF10855_consen 8 LILGGAAFYGFKKYQ 22 (33)
T ss_pred hhhhhHHHHHHHHHH
Confidence 333444555555544
No 270
>PF15069 FAM163: FAM163 family
Probab=20.36 E-value=1e+02 Score=25.64 Aligned_cols=27 Identities=19% Similarity=0.469 Sum_probs=11.5
Q ss_pred HHHhhhHHHHHHHHHHH-HHHHHHHHHH
Q psy12432 88 ASIASGASIAVFLVVIL-CFSLYVHCQR 114 (229)
Q Consensus 88 ~~Ia~gI~V~lLLVlii-~vvly~~~kR 114 (229)
.+|+..++++.+|+++. |=+-|+.+||
T Consensus 10 GgILAtVILLcIIaVLCYCRLQYYCCKK 37 (143)
T PF15069_consen 10 GGILATVILLCIIAVLCYCRLQYYCCKK 37 (143)
T ss_pred chHHHHHHHHHHHHHHHHHhhHHHHhhc
Confidence 44444444444333333 3344444554
No 271
>PF02285 COX8: Cytochrome oxidase c subunit VIII; InterPro: IPR003205 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits.This family is composed of cytochrome c oxidase subunit VIII. ; GO: 0004129 cytochrome-c oxidase activity; PDB: 3AG3_Z 3ABM_M 1OCC_Z 3ASO_Z 3AG2_Z 3ABL_M 3AG4_M 3AG1_M 3ASN_M 1OCZ_M ....
Probab=20.20 E-value=2.6e+02 Score=18.63 Aligned_cols=21 Identities=14% Similarity=0.064 Sum_probs=9.5
Q ss_pred hhhHHHHHHHHHHHHHHHHHH
Q psy12432 91 ASGASIAVFLVVILCFSLYVH 111 (229)
Q Consensus 91 a~gI~V~lLLVlii~vvly~~ 111 (229)
++|+.+.++-+++-..|+...
T Consensus 16 aigltv~f~~~L~PagWVLsh 36 (44)
T PF02285_consen 16 AIGLTVCFVTFLGPAGWVLSH 36 (44)
T ss_dssp HHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHhhHHHHHHH
Confidence 445555544444444444333
No 272
>PF02937 COX6C: Cytochrome c oxidase subunit VIc; InterPro: IPR004204 Cytochrome c oxidase, a 13 subunit complex, 1.9.3.1 from EC is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3AG4_I 2DYS_V 3ASO_I 2EIK_V 2EIM_I 1OCC_V 1V54_V 1OCO_V 3ASN_V 2EIL_I ....
Probab=20.12 E-value=2.7e+02 Score=20.40 Aligned_cols=11 Identities=18% Similarity=0.223 Sum_probs=4.1
Q ss_pred HHHHHHHHHHH
Q psy12432 95 SIAVFLVVILC 105 (229)
Q Consensus 95 ~V~lLLVlii~ 105 (229)
+++++|.+++.
T Consensus 22 ~~a~~ls~~~~ 32 (73)
T PF02937_consen 22 VVAFVLSLGVA 32 (73)
T ss_dssp HHHHHHHHHHH
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 273
>PF04341 DUF485: Protein of unknown function, DUF485; InterPro: IPR007436 This family includes several putative integral membrane proteins.
Probab=20.08 E-value=1.6e+02 Score=22.05 Aligned_cols=20 Identities=20% Similarity=0.271 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy12432 96 IAVFLVVILCFSLYVHCQRR 115 (229)
Q Consensus 96 V~lLLVlii~vvly~~~kRr 115 (229)
++++++..++.++|.++.++
T Consensus 60 ~~~~~~~~~l~~~Yv~~An~ 79 (91)
T PF04341_consen 60 LGQIVFAWVLTWLYVRRANR 79 (91)
T ss_pred HHHHHHHHHHHHHHHHHHcc
Confidence 34445555556677766655
No 274
>PF13955 Fst_toxin: Toxin Fst, type I toxin-antitoxin system; PDB: 2KV5_A.
Probab=20.07 E-value=1.7e+02 Score=16.65 Aligned_cols=11 Identities=27% Similarity=0.365 Sum_probs=4.8
Q ss_pred HhhhHHHHHHH
Q psy12432 90 IASGASIAVFL 100 (229)
Q Consensus 90 Ia~gI~V~lLL 100 (229)
|+..+.|++++
T Consensus 4 iIaPi~VGvvl 14 (21)
T PF13955_consen 4 IIAPIVVGVVL 14 (21)
T ss_dssp HHHHHHHHHHH
T ss_pred ehhhHHHHHHH
Confidence 34444444443
No 275
>PRK10905 cell division protein DamX; Validated
Probab=20.05 E-value=67 Score=30.16 Aligned_cols=16 Identities=6% Similarity=-0.060 Sum_probs=11.9
Q ss_pred hhHHHHHHHHHHHHHH
Q psy12432 92 SGASIAVFLVVILCFS 107 (229)
Q Consensus 92 ~gI~V~lLLVlii~vv 107 (229)
+.++++||||.|...+
T Consensus 5 GilVLlLLIigIgSAL 20 (328)
T PRK10905 5 GILVLLLLIIGIGSAL 20 (328)
T ss_pred hHHHHHHHHHHHhHhh
Confidence 3467788888888864
Done!