Query psy592
Match_columns 339
No_of_seqs 279 out of 1723
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 21:49:32 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy592.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/592hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01049 Cadherin_C: Cadherin 100.0 7E-35 1.5E-39 246.1 2.8 111 228-338 13-126 (149)
2 KOG3514|consensus 99.8 2.2E-20 4.8E-25 190.0 12.5 166 1-174 587-762 (1591)
3 KOG4289|consensus 99.8 2.3E-20 5.1E-25 193.5 10.6 152 1-162 1469-1621(2531)
4 KOG3516|consensus 99.7 1.2E-16 2.7E-21 165.5 14.3 151 1-162 925-1097(1306)
5 KOG4289|consensus 99.6 4E-16 8.7E-21 162.5 10.0 126 27-162 1233-1401(2531)
6 KOG1219|consensus 99.6 2E-14 4.3E-19 154.7 17.8 137 13-162 3621-3766(4289)
7 KOG3514|consensus 99.6 2.3E-14 5E-19 146.7 11.8 167 2-175 985-1164(1591)
8 KOG3516|consensus 99.2 3.9E-10 8.5E-15 117.9 14.3 75 98-175 810-890 (1306)
9 cd00110 LamG Laminin G domain; 99.1 5.7E-10 1.2E-14 93.6 11.0 81 78-162 2-83 (151)
10 PF00054 Laminin_G_1: Laminin 98.9 3.9E-09 8.5E-14 87.4 8.7 68 105-175 1-73 (131)
11 smart00282 LamG Laminin G doma 98.8 3.4E-08 7.4E-13 81.7 9.7 62 98-162 2-65 (135)
12 KOG1219|consensus 98.7 1.9E-08 4E-13 110.1 5.6 77 1-81 3825-3909(4289)
13 PF02210 Laminin_G_2: Laminin 98.5 6E-07 1.3E-11 72.7 8.9 55 105-162 1-57 (128)
14 PF00008 EGF: EGF-like domain 98.4 1.1E-07 2.4E-12 59.0 0.9 31 36-70 1-32 (32)
15 smart00179 EGF_CA Calcium-bind 98.0 9.7E-06 2.1E-10 51.9 3.9 35 34-72 3-39 (39)
16 cd00054 EGF_CA Calcium-binding 97.8 3.2E-05 6.9E-10 48.9 4.0 35 34-72 3-38 (38)
17 cd00053 EGF Epidermal growth f 97.6 8.3E-05 1.8E-09 46.2 3.8 29 44-72 7-36 (36)
18 smart00181 EGF Epidermal growt 97.5 0.00014 3E-09 45.5 3.7 28 44-72 7-35 (35)
19 PF07974 EGF_2: EGF-like domai 97.2 0.00035 7.6E-09 43.1 3.0 26 44-71 7-32 (32)
20 PF07645 EGF_CA: Calcium-bindi 97.2 0.00017 3.7E-09 47.4 1.5 33 33-67 2-34 (42)
21 PF12661 hEGF: Human growth fa 97.0 0.00021 4.6E-09 34.9 0.3 13 59-71 1-13 (13)
22 PF12947 EGF_3: EGF domain; I 96.0 0.0027 5.9E-08 40.3 1.0 27 43-69 6-32 (36)
23 smart00210 TSPN Thrombospondin 95.9 0.071 1.5E-06 46.6 9.8 73 97-175 52-128 (184)
24 PHA02887 EGF-like protein; Pro 95.5 0.011 2.3E-07 47.2 2.6 30 44-74 93-124 (126)
25 PHA03099 epidermal growth fact 95.2 0.021 4.6E-07 46.3 3.4 30 44-74 52-83 (139)
26 KOG3509|consensus 93.8 0.051 1.1E-06 58.2 3.4 70 1-75 375-444 (964)
27 PF12955 DUF3844: Domain of un 93.3 0.044 9.6E-07 43.1 1.6 45 29-75 1-63 (103)
28 PF12662 cEGF: Complement Clr- 91.4 0.14 3E-06 29.3 1.6 16 57-72 1-20 (24)
29 PF12273 RCR: Chitin synthesis 87.5 0.61 1.3E-05 38.4 3.3 11 248-258 72-82 (130)
30 KOG1214|consensus 87.3 0.49 1.1E-05 49.7 3.1 35 34-72 828-864 (1289)
31 PF00354 Pentaxin: Pentaxin fa 87.0 5 0.00011 35.4 9.1 74 83-162 11-87 (195)
32 PF14670 FXa_inhibition: Coagu 85.3 0.46 9.9E-06 30.0 1.1 19 49-67 10-28 (36)
33 KOG4260|consensus 84.9 0.62 1.3E-05 42.8 2.2 33 43-75 150-185 (350)
34 smart00159 PTX Pentraxin / C-r 84.6 12 0.00026 33.1 10.4 78 78-162 9-93 (206)
35 PF00053 Laminin_EGF: Laminin 82.5 0.82 1.8E-05 30.7 1.5 24 50-75 12-35 (49)
36 PF04863 EGF_alliinase: Alliin 82.4 0.4 8.7E-06 33.0 -0.0 32 44-75 18-53 (56)
37 cd01475 vWA_Matrilin VWA_Matri 82.1 1.3 2.9E-05 39.6 3.3 37 27-68 182-218 (224)
38 KOG1217|consensus 81.4 1.4 3E-05 43.2 3.4 36 33-71 271-306 (487)
39 PF05568 ASFV_J13L: African sw 81.3 4.2 9E-05 33.9 5.5 51 196-249 32-82 (189)
40 PF12946 EGF_MSP1_1: MSP1 EGF 80.1 0.77 1.7E-05 29.1 0.7 24 44-67 6-30 (37)
41 PF00954 S_locus_glycop: S-loc 79.7 2 4.3E-05 34.1 3.1 32 33-68 77-108 (110)
42 PF06679 DUF1180: Protein of u 78.8 3.2 7E-05 35.6 4.2 18 243-260 145-162 (163)
43 KOG1225|consensus 78.4 1.6 3.5E-05 44.2 2.7 27 44-74 317-343 (525)
44 PF15330 SIT: SHP2-interacting 77.2 5.9 0.00013 31.5 5.1 15 208-222 13-27 (107)
45 PF01102 Glycophorin_A: Glycop 77.0 3.8 8.3E-05 33.4 4.0 9 213-221 85-93 (122)
46 cd00152 PTX Pentraxins are pla 76.5 35 0.00076 29.9 10.5 80 77-162 8-93 (201)
47 TIGR01478 STEVOR variant surfa 75.9 3.3 7.1E-05 38.5 3.7 33 191-224 258-290 (295)
48 PF13385 Laminin_G_3: Concanav 75.9 26 0.00057 28.0 9.0 78 80-162 6-88 (157)
49 KOG1217|consensus 75.5 2.8 6E-05 41.1 3.5 39 34-74 170-208 (487)
50 cd00055 EGF_Lam Laminin-type e 75.5 3 6.4E-05 28.1 2.6 19 56-74 17-35 (50)
51 PTZ00370 STEVOR; Provisional 73.9 4 8.6E-05 38.0 3.7 31 192-223 255-285 (296)
52 PF05399 EVI2A: Ectropic viral 72.7 6.6 0.00014 34.9 4.6 14 196-209 128-141 (227)
53 smart00051 DSL delta serrate l 72.3 2.9 6.3E-05 29.9 2.0 24 45-71 40-63 (63)
54 PF02009 Rifin_STEVOR: Rifin/s 72.1 4.1 8.8E-05 38.5 3.5 14 208-221 270-283 (299)
55 PF15102 TMEM154: TMEM154 prot 72.1 1.2 2.6E-05 37.2 -0.0 14 288-301 129-142 (146)
56 KOG1226|consensus 70.4 4.1 9E-05 42.7 3.4 27 44-74 556-582 (783)
57 PF01034 Syndecan: Syndecan do 70.0 1.5 3.3E-05 31.3 0.1 15 209-223 26-40 (64)
58 KOG1225|consensus 66.3 7.4 0.00016 39.6 4.1 27 45-75 256-282 (525)
59 smart00180 EGF_Lam Laminin-typ 65.0 4.8 0.0001 26.7 1.8 18 57-74 17-34 (46)
60 PF08693 SKG6: Transmembrane a 62.6 1.6 3.4E-05 28.3 -0.9 9 211-219 30-38 (40)
61 PF14575 EphA2_TM: Ephrin type 59.7 4.5 9.7E-05 30.0 0.9 13 208-220 14-26 (75)
62 KOG4260|consensus 57.1 6.9 0.00015 36.2 1.8 39 27-68 231-269 (350)
63 KOG3509|consensus 56.9 13 0.00029 40.4 4.2 117 43-162 178-317 (964)
64 PF05923 APC_crr: APC cysteine 56.0 4.1 9E-05 23.7 0.2 21 310-330 2-25 (26)
65 PF15050 SCIMP: SCIMP protein 55.0 20 0.00044 28.9 4.0 11 230-240 53-63 (133)
66 KOG1214|consensus 53.9 7.7 0.00017 41.2 1.8 36 36-73 417-454 (1289)
67 PTZ00046 rifin; Provisional 51.6 14 0.0003 35.7 3.0 15 208-222 329-343 (358)
68 KOG1836|consensus 51.1 12 0.00027 43.3 3.0 32 44-75 781-815 (1705)
69 PF02973 Sialidase: Sialidase, 50.0 1.2E+02 0.0026 26.7 8.5 46 98-143 34-82 (190)
70 TIGR01477 RIFIN variant surfac 49.6 15 0.00033 35.3 3.0 15 208-222 324-338 (353)
71 PF15176 LRR19-TM: Leucine-ric 49.4 63 0.0014 25.3 5.7 10 196-205 17-26 (102)
72 PF08374 Protocadherin: Protoc 48.9 11 0.00024 33.7 1.7 10 196-205 41-50 (221)
73 PF00558 Vpu: Vpu protein; In 44.5 33 0.00072 25.8 3.5 9 210-218 20-28 (81)
74 PF14991 MLANA: Protein melan- 42.9 7 0.00015 31.2 -0.4 12 240-251 72-83 (118)
75 PF12191 stn_TNFRSF12A: Tumour 42.0 8.5 0.00018 31.4 0.0 14 210-223 94-107 (129)
76 PF01299 Lamp: Lysosome-associ 41.9 13 0.00029 35.1 1.3 14 208-221 285-299 (306)
77 PF15330 SIT: SHP2-interacting 41.7 56 0.0012 26.0 4.6 21 203-223 11-31 (107)
78 PF14851 FAM176: FAM176 family 41.3 36 0.00078 28.9 3.7 11 301-311 138-148 (153)
79 PF02439 Adeno_E3_CR2: Adenovi 40.8 45 0.00098 21.3 3.1 6 197-202 7-12 (38)
80 PF05393 Hum_adeno_E3A: Human 40.3 73 0.0016 24.3 4.7 14 209-222 46-60 (94)
81 PF05454 DAG1: Dystroglycan (D 40.0 9.5 0.00021 35.8 0.0 16 128-143 43-58 (290)
82 PF15347 PAG: Phosphoprotein a 39.5 35 0.00076 33.1 3.6 15 288-302 176-190 (428)
83 PHA03049 IMV membrane protein; 39.3 46 0.001 23.9 3.3 23 197-223 7-29 (68)
84 PF10731 Anophelin: Thrombin i 38.6 57 0.0012 23.0 3.6 9 240-248 32-40 (65)
85 KOG3607|consensus 37.0 25 0.00055 37.3 2.5 28 44-74 631-658 (716)
86 PF04272 Phospholamban: Phosph 36.9 31 0.00068 22.8 2.0 9 197-205 37-45 (52)
87 PF09064 Tme5_EGF_like: Thromb 36.3 31 0.00068 21.4 1.8 17 50-67 11-27 (34)
88 PRK00523 hypothetical protein; 35.4 76 0.0016 23.3 4.0 23 195-218 5-27 (72)
89 PF06024 DUF912: Nucleopolyhed 35.4 6.9 0.00015 30.7 -1.5 8 214-221 84-91 (101)
90 TIGR00847 ccoS cytochrome oxid 35.1 1.1E+02 0.0024 20.8 4.6 11 211-221 18-28 (51)
91 PHA02872 EFc gene family prote 34.8 23 0.00051 28.1 1.4 30 308-337 58-87 (124)
92 PF02699 YajC: Preprotein tran 34.7 33 0.00071 25.7 2.2 10 211-220 14-23 (82)
93 PHA03164 hypothetical protein; 34.6 38 0.00082 25.1 2.4 23 196-218 60-82 (88)
94 PF06365 CD34_antigen: CD34/Po 34.2 39 0.00085 30.0 2.9 20 239-260 141-160 (202)
95 PF02468 PsbN: Photosystem II 33.8 42 0.0009 22.1 2.2 15 208-222 15-29 (43)
96 PF03597 CcoS: Cytochrome oxid 33.6 1.3E+02 0.0028 19.9 4.7 12 210-221 16-27 (45)
97 PF14316 DUF4381: Domain of un 33.5 56 0.0012 27.2 3.7 9 289-297 98-106 (146)
98 TIGR01294 P_lamban phospholamb 33.0 39 0.00085 22.3 2.0 9 197-205 37-45 (52)
99 PF01414 DSL: Delta serrate li 32.7 14 0.00031 26.3 -0.0 12 60-71 52-63 (63)
100 TIGR00739 yajC preprotein tran 32.3 25 0.00054 26.6 1.2 8 211-218 15-22 (84)
101 PF10717 ODV-E18: Occlusion-de 31.4 52 0.0011 24.8 2.7 10 196-205 28-37 (85)
102 PF10577 UPF0560: Uncharacteri 31.4 49 0.0011 35.3 3.5 10 196-205 276-285 (807)
103 TIGR02595 PEP_exosort PEP-CTER 30.4 56 0.0012 18.8 2.2 9 213-221 15-23 (26)
104 PF11884 DUF3404: Domain of un 29.0 83 0.0018 29.1 4.2 16 208-223 243-258 (262)
105 PRK05585 yajC preprotein trans 28.8 38 0.00082 26.9 1.7 7 212-218 31-37 (106)
106 PF12606 RELT: Tumour necrosis 28.1 1.2E+02 0.0026 20.7 3.8 7 199-205 6-12 (50)
107 PHA03286 envelope glycoprotein 27.9 91 0.002 31.1 4.5 9 214-222 412-420 (492)
108 PRK03427 cell division protein 27.6 64 0.0014 30.9 3.3 8 242-249 59-66 (333)
109 PF05961 Chordopox_A13L: Chord 26.1 87 0.0019 22.6 3.0 14 210-223 16-29 (68)
110 PF06667 PspB: Phage shock pro 26.0 1.1E+02 0.0024 22.7 3.7 8 214-221 23-30 (75)
111 KOG3054|consensus 26.0 75 0.0016 29.1 3.3 15 208-222 14-28 (299)
112 KOG0994|consensus 25.9 73 0.0016 35.6 3.7 26 50-75 924-951 (1758)
113 COG4367 Uncharacterized protei 25.7 57 0.0012 24.9 2.1 24 287-312 54-77 (97)
114 COG3763 Uncharacterized protei 25.4 1.7E+02 0.0037 21.4 4.4 21 196-217 5-25 (71)
115 PF04689 S1FA: DNA binding pro 25.1 63 0.0014 23.1 2.1 31 192-222 10-40 (69)
116 PF10078 DUF2316: Uncharacteri 25.0 70 0.0015 24.6 2.5 26 285-312 53-78 (89)
117 PF00558 Vpu: Vpu protein; In 24.9 42 0.00091 25.3 1.3 18 206-223 19-36 (81)
118 PRK13183 psbN photosystem II r 24.8 61 0.0013 21.6 1.8 15 208-222 18-32 (46)
119 KOG1094|consensus 24.7 99 0.0021 32.3 4.2 8 211-218 407-414 (807)
120 COG4736 CcoQ Cbb3-type cytochr 24.1 89 0.0019 22.1 2.7 15 208-222 21-35 (60)
121 PRK14584 hmsS hemin storage sy 23.4 2E+02 0.0043 24.5 5.1 25 196-222 63-88 (153)
122 PF02009 Rifin_STEVOR: Rifin/s 23.4 96 0.0021 29.4 3.7 25 196-222 263-287 (299)
123 PHA02681 ORF089 virion membran 23.1 1.8E+02 0.0038 22.0 4.2 14 208-221 15-28 (92)
124 PF07213 DAP10: DAP10 membrane 22.9 1.3E+02 0.0029 22.5 3.6 13 209-221 49-61 (79)
125 PF03908 Sec20: Sec20; InterP 22.8 95 0.0021 23.6 3.0 12 208-219 80-91 (92)
126 PF06716 DUF1201: Protein of u 22.8 1.6E+02 0.0036 19.6 3.6 23 194-218 11-33 (54)
127 TIGR03521 GldG gliding-associa 22.7 92 0.002 32.1 3.7 14 208-221 535-548 (552)
128 PRK01844 hypothetical protein; 22.7 1.8E+02 0.0039 21.4 4.1 23 196-219 5-27 (72)
129 PRK05886 yajC preprotein trans 22.7 94 0.002 24.8 3.0 10 211-220 16-25 (109)
130 PHA02902 putative IMV membrane 22.0 1.4E+02 0.0031 21.3 3.4 14 208-221 15-28 (70)
131 PF12768 Rax2: Cortical protei 21.8 1.1E+02 0.0023 28.7 3.7 24 196-221 234-257 (281)
132 PF15431 TMEM190: Transmembran 21.7 2.2E+02 0.0047 22.8 4.7 45 177-222 36-88 (134)
133 KOG3653|consensus 21.6 1.4E+02 0.0031 30.1 4.6 16 209-224 170-185 (534)
134 PF06247 Plasmod_Pvs28: Plasmo 21.5 28 0.00062 30.5 -0.2 57 10-67 10-79 (197)
135 PF05545 FixQ: Cbb3-type cytoc 21.1 1.1E+02 0.0024 20.3 2.7 7 214-220 27-33 (49)
136 KOG1834|consensus 21.0 4.3E+02 0.0094 28.0 7.8 66 97-162 366-445 (952)
137 PF11770 GAPT: GRB2-binding ad 20.9 24 0.00052 29.7 -0.8 10 196-205 10-19 (158)
138 KOG0994|consensus 20.9 66 0.0014 35.9 2.2 16 59-74 1085-1100(1758)
139 PF04478 Mid2: Mid2 like cell 20.5 15 0.00031 31.1 -2.2 8 214-221 71-78 (154)
140 PF11669 WBP-1: WW domain-bind 20.4 2.8E+02 0.0062 21.6 5.3 13 235-247 57-69 (102)
141 PRK00269 zipA cell division pr 20.1 2.1E+02 0.0046 26.9 5.1 12 212-223 20-31 (293)
No 1
>PF01049 Cadherin_C: Cadherin cytoplasmic region; InterPro: IPR000233 Cadherins are transmembrane glycoproteins vital in calcium-dependent cell-cell adhesion during tissue differentiation []. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn []. The cytoplasmic region is highly conserved in sequence and has been shown experimentally to regulate the cell-cell binding function of the extracellular domain of E-cadherin, possibly through interaction with the cytoskeleton []. This domain is found upstream of the cadherin domain IPR002126 from INTERPRO.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2KOH_B 3IFQ_D 1I7X_D 1I7W_D 3L6Y_F 3L6X_B.
Probab=100.00 E-value=7e-35 Score=246.11 Aligned_cols=111 Identities=50% Similarity=0.801 Sum_probs=34.8
Q ss_pred CCCCchhhcccccccCCCCCccCCCcccccccCCCCCCCC-CCccCCC-C-CCCCCCCCCCCCcHHHHHHHHHHHhcCCC
Q psy592 228 PGPDDDVRENIINYDDEGGGEDDMTAFDITPLQIPIGGPH-PEMNNKI-P-YGLGPMPMGVEPNVGIFIEEHKKRADADP 304 (339)
Q Consensus 228 ~~~~~d~reni~~Y~eEggGE~d~~~yd~~~L~~~~~~~~-~~~~~~~-~-~~~~~~~~~~~~~~~~~i~~~~~~ad~d~ 304 (339)
..++||||||||+|||||||||||+||||++|++|...+. ..+.|.. + +...+.+.....+|++||++||++||.||
T Consensus 13 ~~~~ediReNii~YddEGGGEeD~~aydi~~L~~~~~~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~i~~kl~~ad~d~ 92 (149)
T PF01049_consen 13 PDEEEDIRENIINYDDEGGGEEDTEAYDISALQNPSQPRSRRDILPLIRALPRYPPPPPPDSPDIGDFISQKLQEADEDP 92 (149)
T ss_dssp -----------------TT---------GGG--------EEEEE--TTSS---------SSGGGHHHHHHHHHHHHHT-T
T ss_pred CCcHHHHhcCceecccCcCCcCCCCccchhhhhCccccccccccccccccccccccccCCCchHHHHHHHHHHHHcCCcc
Confidence 5688999999999999999999999999999999877510 0011100 0 00011122456799999999999999999
Q ss_pred CCCCCCcceeeeecCCCCCcccccccccCCCCCC
Q psy592 305 NAPPFDDLRNYAYEGGGSTAGSLSSLASGSSIKN 338 (339)
Q Consensus 305 ~~~p~Dsl~~y~~EG~gS~agSlssl~s~~~~~~ 338 (339)
++||||+|++|+|||.||+|||||||+|.+++++
T Consensus 93 ~~~p~Dsl~~Y~~EG~gS~agSLSSL~S~s~~~~ 126 (149)
T PF01049_consen 93 SAPPYDSLQVYAYEGQGSSAGSLSSLESCSSDEE 126 (149)
T ss_dssp TSSS-EEEEE----------TT------------
T ss_pred ccccccceeeeccCCCCCcCCCcccccCCCcccc
Confidence 9999999999999999999999999999886543
No 2
>KOG3514|consensus
Probab=99.83 E-value=2.2e-20 Score=189.96 Aligned_cols=166 Identities=17% Similarity=0.391 Sum_probs=143.8
Q ss_pred CCccceeeECCEEeecCCCccCCCC---CCCCCCCCc-ccCCCCCCCCCCCCCeeecCCCCceeeCC-CCCCCCCCCCCC
Q psy592 1 MENILNIVTITYLYDLANPGLHKNS---VPGCPATEA-VCNAHSRLHCYPEHGVCVGSMHTAKCQCL-PGWSGTGCVTPT 75 (339)
Q Consensus 1 ~GCIrnl~vn~~~~dl~~~~~~~~s---~~gC~~~~~-~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~-~g~~G~~Ce~~~ 75 (339)
+||||+|++||+..|+...++..++ .++|..... .|..+ ||+|+|+|...|+.|.|+|. .+|.|+.||.+.
T Consensus 587 vGCirdl~i~G~s~di~q~ae~q~sagvkpsCs~~~~~~C~~n----PC~N~g~C~egwNrfiCDCs~T~~~G~~CerE~ 662 (1591)
T KOG3514|consen 587 VGCIRDLFIDGVSTDIRQEAEAQNSAGVKPSCSLSNEKICESN----PCQNGGKCSEGWNRFICDCSGTGFEGRTCEREA 662 (1591)
T ss_pred hheehhheecceehhhHHHhhhccccccCcccchhhccccCCC----cccCCCCccccccccccccccCcccCcccccee
Confidence 6999999999999999988877664 689987655 89999 99999999999999999999 589999999998
Q ss_pred CCccccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe-c
Q psy592 76 SPVTFKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL-N 154 (339)
Q Consensus 76 ~~~~F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~-~ 154 (339)
..++|+|.-|+...++.....+..+++++|||..+.|+||..+.....|-+.++|..|.+.+.+++.+ +..+.. .
T Consensus 663 t~ls~nGs~~m~i~L~~~~~tq~E~v~iRF~t~r~~Gll~~Tta~~s~D~l~l~L~~g~vkl~v~ls~----~~nlfag~ 738 (1591)
T KOG3514|consen 663 TALSYNGSMSMKIVLPHTMHTQAEDVSIRFRTQRAYGLLFATTARGSADTLRLELDAGQVKLFVNLSG----PENLFAGQ 738 (1591)
T ss_pred eeEEEcCeeeEEEEecccceeecceEEEEEEecccceeEEEeccCCCCceEEEEEecceEEEEEecCC----Ccceeccc
Confidence 89999999999998886666667899999999999999999888777899999999999999999874 445554 3
Q ss_pred ccccCccc----ccccCCcceeec
Q psy592 155 AIKDSKLR----FRYNLNNLKTDE 174 (339)
Q Consensus 155 ~v~dG~WH----~r~~~~~~l~~~ 174 (339)
.++|.+|| .|.+++..|.++
T Consensus 739 ~LnDN~WHtvrv~Rrg~~L~L~vD 762 (1591)
T KOG3514|consen 739 SLNDNEWHTVRVVRRGKSLLLYVD 762 (1591)
T ss_pred cccCCcceEEEEEEcccceEEEec
Confidence 78999999 566665555554
No 3
>KOG4289|consensus
Probab=99.82 E-value=2.3e-20 Score=193.46 Aligned_cols=152 Identities=20% Similarity=0.344 Sum_probs=131.8
Q ss_pred CCccceeeECCEEeecCCCccCCCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCCC-CCcc
Q psy592 1 MENILNIVTITYLYDLANPGLHKNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT-SPVT 79 (339)
Q Consensus 1 ~GCIrnl~vn~~~~dl~~~~~~~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~-~~~~ 79 (339)
+||||||.|+|+.+|++......+..+||+...+.|... +|.|+|+|++.|++|.|.||.+|.|..|...+ .+-.
T Consensus 1469 vGCmrdLsvD~~~VDma~fianngt~eGC~ark~fCdsg----~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~~pq~ 1544 (2531)
T KOG4289|consen 1469 VGCMRDLSVDGRDVDMATFIANNGTHEGCKARKNFCDSG----QCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMAHPQH 1544 (2531)
T ss_pred hhhhhhcccccccccHHHHHhhcCcccCchhhhcccCCC----ccCCCCeeecccCcEeecCccccCCcchhhccCCchh
Confidence 599999999999999999999989999999999999998 99999999999999999999999999999887 5567
Q ss_pred ccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEecccccC
Q psy592 80 FKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWLNAIKDS 159 (339)
Q Consensus 80 F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~~~v~dG 159 (339)
|.|+|.+.+.-.+......+.++|+|||+..+|+||.+...+ ..-+.|+|.+|++.+.+ +. ....+..+.++||
T Consensus 1545 frG~sl~sw~~~~~~vSvPwylsl~FRTr~ad~vl~~~~~~~-rst~~lqld~g~l~~~v--~~---s~v~L~~~~vtdg 1618 (2531)
T KOG4289|consen 1545 FRGHSLVSWEGLPSQVSVPWYLSLMFRTRRADGVLMQAEFGG-RSTYNLQLDDGTLKYNV--GD---SSVELPAPRVTDG 1618 (2531)
T ss_pred ccccceeeecCCCcceecceEEEEEEEeeccccEEEEEEeCC-CceEEEEEcCCEEEEEe--cC---ceEEccCccccCC
Confidence 999999998855444555789999999999999999887653 45588999999998875 33 1344444688999
Q ss_pred ccc
Q psy592 160 KLR 162 (339)
Q Consensus 160 ~WH 162 (339)
+||
T Consensus 1619 ~Wh 1621 (2531)
T KOG4289|consen 1619 HWH 1621 (2531)
T ss_pred chh
Confidence 999
No 4
>KOG3516|consensus
Probab=99.70 E-value=1.2e-16 Score=165.46 Aligned_cols=151 Identities=19% Similarity=0.324 Sum_probs=122.9
Q ss_pred CCccceeeECCEEeecCCCcc-CCCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCC-CCCCCCCCCCCCCCc
Q psy592 1 MENILNIVTITYLYDLANPGL-HKNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCL-PGWSGTGCVTPTSPV 78 (339)
Q Consensus 1 ~GCIrnl~vn~~~~dl~~~~~-~~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~-~g~~G~~Ce~~~~~~ 78 (339)
.||||+|.+||+++||..-+. ..++.+||. ..|++. +|.|||+|+..|.+|+|+|. ..|+|+.|..++ .+
T Consensus 925 ~GCIRsl~LNGv~ldLe~ra~~~~gv~~GC~---GhCss~----~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~ei-g~ 996 (1306)
T KOG3516|consen 925 LGCIRSLQLNGVMLDLEYRAYGTAGVSPGCE---GHCSSY----PCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEI-GV 996 (1306)
T ss_pred eeeeeeeeecceeeeehhhhccCCcccCCCc---cccccc----cccCCCEEEEecCceeeccccCcCCCCcccccc-ce
Confidence 499999999999999944443 346789997 599998 99999999999999999999 589999999996 88
Q ss_pred cccCcceEEEeccCCC-----------------CccccEEEEEEeecCCceEEEEEecCCCCceEEEEE-ECCEEEEEEe
Q psy592 79 TFKPQSYIKYALSFEP-----------------DKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEI-KDSKLRFRYN 140 (339)
Q Consensus 79 ~F~~~s~v~y~~~~~~-----------------~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L-~~G~l~~~~~ 140 (339)
.|+.+++++|.+...- ......|.|.|+|....++|+|+++. ..+|+.+.+ .+|.+++++.
T Consensus 997 ~fe~gs~i~y~fq~~~~~a~~~~~~~~~~~~~~~~~~e~i~~sftTt~~ps~LLfvssF-~~~y~~V~v~~nGsLq~ry~ 1075 (1306)
T KOG3516|consen 997 FFERGSSIRYNFQKPMRSAVFESSRVKQKLEIEINPNEEINFSFTTTRAPSDLLFVSSF-TDDYLAVLVKDNGSLQTRYM 1075 (1306)
T ss_pred EecCCceEEEeccchHHHhhhhhhhhhhccccccCccceEEEEEEeccCceEEEEeecc-ccceEEEEEeCCCceEEEEe
Confidence 9999999999864321 01246799999999999999999886 366766555 6899999999
Q ss_pred cCCCcccceeEEe--cccccCccc
Q psy592 141 LNNLKTDEKDIWL--NAIKDSKLR 162 (339)
Q Consensus 141 lG~~~~~~~~l~~--~~v~dG~WH 162 (339)
+|.. .+..... .++.+|+.|
T Consensus 1076 lg~~--e~~~~~~~~kn~~~gq~H 1097 (1306)
T KOG3516|consen 1076 LGFR--EPFEYQFKDKNIALGQPH 1097 (1306)
T ss_pred cCCc--CceEEecccccccCCCce
Confidence 9982 3344443 377789999
No 5
>KOG4289|consensus
Probab=99.65 E-value=4e-16 Score=162.54 Aligned_cols=126 Identities=19% Similarity=0.456 Sum_probs=104.6
Q ss_pred CCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCCC-------------------------------
Q psy592 27 PGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT------------------------------- 75 (339)
Q Consensus 27 ~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~------------------------------- 75 (339)
.-|....+.|.+. ||+|+|+|....++|+|.|.+||+|.+||...
T Consensus 1233 d~CeTeiDlCYs~----pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1233 DYCETEIDLCYSG----PCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred ccccchhHhhhcC----CCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCCCc
Confidence 4577778999988 99999999999999999999999999999642
Q ss_pred ---------CCccccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcc
Q psy592 76 ---------SPVTFKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKT 146 (339)
Q Consensus 76 ---------~~~~F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~ 146 (339)
.+.+|.+.||+.|.... .+.++.++|+|.|.+.+|+|+|.+++ ..||+.|++.++.+++.|..|..
T Consensus 1309 e~e~prC~v~trSFp~~sfv~frglr--qRfh~TlslsfaT~~~nGlL~ynGne-khDFvalevVd~qvqltfS~Ges-- 1383 (2531)
T KOG4289|consen 1309 EFEDPRCEVTTRSFPPESFVTFRGLR--QRFHFTLSLSFATIERNGLLLYNGNE-KHDFVALEVVDEQVQLTFSAGES-- 1383 (2531)
T ss_pred ccCCCceEEEeeccCchheEEEeccc--cceEEEEEEEEEEeeecceEEecCCc-ccceEeeeeeeeeEEEEEecccc--
Confidence 24568889999887532 34567788889999999999999965 68999999999999999999863
Q ss_pred cceeEEe---cccccCccc
Q psy592 147 DEKDIWL---NAIKDSKLR 162 (339)
Q Consensus 147 ~~~~l~~---~~v~dG~WH 162 (339)
..++.- ..++||+||
T Consensus 1384 -~t~v~p~Vp~gvsDGqWH 1401 (2531)
T KOG4289|consen 1384 -TTTVSPDVPGGVSDGQWH 1401 (2531)
T ss_pred -cceecCCCCCCcccCcee
Confidence 233332 268999999
No 6
>KOG1219|consensus
Probab=99.61 E-value=2e-14 Score=154.72 Aligned_cols=137 Identities=20% Similarity=0.347 Sum_probs=112.6
Q ss_pred EeecCCCccCCCCC-----CCCCCCCcccCCCCCCCCCCCCCeeecC-CC-CceeeCCCCCCCCCCCCCCCCccccCcce
Q psy592 13 LYDLANPGLHKNSV-----PGCPATEAVCNAHSRLHCYPEHGVCVGS-MH-TAKCQCLPGWSGTGCVTPTSPVTFKPQSY 85 (339)
Q Consensus 13 ~~dl~~~~~~~~s~-----~gC~~~~~~C~~~~~~~pC~~~g~Cv~~-~~-~~~C~C~~g~~G~~Ce~~~~~~~F~~~s~ 85 (339)
.+.+.+|.++++.. .-||...+.|... ||..+..|+.. |+ .|+|.||+|.-| .|... .++++.|+||
T Consensus 3621 rlSfvtPrh~r~a~ClC~~G~Cp~~~~~C~~~----pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~-~elS~tGnSY 3694 (4289)
T KOG1219|consen 3621 RLSFVTPRHHRTAACLCNRGFCPVETNQCAKS----PCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGD-FELSSTGNSY 3694 (4289)
T ss_pred heeeecchhhccceeeecCCcCCcccCccccC----CCcccCcccccccccceeEeccCcccc-cCCCc-ceEeecCcee
Confidence 34566676666643 4588888999998 99999999987 44 699999999665 58877 4889999999
Q ss_pred EEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe--cccccCccc
Q psy592 86 IKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL--NAIKDSKLR 162 (339)
Q Consensus 86 v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~--~~v~dG~WH 162 (339)
++|+++..... +..+.|++||.+++|++||... .++.+|.|.+|++++.++.|+++ ..+.. ..++||+||
T Consensus 3695 veyrlse~~n~-~~kl~frLkT~~sngIiM~tr~---~d~~iLkLv~G~~~l~~~cgsG~---Givg~q~~~VnDgqWH 3766 (4289)
T KOG1219|consen 3695 VEYRLSENQNT-RMKLGFRLKTLQSNGIIMYTRK---TDLAILKLVGGSPQLLADCGSGP---GIVGSQKRTVNDGQWH 3766 (4289)
T ss_pred EEEEccccccc-ceEEEEEEEecccCcEEEEEcC---CceEEEEecCCcEEEEEecCCCC---CcccccceEeecCcee
Confidence 99999876544 3889999999999999999973 68999999999999999999874 34433 478999999
No 7
>KOG3514|consensus
Probab=99.55 E-value=2.3e-14 Score=146.68 Aligned_cols=167 Identities=21% Similarity=0.343 Sum_probs=135.0
Q ss_pred CccceeeECCEEeecCCCccC--CCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCC-CCCCCCCCCCCCCc
Q psy592 2 ENILNIVTITYLYDLANPGLH--KNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLP-GWSGTGCVTPTSPV 78 (339)
Q Consensus 2 GCIrnl~vn~~~~dl~~~~~~--~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~-g~~G~~Ce~~~~~~ 78 (339)
||.-.+-+++.+-+|...... .....+|.-....|..+ .|.|+|.|+..|+++.|+|.. .|+|+.|..+-.++
T Consensus 985 g~~~s~dl~~r~p~L~~~a~~~s~lv~~~~sgpst~c~~~----acanhG~c~q~w~~~~c~csmtS~~Gp~C~d~gtTY 1060 (1591)
T KOG3514|consen 985 GCLASLDLGGRLPDLISDALFESGLVEVGCSGPSTTCSED----ACANHGVCIQQWNGIACDCSMTSYSGPRCNDPGTTY 1060 (1591)
T ss_pred CCcCccCccccchhHHHHhhhhccceeeeccCCCcccchh----hhhccceeeeeecceeeeccccccCCCccCCCceEE
Confidence 666667777776666555433 33457898877889888 899999999999999999996 79999999987788
Q ss_pred cccC-cceEEEeccCCC--CccccEEEEEEeecCCceEEEEEecCC-CCceEEEEEECCEEEEEEecCCCcccceeEEe-
Q psy592 79 TFKP-QSYIKYALSFEP--DKYTTQLQLKFRTREEFGELFRLSDQH-NREYAILEIKDSKLRFRYNLNNLKTDEKDIWL- 153 (339)
Q Consensus 79 ~F~~-~s~v~y~~~~~~--~~~~~~i~l~frT~~~~GlL~~~~~~~-~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~- 153 (339)
.|+. ++.|.|..+... .....++.+.|+|.+++|+|+.+.+.. ..||+.|+|..|++-+.||+|.- ...+..
T Consensus 1061 iFgk~gglI~YtwPpNdRpsTr~DrlAvGFsTtq~daVLvRVdSAsglgDYlqLhI~qG~igvvfNiGt~---Diti~E~ 1137 (1591)
T KOG3514|consen 1061 IFGKSGGLITYTWPPNDRPSTRKDRLAVGFSTTQPDAVLVRVDSASGLGDYLQLHINQGKIGVVFNIGTD---DITISEH 1137 (1591)
T ss_pred EECCCCceEEEecCCCCCCCcccceEEEEEEeccCceEEEEEeccCCCCceEEEEEeccEEEEEEeccCc---ccccccc
Confidence 8975 789999987653 334678999999999999999998765 57999999999999999999973 444443
Q ss_pred -cccccCccc----ccccCCcceeecC
Q psy592 154 -NAIKDSKLR----FRYNLNNLKTDEK 175 (339)
Q Consensus 154 -~~v~dG~WH----~r~~~~~~l~~~~ 175 (339)
..|+||++| .|-+-+.|++++.
T Consensus 1138 ~~ivNDgkYHVVRFtR~GGNATLQVD~ 1164 (1591)
T KOG3514|consen 1138 NAIVNDGKYHVVRFTRSGGNATLQVDS 1164 (1591)
T ss_pred cccccCCceEEEEEEecCCceEEEecc
Confidence 378999999 6666667777765
No 8
>KOG3516|consensus
Probab=99.16 E-value=3.9e-10 Score=117.94 Aligned_cols=75 Identities=13% Similarity=0.244 Sum_probs=61.4
Q ss_pred ccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCE-EEEEEecCCCcccceeEEec-ccccCccc----ccccCCcce
Q psy592 98 TTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSK-LRFRYNLNNLKTDEKDIWLN-AIKDSKLR----FRYNLNNLK 171 (339)
Q Consensus 98 ~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~-l~~~~~lG~~~~~~~~l~~~-~v~dG~WH----~r~~~~~~l 171 (339)
...|+|.|||..++|++|...+. .||+.++|..+. +.|.++.|+++. +.+|..+ .++|++|| +|..++..|
T Consensus 810 saDIsf~FrTt~~~gvflen~g~--~dfir~eL~~~~~vtf~~dvgnGp~-~~~V~s~t~~nD~qWH~V~~Ern~K~a~L 886 (1306)
T KOG3516|consen 810 SADISFFFRTTASSGVFLENHGI--NDFIRLELSSPVEVTFAFDVGNGPS-QLTVRSPTELNDNQWHQVRAERNSKEASL 886 (1306)
T ss_pred cccEEEEEEecCCceEeeeccCC--CceEEEEEcCCCceEEEEEcCCCce-eEEEcCCcccCCCceEEEEEEeccccceE
Confidence 35699999999999999988764 799999998665 889999998753 4555555 78999999 888888888
Q ss_pred eecC
Q psy592 172 TDEK 175 (339)
Q Consensus 172 ~~~~ 175 (339)
++|+
T Consensus 887 qVD~ 890 (1306)
T KOG3516|consen 887 QVDG 890 (1306)
T ss_pred EEcC
Confidence 8877
No 9
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.12 E-value=5.7e-10 Score=93.65 Aligned_cols=81 Identities=23% Similarity=0.439 Sum_probs=67.9
Q ss_pred ccccCcceEEEeccCCCCccccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEec-cc
Q psy592 78 VTFKPQSYIKYALSFEPDKYTTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWLN-AI 156 (339)
Q Consensus 78 ~~F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~~-~v 156 (339)
+.|.+.+|+.|...... .....|+++|||.+++|+||+.+.....+|+.++|.+|++.+.++.|. ....+... .+
T Consensus 2 ~~F~g~~~i~~~~~~~~-~~~~~i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~~~~g~---~~~~~~~~~~v 77 (151)
T cd00110 2 VSFSGSSYVRLPTLPAP-RTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGS---GSLVLSSKTPL 77 (151)
T ss_pred eEeCCCceEEecCCCCC-cceeEEEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEEEcCCc---ccEEEEccCcc
Confidence 67999999999876543 557899999999999999999988656899999999999999999984 24555544 59
Q ss_pred ccCccc
Q psy592 157 KDSKLR 162 (339)
Q Consensus 157 ~dG~WH 162 (339)
+||+||
T Consensus 78 ~dg~Wh 83 (151)
T cd00110 78 NDGQWH 83 (151)
T ss_pred CCCCEE
Confidence 999999
No 10
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=98.94 E-value=3.9e-09 Score=87.44 Aligned_cols=68 Identities=31% Similarity=0.484 Sum_probs=53.7
Q ss_pred EeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe-cccccCccc----ccccCCcceeecC
Q psy592 105 FRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL-NAIKDSKLR----FRYNLNNLKTDEK 175 (339)
Q Consensus 105 frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~-~~v~dG~WH----~r~~~~~~l~~~~ 175 (339)
|||.+++|+|||.+.....||++|+|.+|++++++++|.+ +..+.. ..++||+|| .|..+...+++++
T Consensus 1 frT~~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~---~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~ 73 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSG---PASLRSPQKINDGKWHTVSVSRNGRNGSLSVDG 73 (131)
T ss_dssp EEESSSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSE---EEEEEESSETTSSSEEEEEEEEETTEEEEEETT
T ss_pred CccCCCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCc---cceecCCCccCCCcceEEEEEEcCcEEEEEECC
Confidence 8999999999999987777999999999999999999985 445543 478999999 4444433444433
No 11
>smart00282 LamG Laminin G domain.
Probab=98.80 E-value=3.4e-08 Score=81.68 Aligned_cols=62 Identities=23% Similarity=0.432 Sum_probs=52.7
Q ss_pred ccEEEEEEeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe--cccccCccc
Q psy592 98 TTQLQLKFRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL--NAIKDSKLR 162 (339)
Q Consensus 98 ~~~i~l~frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~--~~v~dG~WH 162 (339)
..+|+|.|||.+++|+||+..+....+|+.++|.+|++.+.++.|+. +..+.. ..++||+||
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~---~~~~~~~~~~~~dg~WH 65 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSG---PARLTSDPTPLNDGQWH 65 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCC---CEEEEECCeEeCCCCEE
Confidence 46899999999999999999875457999999999999999999874 445554 478899999
No 12
>KOG1219|consensus
Probab=98.68 E-value=1.9e-08 Score=110.13 Aligned_cols=77 Identities=22% Similarity=0.382 Sum_probs=66.2
Q ss_pred CCccceeeECCEEeecCCCccC-------CCCCCCCCCCCcccCCCCCCCCCCCCCeeecCC-CCceeeCCCCCCCCCCC
Q psy592 1 MENILNIVTITYLYDLANPGLH-------KNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSM-HTAKCQCLPGWSGTGCV 72 (339)
Q Consensus 1 ~GCIrnl~vn~~~~dl~~~~~~-------~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~-~~~~C~C~~g~~G~~Ce 72 (339)
.||++.+++||..++|.++..+ ....+||....+.|..+ ||+|+|+|.... ++|+|.|++.|+|.+||
T Consensus 3825 ~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC~l~~d~C~~n----pCqhgG~C~~~~~ggy~CkCpsqysG~~CE 3900 (4289)
T KOG1219|consen 3825 DGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGCSLLTDPCNDN----PCQHGGTCISQPKGGYKCKCPSQYSGNHCE 3900 (4289)
T ss_pred cceeeeEEEccccccccCCCchhhhhhhhhcccccccccccccccC----cccCCCEecCCCCCceEEeCcccccCcccc
Confidence 4999999999999999887643 24569999888999999 999999999985 58999999999999999
Q ss_pred CCCCCcccc
Q psy592 73 TPTSPVTFK 81 (339)
Q Consensus 73 ~~~~~~~F~ 81 (339)
..+.+-.+.
T Consensus 3901 i~~epC~sn 3909 (4289)
T KOG1219|consen 3901 IDLEPCASN 3909 (4289)
T ss_pred cccccccCC
Confidence 987555543
No 13
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=98.51 E-value=6e-07 Score=72.70 Aligned_cols=55 Identities=31% Similarity=0.496 Sum_probs=45.9
Q ss_pred EeecCCceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe--cccccCccc
Q psy592 105 FRTREEFGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL--NAIKDSKLR 162 (339)
Q Consensus 105 frT~~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~--~~v~dG~WH 162 (339)
|||++++|+|||.+.....+|+.++|.+|++++.+++|.. ...+.. .++++|+||
T Consensus 1 Frt~~~~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~---~~~~~~~~~~~~dg~wh 57 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGS---EIVTTFSNSNLNDGQWH 57 (128)
T ss_dssp EEESSSSEEEEEEEESTTSEEEEEEEETTEEEEEEESSSS---EEEEEECSSSSTSSSEE
T ss_pred CccCCCCEeEEEEcCCCCCEEEEEEEECCEEEEEEEcccc---ceeeeccCcccccccee
Confidence 8999999999999987556899999999999999999942 233332 378899999
No 14
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.37 E-value=1.1e-07 Score=58.97 Aligned_cols=31 Identities=26% Similarity=0.758 Sum_probs=27.6
Q ss_pred cCCCCCCCCCCCCCeeecCC-CCceeeCCCCCCCCC
Q psy592 36 CNAHSRLHCYPEHGVCVGSM-HTAKCQCLPGWSGTG 70 (339)
Q Consensus 36 C~~~~~~~pC~~~g~Cv~~~-~~~~C~C~~g~~G~~ 70 (339)
|.++ ||.|+|+|+... ++|+|.|++||+|++
T Consensus 1 C~~~----~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSN----PCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTT----SSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCC----cCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 4555 999999999998 899999999999964
No 15
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.98 E-value=9.7e-06 Score=51.92 Aligned_cols=35 Identities=29% Similarity=0.868 Sum_probs=30.8
Q ss_pred cccCC-CCCCCCCCCCCeeecCCCCceeeCCCCCC-CCCCC
Q psy592 34 AVCNA-HSRLHCYPEHGVCVGSMHTAKCQCLPGWS-GTGCV 72 (339)
Q Consensus 34 ~~C~~-~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~-G~~Ce 72 (339)
+.|.. . +|.++|+|++..++|.|.|++||. |..|+
T Consensus 3 ~~C~~~~----~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 3 DECASGN----PCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred ccCcCCC----CcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 45665 5 899999999999999999999999 99885
No 16
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.80 E-value=3.2e-05 Score=48.89 Aligned_cols=35 Identities=29% Similarity=0.854 Sum_probs=30.5
Q ss_pred cccCC-CCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCC
Q psy592 34 AVCNA-HSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCV 72 (339)
Q Consensus 34 ~~C~~-~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce 72 (339)
+.|.. . +|.+++.|+...++|+|.|++||.|..|+
T Consensus 3 ~~C~~~~----~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECASGN----PCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCCCCC----CcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 45665 4 89999999999999999999999999885
No 17
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.62 E-value=8.3e-05 Score=46.18 Aligned_cols=29 Identities=31% Similarity=0.844 Sum_probs=26.8
Q ss_pred CCCCCCeeecCCCCceeeCCCCCCCC-CCC
Q psy592 44 CYPEHGVCVGSMHTAKCQCLPGWSGT-GCV 72 (339)
Q Consensus 44 pC~~~g~Cv~~~~~~~C~C~~g~~G~-~Ce 72 (339)
+|.+++.|+..+++|+|.|+.||.|. .|+
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~ 36 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence 89999999999999999999999998 774
No 18
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.51 E-value=0.00014 Score=45.51 Aligned_cols=28 Identities=25% Similarity=0.766 Sum_probs=25.9
Q ss_pred CCCCCCeeecCCCCceeeCCCCCCC-CCCC
Q psy592 44 CYPEHGVCVGSMHTAKCQCLPGWSG-TGCV 72 (339)
Q Consensus 44 pC~~~g~Cv~~~~~~~C~C~~g~~G-~~Ce 72 (339)
+|.++ +|++.+++|+|.|++||.| ..|+
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence 89888 9999999999999999999 8774
No 19
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.23 E-value=0.00035 Score=43.11 Aligned_cols=26 Identities=35% Similarity=0.992 Sum_probs=23.7
Q ss_pred CCCCCCeeecCCCCceeeCCCCCCCCCC
Q psy592 44 CYPEHGVCVGSMHTAKCQCLPGWSGTGC 71 (339)
Q Consensus 44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~C 71 (339)
.|.++|+|+.. ..+|.|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 69999999987 678999999999987
No 20
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.19 E-value=0.00017 Score=47.41 Aligned_cols=33 Identities=33% Similarity=0.783 Sum_probs=27.7
Q ss_pred CcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCC
Q psy592 33 EAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWS 67 (339)
Q Consensus 33 ~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~ 67 (339)
++.|... .+.|..++.|++..++|+|.|++||.
T Consensus 2 idEC~~~--~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEG--PHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTT--SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred ccccCCC--CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 3567654 35798899999999999999999996
No 21
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.98 E-value=0.00021 Score=34.86 Aligned_cols=13 Identities=62% Similarity=1.842 Sum_probs=11.1
Q ss_pred eeeCCCCCCCCCC
Q psy592 59 KCQCLPGWSGTGC 71 (339)
Q Consensus 59 ~C~C~~g~~G~~C 71 (339)
+|.|++||+|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999988
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.03 E-value=0.0027 Score=40.26 Aligned_cols=27 Identities=22% Similarity=0.619 Sum_probs=21.3
Q ss_pred CCCCCCCeeecCCCCceeeCCCCCCCC
Q psy592 43 HCYPEHGVCVGSMHTAKCQCLPGWSGT 69 (339)
Q Consensus 43 ~pC~~~g~Cv~~~~~~~C~C~~g~~G~ 69 (339)
..|..++.|+....+|+|.|.+||.|.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 368888999999999999999999874
No 23
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=95.92 E-value=0.071 Score=46.58 Aligned_cols=73 Identities=15% Similarity=0.118 Sum_probs=48.1
Q ss_pred cccEEEEEEeec-CCceEEEEEecCCCCceEEEEEECCEEEEEEec-CCCcccceeEEe--cccccCcccccccCCccee
Q psy592 97 YTTQLQLKFRTR-EEFGELFRLSDQHNREYAILEIKDSKLRFRYNL-NNLKTDEKDIWL--NAIKDSKLRFRYNLNNLKT 172 (339)
Q Consensus 97 ~~~~i~l~frT~-~~~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~l-G~~~~~~~~l~~--~~v~dG~WH~r~~~~~~l~ 172 (339)
....|.+.||+. ...|.||.+.+.++..++.+.+..+...+.+.. +.. .....+.. .++.||+|| .+.++
T Consensus 52 ~~fsi~~~~r~~~~~~g~L~si~~~~~~~~l~v~l~g~~~~~~~~~~~~~-g~~~~~~f~~~~l~dg~WH-----~lal~ 125 (184)
T smart00210 52 EDFSLLTTFRQTPKSRGVLFAIYDAQNVRQFGLEVDGRANTLLLRYQGVD-GKQHTVSFRNLPLADGQWH-----KLALS 125 (184)
T ss_pred CCeEEEEEEEeCCCCCeEEEEEEcCCCcEEEEEEEeCCccEEEEEECCCC-CcEEEEeecCCccccCCce-----EEEEE
Confidence 467888889987 667999999876567788888877665555543 221 11233332 367889999 55555
Q ss_pred ecC
Q psy592 173 DEK 175 (339)
Q Consensus 173 ~~~ 175 (339)
+.+
T Consensus 126 V~~ 128 (184)
T smart00210 126 VSG 128 (184)
T ss_pred EeC
Confidence 554
No 24
>PHA02887 EGF-like protein; Provisional
Probab=95.51 E-value=0.011 Score=47.16 Aligned_cols=30 Identities=23% Similarity=0.631 Sum_probs=24.6
Q ss_pred CCCCCCeeec--CCCCceeeCCCCCCCCCCCCC
Q psy592 44 CYPEHGVCVG--SMHTAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 44 pC~~~g~Cv~--~~~~~~C~C~~g~~G~~Ce~~ 74 (339)
-|- +|+|.- +...+.|.|+.||+|.+|+..
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCCcc
Confidence 565 579965 456899999999999999864
No 25
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=95.18 E-value=0.021 Score=46.25 Aligned_cols=30 Identities=20% Similarity=0.577 Sum_probs=25.1
Q ss_pred CCCCCCeeec--CCCCceeeCCCCCCCCCCCCC
Q psy592 44 CYPEHGVCVG--SMHTAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 44 pC~~~g~Cv~--~~~~~~C~C~~g~~G~~Ce~~ 74 (339)
-|-|| +|.- +...+.|.|..||+|.+||..
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCcccccccce
Confidence 68764 9965 457899999999999999865
No 26
>KOG3509|consensus
Probab=93.78 E-value=0.051 Score=58.20 Aligned_cols=70 Identities=19% Similarity=0.347 Sum_probs=54.2
Q ss_pred CCccceeeECCEEeecCCCccCCCCCCCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCCC
Q psy592 1 MENILNIVTITYLYDLANPGLHKNSVPGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT 75 (339)
Q Consensus 1 ~GCIrnl~vn~~~~dl~~~~~~~~s~~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~ 75 (339)
.|||+.++++++.++...-....-...+| ...+.|... ||...+.|.+..-...|.|+++|+|..|+..+
T Consensus 375 ~gci~~~~~~~k~l~~~~~~~~~v~~~~~-c~g~~c~~~----p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~ 444 (964)
T KOG3509|consen 375 AGCIRDLVMNLKDLRVTLQRASYVAAQGT-CLGDVCWRI----PCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCM 444 (964)
T ss_pred cceehhhhhhccccccccccccccccccc-cCCCccccc----cCCCCccccccccccceeccccccCchhhccC
Confidence 48999999999988765443322222222 234688888 99999999999999999999999999999765
No 27
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=93.33 E-value=0.044 Score=43.09 Aligned_cols=45 Identities=27% Similarity=0.742 Sum_probs=32.6
Q ss_pred CCCCCcccCCCCCCCCCCCCCeeecCCC-----CceeeCCC-------------CCCCCCCCCCC
Q psy592 29 CPATEAVCNAHSRLHCYPEHGVCVGSMH-----TAKCQCLP-------------GWSGTGCVTPT 75 (339)
Q Consensus 29 C~~~~~~C~~~~~~~pC~~~g~Cv~~~~-----~~~C~C~~-------------g~~G~~Ce~~~ 75 (339)
|-.+.+.|... .+.|..||.|+.... =|.|.|.+ .|.|+.|++.-
T Consensus 1 Cf~S~~aC~~~--Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkD 63 (103)
T PF12955_consen 1 CFSSNDACENA--TNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKD 63 (103)
T ss_pred CcCCHHHHHHh--ccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccccc
Confidence 33445667644 447999999999742 38999987 57888888763
No 28
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=91.40 E-value=0.14 Score=29.34 Aligned_cols=16 Identities=38% Similarity=1.149 Sum_probs=12.5
Q ss_pred CceeeCCCCCC----CCCCC
Q psy592 57 TAKCQCLPGWS----GTGCV 72 (339)
Q Consensus 57 ~~~C~C~~g~~----G~~Ce 72 (339)
+|+|.|++||. |..|.
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~ 20 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE 20 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc
Confidence 68999999985 55664
No 29
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=87.45 E-value=0.61 Score=38.36 Aligned_cols=11 Identities=18% Similarity=0.311 Sum_probs=6.4
Q ss_pred ccCCCcccccc
Q psy592 248 EDDMTAFDITP 258 (339)
Q Consensus 248 E~d~~~yd~~~ 258 (339)
+.|..-||-..
T Consensus 72 ~~~~g~Yd~~g 82 (130)
T PF12273_consen 72 DNDPGYYDQQG 82 (130)
T ss_pred CCCCCCCCCCC
Confidence 55666677543
No 30
>KOG1214|consensus
Probab=87.26 E-value=0.49 Score=49.68 Aligned_cols=35 Identities=26% Similarity=0.698 Sum_probs=30.9
Q ss_pred cccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCC--CCC
Q psy592 34 AVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGT--GCV 72 (339)
Q Consensus 34 ~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~--~Ce 72 (339)
+.|..+ .|...++|++..++|+|.|.+||.|. +|-
T Consensus 828 DeC~ps----rChp~A~CyntpgsfsC~C~pGy~GDGf~CV 864 (1289)
T KOG1214|consen 828 DECSPS----RCHPAATCYNTPGSFSCRCQPGYYGDGFQCV 864 (1289)
T ss_pred cccCcc----ccCCCceEecCCCcceeecccCccCCCceec
Confidence 788877 89999999999999999999999865 553
No 31
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=86.96 E-value=5 Score=35.37 Aligned_cols=74 Identities=15% Similarity=0.246 Sum_probs=38.3
Q ss_pred cceEEEeccCCCCccccEEEEEEeecCC--ceEEEEEecCCC-CceEEEEEECCEEEEEEecCCCcccceeEEecccccC
Q psy592 83 QSYIKYALSFEPDKYTTQLQLKFRTREE--FGELFRLSDQHN-REYAILEIKDSKLRFRYNLNNLKTDEKDIWLNAIKDS 159 (339)
Q Consensus 83 ~s~v~y~~~~~~~~~~~~i~l~frT~~~--~GlL~~~~~~~~-~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~~~v~dG 159 (339)
..|+++............+-+++||... .+.||..+.... .+++.+.-..+.+.+. +++. ...+. ..+.+|
T Consensus 11 ~~yv~l~~~~~~pL~~fTvC~w~k~~~~~~~~tifSYat~~~~nell~~~~~~~~~~l~--i~~~---~~~~~-~~~~~~ 84 (195)
T PF00354_consen 11 TDYVRLKPSVPLPLSAFTVCFWVKTDDSSNDGTIFSYATSSQDNELLLFGSSSGSLRLY--INGS---SVSFS-GPIRDG 84 (195)
T ss_dssp SBEEEEEESS-S-BSEEEEEEEEEESGSGS-EEEEEEEETTEEEEEEEEEETTTEEEEE--ETTE---EEEEE-ECS-TS
T ss_pred cceEEEecCCCCCcccEEEEEEEEeccCCCceEEEEEccCCCCccEEEEEeCCceEEEE--ECCe---EeEec-cccCCC
Confidence 4677776543222446777888888765 688886654322 2343333334555444 4431 12222 356789
Q ss_pred ccc
Q psy592 160 KLR 162 (339)
Q Consensus 160 ~WH 162 (339)
+||
T Consensus 85 ~Wh 87 (195)
T PF00354_consen 85 QWH 87 (195)
T ss_dssp S-E
T ss_pred CcE
Confidence 999
No 32
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=85.29 E-value=0.46 Score=30.05 Aligned_cols=19 Identities=32% Similarity=0.981 Sum_probs=15.8
Q ss_pred CeeecCCCCceeeCCCCCC
Q psy592 49 GVCVGSMHTAKCQCLPGWS 67 (339)
Q Consensus 49 g~Cv~~~~~~~C~C~~g~~ 67 (339)
..|++..++|+|.|++||.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp SEEEEETTSEEEE-STTEE
T ss_pred CCCccCCCceEeECCCCCE
Confidence 3799999999999999984
No 33
>KOG4260|consensus
Probab=84.87 E-value=0.62 Score=42.83 Aligned_cols=33 Identities=27% Similarity=0.564 Sum_probs=28.3
Q ss_pred CCCCCCCeeecCC---CCceeeCCCCCCCCCCCCCC
Q psy592 43 HCYPEHGVCVGSM---HTAKCQCLPGWSGTGCVTPT 75 (339)
Q Consensus 43 ~pC~~~g~Cv~~~---~~~~C~C~~g~~G~~Ce~~~ 75 (339)
.||..+|.|..+. ++-+|.|.+||+|+.|.+-.
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg 185 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCG 185 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccc
Confidence 4898899999874 57899999999999998764
No 34
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=84.64 E-value=12 Score=33.14 Aligned_cols=78 Identities=15% Similarity=0.186 Sum_probs=42.1
Q ss_pred ccccC---cceEEEeccCCCCccccEEEEEEeecC--CceEEEEEecCCC-CceEEEEEECCEEEEEEecCCCcccceeE
Q psy592 78 VTFKP---QSYIKYALSFEPDKYTTQLQLKFRTRE--EFGELFRLSDQHN-REYAILEIKDSKLRFRYNLNNLKTDEKDI 151 (339)
Q Consensus 78 ~~F~~---~s~v~y~~~~~~~~~~~~i~l~frT~~--~~GlL~~~~~~~~-~~~i~L~L~~G~l~~~~~lG~~~~~~~~l 151 (339)
+.|.. ..|+.+............+.+++|+.. ..+.||..+.... .+++...-.+|.+.+. +++ . .+
T Consensus 9 ~~fp~~s~~~yv~l~~~~~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~~~ne~~~~~~~~~~~~l~--i~g----~-~~ 81 (206)
T smart00159 9 FVFPKESDTSYVKLKPELPKPLQAFTVCLWFYSDLSPRGYSLFSYATKGQDNELLLYKEKQGEYSLY--IGG----K-KV 81 (206)
T ss_pred EECCCCCCCCeEEEccCCCCChhHEEEEEEEEecCCCCceEEEEEeCCCCCCeEEEEEcCCcEEEEE--EcC----e-EE
Confidence 44543 466666544332344677888888864 4567775544332 3333333344555444 333 1 22
Q ss_pred Ee-cccccCccc
Q psy592 152 WL-NAIKDSKLR 162 (339)
Q Consensus 152 ~~-~~v~dG~WH 162 (339)
.. ..+.+|+||
T Consensus 82 ~~~~~~~~g~W~ 93 (206)
T smart00159 82 QFPVPESDGKWH 93 (206)
T ss_pred EecccccCCceE
Confidence 22 367789999
No 35
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=82.46 E-value=0.82 Score=30.68 Aligned_cols=24 Identities=33% Similarity=0.753 Sum_probs=18.3
Q ss_pred eeecCCCCceeeCCCCCCCCCCCCCC
Q psy592 50 VCVGSMHTAKCQCLPGWSGTGCVTPT 75 (339)
Q Consensus 50 ~Cv~~~~~~~C~C~~g~~G~~Ce~~~ 75 (339)
.|.. ....|.|.++|+|++|+.-.
T Consensus 12 ~C~~--~~G~C~C~~~~~G~~C~~C~ 35 (49)
T PF00053_consen 12 TCDP--STGQCVCKPGTTGPRCDQCK 35 (49)
T ss_dssp SEEE--TCEEESBSTTEESTTS-EE-
T ss_pred cccC--CCCEEeccccccCCcCcCCC
Confidence 5665 55689999999999999764
No 36
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=82.38 E-value=0.4 Score=33.03 Aligned_cols=32 Identities=22% Similarity=0.456 Sum_probs=18.6
Q ss_pred CCCCCCeeecC----CCCceeeCCCCCCCCCCCCCC
Q psy592 44 CYPEHGVCVGS----MHTAKCQCLPGWSGTGCVTPT 75 (339)
Q Consensus 44 pC~~~g~Cv~~----~~~~~C~C~~g~~G~~Ce~~~ 75 (339)
+|..||+...+ .+...|.|..-|+|+.|++.+
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~ 53 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI 53 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence 78888988754 345889999999999998654
No 37
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=82.07 E-value=1.3 Score=39.61 Aligned_cols=37 Identities=22% Similarity=0.536 Sum_probs=27.2
Q ss_pred CCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC
Q psy592 27 PGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG 68 (339)
Q Consensus 27 ~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G 68 (339)
.-|. ..+.|... .++|. ..|....++|.|.|++||+.
T Consensus 182 ~~C~-~~~~C~~~--~~~c~--~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 182 KICV-VPDLCATL--SHVCQ--QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred ccCc-CchhhcCC--CCCcc--ceEEcCCCCEEeECCCCccC
Confidence 4575 34677643 23675 47999999999999999974
No 38
>KOG1217|consensus
Probab=81.44 E-value=1.4 Score=43.25 Aligned_cols=36 Identities=31% Similarity=0.766 Sum_probs=30.3
Q ss_pred CcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCC
Q psy592 33 EAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGC 71 (339)
Q Consensus 33 ~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~C 71 (339)
.+.|... . +|.++++|+...+.|.|.|++||+|..|
T Consensus 271 ~~~C~~~--~-~c~~~~~C~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 271 VDSCALI--A-SCPNGGTCVNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred ccccCCC--C-ccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence 3556554 1 3999999999999999999999999998
No 39
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=81.27 E-value=4.2 Score=33.91 Aligned_cols=51 Identities=12% Similarity=0.280 Sum_probs=23.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCchhhcccccccCCCCCcc
Q psy592 196 SALFVTLLISPNQVLVLVFVVYSRRREAHIKYPGPDDDVRENIINYDDEGGGED 249 (339)
Q Consensus 196 ~~il~~~~~l~~l~~~l~~~~~~~~~~~~~~~~~~~~d~reni~~Y~eEggGE~ 249 (339)
+.||+.|++++ +++++++.++.+||++.. -...++|++ =|=.|+|..-+|-
T Consensus 32 ~tILiaIvVli-iiiivli~lcssRKkKaa-AAi~eediQ-finpyqDqqw~~v 82 (189)
T PF05568_consen 32 YTILIAIVVLI-IIIIVLIYLCSSRKKKAA-AAIEEEDIQ-FINPYQDQQWAGV 82 (189)
T ss_pred HHHHHHHHHHH-HHHHHHHHHHhhhhHHHH-hhhhhhccc-ccCcccchhhccC
Confidence 56676666651 233444445555554321 123344553 2334666433333
No 40
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=80.14 E-value=0.77 Score=29.13 Aligned_cols=24 Identities=25% Similarity=0.649 Sum_probs=17.7
Q ss_pred CCCCCCeeecCC-CCceeeCCCCCC
Q psy592 44 CYPEHGVCVGSM-HTAKCQCLPGWS 67 (339)
Q Consensus 44 pC~~~g~Cv~~~-~~~~C~C~~g~~ 67 (339)
+|..++.|+... +.+.|.|.+||.
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEE
T ss_pred cCCCCcccEEcCCCCEEEEeeCCcc
Confidence 888899999987 679999999984
No 41
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=79.70 E-value=2 Score=34.07 Aligned_cols=32 Identities=28% Similarity=0.807 Sum_probs=23.9
Q ss_pred CcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC
Q psy592 33 EAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG 68 (339)
Q Consensus 33 ~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G 68 (339)
.+.|... ..|+..|.|... ...+|.|.+||.-
T Consensus 77 ~d~Cd~y---~~CG~~g~C~~~-~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 77 KDQCDVY---GFCGPNGICNSN-NSPKCSCLPGFEP 108 (110)
T ss_pred ccCCCCc---cccCCccEeCCC-CCCceECCCCcCC
Confidence 3577653 389999999654 5667999999953
No 42
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=78.77 E-value=3.2 Score=35.58 Aligned_cols=18 Identities=28% Similarity=0.492 Sum_probs=12.7
Q ss_pred CCCCCccCCCcccccccC
Q psy592 243 DEGGGEDDMTAFDITPLQ 260 (339)
Q Consensus 243 eEggGE~d~~~yd~~~L~ 260 (339)
||...|||++=||...+|
T Consensus 145 ~~ddedeD~TvFd~~~~r 162 (163)
T PF06679_consen 145 EEDDEDEDSTVFDANHPR 162 (163)
T ss_pred CCCccccccceeeecccc
Confidence 444567777889987765
No 43
>KOG1225|consensus
Probab=78.44 E-value=1.6 Score=44.25 Aligned_cols=27 Identities=30% Similarity=0.839 Sum_probs=23.8
Q ss_pred CCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592 44 CYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~ 74 (339)
.|.++|+|+ .-+|.|.+||+|..|++.
T Consensus 317 dC~g~G~Ci----~G~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 317 DCSGHGKCI----DGECLCDEGYTGELCIQR 343 (525)
T ss_pred cCCCCCccc----CCceEeCCCCcCCccccc
Confidence 798999999 347999999999999985
No 44
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=77.18 E-value=5.9 Score=31.53 Aligned_cols=15 Identities=7% Similarity=0.224 Sum_probs=7.0
Q ss_pred HHHHHHHHHHHHHhh
Q psy592 208 QVLVLVFVVYSRRRE 222 (339)
Q Consensus 208 l~~~l~~~~~~~~~~ 222 (339)
|++++-++.|+..++
T Consensus 13 l~l~asl~~wr~~~r 27 (107)
T PF15330_consen 13 LSLAASLLAWRMKQR 27 (107)
T ss_pred HHHHHHHHHHHHHhh
Confidence 334444455554443
No 45
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=76.95 E-value=3.8 Score=33.38 Aligned_cols=9 Identities=11% Similarity=0.386 Sum_probs=3.0
Q ss_pred HHHHHHHHh
Q psy592 213 VFVVYSRRR 221 (339)
Q Consensus 213 ~~~~~~~~~ 221 (339)
.|+++|+||
T Consensus 85 ~y~irR~~K 93 (122)
T PF01102_consen 85 SYCIRRLRK 93 (122)
T ss_dssp HHHHHHHS-
T ss_pred HHHHHHHhc
Confidence 333333333
No 46
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=76.46 E-value=35 Score=29.94 Aligned_cols=80 Identities=18% Similarity=0.110 Sum_probs=42.8
Q ss_pred CccccC---cceEEEeccCCCCccccEEEEEEeecC--CceEEEEEecCCCC-ceEEEEEECCEEEEEEecCCCccccee
Q psy592 77 PVTFKP---QSYIKYALSFEPDKYTTQLQLKFRTRE--EFGELFRLSDQHNR-EYAILEIKDSKLRFRYNLNNLKTDEKD 150 (339)
Q Consensus 77 ~~~F~~---~s~v~y~~~~~~~~~~~~i~l~frT~~--~~GlL~~~~~~~~~-~~i~L~L~~G~l~~~~~lG~~~~~~~~ 150 (339)
.+.|.. ..|+.+............+.+++|+.. ..+.||........ +++...-..|.+.+.+ ++. ...
T Consensus 8 ~l~f~~~s~~~yv~l~~~~~~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i--~~~---~~~ 82 (201)
T cd00152 8 VFVFPKESDTSYVKLKPELPKPLQAFTLCLWVYTDLSTREYSLFSYATKGQDNELLLYKEKDGGYSLYI--GGK---EVT 82 (201)
T ss_pred EEECCCCCCCceEEEccCCCCChhhEEEEEEEEecCCCCCeEEEEEeCCCCCCeEEEEEcCCCeEEEEE--cCE---EEE
Confidence 345544 356666544332344677888888764 45667755443223 3333333456666654 321 222
Q ss_pred EEecccccCccc
Q psy592 151 IWLNAIKDSKLR 162 (339)
Q Consensus 151 l~~~~v~dG~WH 162 (339)
+. .+..+|+||
T Consensus 83 ~~-~~~~~g~W~ 93 (201)
T cd00152 83 FK-VPESDGAWH 93 (201)
T ss_pred Ee-ccCCCCCEE
Confidence 22 345889999
No 47
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=75.95 E-value=3.3 Score=38.48 Aligned_cols=33 Identities=24% Similarity=0.552 Sum_probs=15.9
Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Q psy592 191 FVKTGSALFVTLLISPNQVLVLVFVVYSRRREAH 224 (339)
Q Consensus 191 ~~~~g~~il~~~~~l~~l~~~l~~~~~~~~~~~~ 224 (339)
+.+-|||+|+.|++. +++++|-+++|||||+.+
T Consensus 258 F~Pcgiaalvllil~-vvliiLYiWlyrrRK~sw 290 (295)
T TIGR01478 258 FLPYGIAALVLIILT-VVLIILYIWLYRRRKKSW 290 (295)
T ss_pred hcccHHHHHHHHHHH-HHHHHHHHHHHHhhcccc
Confidence 344456666554443 123333445566665443
No 48
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=75.94 E-value=26 Score=28.02 Aligned_cols=78 Identities=17% Similarity=0.092 Sum_probs=37.8
Q ss_pred ccCcceEEEeccCCCCccccEEEEEEeecCCce---EEEEEecCCCCceEEEEEE-CCEEEEEEecCCCcccceeEE-ec
Q psy592 80 FKPQSYIKYALSFEPDKYTTQLQLKFRTREEFG---ELFRLSDQHNREYAILEIK-DSKLRFRYNLNNLKTDEKDIW-LN 154 (339)
Q Consensus 80 F~~~s~v~y~~~~~~~~~~~~i~l~frT~~~~G---lL~~~~~~~~~~~i~L~L~-~G~l~~~~~lG~~~~~~~~l~-~~ 154 (339)
|++++|+.+.....+ .....|++.||...... .++. .....+...+.+. +|.+.+.+..+.. ....+. ..
T Consensus 6 ~g~~~~i~~~~~~~~-~~~fTi~~w~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~l~~~~~~~~~--~~~~~~~~~ 80 (157)
T PF13385_consen 6 DGSNDYISIPNSDFP-SGSFTISFWVKPDSPSSSQSFVFM--DSSGSGGFGLFINNNGRLRFYIGNGGG--GNYSFSSDS 80 (157)
T ss_dssp -STT-EEEEESGGGG-GTEEEEEEEEEESS--SSEEEEEE--SSSSSEEEEEEEETTSEEEEEETTSEE--ESS-EE-BS
T ss_pred CCCCCEEEECCcCCC-CCCEEEEEEEEeCCCCCCceEEEE--ecCCCCEEEEEEECCCEEEEEEeCCCc--eeEEEecCc
Confidence 345788888753222 44678888888765432 3333 2112345555555 5767766554431 112232 34
Q ss_pred ccccCccc
Q psy592 155 AIKDSKLR 162 (339)
Q Consensus 155 ~v~dG~WH 162 (339)
.+..++||
T Consensus 81 ~~~~~~W~ 88 (157)
T PF13385_consen 81 NLPDNKWH 88 (157)
T ss_dssp ---TT-EE
T ss_pred ccCCCCEE
Confidence 67779999
No 49
>KOG1217|consensus
Probab=75.50 E-value=2.8 Score=41.14 Aligned_cols=39 Identities=28% Similarity=0.625 Sum_probs=32.8
Q ss_pred cccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592 34 AVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 34 ~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~ 74 (339)
+.|... ..+|.+++.|.+...+|.|.|+++|+|..|+..
T Consensus 170 ~~C~~~--~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~ 208 (487)
T KOG1217|consen 170 DECIQY--SSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT 208 (487)
T ss_pred cccccC--CCCcCCCcccccCCCCeeEeCCCCccCCcCcCC
Confidence 466633 237999999999999999999999999999865
No 50
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=75.45 E-value=3 Score=28.12 Aligned_cols=19 Identities=32% Similarity=0.744 Sum_probs=16.1
Q ss_pred CCceeeCCCCCCCCCCCCC
Q psy592 56 HTAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 56 ~~~~C~C~~g~~G~~Ce~~ 74 (339)
..-+|.|.++++|..|++-
T Consensus 17 ~~G~C~C~~~~~G~~C~~C 35 (50)
T cd00055 17 GTGQCECKPNTTGRRCDRC 35 (50)
T ss_pred CCCEEeCCCcCCCCCCCCC
Confidence 3457999999999999976
No 51
>PTZ00370 STEVOR; Provisional
Probab=73.86 E-value=4 Score=38.03 Aligned_cols=31 Identities=26% Similarity=0.495 Sum_probs=14.9
Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy592 192 VKTGSALFVTLLISPNQVLVLVFVVYSRRREA 223 (339)
Q Consensus 192 ~~~g~~il~~~~~l~~l~~~l~~~~~~~~~~~ 223 (339)
.+-|||+|+.+++. +++++|-+++||||++.
T Consensus 255 ~Pygiaalvllil~-vvliilYiwlyrrRK~s 285 (296)
T PTZ00370 255 YPYGIAALVLLILA-VVLIILYIWLYRRRKNS 285 (296)
T ss_pred cccHHHHHHHHHHH-HHHHHHHHHHHHhhcch
Confidence 44456655544333 13334444566666553
No 52
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=72.69 E-value=6.6 Score=34.87 Aligned_cols=14 Identities=21% Similarity=0.335 Sum_probs=9.0
Q ss_pred hHHHHHHHHHHHHH
Q psy592 196 SALFVTLLISPNQV 209 (339)
Q Consensus 196 ~~il~~~~~l~~l~ 209 (339)
.|+|+|++++|+|+
T Consensus 128 ~amLIClIIIAVLf 141 (227)
T PF05399_consen 128 MAMLICLIIIAVLF 141 (227)
T ss_pred hhHHHHHHHHHHHH
Confidence 47777777764443
No 53
>smart00051 DSL delta serrate ligand.
Probab=72.32 E-value=2.9 Score=29.91 Aligned_cols=24 Identities=33% Similarity=0.790 Sum_probs=16.1
Q ss_pred CCCCCeeecCCCCceeeCCCCCCCCCC
Q psy592 45 YPEHGVCVGSMHTAKCQCLPGWSGTGC 71 (339)
Q Consensus 45 C~~~g~Cv~~~~~~~C~C~~g~~G~~C 71 (339)
+..+.+|-. .-.|.|.+||+|+.|
T Consensus 40 ~~~~~~Cd~---~G~~~C~~Gw~G~~C 63 (63)
T smart00051 40 FFGHYTCDE---NGNKGCLEGWMGPYC 63 (63)
T ss_pred ccCCccCCc---CCCEecCCCCcCCCC
Confidence 334455632 235779999999988
No 54
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=72.11 E-value=4.1 Score=38.50 Aligned_cols=14 Identities=21% Similarity=0.738 Sum_probs=6.8
Q ss_pred HHHHHHHHHHHHHh
Q psy592 208 QVLVLVFVVYSRRR 221 (339)
Q Consensus 208 l~~~l~~~~~~~~~ 221 (339)
|+||+++++||.||
T Consensus 270 LIMvIIYLILRYRR 283 (299)
T PF02009_consen 270 LIMVIIYLILRYRR 283 (299)
T ss_pred HHHHHHHHHHHHHH
Confidence 44445555555444
No 55
>PF15102 TMEM154: TMEM154 protein family
Probab=72.06 E-value=1.2 Score=37.23 Aligned_cols=14 Identities=14% Similarity=0.418 Sum_probs=6.2
Q ss_pred cHHHHHHHHHHHhc
Q psy592 288 NVGIFIEEHKKRAD 301 (339)
Q Consensus 288 ~~~~~i~~~~~~ad 301 (339)
++..+++..=+.||
T Consensus 129 eldkwm~s~n~n~~ 142 (146)
T PF15102_consen 129 ELDKWMNSMNRNAD 142 (146)
T ss_pred HHHhHHHhhccCcc
Confidence 34445544433443
No 56
>KOG1226|consensus
Probab=70.44 E-value=4.1 Score=42.69 Aligned_cols=27 Identities=41% Similarity=1.114 Sum_probs=20.0
Q ss_pred CCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592 44 CYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~ 74 (339)
-|..+|+|.-. .|.|.+||+|..|+-.
T Consensus 556 lC~g~G~C~CG----~CvC~~GwtG~~C~C~ 582 (783)
T KOG1226|consen 556 LCGGHGRCECG----RCVCNPGWTGSACNCP 582 (783)
T ss_pred ccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence 46666666543 6999999999988754
No 57
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=70.01 E-value=1.5 Score=31.31 Aligned_cols=15 Identities=27% Similarity=0.879 Sum_probs=0.6
Q ss_pred HHHHHHHHHHHHhhh
Q psy592 209 VLVLVFVVYSRRREA 223 (339)
Q Consensus 209 ~~~l~~~~~~~~~~~ 223 (339)
+++++|++||-|++.
T Consensus 26 ilLIlf~iyR~rkkd 40 (64)
T PF01034_consen 26 ILLILFLIYRMRKKD 40 (64)
T ss_dssp -----------S---
T ss_pred HHHHHHHHHHHHhcC
Confidence 344556677655543
No 58
>KOG1225|consensus
Probab=66.31 E-value=7.4 Score=39.59 Aligned_cols=27 Identities=33% Similarity=0.897 Sum_probs=19.5
Q ss_pred CCCCCeeecCCCCceeeCCCCCCCCCCCCCC
Q psy592 45 YPEHGVCVGSMHTAKCQCLPGWSGTGCVTPT 75 (339)
Q Consensus 45 C~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~~ 75 (339)
|.++|.|++- .|+|++||+|..|++..
T Consensus 256 c~~~g~c~~G----~CIC~~Gf~G~dC~e~~ 282 (525)
T KOG1225|consen 256 CTGRGQCVEG----RCICPPGFTGDDCDELV 282 (525)
T ss_pred CcccceEeCC----eEeCCCCCcCCCCCccc
Confidence 4444555553 69999999999998854
No 59
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=65.05 E-value=4.8 Score=26.65 Aligned_cols=18 Identities=33% Similarity=0.827 Sum_probs=15.6
Q ss_pred CceeeCCCCCCCCCCCCC
Q psy592 57 TAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 57 ~~~C~C~~g~~G~~Ce~~ 74 (339)
+-.|.|.++++|++|+.-
T Consensus 17 ~G~C~C~~~~~G~~C~~C 34 (46)
T smart00180 17 TGQCECKPNVTGRRCDRC 34 (46)
T ss_pred CCEEECCCCCCCCCCCcC
Confidence 448999999999999965
No 60
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=62.58 E-value=1.6 Score=28.25 Aligned_cols=9 Identities=33% Similarity=0.766 Sum_probs=3.7
Q ss_pred HHHHHHHHH
Q psy592 211 VLVFVVYSR 219 (339)
Q Consensus 211 ~l~~~~~~~ 219 (339)
++++++|+|
T Consensus 30 ~~l~~~~rR 38 (40)
T PF08693_consen 30 AFLFFWYRR 38 (40)
T ss_pred HHhheEEec
Confidence 333444444
No 61
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=59.70 E-value=4.5 Score=29.98 Aligned_cols=13 Identities=46% Similarity=0.713 Sum_probs=4.9
Q ss_pred HHHHHHHHHHHHH
Q psy592 208 QVLVLVFVVYSRR 220 (339)
Q Consensus 208 l~~~l~~~~~~~~ 220 (339)
|+++++++++.||
T Consensus 14 l~~v~~~~~~~rr 26 (75)
T PF14575_consen 14 LVLVIIVIVCFRR 26 (75)
T ss_dssp HHHHHHHHCCCTT
T ss_pred HHhheeEEEEEee
Confidence 3333333333333
No 62
>KOG4260|consensus
Probab=57.13 E-value=6.9 Score=36.20 Aligned_cols=39 Identities=23% Similarity=0.468 Sum_probs=30.6
Q ss_pred CCCCCCCcccCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC
Q psy592 27 PGCPATEAVCNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG 68 (339)
Q Consensus 27 ~gC~~~~~~C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G 68 (339)
.+|. ..+.|... .+||..+-.|++.-++|+|.+.+||.+
T Consensus 231 ~gCv-DvnEC~~e--p~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 231 EGCV-DVNECQNE--PAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred cccc-cHHHHhcC--CCCCChhheeecCCCceEecccccccC
Confidence 4553 35677654 458988889999999999999999865
No 63
>KOG3509|consensus
Probab=56.90 E-value=13 Score=40.44 Aligned_cols=117 Identities=10% Similarity=0.020 Sum_probs=59.0
Q ss_pred CCCCCCCeeecCC-CCceeeCCCCCCCCCCCCCCCCc--cccC-----cceEEEeccCCC------------CccccEEE
Q psy592 43 HCYPEHGVCVGSM-HTAKCQCLPGWSGTGCVTPTSPV--TFKP-----QSYIKYALSFEP------------DKYTTQLQ 102 (339)
Q Consensus 43 ~pC~~~g~Cv~~~-~~~~C~C~~g~~G~~Ce~~~~~~--~F~~-----~s~v~y~~~~~~------------~~~~~~i~ 102 (339)
++|+-+++|.... .+|.|.|+..-.+..|+...... .|.. ...+.|-.+... ...+..-.
T Consensus 178 ~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r~~~~f~~~~~g~~~~~~~~vp~~~e~S~~~~~~~h~~~~~~~~~ 257 (964)
T KOG3509|consen 178 PVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPRAFCPFEKSVEGCLKCFCFGVPRPSESSLHAFRAIHGATLHVDSL 257 (964)
T ss_pred cccccceeEEecCCcceeeeccccccceehhhccCceecccccccccccceeecCCCccccchhhhHhhhccchhccchh
Confidence 3899999998875 58888888776666677653110 0111 111112111100 00111112
Q ss_pred EEEeecCC--ceEEEEEecCCCCceEEEEEECCEEEEEEecCCCcccceeEEe-cccccCccc
Q psy592 103 LKFRTREE--FGELFRLSDQHNREYAILEIKDSKLRFRYNLNNLKTDEKDIWL-NAIKDSKLR 162 (339)
Q Consensus 103 l~frT~~~--~GlL~~~~~~~~~~~i~L~L~~G~l~~~~~lG~~~~~~~~l~~-~~v~dG~WH 162 (339)
..|++... +.++.+.......+|+.+.+..|.+.+++..+.. ...+.. .+..+|+||
T Consensus 258 ~~~~~~s~~~~~~~~~~~~~~~~~f~~lt~~~g~~g~~~~~~~~---~~~~~~~~~~~~~E~~ 317 (964)
T KOG3509|consen 258 GVFFSDSEHRDILGNFLFSSFKDGFRALTLDGGTDGVRYDCGLP---QREDRLDVTSYIGEWR 317 (964)
T ss_pred eeeccccccccccccccccccccceeeeccCCCCccccccccCc---chhhhhccccccceee
Confidence 22333332 3344444433346788888877777777766542 222222 345567777
No 64
>PF05923 APC_crr: APC cysteine-rich region; InterPro: IPR009223 This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin [].; GO: 0016055 Wnt receptor signaling pathway; PDB: 1T08_C 1TH1_C 1V18_B.
Probab=56.05 E-value=4.1 Score=23.75 Aligned_cols=21 Identities=48% Similarity=0.540 Sum_probs=1.3
Q ss_pred CcceeeeecCCC---CCccccccc
Q psy592 310 DDLRNYAYEGGG---STAGSLSSL 330 (339)
Q Consensus 310 Dsl~~y~~EG~g---S~agSlssl 330 (339)
|.++.|.-|+-= |.|.|||||
T Consensus 2 d~~~~~~vE~TP~~fS~~sSlSsL 25 (26)
T PF05923_consen 2 DSPKKYCVEDTPACFSRASSLSSL 25 (26)
T ss_dssp --------SS--------------
T ss_pred CccccceecCCCccccccccccCC
Confidence 456667766644 677888887
No 65
>PF15050 SCIMP: SCIMP protein
Probab=55.02 E-value=20 Score=28.91 Aligned_cols=11 Identities=27% Similarity=0.670 Sum_probs=5.6
Q ss_pred CCchhhccccc
Q psy592 230 PDDDVRENIIN 240 (339)
Q Consensus 230 ~~~d~reni~~ 240 (339)
++|.+=||+++
T Consensus 53 deEkmYENv~n 63 (133)
T PF15050_consen 53 DEEKMYENVLN 63 (133)
T ss_pred cHHHHHHHhhc
Confidence 34445566653
No 66
>KOG1214|consensus
Probab=53.90 E-value=7.7 Score=41.22 Aligned_cols=36 Identities=25% Similarity=0.686 Sum_probs=29.3
Q ss_pred cCCCCCCCCCCCCCeeecCCCCceeeCCCCCCC--CCCCC
Q psy592 36 CNAHSRLHCYPEHGVCVGSMHTAKCQCLPGWSG--TGCVT 73 (339)
Q Consensus 36 C~~~~~~~pC~~~g~Cv~~~~~~~C~C~~g~~G--~~Ce~ 73 (339)
|+.+ -+.|..++.|.+...+|.|+|..||+| ++|-.
T Consensus 417 C~~n--~hqCs~HA~CtdyatgFcC~C~~gy~GnGr~C~~ 454 (1289)
T KOG1214|consen 417 CEHN--HHQCSRHAFCTDYATGFCCHCQSGYYGNGRHCLP 454 (1289)
T ss_pred hhcc--cccccccceeccCCCceeeeecccccCCCceecc
Confidence 5544 357888899999999999999999975 57854
No 67
>PTZ00046 rifin; Provisional
Probab=51.56 E-value=14 Score=35.75 Aligned_cols=15 Identities=20% Similarity=0.716 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHhh
Q psy592 208 QVLVLVFVVYSRRRE 222 (339)
Q Consensus 208 l~~~l~~~~~~~~~~ 222 (339)
|+||++++++|-||+
T Consensus 329 LIMvIIYLILRYRRK 343 (358)
T PTZ00046 329 LIMVIIYLILRYRRK 343 (358)
T ss_pred HHHHHHHHHHHhhhc
Confidence 455555555554443
No 68
>KOG1836|consensus
Probab=51.08 E-value=12 Score=43.29 Aligned_cols=32 Identities=28% Similarity=0.690 Sum_probs=27.2
Q ss_pred CCCCCCeeecCC--CCceee-CCCCCCCCCCCCCC
Q psy592 44 CYPEHGVCVGSM--HTAKCQ-CLPGWSGTGCVTPT 75 (339)
Q Consensus 44 pC~~~g~Cv~~~--~~~~C~-C~~g~~G~~Ce~~~ 75 (339)
||.+++.|.... ....|. |++||+|.+|+...
T Consensus 781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~ 815 (1705)
T KOG1836|consen 781 PCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECA 815 (1705)
T ss_pred CCCCChhhcCcCcccceecCCCCCCCcccccccCC
Confidence 788888898764 578899 99999999999764
No 69
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=50.00 E-value=1.2e+02 Score=26.68 Aligned_cols=46 Identities=24% Similarity=0.467 Sum_probs=34.2
Q ss_pred ccEEEEEEeecCCceE--EEEEecC-CCCceEEEEEECCEEEEEEecCC
Q psy592 98 TTQLQLKFRTREEFGE--LFRLSDQ-HNREYAILEIKDSKLRFRYNLNN 143 (339)
Q Consensus 98 ~~~i~l~frT~~~~Gl--L~~~~~~-~~~~~i~L~L~~G~l~~~~~lG~ 143 (339)
.-.|.++||+...+++ ||.+++. ...+|+.|.+.++.+-+.++-..
T Consensus 34 ~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~ 82 (190)
T PF02973_consen 34 EGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTK 82 (190)
T ss_dssp SEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETT
T ss_pred ccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCC
Confidence 4579999999666543 7777754 34689999999999888887654
No 70
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=49.60 E-value=15 Score=35.30 Aligned_cols=15 Identities=20% Similarity=0.716 Sum_probs=7.1
Q ss_pred HHHHHHHHHHHHHhh
Q psy592 208 QVLVLVFVVYSRRRE 222 (339)
Q Consensus 208 l~~~l~~~~~~~~~~ 222 (339)
|+||++++++|-||+
T Consensus 324 LIMvIIYLILRYRRK 338 (353)
T TIGR01477 324 LIMVIIYLILRYRRK 338 (353)
T ss_pred HHHHHHHHHHHhhhc
Confidence 444555554444443
No 71
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=49.36 E-value=63 Score=25.33 Aligned_cols=10 Identities=10% Similarity=0.384 Sum_probs=6.3
Q ss_pred hHHHHHHHHH
Q psy592 196 SALFVTLLIS 205 (339)
Q Consensus 196 ~~il~~~~~l 205 (339)
++.|+++++.
T Consensus 17 W~~LVGVv~~ 26 (102)
T PF15176_consen 17 WPFLVGVVVT 26 (102)
T ss_pred cHhHHHHHHH
Confidence 5666666655
No 72
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=48.95 E-value=11 Score=33.67 Aligned_cols=10 Identities=10% Similarity=0.218 Sum_probs=5.5
Q ss_pred hHHHHHHHHH
Q psy592 196 SALFVTLLIS 205 (339)
Q Consensus 196 ~~il~~~~~l 205 (339)
+||++.++..
T Consensus 41 iaiVAG~~tV 50 (221)
T PF08374_consen 41 IAIVAGIMTV 50 (221)
T ss_pred eeeecchhhh
Confidence 4555555554
No 73
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=44.52 E-value=33 Score=25.81 Aligned_cols=9 Identities=33% Similarity=0.582 Sum_probs=2.7
Q ss_pred HHHHHHHHH
Q psy592 210 LVLVFVVYS 218 (339)
Q Consensus 210 ~~l~~~~~~ 218 (339)
+++..++|.
T Consensus 20 IvvW~iv~i 28 (81)
T PF00558_consen 20 IVVWTIVYI 28 (81)
T ss_dssp HHHHHHH--
T ss_pred HHHHHHHHH
Confidence 333334443
No 74
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=42.94 E-value=7 Score=31.24 Aligned_cols=12 Identities=42% Similarity=0.709 Sum_probs=0.0
Q ss_pred cccCCCCCccCC
Q psy592 240 NYDDEGGGEDDM 251 (339)
Q Consensus 240 ~Y~eEggGE~d~ 251 (339)
.+..|+++-+|.
T Consensus 72 rc~~e~~~~qds 83 (118)
T PF14991_consen 72 RCYSEGGGHQDS 83 (118)
T ss_dssp ------------
T ss_pred cccccccccccc
Confidence 355577777763
No 75
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=42.03 E-value=8.5 Score=31.37 Aligned_cols=14 Identities=36% Similarity=0.560 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHhhh
Q psy592 210 LVLVFVVYSRRREA 223 (339)
Q Consensus 210 ~~l~~~~~~~~~~~ 223 (339)
++.+|++|+|+|++
T Consensus 94 llsg~lv~rrcrrr 107 (129)
T PF12191_consen 94 LLSGFLVWRRCRRR 107 (129)
T ss_dssp --------------
T ss_pred HHHHHHHHhhhhcc
Confidence 33456777776654
No 76
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=41.86 E-value=13 Score=35.07 Aligned_cols=14 Identities=29% Similarity=0.681 Sum_probs=5.9
Q ss_pred HHHHHH-HHHHHHHh
Q psy592 208 QVLVLV-FVVYSRRR 221 (339)
Q Consensus 208 l~~~l~-~~~~~~~~ 221 (339)
+++||+ +++.|||+
T Consensus 285 vlivLiaYli~Rrr~ 299 (306)
T PF01299_consen 285 VLIVLIAYLIGRRRS 299 (306)
T ss_pred HHHHHHhheeEeccc
Confidence 444443 34444443
No 77
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=41.72 E-value=56 Score=25.97 Aligned_cols=21 Identities=19% Similarity=0.313 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhh
Q psy592 203 LISPNQVLVLVFVVYSRRREA 223 (339)
Q Consensus 203 ~~l~~l~~~l~~~~~~~~~~~ 223 (339)
|+|+|.+-++.+...+|+++.
T Consensus 11 Lll~l~asl~~wr~~~rq~k~ 31 (107)
T PF15330_consen 11 LLLSLAASLLAWRMKQRQKKA 31 (107)
T ss_pred HHHHHHHHHHHHHHHhhhccc
Confidence 333445555555555555553
No 78
>PF14851 FAM176: FAM176 family
Probab=41.33 E-value=36 Score=28.87 Aligned_cols=11 Identities=9% Similarity=0.190 Sum_probs=5.6
Q ss_pred cCCCCCCCCCc
Q psy592 301 DADPNAPPFDD 311 (339)
Q Consensus 301 d~d~~~~p~Ds 311 (339)
+..|+.++--+
T Consensus 138 n~~~d~~~~~~ 148 (153)
T PF14851_consen 138 NGQPDILGTRT 148 (153)
T ss_pred hcCCCCCCCCC
Confidence 45565554433
No 79
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=40.81 E-value=45 Score=21.28 Aligned_cols=6 Identities=33% Similarity=0.667 Sum_probs=2.4
Q ss_pred HHHHHH
Q psy592 197 ALFVTL 202 (339)
Q Consensus 197 ~il~~~ 202 (339)
+|.+.+
T Consensus 7 aIIv~V 12 (38)
T PF02439_consen 7 AIIVAV 12 (38)
T ss_pred hHHHHH
Confidence 444443
No 80
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=40.31 E-value=73 Score=24.33 Aligned_cols=14 Identities=29% Similarity=0.738 Sum_probs=6.3
Q ss_pred HHHHHHHHHH-HHhh
Q psy592 209 VLVLVFVVYS-RRRE 222 (339)
Q Consensus 209 ~~~l~~~~~~-~~~~ 222 (339)
++|++++++. +||+
T Consensus 46 l~VilwfvCC~kRkr 60 (94)
T PF05393_consen 46 LLVILWFVCCKKRKR 60 (94)
T ss_pred HHHHHHHHHHHHhhh
Confidence 3344555444 4443
No 81
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=39.98 E-value=9.5 Score=35.84 Aligned_cols=16 Identities=13% Similarity=0.111 Sum_probs=0.0
Q ss_pred EEEECCEEEEEEecCC
Q psy592 128 LEIKDSKLRFRYNLNN 143 (339)
Q Consensus 128 L~L~~G~l~~~~~lG~ 143 (339)
..+.+|...+.|...+
T Consensus 43 ~sIt~gstiVtwtNnT 58 (290)
T PF05454_consen 43 RSITSGSTIVTWTNNT 58 (290)
T ss_dssp ----------------
T ss_pred EEecCCCEEEEEEcCC
Confidence 3556666666665443
No 82
>PF15347 PAG: Phosphoprotein associated with glycosphingolipid-enriched
Probab=39.52 E-value=35 Score=33.07 Aligned_cols=15 Identities=20% Similarity=0.020 Sum_probs=8.2
Q ss_pred cHHHHHHHHHHHhcC
Q psy592 288 NVGIFIEEHKKRADA 302 (339)
Q Consensus 288 ~~~~~i~~~~~~ad~ 302 (339)
.|.|.|.+-+++...
T Consensus 176 ~vEDcLYETVKEiKe 190 (428)
T PF15347_consen 176 MVEDCLYETVKEIKE 190 (428)
T ss_pred hHHHHHHHHHHHHhh
Confidence 345666665555543
No 83
>PHA03049 IMV membrane protein; Provisional
Probab=39.35 E-value=46 Score=23.95 Aligned_cols=23 Identities=13% Similarity=0.478 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy592 197 ALFVTLLISPNQVLVLVFVVYSRRREA 223 (339)
Q Consensus 197 ~il~~~~~l~~l~~~l~~~~~~~~~~~ 223 (339)
.+++|++++ .++++.+|+|.+..
T Consensus 7 l~iICVaIi----~lIvYgiYnkk~~~ 29 (68)
T PHA03049 7 LVIICVVII----GLIVYGIYNKKTTT 29 (68)
T ss_pred HHHHHHHHH----HHHHHHHHhccccc
Confidence 344454444 34456778776653
No 84
>PF10731 Anophelin: Thrombin inhibitor from mosquito; InterPro: IPR018932 Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing.
Probab=38.63 E-value=57 Score=23.00 Aligned_cols=9 Identities=22% Similarity=0.807 Sum_probs=4.2
Q ss_pred cccCCCCCc
Q psy592 240 NYDDEGGGE 248 (339)
Q Consensus 240 ~Y~eEggGE 248 (339)
.|||+-+.+
T Consensus 32 ~YDEdd~dd 40 (65)
T PF10731_consen 32 SYDEDDDDD 40 (65)
T ss_pred CcCcccCcc
Confidence 455544333
No 85
>KOG3607|consensus
Probab=37.02 E-value=25 Score=37.34 Aligned_cols=28 Identities=36% Similarity=0.879 Sum_probs=21.0
Q ss_pred CCCCCCeeecCCCCceeeCCCCCCCCCCCCC
Q psy592 44 CYPEHGVCVGSMHTAKCQCLPGWSGTGCVTP 74 (339)
Q Consensus 44 pC~~~g~Cv~~~~~~~C~C~~g~~G~~Ce~~ 74 (339)
.|..+|.|-. .+.|+|.+||.++.|++.
T Consensus 631 ~C~g~GVCnn---~~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 631 TCNGHGVCNN---ELNCHCEPGWAPPFCFIF 658 (716)
T ss_pred ccCCCcccCC---CcceeeCCCCCCCccccc
Confidence 3555566654 457999999999999765
No 86
>PF04272 Phospholamban: Phospholamban; InterPro: IPR005984 Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17. The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=36.88 E-value=31 Score=22.77 Aligned_cols=9 Identities=22% Similarity=0.563 Sum_probs=4.6
Q ss_pred HHHHHHHHH
Q psy592 197 ALFVTLLIS 205 (339)
Q Consensus 197 ~il~~~~~l 205 (339)
.||+|++++
T Consensus 37 lilicllli 45 (52)
T PF04272_consen 37 LILICLLLI 45 (52)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHH
Confidence 455555444
No 87
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=36.27 E-value=31 Score=21.43 Aligned_cols=17 Identities=24% Similarity=0.632 Sum_probs=10.9
Q ss_pred eeecCCCCceeeCCCCCC
Q psy592 50 VCVGSMHTAKCQCLPGWS 67 (339)
Q Consensus 50 ~Cv~~~~~~~C~C~~g~~ 67 (339)
.|.+.. ...|.|+.||.
T Consensus 11 ~CDpn~-~~~C~CPeGyI 27 (34)
T PF09064_consen 11 DCDPNS-PGQCFCPEGYI 27 (34)
T ss_pred ccCCCC-CCceeCCCceE
Confidence 344432 34899999983
No 88
>PRK00523 hypothetical protein; Provisional
Probab=35.41 E-value=76 Score=23.33 Aligned_cols=23 Identities=17% Similarity=0.075 Sum_probs=10.0
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHH
Q psy592 195 GSALFVTLLISPNQVLVLVFVVYS 218 (339)
Q Consensus 195 g~~il~~~~~l~~l~~~l~~~~~~ 218 (339)
|++|++.++.| ++.+++.|++-+
T Consensus 5 ~l~I~l~i~~l-i~G~~~Gffiar 27 (72)
T PRK00523 5 GLALGLGIPLL-IVGGIIGYFVSK 27 (72)
T ss_pred HHHHHHHHHHH-HHHHHHHHHHHH
Confidence 34555444443 133444454433
No 89
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=35.37 E-value=6.9 Score=30.73 Aligned_cols=8 Identities=38% Similarity=0.829 Sum_probs=3.1
Q ss_pred HHHHHHHh
Q psy592 214 FVVYSRRR 221 (339)
Q Consensus 214 ~~~~~~~~ 221 (339)
|++.|.|+
T Consensus 84 FVILRer~ 91 (101)
T PF06024_consen 84 FVILRERQ 91 (101)
T ss_pred EEEEeccc
Confidence 34444333
No 90
>TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type. CcoS from Rhodobacter capsulatus has been shown essential for incorporation of redox-active prosthetic groups (heme, Cu) into cytochrome cbb(3) oxidase. FixS of Bradyrhizobium japonicum appears to have the same function. Members of this family are found so far in organisms with a cbb3-type cytochrome oxidase, including Neisseria meningitidis, Helicobacter pylori, Campylobacter jejuni, Caulobacter crescentus, Bradyrhizobium japonicum, and Rhodobacter capsulatus.
Probab=35.12 E-value=1.1e+02 Score=20.85 Aligned_cols=11 Identities=27% Similarity=0.313 Sum_probs=5.2
Q ss_pred HHHHHHHHHHh
Q psy592 211 VLVFVVYSRRR 221 (339)
Q Consensus 211 ~l~~~~~~~~~ 221 (339)
.+.+++|.-|+
T Consensus 18 ~l~~f~Wavk~ 28 (51)
T TIGR00847 18 GLVAFLWSLKS 28 (51)
T ss_pred HHHHHHHHHcc
Confidence 33445555444
No 91
>PHA02872 EFc gene family protein; Provisional
Probab=34.76 E-value=23 Score=28.10 Aligned_cols=30 Identities=27% Similarity=0.283 Sum_probs=26.0
Q ss_pred CCCcceeeeecCCCCCcccccccccCCCCC
Q psy592 308 PFDDLRNYAYEGGGSTAGSLSSLASGSSIK 337 (339)
Q Consensus 308 p~Dsl~~y~~EG~gS~agSlssl~s~~~~~ 337 (339)
.-|++.+|-+||.|-..=|.|-.+|.+.++
T Consensus 58 ~gdtvkIYflEGkG~LIfSv~dv~sp~~ee 87 (124)
T PHA02872 58 AGDTVKIYFLEGKGGLIFSVSDVGSPDNEE 87 (124)
T ss_pred CCCeEEEEEEecCCcEEEEEEecCCCCccc
Confidence 347899999999999999999999987754
No 92
>PF02699 YajC: Preprotein translocase subunit; InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA []. Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought []. More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=34.69 E-value=33 Score=25.74 Aligned_cols=10 Identities=10% Similarity=0.471 Sum_probs=4.1
Q ss_pred HHHHHHHHHH
Q psy592 211 VLVFVVYSRR 220 (339)
Q Consensus 211 ~l~~~~~~~~ 220 (339)
++.|+.++.+
T Consensus 14 i~yf~~~rpq 23 (82)
T PF02699_consen 14 IFYFLMIRPQ 23 (82)
T ss_dssp HHHHHTHHHH
T ss_pred HHhhheecHH
Confidence 3344444433
No 93
>PHA03164 hypothetical protein; Provisional
Probab=34.62 E-value=38 Score=25.05 Aligned_cols=23 Identities=17% Similarity=0.313 Sum_probs=10.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Q psy592 196 SALFVTLLISPNQVLVLVFVVYS 218 (339)
Q Consensus 196 ~~il~~~~~l~~l~~~l~~~~~~ 218 (339)
..||.|+++...|.+++++.++.
T Consensus 60 FlvLtgLaIamILfiifvlyvFn 82 (88)
T PHA03164 60 FLVLTGLAIAMILFIIFVLYVFN 82 (88)
T ss_pred hHHHHHHHHHHHHHHHHHHHhee
Confidence 56666665542233344444443
No 94
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=34.22 E-value=39 Score=30.02 Aligned_cols=20 Identities=15% Similarity=0.235 Sum_probs=10.7
Q ss_pred ccccCCCCCccCCCcccccccC
Q psy592 239 INYDDEGGGEDDMTAFDITPLQ 260 (339)
Q Consensus 239 ~~Y~eEggGE~d~~~yd~~~L~ 260 (339)
.+|.|+ |-+|...++....+
T Consensus 141 ~~~vEn--g~h~n~~l~v~~~~ 160 (202)
T PF06365_consen 141 LYTVEN--GYHDNPTLSVAESQ 160 (202)
T ss_pred ceeccc--CccCCcccccCCCC
Confidence 445443 34555678877333
No 95
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=33.77 E-value=42 Score=22.09 Aligned_cols=15 Identities=13% Similarity=0.235 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHhh
Q psy592 208 QVLVLVFVVYSRRRE 222 (339)
Q Consensus 208 l~~~l~~~~~~~~~~ 222 (339)
|+-+..+.+|.....
T Consensus 15 lv~~Tgy~iYtaFGp 29 (43)
T PF02468_consen 15 LVSITGYAIYTAFGP 29 (43)
T ss_pred HHHHHhhhhhheeCC
Confidence 444555667766553
No 96
>PF03597 CcoS: Cytochrome oxidase maturation protein cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Probab=33.60 E-value=1.3e+02 Score=19.91 Aligned_cols=12 Identities=8% Similarity=0.343 Sum_probs=5.8
Q ss_pred HHHHHHHHHHHh
Q psy592 210 LVLVFVVYSRRR 221 (339)
Q Consensus 210 ~~l~~~~~~~~~ 221 (339)
+.+.+++|.-|+
T Consensus 16 ~~l~~f~Wavk~ 27 (45)
T PF03597_consen 16 IALAAFLWAVKS 27 (45)
T ss_pred HHHHHHHHHHcc
Confidence 344445555554
No 97
>PF14316 DUF4381: Domain of unknown function (DUF4381)
Probab=33.49 E-value=56 Score=27.17 Aligned_cols=9 Identities=11% Similarity=0.139 Sum_probs=3.9
Q ss_pred HHHHHHHHH
Q psy592 289 VGIFIEEHK 297 (339)
Q Consensus 289 ~~~~i~~~~ 297 (339)
.-+||.+..
T Consensus 98 W~~fL~~~~ 106 (146)
T PF14316_consen 98 WLQFLDQQC 106 (146)
T ss_pred HHHHHHHhC
Confidence 444444443
No 98
>TIGR01294 P_lamban phospholamban. This model represents the short (52 residue) transmembrane phosphoprotein phospholamban. Phospholamban, in its unphosphorylated form, inhibits SERCA2, the cardiac sarcoplasmic reticulum Ca-ATPase.
Probab=33.00 E-value=39 Score=22.31 Aligned_cols=9 Identities=22% Similarity=0.563 Sum_probs=4.7
Q ss_pred HHHHHHHHH
Q psy592 197 ALFVTLLIS 205 (339)
Q Consensus 197 ~il~~~~~l 205 (339)
.||+|++++
T Consensus 37 lilicllli 45 (52)
T TIGR01294 37 LILICLLLI 45 (52)
T ss_pred HHHHHHHHH
Confidence 455565444
No 99
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=32.73 E-value=14 Score=26.30 Aligned_cols=12 Identities=58% Similarity=1.655 Sum_probs=9.2
Q ss_pred eeCCCCCCCCCC
Q psy592 60 CQCLPGWSGTGC 71 (339)
Q Consensus 60 C~C~~g~~G~~C 71 (339)
=.|.+||+|+.|
T Consensus 52 ~~C~~Gw~G~~C 63 (63)
T PF01414_consen 52 KVCLPGWTGPNC 63 (63)
T ss_dssp EEE-TTEESTTS
T ss_pred CCCCCCCcCCCC
Confidence 368899999988
No 100
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=32.30 E-value=25 Score=26.62 Aligned_cols=8 Identities=13% Similarity=0.625 Sum_probs=3.4
Q ss_pred HHHHHHHH
Q psy592 211 VLVFVVYS 218 (339)
Q Consensus 211 ~l~~~~~~ 218 (339)
++.|+.++
T Consensus 15 i~yf~~~r 22 (84)
T TIGR00739 15 IFYFLIIR 22 (84)
T ss_pred HHHHheec
Confidence 33444443
No 101
>PF10717 ODV-E18: Occlusion-derived virus envelope protein ODV-E18; InterPro: IPR019655 Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=31.39 E-value=52 Score=24.83 Aligned_cols=10 Identities=20% Similarity=0.504 Sum_probs=4.7
Q ss_pred hHHHHHHHHH
Q psy592 196 SALFVTLLIS 205 (339)
Q Consensus 196 ~~il~~~~~l 205 (339)
..||+.++|+
T Consensus 28 MtILivLVII 37 (85)
T PF10717_consen 28 MTILIVLVII 37 (85)
T ss_pred HHHHHHHHHH
Confidence 3455544444
No 102
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=31.37 E-value=49 Score=35.30 Aligned_cols=10 Identities=10% Similarity=-0.024 Sum_probs=6.4
Q ss_pred hHHHHHHHHH
Q psy592 196 SALFVTLLIS 205 (339)
Q Consensus 196 ~~il~~~~~l 205 (339)
++||..+++|
T Consensus 276 l~ILG~~~li 285 (807)
T PF10577_consen 276 LAILGGTALI 285 (807)
T ss_pred HHHHHHHHHH
Confidence 5677766555
No 103
>TIGR02595 PEP_exosort PEP-CTERM putative exosortase interaction domain. This model describes a 25-residue domain that includes a near-invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In nearly every case, this motif is found within nine residues, and usually within five residues, of the extreme C-terminus of the protein. Proteins with this motif typically have signal sequences at the N-terminus. This region appears many times per genome or not at all, and co-occurs in genomes with a proposed protein-sorting integral membrane protein we designate exosortase (see TIGR02602). PEP-CTERM proteins frequently are poorly conserved, Ser/Thr-rich proteins and may become extensively modified proteinaceous constituents of extracellular material in bacterial biofilms.
Probab=30.43 E-value=56 Score=18.85 Aligned_cols=9 Identities=44% Similarity=0.785 Sum_probs=4.1
Q ss_pred HHHHHHHHh
Q psy592 213 VFVVYSRRR 221 (339)
Q Consensus 213 ~~~~~~~~~ 221 (339)
.++..+|||
T Consensus 15 ~~~~~rrrk 23 (26)
T TIGR02595 15 GFLLLRRRR 23 (26)
T ss_pred HHHHHhhcc
Confidence 444444444
No 104
>PF11884 DUF3404: Domain of unknown function (DUF3404); InterPro: IPR021821 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 260 amino acids in length. This domain is found associated with PF02518 from PFAM, PF00512 from PFAM.
Probab=28.99 E-value=83 Score=29.13 Aligned_cols=16 Identities=25% Similarity=0.434 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHhhh
Q psy592 208 QVLVLVFVVYSRRREA 223 (339)
Q Consensus 208 l~~~l~~~~~~~~~~~ 223 (339)
+++++.-.+|+++++.
T Consensus 243 i~l~~gw~~y~~~~kr 258 (262)
T PF11884_consen 243 ILLVLGWSLYRWNQKR 258 (262)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4445555666665543
No 105
>PRK05585 yajC preprotein translocase subunit YajC; Validated
Probab=28.77 E-value=38 Score=26.86 Aligned_cols=7 Identities=14% Similarity=0.567 Sum_probs=2.9
Q ss_pred HHHHHHH
Q psy592 212 LVFVVYS 218 (339)
Q Consensus 212 l~~~~~~ 218 (339)
+.|+.+|
T Consensus 31 ~yf~~~R 37 (106)
T PRK05585 31 FYFLIIR 37 (106)
T ss_pred HHHHhcc
Confidence 3344444
No 106
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=28.13 E-value=1.2e+02 Score=20.67 Aligned_cols=7 Identities=29% Similarity=0.653 Sum_probs=2.7
Q ss_pred HHHHHHH
Q psy592 199 FVTLLIS 205 (339)
Q Consensus 199 l~~~~~l 205 (339)
++.++++
T Consensus 6 iV~i~iv 12 (50)
T PF12606_consen 6 IVSIFIV 12 (50)
T ss_pred HHHHHHH
Confidence 3344333
No 107
>PHA03286 envelope glycoprotein E; Provisional
Probab=27.85 E-value=91 Score=31.12 Aligned_cols=9 Identities=44% Similarity=0.678 Sum_probs=4.2
Q ss_pred HHHHHHHhh
Q psy592 214 FVVYSRRRE 222 (339)
Q Consensus 214 ~~~~~~~~~ 222 (339)
..+++|||+
T Consensus 412 ~~~~~r~~~ 420 (492)
T PHA03286 412 AGLYRRRRR 420 (492)
T ss_pred HhHhhhhhh
Confidence 344555444
No 108
>PRK03427 cell division protein ZipA; Provisional
Probab=27.57 E-value=64 Score=30.92 Aligned_cols=8 Identities=75% Similarity=0.983 Sum_probs=5.7
Q ss_pred cCCCCCcc
Q psy592 242 DDEGGGED 249 (339)
Q Consensus 242 ~eEggGE~ 249 (339)
|+||-||-
T Consensus 59 d~dGvGev 66 (333)
T PRK03427 59 DDEGVGEV 66 (333)
T ss_pred cccCccce
Confidence 57777775
No 109
>PF05961 Chordopox_A13L: Chordopoxvirus A13L protein; InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=26.09 E-value=87 Score=22.64 Aligned_cols=14 Identities=14% Similarity=0.624 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHhhh
Q psy592 210 LVLVFVVYSRRREA 223 (339)
Q Consensus 210 ~~l~~~~~~~~~~~ 223 (339)
.++++.+|+|.+..
T Consensus 16 ~lIlY~iYnr~~~~ 29 (68)
T PF05961_consen 16 GLILYGIYNRKKTT 29 (68)
T ss_pred HHHHHHHHhccccc
Confidence 34557778776643
No 110
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=26.04 E-value=1.1e+02 Score=22.67 Aligned_cols=8 Identities=13% Similarity=0.389 Sum_probs=3.4
Q ss_pred HHHHHHHh
Q psy592 214 FVVYSRRR 221 (339)
Q Consensus 214 ~~~~~~~~ 221 (339)
++.|+.++
T Consensus 23 ~lHY~sk~ 30 (75)
T PF06667_consen 23 ILHYRSKW 30 (75)
T ss_pred HHHHHHhc
Confidence 34444443
No 111
>KOG3054|consensus
Probab=26.02 E-value=75 Score=29.07 Aligned_cols=15 Identities=33% Similarity=0.685 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHHHhh
Q psy592 208 QVLVLVFVVYSRRRE 222 (339)
Q Consensus 208 l~~~l~~~~~~~~~~ 222 (339)
|+.++++++|.+++.
T Consensus 14 lV~~i~l~l~~r~ra 28 (299)
T KOG3054|consen 14 LVAVILLFLWKRRRA 28 (299)
T ss_pred HHHHHHHHHHHhhcc
Confidence 444445555555543
No 112
>KOG0994|consensus
Probab=25.93 E-value=73 Score=35.56 Aligned_cols=26 Identities=27% Similarity=0.669 Sum_probs=20.3
Q ss_pred eeecCC--CCceeeCCCCCCCCCCCCCC
Q psy592 50 VCVGSM--HTAKCQCLPGWSGTGCVTPT 75 (339)
Q Consensus 50 ~Cv~~~--~~~~C~C~~g~~G~~Ce~~~ 75 (339)
.|..+. ....|+|.+||+|.+|+.-.
T Consensus 924 sC~~d~~t~~ivC~C~~GY~G~RCe~CA 951 (1758)
T KOG0994|consen 924 SCYLDTRTQQIVCHCQEGYSGSRCEICA 951 (1758)
T ss_pred cccccccccceeeecccCccccchhhhc
Confidence 366553 35689999999999999874
No 113
>COG4367 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.66 E-value=57 Score=24.95 Aligned_cols=24 Identities=21% Similarity=0.379 Sum_probs=20.8
Q ss_pred CcHHHHHHHHHHHhcCCCCCCCCCcc
Q psy592 287 PNVGIFIEEHKKRADADPNAPPFDDL 312 (339)
Q Consensus 287 ~~~~~~i~~~~~~ad~d~~~~p~Dsl 312 (339)
-++++||.+++++..+-| .||-.|
T Consensus 54 W~lRd~l~~~i~e~GqtP--~PfT~L 77 (97)
T COG4367 54 WRLRDFLVQAIRESGQTP--VPFTAL 77 (97)
T ss_pred HHHHHHHHHHHHhcCCCC--CCeeee
Confidence 488999999999999999 899554
No 114
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.37 E-value=1.7e+02 Score=21.39 Aligned_cols=21 Identities=19% Similarity=0.162 Sum_probs=8.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy592 196 SALFVTLLISPNQVLVLVFVVY 217 (339)
Q Consensus 196 ~~il~~~~~l~~l~~~l~~~~~ 217 (339)
+|||+.++.| |..+++.|++-
T Consensus 5 lail~ivl~l-l~G~~~G~fia 25 (71)
T COG3763 5 LAILLIVLAL-LAGLIGGFFIA 25 (71)
T ss_pred HHHHHHHHHH-HHHHHHHHHHH
Confidence 4554444333 12344444443
No 115
>PF04689 S1FA: DNA binding protein S1FA; InterPro: IPR006779 S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=25.15 E-value=63 Score=23.11 Aligned_cols=31 Identities=16% Similarity=0.190 Sum_probs=14.9
Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy592 192 VKTGSALFVTLLISPNQVLVLVFVVYSRRRE 222 (339)
Q Consensus 192 ~~~g~~il~~~~~l~~l~~~l~~~~~~~~~~ 222 (339)
+..|+++|+-+.-++|++++-.+++|.-+++
T Consensus 10 lnPGlIVLlvV~g~ll~flvGnyvlY~Yaqk 40 (69)
T PF04689_consen 10 LNPGLIVLLVVAGLLLVFLVGNYVLYVYAQK 40 (69)
T ss_pred CCCCeEEeehHHHHHHHHHHHHHHHHHHHhh
Confidence 3445444444433333555555666555443
No 116
>PF10078 DUF2316: Uncharacterized protein conserved in bacteria (DUF2316); InterPro: IPR018757 Members of this family of hypothetical bacterial proteins have no known function.
Probab=24.97 E-value=70 Score=24.56 Aligned_cols=26 Identities=19% Similarity=0.319 Sum_probs=22.1
Q ss_pred CCCcHHHHHHHHHHHhcCCCCCCCCCcc
Q psy592 285 VEPNVGIFIEEHKKRADADPNAPPFDDL 312 (339)
Q Consensus 285 ~~~~~~~~i~~~~~~ad~d~~~~p~Dsl 312 (339)
..-++++||.+++++....| .||-.|
T Consensus 53 ~vW~lRdyL~~~i~~~G~eP--~PfS~L 78 (89)
T PF10078_consen 53 DVWILRDYLNDKIKEQGKEP--VPFSAL 78 (89)
T ss_pred cchHHHHHHHHHHHHcCCCC--CCeeee
Confidence 34589999999999999999 898655
No 117
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=24.87 E-value=42 Score=25.27 Aligned_cols=18 Identities=17% Similarity=0.409 Sum_probs=4.3
Q ss_pred HHHHHHHHHHHHHHHhhh
Q psy592 206 PNQVLVLVFVVYSRRREA 223 (339)
Q Consensus 206 ~~l~~~l~~~~~~~~~~~ 223 (339)
+..+-.+++.-|++.+++
T Consensus 19 aIvvW~iv~ieYrk~~rq 36 (81)
T PF00558_consen 19 AIVVWTIVYIEYRKIKRQ 36 (81)
T ss_dssp HHHHHHHH----------
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 335555666666665543
No 118
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=24.80 E-value=61 Score=21.61 Aligned_cols=15 Identities=13% Similarity=0.157 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHhh
Q psy592 208 QVLVLVFVVYSRRRE 222 (339)
Q Consensus 208 l~~~l~~~~~~~~~~ 222 (339)
|+-+-.+.+|.....
T Consensus 18 L~~~TgyaiYtaFGp 32 (46)
T PRK13183 18 LLALTGFGIYTAFGP 32 (46)
T ss_pred HHHHhhheeeeccCC
Confidence 444455556655543
No 119
>KOG1094|consensus
Probab=24.74 E-value=99 Score=32.27 Aligned_cols=8 Identities=0% Similarity=0.638 Sum_probs=3.3
Q ss_pred HHHHHHHH
Q psy592 211 VLVFVVYS 218 (339)
Q Consensus 211 ~l~~~~~~ 218 (339)
++++++||
T Consensus 407 ii~~~L~R 414 (807)
T KOG1094|consen 407 IIALMLWR 414 (807)
T ss_pred HHHHHHHH
Confidence 33334444
No 120
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=24.14 E-value=89 Score=22.13 Aligned_cols=15 Identities=7% Similarity=0.450 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHhh
Q psy592 208 QVLVLVFVVYSRRRE 222 (339)
Q Consensus 208 l~~~l~~~~~~~~~~ 222 (339)
+++..++.+++++++
T Consensus 21 ~fiavi~~ayr~~~K 35 (60)
T COG4736 21 FFIAVIYFAYRPGKK 35 (60)
T ss_pred HHHHHHHHHhcccch
Confidence 334444555655554
No 121
>PRK14584 hmsS hemin storage system protein; Provisional
Probab=23.40 E-value=2e+02 Score=24.46 Aligned_cols=25 Identities=20% Similarity=0.449 Sum_probs=12.5
Q ss_pred hHHHHHHHHHHHHH-HHHHHHHHHHHhh
Q psy592 196 SALFVTLLISPNQV-LVLVFVVYSRRRE 222 (339)
Q Consensus 196 ~~il~~~~~l~~l~-~~l~~~~~~~~~~ 222 (339)
+++.++|+++ .. ++++...|.++|-
T Consensus 63 l~~yl~ial~--nAvlLI~WA~YN~~RF 88 (153)
T PRK14584 63 IALYLAIAAF--NAVLLIIWAKYNQVRF 88 (153)
T ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHh
Confidence 5555555554 33 3333455666554
No 122
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=23.36 E-value=96 Score=29.36 Aligned_cols=25 Identities=16% Similarity=0.397 Sum_probs=16.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy592 196 SALFVTLLISPNQVLVLVFVVYSRRRE 222 (339)
Q Consensus 196 ~~il~~~~~l~~l~~~l~~~~~~~~~~ 222 (339)
+||++.+|++ +++-|++--+|+++-
T Consensus 263 iaIliIVLIM--vIIYLILRYRRKKKm 287 (299)
T PF02009_consen 263 IAILIIVLIM--VIIYLILRYRRKKKM 287 (299)
T ss_pred HHHHHHHHHH--HHHHHHHHHHHHhhh
Confidence 5667666666 666666767776554
No 123
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=23.11 E-value=1.8e+02 Score=22.02 Aligned_cols=14 Identities=29% Similarity=0.458 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHh
Q psy592 208 QVLVLVFVVYSRRR 221 (339)
Q Consensus 208 l~~~l~~~~~~~~~ 221 (339)
++.+|++.+|+|-+
T Consensus 15 IVclliya~YRR~~ 28 (92)
T PHA02681 15 IVCYIVIMMYRRSC 28 (92)
T ss_pred HHHHHHHHHHHhcc
Confidence 45556677777654
No 124
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=22.86 E-value=1.3e+02 Score=22.47 Aligned_cols=13 Identities=38% Similarity=0.575 Sum_probs=5.4
Q ss_pred HHHHHHHHHHHHh
Q psy592 209 VLVLVFVVYSRRR 221 (339)
Q Consensus 209 ~~~l~~~~~~~~~ 221 (339)
++++..+.+.|.+
T Consensus 49 LIv~~vy~car~r 61 (79)
T PF07213_consen 49 LIVLVVYYCARPR 61 (79)
T ss_pred HHHHHHHhhcccc
Confidence 3344444444433
No 125
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=22.83 E-value=95 Score=23.60 Aligned_cols=12 Identities=25% Similarity=0.800 Sum_probs=7.1
Q ss_pred HHHHHHHHHHHH
Q psy592 208 QVLVLVFVVYSR 219 (339)
Q Consensus 208 l~~~l~~~~~~~ 219 (339)
+++++++++|+|
T Consensus 80 f~~~v~yI~~rR 91 (92)
T PF03908_consen 80 FLLVVLYILWRR 91 (92)
T ss_pred HHHHHHHHhhhc
Confidence 345666666654
No 126
>PF06716 DUF1201: Protein of unknown function (DUF1201); InterPro: IPR009591 This entry consists of several Beet yellows virus (BYV) putative membrane-binding proteins of around 54 residues in length. The function of this currently unknown.
Probab=22.81 E-value=1.6e+02 Score=19.59 Aligned_cols=23 Identities=17% Similarity=0.496 Sum_probs=11.9
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHH
Q psy592 194 TGSALFVTLLISPNQVLVLVFVVYS 218 (339)
Q Consensus 194 ~g~~il~~~~~l~~l~~~l~~~~~~ 218 (339)
.|.+.++|+.+. .+.+++.++++
T Consensus 11 ~~F~~lIC~Fl~--~~~~F~~F~~K 33 (54)
T PF06716_consen 11 LAFGFLICLFLF--CLVVFIWFVYK 33 (54)
T ss_pred HHHHHHHHHHHH--HHHHHHHHHHH
Confidence 346677776665 33333334443
No 127
>TIGR03521 GldG gliding-associated putative ABC transporter substrate-binding component GldG. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldG is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldG abolish the gliding phenotype. GldG, along with GldA and GldF are believed to compose an ABC transporter and are observed as an operon. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=22.72 E-value=92 Score=32.06 Aligned_cols=14 Identities=14% Similarity=0.169 Sum_probs=6.8
Q ss_pred HHHHHHHHHHHHHh
Q psy592 208 QVLVLVFVVYSRRR 221 (339)
Q Consensus 208 l~~~l~~~~~~~~~ 221 (339)
+++++.+++|.+||
T Consensus 535 ~~l~~G~~~~~~Rr 548 (552)
T TIGR03521 535 LLLLFGLSFTYIRK 548 (552)
T ss_pred HHHHHHHHHHHHHH
Confidence 34444555554444
No 128
>PRK01844 hypothetical protein; Provisional
Probab=22.71 E-value=1.8e+02 Score=21.38 Aligned_cols=23 Identities=13% Similarity=0.199 Sum_probs=10.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q psy592 196 SALFVTLLISPNQVLVLVFVVYSR 219 (339)
Q Consensus 196 ~~il~~~~~l~~l~~~l~~~~~~~ 219 (339)
++|++.|+.| ++.+++.|++-++
T Consensus 5 ~~I~l~I~~l-i~G~~~Gff~ark 27 (72)
T PRK01844 5 LGILVGVVAL-VAGVALGFFIARK 27 (72)
T ss_pred HHHHHHHHHH-HHHHHHHHHHHHH
Confidence 4455444433 1334555544433
No 129
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=22.70 E-value=94 Score=24.79 Aligned_cols=10 Identities=10% Similarity=0.325 Sum_probs=4.0
Q ss_pred HHHHHHHHHH
Q psy592 211 VLVFVVYSRR 220 (339)
Q Consensus 211 ~l~~~~~~~~ 220 (339)
++.|++.|++
T Consensus 16 i~yF~~iRPQ 25 (109)
T PRK05886 16 GFMYFASRRQ 25 (109)
T ss_pred HHHHHHccHH
Confidence 3334444433
No 130
>PHA02902 putative IMV membrane protein; Provisional
Probab=21.99 E-value=1.4e+02 Score=21.35 Aligned_cols=14 Identities=21% Similarity=0.581 Sum_probs=8.1
Q ss_pred HHHHHHHHHHHHHh
Q psy592 208 QVLVLVFVVYSRRR 221 (339)
Q Consensus 208 l~~~l~~~~~~~~~ 221 (339)
++.++++.+|+|-|
T Consensus 15 Ivclliya~YrR~k 28 (70)
T PHA02902 15 IFCLLIYAAYKRYK 28 (70)
T ss_pred HHHHHHHHHHHHhc
Confidence 34455667776654
No 131
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=21.81 E-value=1.1e+02 Score=28.69 Aligned_cols=24 Identities=29% Similarity=0.438 Sum_probs=11.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy592 196 SALFVTLLISPNQVLVLVFVVYSRRR 221 (339)
Q Consensus 196 ~~il~~~~~l~~l~~~l~~~~~~~~~ 221 (339)
+||.+.+++| |+++-+++.+.+||
T Consensus 234 lAiALG~v~l--l~l~Gii~~~~~r~ 257 (281)
T PF12768_consen 234 LAIALGTVFL--LVLIGIILAYIRRR 257 (281)
T ss_pred hHHHHHHHHH--HHHHHHHHHHHHhh
Confidence 4554444444 44444444444444
No 132
>PF15431 TMEM190: Transmembrane protein 190
Probab=21.71 E-value=2.2e+02 Score=22.79 Aligned_cols=45 Identities=9% Similarity=0.096 Sum_probs=23.3
Q ss_pred EEEccccCCCccccc-ccc-----c--hHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy592 177 IWLNAVAVNDGQWHF-VKT-----G--SALFVTLLISPNQVLVLVFVVYSRRRE 222 (339)
Q Consensus 177 v~v~~~~~~~~~~~~-~~~-----g--~~il~~~~~l~~l~~~l~~~~~~~~~~ 222 (339)
+|+-.|..+|+|+.. ..+ . ..-..|.-+|+ |+..+-++.|.+|+.
T Consensus 36 LrLrCCYrdGvCYhQRpDEnmrrKHmWaL~wtC~gll~-Li~~iclFWWAkRrd 88 (134)
T PF15431_consen 36 LRLRCCYRDGVCYHQRPDENMRRKHMWALGWTCGGLLL-LICSICLFWWAKRRD 88 (134)
T ss_pred eeeeeecccceeeccCcchhHHHHHHHHHHHHHHhHHH-HHHHHHHHHHHHHhc
Confidence 566668889988751 111 1 33456655551 222333455555553
No 133
>KOG3653|consensus
Probab=21.59 E-value=1.4e+02 Score=30.11 Aligned_cols=16 Identities=31% Similarity=0.764 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHhhhc
Q psy592 209 VLVLVFVVYSRRREAH 224 (339)
Q Consensus 209 ~~~l~~~~~~~~~~~~ 224 (339)
+++++|++|++||.++
T Consensus 170 lvi~~~~~~r~~k~~~ 185 (534)
T KOG3653|consen 170 LVILAFLGYRQRKNAR 185 (534)
T ss_pred HHHHHHHHHHHhhccc
Confidence 3455667787777653
No 134
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=21.47 E-value=28 Score=30.47 Aligned_cols=57 Identities=18% Similarity=0.376 Sum_probs=34.2
Q ss_pred CCEEeecCCCccC-------CCCCCCCCCCCcccCC-CCCCCCCCCCCeeecCC-----CCceeeCCCCCC
Q psy592 10 ITYLYDLANPGLH-------KNSVPGCPATEAVCNA-HSRLHCYPEHGVCVGSM-----HTAKCQCLPGWS 67 (339)
Q Consensus 10 n~~~~dl~~~~~~-------~~s~~gC~~~~~~C~~-~~~~~pC~~~g~Cv~~~-----~~~~C~C~~g~~ 67 (339)
||.++.+++..+- .-++..|.... .|.. .....+|++.++|+... ..|+|.|.+||+
T Consensus 10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv-~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~ 79 (197)
T PF06247_consen 10 NGYLIQMSNHFECKCNEGFVLKNENTCEEKV-ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYI 79 (197)
T ss_dssp TEEEEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEE
T ss_pred CCEEEEccCceEEEcCCCcEEccccccccce-ecCcccccCccccchhhhhcCCCcccceeEEEecccCce
Confidence 6666666665431 12455665433 5654 22356999999999875 479999999984
No 135
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=21.06 E-value=1.1e+02 Score=20.29 Aligned_cols=7 Identities=29% Similarity=0.472 Sum_probs=3.0
Q ss_pred HHHHHHH
Q psy592 214 FVVYSRR 220 (339)
Q Consensus 214 ~~~~~~~ 220 (339)
+.+++++
T Consensus 27 ~w~~~~~ 33 (49)
T PF05545_consen 27 IWAYRPR 33 (49)
T ss_pred HHHHccc
Confidence 3444444
No 136
>KOG1834|consensus
Probab=21.05 E-value=4.3e+02 Score=27.96 Aligned_cols=66 Identities=18% Similarity=0.194 Sum_probs=40.5
Q ss_pred cccEEEEEEeecC-------CceEEEEEecC--CCCceEEEEEECCEEEEEEecC-----CCcccceeEEecccccCccc
Q psy592 97 YTTQLQLKFRTRE-------EFGELFRLSDQ--HNREYAILEIKDSKLRFRYNLN-----NLKTDEKDIWLNAIKDSKLR 162 (339)
Q Consensus 97 ~~~~i~l~frT~~-------~~GlL~~~~~~--~~~~~i~L~L~~G~l~~~~~lG-----~~~~~~~~l~~~~v~dG~WH 162 (339)
.++.|+|++|-.. ..-.|+-.++. -+.++..|.+.+=++.|.++-. +++.-+....+..|.|.+||
T Consensus 366 dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~EWH 445 (952)
T KOG1834|consen 366 DHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDNEWH 445 (952)
T ss_pred CceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhhhhh
Confidence 3455666655321 12345544432 2567889999999999998752 22222344446789999999
No 137
>PF11770 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR021082 This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region [].
Probab=20.86 E-value=24 Score=29.69 Aligned_cols=10 Identities=10% Similarity=0.162 Sum_probs=4.2
Q ss_pred hHHHHHHHHH
Q psy592 196 SALFVTLLIS 205 (339)
Q Consensus 196 ~~il~~~~~l 205 (339)
++|++.|.+|
T Consensus 10 v~i~igi~Ll 19 (158)
T PF11770_consen 10 VAISIGISLL 19 (158)
T ss_pred HHHHHHHHHH
Confidence 3444444443
No 138
>KOG0994|consensus
Probab=20.86 E-value=66 Score=35.88 Aligned_cols=16 Identities=44% Similarity=1.165 Sum_probs=13.4
Q ss_pred eeeCCCCCCCCCCCCC
Q psy592 59 KCQCLPGWSGTGCVTP 74 (339)
Q Consensus 59 ~C~C~~g~~G~~Ce~~ 74 (339)
.|+|.|||.|..|++-
T Consensus 1085 QCqCkpGfGGR~C~qC 1100 (1758)
T KOG0994|consen 1085 QCQCKPGFGGRTCSQC 1100 (1758)
T ss_pred ceeccCCCCCcchhHH
Confidence 5889999999988765
No 139
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=20.47 E-value=15 Score=31.14 Aligned_cols=8 Identities=38% Similarity=0.833 Sum_probs=3.3
Q ss_pred HHHHHHHh
Q psy592 214 FVVYSRRR 221 (339)
Q Consensus 214 ~~~~~~~~ 221 (339)
|+++.|+|
T Consensus 71 f~~c~r~k 78 (154)
T PF04478_consen 71 FIFCIRRK 78 (154)
T ss_pred eeEEEecc
Confidence 44444444
No 140
>PF11669 WBP-1: WW domain-binding protein 1; InterPro: IPR021684 This family of proteins represents WBP-1, a ligand of the WW domain of Yes-associated protein. This protein has a proline-rich domain. WBP-1 does not bind to the SH3 domain [].
Probab=20.36 E-value=2.8e+02 Score=21.64 Aligned_cols=13 Identities=23% Similarity=0.371 Sum_probs=7.7
Q ss_pred hcccccccCCCCC
Q psy592 235 RENIINYDDEGGG 247 (339)
Q Consensus 235 reni~~Y~eEggG 247 (339)
+-|++.|.+....
T Consensus 57 e~~~~~~~~a~~~ 69 (102)
T PF11669_consen 57 EINLIAYQEAHNY 69 (102)
T ss_pred cccccccccCCCC
Confidence 3466777665544
No 141
>PRK00269 zipA cell division protein ZipA; Reviewed
Probab=20.06 E-value=2.1e+02 Score=26.94 Aligned_cols=12 Identities=25% Similarity=0.465 Sum_probs=7.2
Q ss_pred HHHHHHHHHhhh
Q psy592 212 LVFVVYSRRREA 223 (339)
Q Consensus 212 l~~~~~~~~~~~ 223 (339)
++|--|+|.|..
T Consensus 20 il~~~~~r~r~~ 31 (293)
T PRK00269 20 ILFDGWRRMRGG 31 (293)
T ss_pred HHHHHHHHHhcc
Confidence 455567776543
Done!