Query psy9228
Match_columns 834
No_of_seqs 438 out of 2678
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 22:10:10 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9228.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9228hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3514|consensus 100.0 1.2E-63 2.7E-68 542.8 45.8 689 56-822 117-1218(1591)
2 KOG4289|consensus 100.0 3.1E-56 6.7E-61 494.5 32.8 454 62-818 1216-1687(2531)
3 KOG3516|consensus 100.0 4.2E-50 9.1E-55 447.6 45.9 565 218-830 178-952 (1306)
4 KOG3516|consensus 100.0 2.6E-47 5.7E-52 425.3 48.5 606 83-814 484-1170(1306)
5 KOG3514|consensus 100.0 1.1E-41 2.4E-46 371.9 37.2 508 220-830 26-616 (1591)
6 KOG1219|consensus 100.0 8.6E-36 1.9E-40 342.7 25.8 301 160-651 3632-3945(4289)
7 KOG4289|consensus 100.0 1.1E-30 2.4E-35 292.1 16.6 262 532-815 1189-1482(2531)
8 PF00054 Laminin_G_1: Laminin 99.9 2.9E-24 6.3E-29 197.5 17.0 127 682-814 1-129 (131)
9 smart00282 LamG Laminin G doma 99.9 1.6E-22 3.4E-27 188.3 18.3 134 675-813 2-135 (135)
10 cd00110 LamG Laminin G domain; 99.9 1.6E-22 3.5E-27 192.6 18.3 150 654-811 2-151 (151)
11 PF00054 Laminin_G_1: Laminin 99.9 1.6E-20 3.4E-25 172.7 17.8 129 413-548 1-131 (131)
12 PF02210 Laminin_G_2: Laminin 99.8 3.3E-20 7.1E-25 171.2 16.5 127 682-813 1-128 (128)
13 KOG1219|consensus 99.8 4.6E-20 1E-24 214.7 18.2 210 590-820 3631-3842(4289)
14 smart00282 LamG Laminin G doma 99.8 6.6E-18 1.4E-22 157.1 17.6 131 406-545 2-135 (135)
15 cd00110 LamG Laminin G domain; 99.8 2.8E-17 6E-22 156.4 17.7 142 393-543 8-151 (151)
16 PF02210 Laminin_G_2: Laminin 99.7 1E-15 2.2E-20 141.1 15.9 124 413-545 1-128 (128)
17 KOG3509|consensus 99.3 9.9E-11 2.1E-15 134.9 21.9 192 435-638 278-473 (964)
18 smart00210 TSPN Thrombospondin 99.0 1.2E-08 2.5E-13 99.7 17.7 146 653-811 29-181 (184)
19 PF13385 Laminin_G_3: Concanav 98.7 4.3E-07 9.4E-12 86.3 15.3 143 653-814 2-150 (157)
20 KOG1214|consensus 98.6 7.5E-07 1.6E-11 98.4 15.2 68 136-206 693-765 (1289)
21 KOG1217|consensus 98.6 9.9E-07 2.2E-11 101.2 16.0 77 569-648 272-356 (487)
22 PF00008 EGF: EGF-like domain 98.5 4.6E-08 1E-12 64.4 2.0 32 571-604 1-32 (32)
23 smart00159 PTX Pentraxin / C-r 98.5 5.9E-06 1.3E-10 82.2 16.7 149 653-814 8-164 (206)
24 KOG1214|consensus 98.5 5.1E-07 1.1E-11 99.7 9.7 64 144-209 791-859 (1289)
25 PF00008 EGF: EGF-like domain 98.5 6.6E-08 1.4E-12 63.7 1.8 30 143-173 3-32 (32)
26 cd00152 PTX Pentraxins are pla 98.4 1.1E-05 2.3E-10 80.1 16.6 149 653-814 8-164 (201)
27 KOG3509|consensus 98.3 6.1E-06 1.3E-10 96.1 14.3 137 71-213 339-480 (964)
28 smart00210 TSPN Thrombospondin 98.3 1.8E-05 3.8E-10 77.3 15.5 124 404-543 51-181 (184)
29 PF00354 Pentaxin: Pentaxin fa 98.3 1.9E-05 4.2E-10 77.3 14.3 149 654-814 3-158 (195)
30 KOG1225|consensus 98.0 1.4E-05 3E-10 88.0 9.1 108 67-214 233-343 (525)
31 cd05725 Ig3_Robo Third immunog 98.0 7E-06 1.5E-10 65.9 5.1 44 2-45 13-59 (69)
32 KOG1225|consensus 98.0 1.2E-05 2.5E-10 88.6 8.1 107 57-210 258-365 (525)
33 cd05852 Ig5_Contactin-1 Fifth 98.0 1.1E-05 2.3E-10 65.5 4.8 44 2-45 16-63 (73)
34 cd05876 Ig3_L1-CAM Third immun 97.9 1.4E-05 3E-10 64.6 5.1 44 2-45 13-60 (71)
35 smart00179 EGF_CA Calcium-bind 97.9 1.3E-05 2.9E-10 55.9 4.0 36 177-212 1-39 (39)
36 KOG1217|consensus 97.9 4.9E-05 1.1E-09 87.2 10.8 171 32-214 148-355 (487)
37 KOG0994|consensus 97.9 0.00047 1E-08 79.5 17.7 67 580-647 873-950 (1758)
38 smart00179 EGF_CA Calcium-bind 97.9 1.5E-05 3.2E-10 55.7 3.8 36 609-645 2-39 (39)
39 cd05745 Ig3_Peroxidasin Third 97.9 1.7E-05 3.8E-10 64.6 4.6 44 2-45 17-64 (74)
40 cd05863 Ig2_VEGFR-3 Second imm 97.8 1.7E-05 3.7E-10 62.8 4.0 42 2-45 13-55 (67)
41 PF13385 Laminin_G_3: Concanav 97.8 0.00044 9.6E-09 65.4 14.3 139 391-548 8-152 (157)
42 cd05851 Ig3_Contactin-1 Third 97.8 3.2E-05 6.9E-10 65.4 5.2 44 2-45 31-77 (88)
43 cd05864 Ig2_VEGFR-2 Second imm 97.8 2.3E-05 4.9E-10 62.8 4.0 44 2-45 13-58 (70)
44 cd07702 Ig2_VEGFR-1 Second imm 97.8 2.4E-05 5.1E-10 62.9 4.0 42 2-43 13-58 (72)
45 cd05746 Ig4_Peroxidasin Fourth 97.8 3.8E-05 8.2E-10 61.6 5.1 44 2-45 13-60 (69)
46 cd04976 Ig2_VEGFR Second immun 97.7 3.5E-05 7.5E-10 62.2 4.4 44 2-45 13-59 (71)
47 cd05731 Ig3_L1-CAM_like Third 97.7 3.3E-05 7.2E-10 62.3 4.3 44 2-45 13-60 (71)
48 cd05723 Ig4_Neogenin Fourth im 97.7 4.5E-05 9.8E-10 61.5 5.0 44 2-45 14-61 (71)
49 cd05763 Ig_1 Subgroup of the i 97.7 6.3E-05 1.4E-09 61.5 5.8 44 2-45 13-64 (75)
50 PF02973 Sialidase: Sialidase, 97.7 0.00099 2.2E-08 63.7 14.1 133 674-815 33-177 (190)
51 cd05760 Ig2_PTK7 Second immuno 97.7 6.4E-05 1.4E-09 61.8 5.4 44 2-45 13-62 (77)
52 cd00054 EGF_CA Calcium-binding 97.7 4.5E-05 9.6E-10 52.8 3.8 35 136-175 3-38 (38)
53 cd04969 Ig5_Contactin_like Fif 97.7 5.3E-05 1.1E-09 61.5 4.7 44 2-45 16-63 (73)
54 cd05855 Ig_TrkB_d5 Fifth domai 97.7 6.3E-05 1.4E-09 61.6 4.8 44 2-45 13-68 (79)
55 cd05893 Ig_Palladin_C C-termin 97.6 6.2E-05 1.4E-09 61.3 4.5 44 2-45 13-65 (75)
56 cd05867 Ig4_L1-CAM_like Fourth 97.6 7.4E-05 1.6E-09 61.2 4.9 44 2-45 16-65 (76)
57 cd04971 Ig_TrKABC_d5 Fifth dom 97.6 7E-05 1.5E-09 62.1 4.7 44 2-45 13-70 (81)
58 cd05738 Ig2_RPTP_IIa_LAR_like 97.6 8.9E-05 1.9E-09 60.4 5.3 44 2-45 13-62 (74)
59 cd05748 Ig_Titin_like Immunogl 97.6 9.4E-05 2E-09 60.2 5.0 44 2-45 14-64 (74)
60 cd00054 EGF_CA Calcium-binding 97.6 7.5E-05 1.6E-09 51.7 3.8 36 609-645 2-38 (38)
61 cd04973 Ig1_FGFR First immunog 97.6 0.0001 2.2E-09 60.9 5.2 44 2-45 23-69 (79)
62 cd04968 Ig3_Contactin_like Thi 97.6 9.9E-05 2.2E-09 62.5 5.2 44 2-45 31-77 (88)
63 cd05743 Ig_Perlecan_D2_like Im 97.6 7.2E-05 1.6E-09 61.6 4.2 44 2-45 16-66 (78)
64 cd05892 Ig_Myotilin_C C-termin 97.6 0.00011 2.3E-09 59.9 5.0 44 2-45 13-65 (75)
65 PF07645 EGF_CA: Calcium-bindi 97.6 4.5E-05 9.8E-10 54.0 2.3 31 177-207 1-34 (42)
66 cd05873 Ig_Sema4D_like Immunog 97.6 6E-05 1.3E-09 63.0 3.3 43 2-44 25-71 (87)
67 cd00152 PTX Pentraxins are pla 97.5 0.0036 7.8E-08 62.1 16.4 148 391-551 16-169 (201)
68 cd05764 Ig_2 Subgroup of the i 97.5 0.00013 2.8E-09 59.4 4.9 44 2-45 16-64 (74)
69 cd05740 Ig_CEACAM_D4 Fourth im 97.5 0.00013 2.9E-09 62.0 5.0 44 2-45 32-79 (91)
70 smart00159 PTX Pentraxin / C-r 97.5 0.0039 8.5E-08 62.0 16.1 148 392-551 17-169 (206)
71 cd05868 Ig4_NrCAM Fourth immun 97.5 0.00013 2.8E-09 59.8 4.6 44 2-45 16-65 (76)
72 cd05754 Ig3_Perlecan_like Thir 97.5 0.00014 3.1E-09 61.0 5.0 44 2-45 32-76 (85)
73 cd05750 Ig_Pro_neuregulin Immu 97.5 0.0002 4.4E-09 58.4 5.7 44 2-45 14-67 (75)
74 cd05757 Ig2_IL1R_like Second i 97.5 0.00015 3.3E-09 61.6 4.8 44 2-45 30-76 (92)
75 cd05762 Ig8_MLCK Eighth immuno 97.5 0.00014 3E-09 62.9 4.5 44 2-45 30-80 (98)
76 cd05853 Ig6_Contactin-4 Sixth 97.5 0.00011 2.3E-09 61.2 3.6 43 2-44 17-70 (85)
77 cd05734 Ig7_DSCAM Seventh immu 97.4 0.00028 6.1E-09 58.3 5.7 44 2-45 13-68 (79)
78 cd05756 Ig1_IL1R_like First im 97.4 0.00024 5.2E-09 60.8 5.4 44 2-45 34-84 (94)
79 cd04975 Ig4_SCFR_like Fourth i 97.4 0.00014 3.1E-09 63.0 3.9 44 2-45 34-89 (101)
80 cd04978 Ig4_L1-NrCAM_like Four 97.4 0.00027 5.9E-09 57.8 5.0 44 2-45 16-65 (76)
81 cd05744 Ig_Myotilin_C_like Imm 97.4 0.00036 7.8E-09 56.9 5.5 44 2-45 13-65 (75)
82 cd05866 Ig1_NCAM-2 First immun 97.4 0.00024 5.2E-09 60.4 4.6 43 3-45 30-80 (92)
83 cd05728 Ig4_Contactin-2-like F 97.4 0.00022 4.8E-09 59.8 4.3 44 2-45 29-75 (85)
84 smart00181 EGF Epidermal growt 97.3 0.00026 5.6E-09 48.0 3.8 30 143-175 4-35 (35)
85 smart00560 LamGL LamG-like jel 97.3 0.0065 1.4E-07 55.8 14.3 66 740-814 61-129 (133)
86 cd05726 Ig4_Robo Fhird immunog 97.3 0.00028 6.1E-09 60.0 4.8 44 2-45 16-71 (90)
87 KOG3513|consensus 97.3 0.00049 1.1E-08 81.0 8.3 44 2-45 271-318 (1051)
88 cd05733 Ig6_L1-CAM_like Sixth 97.3 0.00046 1E-08 56.6 5.9 44 2-45 13-67 (77)
89 cd05739 Ig3_RPTP_IIa_LAR_like 97.3 0.00035 7.5E-09 55.9 4.9 42 2-45 16-60 (69)
90 cd05859 Ig4_PDGFR-alpha Fourth 97.3 0.00025 5.5E-09 61.5 4.0 44 2-45 33-89 (101)
91 cd00053 EGF Epidermal growth f 97.3 0.00036 7.8E-09 47.4 3.9 33 571-606 2-36 (36)
92 cd05724 Ig2_Robo Second immuno 97.3 0.00045 9.7E-09 58.1 5.2 44 2-45 27-75 (86)
93 cd05895 Ig_Pro_neuregulin-1 Im 97.3 0.00052 1.1E-08 56.1 5.4 44 2-45 14-68 (76)
94 smart00181 EGF Epidermal growt 97.2 0.00039 8.4E-09 47.1 3.7 33 570-606 1-35 (35)
95 cd05856 Ig2_FGFRL1-like Second 97.2 0.00042 9.2E-09 57.6 4.8 44 2-45 24-72 (82)
96 PF07645 EGF_CA: Calcium-bindi 97.2 0.00018 3.9E-09 50.9 2.0 33 608-640 1-34 (42)
97 cd05849 Ig1_Contactin-1 First 97.2 0.00051 1.1E-08 58.6 5.2 44 2-45 34-82 (93)
98 cd05865 Ig1_NCAM-1 First immun 97.2 0.0004 8.6E-09 59.6 4.4 43 2-44 31-83 (96)
99 cd04977 Ig1_NCAM-1_like First 97.2 0.00055 1.2E-08 58.4 5.1 44 2-45 29-81 (92)
100 cd05737 Ig_Myomesin_like_C C-t 97.2 0.00062 1.3E-08 58.1 5.4 44 2-45 31-82 (92)
101 cd04970 Ig6_Contactin_like Six 97.2 0.00038 8.3E-09 58.4 4.0 43 3-45 18-71 (85)
102 cd05894 Ig_C5_MyBP-C C5 immuno 97.2 0.0005 1.1E-08 57.8 4.7 44 2-45 25-76 (86)
103 cd05730 Ig3_NCAM-1_like Third 97.2 0.00056 1.2E-08 58.8 5.1 44 2-45 33-82 (95)
104 cd05854 Ig6_Contactin-2 Sixth 97.2 0.00044 9.5E-09 58.0 4.2 43 3-45 18-71 (85)
105 cd05848 Ig1_Contactin-5 First 97.1 0.00079 1.7E-08 57.7 5.6 44 2-45 34-83 (94)
106 cd05857 Ig2_FGFR Second immuno 97.1 0.00071 1.5E-08 56.7 5.2 44 2-45 24-75 (85)
107 KOG4260|consensus 97.1 0.00044 9.6E-09 67.4 4.2 81 133-213 139-276 (350)
108 cd05897 Ig2_IL1R2_like Second 97.1 0.00045 9.8E-09 58.7 3.9 44 1-44 29-78 (95)
109 cd05850 Ig1_Contactin-2 First 97.1 0.00077 1.7E-08 57.7 5.3 44 2-45 34-83 (94)
110 cd05736 Ig2_Follistatin_like S 97.1 0.001 2.2E-08 54.4 5.7 44 2-45 13-63 (76)
111 cd00053 EGF Epidermal growth f 97.1 0.00062 1.3E-08 46.2 3.7 29 617-645 7-36 (36)
112 PF00047 ig: Immunoglobulin do 97.1 0.00043 9.2E-09 54.4 3.1 39 2-40 17-64 (64)
113 cd05747 Ig5_Titin_like M5, fif 97.1 0.00087 1.9E-08 57.2 5.2 44 2-45 33-83 (92)
114 cd05875 Ig6_hNeurofascin_like 97.1 0.0011 2.3E-08 54.4 5.5 44 2-45 13-67 (77)
115 cd04974 Ig3_FGFR Third immunog 97.0 0.001 2.2E-08 56.5 5.1 26 20-45 54-79 (90)
116 cd05735 Ig8_DSCAM Eight immuno 97.0 0.00075 1.6E-08 57.0 4.1 44 2-45 16-71 (88)
117 smart00408 IGc2 Immunoglobulin 97.0 0.0014 2.9E-08 50.8 5.3 43 2-44 17-62 (63)
118 cd04967 Ig1_Contactin First Ig 97.0 0.0013 2.7E-08 56.0 5.1 44 2-45 34-83 (91)
119 cd04972 Ig_TrkABC_d4 Fourth do 96.9 0.0011 2.5E-08 56.2 4.7 44 2-45 30-80 (90)
120 KOG1226|consensus 96.9 0.0041 8.9E-08 70.2 10.0 71 137-217 548-625 (783)
121 cd05869 Ig5_NCAM-1 Fifth immun 96.9 0.0018 3.9E-08 55.9 5.7 44 2-45 32-87 (97)
122 cd05727 Ig2_Contactin-2-like S 96.9 0.0011 2.4E-08 56.3 4.2 44 2-45 34-84 (96)
123 cd04979 Ig_Semaphorin_C Immuno 96.9 0.0014 3E-08 55.6 4.5 42 2-43 25-73 (89)
124 cd05874 Ig6_NrCAM Sixth immuno 96.8 0.0024 5.3E-08 52.3 5.7 44 2-45 13-67 (77)
125 PF12661 hEGF: Human growth fa 96.8 0.00037 8.1E-09 35.2 0.4 12 163-174 2-13 (13)
126 cd05732 Ig5_NCAM-1_like Fifth 96.8 0.0019 4.1E-08 55.6 5.2 44 2-45 31-86 (96)
127 PF07679 I-set: Immunoglobulin 96.8 0.00073 1.6E-08 57.3 2.4 44 2-45 30-80 (90)
128 cd05749 Ig2_Tyro3_like Second 96.8 0.002 4.4E-08 53.2 4.9 41 3-45 30-71 (81)
129 cd05870 Ig5_NCAM-2 Fifth immun 96.8 0.0022 4.8E-08 55.4 5.0 44 2-45 31-88 (98)
130 cd05742 Ig1_VEGFR_like First i 96.7 0.0016 3.4E-08 54.5 3.8 44 2-45 18-73 (84)
131 cd05845 Ig2_L1-CAM_like Second 96.7 0.0018 3.9E-08 54.9 3.7 43 2-44 34-82 (95)
132 PF00354 Pentaxin: Pentaxin fa 96.7 0.067 1.4E-06 52.5 15.3 136 404-551 24-163 (195)
133 KOG4260|consensus 96.6 0.0023 4.9E-08 62.6 4.7 63 136-206 237-303 (350)
134 cd05858 Ig3_FGFR-2 Third immun 96.6 0.0019 4.1E-08 54.8 3.8 25 21-45 55-79 (90)
135 KOG4194|consensus 96.6 0.0025 5.4E-08 69.7 5.3 60 2-68 642-709 (873)
136 cd05891 Ig_M-protein_C C-termi 96.6 0.004 8.7E-08 53.0 5.6 44 2-45 31-82 (92)
137 PF07974 EGF_2: EGF-like domai 96.6 0.0025 5.5E-08 41.7 3.3 27 144-174 6-32 (32)
138 cd05861 Ig1_PDGFR-alphabeta Fr 96.6 0.0021 4.5E-08 53.7 3.6 43 2-44 16-69 (84)
139 cd05722 Ig1_Neogenin First imm 96.6 0.0039 8.6E-08 53.5 5.4 44 2-45 29-84 (95)
140 PHA02826 IL-1 receptor-like pr 96.6 0.0022 4.9E-08 64.5 4.4 41 2-42 164-209 (227)
141 cd05765 Ig_3 Subgroup of the i 96.6 0.0039 8.4E-08 51.6 5.1 26 20-45 46-71 (81)
142 PF12661 hEGF: Human growth fa 96.5 0.00098 2.1E-08 33.7 0.8 13 593-605 1-13 (13)
143 cd05871 Ig_Semaphorin_classIII 96.5 0.003 6.6E-08 53.6 4.1 41 2-42 25-74 (91)
144 cd05773 Ig8_hNephrin_like Eigh 96.5 0.0042 9.1E-08 54.8 5.1 44 2-45 38-94 (109)
145 cd05896 Ig1_IL1RAPL-1_like Fir 96.5 0.0041 8.8E-08 53.2 4.7 42 3-44 42-93 (104)
146 cd05882 Ig1_Necl-1 First (N-te 96.5 0.0032 7E-08 53.8 4.0 43 2-44 27-85 (95)
147 PHA03099 epidermal growth fact 96.4 0.0024 5.1E-08 55.0 2.7 39 137-176 44-82 (139)
148 cd05753 Ig2_FcgammaR_like Seco 96.4 0.0043 9.4E-08 51.6 4.3 40 3-44 31-71 (83)
149 cd05872 Ig_Sema4B_like Immunog 96.3 0.0041 8.8E-08 51.7 3.8 40 2-41 25-67 (85)
150 PF07974 EGF_2: EGF-like domai 96.3 0.0047 1E-07 40.4 3.3 27 574-605 6-32 (32)
151 cd05752 Ig1_FcgammaR_like Frst 96.3 0.0044 9.6E-08 50.8 3.9 36 3-43 32-68 (78)
152 PF12662 cEGF: Complement Clr- 96.3 0.0034 7.3E-08 37.8 2.3 19 160-179 1-23 (24)
153 KOG1834|consensus 96.2 0.057 1.2E-06 59.6 12.4 157 653-812 344-516 (952)
154 KOG3513|consensus 96.2 0.0051 1.1E-07 72.8 4.6 44 2-45 456-503 (1051)
155 cd05758 Ig5_KIRREL3-like Fifth 95.9 0.011 2.4E-07 51.0 4.7 43 3-45 33-89 (98)
156 PHA02887 EGF-like protein; Pro 95.9 0.0093 2E-07 50.6 3.7 40 136-176 84-123 (126)
157 cd05862 Ig1_VEGFR First immuno 95.7 0.014 3E-07 48.9 4.4 26 20-45 49-74 (86)
158 cd07701 Ig1_Necl-3 First (N-te 95.7 0.014 3E-07 50.0 4.1 24 20-43 61-84 (95)
159 cd05729 Ig2_FGFR_like Second i 95.6 0.016 3.4E-07 48.3 4.3 44 2-45 24-75 (85)
160 cd07690 Ig1_CD4 First immunogl 95.5 0.017 3.7E-07 49.2 4.1 23 22-44 66-88 (94)
161 PF02973 Sialidase: Sialidase, 95.4 0.88 1.9E-05 43.9 15.8 132 406-549 34-179 (190)
162 PF12947 EGF_3: EGF domain; I 95.4 0.01 2.3E-07 40.0 1.9 26 184-209 6-32 (36)
163 cd07693 Ig1_Robo First immunog 95.4 0.023 5E-07 49.0 4.7 44 2-45 31-88 (100)
164 cd05900 Ig_Aggrecan Immunoglob 95.3 0.028 6E-07 49.6 4.9 25 19-43 72-96 (112)
165 PHA02826 IL-1 receptor-like pr 95.3 0.015 3.2E-07 58.6 3.6 44 2-45 63-122 (227)
166 cd05885 Ig2_Necl-4 Second immu 95.2 0.031 6.7E-07 45.8 4.6 45 1-45 14-67 (80)
167 PF12947 EGF_3: EGF domain; I 95.2 0.0086 1.9E-07 40.4 1.1 28 143-172 5-32 (36)
168 smart00560 LamGL LamG-like jel 95.0 0.8 1.7E-05 41.9 13.9 69 468-547 60-130 (133)
169 cd05774 Ig_CEACAM_D1 First imm 94.9 0.042 9E-07 47.9 4.7 26 19-44 68-93 (105)
170 PHA02887 EGF-like protein; Pro 94.6 0.029 6.2E-07 47.7 2.9 36 179-214 84-124 (126)
171 PHA02785 IL-beta-binding prote 94.5 0.041 9E-07 59.1 4.5 44 2-45 61-106 (326)
172 cd05901 Ig_Versican Immunoglob 94.4 0.051 1.1E-06 48.0 4.2 25 20-44 78-102 (117)
173 cd05751 Ig1_LILRB1_like First 94.3 0.057 1.2E-06 45.8 4.3 41 3-43 31-75 (91)
174 PHA03099 epidermal growth fact 94.2 0.035 7.6E-07 48.0 2.6 37 178-214 42-83 (139)
175 PHA02785 IL-beta-binding prote 94.2 0.048 1E-06 58.6 4.3 43 2-45 156-201 (326)
176 smart00410 IG_like Immunoglobu 93.6 0.13 2.7E-06 42.2 5.0 44 2-45 24-75 (86)
177 smart00409 IG Immunoglobulin. 93.6 0.13 2.7E-06 42.2 5.0 44 2-45 24-75 (86)
178 PF13927 Ig_3: Immunoglobulin 93.6 0.019 4.2E-07 46.2 -0.1 24 19-42 52-75 (75)
179 KOG4222|consensus 93.6 0.087 1.9E-06 62.4 5.1 44 2-45 162-211 (1281)
180 smart00051 DSL delta serrate l 93.5 0.11 2.3E-06 40.3 3.9 48 591-644 16-63 (63)
181 PHA02633 hypothetical protein; 93.5 0.079 1.7E-06 40.0 3.0 25 19-43 16-40 (63)
182 cd05879 Ig_P0 Immunoglobulin ( 93.3 0.11 2.4E-06 46.2 4.4 27 19-45 78-104 (116)
183 PF06439 DUF1080: Domain of Un 93.1 0.11 2.5E-06 50.6 4.5 108 674-784 53-171 (185)
184 KOG1836|consensus 93.0 0.13 2.8E-06 65.4 5.7 128 674-814 1556-1685(1705)
185 PF07354 Sp38: Zona-pellucida- 92.7 0.12 2.6E-06 51.5 3.8 43 1-43 12-59 (271)
186 PF13895 Ig_2: Immunoglobulin 92.6 0.07 1.5E-06 43.6 1.9 39 2-44 29-68 (80)
187 cd05898 Ig5_KIRREL3 Fifth immu 92.6 0.21 4.5E-06 42.8 4.8 25 21-45 64-89 (98)
188 KOG1836|consensus 92.5 0.18 3.9E-06 64.1 6.0 116 704-832 1413-1530(1705)
189 cd05886 Ig1_Nectin-1_like Firs 92.5 0.11 2.5E-06 44.6 3.0 24 20-43 62-85 (99)
190 cd05888 Ig1_Nectin-4_like Frst 92.2 0.13 2.8E-06 44.5 3.1 23 20-42 63-85 (100)
191 cd05714 Ig_CSPGs_LP Immunoglob 92.0 0.11 2.5E-06 45.3 2.7 25 20-44 67-91 (106)
192 cd05741 Ig_CEACAM_D1_like Firs 92.0 0.14 3E-06 43.3 3.1 26 19-44 55-80 (92)
193 PF12662 cEGF: Complement Clr- 91.8 0.13 2.8E-06 31.1 1.7 17 197-213 1-21 (24)
194 cd05759 Ig2_KIRREL3-like Secon 91.8 0.28 6E-06 40.5 4.6 43 2-44 15-69 (82)
195 KOG0994|consensus 91.8 0.37 8.1E-06 56.9 6.9 53 160-214 1083-1148(1758)
196 cd05713 Ig_MOG_like Immunoglob 91.6 0.13 2.8E-06 44.4 2.5 25 21-45 65-89 (100)
197 PF06247 Plasmod_Pvs28: Plasmo 91.4 0.2 4.4E-06 47.0 3.5 63 143-207 49-119 (197)
198 cd00096 Ig Immunoglobulin doma 91.4 0.3 6.4E-06 37.8 4.3 43 2-44 13-66 (74)
199 smart00051 DSL delta serrate l 91.3 0.25 5.4E-06 38.3 3.4 45 161-211 17-63 (63)
200 KOG4221|consensus 91.2 0.23 5E-06 59.4 4.6 42 3-44 171-216 (1381)
201 cd05887 Ig1_Nectin-3_like Firs 91.1 0.18 4E-06 43.0 2.9 22 20-41 59-80 (96)
202 KOG4194|consensus 91.1 0.35 7.6E-06 53.6 5.6 62 1-68 534-615 (873)
203 KOG1226|consensus 91.1 0.29 6.3E-06 55.9 5.1 61 144-214 514-582 (783)
204 cd05880 Ig_EVA1 Immunoglobulin 91.1 0.2 4.4E-06 44.5 3.2 27 20-46 78-104 (115)
205 PF14670 FXa_inhibition: Coagu 90.9 0.18 3.8E-06 34.1 2.0 18 190-207 11-28 (36)
206 cd05717 Ig1_Necl-1-3_like Firs 90.8 0.21 4.6E-06 42.6 3.0 24 20-43 61-84 (95)
207 cd05881 Ig1_Necl-2 First (N-te 90.7 0.17 3.7E-06 43.1 2.3 22 22-43 63-84 (95)
208 cd05775 Ig_SLAM-CD84_like_N N- 90.3 0.22 4.7E-06 42.8 2.6 25 20-44 60-84 (97)
209 cd05889 Ig1_DNAM-1_like First 90.2 0.26 5.6E-06 42.1 3.0 23 20-42 59-81 (96)
210 cd05877 Ig_LP_like Immunoglobu 89.9 0.21 4.6E-06 43.6 2.3 26 20-45 67-92 (106)
211 cd05718 Ig1_PVR_like First imm 89.9 0.28 6E-06 42.1 3.0 22 20-41 61-82 (98)
212 cd05715 Ig_P0-like Immunoglobu 89.8 0.27 5.9E-06 43.8 2.9 25 21-45 80-104 (116)
213 cd05878 Ig_Aggrecan_like Immun 89.8 0.27 5.8E-06 43.4 2.8 25 20-44 71-95 (110)
214 KOG4222|consensus 89.6 0.42 9.2E-06 56.9 4.9 43 2-44 255-301 (1281)
215 cd00099 IgV Immunoglobulin var 89.2 0.24 5.2E-06 43.1 2.1 24 21-44 67-90 (105)
216 cd05902 Ig_Neurocan Immunoglob 88.7 0.23 5E-06 43.6 1.6 25 20-44 71-95 (110)
217 cd00055 EGF_Lam Laminin-type e 88.4 0.57 1.2E-05 34.4 3.3 16 160-175 18-33 (50)
218 cd05711 Ig_FcalphaRI Immunoglo 88.3 0.61 1.3E-05 39.7 3.9 39 3-41 30-75 (94)
219 cd05846 Ig1_MRC-OX-2_like Firs 87.8 0.37 7.9E-06 41.4 2.2 23 20-42 60-82 (97)
220 cd04983 IgV_TCR_alpha_like Imm 87.6 0.4 8.6E-06 42.1 2.5 24 20-43 69-92 (109)
221 smart00406 IGv Immunoglobulin 87.5 0.29 6.2E-06 40.0 1.4 19 21-39 62-80 (81)
222 PF00053 Laminin_EGF: Laminin 87.3 0.33 7.2E-06 35.5 1.4 17 159-175 16-32 (49)
223 cd05771 IgC_Tapasin_R Tapasin- 87.2 0.42 9.1E-06 44.2 2.4 24 22-45 2-25 (139)
224 cd04984 IgV_L_lambda Immunoglo 86.2 0.61 1.3E-05 40.0 2.8 22 21-42 63-84 (98)
225 KOG3546|consensus 85.8 2.1 4.4E-05 47.6 7.0 128 674-812 87-224 (1167)
226 KOG4221|consensus 85.6 1 2.2E-05 54.2 5.0 43 2-44 267-314 (1381)
227 cd05899 IgV_TCR_beta Immunoglo 85.6 0.61 1.3E-05 41.0 2.5 22 21-42 72-93 (110)
228 cd07692 Ig_CD3_epsilon Immunog 85.4 1.6 3.5E-05 33.8 4.3 37 2-43 19-56 (65)
229 PF01414 DSL: Delta serrate li 85.1 0.21 4.6E-06 38.6 -0.5 42 591-644 16-63 (63)
230 cd01475 vWA_Matrilin VWA_Matri 85.0 0.85 1.9E-05 46.1 3.6 37 170-208 180-218 (224)
231 smart00180 EGF_Lam Laminin-typ 84.8 0.91 2E-05 32.7 2.6 16 160-175 17-32 (46)
232 cd04980 IgV_L_kappa Immunoglob 84.7 0.61 1.3E-05 40.7 2.1 23 21-43 71-93 (106)
233 PF06247 Plasmod_Pvs28: Plasmo 84.4 0.48 1E-05 44.6 1.3 61 575-640 7-79 (197)
234 PF06439 DUF1080: Domain of Un 84.0 1.9 4.2E-05 41.9 5.5 91 405-497 53-155 (185)
235 PF14670 FXa_inhibition: Coagu 83.8 0.71 1.5E-05 31.2 1.6 18 623-640 11-28 (36)
236 PF01414 DSL: Delta serrate li 83.6 0.4 8.7E-06 37.1 0.4 47 159-211 15-63 (63)
237 PF07686 V-set: Immunoglobulin 83.5 0.7 1.5E-05 40.5 2.0 23 21-43 79-101 (114)
238 cd05712 Ig_Siglec_N Immunoglob 83.3 0.98 2.1E-05 40.4 2.8 22 21-42 80-101 (119)
239 cd05716 Ig_pIgR Immunoglobulin 82.4 0.79 1.7E-05 39.3 1.8 22 21-42 64-85 (98)
240 cd00098 IgC Immunoglobulin Con 81.7 1.7 3.8E-05 36.7 3.7 43 2-44 30-85 (95)
241 cd05720 Ig_CD8_alpha Immunoglo 81.7 1.1 2.3E-05 39.0 2.4 21 22-42 70-90 (104)
242 cd04982 IgV_TCR_gamma Immunogl 80.2 1.1 2.3E-05 39.8 1.9 21 21-41 77-97 (116)
243 PF14099 Polysacc_lyase: Polys 79.8 7.8 0.00017 39.0 8.3 62 705-766 113-183 (224)
244 PF12946 EGF_MSP1_1: MSP1 EGF 78.9 1.2 2.5E-05 30.0 1.2 29 571-601 2-30 (37)
245 cd07700 IgV_CD8_beta Immunoglo 78.8 1.3 2.8E-05 38.7 2.0 20 22-41 73-92 (107)
246 PF12946 EGF_MSP1_1: MSP1 EGF 78.4 1.2 2.6E-05 30.0 1.2 27 181-207 2-30 (37)
247 cd01475 vWA_Matrilin VWA_Matri 77.6 2.2 4.7E-05 43.1 3.5 38 601-641 180-218 (224)
248 KOG1834|consensus 76.4 28 0.00061 39.5 11.5 134 403-543 364-515 (952)
249 cd07706 IgV_TCR_delta Immunogl 76.3 1.6 3.4E-05 38.8 1.8 22 21-42 72-93 (116)
250 cd05860 Ig4_SCFR Fourth immuno 74.4 2.8 6E-05 36.0 2.7 43 3-45 35-89 (101)
251 cd04981 IgV_H Immunoglobulin ( 71.0 4.1 8.8E-05 36.3 3.1 21 22-42 77-97 (117)
252 cd07705 Ig2_Necl-1 Second immu 69.9 7.6 0.00017 32.0 4.3 43 2-44 15-69 (83)
253 cd01951 lectin_L-type legume l 69.6 73 0.0016 31.9 12.3 23 740-762 154-178 (223)
254 PTZ00334 trans-sialidase; Prov 67.4 25 0.00054 41.8 9.2 74 736-814 639-713 (780)
255 cd05883 Ig2_Necl-2 Second immu 67.1 7.2 0.00016 32.1 3.5 44 1-44 14-67 (82)
256 smart00180 EGF_Lam Laminin-typ 66.6 6.6 0.00014 28.2 2.8 18 197-214 17-34 (46)
257 PHA03376 BARF1; Provisional 66.4 4.9 0.00011 38.2 2.6 24 21-44 87-110 (221)
258 PF09264 Sial-lect-inser: Vibr 66.4 83 0.0018 30.1 10.5 85 673-765 31-119 (198)
259 PF00053 Laminin_EGF: Laminin 65.9 4 8.7E-05 29.8 1.6 16 591-606 17-32 (49)
260 cd07694 Ig2_CD4 Second immunog 64.3 12 0.00026 31.1 4.2 40 3-42 32-72 (88)
261 cd02175 GH16_lichenase lichena 63.7 1.2E+02 0.0026 30.1 12.4 29 739-767 137-165 (212)
262 cd05884 Ig2_Necl-3 Second immu 63.7 14 0.0003 30.5 4.7 42 2-43 15-68 (83)
263 cd06899 lectin_legume_LecRK_Ar 62.4 62 0.0013 32.9 10.1 25 737-761 160-186 (236)
264 cd07691 Ig_CD3_gamma_delta Imm 62.0 11 0.00023 29.6 3.4 36 2-42 17-54 (69)
265 PF00954 S_locus_glycop: S-loc 60.6 8.5 0.00018 33.8 3.1 32 610-642 78-109 (110)
266 PF00954 S_locus_glycop: S-loc 59.8 8.6 0.00019 33.7 2.9 31 178-209 77-109 (110)
267 KOG3546|consensus 59.4 24 0.00052 39.6 6.7 125 405-544 87-224 (1167)
268 PF07953 Toxin_R_bind_N: Clost 56.2 1.7E+02 0.0036 28.1 10.6 86 674-767 55-156 (195)
269 PF02057 Glyco_hydro_59: Glyco 55.9 74 0.0016 37.1 10.1 86 707-810 580-665 (669)
270 cd05761 Ig2_Necl-1-4_like Seco 54.6 13 0.00028 30.5 3.0 43 2-44 15-68 (82)
271 PF00139 Lectin_legB: Legume l 54.6 25 0.00054 35.7 5.7 28 734-761 161-190 (236)
272 cd00413 Glyco_hydrolase_16 gly 54.4 2.1E+02 0.0045 28.1 12.3 30 738-767 140-169 (210)
273 cd00055 EGF_Lam Laminin-type e 53.0 13 0.00028 27.2 2.4 16 591-606 18-33 (50)
274 PF12955 DUF3844: Domain of un 53.0 13 0.00028 31.9 2.7 29 143-171 12-43 (103)
275 cd06903 lectin_EMP46_EMP47 EMP 51.5 2E+02 0.0044 28.6 11.4 26 738-763 149-176 (215)
276 PHA02987 Ig domain OX-2-like p 51.0 11 0.00024 36.0 2.3 39 19-65 83-122 (189)
277 KOG1218|consensus 49.6 22 0.00047 37.9 4.7 56 591-646 161-222 (316)
278 smart00407 IGc1 Immunoglobulin 48.7 37 0.0008 27.2 4.8 41 2-42 16-69 (75)
279 PF09064 Tme5_EGF_like: Thromb 46.8 18 0.00038 23.9 2.0 11 591-601 17-27 (34)
280 cd05770 IgC_beta2m Class I maj 46.4 32 0.0007 29.1 4.2 40 2-42 32-81 (93)
281 PF12955 DUF3844: Domain of un 44.4 12 0.00026 32.1 1.2 36 179-214 6-62 (103)
282 cd07308 lectin_leg-like legume 43.2 1.9E+02 0.0041 28.8 10.0 21 740-760 152-172 (218)
283 cd05721 IgV_CTLA-4 Immunoglobu 42.9 17 0.00036 32.0 1.9 23 22-44 75-97 (115)
284 cd05767 IgC_MHC_II_alpha Class 41.5 43 0.00092 28.4 4.2 14 2-15 32-46 (94)
285 PF14099 Polysacc_lyase: Polys 40.3 1.4E+02 0.003 29.9 8.5 59 436-494 112-182 (224)
286 cd02176 GH16_XET Xyloglucan en 40.2 1.1E+02 0.0024 31.5 7.7 76 739-814 120-204 (263)
287 cd02183 GH16_fungal_CRH1_trans 38.4 4E+02 0.0087 26.3 11.8 27 740-766 115-141 (203)
288 PF01683 EB: EB module; Inter 37.9 46 0.001 24.4 3.4 22 143-170 25-46 (52)
289 PF04863 EGF_alliinase: Alliin 33.4 14 0.0003 27.3 -0.0 39 143-181 16-56 (56)
290 PF04863 EGF_alliinase: Alliin 33.4 19 0.00041 26.6 0.6 36 575-610 18-54 (56)
291 KOG3512|consensus 32.3 1.4E+02 0.003 32.9 6.9 31 52-82 278-309 (592)
292 cd07698 IgC_MHC_I_alpha3 Class 30.1 91 0.002 26.1 4.5 41 2-42 31-81 (93)
293 cd06901 lectin_VIP36_VIPL VIP3 28.5 4.7E+02 0.01 26.7 10.1 21 740-760 157-177 (248)
294 cd05847 IgC_CH2_IgE CH2 domain 28.0 92 0.002 26.3 4.1 41 2-42 31-84 (94)
295 KOG3512|consensus 27.6 98 0.0021 33.9 5.0 64 150-214 285-376 (592)
296 cd05771 IgC_Tapasin_R Tapasin- 27.2 86 0.0019 28.6 4.2 10 2-11 70-79 (139)
297 PF07081 DUF1349: Protein of u 26.6 5.9E+02 0.013 24.6 10.1 116 677-812 55-177 (183)
298 cd05766 IgC_MHC_II_beta Class 26.6 1.2E+02 0.0025 25.6 4.5 14 2-15 31-45 (94)
299 cd08023 GH16_laminarinase_like 26.1 6.7E+02 0.015 25.1 11.8 31 737-767 155-185 (235)
300 cd02178 GH16_beta_agarase Beta 25.5 1.4E+02 0.003 30.7 5.8 28 739-766 178-206 (258)
301 PF08787 Alginate_lyase2: Algi 24.9 4.9E+02 0.011 26.3 9.5 52 713-765 135-189 (236)
302 PF02057 Glyco_hydro_59: Glyco 24.4 8.9E+02 0.019 28.6 12.1 58 436-494 577-636 (669)
303 cd00070 GLECT Galectin/galacto 22.9 5.4E+02 0.012 22.9 9.1 33 733-765 70-102 (127)
304 PF09083 DUF1923: Domain of un 22.8 3.2E+02 0.0069 20.1 5.7 30 714-745 19-49 (64)
305 cd07699 IgC_L Immunoglobulin C 22.4 1.2E+02 0.0026 25.9 3.9 14 2-15 34-48 (100)
306 PF04706 Dickkopf_N: Dickkopf 22.0 1.3E+02 0.0029 22.3 3.4 23 170-192 29-51 (52)
307 cd05755 Ig2_ICAM-1_like Second 21.5 1.1E+02 0.0023 26.3 3.3 41 3-44 35-85 (100)
308 KOG1218|consensus 21.4 1.4E+02 0.0031 31.5 5.2 21 162-182 163-183 (316)
309 cd06902 lectin_ERGIC-53_ERGL E 21.2 4.3E+02 0.0093 26.6 8.1 55 706-760 120-175 (225)
310 cd07697 IgC_TCR_gamma T cell r 20.6 1.4E+02 0.003 25.4 3.8 14 2-15 33-47 (96)
311 PF07654 C1-set: Immunoglobuli 20.1 1.1E+02 0.0024 24.9 3.0 37 2-41 25-74 (83)
No 1
>KOG3514|consensus
Probab=100.00 E-value=1.2e-63 Score=542.82 Aligned_cols=689 Identities=23% Similarity=0.408 Sum_probs=517.6
Q ss_pred CceeEEEcCCcceeeeeCCcccCCCcceeeeeCCCCCCCcCC----CCccCCCCccceEEEEEEcCEEEEeecccccccC
Q psy9228 56 VERMMFVDGIGPFSGESQGAFQGLDLSELVYIGAVPDFGEIH----PSAGFSNGFKGCVSRLKYNKTEFELFKMAVERVG 131 (834)
Q Consensus 56 ~~g~~cvd~~~~~~~~~~~~~~g~~~~~~~~~gg~p~~~~~~----~~~~~~~gf~gCi~~~~~~~~~~~~~~~~~~~~~ 131 (834)
+|-.|-||++..+.-.-...|.-.+.-.++||||+|....++ +.....+.|.|-+..|.+...+-.+...+....+
T Consensus 117 e~t~L~vDgv~~~~~~~~~~f~fg~iasdvfVGGlP~~~~la~l~lp~v~yep~frg~~rnl~y~~~p~g~t~~q~l~~~ 196 (1591)
T KOG3514|consen 117 ENTKLEVDGVLVFKILNQRSFVFGNIASDVFVGGLPNMHMLAVLSLPLVRYEPRFRGNVRNLMYRQYPQGVTSPQLLEVG 196 (1591)
T ss_pred ccceEEechhhhhhhhhcceeeeeeeehheeecCCChHHhhhhhcCcccccccccCccceeeeeecCCCCcCChhhhhcc
Confidence 577899999877766666667666666799999999655433 2356677888999988887766554443222222
Q ss_pred CCCCCccCCCCCCCCCCCCEeccCCCCCceEeeCCCCCCCCCCCCCCCCccC-CCCCC-CeeecCCCCeEEeCCCCcccC
Q psy9228 132 VDSCNTCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRCSVLGEPCYP-GACGD-GSCQDVDGAMKCLCPIGTAGK 209 (834)
Q Consensus 132 ~~~c~~C~~~~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~~~~~C~~-~~C~~-g~C~~~~~~~~C~C~~g~~G~ 209 (834)
.. ..|. +.+|... ..|.. .+|++ |.|.....+.+|.|..-+.|.
T Consensus 197 ~d--~~c~---d~~~~~~-----------------------------~~~~~~~~c~~~g~c~s~d~gp~c~c~~~~dgq 242 (1591)
T KOG3514|consen 197 TD--TNCD---DHCKSKS-----------------------------MSSREQFVCLNDGECYSSDDGPHCDCQFDHDGQ 242 (1591)
T ss_pred cC--CCCc---CCCCCcc-----------------------------ccccccceeccCCeEecCCCCCccccccccCcc
Confidence 11 1233 2222221 11221 34554 677777777888888888889
Q ss_pred cccccccccccCCCCcceEEcCCCCCc---ceEEEEEEEcccCCCceEEEeecccCCCCCCeEEEEEecCeEEEEEecCC
Q psy9228 210 RCEQKIKILQPAFKHGSYLAYPTPKTM---RKFKVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGS 286 (834)
Q Consensus 210 ~Ce~~~~~~~~~F~g~~y~~~~~~~~~---~~~~i~l~f~t~~~~~glll~~~~~~~~~~d~~~l~l~~G~v~~~~~~g~ 286 (834)
+||++.....+.|.|.-|+.|....+. ...+|+|.|||. +.+|||||.+.. .||+.|.|++|.|.+.+++++
T Consensus 243 ~cekeK~~~eaTF~G~ef~~YDls~npI~s~~d~itl~FrT~-q~ngllfytG~~----~dYlnlaL~dGaV~l~~~l~~ 317 (1591)
T KOG3514|consen 243 NCEKEKNDGEATFGGDEFVGYDLSQNPIRSKKDNITLTFRTV-QGNGLLFYTGDE----KDYLNLALQDGAVSLSSKLDG 317 (1591)
T ss_pred ccccccCcceEEecCceEEEeeccCCcccccccceEEEEEEe-cCceeEEEccCC----cceeeEeecCCcEEEEEecCC
Confidence 999888788999999999999986543 678899999999 699999999976 599999999999999999998
Q ss_pred Cc--ceecCCCC--Ccccccce-eccCCCeEEEEecCCCC----CCceeeeeecCCCceeccccccccccccccccccCC
Q psy9228 287 AT--PLYSNDAP--AFNPVSTK-EAPYGSKITLTCNNDLE----APVEYTWSKRSNGHVLPFGAFSRENTLTLQEIKNSD 357 (834)
Q Consensus 287 ~~--~~~~~~~~--~~~~~~~~-~~~~~~~v~l~vd~~~~----~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~ 357 (834)
+. .+..+... +.+.||.+ ..|+-..+++.||.... ...+++|... ...+|.|+.+.... ...+.
T Consensus 318 g~~e~~~~p~~~rfdD~~WH~V~v~R~~~m~t~~VDg~~t~~~~~a~~~tmlss--s~~fyvgg~~~~~~-----l~gsr 390 (1591)
T KOG3514|consen 318 GDAEIIRMPNSFRFDDDSWHTVIVERSLQMMTLIVDGRRTEIRQYAPELTMLSS--SDFFYVGGSPNTAD-----LPGSR 390 (1591)
T ss_pred ccceeEEccccccccCCcceEEEEEeeeEEEEEEEccEEecccccccceeEeec--cceEEecCCCCccc-----cCCCc
Confidence 86 23334333 44557755 55666778889987543 3345555332 22345555442211 11112
Q ss_pred CceEEEEeCCC-------------------------CceeEeeEEeeeeccccccCCCcccccccCCccccceeEEEEEE
Q psy9228 358 AGMYVCKVSNK-------------------------DMTVEIPSILLVTDSVPLFTQKPLSYLALPTLTDAHLHFSIELS 412 (834)
Q Consensus 358 ~~~~~C~~~~~-------------------------~~~~~~~~~~~~~~~~~~f~~~~~s~l~~~~~~~~~~~~~i~~~ 412 (834)
..+.+|++... ++...|+.. ..-.|..|..+.||+.+|.|... ..-+|+|.
T Consensus 391 VsF~GClkkV~y~~d~~rl~L~~LAk~g~~~~k~~G~l~y~C~n~---~~~DpvtFtt~es~l~LP~Wnt~-~~gSiSf~ 466 (1591)
T KOG3514|consen 391 VSFMGCLKKVVYKNDDTRLELSRLAKQGDSKMKTEGDLSYSCENV---AQLDPVTFTTPESYLTLPRWNTK-KSGSISFD 466 (1591)
T ss_pred eeeeeeeeeeEeccCceeehhhHHhhcCCceeEeeceEEEecCCC---CccCceeeecccceeeccccccC-CcceeEEE
Confidence 23556665421 111122211 12267788999999999999754 45789999
Q ss_pred EeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEeccc--EEEE-eeeeecCCCeEEEEEEEECCeEEEEECC
Q psy9228 413 FKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVGL--VVLR-SKVTLVPHEWVVVTIIKDFKEGKLSVGG 489 (834)
Q Consensus 413 frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G~--~~i~-s~~~~~dg~wH~V~v~~~~~~~~l~VD~ 489 (834)
|||+++||||+|..... ....||++++|.||++.+.+++|+ +.++ +..+++||+||+|.+.|+++.+++.||.
T Consensus 467 FRTtepnGlil~~~g~~----~~~~d~~A~ELldghlyl~ldlGSG~iklras~rkv~DGeWhhv~l~R~gR~gsvsVd~ 542 (1591)
T KOG3514|consen 467 FRTTEPNGLILFHGGPQ----ANATDYFAIELLDGHLYLLLDLGSGVIKLRASSRKVNDGEWHHVDLQRDGRTGSVSVDA 542 (1591)
T ss_pred EeecCCCceEEEccCcc----cccccEEEEEEeCCeEEEEEecCCceEEeeeecccccCCceEEEEeeccCccceEEEee
Confidence 99999999999998754 568999999999999999999985 4444 5678999999999999999999999998
Q ss_pred eeeeeeecCCCccccccCCCceeeccccCCCcccC---cCcccccCceeEEeeeEEcCeeeeeccc--ccccCCcc-cCC
Q psy9228 490 EPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPS---LSVEVTEGFHGCISTIDVLGSELDLINS--AVDSANIM-DCS 563 (834)
Q Consensus 490 ~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~---~~~~~~~gF~GCi~~v~ing~~~~~~~~--~~~~~~~~-~C~ 563 (834)
..... ..||. ...|+++.++|+|-.+.. ...+ +......||.|||+++.|+|+..++... +..++++. .|.
T Consensus 543 ~~~df-~tpG~-s~iL~ld~~mylG~~~n~-l~~P~~vWta~L~~GyvGCirdl~i~G~s~di~q~ae~q~sagvkpsCs 619 (1591)
T KOG3514|consen 543 IKTDF-STPGD-SEILDLDDPMYLGEVPNN-LVYPSEVWTAALRKGYVGCIRDLFIDGVSTDIRQEAEAQNSAGVKPSCS 619 (1591)
T ss_pred eecCc-cCCCc-ceeEeecCceeeccCCCC-ccCcHHHHHHHHhccchheehhheecceehhhHHHhhhccccccCcccc
Confidence 76654 46777 889999999999966664 2222 2234578999999999999999998764 44556664 677
Q ss_pred CCCCCCCCCCCCCCCCCeecccCCCCCceeeecC-CCCCCCCCccCccCCC-----------------------------
Q psy9228 564 DLESSPVCAPKPCQNYGICYPTDTSERGYNCSCL-TGYSGDHCEKENNMCM----------------------------- 613 (834)
Q Consensus 564 ~~~~~~~C~~~pC~ngg~C~~~~~~~~~~~C~C~-~G~~G~~Ce~~~~~C~----------------------------- 613 (834)
... ...|.+|||+|+|+|.+.|+ .|.|+|. .+|.|+.||.+...=.
T Consensus 620 ~~~-~~~C~~nPC~N~g~C~egwN---rfiCDCs~T~~~G~~CerE~t~ls~nGs~~m~i~L~~~~~tq~E~v~iRF~t~ 695 (1591)
T KOG3514|consen 620 LSN-EKICESNPCQNGGKCSEGWN---RFICDCSGTGFEGRTCEREATALSYNGSMSMKIVLPHTMHTQAEDVSIRFRTQ 695 (1591)
T ss_pred hhh-ccccCCCcccCCCCcccccc---ccccccccCcccCccccceeeeEEEcCeeeEEEEecccceeecceEEEEEEec
Confidence 533 23899999999999999999 9999998 8999999997311000
Q ss_pred --------------------------------------------------------------------------------
Q psy9228 614 -------------------------------------------------------------------------------- 613 (834)
Q Consensus 614 -------------------------------------------------------------------------------- 613 (834)
T Consensus 696 r~~Gll~~Tta~~s~D~l~l~L~~g~vkl~v~ls~~~nlfag~~LnDN~WHtvrv~Rrg~~L~L~vD~~~~~~~~~~g~h 775 (1591)
T KOG3514|consen 696 RAYGLLFATTARGSADTLRLELDAGQVKLFVNLSGPENLFAGQSLNDNEWHTVRVVRRGKSLLLYVDFWSVSIYTMNGIH 775 (1591)
T ss_pred ccceeEEEeccCCCCceEEEEEecceEEEEEecCCCcceeccccccCCcceEEEEEEcccceEEEeccccceeeeecCce
Confidence
Q ss_pred ---------------------------------------------------CC---------------------------
Q psy9228 614 ---------------------------------------------------KG--------------------------- 615 (834)
Q Consensus 614 ---------------------------------------------------~~--------------------------- 615 (834)
..
T Consensus 776 ~~le~~~i~~g~e~~~~s~~~~nFiG~l~~LvFNG~~Yld~~K~~~~~ls~l~a~fkl~~iv~~paTf~sk~Sy~~la~L 855 (1591)
T KOG3514|consen 776 VRLEFHNIETGTESRAPSSVPSNFIGHLSGLVFNGQDYLDKCKMGDIQLSELSARFKLRAIVADPATFKSKSSYVKLATL 855 (1591)
T ss_pred EEEEEeeeccccccccCCCCChhhhhhhhheEECcHHHHHHHhcCCcchhhcchhhCceEEeeccceeeechhhhhhhhh
Confidence 00
Q ss_pred --------------------------------------------------------------------------------
Q psy9228 616 -------------------------------------------------------------------------------- 615 (834)
Q Consensus 616 -------------------------------------------------------------------------------- 615 (834)
T Consensus 856 ~ay~s~~l~Fqfkt~sp~gll~fn~gd~ndfi~velvnG~ihYtfdlg~gp~~~k~~sr~hlnDnrWHnV~I~rd~~~~H 935 (1591)
T KOG3514|consen 856 QAYFSMHLFFQFKTTSPDGLLLFNSGDGNDFIAVELVNGYIHYTFDLGNGPTSMKGPSRQHLNDNRWHNVLIYRDKTNTH 935 (1591)
T ss_pred heeeEEEEEEEEeecCCCeEEEecCCCCCceEEEEEeCcEEEEEEEcCCCcccccCcccCcCccccceeEEEEcCCCCce
Confidence
Q ss_pred --------------------------------------------------------------------------------
Q psy9228 616 -------------------------------------------------------------------------------- 615 (834)
Q Consensus 616 -------------------------------------------------------------------------------- 615 (834)
T Consensus 936 tL~vD~s~~t~~~~g~~~l~l~g~LyiGGv~k~m~~~~p~~~asR~g~~g~~~s~dl~~r~p~L~~~a~~~s~lv~~~~s 1015 (1591)
T KOG3514|consen 936 TLKVDNSSTTQIIDGAVNLDLKGKLYIGGVSKPMYSFLPKLVASRSGFQGCLASLDLGGRLPDLISDALFESGLVEVGCS 1015 (1591)
T ss_pred EEEecCceEEEEecCccccccccceecccccccccccccceeeccCCCCCCcCccCccccchhHHHHhhhhccceeeecc
Confidence
Q ss_pred --------CccCCCceeeecCCCcEEcCCC-CCCCCCcccccccceeeEEcC-Cceeeec--hhhhhhccCcceEEEEEE
Q psy9228 616 --------DVCKNGGMCKVTPDSYECLCSL-GYAPPNCAKRVSIGSEVHFLG-EGYVELK--KELIEERRNEETIAFDFV 683 (834)
Q Consensus 616 --------~pC~ngg~C~~~~~~~~C~C~~-g~~G~~Ce~~~~~~~~~~F~g-~s~~~~~--~~~~~~~~~~~~i~~~fr 683 (834)
+-|.|.|.|+..|++|.|.|.+ .|+|+.|..+- ..+.|++ .|-|.|. ++.. .......|.+.|+
T Consensus 1016 gpst~c~~~acanhG~c~q~w~~~~c~csmtS~~Gp~C~d~g---tTYiFgk~gglI~YtwPpNdR-psTr~DrlAvGFs 1091 (1591)
T KOG3514|consen 1016 GPSTTCSEDACANHGVCIQQWNGIACDCSMTSYSGPRCNDPG---TTYIFGKSGGLITYTWPPNDR-PSTRKDRLAVGFS 1091 (1591)
T ss_pred CCCcccchhhhhccceeeeeecceeeeccccccCCCccCCCc---eEEEECCCCceEEEecCCCCC-CCcccceEEEEEE
Confidence 1389999999999999999998 89999998663 3678987 4566664 2221 2347789999999
Q ss_pred eCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeee
Q psy9228 684 TDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIV 763 (834)
Q Consensus 684 T~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~ 763 (834)
|..++|+|+.+..... -+||+.|.|..|+|-+.||+|+..+.|...+..||||++|.|+|+|.+..++|+||..++
T Consensus 1092 Ttq~daVLvRVdSAsg----lgDYlqLhI~qG~igvvfNiGt~Diti~E~~~ivNDgkYHVVRFtR~GGNATLQVD~wpV 1167 (1591)
T KOG3514|consen 1092 TTQPDAVLVRVDSASG----LGDYLQLHINQGKIGVVFNIGTDDITISEHNAIVNDGKYHVVRFTRSGGNATLQVDSWPV 1167 (1591)
T ss_pred eccCceEEEEEeccCC----CCceEEEEEeccEEEEEEeccCcccccccccccccCCceEEEEEEecCCceEEEecccch
Confidence 9999999999876541 379999999999999999999998888877889999999999999999999999999998
Q ss_pred ecccCCCC-ccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCceecccCCCC
Q psy9228 764 GKGESPGS-QDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNKHISNIGSSA 822 (834)
Q Consensus 764 ~~~~~~~~-~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~~~~~l~~~a 822 (834)
......+. ...+|....|-|||.-.. ..|.|-|..|.+||-.+++|....
T Consensus 1168 ~e~yP~grqlTiFNtqa~I~IGGk~qG---------RpFqGqiSGLyyNgLkVLdlAaE~ 1218 (1591)
T KOG3514|consen 1168 NERYPAGRQLTIFNSQARISIGGKFQG---------RPFQGQISGLYYNGLKVLDLAAEN 1218 (1591)
T ss_pred hhccCCCceeEEeeccceEEecccccC---------CcccceecceEEcceeeeehhhcc
Confidence 77655544 356888899999996522 479999999999999999997654
No 2
>KOG4289|consensus
Probab=100.00 E-value=3.1e-56 Score=494.46 Aligned_cols=454 Identities=24% Similarity=0.487 Sum_probs=353.2
Q ss_pred EcCCcceeeeeCCcccCCCcceeeeeCCCCCCCcCCCCccCCCCccceEEEEEEcCEEEEeecccccccCCCCCCccCCC
Q psy9228 62 VDGIGPFSGESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFKGCVSRLKYNKTEFELFKMAVERVGVDSCNTCKSS 141 (834)
Q Consensus 62 vd~~~~~~~~~~~~~~g~~~~~~~~~gg~p~~~~~~~~~~~~~gf~gCi~~~~~~~~~~~~~~~~~~~~~~~~c~~C~~~ 141 (834)
+..++.+.|.||+||+|.+|++.+ |.|.
T Consensus 1216 i~pvnglrCrCPpGFTgd~CeTei--------------------------------------------------DlCY-- 1243 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDYCETEI--------------------------------------------------DLCY-- 1243 (2531)
T ss_pred ccccCceeEeCCCCCCcccccchh--------------------------------------------------Hhhh--
Confidence 567778889999999988887765 5788
Q ss_pred CCCCCCCCCEeccCCCCCceEeeCCCCCCCCCCCCC--CCCccCCCCCC-CeeecCC-CCeEEeCCCC-cccCccccccc
Q psy9228 142 KHNNCINNGLCQDAATRIGYTCICPPGFSGDRCSVL--GEPCYPGACGD-GSCQDVD-GAMKCLCPIG-TAGKRCEQKIK 216 (834)
Q Consensus 142 ~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~~--~~~C~~~~C~~-g~C~~~~-~~~~C~C~~g-~~G~~Ce~~~~ 216 (834)
++||.|+|+|.... ++|+|.|.+||+|++||.+ ...|.+.-|.| |+|++.. ++|.|.||.| |++++||.
T Consensus 1244 -s~pC~nng~C~srE--ggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~v--- 1317 (2531)
T KOG4289|consen 1244 -SGPCGNNGRCRSRE--GGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEV--- 1317 (2531)
T ss_pred -cCCCCCCCceEEec--CceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCCCcccCCCceEE---
Confidence 99999999999875 9999999999999999975 46799999997 7999987 8899999999 99999986
Q ss_pred ccccCCCCcceEEcCCCCCcceEEEEEEEcccCCCceEEEeecccCCCCCCeEEEEEecCeEEEEEecCCCcceecCCCC
Q psy9228 217 ILQPAFKHGSYLAYPTPKTMRKFKVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSATPLYSNDAP 296 (834)
Q Consensus 217 ~~~~~F~g~~y~~~~~~~~~~~~~i~l~f~t~~~~~glll~~~~~~~~~~d~~~l~l~~G~v~~~~~~g~~~~~~~~~~~ 296 (834)
++.+|.+.||+.+.......++.++|+|.|. ..+|||+|+++. ..||++|++.+++++++|..|......+
T Consensus 1318 -~trSFp~~sfv~frglrqRfh~TlslsfaT~-~~nGlL~ynGne---khDFvalevVd~qvqltfS~Ges~t~v~---- 1388 (2531)
T KOG4289|consen 1318 -TTRSFPPESFVTFRGLRQRFHFTLSLSFATI-ERNGLLLYNGNE---KHDFVALEVVDEQVQLTFSAGESTTTVS---- 1388 (2531)
T ss_pred -EeeccCchheEEEeccccceEEEEEEEEEEe-eecceEEecCCc---ccceEeeeeeeeeEEEEEecccccceec----
Confidence 4799999999999998888889999999999 699999999943 5799999999999999999886421000
Q ss_pred CcccccceeccCCCeEEEEecCCCCCCceeeeeecCCCceeccccccccccccccccccCCCceEEEEeCCCCceeEeeE
Q psy9228 297 AFNPVSTKEAPYGSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSRENTLTLQEIKNSDAGMYVCKVSNKDMTVEIPS 376 (834)
Q Consensus 297 ~~~~~~~~~~~~~~~v~l~vd~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~ 376 (834)
T Consensus 1389 -------------------------------------------------------------------------------- 1388 (2531)
T KOG4289|consen 1389 -------------------------------------------------------------------------------- 1388 (2531)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred EeeeeccccccCCCcccccccCCccccceeEEEEEEEeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecc
Q psy9228 377 ILLVTDSVPLFTQKPLSYLALPTLTDAHLHFSIELSFKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG 456 (834)
Q Consensus 377 ~~~~~~~~~~f~~~~~s~l~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G 456 (834)
T Consensus 1389 -------------------------------------------------------------------------------- 1388 (2531)
T KOG4289|consen 1389 -------------------------------------------------------------------------------- 1388 (2531)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred cEEEEeeeeecCCCeEEEEEEEECCeEEEEECCeeeeee-------------ecCCCccccccCCCceeeccccCCCccc
Q psy9228 457 LVVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGGEPLIVG-------------STPGEKLQVLNLRTPLYLGGYNIYHVTP 523 (834)
Q Consensus 457 ~~~i~s~~~~~dg~wH~V~v~~~~~~~~l~VD~~~~~~~-------------~~~g~~~~~l~~~~~l~iGG~~~~~~~~ 523 (834)
-.-+..++||+||+|.+.+..+...+.||++..... ...++ -..|++.++|++||+|....
T Consensus 1389 ---p~Vp~gvsDGqWHtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~s-KKsLDltgpLlLGGvPe~fp-- 1462 (2531)
T KOG4289|consen 1389 ---PDVPGGVSDGQWHTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGS-KKSLDLTGPLLLGGVPETFP-- 1462 (2531)
T ss_pred ---CCCCCCcccCceeEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCc-ceeeeccCceeecCCCCcch--
Confidence 001246899999999999999999999998764322 12233 45689999999999996431
Q ss_pred CcCcccccCceeEEeeeEEcCeeeeecccccccCCcccCCCCCCCCCCCCCCCCCCCeecccCCCCCceeeecCCCCCCC
Q psy9228 524 SLSVEVTEGFHGCISTIDVLGSELDLINSAVDSANIMDCSDLESSPVCAPKPCQNYGICYPTDTSERGYNCSCLTGYSGD 603 (834)
Q Consensus 524 ~~~~~~~~gF~GCi~~v~ing~~~~~~~~~~~~~~~~~C~~~~~~~~C~~~pC~ngg~C~~~~~~~~~~~C~C~~G~~G~ 603 (834)
...+.|.|||+++.++++.+|+...........+|..
T Consensus 1463 ----v~~k~FvGCmrdLsvD~~~VDma~fianngt~eGC~a--------------------------------------- 1499 (2531)
T KOG4289|consen 1463 ----VIEKQFVGCMRDLSVDGRDVDMATFIANNGTHEGCKA--------------------------------------- 1499 (2531)
T ss_pred ----hhHhHhhhhhhhcccccccccHHHHHhhcCcccCchh---------------------------------------
Confidence 2346799999999999999999876544444455543
Q ss_pred CCccCccCCCCCCccCCCceeeecCCCcEEcCCCCCCCCCcccccccceeeEEcCCceeeechhhhhhccCcceEEEEEE
Q psy9228 604 HCEKENNMCMKGDVCKNGGMCKVTPDSYECLCSLGYAPPNCAKRVSIGSEVHFLGEGYVELKKELIEERRNEETIAFDFV 683 (834)
Q Consensus 604 ~Ce~~~~~C~~~~pC~ngg~C~~~~~~~~C~C~~g~~G~~Ce~~~~~~~~~~F~g~s~~~~~~~~~~~~~~~~~i~~~fr 683 (834)
..+-|.+. +|+|+|+|++.|++|.|.||.+|.|..|+..+. ..-+|.|.|-++.....+ .....+.++|+||
T Consensus 1500 ----rk~fCdsg-~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~--~pq~frG~sl~sw~~~~~-~vSvPwylsl~FR 1571 (2531)
T KOG4289|consen 1500 ----RKNFCDSG-QCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMA--HPQHFRGHSLVSWEGLPS-QVSVPWYLSLMFR 1571 (2531)
T ss_pred ----hhcccCCC-ccCCCCeeecccCcEeecCccccCCcchhhccC--CchhccccceeeecCCCc-ceecceEEEEEEE
Confidence 23566666 788888888888888888888888888877664 456899988777653322 2347899999999
Q ss_pred eCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeee
Q psy9228 684 TDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIV 763 (834)
Q Consensus 684 T~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~ 763 (834)
|+..+|+||-+.... ..-+.|.|.+|.|.+.+ +... +..+..+++||+||++.|..... ..+..|...-
T Consensus 1572 Tr~ad~vl~~~~~~~------rst~~lqld~g~l~~~v--~~s~--v~L~~~~vtdg~Wh~~~i~l~~d-~~~t~d~g~~ 1640 (2531)
T KOG4289|consen 1572 TRRADGVLMQAEFGG------RSTYNLQLDDGTLKYNV--GDSS--VELPAPRVTDGHWHHLVIELEAD-SVATLDYGIY 1640 (2531)
T ss_pred eeccccEEEEEEeCC------CceEEEEEcCCEEEEEe--cCce--EEccCccccCCchhheeeeeccC-eEEEEechhh
Confidence 999999999876532 34588999999999876 4433 33468899999999999998864 5566665443
Q ss_pred ecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCceeccc
Q psy9228 764 GKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNKHISNI 818 (834)
Q Consensus 764 ~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~~~~~l 818 (834)
+.+...+ ..-|++ ..||+||+|.. ....+|+|||++|.+.|..++..
T Consensus 1641 ~aea~~g-l~gl~l-~sl~vGgap~~------g~p~gf~GCiqgV~v~g~~~l~~ 1687 (2531)
T KOG4289|consen 1641 QAEAKAG-LSGLNL-ESLYVGGAPAT------GVPRGFRGCIQGVRVGGVSILVP 1687 (2531)
T ss_pred hhhhhcC-CCCcee-eEEEEccccCC------CccccchhhhhceEECCEeeccc
Confidence 3322222 223333 37999999944 35689999999999998765443
No 3
>KOG3516|consensus
Probab=100.00 E-value=4.2e-50 Score=447.57 Aligned_cols=565 Identities=21% Similarity=0.330 Sum_probs=412.4
Q ss_pred cccCCCCcceEEcCCCCCc---ceEEEEEEEcccCCCceEEEeecccCCCCCCeEEEEEecCeEEEEEecCCCc-c-e--
Q psy9228 218 LQPAFKHGSYLAYPTPKTM---RKFKVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSAT-P-L-- 290 (834)
Q Consensus 218 ~~~~F~g~~y~~~~~~~~~---~~~~i~l~f~t~~~~~glll~~~~~~~~~~d~~~l~l~~G~v~~~~~~g~~~-~-~-- 290 (834)
....|+|.+++.|+..... ....|+|+|||. ..+|+|||... .++||+.|+|+++++.+.+++|+.. + .
T Consensus 178 ~vi~fdg~s~~~yr~~~~~m~s~~d~is~~Fkt~-~sdGvllh~eg---~QGd~itlql~~~kl~l~ld~G~~~~~~s~~ 253 (1306)
T KOG3516|consen 178 PVIYFDGSSSLLYRFHRKLMSSLKDVISLKFKTM-QSDGVLLHGEG---QQGDYITLQLIGGKLVLILDLGNSKLPSSRT 253 (1306)
T ss_pred ceeEECCccceeeeccccccccccceeEEEEEee-ccceeEEEccc---CCCCEEEEEEeCCEEEEEEecCCccCccccC
Confidence 3578999999998875433 456799999999 58899999874 3589999999999999999999653 1 1
Q ss_pred ----ecCCCCC-cccccceeccCCCeEEEEecCCCC---CCceeeeeecCCCceeccccccccccccccccccCCCceEE
Q psy9228 291 ----YSNDAPA-FNPVSTKEAPYGSKITLTCNNDLE---APVEYTWSKRSNGHVLPFGAFSRENTLTLQEIKNSDAGMYV 362 (834)
Q Consensus 291 ----~~~~~~~-~~~~~~~~~~~~~~v~l~vd~~~~---~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~ 362 (834)
......+ .+||.++..|.++.++++||..+. +..++. ....+..+++||.|........ ...+-+
T Consensus 254 ~~sis~GslLdD~hWHsV~i~r~~~~vnftvD~~~~~fr~~Ge~~--~Ldld~e~~~GGiP~~~~~~~~-----~~nF~G 326 (1306)
T KOG3516|consen 254 PTSISAGSLLDDQHWHSVRIERQGRQVNFTVDGVVHHFRATGEFD--ALDLDTEISFGGIPNDGKSVGF-----EKNFTG 326 (1306)
T ss_pred cceeecccccCCCcceEEEEEecCcEEEEEEccceEeecccCccc--eeecceEEEECCccCCCcccce-----eeeeee
Confidence 1112223 355667788999999999998753 333332 2223445677887643221100 123445
Q ss_pred EEeCC-------CCceeEee-EE-------ee--eeccccccCCCcccccccCCccccceeEEEEEEEeeCCCCeeEEEe
Q psy9228 363 CKVSN-------KDMTVEIP-SI-------LL--VTDSVPLFTQKPLSYLALPTLTDAHLHFSIELSFKPTDYNGLIMYT 425 (834)
Q Consensus 363 C~~~~-------~~~~~~~~-~~-------~~--~~~~~~~f~~~~~s~l~~~~~~~~~~~~~i~~~frt~~~~GlLl~~ 425 (834)
|+.++ .++.+.-+ .. .. .+..+|++|+.+.||+.+|+.... ..+.++|.|||...+|+|++.
T Consensus 327 Cienly~N~vdiidLa~~~~~~~~~~gnv~f~C~~P~~~pvtF~~sss~~~lpg~~~~-~~l~vSF~FRtw~~~G~ll~~ 405 (1306)
T KOG3516|consen 327 CLENLYYNGVDIIDLAKRRKSQISAMGNVSFSCSDPQIIPVTFGNSSSYLRLPGNPNP-DRLSVSFQFRTWNKTGLLLFS 405 (1306)
T ss_pred eeeeeeecCceeEeeecccccceecccceeEeccCCCCCCeEecccceeEEcCCCCCC-CceeeEEEEEeccccCceeee
Confidence 55443 22222111 00 11 134567888888899999988654 358999999999999999998
Q ss_pred ccCCccccCCCCCeEEEEEECcEEEEEEe-ccc--EEEEeeeeecCCCeEEEEEEEECCeEEEEECCeeeeeeecCCCcc
Q psy9228 426 GDSNMKSYKGKGDFVSFGLEDGYPVFRFD-VGL--VVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKL 502 (834)
Q Consensus 426 ~~~~~~~~~~~~d~~~l~L~~G~l~~~~~-~G~--~~i~s~~~~~dg~wH~V~v~~~~~~~~l~VD~~~~~~~~~~g~~~ 502 (834)
.- ......+.|.|++|++.+.+. .+. ..+.....+|||+||.|.+.+..+.+.+.||+.+...... .+.
T Consensus 406 ~~------~e~~g~v~~fl~eg~~~~~i~~~~r~~~~~~~g~~lnDG~WHsv~~~ak~n~~~~~iDd~~~~~~~~--~~p 477 (1306)
T KOG3516|consen 406 EL------KEGSGEVLLFLKEGKKFLQITQIGRSKADAYAGLKLNDGAWHSVSFNAKKNRLVLMIDDGEAEIAPD--SKP 477 (1306)
T ss_pred ee------ccCCceEEEEEeCCeEEEEEeccccchhhhcccccCCCCceEEEEEEeecceeEEEEcCcccccccC--Ccc
Confidence 64 356789999999999877763 342 4455678899999999999999999999999977643221 112
Q ss_pred ccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCeeeeeccccc------ccCCcccCCCCCCCCCCCCCCC
Q psy9228 503 QVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSELDLINSAV------DSANIMDCSDLESSPVCAPKPC 576 (834)
Q Consensus 503 ~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~~~~~~~~~------~~~~~~~C~~~~~~~~C~~~pC 576 (834)
..+......|+||.|.... ........++|.||++-|.++++.+++..... ....+..|.. .+.|.+|||
T Consensus 478 ~~V~tg~tY~fgg~~~~~~-~~~~~~~~~~f~GCmrli~vd~~~~~l~~v~~~~~g~~~~v~id~C~i---~drClPN~C 553 (1306)
T KOG3516|consen 478 LQVYTGTTYYFGGCPDKFN-SWQCASPIKGFQGCMRLIKVDGQLKDLIDVKQGSLGNFSDVQIDMCGI---SDRCLPNPC 553 (1306)
T ss_pred EEEEeCCeeEecccccccc-chhhccccccccceeEEEEECCeEeeeeeeeccccccccceeeccccc---ccccCCccc
Confidence 3445567789999988521 22233456799999999999999888765211 1122456665 789999999
Q ss_pred CCCCeecccCCCCCceeeecC-CCCCCCCCccCcc--CCCCC--------------------------------------
Q psy9228 577 QNYGICYPTDTSERGYNCSCL-TGYSGDHCEKENN--MCMKG-------------------------------------- 615 (834)
Q Consensus 577 ~ngg~C~~~~~~~~~~~C~C~-~G~~G~~Ce~~~~--~C~~~-------------------------------------- 615 (834)
+|||.|...|. .|.|.|. +||+|..|+..+. .|+..
T Consensus 554 ehgG~C~Qs~~---~f~C~C~~TGY~GatCHtsi~e~SCeay~~~~~t~~~~~iD~DGsGpl~Pl~v~C~~~ed~awTvv 630 (1306)
T KOG3516|consen 554 EHGGKCSQSWD---DFECNCELTGYKGATCHTSIYELSCEAYKNIGQTSGNFLIDSDGSGPLEPLQVYCNITEDRAWTVV 630 (1306)
T ss_pred cCCCccccccc---ceeEeccccccccccccCCCcchhhHHhhhhccccceEEEccCCCCcccceEEEEecccCceEEEE
Confidence 99999999998 9999999 9999999998654 35321
Q ss_pred --------------------------------------------------------------------------------
Q psy9228 616 -------------------------------------------------------------------------------- 615 (834)
Q Consensus 616 -------------------------------------------------------------------------------- 615 (834)
T Consensus 631 ~H~~~~~t~V~g~n~~g~~~~s~~y~as~eQ~~al~n~se~CeQ~i~y~C~~sRllnt~~g~P~Swwigr~ne~~~yWGG 710 (1306)
T KOG3516|consen 631 QHDNLGTTRVRGSNPEGPVAISLFYAASMEQLQALLNRSEHCEQEIEYSCRESRLLNTPDGTPFSWWIGRSNEGHVYWGG 710 (1306)
T ss_pred EeCCccceEEeccCCCCceeEeeehhccHHHHHHHhhhhhhhheeeeeeeccceeeeCCCCCeeEEEecccCCccceecC
Confidence
Q ss_pred -------CccCCCceeeecCCCcEEcCCCCCC-------------------------------------CCCcccccccc
Q psy9228 616 -------DVCKNGGMCKVTPDSYECLCSLGYA-------------------------------------PPNCAKRVSIG 651 (834)
Q Consensus 616 -------~pC~ngg~C~~~~~~~~C~C~~g~~-------------------------------------G~~Ce~~~~~~ 651 (834)
--|.-.+.|++ ..+.|.|..+.. +.+|+.+....
T Consensus 711 s~Pg~qkC~Cgi~~nC~d--~~~~CNCDa~~~ewt~Dtg~l~~k~hLPVt~vv~gdTg~~~sea~~~lgPLrC~gDr~~w 788 (1306)
T KOG3516|consen 711 SGPGLQKCECGLLGNCLD--PQLYCNCDADEKEWTTDTGCLAYKDHLPVTQVVIGDTGRSQSEAPYVLGPLRCEGDRNFW 788 (1306)
T ss_pred CCCccceeeccccccccC--cceeeeccCCCccccccccccchhhcCCeeEEEEccCCCcccccceeecceEeecccccc
Confidence 01444455543 356677754321 12477777777
Q ss_pred eeeEEcC-CceeeechhhhhhccCcceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEEEEEE-CCEEEEEEEcCCcEEE
Q psy9228 652 SEVHFLG-EGYVELKKELIEERRNEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVV-NGYLEYSYDLGDGVVT 729 (834)
Q Consensus 652 ~~~~F~g-~s~~~~~~~~~~~~~~~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~-~G~l~~~~~~g~~~~~ 729 (834)
.+++|.+ .+|+.|+... .....+|+|.|||..+.|++|.+-+. .|||.|+|. .-.+.|.++.|+++..
T Consensus 789 nsvSF~~~~syL~fp~f~---~~~saDIsf~FrTt~~~gvflen~g~-------~dfir~eL~~~~~vtf~~dvgnGp~~ 858 (1306)
T KOG3516|consen 789 NSVSFHTGASYLHFPPFH---NELSADISFFFRTTASSGVFLENHGI-------NDFIRLELSSPVEVTFAFDVGNGPSQ 858 (1306)
T ss_pred cceEeecCcceeecCccc---CcccccEEEEEEecCCceEeeeccCC-------CceEEEEEcCCCceEEEEEcCCCcee
Confidence 7899986 6799997543 24788999999999999999988764 699999998 6789999999999876
Q ss_pred EEe-CCceecCCCcEEEEEEEECCEEEEEEcCeeeecccCC-CCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEE
Q psy9228 730 IKF-SKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESP-GSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMN 807 (834)
Q Consensus 730 l~~-s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~-~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~ 807 (834)
+.. ++..+||++||+|+++|+.+.++|+||+.+.....++ .....|.+.+++||||... ...+|.||||.
T Consensus 859 ~~V~s~t~~nD~qWH~V~~Ern~K~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vGgt~~--------~~~gF~GCIRs 930 (1306)
T KOG3516|consen 859 LTVRSPTELNDNQWHQVRAERNSKEASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVGGTVS--------RQRGFLGCIRS 930 (1306)
T ss_pred EEEcCCcccCCCceEEEEEEeccccceEEEcCcccceecCCCCCEEEEEeccceecccccc--------CcCcceeeeee
Confidence 653 5568999999999999999999999999886655444 3356788999999999543 36799999999
Q ss_pred EEECCceecccCCCCCCCccccc
Q psy9228 808 IHIQNKHISNIGSSASSGLHVMP 830 (834)
Q Consensus 808 v~in~~~~~~l~~~a~~~~nv~~ 830 (834)
|+|||+ .+||+..|.....|.|
T Consensus 931 l~LNGv-~ldLe~ra~~~~gv~~ 952 (1306)
T KOG3516|consen 931 LQLNGV-MLDLEYRAYGTAGVSP 952 (1306)
T ss_pred eeecce-eeeehhhhccCCcccC
Confidence 999997 5688776655555544
No 4
>KOG3516|consensus
Probab=100.00 E-value=2.6e-47 Score=425.29 Aligned_cols=606 Identities=22% Similarity=0.355 Sum_probs=389.2
Q ss_pred eeeeeCCCCCCCcCCCCccCCCCccceEEEEEEcCEEEEeecccccc------cCCCCC---CccCCCCCCCCCCCCEec
Q psy9228 83 ELVYIGAVPDFGEIHPSAGFSNGFKGCVSRLKYNKTEFELFKMAVER------VGVDSC---NTCKSSKHNNCINNGLCQ 153 (834)
Q Consensus 83 ~~~~~gg~p~~~~~~~~~~~~~gf~gCi~~~~~~~~~~~~~~~~~~~------~~~~~c---~~C~~~~~~pC~n~g~C~ 153 (834)
..-|+||.|.......-.....+|.||++.++++++..++......+ ..+..| |.|. +|||+|||.|.
T Consensus 484 ~tY~fgg~~~~~~~~~~~~~~~~f~GCmrli~vd~~~~~l~~v~~~~~g~~~~v~id~C~i~drCl---PN~CehgG~C~ 560 (1306)
T KOG3516|consen 484 TTYYFGGCPDKFNSWQCASPIKGFQGCMRLIKVDGQLKDLIDVKQGSLGNFSDVQIDMCGISDRCL---PNPCEHGGKCS 560 (1306)
T ss_pred CeeEeccccccccchhhccccccccceeEEEEECCeEeeeeeeeccccccccceeecccccccccC---CccccCCCccc
Confidence 45689999886433333456789999999999999998876532222 224456 6899 99999999999
Q ss_pred cCCCCCceEeeCC-CCCCCCCCCCCCCCccCCCCCCCeeecCCCCeEEeCCCCcccCcccccccccccCCCCcce-EEcC
Q psy9228 154 DAATRIGYTCICP-PGFSGDRCSVLGEPCYPGACGDGSCQDVDGAMKCLCPIGTAGKRCEQKIKILQPAFKHGSY-LAYP 231 (834)
Q Consensus 154 ~~~~~~~~~C~C~-~g~~G~~Ce~~~~~C~~~~C~~g~C~~~~~~~~C~C~~g~~G~~Ce~~~~~~~~~F~g~~y-~~~~ 231 (834)
.. |..|.|.|. .||.|..|+..+.| ..||.=. ...+.+.-| +-..
T Consensus 561 Qs--~~~f~C~C~~TGY~GatCHtsi~e----------------------------~SCeay~---~~~~t~~~~~iD~D 607 (1306)
T KOG3516|consen 561 QS--WDDFECNCELTGYKGATCHTSIYE----------------------------LSCEAYK---NIGQTSGNFLIDSD 607 (1306)
T ss_pred cc--ccceeEeccccccccccccCCCcc----------------------------hhhHHhh---hhccccceEEEccC
Confidence 84 589999999 99999999865433 1233211 011111111 1111
Q ss_pred CCCCcceEEEEEEEcccCCCceEEEeecccCCCCCCeEEEEEecCeEEEEEecCCCc-ceecCCCCCcccccceeccCCC
Q psy9228 232 TPKTMRKFKVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSAT-PLYSNDAPAFNPVSTKEAPYGS 310 (834)
Q Consensus 232 ~~~~~~~~~i~l~f~t~~~~~glll~~~~~~~~~~d~~~l~l~~G~v~~~~~~g~~~-~~~~~~~~~~~~~~~~~~~~~~ 310 (834)
.......+.+...+... ...-++-+.... ..-+.-...+|.+...+..+... .+..... ..+++ ..
T Consensus 608 GsGpl~Pl~v~C~~~ed-~awTvv~H~~~~----~t~V~g~n~~g~~~~s~~y~as~eQ~~al~n-~se~C-------eQ 674 (1306)
T KOG3516|consen 608 GSGPLEPLQVYCNITED-RAWTVVQHDNLG----TTRVRGSNPEGPVAISLFYAASMEQLQALLN-RSEHC-------EQ 674 (1306)
T ss_pred CCCcccceEEEEecccC-ceEEEEEeCCcc----ceEEeccCCCCceeEeeehhccHHHHHHHhh-hhhhh-------he
Confidence 11222334443333111 011111111100 11122222333333222222211 0000000 00111 12
Q ss_pred eEEEEecCCC----CCCceee-eeecCCCceeccccc-cccccccccccccCCCceEEEEeCCCC--------ce---eE
Q psy9228 311 KITLTCNNDL----EAPVEYT-WSKRSNGHVLPFGAF-SRENTLTLQEIKNSDAGMYVCKVSNKD--------MT---VE 373 (834)
Q Consensus 311 ~v~l~vd~~~----~~~~~~~-~~~~~~~~~~~~gg~-~~~~~~~~~~~~~~~~~~~~C~~~~~~--------~~---~~ 373 (834)
.+.+.|-... ..-..++ |..+...+..|.||. |+.....+.-..+.....+.|.|.... +. ..
T Consensus 675 ~i~y~C~~sRllnt~~g~P~Swwigr~ne~~~yWGGs~Pg~qkC~Cgi~~nC~d~~~~CNCDa~~~ewt~Dtg~l~~k~h 754 (1306)
T KOG3516|consen 675 EIEYSCRESRLLNTPDGTPFSWWIGRSNEGHVYWGGSGPGLQKCECGLLGNCLDPQLYCNCDADEKEWTTDTGCLAYKDH 754 (1306)
T ss_pred eeeeeeccceeeeCCCCCeeEEEecccCCccceecCCCCccceeeccccccccCcceeeeccCCCccccccccccchhhc
Confidence 2333332211 0113344 344555556666655 444443332222222333455543211 00 00
Q ss_pred eeEE-------------e--e----eec-----cccccCCCcccccccCCccccceeEEEEEEEeeCCCCeeEEEeccCC
Q psy9228 374 IPSI-------------L--L----VTD-----SVPLFTQKPLSYLALPTLTDAHLHFSIELSFKPTDYNGLIMYTGDSN 429 (834)
Q Consensus 374 ~~~~-------------~--~----~~~-----~~~~f~~~~~s~l~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~ 429 (834)
.|.. . . .++ ..++.|....|||.||++.... ...|+|.|||+.++|++|.+-
T Consensus 755 LPVt~vv~gdTg~~~sea~~~lgPLrC~gDr~~wnsvSF~~~~syL~fp~f~~~~-saDIsf~FrTt~~~gvflen~--- 830 (1306)
T KOG3516|consen 755 LPVTQVVIGDTGRSQSEAPYVLGPLRCEGDRNFWNSVSFHTGASYLHFPPFHNEL-SADISFFFRTTASSGVFLENH--- 830 (1306)
T ss_pred CCeeEEEEccCCCcccccceeecceEeecccccccceEeecCcceeecCcccCcc-cccEEEEEEecCCceEeeecc---
Confidence 1100 0 0 001 1234566778999999886543 588999999999999999984
Q ss_pred ccccCCCCCeEEEEEECc-EEEEEEecc----cEEEEeeeeecCCCeEEEEEEEECCeEEEEECCeeeeeeecCCCcccc
Q psy9228 430 MKSYKGKGDFVSFGLEDG-YPVFRFDVG----LVVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQV 504 (834)
Q Consensus 430 ~~~~~~~~d~~~l~L~~G-~l~~~~~~G----~~~i~s~~~~~dg~wH~V~v~~~~~~~~l~VD~~~~~~~~~~g~~~~~ 504 (834)
+..||+.|+|..+ .+.|.++.| ..+++++..+||++||+|.++|+.+..+|+||+.+....++|-.....
T Consensus 831 -----g~~dfir~eL~~~~~vtf~~dvgnGp~~~~V~s~t~~nD~qWH~V~~Ern~K~a~LqVD~~~~~~r~sp~~~~~~ 905 (1306)
T KOG3516|consen 831 -----GINDFIRLELSSPVEVTFAFDVGNGPSQLTVRSPTELNDNQWHQVRAERNSKEASLQVDGLPKSIRTSPIPGTRL 905 (1306)
T ss_pred -----CCCceEEEEEcCCCceEEEEEcCCCceeEEEcCCcccCCCceEEEEEEeccccceEEEcCcccceecCCCCCEEE
Confidence 4689999999877 588899887 367788889999999999999999999999999888776666655677
Q ss_pred ccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCeeeeecccccccCCc-ccCCCCCCCCCCCCCCCCCCCeec
Q psy9228 505 LNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSELDLINSAVDSANI-MDCSDLESSPVCAPKPCQNYGICY 583 (834)
Q Consensus 505 l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~~~~~~~~~~~~~~-~~C~~~~~~~~C~~~pC~ngg~C~ 583 (834)
|.+..++|+||.... ..+|.||||.+.+||+.+||...+....++ .+|..
T Consensus 906 L~l~s~l~vGgt~~~----------~~gF~GCIRsl~LNGv~ldLe~ra~~~~gv~~GC~G------------------- 956 (1306)
T KOG3516|consen 906 LQLYSSLFVGGTVSR----------QRGFLGCIRSLQLNGVMLDLEYRAYGTAGVSPGCEG------------------- 956 (1306)
T ss_pred EEeccceeccccccC----------cCcceeeeeeeeecceeeeehhhhccCCcccCCCcc-------------------
Confidence 899999999995542 479999999999999999997665444443 34542
Q ss_pred ccCCCCCceeeecCCCCCCCCCccCccCCCCCCccCCCceeeecCCCcEEcCCC-CCCCCCcccccccceeeEEcCCcee
Q psy9228 584 PTDTSERGYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGMCKVTPDSYECLCSL-GYAPPNCAKRVSIGSEVHFLGEGYV 662 (834)
Q Consensus 584 ~~~~~~~~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~C~~~~~~~~C~C~~-g~~G~~Ce~~~~~~~~~~F~g~s~~ 662 (834)
.|.+. ||+|||+|+..+.+|+|+|.. .|+|+.|.+++ ++.|...+++
T Consensus 957 ---------------------------hCss~-~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~ei----g~~fe~gs~i 1004 (1306)
T KOG3516|consen 957 ---------------------------HCSSY-PCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEI----GVFFERGSSI 1004 (1306)
T ss_pred ---------------------------ccccc-cccCCCEEEEecCceeeccccCcCCCCcccccc----ceEecCCceE
Confidence 35555 777777777777788888886 78899998875 5778878888
Q ss_pred eechhhhhh----------------ccCcceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEEEEEE-CCEEEEEEEcCC
Q psy9228 663 ELKKELIEE----------------RRNEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVV-NGYLEYSYDLGD 725 (834)
Q Consensus 663 ~~~~~~~~~----------------~~~~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~-~G~l~~~~~~g~ 725 (834)
+|+...... ......|.|.|+|+...++|||.+... .+|++|-|. ||.|+++|.+|.
T Consensus 1005 ~y~fq~~~~~a~~~~~~~~~~~~~~~~~~e~i~~sftTt~~ps~LLfvssF~------~~y~~V~v~~nGsLq~ry~lg~ 1078 (1306)
T KOG3516|consen 1005 RYNFQKPMRSAVFESSRVKQKLEIEINPNEEINFSFTTTRAPSDLLFVSSFT------DDYLAVLVKDNGSLQTRYMLGF 1078 (1306)
T ss_pred EEeccchHHHhhhhhhhhhhccccccCccceEEEEEEeccCceEEEEeeccc------cceEEEEEeCCCceEEEEecCC
Confidence 886321110 015678999999999999999987643 689999997 999999999998
Q ss_pred -cEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCC---CCccCCCc
Q psy9228 726 -GVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMT---GGRYVHPM 801 (834)
Q Consensus 726 -~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~---~~~~~~~F 801 (834)
.+.++....+++.||+.|+|.|.|.++.+.++||+.......... ...++....+++|-+....... ......+|
T Consensus 1079 ~e~~~~~~~~kn~~~gq~H~i~i~r~~~~~~i~vD~y~~~~y~~~~-~~~~~~~ksl~lg~v~e~~~~d~~~~k~~t~gF 1157 (1306)
T KOG3516|consen 1079 REPFEYQFKDKNIALGQPHDINITRGPRTVFLEVDGYLKVEYTFSI-DVDFQSPKSLTLGPVTETANIDHEISKYNTPGF 1157 (1306)
T ss_pred cCceEEecccccccCCCceEEEEecCCceEEEEecCccceeeeccc-ceeecccchhhccceeeccCCChhHHhhcCCCc
Confidence 666777667789999999999999999999999997755443222 3445666667777655432221 12347889
Q ss_pred eEEEEEEEECCce
Q psy9228 802 SGLMMNIHIQNKH 814 (834)
Q Consensus 802 ~GCi~~v~in~~~ 814 (834)
.||+..|++|...
T Consensus 1158 ~GClS~Vqf~~va 1170 (1306)
T KOG3516|consen 1158 GGCLSRVQFNDVA 1170 (1306)
T ss_pred cceeeEEEccccc
Confidence 9999999998865
No 5
>KOG3514|consensus
Probab=100.00 E-value=1.1e-41 Score=371.92 Aligned_cols=508 Identities=23% Similarity=0.364 Sum_probs=338.9
Q ss_pred cCCCC--cceEEcCCCCCcceEEEEEEEcccCCCceEEEeecccCCCCCCeEEEEEecCeEEEEEecCCCcc-eecCCCC
Q psy9228 220 PAFKH--GSYLAYPTPKTMRKFKVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSATP-LYSNDAP 296 (834)
Q Consensus 220 ~~F~g--~~y~~~~~~~~~~~~~i~l~f~t~~~~~glll~~~~~~~~~~d~~~l~l~~G~v~~~~~~g~~~~-~~~~~~~ 296 (834)
+.|.| ++|..|+.-.....-.++|+|+|. +++|+|||..+ ++..||+.|.|++|+++++|++|.+.. .+.....
T Consensus 26 ~~l~Ga~~s~ary~kW~~~~~g~ls~e~kt~-q~~glllytDd--Ggt~df~eL~lveG~lrLrf~Lg~~~~~~q~~~~i 102 (1591)
T KOG3514|consen 26 IILTGAPDSYARYPKWAHSFEGSLSMELKTR-QSDGLLLYTDD--GGTHDFYELTLVEGHLRLRFRLGNSNEFGQRRVRI 102 (1591)
T ss_pred eEecCCCcchhhchhhhcccCceeeeeeecc-CCCcEEEEecC--CCceeeeEEEEecceEEEEEEecCCCceeeeccee
Confidence 34444 477777765555566789999999 69999999974 456799999999999999999996543 3333334
Q ss_pred Ccccccc-eeccCCCeEEEEecCCCCCC----ceeeeeecCCCceeccccccccccccc---------------------
Q psy9228 297 AFNPVST-KEAPYGSKITLTCNNDLEAP----VEYTWSKRSNGHVLPFGAFSRENTLTL--------------------- 350 (834)
Q Consensus 297 ~~~~~~~-~~~~~~~~v~l~vd~~~~~~----~~~~~~~~~~~~~~~~gg~~~~~~~~~--------------------- 350 (834)
....||. .+.|..++..|.||....-. .++.. ......+++||.|....+.+
T Consensus 103 ~D~~WH~v~i~r~~e~t~L~vDgv~~~~~~~~~~f~f--g~iasdvfVGGlP~~~~la~l~lp~v~yep~frg~~rnl~y 180 (1591)
T KOG3514|consen 103 DDDKWHTVTIFRSWENTKLEVDGVLVFKILNQRSFVF--GNIASDVFVGGLPNMHMLAVLSLPLVRYEPRFRGNVRNLMY 180 (1591)
T ss_pred cCCceeEEEEEeccccceEEechhhhhhhhhcceeee--eeeehheeecCCChHHhhhhhcCcccccccccCccceeeee
Confidence 4444554 45566678888887633211 11111 11111345555552111000
Q ss_pred ----------cccc-----c--------CCC--ceE------EEEeCCCCceeEeeEEeee-------eccccccCCCc-
Q psy9228 351 ----------QEIK-----N--------SDA--GMY------VCKVSNKDMTVEIPSILLV-------TDSVPLFTQKP- 391 (834)
Q Consensus 351 ----------~~~~-----~--------~~~--~~~------~C~~~~~~~~~~~~~~~~~-------~~~~~~f~~~~- 391 (834)
+.+. + ... .-+ .|..+..+..++|..+... .+...-|.+..
T Consensus 181 ~~~p~g~t~~q~l~~~~d~~c~d~~~~~~~~~~~~~~c~~~g~c~s~d~gp~c~c~~~~dgq~cekeK~~~eaTF~G~ef 260 (1591)
T KOG3514|consen 181 RQYPQGVTSPQLLEVGTDTNCDDHCKSKSMSSREQFVCLNDGECYSSDDGPHCDCQFDHDGQNCEKEKNDGEATFGGDEF 260 (1591)
T ss_pred ecCCCCcCChhhhhcccCCCCcCCCCCccccccccceeccCCeEecCCCCCccccccccCccccccccCcceEEecCceE
Confidence 0000 0 000 001 1222222333344432211 11222343332
Q ss_pred ccc-cccCCccccceeEEEEEEEeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEeccc-----EEEEeeee
Q psy9228 392 LSY-LALPTLTDAHLHFSIELSFKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVGL-----VVLRSKVT 465 (834)
Q Consensus 392 ~s~-l~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G~-----~~i~s~~~ 465 (834)
.+| |...+.. ...-.|+|.|||.+++|||||.+. ..||+.|.|+||.+.+...++. ...-...+
T Consensus 261 ~~YDls~npI~--s~~d~itl~FrT~q~ngllfytG~--------~~dYlnlaL~dGaV~l~~~l~~g~~e~~~~p~~~r 330 (1591)
T KOG3514|consen 261 VGYDLSQNPIR--SKKDNITLTFRTVQGNGLLFYTGD--------EKDYLNLALQDGAVSLSSKLDGGDAEIIRMPNSFR 330 (1591)
T ss_pred EEeeccCCccc--ccccceEEEEEEecCceeEEEccC--------CcceeeEeecCCcEEEEEecCCccceeEEcccccc
Confidence 233 2233332 234679999999999999999976 5799999999999999998862 22335678
Q ss_pred ecCCCeEEEEEEEECCeEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCe
Q psy9228 466 LVPHEWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGS 545 (834)
Q Consensus 466 ~~dg~wH~V~v~~~~~~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~ 545 (834)
++|.+||.|.++|....+++.||+.........+. ...|....-+|+||.+.....+ .....|.||+++|.+...
T Consensus 331 fdD~~WH~V~v~R~~~m~t~~VDg~~t~~~~~a~~-~tmlsss~~fyvgg~~~~~~l~----gsrVsF~GClkkV~y~~d 405 (1591)
T KOG3514|consen 331 FDDDSWHTVIVERSLQMMTLIVDGRRTEIRQYAPE-LTMLSSSDFFYVGGSPNTADLP----GSRVSFMGCLKKVVYKND 405 (1591)
T ss_pred ccCCcceEEEEEeeeEEEEEEEccEEecccccccc-eeEeeccceEEecCCCCccccC----CCceeeeeeeeeeEeccC
Confidence 99999999999999999999999987766555555 6666666679999988743222 223349999999998654
Q ss_pred eeeec--ccccccCCcccCCCCCCCCCCCCCCCCCCCeecccCCCCCceeeecCCCCCCCCCccCccCCCCCCccCCCce
Q psy9228 546 ELDLI--NSAVDSANIMDCSDLESSPVCAPKPCQNYGICYPTDTSERGYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGM 623 (834)
Q Consensus 546 ~~~~~--~~~~~~~~~~~C~~~~~~~~C~~~pC~ngg~C~~~~~~~~~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~ 623 (834)
.+.+. .-+...... +...
T Consensus 406 ~~rl~L~~LAk~g~~~-----------------------~k~~------------------------------------- 425 (1591)
T KOG3514|consen 406 DTRLELSRLAKQGDSK-----------------------MKTE------------------------------------- 425 (1591)
T ss_pred ceeehhhHHhhcCCce-----------------------eEee-------------------------------------
Confidence 43332 221111000 0000
Q ss_pred eeecCCCcEEcCCCCCCCCCcccccccceeeEEcC-CceeeechhhhhhccCcceEEEEEEeCCCCeeEEecCCCCCCCC
Q psy9228 624 CKVTPDSYECLCSLGYAPPNCAKRVSIGSEVHFLG-EGYVELKKELIEERRNEETIAFDFVTDDKNALLLWNGQPSYKNG 702 (834)
Q Consensus 624 C~~~~~~~~C~C~~g~~G~~Ce~~~~~~~~~~F~g-~s~~~~~~~~~~~~~~~~~i~~~frT~~~~GlLl~~~~~~~~~~ 702 (834)
..-.|.| |... ....+.|.. .+|+.++.|.. ++.-.|+|.|||..++|||||+....+
T Consensus 426 ---G~l~y~C-----------~n~~-~~DpvtFtt~es~l~LP~Wnt---~~~gSiSf~FRTtepnGlil~~~g~~~--- 484 (1591)
T KOG3514|consen 426 ---GDLSYSC-----------ENVA-QLDPVTFTTPESYLTLPRWNT---KKSGSISFDFRTTEPNGLILFHGGPQA--- 484 (1591)
T ss_pred ---ceEEEec-----------CCCC-ccCceeeecccceeecccccc---CCcceeEEEEeecCCCceEEEccCccc---
Confidence 0011211 1111 124678876 89999998864 478899999999999999999987543
Q ss_pred CCcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeecccCCCCccceecCCceE
Q psy9228 703 IGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIY 782 (834)
Q Consensus 703 ~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~ly 782 (834)
...||++++|.||+|.+.+++|+|...|+.+..++|||.||+|.+.|.++.+++.||+...-. ..++....|+++++||
T Consensus 485 ~~~d~~A~ELldghlyl~ldlGSG~iklras~rkv~DGeWhhv~l~R~gR~gsvsVd~~~~df-~tpG~s~iL~ld~~my 563 (1591)
T KOG3514|consen 485 NATDYFAIELLDGHLYLLLDLGSGVIKLRASSRKVNDGEWHHVDLQRDGRTGSVSVDAIKTDF-STPGDSEILDLDDPMY 563 (1591)
T ss_pred ccccEEEEEEeCCeEEEEEecCCceEEeeeecccccCCceEEEEeeccCccceEEEeeeecCc-cCCCcceeEeecCcee
Confidence 457999999999999999999999999987889999999999999999999999999976544 3567788999999999
Q ss_pred EcCcCCCCCCCC----CccCCCceEEEEEEEECCceecccCCCC--CCCccccc
Q psy9228 783 LGGTPNMDLMTG----GRYVHPMSGLMMNIHIQNKHISNIGSSA--SSGLHVMP 830 (834)
Q Consensus 783 iGG~p~~~~~~~----~~~~~~F~GCi~~v~in~~~~~~l~~~a--~~~~nv~~ 830 (834)
||-++.....+. .....+|+||||+|.|+|+ -.++.+.| ..+..|.|
T Consensus 564 lG~~~n~l~~P~~vWta~L~~GyvGCirdl~i~G~-s~di~q~ae~q~sagvkp 616 (1591)
T KOG3514|consen 564 LGEVPNNLVYPSEVWTAALRKGYVGCIRDLFIDGV-STDIRQEAEAQNSAGVKP 616 (1591)
T ss_pred eccCCCCccCcHHHHHHHHhccchheehhheecce-ehhhHHHhhhccccccCc
Confidence 997766532222 1458899999999999996 34555443 34455554
No 6
>KOG1219|consensus
Probab=100.00 E-value=8.6e-36 Score=342.68 Aligned_cols=301 Identities=30% Similarity=0.614 Sum_probs=253.8
Q ss_pred ceEeeCCCCCCCCCCCCCCCCccCCCCCCC-eeecCC--CCeEEeCCCCcccCcccccccccccCCCCcceEEcCCCCCc
Q psy9228 160 GYTCICPPGFSGDRCSVLGEPCYPGACGDG-SCQDVD--GAMKCLCPIGTAGKRCEQKIKILQPAFKHGSYLAYPTPKTM 236 (834)
Q Consensus 160 ~~~C~C~~g~~G~~Ce~~~~~C~~~~C~~g-~C~~~~--~~~~C~C~~g~~G~~Ce~~~~~~~~~F~g~~y~~~~~~~~~ 236 (834)
.-.|.|+.|| |+...+.|...||..+ .|+... ..|.|.||.|..| .|+-+ .+.++.|+||+.|..+...
T Consensus 3632 ~a~ClC~~G~----Cp~~~~~C~~~pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~---~elS~tGnSYveyrlse~~ 3703 (4289)
T KOG1219|consen 3632 TAACLCNRGF----CPVETNQCAKSPCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGD---FELSSTGNSYVEYRLSENQ 3703 (4289)
T ss_pred cceeeecCCc----CCcccCccccCCCcccCcccccccccceeEeccCcccc-cCCCc---ceEeecCceeEEEEccccc
Confidence 4579999998 9999999999999976 799775 5599999999544 57765 5789999999999987654
Q ss_pred -ceEEEEEEEcccCCCceEEEeecccCCCCCCeEEEEEecCeEEEEEecCCCcceecCCCCCcccccceeccCCCeEEEE
Q psy9228 237 -RKFKVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSATPLYSNDAPAFNPVSTKEAPYGSKITLT 315 (834)
Q Consensus 237 -~~~~i~l~f~t~~~~~glll~~~~~~~~~~d~~~l~l~~G~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~v~l~ 315 (834)
..+++.|+.+|. +.+|++||... .+|..|.|.+|.+++.++.|+|.-
T Consensus 3704 n~~~kl~frLkT~-~sngIiM~tr~-----~d~~iLkLv~G~~~l~~~cgsG~G-------------------------- 3751 (4289)
T KOG1219|consen 3704 NTRMKLGFRLKTL-QSNGIIMYTRK-----TDLAILKLVGGSPQLLADCGSGPG-------------------------- 3751 (4289)
T ss_pred ccceEEEEEEEec-ccCcEEEEEcC-----CceEEEEecCCcEEEEEecCCCCC--------------------------
Confidence 458899999999 69999999873 389999999999999999888631
Q ss_pred ecCCCCCCceeeeeecCCCceeccccccccccccccccccCCCceEEEEeCCCCceeEeeEEeeeeccccccCCCccccc
Q psy9228 316 CNNDLEAPVEYTWSKRSNGHVLPFGAFSRENTLTLQEIKNSDAGMYVCKVSNKDMTVEIPSILLVTDSVPLFTQKPLSYL 395 (834)
Q Consensus 316 vd~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~f~~~~~s~l 395 (834)
T Consensus 3752 -------------------------------------------------------------------------------- 3751 (4289)
T KOG1219|consen 3752 -------------------------------------------------------------------------------- 3751 (4289)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred ccCCccccceeEEEEEEEeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecccEEEE-eeeeecCCCeEEE
Q psy9228 396 ALPTLTDAHLHFSIELSFKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVGLVVLR-SKVTLVPHEWVVV 474 (834)
Q Consensus 396 ~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G~~~i~-s~~~~~dg~wH~V 474 (834)
.+. ....++||+||.|
T Consensus 3752 ---------------------------------------------------------------ivg~q~~~VnDgqWHsi 3768 (4289)
T KOG1219|consen 3752 ---------------------------------------------------------------IVGSQKRTVNDGQWHSI 3768 (4289)
T ss_pred ---------------------------------------------------------------cccccceEeecCceeEE
Confidence 000 1246899999999
Q ss_pred EEEEECCeEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCeeeeeccccc
Q psy9228 475 TIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSELDLINSAV 554 (834)
Q Consensus 475 ~v~~~~~~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~~~~~~~~~ 554 (834)
.+.|.++..+|.+|+........|+. ...|+++..+|+|+.-.. ......+...||.||+..+.+||..+++.....
T Consensus 3769 alerrr~~irlsvDd~~~~~atvPg~-~~tln~d~hiy~Ga~vrl--r~~~~tqvs~Gf~GCldsiyLng~el~l~~k~~ 3845 (4289)
T KOG1219|consen 3769 ALERRRNHIRLSVDDDTYDSATVPGM-KSTLNLDTHIYLGALVRL--RHQRSTQVSYGFDGCLDSIYLNGMELPLTRKGK 3845 (4289)
T ss_pred EeeccCCceEEEEcccCceeeecccc-eeeccccceEEEeeEeee--ccCCCccccccccceeeeEEEccccccccCCCc
Confidence 99999999999999999888899988 888999999999997651 122334667899999999999999999987542
Q ss_pred ------ccCCc-ccCCCCCCCCCCCCCCCCCCCeecccCCCCCceeeecCCCCCCCCCccCccCCCCCCccCCCceeeec
Q psy9228 555 ------DSANI-MDCSDLESSPVCAPKPCQNYGICYPTDTSERGYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGMCKVT 627 (834)
Q Consensus 555 ------~~~~~-~~C~~~~~~~~C~~~pC~ngg~C~~~~~~~~~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~C~~~ 627 (834)
....+ .+|... .++|..+||||||+|..... ++|.|.|++-|+|++||++..+|.++ ||.+||+|+..
T Consensus 3846 s~a~~~el~~l~pgC~l~--~d~C~~npCqhgG~C~~~~~--ggy~CkCpsqysG~~CEi~~epC~sn-PC~~GgtCip~ 3920 (4289)
T KOG1219|consen 3846 SVAGLMELFGLQPGCSLL--TDPCNDNPCQHGGTCISQPK--GGYKCKCPSQYSGNHCEIDLEPCASN-PCLTGGTCIPF 3920 (4289)
T ss_pred hhhhhhhhhccccccccc--ccccccCcccCCCEecCCCC--CceEEeCcccccCcccccccccccCC-CCCCCCEEEec
Confidence 22233 356542 38999999999999998876 69999999999999999999999999 99999999999
Q ss_pred CCCcEEcCCCCCCCCCcccc-cccc
Q psy9228 628 PDSYECLCSLGYAPPNCAKR-VSIG 651 (834)
Q Consensus 628 ~~~~~C~C~~g~~G~~Ce~~-~~~~ 651 (834)
.++|.|.|+.||+|.+||.+ ++.|
T Consensus 3921 ~n~f~CnC~~gyTG~~Ce~~Gi~eC 3945 (4289)
T KOG1219|consen 3921 YNGFLCNCPNGYTGKRCEARGISEC 3945 (4289)
T ss_pred CCCeeEeCCCCccCceeeccccccc
Confidence 99999999999999999987 5543
No 7
>KOG4289|consensus
Probab=99.97 E-value=1.1e-30 Score=292.06 Aligned_cols=262 Identities=27% Similarity=0.590 Sum_probs=207.8
Q ss_pred CceeEEeeeEEcCeeeeeccc-----ccccCCcccCCCCCC---------CCCCCCCCCCCCCeecccCCCCCceeeecC
Q psy9228 532 GFHGCISTIDVLGSELDLINS-----AVDSANIMDCSDLES---------SPVCAPKPCQNYGICYPTDTSERGYNCSCL 597 (834)
Q Consensus 532 gF~GCi~~v~ing~~~~~~~~-----~~~~~~~~~C~~~~~---------~~~C~~~pC~ngg~C~~~~~~~~~~~C~C~ 597 (834)
.+.-|++-+++.+....+... .+...+...|.++.+ +|.|-++||.|+|+|....+ +|+|.|.
T Consensus 1189 nymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEg---gYtCeCr 1265 (2531)
T KOG4289|consen 1189 NYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREG---GYTCECR 1265 (2531)
T ss_pred HHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecC---ceeEEec
Confidence 455566666666654433332 123334445666655 67888999999999999877 9999999
Q ss_pred CCCCCCCCccCcc--CCCCCCccCCCceeeecC-CCcEEcCCC-CCCCCCcccccccceeeEEcCCceeeechhhhhhcc
Q psy9228 598 TGYSGDHCEKENN--MCMKGDVCKNGGMCKVTP-DSYECLCSL-GYAPPNCAKRVSIGSEVHFLGEGYVELKKELIEERR 673 (834)
Q Consensus 598 ~G~~G~~Ce~~~~--~C~~~~pC~ngg~C~~~~-~~~~C~C~~-g~~G~~Ce~~~~~~~~~~F~g~s~~~~~~~~~~~~~ 673 (834)
+||+|++||.+.. -|.+. -|+|||+|++.. ++|.|.|+. .|++++||.. +.+|.+.||+.|.... .+
T Consensus 1266 pg~tGehCEvs~~agrCvpG-vC~nggtC~~~~nggf~c~Cp~ge~e~prC~v~-----trSFp~~sfv~frglr---qR 1336 (2531)
T KOG4289|consen 1266 PGFTGEHCEVSARAGRCVPG-VCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEVT-----TRSFPPESFVTFRGLR---QR 1336 (2531)
T ss_pred CCccccceeeecccCccccc-eecCCCEEeecCCCceeccCCCcccCCCceEEE-----eeccCchheEEEeccc---cc
Confidence 9999999998754 69888 999999999875 889999999 5999999964 5689999999986433 24
Q ss_pred CcceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEe-CCceecCCCcEEEEEEEECC
Q psy9228 674 NEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKF-SKKPVNDGIKHSVNVTRINK 752 (834)
Q Consensus 674 ~~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~-s~~~~nDg~wH~V~i~r~~~ 752 (834)
..++++|.|.|-..+|||||.+.+. .||++|++++++|++.|..|....++.. -+..++||+||+|.+....+
T Consensus 1337 fh~TlslsfaT~~~nGlL~ynGnek------hDFvalevVd~qvqltfS~Ges~t~v~p~Vp~gvsDGqWHtV~l~YyNK 1410 (2531)
T KOG4289|consen 1337 FHFTLSLSFATIERNGLLLYNGNEK------HDFVALEVVDEQVQLTFSAGESTTTVSPDVPGGVSDGQWHTVQLEYYNK 1410 (2531)
T ss_pred eEEEEEEEEEEeeecceEEecCCcc------cceEeeeeeeeeEEEEEecccccceecCCCCCCcccCceeEEEEEEece
Confidence 6788999999999999999999653 6999999999999999999976666651 23469999999999999999
Q ss_pred EEEEEEcCeeee-------------cccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCcee
Q psy9228 753 FGSLEVDSVIVG-------------KGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNKHI 815 (834)
Q Consensus 753 ~~~l~VD~~~~~-------------~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~~~ 815 (834)
.+.|.||+=... .++..+..+.|++.+++++||+|+.... ...-|+||||++.++++++
T Consensus 1411 ~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~fpv----~~k~FvGCmrdLsvD~~~V 1482 (2531)
T KOG4289|consen 1411 VAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETFPV----IEKQFVGCMRDLSVDGRDV 1482 (2531)
T ss_pred EEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcchh----hHhHhhhhhhhcccccccc
Confidence 999999975431 1123345567999999999999965322 3567999999999999754
No 8
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.92 E-value=2.9e-24 Score=197.47 Aligned_cols=127 Identities=35% Similarity=0.579 Sum_probs=107.3
Q ss_pred EEeCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCe
Q psy9228 682 FVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSV 761 (834)
Q Consensus 682 frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~ 761 (834)
|||..++|+|||.+.... .||++|+|.+|+|+|+|++|+++..+. +..+++||+||+|++.|.++.++|+||+.
T Consensus 1 frT~~~~Gllly~g~~~~-----~dfial~L~~G~l~~~~~~G~~~~~~~-~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~ 74 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDG-----KDFIALELRDGRLEFRYNLGSGPASLR-SPQKINDGKWHTVSVSRNGRNGSLSVDGE 74 (131)
T ss_dssp EEESSSSEEEEEEESSTT-----SSEEEEEEETTEEEEEEESSSEEEEEE-ESSETTSSSEEEEEEEEETTEEEEEETTS
T ss_pred CccCCCCceEEECCcCCC-----CCEEEEEEECCEEEEEEeCCCccceec-CCCccCCCcceEEEEEEcCcEEEEEECCc
Confidence 899999999999987643 599999999999999999999988888 66679999999999999999999999998
Q ss_pred eeecccCCCCccc-eecCCceEEcCcCCC-CCCCCCccCCCceEEEEEEEECCce
Q psy9228 762 IVGKGESPGSQDV-INTRGNIYLGGTPNM-DLMTGGRYVHPMSGLMMNIHIQNKH 814 (834)
Q Consensus 762 ~~~~~~~~~~~~~-l~~~~~lyiGG~p~~-~~~~~~~~~~~F~GCi~~v~in~~~ 814 (834)
......++..... ++...+|||||+|.. ..........+|.|||++|.||+++
T Consensus 75 ~~~~~~s~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~in~~~ 129 (131)
T PF00054_consen 75 EVVTGESPSGATQSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSINGKP 129 (131)
T ss_dssp EEEEEEECSSSSSSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEETTEE
T ss_pred cceeeecCCccccccccccCEEEccCCchhhcccccccCCCeeEEEEEeEECCEE
Confidence 8855555544444 888889999999933 2334456788999999999999974
No 9
>smart00282 LamG Laminin G domain.
Probab=99.90 E-value=1.6e-22 Score=188.28 Aligned_cols=134 Identities=33% Similarity=0.538 Sum_probs=112.3
Q ss_pred cceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEE
Q psy9228 675 EETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFG 754 (834)
Q Consensus 675 ~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~ 754 (834)
.++|+|+|||.+++|+|||+.... ..+|++|+|.+|+|++.++.+++...++....+++||+||+|.|.|.++.+
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~-----~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~~~~ 76 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKN-----GGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRRV 76 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCC-----CCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeCCEE
Confidence 468999999999999999997632 368999999999999999999888777744489999999999999999999
Q ss_pred EEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCc
Q psy9228 755 SLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNK 813 (834)
Q Consensus 755 ~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~ 813 (834)
+|+||+........++....++..+.+||||+|+...........+|+|||++|+||+.
T Consensus 77 ~l~VD~~~~~~~~~~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~F~GCi~~v~in~~ 135 (135)
T smart00282 77 TLSVDGENPVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLLVTPGFRGCIRNLKVNGK 135 (135)
T ss_pred EEEECCCccccEECCCCceEEecCCCcEEccCCchhcccccccCCCCeeEeeEEEECCC
Confidence 99999976544444555567788899999999987443334567899999999999873
No 10
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.90 E-value=1.6e-22 Score=192.57 Aligned_cols=150 Identities=35% Similarity=0.572 Sum_probs=121.9
Q ss_pred eEEcCCceeeechhhhhhccCcceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEeC
Q psy9228 654 VHFLGEGYVELKKELIEERRNEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFS 733 (834)
Q Consensus 654 ~~F~g~s~~~~~~~~~~~~~~~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s 733 (834)
+.|.|++|+.|+..... ....+|+|+|||.+++|+||+.+... ..+|++|+|.+|+|++.++.|.+...+. +
T Consensus 2 ~~F~g~~~i~~~~~~~~--~~~~~i~~~frt~~~~g~l~~~~~~~-----~~~~~~l~l~~g~l~~~~~~g~~~~~~~-~ 73 (151)
T cd00110 2 VSFSGSSYVRLPTLPAP--RTRLSISFSFRTTSPNGLLLYAGSQN-----GGDFLALELEDGRLVLRYDLGSGSLVLS-S 73 (151)
T ss_pred eEeCCCceEEecCCCCC--cceeEEEEEEEeCCCCeEEEEecCCC-----CCCEEEEEEECCEEEEEEcCCcccEEEE-c
Confidence 67999999999865432 57899999999999999999998752 3699999999999999999986666676 5
Q ss_pred CceecCCCcEEEEEEEECCEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEEC
Q psy9228 734 KKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQ 811 (834)
Q Consensus 734 ~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in 811 (834)
..+++||+||+|.+.+.++.++|+||+........+.....++..+.+||||+|+...........+|+|||++|+||
T Consensus 74 ~~~v~dg~Wh~v~i~~~~~~~~l~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 74 KTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred cCccCCCCEEEEEEEECCCEEEEEECCccEEeeeCCCCceeecCCCCeEEcCCCCchhcccccccCCCceEeeEeEeC
Confidence 558999999999999999999999999754433333332246777899999999864333344678999999999997
No 11
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.86 E-value=1.6e-20 Score=172.68 Aligned_cols=129 Identities=33% Similarity=0.685 Sum_probs=106.7
Q ss_pred EeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecc--cEEEEeeeeecCCCeEEEEEEEECCeEEEEECCe
Q psy9228 413 FKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG--LVVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGGE 490 (834)
Q Consensus 413 frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G--~~~i~s~~~~~dg~wH~V~v~~~~~~~~l~VD~~ 490 (834)
|||.+++|+|||.++. ...||++|+|.+|++++++++| ...+.++..++||+||+|++.|..+.+.|+||+.
T Consensus 1 frT~~~~Gllly~g~~------~~~dfial~L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~ 74 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSK------DGKDFIALELRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGRNGSLSVDGE 74 (131)
T ss_dssp EEESSSSEEEEEEESS------TTSSEEEEEEETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETTEEEEEETTS
T ss_pred CccCCCCceEEECCcC------CCCCEEEEEEECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCcEEEEEECCc
Confidence 8999999999999874 3459999999999999999998 4667778889999999999999999999999998
Q ss_pred eeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCeeee
Q psy9228 491 PLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSELD 548 (834)
Q Consensus 491 ~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~~~ 548 (834)
......++......++...++||||+|.. ...........+|.|||+++.+|++++|
T Consensus 75 ~~~~~~s~~~~~~~l~~~~~lyvGG~p~~-~~~~~~~~~~~~f~GCi~~~~in~~~ld 131 (131)
T PF00054_consen 75 EVVTGESPSGATQSLDVDGPLYVGGLPSS-SSRPRPLPISPGFKGCIRNLSINGKPLD 131 (131)
T ss_dssp EEEEEEECSSSSSSCEECSEEEESSSSTT-TGCGSSCSCCSB-EEEEEEEEETTEEC-
T ss_pred cceeeecCCccccccccccCEEEccCCch-hhcccccccCCCeeEEEEEeEECCEECc
Confidence 88666666551334888889999999932 2233445667899999999999998875
No 12
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.85 E-value=3.3e-20 Score=171.21 Aligned_cols=127 Identities=28% Similarity=0.510 Sum_probs=106.1
Q ss_pred EEeCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCe
Q psy9228 682 FVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSV 761 (834)
Q Consensus 682 frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~ 761 (834)
|||++++|+|||++.... .+|++|+|.||+|++.+++|+....+..++.+++||+||+|.+.|.++.++|.||+.
T Consensus 1 Frt~~~~g~Ll~~~~~~~-----~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~~~~l~Vd~~ 75 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDN-----GDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGNRVTLTVDGQ 75 (128)
T ss_dssp EEESSSSEEEEEEEESTT-----SEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETTEEEEEETTS
T ss_pred CccCCCCEeEEEEcCCCC-----CEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeeeeEEEEecCc
Confidence 899999999999987532 589999999999999999996666555588899999999999999999999999999
Q ss_pred eeecccCCCCcc-ceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCc
Q psy9228 762 IVGKGESPGSQD-VINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNK 813 (834)
Q Consensus 762 ~~~~~~~~~~~~-~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~ 813 (834)
............ .++....+||||.|............+|+|||++|+|||+
T Consensus 76 ~~~~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~~~~~f~Gci~~l~vng~ 128 (128)
T PF02210_consen 76 SVSSESLPSSSSDSLDPDGSLYIGGLPESNQPSGSVDTPGFVGCIRDLRVNGQ 128 (128)
T ss_dssp EEEEEESSSTTHHCBESEEEEEESSTTTTCTCTTSSTTSB-EEEEEEEEETTE
T ss_pred cceEEeccccceecccCCCCEEEecccCccccccccCCCCcEEEcCeEEECCC
Confidence 877665544433 6677778999999987554444448899999999999984
No 13
>KOG1219|consensus
Probab=99.84 E-value=4.6e-20 Score=214.69 Aligned_cols=210 Identities=24% Similarity=0.457 Sum_probs=176.5
Q ss_pred CceeeecCCCCCCCCCccCccCCCCCCccCCCceeeecC--CCcEEcCCCCCCCCCcccccccceeeEEcCCceeeechh
Q psy9228 590 RGYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGMCKVTP--DSYECLCSLGYAPPNCAKRVSIGSEVHFLGEGYVELKKE 667 (834)
Q Consensus 590 ~~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~C~~~~--~~~~C~C~~g~~G~~Ce~~~~~~~~~~F~g~s~~~~~~~ 667 (834)
..-.|.|..|+ |+.+.+.|... ||..+-.|+... ..|+|.||.|-.| .|... ..+++.|+||++|...
T Consensus 3631 r~a~ClC~~G~----Cp~~~~~C~~~-pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~----~elS~tGnSYveyrls 3700 (4289)
T KOG1219|consen 3631 RTAACLCNRGF----CPVETNQCAKS-PCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGD----FELSSTGNSYVEYRLS 3700 (4289)
T ss_pred ccceeeecCCc----CCcccCccccC-CCcccCcccccccccceeEeccCcccc-cCCCc----ceEeecCceeEEEEcc
Confidence 46789999998 99999999999 999999998764 5799999998554 47665 4689999999999754
Q ss_pred hhhhccCcceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEE
Q psy9228 668 LIEERRNEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNV 747 (834)
Q Consensus 668 ~~~~~~~~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i 747 (834)
.. ....+.+.|.+||..++|++||... .+|..|.|.+|.+.+.++.|+++-.+......+|||+||.|.+
T Consensus 3701 e~--~n~~~kl~frLkT~~sngIiM~tr~--------~d~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWHsial 3770 (4289)
T KOG1219|consen 3701 EN--QNTRMKLGFRLKTLQSNGIIMYTRK--------TDLAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWHSIAL 3770 (4289)
T ss_pred cc--cccceEEEEEEEecccCcEEEEEcC--------CceEEEEecCCcEEEEEecCCCCCcccccceEeecCceeEEEe
Confidence 33 1244899999999999999999974 5899999999999999999999866663347899999999999
Q ss_pred EEECCEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCceecccCC
Q psy9228 748 TRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNKHISNIGS 820 (834)
Q Consensus 748 ~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~~~~~l~~ 820 (834)
.|+++.++|.||.........|+..+.++++..||+||.-.....+......+|.|||+.+.+||..+ +|.+
T Consensus 3771 errr~~irlsvDd~~~~~atvPg~~~tln~d~hiy~Ga~vrlr~~~~tqvs~Gf~GCldsiyLng~el-~l~~ 3842 (4289)
T KOG1219|consen 3771 ERRRNHIRLSVDDDTYDSATVPGMKSTLNLDTHIYLGALVRLRHQRSTQVSYGFDGCLDSIYLNGMEL-PLTR 3842 (4289)
T ss_pred eccCCceEEEEcccCceeeecccceeeccccceEEEeeEeeeccCCCccccccccceeeeEEEccccc-cccC
Confidence 99999999999999988888999888999999999999765222233346889999999999999754 4544
No 14
>smart00282 LamG Laminin G domain.
Probab=99.78 E-value=6.6e-18 Score=157.12 Aligned_cols=131 Identities=39% Similarity=0.683 Sum_probs=108.3
Q ss_pred eEEEEEEEeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecc--cEEEEee-eeecCCCeEEEEEEEECCe
Q psy9228 406 HFSIELSFKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG--LVVLRSK-VTLVPHEWVVVTIIKDFKE 482 (834)
Q Consensus 406 ~~~i~~~frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G--~~~i~s~-~~~~dg~wH~V~v~~~~~~ 482 (834)
.++|+|+|||.+++|+|||..+. ...+|++|+|.+|++.+.++.| ...+... ..++||+||+|.+.+..+.
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~------~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~~~ 75 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSK------NGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRR 75 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCC------CCCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeCCE
Confidence 47899999999999999999752 3679999999999999999987 3555665 8999999999999999999
Q ss_pred EEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCe
Q psy9228 483 GKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGS 545 (834)
Q Consensus 483 ~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~ 545 (834)
+.|+||+........++. ...++....+||||+|.... ........+|.|||+++.+|+.
T Consensus 76 ~~l~VD~~~~~~~~~~~~-~~~l~~~~~l~iGG~p~~~~--~~~~~~~~~F~GCi~~v~in~~ 135 (135)
T smart00282 76 VTLSVDGENPVSGESPGG-LTILNLDGPLYLGGLPEDLK--LPPLLVTPGFRGCIRNLKVNGK 135 (135)
T ss_pred EEEEECCCccccEECCCC-ceEEecCCCcEEccCCchhc--ccccccCCCCeeEeeEEEECCC
Confidence 999999976555455554 56778889999999998532 1234456899999999999973
No 15
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.76 E-value=2.8e-17 Score=156.44 Aligned_cols=142 Identities=40% Similarity=0.701 Sum_probs=115.2
Q ss_pred cccccCCccccceeEEEEEEEeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecc--cEEEEeeeeecCCC
Q psy9228 393 SYLALPTLTDAHLHFSIELSFKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG--LVVLRSKVTLVPHE 470 (834)
Q Consensus 393 s~l~~~~~~~~~~~~~i~~~frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G--~~~i~s~~~~~dg~ 470 (834)
+|+.++........++|+|+|||..++|+|||.+.. ...+|++|+|.+|++++.++.| ...+.+..+++||+
T Consensus 8 ~~i~~~~~~~~~~~~~i~~~frt~~~~g~l~~~~~~------~~~~~~~l~l~~g~l~~~~~~g~~~~~~~~~~~v~dg~ 81 (151)
T cd00110 8 SYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQ------NGGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLNDGQ 81 (151)
T ss_pred ceEEecCCCCCcceeEEEEEEEeCCCCeEEEEecCC------CCCCEEEEEEECCEEEEEEcCCcccEEEEccCccCCCC
Confidence 899998876557789999999999999999999763 2589999999999999999997 45666666899999
Q ss_pred eEEEEEEEECCeEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEc
Q psy9228 471 WVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVL 543 (834)
Q Consensus 471 wH~V~v~~~~~~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~in 543 (834)
||+|.+.+..+.++|.||+........+.. ...+.....+||||.|..... .......+|.|||+++.+|
T Consensus 82 Wh~v~i~~~~~~~~l~VD~~~~~~~~~~~~-~~~~~~~~~~~iGg~~~~~~~--~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 82 WHSVSVERNGRSVTLSVDGERVVESGSPGG-SALLNLDGPLYLGGLPEDLKS--PGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred EEEEEEEECCCEEEEEECCccEEeeeCCCC-ceeecCCCCeEEcCCCCchhc--ccccccCCCceEeeEeEeC
Confidence 999999999999999999975444333332 224677889999999974221 2334568999999999986
No 16
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.68 E-value=1e-15 Score=141.10 Aligned_cols=124 Identities=33% Similarity=0.557 Sum_probs=100.9
Q ss_pred EeeCCCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecc-c--EEEEeeeeecCCCeEEEEEEEECCeEEEEECC
Q psy9228 413 FKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG-L--VVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGG 489 (834)
Q Consensus 413 frt~~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G-~--~~i~s~~~~~dg~wH~V~v~~~~~~~~l~VD~ 489 (834)
|||.+++|+|||.+... ..+|+.|+|.+|++++.+++| . ........++||+||.|.+.+..+.++|.||+
T Consensus 1 Frt~~~~g~Ll~~~~~~------~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~~~~l~Vd~ 74 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSED------NGDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGNRVTLTVDG 74 (128)
T ss_dssp EEESSSSEEEEEEEEST------TSEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETTEEEEEETT
T ss_pred CccCCCCEeEEEEcCCC------CCEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeeeeEEEEecC
Confidence 89999999999998742 268999999999999999998 3 44556788999999999999999999999999
Q ss_pred eeeeeeecCCCccc-cccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCe
Q psy9228 490 EPLIVGSTPGEKLQ-VLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGS 545 (834)
Q Consensus 490 ~~~~~~~~~g~~~~-~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~ 545 (834)
........... .. .++....+||||.|....... .....+|.|||+++.+||+
T Consensus 75 ~~~~~~~~~~~-~~~~~~~~~~l~iGg~~~~~~~~~--~~~~~~f~Gci~~l~vng~ 128 (128)
T PF02210_consen 75 QSVSSESLPSS-SSDSLDPDGSLYIGGLPESNQPSG--SVDTPGFVGCIRDLRVNGQ 128 (128)
T ss_dssp SEEEEEESSST-THHCBESEEEEEESSTTTTCTCTT--SSTTSB-EEEEEEEEETTE
T ss_pred ccceEEecccc-ceecccCCCCEEEecccCcccccc--ccCCCCcEEEcCeEEECCC
Confidence 88877665554 22 667777899999998542211 1127899999999999985
No 17
>KOG3509|consensus
Probab=99.33 E-value=9.9e-11 Score=134.85 Aligned_cols=192 Identities=28% Similarity=0.432 Sum_probs=135.5
Q ss_pred CCCCeEEEEEECcEEEEEEecc--cEEEEeeeeecCCCeEEEEEEEECCeEEEEECC-eeeeeeecCCCccccccCCCce
Q psy9228 435 GKGDFVSFGLEDGYPVFRFDVG--LVVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGG-EPLIVGSTPGEKLQVLNLRTPL 511 (834)
Q Consensus 435 ~~~d~~~l~L~~G~l~~~~~~G--~~~i~s~~~~~dg~wH~V~v~~~~~~~~l~VD~-~~~~~~~~~g~~~~~l~~~~~l 511 (834)
...+|+++.+.-|.+.++++.+ ...+......-+++|+.+.+.| .-.+.+++ .....+..++. ...+.....+
T Consensus 278 ~~~~f~~lt~~~g~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r---~s~~~~~g~~~~l~g~~~~~-~~~i~~ee~v 353 (964)
T KOG3509|consen 278 FKDGFRALTLDGGTDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFR---GSGLSVSGHKGVLQGNSNIL-VSRITNEESV 353 (964)
T ss_pred cccceeeeccCCCCccccccccCcchhhhhccccccceeeeeEeee---cccccccCcceeeccccccc-ccceeecccc
Confidence 4678999999999888888776 4556677888999999999998 22333443 22233334443 3344555668
Q ss_pred eeccccCCCcccCcCcccccCceeEEeeeEEcCeeeeecccccccCCc-ccCCCCCCCCCCCCCCCCCCCeecccCCCCC
Q psy9228 512 YLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSELDLINSAVDSANI-MDCSDLESSPVCAPKPCQNYGICYPTDTSER 590 (834)
Q Consensus 512 ~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~~~~~~~~~~~~~~-~~C~~~~~~~~C~~~pC~ngg~C~~~~~~~~ 590 (834)
|+|++-+ ...+........||.|||+++.+++..++........+.. ..|. .+.|...||++.+.|.+...
T Consensus 354 ~lg~i~n-i~~l~~~~~~~eGf~gci~~~~~~~k~l~~~~~~~~~v~~~~~c~----g~~c~~~p~~~~g~c~p~~~--- 425 (964)
T KOG3509|consen 354 FLGGIIN-IETLQHNLPLPEGFAGCIRDLVMNLKDLRVTLQRASYVAAQGTCL----GDVCWRIPCQHDGPCLQTLE--- 425 (964)
T ss_pred cCCceee-eccccccCCCccCccceehhhhhhccccccccccccccccccccC----CCccccccCCCCcccccccc---
Confidence 8888444 2234555667789999999999999988876643322222 2444 48899999999999999987
Q ss_pred ceeeecCCCCCCCCCccCccCCCCCCccCCCceeeecCCCcEEcCCCC
Q psy9228 591 GYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGMCKVTPDSYECLCSLG 638 (834)
Q Consensus 591 ~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~C~~~~~~~~C~C~~g 638 (834)
...|.|++||+|..|+...+.|....+=.-.++|........+.|.++
T Consensus 426 ~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg 473 (964)
T KOG3509|consen 426 GKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG 473 (964)
T ss_pred ccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC
Confidence 999999999999999998888776522222456655555556666666
No 18
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=99.03 E-value=1.2e-08 Score=99.67 Aligned_cols=146 Identities=14% Similarity=0.126 Sum_probs=94.0
Q ss_pred eeEEcCCceeeechhhh-h-hccCcceEEEEEEeC-CCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEc--C-Cc
Q psy9228 653 EVHFLGEGYVELKKELI-E-ERRNEETIAFDFVTD-DKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDL--G-DG 726 (834)
Q Consensus 653 ~~~F~g~s~~~~~~~~~-~-~~~~~~~i~~~frT~-~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~--g-~~ 726 (834)
++.|..+..+..+...+ + .....+.|.++||+. ...|.||...... +..++.|.|..++..+.+.. . +.
T Consensus 29 Ay~~~~~a~~~~~t~~~~p~~~~~~fsi~~~~r~~~~~~g~L~si~~~~-----~~~~l~v~l~g~~~~~~~~~~~~~g~ 103 (184)
T smart00210 29 AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFAIYDAQ-----NVRQFGLEVDGRANTLLLRYQGVDGK 103 (184)
T ss_pred eEEecCCcccCcchHHhCcCCCCCCeEEEEEEEeCCCCCeEEEEEEcCC-----CcEEEEEEEeCCccEEEEEECCCCCc
Confidence 56666554444332211 1 123778999999997 6788888776532 35689999886664444442 2 22
Q ss_pred EEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeecccCCCCc-cceecCCceEEcCcCCCCCCCCCccCCCceEEE
Q psy9228 727 VVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQ-DVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLM 805 (834)
Q Consensus 727 ~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~~~~-~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi 805 (834)
...+...+..+.||+||+|.+...+..++|+||...+......... ..++..+ +.++|.... ....|.|+|
T Consensus 104 ~~~~~f~~~~l~dg~WH~lal~V~~~~v~LyvDC~~~~~~~l~~~~~~~~~~~g-~~~~g~~~~-------~~~~f~G~l 175 (184)
T smart00210 104 QHTVSFRNLPLADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGQPPIDTDG-IEVRGAQAA-------DRKPFQGDL 175 (184)
T ss_pred EEEEeecCCccccCCceEEEEEEeCCEEEEEECCccccceecCCcccccccccc-eEEEeeccC-------CCCcceEEe
Confidence 3334434578999999999999999999999999887654322211 1333333 455554322 135799999
Q ss_pred EEEEEC
Q psy9228 806 MNIHIQ 811 (834)
Q Consensus 806 ~~v~in 811 (834)
++|+|-
T Consensus 176 q~l~i~ 181 (184)
T smart00210 176 QQLKIV 181 (184)
T ss_pred EEEEEe
Confidence 999983
No 19
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=98.70 E-value=4.3e-07 Score=86.35 Aligned_cols=143 Identities=21% Similarity=0.364 Sum_probs=90.1
Q ss_pred eeEEcC-CceeeechhhhhhccCcceEEEEEEeCCCCe---eEEecCCCCCCCCCCcceEEEEEE-CCEEEEEEEcCCc-
Q psy9228 653 EVHFLG-EGYVELKKELIEERRNEETIAFDFVTDDKNA---LLLWNGQPSYKNGIGREFIAVAVV-NGYLEYSYDLGDG- 726 (834)
Q Consensus 653 ~~~F~g-~s~~~~~~~~~~~~~~~~~i~~~frT~~~~G---lLl~~~~~~~~~~~~~~~~~l~l~-~G~l~~~~~~g~~- 726 (834)
++.|.| ++|+.++...+. ...++|+++||...... .++.... ..+.+.|.+. +|++.+.+..+.+
T Consensus 2 a~~f~g~~~~i~~~~~~~~--~~~fTi~~w~~~~~~~~~~~~~~~~~~-------~~~~~~l~~~~~~~l~~~~~~~~~~ 72 (157)
T PF13385_consen 2 ALYFDGSNDYISIPNSDFP--SGSFTISFWVKPDSPSSSQSFVFMDSS-------GSGGFGLFINNNGRLRFYIGNGGGG 72 (157)
T ss_dssp EEEE-STT-EEEEESGGGG--GTEEEEEEEEEESS--SSEEEEEESSS-------SSEEEEEEEETTSEEEEEETTSEEE
T ss_pred EEEECCCCCEEEECCcCCC--CCCEEEEEEEEeCCCCCCceEEEEecC-------CCCEEEEEEECCCEEEEEEeCCCce
Confidence 456665 889999864433 57899999999876433 3333111 1246777776 6888887755432
Q ss_pred EEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEE
Q psy9228 727 VVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMM 806 (834)
Q Consensus 727 ~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~ 806 (834)
...+. +...+.+++||+|.++..++.++|+|||..+........ ........++||+... ....|.|-|.
T Consensus 73 ~~~~~-~~~~~~~~~W~~l~~~~~~~~~~lyvnG~~~~~~~~~~~-~~~~~~~~~~iG~~~~--------~~~~~~g~i~ 142 (157)
T PF13385_consen 73 NYSFS-SDSNLPDNKWHHLALTYDGSTVTLYVNGELVGSSTIPSN-ISLNSNGPLFIGGSGG--------GSSPFNGYID 142 (157)
T ss_dssp SS-EE--BS---TT-EEEEEEEEETTEEEEEETTEEETTCTEESS-SSTTSCCEEEESS-ST--------T--B-EEEEE
T ss_pred eEEEe-cCcccCCCCEEEEEEEEECCeEEEEECCEEEEeEeccCC-cCCCCcceEEEeecCC--------CCCceEEEEE
Confidence 22343 567899999999999999999999999998766533222 1234557899998662 2567999999
Q ss_pred EEEECCce
Q psy9228 807 NIHIQNKH 814 (834)
Q Consensus 807 ~v~in~~~ 814 (834)
+|+|-+..
T Consensus 143 ~~~i~~~a 150 (157)
T PF13385_consen 143 DLRIYNRA 150 (157)
T ss_dssp EEEEESS-
T ss_pred EEEEECcc
Confidence 99997764
No 20
>KOG1214|consensus
Probab=98.60 E-value=7.5e-07 Score=98.43 Aligned_cols=68 Identities=29% Similarity=0.816 Sum_probs=55.4
Q ss_pred CccCCCCCCCCCCCCEeccCCCCCceEeeCCCCCCC--CCCCCCCCCccC--CCCC-CCeeecCCCCeEEeCCCCc
Q psy9228 136 NTCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSG--DRCSVLGEPCYP--GACG-DGSCQDVDGAMKCLCPIGT 206 (834)
Q Consensus 136 ~~C~~~~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G--~~Ce~~~~~C~~--~~C~-~g~C~~~~~~~~C~C~~g~ 206 (834)
+.|. .++.-|.-++.|.... ...|+|.|..||.| .+| .++++|+. ..|. +..|++.++.|+|.|..||
T Consensus 693 npCy-~gsh~cdt~a~C~pg~-~~~~tcecs~g~~gdgr~c-~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy 765 (1289)
T KOG1214|consen 693 NPCY-DGSHMCDTTARCHPGT-GVDYTCECSSGYQGDGRNC-VDENECATGFHRCGPNSVCINLPGSYRCECRSGY 765 (1289)
T ss_pred ccce-ecCcccCCCccccCCC-CcceEEEEeeccCCCCCCC-CChhhhccCCCCCCCCceeecCCCceeEEEeecc
Confidence 3554 3478899999999875 46899999999975 689 67779998 4587 6899999999999998765
No 21
>KOG1217|consensus
Probab=98.56 E-value=9.9e-07 Score=101.20 Aligned_cols=77 Identities=30% Similarity=0.780 Sum_probs=62.4
Q ss_pred CCCCCCC-CCCCCeecccCCCCCceeeecCCCCCCCCC--ccCccCCC---CCCccCCCcee--eecCCCcEEcCCCCCC
Q psy9228 569 PVCAPKP-CQNYGICYPTDTSERGYNCSCLTGYSGDHC--EKENNMCM---KGDVCKNGGMC--KVTPDSYECLCSLGYA 640 (834)
Q Consensus 569 ~~C~~~p-C~ngg~C~~~~~~~~~~~C~C~~G~~G~~C--e~~~~~C~---~~~pC~ngg~C--~~~~~~~~C~C~~g~~ 640 (834)
+.|...+ |.|+++|.+..+ .|.|.|++||+|..| ..+...|. ...+|.++++| ......+.|.|..+|.
T Consensus 272 ~~C~~~~~c~~~~~C~~~~~---~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~ 348 (487)
T KOG1217|consen 272 DSCALIASCPNGGTCVNVPG---SYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFT 348 (487)
T ss_pred cccCCCCccCCCCeeecCCC---cceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCC
Confidence 5677764 999999999987 699999999999999 23346784 23389999999 3344688999999999
Q ss_pred CCCccccc
Q psy9228 641 PPNCAKRV 648 (834)
Q Consensus 641 G~~Ce~~~ 648 (834)
|..|+...
T Consensus 349 g~~C~~~~ 356 (487)
T KOG1217|consen 349 GRRCEDSN 356 (487)
T ss_pred CCccccCC
Confidence 99999664
No 22
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.52 E-value=4.6e-08 Score=64.41 Aligned_cols=32 Identities=44% Similarity=1.112 Sum_probs=28.6
Q ss_pred CCCCCCCCCCeecccCCCCCceeeecCCCCCCCC
Q psy9228 571 CAPKPCQNYGICYPTDTSERGYNCSCLTGYSGDH 604 (834)
Q Consensus 571 C~~~pC~ngg~C~~~~~~~~~~~C~C~~G~~G~~ 604 (834)
|.++||+|+|+|++... .+|+|.|++||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~--~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPG--GGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEEST--SEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCC--CCEEeECCCCCccCC
Confidence 67899999999999992 399999999999974
No 23
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=98.48 E-value=5.9e-06 Score=82.22 Aligned_cols=149 Identities=19% Similarity=0.171 Sum_probs=94.4
Q ss_pred eeEEcC---CceeeechhhhhhccCcceEEEEEEeCC--CCeeEEecCCCCCCCCCCcceEEEEE-ECCEEEEEEEcCCc
Q psy9228 653 EVHFLG---EGYVELKKELIEERRNEETIAFDFVTDD--KNALLLWNGQPSYKNGIGREFIAVAV-VNGYLEYSYDLGDG 726 (834)
Q Consensus 653 ~~~F~g---~s~~~~~~~~~~~~~~~~~i~~~frT~~--~~GlLl~~~~~~~~~~~~~~~~~l~l-~~G~l~~~~~~g~~ 726 (834)
.+.|.. ..|+++.+.. ......+++.+++|+.. .++.||..+.... ..++ .+.. .++.+.+.+ ++.
T Consensus 8 ~~~fp~~s~~~yv~l~~~~-~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~~----~ne~-~~~~~~~~~~~l~i--~g~ 79 (206)
T smart00159 8 VFVFPKESDTSYVKLKPEL-PKPLQAFTVCLWFYSDLSPRGYSLFSYATKGQ----DNEL-LLYKEKQGEYSLYI--GGK 79 (206)
T ss_pred EEECCCCCCCCeEEEccCC-CCChhHEEEEEEEEecCCCCceEEEEEeCCCC----CCeE-EEEEcCCcEEEEEE--cCe
Confidence 456764 4688886543 22357899999999965 5566764333211 1233 3333 366666655 333
Q ss_pred EEEEEeCCceecCCCcEEEEEEEEC--CEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEE
Q psy9228 727 VVTIKFSKKPVNDGIKHSVNVTRIN--KFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGL 804 (834)
Q Consensus 727 ~~~l~~s~~~~nDg~wH~V~i~r~~--~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GC 804 (834)
. +. ....+.||+||||.+++.. ++++|+|||.... .........+...+.|.||...+.. -........|.|-
T Consensus 80 ~--~~-~~~~~~~g~W~hvc~tw~~~~g~~~lyvnG~~~~-~~~~~~g~~i~~~G~lvlGq~qd~~-gg~f~~~~~f~G~ 154 (206)
T smart00159 80 K--VQ-FPVPESDGKWHHICTTWESSSGIAELWVDGKPGV-RKGLAKGYTVKPGGSIILGQEQDSY-GGGFDATQSFVGE 154 (206)
T ss_pred E--EE-ecccccCCceEEEEEEEECCCCcEEEEECCEEcc-cccccCCcEECCCCEEEEEecccCC-CCCCCCCcceeEE
Confidence 2 23 3456899999999999984 6789999998862 2222222345667889999755431 1112345689999
Q ss_pred EEEEEECCce
Q psy9228 805 MMNIHIQNKH 814 (834)
Q Consensus 805 i~~v~in~~~ 814 (834)
|.+|+|.+..
T Consensus 155 i~~v~iw~~~ 164 (206)
T smart00159 155 IGDLNMWDSV 164 (206)
T ss_pred EeeeEEeccc
Confidence 9999998864
No 24
>KOG1214|consensus
Probab=98.47 E-value=5.1e-07 Score=99.72 Aligned_cols=64 Identities=33% Similarity=0.816 Sum_probs=50.8
Q ss_pred CCCCCCCEe--ccCCCCCceEeeCCCCCCCC--CCCCCCCCccCCCCC-CCeeecCCCCeEEeCCCCcccC
Q psy9228 144 NNCINNGLC--QDAATRIGYTCICPPGFSGD--RCSVLGEPCYPGACG-DGSCQDVDGAMKCLCPIGTAGK 209 (834)
Q Consensus 144 ~pC~n~g~C--~~~~~~~~~~C~C~~g~~G~--~Ce~~~~~C~~~~C~-~g~C~~~~~~~~C~C~~g~~G~ 209 (834)
.-|.-.|.| +.. ....|+|.|.|||.|. .| .+.|+|.++.|. +++|.+..++|.|.|.+||.|.
T Consensus 791 h~C~i~g~a~c~~h-Ggs~y~C~CLPGfsGDG~~c-~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 791 HTCAIAGQARCVHH-GGSTYSCACLPGFSGDGHQC-TDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred cccCcCCceEEEec-CCceEEEeecCCccCCcccc-ccccccCccccCCCceEecCCCcceeecccCccCC
Confidence 445444444 333 3467999999999975 56 355999999998 7999999999999999999874
No 25
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.47 E-value=6.6e-08 Score=63.66 Aligned_cols=30 Identities=53% Similarity=1.188 Sum_probs=26.4
Q ss_pred CCCCCCCCEeccCCCCCceEeeCCCCCCCCC
Q psy9228 143 HNNCINNGLCQDAATRIGYTCICPPGFSGDR 173 (834)
Q Consensus 143 ~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~ 173 (834)
++||+|+|+|++.. .++|+|.|++||+|++
T Consensus 3 ~~~C~n~g~C~~~~-~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 3 SNPCQNGGTCIDLP-GGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTSSTTTEEEEEES-TSEEEEEEBTTEESTT
T ss_pred CCcCCCCeEEEeCC-CCCEEeECCCCCccCC
Confidence 88999999999876 5789999999999874
No 26
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=98.40 E-value=1.1e-05 Score=80.11 Aligned_cols=149 Identities=19% Similarity=0.167 Sum_probs=95.3
Q ss_pred eeEEcC---CceeeechhhhhhccCcceEEEEEEeCC--CCeeEEecCCCCCCCCCCcceEEEEEE-CCEEEEEEEcCCc
Q psy9228 653 EVHFLG---EGYVELKKELIEERRNEETIAFDFVTDD--KNALLLWNGQPSYKNGIGREFIAVAVV-NGYLEYSYDLGDG 726 (834)
Q Consensus 653 ~~~F~g---~s~~~~~~~~~~~~~~~~~i~~~frT~~--~~GlLl~~~~~~~~~~~~~~~~~l~l~-~G~l~~~~~~g~~ 726 (834)
.+.|.. ..|+++..... .....+++.+++|+.. ..+.||-.+... . .+.+.+... +|++.|.++ +.
T Consensus 8 ~l~f~~~s~~~yv~l~~~~~-~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~----~-~~~~~l~~~~~g~~~~~i~--~~ 79 (201)
T cd00152 8 VFVFPKESDTSYVKLKPELP-KPLQAFTLCLWVYTDLSTREYSLFSYATKG----Q-DNELLLYKEKDGGYSLYIG--GK 79 (201)
T ss_pred EEECCCCCCCceEEEccCCC-CChhhEEEEEEEEecCCCCCeEEEEEeCCC----C-CCeEEEEEcCCCeEEEEEc--CE
Confidence 466765 46888865432 2357899999999864 556677333221 1 233444443 678887663 33
Q ss_pred EEEEEeCCceecCCCcEEEEEEEEC--CEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEE
Q psy9228 727 VVTIKFSKKPVNDGIKHSVNVTRIN--KFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGL 804 (834)
Q Consensus 727 ~~~l~~s~~~~nDg~wH~V~i~r~~--~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GC 804 (834)
...+. ....||+||||.+++.. ++++|+|||....... ......+...+.|.||...+... ........|.|-
T Consensus 80 ~~~~~---~~~~~g~W~hv~~t~d~~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~~~g-g~~~~~~~f~G~ 154 (201)
T cd00152 80 EVTFK---VPESDGAWHHICVTWESTSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQDSYG-GGFDATQSFVGE 154 (201)
T ss_pred EEEEe---ccCCCCCEEEEEEEEECCCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeecccCCC-CCCCCCcceEEE
Confidence 33332 34599999999999984 6789999999875543 22223455667899996543310 011235689999
Q ss_pred EEEEEECCce
Q psy9228 805 MMNIHIQNKH 814 (834)
Q Consensus 805 i~~v~in~~~ 814 (834)
|.+|+|.+..
T Consensus 155 I~~v~iw~~~ 164 (201)
T cd00152 155 ISDVNMWDSV 164 (201)
T ss_pred EceeEEEccc
Confidence 9999998853
No 27
>KOG3509|consensus
Probab=98.32 E-value=6.1e-06 Score=96.14 Aligned_cols=137 Identities=25% Similarity=0.570 Sum_probs=107.1
Q ss_pred eeCCcccCCCcceeeeeCCCCCCCcCCCCccCCCCccceEEEEEEcCEEEEeecccccccCCC-CC--CccCCCCCCCCC
Q psy9228 71 ESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFKGCVSRLKYNKTEFELFKMAVERVGVD-SC--NTCKSSKHNNCI 147 (834)
Q Consensus 71 ~~~~~~~g~~~~~~~~~gg~p~~~~~~~~~~~~~gf~gCi~~~~~~~~~~~~~~~~~~~~~~~-~c--~~C~~~~~~pC~ 147 (834)
.-..-+.+...++.+|+|+.-+..-+++......||.||+..+..+.+.+.........+... .| +.|. ..||+
T Consensus 339 ~~~~~~~~i~~ee~v~lg~i~ni~~l~~~~~~~eGf~gci~~~~~~~k~l~~~~~~~~~v~~~~~c~g~~c~---~~p~~ 415 (964)
T KOG3509|consen 339 NSNILVSRITNEESVFLGGIINIETLQHNLPLPEGFAGCIRDLVMNLKDLRVTLQRASYVAAQGTCLGDVCW---RIPCQ 415 (964)
T ss_pred cccccccceeecccccCCceeeeccccccCCCccCccceehhhhhhccccccccccccccccccccCCCccc---cccCC
Confidence 344455666667778899976666677777788999999999999999887654322222222 45 6898 89999
Q ss_pred CCCEeccCCCCCceEeeCCCCCCCCCCCCCCCCccCCCCC--CCeeecCCCCeEEeCCCCcccCcccc
Q psy9228 148 NNGLCQDAATRIGYTCICPPGFSGDRCSVLGEPCYPGACG--DGSCQDVDGAMKCLCPIGTAGKRCEQ 213 (834)
Q Consensus 148 n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~~~~~C~~~~C~--~g~C~~~~~~~~C~C~~g~~G~~Ce~ 213 (834)
+.+.|.... ....|.|++||+|.-|+...+.|...+=. .++|........+.|.+| .|..+..
T Consensus 416 ~~g~c~p~~--~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg-~g~~~~~ 480 (964)
T KOG3509|consen 416 HDGPCLQTL--EGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG-AGAPTAG 480 (964)
T ss_pred CCccccccc--cccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC-CCCccch
Confidence 999999887 78999999999999999988888886654 489998888888999999 7777644
No 28
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=98.32 E-value=1.8e-05 Score=77.35 Aligned_cols=124 Identities=11% Similarity=0.167 Sum_probs=83.6
Q ss_pred ceeEEEEEEEeeC-CCCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEec----cc-EEEE-eeeeecCCCeEEEEE
Q psy9228 404 HLHFSIELSFKPT-DYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDV----GL-VVLR-SKVTLVPHEWVVVTI 476 (834)
Q Consensus 404 ~~~~~i~~~frt~-~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~----G~-~~i~-s~~~~~dg~wH~V~v 476 (834)
..+|+|.++||+. ...|.||...+. +...++.|.+..++..+.+.. |. ..+. ....+.||+||+|.+
T Consensus 51 ~~~fsi~~~~r~~~~~~g~L~si~~~------~~~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal 124 (184)
T smart00210 51 PEDFSLLTTFRQTPKSRGVLFAIYDA------QNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLAL 124 (184)
T ss_pred CCCeEEEEEEEeCCCCCeEEEEEEcC------CCcEEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEE
Confidence 3569999999998 577888876542 356689999987775566543 32 2222 236799999999999
Q ss_pred EEECCeEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEc
Q psy9228 477 IKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVL 543 (834)
Q Consensus 477 ~~~~~~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~in 543 (834)
...+..++|+||.........+......+...+ ++++|.... ....|.|+|+++.|-
T Consensus 125 ~V~~~~v~LyvDC~~~~~~~l~~~~~~~~~~~g-~~~~g~~~~---------~~~~f~G~lq~l~i~ 181 (184)
T smart00210 125 SVSGSSATLYVDCNEIDSRPLDRPGQPPIDTDG-IEVRGAQAA---------DRKPFQGDLQQLKIV 181 (184)
T ss_pred EEeCCEEEEEECCccccceecCCcccccccccc-eEEEeeccC---------CCCcceEEeEEEEEe
Confidence 999999999999987665443222011233333 344443321 124799999999884
No 29
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=98.26 E-value=1.9e-05 Score=77.32 Aligned_cols=149 Identities=22% Similarity=0.249 Sum_probs=86.9
Q ss_pred eEEcC---CceeeechhhhhhccCcceEEEEEEeCCC--CeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEE
Q psy9228 654 VHFLG---EGYVELKKELIEERRNEETIAFDFVTDDK--NALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVV 728 (834)
Q Consensus 654 ~~F~g---~s~~~~~~~~~~~~~~~~~i~~~frT~~~--~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~ 728 (834)
+.|.. ..|+++.+..... ...+++-|++||... .+.||..+... ...+++.+.-..+.+.+.+ ++...
T Consensus 3 ~~FP~~s~~~yv~l~~~~~~p-L~~fTvC~w~k~~~~~~~~tifSYat~~----~~nell~~~~~~~~~~l~i--~~~~~ 75 (195)
T PF00354_consen 3 FHFPTRSTTDYVRLKPSVPLP-LSAFTVCFWVKTDDSSNDGTIFSYATSS----QDNELLLFGSSSGSLRLYI--NGSSV 75 (195)
T ss_dssp EEE-S-BSSBEEEEEESS-S--BSEEEEEEEEEESGSGS-EEEEEEEETT----EEEEEEEEEETTTEEEEEE--TTEEE
T ss_pred EECCCCCCcceEEEecCCCCC-cccEEEEEEEEeccCCCceEEEEEccCC----CCccEEEEEeCCceEEEEE--CCeEe
Confidence 45655 4577776433222 478999999999765 88888444321 1133433322356766655 33333
Q ss_pred EEEeCCceecCCCcEEEEEEEEC--CEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEE
Q psy9228 729 TIKFSKKPVNDGIKHSVNVTRIN--KFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMM 806 (834)
Q Consensus 729 ~l~~s~~~~nDg~wH~V~i~r~~--~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~ 806 (834)
.+ ...+.||+||||-+++.. +.+.|+|||....... ......+...+.+.||--.+.. .-.......|.|=|.
T Consensus 76 ~~---~~~~~~~~Whh~C~tW~s~~G~~~ly~dG~~~~~~~-~~~g~~i~~gG~~vlGQeQd~~-gG~fd~~q~F~G~i~ 150 (195)
T PF00354_consen 76 SF---SGPIRDGQWHHICVTWDSSTGRWQLYVDGVRLSSTG-LATGHSIPGGGTLVLGQEQDSY-GGGFDESQAFVGEIS 150 (195)
T ss_dssp EE---EECS-TSS-EEEEEEEETTTTEEEEEETTEEEEEEE-SSTT--B-SSEEEEESS-BSBT-TBTCSGGGB--EEEE
T ss_pred Ee---ccccCCCCcEEEEEEEecCCcEEEEEECCEeccccc-ccCCceECCCCEEEECcccccc-CCCcCCccEeeEEEe
Confidence 33 346799999999999986 7899999999543322 2222345566788888655431 112234678999999
Q ss_pred EEEECCce
Q psy9228 807 NIHIQNKH 814 (834)
Q Consensus 807 ~v~in~~~ 814 (834)
+|.|.+..
T Consensus 151 ~~~iWd~v 158 (195)
T PF00354_consen 151 DFNIWDRV 158 (195)
T ss_dssp EEEEESS-
T ss_pred ceEEEeee
Confidence 99999864
No 30
>KOG1225|consensus
Probab=98.04 E-value=1.4e-05 Score=87.97 Aligned_cols=108 Identities=31% Similarity=0.691 Sum_probs=74.3
Q ss_pred ceeeeeCCcccCCCcceeeeeCCCCCCCcCCCCccCCCCccceEEEEEEcCEEEEeecccccccCCCCCCc--cCCCCCC
Q psy9228 67 PFSGESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFKGCVSRLKYNKTEFELFKMAVERVGVDSCNT--CKSSKHN 144 (834)
Q Consensus 67 ~~~~~~~~~~~g~~~~~~~~~gg~p~~~~~~~~~~~~~gf~gCi~~~~~~~~~~~~~~~~~~~~~~~~c~~--C~~~~~~ 144 (834)
...|.|+.+|.|.+|.....-++. .....+.+|+ ||+.. .+...+|++ |. .
T Consensus 233 ~~ic~c~~~~~g~~c~~~~C~~~c------~~~g~c~~G~--CIC~~---------------Gf~G~dC~e~~Cp----~ 285 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCSTIYCPGGC------TGRGQCVEGR--CICPP---------------GFTGDDCDELVCP----V 285 (525)
T ss_pred CceeecCCceeCCccccccCCCCC------cccceEeCCe--EeCCC---------------CCcCCCCCcccCC----c
Confidence 337999999999999854422211 0001122222 44332 333466764 75 4
Q ss_pred CCCCCCEeccCCCCCceEeeCCCCCCCCCCCCCCCCccCCCCC-CCeeecCCCCeEEeCCCCcccCccccc
Q psy9228 145 NCINNGLCQDAATRIGYTCICPPGFSGDRCSVLGEPCYPGACG-DGSCQDVDGAMKCLCPIGTAGKRCEQK 214 (834)
Q Consensus 145 pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~~~~~C~~~~C~-~g~C~~~~~~~~C~C~~g~~G~~Ce~~ 214 (834)
+|..++.|++. .|.|++||.|+.|+... |. .+|. +|.|+ .-+|.|.+||+|..|++.
T Consensus 286 ~cs~~g~~~~g------~CiC~~g~~G~dCs~~~--cp-adC~g~G~Ci----~G~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 286 DCSGGGVCVDG------ECICNPGYSGKDCSIRR--CP-ADCSGHGKCI----DGECLCDEGYTGELCIQR 343 (525)
T ss_pred ccCCCceecCC------EeecCCCcccccccccc--CC-ccCCCCCccc----CCceEeCCCCcCCccccc
Confidence 49999999863 79999999999997654 54 7898 59999 347999999999999875
No 31
>cd05725 Ig3_Robo Third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig3_Robo: domain similar to the third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antagoni
Probab=98.04 E-value=7e-06 Score=65.94 Aligned_cols=44 Identities=32% Similarity=0.645 Sum_probs=37.6
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++++|.|.++ +|..+.. .++.|+|.+|+.+|+|.|.|.+.|..+
T Consensus 13 p~v~W~k~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 59 (69)
T cd05725 13 PTVLWRKEDGELPKGRAEILDDKSLKIRNVTAGDEGSYTCEAENMVG 59 (69)
T ss_pred CEEEEEECCccCCCCcEEEeeCCEEEECcCChhHCEEEEEEEEcCCC
Confidence 4799999877 8776544 578999999999999999999998765
No 32
>KOG1225|consensus
Probab=98.03 E-value=1.2e-05 Score=88.58 Aligned_cols=107 Identities=29% Similarity=0.653 Sum_probs=72.3
Q ss_pred ceeEEEcCCcceeeeeCCcccCCCcceeeeeCCCCCCCcCCCCccCCCCccceEEEEEEcCEEEEeecccccccCCCCCC
Q psy9228 57 ERMMFVDGIGPFSGESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFKGCVSRLKYNKTEFELFKMAVERVGVDSCN 136 (834)
Q Consensus 57 ~g~~cvd~~~~~~~~~~~~~~g~~~~~~~~~gg~p~~~~~~~~~~~~~gf~gCi~~~~~~~~~~~~~~~~~~~~~~~~c~ 136 (834)
+.+.|+++. |.|++||+|.+|+.... |. .++....+.+| -||+.-.+-|+.+.+ .
T Consensus 258 ~~g~c~~G~----CIC~~Gf~G~dC~e~~C----p~--~cs~~g~~~~g--~CiC~~g~~G~dCs~-------------~ 312 (525)
T KOG1225|consen 258 GRGQCVEGR----CICPPGFTGDDCDELVC----PV--DCSGGGVCVDG--ECICNPGYSGKDCSI-------------R 312 (525)
T ss_pred ccceEeCCe----EeCCCCCcCCCCCcccC----Cc--ccCCCceecCC--EeecCCCcccccccc-------------c
Confidence 446789887 99999999999997441 11 11111111222 244433333333322 1
Q ss_pred ccCCCCCCCCCCCCEeccCCCCCceEeeCCCCCCCCCCCCCCCCccCCCCCC-CeeecCCCCeEEeCCCCcccCc
Q psy9228 137 TCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRCSVLGEPCYPGACGD-GSCQDVDGAMKCLCPIGTAGKR 210 (834)
Q Consensus 137 ~C~~~~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~~~~~C~~~~C~~-g~C~~~~~~~~C~C~~g~~G~~ 210 (834)
.| +.+|.++|.|++. +|.|.+||+|..|+.. +|.+ |.|++. |.|..||.|+.
T Consensus 313 ~c----padC~g~G~Ci~G------~C~C~~Gy~G~~C~~~-------~C~~~g~cv~g-----C~C~~Gw~G~d 365 (525)
T KOG1225|consen 313 RC----PADCSGHGKCIDG------ECLCDEGYTGELCIQR-------ACSGGGQCVNG-----CKCKKGWRGPD 365 (525)
T ss_pred cC----CccCCCCCcccCC------ceEeCCCCcCCccccc-------ccCCCceeccC-----ceeccCccCCC
Confidence 24 5789999999953 6999999999999876 3776 678753 99999999998
No 33
>cd05852 Ig5_Contactin-1 Fifth Ig domain of contactin-1. Ig5_Contactin-1: fifth Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=97.96 E-value=1.1e-05 Score=65.51 Aligned_cols=44 Identities=23% Similarity=0.405 Sum_probs=35.9
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |....+. .++.|.|.+|+.+|+|.|.|.++|..+
T Consensus 16 p~v~W~k~~~~l~~~~r~~~~~~g~L~I~~v~~~D~G~Y~C~A~N~~G 63 (73)
T cd05852 16 PKFSWSKGTELLVNNSRISIWDDGSLEILNITKLDEGSYTCFAENNRG 63 (73)
T ss_pred CEEEEEeCCEecccCCCEEEcCCCEEEECcCChhHCEEEEEEEECCCC
Confidence 4899999666 5443333 578999999999999999999998875
No 34
>cd05876 Ig3_L1-CAM Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM: third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=97.94 E-value=1.4e-05 Score=64.62 Aligned_cols=44 Identities=27% Similarity=0.611 Sum_probs=36.8
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.++ ++..+.. .++.|+|++|+.+|+|.|.|.+.|..|
T Consensus 13 P~v~W~k~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 60 (71)
T cd05876 13 PEVHWDRIDGPLSPNRTKKLNNNKTLQLDNVLESDDGEYVCTAENSEG 60 (71)
T ss_pred CeEEEEECCcCCCCCceeEEcCCCEEEEcccCHHhCEEEEEEEEcCCC
Confidence 4899999887 6654332 578999999999999999999998876
No 35
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.91 E-value=1.3e-05 Score=55.94 Aligned_cols=36 Identities=33% Similarity=0.922 Sum_probs=29.4
Q ss_pred CCCCccC-CCCCC-CeeecCCCCeEEeCCCCcc-cCccc
Q psy9228 177 LGEPCYP-GACGD-GSCQDVDGAMKCLCPIGTA-GKRCE 212 (834)
Q Consensus 177 ~~~~C~~-~~C~~-g~C~~~~~~~~C~C~~g~~-G~~Ce 212 (834)
++++|.. .||.+ |+|++..++|.|.|+.||. |..|+
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 3578887 78885 6899888889999999988 88875
No 36
>KOG1217|consensus
Probab=97.90 E-value=4.9e-05 Score=87.18 Aligned_cols=171 Identities=23% Similarity=0.462 Sum_probs=105.1
Q ss_pred CCeeEEEEEcCCCCccC---C-cEEEEc----CceeEEEcCCcceeeeeCCcccCCCcceeeeeCCCCCCCcCCCCccCC
Q psy9228 32 DSGKYKCEIQGHDSFRG---S-DYVKLN----VERMMFVDGIGPFSGESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFS 103 (834)
Q Consensus 32 d~g~y~c~~~~~~~~~~---~-~~~~~~----~~g~~cvd~~~~~~~~~~~~~~g~~~~~~~~~gg~p~~~~~~~~~~~~ 103 (834)
..+.|.|.+..++.-.. . +.|... .++.+|++....|.|.|+++|.+..++.. -.++..... ......
T Consensus 148 ~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~-~~~~~c~~~---~~~~~~ 223 (487)
T KOG1217|consen 148 SVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT-GNGGTCVDS---VACSCP 223 (487)
T ss_pred CCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC-CCCceEecc---eeccCC
Confidence 77888888888864211 1 355422 68899999999999999999999988765 000000000 000001
Q ss_pred CCccc--eEE------------EEEEcCEEEEeecccccccCC-----CCCCccCCCCCC-CCCCCCEeccCCCCCceEe
Q psy9228 104 NGFKG--CVS------------RLKYNKTEFELFKMAVERVGV-----DSCNTCKSSKHN-NCINNGLCQDAATRIGYTC 163 (834)
Q Consensus 104 ~gf~g--Ci~------------~~~~~~~~~~~~~~~~~~~~~-----~~c~~C~~~~~~-pC~n~g~C~~~~~~~~~~C 163 (834)
.|+.+ |.. .....+..+.. .....+. ..-+.|. .. +|.|+++|.+.. +.|.|
T Consensus 224 ~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~---~~g~~~~~~~~~~~~~~C~---~~~~c~~~~~C~~~~--~~~~C 295 (487)
T KOG1217|consen 224 PGARGPECEVSIVECASGDGTCVNTVGSYTCRC---PEGYTGDACVTCVDVDSCA---LIASCPNGGTCVNVP--GSYRC 295 (487)
T ss_pred CCCCCCCcccccccccCCCCcccccCCceeeeC---CCCccccccceeeeccccC---CCCccCCCCeeecCC--Cccee
Confidence 11110 000 00000111111 0001111 1226787 55 399999999976 66999
Q ss_pred eCCCCCCCCCC--CCCCCCc----cCCCCCC-Ceee--cCCCCeEEeCCCCcccCccccc
Q psy9228 164 ICPPGFSGDRC--SVLGEPC----YPGACGD-GSCQ--DVDGAMKCLCPIGTAGKRCEQK 214 (834)
Q Consensus 164 ~C~~g~~G~~C--e~~~~~C----~~~~C~~-g~C~--~~~~~~~C~C~~g~~G~~Ce~~ 214 (834)
.|++||+|..| ..+.++| ...+|.+ ++|. +....+.|.|+.++.|..|+..
T Consensus 296 ~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~ 355 (487)
T KOG1217|consen 296 TCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDS 355 (487)
T ss_pred eCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccC
Confidence 99999999999 3355788 4678987 4893 3446889999999999999865
No 37
>KOG0994|consensus
Probab=97.89 E-value=0.00047 Score=79.53 Aligned_cols=67 Identities=30% Similarity=0.667 Sum_probs=44.5
Q ss_pred CeecccCCCCCceeee-cCCCCCCCCCccCccCCCCCCccCCCc--------eeeec--CCCcEEcCCCCCCCCCcccc
Q psy9228 580 GICYPTDTSERGYNCS-CLTGYSGDHCEKENNMCMKGDVCKNGG--------MCKVT--PDSYECLCSLGYAPPNCAKR 647 (834)
Q Consensus 580 g~C~~~~~~~~~~~C~-C~~G~~G~~Ce~~~~~C~~~~pC~ngg--------~C~~~--~~~~~C~C~~g~~G~~Ce~~ 647 (834)
|.|++-.++..++.|+ |..||.|.-=--.-..|.+= ||..+- +|.-. .....|.|.+||+|.+||.-
T Consensus 873 GaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPC-pCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~C 950 (1758)
T KOG0994|consen 873 GACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPC-PCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEIC 950 (1758)
T ss_pred cccccccccccccchhhhhccccCCcccCCCCCCCCC-CCCCCCccchhccccccccccccceeeecccCccccchhhh
Confidence 6777776666788887 88999874221112234433 675442 35433 25678999999999999864
No 38
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.88 E-value=1.5e-05 Score=55.73 Aligned_cols=36 Identities=42% Similarity=0.936 Sum_probs=32.3
Q ss_pred ccCCCC-CCccCCCceeeecCCCcEEcCCCCCC-CCCcc
Q psy9228 609 NNMCMK-GDVCKNGGMCKVTPDSYECLCSLGYA-PPNCA 645 (834)
Q Consensus 609 ~~~C~~-~~pC~ngg~C~~~~~~~~C~C~~g~~-G~~Ce 645 (834)
+++|.. . ||.++|+|++..++|.|.|++||. |..|+
T Consensus 2 ~~~C~~~~-~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGN-PCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCC-CcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 467876 5 999999999999999999999999 99885
No 39
>cd05745 Ig3_Peroxidasin Third immunoglobulin (Ig)-like domain of peroxidasin. Ig3_Peroxidasin: the third immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells which have undergone programmed cell death, and protection of the organism against non-self.
Probab=97.87 E-value=1.7e-05 Score=64.60 Aligned_cols=44 Identities=27% Similarity=0.608 Sum_probs=36.5
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |+...+. .++.|+|.+|+.+|+|.|.|.+.|.-+
T Consensus 17 p~i~W~k~g~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~A~N~~G 64 (74)
T cd05745 17 PVIAWTKGGSQLSVDRRHLVLSSGTLRISRVALHDQGQYECQAVNIVG 64 (74)
T ss_pred CEEEEEECCEECCCCCCeEEccCCeEEEeeCCHHhCEEEEEEEEeCCC
Confidence 4899999666 7665433 578999999999999999999998765
No 40
>cd05863 Ig2_VEGFR-3 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3). Ig2_VEGFR-3: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds two members of the VEGF family (VEGF-C and -D) and is involved in tumor angiogenesis and growth.
Probab=97.85 E-value=1.7e-05 Score=62.76 Aligned_cols=42 Identities=12% Similarity=0.253 Sum_probs=34.7
Q ss_pred CcEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.|++. |.. +..++.|.|.+|+.+|+|.|.|.++|..+
T Consensus 13 P~v~W~kdg~~l~~--~~~~~~L~i~~v~~~D~G~YtC~a~N~~g 55 (67)
T cd05863 13 PEFQWYKDGKLISG--KHSQHSLQIKDVTEASAGTYTLVLWNSAA 55 (67)
T ss_pred CEEEEEECCEECcc--cCCcCEEEECCCCHHHCEEEEEEEEECCc
Confidence 4799999554 753 33567999999999999999999998865
No 41
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=97.82 E-value=0.00044 Score=65.40 Aligned_cols=139 Identities=20% Similarity=0.310 Sum_probs=85.1
Q ss_pred cccccccCCccccceeEEEEEEEeeCCCCe--eEEEeccCCccccCCCCCeEEEEEE-CcEEEEEEeccc---EEEEeee
Q psy9228 391 PLSYLALPTLTDAHLHFSIELSFKPTDYNG--LIMYTGDSNMKSYKGKGDFVSFGLE-DGYPVFRFDVGL---VVLRSKV 464 (834)
Q Consensus 391 ~~s~l~~~~~~~~~~~~~i~~~frt~~~~G--lLl~~~~~~~~~~~~~~d~~~l~L~-~G~l~~~~~~G~---~~i~s~~ 464 (834)
..+|+.+|........++|+++||...... .++... ....+.+.+.+. ++++.+.+..+. ..+....
T Consensus 8 ~~~~i~~~~~~~~~~~fTi~~w~~~~~~~~~~~~~~~~-------~~~~~~~~l~~~~~~~l~~~~~~~~~~~~~~~~~~ 80 (157)
T PF13385_consen 8 SNDYISIPNSDFPSGSFTISFWVKPDSPSSSQSFVFMD-------SSGSGGFGLFINNNGRLRFYIGNGGGGNYSFSSDS 80 (157)
T ss_dssp TT-EEEEESGGGGGTEEEEEEEEEESS--SSEEEEEES-------SSSSEEEEEEEETTSEEEEEETTSEEESS-EE-BS
T ss_pred CCCEEEECCcCCCCCCEEEEEEEEeCCCCCCceEEEEe-------cCCCCEEEEEEECCCEEEEEEeCCCceeEEEecCc
Confidence 367777776444467899999999887432 233321 122346667675 477777765542 4556777
Q ss_pred eecCCCeEEEEEEEECCeEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcC
Q psy9228 465 TLVPHEWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLG 544 (834)
Q Consensus 465 ~~~dg~wH~V~v~~~~~~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing 544 (834)
.+.+++||+|.+......++|+||+........+.. ........++||+.... ...|.|-|.++++-.
T Consensus 81 ~~~~~~W~~l~~~~~~~~~~lyvnG~~~~~~~~~~~--~~~~~~~~~~iG~~~~~----------~~~~~g~i~~~~i~~ 148 (157)
T PF13385_consen 81 NLPDNKWHHLALTYDGSTVTLYVNGELVGSSTIPSN--ISLNSNGPLFIGGSGGG----------SSPFNGYIDDLRIYN 148 (157)
T ss_dssp ---TT-EEEEEEEEETTEEEEEETTEEETTCTEESS--SSTTSCCEEEESS-STT------------B-EEEEEEEEEES
T ss_pred ccCCCCEEEEEEEEECCeEEEEECCEEEEeEeccCC--cCCCCcceEEEeecCCC----------CCceEEEEEEEEEEC
Confidence 899999999999999999999999987655433222 12345668899986531 467999999999966
Q ss_pred eeee
Q psy9228 545 SELD 548 (834)
Q Consensus 545 ~~~~ 548 (834)
+.+.
T Consensus 149 ~aLt 152 (157)
T PF13385_consen 149 RALT 152 (157)
T ss_dssp S---
T ss_pred ccCC
Confidence 5543
No 42
>cd05851 Ig3_Contactin-1 Third Ig domain of contactin-1. Ig3_Contactin-1: Third Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=97.80 E-value=3.2e-05 Score=65.44 Aligned_cols=44 Identities=30% Similarity=0.607 Sum_probs=37.4
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. ++...+. .++.|.|.+|+.+|+|.|.|.+.|.-|
T Consensus 31 P~v~W~k~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~A~N~~G 77 (88)
T cd05851 31 PVIRWRKILEPMPATAEISMSGAVLKIFNIQPEDEGTYECEAENIKG 77 (88)
T ss_pred CEEEEEECCcCCCCCCEEecCCCEEEECcCChhhCEEEEEEEEcCCC
Confidence 4799999777 8775443 577999999999999999999998876
No 43
>cd05864 Ig2_VEGFR-2 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A; VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGF-A also interacts with VEGFR-1, which it binds more strongly than VEGFR-2. VEGFR-2 and -1 may mediate a chemotactic and a survival signal in hematopoietic stem cells or leukemia cells.
Probab=97.79 E-value=2.3e-05 Score=62.85 Aligned_cols=44 Identities=20% Similarity=0.357 Sum_probs=36.5
Q ss_pred CcEEEEecCC-CCCcccc-CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA-EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~-~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++++|.|.+. |....+. .+..|+|.+++.+|+|.|.|.++|..|
T Consensus 13 P~v~W~k~g~~l~~~~~~~~~~~L~I~~v~~~D~G~YtC~a~N~~G 58 (70)
T cd05864 13 PEVKWYKNGQLIVLNHTFKRGVHLTIYEVTEKDAGNYTVVLTNPIT 58 (70)
T ss_pred CEEEEEECCEECCCCCEEccCCEEEECcCCHHHCEEEEEEEEECCC
Confidence 4799999666 7665444 466899999999999999999998875
No 44
>cd07702 Ig2_VEGFR-1 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 1 (VEGFR-1). Ig2_VEGFR-1: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 1 (VEGFR-1). VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-1 binds VEGF-A strongly; VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGFR-1 may play an inhibitory rolet in the function of VEGFR-2 by binding VEGF-A and interfering with its interaction with VEGFR-2. VEGFR-1 has a signaling role in mediating monocyte chemotaxis and may mediate a chemotactic and a survival signal in hematopoietic stem cells or leukemia cells.
Probab=97.79 E-value=2.4e-05 Score=62.93 Aligned_cols=42 Identities=17% Similarity=0.230 Sum_probs=34.4
Q ss_pred CcEEEEecCC-CCCccc---cCCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 2 AYIKWSRADG-LPLQRY---AEGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~---~~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
+++.|.|++. |+.... ..+..|.|.+|+.+|+|.|.|.+.+.
T Consensus 13 P~v~W~kdg~~l~~~~~~~~~~~~~L~I~~v~~~D~G~YtC~a~~~ 58 (72)
T cd07702 13 PEVIWLKDGLPAAEKCSRYHVDGYSLVIKDVTEEDAGIYTILLGIK 58 (72)
T ss_pred CeEEEEECCEECCCCCcEEEeCCCEEEECcCCHHHCEEEEEEEEcc
Confidence 5899999766 765422 25679999999999999999999876
No 45
>cd05746 Ig4_Peroxidasin Fourth immunoglobulin (Ig)-like domain of peroxidasin. Ig4_Peroxidasin: the fourth immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted, and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells, which have undergone programmed cell death, and protection of the organism against non-self.
Probab=97.78 E-value=3.8e-05 Score=61.59 Aligned_cols=44 Identities=18% Similarity=0.373 Sum_probs=35.3
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |....+. .++.|.|.+|+.+|+|.|.|.+.|..|
T Consensus 13 p~i~W~k~g~~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 60 (69)
T cd05746 13 PTITWNKDGVQVTESGKFHISPEGYLAIRDVGVADQGRYECVARNTIG 60 (69)
T ss_pred CEEEEEECCEECCCCCCEEECCCCEEEECcCChhhCEEEEEEEECCCC
Confidence 4799999555 6543222 478999999999999999999998876
No 46
>cd04976 Ig2_VEGFR Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. The VEGFR family consists of three members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGFR-2 is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A. VEGFR-1 may play an inhibitory part in these processes by binding VEGF and interfering with its interaction with VEGFR-2. VEGFR-1 has a signa
Probab=97.75 E-value=3.5e-05 Score=62.19 Aligned_cols=44 Identities=18% Similarity=0.446 Sum_probs=35.7
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++++|.|++. |....+. .+..|+|.+|+.+|+|.|.|.+.|..+
T Consensus 13 p~v~W~k~g~~l~~~~~~~~~~~~L~I~~v~~~D~G~YtC~a~N~~g 59 (71)
T cd04976 13 PEIQWYKNGKLISEKNRTKKSGHSLTIKDVTEEDAGNYTVVLTNKQA 59 (71)
T ss_pred CEEEEEECCEECCCCCEEEcCCCEEEECcCCHHHCEEEEEEEEcCCc
Confidence 5799999555 6654333 578999999999999999999998765
No 47
>cd05731 Ig3_L1-CAM_like Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM_like: domain similar to the third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM and human neurofascin.
Probab=97.75 E-value=3.3e-05 Score=62.32 Aligned_cols=44 Identities=30% Similarity=0.591 Sum_probs=36.5
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++++|.|.++ ++..+.. .+..|+|.+|+.+|+|.|.|.+.|..+
T Consensus 13 p~v~W~k~~~~~~~~~~~~~~~~~~L~i~~v~~~D~G~Y~C~a~N~~G 60 (71)
T cd05731 13 PEISWIKIGGELPADRTKFENFNKTLKIDNVSEEDDGEYRCTASNSLG 60 (71)
T ss_pred CeEEEEECCeECCCCceeEecCCCEEEECCCCHHHCEEEEEEEEeCCc
Confidence 4799999888 7664332 467999999999999999999998765
No 48
>cd05723 Ig4_Neogenin Fourth immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig4_Neogenin: fourth immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=97.74 E-value=4.5e-05 Score=61.54 Aligned_cols=44 Identities=18% Similarity=0.449 Sum_probs=35.3
Q ss_pred CcEEEEecCC-C-CCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-L-PLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~-~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. + |..+.. .++.|.|.+|+.+|+|.|.|.+.|..|
T Consensus 14 p~v~W~k~g~~~~~~~~~~~~~~~~l~i~~v~~~D~G~Y~C~A~N~~G 61 (71)
T cd05723 14 PTVKWVKNGDMVIPSDYFKIVKEHNLQVLGLVKSDEGFYQCIAENDVG 61 (71)
T ss_pred CEEEEEECCeECCCCCCEEEEecCCEEEEcCCcccCEEEEEEEEcCCC
Confidence 5899999777 4 433322 467899999999999999999998876
No 49
>cd05763 Ig_1 Subgroup of the immunoglobulin (Ig) superfamily. Ig_1: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=97.73 E-value=6.3e-05 Score=61.48 Aligned_cols=44 Identities=30% Similarity=0.677 Sum_probs=35.0
Q ss_pred CcEEEEecCC--CCCccc--c----CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG--LPLQRY--A----EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~--~~~~~~--~----~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.++ +|.... . .++.|.|.+++.+|+|.|.|.+.|..+
T Consensus 13 P~v~W~k~~~~~~~~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~A~N~~G 64 (75)
T cd05763 13 PQIAWQKDGGTDFPAARERRMHVMPEDDVFFIVDVKIEDTGVYSCTAQNTAG 64 (75)
T ss_pred CEEEEEeCCCccCCcccccceEEecCCCEEEEeeCCcccCEEEEEEEEcCCC
Confidence 4799999775 555322 1 467999999999999999999988765
No 50
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=97.70 E-value=0.00099 Score=63.75 Aligned_cols=133 Identities=23% Similarity=0.231 Sum_probs=86.1
Q ss_pred CcceEEEEEEeCCCCe--eEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEE-EEEeC-----CceecCCCcEEE
Q psy9228 674 NEETIAFDFVTDDKNA--LLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVV-TIKFS-----KKPVNDGIKHSV 745 (834)
Q Consensus 674 ~~~~i~~~frT~~~~G--lLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~-~l~~s-----~~~~nDg~wH~V 745 (834)
...+|-++||+....+ -||.++... ....|+.|++.++.+-+.++-..+.. ..... +...++-.||.|
T Consensus 33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~~----~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tv 108 (190)
T PF02973_consen 33 EEGTIVIRFKSDSNSGIQSLFSISNST----KGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTV 108 (190)
T ss_dssp SSEEEEEEEEESS-SSEEEEEEEE-TS----TTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEE
T ss_pred cccEEEEEEecCCCcceeEEEEecCCC----CccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEE
Confidence 6679999999977665 366666543 34689999999998888887655522 21112 234567789999
Q ss_pred EEEEE--CCEEEEEEcCeeeecccCCCCcc--ceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCcee
Q psy9228 746 NVTRI--NKFGSLEVDSVIVGKGESPGSQD--VINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNKHI 815 (834)
Q Consensus 746 ~i~r~--~~~~~l~VD~~~~~~~~~~~~~~--~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~~~ 815 (834)
.+.-. .+..+|+|||+.+.......... .+.-...++|||.... +...-+|.|=|++|+|-+..+
T Consensus 109 a~~ad~~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~-----g~~~y~f~G~I~~l~iYn~aL 177 (190)
T PF02973_consen 109 AFVADSKNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRA-----GSNAYPFNGTIDNLKIYNRAL 177 (190)
T ss_dssp EEEEETTTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEET-----TEEES--EEEEEEEEEESS--
T ss_pred EEEEecCCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeC-----CCceecccceEEEEEEEcCcC
Confidence 99998 78999999996665432222211 2222246999996432 335678999999999987644
No 51
>cd05760 Ig2_PTK7 Second immunoglobulin (Ig)-like domain of protein tyrosine kinase (PTK) 7, also known as CCK4. Ig2_PTK7: domain similar to the second immunoglobulin (Ig)-like domain in protein tyrosine kinase (PTK) 7, also known as CCK4. PTK7 is a subfamily of the receptor protein tyrosine kinase family, and is referred to as an RPTK-like molecule. RPTKs transduce extracellular signals across the cell membrane, and play important roles in regulating cell proliferation, migration, and differentiation. PTK7 is organized as an extracellular portion having seven Ig-like domains, a single transmembrane region, and a cytoplasmic tyrosine kinase-like domain. PTK7 is considered a pseudokinase as it has several unusual residues in some of the highly conserved tyrosine kinase (TK) motifs; it is predicted to lack TK activity. PTK7 may function as a cell-adhesion molecule. PTK7 mRNA is expressed at high levels in placenta, melanocytes, liver, lung, pancreas, and kidney. PTK7 is overexpressed in s
Probab=97.70 E-value=6.4e-05 Score=61.76 Aligned_cols=44 Identities=23% Similarity=0.308 Sum_probs=35.2
Q ss_pred CcEEEEecCC-CCCccc---c--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRY---A--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~---~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |+.... . .++.|.|++++.+|+|.|.|.+.|..|
T Consensus 13 p~v~W~k~g~~l~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 62 (77)
T cd05760 13 PTYQWFRDGTPLSDGQGNYSVSSKERTLTLRSAGPDDSGLYYCCAHNAFG 62 (77)
T ss_pred CcEEEEECCEECCCCCccEEEeCCCCEEEEeeCCcccCEEEEEEEEeCCC
Confidence 4799999666 755421 1 356899999999999999999998765
No 52
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.70 E-value=4.5e-05 Score=52.82 Aligned_cols=35 Identities=40% Similarity=1.135 Sum_probs=29.1
Q ss_pred CccCCCCC-CCCCCCCEeccCCCCCceEeeCCCCCCCCCCC
Q psy9228 136 NTCKSSKH-NNCINNGLCQDAATRIGYTCICPPGFSGDRCS 175 (834)
Q Consensus 136 ~~C~~~~~-~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce 175 (834)
++|. . .||.++++|.+.. ++|.|.|++||.|..|+
T Consensus 3 ~~C~---~~~~C~~~~~C~~~~--~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECA---SGNPCQNGGTCVNTV--GSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCC---CCCCcCCCCEeECCC--CCeEeECCCCCcCCcCC
Confidence 4566 5 7899999999876 78999999999998874
No 53
>cd04969 Ig5_Contactin_like Fifth Ig domain of contactin. Ig5_Contactin_like: Fifth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=97.69 E-value=5.3e-05 Score=61.54 Aligned_cols=44 Identities=27% Similarity=0.418 Sum_probs=36.1
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++++|.|.+. |....+. .++.|.|.+|+.+|+|.|.|.+.|..+
T Consensus 16 p~v~W~k~~~~~~~~~~~~~~~~~~L~i~~v~~~D~G~Y~C~a~N~~G 63 (73)
T cd04969 16 PTISWSKGTELLTNSSRICIWPDGSLEILNVTKSDEGKYTCFAENFFG 63 (73)
T ss_pred CEEEEEECCEEcccCCCEEECCCCeEEEccCChhHCEEEEEEEECCCC
Confidence 5899999766 6544333 467899999999999999999998875
No 54
>cd05855 Ig_TrkB_d5 Fifth domain (immunoglobulin-like) of Trk receptor TrkB. TrkB_d5: the fifth domain of Trk receptor TrkB, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors, which mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. The Trks are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkB shares significant sequence homology and domain organization with TrkA, and TrkC. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. In some cell systems NT-3 can activate TrkA and TrkB receptors. TrKB transcripts are found throughout multiple structures of the central and peripheral nervous systems.
Probab=97.67 E-value=6.3e-05 Score=61.62 Aligned_cols=44 Identities=16% Similarity=0.300 Sum_probs=36.0
Q ss_pred CcEEEEecCC-CCCcccc-----------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA-----------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~-----------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |+...+. ..+.|+|.++..+|+|.|+|.|+|..|
T Consensus 13 Pti~W~kng~~l~~~~~~~~~~~~~~~~~~~~~L~i~~~~~~D~G~YtC~A~N~~G 68 (79)
T cd05855 13 PTLQWFHEGAILNESEYICTKIHVINNTEYHGCLQLDNPTHLNNGIYTLVAKNEYG 68 (79)
T ss_pred CceEEEECCEECCCCcceeeeeEeecccceEEEEEECCCCcccCEEEEEEEEcCCc
Confidence 5899999666 8765542 124799999999999999999999876
No 55
>cd05893 Ig_Palladin_C C-terminal immunoglobulin (Ig)-like domain of palladin. Ig_Palladin_C: C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP, mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions.
Probab=97.65 E-value=6.2e-05 Score=61.33 Aligned_cols=44 Identities=20% Similarity=0.361 Sum_probs=34.3
Q ss_pred CcEEEEecCC-CCCc-ccc---CC--C--eEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ-RYA---EG--N--VLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~-~~~---~~--~--~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.++ |+.. .+. .+ + .|.|.+|+.+|+|.|.|.+.|..|
T Consensus 13 P~v~W~k~~~~l~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~A~N~~G 65 (75)
T cd05893 13 PQIFWKKENESLTHNTDRVSMHQDNCGYICLLIQGATKEDAGWYTVSAKNEAG 65 (75)
T ss_pred CEEEEEECCEECCCCCCeEEEEEcCCCEEEEEECCCCHHHCEEEEEEEEcCCC
Confidence 5899999777 7642 222 12 2 699999999999999999998875
No 56
>cd05867 Ig4_L1-CAM_like Fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig4_L1-CAM_like: fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=97.64 E-value=7.4e-05 Score=61.23 Aligned_cols=44 Identities=23% Similarity=0.451 Sum_probs=34.9
Q ss_pred CcEEEEecCC-CCCc-----cccCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ-----RYAEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~-----~~~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |... ....++.|.|++++.+|+|.|.|.+.|..|
T Consensus 16 p~i~W~k~g~~i~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 65 (76)
T cd05867 16 PNITWSINGAPIEGTDPDPRRHVSSGALILTDVQPSDTAVYQCEARNRHG 65 (76)
T ss_pred CeEEEEECCEECCCCCCCceEEeeCCEEEECCCChhhCEEEEEEEECCCC
Confidence 4799999555 6542 122578999999999999999999998765
No 57
>cd04971 Ig_TrKABC_d5 Fifth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrkB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor foun
Probab=97.63 E-value=7e-05 Score=62.06 Aligned_cols=44 Identities=16% Similarity=0.288 Sum_probs=34.7
Q ss_pred CcEEEEecCC-CCCcccc-------------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA-------------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~-------------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |....+. ..+.|+|.+++.+|+|.|+|.++|..|
T Consensus 13 P~v~W~k~g~~i~~~~~~~~~~~~~~~~~~~~~~~L~I~~~~~~D~G~YtC~A~N~~G 70 (81)
T cd04971 13 PTLTWYHNGAVLNESDYIRTEIHYEVTTPTEYHGCLQFDNPTHVNNGNYTLVASNEYG 70 (81)
T ss_pred CcEEEEECCEECcCCCceeEEEEeecccccccEEEEEECCCCcccCeEEEEEEEeCCC
Confidence 5899999555 6554322 134799999999999999999999876
No 58
>cd05738 Ig2_RPTP_IIa_LAR_like Second immunoglobulin (Ig)-like domain of the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. Ig2_RPTP_IIa_LAR_like: domain similar to the second immunoglobulin (Ig)-like domain found in the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. LAR belongs to the RPTP type IIa subfamily. Members of this subfamily are cell adhesion molecule-like proteins involved in central nervous system (CNS) development. They have large extracellular portions, comprised of multiple Ig-like domains and two to nine fibronectin type III (FNIII) domains, and a cytoplasmic portion having two tandem phosphatase domains.
Probab=97.63 E-value=8.9e-05 Score=60.36 Aligned_cols=44 Identities=23% Similarity=0.372 Sum_probs=35.0
Q ss_pred CcEEEEecCC-CC--Ccccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LP--LQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~--~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|++. |. ...+. .++.|.|.+++.+|+|.|.|.+.|..|
T Consensus 13 p~v~W~k~~~~~~~~~~~~~~~~~~g~L~i~~~~~~D~G~Y~C~a~N~~G 62 (74)
T cd05738 13 PEITWFKDFLPVDTTSNGRIKQLRSGALQIENSEESDQGKYECVATNSAG 62 (74)
T ss_pred CEEEEEECCEECccCCCCCEEEcCCcEEEECCCChhhCEEEEEEEECCCC
Confidence 4799999665 65 22222 467999999999999999999998765
No 59
>cd05748 Ig_Titin_like Immunoglobulin (Ig)-like domain of titin and similar proteins. Ig_Titin_like: immunoglobulin (Ig)-like domain found in titin-like proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic, depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone. It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin, and similar to titin has repeated Ig-like and FN-
Probab=97.60 E-value=9.4e-05 Score=60.24 Aligned_cols=44 Identities=30% Similarity=0.469 Sum_probs=35.1
Q ss_pred CcEEEEecCC-CCCcccc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.|.+. |....+. ....|+|.+++.+|+|.|+|.+.|..|
T Consensus 14 p~v~W~k~g~~l~~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 64 (74)
T cd05748 14 PTVTWSKDGKPLKLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLKNPAG 64 (74)
T ss_pred CeEEEEECCEEcCCCCeEEEEECCCeEEEEECCCCcCcCEEEEEEEECCCc
Confidence 5899999666 7443322 246899999999999999999998876
No 60
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.60 E-value=7.5e-05 Score=51.66 Aligned_cols=36 Identities=42% Similarity=0.943 Sum_probs=31.8
Q ss_pred ccCCCC-CCccCCCceeeecCCCcEEcCCCCCCCCCcc
Q psy9228 609 NNMCMK-GDVCKNGGMCKVTPDSYECLCSLGYAPPNCA 645 (834)
Q Consensus 609 ~~~C~~-~~pC~ngg~C~~~~~~~~C~C~~g~~G~~Ce 645 (834)
+++|.. . ||.+++.|.+..++|.|.|++||.|..|+
T Consensus 2 ~~~C~~~~-~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 2 IDECASGN-PCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred cccCCCCC-CcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 356766 5 99999999999999999999999999885
No 61
>cd04973 Ig1_FGFR First immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig1_FGFR: The first immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for all FGFs.
Probab=97.60 E-value=0.0001 Score=60.92 Aligned_cols=44 Identities=25% Similarity=0.455 Sum_probs=35.1
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. |+...+. .++.|.|.+++.+|+|.|.|.+.|..+
T Consensus 23 ~~v~W~k~g~~l~~~~~~~~~~~~L~I~~~~~~DsG~Y~C~a~n~~g 69 (79)
T cd04973 23 QSINWTKDGVQLGENNRTRITGEEVQIKDAVPRDSGLYACVTSSPSG 69 (79)
T ss_pred ceEEEeeCCcCCCCCceEEEeCCEEEECCCChhhCEEEEEEEeCCCC
Confidence 3699999655 7654333 577899999999999999999987654
No 62
>cd04968 Ig3_Contactin_like Third Ig domain of contactin. Ig3_Contactin_like: Third Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=97.60 E-value=9.9e-05 Score=62.47 Aligned_cols=44 Identities=36% Similarity=0.780 Sum_probs=36.0
Q ss_pred CcEEEEecCC-CCCccc--cCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRY--AEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~--~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.++ ++.... ..++.|+|.+|+.+|+|.|.|.+.|..|
T Consensus 31 p~v~W~k~~~~~~~~~~~~~~~~~L~i~~v~~~D~G~Y~C~a~N~~G 77 (88)
T cd04968 31 PQIKWRKVDGSMPSSAEISMSGAVLKIPNIQFEDEGTYECEAENIKG 77 (88)
T ss_pred CEEEEEECCCcCCCccEEEEeCCEEEECCCCcccCEEEEEEEEECCC
Confidence 4799999777 544332 2678999999999999999999988776
No 63
>cd05743 Ig_Perlecan_D2_like Immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Perlecan consists of five domains. Domain I has three putative heparan sulfate attachment sites; domain II has four LDL receptor-like repeats, and one Ig-like repeat; domain III resembles the short arm of laminin chains; domain IV has multiple Ig-like repeats (21 repeats in human perlecan); and domain V resembles the globular G domain of the laminin A chain and internal repeats of EGF. Perlecan may participate in a variety of biological functions including cell binding, LDL-metabolism, basement membrane assembly and selective permeability, calcium binding, and growth- and neurite-promoting activities.
Probab=97.59 E-value=7.2e-05 Score=61.64 Aligned_cols=44 Identities=27% Similarity=0.472 Sum_probs=36.1
Q ss_pred CcEEEEecCC-CCCcccc----C--CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA----E--GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~--~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.++ ++...+. . .+.|+|.+|+.+|+|.|.|.+.|..+
T Consensus 16 p~v~W~~~~~~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 66 (78)
T cd05743 16 PIINWRLNWGHVPDSARVSITSEGGYGTLTIRDVKESDQGAYTCEAINTRG 66 (78)
T ss_pred CEEEEEECCeECCCCCCEEEEECCCEEEEEECCCChHHCEEEEEEEEecCC
Confidence 5799999777 8765432 2 35899999999999999999998876
No 64
>cd05892 Ig_Myotilin_C C-terminal immunoglobulin (Ig)-like domain of myotilin. Ig_Myotilin_C: C-terminal immunoglobulin (Ig)-like domain of myotilin. Mytolin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to the latter family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Myotilin is most abundant in skeletal and cardiac muscle, and is involved in maintaining sarcomere integrity. It binds to alpha-actinin, filamin and actin. Mutations in myotilin lead to muscle disorders.
Probab=97.59 E-value=0.00011 Score=59.93 Aligned_cols=44 Identities=20% Similarity=0.352 Sum_probs=33.9
Q ss_pred CcEEEEecCC-CCC-cccc----C-CC--eEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPL-QRYA----E-GN--VLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~-~~~~----~-~~--~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. |.. ..+. . ++ .|+|.+++.+|+|.|.|.+.|.-|
T Consensus 13 P~i~W~k~~~~i~~~~~r~~~~~~~~g~~~L~I~~~~~~D~G~Y~C~A~N~~G 65 (75)
T cd05892 13 PKIFWKRNNEMVQYNTDRISLYQDNSGRVTLLIKNVNKKDAGWYTVSAVNEAG 65 (75)
T ss_pred CeEEEEECCEECcCCCCeEEEEEcCCCcEEEEECCCChhhCEEEEEEEEcCcC
Confidence 5899999665 653 2222 2 22 799999999999999999998765
No 65
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.57 E-value=4.5e-05 Score=54.00 Aligned_cols=31 Identities=29% Similarity=0.797 Sum_probs=27.3
Q ss_pred CCCCccC--CCCC-CCeeecCCCCeEEeCCCCcc
Q psy9228 177 LGEPCYP--GACG-DGSCQDVDGAMKCLCPIGTA 207 (834)
Q Consensus 177 ~~~~C~~--~~C~-~g~C~~~~~~~~C~C~~g~~ 207 (834)
|+|||.. ++|. ++.|+|..++|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 5789987 4697 58999999999999999997
No 66
>cd05873 Ig_Sema4D_like Immunoglobulin (Ig)-like domain of the class IV semaphorin Sema4D. Ig_Sema4D_like; Immunoglobulin (Ig)-like domain of Sema4D. Sema4D is a Class IV semaphorin. Semaphorins are classified based on structural features additional to the Sema domain. Sema4D has extracellular Sema and Ig domains, a transmembrane domain, and a short cytoplasmic domain. Sema4D plays a part in the development of GABAergic synapses. Sema4D in addition is an immune semaphorin. It is abundant on resting T cells; its expression is weak on resting B cells and antigen presenting cells (APCs), but is upregulated by various stimuli. The receptor used by Sema4D in the immune system is CD72. Sem4D enhances the activation of B cells and DCs through binding CD72, perhaps by reducing CD72s inhibitory signals. The receptor used by Sema4D in the non-lymphatic tissues is plexin-B1. Sem4D is anchored to the cell surface but its extracellular domain can be released from the cell surface by a metalloproteas
Probab=97.56 E-value=6e-05 Score=63.02 Aligned_cols=43 Identities=26% Similarity=0.353 Sum_probs=34.5
Q ss_pred CcEEEEecCC-CCCc-c--ccCCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQ-R--YAEGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~-~--~~~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
++|+|.|.+. |+.. . .+.++.|.|.||+.+|+|.|.|.+.+..
T Consensus 25 ~~i~W~~ng~~i~~~~~r~~~~~~~L~I~nv~~~DsG~Y~C~a~e~~ 71 (87)
T cd05873 25 ARVVWKFDGKVLTPESAKYLLYRDGLLIFNASEADAGRYQCLSVEKS 71 (87)
T ss_pred CeEEEEECCEECCCCCceEEEECCcEEEeCCCHHHCEEEEEEEEecc
Confidence 5799999665 7654 1 2257789999999999999999998663
No 67
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=97.55 E-value=0.0036 Score=62.11 Aligned_cols=148 Identities=16% Similarity=0.128 Sum_probs=88.3
Q ss_pred cccccccCCcc-ccceeEEEEEEEeeCC--CCeeEE-EeccCCccccCCCCCeEEEEEECcEEEEEEecccEEEEeeeee
Q psy9228 391 PLSYLALPTLT-DAHLHFSIELSFKPTD--YNGLIM-YTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVGLVVLRSKVTL 466 (834)
Q Consensus 391 ~~s~l~~~~~~-~~~~~~~i~~~frt~~--~~GlLl-~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G~~~i~s~~~~ 466 (834)
...|+.++... .....|++.+++|+.. ..+.|| |.... ....++...-.+|++.+.+.-.. .......
T Consensus 16 ~~~yv~l~~~~~~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~------~~~~~~l~~~~~g~~~~~i~~~~--~~~~~~~ 87 (201)
T cd00152 16 DTSYVKLKPELPKPLQAFTLCLWVYTDLSTREYSLFSYATKG------QDNELLLYKEKDGGYSLYIGGKE--VTFKVPE 87 (201)
T ss_pred CCceEEEccCCCCChhhEEEEEEEEecCCCCCeEEEEEeCCC------CCCeEEEEEcCCCeEEEEEcCEE--EEEeccC
Confidence 34677775433 3557799999999875 445555 33321 12233333224466666553222 2223455
Q ss_pred cCCCeEEEEEEEECC--eEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcC
Q psy9228 467 VPHEWVVVTIIKDFK--EGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLG 544 (834)
Q Consensus 467 ~dg~wH~V~v~~~~~--~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing 544 (834)
.+|+||+|.++++.. .++|+|||....... ... ...+...+.+.||-..... .........|.|-|.++.+-.
T Consensus 88 ~~g~W~hv~~t~d~~~g~~~lyvnG~~~~~~~-~~~-~~~~~~~g~l~lG~~q~~~---gg~~~~~~~f~G~I~~v~iw~ 162 (201)
T cd00152 88 SDGAWHHICVTWESTSGIAELWVNGKLSVRKS-LKK-GYTVGPGGSIILGQEQDSY---GGGFDATQSFVGEISDVNMWD 162 (201)
T ss_pred CCCCEEEEEEEEECCCCcEEEEECCEEecccc-ccC-CCEECCCCeEEEeecccCC---CCCCCCCcceEEEEceeEEEc
Confidence 999999999999854 678999998765433 111 2344556678887543321 111223467999999999977
Q ss_pred eeeeecc
Q psy9228 545 SELDLIN 551 (834)
Q Consensus 545 ~~~~~~~ 551 (834)
+.+...+
T Consensus 163 ~~Ls~~e 169 (201)
T cd00152 163 SVLSPEE 169 (201)
T ss_pred ccCCHHH
Confidence 6655443
No 68
>cd05764 Ig_2 Subgroup of the immunoglobulin (Ig) superfamily. Ig_2: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=97.53 E-value=0.00013 Score=59.43 Aligned_cols=44 Identities=20% Similarity=0.473 Sum_probs=36.0
Q ss_pred CcEEEEecCC--CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG--LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~--~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++++|.|.++ ++...+. .++.|+|.+++.+|+|.|+|.+.|..+
T Consensus 16 p~v~W~~~~~~~~~~~~~~~~~~~~~L~i~~~~~~D~G~Y~C~a~N~~G 64 (74)
T cd05764 16 PAIHWISPDGKLISNSSRTLVYDNGTLDILITTVKDTGSFTCIASNAAG 64 (74)
T ss_pred CEEEEEeCCCEEecCCCeEEEecCCEEEEEECChhhCEEEEEEEECCCC
Confidence 4899998677 5554332 578999999999999999999998875
No 69
>cd05740 Ig_CEACAM_D4 Fourth immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM). Ig_CEACAM_D4: immunoglobulin (Ig)-like domain 4 in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) protein subfamily. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions, it is a cell adhesion molecule, and a signaling molecule that regulates the growth of tumor cells, it is an angiogenic factor, and is a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two [D1, D4] or four [D1-D4] Ig-like domains on the cell surface. This family corresponds to the
Probab=97.52 E-value=0.00013 Score=61.96 Aligned_cols=44 Identities=30% Similarity=0.565 Sum_probs=36.0
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
..|.|.|.+. |...+.. .+..|+|.+|+.+|+|.|.|.+.|..|
T Consensus 32 p~i~W~kng~~l~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 79 (91)
T cd05740 32 TYIWWVNNGSLLVPPRLQLSNDNRTLTFNNVTRSDTGHYQCEASNEVS 79 (91)
T ss_pred CEEEEEECCEECCCCCEEEeCCCCEEEECcCChhhCEEEEEEEEcCCC
Confidence 5799999666 6655432 457999999999999999999998875
No 70
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=97.51 E-value=0.0039 Score=62.05 Aligned_cols=148 Identities=15% Similarity=0.127 Sum_probs=87.2
Q ss_pred ccccccCCcc-ccceeEEEEEEEeeCC--CCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecccEEEEeeeeecC
Q psy9228 392 LSYLALPTLT-DAHLHFSIELSFKPTD--YNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVGLVVLRSKVTLVP 468 (834)
Q Consensus 392 ~s~l~~~~~~-~~~~~~~i~~~frt~~--~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G~~~i~s~~~~~d 468 (834)
..|+.++... ..-..+++.+++|+.. .++.||-..... ....++...-.++.+.+.+ +...+..+..+.+
T Consensus 17 ~~yv~l~~~~~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~-----~~ne~~~~~~~~~~~~l~i--~g~~~~~~~~~~~ 89 (206)
T smart00159 17 TSYVKLKPELPKPLQAFTVCLWFYSDLSPRGYSLFSYATKG-----QDNELLLYKEKQGEYSLYI--GGKKVQFPVPESD 89 (206)
T ss_pred CCeEEEccCCCCChhHEEEEEEEEecCCCCceEEEEEeCCC-----CCCeEEEEEcCCcEEEEEE--cCeEEEecccccC
Confidence 4677765432 2456799999999975 455555332211 1223332222444444444 3234455567899
Q ss_pred CCeEEEEEEEECC--eEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCee
Q psy9228 469 HEWVVVTIIKDFK--EGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSE 546 (834)
Q Consensus 469 g~wH~V~v~~~~~--~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~ 546 (834)
|+||+|.++++.. .++|+|||..... ..... ...+...+.+.||-..+. ..........|.|-|.++.|=.+.
T Consensus 90 g~W~hvc~tw~~~~g~~~lyvnG~~~~~-~~~~~-g~~i~~~G~lvlGq~qd~---~gg~f~~~~~f~G~i~~v~iw~~~ 164 (206)
T smart00159 90 GKWHHICTTWESSSGIAELWVDGKPGVR-KGLAK-GYTVKPGGSIILGQEQDS---YGGGFDATQSFVGEIGDLNMWDSV 164 (206)
T ss_pred CceEEEEEEEECCCCcEEEEECCEEccc-ccccC-CcEECCCCEEEEEecccC---CCCCCCCCcceeEEEeeeEEeccc
Confidence 9999999999854 6789999987521 11111 123445666777764331 111223356799999999997766
Q ss_pred eeecc
Q psy9228 547 LDLIN 551 (834)
Q Consensus 547 ~~~~~ 551 (834)
+...+
T Consensus 165 Ls~~e 169 (206)
T smart00159 165 LSPEE 169 (206)
T ss_pred CCHHH
Confidence 65544
No 71
>cd05868 Ig4_NrCAM Fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). Ig4_ NrCAM: fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six IG-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=97.51 E-value=0.00013 Score=59.78 Aligned_cols=44 Identities=16% Similarity=0.418 Sum_probs=35.1
Q ss_pred CcEEEEecCC-CCCc-----cccCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ-----RYAEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~-----~~~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |... +..+++.|.|.+|+.+|+|.|.|.++|..|
T Consensus 16 P~i~W~k~g~~i~~~~~~~~~~~~~~~l~i~~v~~~D~G~Y~C~A~N~~G 65 (76)
T cd05868 16 PSISWLTNGVPIEIAPTDPSRKVDGDTIIFSKVQERSSAVYQCNASNEYG 65 (76)
T ss_pred CeEEEEECCEEccccCCCceEEecCCEEEECCCCHhhCEEEEEEEEcCCC
Confidence 5899999555 6432 123567999999999999999999998876
No 72
>cd05754 Ig3_Perlecan_like Third immunoglobulin (Ig)-like domain found in Perlecan and similar proteins. Ig3_Perlecan_like: domain similar to the third immunoglobulin (Ig)-like domain found in Perlecan. Perlecan is a large multi-domain heparin sulfate proteoglycan, important in tissue development and organogenesis. Perlecan can be represented as 5 major portions; its fourth major portion (domain IV) is a tandem repeat of immunoglobulin-like domains (Ig2-Ig15), which can vary in size due to alternative splicing. Perlecan binds many cellular and extracellular ligands. Its domain IV region has many binding sites. Some of these have been mapped at the level of individual Ig-like domains, including a site restricted to the Ig5 domain for heparin/sulfatide, a site restricted to the Ig3 domain for nidogen-1 and nidogen-2, a site restricted to Ig4-5 for fibronectin, and sites restricted to Ig2 and to Ig13-15 for fibulin-2.
Probab=97.51 E-value=0.00014 Score=61.04 Aligned_cols=44 Identities=30% Similarity=0.551 Sum_probs=37.9
Q ss_pred CcEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|.|.|.+. +|......++.|.|.+|+.+|+|.|.|.+.|..+
T Consensus 32 ~~i~W~~~~~~~~~~~~~~~~~L~I~~v~~~DsG~Y~C~a~n~~g 76 (85)
T cd05754 32 YTLVWTRVGGGLPSRAMDFNGILTIRNVQLSDAGTYVCTGSNMLD 76 (85)
T ss_pred cEEEEEECCCcCCCcccccCCEEEECCCCHHHCEEEEEEEeccCC
Confidence 3799999777 8876666688999999999999999999987764
No 73
>cd05750 Ig_Pro_neuregulin Immunoglobulin (Ig)-like domain in neuregulins (NRGs). Ig_Pro_neuregulin: immunoglobulin (Ig)-like domain in neuregulins (NRGs). NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. There are four members of the neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1 protein, binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. The other NRGs proteins bind one or the other or both of these ErbBs. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates processes such as target cell differentiation, and Schwann cell surv
Probab=97.50 E-value=0.0002 Score=58.37 Aligned_cols=44 Identities=27% Similarity=0.507 Sum_probs=35.1
Q ss_pred CcEEEEecCC-CCCcccc---------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++++|.|.+. |+..... ....|+|.+++.+|+|.|.|.++|..+
T Consensus 14 p~~~W~k~g~~l~~~~~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 67 (75)
T cd05750 14 LRFKWFKDGKELNRKNKPRNIKIRNKKKNSELQINKAKLADSGEYTCVVENILG 67 (75)
T ss_pred ceEEEEcCCeeccccCCcceEEEEecCceEEEEEccCCcccCeEEEEEEEEcCC
Confidence 5899999665 7654321 246899999999999999999998875
No 74
>cd05757 Ig2_IL1R_like Second immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R) and similar proteins. Ig2_IL1R_like: domain similar to the second immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines which participate in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three IG-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of IL-1 alpha and IL-1 beta. This group also contains ILIR-like 1 (IL1
Probab=97.48 E-value=0.00015 Score=61.64 Aligned_cols=44 Identities=25% Similarity=0.471 Sum_probs=36.1
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. |+..... .++.|.|.+|+.+|+|.|.|.+.+..+
T Consensus 30 p~i~Wyk~~~~i~~~~~~~~~~~~L~I~~v~~~DsG~YtC~~~n~~~ 76 (92)
T cd05757 30 PPVQWYKDCKLLEGDRKRFVKGSKLLIQNVTEEDAGNYTCKLTFTHN 76 (92)
T ss_pred CcEEEeECCEECCCCccEEecCCEEEEeeCChhhCEEEEEEEEecCC
Confidence 4899999666 7654443 678999999999999999999987643
No 75
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=97.47 E-value=0.00014 Score=62.86 Aligned_cols=44 Identities=16% Similarity=0.250 Sum_probs=36.3
Q ss_pred CcEEEEecCC-CCCcccc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|.|.|++. |..+.+. ....|.|.+++.+|+|.|+|.+.|..|
T Consensus 30 p~v~W~kdg~~l~~~~~~~~~~~~~~s~L~I~~~~~~D~G~Ytc~a~N~~G 80 (98)
T cd05762 30 ITCTWMKFRKQIQEGEGIKIENTENSSKLTITEGQQEHCGCYTLEVENKLG 80 (98)
T ss_pred CceEEEECCEEecCCCcEEEEecCCeeEEEECCCChhhCEEEEEEEEcCCC
Confidence 5899999666 8654333 257899999999999999999999887
No 76
>cd05853 Ig6_Contactin-4 Sixth Ig domain of contactin-4. Ig6_Contactin-4: sixth Ig domain of the neural cell adhesion molecule contactin-4. Contactins are neural cell adhesion molecules, and are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. Highest expresson of contactin-4 is in testes, thyroid, small intestine, uterus and brain. Contactin-4 plays a role in the response of neuroblastoma cells to differentiating agents, such as retinoids. The contactin 4 gene is associated with cerebellar degeneration in spinocerebellar ataxia type 16.
Probab=97.47 E-value=0.00011 Score=61.23 Aligned_cols=43 Identities=26% Similarity=0.488 Sum_probs=33.5
Q ss_pred CcEEEEecCC-CCCc----cc------cCCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQ----RY------AEGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~----~~------~~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.+|.|.|.+. |... +. ..++.|+|.||+.+|+|.|+|.+++.-
T Consensus 17 ~~~~W~~dg~~i~~~~~~~~~~~~~~~~~~~~L~I~nv~~~dsG~YtC~a~n~~ 70 (85)
T cd05853 17 IVFTWSFNGHLIDFQKDGDHFERVGGQDSAGDLMIRSIQLKHAGKYVCMVQTSV 70 (85)
T ss_pred cEEEEEECCEECcccCCCccEEEeccCCCCCcEEEecCCHHHCEEEEEEEEccc
Confidence 5799999666 7621 21 134689999999999999999998764
No 77
>cd05734 Ig7_DSCAM Seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig7_DSCAM: the seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=97.43 E-value=0.00028 Score=58.28 Aligned_cols=44 Identities=25% Similarity=0.448 Sum_probs=34.4
Q ss_pred CcEEEEecCC--CCC-------ccc---cCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG--LPL-------QRY---AEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~--~~~-------~~~---~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.++ .|. ..+ ..++.|.|.+++.+|+|.|.|.+.|..|
T Consensus 13 P~v~W~~~~~~~~~~~~~~~~~~~r~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 68 (79)
T cd05734 13 PTIVWKHSKGRGHPQHTHTCCLAGRIQLLSNGSLLIKHVLEEDSGYYLCKVSNDVG 68 (79)
T ss_pred CEEEEEECCCccccccccccccCCCEEEecCCeEEECcCCcccCEEEEEEEEeCCC
Confidence 5899999765 332 111 1468999999999999999999998865
No 78
>cd05756 Ig1_IL1R_like First immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R) and similar proteins. Ig1_IL1R_like: domain similar to the first immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines which participate in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three Ig-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of IL-1 alpha and IL-1 beta.
Probab=97.42 E-value=0.00024 Score=60.76 Aligned_cols=44 Identities=27% Similarity=0.599 Sum_probs=34.9
Q ss_pred CcEEEEecCC-CCCc----ccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ----RYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~----~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|.|.|.+. ++.. .+. .++.|.|.+++.+|+|.|.|.+.|..+
T Consensus 34 ~~v~Wyk~~~~~~~~~~~~~r~~~~~~~L~I~~~~~~DsG~Y~C~~~N~~g 84 (94)
T cd05756 34 LNLTWYKSDGKTPIPTEERSRMHQQKDLLWFLPAALEDSGLYTCVVRNSTY 84 (94)
T ss_pred ceEEEEEcCCCCcccccccceeeecCCeEEEccCCcccCeEEEEEEcCCCc
Confidence 4799999776 5432 222 478999999999999999999988765
No 79
>cd04975 Ig4_SCFR_like Fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR) and similar proteins. Ig4_SCFR_like; fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR). In addition to SCFR this group also includes the fourth Ig domain of platelet-derived growth factor receptors (PDGFR), alpha and beta, the fourth Ig domain of macrophage colony stimulating factor (M-CSF), and the Ig domain of the receptor tyrosine kinase KIT. SCFR and the PDGFR alpha and beta have similar organization: an extracellular component having five Ig-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. SCFR and its ligand SCF are critical for normal hematopoiesis, mast cell development, melanocytes and gametogenesis. SCF binds to the second and third Ig-like domains of SCFR, this fourth Ig-like domain participates in SCFR dimerization, which follows ligand binding. Deletion of this fourth SCFR_Ig-like domain abolishes
Probab=97.41 E-value=0.00014 Score=62.99 Aligned_cols=44 Identities=18% Similarity=0.338 Sum_probs=35.1
Q ss_pred CcEEEEecCC-CCCcccc----C-------CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA----E-------GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~-------~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|++. |+.+... . ...|.|.+|+.+|+|.|+|.++|..+
T Consensus 34 p~v~W~Kdg~~i~~~~~~~~~~~~~~~~~~~~~L~i~~v~~~D~G~Ytc~A~N~~G 89 (101)
T cd04975 34 PHINWTYDNRTLTNKLTEIVTSENESEYRYVSELKLVRLKESEAGTYTFLASNSDA 89 (101)
T ss_pred CccEEEeCCeeCCCCcceeEEEeccCcceEEEEEEEeecCHhhCeeEEEEEECCCc
Confidence 4799999555 8754332 1 36799999999999999999998876
No 80
>cd04978 Ig4_L1-NrCAM_like Fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). Ig4_L1-NrCAM_like: fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). These proteins belong to the L1 subfamily of cell adhesion molecules (CAMs) and are comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. These molecules are primarily expressed in the nervous system. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=97.38 E-value=0.00027 Score=57.81 Aligned_cols=44 Identities=18% Similarity=0.386 Sum_probs=35.2
Q ss_pred CcEEEEecCC-CCCcc-----ccCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQR-----YAEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~-----~~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |.... ...++.|.|.+|+.+|+|.|.|.+.|..+
T Consensus 16 p~i~W~~~g~~~~~~~~~~~~~~~~~~L~i~~v~~~D~G~Y~C~A~N~~G 65 (76)
T cd04978 16 PTITWRLNGVPIEELPPDPRRRVDGGTLILSNVQPNDTAVYQCNASNVHG 65 (76)
T ss_pred CEEEEEECCEECCCCCCcceEEccCCEEEECCCChhhCEEEEEEEEccCC
Confidence 4799999655 54432 22578999999999999999999988765
No 81
>cd05744 Ig_Myotilin_C_like Immunoglobulin (Ig)-like domain of myotilin, palladin, and myopalladin. Ig_Myotilin_like_C: immunoglobulin (Ig)-like domain in myotilin, palladin, and myopalladin. Myotilin, palladin, and myopalladin function as scaffolds that regulate actin organization. Myotilin and myopalladin are most abundant in skeletal and cardiac muscle; palladin is ubiquitously expressed in the organs of developing vertebrates and plays a key role in cellular morphogenesis. The three family members each interact with specific molecular partners: all three bind to alpha-actinin; in addition, palladin also binds to vasodilator-stimulated phosphoprotein (VASP) and ezrin, myotilin binds to filamin and actin, and myopalladin also binds to nebulin and cardiac ankyrin repeat protein (CARP).
Probab=97.36 E-value=0.00036 Score=56.93 Aligned_cols=44 Identities=23% Similarity=0.382 Sum_probs=33.9
Q ss_pred CcEEEEecCC-CCCc-ccc----CC-C--eEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ-RYA----EG-N--VLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~-~~~----~~-~--~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. |+.. .+. .+ + .|.|.+++.+|+|.|.|.+.|..|
T Consensus 13 P~v~W~k~~~~i~~~~~r~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 65 (75)
T cd05744 13 PQIFWKKNNEMLTYNTDRISLYQDNCGRICLLIQNANKEDAGWYTVSAVNEAG 65 (75)
T ss_pred CeEEEEECCEECCCCCCcEEEEEcCCCeEEEEECCCCcccCEEEEEEEEcCCC
Confidence 4799999666 7642 222 21 2 699999999999999999998876
No 82
>cd05866 Ig1_NCAM-2 First immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-2. Ig1_NCAM-2: first immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-2 (OCAM/mamFas II, RNCAM). NCAM-2 is organized similarly to NCAM , including five N-terminal Ig-like domains and two fibronectin type III domains. NCAM-2 is differentially expressed in the developing and mature olfactory epithelium (OE), and may function like NCAM, as an adhesion molecule.
Probab=97.36 E-value=0.00024 Score=60.38 Aligned_cols=43 Identities=30% Similarity=0.618 Sum_probs=33.1
Q ss_pred cEEEEecCC--C-CCcccc--CCC---eEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 3 YIKWSRADG--L-PLQRYA--EGN---VLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 3 ~~~w~~~~~--~-~~~~~~--~~~---~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.++|.|.++ + +..+.. .++ .|+|.+|+.+|+|.|.|.++|..|
T Consensus 30 ~v~W~k~~g~~~~~~~r~~~~~~g~~~~L~I~~v~~~D~G~Y~C~A~N~~G 80 (92)
T cd05866 30 SIDWYNPQGEKIVSSQRVVVQKEGVRSRLTIYNANIEDAGIYRCQATDAKG 80 (92)
T ss_pred eEEEEeCCCeEecCCCCEEEEeCCCeeEEEEecCChHHCEEEEEEEEcCCC
Confidence 789998666 3 333322 233 899999999999999999998865
No 83
>cd05728 Ig4_Contactin-2-like Fourth Ig domain of the neural cell adhesion molecule contactin-2 and similar proteins. Ig4_Contactin-2-like: fourth Ig domain of the neural cell adhesion molecule contactin-2. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (aliases TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. The first four Ig domains form the intermolecular binding fragment which arranges as a compact U-shaped module by contacts between Ig domains 1 and 4, and domains 2 and 3. It has been proposed that a linear zipper-like array forms, from contactin-2 molecules alternatively provided by the two apposed membranes.
Probab=97.36 E-value=0.00022 Score=59.83 Aligned_cols=44 Identities=30% Similarity=0.527 Sum_probs=36.7
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.|.+. |+...+. .++.|.|.+++.+|+|.|.|.+.|..+
T Consensus 29 p~v~W~k~g~~~~~~~~~~~~~~~L~i~~~~~~D~G~Y~C~a~N~~G 75 (85)
T cd05728 29 PAYRWLKNGQPLASENRIEVEAGDLRITKLSLSDSGMYQCVAENKHG 75 (85)
T ss_pred CEEEEEECCEECCCCCeEEEeCCEEEEeeCCHHHCEEEEEEEECCCC
Confidence 4799999766 7654333 678999999999999999999998766
No 84
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.35 E-value=0.00026 Score=47.96 Aligned_cols=30 Identities=43% Similarity=1.153 Sum_probs=24.7
Q ss_pred C-CCCCCCCEeccCCCCCceEeeCCCCCCC-CCCC
Q psy9228 143 H-NNCINNGLCQDAATRIGYTCICPPGFSG-DRCS 175 (834)
Q Consensus 143 ~-~pC~n~g~C~~~~~~~~~~C~C~~g~~G-~~Ce 175 (834)
. +||.++ +|++.. ++|+|.|++||.| ..|+
T Consensus 4 ~~~~C~~~-~C~~~~--~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 4 SGGPCSNG-TCINTP--GSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CcCCCCCC-EEECCC--CCeEeECCCCCccCCccC
Confidence 5 689988 999875 8899999999998 7764
No 85
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=97.34 E-value=0.0065 Score=55.85 Aligned_cols=66 Identities=24% Similarity=0.228 Sum_probs=47.6
Q ss_pred CCcEEEEEEEEC--CEEEEEEcCeeeecccCCCCccceecCCceEEcC-cCCCCCCCCCccCCCceEEEEEEEECCce
Q psy9228 740 GIKHSVNVTRIN--KFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGG-TPNMDLMTGGRYVHPMSGLMMNIHIQNKH 814 (834)
Q Consensus 740 g~wH~V~i~r~~--~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG-~p~~~~~~~~~~~~~F~GCi~~v~in~~~ 814 (834)
|+||||.++.++ ++++|+|||..+...... .....+++.||. ... .......|.|.|.+|+|-+..
T Consensus 61 ~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~~~~~-----~~~~~~~f~G~Idevriy~~a 129 (133)
T smart00560 61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGGRILL-----GGAGGENFSGRLDEVRVYNRA 129 (133)
T ss_pred CCEEEEEEEEECCCCeEEEEECCEEccccccC----CcccCCceEEeeeccC-----CCCCCCCceEEeeEEEEeccc
Confidence 899999999998 799999999876543221 123456888884 211 112456899999999997763
No 86
>cd05726 Ig4_Robo Fhird immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig4_Robo: domain similar to the fhird immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antagoni
Probab=97.34 E-value=0.00028 Score=59.97 Aligned_cols=44 Identities=25% Similarity=0.420 Sum_probs=33.6
Q ss_pred CcEEEEecCC-CC---------Ccccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LP---------LQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~---------~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|.|.|.++ +. ..+.. .++.|+|.+|+.+|+|.|.|.+.|..|
T Consensus 16 p~v~W~k~g~~~~~~~~~~~~~~~r~~v~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 71 (90)
T cd05726 16 PAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTGDLTITNVQRSDVGYYICQTLNVAG 71 (90)
T ss_pred CEEEEEeCCCcceeecccCCCCCCeEEECCCCeEEEeeCChhhCEEEEEEEEcCCC
Confidence 4799999765 31 11111 367899999999999999999998765
No 87
>KOG3513|consensus
Probab=97.34 E-value=0.00049 Score=80.97 Aligned_cols=44 Identities=39% Similarity=0.764 Sum_probs=36.9
Q ss_pred CcEEEEecCCCCCcccc----CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADGLPLQRYA----EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~~~~~~~~----~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|.|+|.||.|..+++ .+.+|+|+||+.+|+|.|+|.++|..|
T Consensus 271 P~i~W~k~~g~~~~~r~~~~~~~~vL~I~nv~~~D~G~Y~C~AeN~~G 318 (1051)
T KOG3513|consen 271 PQIKWRKVDGKPPPRRATYSNYGKVLKIPNVQYEDAGEYECIAENSRG 318 (1051)
T ss_pred CcEEEEeCCCCCCCcceeeeccccEEEecccCcCCCeEEEEEEecccc
Confidence 37999999993333333 588999999999999999999999987
No 88
>cd05733 Ig6_L1-CAM_like Sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig6_L1-CAM_like: domain similar to the sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains NrCAM [Ng(neuronglia)CAM-related cell adhesion molecule], which is primarily expressed in the nervous system, and human neurofascin.
Probab=97.33 E-value=0.00046 Score=56.62 Aligned_cols=44 Identities=18% Similarity=0.377 Sum_probs=33.8
Q ss_pred CcEEEEecCC-CCCcc--cc----CCCeEEEcccCC----CCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQR--YA----EGNVLRITNARL----QDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~--~~----~~~~l~~~~~~~----~d~g~y~c~~~~~~~ 45 (834)
++|+|.|++. |+... +. .++.|.|.+++. +|+|.|.|.+.|..|
T Consensus 13 P~v~W~k~g~~l~~~~~~~~~~~~~~g~L~i~~~~~~~~~~d~G~Y~C~A~N~~G 67 (77)
T cd05733 13 PTFSWTRNGTHFDPEKDPRVTMKPDSGTLVIDNMNGGRAEDYEGEYQCYASNELG 67 (77)
T ss_pred CeEEEEECCeECCCCCCCCEEEeCCCCEEEEeccCCCCCcCCCEEEEEEEEcCCC
Confidence 4899999655 65421 11 468999999864 799999999998876
No 89
>cd05739 Ig3_RPTP_IIa_LAR_like Third immunoglobulin (Ig)-like domain of the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. Ig3_RPTP_IIa_LAR_like: domain similar to the third immunoglobulin (Ig)-like domain found in the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. LAR belongs to the RPTP type IIa subfamily. Members of this subfamily are cell adhesion molecule-like proteins involved in central nervous system (CNS) development. They have large extracellular portions, comprised of multiple IG-like domains and two to nine fibronectin type III (FNIII) domains, and a cytoplasmic portion having two tandem phosphatase domains. Included in this group is Drosophila LAR (DLAR).
Probab=97.32 E-value=0.00035 Score=55.91 Aligned_cols=42 Identities=29% Similarity=0.506 Sum_probs=33.2
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |+...+. .+..|+|.+++ |+|.|+|.+.|..+
T Consensus 16 P~v~W~k~g~~l~~~~~~~~~~~~L~i~~~~--d~G~Y~C~A~N~~G 60 (69)
T cd05739 16 PYVKWMKGGEELTKEDEMPVGRNVLELTNIY--ESANYTCVAISSLG 60 (69)
T ss_pred CEEEEEECCEECCCccceecCccEEEEeccc--cCeeEEEEEEeCCC
Confidence 4899999776 7665443 45689999974 89999999998765
No 90
>cd05859 Ig4_PDGFR-alpha Fourth immunoglobulin (Ig)-like domain of platelet-derived growth factor receptor (PDGFR) alpha. IG4_PDGFR-alpha: The fourth immunoglobulin (Ig)-like domain of platelet-derived growth factor receptor (PDGFR) alpha. PDGF is a potent mitogen for connective tissue cells. PDGF-stimulated processes are mediated by three different PDGFs (PDGF-A,-B, and C). PDGFR alpha binds to all three PDGFs, whereas the PDGFR beta (not included in this group) binds only to PDGF-B. PDGF alpha is organized as an extracellular component having five Ig-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. In mice, PDGFR alpha and PDGFR beta are essential for normal development.
Probab=97.29 E-value=0.00025 Score=61.50 Aligned_cols=44 Identities=25% Similarity=0.366 Sum_probs=35.6
Q ss_pred CcEEEEecCC-CCCcccc------------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA------------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|++. |+...+. ....|+|.+++.+|+|.|+|.++|..+
T Consensus 33 P~v~W~kdg~~l~~~~~~~~~~~~~~~~~~~~s~L~I~~v~~~D~G~Ytc~A~N~~g 89 (101)
T cd05859 33 PQIRWLKDNRTLIENLTEITTSEHNVQETRYVSKLKLIRAKEEDSGLYTALAQNEDA 89 (101)
T ss_pred CceEEEECCEECcCCcceEEeccccccceeeccEEEEeeCCHHHCEEEEEEEEcCCc
Confidence 4899999666 8765432 135899999999999999999998876
No 91
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.27 E-value=0.00036 Score=47.41 Aligned_cols=33 Identities=52% Similarity=1.219 Sum_probs=28.8
Q ss_pred CC-CCCCCCCCeecccCCCCCceeeecCCCCCCC-CCc
Q psy9228 571 CA-PKPCQNYGICYPTDTSERGYNCSCLTGYSGD-HCE 606 (834)
Q Consensus 571 C~-~~pC~ngg~C~~~~~~~~~~~C~C~~G~~G~-~Ce 606 (834)
|. ..+|.+++.|++... .|+|.|+.||.|. .|+
T Consensus 2 C~~~~~C~~~~~C~~~~~---~~~C~C~~g~~g~~~C~ 36 (36)
T cd00053 2 CAASNPCSNGGTCVNTPG---SYRCVCPPGYTGDRSCE 36 (36)
T ss_pred CCCCCCCCCCCEEecCCC---CeEeECCCCCcccCCcC
Confidence 55 789999999999987 8999999999998 663
No 92
>cd05724 Ig2_Robo Second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig2_Robo: domain similar to the second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antago
Probab=97.26 E-value=0.00045 Score=58.07 Aligned_cols=44 Identities=27% Similarity=0.494 Sum_probs=34.8
Q ss_pred CcEEEEecCC-CCC-cccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPL-QRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~-~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|.|.|.+. +.. ..+. .++.|.|.+++.+|+|.|.|.+.|..|
T Consensus 27 p~i~W~k~g~~~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 75 (86)
T cd05724 27 PTVSWRKDGQPLNLDNERVRIVDDGNLLIAEARKSDEGTYKCVATNMVG 75 (86)
T ss_pred CEEEEEECCEECCCCCCCEEEccCCEEEEeECCcccCEEEEEEEEeccC
Confidence 4799999655 554 2222 468999999999999999999998765
No 93
>cd05895 Ig_Pro_neuregulin-1 Immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. Ig_Pro_neuregulin-1: immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. There are many NRG-1 isoforms which arise from the alternative splicing of mRNA. NRG-1 belongs to the neuregulin gene family, which is comprised of four genes. This group represents NRG-1. NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, and heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. The NRG-1 protein binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates process
Probab=97.26 E-value=0.00052 Score=56.14 Aligned_cols=44 Identities=25% Similarity=0.508 Sum_probs=33.9
Q ss_pred CcEEEEecCC-CCCcccc----------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA----------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+++|.|.+. |+..... ....|.|.+++.+|+|.|.|.++|..+
T Consensus 14 ~~~~W~~~g~~i~~~~~~~~~~~~~~~~~~~~L~I~~~~~~DsG~Y~C~a~N~~g 68 (76)
T cd05895 14 LRFKWFKNGKEIGAKNKPDNKIKIRKKKKSSELQISKASLADNGEYKCMVSSKLG 68 (76)
T ss_pred CceEEEECCcccccCCCCCceEEEEeCCcEEEEEECcCCcccCEEEEEEEEeCCC
Confidence 4799999655 7542111 246799999999999999999998765
No 94
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.24 E-value=0.00039 Score=47.08 Aligned_cols=33 Identities=48% Similarity=1.118 Sum_probs=28.3
Q ss_pred CCCC-CCCCCCCeecccCCCCCceeeecCCCCCC-CCCc
Q psy9228 570 VCAP-KPCQNYGICYPTDTSERGYNCSCLTGYSG-DHCE 606 (834)
Q Consensus 570 ~C~~-~pC~ngg~C~~~~~~~~~~~C~C~~G~~G-~~Ce 606 (834)
+|.. .||.++ .|++..+ +|+|.|++||.| +.|+
T Consensus 1 ~C~~~~~C~~~-~C~~~~~---~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 1 ECASGGPCSNG-TCINTPG---SYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCcCCCCCC-EEECCCC---CeEeECCCCCccCCccC
Confidence 3666 699999 9999976 999999999999 7774
No 95
>cd05856 Ig2_FGFRL1-like Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal peptide, three extracellular Ig-like modules, a transmembrane segment, and a short intracellular domain. FGFRL1 is expressed preferentially in skeletal tissues. Similar to FGF receptors, the expressed protein interacts specifically with heparin and with FGF2. FGFRL1 does not have a protein tyrosine kinase domain at its C terminus; neither does its cytoplasmic domain appear to interact with a signaling partner. It has been suggested that FGFRL1 may not have any direct signaling function, but instead acts as a decoy receptor trapping FGFs and preventing them from binding other receptors.
Probab=97.24 E-value=0.00042 Score=57.56 Aligned_cols=44 Identities=25% Similarity=0.516 Sum_probs=34.6
Q ss_pred CcEEEEecCC-CCCcccc----CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA----EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.|.+. ++..... .+..|.|.+++.+|+|.|.|.+.|..+
T Consensus 24 p~i~W~k~~~~~~~~~~~~~~~~~~~L~i~~v~~~D~G~Y~C~a~N~~G 72 (82)
T cd05856 24 PDITWLKDNKPLTPTEIGESRKKKWTLSLKNLKPEDSGKYTCHVSNRAG 72 (82)
T ss_pred CcEEEEECCcCCcCCccceecCceEEEEEccCChhhCEEEEEEEEcCCc
Confidence 4799999766 6443222 246899999999999999999988776
No 96
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.23 E-value=0.00018 Score=50.93 Aligned_cols=33 Identities=36% Similarity=0.892 Sum_probs=28.5
Q ss_pred CccCCCCC-CccCCCceeeecCCCcEEcCCCCCC
Q psy9228 608 ENNMCMKG-DVCKNGGMCKVTPDSYECLCSLGYA 640 (834)
Q Consensus 608 ~~~~C~~~-~pC~ngg~C~~~~~~~~C~C~~g~~ 640 (834)
++|||... ++|..++.|++..++|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 36788764 3799999999999999999999987
No 97
>cd05849 Ig1_Contactin-1 First Ig domain of contactin-1. Ig1_Contactin-1: First Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may, through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=97.22 E-value=0.00051 Score=58.64 Aligned_cols=44 Identities=30% Similarity=0.597 Sum_probs=34.5
Q ss_pred CcEEEEecCC-CCC-c-cc-cCCCeEEEcccC-CCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPL-Q-RY-AEGNVLRITNAR-LQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~-~-~~-~~~~~l~~~~~~-~~d~g~y~c~~~~~~~ 45 (834)
.+|+|.|.+. |.. . +. ..++.|.|.++. .+|+|.|.|.++|..|
T Consensus 34 P~i~W~k~~~~i~~~~~~~~~~~g~L~I~~~~~~~D~G~Y~C~A~N~~G 82 (93)
T cd05849 34 PIYKWRKNNLDIDLTNDRYSMVGGNLVINNPDKYKDAGRYVCIVSNIYG 82 (93)
T ss_pred CEEEEEECCEEccCCCCeEEEECCEEEECcCCcCCCCEEEEEEEEeCcc
Confidence 4899999776 642 2 22 257899999985 6999999999998876
No 98
>cd05865 Ig1_NCAM-1 First immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. Ig1_NCAM-1: first immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. NCAM-1 plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-nonNCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves the Ig1, Ig2, and Ig3 domains. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (trans
Probab=97.22 E-value=0.0004 Score=59.64 Aligned_cols=43 Identities=35% Similarity=0.641 Sum_probs=32.9
Q ss_pred CcEEEEecCC--CCC-cccc----C---CCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG--LPL-QRYA----E---GNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~--~~~-~~~~----~---~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.+|+|.|.++ |.. .++. . .+.|+|.+|+.+|+|.|+|.++|..
T Consensus 31 p~i~W~~~~g~~l~~~~~r~~v~~~~~~~~~L~I~~v~~~D~G~YtC~A~N~~ 83 (96)
T cd05865 31 KDISWFSPNGEKLTPNQQRISVVRNDDYSSTLTIYNANIDDAGIYKCVVSNED 83 (96)
T ss_pred CEEEEECCCCcCccccCCeEEEEeCCCCceEEEEeccChhhCEEEEEEEEcCC
Confidence 4899998555 443 3222 1 2589999999999999999999875
No 99
>cd04977 Ig1_NCAM-1_like First immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1 and similar proteins. Ig1_NCAM-1 like: first immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. NCAM-1 plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-nonNCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves the Ig1, Ig2, and Ig3 domains. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the s
Probab=97.20 E-value=0.00055 Score=58.40 Aligned_cols=44 Identities=32% Similarity=0.573 Sum_probs=32.8
Q ss_pred CcEEEEecCC--CCC-cccc--C----CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG--LPL-QRYA--E----GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~--~~~-~~~~--~----~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
..|.|.|.++ +.. .+.. . ...|+|.+|+.+|+|.|+|.++|..+
T Consensus 29 ~~i~W~~~~~~~~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~A~N~~g 81 (92)
T cd04977 29 KDISWFSPNGEKLVTQQQISVVQNDDVRSTLTIYNANIEDAGIYKCVATDAKG 81 (92)
T ss_pred CeEEEECCCCCEeccCCCEEEEeCCCCEEEEEEecCCcccCEEEEEEEEcCCC
Confidence 4799998666 332 2221 1 24899999999999999999998754
No 100
>cd05737 Ig_Myomesin_like_C C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Ig_Myomesin_like_C: domain similar to the C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Myomesin and M-protein are both structural proteins localized to the M-band, a transverse structure in the center of the sarcomere, and are candidates for M-band bridges. Both proteins are modular, consisting mainly of repetitive Ig-like and fibronectin type III (FnIII) domains. Myomesin is expressed in all types of vertebrate striated muscle; M-protein has a muscle-type specific expression pattern. Myomesin is present in both slow and fast fibers; M-protein is present only in fast fibers. It has been suggested that myomesin acts as a molecular spring with alternative splicing as a means of modifying its elasticity.
Probab=97.20 E-value=0.00062 Score=58.10 Aligned_cols=44 Identities=30% Similarity=0.496 Sum_probs=34.2
Q ss_pred CcEEEEecCC-CCCccc--c--C-C--CeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRY--A--E-G--NVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~--~--~-~--~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. |....+ + . + ..|.|.+++.+|+|.|.|.++|..|
T Consensus 31 p~v~W~k~g~~l~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 82 (92)
T cd05737 31 PEVSWLKNDQALALSDHYNVKVEQGKYASLTIKGVSSEDSGKYGIVVKNKYG 82 (92)
T ss_pred CeEEEEECCEECccCCCEEEEEcCCCEEEEEEccCChhhCEEEEEEEEECCC
Confidence 4799999766 654322 2 2 2 3799999999999999999998776
No 101
>cd04970 Ig6_Contactin_like Sixth Ig domain of contactin. Ig6_Contactin_like: Sixth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=97.19 E-value=0.00038 Score=58.39 Aligned_cols=43 Identities=28% Similarity=0.463 Sum_probs=33.1
Q ss_pred cEEEEecCC-CCCcc-------c---cCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 3 YIKWSRADG-LPLQR-------Y---AEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~-------~---~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++.|.|.+. |.... . ..++.|+|.+|+.+|+|.|+|.+.|.-+
T Consensus 18 ~~~W~~~g~~i~~~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~n~~g 71 (85)
T cd04970 18 TFTWSFNGVPIDFDKDGGHYRRVGGKDSNGDLMIRNAQLKHAGKYTCTAQTVVD 71 (85)
T ss_pred EEEEEECCeEeeccCCCccEEEEecccccceEEEccCCHHhCeeeEEEEecCCC
Confidence 679999555 65421 1 1467899999999999999999987654
No 102
>cd05894 Ig_C5_MyBP-C C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). MyBP_C consists of repeated domains, Ig and fibronectin type 3, and various linkers. Three isoforms of MYBP_C exist and are included in this group: cardiac(c), and fast and slow skeletal muscle (s) MyBP_C. cMYBP_C has insertions between and inside domains and an additional cardiac-specific Ig domain at the N-terminus. For cMYBP_C an interaction has been demonstrated between this C5 domain and the Ig C8 domain.
Probab=97.19 E-value=0.0005 Score=57.78 Aligned_cols=44 Identities=20% Similarity=0.352 Sum_probs=34.6
Q ss_pred CcEEEEecCC-CCC-cccc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPL-QRYA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~-~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |.. ..+. ....|.|.+++.+|+|.|.|.+.|..|
T Consensus 25 P~v~W~k~~~~i~~~~~r~~~~~~~~~~~L~I~~~~~~D~G~Y~c~a~N~~G 76 (86)
T cd05894 25 PTVTWSRGDKAFTETEGRVRVESYKDLSSFVIEGAEREDEGVYTITVTNPVG 76 (86)
T ss_pred CeEEEEECCEECccCCCeEEEEEcCCeEEEEECCCccCcCEEEEEEEEeCCC
Confidence 5899999666 643 2122 236899999999999999999998877
No 103
>cd05730 Ig3_NCAM-1_like Third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig3_NCAM-1_like: domain similar to the third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1,and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the
Probab=97.19 E-value=0.00056 Score=58.79 Aligned_cols=44 Identities=14% Similarity=0.387 Sum_probs=35.3
Q ss_pred CcEEEEecCC-CCCc-ccc----CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ-RYA----EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~-~~~----~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. |+.. .+. .+..|+|.+|+.+|+|.|.|.+.|..|
T Consensus 33 p~v~W~k~g~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 82 (95)
T cd05730 33 PTMTWTKDGEPIESGEEKYSFNEDGSEMTILDVDKLDEAEYTCIAENKAG 82 (95)
T ss_pred CEEEEEECCEECcCCCCEEEEeCCCCEEEECCCChhhCEEEEEEEEcCCC
Confidence 5799999665 7654 221 356899999999999999999998876
No 104
>cd05854 Ig6_Contactin-2 Sixth Ig domain of contactin-2. Ig6_Contactin-2: Sixth Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=97.18 E-value=0.00044 Score=57.97 Aligned_cols=43 Identities=30% Similarity=0.420 Sum_probs=33.2
Q ss_pred cEEEEecCC-CCCcc---c---c----CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 3 YIKWSRADG-LPLQR---Y---A----EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~---~---~----~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+|+|.|.+. |+... + . ..+.|.|.+|+.+|+|.|+|.++|.-+
T Consensus 18 ~v~W~~~g~~i~~~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~YtC~A~n~~g 71 (85)
T cd05854 18 TFTWSLDDFPIDLDKPNGHYRRMEVKETIGDLVIVNAQLSHAGTYTCTAQTVVD 71 (85)
T ss_pred EEEEEECCeEccccCCCCcEEEEEecceEeEEEEccCChhhCeEEEEEEecCCC
Confidence 689999555 65421 1 1 246899999999999999999988754
No 105
>cd05848 Ig1_Contactin-5 First Ig domain of contactin-5. Ig1_Contactin-5: First Ig domain of the neural cell adhesion molecule contactin-5. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains, anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. In rats, a lack of contactin-5 (NB-2) results in an impairment of the neuronal activity in the auditory system. Contactin-5 is expressed specifically in the postnatal nervous system, peaking at about 3 weeks postnatal. Contactin-5 is highly expressed in the adult human brain in the occipital lobe and in the amygdala; lower levels of expression have been detected in the corpus callosum, caudate nucleus, and spinal cord.
Probab=97.15 E-value=0.00079 Score=57.66 Aligned_cols=44 Identities=23% Similarity=0.419 Sum_probs=34.3
Q ss_pred CcEEEEecCC-CCCcc----ccCCCeEEEcccC-CCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQR----YAEGNVLRITNAR-LQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~----~~~~~~l~~~~~~-~~d~g~y~c~~~~~~~ 45 (834)
.+|+|.|.+. |.... ...++.|.|.++. .+|+|.|.|.++|..|
T Consensus 34 P~i~W~k~g~~l~~~~~~~~~~~~g~L~i~~~~~~~D~G~Y~C~A~N~~G 83 (94)
T cd05848 34 PTYRWLRNGTEIDTESDYRYSLIDGNLIISNPSEVKDSGRYQCLATNSIG 83 (94)
T ss_pred CEEEEEECCeECccCCCceEEeeCCeEEEccCCccCcCEEEEEEEEcCCC
Confidence 4899999665 75422 1257899999985 6999999999998865
No 106
>cd05857 Ig2_FGFR Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. Ig2_FGFR: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. FGF receptors bind FGF signaling polypeptides. FGFs participate in multiple processes such as morphogenesis, development, and angiogenesis. FGFs bind to four FGF receptor tyrosine kinases (FGFR1, -2, -3, -4). Receptor diversity is controlled by alternative splicing producing splice variants with different ligand binding characteristics and different expression patterns. FGFRs have an extracellular region comprised of three IG-like domains, a single transmembrane helix, and an intracellular tyrosine kinase domain. Ligand binding and specificity reside in the Ig-like domains 2 and 3, and the linker region that connects these two. FGFR activation and signaling depend on FGF-induced dimerization, a process involving cell surface heparin or heparin sulfate proteoglycans.
Probab=97.14 E-value=0.00071 Score=56.70 Aligned_cols=44 Identities=14% Similarity=0.351 Sum_probs=34.9
Q ss_pred CcEEEEecCC-CCCcccc-------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA-------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~-------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |....+. ....|.|.+++.+|+|.|.|.+.|..|
T Consensus 24 p~i~W~k~g~~l~~~~~~~~~~~~~~~~~l~i~~~~~~D~G~Y~C~a~N~~G 75 (85)
T cd05857 24 PTMRWLKNGKEFKQEHRIGGYKVRNQHWSLIMESVVPSDKGNYTCVVENEYG 75 (85)
T ss_pred CEEEEEECCEECCCCCceeeeEEeCCceEEEEccCCcccCEEEEEEEEeCCC
Confidence 4799999666 6554322 345799999999999999999998876
No 107
>KOG4260|consensus
Probab=97.14 E-value=0.00044 Score=67.42 Aligned_cols=81 Identities=26% Similarity=0.660 Sum_probs=60.6
Q ss_pred CCCCccCCCCCCCCCCCCEec-cCCCCCceEeeCCCCCCCCCCCC-----------------------------------
Q psy9228 133 DSCNTCKSSKHNNCINNGLCQ-DAATRIGYTCICPPGFSGDRCSV----------------------------------- 176 (834)
Q Consensus 133 ~~c~~C~~~~~~pC~n~g~C~-~~~~~~~~~C~C~~g~~G~~Ce~----------------------------------- 176 (834)
++|-.|.--...||..+|.|. +....++-.|.|-+||+|+.|..
T Consensus 139 pdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~ 218 (350)
T KOG4260|consen 139 PDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKG 218 (350)
T ss_pred CccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCC
Confidence 567777211157899999998 33334567999999999988742
Q ss_pred ----------------CCCCcc--CCCCCC-CeeecCCCCeEEeCCCCcccC--cccc
Q psy9228 177 ----------------LGEPCY--PGACGD-GSCQDVDGAMKCLCPIGTAGK--RCEQ 213 (834)
Q Consensus 177 ----------------~~~~C~--~~~C~~-g~C~~~~~~~~C~C~~g~~G~--~Ce~ 213 (834)
|+|+|. +.||.. -.|+|..++|.|.+.+||.+. .|+.
T Consensus 219 C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~ 276 (350)
T KOG4260|consen 219 CSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQF 276 (350)
T ss_pred hhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhh
Confidence 266675 477884 699999999999999998863 4544
No 108
>cd05897 Ig2_IL1R2_like Second immunoglobulin (Ig)-like domain of interleukin-1 receptor-2 (IL1R2). Ig2_IL1R2_like: domain similar to the second immunoglobulin (Ig)-like domain of interleukin-1 receptor-2 (IL1R2). IL-1 alpha and IL-1 beta are cytokines which participate in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds the type II (IL1R2) represented in this group. Mature IL1R2 consists of three IG-like domains, a transmembrane domain, and a short cytoplasmic domain. It lacks the large cytoplasmic domain of Mature IL1R1, and does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of IL-1 alpha and IL-1 beta.
Probab=97.14 E-value=0.00045 Score=58.74 Aligned_cols=44 Identities=25% Similarity=0.465 Sum_probs=33.9
Q ss_pred CCcEEEEecCC-CCCcccc-----CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 1 NAYIKWSRADG-LPLQRYA-----EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 1 ~~~~~w~~~~~-~~~~~~~-----~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+++|+|.|++. |+..... ....|.|.+|..+|+|.|.|.+++..
T Consensus 29 ~~~v~WYKd~~~l~~~~~~~~~~~~~~~L~I~~v~~~D~G~YtC~~~~~~ 78 (95)
T cd05897 29 DVELQWYKDSVLLDKDNEKFYSLKGSTYLHIIDVSLNDSGYYTCKLQFTH 78 (95)
T ss_pred CCcEEEccCCEECcCCCcceEecCCCCEEEEEEcChhhCEEEEEEEEEee
Confidence 35899999555 7643211 34689999999999999999998664
No 109
>cd05850 Ig1_Contactin-2 First Ig domain of contactin-2. Ig1_Contactin-2: First Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=97.13 E-value=0.00077 Score=57.73 Aligned_cols=44 Identities=20% Similarity=0.373 Sum_probs=33.9
Q ss_pred CcEEEEecCC-CCCcc----ccCCCeEEEcccC-CCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQR----YAEGNVLRITNAR-LQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~----~~~~~~l~~~~~~-~~d~g~y~c~~~~~~~ 45 (834)
++|+|+|.+. |+... ...++.|.|.++. .+|+|.|.|.++|..|
T Consensus 34 p~i~W~~~g~~l~~~~~~~~~~~~g~L~I~~~~~~~D~G~Y~C~A~N~~G 83 (94)
T cd05850 34 ATYRWKMNGTEIKFAPESRYTLVAGNLVINNPQKARDAGSYQCLAINRCG 83 (94)
T ss_pred CEEEEEECCEECccCCCceEEEECCeEEEccCCccCcCEEEEEEEEcCcC
Confidence 5899999665 75322 1257899998865 5999999999998765
No 110
>cd05736 Ig2_Follistatin_like Second immunoglobulin (Ig)-like domain of a follistatin-like molecule encoded by the Mahya gene and similar proteins. Ig2_Follistatin_like: domain similar to the second immunoglobulin (Ig)-like domain found in a follistatin-like molecule encoded by the CNS-related Mahya gene. Mahya genes have been retained in certain Bilaterian branches during evolution. They are conserved in Hymenoptera and Deuterostomes, but are absent from other metazoan species such as fruit fly and nematode. Mahya proteins are secretory, with a follistatin-like domain (Kazal-type serine/threonine protease inhibitor domain and EF-hand calcium-binding domain), two Ig-like domains, and a novel C-terminal domain. Mahya may be involved in learning and memory and in processing of sensory information in Hymenoptera and vertebrates. Follistatin is a secreted, multidomain protein that binds activins with high affinity and antagonizes their signaling.
Probab=97.11 E-value=0.001 Score=54.41 Aligned_cols=44 Identities=23% Similarity=0.508 Sum_probs=34.0
Q ss_pred CcEEEEecCC-CCCccc----c--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRY----A--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~----~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+.|.|.|.+. |..... . .+..|.|.+++.+|+|.|.|.+.|..+
T Consensus 13 p~v~W~k~~~~l~~~~~~~~~~~~~~~~l~I~~~~~~D~G~Y~C~a~N~~G 63 (76)
T cd05736 13 PRLTWLKNGMDITPKLSKQLTLIANGSELHISNVRYEDTGAYTCIAKNEAG 63 (76)
T ss_pred CEEEEEECCEECCCCCCccEEEeCCCCEEEECcCCcccCEEEEEEEEcCCC
Confidence 4799999666 543321 1 345799999999999999999998765
No 111
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.10 E-value=0.00062 Score=46.21 Aligned_cols=29 Identities=48% Similarity=1.175 Sum_probs=26.9
Q ss_pred ccCCCceeeecCCCcEEcCCCCCCCC-Ccc
Q psy9228 617 VCKNGGMCKVTPDSYECLCSLGYAPP-NCA 645 (834)
Q Consensus 617 pC~ngg~C~~~~~~~~C~C~~g~~G~-~Ce 645 (834)
+|.+++.|++..++|.|.|+.||.|. .|+
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~ 36 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence 89999999999999999999999998 764
No 112
>PF00047 ig: Immunoglobulin domain The Prosite family only concerns antibodies and MHCs.; InterPro: IPR013151 Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involved in protein-protein and protein-ligand interactions. The Pfam alignments do not include the first and last strand of the immunoglobulin-like domain.; PDB: 1B6U_A 3O4O_C 3VH8_H 1BIH_A 2C9A_A 2V5Y_A 1BQH_K 2ATP_A 3B9K_A 1NEZ_H ....
Probab=97.08 E-value=0.00043 Score=54.41 Aligned_cols=39 Identities=31% Similarity=0.606 Sum_probs=30.6
Q ss_pred CcEEEEecCC-CCCcccc----CCCe----EEEcccCCCCCeeEEEEE
Q psy9228 2 AYIKWSRADG-LPLQRYA----EGNV----LRITNARLQDSGKYKCEI 40 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~~~~----l~~~~~~~~d~g~y~c~~ 40 (834)
..+.|.|.+. ++..... .... |.|.+|+.+|+|.|.|.+
T Consensus 17 ~~~~W~~~~~~~~~~~~~~~~~~~~~~~~~L~i~~v~~~d~G~Y~C~v 64 (64)
T PF00047_consen 17 TTVTWSKNGQSLPEGSTTNRSTNSVSSTSRLTISNVTPEDSGTYTCVV 64 (64)
T ss_dssp SEEEEEETTTTTSEEEEEEEEETTTEEEEEEEESSCTGGGTEEEEEEE
T ss_pred cEEEEEECCccccCcceeEeecccceeeeEEEEccCCHHHCEEEEEEC
Confidence 4799999887 6665444 2222 999999999999999985
No 113
>cd05747 Ig5_Titin_like M5, fifth immunoglobulin (Ig)-like domain of human titin C terminus and similar proteins. Ig5_Titin_like: domain similar to the M5, fifth immunoglobulin (Ig)-like domain from the human titin C terminus. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic; depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone, and appears to function similar to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching.
Probab=97.08 E-value=0.00087 Score=57.17 Aligned_cols=44 Identities=14% Similarity=0.403 Sum_probs=34.5
Q ss_pred CcEEEEecCC-CCCcccc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.|.+. |....+. ....|+|.+++.+|+|.|.|.++|..|
T Consensus 33 p~v~W~k~g~~l~~~~~~~~~~~~~~~~L~i~~~~~~D~G~Y~C~a~N~~G 83 (92)
T cd05747 33 PTVTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGNYTVVVENSEG 83 (92)
T ss_pred CEEEEEECCEECCCCCcEEEEEcCCeeEEEECCCCcccCEeEEEEEEcCCC
Confidence 4799999766 6543222 235899999999999999999998876
No 114
>cd05875 Ig6_hNeurofascin_like Sixth immunoglobulin (Ig)-like domain of human neurofascin (NF). Ig6_hNeurofascin_like: the sixth immunoglobulin (Ig)-like domain of human neurofascin (NF). NF belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region, and a cytoplasmic domain. NF has many alternatively spliced isoforms having different temporal expression patterns during development. NF participates in axon subcellular targeting and synapse formation, however little is known of the functions of the different isoforms.
Probab=97.07 E-value=0.0011 Score=54.38 Aligned_cols=44 Identities=16% Similarity=0.325 Sum_probs=32.5
Q ss_pred CcEEEEecCC-CCC--cccc----CCCeEEEcccCC----CCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPL--QRYA----EGNVLRITNARL----QDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~--~~~~----~~~~l~~~~~~~----~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |+. +.+. .++.|.|.+++. +|+|.|.|.++|..|
T Consensus 13 P~v~W~k~g~~~~~~~~~~~~~~~~~~~L~i~~~~~~~~~~d~G~Y~C~A~N~~G 67 (77)
T cd05875 13 PTFQWTRNGKFFNVAKDPRVSMRRRSGTLVIDFSGGGRPEDYEGEYQCFARNNLG 67 (77)
T ss_pred CEEEEEECCEEccCcCCCcEEEeCCCceEEEeccCCCCCCCCCEEEEEEEEeccc
Confidence 5899999665 642 2111 478999998753 479999999998876
No 115
>cd04974 Ig3_FGFR Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig3_FGFR: third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs.
Probab=97.03 E-value=0.001 Score=56.51 Aligned_cols=26 Identities=27% Similarity=0.530 Sum_probs=23.6
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
...|.|.+|+.+|+|.|.|.+.|..|
T Consensus 54 ~~~L~I~~v~~~D~G~Y~C~A~N~~G 79 (90)
T cd04974 54 SEVLYLRNVSFDDAGEYTCLAGNSIG 79 (90)
T ss_pred cceEEEeccccccCcEEEEEeecccC
Confidence 35899999999999999999998876
No 116
>cd05735 Ig8_DSCAM Eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM: the eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=97.02 E-value=0.00075 Score=57.01 Aligned_cols=44 Identities=23% Similarity=0.506 Sum_probs=33.9
Q ss_pred CcEEEEecCC-CCCc--ccc-C-----C---CeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ--RYA-E-----G---NVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~--~~~-~-----~---~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+.|.|.|.+. |... ++. . . ..|+|.+++.+|+|.|+|.+.|..|
T Consensus 16 ~~i~W~k~~~~i~~~~~r~~~~~~~~~~~~~s~L~I~~~~~~D~G~YtC~A~N~~G 71 (88)
T cd05735 16 IIVRWEKEDRIINPEMSRYLVSTKEVGDEVISTLQILPTVREDSGFFSCHAINSYG 71 (88)
T ss_pred CEEEEeeCCEECCCCCCcEEEEEecCCCcEEEEEEECCCCcccCEEEEEEEEcCCC
Confidence 5799999665 6432 221 1 1 5799999999999999999999887
No 117
>smart00408 IGc2 Immunoglobulin C-2 Type.
Probab=97.00 E-value=0.0014 Score=50.76 Aligned_cols=43 Identities=30% Similarity=0.604 Sum_probs=33.6
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.++.|.|.+. ++..... .+..|.|.+++.+|+|.|.|.+.+..
T Consensus 17 ~~v~W~~~~~~~~~~~~~~~~~~~L~i~~~~~~d~G~Y~C~~~n~~ 62 (63)
T smart00408 17 PNITWLKDGKPLPESNRLSASGSTLTIKSVSLEDSGEYTCVAENSA 62 (63)
T ss_pred CeEEEEECCEECCCCCEEecCCcEEEEeeCCcccCEEEEEEEecCC
Confidence 4799999666 7732222 46799999999999999999997653
No 118
>cd04967 Ig1_Contactin First Ig domain of contactin. Ig1_Contactin: First Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 weeks postnata
Probab=96.96 E-value=0.0013 Score=56.05 Aligned_cols=44 Identities=20% Similarity=0.359 Sum_probs=33.5
Q ss_pred CcEEEEecCC-CCCcc----ccCCCeEEEcccCC-CCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQR----YAEGNVLRITNARL-QDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~----~~~~~~l~~~~~~~-~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |.... ...++.|.|.++.. +|+|.|.|.++|..|
T Consensus 34 p~i~W~k~~~~l~~~~~~~~~~~~~~L~i~~~~~~~d~G~Y~C~a~N~~G 83 (91)
T cd04967 34 PTYRWLMNGTEIDDEPDSRYSLVGGNLVISNPSKAKDAGRYQCLASNIVG 83 (91)
T ss_pred CEEEEEECCEECCCCCCCCEEEECCEEEEecCCccCCCEEEEEEEEcCCC
Confidence 5899999555 63321 12468999999874 999999999998765
No 119
>cd04972 Ig_TrkABC_d4 Fourth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor fo
Probab=96.95 E-value=0.0011 Score=56.19 Aligned_cols=44 Identities=14% Similarity=0.190 Sum_probs=35.2
Q ss_pred CcEEEEecCC-CCCcccc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. +...+.. ....|+|.+|+.+|.|.|.|.++|.-|
T Consensus 30 p~v~W~~~g~~~~~~~~~~~~~~~~~~~L~I~~v~~~D~g~Y~C~A~N~~G 80 (90)
T cd04972 30 PKVEWIIAGLIVIQTRTDTLETTVDIYNLQLSNITSETQTTVTCTAENPVG 80 (90)
T ss_pred CeEEEEECCEEccCCccEEEEecCCEEEEEEecCCcccCEEEEEEEECCCC
Confidence 5899999666 6554331 345899999999999999999998876
No 120
>KOG1226|consensus
Probab=96.93 E-value=0.0041 Score=70.19 Aligned_cols=71 Identities=28% Similarity=0.779 Sum_probs=52.3
Q ss_pred ccCCCCCCCCCCCCEeccCCCCCceEeeCCCCCCCCCCC--CCCCCccCC---CCC-CCeeecCCCCeEEeCCCC-cccC
Q psy9228 137 TCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRCS--VLGEPCYPG---ACG-DGSCQDVDGAMKCLCPIG-TAGK 209 (834)
Q Consensus 137 ~C~~~~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce--~~~~~C~~~---~C~-~g~C~~~~~~~~C~C~~g-~~G~ 209 (834)
.|.+....-|.++|.|.-. .|.|.+||+|..|+ .+.+.|.+. -|. .|+|.-. +|.|... |.|.
T Consensus 548 sC~r~~g~lC~g~G~C~CG------~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~ 617 (783)
T KOG1226|consen 548 SCERHKGVLCGGHGRCECG------RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGE 617 (783)
T ss_pred ccccccCcccCCCCeEeCC------cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcc
Confidence 4654445679999999864 69999999999885 456777652 354 3666643 6889766 9999
Q ss_pred cccccccc
Q psy9228 210 RCEQKIKI 217 (834)
Q Consensus 210 ~Ce~~~~~ 217 (834)
.||.....
T Consensus 618 ~CE~cptc 625 (783)
T KOG1226|consen 618 FCEKCPTC 625 (783)
T ss_pred hhhcCCCC
Confidence 99987543
No 121
>cd05869 Ig5_NCAM-1 Fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM) and heterophilic (NCAM-non-NCAM) interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (tr
Probab=96.91 E-value=0.0018 Score=55.89 Aligned_cols=44 Identities=16% Similarity=0.354 Sum_probs=33.5
Q ss_pred CcEEEEecCC-CCCccc-----c------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRY-----A------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~-----~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. +..... . ..+.|.|.+|+.+|+|.|.|.+.|..|
T Consensus 32 P~v~W~~~~~~~~~~~~~~~~~~~v~~~~~~~~L~I~~v~~~D~G~Y~C~A~N~~G 87 (97)
T cd05869 32 PSITWRTSTRNISSEEKTLDGHIVVRSHARVSSLTLKYIQYTDAGEYLCTASNTIG 87 (97)
T ss_pred CEEEEEECCccccCCccccCccEEEEcCccEEEEEEecCccCcCEEEEEEEecCCC
Confidence 4799999655 543211 1 125899999999999999999998876
No 122
>cd05727 Ig2_Contactin-2-like Second Ig domain of the neural cell adhesion molecule contactin-2 and similar proteins. Ig2_Contactin-2-like: second Ig domain of the neural cell adhesion molecule contactin-2. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (aliases TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. The first four Ig domains form the intermolecular binding fragment which arranges as a compact U-shaped module by contacts between Ig domains 1 and 4, and domains 2 and 3. It has been proposed that a linear zipper-like array forms, from contactin-2 molecules alternatively provided by the two apposed membranes.
Probab=96.90 E-value=0.0011 Score=56.30 Aligned_cols=44 Identities=20% Similarity=0.337 Sum_probs=33.4
Q ss_pred CcEEEEecCC-C--CCccc--c--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-L--PLQRY--A--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~--~~~~~--~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+- + +.+.+ + .+|.|.|.+|+.+|+|.|.|.+++.-+
T Consensus 34 p~~~W~k~~~~~~~~~d~r~~~~~~~G~L~fs~v~~~D~g~Y~C~A~n~~~ 84 (96)
T cd05727 34 LSYRWLLNEFPNFIPEDGRRFVSQTNGNLYIAKVEASDRGNYSCFVSSPSS 84 (96)
T ss_pred CEEEEEECCcccccccCCCeEEeCCCCcEEEeecCHhhCceeEEEEEeccc
Confidence 4799999544 3 32221 2 478999999999999999999987643
No 123
>cd04979 Ig_Semaphorin_C Immunoglobulin (Ig)-like domain of semaphorin. Ig_Semaphorin_C; Immunoglobulin (Ig)-like domain in semaphorins. Semaphorins are transmembrane protein that have important roles in a variety of tissues. Functionally, semaphorins were initially characterized for their importance in the development of the nervous system and in axonal guidance. Later they have been found to be important for the formation and functioning of the cardiovascular, endocrine, gastrointestinal, hepatic, immune, musculoskeletal, renal, reproductive, and respiratory systems. Semaphorins function through binding to their receptors and transmembrane semaphorins also serves as receptors themselves. Although molecular mechanism of semaphorins is poorly understood, the Ig-like domains may involve in ligand binding or dimerization.
Probab=96.86 E-value=0.0014 Score=55.56 Aligned_cols=42 Identities=26% Similarity=0.449 Sum_probs=33.0
Q ss_pred CcEEEEecCC-CCCc----c--ccCCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 2 AYIKWSRADG-LPLQ----R--YAEGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~----~--~~~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
++|.|.|.+. ++.. . ....+.|.|.+++.+|+|.|.|.+.+.
T Consensus 25 ~~i~W~k~~~~~~~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~~~ 73 (89)
T cd04979 25 ASVVWLFQGGPLQRKEEPEERLLVTEDGLLIRSVSPADAGVYTCQSVEH 73 (89)
T ss_pred ceEEEEECCcccccccCcCceEEEcCCCEEEccCCHHHCEEEEEEEecC
Confidence 5899999776 6543 1 224567999999999999999999754
No 124
>cd05874 Ig6_NrCAM Sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). Ig6_NrCAM: sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region, and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=96.84 E-value=0.0024 Score=52.27 Aligned_cols=44 Identities=16% Similarity=0.345 Sum_probs=32.5
Q ss_pred CcEEEEecCC-CCCc--cc--c--CCCeEEEcccCCC----CCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ--RY--A--EGNVLRITNARLQ----DSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~--~~--~--~~~~l~~~~~~~~----d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |... .+ . .++.|.|..++.+ |+|.|.|.++|..|
T Consensus 13 P~i~W~k~g~~l~~~~~~~~~~~~~~g~l~i~~~~~~~~~~d~G~Y~C~A~N~~G 67 (77)
T cd05874 13 PSFSWTRNGTHFDIDKDPKVTMKPNTGTLVINIMNGEKAEAYEGVYQCTARNERG 67 (77)
T ss_pred CeEEEEECCeECCCcCCCCEEEeCCCceEEEeccccCCCCCCCEEEEEEEEcCCC
Confidence 4899999665 6321 11 1 4789999888754 78999999998765
No 125
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.83 E-value=0.00037 Score=35.23 Aligned_cols=12 Identities=58% Similarity=1.821 Sum_probs=6.4
Q ss_pred eeCCCCCCCCCC
Q psy9228 163 CICPPGFSGDRC 174 (834)
Q Consensus 163 C~C~~g~~G~~C 174 (834)
|.|++||+|++|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 555566655555
No 126
>cd05732 Ig5_NCAM-1_like Fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar proteins. Ig5_NCAM-1 like: domain similar to the fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM mole
Probab=96.82 E-value=0.0019 Score=55.55 Aligned_cols=44 Identities=25% Similarity=0.494 Sum_probs=33.6
Q ss_pred CcEEEEecCC-CCCccc-------c----CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRY-------A----EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~-------~----~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|.|.|.+. +..... + ....|+|.+|+.+|+|.|.|.+.|..|
T Consensus 31 p~v~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~a~N~~G 86 (96)
T cd05732 31 PEITWRRATRNFSEGDKSLDGRIVVRGHARVSSLTLKDVQLTDAGRYDCEASNRIG 86 (96)
T ss_pred CcEEEEECCcccCCCCccccceEEEeCCccEEEEEECcCCcCcCEEeEEEEEeCCC
Confidence 4799999665 543211 1 235899999999999999999998876
No 127
>PF07679 I-set: Immunoglobulin I-set domain; InterPro: IPR013098 The basic structure of immunoglobulin (Ig) molecules is a tetramer of two light chains and two heavy chains linked by disulphide bonds. There are two types of light chains: kappa and lambda, each composed of a constant domain (CL) and a variable domain (VL). There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are highly modular proteins, in which the variable and constant domains have clear, conserved sequence patterns. The domains in Ig and Ig-like molecules are grouped into four types: V-set (variable; IPR013106 from INTERPRO), C1-set (constant-1; IPR003597 from INTERPRO), C2-set (constant-2; IPR008424 from INTERPRO) and I-set (intermediate; IPR013098 from INTERPRO) []. Structural studies have shown that these domains share a common core Greek-key beta-sandwich structure, with the types differing in the number of strands in the beta-sheets as well as in their sequence patterns [, ]. Immunoglobulin-like domains that are related in both sequence and structure can be found in several diverse protein families. Ig-like domains are involved in a variety of functions, including cell-cell recognition, cell-surface receptors, muscle structure and the immune system []. This entry represents I-set domains, which are found in several cell adhesion molecules, including vascular (VCAM), intercellular (ICAM), neural (NCAM) and mucosal addressin (MADCAM) cell adhesion molecules, as well as junction adhesion molecules (JAM). I-set domains are also present in several other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1 [], and the signalling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis [].; PDB: 3MTR_A 2EDK_A 3DMK_B 1KOA_A 3NCM_A 2NCM_A 2V9Q_A 2CR3_A 3QQN_A 3QR2_A ....
Probab=96.81 E-value=0.00073 Score=57.31 Aligned_cols=44 Identities=20% Similarity=0.427 Sum_probs=35.4
Q ss_pred CcEEEEecCC-CCCcccc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |....+. ....|+|.+|+.+|+|.|.|.+++..+
T Consensus 30 ~~v~W~~~~~~l~~~~~~~~~~~~~~~~L~I~~v~~~D~G~Y~C~~~n~~g 80 (90)
T PF07679_consen 30 PTVTWYKNGRPLTSSQRYQIESDGGSSSLTIKNVTREDAGTYTCVASNSSG 80 (90)
T ss_dssp SEEEEEETTEEEESSSSEEEEEETTEEEEEESSESGGGSEEEEEEEEETTE
T ss_pred CcccccccccceeeeeeeeeecccceeEEccCCCChhhCEEEEEEEEECCC
Confidence 4799999755 6654333 356899999999999999999988765
No 128
>cd05749 Ig2_Tyro3_like Second immunoglobulin (Ig)-like domain of Axl/Tyro3 receptor tyrosine kinases (RTKs). Ig2_Tyro3_like: the second immunoglobulin (Ig)-like domain in the Axl/Tyro3 family of receptor tyrosine kinases (RTKs). This family includes Axl (also known as Ark, Ufo, and Tyro7), Tyro3 (also known as Sky, Rse, Brt, Dtk, and Tif), and Mer (also known as Nyk, c-Eyk, and Tyro12). Axl/Tyro3 family receptors have an extracellular portion with two Ig-like domains followed by two fibronectin-types III (FNIII) domains, a membrane-spanning single helix, and a cytoplasmic tyrosine kinase domain. Axl, Tyro3 and Mer are widely expressed in adult tissues, though they show higher expression in the brain, in the lymphatic and vascular systems, and in the testis. Axl, Tyro3, and Mer bind the vitamin K dependent protein Gas6 with high affinity, and in doing so activate their tyrosine kinase activity. Axl/Gas6 signaling may play a part in cell adhesion processes, prevention of apoptosis, and c
Probab=96.81 E-value=0.002 Score=53.23 Aligned_cols=41 Identities=20% Similarity=0.386 Sum_probs=32.1
Q ss_pred cEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 3 YIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+|+|+|++. |.......+++|.|.++ .|+|.|.|.++|..|
T Consensus 30 ~I~W~k~g~~l~~~~~~~~s~L~i~~~--~d~g~Y~C~A~N~~G 71 (81)
T cd05749 30 EILWWQGGSPLGDPPAPSPSVLNVPGL--NETSKFSCEAHNAKG 71 (81)
T ss_pred EEEEEECCEECCCCCCCCCCEEEEccc--cCCeEEEEEEEeccC
Confidence 699999665 65433345789999997 488999999998765
No 129
>cd05870 Ig5_NCAM-2 Fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-2 (also known as OCAM/mamFas II and RNCAM). Ig5_NCAM-2: the fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-2 (also known as OCAM/mamFas II and RNCAM). NCAM-2 is organized similarly to NCAM , including five N-terminal Ig-like domains and two fibronectin type III domains. NCAM-2 is differentially expressed in the developing and mature olfactory epithelium (OE), and may function like NCAM, as an adhesion molecule.
Probab=96.75 E-value=0.0022 Score=55.41 Aligned_cols=44 Identities=32% Similarity=0.588 Sum_probs=32.8
Q ss_pred CcEEEEecC-C--CCCc-----ccc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRAD-G--LPLQ-----RYA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~-~--~~~~-----~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|.|.|.+ | ++.. .+. ....|+|.+|+.+|+|.|.|.+.|..|
T Consensus 31 p~i~W~k~~~g~~~~~~~~~~~~r~~v~~~~~~~~L~I~~v~~~D~G~Y~C~A~N~~G 88 (98)
T cd05870 31 PEITWKRASDGHTFSEGDKSPDGRIEVKGQHGESSLHIKDVKLSDSGRYDCEAASRIG 88 (98)
T ss_pred CeEEEEECCCCceeccCCcCcCceEEEeecCCeeEEEEeeCCcCCCEEEEEEEeccCC
Confidence 479999953 3 3321 111 135899999999999999999998876
No 130
>cd05742 Ig1_VEGFR_like First immunoglobulin (Ig)-like domain of vascular endothelial growth factor (VEGF) receptor (R) and similar proteins. Ig1_VEGFR_like: first immunoglobulin (Ig)-like domain of vascular endothelial growth factor (VEGF) receptor(R) related proteins. The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. The VEGFR family consists of three members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and VEGFR-3 (Flt-4). VEGF-A interacts with both VEGFR-1 and VEGFR-2. VEGFR-1 binds strongest to VEGF, VEGF-2 binds more weakly. VEGFR-3 appears not to bind VEGF, but binds other members of the VEGF family (VEGF-C and -D). VEGFRs bind VEGFs with high affinity with the IG-like domains. VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGF
Probab=96.73 E-value=0.0016 Score=54.46 Aligned_cols=44 Identities=25% Similarity=0.425 Sum_probs=31.9
Q ss_pred CcEEEEecCC-CCCcc-----cc------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQR-----YA------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~-----~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
..|.|.+.+. +.... .. ....|+|.+|+.+|+|.|.|.++|..+
T Consensus 18 ~~i~W~~~~~~~~~~~~~~~~~~~~~~~~~~s~L~I~~v~~~DsG~Y~C~a~n~~~ 73 (84)
T cd05742 18 VDFQWTYPGKKRGRGKSMVTRQSLSEATELSSTLTIPNATLKDSGTYTCAASSGTM 73 (84)
T ss_pred EEEEEecCCcccCCceEeecccccccceEEEEEEEECCCChhhCEEEEEEEccCCC
Confidence 3699987554 32221 11 235899999999999999999987754
No 131
>cd05845 Ig2_L1-CAM_like Second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig2_L1-CAM_like: domain similar to the second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=96.67 E-value=0.0018 Score=54.94 Aligned_cols=43 Identities=23% Similarity=0.315 Sum_probs=35.0
Q ss_pred CcEEEEecCC--CCCcccc---CCCeEEEcccCCCCCe-eEEEEEcCCC
Q psy9228 2 AYIKWSRADG--LPLQRYA---EGNVLRITNARLQDSG-KYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~--~~~~~~~---~~~~l~~~~~~~~d~g-~y~c~~~~~~ 44 (834)
.+|.|.+.+. +....++ .+|.|.|.+|+.+|+| .|+|.+++.-
T Consensus 34 P~i~W~~~~~~~i~~~~Ri~~~~~GnL~fs~v~~~D~g~~Y~C~a~~~~ 82 (95)
T cd05845 34 LRIYWMNSDLLHITQDERVSMGQNGNLYFANVEEQDSHPDYICHAHFPG 82 (95)
T ss_pred CEEEEECCCCccccccccEEECCCceEEEEEEehhhCCCCeEEEEEccc
Confidence 4799998655 6655555 4799999999999999 7999998664
No 132
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=96.66 E-value=0.067 Score=52.55 Aligned_cols=136 Identities=17% Similarity=0.152 Sum_probs=74.5
Q ss_pred ceeEEEEEEEeeCCC--CeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEecccEEEEeeeeecCCCeEEEEEEEEC-
Q psy9228 404 HLHFSIELSFKPTDY--NGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVGLVVLRSKVTLVPHEWVVVTIIKDF- 480 (834)
Q Consensus 404 ~~~~~i~~~frt~~~--~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G~~~i~s~~~~~dg~wH~V~v~~~~- 480 (834)
-..+++.+++|+... .+.||...... ....++.+.-..+.+.+.+ +...+.....+.+++||++.++++.
T Consensus 24 L~~fTvC~w~k~~~~~~~~tifSYat~~-----~~nell~~~~~~~~~~l~i--~~~~~~~~~~~~~~~Whh~C~tW~s~ 96 (195)
T PF00354_consen 24 LSAFTVCFWVKTDDSSNDGTIFSYATSS-----QDNELLLFGSSSGSLRLYI--NGSSVSFSGPIRDGQWHHICVTWDSS 96 (195)
T ss_dssp BSEEEEEEEEEESGSGS-EEEEEEEETT-----EEEEEEEEEETTTEEEEEE--TTEEEEEEECS-TSS-EEEEEEEETT
T ss_pred cccEEEEEEEEeccCCCceEEEEEccCC-----CCccEEEEEeCCceEEEEE--CCeEeEeccccCCCCcEEEEEEEecC
Confidence 457999999999875 67777543321 1122322222234554444 3223444556899999999999987
Q ss_pred -CeEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCeeeeecc
Q psy9228 481 -KEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSELDLIN 551 (834)
Q Consensus 481 -~~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~~~~~~ 551 (834)
..+.|++||....... ... ...+...+.+.+|--.+. ..........|.|=|.++.+=.+.+...+
T Consensus 97 ~G~~~ly~dG~~~~~~~-~~~-g~~i~~gG~~vlGQeQd~---~gG~fd~~q~F~G~i~~~~iWd~vLs~~e 163 (195)
T PF00354_consen 97 TGRWQLYVDGVRLSSTG-LAT-GHSIPGGGTLVLGQEQDS---YGGGFDESQAFVGEISDFNIWDRVLSPEE 163 (195)
T ss_dssp TTEEEEEETTEEEEEEE-SST-T--B-SSEEEEESS-BSB---TTBTCSGGGB--EEEEEEEEESS---HHH
T ss_pred CcEEEEEECCEeccccc-ccC-CceECCCCEEEECccccc---cCCCcCCccEeeEEEeceEEEeeeCCHHH
Confidence 4788999998433221 111 234455556666643321 22234456789999999999777666443
No 133
>KOG4260|consensus
Probab=96.65 E-value=0.0023 Score=62.62 Aligned_cols=63 Identities=22% Similarity=0.686 Sum_probs=51.4
Q ss_pred CccCCCCCCCCCCCCEeccCCCCCceEeeCCCCCCCCCCCCCCCCccC--CCCC--CCeeecCCCCeEEeCCCCc
Q psy9228 136 NTCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRCSVLGEPCYP--GACG--DGSCQDVDGAMKCLCPIGT 206 (834)
Q Consensus 136 ~~C~~~~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~~~~~C~~--~~C~--~g~C~~~~~~~~C~C~~g~ 206 (834)
|+|. ..++||...-.|++.. ++|+|.+.+||.+. .|+|.- .-|. |+.|.|..++|+|.|..|+
T Consensus 237 nEC~-~ep~~c~~~qfCvNte--GSf~C~dk~Gy~~g-----~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~ 303 (350)
T KOG4260|consen 237 NECQ-NEPAPCKAHQFCVNTE--GSFKCEDKEGYKKG-----VDECQFCADVCASKNRPCMNIDGQYRCVCFSGL 303 (350)
T ss_pred HHHh-cCCCCCChhheeecCC--CceEecccccccCC-----hHHhhhhhhhcccCCCCcccCCccEEEEecccc
Confidence 6886 4488999999999976 99999999999862 556665 4553 6889999999999998875
No 134
>cd05858 Ig3_FGFR-2 Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor 2 (FGFR2). Ig3_FGFR-2-like; domain similar to the third immunoglobulin (Ig)-like domain of human fibroblast growth factor receptor 2 (FGFR2). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs. FGFR2 is required for male sex determination.
Probab=96.64 E-value=0.0019 Score=54.80 Aligned_cols=25 Identities=28% Similarity=0.557 Sum_probs=23.2
Q ss_pred CeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
..|+|.+|+.+|+|.|.|.+.|..|
T Consensus 55 ~~L~I~~v~~~D~G~Y~C~A~N~~G 79 (90)
T cd05858 55 EVLYLRNVTFEDAGEYTCLAGNSIG 79 (90)
T ss_pred eEEEEccCCHHHCEEEEEEEEeCCC
Confidence 4799999999999999999998876
No 135
>KOG4194|consensus
Probab=96.63 E-value=0.0025 Score=69.74 Aligned_cols=60 Identities=28% Similarity=0.548 Sum_probs=46.8
Q ss_pred CcEEEEecCC--CCCcccc------CCCeEEEcccCCCCCeeEEEEEcCCCCccCCcEEEEcCceeEEEcCCcce
Q psy9228 2 AYIKWSRADG--LPLQRYA------EGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNVERMMFVDGIGPF 68 (834)
Q Consensus 2 ~~~~w~~~~~--~~~~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~~~~~~~~~~~~~~g~~cvd~~~~~ 68 (834)
..|.|.|++| .|+.+.- +++++-|.||+.+|+|+|+|++++.-| .+..|..+.|...+.+
T Consensus 642 PeIawqkdggtdFPAA~eRRl~Vmpedd~f~Itnvk~eD~GiYtC~A~n~AG-------~isanAtL~V~e~p~f 709 (873)
T KOG4194|consen 642 PEIAWQKDGGTDFPAARERRLHVMPEDDVFFITNVKIEDQGIYTCTAQNVAG-------QISANATLTVLETPSF 709 (873)
T ss_pred cceeehhcCCCCCchhhhheeeecCCCCEEEEecccccccceeEEeeecccc-------ceeeceEEEEecCCcc
Confidence 4799999888 8884433 588999999999999999999988765 2224666777665444
No 136
>cd05891 Ig_M-protein_C C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). Ig_M-protein_C: the C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). M-protein is a structural protein localized to the M-band, a transverse structure in the center of the sarcomere, and is a candidate for M-band bridges. M-protein is modular consisting mainly of repetitive IG-like and fibronectin type III (FnIII) domains, and has a muscle-type specific expression pattern. M-protein is present in fast fibers.
Probab=96.62 E-value=0.004 Score=53.05 Aligned_cols=44 Identities=27% Similarity=0.485 Sum_probs=34.0
Q ss_pred CcEEEEecCC-CCCccc----cC-C--CeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRY----AE-G--NVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~----~~-~--~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.++.|.|.+. +....+ .. + ..|.|.+++.+|+|.|.|.+.|..|
T Consensus 31 p~i~W~k~~~~~~~~~~~~~~~~~~~~~~L~I~~~~~~D~G~Y~C~a~N~~G 82 (92)
T cd05891 31 PEVIWFKNDQDIELSEHYSVKLEQGKYASLTIKGVTSEDSGKYSINVKNKYG 82 (92)
T ss_pred CeEEEEECCEECCCCCCEEEEEcCCCEEEEEECCCChhhCEEEEEEEEeCCC
Confidence 4789999777 544321 12 2 2799999999999999999998876
No 137
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.62 E-value=0.0025 Score=41.66 Aligned_cols=27 Identities=33% Similarity=0.966 Sum_probs=22.1
Q ss_pred CCCCCCCEeccCCCCCceEeeCCCCCCCCCC
Q psy9228 144 NNCINNGLCQDAATRIGYTCICPPGFSGDRC 174 (834)
Q Consensus 144 ~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~C 174 (834)
..|.++|+|+.. ..+|.|.+||+|+.|
T Consensus 6 ~~C~~~G~C~~~----~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP----CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC----CCEEECCCCCcCCCC
Confidence 469999999963 468999999999877
No 138
>cd05861 Ig1_PDGFR-alphabeta Frst immunoglobulin (Ig)-like domain of platelet-derived growth factor (PDGF) receptors (R), alpha (CD140a), and beta (CD140b). Ig1_PDGFR-alphabeta: The first immunoglobulin (Ig)-like domain of platelet-derived growth factor (PDGF) receptors (R), alpha (CD140a), and beta (CD140b). PDGF is a potent mitogen for connective tissue cells. PDGF-stimulated processes are mediated by three different PDGFs (PDGF-A,-B, and C). PDGFRalpha binds to all three PDGFs, whereas the PDGFRbeta binds only to PDGF-B. PDGFRs alpha and beta have similar organization: an extracellular component with five Ig-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. In mice, PDGFRalpha and PDGFRbeta are essential for normal development.
Probab=96.60 E-value=0.0021 Score=53.73 Aligned_cols=43 Identities=21% Similarity=0.436 Sum_probs=30.7
Q ss_pred CcEEEEecCC-----CCCc--ccc----CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-----LPLQ--RYA----EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-----~~~~--~~~----~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
..+.|.+.++ +|.. ... ....|.|.+++.+|+|.|.|.+++..
T Consensus 16 ~~~~W~~~~~~~~~~~~~~~~~~~~~~~~~s~L~I~~~~~~DsG~Y~C~a~n~~ 69 (84)
T cd05861 16 VDFSWTYPGKDIGKGIPEVEEVKVPATTLRSTLTFPHATVEDSGTYECAAHEST 69 (84)
T ss_pred cEEEEEECCccCCCCceEEEEEecCCcEEEEEEEECCCCcCCCEEEEEEEEECc
Confidence 3689997443 2221 111 24589999999999999999998654
No 139
>cd05722 Ig1_Neogenin First immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig1_Neogenin: first immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=96.59 E-value=0.0039 Score=53.46 Aligned_cols=44 Identities=27% Similarity=0.388 Sum_probs=33.4
Q ss_pred CcEEEEecCC-CCCccc-----cCCCeEEEccc-----CCCCCeeEEEEEcCC-CC
Q psy9228 2 AYIKWSRADG-LPLQRY-----AEGNVLRITNA-----RLQDSGKYKCEIQGH-DS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~-----~~~~~l~~~~~-----~~~d~g~y~c~~~~~-~~ 45 (834)
++|+|.|++. |..... ..++.|.|.++ ..+|+|.|.|.+.|. .+
T Consensus 29 p~i~W~k~g~~l~~~~~~~~~~~~~~~l~i~~v~~~~~~~~D~G~Y~C~a~N~~~G 84 (95)
T cd05722 29 PKIEWKKDGVLLNLVSDERRQQLPNGSLLITSVVHSKHNKPDEGFYQCVAQNDSLG 84 (95)
T ss_pred CEEEEEECCeECccccCcceEEccCCeEEEeeeeccCCCCCcCEEEEEEEECCccC
Confidence 5899999555 654322 14677888887 589999999999988 54
No 140
>PHA02826 IL-1 receptor-like protein; Provisional
Probab=96.58 E-value=0.0022 Score=64.48 Aligned_cols=41 Identities=22% Similarity=0.422 Sum_probs=33.4
Q ss_pred CcEEEEecCC-CCCcccc----CCCeEEEcccCCCCCeeEEEEEcC
Q psy9228 2 AYIKWSRADG-LPLQRYA----EGNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
.+|+|.|++. |+...+. .++.|.|.+|+.+|+|.|.|.++.
T Consensus 164 p~I~WyKng~~l~~~~r~~~~~~~~~L~I~~V~~~DsG~YtC~a~~ 209 (227)
T PHA02826 164 YTLTWYKNGNIVLYTDRIQLRNNNSTLVIKSATHDDSGIYTCNLRF 209 (227)
T ss_pred ceEEEEECCEECCCCCCEEEeCCCCEEEECcCCHHhCEEEEEEEEE
Confidence 5899999666 7755443 356899999999999999999964
No 141
>cd05765 Ig_3 Subgroup of the immunoglobulin (Ig) superfamily. Ig_3: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=96.57 E-value=0.0039 Score=51.60 Aligned_cols=26 Identities=35% Similarity=0.483 Sum_probs=23.8
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
...|.|.+++.+|+|.|.|.+.|..|
T Consensus 46 ~~~L~I~~~~~~D~G~Y~C~a~N~~G 71 (81)
T cd05765 46 IGQLVIYNAQPQDAGLYTCTARNSGG 71 (81)
T ss_pred ccEEEEccCCcccCEEEEEEEecCCc
Confidence 46899999999999999999998876
No 142
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.55 E-value=0.00098 Score=33.73 Aligned_cols=13 Identities=38% Similarity=1.339 Sum_probs=8.0
Q ss_pred eeecCCCCCCCCC
Q psy9228 593 NCSCLTGYSGDHC 605 (834)
Q Consensus 593 ~C~C~~G~~G~~C 605 (834)
+|.|++||+|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4667777777665
No 143
>cd05871 Ig_Semaphorin_classIII Immunoglobulin (Ig)-like domain of class III semaphorin. Ig_Semaphorin_class III; Immunoglobulin (Ig)-like domain of class III semaphorins. Semaphorins are classified into various classes on the basis of structural features additional to the Sema domain. Class III semaphorins are a vertebrate class having a Sema domain, an Ig domain, a short basic domain, and are secreted. They have been shown to be axonal guidance cues and have a part in the regulation of the cardiovascular, immune and respiratory systems. Sema3A, the prototype member of this class III subfamily, induces growth cone collapse and is an inhibitor of axonal sprouting. In perinatal rat cortex as a chemoattractant, it functions to direct, for pyramidal neurons, the orientated extension of apical dendrites. It may play a role, prior to the development of apical dendrites, in signaling the radial migration of newborn cortical neurons towards the upper layers. Sema3A selectively inhibits vascula
Probab=96.51 E-value=0.003 Score=53.59 Aligned_cols=41 Identities=22% Similarity=0.343 Sum_probs=30.4
Q ss_pred CcEEEEecCC-CC------Ccccc--CCCeEEEcccCCCCCeeEEEEEcC
Q psy9228 2 AYIKWSRADG-LP------LQRYA--EGNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 2 ~~~~w~~~~~-~~------~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
++|.|.|++. .+ ...+. .++.|.|.+++.+|+|.|.|.+.+
T Consensus 25 p~v~Wykq~~g~~~~~~~~~~~r~~~~~~~L~I~~~~~~DsG~Y~C~a~~ 74 (91)
T cd05871 25 ASVKWLFQRGGDQRKEEVKTEERLIHTERGLLLRSLQRSDAGVYTCTAVE 74 (91)
T ss_pred ceEEEEEECCCCCccccccccccEEEecCeEEEeeCChhHCEEEEEEEEc
Confidence 4799998543 22 12222 467899999999999999999973
No 144
>cd05773 Ig8_hNephrin_like Eighth immunoglobulin-like domain of nephrin. Ig8_hNephrin_like: domain similar to the eighth immunoglobulin-like domain in human nephrin. Nephrin is an integral component of the slit diaphragm, and is a central component of the glomerular ultrafilter. Nephrin plays a structural role, and has a role in signaling. Nephrin is a transmembrane protein having a short intracellular portion, and an extracellular portion comprised of eight Ig-like domains, and one fibronectin type III-like domain. The extracellular portions of nephrin, from neighboring foot processes of separate podocyte cells, may interact with each other, and in association with other components of the slit diaphragm, form a porous molecular sieve within the slit pore. The intracellular portion of nephrin is associated with linker proteins, which connect nephrin to the actin cytoskeleton. The intracellular portion is tyrosine phosphorylated, and mediates signaling from the slit diaphragm into the p
Probab=96.50 E-value=0.0042 Score=54.80 Aligned_cols=44 Identities=23% Similarity=0.480 Sum_probs=33.3
Q ss_pred CcEEEEecCC-CCC-c-ccc---------CCCeEEEcccC-CCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPL-Q-RYA---------EGNVLRITNAR-LQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~-~-~~~---------~~~~l~~~~~~-~~d~g~y~c~~~~~~~ 45 (834)
++++|+|.+. |+. + ++. .+..|+|.+|. .+|.|.|.|.+.|..|
T Consensus 38 p~i~W~k~g~~l~~~~~r~~~~~~~~~~~~~s~L~I~~v~~~~D~G~Y~C~A~N~~G 94 (109)
T cd05773 38 VQFRWAKNGVPLDLGNPRYEETTEHTGTVHTSILTIINVSAALDYALFTCTAHNSLG 94 (109)
T ss_pred CEEEEEECCEECCCCCCeEEEEeeccCccceeEEEECcCCccCCCEEEEEEEEeCCc
Confidence 5899999544 653 2 111 14689999997 6999999999999876
No 145
>cd05896 Ig1_IL1RAPL-1_like First immunoglobulin (Ig)-like domain of X-linked interleukin-1 receptor accessory protein-like 1 (IL1RAPL-1). Ig1_ IL1RAPL-1_like: domain similar to the first immunoglobulin (Ig)-like domain of X-linked interleukin-1 receptor accessory protein-like 1 (IL1RAPL-1). IL-1 alpha and IL-1 beta are cytokines which participates in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three Ig-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of
Probab=96.49 E-value=0.0041 Score=53.22 Aligned_cols=42 Identities=26% Similarity=0.464 Sum_probs=32.9
Q ss_pred cEEEEecCC-CCCc-------ccc--CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 3 YIKWSRADG-LPLQ-------RYA--EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 3 ~~~w~~~~~-~~~~-------~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+++|.|.++ +... .++ .+..|.|..+..+|+|.|.|+..|..
T Consensus 42 ~v~WyK~~~~~~~~~~~~~~~~Ri~~~~~~Lwf~Pa~~eDSG~Y~C~~rN~t 93 (104)
T cd05896 42 SLMWYKSSGDGEEPIEIIFDGVRMSKEEDSIWFRPAELQDSGLYTCVLRNST 93 (104)
T ss_pred eEEEEECCCCCCCCCcccccceeEEEeCCEEEEEeCChhhCeEEEEEECCCC
Confidence 689999754 3221 133 68999999999999999999997654
No 146
>cd05882 Ig1_Necl-1 First (N-terminal) immunoglobulin (Ig)-like domain of nectin-like molcule-1 (Necl-1, also known as cell adhesion molecule3 (CADM3)). Ig1_Necl-1: domain similar to the N-terminal immunoglobulin (Ig)-like domain of nectin-like molecule-1, Necl-1 (also known as celll adhesion molecule 3 (CADM3), SynCAM2, IGSF4). Nectin-like molecules have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1 - Necl-5). They all have an extracellular region containing three Ig-like domains, a transmembrane region, and a cytoplasmic region. The N-terminal Ig-like domain of the extracellular region belongs to the V-type subfamily of Ig domains, is essential to cell-cell adhesion, and plays a part in the interaction with the envelope glycoprotein D of various viruses. Necl-1 has Ca(2+)-independent homophilic and heterophilic cell-cell adhesion activity. Necl-1 is specifically expressed in neural tissue, and is important to the format
Probab=96.47 E-value=0.0032 Score=53.84 Aligned_cols=43 Identities=26% Similarity=0.507 Sum_probs=31.2
Q ss_pred CcEEEEecCC-C---------CCcccc------CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-L---------PLQRYA------EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~---------~~~~~~------~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.+|.|.|.++ + ...+.. .+..|.|.+|+.+|+|.|.|.+...+
T Consensus 27 ~~v~W~k~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~L~I~nV~~~D~G~YtC~~~t~~ 85 (95)
T cd05882 27 SSLQWSNTAQQTLYFGEKRALRDNRIQLVKSTPTELIISISNVQLSDEGEYTCSIFTMP 85 (95)
T ss_pred CeEEEeccCccEEEeCCeEEEeCCeEEEEeCCCceEEEEECCCCcccCEEEEEEEEeec
Confidence 4799998665 2 112211 14699999999999999999997543
No 147
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=96.39 E-value=0.0024 Score=55.05 Aligned_cols=39 Identities=28% Similarity=0.687 Sum_probs=28.5
Q ss_pred ccCCCCCCCCCCCCEeccCCCCCceEeeCCCCCCCCCCCC
Q psy9228 137 TCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRCSV 176 (834)
Q Consensus 137 ~C~~~~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~ 176 (834)
.|.....+-|.|| +|.-.+....+.|.|+.||+|.+||+
T Consensus 44 ~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 44 LCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred cCChhhCCEeECC-EEEeeccCCCceeECCCCcccccccc
Confidence 4543347889997 89866656778888888888888865
No 148
>cd05753 Ig2_FcgammaR_like Second immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs) and similar proteins. Ig2_FcgammaR_like: domain similar to the second immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs). Interactions between IgG and FcgammaR are important to the initiation of cellular and humoral response. IgG binding to FcgammaR leads to a cascade of signals and ultimately to functions such as antibody-dependent-cellular-cytotoxicity (ADCC), endocytosis, phagocytosis, release of inflammatory mediators, etc. FcgammaR has two Ig-like domains. This group also contains FcepsilonRI, which binds IgE with high affinity.
Probab=96.38 E-value=0.0043 Score=51.62 Aligned_cols=40 Identities=25% Similarity=0.314 Sum_probs=32.7
Q ss_pred cEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 3 YIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
++.|.|.+. ++. ...+..|.|.+++.+|+|.|.|.+.+..
T Consensus 31 ~~~w~k~g~~~~~--~~~~~~l~I~~~~~~dsG~Y~C~~~~~~ 71 (83)
T cd05753 31 KVTYYRDGKAKKY--SHSNSNLSIPQATLSDSGSYHCSGIIGS 71 (83)
T ss_pred EEEEEECCeEccc--cCCCceEEECccCHHHCEEEEEEEEeCC
Confidence 689999666 653 2345789999999999999999998765
No 149
>cd05872 Ig_Sema4B_like Immunoglobulin (Ig)-like domain of the class IV semaphorin Sema4B. Ig_Sema4B_like; Immunoglobulin (Ig)-like domain of Sema4B_like. Sema4B is a Class IV semaphorin. Semaphorins are classified based on structural features additional to the Sema domain. Sema4B has extracellular Sema and Ig domains, a transmembrane domain and a short cytoplasmic domain. Sema4B has been shown to preferentially regulate the development of the postsynaptic specialization at the glutamatergic synapses. This cytoplasmic domain includes a PDZ-binding motif upon which the synaptic localization of Sem4B is dependent. Sema4B is a ligand of CLCP1, CLCP1 was identified in an expression profiling analysis, which compared a highly metastic lung cancer subline with its low metastic parental line. Sema4B was shown to promote CLCP1 endocytosis, and their interaction is a potential target for therapeutic intervention of metastasis.
Probab=96.34 E-value=0.0041 Score=51.67 Aligned_cols=40 Identities=25% Similarity=0.241 Sum_probs=32.6
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEc
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQ 41 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~ 41 (834)
|+++|.+.+. |+.+.+. .+.-|.|.+++.+|+|.|.|.+.
T Consensus 25 A~v~W~~ng~~l~~~~r~~~~~~GLlI~~~~~~dsG~Y~C~s~ 67 (85)
T cd05872 25 ASPVWLFNGTPLNAQFSYRVGTDGLLILVTSPEHSGTYRCYSE 67 (85)
T ss_pred ccEEEEECCcccCCCcceEEeCCCCEEEECCHhhCEEEEEEEe
Confidence 6899999777 8765444 34558899999999999999996
No 150
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.34 E-value=0.0047 Score=40.39 Aligned_cols=27 Identities=33% Similarity=0.895 Sum_probs=23.1
Q ss_pred CCCCCCCeecccCCCCCceeeecCCCCCCCCC
Q psy9228 574 KPCQNYGICYPTDTSERGYNCSCLTGYSGDHC 605 (834)
Q Consensus 574 ~pC~ngg~C~~~~~~~~~~~C~C~~G~~G~~C 605 (834)
..|.++|+|+.. ..+|.|.+||+|+.|
T Consensus 6 ~~C~~~G~C~~~-----~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP-----CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC-----CCEEECCCCCcCCCC
Confidence 479999999876 468999999999876
No 151
>cd05752 Ig1_FcgammaR_like Frst immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs) and similar proteins. Ig1_FcgammaR_like: domain similar to the first immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs). Interactions between IgG and FcgammaR are important to the initiation of cellular and humoral response. IgG binding to FcgammaR leads to a cascade of signals and ultimately to functions such as antibody-dependent-cellular-cytotoxicity (ADCC), endocytosis, phagocytosis, release of inflammatory mediators, etc. FcgammaR has two Ig-like domains. This group also contains FcepsilonRI, which binds IgE with high affinity.
Probab=96.33 E-value=0.0044 Score=50.81 Aligned_cols=36 Identities=36% Similarity=0.662 Sum_probs=29.8
Q ss_pred cEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 3 YIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
.+.|.|.+. |. ..++.|+|.++ .+|+|.|.|.+.+.
T Consensus 32 ~~~W~kng~~l~----~~~~~l~i~~~-~~dsG~Y~C~a~~~ 68 (78)
T cd05752 32 STQWYHNGKLLE----TTTNSYRIRAA-NNDSGEYRCQTQGS 68 (78)
T ss_pred cEEEEECCEEee----ccCCeEEEeec-ccCCEEeEEECCCC
Confidence 689999554 62 35679999999 99999999999765
No 152
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.32 E-value=0.0034 Score=37.77 Aligned_cols=19 Identities=47% Similarity=1.207 Sum_probs=13.0
Q ss_pred ceEeeCCCCCC----CCCCCCCCC
Q psy9228 160 GYTCICPPGFS----GDRCSVLGE 179 (834)
Q Consensus 160 ~~~C~C~~g~~----G~~Ce~~~~ 179 (834)
+|+|.|++||. |..| +++|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C-~DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSC-EDID 23 (24)
T ss_pred CEEeeCCCCCcCCCCCCcc-ccCC
Confidence 58888888885 5566 3444
No 153
>KOG1834|consensus
Probab=96.20 E-value=0.057 Score=59.56 Aligned_cols=157 Identities=13% Similarity=0.086 Sum_probs=100.8
Q ss_pred eeEEcCCceeeechh-hhhhccCcceEEEEEEeCC-------CCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcC
Q psy9228 653 EVHFLGEGYVELKKE-LIEERRNEETIAFDFVTDD-------KNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLG 724 (834)
Q Consensus 653 ~~~F~g~s~~~~~~~-~~~~~~~~~~i~~~frT~~-------~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g 724 (834)
-+.|+|..-+.++.. ........++|+||+|-.. ..-.||-..++. ..+....+|++.+=+|.|.+.-.
T Consensus 344 i~eFdG~qgv~vpdg~~~g~l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~---emnrhHyslyvh~Crl~fllr~d 420 (952)
T KOG1834|consen 344 IFEFDGTQGVTVPDGNVSGSLPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKT---EMNRHHYSLYVHGCRLEFLLRRD 420 (952)
T ss_pred EEEEcCceeeEccCCCCCCCCCCceEEEEeeecCCCCccccccceeEEeccccc---ccccceeEEEEeccEEEEEEccC
Confidence 377888554555422 1112247789999998532 223566555442 34567899999999999998764
Q ss_pred CcE------EEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeecc--cCCCCccceecCCceEEcCcCCCCCCCCCc
Q psy9228 725 DGV------VTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKG--ESPGSQDVINTRGNIYLGGTPNMDLMTGGR 796 (834)
Q Consensus 725 ~~~------~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~--~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~ 796 (834)
.+. .....+-..++|..||+-.+..+.-.++|+|||..-... ........-.....|.||.-..........
T Consensus 421 ~~~~~~fRpaef~Wkl~qVCD~EWH~Y~ln~efp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW~g~~~~~l~ 500 (952)
T KOG1834|consen 421 AGATSDFRPAEFHWKLPQVCDNEWHHYVLNVEFPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACWQGRQQKPLK 500 (952)
T ss_pred ccccccccchheeccchhhhhhhhheeEEeecCceEEEEEcCcccCCceeccCCccCcccccceeEEeeeccCccccchh
Confidence 332 222224457999999999999999889999999753321 111111122244568888665543333345
Q ss_pred cCCCceEEEEEEEECC
Q psy9228 797 YVHPMSGLMMNIHIQN 812 (834)
Q Consensus 797 ~~~~F~GCi~~v~in~ 812 (834)
....|+|-+..|.+-.
T Consensus 501 ~aqfFrG~Lasltlrs 516 (952)
T KOG1834|consen 501 LAQFFRGQLASLTLRS 516 (952)
T ss_pred HHHHhhcccceeEEec
Confidence 6778999999998843
No 154
>KOG3513|consensus
Probab=96.15 E-value=0.0051 Score=72.79 Aligned_cols=44 Identities=25% Similarity=0.454 Sum_probs=38.1
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+.+.|.|.++ |..+.+. .+++|+|.|++.+|+|+|+|.++|..+
T Consensus 456 p~~~W~k~~~~~~~~~r~~i~edGtL~I~n~t~~DaG~YtC~A~N~~G 503 (1051)
T KOG3513|consen 456 PKVSWLKGGEKLLQSGRIRILEDGTLEISNVTRSDAGKYTCVAENKLG 503 (1051)
T ss_pred ceEEEEcCCcccccCceEEECCCCcEEecccCcccCcEEEEEEEcccC
Confidence 4789999888 6665555 689999999999999999999998876
No 155
>cd05758 Ig5_KIRREL3-like Fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 (also known as Neph2) and similar proteins. Ig5_KIRREL3-like: domain similar to the fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 (also known as Neph2). This protein has five Ig-like domains, one transmembrane domain, and a cytoplasmic tail. Included in this group is mammalian Kirrel (Neph1), Kirrel2 (Neph3), and Drosophila RST (irregular chiasm C-roughest) protein. These proteins contain multiple Ig domains, have properties of cell adhesion molecules, and are important in organ development.
Probab=95.93 E-value=0.011 Score=51.02 Aligned_cols=43 Identities=28% Similarity=0.501 Sum_probs=32.5
Q ss_pred cEEEEecCC-CCCc---ccc-C-C-------CeEEEcccCCCC-CeeEEEEEcCCCC
Q psy9228 3 YIKWSRADG-LPLQ---RYA-E-G-------NVLRITNARLQD-SGKYKCEIQGHDS 45 (834)
Q Consensus 3 ~~~w~~~~~-~~~~---~~~-~-~-------~~l~~~~~~~~d-~g~y~c~~~~~~~ 45 (834)
+|.|+|.+. |+.. +.. . . ..|+|.+|+.+| .|.|.|.+.|..|
T Consensus 33 ~v~W~~~~~~i~~~~~~r~~i~~~~~~~~~~s~L~I~~v~~~d~~G~Y~C~A~N~~G 89 (98)
T cd05758 33 RIVWTWKENELESGSSGRYTVETDPSPGGVLSTLTISNTQESDFQTSYNCTAWNSFG 89 (98)
T ss_pred EeEEEECCEEccCCCCCCEEEEEecCCCceEEEEEECCccccccceeEEEEEEcCCC
Confidence 699999766 7643 121 1 1 379999999955 8999999999877
No 156
>PHA02887 EGF-like protein; Provisional
Probab=95.87 E-value=0.0093 Score=50.59 Aligned_cols=40 Identities=35% Similarity=0.779 Sum_probs=31.0
Q ss_pred CccCCCCCCCCCCCCEeccCCCCCceEeeCCCCCCCCCCCC
Q psy9228 136 NTCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRCSV 176 (834)
Q Consensus 136 ~~C~~~~~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~~Ce~ 176 (834)
.+|.+...+-|.| |+|.-........|.|+.||+|.+||+
T Consensus 84 ~pC~~eyk~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 84 EKCKNDFNDFCIN-GECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred cccChHhhCEeeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence 3565434678996 599876666789999999999999975
No 157
>cd05862 Ig1_VEGFR First immunoglobulin (Ig)-like domain of vascular endothelial growth factor (VEGF) receptor(R). IG1_VEGFR: first immunoglobulin (Ig)-like domain of vascular endothelial growth factor (VEGF) receptor(R). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. The VEGFR family consists of three members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and VEGFR-3 (Flt-4). VEGF_A interacts with both VEGFR-1 and VEGFR-2. VEGFR-1 binds strongest to VEGF, VEGF-2 binds more weakly. VEGFR-3 appears not to bind VEGF, but binds other members of the VEGF family (VEGF-C and -D). VEGFRs bind VEGFs with high affinity with the IG-like domains. VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGFR-2 is a major mediator of the mitogenic, angioge
Probab=95.75 E-value=0.014 Score=48.95 Aligned_cols=26 Identities=31% Similarity=0.590 Sum_probs=23.1
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
...|.|.+++.+|+|.|.|.++|..+
T Consensus 49 ~s~L~I~~~~~~DsG~Y~C~a~n~~~ 74 (86)
T cd05862 49 SSTLTIENVTLSDLGRYTCTASSGQM 74 (86)
T ss_pred eeEEEEecCCcccCEEEEEEEeecce
Confidence 45899999999999999999988654
No 158
>cd07701 Ig1_Necl-3 First (N-terminal) immunoglobulin (Ig)-like domain of nectin-like molecule-3 (Necl-3, also known as cell adhesion molecule 2 (CADM2)). Ig1_Necl-3: domain similar to the N-terminal immunoglobulin (Ig)-like domain of nectin-like molecule-3, Necl-3 (also known as cell adhesion molecule 2 (CADM2), SynCAM2, IGSF4D). Nectin-like molecules have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1 - Necl-5). They all have an extracellular region containing three Ig-like domains, a transmembrane region, and a cytoplasmic region. The N-terminal Ig-like domain of the extracellular region, belongs to the V-type subfamily of Ig domains, is essential to cell-cell adhesion, and plays a part in the interaction with the envelope glycoprotein D of various viruses. Necl-3 accumulates in central and peripheral nervous system tissue, and has been shown to selectively interact with oligodendrocytes.
Probab=95.66 E-value=0.014 Score=50.03 Aligned_cols=24 Identities=25% Similarity=0.556 Sum_probs=21.4
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
+..|+|.+|+.+|+|.|.|.+...
T Consensus 61 ~~sL~I~~v~~~DsG~Y~C~~~t~ 84 (95)
T cd07701 61 ELSISISDVSLSDEGQYTCSLFTM 84 (95)
T ss_pred cEEEEECcCCcccCEEEEEEeEee
Confidence 358999999999999999999754
No 159
>cd05729 Ig2_FGFR_like Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor and similar proteins. Ig2_FGFR_like: domain similar to the second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. FGF receptors bind FGF signaling polypeptides. FGFs participate in multiple processes such as morphogenesis, development, and angiogenesis. FGFs bind to four FGF receptor tyrosine kinases (FGFR1, -2, -3, -4). Receptor diversity is controlled by alternative splicing producing splice variants with different ligand binding characteristics and different expression patterns. FGFRs have an extracellular region comprised of three Ig-like domains, a single transmembrane helix, and an intracellular tyrosine kinase domain. Ligand binding and specificity reside in the Ig-like domains 2 and 3, and the linker region that connects these two. FGFR activation and signaling depend on FGF-induced dimerization, a process involving cell surface heparin or heparin
Probab=95.63 E-value=0.016 Score=48.29 Aligned_cols=44 Identities=20% Similarity=0.376 Sum_probs=32.9
Q ss_pred CcEEEEecCC-CCCcccc-------CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA-------EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~-------~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+.+.|.|.+. |...... ....|.|.+|+.+|+|.|.|.+.+..+
T Consensus 24 ~~v~W~k~~~~~~~~~~~~~~~~~~~~~~l~i~~~~~~d~g~Y~C~~~n~~g 75 (85)
T cd05729 24 PTITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVVPSDSGKYTCIVENKYG 75 (85)
T ss_pred CeEEEEECCEECcccCceeEEEccCcEEEEEEeECCcccCEEEEEEEEECCc
Confidence 4799999655 5532111 234799999999999999999987764
No 160
>cd07690 Ig1_CD4 First immunoglobulin (Ig) domain of CD4. Ig1_CD4; first immunoglobulin (Ig) domain of CD4. CD4 and CD8 are the two primary co-receptor proteins found on the surface of T cells, and the presence of either CD4 or CD8 determines the function of the T cell. CD4 is found on helper T cells, where it is required for the binding of MHC (major histocompatibility complex) class II molecules, while CD8 is found on cytotoxic T cells, where it is required for the binding of MHC class I molecules. CD4 contains four immunoglobulin domains, with the first three included in this hierarchy. The fourth domain has a general Ig architecture, but has slight topological changes in the arrangement of beta strands relative to the other structures in this family and is not specifically included in the hierarchy.
Probab=95.51 E-value=0.017 Score=49.18 Aligned_cols=23 Identities=35% Similarity=0.690 Sum_probs=20.9
Q ss_pred eEEEcccCCCCCeeEEEEEcCCC
Q psy9228 22 VLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 22 ~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.|+|.+++.+|+|+|.|.+.+..
T Consensus 66 ~L~I~~l~~sDsgtY~C~v~~~~ 88 (94)
T cd07690 66 PLIIKNLKIEDSDTYICEVEDKK 88 (94)
T ss_pred EEEECCCCHHHCEEEEEEECCcc
Confidence 59999999999999999997654
No 161
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=95.43 E-value=0.88 Score=43.87 Aligned_cols=132 Identities=17% Similarity=0.173 Sum_probs=80.1
Q ss_pred eEEEEEEEeeCCCCeeE--EEeccCCccccCCCCCeEEEEEECcEEEEEEecc--c--EEEEee-----eeecCCCeEEE
Q psy9228 406 HFSIELSFKPTDYNGLI--MYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG--L--VVLRSK-----VTLVPHEWVVV 474 (834)
Q Consensus 406 ~~~i~~~frt~~~~GlL--l~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~G--~--~~i~s~-----~~~~dg~wH~V 474 (834)
+.+|.++|++...+++- |...+.. ....|+.|.+.++.+-+.+.-. . .....+ ...++-.||.|
T Consensus 34 ~gTI~i~Fk~~~~~~~~sLfsiSn~~-----~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tv 108 (190)
T PF02973_consen 34 EGTIVIRFKSDSNSGIQSLFSISNST-----KGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTV 108 (190)
T ss_dssp SEEEEEEEEESS-SSEEEEEEEE-TS-----TTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEE
T ss_pred ccEEEEEEecCCCcceeEEEEecCCC-----CccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEE
Confidence 57899999998776553 3334332 2349999999999887777543 1 122222 23455679999
Q ss_pred EEEEE--CCeEEEEECCeeeeeeecCCCcc-ccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCeeeee
Q psy9228 475 TIIKD--FKEGKLSVGGEPLIVGSTPGEKL-QVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGSELDL 549 (834)
Q Consensus 475 ~v~~~--~~~~~l~VD~~~~~~~~~~g~~~-~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~~~~~ 549 (834)
.+..+ .+..+|++||........+...+ ..+.--..++||+..... ...-+|.|-|++|.+-.+.++-
T Consensus 109 a~~ad~~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~g-------~~~y~f~G~I~~l~iYn~aLsd 179 (190)
T PF02973_consen 109 AFVADSKNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRAG-------SNAYPFNGTIDNLKIYNRALSD 179 (190)
T ss_dssp EEEEETTTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEETT-------EEES--EEEEEEEEEESS---H
T ss_pred EEEEecCCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeCC-------CceecccceEEEEEEEcCcCCH
Confidence 99998 77899999995554433332211 122223468999986532 2346899999999997776654
No 162
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=95.41 E-value=0.01 Score=40.04 Aligned_cols=26 Identities=31% Similarity=0.937 Sum_probs=17.4
Q ss_pred CCCC-CCeeecCCCCeEEeCCCCcccC
Q psy9228 184 GACG-DGSCQDVDGAMKCLCPIGTAGK 209 (834)
Q Consensus 184 ~~C~-~g~C~~~~~~~~C~C~~g~~G~ 209 (834)
..|. |++|++..++|.|.|++||.|.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 3565 5788888888888888888764
No 163
>cd07693 Ig1_Robo First immunoglobulin (Ig)-like domain in Robo (roundabout) receptors and similar proteins. Ig1_Robo: domain similar to the first immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit res
Probab=95.39 E-value=0.023 Score=48.96 Aligned_cols=44 Identities=18% Similarity=0.288 Sum_probs=29.1
Q ss_pred CcEEEEecCC-CCCc------ccc--CCCeEEEc-----ccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQ------RYA--EGNVLRIT-----NARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~------~~~--~~~~l~~~-----~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.|.+. |... +.. .++.|.+. +++.+|+|.|.|.++|..|
T Consensus 31 p~i~W~k~g~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~D~G~Y~C~a~N~~G 88 (100)
T cd07693 31 PTIQWLKNGQPLETDKDDPRSHRIVLPSGSLFFLRVVHGRKGRSDEGVYVCVAHNSLG 88 (100)
T ss_pred CEEEEEECCEECccccCCCCcceEEecCCcEEEEEeeccCCCcCcCEEEEEEEEcccc
Confidence 3799999655 6541 111 34443332 3479999999999998876
No 164
>cd05900 Ig_Aggrecan Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. Ig_Aggrecan: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. These aggregates contribute to the tissue's load bearing properties. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggrecan has a wide distribution in connective tissue and extracellular matrices. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
Probab=95.30 E-value=0.028 Score=49.64 Aligned_cols=25 Identities=40% Similarity=0.699 Sum_probs=22.5
Q ss_pred CCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 19 EGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 19 ~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
.+..|+|.|++.+|+|+|.|...++
T Consensus 72 ~~asL~I~nl~~sDsG~Y~C~V~~g 96 (112)
T cd05900 72 SDATLEITELRSNDSGTYRCEVMHG 96 (112)
T ss_pred CCcEEEEeecccccCEEEEEEEecC
Confidence 4679999999999999999999755
No 165
>PHA02826 IL-1 receptor-like protein; Provisional
Probab=95.30 E-value=0.015 Score=58.58 Aligned_cols=44 Identities=30% Similarity=0.436 Sum_probs=32.6
Q ss_pred CcEEEEecCCC-----CCcc---------c--cCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADGL-----PLQR---------Y--AEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~~-----~~~~---------~--~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|+|.|.+.+ +..+ + ..++.|+|.+|+.+|+|.|.|++.|..+
T Consensus 63 ~~V~W~k~~s~~~v~~~~~~~~~~~i~~~~~~~r~~~L~I~~v~~~DsG~Y~C~a~N~~~ 122 (227)
T PHA02826 63 YNVTWSKTDSLAFVRDSGARTKIKKITHNEIGDRSENLWIGNVINIDEGIYICTISSGNI 122 (227)
T ss_pred ccEEEEeCCeEEEEEcCCCccccccccccceecCCCeEEECCCChHHCEEEEEEEEECCc
Confidence 48999996643 1111 1 1357999999999999999999987543
No 166
>cd05885 Ig2_Necl-4 Second immunoglobulin (Ig)-like domain of nectin-like molecule-4 (Necl-4, also known as cell adhesion molecule 4 (CADM4)). Ig2_Necl-4: second immunoglobulin (Ig)-like domain of nectin-like molecule-4 (Necl-4, also known as cell adhesion molecule 4 (CADM4)). Nectin-like molecules have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1-Necl-5). These have an extracellular region containing three Ig-like domains, one transmembrane region, and one cytoplasmic region. Ig domains are likely to participate in ligand binding and recognition. Necl-4 is expressed on Schwann cells, and plays a key part in initiating peripheral nervous system (PNS) myelination. In injured peripheral nerve cells, the mRNA signal for both Necl-4 and Necl-5 was observed to be elevated. Necl-4 participates in cell-cell adhesion and is proposed to play a role in tumor suppression.
Probab=95.24 E-value=0.031 Score=45.75 Aligned_cols=45 Identities=24% Similarity=0.332 Sum_probs=34.9
Q ss_pred CCcEEEEecCC-CCCcccc-CC-------CeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 1 NAYIKWSRADG-LPLQRYA-EG-------NVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 1 ~~~~~w~~~~~-~~~~~~~-~~-------~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++|+|.|.+. |+..... .. ++|+|.-+..+|.+.|.|.+.|...
T Consensus 14 ~a~i~W~k~~~~l~~~~~~~~~~~~~t~~s~L~~~~~~~Ddg~~~~C~A~n~a~ 67 (80)
T cd05885 14 AATLRWYRDRKELKGVISQQENGKTVSVSNTIRFPVDRKDDGAILSCEASHPAL 67 (80)
T ss_pred CCeEEEEECCEECCCCcccccCCceEEEEEEEEEEeeeccCCcEEEEEEEChhh
Confidence 46999999666 8764322 12 3799999999999999999987754
No 167
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=95.23 E-value=0.0086 Score=40.43 Aligned_cols=28 Identities=39% Similarity=1.035 Sum_probs=21.7
Q ss_pred CCCCCCCCEeccCCCCCceEeeCCCCCCCC
Q psy9228 143 HNNCINNGLCQDAATRIGYTCICPPGFSGD 172 (834)
Q Consensus 143 ~~pC~n~g~C~~~~~~~~~~C~C~~g~~G~ 172 (834)
...|..+++|++.. +.|+|.|++||.|.
T Consensus 5 ~~~C~~nA~C~~~~--~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 5 NGGCHPNATCTNTG--GSYTCTCKPGYEGD 32 (36)
T ss_dssp GGGS-TTCEEEE-T--TSEEEEE-CEEECC
T ss_pred CCCCCCCcEeecCC--CCEEeECCCCCccC
Confidence 45689999999987 69999999999874
No 168
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=95.00 E-value=0.8 Score=41.92 Aligned_cols=69 Identities=22% Similarity=0.200 Sum_probs=45.2
Q ss_pred CCCeEEEEEEEEC--CeEEEEECCeeeeeeecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcCe
Q psy9228 468 PHEWVVVTIIKDF--KEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLGS 545 (834)
Q Consensus 468 dg~wH~V~v~~~~--~~~~l~VD~~~~~~~~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing~ 545 (834)
.++||+|.+..+. ..++|+||+......... ......++.+|..... .......|.|.|.+++|-..
T Consensus 60 ~~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~~-----~~~~~~~~~iG~~~~~------~~~~~~~f~G~Idevriy~~ 128 (133)
T smart00560 60 IGVWVHLAGVYDGGAGKLSLYVNGVEVATSETQ-----PSPSSGNLPQGGRILL------GGAGGENFSGRLDEVRVYNR 128 (133)
T ss_pred CCCEEEEEEEEECCCCeEEEEECCEEccccccC-----CcccCCceEEeeeccC------CCCCCCCceEEeeEEEEecc
Confidence 3999999999998 789999999765432211 1123456778731110 01123579999999999665
Q ss_pred ee
Q psy9228 546 EL 547 (834)
Q Consensus 546 ~~ 547 (834)
.+
T Consensus 129 aL 130 (133)
T smart00560 129 AL 130 (133)
T ss_pred cc
Confidence 43
No 169
>cd05774 Ig_CEACAM_D1 First immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM). IG_CEACAM_D1: immunoglobulin (Ig)-like domain 1 in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) protein subfamily. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions, it is a cell adhesion molecule, and a signaling molecule that regulates the growth of tumor cells, it is an angiogenic factor, and is a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two [D1, D4] or four [D1-D4] Ig-like domains on the cell surface. This family corresponds to the D
Probab=94.87 E-value=0.042 Score=47.88 Aligned_cols=26 Identities=23% Similarity=0.317 Sum_probs=23.4
Q ss_pred CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 19 EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 19 ~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.++.|+|.+|+.+|+|.|.|...+..
T Consensus 68 ~ngSL~I~~v~~~D~G~Y~~~v~~~~ 93 (105)
T cd05774 68 PNGSLLIQNVTQKDTGFYTLQTITTN 93 (105)
T ss_pred CCCcEEEecCCcccCEEEEEEEEeCC
Confidence 46899999999999999999997665
No 170
>PHA02887 EGF-like protein; Provisional
Probab=94.62 E-value=0.029 Score=47.72 Aligned_cols=36 Identities=31% Similarity=0.778 Sum_probs=28.6
Q ss_pred CCccC---CCCCCCeeecC--CCCeEEeCCCCcccCccccc
Q psy9228 179 EPCYP---GACGDGSCQDV--DGAMKCLCPIGTAGKRCEQK 214 (834)
Q Consensus 179 ~~C~~---~~C~~g~C~~~--~~~~~C~C~~g~~G~~Ce~~ 214 (834)
.+|.. +-|.||+|.-. .....|.|+.||+|.+||.-
T Consensus 84 ~pC~~eyk~YCiHG~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 84 EKCKNDFNDFCINGECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred cccChHhhCEeeCCEEEccccCCCceeECCCCcccCCCCcc
Confidence 45554 56888999855 47899999999999999863
No 171
>PHA02785 IL-beta-binding protein; Provisional
Probab=94.46 E-value=0.041 Score=59.08 Aligned_cols=44 Identities=23% Similarity=0.308 Sum_probs=33.5
Q ss_pred CcEEEEecCC-CCCccccC-CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYAE-GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~~-~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+|+|.|.++ ++...... +..|.|.+++.+|+|.|.|.+.|..+
T Consensus 61 ~~i~W~K~g~~~~~~~~~~~g~~l~i~~~~~~DsG~Y~C~a~N~~g 106 (326)
T PHA02785 61 LDILWEKRGADNDRIIPIDNGSNMLILNPTQSDSGIYICITKNETY 106 (326)
T ss_pred ceEEEEECCCccceEEEccCCceEEEcccCcccCeEEEEEEECCCc
Confidence 3699999765 55433333 44599999999999999999987653
No 172
>cd05901 Ig_Versican Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), versican. Ig_Versican: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), versican. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, the CSPG aggrecan (not included in this group) forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Like aggrecan, versican has a wide distribution in connective tissue and extracellular matrices. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA
Probab=94.43 E-value=0.051 Score=47.98 Aligned_cols=25 Identities=32% Similarity=0.557 Sum_probs=22.5
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+..|+|.+|+.+|+|+|.|....+.
T Consensus 78 ~asL~i~~v~~sD~G~Y~C~V~~g~ 102 (117)
T cd05901 78 DASLTIVKLRASDAGVYRCEVMHGI 102 (117)
T ss_pred ceEEEEcccccccCEEEEEEEEECC
Confidence 5799999999999999999997654
No 173
>cd05751 Ig1_LILRB1_like First immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILR)B1 (also known as LIR-1) and similar proteins. Ig1_LILRB1_like: domain similar to the first immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILR)B1 (also known as LIR-1). This group includes, LILRA5 (LIR9), an activating natural cytotoxicity receptor NKp46, and the immune-type receptor glycoprotein VI (GPVI). LILRs are a family of immunoreceptors expressed on expressed on T and B cells, on monocytes, dendritic cells, and subgroups of natural killer (NK) cells. The human LILR family contains nine proteins (LILRA1-3,and 5, and LILRB1-5). From functional assays, and as the cytoplasmic domains of various LILRs, for example LILRB1 (LIR-1), LILRB2 (LIR-2), and LILRB3 (LIR-3) contain immunoreceptor tyrosine-based inhibitory motifs (ITIMs) it is thought that LIR proteins are inhibitory receptors. Of the eight LIR family proteins, only LIR-1(LILRB1), and LIR-2 (LILRB2),
Probab=94.35 E-value=0.057 Score=45.78 Aligned_cols=41 Identities=12% Similarity=0.330 Sum_probs=31.3
Q ss_pred cEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 3 YIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
.+.|.|++. ++..... ....|.|.+|+.+|+|+|.|.+.+.
T Consensus 31 ~f~l~k~g~~~~~~~~~~~~~~~~f~i~~v~~~~~G~Y~C~~~~~ 75 (91)
T cd05751 31 EYRLYREGSTFAVKKPPEPQNKAKFFIPSMKREHAGRYRCYYRSG 75 (91)
T ss_pred EEEEEECCCCccccccCCCceeEEEEccCCChhHCEEEEEEEECC
Confidence 577888665 5543221 3567999999999999999999765
No 174
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=94.21 E-value=0.035 Score=48.05 Aligned_cols=37 Identities=35% Similarity=0.779 Sum_probs=29.2
Q ss_pred CCCccC---CCCCCCeeecCC--CCeEEeCCCCcccCccccc
Q psy9228 178 GEPCYP---GACGDGSCQDVD--GAMKCLCPIGTAGKRCEQK 214 (834)
Q Consensus 178 ~~~C~~---~~C~~g~C~~~~--~~~~C~C~~g~~G~~Ce~~ 214 (834)
+.+|.+ +-|.||+|.-.. +.+.|.|+.||+|.+||..
T Consensus 42 i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 42 IRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred cccCChhhCCEeECCEEEeeccCCCceeECCCCcccccccce
Confidence 335544 568889998554 8899999999999999875
No 175
>PHA02785 IL-beta-binding protein; Provisional
Probab=94.19 E-value=0.048 Score=58.59 Aligned_cols=43 Identities=28% Similarity=0.463 Sum_probs=33.8
Q ss_pred CcEEEEecCC-CCCcccc--CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~--~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
++|+|.| +. +...+.. .++.|.|.+|+.+|+|.|.|.+.+..+
T Consensus 156 ~~i~W~k-~~~~~~~r~~~~~~~~L~i~~v~~~d~G~Y~C~~~n~~g 201 (326)
T PHA02785 156 ADIIWSG-HRRLRNKRLKQRTPGIITIEDVRKNDAGYYTCVLKYIYG 201 (326)
T ss_pred ceEEEcc-CCccCCcceEecCCCeEEEeecChhhCeEEEEEEEeccC
Confidence 5799988 45 5444432 467999999999999999999987654
No 176
>smart00410 IG_like Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Probab=93.61 E-value=0.13 Score=42.19 Aligned_cols=44 Identities=25% Similarity=0.502 Sum_probs=32.4
Q ss_pred CcEEEEecCC--CCCcccc----C--CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG--LPLQRYA----E--GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~--~~~~~~~----~--~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.+.++ ++.+.+. . ...|.|.+++.+|+|.|.|...+..+
T Consensus 24 ~~~~W~~~~~~~v~~~~~~~~~~~~~~~~l~i~~~~~~d~G~Y~C~v~~~~~ 75 (86)
T smart00410 24 PEVTWYKQGGKLLAESGRFSVSRSGSNSTLTISNVTPEDSGTYTCAATNSSG 75 (86)
T ss_pred CeEEEEECCCEEcCCCCcEEEEEcCCeeEEEEEeeccccCeEEEEEEEcCCC
Confidence 4789999734 5422222 2 26899999999999999999986554
No 177
>smart00409 IG Immunoglobulin.
Probab=93.61 E-value=0.13 Score=42.19 Aligned_cols=44 Identities=25% Similarity=0.502 Sum_probs=32.4
Q ss_pred CcEEEEecCC--CCCcccc----C--CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG--LPLQRYA----E--GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~--~~~~~~~----~--~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.+.++ ++.+.+. . ...|.|.+++.+|+|.|.|...+..+
T Consensus 24 ~~~~W~~~~~~~v~~~~~~~~~~~~~~~~l~i~~~~~~d~G~Y~C~v~~~~~ 75 (86)
T smart00409 24 PEVTWYKQGGKLLAESGRFSVSRSGSNSTLTISNVTPEDSGTYTCAATNSSG 75 (86)
T ss_pred CeEEEEECCCEEcCCCCcEEEEEcCCeeEEEEEeeccccCeEEEEEEEcCCC
Confidence 4789999734 5422222 2 26899999999999999999986554
No 178
>PF13927 Ig_3: Immunoglobulin domain; PDB: 2D3V_A 1G0X_A 1VDG_A 1P7Q_D 3D2U_H 1UFU_A 1UGN_A 3VH8_H 3OQ3_B 4DKD_C ....
Probab=93.60 E-value=0.019 Score=46.20 Aligned_cols=24 Identities=29% Similarity=0.574 Sum_probs=21.9
Q ss_pred CCCeEEEcccCCCCCeeEEEEEcC
Q psy9228 19 EGNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 19 ~~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
.+..|.|.+|..+|.|.|.|.++|
T Consensus 52 ~~~~L~i~~v~~~~~g~y~C~a~N 75 (75)
T PF13927_consen 52 SNSTLTISNVTRSDNGTYTCIASN 75 (75)
T ss_dssp EEEEEEESSCCGGGTEEEEEEEEE
T ss_pred eeeEEEEccCCHHhCcEEEEEEEC
Confidence 478999999999999999999965
No 179
>KOG4222|consensus
Probab=93.56 E-value=0.087 Score=62.38 Aligned_cols=44 Identities=25% Similarity=0.474 Sum_probs=34.7
Q ss_pred CcEEEEecCC-CC-----CccccCCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 2 AYIKWSRADG-LP-----LQRYAEGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 2 ~~~~w~~~~~-~~-----~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+++.|.|++- |- .-+-+.++.|.|.||+++|+|+|.|.++|.-+
T Consensus 162 ptvSw~Kdg~pl~~~~~~~~~lisgG~LlIsnvrksD~GtY~CVatNmvG 211 (1281)
T KOG4222|consen 162 PTVSWVKDGKPLDHYDVPIIALISGGNLLISNVRKSDEGTYACVATNMVG 211 (1281)
T ss_pred CcceeecCCCccccccceeEEEecCCcEEEeccccCCCceeeeeeccccc
Confidence 4799999544 43 11223799999999999999999999998854
No 180
>smart00051 DSL delta serrate ligand.
Probab=93.49 E-value=0.11 Score=40.30 Aligned_cols=48 Identities=25% Similarity=0.505 Sum_probs=34.2
Q ss_pred ceeeecCCCCCCCCCccCccCCCCCCccCCCceeeecCCCcEEcCCCCCCCCCc
Q psy9228 591 GYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGMCKVTPDSYECLCSLGYAPPNC 644 (834)
Q Consensus 591 ~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~C~~~~~~~~C~C~~g~~G~~C 644 (834)
.+.-.|+++|.|..|+. .|...+-+..+.+|.. .-.|.|.+||+|+.|
T Consensus 16 ~~rv~C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGEGCNK---FCRPRDDFFGHYTCDE---NGNKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCCccCC---EeCcCccccCCccCCc---CCCEecCCCCcCCCC
Confidence 56667999999999974 4543223556677732 345789999999987
No 181
>PHA02633 hypothetical protein; Provisional
Probab=93.48 E-value=0.079 Score=40.05 Aligned_cols=25 Identities=32% Similarity=0.462 Sum_probs=22.0
Q ss_pred CCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 19 EGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 19 ~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
.+..|+|-++..+|+|+|.|+..+.
T Consensus 16 ~~~nLwilpa~~sDSGiYiC~~rn~ 40 (63)
T PHA02633 16 NCNNMLILNPTQSDSGIYMCITKNE 40 (63)
T ss_pred ccccEEEeccccccCcEEEEEEcCC
Confidence 4678999999999999999999644
No 182
>cd05879 Ig_P0 Immunoglobulin (Ig)-like domain of Protein zero (P0). Ig_P0ex: immunoglobulin (Ig) domain of Protein zero (P0). P0 accounts for over 50% of the total protein in peripheral nervous system (PNS) myelin. P0 is a single-pass transmembrane glycoprotein with a highly basic intracellular domain and an Ig domain. The extracellular domain of P0 (P0-ED) is similar to the Ig variable domain, carrying one acceptor sequence for N-linked glycosylation. P0 plays a role in membrane adhesion in the spiral wraps of the myelin sheath. The intracellular domain is thought to mediate membrane apposition of the cytoplasmic faces and may, through electrostatic interactions, interact directly with lipid headgroups. It is thought that homophilic interactions of the P0 extracellular domain mediate membrane juxtaposition in the extracellular space of PNS myelin.
Probab=93.33 E-value=0.11 Score=46.16 Aligned_cols=27 Identities=19% Similarity=0.530 Sum_probs=23.8
Q ss_pred CCCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 19 EGNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 19 ~~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+..|.|.||+.+|+|+|.|...+...
T Consensus 78 ~daSI~I~nv~~sD~G~Y~C~v~n~p~ 104 (116)
T cd05879 78 KDGSIVIHNLDYTDNGTFTCDVKNPPD 104 (116)
T ss_pred CeeEEEEccCCcccCEEEEEEEEcCCC
Confidence 356899999999999999999987754
No 183
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=93.06 E-value=0.11 Score=50.63 Aligned_cols=108 Identities=13% Similarity=0.075 Sum_probs=60.3
Q ss_pred CcceEEEEEEe-C-CCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCcEEE------E-EeCCceecCCCcEE
Q psy9228 674 NEETIAFDFVT-D-DKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVT------I-KFSKKPVNDGIKHS 744 (834)
Q Consensus 674 ~~~~i~~~frT-~-~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~~~~------l-~~s~~~~nDg~wH~ 744 (834)
..+.|+++||. . ...|++|....... .........+.|.++.-........+... . ......+..|+||+
T Consensus 53 ~df~l~~d~k~~~~~~sGi~~r~~~~~~-~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~W~~ 131 (185)
T PF06439_consen 53 SDFELEVDFKITPGGNSGIFFRAQSPGD-GQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEPSVNVAIPPGEWNT 131 (185)
T ss_dssp SSEEEEEEEEE-TT-EEEEEEEESSECC-SSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-SSS--S--TTSEEE
T ss_pred ccEEEEEEEEECCCCCeEEEEEeccccC-CCCcceEEEEEEECCCCccCCCCccceEEEeccccccccccccCCCCceEE
Confidence 67888888884 3 34566666651111 12234567777775432211111122221 1 11345688999999
Q ss_pred EEEEEECCEEEEEEcCeeeecccCCCCcccee--cCCceEEc
Q psy9228 745 VNVTRINKFGSLEVDSVIVGKGESPGSQDVIN--TRGNIYLG 784 (834)
Q Consensus 745 V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~--~~~~lyiG 784 (834)
++|...+.++++.|||..+.....+... .. ..+.|-|-
T Consensus 132 ~~I~~~g~~i~v~vnG~~v~~~~d~~~~--~~~~~~G~Igl~ 171 (185)
T PF06439_consen 132 VRIVVKGNRITVWVNGKPVADFTDPSFP--YSNPTKGPIGLQ 171 (185)
T ss_dssp EEEEEETTEEEEEETTEEEEEEETTSHH--HHHHSSBEEEEE
T ss_pred EEEEEECCEEEEEECCEEEEEEEcCCCC--CCCCCceEEEEE
Confidence 9999999999999999988765443221 11 45566554
No 184
>KOG1836|consensus
Probab=93.00 E-value=0.13 Score=65.38 Aligned_cols=128 Identities=16% Similarity=0.053 Sum_probs=84.9
Q ss_pred CcceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCc--EEEEEeCCceecCCCcEEEEEEEEC
Q psy9228 674 NEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDG--VVTIKFSKKPVNDGIKHSVNVTRIN 751 (834)
Q Consensus 674 ~~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~--~~~l~~s~~~~nDg~wH~V~i~r~~ 751 (834)
..+.+.+..+-.+..|.|-...... ..+..+....+.....+..|-. ...+. ...++-++.||+|..++..
T Consensus 1556 ~~~~~~~~~~~~~~~~~l~~~~s~~------~~~~~~~~~~~~~~~~~~~gi~~~~~s~~-~~~~~~~~~~~~~~~~~~~ 1628 (1705)
T KOG1836|consen 1556 PAFALVFSERNVSSTGGLTHHLSKL------GTELLVQENPIGVTEKFESGITDLSTSST-PIVSLLPGGCHSVTSSTDP 1628 (1705)
T ss_pred hhHHhhhcccccccCCCcccccccc------chHHhhhhcccccchhhhhhhhhhhhcch-hhhhhcCCcceeeeeecCC
Confidence 4455666666666555555443322 3466666667776666554432 22333 4567889999999999999
Q ss_pred CEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCce
Q psy9228 752 KFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNKH 814 (834)
Q Consensus 752 ~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~~ 814 (834)
..+.+.+|...+. . ......+...++++||+|......-.....+|.||| +.+++.+
T Consensus 1629 ~v~~~~~~~~~~~-~---~~~~~~~~~~p~~~~~~~~s~~~~~~~~~~~~~~~~--~~~~~~~ 1685 (1705)
T KOG1836|consen 1629 GVVQLEDDTYTVG-E---IPPPPADTQEPIKLGGYPSSLTTLRIAVLKSFTGCI--FVVMGIR 1685 (1705)
T ss_pred ccccccccceecc-c---CCCCchhccCCcccCCccccccceeeecccccccce--EEecCCC
Confidence 9999999983222 2 222345677899999999874333345678899999 7777765
No 185
>PF07354 Sp38: Zona-pellucida-binding protein (Sp38); InterPro: IPR010857 This family contains a number of zona-pellucida-binding proteins that seem to be restricted to mammals. These are sperm proteins that bind to the 90 kDa family of zona pellucida glycoproteins in a calcium-dependent manner []. These represent some of the specific molecules that mediate the first steps of gamete interaction, allowing fertilisation to occur [].; GO: 0007339 binding of sperm to zona pellucida, 0005576 extracellular region
Probab=92.70 E-value=0.12 Score=51.47 Aligned_cols=43 Identities=26% Similarity=0.465 Sum_probs=35.4
Q ss_pred CCcEEEEecCC--CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 1 NAYIKWSRADG--LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 1 ~~~~~w~~~~~--~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
+.++.|...+| |+.+.++ ..|.|.|.+++++|+|.|+|+-++.
T Consensus 12 DP~y~W~GP~g~~l~gn~~~nIT~TG~L~~~~F~esmSG~YTCtLsYk 59 (271)
T PF07354_consen 12 DPTYLWTGPNGKPLSGNSYVNITETGKLMFKNFQESMSGSYTCTLSYK 59 (271)
T ss_pred CCceEEECCCCcccCCCCeEEEccCceEEeeccccccCCceEEEEEEE
Confidence 46899999888 6555444 5789999999999999999999654
No 186
>PF13895 Ig_2: Immunoglobulin domain; PDB: 2V5R_B 2V5M_A 2V5S_B 2GI7_A 3LAF_A 4DEP_C 3O4O_B 2EC8_A 2E9W_A 1J87_A ....
Probab=92.62 E-value=0.07 Score=43.55 Aligned_cols=39 Identities=26% Similarity=0.526 Sum_probs=29.8
Q ss_pred CcEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.++.|.|.+. |+... +.|.+.++..+|+|.|.|.+.+..
T Consensus 29 ~~~~w~~~~~~~~~~~----~~~~~~~~~~~~~g~y~C~~~~~~ 68 (80)
T PF13895_consen 29 PQVQWYKNGSPINSSQ----NGLFIPNVSPEDSGNYTCRASNGS 68 (80)
T ss_dssp SEEEEEETTEEEEEES----SEEEESSEEGGGTEEEEEEEEETT
T ss_pred eeeeeeeeeeeeeeee----eeeeeeeeccccCEEEEEEEEeCC
Confidence 4789999555 65222 228889999999999999997654
No 187
>cd05898 Ig5_KIRREL3 Fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 protein (also known as Neph2). Ig5_KIRREL3: the fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 protein (also known as Neph2). This protein has five Ig-like domains, one transmembrane domain, and a cytoplasmic tail. Included in this group is mammalian Kirrel (Neph1). These proteins contain multiple Ig domains, have properties of cell adhesion molecules, and are important in organ development. Neph1 and 2 may mediate axonal guidance and synapse formation in certain areas of the CNS. In the kidney, they participate in the formation of the slit diaphragm.
Probab=92.60 E-value=0.21 Score=42.85 Aligned_cols=25 Identities=24% Similarity=0.328 Sum_probs=22.7
Q ss_pred CeEEEcccCCCC-CeeEEEEEcCCCC
Q psy9228 21 NVLRITNARLQD-SGKYKCEIQGHDS 45 (834)
Q Consensus 21 ~~l~~~~~~~~d-~g~y~c~~~~~~~ 45 (834)
..|+|.+++.+| .|.|.|.+.|.+|
T Consensus 64 S~L~I~~~~~~d~~g~Y~C~a~N~~G 89 (98)
T cd05898 64 STLTINNIMEADFQTHYNCTAWNSFG 89 (98)
T ss_pred EEEEECCCccccCCcEEEEEEEeCCc
Confidence 589999998877 7999999999987
No 188
>KOG1836|consensus
Probab=92.49 E-value=0.18 Score=64.14 Aligned_cols=116 Identities=15% Similarity=0.090 Sum_probs=67.0
Q ss_pred CcceEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeecccCCCCccceecCCceEE
Q psy9228 704 GREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYL 783 (834)
Q Consensus 704 ~~~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyi 783 (834)
-.+|+.+....+++.........+..+. +...+ |.|.....+..-.|.+|....... ++ ........++|.
T Consensus 1413 ~~~~~~~~~~~~~~~g~~~~~~~~~k~~-a~e~~-----~~~~~~~~~~~~~l~~~~~~~~~~--~~-~~~~~~~~~~~~ 1483 (1705)
T KOG1836|consen 1413 CEDFGGLQLCAGRLLGRACERCAEGKYG-APEAL-----HEVACGAERSAGRLVRDCNCPSSY--EG-NLCWECQAPNYL 1483 (1705)
T ss_pred hhhhhhhhhhhhhhhcchhhhhhhhhcc-Ccccc-----eeeecccccccccccccccCCCCc--cc-cccccccccccc
Confidence 3688888888888877665444333232 22223 777777777777777777632221 11 122334456777
Q ss_pred cCcCCCCCCC--CCccCCCceEEEEEEEECCceecccCCCCCCCccccccc
Q psy9228 784 GGTPNMDLMT--GGRYVHPMSGLMMNIHIQNKHISNIGSSASSGLHVMPWI 832 (834)
Q Consensus 784 GG~p~~~~~~--~~~~~~~F~GCi~~v~in~~~~~~l~~~a~~~~nv~~c~ 832 (834)
|+........ .......|.||+.+..++... +.. ...++.+..|+
T Consensus 1484 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~-~~~~~~~~~c~ 1530 (1705)
T KOG1836|consen 1484 GGPVEGCDECKCQEPDIQSFTGCLCDTQLGQAK---CRS-KSAGRSCDKCL 1530 (1705)
T ss_pred cCCCCCchhhccccCCcccccccccchhccccc---ccc-cccCceecccc
Confidence 7765543221 223456788888888886652 222 23666666665
No 189
>cd05886 Ig1_Nectin-1_like First immunoglobulin (Ig) domain of nectin-1 (also known as poliovirus receptor related protein 1, or as CD111) and similar proteins. Ig1_Nectin-1_like: domain similar to the first immunoglobulin (Ig) domain of nectin-1 (also known as poliovirus receptor related protein 1, or as CD111). Nectin-1 belongs to the nectin family comprised of four transmembrane glycoproteins (nectins-1 through -4). Nectins are synaptic cell adhesion molecules (CAMs) which facilitate adhesion and signaling at various intracellular junctions. Nectins form homophilic cis-dimers, followed by homophilic and heterophilic trans-dimers involved in cell-cell adhesion. In addition nectins heterophilically trans-interact with other CAMs such as nectin-like molecules (Necls), nectin-1 for example, has been shown to trans-interact with Necl-1. Nectins also interact with various other proteins, including the actin filament (F-actin)-binding protein, afadin. Mutation in the human nectin-1 gene is
Probab=92.45 E-value=0.11 Score=44.64 Aligned_cols=24 Identities=25% Similarity=0.595 Sum_probs=22.0
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
+..|.|.||+.+|+|+|.|...+-
T Consensus 62 daSi~i~nl~~~D~G~Y~C~v~t~ 85 (99)
T cd05886 62 DGTISLSRLELEDEGVYICEFATF 85 (99)
T ss_pred cceEEEcCCccccCEEEEEEEEeC
Confidence 579999999999999999999873
No 190
>cd05888 Ig1_Nectin-4_like Frst immunoglobulin (Ig) domain of nectin-4 (also known as poliovirus receptor related protein 4, or as LNIR receptor) and similar proteins. Ig1_Nectin-4_like: domain similar to the first immunoglobulin (Ig) domain of nectin-4 (also known as poliovirus receptor related protein 4, or as LNIR receptor). Nectin-4 belongs to the nectin family, which is comprised of four transmembrane glycoproteins (nectins-1 through -4). Nectins are synaptic cell adhesion molecules (CAMs) which participate in adhesion and signaling at various intracellular junctions. Nectins form homophilic cis-dimers, followed by homophilic and heterophilic trans-dimers involved in cell-cell adhesion. For example nectin-4 trans-interacts with nectin-1. Nectin-4 has also been shown to interact with the actin filament-binding protein, afadin. Unlike the other nectins, which are widely expressed in adult tissues, nectin-4 is mainly expressed during embryogenesis, and is not detected in normal adult
Probab=92.19 E-value=0.13 Score=44.49 Aligned_cols=23 Identities=39% Similarity=0.651 Sum_probs=21.0
Q ss_pred CCeEEEcccCCCCCeeEEEEEcC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
+..|+|.||+.+|+|+|.|...+
T Consensus 63 ~~sL~I~nv~~sD~GtY~C~v~~ 85 (100)
T cd05888 63 DGALILRNAVQADEGKYKCRVIT 85 (100)
T ss_pred ceEEEEecCccccceEEEEEEEe
Confidence 56899999999999999999864
No 191
>cd05714 Ig_CSPGs_LP Immunoglobulin (Ig)-like domain of chondroitin sulfate proteoglycans (CSPGs), human cartilage link protein (LP) and similar proteins. Ig_CSPGs_LP: immunoglobulin (Ig)-like domain similar to that found in chondroitin sulfate proteoglycans (CSPGs) and human cartilage link protein (LP). Included in this group are the CSPGs aggrecan, versican, and neurocan. In CSPGs this Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggrecan and versican have a wide distribution in connective tissue and extracellular matrices. Neurocan is localized almost exclusively in nervous tissue.
Probab=92.05 E-value=0.11 Score=45.34 Aligned_cols=25 Identities=44% Similarity=0.755 Sum_probs=22.5
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+..|+|.+|+.+|+|+|.|...++.
T Consensus 67 ~~sL~I~~v~~sD~G~Y~C~v~~~~ 91 (106)
T cd05714 67 DASLVITDLRLEDSGRYRCEVIDGI 91 (106)
T ss_pred cceEEECCCCHHHCEEEEEEEEcCC
Confidence 5789999999999999999998654
No 192
>cd05741 Ig_CEACAM_D1_like First immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) and similar proteins. Ig_CEACAM_D1_like : immunoglobulin (IG)-like domain 1 in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) protein subfamily-like. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions, it is a cell adhesion molecule, and a signaling molecule that regulates the growth of tumor cells, it is an angiogenic factor, and is a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two [D1, D4] or four [D1-D4] Ig-like domains on the cell surf
Probab=92.02 E-value=0.14 Score=43.34 Aligned_cols=26 Identities=31% Similarity=0.427 Sum_probs=22.7
Q ss_pred CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 19 EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 19 ~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.++.|+|.+++.+|+|.|.|......
T Consensus 55 ~~~sL~I~~l~~~DsG~Y~c~v~~~~ 80 (92)
T cd05741 55 PNGSLLIQNLTKEDSGTYTLQIISTN 80 (92)
T ss_pred CCceEEEccCCchhcEEEEEEEEcCC
Confidence 46899999999999999999996543
No 193
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=91.78 E-value=0.13 Score=31.10 Aligned_cols=17 Identities=41% Similarity=1.107 Sum_probs=13.9
Q ss_pred CeEEeCCCCcc----cCcccc
Q psy9228 197 AMKCLCPIGTA----GKRCEQ 213 (834)
Q Consensus 197 ~~~C~C~~g~~----G~~Ce~ 213 (834)
+|+|.|++||. |..|+.
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~D 21 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCED 21 (24)
T ss_pred CEEeeCCCCCcCCCCCCcccc
Confidence 69999999997 667753
No 194
>cd05759 Ig2_KIRREL3-like Second immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 (also known as Neph2). Ig2_KIRREL3-like: domain similar to the second immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 (also known as Neph2). This protein has five Ig-like domains, one transmembrane domain, and a cytoplasmic tail. Included in this group is mammalian Kirrel (Neph1), Kirrel2 (Neph3), and Drosophila RST (irregular chiasm C-roughest) protein. These proteins contain multiple Ig domains, have properties of cell adhesion molecules, and are important in organ development.
Probab=91.78 E-value=0.28 Score=40.51 Aligned_cols=43 Identities=23% Similarity=0.409 Sum_probs=30.9
Q ss_pred CcEEEEecCC-CCCcccc----CC-------CeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA----EG-------NVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~~-------~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
++|+|.|.+. |...... .+ ..|.|...+.+|.|.|.|.+.+..
T Consensus 15 ~~v~W~~~g~~l~~~~~~~~~~~~~~~~~~~s~L~i~~~~~~~~~~y~C~a~n~~ 69 (82)
T cd05759 15 AEIIWFRDGEVLDGATYSKELLKDGKRETTVSTLPITPSDHDTGRTFTCRARNEA 69 (82)
T ss_pred CEEEEEECCEEccCceeeeeEecCCCeEEEEEEEEEeCceecCCCEEEEEEcCcc
Confidence 4899999555 7654321 12 267788888889999999997653
No 195
>KOG0994|consensus
Probab=91.78 E-value=0.37 Score=56.88 Aligned_cols=53 Identities=38% Similarity=0.918 Sum_probs=31.9
Q ss_pred ceEeeCCCCCCCCCCCC-------C-CCCccCCCCC-CC----eeecCCCCeEEeCCCCcccCccccc
Q psy9228 160 GYTCICPPGFSGDRCSV-------L-GEPCYPGACG-DG----SCQDVDGAMKCLCPIGTAGKRCEQK 214 (834)
Q Consensus 160 ~~~C~C~~g~~G~~Ce~-------~-~~~C~~~~C~-~g----~C~~~~~~~~C~C~~g~~G~~Ce~~ 214 (834)
.-.|.|.|||.|..|.+ + .-.|..--|- .| .|. ...-.|.|.+|..|++|.+-
T Consensus 1083 tGQCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCd--r~tG~C~C~~Gv~G~rCdqC 1148 (1758)
T KOG0994|consen 1083 TGQCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCD--RATGRCVCRPGVGGPRCDQC 1148 (1758)
T ss_pred ccceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCcc--ccCCceeecCCCCCcchhhh
Confidence 45799999999999853 1 1123222222 12 222 13346888888888888764
No 196
>cd05713 Ig_MOG_like Immunoglobulin (Ig)-like domain of myelin oligodendrocyte glycoprotein (MOG). Ig_MOG_like: immunoglobulin (Ig)-like domain of myelin oligodendrocyte glycoprotein (MOG). MOG, a minor component of the myelin sheath, is an important CNS-specific autoantigen, linked to the pathogenesis of multiple sclerosis (MS) and experimental autoimmune encephalomyelitis (EAE). It is a transmembrane protein having an extracellular Ig domain. MOG is expressed in the CNS on the outermost lamellae of the myelin sheath, and on the surface of oligodendrocytes, and may participate in the completion, compaction, and/or maintenance of myelin. This group also includes butyrophilin (BTN). BTN is the most abundant protein in bovine milk-fat globule membrane (MFGM).
Probab=91.60 E-value=0.13 Score=44.40 Aligned_cols=25 Identities=40% Similarity=0.595 Sum_probs=21.9
Q ss_pred CeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
..|+|.+|+.+|+|+|.|...+..+
T Consensus 65 ~sL~I~~v~~~D~G~Y~C~v~~~~~ 89 (100)
T cd05713 65 VALRIHNVRASDEGLYTCFFQSDGF 89 (100)
T ss_pred EEEEEEcCChhhCEEEEEEEecCCc
Confidence 4799999999999999999976543
No 197
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=91.36 E-value=0.2 Score=47.00 Aligned_cols=63 Identities=27% Similarity=0.654 Sum_probs=42.2
Q ss_pred CCCCCCCCEeccCCC---CCceEeeCCCCCCCC--CCCCCCCCccCCCCCCCeeecCC---CCeEEeCCCCcc
Q psy9228 143 HNNCINNGLCQDAAT---RIGYTCICPPGFSGD--RCSVLGEPCYPGACGDGSCQDVD---GAMKCLCPIGTA 207 (834)
Q Consensus 143 ~~pC~n~g~C~~~~~---~~~~~C~C~~g~~G~--~Ce~~~~~C~~~~C~~g~C~~~~---~~~~C~C~~g~~ 207 (834)
..||.+.++|.+... ...|+|.|.+||.=. .| ..+.|...-|.+|.|+-.+ ....|.|.-|+.
T Consensus 49 ~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vC--vp~~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV 119 (197)
T PF06247_consen 49 NKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVC--VPNKCNNKDCGSGKCILDPDNPNNPTCSCNIGKV 119 (197)
T ss_dssp TSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSE--EEGGGSS---TTEEEEEEEGGGSEEEEEE-TEEE
T ss_pred CccccchhhhhcCCCcccceeEEEecccCceeeCCeE--chhhcCceecCCCeEEecCCCCCCceeEeeeceE
Confidence 688999999997653 467999999999732 34 2345666677778998443 566999999976
No 198
>cd00096 Ig Immunoglobulin domain. Ig: immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of this group are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=91.36 E-value=0.3 Score=37.82 Aligned_cols=43 Identities=30% Similarity=0.563 Sum_probs=32.0
Q ss_pred CcEEEEecCC-CCCcc----------ccCCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQR----------YAEGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~----------~~~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
..+.|.|.+. ..... ......|.|++++.+|+|.|.|...+..
T Consensus 13 ~~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~~~~~d~g~y~C~~~~~~ 66 (74)
T cd00096 13 PTITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTCVASNSA 66 (74)
T ss_pred CcEEEEECCEECCCcccEEeccccCcceeEEEEEECccCcccCcEEEEEEecCc
Confidence 4789998665 44332 1135689999999999999999997543
No 199
>smart00051 DSL delta serrate ligand.
Probab=91.28 E-value=0.25 Score=38.30 Aligned_cols=45 Identities=22% Similarity=0.488 Sum_probs=32.1
Q ss_pred eEeeCCCCCCCCCCCCCCCCccC-CCCCC-CeeecCCCCeEEeCCCCcccCcc
Q psy9228 161 YTCICPPGFSGDRCSVLGEPCYP-GACGD-GSCQDVDGAMKCLCPIGTAGKRC 211 (834)
Q Consensus 161 ~~C~C~~g~~G~~Ce~~~~~C~~-~~C~~-g~C~~~~~~~~C~C~~g~~G~~C 211 (834)
|.=.|+++|.|..|+. .|.+ +.+.+ .+|.. .-.|.|++||+|+.|
T Consensus 17 ~rv~C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNK---FCRPRDDFFGHYTCDE---NGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCC---EeCcCccccCCccCCc---CCCEecCCCCcCCCC
Confidence 4456999999999965 5654 23443 47743 245889999999987
No 200
>KOG4221|consensus
Probab=91.20 E-value=0.23 Score=59.39 Aligned_cols=42 Identities=31% Similarity=0.671 Sum_probs=38.3
Q ss_pred cEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 3 YIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+|+|.|.+. ||.+.+. .++.|.|.+++++|.|.|.|.++++-
T Consensus 171 ~i~w~kn~~pl~~~~r~i~lpsG~L~I~~~qp~d~g~yrc~vt~g~ 216 (1381)
T KOG4221|consen 171 TIMWLKNDQPLPLDSRVIVLPSGALEISGLQPSDTGEYRCVVTGGA 216 (1381)
T ss_pred eeEEecccccccCCCcEEEcCCCcEEecccccCCCceEEEEEecCC
Confidence 699999998 9998777 58999999999999999999999874
No 201
>cd05887 Ig1_Nectin-3_like First immunoglobulin (Ig) domain of nectin-3 (also known as poliovirus receptor related protein 3) and similar proteins. Ig1_Nectin-3_like: domain similar to the first immunoglobulin (Ig) domain of nectin-3 (also known as poliovirus receptor related protein 3). Nectin-3 belongs to the nectin family comprised of four transmembrane glycoproteins (nectins-1 through -4). Nectins are synaptic cell adhesion molecules (CAMs) which participate in adhesion and signaling at various intracellular junctions. Nectins form homophilic cis-dimers, followed by homophilic and heterophilic trans-dimers involved in cell-cell adhesion. For example, during spermatid development, the nectin-3,-2 trans-interaction is required for the formation of Sertoli cell-spermatid junctions in testis, and during morphogenesis of the ciliary body, the nectin-3,-1 trans-interaction is important for apex-apex adhesion between the pigment and non-pigment layers of the ciliary epithelia. Nectins also
Probab=91.15 E-value=0.18 Score=42.99 Aligned_cols=22 Identities=23% Similarity=0.495 Sum_probs=20.6
Q ss_pred CCeEEEcccCCCCCeeEEEEEc
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQ 41 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~ 41 (834)
+..|+|+||+.+|+|.|.|...
T Consensus 59 d~sl~i~nv~~~D~G~Y~C~v~ 80 (96)
T cd05887 59 DATIMLENVGFSDIGVYICKAV 80 (96)
T ss_pred ccEEEEeCCCccccEEEEEEEE
Confidence 7799999999999999999975
No 202
>KOG4194|consensus
Probab=91.10 E-value=0.35 Score=53.64 Aligned_cols=62 Identities=23% Similarity=0.471 Sum_probs=44.5
Q ss_pred CCcEEEEecCC-CCCcccc-------------------CCCeEEEcccCCCCCeeEEEEEcCCCCccCCcEEEEcCceeE
Q psy9228 1 NAYIKWSRADG-LPLQRYA-------------------EGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNVERMM 60 (834)
Q Consensus 1 ~~~~~w~~~~~-~~~~~~~-------------------~~~~l~~~~~~~~d~g~y~c~~~~~~~~~~~~~~~~~~~g~~ 60 (834)
+.+|.|||+.. +|.-.-. .-+.|+|.||.-.|+|.|.|.++|..| .+|- .+..+
T Consensus 534 plsi~WR~dnevq~rv~ved~a~~~~q~r~~~~~e~~~~~~~L~L~nVt~td~grYQCVvtN~FG---Stys---qk~Kl 607 (873)
T KOG4194|consen 534 PLSIEWRKDNEVQPRVDVEDFATFLSQNRNGTFGEREEYTAILHLDNVTFTDEGRYQCVVTNHFG---STYS---QKAKL 607 (873)
T ss_pred CceeEeeeccccCcccchhcchhhhhhhccCccchhhhhhheeeeeeeecccCceEEEEEecccC---cchh---heeEE
Confidence 35899999777 7653221 025899999999999999999998876 3332 55666
Q ss_pred EEcCCcce
Q psy9228 61 FVDGIGPF 68 (834)
Q Consensus 61 cvd~~~~~ 68 (834)
.|+..+.|
T Consensus 608 tV~~~PsF 615 (873)
T KOG4194|consen 608 TVNQAPSF 615 (873)
T ss_pred EeeccCcc
Confidence 66666555
No 203
>KOG1226|consensus
Probab=91.09 E-value=0.29 Score=55.86 Aligned_cols=61 Identities=28% Similarity=0.683 Sum_probs=48.6
Q ss_pred CCCCCCCEeccCCCCCceEeeCCCCCC----CCCCCCCCCCccCC---CCC-CCeeecCCCCeEEeCCCCcccCccccc
Q psy9228 144 NNCINNGLCQDAATRIGYTCICPPGFS----GDRCSVLGEPCYPG---ACG-DGSCQDVDGAMKCLCPIGTAGKRCEQK 214 (834)
Q Consensus 144 ~pC~n~g~C~~~~~~~~~~C~C~~g~~----G~~Ce~~~~~C~~~---~C~-~g~C~~~~~~~~C~C~~g~~G~~Ce~~ 214 (834)
-+|.+.|.|.-. .|.|.+... |++||-+--.|..+ -|. +|.|.-. .|.|.+||+|..|+-.
T Consensus 514 ~vCSgrG~C~CG------qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~ 582 (783)
T KOG1226|consen 514 PVCSGRGDCVCG------QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACNCP 582 (783)
T ss_pred CCcCCCCcEeCC------ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence 489999999974 599986655 99998777777664 576 5887754 6999999999999865
No 204
>cd05880 Ig_EVA1 Immunoglobulin (Ig)-like domain of epithelial V-like antigen 1 (EVA). Ig_EVA: immunoglobulin (Ig) domain of epithelial V-like antigen 1 (EVA). EVA is also known as myelin protein zero-like 2. EVA is an adhesion molecule, which may play a role in structural organization of the thymus and early lymphocyte development.
Probab=91.08 E-value=0.2 Score=44.54 Aligned_cols=27 Identities=19% Similarity=0.443 Sum_probs=23.1
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCCCc
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHDSF 46 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~~~ 46 (834)
+..|+|.+|+.+|+|+|.|...+....
T Consensus 78 ~~sL~I~~v~~~D~G~Y~C~v~~~~~~ 104 (115)
T cd05880 78 DASILIWQLQPTDNGTYTCQVKNPPDV 104 (115)
T ss_pred ceEEEEeeCChhhCEEEEEEEEeCCCC
Confidence 346999999999999999999876543
No 205
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=90.90 E-value=0.18 Score=34.07 Aligned_cols=18 Identities=33% Similarity=0.927 Sum_probs=10.5
Q ss_pred eeecCCCCeEEeCCCCcc
Q psy9228 190 SCQDVDGAMKCLCPIGTA 207 (834)
Q Consensus 190 ~C~~~~~~~~C~C~~g~~ 207 (834)
.|++..++|+|.|++||.
T Consensus 11 ~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp EEEEETTSEEEE-STTEE
T ss_pred CCccCCCceEeECCCCCE
Confidence 466666666666666654
No 206
>cd05717 Ig1_Necl-1-3_like First (N-terminal) immunoglobulin (Ig)-like domain of the nectin-like molecules Necl-1 - Necl-3 (also known as cell adhesion molecules CADM3, CADM1, and CADM2 respectively). Ig1_Necl-1-3_like: N-terminal immunoglobulin (Ig)-like domain of the nectin-like molecules Necl-1 (also known as cell adhesion molecule 3 (CADM3)), Necl-2 (CADM1), and Necl-3 (CADM2). At least five nectin-like molecules have been identified (Necl-1 - Necl-5). They all have an extracellular region containing three Ig-like domains, a transmembrane region, and a cytoplasmic region. The N-terminal Ig-like domain of the extracellular region belongs to the V-type subfamily of Ig domains, is essential to cell-cell adhesion, and plays a part in the interaction with the envelope glycoprotein D of various viruses. Necl-1, Necl-2, and Necl-3 have Ca(2+)-independent homophilic and heterophilic cell-cell adhesion activity. Necl-1 is specifically expressed in neural tissue, and is important to the form
Probab=90.81 E-value=0.21 Score=42.59 Aligned_cols=24 Identities=29% Similarity=0.607 Sum_probs=21.0
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
...|+|.+|+.+|+|.|.|.+...
T Consensus 61 ~~~L~I~~v~~~DsG~Y~C~~~~~ 84 (95)
T cd05717 61 ELSISISNVSLSDEGRYTCSLYTM 84 (95)
T ss_pred eeEEEEccCCcccCEEEEEEEecC
Confidence 357999999999999999999644
No 207
>cd05881 Ig1_Necl-2 First (N-terminal) immunoglobulin (Ig)-like domain of nectin-like molecule 2 (also known as cell adhesion molecule 1 (CADM1)). Ig1_Necl-2: domain similar to the N-terminal immunoglobulin (Ig)-like domain of nectin-like molecule-2, Necl-2 (also known as cell adhesion molecule 1 (CADM1), SynCAM1, IGSF4A, Tslc1, sgIGSF, and RA175). Nectin-like molecules have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1 - Necl-5). They all have an extracellular region containing three Ig-like domains, a transmembrane region, and a cytoplasmic region. The N-terminal Ig-like domain of the extracellular region, belongs to the V-type subfamily of Ig domains, is essential to cell-cell adhesion, and plays a part in the interaction with the envelope glycoprotein D of various viruses. Necl-2 has Ca(2+)-independent homophilic and heterophilic cell-cell adhesion activity. Necl-2 is expressed in a wide variety of tissues, and is a
Probab=90.75 E-value=0.17 Score=43.09 Aligned_cols=22 Identities=27% Similarity=0.668 Sum_probs=19.8
Q ss_pred eEEEcccCCCCCeeEEEEEcCC
Q psy9228 22 VLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 22 ~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
.|+|.+|+.+|+|.|.|.+...
T Consensus 63 tL~I~~vq~~D~G~Y~Cqv~t~ 84 (95)
T cd05881 63 RVSLSNVSLSDEGRYFCQLYTD 84 (95)
T ss_pred EEEECcCCcccCEEEEEEEEcc
Confidence 6999999999999999999643
No 208
>cd05775 Ig_SLAM-CD84_like_N N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84_like. Ig_SLAM-CD84_like_N: The N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84_like. The SLAM family is a group of immune-cell specific receptors that can regulate both adaptive and innate immune responses. Members of this group include proteins such as CD84, SLAM (CD150), Ly-9 (CD229), NTB-A (ly-108, SLAM6), 19A (CRACC), and SLAMF9. The genes coding for the SLAM family are nested on chromosome 1, in humans at 1q23, and in mice at 1H2. The SLAM family is a subset of the CD2 family, which also includes CD2 and CD58 located on chromosome 1 at 1p13 in humans. In mice, CD2 is located on chromosome 3, and there is no CD58 homolog. The SLAM family proteins are organized as an extracellular domain with either two or four Ig-like domains, a single transmembrane segment, and a cytoplasmic region
Probab=90.34 E-value=0.22 Score=42.81 Aligned_cols=25 Identities=32% Similarity=0.694 Sum_probs=22.1
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+..|+|.+++.+|+|+|.|......
T Consensus 60 ~~sL~I~~~~~~DsG~Y~c~v~~~~ 84 (97)
T cd05775 60 DYSLQISNLKMEDAGSYRAEINTKN 84 (97)
T ss_pred ceeEEECCCchHHCEEEEEEEEcCC
Confidence 4789999999999999999997554
No 209
>cd05889 Ig1_DNAM-1_like First immunoglobulin (Ig) domain of DNAX accessory molecule 1 (DNAM-1, also known as CD226) and similar proteins. Ig1_DNAM-1_like: domain similar to the first immunoglobulin (Ig) domain of DNAX accessory molecule 1 (DNAM-1, also known as CD226). DNAM-1 is a transmembrane protein having two Ig-like domains. It is an adhesion molecule which plays a part in tumor-directed cytotoxicity and adhesion in natural killer (NK) cells and T lymphocytes. It has been shown to regulate the NK cell killing of several tumor types, including myeloma cells and ovarian carcinoma cells. DNAM-1 interacts specifically with poliovirus receptor (PVR; CD155) and nectin -2 (CD211), other members of the Ig superfamily. DNAM-1 is expressed in most peripheral T cells, NK cells, monocytes and a subset of B lymphocytes.
Probab=90.22 E-value=0.26 Score=42.08 Aligned_cols=23 Identities=30% Similarity=0.468 Sum_probs=21.0
Q ss_pred CCeEEEcccCCCCCeeEEEEEcC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
+..|+|.||+.+|+|.|+|....
T Consensus 59 d~sI~i~nvt~~D~G~Y~C~~~t 81 (96)
T cd05889 59 DMSLSFNNATEEDVGLYCCSLVT 81 (96)
T ss_pred ccEEEEcCCCcccCEEEEEEEEe
Confidence 58999999999999999999853
No 210
>cd05877 Ig_LP_like Immunoglobulin (Ig)-like domain of human cartilage link protein (LP). Ig_LP_like: immunoglobulin (Ig)-like domain similar to that that found in human cartilage link protein (LP). In cartilage, chondroitin-keratan sulfate proteoglycan (CSPG), aggrecan, forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
Probab=89.94 E-value=0.21 Score=43.62 Aligned_cols=26 Identities=38% Similarity=0.673 Sum_probs=22.3
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
+..|+|.+|+.+|+|+|.|...++.+
T Consensus 67 ~~sL~I~~v~~~DsG~Y~C~v~~~~~ 92 (106)
T cd05877 67 DASLVITDLRLEDYGRYRCEVIDGLE 92 (106)
T ss_pred cEEEEEccCChHHCEEEEEEEEeccc
Confidence 34899999999999999999976543
No 211
>cd05718 Ig1_PVR_like First immunoglobulin (Ig) domain of poliovirus receptor (PVR, also known as CD155) and similar proteins. Ig1_PVR_like: domain similar to the first immunoglobulin (Ig) domain of poliovirus receptor (PVR, also known as CD155). Poliovirus (PV) binds to its cellular receptor (PVR/CD155) to initiate infection. CD155 is a membrane-anchored, single-span glycoprotein; its extracellular region has three Ig-like domains. There are four different isotypes of CD155 (referred to as alpha, beta, gamma, and delta), that result from alternate splicing of the CD155 mRNA, and have identical extracellular domains. CD155-beta and - gamma, are secreted, CD155-alpha and delta are membrane-bound and function as PV receptors. The virus recognition site is contained in the amino-terminal domain, D1. Having the virus attachment site on the receptor distal from the plasma membrane, may be important for successful initiation of infection of cells by the virus. CD155 binds in the poliovirus "c
Probab=89.89 E-value=0.28 Score=42.07 Aligned_cols=22 Identities=41% Similarity=0.830 Sum_probs=20.3
Q ss_pred CCeEEEcccCCCCCeeEEEEEc
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQ 41 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~ 41 (834)
+..|+|.+|+.+|+|.|.|...
T Consensus 61 ~~sL~I~~v~~~D~G~Y~C~v~ 82 (98)
T cd05718 61 DATISISNLRLEDEGNYICEFA 82 (98)
T ss_pred ceEEEEccCCcccCEEEEEEEE
Confidence 4699999999999999999985
No 212
>cd05715 Ig_P0-like Immunoglobulin (Ig)-like domain of Protein zero (P0) and similar proteins. Ig_P0ex-like: domain similar to the immunoglobulin (Ig) domain of Protein zero (P0). P0 accounts for over 50% of the total protein in peripheral nervous system (PNS) myelin. P0 is a single-pass transmembrane glycoprotein with a highly basic intracellular domain and an extracellular Ig domain. The extracellular domain of P0 (P0-ED) is similar to the Ig variable domain, carrying one acceptor sequence for N-linked glycosylation. P0 plays a role in membrane adhesion in the spiral wraps of the myelin sheath. The intracellular domain is thought to mediate membrane apposition of the cytoplasmic faces and may, through electrostatic interactions, interact directly with lipid headgroups. It is thought that homophilic interactions of the P0 extracellular domain mediate membrane juxtaposition in the extracellular space of PNS myelin. This group also contains the Ig domain of Sodium channel subunit beta-2
Probab=89.78 E-value=0.27 Score=43.80 Aligned_cols=25 Identities=24% Similarity=0.632 Sum_probs=22.1
Q ss_pred CeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
..|+|.+|+.+|+|+|.|...+...
T Consensus 80 ~sL~I~~v~~~D~G~Y~C~v~~~~~ 104 (116)
T cd05715 80 ASIVIHNLQFTDNGTYTCDVKNPPD 104 (116)
T ss_pred eEEEEeeCCcccCEEEEEEEEeCCC
Confidence 5799999999999999999986643
No 213
>cd05878 Ig_Aggrecan_like Immunoglobulin (Ig)-like domain of the aggrecan-like chondroitin sulfate proteoglycan core protein (CSPG). Ig_Aggrecan_like: immunoglobulin (Ig)-like domain of the aggrecan-like chondroitin sulfate proteoglycan core protein (CSPG)s. Included in this group are the Ig domains of other CSPGs: versican, and neurocan. In CSPGs this Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggrecan and versican have a wide distribution in connective tissue and extracellular matrices. Neurocan is localized almost exclusively in nervous tissue. Aggregates having other CSPGs substi
Probab=89.78 E-value=0.27 Score=43.38 Aligned_cols=25 Identities=36% Similarity=0.581 Sum_probs=21.8
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+..|+|.+|+.+|+|+|.|......
T Consensus 71 ~asL~I~~l~~sD~G~Y~C~v~~~~ 95 (110)
T cd05878 71 DASLEISRLRSSDSGVYRCEVMHGI 95 (110)
T ss_pred cEEEEECCCChhhCEEEEEEEEeCC
Confidence 3579999999999999999997554
No 214
>KOG4222|consensus
Probab=89.58 E-value=0.42 Score=56.92 Aligned_cols=43 Identities=30% Similarity=0.662 Sum_probs=37.6
Q ss_pred CcEEEEecCC-CCCcccc---CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA---EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~---~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.+++|+|.++ ||+.++. ++-.|+|.++++.|.|.|.|.+.+.-
T Consensus 255 P~Vrwkk~~~~~P~~ry~i~~d~qtl~i~r~tp~deg~y~C~a~n~v 301 (1281)
T KOG4222|consen 255 PTVRWKKDDPSLPRSRYSILDDTQTLRIDRVTPTDEGTYVCPAENLV 301 (1281)
T ss_pred CceeeeccCCCCCccceeeeeccccccccccCCCcccceeeeccccc
Confidence 4799999888 9998877 34799999999999999999998764
No 215
>cd00099 IgV Immunoglobulin variable domain (IgV). IgV: Immunoglobulin variable domain (IgV). Members of the IgV family are components of immunoglobulin (Ig) and T cell receptors. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. In Ig, each chain is composed of one variable domain (IgV) and one or more constant domains (IgC); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is the disulfide bridge connecting 2 beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=89.20 E-value=0.24 Score=43.05 Aligned_cols=24 Identities=25% Similarity=0.397 Sum_probs=21.4
Q ss_pred CeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
-.|+|.+|+.+|+|+|.|.+....
T Consensus 67 ~sL~I~~v~~~D~G~Y~C~v~~~~ 90 (105)
T cd00099 67 FTLTISSLQPEDSAVYYCAVSLSG 90 (105)
T ss_pred EEEEECCCCHHHCEEEEEEEecCC
Confidence 389999999999999999997654
No 216
>cd05902 Ig_Neurocan Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), neurocan. Ig_Neurocan: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), neurocan. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, the CSPG aggrecan (not included in this group) forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Unlike aggrecan which is widely distributed in connective tissue and extracellular matrices, neurocan is localized almost exclusively in nervous tissue. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many differen
Probab=88.74 E-value=0.23 Score=43.62 Aligned_cols=25 Identities=32% Similarity=0.542 Sum_probs=22.0
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+..|+|.||+.+|+|+|.|....+.
T Consensus 71 ~asL~i~nv~~~D~G~Y~C~v~~g~ 95 (110)
T cd05902 71 NASLVLSRLRYSDSGTYRCEVVLGI 95 (110)
T ss_pred eEEEEECCCCHHHCEEEEEEEEECC
Confidence 4589999999999999999997654
No 217
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=88.41 E-value=0.57 Score=34.45 Aligned_cols=16 Identities=38% Similarity=0.935 Sum_probs=11.9
Q ss_pred ceEeeCCCCCCCCCCC
Q psy9228 160 GYTCICPPGFSGDRCS 175 (834)
Q Consensus 160 ~~~C~C~~g~~G~~Ce 175 (834)
+-.|.|+++|+|.+|+
T Consensus 18 ~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 18 TGQCECKPNTTGRRCD 33 (50)
T ss_pred CCEEeCCCcCCCCCCC
Confidence 3567788888888885
No 218
>cd05711 Ig_FcalphaRI Immunoglobulin (IG)-like domain of of FcalphaRI. IG_FcalphaRI : immunoglobulin (IG)-like domain of of FcalphaRI. FcalphaRI (CD89) is an IgA-specific receptor that is expressed on monocytes, eosinophils, neutrophils and macrophages. FcalphaRI mediates IgA-induced immune effector responses such as phagocytosis, antibody-dependent cell-mediated cytotoxicity and respiratory burst. Both monomeric and dimeric IgA can bind to FcalphaRI, and monomeric or dimeric IgA immune complexes can activate phagocytosis and other immune responses through the clustering of FcalphaRI. The Fc RI ectodomain is comprised of two Ig-like domains oriented at about 90 degree to each another.
Probab=88.32 E-value=0.61 Score=39.70 Aligned_cols=39 Identities=10% Similarity=0.145 Sum_probs=29.0
Q ss_pred cEEEEecCC-CCCcc--c----cCCCeEEEcccCCCCCeeEEEEEc
Q psy9228 3 YIKWSRADG-LPLQR--Y----AEGNVLRITNARLQDSGKYKCEIQ 41 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~--~----~~~~~l~~~~~~~~d~g~y~c~~~ 41 (834)
.+.|.|++. .+... . .....+.|.+++.+|+|+|.|...
T Consensus 30 ~f~l~k~g~~~~~~~~~~~~~~~~~a~f~I~~~~~~~~G~Y~C~~~ 75 (94)
T cd05711 30 RFILYKEGRSKPVLHLYEKHHGGFQASFPLGPVTPAHAGTYRCYGS 75 (94)
T ss_pred EEEEEECCCCCCceecccccCCeEEEEEEecCCCcccCEEEEEEEE
Confidence 588999776 44322 1 123478999999999999999985
No 219
>cd05846 Ig1_MRC-OX-2_like First immunoglobulin (Ig) domain of rat MRC OX-2 antigen (also known as CD200) and similar proteins. Ig1_ MRC-OX-2_like: domain similar to the first immunoglobulin (Ig) domain of rat MRC OX-2 antigen (also known as CD200). MRC OX-2 is a membrane glycoprotein expressed in a variety of lymphoid and non-lymphoid cells in rats. It has a similar broad distribution pattern in humans. MRC OX-2 may regulate myeloid cell activity. The protein has an extracellular portion containing two Ig-like domains, a transmembrane portion, and a cytoplasmic portion.
Probab=87.77 E-value=0.37 Score=41.35 Aligned_cols=23 Identities=35% Similarity=0.602 Sum_probs=20.5
Q ss_pred CCeEEEcccCCCCCeeEEEEEcC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
+..|+|.||+.+|+|.|.|....
T Consensus 60 ~~sL~I~nv~~~D~G~Y~C~~~~ 82 (97)
T cd05846 60 STSITIWNVTLEDEGCYKCIFNT 82 (97)
T ss_pred ccEEEEeCCceeeeEEEEEEEEe
Confidence 56799999999999999999863
No 220
>cd04983 IgV_TCR_alpha_like Immunoglobulin (Ig) variable (V) domain of T-cell receptor (TCR) alpha chain and similar proteins. IgV_TCR_alpha: immunoglobulin (Ig) variable domain of the alpha chain of alpha/beta T-cell antigen receptors (TCRs). TCRs mediate antigen recognition by T lymphocytes, and are composed of alpha and beta, or gamma and delta, polypeptide chains with variable (V) and constant (C) regions. This group represents the variable domain of the alpha chain of TCRs and also includes the variable domain of delta chains of TCRs. Alpha/beta TCRs recognize antigen as peptide fragments presented by major histocompatibility complex (MHC) molecules. The variable domain of TCRs is responsible for antigen recognition, and is located at the N-terminus of the receptor. Gamma/delta TCRs recognize intact protein antigens; they recognize proteins antigens directly and without antigen processing, and MHC independently of the bound peptide.
Probab=87.64 E-value=0.4 Score=42.06 Aligned_cols=24 Identities=33% Similarity=0.582 Sum_probs=21.2
Q ss_pred CCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 20 GNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 20 ~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
.-.|+|.+++.+|+|+|.|.+.+.
T Consensus 69 ~~~L~I~~~~~~DsG~Y~C~~~~~ 92 (109)
T cd04983 69 SSSLHISAAQLSDSAVYFCALSES 92 (109)
T ss_pred EEEEEECCCCHHHCEEEEEEEecC
Confidence 348999999999999999999754
No 221
>smart00406 IGv Immunoglobulin V-Type.
Probab=87.48 E-value=0.29 Score=39.98 Aligned_cols=19 Identities=42% Similarity=0.824 Sum_probs=17.5
Q ss_pred CeEEEcccCCCCCeeEEEE
Q psy9228 21 NVLRITNARLQDSGKYKCE 39 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~ 39 (834)
-.|+|.+++.+|+|+|.|.
T Consensus 62 ~~L~i~~~~~~D~G~Y~C~ 80 (81)
T smart00406 62 VSLTISNLRVEDTGTYYCA 80 (81)
T ss_pred EEEEEcCCCHHHCEEEEEc
Confidence 4699999999999999996
No 222
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=87.32 E-value=0.33 Score=35.54 Aligned_cols=17 Identities=41% Similarity=1.071 Sum_probs=11.5
Q ss_pred CceEeeCCCCCCCCCCC
Q psy9228 159 IGYTCICPPGFSGDRCS 175 (834)
Q Consensus 159 ~~~~C~C~~g~~G~~Ce 175 (834)
....|.|+++|+|++|+
T Consensus 16 ~~G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 16 STGQCVCKPGTTGPRCD 32 (49)
T ss_dssp TCEEESBSTTEESTTS-
T ss_pred CCCEEeccccccCCcCc
Confidence 34677777777777775
No 223
>cd05771 IgC_Tapasin_R Tapasin-R immunoglobulin-like domain. IgC_Tapasin_R: Immunoglobulin-like domain on Tapasin-R. Tapasin is a V-C1 (variable-constant) immunoglobulin superfamily molecule present in the endoplasmic reticulum (ER), where it links MHC class I molecules to the transporter associated with antigen processing (TAP). Tapasin-R is a tapasin-related protein that contains similar structural motifs to Tapasin, with some marked differences, especially in the V domain, transmembrane and cytoplasmic regions. The majority of Tapasin-R is located within the ER; however, there may be some expression of Tapasin-R at the cell surface. Tapasin-R lacks an obvious ER retention signal.
Probab=87.21 E-value=0.42 Score=44.18 Aligned_cols=24 Identities=21% Similarity=0.513 Sum_probs=21.1
Q ss_pred eEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 22 VLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 22 ~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.|+|++|+.+|+|.|.|.+.+..+
T Consensus 2 sL~i~~v~~~D~G~Y~C~~~~~~~ 25 (139)
T cd05771 2 SLTLPGLTVHDEGTYICSVSTPPH 25 (139)
T ss_pred eEEECCCCHHHCEEEEEEEEccCc
Confidence 699999999999999999976543
No 224
>cd04984 IgV_L_lambda Immunoglobulin (Ig) lambda light chain variable (V) domain. IgV_L_lambda: Immunoglobulin (Ig) light chain, lambda type, variable (V) domain. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. In Ig, each chain is composed of one variable domain (IgV) and one or more constant domains (IgC); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. There are five types of heavy chains (alpha, gamma, delta, epsilon, and mu), which determine the type of immunoglobulin: IgA, IgG, IgD, IgE, and IgM, respectively. In higher vertebrates, there are two types of light chain, designated kappa and lambda, which seem to be functionally identical, and can associate with any of the heavy chains.
Probab=86.19 E-value=0.61 Score=40.00 Aligned_cols=22 Identities=27% Similarity=0.576 Sum_probs=19.7
Q ss_pred CeEEEcccCCCCCeeEEEEEcC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
..|+|.+++.+|+|+|.|...+
T Consensus 63 ~~L~I~~~~~~Dsg~Y~C~~~~ 84 (98)
T cd04984 63 ASLTISGAQTEDEADYYCQVWD 84 (98)
T ss_pred EEEEECCCChhhCEEEEEEEcc
Confidence 3699999999999999999864
No 225
>KOG3546|consensus
Probab=85.81 E-value=2.1 Score=47.60 Aligned_cols=128 Identities=13% Similarity=0.162 Sum_probs=81.3
Q ss_pred CcceEEEEEEeCC-CCeeEEecCCCCCCCCCCcceEEEEE---ECCEEEEEEEc---CCcEE-EEEeCCceecCCCcEEE
Q psy9228 674 NEETIAFDFVTDD-KNALLLWNGQPSYKNGIGREFIAVAV---VNGYLEYSYDL---GDGVV-TIKFSKKPVNDGIKHSV 745 (834)
Q Consensus 674 ~~~~i~~~frT~~-~~GlLl~~~~~~~~~~~~~~~~~l~l---~~G~l~~~~~~---g~~~~-~l~~s~~~~nDg~wH~V 745 (834)
..+.|.|.+|..+ .-|+||.+.+.. +..-|+.|.| +||+-.+++.. |++.. +.......+-.++|.++
T Consensus 87 rdf~~~~~i~p~s~~~gvlfaitd~~----q~~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~ 162 (1167)
T KOG3546|consen 87 RDFSLLFHIRPATEGPGVLFAITDSA----QAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHL 162 (1167)
T ss_pred ccceEEEEeeccCCCCceEEEechhh----hhhheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhhe
Confidence 5678888888866 456777665432 2345666655 37765554332 33322 11112345678999999
Q ss_pred EEEEECCEEEEEEcCeeeecccCCCCccc--eecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECC
Q psy9228 746 NVTRINKFGSLEVDSVIVGKGESPGSQDV--INTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQN 812 (834)
Q Consensus 746 ~i~r~~~~~~l~VD~~~~~~~~~~~~~~~--l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~ 812 (834)
.+...+..+.|+||=++.+..-..-+++. +....-||+|-+-.. ..+.|.|-|..|++.-
T Consensus 163 a~~v~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~~-------~~~~f~g~~~~l~v~~ 224 (1167)
T KOG3546|consen 163 ALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGA-------DPDKFQGVIAELKVRR 224 (1167)
T ss_pred eeeecCceEEEEechHHhcccchhccccceeecCCcceEEeccCCC-------ChHhhhhhhhheeecC
Confidence 99999999999999876554422222333 344456998865433 3467999999999854
No 226
>KOG4221|consensus
Probab=85.63 E-value=1 Score=54.21 Aligned_cols=43 Identities=35% Similarity=0.677 Sum_probs=33.5
Q ss_pred CcEEEEecCC-CCCcccc----CCCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA----EGNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
++|+|.|.+. |-.+... ..+.|.|.+++..|+|.|.|.++|.+
T Consensus 267 p~v~Wlrng~~i~~ds~rivl~g~~slliS~a~~~dSg~YtC~Atn~~ 314 (1381)
T KOG4221|consen 267 PSVKWLRNGEKISVDSYRIVLVGGSSLLISNATDDDSGKYTCRATNTN 314 (1381)
T ss_pred CceEEeeCCeeeeecceEEEEecccceEEeccccccCceEEEEecCCC
Confidence 5899999433 5444211 46789999999999999999999854
No 227
>cd05899 IgV_TCR_beta Immunoglobulin (Ig) variable (V) domain of T-cell receptor (TCR) bet a chain. IgV_TCR_beta: immunoglobulin (Ig) variable domain of the beta chain of alpha/beta T-cell antigen receptors (TCRs). TCRs mediate antigen recognition by T lymphocytes, and are composed of alpha and beta, or gamma and delta, polypeptide chains with variable (V) and constant (C) regions. This group includes the variable domain of the alpha chain of alpha/beta TCRs. Alpha/beta TCRs recognize antigen as peptide fragments presented by major histocompatibility complex (MHC) molecules. The variable domain of TCRs is responsible for antigen recognition, and is located at the N-terminus of the receptor. Gamma/delta TCRs recognize intact protein antigens; they recognize proteins antigens directly and without antigen processing, and MHC independently of the bound peptide.
Probab=85.57 E-value=0.61 Score=41.03 Aligned_cols=22 Identities=32% Similarity=0.422 Sum_probs=20.2
Q ss_pred CeEEEcccCCCCCeeEEEEEcC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
..|.|.+++.+|+|+|.|.+..
T Consensus 72 ~~L~I~~~~~~Dsg~Y~Ca~~~ 93 (110)
T cd05899 72 SSLTIKSAEPEDSAVYLCASSL 93 (110)
T ss_pred cEEEECCCChhhCEEEEEEeeC
Confidence 5799999999999999999964
No 228
>cd07692 Ig_CD3_epsilon Immunoglobulin (Ig)-like domain of CD3 epsilon chain. Ig_CD3_epsilon; immunoglobulin (Ig)-like domain of CD3 epsilon chain. CD3 is a T cell surface receptor that is associated with alpha/beta T cell receptors (TCRs). The CD3 complex consists of one gamma, one delta, two epsilon, and two zeta chains. The CD3 subunits form heterodimers as gamma/epsilon, delta/epsilon, and zeta/zeta. The gamma, delta, and epsilon chains each contain an extracellular Ig domain, whereas the extracellular domains of the zeta chains are very small and have unknown structure. The CD3 domain participates in intracellular signalling once the TCR has bound an MHC/antigen complex.
Probab=85.44 E-value=1.6 Score=33.84 Aligned_cols=37 Identities=24% Similarity=0.350 Sum_probs=27.9
Q ss_pred CcEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcCC
Q psy9228 2 AYIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
..|.|.|.+- ++. .+..+.+.+.. +++|.|.|.+.+.
T Consensus 19 ~~i~W~~n~~~~~g----~~~~~~~~~~~-e~~G~Y~C~~~~~ 56 (65)
T cd07692 19 DDIKWKKNDQVSQG----SDEKYLSLNHF-SSSGYYHCTADNA 56 (65)
T ss_pred CCceEEeCCccCCC----CCceEEeeccC-CCCceEEEEeCCC
Confidence 3589999665 433 45677888888 9999999999654
No 229
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=85.13 E-value=0.21 Score=38.62 Aligned_cols=42 Identities=31% Similarity=0.788 Sum_probs=21.4
Q ss_pred ceeeecCCCCCCCCCccCccCCCCCCccCCCceeeecCCCcEE------cCCCCCCCCCc
Q psy9228 591 GYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGMCKVTPDSYEC------LCSLGYAPPNC 644 (834)
Q Consensus 591 ~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~C~~~~~~~~C------~C~~g~~G~~C 644 (834)
.+.-.|.+.|.|..|.. .|.+. .+..+.|+| .|.+||+|+.|
T Consensus 16 ~~rv~C~~nyyG~~C~~---~C~~~---------~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCSK---FCKPR---------DDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETTTT-E---E---E---------EETTEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEEECCCCCCCccccC---CcCCC---------cCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 67888999999999984 34322 022355666 58899999987
No 230
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.02 E-value=0.85 Score=46.07 Aligned_cols=37 Identities=27% Similarity=0.598 Sum_probs=28.2
Q ss_pred CCCCCCCCCCCccCC--CCCCCeeecCCCCeEEeCCCCccc
Q psy9228 170 SGDRCSVLGEPCYPG--ACGDGSCQDVDGAMKCLCPIGTAG 208 (834)
Q Consensus 170 ~G~~Ce~~~~~C~~~--~C~~g~C~~~~~~~~C~C~~g~~G 208 (834)
.+..|+ +.++|... +|. ..|.+..++|.|.|+.||+.
T Consensus 180 ~~~~C~-~~~~C~~~~~~c~-~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 180 QGKICV-VPDLCATLSHVCQ-QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred ccccCc-CchhhcCCCCCcc-ceEEcCCCCEEeECCCCccC
Confidence 455675 56788653 454 47999999999999999974
No 231
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=84.78 E-value=0.91 Score=32.69 Aligned_cols=16 Identities=38% Similarity=0.982 Sum_probs=12.4
Q ss_pred ceEeeCCCCCCCCCCC
Q psy9228 160 GYTCICPPGFSGDRCS 175 (834)
Q Consensus 160 ~~~C~C~~g~~G~~Ce 175 (834)
+-.|.|+++++|++|+
T Consensus 17 ~G~C~C~~~~~G~~C~ 32 (46)
T smart00180 17 TGQCECKPNVTGRRCD 32 (46)
T ss_pred CCEEECCCCCCCCCCC
Confidence 3478888888888886
No 232
>cd04980 IgV_L_kappa Immunoglobulin (Ig) light chain, kappa type, variable (V) domain. IgV_L_kappa: Immunoglobulin (Ig) light chain, kappa type, variable (V) domain. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. In Ig, each chain is composed of one variable domain (IgV) and one or more constant domains (IgC); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. There are five types of heavy chains (alpha, gamma, delta, epsilon, and mu), which determine the type of immunoglobulin: IgA, IgG, IgD, IgE, and IgM, respectively. In higher vertebrates, there are two types of light chain, designated kappa and lambda, which seem to be functionally identical, and can associate with any of the heavy chains.
Probab=84.74 E-value=0.61 Score=40.70 Aligned_cols=23 Identities=26% Similarity=0.473 Sum_probs=20.2
Q ss_pred CeEEEcccCCCCCeeEEEEEcCC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
..|+|.+++.+|+|+|.|.+.+.
T Consensus 71 ~~L~I~~~~~~Dsg~Y~Ca~~~~ 93 (106)
T cd04980 71 FTLTISRVEPEDAAVYYCQQYGT 93 (106)
T ss_pred EEEEECCCChHHCEEEEEEEeCC
Confidence 36999999999999999999644
No 233
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=84.41 E-value=0.48 Score=44.58 Aligned_cols=61 Identities=28% Similarity=0.718 Sum_probs=43.3
Q ss_pred CCCCCCeecccCCCCCceeeecCCCCC---CCCCccCccCCCC----CCccCCCceeeecC-----CCcEEcCCCCCC
Q psy9228 575 PCQNYGICYPTDTSERGYNCSCLTGYS---GDHCEKENNMCMK----GDVCKNGGMCKVTP-----DSYECLCSLGYA 640 (834)
Q Consensus 575 pC~ngg~C~~~~~~~~~~~C~C~~G~~---G~~Ce~~~~~C~~----~~pC~ngg~C~~~~-----~~~~C~C~~g~~ 640 (834)
.|.| |..++..+ .|.|.|.+||. -.+||... +|.. +.||.+-++|+... ..|.|.|.+||.
T Consensus 7 ~CKN-G~LiQMSN---HfEC~Cnegfvl~~EntCE~kv-~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~ 79 (197)
T PF06247_consen 7 ICKN-GYLIQMSN---HFECKCNEGFVLKNENTCEEKV-ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYI 79 (197)
T ss_dssp --BT-EEEEEESS---EEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEE
T ss_pred cccC-CEEEEccC---ceEEEcCCCcEEccccccccce-ecCcccccCccccchhhhhcCCCcccceeEEEecccCce
Confidence 4554 57777776 99999999996 56788763 6654 24999999999875 589999999986
No 234
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=84.00 E-value=1.9 Score=41.88 Aligned_cols=91 Identities=18% Similarity=0.251 Sum_probs=51.6
Q ss_pred eeEEEEEEEee-C-CCCeeEEEeccCCccccCCCCCeEEEEEECcEEE--EEEecccEE--------EEeeeeecCCCeE
Q psy9228 405 LHFSIELSFKP-T-DYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPV--FRFDVGLVV--------LRSKVTLVPHEWV 472 (834)
Q Consensus 405 ~~~~i~~~frt-~-~~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~--~~~~~G~~~--------i~s~~~~~dg~wH 472 (834)
.++.+++.||. . ...|++|..... .........+.+.|.+..-. .....|... .........++||
T Consensus 53 ~df~l~~d~k~~~~~~sGi~~r~~~~--~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~W~ 130 (185)
T PF06439_consen 53 SDFELEVDFKITPGGNSGIFFRAQSP--GDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEPSVNVAIPPGEWN 130 (185)
T ss_dssp SSEEEEEEEEE-TT-EEEEEEEESSE--CCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-SSS--S--TTSEE
T ss_pred ccEEEEEEEEECCCCCeEEEEEeccc--cCCCCcceEEEEEEECCCCccCCCCccceEEEeccccccccccccCCCCceE
Confidence 45777777773 3 345666655400 01123455566666665443 011112221 2234567889999
Q ss_pred EEEEEEECCeEEEEECCeeeeeeec
Q psy9228 473 VVTIIKDFKEGKLSVGGEPLIVGST 497 (834)
Q Consensus 473 ~V~v~~~~~~~~l~VD~~~~~~~~~ 497 (834)
+++|...+..+++.||+........
T Consensus 131 ~~~I~~~g~~i~v~vnG~~v~~~~d 155 (185)
T PF06439_consen 131 TVRIVVKGNRITVWVNGKPVADFTD 155 (185)
T ss_dssp EEEEEEETTEEEEEETTEEEEEEET
T ss_pred EEEEEEECCEEEEEECCEEEEEEEc
Confidence 9999999999999999988766543
No 235
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=83.85 E-value=0.71 Score=31.18 Aligned_cols=18 Identities=50% Similarity=1.178 Sum_probs=15.2
Q ss_pred eeeecCCCcEEcCCCCCC
Q psy9228 623 MCKVTPDSYECLCSLGYA 640 (834)
Q Consensus 623 ~C~~~~~~~~C~C~~g~~ 640 (834)
.|++.+++|+|.|++||.
T Consensus 11 ~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp EEEEETTSEEEE-STTEE
T ss_pred CCccCCCceEeECCCCCE
Confidence 688999999999999874
No 236
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=83.57 E-value=0.4 Score=37.13 Aligned_cols=47 Identities=26% Similarity=0.602 Sum_probs=20.5
Q ss_pred CceEeeCCCCCCCCCCCCCCCCccCCCCCCC--eeecCCCCeEEeCCCCcccCcc
Q psy9228 159 IGYTCICPPGFSGDRCSVLGEPCYPGACGDG--SCQDVDGAMKCLCPIGTAGKRC 211 (834)
Q Consensus 159 ~~~~C~C~~g~~G~~Ce~~~~~C~~~~C~~g--~C~~~~~~~~C~C~~g~~G~~C 211 (834)
-.+.-.|.+.|.|..|.. .|.+.-=..| +|.. . -.-.|.+||+|+.|
T Consensus 15 ~~~rv~C~~nyyG~~C~~---~C~~~~d~~ghy~Cd~-~--G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 15 YRIRVVCDENYYGPNCSK---FCKPRDDSFGHYTCDS-N--GNKVCLPGWTGPNC 63 (63)
T ss_dssp --------TTEETTTT-E---E---EEETTEEEEE-S-S----EEE-TTEESTTS
T ss_pred EEEEEECCCCCCCccccC---CcCCCcCCcCCcccCC-C--CCCCCCCCCcCCCC
Confidence 357788999999999976 4444311111 4542 1 12358999999987
No 237
>PF07686 V-set: Immunoglobulin V-set domain; InterPro: IPR013106 The basic structure of immunoglobulin (Ig) molecules is a tetramer of two light chains and two heavy chains linked by disulphide bonds. There are two types of light chains: kappa and lambda, each composed of a constant domain (CL) and a variable domain (VL). There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are highly modular proteins, in which the variable and constant domains have clear, conserved sequence patterns. The domains in Ig and Ig-like molecules are grouped into four types: V-set (variable; IPR013106 from INTERPRO), C1-set (constant-1; IPR003597 from INTERPRO), C2-set (constant-2; IPR008424 from INTERPRO) and I-set (intermediate; IPR013098 from INTERPRO) []. Structural studies have shown that these domains share a common core Greek-key beta-sandwich structure, with the types differing in the number of strands in the beta-sheets as well as in their sequence patterns [, ]. Immunoglobulin-like domains that are related in both sequence and structure can be found in several diverse protein families. Ig-like domains are involved in a variety of functions, including cell-cell recognition, cell-surface receptors, muscle structure and the immune system []. This entry represents the V-set domains, which are Ig-like domains resembling the antibody variable domain. V-set domains are found in diverse protein families, including immunoglobulin light and heavy chains; in several T-cell receptors such as CD2 (Cluster of Differentiation 2), CD4, CD80, and CD86; in myelin membrane adhesion molecules; in junction adhesion molecules (JAM); in tyrosine-protein kinase receptors; and in the programmed cell death protein 1 (PD1).; PDB: 1PY9_A 2NXY_D 1U9K_B 3RNK_A 3BP6_A 3BIK_B 3SBW_A 1NPU_A 3BP5_A 3RNQ_A ....
Probab=83.48 E-value=0.7 Score=40.48 Aligned_cols=23 Identities=35% Similarity=0.567 Sum_probs=20.6
Q ss_pred CeEEEcccCCCCCeeEEEEEcCC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
..|+|.||+.+|+|.|.|.....
T Consensus 79 ~sL~i~~l~~~DsG~Y~C~v~~~ 101 (114)
T PF07686_consen 79 FSLTIKNLQPSDSGTYFCQVSTS 101 (114)
T ss_dssp EEEEESSESGGGEEEEEEEEEES
T ss_pred EEEEECCCCcCcCEEEEEEEEEC
Confidence 36999999999999999999754
No 238
>cd05712 Ig_Siglec_N Immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Ig_Siglec_N: immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Siglec refers to a structurally related protein family that specifically recognizes sialic acid in oligosaccharide chains of glycoproteins and glycolipids. Siglecs are type I transmembrane proteins, organized as an extracellular module composed of Ig-like domains (an N-terminal variable set of Ig-like carbohydrate recognition domains, and 1 to 16 constant Ig-like domains), followed by transmembrane and short cytoplasmic domains. Human siglecs are classified into two subgroups, one subgroup is comprised of sialoadhesin (Siglec-1), CD22 (Siglec-2), and MAG, the other subgroup is comprised of CD33-related Siglecs which include CD33 (Siglec-3) and human Siglecs 5-11.
Probab=83.27 E-value=0.98 Score=40.41 Aligned_cols=22 Identities=36% Similarity=0.474 Sum_probs=19.8
Q ss_pred CeEEEcccCCCCCeeEEEEEcC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
..|+|.+++.+|+|+|.|....
T Consensus 80 ~sL~I~~l~~~Dsg~Y~C~~~~ 101 (119)
T cd05712 80 CSLLISDAQPEDSGKYFFRVEL 101 (119)
T ss_pred EEEEEccCChhhCeEEEEEEEc
Confidence 4799999999999999999854
No 239
>cd05716 Ig_pIgR Immunoglobulin (Ig)-like domain in the polymeric Ig receptor (pIgR). Ig_pIgR: Immunoglobulin (Ig)-like domain in the polymeric Ig receptor (pIgR). pIgR delivers dimeric IgA and pentameric IgM to mucosal secretions. Polymeric immunoglobulin (pIgs) are the first defense against pathogens and toxins. IgA and IgM can form polymers via an 18-residue extension at their c-termini referred to as the tailpiece. pIgR transports pIgs across mucosal epithelia into mucosal secretions. Human pIgR is a glycosylated type I transmembrane protein, comprised of a 620 residue extracellular region, a 23 residue transmembrane region, and a 103 residue cytoplasmic tail. The extracellular region contains five domains that share sequence similarity with Ig variable (v) regions.
Probab=82.38 E-value=0.79 Score=39.27 Aligned_cols=22 Identities=23% Similarity=0.481 Sum_probs=19.6
Q ss_pred CeEEEcccCCCCCeeEEEEEcC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
-.|+|.+|+.+|+|.|.|....
T Consensus 64 ~~L~I~~v~~~DsG~Y~C~~~~ 85 (98)
T cd05716 64 FTVTLNQLRKEDAGWYWCGVGD 85 (98)
T ss_pred EEEEEcCCCHHHCEEEEEEccc
Confidence 3799999999999999999853
No 240
>cd00098 IgC Immunoglobulin Constant domain. IgC: Immunoglobulin constant domain (IgC). Members of the IgC family are components of immunoglobulin, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IgV) and one or more IgC domains. These names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. The IgV domain is responsible for antigen binding, and the IgC domain is involved in oligomerization and molecular interactions.
Probab=81.73 E-value=1.7 Score=36.72 Aligned_cols=43 Identities=12% Similarity=0.312 Sum_probs=30.3
Q ss_pred CcEEEEecCC-CCCccc--c----CCC------eEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRY--A----EGN------VLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~--~----~~~------~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
++|+|.|.+. +..... . .++ .|+|...+.+|.+.|.|.+.+..
T Consensus 30 ~~i~W~~~g~~~~~~~~~~~~~~~~~gt~~~~s~l~v~~~~~~~~~~y~C~v~h~~ 85 (95)
T cd00098 30 ITVTWLKNGKELTSGVTTTPPVPNSDGTYSVSSQLTVSPSDWNSGDTYTCVVTHES 85 (95)
T ss_pred cEEEEEECCEECCCceeccccccCCCCCEEEEEEEEECHHHhCCCCCEEEEEEeCC
Confidence 5899999555 766543 1 233 46677777779999999997654
No 241
>cd05720 Ig_CD8_alpha Immunoglobulin (Ig) like domain of CD8 alpha chain. Ig_CD8_alpha: immunoglobulin (Ig)-like domain in CD8 alpha. The CD8 glycoprotein plays an essential role in the control of T-cell selection, maturation and the T-cell receptor (TCR)-mediated response to peptide antigen. CD8 is comprised of alpha and beta subunits and is expressed as either an alphaalpha or alphabeta dimer. Both dimeric isoforms can serve as a coreceptor for T cell activation and differentiation, however they have distinct physiological roles, different cellular distributions, unique binding partners etc. Each CD8 subunit is comprised of an extracellular domain containing a v-type Ig-like domain, a single pass transmembrane portion and a short intracellular domain. The Ig domain of CD8 alpha binds to antibodies.
Probab=81.68 E-value=1.1 Score=39.01 Aligned_cols=21 Identities=29% Similarity=0.621 Sum_probs=19.3
Q ss_pred eEEEcccCCCCCeeEEEEEcC
Q psy9228 22 VLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 22 ~l~~~~~~~~d~g~y~c~~~~ 42 (834)
.|+|.+++.+|+|+|.|....
T Consensus 70 ~L~I~~~~~sDsgtY~Ca~~~ 90 (104)
T cd05720 70 VLTLKNFQKENEGYYFCSVAS 90 (104)
T ss_pred EEEECCCCHHHCEEEEEEEcc
Confidence 699999999999999999863
No 242
>cd04982 IgV_TCR_gamma Immunoglobulin (Ig) variable (V) domain of T-cell receptor (TCR) gamma chain. IgV_TCR_gamma: immunoglobulin (Ig) variable (V) domain of the gamma chain of gamma/delta T-cell receptors (TCRs). TCRs mediate antigen recognition by T lymphocytes, and are heterodimers consisting of alpha and beta chains or gamma and delta chains. Each chain contains a variable (V) and a constant (C) region. The majority of T cells contain alpha/beta TCRs but a small subset contain gamma/delta TCRs. Alpha/beta TCRs recognize antigen as peptide fragments presented by major histocompatibility complex (MHC) molecules. Gamma/delta TCRs recognize intact protein antigens; they recognize protein antigens directly and without antigen processing, and MHC independently of the bound peptide. Gamma/delta T cells can also be stimulated by non-peptide antigens such as small phosphate- or amine-containing compounds. The variable domain of gamma/delta TCRs is responsible for antigen recognition and is
Probab=80.16 E-value=1.1 Score=39.83 Aligned_cols=21 Identities=33% Similarity=0.518 Sum_probs=19.3
Q ss_pred CeEEEcccCCCCCeeEEEEEc
Q psy9228 21 NVLRITNARLQDSGKYKCEIQ 41 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~ 41 (834)
..|+|.+++.+|+|+|.|.+.
T Consensus 77 ~~L~I~~~~~~Dsg~Y~C~~~ 97 (116)
T cd04982 77 STLTIQNLEKEDSATYYCAYW 97 (116)
T ss_pred EEEEEEECCHHHCEEEEEEEe
Confidence 379999999999999999995
No 243
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=79.76 E-value=7.8 Score=39.03 Aligned_cols=62 Identities=16% Similarity=0.132 Sum_probs=44.3
Q ss_pred cceEEEEEECCEEEEEEEcCC----cEEEEEeCCceecCCCcEEEEEEEE-----CCEEEEEEcCeeeecc
Q psy9228 705 REFIAVAVVNGYLEYSYDLGD----GVVTIKFSKKPVNDGIKHSVNVTRI-----NKFGSLEVDSVIVGKG 766 (834)
Q Consensus 705 ~~~~~l~l~~G~l~~~~~~g~----~~~~l~~s~~~~nDg~wH~V~i~r~-----~~~~~l~VD~~~~~~~ 766 (834)
...++|.+.+|++.+.+.... ..........++.-|+||+|.|... .+.+.|.+||..+...
T Consensus 113 ~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~~~ 183 (224)
T PF14099_consen 113 SPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVVDY 183 (224)
T ss_dssp EECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECCEE
T ss_pred CCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEEEE
Confidence 568999999999999987755 1112222455677799999988664 3679999999887553
No 244
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=78.85 E-value=1.2 Score=30.04 Aligned_cols=29 Identities=31% Similarity=0.818 Sum_probs=20.4
Q ss_pred CCCCCCCCCCeecccCCCCCceeeecCCCCC
Q psy9228 571 CAPKPCQNYGICYPTDTSERGYNCSCLTGYS 601 (834)
Q Consensus 571 C~~~pC~ngg~C~~~~~~~~~~~C~C~~G~~ 601 (834)
|...+|.-++.|....+ +.+.|.|..||.
T Consensus 2 C~~~~cP~NA~C~~~~d--G~eecrCllgyk 30 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDD--GSEECRCLLGYK 30 (37)
T ss_dssp -SSS---TTEEEEEETT--SEEEEEE-TTEE
T ss_pred ccCccCCCCcccEEcCC--CCEEEEeeCCcc
Confidence 66677888899988875 499999999997
No 245
>cd07700 IgV_CD8_beta Immunoglobulin (Ig) like domain of CD8 beta chain. IgV_CD8_beta: immunoglobulin (Ig)-like domain in CD8 beta. The CD8 glycoprotein plays an essential role in the control of T-cell selection, maturation and the T-cell receptor (TCR)-mediated response to peptide antigen. CD8 is comprised of alpha and beta subunits and is expressed as either an alpha/alpha or alpha/beta dimer. Both dimeric isoforms can serve as a coreceptor for T cell activation and differentiation, however they have distinct physiological roles, different cellular distributions, unique binding partners etc. Each CD8 subunit is comprised of an extracellular domain containing a V-type Ig-like domain, a single pass transmembrane portion and a short intracellular domain.
Probab=78.79 E-value=1.3 Score=38.72 Aligned_cols=20 Identities=35% Similarity=0.586 Sum_probs=18.7
Q ss_pred eEEEcccCCCCCeeEEEEEc
Q psy9228 22 VLRITNARLQDSGKYKCEIQ 41 (834)
Q Consensus 22 ~l~~~~~~~~d~g~y~c~~~ 41 (834)
.|+|.+|+.+|+|+|.|...
T Consensus 73 ~L~I~~~~~~Dsg~YyCa~~ 92 (107)
T cd07700 73 RLHINRVKPEDSGTYFCMTV 92 (107)
T ss_pred EEEECCCCHHHCEEEEEeEc
Confidence 59999999999999999985
No 246
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=78.38 E-value=1.2 Score=29.99 Aligned_cols=27 Identities=30% Similarity=0.818 Sum_probs=16.4
Q ss_pred ccCCCCC-CCeeecCC-CCeEEeCCCCcc
Q psy9228 181 CYPGACG-DGSCQDVD-GAMKCLCPIGTA 207 (834)
Q Consensus 181 C~~~~C~-~g~C~~~~-~~~~C~C~~g~~ 207 (834)
|...+|. |..|++.. +.+.|.|.+||.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 4556665 67888776 677888888875
No 247
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=77.59 E-value=2.2 Score=43.11 Aligned_cols=38 Identities=39% Similarity=0.891 Sum_probs=27.2
Q ss_pred CCCCCccCccCCCCC-CccCCCceeeecCCCcEEcCCCCCCC
Q psy9228 601 SGDHCEKENNMCMKG-DVCKNGGMCKVTPDSYECLCSLGYAP 641 (834)
Q Consensus 601 ~G~~Ce~~~~~C~~~-~pC~ngg~C~~~~~~~~C~C~~g~~G 641 (834)
.+..|+ +.++|... ++|. ..|.+..++|.|.|++||+.
T Consensus 180 ~~~~C~-~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 180 QGKICV-VPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred ccccCc-CchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence 345554 34667533 2565 47999999999999999975
No 248
>KOG1834|consensus
Probab=76.45 E-value=28 Score=39.46 Aligned_cols=134 Identities=16% Similarity=0.193 Sum_probs=83.7
Q ss_pred cceeEEEEEEEeeCC-------CCeeEEEeccCCccccCCCCCeEEEEEECcEEEEEEec--cc-EEEE------eeeee
Q psy9228 403 AHLHFSIELSFKPTD-------YNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDV--GL-VVLR------SKVTL 466 (834)
Q Consensus 403 ~~~~~~i~~~frt~~-------~~GlLl~~~~~~~~~~~~~~d~~~l~L~~G~l~~~~~~--G~-~~i~------s~~~~ 466 (834)
...+|+|+|+.|--. ..-.||-..+.+ ..+...++|.+..-++.|.+.- |. ...+ .-..+
T Consensus 364 l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~----emnrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qV 439 (952)
T KOG1834|consen 364 LPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKT----EMNRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQV 439 (952)
T ss_pred CCCceEEEEeeecCCCCccccccceeEEeccccc----ccccceeEEEEeccEEEEEEccCccccccccchheeccchhh
Confidence 345799999998533 223455544432 4577889999999999999854 31 1111 33568
Q ss_pred cCCCeEEEEEEEECCeEEEEECCeeeeee--ecCCCccccccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEc
Q psy9228 467 VPHEWVVVTIIKDFKEGKLSVGGEPLIVG--STPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVL 543 (834)
Q Consensus 467 ~dg~wH~V~v~~~~~~~~l~VD~~~~~~~--~~~g~~~~~l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~in 543 (834)
-|.+||+-.+..+.-.++|+|||..-... ..... ...-.....|.||..=.... ........-|.|-+..+.+.
T Consensus 440 CD~EWH~Y~ln~efp~VtlyvDG~Sfep~~i~ddwp-lHpsk~~tqLvVGACW~g~~--~~~l~~aqfFrG~Lasltlr 515 (952)
T KOG1834|consen 440 CDNEWHHYVLNVEFPDVTLYVDGKSFEPPLITDDWP-LHPSKIETQLVVGACWQGRQ--QKPLKLAQFFRGQLASLTLR 515 (952)
T ss_pred hhhhhheeEEeecCceEEEEEcCcccCCceeccCCc-cCcccccceeEEeeeccCcc--ccchhHHHHhhcccceeEEe
Confidence 99999999999999999999999643221 11111 11112455678887544321 11123345577877777663
No 249
>cd07706 IgV_TCR_delta Immunoglobulin (Ig) variable (V) domain of T-cell receptor (TCR) delta chain. IgV_TCR_delta: immunoglobulin (Ig) variable (V) domain of the delta chain of gamma/delta T-cell receptors (TCRs). TCRs mediate antigen recognition by T lymphocytes, and are heterodimers consisting of alpha and beta chains or gamma and delta chains. Each chain contains a variable (V) and a constant (C) region. The majority of T cells contain alpha/beta TCRs but a small subset contain gamma/delta TCRs. Alpha/beta TCRs recognize antigen as peptide fragments presented by major histocompatibility complex (MHC) molecules. Gamma/delta TCRs recognize intact protein antigens; they recognize protein antigens directly and without antigen processing, and MHC independently of the bound peptide. Gamma/delta T cells can also be stimulated by non-peptide antigens such as small phosphate- or amine-containing compounds. The variable domain of gamma/delta TCRs is responsible for antigen recognition and is
Probab=76.27 E-value=1.6 Score=38.85 Aligned_cols=22 Identities=36% Similarity=0.602 Sum_probs=19.9
Q ss_pred CeEEEcccCCCCCeeEEEEEcC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
-.|+|.+++.+|+|+|.|....
T Consensus 72 ~sL~I~~~~~~Dsg~Y~C~~~~ 93 (116)
T cd07706 72 ISLTISALQLEDSAKYFCALST 93 (116)
T ss_pred EEEEECCCCHHHCEEEEEEEEC
Confidence 3699999999999999999973
No 250
>cd05860 Ig4_SCFR Fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR). Ig4_SCFR: The fourth Immunoglobulin (Ig)-like domain in stem cell factor receptor (SCFR). SCFR is organized as an extracellular component having five IG-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. SCFR and its ligand SCF are critical for normal hematopoiesis, mast cell development, melanocytes and gametogenesis. SCF binds to the second and third Ig-like domains of SCFR. This fourth Ig-like domain participates in SCFR dimerization, which follows ligand binding. Deletion of this fourth domain abolishes the ligand-induced dimerization of SCFR and completely inhibits signal transduction.
Probab=74.43 E-value=2.8 Score=35.98 Aligned_cols=43 Identities=19% Similarity=0.362 Sum_probs=32.0
Q ss_pred cEEEEecCC-CC--Ccccc-----C----CCeEEEcccCCCCCeeEEEEEcCCCC
Q psy9228 3 YIKWSRADG-LP--LQRYA-----E----GNVLRITNARLQDSGKYKCEIQGHDS 45 (834)
Q Consensus 3 ~~~w~~~~~-~~--~~~~~-----~----~~~l~~~~~~~~d~g~y~c~~~~~~~ 45 (834)
.+.|.|... |- .+... . -.+|+|..++.+|+|.|++.+.|.+.
T Consensus 35 ~~~W~~~~~~l~~~~~~~~~~~~~~~~rY~S~L~L~Rlk~~E~G~YTf~a~N~~~ 89 (101)
T cd05860 35 HQQWIYMNRTLTNTSDHYVKSRNESNNRYVSELHLTRLKGTEGGTYTFLVSNSDA 89 (101)
T ss_pred eeEEEcCCcccCccccceeEEeccCceEEEEEEEEeecChhhCcEEEEEEECCCC
Confidence 489999655 32 22111 1 26899999999999999999998874
No 251
>cd04981 IgV_H Immunoglobulin (Ig) heavy chain (H), variable (V) domain. IgV_H: Immunoglobulin (Ig) heavy chain (H), variable (V) domain. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. In Ig, each chain is composed of one variable domain (IgV) and one or more constant domains (IgC); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. There are five types of heavy chains (alpha, gamma, delta, epsilon, and mu), which determine the type of immunoglobulin: IgA, IgG, IgD, IgE, and IgM, respectively. In higher vertebrates, there are two types of light chain, designated kappa and lambda, which can associate with any of the heavy chains. This family includes alpha, gamma, delta, epsilon, and mu heavy chains.
Probab=70.95 E-value=4.1 Score=36.27 Aligned_cols=21 Identities=19% Similarity=0.335 Sum_probs=19.3
Q ss_pred eEEEcccCCCCCeeEEEEEcC
Q psy9228 22 VLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 22 ~l~~~~~~~~d~g~y~c~~~~ 42 (834)
.|+|.+++.+|+|+|-|....
T Consensus 77 ~L~I~~~~~~Dsa~YyCa~~~ 97 (117)
T cd04981 77 YLQLNSLTPEDTAVYYCARGL 97 (117)
T ss_pred EEEECCCCHHHCEEEEEEEEc
Confidence 699999999999999999854
No 252
>cd07705 Ig2_Necl-1 Second immunoglobulin (Ig)-like domain of nectin-like molcule-1 (Necl-1, also known as cell adhesion molecule3 (CADM3)). Ig2_Necl-1: second immunoglobulin (Ig)-like domain of nectin-like molcule-1 (Necl-1, also known as cell adhesion molecule3 (CADM3)). These nectin-like molecules have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1 - Necl-5). These have an extracellular region containing three Ig-like domains, one transmembrane region, and one cytoplasmic region. The N-terminal Ig-like domain of the extracellular region belongs to the V-type subfamily of Ig domains, is essential to cell-cell adhesion, and plays a part in the interaction with the envelope glycoprotein D of various viruses. Necl-1 and Necl-2 have Ca(2+)-independent homophilic and heterophilic cell-cell adhesion activity. Necl-1 is specifically expressed in neural tissue and is important to the formation of synapses, axon bundles, and myel
Probab=69.91 E-value=7.6 Score=32.03 Aligned_cols=43 Identities=23% Similarity=0.447 Sum_probs=26.7
Q ss_pred CcEEEEecCC-CCCcccc-----CCC------eEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA-----EGN------VLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~-----~~~------~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
++|+|.|.+. |...... .++ .|.|.-...+|...|.|.+.+..
T Consensus 15 ~~ItW~k~g~~l~~~~~~~~~~~~~~t~~~~s~l~~~~~~~d~g~~~tC~v~h~~ 69 (83)
T cd07705 15 ANIKWRKGDQELEGAPTSVLEDGNGKTFTVSSSVEFQVTREDDGAEITCSVGHES 69 (83)
T ss_pred CEeEEEECCEECCCcceeEEECCCCCEEEEEEEEEEEecchhCCCEEEEEEECcc
Confidence 5899999655 7654321 222 34443444567789999997653
No 253
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=69.63 E-value=73 Score=31.93 Aligned_cols=23 Identities=22% Similarity=0.182 Sum_probs=20.1
Q ss_pred CCcEEEEEEEEC--CEEEEEEcCee
Q psy9228 740 GIKHSVNVTRIN--KFGSLEVDSVI 762 (834)
Q Consensus 740 g~wH~V~i~r~~--~~~~l~VD~~~ 762 (834)
|+||+|+|.... +.+++.|+...
T Consensus 154 g~~~~v~I~Y~~~~~~L~v~l~~~~ 178 (223)
T cd01951 154 GNEHTVRITYDPTTNTLTVYLDNGS 178 (223)
T ss_pred CCEEEEEEEEeCCCCEEEEEECCCC
Confidence 999999999994 78999998764
No 254
>PTZ00334 trans-sialidase; Provisional
Probab=67.44 E-value=25 Score=41.77 Aligned_cols=74 Identities=19% Similarity=0.164 Sum_probs=45.0
Q ss_pred eecCCCcEEEEEEE-ECCEEEEEEcCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECCce
Q psy9228 736 PVNDGIKHSVNVTR-INKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQNKH 814 (834)
Q Consensus 736 ~~nDg~wH~V~i~r-~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~~~ 814 (834)
...-++-|+|.|.. ++++++++|||+.+.....+.......-...+||||--.... ...+..==++||.|-+++
T Consensus 639 tWe~~k~yqVal~L~~G~~gsvYVDG~~vg~~~~~l~~~~~~~IshFyiGgdg~~~~-----~~~~~~VTV~NVlLYNRp 713 (780)
T PTZ00334 639 NWEPETTHQVAIVLRNGKQGSAYVDGQRVGDASCELKNTDSKGISHFYIGGDGGSAG-----SKEDVPVTATNVLLYNRP 713 (780)
T ss_pred cccCCCeEEEEEEEeCCCeEEEEECCEEecCcccccCCCCCcccceEEECCCccccc-----cCCCCCEEEeEeEEeCCC
Confidence 46668999999987 567999999999885433222222222335799999542200 011222256777776654
No 255
>cd05883 Ig2_Necl-2 Second immunoglobulin (Ig)-like domain of nectin-like molecule 2 (also known as cell adhesion molecule 1 (CADM1)). Ig2_Necl-2: second immunoglobulin (Ig)-like domain of nectin-like molecule 2 (also known as cell adhesion molecule 1 (CADM1)). Nectin-like molecules (Necls) have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1 - Necl-5). These have an extracellular region containing three Ig-like domains, one transmembrane region, and one cytoplasmic region. Necl-2 has Ca(2+)-independent homophilic and heterophilic cell-cell adhesion activity. Necl-1 is expressed in a wide variety of tissues, and is a putative tumour suppressor gene, which is downregulated in aggressive neuroblastoma. Ig domains are likely to participate in ligand binding and recognition.
Probab=67.13 E-value=7.2 Score=32.12 Aligned_cols=44 Identities=16% Similarity=0.249 Sum_probs=28.8
Q ss_pred CCcEEEEecCC-CCCcccc----C-----CCeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 1 NAYIKWSRADG-LPLQRYA----E-----GNVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 1 ~~~~~w~~~~~-~~~~~~~----~-----~~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
+|+|+|.|.+. |+..... . -.+|.|.--+..|.-.|.|.+.+..
T Consensus 14 ~A~I~W~k~~~~l~~~~~~~~~~~~~~t~~S~L~~~p~~eDdG~~~~C~a~~~~ 67 (82)
T cd05883 14 AATIRWFKGNKELTGKSTVEETWSRMFTVTSQLMLKVTKEDDGVPVICLVDHPA 67 (82)
T ss_pred CCEEEEEECCEECcCcccceeccCCCcEEEEEEEEECchhhCCCEEEEEEcCcc
Confidence 46999999665 8775332 1 2467775555556667789997653
No 256
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=66.60 E-value=6.6 Score=28.23 Aligned_cols=18 Identities=28% Similarity=0.805 Sum_probs=15.0
Q ss_pred CeEEeCCCCcccCccccc
Q psy9228 197 AMKCLCPIGTAGKRCEQK 214 (834)
Q Consensus 197 ~~~C~C~~g~~G~~Ce~~ 214 (834)
.-.|.|+++++|+.|+.-
T Consensus 17 ~G~C~C~~~~~G~~C~~C 34 (46)
T smart00180 17 TGQCECKPNVTGRRCDRC 34 (46)
T ss_pred CCEEECCCCCCCCCCCcC
Confidence 347999999999999853
No 257
>PHA03376 BARF1; Provisional
Probab=66.38 E-value=4.9 Score=38.25 Aligned_cols=24 Identities=25% Similarity=0.512 Sum_probs=20.6
Q ss_pred CeEEEcccCCCCCeeEEEEEcCCC
Q psy9228 21 NVLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 21 ~~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
-.|.|.+++.+|.|+|.|..+.+.
T Consensus 87 vsLvI~~l~lSDdGtY~C~fQkge 110 (221)
T PHA03376 87 FFLVVTAANISHDGNYLCRMKLGE 110 (221)
T ss_pred EEEEEEeeeecCCceEEEEEEcCC
Confidence 378899999999999999996443
No 258
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=66.36 E-value=83 Score=30.06 Aligned_cols=85 Identities=8% Similarity=0.098 Sum_probs=54.0
Q ss_pred cCcceEEEEEEeCCCCeeEEecCCCCCCCCCCcceEE-EEEE-CCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEE
Q psy9228 673 RNEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIA-VAVV-NGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRI 750 (834)
Q Consensus 673 ~~~~~i~~~frT~~~~GlLl~~~~~~~~~~~~~~~~~-l~l~-~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~ 750 (834)
...++++-..|-.+-.-...|.+.-. +.|+. |.|. +|.|+..++-+++...+......+ .-.|...|...
T Consensus 31 ~~gW~ls~~~RV~~G~~n~~yyAnG~------~r~l~~lsvn~sG~LvA~L~g~ss~~~~~~~~~di--~gyH~Y~i~~~ 102 (198)
T PF09264_consen 31 QQGWSLSWESRVVSGGCNTNYYANGS------KRYLPILSVNESGSLVAELEGQSSNTLLATTGADI--HGYHKYEIVFS 102 (198)
T ss_dssp CC-EEEEEEEEEEEES-EEEEEEESS------EEEEEEEEE-TTS-EEEEETTS-S-EEEE-CHHHH--CSEEEEEEEEE
T ss_pred hcCcceeeeEEEecCcceeEEEcCCc------eEEEEEEEEcCCCCEEEEEecCCCcEEEecccccc--cceeEEEEEec
Confidence 36677888888765444444444321 35654 4555 789999998777777776221222 47999999997
Q ss_pred C--CEEEEEEcCeeeec
Q psy9228 751 N--KFGSLEVDSVIVGK 765 (834)
Q Consensus 751 ~--~~~~l~VD~~~~~~ 765 (834)
. ..++++|||..+..
T Consensus 103 p~~~tASfy~DG~lI~t 119 (198)
T PF09264_consen 103 PLTNTASFYFDGTLIAT 119 (198)
T ss_dssp TTTTEEEEEETTEEEEE
T ss_pred CCCCceEEEECCEEEee
Confidence 6 89999999988764
No 259
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=65.89 E-value=4 Score=29.75 Aligned_cols=16 Identities=31% Similarity=0.885 Sum_probs=12.5
Q ss_pred ceeeecCCCCCCCCCc
Q psy9228 591 GYNCSCLTGYSGDHCE 606 (834)
Q Consensus 591 ~~~C~C~~G~~G~~Ce 606 (834)
..+|.|+++|+|+.|+
T Consensus 17 ~G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 17 TGQCVCKPGTTGPRCD 32 (49)
T ss_dssp CEEESBSTTEESTTS-
T ss_pred CCEEeccccccCCcCc
Confidence 6688888888888887
No 260
>cd07694 Ig2_CD4 Second immunoglobulin (Ig) domain of CD4. Ig2_CD4; second immunoglobulin (Ig) domain of CD4. CD4 and CD8 are the two primary co-receptor proteins found on the surface of T cells, and the presence of either CD4 or CD8 determines the function of the T cell. CD4 is found on helper T cells, where it is required for the binding of MHC (major histocompatibility complex) class II molecules, while CD8 is found on cytotoxic T cells, where it is required for the binding of MHC class I molecules. CD4 contains four immunoglobulin domains, with the first three included in this hierarchy. The fourth domain has a general Ig architecture, but has slight topological changes in the arrangement of beta strands relative to the other structures in this family and is not specifically included in the hierarchy.
Probab=64.32 E-value=12 Score=31.08 Aligned_cols=40 Identities=15% Similarity=0.330 Sum_probs=29.0
Q ss_pred cEEEEecCC-CCCccccCCCeEEEcccCCCCCeeEEEEEcC
Q psy9228 3 YIKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
.|+|.-.++ .-......+..|.+.+|...|+|+..|+.+.
T Consensus 32 ~i~w~~P~n~~~~~~~~~~ktL~~~qv~~qdSG~WtC~V~~ 72 (88)
T cd07694 32 KVEWRGPGNKSKQILNQDKKTLNLVQLGPNDSGTWDCIVSV 72 (88)
T ss_pred cEEEeCCCCccceeccCCccEEEeceeCcccCCEEEEEEEE
Confidence 789986332 1111122567999999999999999999963
No 261
>cd02175 GH16_lichenase lichenase, member of glycosyl hydrolase family 16. Lichenase, also known as 1,3-1,4-beta-glucanase, is a member of glycosyl hydrolase family 16, that specifically cleaves 1,4-beta-D-glucosidic bonds in mixed-linked beta glucans that also contain 1,3-beta-D-glucosidic linkages. Natural substrates of beta-glucanase are beta-glucans from grain endosperm cell walls or lichenan from the Islandic moss, Cetraria islandica. This protein is found not only in bacteria but also in anaerobic fungi. This domain includes two seven-stranded antiparallel beta-sheets that are adjacent to one another forming a compact, jellyroll beta-sandwich structure.
Probab=63.67 E-value=1.2e+02 Score=30.12 Aligned_cols=29 Identities=14% Similarity=-0.050 Sum_probs=25.3
Q ss_pred CCCcEEEEEEEECCEEEEEEcCeeeeccc
Q psy9228 739 DGIKHSVNVTRINKFGSLEVDSVIVGKGE 767 (834)
Q Consensus 739 Dg~wH~V~i~r~~~~~~l~VD~~~~~~~~ 767 (834)
...||+-.|.+..++++.+|||..+....
T Consensus 137 ~~~~H~Y~v~W~~~~i~~yvDg~~v~~~~ 165 (212)
T cd02175 137 SEGFHTYAFEWEPDSIRWYVDGELVHEAT 165 (212)
T ss_pred ccccEEEEEEEeCCEEEEEECCEEEEEEc
Confidence 46899999999999999999998876553
No 262
>cd05884 Ig2_Necl-3 Second immunoglobulin (Ig)-like domain of nectin-like molecule-3 (Necl-3, also known as cell adhesion molecule 2 (CADM2)). Ig2_Necl-3: second immunoglobulin (Ig)-like domain of nectin-like molecule-3 (Necl-3, also known as cell adhesion molecule 2 (CADM2)). Nectin-like molecules have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1 - Necl-5). These have an extracellular region containing three Ig-like domains, one transmembrane region, and one cytoplasmic region. Necl-3 has been shown to accumulate in tissues of the central and peripheral nervous system, where it is expressed in ependymal cells and myelinated axons. It is observed at the interface between the axon shaft and the myelin sheath. Ig domains are likely to participate in ligand binding and recognition.
Probab=63.67 E-value=14 Score=30.51 Aligned_cols=42 Identities=21% Similarity=0.393 Sum_probs=27.1
Q ss_pred CcEEEEecCC-CCCcccc-----CCC------eEEEcccCCCCCeeEEEEEcCC
Q psy9228 2 AYIKWSRADG-LPLQRYA-----EGN------VLRITNARLQDSGKYKCEIQGH 43 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~-----~~~------~l~~~~~~~~d~g~y~c~~~~~ 43 (834)
++|+|.|.+. ++..... .++ .|.|.--...|...|.|.+.+.
T Consensus 15 ~~itW~kng~~l~~~~~~~~~~~~~~t~~~~s~L~~~~~~~d~g~~ytC~v~h~ 68 (83)
T cd05884 15 ADIRWFKNDKEIKDVKYLKEEDANRKTFTVSSTLDFRVDRSDDGVAIICRVDHE 68 (83)
T ss_pred CEEEEEECCEECcCceeeEeeCCCCCEEEEEEEEEEEcccccCCCEEEEEEech
Confidence 5899999555 7654322 122 5555554566778999999755
No 263
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=62.39 E-value=62 Score=32.85 Aligned_cols=25 Identities=12% Similarity=-0.012 Sum_probs=20.7
Q ss_pred ecCCCcEEEEEEEEC--CEEEEEEcCe
Q psy9228 737 VNDGIKHSVNVTRIN--KFGSLEVDSV 761 (834)
Q Consensus 737 ~nDg~wH~V~i~r~~--~~~~l~VD~~ 761 (834)
+.||++|+|.|.... +.+++.|+..
T Consensus 160 l~~g~~~~v~I~Y~~~~~~L~V~l~~~ 186 (236)
T cd06899 160 LKSGKPMQAWIDYDSSSKRLSVTLAYS 186 (236)
T ss_pred ccCCCeEEEEEEEcCCCCEEEEEEEeC
Confidence 579999999999995 6777777754
No 264
>cd07691 Ig_CD3_gamma_delta Immunoglobulin (Ig)-like domain of CD3 gamma and delta chains. Ig_CD3_gamma_delta; immunoglobulin (Ig)-like domain of CD3 gamma and delta chains. CD3 is a T cell surface receptor that is associated with alpha/beta T cell receptors (TCRs). The CD3 complex consists of one gamma, one delta, two epsilon, and two zeta chains. The CD3 subunits form heterodimers as gamma/epsilon, delta/epsilon, and zeta/zeta. The gamma, delta, and epsilon chains each contain an extracellular Ig domain, whereas the extracellular domains of the zeta chains are very small and have unknown structure. The CD3 domain participates in intracellular signalling once the TCR has bound an MHC/antigen complex.
Probab=62.01 E-value=11 Score=29.64 Aligned_cols=36 Identities=19% Similarity=0.481 Sum_probs=29.0
Q ss_pred CcEEEEecCC--CCCccccCCCeEEEcccCCCCCeeEEEEEcC
Q psy9228 2 AYIKWSRADG--LPLQRYAEGNVLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 2 ~~~~w~~~~~--~~~~~~~~~~~l~~~~~~~~d~g~y~c~~~~ 42 (834)
.+|+|.| +. .+.. +..|-|-++...=.|+|.|.+++
T Consensus 17 tsi~W~k-G~~~~~~~----~~tlnLGs~~~DPRG~Y~C~~s~ 54 (69)
T cd07691 17 TNITWKK-GKEILEVS----NTLLDLGSRINDPRGTYSCKESE 54 (69)
T ss_pred CcEEEec-Cccccccc----ccEEeccCcccCCCcceEecCcc
Confidence 5799999 44 3222 67899999999999999999965
No 265
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=60.65 E-value=8.5 Score=33.76 Aligned_cols=32 Identities=28% Similarity=0.743 Sum_probs=24.2
Q ss_pred cCCCCCCccCCCceeeecCCCcEEcCCCCCCCC
Q psy9228 610 NMCMKGDVCKNGGMCKVTPDSYECLCSLGYAPP 642 (834)
Q Consensus 610 ~~C~~~~pC~ngg~C~~~~~~~~C~C~~g~~G~ 642 (834)
+.|.....|...|.|.. .....|.|.+||...
T Consensus 78 d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 56665458999999954 456779999999764
No 266
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=59.77 E-value=8.6 Score=33.73 Aligned_cols=31 Identities=35% Similarity=0.827 Sum_probs=19.6
Q ss_pred CCCccC-CCCC-CCeeecCCCCeEEeCCCCcccC
Q psy9228 178 GEPCYP-GACG-DGSCQDVDGAMKCLCPIGTAGK 209 (834)
Q Consensus 178 ~~~C~~-~~C~-~g~C~~~~~~~~C~C~~g~~G~ 209 (834)
.|.|.. ..|+ +|.|.. .....|.|.+||.-+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 345653 6676 578843 345568888888643
No 267
>KOG3546|consensus
Probab=59.42 E-value=24 Score=39.63 Aligned_cols=125 Identities=16% Similarity=0.174 Sum_probs=75.3
Q ss_pred eeEEEEEEEeeCCC-CeeEEEeccCCccccCCCCCeEEEEE---ECcEEEEEEec---cc----EEEEeeeeecCCCeEE
Q psy9228 405 LHFSIELSFKPTDY-NGLIMYTGDSNMKSYKGKGDFVSFGL---EDGYPVFRFDV---GL----VVLRSKVTLVPHEWVV 473 (834)
Q Consensus 405 ~~~~i~~~frt~~~-~GlLl~~~~~~~~~~~~~~d~~~l~L---~~G~l~~~~~~---G~----~~i~s~~~~~dg~wH~ 473 (834)
++|+|.|.+|.... -|+||-.-+. ....-|+-|.| +||+-.+.+-. |. ....-..++-.++|.+
T Consensus 87 rdf~~~~~i~p~s~~~gvlfaitd~-----~q~~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~ 161 (1167)
T KOG3546|consen 87 RDFSLLFHIRPATEGPGVLFAITDS-----AQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTH 161 (1167)
T ss_pred ccceEEEEeeccCCCCceEEEechh-----hhhhheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhh
Confidence 46888888998765 4566554433 22334444444 56654444322 21 1112245567799999
Q ss_pred EEEEEECCeEEEEECCeeeeeeecCCCcccc--ccCCCceeeccccCCCcccCcCcccccCceeEEeeeEEcC
Q psy9228 474 VTIIKDFKEGKLSVGGEPLIVGSTPGEKLQV--LNLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCISTIDVLG 544 (834)
Q Consensus 474 V~v~~~~~~~~l~VD~~~~~~~~~~g~~~~~--l~~~~~l~iGG~~~~~~~~~~~~~~~~gF~GCi~~v~ing 544 (834)
+.+...+..+.|.||-.+..+....-+ ++. +.....||++-.-... ...|+|-|+++.+.-
T Consensus 162 ~a~~v~g~~v~l~v~cee~~r~p~~rs-s~~l~~e~~ag~f~~~ag~~~---------~~~f~g~~~~l~v~~ 224 (1167)
T KOG3546|consen 162 LALSVAGGFVALYVDCEEFQRMPLARS-SRGLELEPGAGLFVAQAGGAD---------PDKFQGVIAELKVRR 224 (1167)
T ss_pred eeeeecCceEEEEechHHhcccchhcc-ccceeecCCcceEEeccCCCC---------hHhhhhhhhheeecC
Confidence 999999999999999655443221111 233 3445568887543321 356999999998853
No 268
>PF07953 Toxin_R_bind_N: Clostridium neurotoxin, N-terminal receptor binding; InterPro: IPR012928 The Clostridium neurotoxin family is composed of tetanus neurotoxin and seven serotypes of botulinum neurotoxin. The structure of the botulinum neurotoxin reveals a four domain protein. The N-terminal catalytic domain (IPR000395 from INTERPRO), the central translocation domain and two receptor binding domains []. This domain is the N-terminal receptor binding domain, which is comprised of two seven-stranded beta-sheets sandwiched together to form a jelly role motif []. The role of this domain in receptor binding appears to be indirect. ; GO: 0004222 metalloendopeptidase activity, 0050827 toxin receptor binding, 0009405 pathogenesis, 0051609 inhibition of neurotransmitter uptake, 0005576 extracellular region; PDB: 3RSJ_B 3FUQ_A 3FFZ_B 1G9B_A 1S0F_A 1Z0H_B 1F31_A 1G9D_A 1S0C_A 1S0D_A ....
Probab=56.19 E-value=1.7e+02 Score=28.09 Aligned_cols=86 Identities=12% Similarity=0.125 Sum_probs=57.3
Q ss_pred CcceEEEEEEeCCCCee--------EE-ecCCCCCCCCCCcceEEEEEECCEEEEEEEcCCc-EEEEEe--CC-ceecC-
Q psy9228 674 NEETIAFDFVTDDKNAL--------LL-WNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDG-VVTIKF--SK-KPVND- 739 (834)
Q Consensus 674 ~~~~i~~~frT~~~~Gl--------Ll-~~~~~~~~~~~~~~~~~l~l~~G~l~~~~~~g~~-~~~l~~--s~-~~~nD- 739 (834)
..++++||+|....+-. |. -... ..=-.++++++.|.+.+.--.| ...|.. +. ..++|
T Consensus 55 ~nFSIsFWlRipk~~~~~~~~neytII~~~~n--------NsGWkI~l~~n~iiwtl~D~ng~~k~i~f~y~~~~~~Sdy 126 (195)
T PF07953_consen 55 NNFSISFWLRIPKYDNNINLNNEYTIINCMKN--------NSGWKISLNNNGIIWTLIDSNGNEKSIYFNYSIMDNISDY 126 (195)
T ss_dssp SEEEEEEEEEEECHHCCHHTTSEEEEEEEEET--------TEEEEEEEETTEEEEEEEETTSEEEEEEEESSSTSSTTSS
T ss_pred cceEEEEEEEccCcccccccCcceEEEEeecC--------CCceEEEEeCCcEEEEEEeCCCCEEEEEEEcccccchhhh
Confidence 78899999998653332 22 2211 2335789999999998765443 333432 21 12233
Q ss_pred -CCcEEEEEEEEC-CEEEEEEcCeeeeccc
Q psy9228 740 -GIKHSVNVTRIN-KFGSLEVDSVIVGKGE 767 (834)
Q Consensus 740 -g~wH~V~i~r~~-~~~~l~VD~~~~~~~~ 767 (834)
++||-|+++-.. +...++++|..+..+.
T Consensus 127 iNkW~fITITnnrL~~~~IyINg~Li~~~~ 156 (195)
T PF07953_consen 127 INKWFFITITNNRLGNSKIYINGNLIDNES 156 (195)
T ss_dssp TTSEEEEEEEEETTSEEEEEETTEEEEEEE
T ss_pred cccEEEEEEEcccCccceEEECCEEEcccc
Confidence 799999999997 7779999998876653
No 269
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=55.91 E-value=74 Score=37.07 Aligned_cols=86 Identities=15% Similarity=0.175 Sum_probs=51.3
Q ss_pred eEEEEEECCEEEEEEEcCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeecccCCCCccceecCCceEEcCc
Q psy9228 707 FIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGT 786 (834)
Q Consensus 707 ~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~lyiGG~ 786 (834)
|+.| ..||.-.+.-++.+. .+|..-...+.-++||++.+...+..++-.+||..+..... ......+.+-||-
T Consensus 580 ~f~v-~~~G~w~vt~d~~~~-~~l~~G~~~~~~~~WhtltL~~~g~~~ta~lng~~l~~~~~----~~~p~~G~aaIGT- 652 (669)
T PF02057_consen 580 FFWV-YANGTWSVTSDLAGT-TTLASGTADIGAGKWHTLTLTISGSTATAMLNGTVLWTDVD----SSYPKNGWAAIGT- 652 (669)
T ss_dssp EEEE-ETTTEEEEEEETTS--SEEEEEE-S--TT-EEEEEEEEETTEEEEEETTEEEEEEEE------SS---EEEEEE-
T ss_pred EEEE-EcCCcEEEeccCCCc-EEEeeeeecccCCeEEEEEEEEECCEEEEEECCEEeEEecc----cCCCCCceEEEEc-
Confidence 4444 569998888777653 34442334588899999999999999999999998765322 1122235556662
Q ss_pred CCCCCCCCCccCCCceEEEEEEEE
Q psy9228 787 PNMDLMTGGRYVHPMSGLMMNIHI 810 (834)
Q Consensus 787 p~~~~~~~~~~~~~F~GCi~~v~i 810 (834)
.....++..+|.|
T Consensus 653 -----------~~~~~~QFDNf~V 665 (669)
T PF02057_consen 653 -----------SSFETAQFDNFSV 665 (669)
T ss_dssp -----------SSS--EEEEEEEE
T ss_pred -----------CCCceeEeeeeEE
Confidence 1233467777776
No 270
>cd05761 Ig2_Necl-1-4_like Second immunoglobulin (Ig)-like domain of the nectin-like molecules Necl-1 - Necl-4 (also known as cell adhesion molecules CADM3, CADM1, CADM2, and CADM4, respectively). Ig2_Necl-1-4_like: domain similar to the second immunoglobulin (Ig)-like domain of the nectin-like molecules Necl-1 (also known as cell adhesion molecule 3 (CADM3)), Necl-2 (CADM1), Necl-3 (CADM2) and Necl-4 (CADM4). These nectin-like molecules have similar domain structures to those of nectins. At least five nectin-like molecules have been identified (Necl-1 - Necl-5). These have an extracellular region containing three Ig-like domains, one transmembrane region, and one cytoplasmic region. The N-terminal Ig-like domain of the extracellular region belongs to the V-type subfamily of Ig domains, is essential to cell-cell adhesion, and plays a part in the interaction with the envelope glycoprotein D of various viruses. Necl-1 and Necl-2 have Ca(2+)-independent homophilic and heterophilic cell-cel
Probab=54.61 E-value=13 Score=30.53 Aligned_cols=43 Identities=26% Similarity=0.449 Sum_probs=26.2
Q ss_pred CcEEEEecCC-CCCcccc----CCC------eEEEcccCCCCCeeEEEEEcCCC
Q psy9228 2 AYIKWSRADG-LPLQRYA----EGN------VLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~----~~~------~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
++|+|.|.+. |+..... .++ .|.+.-...+|...|.|.+.+..
T Consensus 15 ~~v~W~~~~~~l~~~~~~~~~~~~~~~~~~s~l~~~~~~~d~g~~~~C~v~h~~ 68 (82)
T cd05761 15 ATIRWFKGDKELKGVKLKEEDENGKTFTVTSSLRFQVDREDDGAPIICRVDHPA 68 (82)
T ss_pred CeEEEEECCEEccCceeeEecCCCCEEEEEEEEEEEcchhhCCCEEEEEEeChh
Confidence 4899999655 7643211 222 34444444556679999997654
No 271
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=54.59 E-value=25 Score=35.73 Aligned_cols=28 Identities=25% Similarity=0.143 Sum_probs=23.6
Q ss_pred CceecCCCcEEEEEEEEC--CEEEEEEcCe
Q psy9228 734 KKPVNDGIKHSVNVTRIN--KFGSLEVDSV 761 (834)
Q Consensus 734 ~~~~nDg~wH~V~i~r~~--~~~~l~VD~~ 761 (834)
...+.||+||+|.|.... +.+++.++..
T Consensus 161 ~~~l~~g~~~~v~I~Yd~~~~~L~V~l~~~ 190 (236)
T PF00139_consen 161 SFSLSDGKWHTVWIDYDASTKRLSVYLDDN 190 (236)
T ss_dssp EHHHGTTSEEEEEEEEETTTTEEEEEEEET
T ss_pred cccccCCcEEEEEEEEcCCccEEEEEEecc
Confidence 456899999999999998 6777777765
No 272
>cd00413 Glyco_hydrolase_16 glycosyl hydrolase family 16. The O-Glycosyl hydrolases are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A glycosyl hydrolase classification system based on sequence similarity has led to the definition of more than 95 different families inlcuding glycosyl hydrolase family 16. Family 16 includes lichenase, xyloglucan endotransglycosylase (XET), beta-agarase, kappa-carrageenase, endo-beta-1,3-glucanase, endo-beta-1,3-1,4-glucanase, and endo-beta-galactosidase, all of which have a conserved jelly roll fold with a deep active site channel harboring the catalytic residues.
Probab=54.37 E-value=2.1e+02 Score=28.15 Aligned_cols=30 Identities=20% Similarity=0.049 Sum_probs=26.6
Q ss_pred cCCCcEEEEEEEECCEEEEEEcCeeeeccc
Q psy9228 738 NDGIKHSVNVTRINKFGSLEVDSVIVGKGE 767 (834)
Q Consensus 738 nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~ 767 (834)
..+.||...|.+....++.+|||..+....
T Consensus 140 ~~~~~H~Y~~~W~~~~i~~yvDG~~~~~~~ 169 (210)
T cd00413 140 PADDFHTYRVDWTPGEITFYVDGVLVATIT 169 (210)
T ss_pred CccCeEEEEEEEeCCEEEEEECCEEEEEEC
Confidence 478999999999999999999999887643
No 273
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=53.01 E-value=13 Score=27.24 Aligned_cols=16 Identities=25% Similarity=0.789 Sum_probs=12.6
Q ss_pred ceeeecCCCCCCCCCc
Q psy9228 591 GYNCSCLTGYSGDHCE 606 (834)
Q Consensus 591 ~~~C~C~~G~~G~~Ce 606 (834)
.-+|.|+++|+|..|+
T Consensus 18 ~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 18 TGQCECKPNTTGRRCD 33 (50)
T ss_pred CCEEeCCCcCCCCCCC
Confidence 4578888888888886
No 274
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=52.99 E-value=13 Score=31.86 Aligned_cols=29 Identities=28% Similarity=0.626 Sum_probs=19.7
Q ss_pred CCCCCCCCEeccCCCC---CceEeeCCCCCCC
Q psy9228 143 HNNCINNGLCQDAATR---IGYTCICPPGFSG 171 (834)
Q Consensus 143 ~~pC~n~g~C~~~~~~---~~~~C~C~~g~~G 171 (834)
.+-|.++|.|+..... .=|.|.|.+.+..
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeeccccc
Confidence 6789999999976421 2377888764443
No 275
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins. EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=51.52 E-value=2e+02 Score=28.64 Aligned_cols=26 Identities=12% Similarity=0.312 Sum_probs=21.3
Q ss_pred cCCCcEEEEEEEEC--CEEEEEEcCeee
Q psy9228 738 NDGIKHSVNVTRIN--KFGSLEVDSVIV 763 (834)
Q Consensus 738 nDg~wH~V~i~r~~--~~~~l~VD~~~~ 763 (834)
|.+.-.+++|+..+ ..++|.||+..-
T Consensus 149 n~~~p~~iri~Y~~~~~~l~v~vd~~~C 176 (215)
T cd06903 149 DSGVPSTIRLSYDALNSLFKVQVDNRLC 176 (215)
T ss_pred CCCCCEEEEEEEECCCCEEEEEECCCEE
Confidence 55667889999998 899999998643
No 276
>PHA02987 Ig domain OX-2-like protein; Provisional
Probab=50.97 E-value=11 Score=36.03 Aligned_cols=39 Identities=21% Similarity=0.359 Sum_probs=28.1
Q ss_pred CCCeEEEcccCCCCCeeEEEEEcCC-CCccCCcEEEEcCceeEEEcCC
Q psy9228 19 EGNVLRITNARLQDSGKYKCEIQGH-DSFRGSDYVKLNVERMMFVDGI 65 (834)
Q Consensus 19 ~~~~l~~~~~~~~d~g~y~c~~~~~-~~~~~~~~~~~~~~g~~cvd~~ 65 (834)
.+..|+|.+|+.+|+|-|.|.-..- .+. ++.|.+|+.-.
T Consensus 83 ~~StItIknVt~sDeGcY~C~F~tfp~G~--------~~~gt~CLtVt 122 (189)
T PHA02987 83 NESTILIKNVSLKDNGCYTCIFNTLLSKN--------NEKGVVCLNVT 122 (189)
T ss_pred CcceEEEEeCChhhCeEEEEEEEecCCCC--------CceeEEEEEEE
Confidence 3568999999999999999998522 120 15677776644
No 277
>KOG1218|consensus
Probab=49.64 E-value=22 Score=37.88 Aligned_cols=56 Identities=27% Similarity=0.668 Sum_probs=40.5
Q ss_pred ceeeecCCCCCCCCCccCccCCCCCCccCCCceeeecCCCcEE------cCCCCCCCCCccc
Q psy9228 591 GYNCSCLTGYSGDHCEKENNMCMKGDVCKNGGMCKVTPDSYEC------LCSLGYAPPNCAK 646 (834)
Q Consensus 591 ~~~C~C~~G~~G~~Ce~~~~~C~~~~pC~ngg~C~~~~~~~~C------~C~~g~~G~~Ce~ 646 (834)
.-.|.|++||.|.+|+.....|.....|.+++.|......-.| .|..+|.|..|+.
T Consensus 161 ~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (316)
T KOG1218|consen 161 NGICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTGSCLCYPGPSGACKGGFHGCACLR 222 (316)
T ss_pred CCceeccCCcccccccccCCCcCCCcccCCCCeeeccccccccCCCCcccccCCccCCcCcc
Confidence 5678899999999999887668877799999999876543322 3444455655553
No 278
>smart00407 IGc1 Immunoglobulin C-Type.
Probab=48.68 E-value=37 Score=27.16 Aligned_cols=41 Identities=12% Similarity=0.368 Sum_probs=23.4
Q ss_pred CcEEEEecCC-CCCcccc------CCC------eEEEcccCCCCCeeEEEEEcC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGN------VLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~------~l~~~~~~~~d~g~y~c~~~~ 42 (834)
++|+|.|.+. +..+... .++ .|+|+....++...|+|.+..
T Consensus 16 i~v~W~k~g~~~~~~~~~~~~~~~~~gt~~~~s~L~v~~~~~~~~~~~tC~V~H 69 (75)
T smart00407 16 ITVTWLRNGQEVTSGVSTTDPLKNSDGTYFLSSYLTVSASTWESGDTYTCQVTH 69 (75)
T ss_pred CEEEEEECCEECCCCEEECceEECCCCCEEEEEEEEEccccCCCCCEEEEEEEE
Confidence 5899999444 4443211 122 344444455667788887753
No 279
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=46.80 E-value=18 Score=23.92 Aligned_cols=11 Identities=36% Similarity=0.773 Sum_probs=9.7
Q ss_pred ceeeecCCCCC
Q psy9228 591 GYNCSCLTGYS 601 (834)
Q Consensus 591 ~~~C~C~~G~~ 601 (834)
...|.||+||.
T Consensus 17 ~~~C~CPeGyI 27 (34)
T PF09064_consen 17 PGQCFCPEGYI 27 (34)
T ss_pred CCceeCCCceE
Confidence 67999999996
No 280
>cd05770 IgC_beta2m Class I major histocompatibility complex (MHC) beta2-microglobulin. IgC_beta2m: Immunoglobulin-like domain in beta2-Microglobulin (beta2m). Beta2m is the non-covalently bound light chain of the human class I major histocompatibility complex (MHC-I). Beta2m is structured as a beta-sandwich domain composed of two facing beta-sheets (four stranded and three stranded), that is typical of the C-type immunoglobulin superfamily. This structure is stabilized by an intramolecular disulfide bridge connecting two Cys residues in the facing beta -sheets. In vivo, MHC-I continuously exposes beta2m on the cell surface, where it may be released to plasmatic fluids, transported to the kidneys, degraded and then excreted.
Probab=46.38 E-value=32 Score=29.10 Aligned_cols=40 Identities=13% Similarity=0.257 Sum_probs=23.0
Q ss_pred CcEEEEecCC-CCCcccc------CCCeEEE---cccCCCCCeeEEEEEcC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGNVLRI---TNARLQDSGKYKCEIQG 42 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~~l~~---~~~~~~d~g~y~c~~~~ 42 (834)
++++|.|++. ++ +... .++...+ -.+..++...|.|....
T Consensus 32 i~v~W~~dg~~~~-~~~~~~~~p~~d~tyq~~s~l~~~~~~~~~ysC~V~H 81 (93)
T cd05770 32 IEIRLLKNGVKIP-KVEQSDLSFSKDWTFYLLKSTEFTPTKGDEYACRVRH 81 (93)
T ss_pred CEEEEEECCEECC-CcEECcEEECCCCCEEEEEEEEECCCCCCeEEEEEEE
Confidence 5789999554 76 4432 2333333 12345667777777753
No 281
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=44.36 E-value=12 Score=32.05 Aligned_cols=36 Identities=28% Similarity=0.709 Sum_probs=25.3
Q ss_pred CCccC--CCCC-CCeeecCC-----CCeEEeCCC-------------CcccCccccc
Q psy9228 179 EPCYP--GACG-DGSCQDVD-----GAMKCLCPI-------------GTAGKRCEQK 214 (834)
Q Consensus 179 ~~C~~--~~C~-~g~C~~~~-----~~~~C~C~~-------------g~~G~~Ce~~ 214 (834)
++|.. +.|. +|.|+... .=|.|.|.+ .|.|..|++.
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKk 62 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKK 62 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeecccccccc
Confidence 34543 5676 69999873 448999987 4667788765
No 282
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins. This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex. These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein. Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=43.17 E-value=1.9e+02 Score=28.81 Aligned_cols=21 Identities=5% Similarity=0.188 Sum_probs=17.9
Q ss_pred CCcEEEEEEEECCEEEEEEcC
Q psy9228 740 GIKHSVNVTRINKFGSLEVDS 760 (834)
Q Consensus 740 g~wH~V~i~r~~~~~~l~VD~ 760 (834)
++-.+++|+...+.++|.|+.
T Consensus 152 ~~~~~~~I~y~~~~l~v~i~~ 172 (218)
T cd07308 152 NAPTTLRISYLNNTLKVDITY 172 (218)
T ss_pred CCCeEEEEEEECCEEEEEEeC
Confidence 578889999998899999964
No 283
>cd05721 IgV_CTLA-4 Immunoglobulin (Ig) domain of cytotoxic T lymphocyte-associated antigen 4 (CTLA-4). IgV_CTLA-4: domain similar to the variable(v)-type immunoglobulin (Ig) domain found in cytotoxic T lymphocyte-associated antigen 4 (CTLA-4). CTLA-4 is involved in the regulation of T cell response, acting as an inhibitor of intracellular signalling. CTLA-4 is similar to CD28, a T cell co-receptor protein that recognizes the B7 proteins (CD80 and CD86). CD28 binding of the B7 proteins occurs after the presentation of antigen to the T cell receptor (TCR) via the peptide-MHC complex on the surface of an antigen presenting cell (APC). CTLA-4 also binds the B7 molecules with a higher affinity than does CD28. The B7/CTLA-4 interaction generates inhibitory signals down-regulating the response, and may prevent T cell activation by weak TCR signals. CD28 and CTLA-4 then elicit opposing signals in the regulation of T cell responsiveness and homeostasis. T cell activation leads to increased
Probab=42.91 E-value=17 Score=32.04 Aligned_cols=23 Identities=22% Similarity=0.349 Sum_probs=19.7
Q ss_pred eEEEcccCCCCCeeEEEEEcCCC
Q psy9228 22 VLRITNARLQDSGKYKCEIQGHD 44 (834)
Q Consensus 22 ~l~~~~~~~~d~g~y~c~~~~~~ 44 (834)
.|+|.+++.+|++.|.|.-.-.+
T Consensus 75 ~L~l~~L~a~DTa~Y~Ca~e~my 97 (115)
T cd05721 75 NFTLQNLRANQTDIYFCKIELMY 97 (115)
T ss_pred EEEEcCCCHHHCeEEEEEeeecc
Confidence 68899999999999999875443
No 284
>cd05767 IgC_MHC_II_alpha Class II major histocompatibility complex (MHC) alpha chain immunoglobulin domain. IgC_MHC_II_alpha: Immunoglobulin (Ig) domain of major histocompatibility complex (MHC) class II alpha chain. MHC class II molecules play a key role in the initiation of the antigen-specific immune reponse. In both humans and in mice these molecules have been shown to be expressed constitutively on the cell surface of professional antigen-presenting cells (APCs), for example on B-lymphocytes, monocytes, and macrophages. The expression of these molecules has been shown to be induced in nonprofessional APCs such as keratinocyctes, and they are expressed on the surface of activated human T cells and on T cells from other species. The MHC II molecules present antigenic peptides to CD4(+) T-lymphocytes. These peptides derive mostly from protelytic processing via the endocytic pathway, of antigens internalized by the APC. These peptides bind to the MHC class II molecules in the endosom
Probab=41.51 E-value=43 Score=28.41 Aligned_cols=14 Identities=7% Similarity=0.211 Sum_probs=10.2
Q ss_pred CcEEEEecCC-CCCc
Q psy9228 2 AYIKWSRADG-LPLQ 15 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~ 15 (834)
++++|.|++. ++.+
T Consensus 32 I~v~W~~~g~~~~~~ 46 (94)
T cd05767 32 LNVTWLKNGVPVTDG 46 (94)
T ss_pred CEEEEEECCeEccCc
Confidence 5789999555 7654
No 285
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=40.26 E-value=1.4e+02 Score=29.89 Aligned_cols=59 Identities=8% Similarity=0.033 Sum_probs=41.5
Q ss_pred CCCeEEEEEECcEEEEEEeccc-------EEEEeeeeecCCCeEEEEEEEEC-----CeEEEEECCeeeee
Q psy9228 436 KGDFVSFGLEDGYPVFRFDVGL-------VVLRSKVTLVPHEWVVVTIIKDF-----KEGKLSVGGEPLIV 494 (834)
Q Consensus 436 ~~d~~~l~L~~G~l~~~~~~G~-------~~i~s~~~~~dg~wH~V~v~~~~-----~~~~l~VD~~~~~~ 494 (834)
....++|.+.++++.+.+.... ........+.-|+||.+.+...- ..+++.+||.....
T Consensus 112 ~~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~~ 182 (224)
T PF14099_consen 112 GSPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVVD 182 (224)
T ss_dssp EEECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECCE
T ss_pred CCCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEEE
Confidence 4567889999999998886543 23334556778999999987532 35778899976654
No 286
>cd02176 GH16_XET Xyloglucan endotransglycosylase, member of glycosyl hydrolase family 16. Xyloglucan endotransglycosylases (XETs) cleave and religate xyloglucan polymers in plant cell walls via a transglycosylation mechanism. Xyloglucan is a soluble hemicellulose with a backbone of beta-1,4-linked glucose units, partially substituted with alpha-1,6-linked xylopyranose branches. It binds noncovalently to cellulose, cross-linking the adjacent cellulose microfibrils, giving it a key structural role as a matrix polymer. Therefore, XET plays an important role in all plant processes that require cell wall remodeling.
Probab=40.17 E-value=1.1e+02 Score=31.54 Aligned_cols=76 Identities=11% Similarity=0.018 Sum_probs=49.7
Q ss_pred CCCcEEEEEEEECCEEEEEEcCeeeecccCC---CC----ccceecCCceEEcCcCCCCC--CCCCccCCCceEEEEEEE
Q psy9228 739 DGIKHSVNVTRINKFGSLEVDSVIVGKGESP---GS----QDVINTRGNIYLGGTPNMDL--MTGGRYVHPMSGLMMNIH 809 (834)
Q Consensus 739 Dg~wH~V~i~r~~~~~~l~VD~~~~~~~~~~---~~----~~~l~~~~~lyiGG~p~~~~--~~~~~~~~~F~GCi~~v~ 809 (834)
...+|+-.|.+....+...|||..+...... +. .+.+.+...|+.||.....- ....-....|+-=++++.
T Consensus 120 t~dFHtY~i~Wtp~~I~fyVDG~~vr~~~~~~~~g~~~P~~~Pm~l~~niW~g~~WAt~gG~~~~d~~~aPf~a~~~~~~ 199 (263)
T cd02176 120 TADFHTYSILWNPHQIVFYVDDVPIRVFKNNEALGVPYPSSQPMGVYASIWDGSDWATQGGRVKIDWSYAPFVASYRDFK 199 (263)
T ss_pred CCCeEEEEEEEccceEEEEECCEEEEEEecccccCCCCCccceEEEEEeeEcCCCcccCCCcccccCCCCCeeEEEeeEE
Confidence 4689999999999999999999887543211 11 12344445677787543211 011113567999999999
Q ss_pred ECCce
Q psy9228 810 IQNKH 814 (834)
Q Consensus 810 in~~~ 814 (834)
+++-.
T Consensus 200 ~~~c~ 204 (263)
T cd02176 200 LDGCV 204 (263)
T ss_pred Eeeee
Confidence 98754
No 287
>cd02183 GH16_fungal_CRH1_transglycosylase glycosylphosphatidylinositol-glucanosyltransferase. Group of fungal GH16 members related to Saccharomyces cerevisiae Crh1p. Chr1p and Crh2p are transglycosylases that are required for the linkage of chitin to beta(1-3)glucose branches of beta(1-6)glucan, an important step in the assembly of new cell wall. Both have been shown to be glycosylphosphatidylinositol (GPI)-anchored. A third homologous protein, Crr1p, functions in the formation of the spore wall. They belongs to the family 16 of glycosyl hydrolases that includes lichenase, xyloglucan endotransglycosylase (XET), beta-agarase, kappa-carrageenase, endo-beta-1,3-glucanase, endo-beta-1,3-1,4-glucanase, and endo-beta-galactosidase, all of which have a conserved jelly roll fold with a deep active site channel harboring the catalytic residues.
Probab=38.37 E-value=4e+02 Score=26.26 Aligned_cols=27 Identities=11% Similarity=0.008 Sum_probs=24.2
Q ss_pred CCcEEEEEEEECCEEEEEEcCeeeecc
Q psy9228 740 GIKHSVNVTRINKFGSLEVDSVIVGKG 766 (834)
Q Consensus 740 g~wH~V~i~r~~~~~~l~VD~~~~~~~ 766 (834)
..||.-.|.+....++.+|||..+...
T Consensus 115 ~dFHtY~veWtpd~I~~yVDG~~v~~~ 141 (203)
T cd02183 115 EEFHTYTIDWTKDRITWYIDGKVVRTL 141 (203)
T ss_pred cCcEEEEEEEecCEEEEEECCEEEEEE
Confidence 689999999999999999999887554
No 288
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=37.87 E-value=46 Score=24.43 Aligned_cols=22 Identities=36% Similarity=1.211 Sum_probs=16.1
Q ss_pred CCCCCCCCEeccCCCCCceEeeCCCCCC
Q psy9228 143 HNNCINNGLCQDAATRIGYTCICPPGFS 170 (834)
Q Consensus 143 ~~pC~n~g~C~~~~~~~~~~C~C~~g~~ 170 (834)
+..|..+..|.+. +|.|++||.
T Consensus 25 ~~qC~~~s~C~~g------~C~C~~g~~ 46 (52)
T PF01683_consen 25 DEQCIGGSVCVNG------RCQCPPGYV 46 (52)
T ss_pred cCCCCCcCEEcCC------EeECCCCCE
Confidence 5667788888653 688888874
No 289
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=33.43 E-value=14 Score=27.29 Aligned_cols=39 Identities=26% Similarity=0.505 Sum_probs=18.6
Q ss_pred CCCCCCCCEec-cCCC-CCceEeeCCCCCCCCCCCCCCCCc
Q psy9228 143 HNNCINNGLCQ-DAAT-RIGYTCICPPGFSGDRCSVLGEPC 181 (834)
Q Consensus 143 ~~pC~n~g~C~-~~~~-~~~~~C~C~~g~~G~~Ce~~~~~C 181 (834)
.-+|...|... +... .+.-.|.|..-|+|++|++.+..|
T Consensus 16 ai~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~~~C 56 (56)
T PF04863_consen 16 AISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLIPNC 56 (56)
T ss_dssp TS--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-TT-
T ss_pred cCCcCCCCeeeeccccccCCccccccCCcCCCCcccCCCCC
Confidence 45666667665 3322 234689999999999997655443
No 290
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=33.40 E-value=19 Score=26.65 Aligned_cols=36 Identities=19% Similarity=0.507 Sum_probs=15.0
Q ss_pred CCCCCCeecccC-CCCCceeeecCCCCCCCCCccCcc
Q psy9228 575 PCQNYGICYPTD-TSERGYNCSCLTGYSGDHCEKENN 610 (834)
Q Consensus 575 pC~ngg~C~~~~-~~~~~~~C~C~~G~~G~~Ce~~~~ 610 (834)
+|.-.|+...+. ...+...|+|..-|.|+.|..-+.
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~~ 54 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLIP 54 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-T
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCcccCCC
Confidence 344445554221 112357788888888877776543
No 291
>KOG3512|consensus
Probab=32.25 E-value=1.4e+02 Score=32.92 Aligned_cols=31 Identities=19% Similarity=0.139 Sum_probs=23.3
Q ss_pred EEEcCceeEEE-cCCcceeeeeCCcccCCCcc
Q psy9228 52 VKLNVERMMFV-DGIGPFSGESQGAFQGLDLS 82 (834)
Q Consensus 52 ~~~~~~g~~cv-d~~~~~~~~~~~~~~g~~~~ 82 (834)
|..|-....|| |....++|.|.-+-.|.+|+
T Consensus 278 CKCNgHAs~Cv~d~~~~ltCdC~HNTaGPdCg 309 (592)
T KOG3512|consen 278 CKCNGHASRCVMDESSHLTCDCEHNTAGPDCG 309 (592)
T ss_pred eeecCccceeeeccCCceEEecccCCCCCCcc
Confidence 44444557786 44555999999999999998
No 292
>cd07698 IgC_MHC_I_alpha3 Class I major histocompatibility complex (MHC) alpha chain immunoglobulin domain. IgC_MHC_I_alpha3; Immunoglobulin (Ig) domain of major histocompatibility complex (MHC) class I alpha chain. Class I MHC proteins bind antigenic peptide fragments and present them to CD8+ T lymphocytes. Class I molecules consist of a transmembrane alpha chain and a small chain called the beta2 microglobulin. The alpha chain contains three extracellular domains, two of which fold together to form the peptide-binding cleft (alpha1 and alpha2), and one which has an Ig fold (alpha3). Peptide binding to class I molecules occurs in the endoplasmic reticulum (ER) and involves both chaperones and dedicated factors to assist in peptide loading. Class I MHC molecules are expressed on most nucleated cells.
Probab=30.10 E-value=91 Score=26.11 Aligned_cols=41 Identities=12% Similarity=0.313 Sum_probs=22.8
Q ss_pred CcEEEEecCC-CCCcccc------CCCeEEE---cccCCCCCeeEEEEEcC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGNVLRI---TNARLQDSGKYKCEIQG 42 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~~l~~---~~~~~~d~g~y~c~~~~ 42 (834)
.+++|.|.+. +..+... .++...+ -.|..+|...|+|.+..
T Consensus 31 i~v~W~~~g~~~~~~~~~~~~~~~~d~ty~~~s~l~v~~~~~~~ytC~V~H 81 (93)
T cd07698 31 IEVTWLRDGEDSVDDVESGEILPNGDGTYQLWVTLEVPPEDKARYSCRVEH 81 (93)
T ss_pred cEEEEEECCEECcccccccceEECCCCeEEEEEEEEECCCCCCEEEEEEEe
Confidence 4789999664 3332211 2332222 23445578888888854
No 293
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER). Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=28.49 E-value=4.7e+02 Score=26.71 Aligned_cols=21 Identities=10% Similarity=-0.092 Sum_probs=15.3
Q ss_pred CCcEEEEEEEECCEEEEEEcC
Q psy9228 740 GIKHSVNVTRINKFGSLEVDS 760 (834)
Q Consensus 740 g~wH~V~i~r~~~~~~l~VD~ 760 (834)
+.--+++|...++.++|.||-
T Consensus 157 ~~~t~~rI~Y~~~~l~v~vd~ 177 (248)
T cd06901 157 DHDTFVAIRYSKGRLTVMTDI 177 (248)
T ss_pred CCCeEEEEEEECCeEEEEEec
Confidence 333467888888888888873
No 294
>cd05847 IgC_CH2_IgE CH2 domain (second constant Ig domain of the heavy chain) in immunoglobulin E (IgE). IgC_CH2_IgE: The second constant domain of the heavy chain of immunoglobulin E (IgE). The basic structure of immunoglobulin (Ig) molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. There are two types of light chains: kappa and lambda; each composed of a constant domain and a variable domain. There are five types of heavy chains: alpha, delta, epsilon, gamma, and mu, all consisting of a variable domain (VH) and three (in alpha, delta, and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). The different classes of antibodies vary in their heavy chains; the IgE class has the epsilon type. This domain (Cepsilon2) of IgE is in place of the flexible hinge region found in IgG.
Probab=28.05 E-value=92 Score=26.35 Aligned_cols=41 Identities=20% Similarity=0.423 Sum_probs=21.9
Q ss_pred CcEEEEecCC-CCCcccc------CCC------eEEEcccCCCCCeeEEEEEcC
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGN------VLRITNARLQDSGKYKCEIQG 42 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~------~l~~~~~~~~d~g~y~c~~~~ 42 (834)
++|+|.|.+. ++.+... .++ .|.|..-+-.+...|.|....
T Consensus 31 I~vtW~~dg~~~~~~~~~s~~~~~~~~Ty~~~S~L~v~~~~w~~~~~ytC~V~H 84 (94)
T cd05847 31 IEVEWLVDGQVATLVAASTAPQKEEGSTFSTTSELNVTQEDWKSGKTYTCKVTH 84 (94)
T ss_pred CEEEEEECCEECcCceeecccEECCCCcEEEEEEEEEchHHhcCCCeEEEEEEE
Confidence 5899999444 6544221 233 333332222355678888754
No 295
>KOG3512|consensus
Probab=27.62 E-value=98 Score=33.95 Aligned_cols=64 Identities=25% Similarity=0.544 Sum_probs=44.6
Q ss_pred CEeccCCCCCceEeeCCCCCCCCCCCC----------------CCCCccCCCCC-CC-eeec---------CCCCeEEe-
Q psy9228 150 GLCQDAATRIGYTCICPPGFSGDRCSV----------------LGEPCYPGACG-DG-SCQD---------VDGAMKCL- 201 (834)
Q Consensus 150 g~C~~~~~~~~~~C~C~~g~~G~~Ce~----------------~~~~C~~~~C~-~g-~C~~---------~~~~~~C~- 201 (834)
..|+-.. ...++|.|.-+-+|+.|+. +.++|....|. ++ .|.- .-.+-.|.
T Consensus 285 s~Cv~d~-~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvCln 363 (592)
T KOG3512|consen 285 SRCVMDE-SSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLN 363 (592)
T ss_pred ceeeecc-CCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEee
Confidence 4577443 1449999999999999985 47788877775 44 4541 12445674
Q ss_pred CCCCcccCccccc
Q psy9228 202 CPIGTAGKRCEQK 214 (834)
Q Consensus 202 C~~g~~G~~Ce~~ 214 (834)
|.....|++|..-
T Consensus 364 CrHnTaGrhChyC 376 (592)
T KOG3512|consen 364 CRHNTAGRHCHYC 376 (592)
T ss_pred cccCCCCcccccc
Confidence 8899999999753
No 296
>cd05771 IgC_Tapasin_R Tapasin-R immunoglobulin-like domain. IgC_Tapasin_R: Immunoglobulin-like domain on Tapasin-R. Tapasin is a V-C1 (variable-constant) immunoglobulin superfamily molecule present in the endoplasmic reticulum (ER), where it links MHC class I molecules to the transporter associated with antigen processing (TAP). Tapasin-R is a tapasin-related protein that contains similar structural motifs to Tapasin, with some marked differences, especially in the V domain, transmembrane and cytoplasmic regions. The majority of Tapasin-R is located within the ER; however, there may be some expression of Tapasin-R at the cell surface. Tapasin-R lacks an obvious ER retention signal.
Probab=27.22 E-value=86 Score=28.60 Aligned_cols=10 Identities=20% Similarity=0.694 Sum_probs=7.8
Q ss_pred CcEEEEecCC
Q psy9228 2 AYIKWSRADG 11 (834)
Q Consensus 2 ~~~~w~~~~~ 11 (834)
++|+|.|.+.
T Consensus 70 ~~i~W~~~g~ 79 (139)
T cd05771 70 VQVEWTREPP 79 (139)
T ss_pred eEEEEEECCC
Confidence 4899999654
No 297
>PF07081 DUF1349: Protein of unknown function (DUF1349); InterPro: IPR009784 This family consists of several hypothetical bacterial proteins but contains one sequence (P40893 from SWISSPROT) from Saccharomyces cerevisiae. Members of this family are typically around 200 residues in length. The function of this family is unknown.; PDB: 3MEP_B 3O12_A.
Probab=26.63 E-value=5.9e+02 Score=24.56 Aligned_cols=116 Identities=15% Similarity=0.092 Sum_probs=58.3
Q ss_pred eEEEEEEe-CCCCeeEEecCCCCCCCCCCcceEEEEE---ECCEEEEEEEcCCcEEEEEeCCcee-cCCCcEEEEEEEEC
Q psy9228 677 TIAFDFVT-DDKNALLLWNGQPSYKNGIGREFIAVAV---VNGYLEYSYDLGDGVVTIKFSKKPV-NDGIKHSVNVTRIN 751 (834)
Q Consensus 677 ~i~~~frT-~~~~GlLl~~~~~~~~~~~~~~~~~l~l---~~G~l~~~~~~g~~~~~l~~s~~~~-nDg~wH~V~i~r~~ 751 (834)
.+++.++. .+.-|||+|... ..|+-..+ .+|..+++.-.-.+...- +..++ .++..-.++|.|.+
T Consensus 55 ~v~~~~~~~YDQaGL~v~~~~--------~~WiK~giE~~~~g~~~l~sV~t~~~SDw--s~~~~~~~~~~~~lrv~R~g 124 (183)
T PF07081_consen 55 KVSGDFKEQYDQAGLMVYQDE--------DNWIKAGIEYSNDGTPRLSSVVTNGYSDW--SLSPLPSDGQSVWLRVERRG 124 (183)
T ss_dssp EEEE---STT-EEEEEEEEET--------TEEEEEEEEE-ETTCEEEEEEEEESSEEE--EEEE--SBTTSEEEEEEEET
T ss_pred EEEeCCccceeeEEEEEEECC--------cccEEEEEEEecCCCceEEEEeccCcccc--cccccCCCCCEEEEEEEEeC
Confidence 34444444 346788888865 35776655 478766653221111111 22233 56677789999999
Q ss_pred CEEEEEE--cCeeeecccCCCCccceecCCceEEcCcCCCCCCCCCccCCCceEEEEEEEECC
Q psy9228 752 KFGSLEV--DSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHIQN 812 (834)
Q Consensus 752 ~~~~l~V--D~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~F~GCi~~v~in~ 812 (834)
..+.++. ||..-.... ...+.....+.||=+--. +...+|.-...+|+|..
T Consensus 125 ~~~~~~ys~DG~~w~~~R----~~~~~~~~~~~VG~~A~s------P~~~g~~~~F~~f~i~~ 177 (183)
T PF07081_consen 125 DDLWIYYSADGKTWTLLR----IFHFPEDWEVQVGVYACS------PQGEGFEVEFDDFSITP 177 (183)
T ss_dssp TEEEEEEESSSS---EEE----EEE--S-S-EEEEEEEE-------SSSS--EEEEEEEEEE-
T ss_pred CEEEEEEEcCCCEEEEEE----EEECCCCCcEEEEEEEeC------CCCCcEEEEEeEEEEEc
Confidence 9887765 554322111 111233456777733211 23678999999999843
No 298
>cd05766 IgC_MHC_II_beta Class II major histocompatibility complex (MHC) beta chain immunoglobulin domain. IgC_MHC_II_beta: Immunoglobulin (Ig) domain of major histocompatibility complex (MHC) class II beta chain. MHC class II molecules play a key role in the initiation of the antigen-specific immune reponse. In both humans and in mice these molecules have been shown to be expressed constitutively on the cell surface of professional antigen-presenting cells (APCs), for example on B-lymphocytes, monocytes, and macrophages. The expression of these molecules has been shown to be induced in nonprofessional APCs such as keratinocyctes, and they are expressed on the surface of activated human T cells and on T cells from other species. The MHC II molecules present antigenic peptides to CD4(+) T-lymphocytes. These peptides derive mostly from protelytic processing via the endocytic pathway, of antigens internalized by the APC. These peptides bind to the MHC class II molecules in the endosome be
Probab=26.56 E-value=1.2e+02 Score=25.65 Aligned_cols=14 Identities=14% Similarity=0.240 Sum_probs=9.8
Q ss_pred CcEEEEecCC-CCCc
Q psy9228 2 AYIKWSRADG-LPLQ 15 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~ 15 (834)
.+++|.|++. +..+
T Consensus 31 i~v~W~~~g~~~~~~ 45 (94)
T cd05766 31 ITVKWFKNGQEETEG 45 (94)
T ss_pred CEEEEEECCeecCCC
Confidence 5789999555 6554
No 299
>cd08023 GH16_laminarinase_like Laminarinase, member of the glycosyl hydrolase family 16. Laminarinase, also known as glucan endo-1,3-beta-D-glucosidase, is a glycosyl hydrolase family 16 member that hydrolyzes 1,3-beta-D-glucosidic linkages in 1,3-beta-D-glucans such as laminarins, curdlans, paramylons, and pachymans, with very limited action on mixed-link (1,3-1,4-)-beta-D-glucans.
Probab=26.08 E-value=6.7e+02 Score=25.07 Aligned_cols=31 Identities=13% Similarity=-0.140 Sum_probs=26.9
Q ss_pred ecCCCcEEEEEEEECCEEEEEEcCeeeeccc
Q psy9228 737 VNDGIKHSVNVTRINKFGSLEVDSVIVGKGE 767 (834)
Q Consensus 737 ~nDg~wH~V~i~r~~~~~~l~VD~~~~~~~~ 767 (834)
-....||.-.+.+....++.+|||..+....
T Consensus 155 ~~~~~fHtY~~~W~p~~i~~yvDG~~v~~~~ 185 (235)
T cd08023 155 DLSDDFHTYAVEWTPDKITFYVDGKLYFTYT 185 (235)
T ss_pred CcCCCcEEEEEEEECCEEEEEECCEEEEEEc
Confidence 3568999999999999999999999886653
No 300
>cd02178 GH16_beta_agarase Beta-agarase, member of glycosyl hydrolase family 16. Beta-agarase is a glycosyl hydrolase family 16 (GH16) member that hydrolyzes the internal beta-1,4-linkage of agarose, a hydrophilic polysaccharide found in the cell wall of Rhodophyceaea, marine red algae. Agarose is a linear chain of galactose units linked by alternating L-alpha-1,3- and D-beta-1,4-linkages that are additionally modified by a 3,6-anhydro-bridge. Agarose forms thermo-reversible gels that are widely used in the food industry or as a laboratory medium. While beta-agarases are also found in two other families derived from the sequence-based classification of glycosyl hydrolases (GH50, and GH86) the GH16 members are most abundant. This domain adopts a curved beta-sandwich conformation, with a tunnel-shaped active site cavity, referred to as a jellyroll fold.
Probab=25.55 E-value=1.4e+02 Score=30.72 Aligned_cols=28 Identities=18% Similarity=0.007 Sum_probs=24.6
Q ss_pred CCCcEEEEEEEE-CCEEEEEEcCeeeecc
Q psy9228 739 DGIKHSVNVTRI-NKFGSLEVDSVIVGKG 766 (834)
Q Consensus 739 Dg~wH~V~i~r~-~~~~~l~VD~~~~~~~ 766 (834)
...||.-.|.+. ...++.+|||..+...
T Consensus 178 ~~~fHtY~veW~~p~~i~fyvDG~~~~~~ 206 (258)
T cd02178 178 ADDFHVYGVYWKDPDTIRFYIDGVLVRTV 206 (258)
T ss_pred ccCeEEEEEEEcCCCeEEEEECCEEEEEE
Confidence 458999999999 9999999999887654
No 301
>PF08787 Alginate_lyase2: Alginate lyase; InterPro: IPR014895 Alginate lyases are enzymes that degrade the linear polysaccharide alignate. They cleave the glycosidic linkage of alignate through a beta-elimination reaction. This region forms an all beta fold, which is different to the all alpha fold of IPR008397 from INTERPRO. ; PDB: 1VAV_B 1UAI_A 1J1T_A 2Z42_A 2ZAC_A 2ZAB_A 2ZAA_A 2ZA9_A 2CWS_A.
Probab=24.90 E-value=4.9e+02 Score=26.30 Aligned_cols=52 Identities=10% Similarity=-0.090 Sum_probs=37.8
Q ss_pred ECCEEEEEEE---cCCcEEEEEeCCceecCCCcEEEEEEEECCEEEEEEcCeeeec
Q psy9228 713 VNGYLEYSYD---LGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGK 765 (834)
Q Consensus 713 ~~G~l~~~~~---~g~~~~~l~~s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~ 765 (834)
.+|.|.+.++ ..++..... .-..|.-|+|.++.|...++.+++.|++.....
T Consensus 135 ~~G~l~~~~~~~~~~~~~~~~~-~~~~i~LG~~F~y~I~v~~~~l~V~ing~~~~~ 189 (236)
T PF08787_consen 135 EKGSLYVYVRQSNPDGGDQEYT-IYGGIPLGEWFSYEIEVSGGTLTVTINGEGKTT 189 (236)
T ss_dssp ETTEEEEEEESSTTTTSEEEEE-EEEEEETT-EEEEEEEEETTEEEEEETTEEEEE
T ss_pred cCCeEEEEEeccCCCCCcEEee-eEcceeCCCEEEEEEEEECCEEEEEEECCcceE
Confidence 7899999998 122222221 123677789999999999999999999987654
No 302
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=24.40 E-value=8.9e+02 Score=28.60 Aligned_cols=58 Identities=17% Similarity=0.205 Sum_probs=40.5
Q ss_pred CCCeEEEEEECcEEEEEEeccc-EEEEe-eeeecCCCeEEEEEEEECCeEEEEECCeeeee
Q psy9228 436 KGDFVSFGLEDGYPVFRFDVGL-VVLRS-KVTLVPHEWVVVTIIKDFKEGKLSVGGEPLIV 494 (834)
Q Consensus 436 ~~d~~~l~L~~G~l~~~~~~G~-~~i~s-~~~~~dg~wH~V~v~~~~~~~~l~VD~~~~~~ 494 (834)
.+-|+.| ..+|.-.+.-++.. .++.+ ...+..++||++.+...+..+.-.+|+...-.
T Consensus 577 ~G~~f~v-~~~G~w~vt~d~~~~~~l~~G~~~~~~~~WhtltL~~~g~~~ta~lng~~l~~ 636 (669)
T PF02057_consen 577 RGYFFWV-YANGTWSVTSDLAGTTTLASGTADIGAGKWHTLTLTISGSTATAMLNGTVLWT 636 (669)
T ss_dssp EEEEEEE-ETTTEEEEEEETTS-SEEEEEE-S--TT-EEEEEEEEETTEEEEEETTEEEEE
T ss_pred CeEEEEE-EcCCcEEEeccCCCcEEEeeeeecccCCeEEEEEEEEECCEEEEEECCEEeEE
Confidence 4556655 78888888777763 33433 34578899999999999999999999977654
No 303
>cd00070 GLECT Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
Probab=22.94 E-value=5.4e+02 Score=22.89 Aligned_cols=33 Identities=9% Similarity=-0.032 Sum_probs=29.0
Q ss_pred CCceecCCCcEEEEEEEECCEEEEEEcCeeeec
Q psy9228 733 SKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGK 765 (834)
Q Consensus 733 s~~~~nDg~wH~V~i~r~~~~~~l~VD~~~~~~ 765 (834)
...++..|+...|.|........+.|||.....
T Consensus 70 ~~~pf~~g~~F~l~i~~~~~~f~i~vng~~~~~ 102 (127)
T cd00070 70 GGFPFQPGQPFELTILVEEDKFQIFVNGQHFFS 102 (127)
T ss_pred CCCCCCCCCeEEEEEEEcCCEEEEEECCEeEEE
Confidence 356889999999999999999999999987643
No 304
>PF09083 DUF1923: Domain of unknown function (DUF1923); InterPro: IPR015167 This domain is found in maltosyltransferases, adopting a secondary structure that consists of eight antiparallel beta-strands forming an open-sided 'jelly roll' Greek key beta-barrel. Their exact function is, as yet, unknown []. ; PDB: 1GJW_A 1GJU_A.
Probab=22.84 E-value=3.2e+02 Score=20.15 Aligned_cols=30 Identities=20% Similarity=0.184 Sum_probs=21.7
Q ss_pred CC-EEEEEEEcCCcEEEEEeCCceecCCCcEEE
Q psy9228 714 NG-YLEYSYDLGDGVVTIKFSKKPVNDGIKHSV 745 (834)
Q Consensus 714 ~G-~l~~~~~~g~~~~~l~~s~~~~nDg~wH~V 745 (834)
|| +|++..|.|..+..|+ ..++=||+|..-
T Consensus 19 ~g~k~viaanvgke~ke~s--ggrvw~g~w~~~ 49 (64)
T PF09083_consen 19 NGQKIVIAANVGKEPKEIS--GGRVWNGRWSDK 49 (64)
T ss_dssp TTEEEEEEEE-SSS-EEEE--EEEEESSSEEEE
T ss_pred CCcEEEEEeccCCCccccc--CceeecCccccc
Confidence 55 5667788898888875 788999999753
No 305
>cd07699 IgC_L Immunoglobulin Constant domain. IgC_L: Immunoglobulin (Ig) light chain constant (C) domain. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. In Ig, each chain is composed of one variable domain (IgV) and one or more constant domains (IgC); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. There are five types of heavy chains (alpha, gamma, delta, epsilon, and mu), which determine the type of immunoglobulin: IgA, IgG, IgD, IgE, and IgM, respectively. In higher vertebrates, there are two types of light chain, designated kappa and lambda, which seem to be functionally identical, and can associate with any of the heavy chains.
Probab=22.39 E-value=1.2e+02 Score=25.87 Aligned_cols=14 Identities=14% Similarity=0.401 Sum_probs=9.7
Q ss_pred CcEEEEecCC-CCCc
Q psy9228 2 AYIKWSRADG-LPLQ 15 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~ 15 (834)
++|+|.|++. +..+
T Consensus 34 i~v~W~~~g~~~~~~ 48 (100)
T cd07699 34 ATVQWKVDGATVSSG 48 (100)
T ss_pred CEEEEEECCEECccc
Confidence 5899999554 6543
No 306
>PF04706 Dickkopf_N: Dickkopf N-terminal cysteine-rich region; InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=22.03 E-value=1.3e+02 Score=22.29 Aligned_cols=23 Identities=26% Similarity=0.579 Sum_probs=17.7
Q ss_pred CCCCCCCCCCCccCCCCCCCeee
Q psy9228 170 SGDRCSVLGEPCYPGACGDGSCQ 192 (834)
Q Consensus 170 ~G~~Ce~~~~~C~~~~C~~g~C~ 192 (834)
...+|..+..=|..+.|.||.|+
T Consensus 29 ~~~rC~Rd~~CC~g~~CvnG~C~ 51 (52)
T PF04706_consen 29 RRKRCTRDAMCCPGNLCVNGVCT 51 (52)
T ss_pred CCCCCCCCcccCCCCeeeCCEec
Confidence 56788777766667889999886
No 307
>cd05755 Ig2_ICAM-1_like Second immunoglobulin (Ig)-like domain of intercellular cell adhesion molecule-1 (ICAM-1, CD54) and similar proteins. Ig2_ ICAM-1_like: domain similar to the second immunoglobulin (Ig)-like domain of intercellular cell adhesion molecule-1 (ICAM-1, CD54). During the inflammation process, these molecules recruit leukocytes onto the vascular endothelium before extravasation to the injured tissues. ICAM-1 may be involved in organ targeted tumor metastasis. The interaction of ICAM-1 with leukocyte function-associated antigen-1 (LFA-1) plays a part in leukocyte-endothelial cell recognition. This group also contains ICAM-2, which also interacts with LFA-1. Transmigration of immature dendritic cells across resting endothelium is dependent on the interaction of ICAM-2 with, yet unidentified, ligand(s) on the dendritic cells. ICAM-1 has five Ig-like domains and ICAM-2 has two. ICAM-1 may also act as host receptor for viruses and parasites.
Probab=21.50 E-value=1.1e+02 Score=26.34 Aligned_cols=41 Identities=22% Similarity=0.365 Sum_probs=26.1
Q ss_pred cEEEEecCC-CCCcccc--------CCCeEEEcccCCCCCe-eEEEEEcCCC
Q psy9228 3 YIKWSRADG-LPLQRYA--------EGNVLRITNARLQDSG-KYKCEIQGHD 44 (834)
Q Consensus 3 ~~~w~~~~~-~~~~~~~--------~~~~l~~~~~~~~d~g-~y~c~~~~~~ 44 (834)
+|+|.|.+. |...... ...+|.+. +..+|.| .|+|.+...-
T Consensus 35 ~i~W~rG~~~l~~~~~~~~~~~~~~~~stlt~~-~~r~D~g~~~sC~A~l~l 85 (100)
T cd05755 35 TVVLLRGNETLSRQPFGDNTKSPVNAPATITIT-VDREDHGANFSCETELDL 85 (100)
T ss_pred EEEEeeCCEEcccceeccccCCCceeEEEEEEe-cchhhCCcEEEEEEEecc
Confidence 489999666 7654322 13455654 5566666 8999997553
No 308
>KOG1218|consensus
Probab=21.39 E-value=1.4e+02 Score=31.47 Aligned_cols=21 Identities=38% Similarity=0.873 Sum_probs=9.7
Q ss_pred EeeCCCCCCCCCCCCCCCCcc
Q psy9228 162 TCICPPGFSGDRCSVLGEPCY 182 (834)
Q Consensus 162 ~C~C~~g~~G~~Ce~~~~~C~ 182 (834)
.|.|++||.|.+|+.....|.
T Consensus 163 ~c~c~~g~~g~~~~~~~~~c~ 183 (316)
T KOG1218|consen 163 ICTCQPGFVGVFCVESCSGCS 183 (316)
T ss_pred ceeccCCcccccccccCCCcC
Confidence 344555555555544333344
No 309
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC). ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53. In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII. Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=21.23 E-value=4.3e+02 Score=26.56 Aligned_cols=55 Identities=11% Similarity=0.069 Sum_probs=30.0
Q ss_pred ceEEEEEECCEEEEEEEcCCcEEEEEeCCcee-cCCCcEEEEEEEECCEEEEEEcC
Q psy9228 706 EFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPV-NDGIKHSVNVTRINKFGSLEVDS 760 (834)
Q Consensus 706 ~~~~l~l~~G~l~~~~~~g~~~~~l~~s~~~~-nDg~wH~V~i~r~~~~~~l~VD~ 760 (834)
.++.+.+.||...+..........+..=...+ |...-.+++|+..++.++|.||.
T Consensus 120 p~i~~~~NDGt~~yd~~~D~~~~~~~~C~~~~rn~~~p~~~rI~Y~~~~l~V~~d~ 175 (225)
T cd06902 120 PAILVVGNDGTKSYDHQNDGLTQALGSCLRDFRNKPYPVRAKITYYQNVLTVSINN 175 (225)
T ss_pred cEEEEEECCCCeeccccCCCcccccceEEEeccCCCCCeEEEEEEECCeEEEEEeC
Confidence 46666777776655432111111111000112 22355688999999999999985
No 310
>cd07697 IgC_TCR_gamma T cell receptor (TCR) gamma chain constant immunoglobulin domain. IgC_TCR_gamma; immunoglobulin (Ig) constant (C) domain of the gamma chain of gamma-delta T-cell receptors (TCRs). TCRs mediate antigen recognition by T lymphocytes, and are heterodimers consisting of alpha and beta chains or gamma and delta chains. Each chain contains a variable (V) and a constant (C) region. The majority of T cells contain alpha-beta TCRs but a small subset contain gamma-delta TCRs. Alpha-beta TCRs recognize antigen as peptide fragments presented by major histocompatibility complex (MHC) molecules. Gamma-delta TCRs recognize intact protein antigens; they recognize protein antigens directly and without antigen processing, and MHC independently of the bound peptide. Gamma-delta T cells can also be stimulated by non-peptide antigens such as small phosphate- or amine-containing compounds.
Probab=20.63 E-value=1.4e+02 Score=25.40 Aligned_cols=14 Identities=14% Similarity=0.406 Sum_probs=9.3
Q ss_pred CcEEEEecCC-CCCc
Q psy9228 2 AYIKWSRADG-LPLQ 15 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~ 15 (834)
++++|+|.+. +..+
T Consensus 33 I~v~W~kng~~~~~~ 47 (96)
T cd07697 33 IQVHWREGNSPSILG 47 (96)
T ss_pred eEEEEEECCEECcCC
Confidence 5789999444 5543
No 311
>PF07654 C1-set: Immunoglobulin C1-set domain; InterPro: IPR003597 The basic structure of immunoglobulin (Ig) molecules is a tetramer of two light chains and two heavy chains linked by disulphide bonds. There are two types of light chains: kappa and lambda, each composed of a constant domain (CL) and a variable domain (VL). There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are highly modular proteins, in which the variable and constant domains have clear, conserved sequence patterns. The domains in Ig and Ig-like molecules are grouped into four types: V-set (variable; IPR013106 from INTERPRO), C1-set (constant-1; IPR003597 from INTERPRO), C2-set (constant-2; IPR008424 from INTERPRO) and I-set (intermediate; IPR013098 from INTERPRO) []. Structural studies have shown that these domains share a common core Greek-key beta-sandwich structure, with the types differing in the number of strands in the beta-sheets as well as in their sequence patterns [, ]. Immunoglobulin-like domains that are related in both sequence and structure can be found in several diverse protein families. Ig-like domains are involved in a variety of functions, including cell-cell recognition, cell-surface receptors, muscle structure and the immune system []. This entry represents C1-set domains, which are classical Ig-like domains resembling the antibody constant domain. C1-set domains are found almost exclusively in molecules involved in the immune system, such as in immunoglobulin light and heavy chains, in the major histocompatibility complex (MHC) class I and II complex molecules [, ], and in various T-cell receptors.; PDB: 3BVN_D 3BXN_A 3PWV_E 3L9R_F 2XFX_B 1BMG_A 1K8I_A 3M1B_G 3M17_C 1EXU_A ....
Probab=20.12 E-value=1.1e+02 Score=24.89 Aligned_cols=37 Identities=14% Similarity=0.382 Sum_probs=20.7
Q ss_pred CcEEEEecCC-CCCcccc------CCC------eEEEcccCCCCCeeEEEEEc
Q psy9228 2 AYIKWSRADG-LPLQRYA------EGN------VLRITNARLQDSGKYKCEIQ 41 (834)
Q Consensus 2 ~~~~w~~~~~-~~~~~~~------~~~------~l~~~~~~~~d~g~y~c~~~ 41 (834)
++|+|.|.+. ++..... .++ .|.|. ..+...|+|.+.
T Consensus 25 i~v~W~~~~~~~~~~~~~~~~~~~~dgty~~~s~l~v~---~~~~~~ysC~V~ 74 (83)
T PF07654_consen 25 ITVTWLKNGKEVTEGVETTPPPPNSDGTYSVTSSLTVT---WNSGDEYSCRVT 74 (83)
T ss_dssp EEEEEEETTEEETTTEEEEEEEEETTSEEEEEEEEEEE---TTTTGGEEEEEE
T ss_pred cEEEEEeccceeeeeeeecccccccccceeeeEEEEec---CCCCCEEEEEEE
Confidence 5799998555 6543322 122 33334 444557777765
Done!