Query psy14533
Match_columns 209
No_of_seqs 139 out of 1266
Neff 11.8
Searched_HMMs 46136
Date Fri Aug 16 22:18:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14533.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14533hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4221|consensus 99.9 1.2E-24 2.6E-29 170.8 20.4 190 10-207 571-764 (1381)
2 KOG3513|consensus 99.9 6E-24 1.3E-28 166.7 21.2 189 14-206 675-864 (1051)
3 KOG4221|consensus 99.9 9.3E-23 2E-27 160.3 22.2 176 13-202 483-658 (1381)
4 KOG3513|consensus 99.8 9.9E-19 2.1E-23 137.8 21.0 185 13-203 776-965 (1051)
5 KOG0196|consensus 99.7 1.3E-14 2.8E-19 111.5 16.7 138 11-157 396-537 (996)
6 KOG0196|consensus 99.6 9E-13 2E-17 101.6 17.2 150 51-205 328-486 (996)
7 PF00041 fn3: Fibronectin type 99.4 5.1E-12 1.1E-16 73.8 9.0 83 57-146 2-84 (85)
8 KOG4222|consensus 99.4 8.4E-12 1.8E-16 99.4 11.9 188 11-202 582-791 (1281)
9 KOG4258|consensus 99.2 3.1E-10 6.7E-15 88.3 11.0 184 14-203 567-870 (1025)
10 KOG4258|consensus 99.1 2.6E-09 5.7E-14 83.3 13.6 140 61-204 494-653 (1025)
11 KOG4802|consensus 99.1 5.7E-09 1.2E-13 75.1 12.8 132 4-144 199-340 (516)
12 KOG4222|consensus 99.0 1.5E-08 3.3E-13 81.5 11.7 139 11-154 699-843 (1281)
13 cd00063 FN3 Fibronectin type 3 98.8 3.4E-07 7.4E-12 53.6 11.4 85 57-148 3-87 (93)
14 smart00060 FN3 Fibronectin typ 98.4 1.7E-05 3.6E-10 45.0 10.8 78 58-142 4-81 (83)
15 KOG4802|consensus 98.3 1.8E-05 3.9E-10 57.7 10.9 132 66-202 157-295 (516)
16 PF10179 DUF2369: Uncharacteri 97.9 0.002 4.4E-08 46.0 15.0 139 11-155 126-297 (300)
17 PF10179 DUF2369: Uncharacteri 97.9 0.0032 6.9E-08 45.1 17.9 179 10-204 12-215 (300)
18 PF00041 fn3: Fibronectin type 97.9 4.5E-05 9.7E-10 44.0 5.7 42 162-205 2-43 (85)
19 COG3401 Fibronectin type 3 dom 97.8 0.00062 1.4E-08 48.7 10.1 174 12-204 30-203 (343)
20 PF09294 Interfer-bind: Interf 97.6 0.00089 1.9E-08 40.5 8.4 89 57-154 5-105 (106)
21 KOG3632|consensus 97.5 0.00048 1E-08 55.9 7.0 159 23-200 642-814 (1335)
22 KOG4367|consensus 97.3 0.00046 1E-08 51.0 4.8 81 66-155 450-530 (699)
23 KOG4152|consensus 97.1 0.019 4E-07 44.1 11.1 75 17-93 651-725 (830)
24 cd00063 FN3 Fibronectin type 3 96.9 0.003 6.5E-08 36.4 5.1 33 11-43 55-87 (93)
25 KOG4152|consensus 96.8 0.0044 9.4E-08 47.3 6.1 72 122-197 651-724 (830)
26 PF01108 Tissue_fac: Tissue fa 96.6 0.046 9.9E-07 33.1 8.5 78 56-142 23-103 (107)
27 COG3401 Fibronectin type 3 dom 96.1 0.32 7E-06 35.4 11.5 122 15-153 126-251 (343)
28 smart00060 FN3 Fibronectin typ 95.5 0.047 1E-06 30.3 4.8 26 12-37 56-81 (83)
29 PF09067 EpoR_lig-bind: Erythr 94.9 0.18 4E-06 30.3 6.0 83 54-142 7-94 (104)
30 COG4733 Phage-related protein, 94.9 0.98 2.1E-05 37.3 11.3 126 10-145 657-784 (952)
31 PF07495 Y_Y_Y: Y_Y_Y domain; 93.3 0.25 5.5E-06 26.7 4.2 36 3-40 20-55 (66)
32 KOG3632|consensus 92.9 1.2 2.7E-05 37.5 8.8 127 13-147 718-859 (1335)
33 PF09294 Interfer-bind: Interf 92.3 0.18 3.9E-06 30.3 2.9 24 11-34 65-88 (106)
34 PLN02533 probable purple acid 91.3 5.7 0.00012 30.8 11.1 90 54-156 40-135 (427)
35 KOG4806|consensus 91.1 5 0.00011 29.8 9.4 131 18-154 289-446 (454)
36 PF09240 IL6Ra-bind: Interleuk 90.0 2.7 5.8E-05 25.0 9.7 80 58-142 2-85 (99)
37 KOG4806|consensus 88.8 8.1 0.00018 28.8 11.7 76 122-204 288-368 (454)
38 KOG1225|consensus 88.2 2.5 5.5E-05 33.4 6.5 111 9-135 414-524 (525)
39 TIGR00868 hCaCC calcium-activa 87.3 4.3 9.3E-05 34.4 7.6 82 66-147 766-859 (863)
40 PF01108 Tissue_fac: Tissue fa 86.9 1.9 4E-05 26.1 4.3 36 162-201 24-59 (107)
41 COG4733 Phage-related protein, 83.3 13 0.00029 31.3 8.4 65 117-183 659-725 (952)
42 KOG3515|consensus 82.9 27 0.00059 29.3 12.1 163 13-203 570-732 (741)
43 KOG4367|consensus 80.4 0.65 1.4E-05 35.1 0.4 42 9-50 489-530 (699)
44 PF09067 EpoR_lig-bind: Erythr 79.8 5.4 0.00012 24.1 4.2 42 159-203 7-48 (104)
45 cd05735 Ig8_DSCAM Eight immuno 79.0 4.6 9.9E-05 23.3 3.7 35 13-47 47-81 (88)
46 cd05851 Ig3_Contactin-1 Third 78.5 3.6 7.8E-05 23.7 3.1 28 13-40 53-80 (88)
47 KOG3515|consensus 77.2 4.6 9.9E-05 33.4 4.2 76 14-96 655-730 (741)
48 KOG1948|consensus 75.7 51 0.0011 28.3 13.4 24 12-35 945-968 (1165)
49 cd05870 Ig5_NCAM-2 Fifth immun 75.1 7.6 0.00017 22.8 4.0 28 13-40 64-91 (98)
50 PF07353 Uroplakin_II: Uroplak 73.6 8.6 0.00019 25.0 4.0 24 13-36 102-125 (184)
51 PF09423 PhoD: PhoD-like phosp 73.6 11 0.00023 29.6 5.4 50 118-173 64-113 (453)
52 cd05760 Ig2_PTK7 Second immuno 73.4 13 0.00029 20.6 4.5 27 13-39 38-64 (77)
53 cd05762 Ig8_MLCK Eighth immuno 72.7 12 0.00025 22.2 4.3 31 12-42 55-85 (98)
54 PLN02533 probable purple acid 71.6 9.2 0.0002 29.7 4.6 33 13-51 103-135 (427)
55 PF11344 DUF3146: Protein of u 71.5 3.6 7.7E-05 22.9 1.7 15 19-33 66-80 (80)
56 cd05740 Ig_CEACAM_D4 Fourth im 70.2 11 0.00024 21.9 3.8 30 11-40 53-82 (91)
57 cd05763 Ig_1 Subgroup of the i 68.4 14 0.00031 20.2 3.9 27 13-39 40-66 (75)
58 TIGR03000 plancto_dom_1 Planct 68.3 10 0.00022 21.3 3.0 27 9-35 25-51 (75)
59 PF09240 IL6Ra-bind: Interleuk 67.4 23 0.00051 20.9 6.0 38 163-203 2-39 (99)
60 cd05748 Ig_Titin_like Immunogl 66.6 19 0.00041 19.6 4.2 28 12-39 39-66 (74)
61 cd02848 Chitinase_N_term Chiti 66.2 27 0.00058 21.2 5.6 31 14-46 71-101 (106)
62 cd05854 Ig6_Contactin-2 Sixth 64.7 11 0.00024 21.5 3.0 28 13-40 47-74 (85)
63 cd05894 Ig_C5_MyBP-C C5 immuno 64.5 21 0.00046 20.4 4.2 27 13-39 52-78 (86)
64 cd05726 Ig4_Robo Fhird immunog 63.6 18 0.00038 20.8 3.8 28 13-40 47-74 (90)
65 cd05893 Ig_Palladin_C C-termin 63.3 4.7 0.0001 22.5 1.2 27 14-40 42-68 (75)
66 PF09423 PhoD: PhoD-like phosp 63.2 18 0.0004 28.3 4.8 35 13-50 64-98 (453)
67 TIGR00868 hCaCC calcium-activa 62.3 18 0.00038 31.0 4.7 39 162-201 758-796 (863)
68 cd05879 Ig_P0 Immunoglobulin ( 62.2 32 0.0007 21.2 4.8 24 13-36 80-103 (116)
69 cd05876 Ig3_L1-CAM Third immun 61.9 9.4 0.0002 20.8 2.3 29 12-40 35-63 (71)
70 cd05730 Ig3_NCAM-1_like Third 61.7 18 0.00038 21.0 3.6 28 13-40 58-85 (95)
71 cd05736 Ig2_Follistatin_like S 61.4 25 0.00055 19.2 4.3 28 13-40 39-66 (76)
72 cd05852 Ig5_Contactin-1 Fifth 61.0 12 0.00026 20.7 2.6 28 13-40 39-66 (73)
73 KOG4228|consensus 60.7 7.7 0.00017 33.5 2.4 59 135-202 144-202 (1087)
74 cd04974 Ig3_FGFR Third immunog 60.5 20 0.00044 20.6 3.6 28 13-40 55-82 (90)
75 KOG1225|consensus 60.0 36 0.00077 27.4 5.7 119 59-199 370-489 (525)
76 KOG4228|consensus 59.7 31 0.00068 30.1 5.6 58 30-96 144-201 (1087)
77 cd05765 Ig_3 Subgroup of the i 59.5 12 0.00026 20.8 2.5 27 13-39 47-73 (81)
78 cd05866 Ig1_NCAM-2 First immun 58.9 29 0.00062 20.3 4.1 28 13-40 56-83 (92)
79 cd04972 Ig_TrkABC_d4 Fourth do 57.9 20 0.00043 20.6 3.3 28 13-40 56-83 (90)
80 cd05751 Ig1_LILRB1_like First 57.8 30 0.00065 20.0 4.1 35 11-45 51-86 (91)
81 cd05745 Ig3_Peroxidasin Third 57.2 13 0.00027 20.4 2.3 28 13-40 40-67 (74)
82 cd05734 Ig7_DSCAM Seventh immu 57.0 32 0.0007 19.0 4.5 28 13-40 44-71 (79)
83 cd04971 Ig_TrKABC_d5 Fifth dom 57.0 18 0.00038 20.5 2.9 27 14-40 47-73 (81)
84 cd05858 Ig3_FGFR-2 Third immun 56.0 16 0.00035 21.0 2.7 28 13-40 55-82 (90)
85 cd05725 Ig3_Robo Third immunog 55.9 24 0.00051 18.9 3.2 28 12-39 34-61 (69)
86 cd05743 Ig_Perlecan_D2_like Im 54.6 36 0.00077 18.8 4.1 28 13-40 42-69 (78)
87 cd04969 Ig5_Contactin_like Fif 54.3 14 0.0003 20.1 2.2 28 13-40 39-66 (73)
88 cd05892 Ig_Myotilin_C C-termin 53.8 30 0.00065 19.2 3.5 27 14-40 42-68 (75)
89 cd04970 Ig6_Contactin_like Six 53.1 23 0.00049 20.0 3.0 28 12-39 46-73 (85)
90 cd05867 Ig4_L1-CAM_like Fourth 52.5 27 0.00059 19.2 3.2 27 13-39 41-67 (76)
91 cd05868 Ig4_NrCAM Fourth immun 50.5 19 0.00042 19.9 2.3 28 13-40 41-68 (76)
92 cd04978 Ig4_L1-NrCAM_like Four 50.4 21 0.00045 19.4 2.5 28 12-39 40-67 (76)
93 cd05746 Ig4_Peroxidasin Fourth 50.2 19 0.0004 19.4 2.2 29 12-40 35-63 (69)
94 cd04968 Ig3_Contactin_like Thi 50.2 24 0.00052 20.1 2.8 29 12-40 52-80 (88)
95 cd05731 Ig3_L1-CAM_like Third 50.0 16 0.00035 19.6 2.0 27 13-39 36-62 (71)
96 cd05732 Ig5_NCAM-1_like Fifth 49.8 31 0.00067 19.9 3.3 28 13-40 62-89 (96)
97 PF15417 DUF4624: Domain of un 49.2 20 0.00044 21.8 2.3 30 4-33 80-109 (132)
98 PF08329 ChitinaseA_N: Chitina 49.1 19 0.00041 22.9 2.3 33 14-48 74-106 (133)
99 cd05753 Ig2_FcgammaR_like Seco 49.0 49 0.0011 18.7 4.0 35 12-47 47-81 (83)
100 cd05744 Ig_Myotilin_C_like Imm 48.9 45 0.00098 18.3 3.7 26 14-39 42-67 (75)
101 cd05869 Ig5_NCAM-1 Fifth immun 48.7 36 0.00079 19.8 3.5 27 14-40 64-90 (97)
102 cd05855 Ig_TrkB_d5 Fifth domai 47.9 38 0.00083 19.1 3.3 27 14-40 45-71 (79)
103 PF04775 Bile_Hydr_Trans: Acyl 46.7 51 0.0011 20.7 4.0 25 118-142 5-29 (126)
104 cd04976 Ig2_VEGFR Second immun 46.3 20 0.00044 19.5 2.0 28 12-39 34-61 (71)
105 PF13754 Big_3_4: Bacterial Ig 45.8 43 0.00094 17.2 4.8 28 14-42 15-42 (54)
106 cd05747 Ig5_Titin_like M5, fif 45.7 58 0.0013 18.7 4.3 27 13-39 59-85 (92)
107 cd05733 Ig6_L1-CAM_like Sixth 44.5 55 0.0012 18.0 4.1 28 13-40 39-70 (77)
108 cd04975 Ig4_SCFR_like Fourth i 44.3 25 0.00054 21.0 2.3 28 13-40 65-92 (101)
109 cd05737 Ig_Myomesin_like_C C-t 44.0 30 0.00064 19.9 2.5 27 13-39 58-84 (92)
110 cd05750 Ig_Pro_neuregulin Immu 42.8 55 0.0012 17.6 4.1 27 13-39 43-69 (75)
111 cd04977 Ig1_NCAM-1_like First 42.3 68 0.0015 18.5 4.9 26 13-38 57-82 (92)
112 PF00907 T-box: T-box; InterP 42.2 44 0.00095 22.5 3.4 25 12-36 32-56 (184)
113 cd05723 Ig4_Neogenin Fourth im 42.0 26 0.00055 19.0 2.0 26 14-39 38-63 (71)
114 cd05859 Ig4_PDGFR-alpha Fourth 42.0 26 0.00057 20.8 2.1 28 13-40 65-92 (101)
115 cd05895 Ig_Pro_neuregulin-1 Im 41.9 59 0.0013 17.7 4.5 27 13-39 44-70 (76)
116 cd05865 Ig1_NCAM-1 First immun 41.5 33 0.00072 20.2 2.5 24 13-36 60-83 (96)
117 cd05857 Ig2_FGFR Second immuno 40.7 58 0.0012 18.2 3.4 26 14-39 52-77 (85)
118 cd05891 Ig_M-protein_C C-termi 40.3 48 0.001 19.1 3.0 27 14-40 59-85 (92)
119 cd05886 Ig1_Nectin-1_like Firs 39.5 68 0.0015 19.1 3.6 14 2-15 21-34 (99)
120 cd05724 Ig2_Robo Second immuno 39.4 70 0.0015 17.8 4.5 28 13-40 51-78 (86)
121 cd05856 Ig2_FGFRL1-like Second 39.2 49 0.0011 18.1 3.0 26 13-38 48-73 (82)
122 cd05864 Ig2_VEGFR-2 Second imm 39.0 37 0.0008 18.5 2.3 28 13-40 34-61 (70)
123 cd05758 Ig5_KIRREL3-like Fifth 39.0 57 0.0012 19.0 3.3 27 14-40 65-92 (98)
124 cd05863 Ig2_VEGFR-3 Second imm 38.7 33 0.00072 18.6 2.0 29 11-39 29-57 (67)
125 PF14292 SusE: SusE outer memb 38.3 97 0.0021 19.1 4.9 36 166-203 37-73 (122)
126 cd05742 Ig1_VEGFR_like First i 37.8 35 0.00077 19.1 2.2 28 13-40 49-76 (84)
127 smart00408 IGc2 Immunoglobulin 36.4 59 0.0013 16.1 3.8 25 12-36 38-62 (63)
128 cd05738 Ig2_RPTP_IIa_LAR_like 36.2 75 0.0016 17.3 4.2 27 13-39 38-64 (74)
129 cd05773 Ig8_hNephrin_like Eigh 35.4 81 0.0018 19.0 3.6 26 14-39 70-96 (109)
130 PF14686 fn3_3: Polysaccharide 34.5 72 0.0016 18.9 3.1 21 11-32 48-68 (95)
131 cd05754 Ig3_Perlecan_like Thir 34.5 77 0.0017 17.7 3.3 27 13-39 52-78 (85)
132 cd05853 Ig6_Contactin-4 Sixth 32.8 64 0.0014 18.5 2.7 28 13-40 47-74 (85)
133 PF07867 DUF1654: Protein of u 31.9 93 0.002 17.5 3.0 19 165-183 52-70 (73)
134 KOG3834|consensus 31.5 1.4E+02 0.003 23.4 4.7 71 60-135 67-137 (462)
135 cd05729 Ig2_FGFR_like Second i 31.3 96 0.0021 17.0 3.5 25 14-38 52-76 (85)
136 cd04973 Ig1_FGFR First immunog 30.9 62 0.0014 17.9 2.4 25 14-38 46-70 (79)
137 PF13750 Big_3_3: Bacterial Ig 30.7 75 0.0016 20.9 3.0 23 122-144 116-138 (158)
138 cd05728 Ig4_Contactin-2-like F 30.4 96 0.0021 17.4 3.2 27 13-39 51-77 (85)
139 cd05874 Ig6_NrCAM Sixth immuno 30.0 1E+02 0.0023 17.0 3.5 27 13-39 39-69 (77)
140 PF10333 Pga1: GPI-Mannosyltra 29.1 77 0.0017 21.5 2.9 21 11-31 64-84 (180)
141 PF11811 DUF3331: Domain of un 28.8 1.4E+02 0.0029 17.9 4.1 20 164-183 16-36 (96)
142 KOG0613|consensus 28.8 4E+02 0.0087 24.4 7.4 64 120-183 293-359 (1205)
143 cd00182 TBOX T-box DNA binding 28.3 98 0.0021 21.1 3.3 24 12-35 34-57 (188)
144 cd05756 Ig1_IL1R_like First im 28.0 99 0.0022 18.0 3.0 25 14-38 61-85 (94)
145 cd02856 Glycogen_debranching_e 26.2 87 0.0019 18.6 2.6 19 14-32 48-66 (103)
146 smart00425 TBOX Domain first f 25.9 1.1E+02 0.0024 20.9 3.2 24 12-35 33-56 (190)
147 cd05880 Ig_EVA1 Immunoglobulin 25.9 1.6E+02 0.0035 17.9 5.1 6 2-7 34-39 (115)
148 PF07679 I-set: Immunoglobulin 25.8 1.2E+02 0.0027 16.8 3.2 29 11-39 54-82 (90)
149 cd05898 Ig5_KIRREL3 Fifth immu 24.2 1.7E+02 0.0036 17.4 3.5 28 12-39 63-91 (98)
150 cd05900 Ig_Aggrecan Immunoglob 23.4 1.9E+02 0.004 17.7 3.9 20 14-33 75-94 (112)
151 cd05771 IgC_Tapasin_R Tapasin- 23.4 2E+02 0.0043 18.0 4.9 25 14-38 2-26 (139)
152 KOG1378|consensus 22.4 1.2E+02 0.0027 23.9 3.2 35 12-51 108-142 (452)
153 PF05738 Cna_B: Cna protein B- 22.1 85 0.0019 16.8 1.9 22 10-32 24-45 (70)
154 cd05764 Ig_2 Subgroup of the i 22.1 1.4E+02 0.0031 15.9 3.5 27 13-39 40-66 (74)
155 PF02018 CBM_4_9: Carbohydrate 22.0 1.2E+02 0.0026 18.4 2.8 20 18-37 56-75 (131)
156 cd05715 Ig_P0-like Immunoglobu 22.0 2E+02 0.0043 17.4 5.3 8 2-9 34-41 (116)
157 PF14250 AbrB-like: AbrB-like 21.7 1E+02 0.0023 17.1 2.0 15 122-136 50-64 (71)
158 cd05848 Ig1_Contactin-5 First 21.4 1.8E+02 0.0039 16.8 3.9 26 14-39 59-85 (94)
159 PF07753 DUF1609: Protein of u 20.2 1E+02 0.0022 21.3 2.2 36 164-201 192-228 (230)
No 1
>KOG4221|consensus
Probab=99.94 E-value=1.2e-24 Score=170.78 Aligned_cols=190 Identities=32% Similarity=0.516 Sum_probs=156.9
Q ss_pred CCcceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCccee
Q psy14533 10 GTEQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLG 89 (209)
Q Consensus 10 ~~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~ 89 (209)
.+...++|.+|++.+.|.|+|.|+|..|.+..|..+.++|..+.|.++|.++.+.....++++|+|.+|..+..++.+.+
T Consensus 571 ~n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~itg 650 (1381)
T KOG4221|consen 571 NNATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQITG 650 (1381)
T ss_pred cCccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecCCCeEEEEccCCCcccccceEEE
Confidence 35678999999999999999999999999998999999999999999999999999999999999999987777999999
Q ss_pred EEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCCCC----CCCcc
Q psy14533 90 YYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDVPA----APPLD 165 (209)
Q Consensus 90 y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~----~~p~~ 165 (209)
|.|+|++ ...........+.. ....+.+.+|+|++.|.|+|.|++..|.|+.|..+.+.|....+. .+|..
T Consensus 651 YkIRy~~--~~~~~~~~~t~v~~---n~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~ 725 (1381)
T KOG4221|consen 651 YKIRYRK--LSREDEVNETVVKG---NTTQYLFNGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDLDERVPGKPSE 725 (1381)
T ss_pred EEEEecc--cCcccccceeeccc---chhhhHhhcCCCCceEEEEEEEeccCCCCCcccceeccCccccccccCCCCCce
Confidence 9999997 22222222333332 247888999999999999999999999999999999988865422 14544
Q ss_pred eEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEeCCCcCC
Q psy14533 166 ITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENMRELPM 207 (209)
Q Consensus 166 ~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~~~~~ 207 (209)
+... ...+++.+.|.+| ...+..+.+|+|.|++..+-++
T Consensus 726 l~~~-~g~~si~vsW~Pp--~~~~~~vrgY~ig~r~g~~~p~ 764 (1381)
T KOG4221|consen 726 LHVH-PGSNSIVVSWTPP--PHPNIVVRGYKIGYRPGSGIPD 764 (1381)
T ss_pred eeec-cCceeEEEEeCCC--CChhhhhcceEEeeecccCCCC
Confidence 4444 4566899999999 5777889999999998766553
No 2
>KOG3513|consensus
Probab=99.93 E-value=6e-24 Score=166.67 Aligned_cols=189 Identities=31% Similarity=0.554 Sum_probs=161.6
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCCCCCC-eeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEE
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGRPSDP-LLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYL 92 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~-~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v 92 (209)
+..+-+|.|...|+|||.|.|..|.+++|.+ ...+|.+++|...|.++.......+.|.|+|++.+..+.+|+-.+|.|
T Consensus 675 sa~vv~L~Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~~P~nv~g~g~~~~eLvItW~Pl~~~~qNG~gfgY~V 754 (1051)
T KOG3513|consen 675 SATVVNLSPWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSVNPSNVKGGGGSPTELVITWEPLPEEEQNGPGFGYRV 754 (1051)
T ss_pred ceeEEccCCCcceEEEEEEEcccccCCCCCCccceecCCCCCccCCccccccCCCCceEEEEeccCCHHHccCCCceEEE
Confidence 6889999999999999999999999987755 578899999999999999998899999999999877777899999999
Q ss_pred EEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCCCCCCCcceEEEEec
Q psy14533 93 GYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDVPAAPPLDITCSALS 172 (209)
Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~ 172 (209)
.|+. ......|....+.........+......|++.|.+.|+++|..|.|+.|..+.+.+.++.|+.+|..+.+...+
T Consensus 755 swr~--~g~~~~W~~~~v~~~d~~~~V~~~~st~~~tpyevKVqa~N~~GeGp~s~~~v~~S~Ed~P~~ap~~~~~~~~s 832 (1051)
T KOG3513|consen 755 SWRP--QGADKEWKEVIVSNQDQPRYVVSNESTEPFTPYEVKVQAINDQGEGPESQVTVGYSGEDEPPVAPTKLSAKPLS 832 (1051)
T ss_pred EEEe--CCCCcccceeEecccCCceEEEcCCCCCCcceeEEEEEEecCCCCCCCCceEEEEcCCCCCCCCCccceeeccc
Confidence 9999 33333555555544321223334445778999999999999999999999999999999999999999999999
Q ss_pred CCeEEEEEeCCCCccCCceeeEEEEEEEeCCCcC
Q psy14533 173 STSLSVTWQPPPLLLQNGEILGYKVYYENMRELP 206 (209)
Q Consensus 173 ~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~~~~ 206 (209)
.+.+.|+|++| ...||.+.+|+|.|+..++.+
T Consensus 833 ~s~~~v~W~~~--~~~nG~l~gY~v~Y~~~~~~~ 864 (1051)
T KOG3513|consen 833 SSEVNLSWKPP--LWDNGKLTGYEVKYWKINEKE 864 (1051)
T ss_pred CceEEEEecCc--CccCCccceeEEEEEEcCCCc
Confidence 99999999998 577799999999999997764
No 3
>KOG4221|consensus
Probab=99.92 E-value=9.3e-23 Score=160.33 Aligned_cols=176 Identities=43% Similarity=0.697 Sum_probs=152.9
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEE
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYL 92 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v 92 (209)
....+.+|.|.+.|.|+|.|.|..|.+..+.++.+.+..+ ++ ..+.........+.+.|++|.. +++++.+|++
T Consensus 483 ~~~tv~nl~p~t~Y~~rv~A~n~~g~g~sS~pLkV~t~pE---gp-~~~~a~ats~~ti~v~WepP~~--~n~~I~~yk~ 556 (1381)
T KOG4221|consen 483 IQVTVQNLSPLTMYFFRVRAKNEAGSGESSAPLKVTTQPE---GP-VQLQAYATSPTTILVTWEPPPF--GNGPITGYKL 556 (1381)
T ss_pred eEEEeeecccceeEEEEEeccCcccCCccCCceEEecCCC---CC-ccccccccCcceEEEEecCCCC--CCCCceEEEE
Confidence 6789999999999999999999999998888898887766 22 3377788889999999999864 6899999999
Q ss_pred EEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCCCCCCCcceEEEEec
Q psy14533 93 GYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDVPAAPPLDITCSALS 172 (209)
Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~ 172 (209)
.|.. . +... ...+.. +...+.|.+|++++.|.|+|.|.|..|.|..|..+.+.|..+.|+.||.||++...+
T Consensus 557 ~ys~--~-~~~~--~~~~~~---n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~s 628 (1381)
T KOG4221|consen 557 FYSE--D-DTGK--ELRVEN---NATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVS 628 (1381)
T ss_pred EEEc--C-CCCc--eEEEec---CccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecC
Confidence 9988 2 1222 222222 247899999999999999999999999999999999999999999999999999999
Q ss_pred CCeEEEEEeCCCCccCCceeeEEEEEEEeC
Q psy14533 173 STSLSVTWQPPPLLLQNGEILGYKVYYENM 202 (209)
Q Consensus 173 ~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~ 202 (209)
.++|++.|.+|+....+|.+.+|+|+|+..
T Consensus 629 StsVrVsW~pP~~~t~ng~itgYkIRy~~~ 658 (1381)
T KOG4221|consen 629 STSVRVSWLPPPSETQNGQITGYKIRYRKL 658 (1381)
T ss_pred CCeEEEEccCCCcccccceEEEEEEEeccc
Confidence 999999999999888999999999999965
No 4
>KOG3513|consensus
Probab=99.84 E-value=9.9e-19 Score=137.84 Aligned_cols=185 Identities=30% Similarity=0.432 Sum_probs=149.9
Q ss_pred ceEEE--eecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeE
Q psy14533 13 QWAVL--QDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGY 90 (209)
Q Consensus 13 ~~~~i--~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y 90 (209)
..+.+ ....|.+.|.+.|+++|..|.+..+....+...++.|+.+|..+.+...+.+.+.|+|+++.. .+|.+.+|
T Consensus 776 ~~~V~~~~st~~~tpyevKVqa~N~~GeGp~s~~~v~~S~Ed~P~~ap~~~~~~~~s~s~~~v~W~~~~~--~nG~l~gY 853 (1051)
T KOG3513|consen 776 PRYVVSNESTEPFTPYEVKVQAINDQGEGPESQVTVGYSGEDEPPVAPTKLSAKPLSSSEVNLSWKPPLW--DNGKLTGY 853 (1051)
T ss_pred ceEEEcCCCCCCcceeEEEEEEecCCCCCCCCceEEEEcCCCCCCCCCccceeecccCceEEEEecCcCc--cCCcccee
Confidence 34444 344669999999999999999998988999999999999999999999999999999998865 46999999
Q ss_pred EEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCCCCC---CCcceE
Q psy14533 91 YLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDVPAA---PPLDIT 167 (209)
Q Consensus 91 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~---~p~~~~ 167 (209)
+|.|++. +..........+.. ...+..|.+|++.+.|.|.|+|++..|.|+.|.....++.+..|+. +|....
T Consensus 854 ~v~Y~~~-~~~~~~~~~~~i~~---~~~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp~s~~~~~tt~k~pPs~~~~~p~g~~ 929 (1051)
T KOG3513|consen 854 EVKYWKI-NEKEGSLSRVQIAG---NRTSWRLTGLEPNTKYRFYVRAYTSAGGGPASSEENVTTKKAPPSQVDIAPPGNF 929 (1051)
T ss_pred EEEEEEc-CCCcccccceeecC---CcceEeeeCCCCCceEEEEEEEecCCCCCCCccceeccccCCCCcccccCCCcce
Confidence 9999992 22212222222322 2578899999999999999999999999999999999888888875 234556
Q ss_pred EEEecCCeEEEEEeCCCCccCCceeeEEEEEEEeCC
Q psy14533 168 CSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENMR 203 (209)
Q Consensus 168 ~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~ 203 (209)
.-.....++.|.|....+.+....+.+|+|.|++..
T Consensus 930 ~~~~~~~~~~l~w~~v~~~~nes~v~gYkV~~~~~~ 965 (1051)
T KOG3513|consen 930 IWKFSASILLLLWLLVSAFENESEVGGYKVLYREDL 965 (1051)
T ss_pred EEeeeeeEEEEEEeeEEEEeecccCcceEEEEeecc
Confidence 666778889999998876666667999999999864
No 5
>KOG0196|consensus
Probab=99.66 E-value=1.3e-14 Score=111.50 Aligned_cols=138 Identities=33% Similarity=0.473 Sum_probs=111.6
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCCC-C---CCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCc
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLGA-G---RPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGD 86 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g~-~---~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~ 86 (209)
.+++..+.+|.|.+.|.|.|.|.|+... + .....++++|...+|+. ...++....+.++|.|+|..|.. +++.
T Consensus 396 t~~~V~v~~L~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps~-V~~~r~~~~~~~sitlsW~~p~~--png~ 472 (996)
T KOG0196|consen 396 TETSVTVSDLLAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPSP-VSVLRQVSRTSDSITLSWSEPDQ--PNGV 472 (996)
T ss_pred ccceEEEeccccccccEEEEEEeecccccCCCCCcceeEEeeccccCCCc-cceEEEeeeccCceEEecCCCCC--CCCc
Confidence 5678999999999999999999998743 2 22445788888877764 46788888899999999998854 6889
Q ss_pred ceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccC
Q psy14533 87 LLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLED 157 (209)
Q Consensus 87 ~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~ 157 (209)
+..|+|.|.++. ........+... .+..++.+|+|++.|.|+|+|.+..|.|.+|....+.|.+.
T Consensus 473 ildYEvky~ek~---~~e~~~~~~~t~---~~~~ti~gL~p~t~YvfqVRarT~aG~G~~S~~~~fqT~~~ 537 (996)
T KOG0196|consen 473 ILDYEVKYYEKD---EDERSYSTLKTK---TTTATITGLKPGTVYVFQVRARTAAGYGPYSGKHEFQTLPS 537 (996)
T ss_pred ceeEEEEEeecc---ccccceeEEecc---cceEEeeccCCCcEEEEEEEEecccCCCCCCCceeeeecCc
Confidence 999999999832 123334444332 57889999999999999999999999999999999998764
No 6
>KOG0196|consensus
Probab=99.55 E-value=9e-13 Score=101.61 Aligned_cols=150 Identities=25% Similarity=0.373 Sum_probs=110.0
Q ss_pred CCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccC--CCCc--ceEEeec-CCCCcceEEecCC
Q psy14533 51 AEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGR--QNSY--NFTTIPN-RSDGAGVATLTGL 125 (209)
Q Consensus 51 ~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~--~~~~--~~~~~~~-~~~~~~~~~~~~L 125 (209)
+..|+++|.++... +..+++.|.|.+|.+.. +..-..|.|.++...... ...+ .....+. .+....++.+.+|
T Consensus 328 CT~PPSaP~nlis~-vn~Ts~~L~W~~P~d~G-GR~Di~y~v~Ck~c~~~~~~C~~Cg~~V~f~P~q~gLt~~~V~v~~L 405 (996)
T KOG0196|consen 328 CTRPPSAPRNLISN-VNGTSLILEWSPPADTG-GREDITYNVICKKCGGGRGACEPCGDNVRFTPRQRGLTETSVTVSDL 405 (996)
T ss_pred CCCCCCccceeeee-cccceEEEEecCCcccC-CCcceEEEEEeeccCCCCCccccCCCCceECCCCCCcccceEEEecc
Confidence 44677788888765 78899999999987543 444578999888732111 1111 1111111 2334578999999
Q ss_pred CCCcEEEEEEEEEcCCC-Cc---cCCccEEEEeccCCCCCCCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEe
Q psy14533 126 RKYRKYDIVVQAFNEKG-PG---PMSSEVSVQTLEDVPAAPPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYEN 201 (209)
Q Consensus 126 ~p~~~Y~~~v~a~~~~g-~~---~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~ 201 (209)
.+.+.|+|.|.|+|+.. .+ .....+.++|...+|+ +..+++......+++.|+|..| ...||.|..|+|.|.+
T Consensus 406 ~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps-~V~~~r~~~~~~~sitlsW~~p--~~png~ildYEvky~e 482 (996)
T KOG0196|consen 406 LAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPS-PVSVLRQVSRTSDSITLSWSEP--DQPNGVILDYEVKYYE 482 (996)
T ss_pred ccccccEEEEEEeecccccCCCCCcceeEEeeccccCCC-ccceEEEeeeccCceEEecCCC--CCCCCcceeEEEEEee
Confidence 99999999999998753 22 3345678888888888 5558999999999999999999 7899999999999999
Q ss_pred CCCc
Q psy14533 202 MREL 205 (209)
Q Consensus 202 ~~~~ 205 (209)
.+.-
T Consensus 483 k~~~ 486 (996)
T KOG0196|consen 483 KDED 486 (996)
T ss_pred cccc
Confidence 8653
No 7
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.40 E-value=5.1e-12 Score=73.77 Aligned_cols=83 Identities=34% Similarity=0.636 Sum_probs=65.1
Q ss_pred CCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEE
Q psy14533 57 EPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQ 136 (209)
Q Consensus 57 ~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~ 136 (209)
+|.++.+.....+++.|.|+.+. ..++.+.+|.|.|.. ...........+... ...+.+.+|.|++.|.|+|+
T Consensus 2 ~P~~l~v~~~~~~sv~v~W~~~~--~~~~~~~~y~v~~~~--~~~~~~~~~~~~~~~---~~~~~i~~L~p~t~Y~~~v~ 74 (85)
T PF00041_consen 2 APENLSVSNISPTSVTVSWKPPS--SGNGPITGYRVEYRS--VNSTSDWQEVTVPGN---ETSYTITGLQPGTTYEFRVR 74 (85)
T ss_dssp SSEEEEEEEECSSEEEEEEEESS--STSSSESEEEEEEEE--TTSSSEEEEEEEETT---SSEEEEESCCTTSEEEEEEE
T ss_pred cCcCeEEEECCCCEEEEEEECCC--CCCCCeeEEEEEEEe--cccceeeeeeeeeee---eeeeeeccCCCCCEEEEEEE
Confidence 57889999999999999999986 247889999999988 222222334444432 45899999999999999999
Q ss_pred EEcCCCCccC
Q psy14533 137 AFNEKGPGPM 146 (209)
Q Consensus 137 a~~~~g~~~~ 146 (209)
|.+..|.|++
T Consensus 75 a~~~~g~g~~ 84 (85)
T PF00041_consen 75 AVNSDGEGPP 84 (85)
T ss_dssp EEETTEEEEE
T ss_pred EEeCCcCcCC
Confidence 9999887754
No 8
>KOG4222|consensus
Probab=99.38 E-value=8.4e-12 Score=99.37 Aligned_cols=188 Identities=26% Similarity=0.404 Sum_probs=135.2
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCCCCCCCCC-eeEecCCCCCCCC---------------CCceEEEeecCCeEEEE
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLGAGRPSDP-LLVHTEAEPPTAE---------------PSGLHAVAISSDSIRVT 74 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~-~~~~t~~~~~~~~---------------p~~~~~~~~~~~~~~l~ 74 (209)
..+.+.|.+|+|+..|.|.|++.|..|.+.++.. -.++|....+.+. -.......+..++|++.
T Consensus 582 ~~t~~~I~gL~P~~sylf~vRa~n~~Gis~Ps~~S~~vrta~a~~~~a~ad~~k~~~~ls~~l~~l~~~~~L~asslr~~ 661 (1281)
T KOG4222|consen 582 KTTTYAIRGLKPNLSYLFLVRAENEQGISDPSTSSDPVRTAPADAAAAGADHQKVQRELSNELLRLSNPNVLNASSLRLG 661 (1281)
T ss_pred ccceeeecCcCccceeeeeeeccccccccCCcccCCccccCCCChhhhhhhHHHHHHhhcccceeeccccccchhheeee
Confidence 4578999999999999999999999998765432 2333322211100 01112345578899999
Q ss_pred ecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCC---CccCCccEE
Q psy14533 75 WSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKG---PGPMSSEVS 151 (209)
Q Consensus 75 W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g---~~~~s~~~~ 151 (209)
|..... +....+.+|+|.|+.... ....+....+. ......+.+.+|+|++.|++.+.-+...| .+..+....
T Consensus 662 w~~~kq-~~~~~i~g~~I~~r~~~~-~~a~~s~~~v~--~~t~~s~v~~nl~p~t~ye~f~~Pf~~~~~s~~g~pS~sk~ 737 (1281)
T KOG4222|consen 662 WTKDKQ-HGSQYIQGYRISYRSLGS-QLAQWSNAGVT--VPTPESVVVPNLKPGTNYEFFVRPFFPHGYSIQGAPSNSKT 737 (1281)
T ss_pred eeeecc-cCcccccceEEEeccCcc-cccccccccee--ccCCcceeccccCCCccceeeccCccCCCcceecCCccccc
Confidence 987653 235678999999998322 11222222222 22246788999999999999999987744 567788888
Q ss_pred EEeccCCCCCCCcce---EEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEeC
Q psy14533 152 VQTLEDVPAAPPLDI---TCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENM 202 (209)
Q Consensus 152 ~~t~~~~p~~~p~~~---~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~ 202 (209)
+.+....|+.+|.++ +....+.+++.|+|.+|+.+..++.+.+|+|.....
T Consensus 738 alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~pp~d~~ng~~qg~ki~~~~~ 791 (1281)
T KOG4222|consen 738 ALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPPPADVQNGILQGYKIECSGG 791 (1281)
T ss_pred ccccccCCCCCCCCccccccccCCCceeeEEecCCcccccCCcccceeEEeecC
Confidence 999999999999995 455568899999999999888999999999976554
No 9
>KOG4258|consensus
Probab=99.19 E-value=3.1e-10 Score=88.31 Aligned_cols=184 Identities=27% Similarity=0.351 Sum_probs=121.7
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCC--C--CCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCccee
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGA--G--RPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLG 89 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~--~--~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~ 89 (209)
-+.+.+|+|.+.|.+.|.+...... . ..|....++|....|. +|.........+++|.|+|.+|.. ++|.++.
T Consensus 567 ~~~l~~LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~Ps-pPl~~ls~snsSsqi~l~W~pP~~--pNG~lt~ 643 (1025)
T KOG4258|consen 567 GFLLDGLKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPS-PPLDVLSKSNSSSQILLKWKPPSQ--PNGNLTH 643 (1025)
T ss_pred ceehhcCCccceeEEEEeeeehhhhccccccccceEEEEecCCCCC-CcchhhhccCcchheeEEecCCCC--CCCceeE
Confidence 5789999999999999998843321 1 2366778889888876 567766666677799999999965 6899999
Q ss_pred EEEEEEEccccCC-------------------CCcceEEee-----------cCC-------C-----------------
Q psy14533 90 YYLGYREQGFGRQ-------------------NSYNFTTIP-----------NRS-------D----------------- 115 (209)
Q Consensus 90 y~v~~~~~~~~~~-------------------~~~~~~~~~-----------~~~-------~----------------- 115 (209)
|.|.|........ ....+.... ... .
T Consensus 644 Ylv~wer~~~~~yl~~~nYC~~~~k~p~~~~~p~~~~ed~d~~~e~e~~~~~Cc~c~~~~~~~~~e~eea~~~~~FEd~L 723 (1025)
T KOG4258|consen 644 YLVVWERQAEDGYLEQRNYCHKGLKLPIRADLPSFDSEDMDPLLEMEGHTGPCCSCPPTESYPQYEDEEASEQKTFEDFL 723 (1025)
T ss_pred EEEEEEeccCCchHHHhccccccccccccccCCCCchhhcchhhhhccCCCCCCCCCcccccCchhhHHHHHHHHHhhhc
Confidence 9999887311000 000000000 000 0
Q ss_pred --------------------------------------------------------CcceEEecCCCCCcEEEEEEEEEc
Q psy14533 116 --------------------------------------------------------GAGVATLTGLRKYRKYDIVVQAFN 139 (209)
Q Consensus 116 --------------------------------------------------------~~~~~~~~~L~p~~~Y~~~v~a~~ 139 (209)
...++.+.+|+.++.|.+.++|++
T Consensus 724 ~n~i~vpr~~~~krk~l~~~~n~t~~~~~~~~~~p~t~~~t~p~ei~e~~p~~~n~n~~~~vi~~Lrh~tlY~i~l~aCn 803 (1025)
T KOG4258|consen 724 HNAIFVPRRPDRKRKSLDDVENCTRLAPTRKAEEPTTPPTTAPTEIEEPKPRLENGNKESYVISGLRHFTLYRIDLQACN 803 (1025)
T ss_pred cceeeecccCcccccccccccceeeccccccccCCCCCCCCCCccccccCcccccccchhhhhhccccchhhhhhHhhhc
Confidence 012566789999999999999998
Q ss_pred CCCC----ccCCccEEEEeccCC-CCCCCcceEEEEe-cCCeEEEEEeCCCCccCCceeeEEEEEEEeCC
Q psy14533 140 EKGP----GPMSSEVSVQTLEDV-PAAPPLDITCSAL-SSTSLSVTWQPPPLLLQNGEILGYKVYYENMR 203 (209)
Q Consensus 140 ~~g~----~~~s~~~~~~t~~~~-p~~~p~~~~~~~~-~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~ 203 (209)
..+. +. ...+..+|.+.. ...-|.-+.-+.. +.+++.|.|.+| .+.||.|..|.|.|+..+
T Consensus 804 h~~~~~~cS~-a~~v~~RT~~~~~aD~i~g~v~we~~~~~~~v~l~w~EP--~~pNGli~~Y~Vk~r~~~ 870 (1025)
T KOG4258|consen 804 HATPKCGCSH-AAFVFARTMPTMGADDIPGPVTWECHIEMNSVILRWLEP--KEPNGLILNYEVKYRRNG 870 (1025)
T ss_pred ccccccccch-hhhhhhccccccccccCCCceeEecccCcceEEEecCCC--CCCCccEEEEEEEEeecc
Confidence 7763 21 112233333221 1112223444444 788999999999 689999999999999654
No 10
>KOG4258|consensus
Probab=99.13 E-value=2.6e-09 Score=83.31 Aligned_cols=140 Identities=30% Similarity=0.489 Sum_probs=99.6
Q ss_pred eEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEcc---------c--cCCCCcceEEeecC---C--CCcceEEecC
Q psy14533 61 LHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQG---------F--GRQNSYNFTTIPNR---S--DGAGVATLTG 124 (209)
Q Consensus 61 ~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~---------~--~~~~~~~~~~~~~~---~--~~~~~~~~~~ 124 (209)
+.....+.+++.++|...... ...+..+|.+.|+... . +....|....+... . .....+.+.+
T Consensus 494 ~~~~~~~~dsi~lrW~~~~~~-d~r~llg~~~~yKEaP~qNvT~~dg~~aCg~~~W~~~~v~~~~~~p~~~~~~~~~l~~ 572 (1025)
T KOG4258|consen 494 FSSTVTSADSILLRWERYQPP-DMRDLLGFLLHYKEAPFQNVTEEDGRDACGSNSWNVVDVDPPDLIPNDGTHPGFLLDG 572 (1025)
T ss_pred eeeEEeecceeEEEecccCCc-chhhhheeeEeeccCCccccceecCccccccCcceEEeccCCcCCCccccccceehhc
Confidence 344566789999999875432 1345789999888631 1 11111222222221 1 1224688999
Q ss_pred CCCCcEEEEEEEEEcCC--C--CccCCccEEEEeccCCCCCCCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEE
Q psy14533 125 LRKYRKYDIVVQAFNEK--G--PGPMSSEVSVQTLEDVPAAPPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYE 200 (209)
Q Consensus 125 L~p~~~Y~~~v~a~~~~--g--~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~ 200 (209)
|+|+|+|-+.|.+.... + .-..|....++|..+.|+ +|..+.......++|.|+|.+| .++||.+.+|.+.++
T Consensus 573 LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~Ps-pPl~~ls~snsSsqi~l~W~pP--~~pNG~lt~Ylv~we 649 (1025)
T KOG4258|consen 573 LKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPS-PPLDVLSKSNSSSQILLKWKPP--SQPNGNLTHYLVVWE 649 (1025)
T ss_pred CCccceeEEEEeeeehhhhccccccccceEEEEecCCCCC-CcchhhhccCcchheeEEecCC--CCCCCceeEEEEEEE
Confidence 99999999999998422 2 224688889999999999 7777777777788999999999 799999999999998
Q ss_pred eCCC
Q psy14533 201 NMRE 204 (209)
Q Consensus 201 ~~~~ 204 (209)
.-++
T Consensus 650 r~~~ 653 (1025)
T KOG4258|consen 650 RQAE 653 (1025)
T ss_pred eccC
Confidence 8644
No 11
>KOG4802|consensus
Probab=99.09 E-value=5.7e-09 Score=75.08 Aligned_cols=132 Identities=20% Similarity=0.357 Sum_probs=87.5
Q ss_pred eEEecCC-CcceEEEeecCCCceEEEEEEEEcCCCCC---CCCCCeeEecCCCCCCCCCCceEEEee---cCCeEEEEec
Q psy14533 4 WKSQNSG-TEQWAVLQDLLPATLYRVRVLAENSLGAG---RPSDPLLVHTEAEPPTAEPSGLHAVAI---SSDSIRVTWS 76 (209)
Q Consensus 4 w~~~~~~-~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~---~~s~~~~~~t~~~~~~~~p~~~~~~~~---~~~~~~l~W~ 76 (209)
|+..... ....++.++++||.-|.|+|.|+|..|.. +++......-...+|+ +|.++.+... +.-.+.|.|.
T Consensus 199 wQtv~~t~~e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp-~P~dl~l~~v~~dG~~~~~v~w~ 277 (516)
T KOG4802|consen 199 WQTVEKTMEENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPP-SPNDLKLIGVQFDGRYMLKVVWC 277 (516)
T ss_pred ceeeeecCCCceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCc-CcccceeeeeeecceEEEEEEeC
Confidence 6655443 44589999999999999999999988764 3444444444444444 5778776543 2234667787
Q ss_pred CCCCCCCCCcceeEEEEEEEccccCCCCcc---eEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCc
Q psy14533 77 PPPAHLTNGDLLGYYLGYREQGFGRQNSYN---FTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPG 144 (209)
Q Consensus 77 ~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~ 144 (209)
++.. +-++..|+|.|....+. ....+ ...... ...+.|++|.|+..|.+.|+|....|.+
T Consensus 278 P~~s---dlPv~~Yki~Ws~~v~s-~k~~m~tks~~~k~----thq~si~~L~Pns~Y~VevqAi~y~g~~ 340 (516)
T KOG4802|consen 278 PSKS---DLPVEKYKITWSLYVNS-AKASMITKSSYVKD----THQFSIKELLPNSSYYVEVQAISYLGSR 340 (516)
T ss_pred CCCC---CCcceeeEEEeehhhhh-hhhhcccccceeec----cchhhhhhcCCCCeEEEEEEEEEeccCc
Confidence 7664 67899999999872211 11111 111111 2345599999999999999998876644
No 12
>KOG4222|consensus
Probab=98.96 E-value=1.5e-08 Score=81.50 Aligned_cols=139 Identities=28% Similarity=0.364 Sum_probs=105.6
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCC---CCCCCCCeeEecCCCCCCCCCCce---EEEeecCCeEEEEecCCCCCCCC
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLG---AGRPSDPLLVHTEAEPPTAEPSGL---HAVAISSDSIRVTWSPPPAHLTN 84 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g---~~~~s~~~~~~t~~~~~~~~p~~~---~~~~~~~~~~~l~W~~~~~~~~~ 84 (209)
...++.+.+|+|++.|+|.++.+...+ .+.++.+..+.+.+..|..+|..+ ........++.|+|.+++....+
T Consensus 699 t~~s~v~~nl~p~t~ye~f~~Pf~~~~~s~~g~pS~sk~alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~pp~d~~n 778 (1281)
T KOG4222|consen 699 TPESVVVPNLKPGTNYEFFVRPFFPHGYSIQGAPSNSKTALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPPPADVQN 778 (1281)
T ss_pred CCcceeccccCCCccceeeccCccCCCcceecCCcccccccccccCCCCCCCCccccccccCCCceeeEEecCCcccccC
Confidence 446789999999999999999998855 356788889999999998888884 33445677899999999877779
Q ss_pred CcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEe
Q psy14533 85 GDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQT 154 (209)
Q Consensus 85 ~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t 154 (209)
+.+.+|.|++.. ..........+... .....+|.+|.++..|.|++.+.+..|.|..+......+
T Consensus 779 g~~qg~ki~~~~--~e~tr~h~n~t~~a---~~~sv~i~~l~~g~ay~vtv~a~T~aGvG~~s~p~~~~~ 843 (1281)
T KOG4222|consen 779 GILQGYKIECSG--GEKTRIHINKTTNA---RTGSVTIGNLVTGIAYSVTVAARTGAGVGVKSPPQPIVF 843 (1281)
T ss_pred CcccceeEEeec--CccccccccccccC---CCCceEeccccccceEEEEEeeecCCccCCCCCCeeeec
Confidence 999999998876 11011111122222 247889999999999999999999999887665554433
No 13
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.80 E-value=3.4e-07 Score=53.61 Aligned_cols=85 Identities=31% Similarity=0.604 Sum_probs=59.3
Q ss_pred CCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEE
Q psy14533 57 EPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQ 136 (209)
Q Consensus 57 ~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~ 136 (209)
+|.++.+.....+++.|.|..+... .+.+..|.|.+.. .. ...+...... ......+.+.+|.|++.|.++|.
T Consensus 3 ~p~~~~~~~~~~~~~~v~W~~~~~~--~~~~~~y~v~~~~--~~-~~~~~~~~~~--~~~~~~~~i~~l~p~~~Y~~~v~ 75 (93)
T cd00063 3 PPTNLRVTDVTSTSVTLSWTPPEDD--GGPITGYVVEYRE--KG-SGDWKEVEVT--PGSETSYTLTGLKPGTEYEFRVR 75 (93)
T ss_pred CCCCcEEEEecCCEEEEEECCCCCC--CCcceeEEEEEee--CC-CCCCEEeecc--CCcccEEEEccccCCCEEEEEEE
Confidence 4556666666789999999987542 2567899999987 21 2222222211 01247889999999999999999
Q ss_pred EEcCCCCccCCc
Q psy14533 137 AFNEKGPGPMSS 148 (209)
Q Consensus 137 a~~~~g~~~~s~ 148 (209)
+.+..|.+..+.
T Consensus 76 a~~~~~~~~~s~ 87 (93)
T cd00063 76 AVNGGGESPPSE 87 (93)
T ss_pred EECCCccCCCcc
Confidence 998877775554
No 14
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=98.41 E-value=1.7e-05 Score=45.01 Aligned_cols=78 Identities=32% Similarity=0.596 Sum_probs=49.1
Q ss_pred CCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEE
Q psy14533 58 PSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQA 137 (209)
Q Consensus 58 p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a 137 (209)
|..+.+......++.|.|.++... .. .+|.+.+........ ..+........ ...+.+.+|.+++.|.|+|.+
T Consensus 4 p~~~~~~~~~~~~~~v~W~~~~~~---~~-~~y~~~~~~~~~~~~--~~~~~~~~~~~-~~~~~i~~L~~~~~Y~v~v~a 76 (83)
T smart00060 4 PSNLRVTDVTSTSVTLSWEPPPDD---GI-TGYIVGYRVEYREEG--SSWKEVNVTPS-STSYTLTGLKPGTEYEFRVRA 76 (83)
T ss_pred CCcEEEEEEeCCEEEEEECCCCCC---CC-CccEEEEEEEEecCC--CccEEEEecCC-ccEEEEeCcCCCCEEEEEEEE
Confidence 345666666677999999976542 12 677777765211111 11222221111 368999999999999999999
Q ss_pred EcCCC
Q psy14533 138 FNEKG 142 (209)
Q Consensus 138 ~~~~g 142 (209)
.+..|
T Consensus 77 ~~~~g 81 (83)
T smart00060 77 VNGAG 81 (83)
T ss_pred EcccC
Confidence 98644
No 15
>KOG4802|consensus
Probab=98.31 E-value=1.8e-05 Score=57.67 Aligned_cols=132 Identities=18% Similarity=0.211 Sum_probs=75.1
Q ss_pred ecCCeEEEEecCCCCCCCCCcceeEEEEEEEc-cccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCC-
Q psy14533 66 ISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQ-GFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGP- 143 (209)
Q Consensus 66 ~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~- 143 (209)
...+++.+.|..............++..|..- .......+.|.++.... ..+.+...++.||..|.|+|.|+|..|.
T Consensus 157 ~~~g~~av~w~~~~~~~v~~~~~~vr~~w~~g~hase~~~thwQtv~~t~-~e~~~~~t~~rPgRwyefrvaavn~~G~r 235 (516)
T KOG4802|consen 157 RSRGSHAVDWKIESSLLVYYVHVEVRSHWGRGFHASELGPTHWQTVEKTM-EENTYIFTDMRPGRWYEFRVAAVNAYGFR 235 (516)
T ss_pred hccCceeeeeeeccccceeeeehhhhhhhcccccccccccccceeeeecC-CCceeeeeecCcceeEEEEEeeeeccccc
Confidence 35567888897643210000001111111110 12233345566655432 2367888999999999999999998873
Q ss_pred --ccCCccEEEEeccCCCCCCCcceEEEEec--CC-eEEEEEeCCCCccCCceeeEEEEEEEeC
Q psy14533 144 --GPMSSEVSVQTLEDVPAAPPLDITCSALS--ST-SLSVTWQPPPLLLQNGEILGYKVYYENM 202 (209)
Q Consensus 144 --~~~s~~~~~~t~~~~p~~~p~~~~~~~~~--~~-sv~l~W~~p~~~~~~~~i~~Y~i~y~~~ 202 (209)
+.+|......-.+.+|+ +|.++...... +. ...|.|.++ ..+-+|..|+|.|...
T Consensus 236 GFs~PSkpf~ssk~pkaPp-~P~dl~l~~v~~dG~~~~~v~w~P~---~sdlPv~~Yki~Ws~~ 295 (516)
T KOG4802|consen 236 GFSEPSKPFPSSKNPKAPP-SPNDLKLIGVQFDGRYMLKVVWCPS---KSDLPVEKYKITWSLY 295 (516)
T ss_pred ccCCCCCCCCCCCCCCCCc-CcccceeeeeeecceEEEEEEeCCC---CCCCcceeeEEEeehh
Confidence 34444433333333454 78888877542 22 234445544 4566799999998875
No 16
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=97.94 E-value=0.002 Score=46.05 Aligned_cols=139 Identities=16% Similarity=0.165 Sum_probs=79.8
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCCCCCCCCCee-EecCCCC--CCCCCCceEEEeec----CCeEEEEecCCCCCCC
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLL-VHTEAEP--PTAEPSGLHAVAIS----SDSIRVTWSPPPAHLT 83 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~-~~t~~~~--~~~~p~~~~~~~~~----~~~~~l~W~~~~~~~~ 83 (209)
.-..+.+.++.||..|.+++.+.+...... ..-+. +.+.... -|.-|.+..+.... =++++|.|...++
T Consensus 126 ~~~~f~l~~~~~g~~Yliri~~~~~~e~~~-~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~d--- 201 (300)
T PF10179_consen 126 GLRHFRLSGVKPGERYLIRIQISNSDEGPS-TFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSPD--- 201 (300)
T ss_pred ceEEEEECCCCCCCeEEEEEEccCCCCCce-EEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEecCCC---
Confidence 345789999999999999998776543322 22233 2332222 24456666664432 2789999998654
Q ss_pred CCcceeEEEEEEEcccc-------------C--CCCcceE-Eee---cCCC----Cc---ceEEecCCCCCcEEEEEEEE
Q psy14533 84 NGDLLGYYLGYREQGFG-------------R--QNSYNFT-TIP---NRSD----GA---GVATLTGLRKYRKYDIVVQA 137 (209)
Q Consensus 84 ~~~~~~y~v~~~~~~~~-------------~--~~~~~~~-~~~---~~~~----~~---~~~~~~~L~p~~~Y~~~v~a 137 (209)
.. ..|.|..+..... . ....... ... ...+ .. ...+|.+|+||+.|.|.|.+
T Consensus 202 -~~-~kYCvy~~~~~~~~~~~~~~~~~n~C~~~~sr~k~e~v~Ck~~~~~n~~~~~~~~v~tetI~~L~PG~~Yl~dV~~ 279 (300)
T PF10179_consen 202 -RS-IKYCVYRREEHSNYQERSVSRMPNQCLGPESRKKSEKVLCKYFHSPNSSEDPQRAVTTETIKGLKPGTTYLFDVYV 279 (300)
T ss_pred -CC-ceEEEEEEEecCchhhhhhcccCccCCCCCccccceEEEEEEEcCCccccccccccceeecccCCCCcEEEEEEEE
Confidence 12 5788876642111 0 0000010 111 1101 11 24478999999999999999
Q ss_pred EcCCCCccCCccEEEEec
Q psy14533 138 FNEKGPGPMSSEVSVQTL 155 (209)
Q Consensus 138 ~~~~g~~~~s~~~~~~t~ 155 (209)
....|.+-+-..+.+.|.
T Consensus 280 ~~~~G~sl~Y~s~~VkTr 297 (300)
T PF10179_consen 280 NGPSGQSLPYRSKWVKTR 297 (300)
T ss_pred ecCCCceeecceEEEEec
Confidence 977776544444455443
No 17
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=97.94 E-value=0.0032 Score=45.11 Aligned_cols=179 Identities=22% Similarity=0.299 Sum_probs=90.3
Q ss_pred CCcceEEEeecCCCceEEEEEEEEcCCCC-CCCCCCeeEecCCCCCCCCCCceE-----EEee-cCCe-EEEEecCCCCC
Q psy14533 10 GTEQWAVLQDLLPATLYRVRVLAENSLGA-GRPSDPLLVHTEAEPPTAEPSGLH-----AVAI-SSDS-IRVTWSPPPAH 81 (209)
Q Consensus 10 ~~~~~~~i~~L~p~~~Y~~~v~a~~~~g~-~~~s~~~~~~t~~~~~~~~p~~~~-----~~~~-~~~~-~~l~W~~~~~~ 81 (209)
|+.+.++|.+|.|++.|.|-|.+.|.... +..-....+.+.+.... .|..+. .... ..+. -......|...
T Consensus 12 g~~t~~t~~~L~p~t~YyfdVF~vn~~~n~ssay~gt~~~t~~~~r~-~~~~Lkdg~l~~~~l~~~~g~~~f~f~vP~~~ 90 (300)
T PF10179_consen 12 GQKTNQTLSGLKPDTTYYFDVFVVNQLTNNSSAYLGTFARTREENRS-KPTRLKDGKLTQVKLKGKGGFKFFSFKVPKRS 90 (300)
T ss_pred CCCceEEeccCCCCCeEEEEEEEEECCCCceeeeeEEEEEEccccCC-CcEEcccCcEEEEEECCcCceeEEEEcCCcCC
Confidence 56688999999999999999999988533 32222234444222221 122221 1111 1222 22334433110
Q ss_pred CCCCcceeEEEEEEEccccCCCCcceEEeecC---------CCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEE-
Q psy14533 82 LTNGDLLGYYLGYREQGFGRQNSYNFTTIPNR---------SDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVS- 151 (209)
Q Consensus 82 ~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~- 151 (209)
..+..+. +...+ ++.. ....+... ...-..+.+.++.||..|.+++.+.+... +....-+.
T Consensus 91 ~~~~~~~---l~v~~---C~G~--V~v~i~r~gk~l~~~~~v~~~~~f~l~~~~~g~~Yliri~~~~~~e-~~~~~kV~a 161 (300)
T PF10179_consen 91 STHQSLW---LFVQS---CSGS--VRVEISRNGKILLSQKNVEGLRHFRLSGVKPGERYLIRIQISNSDE-GPSTFKVQA 161 (300)
T ss_pred CCCccEE---EEEEe---CCCe--EEEEEEECCeEEeeeecccceEEEEECCCCCCCeEEEEEEccCCCC-CceEEEEEE
Confidence 0011111 11111 1000 01111100 01125788999999999999998776433 21222222
Q ss_pred EEeccC---CCCCCCcceEEE--E--ecCCeEEEEEeCCCCccCCceeeEEEEEEEeCCC
Q psy14533 152 VQTLED---VPAAPPLDITCS--A--LSSTSLSVTWQPPPLLLQNGEILGYKVYYENMRE 204 (209)
Q Consensus 152 ~~t~~~---~p~~~p~~~~~~--~--~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~~ 204 (209)
..+... .|. -|.+..+. . .+.++++|.|.+.+ +. . ..|.|+-++..+
T Consensus 162 ast~~~~~~~P~-LP~d~~Ik~f~~lrtC~SvTIAW~~s~--d~--~-~kYCvy~~~~~~ 215 (300)
T PF10179_consen 162 ASTNPSKQPYPQ-LPDDTSIKEFNKLRTCNSVTIAWLGSP--DR--S-IKYCVYRREEHS 215 (300)
T ss_pred ecCCcccCCCCC-CCCCCceeEEcCCcccceEEEEEecCC--CC--C-ceEEEEEEEecC
Confidence 222221 233 45566654 3 56799999999863 22 1 578888887644
No 18
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=97.92 E-value=4.5e-05 Score=44.04 Aligned_cols=42 Identities=38% Similarity=0.796 Sum_probs=37.1
Q ss_pred CCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEeCCCc
Q psy14533 162 PPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENMREL 205 (209)
Q Consensus 162 ~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~~~ 205 (209)
+|.++.+...+.+++.|+|..|. ..++.+.+|+|.|+..++-
T Consensus 2 ~P~~l~v~~~~~~sv~v~W~~~~--~~~~~~~~y~v~~~~~~~~ 43 (85)
T PF00041_consen 2 APENLSVSNISPTSVTVSWKPPS--SGNGPITGYRVEYRSVNST 43 (85)
T ss_dssp SSEEEEEEEECSSEEEEEEEESS--STSSSESEEEEEEEETTSS
T ss_pred cCcCeEEEECCCCEEEEEEECCC--CCCCCeeEEEEEEEecccc
Confidence 67899999999999999999984 6778899999999988653
No 19
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=97.77 E-value=0.00062 Score=48.72 Aligned_cols=174 Identities=15% Similarity=0.083 Sum_probs=104.1
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEE
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYY 91 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~ 91 (209)
++.+....|.++..|.+..-..+....+..++.+...+.-.. +..........+..+.+.|.+.++ -...+|.
T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~i~---~~~~~~~~~~~~~~~~~~w~~~~d----~~~~~Y~ 102 (343)
T COG3401 30 QTHYYDEGLEGEESYPYQEGTTKVDKISYDSERILVKTSFIE---RVRSVFASLERPKSVKVFWSPHPD----VSVGKYI 102 (343)
T ss_pred hhhhhhccccccCcceeeecccccceeeecCcceEEEeeecc---ccccccchhcCcceeeecccccCC----CCCCeEE
Confidence 456788889999999998877766644555666666655111 112222233456779999988653 4568898
Q ss_pred EEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCCCCCCCcceEEEEe
Q psy14533 92 LGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDVPAAPPLDITCSAL 171 (209)
Q Consensus 92 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~ 171 (209)
|+... ++.. .........+. ...+...+|.++..|...|.+.+..+.-..+....-.+....|. .-.++++...
T Consensus 103 i~~~~---gD~~-f~r~~~~~n~l-~~~~i~s~~~~~~~~~~~Iia~~f~~~~sfsf~gVE~~~~~~P~-ei~~~~~~~d 176 (343)
T COG3401 103 IQRQN---GDGK-FLRTGLVKNRL-FVEFIDSDLGHNEKYMELIIAADFQMGKSFSFTGVEATPKAEPK-EITNVRVSFD 176 (343)
T ss_pred EEEec---Cchh-hhhhhHHHhcc-chhheecccccccceeeeEEeecccccceeeeeeeecccccCCc-eeeeeeeecC
Confidence 98776 2221 11111111111 14566679999999999999988665322232222222223333 3344556666
Q ss_pred cCCeEEEEEeCCCCccCCceeeEEEEEEEeCCC
Q psy14533 172 SSTSLSVTWQPPPLLLQNGEILGYKVYYENMRE 204 (209)
Q Consensus 172 ~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~~ 204 (209)
..+.+.|+|+.+. . ...|+|+-...++
T Consensus 177 ~~~~i~ls~dg~~--~----~~yy~IY~~~~g~ 203 (343)
T COG3401 177 LGNNIELSEDGSE--A----EDYYRIYASDSGN 203 (343)
T ss_pred CCCcceeeccCcc--c----cceEEEeccCCcc
Confidence 7888999999884 2 2378886655443
No 20
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=97.64 E-value=0.00089 Score=40.54 Aligned_cols=89 Identities=26% Similarity=0.330 Sum_probs=52.3
Q ss_pred CCCceEEEeecCCeEEEEecCCCCC----CCC------CcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCC
Q psy14533 57 EPSGLHAVAISSDSIRVTWSPPPAH----LTN------GDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLR 126 (209)
Q Consensus 57 ~p~~~~~~~~~~~~~~l~W~~~~~~----~~~------~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~ 126 (209)
||. +.+ ....+.|.|.+..|... ..+ ..-..|.|.++. ++.. ......... ...+.+.+|.
T Consensus 5 PP~-v~v-~~~~~~l~V~i~~P~~~~~~~~~~~~l~~~~~~~~Y~v~~~~--~~~~--~~~~~~~~~---~~~~~l~~L~ 75 (106)
T PF09294_consen 5 PPS-VNV-SSCGGSLHVTIKPPMTPLRAGGKNSSLRDIYPSLSYNVSYWK--NGSN--EKKKEIETK---NSSVTLSDLK 75 (106)
T ss_dssp SSE-EEE-EEETTEEEEEEEESEEEEECSSSEEEHHHHHGG-EEEEEEEE--TTTS--CEEEEEESS---SEEEEEES--
T ss_pred CCE-EEE-EECCCEEEEEEECCCcccccCCCCCcHHHhCCCeEEEEEEEe--CCCc--cceEEEeec---CCEEEEeCCC
Confidence 454 565 45778999999887510 000 012579999988 3222 122222221 3667899999
Q ss_pred CCcEEEEEEEEEcCC--CCccCCccEEEEe
Q psy14533 127 KYRKYDIVVQAFNEK--GPGPMSSEVSVQT 154 (209)
Q Consensus 127 p~~~Y~~~v~a~~~~--g~~~~s~~~~~~t 154 (209)
|++.|.|+|++.... ..|.+|....+.|
T Consensus 76 p~t~YCv~V~~~~~~~~~~s~~S~~~C~~t 105 (106)
T PF09294_consen 76 PGTNYCVSVQAFSPSQNKNSQPSEPQCITT 105 (106)
T ss_dssp TTSEEEEEEEEEECSSTEEEEEBSEEEEE-
T ss_pred CCCCEEEEEEEEeccCCCcCCCCCCEeEeC
Confidence 999999999993322 2566776666654
No 21
>KOG3632|consensus
Probab=97.48 E-value=0.00048 Score=55.91 Aligned_cols=159 Identities=26% Similarity=0.341 Sum_probs=105.7
Q ss_pred CceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCC
Q psy14533 23 ATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQ 102 (209)
Q Consensus 23 ~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~ 102 (209)
...|.+.|.+....|..+....+... . .+....|.++++-..+..+..++|-....++ .+ +.|.. ...
T Consensus 642 a~s~~isvq~ltSrGsqd~lrc~Llv-g-g~a~vvpsqlrv~n~tqtSa~itwvp~nsn~-----~H--viyln---~eE 709 (1335)
T KOG3632|consen 642 AHSGYISVQRLTSRGSQDQLRCILLV-G-GAAPVVPSQLRVWNATQTSAMITWVPFNSNF-----LH--VIYLN---AEE 709 (1335)
T ss_pred CCceeeehhhhhccCCCCcceeeEec-c-ccccccchhhhhhhhhchhhheeeeecCCCc-----ce--eeecC---Ccc
Confidence 34567778888888877655444432 2 2333567888888888889999996653211 11 22322 111
Q ss_pred CCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCC-C--------CccCCccEEEEeccCCCCCCCcceEEEEe-c
Q psy14533 103 NSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEK-G--------PGPMSSEVSVQTLEDVPAAPPLDITCSAL-S 172 (209)
Q Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~-g--------~~~~s~~~~~~t~~~~p~~~p~~~~~~~~-~ 172 (209)
. .. ... ....+.+.+|.|++.|.++|-+.... + .+.....+.|.|..+-|+.+|.+|.+... .
T Consensus 710 ~--~p-s~a----~~y~ytf~~lrpgt~y~a~vea~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~g~s 782 (1335)
T KOG3632|consen 710 P--RP-SVA----EMYNYTFMRLRPGTDYWASVEAALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVETGGS 782 (1335)
T ss_pred C--CC-chh----hhhHHHHhccCCCCccceecccccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeecCCC
Confidence 1 11 111 13677889999999999999887541 1 24456688999998888889999999875 7
Q ss_pred CCeEEEEEeCCCCcc---CCc-eeeEEEEEEE
Q psy14533 173 STSLSVTWQPPPLLL---QNG-EILGYKVYYE 200 (209)
Q Consensus 173 ~~sv~l~W~~p~~~~---~~~-~i~~Y~i~y~ 200 (209)
+..++++|.+|-.+. .|| .+.+|.|+..
T Consensus 783 pg~l~vswrPptldsag~sngv~vtgYavyad 814 (1335)
T KOG3632|consen 783 PGRLEVSWRPPTLDSAGCSNGVAVTGYAVYAD 814 (1335)
T ss_pred CceeeeeccCceeccccccCceeeeeeeeeeC
Confidence 888999999885221 222 3789998764
No 22
>KOG4367|consensus
Probab=97.32 E-value=0.00046 Score=50.97 Aligned_cols=81 Identities=27% Similarity=0.456 Sum_probs=60.8
Q ss_pred ecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCcc
Q psy14533 66 ISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGP 145 (209)
Q Consensus 66 ~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~ 145 (209)
...+++++.|+.|+.+ ..+..+|.++.-. +.. ..+..+.. + ..+.++++||.-++.|..+|.+.|..|.++
T Consensus 450 t~nns~t~~wkqp~~~--~~~~dg~~leld~---g~~--g~frevy~-g-~etmctvdglhfns~y~arvka~n~tg~s~ 520 (699)
T KOG4367|consen 450 THNNSATLSWKQPPLS--TVPADGYILELDD---GNG--GQFREVYV-G-KETMCTVDGLHFNSTYNARVKAFNKTGVSP 520 (699)
T ss_pred ccCCceEEEeecCCCC--CCCCcceEEEeec---CCC--CceeEEEe-c-CceeEEecceecchhHHHHHHHhhccCCCc
Confidence 3568999999977653 6678999998866 222 23333222 1 147899999999999999999999999999
Q ss_pred CCccEEEEec
Q psy14533 146 MSSEVSVQTL 155 (209)
Q Consensus 146 ~s~~~~~~t~ 155 (209)
+|..+...|.
T Consensus 521 ys~tl~lqts 530 (699)
T KOG4367|consen 521 YSKTLVLQTS 530 (699)
T ss_pred ccceeEeeec
Confidence 9988776555
No 23
>KOG4152|consensus
Probab=97.08 E-value=0.019 Score=44.06 Aligned_cols=75 Identities=29% Similarity=0.446 Sum_probs=55.2
Q ss_pred EeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEE
Q psy14533 17 LQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLG 93 (209)
Q Consensus 17 i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~ 93 (209)
=..|-+|..|+|+|.+.|+.|.+.++....++|...--+..|..+... .+-..+.+.|+++... ..+.|..|..+
T Consensus 651 k~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG~P~apS~~ri~-k~~eGi~l~weppt~p-~sg~Iieys~y 725 (830)
T KOG4152|consen 651 KTSLVTGQAYRFRVTAINGKGPGPASTILKLKTCAPGKPTAPSGARIK-KTIEGISLVWEPPTKP-GSGTIIEYSPY 725 (830)
T ss_pred ccccccccceeeeeeeeeccCCCchhhheeeeeccCCCCCCccccccc-ccccceeecccCCCCC-CCcceEEeehh
Confidence 356889999999999999999999998888887654444456555543 2446799999988653 25667777654
No 24
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=96.94 E-value=0.003 Score=36.39 Aligned_cols=33 Identities=39% Similarity=0.471 Sum_probs=28.1
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCCCCCCCC
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLGAGRPSD 43 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~ 43 (209)
+...+.+.+|.|++.|.|+|++.+..+.+.++.
T Consensus 55 ~~~~~~i~~l~p~~~Y~~~v~a~~~~~~~~~s~ 87 (93)
T cd00063 55 SETSYTLTGLKPGTEYEFRVRAVNGGGESPPSE 87 (93)
T ss_pred cccEEEEccccCCCEEEEEEEEECCCccCCCcc
Confidence 567899999999999999999998877766554
No 25
>KOG4152|consensus
Probab=96.83 E-value=0.0044 Score=47.31 Aligned_cols=72 Identities=31% Similarity=0.531 Sum_probs=53.3
Q ss_pred ecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCC-CCCCCcceEEEEecCCeEEEEEeCCCCccC-CceeeEEEE
Q psy14533 122 LTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDV-PAAPPLDITCSALSSTSLSVTWQPPPLLLQ-NGEILGYKV 197 (209)
Q Consensus 122 ~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~-p~~~p~~~~~~~~~~~sv~l~W~~p~~~~~-~~~i~~Y~i 197 (209)
-..|-+|+.|.|+|.+.+..|.++++....+.|...- |. .|..++.... -..+.|.|++|. .. .|.|-.|..
T Consensus 651 k~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG~P~-apS~~ri~k~-~eGi~l~weppt--~p~sg~Iieys~ 724 (830)
T KOG4152|consen 651 KTSLVTGQAYRFRVTAINGKGPGPASTILKLKTCAPGKPT-APSGARIKKT-IEGISLVWEPPT--KPGSGTIIEYSP 724 (830)
T ss_pred ccccccccceeeeeeeeeccCCCchhhheeeeeccCCCCC-Cccccccccc-ccceeecccCCC--CCCCcceEEeeh
Confidence 4579999999999999999999999999988887543 44 5666666654 345899999883 33 444555543
No 26
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=96.59 E-value=0.046 Score=33.15 Aligned_cols=78 Identities=15% Similarity=0.259 Sum_probs=50.4
Q ss_pred CCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEee-cCCCCcceEEecCCC--CCcEEE
Q psy14533 56 AEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIP-NRSDGAGVATLTGLR--KYRKYD 132 (209)
Q Consensus 56 ~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~L~--p~~~Y~ 132 (209)
.+|.++++... .-...|.|+++.. ...-..|.|+|+.. ... .+..+. ........+.+.... +...|.
T Consensus 23 p~P~nv~~~s~-nf~~iL~W~~~~~---~~~~~~ytVq~~~~---~~~--~W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~ 93 (107)
T PF01108_consen 23 PAPQNVTVDSV-NFKHILRWDPGPG---SPPNVTYTVQYKKY---GSS--SWKDVPGCQNITETSCDLTDETSDPSESYY 93 (107)
T ss_dssp SSCEEEEEEEE-TTEEEEEEEESTT---SSSTEEEEEEEEES---STS--CEEEECCEEEESSSEEECTTCCTTTTSEEE
T ss_pred CCCCeeEEEEE-CCceEEEeCCCCC---CCCCeEEEEEEEec---CCc--ceeeccceecccccceeCcchhhcCcCCEE
Confidence 36788888765 4578999998543 34568999999941 122 233321 111124677777643 788899
Q ss_pred EEEEEEcCCC
Q psy14533 133 IVVQAFNEKG 142 (209)
Q Consensus 133 ~~v~a~~~~g 142 (209)
++|+|..+..
T Consensus 94 ~rV~A~~~~~ 103 (107)
T PF01108_consen 94 ARVRAEVGNQ 103 (107)
T ss_dssp EEEEEEETTE
T ss_pred EEEEEEeCCc
Confidence 9999997543
No 27
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=96.12 E-value=0.32 Score=35.44 Aligned_cols=122 Identities=15% Similarity=0.067 Sum_probs=68.4
Q ss_pred EEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCC---CceEEEeecCCeEEEEecCCCCCCCCCcceeEE
Q psy14533 15 AVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEP---SGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYY 91 (209)
Q Consensus 15 ~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p---~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~ 91 (209)
+.-++|..+..|.+.|.+.+..+.-..+ +..-+..+...| ..+.+.....+.+.|+|..+.. ...|.
T Consensus 126 ~i~s~~~~~~~~~~~Iia~~f~~~~sfs----f~gVE~~~~~~P~ei~~~~~~~d~~~~i~ls~dg~~~------~~yy~ 195 (343)
T COG3401 126 FIDSDLGHNEKYMELIIAADFQMGKSFS----FTGVEATPKAEPKEITNVRVSFDLGNNIELSEDGSEA------EDYYR 195 (343)
T ss_pred heecccccccceeeeEEeecccccceee----eeeeecccccCCceeeeeeeecCCCCcceeeccCccc------cceEE
Confidence 3445788999999999998877654433 222222222222 2334455677889999988754 23777
Q ss_pred EEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCC-CccCCccEEEE
Q psy14533 92 LGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKG-PGPMSSEVSVQ 153 (209)
Q Consensus 92 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g-~~~~s~~~~~~ 153 (209)
|+... ............. .+.+....-.-|..|.+.|.++...+ .+..+......
T Consensus 196 IY~~~---~g~e~~~~ia~t~----~n~y~d~~eglga~~~y~VTtVd~~~~es~lp~~~t~~ 251 (343)
T COG3401 196 IYASD---SGNEEYGFIAQTT----ENSYYDVKEGLGAVEYYKVTTVDNTGFESDLPNEPTVG 251 (343)
T ss_pred EeccC---Cccccccceeecc----ccchhhhhhccCceeEEEEEEEcCCcceeccCCccccc
Confidence 75544 2222223222221 23433333333778888888887655 55555444433
No 28
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=95.54 E-value=0.047 Score=30.26 Aligned_cols=26 Identities=38% Similarity=0.422 Sum_probs=22.9
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLG 37 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g 37 (209)
...+.|.+|.+++.|.|+|++.+..|
T Consensus 56 ~~~~~i~~L~~~~~Y~v~v~a~~~~g 81 (83)
T smart00060 56 STSYTLTGLKPGTEYEFRVRAVNGAG 81 (83)
T ss_pred ccEEEEeCcCCCCEEEEEEEEEcccC
Confidence 46899999999999999999998644
No 29
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=94.88 E-value=0.18 Score=30.33 Aligned_cols=83 Identities=18% Similarity=0.277 Sum_probs=50.7
Q ss_pred CCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCc-----ceEEecCCCCC
Q psy14533 54 PTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGA-----GVATLTGLRKY 128 (209)
Q Consensus 54 ~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~L~p~ 128 (209)
++..|.++.+.......++.-|++... .+....|.+.|.. .. ............+... ..|......-+
T Consensus 7 ~p~~P~~~~C~S~~~etftC~W~~g~~---~~l~~~y~L~Y~~--~~-~~~~eCp~~~~~~~ns~~~~~C~F~~~~t~lf 80 (104)
T PF09067_consen 7 PPEKPENLKCFSREMETFTCFWEPGSE---GNLPTNYTLFYKK--EG-EEWKECPDYSTSGPNSTVRHICYFPKSDTSLF 80 (104)
T ss_dssp HCCCCEEEEEEBSSSS-EEEEEEEESS---STSTCEEEEEEEE--TT-SEEEEESESSTTETTEEEEEEEEE-CCGCSSS
T ss_pred CCCCCccCccCCCCCCcEEEEeeCCCC---CCCCCcEEEEEEe--CC-CCCccCCCeEecCCCCceeEEEEcCCCCeEEE
Confidence 345678888888888999999998754 2223449999998 21 1111111111111111 23333478899
Q ss_pred cEEEEEEEEEcCCC
Q psy14533 129 RKYDIVVQAFNEKG 142 (209)
Q Consensus 129 ~~Y~~~v~a~~~~g 142 (209)
+.|.++|.+.+..|
T Consensus 81 ~~y~i~V~a~~~~~ 94 (104)
T PF09067_consen 81 VPYCIQVEATNALG 94 (104)
T ss_dssp SEEEEEEEEEETTE
T ss_pred EEEEEEEEeccCCC
Confidence 99999999998765
No 30
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=94.85 E-value=0.98 Score=37.30 Aligned_cols=126 Identities=12% Similarity=0.020 Sum_probs=65.1
Q ss_pred CCcceEEEeecCCCceEEEEEEEEcCCCCCCCCC-CeeEecCCCCCCCCCCceEEE-eecCCeEEEEecCCCCCCCCCcc
Q psy14533 10 GTEQWAVLQDLLPATLYRVRVLAENSLGAGRPSD-PLLVHTEAEPPTAEPSGLHAV-AISSDSIRVTWSPPPAHLTNGDL 87 (209)
Q Consensus 10 ~~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~-~~~~~t~~~~~~~~p~~~~~~-~~~~~~~~l~W~~~~~~~~~~~~ 87 (209)
.....+.+.+|.+| .|.++|+|.|..+....-. ...++......+ +|..+... ..-.-.+++.|-.|.. ...+
T Consensus 657 t~~~~~~~~gi~~G-qY~i~VrAiN~~g~~~~~a~s~~f~i~g~~~P-pp~~~t~~a~~it~~~~l~v~dPt~---~~d~ 731 (952)
T COG4733 657 TSAAGFDVEGIPAG-QYAIRVRAINVFEPNSPDATAYEFALNGKKVP-PPKAMIYDAVIITLVIRLVVGDPTG---AVDI 731 (952)
T ss_pred ccccceeecCcCcc-ceEEEEEEeeccCCCCCCcceeEEEecCCCCC-CCcccccceEEEEeeeeEEEecCCc---ceEE
Confidence 45678999999995 6999999999998754222 334433322221 22222211 1122356677755531 1122
Q ss_pred eeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCcc
Q psy14533 88 LGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGP 145 (209)
Q Consensus 88 ~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~ 145 (209)
..-.+++.. ....+......-... .....-.+|+++..|.|++++++..|...
T Consensus 732 ~~sei~~s~-~~d~~~~Ar~LG~~~----~~~~~~~~i~~g~~~~F~~R~Vn~vG~~~ 784 (952)
T COG4733 732 TSTEIRSAV-IADGNFQARSLGNLN----YPGLFSVGIQAGLTFWFRNRNVDLVGNND 784 (952)
T ss_pred eeeeeeeec-cccchhHHhhhhccc----cccccccCcCCCceEEEEeeecccccccc
Confidence 222333322 111111011111000 01111268999999999999999888543
No 31
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=93.31 E-value=0.25 Score=26.73 Aligned_cols=36 Identities=25% Similarity=0.354 Sum_probs=23.2
Q ss_pred ceEEecCCCcceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 3 EWKSQNSGTEQWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 3 ~w~~~~~~~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.|........ .+.+++|.|| .|.|.|+|.+..|...
T Consensus 20 ~W~~~~~~~~-~~~~~~L~~G-~Y~l~V~a~~~~~~~~ 55 (66)
T PF07495_consen 20 EWITLGSYSN-SISYTNLPPG-KYTLEVRAKDNNGKWS 55 (66)
T ss_dssp SEEEESSTS--EEEEES--SE-EEEEEEEEEETTS-B-
T ss_pred eEEECCCCcE-EEEEEeCCCE-EEEEEEEEECCCCCcC
Confidence 3544443322 8999999999 5999999999877543
No 32
>KOG3632|consensus
Probab=92.93 E-value=1.2 Score=37.47 Aligned_cols=127 Identities=19% Similarity=0.219 Sum_probs=76.2
Q ss_pred ceEEEeecCCCceEEEEEEEEcC-CCC--------CCCCCCeeEecCCCCCCCCCCceEEEee-cCCeEEEEecCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENS-LGA--------GRPSDPLLVHTEAEPPTAEPSGLHAVAI-SSDSIRVTWSPPPAHL 82 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~-~g~--------~~~s~~~~~~t~~~~~~~~p~~~~~~~~-~~~~~~l~W~~~~~~~ 82 (209)
..+.+.+|.|++.|..+|.+... .+. +.-...+.++|...-|+.+|..+.+... +++.+.++|.++.-+.
T Consensus 718 y~ytf~~lrpgt~y~a~vea~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~g~spg~l~vswrPptlds 797 (1335)
T KOG3632|consen 718 YNYTFMRLRPGTDYWASVEAALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVETGGSPGRLEVSWRPPTLDS 797 (1335)
T ss_pred hHHHHhccCCCCccceecccccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeecCCCCceeeeeccCceecc
Confidence 35788999999999999988755 211 1113457888888888888888887643 6788999999875321
Q ss_pred ----CCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCC-CcEEEEEEEEEcCCCCccCC
Q psy14533 83 ----TNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRK-YRKYDIVVQAFNEKGPGPMS 147 (209)
Q Consensus 83 ----~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p-~~~Y~~~v~a~~~~g~~~~s 147 (209)
.+-.+.+|-|+... .. ...+.........+.+..|.- ...=.+.|+.....|.+-.|
T Consensus 798 ag~sngv~vtgYavyadg------qk--v~Evafptagst~VelsQlq~~l~~~~V~vRtms~~gesvds 859 (1335)
T KOG3632|consen 798 AGCSNGVAVTGYAVYADG------QK--VEEVAFPTAGSTKVELSQLQDGLYHGAVGVRTMSVPGESVDS 859 (1335)
T ss_pred ccccCceeeeeeeeeeCC------ce--eeeeecccCCceEEEeeeehhhheecceeEEecccCcccccc
Confidence 13346788885542 11 111111111224444445544 23335566666666654333
No 33
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=92.30 E-value=0.18 Score=30.33 Aligned_cols=24 Identities=38% Similarity=0.368 Sum_probs=18.3
Q ss_pred CcceEEEeecCCCceEEEEEEEEc
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAEN 34 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~ 34 (209)
....+.|.+|.|++.|.|+|++..
T Consensus 65 ~~~~~~l~~L~p~t~YCv~V~~~~ 88 (106)
T PF09294_consen 65 KNSSVTLSDLKPGTNYCVSVQAFS 88 (106)
T ss_dssp SSEEEEEES--TTSEEEEEEEEEE
T ss_pred cCCEEEEeCCCCCCCEEEEEEEEe
Confidence 345679999999999999999943
No 34
>PLN02533 probable purple acid phosphatase
Probab=91.29 E-value=5.7 Score=30.83 Aligned_cols=90 Identities=16% Similarity=0.174 Sum_probs=49.3
Q ss_pred CCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCc--ceEEee--c--CCCCcceEEecCCCC
Q psy14533 54 PTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSY--NFTTIP--N--RSDGAGVATLTGLRK 127 (209)
Q Consensus 54 ~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~--~~~~~~--~--~~~~~~~~~~~~L~p 127 (209)
.+..|..+++...+.+++.|+|.-... ..+. |+|-.......... ...+.. . ...-.....|.+|+|
T Consensus 40 ~~~~P~qvhls~~~~~~m~V~W~T~~~---~~~~----V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~L~p 112 (427)
T PLN02533 40 DPTHPDQVHISLVGPDKMRISWITQDS---IPPS----VVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGPLKP 112 (427)
T ss_pred CCCCCceEEEEEcCCCeEEEEEECCCC---CCCE----EEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCCCCC
Confidence 344578888887788999999987642 1222 44443111000000 000000 0 000124578999999
Q ss_pred CcEEEEEEEEEcCCCCccCCccEEEEecc
Q psy14533 128 YRKYDIVVQAFNEKGPGPMSSEVSVQTLE 156 (209)
Q Consensus 128 ~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~ 156 (209)
++.|.++|.. ...+....|+|.+
T Consensus 113 ~T~Y~Yrvg~------~~~s~~~~F~T~p 135 (427)
T PLN02533 113 NTVYYYKCGG------PSSTQEFSFRTPP 135 (427)
T ss_pred CCEEEEEECC------CCCccceEEECCC
Confidence 9999999831 1124456777754
No 35
>KOG4806|consensus
Probab=91.10 E-value=5 Score=29.82 Aligned_cols=131 Identities=20% Similarity=0.173 Sum_probs=70.2
Q ss_pred eecCCCceEEEEEEEEcCCCCCCCCCCeeEecCC--CCCCCCCCceEEEee--cCCeEEEEecCCCCCCCCCcceeEEEE
Q psy14533 18 QDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEA--EPPTAEPSGLHAVAI--SSDSIRVTWSPPPAHLTNGDLLGYYLG 93 (209)
Q Consensus 18 ~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~--~~~~~~p~~~~~~~~--~~~~~~l~W~~~~~~~~~~~~~~y~v~ 93 (209)
-.+.++..|.++.+..|....-. ..-+.+.+.. ...+..|.+..+... .=++.+|.|...+++ -..|.|+
T Consensus 289 fvv~~~~~~~~rf~~~ndde~~~-~v~V~aSt~~t~~~~P~LP~dTtVk~v~r~CSsAtIaW~gs~d~-----~~kyCIy 362 (454)
T KOG4806|consen 289 FVVLPGERYLMRFEPSNDDEALQ-KVMVAASTEATFRDLPELPQDTTVKNVRRRCSSATIAWNGSPDE-----ELKYCIY 362 (454)
T ss_pred EEecCCCceEEEEEecCCchhhh-eeEeeeecccccCcCCCCCCCceEeeeccccchheeeeccCcch-----heeEEEE
Confidence 45678888999999888764332 1122222221 112334566555543 346788999876542 3667775
Q ss_pred EEEccc---------cCCCC-----cc------eEEee--cCC-CCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccE
Q psy14533 94 YREQGF---------GRQNS-----YN------FTTIP--NRS-DGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEV 150 (209)
Q Consensus 94 ~~~~~~---------~~~~~-----~~------~~~~~--~~~-~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~ 150 (209)
...... ..... .. +.... ... .....-+|.||.||..|.+.|.|....|..-+-..+
T Consensus 363 ~~~~~~~e~~v~~~~N~C~g~~~~~~s~~v~c~y~hs~~~q~~~~~i~teTI~gL~PgssYlldv~a~~~~g~~lpyqa~ 442 (454)
T KOG4806|consen 363 VFNLPQRERSVVDFTNYCMGFVPKRVSQYVYCEYMHSRERQQSPDNIETETILGLMPGSSYLLDVTANLSMGKPLPYQAL 442 (454)
T ss_pred EecccchhhhhhhhhccccCccccceeEEEeEEEecChhhhcchhhhhhhhhcccccCceEEEEEEEcccCCccccceeE
Confidence 543100 00000 00 00000 000 011344678999999999999998877765444444
Q ss_pred EEEe
Q psy14533 151 SVQT 154 (209)
Q Consensus 151 ~~~t 154 (209)
.+.|
T Consensus 443 ~v~t 446 (454)
T KOG4806|consen 443 TVHT 446 (454)
T ss_pred EEEe
Confidence 4444
No 36
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=90.03 E-value=2.7 Score=24.97 Aligned_cols=80 Identities=15% Similarity=0.212 Sum_probs=46.9
Q ss_pred CCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCC----CcEEEE
Q psy14533 58 PSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRK----YRKYDI 133 (209)
Q Consensus 58 p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p----~~~Y~~ 133 (209)
|.++.........+..+|.+... ...-..|.+.++.. ....................+.+..... ...|.+
T Consensus 2 ~~nlsC~~~~~~~m~CtW~~g~~---~~~~t~y~L~~~~~--~~~~~~eC~~y~~~~~~~~gC~~~~~~~~~~~~~~~~v 76 (99)
T PF09240_consen 2 PQNLSCFIYNLEYMNCTWEPGKE---APPDTQYTLYYWYS--PLEEEKECPHYSKDSGTRIGCQFPVSEIDSSEFSQYNV 76 (99)
T ss_dssp -EEEEEEEETTTEEEEEEECCTT---CSTTEEEEEEEEET--TSSSEEEESEEEESTSSEEEEEEESCTT-TTTTSEEEE
T ss_pred CeeCEEEEECCEEEEEEECCCCC---CCCcccEEEEEEcC--CCCccccCCCccccCCceeEEEecCCCccccccceEEE
Confidence 46777777788999999987653 22447899988882 2111112222111111124555555444 457889
Q ss_pred EEEEEcCCC
Q psy14533 134 VVQAFNEKG 142 (209)
Q Consensus 134 ~v~a~~~~g 142 (209)
.|.+.+..|
T Consensus 77 ~V~~ss~~~ 85 (99)
T PF09240_consen 77 CVNGSSSAG 85 (99)
T ss_dssp EEEEEETTE
T ss_pred EEEeccCCC
Confidence 988887655
No 37
>KOG4806|consensus
Probab=88.78 E-value=8.1 Score=28.80 Aligned_cols=76 Identities=21% Similarity=0.280 Sum_probs=43.7
Q ss_pred ecCCCCCcEEEEEEEEEcCCCCccCCccEEEEec---cCCCCCCCcceEE--EEecCCeEEEEEeCCCCccCCceeeEEE
Q psy14533 122 LTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTL---EDVPAAPPLDITC--SALSSTSLSVTWQPPPLLLQNGEILGYK 196 (209)
Q Consensus 122 ~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~---~~~p~~~p~~~~~--~~~~~~sv~l~W~~p~~~~~~~~i~~Y~ 196 (209)
--.+.++..|.+++...|....- .-..+...|. .+.|. -|.+-++ .....++..|.|...+ +. -..|.
T Consensus 288 Qfvv~~~~~~~~rf~~~ndde~~-~~v~V~aSt~~t~~~~P~-LP~dTtVk~v~r~CSsAtIaW~gs~--d~---~~kyC 360 (454)
T KOG4806|consen 288 QFVVLPGERYLMRFEPSNDDEAL-QKVMVAASTEATFRDLPE-LPQDTTVKNVRRRCSSATIAWNGSP--DE---ELKYC 360 (454)
T ss_pred EEEecCCCceEEEEEecCCchhh-heeEeeeecccccCcCCC-CCCCceEeeeccccchheeeeccCc--ch---heeEE
Confidence 34577888888888888765322 1111222111 12233 3444444 4567889999999874 21 35677
Q ss_pred EEEEeCCC
Q psy14533 197 VYYENMRE 204 (209)
Q Consensus 197 i~y~~~~~ 204 (209)
|+-+...+
T Consensus 361 Iy~~~~~~ 368 (454)
T KOG4806|consen 361 IYVFNLPQ 368 (454)
T ss_pred EEEecccc
Confidence 77666654
No 38
>KOG1225|consensus
Probab=88.24 E-value=2.5 Score=33.40 Aligned_cols=111 Identities=19% Similarity=0.160 Sum_probs=69.1
Q ss_pred CCCcceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcce
Q psy14533 9 SGTEQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLL 88 (209)
Q Consensus 9 ~~~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~ 88 (209)
.+....+.+..|.|+..|..+++++...-.+.+.. ....+. ...+..+.+......++.+.|..+.. ...
T Consensus 414 ~~~~~~~~~~~~~~g~~~~~~~~~v~~~~~~~~~~-~~~~~~----~~~~g~~~v~~~~~~s~e~~g~~~s~-----~~~ 483 (525)
T KOG1225|consen 414 PGDANSVDIQGLEPGDEYNCSVNTVAANIGSLPKD-KSETTV----LCWNGGLCVDGETESSLEVGGPCPSS-----GTC 483 (525)
T ss_pred ccceeeeeeeeecCCcceeeehhhhhhhhccCCcc-cccceE----eecCCceeeeeeeeccccccCCCCCc-----ccc
Confidence 34567889999999999999999885543332111 111111 11345667778888999999988754 346
Q ss_pred eEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEE
Q psy14533 89 GYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVV 135 (209)
Q Consensus 89 ~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v 135 (209)
.+.+.+.+ ............ . ....+...+|.+++.|.+.+
T Consensus 484 ~~~~~~~~---~~~~~~~~~~~~--~-~~~~~~~~~l~~c~~~~~~~ 524 (525)
T KOG1225|consen 484 GWEVRCGP---CGNDGGVNAEPP--P-ECTSYDRTGLGPCTEYEVSV 524 (525)
T ss_pred ceEEEeee---cCcccccccCCC--C-CCCCCCccCcccccceeccc
Confidence 67777744 111111111111 1 24677788999999998764
No 39
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=87.29 E-value=4.3 Score=34.39 Aligned_cols=82 Identities=22% Similarity=0.336 Sum_probs=47.1
Q ss_pred ecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEE-e------ecC-CCC-cceEEecC--CCCCcEEEEE
Q psy14533 66 ISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTT-I------PNR-SDG-AGVATLTG--LRKYRKYDIV 134 (209)
Q Consensus 66 ~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~-~------~~~-~~~-~~~~~~~~--L~p~~~Y~~~ 134 (209)
.....+.|+|..|-.+.-.|....|.|+|......-........ + +.. +.. ...+..+. .+.++.|.|-
T Consensus 766 ~~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~~~~~l~~~f~~a~~vn~~~~~P~~ags~e~~~f~~~~~~~~~~~~~~~a 845 (863)
T TIGR00868 766 FQGDNIILTWTAPGDVLDHGRADRYIIRISTSILDLRDDFNDATQVNTTDLIPKEANSKEVFVFKPEGIPIENGTDLFIA 845 (863)
T ss_pred ecCCEEEEEeeCCCccCCCCccceEEEEecCCHHHHHhhhccccccccCCcCCCCCCceeEEEEeCCcccccCCeEEEEE
Confidence 35566999999997766678889999999863111100101000 0 110 111 12344444 3367789999
Q ss_pred EEEEcCCC-CccCC
Q psy14533 135 VQAFNEKG-PGPMS 147 (209)
Q Consensus 135 v~a~~~~g-~~~~s 147 (209)
|++++..+ .|..|
T Consensus 846 i~a~d~~~~~s~~s 859 (863)
T TIGR00868 846 VQAIDKANLTSEVS 859 (863)
T ss_pred EEEEcccccccccc
Confidence 99998776 34333
No 40
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=86.92 E-value=1.9 Score=26.08 Aligned_cols=36 Identities=22% Similarity=0.532 Sum_probs=27.5
Q ss_pred CCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEe
Q psy14533 162 PPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYEN 201 (209)
Q Consensus 162 ~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~ 201 (209)
+|.++++...+-. ..|.|++++ ....-..|.+.|+.
T Consensus 24 ~P~nv~~~s~nf~-~iL~W~~~~---~~~~~~~ytVq~~~ 59 (107)
T PF01108_consen 24 APQNVTVDSVNFK-HILRWDPGP---GSPPNVTYTVQYKK 59 (107)
T ss_dssp SCEEEEEEEETTE-EEEEEEEST---TSSSTEEEEEEEEE
T ss_pred CCCeeEEEEECCc-eEEEeCCCC---CCCCCeEEEEEEEe
Confidence 7889999987554 779999853 22345789999993
No 41
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=83.26 E-value=13 Score=31.27 Aligned_cols=65 Identities=18% Similarity=0.270 Sum_probs=39.8
Q ss_pred cceEEecCCCCCcEEEEEEEEEcCCCC-ccCCccEEEE-eccCCCCCCCcceEEEEecCCeEEEEEeCC
Q psy14533 117 AGVATLTGLRKYRKYDIVVQAFNEKGP-GPMSSEVSVQ-TLEDVPAAPPLDITCSALSSTSLSVTWQPP 183 (209)
Q Consensus 117 ~~~~~~~~L~p~~~Y~~~v~a~~~~g~-~~~s~~~~~~-t~~~~p~~~p~~~~~~~~~~~sv~l~W~~p 183 (209)
...+.+.+|.+ .+|.++|+|+|..|. +.+.....+. ..+..|+..+..+.....+- .++|.|--|
T Consensus 659 ~~~~~~~gi~~-GqY~i~VrAiN~~g~~~~~a~s~~f~i~g~~~Ppp~~~t~~a~~it~-~~~l~v~dP 725 (952)
T COG4733 659 AAGFDVEGIPA-GQYAIRVRAINVFEPNSPDATAYEFALNGKKVPPPKAMIYDAVIITL-VIRLVVGDP 725 (952)
T ss_pred ccceeecCcCc-cceEEEEEEeeccCCCCCCcceeEEEecCCCCCCCcccccceEEEEe-eeeEEEecC
Confidence 36788999988 489999999998874 3333233332 22334443333333444433 477888877
No 42
>KOG3515|consensus
Probab=82.86 E-value=27 Score=29.28 Aligned_cols=163 Identities=15% Similarity=0.148 Sum_probs=87.4
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEE
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYL 92 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v 92 (209)
+.+..+++.+-..|.-++.+..................... ...+ .......-.....|...+.. ..+....|..
T Consensus 570 ~~~~~d~~~~pA~~~~~l~~~s~~~v~~~tp~~~~~~~g~v-~as~---~~~a~~~~~~~~~~~k~p~~-~~~~~~~~~~ 644 (741)
T KOG3515|consen 570 TACVVDGSSPPAGLSLVLKAYSALIVTVRTPNMYTAQEGCV-NASD---DCVASGVPSTSFTTAKTPLQ-TTEGPHIREK 644 (741)
T ss_pred eEEeecCCCCccccceEEEecccccccccCCceeeeccCcc-cccc---cceecccccceeecccCCCc-ccCCceEEEe
Confidence 45566666666666666666655554443333333322222 1111 11222333455566554332 1122222222
Q ss_pred EEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCCCCCCCcceEEEEec
Q psy14533 93 GYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDVPAAPPLDITCSALS 172 (209)
Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~ 172 (209)
-+.. . ......-.+-+-.++..|.+.+.+.+..|.+..+..+..-+. +..|.|+.+....
T Consensus 645 ~~~~---------------~-~~~s~~~~~~~~~~~~~y~~~c~t~n~lg~~~v~~~~~~~t~----~~~~~n~~~~~~~ 704 (741)
T KOG3515|consen 645 ATQQ---------------G-QTHSSSEWILNHIDGSDYENGCTTQNLLGSDHVSGAIHSGTA----SVGPINLTYDNLT 704 (741)
T ss_pred eeec---------------c-ccccccccccCCcchhhhcceeeecccCCCccccceecCCcC----CCCccceEeeeee
Confidence 2222 0 001122233455567778877888887776655544433332 2367899999999
Q ss_pred CCeEEEEEeCCCCccCCceeeEEEEEEEeCC
Q psy14533 173 STSLSVTWQPPPLLLQNGEILGYKVYYENMR 203 (209)
Q Consensus 173 ~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~ 203 (209)
...+.|.|.+- ...+...+|.+.|...+
T Consensus 705 ~~~i~l~~~p~---fdgg~~q~f~~~~~~~~ 732 (741)
T KOG3515|consen 705 YSTISLEWMPG---FDGGLQQRFFLKYYDLG 732 (741)
T ss_pred eeeeceeeeec---cccccccceeeehhhcC
Confidence 99999999975 44566678888877654
No 43
>KOG4367|consensus
Probab=80.43 E-value=0.65 Score=35.14 Aligned_cols=42 Identities=29% Similarity=0.339 Sum_probs=36.5
Q ss_pred CCCcceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecC
Q psy14533 9 SGTEQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTE 50 (209)
Q Consensus 9 ~~~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~ 50 (209)
.|.++.++|++|.-++.|.-+|.++|..|.++.|..+-..|.
T Consensus 489 ~g~etmctvdglhfns~y~arvka~n~tg~s~ys~tl~lqts 530 (699)
T KOG4367|consen 489 VGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYSKTLVLQTS 530 (699)
T ss_pred ecCceeEEecceecchhHHHHHHHhhccCCCcccceeEeeec
Confidence 356788999999999999999999999999998887766654
No 44
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=79.76 E-value=5.4 Score=24.08 Aligned_cols=42 Identities=19% Similarity=0.492 Sum_probs=31.5
Q ss_pred CCCCCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEeCC
Q psy14533 159 PAAPPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENMR 203 (209)
Q Consensus 159 p~~~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~ 203 (209)
|+..|.++.+...+...++..|++. ...+.. ..|.+.|...+
T Consensus 7 ~p~~P~~~~C~S~~~etftC~W~~g--~~~~l~-~~y~L~Y~~~~ 48 (104)
T PF09067_consen 7 PPEKPENLKCFSREMETFTCFWEPG--SEGNLP-TNYTLFYKKEG 48 (104)
T ss_dssp HCCCCEEEEEEBSSSS-EEEEEEEE--SSSTST-CEEEEEEEETT
T ss_pred CCCCCccCccCCCCCCcEEEEeeCC--CCCCCC-CcEEEEEEeCC
Confidence 3447889999999999999999987 344433 33999999874
No 45
>cd05735 Ig8_DSCAM Eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM: the eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=79.00 E-value=4.6 Score=23.29 Aligned_cols=35 Identities=20% Similarity=-0.011 Sum_probs=24.3
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeE
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLV 47 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~ 47 (209)
..+.|.++.......|.|.|.|..|.......+.+
T Consensus 47 s~L~I~~~~~~D~G~YtC~A~N~~G~~~~~~~L~V 81 (88)
T cd05735 47 STLQILPTVREDSGFFSCHAINSYGEDRGIIQLTV 81 (88)
T ss_pred EEEEECCCCcccCEEEEEEEEcCCCcceEEEEEEE
Confidence 34667777777888899999999887653333333
No 46
>cd05851 Ig3_Contactin-1 Third Ig domain of contactin-1. Ig3_Contactin-1: Third Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=78.51 E-value=3.6 Score=23.72 Aligned_cols=28 Identities=18% Similarity=0.120 Sum_probs=24.1
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|.|.|.|..|...
T Consensus 53 ~~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 80 (88)
T cd05851 53 AVLKIFNIQPEDEGTYECEAENIKGKDK 80 (88)
T ss_pred CEEEECcCChhhCEEEEEEEEcCCCceE
Confidence 4688999999999999999999988654
No 47
>KOG3515|consensus
Probab=77.15 E-value=4.6 Score=33.44 Aligned_cols=76 Identities=17% Similarity=0.291 Sum_probs=54.9
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEE
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLG 93 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~ 93 (209)
+..+-+.-++..|.+.|.+.|..|....+..+..- ..+..|.++.........+.+.|.+..+ .+....|.+.
T Consensus 655 ~~~~~~~~~~~~y~~~c~t~n~lg~~~v~~~~~~~----t~~~~~~n~~~~~~~~~~i~l~~~p~fd---gg~~q~f~~~ 727 (741)
T KOG3515|consen 655 SEWILNHIDGSDYENGCTTQNLLGSDHVSGAIHSG----TASVGPINLTYDNLTYSTISLEWMPGFD---GGLQQRFFLK 727 (741)
T ss_pred cccccCCcchhhhcceeeecccCCCccccceecCC----cCCCCccceEeeeeeeeeeceeeeeccc---cccccceeee
Confidence 34455556777889999999999877654433322 2233578888888889999999998775 5667788887
Q ss_pred EEE
Q psy14533 94 YRE 96 (209)
Q Consensus 94 ~~~ 96 (209)
|..
T Consensus 728 ~~~ 730 (741)
T KOG3515|consen 728 YYD 730 (741)
T ss_pred hhh
Confidence 766
No 48
>KOG1948|consensus
Probab=75.69 E-value=51 Score=28.32 Aligned_cols=24 Identities=25% Similarity=0.304 Sum_probs=20.9
Q ss_pred cceEEEeecCCCceEEEEEEEEcC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENS 35 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~ 35 (209)
+..|.|.+|.|+..|.+++.+..+
T Consensus 945 nG~yRiRGL~Pdc~Y~V~vk~~~~ 968 (1165)
T KOG1948|consen 945 NGTYRIRGLLPDCEYQVHVKSYAD 968 (1165)
T ss_pred CCcEEEeccCCCceEEEEEeeccC
Confidence 357999999999999999998843
No 49
>cd05870 Ig5_NCAM-2 Fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-2 (also known as OCAM/mamFas II and RNCAM). Ig5_NCAM-2: the fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-2 (also known as OCAM/mamFas II and RNCAM). NCAM-2 is organized similarly to NCAM , including five N-terminal Ig-like domains and two fibronectin type III domains. NCAM-2 is differentially expressed in the developing and mature olfactory epithelium (OE), and may function like NCAM, as an adhesion molecule.
Probab=75.13 E-value=7.6 Score=22.81 Aligned_cols=28 Identities=14% Similarity=0.106 Sum_probs=22.7
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.++.|.++.+.....|.|.|.|..|...
T Consensus 64 ~~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 91 (98)
T cd05870 64 SSLHIKDVKLSDSGRYDCEAASRIGGHQ 91 (98)
T ss_pred eEEEEeeCCcCCCEEEEEEEeccCCcce
Confidence 3678888888888899999998888543
No 50
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=73.64 E-value=8.6 Score=25.04 Aligned_cols=24 Identities=21% Similarity=0.272 Sum_probs=20.0
Q ss_pred ceEEEeecCCCceEEEEEEEEcCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSL 36 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~ 36 (209)
.-|.+.||.||+.|.|+-...++.
T Consensus 102 saYqVtNL~pGTkY~isY~Vtkgt 125 (184)
T PF07353_consen 102 SAYQVTNLQPGTKYYISYLVTKGT 125 (184)
T ss_pred eeEEeeccCCCcEEEEEEEEecCc
Confidence 468999999999999987776653
No 51
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=73.58 E-value=11 Score=29.60 Aligned_cols=50 Identities=20% Similarity=0.117 Sum_probs=22.0
Q ss_pred ceEEecCCCCCcEEEEEEEEEcCCCCccCCccEEEEeccCCCCCCCcceEEEEecC
Q psy14533 118 GVATLTGLRKYRKYDIVVQAFNEKGPGPMSSEVSVQTLEDVPAAPPLDITCSALSS 173 (209)
Q Consensus 118 ~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~ 173 (209)
..+.+++|+|++.|.+++.... .+..+..-.++|. |...+..+++...++
T Consensus 64 ~~v~v~gL~p~t~Y~Y~~~~~~---~~~~s~~g~~rT~---p~~~~~~~r~a~~SC 113 (453)
T PF09423_consen 64 VKVDVTGLQPGTRYYYRFVVDG---GGQTSPVGRFRTA---PDGDPDPFRFAFGSC 113 (453)
T ss_dssp EEEEE-S--TT-EEEEEEEE-----TTEE---EEEE-----TT-----EEEEEE--
T ss_pred eecccCCCCCCceEEEEEEEec---CCCCCCceEEEcC---CCCCCCceEEEEECC
Confidence 4678999999999999998832 2333445667776 322333466554443
No 52
>cd05760 Ig2_PTK7 Second immunoglobulin (Ig)-like domain of protein tyrosine kinase (PTK) 7, also known as CCK4. Ig2_PTK7: domain similar to the second immunoglobulin (Ig)-like domain in protein tyrosine kinase (PTK) 7, also known as CCK4. PTK7 is a subfamily of the receptor protein tyrosine kinase family, and is referred to as an RPTK-like molecule. RPTKs transduce extracellular signals across the cell membrane, and play important roles in regulating cell proliferation, migration, and differentiation. PTK7 is organized as an extracellular portion having seven Ig-like domains, a single transmembrane region, and a cytoplasmic tyrosine kinase-like domain. PTK7 is considered a pseudokinase as it has several unusual residues in some of the highly conserved tyrosine kinase (TK) motifs; it is predicted to lack TK activity. PTK7 may function as a cell-adhesion molecule. PTK7 mRNA is expressed at high levels in placenta, melanocytes, liver, lung, pancreas, and kidney. PTK7 is overexpressed in s
Probab=73.39 E-value=13 Score=20.58 Aligned_cols=27 Identities=19% Similarity=0.075 Sum_probs=23.1
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.+.....|.+.|.|..|..
T Consensus 38 ~~L~I~~~~~~D~G~Y~C~a~N~~G~~ 64 (77)
T cd05760 38 RTLTLRSAGPDDSGLYYCCAHNAFGSV 64 (77)
T ss_pred CEEEEeeCCcccCEEEEEEEEeCCCeE
Confidence 468899999999999999999988754
No 53
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=72.70 E-value=12 Score=22.17 Aligned_cols=31 Identities=13% Similarity=0.056 Sum_probs=25.6
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAGRPS 42 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s 42 (209)
.+.+.|.+........|.|.|.|..|....+
T Consensus 55 ~s~L~I~~~~~~D~G~Ytc~a~N~~G~~~~~ 85 (98)
T cd05762 55 SSKLTITEGQQEHCGCYTLEVENKLGSRQAQ 85 (98)
T ss_pred eeEEEECCCChhhCEEEEEEEEcCCCceeEE
Confidence 4568889999999999999999999876533
No 54
>PLN02533 probable purple acid phosphatase
Probab=71.62 E-value=9.2 Score=29.73 Aligned_cols=33 Identities=18% Similarity=0.211 Sum_probs=23.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEA 51 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~ 51 (209)
..+.|++|+|++.|.|+|-. ...+....++|..
T Consensus 103 H~v~l~~L~p~T~Y~Yrvg~------~~~s~~~~F~T~p 135 (427)
T PLN02533 103 NDVVIGPLKPNTVYYYKCGG------PSSTQEFSFRTPP 135 (427)
T ss_pred EEEEeCCCCCCCEEEEEECC------CCCccceEEECCC
Confidence 45789999999999999842 1224556777654
No 55
>PF11344 DUF3146: Protein of unknown function (DUF3146); InterPro: IPR021492 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=71.47 E-value=3.6 Score=22.93 Aligned_cols=15 Identities=27% Similarity=0.381 Sum_probs=13.1
Q ss_pred ecCCCceEEEEEEEE
Q psy14533 19 DLLPATLYRVRVLAE 33 (209)
Q Consensus 19 ~L~p~~~Y~~~v~a~ 33 (209)
.|+||..|.|.|+|.
T Consensus 66 ~LEpGgdY~Ftirak 80 (80)
T PF11344_consen 66 QLEPGGDYSFTIRAK 80 (80)
T ss_pred eccCCCceEEEEecC
Confidence 589999999999873
No 56
>cd05740 Ig_CEACAM_D4 Fourth immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM). Ig_CEACAM_D4: immunoglobulin (Ig)-like domain 4 in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) protein subfamily. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions, it is a cell adhesion molecule, and a signaling molecule that regulates the growth of tumor cells, it is an angiogenic factor, and is a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two [D1, D4] or four [D1-D4] Ig-like domains on the cell surface. This family corresponds to the
Probab=70.21 E-value=11 Score=21.89 Aligned_cols=30 Identities=7% Similarity=0.014 Sum_probs=25.3
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
+...+.|.++.......|+|.|.|..|...
T Consensus 53 ~~~~L~I~~v~~~D~G~Y~C~a~N~~G~~~ 82 (91)
T cd05740 53 DNRTLTFNNVTRSDTGHYQCEASNEVSNMT 82 (91)
T ss_pred CCCEEEECcCChhhCEEEEEEEEcCCCCEE
Confidence 345799999999999999999999988644
No 57
>cd05763 Ig_1 Subgroup of the immunoglobulin (Ig) superfamily. Ig_1: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=68.40 E-value=14 Score=20.22 Aligned_cols=27 Identities=15% Similarity=0.011 Sum_probs=22.3
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.+.....|.+.|.|..|..
T Consensus 40 ~~L~I~~~~~~D~G~Y~C~A~N~~G~~ 66 (75)
T cd05763 40 DVFFIVDVKIEDTGVYSCTAQNTAGSI 66 (75)
T ss_pred CEEEEeeCCcccCEEEEEEEEcCCCEE
Confidence 467888998888999999999887753
No 58
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=68.30 E-value=10 Score=21.32 Aligned_cols=27 Identities=26% Similarity=0.195 Sum_probs=22.1
Q ss_pred CCCcceEEEeecCCCceEEEEEEEEcC
Q psy14533 9 SGTEQWAVLQDLLPATLYRVRVLAENS 35 (209)
Q Consensus 9 ~~~~~~~~i~~L~p~~~Y~~~v~a~~~ 35 (209)
.|...+|.-.+|.+|..|.|.|.+-..
T Consensus 25 ~G~~R~F~T~~L~~G~~y~Y~v~a~~~ 51 (75)
T TIGR03000 25 TGTVRTFTTPPLEAGKEYEYTVTAEYD 51 (75)
T ss_pred CccEEEEECCCCCCCCEEEEEEEEEEe
Confidence 345568999999999999999998643
No 59
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=67.42 E-value=23 Score=20.91 Aligned_cols=38 Identities=21% Similarity=0.447 Sum_probs=28.5
Q ss_pred CcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEeCC
Q psy14533 163 PLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENMR 203 (209)
Q Consensus 163 p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~~ 203 (209)
|++|.+.-.+...+..+|.+-+ .. ..-..|.++|+..+
T Consensus 2 ~~nlsC~~~~~~~m~CtW~~g~--~~-~~~t~y~L~~~~~~ 39 (99)
T PF09240_consen 2 PQNLSCFIYNLEYMNCTWEPGK--EA-PPDTQYTLYYWYSP 39 (99)
T ss_dssp -EEEEEEEETTTEEEEEEECCT--TC-STTEEEEEEEEETT
T ss_pred CeeCEEEEECCEEEEEEECCCC--CC-CCcccEEEEEEcCC
Confidence 5789999888999999998652 11 12468999999875
No 60
>cd05748 Ig_Titin_like Immunoglobulin (Ig)-like domain of titin and similar proteins. Ig_Titin_like: immunoglobulin (Ig)-like domain found in titin-like proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic, depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone. It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin, and similar to titin has repeated Ig-like and FN-
Probab=66.64 E-value=19 Score=19.60 Aligned_cols=28 Identities=11% Similarity=0.075 Sum_probs=23.1
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
...+.|.++.+.....|.|.|.|..|..
T Consensus 39 ~~~L~I~~~~~~D~G~Y~C~a~N~~G~~ 66 (74)
T cd05748 39 STSLVIKNAERSDSGKYTLTLKNPAGEK 66 (74)
T ss_pred eEEEEECCCCcCcCEEEEEEEECCCccE
Confidence 3468888999999999999999988753
No 61
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=66.24 E-value=27 Score=21.20 Aligned_cols=31 Identities=19% Similarity=0.127 Sum_probs=22.1
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCCCCCCee
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGRPSDPLL 46 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~ 46 (209)
+..+. ...+..|.++|..++..|++. |.+..
T Consensus 71 ~at~~-v~kgG~y~m~V~lCn~dGCS~-S~~~~ 101 (106)
T cd02848 71 TATFK-VGKGGRYQMQVALCNGDGCST-SAAKE 101 (106)
T ss_pred EEEEE-eCCCCeEEEEEEEECCCCccC-cCCEE
Confidence 44444 456778999999999999776 54443
No 62
>cd05854 Ig6_Contactin-2 Sixth Ig domain of contactin-2. Ig6_Contactin-2: Sixth Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=64.74 E-value=11 Score=21.51 Aligned_cols=28 Identities=7% Similarity=-0.067 Sum_probs=23.7
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.++.|.+++......|.|.|.|..|...
T Consensus 47 ~~L~I~~v~~~D~G~YtC~A~n~~g~~~ 74 (85)
T cd05854 47 GDLVIVNAQLSHAGTYTCTAQTVVDSAS 74 (85)
T ss_pred eEEEEccCChhhCeEEEEEEecCCCCEE
Confidence 4688999999999999999999887544
No 63
>cd05894 Ig_C5_MyBP-C C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). MyBP_C consists of repeated domains, Ig and fibronectin type 3, and various linkers. Three isoforms of MYBP_C exist and are included in this group: cardiac(c), and fast and slow skeletal muscle (s) MyBP_C. cMYBP_C has insertions between and inside domains and an additional cardiac-specific Ig domain at the N-terminus. For cMYBP_C an interaction has been demonstrated between this C5 domain and the Ig C8 domain.
Probab=64.45 E-value=21 Score=20.39 Aligned_cols=27 Identities=11% Similarity=0.028 Sum_probs=23.3
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.+.....|.|.|.|..|..
T Consensus 52 ~~L~I~~~~~~D~G~Y~c~a~N~~G~~ 78 (86)
T cd05894 52 SSFVIEGAEREDEGVYTITVTNPVGED 78 (86)
T ss_pred EEEEECCCccCcCEEEEEEEEeCCCcE
Confidence 568889999999999999999988854
No 64
>cd05726 Ig4_Robo Fhird immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig4_Robo: domain similar to the fhird immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antagoni
Probab=63.65 E-value=18 Score=20.78 Aligned_cols=28 Identities=7% Similarity=-0.124 Sum_probs=22.8
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|.|.|.|..|...
T Consensus 47 ~~L~I~~v~~~D~G~Y~C~a~N~~G~~~ 74 (90)
T cd05726 47 GDLTITNVQRSDVGYYICQTLNVAGSIL 74 (90)
T ss_pred CeEEEeeCChhhCEEEEEEEEcCCCceE
Confidence 3578889988888889999998887644
No 65
>cd05893 Ig_Palladin_C C-terminal immunoglobulin (Ig)-like domain of palladin. Ig_Palladin_C: C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP, mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions.
Probab=63.32 E-value=4.7 Score=22.45 Aligned_cols=27 Identities=19% Similarity=0.052 Sum_probs=23.3
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.+.|.++.+.....|.|.|.|..|...
T Consensus 42 ~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 68 (75)
T cd05893 42 CLLIQGATKEDAGWYTVSAKNEAGIVS 68 (75)
T ss_pred EEEECCCCHHHCEEEEEEEEcCCCEEE
Confidence 578999999999999999999988643
No 66
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=63.17 E-value=18 Score=28.33 Aligned_cols=35 Identities=23% Similarity=0.235 Sum_probs=17.7
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTE 50 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~ 50 (209)
..+.+++|+|++.|.|++..... ...+..-.++|.
T Consensus 64 ~~v~v~gL~p~t~Y~Y~~~~~~~---~~~s~~g~~rT~ 98 (453)
T PF09423_consen 64 VKVDVTGLQPGTRYYYRFVVDGG---GQTSPVGRFRTA 98 (453)
T ss_dssp EEEEE-S--TT-EEEEEEEE--T---TEE---EEEE--
T ss_pred eecccCCCCCCceEEEEEEEecC---CCCCCceEEEcC
Confidence 36789999999999999999322 222344567665
No 67
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=62.27 E-value=18 Score=30.99 Aligned_cols=39 Identities=18% Similarity=0.351 Sum_probs=28.6
Q ss_pred CCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEe
Q psy14533 162 PPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYEN 201 (209)
Q Consensus 162 ~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~ 201 (209)
..++|.+.. ....|.|+|-.|-.+...|...+|.|+|..
T Consensus 758 rItDL~~~~-~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~ 796 (863)
T TIGR00868 758 KITDLEAGF-QGDNIILTWTAPGDVLDHGRADRYIIRIST 796 (863)
T ss_pred cceeeEEee-cCCEEEEEeeCCCccCCCCccceEEEEecC
Confidence 455666643 455599999999544477788999999975
No 68
>cd05879 Ig_P0 Immunoglobulin (Ig)-like domain of Protein zero (P0). Ig_P0ex: immunoglobulin (Ig) domain of Protein zero (P0). P0 accounts for over 50% of the total protein in peripheral nervous system (PNS) myelin. P0 is a single-pass transmembrane glycoprotein with a highly basic intracellular domain and an Ig domain. The extracellular domain of P0 (P0-ED) is similar to the Ig variable domain, carrying one acceptor sequence for N-linked glycosylation. P0 plays a role in membrane adhesion in the spiral wraps of the myelin sheath. The intracellular domain is thought to mediate membrane apposition of the cytoplasmic faces and may, through electrostatic interactions, interact directly with lipid headgroups. It is thought that homophilic interactions of the P0 extracellular domain mediate membrane juxtaposition in the extracellular space of PNS myelin.
Probab=62.18 E-value=32 Score=21.18 Aligned_cols=24 Identities=13% Similarity=-0.082 Sum_probs=13.9
Q ss_pred ceEEEeecCCCceEEEEEEEEcCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSL 36 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~ 36 (209)
.++.|.+|+......|.|...|..
T Consensus 80 aSI~I~nv~~sD~G~Y~C~v~n~p 103 (116)
T cd05879 80 GSIVIHNLDYTDNGTFTCDVKNPP 103 (116)
T ss_pred eEEEEccCCcccCEEEEEEEEcCC
Confidence 455666666666666666655543
No 69
>cd05876 Ig3_L1-CAM Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM: third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=61.88 E-value=9.4 Score=20.77 Aligned_cols=29 Identities=28% Similarity=0.229 Sum_probs=24.1
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
...+.|.++.+.....|.+.|.|..|...
T Consensus 35 ~~~L~I~~v~~~D~G~Y~C~a~N~~G~~~ 63 (71)
T cd05876 35 NKTLQLDNVLESDDGEYVCTAENSEGSAR 63 (71)
T ss_pred CCEEEEcccCHHhCEEEEEEEEcCCCeEE
Confidence 35788999999998999999999988643
No 70
>cd05730 Ig3_NCAM-1_like Third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig3_NCAM-1_like: domain similar to the third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1,and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the
Probab=61.67 E-value=18 Score=20.98 Aligned_cols=28 Identities=18% Similarity=0.106 Sum_probs=23.1
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|.|.|.|..|...
T Consensus 58 ~~L~I~~v~~~D~G~Y~C~a~N~~G~~~ 85 (95)
T cd05730 58 SEMTILDVDKLDEAEYTCIAENKAGEQE 85 (95)
T ss_pred CEEEECCCChhhCEEEEEEEEcCCCeEE
Confidence 4688889988888899999999888643
No 71
>cd05736 Ig2_Follistatin_like Second immunoglobulin (Ig)-like domain of a follistatin-like molecule encoded by the Mahya gene and similar proteins. Ig2_Follistatin_like: domain similar to the second immunoglobulin (Ig)-like domain found in a follistatin-like molecule encoded by the CNS-related Mahya gene. Mahya genes have been retained in certain Bilaterian branches during evolution. They are conserved in Hymenoptera and Deuterostomes, but are absent from other metazoan species such as fruit fly and nematode. Mahya proteins are secretory, with a follistatin-like domain (Kazal-type serine/threonine protease inhibitor domain and EF-hand calcium-binding domain), two Ig-like domains, and a novel C-terminal domain. Mahya may be involved in learning and memory and in processing of sensory information in Hymenoptera and vertebrates. Follistatin is a secreted, multidomain protein that binds activins with high affinity and antagonizes their signaling.
Probab=61.44 E-value=25 Score=19.22 Aligned_cols=28 Identities=11% Similarity=0.045 Sum_probs=23.0
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|.+.+.|..|...
T Consensus 39 ~~l~I~~~~~~D~G~Y~C~a~N~~G~~~ 66 (76)
T cd05736 39 SELHISNVRYEDTGAYTCIAKNEAGVDE 66 (76)
T ss_pred CEEEECcCCcccCEEEEEEEEcCCCCcc
Confidence 4688888888888889999998887655
No 72
>cd05852 Ig5_Contactin-1 Fifth Ig domain of contactin-1. Ig5_Contactin-1: fifth Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=61.00 E-value=12 Score=20.65 Aligned_cols=28 Identities=14% Similarity=0.061 Sum_probs=23.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.++.|.++.+.....|.+.|.|..|...
T Consensus 39 g~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 66 (73)
T cd05852 39 GSLEILNITKLDEGSYTCFAENNRGKAN 66 (73)
T ss_pred CEEEECcCChhHCEEEEEEEECCCCcee
Confidence 4688889999999999999999887654
No 73
>KOG4228|consensus
Probab=60.73 E-value=7.7 Score=33.53 Aligned_cols=59 Identities=24% Similarity=0.389 Sum_probs=39.1
Q ss_pred EEEEcCCCCccCCccEEEEeccCCCCCCCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEEEeC
Q psy14533 135 VQAFNEKGPGPMSSEVSVQTLEDVPAAPPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYYENM 202 (209)
Q Consensus 135 v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~~ 202 (209)
+.+....+.++.+..+.+.+....|..+. ...+...++++|.++. ...+..|++.+...
T Consensus 144 ~~~~~~~~~~~~~t~i~i~~~~~~p~~~~-----~~~~~~~it~~w~~~~----~~~~~~ykl~~~~~ 202 (1087)
T KOG4228|consen 144 PLAIASPGLGPPSTYIQITTNANSPIQPG-----EEEEYTTITGSWSPPH----AVSLDTYKLLHLDP 202 (1087)
T ss_pred ccccCCcccCCCCceEEEeccCCCCCCCC-----cceEEEEEEecCCCCC----cccchhhhhhhcCC
Confidence 44444455555566677777777666444 5556778999999884 45577788776654
No 74
>cd04974 Ig3_FGFR Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig3_FGFR: third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs.
Probab=60.49 E-value=20 Score=20.61 Aligned_cols=28 Identities=21% Similarity=0.196 Sum_probs=22.9
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|.|.|.|..|...
T Consensus 55 ~~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 82 (90)
T cd04974 55 EVLYLRNVSFDDAGEYTCLAGNSIGPSH 82 (90)
T ss_pred ceEEEeccccccCcEEEEEeecccCccc
Confidence 3577888888888999999999988654
No 75
>KOG1225|consensus
Probab=59.98 E-value=36 Score=27.37 Aligned_cols=119 Identities=18% Similarity=0.126 Sum_probs=70.0
Q ss_pred CceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEEEEE
Q psy14533 59 SGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVVQAF 138 (209)
Q Consensus 59 ~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~ 138 (209)
..+....-+...+.+.|... .....+...+.+.... ... ....+.. ....+.+..|+|+..|..++.++
T Consensus 370 ~~~~~~~cs~~~~~~~~~~~------~~~~~~~~~~~~~~~~-~~~-~~~r~~~---~~~~~~~~~~~~g~~~~~~~~~v 438 (525)
T KOG1225|consen 370 SLLLITECSPPSLCIAGVGR------RRVTHCAGTYCPLGES-GGD-LQGRVPG---DANSVDIQGLEPGDEYNCSVNTV 438 (525)
T ss_pred hhhcccccCCCceeeccccc------cccccccccccccccC-CCc-cceeecc---ceeeeeeeeecCCcceeeehhhh
Confidence 33344445667777777621 1234444444441111 111 1222333 25788888999999999999997
Q ss_pred cCC-CCccCCccEEEEeccCCCCCCCcceEEEEecCCeEEEEEeCCCCccCCceeeEEEEEE
Q psy14533 139 NEK-GPGPMSSEVSVQTLEDVPAAPPLDITCSALSSTSLSVTWQPPPLLLQNGEILGYKVYY 199 (209)
Q Consensus 139 ~~~-g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~sv~l~W~~p~~~~~~~~i~~Y~i~y 199 (209)
-.. +..... ....+... -+..+.+.....+++.+.|..|. .....|.+.|
T Consensus 439 ~~~~~~~~~~--~~~~~~~~----~~g~~~v~~~~~~s~e~~g~~~s-----~~~~~~~~~~ 489 (525)
T KOG1225|consen 439 AANIGSLPKD--KSETTVLC----WNGGLCVDGETESSLEVGGPCPS-----SGTCGWEVRC 489 (525)
T ss_pred hhhhccCCcc--cccceEee----cCCceeeeeeeeccccccCCCCC-----ccccceEEEe
Confidence 543 322222 22222222 35578888889999999999883 3356788877
No 76
>KOG4228|consensus
Probab=59.71 E-value=31 Score=30.15 Aligned_cols=58 Identities=31% Similarity=0.359 Sum_probs=39.5
Q ss_pred EEEEcCCCCCCCCCCeeEecCCCCCCCCCCceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEE
Q psy14533 30 VLAENSLGAGRPSDPLLVHTEAEPPTAEPSGLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYRE 96 (209)
Q Consensus 30 v~a~~~~g~~~~s~~~~~~t~~~~~~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~ 96 (209)
+.+....+.+..+..+.+.+....|..++ .......+++.|..+. ...+..|++....
T Consensus 144 ~~~~~~~~~~~~~t~i~i~~~~~~p~~~~-----~~~~~~~it~~w~~~~----~~~~~~ykl~~~~ 201 (1087)
T KOG4228|consen 144 PLAIASPGLGPPSTYIQITTNANSPIQPG-----EEEEYTTITGSWSPPH----AVSLDTYKLLHLD 201 (1087)
T ss_pred ccccCCcccCCCCceEEEeccCCCCCCCC-----cceEEEEEEecCCCCC----cccchhhhhhhcC
Confidence 55556666665566677777777776654 3456678999998774 3567788886665
No 77
>cd05765 Ig_3 Subgroup of the immunoglobulin (Ig) superfamily. Ig_3: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=59.48 E-value=12 Score=20.76 Aligned_cols=27 Identities=22% Similarity=0.025 Sum_probs=22.0
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
.++.|.++.+.....|.|.|.|..|..
T Consensus 47 ~~L~I~~~~~~D~G~Y~C~a~N~~G~~ 73 (81)
T cd05765 47 GQLVIYNAQPQDAGLYTCTARNSGGLL 73 (81)
T ss_pred cEEEEccCCcccCEEEEEEEecCCceE
Confidence 457888888888888999999888754
No 78
>cd05866 Ig1_NCAM-2 First immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-2. Ig1_NCAM-2: first immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-2 (OCAM/mamFas II, RNCAM). NCAM-2 is organized similarly to NCAM , including five N-terminal Ig-like domains and two fibronectin type III domains. NCAM-2 is differentially expressed in the developing and mature olfactory epithelium (OE), and may function like NCAM, as an adhesion molecule.
Probab=58.92 E-value=29 Score=20.28 Aligned_cols=28 Identities=11% Similarity=-0.014 Sum_probs=23.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|+|.|.|..|...
T Consensus 56 ~~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 83 (92)
T cd05866 56 SRLTIYNANIEDAGIYRCQATDAKGQTQ 83 (92)
T ss_pred eEEEEecCChHHCEEEEEEEEcCCCcEE
Confidence 3788999999999999999999887643
No 79
>cd04972 Ig_TrkABC_d4 Fourth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor fo
Probab=57.91 E-value=20 Score=20.58 Aligned_cols=28 Identities=25% Similarity=0.242 Sum_probs=23.0
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|.|.|.|..|...
T Consensus 56 ~~L~I~~v~~~D~g~Y~C~A~N~~G~~~ 83 (90)
T cd04972 56 YNLQLSNITSETQTTVTCTAENPVGQAN 83 (90)
T ss_pred EEEEEecCCcccCEEEEEEEECCCCcee
Confidence 4678889998888899999999887543
No 80
>cd05751 Ig1_LILRB1_like First immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILR)B1 (also known as LIR-1) and similar proteins. Ig1_LILRB1_like: domain similar to the first immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILR)B1 (also known as LIR-1). This group includes, LILRA5 (LIR9), an activating natural cytotoxicity receptor NKp46, and the immune-type receptor glycoprotein VI (GPVI). LILRs are a family of immunoreceptors expressed on expressed on T and B cells, on monocytes, dendritic cells, and subgroups of natural killer (NK) cells. The human LILR family contains nine proteins (LILRA1-3,and 5, and LILRB1-5). From functional assays, and as the cytoplasmic domains of various LILRs, for example LILRB1 (LIR-1), LILRB2 (LIR-2), and LILRB3 (LIR-3) contain immunoreceptor tyrosine-based inhibitory motifs (ITIMs) it is thought that LIR proteins are inhibitory receptors. Of the eight LIR family proteins, only LIR-1(LILRB1), and LIR-2 (LILRB2),
Probab=57.84 E-value=30 Score=19.96 Aligned_cols=35 Identities=20% Similarity=0.180 Sum_probs=23.2
Q ss_pred CcceEEEeecCCCceEEEEEEEEcC-CCCCCCCCCe
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENS-LGAGRPSDPL 45 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~-~g~~~~s~~~ 45 (209)
....+.|.++.+.....|+|+..+. .+.+..|.++
T Consensus 51 ~~~~f~i~~v~~~~~G~Y~C~~~~~~~~~S~~Sd~l 86 (91)
T cd05751 51 NKAKFFIPSMKREHAGRYRCYYRSGVALWSEPSDPL 86 (91)
T ss_pred eeEEEEccCCChhHCEEEEEEEECCCCccCCCCCcE
Confidence 3456888888888888888888775 3334444443
No 81
>cd05745 Ig3_Peroxidasin Third immunoglobulin (Ig)-like domain of peroxidasin. Ig3_Peroxidasin: the third immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells which have undergone programmed cell death, and protection of the organism against non-self.
Probab=57.17 E-value=13 Score=20.42 Aligned_cols=28 Identities=14% Similarity=0.024 Sum_probs=23.5
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|.|.|.|..|...
T Consensus 40 ~~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 67 (74)
T cd05745 40 GTLRISRVALHDQGQYECQAVNIVGSQR 67 (74)
T ss_pred CeEEEeeCCHHhCEEEEEEEEeCCCcee
Confidence 4688999999999999999999887543
No 82
>cd05734 Ig7_DSCAM Seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig7_DSCAM: the seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=57.03 E-value=32 Score=19.03 Aligned_cols=28 Identities=14% Similarity=0.096 Sum_probs=23.0
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.++.|.++.+.....|.+.|.|..|...
T Consensus 44 ~~L~I~~v~~~D~G~Y~C~a~N~~G~~~ 71 (79)
T cd05734 44 GSLLIKHVLEEDSGYYLCKVSNDVGADA 71 (79)
T ss_pred CeEEECcCCcccCEEEEEEEEeCCCCCC
Confidence 4678888888888899999999887643
No 83
>cd04971 Ig_TrKABC_d5 Fifth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrkB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor foun
Probab=57.02 E-value=18 Score=20.51 Aligned_cols=27 Identities=11% Similarity=0.074 Sum_probs=22.3
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.+.|.++.+.....|.|.|.|..|...
T Consensus 47 ~L~I~~~~~~D~G~YtC~A~N~~G~~~ 73 (81)
T cd04971 47 CLQFDNPTHVNNGNYTLVASNEYGQDS 73 (81)
T ss_pred EEEECCCCcccCeEEEEEEEeCCCCee
Confidence 477888888888899999999888654
No 84
>cd05858 Ig3_FGFR-2 Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor 2 (FGFR2). Ig3_FGFR-2-like; domain similar to the third immunoglobulin (Ig)-like domain of human fibroblast growth factor receptor 2 (FGFR2). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs. FGFR2 is required for male sex determination.
Probab=56.03 E-value=16 Score=20.97 Aligned_cols=28 Identities=21% Similarity=0.196 Sum_probs=24.0
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|+|.|.|..|...
T Consensus 55 ~~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 82 (90)
T cd05858 55 EVLYLRNVTFEDAGEYTCLAGNSIGISH 82 (90)
T ss_pred eEEEEccCCHHHCEEEEEEEEeCCCccc
Confidence 3689999999999999999999988654
No 85
>cd05725 Ig3_Robo Third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig3_Robo: domain similar to the third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antagoni
Probab=55.90 E-value=24 Score=18.90 Aligned_cols=28 Identities=14% Similarity=0.081 Sum_probs=23.4
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
...+.|.++.+.....|.+.|.|..|..
T Consensus 34 ~~~L~I~~v~~~D~G~Y~C~a~N~~G~~ 61 (69)
T cd05725 34 DKSLKIRNVTAGDEGSYTCEAENMVGKI 61 (69)
T ss_pred CCEEEECcCChhHCEEEEEEEEcCCCcE
Confidence 3478899999998899999999988754
No 86
>cd05743 Ig_Perlecan_D2_like Immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Perlecan consists of five domains. Domain I has three putative heparan sulfate attachment sites; domain II has four LDL receptor-like repeats, and one Ig-like repeat; domain III resembles the short arm of laminin chains; domain IV has multiple Ig-like repeats (21 repeats in human perlecan); and domain V resembles the globular G domain of the laminin A chain and internal repeats of EGF. Perlecan may participate in a variety of biological functions including cell binding, LDL-metabolism, basement membrane assembly and selective permeability, calcium binding, and growth- and neurite-promoting activities.
Probab=54.65 E-value=36 Score=18.83 Aligned_cols=28 Identities=14% Similarity=-0.051 Sum_probs=22.2
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|.+.|.|..|...
T Consensus 42 ~~L~I~~v~~~D~G~Y~C~a~N~~G~~~ 69 (78)
T cd05743 42 GTLTIRDVKESDQGAYTCEAINTRGMVF 69 (78)
T ss_pred EEEEECCCChHHCEEEEEEEEecCCEEE
Confidence 4678888888888888999988887543
No 87
>cd04969 Ig5_Contactin_like Fifth Ig domain of contactin. Ig5_Contactin_like: Fifth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=54.28 E-value=14 Score=20.14 Aligned_cols=28 Identities=14% Similarity=0.116 Sum_probs=23.3
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|.+.|.|..|...
T Consensus 39 ~~L~i~~v~~~D~G~Y~C~a~N~~G~~~ 66 (73)
T cd04969 39 GSLEILNVTKSDEGKYTCFAENFFGKAN 66 (73)
T ss_pred CeEEEccCChhHCEEEEEEEECCCCceE
Confidence 4688899998888899999999887643
No 88
>cd05892 Ig_Myotilin_C C-terminal immunoglobulin (Ig)-like domain of myotilin. Ig_Myotilin_C: C-terminal immunoglobulin (Ig)-like domain of myotilin. Mytolin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to the latter family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Myotilin is most abundant in skeletal and cardiac muscle, and is involved in maintaining sarcomere integrity. It binds to alpha-actinin, filamin and actin. Mutations in myotilin lead to muscle disorders.
Probab=53.82 E-value=30 Score=19.21 Aligned_cols=27 Identities=15% Similarity=0.063 Sum_probs=22.7
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
++.|.++.+.....|.+.|.|..|...
T Consensus 42 ~L~I~~~~~~D~G~Y~C~A~N~~G~~~ 68 (75)
T cd05892 42 TLLIKNVNKKDAGWYTVSAVNEAGVAT 68 (75)
T ss_pred EEEECCCChhhCEEEEEEEEcCcCeEE
Confidence 688889999999999999999887543
No 89
>cd04970 Ig6_Contactin_like Sixth Ig domain of contactin. Ig6_Contactin_like: Sixth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=53.10 E-value=23 Score=20.03 Aligned_cols=28 Identities=4% Similarity=-0.082 Sum_probs=22.3
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
...+.|.+++......|.|.|.|..|..
T Consensus 46 ~~~L~I~~v~~~D~G~Y~C~a~n~~g~~ 73 (85)
T cd04970 46 NGDLMIRNAQLKHAGKYTCTAQTVVDSL 73 (85)
T ss_pred cceEEEccCCHHhCeeeEEEEecCCCcE
Confidence 3468889998888889999998877643
No 90
>cd05867 Ig4_L1-CAM_like Fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig4_L1-CAM_like: fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=52.45 E-value=27 Score=19.17 Aligned_cols=27 Identities=22% Similarity=0.167 Sum_probs=22.5
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.+.....|.+.|.|..|..
T Consensus 41 ~~L~I~~~~~~D~G~Y~C~a~N~~G~~ 67 (76)
T cd05867 41 GALILTDVQPSDTAVYQCEARNRHGNL 67 (76)
T ss_pred CEEEECCCChhhCEEEEEEEECCCCeE
Confidence 568889999998999999999987753
No 91
>cd05868 Ig4_NrCAM Fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). Ig4_ NrCAM: fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six IG-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=50.46 E-value=19 Score=19.89 Aligned_cols=28 Identities=11% Similarity=-0.015 Sum_probs=22.3
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|++.|.|..|...
T Consensus 41 ~~l~i~~v~~~D~G~Y~C~A~N~~G~~~ 68 (76)
T cd05868 41 DTIIFSKVQERSSAVYQCNASNEYGYLL 68 (76)
T ss_pred CEEEECCCCHhhCEEEEEEEEcCCCEEE
Confidence 3577888888888889999999887543
No 92
>cd04978 Ig4_L1-NrCAM_like Fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). Ig4_L1-NrCAM_like: fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). These proteins belong to the L1 subfamily of cell adhesion molecules (CAMs) and are comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. These molecules are primarily expressed in the nervous system. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=50.40 E-value=21 Score=19.44 Aligned_cols=28 Identities=18% Similarity=0.073 Sum_probs=23.0
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
...+.|.++.......|.+.|.|..|..
T Consensus 40 ~~~L~i~~v~~~D~G~Y~C~A~N~~G~~ 67 (76)
T cd04978 40 GGTLILSNVQPNDTAVYQCNASNVHGYL 67 (76)
T ss_pred CCEEEECCCChhhCEEEEEEEEccCCeE
Confidence 3578889998888889999999987754
No 93
>cd05746 Ig4_Peroxidasin Fourth immunoglobulin (Ig)-like domain of peroxidasin. Ig4_Peroxidasin: the fourth immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted, and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells, which have undergone programmed cell death, and protection of the organism against non-self.
Probab=50.23 E-value=19 Score=19.40 Aligned_cols=29 Identities=24% Similarity=0.328 Sum_probs=23.7
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
...+.|.++.+.....|.+.|.|..|...
T Consensus 35 ~~~L~I~~~~~~D~G~Y~C~a~N~~G~~~ 63 (69)
T cd05746 35 EGYLAIRDVGVADQGRYECVARNTIGYAS 63 (69)
T ss_pred CCEEEECcCChhhCEEEEEEEECCCCcEE
Confidence 34688889989888999999999887643
No 94
>cd04968 Ig3_Contactin_like Third Ig domain of contactin. Ig3_Contactin_like: Third Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=50.21 E-value=24 Score=20.05 Aligned_cols=29 Identities=14% Similarity=-0.029 Sum_probs=23.8
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
...+.|.++.......|.|.|.|..|...
T Consensus 52 ~~~L~i~~v~~~D~G~Y~C~a~N~~G~~~ 80 (88)
T cd04968 52 GAVLKIPNIQFEDEGTYECEAENIKGKDT 80 (88)
T ss_pred CCEEEECCCCcccCEEEEEEEEECCCcEE
Confidence 35678899998888999999999887643
No 95
>cd05731 Ig3_L1-CAM_like Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM_like: domain similar to the third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM and human neurofascin.
Probab=50.00 E-value=16 Score=19.56 Aligned_cols=27 Identities=22% Similarity=0.196 Sum_probs=22.3
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.......|.+.|.|..|..
T Consensus 36 ~~L~i~~v~~~D~G~Y~C~a~N~~G~~ 62 (71)
T cd05731 36 KTLKIDNVSEEDDGEYRCTASNSLGSA 62 (71)
T ss_pred CEEEECCCCHHHCEEEEEEEEeCCceE
Confidence 468888888888888999999988754
No 96
>cd05732 Ig5_NCAM-1_like Fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar proteins. Ig5_NCAM-1 like: domain similar to the fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM mole
Probab=49.80 E-value=31 Score=19.89 Aligned_cols=28 Identities=21% Similarity=0.209 Sum_probs=22.1
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|.|.|.|..|...
T Consensus 62 ~~L~I~~v~~~D~G~Y~C~a~N~~G~~~ 89 (96)
T cd05732 62 SSLTLKDVQLTDAGRYDCEASNRIGGDQ 89 (96)
T ss_pred EEEEECcCCcCcCEEeEEEEEeCCCCcE
Confidence 3678888888888888999998887643
No 97
>PF15417 DUF4624: Domain of unknown function (DUF4624)
Probab=49.24 E-value=20 Score=21.78 Aligned_cols=30 Identities=23% Similarity=0.309 Sum_probs=23.0
Q ss_pred eEEecCCCcceEEEeecCCCceEEEEEEEE
Q psy14533 4 WKSQNSGTEQWAVLQDLLPATLYRVRVLAE 33 (209)
Q Consensus 4 w~~~~~~~~~~~~i~~L~p~~~Y~~~v~a~ 33 (209)
|...+.++.-+..+.+|+.+..|.++.+..
T Consensus 80 ~~~~V~~dt~tisL~nlqk~kEY~V~ftGt 109 (132)
T PF15417_consen 80 WNGKVSGDTFTISLNNLQKEKEYVVCFTGT 109 (132)
T ss_pred cccccccceEEEEhhhcccCceEEEEEecc
Confidence 444566677788999999999998876643
No 98
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=49.10 E-value=19 Score=22.92 Aligned_cols=33 Identities=24% Similarity=0.255 Sum_probs=20.4
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEe
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVH 48 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~ 48 (209)
+..+. ...+..|.++|..+|..|++. |.++.+.
T Consensus 74 ~a~~~-~~~gG~y~~~VeLCN~~GCS~-S~~~~V~ 106 (133)
T PF08329_consen 74 SATFT-VTKGGRYQMQVELCNADGCST-SAPVEVV 106 (133)
T ss_dssp EEEEE-E-S-EEEEEEEEEEETTEEEE----EEEE
T ss_pred eEEEE-ecCCCEEEEEEEEECCCCccc-CCCEEEE
Confidence 34444 456778999999999999776 4454443
No 99
>cd05753 Ig2_FcgammaR_like Second immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs) and similar proteins. Ig2_FcgammaR_like: domain similar to the second immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs). Interactions between IgG and FcgammaR are important to the initiation of cellular and humoral response. IgG binding to FcgammaR leads to a cascade of signals and ultimately to functions such as antibody-dependent-cellular-cytotoxicity (ADCC), endocytosis, phagocytosis, release of inflammatory mediators, etc. FcgammaR has two Ig-like domains. This group also contains FcepsilonRI, which binds IgE with high affinity.
Probab=49.01 E-value=49 Score=18.72 Aligned_cols=35 Identities=6% Similarity=-0.127 Sum_probs=24.8
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeE
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLV 47 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~ 47 (209)
...+.|..+.+.....|.|.+.+..|... |..+.+
T Consensus 47 ~~~l~I~~~~~~dsG~Y~C~~~~~~~~~~-s~~~~i 81 (83)
T cd05753 47 NSNLSIPQATLSDSGSYHCSGIIGSYDYS-SEPVSI 81 (83)
T ss_pred CceEEECccCHHHCEEEEEEEEeCCceec-CCCEEE
Confidence 35688888888888888888888877544 444433
No 100
>cd05744 Ig_Myotilin_C_like Immunoglobulin (Ig)-like domain of myotilin, palladin, and myopalladin. Ig_Myotilin_like_C: immunoglobulin (Ig)-like domain in myotilin, palladin, and myopalladin. Myotilin, palladin, and myopalladin function as scaffolds that regulate actin organization. Myotilin and myopalladin are most abundant in skeletal and cardiac muscle; palladin is ubiquitously expressed in the organs of developing vertebrates and plays a key role in cellular morphogenesis. The three family members each interact with specific molecular partners: all three bind to alpha-actinin; in addition, palladin also binds to vasodilator-stimulated phosphoprotein (VASP) and ezrin, myotilin binds to filamin and actin, and myopalladin also binds to nebulin and cardiac ankyrin repeat protein (CARP).
Probab=48.92 E-value=45 Score=18.32 Aligned_cols=26 Identities=19% Similarity=0.033 Sum_probs=21.9
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
++.|.++.+.....|.+.|.|..|..
T Consensus 42 ~L~I~~~~~~D~G~Y~C~a~N~~G~~ 67 (75)
T cd05744 42 CLLIQNANKEDAGWYTVSAVNEAGVV 67 (75)
T ss_pred EEEECCCCcccCEEEEEEEEcCCCcE
Confidence 57888888888899999999988754
No 101
>cd05869 Ig5_NCAM-1 Fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM) and heterophilic (NCAM-non-NCAM) interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (tr
Probab=48.67 E-value=36 Score=19.81 Aligned_cols=27 Identities=15% Similarity=0.058 Sum_probs=21.8
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.+.|.++.......|.|.|.|..|...
T Consensus 64 ~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 90 (97)
T cd05869 64 SLTLKYIQYTDAGEYLCTASNTIGQDS 90 (97)
T ss_pred EEEEecCccCcCEEEEEEEecCCCCee
Confidence 577888888888889999999888543
No 102
>cd05855 Ig_TrkB_d5 Fifth domain (immunoglobulin-like) of Trk receptor TrkB. TrkB_d5: the fifth domain of Trk receptor TrkB, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors, which mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. The Trks are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkB shares significant sequence homology and domain organization with TrkA, and TrkC. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. In some cell systems NT-3 can activate TrkA and TrkB receptors. TrKB transcripts are found throughout multiple structures of the central and peripheral nervous systems.
Probab=47.88 E-value=38 Score=19.14 Aligned_cols=27 Identities=15% Similarity=0.071 Sum_probs=22.1
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
.+.|.+..+.....|.|.|.|..|...
T Consensus 45 ~L~i~~~~~~D~G~YtC~A~N~~G~~~ 71 (79)
T cd05855 45 CLQLDNPTHLNNGIYTLVAKNEYGEDE 71 (79)
T ss_pred EEEECCCCcccCEEEEEEEEcCCcccc
Confidence 567788888888899999999988643
No 103
>PF04775 Bile_Hydr_Trans: Acyl-CoA thioester hydrolase/BAAT N-terminal region; InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=46.74 E-value=51 Score=20.69 Aligned_cols=25 Identities=12% Similarity=0.190 Sum_probs=16.9
Q ss_pred ceEEecCCCCCcEEEEEEEEEcCCC
Q psy14533 118 GVATLTGLRKYRKYDIVVQAFNEKG 142 (209)
Q Consensus 118 ~~~~~~~L~p~~~Y~~~v~a~~~~g 142 (209)
..+.+.||.|+..++++.......|
T Consensus 5 ~~I~v~GL~p~~~vtl~a~~~~~~g 29 (126)
T PF04775_consen 5 VDIRVSGLPPGQEVTLRARLTDDNG 29 (126)
T ss_dssp -EEEEES--TT-EEEEEEEEE-TTS
T ss_pred eEEEEeCCCCCCEEEEEEEEEeCCC
Confidence 5678899999999999988887655
No 104
>cd04976 Ig2_VEGFR Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. The VEGFR family consists of three members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGFR-2 is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A. VEGFR-1 may play an inhibitory part in these processes by binding VEGF and interfering with its interaction with VEGFR-2. VEGFR-1 has a signa
Probab=46.28 E-value=20 Score=19.48 Aligned_cols=28 Identities=11% Similarity=0.010 Sum_probs=23.4
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
...+.|.++.+.....|.|.|.|..|..
T Consensus 34 ~~~L~I~~v~~~D~G~YtC~a~N~~g~~ 61 (71)
T cd04976 34 GHSLTIKDVTEEDAGNYTVVLTNKQAKL 61 (71)
T ss_pred CCEEEECcCCHHHCEEEEEEEEcCCccE
Confidence 3578899999999999999999987653
No 105
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=45.79 E-value=43 Score=17.24 Aligned_cols=28 Identities=21% Similarity=0.259 Sum_probs=21.3
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGRPS 42 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s 42 (209)
++.+..+ ....|.+.+.+....|.....
T Consensus 15 s~t~~~~-~dG~y~itv~a~D~AGN~s~~ 42 (54)
T PF13754_consen 15 SFTVPAL-ADGTYTITVTATDAAGNTSTS 42 (54)
T ss_pred EEeCCCC-CCccEEEEEEEEeCCCCCCCc
Confidence 4666666 677899999999999876533
No 106
>cd05747 Ig5_Titin_like M5, fifth immunoglobulin (Ig)-like domain of human titin C terminus and similar proteins. Ig5_Titin_like: domain similar to the M5, fifth immunoglobulin (Ig)-like domain from the human titin C terminus. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic; depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone, and appears to function similar to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching.
Probab=45.74 E-value=58 Score=18.67 Aligned_cols=27 Identities=19% Similarity=0.057 Sum_probs=21.5
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.......|.|.|.|..|..
T Consensus 59 ~~L~i~~~~~~D~G~Y~C~a~N~~G~~ 85 (92)
T cd05747 59 STFEISKVQMSDEGNYTVVVENSEGKQ 85 (92)
T ss_pred eEEEECCCCcccCEeEEEEEEcCCCCE
Confidence 467888888888888888888887753
No 107
>cd05733 Ig6_L1-CAM_like Sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig6_L1-CAM_like: domain similar to the sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains NrCAM [Ng(neuronglia)CAM-related cell adhesion molecule], which is primarily expressed in the nervous system, and human neurofascin.
Probab=44.48 E-value=55 Score=18.04 Aligned_cols=28 Identities=18% Similarity=0.115 Sum_probs=19.1
Q ss_pred ceEEEeecCC----CceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLP----ATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p----~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|..+.+ .....|++.|.|..|...
T Consensus 39 g~L~i~~~~~~~~~~d~G~Y~C~A~N~~G~~~ 70 (77)
T cd05733 39 GTLVIDNMNGGRAEDYEGEYQCYASNELGTAI 70 (77)
T ss_pred CEEEEeccCCCCCcCCCEEEEEEEEcCCCcEE
Confidence 4566666654 345778899999888643
No 108
>cd04975 Ig4_SCFR_like Fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR) and similar proteins. Ig4_SCFR_like; fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR). In addition to SCFR this group also includes the fourth Ig domain of platelet-derived growth factor receptors (PDGFR), alpha and beta, the fourth Ig domain of macrophage colony stimulating factor (M-CSF), and the Ig domain of the receptor tyrosine kinase KIT. SCFR and the PDGFR alpha and beta have similar organization: an extracellular component having five Ig-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. SCFR and its ligand SCF are critical for normal hematopoiesis, mast cell development, melanocytes and gametogenesis. SCF binds to the second and third Ig-like domains of SCFR, this fourth Ig-like domain participates in SCFR dimerization, which follows ligand binding. Deletion of this fourth SCFR_Ig-like domain abolishes
Probab=44.32 E-value=25 Score=20.98 Aligned_cols=28 Identities=21% Similarity=0.001 Sum_probs=22.9
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|..+++.....|.|.|.|..|...
T Consensus 65 ~~L~i~~v~~~D~G~Ytc~A~N~~G~~~ 92 (101)
T cd04975 65 SELKLVRLKESEAGTYTFLASNSDASKS 92 (101)
T ss_pred EEEEEeecCHhhCeeEEEEEECCCccEE
Confidence 4578888899888999999999887543
No 109
>cd05737 Ig_Myomesin_like_C C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Ig_Myomesin_like_C: domain similar to the C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Myomesin and M-protein are both structural proteins localized to the M-band, a transverse structure in the center of the sarcomere, and are candidates for M-band bridges. Both proteins are modular, consisting mainly of repetitive Ig-like and fibronectin type III (FnIII) domains. Myomesin is expressed in all types of vertebrate striated muscle; M-protein has a muscle-type specific expression pattern. Myomesin is present in both slow and fast fibers; M-protein is present only in fast fibers. It has been suggested that myomesin acts as a molecular spring with alternative splicing as a means of modifying its elasticity.
Probab=44.00 E-value=30 Score=19.95 Aligned_cols=27 Identities=7% Similarity=0.134 Sum_probs=21.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
.++.|.++.+.....|.|.|.|..|..
T Consensus 58 ~~L~I~~v~~~D~G~Y~C~a~N~~G~~ 84 (92)
T cd05737 58 ASLTIKGVSSEDSGKYGIVVKNKYGGE 84 (92)
T ss_pred EEEEEccCChhhCEEEEEEEEECCCcc
Confidence 367888888888888888888887754
No 110
>cd05750 Ig_Pro_neuregulin Immunoglobulin (Ig)-like domain in neuregulins (NRGs). Ig_Pro_neuregulin: immunoglobulin (Ig)-like domain in neuregulins (NRGs). NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. There are four members of the neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1 protein, binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. The other NRGs proteins bind one or the other or both of these ErbBs. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates processes such as target cell differentiation, and Schwann cell surv
Probab=42.76 E-value=55 Score=17.59 Aligned_cols=27 Identities=19% Similarity=0.063 Sum_probs=21.4
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.+.....|.+.|.|..|..
T Consensus 43 ~~L~I~~~~~~D~G~Y~C~a~N~~G~~ 69 (75)
T cd05750 43 SELQINKAKLADSGEYTCVVENILGND 69 (75)
T ss_pred EEEEEccCCcccCeEEEEEEEEcCCce
Confidence 457788888888888889998888754
No 111
>cd04977 Ig1_NCAM-1_like First immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1 and similar proteins. Ig1_NCAM-1 like: first immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. NCAM-1 plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-nonNCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves the Ig1, Ig2, and Ig3 domains. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the s
Probab=42.31 E-value=68 Score=18.49 Aligned_cols=26 Identities=8% Similarity=-0.017 Sum_probs=22.2
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGA 38 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~ 38 (209)
..+.|.++.+.....|.|.|.|..|.
T Consensus 57 ~~L~I~~v~~~D~G~Y~C~A~N~~g~ 82 (92)
T cd04977 57 STLTIYNANIEDAGIYKCVATDAKGT 82 (92)
T ss_pred EEEEEecCCcccCEEEEEEEEcCCCC
Confidence 46889999999999999999998654
No 112
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=42.20 E-value=44 Score=22.52 Aligned_cols=25 Identities=20% Similarity=0.171 Sum_probs=19.4
Q ss_pred cceEEEeecCCCceEEEEEEEEcCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSL 36 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~ 36 (209)
...|.|+||.|...|.+.+......
T Consensus 32 ~l~y~vsGL~p~~~Y~i~l~~~~~d 56 (184)
T PF00907_consen 32 TLEYSVSGLDPDSLYSISLHFERVD 56 (184)
T ss_dssp -EEEEEESS-TTSEEEEEEEEEESC
T ss_pred ccEEEecCCCCCcceEEEEEEEEec
Confidence 4589999999999999998876543
No 113
>cd05723 Ig4_Neogenin Fourth immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig4_Neogenin: fourth immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=42.04 E-value=26 Score=19.00 Aligned_cols=26 Identities=19% Similarity=0.112 Sum_probs=19.8
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
.+.|.++.+.....|.+.|.|..|..
T Consensus 38 ~l~i~~v~~~D~G~Y~C~A~N~~G~~ 63 (71)
T cd05723 38 NLQVLGLVKSDEGFYQCIAENDVGNV 63 (71)
T ss_pred CEEEEcCCcccCEEEEEEEEcCCCEE
Confidence 46666788888888888888887754
No 114
>cd05859 Ig4_PDGFR-alpha Fourth immunoglobulin (Ig)-like domain of platelet-derived growth factor receptor (PDGFR) alpha. IG4_PDGFR-alpha: The fourth immunoglobulin (Ig)-like domain of platelet-derived growth factor receptor (PDGFR) alpha. PDGF is a potent mitogen for connective tissue cells. PDGF-stimulated processes are mediated by three different PDGFs (PDGF-A,-B, and C). PDGFR alpha binds to all three PDGFs, whereas the PDGFR beta (not included in this group) binds only to PDGF-B. PDGF alpha is organized as an extracellular component having five Ig-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. In mice, PDGFR alpha and PDGFR beta are essential for normal development.
Probab=42.00 E-value=26 Score=20.84 Aligned_cols=28 Identities=14% Similarity=-0.120 Sum_probs=23.0
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|..+.+.....|.|.|.|..|...
T Consensus 65 s~L~I~~v~~~D~G~Ytc~A~N~~g~~~ 92 (101)
T cd05859 65 SKLKLIRAKEEDSGLYTALAQNEDAVKS 92 (101)
T ss_pred cEEEEeeCCHHHCEEEEEEEEcCCceEE
Confidence 4688889999998999999999877543
No 115
>cd05895 Ig_Pro_neuregulin-1 Immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. Ig_Pro_neuregulin-1: immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. There are many NRG-1 isoforms which arise from the alternative splicing of mRNA. NRG-1 belongs to the neuregulin gene family, which is comprised of four genes. This group represents NRG-1. NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, and heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. The NRG-1 protein binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates process
Probab=41.95 E-value=59 Score=17.70 Aligned_cols=27 Identities=11% Similarity=0.040 Sum_probs=20.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
.++.|.++.......|.|.+.|..|..
T Consensus 44 ~~L~I~~~~~~DsG~Y~C~a~N~~g~~ 70 (76)
T cd05895 44 SELQISKASLADNGEYKCMVSSKLGND 70 (76)
T ss_pred EEEEECcCCcccCEEEEEEEEeCCCce
Confidence 457788888888888888888877653
No 116
>cd05865 Ig1_NCAM-1 First immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. Ig1_NCAM-1: first immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. NCAM-1 plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-nonNCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves the Ig1, Ig2, and Ig3 domains. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (trans
Probab=41.53 E-value=33 Score=20.15 Aligned_cols=24 Identities=4% Similarity=-0.166 Sum_probs=20.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSL 36 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~ 36 (209)
..+.|.++++.....|+|.|.|..
T Consensus 60 ~~L~I~~v~~~D~G~YtC~A~N~~ 83 (96)
T cd05865 60 STLTIYNANIDDAGIYKCVVSNED 83 (96)
T ss_pred eEEEEeccChhhCEEEEEEEEcCC
Confidence 357888999999999999999986
No 117
>cd05857 Ig2_FGFR Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. Ig2_FGFR: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. FGF receptors bind FGF signaling polypeptides. FGFs participate in multiple processes such as morphogenesis, development, and angiogenesis. FGFs bind to four FGF receptor tyrosine kinases (FGFR1, -2, -3, -4). Receptor diversity is controlled by alternative splicing producing splice variants with different ligand binding characteristics and different expression patterns. FGFRs have an extracellular region comprised of three IG-like domains, a single transmembrane helix, and an intracellular tyrosine kinase domain. Ligand binding and specificity reside in the Ig-like domains 2 and 3, and the linker region that connects these two. FGFR activation and signaling depend on FGF-induced dimerization, a process involving cell surface heparin or heparin sulfate proteoglycans.
Probab=40.74 E-value=58 Score=18.15 Aligned_cols=26 Identities=15% Similarity=0.277 Sum_probs=20.7
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
.+.|.++.+.....|.|.+.|..|..
T Consensus 52 ~l~i~~~~~~D~G~Y~C~a~N~~G~~ 77 (85)
T cd05857 52 SLIMESVVPSDKGNYTCVVENEYGSI 77 (85)
T ss_pred EEEEccCCcccCEEEEEEEEeCCCEE
Confidence 46777888888888899999888753
No 118
>cd05891 Ig_M-protein_C C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). Ig_M-protein_C: the C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). M-protein is a structural protein localized to the M-band, a transverse structure in the center of the sarcomere, and is a candidate for M-band bridges. M-protein is modular consisting mainly of repetitive IG-like and fibronectin type III (FnIII) domains, and has a muscle-type specific expression pattern. M-protein is present in fast fibers.
Probab=40.25 E-value=48 Score=19.13 Aligned_cols=27 Identities=7% Similarity=0.095 Sum_probs=20.5
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
++.|.++.+.....|.|.|.|..|...
T Consensus 59 ~L~I~~~~~~D~G~Y~C~a~N~~G~~~ 85 (92)
T cd05891 59 SLTIKGVTSEDSGKYSINVKNKYGGET 85 (92)
T ss_pred EEEECCCChhhCEEEEEEEEeCCCcee
Confidence 577888888888888888888776543
No 119
>cd05886 Ig1_Nectin-1_like First immunoglobulin (Ig) domain of nectin-1 (also known as poliovirus receptor related protein 1, or as CD111) and similar proteins. Ig1_Nectin-1_like: domain similar to the first immunoglobulin (Ig) domain of nectin-1 (also known as poliovirus receptor related protein 1, or as CD111). Nectin-1 belongs to the nectin family comprised of four transmembrane glycoproteins (nectins-1 through -4). Nectins are synaptic cell adhesion molecules (CAMs) which facilitate adhesion and signaling at various intracellular junctions. Nectins form homophilic cis-dimers, followed by homophilic and heterophilic trans-dimers involved in cell-cell adhesion. In addition nectins heterophilically trans-interact with other CAMs such as nectin-like molecules (Necls), nectin-1 for example, has been shown to trans-interact with Necl-1. Nectins also interact with various other proteins, including the actin filament (F-actin)-binding protein, afadin. Mutation in the human nectin-1 gene is
Probab=39.47 E-value=68 Score=19.06 Aligned_cols=14 Identities=29% Similarity=0.653 Sum_probs=7.1
Q ss_pred cceEEecCCCcceE
Q psy14533 2 VEWKSQNSGTEQWA 15 (209)
Q Consensus 2 ~~w~~~~~~~~~~~ 15 (209)
|+|+....+....+
T Consensus 21 V~W~k~~~~~~~~v 34 (99)
T cd05886 21 VTWQKLTNGSKQNV 34 (99)
T ss_pred EEEEECCCCCceEE
Confidence 67865543333333
No 120
>cd05724 Ig2_Robo Second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig2_Robo: domain similar to the second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antago
Probab=39.44 E-value=70 Score=17.79 Aligned_cols=28 Identities=11% Similarity=0.089 Sum_probs=22.8
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|+|.|.|..|...
T Consensus 51 ~~L~I~~~~~~D~G~Y~C~a~N~~G~~~ 78 (86)
T cd05724 51 GNLLIAEARKSDEGTYKCVATNMVGERE 78 (86)
T ss_pred CEEEEeECCcccCEEEEEEEEeccCcee
Confidence 4688888988888899999998877543
No 121
>cd05856 Ig2_FGFRL1-like Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal peptide, three extracellular Ig-like modules, a transmembrane segment, and a short intracellular domain. FGFRL1 is expressed preferentially in skeletal tissues. Similar to FGF receptors, the expressed protein interacts specifically with heparin and with FGF2. FGFRL1 does not have a protein tyrosine kinase domain at its C terminus; neither does its cytoplasmic domain appear to interact with a signaling partner. It has been suggested that FGFRL1 may not have any direct signaling function, but instead acts as a decoy receptor trapping FGFs and preventing them from binding other receptors.
Probab=39.22 E-value=49 Score=18.11 Aligned_cols=26 Identities=19% Similarity=0.150 Sum_probs=21.8
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGA 38 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~ 38 (209)
..+.|.++.+.....|.+.+.|..|.
T Consensus 48 ~~L~i~~v~~~D~G~Y~C~a~N~~G~ 73 (82)
T cd05856 48 WTLSLKNLKPEDSGKYTCHVSNRAGE 73 (82)
T ss_pred EEEEEccCChhhCEEEEEEEEcCCcc
Confidence 46788899988888999999988774
No 122
>cd05864 Ig2_VEGFR-2 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A; VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGF-A also interacts with VEGFR-1, which it binds more strongly than VEGFR-2. VEGFR-2 and -1 may mediate a chemotactic and a survival signal in hematopoietic stem cells or leukemia cells.
Probab=39.00 E-value=37 Score=18.52 Aligned_cols=28 Identities=7% Similarity=0.035 Sum_probs=23.9
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.+.....|.|.|.|..|...
T Consensus 34 ~~L~I~~v~~~D~G~YtC~a~N~~G~~~ 61 (70)
T cd05864 34 VHLTIYEVTEKDAGNYTVVLTNPITKEE 61 (70)
T ss_pred CEEEECcCCHHHCEEEEEEEEECCCcee
Confidence 4688999999999999999999987554
No 123
>cd05758 Ig5_KIRREL3-like Fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 (also known as Neph2) and similar proteins. Ig5_KIRREL3-like: domain similar to the fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 (also known as Neph2). This protein has five Ig-like domains, one transmembrane domain, and a cytoplasmic tail. Included in this group is mammalian Kirrel (Neph1), Kirrel2 (Neph3), and Drosophila RST (irregular chiasm C-roughest) protein. These proteins contain multiple Ig domains, have properties of cell adhesion molecules, and are important in organ development.
Probab=38.99 E-value=57 Score=19.02 Aligned_cols=27 Identities=19% Similarity=0.158 Sum_probs=19.9
Q ss_pred eEEEeecCCCc-eEEEEEEEEcCCCCCC
Q psy14533 14 WAVLQDLLPAT-LYRVRVLAENSLGAGR 40 (209)
Q Consensus 14 ~~~i~~L~p~~-~Y~~~v~a~~~~g~~~ 40 (209)
.+.|.+++... ...|.|.|.|..|...
T Consensus 65 ~L~I~~v~~~d~~G~Y~C~A~N~~G~~~ 92 (98)
T cd05758 65 TLTISNTQESDFQTSYNCTAWNSFGSGT 92 (98)
T ss_pred EEEECCccccccceeEEEEEEcCCCccc
Confidence 57778888733 6778888888887643
No 124
>cd05863 Ig2_VEGFR-3 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3). Ig2_VEGFR-3: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds two members of the VEGF family (VEGF-C and -D) and is involved in tumor angiogenesis and growth.
Probab=38.74 E-value=33 Score=18.59 Aligned_cols=29 Identities=14% Similarity=0.050 Sum_probs=24.5
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
....+.|.++.......|.|.|.|..|..
T Consensus 29 ~~~~L~i~~v~~~D~G~YtC~a~N~~g~~ 57 (67)
T cd05863 29 SQHSLQIKDVTEASAGTYTLVLWNSAAGL 57 (67)
T ss_pred CcCEEEECCCCHHHCEEEEEEEEECCccE
Confidence 34589999999999999999999988743
No 125
>PF14292 SusE: SusE outer membrane protein
Probab=38.34 E-value=97 Score=19.14 Aligned_cols=36 Identities=14% Similarity=0.337 Sum_probs=24.0
Q ss_pred eEEEEecCCeEEEEEeCCCCccCCc-eeeEEEEEEEeCC
Q psy14533 166 ITCSALSSTSLSVTWQPPPLLLQNG-EILGYKVYYENMR 203 (209)
Q Consensus 166 ~~~~~~~~~sv~l~W~~p~~~~~~~-~i~~Y~i~y~~~~ 203 (209)
+.........++++|.++. .... ....|.|.....+
T Consensus 37 i~L~~~~~~a~tftW~~~~--~~~~~a~v~Y~lq~~~~~ 73 (122)
T PF14292_consen 37 IVLDEASDNAVTFTWTAAD--YGGPDAPVTYTLQFDKKG 73 (122)
T ss_pred EEecccCCceEEEEEECCc--cCCCCCceEEEEEEeccC
Confidence 4343345568999999984 3333 4567988888754
No 126
>cd05742 Ig1_VEGFR_like First immunoglobulin (Ig)-like domain of vascular endothelial growth factor (VEGF) receptor (R) and similar proteins. Ig1_VEGFR_like: first immunoglobulin (Ig)-like domain of vascular endothelial growth factor (VEGF) receptor(R) related proteins. The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. The VEGFR family consists of three members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and VEGFR-3 (Flt-4). VEGF-A interacts with both VEGFR-1 and VEGFR-2. VEGFR-1 binds strongest to VEGF, VEGF-2 binds more weakly. VEGFR-3 appears not to bind VEGF, but binds other members of the VEGF family (VEGF-C and -D). VEGFRs bind VEGFs with high affinity with the IG-like domains. VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGF
Probab=37.77 E-value=35 Score=19.14 Aligned_cols=28 Identities=4% Similarity=-0.154 Sum_probs=21.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|.|.|.|..|...
T Consensus 49 s~L~I~~v~~~DsG~Y~C~a~n~~~~~~ 76 (84)
T cd05742 49 STLTIPNATLKDSGTYTCAASSGTMDQK 76 (84)
T ss_pred EEEEECCCChhhCEEEEEEEccCCCceE
Confidence 3578888888888888888888776543
No 127
>smart00408 IGc2 Immunoglobulin C-2 Type.
Probab=36.40 E-value=59 Score=16.10 Aligned_cols=25 Identities=16% Similarity=0.036 Sum_probs=17.5
Q ss_pred cceEEEeecCCCceEEEEEEEEcCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSL 36 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~ 36 (209)
...+.|.++.......|.+.+.+..
T Consensus 38 ~~~L~i~~~~~~d~G~Y~C~~~n~~ 62 (63)
T smart00408 38 GSTLTIKSVSLEDSGEYTCVAENSA 62 (63)
T ss_pred CcEEEEeeCCcccCEEEEEEEecCC
Confidence 3467777777777777777776654
No 128
>cd05738 Ig2_RPTP_IIa_LAR_like Second immunoglobulin (Ig)-like domain of the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. Ig2_RPTP_IIa_LAR_like: domain similar to the second immunoglobulin (Ig)-like domain found in the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. LAR belongs to the RPTP type IIa subfamily. Members of this subfamily are cell adhesion molecule-like proteins involved in central nervous system (CNS) development. They have large extracellular portions, comprised of multiple Ig-like domains and two to nine fibronectin type III (FNIII) domains, and a cytoplasmic portion having two tandem phosphatase domains.
Probab=36.24 E-value=75 Score=17.26 Aligned_cols=27 Identities=15% Similarity=0.084 Sum_probs=21.9
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.+.....|++.|.|..|..
T Consensus 38 g~L~i~~~~~~D~G~Y~C~a~N~~G~~ 64 (74)
T cd05738 38 GALQIENSEESDQGKYECVATNSAGTR 64 (74)
T ss_pred cEEEECCCChhhCEEEEEEEECCCCce
Confidence 367888888888888999999888763
No 129
>cd05773 Ig8_hNephrin_like Eighth immunoglobulin-like domain of nephrin. Ig8_hNephrin_like: domain similar to the eighth immunoglobulin-like domain in human nephrin. Nephrin is an integral component of the slit diaphragm, and is a central component of the glomerular ultrafilter. Nephrin plays a structural role, and has a role in signaling. Nephrin is a transmembrane protein having a short intracellular portion, and an extracellular portion comprised of eight Ig-like domains, and one fibronectin type III-like domain. The extracellular portions of nephrin, from neighboring foot processes of separate podocyte cells, may interact with each other, and in association with other components of the slit diaphragm, form a porous molecular sieve within the slit pore. The intracellular portion of nephrin is associated with linker proteins, which connect nephrin to the actin cytoskeleton. The intracellular portion is tyrosine phosphorylated, and mediates signaling from the slit diaphragm into the p
Probab=35.40 E-value=81 Score=18.96 Aligned_cols=26 Identities=19% Similarity=0.164 Sum_probs=18.1
Q ss_pred eEEEeecC-CCceEEEEEEEEcCCCCC
Q psy14533 14 WAVLQDLL-PATLYRVRVLAENSLGAG 39 (209)
Q Consensus 14 ~~~i~~L~-p~~~Y~~~v~a~~~~g~~ 39 (209)
.+.|.++. +.....|.|.|.|..|..
T Consensus 70 ~L~I~~v~~~~D~G~Y~C~A~N~~G~~ 96 (109)
T cd05773 70 ILTIINVSAALDYALFTCTAHNSLGED 96 (109)
T ss_pred EEEECcCCccCCCEEEEEEEEeCCccC
Confidence 46677765 455567888888888764
No 130
>PF14686 fn3_3: Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=34.53 E-value=72 Score=18.90 Aligned_cols=21 Identities=24% Similarity=0.434 Sum_probs=12.2
Q ss_pred CcceEEEeecCCCceEEEEEEE
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLA 32 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a 32 (209)
.+..|.|.+++||+ |.+.+.+
T Consensus 48 ~~G~Fti~~V~pGt-Y~L~ay~ 68 (95)
T PF14686_consen 48 SDGNFTIPNVRPGT-YRLYAYA 68 (95)
T ss_dssp TTSEEE---B-SEE-EEEEEEE
T ss_pred CCCcEEeCCeeCcE-eEEEEEE
Confidence 45689999999997 6666666
No 131
>cd05754 Ig3_Perlecan_like Third immunoglobulin (Ig)-like domain found in Perlecan and similar proteins. Ig3_Perlecan_like: domain similar to the third immunoglobulin (Ig)-like domain found in Perlecan. Perlecan is a large multi-domain heparin sulfate proteoglycan, important in tissue development and organogenesis. Perlecan can be represented as 5 major portions; its fourth major portion (domain IV) is a tandem repeat of immunoglobulin-like domains (Ig2-Ig15), which can vary in size due to alternative splicing. Perlecan binds many cellular and extracellular ligands. Its domain IV region has many binding sites. Some of these have been mapped at the level of individual Ig-like domains, including a site restricted to the Ig5 domain for heparin/sulfatide, a site restricted to the Ig3 domain for nidogen-1 and nidogen-2, a site restricted to Ig4-5 for fibronectin, and sites restricted to Ig2 and to Ig13-15 for fibulin-2.
Probab=34.45 E-value=77 Score=17.73 Aligned_cols=27 Identities=7% Similarity=-0.002 Sum_probs=19.6
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.+...-.|.|.|.|..|..
T Consensus 52 ~~L~I~~v~~~DsG~Y~C~a~n~~g~~ 78 (85)
T cd05754 52 GILTIRNVQLSDAGTYVCTGSNMLDTD 78 (85)
T ss_pred CEEEECCCCHHHCEEEEEEEeccCCeE
Confidence 467777787777777888887766543
No 132
>cd05853 Ig6_Contactin-4 Sixth Ig domain of contactin-4. Ig6_Contactin-4: sixth Ig domain of the neural cell adhesion molecule contactin-4. Contactins are neural cell adhesion molecules, and are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. Highest expresson of contactin-4 is in testes, thyroid, small intestine, uterus and brain. Contactin-4 plays a role in the response of neuroblastoma cells to differentiating agents, such as retinoids. The contactin 4 gene is associated with cerebellar degeneration in spinocerebellar ataxia type 16.
Probab=32.81 E-value=64 Score=18.55 Aligned_cols=28 Identities=4% Similarity=-0.047 Sum_probs=22.3
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAGR 40 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~~ 40 (209)
..+.|.++.......|.|.|.|..+...
T Consensus 47 ~~L~I~nv~~~dsG~YtC~a~n~~~~~~ 74 (85)
T cd05853 47 GDLMIRSIQLKHAGKYVCMVQTSVDKLS 74 (85)
T ss_pred CcEEEecCCHHHCEEEEEEEEcccCceE
Confidence 4688888888888889999988776543
No 133
>PF07867 DUF1654: Protein of unknown function (DUF1654); InterPro: IPR012449 This entry is represented by Bacteriophage F116 (Pseudomonas phage F116), Orf28. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of proteins from the Pseudomonadaceae.
Probab=31.89 E-value=93 Score=17.49 Aligned_cols=19 Identities=16% Similarity=0.501 Sum_probs=16.1
Q ss_pred ceEEEEecCCeEEEEEeCC
Q psy14533 165 DITCSALSSTSLSVTWQPP 183 (209)
Q Consensus 165 ~~~~~~~~~~sv~l~W~~p 183 (209)
.+.+.-..+.+++|.|..+
T Consensus 52 gv~v~~~dDGsv~i~W~~~ 70 (73)
T PF07867_consen 52 GVEVTFNDDGSVRIRWERP 70 (73)
T ss_pred CeEEEEcCCCeEEEEEEcc
Confidence 6777777888999999977
No 134
>KOG3834|consensus
Probab=31.51 E-value=1.4e+02 Score=23.44 Aligned_cols=71 Identities=21% Similarity=0.364 Sum_probs=44.9
Q ss_pred ceEEEeecCCeEEEEecCCCCCCCCCcceeEEEEEEEccccCCCCcceEEeecCCCCcceEEecCCCCCcEEEEEE
Q psy14533 60 GLHAVAISSDSIRVTWSPPPAHLTNGDLLGYYLGYREQGFGRQNSYNFTTIPNRSDGAGVATLTGLRKYRKYDIVV 135 (209)
Q Consensus 60 ~~~~~~~~~~~~~l~W~~~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v 135 (209)
.+.+.+.....+++.|-.+.... .|.+.++.|+|+. ........|..+.... ..-..+-+|.+++.|.+-+
T Consensus 67 kltv~n~kt~~~R~v~I~ps~~w-ggqllGvsvrFcs--f~~A~~~vwHvl~V~p--~SPaalAgl~~~~DYivG~ 137 (462)
T KOG3834|consen 67 KLTVYNSKTQEVRIVEIVPSNNW-GGQLLGVSVRFCS--FDGAVESVWHVLSVEP--NSPAALAGLRPYTDYIVGI 137 (462)
T ss_pred EEEEEecccceeEEEEecccccc-cccccceEEEecc--CccchhheeeeeecCC--CCHHHhcccccccceEecc
Confidence 44555666777888887776543 4558999999998 2222233333332221 2345577899999998766
No 135
>cd05729 Ig2_FGFR_like Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor and similar proteins. Ig2_FGFR_like: domain similar to the second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. FGF receptors bind FGF signaling polypeptides. FGFs participate in multiple processes such as morphogenesis, development, and angiogenesis. FGFs bind to four FGF receptor tyrosine kinases (FGFR1, -2, -3, -4). Receptor diversity is controlled by alternative splicing producing splice variants with different ligand binding characteristics and different expression patterns. FGFRs have an extracellular region comprised of three Ig-like domains, a single transmembrane helix, and an intracellular tyrosine kinase domain. Ligand binding and specificity reside in the Ig-like domains 2 and 3, and the linker region that connects these two. FGFR activation and signaling depend on FGF-induced dimerization, a process involving cell surface heparin or heparin
Probab=31.27 E-value=96 Score=16.98 Aligned_cols=25 Identities=20% Similarity=0.405 Sum_probs=19.3
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGA 38 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~ 38 (209)
.+.|.++.+.....|.|.+.|..|.
T Consensus 52 ~l~i~~~~~~d~g~Y~C~~~n~~g~ 76 (85)
T cd05729 52 TLILESVVPSDSGKYTCIVENKYGS 76 (85)
T ss_pred EEEEeECCcccCEEEEEEEEECCce
Confidence 4677788888888888888887664
No 136
>cd04973 Ig1_FGFR First immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig1_FGFR: The first immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for all FGFs.
Probab=30.88 E-value=62 Score=17.92 Aligned_cols=25 Identities=12% Similarity=0.141 Sum_probs=19.0
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGA 38 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~ 38 (209)
.+.|.++.+.....|.+.|.|..|.
T Consensus 46 ~L~I~~~~~~DsG~Y~C~a~n~~g~ 70 (79)
T cd04973 46 EVQIKDAVPRDSGLYACVTSSPSGS 70 (79)
T ss_pred EEEECCCChhhCEEEEEEEeCCCCc
Confidence 4677888888888888888776654
No 137
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=30.72 E-value=75 Score=20.92 Aligned_cols=23 Identities=22% Similarity=0.372 Sum_probs=19.6
Q ss_pred ecCCCCCcEEEEEEEEEcCCCCc
Q psy14533 122 LTGLRKYRKYDIVVQAFNEKGPG 144 (209)
Q Consensus 122 ~~~L~p~~~Y~~~v~a~~~~g~~ 144 (209)
+..|+.+..|.+.|.|.+..|..
T Consensus 116 fpsle~~~~YtLtV~a~D~aGN~ 138 (158)
T PF13750_consen 116 FPSLEADDSYTLTVSATDKAGNQ 138 (158)
T ss_pred cCCcCCCCeEEEEEEEEecCCCE
Confidence 46788999999999999988743
No 138
>cd05728 Ig4_Contactin-2-like Fourth Ig domain of the neural cell adhesion molecule contactin-2 and similar proteins. Ig4_Contactin-2-like: fourth Ig domain of the neural cell adhesion molecule contactin-2. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (aliases TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. The first four Ig domains form the intermolecular binding fragment which arranges as a compact U-shaped module by contacts between Ig domains 1 and 4, and domains 2 and 3. It has been proposed that a linear zipper-like array forms, from contactin-2 molecules alternatively provided by the two apposed membranes.
Probab=30.38 E-value=96 Score=17.44 Aligned_cols=27 Identities=19% Similarity=-0.001 Sum_probs=21.9
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
..+.|.++.......|+|.|.|..|..
T Consensus 51 ~~L~i~~~~~~D~G~Y~C~a~N~~G~~ 77 (85)
T cd05728 51 GDLRITKLSLSDSGMYQCVAENKHGTI 77 (85)
T ss_pred CEEEEeeCCHHHCEEEEEEEECCCCeE
Confidence 467888888888888999999887753
No 139
>cd05874 Ig6_NrCAM Sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). Ig6_NrCAM: sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region, and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=30.01 E-value=1e+02 Score=16.98 Aligned_cols=27 Identities=19% Similarity=-0.021 Sum_probs=17.2
Q ss_pred ceEEEeecCCC----ceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPA----TLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~----~~Y~~~v~a~~~~g~~ 39 (209)
.++.|..+.+. ....|+|.|.|..|..
T Consensus 39 g~l~i~~~~~~~~~~d~G~Y~C~A~N~~G~~ 69 (77)
T cd05874 39 GTLVINIMNGEKAEAYEGVYQCTARNERGAA 69 (77)
T ss_pred ceEEEeccccCCCCCCCEEEEEEEEcCCCeE
Confidence 45566666542 3467888888887754
No 140
>PF10333 Pga1: GPI-Mannosyltransferase II co-activator; InterPro: IPR019433 Pga1 is found only in yeasts and not in mammals. It localises in the ER as a glycosylated integral membrane protein. It binds to the GPI-mannosyltransferase II subunit of the GPI and it is responsible for the second mannose addition to GPI precursors. The GPI-anchoring complex is a glycolipid that functions as a membrane anchor for many cell-surface proteins [].
Probab=29.09 E-value=77 Score=21.46 Aligned_cols=21 Identities=24% Similarity=0.583 Sum_probs=14.1
Q ss_pred CcceEEEeecCCCceEEEEEE
Q psy14533 11 TEQWAVLQDLLPATLYRVRVL 31 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~ 31 (209)
...-..+.+|++|..|.+++.
T Consensus 64 ~t~~V~L~nl~~~e~y~vKiC 84 (180)
T PF10333_consen 64 STTYVELNNLQPGETYQVKIC 84 (180)
T ss_pred ceEEEEeccCCCCCeEEEEEE
Confidence 344567777777777777653
No 141
>PF11811 DUF3331: Domain of unknown function (DUF3331); InterPro: IPR021769 This family of proteins are functionally uncharacterised. This family is only found in bacteria. Proteins in this family vary in length from 96 to 160 amino acids.
Probab=28.82 E-value=1.4e+02 Score=17.91 Aligned_cols=20 Identities=40% Similarity=0.669 Sum_probs=15.7
Q ss_pred cceEEEEe-cCCeEEEEEeCC
Q psy14533 164 LDITCSAL-SSTSLSVTWQPP 183 (209)
Q Consensus 164 ~~~~~~~~-~~~sv~l~W~~p 183 (209)
..+.+... +++++.|.|..|
T Consensus 16 ~~I~vlEr~S~~t~~V~W~D~ 36 (96)
T PF11811_consen 16 VRIRVLERPSDTTLSVSWSDP 36 (96)
T ss_pred CEEEEEEecCCCEEEEEEECC
Confidence 35665555 899999999988
No 142
>KOG0613|consensus
Probab=28.78 E-value=4e+02 Score=24.41 Aligned_cols=64 Identities=16% Similarity=0.102 Sum_probs=39.0
Q ss_pred EEecCCCCCcEEEEEEEEEcCCCCccC---CccEEEEeccCCCCCCCcceEEEEecCCeEEEEEeCC
Q psy14533 120 ATLTGLRKYRKYDIVVQAFNEKGPGPM---SSEVSVQTLEDVPAAPPLDITCSALSSTSLSVTWQPP 183 (209)
Q Consensus 120 ~~~~~L~p~~~Y~~~v~a~~~~g~~~~---s~~~~~~t~~~~p~~~p~~~~~~~~~~~sv~l~W~~p 183 (209)
.....-.++.++.+.+...-..+.+.+ +-...+.....++..-+.++.+.....+.+++.|...
T Consensus 293 ~t~~~~~~~~q~~~~v~~~~~i~~~~~~p~~~~~aaa~~~~~~v~~~~~~~v~a~d~~~v~m~~~~~ 359 (1205)
T KOG0613|consen 293 WTVADKRQGCQDRFKVTEEAPIGKGKPGPESLQAAAARPPEVPVGLVRNLSVTARDNTLVEMPTALS 359 (1205)
T ss_pred eeeecccCCcccceeeeEEeeccccccCchhhhhhccCCccccccccccceeccccCcceeeccccc
Confidence 333345566777888777766554433 3223333333455555667777777888888988865
No 143
>cd00182 TBOX T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
Probab=28.34 E-value=98 Score=21.15 Aligned_cols=24 Identities=21% Similarity=0.122 Sum_probs=20.3
Q ss_pred cceEEEeecCCCceEEEEEEEEcC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENS 35 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~ 35 (209)
...+.|+||.|...|.+.+.....
T Consensus 34 ~l~~~vsGLdp~~~Y~v~l~~~~~ 57 (188)
T cd00182 34 TLKVKVSGLDPNALYSVLMDLVPV 57 (188)
T ss_pred ceEEEEeCCCcccceEEEEEEEEc
Confidence 457899999999999999876644
No 144
>cd05756 Ig1_IL1R_like First immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R) and similar proteins. Ig1_IL1R_like: domain similar to the first immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines which participate in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three Ig-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of IL-1 alpha and IL-1 beta.
Probab=28.01 E-value=99 Score=18.01 Aligned_cols=25 Identities=8% Similarity=-0.205 Sum_probs=17.2
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGA 38 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~ 38 (209)
.+.|.+..+.....|.|.+.|..|.
T Consensus 61 ~L~I~~~~~~DsG~Y~C~~~N~~g~ 85 (94)
T cd05756 61 LLWFLPAALEDSGLYTCVVRNSTYC 85 (94)
T ss_pred eEEEccCCcccCeEEEEEEcCCCcc
Confidence 4567777777777777777776654
No 145
>cd02856 Glycogen_debranching_enzyme_N_term Glycogen_debranching_enzyme N-terminal domain. Glycogen debranching enzymes have both 4-alpha-glucanotransferase and amylo-1,6-glucosidase activities. As a transferase it transfers a segment of a 1,4-alpha-D-glucan to a new 4-position in an acceptor, which may be glucose or another 1,4-alpha-D-glucan. As a glucosidase it catalyzes the endohydrolysis of 1,6-alpha-D-glucoside linkages at points of branching in chains of 1,4-linked alpha-D-glucose residues. The N-terminus of the glycogen debranching enzyme may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=26.22 E-value=87 Score=18.60 Aligned_cols=19 Identities=21% Similarity=0.293 Sum_probs=15.3
Q ss_pred eEEEeecCCCceEEEEEEE
Q psy14533 14 WAVLQDLLPATLYRVRVLA 32 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a 32 (209)
...+.++.++..|.|+|..
T Consensus 48 ~~~v~~~~~g~~Y~y~i~g 66 (103)
T cd02856 48 HGFLPGIKAGQRYGFRVHG 66 (103)
T ss_pred EEEECCCCCCCEEEEEECC
Confidence 4677788889999999865
No 146
>smart00425 TBOX Domain first found in the mice T locus (Brachyury) protein.
Probab=25.88 E-value=1.1e+02 Score=20.92 Aligned_cols=24 Identities=21% Similarity=0.090 Sum_probs=20.2
Q ss_pred cceEEEeecCCCceEEEEEEEEcC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENS 35 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~ 35 (209)
...+.++||.|...|.+.+.....
T Consensus 33 ~l~~~vsGLdp~~~Y~v~l~~~~~ 56 (190)
T smart00425 33 TLKYKVSGLDPNALYSVLMDLVPV 56 (190)
T ss_pred eeEEEEeCCCccCcEEEEEEEEEc
Confidence 357899999999999999876643
No 147
>cd05880 Ig_EVA1 Immunoglobulin (Ig)-like domain of epithelial V-like antigen 1 (EVA). Ig_EVA: immunoglobulin (Ig) domain of epithelial V-like antigen 1 (EVA). EVA is also known as myelin protein zero-like 2. EVA is an adhesion molecule, which may play a role in structural organization of the thymus and early lymphocyte development.
Probab=25.86 E-value=1.6e+02 Score=17.87 Aligned_cols=6 Identities=17% Similarity=0.927 Sum_probs=3.0
Q ss_pred cceEEe
Q psy14533 2 VEWKSQ 7 (209)
Q Consensus 2 ~~w~~~ 7 (209)
+.|..+
T Consensus 34 i~W~~~ 39 (115)
T cd05880 34 ITWNFR 39 (115)
T ss_pred EEEEEE
Confidence 457443
No 148
>PF07679 I-set: Immunoglobulin I-set domain; InterPro: IPR013098 The basic structure of immunoglobulin (Ig) molecules is a tetramer of two light chains and two heavy chains linked by disulphide bonds. There are two types of light chains: kappa and lambda, each composed of a constant domain (CL) and a variable domain (VL). There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are highly modular proteins, in which the variable and constant domains have clear, conserved sequence patterns. The domains in Ig and Ig-like molecules are grouped into four types: V-set (variable; IPR013106 from INTERPRO), C1-set (constant-1; IPR003597 from INTERPRO), C2-set (constant-2; IPR008424 from INTERPRO) and I-set (intermediate; IPR013098 from INTERPRO) []. Structural studies have shown that these domains share a common core Greek-key beta-sandwich structure, with the types differing in the number of strands in the beta-sheets as well as in their sequence patterns [, ]. Immunoglobulin-like domains that are related in both sequence and structure can be found in several diverse protein families. Ig-like domains are involved in a variety of functions, including cell-cell recognition, cell-surface receptors, muscle structure and the immune system []. This entry represents I-set domains, which are found in several cell adhesion molecules, including vascular (VCAM), intercellular (ICAM), neural (NCAM) and mucosal addressin (MADCAM) cell adhesion molecules, as well as junction adhesion molecules (JAM). I-set domains are also present in several other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1 [], and the signalling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis [].; PDB: 3MTR_A 2EDK_A 3DMK_B 1KOA_A 3NCM_A 2NCM_A 2V9Q_A 2CR3_A 3QQN_A 3QR2_A ....
Probab=25.84 E-value=1.2e+02 Score=16.85 Aligned_cols=29 Identities=14% Similarity=0.112 Sum_probs=23.3
Q ss_pred CcceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 11 TEQWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 11 ~~~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
....+.|.++.......|.|.+.|..|..
T Consensus 54 ~~~~L~I~~v~~~D~G~Y~C~~~n~~g~~ 82 (90)
T PF07679_consen 54 GSSSLTIKNVTREDAGTYTCVASNSSGEA 82 (90)
T ss_dssp TEEEEEESSESGGGSEEEEEEEEETTEEE
T ss_pred ceeEEccCCCChhhCEEEEEEEEECCCEE
Confidence 45678889999888999999998876543
No 149
>cd05898 Ig5_KIRREL3 Fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 protein (also known as Neph2). Ig5_KIRREL3: the fifth immunoglobulin (Ig)-like domain of Kirrel (kin of irregular chiasm-like) 3 protein (also known as Neph2). This protein has five Ig-like domains, one transmembrane domain, and a cytoplasmic tail. Included in this group is mammalian Kirrel (Neph1). These proteins contain multiple Ig domains, have properties of cell adhesion molecules, and are important in organ development. Neph1 and 2 may mediate axonal guidance and synapse formation in certain areas of the CNS. In the kidney, they participate in the formation of the slit diaphragm.
Probab=24.25 E-value=1.7e+02 Score=17.43 Aligned_cols=28 Identities=21% Similarity=0.255 Sum_probs=15.2
Q ss_pred cceEEEeecCCCc-eEEEEEEEEcCCCCC
Q psy14533 12 EQWAVLQDLLPAT-LYRVRVLAENSLGAG 39 (209)
Q Consensus 12 ~~~~~i~~L~p~~-~Y~~~v~a~~~~g~~ 39 (209)
.+.+.|.++.... ...|.|.|.|..|..
T Consensus 63 ~S~L~I~~~~~~d~~g~Y~C~a~N~~G~d 91 (98)
T cd05898 63 LSTLTINNIMEADFQTHYNCTAWNSFGSG 91 (98)
T ss_pred EEEEEECCCccccCCcEEEEEEEeCCccc
Confidence 3456666664322 335666666666643
No 150
>cd05900 Ig_Aggrecan Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. Ig_Aggrecan: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. These aggregates contribute to the tissue's load bearing properties. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggrecan has a wide distribution in connective tissue and extracellular matrices. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
Probab=23.43 E-value=1.9e+02 Score=17.70 Aligned_cols=20 Identities=10% Similarity=-0.107 Sum_probs=8.8
Q ss_pred eEEEeecCCCceEEEEEEEE
Q psy14533 14 WAVLQDLLPATLYRVRVLAE 33 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~ 33 (209)
++.|.+|+....-.|+|...
T Consensus 75 sL~I~nl~~sDsG~Y~C~V~ 94 (112)
T cd05900 75 TLEITELRSNDSGTYRCEVM 94 (112)
T ss_pred EEEEeecccccCEEEEEEEe
Confidence 44444444444444444443
No 151
>cd05771 IgC_Tapasin_R Tapasin-R immunoglobulin-like domain. IgC_Tapasin_R: Immunoglobulin-like domain on Tapasin-R. Tapasin is a V-C1 (variable-constant) immunoglobulin superfamily molecule present in the endoplasmic reticulum (ER), where it links MHC class I molecules to the transporter associated with antigen processing (TAP). Tapasin-R is a tapasin-related protein that contains similar structural motifs to Tapasin, with some marked differences, especially in the V domain, transmembrane and cytoplasmic regions. The majority of Tapasin-R is located within the ER; however, there may be some expression of Tapasin-R at the cell surface. Tapasin-R lacks an obvious ER retention signal.
Probab=23.40 E-value=2e+02 Score=18.05 Aligned_cols=25 Identities=8% Similarity=-0.207 Sum_probs=18.7
Q ss_pred eEEEeecCCCceEEEEEEEEcCCCC
Q psy14533 14 WAVLQDLLPATLYRVRVLAENSLGA 38 (209)
Q Consensus 14 ~~~i~~L~p~~~Y~~~v~a~~~~g~ 38 (209)
++.|.+++......|.|.+.+..+.
T Consensus 2 sL~i~~v~~~D~G~Y~C~~~~~~~~ 26 (139)
T cd05771 2 SLTLPGLTVHDEGTYICSVSTPPHQ 26 (139)
T ss_pred eEEECCCCHHHCEEEEEEEEccCcc
Confidence 5678888887778888888776554
No 152
>KOG1378|consensus
Probab=22.43 E-value=1.2e+02 Score=23.95 Aligned_cols=35 Identities=26% Similarity=0.201 Sum_probs=25.9
Q ss_pred cceEEEeecCCCceEEEEEEEEcCCCCCCCCCCeeEecCC
Q psy14533 12 EQWAVLQDLLPATLYRVRVLAENSLGAGRPSDPLLVHTEA 51 (209)
Q Consensus 12 ~~~~~i~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~ 51 (209)
...+.+.+|++++.|.|+|-.-.. +|....+++..
T Consensus 108 ih~~~~~~L~~~t~YyY~~Gs~~~-----wS~~f~F~t~p 142 (452)
T KOG1378|consen 108 IHDAVMKNLEPNTRYYYQVGSDLK-----WSEIFSFKTPP 142 (452)
T ss_pred EeeeeecCCCCCceEEEEeCCCCC-----cccceEeECCC
Confidence 346788999999999999854332 56777777655
No 153
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=22.12 E-value=85 Score=16.79 Aligned_cols=22 Identities=18% Similarity=0.315 Sum_probs=16.0
Q ss_pred CCcceEEEeecCCCceEEEEEEE
Q psy14533 10 GTEQWAVLQDLLPATLYRVRVLA 32 (209)
Q Consensus 10 ~~~~~~~i~~L~p~~~Y~~~v~a 32 (209)
.....+.+.+|.+|. |.++-..
T Consensus 24 d~~G~~~f~~L~~G~-Y~l~E~~ 45 (70)
T PF05738_consen 24 DENGKYTFKNLPPGT-YTLKETK 45 (70)
T ss_dssp GTTSEEEEEEEESEE-EEEEEEE
T ss_pred CCCCEEEEeecCCeE-EEEEEEE
Confidence 345688999999997 6666544
No 154
>cd05764 Ig_2 Subgroup of the immunoglobulin (Ig) superfamily. Ig_2: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=22.09 E-value=1.4e+02 Score=15.92 Aligned_cols=27 Identities=11% Similarity=-0.064 Sum_probs=20.2
Q ss_pred ceEEEeecCCCceEEEEEEEEcCCCCC
Q psy14533 13 QWAVLQDLLPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 13 ~~~~i~~L~p~~~Y~~~v~a~~~~g~~ 39 (209)
.++.|..+.......|.+.|.|..|..
T Consensus 40 ~~L~i~~~~~~D~G~Y~C~a~N~~G~~ 66 (74)
T cd05764 40 GTLDILITTVKDTGSFTCIASNAAGEA 66 (74)
T ss_pred CEEEEEECChhhCEEEEEEEECCCCeE
Confidence 456777777777888888888877643
No 155
>PF02018 CBM_4_9: Carbohydrate binding domain; InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=21.98 E-value=1.2e+02 Score=18.39 Aligned_cols=20 Identities=25% Similarity=0.395 Sum_probs=16.3
Q ss_pred eecCCCceEEEEEEEEcCCC
Q psy14533 18 QDLLPATLYRVRVLAENSLG 37 (209)
Q Consensus 18 ~~L~p~~~Y~~~v~a~~~~g 37 (209)
..|++|..|.+.+.+....+
T Consensus 56 ~~l~~G~~Y~~s~~vk~~~~ 75 (131)
T PF02018_consen 56 ISLKPGKTYTVSFWVKADSG 75 (131)
T ss_dssp EEE-TTSEEEEEEEEEESSS
T ss_pred eEecCCCEEEEEEEEEeCCC
Confidence 67999999999999987654
No 156
>cd05715 Ig_P0-like Immunoglobulin (Ig)-like domain of Protein zero (P0) and similar proteins. Ig_P0ex-like: domain similar to the immunoglobulin (Ig) domain of Protein zero (P0). P0 accounts for over 50% of the total protein in peripheral nervous system (PNS) myelin. P0 is a single-pass transmembrane glycoprotein with a highly basic intracellular domain and an extracellular Ig domain. The extracellular domain of P0 (P0-ED) is similar to the Ig variable domain, carrying one acceptor sequence for N-linked glycosylation. P0 plays a role in membrane adhesion in the spiral wraps of the myelin sheath. The intracellular domain is thought to mediate membrane apposition of the cytoplasmic faces and may, through electrostatic interactions, interact directly with lipid headgroups. It is thought that homophilic interactions of the P0 extracellular domain mediate membrane juxtaposition in the extracellular space of PNS myelin. This group also contains the Ig domain of Sodium channel subunit beta-2
Probab=21.97 E-value=2e+02 Score=17.41 Aligned_cols=8 Identities=38% Similarity=0.700 Sum_probs=4.5
Q ss_pred cceEEecC
Q psy14533 2 VEWKSQNS 9 (209)
Q Consensus 2 ~~w~~~~~ 9 (209)
+.|..+..
T Consensus 34 v~W~~~~~ 41 (116)
T cd05715 34 VTWSFQPE 41 (116)
T ss_pred EEEEEecC
Confidence 56855544
No 157
>PF14250 AbrB-like: AbrB-like transcriptional regulator
Probab=21.66 E-value=1e+02 Score=17.08 Aligned_cols=15 Identities=20% Similarity=0.408 Sum_probs=12.2
Q ss_pred ecCCCCCcEEEEEEE
Q psy14533 122 LTGLRKYRKYDIVVQ 136 (209)
Q Consensus 122 ~~~L~p~~~Y~~~v~ 136 (209)
..+|+||.+|++.+-
T Consensus 50 ~m~L~PGdEFeI~Lg 64 (71)
T PF14250_consen 50 QMGLKPGDEFEIKLG 64 (71)
T ss_pred HhCCCCCCEEEEEeC
Confidence 458999999988763
No 158
>cd05848 Ig1_Contactin-5 First Ig domain of contactin-5. Ig1_Contactin-5: First Ig domain of the neural cell adhesion molecule contactin-5. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains, anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. In rats, a lack of contactin-5 (NB-2) results in an impairment of the neuronal activity in the auditory system. Contactin-5 is expressed specifically in the postnatal nervous system, peaking at about 3 weeks postnatal. Contactin-5 is highly expressed in the adult human brain in the occipital lobe and in the amygdala; lower levels of expression have been detected in the corpus callosum, caudate nucleus, and spinal cord.
Probab=21.44 E-value=1.8e+02 Score=16.79 Aligned_cols=26 Identities=23% Similarity=0.175 Sum_probs=17.3
Q ss_pred eEEEeec-CCCceEEEEEEEEcCCCCC
Q psy14533 14 WAVLQDL-LPATLYRVRVLAENSLGAG 39 (209)
Q Consensus 14 ~~~i~~L-~p~~~Y~~~v~a~~~~g~~ 39 (209)
.+.|.++ .+.....|+|.|.|..|..
T Consensus 59 ~L~i~~~~~~~D~G~Y~C~A~N~~G~~ 85 (94)
T cd05848 59 NLIISNPSEVKDSGRYQCLATNSIGSI 85 (94)
T ss_pred eEEEccCCccCcCEEEEEEEEcCCCeE
Confidence 4456565 3466677888888887754
No 159
>PF07753 DUF1609: Protein of unknown function (DUF1609); InterPro: IPR011667 This region is found in a number of hypothetical proteins thought to be expressed by the eukaryote Encephalitozoon cuniculi, an obligate intracellular microsporidial parasite. The proteins are approximately 200 residues long.
Probab=20.21 E-value=1e+02 Score=21.28 Aligned_cols=36 Identities=6% Similarity=0.240 Sum_probs=23.4
Q ss_pred cceEEEEe-cCCeEEEEEeCCCCccCCceeeEEEEEEEe
Q psy14533 164 LDITCSAL-SSTSLSVTWQPPPLLLQNGEILGYKVYYEN 201 (209)
Q Consensus 164 ~~~~~~~~-~~~sv~l~W~~p~~~~~~~~i~~Y~i~y~~ 201 (209)
..++...+ ..+.++|.|..| .+..-.+....|..++
T Consensus 192 kgvR~E~v~~~~~frIvwrnp--~~TSevlr~LTI~~~P 228 (230)
T PF07753_consen 192 KGVRSETVKEGDEFRIVWRNP--KDTSEVLRSLTILRRP 228 (230)
T ss_pred CCceeEEeccCCEEEEEecCC--ccHHHHHhhheeeecC
Confidence 36666654 567899999998 4444445566665544
Done!