Query psy7018
Match_columns 349
No_of_seqs 223 out of 1521
Neff 11.1
Searched_HMMs 46136
Date Sat Aug 17 00:42:09 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7018.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7018hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4221|consensus 100.0 1.1E-31 2.5E-36 245.4 30.3 296 9-308 492-805 (1381)
2 KOG3513|consensus 100.0 6.6E-31 1.4E-35 241.2 33.8 306 11-321 590-920 (1051)
3 KOG4221|consensus 100.0 1.6E-29 3.6E-34 231.4 27.4 301 14-317 395-713 (1381)
4 KOG3513|consensus 99.9 9.7E-24 2.1E-28 194.5 30.7 303 9-313 683-1013(1051)
5 KOG0196|consensus 99.9 5.1E-22 1.1E-26 176.2 17.2 236 3-317 291-537 (996)
6 KOG4222|consensus 99.8 1.8E-16 3.8E-21 146.1 23.0 305 11-315 485-844 (1281)
7 PF00041 fn3: Fibronectin type 99.4 4.7E-13 1E-17 91.3 8.1 83 225-307 2-85 (85)
8 KOG4258|consensus 99.4 5.5E-13 1.2E-17 119.9 10.2 291 9-314 575-922 (1025)
9 KOG0196|consensus 99.1 9.1E-10 2E-14 99.5 13.6 134 88-221 395-537 (996)
10 cd00063 FN3 Fibronectin type 3 99.0 5.3E-09 1.2E-13 72.0 11.4 89 223-312 1-91 (93)
11 KOG4222|consensus 99.0 1.3E-08 2.8E-13 95.3 15.6 208 9-216 593-841 (1281)
12 PF00041 fn3: Fibronectin type 99.0 1.9E-09 4.2E-14 73.3 7.6 71 43-113 2-78 (85)
13 PF10179 DUF2369: Uncharacteri 98.9 3.5E-06 7.7E-11 69.5 24.6 133 179-315 125-297 (300)
14 KOG4802|consensus 98.9 7.7E-08 1.7E-12 80.4 13.8 135 170-305 196-341 (516)
15 smart00060 FN3 Fibronectin typ 98.5 1.2E-06 2.7E-11 58.3 9.5 77 225-303 3-82 (83)
16 KOG4802|consensus 98.4 7.2E-06 1.6E-10 68.9 12.1 104 9-113 218-336 (516)
17 KOG4367|consensus 98.3 5.6E-07 1.2E-11 75.7 4.5 82 234-316 450-531 (699)
18 cd00063 FN3 Fibronectin type 3 98.3 7.8E-06 1.7E-10 55.8 9.1 69 45-113 5-79 (93)
19 KOG4258|consensus 98.2 1.8E-05 4E-10 72.7 11.7 122 143-264 501-652 (1025)
20 COG3401 Fibronectin type 3 dom 98.0 0.0024 5.2E-08 53.1 19.6 279 10-304 41-332 (343)
21 smart00060 FN3 Fibronectin typ 98.0 3.5E-05 7.7E-10 51.0 7.0 65 47-113 7-79 (83)
22 PF10179 DUF2369: Uncharacteri 97.9 0.0042 9.2E-08 51.7 18.3 128 90-218 127-296 (300)
23 KOG4152|consensus 97.9 0.00016 3.6E-09 63.0 10.2 122 186-308 651-790 (830)
24 KOG3632|consensus 97.8 0.00044 9.5E-09 64.8 13.1 242 50-304 587-856 (1335)
25 PF01108 Tissue_fac: Tissue fa 97.5 0.00051 1.1E-08 48.6 6.8 79 223-304 22-105 (107)
26 PF09294 Interfer-bind: Interf 97.4 0.00059 1.3E-08 48.2 6.2 87 225-314 5-105 (106)
27 COG3401 Fibronectin type 3 dom 97.2 0.021 4.5E-07 47.7 13.7 160 146-312 86-250 (343)
28 KOG4152|consensus 97.1 0.00089 1.9E-08 58.6 5.6 68 7-75 654-723 (830)
29 KOG4367|consensus 96.7 0.002 4.4E-08 55.0 3.8 75 145-220 454-531 (699)
30 COG4733 Phage-related protein, 96.6 0.03 6.5E-07 52.4 10.8 132 172-307 650-786 (952)
31 KOG4806|consensus 96.6 0.39 8.4E-06 40.5 16.2 128 184-315 286-447 (454)
32 PF09294 Interfer-bind: Interf 96.5 0.014 3E-07 41.1 6.6 60 158-218 44-105 (106)
33 PF07495 Y_Y_Y: Y_Y_Y domain; 95.3 0.1 2.3E-06 32.9 6.2 37 267-305 19-55 (66)
34 PF01108 Tissue_fac: Tissue fa 95.1 0.24 5.2E-06 34.9 8.3 63 145-208 36-105 (107)
35 KOG4228|consensus 95.1 0.16 3.5E-06 49.5 9.3 112 199-317 144-255 (1087)
36 PF07495 Y_Y_Y: Y_Y_Y domain; 94.6 0.21 4.6E-06 31.4 6.3 48 160-209 8-55 (66)
37 PF09067 EpoR_lig-bind: Erythr 94.1 0.53 1.1E-05 32.8 7.9 78 223-303 8-95 (104)
38 PLN02533 probable purple acid 94.1 0.55 1.2E-05 42.2 10.0 84 222-316 40-135 (427)
39 TIGR00864 PCC polycystin catio 94.1 12 0.00025 41.5 25.3 33 279-312 1661-1693(2740)
40 KOG3632|consensus 93.9 0.51 1.1E-05 45.4 9.5 192 10-211 642-859 (1335)
41 TIGR00868 hCaCC calcium-activa 93.6 7.9 0.00017 38.1 21.9 82 226-308 759-860 (863)
42 TIGR00864 PCC polycystin catio 92.3 22 0.00048 39.6 27.5 104 193-313 1754-1864(2740)
43 KOG1948|consensus 91.7 11 0.00025 36.4 14.8 120 180-300 944-1065(1165)
44 KOG0613|consensus 91.0 9.5 0.0002 38.6 14.1 253 51-308 150-423 (1205)
45 KOG4228|consensus 90.1 0.61 1.3E-05 45.7 5.5 89 15-109 142-234 (1087)
46 KOG4806|consensus 90.1 2.7 5.9E-05 35.7 8.4 101 9-113 293-432 (454)
47 COG4733 Phage-related protein, 89.8 2.5 5.3E-05 40.4 8.9 119 88-210 657-785 (952)
48 KOG1948|consensus 89.6 12 0.00026 36.3 13.0 114 90-203 945-1064(1165)
49 KOG1225|consensus 89.0 1.1 2.5E-05 40.8 6.0 114 174-295 410-524 (525)
50 PF13750 Big_3_3: Bacterial Ig 87.9 10 0.00022 28.8 11.0 31 282-313 116-146 (158)
51 PLN02533 probable purple acid 87.4 4.3 9.4E-05 36.6 8.8 64 40-108 40-120 (427)
52 PF14292 SusE: SusE outer memb 85.0 6.9 0.00015 28.2 7.3 60 52-111 42-121 (122)
53 TIGR00868 hCaCC calcium-activa 83.9 8.3 0.00018 37.9 9.2 40 40-80 755-796 (863)
54 PHA02579 7 baseplate wedge sub 83.1 3 6.6E-05 39.4 5.7 73 44-120 8-94 (1030)
55 cd05735 Ig8_DSCAM Eight immuno 82.8 1.6 3.4E-05 29.4 3.0 34 8-41 55-88 (88)
56 PF08329 ChitinaseA_N: Chitina 82.6 11 0.00025 27.5 7.4 66 144-216 38-105 (133)
57 cd05762 Ig8_MLCK Eighth immuno 82.4 1.5 3.2E-05 30.2 2.8 34 8-41 64-97 (98)
58 PF14292 SusE: SusE outer memb 81.8 17 0.00036 26.2 9.2 65 234-298 42-121 (122)
59 PF09067 EpoR_lig-bind: Erythr 80.5 10 0.00022 26.4 6.4 72 40-112 7-91 (104)
60 PF13754 Big_3_4: Bacterial Ig 79.9 6.8 0.00015 23.4 4.8 28 279-307 15-42 (54)
61 PF09240 IL6Ra-bind: Interleuk 79.4 17 0.00036 25.0 7.3 78 226-305 2-88 (99)
62 COG3979 Uncharacterized protei 78.9 5.3 0.00011 31.2 5.0 87 223-316 5-92 (181)
63 KOG1225|consensus 71.0 15 0.00033 33.8 6.5 150 44-199 369-524 (525)
64 cd02848 Chitinase_N_term Chiti 69.6 35 0.00077 23.8 7.9 61 144-208 34-95 (106)
65 KOG1378|consensus 68.6 37 0.0008 30.6 8.2 87 224-316 43-142 (452)
66 PF11344 DUF3146: Protein of u 68.0 4.1 8.9E-05 26.0 1.7 15 284-298 66-80 (80)
67 PF08329 ChitinaseA_N: Chitina 65.5 30 0.00064 25.4 5.9 72 235-314 36-107 (133)
68 PHA02579 7 baseplate wedge sub 64.9 25 0.00055 33.7 6.7 84 225-314 7-102 (1030)
69 PF09423 PhoD: PhoD-like phosp 61.7 15 0.00032 33.6 4.8 34 279-315 65-98 (453)
70 KOG0613|consensus 60.7 67 0.0014 33.0 9.1 108 8-115 299-417 (1205)
71 cd05762 Ig8_MLCK Eighth immuno 59.3 54 0.0012 22.4 6.8 38 179-216 53-90 (98)
72 PF13754 Big_3_4: Bacterial Ig 57.1 40 0.00086 20.1 5.5 36 182-218 14-49 (54)
73 PF09402 MSC: Man1-Src1p-C-ter 54.1 4.2 9.2E-05 35.3 0.0 24 2-25 49-73 (334)
74 PF07699 GCC2_GCC3: GCC2 and G 53.3 4.4 9.5E-05 23.5 -0.0 19 4-22 10-28 (48)
75 PF07353 Uroplakin_II: Uroplak 51.6 1E+02 0.0022 23.3 8.6 39 277-316 101-139 (184)
76 PF07353 Uroplakin_II: Uroplak 50.9 25 0.00055 26.3 3.5 23 90-112 101-123 (184)
77 PF01422 zf-NF-X1: NF-X1 type 49.8 9.8 0.00021 17.5 0.8 9 2-10 12-20 (20)
78 cd08544 Reeler Reeler, the N-t 49.0 47 0.001 24.3 4.9 22 51-72 92-113 (135)
79 PF10342 GPI-anchored: Ser-Thr 48.7 79 0.0017 21.0 7.2 59 236-297 12-78 (93)
80 cd02848 Chitinase_N_term Chiti 47.1 98 0.0021 21.7 8.6 70 236-312 33-102 (106)
81 PF02010 REJ: REJ domain; Int 46.6 6.5 0.00014 35.7 0.0 113 188-304 268-389 (440)
82 KOG3515|consensus 41.6 3.5E+02 0.0077 26.6 10.3 78 183-264 655-732 (741)
83 cd05854 Ig6_Contactin-2 Sixth 40.6 21 0.00045 23.6 1.7 27 10-36 57-83 (85)
84 PF10576 EndIII_4Fe-2S: Iron-s 39.5 13 0.00029 16.2 0.4 12 2-13 5-16 (17)
85 TIGR03000 plancto_dom_1 Planct 39.2 48 0.001 21.5 3.0 26 275-300 26-51 (75)
86 cd05735 Ig8_DSCAM Eight immuno 37.9 1.2E+02 0.0026 20.1 7.0 35 182-216 47-81 (88)
87 cd04971 Ig_TrKABC_d5 Fifth dom 37.6 33 0.00072 22.5 2.3 23 7-29 53-75 (81)
88 cd05851 Ig3_Contactin-1 Third 37.5 95 0.0021 20.5 4.7 27 182-208 53-79 (88)
89 PHA02637 TNF-alpha-receptor-li 37.0 19 0.00042 25.9 1.1 13 2-14 62-74 (127)
90 cd00146 PKD polycystic kidney 36.1 1.2E+02 0.0026 19.5 7.2 60 236-305 14-73 (81)
91 PF04775 Bile_Hydr_Trans: Acyl 35.3 65 0.0014 23.4 3.7 24 279-302 6-29 (126)
92 KOG3515|consensus 34.6 49 0.0011 32.1 3.6 68 9-80 663-730 (741)
93 cd05773 Ig8_hNephrin_like Eigh 34.0 49 0.0011 23.1 2.9 27 11-37 81-107 (109)
94 COG3979 Uncharacterized protei 30.7 1.4E+02 0.003 23.4 5.1 70 42-113 6-75 (181)
95 smart00469 WIF Wnt-inhibitory 30.3 2.3E+02 0.0049 21.0 6.4 60 53-115 43-102 (136)
96 PRK13211 N-acetylglucosamine-b 30.3 4.5E+02 0.0097 24.4 10.2 35 279-314 370-404 (478)
97 COG4932 Predicted outer membra 29.6 7E+02 0.015 26.4 12.0 197 93-293 1105-1303(1531)
98 PF13205 Big_5: Bacterial Ig-l 29.3 1.9E+02 0.0041 19.7 8.1 20 283-302 70-91 (107)
99 PF02010 REJ: REJ domain; Int 29.1 36 0.00078 30.9 1.9 22 283-304 155-176 (440)
100 PF12245 Big_3_2: Bacterial Ig 28.3 1E+02 0.0022 18.9 3.2 20 11-30 23-42 (60)
101 cd05748 Ig_Titin_like Immunogl 27.7 1.6E+02 0.0034 18.4 5.7 29 179-207 37-65 (74)
102 PF15417 DUF4624: Domain of un 27.4 1.1E+02 0.0023 21.5 3.4 32 78-109 76-107 (132)
103 TIGR03769 P_ac_wall_RPT actino 27.2 1.2E+02 0.0027 16.9 3.3 13 286-299 11-23 (41)
104 cd05893 Ig_Palladin_C C-termin 26.6 28 0.0006 22.4 0.5 19 9-27 50-68 (75)
105 PF09423 PhoD: PhoD-like phosp 26.3 1.1E+02 0.0024 28.0 4.5 37 180-219 62-98 (453)
106 PF05345 He_PIG: Putative Ig d 26.1 84 0.0018 18.3 2.4 13 290-302 37-49 (49)
107 PF00907 T-box: T-box; InterP 26.1 1.2E+02 0.0027 23.6 4.2 25 89-113 31-55 (184)
108 cd05765 Ig_3 Subgroup of the i 24.2 50 0.0011 21.2 1.4 20 8-27 55-74 (81)
109 cd05760 Ig2_PTK7 Second immuno 24.0 91 0.002 19.9 2.6 18 9-26 47-64 (77)
110 cd04974 Ig3_FGFR Third immunog 23.4 93 0.002 20.6 2.7 24 9-32 64-87 (90)
111 cd05723 Ig4_Neogenin Fourth im 22.7 49 0.0011 20.7 1.1 19 9-27 46-64 (71)
112 cd05853 Ig6_Contactin-4 Sixth 22.1 76 0.0016 21.1 2.0 26 11-36 58-83 (85)
113 cd05763 Ig_1 Subgroup of the i 21.6 82 0.0018 19.8 2.1 19 9-27 49-67 (75)
114 cd05733 Ig6_L1-CAM_like Sixth 21.5 1.1E+02 0.0023 19.6 2.6 16 10-25 53-68 (77)
115 cd05855 Ig_TrkB_d5 Fifth domai 20.8 1.1E+02 0.0024 20.0 2.5 19 10-28 54-72 (79)
116 cd04970 Ig6_Contactin_like Six 20.5 64 0.0014 21.0 1.4 26 10-35 57-82 (85)
117 PF00907 T-box: T-box; InterP 20.1 1.3E+02 0.0028 23.5 3.2 24 277-300 32-55 (184)
No 1
>KOG4221|consensus
Probab=100.00 E-value=1.1e-31 Score=245.45 Aligned_cols=296 Identities=22% Similarity=0.282 Sum_probs=239.3
Q ss_pred CCCceEEEEeeecCCCCCCCCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecC---CCCCc
Q psy7018 9 TDHKVSTEDRAGNSAGWSPFSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSD---PAQPS 85 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~---~~~~~ 85 (349)
|.+.|.|||+|.|..|.+..+.++.+++.+..|.. +++...+..++.+.|++|.-+++++++|.+.|.. +.+..
T Consensus 492 p~t~Y~~rv~A~n~~g~g~sS~pLkV~t~pEgp~~---~~a~ats~~ti~v~WepP~~~n~~I~~yk~~ys~~~~~~~~~ 568 (1381)
T KOG4221|consen 492 PLTMYFFRVRAKNEAGSGESSAPLKVTTQPEGPVQ---LQAYATSPTTILVTWEPPPFGNGPITGYKLFYSEDDTGKELR 568 (1381)
T ss_pred cceeEEEEEeccCcccCCccCCceEEecCCCCCcc---ccccccCcceEEEEecCCCCCCCCceEEEEEEEcCCCCceEE
Confidence 67899999999999999999999999998885543 6666789999999999998888999999998865 34556
Q ss_pred eeecccEEEEcCCCCCCEEEEEEEEEeecCCcccceeeeeeecccccCCCCCCC---ccccceEEEEEEec-----CCCC
Q psy7018 86 ITCDTTSYQLTGLSPHTTYKSVCQVYLISDKHTQKVDLTFESDQLYLSGLPNSE---WPSLRTIILTLYFR-----TAQV 157 (349)
Q Consensus 86 ~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~---~~~~~~~~l~w~~~-----~~~~ 157 (349)
...+..+++|.+|.+.++|.|+|.+++..+.|.....+.+.+....|..+|... ..+.++..+.|..+ ++.+
T Consensus 569 ~~~n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~i 648 (1381)
T KOG4221|consen 569 VENNATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQI 648 (1381)
T ss_pred EecCccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecCCCeEEEEccCCCcccccceE
Confidence 677899999999999999999999999888888888888888887777666622 24556778888754 3778
Q ss_pred CeeeEEEeeCCCCCceE-EEecCCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCCCC-----CCCCCCeE
Q psy7018 158 PGVPPYVSDKGRDGKYK-SIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEP-----DQPNPPKL 231 (349)
Q Consensus 158 ~~Y~v~~~~~~~~~~~~-~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P-----~~p~~~~~ 231 (349)
.+|.|+|+......... ....++...+.+.+|+|++.|.|+|.|.+..|.|++|++..+.|....+ .+|..+.+
T Consensus 649 tgYkIRy~~~~~~~~~~~t~v~~n~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~l~~ 728 (1381)
T KOG4221|consen 649 TGYKIRYRKLSREDEVNETVVKGNTTQYLFNGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDLDERVPGKPSELHV 728 (1381)
T ss_pred EEEEEEecccCcccccceeecccchhhhHhhcCCCCceEEEEEEEeccCCCCCcccceeccCccccccccCCCCCceeee
Confidence 99999998755444432 3445688899999999999999999999999999999999998875433 33333332
Q ss_pred EeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEe-ecCCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCc
Q psy7018 232 VTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEI-SKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADAT 308 (349)
Q Consensus 232 ~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~-~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~ 308 (349)
. ...++|.+.|.++++.+-.+.+|+|.|+...+.+.-..+ .....++|.++.|.++..|.++++|+|..|+|.+..
T Consensus 729 ~-~g~~si~vsW~Pp~~~~~~vrgY~ig~r~g~~~p~~~tIrl~~~~s~y~l~~Le~~~~YvVkL~AfNn~gdG~p~y 805 (1381)
T KOG4221|consen 729 H-PGSNSIVVSWTPPPHPNIVVRGYKIGYRPGSGIPDTGTIRLDEKVSYYNLEQLEPNRDYVVKLRAFNNHGDGNPIY 805 (1381)
T ss_pred c-cCceeEEEEeCCCCChhhhhcceEEeeecccCCCCCccEEecceeeEEEEEecccCceEEEEEEEeccCCCCccee
Confidence 2 346799999999998777888999999765543323333 445667999999999999999999999999997653
No 2
>KOG3513|consensus
Probab=100.00 E-value=6.6e-31 Score=241.22 Aligned_cols=306 Identities=18% Similarity=0.199 Sum_probs=234.1
Q ss_pred CceEEEEeeecCCCCCCCCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCCce---e
Q psy7018 11 HKVSTEDRAGNSAGWSPFSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQPSI---T 87 (349)
Q Consensus 11 ~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~~~---~ 87 (349)
.+|..+++. ..-.......+.++.+|+||. .+.+..+++++++|+|++..++..+|..|.|+.++.....| .
T Consensus 590 G~Y~C~aqT--~~Ds~s~~A~l~V~gpPgpP~---~v~~~~i~~t~~~lsW~~g~dn~SpI~~Y~iq~rt~~~~~W~~v~ 664 (1051)
T KOG3513|consen 590 GKYTCVAQT--ALDSASARADLLVRGPPGPPP---DVHVDDISDTTARLSWSPGSDNNSPIEKYTIQFRTPFPGKWKAVT 664 (1051)
T ss_pred ceEEEEEEE--eecchhcccceEEecCCCCCC---ceeEeeeccceEEEEeecCCCCCCCceEEeEEecCCCCCcceEee
Confidence 345555544 333222344577888888877 44446789999999999999988999999999988533222 1
Q ss_pred -e--cc---cEEEEcCCCCCCEEEEEEEEEe-ecCCcccceeeeeeecccccCCCCCCCc---cccceEEEEEE-----e
Q psy7018 88 -C--DT---TSYQLTGLSPHTTYKSVCQVYL-ISDKHTQKVDLTFESDQLYLSGLPNSEW---PSLRTIILTLY-----F 152 (349)
Q Consensus 88 -~--~~---~~~~i~~L~p~t~Y~~~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~p~~~~---~~~~~~~l~w~-----~ 152 (349)
. .. .+.++.+|.|+.+|+|||.+.+ .+.+..+..+...+|.+..|...|.... ...++..++|+ .
T Consensus 665 ~vp~~~~~~~sa~vv~L~Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~~P~nv~g~g~~~~eLvItW~Pl~~~~ 744 (1051)
T KOG3513|consen 665 TVPGNITGDESATVVNLSPWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSVNPSNVKGGGGSPTELVITWEPLPEEE 744 (1051)
T ss_pred ECCCcccCccceeEEccCCCcceEEEEEEEcccccCCCCCCccceecCCCCCccCCccccccCCCCceEEEEeccCCHHH
Confidence 1 11 3577889999999999988666 4444466677777887776666665332 34557788887 2
Q ss_pred cCCCCCeeeEEEeeCCCCCceEEEe--cCCceEEEecC--CCCCCeEEEEEEEEecccccCCCCcEEEecCCCCC-CCCC
Q psy7018 153 RTAQVPGVPPYVSDKGRDGKYKSIY--NGEAQSCRIQD--LKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEP-DQPN 227 (349)
Q Consensus 153 ~~~~~~~Y~v~~~~~~~~~~~~~~~--~~~~~~~~i~~--L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P-~~p~ 227 (349)
.+|.-.+|.|.|+..+....|.... +.+...+.+.+ ..|.+.|.+.|+|.|..|.|+.+..+.+...++.| .+|.
T Consensus 745 qNG~gfgY~Vswr~~g~~~~W~~~~v~~~d~~~~V~~~~st~~~tpyevKVqa~N~~GeGp~s~~~v~~S~Ed~P~~ap~ 824 (1051)
T KOG3513|consen 745 QNGPGFGYRVSWRPQGADKEWKEVIVSNQDQPRYVVSNESTEPFTPYEVKVQAINDQGEGPESQVTVGYSGEDEPPVAPT 824 (1051)
T ss_pred ccCCCceEEEEEEeCCCCcccceeEecccCCceEEEcCCCCCCcceeEEEEEEecCCCCCCCCceEEEEcCCCCCCCCCc
Confidence 4688899999999988666887543 33334555544 56799999999999999999999999888887766 7888
Q ss_pred CCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCC--CcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCC
Q psy7018 228 PPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATG--DFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSA 305 (349)
Q Consensus 228 ~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~--~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~ 305 (349)
.+.+...+.+.+.|+|+++...+|.+.+|+|+|+..+... .....+.++.+...+.||++.|.|.|.|+|+|.+|.|+
T Consensus 825 ~~~~~~~s~s~~~v~W~~~~~~nG~l~gY~v~Y~~~~~~~~~~~~~~i~~~~~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp 904 (1051)
T KOG3513|consen 825 KLSAKPLSSSEVNLSWKPPLWDNGKLTGYEVKYWKINEKEGSLSRVQIAGNRTSWRLTGLEPNTKYRFYVRAYTSAGGGP 904 (1051)
T ss_pred cceeecccCceEEEEecCcCccCCccceeEEEEEEcCCCcccccceeecCCcceEeeeCCCCCceEEEEEEEecCCCCCC
Confidence 8999999999999999999866799999999998765432 12223447788999999999999999999999999999
Q ss_pred CCceEEeeccCccCCC
Q psy7018 306 DATSVAEWLSYSVYTP 321 (349)
Q Consensus 306 ~s~~~~~~t~~~~p~~ 321 (349)
.|.....+|..++|..
T Consensus 905 ~s~~~~~tt~k~pPs~ 920 (1051)
T KOG3513|consen 905 ASSEENVTTKKAPPSQ 920 (1051)
T ss_pred CccceeccccCCCCcc
Confidence 9999999998888874
No 3
>KOG4221|consensus
Probab=99.97 E-value=1.6e-29 Score=231.44 Aligned_cols=301 Identities=17% Similarity=0.196 Sum_probs=229.6
Q ss_pred EEEEeeecCCCCCCCCcceeeeCCC------CCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCC---
Q psy7018 14 STEDRAGNSAGWSPFSETSVITTPP------GPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQP--- 84 (349)
Q Consensus 14 ~~~v~a~n~~G~~~~s~~~~~~t~~------~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~--- 84 (349)
.|+|.|.|+.|.-..+..+.+...+ .-|.+|..+.+...+...++++|.+|...++++..|.+.+..+...
T Consensus 395 ~yQcvaen~~G~a~a~a~l~vv~~p~~~~s~~~~sap~~lv~~~~~srfi~~tw~~p~~~~g~i~~~~v~~~~~~~~rer 474 (1381)
T KOG4221|consen 395 FYQCVAENDAGSAQAAAQLEVVPQPASASSDLLPSAPRDLVANLVSSRFIQLTWRPPAQISGNISTYTVFYKVEGDVRER 474 (1381)
T ss_pred hhhhhhccCcccccchhhheeccCCcccccCccccCCcceecccccceeEEEeecCccccCCCcceEEEEEecCCchhhh
Confidence 3578888999876655555443332 2345566666566788999999998887788999999988663221
Q ss_pred --ceeecccEEEEcCCCCCCEEEEEEEEEeecCCcccceeeeeeecccccCCCCCCCccccceEEEEEEec---CCCCCe
Q psy7018 85 --SITCDTTSYQLTGLSPHTTYKSVCQVYLISDKHTQKVDLTFESDQLYLSGLPNSEWPSLRTIILTLYFR---TAQVPG 159 (349)
Q Consensus 85 --~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~w~~~---~~~~~~ 159 (349)
.........++.+|+|.+.|.|+|++.+..+.+.....+.+.+..-.+.. -.....+..+..++|+.+ ++++.+
T Consensus 475 ~~~tss~g~~~tv~nl~p~t~Y~~rv~A~n~~g~g~sS~pLkV~t~pEgp~~-~~a~ats~~ti~v~WepP~~~n~~I~~ 553 (1381)
T KOG4221|consen 475 LQNTSSPGIQVTVQNLSPLTMYFFRVRAKNEAGSGESSAPLKVTTQPEGPVQ-LQAYATSPTTILVTWEPPPFGNGPITG 553 (1381)
T ss_pred heeccCCceEEEeeecccceeEEEEEeccCcccCCccCCceEEecCCCCCcc-ccccccCcceEEEEecCCCCCCCCceE
Confidence 11222378899999999999998877776555555555555443321111 112224556888899864 478999
Q ss_pred eeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCCCCCCCC-CCeEEeecCCe
Q psy7018 160 VPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEPDQPN-PPKLVTKTRTS 238 (349)
Q Consensus 160 Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P~~p~-~~~~~~~~~~s 238 (349)
|.+.|...+ .+.+..+ ..+.++++|.+|.+.++|.|+|.|.|..|.|..|..+.+.|..+.|++|. ++++...+.++
T Consensus 554 yk~~ys~~~-~~~~~~~-~~n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sSts 631 (1381)
T KOG4221|consen 554 YKLFYSEDD-TGKELRV-ENNATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTS 631 (1381)
T ss_pred EEEEEEcCC-CCceEEE-ecCccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecCCCe
Confidence 999998864 3344333 56778999999999999999999999999999999999999999996655 49999999999
Q ss_pred EEEEeecCC--CCCCcccEEEEEEecCCCCCCcE-EeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeecc
Q psy7018 239 LALKWNAAV--DNGAHVMHYILESDQGNATGDFK-EISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWLS 315 (349)
Q Consensus 239 v~l~W~~~~--~~~~~i~~Y~v~~~~~~~~~~~~-~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t~ 315 (349)
|.++|.+|. ..++.+.+|+|+++......+.. ....++.+++.+.+|+|++.|.|+|.|.+..|.|++|+++.+.|+
T Consensus 632 VrVsW~pP~~~t~ng~itgYkIRy~~~~~~~~~~~t~v~~n~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~ 711 (1381)
T KOG4221|consen 632 VRVSWLPPPSETQNGQITGYKIRYRKLSREDEVNETVVKGNTTQYLFNGLEPNTQYRVRISAMTVNGTGPASEWVSAETP 711 (1381)
T ss_pred EEEEccCCCcccccceEEEEEEEecccCcccccceeecccchhhhHhhcCCCCceEEEEEEEeccCCCCCcccceeccCc
Confidence 999999998 44788999999998765434332 344557889999999999999999999999999999999999998
Q ss_pred Cc
Q psy7018 316 YS 317 (349)
Q Consensus 316 ~~ 317 (349)
..
T Consensus 712 ~~ 713 (1381)
T KOG4221|consen 712 ES 713 (1381)
T ss_pred cc
Confidence 53
No 4
>KOG3513|consensus
Probab=99.94 E-value=9.7e-24 Score=194.48 Aligned_cols=303 Identities=17% Similarity=0.142 Sum_probs=223.3
Q ss_pred CCCceEEEEeeecCCCCCCCCcce-eeeCCCCCC-CCCccccccccCCCeeEEEecCCCCC--CCCceeEEEEecCCC--
Q psy7018 9 TDHKVSTEDRAGNSAGWSPFSETS-VITTPPGPP-GSLTLRKTSVTTPTSLTVSWSEPANH--GSPILHYVLETSDPA-- 82 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~~s~~~-~~~t~~~~P-~~~~~~~~~~~~~~si~l~W~~~~~~--~~~~~~Y~v~~~~~~-- 82 (349)
|.-.|.|||.|.|..|.|..|.+. .++|.+++| .+|.++.......+.+.|+|++.... +++-.+|.|.|+...
T Consensus 683 Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~~P~nv~g~g~~~~eLvItW~Pl~~~~qNG~gfgY~Vswr~~g~~ 762 (1051)
T KOG3513|consen 683 PWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSVNPSNVKGGGGSPTELVITWEPLPEEEQNGPGFGYRVSWRPQGAD 762 (1051)
T ss_pred CCcceEEEEEEEcccccCCCCCCccceecCCCCCccCCccccccCCCCceEEEEeccCCHHHccCCCceEEEEEEeCCCC
Confidence 778999999999999999987664 467999999 58999987888899999999987542 678899999997632
Q ss_pred --CCceee---cccEEEEc--CCCCCCEEEEEEEEEeecCCcccceeeeeeecccccCCCCCCCc---cccceEEEEEE-
Q psy7018 83 --QPSITC---DTTSYQLT--GLSPHTTYKSVCQVYLISDKHTQKVDLTFESDQLYLSGLPNSEW---PSLRTIILTLY- 151 (349)
Q Consensus 83 --~~~~~~---~~~~~~i~--~L~p~t~Y~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~---~~~~~~~l~w~- 151 (349)
+..... +...+.+. ...|.+.|+++|++.|..+.|+.......-+.+..|+.+|.... .+.+...|.|.
T Consensus 763 ~~W~~~~v~~~d~~~~V~~~~st~~~tpyevKVqa~N~~GeGp~s~~~v~~S~Ed~P~~ap~~~~~~~~s~s~~~v~W~~ 842 (1051)
T KOG3513|consen 763 KEWKEVIVSNQDQPRYVVSNESTEPFTPYEVKVQAINDQGEGPESQVTVGYSGEDEPPVAPTKLSAKPLSSSEVNLSWKP 842 (1051)
T ss_pred cccceeEecccCCceEEEcCCCCCCcceeEEEEEEecCCCCCCCCceEEEEcCCCCCCCCCccceeecccCceEEEEecC
Confidence 222222 23455554 46779999999998886566666655555555555555555433 45668888994
Q ss_pred --ecCCCCCeeeEEEeeCCCC-C-ceEEEecCCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCCCCCC--
Q psy7018 152 --FRTAQVPGVPPYVSDKGRD-G-KYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEPDQ-- 225 (349)
Q Consensus 152 --~~~~~~~~Y~v~~~~~~~~-~-~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P~~-- 225 (349)
|.+|.+.+|+|+|+..+.. . .......++.++..|.+|++.+.|.|.|+|++.+|.|+.|......+...+|..
T Consensus 843 ~~~~nG~l~gY~v~Y~~~~~~~~~~~~~~i~~~~~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp~s~~~~~tt~k~pPs~~~ 922 (1051)
T KOG3513|consen 843 PLWDNGKLTGYEVKYWKINEKEGSLSRVQIAGNRTSWRLTGLEPNTKYRFYVRAYTSAGGGPASSEENVTTKKAPPSQVD 922 (1051)
T ss_pred cCccCCccceeEEEEEEcCCCcccccceeecCCcceEeeeCCCCCceEEEEEEEecCCCCCCCccceeccccCCCCcccc
Confidence 7789999999999875422 2 222333688999999999999999999999999999999999888888877764
Q ss_pred --CCCCeEEeecCCeEEEEeecCC--CCCCcccEEEEEEecCCCCCCcEEeecC-CcceEEEcCCCCCCeEEEEEEEecC
Q psy7018 226 --PNPPKLVTKTRTSLALKWNAAV--DNGAHVMHYILESDQGNATGDFKEISKS-KNKNFTLSKLTPSTCFRFRLAAVNQ 300 (349)
Q Consensus 226 --p~~~~~~~~~~~sv~l~W~~~~--~~~~~i~~Y~v~~~~~~~~~~~~~~~~~-~~~~~~~~~L~p~t~Y~~~V~A~~~ 300 (349)
|....+.......+.|.|.... .+...+.+|.|.++.+............ ....-...++ +..|.++|++...
T Consensus 923 ~~p~g~~~~~~~~~~~~l~w~~v~~~~nes~v~gYkV~~~~~~~~~~~~~~t~~~~~~~~~p~~~--~~~y~i~v~~~~~ 1000 (1051)
T KOG3513|consen 923 IAPPGNFIWKFSASILLLLWLLVSAFENESEVGGYKVLYREDLQNDIEIILTMKLDAEFPEPSDL--DGKYDIKVRGYSP 1000 (1051)
T ss_pred cCCCcceEEeeeeeEEEEEEeeEEEEeecccCcceEEEEeecccCCceeEecccccccccCcccc--CCcceeEeccccC
Confidence 6666777788889999998765 3445588999999887643322222111 1111122333 2689999999999
Q ss_pred CCCCCCCceEEee
Q psy7018 301 HGKSADATSVAEW 313 (349)
Q Consensus 301 ~G~~~~s~~~~~~ 313 (349)
+|.|..++.....
T Consensus 1001 ggdg~~s~~~~~~ 1013 (1051)
T KOG3513|consen 1001 GGDGTISEEISID 1013 (1051)
T ss_pred CCCCCcCCccccc
Confidence 9999777766554
No 5
>KOG0196|consensus
Probab=99.89 E-value=5.1e-22 Score=176.22 Aligned_cols=236 Identities=19% Similarity=0.212 Sum_probs=166.9
Q ss_pred cccccCCCCceEEEEeeecCCCC---CCC-C--cceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCC-CceeEE
Q psy7018 3 MVCQSCTDHKVSTEDRAGNSAGW---SPF-S--ETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGS-PILHYV 75 (349)
Q Consensus 3 ~~~~~~p~~~y~~~v~a~n~~G~---~~~-s--~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~-~~~~Y~ 75 (349)
+.|.+||.++++-.-.|..+... ... + ..+.++. ||++|.++. .+++.+++.|.|.+|.+.|+ .-..|.
T Consensus 291 ~~C~~CP~~S~s~~ega~~C~C~~gyyRA~~Dp~~mpCT~---PPSaP~nli-s~vn~Ts~~L~W~~P~d~GGR~Di~y~ 366 (996)
T KOG0196|consen 291 SLCLPCPPNSHSSSEGATSCTCENGYYRADSDPPSMPCTR---PPSAPRNLI-SNVNGTSLILEWSPPADTGGREDITYN 366 (996)
T ss_pred CCCCCCCCCCCCCCCCCCcccccCCcccCCCCCCCCCCCC---CCCccceee-eecccceEEEEecCCcccCCCcceEEE
Confidence 57999999998877777776432 221 2 2233444 455555554 34899999999999987655 223343
Q ss_pred EEecCCCCCceeecccEEEEcCCCCCCEEEEEEEEEeecCCcccceeeeeeecccccCCCCCCCccccceEEEEEEecCC
Q psy7018 76 LETSDPAQPSITCDTTSYQLTGLSPHTTYKSVCQVYLISDKHTQKVDLTFESDQLYLSGLPNSEWPSLRTIILTLYFRTA 155 (349)
Q Consensus 76 v~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~w~~~~~ 155 (349)
|..+..... .+. .
T Consensus 367 v~Ck~c~~~-------------------------------~~~--C---------------------------------- 379 (996)
T KOG0196|consen 367 VICKKCGGG-------------------------------RGA--C---------------------------------- 379 (996)
T ss_pred EEeeccCCC-------------------------------CCc--c----------------------------------
Confidence 332221100 000 0
Q ss_pred CCCeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEecccc-cC---CCCcEEEecCCCCCCCCCCCeE
Q psy7018 156 QVPGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIV-GI---ASEPTLFTTPPCEPDQPNPPKL 231 (349)
Q Consensus 156 ~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~-~~---~s~~~~~~t~~~~P~~p~~~~~ 231 (349)
...+-.|.|.+... .-..++..+.+|.+.+.|.|.|.|.|+... ++ ....+.++|....|++...++.
T Consensus 380 ~~Cg~~V~f~P~q~--------gLt~~~V~v~~L~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps~V~~~r~ 451 (996)
T KOG0196|consen 380 EPCGDNVRFTPRQR--------GLTETSVTVSDLLAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPSPVSVLRQ 451 (996)
T ss_pred ccCCCCceECCCCC--------CcccceEEEeccccccccEEEEEEeecccccCCCCCcceeEEeeccccCCCccceEEE
Confidence 00112223322110 123456889999999999999999997532 22 2456688888888888889999
Q ss_pred EeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEE
Q psy7018 232 VTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVA 311 (349)
Q Consensus 232 ~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~ 311 (349)
...+.++++|+|..|..+++.|..|+|+|.+.+.++.-........+..++.+|+|++.|.|+|+|++..|.|.+|....
T Consensus 452 ~~~~~~sitlsW~~p~~png~ildYEvky~ek~~~e~~~~~~~t~~~~~ti~gL~p~t~YvfqVRarT~aG~G~~S~~~~ 531 (996)
T KOG0196|consen 452 VSRTSDSITLSWSEPDQPNGVILDYEVKYYEKDEDERSYSTLKTKTTTATITGLKPGTVYVFQVRARTAAGYGPYSGKHE 531 (996)
T ss_pred eeeccCceEEecCCCCCCCCcceeEEEEEeeccccccceeEEecccceEEeeccCCCcEEEEEEEEecccCCCCCCCcee
Confidence 99999999999999998888999999999887644444455667778999999999999999999999999999999999
Q ss_pred eeccCc
Q psy7018 312 EWLSYS 317 (349)
Q Consensus 312 ~~t~~~ 317 (349)
+.|.+.
T Consensus 532 fqT~~~ 537 (996)
T KOG0196|consen 532 FQTLPS 537 (996)
T ss_pred eeecCc
Confidence 999754
No 6
>KOG4222|consensus
Probab=99.76 E-value=1.8e-16 Score=146.08 Aligned_cols=305 Identities=16% Similarity=0.198 Sum_probs=204.5
Q ss_pred CceEEEEeeecCCCCCCCCcceeeeCC------------CCCCCCCccccccccCCCeeEEEecCCCCC-CCCceeEEEE
Q psy7018 11 HKVSTEDRAGNSAGWSPFSETSVITTP------------PGPPGSLTLRKTSVTTPTSLTVSWSEPANH-GSPILHYVLE 77 (349)
Q Consensus 11 ~~y~~~v~a~n~~G~~~~s~~~~~~t~------------~~~P~~~~~~~~~~~~~~si~l~W~~~~~~-~~~~~~Y~v~ 77 (349)
..|.|.+.|.+..|.+.|+..+.+... ...|+.|..-.+..++.++++|.|.+.... ..++.+|.++
T Consensus 485 DtG~YTc~as~~~ges~wSatl~v~~~~~s~q~~r~~D~S~~pS~p~~p~v~~v~~~~v~LsW~~~s~sg~vP~s~yiie 564 (1281)
T KOG4222|consen 485 DTGRYTCIASDESGESTWSATLTVEKAGSSQQFCRCEDPSALPSPPGTPGVVNVSRTSVTLSWQPTSPSGAVPASGYIIE 564 (1281)
T ss_pred CCcceeeeccCcccccccceeeEhhhcCcccccccCCChhhCCCCCCCCccccCCCceEEecccCCCCCCccccchhHHH
Confidence 578999999999999999877766411 112333333333468999999999998764 4588899997
Q ss_pred ecC----CCCCc--eeecccEEEEcCCCCCCEEEEEEEEEeecCCc-ccceeeeeeecccccCCC-------------CC
Q psy7018 78 TSD----PAQPS--ITCDTTSYQLTGLSPHTTYKSVCQVYLISDKH-TQKVDLTFESDQLYLSGL-------------PN 137 (349)
Q Consensus 78 ~~~----~~~~~--~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~-~~~~~~~~~~~~~~~~~~-------------p~ 137 (349)
+-. ..+.. .....+.+.|.+|.|++.|.|.||+.+..+.+ .+..+-..++.+..+... +.
T Consensus 565 afs~~~~etw~~ta~~v~~t~~~I~gL~P~~sylf~vRa~n~~Gis~Ps~~S~~vrta~a~~~~a~ad~~k~~~~ls~~l 644 (1281)
T KOG4222|consen 565 AFSPDLGETWQTTAGRVKTTTYAIRGLKPNLSYLFLVRAENEQGISDPSTSSDPVRTAPADAAAAGADHQKVQRELSNEL 644 (1281)
T ss_pred HhhhhhcccccccccccccceeeecCcCccceeeeeeeccccccccCCcccCCccccCCCChhhhhhhHHHHHHhhcccc
Confidence 733 11222 22347889999999999999998876643332 222222222222111100 00
Q ss_pred -----CCccccceEEEEEEec----CCCCCeeeEEEeeCCC-CCceE--EEecCCceEEEecCCCCCCeEEEEEEEEecc
Q psy7018 138 -----SEWPSLRTIILTLYFR----TAQVPGVPPYVSDKGR-DGKYK--SIYNGEAQSCRIQDLKPGTDYAVCVQVHLEE 205 (349)
Q Consensus 138 -----~~~~~~~~~~l~w~~~----~~~~~~Y~v~~~~~~~-~~~~~--~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~ 205 (349)
....+.+...+.|... ...+.+|+|.|+..+. ...|. .+......++.+.+|.|++.|++.+..+...
T Consensus 645 ~~l~~~~~L~asslr~~w~~~kq~~~~~i~g~~I~~r~~~~~~a~~s~~~v~~~t~~s~v~~nl~p~t~ye~f~~Pf~~~ 724 (1281)
T KOG4222|consen 645 LRLSNPNVLNASSLRLGWTKDKQHGSQYIQGYRISYRSLGSQLAQWSNAGVTVPTPESVVVPNLKPGTNYEFFVRPFFPH 724 (1281)
T ss_pred eeeccccccchhheeeeeeeecccCcccccceEEEeccCccccccccccceeccCCcceeccccCCCccceeeccCccCC
Confidence 1112334556666633 2458899999998653 23332 2234556678999999999999999998774
Q ss_pred c---ccCCCCcEEEecCCCCCCCC-CCC---eEEeecCCeEEEEeecCC--CCCCcccEEEEEEecCCCCC-CcEEeecC
Q psy7018 206 I---VGIASEPTLFTTPPCEPDQP-NPP---KLVTKTRTSLALKWNAAV--DNGAHVMHYILESDQGNATG-DFKEISKS 275 (349)
Q Consensus 206 g---~~~~s~~~~~~t~~~~P~~p-~~~---~~~~~~~~sv~l~W~~~~--~~~~~i~~Y~v~~~~~~~~~-~~~~~~~~ 275 (349)
| .|..+....+.+...+|.+| .++ ..-....+++.|+|.+++ ..++.+.+|.|.+...+... .....+..
T Consensus 725 ~~s~~g~pS~sk~alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~pp~d~~ng~~qg~ki~~~~~e~tr~h~n~t~~a 804 (1281)
T KOG4222|consen 725 GYSIQGAPSNSKTALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPPPADVQNGILQGYKIECSGGEKTRIHINKTTNA 804 (1281)
T ss_pred CcceecCCcccccccccccCCCCCCCCccccccccCCCceeeEEecCCcccccCCcccceeEEeecCccccccccccccC
Confidence 3 56667788888888888544 553 344456889999999997 33688899999886654211 11222334
Q ss_pred CcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeecc
Q psy7018 276 KNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWLS 315 (349)
Q Consensus 276 ~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t~ 315 (349)
...+.++.+|.+|..|.|+|.|.+.+|.|..++...+.+.
T Consensus 805 ~~~sv~i~~l~~g~ay~vtv~a~T~aGvG~~s~p~~~~~~ 844 (1281)
T KOG4222|consen 805 RTGSVTIGNLVTGIAYSVTVAARTGAGVGVKSPPQPIVFD 844 (1281)
T ss_pred CCCceEeccccccceEEEEEeeecCCccCCCCCCeeeecC
Confidence 4558899999999999999999999999988877665443
No 7
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.45 E-value=4.7e-13 Score=91.32 Aligned_cols=83 Identities=24% Similarity=0.499 Sum_probs=69.7
Q ss_pred CCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEe-ecCCcceEEEcCCCCCCeEEEEEEEecCCCC
Q psy7018 225 QPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEI-SKSKNKNFTLSKLTPSTCFRFRLAAVNQHGK 303 (349)
Q Consensus 225 ~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~-~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~ 303 (349)
+|.++++...+.+++.|+|+.+...++.+.+|.|+++.......+... .......+.+.+|+|++.|.|+|+|.+..|.
T Consensus 2 ~P~~l~v~~~~~~sv~v~W~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~a~~~~g~ 81 (85)
T PF00041_consen 2 APENLSVSNISPTSVTVSWKPPSSGNGPITGYRVEYRSVNSTSDWQEVTVPGNETSYTITGLQPGTTYEFRVRAVNSDGE 81 (85)
T ss_dssp SSEEEEEEEECSSEEEEEEEESSSTSSSESEEEEEEEETTSSSEEEEEEEETTSSEEEEESCCTTSEEEEEEEEEETTEE
T ss_pred cCcCeEEEECCCCEEEEEEECCCCCCCCeeEEEEEEEecccceeeeeeeeeeeeeeeeeccCCCCCEEEEEEEEEeCCcC
Confidence 678899999999999999999986678899999999887654423333 4455669999999999999999999999998
Q ss_pred CCCC
Q psy7018 304 SADA 307 (349)
Q Consensus 304 ~~~s 307 (349)
|++|
T Consensus 82 g~~S 85 (85)
T PF00041_consen 82 GPPS 85 (85)
T ss_dssp EEEE
T ss_pred cCCC
Confidence 8754
No 8
>KOG4258|consensus
Probab=99.44 E-value=5.5e-13 Score=119.94 Aligned_cols=291 Identities=16% Similarity=0.203 Sum_probs=162.2
Q ss_pred CCCceEEEEeeecC--CCCC--CCCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCC
Q psy7018 9 TDHKVSTEDRAGNS--AGWS--PFSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQP 84 (349)
Q Consensus 9 p~~~y~~~v~a~n~--~G~~--~~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~ 84 (349)
|.+.|.+-|++... .+.. ..|+...++|.++.|+.|...-..+-+++.|.|+|++|..+++.+++|.|.|......
T Consensus 575 P~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~PspPl~~ls~snsSsqi~l~W~pP~~pNG~lt~Ylv~wer~~~~ 654 (1025)
T KOG4258|consen 575 PWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSPPLDVLSKSNSSSQILLKWKPPSQPNGNLTHYLVVWERQAED 654 (1025)
T ss_pred ccceeEEEEeeeehhhhccccccccceEEEEecCCCCCCcchhhhccCcchheeEEecCCCCCCCceeEEEEEEEeccCC
Confidence 78999999999854 2222 3488899999999998888776555567799999999999999999999988542110
Q ss_pred ---------------ceeecccEEEEcCCCCCCEEEEEEEEEeecCCcccceee---------------eeee---cccc
Q psy7018 85 ---------------SITCDTTSYQLTGLSPHTTYKSVCQVYLISDKHTQKVDL---------------TFES---DQLY 131 (349)
Q Consensus 85 ---------------~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~---------------~~~~---~~~~ 131 (349)
.......++..++..|-.+++= ..+........ .+.. ...-
T Consensus 655 ~yl~~~nYC~~~~k~p~~~~~p~~~~ed~d~~~e~e~------~~~~Cc~c~~~~~~~~~e~eea~~~~~FEd~L~n~i~ 728 (1025)
T KOG4258|consen 655 GYLEQRNYCHKGLKLPIRADLPSFDSEDMDPLLEMEG------HTGPCCSCPPTESYPQYEDEEASEQKTFEDFLHNAIF 728 (1025)
T ss_pred chHHHhccccccccccccccCCCCchhhcchhhhhcc------CCCCCCCCCcccccCchhhHHHHHHHHHhhhccceee
Confidence 0000111222222222222220 00000000000 0000 0000
Q ss_pred cCCCCCCC---ccccceEEEEEE----e-c--CCCCCeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEE
Q psy7018 132 LSGLPNSE---WPSLRTIILTLY----F-R--TAQVPGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQV 201 (349)
Q Consensus 132 ~~~~p~~~---~~~~~~~~l~w~----~-~--~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a 201 (349)
.+..+... .....+...... . + .+...-|+++-- +...+ .++..++.+.+|+.++.|.+.++|
T Consensus 729 vpr~~~~krk~l~~~~n~t~~~~~~~~~~p~t~~~t~p~ei~e~---~p~~~----n~n~~~~vi~~Lrh~tlY~i~l~a 801 (1025)
T KOG4258|consen 729 VPRRPDRKRKSLDDVENCTRLAPTRKAEEPTTPPTTAPTEIEEP---KPRLE----NGNKESYVISGLRHFTLYRIDLQA 801 (1025)
T ss_pred ecccCcccccccccccceeeccccccccCCCCCCCCCCcccccc---Ccccc----cccchhhhhhccccchhhhhhHhh
Confidence 00000000 000000000000 0 0 001112232210 01111 367778999999999999999999
Q ss_pred Eecccc----cCCCCcEEEecCC--CCCCCCCCCeEEe-ecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeec
Q psy7018 202 HLEEIV----GIASEPTLFTTPP--CEPDQPNPPKLVT-KTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISK 274 (349)
Q Consensus 202 ~~~~g~----~~~s~~~~~~t~~--~~P~~p~~~~~~~-~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~ 274 (349)
+|..+. +..+ .+..+|.. ++-..|..+.-.. ...+++.|+|.+|..+++-|..|.|.++....+....-+..
T Consensus 802 Cnh~~~~~~cS~a~-~v~~RT~~~~~aD~i~g~v~we~~~~~~~v~l~w~EP~~pNGli~~Y~Vk~r~~~~et~v~cvsR 880 (1025)
T KOG4258|consen 802 CNHATPKCGCSHAA-FVFARTMPTMGADDIPGPVTWECHIEMNSVILRWLEPKEPNGLILNYEVKYRRNGDETHVECVSR 880 (1025)
T ss_pred hcccccccccchhh-hhhhccccccccccCCCceeEecccCcceEEEecCCCCCCCccEEEEEEEEeeccCcchhhhhhh
Confidence 998665 2111 11112211 1112222222222 37899999999999999999999999986654322211111
Q ss_pred ---CCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeec
Q psy7018 275 ---SKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWL 314 (349)
Q Consensus 275 ---~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t 314 (349)
.....+.+..|.+| .|.++|+|..-+|.|++.+.+.+..
T Consensus 881 ~~~~k~~gv~l~~l~~G-~y~~~vratSlaGng~~t~~~~~~v 922 (1025)
T KOG4258|consen 881 MDYAKAGGVYLKRLNPG-NYSVRVRATSLAGNGSWTESVIFYV 922 (1025)
T ss_pred hhhhhcCceEEEecCCC-cEEEEEEEEeeccCCCCcCCcceEE
Confidence 12337788999998 8999999999999999888766543
No 9
>KOG0196|consensus
Probab=99.15 E-value=9.1e-10 Score=99.48 Aligned_cols=134 Identities=22% Similarity=0.190 Sum_probs=101.2
Q ss_pred ecccEEEEcCCCCCCEEEEEEEEEe-ecCCc---ccceeeeeeecccccCCCCCCCccc--cceEEEEEEe---cCCCCC
Q psy7018 88 CDTTSYQLTGLSPHTTYKSVCQVYL-ISDKH---TQKVDLTFESDQLYLSGLPNSEWPS--LRTIILTLYF---RTAQVP 158 (349)
Q Consensus 88 ~~~~~~~i~~L~p~t~Y~~~v~~~~-~~~~~---~~~~~~~~~~~~~~~~~~p~~~~~~--~~~~~l~w~~---~~~~~~ 158 (349)
..+.++++.+|.|.++|+|.|.+.| .++.. .....+.+.+.+..++..+...... .++..|.|.. +++.+.
T Consensus 395 Lt~~~V~v~~L~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps~V~~~r~~~~~~~sitlsW~~p~~png~il 474 (996)
T KOG0196|consen 395 LTETSVTVSDLLAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPSPVSVLRQVSRTSDSITLSWSEPDQPNGVIL 474 (996)
T ss_pred cccceEEEeccccccccEEEEEEeecccccCCCCCcceeEEeeccccCCCccceEEEeeeccCceEEecCCCCCCCCcce
Confidence 4578899999999999999988655 33322 3446667777777666655554433 3466677764 346789
Q ss_pred eeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCC
Q psy7018 159 GVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPC 221 (349)
Q Consensus 159 ~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~ 221 (349)
.|+|+|.+.+.+...........+..++.+|+|++.|.|+|+|...+|.|..|....|.|...
T Consensus 475 dYEvky~ek~~~e~~~~~~~t~~~~~ti~gL~p~t~YvfqVRarT~aG~G~~S~~~~fqT~~~ 537 (996)
T KOG0196|consen 475 DYEVKYYEKDEDERSYSTLKTKTTTATITGLKPGTVYVFQVRARTAAGYGPYSGKHEFQTLPS 537 (996)
T ss_pred eEEEEEeeccccccceeEEecccceEEeeccCCCcEEEEEEEEecccCCCCCCCceeeeecCc
Confidence 999999987644443444466778899999999999999999999999999999999998763
No 10
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=99.04 E-value=5.3e-09 Score=71.97 Aligned_cols=89 Identities=33% Similarity=0.615 Sum_probs=69.2
Q ss_pred CCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecC--CcceEEEcCCCCCCeEEEEEEEecC
Q psy7018 223 PDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKS--KNKNFTLSKLTPSTCFRFRLAAVNQ 300 (349)
Q Consensus 223 P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~--~~~~~~~~~L~p~t~Y~~~V~A~~~ 300 (349)
|.+|.++++.....+++.|+|..+....+.+..|.|++.... ...+..+... ....+.+.+|.|++.|.|+|+|.+.
T Consensus 1 p~~p~~~~~~~~~~~~~~v~W~~~~~~~~~~~~y~v~~~~~~-~~~~~~~~~~~~~~~~~~i~~l~p~~~Y~~~v~a~~~ 79 (93)
T cd00063 1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGPITGYVVEYREKG-SGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79 (93)
T ss_pred CcCCCCcEEEEecCCEEEEEECCCCCCCCcceeEEEEEeeCC-CCCCEEeeccCCcccEEEEccccCCCEEEEEEEEECC
Confidence 346677888777789999999998754456779999998764 2445555443 6678999999999999999999999
Q ss_pred CCCCCCCceEEe
Q psy7018 301 HGKSADATSVAE 312 (349)
Q Consensus 301 ~G~~~~s~~~~~ 312 (349)
.|.|.++....+
T Consensus 80 ~~~~~~s~~~~~ 91 (93)
T cd00063 80 GGESPPSESVTV 91 (93)
T ss_pred CccCCCcccccc
Confidence 888877764443
No 11
>KOG4222|consensus
Probab=99.01 E-value=1.3e-08 Score=95.30 Aligned_cols=208 Identities=19% Similarity=0.182 Sum_probs=135.4
Q ss_pred CCCceEEEEeeecCCCCCCCCcc-eeeeCCCCCC-C---------------CCccccccccCCCeeEEEecCCCCC-CCC
Q psy7018 9 TDHKVSTEDRAGNSAGWSPFSET-SVITTPPGPP-G---------------SLTLRKTSVTTPTSLTVSWSEPANH-GSP 70 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~~s~~-~~~~t~~~~P-~---------------~~~~~~~~~~~~~si~l~W~~~~~~-~~~ 70 (349)
|++.|.|.|+|.|..|.+..+.. -.++|.+..+ . .+.......++..++++.|...... ...
T Consensus 593 P~~sylf~vRa~n~~Gis~Ps~~S~~vrta~a~~~~a~ad~~k~~~~ls~~l~~l~~~~~L~asslr~~w~~~kq~~~~~ 672 (1281)
T KOG4222|consen 593 PNLSYLFLVRAENEQGISDPSTSSDPVRTAPADAAAAGADHQKVQRELSNELLRLSNPNVLNASSLRLGWTKDKQHGSQY 672 (1281)
T ss_pred ccceeeeeeeccccccccCCcccCCccccCCCChhhhhhhHHHHHHhhcccceeeccccccchhheeeeeeeecccCccc
Confidence 78999999999999998776422 2234433322 1 1112223456789999999988664 457
Q ss_pred ceeEEEEecCCCC-------Cceee-cccEEEEcCCCCCCEEEEEEEEEee-cCC--cccceeeeeeecccccCCCCCCC
Q psy7018 71 ILHYVLETSDPAQ-------PSITC-DTTSYQLTGLSPHTTYKSVCQVYLI-SDK--HTQKVDLTFESDQLYLSGLPNSE 139 (349)
Q Consensus 71 ~~~Y~v~~~~~~~-------~~~~~-~~~~~~i~~L~p~t~Y~~~v~~~~~-~~~--~~~~~~~~~~~~~~~~~~~p~~~ 139 (349)
+.+|.|.|+.... ..+.. ....+.+.+|.|+++|+|.++-+.. +.. +....+..+.+...+|+..|...
T Consensus 673 i~g~~I~~r~~~~~~a~~s~~~v~~~t~~s~v~~nl~p~t~ye~f~~Pf~~~~~s~~g~pS~sk~alt~e~~PSapp~~~ 752 (1281)
T KOG4222|consen 673 IQGYRISYRSLGSQLAQWSNAGVTVPTPESVVVPNLKPGTNYEFFVRPFFPHGYSIQGAPSNSKTALTLEEPPSAPPQGV 752 (1281)
T ss_pred ccceEEEeccCccccccccccceeccCCcceeccccCCCccceeeccCccCCCcceecCCcccccccccccCCCCCCCCc
Confidence 8999999987322 11222 3677889999999999998873332 211 22334445555555555555442
Q ss_pred --c----cccceEEEEEEec-----CCCCCeeeEEEeeCCCC-CceEEEecCCceEEEecCCCCCCeEEEEEEEEecccc
Q psy7018 140 --W----PSLRTIILTLYFR-----TAQVPGVPPYVSDKGRD-GKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIV 207 (349)
Q Consensus 140 --~----~~~~~~~l~w~~~-----~~~~~~Y~v~~~~~~~~-~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~ 207 (349)
+ .+.+...+.|..+ ++.+.+|.|+.....+. .......+....+.+|.+|.++..|.|+|.+.+..|.
T Consensus 753 ~~~s~~~~n~Ta~~Vsw~~pp~d~~ng~~qg~ki~~~~~e~tr~h~n~t~~a~~~sv~i~~l~~g~ay~vtv~a~T~aGv 832 (1281)
T KOG4222|consen 753 QHVSKGSYNGTAGSVSWAPPPADVQNGILQGYKIECSGGEKTRIHINKTTNARTGSVTIGNLVTGIAYSVTVAARTGAGV 832 (1281)
T ss_pred cccccccCCCceeeEEecCCcccccCCcccceeEEeecCccccccccccccCCCCceEeccccccceEEEEEeeecCCcc
Confidence 1 2345677788743 36678999887653211 1111122466678999999999999999999999999
Q ss_pred cCCCCcEEE
Q psy7018 208 GIASEPTLF 216 (349)
Q Consensus 208 ~~~s~~~~~ 216 (349)
|..+.+...
T Consensus 833 G~~s~p~~~ 841 (1281)
T KOG4222|consen 833 GVKSPPQPI 841 (1281)
T ss_pred CCCCCCeee
Confidence 987766543
No 12
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.00 E-value=1.9e-09 Score=73.26 Aligned_cols=71 Identities=34% Similarity=0.477 Sum_probs=56.4
Q ss_pred CCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCCc------eeecccEEEEcCCCCCCEEEEEEEEEee
Q psy7018 43 SLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQPS------ITCDTTSYQLTGLSPHTTYKSVCQVYLI 113 (349)
Q Consensus 43 ~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~~------~~~~~~~~~i~~L~p~t~Y~~~v~~~~~ 113 (349)
+|..+.+..++.+++.|+|+++...++.+.+|.|+|....... .......+.+.+|.|++.|.|+|++.+.
T Consensus 2 ~P~~l~v~~~~~~sv~v~W~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~a~~~ 78 (85)
T PF00041_consen 2 APENLSVSNISPTSVTVSWKPPSSGNGPITGYRVEYRSVNSTSDWQEVTVPGNETSYTITGLQPGTTYEFRVRAVNS 78 (85)
T ss_dssp SSEEEEEEEECSSEEEEEEEESSSTSSSESEEEEEEEETTSSSEEEEEEEETTSSEEEEESCCTTSEEEEEEEEEET
T ss_pred cCcCeEEEECCCCEEEEEEECCCCCCCCeeEEEEEEEecccceeeeeeeeeeeeeeeeeccCCCCCEEEEEEEEEeC
Confidence 4667777788999999999999766789999999997643322 2234568999999999999998887654
No 13
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=98.91 E-value=3.5e-06 Score=69.48 Aligned_cols=133 Identities=18% Similarity=0.257 Sum_probs=85.5
Q ss_pred CCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEE-EecCC---CCCCCCCCCeEEeec----CCeEEEEeecCCCCC
Q psy7018 179 GEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTL-FTTPP---CEPDQPNPPKLVTKT----RTSLALKWNAAVDNG 250 (349)
Q Consensus 179 ~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~-~~t~~---~~P~~p~~~~~~~~~----~~sv~l~W~~~~~~~ 250 (349)
.....+.+.++.||..|.+++.+.+... +.....+. ..+.. .-|.-|.+..+.... =++++|.|....+..
T Consensus 125 ~~~~~f~l~~~~~g~~Yliri~~~~~~e-~~~~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~d~~ 203 (300)
T PF10179_consen 125 EGLRHFRLSGVKPGERYLIRIQISNSDE-GPSTFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSPDRS 203 (300)
T ss_pred cceEEEEECCCCCCCeEEEEEEccCCCC-CceEEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEecCCCCC
Confidence 4556799999999999999998766422 22222222 22322 346778888885543 489999999876533
Q ss_pred CcccEEEEEEecCCCC---------------------CCcEEee--cC---------CcceEEEcCCCCCCeEEEEEEEe
Q psy7018 251 AHVMHYILESDQGNAT---------------------GDFKEIS--KS---------KNKNFTLSKLTPSTCFRFRLAAV 298 (349)
Q Consensus 251 ~~i~~Y~v~~~~~~~~---------------------~~~~~~~--~~---------~~~~~~~~~L~p~t~Y~~~V~A~ 298 (349)
..|=|+.++..+. .++..+. .. .....+|.+|+||+.|.|.|.+.
T Consensus 204 ---~kYCvy~~~~~~~~~~~~~~~~~n~C~~~~sr~k~e~v~Ck~~~~~n~~~~~~~~v~tetI~~L~PG~~Yl~dV~~~ 280 (300)
T PF10179_consen 204 ---IKYCVYRREEHSNYQERSVSRMPNQCLGPESRKKSEKVLCKYFHSPNSSEDPQRAVTTETIKGLKPGTTYLFDVYVN 280 (300)
T ss_pred ---ceEEEEEEEecCchhhhhhcccCccCCCCCccccceEEEEEEEcCCccccccccccceeecccCCCCcEEEEEEEEe
Confidence 3677766543321 0122221 11 11255799999999999999999
Q ss_pred cCCCCCCCCceEEeecc
Q psy7018 299 NQHGKSADATSVAEWLS 315 (349)
Q Consensus 299 ~~~G~~~~s~~~~~~t~ 315 (349)
-..|.+-+-..+.+.|.
T Consensus 281 ~~~G~sl~Y~s~~VkTr 297 (300)
T PF10179_consen 281 GPSGQSLPYRSKWVKTR 297 (300)
T ss_pred cCCCceeecceEEEEec
Confidence 87887665555666554
No 14
>KOG4802|consensus
Probab=98.88 E-value=7.7e-08 Score=80.37 Aligned_cols=135 Identities=17% Similarity=0.242 Sum_probs=92.6
Q ss_pred CCceEEEecCC-ceEEEecCCCCCCeEEEEEEEEecccc---cCCCCcEEEecCCCCCCCCCCCeEEeec-CC--eEEEE
Q psy7018 170 DGKYKSIYNGE-AQSCRIQDLKPGTDYAVCVQVHLEEIV---GIASEPTLFTTPPCEPDQPNPPKLVTKT-RT--SLALK 242 (349)
Q Consensus 170 ~~~~~~~~~~~-~~~~~i~~L~p~~~Y~~~V~a~~~~g~---~~~s~~~~~~t~~~~P~~p~~~~~~~~~-~~--sv~l~ 242 (349)
...|+.+.... ...+.+.++.||..|.|+|.|+|..|. +.+|.+....-.+.+|++|.++++.... ++ ...+.
T Consensus 196 ~thwQtv~~t~~e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~~~~~v~ 275 (516)
T KOG4802|consen 196 PTHWQTVEKTMEENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGRYMLKVV 275 (516)
T ss_pred cccceeeeecCCCceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeeeecceEEEEEE
Confidence 34566654433 347889999999999999999987764 4455565656666778899999986644 22 24566
Q ss_pred eecCCCCCCcccEEEEEEecCCCCC----CcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCC
Q psy7018 243 WNAAVDNGAHVMHYILESDQGNATG----DFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSA 305 (349)
Q Consensus 243 W~~~~~~~~~i~~Y~v~~~~~~~~~----~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~ 305 (349)
|.+++ .+-+|..|.|.|.-..+.. .-+..+.-...++.|++|.|+..|.+.|.|+...|.+.
T Consensus 276 w~P~~-sdlPv~~Yki~Ws~~v~s~k~~m~tks~~~k~thq~si~~L~Pns~Y~VevqAi~y~g~~r 341 (516)
T KOG4802|consen 276 WCPSK-SDLPVEKYKITWSLYVNSAKASMITKSSYVKDTHQFSIKELLPNSSYYVEVQAISYLGSRR 341 (516)
T ss_pred eCCCC-CCCcceeeEEEeehhhhhhhhhcccccceeeccchhhhhhcCCCCeEEEEEEEEEeccCcc
Confidence 66554 4567899999996432111 11111111222556999999999999999999888663
No 15
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=98.53 E-value=1.2e-06 Score=58.27 Aligned_cols=77 Identities=30% Similarity=0.550 Sum_probs=53.0
Q ss_pred CCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCC--CCcEEeecC-CcceEEEcCCCCCCeEEEEEEEecCC
Q psy7018 225 QPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNAT--GDFKEISKS-KNKNFTLSKLTPSTCFRFRLAAVNQH 301 (349)
Q Consensus 225 ~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~--~~~~~~~~~-~~~~~~~~~L~p~t~Y~~~V~A~~~~ 301 (349)
+|.++.+.....+++.|+|+++...+. .+|.+.+...... ..+...... ....+.+.+|.+++.|.|+|+|.+..
T Consensus 3 ~p~~~~~~~~~~~~~~v~W~~~~~~~~--~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~~~~~Y~v~v~a~~~~ 80 (83)
T smart00060 3 PPSNLRVTDVTSTSVTLSWEPPPDDGI--TGYIVGYRVEYREEGSSWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80 (83)
T ss_pred CCCcEEEEEEeCCEEEEEECCCCCCCC--CccEEEEEEEEecCCCccEEEEecCCccEEEEeCcCCCCEEEEEEEEEccc
Confidence 455577777777799999997764332 5677766543221 124333322 35689999999999999999999976
Q ss_pred CC
Q psy7018 302 GK 303 (349)
Q Consensus 302 G~ 303 (349)
|.
T Consensus 81 g~ 82 (83)
T smart00060 81 GE 82 (83)
T ss_pred CC
Confidence 53
No 16
>KOG4802|consensus
Probab=98.39 E-value=7.2e-06 Score=68.92 Aligned_cols=104 Identities=21% Similarity=0.170 Sum_probs=72.1
Q ss_pred CCCceEEEEeeecCCC---CCCCCcceeeeCCCCCCCCCccccccccC---CCeeEEEecCCCCCCCCceeEEEEecCCC
Q psy7018 9 TDHKVSTEDRAGNSAG---WSPFSETSVITTPPGPPGSLTLRKTSVTT---PTSLTVSWSEPANHGSPILHYVLETSDPA 82 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G---~~~~s~~~~~~t~~~~P~~~~~~~~~~~~---~~si~l~W~~~~~~~~~~~~Y~v~~~~~~ 82 (349)
|++.|.|||.|.|..| .+.+|......-.|.+|.+|.++++..+. .-...|-|.|+. ..-++..|+|.|.-.-
T Consensus 218 PgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~~~~~v~w~P~~-sdlPv~~Yki~Ws~~v 296 (516)
T KOG4802|consen 218 PGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGRYMLKVVWCPSK-SDLPVEKYKITWSLYV 296 (516)
T ss_pred cceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeeeecceEEEEEEeCCCC-CCCcceeeEEEeehhh
Confidence 7899999999999965 56667777777777788788877655442 223556666654 4568999999995411
Q ss_pred -----CC----ceeecccEEEEcCCCCCCEEEEEEEEEee
Q psy7018 83 -----QP----SITCDTTSYQLTGLSPHTTYKSVCQVYLI 113 (349)
Q Consensus 83 -----~~----~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~ 113 (349)
.. ...-+...+.|++|.|+..|.+.|++...
T Consensus 297 ~s~k~~m~tks~~~k~thq~si~~L~Pns~Y~VevqAi~y 336 (516)
T KOG4802|consen 297 NSAKASMITKSSYVKDTHQFSIKELLPNSSYYVEVQAISY 336 (516)
T ss_pred hhhhhhcccccceeeccchhhhhhcCCCCeEEEEEEEEEe
Confidence 00 11112334449999999999999986653
No 17
>KOG4367|consensus
Probab=98.33 E-value=5.6e-07 Score=75.75 Aligned_cols=82 Identities=27% Similarity=0.367 Sum_probs=73.6
Q ss_pred ecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEee
Q psy7018 234 KTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEW 313 (349)
Q Consensus 234 ~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~ 313 (349)
...++++|+|+.|+..+.+.++|.++....++ ..++.++.+.++-+++.||.-++.|..+|.|+|..|.+++|+.+...
T Consensus 450 t~nns~t~~wkqp~~~~~~~dg~~leld~g~~-g~frevy~g~etmctvdglhfns~y~arvka~n~tg~s~ys~tl~lq 528 (699)
T KOG4367|consen 450 THNNSATLSWKQPPLSTVPADGYILELDDGNG-GQFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYSKTLVLQ 528 (699)
T ss_pred ccCCceEEEeecCCCCCCCCcceEEEeecCCC-CceeEEEecCceeEEecceecchhHHHHHHHhhccCCCcccceeEee
Confidence 34789999999888777889999999987744 66999999999999999999999999999999999999999999888
Q ss_pred ccC
Q psy7018 314 LSY 316 (349)
Q Consensus 314 t~~ 316 (349)
|..
T Consensus 529 ts~ 531 (699)
T KOG4367|consen 529 TSE 531 (699)
T ss_pred ecc
Confidence 863
No 18
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.30 E-value=7.8e-06 Score=55.80 Aligned_cols=69 Identities=35% Similarity=0.516 Sum_probs=50.1
Q ss_pred ccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCC---CCcee---ecccEEEEcCCCCCCEEEEEEEEEee
Q psy7018 45 TLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPA---QPSIT---CDTTSYQLTGLSPHTTYKSVCQVYLI 113 (349)
Q Consensus 45 ~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~---~~~~~---~~~~~~~i~~L~p~t~Y~~~v~~~~~ 113 (349)
..+.+.....+++.|.|+++....+.+..|.|.+.... ..... .....+.+.+|.|++.|.|+|++...
T Consensus 5 ~~~~~~~~~~~~~~v~W~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~Y~~~v~a~~~ 79 (93)
T cd00063 5 TNLRVTDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79 (93)
T ss_pred CCcEEEEecCCEEEEEECCCCCCCCcceeEEEEEeeCCCCCCEEeeccCCcccEEEEccccCCCEEEEEEEEECC
Confidence 33344455689999999998654456789999887632 22222 25778999999999999998887654
No 19
>KOG4258|consensus
Probab=98.22 E-value=1.8e-05 Score=72.71 Aligned_cols=122 Identities=20% Similarity=0.281 Sum_probs=87.5
Q ss_pred cceEEEEEEecCC----CCCeeeEEEeeC---------C----CCCceEEEe---c------CCceEEEecCCCCCCeEE
Q psy7018 143 LRTIILTLYFRTA----QVPGVPPYVSDK---------G----RDGKYKSIY---N------GEAQSCRIQDLKPGTDYA 196 (349)
Q Consensus 143 ~~~~~l~w~~~~~----~~~~Y~v~~~~~---------~----~~~~~~~~~---~------~~~~~~~i~~L~p~~~Y~ 196 (349)
..+..++|..-.. ...+|.++|+.. | ....|.... . +...-+.+.+|+|.|.|.
T Consensus 501 ~dsi~lrW~~~~~~d~r~llg~~~~yKEaP~qNvT~~dg~~aCg~~~W~~~~v~~~~~~p~~~~~~~~~l~~LkP~TqYA 580 (1025)
T KOG4258|consen 501 ADSILLRWERYQPPDMRDLLGFLLHYKEAPFQNVTEEDGRDACGSNSWNVVDVDPPDLIPNDGTHPGFLLDGLKPWTQYA 580 (1025)
T ss_pred cceeEEEecccCCcchhhhheeeEeeccCCccccceecCccccccCcceEEeccCCcCCCccccccceehhcCCccceeE
Confidence 3466677763222 367898888642 1 122344321 1 122357899999999999
Q ss_pred EEEEEEe--ccccc--CCCCcEEEecCCCCCCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCC
Q psy7018 197 VCVQVHL--EEIVG--IASEPTLFTTPPCEPDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGN 264 (349)
Q Consensus 197 ~~V~a~~--~~g~~--~~s~~~~~~t~~~~P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~ 264 (349)
+.|.+.. ..+.. ..|...-++|..+.|.+|..+-.....++.|.|+|.+|..++|.+++|.+.|+...
T Consensus 581 vfVkT~t~t~~~~~~~A~S~I~YvqT~~~~PspPl~~ls~snsSsqi~l~W~pP~~pNG~lt~Ylv~wer~~ 652 (1025)
T KOG4258|consen 581 VFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSPPLDVLSKSNSSSQILLKWKPPSQPNGNLTHYLVVWERQA 652 (1025)
T ss_pred EEEeeeehhhhccccccccceEEEEecCCCCCCcchhhhccCcchheeEEecCCCCCCCceeEEEEEEEecc
Confidence 9999983 22222 23777789999999999988777777788999999999999999999999997654
No 20
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=98.05 E-value=0.0024 Score=53.09 Aligned_cols=279 Identities=13% Similarity=0.080 Sum_probs=141.2
Q ss_pred CCceEEEEeeecCCCCCCCCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCCceeec
Q psy7018 10 DHKVSTEDRAGNSAGWSPFSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQPSITCD 89 (349)
Q Consensus 10 ~~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~~~~~~ 89 (349)
..+|.|.-...+..-.+--++.+.+.+. .-...+.....-.-+..+.+.|.+..+ -++..|.|+...+......+.
T Consensus 41 ~~~~~~~~~~~~~~~~~~~~e~~~~~~~--~i~~~~~~~~~~~~~~~~~~~w~~~~d--~~~~~Y~i~~~~gD~~f~r~~ 116 (343)
T COG3401 41 EESYPYQEGTTKVDKISYDSERILVKTS--FIERVRSVFASLERPKSVKVFWSPHPD--VSVGKYIIQRQNGDGKFLRTG 116 (343)
T ss_pred cCcceeeecccccceeeecCcceEEEee--eccccccccchhcCcceeeecccccCC--CCCCeEEEEEecCchhhhhhh
Confidence 3455555555444422222555555443 111111111112356779999998764 456788887655333221111
Q ss_pred ------ccEEEEcCCCCCCEEEEEEEEEeecCCcccceeeeeeecccccCCCCCCCc------cccceEEEEEEecCCCC
Q psy7018 90 ------TTSYQLTGLSPHTTYKSVCQVYLISDKHTQKVDLTFESDQLYLSGLPNSEW------PSLRTIILTLYFRTAQV 157 (349)
Q Consensus 90 ------~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~------~~~~~~~l~w~~~~~~~ 157 (349)
.-.+.-..|.++..|..+|-+......-+ .++...+..+...|.... ...+...++|.|....-
T Consensus 117 ~~~n~l~~~~i~s~~~~~~~~~~~Iia~~f~~~~s----fsf~gVE~~~~~~P~ei~~~~~~~d~~~~i~ls~dg~~~~~ 192 (343)
T COG3401 117 LVKNRLFVEFIDSDLGHNEKYMELIIAADFQMGKS----FSFTGVEATPKAEPKEITNVRVSFDLGNNIELSEDGSEAED 192 (343)
T ss_pred HHHhccchhheecccccccceeeeEEeecccccce----eeeeeeecccccCCceeeeeeeecCCCCcceeeccCccccc
Confidence 33344558999999999765443212111 112222222222332211 23345678888755433
Q ss_pred CeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEec-ccccCCCCcEEEecCCCCCCCCCCCeEEeecC
Q psy7018 158 PGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLE-EIVGIASEPTLFTTPPCEPDQPNPPKLVTKTR 236 (349)
Q Consensus 158 ~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~-~g~~~~s~~~~~~t~~~~P~~p~~~~~~~~~~ 236 (349)
.|.|+-.+.+ +.....+.....+.+...--..|..|...|.+... ..++..+...++.-..+.|..|..+.- ....
T Consensus 193 -yy~IY~~~~g-~e~~~~ia~t~~n~y~d~~eglga~~~y~VTtVd~~~~es~lp~~~t~~~~~g~~e~pt~~g~-t~~~ 269 (343)
T COG3401 193 -YYRIYASDSG-NEEYGFIAQTTENSYYDVKEGLGAVEYYKVTTVDNTGFESDLPNEPTVGETGGRYEVPTIPGE-TTEA 269 (343)
T ss_pred -eEEEeccCCc-cccccceeeccccchhhhhhccCceeEEEEEEEcCCcceeccCCccccccccCCcccccCCCc-eeee
Confidence 7888765544 33333333333333332222337788888888764 345555544333222222222221111 1223
Q ss_pred CeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCC
Q psy7018 237 TSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKS 304 (349)
Q Consensus 237 ~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~ 304 (349)
+.+.+.|+..+. .....|.++.-+.+... ....-..+...+.++..+..|.+.|.+.+..|.+
T Consensus 270 s~i~V~~E~~~~--~~av~ytvyr~~~g~~~---~~f~it~~d~~d~~~ltg~~~~y~v~~v~~~gls 332 (343)
T COG3401 270 SFIGVAAEQNQL--RQAVTYTVYRVEDGTPT---KFFTITETDGQDNDMLTGVEYRYEVVAVDKAGLS 332 (343)
T ss_pred eeeeEEeccccc--ceeeEEeeeeccCCCcc---eeeeeeeccccchhhccccceeEEEEEecccccc
Confidence 455677776642 22335666554442211 1222234466677888999999999999999854
No 21
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=97.98 E-value=3.5e-05 Score=51.01 Aligned_cols=65 Identities=29% Similarity=0.445 Sum_probs=42.9
Q ss_pred ccccccCCCeeEEEecCCCCCCCCceeEEEEecC----C--CCCceeec--ccEEEEcCCCCCCEEEEEEEEEee
Q psy7018 47 RKTSVTTPTSLTVSWSEPANHGSPILHYVLETSD----P--AQPSITCD--TTSYQLTGLSPHTTYKSVCQVYLI 113 (349)
Q Consensus 47 ~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~----~--~~~~~~~~--~~~~~i~~L~p~t~Y~~~v~~~~~ 113 (349)
+.+....++++.|+|+++...+. .+|.+.+.. . ........ ...+.+.+|.|++.|.|+|++...
T Consensus 7 ~~~~~~~~~~~~v~W~~~~~~~~--~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~~~~~Y~v~v~a~~~ 79 (83)
T smart00060 7 LRVTDVTSTSVTLSWEPPPDDGI--TGYIVGYRVEYREEGSSWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNG 79 (83)
T ss_pred EEEEEEeCCEEEEEECCCCCCCC--CccEEEEEEEEecCCCccEEEEecCCccEEEEeCcCCCCEEEEEEEEEcc
Confidence 44445567799999997754332 566665543 1 12222222 578999999999999998887653
No 22
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=97.87 E-value=0.0042 Score=51.68 Aligned_cols=128 Identities=14% Similarity=0.156 Sum_probs=68.9
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeecCCcccceeee-eee--cccccCCCCCCCcc-------ccceEEEEEEecCCCCCe
Q psy7018 90 TTSYQLTGLSPHTTYKSVCQVYLISDKHTQKVDLT-FES--DQLYLSGLPNSEWP-------SLRTIILTLYFRTAQVPG 159 (349)
Q Consensus 90 ~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~~-~~~--~~~~~~~~p~~~~~-------~~~~~~l~w~~~~~~~~~ 159 (349)
-..+.+.++.||..|.+++.+.+..+ +.....+. +.+ ...+.|..|..... .-++..+.|......-..
T Consensus 127 ~~~f~l~~~~~g~~Yliri~~~~~~e-~~~~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~d~~~k 205 (300)
T PF10179_consen 127 LRHFRLSGVKPGERYLIRIQISNSDE-GPSTFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSPDRSIK 205 (300)
T ss_pred eEEEEECCCCCCCeEEEEEEccCCCC-CceEEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEecCCCCCce
Confidence 46788999999999999776544322 22222222 222 12223334433221 113455555533322247
Q ss_pred eeEEEeeCCCC---------------------CceEEE---ecC--------CceEEEecCCCCCCeEEEEEEEEecccc
Q psy7018 160 VPPYVSDKGRD---------------------GKYKSI---YNG--------EAQSCRIQDLKPGTDYAVCVQVHLEEIV 207 (349)
Q Consensus 160 Y~v~~~~~~~~---------------------~~~~~~---~~~--------~~~~~~i~~L~p~~~Y~~~V~a~~~~g~ 207 (349)
|-|+.+....+ .++... ... .....+|.+|+||+.|-|.|.+....|.
T Consensus 206 YCvy~~~~~~~~~~~~~~~~~n~C~~~~sr~k~e~v~Ck~~~~~n~~~~~~~~v~tetI~~L~PG~~Yl~dV~~~~~~G~ 285 (300)
T PF10179_consen 206 YCVYRREEHSNYQERSVSRMPNQCLGPESRKKSEKVLCKYFHSPNSSEDPQRAVTTETIKGLKPGTTYLFDVYVNGPSGQ 285 (300)
T ss_pred EEEEEEEecCchhhhhhcccCccCCCCCccccceEEEEEEEcCCccccccccccceeecccCCCCcEEEEEEEEecCCCc
Confidence 88877643211 011111 111 1124579999999999999999987776
Q ss_pred cCCCCcEEEec
Q psy7018 208 GIASEPTLFTT 218 (349)
Q Consensus 208 ~~~s~~~~~~t 218 (349)
.-.-....++|
T Consensus 286 sl~Y~s~~VkT 296 (300)
T PF10179_consen 286 SLPYRSKWVKT 296 (300)
T ss_pred eeecceEEEEe
Confidence 54433344444
No 23
>KOG4152|consensus
Probab=97.85 E-value=0.00016 Score=62.99 Aligned_cols=122 Identities=19% Similarity=0.224 Sum_probs=83.1
Q ss_pred ecCCCCCCeEEEEEEEEecccccCCCCcEEEecC-CCCCCCCCCCeEEeecCCeEEEEeecCCCC-CCcccEEEEEEecC
Q psy7018 186 IQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTP-PCEPDQPNPPKLVTKTRTSLALKWNAAVDN-GAHVMHYILESDQG 263 (349)
Q Consensus 186 i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~-~~~P~~p~~~~~~~~~~~sv~l~W~~~~~~-~~~i~~Y~v~~~~~ 263 (349)
-.+|-+|+.|.|+|.|+|..|.+++|....++|. ++.|..|...++. .+-..+.|.|++|... .+.|..|..+..-.
T Consensus 651 k~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG~P~apS~~ri~-k~~eGi~l~weppt~p~sg~Iieys~ylAi~ 729 (830)
T KOG4152|consen 651 KTSLVTGQAYRFRVTAINGKGPGPASTILKLKTCAPGKPTAPSGARIK-KTIEGISLVWEPPTKPGSGTIIEYSPYLAII 729 (830)
T ss_pred ccccccccceeeeeeeeeccCCCchhhheeeeeccCCCCCCccccccc-ccccceeecccCCCCCCCcceEEeehhhHhh
Confidence 4578999999999999999999999999988885 4456666665554 2345789999999844 57788887764211
Q ss_pred -------CCCCCcEEeecCCcceEE---------EcCCCCCCeEEEEEEEecCCCCCCCCc
Q psy7018 264 -------NATGDFKEISKSKNKNFT---------LSKLTPSTCFRFRLAAVNQHGKSADAT 308 (349)
Q Consensus 264 -------~~~~~~~~~~~~~~~~~~---------~~~L~p~t~Y~~~V~A~~~~G~~~~s~ 308 (349)
..+-.+..++-+-.+.+. +.+...+-.-.|+|.|.|..|.|+...
T Consensus 730 tq~~~~~~sql~fmr~ycg~~t~c~vt~g~l~tA~~d~t~~paivfri~aknekGyGpatq 790 (830)
T KOG4152|consen 730 TQAGATGVSQLPFMRSYCGIITTCVVTSGKLITAVPDITTGPAIVFRIVAKNEKGYGPATQ 790 (830)
T ss_pred hhhhccCccccceeeeecccceeeEEecccceeeccccccCcceEEEEEecCCCCCCcceE
Confidence 111122333333332222 234455667899999999999887443
No 24
>KOG3632|consensus
Probab=97.83 E-value=0.00044 Score=64.85 Aligned_cols=242 Identities=18% Similarity=0.190 Sum_probs=128.5
Q ss_pred cccCCCeeEEEecCCCCC--CCCceeEEEEecCCCCCceee-cccEEEEc--CCCCCCEEEEEEEEEeecCCcccceeee
Q psy7018 50 SVTTPTSLTVSWSEPANH--GSPILHYVLETSDPAQPSITC-DTTSYQLT--GLSPHTTYKSVCQVYLISDKHTQKVDLT 124 (349)
Q Consensus 50 ~~~~~~si~l~W~~~~~~--~~~~~~Y~v~~~~~~~~~~~~-~~~~~~i~--~L~p~t~Y~~~v~~~~~~~~~~~~~~~~ 124 (349)
......++.|-|++|..+ -+.+..|.+....+....... +.+...+. ++..++ |.+.|.....-+.......+.
T Consensus 587 vkqla~sv~vawepps~pP~~~~~~~~~~~v~~elrq~l~~gs~tka~~E~ld~~a~s-~~isvq~ltSrGsqd~lrc~L 665 (1335)
T KOG3632|consen 587 VKQLAGSVHVAWEPPSSPPFTAPARMIGATVLMELRQTLYAGSLTKAMQESLDNSAHS-GYISVQRLTSRGSQDQLRCIL 665 (1335)
T ss_pred hhhhccceeeeccCCCCCCccccceeeeeecchhhhhhcccccchHHHHhhccccCCc-eeeehhhhhccCCCCcceeeE
Confidence 346788999999998654 346677777665543333222 12222222 233333 445443322212222222222
Q ss_pred eeecccccCCCC-CCCc--cccceEEEEEEecCCCCCeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEE
Q psy7018 125 FESDQLYLSGLP-NSEW--PSLRTIILTLYFRTAQVPGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQV 201 (349)
Q Consensus 125 ~~~~~~~~~~~p-~~~~--~~~~~~~l~w~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a 201 (349)
...... + ..| .... ...++..++|-..+.... .+.|.+.. ...........+++.+|.|++.|.++|-+
T Consensus 666 lvgg~a-~-vvpsqlrv~n~tqtSa~itwvp~nsn~~--Hviyln~e----E~~ps~a~~y~ytf~~lrpgt~y~a~vea 737 (1335)
T KOG3632|consen 666 LVGGAA-P-VVPSQLRVWNATQTSAMITWVPFNSNFL--HVIYLNAE----EPRPSVAEMYNYTFMRLRPGTDYWASVEA 737 (1335)
T ss_pred eccccc-c-ccchhhhhhhhhchhhheeeeecCCCcc--eeeecCCc----cCCCchhhhhHHHHhccCCCCccceeccc
Confidence 222111 1 122 1111 333466677765443322 12232211 11112334456889999999999999988
Q ss_pred Eeccc---------ccCCCCcEEEecCC-CCCCCCCCCeEEe-ecCCeEEEEeecCC-C-----CCCcccEEEEEEecCC
Q psy7018 202 HLEEI---------VGIASEPTLFTTPP-CEPDQPNPPKLVT-KTRTSLALKWNAAV-D-----NGAHVMHYILESDQGN 264 (349)
Q Consensus 202 ~~~~g---------~~~~s~~~~~~t~~-~~P~~p~~~~~~~-~~~~sv~l~W~~~~-~-----~~~~i~~Y~v~~~~~~ 264 (349)
..... .+.....+.|.|.. ++|.+|.++++.. .+...+.++|.++. + ++-.+++|.++.....
T Consensus 738 ~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~g~spg~l~vswrPptldsag~sngv~vtgYavyadgqk 817 (1335)
T KOG3632|consen 738 ALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVETGGSPGRLEVSWRPPTLDSAGCSNGVAVTGYAVYADGQK 817 (1335)
T ss_pred ccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeecCCCCceeeeeccCceeccccccCceeeeeeeeeeCCce
Confidence 76411 12224567787764 5668888888854 46888999999986 2 2456889999876332
Q ss_pred CCCCcEEee--cCCcceEEEcCCCCC-CeEEEEEEEecCCCCC
Q psy7018 265 ATGDFKEIS--KSKNKNFTLSKLTPS-TCFRFRLAAVNQHGKS 304 (349)
Q Consensus 265 ~~~~~~~~~--~~~~~~~~~~~L~p~-t~Y~~~V~A~~~~G~~ 304 (349)
+..+. ....+.+.+..|+-+ .-=.+.|+.....|.+
T Consensus 818 ----v~Evafptagst~VelsQlq~~l~~~~V~vRtms~~ges 856 (1335)
T KOG3632|consen 818 ----VEEVAFPTAGSTKVELSQLQDGLYHGAVGVRTMSVPGES 856 (1335)
T ss_pred ----eeeeecccCCceEEEeeeehhhheecceeEEecccCccc
Confidence 22332 223445556666653 2223555555566655
No 25
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=97.49 E-value=0.00051 Score=48.58 Aligned_cols=79 Identities=15% Similarity=0.260 Sum_probs=55.8
Q ss_pred CCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEe---ecCCcceEEEcCCC--CCCeEEEEEEE
Q psy7018 223 PDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEI---SKSKNKNFTLSKLT--PSTCFRFRLAA 297 (349)
Q Consensus 223 P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~---~~~~~~~~~~~~L~--p~t~Y~~~V~A 297 (349)
.++|.++++...+ -...|+|+++.... .-..|.|+++.. +...|+.+ .+...+.+.+++.. +...|.++|+|
T Consensus 22 lp~P~nv~~~s~n-f~~iL~W~~~~~~~-~~~~ytVq~~~~-~~~~W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~~rV~A 98 (107)
T PF01108_consen 22 LPAPQNVTVDSVN-FKHILRWDPGPGSP-PNVTYTVQYKKY-GSSSWKDVPGCQNITETSCDLTDETSDPSESYYARVRA 98 (107)
T ss_dssp GSSCEEEEEEEET-TEEEEEEEESTTSS-STEEEEEEEEES-STSCEEEECCEEEESSSEEECTTCCTTTTSEEEEEEEE
T ss_pred CCCCCeeEEEEEC-CceEEEeCCCCCCC-CCeEEEEEEEec-CCcceeeccceecccccceeCcchhhcCcCCEEEEEEE
Confidence 3678888887664 45779999965333 345899999933 23557665 44556788888744 67899999999
Q ss_pred ecCCCCC
Q psy7018 298 VNQHGKS 304 (349)
Q Consensus 298 ~~~~G~~ 304 (349)
..+...|
T Consensus 99 ~~~~~~S 105 (107)
T PF01108_consen 99 EVGNQTS 105 (107)
T ss_dssp EETTEEE
T ss_pred EeCCccC
Confidence 9876544
No 26
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=97.39 E-value=0.00059 Score=48.21 Aligned_cols=87 Identities=18% Similarity=0.241 Sum_probs=57.3
Q ss_pred CCCCCeEEeecCCeEEEEeecCCC----CCC--------cccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEE
Q psy7018 225 QPNPPKLVTKTRTSLALKWNAAVD----NGA--------HVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFR 292 (349)
Q Consensus 225 ~p~~~~~~~~~~~sv~l~W~~~~~----~~~--------~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~ 292 (349)
||. +.+ ...++++.+.+..|.. .+. .-..|.|.+++.... ............+.+.+|.|++.|-
T Consensus 5 PP~-v~v-~~~~~~l~V~i~~P~~~~~~~~~~~~l~~~~~~~~Y~v~~~~~~~~-~~~~~~~~~~~~~~l~~L~p~t~YC 81 (106)
T PF09294_consen 5 PPS-VNV-SSCGGSLHVTIKPPMTPLRAGGKNSSLRDIYPSLSYNVSYWKNGSN-EKKKEIETKNSSVTLSDLKPGTNYC 81 (106)
T ss_dssp SSE-EEE-EEETTEEEEEEEESEEEEECSSSEEEHHHHHGG-EEEEEEEETTTS-CEEEEEESSSEEEEEES--TTSEEE
T ss_pred CCE-EEE-EECCCEEEEEEECCCcccccCCCCCcHHHhCCCeEEEEEEEeCCCc-cceEEEeecCCEEEEeCCCCCCCEE
Confidence 443 666 5678899999998861 111 113699998877643 2344566667788999999999999
Q ss_pred EEEEEecC--CCCCCCCceEEeec
Q psy7018 293 FRLAAVNQ--HGKSADATSVAEWL 314 (349)
Q Consensus 293 ~~V~A~~~--~G~~~~s~~~~~~t 314 (349)
|+|+|... ...|.+|+..-++|
T Consensus 82 v~V~~~~~~~~~~s~~S~~~C~~t 105 (106)
T PF09294_consen 82 VSVQAFSPSQNKNSQPSEPQCITT 105 (106)
T ss_dssp EEEEEEECSSTEEEEEBSEEEEE-
T ss_pred EEEEEEeccCCCcCCCCCCEeEeC
Confidence 99999332 23677777766554
No 27
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=97.20 E-value=0.021 Score=47.74 Aligned_cols=160 Identities=11% Similarity=0.045 Sum_probs=92.6
Q ss_pred EEEEEEec-CCCCCeeeEEEeeCCCCCceEEE---ecCCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCC
Q psy7018 146 IILTLYFR-TAQVPGVPPYVSDKGRDGKYKSI---YNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPC 221 (349)
Q Consensus 146 ~~l~w~~~-~~~~~~Y~v~~~~~~~~~~~~~~---~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~ 221 (349)
..+-|... +-....|.|+....+.. +... .++-...+.-.+|.++..|...|.+.+..+.-+.+....--+...
T Consensus 86 ~~~~w~~~~d~~~~~Y~i~~~~gD~~--f~r~~~~~n~l~~~~i~s~~~~~~~~~~~Iia~~f~~~~sfsf~gVE~~~~~ 163 (343)
T COG3401 86 VKVFWSPHPDVSVGKYIIQRQNGDGK--FLRTGLVKNRLFVEFIDSDLGHNEKYMELIIAADFQMGKSFSFTGVEATPKA 163 (343)
T ss_pred eeecccccCCCCCCeEEEEEecCchh--hhhhhHHHhccchhheecccccccceeeeEEeecccccceeeeeeeeccccc
Confidence 44445432 24577899988764322 2211 122233455568999999999999988765433331111122222
Q ss_pred CCCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCC
Q psy7018 222 EPDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQH 301 (349)
Q Consensus 222 ~P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~ 301 (349)
.|..-.++.+...-.+.+.|+|+.++. ..+|.|+...... .++..+.....+.+...--..|..|.+.|.+.+..
T Consensus 164 ~P~ei~~~~~~~d~~~~i~ls~dg~~~----~~yy~IY~~~~g~-e~~~~ia~t~~n~y~d~~eglga~~~y~VTtVd~~ 238 (343)
T COG3401 164 EPKEITNVRVSFDLGNNIELSEDGSEA----EDYYRIYASDSGN-EEYGFIAQTTENSYYDVKEGLGAVEYYKVTTVDNT 238 (343)
T ss_pred CCceeeeeeeecCCCCcceeeccCccc----cceEEEeccCCcc-ccccceeeccccchhhhhhccCceeEEEEEEEcCC
Confidence 233333444455568889999998863 2388888766544 33444444444444333222388999999999965
Q ss_pred C-CCCCCceEEe
Q psy7018 302 G-KSADATSVAE 312 (349)
Q Consensus 302 G-~~~~s~~~~~ 312 (349)
| .+..+.....
T Consensus 239 ~~es~lp~~~t~ 250 (343)
T COG3401 239 GFESDLPNEPTV 250 (343)
T ss_pred cceeccCCcccc
Confidence 5 6665554433
No 28
>KOG4152|consensus
Probab=97.14 E-value=0.00089 Score=58.59 Aligned_cols=68 Identities=24% Similarity=0.361 Sum_probs=53.8
Q ss_pred cCCCCceEEEEeeecCCCCCCCCcceeeeC-CCCCCCCCccccccccCCCeeEEEecCCCCCC-CCceeEE
Q psy7018 7 SCTDHKVSTEDRAGNSAGWSPFSETSVITT-PPGPPGSLTLRKTSVTTPTSLTVSWSEPANHG-SPILHYV 75 (349)
Q Consensus 7 ~~p~~~y~~~v~a~n~~G~~~~s~~~~~~t-~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~-~~~~~Y~ 75 (349)
--++..|.|||.|.|..|.+++|+...+.| .|+-|.+|+..++. .+-..+.|.|++|..++ +.|..|.
T Consensus 654 lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG~P~apS~~ri~-k~~eGi~l~weppt~p~sg~Iieys 723 (830)
T KOG4152|consen 654 LVTGQAYRFRVTAINGKGPGPASTILKLKTCAPGKPTAPSGARIK-KTIEGISLVWEPPTKPGSGTIIEYS 723 (830)
T ss_pred cccccceeeeeeeeeccCCCchhhheeeeeccCCCCCCccccccc-ccccceeecccCCCCCCCcceEEee
Confidence 357899999999999999999999999986 68888888877632 36678999999997653 3444443
No 29
>KOG4367|consensus
Probab=96.66 E-value=0.002 Score=54.96 Aligned_cols=75 Identities=17% Similarity=0.284 Sum_probs=63.4
Q ss_pred eEEEEEEecC---CCCCeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCC
Q psy7018 145 TIILTLYFRT---AQVPGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPP 220 (349)
Q Consensus 145 ~~~l~w~~~~---~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~ 220 (349)
+..+.|..+. .+.++|.++..+. .++.+..+..+..+-+++.+|.-++.|..+|.++|..|.++.|..+...|..
T Consensus 454 s~t~~wkqp~~~~~~~dg~~leld~g-~~g~frevy~g~etmctvdglhfns~y~arvka~n~tg~s~ys~tl~lqts~ 531 (699)
T KOG4367|consen 454 SATLSWKQPPLSTVPADGYILELDDG-NGGQFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYSKTLVLQTSE 531 (699)
T ss_pred ceEEEeecCCCCCCCCcceEEEeecC-CCCceeEEEecCceeEEecceecchhHHHHHHHhhccCCCcccceeEeeecc
Confidence 5667777443 4688999998664 4677888889999999999999999999999999999999999888777754
No 30
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=96.56 E-value=0.03 Score=52.45 Aligned_cols=132 Identities=14% Similarity=0.127 Sum_probs=76.0
Q ss_pred ceEEEecCCceEEEecCCCCCCeEEEEEEEEecccccCC-CCcEEEec--CCCCCCCCCCCeEEeecCCeEEEEeecCCC
Q psy7018 172 KYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIA-SEPTLFTT--PPCEPDQPNPPKLVTKTRTSLALKWNAAVD 248 (349)
Q Consensus 172 ~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~-s~~~~~~t--~~~~P~~p~~~~~~~~~~~sv~l~W~~~~~ 248 (349)
.|..........+.+.+|.+| .|.++|+|.|..|.... .....|.. ...+|..+..+....++ -.+.|.|-.|.
T Consensus 650 ~~vta~rt~~~~~~~~gi~~G-qY~i~VrAiN~~g~~~~~a~s~~f~i~g~~~Ppp~~~t~~a~~it-~~~~l~v~dPt- 726 (952)
T COG4733 650 NWITAPRTSAAGFDVEGIPAG-QYAIRVRAINVFEPNSPDATAYEFALNGKKVPPPKAMIYDAVIIT-LVIRLVVGDPT- 726 (952)
T ss_pred ceEecccccccceeecCcCcc-ceEEEEEEeeccCCCCCCcceeEEEecCCCCCCCcccccceEEEE-eeeeEEEecCC-
Confidence 555554566778999999995 89999999998775433 22333333 22233333333333222 35678888775
Q ss_pred CCCcccEEEEEEecCCCCCCcEEeecCCcc--eEEEcCCCCCCeEEEEEEEecCCCCCCCC
Q psy7018 249 NGAHVMHYILESDQGNATGDFKEISKSKNK--NFTLSKLTPSTCFRFRLAAVNQHGKSADA 307 (349)
Q Consensus 249 ~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~--~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s 307 (349)
...++..=++++....+ .+....+.+... ...-.+|++|..|.|++++.|..|.+.+-
T Consensus 727 ~~~d~~~sei~~s~~~d-~~~~Ar~LG~~~~~~~~~~~i~~g~~~~F~~R~Vn~vG~~~~~ 786 (952)
T COG4733 727 GAVDITSTEIRSAVIAD-GNFQARSLGNLNYPGLFSVGIQAGLTFWFRNRNVDLVGNNDKW 786 (952)
T ss_pred cceEEeeeeeeeecccc-chhHHhhhhccccccccccCcCCCceEEEEeeecccccccccc
Confidence 22333333444433221 111111222222 12225899999999999999999976543
No 31
>KOG4806|consensus
Probab=96.56 E-value=0.39 Score=40.51 Aligned_cols=128 Identities=14% Similarity=0.181 Sum_probs=76.6
Q ss_pred EEecCCCCCCeEEEEEEEEecccccCCCCcEEEec---CCCCCCCCCCCeEEee--cCCeEEEEeecCCCCCCcccEEEE
Q psy7018 184 CRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTT---PPCEPDQPNPPKLVTK--TRTSLALKWNAAVDNGAHVMHYIL 258 (349)
Q Consensus 184 ~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t---~~~~P~~p~~~~~~~~--~~~sv~l~W~~~~~~~~~i~~Y~v 258 (349)
+.-..+.++..|.+++...|+.-. .....+...| ..+-|.-|.+-.+... +=++.++.|...++. . ..|-|
T Consensus 286 ~~Qfvv~~~~~~~~rf~~~ndde~-~~~v~V~aSt~~t~~~~P~LP~dTtVk~v~r~CSsAtIaW~gs~d~--~-~kyCI 361 (454)
T KOG4806|consen 286 IRQFVVLPGERYLMRFEPSNDDEA-LQKVMVAASTEATFRDLPELPQDTTVKNVRRRCSSATIAWNGSPDE--E-LKYCI 361 (454)
T ss_pred eeEEEecCCCceEEEEEecCCchh-hheeEeeeecccccCcCCCCCCCceEeeeccccchheeeeccCcch--h-eeEEE
Confidence 334457788899999988876321 1111111122 2334677877776554 467899999877642 2 25666
Q ss_pred EEecCCCCCC------------------------cEEeec-----CCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCce
Q psy7018 259 ESDQGNATGD------------------------FKEISK-----SKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATS 309 (349)
Q Consensus 259 ~~~~~~~~~~------------------------~~~~~~-----~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~ 309 (349)
+.....+.+. |..... .+...-+|.||.||..|.+.|.|.-..|...+-..
T Consensus 362 y~~~~~~~e~~v~~~~N~C~g~~~~~~s~~v~c~y~hs~~~q~~~~~i~teTI~gL~PgssYlldv~a~~~~g~~lpyqa 441 (454)
T KOG4806|consen 362 YVFNLPQRERSVVDFTNYCMGFVPKRVSQYVYCEYMHSRERQQSPDNIETETILGLMPGSSYLLDVTANLSMGKPLPYQA 441 (454)
T ss_pred EEecccchhhhhhhhhccccCccccceeEEEeEEEecChhhhcchhhhhhhhhcccccCceEEEEEEEcccCCcccccee
Confidence 6554432110 111100 11124568899999999999999988887766555
Q ss_pred EEeecc
Q psy7018 310 VAEWLS 315 (349)
Q Consensus 310 ~~~~t~ 315 (349)
+.+.|.
T Consensus 442 ~~v~t~ 447 (454)
T KOG4806|consen 442 LTVHTA 447 (454)
T ss_pred EEEEec
Confidence 666554
No 32
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=96.46 E-value=0.014 Score=41.10 Aligned_cols=60 Identities=30% Similarity=0.347 Sum_probs=38.5
Q ss_pred CeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEec--ccccCCCCcEEEec
Q psy7018 158 PGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLE--EIVGIASEPTLFTT 218 (349)
Q Consensus 158 ~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~--~g~~~~s~~~~~~t 218 (349)
..|.|.|+..+.. ............+.|.+|.|++.|.++|++... ...|.+|....+.|
T Consensus 44 ~~Y~v~~~~~~~~-~~~~~~~~~~~~~~l~~L~p~t~YCv~V~~~~~~~~~~s~~S~~~C~~t 105 (106)
T PF09294_consen 44 LSYNVSYWKNGSN-EKKKEIETKNSSVTLSDLKPGTNYCVSVQAFSPSQNKNSQPSEPQCITT 105 (106)
T ss_dssp -EEEEEEEETTTS-CEEEEEESSSEEEEEES--TTSEEEEEEEEEECSSTEEEEEBSEEEEE-
T ss_pred eEEEEEEEeCCCc-cceEEEeecCCEEEEeCCCCCCCEEEEEEEEeccCCCcCCCCCCEeEeC
Confidence 4699998886644 233334566677899999999999999999432 23455566555443
No 33
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=95.26 E-value=0.1 Score=32.88 Aligned_cols=37 Identities=16% Similarity=0.410 Sum_probs=24.7
Q ss_pred CCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCC
Q psy7018 267 GDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSA 305 (349)
Q Consensus 267 ~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~ 305 (349)
.+|........ .+.+.+|.|| .|.|.|+|.+..|.-.
T Consensus 19 ~~W~~~~~~~~-~~~~~~L~~G-~Y~l~V~a~~~~~~~~ 55 (66)
T PF07495_consen 19 DEWITLGSYSN-SISYTNLPPG-KYTLEVRAKDNNGKWS 55 (66)
T ss_dssp SSEEEESSTS--EEEEES--SE-EEEEEEEEEETTS-B-
T ss_pred CeEEECCCCcE-EEEEEeCCCE-EEEEEEEEECCCCCcC
Confidence 44655433332 8899999999 8999999999887543
No 34
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=95.13 E-value=0.24 Score=34.87 Aligned_cols=63 Identities=16% Similarity=0.159 Sum_probs=45.9
Q ss_pred eEEEEEEecC--CCCCeeeEEEeeCCCCCceEEE---ecCCceEEEecCCC--CCCeEEEEEEEEeccccc
Q psy7018 145 TIILTLYFRT--AQVPGVPPYVSDKGRDGKYKSI---YNGEAQSCRIQDLK--PGTDYAVCVQVHLEEIVG 208 (349)
Q Consensus 145 ~~~l~w~~~~--~~~~~Y~v~~~~~~~~~~~~~~---~~~~~~~~~i~~L~--p~~~Y~~~V~a~~~~g~~ 208 (349)
...|.|+..+ .....|.|+|+.. ..+.|..+ .....+.+.+++.. +...|.++|+|..+...+
T Consensus 36 ~~iL~W~~~~~~~~~~~ytVq~~~~-~~~~W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~~rV~A~~~~~~S 105 (107)
T PF01108_consen 36 KHILRWDPGPGSPPNVTYTVQYKKY-GSSSWKDVPGCQNITETSCDLTDETSDPSESYYARVRAEVGNQTS 105 (107)
T ss_dssp EEEEEEEESTTSSSTEEEEEEEEES-STSCEEEECCEEEESSSEEECTTCCTTTTSEEEEEEEEEETTEEE
T ss_pred ceEEEeCCCCCCCCCeEEEEEEEec-CCcceeeccceecccccceeCcchhhcCcCCEEEEEEEEeCCccC
Confidence 6788998743 4567899999932 35677655 34556788888754 688999999999875543
No 35
>KOG4228|consensus
Probab=95.08 E-value=0.16 Score=49.50 Aligned_cols=112 Identities=16% Similarity=0.131 Sum_probs=80.0
Q ss_pred EEEEecccccCCCCcEEEecCCCCCCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcc
Q psy7018 199 VQVHLEEIVGIASEPTLFTTPPCEPDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNK 278 (349)
Q Consensus 199 V~a~~~~g~~~~s~~~~~~t~~~~P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~ 278 (349)
+.+....|.++.+..+.+++..+.|..|. ...+...++++|.++. ...+..|++.....+. .....+......
T Consensus 144 ~~~~~~~~~~~~~t~i~i~~~~~~p~~~~----~~~~~~~it~~w~~~~--~~~~~~ykl~~~~~d~-~~~~~v~~~~~~ 216 (1087)
T KOG4228|consen 144 PLAIASPGLGPPSTYIQITTNANSPIQPG----EEEEYTTITGSWSPPH--AVSLDTYKLLHLDPDT-GYEISVTLTPPG 216 (1087)
T ss_pred ccccCCcccCCCCceEEEeccCCCCCCCC----cceEEEEEEecCCCCC--cccchhhhhhhcCCcc-cceeeeeccCCC
Confidence 66667777777777778888877666555 5566789999999886 4567788887776532 223344444555
Q ss_pred eEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeeccCc
Q psy7018 279 NFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWLSYS 317 (349)
Q Consensus 279 ~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t~~~ 317 (349)
.....++.+...|.++-.+.+..+.+.....+...+..+
T Consensus 217 ~~~t~~~~~~~~~~~~~ae~~~~~~~~~~a~i~~r~~~s 255 (1087)
T KOG4228|consen 217 SGGTGDLGPPSISRFKCAEPDRGPLGSSTAEIEARQLQS 255 (1087)
T ss_pred CCcccCCCCcccccccccCccccccCCCCcceEEEehhh
Confidence 666778888899999999998888777666677666554
No 36
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=94.57 E-value=0.21 Score=31.45 Aligned_cols=48 Identities=17% Similarity=0.153 Sum_probs=31.1
Q ss_pred eeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEecccccC
Q psy7018 160 VPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGI 209 (349)
Q Consensus 160 Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~ 209 (349)
...+|+-.+.+..|....... ..+.+.+|.|| .|.|.|+|.+..|...
T Consensus 8 ~~Y~Y~l~g~d~~W~~~~~~~-~~~~~~~L~~G-~Y~l~V~a~~~~~~~~ 55 (66)
T PF07495_consen 8 IRYRYRLEGFDDEWITLGSYS-NSISYTNLPPG-KYTLEVRAKDNNGKWS 55 (66)
T ss_dssp EEEEEEEETTESSEEEESSTS--EEEEES--SE-EEEEEEEEEETTS-B-
T ss_pred eEEEEEEECCCCeEEECCCCc-EEEEEEeCCCE-EEEEEEEEECCCCCcC
Confidence 334454455567787663332 28999999999 8999999999876543
No 37
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=94.15 E-value=0.53 Score=32.84 Aligned_cols=78 Identities=14% Similarity=0.276 Sum_probs=54.4
Q ss_pred CCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecC---Ccce-----EEEc--CCCCCCeEE
Q psy7018 223 PDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKS---KNKN-----FTLS--KLTPSTCFR 292 (349)
Q Consensus 223 P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~---~~~~-----~~~~--~L~p~t~Y~ 292 (349)
|..|.+++........++..|++....+.. ..|.+.|.... ..+..+..- ...+ +.+. +..-.+.|.
T Consensus 8 p~~P~~~~C~S~~~etftC~W~~g~~~~l~-~~y~L~Y~~~~--~~~~eCp~~~~~~~ns~~~~~C~F~~~~t~lf~~y~ 84 (104)
T PF09067_consen 8 PEKPENLKCFSREMETFTCFWEPGSEGNLP-TNYTLFYKKEG--EEWKECPDYSTSGPNSTVRHICYFPKSDTSLFVPYC 84 (104)
T ss_dssp CCCCEEEEEEBSSSS-EEEEEEEESSSTST-CEEEEEEEETT--SEEEEESESSTTETTEEEEEEEEE-CCGCSSSSEEE
T ss_pred CCCCccCccCCCCCCcEEEEeeCCCCCCCC-CcEEEEEEeCC--CCCccCCCeEecCCCCceeEEEEcCCCCeEEEEEEE
Confidence 567777777778888999999998754433 34999998765 446565321 1223 4454 778899999
Q ss_pred EEEEEecCCCC
Q psy7018 293 FRLAAVNQHGK 303 (349)
Q Consensus 293 ~~V~A~~~~G~ 303 (349)
++|.|.+..|.
T Consensus 85 i~V~a~~~~~~ 95 (104)
T PF09067_consen 85 IQVEATNALGS 95 (104)
T ss_dssp EEEEEEETTEE
T ss_pred EEEEeccCCCc
Confidence 99999998773
No 38
>PLN02533 probable purple acid phosphatase
Probab=94.12 E-value=0.55 Score=42.23 Aligned_cols=84 Identities=7% Similarity=0.145 Sum_probs=50.8
Q ss_pred CCCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcE-----Eeec-------CCcceEEEcCCCCCC
Q psy7018 222 EPDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFK-----EISK-------SKNKNFTLSKLTPST 289 (349)
Q Consensus 222 ~P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~-----~~~~-------~~~~~~~~~~L~p~t 289 (349)
.+..|..+++.-...+++.++|....... . .|+|....+.-... ..+. +-...+.|++|+|+|
T Consensus 40 ~~~~P~qvhls~~~~~~m~V~W~T~~~~~-~----~V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~L~p~T 114 (427)
T PLN02533 40 DPTHPDQVHISLVGPDKMRISWITQDSIP-P----SVVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGPLKPNT 114 (427)
T ss_pred CCCCCceEEEEEcCCCeEEEEEECCCCCC-C----EEEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCCCCCCC
Confidence 35678888887777889999998775322 2 24443322110000 0010 112357899999999
Q ss_pred eEEEEEEEecCCCCCCCCceEEeeccC
Q psy7018 290 CFRFRLAAVNQHGKSADATSVAEWLSY 316 (349)
Q Consensus 290 ~Y~~~V~A~~~~G~~~~s~~~~~~t~~ 316 (349)
.|.++|.. ...|....|+|++
T Consensus 115 ~Y~Yrvg~------~~~s~~~~F~T~p 135 (427)
T PLN02533 115 VYYYKCGG------PSSTQEFSFRTPP 135 (427)
T ss_pred EEEEEECC------CCCccceEEECCC
Confidence 99999831 1235667888854
No 39
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=94.08 E-value=12 Score=41.51 Aligned_cols=33 Identities=18% Similarity=0.168 Sum_probs=23.9
Q ss_pred eEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEe
Q psy7018 279 NFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAE 312 (349)
Q Consensus 279 ~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~ 312 (349)
.+.+.-.++| .|.++|.|.|..|...-+..+.+
T Consensus 1661 ~~~vt~~~pG-~Y~VtL~aSN~vG~~t~s~~i~v 1693 (2740)
T TIGR00864 1661 GAKLNPLEAG-PCDIFLQAANLLGQATADCTIDF 1693 (2740)
T ss_pred eeEEecCCCc-eEEEEEEEeecccceeeEEEEEE
Confidence 4555567887 89999999999987654544433
No 40
>KOG3632|consensus
Probab=93.89 E-value=0.51 Score=45.44 Aligned_cols=192 Identities=15% Similarity=0.096 Sum_probs=107.6
Q ss_pred CCceEEEEeeecCCCCCCCCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCC-ceee
Q psy7018 10 DHKVSTEDRAGNSAGWSPFSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQP-SITC 88 (349)
Q Consensus 10 ~~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~-~~~~ 88 (349)
.++|.+.|.+..+.|.......+.. ..-.+|..|..+++..++..+..++|-+...+.. .+.|.+..+. ....
T Consensus 642 a~s~~isvq~ltSrGsqd~lrc~Ll-vgg~a~vvpsqlrv~n~tqtSa~itwvp~nsn~~-----Hviyln~eE~~ps~a 715 (1335)
T KOG3632|consen 642 AHSGYISVQRLTSRGSQDQLRCILL-VGGAAPVVPSQLRVWNATQTSAMITWVPFNSNFL-----HVIYLNAEEPRPSVA 715 (1335)
T ss_pred CCceeeehhhhhccCCCCcceeeEe-ccccccccchhhhhhhhhchhhheeeeecCCCcc-----eeeecCCccCCCchh
Confidence 3567777778888887654333322 2334556777777778999999999988743211 2233333332 2233
Q ss_pred cccEEEEcCCCCCCEEEEEEEEEeecC---------CcccceeeeeeecccccCCCCCCCcc----ccceEEEEEEecC-
Q psy7018 89 DTTSYQLTGLSPHTTYKSVCQVYLISD---------KHTQKVDLTFESDQLYLSGLPNSEWP----SLRTIILTLYFRT- 154 (349)
Q Consensus 89 ~~~~~~i~~L~p~t~Y~~~v~~~~~~~---------~~~~~~~~~~~~~~~~~~~~p~~~~~----~~~~~~l~w~~~~- 154 (349)
....+++-+|.|++.|..+|-+.-.-. .+.....+.+.+....++.+|..... ......+.|..+.
T Consensus 716 ~~y~ytf~~lrpgt~y~a~vea~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~g~spg~l~vswrPptl 795 (1335)
T KOG3632|consen 716 EMYNYTFMRLRPGTDYWASVEAALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVETGGSPGRLEVSWRPPTL 795 (1335)
T ss_pred hhhHHHHhccCCCCccceecccccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeecCCCCceeeeeccCcee
Confidence 466788899999999998765332201 11223555666666655555543322 2234455554221
Q ss_pred --------CCCCeeeEEEeeCCCCCceEEEe--cCCceEEEecCCCC-CCeEEEEEEEEecccccCCC
Q psy7018 155 --------AQVPGVPPYVSDKGRDGKYKSIY--NGEAQSCRIQDLKP-GTDYAVCVQVHLEEIVGIAS 211 (349)
Q Consensus 155 --------~~~~~Y~v~~~~~~~~~~~~~~~--~~~~~~~~i~~L~p-~~~Y~~~V~a~~~~g~~~~s 211 (349)
-.+.+|.|+.. +..+..+. ....+...+..|+- ...-.+.|+.....|++.-|
T Consensus 796 dsag~sngv~vtgYavyad----gqkv~Evafptagst~VelsQlq~~l~~~~V~vRtms~~gesvds 859 (1335)
T KOG3632|consen 796 DSAGCSNGVAVTGYAVYAD----GQKVEEVAFPTAGSTKVELSQLQDGLYHGAVGVRTMSVPGESVDS 859 (1335)
T ss_pred ccccccCceeeeeeeeeeC----CceeeeeecccCCceEEEeeeehhhheecceeEEecccCcccccc
Confidence 24678998764 23333333 33344455555554 33345666666666654443
No 41
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=93.64 E-value=7.9 Score=38.07 Aligned_cols=82 Identities=15% Similarity=0.229 Sum_probs=49.2
Q ss_pred CCCCeEEeecCCeEEEEeecCCC--CCCcccEEEEEEecCCCC--CCcEE--ee---------cCCcceEEE--cC--CC
Q psy7018 226 PNPPKLVTKTRTSLALKWNAAVD--NGAHVMHYILESDQGNAT--GDFKE--IS---------KSKNKNFTL--SK--LT 286 (349)
Q Consensus 226 p~~~~~~~~~~~sv~l~W~~~~~--~~~~i~~Y~v~~~~~~~~--~~~~~--~~---------~~~~~~~~~--~~--L~ 286 (349)
..+|++. ..+..|.|+|..|-+ ..+...+|.|.+...-.. ..+.. .+ -++...+.+ .+ .+
T Consensus 759 ItDL~~~-~~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~~~~~l~~~f~~a~~vn~~~~~P~~ags~e~~~f~~~~~~~~ 837 (863)
T TIGR00868 759 ITDLEAG-FQGDNIILTWTAPGDVLDHGRADRYIIRISTSILDLRDDFNDATQVNTTDLIPKEANSKEVFVFKPEGIPIE 837 (863)
T ss_pred ceeeEEe-ecCCEEEEEeeCCCccCCCCccceEEEEecCCHHHHHhhhccccccccCCcCCCCCCceeEEEEeCCccccc
Confidence 3445543 445669999999975 467788999998653100 00100 01 122224444 34 33
Q ss_pred CCCeEEEEEEEecCCC-CCCCCc
Q psy7018 287 PSTCFRFRLAAVNQHG-KSADAT 308 (349)
Q Consensus 287 p~t~Y~~~V~A~~~~G-~~~~s~ 308 (349)
.++.|.|.|+|.|..| .|..|.
T Consensus 838 ~~~~~~~ai~a~d~~~~~s~~sn 860 (863)
T TIGR00868 838 NGTDLFIAVQAIDKANLTSEVSN 860 (863)
T ss_pred CCeEEEEEEEEEccccccccccc
Confidence 6889999999999987 555554
No 42
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=92.26 E-value=22 Score=39.57 Aligned_cols=104 Identities=15% Similarity=0.104 Sum_probs=52.9
Q ss_pred CeEEEEEEEEecccccCCCCcEEEecCCCCCCCCCCCeEEeecC-------CeEEEEeecCCCCCCcccEEEEEEecCCC
Q psy7018 193 TDYAVCVQVHLEEIVGIASEPTLFTTPPCEPDQPNPPKLVTKTR-------TSLALKWNAAVDNGAHVMHYILESDQGNA 265 (349)
Q Consensus 193 ~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P~~p~~~~~~~~~~-------~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~ 265 (349)
-.|.++|.|.|..|....+..+.+ .+ +...+.+..... ....+.+......+..+ .|...+....
T Consensus 1754 G~y~VtVta~N~vss~~~s~~V~V--qe----pV~GL~i~~~~~~~~~~~~ag~~v~F~a~vstGsnV-sw~W~f~~g~- 1825 (2740)
T TIGR00864 1754 GLHLVTMKAFNELGSANASEEVDV--QE----PISGLKIRAADAGEQNFFAADSSVCFQGELATGTNV-SWCWAIDGGS- 1825 (2740)
T ss_pred ceEEEEEEEEccccccceeEEEEE--Ee----ccccceEecCCCCcccceecCcEEEEEEEccCCCee-EEEEEeCCCC-
Confidence 389999999999875543333322 22 333444432111 12234443332222222 5555443211
Q ss_pred CCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEee
Q psy7018 266 TGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEW 313 (349)
Q Consensus 266 ~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~ 313 (349)
.. .+ ......-.++| .|.++|.|.|..+.-..+..+.+.
T Consensus 1826 -s~-----~g--k~v~~Tf~~aG-~ytV~L~AsN~vs~~~~s~~~~VQ 1864 (2740)
T TIGR00864 1826 -SK-----MG--KHACMTFPDAG-TFAIRLNASNAVSGKSASREFFAE 1864 (2740)
T ss_pred -cc-----cc--ceeEEecCCCe-EEEEEEEEEcccCcceeeeeEEEE
Confidence 10 11 23334445666 899999999998755545444443
No 43
>KOG1948|consensus
Probab=91.72 E-value=11 Score=36.41 Aligned_cols=120 Identities=14% Similarity=0.166 Sum_probs=62.6
Q ss_pred CceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCCCCCCCCCCeEEe-ecCCeEEEEeecCCCCCCcccEE-E
Q psy7018 180 EAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEPDQPNPPKLVT-KTRTSLALKWNAAVDNGAHVMHY-I 257 (349)
Q Consensus 180 ~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P~~p~~~~~~~-~~~~sv~l~W~~~~~~~~~i~~Y-~ 257 (349)
..-.|.|.+|.|+..|.++|.+.-+...-+.+.+..+....+. .-..++.+.. .......|+-.-..+.+..+..- .
T Consensus 944 enG~yRiRGL~Pdc~Y~V~vk~~~~n~~iers~P~s~tv~vgn-eDv~glnf~af~q~kttdit~~V~~~~ne~l~sl~v 1022 (1165)
T KOG1948|consen 944 ENGTYRIRGLLPDCEYQVHVKSYADNSPIERSFPRSFTVSVGN-EDVKGLNFMAFIQAKTTDITVEVGMDTNEELQSLRV 1022 (1165)
T ss_pred cCCcEEEeccCCCceEEEEEeeccCCCcccccCCceEEEEecc-cccCCceEEEEeccceEEEEEEEcccccccccceEE
Confidence 3457999999999999999998743212122222222221111 1233343322 22333334443333333444433 3
Q ss_pred EEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecC
Q psy7018 258 LESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQ 300 (349)
Q Consensus 258 v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~ 300 (349)
+.++..........+..+....+...-+.+|..|.+++.+.-.
T Consensus 1023 v~yKs~nddspv~sv~~gql~~ffp~l~~dg~~yvV~l~Stlp 1065 (1165)
T KOG1948|consen 1023 VIYKSNNDDSPVASVVAGQLLHFFPNLPRDGVEYVVRLESTLP 1065 (1165)
T ss_pred EEEecCCCCCcceEEeccceeeeccccCCCCceEEEEEeccCC
Confidence 3444444333344445555555555556788899998877654
No 44
>KOG0613|consensus
Probab=90.95 E-value=9.5 Score=38.56 Aligned_cols=253 Identities=14% Similarity=0.117 Sum_probs=121.2
Q ss_pred ccCCCeeEEEecCCCCCC-CCceeEEEEecCCCC----CceeecccEEEEcCCCCCCEEEEEEEEEeecCCcccceeeee
Q psy7018 51 VTTPTSLTVSWSEPANHG-SPILHYVLETSDPAQ----PSITCDTTSYQLTGLSPHTTYKSVCQVYLISDKHTQKVDLTF 125 (349)
Q Consensus 51 ~~~~~si~l~W~~~~~~~-~~~~~Y~v~~~~~~~----~~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~~~ 125 (349)
.+...-+...|..+..++ ..+..|.|+.+.... .........-.+..+. |+++|+.....+.......+..
T Consensus 150 ~~~~~~v~~~~~~~~~d~~~k~~~~iie~~~~g~s~~~~~l~~~~~~s~~~~a~----~e~~v~~v~s~gqp~~~~~~~~ 225 (1205)
T KOG0613|consen 150 LVHRARVLPAWRPPREDGGRKREEYIIERRQAGRSPWLKRLTQPTDDSTDTDAH----YESRVRAVVSAGQPEPLERWEI 225 (1205)
T ss_pred eecCceecccccCCccCccceeecceEEEEeccCCCcccccccccCCcceeccc----cceeEEEEEecCCCccchhhhh
Confidence 457788889999887764 466788887644211 1111111111222222 4554443322222222211211
Q ss_pred eecccccCCCCCCCc---cccceEEEEEEecCCC----CCeeeEEEeeCCCCCceEEEe--cCCceEEEecCCCCCCeEE
Q psy7018 126 ESDQLYLSGLPNSEW---PSLRTIILTLYFRTAQ----VPGVPPYVSDKGRDGKYKSIY--NGEAQSCRIQDLKPGTDYA 196 (349)
Q Consensus 126 ~~~~~~~~~~p~~~~---~~~~~~~l~w~~~~~~----~~~Y~v~~~~~~~~~~~~~~~--~~~~~~~~i~~L~p~~~Y~ 196 (349)
......+...+.... .......++|....++ +.+|.+..........+.... .....+.+.-.-..+..|.
T Consensus 226 ~g~~i~~~~~~~~s~t~e~s~~~i~~~l~~~~G~~~~~~~~~l~e~~~~~~~~t~~i~~~~~~~~~s~t~~~~~~~~q~~ 305 (1205)
T KOG0613|consen 226 LGELIAPGSAPSMSVTLEASDRKIQLTLRAPRGPYSAHILGYLIEQRKHKGGNTKTIQNDGPEPELSWTVADKRQGCQDR 305 (1205)
T ss_pred cccccCCCCccccccccchhheeeeEEeecCCCchHHHHhcchhhhcccCCCccccccccCCccccceeeecccCCcccc
Confidence 111111222222111 1122344555544332 334444332211111221111 2223344444556677888
Q ss_pred EEEEEEecccccCCCCcE---EEecCCC-CCCCCCCCeEEeecCCeEEEEeecCCC-CCCcccEEEEEEecCCCCCCcEE
Q psy7018 197 VCVQVHLEEIVGIASEPT---LFTTPPC-EPDQPNPPKLVTKTRTSLALKWNAAVD-NGAHVMHYILESDQGNATGDFKE 271 (349)
Q Consensus 197 ~~V~a~~~~g~~~~s~~~---~~~t~~~-~P~~p~~~~~~~~~~~sv~l~W~~~~~-~~~~i~~Y~v~~~~~~~~~~~~~ 271 (349)
+.+...-..|.+.+.... .+..... ++..+....+.....+.+++.|.-... ....+.+|...-..... ..+..
T Consensus 306 ~~v~~~~~i~~~~~~p~~~~~aaa~~~~~~v~~~~~~~v~a~d~~~v~m~~~~~~~~~p~~~~~~~~~~~~~~~-~~~~~ 384 (1205)
T KOG0613|consen 306 FKVTEEAPIGKGKPGPESLQAAAARPPEVPVGLVRNLSVTARDNTLVEMPTALSGTQKPDEAQGYHGEEVSSES-LGALP 384 (1205)
T ss_pred eeeeEEeeccccccCchhhhhhccCCccccccccccceeccccCcceeecccccCCcCCchheeeccccccccc-ccccc
Confidence 888877666655442222 2222111 123445556666678888999976642 12334455544332221 12223
Q ss_pred eec--CCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCc
Q psy7018 272 ISK--SKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADAT 308 (349)
Q Consensus 272 ~~~--~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~ 308 (349)
+.. .....+.+.++-++..|..++.|.|..|......
T Consensus 385 ~~~~~~ts~~~~~~~~~~~e~~~~~~~a~n~~g~~~~~~ 423 (1205)
T KOG0613|consen 385 CPVGTVTSHTYTIKDAGPGEGYFTRVPAVNKGGTEQTIA 423 (1205)
T ss_pred cccccccCcchhhhhcCcccccccccceeccCCcccccc
Confidence 333 2344777889999999999999999998665443
No 45
>KOG4228|consensus
Probab=90.14 E-value=0.61 Score=45.71 Aligned_cols=89 Identities=28% Similarity=0.265 Sum_probs=57.2
Q ss_pred EEEeeecCCCCCCCCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCC---Ccee-ecc
Q psy7018 15 TEDRAGNSAGWSPFSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQ---PSIT-CDT 90 (349)
Q Consensus 15 ~~v~a~n~~G~~~~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~---~~~~-~~~ 90 (349)
+.+.+++..|.++.+..+.+++.++.|..+. ...+..+++++|.++.. ..+..|++-..+... .... ...
T Consensus 142 ~~~~~~~~~~~~~~~t~i~i~~~~~~p~~~~----~~~~~~~it~~w~~~~~--~~~~~ykl~~~~~d~~~~~~v~~~~~ 215 (1087)
T KOG4228|consen 142 VGPLAIASPGLGPPSTYIQITTNANSPIQPG----EEEEYTTITGSWSPPHA--VSLDTYKLLHLDPDTGYEISVTLTPP 215 (1087)
T ss_pred ccccccCCcccCCCCceEEEeccCCCCCCCC----cceEEEEEEecCCCCCc--ccchhhhhhhcCCcccceeeeeccCC
Confidence 3377888888888888888888888886666 34688999999998853 567777775543211 1111 123
Q ss_pred cEEEEcCCCCCCEEEEEEE
Q psy7018 91 TSYQLTGLSPHTTYKSVCQ 109 (349)
Q Consensus 91 ~~~~i~~L~p~t~Y~~~v~ 109 (349)
.+....++.|-..|.+++.
T Consensus 216 ~~~~t~~~~~~~~~~~~~a 234 (1087)
T KOG4228|consen 216 GSGGTGDLGPPSISRFKCA 234 (1087)
T ss_pred CCCcccCCCCccccccccc
Confidence 3344445666666666443
No 46
>KOG4806|consensus
Probab=90.06 E-value=2.7 Score=35.70 Aligned_cols=101 Identities=15% Similarity=0.200 Sum_probs=56.1
Q ss_pred CCCceEEEEeeecCCCCCC-----CCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecC---
Q psy7018 9 TDHKVSTEDRAGNSAGWSP-----FSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSD--- 80 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~-----~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~--- 80 (349)
|..+|.+|++-.|+-..-. .|....++..|..|.+-....+ .-+-+++++.|....+. -..|.|....
T Consensus 293 ~~~~~~~rf~~~ndde~~~~v~V~aSt~~t~~~~P~LP~dTtVk~v-~r~CSsAtIaW~gs~d~---~~kyCIy~~~~~~ 368 (454)
T KOG4806|consen 293 PGERYLMRFEPSNDDEALQKVMVAASTEATFRDLPELPQDTTVKNV-RRRCSSATIAWNGSPDE---ELKYCIYVFNLPQ 368 (454)
T ss_pred CCCceEEEEEecCCchhhheeEeeeecccccCcCCCCCCCceEeee-ccccchheeeeccCcch---heeEEEEEecccc
Confidence 5677888888777643211 1334444455555544333321 23668999999977532 3344442211
Q ss_pred ------------------------------CC-CCceeecccEEEEcCCCCCCEEEEEEEEEee
Q psy7018 81 ------------------------------PA-QPSITCDTTSYQLTGLSPHTTYKSVCQVYLI 113 (349)
Q Consensus 81 ------------------------------~~-~~~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~ 113 (349)
+. ......+-.+-+|.||.||..|.+-|.+...
T Consensus 369 ~e~~v~~~~N~C~g~~~~~~s~~v~c~y~hs~~~q~~~~~i~teTI~gL~PgssYlldv~a~~~ 432 (454)
T KOG4806|consen 369 RERSVVDFTNYCMGFVPKRVSQYVYCEYMHSRERQQSPDNIETETILGLMPGSSYLLDVTANLS 432 (454)
T ss_pred hhhhhhhhhccccCccccceeEEEeEEEecChhhhcchhhhhhhhhcccccCceEEEEEEEccc
Confidence 00 0001112345678899999999997766544
No 47
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=89.84 E-value=2.5 Score=40.42 Aligned_cols=119 Identities=10% Similarity=0.006 Sum_probs=59.9
Q ss_pred ecccEEEEcCCCCCCEEEEEEEEEeecCCcccce-eeeeeec-ccccCCCCCCCcc--ccceEEEEEEecCCCCCeeeEE
Q psy7018 88 CDTTSYQLTGLSPHTTYKSVCQVYLISDKHTQKV-DLTFESD-QLYLSGLPNSEWP--SLRTIILTLYFRTAQVPGVPPY 163 (349)
Q Consensus 88 ~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~-~~~~~~~-~~~~~~~p~~~~~--~~~~~~l~w~~~~~~~~~Y~v~ 163 (349)
+....+.+.+|.+| +|.++|++.+..+..+... +..+.-. ...|+..+..... ..-...+.|-.+. ..|.+.
T Consensus 657 t~~~~~~~~gi~~G-qY~i~VrAiN~~g~~~~~a~s~~f~i~g~~~Ppp~~~t~~a~~it~~~~l~v~dPt---~~~d~~ 732 (952)
T COG4733 657 TSAAGFDVEGIPAG-QYAIRVRAINVFEPNSPDATAYEFALNGKKVPPPKAMIYDAVIITLVIRLVVGDPT---GAVDIT 732 (952)
T ss_pred ccccceeecCcCcc-ceEEEEEEeeccCCCCCCcceeEEEecCCCCCCCcccccceEEEEeeeeEEEecCC---cceEEe
Confidence 45788999999995 7999998776555443333 3322221 1111111111111 1113334444433 334443
Q ss_pred EeeCC----CCCce--EEEecCCceEEEecCCCCCCeEEEEEEEEecccccCC
Q psy7018 164 VSDKG----RDGKY--KSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIA 210 (349)
Q Consensus 164 ~~~~~----~~~~~--~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~ 210 (349)
+.... .+... ...-........-.+|+++..|.|++++++..|...+
T Consensus 733 ~sei~~s~~~d~~~~Ar~LG~~~~~~~~~~~i~~g~~~~F~~R~Vn~vG~~~~ 785 (952)
T COG4733 733 STEIRSAVIADGNFQARSLGNLNYPGLFSVGIQAGLTFWFRNRNVDLVGNNDK 785 (952)
T ss_pred eeeeeeeccccchhHHhhhhccccccccccCcCCCceEEEEeeeccccccccc
Confidence 32210 01111 0111111222233689999999999999998886443
No 48
>KOG1948|consensus
Probab=89.58 E-value=12 Score=36.27 Aligned_cols=114 Identities=16% Similarity=0.225 Sum_probs=54.7
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeecCC--cccceeeeeeecccccCCCCCCCccccceEEEEEEe---cCCCCCee-eEE
Q psy7018 90 TTSYQLTGLSPHTTYKSVCQVYLISDK--HTQKVDLTFESDQLYLSGLPNSEWPSLRTIILTLYF---RTAQVPGV-PPY 163 (349)
Q Consensus 90 ~~~~~i~~L~p~t~Y~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~w~~---~~~~~~~Y-~v~ 163 (349)
+..|.|.+|.|+..|.+++.....++. .+.+.+.+..-..-...+..-..........++..- .+..+..- .+.
T Consensus 945 nG~yRiRGL~Pdc~Y~V~vk~~~~n~~iers~P~s~tv~vgneDv~glnf~af~q~kttdit~~V~~~~ne~l~sl~vv~ 1024 (1165)
T KOG1948|consen 945 NGTYRIRGLLPDCEYQVHVKSYADNSPIERSFPRSFTVSVGNEDVKGLNFMAFIQAKTTDITVEVGMDTNEELQSLRVVI 1024 (1165)
T ss_pred CCcEEEeccCCCceEEEEEeeccCCCcccccCCceEEEEecccccCCceEEEEeccceEEEEEEEcccccccccceEEEE
Confidence 677999999999999996655432221 122222222211111111111111222333444431 22333333 334
Q ss_pred EeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEe
Q psy7018 164 VSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHL 203 (349)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~ 203 (349)
|+....+.....+..+....|.-.-+++|..|-+++.+..
T Consensus 1025 yKs~nddspv~sv~~gql~~ffp~l~~dg~~yvV~l~Stl 1064 (1165)
T KOG1948|consen 1025 YKSNNDDSPVASVVAGQLLHFFPNLPRDGVEYVVRLESTL 1064 (1165)
T ss_pred EecCCCCCcceEEeccceeeeccccCCCCceEEEEEeccC
Confidence 4443323333344344444444455678889998887654
No 49
>KOG1225|consensus
Probab=88.99 E-value=1.1 Score=40.79 Aligned_cols=114 Identities=17% Similarity=0.189 Sum_probs=76.4
Q ss_pred EEEecCCceEEEecCCCCCCeEEEEEEEEeccccc-CCCCcEEEecCCCCCCCCCCCeEEeecCCeEEEEeecCCCCCCc
Q psy7018 174 KSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVG-IASEPTLFTTPPCEPDQPNPPKLVTKTRTSLALKWNAAVDNGAH 252 (349)
Q Consensus 174 ~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~-~~s~~~~~~t~~~~P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~ 252 (349)
..........+.+..|+|+..|..+++++-..-.+ +.....+ +.. .-+..+.+.....+++.+.|..+.. .
T Consensus 410 ~~r~~~~~~~~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~~~--~~~---~~~g~~~v~~~~~~s~e~~g~~~s~---~ 481 (525)
T KOG1225|consen 410 QGRVPGDANSVDIQGLEPGDEYNCSVNTVAANIGSLPKDKSET--TVL---CWNGGLCVDGETESSLEVGGPCPSS---G 481 (525)
T ss_pred ceeeccceeeeeeeeecCCcceeeehhhhhhhhccCCcccccc--eEe---ecCCceeeeeeeeccccccCCCCCc---c
Confidence 33446777889999999999999999998543322 2222111 111 4566788888899999999998863 3
Q ss_pred ccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEE
Q psy7018 253 VMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRL 295 (349)
Q Consensus 253 i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V 295 (349)
...+.+.+.................+.+...+|.+++.|.+.+
T Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~c~~~~~~~ 524 (525)
T KOG1225|consen 482 TCGWEVRCGPCGNDGGVNAEPPPECTSYDRTGLGPCTEYEVSV 524 (525)
T ss_pred ccceEEEeeecCcccccccCCCCCCCCCCccCcccccceeccc
Confidence 4467777733322222333333456688889999999998765
No 50
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=87.87 E-value=10 Score=28.84 Aligned_cols=31 Identities=13% Similarity=0.332 Sum_probs=23.4
Q ss_pred EcCCCCCCeEEEEEEEecCCCCCCCCceEEee
Q psy7018 282 LSKLTPSTCFRFRLAAVNQHGKSADATSVAEW 313 (349)
Q Consensus 282 ~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~ 313 (349)
+..|+.+..|.+.|.|.|..|... ...+.|.
T Consensus 116 fpsle~~~~YtLtV~a~D~aGN~~-~~si~F~ 146 (158)
T PF13750_consen 116 FPSLEADDSYTLTVSATDKAGNQS-TKSISFS 146 (158)
T ss_pred cCCcCCCCeEEEEEEEEecCCCEE-EEEEEEE
Confidence 457889999999999999998543 2334443
No 51
>PLN02533 probable purple acid phosphatase
Probab=87.42 E-value=4.3 Score=36.60 Aligned_cols=64 Identities=19% Similarity=0.320 Sum_probs=39.1
Q ss_pred CCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCC-C--Cce--------------eecccEEEEcCCCCCC
Q psy7018 40 PPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPA-Q--PSI--------------TCDTTSYQLTGLSPHT 102 (349)
Q Consensus 40 ~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~-~--~~~--------------~~~~~~~~i~~L~p~t 102 (349)
.|..|.-+.++-.++++++|+|...... .+. |+|+... . ... ..-.....|++|+|+|
T Consensus 40 ~~~~P~qvhls~~~~~~m~V~W~T~~~~-~~~----V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~L~p~T 114 (427)
T PLN02533 40 DPTHPDQVHISLVGPDKMRISWITQDSI-PPS----VVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGPLKPNT 114 (427)
T ss_pred CCCCCceEEEEEcCCCeEEEEEECCCCC-CCE----EEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCCCCCCC
Confidence 4455666655555789999999987532 233 3343211 0 000 0112457899999999
Q ss_pred EEEEEE
Q psy7018 103 TYKSVC 108 (349)
Q Consensus 103 ~Y~~~v 108 (349)
.|.++|
T Consensus 115 ~Y~Yrv 120 (427)
T PLN02533 115 VYYYKC 120 (427)
T ss_pred EEEEEE
Confidence 999976
No 52
>PF14292 SusE: SusE outer membrane protein
Probab=84.96 E-value=6.9 Score=28.19 Aligned_cols=60 Identities=20% Similarity=0.367 Sum_probs=39.2
Q ss_pred cCCCeeEEEecCCCCCCC-CceeEEEEecCCC---CCceee--c---ccEEEEc-----------CCCCCCEEEEEEEEE
Q psy7018 52 TTPTSLTVSWSEPANHGS-PILHYVLETSDPA---QPSITC--D---TTSYQLT-----------GLSPHTTYKSVCQVY 111 (349)
Q Consensus 52 ~~~~si~l~W~~~~~~~~-~~~~Y~v~~~~~~---~~~~~~--~---~~~~~i~-----------~L~p~t~Y~~~v~~~ 111 (349)
.....++++|+++..++. -...|.|+..... ...... . ..+..++ ++.|+....+.+|+.
T Consensus 42 ~~~~a~tftW~~~~~~~~~a~v~Y~lq~~~~~~~F~~~~~~~~~~~~~~s~~~t~~eLN~~l~~~g~~~~~~~~l~~rV~ 121 (122)
T PF14292_consen 42 ASDNAVTFTWTAADYGGPDAPVTYTLQFDKKGNDFSSPVEIVTTDNGSTSVSITVKELNSILLKLGLEPGEAGTLYWRVK 121 (122)
T ss_pred cCCceEEEEEECCccCCCCCceEEEEEEeccCCCccCcEEEEeecCcceeEEecHHHHHHHHHHcCCCCCceEEEEEEEE
Confidence 355699999999977654 5568999886521 111111 1 2444543 688999988888764
No 53
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=83.92 E-value=8.3 Score=37.92 Aligned_cols=40 Identities=18% Similarity=0.444 Sum_probs=31.4
Q ss_pred CCCCCccccccccCCCeeEEEecCCCCC--CCCceeEEEEecC
Q psy7018 40 PPGSLTLRKTSVTTPTSLTVSWSEPANH--GSPILHYVLETSD 80 (349)
Q Consensus 40 ~P~~~~~~~~~~~~~~si~l~W~~~~~~--~~~~~~Y~v~~~~ 80 (349)
||..+..+++. .....|.|+|..|.++ .+....|.|++..
T Consensus 755 PP~rItDL~~~-~~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~ 796 (863)
T TIGR00868 755 PPSKITDLEAG-FQGDNIILTWTAPGDVLDHGRADRYIIRIST 796 (863)
T ss_pred CCccceeeEEe-ecCCEEEEEeeCCCccCCCCccceEEEEecC
Confidence 67778888753 4566699999999773 6788999998865
No 54
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=83.09 E-value=3 Score=39.35 Aligned_cols=73 Identities=14% Similarity=0.205 Sum_probs=49.6
Q ss_pred CccccccccCCCeeEEEecCCCCCCCCceeEEEEecCC----------C---CCceee-cccEEEEcCCCCCCEEEEEEE
Q psy7018 44 LTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDP----------A---QPSITC-DTTSYQLTGLSPHTTYKSVCQ 109 (349)
Q Consensus 44 ~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~----------~---~~~~~~-~~~~~~i~~L~p~t~Y~~~v~ 109 (349)
+..+++...+.+.+.|+|+.. +.-..|.|+.... . +....- ..+.+--+.|.|.+.|.+||+
T Consensus 8 vtslrI~kLsaN~v~l~WddV----G~NFyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed~~~p~t~Yk~Rv~ 83 (1030)
T PHA02579 8 VTSLRIDKLSANQVYLTWDDV----GANFYYFVELAETRDADGELIPDDELRWINLGYTANNEWFEDKLQPNTYYKFRVA 83 (1030)
T ss_pred ccEEEhhhhccceEEEEeecc----CCceEEEEEEEeeccCCCccCCCccccceecCcccchhhhhhccCCcceEEEEEE
Confidence 455666678999999999988 5568899988531 1 111111 233444445999999999999
Q ss_pred EEeecCCcccc
Q psy7018 110 VYLISDKHTQK 120 (349)
Q Consensus 110 ~~~~~~~~~~~ 120 (349)
+...+-..+.+
T Consensus 84 ~~~qGFe~SdW 94 (1030)
T PHA02579 84 VAAQGFEQSDW 94 (1030)
T ss_pred eeccCCCcccc
Confidence 87776555555
No 55
>cd05735 Ig8_DSCAM Eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM: the eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=82.79 E-value=1.6 Score=29.38 Aligned_cols=34 Identities=24% Similarity=0.175 Sum_probs=26.9
Q ss_pred CCCCceEEEEeeecCCCCCCCCcceeeeCCCCCC
Q psy7018 8 CTDHKVSTEDRAGNSAGWSPFSETSVITTPPGPP 41 (349)
Q Consensus 8 ~p~~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P 41 (349)
-..+.-.|.|.|.|..|.......+.+...|++|
T Consensus 55 ~~~D~G~YtC~A~N~~G~~~~~~~L~V~~~P~~P 88 (88)
T cd05735 55 VREDSGFFSCHAINSYGEDRGIIQLTVQEPPDPP 88 (88)
T ss_pred CcccCEEEEEEEEcCCCcceEEEEEEEeCCCCCC
Confidence 3566778999999999987777777777776665
No 56
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=82.57 E-value=11 Score=27.50 Aligned_cols=66 Identities=15% Similarity=0.164 Sum_probs=34.5
Q ss_pred ceEEEEEE-ecCCCCCeeeEEEeeCCCCCceEEEecC-CceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEE
Q psy7018 144 RTIILTLY-FRTAQVPGVPPYVSDKGRDGKYKSIYNG-EAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLF 216 (349)
Q Consensus 144 ~~~~l~w~-~~~~~~~~Y~v~~~~~~~~~~~~~~~~~-~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~ 216 (349)
-...++|. |....-..|.|+. ++.....-.. ...+..+. ...+..|.++|..+|..|.+. |.++.+
T Consensus 38 v~V~VtwN~WsG~~Gd~~kly~-----dG~~V~tG~~~~~~~a~~~-~~~gG~y~~~VeLCN~~GCS~-S~~~~V 105 (133)
T PF08329_consen 38 VDVSVTWNVWSGTNGDTAKLYF-----DGVLVWTGPSPQQKSATFT-VTKGGRYQMQVELCNADGCST-SAPVEV 105 (133)
T ss_dssp EEEEEEEE-SSS---SEEEEEE-----TTEEEEEEE--SEEEEEEE-E-S-EEEEEEEEEEETTEEEE----EEE
T ss_pred eEEEEEEEEecCCCCCEEEEEE-----CCEEEEeCCCccCceEEEE-ecCCCEEEEEEEEECCCCccc-CCCEEE
Confidence 35677887 6666667788875 2222211111 12333333 345669999999999999654 334433
No 57
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=82.37 E-value=1.5 Score=30.22 Aligned_cols=34 Identities=18% Similarity=0.192 Sum_probs=28.5
Q ss_pred CCCCceEEEEeeecCCCCCCCCcceeeeCCCCCC
Q psy7018 8 CTDHKVSTEDRAGNSAGWSPFSETSVITTPPGPP 41 (349)
Q Consensus 8 ~p~~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P 41 (349)
...++-.|.|.|.|..|....+..+.+...|++|
T Consensus 64 ~~~D~G~Ytc~a~N~~G~~~~~~~l~V~~~P~pP 97 (98)
T cd05762 64 QQEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP 97 (98)
T ss_pred ChhhCEEEEEEEEcCCCceeEEEEEEEecCCCCC
Confidence 3457788999999999987778888888888887
No 58
>PF14292 SusE: SusE outer membrane protein
Probab=81.83 E-value=17 Score=26.19 Aligned_cols=65 Identities=17% Similarity=0.294 Sum_probs=38.4
Q ss_pred ecCCeEEEEeecCCCCCC-cccEEEEEEecCCCC-CCcEEeecCC--cceEEEc-----------CCCCCCeEEEEEEEe
Q psy7018 234 KTRTSLALKWNAAVDNGA-HVMHYILESDQGNAT-GDFKEISKSK--NKNFTLS-----------KLTPSTCFRFRLAAV 298 (349)
Q Consensus 234 ~~~~sv~l~W~~~~~~~~-~i~~Y~v~~~~~~~~-~~~~~~~~~~--~~~~~~~-----------~L~p~t~Y~~~V~A~ 298 (349)
..++.++++|+++..... ....|.|+....++. ..-..+.... .++..++ |+.+|....+.+|..
T Consensus 42 ~~~~a~tftW~~~~~~~~~a~v~Y~lq~~~~~~~F~~~~~~~~~~~~~~s~~~t~~eLN~~l~~~g~~~~~~~~l~~rV~ 121 (122)
T PF14292_consen 42 ASDNAVTFTWTAADYGGPDAPVTYTLQFDKKGNDFSSPVEIVTTDNGSTSVSITVKELNSILLKLGLEPGEAGTLYWRVK 121 (122)
T ss_pred cCCceEEEEEECCccCCCCCceEEEEEEeccCCCccCcEEEEeecCcceeEEecHHHHHHHHHHcCCCCCceEEEEEEEE
Confidence 456789999999974333 345899998764321 1122222221 3344443 778888777776643
No 59
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=80.48 E-value=10 Score=26.44 Aligned_cols=72 Identities=17% Similarity=0.181 Sum_probs=43.7
Q ss_pred CCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCCceee------cccE-----EEEc--CCCCCCEEEE
Q psy7018 40 PPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQPSITC------DTTS-----YQLT--GLSPHTTYKS 106 (349)
Q Consensus 40 ~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~~~~~------~~~~-----~~i~--~L~p~t~Y~~ 106 (349)
+|..|.++..-+-+-..++-.|++....+.+ ..|.+.|.........+ ..++ +.+. ...-.+.|.|
T Consensus 7 ~p~~P~~~~C~S~~~etftC~W~~g~~~~l~-~~y~L~Y~~~~~~~~eCp~~~~~~~ns~~~~~C~F~~~~t~lf~~y~i 85 (104)
T PF09067_consen 7 PPEKPENLKCFSREMETFTCFWEPGSEGNLP-TNYTLFYKKEGEEWKECPDYSTSGPNSTVRHICYFPKSDTSLFVPYCI 85 (104)
T ss_dssp HCCCCEEEEEEBSSSS-EEEEEEEESSSTST-CEEEEEEEETTSEEEEESESSTTETTEEEEEEEEE-CCGCSSSSEEEE
T ss_pred CCCCCccCccCCCCCCcEEEEeeCCCCCCCC-CcEEEEEEeCCCCCccCCCeEecCCCCceeEEEEcCCCCeEEEEEEEE
Confidence 4566666665566778999999998764433 33888776433211111 1233 4444 7788999999
Q ss_pred EEEEEe
Q psy7018 107 VCQVYL 112 (349)
Q Consensus 107 ~v~~~~ 112 (349)
+|.+.+
T Consensus 86 ~V~a~~ 91 (104)
T PF09067_consen 86 QVEATN 91 (104)
T ss_dssp EEEEEE
T ss_pred EEEecc
Confidence 776554
No 60
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=79.94 E-value=6.8 Score=23.45 Aligned_cols=28 Identities=18% Similarity=0.456 Sum_probs=20.7
Q ss_pred eEEEcCCCCCCeEEEEEEEecCCCCCCCC
Q psy7018 279 NFTLSKLTPSTCFRFRLAAVNQHGKSADA 307 (349)
Q Consensus 279 ~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s 307 (349)
++.+..+ ....|.+.|.|.|..|.....
T Consensus 15 s~t~~~~-~dG~y~itv~a~D~AGN~s~~ 42 (54)
T PF13754_consen 15 SFTVPAL-ADGTYTITVTATDAAGNTSTS 42 (54)
T ss_pred EEeCCCC-CCccEEEEEEEEeCCCCCCCc
Confidence 4455556 466999999999999965444
No 61
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=79.45 E-value=17 Score=25.00 Aligned_cols=78 Identities=9% Similarity=0.051 Sum_probs=48.9
Q ss_pred CCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEe--ec---CCcceEEEcCCCC----CCeEEEEEE
Q psy7018 226 PNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEI--SK---SKNKNFTLSKLTP----STCFRFRLA 296 (349)
Q Consensus 226 p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~--~~---~~~~~~~~~~L~p----~t~Y~~~V~ 296 (349)
|.++...-.+...+..+|.+-...... ..|.++++...... ...+ +. +....+.+..... ...|.|.|.
T Consensus 2 ~~nlsC~~~~~~~m~CtW~~g~~~~~~-t~y~L~~~~~~~~~-~~eC~~y~~~~~~~~gC~~~~~~~~~~~~~~~~v~V~ 79 (99)
T PF09240_consen 2 PQNLSCFIYNLEYMNCTWEPGKEAPPD-TQYTLYYWYSPLEE-EKECPHYSKDSGTRIGCQFPVSEIDSSEFSQYNVCVN 79 (99)
T ss_dssp -EEEEEEEETTTEEEEEEECCTTCSTT-EEEEEEEEETTSSS-EEEESEEEESTSSEEEEEEESCTT-TTTTSEEEEEEE
T ss_pred CeeCEEEEECCEEEEEEECCCCCCCCc-ccEEEEEEcCCCCc-cccCCCccccCCceeEEEecCCCccccccceEEEEEE
Confidence 456666666788999999876533333 37888887764322 2222 21 1234566666555 247999999
Q ss_pred EecCCCCCC
Q psy7018 297 AVNQHGKSA 305 (349)
Q Consensus 297 A~~~~G~~~ 305 (349)
+.+..|.-.
T Consensus 80 ~ss~~~~i~ 88 (99)
T PF09240_consen 80 GSSSAGSIR 88 (99)
T ss_dssp EEETTEEEE
T ss_pred eccCCCccC
Confidence 999887443
No 62
>COG3979 Uncharacterized protein contain chitin-binding domain type 3 [General function prediction only]
Probab=78.94 E-value=5.3 Score=31.16 Aligned_cols=87 Identities=21% Similarity=0.128 Sum_probs=56.6
Q ss_pred CCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCC
Q psy7018 223 PDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHG 302 (349)
Q Consensus 223 P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G 302 (349)
|..+..+........++.+.|....+ ...+..|.+..+-+ ..........+++.+|.+.+.|.+.+.+.+..|
T Consensus 5 ~~a~~~~~~~~~~~~s~~~~~~s~~~-~~~~~~~p~~~~~d------~~~~~t~~~~~~~~g~~~et~y~~~~~a~~t~g 77 (181)
T COG3979 5 PPAPKSLMLTNVTTTSASLAASSSSD-NKASDTYPYVYAWD------PKTVYTTGTVVTVEGLAPETEYWTLVEAPDTSG 77 (181)
T ss_pred CcCccceeecchhhhhhhhccccccc-cccccCCCcccccC------ccceecccceEEecCcceeeEEEEeeeccCCCC
Confidence 44556667777778889999987621 23344455111111 012233455788999999999999999999766
Q ss_pred -CCCCCceEEeeccC
Q psy7018 303 -KSADATSVAEWLSY 316 (349)
Q Consensus 303 -~~~~s~~~~~~t~~ 316 (349)
.+.++..+...+..
T Consensus 78 ~~s~~~~~~~~~~~~ 92 (181)
T COG3979 78 NWSAWSRLLTVSTSG 92 (181)
T ss_pred ccccceeeeeeccCc
Confidence 66666666666554
No 63
>KOG1225|consensus
Probab=71.03 E-value=15 Score=33.83 Aligned_cols=150 Identities=17% Similarity=0.131 Sum_probs=78.4
Q ss_pred CccccccccCCCeeEEEecCCCCCCCCceeEEEEec----CC--CCCceeecccEEEEcCCCCCCEEEEEEEEEeecCCc
Q psy7018 44 LTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETS----DP--AQPSITCDTTSYQLTGLSPHTTYKSVCQVYLISDKH 117 (349)
Q Consensus 44 ~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~----~~--~~~~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~~~ 117 (349)
+..+.+...++.++.+.|... .....|...+. .. ....+......+.+..|+|+.+|..++++... ..+
T Consensus 369 ~~~~~~~~cs~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~g~~~~~~~~~v~~-~~~ 443 (525)
T KOG1225|consen 369 PSLLLITECSPPSLCIAGVGR----RRVTHCAGTYCPLGESGGDLQGRVPGDANSVDIQGLEPGDEYNCSVNTVAA-NIG 443 (525)
T ss_pred chhhcccccCCCceeeccccc----cccccccccccccccCCCccceeeccceeeeeeeeecCCcceeeehhhhhh-hhc
Confidence 334444456888888888822 22233333221 11 13344556888999999999999997763321 111
Q ss_pred ccceeeeeeecccccCCCCCCCccccceEEEEEEecCCCCCeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEE
Q psy7018 118 TQKVDLTFESDQLYLSGLPNSEWPSLRTIILTLYFRTAQVPGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAV 197 (349)
Q Consensus 118 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~w~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~ 197 (349)
+........+... -+.-.........+..+.|.-+......+.+.+...+.............+.+...+|.|++.|.+
T Consensus 444 ~~~~~~~~~~~~~-~~g~~~v~~~~~~s~e~~g~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~c~~~~~ 522 (525)
T KOG1225|consen 444 SLPKDKSETTVLC-WNGGLCVDGETESSLEVGGPCPSSGTCGWEVRCGPCGNDGGVNAEPPPECTSYDRTGLGPCTEYEV 522 (525)
T ss_pred cCCcccccceEee-cCCceeeeeeeeccccccCCCCCccccceEEEeeecCcccccccCCCCCCCCCCccCcccccceec
Confidence 1111111111000 000000111223344555555556677888888433222222222233667788899999999987
Q ss_pred EE
Q psy7018 198 CV 199 (349)
Q Consensus 198 ~V 199 (349)
.+
T Consensus 523 ~~ 524 (525)
T KOG1225|consen 523 SV 524 (525)
T ss_pred cc
Confidence 65
No 64
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=69.59 E-value=35 Score=23.80 Aligned_cols=61 Identities=8% Similarity=0.127 Sum_probs=34.7
Q ss_pred ceEEEEEE-ecCCCCCeeeEEEeeCCCCCceEEEecCCceEEEecCCCCCCeEEEEEEEEeccccc
Q psy7018 144 RTIILTLY-FRTAQVPGVPPYVSDKGRDGKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVG 208 (349)
Q Consensus 144 ~~~~l~w~-~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~ 208 (349)
-...+.|. |....-..|.|.+-.. ..+.-...+.....++. ...|-.|.++|.+|+..|.+
T Consensus 34 v~V~V~wnvWsG~~Gd~a~vl~dg~---~V~~G~~~~~~~~at~~-v~kgG~y~m~V~lCn~dGCS 95 (106)
T cd02848 34 ADVSVKWNAWSGDPGDTYKVLLDGK---EVWSGALTGSSGTATFK-VGKGGRYQMQVALCNGDGCS 95 (106)
T ss_pred eEEEEEEeeecCCCCcEEEEEECCe---EEEcccCCCCccEEEEE-eCCCCeEEEEEEEECCCCcc
Confidence 35667777 6666667787766220 00111111222233333 45577999999999998853
No 65
>KOG1378|consensus
Probab=68.61 E-value=37 Score=30.62 Aligned_cols=87 Identities=10% Similarity=0.132 Sum_probs=48.7
Q ss_pred CCCCCCeEEeec-CCeEEEEeecCCCCCCcccEEEEEEecC-----CCC-CCcEEee------cCCcceEEEcCCCCCCe
Q psy7018 224 DQPNPPKLVTKT-RTSLALKWNAAVDNGAHVMHYILESDQG-----NAT-GDFKEIS------KSKNKNFTLSKLTPSTC 290 (349)
Q Consensus 224 ~~p~~~~~~~~~-~~sv~l~W~~~~~~~~~i~~Y~v~~~~~-----~~~-~~~~~~~------~~~~~~~~~~~L~p~t~ 290 (349)
..|..+.+.... .+.+.++|....... .+..|-..-... ++. ..|...+ .+-...+.+.+|++++.
T Consensus 43 ~~peQvhlS~~~~~~~m~VswvT~~~~~-~~V~Yg~~~~~~~~~~~~~~~~~~~~~y~~~~~~sg~ih~~~~~~L~~~t~ 121 (452)
T KOG1378|consen 43 NSPEQVHLSFTDNLNEMRVSWVTGDGEE-NVVRYGEVKDKLDNSAARGMTEAWTDGYANGWRDSGYIHDAVMKNLEPNTR 121 (452)
T ss_pred CCCCeEEEeccCCCCcEEEEEeCCCCCC-ceEEEeecCCCccccccccceEEEecccccccceeeeEeeeeecCCCCCce
Confidence 456777775443 348999998775312 233333221110 000 0111111 11122678899999999
Q ss_pred EEEEEEEecCCCCCCCCceEEeeccC
Q psy7018 291 FRFRLAAVNQHGKSADATSVAEWLSY 316 (349)
Q Consensus 291 Y~~~V~A~~~~G~~~~s~~~~~~t~~ 316 (349)
|.++|-.- ..+|....|+|++
T Consensus 122 YyY~~Gs~-----~~wS~~f~F~t~p 142 (452)
T KOG1378|consen 122 YYYQVGSD-----LKWSEIFSFKTPP 142 (452)
T ss_pred EEEEeCCC-----CCcccceEeECCC
Confidence 99998332 2378889999865
No 66
>PF11344 DUF3146: Protein of unknown function (DUF3146); InterPro: IPR021492 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=67.96 E-value=4.1 Score=26.00 Aligned_cols=15 Identities=27% Similarity=0.527 Sum_probs=13.5
Q ss_pred CCCCCCeEEEEEEEe
Q psy7018 284 KLTPSTCFRFRLAAV 298 (349)
Q Consensus 284 ~L~p~t~Y~~~V~A~ 298 (349)
.|+||..|.|.|+|.
T Consensus 66 ~LEpGgdY~Ftirak 80 (80)
T PF11344_consen 66 QLEPGGDYSFTIRAK 80 (80)
T ss_pred eccCCCceEEEEecC
Confidence 689999999999984
No 67
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=65.46 E-value=30 Score=25.39 Aligned_cols=72 Identities=15% Similarity=0.190 Sum_probs=37.4
Q ss_pred cCCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeec
Q psy7018 235 TRTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWL 314 (349)
Q Consensus 235 ~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t 314 (349)
..-.|.|+|+-=...++ ..|+|+.... .-+.- .........+.. ..+..|.++|...|..|.+. |..+.+..
T Consensus 36 ~~v~V~VtwN~WsG~~G--d~~kly~dG~---~V~tG-~~~~~~~a~~~~-~~gG~y~~~VeLCN~~GCS~-S~~~~V~V 107 (133)
T PF08329_consen 36 DQVDVSVTWNVWSGTNG--DTAKLYFDGV---LVWTG-PSPQQKSATFTV-TKGGRYQMQVELCNADGCST-SAPVEVVV 107 (133)
T ss_dssp SSEEEEEEEE-SSS-----SEEEEEETTE---EEEEE-E--SEEEEEEEE--S-EEEEEEEEEEETTEEEE----EEEEE
T ss_pred CceEEEEEEEEecCCCC--CEEEEEECCE---EEEeC-CCccCceEEEEe-cCCCEEEEEEEEECCCCccc-CCCEEEEE
Confidence 34578899987763333 3677766432 11111 111223444444 45569999999999999765 34454433
No 68
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=64.87 E-value=25 Score=33.66 Aligned_cols=84 Identities=19% Similarity=0.316 Sum_probs=52.2
Q ss_pred CCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEecCC---CC------CCcEEe-ecCCcceEEEcCCCCCCeEEEE
Q psy7018 225 QPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQGN---AT------GDFKEI-SKSKNKNFTLSKLTPSTCFRFR 294 (349)
Q Consensus 225 ~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~---~~------~~~~~~-~~~~~~~~~~~~L~p~t~Y~~~ 294 (349)
+...+++...+.+.+.|+|+.. +.. ..|.|+..+.. +. ..|... +......+.-. ++|.+.|.+|
T Consensus 7 ~vtslrI~kLsaN~v~l~WddV---G~N-FyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed~-~~p~t~Yk~R 81 (1030)
T PHA02579 7 SVTSLRIDKLSANQVYLTWDDV---GAN-FYYFVELAETRDADGELIPDDELRWINLGYTANNEWFEDK-LQPNTYYKFR 81 (1030)
T ss_pred cccEEEhhhhccceEEEEeecc---CCc-eEEEEEEEeeccCCCccCCCccccceecCcccchhhhhhc-cCCcceEEEE
Confidence 4456777778899999999987 333 37888876422 11 124443 22333355444 9999999999
Q ss_pred EEEecCCC--CCCCCceEEeec
Q psy7018 295 LAAVNQHG--KSADATSVAEWL 314 (349)
Q Consensus 295 V~A~~~~G--~~~~s~~~~~~t 314 (349)
|+..- .| .|.|-..-.+.|
T Consensus 82 v~~~~-qGFe~SdWv~teef~T 102 (1030)
T PHA02579 82 VAVAA-QGFEQSDWVYTEEFET 102 (1030)
T ss_pred EEeec-cCCCccccEEeeceEe
Confidence 98774 34 444443334444
No 69
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=61.74 E-value=15 Score=33.62 Aligned_cols=34 Identities=12% Similarity=0.178 Sum_probs=17.9
Q ss_pred eEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeecc
Q psy7018 279 NFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWLS 315 (349)
Q Consensus 279 ~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t~ 315 (349)
.+.+++|+|++.|.+++..-+. +..+..-.++|.
T Consensus 65 ~v~v~gL~p~t~Y~Y~~~~~~~---~~~s~~g~~rT~ 98 (453)
T PF09423_consen 65 KVDVTGLQPGTRYYYRFVVDGG---GQTSPVGRFRTA 98 (453)
T ss_dssp EEEE-S--TT-EEEEEEEE--T---TEE---EEEE--
T ss_pred ecccCCCCCCceEEEEEEEecC---CCCCCceEEEcC
Confidence 6789999999999999999322 233445567775
No 70
>KOG0613|consensus
Probab=60.75 E-value=67 Score=32.98 Aligned_cols=108 Identities=11% Similarity=0.063 Sum_probs=65.9
Q ss_pred CCCCceEEEEeeecCCCCCCCCcceeeeCCCCCC----CCCccccccccCCCeeEEEecCCCCC-CCCceeEEEEecCCC
Q psy7018 8 CTDHKVSTEDRAGNSAGWSPFSETSVITTPPGPP----GSLTLRKTSVTTPTSLTVSWSEPANH-GSPILHYVLETSDPA 82 (349)
Q Consensus 8 ~p~~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P----~~~~~~~~~~~~~~si~l~W~~~~~~-~~~~~~Y~v~~~~~~ 82 (349)
|++-.+.+++...-..|++.+.......+.+.+| ..+..+.+..-....+.+.|...... ...+.+|...--...
T Consensus 299 ~~~~q~~~~v~~~~~i~~~~~~p~~~~~aaa~~~~~~v~~~~~~~v~a~d~~~v~m~~~~~~~~~p~~~~~~~~~~~~~~ 378 (1205)
T KOG0613|consen 299 RQGCQDRFKVTEEAPIGKGKPGPESLQAAAARPPEVPVGLVRNLSVTARDNTLVEMPTALSGTQKPDEAQGYHGEEVSSE 378 (1205)
T ss_pred cCCcccceeeeEEeeccccccCchhhhhhccCCccccccccccceeccccCcceeecccccCCcCCchheeecccccccc
Confidence 5555778888888888887775444333333333 34445555555677888888766433 233566665433221
Q ss_pred CC------ceeecccEEEEcCCCCCCEEEEEEEEEeecC
Q psy7018 83 QP------SITCDTTSYQLTGLSPHTTYKSVCQVYLISD 115 (349)
Q Consensus 83 ~~------~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~ 115 (349)
.. ........+.+.++-++..|..++.+.+..+
T Consensus 379 ~~~~~~~~~~~~ts~~~~~~~~~~~e~~~~~~~a~n~~g 417 (1205)
T KOG0613|consen 379 SLGALPCPVGTVTSHTYTIKDAGPGEGYFTRVPAVNKGG 417 (1205)
T ss_pred cccccccccccccCcchhhhhcCcccccccccceeccCC
Confidence 11 1123367788889999999998777776543
No 71
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=59.31 E-value=54 Score=22.36 Aligned_cols=38 Identities=8% Similarity=-0.123 Sum_probs=27.0
Q ss_pred CCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEE
Q psy7018 179 GEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLF 216 (349)
Q Consensus 179 ~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~ 216 (349)
.....+.|.+......-.+.+.|.|..|....+..+.+
T Consensus 53 ~~~s~L~I~~~~~~D~G~Ytc~a~N~~G~~~~~~~l~V 90 (98)
T cd05762 53 ENSSKLTITEGQQEHCGCYTLEVENKLGSRQAQVNLTV 90 (98)
T ss_pred CCeeEEEECCCChhhCEEEEEEEEcCCCceeEEEEEEE
Confidence 34456888888888888889999999887544433333
No 72
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=57.07 E-value=40 Score=20.10 Aligned_cols=36 Identities=19% Similarity=0.150 Sum_probs=23.9
Q ss_pred eEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEec
Q psy7018 182 QSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTT 218 (349)
Q Consensus 182 ~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t 218 (349)
.++.+..+ ..-.|.+.|.|...+|.........|..
T Consensus 14 Ws~t~~~~-~dG~y~itv~a~D~AGN~s~~~~~~~ti 49 (54)
T PF13754_consen 14 WSFTVPAL-ADGTYTITVTATDAAGNTSTSSSVTFTI 49 (54)
T ss_pred EEEeCCCC-CCccEEEEEEEEeCCCCCCCccceeEEE
Confidence 34555555 5669999999999988755543333433
No 73
>PF09402 MSC: Man1-Src1p-C-terminal domain; InterPro: IPR018996 This entry represents the Inner nuclear membrane proteins MAN1 (also known as LEM domain-containing protein 3) and LEM domain-containing protein 2 (or LEM protein 2). Emerin and MAN1 are LEM domain-containing integral membrane proteins of the vertebrate nuclear envelope []. MAN1 is an integral protein of the inner nuclear membrane which binds to chromatin associated proteins and plays a role in nuclear organisation. The C-terminal nulceoplasmic region forms a DNA binding winged helix and binds to Smad []. LEM protein 2 is an essential protein involved in chromosome segregation and cell division, probably via its interaction with lmn-1, the main component of nuclear lamina. Has some overlapping function with emr-1.; GO: 0005639 integral to nuclear inner membrane; PDB: 2CH0_A.
Probab=54.08 E-value=4.2 Score=35.35 Aligned_cols=24 Identities=17% Similarity=0.573 Sum_probs=0.0
Q ss_pred CcccccCCCCceEEEE-eeecCCCC
Q psy7018 2 SMVCQSCTDHKVSTED-RAGNSAGW 25 (349)
Q Consensus 2 ~~~~~~~p~~~y~~~v-~a~n~~G~ 25 (349)
.|+|+|||.+-+++.= .....-|+
T Consensus 49 ~P~C~pCP~~a~C~~~~~~~C~~~y 73 (334)
T PF09402_consen 49 KPSCEPCPEHAICYPGLKLECEPGY 73 (334)
T ss_dssp -------------------------
T ss_pred ccccccccccccccccccccccccc
Confidence 4799999999988877 55555554
No 74
>PF07699 GCC2_GCC3: GCC2 and GCC3; InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []: Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins) Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []: Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction []. Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases []. This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=53.34 E-value=4.4 Score=23.53 Aligned_cols=19 Identities=21% Similarity=0.489 Sum_probs=10.4
Q ss_pred ccccCCCCceEEEEeeecC
Q psy7018 4 VCQSCTDHKVSTEDRAGNS 22 (349)
Q Consensus 4 ~~~~~p~~~y~~~v~a~n~ 22 (349)
.|++||...|.=.......
T Consensus 10 ~C~~Cp~GtYq~~~g~~~C 28 (48)
T PF07699_consen 10 KCQPCPKGTYQDEEGQTSC 28 (48)
T ss_pred ccCCCCCCccCCccCCccC
Confidence 4666666666544444433
No 75
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=51.62 E-value=1e+02 Score=23.26 Aligned_cols=39 Identities=8% Similarity=0.153 Sum_probs=26.9
Q ss_pred cceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeeccC
Q psy7018 277 NKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWLSY 316 (349)
Q Consensus 277 ~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t~~ 316 (349)
...|.+.+|.||+.|+|+=...++.. ...|..+...|.+
T Consensus 101 lsaYqVtNL~pGTkY~isY~Vtkgts-tESS~~i~msT~n 139 (184)
T PF07353_consen 101 LSAYQVTNLQPGTKYYISYLVTKGTS-TESSNEIPMSTLN 139 (184)
T ss_pred ceeEEeeccCCCcEEEEEEEEecCcc-ceecceecccccc
Confidence 34788999999999999877765532 3334455566543
No 76
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=50.94 E-value=25 Score=26.33 Aligned_cols=23 Identities=35% Similarity=0.431 Sum_probs=18.3
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEe
Q psy7018 90 TTSYQLTGLSPHTTYKSVCQVYL 112 (349)
Q Consensus 90 ~~~~~i~~L~p~t~Y~~~v~~~~ 112 (349)
...|.+.||.||++|.|+-.+..
T Consensus 101 lsaYqVtNL~pGTkY~isY~Vtk 123 (184)
T PF07353_consen 101 LSAYQVTNLQPGTKYYISYLVTK 123 (184)
T ss_pred ceeEEeeccCCCcEEEEEEEEec
Confidence 56688999999999999654543
No 77
>PF01422 zf-NF-X1: NF-X1 type zinc finger; InterPro: IPR000967 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a domain presumed to be a zinc binding domain. The following pattern describes the zinc finger: C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C where X can be any amino acid, and numbers in brackets indicate the number of residues. The two position can be either His or Cys. This domain is found in the human transcriptional repressor NK-X1, a repressor of HLA-DRA transcription; the Drosophila shuttle craft protein, which plays an essential role during the late stages of embryonic neurogenesis; and a yeast hypothetical protein YNL023C. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=49.81 E-value=9.8 Score=17.48 Aligned_cols=9 Identities=22% Similarity=0.715 Sum_probs=6.7
Q ss_pred CcccccCCC
Q psy7018 2 SMVCQSCTD 10 (349)
Q Consensus 2 ~~~~~~~p~ 10 (349)
.+.|+||+.
T Consensus 12 ~G~C~pC~~ 20 (20)
T PF01422_consen 12 PGPCPPCPQ 20 (20)
T ss_pred CCcCCCCCC
Confidence 467999974
No 78
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=48.96 E-value=47 Score=24.34 Aligned_cols=22 Identities=27% Similarity=0.542 Sum_probs=16.2
Q ss_pred ccCCCeeEEEecCCCCCCCCce
Q psy7018 51 VTTPTSLTVSWSEPANHGSPIL 72 (349)
Q Consensus 51 ~~~~~si~l~W~~~~~~~~~~~ 72 (349)
......+.+.|.+|....+++.
T Consensus 92 ~~~k~~v~~~W~AP~~~~g~V~ 113 (135)
T cd08544 92 NSKKTSVTFTWTAPSNGSGCVT 113 (135)
T ss_pred CCCCceeEEEEECCCCCCccEE
Confidence 4456799999999987645553
No 79
>PF10342 GPI-anchored: Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family; InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue [].
Probab=48.65 E-value=79 Score=21.04 Aligned_cols=59 Identities=14% Similarity=0.236 Sum_probs=34.6
Q ss_pred CCeEEEEeecCCCCCCcccEEEEEEecCCCC--CCcEEee---cC--CcceEEE-cCCCCCCeEEEEEEE
Q psy7018 236 RTSLALKWNAAVDNGAHVMHYILESDQGNAT--GDFKEIS---KS--KNKNFTL-SKLTPSTCFRFRLAA 297 (349)
Q Consensus 236 ~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~--~~~~~~~---~~--~~~~~~~-~~L~p~t~Y~~~V~A 297 (349)
...++|+|+.... ....|.|........ .....+. .. ....+.+ .+|.++..|.+++..
T Consensus 12 g~~~~I~W~~~~~---~~~~~~I~L~~g~~~~~~~~~~ia~~v~~~~gs~~~~~p~~l~~~~~Y~i~~~~ 78 (93)
T PF10342_consen 12 GQPITITWTSDGT---DPGNVTIYLCNGNNTNLNFVQTIASNVSNSDGSYTWTIPSDLPSGGDYFIQIVN 78 (93)
T ss_pred CCcEEEEEeCCCC---CCcEEEEEEEcCCCCCcceeEEEEecccCCCCEEEEEcCCCCCCCCcEEEEEEE
Confidence 4679999998742 224677777655431 1222222 11 1224444 469999999999883
No 80
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=47.10 E-value=98 Score=21.68 Aligned_cols=70 Identities=16% Similarity=0.224 Sum_probs=39.4
Q ss_pred CCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEe
Q psy7018 236 RTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAE 312 (349)
Q Consensus 236 ~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~ 312 (349)
.-.|.++|+-=....+ ..|.|.+.... -|.--..+.....++. ...|..|..+|++.|..|.+. |.++.+
T Consensus 33 ~v~V~V~wnvWsG~~G--d~a~vl~dg~~---V~~G~~~~~~~~at~~-v~kgG~y~m~V~lCn~dGCS~-S~~~~I 102 (106)
T cd02848 33 AADVSVKWNAWSGDPG--DTYKVLLDGKE---VWSGALTGSSGTATFK-VGKGGRYQMQVALCNGDGCST-SAAKEI 102 (106)
T ss_pred ceEEEEEEeeecCCCC--cEEEEEECCeE---EEcccCCCCccEEEEE-eCCCCeEEEEEEEECCCCccC-cCCEEE
Confidence 3467888987753233 36776653211 0111111222344444 355679999999999999665 444444
No 81
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=46.64 E-value=6.5 Score=35.66 Aligned_cols=113 Identities=14% Similarity=0.233 Sum_probs=0.0
Q ss_pred CCCCCCeEEEEEEEEecccccCCCCcEEEecCCCCCCCCCCCeEEeec----CCeEEEEeecCCCCCCcccEEEEEEecC
Q psy7018 188 DLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEPDQPNPPKLVTKT----RTSLALKWNAAVDNGAHVMHYILESDQG 263 (349)
Q Consensus 188 ~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P~~p~~~~~~~~~----~~sv~l~W~~~~~~~~~i~~Y~v~~~~~ 263 (349)
.|.+|..|.|++.+.+..+... ...+.+.+. .|+....+.+.... .+.+++.-..-.+...+ ..|.+.+...
T Consensus 268 ~l~~g~~Y~~~l~v~~~~~~~~-~a~~~~~~n--~pP~~g~c~v~P~~G~al~T~Ft~~c~~~~d~~~p-L~Y~f~~~~~ 343 (440)
T PF02010_consen 268 VLEPGSTYTFRLTVTDSSGSSG-SASISFTVN--APPSGGSCSVSPSSGYALETTFTITCSGWTDDDQP-LTYQFYYSSD 343 (440)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cccccccccccccccccccccc-ccccccccc--ccccccccccccccccccccccccccccccccccc-cccccccccc
Confidence 5899999999999998766532 223344441 22222233332211 34444443322222233 3677777222
Q ss_pred CCCCCcEEeecCCcceEEEcCCCCC---C--eEEEEEEEecCCCCC
Q psy7018 264 NATGDFKEISKSKNKNFTLSKLTPS---T--CFRFRLAAVNQHGKS 304 (349)
Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~L~p~---t--~Y~~~V~A~~~~G~~ 304 (349)
.....+..+..+......-.-|.+| . .-.+.|.+.|..|..
T Consensus 344 ~~~~~~~~L~~~~~~~~~~~~LP~g~~~~~~~~~i~v~V~D~~ga~ 389 (440)
T PF02010_consen 344 DGSGSWNLLYEGSSPSSISTYLPPGDENNDYNVTIYVQVCDSLGAC 389 (440)
T ss_dssp ----------------------------------------------
T ss_pred cccccccccccccccccccccccccccccccccccccccccccccc
Confidence 2223344444443333222234444 2 455666777766643
No 82
>KOG3515|consensus
Probab=41.63 E-value=3.5e+02 Score=26.57 Aligned_cols=78 Identities=18% Similarity=0.121 Sum_probs=56.7
Q ss_pred EEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCCCCCCCCCCeEEeecCCeEEEEeecCCCCCCcccEEEEEEec
Q psy7018 183 SCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEPDQPNPPKLVTKTRTSLALKWNAAVDNGAHVMHYILESDQ 262 (349)
Q Consensus 183 ~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P~~p~~~~~~~~~~~sv~l~W~~~~~~~~~i~~Y~v~~~~ 262 (349)
+-.+-+--++..|.+.+.+.|..|.+..+..+.--+ .|..|.++.+...+...+.|.|.+--+ ++-...|.+.+..
T Consensus 655 ~~~~~~~~~~~~y~~~c~t~n~lg~~~v~~~~~~~t---~~~~~~n~~~~~~~~~~i~l~~~p~fd-gg~~q~f~~~~~~ 730 (741)
T KOG3515|consen 655 SEWILNHIDGSDYENGCTTQNLLGSDHVSGAIHSGT---ASVGPINLTYDNLTYSTISLEWMPGFD-GGLQQRFFLKYYD 730 (741)
T ss_pred cccccCCcchhhhcceeeecccCCCccccceecCCc---CCCCccceEeeeeeeeeeceeeeeccc-cccccceeeehhh
Confidence 344556677889998888888888766665443322 357888999998999999999998876 3344577777766
Q ss_pred CC
Q psy7018 263 GN 264 (349)
Q Consensus 263 ~~ 264 (349)
.+
T Consensus 731 ~~ 732 (741)
T KOG3515|consen 731 LG 732 (741)
T ss_pred cC
Confidence 54
No 83
>cd05854 Ig6_Contactin-2 Sixth Ig domain of contactin-2. Ig6_Contactin-2: Sixth Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=40.61 E-value=21 Score=23.63 Aligned_cols=27 Identities=19% Similarity=0.048 Sum_probs=19.7
Q ss_pred CCceEEEEeeecCCCCCCCCcceeeeC
Q psy7018 10 DHKVSTEDRAGNSAGWSPFSETSVITT 36 (349)
Q Consensus 10 ~~~y~~~v~a~n~~G~~~~s~~~~~~t 36 (349)
.+.-.|.|.|.|.+|....+..+.+..
T Consensus 57 ~D~G~YtC~A~n~~g~~~~~~~L~V~~ 83 (85)
T cd05854 57 SHAGTYTCTAQTVVDSASASATLVVRG 83 (85)
T ss_pred hhCeEEEEEEecCCCCEEEEEEEEEEC
Confidence 466789999999999766555555543
No 84
>PF10576 EndIII_4Fe-2S: Iron-sulfur binding domain of endonuclease III; InterPro: IPR003651 Endonuclease III (4.2.99.18 from EC) is a DNA repair enzyme which removes a number of damaged pyrimidines from DNA via its glycosylase activity and also cleaves the phosphodiester backbone at apurinic / apyrimidinic sites via a beta-elimination mechanism [, ]. The structurally related DNA glycosylase MutY recognises and excises the mutational intermediate 8-oxoguanine-adenine mispair []. The 3-D structures of Escherichia coli endonuclease III [] and catalytic domain of MutY [] have been determined. The structures contain two all-alpha domains: a sequence-continuous, six-helix domain (residues 22-132) and a Greek-key, four-helix domain formed by one N-terminal and three C-terminal helices (residues 1-21 and 133-211) together with the [Fe4S4] cluster. The cluster is bound entirely within the C-terminal loop by four cysteine residues with a ligation pattern Cys-(Xaa)6-Cys-(Xaa)2-Cys-(Xaa)5-Cys which is distinct from all other known Fe4S4 proteins. This structural motif is referred to as a [Fe4S4] cluster loop (FCL) []. Two DNA-binding motifs have been proposed, one at either end of the interdomain groove: the helix-hairpin-helix (HhH) and FCL motifs. The primary role of the iron-sulphur cluster appears to involve positioning conserved basic residues for interaction with the DNA phosphate backbone by forming the loop of the FCL motif [, ]. The iron-sulphur cluster loop (FCL) is also found in DNA-(apurinic or apyrimidinic site) lyase, a subfamily of endonuclease III. The enzyme has both apurinic and apyrimidinic endonuclease activity and a DNA N-glycosylase activity. It cuts damaged DNA at cytosines, thymines and guanines, and acts on the damaged strand 5' of the damaged site. The enzyme binds a 4Fe-4S cluster which is not important for the catalytic activity, but is probably involved in the alignment of the enzyme along the DNA strand.; GO: 0004519 endonuclease activity, 0051539 4 iron, 4 sulfur cluster binding; PDB: 1VRL_A 1RRQ_A 3G0Q_A 3FSQ_A 1RRS_A 3FSP_A 2ABK_A 1KG7_A 1KG2_A 1MUN_A ....
Probab=39.54 E-value=13 Score=16.24 Aligned_cols=12 Identities=17% Similarity=0.434 Sum_probs=6.7
Q ss_pred CcccccCCCCce
Q psy7018 2 SMVCQSCTDHKV 13 (349)
Q Consensus 2 ~~~~~~~p~~~y 13 (349)
++.|+-||-..|
T Consensus 5 ~P~C~~Cpl~~~ 16 (17)
T PF10576_consen 5 KPKCEECPLADY 16 (17)
T ss_dssp S--GGG-TTGGG
T ss_pred CCccccCCCccc
Confidence 588999997654
No 85
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=39.20 E-value=48 Score=21.46 Aligned_cols=26 Identities=15% Similarity=0.309 Sum_probs=21.2
Q ss_pred CCcceEEEcCCCCCCeEEEEEEEecC
Q psy7018 275 SKNKNFTLSKLTPSTCFRFRLAAVNQ 300 (349)
Q Consensus 275 ~~~~~~~~~~L~p~t~Y~~~V~A~~~ 300 (349)
+...+|.-.+|.+|..|.|+|+|.-.
T Consensus 26 G~~R~F~T~~L~~G~~y~Y~v~a~~~ 51 (75)
T TIGR03000 26 GTVRTFTTPPLEAGKEYEYTVTAEYD 51 (75)
T ss_pred ccEEEEECCCCCCCCEEEEEEEEEEe
Confidence 34457888899999999999999553
No 86
>cd05735 Ig8_DSCAM Eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM: the eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=37.88 E-value=1.2e+02 Score=20.05 Aligned_cols=35 Identities=3% Similarity=-0.270 Sum_probs=23.8
Q ss_pred eEEEecCCCCCCeEEEEEEEEecccccCCCCcEEE
Q psy7018 182 QSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLF 216 (349)
Q Consensus 182 ~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~ 216 (349)
..+.|.++.....=.+...|.|..|.......+.+
T Consensus 47 s~L~I~~~~~~D~G~YtC~A~N~~G~~~~~~~L~V 81 (88)
T cd05735 47 STLQILPTVREDSGFFSCHAINSYGEDRGIIQLTV 81 (88)
T ss_pred EEEEECCCCcccCEEEEEEEEcCCCcceEEEEEEE
Confidence 45777777776666788888999887544433333
No 87
>cd04971 Ig_TrKABC_d5 Fifth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrkB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor foun
Probab=37.61 E-value=33 Score=22.45 Aligned_cols=23 Identities=17% Similarity=-0.046 Sum_probs=18.2
Q ss_pred cCCCCceEEEEeeecCCCCCCCC
Q psy7018 7 SCTDHKVSTEDRAGNSAGWSPFS 29 (349)
Q Consensus 7 ~~p~~~y~~~v~a~n~~G~~~~s 29 (349)
+.+.+.-.|.|.|.|..|....+
T Consensus 53 ~~~~D~G~YtC~A~N~~G~~~~~ 75 (81)
T cd04971 53 PTHVNNGNYTLVASNEYGQDSKS 75 (81)
T ss_pred CCcccCeEEEEEEEeCCCCeeeE
Confidence 56678889999999999975433
No 88
>cd05851 Ig3_Contactin-1 Third Ig domain of contactin-1. Ig3_Contactin-1: Third Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=37.54 E-value=95 Score=20.51 Aligned_cols=27 Identities=7% Similarity=0.021 Sum_probs=21.8
Q ss_pred eEEEecCCCCCCeEEEEEEEEeccccc
Q psy7018 182 QSCRIQDLKPGTDYAVCVQVHLEEIVG 208 (349)
Q Consensus 182 ~~~~i~~L~p~~~Y~~~V~a~~~~g~~ 208 (349)
..+.|.++.+...-.+...|.|..|..
T Consensus 53 ~~L~I~~v~~~D~G~Y~C~A~N~~G~~ 79 (88)
T cd05851 53 AVLKIFNIQPEDEGTYECEAENIKGKD 79 (88)
T ss_pred CEEEECcCChhhCEEEEEEEEcCCCce
Confidence 357888988888888888999988764
No 89
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=36.95 E-value=19 Score=25.91 Aligned_cols=13 Identities=15% Similarity=0.562 Sum_probs=10.7
Q ss_pred CcccccCCCCceE
Q psy7018 2 SMVCQSCTDHKVS 14 (349)
Q Consensus 2 ~~~~~~~p~~~y~ 14 (349)
..+|.|||..+|+
T Consensus 62 ~T~C~PCp~GTYT 74 (127)
T PHA02637 62 NTQCTPCGSGTFT 74 (127)
T ss_pred CcccccCCCCCee
Confidence 4689999999874
No 90
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=36.09 E-value=1.2e+02 Score=19.48 Aligned_cols=60 Identities=17% Similarity=0.266 Sum_probs=33.0
Q ss_pred CCeEEEEeecCCCCCCcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEEEEEEecCCCCCC
Q psy7018 236 RTSLALKWNAAVDNGAHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRFRLAAVNQHGKSA 305 (349)
Q Consensus 236 ~~sv~l~W~~~~~~~~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~~V~A~~~~G~~~ 305 (349)
...+.+++....+ +.+..|...+.... . .... .......-.++| .|.++|.+.|..|.+.
T Consensus 14 ~~~v~~~~~~~~~--~~~~~~~W~fgdg~--~----~~~~-~~~~~~~y~~~G-~y~v~l~v~d~~g~~~ 73 (81)
T cd00146 14 GASVTFSASDSSG--GSIVSYKWDFGDGE--V----SSSG-EPTVTHTYTKPG-TYTVTLTVTNAVGSSS 73 (81)
T ss_pred CccEEEEEEeCCC--CCEEEEEEEeCCCC--c----cccC-CCceEEEcCCCc-EEEEEEEEEeCCCCEE
Confidence 4566677666643 34445665554320 0 0111 022333344666 8999999999887543
No 91
>PF04775 Bile_Hydr_Trans: Acyl-CoA thioester hydrolase/BAAT N-terminal region; InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=35.33 E-value=65 Score=23.37 Aligned_cols=24 Identities=21% Similarity=0.411 Sum_probs=17.5
Q ss_pred eEEEcCCCCCCeEEEEEEEecCCC
Q psy7018 279 NFTLSKLTPSTCFRFRLAAVNQHG 302 (349)
Q Consensus 279 ~~~~~~L~p~t~Y~~~V~A~~~~G 302 (349)
.+.++||.|+..|+++....+..|
T Consensus 6 ~I~v~GL~p~~~vtl~a~~~~~~g 29 (126)
T PF04775_consen 6 DIRVSGLPPGQEVTLRARLTDDNG 29 (126)
T ss_dssp EEEEES--TT-EEEEEEEEE-TTS
T ss_pred EEEEeCCCCCCEEEEEEEEEeCCC
Confidence 678999999999999999998766
No 92
>KOG3515|consensus
Probab=34.58 E-value=49 Score=32.07 Aligned_cols=68 Identities=18% Similarity=0.151 Sum_probs=50.8
Q ss_pred CCCceEEEEeeecCCCCCCCCcceeeeCCCCCCCCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecC
Q psy7018 9 TDHKVSTEDRAGNSAGWSPFSETSVITTPPGPPGSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSD 80 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~~s~~~~~~t~~~~P~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~ 80 (349)
++..|.+-|.+.|.+|.+.-+..+.--+.+..|.++. .......++.|.|.+..+ ++....+.+++..
T Consensus 663 ~~~~y~~~c~t~n~lg~~~v~~~~~~~t~~~~~~n~~---~~~~~~~~i~l~~~p~fd-gg~~q~f~~~~~~ 730 (741)
T KOG3515|consen 663 DGSDYENGCTTQNLLGSDHVSGAIHSGTASVGPINLT---YDNLTYSTISLEWMPGFD-GGLQQRFFLKYYD 730 (741)
T ss_pred chhhhcceeeecccCCCccccceecCCcCCCCccceE---eeeeeeeeeceeeeeccc-cccccceeeehhh
Confidence 4567889999999999887777776666666555444 567899999999999876 3445677777654
No 93
>cd05773 Ig8_hNephrin_like Eighth immunoglobulin-like domain of nephrin. Ig8_hNephrin_like: domain similar to the eighth immunoglobulin-like domain in human nephrin. Nephrin is an integral component of the slit diaphragm, and is a central component of the glomerular ultrafilter. Nephrin plays a structural role, and has a role in signaling. Nephrin is a transmembrane protein having a short intracellular portion, and an extracellular portion comprised of eight Ig-like domains, and one fibronectin type III-like domain. The extracellular portions of nephrin, from neighboring foot processes of separate podocyte cells, may interact with each other, and in association with other components of the slit diaphragm, form a porous molecular sieve within the slit pore. The intracellular portion of nephrin is associated with linker proteins, which connect nephrin to the actin cytoskeleton. The intracellular portion is tyrosine phosphorylated, and mediates signaling from the slit diaphragm into the p
Probab=33.97 E-value=49 Score=23.12 Aligned_cols=27 Identities=22% Similarity=0.027 Sum_probs=17.9
Q ss_pred CceEEEEeeecCCCCCCCCcceeeeCC
Q psy7018 11 HKVSTEDRAGNSAGWSPFSETSVITTP 37 (349)
Q Consensus 11 ~~y~~~v~a~n~~G~~~~s~~~~~~t~ 37 (349)
+.-.|.|.|.|..|....+..+.+...
T Consensus 81 D~G~Y~C~A~N~~G~~~~~i~L~~~~~ 107 (109)
T cd05773 81 DYALFTCTAHNSLGEDSLDIQLVSTSR 107 (109)
T ss_pred CCEEEEEEEEeCCccCcEEEEEEecCC
Confidence 455799999999998544444444433
No 94
>COG3979 Uncharacterized protein contain chitin-binding domain type 3 [General function prediction only]
Probab=30.67 E-value=1.4e+02 Score=23.37 Aligned_cols=70 Identities=16% Similarity=0.098 Sum_probs=42.8
Q ss_pred CCCccccccccCCCeeEEEecCCCCCCCCceeEEEEecCCCCCceeecccEEEEcCCCCCCEEEEEEEEEee
Q psy7018 42 GSLTLRKTSVTTPTSLTVSWSEPANHGSPILHYVLETSDPAQPSITCDTTSYQLTGLSPHTTYKSVCQVYLI 113 (349)
Q Consensus 42 ~~~~~~~~~~~~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~ 113 (349)
.++..+...+....++.+.|+.... ...+..|.+.+ .-............++++|.+.++|.+.+.+...
T Consensus 6 ~a~~~~~~~~~~~~s~~~~~~s~~~-~~~~~~~p~~~-~~d~~~~~t~~~~~~~~g~~~et~y~~~~~a~~t 75 (181)
T COG3979 6 PAPKSLMLTNVTTTSASLAASSSSD-NKASDTYPYVY-AWDPKTVYTTGTVVTVEGLAPETEYWTLVEAPDT 75 (181)
T ss_pred cCccceeecchhhhhhhhccccccc-cccccCCCccc-ccCccceecccceEEecCcceeeEEEEeeeccCC
Confidence 3444444456788899999987622 23344444111 1111233456788999999999999997765544
No 95
>smart00469 WIF Wnt-inhibitory factor-1 like domain. Occurs as extracellular domain in metazoan Ryk receptor tyrosine kinases. C. elegans Ryk is required for cell-cuticle recognition. WIF-1 binds to Wnt and inhibits its activity.
Probab=30.34 E-value=2.3e+02 Score=20.99 Aligned_cols=60 Identities=15% Similarity=0.232 Sum_probs=40.3
Q ss_pred CCCeeEEEecCCCCCCCCceeEEEEecCCCCCceeecccEEEEcCCCCCCEEEEEEEEEeecC
Q psy7018 53 TPTSLTVSWSEPANHGSPILHYVLETSDPAQPSITCDTTSYQLTGLSPHTTYKSVCQVYLISD 115 (349)
Q Consensus 53 ~~~si~l~W~~~~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~~~~~~~ 115 (349)
.-+.+.++|+... ..-..|.+...............+....|..|.+.=.|+|...+.|.
T Consensus 43 ~i~~v~FTW~s~~---~~~~~y~~~l~s~D~~vl~~P~lnI~~~G~VP~~~~~f~V~l~CsG~ 102 (136)
T smart00469 43 NIHDLSFTWQALG---QEYVPYSLNVRSDDKEVLPRPIVNISLLGTVPHTLQVFQVELKCSGK 102 (136)
T ss_pred ccceEEEEEEcCC---CeeeeeEEEEEeCCHHHcCCCcccccccccCCCCcEEEEEEEEecCC
Confidence 4568999999873 34466777665544434444556666678888888888777666544
No 96
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=30.26 E-value=4.5e+02 Score=24.39 Aligned_cols=35 Identities=20% Similarity=0.234 Sum_probs=26.9
Q ss_pred eEEEcCCCCCCeEEEEEEEecCCCCCCCCceEEeec
Q psy7018 279 NFTLSKLTPSTCFRFRLAAVNQHGKSADATSVAEWL 314 (349)
Q Consensus 279 ~~~~~~L~p~t~Y~~~V~A~~~~G~~~~s~~~~~~t 314 (349)
...+.++++| .|.+.|.+.+..|.-..+..+.|..
T Consensus 370 tL~Ls~~~AG-~y~Lvv~~t~~dG~~~~q~~~~~~v 404 (478)
T PRK13211 370 SLDLSKLKAG-HHMLVVKAKPKDGELIKQQTLDFML 404 (478)
T ss_pred EEecccCCCc-eEEEEEEEEeCCCceeeeeeEEEEE
Confidence 5667799988 8999999999988665455555544
No 97
>COG4932 Predicted outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=29.65 E-value=7e+02 Score=26.44 Aligned_cols=197 Identities=17% Similarity=0.159 Sum_probs=0.0
Q ss_pred EEEcCCCCCCEEEEEEEEEeecCCcccceeeeeeecccccCCCCCCCccccceEEEEEE--ecCCCCCeeeEEEeeCCCC
Q psy7018 93 YQLTGLSPHTTYKSVCQVYLISDKHTQKVDLTFESDQLYLSGLPNSEWPSLRTIILTLY--FRTAQVPGVPPYVSDKGRD 170 (349)
Q Consensus 93 ~~i~~L~p~t~Y~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~w~--~~~~~~~~Y~v~~~~~~~~ 170 (349)
..+++|.||-.-.+...+-..-.....+..++..-.+..+............+..|+=. .....+.+-..+..+.+..
T Consensus 1105 v~itdLaPGDYqfVEtkAPtGY~LdatPV~FtI~eeq~e~~~vtKeN~~~~GsvqLtK~Ds~t~a~LaGA~Fel~d~dG~ 1184 (1531)
T COG4932 1105 VEITDLAPGDYQFVETKAPTGYILDATPVNFTISEEQDEAAKVTKENTLKPGSVQLTKVDSATKATLAGAEFELQDEDGT 1184 (1531)
T ss_pred EEeccccCCceeeEEecCCceeEecCccceeEeeccCCceeEEeecccccccceEEEEecccccccccCcEEEEEcCCCc
Q ss_pred CceEEEecCCceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecCCCCCCCCCCCeEEeecCCeEEEEeecCCCCC
Q psy7018 171 GKYKSIYNGEAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTPPCEPDQPNPPKLVTKTRTSLALKWNAAVDNG 250 (349)
Q Consensus 171 ~~~~~~~~~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~~~~P~~p~~~~~~~~~~~sv~l~W~~~~~~~ 250 (349)
.....+.....-...+.+|.|| .|+| |-..-..|.---..+..|+.....-....-..--..-..++.|.=... +.+
T Consensus 1185 ~VqegLtTD~nG~i~VtdL~PG-dYqF-VETkAP~GY~LdatP~~FtI~~~q~ev~~V~~en~~~pgsv~L~k~d~-~~~ 1261 (1531)
T COG4932 1185 LVQEGLTTDENGKINVTDLAPG-DYQF-VETKAPTGYILDATPTPFTIEFNQEEVVKVVKENTAIPGSVVLTKKDS-DTG 1261 (1531)
T ss_pred EeeccceecCCCcEEecccCCc-ceee-eeecCCcceeeccccceeEEeccccceeEEeeccccCCCCceeeccCC-Ccc
Q ss_pred CcccEEEEEEecCCCCCCcEEeecCCcceEEEcCCCCCCeEEE
Q psy7018 251 AHVMHYILESDQGNATGDFKEISKSKNKNFTLSKLTPSTCFRF 293 (349)
Q Consensus 251 ~~i~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~~ 293 (349)
..+.+-.......++...-...+....-.+.+.+|.|| .|.|
T Consensus 1262 ~~l~~a~fkl~~~eg~~vqe~L~td~~Gei~v~dlkpG-dyqf 1303 (1531)
T COG4932 1262 AALSGAEFKLLDAEGTTVQEGLTTDETGEIVVADLKPG-DYQF 1303 (1531)
T ss_pred cccCCCceeeecCCCcEeccCceecCCCcEEecccCCC-cccc
No 98
>PF13205 Big_5: Bacterial Ig-like domain
Probab=29.30 E-value=1.9e+02 Score=19.72 Aligned_cols=20 Identities=20% Similarity=0.448 Sum_probs=14.8
Q ss_pred cCCCCCCeEEEEEEE--ecCCC
Q psy7018 283 SKLTPSTCFRFRLAA--VNQHG 302 (349)
Q Consensus 283 ~~L~p~t~Y~~~V~A--~~~~G 302 (349)
..|.+|+.|.+.|.. .+..|
T Consensus 70 ~~L~~~t~Y~v~i~~~i~d~~G 91 (107)
T PF13205_consen 70 QPLKPGTTYTVTIDSGIKDLAG 91 (107)
T ss_pred CcCCCCCEEEEEECCCCCCCCC
Confidence 369999999999944 34444
No 99
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=29.09 E-value=36 Score=30.87 Aligned_cols=22 Identities=27% Similarity=0.502 Sum_probs=0.0
Q ss_pred cCCCCCCeEEEEEEEecCCCCC
Q psy7018 283 SKLTPSTCFRFRLAAVNQHGKS 304 (349)
Q Consensus 283 ~~L~p~t~Y~~~V~A~~~~G~~ 304 (349)
..|.++..|.|++.+....+..
T Consensus 155 ~~l~~~~~y~f~ltv~k~~r~s 176 (440)
T PF02010_consen 155 STLSPGSTYTFTLTVSKGSRSS 176 (440)
T ss_dssp ----------------------
T ss_pred HHcCCCceEEEEEEEEeCCCCc
Confidence 6799999999999999887633
No 100
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=28.30 E-value=1e+02 Score=18.86 Aligned_cols=20 Identities=20% Similarity=0.101 Sum_probs=16.1
Q ss_pred CceEEEEeeecCCCCCCCCc
Q psy7018 11 HKVSTEDRAGNSAGWSPFSE 30 (349)
Q Consensus 11 ~~y~~~v~a~n~~G~~~~s~ 30 (349)
..|++.+++.+.+|......
T Consensus 23 g~yt~~v~a~D~AGN~~~~~ 42 (60)
T PF12245_consen 23 GEYTLTVTATDKAGNTSSST 42 (60)
T ss_pred ccEEEEEEEEECCCCEEEee
Confidence 38999999999999755433
No 101
>cd05748 Ig_Titin_like Immunoglobulin (Ig)-like domain of titin and similar proteins. Ig_Titin_like: immunoglobulin (Ig)-like domain found in titin-like proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic, depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone. It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin, and similar to titin has repeated Ig-like and FN-
Probab=27.74 E-value=1.6e+02 Score=18.37 Aligned_cols=29 Identities=7% Similarity=-0.038 Sum_probs=23.3
Q ss_pred CCceEEEecCCCCCCeEEEEEEEEecccc
Q psy7018 179 GEAQSCRIQDLKPGTDYAVCVQVHLEEIV 207 (349)
Q Consensus 179 ~~~~~~~i~~L~p~~~Y~~~V~a~~~~g~ 207 (349)
.....+.|.++.+...-.+.+.|.|..|.
T Consensus 37 ~~~~~L~I~~~~~~D~G~Y~C~a~N~~G~ 65 (74)
T cd05748 37 ASSTSLVIKNAERSDSGKYTLTLKNPAGE 65 (74)
T ss_pred CCeEEEEECCCCcCcCEEEEEEEECCCcc
Confidence 34567888899888888888999998775
No 102
>PF15417 DUF4624: Domain of unknown function (DUF4624)
Probab=27.35 E-value=1.1e+02 Score=21.54 Aligned_cols=32 Identities=19% Similarity=0.194 Sum_probs=22.8
Q ss_pred ecCCCCCceeecccEEEEcCCCCCCEEEEEEE
Q psy7018 78 TSDPAQPSITCDTTSYQLTGLSPHTTYKSVCQ 109 (349)
Q Consensus 78 ~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~ 109 (349)
|+..+...+..+..++.+.+|+.+.+|.++.+
T Consensus 76 Wsn~~~~~V~~dt~tisL~nlqk~kEY~V~ft 107 (132)
T PF15417_consen 76 WSNTWNGKVSGDTFTISLNNLQKEKEYVVCFT 107 (132)
T ss_pred eccccccccccceEEEEhhhcccCceEEEEEe
Confidence 33334444556678888999999999998553
No 103
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=27.19 E-value=1.2e+02 Score=16.90 Aligned_cols=13 Identities=15% Similarity=0.462 Sum_probs=9.3
Q ss_pred CCCCeEEEEEEEec
Q psy7018 286 TPSTCFRFRLAAVN 299 (349)
Q Consensus 286 ~p~t~Y~~~V~A~~ 299 (349)
+|| .|.+.++|.-
T Consensus 11 ~PG-~Y~l~~~a~~ 23 (41)
T TIGR03769 11 KPG-TYTLTVQATA 23 (41)
T ss_pred CCe-EEEEEEEEEE
Confidence 566 7888888754
No 104
>cd05893 Ig_Palladin_C C-terminal immunoglobulin (Ig)-like domain of palladin. Ig_Palladin_C: C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP, mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions.
Probab=26.62 E-value=28 Score=22.38 Aligned_cols=19 Identities=21% Similarity=0.055 Sum_probs=14.9
Q ss_pred CCCceEEEEeeecCCCCCC
Q psy7018 9 TDHKVSTEDRAGNSAGWSP 27 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~ 27 (349)
+.+...|.|.|.|..|...
T Consensus 50 ~~D~G~Y~C~A~N~~G~~~ 68 (75)
T cd05893 50 KEDAGWYTVSAKNEAGIVS 68 (75)
T ss_pred HHHCEEEEEEEEcCCCEEE
Confidence 3466789999999999743
No 105
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=26.30 E-value=1.1e+02 Score=28.01 Aligned_cols=37 Identities=27% Similarity=0.282 Sum_probs=18.8
Q ss_pred CceEEEecCCCCCCeEEEEEEEEecccccCCCCcEEEecC
Q psy7018 180 EAQSCRIQDLKPGTDYAVCVQVHLEEIVGIASEPTLFTTP 219 (349)
Q Consensus 180 ~~~~~~i~~L~p~~~Y~~~V~a~~~~g~~~~s~~~~~~t~ 219 (349)
....+.+.+|+|++.|.+++..... +..+..-.++|.
T Consensus 62 ~t~~v~v~gL~p~t~Y~Y~~~~~~~---~~~s~~g~~rT~ 98 (453)
T PF09423_consen 62 FTVKVDVTGLQPGTRYYYRFVVDGG---GQTSPVGRFRTA 98 (453)
T ss_dssp TEEEEEE-S--TT-EEEEEEEE--T---TEE---EEEE--
T ss_pred eEeecccCCCCCCceEEEEEEEecC---CCCCCceEEEcC
Confidence 3446889999999999999998332 222444567775
No 106
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=26.14 E-value=84 Score=18.31 Aligned_cols=13 Identities=23% Similarity=0.606 Sum_probs=9.5
Q ss_pred eEEEEEEEecCCC
Q psy7018 290 CFRFRLAAVNQHG 302 (349)
Q Consensus 290 ~Y~~~V~A~~~~G 302 (349)
.|.|.|.|.|..|
T Consensus 37 ~y~~~vtatd~~G 49 (49)
T PF05345_consen 37 TYTFTVTATDGSG 49 (49)
T ss_pred EEEEEEEEEcCCC
Confidence 7777877777654
No 107
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=26.08 E-value=1.2e+02 Score=23.62 Aligned_cols=25 Identities=24% Similarity=0.460 Sum_probs=19.0
Q ss_pred cccEEEEcCCCCCCEEEEEEEEEee
Q psy7018 89 DTTSYQLTGLSPHTTYKSVCQVYLI 113 (349)
Q Consensus 89 ~~~~~~i~~L~p~t~Y~~~v~~~~~ 113 (349)
+...|.+.||.|...|.+.++....
T Consensus 31 P~l~y~vsGL~p~~~Y~i~l~~~~~ 55 (184)
T PF00907_consen 31 PTLEYSVSGLDPDSLYSISLHFERV 55 (184)
T ss_dssp S-EEEEEESS-TTSEEEEEEEEEES
T ss_pred cccEEEecCCCCCcceEEEEEEEEe
Confidence 4678999999999999998876543
No 108
>cd05765 Ig_3 Subgroup of the immunoglobulin (Ig) superfamily. Ig_3: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=24.21 E-value=50 Score=21.15 Aligned_cols=20 Identities=20% Similarity=-0.021 Sum_probs=16.2
Q ss_pred CCCCceEEEEeeecCCCCCC
Q psy7018 8 CTDHKVSTEDRAGNSAGWSP 27 (349)
Q Consensus 8 ~p~~~y~~~v~a~n~~G~~~ 27 (349)
-+.+...|.|.|.|..|...
T Consensus 55 ~~~D~G~Y~C~a~N~~G~~~ 74 (81)
T cd05765 55 QPQDAGLYTCTARNSGGLLR 74 (81)
T ss_pred CcccCEEEEEEEecCCceEE
Confidence 45778899999999999643
No 109
>cd05760 Ig2_PTK7 Second immunoglobulin (Ig)-like domain of protein tyrosine kinase (PTK) 7, also known as CCK4. Ig2_PTK7: domain similar to the second immunoglobulin (Ig)-like domain in protein tyrosine kinase (PTK) 7, also known as CCK4. PTK7 is a subfamily of the receptor protein tyrosine kinase family, and is referred to as an RPTK-like molecule. RPTKs transduce extracellular signals across the cell membrane, and play important roles in regulating cell proliferation, migration, and differentiation. PTK7 is organized as an extracellular portion having seven Ig-like domains, a single transmembrane region, and a cytoplasmic tyrosine kinase-like domain. PTK7 is considered a pseudokinase as it has several unusual residues in some of the highly conserved tyrosine kinase (TK) motifs; it is predicted to lack TK activity. PTK7 may function as a cell-adhesion molecule. PTK7 mRNA is expressed at high levels in placenta, melanocytes, liver, lung, pancreas, and kidney. PTK7 is overexpressed in s
Probab=24.02 E-value=91 Score=19.91 Aligned_cols=18 Identities=22% Similarity=-0.075 Sum_probs=14.6
Q ss_pred CCCceEEEEeeecCCCCC
Q psy7018 9 TDHKVSTEDRAGNSAGWS 26 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~ 26 (349)
+.+.-.|+|.|.|..|..
T Consensus 47 ~~D~G~Y~C~a~N~~G~~ 64 (77)
T cd05760 47 PDDSGLYYCCAHNAFGSV 64 (77)
T ss_pred cccCEEEEEEEEeCCCeE
Confidence 467788999999999964
No 110
>cd04974 Ig3_FGFR Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig3_FGFR: third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs.
Probab=23.39 E-value=93 Score=20.64 Aligned_cols=24 Identities=33% Similarity=0.077 Sum_probs=17.9
Q ss_pred CCCceEEEEeeecCCCCCCCCcce
Q psy7018 9 TDHKVSTEDRAGNSAGWSPFSETS 32 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~~s~~~ 32 (349)
+.+.-.|.|.|.|..|....+..+
T Consensus 64 ~~D~G~Y~C~A~N~~G~~~~sa~L 87 (90)
T cd04974 64 FDDAGEYTCLAGNSIGPSHHSAWL 87 (90)
T ss_pred cccCcEEEEEeecccCcccCCEEE
Confidence 456778999999999976554443
No 111
>cd05723 Ig4_Neogenin Fourth immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig4_Neogenin: fourth immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=22.70 E-value=49 Score=20.73 Aligned_cols=19 Identities=16% Similarity=-0.031 Sum_probs=15.1
Q ss_pred CCCceEEEEeeecCCCCCC
Q psy7018 9 TDHKVSTEDRAGNSAGWSP 27 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~ 27 (349)
+.+.-.|+|.|.|..|...
T Consensus 46 ~~D~G~Y~C~A~N~~G~~~ 64 (71)
T cd05723 46 KSDEGFYQCIAENDVGNVQ 64 (71)
T ss_pred cccCEEEEEEEEcCCCEEE
Confidence 4567789999999999743
No 112
>cd05853 Ig6_Contactin-4 Sixth Ig domain of contactin-4. Ig6_Contactin-4: sixth Ig domain of the neural cell adhesion molecule contactin-4. Contactins are neural cell adhesion molecules, and are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. Highest expresson of contactin-4 is in testes, thyroid, small intestine, uterus and brain. Contactin-4 plays a role in the response of neuroblastoma cells to differentiating agents, such as retinoids. The contactin 4 gene is associated with cerebellar degeneration in spinocerebellar ataxia type 16.
Probab=22.10 E-value=76 Score=21.11 Aligned_cols=26 Identities=8% Similarity=-0.098 Sum_probs=19.4
Q ss_pred CceEEEEeeecCCCCCCCCcceeeeC
Q psy7018 11 HKVSTEDRAGNSAGWSPFSETSVITT 36 (349)
Q Consensus 11 ~~y~~~v~a~n~~G~~~~s~~~~~~t 36 (349)
+.-.|.|.|.|.++....+..+.++-
T Consensus 58 dsG~YtC~a~n~~~~~~a~a~L~V~~ 83 (85)
T cd05853 58 HAGKYVCMVQTSVDKLSAAADLIVRG 83 (85)
T ss_pred HCEEEEEEEEcccCceEEEEEEEEec
Confidence 45688999999998766666666654
No 113
>cd05763 Ig_1 Subgroup of the immunoglobulin (Ig) superfamily. Ig_1: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=21.60 E-value=82 Score=19.84 Aligned_cols=19 Identities=21% Similarity=0.076 Sum_probs=14.5
Q ss_pred CCCceEEEEeeecCCCCCC
Q psy7018 9 TDHKVSTEDRAGNSAGWSP 27 (349)
Q Consensus 9 p~~~y~~~v~a~n~~G~~~ 27 (349)
+.+.-.|.|.|.|..|...
T Consensus 49 ~~D~G~Y~C~A~N~~G~~~ 67 (75)
T cd05763 49 IEDTGVYSCTAQNTAGSIS 67 (75)
T ss_pred cccCEEEEEEEEcCCCEEE
Confidence 3567788999999988643
No 114
>cd05733 Ig6_L1-CAM_like Sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig6_L1-CAM_like: domain similar to the sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains NrCAM [Ng(neuronglia)CAM-related cell adhesion molecule], which is primarily expressed in the nervous system, and human neurofascin.
Probab=21.50 E-value=1.1e+02 Score=19.57 Aligned_cols=16 Identities=25% Similarity=0.088 Sum_probs=13.7
Q ss_pred CCceEEEEeeecCCCC
Q psy7018 10 DHKVSTEDRAGNSAGW 25 (349)
Q Consensus 10 ~~~y~~~v~a~n~~G~ 25 (349)
.+.-.|+|.|.|..|.
T Consensus 53 ~d~G~Y~C~A~N~~G~ 68 (77)
T cd05733 53 DYEGEYQCYASNELGT 68 (77)
T ss_pred CCCEEEEEEEEcCCCc
Confidence 3568899999999996
No 115
>cd05855 Ig_TrkB_d5 Fifth domain (immunoglobulin-like) of Trk receptor TrkB. TrkB_d5: the fifth domain of Trk receptor TrkB, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors, which mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. The Trks are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkB shares significant sequence homology and domain organization with TrkA, and TrkC. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. In some cell systems NT-3 can activate TrkA and TrkB receptors. TrKB transcripts are found throughout multiple structures of the central and peripheral nervous systems.
Probab=20.84 E-value=1.1e+02 Score=20.00 Aligned_cols=19 Identities=16% Similarity=-0.225 Sum_probs=15.1
Q ss_pred CCceEEEEeeecCCCCCCC
Q psy7018 10 DHKVSTEDRAGNSAGWSPF 28 (349)
Q Consensus 10 ~~~y~~~v~a~n~~G~~~~ 28 (349)
.+.-.|.|.|.|..|....
T Consensus 54 ~D~G~YtC~A~N~~G~~~~ 72 (79)
T cd05855 54 LNNGIYTLVAKNEYGEDEK 72 (79)
T ss_pred ccCEEEEEEEEcCCcccce
Confidence 4677899999999997543
No 116
>cd04970 Ig6_Contactin_like Sixth Ig domain of contactin. Ig6_Contactin_like: Sixth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=20.53 E-value=64 Score=21.04 Aligned_cols=26 Identities=12% Similarity=-0.031 Sum_probs=17.1
Q ss_pred CCceEEEEeeecCCCCCCCCcceeee
Q psy7018 10 DHKVSTEDRAGNSAGWSPFSETSVIT 35 (349)
Q Consensus 10 ~~~y~~~v~a~n~~G~~~~s~~~~~~ 35 (349)
.++-.|.|.|.|..|....+..+.+.
T Consensus 57 ~D~G~Y~C~a~n~~g~~~~~~~l~V~ 82 (85)
T cd04970 57 KHAGKYTCTAQTVVDSLSASADLIVR 82 (85)
T ss_pred HhCeeeEEEEecCCCcEEEEEEEEEE
Confidence 35678889999988865444444444
No 117
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=20.09 E-value=1.3e+02 Score=23.48 Aligned_cols=24 Identities=17% Similarity=0.340 Sum_probs=18.7
Q ss_pred cceEEEcCCCCCCeEEEEEEEecC
Q psy7018 277 NKNFTLSKLTPSTCFRFRLAAVNQ 300 (349)
Q Consensus 277 ~~~~~~~~L~p~t~Y~~~V~A~~~ 300 (349)
.-.|.++||.|...|.+.|.-.-.
T Consensus 32 ~l~y~vsGL~p~~~Y~i~l~~~~~ 55 (184)
T PF00907_consen 32 TLEYSVSGLDPDSLYSISLHFERV 55 (184)
T ss_dssp -EEEEEESS-TTSEEEEEEEEEES
T ss_pred ccEEEecCCCCCcceEEEEEEEEe
Confidence 338899999999999999877654
Done!