Query psy4956
Match_columns 361
No_of_seqs 203 out of 1867
Neff 11.4
Searched_HMMs 46136
Date Fri Aug 16 22:12:06 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4956.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4956hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4221|consensus 100.0 3.1E-32 6.8E-37 247.6 34.4 340 5-360 438-798 (1381)
2 KOG4221|consensus 100.0 1.3E-30 2.9E-35 237.1 32.6 336 10-358 536-901 (1381)
3 KOG3513|consensus 100.0 6.3E-27 1.4E-31 214.0 33.8 299 53-359 578-899 (1051)
4 KOG3513|consensus 100.0 8.6E-26 1.9E-30 206.7 36.2 346 6-360 626-1001(1051)
5 KOG0196|consensus 99.7 2.3E-14 5.1E-19 127.5 20.4 181 83-269 325-526 (996)
6 KOG0196|consensus 99.6 2.1E-13 4.6E-18 121.5 20.4 174 180-359 329-520 (996)
7 KOG4222|consensus 99.6 1.1E-13 2.3E-18 127.4 18.0 303 52-360 474-830 (1281)
8 KOG4258|consensus 99.4 1.2E-11 2.6E-16 110.9 15.9 261 95-358 493-907 (1025)
9 KOG4222|consensus 99.4 9.2E-12 2E-16 115.0 15.0 257 6-268 536-834 (1281)
10 PF00041 fn3: Fibronectin type 99.3 2E-11 4.4E-16 82.7 9.9 82 91-173 1-84 (85)
11 PF00041 fn3: Fibronectin type 99.2 2.3E-10 5.1E-15 77.4 9.3 77 280-360 2-79 (85)
12 KOG4258|consensus 99.1 2.1E-09 4.5E-14 97.0 14.9 255 9-267 500-912 (1025)
13 PF10179 DUF2369: Uncharacteri 98.9 3.5E-06 7.6E-11 69.1 25.1 216 47-267 10-286 (300)
14 PF10179 DUF2369: Uncharacteri 98.8 1.3E-05 2.9E-10 65.8 24.9 116 240-358 127-281 (300)
15 KOG4802|consensus 98.8 1.6E-07 3.5E-12 78.0 12.4 130 135-265 200-338 (516)
16 cd00063 FN3 Fibronectin type 3 98.6 1E-06 2.2E-11 60.1 11.1 82 90-173 1-85 (93)
17 cd00063 FN3 Fibronectin type 3 98.5 3.1E-06 6.6E-11 57.7 10.8 85 183-270 1-86 (93)
18 KOG4802|consensus 98.4 4E-06 8.6E-11 70.0 11.2 130 41-171 198-340 (516)
19 smart00060 FN3 Fibronectin typ 98.4 1.2E-05 2.5E-10 53.2 10.8 77 92-169 3-81 (83)
20 smart00060 FN3 Fibronectin typ 98.2 1.5E-05 3.2E-10 52.7 9.1 77 185-265 3-81 (83)
21 COG3401 Fibronectin type 3 dom 98.2 0.0012 2.7E-08 54.4 20.1 317 23-360 3-329 (343)
22 KOG3632|consensus 98.1 1.5E-05 3.3E-10 73.7 9.0 201 98-317 586-814 (1335)
23 PF01108 Tissue_fac: Tissue fa 97.8 0.0002 4.3E-09 50.3 8.0 81 277-360 21-102 (107)
24 PF09294 Interfer-bind: Interf 97.4 0.001 2.2E-08 46.7 7.6 71 91-166 4-88 (106)
25 COG3401 Fibronectin type 3 dom 97.4 0.032 6.8E-07 46.4 16.5 241 11-268 84-333 (343)
26 KOG4806|consensus 97.2 0.089 1.9E-06 43.9 17.1 115 150-267 289-436 (454)
27 KOG3632|consensus 97.1 0.0024 5.3E-08 59.9 8.5 241 6-267 589-856 (1335)
28 PF01108 Tissue_fac: Tissue fa 97.0 0.0059 1.3E-07 42.9 8.1 83 87-171 19-105 (107)
29 KOG4806|consensus 96.9 0.14 3.1E-06 42.8 15.5 108 247-357 290-430 (454)
30 PF09294 Interfer-bind: Interf 96.8 0.0054 1.2E-07 43.0 6.7 72 184-262 4-88 (106)
31 KOG1225|consensus 96.6 0.0053 1.1E-07 55.0 5.9 156 186-354 369-524 (525)
32 KOG4367|consensus 96.4 0.0039 8.5E-08 53.0 4.0 72 195-271 451-523 (699)
33 KOG4367|consensus 96.3 0.0048 1E-07 52.5 4.1 71 7-80 451-521 (699)
34 KOG1225|consensus 96.1 0.011 2.4E-07 52.9 5.4 110 49-162 415-524 (525)
35 PF09240 IL6Ra-bind: Interleuk 95.9 0.26 5.7E-06 33.9 10.6 87 185-271 1-91 (99)
36 PF09067 EpoR_lig-bind: Erythr 95.8 0.11 2.4E-06 36.0 8.0 81 182-265 7-94 (104)
37 KOG4152|consensus 95.4 0.48 1E-05 42.1 12.2 68 150-218 652-725 (830)
38 COG4733 Phage-related protein, 95.0 0.19 4.1E-06 47.2 9.1 115 240-358 659-778 (952)
39 KOG4152|consensus 93.2 0.22 4.7E-06 44.1 5.6 67 246-315 652-725 (830)
40 KOG1948|consensus 92.8 9.7 0.00021 36.7 19.9 148 10-168 820-969 (1165)
41 PLN02533 probable purple acid 92.4 1.7 3.6E-05 39.1 10.2 73 183-259 41-121 (427)
42 COG4733 Phage-related protein, 91.7 3 6.4E-05 39.8 11.0 123 50-175 658-787 (952)
43 PF07495 Y_Y_Y: Y_Y_Y domain; 90.6 1 2.2E-05 28.1 5.3 25 53-78 30-54 (66)
44 PF07495 Y_Y_Y: Y_Y_Y domain; 90.3 1.4 3.1E-05 27.4 5.8 22 337-360 30-51 (66)
45 PF09067 EpoR_lig-bind: Erythr 89.6 3 6.6E-05 28.9 7.2 80 89-170 7-95 (104)
46 PF10342 GPI-anchored: Ser-Thr 87.3 3.5 7.6E-05 27.7 6.5 62 294-357 13-79 (93)
47 PLN02533 probable purple acid 84.0 5.8 0.00013 35.7 7.8 72 277-355 40-121 (427)
48 KOG1948|consensus 83.3 47 0.001 32.4 15.6 142 21-165 896-1063(1165)
49 KOG0613|consensus 80.4 35 0.00076 34.7 11.9 244 13-267 158-419 (1205)
50 PF11344 DUF3146: Protein of u 77.1 1.9 4.1E-05 27.2 1.7 14 344-357 67-80 (80)
51 PF09240 IL6Ra-bind: Interleuk 75.6 23 0.00049 24.2 8.8 79 93-171 2-87 (99)
52 KOG0613|consensus 75.5 56 0.0012 33.4 11.7 150 22-172 261-420 (1205)
53 KOG3515|consensus 74.2 75 0.0016 30.8 11.8 75 150-224 659-733 (741)
54 cd05762 Ig8_MLCK Eighth immuno 73.5 26 0.00056 23.9 10.2 84 4-93 11-97 (98)
55 KOG4228|consensus 67.3 29 0.00064 34.7 7.8 75 90-171 25-99 (1087)
56 PF07353 Uroplakin_II: Uroplak 66.7 12 0.00026 27.8 4.0 25 51-75 101-125 (184)
57 PF13754 Big_3_4: Bacterial Ig 65.2 23 0.0005 21.0 4.5 30 52-82 14-43 (54)
58 KOG3515|consensus 64.1 13 0.00028 35.6 4.9 79 51-129 653-732 (741)
59 TIGR00864 PCC polycystin catio 60.9 3.1E+02 0.0067 31.6 27.2 23 248-271 1565-1587(2740)
60 KOG4228|consensus 59.4 1E+02 0.0022 31.2 9.8 68 99-169 173-240 (1087)
61 TIGR00864 PCC polycystin catio 58.4 3.4E+02 0.0073 31.3 27.1 27 243-270 1662-1688(2740)
62 PF14292 SusE: SusE outer memb 57.6 67 0.0015 22.9 9.3 73 283-357 36-121 (122)
63 PF10342 GPI-anchored: Ser-Thr 55.3 59 0.0013 21.6 8.1 62 197-260 13-78 (93)
64 PF13750 Big_3_3: Bacterial Ig 54.8 92 0.002 23.6 14.4 33 245-277 116-148 (158)
65 PHA02579 7 baseplate wedge sub 52.2 80 0.0017 30.5 7.6 76 93-172 8-92 (1030)
66 PF07353 Uroplakin_II: Uroplak 51.1 1E+02 0.0022 23.1 8.8 25 241-265 102-126 (184)
67 PF09423 PhoD: PhoD-like phosp 45.4 45 0.00097 30.4 5.1 48 146-196 65-113 (453)
68 PF13205 Big_5: Bacterial Ig-l 42.3 1.1E+02 0.0024 20.9 6.3 23 333-355 61-83 (107)
69 PHA02579 7 baseplate wedge sub 37.9 1.8E+02 0.0039 28.3 7.5 73 282-360 9-88 (1030)
70 PF04775 Bile_Hydr_Trans: Acyl 35.9 84 0.0018 22.7 4.3 37 241-277 5-41 (126)
71 PF14734 DUF4469: Domain of un 34.6 79 0.0017 21.9 3.8 32 328-360 58-89 (102)
72 PF14054 DUF4249: Domain of un 32.8 2.6E+02 0.0056 23.6 7.6 64 247-317 95-166 (298)
73 PF09423 PhoD: PhoD-like phosp 31.4 1E+02 0.0023 28.1 5.2 49 51-102 63-112 (453)
74 PF14250 AbrB-like: AbrB-like 31.0 47 0.001 20.9 2.0 37 308-354 27-63 (71)
75 PRK13211 N-acetylglucosamine-b 30.1 2.8E+02 0.0061 25.6 7.4 28 51-79 368-395 (478)
76 PF02010 REJ: REJ domain; Int 29.9 17 0.00038 32.8 0.0 25 51-75 147-173 (440)
77 PF04775 Bile_Hydr_Trans: Acyl 29.9 1.1E+02 0.0024 22.1 4.1 25 146-170 6-30 (126)
78 PF00907 T-box: T-box; InterP 29.3 63 0.0014 25.2 3.0 22 51-72 32-53 (184)
79 cd02848 Chitinase_N_term Chiti 29.3 2E+02 0.0044 20.1 7.1 64 8-77 31-94 (106)
80 KOG1378|consensus 28.3 1.5E+02 0.0033 26.9 5.3 75 280-356 44-128 (452)
81 TIGR00868 hCaCC calcium-activa 27.3 6.6E+02 0.014 25.4 22.9 75 282-359 760-851 (863)
82 TIGR00868 hCaCC calcium-activa 26.4 6.9E+02 0.015 25.3 26.0 78 188-266 761-854 (863)
83 PF14686 fn3_3: Polysaccharide 25.6 56 0.0012 22.2 1.9 20 335-356 49-68 (95)
84 PF10333 Pga1: GPI-Mannosyltra 25.2 1E+02 0.0022 24.0 3.4 22 239-260 64-85 (180)
85 KOG1378|consensus 25.1 3.2E+02 0.0069 24.9 6.7 21 240-260 108-128 (452)
86 KOG3834|consensus 24.5 2.2E+02 0.0048 25.5 5.5 61 100-162 72-137 (462)
87 cd05894 Ig_C5_MyBP-C C5 immuno 24.5 2.1E+02 0.0046 18.7 8.7 71 3-78 5-78 (86)
88 PF08329 ChitinaseA_N: Chitina 22.8 3.1E+02 0.0068 20.1 6.4 62 8-76 35-96 (133)
89 PF10333 Pga1: GPI-Mannosyltra 22.4 1.1E+02 0.0024 23.7 3.2 22 333-355 63-84 (180)
90 PF14054 DUF4249: Domain of un 21.3 5E+02 0.011 21.8 8.1 62 58-126 95-166 (298)
91 PF04151 PPC: Bacterial pre-pe 21.2 2.2E+02 0.0047 17.6 5.6 20 51-71 51-70 (70)
92 TIGR03769 P_ac_wall_RPT actino 21.0 1.2E+02 0.0026 16.8 2.3 13 345-358 11-23 (41)
93 PF00907 T-box: T-box; InterP 20.6 1.5E+02 0.0032 23.1 3.6 23 146-168 34-56 (184)
94 TIGR03000 plancto_dom_1 Planct 20.1 2.6E+02 0.0056 18.1 4.9 23 51-73 28-50 (75)
No 1
>KOG4221|consensus
Probab=100.00 E-value=3.1e-32 Score=247.58 Aligned_cols=340 Identities=20% Similarity=0.289 Sum_probs=261.8
Q ss_pred EeCCCceEEEEecCCCCCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCccceeeE
Q psy4956 5 VHGVHGANLVIKIPENLSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTA 84 (361)
Q Consensus 5 ~~~~~~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~ 84 (361)
....+-..+.|..|....+++.-|.+.|.. ++... ++.+...+.....++.+|.|.+.|.|+|+|.+..|..-.+..+
T Consensus 438 ~~~srfi~~tw~~p~~~~g~i~~~~v~~~~-~~~~r-er~~~tss~g~~~tv~nl~p~t~Y~~rv~A~n~~g~g~sS~pL 515 (1381)
T KOG4221|consen 438 LVSSRFIQLTWRPPAQISGNISTYTVFYKV-EGDVR-ERLQNTSSPGIQVTVQNLSPLTMYFFRVRAKNEAGSGESSAPL 515 (1381)
T ss_pred cccceeEEEeecCccccCCCcceEEEEEec-CCchh-hhheeccCCceEEEeeecccceeEEEEEeccCcccCCccCCce
Confidence 445566778899899888899999999977 33332 3434444323899999999999999999998877565555667
Q ss_pred EEecCCCCCCccEEEEEeCCEEEEEeeCCC--CCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEE
Q psy4956 85 SITTPPDPPTNLSVNVRSGKTAQIFWSPPI--SGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQL 162 (361)
Q Consensus 85 ~~~t~~~~p~~l~~~~~~~~~v~l~W~~p~--~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v 162 (361)
.+.+.+.+|..+.+...+..++.+.|.+|. ++++.+|++.|... .....+.+..+.. .++|.+|.+.++|.|+|
T Consensus 516 kV~t~pEgp~~~~a~ats~~ti~v~WepP~~~n~~I~~yk~~ys~~---~~~~~~~~~~n~~-e~ti~gL~k~TeY~~~v 591 (1381)
T KOG4221|consen 516 KVTTQPEGPVQLQAYATSPTTILVTWEPPPFGNGPITGYKLFYSED---DTGKELRVENNAT-EYTINGLEKYTEYSIRV 591 (1381)
T ss_pred EEecCCCCCccccccccCcceEEEEecCCCCCCCCceEEEEEEEcC---CCCceEEEecCcc-EEEeecCCCccceEEEE
Confidence 778888788778888888999999999987 77899999988765 2344556667777 99999999999999999
Q ss_pred EEEeCCCCcccceeeeeccC---CC-CCcceEEEeccCCeEEEEeecCCCC---CcceEEEEEEeCCCCCCc--eeEEee
Q psy4956 163 FSVYDSKESVAYTSRNFTTK---PN-TPGKFIVWFRNETTLLVLWQPSYPA---SIYTHYKVSIDPPDAPES--VLYVEK 233 (361)
Q Consensus 163 ~a~~~~~~s~~~~~~~~~t~---p~-~p~~l~~~~~~~~~~~v~W~~~~~~---~~~~~y~v~~~~~~~~~~--~~~~~~ 233 (361)
.|.+..|.+..+....+.|. |+ ||.|+++...+.++|.|+|.+|... +.+.+|+|+|........ .....
T Consensus 592 vA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~itgYkIRy~~~~~~~~~~~t~v~- 670 (1381)
T KOG4221|consen 592 VAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQITGYKIRYRKLSREDEVNETVVK- 670 (1381)
T ss_pred EEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecCCCeEEEEccCCCcccccceEEEEEEEecccCcccccceeecc-
Confidence 99999888876555555544 65 4556999999999999999876443 889999999986543322 12222
Q ss_pred ccCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCCCceEE-EEecC--------CCCCccceEEeeecCCceEEEEeeCCC
Q psy4956 234 EGEPPGPAQAAFKGLVPGRAYNISVQTVSEDEISTPTTA-QYRTI--------PLRPLSFTYDKASITSNSLRVVWEPPK 304 (361)
Q Consensus 234 ~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s~~~~~-~~~t~--------~~~p~~l~~~~~~~~~~si~l~W~~~~ 304 (361)
+...++.+.+|+|++.|.|+|.|.+..|.++++.. .+.|. |.+|..|.+ ....++|.++|++|.
T Consensus 671 ----~n~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~l~~---~~g~~si~vsW~Pp~ 743 (1381)
T KOG4221|consen 671 ----GNTTQYLFNGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDLDERVPGKPSELHV---HPGSNSIVVSWTPPP 743 (1381)
T ss_pred ----cchhhhHhhcCCCCceEEEEEEEeccCCCCCcccceeccCccccccccCCCCCceeee---ccCceeEEEEeCCCC
Confidence 23578999999999999999999999988877653 44442 234454444 667789999999975
Q ss_pred CC-CCceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCCCCceEEEEEEEEeCC
Q psy4956 305 GF-SEFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLEPGRTYQVLVKTVSGK 360 (361)
Q Consensus 305 ~~-~~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~ 360 (361)
.. ..+.+|+|.|+...+. .....++++.....|.+.. |.|+..|.|+++|.+..
T Consensus 744 ~~~~~vrgY~ig~r~g~~~-p~~~tIrl~~~~s~y~l~~-Le~~~~YvVkL~AfNn~ 798 (1381)
T KOG4221|consen 744 HPNIVVRGYKIGYRPGSGI-PDTGTIRLDEKVSYYNLEQ-LEPNRDYVVKLRAFNNH 798 (1381)
T ss_pred ChhhhhcceEEeeecccCC-CCCccEEecceeeEEEEEe-cccCceEEEEEEEeccC
Confidence 42 4688999999866542 2344467788888999997 99999999999997643
No 2
>KOG4221|consensus
Probab=100.00 E-value=1.3e-30 Score=237.13 Aligned_cols=336 Identities=18% Similarity=0.265 Sum_probs=247.3
Q ss_pred ceEEEEecCCCCCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCccceeeEEEec-
Q psy4956 10 GANLVIKIPENLSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTASITT- 88 (361)
Q Consensus 10 ~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t- 88 (361)
...+.|+.|.-.+++|.+|++.|...+ ...+..+..+ ..+++|.+|++.|+|.|+|.|.|..|.+..+..+.+.|
T Consensus 536 ti~v~WepP~~~n~~I~~yk~~ys~~~--~~~~~~~~~n--~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tl 611 (1381)
T KOG4221|consen 536 TILVTWEPPPFGNGPITGYKLFYSEDD--TGKELRVENN--ATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTL 611 (1381)
T ss_pred eEEEEecCCCCCCCCceEEEEEEEcCC--CCceEEEecC--ccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEec
Confidence 345667778888999999999999862 2225555555 89999999999999999999999876665555566555
Q ss_pred --CCCCC-CccEEEEEeCCEEEEEeeCCC----CCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEE
Q psy4956 89 --PPDPP-TNLSVNVRSGKTAQIFWSPPI----SGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQ 161 (361)
Q Consensus 89 --~~~~p-~~l~~~~~~~~~v~l~W~~p~----~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~ 161 (361)
.|++| .|+++...++++|.|+|.+|. ++.+.+|+|+|+....+...++..+..+.. .+.+.+|+|++.|.|+
T Consensus 612 sd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~itgYkIRy~~~~~~~~~~~t~v~~n~~-~~l~~~Lep~T~Y~vr 690 (1381)
T KOG4221|consen 612 SDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQITGYKIRYRKLSREDEVNETVVKGNTT-QYLFNGLEPNTQYRVR 690 (1381)
T ss_pred cCCCCCCCcceEEEecCCCeEEEEccCCCcccccceEEEEEEEecccCcccccceeecccchh-hhHhhcCCCCceEEEE
Confidence 35555 569999999999999999987 677899999998766655666666665666 8999999999999999
Q ss_pred EEEEeCCCCcccceeeeeccC--------CCCCcceEEEeccCCeEEEEeecCCCC-CcceEEEEEEeCCCCCCceeEEe
Q psy4956 162 LFSVYDSKESVAYTSRNFTTK--------PNTPGKFIVWFRNETTLLVLWQPSYPA-SIYTHYKVSIDPPDAPESVLYVE 232 (361)
Q Consensus 162 v~a~~~~~~s~~~~~~~~~t~--------p~~p~~l~~~~~~~~~~~v~W~~~~~~-~~~~~y~v~~~~~~~~~~~~~~~ 232 (361)
|.|.+..|.+..+.-..+.|. |.+|..|.+. ...+++.+.|+|+... ..+.+|+|-|+...+......+.
T Consensus 691 Isa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~l~~~-~g~~si~vsW~Pp~~~~~~vrgY~ig~r~g~~~p~~~tIr 769 (1381)
T KOG4221|consen 691 ISAMTVNGTGPASEWVSAETPESDLDERVPGKPSELHVH-PGSNSIVVSWTPPPHPNIVVRGYKIGYRPGSGIPDTGTIR 769 (1381)
T ss_pred EEEeccCCCCCcccceeccCccccccccCCCCCceeeec-cCceeEEEEeCCCCChhhhhcceEEeeecccCCCCCccEE
Confidence 999999888876555555543 3455555553 4567999999988544 66789999997654433211111
Q ss_pred eccCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCCCceEEEEecC------------CCCCccceEEeeecCCceEEEEe
Q psy4956 233 KEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDEISTPTTAQYRTI------------PLRPLSFTYDKASITSNSLRVVW 300 (361)
Q Consensus 233 ~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s~~~~~~~~t~------------~~~p~~l~~~~~~~~~~si~l~W 300 (361)
. ......+.|..|.++..|.|++.|+|..|.+.+..-...+. ..+|..+++ ...+.++|++.|
T Consensus 770 l---~~~~s~y~l~~Le~~~~YvVkL~AfNn~gdG~p~y~~~~tR~~~~~~~~v~tp~~ppvgv~A--~~~S~tsI~v~w 844 (1381)
T KOG4221|consen 770 L---DEKVSYYNLEQLEPNRDYVVKLRAFNNHGDGNPIYESVKTRSATDPTSPVDTPMLPPVGVRA--NALSSTSIRVTW 844 (1381)
T ss_pred e---cceeeEEEEEecccCceEEEEEEEeccCCCCcceeeeeeeccCCCcCCcCCCCCCCcccccc--cccccceEEEEE
Confidence 1 22347899999999999999999999999887766444333 123455666 778889999999
Q ss_pred eCC-CCCCCceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCCCCceEEEEEEEEe
Q psy4956 301 EPP-KGFSEFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLEPGRTYQVLVKTVS 358 (361)
Q Consensus 301 ~~~-~~~~~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~ 358 (361)
..- +.......|.+.|...+-.. ............++.+.+ |+|.+.|+|.|.++.
T Consensus 845 ~~~~~~t~~~~~yTVr~~~~gi~~-~~~~~~~~~t~ls~~v~g-lkpnt~yEfav~~~~ 901 (1381)
T KOG4221|consen 845 ADNKDQTTDNRIYTVRWSLTGIRN-GTLYRYDNSTDLSYLVGG-LKPNTPYEFAVMVVK 901 (1381)
T ss_pred ecCCCccccceEEEEEEeeccccc-ceeEEEecccccceeccC-cCcCChhhhhhhhhh
Confidence 983 22235678999997432110 111122345556888875 999999999998775
No 3
>KOG3513|consensus
Probab=99.97 E-value=6.3e-27 Score=214.01 Aligned_cols=299 Identities=16% Similarity=0.232 Sum_probs=226.5
Q ss_pred eEEec--CCCCCceEEEEEEEecCCcCccceeeEEEecCCCCCCccEEEEEeCCEEEEEeeCCC--CCCcccEEEEEEEC
Q psy4956 53 NIEFS--EGLPGTKYDFYLYYTNSTVHDWLTWTASITTPPDPPTNLSVNVRSGKTAQIFWSPPI--SGKYSGFKLKVISL 128 (361)
Q Consensus 53 ~~~i~--~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t~~~~p~~l~~~~~~~~~v~l~W~~p~--~~~~~~y~v~~~~~ 128 (361)
.++|. .|+....|.+.+++..+ ..+..+.+.+..+|.||.++.+..++.+++.|+|++.. +.++..|.|+++..
T Consensus 578 ~L~i~nv~l~~~G~Y~C~aqT~~D--s~s~~A~l~V~gpPgpP~~v~~~~i~~t~~~lsW~~g~dn~SpI~~Y~iq~rt~ 655 (1051)
T KOG3513|consen 578 RLTIANVSLEDSGKYTCVAQTALD--SASARADLLVRGPPGPPPDVHVDDISDTTARLSWSPGSDNNSPIEKYTIQFRTP 655 (1051)
T ss_pred ceEEEeeccccCceEEEEEEEeec--chhcccceEEecCCCCCCceeEeeeccceEEEEeecCCCCCCCceEEeEEecCC
Confidence 34444 47888899999998443 23445568888999999999999999999999999977 46799999999887
Q ss_pred CCCCCCeEEeecCCCC--ceEEEcCCCCCcEEEEEEEEEeCCCCccccee-eeeccCCC----CCcceEEEeccCCeEEE
Q psy4956 129 SEKTPPRIIGFTENPP--AGYSLKDLTPGGSYQVQLFSVYDSKESVAYTS-RNFTTKPN----TPGKFIVWFRNETTLLV 201 (361)
Q Consensus 129 ~~~~~~~~~~~~~~~~--~~~~l~~L~p~~~Y~v~v~a~~~~~~s~~~~~-~~~~t~p~----~p~~l~~~~~~~~~~~v 201 (361)
-...|.....++...+ .+..+.+|.|+..|+|||.|.|..|.+.++.. ...+|.+. .|.++.......+.+.|
T Consensus 656 ~~~~W~~v~~vp~~~~~~~sa~vv~L~Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~~P~nv~g~g~~~~eLvI 735 (1051)
T KOG3513|consen 656 FPGKWKAVTTVPGNITGDESATVVNLSPWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSVNPSNVKGGGGSPTELVI 735 (1051)
T ss_pred CCCcceEeeECCCcccCccceeEEccCCCcceEEEEEEEcccccCCCCCCccceecCCCCCccCCccccccCCCCceEEE
Confidence 5556665555554433 24778899999999999999999988876653 34456543 46788888888889999
Q ss_pred EeecCCC---CCcceEEEEEEeCCCCC-CceeEEeeccCCCCCcEEEecC--CCCCCeEEEEEEEEeCCCCCCceE-EEE
Q psy4956 202 LWQPSYP---ASIYTHYKVSIDPPDAP-ESVLYVEKEGEPPGPAQAAFKG--LVPGRAYNISVQTVSEDEISTPTT-AQY 274 (361)
Q Consensus 202 ~W~~~~~---~~~~~~y~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--L~p~t~Y~v~V~a~~~~~~s~~~~-~~~ 274 (361)
+|+|-.. .|..-+|+|.|++.+.. .+....-. ..+...+.+.+ ..|.+.|.+.|+|+|..|.++... +.+
T Consensus 736 tW~Pl~~~~qNG~gfgY~Vswr~~g~~~~W~~~~v~---~~d~~~~V~~~~st~~~tpyevKVqa~N~~GeGp~s~~~v~ 812 (1051)
T KOG3513|consen 736 TWEPLPEEEQNGPGFGYRVSWRPQGADKEWKEVIVS---NQDQPRYVVSNESTEPFTPYEVKVQAINDQGEGPESQVTVG 812 (1051)
T ss_pred EeccCCHHHccCCCceEEEEEEeCCCCcccceeEec---ccCCceEEEcCCCCCCcceeEEEEEEecCCCCCCCCceEEE
Confidence 9987532 27778999999998876 54433222 22235566654 667999999999999988885443 344
Q ss_pred ecC----CCCCccceEEeeecCCceEEEEeeCCC-CCCCceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCCCCce
Q psy4956 275 RTI----PLRPLSFTYDKASITSNSLRVVWEPPK-GFSEFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLEPGRT 349 (361)
Q Consensus 275 ~t~----~~~p~~l~~~~~~~~~~si~l~W~~~~-~~~~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~ 349 (361)
.+. +.+|..+.+ ...+.+.+.|+|+++. ..+.+++|+|+|...+..+.........+..++..|++ |+|.|.
T Consensus 813 ~S~Ed~P~~ap~~~~~--~~~s~s~~~v~W~~~~~~nG~l~gY~v~Y~~~~~~~~~~~~~~i~~~~~~~~ltg-L~~~T~ 889 (1051)
T KOG3513|consen 813 YSGEDEPPVAPTKLSA--KPLSSSEVNLSWKPPLWDNGKLTGYEVKYWKINEKEGSLSRVQIAGNRTSWRLTG-LEPNTK 889 (1051)
T ss_pred EcCCCCCCCCCcccee--ecccCceEEEEecCcCccCCccceeEEEEEEcCCCcccccceeecCCcceEeeeC-CCCCce
Confidence 443 335677888 8899999999998752 11489999999999887644444455668889999996 999999
Q ss_pred EEEEEEEEeC
Q psy4956 350 YQVLVKTVSG 359 (361)
Q Consensus 350 Y~v~V~a~~~ 359 (361)
|.|.|+|.+.
T Consensus 890 Y~~~vrA~ns 899 (1051)
T KOG3513|consen 890 YRFYVRAYTS 899 (1051)
T ss_pred EEEEEEEecC
Confidence 9999999874
No 4
>KOG3513|consensus
Probab=99.96 E-value=8.6e-26 Score=206.66 Aligned_cols=346 Identities=16% Similarity=0.154 Sum_probs=240.1
Q ss_pred eCCCceEEEEecCCCCCCCcceEEEEEEcCCCCCCCC-eEEeecC-ccceEEecCCCCCceEEEEEEEecCCcCccc-ee
Q psy4956 6 HGVHGANLVIKIPENLSSDNSTYRLDYIPAHGHPPPN-TTYVSRD-IKDNIEFSEGLPGTKYDFYLYYTNSTVHDWL-TW 82 (361)
Q Consensus 6 ~~~~~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~-~~~~~~~-~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~-~~ 82 (361)
.....+.|+|+....+.++|..|.|+.+..-...+.. .+++... +..+.++.+|.|+..|+|+|+|.|..|.+.. ..
T Consensus 626 i~~t~~~lsW~~g~dn~SpI~~Y~iq~rt~~~~~W~~v~~vp~~~~~~~sa~vv~L~Pwv~YeFRV~AvN~iG~gePS~p 705 (1051)
T KOG3513|consen 626 ISDTTARLSWSPGSDNNSPIEKYTIQFRTPFPGKWKAVTTVPGNITGDESATVVNLSPWVEYEFRVVAVNSIGIGEPSPP 705 (1051)
T ss_pred eccceEEEEeecCCCCCCCceEEeEEecCCCCCcceEeeECCCcccCccceeEEccCCCcceEEEEEEEcccccCCCCCC
Confidence 4567788999998888889999999998864333222 2333331 1136889999999999999999996544322 22
Q ss_pred eEEEecCCCC----CCccEEEEEeCCEEEEEeeCCC----CCCcccEEEEEEECCCC-CCCeEEeecCCCCceEEEc--C
Q psy4956 83 TASITTPPDP----PTNLSVNVRSGKTAQIFWSPPI----SGKYSGFKLKVISLSEK-TPPRIIGFTENPPAGYSLK--D 151 (361)
Q Consensus 83 ~~~~~t~~~~----p~~l~~~~~~~~~v~l~W~~p~----~~~~~~y~v~~~~~~~~-~~~~~~~~~~~~~~~~~l~--~ 151 (361)
+...+|.+.+ |.++.......+.+.++|+|.. +|+-.+|+|.|++.+.. .|........... .+.+. .
T Consensus 706 S~~~rT~ea~P~~~P~nv~g~g~~~~eLvItW~Pl~~~~qNG~gfgY~Vswr~~g~~~~W~~~~v~~~d~~-~~V~~~~s 784 (1051)
T KOG3513|consen 706 SEKVRTPEAAPSVNPSNVKGGGGSPTELVITWEPLPEEEQNGPGFGYRVSWRPQGADKEWKEVIVSNQDQP-RYVVSNES 784 (1051)
T ss_pred ccceecCCCCCccCCccccccCCCCceEEEEeccCCHHHccCCCceEEEEEEeCCCCcccceeEecccCCc-eEEEcCCC
Confidence 5666776555 5778777777899999999966 78888999999998776 4554444433333 55554 4
Q ss_pred CCCCcEEEEEEEEEeCCCCcccceeeeeccC----CCCCcceEEEeccCCeEEEEeecCC-CCCcceEEEEEEeCCCCCC
Q psy4956 152 LTPGGSYQVQLFSVYDSKESVAYTSRNFTTK----PNTPGKFIVWFRNETTLLVLWQPSY-PASIYTHYKVSIDPPDAPE 226 (361)
Q Consensus 152 L~p~~~Y~v~v~a~~~~~~s~~~~~~~~~t~----p~~p~~l~~~~~~~~~~~v~W~~~~-~~~~~~~y~v~~~~~~~~~ 226 (361)
..|++.|.+.|+++|+.|.+..+....+... +.+|..+.+...+.+.+.|+|+++. ..|.+.+|.|.|+..+...
T Consensus 785 t~~~tpyevKVqa~N~~GeGp~s~~~v~~S~Ed~P~~ap~~~~~~~~s~s~~~v~W~~~~~~nG~l~gY~v~Y~~~~~~~ 864 (1051)
T KOG3513|consen 785 TEPFTPYEVKVQAINDQGEGPESQVTVGYSGEDEPPVAPTKLSAKPLSSSEVNLSWKPPLWDNGKLTGYEVKYWKINEKE 864 (1051)
T ss_pred CCCcceeEEEEEEecCCCCCCCCceEEEEcCCCCCCCCCccceeecccCceEEEEecCcCccCCccceeEEEEEEcCCCc
Confidence 6789999999999999998865444444333 5678889999999999999997763 3489999999999877653
Q ss_pred ceeEEeeccCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCCCceE-EEEecCCCCCcc-------ceEEeeecCCceEEE
Q psy4956 227 SVLYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDEISTPTT-AQYRTIPLRPLS-------FTYDKASITSNSLRV 298 (361)
Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s~~~~-~~~~t~~~~p~~-------l~~~~~~~~~~si~l 298 (361)
...... ...++.++..|.||++.+.|.|.|+|.+..|.++++. ....+++.+|.. ..+ .......+.|
T Consensus 865 ~~~~~~--~i~~~~~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp~s~~~~~tt~k~pPs~~~~~p~g~~~--~~~~~~~~~l 940 (1051)
T KOG3513|consen 865 GSLSRV--QIAGNRTSWRLTGLEPNTKYRFYVRAYTSAGGGPASSEENVTTKKAPPSQVDIAPPGNFI--WKFSASILLL 940 (1051)
T ss_pred ccccce--eecCCcceEeeeCCCCCceEEEEEEEecCCCCCCCccceeccccCCCCcccccCCCcceE--EeeeeeEEEE
Confidence 221111 1135678999999999999999999999988765543 445666655543 223 3556678899
Q ss_pred EeeCC---CCCCCceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCCCCceEEEEEEEEeCC
Q psy4956 299 VWEPP---KGFSEFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLEPGRTYQVLVKTVSGK 360 (361)
Q Consensus 299 ~W~~~---~~~~~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~ 360 (361)
.|..- ..-+.+.+|.|.++....... .+.........+-+...+ +-.|.+.|++.+.+
T Consensus 941 ~w~~v~~~~nes~v~gYkV~~~~~~~~~~--~~~~t~~~~~~~~~p~~~--~~~y~i~v~~~~~g 1001 (1051)
T KOG3513|consen 941 LWLLVSAFENESEVGGYKVLYREDLQNDI--EIILTMKLDAEFPEPSDL--DGKYDIKVRGYSPG 1001 (1051)
T ss_pred EEeeEEEEeecccCcceEEEEeecccCCc--eeEecccccccccCcccc--CCcceeEeccccCC
Confidence 99873 112468999999998754222 111111111122222224 25899999877653
No 5
>KOG0196|consensus
Probab=99.65 E-value=2.3e-14 Score=127.47 Aligned_cols=181 Identities=20% Similarity=0.321 Sum_probs=127.4
Q ss_pred eEEEecCCCCCCccEEEEEeCCEEEEEeeCCCC-C--CcccEEEEEEECCCCC------CCeEEeecCCCC---ceEEEc
Q psy4956 83 TASITTPPDPPTNLSVNVRSGKTAQIFWSPPIS-G--KYSGFKLKVISLSEKT------PPRIIGFTENPP---AGYSLK 150 (361)
Q Consensus 83 ~~~~~t~~~~p~~l~~~~~~~~~v~l~W~~p~~-~--~~~~y~v~~~~~~~~~------~~~~~~~~~~~~---~~~~l~ 150 (361)
..-.+.+|++|.++... ++.+++.|+|.+|.+ | .-..|.+.++...... .......+.+.. .++.+.
T Consensus 325 ~mpCT~PPSaP~nlis~-vn~Ts~~L~W~~P~d~GGR~Di~y~v~Ck~c~~~~~~C~~Cg~~V~f~P~q~gLt~~~V~v~ 403 (996)
T KOG0196|consen 325 SMPCTRPPSAPRNLISN-VNGTSLILEWSPPADTGGREDITYNVICKKCGGGRGACEPCGDNVRFTPRQRGLTETSVTVS 403 (996)
T ss_pred CCCCCCCCCccceeeee-cccceEEEEecCCcccCCCcceEEEEEeeccCCCCCccccCCCCceECCCCCCcccceEEEe
Confidence 34456678889998766 789999999999872 2 2335777776543221 011111121111 489999
Q ss_pred CCCCCcEEEEEEEEEeCCCCc----ccceeeeeccC---CCCCcceEEEeccCCeEEEEeecC-CCCCcceEEEEEEeCC
Q psy4956 151 DLTPGGSYQVQLFSVYDSKES----VAYTSRNFTTK---PNTPGKFIVWFRNETTLLVLWQPS-YPASIYTHYKVSIDPP 222 (361)
Q Consensus 151 ~L~p~~~Y~v~v~a~~~~~~s----~~~~~~~~~t~---p~~p~~l~~~~~~~~~~~v~W~~~-~~~~~~~~y~v~~~~~ 222 (361)
+|.|.+.|+|.|.|.|+...- .......++|. |++...++....+.+++.|+|..| .+.+.+..|.|.|.++
T Consensus 404 ~L~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps~V~~~r~~~~~~~sitlsW~~p~~png~ildYEvky~ek 483 (996)
T KOG0196|consen 404 DLLAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPSPVSVLRQVSRTSDSITLSWSEPDQPNGVILDYEVKYYEK 483 (996)
T ss_pred ccccccccEEEEEEeecccccCCCCCcceeEEeeccccCCCccceEEEeeeccCceEEecCCCCCCCCcceeEEEEEeec
Confidence 999999999999999965221 22344555554 666677888899999999999655 4458899999999988
Q ss_pred CCCCce-eEEeeccCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCCCc
Q psy4956 223 DAPESV-LYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDEISTP 269 (361)
Q Consensus 223 ~~~~~~-~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s~~ 269 (361)
+..... ...... .+..++.+|+|++.|.|+|+|.+..|.+.-
T Consensus 484 ~~~e~~~~~~~t~-----~~~~ti~gL~p~t~YvfqVRarT~aG~G~~ 526 (996)
T KOG0196|consen 484 DEDERSYSTLKTK-----TTTATITGLKPGTVYVFQVRARTAAGYGPY 526 (996)
T ss_pred cccccceeEEecc-----cceEEeeccCCCcEEEEEEEEecccCCCCC
Confidence 653332 222222 478999999999999999999998887643
No 6
>KOG0196|consensus
Probab=99.60 E-value=2.1e-13 Score=121.49 Aligned_cols=174 Identities=19% Similarity=0.266 Sum_probs=121.3
Q ss_pred ccCCCCCcceEEEeccCCeEEEEeecCCCCC-cc-eEEEEEEeCCCCC-------CceeEEeeccCCCCCcEEEecCCCC
Q psy4956 180 TTKPNTPGKFIVWFRNETTLLVLWQPSYPAS-IY-THYKVSIDPPDAP-------ESVLYVEKEGEPPGPAQAAFKGLVP 250 (361)
Q Consensus 180 ~t~p~~p~~l~~~~~~~~~~~v~W~~~~~~~-~~-~~y~v~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~L~p 250 (361)
+..|++|.+|... ++.+++.|.|.+|...| .- ..|.|.+..-... .....+......-.++.+.+.+|.|
T Consensus 329 T~PPSaP~nlis~-vn~Ts~~L~W~~P~d~GGR~Di~y~v~Ck~c~~~~~~C~~Cg~~V~f~P~q~gLt~~~V~v~~L~a 407 (996)
T KOG0196|consen 329 TRPPSAPRNLISN-VNGTSLILEWSPPADTGGREDITYNVICKKCGGGRGACEPCGDNVRFTPRQRGLTETSVTVSDLLA 407 (996)
T ss_pred CCCCCccceeeee-cccceEEEEecCCcccCCCcceEEEEEeeccCCCCCccccCCCCceECCCCCCcccceEEEecccc
Confidence 3447788888776 88999999999886553 22 2677776643321 1122222222234557899999999
Q ss_pred CCeEEEEEEEEeCCCC-C----CceEEEEecCCCCC---ccceEEeeecCCceEEEEeeCCCC-CCCceeEEEEEEECCC
Q psy4956 251 GRAYNISVQTVSEDEI-S----TPTTAQYRTIPLRP---LSFTYDKASITSNSLRVVWEPPKG-FSEFDKYQVSINVRRP 321 (361)
Q Consensus 251 ~t~Y~v~V~a~~~~~~-s----~~~~~~~~t~~~~p---~~l~~~~~~~~~~si~l~W~~~~~-~~~~~~y~i~~~~~~~ 321 (361)
.+.|+|.|.|+|+... + ..+.+...|....| ..++. ...+.+++.|+|..|+. .+.+..|+|+|.+...
T Consensus 408 h~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps~V~~~r~--~~~~~~sitlsW~~p~~png~ildYEvky~ek~~ 485 (996)
T KOG0196|consen 408 HTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPSPVSVLRQ--VSRTSDSITLSWSEPDQPNGVILDYEVKYYEKDE 485 (996)
T ss_pred ccccEEEEEEeecccccCCCCCcceeEEeeccccCCCccceEEE--eeeccCceEEecCCCCCCCCcceeEEEEEeeccc
Confidence 9999999999997432 1 22345666655554 34566 78889999999988642 2378999999998864
Q ss_pred CCCCCceeecCCCCceeEeccCCCCCceEEEEEEEEeC
Q psy4956 322 GASSTPITKSRDEPTQCDMSEGLEPGRTYQVLVKTVSG 359 (361)
Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~ 359 (361)
.+..... .....+..++.+ |+|++.|-|+|+|+..
T Consensus 486 ~e~~~~~--~~t~~~~~ti~g-L~p~t~YvfqVRarT~ 520 (996)
T KOG0196|consen 486 DERSYST--LKTKTTTATITG-LKPGTVYVFQVRARTA 520 (996)
T ss_pred cccceeE--EecccceEEeec-cCCCcEEEEEEEEecc
Confidence 3333333 345566899986 9999999999999864
No 7
>KOG4222|consensus
Probab=99.59 E-value=1.1e-13 Score=127.40 Aligned_cols=303 Identities=17% Similarity=0.236 Sum_probs=189.8
Q ss_pred ceEEecCCCCCceEEEEEEEecCCcCccceeeEEE---------------ecCCCCCCccEEEEEeCCEEEEEeeCCCC-
Q psy4956 52 DNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTASI---------------TTPPDPPTNLSVNVRSGKTAQIFWSPPIS- 115 (361)
Q Consensus 52 ~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~---------------~t~~~~p~~l~~~~~~~~~v~l~W~~p~~- 115 (361)
.+..+.+|+-...|.....+....+....+.++.. ...|.+|..-.+..++.+++.|+|.+...
T Consensus 474 ~~l~i~dl~~~DtG~YTc~as~~~ges~wSatl~v~~~~~s~q~~r~~D~S~~pS~p~~p~v~~v~~~~v~LsW~~~s~s 553 (1281)
T KOG4222|consen 474 GSLRIRDLKGPDTGRYTCIASDESGESTWSATLTVEKAGSSQQFCRCEDPSALPSPPGTPGVVNVSRTSVTLSWQPTSPS 553 (1281)
T ss_pred ceeeeeeeecCCCcceeeeccCcccccccceeeEhhhcCcccccccCCChhhCCCCCCCCccccCCCceEEecccCCCCC
Confidence 45666777666666666666544433322222221 11245566667777889999999988663
Q ss_pred --CCcccEEEEEEECCC-CCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCCCcccceeee-eccCCCCC-----
Q psy4956 116 --GKYSGFKLKVISLSE-KTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSKESVAYTSRN-FTTKPNTP----- 186 (361)
Q Consensus 116 --~~~~~y~v~~~~~~~-~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~s~~~~~~~-~~t~p~~p----- 186 (361)
.++.+|.+++...+- +.|...... ...+ .+.|.+|+|+..|.|.+++.+..|.+.+..... .++.|..+
T Consensus 554 g~vP~s~yiieafs~~~~etw~~ta~~-v~~t-~~~I~gL~P~~sylf~vRa~n~~Gis~Ps~~S~~vrta~a~~~~a~a 631 (1281)
T KOG4222|consen 554 GAVPASGYIIEAFSPDLGETWQTTAGR-VKTT-TYAIRGLKPNLSYLFLVRAENEQGISDPSTSSDPVRTAPADAAAAGA 631 (1281)
T ss_pred CccccchhHHHHhhhhhcccccccccc-cccc-eeeecCcCccceeeeeeeccccccccCCcccCCccccCCCChhhhhh
Confidence 357789998765432 233332222 2223 799999999999999999999988776544332 23332111
Q ss_pred -----------cce---EEEeccCCeEEEEeecCCC--CCcceEEEEEEeCCCCCC-ceeEEeeccCCCCCcEEEecCCC
Q psy4956 187 -----------GKF---IVWFRNETTLLVLWQPSYP--ASIYTHYKVSIDPPDAPE-SVLYVEKEGEPPGPAQAAFKGLV 249 (361)
Q Consensus 187 -----------~~l---~~~~~~~~~~~v~W~~~~~--~~~~~~y~v~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~L~ 249 (361)
..+ .....+.+++.+.|..... ...+.+|+|.|+...... ....... ......++.+.+|.
T Consensus 632 d~~k~~~~ls~~l~~l~~~~~L~asslr~~w~~~kq~~~~~i~g~~I~~r~~~~~~a~~s~~~v--~~~t~~s~v~~nl~ 709 (1281)
T KOG4222|consen 632 DHQKVQRELSNELLRLSNPNVLNASSLRLGWTKDKQHGSQYIQGYRISYRSLGSQLAQWSNAGV--TVPTPESVVVPNLK 709 (1281)
T ss_pred hHHHHHHhhcccceeeccccccchhheeeeeeeecccCcccccceEEEeccCccccccccccce--eccCCcceeccccC
Confidence 011 1223566789999976543 478899999999876521 1111111 13345788999999
Q ss_pred CCCeEEEEEEEEeCCCCC---Cc-eEEEEecC---CC-CCccceE-EeeecCCceEEEEeeCCCC---CCCceeEEEEEE
Q psy4956 250 PGRAYNISVQTVSEDEIS---TP-TTAQYRTI---PL-RPLSFTY-DKASITSNSLRVVWEPPKG---FSEFDKYQVSIN 317 (361)
Q Consensus 250 p~t~Y~v~V~a~~~~~~s---~~-~~~~~~t~---~~-~p~~l~~-~~~~~~~~si~l~W~~~~~---~~~~~~y~i~~~ 317 (361)
|++.|.|.+..+...+.+ .+ ....+.|. |. +|..++. ++.....+++.|+|.++.. .+.+.+|.|.+.
T Consensus 710 p~t~ye~f~~Pf~~~~~s~~g~pS~sk~alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~pp~d~~ng~~qg~ki~~~ 789 (1281)
T KOG4222|consen 710 PGTNYEFFVRPFFPHGYSIQGAPSNSKTALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPPPADVQNGILQGYKIECS 789 (1281)
T ss_pred CCccceeeccCccCCCcceecCCcccccccccccCCCCCCCCccccccccCCCceeeEEecCCcccccCCcccceeEEee
Confidence 999999999998874432 22 22233332 32 3455432 1145566789999999632 147889999877
Q ss_pred ECCCCCCCCceeecCCCCceeEeccCCCCCceEEEEEEEEeCC
Q psy4956 318 VRRPGASSTPITKSRDEPTQCDMSEGLEPGRTYQVLVKTVSGK 360 (361)
Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~ 360 (361)
.........+ ...+....+.++.+ |.+|..|.|+|.|.++.
T Consensus 790 ~~e~tr~h~n-~t~~a~~~sv~i~~-l~~g~ay~vtv~a~T~a 830 (1281)
T KOG4222|consen 790 GGEKTRIHIN-KTTNARTGSVTIGN-LVTGIAYSVTVAARTGA 830 (1281)
T ss_pred cCcccccccc-ccccCCCCceEecc-ccccceEEEEEeeecCC
Confidence 5542111122 22335667888875 99999999999998864
No 8
>KOG4258|consensus
Probab=99.40 E-value=1.2e-11 Score=110.92 Aligned_cols=261 Identities=21% Similarity=0.296 Sum_probs=160.8
Q ss_pred ccEEEEEeCCEEEEEeeCCC---CCCcccEEEEEEECC--------C-----CCCCeEEeecCC------CC-ceEEEcC
Q psy4956 95 NLSVNVRSGKTAQIFWSPPI---SGKYSGFKLKVISLS--------E-----KTPPRIIGFTEN------PP-AGYSLKD 151 (361)
Q Consensus 95 ~l~~~~~~~~~v~l~W~~p~---~~~~~~y~v~~~~~~--------~-----~~~~~~~~~~~~------~~-~~~~l~~ 151 (361)
++.....+.+++.|+|..-. ...+.+|.+.|+..- + ..++....+... .. -.+.+.+
T Consensus 493 ~~~~~~~~~dsi~lrW~~~~~~d~r~llg~~~~yKEaP~qNvT~~dg~~aCg~~~W~~~~v~~~~~~p~~~~~~~~~l~~ 572 (1025)
T KOG4258|consen 493 QFSSTVTSADSILLRWERYQPPDMRDLLGFLLHYKEAPFQNVTEEDGRDACGSNSWNVVDVDPPDLIPNDGTHPGFLLDG 572 (1025)
T ss_pred eeeeEEeecceeEEEecccCCcchhhhheeeEeeccCCccccceecCccccccCcceEEeccCCcCCCccccccceehhc
Confidence 34444556789999996532 234779999887521 0 012222333222 11 2578899
Q ss_pred CCCCcEEEEEEEEEeCCC--Cc--ccceeeeeccC---CCCCcceEEEeccCCeEEEEeecCC-CCCcceEEEEEEeCCC
Q psy4956 152 LTPGGSYQVQLFSVYDSK--ES--VAYTSRNFTTK---PNTPGKFIVWFRNETTLLVLWQPSY-PASIYTHYKVSIDPPD 223 (361)
Q Consensus 152 L~p~~~Y~v~v~a~~~~~--~s--~~~~~~~~~t~---p~~p~~l~~~~~~~~~~~v~W~~~~-~~~~~~~y~v~~~~~~ 223 (361)
|+|+|.|.+.|++...-. .. ..+...-++|. |.+|..+-.....++.+.|.|+||. +.|.++.|.|.|+...
T Consensus 573 LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~PspPl~~ls~snsSsqi~l~W~pP~~pNG~lt~Ylv~wer~~ 652 (1025)
T KOG4258|consen 573 LKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSPPLDVLSKSNSSSQILLKWKPPSQPNGNLTHYLVVWERQA 652 (1025)
T ss_pred CCccceeEEEEeeeehhhhccccccccceEEEEecCCCCCCcchhhhccCcchheeEEecCCCCCCCceeEEEEEEEecc
Confidence 999999999999985221 11 11223334444 6677666666566668999999874 4599999999986421
Q ss_pred CCC------------------------c---------------------------------------------eeEEe--
Q psy4956 224 APE------------------------S---------------------------------------------VLYVE-- 232 (361)
Q Consensus 224 ~~~------------------------~---------------------------------------------~~~~~-- 232 (361)
... . ...++
T Consensus 653 ~~~yl~~~nYC~~~~k~p~~~~~p~~~~ed~d~~~e~e~~~~~Cc~c~~~~~~~~~e~eea~~~~~FEd~L~n~i~vpr~ 732 (1025)
T KOG4258|consen 653 EDGYLEQRNYCHKGLKLPIRADLPSFDSEDMDPLLEMEGHTGPCCSCPPTESYPQYEDEEASEQKTFEDFLHNAIFVPRR 732 (1025)
T ss_pred CCchHHHhccccccccccccccCCCCchhhcchhhhhccCCCCCCCCCcccccCchhhHHHHHHHHHhhhccceeeeccc
Confidence 000 0 00000
Q ss_pred -------------------e-----------------c----cC-CCCCcEEEecCCCCCCeEEEEEEEEeCCCC----C
Q psy4956 233 -------------------K-----------------E----GE-PPGPAQAAFKGLVPGRAYNISVQTVSEDEI----S 267 (361)
Q Consensus 233 -------------------~-----------------~----~~-~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~----s 267 (361)
. . .. ..+...+.+++|+..+.|.+.++|++.... |
T Consensus 733 ~~~krk~l~~~~n~t~~~~~~~~~~p~t~~~t~p~ei~e~~p~~~n~n~~~~vi~~Lrh~tlY~i~l~aCnh~~~~~~cS 812 (1025)
T KOG4258|consen 733 PDRKRKSLDDVENCTRLAPTRKAEEPTTPPTTAPTEIEEPKPRLENGNKESYVISGLRHFTLYRIDLQACNHATPKCGCS 812 (1025)
T ss_pred CcccccccccccceeeccccccccCCCCCCCCCCccccccCcccccccchhhhhhccccchhhhhhHhhhcccccccccc
Confidence 0 0 00 012235678899999999999999996554 6
Q ss_pred CceEEEEecCCCC-----CccceEEeeecCCceEEEEeeCCCC-CCCceeEEEEEEECCCCCCCCceeecC-CCCceeEe
Q psy4956 268 TPTTAQYRTIPLR-----PLSFTYDKASITSNSLRVVWEPPKG-FSEFDKYQVSINVRRPGASSTPITKSR-DEPTQCDM 340 (361)
Q Consensus 268 ~~~~~~~~t~~~~-----p~~l~~~~~~~~~~si~l~W~~~~~-~~~~~~y~i~~~~~~~~~~~~~~~~~~-~~~~~~~~ 340 (361)
..+.+.++|+|.. |..+.-+ -....+++.|+|.+|+. .+-+..|+|.|+...+......+.+.. .....+.+
T Consensus 813 ~a~~v~~RT~~~~~aD~i~g~v~we-~~~~~~~v~l~w~EP~~pNGli~~Y~Vk~r~~~~et~v~cvsR~~~~k~~gv~l 891 (1025)
T KOG4258|consen 813 HAAFVFARTMPTMGADDIPGPVTWE-CHIEMNSVILRWLEPKEPNGLILNYEVKYRRNGDETHVECVSRMDYAKAGGVYL 891 (1025)
T ss_pred hhhhhhhccccccccccCCCceeEe-cccCcceEEEecCCCCCCCccEEEEEEEEeeccCcchhhhhhhhhhhhcCceEE
Confidence 6666777776632 3333331 13477899999998743 248999999999776533222222211 22235777
Q ss_pred ccCCCCCceEEEEEEEEe
Q psy4956 341 SEGLEPGRTYQVLVKTVS 358 (361)
Q Consensus 341 ~~~L~p~t~Y~v~V~a~~ 358 (361)
.+ |.|| .|.++|+|.|
T Consensus 892 ~~-l~~G-~y~~~vratS 907 (1025)
T KOG4258|consen 892 KR-LNPG-NYSVRVRATS 907 (1025)
T ss_pred Ee-cCCC-cEEEEEEEEe
Confidence 75 9998 9999999976
No 9
>KOG4222|consensus
Probab=99.39 E-value=9.2e-12 Score=115.03 Aligned_cols=257 Identities=16% Similarity=0.178 Sum_probs=161.1
Q ss_pred eCCCceEEEEec--CCC-CCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCcccee
Q psy4956 6 HGVHGANLVIKI--PEN-LSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTW 82 (361)
Q Consensus 6 ~~~~~~~l~~~~--p~~-~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~ 82 (361)
..+.+..+...| +.. ...++.+|.+++-..+-... .......-..+.+.|.+|+|++.|.|.|++.+..| -+.+.
T Consensus 536 ~~v~~~~v~LsW~~~s~sg~vP~s~yiieafs~~~~et-w~~ta~~v~~t~~~I~gL~P~~sylf~vRa~n~~G-is~Ps 613 (1281)
T KOG4222|consen 536 VNVSRTSVTLSWQPTSPSGAVPASGYIIEAFSPDLGET-WQTTAGRVKTTTYAIRGLKPNLSYLFLVRAENEQG-ISDPS 613 (1281)
T ss_pred ccCCCceEEecccCCCCCCccccchhHHHHhhhhhccc-ccccccccccceeeecCcCccceeeeeeecccccc-ccCCc
Confidence 344445544444 444 34499999998866442222 22222222268999999999999999999988653 32222
Q ss_pred --eEEEecCCCCC----------------CccE---EEEEeCCEEEEEeeCCCC---CCcccEEEEEEECCCC-CCCeEE
Q psy4956 83 --TASITTPPDPP----------------TNLS---VNVRSGKTAQIFWSPPIS---GKYSGFKLKVISLSEK-TPPRII 137 (361)
Q Consensus 83 --~~~~~t~~~~p----------------~~l~---~~~~~~~~v~l~W~~p~~---~~~~~y~v~~~~~~~~-~~~~~~ 137 (361)
+..+++.+..+ .-++ ...+..+++.+.|..+.. .-+.+|+|.|+..... ......
T Consensus 614 ~~S~~vrta~a~~~~a~ad~~k~~~~ls~~l~~l~~~~~L~asslr~~w~~~kq~~~~~i~g~~I~~r~~~~~~a~~s~~ 693 (1281)
T KOG4222|consen 614 TSSDPVRTAPADAAAAGADHQKVQRELSNELLRLSNPNVLNASSLRLGWTKDKQHGSQYIQGYRISYRSLGSQLAQWSNA 693 (1281)
T ss_pred ccCCccccCCCChhhhhhhHHHHHHhhcccceeeccccccchhheeeeeeeecccCcccccceEEEeccCcccccccccc
Confidence 23333332211 1111 122457899999988662 3478999999987652 222222
Q ss_pred -eecCCCCceEEEcCCCCCcEEEEEEEEEeCCCCc---ccceeeeeccC---CC-CCcce---EEEeccCCeEEEEeecC
Q psy4956 138 -GFTENPPAGYSLKDLTPGGSYQVQLFSVYDSKES---VAYTSRNFTTK---PN-TPGKF---IVWFRNETTLLVLWQPS 206 (361)
Q Consensus 138 -~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~s---~~~~~~~~~t~---p~-~p~~l---~~~~~~~~~~~v~W~~~ 206 (361)
....... ++.+.+|+|++.|++.++.+...+.+ .++....+.+. |. +|.++ .....+.+++.|+|.++
T Consensus 694 ~v~~~t~~-s~v~~nl~p~t~ye~f~~Pf~~~~~s~~g~pS~sk~alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~p 772 (1281)
T KOG4222|consen 694 GVTVPTPE-SVVVPNLKPGTNYEFFVRPFFPHGYSIQGAPSNSKTALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPP 772 (1281)
T ss_pred ceeccCCc-ceeccccCCCccceeeccCccCCCcceecCCcccccccccccCCCCCCCCccccccccCCCceeeEEecCC
Confidence 2223344 89999999999999999998764433 23233333222 33 34554 44455678999999877
Q ss_pred CCC---CcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCCC
Q psy4956 207 YPA---SIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDEIST 268 (361)
Q Consensus 207 ~~~---~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s~ 268 (361)
..+ +.+.+|.|.+....... ............+.++.+|.++..|.|+|.+.++.|.+.
T Consensus 773 p~d~~ng~~qg~ki~~~~~e~tr---~h~n~t~~a~~~sv~i~~l~~g~ay~vtv~a~T~aGvG~ 834 (1281)
T KOG4222|consen 773 PADVQNGILQGYKIECSGGEKTR---IHINKTTNARTGSVTIGNLVTGIAYSVTVAARTGAGVGV 834 (1281)
T ss_pred cccccCCcccceeEEeecCcccc---ccccccccCCCCceEeccccccceEEEEEeeecCCccCC
Confidence 333 77889999776433211 111222234457899999999999999999999888663
No 10
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.32 E-value=2e-11 Score=82.68 Aligned_cols=82 Identities=24% Similarity=0.493 Sum_probs=70.3
Q ss_pred CCCCccEEEEEeCCEEEEEeeCCC--CCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCC
Q psy4956 91 DPPTNLSVNVRSGKTAQIFWSPPI--SGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDS 168 (361)
Q Consensus 91 ~~p~~l~~~~~~~~~v~l~W~~p~--~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~ 168 (361)
++|.++.+...+.+++.|+|++|. .+.+.+|.|+|+............+..... .+.|.+|.|++.|.|+|+|.+..
T Consensus 1 s~P~~l~v~~~~~~sv~v~W~~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~-~~~i~~L~p~t~Y~~~v~a~~~~ 79 (85)
T PF00041_consen 1 SAPENLSVSNISPTSVTVSWKPPSSGNGPITGYRVEYRSVNSTSDWQEVTVPGNET-SYTITGLQPGTTYEFRVRAVNSD 79 (85)
T ss_dssp SSSEEEEEEEECSSEEEEEEEESSSTSSSESEEEEEEEETTSSSEEEEEEEETTSS-EEEEESCCTTSEEEEEEEEEETT
T ss_pred CcCcCeEEEECCCCEEEEEEECCCCCCCCeeEEEEEEEecccceeeeeeeeeeeee-eeeeccCCCCCEEEEEEEEEeCC
Confidence 468899999999999999999985 678999999999876655456666777777 99999999999999999999998
Q ss_pred CCccc
Q psy4956 169 KESVA 173 (361)
Q Consensus 169 ~~s~~ 173 (361)
|.+.+
T Consensus 80 g~g~~ 84 (85)
T PF00041_consen 80 GEGPP 84 (85)
T ss_dssp EEEEE
T ss_pred cCcCC
Confidence 87653
No 11
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.18 E-value=2.3e-10 Score=77.39 Aligned_cols=77 Identities=23% Similarity=0.460 Sum_probs=62.0
Q ss_pred CCccceEEeeecCCceEEEEeeCCC-CCCCceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCCCCceEEEEEEEEe
Q psy4956 280 RPLSFTYDKASITSNSLRVVWEPPK-GFSEFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLEPGRTYQVLVKTVS 358 (361)
Q Consensus 280 ~p~~l~~~~~~~~~~si~l~W~~~~-~~~~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~ 358 (361)
+|.+|.+ ..++.+++.|+|+.+. +.+.+++|.|.|...+... .............+++.+ |+|++.|.|+|+|++
T Consensus 2 ~P~~l~v--~~~~~~sv~v~W~~~~~~~~~~~~y~v~~~~~~~~~-~~~~~~~~~~~~~~~i~~-L~p~t~Y~~~v~a~~ 77 (85)
T PF00041_consen 2 APENLSV--SNISPTSVTVSWKPPSSGNGPITGYRVEYRSVNSTS-DWQEVTVPGNETSYTITG-LQPGTTYEFRVRAVN 77 (85)
T ss_dssp SSEEEEE--EEECSSEEEEEEEESSSTSSSESEEEEEEEETTSSS-EEEEEEEETTSSEEEEES-CCTTSEEEEEEEEEE
T ss_pred cCcCeEE--EECCCCEEEEEEECCCCCCCCeeEEEEEEEecccce-eeeeeeeeeeeeeeeecc-CCCCCEEEEEEEEEe
Confidence 6889999 8889999999999983 3358999999999886543 123344455566899986 999999999999998
Q ss_pred CC
Q psy4956 359 GK 360 (361)
Q Consensus 359 ~~ 360 (361)
++
T Consensus 78 ~~ 79 (85)
T PF00041_consen 78 SD 79 (85)
T ss_dssp TT
T ss_pred CC
Confidence 75
No 12
>KOG4258|consensus
Probab=99.13 E-value=2.1e-09 Score=96.96 Aligned_cols=255 Identities=17% Similarity=0.212 Sum_probs=151.1
Q ss_pred CceEEEEecCCCC---CCCcceEEEEEEcCC--------C-----CCCC-CeEEe------ecCccceEEecCCCCCceE
Q psy4956 9 HGANLVIKIPENL---SSDNSTYRLDYIPAH--------G-----HPPP-NTTYV------SRDIKDNIEFSEGLPGTKY 65 (361)
Q Consensus 9 ~~~~l~~~~p~~~---~~~~~~Y~v~~~~~~--------~-----~~~~-~~~~~------~~~~~~~~~i~~L~p~t~Y 65 (361)
..+.+.++|+.-. .-...||.+.|.... | ...+ ...+. ......-+.+.+|+|+|+|
T Consensus 500 ~~dsi~lrW~~~~~~d~r~llg~~~~yKEaP~qNvT~~dg~~aCg~~~W~~~~v~~~~~~p~~~~~~~~~l~~LkP~TqY 579 (1025)
T KOG4258|consen 500 SADSILLRWERYQPPDMRDLLGFLLHYKEAPFQNVTEEDGRDACGSNSWNVVDVDPPDLIPNDGTHPGFLLDGLKPWTQY 579 (1025)
T ss_pred ecceeEEEecccCCcchhhhheeeEeeccCCccccceecCccccccCcceEEeccCCcCCCccccccceehhcCCcccee
Confidence 3344555554422 226889999997642 1 1111 11111 1122346889999999999
Q ss_pred EEEEEEecCCc----CccceeeEEEec---CCCCCCccEEEEEeCCEEEEEeeCCC--CCCcccEEEEEEECCCCCC---
Q psy4956 66 DFYLYYTNSTV----HDWLTWTASITT---PPDPPTNLSVNVRSGKTAQIFWSPPI--SGKYSGFKLKVISLSEKTP--- 133 (361)
Q Consensus 66 ~i~V~a~~~~~----~~~~~~~~~~~t---~~~~p~~l~~~~~~~~~v~l~W~~p~--~~~~~~y~v~~~~~~~~~~--- 133 (361)
.+.|++..... ....+...-+.| .|++|..+.....+++.+.|.|+||. +|.+++|.+.|......+.
T Consensus 580 AvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~PspPl~~ls~snsSsqi~l~W~pP~~pNG~lt~Ylv~wer~~~~~yl~~ 659 (1025)
T KOG4258|consen 580 AVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSPPLDVLSKSNSSSQILLKWKPPSQPNGNLTHYLVVWERQAEDGYLEQ 659 (1025)
T ss_pred EEEEeeeehhhhccccccccceEEEEecCCCCCCcchhhhccCcchheeEEecCCCCCCCceeEEEEEEEeccCCchHHH
Confidence 99999863210 111122222334 46777666666666778999999988 7889999998864211000
Q ss_pred --------------------CeE--------------------------------------------Eeec---------
Q psy4956 134 --------------------PRI--------------------------------------------IGFT--------- 140 (361)
Q Consensus 134 --------------------~~~--------------------------------------------~~~~--------- 140 (361)
.+. ..++
T Consensus 660 ~nYC~~~~k~p~~~~~p~~~~ed~d~~~e~e~~~~~Cc~c~~~~~~~~~e~eea~~~~~FEd~L~n~i~vpr~~~~krk~ 739 (1025)
T KOG4258|consen 660 RNYCHKGLKLPIRADLPSFDSEDMDPLLEMEGHTGPCCSCPPTESYPQYEDEEASEQKTFEDFLHNAIFVPRRPDRKRKS 739 (1025)
T ss_pred hccccccccccccccCCCCchhhcchhhhhccCCCCCCCCCcccccCchhhHHHHHHHHHhhhccceeeecccCcccccc
Confidence 000 0000
Q ss_pred ---------------------------------------CCCCceEEEcCCCCCcEEEEEEEEEeCCCC----cccceee
Q psy4956 141 ---------------------------------------ENPPAGYSLKDLTPGGSYQVQLFSVYDSKE----SVAYTSR 177 (361)
Q Consensus 141 ---------------------------------------~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~----s~~~~~~ 177 (361)
.+. .++.+.+|.+.+.|.+.+.|++..+. |.. ...
T Consensus 740 l~~~~n~t~~~~~~~~~~p~t~~~t~p~ei~e~~p~~~n~n~-~~~vi~~Lrh~tlY~i~l~aCnh~~~~~~cS~a-~~v 817 (1025)
T KOG4258|consen 740 LDDVENCTRLAPTRKAEEPTTPPTTAPTEIEEPKPRLENGNK-ESYVISGLRHFTLYRIDLQACNHATPKCGCSHA-AFV 817 (1025)
T ss_pred cccccceeeccccccccCCCCCCCCCCccccccCcccccccc-hhhhhhccccchhhhhhHhhhcccccccccchh-hhh
Confidence 001 25677889999999999999986654 332 222
Q ss_pred eeccCC----C-CCcceEEEe-ccCCeEEEEeecC-CCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCC
Q psy4956 178 NFTTKP----N-TPGKFIVWF-RNETTLLVLWQPS-YPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVP 250 (361)
Q Consensus 178 ~~~t~p----~-~p~~l~~~~-~~~~~~~v~W~~~-~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p 250 (361)
..+|.| + -|.-+.-.. ...+.+.+.|..| .+.|.+..|.|.++..+.......+.... ......+.|.+|.|
T Consensus 818 ~~RT~~~~~aD~i~g~v~we~~~~~~~v~l~w~EP~~pNGli~~Y~Vk~r~~~~et~v~cvsR~~-~~k~~gv~l~~l~~ 896 (1025)
T KOG4258|consen 818 FARTMPTMGADDIPGPVTWECHIEMNSVILRWLEPKEPNGLILNYEVKYRRNGDETHVECVSRMD-YAKAGGVYLKRLNP 896 (1025)
T ss_pred hhccccccccccCCCceeEecccCcceEEEecCCCCCCCccEEEEEEEEeeccCcchhhhhhhhh-hhhcCceEEEecCC
Confidence 223332 1 122222222 3677899999655 45599999999999765544332221111 11225678899999
Q ss_pred CCeEEEEEEEEeCCCCC
Q psy4956 251 GRAYNISVQTVSEDEIS 267 (361)
Q Consensus 251 ~t~Y~v~V~a~~~~~~s 267 (361)
| .|.++|+|.+-.|.+
T Consensus 897 G-~y~~~vratSlaGng 912 (1025)
T KOG4258|consen 897 G-NYSVRVRATSLAGNG 912 (1025)
T ss_pred C-cEEEEEEEEeeccCC
Confidence 9 899999998865554
No 13
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=98.94 E-value=3.5e-06 Score=69.07 Aligned_cols=216 Identities=19% Similarity=0.184 Sum_probs=114.6
Q ss_pred ecCccceEEecCCCCCceEEEEEEEecCCcC-ccceeeEEEec---CCCCCCcc-----EEEEEeC-C-EEEEEeeCCC-
Q psy4956 47 SRDIKDNIEFSEGLPGTKYDFYLYYTNSTVH-DWLTWTASITT---PPDPPTNL-----SVNVRSG-K-TAQIFWSPPI- 114 (361)
Q Consensus 47 ~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~-~~~~~~~~~~t---~~~~p~~l-----~~~~~~~-~-~v~l~W~~p~- 114 (361)
..+..+.++|.+|.|+++|.|.|-+.+.... ++.-....+.+ ...+|..| ....+.. . .-..+++.|.
T Consensus 10 Cvg~~t~~t~~~L~p~t~YyfdVF~vn~~~n~ssay~gt~~~t~~~~r~~~~~Lkdg~l~~~~l~~~~g~~~f~f~vP~~ 89 (300)
T PF10179_consen 10 CVGQKTNQTLSGLKPDTTYYFDVFVVNQLTNNSSAYLGTFARTREENRSKPTRLKDGKLTQVKLKGKGGFKFFSFKVPKR 89 (300)
T ss_pred EcCCCceEEeccCCCCCeEEEEEEEEECCCCceeeeeEEEEEEccccCCCcEEcccCcEEEEEECCcCceeEEEEcCCcC
Confidence 3345799999999999999999999876423 22222233333 22233222 2222222 1 2445566441
Q ss_pred CCCcccEEEEEEECCCC--------C--CCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCCCcccceeee-eccC-
Q psy4956 115 SGKYSGFKLKVISLSEK--------T--PPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSKESVAYTSRN-FTTK- 182 (361)
Q Consensus 115 ~~~~~~y~v~~~~~~~~--------~--~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~s~~~~~~~-~~t~- 182 (361)
......-.+.....++. + ......+ ... ..+.+.++.||..|.+++.+.+....... .-.. .++.
T Consensus 90 ~~~~~~~~l~v~~C~G~V~v~i~r~gk~l~~~~~v-~~~-~~f~l~~~~~g~~Yliri~~~~~~e~~~~-~kV~aast~~ 166 (300)
T PF10179_consen 90 SSTHQSLWLFVQSCSGSVRVEISRNGKILLSQKNV-EGL-RHFRLSGVKPGERYLIRIQISNSDEGPST-FKVQAASTNP 166 (300)
T ss_pred CCCCccEEEEEEeCCCeEEEEEEECCeEEeeeecc-cce-EEEEECCCCCCCeEEEEEEccCCCCCceE-EEEEEecCCc
Confidence 11111111112222211 0 0001011 111 37999999999999999977654332211 1111 2332
Q ss_pred -----CCCCcceEEEe----ccCCeEEEEeecCCCCCcceEEEEEEeCCCCC----------------Cc-----eeEEe
Q psy4956 183 -----PNTPGKFIVWF----RNETTLLVLWQPSYPASIYTHYKVSIDPPDAP----------------ES-----VLYVE 232 (361)
Q Consensus 183 -----p~~p~~l~~~~----~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~----------------~~-----~~~~~ 232 (361)
|.-|.+..+.. .+.++++|.|.+.. +.. ..|.|..+..... .. .....
T Consensus 167 ~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~-d~~-~kYCvy~~~~~~~~~~~~~~~~~n~C~~~~sr~k~e~v~Ck 244 (300)
T PF10179_consen 167 SKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSP-DRS-IKYCVYRREEHSNYQERSVSRMPNQCLGPESRKKSEKVLCK 244 (300)
T ss_pred ccCCCCCCCCCCceeEEcCCcccceEEEEEecCC-CCC-ceEEEEEEEecCchhhhhhcccCccCCCCCccccceEEEEE
Confidence 56677766643 46689999998753 222 4788866532221 00 01110
Q ss_pred -ecc------CCCCCcEEEecCCCCCCeEEEEEEEEeCCCCC
Q psy4956 233 -KEG------EPPGPAQAAFKGLVPGRAYNISVQTVSEDEIS 267 (361)
Q Consensus 233 -~~~------~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s 267 (361)
... ........+|.+|+||+.|.|.|.+.-..|.+
T Consensus 245 ~~~~~n~~~~~~~~v~tetI~~L~PG~~Yl~dV~~~~~~G~s 286 (300)
T PF10179_consen 245 YFHSPNSSEDPQRAVTTETIKGLKPGTTYLFDVYVNGPSGQS 286 (300)
T ss_pred EEcCCccccccccccceeecccCCCCcEEEEEEEEecCCCce
Confidence 010 01122345799999999999999998766654
No 14
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=98.83 E-value=1.3e-05 Score=65.76 Aligned_cols=116 Identities=23% Similarity=0.256 Sum_probs=72.6
Q ss_pred CcEEEecCCCCCCeEEEEEEEEeCCCCCCceEEE-EecC------CCCCccceEEeee--cCCceEEEEeeCCCCCCCce
Q psy4956 240 PAQAAFKGLVPGRAYNISVQTVSEDEISTPTTAQ-YRTI------PLRPLSFTYDKAS--ITSNSLRVVWEPPKGFSEFD 310 (361)
Q Consensus 240 ~~~~~~~~L~p~t~Y~v~V~a~~~~~~s~~~~~~-~~t~------~~~p~~l~~~~~~--~~~~si~l~W~~~~~~~~~~ 310 (361)
-..+.+.|+.||..|.+++.+.+.........+. +.+. |.-|.++.+.... -+-++++|.|.+..+. . .
T Consensus 127 ~~~f~l~~~~~g~~Yliri~~~~~~e~~~~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~d~-~-~ 204 (300)
T PF10179_consen 127 LRHFRLSGVKPGERYLIRIQISNSDEGPSTFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSPDR-S-I 204 (300)
T ss_pred eEEEEECCCCCCCeEEEEEEccCCCCCceEEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEecCCCC-C-c
Confidence 3678999999999999999887654433333344 3333 3347777773334 4558999999985441 2 4
Q ss_pred eEEEEEEECCCC-------------C---CCCc---eee-----------cCCCCceeEeccCCCCCceEEEEEEEEe
Q psy4956 311 KYQVSINVRRPG-------------A---SSTP---ITK-----------SRDEPTQCDMSEGLEPGRTYQVLVKTVS 358 (361)
Q Consensus 311 ~y~i~~~~~~~~-------------~---~~~~---~~~-----------~~~~~~~~~~~~~L~p~t~Y~v~V~a~~ 358 (361)
.|.|..++.+.. . .... +.. .....+..++.+ |+||+.|.|.|.+.-
T Consensus 205 kYCvy~~~~~~~~~~~~~~~~~n~C~~~~sr~k~e~v~Ck~~~~~n~~~~~~~~v~tetI~~-L~PG~~Yl~dV~~~~ 281 (300)
T PF10179_consen 205 KYCVYRREEHSNYQERSVSRMPNQCLGPESRKKSEKVLCKYFHSPNSSEDPQRAVTTETIKG-LKPGTTYLFDVYVNG 281 (300)
T ss_pred eEEEEEEEecCchhhhhhcccCccCCCCCccccceEEEEEEEcCCccccccccccceeeccc-CCCCcEEEEEEEEec
Confidence 788876654432 0 0000 100 011223457875 999999999999974
No 15
>KOG4802|consensus
Probab=98.77 E-value=1.6e-07 Score=77.96 Aligned_cols=130 Identities=20% Similarity=0.232 Sum_probs=83.3
Q ss_pred eEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCC---Ccccceeeeec---cCCCCCcceEEEeccCC---eEEEEeec
Q psy4956 135 RIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSK---ESVAYTSRNFT---TKPNTPGKFIVWFRNET---TLLVLWQP 205 (361)
Q Consensus 135 ~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~---~s~~~~~~~~~---t~p~~p~~l~~~~~~~~---~~~v~W~~ 205 (361)
+........+ .+.+.++.||.-|.|+|.|++..| -+.++...... ..|.+|.++.+..+..+ ...|.|-|
T Consensus 200 Qtv~~t~~e~-~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~~~~~v~w~P 278 (516)
T KOG4802|consen 200 QTVEKTMEEN-TYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGRYMLKVVWCP 278 (516)
T ss_pred eeeeecCCCc-eeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeeeecceEEEEEEeCC
Confidence 3333334444 788899999999999999998543 33333222222 22557778777654443 46788888
Q ss_pred CCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCCCCeEEEEEEEEeCCC
Q psy4956 206 SYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDE 265 (361)
Q Consensus 206 ~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~ 265 (361)
+.++-++..|.|.|...-...................+.|.+|.|+..|.|.|+|+.--|
T Consensus 279 ~~sdlPv~~Yki~Ws~~v~s~k~~m~tks~~~k~thq~si~~L~Pns~Y~VevqAi~y~g 338 (516)
T KOG4802|consen 279 SKSDLPVEKYKITWSLYVNSAKASMITKSSYVKDTHQFSIKELLPNSSYYVEVQAISYLG 338 (516)
T ss_pred CCCCCcceeeEEEeehhhhhhhhhcccccceeeccchhhhhhcCCCCeEEEEEEEEEecc
Confidence 877889999999997543211111111111122234566999999999999999998444
No 16
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.62 E-value=1e-06 Score=60.11 Aligned_cols=82 Identities=27% Similarity=0.500 Sum_probs=59.7
Q ss_pred CCCCCccEEEEEeCCEEEEEeeCCCC--CCcccEEEEEEECCCCCCCeEEeec-CCCCceEEEcCCCCCcEEEEEEEEEe
Q psy4956 90 PDPPTNLSVNVRSGKTAQIFWSPPIS--GKYSGFKLKVISLSEKTPPRIIGFT-ENPPAGYSLKDLTPGGSYQVQLFSVY 166 (361)
Q Consensus 90 ~~~p~~l~~~~~~~~~v~l~W~~p~~--~~~~~y~v~~~~~~~~~~~~~~~~~-~~~~~~~~l~~L~p~~~Y~v~v~a~~ 166 (361)
|.+|.++.+......++.|.|.++.. +....|.|.+.......+. ..... .... .+.+.+|.|++.|.++|++..
T Consensus 1 p~~p~~~~~~~~~~~~~~v~W~~~~~~~~~~~~y~v~~~~~~~~~~~-~~~~~~~~~~-~~~i~~l~p~~~Y~~~v~a~~ 78 (93)
T cd00063 1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGSGDWK-EVEVTPGSET-SYTLTGLKPGTEYEFRVRAVN 78 (93)
T ss_pred CcCCCCcEEEEecCCEEEEEECCCCCCCCcceeEEEEEeeCCCCCCE-EeeccCCccc-EEEEccccCCCEEEEEEEEEC
Confidence 34677788887778999999998753 3577899998875422222 22221 2444 899999999999999999998
Q ss_pred CCCCccc
Q psy4956 167 DSKESVA 173 (361)
Q Consensus 167 ~~~~s~~ 173 (361)
..+.+..
T Consensus 79 ~~~~~~~ 85 (93)
T cd00063 79 GGGESPP 85 (93)
T ss_pred CCccCCC
Confidence 7766643
No 17
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.49 E-value=3.1e-06 Score=57.65 Aligned_cols=85 Identities=25% Similarity=0.371 Sum_probs=60.7
Q ss_pred CCCCcceEEEeccCCeEEEEeecCCCC-CcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCCCCeEEEEEEEE
Q psy4956 183 PNTPGKFIVWFRNETTLLVLWQPSYPA-SIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTV 261 (361)
Q Consensus 183 p~~p~~l~~~~~~~~~~~v~W~~~~~~-~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~ 261 (361)
|.+|.++.+......++.|.|+++... +.+..|.|.+.......... .... ......+.+.+|.|++.|.|+|.+.
T Consensus 1 p~~p~~~~~~~~~~~~~~v~W~~~~~~~~~~~~y~v~~~~~~~~~~~~-~~~~--~~~~~~~~i~~l~p~~~Y~~~v~a~ 77 (93)
T cd00063 1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGSGDWKE-VEVT--PGSETSYTLTGLKPGTEYEFRVRAV 77 (93)
T ss_pred CcCCCCcEEEEecCCEEEEEECCCCCCCCcceeEEEEEeeCCCCCCEE-eecc--CCcccEEEEccccCCCEEEEEEEEE
Confidence 346777888877889999999876432 56789999998764222222 2111 1235789999999999999999999
Q ss_pred eCCCCCCce
Q psy4956 262 SEDEISTPT 270 (361)
Q Consensus 262 ~~~~~s~~~ 270 (361)
+..+.+..+
T Consensus 78 ~~~~~~~~s 86 (93)
T cd00063 78 NGGGESPPS 86 (93)
T ss_pred CCCccCCCc
Confidence 876665443
No 18
>KOG4802|consensus
Probab=98.42 E-value=4e-06 Score=69.98 Aligned_cols=130 Identities=15% Similarity=0.193 Sum_probs=82.9
Q ss_pred CCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCc--ccee-eEEE---ecCCCCCCccEEEEEe---CCEEEEEee
Q psy4956 41 PNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHD--WLTW-TASI---TTPPDPPTNLSVNVRS---GKTAQIFWS 111 (361)
Q Consensus 41 ~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~--~~~~-~~~~---~t~~~~p~~l~~~~~~---~~~v~l~W~ 111 (361)
.+.++........+.+.++.||..|.|+|.|+|.-|.. +.+. -... .-+|.+|.++.+..+. .-...|-|.
T Consensus 198 hwQtv~~t~~e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~~~~~v~w~ 277 (516)
T KOG4802|consen 198 HWQTVEKTMEENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGRYMLKVVWC 277 (516)
T ss_pred cceeeeecCCCceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeeeecceEEEEEEeC
Confidence 35666666556789999999999999999998743332 2221 1111 2246667888776643 234678888
Q ss_pred CCC-CCCcccEEEEEEECCCC---CCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCCCc
Q psy4956 112 PPI-SGKYSGFKLKVISLSEK---TPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSKES 171 (361)
Q Consensus 112 ~p~-~~~~~~y~v~~~~~~~~---~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~s 171 (361)
|+. +-++.+|.|.|...-.. ............. .+.|.+|.|+..|.|.|.|....|..
T Consensus 278 P~~sdlPv~~Yki~Ws~~v~s~k~~m~tks~~~k~th-q~si~~L~Pns~Y~VevqAi~y~g~~ 340 (516)
T KOG4802|consen 278 PSKSDLPVEKYKITWSLYVNSAKASMITKSSYVKDTH-QFSIKELLPNSSYYVEVQAISYLGSR 340 (516)
T ss_pred CCCCCCcceeeEEEeehhhhhhhhhcccccceeeccc-hhhhhhcCCCCeEEEEEEEEEeccCc
Confidence 866 45789999998653211 1111111111122 34489999999999999999865543
No 19
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=98.36 E-value=1.2e-05 Score=53.21 Aligned_cols=77 Identities=22% Similarity=0.449 Sum_probs=52.5
Q ss_pred CCCccEEEEEeCCEEEEEeeCCCCCCcccEEEEEEECCCCC--CCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCC
Q psy4956 92 PPTNLSVNVRSGKTAQIFWSPPISGKYSGFKLKVISLSEKT--PPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSK 169 (361)
Q Consensus 92 ~p~~l~~~~~~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~~--~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~ 169 (361)
+|.++.+......++.|+|++|......+|.+.|....... ............ .+.+.+|.|++.|.|+|++.+..+
T Consensus 3 ~p~~~~~~~~~~~~~~v~W~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~-~~~i~~L~~~~~Y~v~v~a~~~~g 81 (83)
T smart00060 3 PPSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYRVEYREEGSSWKEVNVTPSST-SYTLTGLKPGTEYEFRVRAVNGAG 81 (83)
T ss_pred CCCcEEEEEEeCCEEEEEECCCCCCCCCccEEEEEEEEecCCCccEEEEecCCcc-EEEEeCcCCCCEEEEEEEEEcccC
Confidence 45567777777779999999765222267888876643321 122232333344 899999999999999999998543
No 20
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=98.23 E-value=1.5e-05 Score=52.68 Aligned_cols=77 Identities=25% Similarity=0.364 Sum_probs=50.5
Q ss_pred CCcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCc--eeEEeeccCCCCCcEEEecCCCCCCeEEEEEEEEe
Q psy4956 185 TPGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPES--VLYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTVS 262 (361)
Q Consensus 185 ~p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~ 262 (361)
+|.++.+......++.|+|+++..... .+|.+.|........ ...... ......+.+.+|.|++.|.|+|.|.+
T Consensus 3 ~p~~~~~~~~~~~~~~v~W~~~~~~~~-~~y~~~~~~~~~~~~~~~~~~~~---~~~~~~~~i~~L~~~~~Y~v~v~a~~ 78 (83)
T smart00060 3 PPSNLRVTDVTSTSVTLSWEPPPDDGI-TGYIVGYRVEYREEGSSWKEVNV---TPSSTSYTLTGLKPGTEYEFRVRAVN 78 (83)
T ss_pred CCCcEEEEEEeCCEEEEEECCCCCCCC-CccEEEEEEEEecCCCccEEEEe---cCCccEEEEeCcCCCCEEEEEEEEEc
Confidence 455677777777799999996643332 667777664332211 111111 11147899999999999999999998
Q ss_pred CCC
Q psy4956 263 EDE 265 (361)
Q Consensus 263 ~~~ 265 (361)
..|
T Consensus 79 ~~g 81 (83)
T smart00060 79 GAG 81 (83)
T ss_pred ccC
Confidence 644
No 21
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=98.17 E-value=0.0012 Score=54.36 Aligned_cols=317 Identities=14% Similarity=0.027 Sum_probs=160.8
Q ss_pred CCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCcccee-eEEEec-CCCCCCccEEEE
Q psy4956 23 SDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTW-TASITT-PPDPPTNLSVNV 100 (361)
Q Consensus 23 ~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~-~~~~~t-~~~~p~~l~~~~ 100 (361)
-.+.+|.+.....+-.......++.. -.+++....|.+++.|.+..-..+.. ..+... ..-..+ ...+........
T Consensus 3 ~~~~~~~~~~t~~~~~~~~i~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~i~~~~~~~~~~ 80 (343)
T COG3401 3 YDIDGFVLYRTKKDLKLKRIGTIKNK-YQTHYYDEGLEGEESYPYQEGTTKVD-KISYDSERILVKTSFIERVRSVFASL 80 (343)
T ss_pred ccceeEEEEEEcccchhhhhhccccc-hhhhhhhccccccCcceeeecccccc-eeeecCcceEEEeeeccccccccchh
Confidence 35678877766644332222333322 26889999999999999887665543 222111 111111 222222222222
Q ss_pred EeCCEEEEEeeCCCCCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCCC-cccceeeee
Q psy4956 101 RSGKTAQIFWSPPISGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSKE-SVAYTSRNF 179 (361)
Q Consensus 101 ~~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~-s~~~~~~~~ 179 (361)
.-+..+.+-|.+..+-.+..|.|+....+.. ..+.-.+.......+.-.+|.++..|...|.+....+. +.+......
T Consensus 81 ~~~~~~~~~w~~~~d~~~~~Y~i~~~~gD~~-f~r~~~~~n~l~~~~i~s~~~~~~~~~~~Iia~~f~~~~sfsf~gVE~ 159 (343)
T COG3401 81 ERPKSVKVFWSPHPDVSVGKYIIQRQNGDGK-FLRTGLVKNRLFVEFIDSDLGHNEKYMELIIAADFQMGKSFSFTGVEA 159 (343)
T ss_pred cCcceeeecccccCCCCCCeEEEEEecCchh-hhhhhHHHhccchhheecccccccceeeeEEeecccccceeeeeeeec
Confidence 3456799999997777788999986654322 11111111111114445689999999999999886643 323233333
Q ss_pred ccCCCCCc--ceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCCCCeEEEE
Q psy4956 180 TTKPNTPG--KFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVPGRAYNIS 257 (361)
Q Consensus 180 ~t~p~~p~--~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~ 257 (361)
+....|++ ++.+.....+.+.|+|+.+.. .+.|+|.-...+........... .+.......+| |..|...
T Consensus 160 ~~~~~P~ei~~~~~~~d~~~~i~ls~dg~~~---~~yy~IY~~~~g~e~~~~ia~t~---~n~y~d~~egl--ga~~~y~ 231 (343)
T COG3401 160 TPKAEPKEITNVRVSFDLGNNIELSEDGSEA---EDYYRIYASDSGNEEYGFIAQTT---ENSYYDVKEGL--GAVEYYK 231 (343)
T ss_pred ccccCCceeeeeeeecCCCCcceeeccCccc---cceEEEeccCCccccccceeecc---ccchhhhhhcc--CceeEEE
Confidence 33333343 344445566789999986532 23677744333322222222211 11122333445 5566677
Q ss_pred EEEEeCCCCCCc-eE--EEEec--CCCCCccceEEeeecCCceEEEEeeCCCCCCCceeEEEEEEECCCCCCCCceeecC
Q psy4956 258 VQTVSEDEISTP-TT--AQYRT--IPLRPLSFTYDKASITSNSLRVVWEPPKGFSEFDKYQVSINVRRPGASSTPITKSR 332 (361)
Q Consensus 258 V~a~~~~~~s~~-~~--~~~~t--~~~~p~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i~~~~~~~~~~~~~~~~~~ 332 (361)
|.++...+.... .. +...+ .+..|.... .....+.+.|.|+..... ....|.+.-...+.. .... .
T Consensus 232 VTtVd~~~~es~lp~~~t~~~~~g~~e~pt~~g---~t~~~s~i~V~~E~~~~~-~av~ytvyr~~~g~~---~~~f--~ 302 (343)
T COG3401 232 VTTVDNTGFESDLPNEPTVGETGGRYEVPTIPG---ETTEASFIGVAAEQNQLR-QAVTYTVYRVEDGTP---TKFF--T 302 (343)
T ss_pred EEEEcCCcceeccCCccccccccCCcccccCCC---ceeeeeeeeEEecccccc-eeeEEeeeeccCCCc---ceee--e
Confidence 777775443211 11 11122 122221111 122334456888775321 233344433333211 1111 1
Q ss_pred CCCceeEeccCCCCCceEEEEEEEEeCC
Q psy4956 333 DEPTQCDMSEGLEPGRTYQVLVKTVSGK 360 (361)
Q Consensus 333 ~~~~~~~~~~~L~p~t~Y~v~V~a~~~~ 360 (361)
-..+...... +.++..|.+.|.+++..
T Consensus 303 it~~d~~d~~-~ltg~~~~y~v~~v~~~ 329 (343)
T COG3401 303 ITETDGQDND-MLTGVEYRYEVVAVDKA 329 (343)
T ss_pred eeeccccchh-hccccceeEEEEEeccc
Confidence 1234566665 89999999999988643
No 22
>KOG3632|consensus
Probab=98.11 E-value=1.5e-05 Score=73.69 Aligned_cols=201 Identities=18% Similarity=0.277 Sum_probs=121.5
Q ss_pred EEEEeCCEEEEEeeCCC----CCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCC-CCcEEEEEEEEEeCCCCcc
Q psy4956 98 VNVRSGKTAQIFWSPPI----SGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLT-PGGSYQVQLFSVYDSKESV 172 (361)
Q Consensus 98 ~~~~~~~~v~l~W~~p~----~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~-p~~~Y~v~v~a~~~~~~s~ 172 (361)
+......++.|.|++|. .+.+..|.+..- ....+.+.+. +.+ ...+..|. .+..|.+.|.+....|.+.
T Consensus 586 ~vkqla~sv~vawepps~pP~~~~~~~~~~~v~----~elrq~l~~g-s~t-ka~~E~ld~~a~s~~isvq~ltSrGsqd 659 (1335)
T KOG3632|consen 586 AVKQLAGSVHVAWEPPSSPPFTAPARMIGATVL----MELRQTLYAG-SLT-KAMQESLDNSAHSGYISVQRLTSRGSQD 659 (1335)
T ss_pred hhhhhccceeeeccCCCCCCccccceeeeeecc----hhhhhhcccc-cch-HHHHhhccccCCceeeehhhhhccCCCC
Confidence 33345678999999976 334556666531 1222222221 122 33333332 2345778888888877776
Q ss_pred cceeeeecc--CCCCCcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCC
Q psy4956 173 AYTSRNFTT--KPNTPGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVP 250 (361)
Q Consensus 173 ~~~~~~~~t--~p~~p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p 250 (361)
......... .+..|.++.+...+.++..++|-+..+ . .-++.|......... ......++|.+|.|
T Consensus 660 ~lrc~Llvgg~a~vvpsqlrv~n~tqtSa~itwvp~ns--n--~~Hviyln~eE~~ps--------~a~~y~ytf~~lrp 727 (1335)
T KOG3632|consen 660 QLRCILLVGGAAPVVPSQLRVWNATQTSAMITWVPFNS--N--FLHVIYLNAEEPRPS--------VAEMYNYTFMRLRP 727 (1335)
T ss_pred cceeeEeccccccccchhhhhhhhhchhhheeeeecCC--C--cceeeecCCccCCCc--------hhhhhHHHHhccCC
Confidence 543333332 267899999999999999999987532 1 123444433332221 11236788999999
Q ss_pred CCeEEEEEEEEeCCCCC----------CceEEEEecCC----CCCccceEEeeecCCceEEEEeeCCC----C---CCCc
Q psy4956 251 GRAYNISVQTVSEDEIS----------TPTTAQYRTIP----LRPLSFTYDKASITSNSLRVVWEPPK----G---FSEF 309 (361)
Q Consensus 251 ~t~Y~v~V~a~~~~~~s----------~~~~~~~~t~~----~~p~~l~~~~~~~~~~si~l~W~~~~----~---~~~~ 309 (361)
++.|.++|-+.-..... ....+.+.|.| .||.+++++ ...+...+.++|.++. | ...+
T Consensus 728 gt~y~a~vea~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE-~g~spg~l~vswrPptldsag~sngv~v 806 (1335)
T KOG3632|consen 728 GTDYWASVEAALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVE-TGGSPGRLEVSWRPPTLDSAGCSNGVAV 806 (1335)
T ss_pred CCccceecccccCcCCCcccccchhhhhccceeeecccCCCCCCchheeee-cCCCCceeeeeccCceeccccccCceee
Confidence 99999999887642111 23334555543 456777774 3556788999999962 2 1467
Q ss_pred eeEEEEEE
Q psy4956 310 DKYQVSIN 317 (361)
Q Consensus 310 ~~y~i~~~ 317 (361)
++|-|...
T Consensus 807 tgYavyad 814 (1335)
T KOG3632|consen 807 TGYAVYAD 814 (1335)
T ss_pred eeeeeeeC
Confidence 88988654
No 23
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=97.77 E-value=0.0002 Score=50.34 Aligned_cols=81 Identities=17% Similarity=0.233 Sum_probs=55.8
Q ss_pred CCCCCccceEEeeecCCceEEEEeeCCCCCCCceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCC-CCceEEEEEE
Q psy4956 277 IPLRPLSFTYDKASITSNSLRVVWEPPKGFSEFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLE-PGRTYQVLVK 355 (361)
Q Consensus 277 ~~~~p~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~-p~t~Y~v~V~ 355 (361)
...+|.++++ ...+-...|.|+++.+...-..|.|+|...+.+.+........-..+++.+++.+. +...|.+||+
T Consensus 21 ~lp~P~nv~~---~s~nf~~iL~W~~~~~~~~~~~ytVq~~~~~~~~W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~~rV~ 97 (107)
T PF01108_consen 21 SLPAPQNVTV---DSVNFKHILRWDPGPGSPPNVTYTVQYKKYGSSSWKDVPGCQNITETSCDLTDETSDPSESYYARVR 97 (107)
T ss_dssp SGSSCEEEEE---EEETTEEEEEEEESTTSSSTEEEEEEEEESSTSCEEEECCEEEESSSEEECTTCCTTTTSEEEEEEE
T ss_pred cCCCCCeeEE---EEECCceEEEeCCCCCCCCCeEEEEEEEecCCcceeeccceecccccceeCcchhhcCcCCEEEEEE
Confidence 4457899998 44456689999996654466889999994444333222112334457899986343 8899999999
Q ss_pred EEeCC
Q psy4956 356 TVSGK 360 (361)
Q Consensus 356 a~~~~ 360 (361)
|..|+
T Consensus 98 A~~~~ 102 (107)
T PF01108_consen 98 AEVGN 102 (107)
T ss_dssp EEETT
T ss_pred EEeCC
Confidence 99886
No 24
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=97.41 E-value=0.001 Score=46.72 Aligned_cols=71 Identities=21% Similarity=0.420 Sum_probs=47.7
Q ss_pred CCCCccEEEEEeCCEEEEEeeCCC-----CCC---------cccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCc
Q psy4956 91 DPPTNLSVNVRSGKTAQIFWSPPI-----SGK---------YSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGG 156 (361)
Q Consensus 91 ~~p~~l~~~~~~~~~v~l~W~~p~-----~~~---------~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~ 156 (361)
.||. +.+ ...++.+.|.+.+|. .+. ...|.|.|+..+.. .+......... .+.|.+|.|++
T Consensus 4 gPP~-v~v-~~~~~~l~V~i~~P~~~~~~~~~~~~l~~~~~~~~Y~v~~~~~~~~--~~~~~~~~~~~-~~~l~~L~p~t 78 (106)
T PF09294_consen 4 GPPS-VNV-SSCGGSLHVTIKPPMTPLRAGGKNSSLRDIYPSLSYNVSYWKNGSN--EKKKEIETKNS-SVTLSDLKPGT 78 (106)
T ss_dssp -SSE-EEE-EEETTEEEEEEEESEEEEECSSSEEEHHHHHGG-EEEEEEEETTTS--CEEEEEESSSE-EEEEES--TTS
T ss_pred cCCE-EEE-EECCCEEEEEEECCCcccccCCCCCcHHHhCCCeEEEEEEEeCCCc--cceEEEeecCC-EEEEeCCCCCC
Confidence 4554 776 557889999998875 111 13599998886543 34444455555 77999999999
Q ss_pred EEEEEEEEEe
Q psy4956 157 SYQVQLFSVY 166 (361)
Q Consensus 157 ~Y~v~v~a~~ 166 (361)
.|.|+|.+..
T Consensus 79 ~YCv~V~~~~ 88 (106)
T PF09294_consen 79 NYCVSVQAFS 88 (106)
T ss_dssp EEEEEEEEEE
T ss_pred CEEEEEEEEe
Confidence 9999999943
No 25
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=97.38 E-value=0.032 Score=46.41 Aligned_cols=241 Identities=14% Similarity=0.087 Sum_probs=125.1
Q ss_pred eEEEEecCCCCCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCccceeeEEEecC-
Q psy4956 11 ANLVIKIPENLSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTASITTP- 89 (361)
Q Consensus 11 ~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t~- 89 (361)
..+-+-|--+.+-++..|.|+.-..++. +....+..+.-.-.+.-.+|.++..|...|.+....+.-+.+....-.++
T Consensus 84 ~~~~~~w~~~~d~~~~~Y~i~~~~gD~~-f~r~~~~~n~l~~~~i~s~~~~~~~~~~~Iia~~f~~~~sfsf~gVE~~~~ 162 (343)
T COG3401 84 KSVKVFWSPHPDVSVGKYIIQRQNGDGK-FLRTGLVKNRLFVEFIDSDLGHNEKYMELIIAADFQMGKSFSFTGVEATPK 162 (343)
T ss_pred ceeeecccccCCCCCCeEEEEEecCchh-hhhhhHHHhccchhheecccccccceeeeEEeecccccceeeeeeeecccc
Confidence 3355556666677899999988776665 32333333333456677799999999999998776544444443332333
Q ss_pred CCCC--CccEEEEEeCCEEEEEeeCCCCCCcccEEEEEEECCCCCCCeEEeecCCCCceEEE--cCCCCCcEEEEEEEEE
Q psy4956 90 PDPP--TNLSVNVRSGKTAQIFWSPPISGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSL--KDLTPGGSYQVQLFSV 165 (361)
Q Consensus 90 ~~~p--~~l~~~~~~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l--~~L~p~~~Y~v~v~a~ 165 (361)
..|| .++++.....+.+.|+|+.+.. ...|+|.- ...+ ............+ .+.. .+| |..|...|.+.
T Consensus 163 ~~P~ei~~~~~~~d~~~~i~ls~dg~~~--~~yy~IY~-~~~g-~e~~~~ia~t~~n-~y~d~~egl--ga~~~y~VTtV 235 (343)
T COG3401 163 AEPKEITNVRVSFDLGNNIELSEDGSEA--EDYYRIYA-SDSG-NEEYGFIAQTTEN-SYYDVKEGL--GAVEYYKVTTV 235 (343)
T ss_pred cCCceeeeeeeecCCCCcceeeccCccc--cceEEEec-cCCc-cccccceeecccc-chhhhhhcc--CceeEEEEEEE
Confidence 3333 2333334457889999988542 34788853 3222 1222222223333 3333 344 55666666666
Q ss_pred e-CCCCcccceeeee-cc--CCCCCcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCc
Q psy4956 166 Y-DSKESVAYTSRNF-TT--KPNTPGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPA 241 (361)
Q Consensus 166 ~-~~~~s~~~~~~~~-~t--~p~~p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~ 241 (361)
. .+..+.-...... .+ .+..|.-. -.....+.+.|.|+...... ...|.+ ++-.++......... ..
T Consensus 236 d~~~~es~lp~~~t~~~~~g~~e~pt~~-g~t~~~s~i~V~~E~~~~~~-av~ytv-yr~~~g~~~~~f~it------~~ 306 (343)
T COG3401 236 DNTGFESDLPNEPTVGETGGRYEVPTIP-GETTEASFIGVAAEQNQLRQ-AVTYTV-YRVEDGTPTKFFTIT------ET 306 (343)
T ss_pred cCCcceeccCCccccccccCCcccccCC-CceeeeeeeeEEecccccce-eeEEee-eeccCCCcceeeeee------ec
Confidence 5 3333321111111 11 12122111 11122234568887642222 223333 121222222111111 25
Q ss_pred EEEecCCCCCCeEEEEEEEEeCCCCCC
Q psy4956 242 QAAFKGLVPGRAYNISVQTVSEDEIST 268 (361)
Q Consensus 242 ~~~~~~L~p~t~Y~v~V~a~~~~~~s~ 268 (361)
......+.++..|.+.|.+++..+.+.
T Consensus 307 d~~d~~~ltg~~~~y~v~~v~~~gls~ 333 (343)
T COG3401 307 DGQDNDMLTGVEYRYEVVAVDKAGLSS 333 (343)
T ss_pred cccchhhccccceeEEEEEeccccccc
Confidence 566778899999999999999877553
No 26
>KOG4806|consensus
Probab=97.20 E-value=0.089 Score=43.90 Aligned_cols=115 Identities=17% Similarity=0.234 Sum_probs=66.1
Q ss_pred cCCCCCcEEEEEEEEEeCCCCcccceeeeec------cCCCCCcceEEEe--ccCCeEEEEeecCCCCCcceEEEEEEeC
Q psy4956 150 KDLTPGGSYQVQLFSVYDSKESVAYTSRNFT------TKPNTPGKFIVWF--RNETTLLVLWQPSYPASIYTHYKVSIDP 221 (361)
Q Consensus 150 ~~L~p~~~Y~v~v~a~~~~~~s~~~~~~~~~------t~p~~p~~l~~~~--~~~~~~~v~W~~~~~~~~~~~y~v~~~~ 221 (361)
-.+.|+..|..++...++...... .....+ .-|+-|.+-.+.. ...++++|.|.... +. -..|.|....
T Consensus 289 fvv~~~~~~~~rf~~~ndde~~~~-v~V~aSt~~t~~~~P~LP~dTtVk~v~r~CSsAtIaW~gs~-d~-~~kyCIy~~~ 365 (454)
T KOG4806|consen 289 FVVLPGERYLMRFEPSNDDEALQK-VMVAASTEATFRDLPELPQDTTVKNVRRRCSSATIAWNGSP-DE-ELKYCIYVFN 365 (454)
T ss_pred EEecCCCceEEEEEecCCchhhhe-eEeeeecccccCcCCCCCCCceEeeeccccchheeeeccCc-ch-heeEEEEEec
Confidence 357788889988888875533221 212222 2266677766654 46678999997542 22 3456665543
Q ss_pred CCCCCc-----------------e--eEE---e---eccCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCC
Q psy4956 222 PDAPES-----------------V--LYV---E---KEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDEIS 267 (361)
Q Consensus 222 ~~~~~~-----------------~--~~~---~---~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s 267 (361)
....+. . ... . .+..+.+...-+|.||.||..|.+.|.|.-..|..
T Consensus 366 ~~~~e~~v~~~~N~C~g~~~~~~s~~v~c~y~hs~~~q~~~~~i~teTI~gL~PgssYlldv~a~~~~g~~ 436 (454)
T KOG4806|consen 366 LPQRERSVVDFTNYCMGFVPKRVSQYVYCEYMHSRERQQSPDNIETETILGLMPGSSYLLDVTANLSMGKP 436 (454)
T ss_pred ccchhhhhhhhhccccCccccceeEEEeEEEecChhhhcchhhhhhhhhcccccCceEEEEEEEcccCCcc
Confidence 221110 0 000 0 01112233456789999999999999997665543
No 27
>KOG3632|consensus
Probab=97.13 E-value=0.0024 Score=59.88 Aligned_cols=241 Identities=16% Similarity=0.117 Sum_probs=132.2
Q ss_pred eCCCceEEEEecCCCCCC--CcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCC-CceEEEEEEEecCCcCcccee
Q psy4956 6 HGVHGANLVIKIPENLSS--DNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLP-GTKYDFYLYYTNSTVHDWLTW 82 (361)
Q Consensus 6 ~~~~~~~l~~~~p~~~~~--~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p-~t~Y~i~V~a~~~~~~~~~~~ 82 (361)
.-.+...+.|+.|..... ++.+|-+.....-- ...-.++ .+...+..|.- +..|.|.|.+....|......
T Consensus 589 qla~sv~vawepps~pP~~~~~~~~~~~v~~elr----q~l~~gs--~tka~~E~ld~~a~s~~isvq~ltSrGsqd~lr 662 (1335)
T KOG3632|consen 589 QLAGSVHVAWEPPSSPPFTAPARMIGATVLMELR----QTLYAGS--LTKAMQESLDNSAHSGYISVQRLTSRGSQDQLR 662 (1335)
T ss_pred hhccceeeeccCCCCCCccccceeeeeecchhhh----hhccccc--chHHHHhhccccCCceeeehhhhhccCCCCcce
Confidence 345667888998886654 66666665432110 1111111 33344444432 234788887765553332222
Q ss_pred eE--EEecCCCCCCccEEEEEeCCEEEEEeeCCCCCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEE
Q psy4956 83 TA--SITTPPDPPTNLSVNVRSGKTAQIFWSPPISGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQV 160 (361)
Q Consensus 83 ~~--~~~t~~~~p~~l~~~~~~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v 160 (361)
.+ .....+..|.++++...+.++..++|-+- +..+ -++.|... ..... ...... .++|-+|.|++.|.+
T Consensus 663 c~Llvgg~a~vvpsqlrv~n~tqtSa~itwvp~-nsn~--~Hviyln~----eE~~p-s~a~~y-~ytf~~lrpgt~y~a 733 (1335)
T KOG3632|consen 663 CILLVGGAAPVVPSQLRVWNATQTSAMITWVPF-NSNF--LHVIYLNA----EEPRP-SVAEMY-NYTFMRLRPGTDYWA 733 (1335)
T ss_pred eeEeccccccccchhhhhhhhhchhhheeeeec-CCCc--ceeeecCC----ccCCC-chhhhh-HHHHhccCCCCccce
Confidence 22 22234778899999999999999999873 3222 22333221 11111 112223 788999999999999
Q ss_pred EEEEEeCCCCc---------ccceeeeeccC----CCCCcceEEEe-ccCCeEEEEeecCCCC-------CcceEEEEEE
Q psy4956 161 QLFSVYDSKES---------VAYTSRNFTTK----PNTPGKFIVWF-RNETTLLVLWQPSYPA-------SIYTHYKVSI 219 (361)
Q Consensus 161 ~v~a~~~~~~s---------~~~~~~~~~t~----p~~p~~l~~~~-~~~~~~~v~W~~~~~~-------~~~~~y~v~~ 219 (361)
+|-+......+ .......|+|. |.+|.++.+.. .....+.++|.|+.-+ ..+++|.|..
T Consensus 734 ~vea~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~g~spg~l~vswrPptldsag~sngv~vtgYavya 813 (1335)
T KOG3632|consen 734 SVEAALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVETGGSPGRLEVSWRPPTLDSAGCSNGVAVTGYAVYA 813 (1335)
T ss_pred ecccccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeecCCCCceeeeeccCceeccccccCceeeeeeeeee
Confidence 99887642111 11233445554 56777888764 3556799999876432 3467888865
Q ss_pred eCCCCCCceeEEeeccCCCCCcEEEecCCCCC-CeEEEEEEEEeCCCCC
Q psy4956 220 DPPDAPESVLYVEKEGEPPGPAQAAFKGLVPG-RAYNISVQTVSEDEIS 267 (361)
Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~-t~Y~v~V~a~~~~~~s 267 (361)
.. ......... ..+.+.+.+..|+-+ ..-.|.|+.....|++
T Consensus 814 dg--qkv~Evafp----tagst~VelsQlq~~l~~~~V~vRtms~~ges 856 (1335)
T KOG3632|consen 814 DG--QKVEEVAFP----TAGSTKVELSQLQDGLYHGAVGVRTMSVPGES 856 (1335)
T ss_pred CC--ceeeeeecc----cCCceEEEeeeehhhheecceeEEecccCccc
Confidence 41 111222121 112345555555553 2335666666555544
No 28
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=97.04 E-value=0.0059 Score=42.85 Aligned_cols=83 Identities=17% Similarity=0.239 Sum_probs=56.0
Q ss_pred ecCCCCCCccEEEEEeCCEEEEEeeCCC-CCCcccEEEEEEECCCCCCCeEEee-cCCCCceEEEcCCC--CCcEEEEEE
Q psy4956 87 TTPPDPPTNLSVNVRSGKTAQIFWSPPI-SGKYSGFKLKVISLSEKTPPRIIGF-TENPPAGYSLKDLT--PGGSYQVQL 162 (361)
Q Consensus 87 ~t~~~~p~~l~~~~~~~~~v~l~W~~p~-~~~~~~y~v~~~~~~~~~~~~~~~~-~~~~~~~~~l~~L~--p~~~Y~v~v 162 (361)
.....+|.++++... +-...|+|+++. ...-..|.|+|+......|.....+ ..... .+.|+... +...|.++|
T Consensus 19 ~~~lp~P~nv~~~s~-nf~~iL~W~~~~~~~~~~~ytVq~~~~~~~~W~~v~~C~~i~~~-~Cdlt~~~~~~~~~Y~~rV 96 (107)
T PF01108_consen 19 SASLPAPQNVTVDSV-NFKHILRWDPGPGSPPNVTYTVQYKKYGSSSWKDVPGCQNITET-SCDLTDETSDPSESYYARV 96 (107)
T ss_dssp -SSGSSCEEEEEEEE-TTEEEEEEEESTTSSSTEEEEEEEEESSTSCEEEECCEEEESSS-EEECTTCCTTTTSEEEEEE
T ss_pred cccCCCCCeeEEEEE-CCceEEEeCCCCCCCCCeEEEEEEEecCCcceeeccceeccccc-ceeCcchhhcCcCCEEEEE
Confidence 345667899998875 556899999944 3455689999994433333332111 22234 88887644 788999999
Q ss_pred EEEeCCCCc
Q psy4956 163 FSVYDSKES 171 (361)
Q Consensus 163 ~a~~~~~~s 171 (361)
+|..++..|
T Consensus 97 ~A~~~~~~S 105 (107)
T PF01108_consen 97 RAEVGNQTS 105 (107)
T ss_dssp EEEETTEEE
T ss_pred EEEeCCccC
Confidence 999877655
No 29
>KOG4806|consensus
Probab=96.87 E-value=0.14 Score=42.77 Aligned_cols=108 Identities=20% Similarity=0.261 Sum_probs=64.2
Q ss_pred CCCCCCeEEEEEEEEeCCCCCCceEEEEec------CCCCCccceEEeeecCCceEEEEeeCCCCCCCceeEEEEEEECC
Q psy4956 247 GLVPGRAYNISVQTVSEDEISTPTTAQYRT------IPLRPLSFTYDKASITSNSLRVVWEPPKGFSEFDKYQVSINVRR 320 (361)
Q Consensus 247 ~L~p~t~Y~v~V~a~~~~~~s~~~~~~~~t------~~~~p~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i~~~~~~ 320 (361)
.+.++..|.+++...|.+..-.-..+-+.| -|.-|.+-.+....-.-++.++.|....+ .-..|.|.....+
T Consensus 290 vv~~~~~~~~rf~~~ndde~~~~v~V~aSt~~t~~~~P~LP~dTtVk~v~r~CSsAtIaW~gs~d--~~~kyCIy~~~~~ 367 (454)
T KOG4806|consen 290 VVLPGERYLMRFEPSNDDEALQKVMVAASTEATFRDLPELPQDTTVKNVRRRCSSATIAWNGSPD--EELKYCIYVFNLP 367 (454)
T ss_pred EecCCCceEEEEEecCCchhhheeEeeeecccccCcCCCCCCCceEeeeccccchheeeeccCcc--hheeEEEEEeccc
Confidence 466788888888888755433222233333 34558887774345566899999987543 3456777655543
Q ss_pred CCCCC-------------C---ce-----------eecCCCCceeEeccCCCCCceEEEEEEEE
Q psy4956 321 PGASS-------------T---PI-----------TKSRDEPTQCDMSEGLEPGRTYQVLVKTV 357 (361)
Q Consensus 321 ~~~~~-------------~---~~-----------~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~ 357 (361)
..+.. . .. .....+...-++.+ |.||..|.+.|.|.
T Consensus 368 ~~e~~v~~~~N~C~g~~~~~~s~~v~c~y~hs~~~q~~~~~i~teTI~g-L~PgssYlldv~a~ 430 (454)
T KOG4806|consen 368 QRERSVVDFTNYCMGFVPKRVSQYVYCEYMHSRERQQSPDNIETETILG-LMPGSSYLLDVTAN 430 (454)
T ss_pred chhhhhhhhhccccCccccceeEEEeEEEecChhhhcchhhhhhhhhcc-cccCceEEEEEEEc
Confidence 21000 0 00 00112334456764 99999999999985
No 30
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=96.84 E-value=0.0054 Score=43.00 Aligned_cols=72 Identities=25% Similarity=0.273 Sum_probs=46.2
Q ss_pred CCCcceEEEeccCCeEEEEeecCC-----CC--------CcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCC
Q psy4956 184 NTPGKFIVWFRNETTLLVLWQPSY-----PA--------SIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVP 250 (361)
Q Consensus 184 ~~p~~l~~~~~~~~~~~v~W~~~~-----~~--------~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p 250 (361)
.||. +.+ .....++.|.+.+|. .. -..-.|.|.++..+........... ...+.|.+|.|
T Consensus 4 gPP~-v~v-~~~~~~l~V~i~~P~~~~~~~~~~~~l~~~~~~~~Y~v~~~~~~~~~~~~~~~~~-----~~~~~l~~L~p 76 (106)
T PF09294_consen 4 GPPS-VNV-SSCGGSLHVTIKPPMTPLRAGGKNSSLRDIYPSLSYNVSYWKNGSNEKKKEIETK-----NSSVTLSDLKP 76 (106)
T ss_dssp -SSE-EEE-EEETTEEEEEEEESEEEEECSSSEEEHHHHHGG-EEEEEEEETTTSCEEEEEESS-----SEEEEEES--T
T ss_pred cCCE-EEE-EECCCEEEEEEECCCcccccCCCCCcHHHhCCCeEEEEEEEeCCCccceEEEeec-----CCEEEEeCCCC
Confidence 3455 777 567789999998764 01 0124699998877665322222222 36679999999
Q ss_pred CCeEEEEEEEEe
Q psy4956 251 GRAYNISVQTVS 262 (361)
Q Consensus 251 ~t~Y~v~V~a~~ 262 (361)
++.|.|+|++..
T Consensus 77 ~t~YCv~V~~~~ 88 (106)
T PF09294_consen 77 GTNYCVSVQAFS 88 (106)
T ss_dssp TSEEEEEEEEEE
T ss_pred CCCEEEEEEEEe
Confidence 999999999944
No 31
>KOG1225|consensus
Probab=96.57 E-value=0.0053 Score=54.97 Aligned_cols=156 Identities=22% Similarity=0.192 Sum_probs=102.4
Q ss_pred CcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCCCCeEEEEEEEEeCCC
Q psy4956 186 PGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDE 265 (361)
Q Consensus 186 p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~ 265 (361)
|..+.........+.+.|.. ....+.|...+.......... ..+.+.....+.+..|.|++.|.++|+++....
T Consensus 369 ~~~~~~~~cs~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~---~~r~~~~~~~~~~~~~~~g~~~~~~~~~v~~~~ 442 (525)
T KOG1225|consen 369 PSLLLITECSPPSLCIAGVG---RRRVTHCAGTYCPLGESGGDL---QGRVPGDANSVDIQGLEPGDEYNCSVNTVAANI 442 (525)
T ss_pred chhhcccccCCCceeecccc---ccccccccccccccccCCCcc---ceeeccceeeeeeeeecCCcceeeehhhhhhhh
Confidence 44444555666678888872 223344555444422111111 112234457888999999999999999998766
Q ss_pred CCCceEEEEecCCCCCccceEEeeecCCceEEEEeeCCCCCCCceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCC
Q psy4956 266 ISTPTTAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKGFSEFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLE 345 (361)
Q Consensus 266 ~s~~~~~~~~t~~~~p~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~ 345 (361)
.+.+......+...-+..+.+ .....+++.+.|..|.. ....+.+.+..... +.... .......+.+..++ |.
T Consensus 443 ~~~~~~~~~~~~~~~~g~~~v--~~~~~~s~e~~g~~~s~--~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~-l~ 515 (525)
T KOG1225|consen 443 GSLPKDKSETTVLCWNGGLCV--DGETESSLEVGGPCPSS--GTCGWEVRCGPCGN-DGGVN-AEPPPECTSYDRTG-LG 515 (525)
T ss_pred ccCCcccccceEeecCCceee--eeeeeccccccCCCCCc--cccceEEEeeecCc-ccccc-cCCCCCCCCCCccC-cc
Confidence 666665555555567788888 88999999999999876 67889999833221 11121 22233556788886 99
Q ss_pred CCceEEEEE
Q psy4956 346 PGRTYQVLV 354 (361)
Q Consensus 346 p~t~Y~v~V 354 (361)
|++.|.+.+
T Consensus 516 ~c~~~~~~~ 524 (525)
T KOG1225|consen 516 PCTEYEVSV 524 (525)
T ss_pred cccceeccc
Confidence 999999875
No 32
>KOG4367|consensus
Probab=96.41 E-value=0.0039 Score=52.97 Aligned_cols=72 Identities=14% Similarity=0.226 Sum_probs=56.3
Q ss_pred cCCeEEEEee-cCCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCCCceE
Q psy4956 195 NETTLLVLWQ-PSYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVPGRAYNISVQTVSEDEISTPTT 271 (361)
Q Consensus 195 ~~~~~~v~W~-~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s~~~~ 271 (361)
..+++++.|+ |+......++|.++....++.....+.-. ..+.+++.||.-++.|..+|.++|..|.++.+.
T Consensus 451 ~nns~t~~wkqp~~~~~~~dg~~leld~g~~g~frevy~g-----~etmctvdglhfns~y~arvka~n~tg~s~ys~ 523 (699)
T KOG4367|consen 451 HNNSATLSWKQPPLSTVPADGYILELDDGNGGQFREVYVG-----KETMCTVDGLHFNSTYNARVKAFNKTGVSPYSK 523 (699)
T ss_pred cCCceEEEeecCCCCCCCCcceEEEeecCCCCceeEEEec-----CceeEEecceecchhHHHHHHHhhccCCCcccc
Confidence 4568999997 55444778999999998877664443322 238899999999999999999999999886554
No 33
>KOG4367|consensus
Probab=96.33 E-value=0.0048 Score=52.48 Aligned_cols=71 Identities=17% Similarity=0.079 Sum_probs=56.6
Q ss_pred CCCceEEEEecCCCCCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCccc
Q psy4956 7 GVHGANLVIKIPENLSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWL 80 (361)
Q Consensus 7 ~~~~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~ 80 (361)
.+..+.|+|.-|.+...+++||.++....+|..+.++ ..+. ++-+++.+|.-++.|.-+|.++|..|-+..
T Consensus 451 ~nns~t~~wkqp~~~~~~~dg~~leld~g~~g~frev-y~g~--etmctvdglhfns~y~arvka~n~tg~s~y 521 (699)
T KOG4367|consen 451 HNNSATLSWKQPPLSTVPADGYILELDDGNGGQFREV-YVGK--ETMCTVDGLHFNSTYNARVKAFNKTGVSPY 521 (699)
T ss_pred cCCceEEEeecCCCCCCCCcceEEEeecCCCCceeEE-EecC--ceeEEecceecchhHHHHHHHhhccCCCcc
Confidence 4567889999999999999999999987666554222 2223 699999999999999999999998744433
No 34
>KOG1225|consensus
Probab=96.10 E-value=0.011 Score=52.95 Aligned_cols=110 Identities=15% Similarity=0.109 Sum_probs=81.9
Q ss_pred CccceEEecCCCCCceEEEEEEEecCCcCccceeeEEEecCCCCCCccEEEEEeCCEEEEEeeCCCCCCcccEEEEEEEC
Q psy4956 49 DIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTASITTPPDPPTNLSVNVRSGKTAQIFWSPPISGKYSGFKLKVISL 128 (361)
Q Consensus 49 ~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t~~~~p~~l~~~~~~~~~v~l~W~~p~~~~~~~y~v~~~~~ 128 (361)
.+...+.+..|+|++.|.+++++.... ..+........+....+..+.+.....+++.+.|..|. .....+.+.|..
T Consensus 415 ~~~~~~~~~~~~~g~~~~~~~~~v~~~-~~~~~~~~~~~~~~~~~g~~~v~~~~~~s~e~~g~~~s-~~~~~~~~~~~~- 491 (525)
T KOG1225|consen 415 GDANSVDIQGLEPGDEYNCSVNTVAAN-IGSLPKDKSETTVLCWNGGLCVDGETESSLEVGGPCPS-SGTCGWEVRCGP- 491 (525)
T ss_pred cceeeeeeeeecCCcceeeehhhhhhh-hccCCcccccceEeecCCceeeeeeeeccccccCCCCC-ccccceEEEeee-
Confidence 348999999999999999999997766 44445556666677778889999999999999999975 456788888833
Q ss_pred CCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEE
Q psy4956 129 SEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQL 162 (361)
Q Consensus 129 ~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v 162 (361)
......-...+..... .+..++|.|++.|.+.+
T Consensus 492 ~~~~~~~~~~~~~~~~-~~~~~~l~~c~~~~~~~ 524 (525)
T KOG1225|consen 492 CGNDGGVNAEPPPECT-SYDRTGLGPCTEYEVSV 524 (525)
T ss_pred cCcccccccCCCCCCC-CCCccCcccccceeccc
Confidence 2222222233333444 88899999999998764
No 35
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=95.93 E-value=0.26 Score=33.88 Aligned_cols=87 Identities=15% Similarity=0.105 Sum_probs=55.7
Q ss_pred CCcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcEEEecCCCC----CCeEEEEEEE
Q psy4956 185 TPGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQAAFKGLVP----GRAYNISVQT 260 (361)
Q Consensus 185 ~p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p----~t~Y~v~V~a 260 (361)
+|.+|.....+...|..+|.+......-..|.+.|+.................+....+.|..... ...|.|.|.+
T Consensus 1 ~~~nlsC~~~~~~~m~CtW~~g~~~~~~t~y~L~~~~~~~~~~~eC~~y~~~~~~~~gC~~~~~~~~~~~~~~~~v~V~~ 80 (99)
T PF09240_consen 1 KPQNLSCFIYNLEYMNCTWEPGKEAPPDTQYTLYYWYSPLEEEKECPHYSKDSGTRIGCQFPVSEIDSSEFSQYNVCVNG 80 (99)
T ss_dssp S-EEEEEEEETTTEEEEEEECCTTCSTTEEEEEEEEETTSSSEEEESEEEESTSSEEEEEEESCTT-TTTTSEEEEEEEE
T ss_pred CCeeCEEEEECCEEEEEEECCCCCCCCcccEEEEEEcCCCCccccCCCccccCCceeEEEecCCCccccccceEEEEEEe
Confidence 367888888888999999988644445579999998765433322222222122245566665554 3589999999
Q ss_pred EeCCCCCCceE
Q psy4956 261 VSEDEISTPTT 271 (361)
Q Consensus 261 ~~~~~~s~~~~ 271 (361)
.+..+.-.+..
T Consensus 81 ss~~~~i~~~~ 91 (99)
T PF09240_consen 81 SSSAGSIRSSD 91 (99)
T ss_dssp EETTEEEECEE
T ss_pred ccCCCccCCcc
Confidence 88766444443
No 36
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=95.76 E-value=0.11 Score=35.96 Aligned_cols=81 Identities=19% Similarity=0.292 Sum_probs=54.3
Q ss_pred CCCCCcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCceeEEeeccCCCCCcE-----EEec--CCCCCCeE
Q psy4956 182 KPNTPGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPESVLYVEKEGEPPGPAQ-----AAFK--GLVPGRAY 254 (361)
Q Consensus 182 ~p~~p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~--~L~p~t~Y 254 (361)
.|..|.++.....+...++.-|++....+.-..|.+.|...+ .......... ....++ +.|. +..-.+.|
T Consensus 7 ~p~~P~~~~C~S~~~etftC~W~~g~~~~l~~~y~L~Y~~~~-~~~~eCp~~~--~~~~ns~~~~~C~F~~~~t~lf~~y 83 (104)
T PF09067_consen 7 PPEKPENLKCFSREMETFTCFWEPGSEGNLPTNYTLFYKKEG-EEWKECPDYS--TSGPNSTVRHICYFPKSDTSLFVPY 83 (104)
T ss_dssp HCCCCEEEEEEBSSSS-EEEEEEEESSSTSTCEEEEEEEETT-SEEEEESESS--TTETTEEEEEEEEE-CCGCSSSSEE
T ss_pred CCCCCccCccCCCCCCcEEEEeeCCCCCCCCCcEEEEEEeCC-CCCccCCCeE--ecCCCCceeEEEEcCCCCeEEEEEE
Confidence 367899999999999999999998744333345999998765 2222222111 111233 5554 78889999
Q ss_pred EEEEEEEeCCC
Q psy4956 255 NISVQTVSEDE 265 (361)
Q Consensus 255 ~v~V~a~~~~~ 265 (361)
.|+|.|.+..|
T Consensus 84 ~i~V~a~~~~~ 94 (104)
T PF09067_consen 84 CIQVEATNALG 94 (104)
T ss_dssp EEEEEEEETTE
T ss_pred EEEEEeccCCC
Confidence 99999988765
No 37
>KOG4152|consensus
Probab=95.40 E-value=0.48 Score=42.07 Aligned_cols=68 Identities=16% Similarity=0.307 Sum_probs=46.7
Q ss_pred cCCCCCcEEEEEEEEEeCCCCcccceeeeeccC----CCCCcceEEEeccCCeEEEEeecCCCC--CcceEEEEE
Q psy4956 150 KDLTPGGSYQVQLFSVYDSKESVAYTSRNFTTK----PNTPGKFIVWFRNETTLLVLWQPSYPA--SIYTHYKVS 218 (361)
Q Consensus 150 ~~L~p~~~Y~v~v~a~~~~~~s~~~~~~~~~t~----p~~p~~l~~~~~~~~~~~v~W~~~~~~--~~~~~y~v~ 218 (361)
..|-+|+.|.|+|.+.++.|.+..+....+.|- |.+|..+.+. .+-..+.+.|+++..+ +.|..|...
T Consensus 652 ~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG~P~apS~~ri~-k~~eGi~l~weppt~p~sg~Iieys~y 725 (830)
T KOG4152|consen 652 TSLVTGQAYRFRVTAINGKGPGPASTILKLKTCAPGKPTAPSGARIK-KTIEGISLVWEPPTKPGSGTIIEYSPY 725 (830)
T ss_pred cccccccceeeeeeeeeccCCCchhhheeeeeccCCCCCCccccccc-ccccceeecccCCCCCCCcceEEeehh
Confidence 468899999999999999887765544444433 5566666554 3445799999987544 455555543
No 38
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=94.99 E-value=0.19 Score=47.19 Aligned_cols=115 Identities=18% Similarity=0.232 Sum_probs=67.2
Q ss_pred CcEEEecCCCCCCeEEEEEEEEeCCCCC-Cce-EEEEe-cCCC--CCccceEEeeecCCceEEEEeeCCCCCCCceeEEE
Q psy4956 240 PAQAAFKGLVPGRAYNISVQTVSEDEIS-TPT-TAQYR-TIPL--RPLSFTYDKASITSNSLRVVWEPPKGFSEFDKYQV 314 (361)
Q Consensus 240 ~~~~~~~~L~p~t~Y~v~V~a~~~~~~s-~~~-~~~~~-t~~~--~p~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i 314 (361)
...+.+.+|.+| .|.++|+|+|.-+.. .+. ...+. ..+. +|..+......+ .-.+++.|-.|.+.-++.+-++
T Consensus 659 ~~~~~~~gi~~G-qY~i~VrAiN~~g~~~~~a~s~~f~i~g~~~Ppp~~~t~~a~~i-t~~~~l~v~dPt~~~d~~~sei 736 (952)
T COG4733 659 AAGFDVEGIPAG-QYAIRVRAINVFEPNSPDATAYEFALNGKKVPPPKAMIYDAVII-TLVIRLVVGDPTGAVDITSTEI 736 (952)
T ss_pred ccceeecCcCcc-ceEEEEEEeeccCCCCCCcceeEEEecCCCCCCCcccccceEEE-EeeeeEEEecCCcceEEeeeee
Confidence 367899999996 999999999976643 333 22221 2222 233333310222 2357889988866445666667
Q ss_pred EEEECCCCCCCCceeecCCCCceeEeccCCCCCceEEEEEEEEe
Q psy4956 315 SINVRRPGASSTPITKSRDEPTQCDMSEGLEPGRTYQVLVKTVS 358 (361)
Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~ 358 (361)
++....+++......-. .......-. +|+||..|.|++++++
T Consensus 737 ~~s~~~d~~~~Ar~LG~-~~~~~~~~~-~i~~g~~~~F~~R~Vn 778 (952)
T COG4733 737 RSAVIADGNFQARSLGN-LNYPGLFSV-GIQAGLTFWFRNRNVD 778 (952)
T ss_pred eeeccccchhHHhhhhc-ccccccccc-CcCCCceEEEEeeecc
Confidence 77766654442211111 111111113 4999999999999986
No 39
>KOG4152|consensus
Probab=93.25 E-value=0.22 Score=44.12 Aligned_cols=67 Identities=24% Similarity=0.378 Sum_probs=44.3
Q ss_pred cCCCCCCeEEEEEEEEeCCCCCCceEE--EEecCCC---CCccceEEeeecCCceEEEEeeCCCC--CCCceeEEEE
Q psy4956 246 KGLVPGRAYNISVQTVSEDEISTPTTA--QYRTIPL---RPLSFTYDKASITSNSLRVVWEPPKG--FSEFDKYQVS 315 (361)
Q Consensus 246 ~~L~p~t~Y~v~V~a~~~~~~s~~~~~--~~~t~~~---~p~~l~~~~~~~~~~si~l~W~~~~~--~~~~~~y~i~ 315 (361)
..|.+|+.|.|+|.|+++.|.++...+ ...+.|. +|..+.. ..+-..+.+.|++|.. .+.|..|...
T Consensus 652 ~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG~P~apS~~ri---~k~~eGi~l~weppt~p~sg~Iieys~y 725 (830)
T KOG4152|consen 652 TSLVTGQAYRFRVTAINGKGPGPASTILKLKTCAPGKPTAPSGARI---KKTIEGISLVWEPPTKPGSGTIIEYSPY 725 (830)
T ss_pred cccccccceeeeeeeeeccCCCchhhheeeeeccCCCCCCcccccc---cccccceeecccCCCCCCCcceEEeehh
Confidence 468899999999999999887765543 2233444 4554444 3344579999999743 2355555543
No 40
>KOG1948|consensus
Probab=92.78 E-value=9.7 Score=36.71 Aligned_cols=148 Identities=14% Similarity=0.095 Sum_probs=68.1
Q ss_pred ceEEEEecCCCCCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCC-cCccceeeEEEec
Q psy4956 10 GANLVIKIPENLSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNST-VHDWLTWTASITT 88 (361)
Q Consensus 10 ~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~-~~~~~~~~~~~~t 88 (361)
-..+++..-.+...++.|--|...... ..+...+... .-...+..|.||.-| + ++.... ..+.....+...
T Consensus 820 l~~vsv~vkdea~q~LpgvLLSLsGg~--~yRsNlvtgd--ng~~nf~sLsPgqyy-l--RpmlKEykFePst~mIevk- 891 (1165)
T KOG1948|consen 820 LSQVSVKVKDEATQPLPGVLLSLSGGK--DYRSNLVTGD--NGHKNFVSLSPGQYY-L--RPMLKEYKFEPSTSMIEVK- 891 (1165)
T ss_pred eeEEEEEEeccCCCcCCcEEEEEecCc--chhhccccCC--CceeEEeecCcchhh-h--hhHHHhcCcCCCceeEEec-
Confidence 345666666666677777777775532 2234444444 567788889998743 2 221111 011111111111
Q ss_pred CCCCCCccEEEEE-eCCEEEEEeeCCCCCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeC
Q psy4956 89 PPDPPTNLSVNVR-SGKTAQIFWSPPISGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYD 167 (361)
Q Consensus 89 ~~~~p~~l~~~~~-~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~ 167 (361)
...-.++.+... ..-++.=+=......+..+-.|+-...+......+..... .- .|.|.||.|++.|.+++.+..+
T Consensus 892 -eGq~~~vvl~gkRvAySayGtvssLsGdp~~gVaieA~sdn~~~y~eeattde-nG-~yRiRGL~Pdc~Y~V~vk~~~~ 968 (1165)
T KOG1948|consen 892 -EGQHENVVLKGKRVAYSAYGTVSSLSGDPMKGVAIEALSDNCDLYQEEATTDE-NG-TYRIRGLLPDCEYQVHVKSYAD 968 (1165)
T ss_pred -cCceEEEEEEEEEEEEEeeeehhhccCCcccCeEEEEecCCCCcccccccccc-CC-cEEEeccCCCceEEEEEeeccC
Confidence 111111111100 0000000000000122344455544332222222222222 23 6999999999999999999854
Q ss_pred C
Q psy4956 168 S 168 (361)
Q Consensus 168 ~ 168 (361)
+
T Consensus 969 n 969 (1165)
T KOG1948|consen 969 N 969 (1165)
T ss_pred C
Confidence 3
No 41
>PLN02533 probable purple acid phosphatase
Probab=92.35 E-value=1.7 Score=39.09 Aligned_cols=73 Identities=10% Similarity=0.067 Sum_probs=42.9
Q ss_pred CCCCcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCCCCceeE------Eee--ccCCCCCcEEEecCCCCCCeE
Q psy4956 183 PNTPGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDAPESVLY------VEK--EGEPPGPAQAAFKGLVPGRAY 254 (361)
Q Consensus 183 p~~p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~~~~~------~~~--~~~~~~~~~~~~~~L~p~t~Y 254 (361)
+..|+.+.+.-.+.+++.|+|.-...... .|.|....+...... ... ...++....+.|++|+|++.|
T Consensus 41 ~~~P~qvhls~~~~~~m~V~W~T~~~~~~----~V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~L~p~T~Y 116 (427)
T PLN02533 41 PTHPDQVHISLVGPDKMRISWITQDSIPP----SVVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGPLKPNTVY 116 (427)
T ss_pred CCCCceEEEEEcCCCeEEEEEECCCCCCC----EEEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCCCCCCCEE
Confidence 55688888877778899999965422222 244543222111100 100 001222356789999999999
Q ss_pred EEEEE
Q psy4956 255 NISVQ 259 (361)
Q Consensus 255 ~v~V~ 259 (361)
..+|.
T Consensus 117 ~Yrvg 121 (427)
T PLN02533 117 YYKCG 121 (427)
T ss_pred EEEEC
Confidence 99984
No 42
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=91.66 E-value=3 Score=39.76 Aligned_cols=123 Identities=7% Similarity=-0.001 Sum_probs=66.6
Q ss_pred ccceEEecCCCCCceEEEEEEEecCCcCcccee-eEEEe-cCCCCC--CccEEEEEe-CCEEEEEeeCCCCCCcc--cEE
Q psy4956 50 IKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTW-TASIT-TPPDPP--TNLSVNVRS-GKTAQIFWSPPISGKYS--GFK 122 (361)
Q Consensus 50 ~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~-~~~~~-t~~~~p--~~l~~~~~~-~~~v~l~W~~p~~~~~~--~y~ 122 (361)
.+..+.+.+|.+ .+|.++|+|.|..+..+.+. ...+. ..+..| ..+...... .-.+.+.|-.|. +.++ .-.
T Consensus 658 ~~~~~~~~gi~~-GqY~i~VrAiN~~g~~~~~a~s~~f~i~g~~~Ppp~~~t~~a~~it~~~~l~v~dPt-~~~d~~~se 735 (952)
T COG4733 658 SAAGFDVEGIPA-GQYAIRVRAINVFEPNSPDATAYEFALNGKKVPPPKAMIYDAVIITLVIRLVVGDPT-GAVDITSTE 735 (952)
T ss_pred cccceeecCcCc-cceEEEEEEeeccCCCCCCcceeEEEecCCCCCCCcccccceEEEEeeeeEEEecCC-cceEEeeee
Confidence 378999999999 56999999998765555544 23332 222222 122211111 235678887764 3222 222
Q ss_pred EEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCCCcccce
Q psy4956 123 LKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSKESVAYT 175 (361)
Q Consensus 123 v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~s~~~~ 175 (361)
+++... .....+...+..-......-.+|.|+..|.|++++++..|.+.++.
T Consensus 736 i~~s~~-~d~~~~Ar~LG~~~~~~~~~~~i~~g~~~~F~~R~Vn~vG~~~~~e 787 (952)
T COG4733 736 IRSAVI-ADGNFQARSLGNLNYPGLFSVGIQAGLTFWFRNRNVDLVGNNDKWE 787 (952)
T ss_pred eeeecc-ccchhHHhhhhccccccccccCcCCCceEEEEeeecccccccccce
Confidence 222221 1111111111111111222268999999999999999887776543
No 43
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=90.64 E-value=1 Score=28.08 Aligned_cols=25 Identities=16% Similarity=0.280 Sum_probs=18.8
Q ss_pred eEEecCCCCCceEEEEEEEecCCcCc
Q psy4956 53 NIEFSEGLPGTKYDFYLYYTNSTVHD 78 (361)
Q Consensus 53 ~~~i~~L~p~t~Y~i~V~a~~~~~~~ 78 (361)
++.+.+|.||+ |.|.|+|.+..+..
T Consensus 30 ~~~~~~L~~G~-Y~l~V~a~~~~~~~ 54 (66)
T PF07495_consen 30 SISYTNLPPGK-YTLEVRAKDNNGKW 54 (66)
T ss_dssp EEEEES--SEE-EEEEEEEEETTS-B
T ss_pred EEEEEeCCCEE-EEEEEEEECCCCCc
Confidence 99999999998 99999998765343
No 44
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=90.34 E-value=1.4 Score=27.39 Aligned_cols=22 Identities=23% Similarity=0.502 Sum_probs=17.4
Q ss_pred eeEeccCCCCCceEEEEEEEEeCC
Q psy4956 337 QCDMSEGLEPGRTYQVLVKTVSGK 360 (361)
Q Consensus 337 ~~~~~~~L~p~t~Y~v~V~a~~~~ 360 (361)
.+.+++ |.|| .|.|.|+|.+..
T Consensus 30 ~~~~~~-L~~G-~Y~l~V~a~~~~ 51 (66)
T PF07495_consen 30 SISYTN-LPPG-KYTLEVRAKDNN 51 (66)
T ss_dssp EEEEES---SE-EEEEEEEEEETT
T ss_pred EEEEEe-CCCE-EEEEEEEEECCC
Confidence 899986 9999 999999998753
No 45
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=89.57 E-value=3 Score=28.89 Aligned_cols=80 Identities=20% Similarity=0.299 Sum_probs=52.2
Q ss_pred CCCCCCccEEEEEeCCEEEEEeeCCCCCCc-ccEEEEEEECCCCCCCeEEeec-CCCCce-----EEEc--CCCCCcEEE
Q psy4956 89 PPDPPTNLSVNVRSGKTAQIFWSPPISGKY-SGFKLKVISLSEKTPPRIIGFT-ENPPAG-----YSLK--DLTPGGSYQ 159 (361)
Q Consensus 89 ~~~~p~~l~~~~~~~~~v~l~W~~p~~~~~-~~y~v~~~~~~~~~~~~~~~~~-~~~~~~-----~~l~--~L~p~~~Y~ 159 (361)
.|..|.++.-.......++.-|++...+.. ..|.+.|.... ..+..+.... ...+ + +.+. +..-.+.|.
T Consensus 7 ~p~~P~~~~C~S~~~etftC~W~~g~~~~l~~~y~L~Y~~~~-~~~~eCp~~~~~~~n-s~~~~~C~F~~~~t~lf~~y~ 84 (104)
T PF09067_consen 7 PPEKPENLKCFSREMETFTCFWEPGSEGNLPTNYTLFYKKEG-EEWKECPDYSTSGPN-STVRHICYFPKSDTSLFVPYC 84 (104)
T ss_dssp HCCCCEEEEEEBSSSS-EEEEEEEESSSTSTCEEEEEEEETT-SEEEEESESSTTETT-EEEEEEEEE-CCGCSSSSEEE
T ss_pred CCCCCccCccCCCCCCcEEEEeeCCCCCCCCCcEEEEEEeCC-CCCccCCCeEecCCC-CceeEEEEcCCCCeEEEEEEE
Confidence 356677777766677889999998764442 34999998764 2333332222 1223 5 6665 677789999
Q ss_pred EEEEEEeCCCC
Q psy4956 160 VQLFSVYDSKE 170 (361)
Q Consensus 160 v~v~a~~~~~~ 170 (361)
|+|.+.+..+.
T Consensus 85 i~V~a~~~~~~ 95 (104)
T PF09067_consen 85 IQVEATNALGS 95 (104)
T ss_dssp EEEEEEETTEE
T ss_pred EEEEeccCCCc
Confidence 99999987764
No 46
>PF10342 GPI-anchored: Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family; InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue [].
Probab=87.26 E-value=3.5 Score=27.72 Aligned_cols=62 Identities=10% Similarity=0.197 Sum_probs=39.4
Q ss_pred ceEEEEeeCCCCCCCceeEEEEEEECCCCCC--CCceee-cC--CCCceeEeccCCCCCceEEEEEEEE
Q psy4956 294 NSLRVVWEPPKGFSEFDKYQVSINVRRPGAS--STPITK-SR--DEPTQCDMSEGLEPGRTYQVLVKTV 357 (361)
Q Consensus 294 ~si~l~W~~~~~~~~~~~y~i~~~~~~~~~~--~~~~~~-~~--~~~~~~~~~~~L~p~t~Y~v~V~a~ 357 (361)
..+.|+|+.... ....|.|.+........ ...+.. .. .....+.+..+|.++..|.|++...
T Consensus 13 ~~~~I~W~~~~~--~~~~~~I~L~~g~~~~~~~~~~ia~~v~~~~gs~~~~~p~~l~~~~~Y~i~~~~~ 79 (93)
T PF10342_consen 13 QPITITWTSDGT--DPGNVTIYLCNGNNTNLNFVQTIASNVSNSDGSYTWTIPSDLPSGGDYFIQIVNS 79 (93)
T ss_pred CcEEEEEeCCCC--CCcEEEEEEEcCCCCCcceeEEEEecccCCCCEEEEEcCCCCCCCCcEEEEEEEC
Confidence 679999998754 44778888777654211 111111 11 2345566644599999999999843
No 47
>PLN02533 probable purple acid phosphatase
Probab=83.99 E-value=5.8 Score=35.70 Aligned_cols=72 Identities=11% Similarity=0.165 Sum_probs=41.2
Q ss_pred CCCCCccceEEeeecCCceEEEEeeCCCCCCCceeEEEEEEECCCCCCCCce-----e-----ecCCCCceeEeccCCCC
Q psy4956 277 IPLRPLSFTYDKASITSNSLRVVWEPPKGFSEFDKYQVSINVRRPGASSTPI-----T-----KSRDEPTQCDMSEGLEP 346 (361)
Q Consensus 277 ~~~~p~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i~~~~~~~~~~~~~~-----~-----~~~~~~~~~~~~~~L~p 346 (361)
.+..|..+.+ .-.+.++++|+|.-..... -.|+|-...+....... . ...+..-.+.|++ |+|
T Consensus 40 ~~~~P~qvhl--s~~~~~~m~V~W~T~~~~~----~~V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~-L~p 112 (427)
T PLN02533 40 DPTHPDQVHI--SLVGPDKMRISWITQDSIP----PSVVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGP-LKP 112 (427)
T ss_pred CCCCCceEEE--EEcCCCeEEEEEECCCCCC----CEEEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCC-CCC
Confidence 3446788888 5556889999998765311 22444332211000000 0 0112223567885 999
Q ss_pred CceEEEEEE
Q psy4956 347 GRTYQVLVK 355 (361)
Q Consensus 347 ~t~Y~v~V~ 355 (361)
+|.|..+|-
T Consensus 113 ~T~Y~Yrvg 121 (427)
T PLN02533 113 NTVYYYKCG 121 (427)
T ss_pred CCEEEEEEC
Confidence 999999985
No 48
>KOG1948|consensus
Probab=83.30 E-value=47 Score=32.43 Aligned_cols=142 Identities=12% Similarity=0.032 Sum_probs=67.7
Q ss_pred CCCCcceEEEEEEc------CCCCCCCCeEEeecC-------------ccceEEecCCCCCceEEEEEEEecCC--cCcc
Q psy4956 21 LSSDNSTYRLDYIP------AHGHPPPNTTYVSRD-------------IKDNIEFSEGLPGTKYDFYLYYTNST--VHDW 79 (361)
Q Consensus 21 ~~~~~~~Y~v~~~~------~~~~~~~~~~~~~~~-------------~~~~~~i~~L~p~t~Y~i~V~a~~~~--~~~~ 79 (361)
...-+.|||+-|+. -+|++..-+.+.... ..-.|.|.+|.|+..|.++|.+..+. ...+
T Consensus 896 ~~vvl~gkRvAySayGtvssLsGdp~~gVaieA~sdn~~~y~eeattdenG~yRiRGL~Pdc~Y~V~vk~~~~n~~iers 975 (1165)
T KOG1948|consen 896 ENVVLKGKRVAYSAYGTVSSLSGDPMKGVAIEALSDNCDLYQEEATTDENGTYRIRGLLPDCEYQVHVKSYADNSPIERS 975 (1165)
T ss_pred EEEEEEEEEEEEEeeeehhhccCCcccCeEEEEecCCCCccccccccccCCcEEEeccCCCceEEEEEeeccCCCccccc
Confidence 34467788888754 234443333333321 13579999999999999999986432 1222
Q ss_pred ceeeEEEecCCCCCCccEEEEEe-CCE--EEEEeeCCCCCCcc-cEEEEEEECCCCCCCeEEeecCCCCceEEEc-CCCC
Q psy4956 80 LTWTASITTPPDPPTNLSVNVRS-GKT--AQIFWSPPISGKYS-GFKLKVISLSEKTPPRIIGFTENPPAGYSLK-DLTP 154 (361)
Q Consensus 80 ~~~~~~~~t~~~~p~~l~~~~~~-~~~--v~l~W~~p~~~~~~-~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~-~L~p 154 (361)
.+...+....-.-..++.+.... ... +...=..-.+..+. .-.+.|+..++......... ... .+-+. -+.+
T Consensus 976 ~P~s~tv~vgneDv~glnf~af~q~kttdit~~V~~~~ne~l~sl~vv~yKs~nddspv~sv~~--gql-~~ffp~l~~d 1052 (1165)
T KOG1948|consen 976 FPRSFTVSVGNEDVKGLNFMAFIQAKTTDITVEVGMDTNEELQSLRVVIYKSNNDDSPVASVVA--GQL-LHFFPNLPRD 1052 (1165)
T ss_pred CCceEEEEecccccCCceEEEEeccceEEEEEEEcccccccccceEEEEEecCCCCCcceEEec--cce-eeeccccCCC
Confidence 33333333322223333333222 112 22221111122222 23444555433222222222 222 33343 3568
Q ss_pred CcEEEEEEEEE
Q psy4956 155 GGSYQVQLFSV 165 (361)
Q Consensus 155 ~~~Y~v~v~a~ 165 (361)
|..|.+++.+.
T Consensus 1053 g~~yvV~l~St 1063 (1165)
T KOG1948|consen 1053 GVEYVVRLEST 1063 (1165)
T ss_pred CceEEEEEecc
Confidence 88998887664
No 49
>KOG0613|consensus
Probab=80.39 E-value=35 Score=34.72 Aligned_cols=244 Identities=12% Similarity=0.045 Sum_probs=113.1
Q ss_pred EEEecCCCCCC-CcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCccceeeEEEecCCC
Q psy4956 13 LVIKIPENLSS-DNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTASITTPPD 91 (361)
Q Consensus 13 l~~~~p~~~~~-~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t~~~ 91 (361)
..|.+|....+ .+.+|+|+-+...+... -..+... ...-.+....+..+|.+.+.+.. ..+..........+.
T Consensus 158 ~~~~~~~~d~~~k~~~~iie~~~~g~s~~-~~~l~~~--~~~s~~~~a~~e~~v~~v~s~gq---p~~~~~~~~~g~~i~ 231 (1205)
T KOG0613|consen 158 PAWRPPREDGGRKREEYIIERRQAGRSPW-LKRLTQP--TDDSTDTDAHYESRVRAVVSAGQ---PEPLERWEILGELIA 231 (1205)
T ss_pred ccccCCccCccceeecceEEEEeccCCCc-ccccccc--cCCcceeccccceeEEEEEecCC---CccchhhhhcccccC
Confidence 34556654444 88899998877444322 2222222 22223333333344444443321 111111111112233
Q ss_pred CCCccEEEEEe---CCEEEEEeeCCCCCC----cccEEEEEEE-CCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEE
Q psy4956 92 PPTNLSVNVRS---GKTAQIFWSPPISGK----YSGFKLKVIS-LSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLF 163 (361)
Q Consensus 92 ~p~~l~~~~~~---~~~v~l~W~~p~~~~----~~~y~v~~~~-~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~ 163 (361)
+...+.+.... .--+.+.|..+. |+ +.+|.++-.. .++..+.-.-..+.... +.+...-.++..+.+.+.
T Consensus 232 ~~~~~~~s~t~e~s~~~i~~~l~~~~-G~~~~~~~~~l~e~~~~~~~~t~~i~~~~~~~~~-s~t~~~~~~~~q~~~~v~ 309 (1205)
T KOG0613|consen 232 PGSAPSMSVTLEASDRKIQLTLRAPR-GPYSAHILGYLIEQRKHKGGNTKTIQNDGPEPEL-SWTVADKRQGCQDRFKVT 309 (1205)
T ss_pred CCCccccccccchhheeeeEEeecCC-CchHHHHhcchhhhcccCCCccccccccCCcccc-ceeeecccCCcccceeee
Confidence 33333333322 345677787653 22 3455554332 11111111112222222 455555567778888888
Q ss_pred EEeCCCCccccee---eeeccCC----CCCcceEEEeccCCeEEEEeecCCC--CCcceEEEEEEeCCCCCCceeEEeec
Q psy4956 164 SVYDSKESVAYTS---RNFTTKP----NTPGKFIVWFRNETTLLVLWQPSYP--ASIYTHYKVSIDPPDAPESVLYVEKE 234 (361)
Q Consensus 164 a~~~~~~s~~~~~---~~~~t~p----~~p~~l~~~~~~~~~~~v~W~~~~~--~~~~~~y~v~~~~~~~~~~~~~~~~~ 234 (361)
..-..+.+.+... ..+...| .-+.++.+.....+.+.+.|.-... +..+.+|+..-....+.........
T Consensus 310 ~~~~i~~~~~~p~~~~~aaa~~~~~~v~~~~~~~v~a~d~~~v~m~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~- 388 (1205)
T KOG0613|consen 310 EEAPIGKGKPGPESLQAAAARPPEVPVGLVRNLSVTARDNTLVEMPTALSGTQKPDEAQGYHGEEVSSESLGALPCPVG- 388 (1205)
T ss_pred EEeeccccccCchhhhhhccCCccccccccccceeccccCcceeecccccCCcCCchheeecccccccccccccccccc-
Confidence 7765555432111 1111222 2234556666666778888854311 1223444443222222211111111
Q ss_pred cCCCCCcEEEecCCCCCCeEEEEEEEEeCCCCC
Q psy4956 235 GEPPGPAQAAFKGLVPGRAYNISVQTVSEDEIS 267 (361)
Q Consensus 235 ~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~~~s 267 (361)
........+.++.++..|..++.|.|..|..
T Consensus 389 --~~ts~~~~~~~~~~~e~~~~~~~a~n~~g~~ 419 (1205)
T KOG0613|consen 389 --TVTSHTYTIKDAGPGEGYFTRVPAVNKGGTE 419 (1205)
T ss_pred --cccCcchhhhhcCcccccccccceeccCCcc
Confidence 1123456778899999999999999876654
No 50
>PF11344 DUF3146: Protein of unknown function (DUF3146); InterPro: IPR021492 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=77.13 E-value=1.9 Score=27.23 Aligned_cols=14 Identities=36% Similarity=0.769 Sum_probs=12.9
Q ss_pred CCCCceEEEEEEEE
Q psy4956 344 LEPGRTYQVLVKTV 357 (361)
Q Consensus 344 L~p~t~Y~v~V~a~ 357 (361)
|+||.+|.|+|+|.
T Consensus 67 LEpGgdY~Ftirak 80 (80)
T PF11344_consen 67 LEPGGDYSFTIRAK 80 (80)
T ss_pred ccCCCceEEEEecC
Confidence 99999999999983
No 51
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=75.56 E-value=23 Score=24.20 Aligned_cols=79 Identities=13% Similarity=0.113 Sum_probs=49.2
Q ss_pred CCccEEEEEeCCEEEEEeeCCCCC-CcccEEEEEEECCCCCCCeEEeecCCC--CceEEEcCCCC----CcEEEEEEEEE
Q psy4956 93 PTNLSVNVRSGKTAQIFWSPPISG-KYSGFKLKVISLSEKTPPRIIGFTENP--PAGYSLKDLTP----GGSYQVQLFSV 165 (361)
Q Consensus 93 p~~l~~~~~~~~~v~l~W~~p~~~-~~~~y~v~~~~~~~~~~~~~~~~~~~~--~~~~~l~~L~p----~~~Y~v~v~a~ 165 (361)
|.+|.-.-.+...+.-+|.+.... .-..|.+.|+........++....... ...+.+..... ...|.|.|...
T Consensus 2 ~~nlsC~~~~~~~m~CtW~~g~~~~~~t~y~L~~~~~~~~~~~eC~~y~~~~~~~~gC~~~~~~~~~~~~~~~~v~V~~s 81 (99)
T PF09240_consen 2 PQNLSCFIYNLEYMNCTWEPGKEAPPDTQYTLYYWYSPLEEEKECPHYSKDSGTRIGCQFPVSEIDSSEFSQYNVCVNGS 81 (99)
T ss_dssp -EEEEEEEETTTEEEEEEECCTTCSTTEEEEEEEEETTSSSEEEESEEEESTSSEEEEEEESCTT-TTTTSEEEEEEEEE
T ss_pred CeeCEEEEECCEEEEEEECCCCCCCCcccEEEEEEcCCCCccccCCCccccCCceeEEEecCCCccccccceEEEEEEec
Confidence 556666666788999999885433 346899999886533333333222221 23566655544 35799998888
Q ss_pred eCCCCc
Q psy4956 166 YDSKES 171 (361)
Q Consensus 166 ~~~~~s 171 (361)
+..+..
T Consensus 82 s~~~~i 87 (99)
T PF09240_consen 82 SSAGSI 87 (99)
T ss_dssp ETTEEE
T ss_pred cCCCcc
Confidence 766554
No 52
>KOG0613|consensus
Probab=75.49 E-value=56 Score=33.42 Aligned_cols=150 Identities=15% Similarity=0.104 Sum_probs=74.9
Q ss_pred CCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCc-ccee--eEEEecCCCC----CC
Q psy4956 22 SSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHD-WLTW--TASITTPPDP----PT 94 (361)
Q Consensus 22 ~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~-~~~~--~~~~~t~~~~----p~ 94 (361)
...+.+|.++-+..-+....-..-.+......++...-.++.+|.+++......+.. ..+. ...+...+.. +.
T Consensus 261 ~~~~~~~l~e~~~~~~~~t~~i~~~~~~~~~s~t~~~~~~~~q~~~~v~~~~~i~~~~~~p~~~~~aaa~~~~~~v~~~~ 340 (1205)
T KOG0613|consen 261 SAHILGYLIEQRKHKGGNTKTIQNDGPEPELSWTVADKRQGCQDRFKVTEEAPIGKGKPGPESLQAAAARPPEVPVGLVR 340 (1205)
T ss_pred HHHHhcchhhhcccCCCccccccccCCccccceeeecccCCcccceeeeEEeeccccccCchhhhhhccCCccccccccc
Confidence 336666666655522211111111111224566666777788888888764322111 1111 2222222222 34
Q ss_pred ccEEEEEeCCEEEEEeeCCCCC---CcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCCCc
Q psy4956 95 NLSVNVRSGKTAQIFWSPPISG---KYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSKES 171 (361)
Q Consensus 95 ~l~~~~~~~~~v~l~W~~p~~~---~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~~s 171 (361)
++.+.....+.+.+.|.-.... .+.+|...-......++..+..-..... .+.+.++.++..|..++-+.+..+..
T Consensus 341 ~~~v~a~d~~~v~m~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ts~-~~~~~~~~~~e~~~~~~~a~n~~g~~ 419 (1205)
T KOG0613|consen 341 NLSVTARDNTLVEMPTALSGTQKPDEAQGYHGEEVSSESLGALPCPVGTVTSH-TYTIKDAGPGEGYFTRVPAVNKGGTE 419 (1205)
T ss_pred cceeccccCcceeecccccCCcCCchheeecccccccccccccccccccccCc-chhhhhcCcccccccccceeccCCcc
Confidence 5556666677888888664421 1333433322211112211111111122 56778899999999999999877765
Q ss_pred c
Q psy4956 172 V 172 (361)
Q Consensus 172 ~ 172 (361)
.
T Consensus 420 ~ 420 (1205)
T KOG0613|consen 420 Q 420 (1205)
T ss_pred c
Confidence 4
No 53
>KOG3515|consensus
Probab=74.19 E-value=75 Score=30.79 Aligned_cols=75 Identities=9% Similarity=0.050 Sum_probs=56.3
Q ss_pred cCCCCCcEEEEEEEEEeCCCCcccceeeeeccCCCCCcceEEEeccCCeEEEEeecCCCCCcceEEEEEEeCCCC
Q psy4956 150 KDLTPGGSYQVQLFSVYDSKESVAYTSRNFTTKPNTPGKFIVWFRNETTLLVLWQPSYPASIYTHYKVSIDPPDA 224 (361)
Q Consensus 150 ~~L~p~~~Y~v~v~a~~~~~~s~~~~~~~~~t~p~~p~~l~~~~~~~~~~~v~W~~~~~~~~~~~y~v~~~~~~~ 224 (361)
-+--++..|.+.+.+.+.-|...-......-+.+..|.||.+.......+.+.|.+....+....|.+.|...+.
T Consensus 659 ~~~~~~~~y~~~c~t~n~lg~~~v~~~~~~~t~~~~~~n~~~~~~~~~~i~l~~~p~fdgg~~q~f~~~~~~~~~ 733 (741)
T KOG3515|consen 659 LNHIDGSDYENGCTTQNLLGSDHVSGAIHSGTASVGPINLTYDNLTYSTISLEWMPGFDGGLQQRFFLKYYDLGT 733 (741)
T ss_pred cCCcchhhhcceeeecccCCCccccceecCCcCCCCccceEeeeeeeeeeceeeeeccccccccceeeehhhcCC
Confidence 345567788877777777676654445555677888999999999999999999987655666788888776543
No 54
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=73.54 E-value=26 Score=23.86 Aligned_cols=84 Identities=13% Similarity=0.148 Sum_probs=49.4
Q ss_pred EEeCCCceEEEEecCCCCCCCcceEEEEEEcCCCCC---CCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCccc
Q psy4956 4 IVHGVHGANLVIKIPENLSSDNSTYRLDYIPAHGHP---PPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHDWL 80 (361)
Q Consensus 4 ~~~~~~~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~---~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~ 80 (361)
.++....+.|.+..-+..... +.|.. +|.. .....+........++|.+......=.+.+.+.|..|....
T Consensus 11 ~v~~G~~v~l~C~~~G~p~p~-----v~W~k-dg~~l~~~~~~~~~~~~~~s~L~I~~~~~~D~G~Ytc~a~N~~G~~~~ 84 (98)
T cd05762 11 KVRAGESVELFCKVTGTQPIT-----CTWMK-FRKQIQEGEGIKIENTENSSKLTITEGQQEHCGCYTLEVENKLGSRQA 84 (98)
T ss_pred EEECCCEEEEEEEEcccCCCc-----eEEEE-CCEEecCCCcEEEEecCCeeEEEECCCChhhCEEEEEEEEcCCCceeE
Confidence 344555566666665443322 55554 3321 11334444444677889988877766666667777756666
Q ss_pred eeeEEEecCCCCC
Q psy4956 81 TWTASITTPPDPP 93 (361)
Q Consensus 81 ~~~~~~~t~~~~p 93 (361)
...+.+.-.|.+|
T Consensus 85 ~~~l~V~~~P~pP 97 (98)
T cd05762 85 QVNLTVVDKPDPP 97 (98)
T ss_pred EEEEEEecCCCCC
Confidence 6667777777777
No 55
>KOG4228|consensus
Probab=67.28 E-value=29 Score=34.70 Aligned_cols=75 Identities=13% Similarity=0.032 Sum_probs=42.0
Q ss_pred CCCCCccEEEEEeCCEEEEEeeCCCCCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCC
Q psy4956 90 PDPPTNLSVNVRSGKTAQIFWSPPISGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSK 169 (361)
Q Consensus 90 ~~~p~~l~~~~~~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~ 169 (361)
|..|....++.....++.+.|..... ....|.+.++....+ .+...+... . ......+.+-..+.+ .+.+..+
T Consensus 25 p~~~~~~~vt~~~~~~~~~~~~~~~t-~~~~y~v~~~~~~s~--~~~~~~~~~-~-~~s~~~~~~~~~~~~--~evnt~~ 97 (1087)
T KOG4228|consen 25 PPGPSTPNVTGSTATSWVQAELLGST-WPNYYQVIFKALSSG--YGYIDVDLT-T-RLSYPCLSPPSFLEL--VEVNTIG 97 (1087)
T ss_pred CCCCCCcccccccccceeehhhccCC-ChhhhheeeeEeccC--cceeeccce-e-ecccCCCCCcchhhh--hhccccc
Confidence 44445555666667788888987654 667788877665433 332222211 1 344455666666665 4444444
Q ss_pred Cc
Q psy4956 170 ES 171 (361)
Q Consensus 170 ~s 171 (361)
..
T Consensus 98 ~a 99 (1087)
T KOG4228|consen 98 RA 99 (1087)
T ss_pred cc
Confidence 44
No 56
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=66.68 E-value=12 Score=27.80 Aligned_cols=25 Identities=28% Similarity=0.331 Sum_probs=21.2
Q ss_pred cceEEecCCCCCceEEEEEEEecCC
Q psy4956 51 KDNIEFSEGLPGTKYDFYLYYTNST 75 (361)
Q Consensus 51 ~~~~~i~~L~p~t~Y~i~V~a~~~~ 75 (361)
-..|.+.||.||+.|.|+-...++.
T Consensus 101 lsaYqVtNL~pGTkY~isY~Vtkgt 125 (184)
T PF07353_consen 101 LSAYQVTNLQPGTKYYISYLVTKGT 125 (184)
T ss_pred ceeEEeeccCCCcEEEEEEEEecCc
Confidence 3579999999999999998876665
No 57
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=65.17 E-value=23 Score=21.01 Aligned_cols=30 Identities=7% Similarity=-0.057 Sum_probs=21.0
Q ss_pred ceEEecCCCCCceEEEEEEEecCCcCcccee
Q psy4956 52 DNIEFSEGLPGTKYDFYLYYTNSTVHDWLTW 82 (361)
Q Consensus 52 ~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~ 82 (361)
=++++..+ ..-.|.|.|++....|..+...
T Consensus 14 Ws~t~~~~-~dG~y~itv~a~D~AGN~s~~~ 43 (54)
T PF13754_consen 14 WSFTVPAL-ADGTYTITVTATDAAGNTSTSS 43 (54)
T ss_pred EEEeCCCC-CCccEEEEEEEEeCCCCCCCcc
Confidence 34555666 6778999999987766665543
No 58
>KOG3515|consensus
Probab=64.11 E-value=13 Score=35.59 Aligned_cols=79 Identities=20% Similarity=0.191 Sum_probs=60.6
Q ss_pred cceEEecCCCCCceEEEEEEEecCCcCccceeeEEEecCCCCCCccEEEEEeCCEEEEEeeCCCCC-CcccEEEEEEECC
Q psy4956 51 KDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTASITTPPDPPTNLSVNVRSGKTAQIFWSPPISG-KYSGFKLKVISLS 129 (361)
Q Consensus 51 ~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t~~~~p~~l~~~~~~~~~v~l~W~~p~~~-~~~~y~v~~~~~~ 129 (361)
..+-.+-+.-++..|.+.+.+.+..|.......+...+.+..|.||.+.......+.|.|.+...+ -...|.+.|...+
T Consensus 653 s~~~~~~~~~~~~~y~~~c~t~n~lg~~~v~~~~~~~t~~~~~~n~~~~~~~~~~i~l~~~p~fdgg~~q~f~~~~~~~~ 732 (741)
T KOG3515|consen 653 SSSEWILNHIDGSDYENGCTTQNLLGSDHVSGAIHSGTASVGPINLTYDNLTYSTISLEWMPGFDGGLQQRFFLKYYDLG 732 (741)
T ss_pred cccccccCCcchhhhcceeeecccCCCccccceecCCcCCCCccceEeeeeeeeeeceeeeeccccccccceeeehhhcC
Confidence 345556667788899988888888766666666777788999999999999999999999996644 3456777775543
No 59
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=60.86 E-value=3.1e+02 Score=31.55 Aligned_cols=23 Identities=17% Similarity=0.245 Sum_probs=17.3
Q ss_pred CCCCCeEEEEEEEEeCCCCCCceE
Q psy4956 248 LVPGRAYNISVQTVSEDEISTPTT 271 (361)
Q Consensus 248 L~p~t~Y~v~V~a~~~~~~s~~~~ 271 (361)
-.|| .|.+++.+.|..|......
T Consensus 1565 ~spG-tYtVtLTvtN~~Gs~~~T~ 1587 (2740)
T TIGR00864 1565 RSVG-TFNIIVTAENDVGAAQASI 1587 (2740)
T ss_pred cCCc-eEEEEEEEecCCCccceeE
Confidence 4566 8999999999888654443
No 60
>KOG4228|consensus
Probab=59.35 E-value=1e+02 Score=31.25 Aligned_cols=68 Identities=16% Similarity=0.129 Sum_probs=39.6
Q ss_pred EEEeCCEEEEEeeCCCCCCcccEEEEEEECCCCCCCeEEeecCCCCceEEEcCCCCCcEEEEEEEEEeCCC
Q psy4956 99 NVRSGKTAQIFWSPPISGKYSGFKLKVISLSEKTPPRIIGFTENPPAGYSLKDLTPGGSYQVQLFSVYDSK 169 (361)
Q Consensus 99 ~~~~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~a~~~~~ 169 (361)
...+.+.++++|.++....+..|.+.+...+ .+....... .... .....++.|-..+.++..+...+.
T Consensus 173 ~~~~~~~it~~w~~~~~~~~~~ykl~~~~~d-~~~~~~v~~-~~~~-~~~t~~~~~~~~~~~~~ae~~~~~ 240 (1087)
T KOG4228|consen 173 EEEEYTTITGSWSPPHAVSLDTYKLLHLDPD-TGYEISVTL-TPPG-SGGTGDLGPPSISRFKCAEPDRGP 240 (1087)
T ss_pred cceEEEEEEecCCCCCcccchhhhhhhcCCc-ccceeeeec-cCCC-CCcccCCCCcccccccccCccccc
Confidence 4556788999999887777888888765542 222222211 1222 344456666666666665554443
No 61
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=58.44 E-value=3.4e+02 Score=31.26 Aligned_cols=27 Identities=22% Similarity=0.198 Sum_probs=19.5
Q ss_pred EEecCCCCCCeEEEEEEEEeCCCCCCce
Q psy4956 243 AAFKGLVPGRAYNISVQTVSEDEISTPT 270 (361)
Q Consensus 243 ~~~~~L~p~t~Y~v~V~a~~~~~~s~~~ 270 (361)
+.+.-..|| .|.|+|.|-|.-|.....
T Consensus 1662 ~~vt~~~pG-~Y~VtL~aSN~vG~~t~s 1688 (2740)
T TIGR00864 1662 AKLNPLEAG-PCDIFLQAANLLGQATAD 1688 (2740)
T ss_pred eEEecCCCc-eEEEEEEEeecccceeeE
Confidence 344446776 899999999987766443
No 62
>PF14292 SusE: SusE outer membrane protein
Probab=57.63 E-value=67 Score=22.92 Aligned_cols=73 Identities=18% Similarity=0.277 Sum_probs=41.3
Q ss_pred cceEEeeecCCceEEEEeeCCCCC-C-CceeEEEEEEECCCCCCCCc-eeecCCCCceeEec----------cCCCCCce
Q psy4956 283 SFTYDKASITSNSLRVVWEPPKGF-S-EFDKYQVSINVRRPGASSTP-ITKSRDEPTQCDMS----------EGLEPGRT 349 (361)
Q Consensus 283 ~l~~~~~~~~~~si~l~W~~~~~~-~-~~~~y~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~----------~~L~p~t~ 349 (361)
.+.+ .....+.++++|++++-. . ....|.|+....+....... +.......++..++ =|+.||..
T Consensus 36 ~i~L--~~~~~~a~tftW~~~~~~~~~a~v~Y~lq~~~~~~~F~~~~~~~~~~~~~~s~~~t~~eLN~~l~~~g~~~~~~ 113 (122)
T PF14292_consen 36 SIVL--DEASDNAVTFTWTAADYGGPDAPVTYTLQFDKKGNDFSSPVEIVTTDNGSTSVSITVKELNSILLKLGLEPGEA 113 (122)
T ss_pred eEEe--cccCCceEEEEEECCccCCCCCceEEEEEEeccCCCccCcEEEEeecCcceeEEecHHHHHHHHHHcCCCCCce
Confidence 3444 334556899999997532 2 34579999887543222221 11111112333322 14899999
Q ss_pred EEEEEEEE
Q psy4956 350 YQVLVKTV 357 (361)
Q Consensus 350 Y~v~V~a~ 357 (361)
..|.+|-.
T Consensus 114 ~~l~~rV~ 121 (122)
T PF14292_consen 114 GTLYWRVK 121 (122)
T ss_pred EEEEEEEE
Confidence 99888764
No 63
>PF10342 GPI-anchored: Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family; InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue [].
Probab=55.31 E-value=59 Score=21.57 Aligned_cols=62 Identities=10% Similarity=0.147 Sum_probs=35.7
Q ss_pred CeEEEEeecCCCCCcceEEEEEEeCCCCCC---ceeEEeeccCCCCCcEEEe-cCCCCCCeEEEEEEE
Q psy4956 197 TTLLVLWQPSYPASIYTHYKVSIDPPDAPE---SVLYVEKEGEPPGPAQAAF-KGLVPGRAYNISVQT 260 (361)
Q Consensus 197 ~~~~v~W~~~~~~~~~~~y~v~~~~~~~~~---~~~~~~~~~~~~~~~~~~~-~~L~p~t~Y~v~V~a 260 (361)
..+.|+|+... .....|.|.+....... ............+...+.+ .+|.++..|.|++..
T Consensus 13 ~~~~I~W~~~~--~~~~~~~I~L~~g~~~~~~~~~~ia~~v~~~~gs~~~~~p~~l~~~~~Y~i~~~~ 78 (93)
T PF10342_consen 13 QPITITWTSDG--TDPGNVTIYLCNGNNTNLNFVQTIASNVSNSDGSYTWTIPSDLPSGGDYFIQIVN 78 (93)
T ss_pred CcEEEEEeCCC--CCCcEEEEEEEcCCCCCcceeEEEEecccCCCCEEEEEcCCCCCCCCcEEEEEEE
Confidence 67999998752 12266778777655521 1111111111113344555 569999999998884
No 64
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=54.83 E-value=92 Score=23.61 Aligned_cols=33 Identities=12% Similarity=0.335 Sum_probs=25.7
Q ss_pred ecCCCCCCeEEEEEEEEeCCCCCCceEEEEecC
Q psy4956 245 FKGLVPGRAYNISVQTVSEDEISTPTTAQYRTI 277 (361)
Q Consensus 245 ~~~L~p~t~Y~v~V~a~~~~~~s~~~~~~~~t~ 277 (361)
|..|..+..|.++|.|....|......+.|.-.
T Consensus 116 fpsle~~~~YtLtV~a~D~aGN~~~~si~F~y~ 148 (158)
T PF13750_consen 116 FPSLEADDSYTLTVSATDKAGNQSTKSISFSYM 148 (158)
T ss_pred cCCcCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence 456889999999999999888766656655554
No 65
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=52.20 E-value=80 Score=30.46 Aligned_cols=76 Identities=16% Similarity=0.235 Sum_probs=48.9
Q ss_pred CCccEEEEEeCCEEEEEeeCCCCCCcccEEEEEEECCCC-CC--------CeEEeecCCCCceEEEcCCCCCcEEEEEEE
Q psy4956 93 PTNLSVNVRSGKTAQIFWSPPISGKYSGFKLKVISLSEK-TP--------PRIIGFTENPPAGYSLKDLTPGGSYQVQLF 163 (361)
Q Consensus 93 p~~l~~~~~~~~~v~l~W~~p~~~~~~~y~v~~~~~~~~-~~--------~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v~ 163 (361)
.+.|++...+.+.+.|+|+.- |.--.|.|++...-.. +. +..+....... -+. ..+.|.+.|.+||+
T Consensus 8 vtslrI~kLsaN~v~l~WddV--G~NFyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~-wFe-d~~~p~t~Yk~Rv~ 83 (1030)
T PHA02579 8 VTSLRIDKLSANQVYLTWDDV--GANFYYFVELAETRDADGELIPDDELRWINLGYTANNE-WFE-DKLQPNTYYKFRVA 83 (1030)
T ss_pred ccEEEhhhhccceEEEEeecc--CCceEEEEEEEeeccCCCccCCCccccceecCcccchh-hhh-hccCCcceEEEEEE
Confidence 367778788899999999983 5556788888753221 11 11122222222 222 34999999999999
Q ss_pred EEeCCCCcc
Q psy4956 164 SVYDSKESV 172 (361)
Q Consensus 164 a~~~~~~s~ 172 (361)
....+-+..
T Consensus 84 ~~~qGFe~S 92 (1030)
T PHA02579 84 VAAQGFEQS 92 (1030)
T ss_pred eeccCCCcc
Confidence 988775443
No 66
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=51.09 E-value=1e+02 Score=23.13 Aligned_cols=25 Identities=24% Similarity=0.329 Sum_probs=20.7
Q ss_pred cEEEecCCCCCCeEEEEEEEEeCCC
Q psy4956 241 AQAAFKGLVPGRAYNISVQTVSEDE 265 (361)
Q Consensus 241 ~~~~~~~L~p~t~Y~v~V~a~~~~~ 265 (361)
..+.+.+|.||+.|.|+-...++..
T Consensus 102 saYqVtNL~pGTkY~isY~Vtkgts 126 (184)
T PF07353_consen 102 SAYQVTNLQPGTKYYISYLVTKGTS 126 (184)
T ss_pred eeEEeeccCCCcEEEEEEEEecCcc
Confidence 5688999999999999887766543
No 67
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=45.44 E-value=45 Score=30.44 Aligned_cols=48 Identities=23% Similarity=0.349 Sum_probs=20.9
Q ss_pred eEEEcCCCCCcEEEEEEEEEeCCCCcccceeeeeccCCCC-CcceEEEeccC
Q psy4956 146 GYSLKDLTPGGSYQVQLFSVYDSKESVAYTSRNFTTKPNT-PGKFIVWFRNE 196 (361)
Q Consensus 146 ~~~l~~L~p~~~Y~v~v~a~~~~~~s~~~~~~~~~t~p~~-p~~l~~~~~~~ 196 (361)
.+.+++|.|++.|.+++........+ ..-.++|.|.. +..+++...++
T Consensus 65 ~v~v~gL~p~t~Y~Y~~~~~~~~~~s---~~g~~rT~p~~~~~~~r~a~~SC 113 (453)
T PF09423_consen 65 KVDVTGLQPGTRYYYRFVVDGGGQTS---PVGRFRTAPDGDPDPFRFAFGSC 113 (453)
T ss_dssp EEEE-S--TT-EEEEEEEE--TTEE------EEEE--TT-----EEEEEE--
T ss_pred ecccCCCCCCceEEEEEEEecCCCCC---CceEEEcCCCCCCCceEEEEECC
Confidence 68889999999999999983332222 23455666533 33455544443
No 68
>PF13205 Big_5: Bacterial Ig-like domain
Probab=42.27 E-value=1.1e+02 Score=20.86 Aligned_cols=23 Identities=30% Similarity=0.491 Sum_probs=16.2
Q ss_pred CCCceeEeccCCCCCceEEEEEE
Q psy4956 333 DEPTQCDMSEGLEPGRTYQVLVK 355 (361)
Q Consensus 333 ~~~~~~~~~~~L~p~t~Y~v~V~ 355 (361)
+....+.....|.+|+.|.|.|.
T Consensus 61 ~~~~~i~p~~~L~~~t~Y~v~i~ 83 (107)
T PF13205_consen 61 GNTLTITPSQPLKPGTTYTVTID 83 (107)
T ss_pred CCEEEEEECCcCCCCCEEEEEEC
Confidence 34344555556999999999983
No 69
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=37.87 E-value=1.8e+02 Score=28.29 Aligned_cols=73 Identities=14% Similarity=0.195 Sum_probs=44.9
Q ss_pred ccceEEeeecCCceEEEEeeCCCCCCCceeEEEEEEECCCCCCC------CceeecC-CCCceeEeccCCCCCceEEEEE
Q psy4956 282 LSFTYDKASITSNSLRVVWEPPKGFSEFDKYQVSINVRRPGASS------TPITKSR-DEPTQCDMSEGLEPGRTYQVLV 354 (361)
Q Consensus 282 ~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i~~~~~~~~~~~------~~~~~~~-~~~~~~~~~~~L~p~t~Y~v~V 354 (361)
..|++ ...+.+.+.|+|+.... + -.|.|++......... ....... .....+--. .|.|.+.|.+||
T Consensus 9 tslrI--~kLsaN~v~l~WddVG~--N-FyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed-~~~p~t~Yk~Rv 82 (1030)
T PHA02579 9 TSLRI--DKLSANQVYLTWDDVGA--N-FYYFVELAETRDADGELIPDDELRWINLGYTANNEWFED-KLQPNTYYKFRV 82 (1030)
T ss_pred cEEEh--hhhccceEEEEeeccCC--c-eEEEEEEEeeccCCCccCCCccccceecCcccchhhhhh-ccCCcceEEEEE
Confidence 45566 78888999999997643 2 5689998875432111 0111111 111122222 399999999999
Q ss_pred EEEeCC
Q psy4956 355 KTVSGK 360 (361)
Q Consensus 355 ~a~~~~ 360 (361)
+...++
T Consensus 83 ~~~~qG 88 (1030)
T PHA02579 83 AVAAQG 88 (1030)
T ss_pred EeeccC
Confidence 987654
No 70
>PF04775 Bile_Hydr_Trans: Acyl-CoA thioester hydrolase/BAAT N-terminal region; InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=35.92 E-value=84 Score=22.70 Aligned_cols=37 Identities=16% Similarity=0.356 Sum_probs=20.7
Q ss_pred cEEEecCCCCCCeEEEEEEEEeCCCCCCceEEEEecC
Q psy4956 241 AQAAFKGLVPGRAYNISVQTVSEDEISTPTTAQYRTI 277 (361)
Q Consensus 241 ~~~~~~~L~p~t~Y~v~V~a~~~~~~s~~~~~~~~t~ 277 (361)
....++||.|++.|+++.......|.--.+...+...
T Consensus 5 ~~I~v~GL~p~~~vtl~a~~~~~~g~~w~S~A~f~Ad 41 (126)
T PF04775_consen 5 VDIRVSGLPPGQEVTLRARLTDDNGVQWQSYATFRAD 41 (126)
T ss_dssp -EEEEES--TT-EEEEEEEEE-TTS-EEEEEEEEE--
T ss_pred eEEEEeCCCCCCEEEEEEEEEeCCCCEEEEEEEEEcC
Confidence 4677899999999999999987655433333344443
No 71
>PF14734 DUF4469: Domain of unknown function (DUF4469) with IG-like fold
Probab=34.60 E-value=79 Score=21.88 Aligned_cols=32 Identities=19% Similarity=0.283 Sum_probs=23.7
Q ss_pred eeecCCCCceeEeccCCCCCceEEEEEEEEeCC
Q psy4956 328 ITKSRDEPTQCDMSEGLEPGRTYQVLVKTVSGK 360 (361)
Q Consensus 328 ~~~~~~~~~~~~~~~~L~p~t~Y~v~V~a~~~~ 360 (361)
+....++...+.+..+|..|. |.+.|+...++
T Consensus 58 i~~N~ps~l~~~lPa~L~~G~-Y~l~V~Tq~~~ 89 (102)
T PF14734_consen 58 IVRNKPSRLIFILPADLAAGE-YTLEVRTQYSG 89 (102)
T ss_pred eEeCCCcEEEEECcCccCceE-EEEEEEEEecC
Confidence 444556667788876688885 99999988764
No 72
>PF14054 DUF4249: Domain of unknown function (DUF4249)
Probab=32.76 E-value=2.6e+02 Score=23.59 Aligned_cols=64 Identities=25% Similarity=0.462 Sum_probs=35.8
Q ss_pred CCCCCCeEEEEEEEEeCCCCCCceEEEEecCCCCCc--cceEEeeec---CCce---EEEEeeCCCCCCCceeEEEEEE
Q psy4956 247 GLVPGRAYNISVQTVSEDEISTPTTAQYRTIPLRPL--SFTYDKASI---TSNS---LRVVWEPPKGFSEFDKYQVSIN 317 (361)
Q Consensus 247 ~L~p~t~Y~v~V~a~~~~~~s~~~~~~~~t~~~~p~--~l~~~~~~~---~~~s---i~l~W~~~~~~~~~~~y~i~~~ 317 (361)
.+.+|..|.++|.+-. +..-.+ ..+.|.+|. .+....... .... +.+.|+.+.+ .-..|.+.+.
T Consensus 95 ~~~~G~~Y~L~V~~~~--~~~~sa---~~~vp~~~~i~~v~~~~~~~~~~~~~~~~~i~~~~~D~~~--~~nyY~~~~~ 166 (298)
T PF14054_consen 95 RGRPGRTYRLEVETPG--GKTYSA---ETTVPPPPPIDSVSYEFKKIGDGDEGEYYRISITFQDPPG--EDNYYRWRVE 166 (298)
T ss_pred cccCCCEEEEEEEECC--CCEEEE---EEEECCCCceeEEEEEEecCCCCcccceeEEEEEEeCCCC--CceEEEEEEE
Confidence 4788999999998852 211111 123444443 333311111 1122 8999998766 5677877766
No 73
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=31.35 E-value=1e+02 Score=28.08 Aligned_cols=49 Identities=18% Similarity=0.312 Sum_probs=21.6
Q ss_pred cceEEecCCCCCceEEEEEEEecCCcCccceeeEEEecCCCC-CCccEEEEEe
Q psy4956 51 KDNIEFSEGLPGTKYDFYLYYTNSTVHDWLTWTASITTPPDP-PTNLSVNVRS 102 (361)
Q Consensus 51 ~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t~~~~-p~~l~~~~~~ 102 (361)
...+.+++|+|++.|.+++....+. ..+. .-.++|.|.. +..+++...+
T Consensus 63 t~~v~v~gL~p~t~Y~Y~~~~~~~~-~~s~--~g~~rT~p~~~~~~~r~a~~S 112 (453)
T PF09423_consen 63 TVKVDVTGLQPGTRYYYRFVVDGGG-QTSP--VGRFRTAPDGDPDPFRFAFGS 112 (453)
T ss_dssp EEEEEE-S--TT-EEEEEEEE--TT-EE-----EEEE--TT-----EEEEEE-
T ss_pred EeecccCCCCCCceEEEEEEEecCC-CCCC--ceEEEcCCCCCCCceEEEEEC
Confidence 3468899999999999999883322 2222 3455565543 3345554433
No 74
>PF14250 AbrB-like: AbrB-like transcriptional regulator
Probab=30.96 E-value=47 Score=20.90 Aligned_cols=37 Identities=22% Similarity=0.416 Sum_probs=23.0
Q ss_pred CceeEEEEEEECCCCCCCCceeecCCCCceeEeccCCCCCceEEEEE
Q psy4956 308 EFDKYQVSINVRRPGASSTPITKSRDEPTQCDMSEGLEPGRTYQVLV 354 (361)
Q Consensus 308 ~~~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~t~Y~v~V 354 (361)
.--.|.+.+...+. +. -+......+ +|+||.+++|.+
T Consensus 27 R~~syr~~Vq~NGn------LL--IG~AYT~~m--~L~PGdEFeI~L 63 (71)
T PF14250_consen 27 RKASYRVSVQGNGN------LL--IGSAYTKQM--GLKPGDEFEIKL 63 (71)
T ss_pred cCceEEEEEecCCC------EE--EcHHHHHHh--CCCCCCEEEEEe
Confidence 44568888776543 21 233223334 499999999986
No 75
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=30.13 E-value=2.8e+02 Score=25.60 Aligned_cols=28 Identities=7% Similarity=-0.082 Sum_probs=18.4
Q ss_pred cceEEecCCCCCceEEEEEEEecCCcCcc
Q psy4956 51 KDNIEFSEGLPGTKYDFYLYYTNSTVHDW 79 (361)
Q Consensus 51 ~~~~~i~~L~p~t~Y~i~V~a~~~~~~~~ 79 (361)
.-++.|.++++|. |.+.|.+....|...
T Consensus 368 ~vtL~Ls~~~AG~-y~Lvv~~t~~dG~~~ 395 (478)
T PRK13211 368 SVSLDLSKLKAGH-HMLVVKAKPKDGELI 395 (478)
T ss_pred eEEEecccCCCce-EEEEEEEEeCCCcee
Confidence 3456666888776 888888876553433
No 76
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=29.94 E-value=17 Score=32.84 Aligned_cols=25 Identities=24% Similarity=0.464 Sum_probs=0.0
Q ss_pred cceEEe--cCCCCCceEEEEEEEecCC
Q psy4956 51 KDNIEF--SEGLPGTKYDFYLYYTNST 75 (361)
Q Consensus 51 ~~~~~i--~~L~p~t~Y~i~V~a~~~~ 75 (361)
...++| ..|.++..|.|++...++.
T Consensus 147 ~~~l~i~~~~l~~~~~y~f~ltv~k~~ 173 (440)
T PF02010_consen 147 SSSLTIPASTLSPGSTYTFTLTVSKGS 173 (440)
T ss_dssp ---------------------------
T ss_pred CEEEEEEhHHcCCCceEEEEEEEEeCC
Confidence 345555 6799999999999987776
No 77
>PF04775 Bile_Hydr_Trans: Acyl-CoA thioester hydrolase/BAAT N-terminal region; InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=29.85 E-value=1.1e+02 Score=22.09 Aligned_cols=25 Identities=16% Similarity=0.191 Sum_probs=17.4
Q ss_pred eEEEcCCCCCcEEEEEEEEEeCCCC
Q psy4956 146 GYSLKDLTPGGSYQVQLFSVYDSKE 170 (361)
Q Consensus 146 ~~~l~~L~p~~~Y~v~v~a~~~~~~ 170 (361)
.+.+.||.|+..++++.......+.
T Consensus 6 ~I~v~GL~p~~~vtl~a~~~~~~g~ 30 (126)
T PF04775_consen 6 DIRVSGLPPGQEVTLRARLTDDNGV 30 (126)
T ss_dssp EEEEES--TT-EEEEEEEEE-TTS-
T ss_pred EEEEeCCCCCCEEEEEEEEEeCCCC
Confidence 6889999999999999999876553
No 78
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=29.34 E-value=63 Score=25.16 Aligned_cols=22 Identities=18% Similarity=0.167 Sum_probs=18.0
Q ss_pred cceEEecCCCCCceEEEEEEEe
Q psy4956 51 KDNIEFSEGLPGTKYDFYLYYT 72 (361)
Q Consensus 51 ~~~~~i~~L~p~t~Y~i~V~a~ 72 (361)
.-.|.+.||.|...|.+.+.-.
T Consensus 32 ~l~y~vsGL~p~~~Y~i~l~~~ 53 (184)
T PF00907_consen 32 TLEYSVSGLDPDSLYSISLHFE 53 (184)
T ss_dssp -EEEEEESS-TTSEEEEEEEEE
T ss_pred ccEEEecCCCCCcceEEEEEEE
Confidence 4689999999999999999863
No 79
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=29.25 E-value=2e+02 Score=20.06 Aligned_cols=64 Identities=13% Similarity=0.086 Sum_probs=35.0
Q ss_pred CCceEEEEecCCCCCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcC
Q psy4956 8 VHGANLVIKIPENLSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVH 77 (361)
Q Consensus 8 ~~~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~ 77 (361)
...+++.++|..=.+...+.|.|.+... ......+... ..+.++. ..-|.+|..+|+.++..|-
T Consensus 31 ~d~v~V~V~wnvWsG~~Gd~a~vl~dg~---~V~~G~~~~~--~~~at~~-v~kgG~y~m~V~lCn~dGC 94 (106)
T cd02848 31 KDAADVSVKWNAWSGDPGDTYKVLLDGK---EVWSGALTGS--SGTATFK-VGKGGRYQMQVALCNGDGC 94 (106)
T ss_pred ccceEEEEEEeeecCCCCcEEEEEECCe---EEEcccCCCC--ccEEEEE-eCCCCeEEEEEEEECCCCc
Confidence 4456677766665555667777765221 0000011111 1233333 4667889999999887643
No 80
>KOG1378|consensus
Probab=28.29 E-value=1.5e+02 Score=26.87 Aligned_cols=75 Identities=17% Similarity=0.197 Sum_probs=37.5
Q ss_pred CCccceEEeeecCCceEEEEeeCCCCCCCceeEEEEEEECCCC--CCCCce--------eecCCCCceeEeccCCCCCce
Q psy4956 280 RPLSFTYDKASITSNSLRVVWEPPKGFSEFDKYQVSINVRRPG--ASSTPI--------TKSRDEPTQCDMSEGLEPGRT 349 (361)
Q Consensus 280 ~p~~l~~~~~~~~~~si~l~W~~~~~~~~~~~y~i~~~~~~~~--~~~~~~--------~~~~~~~~~~~~~~~L~p~t~ 349 (361)
.|+++.++...... .+.++|-..+....+..|-..-...... ...... ....+..-.+++.+ |+|++.
T Consensus 44 ~peQvhlS~~~~~~-~m~VswvT~~~~~~~V~Yg~~~~~~~~~~~~~~~~~~~~~y~~~~~~sg~ih~~~~~~-L~~~t~ 121 (452)
T KOG1378|consen 44 SPEQVHLSFTDNLN-EMRVSWVTGDGEENVVRYGEVKDKLDNSAARGMTEAWTDGYANGWRDSGYIHDAVMKN-LEPNTR 121 (452)
T ss_pred CCCeEEEeccCCCC-cEEEEEeCCCCCCceEEEeecCCCccccccccceEEEecccccccceeeeEeeeeecC-CCCCce
Confidence 47777773333333 8999998765422223333211110000 000000 01122334567775 999999
Q ss_pred EEEEEEE
Q psy4956 350 YQVLVKT 356 (361)
Q Consensus 350 Y~v~V~a 356 (361)
|..+|-+
T Consensus 122 YyY~~Gs 128 (452)
T KOG1378|consen 122 YYYQVGS 128 (452)
T ss_pred EEEEeCC
Confidence 9998744
No 81
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=27.33 E-value=6.6e+02 Score=25.41 Aligned_cols=75 Identities=16% Similarity=0.331 Sum_probs=44.0
Q ss_pred ccceEEeeecCCceEEEEeeCCCC---CCCceeEEEEEEECCCC----CC-CCc------eeecCCCCceeEecc-CC--
Q psy4956 282 LSFTYDKASITSNSLRVVWEPPKG---FSEFDKYQVSINVRRPG----AS-STP------ITKSRDEPTQCDMSE-GL-- 344 (361)
Q Consensus 282 ~~l~~~~~~~~~~si~l~W~~~~~---~~~~~~y~i~~~~~~~~----~~-~~~------~~~~~~~~~~~~~~~-~L-- 344 (361)
.+|++ ......|.|+|+.|.. .+....|+|++...-.. .. ... ..+..++...+++.- +.
T Consensus 760 tDL~~---~~~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~~~~~l~~~f~~a~~vn~~~~~P~~ags~e~~~f~~~~~~~ 836 (863)
T TIGR00868 760 TDLEA---GFQGDNIILTWTAPGDVLDHGRADRYIIRISTSILDLRDDFNDATQVNTTDLIPKEANSKEVFVFKPEGIPI 836 (863)
T ss_pred eeeEE---eecCCEEEEEeeCCCccCCCCccceEEEEecCCHHHHHhhhccccccccCCcCCCCCCceeEEEEeCCcccc
Confidence 35555 4555669999999843 35788999998763210 00 000 011224444555541 12
Q ss_pred CCCceEEEEEEEEeC
Q psy4956 345 EPGRTYQVLVKTVSG 359 (361)
Q Consensus 345 ~p~t~Y~v~V~a~~~ 359 (361)
..++.|.|.|+|+..
T Consensus 837 ~~~~~~~~ai~a~d~ 851 (863)
T TIGR00868 837 ENGTDLFIAVQAIDK 851 (863)
T ss_pred cCCeEEEEEEEEEcc
Confidence 258899999999864
No 82
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=26.37 E-value=6.9e+02 Score=25.29 Aligned_cols=78 Identities=13% Similarity=0.190 Sum_probs=43.7
Q ss_pred ceEEEeccCCeEEEEeecCCCC---CcceEEEEEEeCCCC-----CCceeEEeeccC----CCCCcEEEe--cCC--CCC
Q psy4956 188 KFIVWFRNETTLLVLWQPSYPA---SIYTHYKVSIDPPDA-----PESVLYVEKEGE----PPGPAQAAF--KGL--VPG 251 (361)
Q Consensus 188 ~l~~~~~~~~~~~v~W~~~~~~---~~~~~y~v~~~~~~~-----~~~~~~~~~~~~----~~~~~~~~~--~~L--~p~ 251 (361)
+|++. .....|.|+|..|..+ |...+|.|++...-. ......+..... .+..-.+.| .+. ..+
T Consensus 761 DL~~~-~~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~~~~~l~~~f~~a~~vn~~~~~P~~ags~e~~~f~~~~~~~~~~ 839 (863)
T TIGR00868 761 DLEAG-FQGDNIILTWTAPGDVLDHGRADRYIIRISTSILDLRDDFNDATQVNTTDLIPKEANSKEVFVFKPEGIPIENG 839 (863)
T ss_pred eeEEe-ecCCEEEEEeeCCCccCCCCccceEEEEecCCHHHHHhhhccccccccCCcCCCCCCceeEEEEeCCcccccCC
Confidence 45553 3445699999877543 788899999875311 111111111111 112223333 342 357
Q ss_pred CeEEEEEEEEeCCCC
Q psy4956 252 RAYNISVQTVSEDEI 266 (361)
Q Consensus 252 t~Y~v~V~a~~~~~~ 266 (361)
+.|.|.|+|++..+.
T Consensus 840 ~~~~~ai~a~d~~~~ 854 (863)
T TIGR00868 840 TDLFIAVQAIDKANL 854 (863)
T ss_pred eEEEEEEEEEccccc
Confidence 789999999987654
No 83
>PF14686 fn3_3: Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=25.59 E-value=56 Score=22.23 Aligned_cols=20 Identities=15% Similarity=0.464 Sum_probs=13.2
Q ss_pred CceeEeccCCCCCceEEEEEEE
Q psy4956 335 PTQCDMSEGLEPGRTYQVLVKT 356 (361)
Q Consensus 335 ~~~~~~~~~L~p~t~Y~v~V~a 356 (361)
.-.|++.+ ++||+ |++.+.+
T Consensus 49 ~G~Fti~~-V~pGt-Y~L~ay~ 68 (95)
T PF14686_consen 49 DGNFTIPN-VRPGT-YRLYAYA 68 (95)
T ss_dssp TSEEE----B-SEE-EEEEEEE
T ss_pred CCcEEeCC-eeCcE-eEEEEEE
Confidence 34899986 99996 9998887
No 84
>PF10333 Pga1: GPI-Mannosyltransferase II co-activator; InterPro: IPR019433 Pga1 is found only in yeasts and not in mammals. It localises in the ER as a glycosylated integral membrane protein. It binds to the GPI-mannosyltransferase II subunit of the GPI and it is responsible for the second mannose addition to GPI precursors. The GPI-anchoring complex is a glycolipid that functions as a membrane anchor for many cell-surface proteins [].
Probab=25.23 E-value=1e+02 Score=23.99 Aligned_cols=22 Identities=18% Similarity=0.445 Sum_probs=18.3
Q ss_pred CCcEEEecCCCCCCeEEEEEEE
Q psy4956 239 GPAQAAFKGLVPGRAYNISVQT 260 (361)
Q Consensus 239 ~~~~~~~~~L~p~t~Y~v~V~a 260 (361)
....+.+.+|++|..|.++++=
T Consensus 64 ~t~~V~L~nl~~~e~y~vKiCW 85 (180)
T PF10333_consen 64 STTYVELNNLQPGETYQVKICW 85 (180)
T ss_pred ceEEEEeccCCCCCeEEEEEEE
Confidence 3467889999999999988864
No 85
>KOG1378|consensus
Probab=25.10 E-value=3.2e+02 Score=24.91 Aligned_cols=21 Identities=29% Similarity=0.387 Sum_probs=17.3
Q ss_pred CcEEEecCCCCCCeEEEEEEE
Q psy4956 240 PAQAAFKGLVPGRAYNISVQT 260 (361)
Q Consensus 240 ~~~~~~~~L~p~t~Y~v~V~a 260 (361)
.....+.+|.+++.|..+|-.
T Consensus 108 ih~~~~~~L~~~t~YyY~~Gs 128 (452)
T KOG1378|consen 108 IHDAVMKNLEPNTRYYYQVGS 128 (452)
T ss_pred EeeeeecCCCCCceEEEEeCC
Confidence 356788999999999988755
No 86
>KOG3834|consensus
Probab=24.53 E-value=2.2e+02 Score=25.51 Aligned_cols=61 Identities=18% Similarity=0.253 Sum_probs=41.1
Q ss_pred EEeCCEEEEEeeCCC---CCCcccEEEEEEECCCC--CCCeEEeecCCCCceEEEcCCCCCcEEEEEE
Q psy4956 100 VRSGKTAQIFWSPPI---SGKYSGFKLKVISLSEK--TPPRIIGFTENPPAGYSLKDLTPGGSYQVQL 162 (361)
Q Consensus 100 ~~~~~~v~l~W~~p~---~~~~~~y~v~~~~~~~~--~~~~~~~~~~~~~~~~~l~~L~p~~~Y~v~v 162 (361)
+.....++..|-.+. .|.+.++.|.|+..... ..+..+.+.... -..+-+|.|++.|.+-+
T Consensus 72 n~kt~~~R~v~I~ps~~wggqllGvsvrFcsf~~A~~~vwHvl~V~p~S--PaalAgl~~~~DYivG~ 137 (462)
T KOG3834|consen 72 NSKTQEVRIVEIVPSNNWGGQLLGVSVRFCSFDGAVESVWHVLSVEPNS--PAALAGLRPYTDYIVGI 137 (462)
T ss_pred ecccceeEEEEecccccccccccceEEEeccCccchhheeeeeecCCCC--HHHhcccccccceEecc
Confidence 334456777787766 34578999998775432 234445554443 57788999999999877
No 87
>cd05894 Ig_C5_MyBP-C C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). MyBP_C consists of repeated domains, Ig and fibronectin type 3, and various linkers. Three isoforms of MYBP_C exist and are included in this group: cardiac(c), and fast and slow skeletal muscle (s) MyBP_C. cMYBP_C has insertions between and inside domains and an additional cardiac-specific Ig domain at the N-terminus. For cMYBP_C an interaction has been demonstrated between this C5 domain and the Ig C8 domain.
Probab=24.49 E-value=2.1e+02 Score=18.67 Aligned_cols=71 Identities=11% Similarity=0.017 Sum_probs=41.1
Q ss_pred EEEeCCCceEEEEecCCCCCCCcceEEEEEEcCCCCC---CCCeEEeecCccceEEecCCCCCceEEEEEEEecCCcCc
Q psy4956 3 IIVHGVHGANLVIKIPENLSSDNSTYRLDYIPAHGHP---PPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTVHD 78 (361)
Q Consensus 3 ~~~~~~~~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~---~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~~~ 78 (361)
+.+.....+.|.+..-+.+.. .|.|...+... .....+...++...+.|.++.+...-.+.+.|.|..|..
T Consensus 5 ~~v~~G~~v~l~c~~~G~P~P-----~v~W~k~~~~i~~~~~r~~~~~~~~~~~L~I~~~~~~D~G~Y~c~a~N~~G~~ 78 (86)
T cd05894 5 IVVVAGNKLRLDVPISGEPAP-----TVTWSRGDKAFTETEGRVRVESYKDLSSFVIEGAEREDEGVYTITVTNPVGED 78 (86)
T ss_pred EEEEcCCEEEEEeeEeecCCC-----eEEEEECCEECccCCCeEEEEEcCCeEEEEECCCccCcCEEEEEEEEeCCCcE
Confidence 445555556666665545443 45555422111 112334333345789999999888877777888877443
No 88
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=22.82 E-value=3.1e+02 Score=20.10 Aligned_cols=62 Identities=11% Similarity=0.095 Sum_probs=31.3
Q ss_pred CCceEEEEecCCCCCCCcceEEEEEEcCCCCCCCCeEEeecCccceEEecCCCCCceEEEEEEEecCCc
Q psy4956 8 VHGANLVIKIPENLSSDNSTYRLDYIPAHGHPPPNTTYVSRDIKDNIEFSEGLPGTKYDFYLYYTNSTV 76 (361)
Q Consensus 8 ~~~~~l~~~~p~~~~~~~~~Y~v~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~i~V~a~~~~~ 76 (361)
...+++.++|..=.+...+.|+|... |......... . ..+..| ....+.+|+.+|..++..|
T Consensus 35 ~~~v~V~VtwN~WsG~~Gd~~kly~d---G~~V~tG~~~-~--~~~a~~-~~~~gG~y~~~VeLCN~~G 96 (133)
T PF08329_consen 35 NDQVDVSVTWNVWSGTNGDTAKLYFD---GVLVWTGPSP-Q--QKSATF-TVTKGGRYQMQVELCNADG 96 (133)
T ss_dssp -SSEEEEEEEE-SSS---SEEEEEET---TEEEEEEE---S--EEEEEE-EE-S-EEEEEEEEEEETTE
T ss_pred cCceEEEEEEEEecCCCCCEEEEEEC---CEEEEeCCCc-c--CceEEE-EecCCCEEEEEEEEECCCC
Confidence 55667777776655667777887763 2211111111 1 223333 3456777999999988774
No 89
>PF10333 Pga1: GPI-Mannosyltransferase II co-activator; InterPro: IPR019433 Pga1 is found only in yeasts and not in mammals. It localises in the ER as a glycosylated integral membrane protein. It binds to the GPI-mannosyltransferase II subunit of the GPI and it is responsible for the second mannose addition to GPI precursors. The GPI-anchoring complex is a glycolipid that functions as a membrane anchor for many cell-surface proteins [].
Probab=22.43 E-value=1.1e+02 Score=23.74 Aligned_cols=22 Identities=36% Similarity=0.658 Sum_probs=17.7
Q ss_pred CCCceeEeccCCCCCceEEEEEE
Q psy4956 333 DEPTQCDMSEGLEPGRTYQVLVK 355 (361)
Q Consensus 333 ~~~~~~~~~~~L~p~t~Y~v~V~ 355 (361)
+....+.+.+ |++|..|.|+++
T Consensus 63 ~~t~~V~L~n-l~~~e~y~vKiC 84 (180)
T PF10333_consen 63 GSTTYVELNN-LQPGETYQVKIC 84 (180)
T ss_pred CceEEEEecc-CCCCCeEEEEEE
Confidence 4556777775 999999999986
No 90
>PF14054 DUF4249: Domain of unknown function (DUF4249)
Probab=21.34 E-value=5e+02 Score=21.85 Aligned_cols=62 Identities=23% Similarity=0.382 Sum_probs=33.5
Q ss_pred CCCCCceEEEEEEEecCCcCccceeeEEEecCCCCCC--ccEEEEEe-----CCE---EEEEeeCCCCCCcccEEEEEE
Q psy4956 58 EGLPGTKYDFYLYYTNSTVHDWLTWTASITTPPDPPT--NLSVNVRS-----GKT---AQIFWSPPISGKYSGFKLKVI 126 (361)
Q Consensus 58 ~L~p~t~Y~i~V~a~~~~~~~~~~~~~~~~t~~~~p~--~l~~~~~~-----~~~---v~l~W~~p~~~~~~~y~v~~~ 126 (361)
.+.+|.+|.++|....+. ...+. .+.|.+|. .+...... ... +.+.|+.|. +.-..|++.+.
T Consensus 95 ~~~~G~~Y~L~V~~~~~~---~~sa~---~~vp~~~~i~~v~~~~~~~~~~~~~~~~~i~~~~~D~~-~~~nyY~~~~~ 166 (298)
T PF14054_consen 95 RGRPGRTYRLEVETPGGK---TYSAE---TTVPPPPPIDSVSYEFKKIGDGDEGEYYRISITFQDPP-GEDNYYRWRVE 166 (298)
T ss_pred cccCCCEEEEEEEECCCC---EEEEE---EEECCCCceeEEEEEEecCCCCcccceeEEEEEEeCCC-CCceEEEEEEE
Confidence 578999999999875222 22222 22333332 22222111 112 788887764 44556777765
No 91
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=21.15 E-value=2.2e+02 Score=17.65 Aligned_cols=20 Identities=30% Similarity=0.549 Sum_probs=13.1
Q ss_pred cceEEecCCCCCceEEEEEEE
Q psy4956 51 KDNIEFSEGLPGTKYDFYLYY 71 (361)
Q Consensus 51 ~~~~~i~~L~p~t~Y~i~V~a 71 (361)
...+.+..+.+|+ |.|+|.+
T Consensus 51 ~~~i~~~~~~~Gt-Yyi~V~~ 70 (70)
T PF04151_consen 51 DESITFTAPAAGT-YYIRVYG 70 (70)
T ss_dssp EEEEEEEESSSEE-EEEEEE-
T ss_pred ccEEEEEcCCCEE-EEEEEEC
Confidence 3556666677776 8888863
No 92
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=21.03 E-value=1.2e+02 Score=16.84 Aligned_cols=13 Identities=38% Similarity=0.841 Sum_probs=10.4
Q ss_pred CCCceEEEEEEEEe
Q psy4956 345 EPGRTYQVLVKTVS 358 (361)
Q Consensus 345 ~p~t~Y~v~V~a~~ 358 (361)
+|| .|.+.++|.-
T Consensus 11 ~PG-~Y~l~~~a~~ 23 (41)
T TIGR03769 11 KPG-TYTLTVQATA 23 (41)
T ss_pred CCe-EEEEEEEEEE
Confidence 677 8999988853
No 93
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=20.57 E-value=1.5e+02 Score=23.12 Aligned_cols=23 Identities=26% Similarity=0.400 Sum_probs=19.3
Q ss_pred eEEEcCCCCCcEEEEEEEEEeCC
Q psy4956 146 GYSLKDLTPGGSYQVQLFSVYDS 168 (361)
Q Consensus 146 ~~~l~~L~p~~~Y~v~v~a~~~~ 168 (361)
.+.+.||.|...|.+.+......
T Consensus 34 ~y~vsGL~p~~~Y~i~l~~~~~d 56 (184)
T PF00907_consen 34 EYSVSGLDPDSLYSISLHFERVD 56 (184)
T ss_dssp EEEEESS-TTSEEEEEEEEEESC
T ss_pred EEEecCCCCCcceEEEEEEEEec
Confidence 89999999999999999887643
No 94
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=20.14 E-value=2.6e+02 Score=18.10 Aligned_cols=23 Identities=9% Similarity=0.118 Sum_probs=19.9
Q ss_pred cceEEecCCCCCceEEEEEEEec
Q psy4956 51 KDNIEFSEGLPGTKYDFYLYYTN 73 (361)
Q Consensus 51 ~~~~~i~~L~p~t~Y~i~V~a~~ 73 (361)
..+|.=.+|.+|..|.++|++.-
T Consensus 28 ~R~F~T~~L~~G~~y~Y~v~a~~ 50 (75)
T TIGR03000 28 VRTFTTPPLEAGKEYEYTVTAEY 50 (75)
T ss_pred EEEEECCCCCCCCEEEEEEEEEE
Confidence 67888889999999999999843
Done!