Query psy17638
Match_columns 160
No_of_seqs 132 out of 1269
Neff 11.3
Searched_HMMs 46136
Date Fri Aug 16 19:09:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17638.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17638hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4221|consensus 99.8 6E-19 1.3E-23 133.6 14.4 150 2-160 598-756 (1381)
2 KOG4221|consensus 99.7 2.4E-16 5.3E-21 119.8 15.1 142 2-160 507-655 (1381)
3 KOG3513|consensus 99.7 1.5E-15 3.3E-20 115.1 15.5 150 1-160 801-961 (1051)
4 KOG3513|consensus 99.6 2.7E-14 5.8E-19 108.6 13.4 137 16-160 613-756 (1051)
5 KOG0196|consensus 99.6 1E-13 2.2E-18 102.5 12.9 103 9-117 434-537 (996)
6 PF00041 fn3: Fibronectin type 99.5 3E-13 6.4E-18 75.9 9.5 83 20-107 2-84 (85)
7 KOG0196|consensus 99.4 1.4E-11 3E-16 91.5 14.3 137 17-160 331-480 (996)
8 cd00063 FN3 Fibronectin type 3 99.0 3.9E-08 8.4E-13 55.2 11.2 84 20-108 3-86 (93)
9 KOG4258|consensus 98.8 1.6E-08 3.5E-13 76.0 6.9 53 6-58 598-651 (1025)
10 smart00060 FN3 Fibronectin typ 98.7 9.4E-07 2E-11 48.1 10.5 79 21-104 4-82 (83)
11 KOG4258|consensus 98.5 2E-06 4.4E-11 65.2 11.1 132 23-160 493-648 (1025)
12 KOG4222|consensus 98.5 5.2E-07 1.1E-11 70.1 8.1 126 28-158 652-786 (1281)
13 KOG4802|consensus 98.4 8.5E-06 1.9E-10 57.1 9.7 82 78-160 208-292 (516)
14 KOG4222|consensus 98.2 9.7E-06 2.1E-10 63.4 8.4 101 3-108 729-836 (1281)
15 KOG4152|consensus 97.7 0.00017 3.8E-09 52.5 6.4 71 83-160 651-722 (830)
16 KOG4802|consensus 97.7 0.00038 8.3E-09 49.2 7.8 103 2-107 236-342 (516)
17 KOG4367|consensus 97.2 0.00054 1.2E-08 48.8 3.9 80 29-115 450-530 (699)
18 PF01108 Tissue_fac: Tissue fa 96.7 0.018 3.9E-07 33.6 6.8 81 17-103 21-103 (107)
19 KOG3632|consensus 96.6 0.0037 8.1E-08 49.1 4.6 119 10-149 664-794 (1335)
20 PF09294 Interfer-bind: Interf 95.8 0.094 2E-06 30.3 6.7 70 23-100 7-88 (106)
21 PF00041 fn3: Fibronectin type 95.3 0.027 5.8E-07 30.8 3.1 24 137-160 12-37 (85)
22 PF10179 DUF2369: Uncharacteri 95.0 0.48 1E-05 33.0 9.0 80 78-158 127-208 (300)
23 PF09067 EpoR_lig-bind: Erythr 94.2 0.62 1.3E-05 27.1 7.0 84 17-104 7-95 (104)
24 PF07495 Y_Y_Y: Y_Y_Y domain; 93.1 0.66 1.4E-05 24.1 6.5 27 80-107 30-56 (66)
25 KOG4152|consensus 90.8 1.2 2.7E-05 33.5 6.1 106 1-108 670-789 (830)
26 PLN02533 probable purple acid 89.3 4.4 9.4E-05 30.0 8.1 77 16-97 39-121 (427)
27 PF10179 DUF2369: Uncharacteri 88.0 7.3 0.00016 27.4 12.4 30 79-108 260-289 (300)
28 TIGR00868 hCaCC calcium-activa 87.7 6.5 0.00014 31.9 8.5 31 130-160 760-794 (863)
29 KOG4367|consensus 84.8 1.9 4E-05 31.7 4.0 37 121-159 437-475 (699)
30 PF01108 Tissue_fac: Tissue fa 83.0 2.4 5.3E-05 24.6 3.4 31 130-160 26-57 (107)
31 PF11344 DUF3146: Protein of u 82.8 1 2.3E-05 24.2 1.6 15 85-99 66-80 (80)
32 KOG3632|consensus 80.2 4.1 8.8E-05 33.3 4.5 112 21-149 581-696 (1335)
33 KOG4228|consensus 78.0 4.4 9.6E-05 33.4 4.2 91 2-105 151-242 (1087)
34 COG4733 Phage-related protein, 73.5 16 0.00035 29.5 6.0 68 76-149 657-726 (952)
35 KOG1225|consensus 61.7 20 0.00042 27.5 4.4 118 24-160 372-489 (525)
36 COG3401 Fibronectin type 3 dom 60.6 61 0.0013 23.4 8.9 106 31-149 83-189 (343)
37 KOG3834|consensus 55.8 40 0.00086 25.1 4.9 69 23-96 67-137 (462)
38 PF04775 Bile_Hydr_Trans: Acyl 54.7 30 0.00064 20.9 3.7 25 79-103 5-29 (126)
39 TIGR00868 hCaCC calcium-activa 54.1 35 0.00076 28.1 4.9 41 17-58 755-797 (863)
40 PF00907 T-box: T-box; InterP 51.6 23 0.00051 22.8 3.1 25 78-102 32-56 (184)
41 PF10342 GPI-anchored: Ser-Thr 48.3 46 0.001 18.3 7.5 65 31-98 12-78 (93)
42 PF14292 SusE: SusE outer memb 46.7 61 0.0013 19.2 5.5 30 30-59 43-73 (122)
43 PF09423 PhoD: PhoD-like phosp 46.2 38 0.00081 25.4 3.9 21 79-99 64-84 (453)
44 PF07353 Uroplakin_II: Uroplak 46.1 77 0.0017 20.2 6.0 25 77-101 100-124 (184)
45 KOG4806|consensus 44.6 1.2E+02 0.0027 22.2 7.1 30 79-108 410-439 (454)
46 PF09240 IL6Ra-bind: Interleuk 44.6 59 0.0013 18.4 8.7 84 21-107 2-89 (99)
47 PF11811 DUF3331: Domain of un 42.7 58 0.0013 18.7 3.4 22 21-42 15-37 (96)
48 PF10333 Pga1: GPI-Mannosyltra 40.7 54 0.0012 21.3 3.4 28 75-102 62-89 (180)
49 PF13750 Big_3_3: Bacterial Ig 37.6 89 0.0019 19.7 4.0 33 83-115 116-148 (158)
50 KOG0613|consensus 37.0 2.9E+02 0.0062 24.2 7.9 106 29-149 151-257 (1205)
51 cd02848 Chitinase_N_term Chiti 36.1 94 0.002 18.3 7.8 23 86-108 76-98 (106)
52 PF04734 Ceramidase_alk: Neutr 35.5 51 0.0011 26.3 3.2 34 127-160 617-650 (674)
53 PF14054 DUF4249: Domain of un 33.5 1.6E+02 0.0036 20.4 6.6 15 85-99 95-109 (298)
54 KOG4806|consensus 30.7 90 0.002 22.8 3.5 29 76-104 167-195 (454)
55 cd00182 TBOX T-box DNA binding 29.1 99 0.0021 20.3 3.3 24 78-101 34-57 (188)
56 PHA02579 7 baseplate wedge sub 28.2 3.5E+02 0.0076 22.5 7.2 76 22-101 9-87 (1030)
57 PF08329 ChitinaseA_N: Chitina 28.2 90 0.002 19.2 2.8 26 86-111 79-104 (133)
58 cd08544 Reeler Reeler, the N-t 27.4 1.1E+02 0.0023 18.5 3.2 19 30-48 94-112 (135)
59 smart00425 TBOX Domain first f 26.9 1.1E+02 0.0025 20.0 3.3 23 78-100 33-55 (190)
60 PF13205 Big_5: Bacterial Ig-l 26.3 71 0.0015 18.0 2.2 16 84-99 70-85 (107)
61 PF14686 fn3_3: Polysaccharide 25.2 71 0.0015 18.2 2.0 21 77-98 48-68 (95)
62 PRK14438 acylphosphatase; Prov 25.2 1.1E+02 0.0024 17.2 2.7 21 139-160 71-91 (91)
63 PF02014 Reeler: Reeler domain 24.1 1.1E+02 0.0025 18.3 2.8 18 31-48 99-116 (132)
64 PF05345 He_PIG: Putative Ig d 21.9 1.2E+02 0.0025 14.8 2.4 13 91-103 37-49 (49)
65 cd02856 Glycogen_debranching_e 21.7 1.7E+02 0.0036 16.7 3.1 20 79-98 47-66 (103)
66 PRK14450 acylphosphatase; Prov 20.4 1.6E+02 0.0034 16.5 2.7 21 139-160 71-91 (91)
No 1
>KOG4221|consensus
Probab=99.82 E-value=6e-19 Score=133.55 Aligned_cols=150 Identities=25% Similarity=0.422 Sum_probs=116.5
Q ss_pred CCCCCcc-eEEEcCCCCCCCCCe-eEEEecCCcEEEEEecCCCC--CCCceeeEEEEEEEeCCcccCCceeeceeeeCCc
Q psy17638 2 YLSPPLW-LTLNLEALVPGTVQS-LEAIPLGSSAFYLKWVKPEQ--PNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPT 77 (160)
Q Consensus 2 g~g~~s~-~~~~t~~~~P~~p~~-~~~~~~~~~~~~l~W~~p~~--~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~ 77 (160)
|.|..|. +.|+|.++.|++|++ +++.....+++.|.|.+|.. .+|.+.+|+|+|......... +..+...+
T Consensus 598 G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~itgYkIRy~~~~~~~~~-----~~t~v~~n 672 (1381)
T KOG4221|consen 598 GSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQITGYKIRYRKLSREDEV-----NETVVKGN 672 (1381)
T ss_pred CCCCCCCceEEEeccCCCCCCCcceEEEecCCCeEEEEccCCCcccccceEEEEEEEecccCccccc-----ceeecccc
Confidence 7888998 999999999976665 88888999999999999853 488999999999876433221 12244557
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeCCCCCCcc-eeEEeecCCCC--CCCCCCccceeeecccCccCeEEEEEecCCC--CC
Q psy17638 78 TTSAKLAGLEPSTKYRIHLAGYTKAGDGADY-FIEHRTRGTDN--VAPGKPNFKLVRSGTENGYGAFKVIWEPNPE--RP 152 (160)
Q Consensus 78 ~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~-~~~~~t~~~~~--~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~~--~~ 152 (160)
...+++.+|+|++.|.|+|.|.+..|.|+++ +....|..... .+|.+|.. +.+....++|.|+|.||.+ ..
T Consensus 673 ~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~----l~~~~g~~si~vsW~Pp~~~~~~ 748 (1381)
T KOG4221|consen 673 TTQYLFNGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDLDERVPGKPSE----LHVHPGSNSIVVSWTPPPHPNIV 748 (1381)
T ss_pred hhhhHhhcCCCCceEEEEEEEeccCCCCCcccceeccCccccccccCCCCCce----eeeccCceeEEEEeCCCCChhhh
Confidence 7889999999999999999999999999998 56666654432 23444432 4444578999999999985 34
Q ss_pred CceeEEeC
Q psy17638 153 GSHFFVKY 160 (160)
Q Consensus 153 ~~~y~v~y 160 (160)
+.+|.|-|
T Consensus 749 vrgY~ig~ 756 (1381)
T KOG4221|consen 749 VRGYKIGY 756 (1381)
T ss_pred hcceEEee
Confidence 56888876
No 2
>KOG4221|consensus
Probab=99.73 E-value=2.4e-16 Score=119.79 Aligned_cols=142 Identities=25% Similarity=0.384 Sum_probs=108.8
Q ss_pred CCCCCcc-eEEEcCCCCCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccE
Q psy17638 2 YLSPPLW-LTLNLEALVPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTS 80 (160)
Q Consensus 2 g~g~~s~-~~~~t~~~~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (160)
|.|..|. +.|.|..+.| ..+.+...+...+.+.|.+|...++.|++|++.|...+...+ .....+..+
T Consensus 507 g~g~sS~pLkV~t~pEgp---~~~~a~ats~~ti~v~WepP~~~n~~I~~yk~~ys~~~~~~~--------~~~~~n~~e 575 (1381)
T KOG4221|consen 507 GSGESSAPLKVTTQPEGP---VQLQAYATSPTTILVTWEPPPFGNGPITGYKLFYSEDDTGKE--------LRVENNATE 575 (1381)
T ss_pred cCCccCCceEEecCCCCC---ccccccccCcceEEEEecCCCCCCCCceEEEEEEEcCCCCce--------EEEecCccE
Confidence 6777777 8888765532 237777788899999999999889999999999876522221 133567789
Q ss_pred EEEcCCCCCCEEEEEEEEEeCCCCCCcc-eeEEeecCCCCCCCCCCccceeeeccc-CccCeEEEEEecCCC----CCCc
Q psy17638 81 AKLAGLEPSTKYRIHLAGYTKAGDGADY-FIEHRTRGTDNVAPGKPNFKLVRSGTE-NGYGAFKVIWEPNPE----RPGS 154 (160)
Q Consensus 81 ~~i~~l~p~~~y~~~v~a~~~~g~g~~~-~~~~~t~~~~~~~p~~p~~~~~~~~~~-~~~~~~~l~W~~p~~----~~~~ 154 (160)
++|.+|++++.|.|+|.|+|..|.|..+ .+.+.|..+ .|++|+. |++.+ .+.++|+|+|.+|.. +-+.
T Consensus 576 ~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd---~PsaPP~---Nl~lev~sStsVrVsW~pP~~~t~ng~it 649 (1381)
T KOG4221|consen 576 YTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSD---VPSAPPQ---NLSLEVVSSTSVRVSWLPPPSETQNGQIT 649 (1381)
T ss_pred EEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccC---CCCCCCc---ceEEEecCCCeEEEEccCCCcccccceEE
Confidence 9999999999999999999999999887 477777665 4555553 24443 388999999999975 4456
Q ss_pred eeEEeC
Q psy17638 155 HFFVKY 160 (160)
Q Consensus 155 ~y~v~y 160 (160)
+|.|+|
T Consensus 650 gYkIRy 655 (1381)
T KOG4221|consen 650 GYKIRY 655 (1381)
T ss_pred EEEEEe
Confidence 888876
No 3
>KOG3513|consensus
Probab=99.70 E-value=1.5e-15 Score=115.14 Aligned_cols=150 Identities=28% Similarity=0.333 Sum_probs=112.1
Q ss_pred CCCCCCcc-eEEEcCCCCC-CCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcc
Q psy17638 1 MYLSPPLW-LTLNLEALVP-GTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTT 78 (160)
Q Consensus 1 ~g~g~~s~-~~~~t~~~~P-~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (160)
+|+||.|. +.+.+.|+.| .+|..+.+...+.+.+.|.|++|...+|.+++|+|.|+...... +... ......+.
T Consensus 801 ~GeGp~s~~~v~~S~Ed~P~~ap~~~~~~~~s~s~~~v~W~~~~~~nG~l~gY~v~Y~~~~~~~-~~~~---~~~i~~~~ 876 (1051)
T KOG3513|consen 801 QGEGPESQVTVGYSGEDEPPVAPTKLSAKPLSSSEVNLSWKPPLWDNGKLTGYEVKYWKINEKE-GSLS---RVQIAGNR 876 (1051)
T ss_pred CCCCCCCceEEEEcCCCCCCCCCccceeecccCceEEEEecCcCccCCccceeEEEEEEcCCCc-cccc---ceeecCCc
Confidence 58999999 8899988876 77888888899999999999999877899999999999874432 1111 11335788
Q ss_pred cEEEEcCCCCCCEEEEEEEEEeCCCCCCcce-eEEeecCCCCCCCCC----CccceeeecccCccCeEEEEEecCCC---
Q psy17638 79 TSAKLAGLEPSTKYRIHLAGYTKAGDGADYF-IEHRTRGTDNVAPGK----PNFKLVRSGTENGYGAFKVIWEPNPE--- 150 (160)
Q Consensus 79 ~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~~-~~~~t~~~~~~~p~~----p~~~~~~~~~~~~~~~~~l~W~~p~~--- 150 (160)
+...|.+|.+.+.|.|.|+|+|.+|.|+.+. ... +...++|+. |.. +..-......+.|.|..-..
T Consensus 877 ~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp~s~~~~~---tt~k~pPs~~~~~p~g---~~~~~~~~~~~~l~w~~v~~~~n 950 (1051)
T KOG3513|consen 877 TSWRLTGLEPNTKYRFYVRAYTSAGGGPASSEENV---TTKKAPPSQVDIAPPG---NFIWKFSASILLLLWLLVSAFEN 950 (1051)
T ss_pred ceEeeeCCCCCceEEEEEEEecCCCCCCCccceec---cccCCCCcccccCCCc---ceEEeeeeeEEEEEEeeEEEEee
Confidence 8999999999999999999999999998873 333 233335653 332 12223366788899987643
Q ss_pred -CCCceeEEeC
Q psy17638 151 -RPGSHFFVKY 160 (160)
Q Consensus 151 -~~~~~y~v~y 160 (160)
..+.+|-|.|
T Consensus 951 es~v~gYkV~~ 961 (1051)
T KOG3513|consen 951 ESEVGGYKVLY 961 (1051)
T ss_pred cccCcceEEEE
Confidence 4466777664
No 4
>KOG3513|consensus
Probab=99.61 E-value=2.7e-14 Score=108.56 Aligned_cols=137 Identities=20% Similarity=0.293 Sum_probs=100.3
Q ss_pred CCCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEE
Q psy17638 16 LVPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIH 95 (160)
Q Consensus 16 ~~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~ 95 (160)
++|++|.++++...+.+++.|+|++..+.+++|..|.|+++......|...... ......+ ...++.+|.|+..|+||
T Consensus 613 gpPgpP~~v~~~~i~~t~~~lsW~~g~dn~SpI~~Y~iq~rt~~~~~W~~v~~v-p~~~~~~-~sa~vv~L~Pwv~YeFR 690 (1051)
T KOG3513|consen 613 GPPGPPPDVHVDDISDTTARLSWSPGSDNNSPIEKYTIQFRTPFPGKWKAVTTV-PGNITGD-ESATVVNLSPWVEYEFR 690 (1051)
T ss_pred cCCCCCCceeEeeeccceEEEEeecCCCCCCCceEEeEEecCCCCCcceEeeEC-CCcccCc-cceeEEccCCCcceEEE
Confidence 567788899999999999999999998888899999999987755555432211 0111112 35778999999999999
Q ss_pred EEEEeCCCCCCcce--eEEeecCCCCCCCCCCccceeeecccC-ccCeEEEEEecCCC----CCCceeEEeC
Q psy17638 96 LAGYTKAGDGADYF--IEHRTRGTDNVAPGKPNFKLVRSGTEN-GYGAFKVIWEPNPE----RPGSHFFVKY 160 (160)
Q Consensus 96 v~a~~~~g~g~~~~--~~~~t~~~~~~~p~~p~~~~~~~~~~~-~~~~~~l~W~~p~~----~~~~~y~v~y 160 (160)
|.|.|..|.|.++. ...+|... +|.... .++.... ..+.+.|+|.|-.. +++.+|+|.|
T Consensus 691 V~AvN~iG~gePS~pS~~~rT~ea---~P~~~P---~nv~g~g~~~~eLvItW~Pl~~~~qNG~gfgY~Vsw 756 (1051)
T KOG3513|consen 691 VVAVNSIGIGEPSPPSEKVRTPEA---APSVNP---SNVKGGGGSPTELVITWEPLPEEEQNGPGFGYRVSW 756 (1051)
T ss_pred EEEEcccccCCCCCCccceecCCC---CCccCC---ccccccCCCCceEEEEeccCCHHHccCCCceEEEEE
Confidence 99999999999883 44444332 343322 2344433 66789999999755 8889999976
No 5
>KOG0196|consensus
Probab=99.57 E-value=1e-13 Score=102.47 Aligned_cols=103 Identities=28% Similarity=0.458 Sum_probs=89.9
Q ss_pred eEEEcCCCCCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCC
Q psy17638 9 LTLNLEALVPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEP 88 (160)
Q Consensus 9 ~~~~t~~~~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p 88 (160)
++++|.+.+|+....++....+.++++|+|..|.+++|.|..|+|+|..+..+.. .+..+..+.+..+|.+|.|
T Consensus 434 vnItt~qa~ps~V~~~r~~~~~~~sitlsW~~p~~png~ildYEvky~ek~~~e~------~~~~~~t~~~~~ti~gL~p 507 (996)
T KOG0196|consen 434 VNITTNQAAPSPVSVLRQVSRTSDSITLSWSEPDQPNGVILDYEVKYYEKDEDER------SYSTLKTKTTTATITGLKP 507 (996)
T ss_pred EEeeccccCCCccceEEEeeeccCceEEecCCCCCCCCcceeEEEEEeecccccc------ceeEEecccceEEeeccCC
Confidence 9999999999999999999999999999999999999999999999998754322 2335567788999999999
Q ss_pred CCEEEEEEEEEeCCCCCCcc-eeEEeecCC
Q psy17638 89 STKYRIHLAGYTKAGDGADY-FIEHRTRGT 117 (160)
Q Consensus 89 ~~~y~~~v~a~~~~g~g~~~-~~~~~t~~~ 117 (160)
++.|.|+|+|++..|.|.++ ...+.|.+.
T Consensus 508 ~t~YvfqVRarT~aG~G~~S~~~~fqT~~~ 537 (996)
T KOG0196|consen 508 GTVYVFQVRARTAAGYGPYSGKHEFQTLPS 537 (996)
T ss_pred CcEEEEEEEEecccCCCCCCCceeeeecCc
Confidence 99999999999999999998 577777654
No 6
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.52 E-value=3e-13 Score=75.92 Aligned_cols=83 Identities=24% Similarity=0.496 Sum_probs=66.0
Q ss_pred CCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEE
Q psy17638 20 TVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGY 99 (160)
Q Consensus 20 ~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~ 99 (160)
+|.++.+...+.+++.|.|.+|...++.+.+|.|.|........ .. ..........+++.+|.|++.|.|+|+|.
T Consensus 2 ~P~~l~v~~~~~~sv~v~W~~~~~~~~~~~~y~v~~~~~~~~~~--~~---~~~~~~~~~~~~i~~L~p~t~Y~~~v~a~ 76 (85)
T PF00041_consen 2 APENLSVSNISPTSVTVSWKPPSSGNGPITGYRVEYRSVNSTSD--WQ---EVTVPGNETSYTITGLQPGTTYEFRVRAV 76 (85)
T ss_dssp SSEEEEEEEECSSEEEEEEEESSSTSSSESEEEEEEEETTSSSE--EE---EEEEETTSSEEEEESCCTTSEEEEEEEEE
T ss_pred cCcCeEEEECCCCEEEEEEECCCCCCCCeeEEEEEEEeccccee--ee---eeeeeeeeeeeeeccCCCCCEEEEEEEEE
Confidence 57788888889999999999987557789999999987644431 01 11234445589999999999999999999
Q ss_pred eCCCCCCc
Q psy17638 100 TKAGDGAD 107 (160)
Q Consensus 100 ~~~g~g~~ 107 (160)
+..|.|++
T Consensus 77 ~~~g~g~~ 84 (85)
T PF00041_consen 77 NSDGEGPP 84 (85)
T ss_dssp ETTEEEEE
T ss_pred eCCcCcCC
Confidence 99998765
No 7
>KOG0196|consensus
Probab=99.42 E-value=1.4e-11 Score=91.49 Aligned_cols=137 Identities=16% Similarity=0.254 Sum_probs=90.0
Q ss_pred CCCCCCeeEEEecCCcEEEEEecCCCCCCC-ceeeEEEEEEEeCCcc-----cCCceeeceeeeCCcccEEEEcCCCCCC
Q psy17638 17 VPGTVQSLEAIPLGSSAFYLKWVKPEQPNG-VLMGYKIKYQSVKGTK-----VGPLLERLPYISDPTTTSAKLAGLEPST 90 (160)
Q Consensus 17 ~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g-~i~~y~i~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~i~~l~p~~ 90 (160)
+|++|.++.. ..+.+++.|.|.+|.+.+| .-..|.|.|....... ++......+.-.....++.++.+|.+.+
T Consensus 331 PPSaP~nlis-~vn~Ts~~L~W~~P~d~GGR~Di~y~v~Ck~c~~~~~~C~~Cg~~V~f~P~q~gLt~~~V~v~~L~ah~ 409 (996)
T KOG0196|consen 331 PPSAPRNLIS-NVNGTSLILEWSPPADTGGREDITYNVICKKCGGGRGACEPCGDNVRFTPRQRGLTETSVTVSDLLAHT 409 (996)
T ss_pred CCCccceeee-ecccceEEEEecCCcccCCCcceEEEEEeeccCCCCCccccCCCCceECCCCCCcccceEEEecccccc
Confidence 5677877744 5889999999999988777 4567999887764221 1111111111123346789999999999
Q ss_pred EEEEEEEEEeCCCCC-Cc--c--eeEEeecCCCCCCCCCCccceeeecccCccCeEEEEEecCCC--CCCceeEEeC
Q psy17638 91 KYRIHLAGYTKAGDG-AD--Y--FIEHRTRGTDNVAPGKPNFKLVRSGTENGYGAFKVIWEPNPE--RPGSHFFVKY 160 (160)
Q Consensus 91 ~y~~~v~a~~~~g~g-~~--~--~~~~~t~~~~~~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~~--~~~~~y~v~y 160 (160)
.|.|.|.|.|....- +. . .+.+.| ...+|+..+.+ +. .....++|.|+|..|.. +.+..|.|+|
T Consensus 410 ~YTFeV~AvNgVS~lsp~~~~~a~vnItt--~qa~ps~V~~~--r~--~~~~~~sitlsW~~p~~png~ildYEvky 480 (996)
T KOG0196|consen 410 NYTFEVEAVNGVSDLSPFPRQFASVNITT--NQAAPSPVSVL--RQ--VSRTSDSITLSWSEPDQPNGVILDYEVKY 480 (996)
T ss_pred ccEEEEEEeecccccCCCCCcceeEEeec--cccCCCccceE--EE--eeeccCceEEecCCCCCCCCcceeEEEEE
Confidence 999999999987542 22 1 233333 33334444443 12 23478999999999985 5568999887
No 8
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.98 E-value=3.9e-08 Score=55.19 Aligned_cols=84 Identities=23% Similarity=0.436 Sum_probs=59.6
Q ss_pred CCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEE
Q psy17638 20 TVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGY 99 (160)
Q Consensus 20 ~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~ 99 (160)
.|.++.+......++.|.|..+...++.+..|.|.+.......+.. ..........+.+.+|.|++.|.|+|.+.
T Consensus 3 ~p~~~~~~~~~~~~~~v~W~~~~~~~~~~~~y~v~~~~~~~~~~~~-----~~~~~~~~~~~~i~~l~p~~~Y~~~v~a~ 77 (93)
T cd00063 3 PPTNLRVTDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGSGDWKE-----VEVTPGSETSYTLTGLKPGTEYEFRVRAV 77 (93)
T ss_pred CCCCcEEEEecCCEEEEEECCCCCCCCcceeEEEEEeeCCCCCCEE-----eeccCCcccEEEEccccCCCEEEEEEEEE
Confidence 4455666666678999999988654466789999988753222110 00111356789999999999999999999
Q ss_pred eCCCCCCcc
Q psy17638 100 TKAGDGADY 108 (160)
Q Consensus 100 ~~~g~g~~~ 108 (160)
+..|.|..+
T Consensus 78 ~~~~~~~~s 86 (93)
T cd00063 78 NGGGESPPS 86 (93)
T ss_pred CCCccCCCc
Confidence 987777554
No 9
>KOG4258|consensus
Probab=98.81 E-value=1.6e-08 Score=75.97 Aligned_cols=53 Identities=25% Similarity=0.317 Sum_probs=45.5
Q ss_pred Ccc-eEEEcCCCCCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEe
Q psy17638 6 PLW-LTLNLEALVPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSV 58 (160)
Q Consensus 6 ~s~-~~~~t~~~~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~ 58 (160)
.|. ..++|..+.|++|..+-......+.+.|.|.+|.+++|.+..|.|.|...
T Consensus 598 ~S~I~YvqT~~~~PspPl~~ls~snsSsqi~l~W~pP~~pNG~lt~Ylv~wer~ 651 (1025)
T KOG4258|consen 598 KSKIGYVQTLPDIPSPPLDVLSKSNSSSQILLKWKPPSQPNGNLTHYLVVWERQ 651 (1025)
T ss_pred ccceEEEEecCCCCCCcchhhhccCcchheeEEecCCCCCCCceeEEEEEEEec
Confidence 456 88999999999998876665566789999999999999999999998764
No 10
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=98.70 E-value=9.4e-07 Score=48.11 Aligned_cols=79 Identities=25% Similarity=0.421 Sum_probs=49.3
Q ss_pred CCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEEe
Q psy17638 21 VQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGYT 100 (160)
Q Consensus 21 p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~~ 100 (160)
|..+.+......++.|.|+++..... .+|.+.+............. .........+.+.+|.|++.|.|+|+|.+
T Consensus 4 p~~~~~~~~~~~~~~v~W~~~~~~~~--~~y~~~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~L~~~~~Y~v~v~a~~ 78 (83)
T smart00060 4 PSNLRVTDVTSTSVTLSWEPPPDDGI--TGYIVGYRVEYREEGSSWKE---VNVTPSSTSYTLTGLKPGTEYEFRVRAVN 78 (83)
T ss_pred CCcEEEEEEeCCEEEEEECCCCCCCC--CccEEEEEEEEecCCCccEE---EEecCCccEEEEeCcCCCCEEEEEEEEEc
Confidence 34455655566699999997744332 56777666542211100010 01122256899999999999999999998
Q ss_pred CCCC
Q psy17638 101 KAGD 104 (160)
Q Consensus 101 ~~g~ 104 (160)
..|.
T Consensus 79 ~~g~ 82 (83)
T smart00060 79 GAGE 82 (83)
T ss_pred ccCC
Confidence 7543
No 11
>KOG4258|consensus
Probab=98.54 E-value=2e-06 Score=65.15 Aligned_cols=132 Identities=18% Similarity=0.265 Sum_probs=77.0
Q ss_pred eeEEEecCCcEEEEEecCCCCC-CCceeeEEEEEEEeCCcc----cC--C--ceeeceeeeC--------CcccEEEEcC
Q psy17638 23 SLEAIPLGSSAFYLKWVKPEQP-NGVLMGYKIKYQSVKGTK----VG--P--LLERLPYISD--------PTTTSAKLAG 85 (160)
Q Consensus 23 ~~~~~~~~~~~~~l~W~~p~~~-~g~i~~y~i~~~~~~~~~----~~--~--~~~~~~~~~~--------~~~~~~~i~~ 85 (160)
.+.....+.+++.|+|..-... .-.+.+|.+.|....-.. .+ . ...++...+. .....+.+.+
T Consensus 493 ~~~~~~~~~dsi~lrW~~~~~~d~r~llg~~~~yKEaP~qNvT~~dg~~aCg~~~W~~~~v~~~~~~p~~~~~~~~~l~~ 572 (1025)
T KOG4258|consen 493 QFSSTVTSADSILLRWERYQPPDMRDLLGFLLHYKEAPFQNVTEEDGRDACGSNSWNVVDVDPPDLIPNDGTHPGFLLDG 572 (1025)
T ss_pred eeeeEEeecceeEEEecccCCcchhhhheeeEeeccCCccccceecCccccccCcceEEeccCCcCCCccccccceehhc
Confidence 3344455678999999754322 225788999886532110 00 0 0111100111 1234678899
Q ss_pred CCCCCEEEEEEEEEeCCCCC--Ccc---eeEEeecCCCCCCCCCCccceeeecccCccCeEEEEEecCCC--CCCceeEE
Q psy17638 86 LEPSTKYRIHLAGYTKAGDG--ADY---FIEHRTRGTDNVAPGKPNFKLVRSGTENGYGAFKVIWEPNPE--RPGSHFFV 158 (160)
Q Consensus 86 l~p~~~y~~~v~a~~~~g~g--~~~---~~~~~t~~~~~~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~~--~~~~~y~v 158 (160)
|+|.|+|.+.|++....-.. ... -..++|..+ .|++|.. .+.....++.|.|+|.||.. +.+.+|.|
T Consensus 573 LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~---~PspPl~---~ls~snsSsqi~l~W~pP~~pNG~lt~Ylv 646 (1025)
T KOG4258|consen 573 LKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPD---IPSPPLD---VLSKSNSSSQILLKWKPPSQPNGNLTHYLV 646 (1025)
T ss_pred CCccceeEEEEeeeehhhhccccccccceEEEEecCC---CCCCcch---hhhccCcchheeEEecCCCCCCCceeEEEE
Confidence 99999999999988543322 211 134455444 5666654 24444566689999999975 55678887
Q ss_pred eC
Q psy17638 159 KY 160 (160)
Q Consensus 159 ~y 160 (160)
.|
T Consensus 647 ~w 648 (1025)
T KOG4258|consen 647 VW 648 (1025)
T ss_pred EE
Confidence 64
No 12
>KOG4222|consensus
Probab=98.53 E-value=5.2e-07 Score=70.07 Aligned_cols=126 Identities=24% Similarity=0.332 Sum_probs=81.9
Q ss_pred ecCCcEEEEEecCCCC-CCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEEeCCCC--
Q psy17638 28 PLGSSAFYLKWVKPEQ-PNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGYTKAGD-- 104 (160)
Q Consensus 28 ~~~~~~~~l~W~~p~~-~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~~~~g~-- 104 (160)
..++++|.+.|..+.+ ....|.+|+|.|+...... .......+.......+++.+|.|++.|+|.+..+...|.
T Consensus 652 ~L~asslr~~w~~~kq~~~~~i~g~~I~~r~~~~~~---a~~s~~~v~~~t~~s~v~~nl~p~t~ye~f~~Pf~~~~~s~ 728 (1281)
T KOG4222|consen 652 VLNASSLRLGWTKDKQHGSQYIQGYRISYRSLGSQL---AQWSNAGVTVPTPESVVVPNLKPGTNYEFFVRPFFPHGYSI 728 (1281)
T ss_pred ccchhheeeeeeeecccCcccccceEEEeccCcccc---ccccccceeccCCcceeccccCCCccceeeccCccCCCcce
Confidence 3567899999998864 3457899999999875411 111112233445567889999999999999999988654
Q ss_pred -CCcceeEEeecCCCCCCCCCCccceeeeccc-CccCeEEEEEecCCC----CCCceeEE
Q psy17638 105 -GADYFIEHRTRGTDNVAPGKPNFKLVRSGTE-NGYGAFKVIWEPNPE----RPGSHFFV 158 (160)
Q Consensus 105 -g~~~~~~~~t~~~~~~~p~~p~~~~~~~~~~-~~~~~~~l~W~~p~~----~~~~~y~v 158 (160)
|..+ ...+......+|++|.-.+..+... .+.+++.|+|.+|.. +...+|-|
T Consensus 729 ~g~pS--~sk~alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~pp~d~~ng~~qg~ki 786 (1281)
T KOG4222|consen 729 QGAPS--NSKTALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPPPADVQNGILQGYKI 786 (1281)
T ss_pred ecCCc--ccccccccccCCCCCCCCccccccccCCCceeeEEecCCcccccCCcccceeE
Confidence 3333 3333333334566665443334433 378899999999954 34445544
No 13
>KOG4802|consensus
Probab=98.35 E-value=8.5e-06 Score=57.14 Aligned_cols=82 Identities=16% Similarity=0.276 Sum_probs=55.3
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeCCCCCCcc-e-eEEeecCCCCCCCCCCccceeeecccCccCeEEEEEecCCC-CCCc
Q psy17638 78 TTSAKLAGLEPSTKYRIHLAGYTKAGDGADY-F-IEHRTRGTDNVAPGKPNFKLVRSGTENGYGAFKVIWEPNPE-RPGS 154 (160)
Q Consensus 78 ~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~-~-~~~~t~~~~~~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~~-~~~~ 154 (160)
...+.+.+++||.-|.|+|+|.|..|...++ . -......++.+||.+-.+++.++. ..+.....|-|-|+.. -|+.
T Consensus 208 e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~-~dG~~~~~v~w~P~~sdlPv~ 286 (516)
T KOG4802|consen 208 ENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQ-FDGRYMLKVVWCPSKSDLPVE 286 (516)
T ss_pred CceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeee-ecceEEEEEEeCCCCCCCcce
Confidence 3467889999999999999999998875444 1 222333444444554455545532 2344557788988876 8889
Q ss_pred eeEEeC
Q psy17638 155 HFFVKY 160 (160)
Q Consensus 155 ~y~v~y 160 (160)
+|-|.|
T Consensus 287 ~Yki~W 292 (516)
T KOG4802|consen 287 KYKITW 292 (516)
T ss_pred eeEEEe
Confidence 998875
No 14
>KOG4222|consensus
Probab=98.22 E-value=9.7e-06 Score=63.39 Aligned_cols=101 Identities=24% Similarity=0.297 Sum_probs=71.7
Q ss_pred CCCCcc-eEEEcCCCCCCCC-Cee---EEEecCCcEEEEEecCCCC--CCCceeeEEEEEEEeCCcccCCceeeceeeeC
Q psy17638 3 LSPPLW-LTLNLEALVPGTV-QSL---EAIPLGSSAFYLKWVKPEQ--PNGVLMGYKIKYQSVKGTKVGPLLERLPYISD 75 (160)
Q Consensus 3 ~g~~s~-~~~~t~~~~P~~p-~~~---~~~~~~~~~~~l~W~~p~~--~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~ 75 (160)
+|.++. ..+.|.+++|++| ..+ .....+.++..|+|.+|.. .+|.+.+|.|.+........+. + ....
T Consensus 729 ~g~pS~sk~alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~pp~d~~ng~~qg~ki~~~~~e~tr~h~----n-~t~~ 803 (1281)
T KOG4222|consen 729 QGAPSNSKTALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPPPADVQNGILQGYKIECSGGEKTRIHI----N-KTTN 803 (1281)
T ss_pred ecCCcccccccccccCCCCCCCCccccccccCCCceeeEEecCCcccccCCcccceeEEeecCccccccc----c-cccc
Confidence 466776 8888999988544 443 3334456889999998742 3788999999776532111111 0 0223
Q ss_pred CcccEEEEcCCCCCCEEEEEEEEEeCCCCCCcc
Q psy17638 76 PTTTSAKLAGLEPSTKYRIHLAGYTKAGDGADY 108 (160)
Q Consensus 76 ~~~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~ 108 (160)
......+|.+|.++..|.|+|++.+..|.|..+
T Consensus 804 a~~~sv~i~~l~~g~ay~vtv~a~T~aGvG~~s 836 (1281)
T KOG4222|consen 804 ARTGSVTIGNLVTGIAYSVTVAARTGAGVGVKS 836 (1281)
T ss_pred CCCCceEeccccccceEEEEEeeecCCccCCCC
Confidence 456678999999999999999999999999865
No 15
>KOG4152|consensus
Probab=97.68 E-value=0.00017 Score=52.50 Aligned_cols=71 Identities=21% Similarity=0.462 Sum_probs=51.0
Q ss_pred EcCCCCCCEEEEEEEEEeCCCCCCcce-eEEeecCCCCCCCCCCccceeeecccCccCeEEEEEecCCCCCCceeEEeC
Q psy17638 83 LAGLEPSTKYRIHLAGYTKAGDGADYF-IEHRTRGTDNVAPGKPNFKLVRSGTENGYGAFKVIWEPNPERPGSHFFVKY 160 (160)
Q Consensus 83 i~~l~p~~~y~~~v~a~~~~g~g~~~~-~~~~t~~~~~~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~~~~~~~y~v~y 160 (160)
-..|.+|+.|.|+|.+.|..|-|+++. ..++|...- -|.+|.- .....+-..+.|.|.+|. .|+.+-+|+|
T Consensus 651 k~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG--~P~apS~----~ri~k~~eGi~l~weppt-~p~sg~Iiey 722 (830)
T KOG4152|consen 651 KTSLVTGQAYRFRVTAINGKGPGPASTILKLKTCAPG--KPTAPSG----ARIKKTIEGISLVWEPPT-KPGSGTIIEY 722 (830)
T ss_pred ccccccccceeeeeeeeeccCCCchhhheeeeeccCC--CCCCccc----ccccccccceeecccCCC-CCCCcceEEe
Confidence 356889999999999999999999883 555554322 2334422 333345567999999999 7777777776
No 16
>KOG4802|consensus
Probab=97.68 E-value=0.00038 Score=49.20 Aligned_cols=103 Identities=16% Similarity=0.290 Sum_probs=59.9
Q ss_pred CCCCCcc-eEEEcCCCCCCCCCeeEEEecC--CcE-EEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCc
Q psy17638 2 YLSPPLW-LTLNLEALVPGTVQSLEAIPLG--SSA-FYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPT 77 (160)
Q Consensus 2 g~g~~s~-~~~~t~~~~P~~p~~~~~~~~~--~~~-~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~ 77 (160)
|-..+|. ...+-...+|.+|.+++..... +.. ..|-|. |..++=+|..|+|.|....+.....+..........
T Consensus 236 GFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~~~~~v~w~-P~~sdlPv~~Yki~Ws~~v~s~k~~m~tks~~~k~t- 313 (516)
T KOG4802|consen 236 GFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGRYMLKVVWC-PSKSDLPVEKYKITWSLYVNSAKASMITKSSYVKDT- 313 (516)
T ss_pred ccCCCCCCCCCCCCCCCCcCcccceeeeeeecceEEEEEEeC-CCCCCCcceeeEEEeehhhhhhhhhcccccceeecc-
Confidence 4444454 3333344567777777655433 222 345555 444466889999999764321111111111011122
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeCCCCCCc
Q psy17638 78 TTSAKLAGLEPSTKYRIHLAGYTKAGDGAD 107 (160)
Q Consensus 78 ~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~ 107 (160)
....|++|.|+..|.|.|.|....|.+.-
T Consensus 314 -hq~si~~L~Pns~Y~VevqAi~y~g~~rL 342 (516)
T KOG4802|consen 314 -HQFSIKELLPNSSYYVEVQAISYLGSRRL 342 (516)
T ss_pred -chhhhhhcCCCCeEEEEEEEEEeccCccc
Confidence 23448999999999999999999888754
No 17
>KOG4367|consensus
Probab=97.19 E-value=0.00054 Score=48.82 Aligned_cols=80 Identities=19% Similarity=0.301 Sum_probs=59.5
Q ss_pred cCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEEeCCCCCCcc
Q psy17638 29 LGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGYTKAGDGADY 108 (160)
Q Consensus 29 ~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~ 108 (160)
....++++.|..|.....+..||.++.....+ +.+.+ +..+..+--++.+|-=++.|..+|.++|..|.++++
T Consensus 450 t~nns~t~~wkqp~~~~~~~dg~~leld~g~~---g~fre----vy~g~etmctvdglhfns~y~arvka~n~tg~s~ys 522 (699)
T KOG4367|consen 450 THNNSATLSWKQPPLSTVPADGYILELDDGNG---GQFRE----VYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYS 522 (699)
T ss_pred ccCCceEEEeecCCCCCCCCcceEEEeecCCC---CceeE----EEecCceeEEecceecchhHHHHHHHhhccCCCccc
Confidence 34578999999776555677899887765422 33332 445666778899999999999999999999999987
Q ss_pred -eeEEeec
Q psy17638 109 -FIEHRTR 115 (160)
Q Consensus 109 -~~~~~t~ 115 (160)
.+-++|.
T Consensus 523 ~tl~lqts 530 (699)
T KOG4367|consen 523 KTLVLQTS 530 (699)
T ss_pred ceeEeeec
Confidence 4555554
No 18
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=96.65 E-value=0.018 Score=33.56 Aligned_cols=81 Identities=15% Similarity=0.128 Sum_probs=49.6
Q ss_pred CCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcC--CCCCCEEEE
Q psy17638 17 VPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAG--LEPSTKYRI 94 (160)
Q Consensus 17 ~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~--l~p~~~y~~ 94 (160)
...+|.++++...+ -...|.|+++... ..-..|.|++.......|.... .+.. ...+.-.++. ......|.+
T Consensus 21 ~lp~P~nv~~~s~n-f~~iL~W~~~~~~-~~~~~ytVq~~~~~~~~W~~v~--~C~~--i~~~~Cdlt~~~~~~~~~Y~~ 94 (107)
T PF01108_consen 21 SLPAPQNVTVDSVN-FKHILRWDPGPGS-PPNVTYTVQYKKYGSSSWKDVP--GCQN--ITETSCDLTDETSDPSESYYA 94 (107)
T ss_dssp SGSSCEEEEEEEET-TEEEEEEEESTTS-SSTEEEEEEEEESSTSCEEEEC--CEEE--ESSSEEECTTCCTTTTSEEEE
T ss_pred cCCCCCeeEEEEEC-CceEEEeCCCCCC-CCCeEEEEEEEecCCcceeecc--ceec--ccccceeCcchhhcCcCCEEE
Confidence 34678888887665 4588999985332 2346899999854444443221 0111 2224444544 337889999
Q ss_pred EEEEEeCCC
Q psy17638 95 HLAGYTKAG 103 (160)
Q Consensus 95 ~v~a~~~~g 103 (160)
+|+|.....
T Consensus 95 rV~A~~~~~ 103 (107)
T PF01108_consen 95 RVRAEVGNQ 103 (107)
T ss_dssp EEEEEETTE
T ss_pred EEEEEeCCc
Confidence 999987653
No 19
>KOG3632|consensus
Probab=96.63 E-value=0.0037 Score=49.08 Aligned_cols=119 Identities=18% Similarity=0.182 Sum_probs=71.1
Q ss_pred EEEcCCCCCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCC
Q psy17638 10 TLNLEALVPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPS 89 (160)
Q Consensus 10 ~~~t~~~~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~ 89 (160)
..-+...++..|.++++...+..+..+.|-+. +... ... .|... ... ......+..++|.+|+|+
T Consensus 664 ~Llvgg~a~vvpsqlrv~n~tqtSa~itwvp~---nsn~-~Hv-iyln~--eE~--------~ps~a~~y~ytf~~lrpg 728 (1335)
T KOG3632|consen 664 ILLVGGAAPVVPSQLRVWNATQTSAMITWVPF---NSNF-LHV-IYLNA--EEP--------RPSVAEMYNYTFMRLRPG 728 (1335)
T ss_pred eEeccccccccchhhhhhhhhchhhheeeeec---CCCc-cee-eecCC--ccC--------CCchhhhhHHHHhccCCC
Confidence 33444567788899999999999999999754 2211 122 22221 111 022345667889999999
Q ss_pred CEEEEEEEEEeC-CCC--------CCcc-eeEEeecCCCCCCCCCCccceeeecccC--ccCeEEEEEecCC
Q psy17638 90 TKYRIHLAGYTK-AGD--------GADY-FIEHRTRGTDNVAPGKPNFKLVRSGTEN--GYGAFKVIWEPNP 149 (160)
Q Consensus 90 ~~y~~~v~a~~~-~g~--------g~~~-~~~~~t~~~~~~~p~~p~~~~~~~~~~~--~~~~~~l~W~~p~ 149 (160)
+.|.++|-+.-. .+- +... ...+.|... .+|.+|-. +.++. ..+.+.++|.||.
T Consensus 729 t~y~a~vea~~p~q~pwdl~~v~~etr~atv~f~tLpA--Gppappld----V~vE~g~spg~l~vswrPpt 794 (1335)
T KOG3632|consen 729 TDYWASVEAALPRQEPWDLRMVPMETRQATVLFRTLPA--GPPAPPLD----VKVETGGSPGRLEVSWRPPT 794 (1335)
T ss_pred CccceecccccCcCCCcccccchhhhhccceeeecccC--CCCCCchh----eeeecCCCCceeeeeccCce
Confidence 999999987665 221 1001 123333322 24444422 44444 6778999999995
No 20
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=95.77 E-value=0.094 Score=30.31 Aligned_cols=70 Identities=14% Similarity=0.267 Sum_probs=43.8
Q ss_pred eeEEEecCCcEEEEEecCCCC----CCC--c------eeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCC
Q psy17638 23 SLEAIPLGSSAFYLKWVKPEQ----PNG--V------LMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPST 90 (160)
Q Consensus 23 ~~~~~~~~~~~~~l~W~~p~~----~~g--~------i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~ 90 (160)
.+.+ ......+.|.+..|.. ... . -..|.|.++....... . .........+.|.+|.|++
T Consensus 7 ~v~v-~~~~~~l~V~i~~P~~~~~~~~~~~~l~~~~~~~~Y~v~~~~~~~~~~---~----~~~~~~~~~~~l~~L~p~t 78 (106)
T PF09294_consen 7 SVNV-SSCGGSLHVTIKPPMTPLRAGGKNSSLRDIYPSLSYNVSYWKNGSNEK---K----KEIETKNSSVTLSDLKPGT 78 (106)
T ss_dssp EEEE-EEETTEEEEEEEESEEEEECSSSEEEHHHHHGG-EEEEEEEETTTSCE---E----EEEESSSEEEEEES--TTS
T ss_pred EEEE-EECCCEEEEEEECCCcccccCCCCCcHHHhCCCeEEEEEEEeCCCccc---e----EEEeecCCEEEEeCCCCCC
Confidence 4655 4566789999988851 111 1 2569999888644411 0 1223445677899999999
Q ss_pred EEEEEEEEEe
Q psy17638 91 KYRIHLAGYT 100 (160)
Q Consensus 91 ~y~~~v~a~~ 100 (160)
.|=|+|.+..
T Consensus 79 ~YCv~V~~~~ 88 (106)
T PF09294_consen 79 NYCVSVQAFS 88 (106)
T ss_dssp EEEEEEEEEE
T ss_pred CEEEEEEEEe
Confidence 9999999943
No 21
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=95.28 E-value=0.027 Score=30.81 Aligned_cols=24 Identities=25% Similarity=0.542 Sum_probs=20.8
Q ss_pred ccCeEEEEEecCC--CCCCceeEEeC
Q psy17638 137 GYGAFKVIWEPNP--ERPGSHFFVKY 160 (160)
Q Consensus 137 ~~~~~~l~W~~p~--~~~~~~y~v~y 160 (160)
+.+++.|+|.+|. ..++.+|.|+|
T Consensus 12 ~~~sv~v~W~~~~~~~~~~~~y~v~~ 37 (85)
T PF00041_consen 12 SPTSVTVSWKPPSSGNGPITGYRVEY 37 (85)
T ss_dssp CSSEEEEEEEESSSTSSSESEEEEEE
T ss_pred CCCEEEEEEECCCCCCCCeeEEEEEE
Confidence 8899999999995 27888999986
No 22
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=95.03 E-value=0.48 Score=33.00 Aligned_cols=80 Identities=15% Similarity=0.321 Sum_probs=45.0
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeCCCCCCcceeE-EeecCCCCCCCCCCcc-ceeeecccCccCeEEEEEecCCCCCCce
Q psy17638 78 TTSAKLAGLEPSTKYRIHLAGYTKAGDGADYFIE-HRTRGTDNVAPGKPNF-KLVRSGTENGYGAFKVIWEPNPERPGSH 155 (160)
Q Consensus 78 ~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~~~~-~~t~~~~~~~p~~p~~-~~~~~~~~~~~~~~~l~W~~p~~~~~~~ 155 (160)
...+.+.++.||..|.+++...+.........+. ..+......-|..|.- .++......+-+++.|.|.+...+. ..
T Consensus 127 ~~~f~l~~~~~g~~Yliri~~~~~~e~~~~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~d~~-~k 205 (300)
T PF10179_consen 127 LRHFRLSGVKPGERYLIRIQISNSDEGPSTFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSPDRS-IK 205 (300)
T ss_pred eEEEEECCCCCCCeEEEEEEccCCCCCceEEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEecCCCCC-ce
Confidence 4578899999999999999876644322111122 2222222223444432 1133332246689999999865222 35
Q ss_pred eEE
Q psy17638 156 FFV 158 (160)
Q Consensus 156 y~v 158 (160)
|.|
T Consensus 206 YCv 208 (300)
T PF10179_consen 206 YCV 208 (300)
T ss_pred EEE
Confidence 644
No 23
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=94.15 E-value=0.62 Score=27.10 Aligned_cols=84 Identities=14% Similarity=0.156 Sum_probs=53.0
Q ss_pred CCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcc-----cEEEEcCCCCCCE
Q psy17638 17 VPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTT-----TSAKLAGLEPSTK 91 (160)
Q Consensus 17 ~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~i~~l~p~~~ 91 (160)
.|..|.++.+.+.....++=-|++....+.. ..|.+.|.... ..+...... .....+. ..+.-.+..-++.
T Consensus 7 ~p~~P~~~~C~S~~~etftC~W~~g~~~~l~-~~y~L~Y~~~~-~~~~eCp~~--~~~~~ns~~~~~C~F~~~~t~lf~~ 82 (104)
T PF09067_consen 7 PPEKPENLKCFSREMETFTCFWEPGSEGNLP-TNYTLFYKKEG-EEWKECPDY--STSGPNSTVRHICYFPKSDTSLFVP 82 (104)
T ss_dssp HCCCCEEEEEEBSSSS-EEEEEEEESSSTST-CEEEEEEEETT-SEEEEESES--STTETTEEEEEEEEE-CCGCSSSSE
T ss_pred CCCCCccCccCCCCCCcEEEEeeCCCCCCCC-CcEEEEEEeCC-CCCccCCCe--EecCCCCceeEEEEcCCCCeEEEEE
Confidence 4677888888888889999999987543322 33898888764 222221110 0111222 4454456778899
Q ss_pred EEEEEEEEeCCCC
Q psy17638 92 YRIHLAGYTKAGD 104 (160)
Q Consensus 92 y~~~v~a~~~~g~ 104 (160)
|.|+|.+.|..|.
T Consensus 83 y~i~V~a~~~~~~ 95 (104)
T PF09067_consen 83 YCIQVEATNALGS 95 (104)
T ss_dssp EEEEEEEEETTEE
T ss_pred EEEEEEeccCCCc
Confidence 9999999988754
No 24
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=93.12 E-value=0.66 Score=24.12 Aligned_cols=27 Identities=22% Similarity=0.406 Sum_probs=20.2
Q ss_pred EEEEcCCCCCCEEEEEEEEEeCCCCCCc
Q psy17638 80 SAKLAGLEPSTKYRIHLAGYTKAGDGAD 107 (160)
Q Consensus 80 ~~~i~~l~p~~~y~~~v~a~~~~g~g~~ 107 (160)
.+.+.+|.|| .|.|.|+|.+..|....
T Consensus 30 ~~~~~~L~~G-~Y~l~V~a~~~~~~~~~ 56 (66)
T PF07495_consen 30 SISYTNLPPG-KYTLEVRAKDNNGKWSS 56 (66)
T ss_dssp EEEEES--SE-EEEEEEEEEETTS-B-S
T ss_pred EEEEEeCCCE-EEEEEEEEECCCCCcCc
Confidence 8889999999 79999999998876543
No 25
>KOG4152|consensus
Probab=90.82 E-value=1.2 Score=33.46 Aligned_cols=106 Identities=14% Similarity=0.188 Sum_probs=59.4
Q ss_pred CCCCCCcc-eEEEcCC-CCCCCCCeeEEEecCCcEEEEEecCCCCC-CCceeeEEEEEEEeCC-----cccCCce-ee--
Q psy17638 1 MYLSPPLW-LTLNLEA-LVPGTVQSLEAIPLGSSAFYLKWVKPEQP-NGVLMGYKIKYQSVKG-----TKVGPLL-ER-- 69 (160)
Q Consensus 1 ~g~g~~s~-~~~~t~~-~~P~~p~~~~~~~~~~~~~~l~W~~p~~~-~g~i~~y~i~~~~~~~-----~~~~~~~-~~-- 69 (160)
+|.|+.|. ..+.|.. +.|..|..+++. -+--.+.+.|.+|..+ .|.|..|........+ .....+. ..
T Consensus 670 ~G~gp~s~i~~~kTc~pG~P~apS~~ri~-k~~eGi~l~weppt~p~sg~Iieys~ylAi~tq~~~~~~sql~fmr~ycg 748 (830)
T KOG4152|consen 670 KGPGPASTILKLKTCAPGKPTAPSGARIK-KTIEGISLVWEPPTKPGSGTIIEYSPYLAIITQAGATGVSQLPFMRSYCG 748 (830)
T ss_pred cCCCchhhheeeeeccCCCCCCccccccc-ccccceeecccCCCCCCCcceEEeehhhHhhhhhhccCccccceeeeecc
Confidence 47788888 8888753 457777766543 2334689999999765 5577777654432211 0000000 00
Q ss_pred ---ceeeeCCcccEEEEcCCCCCCEEEEEEEEEeCCCCCCcc
Q psy17638 70 ---LPYISDPTTTSAKLAGLEPSTKYRIHLAGYTKAGDGADY 108 (160)
Q Consensus 70 ---~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~ 108 (160)
.+.+...+.... +.+...+..-.|+|.|.|..|.|+..
T Consensus 749 ~~t~c~vt~g~l~tA-~~d~t~~paivfri~aknekGyGpat 789 (830)
T KOG4152|consen 749 IITTCVVTSGKLITA-VPDITTGPAIVFRIVAKNEKGYGPAT 789 (830)
T ss_pred cceeeEEecccceee-ccccccCcceEEEEEecCCCCCCcce
Confidence 011111111111 12233355568999999999999865
No 26
>PLN02533 probable purple acid phosphatase
Probab=89.29 E-value=4.4 Score=30.05 Aligned_cols=77 Identities=14% Similarity=0.232 Sum_probs=43.5
Q ss_pred CCCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCc--eeecee----eeCCcccEEEEcCCCCC
Q psy17638 16 LVPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPL--LERLPY----ISDPTTTSAKLAGLEPS 89 (160)
Q Consensus 16 ~~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~--~~~~~~----~~~~~~~~~~i~~l~p~ 89 (160)
+.+..|..+++.-.+.+++.|+|....... . .|.|....+...... ....+. ..........|.+|+|+
T Consensus 39 ~~~~~P~qvhls~~~~~~m~V~W~T~~~~~-~----~V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~L~p~ 113 (427)
T PLN02533 39 DDPTHPDQVHISLVGPDKMRISWITQDSIP-P----SVVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGPLKPN 113 (427)
T ss_pred CCCCCCceEEEEEcCCCeEEEEEECCCCCC-C----EEEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCCCCCC
Confidence 345677788877667889999998764322 2 233433211111100 000000 11223456789999999
Q ss_pred CEEEEEEE
Q psy17638 90 TKYRIHLA 97 (160)
Q Consensus 90 ~~y~~~v~ 97 (160)
+.|.-+|-
T Consensus 114 T~Y~Yrvg 121 (427)
T PLN02533 114 TVYYYKCG 121 (427)
T ss_pred CEEEEEEC
Confidence 99999884
No 27
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=88.04 E-value=7.3 Score=27.39 Aligned_cols=30 Identities=27% Similarity=0.490 Sum_probs=24.2
Q ss_pred cEEEEcCCCCCCEEEEEEEEEeCCCCCCcc
Q psy17638 79 TSAKLAGLEPSTKYRIHLAGYTKAGDGADY 108 (160)
Q Consensus 79 ~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~ 108 (160)
...+|.+|.||+.|-|.|.+.-..|...+.
T Consensus 260 ~tetI~~L~PG~~Yl~dV~~~~~~G~sl~Y 289 (300)
T PF10179_consen 260 TTETIKGLKPGTTYLFDVYVNGPSGQSLPY 289 (300)
T ss_pred ceeecccCCCCcEEEEEEEEecCCCceeec
Confidence 345799999999999999998777665544
No 28
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=87.74 E-value=6.5 Score=31.92 Aligned_cols=31 Identities=3% Similarity=0.152 Sum_probs=22.5
Q ss_pred eeecccCccCeEEEEEecCCC----CCCceeEEeC
Q psy17638 130 VRSGTENGYGAFKVIWEPNPE----RPGSHFFVKY 160 (160)
Q Consensus 130 ~~~~~~~~~~~~~l~W~~p~~----~~~~~y~v~y 160 (160)
..+.++..+..|.|+|..|.. +....|.|+|
T Consensus 760 tDL~~~~~~~~v~LsWTAPG~d~D~G~a~~y~ir~ 794 (863)
T TIGR00868 760 TDLEAGFQGDNIILTWTAPGDVLDHGRADRYIIRI 794 (863)
T ss_pred eeeEEeecCCEEEEEeeCCCccCCCCccceEEEEe
Confidence 445544566679999999975 4556888876
No 29
>KOG4367|consensus
Probab=84.79 E-value=1.9 Score=31.65 Aligned_cols=37 Identities=14% Similarity=0.368 Sum_probs=24.4
Q ss_pred CCCCCccceeeecccCccCeEEEEEecCCC--CCCceeEEe
Q psy17638 121 APGKPNFKLVRSGTENGYGAFKVIWEPNPE--RPGSHFFVK 159 (160)
Q Consensus 121 ~p~~p~~~~~~~~~~~~~~~~~l~W~~p~~--~~~~~y~v~ 159 (160)
+|..|-+. ...-...++++.|+|+.|.. .|.++|+++
T Consensus 437 vpatpilq--~~ec~t~nns~t~~wkqp~~~~~~~dg~~le 475 (699)
T KOG4367|consen 437 VPATPILQ--LEECCTHNNSATLSWKQPPLSTVPADGYILE 475 (699)
T ss_pred CCCCceee--hhhhhccCCceEEEeecCCCCCCCCcceEEE
Confidence 56666442 11112377899999996654 778899876
No 30
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=83.00 E-value=2.4 Score=24.56 Aligned_cols=31 Identities=19% Similarity=0.434 Sum_probs=22.4
Q ss_pred eeecccCccCeEEEEEecCCC-CCCceeEEeC
Q psy17638 130 VRSGTENGYGAFKVIWEPNPE-RPGSHFFVKY 160 (160)
Q Consensus 130 ~~~~~~~~~~~~~l~W~~p~~-~~~~~y~v~y 160 (160)
.++.....+-...|+|.++.+ .+...|.|+|
T Consensus 26 ~nv~~~s~nf~~iL~W~~~~~~~~~~~ytVq~ 57 (107)
T PF01108_consen 26 QNVTVDSVNFKHILRWDPGPGSPPNVTYTVQY 57 (107)
T ss_dssp EEEEEEEETTEEEEEEEESTTSSSTEEEEEEE
T ss_pred CeeEEEEECCceEEEeCCCCCCCCCeEEEEEE
Confidence 456666666778899999654 5556888886
No 31
>PF11344 DUF3146: Protein of unknown function (DUF3146); InterPro: IPR021492 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=82.81 E-value=1 Score=24.25 Aligned_cols=15 Identities=27% Similarity=0.510 Sum_probs=13.0
Q ss_pred CCCCCCEEEEEEEEE
Q psy17638 85 GLEPSTKYRIHLAGY 99 (160)
Q Consensus 85 ~l~p~~~y~~~v~a~ 99 (160)
.|+||..|.|+|+|.
T Consensus 66 ~LEpGgdY~Ftirak 80 (80)
T PF11344_consen 66 QLEPGGDYSFTIRAK 80 (80)
T ss_pred eccCCCceEEEEecC
Confidence 488999999999873
No 32
>KOG3632|consensus
Probab=80.17 E-value=4.1 Score=33.32 Aligned_cols=112 Identities=14% Similarity=0.092 Sum_probs=63.1
Q ss_pred CCeeEEEecCCcEEEEEecCCCCC--CCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCC-CCEEEEEEE
Q psy17638 21 VQSLEAIPLGSSAFYLKWVKPEQP--NGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEP-STKYRIHLA 97 (160)
Q Consensus 21 p~~~~~~~~~~~~~~l~W~~p~~~--~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p-~~~y~~~v~ 97 (160)
|..+.+....+.++.|.|++|..+ -+.+.+|.+.....-.. .+...+.++..+..|.- +..|.+.|.
T Consensus 581 pr~~~~vkqla~sv~vawepps~pP~~~~~~~~~~~v~~elrq----------~l~~gs~tka~~E~ld~~a~s~~isvq 650 (1335)
T KOG3632|consen 581 PRTGAAVKQLAGSVHVAWEPPSSPPFTAPARMIGATVLMELRQ----------TLYAGSLTKAMQESLDNSAHSGYISVQ 650 (1335)
T ss_pred ccchhhhhhhccceeeeccCCCCCCccccceeeeeecchhhhh----------hcccccchHHHHhhccccCCceeeehh
Confidence 334444455678999999998644 23456666544321000 02223334444555553 345889999
Q ss_pred EEeCCCCCCcceeEEeecCCCCCCCCCCccceeeeccc-CccCeEEEEEecCC
Q psy17638 98 GYTKAGDGADYFIEHRTRGTDNVAPGKPNFKLVRSGTE-NGYGAFKVIWEPNP 149 (160)
Q Consensus 98 a~~~~g~g~~~~~~~~t~~~~~~~p~~p~~~~~~~~~~-~~~~~~~l~W~~p~ 149 (160)
+..+.|........+.+... ++..|.. +.+- .+.++..++|-+-.
T Consensus 651 ~ltSrGsqd~lrc~Llvgg~---a~vvpsq----lrv~n~tqtSa~itwvp~n 696 (1335)
T KOG3632|consen 651 RLTSRGSQDQLRCILLVGGA---APVVPSQ----LRVWNATQTSAMITWVPFN 696 (1335)
T ss_pred hhhccCCCCcceeeEecccc---ccccchh----hhhhhhhchhhheeeeecC
Confidence 99999988766333333333 3333421 2222 26678889999865
No 33
>KOG4228|consensus
Probab=78.00 E-value=4.4 Score=33.37 Aligned_cols=91 Identities=18% Similarity=0.112 Sum_probs=51.7
Q ss_pred CCCCCcc-eEEEcCCCCCCCCCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccE
Q psy17638 2 YLSPPLW-LTLNLEALVPGTVQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTS 80 (160)
Q Consensus 2 g~g~~s~-~~~~t~~~~P~~p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (160)
|.|+++. +.+.|....|..+. .....+.+++.|.++. ...+..|++.+...+..... .+.......
T Consensus 151 ~~~~~~t~i~i~~~~~~p~~~~----~~~~~~~it~~w~~~~--~~~~~~ykl~~~~~d~~~~~-------~v~~~~~~~ 217 (1087)
T KOG4228|consen 151 GLGPPSTYIQITTNANSPIQPG----EEEEYTTITGSWSPPH--AVSLDTYKLLHLDPDTGYEI-------SVTLTPPGS 217 (1087)
T ss_pred ccCCCCceEEEeccCCCCCCCC----cceEEEEEEecCCCCC--cccchhhhhhhcCCccccee-------eeeccCCCC
Confidence 5677766 99999988887665 4566789999998763 33456677655443111100 011122223
Q ss_pred EEEcCCCCCCEEEEEEEEEeCCCCC
Q psy17638 81 AKLAGLEPSTKYRIHLAGYTKAGDG 105 (160)
Q Consensus 81 ~~i~~l~p~~~y~~~v~a~~~~g~g 105 (160)
....++.|-..+.+++++....+.+
T Consensus 218 ~~t~~~~~~~~~~~~~ae~~~~~~~ 242 (1087)
T KOG4228|consen 218 GGTGDLGPPSISRFKCAEPDRGPLG 242 (1087)
T ss_pred CcccCCCCcccccccccCccccccC
Confidence 3334455555566666655554443
No 34
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=73.47 E-value=16 Score=29.55 Aligned_cols=68 Identities=12% Similarity=0.096 Sum_probs=39.1
Q ss_pred CcccEEEEcCCCCCCEEEEEEEEEeCCCCC-Ccc-eeEEeecCCCCCCCCCCccceeeecccCccCeEEEEEecCC
Q psy17638 76 PTTTSAKLAGLEPSTKYRIHLAGYTKAGDG-ADY-FIEHRTRGTDNVAPGKPNFKLVRSGTENGYGAFKVIWEPNP 149 (160)
Q Consensus 76 ~~~~~~~i~~l~p~~~y~~~v~a~~~~g~g-~~~-~~~~~t~~~~~~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~ 149 (160)
.....+.+.++.+| +|.++|+|.|..|.. .+. +.... ......||..+... ..+ ..-..+.|.|-.|.
T Consensus 657 t~~~~~~~~gi~~G-qY~i~VrAiN~~g~~~~~a~s~~f~-i~g~~~Ppp~~~t~-~a~---~it~~~~l~v~dPt 726 (952)
T COG4733 657 TSAAGFDVEGIPAG-QYAIRVRAINVFEPNSPDATAYEFA-LNGKKVPPPKAMIY-DAV---IITLVIRLVVGDPT 726 (952)
T ss_pred ccccceeecCcCcc-ceEEEEEEeeccCCCCCCcceeEEE-ecCCCCCCCccccc-ceE---EEEeeeeEEEecCC
Confidence 34567889999995 899999999999875 333 22222 22222233332210 111 12345678888885
No 35
>KOG1225|consensus
Probab=61.67 E-value=20 Score=27.53 Aligned_cols=118 Identities=13% Similarity=0.009 Sum_probs=60.4
Q ss_pred eEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEEeCCC
Q psy17638 24 LEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGYTKAG 103 (160)
Q Consensus 24 ~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~~~~g 103 (160)
+....-+...+.+.|.. ......|...+...... ..... ...........+..|.|+..|..++.+.-..-
T Consensus 372 ~~~~~cs~~~~~~~~~~----~~~~~~~~~~~~~~~~~-~~~~~----~r~~~~~~~~~~~~~~~g~~~~~~~~~v~~~~ 442 (525)
T KOG1225|consen 372 LLITECSPPSLCIAGVG----RRRVTHCAGTYCPLGES-GGDLQ----GRVPGDANSVDIQGLEPGDEYNCSVNTVAANI 442 (525)
T ss_pred hcccccCCCceeecccc----ccccccccccccccccC-CCccc----eeeccceeeeeeeeecCCcceeeehhhhhhhh
Confidence 33444556677777861 22233444444442111 11111 13355667888999999999999998765443
Q ss_pred CCCcceeEEeecCCCCCCCCCCccceeeecccCccCeEEEEEecCCCCCCceeEEeC
Q psy17638 104 DGADYFIEHRTRGTDNVAPGKPNFKLVRSGTENGYGAFKVIWEPNPERPGSHFFVKY 160 (160)
Q Consensus 104 ~g~~~~~~~~t~~~~~~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~~~~~~~y~v~y 160 (160)
.+.+......+... .+.. .-......+++.+.|..|. .-...+.++|
T Consensus 443 ~~~~~~~~~~~~~~---~~g~------~~v~~~~~~s~e~~g~~~s-~~~~~~~~~~ 489 (525)
T KOG1225|consen 443 GSLPKDKSETTVLC---WNGG------LCVDGETESSLEVGGPCPS-SGTCGWEVRC 489 (525)
T ss_pred ccCCcccccceEee---cCCc------eeeeeeeeccccccCCCCC-ccccceEEEe
Confidence 33222111111111 1111 1112235677888888887 4445555543
No 36
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=60.61 E-value=61 Score=23.36 Aligned_cols=106 Identities=8% Similarity=-0.034 Sum_probs=57.1
Q ss_pred CcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEEeCCCCCCccee
Q psy17638 31 SSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGYTKAGDGADYFI 110 (160)
Q Consensus 31 ~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~~~ 110 (160)
...+.+-|.+..+ -....|.|+....+... ..... ...+-.-.+.-..+.++..|...|.+.+..+.-..+..
T Consensus 83 ~~~~~~~w~~~~d--~~~~~Y~i~~~~gD~~f-~r~~~----~~n~l~~~~i~s~~~~~~~~~~~Iia~~f~~~~sfsf~ 155 (343)
T COG3401 83 PKSVKVFWSPHPD--VSVGKYIIQRQNGDGKF-LRTGL----VKNRLFVEFIDSDLGHNEKYMELIIAADFQMGKSFSFT 155 (343)
T ss_pred cceeeecccccCC--CCCCeEEEEEecCchhh-hhhhH----HHhccchhheecccccccceeeeEEeecccccceeeee
Confidence 3468899987543 23457888776543331 11000 11111123334578899999999998887766544321
Q ss_pred EEeecCCCCCCCCCCccceeeeccc-CccCeEEEEEecCC
Q psy17638 111 EHRTRGTDNVAPGKPNFKLVRSGTE-NGYGAFKVIWEPNP 149 (160)
Q Consensus 111 ~~~t~~~~~~~p~~p~~~~~~~~~~-~~~~~~~l~W~~p~ 149 (160)
.+...+... |+. +.++.+. ...+.+.|+|..+.
T Consensus 156 gVE~~~~~~-P~e-----i~~~~~~~d~~~~i~ls~dg~~ 189 (343)
T COG3401 156 GVEATPKAE-PKE-----ITNVRVSFDLGNNIELSEDGSE 189 (343)
T ss_pred eeecccccC-Cce-----eeeeeeecCCCCcceeeccCcc
Confidence 111111111 121 1444443 36677999999976
No 37
>KOG3834|consensus
Probab=55.83 E-value=40 Score=25.15 Aligned_cols=69 Identities=22% Similarity=0.361 Sum_probs=42.6
Q ss_pred eeEEEecCCcEEEEEecCCCCC-CCceeeEEEEEEEeCCcccCCceeeceeeeC-CcccEEEEcCCCCCCEEEEEE
Q psy17638 23 SLEAIPLGSSAFYLKWVKPEQP-NGVLMGYKIKYQSVKGTKVGPLLERLPYISD-PTTTSAKLAGLEPSTKYRIHL 96 (160)
Q Consensus 23 ~~~~~~~~~~~~~l~W~~p~~~-~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~l~p~~~y~~~v 96 (160)
.+.+.+.....+...|-.+.+. +|.++++.|+|+........ .|+++. ....-..+-+|.+++.|.+-+
T Consensus 67 kltv~n~kt~~~R~v~I~ps~~wggqllGvsvrFcsf~~A~~~-----vwHvl~V~p~SPaalAgl~~~~DYivG~ 137 (462)
T KOG3834|consen 67 KLTVYNSKTQEVRIVEIVPSNNWGGQLLGVSVRFCSFDGAVES-----VWHVLSVEPNSPAALAGLRPYTDYIVGI 137 (462)
T ss_pred EEEEEecccceeEEEEecccccccccccceEEEeccCccchhh-----eeeeeecCCCCHHHhcccccccceEecc
Confidence 3445555556778888877654 56789999999876332211 122222 222345567888888887755
No 38
>PF04775 Bile_Hydr_Trans: Acyl-CoA thioester hydrolase/BAAT N-terminal region; InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=54.72 E-value=30 Score=20.88 Aligned_cols=25 Identities=16% Similarity=0.340 Sum_probs=16.0
Q ss_pred cEEEEcCCCCCCEEEEEEEEEeCCC
Q psy17638 79 TSAKLAGLEPSTKYRIHLAGYTKAG 103 (160)
Q Consensus 79 ~~~~i~~l~p~~~y~~~v~a~~~~g 103 (160)
..+.+.+|.|+..+.++.......|
T Consensus 5 ~~I~v~GL~p~~~vtl~a~~~~~~g 29 (126)
T PF04775_consen 5 VDIRVSGLPPGQEVTLRARLTDDNG 29 (126)
T ss_dssp -EEEEES--TT-EEEEEEEEE-TTS
T ss_pred eEEEEeCCCCCCEEEEEEEEEeCCC
Confidence 3567899999999998888876544
No 39
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=54.12 E-value=35 Score=28.05 Aligned_cols=41 Identities=24% Similarity=0.450 Sum_probs=29.0
Q ss_pred CCCCCCeeEEEecCCcEEEEEecCCCC--CCCceeeEEEEEEEe
Q psy17638 17 VPGTVQSLEAIPLGSSAFYLKWVKPEQ--PNGVLMGYKIKYQSV 58 (160)
Q Consensus 17 ~P~~p~~~~~~~~~~~~~~l~W~~p~~--~~g~i~~y~i~~~~~ 58 (160)
+|++..+|++. .....+.|+|..|.+ ..|....|.|++...
T Consensus 755 PP~rItDL~~~-~~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~~ 797 (863)
T TIGR00868 755 PPSKITDLEAG-FQGDNIILTWTAPGDVLDHGRADRYIIRISTS 797 (863)
T ss_pred CCccceeeEEe-ecCCEEEEEeeCCCccCCCCccceEEEEecCC
Confidence 34555566653 455679999999965 467778999988653
No 40
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=51.65 E-value=23 Score=22.81 Aligned_cols=25 Identities=28% Similarity=0.368 Sum_probs=19.0
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeCC
Q psy17638 78 TTSAKLAGLEPSTKYRIHLAGYTKA 102 (160)
Q Consensus 78 ~~~~~i~~l~p~~~y~~~v~a~~~~ 102 (160)
...+.|.||.|...|.+.|......
T Consensus 32 ~l~y~vsGL~p~~~Y~i~l~~~~~d 56 (184)
T PF00907_consen 32 TLEYSVSGLDPDSLYSISLHFERVD 56 (184)
T ss_dssp -EEEEEESS-TTSEEEEEEEEEESC
T ss_pred ccEEEecCCCCCcceEEEEEEEEec
Confidence 4578899999999999988766543
No 41
>PF10342 GPI-anchored: Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family; InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue [].
Probab=48.26 E-value=46 Score=18.29 Aligned_cols=65 Identities=11% Similarity=0.141 Sum_probs=32.2
Q ss_pred CcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCC-cccEEEE-cCCCCCCEEEEEEEE
Q psy17638 31 SSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDP-TTTSAKL-AGLEPSTKYRIHLAG 98 (160)
Q Consensus 31 ~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~i-~~l~p~~~y~~~v~a 98 (160)
...++|.|..... ....|.|.................-..... ....+.+ .+|.++..|.|++..
T Consensus 12 g~~~~I~W~~~~~---~~~~~~I~L~~g~~~~~~~~~~ia~~v~~~~gs~~~~~p~~l~~~~~Y~i~~~~ 78 (93)
T PF10342_consen 12 GQPITITWTSDGT---DPGNVTIYLCNGNNTNLNFVQTIASNVSNSDGSYTWTIPSDLPSGGDYFIQIVN 78 (93)
T ss_pred CCcEEEEEeCCCC---CCcEEEEEEEcCCCCCcceeEEEEecccCCCCEEEEEcCCCCCCCCcEEEEEEE
Confidence 3679999986531 124566666554331111111000001111 2233444 448888889888883
No 42
>PF14292 SusE: SusE outer membrane protein
Probab=46.68 E-value=61 Score=19.22 Aligned_cols=30 Identities=10% Similarity=0.341 Sum_probs=21.7
Q ss_pred CCcEEEEEecCCCCCCC-ceeeEEEEEEEeC
Q psy17638 30 GSSAFYLKWVKPEQPNG-VLMGYKIKYQSVK 59 (160)
Q Consensus 30 ~~~~~~l~W~~p~~~~g-~i~~y~i~~~~~~ 59 (160)
....+++.|+++..... ....|.++....+
T Consensus 43 ~~~a~tftW~~~~~~~~~a~v~Y~lq~~~~~ 73 (122)
T PF14292_consen 43 SDNAVTFTWTAADYGGPDAPVTYTLQFDKKG 73 (122)
T ss_pred CCceEEEEEECCccCCCCCceEEEEEEeccC
Confidence 44689999999865433 5568999887653
No 43
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=46.24 E-value=38 Score=25.38 Aligned_cols=21 Identities=24% Similarity=0.485 Sum_probs=13.6
Q ss_pred cEEEEcCCCCCCEEEEEEEEE
Q psy17638 79 TSAKLAGLEPSTKYRIHLAGY 99 (160)
Q Consensus 79 ~~~~i~~l~p~~~y~~~v~a~ 99 (160)
..+.+.+|.|++.|..++...
T Consensus 64 ~~v~v~gL~p~t~Y~Y~~~~~ 84 (453)
T PF09423_consen 64 VKVDVTGLQPGTRYYYRFVVD 84 (453)
T ss_dssp EEEEE-S--TT-EEEEEEEE-
T ss_pred eecccCCCCCCceEEEEEEEe
Confidence 357789999999999888883
No 44
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=46.07 E-value=77 Score=20.19 Aligned_cols=25 Identities=24% Similarity=0.335 Sum_probs=20.1
Q ss_pred cccEEEEcCCCCCCEEEEEEEEEeC
Q psy17638 77 TTTSAKLAGLEPSTKYRIHLAGYTK 101 (160)
Q Consensus 77 ~~~~~~i~~l~p~~~y~~~v~a~~~ 101 (160)
....|.+.+|.||+.|.|+-.....
T Consensus 100 rlsaYqVtNL~pGTkY~isY~Vtkg 124 (184)
T PF07353_consen 100 RLSAYQVTNLQPGTKYYISYLVTKG 124 (184)
T ss_pred cceeEEeeccCCCcEEEEEEEEecC
Confidence 3567899999999999987766554
No 45
>KOG4806|consensus
Probab=44.63 E-value=1.2e+02 Score=22.16 Aligned_cols=30 Identities=20% Similarity=0.458 Sum_probs=24.2
Q ss_pred cEEEEcCCCCCCEEEEEEEEEeCCCCCCcc
Q psy17638 79 TSAKLAGLEPSTKYRIHLAGYTKAGDGADY 108 (160)
Q Consensus 79 ~~~~i~~l~p~~~y~~~v~a~~~~g~g~~~ 108 (160)
..-+|.+|.||..|-+-|.|.-..|...+.
T Consensus 410 ~teTI~gL~PgssYlldv~a~~~~g~~lpy 439 (454)
T KOG4806|consen 410 ETETILGLMPGSSYLLDVTANLSMGKPLPY 439 (454)
T ss_pred hhhhhcccccCceEEEEEEEcccCCccccc
Confidence 356789999999999999998877665444
No 46
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=44.56 E-value=59 Score=18.41 Aligned_cols=84 Identities=12% Similarity=0.105 Sum_probs=47.2
Q ss_pred CCeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccCCceeeceeeeCCcc--cEEEEcCC--CCCCEEEEEE
Q psy17638 21 VQSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVGPLLERLPYISDPTT--TSAKLAGL--EPSTKYRIHL 96 (160)
Q Consensus 21 p~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~i~~l--~p~~~y~~~v 96 (160)
|.++.+...+...+.=+|.+..... .-..|.+.++........+ ... ........ ..+..... .....|.|.|
T Consensus 2 ~~nlsC~~~~~~~m~CtW~~g~~~~-~~t~y~L~~~~~~~~~~~e-C~~-y~~~~~~~~gC~~~~~~~~~~~~~~~~v~V 78 (99)
T PF09240_consen 2 PQNLSCFIYNLEYMNCTWEPGKEAP-PDTQYTLYYWYSPLEEEKE-CPH-YSKDSGTRIGCQFPVSEIDSSEFSQYNVCV 78 (99)
T ss_dssp -EEEEEEEETTTEEEEEEECCTTCS-TTEEEEEEEEETTSSSEEE-ESE-EEESTSSEEEEEEESCTT-TTTTSEEEEEE
T ss_pred CeeCEEEEECCEEEEEEECCCCCCC-CcccEEEEEEcCCCCcccc-CCC-ccccCCceeEEEecCCCccccccceEEEEE
Confidence 4567777777889999998754322 2357888888754222111 111 11111122 23333333 2346899999
Q ss_pred EEEeCCCCCCc
Q psy17638 97 AGYTKAGDGAD 107 (160)
Q Consensus 97 ~a~~~~g~g~~ 107 (160)
.+.+..|.-..
T Consensus 79 ~~ss~~~~i~~ 89 (99)
T PF09240_consen 79 NGSSSAGSIRS 89 (99)
T ss_dssp EEEETTEEEEC
T ss_pred EeccCCCccCC
Confidence 99888766443
No 47
>PF11811 DUF3331: Domain of unknown function (DUF3331); InterPro: IPR021769 This family of proteins are functionally uncharacterised. This family is only found in bacteria. Proteins in this family vary in length from 96 to 160 amino acids.
Probab=42.74 E-value=58 Score=18.74 Aligned_cols=22 Identities=9% Similarity=0.397 Sum_probs=16.8
Q ss_pred CCeeEEEec-CCcEEEEEecCCC
Q psy17638 21 VQSLEAIPL-GSSAFYLKWVKPE 42 (160)
Q Consensus 21 p~~~~~~~~-~~~~~~l~W~~p~ 42 (160)
+..+.+.+. +.+++.|.|..|.
T Consensus 15 ~~~I~vlEr~S~~t~~V~W~D~~ 37 (96)
T PF11811_consen 15 PVRIRVLERPSDTTLSVSWSDPT 37 (96)
T ss_pred CCEEEEEEecCCCEEEEEEECCC
Confidence 456666666 7899999998774
No 48
>PF10333 Pga1: GPI-Mannosyltransferase II co-activator; InterPro: IPR019433 Pga1 is found only in yeasts and not in mammals. It localises in the ER as a glycosylated integral membrane protein. It binds to the GPI-mannosyltransferase II subunit of the GPI and it is responsible for the second mannose addition to GPI precursors. The GPI-anchoring complex is a glycolipid that functions as a membrane anchor for many cell-surface proteins [].
Probab=40.65 E-value=54 Score=21.34 Aligned_cols=28 Identities=21% Similarity=0.370 Sum_probs=22.0
Q ss_pred CCcccEEEEcCCCCCCEEEEEEEEEeCC
Q psy17638 75 DPTTTSAKLAGLEPSTKYRIHLAGYTKA 102 (160)
Q Consensus 75 ~~~~~~~~i~~l~p~~~y~~~v~a~~~~ 102 (160)
..+.....+.+|+++..|.++++-.-..
T Consensus 62 ~~~t~~V~L~nl~~~e~y~vKiCW~At~ 89 (180)
T PF10333_consen 62 PGSTTYVELNNLQPGETYQVKICWPATD 89 (180)
T ss_pred CCceEEEEeccCCCCCeEEEEEEEeccC
Confidence 3456678899999999999999865443
No 49
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=37.61 E-value=89 Score=19.74 Aligned_cols=33 Identities=21% Similarity=0.332 Sum_probs=24.6
Q ss_pred EcCCCCCCEEEEEEEEEeCCCCCCcceeEEeec
Q psy17638 83 LAGLEPSTKYRIHLAGYTKAGDGADYFIEHRTR 115 (160)
Q Consensus 83 i~~l~p~~~y~~~v~a~~~~g~g~~~~~~~~t~ 115 (160)
+..|..+..|.+.|.|...+|........+..+
T Consensus 116 fpsle~~~~YtLtV~a~D~aGN~~~~si~F~y~ 148 (158)
T PF13750_consen 116 FPSLEADDSYTLTVSATDKAGNQSTKSISFSYM 148 (158)
T ss_pred cCCcCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence 356778999999999999998865444455443
No 50
>KOG0613|consensus
Probab=37.00 E-value=2.9e+02 Score=24.16 Aligned_cols=106 Identities=11% Similarity=0.103 Sum_probs=51.7
Q ss_pred cCCcEEEEEecCCCCCCC-ceeeEEEEEEEeCCcccCCceeeceeeeCCcccEEEEcCCCCCCEEEEEEEEEeCCCCCCc
Q psy17638 29 LGSSAFYLKWVKPEQPNG-VLMGYKIKYQSVKGTKVGPLLERLPYISDPTTTSAKLAGLEPSTKYRIHLAGYTKAGDGAD 107 (160)
Q Consensus 29 ~~~~~~~l~W~~p~~~~g-~i~~y~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~y~~~v~a~~~~g~g~~ 107 (160)
....-+...|..+....+ ....|.|.-+......+.. .+........+..+. |.+++++....|...+
T Consensus 151 ~~~~~v~~~~~~~~~d~~~k~~~~iie~~~~g~s~~~~-------~l~~~~~~s~~~~a~----~e~~v~~v~s~gqp~~ 219 (1205)
T KOG0613|consen 151 VHRARVLPAWRPPREDGGRKREEYIIERRQAGRSPWLK-------RLTQPTDDSTDTDAH----YESRVRAVVSAGQPEP 219 (1205)
T ss_pred ecCceecccccCCccCccceeecceEEEEeccCCCccc-------ccccccCCcceeccc----cceeEEEEEecCCCcc
Confidence 344567788988765444 6777888777664444321 111222222233333 5555555555555443
Q ss_pred ceeEEeecCCCCCCCCCCccceeeecccCccCeEEEEEecCC
Q psy17638 108 YFIEHRTRGTDNVAPGKPNFKLVRSGTENGYGAFKVIWEPNP 149 (160)
Q Consensus 108 ~~~~~~t~~~~~~~p~~p~~~~~~~~~~~~~~~~~l~W~~p~ 149 (160)
.. ..-......++++.+... +..+..+--+.+.|..+.
T Consensus 220 ~~-~~~~~g~~i~~~~~~~~s---~t~e~s~~~i~~~l~~~~ 257 (1205)
T KOG0613|consen 220 LE-RWEILGELIAPGSAPSMS---VTLEASDRKIQLTLRAPR 257 (1205)
T ss_pred ch-hhhhcccccCCCCccccc---cccchhheeeeEEeecCC
Confidence 21 000111112234444331 222335566778898887
No 51
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=36.09 E-value=94 Score=18.29 Aligned_cols=23 Identities=13% Similarity=0.241 Sum_probs=18.7
Q ss_pred CCCCCEEEEEEEEEeCCCCCCcc
Q psy17638 86 LEPSTKYRIHLAGYTKAGDGADY 108 (160)
Q Consensus 86 l~p~~~y~~~v~a~~~~g~g~~~ 108 (160)
...+.+|.++|.+++..|.+...
T Consensus 76 v~kgG~y~m~V~lCn~dGCS~S~ 98 (106)
T cd02848 76 VGKGGRYQMQVALCNGDGCSTSA 98 (106)
T ss_pred eCCCCeEEEEEEEECCCCccCcC
Confidence 45677999999999999876543
No 52
>PF04734 Ceramidase_alk: Neutral/alkaline non-lysosomal ceramidase; InterPro: IPR006823 This family represents a group of neutral/alkaline ceramidases found in both bacteria and eukaryotes [, , ]. They hydrolyse the sphingolipid ceramide into sphingosine and free fatty acid.; PDB: 2ZXC_A 2ZWS_A.
Probab=35.54 E-value=51 Score=26.34 Aligned_cols=34 Identities=24% Similarity=0.564 Sum_probs=19.2
Q ss_pred cceeeecccCccCeEEEEEecCCCCCCceeEEeC
Q psy17638 127 FKLVRSGTENGYGAFKVIWEPNPERPGSHFFVKY 160 (160)
Q Consensus 127 ~~~~~~~~~~~~~~~~l~W~~p~~~~~~~y~v~y 160 (160)
+.|++.....+.+.+.|+|..|...+...|+|.|
T Consensus 617 f~W~r~~~~~~~S~~ti~W~ip~~~~~G~YRi~~ 650 (674)
T PF04734_consen 617 FRWKRTGSLLGTSEVTIEWEIPPDTPPGTYRIRH 650 (674)
T ss_dssp EEEEEETTT--EEEEEEEEE--TT--SEEEEEEE
T ss_pred EEEEecCCccccEEEEEEEECCCCCCCCCEEEEE
Confidence 3344443333467899999998865667888876
No 53
>PF14054 DUF4249: Domain of unknown function (DUF4249)
Probab=33.49 E-value=1.6e+02 Score=20.35 Aligned_cols=15 Identities=20% Similarity=0.361 Sum_probs=12.5
Q ss_pred CCCCCCEEEEEEEEE
Q psy17638 85 GLEPSTKYRIHLAGY 99 (160)
Q Consensus 85 ~l~p~~~y~~~v~a~ 99 (160)
...+|..|.++|...
T Consensus 95 ~~~~G~~Y~L~V~~~ 109 (298)
T PF14054_consen 95 RGRPGRTYRLEVETP 109 (298)
T ss_pred cccCCCEEEEEEEEC
Confidence 577899999999774
No 54
>KOG4806|consensus
Probab=30.68 E-value=90 Score=22.81 Aligned_cols=29 Identities=14% Similarity=0.204 Sum_probs=23.7
Q ss_pred CcccEEEEcCCCCCCEEEEEEEEEeCCCC
Q psy17638 76 PTTTSAKLAGLEPSTKYRIHLAGYTKAGD 104 (160)
Q Consensus 76 ~~~~~~~i~~l~p~~~y~~~v~a~~~~g~ 104 (160)
++.+.+++..|.|.+.|.|.|...+....
T Consensus 167 g~~n~~~v~~L~pdt~Y~~dvFvv~~~r~ 195 (454)
T KOG4806|consen 167 GNKNIFTVSDLKPDTQYYFDVFVVNINRN 195 (454)
T ss_pred CCccEEEhhhcCCCceEEEEEEEEecCCC
Confidence 44568889999999999999998886533
No 55
>cd00182 TBOX T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
Probab=29.14 E-value=99 Score=20.29 Aligned_cols=24 Identities=25% Similarity=0.445 Sum_probs=19.3
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEeC
Q psy17638 78 TTSAKLAGLEPSTKYRIHLAGYTK 101 (160)
Q Consensus 78 ~~~~~i~~l~p~~~y~~~v~a~~~ 101 (160)
...+.+.||.|...|.+.+.....
T Consensus 34 ~l~~~vsGLdp~~~Y~v~l~~~~~ 57 (188)
T cd00182 34 TLKVKVSGLDPNALYSVLMDLVPV 57 (188)
T ss_pred ceEEEEeCCCcccceEEEEEEEEc
Confidence 357889999999999998865543
No 56
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=28.23 E-value=3.5e+02 Score=22.49 Aligned_cols=76 Identities=20% Similarity=0.204 Sum_probs=43.1
Q ss_pred CeeEEEecCCcEEEEEecCCCCCCCceeeEEEEEEEeCCcccC--CceeeceeeeC-CcccEEEEcCCCCCCEEEEEEEE
Q psy17638 22 QSLEAIPLGSSAFYLKWVKPEQPNGVLMGYKIKYQSVKGTKVG--PLLERLPYISD-PTTTSAKLAGLEPSTKYRIHLAG 98 (160)
Q Consensus 22 ~~~~~~~~~~~~~~l~W~~p~~~~g~i~~y~i~~~~~~~~~~~--~~~~~~~~~~~-~~~~~~~i~~l~p~~~y~~~v~a 98 (160)
..+++...++..+.|+|+.. |.=..|.|+.......... ......|..++ .....+--..+.|.+.|.+||+.
T Consensus 9 tslrI~kLsaN~v~l~WddV----G~NFyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed~~~p~t~Yk~Rv~~ 84 (1030)
T PHA02579 9 TSLRIDKLSANQVYLTWDDV----GANFYYFVELAETRDADGELIPDDELRWINLGYTANNEWFEDKLQPNTYYKFRVAV 84 (1030)
T ss_pred cEEEhhhhccceEEEEeecc----CCceEEEEEEEeeccCCCccCCCccccceecCcccchhhhhhccCCcceEEEEEEe
Confidence 35667677888999999865 2225688877754322111 00111121212 12223323448999999999987
Q ss_pred EeC
Q psy17638 99 YTK 101 (160)
Q Consensus 99 ~~~ 101 (160)
...
T Consensus 85 ~~q 87 (1030)
T PHA02579 85 AAQ 87 (1030)
T ss_pred ecc
Confidence 553
No 57
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=28.19 E-value=90 Score=19.21 Aligned_cols=26 Identities=12% Similarity=0.238 Sum_probs=17.2
Q ss_pred CCCCCEEEEEEEEEeCCCCCCcceeE
Q psy17638 86 LEPSTKYRIHLAGYTKAGDGADYFIE 111 (160)
Q Consensus 86 l~p~~~y~~~v~a~~~~g~g~~~~~~ 111 (160)
...+.+|.++|..+|..|+.......
T Consensus 79 ~~~gG~y~~~VeLCN~~GCS~S~~~~ 104 (133)
T PF08329_consen 79 VTKGGRYQMQVELCNADGCSTSAPVE 104 (133)
T ss_dssp E-S-EEEEEEEEEEETTEEEE---EE
T ss_pred ecCCCEEEEEEEEECCCCcccCCCEE
Confidence 35677999999999999976544333
No 58
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=27.36 E-value=1.1e+02 Score=18.50 Aligned_cols=19 Identities=16% Similarity=0.560 Sum_probs=14.1
Q ss_pred CCcEEEEEecCCCCCCCce
Q psy17638 30 GSSAFYLKWVKPEQPNGVL 48 (160)
Q Consensus 30 ~~~~~~l~W~~p~~~~g~i 48 (160)
....+.+.|.+|....|.|
T Consensus 94 ~k~~v~~~W~AP~~~~g~V 112 (135)
T cd08544 94 KKTSVTFTWTAPSNGSGCV 112 (135)
T ss_pred CCceeEEEEECCCCCCccE
Confidence 3568999999997655554
No 59
>smart00425 TBOX Domain first found in the mice T locus (Brachyury) protein.
Probab=26.92 E-value=1.1e+02 Score=20.02 Aligned_cols=23 Identities=26% Similarity=0.456 Sum_probs=18.8
Q ss_pred ccEEEEcCCCCCCEEEEEEEEEe
Q psy17638 78 TTSAKLAGLEPSTKYRIHLAGYT 100 (160)
Q Consensus 78 ~~~~~i~~l~p~~~y~~~v~a~~ 100 (160)
...+.+.||.|...|.+.+....
T Consensus 33 ~l~~~vsGLdp~~~Y~v~l~~~~ 55 (190)
T smart00425 33 TLKYKVSGLDPNALYSVLMDLVP 55 (190)
T ss_pred eeEEEEeCCCccCcEEEEEEEEE
Confidence 35788999999999999886554
No 60
>PF13205 Big_5: Bacterial Ig-like domain
Probab=26.25 E-value=71 Score=18.02 Aligned_cols=16 Identities=25% Similarity=0.486 Sum_probs=12.9
Q ss_pred cCCCCCCEEEEEEEEE
Q psy17638 84 AGLEPSTKYRIHLAGY 99 (160)
Q Consensus 84 ~~l~p~~~y~~~v~a~ 99 (160)
..|.+++.|.+.|...
T Consensus 70 ~~L~~~t~Y~v~i~~~ 85 (107)
T PF13205_consen 70 QPLKPGTTYTVTIDSG 85 (107)
T ss_pred CcCCCCCEEEEEECCC
Confidence 4599999999999543
No 61
>PF14686 fn3_3: Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=25.25 E-value=71 Score=18.21 Aligned_cols=21 Identities=19% Similarity=0.406 Sum_probs=13.4
Q ss_pred cccEEEEcCCCCCCEEEEEEEE
Q psy17638 77 TTTSAKLAGLEPSTKYRIHLAG 98 (160)
Q Consensus 77 ~~~~~~i~~l~p~~~y~~~v~a 98 (160)
..-.+.|.+++||+ |.+.+.+
T Consensus 48 ~~G~Fti~~V~pGt-Y~L~ay~ 68 (95)
T PF14686_consen 48 SDGNFTIPNVRPGT-YRLYAYA 68 (95)
T ss_dssp TTSEEE---B-SEE-EEEEEEE
T ss_pred CCCcEEeCCeeCcE-eEEEEEE
Confidence 45678999999996 9887777
No 62
>PRK14438 acylphosphatase; Provisional
Probab=25.23 E-value=1.1e+02 Score=17.18 Aligned_cols=21 Identities=19% Similarity=0.120 Sum_probs=15.7
Q ss_pred CeEEEEEecCCCCCCceeEEeC
Q psy17638 139 GAFKVIWEPNPERPGSHFFVKY 160 (160)
Q Consensus 139 ~~~~l~W~~p~~~~~~~y~v~y 160 (160)
..+.+.|.+|. ....+|.|.|
T Consensus 71 ~~v~~~~~~~~-~~~~~F~I~~ 91 (91)
T PRK14438 71 SGVIVEREEFR-GEFDDFDIRY 91 (91)
T ss_pred EEEEEEEcCCC-CCCCCEEEeC
Confidence 45678888886 5457898887
No 63
>PF02014 Reeler: Reeler domain Schematic picture including Reeler domain; InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=24.09 E-value=1.1e+02 Score=18.33 Aligned_cols=18 Identities=17% Similarity=0.571 Sum_probs=10.7
Q ss_pred CcEEEEEecCCCCCCCce
Q psy17638 31 SSAFYLKWVKPEQPNGVL 48 (160)
Q Consensus 31 ~~~~~l~W~~p~~~~g~i 48 (160)
...+++.|.+|....|.|
T Consensus 99 K~~v~~~W~AP~~~~g~V 116 (132)
T PF02014_consen 99 KTSVTFTWTAPADFSGCV 116 (132)
T ss_dssp BSEEEEEEE--STTS-EE
T ss_pred ccEEEEEEcCCCCCCceE
Confidence 468999999996544543
No 64
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=21.90 E-value=1.2e+02 Score=14.79 Aligned_cols=13 Identities=15% Similarity=0.411 Sum_probs=10.5
Q ss_pred EEEEEEEEEeCCC
Q psy17638 91 KYRIHLAGYTKAG 103 (160)
Q Consensus 91 ~y~~~v~a~~~~g 103 (160)
.|.|.|.+.+..|
T Consensus 37 ~y~~~vtatd~~G 49 (49)
T PF05345_consen 37 TYTFTVTATDGSG 49 (49)
T ss_pred EEEEEEEEEcCCC
Confidence 7999999887654
No 65
>cd02856 Glycogen_debranching_enzyme_N_term Glycogen_debranching_enzyme N-terminal domain. Glycogen debranching enzymes have both 4-alpha-glucanotransferase and amylo-1,6-glucosidase activities. As a transferase it transfers a segment of a 1,4-alpha-D-glucan to a new 4-position in an acceptor, which may be glucose or another 1,4-alpha-D-glucan. As a glucosidase it catalyzes the endohydrolysis of 1,6-alpha-D-glucoside linkages at points of branching in chains of 1,4-linked alpha-D-glucose residues. The N-terminus of the glycogen debranching enzyme may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=21.73 E-value=1.7e+02 Score=16.67 Aligned_cols=20 Identities=20% Similarity=0.464 Sum_probs=16.0
Q ss_pred cEEEEcCCCCCCEEEEEEEE
Q psy17638 79 TSAKLAGLEPSTKYRIHLAG 98 (160)
Q Consensus 79 ~~~~i~~l~p~~~y~~~v~a 98 (160)
-+..+.++.++..|.++|..
T Consensus 47 W~~~v~~~~~g~~Y~y~i~g 66 (103)
T cd02856 47 WHGFLPGIKAGQRYGFRVHG 66 (103)
T ss_pred EEEEECCCCCCCEEEEEECC
Confidence 34667788899999999965
No 66
>PRK14450 acylphosphatase; Provisional
Probab=20.45 E-value=1.6e+02 Score=16.55 Aligned_cols=21 Identities=29% Similarity=0.494 Sum_probs=14.9
Q ss_pred CeEEEEEecCCCCCCceeEEeC
Q psy17638 139 GAFKVIWEPNPERPGSHFFVKY 160 (160)
Q Consensus 139 ~~~~l~W~~p~~~~~~~y~v~y 160 (160)
..+.+.|.++. .....|.|.|
T Consensus 71 ~~v~~~~~~~~-~~~~~F~I~~ 91 (91)
T PRK14450 71 KEVETSWETAT-ANYSDFRIKY 91 (91)
T ss_pred EEEEEEEcCCC-CCCCCEEEeC
Confidence 34667788876 4457888887
Done!