Query psy9822
Match_columns 380
No_of_seqs 312 out of 1858
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 20:35:25 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9822.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9822hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0994|consensus 99.7 4.7E-17 1E-21 173.2 9.2 173 174-379 848-1063(1758)
2 KOG0994|consensus 99.6 4E-16 8.8E-21 166.2 9.8 158 192-379 774-960 (1758)
3 KOG1388|consensus 99.3 1.4E-12 3E-17 118.5 4.5 127 113-369 1-128 (217)
4 KOG3512|consensus 99.2 2.8E-11 6E-16 120.4 5.6 102 259-379 277-386 (592)
5 KOG1836|consensus 99.1 6.2E-10 1.4E-14 127.5 11.3 160 190-379 734-936 (1705)
6 KOG3512|consensus 99.0 4.3E-10 9.3E-15 112.1 6.6 78 274-378 358-439 (592)
7 KOG1836|consensus 98.8 4.6E-09 1E-13 120.5 7.5 108 244-380 696-825 (1705)
8 KOG1225|consensus 98.6 7.7E-08 1.7E-12 99.5 8.3 74 207-290 232-310 (525)
9 KOG1225|consensus 98.6 1.3E-07 2.9E-12 97.7 8.7 99 174-291 239-342 (525)
10 KOG1214|consensus 98.2 5.4E-06 1.2E-10 88.0 8.8 107 179-287 728-859 (1289)
11 KOG4289|consensus 98.1 7E-06 1.5E-10 90.9 7.5 107 226-373 1715-1843(2531)
12 cd00055 EGF_Lam Laminin-type e 97.9 1.2E-05 2.7E-10 57.5 3.9 28 350-378 17-44 (50)
13 KOG1219|consensus 97.9 1.1E-05 2.5E-10 92.4 5.5 101 186-294 3865-3981(4289)
14 PF07645 EGF_CA: Calcium-bindi 97.9 6.5E-06 1.4E-10 56.7 2.1 36 184-220 1-36 (42)
15 PF00053 Laminin_EGF: Laminin 97.9 4.3E-06 9.3E-11 59.5 0.9 29 350-379 16-44 (49)
16 KOG4289|consensus 97.7 2.3E-05 5E-10 87.1 3.9 82 205-290 1218-1316(2531)
17 KOG1219|consensus 97.7 3.6E-05 7.9E-10 88.6 5.5 102 231-371 3869-3981(4289)
18 KOG1214|consensus 97.6 9.9E-05 2.1E-09 78.7 5.8 111 184-301 691-822 (1289)
19 smart00180 EGF_Lam Laminin-typ 97.6 7.8E-05 1.7E-09 52.4 3.3 25 350-375 16-40 (46)
20 KOG1226|consensus 97.5 0.0003 6.4E-09 74.9 8.0 75 210-290 479-580 (783)
21 cd00055 EGF_Lam Laminin-type e 97.0 0.0011 2.3E-08 47.4 4.4 40 260-302 2-45 (50)
22 KOG1226|consensus 97.0 0.0023 5.1E-08 68.3 8.0 87 194-293 515-623 (783)
23 PF12947 EGF_3: EGF domain; I 96.9 0.00034 7.3E-09 46.6 0.8 31 190-221 3-33 (36)
24 KOG3509|consensus 96.8 0.002 4.4E-08 71.1 6.6 116 243-379 718-853 (964)
25 KOG1217|consensus 96.8 0.012 2.6E-07 59.5 11.5 104 174-287 157-263 (487)
26 smart00179 EGF_CA Calcium-bind 96.7 0.0018 3.9E-08 42.8 3.3 36 184-221 1-37 (39)
27 PF00053 Laminin_EGF: Laminin 96.6 0.00089 1.9E-08 47.5 1.1 32 267-301 12-43 (49)
28 smart00180 EGF_Lam Laminin-typ 95.9 0.0081 1.7E-07 42.1 3.2 29 267-298 12-40 (46)
29 cd00054 EGF_CA Calcium-binding 95.8 0.012 2.5E-07 38.2 3.2 35 185-221 2-36 (38)
30 PF01437 PSI: Plexin repeat; 95.7 0.0038 8.3E-08 44.7 0.8 32 43-79 1-32 (51)
31 PF00008 EGF: EGF-like domain 95.4 0.0058 1.3E-07 39.5 0.8 26 194-220 5-31 (32)
32 KOG4260|consensus 95.2 0.019 4E-07 54.8 3.6 66 247-313 132-207 (350)
33 smart00051 DSL delta serrate l 94.7 0.03 6.5E-07 42.1 2.9 39 247-289 21-63 (63)
34 PF07974 EGF_2: EGF-like domai 94.5 0.022 4.9E-07 36.8 1.6 21 236-256 11-32 (32)
35 smart00423 PSI domain found in 94.1 0.034 7.3E-07 38.8 2.0 30 44-78 1-30 (46)
36 KOG4260|consensus 94.1 0.06 1.3E-06 51.4 4.2 63 274-375 128-190 (350)
37 cd00053 EGF Epidermal growth f 94.1 0.054 1.2E-06 34.3 2.8 27 193-220 6-32 (36)
38 KOG1217|consensus 94.0 0.16 3.5E-06 51.2 7.6 102 181-286 267-388 (487)
39 smart00051 DSL delta serrate l 93.7 0.061 1.3E-06 40.4 2.8 41 212-256 20-63 (63)
40 PF01437 PSI: Plexin repeat; 93.7 0.018 3.9E-07 41.1 -0.1 30 105-141 1-32 (51)
41 PF12661 hEGF: Human growth fa 93.5 0.023 4.9E-07 29.3 0.1 12 245-256 2-13 (13)
42 smart00181 EGF Epidermal growt 93.0 0.11 2.4E-06 33.4 2.9 25 193-219 6-30 (35)
43 KOG3509|consensus 92.9 0.12 2.5E-06 57.6 4.6 75 274-378 717-794 (964)
44 PF14670 FXa_inhibition: Coagu 89.6 0.21 4.6E-06 33.2 1.5 22 200-221 10-31 (36)
45 PF12662 cEGF: Complement Clr- 89.0 0.26 5.7E-06 29.8 1.5 20 208-227 1-20 (24)
46 PF07974 EGF_2: EGF-like domai 88.8 0.31 6.7E-06 31.5 1.8 26 261-289 7-32 (32)
47 cd01475 vWA_Matrilin VWA_Matri 84.5 0.83 1.8E-05 42.4 3.0 41 178-221 180-220 (224)
48 KOG1218|consensus 83.9 11 0.00025 36.2 10.8 41 248-291 96-139 (316)
49 smart00179 EGF_CA Calcium-bind 83.6 1.1 2.3E-05 29.1 2.4 27 262-289 11-38 (39)
50 KOG1388|consensus 83.4 1.4 3.1E-05 40.9 3.9 98 201-301 68-169 (217)
51 PF00008 EGF: EGF-like domain 83.4 0.25 5.4E-06 31.7 -0.7 26 261-287 5-31 (32)
52 PF06247 Plasmod_Pvs28: Plasmo 81.5 0.21 4.6E-06 45.3 -2.1 89 194-286 51-161 (197)
53 cd00054 EGF_CA Calcium-binding 79.0 1.9 4E-05 27.4 2.3 27 262-289 11-37 (38)
54 smart00423 PSI domain found in 78.3 1.2 2.6E-05 30.8 1.3 20 106-125 1-22 (46)
55 cd00053 EGF Epidermal growth f 78.1 2.7 5.8E-05 26.1 2.8 26 261-287 7-32 (36)
56 PF01414 DSL: Delta serrate li 78.0 0.68 1.5E-05 34.7 -0.1 10 280-289 54-63 (63)
57 KOG1218|consensus 74.9 23 0.0005 34.0 9.7 13 176-188 98-110 (316)
58 PF01414 DSL: Delta serrate li 74.9 0.61 1.3E-05 35.0 -1.0 45 208-256 16-63 (63)
59 PHA02887 EGF-like protein; Pro 73.1 1.8 4E-05 36.3 1.2 25 234-258 94-123 (126)
60 smart00181 EGF Epidermal growt 71.4 3.8 8.2E-05 26.0 2.2 23 262-286 8-30 (35)
61 PF12946 EGF_MSP1_1: MSP1 EGF 67.7 1.9 4.1E-05 28.9 0.2 24 198-221 9-33 (37)
62 PHA03099 epidermal growth fact 65.5 3 6.6E-05 35.6 1.0 25 234-258 53-82 (139)
63 KOG0196|consensus 55.7 17 0.00037 40.2 4.8 36 274-310 258-297 (996)
64 PTZ00214 high cysteine membran 51.6 98 0.0021 34.7 10.0 133 197-380 504-662 (800)
65 PHA03099 epidermal growth fact 50.6 9.5 0.00021 32.7 1.6 18 350-368 65-82 (139)
66 PF03302 VSP: Giardia variant- 50.6 1.2E+02 0.0026 30.9 9.9 21 288-311 92-112 (397)
67 PF01508 Paramecium_SA: Parame 46.0 16 0.00035 26.5 2.1 34 41-78 27-60 (63)
68 PF01683 EB: EB module; Inter 40.9 37 0.0008 23.7 3.2 29 251-284 17-45 (52)
69 PTZ00382 Variant-specific surf 37.3 76 0.0017 25.7 4.9 22 288-310 5-26 (96)
70 PF00954 S_locus_glycop: S-loc 34.2 38 0.00081 27.7 2.7 33 184-219 76-108 (110)
71 PHA02887 EGF-like protein; Pro 31.8 29 0.00063 29.3 1.6 18 350-368 106-123 (126)
72 smart00187 INB Integrin beta s 31.2 27 0.00059 35.9 1.6 20 111-132 1-21 (423)
73 KOG3848|consensus 30.1 23 0.00049 36.2 0.8 32 42-77 317-348 (516)
74 PF14843 GF_recep_IV: Growth f 29.7 17 0.00038 31.0 -0.0 22 352-373 73-95 (132)
75 PF00362 Integrin_beta: Integr 29.2 25 0.00055 36.3 1.0 19 111-131 1-20 (426)
76 PF05283 MGC-24: Multi-glycosy 24.6 73 0.0016 29.2 3.0 39 40-81 52-92 (186)
77 cd00185 TNFR Tumor necrosis fa 22.4 1.7E+02 0.0038 23.3 4.6 27 287-314 33-60 (98)
No 1
>KOG0994|consensus
Probab=99.69 E-value=4.7e-17 Score=173.20 Aligned_cols=173 Identities=32% Similarity=0.795 Sum_probs=137.0
Q ss_pred CCCccCCCCCCCccccCCCC-CCCCCCCeeec---CCCCccc-ccCCCceecCC---CCCcccc-CCCCCcC-----Ccc
Q psy9822 174 SFSWNYVKCPPENECLNDHH-TCDPQSEQCVD---LADGFEC-VCGRGYNKSEE---GGECVPV-CSQGCVR-----GVC 239 (380)
Q Consensus 174 ~~~w~~~~C~d~deC~~g~~-~C~~~~g~C~n---~~~g~~C-~C~~Gy~G~~~---~~~C~~~-C~~~C~~-----G~C 239 (380)
...|+|+.|... .| |+|. .|++..|.|++ ...|+.| +|..||+|+.. ...|.|. |+.+-.. -.|
T Consensus 848 pG~WgFPeCr~C-qC-NgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC 925 (1758)
T KOG0994|consen 848 PGYWGFPECRPC-QC-NGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSC 925 (1758)
T ss_pred CCccCCCcCccc-cc-cCcccccCccccccccccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccc
Confidence 356999999763 67 7776 89999998876 6689999 99999999976 5678888 8766322 245
Q ss_pred cC-----CCccccCCCcccccccc----------------cCCCCCCC------cccCCCCCcccccCCCCCCCCCCCCC
Q psy9822 240 SE-----PDKCQCDFGYVGVNCSI----------------QCQCNGHA------DCAGPDKLDVCLRCHNHTKGPQCDKC 292 (380)
Q Consensus 240 ~~-----~~~C~C~~G~~G~~C~~----------------~c~C~~~~------~C~~~~~~g~C~~C~~~~~G~~C~~C 292 (380)
.. .-.|.|..||+|.+|++ .|+|+++. .|. ..++.|++|...++|.||+.|
T Consensus 926 ~~d~~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD--~~TG~CLkCL~hTeG~hCe~C 1003 (1758)
T KOG0994|consen 926 YLDTRTQQIVCHCQEGYSGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACD--VATGACLKCLYHTEGDHCEHC 1003 (1758)
T ss_pred cccccccceeeecccCccccchhhhcccccCCcccCCccccccccCCcCccCCCccc--hhhchhhhhhhcccccchhhc
Confidence 54 23899999999999975 35565542 243 368999999999999999999
Q ss_pred CCCccCCCCCCCCCcCCCCccCCCCCc--ccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCC
Q psy9822 293 EPLYVGDPRDNGECVPCSEYCNGHTGL--CINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCV 370 (380)
Q Consensus 293 ~~g~~G~~~~~~~C~~C~~~Cng~~~~--C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~ 370 (380)
++||||++.. ..|+.|. ||-.++. |+ +| ..+|+|. |.+|..|.+|++|+
T Consensus 1004 k~Gf~GdA~~-q~CqrC~--Cn~LGTn~~~~-----CD--------------------r~tGQCp-ClpNv~G~~CDqCA 1054 (1758)
T KOG0994|consen 1004 KDGFYGDALR-QNCQRCV--CNFLGTNSTCH-----CD--------------------RFTGQCP-CLPNVQGVRCDQCA 1054 (1758)
T ss_pred cccchhHHHH-hhhhhhe--ccccccCCccc-----cc--------------------cccCcCC-CCcccccccccccc
Confidence 9999999864 5699998 9855432 21 12 2679999 99999999999999
Q ss_pred CCCCCCCCC
Q psy9822 371 EGKENRQSR 379 (380)
Q Consensus 371 ~Gy~g~~~~ 379 (380)
+-+|.+.|+
T Consensus 1055 ~N~w~laSG 1063 (1758)
T KOG0994|consen 1055 ENHWNLASG 1063 (1758)
T ss_pred cchhccccC
Confidence 999998875
No 2
>KOG0994|consensus
Probab=99.64 E-value=4e-16 Score=166.16 Aligned_cols=158 Identities=33% Similarity=0.777 Sum_probs=117.6
Q ss_pred CCCCCCCCCeeecCCC--Cccc-ccCCCceecCCCCCcccc-CCCC-CcCCcccC-CCcccc------------CCCccc
Q psy9822 192 HHTCDPQSEQCVDLAD--GFEC-VCGRGYNKSEEGGECVPV-CSQG-CVRGVCSE-PDKCQC------------DFGYVG 253 (380)
Q Consensus 192 ~~~C~~~~g~C~n~~~--g~~C-~C~~Gy~G~~~~~~C~~~-C~~~-C~~G~C~~-~~~C~C------------~~G~~G 253 (380)
.+.|.+++|.|...|+ |.+| +|.+|++|.+ +..|.+. |... =....|.. .++|+| .+||.|
T Consensus 774 S~vCn~~GGqCqCkPnVVGR~CdqCApGtyGFG-PsGCk~CdC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~Wg 852 (1758)
T KOG0994|consen 774 SSVCNPNGGQCQCKPNVVGRRCDQCAPGTYGFG-PSGCKACDCNSIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWG 852 (1758)
T ss_pred cccccCCCceecccCccccccccccCCcccCcC-CccCccccccccccccccccccccceeeccccchhhccccCCCccC
Confidence 3478889999999886 8999 9999999988 4567665 5322 11122322 334444 455444
Q ss_pred -ccccccCCCCCCCc-ccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC--CCCCcCCCCccCCC-------CCcccc
Q psy9822 254 -VNCSIQCQCNGHAD-CAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD--NGECVPCSEYCNGH-------TGLCIN 322 (380)
Q Consensus 254 -~~C~~~c~C~~~~~-C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~--~~~C~~C~~~Cng~-------~~~C~~ 322 (380)
+.|. +|+||+|+. |.. .++.|+.|++.++|.+|++|..||||+|.- +..|+||+ |... ++.|..
T Consensus 853 FPeCr-~CqCNgHA~~Cd~--~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCp--CP~gp~Sg~~~A~sC~~ 927 (1758)
T KOG0994|consen 853 FPECR-PCQCNGHADTCDP--ITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCP--CPDGPASGRQHADSCYL 927 (1758)
T ss_pred CCcCc-cccccCcccccCc--cccccccccccccccchhhhhccccCCcccCCCCCCCCCC--CCCCCccchhccccccc
Confidence 3444 599999975 864 689999999999999999999999999975 57899999 8743 223322
Q ss_pred cccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822 323 ASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR 379 (380)
Q Consensus 323 ~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~ 379 (380)
| .....-+|. |+.||+|.+|+.|+++|||+|++
T Consensus 928 -----d------------------~~t~~ivC~-C~~GY~G~RCe~CA~~~fGnP~~ 960 (1758)
T KOG0994|consen 928 -----D------------------TRTQQIVCH-CQEGYSGSRCEICADNHFGNPSE 960 (1758)
T ss_pred -----c------------------ccccceeee-cccCccccchhhhcccccCCccc
Confidence 1 113568899 99999999999999999999975
No 3
>KOG1388|consensus
Probab=99.31 E-value=1.4e-12 Score=118.50 Aligned_cols=127 Identities=39% Similarity=0.994 Sum_probs=95.9
Q ss_pred hhhcCCCCCccCcCCCCCCCCceeeccCCCCCCCCCCCCCccccccccCCCCcccccccCCCCCccCCCCCCCccccCCC
Q psy9822 113 STCLRHAHCGWCALQRDTTGGMGVCTEGSLNSPSSGPESSTCDILFYQTYPDQVIEVNNNDSFSWNYVKCPPENECLNDH 192 (380)
Q Consensus 113 ~~Cl~~~~CGwC~~~~~~~~g~G~C~~G~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~w~~~~C~d~deC~~g~ 192 (380)
++|+++|.||||+ +++.+|.|+||+|..++|.+... +. -+ ...|.|.+|| ..+|
T Consensus 1 ~~c~~~P~cgw~a--~~~~~G~grcm~g~~~~p~d~~~---~~-------------~~---~~~W~fl~cP-~~~c---- 54 (217)
T KOG1388|consen 1 SHCLRNPACGWCA--DGSNTGDGRCMEGHYRGPLDCVQ---LE-------------QP---REIWRFLFCP-LCQC---- 54 (217)
T ss_pred CCcccCCCccccc--CCCCCCCCeEEECccccccchhh---cc-------------Cc---cchhhhhcCh-HHHh----
Confidence 3689999999999 99999999999999999987200 00 01 4579999998 3344
Q ss_pred CCCCCCCCeeecCCCCcccccCCCceecCCCCCccccCCCCCcCCcccCCCccccCCCcccccccccCCCCCCCcccCCC
Q psy9822 193 HTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPVCSQGCVRGVCSEPDKCQCDFGYVGVNCSIQCQCNGHADCAGPD 272 (380)
Q Consensus 193 ~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~C~~~C~~G~C~~~~~C~C~~G~~G~~C~~~c~C~~~~~C~~~~ 272 (380)
++|+.|..
T Consensus 55 ----------------------------------------------------------------------NGh~~c~t-- 62 (217)
T KOG1388|consen 55 ----------------------------------------------------------------------NGHSDCNT-- 62 (217)
T ss_pred ----------------------------------------------------------------------cCCCCccc--
Confidence 34444443
Q ss_pred CCcccccCCCCCCCCCCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCc
Q psy9822 273 KLDVCLRCHNHTKGPQCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRA 352 (380)
Q Consensus 273 ~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g 352 (380)
.-.|.+|+++++|.+|++|+.+|+|| .+++.|++|. |++....|+. ..+
T Consensus 63 -~~v~~~~~N~~~g~~c~kc~~g~~Gd-tN~g~c~~~~--~~g~~~~~~~---------------------------~~~ 111 (217)
T KOG1388|consen 63 -QHVCWRCENGTTGAHCEKCIVGFYGD-TNGGKCQPCD--CNGGASACVT---------------------------LTG 111 (217)
T ss_pred -ceeeeeccCccccccCCceEEEEEec-CCCCccCHhh--hcCCeeeeec---------------------------cCC
Confidence 33444588999999999999999999 8899999999 9987665542 346
Q ss_pred ccccC-CCCCCCCCCCcC
Q psy9822 353 RCMHC-GNHTTGPKCEDC 369 (380)
Q Consensus 353 ~C~~C-~~g~~G~~Ce~C 369 (380)
+|. | ..++.|++|++|
T Consensus 112 ~c~-c~~kgvvgd~c~~~ 128 (217)
T KOG1388|consen 112 KCF-CTTKGVVGDLCPKC 128 (217)
T ss_pred ccc-cccceEecccCccc
Confidence 676 7 457778777765
No 4
>KOG3512|consensus
Probab=99.16 E-value=2.8e-11 Score=120.42 Aligned_cols=102 Identities=35% Similarity=0.839 Sum_probs=83.8
Q ss_pred cCCCCCCCc-ccCCC-CCcccccCCCCCCCCCCCCCCCCccCCCCC------CCCCcCCCCccCCCCCcccccccCCCCC
Q psy9822 259 QCQCNGHAD-CAGPD-KLDVCLRCHNHTKGPQCDKCEPLYVGDPRD------NGECVPCSEYCNGHTGLCINASLASLPF 330 (380)
Q Consensus 259 ~c~C~~~~~-C~~~~-~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~------~~~C~~C~~~Cng~~~~C~~~~~~~~p~ 330 (380)
.|.||+|++ |+.+. ...+|+ |.+|++|+.|++|++.|+..|.. -..|.+|. ||+|+.+|..+.+
T Consensus 277 RCKCNgHAs~Cv~d~~~~ltCd-C~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~--Cn~harrcrfn~E----- 348 (592)
T KOG3512|consen 277 RCKCNGHASRCVMDESSHLTCD-CEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACN--CNGHARRCRFNME----- 348 (592)
T ss_pred eeeecCccceeeeccCCceEEe-cccCCCCCCcccccccccCCCccccccCCCccccccc--cchhhhhcccchh-----
Confidence 588999976 87543 348995 99999999999999999988843 35799999 9999998875442
Q ss_pred CCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822 331 SPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR 379 (380)
Q Consensus 331 ~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~ 379 (380)
.+.++ |+.+.++|++|++++.|+||..|++|||++.+.
T Consensus 349 -ly~lS----------gr~SggvClnCrHnTaGrhChyCreGyyRd~s~ 386 (592)
T KOG3512|consen 349 -LYRLS----------GRRSGGVCLNCRHNTAGRHCHYCREGYYRDGSK 386 (592)
T ss_pred -hhccc----------CccccceEeecccCCCCcccccccCccccCCCC
Confidence 33333 455789999999999999999999999999874
No 5
>KOG1836|consensus
Probab=99.06 E-value=6.2e-10 Score=127.51 Aligned_cols=160 Identities=34% Similarity=0.799 Sum_probs=120.9
Q ss_pred CCC-CCCCCCCCeeecCC--CCccc-ccCCCceecCC---CCCcccc-CCCCCcCCcccC-----CCccc-cCCCccccc
Q psy9822 190 NDH-HTCDPQSEQCVDLA--DGFEC-VCGRGYNKSEE---GGECVPV-CSQGCVRGVCSE-----PDKCQ-CDFGYVGVN 255 (380)
Q Consensus 190 ~g~-~~C~~~~g~C~n~~--~g~~C-~C~~Gy~G~~~---~~~C~~~-C~~~C~~G~C~~-----~~~C~-C~~G~~G~~ 255 (380)
++| ..|++.+|.|...+ .+.+| +|.+||+|... .+.|.+. |++. +.|.. ...|. |+.||+|..
T Consensus 734 ngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~---~~~~~~~~~~~~iCk~Cp~gytG~r 810 (1705)
T KOG1836|consen 734 NGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNG---GACGQTPEILEVVCKNCPPGYTGLR 810 (1705)
T ss_pred CCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCCCccCCCCCC---hhhcCcCcccceecCCCCCCCcccc
Confidence 566 48999999887644 68899 99999999876 3348877 7765 33332 45897 999999988
Q ss_pred ccc--------------------cCCCCCCC------cccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC---CCCC
Q psy9822 256 CSI--------------------QCQCNGHA------DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD---NGEC 306 (380)
Q Consensus 256 C~~--------------------~c~C~~~~------~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~---~~~C 306 (380)
|+. .|+|+++. .|.. .++.|+.|.+++.|.+|+.|+++|+|++.. .+.|
T Consensus 811 Ce~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~--~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c 888 (1705)
T KOG1836|consen 811 CEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNR--LTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKC 888 (1705)
T ss_pred cccCCCccccCCCCCCCCcccCccceeccccCccccccccc--cccceeeccCCcccccccccccCccccccCCCcCCcc
Confidence 863 34565542 3542 578898899999999999999999999865 4678
Q ss_pred cCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822 307 VPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR 379 (380)
Q Consensus 307 ~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~ 379 (380)
..|. |+..+.. .| .+ .-+..+|+|. |.++.+|..|..|.+|||++++.
T Consensus 889 ~~c~--c~p~gs~--------~~----~~----------~c~~~tGQce-c~~~v~g~~c~~c~~g~fnl~s~ 936 (1705)
T KOG1836|consen 889 FACG--CVPAGSE--------LP----SL----------TCNPVTGQCE-CKPNVEGRDCLYCFKGFFNLNSG 936 (1705)
T ss_pred cccc--CccCCcc--------cc----cc----------cCCCccccee-ccCCCCccccccccccccccCCC
Confidence 8888 7754321 00 00 1123679999 99999999999999999999854
No 6
>KOG3512|consensus
Probab=99.01 E-value=4.3e-10 Score=112.07 Aligned_cols=78 Identities=28% Similarity=0.648 Sum_probs=63.9
Q ss_pred CcccccCCCCCCCCCCCCCCCCccCCCCC----CCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCC
Q psy9822 274 LDVCLRCHNHTKGPQCDKCEPLYVGDPRD----NGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPT 349 (380)
Q Consensus 274 ~g~C~~C~~~~~G~~C~~C~~g~~G~~~~----~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~ 349 (380)
.++|+.|++++.|.+|..|++|||.|+.. ...|..|+ ||..+.. ++. -++
T Consensus 358 ggvClnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~Cd--ChpVGs~---------------------gkt---CNq 411 (592)
T KOG3512|consen 358 GGVCLNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACD--CHPVGSA---------------------GKT---CNQ 411 (592)
T ss_pred cceEeecccCCCCcccccccCccccCCCCCCchhhhhhhcC--Ccccccc---------------------ccc---ccc
Confidence 57899999999999999999999999753 46799999 9876421 011 123
Q ss_pred CCcccccCCCCCCCCCCCcCCCCCCCCCC
Q psy9822 350 TRARCMHCGNHTTGPKCEDCVEGKENRQS 378 (380)
Q Consensus 350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~ 378 (380)
.+|+|. |++|.+|..|++|++||+-..+
T Consensus 412 ~tGqCp-CkeGvtG~tCnrCa~gyqqsrs 439 (592)
T KOG3512|consen 412 TTGQCP-CKEGVTGLTCNRCAPGYQQSRS 439 (592)
T ss_pred cCCccc-CCCCCcccccccccchhhcccC
Confidence 679999 9999999999999999997655
No 7
>KOG1836|consensus
Probab=98.84 E-value=4.6e-09 Score=120.52 Aligned_cols=108 Identities=32% Similarity=0.815 Sum_probs=88.2
Q ss_pred ccccCCCcccccccc-------------------cCCCCCCC-cccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCCC
Q psy9822 244 KCQCDFGYVGVNCSI-------------------QCQCNGHA-DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRDN 303 (380)
Q Consensus 244 ~C~C~~G~~G~~C~~-------------------~c~C~~~~-~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~~ 303 (380)
.|.|+.||+|..|+. +|.|++|+ +|.. .++.|. |++++.|.+|++|++||||++..+
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~--~tG~C~-C~~~t~G~~C~~C~~GfYg~~~~~ 772 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDP--RTGQCK-CKHNTFGGQCAQCVDGFYGLPDLG 772 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccC--CCCcee-cccCCCCCchhhhcCCCCCccccC
Confidence 599999999988863 57788884 5864 789996 999999999999999999999764
Q ss_pred C--CCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCCC
Q psy9822 304 G--ECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSRN 380 (380)
Q Consensus 304 ~--~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~~ 380 (380)
. .|++|. |... +.|.... ....++|.+|+++|+|.+|+.|..|||+++..+
T Consensus 773 ~~~dC~~C~--Cp~~-~~~~~~~-----------------------~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~ 825 (1705)
T KOG1836|consen 773 TSGDCQPCP--CPNG-GACGQTP-----------------------EILEVVCKNCPPGYTGLRCEECADGYFGNPLGH 825 (1705)
T ss_pred CCCCCccCC--CCCC-hhhcCcC-----------------------cccceecCCCCCCCcccccccCCCccccCCCCC
Confidence 3 399999 8843 4443211 135789999999999999999999999999764
No 8
>KOG1225|consensus
Probab=98.63 E-value=7.7e-08 Score=99.45 Aligned_cols=74 Identities=32% Similarity=0.891 Sum_probs=54.5
Q ss_pred CCcccccCCCceecCCCCCcccc-CCCCCc-CCcccCCCccccCCCcccccccc-cC--CCCCCCcccCCCCCcccccCC
Q psy9822 207 DGFECVCGRGYNKSEEGGECVPV-CSQGCV-RGVCSEPDKCQCDFGYVGVNCSI-QC--QCNGHADCAGPDKLDVCLRCH 281 (380)
Q Consensus 207 ~g~~C~C~~Gy~G~~~~~~C~~~-C~~~C~-~G~C~~~~~C~C~~G~~G~~C~~-~c--~C~~~~~C~~~~~~g~C~~C~ 281 (380)
....|.|..+|+|.. |... |+..|. +|.|+ +++|+|++||+|.+|+. .| .|++|..+. .++|+ |+
T Consensus 232 ~~~ic~c~~~~~g~~----c~~~~C~~~c~~~g~c~-~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~----~g~Ci-C~ 301 (525)
T KOG1225|consen 232 FDGICECPEGYFGPL----CSTIYCPGGCTGRGQCV-EGRCICPPGFTGDDCDELVCPVDCSGGGVCV----DGECI-CN 301 (525)
T ss_pred cCceeecCCceeCCc----cccccCCCCCcccceEe-CCeEeCCCCCcCCCCCcccCCcccCCCceec----CCEee-cC
Confidence 345788888888876 6655 877776 45554 46888999999988886 33 366777776 45886 88
Q ss_pred CCCCCCCCC
Q psy9822 282 NHTKGPQCD 290 (380)
Q Consensus 282 ~~~~G~~C~ 290 (380)
++|+|..|+
T Consensus 302 ~g~~G~dCs 310 (525)
T KOG1225|consen 302 PGYSGKDCS 310 (525)
T ss_pred CCccccccc
Confidence 888888885
No 9
>KOG1225|consensus
Probab=98.59 E-value=1.3e-07 Score=97.70 Aligned_cols=99 Identities=32% Similarity=0.835 Sum_probs=72.4
Q ss_pred CCCccCCCCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCcccc-CCCCCc-CCcccCCCccccCCCc
Q psy9822 174 SFSWNYVKCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPV-CSQGCV-RGVCSEPDKCQCDFGY 251 (380)
Q Consensus 174 ~~~w~~~~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~-C~~~C~-~G~C~~~~~C~C~~G~ 251 (380)
..+|.++.|.. -.|. +.|. .++.|++. +|.|++||+|+. |... |+..|. ++.+++. +|+|++||
T Consensus 239 ~~~~~g~~c~~-~~C~---~~c~-~~g~c~~G----~CIC~~Gf~G~d----C~e~~Cp~~cs~~g~~~~g-~CiC~~g~ 304 (525)
T KOG1225|consen 239 PEGYFGPLCST-IYCP---GGCT-GRGQCVEG----RCICPPGFTGDD----CDELVCPVDCSGGGVCVDG-ECICNPGY 304 (525)
T ss_pred CCceeCCcccc-ccCC---CCCc-ccceEeCC----eEeCCCCCcCCC----CCcccCCcccCCCceecCC-EeecCCCc
Confidence 34566666652 2331 2233 33556544 899999999988 6665 887787 4566554 99999999
Q ss_pred ccccccc---cCCCCCCCcccCCCCCcccccCCCCCCCCCCCC
Q psy9822 252 VGVNCSI---QCQCNGHADCAGPDKLDVCLRCHNHTKGPQCDK 291 (380)
Q Consensus 252 ~G~~C~~---~c~C~~~~~C~~~~~~g~C~~C~~~~~G~~C~~ 291 (380)
+|..|++ +-.|++|+.|+ .+.|. |.++|+|..|..
T Consensus 305 ~G~dCs~~~cpadC~g~G~Ci----~G~C~-C~~Gy~G~~C~~ 342 (525)
T KOG1225|consen 305 SGKDCSIRRCPADCSGHGKCI----DGECL-CDEGYTGELCIQ 342 (525)
T ss_pred cccccccccCCccCCCCCccc----CCceE-eCCCCcCCcccc
Confidence 9999986 35699999998 68995 999999998864
No 10
>KOG1214|consensus
Probab=98.18 E-value=5.4e-06 Score=87.97 Aligned_cols=107 Identities=29% Similarity=0.744 Sum_probs=73.8
Q ss_pred CCCCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCcccc--------CCC---CCc-CC--cccC---
Q psy9822 179 YVKCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPV--------CSQ---GCV-RG--VCSE--- 241 (380)
Q Consensus 179 ~~~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~--------C~~---~C~-~G--~C~~--- 241 (380)
+..|.|++||+.+.+.|. .+.+|+|.+++|+|+|..||.-.+....|.++ |.. .|. .| +|+.
T Consensus 728 gr~c~d~~eca~~~~~CG-p~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGg 806 (1289)
T KOG1214|consen 728 GRNCVDENECATGFHRCG-PNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGG 806 (1289)
T ss_pred CCCCCChhhhccCCCCCC-CCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCC
Confidence 467999999999999999 67899999999999999998644433456544 211 133 23 4443
Q ss_pred -CCccccCCCcccc--cccc-----cCCCCCCCcccCCCCCcccccCCCCCCCC
Q psy9822 242 -PDKCQCDFGYVGV--NCSI-----QCQCNGHADCAGPDKLDVCLRCHNHTKGP 287 (380)
Q Consensus 242 -~~~C~C~~G~~G~--~C~~-----~c~C~~~~~C~~~~~~g~C~~C~~~~~G~ 287 (380)
.+.|.|.+||.|. .|.. +..|...+.|.++++.+.| +|+++|.|.
T Consensus 807 s~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC-~C~pGy~GD 859 (1289)
T KOG1214|consen 807 STYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSC-RCQPGYYGD 859 (1289)
T ss_pred ceEEEeecCCccCCccccccccccCccccCCCceEecCCCccee-ecccCccCC
Confidence 3599999999973 2322 3446666777777777777 355555443
No 11
>KOG4289|consensus
Probab=98.08 E-value=7e-06 Score=90.95 Aligned_cols=107 Identities=34% Similarity=0.868 Sum_probs=61.2
Q ss_pred ccccCC-CCCc-CCcccC-----CCccccCCCcccccccc--cCCCC----CCCcccCCCCCcccccCC---------CC
Q psy9822 226 CVPVCS-QGCV-RGVCSE-----PDKCQCDFGYVGVNCSI--QCQCN----GHADCAGPDKLDVCLRCH---------NH 283 (380)
Q Consensus 226 C~~~C~-~~C~-~G~C~~-----~~~C~C~~G~~G~~C~~--~c~C~----~~~~C~~~~~~g~C~~C~---------~~ 283 (380)
|.++|. ++|. .|+|+. .++|+|++||+|++|+. +-+|- ++-+| +.| .|. +-
T Consensus 1715 C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~Ce~~~dq~CPrGWWG~P~C------gpC-~CavsKgfdp~CnK 1787 (2531)
T KOG4289|consen 1715 CVDVCSLNPCENQGTCVRSPGAHGYTCECPPGYTGPYCELRADQPCPRGWWGFPTC------GPC-NCAVSKGFDPDCNK 1787 (2531)
T ss_pred ccchhcccccccCceeecCCCCCceeEECCCcccCcchhhhccCCCCCcccCCCCc------cCc-cccccCCCCCCccc
Confidence 666663 4555 477765 35788888888877764 12221 11111 111 111 01
Q ss_pred CCCCCCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCC
Q psy9822 284 TKGPQCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTG 363 (380)
Q Consensus 284 ~~G~~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G 363 (380)
+.| .| +|+..+|... +.|.+|+ |. ++.. +. +. ..+|+|. |+++..|
T Consensus 1788 t~G-~C-qCKe~hy~~~---~~Cl~Cd--C~-~Gs~----------------Sr--------~C-~adGqC~-C~pgaiG 1833 (2531)
T KOG4289|consen 1788 TNG-QC-QCKENHYRPI---GSCLPCD--CY-FGSD----------------SR--------EC-DADGQCP-CKPGAIG 1833 (2531)
T ss_pred cCc-ce-eeccccccCC---Ccceeec--cc-cCCC----------------cc--------cc-cCCCcCC-CCCcccc
Confidence 122 23 6777666543 3599999 77 4310 00 01 1579999 9999999
Q ss_pred CCCCcCCCCC
Q psy9822 364 PKCEDCVEGK 373 (380)
Q Consensus 364 ~~Ce~C~~Gy 373 (380)
++|++|..-|
T Consensus 1834 RqCdrCd~pf 1843 (2531)
T KOG4289|consen 1834 RQCDRCDNPF 1843 (2531)
T ss_pred ccccccCChh
Confidence 9999997543
No 12
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.92 E-value=1.2e-05 Score=57.46 Aligned_cols=28 Identities=32% Similarity=0.730 Sum_probs=26.2
Q ss_pred CCcccccCCCCCCCCCCCcCCCCCCCCCC
Q psy9822 350 TRARCMHCGNHTTGPKCEDCVEGKENRQS 378 (380)
Q Consensus 350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~ 378 (380)
.+++|+ |+++|+|++|++|++|||+++.
T Consensus 17 ~~G~C~-C~~~~~G~~C~~C~~g~~~~~~ 44 (50)
T cd00055 17 GTGQCE-CKPNTTGRRCDRCAPGYYGLPS 44 (50)
T ss_pred CCCEEe-CCCcCCCCCCCCCCCCCccCCC
Confidence 579999 9999999999999999999975
No 13
>KOG1219|consensus
Probab=97.91 E-value=1.1e-05 Score=92.45 Aligned_cols=101 Identities=27% Similarity=0.729 Sum_probs=80.0
Q ss_pred ccccCCCCCCCCCCCeeecCC-CCcccccCCCceecCCCCCcccc---C-CCCCc-CCcccC---CCccccCCCcccccc
Q psy9822 186 NECLNDHHTCDPQSEQCVDLA-DGFECVCGRGYNKSEEGGECVPV---C-SQGCV-RGVCSE---PDKCQCDFGYVGVNC 256 (380)
Q Consensus 186 deC~~g~~~C~~~~g~C~n~~-~g~~C~C~~Gy~G~~~~~~C~~~---C-~~~C~-~G~C~~---~~~C~C~~G~~G~~C 256 (380)
+.|. ..+|. ++|+|+-.+ ++|.|.|++-|.|.. |+.- | .++|. -|+|+. .+.|.|+.||||.+|
T Consensus 3865 d~C~--~npCq-hgG~C~~~~~ggy~CkCpsqysG~~----CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~C 3937 (4289)
T KOG1219|consen 3865 DPCN--DNPCQ-HGGTCISQPKGGYKCKCPSQYSGNH----CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRC 3937 (4289)
T ss_pred cccc--cCccc-CCCEecCCCCCceEEeCcccccCcc----cccccccccCCCCCCCCEEEecCCCeeEeCCCCccCcee
Confidence 4552 35787 899999877 699999999999987 6532 5 45676 489986 469999999999999
Q ss_pred ccc----C---CCCCCCcccCCCCCcccccCCCCCCCCCCCCCCC
Q psy9822 257 SIQ----C---QCNGHADCAGPDKLDVCLRCHNHTKGPQCDKCEP 294 (380)
Q Consensus 257 ~~~----c---~C~~~~~C~~~~~~g~C~~C~~~~~G~~C~~C~~ 294 (380)
+.. | .|...+.|++.++.+.|. |.+++.|.+|..-+|
T Consensus 3938 e~~Gi~eCs~n~C~~gg~C~n~~gsf~Cn-cT~g~~gr~c~~~~p 3981 (4289)
T KOG1219|consen 3938 EARGISECSKNVCGTGGQCINIPGSFHCN-CTPGILGRTCCAEKP 3981 (4289)
T ss_pred ecccccccccccccCCceeeccCCceEec-cChhHhcccCccccC
Confidence 863 2 466678899999999994 999999999964443
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.90 E-value=6.5e-06 Score=56.69 Aligned_cols=36 Identities=28% Similarity=0.804 Sum_probs=32.5
Q ss_pred CCccccCCCCCCCCCCCeeecCCCCcccccCCCceec
Q psy9822 184 PENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKS 220 (380)
Q Consensus 184 d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~ 220 (380)
|+|||..+.+.|. .++.|+|+.++|+|.|++||...
T Consensus 1 DidEC~~~~~~C~-~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCP-ENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSS-TTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCC-CCCEEEcCCCCEEeeCCCCcEEC
Confidence 6899998888996 89999999999999999999843
No 15
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.87 E-value=4.3e-06 Score=59.48 Aligned_cols=29 Identities=34% Similarity=0.819 Sum_probs=24.9
Q ss_pred CCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822 350 TRARCMHCGNHTTGPKCEDCVEGKENRQSR 379 (380)
Q Consensus 350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~ 379 (380)
.+++|+ |+++|+|++|++|++|||+++.+
T Consensus 16 ~~G~C~-C~~~~~G~~C~~C~~g~~~~~~~ 44 (49)
T PF00053_consen 16 STGQCV-CKPGTTGPRCDQCKPGYFGLPSD 44 (49)
T ss_dssp TCEEES-BSTTEESTTS-EE-TTEECSTTS
T ss_pred CCCEEe-ccccccCCcCcCCCCccccccCC
Confidence 579999 99999999999999999999764
No 16
>KOG4289|consensus
Probab=97.72 E-value=2.3e-05 Score=87.06 Aligned_cols=82 Identities=24% Similarity=0.713 Sum_probs=62.8
Q ss_pred CCCCcccccCCCceecCCCCCcccc---CC-CCCc-CCcccC---CCccccCCCcccccccc--------cCCCCCCCcc
Q psy9822 205 LADGFECVCGRGYNKSEEGGECVPV---CS-QGCV-RGVCSE---PDKCQCDFGYVGVNCSI--------QCQCNGHADC 268 (380)
Q Consensus 205 ~~~g~~C~C~~Gy~G~~~~~~C~~~---C~-~~C~-~G~C~~---~~~C~C~~G~~G~~C~~--------~c~C~~~~~C 268 (380)
..++++|+|++||+|+. |+.. |. .+|. +|+|.. .++|.|.+||+|..|++ +--|.+.++|
T Consensus 1218 pvnglrCrCPpGFTgd~----CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC 1293 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDY----CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTC 1293 (2531)
T ss_pred ccCceeEeCCCCCCccc----ccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEE
Confidence 34789999999999987 7644 75 5688 799987 57999999999999996 2357788889
Q ss_pred cCCCCCcccccCCCC-CCCCCCC
Q psy9822 269 AGPDKLDVCLRCHNH-TKGPQCD 290 (380)
Q Consensus 269 ~~~~~~g~C~~C~~~-~~G~~C~ 290 (380)
.+..+.+.|-.|+.+ +++++|+
T Consensus 1294 ~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1294 VNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred eecCCCceeccCCCcccCCCceE
Confidence 875554444358654 5777886
No 17
>KOG1219|consensus
Probab=97.72 E-value=3.6e-05 Score=88.56 Aligned_cols=102 Identities=27% Similarity=0.576 Sum_probs=77.1
Q ss_pred CCCCcC-CcccC----CCccccCCCcccccccc------cCCCCCCCcccCCCCCcccccCCCCCCCCCCCCCCCCccCC
Q psy9822 231 SQGCVR-GVCSE----PDKCQCDFGYVGVNCSI------QCQCNGHADCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGD 299 (380)
Q Consensus 231 ~~~C~~-G~C~~----~~~C~C~~G~~G~~C~~------~c~C~~~~~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~ 299 (380)
.++|+| |+|+. .++|.|+.-|+|..|++ ..+|..+++|+...+.+.| +|+++|+|.+|+.= |
T Consensus 3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~C-nC~~gyTG~~Ce~~--G---- 3941 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLC-NCPNGYTGKRCEAR--G---- 3941 (4289)
T ss_pred cCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeE-eCCCCccCceeecc--c----
Confidence 355775 79987 36999999999999997 4578899999987889999 69999999999752 1
Q ss_pred CCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCC
Q psy9822 300 PRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVE 371 (380)
Q Consensus 300 ~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~ 371 (380)
-..|..-. |. +++.|.+.. +.+.|. |.+++.|++|..=+|
T Consensus 3942 ---i~eCs~n~--C~-~gg~C~n~~-------------------------gsf~Cn-cT~g~~gr~c~~~~p 3981 (4289)
T KOG1219|consen 3942 ---ISECSKNV--CG-TGGQCINIP-------------------------GSFHCN-CTPGILGRTCCAEKP 3981 (4289)
T ss_pred ---cccccccc--cc-CCceeeccC-------------------------CceEec-cChhHhcccCccccC
Confidence 12343222 33 778887421 458999 999999999965443
No 18
>KOG1214|consensus
Probab=97.57 E-value=9.9e-05 Score=78.68 Aligned_cols=111 Identities=29% Similarity=0.705 Sum_probs=72.1
Q ss_pred CCccccCCCCCCCCCCCeeecCC-CCcccccCCCceecCCCCCcccc--C---CCCCc-CCcccC---CCccccCCCcc-
Q psy9822 184 PENECLNDHHTCDPQSEQCVDLA-DGFECVCGRGYNKSEEGGECVPV--C---SQGCV-RGVCSE---PDKCQCDFGYV- 252 (380)
Q Consensus 184 d~deC~~g~~~C~~~~g~C~n~~-~g~~C~C~~Gy~G~~~~~~C~~~--C---~~~C~-~G~C~~---~~~C~C~~G~~- 252 (380)
.++.|..+.+.|+ .++.|.-.+ -.|+|.|..||.|++ ..|... | ...|. +..|++ .++|.|..||.
T Consensus 691 ~~npCy~gsh~cd-t~a~C~pg~~~~~tcecs~g~~gdg--r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F 767 (1289)
T KOG1214|consen 691 PVNPCYDGSHMCD-TTARCHPGTGVDYTCECSSGYQGDG--RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEF 767 (1289)
T ss_pred ccccceecCcccC-CCccccCCCCcceEEEEeeccCCCC--CCCCChhhhccCCCCCCCCceeecCCCceeEEEeeccee
Confidence 4567777888888 777887655 489999999999997 456554 3 45587 789998 35888887774
Q ss_pred ---cccccc------cCCCCCC-CcccCCCCCcccccCCCCCCCCCCCCCCCCccCCCC
Q psy9822 253 ---GVNCSI------QCQCNGH-ADCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPR 301 (380)
Q Consensus 253 ---G~~C~~------~c~C~~~-~~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~ 301 (380)
+.+|-. ...|-.. .+|.. .+++ .|...=.+.+-..|.|||.|++.
T Consensus 768 ~dd~~tCV~i~~pap~n~Ce~g~h~C~i---~g~a-~c~~hGgs~y~C~CLPGfsGDG~ 822 (1289)
T KOG1214|consen 768 ADDRHTCVLITPPAPANPCEDGSHTCAI---AGQA-RCVHHGGSTYSCACLPGFSGDGH 822 (1289)
T ss_pred ccCCcceEEecCCCCCCccccCccccCc---CCce-EEEecCCceEEEeecCCccCCcc
Confidence 245532 2344433 34642 3444 24433333333378889988874
No 19
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=97.55 E-value=7.8e-05 Score=52.41 Aligned_cols=25 Identities=28% Similarity=0.801 Sum_probs=23.8
Q ss_pred CCcccccCCCCCCCCCCCcCCCCCCC
Q psy9822 350 TRARCMHCGNHTTGPKCEDCVEGKEN 375 (380)
Q Consensus 350 ~~g~C~~C~~g~~G~~Ce~C~~Gy~g 375 (380)
.+++|+ |+++++|++|++|++||||
T Consensus 16 ~~G~C~-C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 16 DTGQCE-CKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred CCCEEE-CCCCCCCCCCCcCCCCcCC
Confidence 468999 9999999999999999999
No 20
>KOG1226|consensus
Probab=97.48 E-value=0.0003 Score=74.94 Aligned_cols=75 Identities=28% Similarity=0.749 Sum_probs=48.7
Q ss_pred ccccCCCceecCC--CCCc------cccCCC-----CCc-CCcccCCCccccCCCcc----cccccc---cC------CC
Q psy9822 210 ECVCGRGYNKSEE--GGEC------VPVCSQ-----GCV-RGVCSEPDKCQCDFGYV----GVNCSI---QC------QC 262 (380)
Q Consensus 210 ~C~C~~Gy~G~~~--~~~C------~~~C~~-----~C~-~G~C~~~~~C~C~~G~~----G~~C~~---~c------~C 262 (380)
.|.|.+||.|..+ .... ++.|.. .|. +|.|.- ++|+|..... |..|+- .| .|
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC 557 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCECDNFSCERHKGVLC 557 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-CceEecCCCCCceeeeeeeccCcccccccCccc
Confidence 4689999999885 0000 111321 243 444432 4788887777 777762 12 37
Q ss_pred CCCCcccCCCCCcccccCCCCCCCCCCC
Q psy9822 263 NGHADCAGPDKLDVCLRCHNHTKGPQCD 290 (380)
Q Consensus 263 ~~~~~C~~~~~~g~C~~C~~~~~G~~C~ 290 (380)
.+|+.|. -++|+ |.+||+|..|+
T Consensus 558 ~g~G~C~----CG~Cv-C~~GwtG~~C~ 580 (783)
T KOG1226|consen 558 GGHGRCE----CGRCV-CNPGWTGSACN 580 (783)
T ss_pred CCCCeEe----CCcEE-cCCCCccCCCC
Confidence 8888886 78995 99999999983
No 21
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.03 E-value=0.0011 Score=47.40 Aligned_cols=40 Identities=43% Similarity=1.076 Sum_probs=31.7
Q ss_pred CCCCCCCc----ccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC
Q psy9822 260 CQCNGHAD----CAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD 302 (380)
Q Consensus 260 c~C~~~~~----C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~ 302 (380)
|.|+.+++ |.. .+++|. |+++++|.+|++|+++|++.+..
T Consensus 2 C~C~~~g~~~~~C~~--~~G~C~-C~~~~~G~~C~~C~~g~~~~~~~ 45 (50)
T cd00055 2 CDCNGHGSLSGQCDP--GTGQCE-CKPNTTGRRCDRCAPGYYGLPSQ 45 (50)
T ss_pred CcCcCCCCCCccccC--CCCEEe-CCCcCCCCCCCCCCCCCccCCCC
Confidence 44555443 753 579995 99999999999999999998854
No 22
>KOG1226|consensus
Probab=96.96 E-value=0.0023 Score=68.31 Aligned_cols=87 Identities=31% Similarity=0.819 Sum_probs=55.3
Q ss_pred CCCCCCCeeecCCCCcccccCCCce----ecCCCCCcccc-CCCC----Cc-CCcccCCCccccCCCcccccccc-----
Q psy9822 194 TCDPQSEQCVDLADGFECVCGRGYN----KSEEGGECVPV-CSQG----CV-RGVCSEPDKCQCDFGYVGVNCSI----- 258 (380)
Q Consensus 194 ~C~~~~g~C~n~~~g~~C~C~~Gy~----G~~~~~~C~~~-C~~~----C~-~G~C~~~~~C~C~~G~~G~~C~~----- 258 (380)
.|. ..|.|... +|.|.+... |..+ +|... |... |. ||+|.- ++|+|.+||+|..|.-
T Consensus 515 vCS-grG~C~CG----qC~C~~~~~~~i~G~fC--ECDnfsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~C~~std 586 (783)
T KOG1226|consen 515 VCS-GRGDCVCG----QCVCHKPDNGKIYGKFC--ECDNFSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACNCPLSTD 586 (783)
T ss_pred CcC-CCCcEeCC----ceEecCCCCCceeeeee--eccCcccccccCcccCCCCeEeC-CcEEcCCCCccCCCCCCCCCc
Confidence 455 55556553 566655444 5541 22233 5322 65 777643 6899999999998862
Q ss_pred ------cCCCCCCCcccCCCCCcccccCCCC-CCCCCCCCCC
Q psy9822 259 ------QCQCNGHADCAGPDKLDVCLRCHNH-TKGPQCDKCE 293 (380)
Q Consensus 259 ------~c~C~~~~~C~~~~~~g~C~~C~~~-~~G~~C~~C~ 293 (380)
.-.|+++++|. -++| +|.+. +.|.+||.|.
T Consensus 587 ~C~~~~G~iCSGrG~C~----Cg~C-~C~~~~~sG~~CE~cp 623 (783)
T KOG1226|consen 587 TCESSDGQICSGRGTCE----CGRC-KCTDPPYSGEFCEKCP 623 (783)
T ss_pred cccCCCCceeCCCceee----CCce-EcCCCCcCcchhhcCC
Confidence 11367777775 6778 48766 8999998775
No 23
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.91 E-value=0.00034 Score=46.64 Aligned_cols=31 Identities=19% Similarity=0.728 Sum_probs=24.6
Q ss_pred CCCCCCCCCCCeeecCCCCcccccCCCceecC
Q psy9822 190 NDHHTCDPQSEQCVDLADGFECVCGRGYNKSE 221 (380)
Q Consensus 190 ~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~ 221 (380)
...+.|+ .+++|++++++|+|+|++||.|++
T Consensus 3 ~~~~~C~-~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 3 ENNGGCH-PNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp TGGGGS--TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCC-CCcEeecCCCCEEeECCCCCccCC
Confidence 3456788 789999999999999999999987
No 24
>KOG3509|consensus
Probab=96.84 E-value=0.002 Score=71.10 Aligned_cols=116 Identities=27% Similarity=0.541 Sum_probs=76.5
Q ss_pred CccccCCCcccccccc-------------------cCCCCCCCc-ccCCCCCcccccCCCCCCCCCCCCCCCCccCCCCC
Q psy9822 243 DKCQCDFGYVGVNCSI-------------------QCQCNGHAD-CAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD 302 (380)
Q Consensus 243 ~~C~C~~G~~G~~C~~-------------------~c~C~~~~~-C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~ 302 (380)
.+|+|+.|+.|..|+. .+.|+.|+. |.. ..+.|..|++++.|.+|+.|.+|+++++..
T Consensus 718 ~~C~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~~~~h~~~c~~--~~~~nt~~q~~~~~~~~~~~~~g~~~da~~ 795 (964)
T KOG3509|consen 718 EQCQCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCECNSHISQCED--DLGYNTDCQNNTEGDRCELCSPGTYGDARR 795 (964)
T ss_pred cccccCccccCcccccccccccccccCCcCcccCcccccCCCcccccc--cccccccccccCccceeeecCCCccccCcc
Confidence 3788888888877652 356777764 763 467787899999999999999999999876
Q ss_pred CCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCC
Q psy9822 303 NGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSR 379 (380)
Q Consensus 303 ~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~ 379 (380)
+....... .+-.. ..++. .+..+.. .......|..|.+|++|.+|+++..||+++.++
T Consensus 796 g~~~D~~p-~~~l~-~~~~~----------~~r~~l~-------~~~~~~~~~~~p~~~~g~~~~~~~~~~~~~atd 853 (964)
T KOG3509|consen 796 GTPEDCRP-ATALT-IQCSC----------NNRSPLS-------CDGFGPGCLLCPHNTEGTTCERVKAGYYGFATD 853 (964)
T ss_pred CCcccCCc-cchhh-hhhhh----------cccCccc-------cccCCCCcccCCCCccccchhhhccccccccCc
Confidence 43321111 01100 01100 0011100 011345777799999999999999999998764
No 25
>KOG1217|consensus
Probab=96.78 E-value=0.012 Score=59.46 Aligned_cols=104 Identities=22% Similarity=0.603 Sum_probs=62.1
Q ss_pred CCCccCCCCCCC-ccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCccccCCCCCcCCcccCCCccccCCCcc
Q psy9822 174 SFSWNYVKCPPE-NECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPVCSQGCVRGVCSEPDKCQCDFGYV 252 (380)
Q Consensus 174 ~~~w~~~~C~d~-deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~C~~~C~~G~C~~~~~C~C~~G~~ 252 (380)
..+|.+..|... ++|....+.|. +.+.|.+..++|.|.|++||.+... ... .. .+.|+....|.+..|+.
T Consensus 157 ~~g~~~~~~~~~~~~C~~~~~~c~-~~~~C~~~~~~~~C~c~~~~~~~~~----~~~-~~---~~~c~~~~~~~~~~g~~ 227 (487)
T KOG1217|consen 157 TEGYEGEPCETDLDECIQYSSPCQ-NGGTCVNTGGSYLCSCPPGYTGSTC----ETT-GN---GGTCVDSVACSCPPGAR 227 (487)
T ss_pred CCCcccccccccccccccCCCCcC-CCcccccCCCCeeEeCCCCccCCcC----cCC-CC---CceEecceeccCCCCCC
Confidence 567888888644 79976667787 7889999999999999999998863 211 00 12333334456666666
Q ss_pred ccccccc-CCCCCC-CcccCCCCCcccccCCCCCCCC
Q psy9822 253 GVNCSIQ-CQCNGH-ADCAGPDKLDVCLRCHNHTKGP 287 (380)
Q Consensus 253 G~~C~~~-c~C~~~-~~C~~~~~~g~C~~C~~~~~G~ 287 (380)
+..|.+. ..|... ++|.+..+.+.|. |..++++.
T Consensus 228 ~~~c~~~~~~~~~~~~~c~~~~~~~~C~-~~~g~~~~ 263 (487)
T KOG1217|consen 228 GPECEVSIVECASGDGTCVNTVGSYTCR-CPEGYTGD 263 (487)
T ss_pred CCCcccccccccCCCCcccccCCceeee-CCCCcccc
Confidence 6555531 122211 4444433444552 44444444
No 26
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.71 E-value=0.0018 Score=42.78 Aligned_cols=36 Identities=25% Similarity=0.766 Sum_probs=28.4
Q ss_pred CCccccCCCCCCCCCCCeeecCCCCcccccCCCce-ecC
Q psy9822 184 PENECLNDHHTCDPQSEQCVDLADGFECVCGRGYN-KSE 221 (380)
Q Consensus 184 d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~-G~~ 221 (380)
++|||... +.|. ..++|+++.++|.|.|++||. |..
T Consensus 1 d~~~C~~~-~~C~-~~~~C~~~~g~~~C~C~~g~~~g~~ 37 (39)
T smart00179 1 DIDECASG-NPCQ-NGGTCVNTVGSYRCECPPGYTDGRN 37 (39)
T ss_pred CcccCcCC-CCcC-CCCEeECCCCCeEeECCCCCccCCc
Confidence 35677543 5677 678999999999999999998 543
No 27
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=96.59 E-value=0.00089 Score=47.45 Aligned_cols=32 Identities=41% Similarity=1.040 Sum_probs=26.4
Q ss_pred cccCCCCCcccccCCCCCCCCCCCCCCCCccCCCC
Q psy9822 267 DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVGDPR 301 (380)
Q Consensus 267 ~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~ 301 (380)
.|.. .+++|. |+++|+|.+|++|+++||+.+.
T Consensus 12 ~C~~--~~G~C~-C~~~~~G~~C~~C~~g~~~~~~ 43 (49)
T PF00053_consen 12 TCDP--STGQCV-CKPGTTGPRCDQCKPGYFGLPS 43 (49)
T ss_dssp SEEE--TCEEES-BSTTEESTTS-EE-TTEECSTT
T ss_pred cccC--CCCEEe-ccccccCCcCcCCCCccccccC
Confidence 5764 689996 9999999999999999999875
No 28
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=95.94 E-value=0.0081 Score=42.11 Aligned_cols=29 Identities=34% Similarity=0.950 Sum_probs=25.3
Q ss_pred cccCCCCCcccccCCCCCCCCCCCCCCCCccC
Q psy9822 267 DCAGPDKLDVCLRCHNHTKGPQCDKCEPLYVG 298 (380)
Q Consensus 267 ~C~~~~~~g~C~~C~~~~~G~~C~~C~~g~~G 298 (380)
.|.. .+++|+ |+++++|.+|++|+++|||
T Consensus 12 ~C~~--~~G~C~-C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 12 TCDP--DTGQCE-CKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred cccC--CCCEEE-CCCCCCCCCCCcCCCCcCC
Confidence 4753 478996 9999999999999999999
No 29
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.77 E-value=0.012 Score=38.19 Aligned_cols=35 Identities=26% Similarity=0.803 Sum_probs=27.1
Q ss_pred CccccCCCCCCCCCCCeeecCCCCcccccCCCceecC
Q psy9822 185 ENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSE 221 (380)
Q Consensus 185 ~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~ 221 (380)
+++|... ..|. .++.|++..++|.|.|++||.|..
T Consensus 2 ~~~C~~~-~~C~-~~~~C~~~~~~~~C~C~~g~~g~~ 36 (38)
T cd00054 2 IDECASG-NPCQ-NGGTCVNTVGSYRCSCPPGYTGRN 36 (38)
T ss_pred cccCCCC-CCcC-CCCEeECCCCCeEeECCCCCcCCc
Confidence 4566432 4576 678999999999999999998854
No 30
>PF01437 PSI: Plexin repeat; InterPro: IPR002165 This is a cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in plexin (P70206 from SWISSPROT) []. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein (Q19981 from SWISSPROT) contains four copies of the repeat, while the Met receptor contains a single copy of the repeat.; GO: 0016020 membrane; PDB: 3NVQ_B 3NVN_B 3OL2_B 3OKT_A 3AL8_A 3OKW_A 3OKY_B 3AFC_B 1OLZ_B 1SHY_B ....
Probab=95.73 E-value=0.0038 Score=44.66 Aligned_cols=32 Identities=44% Similarity=0.917 Sum_probs=27.6
Q ss_pred CCCCCCCchhhhcCCCCCCCccceeecCCCCcccCcc
Q psy9822 43 RCDQRTDCQDCLTSSGGEGGWHECRWSSMLNECISPS 79 (380)
Q Consensus 43 ~C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~~~ 79 (380)
.|+.|++|++||.++. -.|+||..+++|+...
T Consensus 1 ~C~~~~sC~~Cl~~~d-----p~CgWc~~~~~C~~~~ 32 (51)
T PF01437_consen 1 NCSQYTSCSSCLSSRD-----PYCGWCSSENRCVSGS 32 (51)
T ss_dssp -GGGGSSHHHHHHSTC-----TTEEEETTTTEEEECC
T ss_pred CCcCCCcHHHHHcCCC-----cCccccCCCCcccCCc
Confidence 3789999999999873 3799999999999876
No 31
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=95.45 E-value=0.0058 Score=39.45 Aligned_cols=26 Identities=35% Similarity=1.032 Sum_probs=23.1
Q ss_pred CCCCCCCeeecCC-CCcccccCCCceec
Q psy9822 194 TCDPQSEQCVDLA-DGFECVCGRGYNKS 220 (380)
Q Consensus 194 ~C~~~~g~C~n~~-~g~~C~C~~Gy~G~ 220 (380)
+|. ++|+|++.. ++|+|.|++||+|+
T Consensus 5 ~C~-n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQ-NGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp SST-TTEEEEEESTSEEEEEEBTTEEST
T ss_pred cCC-CCeEEEeCCCCCEEeECCCCCccC
Confidence 566 789999988 99999999999986
No 32
>KOG4260|consensus
Probab=95.19 E-value=0.019 Score=54.77 Aligned_cols=66 Identities=30% Similarity=0.756 Sum_probs=46.1
Q ss_pred cCCCcccccccc-----cCCCCCCCcccC---CCCCcccccCCCCCCCCCCCCCCCCccCCCCC--CCCCcCCCCcc
Q psy9822 247 CDFGYVGVNCSI-----QCQCNGHADCAG---PDKLDVCLRCHNHTKGPQCDKCEPLYVGDPRD--NGECVPCSEYC 313 (380)
Q Consensus 247 C~~G~~G~~C~~-----~c~C~~~~~C~~---~~~~g~C~~C~~~~~G~~C~~C~~g~~G~~~~--~~~C~~C~~~C 313 (380)
|+.|..|++|.. .-+|++++.|.. ..+++.| .|+.+|+|..|..|.++|+-.-.+ .-.|..|.+-|
T Consensus 132 Cp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkC-kC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C 207 (350)
T KOG4260|consen 132 CPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKC-KCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGC 207 (350)
T ss_pred cCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcc-cccCCCCCccccccchHHHHhhcccccchhhhhhhhh
Confidence 555666666653 235888888864 2468999 599999999999999999976554 23566655333
No 33
>smart00051 DSL delta serrate ligand.
Probab=94.70 E-value=0.03 Score=42.06 Aligned_cols=39 Identities=33% Similarity=0.647 Sum_probs=19.7
Q ss_pred cCCCcccccccccCCC----CCCCcccCCCCCcccccCCCCCCCCCC
Q psy9822 247 CDFGYVGVNCSIQCQC----NGHADCAGPDKLDVCLRCHNHTKGPQC 289 (380)
Q Consensus 247 C~~G~~G~~C~~~c~C----~~~~~C~~~~~~g~C~~C~~~~~G~~C 289 (380)
|+.+|.|..|+..|.+ .+|.+|.. .+.++ |.++|+|..|
T Consensus 21 C~~~~yG~~C~~~C~~~~d~~~~~~Cd~---~G~~~-C~~Gw~G~~C 63 (63)
T smart00051 21 CDENYYGEGCNKFCRPRDDFFGHYTCDE---NGNKG-CLEGWMGPYC 63 (63)
T ss_pred CCCCCcCCccCCEeCcCccccCCccCCc---CCCEe-cCCCCcCCCC
Confidence 4444455544443332 44555643 45553 6666666554
No 34
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.51 E-value=0.022 Score=36.85 Aligned_cols=21 Identities=48% Similarity=1.341 Sum_probs=15.1
Q ss_pred CCcccCC-CccccCCCcccccc
Q psy9822 236 RGVCSEP-DKCQCDFGYVGVNC 256 (380)
Q Consensus 236 ~G~C~~~-~~C~C~~G~~G~~C 256 (380)
||+|+.+ ++|+|+.||+|+.|
T Consensus 11 ~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 11 HGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred CCEEeCCCCEEECCCCCcCCCC
Confidence 5777765 67778877777764
No 35
>smart00423 PSI domain found in Plexins, Semaphorins and Integrins.
Probab=94.15 E-value=0.034 Score=38.77 Aligned_cols=30 Identities=33% Similarity=0.857 Sum_probs=26.0
Q ss_pred CCCCCCchhhhcCCCCCCCccceeecCCCCcccCc
Q psy9822 44 CDQRTDCQDCLTSSGGEGGWHECRWSSMLNECISP 78 (380)
Q Consensus 44 C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~~ 78 (380)
|+.+++|++||..+. ..|+||..+++|+..
T Consensus 1 C~~~~sC~~C~~~~~-----~~C~Wc~~~~~C~~~ 30 (46)
T smart00423 1 CSKYTSCSECLLARD-----PYCAWCSSQGRCTSG 30 (46)
T ss_pred CCCCCcHHHHHcCCC-----CCCCccCCCCcccCc
Confidence 678899999999873 369999999999986
No 36
>KOG4260|consensus
Probab=94.11 E-value=0.06 Score=51.37 Aligned_cols=63 Identities=33% Similarity=0.701 Sum_probs=44.8
Q ss_pred CcccccCCCCCCCCCCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCCCcc
Q psy9822 274 LDVCLRCHNHTKGPQCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTTRAR 353 (380)
Q Consensus 274 ~g~C~~C~~~~~G~~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~~g~ 353 (380)
.-.| |++++.|+.|..|.-+.. ++ |+++ +.|+-+.. +.+.+.
T Consensus 128 LkvC--Cp~gtyGpdCl~Cpggse---------r~----C~Gn-G~C~GdGs----------------------R~GsGk 169 (350)
T KOG4260|consen 128 LKVC--CPDGTYGPDCLQCPGGSE---------RP----CFGN-GSCHGDGS----------------------REGSGK 169 (350)
T ss_pred heec--cCCCCcCCccccCCCCCc---------CC----cCCC-CcccCCCC----------------------CCCCCc
Confidence 4456 999999999988864321 23 4444 34543211 136799
Q ss_pred cccCCCCCCCCCCCcCCCCCCC
Q psy9822 354 CMHCGNHTTGPKCEDCVEGKEN 375 (380)
Q Consensus 354 C~~C~~g~~G~~Ce~C~~Gy~g 375 (380)
|. |.+||+|+.|..|.++||-
T Consensus 170 Ck-C~~GY~Gp~C~~Cg~eyfe 190 (350)
T KOG4260|consen 170 CK-CETGYTGPLCRYCGIEYFE 190 (350)
T ss_pred cc-ccCCCCCccccccchHHHH
Confidence 99 9999999999999999984
No 37
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.09 E-value=0.054 Score=34.35 Aligned_cols=27 Identities=30% Similarity=0.912 Sum_probs=22.7
Q ss_pred CCCCCCCCeeecCCCCcccccCCCceec
Q psy9822 193 HTCDPQSEQCVDLADGFECVCGRGYNKS 220 (380)
Q Consensus 193 ~~C~~~~g~C~n~~~g~~C~C~~Gy~G~ 220 (380)
..|. .+++|++.+++|+|.|+.||.|.
T Consensus 6 ~~C~-~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCS-NGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCC-CCCEEecCCCCeEeECCCCCccc
Confidence 3555 57899999999999999999886
No 38
>KOG1217|consensus
Probab=94.04 E-value=0.16 Score=51.24 Aligned_cols=102 Identities=26% Similarity=0.691 Sum_probs=68.6
Q ss_pred CCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecCCCCCcccc--CC-----CCCcCC-cccC-----CCcccc
Q psy9822 181 KCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSEEGGECVPV--CS-----QGCVRG-VCSE-----PDKCQC 247 (380)
Q Consensus 181 ~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~~~~~C~~~--C~-----~~C~~G-~C~~-----~~~C~C 247 (380)
.|.++++|....+ |. .+++|++.++.|.|.|++||.|..+ ..+... |. ..|.++ .|.. .+.|.|
T Consensus 267 ~~~~~~~C~~~~~-c~-~~~~C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c 343 (487)
T KOG1217|consen 267 TCVDVDSCALIAS-CP-NGGTCVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCAC 343 (487)
T ss_pred eeeeccccCCCCc-cC-CCCeeecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCCCcccccCCCCCCCCcCC
Confidence 4668899976654 77 7899999999999999999999884 234332 52 226543 7732 247999
Q ss_pred CCCcccccccccC-CCC-----CCCcccC-CCCCcccccCCCCCCC
Q psy9822 248 DFGYVGVNCSIQC-QCN-----GHADCAG-PDKLDVCLRCHNHTKG 286 (380)
Q Consensus 248 ~~G~~G~~C~~~c-~C~-----~~~~C~~-~~~~g~C~~C~~~~~G 286 (380)
..+|+|..|+... .|. ..+.|.+ ..+.+.|. |..++.+
T Consensus 344 ~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~-~~~~~~~ 388 (487)
T KOG1217|consen 344 GPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCA-CPAGFAG 388 (487)
T ss_pred CCCCCCCccccCCccccCCccccCCEeccCCCCCeEec-CCCcccc
Confidence 9999999998643 333 3344554 34556663 5555554
No 39
>smart00051 DSL delta serrate ligand.
Probab=93.71 E-value=0.061 Score=40.37 Aligned_cols=41 Identities=24% Similarity=0.606 Sum_probs=30.4
Q ss_pred ccCCCceecCCCCCccccCCCC--Cc-CCcccCCCccccCCCcccccc
Q psy9822 212 VCGRGYNKSEEGGECVPVCSQG--CV-RGVCSEPDKCQCDFGYVGVNC 256 (380)
Q Consensus 212 ~C~~Gy~G~~~~~~C~~~C~~~--C~-~G~C~~~~~C~C~~G~~G~~C 256 (380)
.|.++|+|.. |...|... +. +.+|...+.++|.+||+|++|
T Consensus 20 ~C~~~~yG~~----C~~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 20 TCDENYYGEG----CNKFCRPRDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred eCCCCCcCCc----cCCEeCcCccccCCccCCcCCCEecCCCCcCCCC
Confidence 7888999987 55555322 33 567866778999999999876
No 40
>PF01437 PSI: Plexin repeat; InterPro: IPR002165 This is a cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in plexin (P70206 from SWISSPROT) []. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein (Q19981 from SWISSPROT) contains four copies of the repeat, while the Met receptor contains a single copy of the repeat.; GO: 0016020 membrane; PDB: 3NVQ_B 3NVN_B 3OL2_B 3OKT_A 3AL8_A 3OKW_A 3OKY_B 3AFC_B 1OLZ_B 1SHY_B ....
Probab=93.66 E-value=0.018 Score=41.15 Aligned_cols=30 Identities=50% Similarity=1.244 Sum_probs=24.9
Q ss_pred CCCCcccchhhcCCC--CCccCcCCCCCCCCceeeccCC
Q psy9822 105 SCSAYTQCSTCLRHA--HCGWCALQRDTTGGMGVCTEGS 141 (380)
Q Consensus 105 ~Cs~~~~C~~Cl~~~--~CGwC~~~~~~~~g~G~C~~G~ 141 (380)
.|+.|++|.+||... .||||.. .++|+.+.
T Consensus 1 ~C~~~~sC~~Cl~~~dp~CgWc~~-------~~~C~~~~ 32 (51)
T PF01437_consen 1 NCSQYTSCSSCLSSRDPYCGWCSS-------ENRCVSGS 32 (51)
T ss_dssp -GGGGSSHHHHHHSTCTTEEEETT-------TTEEEECC
T ss_pred CCcCCCcHHHHHcCCCcCccccCC-------CCcccCCc
Confidence 489999999999875 9999992 57888884
No 41
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.48 E-value=0.023 Score=29.32 Aligned_cols=12 Identities=58% Similarity=1.442 Sum_probs=6.4
Q ss_pred cccCCCcccccc
Q psy9822 245 CQCDFGYVGVNC 256 (380)
Q Consensus 245 C~C~~G~~G~~C 256 (380)
|+|++||+|.+|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 556666666554
No 42
>smart00181 EGF Epidermal growth factor-like domain.
Probab=93.05 E-value=0.11 Score=33.40 Aligned_cols=25 Identities=24% Similarity=0.885 Sum_probs=20.8
Q ss_pred CCCCCCCCeeecCCCCcccccCCCcee
Q psy9822 193 HTCDPQSEQCVDLADGFECVCGRGYNK 219 (380)
Q Consensus 193 ~~C~~~~g~C~n~~~g~~C~C~~Gy~G 219 (380)
..|. .+ +|++..++|+|.|++||.|
T Consensus 6 ~~C~-~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCS-NG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCC-CC-EEECCCCCeEeECCCCCcc
Confidence 4565 34 8999999999999999988
No 43
>KOG3509|consensus
Probab=92.92 E-value=0.12 Score=57.62 Aligned_cols=75 Identities=28% Similarity=0.690 Sum_probs=59.4
Q ss_pred CcccccCCCCCCCCCCCCCCCCccCCCCC---CCCCcCCCCccCCCCCcccccccCCCCCCCCCCCcccccccccCCCCC
Q psy9822 274 LDVCLRCHNHTKGPQCDKCEPLYVGDPRD---NGECVPCSEYCNGHTGLCINASLASLPFSPESGGTSELAAFLDEGPTT 350 (380)
Q Consensus 274 ~g~C~~C~~~~~G~~C~~C~~g~~G~~~~---~~~C~~C~~~Cng~~~~C~~~~~~~~p~~~~~~~~~~~~~~~~~g~~~ 350 (380)
.-+| +|..++.|.+|+.|.++|.-..+. .+.+..|. |+.|+..|.. .
T Consensus 717 ~~~C-~c~~g~~G~~ce~c~e~~~ls~t~~~~~~~~~~c~--~~~h~~~c~~---------------------------~ 766 (964)
T KOG3509|consen 717 VEQC-QCPKGLVGTSCEDCAEGYTLSTTGGLYPGLCEDCE--CNSHISQCED---------------------------D 766 (964)
T ss_pred cccc-ccCccccCcccccccccccccccCCcCcccCcccc--cCCCcccccc---------------------------c
Confidence 4578 499999999999999988766532 24566777 8888877753 3
Q ss_pred CcccccCCCCCCCCCCCcCCCCCCCCCC
Q psy9822 351 RARCMHCGNHTTGPKCEDCVEGKENRQS 378 (380)
Q Consensus 351 ~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~ 378 (380)
.++|..|++++.|.+|+.|.+||+++.-
T Consensus 767 ~~~nt~~q~~~~~~~~~~~~~g~~~da~ 794 (964)
T KOG3509|consen 767 LGYNTDCQNNTEGDRCELCSPGTYGDAR 794 (964)
T ss_pred ccccccccccCccceeeecCCCccccCc
Confidence 4566669999999999999999999863
No 44
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.61 E-value=0.21 Score=33.19 Aligned_cols=22 Identities=32% Similarity=0.877 Sum_probs=18.0
Q ss_pred CeeecCCCCcccccCCCceecC
Q psy9822 200 EQCVDLADGFECVCGRGYNKSE 221 (380)
Q Consensus 200 g~C~n~~~g~~C~C~~Gy~G~~ 221 (380)
..|++++++|+|.|++||+...
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-T
T ss_pred CCCccCCCceEeECCCCCEECc
Confidence 3799999999999999998765
No 45
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=89.04 E-value=0.26 Score=29.77 Aligned_cols=20 Identities=35% Similarity=0.961 Sum_probs=13.8
Q ss_pred CcccccCCCceecCCCCCcc
Q psy9822 208 GFECVCGRGYNKSEEGGECV 227 (380)
Q Consensus 208 g~~C~C~~Gy~G~~~~~~C~ 227 (380)
+|+|+|++||........|+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~ 20 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE 20 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc
Confidence 58899999998654333444
No 46
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=88.83 E-value=0.31 Score=31.50 Aligned_cols=26 Identities=35% Similarity=1.039 Sum_probs=21.3
Q ss_pred CCCCCCcccCCCCCcccccCCCCCCCCCC
Q psy9822 261 QCNGHADCAGPDKLDVCLRCHNHTKGPQC 289 (380)
Q Consensus 261 ~C~~~~~C~~~~~~g~C~~C~~~~~G~~C 289 (380)
.|++|++|+. ..++|+ |.++|+|..|
T Consensus 7 ~C~~~G~C~~--~~g~C~-C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVS--PCGRCV-CDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeC--CCCEEE-CCCCCcCCCC
Confidence 5889999985 158995 9999999876
No 47
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.46 E-value=0.83 Score=42.40 Aligned_cols=41 Identities=24% Similarity=0.632 Sum_probs=34.2
Q ss_pred cCCCCCCCccccCCCCCCCCCCCeeecCCCCcccccCCCceecC
Q psy9822 178 NYVKCPPENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNKSE 221 (380)
Q Consensus 178 ~~~~C~d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G~~ 221 (380)
....|.+.+||....+.|. ..|++++++|.|.|++||....
T Consensus 180 ~~~~C~~~~~C~~~~~~c~---~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQ---QVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred ccccCcCchhhcCCCCCcc---ceEEcCCCCEEeECCCCccCCC
Confidence 4566999999987777886 3799999999999999997643
No 48
>KOG1218|consensus
Probab=83.85 E-value=11 Score=36.16 Aligned_cols=41 Identities=37% Similarity=0.873 Sum_probs=25.8
Q ss_pred CCCcccccccccCCCCCC---CcccCCCCCcccccCCCCCCCCCCCC
Q psy9822 248 DFGYVGVNCSIQCQCNGH---ADCAGPDKLDVCLRCHNHTKGPQCDK 291 (380)
Q Consensus 248 ~~G~~G~~C~~~c~C~~~---~~C~~~~~~g~C~~C~~~~~G~~C~~ 291 (380)
..+|.|..|...++|+.+ .+|.+... .| .+...+.+.+|..
T Consensus 96 ~~~~~g~~C~~~~~~~~~c~~~~C~~~~~--~c-~~~~~~~~~~C~~ 139 (316)
T KOG1218|consen 96 LNGYEGPQCESPCPCGDGCAEKTCANPRR--EC-RCGGGYIGEQCGE 139 (316)
T ss_pred CCCCCcccccCCCCcCCcccccccCCCcc--ce-ecCCcCccccccc
Confidence 688888888887777655 56654211 35 2556666666644
No 49
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=83.58 E-value=1.1 Score=29.05 Aligned_cols=27 Identities=22% Similarity=0.566 Sum_probs=15.2
Q ss_pred CCCCCcccCCCCCcccccCCCCCC-CCCC
Q psy9822 262 CNGHADCAGPDKLDVCLRCHNHTK-GPQC 289 (380)
Q Consensus 262 C~~~~~C~~~~~~g~C~~C~~~~~-G~~C 289 (380)
|..++.|.+..+.+.|. |..+|. |.+|
T Consensus 11 C~~~~~C~~~~g~~~C~-C~~g~~~g~~C 38 (39)
T smart00179 11 CQNGGTCVNTVGSYRCE-CPPGYTDGRNC 38 (39)
T ss_pred cCCCCEeECCCCCeEeE-CCCCCccCCcC
Confidence 44445565555566663 666665 5554
No 50
>KOG1388|consensus
Probab=83.42 E-value=1.4 Score=40.86 Aligned_cols=98 Identities=20% Similarity=0.312 Sum_probs=63.3
Q ss_pred eeecCCCCccc-ccCCCceecCCCCCccccCCCCCcCCcccCCCcccc-CCCcccccccccCCCCCCCcccCCCCCcccc
Q psy9822 201 QCVDLADGFEC-VCGRGYNKSEEGGECVPVCSQGCVRGVCSEPDKCQC-DFGYVGVNCSIQCQCNGHADCAGPDKLDVCL 278 (380)
Q Consensus 201 ~C~n~~~g~~C-~C~~Gy~G~~~~~~C~~~C~~~C~~G~C~~~~~C~C-~~G~~G~~C~~~c~C~~~~~C~~~~~~g~C~ 278 (380)
.|.|.+.+++| .|..||+|+.-...|.+..-++.+.+-....++|.| ..|.+|..|..-..++.+..... .+.|.
T Consensus 68 ~~~N~~~g~~c~kc~~g~~GdtN~g~c~~~~~~g~~~~~~~~~~~c~c~~kgvvgd~c~~~e~~N~~r~~~~---kgtcy 144 (217)
T KOG1388|consen 68 RCENGTTGAHCEKCIVGFYGDTNGGKCQPCDCNGGASACVTLTGKCFCTTKGVVGDLCPKCEVPNRYRGKPL---KGTCY 144 (217)
T ss_pred eccCccccccCCceEEEEEecCCCCccCHhhhcCCeeeeeccCCccccccceEecccCcccccccccccCcc---cccee
Confidence 37788899999 599999997434566666333333333444678899 57899988886444555544422 22231
Q ss_pred --cCCCCCCCCCCCCCCCCccCCCC
Q psy9822 279 --RCHNHTKGPQCDKCEPLYVGDPR 301 (380)
Q Consensus 279 --~C~~~~~G~~C~~C~~g~~G~~~ 301 (380)
-..++....+.....++|+....
T Consensus 145 y~l~id~~ftf~l~~~d~~fv~sd~ 169 (217)
T KOG1388|consen 145 YELLIDGQFTFHLLQEDDGFVTSDN 169 (217)
T ss_pred eeeecccccccceeecCCCceeecc
Confidence 14566667777777888886653
No 51
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=83.38 E-value=0.25 Score=31.75 Aligned_cols=26 Identities=19% Similarity=0.411 Sum_probs=13.5
Q ss_pred CCCCCCcccCCC-CCcccccCCCCCCCC
Q psy9822 261 QCNGHADCAGPD-KLDVCLRCHNHTKGP 287 (380)
Q Consensus 261 ~C~~~~~C~~~~-~~g~C~~C~~~~~G~ 287 (380)
+|.++++|+... ..+.|+ |+++|+|.
T Consensus 5 ~C~n~g~C~~~~~~~y~C~-C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCE-CPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEESTSEEEEE-EBTTEEST
T ss_pred cCCCCeEEEeCCCCCEEeE-CCCCCccC
Confidence 455555555433 455563 55555554
No 52
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=81.53 E-value=0.21 Score=45.30 Aligned_cols=89 Identities=30% Similarity=0.830 Sum_probs=57.8
Q ss_pred CCCCCCCeeecCC-----CCcccccCCCceecCCCCCcccc-CCCC-CcCCcccC-C-----CccccCCCcc---ccccc
Q psy9822 194 TCDPQSEQCVDLA-----DGFECVCGRGYNKSEEGGECVPV-CSQG-CVRGVCSE-P-----DKCQCDFGYV---GVNCS 257 (380)
Q Consensus 194 ~C~~~~g~C~n~~-----~g~~C~C~~Gy~G~~~~~~C~~~-C~~~-C~~G~C~~-~-----~~C~C~~G~~---G~~C~ 257 (380)
.|. +-++|++.+ ..|.|.|.+||.... +.|.|. |.+. |..|.|+. + ..|.|.-|+. ...|.
T Consensus 51 ~Cg-dya~C~~~~~~~~~~~~~C~C~~gY~~~~--~vCvp~~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCt 127 (197)
T PF06247_consen 51 PCG-DYAKCINQANKGEERAYKCDCINGYILKQ--GVCVPNKCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCT 127 (197)
T ss_dssp EEE-TTEEEEE-SSTTSSTSEEEEE-TTEEESS--SSEEEGGGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTESE
T ss_pred ccc-chhhhhcCCCcccceeEEEecccCceeeC--CeEchhhcCceecCCCeEEecCCCCCCceeEeeeceEeccCCccc
Confidence 566 667898755 589999999999887 689987 8764 99999986 2 2899999998 22333
Q ss_pred c----c--CCCCCCCcccCCCCCcccccCCCCCCC
Q psy9822 258 I----Q--CQCNGHADCAGPDKLDVCLRCHNHTKG 286 (380)
Q Consensus 258 ~----~--c~C~~~~~C~~~~~~g~C~~C~~~~~G 286 (380)
. . -.|..+-.|...++.|.|. |+.++.+
T Consensus 128 k~G~T~C~LKCk~nE~CK~~~~~Y~C~-~~~~~~~ 161 (197)
T PF06247_consen 128 KTGETKCSLKCKENEECKLVDGYYKCV-CKEGFPG 161 (197)
T ss_dssp EEE--------TTTEEEEEETTEEEEE-E-TT-EE
T ss_pred CCCccceeeecCCCcceeeeCcEEEee-cCCCCCC
Confidence 2 1 2355556687666788884 7666643
No 53
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=79.04 E-value=1.9 Score=27.39 Aligned_cols=27 Identities=22% Similarity=0.575 Sum_probs=14.9
Q ss_pred CCCCCcccCCCCCcccccCCCCCCCCCC
Q psy9822 262 CNGHADCAGPDKLDVCLRCHNHTKGPQC 289 (380)
Q Consensus 262 C~~~~~C~~~~~~g~C~~C~~~~~G~~C 289 (380)
|..++.|.+..+.+.|. |..+|.|.+|
T Consensus 11 C~~~~~C~~~~~~~~C~-C~~g~~g~~C 37 (38)
T cd00054 11 CQNGGTCVNTVGSYRCS-CPPGYTGRNC 37 (38)
T ss_pred cCCCCEeECCCCCeEeE-CCCCCcCCcC
Confidence 44445565555556663 6666666554
No 54
>smart00423 PSI domain found in Plexins, Semaphorins and Integrins.
Probab=78.30 E-value=1.2 Score=30.84 Aligned_cols=20 Identities=55% Similarity=1.472 Sum_probs=17.1
Q ss_pred CCCcccchhhcCCCC--CccCc
Q psy9822 106 CSAYTQCSTCLRHAH--CGWCA 125 (380)
Q Consensus 106 Cs~~~~C~~Cl~~~~--CGwC~ 125 (380)
|+.|++|++||.... |+||.
T Consensus 1 C~~~~sC~~C~~~~~~~C~Wc~ 22 (46)
T smart00423 1 CSKYTSCSECLLARDPYCAWCS 22 (46)
T ss_pred CCCCCcHHHHHcCCCCCCCccC
Confidence 678999999998633 99998
No 55
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=78.10 E-value=2.7 Score=26.15 Aligned_cols=26 Identities=19% Similarity=0.521 Sum_probs=14.9
Q ss_pred CCCCCCcccCCCCCcccccCCCCCCCC
Q psy9822 261 QCNGHADCAGPDKLDVCLRCHNHTKGP 287 (380)
Q Consensus 261 ~C~~~~~C~~~~~~g~C~~C~~~~~G~ 287 (380)
+|..++.|.+..+.+.|. |..+|.|.
T Consensus 7 ~C~~~~~C~~~~~~~~C~-C~~g~~g~ 32 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCV-CPPGYTGD 32 (36)
T ss_pred CCCCCCEEecCCCCeEeE-CCCCCccc
Confidence 444455666555566663 66666655
No 56
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=77.99 E-value=0.68 Score=34.72 Aligned_cols=10 Identities=40% Similarity=1.009 Sum_probs=5.0
Q ss_pred CCCCCCCCCC
Q psy9822 280 CHNHTKGPQC 289 (380)
Q Consensus 280 C~~~~~G~~C 289 (380)
|.++|+|..|
T Consensus 54 C~~Gw~G~~C 63 (63)
T PF01414_consen 54 CLPGWTGPNC 63 (63)
T ss_dssp E-TTEESTTS
T ss_pred CCCCCcCCCC
Confidence 4556665544
No 57
>KOG1218|consensus
Probab=74.89 E-value=23 Score=33.97 Aligned_cols=13 Identities=15% Similarity=0.544 Sum_probs=7.9
Q ss_pred CccCCCCCCCccc
Q psy9822 176 SWNYVKCPPENEC 188 (380)
Q Consensus 176 ~w~~~~C~d~deC 188 (380)
.|.+..|....+|
T Consensus 98 ~~~g~~C~~~~~~ 110 (316)
T KOG1218|consen 98 GYEGPQCESPCPC 110 (316)
T ss_pred CCCcccccCCCCc
Confidence 4566667665555
No 58
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=74.85 E-value=0.61 Score=34.96 Aligned_cols=45 Identities=24% Similarity=0.579 Sum_probs=19.0
Q ss_pred CcccccCCCceecCCCCCccccCCCC---CcCCcccCCCccccCCCcccccc
Q psy9822 208 GFECVCGRGYNKSEEGGECVPVCSQG---CVRGVCSEPDKCQCDFGYVGVNC 256 (380)
Q Consensus 208 g~~C~C~~Gy~G~~~~~~C~~~C~~~---C~~G~C~~~~~C~C~~G~~G~~C 256 (380)
.++-.|.+.|+|.. |...|... =.|-+|...+.=+|.+||+|++|
T Consensus 16 ~~rv~C~~nyyG~~----C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGPN----CSKFCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETTT----T-EE---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEEECCCCCCCcc----ccCCcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 44567888888877 44444222 11345555556678888888765
No 59
>PHA02887 EGF-like protein; Provisional
Probab=73.09 E-value=1.8 Score=36.34 Aligned_cols=25 Identities=36% Similarity=1.079 Sum_probs=18.5
Q ss_pred CcCCcccC-----CCccccCCCcccccccc
Q psy9822 234 CVRGVCSE-----PDKCQCDFGYVGVNCSI 258 (380)
Q Consensus 234 C~~G~C~~-----~~~C~C~~G~~G~~C~~ 258 (380)
|.||+|.- ...|.|+.||+|.+|+.
T Consensus 94 CiHG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 94 CINGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred eeCCEEEccccCCCceeECCCCcccCCCCc
Confidence 66777764 24788888888888863
No 60
>smart00181 EGF Epidermal growth factor-like domain.
Probab=71.36 E-value=3.8 Score=25.96 Aligned_cols=23 Identities=22% Similarity=0.582 Sum_probs=12.4
Q ss_pred CCCCCcccCCCCCcccccCCCCCCC
Q psy9822 262 CNGHADCAGPDKLDVCLRCHNHTKG 286 (380)
Q Consensus 262 C~~~~~C~~~~~~g~C~~C~~~~~G 286 (380)
|..+ .|.+..+.+.| .|..+|.|
T Consensus 8 C~~~-~C~~~~~~~~C-~C~~g~~g 30 (35)
T smart00181 8 CSNG-TCINTPGSYTC-SCPPGYTG 30 (35)
T ss_pred CCCC-EEECCCCCeEe-ECCCCCcc
Confidence 3344 55554455666 36666655
No 61
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=67.69 E-value=1.9 Score=28.85 Aligned_cols=24 Identities=29% Similarity=0.582 Sum_probs=19.9
Q ss_pred CCCeeecCC-CCcccccCCCceecC
Q psy9822 198 QSEQCVDLA-DGFECVCGRGYNKSE 221 (380)
Q Consensus 198 ~~g~C~n~~-~g~~C~C~~Gy~G~~ 221 (380)
.++.|++.. +.++|+|..||..++
T Consensus 9 ~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 9 ANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred CCcccEEcCCCCEEEEeeCCccccC
Confidence 678899877 899999999998775
No 62
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=65.46 E-value=3 Score=35.61 Aligned_cols=25 Identities=36% Similarity=1.063 Sum_probs=16.2
Q ss_pred CcCCcccC-----CCccccCCCcccccccc
Q psy9822 234 CVRGVCSE-----PDKCQCDFGYVGVNCSI 258 (380)
Q Consensus 234 C~~G~C~~-----~~~C~C~~G~~G~~C~~ 258 (380)
|.||+|.- ...|.|+.||+|.+|+.
T Consensus 53 ClHG~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 53 CLHGDCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred eECCEEEeeccCCCceeECCCCcccccccc
Confidence 55566653 24677777777777763
No 63
>KOG0196|consensus
Probab=55.74 E-value=17 Score=40.23 Aligned_cols=36 Identities=25% Similarity=0.781 Sum_probs=26.4
Q ss_pred CcccccCCCCC----CCCCCCCCCCCccCCCCCCCCCcCCC
Q psy9822 274 LDVCLRCHNHT----KGPQCDKCEPLYVGDPRDNGECVPCS 310 (380)
Q Consensus 274 ~g~C~~C~~~~----~G~~C~~C~~g~~G~~~~~~~C~~C~ 310 (380)
.+.|. |++|| .|..|+.|.+|+|-.......|.+|+
T Consensus 258 iG~C~-C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP 297 (996)
T KOG0196|consen 258 IGGCV-CKAGYEEAENGKACQACPPGTYKASQGDSLCLPCP 297 (996)
T ss_pred cCcee-ecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCC
Confidence 57784 88887 46788889888886655456677776
No 64
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=51.57 E-value=98 Score=34.71 Aligned_cols=133 Identities=20% Similarity=0.461 Sum_probs=0.0
Q ss_pred CCCCeeecC---------CCCccc-ccCCCceecCCCCCcccc-CCCCCcCCcccC--CCcc-ccCCCcccccccccCCC
Q psy9822 197 PQSEQCVDL---------ADGFEC-VCGRGYNKSEEGGECVPV-CSQGCVRGVCSE--PDKC-QCDFGYVGVNCSIQCQC 262 (380)
Q Consensus 197 ~~~g~C~n~---------~~g~~C-~C~~Gy~G~~~~~~C~~~-C~~~C~~G~C~~--~~~C-~C~~G~~G~~C~~~c~C 262 (380)
|.++.|+.. .....| .|.++|+... ..|... ...+ +-.|.. .++| .|..||.-..=..=-.|
T Consensus 504 p~nG~C~~~~~~~~~C~~~~~g~Ct~C~~gyfL~~--GGCY~~~~~PG--~~~C~~a~~g~Ct~C~~GYyl~~~~~C~~C 579 (800)
T PTZ00214 504 PFNGVCIPNTQRDAYCTSTANGACTTCSGAAFLMN--GGCYTTEHYPG--STICDKQSNGKCTTTKKGYGISPDGKLLEC 579 (800)
T ss_pred CCCCceecccCCCccccCCCCCcccCCCCCcEecC--CceeccCcCCC--CCccccCCCCcccccccCcccCCCCccccc
Q ss_pred CCC-CcccCCCCCcccccCCCCCCCC-----------CCCCCCCCccCCCCCCCCCcCCCCccCCCCCcccccccCCCCC
Q psy9822 263 NGH-ADCAGPDKLDVCLRCHNHTKGP-----------QCDKCEPLYVGDPRDNGECVPCSEYCNGHTGLCINASLASLPF 330 (380)
Q Consensus 263 ~~~-~~C~~~~~~g~C~~C~~~~~G~-----------~C~~C~~g~~G~~~~~~~C~~C~~~Cng~~~~C~~~~~~~~p~ 330 (380)
... .+|. .+....|++|..++.-. .|..|.+|||.+ +..|.+|.
T Consensus 580 d~~C~~C~-~~~~~~C~sC~~g~yL~~~~~~~~g~Cv~~~~C~~GYY~d---~~~C~~C~-------------------- 635 (800)
T PTZ00214 580 DPTCLACT-APGPGRCTRCPSDKLLKRASGAATGSCVDPGACVDGYYAD---GDACLPCA-------------------- 635 (800)
T ss_pred cchhhhcc-CCCCCcceECCCCCEeccccccccccccCcccCCCCcccC---CCccccCC--------------------
Q ss_pred CCCCCCcccccccccCCCCCCcccccCCCCCCCCCCCcCCCCCCCCCCCC
Q psy9822 331 SPESGGTSELAAFLDEGPTTRARCMHCGNHTTGPKCEDCVEGKENRQSRN 380 (380)
Q Consensus 331 ~~~~~~~~~~~~~~~~g~~~~g~C~~C~~g~~G~~Ce~C~~Gy~g~~~~~ 380 (380)
...|..| .....|.+|..+||....++
T Consensus 636 --------------------~~~C~tC---~~~~~C~~C~~~~~ls~~~~ 662 (800)
T PTZ00214 636 --------------------TPGCKTC---GHASFCTECAGELFVSLDGQ 662 (800)
T ss_pred --------------------ccccccc---cCCCCcCcCCCCceecCcCC
No 65
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=50.58 E-value=9.5 Score=32.67 Aligned_cols=18 Identities=28% Similarity=0.720 Sum_probs=16.2
Q ss_pred CCcccccCCCCCCCCCCCc
Q psy9822 350 TRARCMHCGNHTTGPKCED 368 (380)
Q Consensus 350 ~~g~C~~C~~g~~G~~Ce~ 368 (380)
....|. |..||+|.+||.
T Consensus 65 ~~~~Cr-C~~GYtGeRCEh 82 (139)
T PHA03099 65 DGMYCR-CSHGYTGIRCQH 82 (139)
T ss_pred CCceeE-CCCCcccccccc
Confidence 568899 999999999986
No 66
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=50.56 E-value=1.2e+02 Score=30.91 Aligned_cols=21 Identities=38% Similarity=0.982 Sum_probs=14.6
Q ss_pred CCCCCCCCccCCCCCCCCCcCCCC
Q psy9822 288 QCDKCEPLYVGDPRDNGECVPCSE 311 (380)
Q Consensus 288 ~C~~C~~g~~G~~~~~~~C~~C~~ 311 (380)
.|..|++|||-+. ..|.+|.+
T Consensus 92 ~c~~C~~G~y~~~---~~C~~C~~ 112 (397)
T PF03302_consen 92 CCSECPDGYYKNG---NKCVPCHE 112 (397)
T ss_pred cccCCCCCccccC---CCCCCCCc
Confidence 5778899998643 46777663
No 67
>PF01508 Paramecium_SA: Paramecium surface antigen domain; InterPro: IPR002895 The G surface protein of Paramecium primaurelia has important internal homologies and a periodic structure, which could be dictated in part by the rigid scaffolding of cysteine residues. The predicted secondary structure shows a quasi absence of alpha-helix and an abundance of beta-pleated sheets and random coils. The monotony of the amino acid sequence is in favour of a structural role for the protein []. This structure is based on the presence of 37 periods of about 75 residues, each period containing eight cysteine residues []. Homologies with other proteins are limited to surface antigens of trypanosomes.
Probab=45.98 E-value=16 Score=26.55 Aligned_cols=34 Identities=24% Similarity=0.612 Sum_probs=28.3
Q ss_pred CCCCCCCCCchhhhcCCCCCCCccceeecCCCCcccCc
Q psy9822 41 PLRCDQRTDCQDCLTSSGGEGGWHECRWSSMLNECISP 78 (380)
Q Consensus 41 p~~C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~~ 78 (380)
|..|+.+++=..|+.... ...|+|-.++..|+..
T Consensus 27 ~~~C~~y~~~~~C~~~~~----~~~C~W~~~~~~C~~~ 60 (63)
T PF01508_consen 27 PSSCSDYKTQEQCTSSKS----GGKCYWDSSTSKCQDK 60 (63)
T ss_pred cccccccchhhcCcccCC----CCEEEEcCCCCcCcCC
Confidence 577999999999999853 2589999999999864
No 68
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=40.86 E-value=37 Score=23.75 Aligned_cols=29 Identities=28% Similarity=0.547 Sum_probs=19.3
Q ss_pred cccccccccCCCCCCCcccCCCCCcccccCCCCC
Q psy9822 251 YVGVNCSIQCQCNGHADCAGPDKLDVCLRCHNHT 284 (380)
Q Consensus 251 ~~G~~C~~~c~C~~~~~C~~~~~~g~C~~C~~~~ 284 (380)
..|..|+..-+|..++.|. .++|. |.+++
T Consensus 17 ~~g~~C~~~~qC~~~s~C~----~g~C~-C~~g~ 45 (52)
T PF01683_consen 17 QPGESCESDEQCIGGSVCV----NGRCQ-CPPGY 45 (52)
T ss_pred CCCCCCCCcCCCCCcCEEc----CCEeE-CCCCC
Confidence 3455666666777777786 57884 77665
No 69
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=37.26 E-value=76 Score=25.71 Aligned_cols=22 Identities=32% Similarity=0.726 Sum_probs=12.5
Q ss_pred CCCCCCCCccCCCCCCCCCcCCC
Q psy9822 288 QCDKCEPLYVGDPRDNGECVPCS 310 (380)
Q Consensus 288 ~C~~C~~g~~G~~~~~~~C~~C~ 310 (380)
.|..|.++|+-+. .+..|.+|.
T Consensus 5 ~Ct~C~~g~~~~~-~~~~C~~C~ 26 (96)
T PTZ00382 5 VCTSCDSDKKPNK-DGSGCVLCS 26 (96)
T ss_pred ccCcCCCCCccCC-CCCcCCcCC
Confidence 4567777776543 233466655
No 70
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=34.20 E-value=38 Score=27.67 Aligned_cols=33 Identities=21% Similarity=0.636 Sum_probs=22.7
Q ss_pred CCccccCCCCCCCCCCCeeecCCCCcccccCCCcee
Q psy9822 184 PENECLNDHHTCDPQSEQCVDLADGFECVCGRGYNK 219 (380)
Q Consensus 184 d~deC~~g~~~C~~~~g~C~n~~~g~~C~C~~Gy~G 219 (380)
..|+| .....|. ..+.|. ......|.|.+||.-
T Consensus 76 p~d~C-d~y~~CG-~~g~C~-~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQC-DVYGFCG-PNGICN-SNNSPKCSCLPGFEP 108 (110)
T ss_pred cccCC-CCccccC-CccEeC-CCCCCceECCCCcCC
Confidence 44567 3456787 678894 345667999999863
No 71
>PHA02887 EGF-like protein; Provisional
Probab=31.78 E-value=29 Score=29.32 Aligned_cols=18 Identities=28% Similarity=0.857 Sum_probs=16.1
Q ss_pred CCcccccCCCCCCCCCCCc
Q psy9822 350 TRARCMHCGNHTTGPKCED 368 (380)
Q Consensus 350 ~~g~C~~C~~g~~G~~Ce~ 368 (380)
....|. |..||+|.+||.
T Consensus 106 ~epsCr-C~~GYtG~RCE~ 123 (126)
T PHA02887 106 DEKFCI-CNKGYTGIRCDE 123 (126)
T ss_pred CCceeE-CCCCcccCCCCc
Confidence 568999 999999999985
No 72
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=31.19 E-value=27 Score=35.91 Aligned_cols=20 Identities=25% Similarity=1.095 Sum_probs=16.1
Q ss_pred cchhhcC-CCCCccCcCCCCCCC
Q psy9822 111 QCSTCLR-HAHCGWCALQRDTTG 132 (380)
Q Consensus 111 ~C~~Cl~-~~~CGwC~~~~~~~~ 132 (380)
+|.+|++ ++.|+||. +..+.
T Consensus 1 sC~eCi~~~p~CaWC~--~~~f~ 21 (423)
T smart00187 1 SCGECIQSGPNCAWCT--DENFT 21 (423)
T ss_pred ChhHHhCCCCCccccC--CccCC
Confidence 6899997 68999999 55553
No 73
>KOG3848|consensus
Probab=30.06 E-value=23 Score=36.15 Aligned_cols=32 Identities=31% Similarity=0.720 Sum_probs=28.3
Q ss_pred CCCCCCCCchhhhcCCCCCCCccceeecCCCCcccC
Q psy9822 42 LRCDQRTDCQDCLTSSGGEGGWHECRWSSMLNECIS 77 (380)
Q Consensus 42 ~~C~~~~~C~~Cl~~~~~~~~~~~C~Wc~~~~~C~~ 77 (380)
+-|-++.+|..|+++.-. -.|.||.-+|+|-+
T Consensus 317 PTCLqfn~Cd~C~ns~lt----FnClWC~vlqrCSs 348 (516)
T KOG3848|consen 317 PTCLQFNTCDTCTNSTLT----FNCLWCHVLQRCSS 348 (516)
T ss_pred CceeccCCcChhhccCCC----ceeehHhhhhhhcc
Confidence 569999999999998764 68999999999985
No 74
>PF14843 GF_recep_IV: Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=29.73 E-value=17 Score=31.05 Aligned_cols=22 Identities=32% Similarity=1.013 Sum_probs=10.4
Q ss_pred cccccCCCCCCCCCC-CcCCCCC
Q psy9822 352 ARCMHCGNHTTGPKC-EDCVEGK 373 (380)
Q Consensus 352 g~C~~C~~g~~G~~C-e~C~~Gy 373 (380)
..|+.|.....+..| +.|+.||
T Consensus 73 ~~C~~C~~~~~~~~Cv~~CP~~~ 95 (132)
T PF14843_consen 73 DQCLSCAHFRDGGQCVSSCPEGT 95 (132)
T ss_dssp TTBSSBSSEEETTEEESS--EEE
T ss_pred chhhhccccCCCCcccCcCCCCC
Confidence 445555554455555 3466664
No 75
>PF00362 Integrin_beta: Integrin, beta chain; InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus. Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another. The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices. Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=29.18 E-value=25 Score=36.26 Aligned_cols=19 Identities=32% Similarity=1.209 Sum_probs=15.3
Q ss_pred cchhhcCC-CCCccCcCCCCCC
Q psy9822 111 QCSTCLRH-AHCGWCALQRDTT 131 (380)
Q Consensus 111 ~C~~Cl~~-~~CGwC~~~~~~~ 131 (380)
+|.+|++. +.|+||. ++.+
T Consensus 1 SC~eCI~~~p~CaWC~--~~~f 20 (426)
T PF00362_consen 1 SCSECIQSGPSCAWCS--DENF 20 (426)
T ss_dssp SHHHHHTT-TTEEEE---STTC
T ss_pred ChhHHhCCCCCCcccC--CCCc
Confidence 69999996 8999999 6665
No 76
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=24.63 E-value=73 Score=29.16 Aligned_cols=39 Identities=23% Similarity=0.651 Sum_probs=27.0
Q ss_pred CCCCCCCCCCchhhhcCCCCCCCccceeec--CCCCcccCccch
Q psy9822 40 CPLRCDQRTDCQDCLTSSGGEGGWHECRWS--SMLNECISPSYE 81 (380)
Q Consensus 40 cp~~C~~~~~C~~Cl~~~~~~~~~~~C~Wc--~~~~~C~~~~~~ 81 (380)
=+.-|.++++|..|++... ....|+|- .....|+..+.+
T Consensus 52 ~~~~C~~~~~C~sCVn~t~---n~s~C~W~~C~~~~~Cv~~stV 92 (186)
T PF05283_consen 52 APETCEQHNSCESCVNATH---NNSTCVWMECKGESYCVNNSTV 92 (186)
T ss_pred CcccCCCCccchhhhCccC---ccCceEeeecCCCCcccCCccc
Confidence 3667889999999997643 24579994 445666655543
No 77
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=22.36 E-value=1.7e+02 Score=23.33 Aligned_cols=27 Identities=26% Similarity=0.810 Sum_probs=16.9
Q ss_pred CCCCCCCCCccCCCCCC-CCCcCCCCccC
Q psy9822 287 PQCDKCEPLYVGDPRDN-GECVPCSEYCN 314 (380)
Q Consensus 287 ~~C~~C~~g~~G~~~~~-~~C~~C~~~Cn 314 (380)
..|..|.+|+|-+..+. ..|++|. .|.
T Consensus 33 t~C~~C~~g~ys~~~~~~~~C~~c~-~C~ 60 (98)
T cd00185 33 TVCEPCPPGTYTDSWNHLPKCLSCR-TCD 60 (98)
T ss_pred CeecCCCCCCcccCCCCCCcCCcCc-cCC
Confidence 45667888877665543 4577765 354
Done!