Query psy11297
Match_columns 132
No_of_seqs 153 out of 1321
Neff 8.8
Searched_HMMs 46136
Date Fri Aug 16 23:19:09 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11297.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11297hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.8 2.1E-19 4.6E-24 150.2 9.2 117 11-132 1220-1371(2531)
2 KOG3514|consensus 99.3 1.7E-11 3.6E-16 101.7 8.5 104 21-132 612-719 (1591)
3 KOG1219|consensus 99.2 1.5E-11 3.3E-16 107.5 7.1 72 17-90 3889-3961(4289)
4 KOG4289|consensus 99.1 6.7E-10 1.5E-14 94.6 10.1 103 21-131 1491-1595(2531)
5 KOG1219|consensus 99.1 2.3E-10 4.9E-15 100.4 5.6 67 22-89 3855-3921(4289)
6 KOG3516|consensus 99.0 1.5E-09 3.3E-14 91.1 9.7 44 84-131 798-841 (1306)
7 KOG3516|consensus 99.0 2.4E-09 5.1E-14 90.0 10.2 96 24-132 951-1066(1306)
8 PF00008 EGF: EGF-like domain 98.9 2.4E-09 5.2E-14 54.1 2.7 32 34-65 1-32 (32)
9 KOG3514|consensus 98.7 4.1E-08 9E-13 82.1 8.1 89 36-132 215-306 (1591)
10 smart00179 EGF_CA Calcium-bind 98.5 2.3E-07 5E-12 48.2 4.1 36 31-67 2-39 (39)
11 cd00054 EGF_CA Calcium-binding 98.3 1.6E-06 3.4E-11 44.5 4.1 36 31-67 2-38 (38)
12 cd00110 LamG Laminin G domain; 98.1 2.1E-05 4.5E-10 52.3 7.2 52 79-132 3-55 (151)
13 cd00053 EGF Epidermal growth f 98.0 1.5E-05 3.3E-10 40.1 4.0 32 34-66 2-35 (36)
14 PF07645 EGF_CA: Calcium-bindi 98.0 1E-05 2.2E-10 43.1 3.2 32 30-62 1-34 (42)
15 smart00181 EGF Epidermal growt 97.9 2.1E-05 4.7E-10 39.8 3.9 32 34-67 2-35 (35)
16 PF12661 hEGF: Human growth fa 97.1 0.00023 5E-09 28.4 0.9 13 54-66 1-13 (13)
17 PF07974 EGF_2: EGF-like domai 97.1 0.001 2.3E-08 33.2 3.3 27 37-66 6-32 (32)
18 PF12947 EGF_3: EGF domain; I 95.8 0.0075 1.6E-07 30.9 1.8 26 38-64 7-32 (36)
19 PHA03099 epidermal growth fact 95.1 0.019 4.1E-07 37.7 2.4 39 31-70 42-84 (139)
20 KOG1214|consensus 95.1 0.03 6.5E-07 47.3 3.9 42 19-63 815-858 (1289)
21 PF12662 cEGF: Complement Clr- 95.0 0.027 5.9E-07 26.2 2.1 16 52-67 1-20 (24)
22 PHA02887 EGF-like protein; Pro 94.8 0.03 6.6E-07 36.2 2.7 36 32-68 84-123 (126)
23 KOG1225|consensus 94.3 0.07 1.5E-06 43.1 4.3 44 18-69 300-343 (525)
24 KOG1217|consensus 94.0 0.088 1.9E-06 40.8 4.4 53 16-69 154-208 (487)
25 PF12946 EGF_MSP1_1: MSP1 EGF 94.0 0.057 1.2E-06 27.8 2.1 29 34-62 2-30 (37)
26 KOG1217|consensus 94.0 0.17 3.6E-06 39.3 5.7 47 19-66 257-306 (487)
27 KOG4260|consensus 93.1 0.11 2.5E-06 38.5 3.2 47 18-68 132-183 (350)
28 smart00051 DSL delta serrate l 92.6 0.19 4E-06 29.0 3.1 42 18-66 21-63 (63)
29 cd01475 vWA_Matrilin VWA_Matri 91.8 0.24 5.2E-06 35.4 3.6 36 24-63 181-218 (224)
30 KOG1225|consensus 91.0 0.59 1.3E-05 37.9 5.3 45 18-70 269-313 (525)
31 PF14670 FXa_inhibition: Coagu 90.7 0.22 4.8E-06 25.4 1.8 18 44-62 11-28 (36)
32 KOG1214|consensus 87.5 1.9 4.1E-05 37.1 5.9 31 43-74 799-831 (1289)
33 PF00053 Laminin_EGF: Laminin 86.5 0.88 1.9E-05 24.5 2.5 23 43-68 11-33 (49)
34 cd00055 EGF_Lam Laminin-type e 84.8 1.1 2.4E-05 24.3 2.4 16 53-68 19-34 (50)
35 PF12955 DUF3844: Domain of un 83.0 0.71 1.5E-05 29.3 1.2 32 37-68 13-61 (103)
36 KOG1226|consensus 81.9 2 4.3E-05 36.3 3.7 40 24-68 539-581 (783)
37 KOG1836|consensus 81.5 3 6.5E-05 38.6 4.9 60 11-71 755-816 (1705)
38 PF01414 DSL: Delta serrate li 79.3 0.98 2.1E-05 26.0 0.9 11 56-66 53-63 (63)
39 KOG3509|consensus 78.0 6.5 0.00014 34.4 5.7 56 15-72 158-213 (964)
40 PF00954 S_locus_glycop: S-loc 75.7 3.4 7.4E-05 26.1 2.8 32 30-63 76-108 (110)
41 smart00180 EGF_Lam Laminin-typ 74.0 2.7 5.9E-05 22.3 1.7 16 53-68 18-33 (46)
42 KOG3509|consensus 71.1 6.7 0.00015 34.3 4.1 42 31-73 406-447 (964)
43 KOG4260|consensus 71.1 2.4 5.2E-05 31.7 1.3 33 30-63 235-269 (350)
44 PF09064 Tme5_EGF_like: Thromb 66.5 5.1 0.00011 20.1 1.5 25 34-62 3-27 (34)
45 PF04863 EGF_alliinase: Alliin 60.0 5.3 0.00011 22.4 1.0 35 37-71 17-54 (56)
46 KOG0994|consensus 52.4 16 0.00034 33.0 3.1 32 33-68 1068-1099(1758)
47 cd01328 FSL_SPARC Follistatin- 50.6 29 0.00063 21.3 3.3 27 33-59 1-27 (86)
48 PHA02885 putative interleukin 43.5 35 0.00076 21.8 2.9 24 106-129 57-80 (135)
49 smart00274 FOLN Follistatin-N- 41.1 41 0.00089 15.6 2.6 23 33-55 1-23 (26)
50 PF12020 TAFA: TAFA family; I 34.4 52 0.0011 20.2 2.6 39 9-47 27-70 (90)
51 KOG1226|consensus 34.3 80 0.0017 27.2 4.4 50 18-70 570-623 (783)
52 KOG0994|consensus 31.2 1.1E+02 0.0025 28.0 4.9 37 33-69 905-950 (1758)
53 PF02973 Sialidase: Sialidase, 30.9 1.8E+02 0.004 20.5 5.3 41 91-132 29-71 (190)
54 KOG3607|consensus 30.6 38 0.00082 28.9 2.1 29 38-70 631-659 (716)
55 KOG1218|consensus 25.5 2.5E+02 0.0055 20.5 5.6 30 53-82 162-191 (316)
56 KOG4004|consensus 22.0 96 0.0021 22.4 2.5 31 29-59 48-78 (259)
57 PF06247 Plasmod_Pvs28: Plasmo 21.9 48 0.001 23.5 1.0 26 37-62 50-79 (197)
58 PF07966 A1_Propeptide: A1 Pro 20.6 32 0.0007 16.4 -0.0 8 2-9 4-11 (29)
No 1
>KOG4289|consensus
Probab=99.80 E-value=2.1e-19 Score=150.20 Aligned_cols=117 Identities=28% Similarity=0.539 Sum_probs=98.0
Q ss_pred CccCCeecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCC------------------
Q psy11297 11 GCLSASVVQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPP------------------ 72 (132)
Q Consensus 11 ~~~~~~~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~------------------ 72 (132)
+.++.. ||..|+|..|+.++|.|-+.||.|+|+|....+ +|+|.|.+||+|++||.+...
T Consensus 1220 nglrCr-CPpGFTgd~CeTeiDlCYs~pC~nng~C~srEg-gYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1220 NGLRCR-CPPGFTGDYCETEIDLCYSGPCGNNGRCRSREG-GYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred CceeEe-CCCCCCcccccchhHhhhcCCCCCCCceEEecC-ceeEEecCCccccceeeecccCccccceecCCCEEeecC
Confidence 334433 455566999999999999999999999999985 599999999999999986532
Q ss_pred -----------------CCCCCCCCCCcceEEecccCCceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297 73 -----------------KLYDTPAFNGSSHIVMKTLKAYNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN 132 (132)
Q Consensus 73 -----------------c~~~~~~~~g~~~~~~~~~~~~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~ 132 (132)
|+....+|.+.+|+.|.+++.+.++.++|.|.|...+|+|+|+|.+ ++||++|++++
T Consensus 1298 nggf~c~Cp~ge~e~prC~v~trSFp~~sfv~frglrqRfh~TlslsfaT~~~nGlL~ynGne---khDFvalevVd 1371 (2531)
T KOG4289|consen 1298 NGGFCCHCPYGEFEDPRCEVTTRSFPPESFVTFRGLRQRFHFTLSLSFATIERNGLLLYNGNE---KHDFVALEVVD 1371 (2531)
T ss_pred CCceeccCCCcccCCCceEEEeeccCchheEEEeccccceEEEEEEEEEEeeecceEEecCCc---ccceEeeeeee
Confidence 1123345777889999999999999999999999999999999954 89999999975
No 2
>KOG3514|consensus
Probab=99.27 E-value=1.7e-11 Score=101.69 Aligned_cols=104 Identities=27% Similarity=0.551 Sum_probs=85.2
Q ss_pred CCCCCCcccCCC-CCCCCCCCCCCeEeeCCCCceEEeC-CCCccCCcccccCCCCCCCCCCCCCcceEEecc--cCCcee
Q psy11297 21 SQASPNLRLEES-PCLSHPCQNGATCQDEEDGLFECLC-SPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKT--LKAYNK 96 (132)
Q Consensus 21 ~~~g~~C~~~~~-~C~~~pC~ngg~C~~~~~~~~~C~C-~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~--~~~~~~ 96 (132)
..+-+.|..... .|.++||+|+|+|...|+. |.|.| ..+|.|+.|+.+. ....++|.-|+.+-. .+....
T Consensus 612 agvkpsCs~~~~~~C~~nPC~N~g~C~egwNr-fiCDCs~T~~~G~~CerE~-----t~ls~nGs~~m~i~L~~~~~tq~ 685 (1591)
T KOG3514|consen 612 AGVKPSCSLSNEKICESNPCQNGGKCSEGWNR-FICDCSGTGFEGRTCEREA-----TALSYNGSMSMKIVLPHTMHTQA 685 (1591)
T ss_pred cccCcccchhhccccCCCcccCCCCccccccc-cccccccCcccCcccccee-----eeEEEcCeeeEEEEecccceeec
Confidence 445668875544 7999999999999999985 99999 6789999999877 678889988887542 234456
Q ss_pred EEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297 97 LSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN 132 (132)
Q Consensus 97 ~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~ 132 (132)
.+++|+|||..+.|+||-.+++ +..|-|.|+|.+
T Consensus 686 E~v~iRF~t~r~~Gll~~Tta~--~s~D~l~l~L~~ 719 (1591)
T KOG3514|consen 686 EDVSIRFRTQRAYGLLFATTAR--GSADTLRLELDA 719 (1591)
T ss_pred ceEEEEEEecccceeEEEeccC--CCCceEEEEEec
Confidence 7899999999999999988886 358999999864
No 3
>KOG1219|consensus
Probab=99.25 E-value=1.5e-11 Score=107.45 Aligned_cols=72 Identities=26% Similarity=0.510 Sum_probs=65.0
Q ss_pred ecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCccccc-CCCCCCCCCCCCCcceEEecc
Q psy11297 17 VVQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTR-APPKLYDTPAFNGSSHIVMKT 90 (132)
Q Consensus 17 ~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~-~~~c~~~~~~~~g~~~~~~~~ 90 (132)
-||..|.|.+||.++.+|.++||.+||+|+...++ |.|.|+.||+|.+||.+ +++|. .++|.+|+.|+..++
T Consensus 3889 kCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~-f~CnC~~gyTG~~Ce~~Gi~eCs-~n~C~~gg~C~n~~g 3961 (4289)
T KOG1219|consen 3889 KCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNG-FLCNCPNGYTGKRCEARGISECS-KNVCGTGGQCINIPG 3961 (4289)
T ss_pred eCcccccCcccccccccccCCCCCCCCEEEecCCC-eeEeCCCCccCceeecccccccc-cccccCCceeeccCC
Confidence 46677889999999999999999999999999865 99999999999999998 89997 889999998888764
No 4
>KOG4289|consensus
Probab=99.10 E-value=6.7e-10 Score=94.60 Aligned_cols=103 Identities=22% Similarity=0.401 Sum_probs=80.1
Q ss_pred CCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCCCCCCCCCCCcceEEecccC--CceeEE
Q psy11297 21 SQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKTLK--AYNKLS 98 (132)
Q Consensus 21 ~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~~~--~~~~~~ 98 (132)
..+...|....+-|+..+|.|+|+|++.|+ +|.|.||.+|.|..|++.+. ...-|.|.+-+....+. ....+.
T Consensus 1491 ngt~eGC~ark~fCdsg~C~n~g~CvnrWg-~~~C~CP~~fggk~c~~~m~----~pq~frG~sl~sw~~~~~~vSvPwy 1565 (2531)
T KOG4289|consen 1491 NGTHEGCKARKNFCDSGQCSNGGTCVNRWG-GFSCECPLGFGGKGCCQGMA----HPQHFRGHSLVSWEGLPSQVSVPWY 1565 (2531)
T ss_pred cCcccCchhhhcccCCCccCCCCeeecccC-cEeecCccccCCcchhhccC----CchhccccceeeecCCCcceecceE
Confidence 344678988889999999999999999995 59999999999999998773 55677887777655443 234678
Q ss_pred EEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEc
Q psy11297 99 IEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIV 131 (132)
Q Consensus 99 i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~ 131 (132)
++|+|||+..+|+||-.... ...-+.|+|.
T Consensus 1566 lsl~FRTr~ad~vl~~~~~~---~rst~~lqld 1595 (2531)
T KOG4289|consen 1566 LSLMFRTRRADGVLMQAEFG---GRSTYNLQLD 1595 (2531)
T ss_pred EEEEEEeeccccEEEEEEeC---CCceEEEEEc
Confidence 99999999999999976653 1223555553
No 5
>KOG1219|consensus
Probab=99.05 E-value=2.3e-10 Score=100.40 Aligned_cols=67 Identities=28% Similarity=0.713 Sum_probs=60.6
Q ss_pred CCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCCCCCCCCCCCcceEEec
Q psy11297 22 QASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMK 89 (132)
Q Consensus 22 ~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~ 89 (132)
..-+.|....++|+.+||+|||+|.....++|.|.|+..|+|.+||.++..|. ++||..|++|+.+.
T Consensus 3855 ~l~pgC~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~-snPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3855 GLQPGCSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCA-SNPCLTGGTCIPFY 3921 (4289)
T ss_pred cccccccccccccccCcccCCCEecCCCCCceEEeCcccccCccccccccccc-CCCCCCCCEEEecC
Confidence 34567876669999999999999999887889999999999999999999998 99999999999875
No 6
>KOG3516|consensus
Probab=99.04 E-value=1.5e-09 Score=91.11 Aligned_cols=44 Identities=23% Similarity=0.381 Sum_probs=40.1
Q ss_pred ceEEecccCCceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEc
Q psy11297 84 SHIVMKTLKAYNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIV 131 (132)
Q Consensus 84 ~~~~~~~~~~~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~ 131 (132)
+|++|+..+.....+|+|.|||..+.|++|.+.+ ..|||.|+|.
T Consensus 798 syL~fp~f~~~~saDIsf~FrTt~~~gvflen~g----~~dfir~eL~ 841 (1306)
T KOG3516|consen 798 SYLHFPPFHNELSADISFFFRTTASSGVFLENHG----INDFIRLELS 841 (1306)
T ss_pred ceeecCcccCcccccEEEEEEecCCceEeeeccC----CCceEEEEEc
Confidence 5888888888888999999999999999999998 5999999986
No 7
>KOG3516|consensus
Probab=99.02 E-value=2.4e-09 Score=90.01 Aligned_cols=96 Identities=26% Similarity=0.537 Sum_probs=77.3
Q ss_pred CCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeC-CCCccCCcccccCCCCCCCCCCCCCcceEEeccc-----------
Q psy11297 24 SPNLRLEESPCLSHPCQNGATCQDEEDGLFECLC-SPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKTL----------- 91 (132)
Q Consensus 24 g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C-~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~~----------- 91 (132)
-+.|+ ..|++.||.|||.|+..+.+ |+|.| ...|.|+.|..++ ...|..+++++|+-.
T Consensus 951 ~~GC~---GhCss~~C~NGG~Cvery~g-ytCDCs~Tay~Gp~Cs~ei------g~~fe~gs~i~y~fq~~~~~a~~~~~ 1020 (1306)
T KOG3516|consen 951 SPGCE---GHCSSYPCLNGGHCVERYDG-YTCDCSRTAYDGPFCSKEI------GVFFERGSSIRYNFQKPMRSAVFESS 1020 (1306)
T ss_pred cCCCc---cccccccccCCCEEEEecCc-eeeccccCcCCCCcccccc------ceEecCCceEEEeccchHHHhhhhhh
Confidence 45665 47999999999999999975 99999 5669999999876 456777888887521
Q ss_pred --------CCceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297 92 --------KAYNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN 132 (132)
Q Consensus 92 --------~~~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~ 132 (132)
.......|.|.|+|+...++|||.++. ..||+++.|.+
T Consensus 1021 ~~~~~~~~~~~~~e~i~~sftTt~~ps~LLfvssF---~~~y~~V~v~~ 1066 (1306)
T KOG3516|consen 1021 RVKQKLEIEINPNEEINFSFTTTRAPSDLLFVSSF---TDDYLAVLVKD 1066 (1306)
T ss_pred hhhhccccccCccceEEEEEEeccCceEEEEeecc---ccceEEEEEeC
Confidence 123467899999999999999999987 57899888754
No 8
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.85 E-value=2.4e-09 Score=54.07 Aligned_cols=32 Identities=50% Similarity=1.078 Sum_probs=28.1
Q ss_pred CCCCCCCCCCeEeeCCCCceEEeCCCCccCCc
Q psy11297 34 CLSHPCQNGATCQDEEDGLFECLCSPEFTGYL 65 (132)
Q Consensus 34 C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~ 65 (132)
|.++||.|+|+|++...++|+|.|++||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 67889999999998874469999999999974
No 9
>KOG3514|consensus
Probab=98.74 E-value=4.1e-08 Score=82.14 Aligned_cols=89 Identities=24% Similarity=0.531 Sum_probs=75.9
Q ss_pred CCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCCCCCCCCCCCcceEEecccC---CceeEEEEEEEEeCCCCeEE
Q psy11297 36 SHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKTLK---AYNKLSIEIEFKTNKNDGIL 112 (132)
Q Consensus 36 ~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~~~---~~~~~~i~~~f~t~~~~g~l 112 (132)
..+|+|++.|.....+ -.|.|...+.|.+|+...++ ....|.|..|+-|..-. .....+|+|.|||...+|+|
T Consensus 215 ~~~c~~~g~c~s~d~g-p~c~c~~~~dgq~cekeK~~---~eaTF~G~ef~~YDls~npI~s~~d~itl~FrT~q~ngll 290 (1591)
T KOG3514|consen 215 QFVCLNDGECYSSDDG-PHCDCQFDHDGQNCEKEKND---GEATFGGDEFVGYDLSQNPIRSKKDNITLTFRTVQGNGLL 290 (1591)
T ss_pred cceeccCCeEecCCCC-CccccccccCccccccccCc---ceEEecCceEEEeeccCCcccccccceEEEEEEecCceeE
Confidence 3679999999998754 99999999999999998865 67889999999886322 23466899999999999999
Q ss_pred EEeccCCCCCCCeEEEEEcC
Q psy11297 113 LYNQQNLDGTGDFVSLAIVN 132 (132)
Q Consensus 113 l~~~~~~~~~~d~~~l~l~~ 132 (132)
||.|. ..||+.|+|+|
T Consensus 291 fytG~----~~dYlnlaL~d 306 (1591)
T KOG3514|consen 291 FYTGD----EKDYLNLALQD 306 (1591)
T ss_pred EEccC----CcceeeEeecC
Confidence 99999 48999999986
No 10
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.50 E-value=2.3e-07 Score=48.24 Aligned_cols=36 Identities=42% Similarity=1.005 Sum_probs=31.1
Q ss_pred CCCCCC-CCCCCCCeEeeCCCCceEEeCCCCcc-CCccc
Q psy11297 31 ESPCLS-HPCQNGATCQDEEDGLFECLCSPEFT-GYLCH 67 (132)
Q Consensus 31 ~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~-G~~C~ 67 (132)
+++|.. +||.++++|++... +|.|.|+.||. |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g-~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVG-SYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCC-CeEeECCCCCccCCcCC
Confidence 577887 89999999998874 59999999999 88874
No 11
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.29 E-value=1.6e-06 Score=44.47 Aligned_cols=36 Identities=42% Similarity=1.011 Sum_probs=30.7
Q ss_pred CCCCCC-CCCCCCCeEeeCCCCceEEeCCCCccCCccc
Q psy11297 31 ESPCLS-HPCQNGATCQDEEDGLFECLCSPEFTGYLCH 67 (132)
Q Consensus 31 ~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~ 67 (132)
+++|.. .+|.+++.|.+... .|.|.|+.+|.|..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~-~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVG-SYRCSCPPGYTGRNCE 38 (38)
T ss_pred cccCCCCCCcCCCCEeECCCC-CeEeECCCCCcCCcCC
Confidence 467877 79999999998874 5999999999998874
No 12
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=98.08 E-value=2.1e-05 Score=52.31 Aligned_cols=52 Identities=38% Similarity=0.676 Sum_probs=43.8
Q ss_pred CCCCcceEEecccCC-ceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297 79 AFNGSSHIVMKTLKA-YNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN 132 (132)
Q Consensus 79 ~~~g~~~~~~~~~~~-~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~ 132 (132)
.|.+.+|+.++.... ...++|+++|||..++|+|||.+... ..+|++|+|.+
T Consensus 3 ~F~g~~~i~~~~~~~~~~~~~i~~~frt~~~~g~l~~~~~~~--~~~~~~l~l~~ 55 (151)
T cd00110 3 SFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQN--GGDFLALELED 55 (151)
T ss_pred EeCCCceEEecCCCCCcceeEEEEEEEeCCCCeEEEEecCCC--CCCEEEEEEEC
Confidence 477789999986654 67889999999999999999999853 48999999874
No 13
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.99 E-value=1.5e-05 Score=40.06 Aligned_cols=32 Identities=44% Similarity=1.189 Sum_probs=27.2
Q ss_pred CC-CCCCCCCCeEeeCCCCceEEeCCCCccCC-cc
Q psy11297 34 CL-SHPCQNGATCQDEEDGLFECLCSPEFTGY-LC 66 (132)
Q Consensus 34 C~-~~pC~ngg~C~~~~~~~~~C~C~~g~~G~-~C 66 (132)
|. ..+|.+++.|++... .|.|.|+.||.|. .|
T Consensus 2 C~~~~~C~~~~~C~~~~~-~~~C~C~~g~~g~~~C 35 (36)
T cd00053 2 CAASNPCSNGGTCVNTPG-SYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCCCCEEecCCC-CeEeECCCCCcccCCc
Confidence 44 678999999999884 5999999999998 65
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.97 E-value=1e-05 Score=43.15 Aligned_cols=32 Identities=31% Similarity=0.935 Sum_probs=27.2
Q ss_pred CCCCCCC--CCCCCCCeEeeCCCCceEEeCCCCcc
Q psy11297 30 EESPCLS--HPCQNGATCQDEEDGLFECLCSPEFT 62 (132)
Q Consensus 30 ~~~~C~~--~pC~ngg~C~~~~~~~~~C~C~~g~~ 62 (132)
++++|+. ++|..++.|++.. |+|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~-Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTE-GSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEET-TEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCC-CCEEeeCCCCcE
Confidence 4678876 4698899999998 679999999997
No 15
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.93 E-value=2.1e-05 Score=39.80 Aligned_cols=32 Identities=47% Similarity=1.185 Sum_probs=26.6
Q ss_pred CCC-CCCCCCCeEeeCCCCceEEeCCCCccC-Cccc
Q psy11297 34 CLS-HPCQNGATCQDEEDGLFECLCSPEFTG-YLCH 67 (132)
Q Consensus 34 C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G-~~C~ 67 (132)
|.. .+|.++ .|++.. ++|+|.|+.||.| ..|+
T Consensus 2 C~~~~~C~~~-~C~~~~-~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 2 CASGGPCSNG-TCINTP-GSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCcCCCCCC-EEECCC-CCeEeECCCCCccCCccC
Confidence 556 689998 999886 4699999999999 7664
No 16
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.10 E-value=0.00023 Score=28.41 Aligned_cols=13 Identities=46% Similarity=1.193 Sum_probs=10.8
Q ss_pred EEeCCCCccCCcc
Q psy11297 54 ECLCSPEFTGYLC 66 (132)
Q Consensus 54 ~C~C~~g~~G~~C 66 (132)
.|.|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5899999999987
No 17
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.07 E-value=0.001 Score=33.25 Aligned_cols=27 Identities=30% Similarity=0.716 Sum_probs=23.3
Q ss_pred CCCCCCCeEeeCCCCceEEeCCCCccCCcc
Q psy11297 37 HPCQNGATCQDEEDGLFECLCSPEFTGYLC 66 (132)
Q Consensus 37 ~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C 66 (132)
..|.++|+|+... .+|.|..||+|+.|
T Consensus 6 ~~C~~~G~C~~~~---g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPC---GRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCC---CEEECCCCCcCCCC
Confidence 4689999999772 69999999999887
No 18
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=95.77 E-value=0.0075 Score=30.93 Aligned_cols=26 Identities=31% Similarity=0.880 Sum_probs=18.9
Q ss_pred CCCCCCeEeeCCCCceEEeCCCCccCC
Q psy11297 38 PCQNGATCQDEEDGLFECLCSPEFTGY 64 (132)
Q Consensus 38 pC~ngg~C~~~~~~~~~C~C~~g~~G~ 64 (132)
.|-..++|++... +|+|.|++||.|.
T Consensus 7 ~C~~nA~C~~~~~-~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGG-SYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TT-SEEEEE-CEEECC
T ss_pred CCCCCcEeecCCC-CEEeECCCCCccC
Confidence 4666789999985 6999999999873
No 19
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=95.13 E-value=0.019 Score=37.68 Aligned_cols=39 Identities=26% Similarity=0.510 Sum_probs=28.0
Q ss_pred CCCCCC---CCCCCCCeEeeCC-CCceEEeCCCCccCCcccccC
Q psy11297 31 ESPCLS---HPCQNGATCQDEE-DGLFECLCSPEFTGYLCHTRA 70 (132)
Q Consensus 31 ~~~C~~---~pC~ngg~C~~~~-~~~~~C~C~~g~~G~~C~~~~ 70 (132)
+.+|.+ +=|.|| +|.-.. ...+.|.|..||+|.+||...
T Consensus 42 i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~d 84 (139)
T PHA03099 42 IRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVV 84 (139)
T ss_pred cccCChhhCCEeECC-EEEeeccCCCceeECCCCccccccccee
Confidence 445653 458887 897331 125999999999999998743
No 20
>KOG1214|consensus
Probab=95.06 E-value=0.03 Score=47.29 Aligned_cols=42 Identities=26% Similarity=0.656 Sum_probs=34.7
Q ss_pred CCCCCCC--CcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297 19 QSSQASP--NLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTG 63 (132)
Q Consensus 19 p~~~~g~--~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G 63 (132)
|++. |. .| .+.|.|+++.|-..++|.+.. ++|+|.|.+||.|
T Consensus 815 PGfs-GDG~~c-~dvDeC~psrChp~A~Cyntp-gsfsC~C~pGy~G 858 (1289)
T KOG1214|consen 815 PGFS-GDGHQC-TDVDECSPSRCHPAATCYNTP-GSFSCRCQPGYYG 858 (1289)
T ss_pred CCcc-CCcccc-ccccccCccccCCCceEecCC-CcceeecccCccC
Confidence 6655 44 44 467999999999999999998 5699999999976
No 21
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.00 E-value=0.027 Score=26.19 Aligned_cols=16 Identities=31% Similarity=1.055 Sum_probs=12.1
Q ss_pred ceEEeCCCCcc----CCccc
Q psy11297 52 LFECLCSPEFT----GYLCH 67 (132)
Q Consensus 52 ~~~C~C~~g~~----G~~C~ 67 (132)
+|+|.|++||. |..|+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~ 20 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE 20 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc
Confidence 49999999985 45553
No 22
>PHA02887 EGF-like protein; Provisional
Probab=94.82 E-value=0.03 Score=36.16 Aligned_cols=36 Identities=25% Similarity=0.575 Sum_probs=26.1
Q ss_pred CCCCC---CCCCCCCeEeeCC-CCceEEeCCCCccCCcccc
Q psy11297 32 SPCLS---HPCQNGATCQDEE-DGLFECLCSPEFTGYLCHT 68 (132)
Q Consensus 32 ~~C~~---~pC~ngg~C~~~~-~~~~~C~C~~g~~G~~C~~ 68 (132)
.+|.+ +=|.| |+|.-.. .....|.|+.||+|.+|+.
T Consensus 84 ~pC~~eyk~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 84 EKCKNDFNDFCIN-GECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred cccChHhhCEeeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence 45653 45885 5998321 1248999999999999986
No 23
>KOG1225|consensus
Probab=94.33 E-value=0.07 Score=43.08 Aligned_cols=44 Identities=30% Similarity=0.773 Sum_probs=29.8
Q ss_pred cCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCccccc
Q psy11297 18 VQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTR 69 (132)
Q Consensus 18 cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~ 69 (132)
|+..|.|..|+... |. ..|.++|.|+ . + +|.|.+||+|..|++.
T Consensus 300 C~~g~~G~dCs~~~--cp-adC~g~G~Ci--~-G--~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 300 CNPGYSGKDCSIRR--CP-ADCSGHGKCI--D-G--ECLCDEGYTGELCIQR 343 (525)
T ss_pred cCCCcccccccccc--CC-ccCCCCCccc--C-C--ceEeCCCCcCCccccc
Confidence 44555577775432 43 5677778887 2 3 7888888888888876
No 24
>KOG1217|consensus
Probab=94.04 E-value=0.088 Score=40.81 Aligned_cols=53 Identities=30% Similarity=0.698 Sum_probs=43.4
Q ss_pred eecCCCCCCCCcccCCCCCC--CCCCCCCCeEeeCCCCceEEeCCCCccCCccccc
Q psy11297 16 SVVQSSQASPNLRLEESPCL--SHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTR 69 (132)
Q Consensus 16 ~~cp~~~~g~~C~~~~~~C~--~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~ 69 (132)
..|...+.+..|....+.|. ..+|.+++.|.+... .|.|.|+.+|.|..|+..
T Consensus 154 c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~-~~~C~c~~~~~~~~~~~~ 208 (487)
T KOG1217|consen 154 CSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGG-SYLCSCPPGYTGSTCETT 208 (487)
T ss_pred eeeCCCcccccccccccccccCCCCcCCCcccccCCC-CeeEeCCCCccCCcCcCC
Confidence 45666666888876667898 456999999999885 499999999999998865
No 25
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=93.96 E-value=0.057 Score=27.78 Aligned_cols=29 Identities=31% Similarity=0.746 Sum_probs=21.0
Q ss_pred CCCCCCCCCCeEeeCCCCceEEeCCCCcc
Q psy11297 34 CLSHPCQNGATCQDEEDGLFECLCSPEFT 62 (132)
Q Consensus 34 C~~~pC~ngg~C~~~~~~~~~C~C~~g~~ 62 (132)
|...+|..++.|....+|++.|.|.+||.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 66678889999998877889999999985
No 26
>KOG1217|consensus
Probab=93.96 E-value=0.17 Score=39.29 Aligned_cols=47 Identities=30% Similarity=0.721 Sum_probs=36.6
Q ss_pred CCCCCCCC--cccCCCCCCCCC-CCCCCeEeeCCCCceEEeCCCCccCCcc
Q psy11297 19 QSSQASPN--LRLEESPCLSHP-CQNGATCQDEEDGLFECLCSPEFTGYLC 66 (132)
Q Consensus 19 p~~~~g~~--C~~~~~~C~~~p-C~ngg~C~~~~~~~~~C~C~~g~~G~~C 66 (132)
+..+.+.. ...+++.|...+ |.++++|....+ .|.|.|+++|.|..|
T Consensus 257 ~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~-~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 257 PEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPG-SYRCTCPPGFTGRLC 306 (487)
T ss_pred CCCccccccceeeeccccCCCCccCCCCeeecCCC-cceeeCCCCCCCCCC
Confidence 33343554 234678899874 999999999875 499999999999998
No 27
>KOG4260|consensus
Probab=93.08 E-value=0.11 Score=38.53 Aligned_cols=47 Identities=26% Similarity=0.703 Sum_probs=35.4
Q ss_pred cCCCCCCCCcccCCCCC---CCCCCCCCCeEeeC--CCCceEEeCCCCccCCcccc
Q psy11297 18 VQSSQASPNLRLEESPC---LSHPCQNGATCQDE--EDGLFECLCSPEFTGYLCHT 68 (132)
Q Consensus 18 cp~~~~g~~C~~~~~~C---~~~pC~ngg~C~~~--~~~~~~C~C~~g~~G~~C~~ 68 (132)
||...-|++|. .| +..||..+|.|... +.|+-.|.|.+||+|+.|..
T Consensus 132 Cp~gtyGpdCl----~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~ 183 (350)
T KOG4260|consen 132 CPDGTYGPDCL----QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY 183 (350)
T ss_pred cCCCCcCCccc----cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc
Confidence 45444499997 45 46789999999843 33567999999999999865
No 28
>smart00051 DSL delta serrate ligand.
Probab=92.60 E-value=0.19 Score=29.04 Aligned_cols=42 Identities=17% Similarity=0.311 Sum_probs=27.5
Q ss_pred cCCCCCCCCcccCCCCCCC-CCCCCCCeEeeCCCCceEEeCCCCccCCcc
Q psy11297 18 VQSSQASPNLRLEESPCLS-HPCQNGATCQDEEDGLFECLCSPEFTGYLC 66 (132)
Q Consensus 18 cp~~~~g~~C~~~~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C 66 (132)
|+..+-|..|.. .|.+ +-+.++.+|... | .|.|++||+|..|
T Consensus 21 C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~~--G--~~~C~~Gw~G~~C 63 (63)
T smart00051 21 CDENYYGEGCNK---FCRPRDDFFGHYTCDEN--G--NKGCLEGWMGPYC 63 (63)
T ss_pred CCCCCcCCccCC---EeCcCccccCCccCCcC--C--CEecCCCCcCCCC
Confidence 343444999963 5643 224567788532 3 6789999999887
No 29
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.82 E-value=0.24 Score=35.37 Aligned_cols=36 Identities=22% Similarity=0.546 Sum_probs=26.2
Q ss_pred CCCcccCCCCCCC--CCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297 24 SPNLRLEESPCLS--HPCQNGATCQDEEDGLFECLCSPEFTG 63 (132)
Q Consensus 24 g~~C~~~~~~C~~--~pC~ngg~C~~~~~~~~~C~C~~g~~G 63 (132)
+..|+ +.++|.. ++|. ..|.+.. |+|.|.|+.||+.
T Consensus 181 ~~~C~-~~~~C~~~~~~c~--~~C~~~~-g~~~c~c~~g~~~ 218 (224)
T cd01475 181 GKICV-VPDLCATLSHVCQ--QVCISTP-GSYLCACTEGYAL 218 (224)
T ss_pred cccCc-CchhhcCCCCCcc--ceEEcCC-CCEEeECCCCccC
Confidence 55665 5667864 4565 5799887 5699999999974
No 30
>KOG1225|consensus
Probab=90.99 E-value=0.59 Score=37.94 Aligned_cols=45 Identities=29% Similarity=0.742 Sum_probs=33.0
Q ss_pred cCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccC
Q psy11297 18 VQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRA 70 (132)
Q Consensus 18 cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~ 70 (132)
||..|.|.+|.+ -.|... |.+++.|++ + .|.|+++|.|..|+...
T Consensus 269 C~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~---g--~CiC~~g~~G~dCs~~~ 313 (525)
T KOG1225|consen 269 CPPGFTGDDCDE--LVCPVD-CSGGGVCVD---G--ECICNPGYSGKDCSIRR 313 (525)
T ss_pred CCCCCcCCCCCc--ccCCcc-cCCCceecC---C--EeecCCCcccccccccc
Confidence 666777888875 357665 777676663 2 89999999999997543
No 31
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=90.67 E-value=0.22 Score=25.41 Aligned_cols=18 Identities=28% Similarity=1.005 Sum_probs=14.4
Q ss_pred eEeeCCCCceEEeCCCCcc
Q psy11297 44 TCQDEEDGLFECLCSPEFT 62 (132)
Q Consensus 44 ~C~~~~~~~~~C~C~~g~~ 62 (132)
.|++.. ++|+|.|+.||.
T Consensus 11 ~C~~~~-g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICVNTP-GSYRCSCPPGYK 28 (36)
T ss_dssp EEEEET-TSEEEE-STTEE
T ss_pred CCccCC-CceEeECCCCCE
Confidence 788887 469999999985
No 32
>KOG1214|consensus
Probab=87.47 E-value=1.9 Score=37.10 Aligned_cols=31 Identities=26% Similarity=0.639 Sum_probs=22.3
Q ss_pred CeEeeCCCCceEEeCCCCccC--CcccccCCCCC
Q psy11297 43 ATCQDEEDGLFECLCSPEFTG--YLCHTRAPPKL 74 (132)
Q Consensus 43 g~C~~~~~~~~~C~C~~g~~G--~~C~~~~~~c~ 74 (132)
..|+....++|+|.|.+||.| ..|. +.++|.
T Consensus 799 a~c~~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~ 831 (1289)
T KOG1214|consen 799 ARCVHHGGSTYSCACLPGFSGDGHQCT-DVDECS 831 (1289)
T ss_pred eEEEecCCceEEEeecCCccCCccccc-cccccC
Confidence 366666656899999999986 4554 347775
No 33
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=86.51 E-value=0.88 Score=24.47 Aligned_cols=23 Identities=35% Similarity=0.791 Sum_probs=17.9
Q ss_pred CeEeeCCCCceEEeCCCCccCCcccc
Q psy11297 43 ATCQDEEDGLFECLCSPEFTGYLCHT 68 (132)
Q Consensus 43 g~C~~~~~~~~~C~C~~g~~G~~C~~ 68 (132)
..|... +..|.|.++++|++|++
T Consensus 11 ~~C~~~---~G~C~C~~~~~G~~C~~ 33 (49)
T PF00053_consen 11 QTCDPS---TGQCVCKPGTTGPRCDQ 33 (49)
T ss_dssp SSEEET---CEEESBSTTEESTTS-E
T ss_pred CcccCC---CCEEeccccccCCcCcC
Confidence 467664 26999999999999986
No 34
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=84.85 E-value=1.1 Score=24.31 Aligned_cols=16 Identities=38% Similarity=0.831 Sum_probs=14.3
Q ss_pred eEEeCCCCccCCcccc
Q psy11297 53 FECLCSPEFTGYLCHT 68 (132)
Q Consensus 53 ~~C~C~~g~~G~~C~~ 68 (132)
-.|.|.+++.|..|++
T Consensus 19 G~C~C~~~~~G~~C~~ 34 (50)
T cd00055 19 GQCECKPNTTGRRCDR 34 (50)
T ss_pred CEEeCCCcCCCCCCCC
Confidence 5899999999999974
No 35
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=82.98 E-value=0.71 Score=29.33 Aligned_cols=32 Identities=25% Similarity=0.774 Sum_probs=22.2
Q ss_pred CCCCCCCeEeeCCC----CceEEeCCC-------------CccCCcccc
Q psy11297 37 HPCQNGATCQDEED----GLFECLCSP-------------EFTGYLCHT 68 (132)
Q Consensus 37 ~pC~ngg~C~~~~~----~~~~C~C~~-------------g~~G~~C~~ 68 (132)
+-|.++|.|+.... .=|.|.|.+ .|.|..|+.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqK 61 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQK 61 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccc
Confidence 45677899997621 238999976 466777764
No 36
>KOG1226|consensus
Probab=81.90 E-value=2 Score=36.30 Aligned_cols=40 Identities=25% Similarity=0.640 Sum_probs=22.2
Q ss_pred CCCcccCCCCCCCC---CCCCCCeEeeCCCCceEEeCCCCccCCcccc
Q psy11297 24 SPNLRLEESPCLSH---PCQNGATCQDEEDGLFECLCSPEFTGYLCHT 68 (132)
Q Consensus 24 g~~C~~~~~~C~~~---pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~ 68 (132)
|+.|+-+.-.|..+ .|.++|+|.= -.|.|.+||+|..|+-
T Consensus 539 G~fCECDnfsC~r~~g~lC~g~G~C~C-----G~CvC~~GwtG~~C~C 581 (783)
T KOG1226|consen 539 GKFCECDNFSCERHKGVLCGGHGRCEC-----GRCVCNPGWTGSACNC 581 (783)
T ss_pred eeeeeccCcccccccCcccCCCCeEeC-----CcEEcCCCCccCCCCC
Confidence 66666555555542 4555555541 2566666666665543
No 37
>KOG1836|consensus
Probab=81.47 E-value=3 Score=38.58 Aligned_cols=60 Identities=27% Similarity=0.452 Sum_probs=40.2
Q ss_pred CccCCeecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCC-CCceEEe-CCCCccCCcccccCC
Q psy11297 11 GCLSASVVQSSQASPNLRLEESPCLSHPCQNGATCQDEE-DGLFECL-CSPEFTGYLCHTRAP 71 (132)
Q Consensus 11 ~~~~~~~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~-~~~~~C~-C~~g~~G~~C~~~~~ 71 (132)
|-.-++..+++| |..=......|.+=||.+++.|.... .....|. |+++|+|.+|+...+
T Consensus 755 G~~C~~C~~GfY-g~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d 816 (1705)
T KOG1836|consen 755 GGQCAQCVDGFY-GLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD 816 (1705)
T ss_pred CCchhhhcCCCC-CccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence 334444555666 33322222338888999999998654 2357899 999999999998554
No 38
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=79.34 E-value=0.98 Score=26.02 Aligned_cols=11 Identities=45% Similarity=1.093 Sum_probs=8.0
Q ss_pred eCCCCccCCcc
Q psy11297 56 LCSPEFTGYLC 66 (132)
Q Consensus 56 ~C~~g~~G~~C 66 (132)
.|.+||+|+.|
T Consensus 53 ~C~~Gw~G~~C 63 (63)
T PF01414_consen 53 VCLPGWTGPNC 63 (63)
T ss_dssp EE-TTEESTTS
T ss_pred CCCCCCcCCCC
Confidence 47789999877
No 39
>KOG3509|consensus
Probab=78.01 E-value=6.5 Score=34.36 Aligned_cols=56 Identities=21% Similarity=0.426 Sum_probs=41.4
Q ss_pred CeecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCC
Q psy11297 15 ASVVQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPP 72 (132)
Q Consensus 15 ~~~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~ 72 (132)
-..||.+..+..++ .-.|...+|+-+++|......+|.+.|+..-.+..|+.....
T Consensus 158 ~~~~~~~~~~~~~~--~~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r 213 (964)
T KOG3509|consen 158 KRCCPTSEGGPGTE--PIHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPR 213 (964)
T ss_pred hhcCcccCCCCccc--cccccCcccccceeEEecCCcceeeeccccccceehhhccCc
Confidence 34567777666665 357889999999999977544688888888777778775543
No 40
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=75.72 E-value=3.4 Score=26.07 Aligned_cols=32 Identities=22% Similarity=0.552 Sum_probs=24.5
Q ss_pred CCCCCCC-CCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297 30 EESPCLS-HPCQNGATCQDEEDGLFECLCSPEFTG 63 (132)
Q Consensus 30 ~~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G 63 (132)
..+.|.. ..|...|.|.... ...|.|++||.-
T Consensus 76 p~d~Cd~y~~CG~~g~C~~~~--~~~C~Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNSNN--SPKCSCLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCCCC--CCceECCCCcCC
Confidence 3467874 8899999997553 367999999963
No 41
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=74.03 E-value=2.7 Score=22.35 Aligned_cols=16 Identities=38% Similarity=0.879 Sum_probs=14.1
Q ss_pred eEEeCCCCccCCcccc
Q psy11297 53 FECLCSPEFTGYLCHT 68 (132)
Q Consensus 53 ~~C~C~~g~~G~~C~~ 68 (132)
-.|.|+++++|++|++
T Consensus 18 G~C~C~~~~~G~~C~~ 33 (46)
T smart00180 18 GQCECKPNVTGRRCDR 33 (46)
T ss_pred CEEECCCCCCCCCCCc
Confidence 4899999999999974
No 42
>KOG3509|consensus
Probab=71.12 E-value=6.7 Score=34.30 Aligned_cols=42 Identities=31% Similarity=0.737 Sum_probs=35.9
Q ss_pred CCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCC
Q psy11297 31 ESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPK 73 (132)
Q Consensus 31 ~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c 73 (132)
.+.|...||.+.+.|.+...+ ..|.|+++|.|..|+...+.|
T Consensus 406 g~~c~~~p~~~~g~c~p~~~~-~~c~c~~g~~G~~c~d~~~~~ 447 (964)
T KOG3509|consen 406 GDVCWRIPCQHDGPCLQTLEG-KQCLCPPGYTGDSCEDCMNGC 447 (964)
T ss_pred CCccccccCCCCccccccccc-cceeccccccCchhhccCccc
Confidence 467889999999999988754 999999999999998876654
No 43
>KOG4260|consensus
Probab=71.09 E-value=2.4 Score=31.71 Aligned_cols=33 Identities=24% Similarity=0.647 Sum_probs=27.6
Q ss_pred CCCCCC--CCCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297 30 EESPCL--SHPCQNGATCQDEEDGLFECLCSPEFTG 63 (132)
Q Consensus 30 ~~~~C~--~~pC~ngg~C~~~~~~~~~C~C~~g~~G 63 (132)
++++|. +.||.....|++.. |+|+|.+..||.+
T Consensus 235 DvnEC~~ep~~c~~~qfCvNte-GSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 235 DVNECQNEPAPCKAHQFCVNTE-GSFKCEDKEGYKK 269 (350)
T ss_pred cHHHHhcCCCCCChhheeecCC-CceEecccccccC
Confidence 677775 57899999999987 6799999888865
No 44
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=66.53 E-value=5.1 Score=20.13 Aligned_cols=25 Identities=24% Similarity=0.763 Sum_probs=16.3
Q ss_pred CCCCCCCCCCeEeeCCCCceEEeCCCCcc
Q psy11297 34 CLSHPCQNGATCQDEEDGLFECLCSPEFT 62 (132)
Q Consensus 34 C~~~pC~ngg~C~~~~~~~~~C~C~~g~~ 62 (132)
|....|. +.|.+.. .+.|.||.||.
T Consensus 3 Cn~t~Cp--A~CDpn~--~~~C~CPeGyI 27 (34)
T PF09064_consen 3 CNQTECP--ADCDPNS--PGQCFCPEGYI 27 (34)
T ss_pred cccccCC--CccCCCC--CCceeCCCceE
Confidence 4444554 3666654 37999999984
No 45
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=59.98 E-value=5.3 Score=22.36 Aligned_cols=35 Identities=26% Similarity=0.474 Sum_probs=17.9
Q ss_pred CCCCCCCeEeeC---CCCceEEeCCCCccCCcccccCC
Q psy11297 37 HPCQNGATCQDE---EDGLFECLCSPEFTGYLCHTRAP 71 (132)
Q Consensus 37 ~pC~ngg~C~~~---~~~~~~C~C~~g~~G~~C~~~~~ 71 (132)
.+|..+|+.... ..|.-.|.|..-|.|+.|.+.+.
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~~ 54 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLIP 54 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-T
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcccCCC
Confidence 456666665421 12457899999999999988653
No 46
>KOG0994|consensus
Probab=52.36 E-value=16 Score=32.98 Aligned_cols=32 Identities=34% Similarity=0.895 Sum_probs=20.6
Q ss_pred CCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccc
Q psy11297 33 PCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHT 68 (132)
Q Consensus 33 ~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~ 68 (132)
+|.=+| ..+..|..-. -.|.|.+||.|+.|.+
T Consensus 1068 ~C~Cd~-~~~pqCN~ft---GQCqCkpGfGGR~C~q 1099 (1758)
T KOG0994|consen 1068 PCNCDP-IGGPQCNEFT---GQCQCKPGFGGRTCSQ 1099 (1758)
T ss_pred ccCCCc-cCCccccccc---cceeccCCCCCcchhH
Confidence 444444 2333555443 3799999999999876
No 47
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=50.58 E-value=29 Score=21.26 Aligned_cols=27 Identities=30% Similarity=0.934 Sum_probs=21.1
Q ss_pred CCCCCCCCCCCeEeeCCCCceEEeCCC
Q psy11297 33 PCLSHPCQNGATCQDEEDGLFECLCSP 59 (132)
Q Consensus 33 ~C~~~pC~ngg~C~~~~~~~~~C~C~~ 59 (132)
+|....|..|.+|+-...+.-.|+|..
T Consensus 1 pC~~v~C~~G~~C~~d~~~~p~CvC~~ 27 (86)
T cd01328 1 PCENHHCGAGKVCEVDDENTPKCVCID 27 (86)
T ss_pred CCCCcCCCCCCEeeECCCCCeEEecCC
Confidence 577888999999986544568999864
No 48
>PHA02885 putative interleukin binding protein; Provisional
Probab=43.49 E-value=35 Score=21.80 Aligned_cols=24 Identities=21% Similarity=0.516 Sum_probs=19.6
Q ss_pred CCCCeEEEEeccCCCCCCCeEEEE
Q psy11297 106 NKNDGILLYNQQNLDGTGDFVSLA 129 (132)
Q Consensus 106 ~~~~g~ll~~~~~~~~~~d~~~l~ 129 (132)
.+.+|.|+|.+..+...+.||.++
T Consensus 57 nsedgyiywi~pnntspgeyif~e 80 (135)
T PHA02885 57 NSEDGYIYWIGPNNTSPGEYIFIE 80 (135)
T ss_pred cCCCceEEEEeCCCCCCcceeeee
Confidence 468899999999877778888765
No 49
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=41.06 E-value=41 Score=15.64 Aligned_cols=23 Identities=26% Similarity=0.683 Sum_probs=14.2
Q ss_pred CCCCCCCCCCCeEeeCCCCceEE
Q psy11297 33 PCLSHPCQNGATCQDEEDGLFEC 55 (132)
Q Consensus 33 ~C~~~pC~ngg~C~~~~~~~~~C 55 (132)
+|...-|..|.+|.....+.-.|
T Consensus 1 ~C~~v~C~~G~~C~~d~~g~p~C 23 (26)
T smart00274 1 SCRNVQCPFGKVCVVDKGGNARC 23 (26)
T ss_pred CCCCEECCCCCEEEeCCCCCEEE
Confidence 35666677777887654344555
No 50
>PF12020 TAFA: TAFA family; InterPro: IPR020350 This entry contains the protein FAM19A2 (also known as TAFA2); it is a secreted, 11 kDa member of the FAM19/TAFA family of chemokine-like proteins [], which are composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The FAM19/TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines that act as regulators of immune and nervous cells [].
Probab=34.43 E-value=52 Score=20.17 Aligned_cols=39 Identities=26% Similarity=0.571 Sum_probs=20.8
Q ss_pred cCCccCCeecCCCCCCCCccc-----CCCCCCCCCCCCCCeEee
Q psy11297 9 RMGCLSASVVQSSQASPNLRL-----EESPCLSHPCQNGATCQD 47 (132)
Q Consensus 9 ~~~~~~~~~cp~~~~g~~C~~-----~~~~C~~~pC~ngg~C~~ 47 (132)
++.|..+++.=-.-+-+.|-. ...-|.-.||..|..|.-
T Consensus 27 ~C~C~~GqvaGTTrA~Pscvdasiv~~rwWC~MePCl~gEeC~~ 70 (90)
T PF12020_consen 27 RCSCFPGQVAGTTRARPSCVDASIVLQRWWCQMEPCLEGEECKV 70 (90)
T ss_pred EEEecCCeecccccCCCccccceeeeeeeeeecccccCCcceEE
Confidence 456666666543333333421 122366777777777763
No 51
>KOG1226|consensus
Probab=34.25 E-value=80 Score=27.20 Aligned_cols=50 Identities=18% Similarity=0.461 Sum_probs=28.6
Q ss_pred cCCCCCCCCcc--cCCCCCCCCCCCCCCeEeeCCCC-ceEEeCCCC-ccCCcccccC
Q psy11297 18 VQSSQASPNLR--LEESPCLSHPCQNGATCQDEEDG-LFECLCSPE-FTGYLCHTRA 70 (132)
Q Consensus 18 cp~~~~g~~C~--~~~~~C~~~pC~ngg~C~~~~~~-~~~C~C~~g-~~G~~C~~~~ 70 (132)
|...+.|..|+ ...+.|.+. +|+.|.....- =-+|.|... |.|..||...
T Consensus 570 C~~GwtG~~C~C~~std~C~~~---~G~iCSGrG~C~Cg~C~C~~~~~sG~~CE~cp 623 (783)
T KOG1226|consen 570 CNPGWTGSACNCPLSTDTCESS---DGQICSGRGTCECGRCKCTDPPYSGEFCEKCP 623 (783)
T ss_pred cCCCCccCCCCCCCCCccccCC---CCceeCCCceeeCCceEcCCCCcCcchhhcCC
Confidence 34455588775 456667653 44455433100 135667444 9999998854
No 52
>KOG0994|consensus
Probab=31.19 E-value=1.1e+02 Score=27.99 Aligned_cols=37 Identities=24% Similarity=0.713 Sum_probs=24.2
Q ss_pred CCCCCCCCCCC--------eEeeCCC-CceEEeCCCCccCCccccc
Q psy11297 33 PCLSHPCQNGA--------TCQDEED-GLFECLCSPEFTGYLCHTR 69 (132)
Q Consensus 33 ~C~~~pC~ngg--------~C~~~~~-~~~~C~C~~g~~G~~C~~~ 69 (132)
.|.+=||..+. .|..... -...|.|..||.|.+|+.=
T Consensus 905 ~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~C 950 (1758)
T KOG0994|consen 905 GCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEIC 950 (1758)
T ss_pred CCCCCCCCCCCccchhccccccccccccceeeecccCccccchhhh
Confidence 46655666543 4543321 1367999999999999873
No 53
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=30.89 E-value=1.8e+02 Score=20.53 Aligned_cols=41 Identities=29% Similarity=0.448 Sum_probs=25.1
Q ss_pred cCCceeEEEEEEEEeCCCCe--EEEEeccCCCCCCCeEEEEEcC
Q psy11297 91 LKAYNKLSIEIEFKTNKNDG--ILLYNQQNLDGTGDFVSLAIVN 132 (132)
Q Consensus 91 ~~~~~~~~i~~~f~t~~~~g--~ll~~~~~~~~~~d~~~l~l~~ 132 (132)
++...+..|.++|++....+ .||....... ...|+.|.+.|
T Consensus 29 lk~L~~gTI~i~Fk~~~~~~~~sLfsiSn~~~-~n~YF~lyv~~ 71 (190)
T PF02973_consen 29 LKKLEEGTIVIRFKSDSNSGIQSLFSISNSTK-GNEYFSLYVSN 71 (190)
T ss_dssp CCT-SSEEEEEEEEESS-SSEEEEEEEE-TST-TSEEEEEEEET
T ss_pred HhcccccEEEEEEecCCCcceeEEEEecCCCC-ccceEEEEEEC
Confidence 44566789999999865544 3454444433 45788887764
No 54
>KOG3607|consensus
Probab=30.60 E-value=38 Score=28.93 Aligned_cols=29 Identities=24% Similarity=0.809 Sum_probs=23.0
Q ss_pred CCCCCCeEeeCCCCceEEeCCCCccCCcccccC
Q psy11297 38 PCQNGATCQDEEDGLFECLCSPEFTGYLCHTRA 70 (132)
Q Consensus 38 pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~ 70 (132)
.|...|.|.+. +.|.|.+||.++.|+..-
T Consensus 631 ~C~g~GVCnn~----~~ChC~~gwapp~C~~~~ 659 (716)
T KOG3607|consen 631 TCNGHGVCNNE----LNCHCEPGWAPPFCFIFG 659 (716)
T ss_pred ccCCCcccCCC----cceeeCCCCCCCcccccc
Confidence 37667777643 789999999999998754
No 55
>KOG1218|consensus
Probab=25.47 E-value=2.5e+02 Score=20.51 Aligned_cols=30 Identities=30% Similarity=0.463 Sum_probs=19.6
Q ss_pred eEEeCCCCccCCcccccCCCCCCCCCCCCC
Q psy11297 53 FECLCSPEFTGYLCHTRAPPKLYDTPAFNG 82 (132)
Q Consensus 53 ~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g 82 (132)
-.|.|++||.|..|......|.....+.++
T Consensus 162 ~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g 191 (316)
T KOG1218|consen 162 GICTCQPGFVGVFCVESCSGCSPLTACENG 191 (316)
T ss_pred CceeccCCcccccccccCCCcCCCcccCCC
Confidence 567899999999988765434323334444
No 56
>KOG4004|consensus
Probab=22.04 E-value=96 Score=22.39 Aligned_cols=31 Identities=26% Similarity=0.692 Sum_probs=24.7
Q ss_pred cCCCCCCCCCCCCCCeEeeCCCCceEEeCCC
Q psy11297 29 LEESPCLSHPCQNGATCQDEEDGLFECLCSP 59 (132)
Q Consensus 29 ~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~ 59 (132)
...++|+.+.|..|-.|.-...+.-+|+|..
T Consensus 48 ~a~npC~dh~Cg~gk~C~vd~~~~P~Cvc~~ 78 (259)
T KOG4004|consen 48 EARNPCADHKCGPGKNCLVDLQTQPRCVCCR 78 (259)
T ss_pred cccCccccccCCCCceeeecCCCCceeEEec
Confidence 3468999999999999986655668899855
No 57
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=21.86 E-value=48 Score=23.50 Aligned_cols=26 Identities=23% Similarity=0.759 Sum_probs=19.8
Q ss_pred CCCCCCCeEeeCCC----CceEEeCCCCcc
Q psy11297 37 HPCQNGATCQDEED----GLFECLCSPEFT 62 (132)
Q Consensus 37 ~pC~ngg~C~~~~~----~~~~C~C~~g~~ 62 (132)
.+|...+.|+...+ ..|.|.|.+||.
T Consensus 50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~ 79 (197)
T PF06247_consen 50 KPCGDYAKCINQANKGEERAYKCDCINGYI 79 (197)
T ss_dssp SEEETTEEEEE-SSTTSSTSEEEEE-TTEE
T ss_pred ccccchhhhhcCCCcccceeEEEecccCce
Confidence 68999999997653 369999999985
No 58
>PF07966 A1_Propeptide: A1 Propeptide ; InterPro: IPR012848 Most eukaryotic endopeptidases (MEROPS peptidase family A1) are synthesised with signal and propeptides. The animal pepsin-like endopeptidase propeptides form a distinct family of propeptides, which contain a conserved motif approximately 30 residues long. In pepsinogen A, the first 11 residues of the mature pepsin sequence are displaced by residues of the propeptide. The propeptide contains two helices that block the active site cleft, in particular the conserved Asp11 residue, in pepsin, hydrogen bonds to a conserved Arg residue in the propeptide. This hydrogen bond stabilises the propeptide conformation and is probably responsible for triggering the conversion of pepsinogen to pepsin under acidic conditions [, ]. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1AVF_Q 1HTR_P 3PSG_A 2PSG_A 3VCM_Q 1TZS_P.
Probab=20.65 E-value=32 Score=16.38 Aligned_cols=8 Identities=38% Similarity=0.845 Sum_probs=3.3
Q ss_pred CceeEeec
Q psy11297 2 VPLVRMKR 9 (132)
Q Consensus 2 ~~~~~~~~ 9 (132)
|||.|+.+
T Consensus 4 IPL~K~kS 11 (29)
T PF07966_consen 4 IPLKKFKS 11 (29)
T ss_dssp EEEEE---
T ss_pred EeccCCch
Confidence 56766654
Done!