Query psy14262
Match_columns 482
No_of_seqs 447 out of 2860
Neff 7.5
Searched_HMMs 46136
Date Fri Aug 16 23:30:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14262.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14262hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1215|consensus 99.8 3.5E-20 7.6E-25 211.5 18.4 324 115-481 25-426 (877)
2 KOG1215|consensus 99.2 3.8E-11 8.3E-16 137.6 12.4 166 120-312 142-319 (877)
3 PF00057 Ldl_recept_a: Low-den 98.9 9E-10 1.9E-14 75.0 3.3 36 4-45 2-37 (37)
4 cd00112 LDLa Low Density Lipop 98.9 8.4E-10 1.8E-14 74.3 2.5 35 5-45 1-35 (35)
5 PF00057 Ldl_recept_a: Low-den 98.8 3.6E-09 7.8E-14 72.0 3.3 35 164-199 2-36 (37)
6 cd00112 LDLa Low Density Lipop 98.7 6E-09 1.3E-13 70.1 2.7 34 165-199 1-34 (35)
7 smart00192 LDLa Low-density li 98.7 1.7E-08 3.7E-13 67.0 2.8 33 4-42 1-33 (33)
8 smart00192 LDLa Low-density li 98.5 1.1E-07 2.3E-12 63.1 2.7 32 165-197 2-33 (33)
9 KOG1214|consensus 98.4 3E-07 6.5E-12 99.0 6.4 147 290-455 695-865 (1289)
10 PF12999 PRKCSH-like: Glucosid 98.4 2.2E-07 4.8E-12 84.8 4.7 68 167-237 35-110 (176)
11 PF14670 FXa_inhibition: Coagu 98.4 2.8E-07 6E-12 62.2 3.5 33 421-453 4-36 (36)
12 KOG1219|consensus 98.2 1.8E-06 3.9E-11 100.8 5.7 105 337-455 3864-3978(4289)
13 PF12999 PRKCSH-like: Glucosid 98.1 3E-06 6.4E-11 77.5 4.2 68 207-282 35-110 (176)
14 PF07645 EGF_CA: Calcium-bindi 98.1 5.3E-06 1.2E-10 58.2 4.2 39 415-453 1-42 (42)
15 PF12662 cEGF: Complement Clr- 98.0 3.8E-06 8.2E-11 51.1 2.1 24 435-458 1-24 (24)
16 KOG1219|consensus 97.4 0.00017 3.6E-09 85.2 4.9 102 289-414 3866-3977(4289)
17 KOG1214|consensus 97.3 0.00049 1.1E-08 75.0 6.8 112 344-462 699-833 (1289)
18 KOG4260|consensus 97.2 0.00082 1.8E-08 64.8 6.3 46 401-446 221-269 (350)
19 PF14670 FXa_inhibition: Coagu 97.0 0.00018 3.8E-09 48.6 0.0 30 384-413 5-36 (36)
20 PF12662 cEGF: Complement Clr- 96.8 0.00079 1.7E-08 41.0 1.7 22 397-418 1-24 (24)
21 cd01475 vWA_Matrilin VWA_Matri 96.2 0.0039 8.5E-08 60.0 3.5 44 409-452 180-224 (224)
22 KOG2397|consensus 96.1 0.0055 1.2E-07 63.9 4.3 69 168-239 43-115 (480)
23 smart00179 EGF_CA Calcium-bind 95.9 0.009 2E-07 40.3 3.4 35 416-453 2-38 (39)
24 KOG1217|consensus 95.9 0.21 4.6E-06 52.6 15.2 116 338-461 272-405 (487)
25 KOG4289|consensus 95.4 0.02 4.3E-07 66.2 5.2 88 357-455 1221-1317(2531)
26 KOG1217|consensus 95.3 0.15 3.1E-06 53.8 11.3 150 278-459 158-315 (487)
27 PF07645 EGF_CA: Calcium-bindi 95.0 0.0073 1.6E-07 42.2 0.3 26 381-406 4-33 (42)
28 KOG4260|consensus 94.8 0.06 1.3E-06 52.2 6.0 76 365-447 221-306 (350)
29 KOG3509|consensus 94.3 0.17 3.7E-06 57.8 9.0 104 257-374 2-109 (964)
30 KOG2397|consensus 93.8 0.056 1.2E-06 56.6 3.8 68 209-284 44-115 (480)
31 cd00054 EGF_CA Calcium-binding 93.4 0.099 2.1E-06 34.5 3.3 30 416-445 2-33 (38)
32 smart00181 EGF Epidermal growt 93.4 0.094 2E-06 34.5 3.1 25 423-447 6-31 (35)
33 KOG4289|consensus 93.0 0.087 1.9E-06 61.2 3.7 49 394-445 1218-1269(2531)
34 PF12947 EGF_3: EGF domain; I 92.9 0.099 2.1E-06 35.2 2.6 30 422-453 5-36 (36)
35 KOG3509|consensus 92.9 0.098 2.1E-06 59.7 4.1 103 177-286 2-111 (964)
36 smart00179 EGF_CA Calcium-bind 91.4 0.16 3.6E-06 33.9 2.4 32 381-413 4-38 (39)
37 cd00053 EGF Epidermal growth f 89.3 0.46 1E-05 30.6 3.1 24 423-446 6-31 (36)
38 PF12947 EGF_3: EGF domain; I 89.2 0.087 1.9E-06 35.5 -0.5 25 389-413 12-36 (36)
39 smart00181 EGF Epidermal growt 88.7 0.34 7.3E-06 31.7 2.1 22 385-406 6-28 (35)
40 PF00008 EGF: EGF-like domain 84.6 1 2.2E-05 29.3 2.6 23 423-445 4-29 (32)
41 PF00008 EGF: EGF-like domain 84.2 0.23 4.9E-06 32.4 -0.7 23 384-406 3-28 (32)
42 cd00054 EGF_CA Calcium-binding 82.0 1.2 2.6E-05 29.1 2.2 26 381-406 4-32 (38)
43 cd00053 EGF Epidermal growth f 81.3 1.3 2.8E-05 28.4 2.1 18 389-406 12-29 (36)
44 KOG1225|consensus 80.2 7 0.00015 42.3 8.3 72 210-310 260-331 (525)
45 cd01475 vWA_Matrilin VWA_Matri 78.7 1.4 3E-05 42.2 2.3 23 384-406 194-216 (224)
46 KOG1225|consensus 78.3 9.8 0.00021 41.2 8.7 62 365-445 299-362 (525)
47 PF06247 Plasmod_Pvs28: Plasmo 68.3 2.9 6.3E-05 38.8 1.6 57 392-448 14-82 (197)
48 PF09064 Tme5_EGF_like: Thromb 60.6 8.9 0.00019 25.4 2.3 24 424-448 7-30 (34)
49 PF12661 hEGF: Human growth fa 59.0 7.8 0.00017 20.0 1.5 9 437-445 1-9 (13)
50 PF12946 EGF_MSP1_1: MSP1 EGF 31.0 48 0.001 22.5 2.2 22 426-447 10-32 (37)
51 PF06247 Plasmod_Pvs28: Plasmo 27.1 13 0.00028 34.6 -1.4 98 346-450 51-165 (197)
No 1
>KOG1215|consensus
Probab=99.84 E-value=3.5e-20 Score=211.54 Aligned_cols=324 Identities=32% Similarity=0.614 Sum_probs=250.1
Q ss_pred CCCCcceeeCCceecCceeeCCCCCCCcccCCCCCCccccccCCCC----------------------------------
Q psy14262 115 GENRSPLVTKNIQWPSSLVIDYNQDILYWCDSHLNTVESVRVDGSH---------------------------------- 160 (482)
Q Consensus 115 ~~~~~~~~~~~~ci~~~~~cdg~~dc~dw~d~~~~sdE~~~~~~~c---------------------------------- 160 (482)
|+.+..++-...-||+++..|....+.+|.|++...+|..++++.-
T Consensus 25 g~~~~~~~~~~~~~~ng~~id~~~~~~y~~d~~~~~i~~~~~dg~~r~~l~~~~~~~y~~d~~v~~~~~~sg~~~~~~~~ 104 (877)
T KOG1215|consen 25 GGERKILEKEEFEWPNGLTIDLAWQRIYWADAKNDLIESANYDGSGRRALTLFEDGLYWTDKSVSAANKKTGKDVTRLSQ 104 (877)
T ss_pred CcceEEeeccceeCCCcceecchhheeeeccccCCceEEeccCCccceeeeeeccceeeccchhhhhccCCCCcceeehh
Confidence 3444455555677888888888899999999999888887776631
Q ss_pred -----------------------------CCccCCCCCeecC--CCCceecCCcccCCCCCCCCCCCCCCCCC---CCCC
Q psy14262 161 -----------------------------NAVLCHPNELRCG--SSNQCVALSKFCDGKPDCSNGEDEGPFCN---AECT 206 (482)
Q Consensus 161 -----------------------------~~~~C~~~~f~C~--~~~~Ci~~~~~CDg~~dC~dgsDE~~~C~---~~C~ 206 (482)
+...|...+|+|. +. +||+..|+|||..+|.+|+||.+ |. ....
T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~-~Cip~~~~cd~~~~C~dg~de~~-~~~~~~~~~ 182 (877)
T KOG1215|consen 105 DSHFPLDIHAYHPSSQPLAPDPCAESGNGPCSHCCLDKFSCRTGSC-KCIPGDWLCDGEADCPDGSDELN-CAVRRCEPR 182 (877)
T ss_pred cCCCCcceeEEecCCCCCCCCcccccCCCCCccccCCCCCCcCccc-cCCCCceeCCCCCccccchhhhc-ccccccCcc
Confidence 0123566789998 44 89999999999999999999998 64 1222
Q ss_pred CCceecCCCCeeccCCCCCCCcCCCCCCCCCCCccccccc---CCccccCCCCceecCCCccCccCCCCCCCCCCCCCCC
Q psy14262 207 PGQFSCGMGKCIPESWKCDGFVDCLEDGSDEPNTCIHQCF---NGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDED 283 (482)
Q Consensus 207 ~~~f~C~~g~Ci~~~~~CDg~~dC~~dgsDE~~C~~~~C~---~~~f~C~~~g~Ci~~~~~CDg~~dC~~~~~~dgsDE~ 283 (482)
...|+| |...++||+..+| .+++|+.......+. ...++|.+.++||...+.|||..+|. +++||.
T Consensus 183 ~~~~~~-----~~~~~~~d~~~~~-~~~~d~~~~~~~~~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~-----~~~de~ 251 (877)
T KOG1215|consen 183 GASLDC-----IVAIKVCDIQHDC-ADDYDESEGRIYWTDDSRIEVTRCDGSSRCILISEVCDGPRDCV-----DGPDEG 251 (877)
T ss_pred cccccc-----ceeeeecCccccc-ccccccccCcccccCCcceeEEEecCCCcEEeehhccCCCcccc-----cCCcCc
Confidence 344555 9999999999999 999999887544443 46788985569999999999999999 999994
Q ss_pred --CCCCccccCcccccCCCCccCCCCCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCcccccccccC
Q psy14262 284 --WDKFHQCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQCVALSKFCD 361 (482)
Q Consensus 284 --~C~~~~C~~~~f~C~~g~~Ci~~~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~Ci~~~~~CD 361 (482)
++....|...++.|.++ .|++...+|+|..||+ +|+||.. |.. ..+.+..|.|..+ + |...++|+
T Consensus 252 ~~~~~~~~~~~~e~~~~~~-~~~~~~~~~~g~~d~p-----dg~de~~---~~~--~~~~~~~~d~~~~-~-i~~~~~~~ 318 (877)
T KOG1215|consen 252 VMNCSDATCEAPEIECADG-DCSDRQKLCDGDLDCP-----DGLDEDY---CKK--KLYWSMNVDGSGR-R-ILLSKLCH 318 (877)
T ss_pred eeEeeccccCCcceeecCC-CCccceEEecCccCCC-----Ccccccc---ccc--ceeeeeecccCCc-e-eeecccCc
Confidence 66667788889999876 9999999999999999 9999987 754 2233667888877 8 99999998
Q ss_pred CCCCCCCCCCCCCCCCCCCCcc--CCCCcCCCCCCCCCccccCCCCc--cCCCCCccccccccCCCCCCceee-ecCCce
Q psy14262 362 GKPDCSNGEDEGPFCNNKADCE--NLKCSHKCSPTPRGAMCFCPAGQ--NPNKTQCVDADECLIDGSCDQTCT-NTHLSY 436 (482)
Q Consensus 362 g~~dC~dgsDE~~~C~~~~~C~--~~~C~~~C~~~~~~~~C~C~~G~--~~~~~~C~di~eC~~~~~Cs~~C~-n~~gsy 436 (482)
+ ...+..+ ++. ...+.+.+.+......+.|..++ .....+ ....++..++.|+|.|. +.++.|
T Consensus 319 ~-----~~~~~~~------~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~Csq~C~~~~p~~~ 386 (877)
T KOG1215|consen 319 G-----YWTDGLN------ECAERVLKCSHKCPDVSVGPRCDCMGAKVLPLGART-DSNPCESDNGGCSQLCVPNSPGTF 386 (877)
T ss_pred c-----ccccccc------cchhhcccccCCCCccccCCcccCCccceecccccc-cCCcccccCCccceeccCCCCCce
Confidence 8 1233322 443 45666778888888999999988 222322 12233336799999999 569999
Q ss_pred EEecCCCeEECCCCCccccCCCCCCCCcEEEeccCCceEEEeccC
Q psy14262 437 VCSCTEGYTLRPDGKSCTAVNVPQDEPASIIFSSTVDIRRIHLNG 481 (482)
Q Consensus 437 ~C~C~~Gy~l~~d~~tC~didec~~~p~~l~~~~~~~ir~~~l~~ 481 (482)
+|.|..||.+..++ |...+ ...++|+|+++++||++.|..
T Consensus 387 ~c~c~~g~~~~~~~--c~~~~---~~~~~l~~s~~~~ir~~~~~~ 426 (877)
T KOG1215|consen 387 KCACSPGYELRLDK--CEASD---QPEAFLLFSNRHDIRRISLDC 426 (877)
T ss_pred eEecCCCcEeccCC--ceecC---CCCcEEEEecCccceecccCC
Confidence 99999999998766 66554 334599999999999999875
No 2
>KOG1215|consensus
Probab=99.24 E-value=3.8e-11 Score=137.56 Aligned_cols=166 Identities=27% Similarity=0.421 Sum_probs=136.8
Q ss_pred ceeeC--C-ceecCceeeCCCCCCCcccCCCCCCccccccCCCCCCcc--CCCCCeecCCCCceecCCcccCCCCCCCCC
Q psy14262 120 PLVTK--N-IQWPSSLVIDYNQDILYWCDSHLNTVESVRVDGSHNAVL--CHPNELRCGSSNQCVALSKFCDGKPDCSNG 194 (482)
Q Consensus 120 ~~~~~--~-~ci~~~~~cdg~~dc~dw~d~~~~sdE~~~~~~~c~~~~--C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dg 194 (482)
.|.|. . .|||..|+||+..+|.+ |+||. .+.... .....|.| |...++||+..+|.++
T Consensus 142 ~~~c~~~~~~Cip~~~~cd~~~~C~d------g~de~-----~~~~~~~~~~~~~~~~------~~~~~~~d~~~~~~~~ 204 (877)
T KOG1215|consen 142 KFSCRTGSCKCIPGDWLCDGEADCPD------GSDEL-----NCAVRRCEPRGASLDC------IVAIKVCDIQHDCADD 204 (877)
T ss_pred CCCCcCccccCCCCceeCCCCCcccc------chhhh-----cccccccCcccccccc------ceeeeecCcccccccc
Confidence 45565 4 99999999999999999 99999 454221 22233444 8899999999999999
Q ss_pred CCCCCCCC-CCCC---CCceecCCC-CeeccCCCCCCCcCCCCCCCCCC--CcccccccCCccccCCCCceecCCCccCc
Q psy14262 195 EDEGPFCN-AECT---PGQFSCGMG-KCIPESWKCDGFVDCLEDGSDEP--NTCIHQCFNGQVQCALSKKCIPHNWLCDE 267 (482)
Q Consensus 195 sDE~~~C~-~~C~---~~~f~C~~g-~Ci~~~~~CDg~~dC~~dgsDE~--~C~~~~C~~~~f~C~~~g~Ci~~~~~CDg 267 (482)
+|+.. +. ..+. ...++|..+ +||..++.|||..|| .+++||. ++....|...++.|. ++.|++..++|+|
T Consensus 205 ~d~~~-~~~~~~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc-~~~~de~~~~~~~~~~~~~e~~~~-~~~~~~~~~~~~g 281 (877)
T KOG1215|consen 205 YDESE-GRIYWTDDSRIEVTRCDGSSRCILISEVCDGPRDC-VDGPDEGVMNCSDATCEAPEIECA-DGDCSDRQKLCDG 281 (877)
T ss_pred ccccc-CcccccCCcceeEEEecCCCcEEeehhccCCCccc-ccCCcCceeEeeccccCCcceeec-CCCCccceEEecC
Confidence 99987 43 1122 467899875 999999999999999 9999994 676667888899998 9999999999999
Q ss_pred cCCCCCCCCCCCCCCCCCCCccccCcccccCCCCccCCCCCCCCC
Q psy14262 268 EMDCSTDDIVDNTDEDWDKFHQCFNGQVQCALSKKCIPHNWLCDE 312 (482)
Q Consensus 268 ~~dC~~~~~~dgsDE~~C~~~~C~~~~f~C~~g~~Ci~~~~~CDg 312 (482)
..+|+ +|+||..|....+....|.|..+ + |+...+|++
T Consensus 282 ~~d~p-----dg~de~~~~~~~~~~~~~d~~~~-~-i~~~~~~~~ 319 (877)
T KOG1215|consen 282 DLDCP-----DGLDEDYCKKKLYWSMNVDGSGR-R-ILLSKLCHG 319 (877)
T ss_pred ccCCC-----CcccccccccceeeeeecccCCc-e-eeecccCcc
Confidence 99999 99999998866665778899866 8 999998876
No 3
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.91 E-value=9e-10 Score=74.98 Aligned_cols=36 Identities=39% Similarity=0.985 Sum_probs=33.2
Q ss_pred ccCCCCeeeCCCCCeecCccccCCCCCCCCCCCCCCCCcccc
Q psy14262 4 QCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWD 45 (482)
Q Consensus 4 ~C~~~~f~C~~~~~Ci~~~~~CDg~~dC~~~~~~d~sDE~~~ 45 (482)
.|.+++|+|. ++.|||..|+|||+.||+ |+|||.+|
T Consensus 2 ~C~~~~f~C~-~~~CI~~~~~CDg~~DC~-----dgsDE~~C 37 (37)
T PF00057_consen 2 TCPPGEFRCG-NGQCIPKSWVCDGIPDCP-----DGSDEQNC 37 (37)
T ss_dssp SSSTTEEEET-TSSEEEGGGTTSSSCSSS-----SSTTTSSH
T ss_pred cCcCCeeEcC-CCCEEChHHcCCCCCCCC-----CCcccccC
Confidence 6899999999 888999999999999998 99999754
No 4
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.89 E-value=8.4e-10 Score=74.27 Aligned_cols=35 Identities=43% Similarity=1.048 Sum_probs=32.0
Q ss_pred cCCCCeeeCCCCCeecCccccCCCCCCCCCCCCCCCCcccc
Q psy14262 5 CFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWD 45 (482)
Q Consensus 5 C~~~~f~C~~~~~Ci~~~~~CDg~~dC~~~~~~d~sDE~~~ 45 (482)
|.+++|+|. +++|||..|+|||+.||+ |+|||..|
T Consensus 1 C~~~~f~C~-~~~Ci~~~~~CDg~~DC~-----dgsDE~~C 35 (35)
T cd00112 1 CPPNEFRCA-NGRCIPSSWVCDGEDDCG-----DGSDEENC 35 (35)
T ss_pred CCCCeEEcC-CCCeeCHHHcCCCccCCC-----CCcccccC
Confidence 678999998 799999999999999998 99999743
No 5
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.81 E-value=3.6e-09 Score=72.01 Aligned_cols=35 Identities=51% Similarity=1.155 Sum_probs=18.4
Q ss_pred cCCCCCeecCCCCceecCCcccCCCCCCCCCCCCCC
Q psy14262 164 LCHPNELRCGSSNQCVALSKFCDGKPDCSNGEDEGP 199 (482)
Q Consensus 164 ~C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE~~ 199 (482)
.|.+.+|+|.++ +||+..|+|||+.||.|||||.+
T Consensus 2 ~C~~~~f~C~~~-~CI~~~~~CDg~~DC~dgsDE~~ 36 (37)
T PF00057_consen 2 TCPPGEFRCGNG-QCIPKSWVCDGIPDCPDGSDEQN 36 (37)
T ss_dssp SSSTTEEEETTS-SEEEGGGTTSSSCSSSSSTTTSS
T ss_pred cCcCCeeEcCCC-CEEChHHcCCCCCCCCCCccccc
Confidence 345555555554 35555555555555555555543
No 6
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.74 E-value=6e-09 Score=70.11 Aligned_cols=34 Identities=47% Similarity=1.116 Sum_probs=20.9
Q ss_pred CCCCCeecCCCCceecCCcccCCCCCCCCCCCCCC
Q psy14262 165 CHPNELRCGSSNQCVALSKFCDGKPDCSNGEDEGP 199 (482)
Q Consensus 165 C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE~~ 199 (482)
|.+.+|+|.++ +||+..++|||+.||+|||||.+
T Consensus 1 C~~~~f~C~~~-~Ci~~~~~CDg~~DC~dgsDE~~ 34 (35)
T cd00112 1 CPPNEFRCANG-RCIPSSWVCDGEDDCGDGSDEEN 34 (35)
T ss_pred CCCCeEEcCCC-CeeCHHHcCCCccCCCCCccccc
Confidence 44556666664 66666666666666666666654
No 7
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.65 E-value=1.7e-08 Score=67.00 Aligned_cols=33 Identities=48% Similarity=1.145 Sum_probs=30.2
Q ss_pred ccCCCCeeeCCCCCeecCccccCCCCCCCCCCCCCCCCc
Q psy14262 4 QCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDE 42 (482)
Q Consensus 4 ~C~~~~f~C~~~~~Ci~~~~~CDg~~dC~~~~~~d~sDE 42 (482)
.|.+.+|+|. ++.|||..|+|||++||+ |+|||
T Consensus 1 ~C~~~~f~C~-~~~Ci~~~~~Cdg~~dC~-----dgsDE 33 (33)
T smart00192 1 TCPPGEFQCD-NGRCIPLSWVCDGVDDCS-----DGSDE 33 (33)
T ss_pred CCCCCeEECC-CCCEECchhhCCCcCcCc-----CCCCC
Confidence 3777899998 899999999999999998 88997
No 8
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.47 E-value=1.1e-07 Score=63.14 Aligned_cols=32 Identities=50% Similarity=1.183 Sum_probs=24.6
Q ss_pred CCCCCeecCCCCceecCCcccCCCCCCCCCCCC
Q psy14262 165 CHPNELRCGSSNQCVALSKFCDGKPDCSNGEDE 197 (482)
Q Consensus 165 C~~~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE 197 (482)
|...+|+|.++ .||+..++|||+.||+|||||
T Consensus 2 C~~~~f~C~~~-~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CPPGEFQCDNG-RCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CCCCeEECCCC-CEECchhhCCCcCcCcCCCCC
Confidence 44567888766 688888888888888888887
No 9
>KOG1214|consensus
Probab=98.43 E-value=3e-07 Score=99.03 Aligned_cols=147 Identities=32% Similarity=0.762 Sum_probs=100.3
Q ss_pred ccCcccccCCCCccCCC---CCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCccccc--ccccCCCC
Q psy14262 290 CFNGQVQCALSKKCIPH---NWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQCVAL--SKFCDGKP 364 (482)
Q Consensus 290 C~~~~f~C~~g~~Ci~~---~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~Ci~~--~~~CDg~~ 364 (482)
|..+.-.|..+..|.+. .|.| .|+.++-+|| .+ |.. +..|......|+....||+. +++|
T Consensus 695 Cy~gsh~cdt~a~C~pg~~~~~tc----ecs~g~~gdg---r~---c~d-~~eca~~~~~CGp~s~Cin~pg~~rc---- 759 (1289)
T KOG1214|consen 695 CYDGSHMCDTTARCHPGTGVDYTC----ECSSGYQGDG---RN---CVD-ENECATGFHRCGPNSVCINLPGSYRC---- 759 (1289)
T ss_pred ceecCcccCCCccccCCCCcceEE----EEeeccCCCC---CC---CCC-hhhhccCCCCCCCCceeecCCCceeE----
Confidence 43334445555566653 4555 5664443333 33 443 23566666788887799976 7888
Q ss_pred CCCCC---CCCCCCCC-------CCCCcc--CCCCc----CCCCCC-CCCccccCCCCccCCCCCccccccccCCCCC--
Q psy14262 365 DCSNG---EDEGPFCN-------NKADCE--NLKCS----HKCSPT-PRGAMCFCPAGQNPNKTQCVDADECLIDGSC-- 425 (482)
Q Consensus 365 dC~dg---sDE~~~C~-------~~~~C~--~~~C~----~~C~~~-~~~~~C~C~~G~~~~~~~C~di~eC~~~~~C-- 425 (482)
.|.-| +|+...|. .+ .|. ++.|. .+|+.. .+.|.|.|.+||..++..|.+++||. +..|
T Consensus 760 eC~~gy~F~dd~~tCV~i~~pap~n-~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~-psrChp 837 (1289)
T KOG1214|consen 760 ECRSGYEFADDRHTCVLITPPAPAN-PCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECS-PSRCHP 837 (1289)
T ss_pred EEeecceeccCCcceEEecCCCCCC-ccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccC-ccccCC
Confidence 77776 46655576 23 565 34442 145543 34799999999988999999999998 7788
Q ss_pred CceeeecCCceEEecCCCeEECCCCCcccc
Q psy14262 426 DQTCTNTHLSYVCSCTEGYTLRPDGKSCTA 455 (482)
Q Consensus 426 s~~C~n~~gsy~C~C~~Gy~l~~d~~tC~d 455 (482)
..+|+|++|+|.|.|.+||. .|+..|..
T Consensus 838 ~A~CyntpgsfsC~C~pGy~--GDGf~CVP 865 (1289)
T KOG1214|consen 838 AATCYNTPGSFSCRCQPGYY--GDGFQCVP 865 (1289)
T ss_pred CceEecCCCcceeecccCcc--CCCceecC
Confidence 45999999999999999998 45777764
No 10
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=98.43 E-value=2.2e-07 Score=84.76 Aligned_cols=68 Identities=35% Similarity=0.688 Sum_probs=60.1
Q ss_pred CCCeecCCCCce-ecCCcccCCCCCCCCCCCCCCCCCCCCCCCceecCCC----CeeccCCCCCCCcC---CCCCCCCC
Q psy14262 167 PNELRCGSSNQC-VALSKFCDGKPDCSNGEDEGPFCNAECTPGQFSCGMG----KCIPESWKCDGFVD---CLEDGSDE 237 (482)
Q Consensus 167 ~~~f~C~~~~~C-i~~~~~CDg~~dC~dgsDE~~~C~~~C~~~~f~C~~g----~Ci~~~~~CDg~~d---C~~dgsDE 237 (482)
.+.|+|.++.+= |+.+.+.|++-||+|||||.++ ..|+...|.|.|. .-||.+++=||+-| | -|||||
T Consensus 35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPGT--sAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~C-CDGSDE 110 (176)
T PF12999_consen 35 NGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPGT--SACSNGKFYCENKGHIPRYIPSSRVNDGICDYDIC-CDGSDE 110 (176)
T ss_pred CCceEecCCCCceecHHHccCcceeCCCCCCcccc--ccCcCceEeeccCCCCCceeehhhhcCCcCccccc-CCCCCC
Confidence 357999988666 9999999999999999999872 3688789999875 67899999999999 9 999999
No 11
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.40 E-value=2.8e-07 Score=62.18 Aligned_cols=33 Identities=48% Similarity=1.095 Sum_probs=28.8
Q ss_pred CCCCCCceeeecCCceEEecCCCeEECCCCCcc
Q psy14262 421 IDGSCDQTCTNTHLSYVCSCTEGYTLRPDGKSC 453 (482)
Q Consensus 421 ~~~~Cs~~C~n~~gsy~C~C~~Gy~l~~d~~tC 453 (482)
.++.|+|+|++++++|+|.|++||+|..|+++|
T Consensus 4 ~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 4 NNGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp GGGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence 358899999999999999999999999999987
No 12
>KOG1219|consensus
Probab=98.18 E-value=1.8e-06 Score=100.77 Aligned_cols=105 Identities=28% Similarity=0.726 Sum_probs=86.5
Q ss_pred cccccCCCcccCCCCccccc---ccccCCCCCCCCCCCCCCCCCCC-CCccCCCCcC--CCCCCCCCccccCCCCccCCC
Q psy14262 337 AVLCHPNELRCGSSNQCVAL---SKFCDGKPDCSNGEDEGPFCNNK-ADCENLKCSH--KCSPTPRGAMCFCPAGQNPNK 410 (482)
Q Consensus 337 ~~~C~~~~f~C~~g~~Ci~~---~~~CDg~~dC~dgsDE~~~C~~~-~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~ 410 (482)
+..|+ ..+|.+|+.|++. .|.| .|+..+-+.+ |.+. .+|.++||.. +|....++|.|.|+.|| .|
T Consensus 3864 ~d~C~--~npCqhgG~C~~~~~ggy~C----kCpsqysG~~-CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gy--TG 3934 (4289)
T KOG1219|consen 3864 TDPCN--DNPCQHGGTCISQPKGGYKC----KCPSQYSGNH-CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGY--TG 3934 (4289)
T ss_pred ccccc--cCcccCCCEecCCCCCceEE----eCcccccCcc-cccccccccCCCCCCCCEEEecCCCeeEeCCCCc--cC
Confidence 34563 4589999999986 7899 9999888888 8732 2788888875 89999999999999999 78
Q ss_pred CCcc--ccccccCCCCC--CceeeecCCceEEecCCCeEECCCCCcccc
Q psy14262 411 TQCV--DADECLIDGSC--DQTCTNTHLSYVCSCTEGYTLRPDGKSCTA 455 (482)
Q Consensus 411 ~~C~--di~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~l~~d~~tC~d 455 (482)
.+|+ .++||+ ...| .++|+|++|+|.|.|.+||. +++|.+
T Consensus 3935 ~~Ce~~Gi~eCs-~n~C~~gg~C~n~~gsf~CncT~g~~----gr~c~~ 3978 (4289)
T KOG1219|consen 3935 KRCEARGISECS-KNVCGTGGQCINIPGSFHCNCTPGIL----GRTCCA 3978 (4289)
T ss_pred ceeecccccccc-cccccCCceeeccCCceEeccChhHh----cccCcc
Confidence 8885 378998 5677 56999999999999999997 677643
No 13
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=98.09 E-value=3e-06 Score=77.46 Aligned_cols=68 Identities=44% Similarity=0.785 Sum_probs=58.4
Q ss_pred CCceecCCC-Ce-eccCCCCCCCcCCCCCCCCCCCcccccccCCccccCCCC---ceecCCCccCccCC---CCCCCCCC
Q psy14262 207 PGQFSCGMG-KC-IPESWKCDGFVDCLEDGSDEPNTCIHQCFNGQVQCALSK---KCIPHNWLCDEEMD---CSTDDIVD 278 (482)
Q Consensus 207 ~~~f~C~~g-~C-i~~~~~CDg~~dC~~dgsDE~~C~~~~C~~~~f~C~~~g---~Ci~~~~~CDg~~d---C~~~~~~d 278 (482)
.+.|.|.+| .= |+.+++.|++.|| +|||||.+- ..|..+.|+|.|.| .-||.++|=||+-| |= |
T Consensus 35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC-~DGSDEPGT--sAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CC-----D 106 (176)
T PF12999_consen 35 NGKFTCLDGSKIVIPFSQINDDYCDC-PDGSDEPGT--SACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICC-----D 106 (176)
T ss_pred CCceEecCCCCceecHHHccCcceeC-CCCCCcccc--ccCcCceEeeccCCCCCceeehhhhcCCcCcccccC-----C
Confidence 457999887 34 8999999999999 999999853 35777899999655 57889999999999 98 9
Q ss_pred CCCC
Q psy14262 279 NTDE 282 (482)
Q Consensus 279 gsDE 282 (482)
||||
T Consensus 107 GSDE 110 (176)
T PF12999_consen 107 GSDE 110 (176)
T ss_pred CCCC
Confidence 9999
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.06 E-value=5.3e-06 Score=58.16 Aligned_cols=39 Identities=51% Similarity=1.016 Sum_probs=32.7
Q ss_pred cccccc-CCCCCC--ceeeecCCceEEecCCCeEECCCCCcc
Q psy14262 415 DADECL-IDGSCD--QTCTNTHLSYVCSCTEGYTLRPDGKSC 453 (482)
Q Consensus 415 di~eC~-~~~~Cs--~~C~n~~gsy~C~C~~Gy~l~~d~~tC 453 (482)
||+||. ....|. +.|+|+.|+|+|.|++||++...+.+|
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTTC 42 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCcC
Confidence 689998 457885 799999999999999999977666554
No 15
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=98.00 E-value=3.8e-06 Score=51.13 Aligned_cols=24 Identities=54% Similarity=1.266 Sum_probs=22.5
Q ss_pred ceEEecCCCeEECCCCCccccCCC
Q psy14262 435 SYVCSCTEGYTLRPDGKSCTAVNV 458 (482)
Q Consensus 435 sy~C~C~~Gy~l~~d~~tC~dide 458 (482)
||+|+|++||+|.+++++|.||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 699999999999999999999986
No 16
>KOG1219|consensus
Probab=97.38 E-value=0.00017 Score=85.21 Aligned_cols=102 Identities=23% Similarity=0.651 Sum_probs=81.4
Q ss_pred cccCcccccCCCCccCC---CCCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCccccc--ccccCCC
Q psy14262 289 QCFNGQVQCALSKKCIP---HNWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQCVAL--SKFCDGK 363 (482)
Q Consensus 289 ~C~~~~f~C~~g~~Ci~---~~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~Ci~~--~~~CDg~ 363 (482)
.|.. ..|.+||+|++ ..|.| .|+ .-....+ |+.....|.++ +|.+|++||+. .++|
T Consensus 3866 ~C~~--npCqhgG~C~~~~~ggy~C----kCp-----sqysG~~---CEi~~epC~sn--PC~~GgtCip~~n~f~C--- 3926 (4289)
T KOG1219|consen 3866 PCND--NPCQHGGTCISQPKGGYKC----KCP-----SQYSGNH---CEIDLEPCASN--PCLTGGTCIPFYNGFLC--- 3926 (4289)
T ss_pred cccc--CcccCCCEecCCCCCceEE----eCc-----ccccCcc---cccccccccCC--CCCCCCEEEecCCCeeE---
Confidence 4554 35989999994 78999 898 5555566 76656678654 89999999986 8899
Q ss_pred CCCCCCCCCCCCCC---CCCCccCCCCcC--CCCCCCCCccccCCCCccCCCCCcc
Q psy14262 364 PDCSNGEDEGPFCN---NKADCENLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCV 414 (482)
Q Consensus 364 ~dC~dgsDE~~~C~---~~~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~ 414 (482)
.|+.|.-+.. |. ++ +|+.+.|.+ .|.|.+++|.|.|.+|| -+++|.
T Consensus 3927 -nC~~gyTG~~-Ce~~Gi~-eCs~n~C~~gg~C~n~~gsf~CncT~g~--~gr~c~ 3977 (4289)
T KOG1219|consen 3927 -NCPNGYTGKR-CEARGIS-ECSKNVCGTGGQCINIPGSFHCNCTPGI--LGRTCC 3977 (4289)
T ss_pred -eCCCCccCce-eeccccc-ccccccccCCceeeccCCceEeccChhH--hcccCc
Confidence 9999998887 87 34 798888876 89999999999999999 555663
No 17
>KOG1214|consensus
Probab=97.28 E-value=0.00049 Score=75.01 Aligned_cols=112 Identities=27% Similarity=0.676 Sum_probs=76.1
Q ss_pred CcccCCCCccccc---ccccCCCCCCCCCC--CCCCCCCCCCCcc--CCCCcC--CCCCCCCCccccCCCCc--cCCCCC
Q psy14262 344 ELRCGSSNQCVAL---SKFCDGKPDCSNGE--DEGPFCNNKADCE--NLKCSH--KCSPTPRGAMCFCPAGQ--NPNKTQ 412 (482)
Q Consensus 344 ~f~C~~g~~Ci~~---~~~CDg~~dC~dgs--DE~~~C~~~~~C~--~~~C~~--~C~~~~~~~~C~C~~G~--~~~~~~ 412 (482)
...|..+..|.+. .|.| .|.-|+ |... |...++|. .+.|.+ .|++.+++|+|.|..|| ..++.+
T Consensus 699 sh~cdt~a~C~pg~~~~~tc----ecs~g~~gdgr~-c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~t 773 (1289)
T KOG1214|consen 699 SHMCDTTARCHPGTGVDYTC----ECSSGYQGDGRN-CVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHT 773 (1289)
T ss_pred CcccCCCccccCCCCcceEE----EEeeccCCCCCC-CCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcc
Confidence 3344444445443 3444 555553 4445 76444777 566754 89999999999999999 667788
Q ss_pred ccc------ccccc-CCCCC----CceeeecCC-ceEEecCCCeEECCCCCccccCCCCCCC
Q psy14262 413 CVD------ADECL-IDGSC----DQTCTNTHL-SYVCSCTEGYTLRPDGKSCTAVNVPQDE 462 (482)
Q Consensus 413 C~d------i~eC~-~~~~C----s~~C~n~~g-sy~C~C~~Gy~l~~d~~tC~didec~~~ 462 (482)
|.- ++.|. ..+.| ...|+...| .|.|.|.+||. .|+..|.++|||+..
T Consensus 774 CV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfs--GDG~~c~dvDeC~ps 833 (1289)
T KOG1214|consen 774 CVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFS--GDGHQCTDVDECSPS 833 (1289)
T ss_pred eEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCcc--CCccccccccccCcc
Confidence 864 34565 33555 125666654 59999999998 579999999999854
No 18
>KOG4260|consensus
Probab=97.17 E-value=0.00082 Score=64.77 Aligned_cols=46 Identities=37% Similarity=0.733 Sum_probs=31.3
Q ss_pred cCCCCccCCCCCcccccccc-CCCCC--CceeeecCCceEEecCCCeEE
Q psy14262 401 FCPAGQNPNKTQCVDADECL-IDGSC--DQTCTNTHLSYVCSCTEGYTL 446 (482)
Q Consensus 401 ~C~~G~~~~~~~C~di~eC~-~~~~C--s~~C~n~~gsy~C~C~~Gy~l 446 (482)
.|..||..+...|.||+||. .+.+| .|+|+|+.|||+|.+.+||+.
T Consensus 221 kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 221 KCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred hhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 46777744455677777776 44555 467777777777777777764
No 19
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.98 E-value=0.00018 Score=48.57 Aligned_cols=30 Identities=40% Similarity=0.841 Sum_probs=24.8
Q ss_pred CCCCcCCCCCCCCCccccCCCCc--cCCCCCc
Q psy14262 384 NLKCSHKCSPTPRGAMCFCPAGQ--NPNKTQC 413 (482)
Q Consensus 384 ~~~C~~~C~~~~~~~~C~C~~G~--~~~~~~C 413 (482)
+++|+|.|++++++|+|.|++|| .+++++|
T Consensus 5 NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 5 NGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp GGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence 57899999999999999999999 6677765
No 20
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.78 E-value=0.00079 Score=41.03 Aligned_cols=22 Identities=41% Similarity=0.939 Sum_probs=19.7
Q ss_pred CccccCCCCc--cCCCCCcccccc
Q psy14262 397 GAMCFCPAGQ--NPNKTQCVDADE 418 (482)
Q Consensus 397 ~~~C~C~~G~--~~~~~~C~di~e 418 (482)
+|+|.|++|| .+++++|.||+|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 5899999999 578999999986
No 21
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=96.20 E-value=0.0039 Score=59.97 Aligned_cols=44 Identities=43% Similarity=0.872 Sum_probs=38.3
Q ss_pred CCCCcccccccc-CCCCCCceeeecCCceEEecCCCeEECCCCCc
Q psy14262 409 NKTQCVDADECL-IDGSCDQTCTNTHLSYVCSCTEGYTLRPDGKS 452 (482)
Q Consensus 409 ~~~~C~di~eC~-~~~~Cs~~C~n~~gsy~C~C~~Gy~l~~d~~t 452 (482)
.+..|.++++|. .++.|.+.|.+++|+|.|.|++||.|..++++
T Consensus 180 ~~~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~~~~ 224 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224 (224)
T ss_pred ccccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCCCCC
Confidence 567788899997 56789999999999999999999999877754
No 22
>KOG2397|consensus
Probab=96.13 E-value=0.0055 Score=63.89 Aligned_cols=69 Identities=36% Similarity=0.649 Sum_probs=60.2
Q ss_pred CCeecCCCCceecCCcccCCCCCCCCCCCCCCCCCCCCCCCceecCCC----CeeccCCCCCCCcCCCCCCCCCCC
Q psy14262 168 NELRCGSSNQCVALSKFCDGKPDCSNGEDEGPFCNAECTPGQFSCGMG----KCIPESWKCDGFVDCLEDGSDEPN 239 (482)
Q Consensus 168 ~~f~C~~~~~Ci~~~~~CDg~~dC~dgsDE~~~C~~~C~~~~f~C~~g----~Ci~~~~~CDg~~dC~~dgsDE~~ 239 (482)
..|+|.++..=|+.+.+=|..-||.||+||..+ ..|+.+.|.|.+. .=|+.+.+=||+.|| -|||||..
T Consensus 43 ~~~~CLdgs~~i~f~qlNDd~CDC~DGsDEPGt--sACpngkF~C~N~G~~p~~i~ssrV~DGICDC-CDgSDE~~ 115 (480)
T KOG2397|consen 43 SMFKCLDGSKTISFSQLNDDSCDCLDGSDEPGT--SACPNGKFYCVNQGHQPKYIPSSRVNDGICDC-CDGSDEYL 115 (480)
T ss_pred cceeeccCCcccCHHHhccccccCCCCCCCCcc--ccCCCCceeeeecCCCceeeechhccCccccc-ccCCCCcc
Confidence 479999988889988888999999999999762 4688999999764 678889999999999 99999974
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.94 E-value=0.009 Score=40.26 Aligned_cols=35 Identities=51% Similarity=1.123 Sum_probs=25.7
Q ss_pred cccccCCCCCCc--eeeecCCceEEecCCCeEECCCCCcc
Q psy14262 416 ADECLIDGSCDQ--TCTNTHLSYVCSCTEGYTLRPDGKSC 453 (482)
Q Consensus 416 i~eC~~~~~Cs~--~C~n~~gsy~C~C~~Gy~l~~d~~tC 453 (482)
+++|....+|.+ .|+++.++|.|.|++||. +++.|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C 38 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNC 38 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCc---cCCcC
Confidence 566752256754 899999999999999987 35554
No 24
>KOG1217|consensus
Probab=95.88 E-value=0.21 Score=52.56 Aligned_cols=116 Identities=28% Similarity=0.606 Sum_probs=79.8
Q ss_pred ccccCCCcccCCCCccccc--ccccCCCCCCCCCCCCCCCC---CCCCCcc----CCCCcC--CC--CCCCCCccccCCC
Q psy14262 338 VLCHPNELRCGSSNQCVAL--SKFCDGKPDCSNGEDEGPFC---NNKADCE----NLKCSH--KC--SPTPRGAMCFCPA 404 (482)
Q Consensus 338 ~~C~~~~f~C~~g~~Ci~~--~~~CDg~~dC~dgsDE~~~C---~~~~~C~----~~~C~~--~C--~~~~~~~~C~C~~ 404 (482)
..|..... |.+++.|+.. .++| .|+.|..... | ....+|. ...|.. .| ......+.|.|..
T Consensus 272 ~~C~~~~~-c~~~~~C~~~~~~~~C----~C~~g~~g~~-~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~ 345 (487)
T KOG1217|consen 272 DSCALIAS-CPNGGTCVNVPGSYRC----TCPPGFTGRL-CTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGP 345 (487)
T ss_pred cccCCCCc-cCCCCeeecCCCccee----eCCCCCCCCC-CccccccccccccccCCcCCCCcccccCCCCCCCCcCCCC
Confidence 34555444 8777789887 3788 6998887766 4 1112553 333443 45 3344578899999
Q ss_pred CccCCCCCccccc-cccCC-CCCCceeee-cCCceEEecCCCeEEC--CCCCccccCCCCCC
Q psy14262 405 GQNPNKTQCVDAD-ECLID-GSCDQTCTN-THLSYVCSCTEGYTLR--PDGKSCTAVNVPQD 461 (482)
Q Consensus 405 G~~~~~~~C~di~-eC~~~-~~Cs~~C~n-~~gsy~C~C~~Gy~l~--~d~~tC~didec~~ 461 (482)
|+ .+..|+... +|... ......|++ +.++|.|.|+.+|.+. .+...+.++++|..
T Consensus 346 ~~--~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~ 405 (487)
T KOG1217|consen 346 GF--TGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG 405 (487)
T ss_pred CC--CCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC
Confidence 96 778898774 88732 223678999 7999999999999863 57788888888753
No 25
>KOG4289|consensus
Probab=95.40 E-value=0.02 Score=66.15 Aligned_cols=88 Identities=25% Similarity=0.643 Sum_probs=65.5
Q ss_pred ccccCCCCCCCCCCCCCCCCC-CCCCccCCCCcC--CCCCCCCCccccCCCCccCCCCCcccc---ccccCCCCC--Cce
Q psy14262 357 SKFCDGKPDCSNGEDEGPFCN-NKADCENLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCVDA---DECLIDGSC--DQT 428 (482)
Q Consensus 357 ~~~CDg~~dC~dgsDE~~~C~-~~~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~di---~eC~~~~~C--s~~ 428 (482)
.++| .|+.|..... |. ..++|-+++|.+ +|....++|+|.|.+|| .+..|+-. -.|. ++.| ..+
T Consensus 1221 glrC----rCPpGFTgd~-CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~--tGehCEvs~~agrCv-pGvC~nggt 1292 (2531)
T KOG4289|consen 1221 GLRC----RCPPGFTGDY-CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGF--TGEHCEVSARAGRCV-PGVCKNGGT 1292 (2531)
T ss_pred ceeE----eCCCCCCccc-ccchhHhhhcCCCCCCCceEEecCceeEEecCCc--cccceeeecccCccc-cceecCCCE
Confidence 4677 8999987776 87 223788889976 89988999999999999 77778532 2243 6788 468
Q ss_pred eeecC-CceEEecCCCeEECCCCCcccc
Q psy14262 429 CTNTH-LSYVCSCTEGYTLRPDGKSCTA 455 (482)
Q Consensus 429 C~n~~-gsy~C~C~~Gy~l~~d~~tC~d 455 (482)
|++.+ |+|.|.|+.| . .....|+-
T Consensus 1293 C~~~~nggf~c~Cp~g-e--~e~prC~v 1317 (2531)
T KOG4289|consen 1293 CVNLLNGGFCCHCPYG-E--FEDPRCEV 1317 (2531)
T ss_pred EeecCCCceeccCCCc-c--cCCCceEE
Confidence 98875 7799999999 3 23555653
No 26
>KOG1217|consensus
Probab=95.27 E-value=0.15 Score=53.78 Aligned_cols=150 Identities=26% Similarity=0.599 Sum_probs=86.8
Q ss_pred CCCCCCCCCC--ccccCcccccCCCCccCC--CCCCCCCCCCCCCCCcCCCCCCCcCCcCccCcccccCCCcccCCCCcc
Q psy14262 278 DNTDEDWDKF--HQCFNGQVQCALSKKCIP--HNWLCDEEMDCSTDDIVDNTDEDWDKYFSHNAVLCHPNELRCGSSNQC 353 (482)
Q Consensus 278 dgsDE~~C~~--~~C~~~~f~C~~g~~Ci~--~~~~CDg~~DC~~g~~~dgsDE~~~~~C~~~~~~C~~~~f~C~~g~~C 353 (482)
.+.....+.. ..|......|.+++.|.. ..|.| .|. .+.-... |... .++..|
T Consensus 158 ~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C----~c~-----~~~~~~~---~~~~-----------~~~~~c 214 (487)
T KOG1217|consen 158 EGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLC----SCP-----PGYTGST---CETT-----------GNGGTC 214 (487)
T ss_pred CCcccccccccccccccCCCCcCCCcccccCCCCeeE----eCC-----CCccCCc---CcCC-----------CCCceE
Confidence 5555544442 356655556776667773 44667 566 4433333 3211 222344
Q ss_pred cccccccCCCCCCCCCCCCCCCCCCC-CCccCCCCcCCCCCCCCCccccCCCCccCCC-CCccccccccCCCCC--Ccee
Q psy14262 354 VALSKFCDGKPDCSNGEDEGPFCNNK-ADCENLKCSHKCSPTPRGAMCFCPAGQNPNK-TQCVDADECLIDGSC--DQTC 429 (482)
Q Consensus 354 i~~~~~CDg~~dC~dgsDE~~~C~~~-~~C~~~~C~~~C~~~~~~~~C~C~~G~~~~~-~~C~di~eC~~~~~C--s~~C 429 (482)
+.. +.| .+..+..... |... .+|.... .+|.+..++|.|.|++||.... ..|.++++|.....| .++|
T Consensus 215 ~~~-~~~----~~~~g~~~~~-c~~~~~~~~~~~--~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C 286 (487)
T KOG1217|consen 215 VDS-VAC----SCPPGARGPE-CEVSIVECASGD--GTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTC 286 (487)
T ss_pred ecc-eec----cCCCCCCCCC-cccccccccCCC--CcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCee
Confidence 443 222 3444443322 3321 1232221 4788888899999999994344 467889999843236 4699
Q ss_pred eecCCceEEecCCCeEECCCCCccccCCCC
Q psy14262 430 TNTHLSYVCSCTEGYTLRPDGKSCTAVNVP 459 (482)
Q Consensus 430 ~n~~gsy~C~C~~Gy~l~~d~~tC~didec 459 (482)
++..+.|.|.|++||..... ..+.++.+|
T Consensus 287 ~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C 315 (487)
T KOG1217|consen 287 VNVPGSYRCTCPPGFTGRLC-TECVDVDEC 315 (487)
T ss_pred ecCCCcceeeCCCCCCCCCC-ccccccccc
Confidence 99999999999999986544 334455555
No 27
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.01 E-value=0.0073 Score=42.16 Aligned_cols=26 Identities=31% Similarity=0.810 Sum_probs=21.5
Q ss_pred Ccc--CCCCc--CCCCCCCCCccccCCCCc
Q psy14262 381 DCE--NLKCS--HKCSPTPRGAMCFCPAGQ 406 (482)
Q Consensus 381 ~C~--~~~C~--~~C~~~~~~~~C~C~~G~ 406 (482)
||. .+.|. ..|+|+.|+|.|.|++||
T Consensus 4 EC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy 33 (42)
T PF07645_consen 4 ECAEGPHNCPENGTCVNTEGSYSCSCPPGY 33 (42)
T ss_dssp TTTTTSSSSSTTSEEEEETTEEEEEESTTE
T ss_pred ccCCCCCcCCCCCEEEcCCCCEEeeCCCCc
Confidence 665 34565 489999999999999999
No 28
>KOG4260|consensus
Probab=94.84 E-value=0.06 Score=52.22 Aligned_cols=76 Identities=28% Similarity=0.621 Sum_probs=56.5
Q ss_pred CCCCCC--CCCCCCCCCCCcc--CCCCcC--CCCCCCCCccccCCCCccCCCCCcccccccc-CCCCC---CceeeecCC
Q psy14262 365 DCSNGE--DEGPFCNNKADCE--NLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCVDADECL-IDGSC---DQTCTNTHL 434 (482)
Q Consensus 365 dC~dgs--DE~~~C~~~~~C~--~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~di~eC~-~~~~C---s~~C~n~~g 434 (482)
.|..|. ||.. |...+||. ..+|.+ .|+|+.++|.|.+.+||.. ++++|. ....| ...|+|+.+
T Consensus 221 kCkkGW~lde~g-CvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~------g~d~C~~~~d~~~~kn~~c~ni~~ 293 (350)
T KOG4260|consen 221 KCKKGWKLDEEG-CVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK------GVDECQFCADVCASKNRPCMNIDG 293 (350)
T ss_pred hhcccceecccc-cccHHHHhcCCCCCChhheeecCCCceEecccccccC------ChHHhhhhhhhcccCCCCcccCCc
Confidence 455563 7777 88545897 566753 8999999999999999932 256665 22444 357999999
Q ss_pred ceEEecCCCeEEC
Q psy14262 435 SYVCSCTEGYTLR 447 (482)
Q Consensus 435 sy~C~C~~Gy~l~ 447 (482)
+|+|.|+.|+...
T Consensus 294 ~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 294 QYRCVCFSGLIII 306 (350)
T ss_pred cEEEEecccceee
Confidence 9999999998643
No 29
>KOG3509|consensus
Probab=94.30 E-value=0.17 Score=57.75 Aligned_cols=104 Identities=29% Similarity=0.628 Sum_probs=86.0
Q ss_pred ceecCCCccCccCCCCCCCCCCCCCCCCCC--CccccCcccccCCCCccCCCCCCCCCCCCCCCCCcCCCCCCCcCCcCc
Q psy14262 257 KCIPHNWLCDEEMDCSTDDIVDNTDEDWDK--FHQCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDKYFS 334 (482)
Q Consensus 257 ~Ci~~~~~CDg~~dC~~~~~~dgsDE~~C~--~~~C~~~~f~C~~g~~Ci~~~~~CDg~~DC~~g~~~dgsDE~~~~~C~ 334 (482)
.|....+.|++..++. +.+|+.+.. ...+.+++|.|.++ ++.-..+.+|....+. +++.+.+ |.
T Consensus 2 ~c~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~p~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~~~~---~~ 67 (964)
T KOG3509|consen 2 ECVKNRYACDRQPDCR-----DRSDVANDPAIGSACSPNEFKCNNP-RCVQPEALLDADSTCG-----PNSTPSG---CN 67 (964)
T ss_pred chhhhhhhhccchhhH-----hhcccCCCccccccCCcchhccCCc-cccCchhhhccccccC-----CCCCcCC---cc
Confidence 4666778899999998 999997765 34688899999887 9999999999999999 7776665 43
Q ss_pred c--CcccccCCCcccCCCCcccccccccCCCCCCCCCCCCCC
Q psy14262 335 H--NAVLCHPNELRCGSSNQCVALSKFCDGKPDCSNGEDEGP 374 (482)
Q Consensus 335 ~--~~~~C~~~~f~C~~g~~Ci~~~~~CDg~~dC~dgsDE~~ 374 (482)
. .+..|.+..+.|.+-.++...+..|+|..+|.++++|..
T Consensus 68 ~~~~~s~~~~~~~~c~~~~~~~~~~~~~~g~~~~~~~~~~~~ 109 (964)
T KOG3509|consen 68 AKPSASDCKPTETQCRDRLRCNPQSFQCDGTNDCKDGSDEVG 109 (964)
T ss_pred ccccccccCCcccccccchhcCCccccccCCCCCCccchhcc
Confidence 2 245677888999887678888999999999999999977
No 30
>KOG2397|consensus
Probab=93.84 E-value=0.056 Score=56.56 Aligned_cols=68 Identities=43% Similarity=0.638 Sum_probs=56.2
Q ss_pred ceecCCC-CeeccCCCCCCCcCCCCCCCCCCCcccccccCCccccCCCC---ceecCCCccCccCCCCCCCCCCCCCCCC
Q psy14262 209 QFSCGMG-KCIPESWKCDGFVDCLEDGSDEPNTCIHQCFNGQVQCALSK---KCIPHNWLCDEEMDCSTDDIVDNTDEDW 284 (482)
Q Consensus 209 ~f~C~~g-~Ci~~~~~CDg~~dC~~dgsDE~~C~~~~C~~~~f~C~~~g---~Ci~~~~~CDg~~dC~~~~~~dgsDE~~ 284 (482)
.|.|.+| .=|+.+++-|...|| .||+||..= ..|..+.|+|.|-| .=|+.+.+=||+.||- |||||..
T Consensus 44 ~~~CLdgs~~i~f~qlNDd~CDC-~DGsDEPGt--sACpngkF~C~N~G~~p~~i~ssrV~DGICDCC-----DgSDE~~ 115 (480)
T KOG2397|consen 44 MFKCLDGSKTISFSQLNDDSCDC-LDGSDEPGT--SACPNGKFYCVNQGHQPKYIPSSRVNDGICDCC-----DGSDEYL 115 (480)
T ss_pred ceeeccCCcccCHHHhccccccC-CCCCCCCcc--ccCCCCceeeeecCCCceeeechhccCcccccc-----cCCCCcc
Confidence 5788877 678888899999999 999999752 35888999998543 4688888999999999 9999953
No 31
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=93.42 E-value=0.099 Score=34.51 Aligned_cols=30 Identities=53% Similarity=1.011 Sum_probs=22.6
Q ss_pred cccccCCCCC--CceeeecCCceEEecCCCeE
Q psy14262 416 ADECLIDGSC--DQTCTNTHLSYVCSCTEGYT 445 (482)
Q Consensus 416 i~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~ 445 (482)
+++|....+| .+.|+++.++|+|.|++||.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~ 33 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYT 33 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCc
Confidence 4556522467 46899999999999999987
No 32
>smart00181 EGF Epidermal growth factor-like domain.
Probab=93.42 E-value=0.094 Score=34.45 Aligned_cols=25 Identities=56% Similarity=1.077 Sum_probs=20.5
Q ss_pred CCCCc-eeeecCCceEEecCCCeEEC
Q psy14262 423 GSCDQ-TCTNTHLSYVCSCTEGYTLR 447 (482)
Q Consensus 423 ~~Cs~-~C~n~~gsy~C~C~~Gy~l~ 447 (482)
.+|.+ +|+++.++|.|.|++||.+.
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCccC
Confidence 46755 89999999999999999753
No 33
>KOG4289|consensus
Probab=92.95 E-value=0.087 Score=61.19 Aligned_cols=49 Identities=33% Similarity=0.880 Sum_probs=40.4
Q ss_pred CCCCccccCCCCccCCCCCcc-ccccccCCCCC--CceeeecCCceEEecCCCeE
Q psy14262 394 TPRGAMCFCPAGQNPNKTQCV-DADECLIDGSC--DQTCTNTHLSYVCSCTEGYT 445 (482)
Q Consensus 394 ~~~~~~C~C~~G~~~~~~~C~-di~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~ 445 (482)
...+++|.||+|| .+..|+ .+++|- .++| ...|....|+|.|.|.+||+
T Consensus 1218 pvnglrCrCPpGF--Tgd~CeTeiDlCY-s~pC~nng~C~srEggYtCeCrpg~t 1269 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGF--TGDYCETEIDLCY-SGPCGNNGRCRSREGGYTCECRPGFT 1269 (2531)
T ss_pred ccCceeEeCCCCC--CcccccchhHhhh-cCCCCCCCceEEecCceeEEecCCcc
Confidence 4568999999999 445674 578886 5788 46899999999999999997
No 34
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=92.94 E-value=0.099 Score=35.24 Aligned_cols=30 Identities=53% Similarity=1.114 Sum_probs=20.8
Q ss_pred CCCC--CceeeecCCceEEecCCCeEECCCCCcc
Q psy14262 422 DGSC--DQTCTNTHLSYVCSCTEGYTLRPDGKSC 453 (482)
Q Consensus 422 ~~~C--s~~C~n~~gsy~C~C~~Gy~l~~d~~tC 453 (482)
++.| ...|++++++|.|+|.+||.. +|..|
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~G--dG~~C 36 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEG--DGFFC 36 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEEC--CSTCE
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCcc--CCcCC
Confidence 3556 358999999999999999984 35543
No 35
>KOG3509|consensus
Probab=92.91 E-value=0.098 Score=59.68 Aligned_cols=103 Identities=28% Similarity=0.593 Sum_probs=87.2
Q ss_pred ceecCCcccCCCCCCCCCCCCCCCCC---CCCCCCceecCCCCeeccCCCCCCCcCCCCCCCCCCCcc----cccccCCc
Q psy14262 177 QCVALSKFCDGKPDCSNGEDEGPFCN---AECTPGQFSCGMGKCIPESWKCDGFVDCLEDGSDEPNTC----IHQCFNGQ 249 (482)
Q Consensus 177 ~Ci~~~~~CDg~~dC~dgsDE~~~C~---~~C~~~~f~C~~g~Ci~~~~~CDg~~dC~~dgsDE~~C~----~~~C~~~~ 249 (482)
.|....+.|++..|+.+.+|+.+ .+ ..+++++|.|.++++.-..+.+|....+ ..++++.+|. +..|.+..
T Consensus 2 ~c~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~s~~~~~~ 79 (964)
T KOG3509|consen 2 ECVKNRYACDRQPDCRDRSDVAN-DPAIGSACSPNEFKCNNPRCVQPEALLDADSTC-GPNSTPSGCNAKPSASDCKPTE 79 (964)
T ss_pred chhhhhhhhccchhhHhhcccCC-CccccccCCcchhccCCccccCchhhhcccccc-CCCCCcCCccccccccccCCcc
Confidence 57778899999999999999987 54 6788999999999999999999999999 8888877763 23467778
Q ss_pred cccCCCCceecCCCccCccCCCCCCCCCCCCCCCCCC
Q psy14262 250 VQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDK 286 (482)
Q Consensus 250 f~C~~~g~Ci~~~~~CDg~~dC~~~~~~dgsDE~~C~ 286 (482)
+.|.+--++.+.+..|+|.++|. ++++|..+.
T Consensus 80 ~~c~~~~~~~~~~~~~~g~~~~~-----~~~~~~~~~ 111 (964)
T KOG3509|consen 80 TQCRDRLRCNPQSFQCDGTNDCK-----DGSDEVGCK 111 (964)
T ss_pred cccccchhcCCccccccCCCCCC-----ccchhcccc
Confidence 88984447888999999999999 999886553
No 36
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=91.45 E-value=0.16 Score=33.89 Aligned_cols=32 Identities=28% Similarity=0.789 Sum_probs=23.9
Q ss_pred CccC-CCCcC--CCCCCCCCccccCCCCccCCCCCc
Q psy14262 381 DCEN-LKCSH--KCSPTPRGAMCFCPAGQNPNKTQC 413 (482)
Q Consensus 381 ~C~~-~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C 413 (482)
+|.. .+|.+ +|+++.++|.|.|++||. .+..|
T Consensus 4 ~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~-~g~~C 38 (39)
T smart00179 4 ECASGNPCQNGGTCVNTVGSYRCECPPGYT-DGRNC 38 (39)
T ss_pred cCcCCCCcCCCCEeECCCCCeEeECCCCCc-cCCcC
Confidence 5554 56665 799999999999999994 35544
No 37
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=89.25 E-value=0.46 Score=30.59 Aligned_cols=24 Identities=50% Similarity=0.953 Sum_probs=19.0
Q ss_pred CCCC--ceeeecCCceEEecCCCeEE
Q psy14262 423 GSCD--QTCTNTHLSYVCSCTEGYTL 446 (482)
Q Consensus 423 ~~Cs--~~C~n~~gsy~C~C~~Gy~l 446 (482)
..|. ++|+++.++|+|.|+.||..
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcc
Confidence 4553 68888888999999999874
No 38
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=89.23 E-value=0.087 Score=35.51 Aligned_cols=25 Identities=24% Similarity=0.553 Sum_probs=18.6
Q ss_pred CCCCCCCCCccccCCCCccCCCCCc
Q psy14262 389 HKCSPTPRGAMCFCPAGQNPNKTQC 413 (482)
Q Consensus 389 ~~C~~~~~~~~C~C~~G~~~~~~~C 413 (482)
+.|.+++++|.|.|++||..++..|
T Consensus 12 A~C~~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 12 ATCTNTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp CEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred cEeecCCCCEEeECCCCCccCCcCC
Confidence 3788889999999999996666554
No 39
>smart00181 EGF Epidermal growth factor-like domain.
Probab=88.69 E-value=0.34 Score=31.71 Aligned_cols=22 Identities=41% Similarity=0.886 Sum_probs=18.9
Q ss_pred CCCcC-CCCCCCCCccccCCCCc
Q psy14262 385 LKCSH-KCSPTPRGAMCFCPAGQ 406 (482)
Q Consensus 385 ~~C~~-~C~~~~~~~~C~C~~G~ 406 (482)
.+|.+ +|+++.++|.|.|++||
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~ 28 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGY 28 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCC
Confidence 45666 79888999999999999
No 40
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=84.59 E-value=1 Score=29.29 Aligned_cols=23 Identities=39% Similarity=0.998 Sum_probs=19.2
Q ss_pred CCCC--ceeeecC-CceEEecCCCeE
Q psy14262 423 GSCD--QTCTNTH-LSYVCSCTEGYT 445 (482)
Q Consensus 423 ~~Cs--~~C~n~~-gsy~C~C~~Gy~ 445 (482)
.+|. .+|++.. ++|.|.|++||+
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYT 29 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEE
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCc
Confidence 3664 4899998 899999999997
No 41
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=84.16 E-value=0.23 Score=32.44 Aligned_cols=23 Identities=35% Similarity=0.873 Sum_probs=18.4
Q ss_pred CCCCcC--CCCCCC-CCccccCCCCc
Q psy14262 384 NLKCSH--KCSPTP-RGAMCFCPAGQ 406 (482)
Q Consensus 384 ~~~C~~--~C~~~~-~~~~C~C~~G~ 406 (482)
..+|++ +|++.. .+|+|.|++||
T Consensus 3 ~~~C~n~g~C~~~~~~~y~C~C~~G~ 28 (32)
T PF00008_consen 3 SNPCQNGGTCIDLPGGGYTCECPPGY 28 (32)
T ss_dssp TTSSTTTEEEEEESTSEEEEEEBTTE
T ss_pred CCcCCCCeEEEeCCCCCEEeECCCCC
Confidence 345655 788777 89999999999
No 42
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=81.98 E-value=1.2 Score=29.10 Aligned_cols=26 Identities=31% Similarity=0.778 Sum_probs=19.8
Q ss_pred CccC-CCCc--CCCCCCCCCccccCCCCc
Q psy14262 381 DCEN-LKCS--HKCSPTPRGAMCFCPAGQ 406 (482)
Q Consensus 381 ~C~~-~~C~--~~C~~~~~~~~C~C~~G~ 406 (482)
+|.. .+|. ..|.+..++|.|.|++||
T Consensus 4 ~C~~~~~C~~~~~C~~~~~~~~C~C~~g~ 32 (38)
T cd00054 4 ECASGNPCQNGGTCVNTVGSYRCSCPPGY 32 (38)
T ss_pred cCCCCCCcCCCCEeECCCCCeEeECCCCC
Confidence 4443 4564 378888999999999999
No 43
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=81.26 E-value=1.3 Score=28.41 Aligned_cols=18 Identities=39% Similarity=0.922 Sum_probs=16.0
Q ss_pred CCCCCCCCCccccCCCCc
Q psy14262 389 HKCSPTPRGAMCFCPAGQ 406 (482)
Q Consensus 389 ~~C~~~~~~~~C~C~~G~ 406 (482)
.+|++..++|.|.|+.||
T Consensus 12 ~~C~~~~~~~~C~C~~g~ 29 (36)
T cd00053 12 GTCVNTPGSYRCVCPPGY 29 (36)
T ss_pred CEEecCCCCeEeECCCCC
Confidence 478888889999999999
No 44
>KOG1225|consensus
Probab=80.21 E-value=7 Score=42.31 Aligned_cols=72 Identities=25% Similarity=0.495 Sum_probs=38.4
Q ss_pred eecCCCCeeccCCCCCCCcCCCCCCCCCCCcccccccCCccccCCCCceecCCCccCccCCCCCCCCCCCCCCCCCCCcc
Q psy14262 210 FSCGMGKCIPESWKCDGFVDCLEDGSDEPNTCIHQCFNGQVQCALSKKCIPHNWLCDEEMDCSTDDIVDNTDEDWDKFHQ 289 (482)
Q Consensus 210 f~C~~g~Ci~~~~~CDg~~dC~~dgsDE~~C~~~~C~~~~f~C~~~g~Ci~~~~~CDg~~dC~~~~~~dgsDE~~C~~~~ 289 (482)
++|..|+|| | ..|+...+|....|... |.+++.+++ |+-.|. ++.-...|....
T Consensus 260 g~c~~G~CI-----------C-~~Gf~G~dC~e~~Cp~~---cs~~g~~~~------g~CiC~-----~g~~G~dCs~~~ 313 (525)
T KOG1225|consen 260 GQCVEGRCI-----------C-PPGFTGDDCDELVCPVD---CSGGGVCVD------GECICN-----PGYSGKDCSIRR 313 (525)
T ss_pred ceEeCCeEe-----------C-CCCCcCCCCCcccCCcc---cCCCceecC------CEeecC-----CCcccccccccc
Confidence 566666666 3 55555555543334332 332344443 223376 666666676555
Q ss_pred ccCcccccCCCCccCCCCCCC
Q psy14262 290 CFNGQVQCALSKKCIPHNWLC 310 (482)
Q Consensus 290 C~~~~f~C~~g~~Ci~~~~~C 310 (482)
|.. .|..+++||+..-+|
T Consensus 314 cpa---dC~g~G~Ci~G~C~C 331 (525)
T KOG1225|consen 314 CPA---DCSGHGKCIDGECLC 331 (525)
T ss_pred CCc---cCCCCCcccCCceEe
Confidence 653 477777888544333
No 45
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=78.70 E-value=1.4 Score=42.20 Aligned_cols=23 Identities=30% Similarity=0.788 Sum_probs=20.4
Q ss_pred CCCCcCCCCCCCCCccccCCCCc
Q psy14262 384 NLKCSHKCSPTPRGAMCFCPAGQ 406 (482)
Q Consensus 384 ~~~C~~~C~~~~~~~~C~C~~G~ 406 (482)
++.|.+.|.+++++|.|.|++||
T Consensus 194 ~~~c~~~C~~~~g~~~c~c~~g~ 216 (224)
T cd01475 194 SHVCQQVCISTPGSYLCACTEGY 216 (224)
T ss_pred CCCccceEEcCCCCEEeECCCCc
Confidence 44677889999999999999999
No 46
>KOG1225|consensus
Probab=78.32 E-value=9.8 Score=41.22 Aligned_cols=62 Identities=29% Similarity=0.790 Sum_probs=35.1
Q ss_pred CCCCCCCCCCCCCCCCCccCCCCcC--CCCCCCCCccccCCCCccCCCCCccccccccCCCCCCceeeecCCceEEecCC
Q psy14262 365 DCSNGEDEGPFCNNKADCENLKCSH--KCSPTPRGAMCFCPAGQNPNKTQCVDADECLIDGSCDQTCTNTHLSYVCSCTE 442 (482)
Q Consensus 365 dC~dgsDE~~~C~~~~~C~~~~C~~--~C~~~~~~~~C~C~~G~~~~~~~C~di~eC~~~~~Cs~~C~n~~gsy~C~C~~ 442 (482)
.|.+|+-... |++- .|. ..|.. +|+ .-+|.|.+|| .+..|.... |... +.|+| | |.|.+
T Consensus 299 iC~~g~~G~d-Cs~~-~cp-adC~g~G~Ci----~G~C~C~~Gy--~G~~C~~~~-C~~~----g~cv~--g---C~C~~ 359 (525)
T KOG1225|consen 299 ICNPGYSGKD-CSIR-RCP-ADCSGHGKCI----DGECLCDEGY--TGELCIQRA-CSGG----GQCVN--G---CKCKK 359 (525)
T ss_pred ecCCCccccc-cccc-cCC-ccCCCCCccc----CCceEeCCCC--cCCcccccc-cCCC----ceecc--C---ceecc
Confidence 6777776655 6644 332 22322 444 3478899998 566665432 4322 24544 2 77778
Q ss_pred CeE
Q psy14262 443 GYT 445 (482)
Q Consensus 443 Gy~ 445 (482)
||+
T Consensus 360 Gw~ 362 (525)
T KOG1225|consen 360 GWR 362 (525)
T ss_pred Ccc
Confidence 876
No 47
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=68.33 E-value=2.9 Score=38.79 Aligned_cols=57 Identities=28% Similarity=0.680 Sum_probs=39.1
Q ss_pred CCCCCCccccCCCCc-cCCCCCcccccccc----CCCCCC--ceeeecC-----CceEEecCCCeEECC
Q psy14262 392 SPTPRGAMCFCPAGQ-NPNKTQCVDADECL----IDGSCD--QTCTNTH-----LSYVCSCTEGYTLRP 448 (482)
Q Consensus 392 ~~~~~~~~C~C~~G~-~~~~~~C~di~eC~----~~~~Cs--~~C~n~~-----gsy~C~C~~Gy~l~~ 448 (482)
..+...|.|.|.+|| .....+|+...+|. ...+|. ..|++.. ..|+|.|.+||.|..
T Consensus 14 iQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~ 82 (197)
T PF06247_consen 14 IQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQ 82 (197)
T ss_dssp EEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS
T ss_pred EEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeC
Confidence 345667999999999 55677898888886 246673 5788876 469999999999874
No 48
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=60.64 E-value=8.9 Score=25.41 Aligned_cols=24 Identities=33% Similarity=0.798 Sum_probs=14.8
Q ss_pred CCCceeeecCCceEEecCCCeEECC
Q psy14262 424 SCDQTCTNTHLSYVCSCTEGYTLRP 448 (482)
Q Consensus 424 ~Cs~~C~n~~gsy~C~C~~Gy~l~~ 448 (482)
.|...|.... .++|.|++||.|..
T Consensus 7 ~CpA~CDpn~-~~~C~CPeGyIlde 30 (34)
T PF09064_consen 7 ECPADCDPNS-PGQCFCPEGYILDE 30 (34)
T ss_pred cCCCccCCCC-CCceeCCCceEecC
Confidence 3444554322 23788999998864
No 49
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=58.97 E-value=7.8 Score=20.00 Aligned_cols=9 Identities=44% Similarity=1.331 Sum_probs=5.5
Q ss_pred EEecCCCeE
Q psy14262 437 VCSCTEGYT 445 (482)
Q Consensus 437 ~C~C~~Gy~ 445 (482)
.|.|++||+
T Consensus 1 ~C~C~~G~~ 9 (13)
T PF12661_consen 1 TCQCPPGWT 9 (13)
T ss_dssp EEEE-TTEE
T ss_pred CccCcCCCc
Confidence 467777776
No 50
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=30.98 E-value=48 Score=22.52 Aligned_cols=22 Identities=27% Similarity=0.475 Sum_probs=17.4
Q ss_pred CceeeecC-CceEEecCCCeEEC
Q psy14262 426 DQTCTNTH-LSYVCSCTEGYTLR 447 (482)
Q Consensus 426 s~~C~n~~-gsy~C~C~~Gy~l~ 447 (482)
...|++.. |++.|.|..||+..
T Consensus 10 NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 10 NAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp TEEEEEETTSEEEEEE-TTEEEE
T ss_pred CcccEEcCCCCEEEEeeCCcccc
Confidence 34788887 89999999999875
No 51
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=27.08 E-value=13 Score=34.64 Aligned_cols=98 Identities=29% Similarity=0.743 Sum_probs=50.3
Q ss_pred ccCCCCccccc-------ccccCCCCCCCCCCCCCC-CCCCCCCccCCCCcC-CCCCC---CCCccccCCCCc-cCCCCC
Q psy14262 346 RCGSSNQCVAL-------SKFCDGKPDCSNGEDEGP-FCNNKADCENLKCSH-KCSPT---PRGAMCFCPAGQ-NPNKTQ 412 (482)
Q Consensus 346 ~C~~g~~Ci~~-------~~~CDg~~dC~dgsDE~~-~C~~~~~C~~~~C~~-~C~~~---~~~~~C~C~~G~-~~~~~~ 412 (482)
+|++.++|+.. .+.| +|..|+.-.. .|... .|.+-.|.. .|+-. +....|+|.-|+ ..+...
T Consensus 51 ~Cgdya~C~~~~~~~~~~~~~C----~C~~gY~~~~~vCvp~-~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~k 125 (197)
T PF06247_consen 51 PCGDYAKCINQANKGEERAYKC----DCINGYILKQGVCVPN-KCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKK 125 (197)
T ss_dssp EEETTEEEEE-SSTTSSTSEEE----EE-TTEEESSSSEEEG-GGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTE
T ss_pred cccchhhhhcCCCcccceeEEE----ecccCceeeCCeEchh-hcCceecCCCeEEecCCCCCCceeEeeeceEeccCCc
Confidence 45554466643 4566 6777642211 14433 444333332 44322 234589999999 466777
Q ss_pred ccc--cccccCCCCC--CceeeecCCceEEecCCCeEECCCC
Q psy14262 413 CVD--ADECLIDGSC--DQTCTNTHLSYVCSCTEGYTLRPDG 450 (482)
Q Consensus 413 C~d--i~eC~~~~~C--s~~C~n~~gsy~C~C~~Gy~l~~d~ 450 (482)
|.. ..+|+ -.| ...|..+.+-|+|.+..|+.+...+
T Consensus 126 Ctk~G~T~C~--LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~~ 165 (197)
T PF06247_consen 126 CTKTGETKCS--LKCKENEECKLVDGYYKCVCKEGFPGDGEG 165 (197)
T ss_dssp SEEEE----------TTTEEEEEETTEEEEEE-TT-EEETTT
T ss_pred ccCCCcccee--eecCCCcceeeeCcEEEeecCCCCCCCCCc
Confidence 743 34454 344 4589999999999999999876544
Done!