Query psy620
Match_columns 1290
No_of_seqs 902 out of 4255
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 20:15:49 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy620.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/620hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05735 TSP_C: Thrombospondin 100.0 7.5E-43 1.6E-47 365.5 7.9 138 1015-1153 1-138 (201)
2 PF05735 TSP_C: Thrombospondin 100.0 2.2E-39 4.8E-44 339.4 5.6 108 1132-1290 1-112 (201)
3 KOG1214|consensus 99.9 7.1E-21 1.5E-25 225.3 17.9 325 19-377 606-951 (1289)
4 KOG1214|consensus 99.7 5.4E-17 1.2E-21 192.9 18.2 160 242-433 699-865 (1289)
5 KOG4289|consensus 99.7 4.2E-17 9.1E-22 200.6 16.8 105 104-225 1178-1308(2531)
6 KOG4289|consensus 99.6 1.8E-15 3.8E-20 186.6 16.8 111 146-274 1179-1317(2531)
7 KOG0994|consensus 99.6 1.4E-15 3.1E-20 185.4 14.2 312 54-427 763-1095(1758)
8 KOG1219|consensus 99.6 1.1E-15 2.4E-20 193.7 10.7 111 104-234 3863-3977(4289)
9 KOG1217|consensus 99.6 2.2E-13 4.7E-18 165.3 25.4 199 148-379 91-306 (487)
10 KOG1219|consensus 99.5 9.2E-14 2E-18 176.9 11.7 114 142-273 3859-3977(4289)
11 KOG1217|consensus 99.4 1.4E-12 3.1E-17 158.3 18.2 264 106-427 90-389 (487)
12 KOG0994|consensus 99.1 5E-10 1.1E-14 138.0 14.0 221 119-400 877-1117(1758)
13 KOG4260|consensus 98.7 1.6E-08 3.4E-13 109.5 6.5 152 128-326 130-304 (350)
14 PF02412 TSP_3: Thrombospondin 98.6 2E-08 4.3E-13 78.3 1.8 36 829-864 1-36 (36)
15 KOG1836|consensus 98.6 2.1E-06 4.6E-11 115.8 21.9 227 160-428 749-1019(1705)
16 KOG4260|consensus 98.6 9.7E-08 2.1E-12 103.5 6.8 156 172-373 132-304 (350)
17 PF02412 TSP_3: Thrombospondin 98.5 2.5E-08 5.4E-13 77.7 1.5 35 927-961 2-36 (36)
18 KOG1225|consensus 98.4 8.1E-07 1.7E-11 107.6 11.5 132 127-329 234-365 (525)
19 KOG1225|consensus 98.4 1.7E-06 3.7E-11 104.9 12.0 131 217-428 234-365 (525)
20 KOG1836|consensus 97.8 0.00039 8.4E-09 94.7 16.9 108 260-384 697-816 (1705)
21 PF07645 EGF_CA: Calcium-bindi 97.6 3E-05 6.5E-10 63.1 1.9 39 290-331 1-39 (42)
22 PF12947 EGF_3: EGF domain; I 97.5 9.3E-05 2E-09 58.1 2.9 31 348-379 6-36 (36)
23 KOG1226|consensus 97.4 0.001 2.3E-08 82.5 11.7 99 259-380 479-580 (783)
24 KOG1226|consensus 97.1 0.0024 5.1E-08 79.5 10.7 137 168-357 479-636 (783)
25 PF00008 EGF: EGF-like domain 97.0 0.00018 4E-09 55.0 0.2 28 593-621 3-31 (32)
26 PF07645 EGF_CA: Calcium-bindi 97.0 0.00096 2.1E-08 54.3 4.2 32 341-373 2-34 (42)
27 smart00179 EGF_CA Calcium-bind 96.7 0.0014 3E-08 51.6 3.3 35 146-183 2-38 (39)
28 PF00008 EGF: EGF-like domain 96.6 0.00061 1.3E-08 52.1 0.4 29 108-138 1-30 (32)
29 smart00179 EGF_CA Calcium-bind 96.6 0.0022 4.8E-08 50.5 3.5 37 105-144 2-39 (39)
30 PF12947 EGF_3: EGF domain; I 96.5 0.00082 1.8E-08 52.8 0.7 32 111-143 5-36 (36)
31 PF06247 Plasmod_Pvs28: Plasmo 96.5 0.00065 1.4E-08 71.5 -0.3 133 215-376 18-163 (197)
32 PF12662 cEGF: Complement Clr- 96.2 0.0025 5.4E-08 45.4 1.5 22 608-630 1-24 (24)
33 PF06247 Plasmod_Pvs28: Plasmo 96.1 0.00086 1.9E-08 70.6 -1.4 141 159-332 12-166 (197)
34 cd00054 EGF_CA Calcium-binding 96.0 0.0069 1.5E-07 47.0 3.2 35 146-183 2-37 (38)
35 cd00054 EGF_CA Calcium-binding 95.9 0.008 1.7E-07 46.6 3.5 36 105-144 2-38 (38)
36 PF12662 cEGF: Complement Clr- 95.9 0.0041 8.9E-08 44.3 1.6 22 698-720 1-24 (24)
37 PF14670 FXa_inhibition: Coagu 95.8 0.004 8.8E-08 48.9 1.1 36 294-334 1-36 (36)
38 cd00053 EGF Epidermal growth f 94.9 0.026 5.7E-07 43.0 3.2 28 151-179 5-32 (36)
39 cd00053 EGF Epidermal growth f 94.6 0.037 8E-07 42.1 3.4 31 111-143 5-35 (36)
40 smart00181 EGF Epidermal growt 94.4 0.044 9.4E-07 42.2 3.3 26 112-139 6-31 (35)
41 smart00181 EGF Epidermal growt 94.3 0.041 8.9E-07 42.4 3.0 33 148-183 1-34 (35)
42 KOG1218|consensus 91.5 3.1 6.8E-05 48.3 14.7 193 126-374 14-209 (316)
43 PF14670 FXa_inhibition: Coagu 88.4 0.49 1.1E-05 37.4 3.1 29 348-379 6-36 (36)
44 KOG1218|consensus 88.0 11 0.00024 43.7 15.5 102 131-257 93-200 (316)
45 PF12661 hEGF: Human growth fa 87.3 0.36 7.9E-06 29.6 1.3 13 365-379 1-13 (13)
46 PF07974 EGF_2: EGF-like domai 85.4 0.81 1.8E-05 35.2 2.7 27 348-379 6-32 (32)
47 cd01475 vWA_Matrilin VWA_Matri 85.4 0.71 1.5E-05 51.4 3.5 43 284-331 180-222 (224)
48 KOG3514|consensus 83.2 1.8 3.9E-05 56.1 5.9 34 107-144 625-659 (1591)
49 KOG3512|consensus 82.7 2.2 4.8E-05 51.1 6.0 116 250-380 286-428 (592)
50 PF00683 TB: TB domain; Inter 82.6 0.089 1.9E-06 43.0 -3.7 22 465-486 18-39 (42)
51 PF07974 EGF_2: EGF-like domai 82.6 1.2 2.5E-05 34.3 2.5 24 112-138 6-29 (32)
52 KOG3516|consensus 81.5 3.2 6.9E-05 54.9 7.3 35 106-144 956-991 (1306)
53 PF12946 EGF_MSP1_1: MSP1 EGF 81.4 1.1 2.4E-05 35.5 2.0 32 348-379 5-36 (37)
54 smart00051 DSL delta serrate l 79.5 2.3 4.9E-05 38.0 3.6 44 320-379 20-63 (63)
55 smart00682 G2F G2 nidogen doma 77.8 2.1 4.5E-05 47.6 3.6 67 21-96 153-222 (227)
56 cd01475 vWA_Matrilin VWA_Matri 76.9 1.7 3.8E-05 48.2 2.8 38 141-179 182-219 (224)
57 KOG3516|consensus 72.0 9.6 0.00021 50.7 7.8 39 102-144 542-581 (1306)
58 PF01683 EB: EB module; Inter 69.8 7.5 0.00016 33.0 4.3 33 342-379 20-52 (52)
59 PF12946 EGF_MSP1_1: MSP1 EGF 69.0 0.92 2E-05 35.9 -1.3 34 149-183 2-36 (37)
60 KOG3512|consensus 68.8 8.6 0.00019 46.4 5.9 16 164-179 368-383 (592)
61 PTZ00214 high cysteine membran 64.8 1.2E+02 0.0025 40.6 15.6 85 119-227 366-458 (800)
62 smart00051 DSL delta serrate l 61.6 10 0.00022 34.0 3.6 47 217-272 17-63 (63)
63 PF03302 VSP: Giardia variant- 60.9 40 0.00086 41.2 9.8 128 130-271 2-136 (397)
64 cd00255 nidG2 Nidogen, G2 doma 59.9 6.7 0.00015 43.7 2.7 69 19-93 150-222 (224)
65 PHA02887 EGF-like protein; Pro 57.6 9 0.0002 38.0 2.8 31 348-381 92-123 (126)
66 PHA03099 epidermal growth fact 52.2 13 0.00028 37.5 3.0 31 348-381 51-82 (139)
67 KOG3514|consensus 51.8 10 0.00023 49.6 2.8 35 588-626 624-659 (1591)
68 PHA03099 epidermal growth fact 50.3 15 0.00031 37.2 3.0 39 104-146 41-83 (139)
69 PF03302 VSP: Giardia variant- 49.0 2.8E+02 0.0062 33.9 14.4 44 167-228 91-134 (397)
70 PF00954 S_locus_glycop: S-loc 44.3 19 0.00041 35.5 2.8 32 105-138 77-108 (110)
71 PHA02887 EGF-like protein; Pro 43.6 18 0.00038 36.1 2.4 28 206-236 97-124 (126)
72 cd00055 EGF_Lam Laminin-type e 34.6 41 0.00088 28.4 3.0 35 114-179 4-42 (50)
73 PF01683 EB: EB module; Inter 33.8 50 0.0011 27.9 3.4 29 106-139 20-48 (52)
74 PF00954 S_locus_glycop: S-loc 33.6 34 0.00074 33.7 2.8 32 341-374 77-108 (110)
75 smart00210 TSPN Thrombospondin 31.6 79 0.0017 34.1 5.4 33 29-65 112-144 (184)
76 PTZ00214 high cysteine membran 29.9 1.8E+02 0.0039 38.9 9.1 38 314-357 681-722 (800)
77 KOG3509|consensus 28.8 1.1E+02 0.0023 41.3 6.7 117 128-273 719-841 (964)
78 TIGR00648 recU recombination p 25.9 32 0.00069 36.9 1.1 46 1226-1272 101-154 (169)
79 PF00053 Laminin_EGF: Laminin 25.4 61 0.0013 27.1 2.5 22 354-380 11-32 (49)
80 PRK02234 recU Holliday junctio 21.7 47 0.001 36.5 1.4 47 1225-1272 123-177 (195)
81 PF01414 DSL: Delta serrate li 20.0 26 0.00057 31.3 -0.8 48 314-379 16-63 (63)
No 1
>PF05735 TSP_C: Thrombospondin C-terminal region; InterPro: IPR008859 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. The globular C-terminal domain is a beta sandwich of two curved antiparallel beta-sheets []. The fold is an elaboration of the jelly role topology, with strand B3-B7, B11 and B14-B15 forming the eight-stranded jelly roll motif. The function of the C-terminal domain is not yet known.; GO: 0005509 calcium ion binding, 0007155 cell adhesion, 0005576 extracellular region; PDB: 1UX6_A 1YO8_A 2RHP_A 3FBY_C.
Probab=100.00 E-value=7.5e-43 Score=365.48 Aligned_cols=138 Identities=64% Similarity=1.137 Sum_probs=106.7
Q ss_pred CCCCceeeccCCceEEEeecCCCCcccccccccceeeecceeecccCCCCccceEeeeccCCcEEEEeccccceeeeecc
Q psy620 1015 QIDPHWVIYNHGAEILQTMNSDPGLAIGQDKFSGVDFEGTFFVDTDIDDDYAGFVFSYQSSQKFYVMMWKKNSQVYWQTT 1094 (1290)
Q Consensus 1015 ~~d~~~~v~~~g~~~~q~~~~dp~~~~g~~~~~~~d~~g~~~~~~~~d~~~~gfvf~yq~~~~f~~~~~~~~~~~~w~~~ 1094 (1290)
|.||+|+|.++|+||+|++||||+++||.++|.+|||+|||+|++..|||||||||+||+|+||||||||+..|+||+.+
T Consensus 1 q~dP~W~v~~~G~ev~Qt~NsdP~l~ig~~~~~~vdf~GT~~Vnt~~DDDyiGFVFGYQsn~~FYvv~WKq~~Q~y~~~~ 80 (201)
T PF05735_consen 1 QIDPNWVVSNQGAEVVQTLNSDPGLAIGPDNFGGVDFSGTFFVNTTSDDDYIGFVFGYQSNRKFYVVMWKQGNQNYWESS 80 (201)
T ss_dssp S----EEEECCCTEEEE-SS-SSEEEEEEEEESSEEEEEEEEE--SS---EEEEEEEEEETTEEEEEEEESS-EE-S--S
T ss_pred CCCCceEEecCCeEEEEeccCCCeEEEccceecceEEEEEEEEecCCCCCEEEEEEEecCCCeEEEEEeeccccccccCC
Confidence 67999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccccCCccEEEEecCCCCCCcccccccccCCCcccccccCCCcCCCcccccCCCCcc
Q psy620 1095 PFRAVAEPGIQLKVVDSATGPGTMLRNSLWHTGDTENQCDLAEPCDPRVQCTNLFPGYR 1153 (1290)
Q Consensus 1095 ~f~~~~~~g~~~~~~~~~~~~~~~~~~~~w~~~~~~~q~~~~w~~~~~~~~~~~~~~~~ 1153 (1290)
||||.|++|||+|+|+|+||||++|||+|||+++|.+||++||+ +|...+|+....|+
T Consensus 81 p~~~~a~~Gl~iK~V~s~tGpg~~l~nalWh~~~t~~qv~llw~-dp~~~GW~~~t~Y~ 138 (201)
T PF05735_consen 81 PFRATAEPGLQIKLVDSTTGPGEMLRNALWHTGDTTNQVKLLWH-DPGNIGWKDNTAYR 138 (201)
T ss_dssp SS--EE-SEEEEEEEE-SS-TTHHHHHHHHSSS-BTTTEEEEEE--TT-----TT-EEE
T ss_pred CccccccceEEEEEEecCcCCchhhhhhhccCCCccceeEEEEe-CCCcCCCcCCccEE
Confidence 99999999999999999999999999999999999999999999 88777776655554
No 2
>PF05735 TSP_C: Thrombospondin C-terminal region; InterPro: IPR008859 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. The globular C-terminal domain is a beta sandwich of two curved antiparallel beta-sheets []. The fold is an elaboration of the jelly role topology, with strand B3-B7, B11 and B14-B15 forming the eight-stranded jelly roll motif. The function of the C-terminal domain is not yet known.; GO: 0005509 calcium ion binding, 0007155 cell adhesion, 0005576 extracellular region; PDB: 1UX6_A 1YO8_A 2RHP_A 3FBY_C.
Probab=100.00 E-value=2.2e-39 Score=339.41 Aligned_cols=108 Identities=50% Similarity=0.779 Sum_probs=82.4
Q ss_pred ccccCCCc----CCCcccccCCCCcccCCCCCCCcCCCCccceeeeEEEEeeecccccccccCCCCCCccceeeeecccc
Q psy620 1132 QCDLAEPC----DPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGSALLLVRINSQA 1207 (1290)
Q Consensus 1132 q~~~~w~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1207 (1290)
|++|+|++ .+++|++||+||++ +|.+.|.+|||+|||. ||
T Consensus 1 q~dP~W~v~~~G~ev~Qt~NsdP~l~--------ig~~~~~~vdf~GT~~-------------------------Vn--- 44 (201)
T PF05735_consen 1 QIDPNWVVSNQGAEVVQTLNSDPGLA--------IGPDNFGGVDFSGTFF-------------------------VN--- 44 (201)
T ss_dssp S----EEEECCCTEEEE-SS-SSEEE--------EEEEEESSEEEEEEEE-------------------------E----
T ss_pred CCCCceEEecCCeEEEEeccCCCeEE--------EccceecceEEEEEEE-------------------------Ee---
Confidence 67778877 78999999999998 7777999999999985 66
Q ss_pred chhhchhhhhhhhhhcccccceeeEEEEeecCCcEEEEEeeecccccccccCcceecCCccEEEEEeCCCCCchhhhhcc
Q psy620 1208 WTSRELSSWIRILMTTMLDLCSGNVATFYQSSQKFYVMMWKKNSQVYWQTTPFRAVAEPGIQLKVVDSATGPGTMLRNSL 1287 (1290)
Q Consensus 1208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~yw~~~~~~~~~~~~~~~~~~~~~~~~g~~lrn~l 1287 (1290)
|+.+|||+||| ||||+|++||||||||.+|+||+++||||+|++|||||+|+|+||||++|||||
T Consensus 45 --------------t~~DDDyiGFV-FGYQsn~~FYvv~WKq~~Q~y~~~~p~~~~a~~Gl~iK~V~s~tGpg~~l~nal 109 (201)
T PF05735_consen 45 --------------TTSDDDYIGFV-FGYQSNRKFYVVMWKQGNQNYWESSPFRATAEPGLQIKLVDSTTGPGEMLRNAL 109 (201)
T ss_dssp ---------------SS---EEEEE-EEEEETTEEEEEEEESS-EE-S--SSS--EE-SEEEEEEEE-SS-TTHHHHHHH
T ss_pred --------------cCCCCCEEEEE-EEecCCCeEEEEEeeccccccccCCCccccccceEEEEEEecCcCCchhhhhhh
Confidence 34578899988 899999999999999999999999999999999999999999999999999999
Q ss_pred cCC
Q psy620 1288 WHT 1290 (1290)
Q Consensus 1288 w~~ 1290 (1290)
|||
T Consensus 110 Wh~ 112 (201)
T PF05735_consen 110 WHT 112 (201)
T ss_dssp HSS
T ss_pred ccC
Confidence 997
No 3
>KOG1214|consensus
Probab=99.85 E-value=7.1e-21 Score=225.33 Aligned_cols=325 Identities=24% Similarity=0.492 Sum_probs=228.7
Q ss_pred ceeecccccccccccCccc--eEEEEeeeeccccceeeeecccCCCccc---chhh--hhhhcccccccceeeeeecccc
Q psy620 19 PIVEGWSVKDDLLDDGVIN--GLLLGVKQDIMGARYTLYMDCVDHGTVA---MTQS--LKKMFDSMKNPQMRLRKTDEES 91 (1290)
Q Consensus 19 ~~~~~~s~~~~~~~~~~~~--~l~~~~~~~i~G~~~~ly~~C~~~~~~~---~~~~--~~~~~~~~~~~~~~l~~~~~~~ 91 (1290)
..+...++++|.+..+.+. +.+++++|+|. |..|.+..+.+ ..++ +-++|+.|..++..|+++..+.
T Consensus 606 s~vtstssr~y~~t~ga~~S~~~sy~~hq~it------yq~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~lr~a~Sn~ 679 (1289)
T KOG1214|consen 606 STVTSTSSRDYSLTFGAINSQTWSYRIHQNIT------YQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVSNQ 679 (1289)
T ss_pred ceeecccccceeeecCcccccceeEEEeecce------eEEeecCCCCCCCCCceEeecccceeccCccccchhhhhhhc
Confidence 3444556677888888776 78999999988 88998776654 3333 3499999999999999999999
Q ss_pred cccccCCCcCcccCCCCC-CCCCCCCCCeeecCCC-CcccccCCCCcccCCCCCCCCCCCCC--CCCCCCCeeccCCCCc
Q psy620 92 VDEIELPAIPIVKKPTCA-TDNPCFPGVECRDTRE-GPRCMRCPDGYVGDGIHCKPGVTCNM--RPCFQGVQCFDTVEGY 167 (1290)
Q Consensus 92 ~~~~~~~~~~~~~~d~C~-~~~pC~~gg~C~~~~g-~y~C~~C~~Gy~Gdg~~CedideC~~--~pC~~gg~C~n~~g~y 167 (1290)
+..+.....+ ...++|- .++-|.-++.|....+ .|.| .|..||.|+++.|.++++|+. ..|+.++.|++.+++|
T Consensus 680 igpV~E~S~~-~~~npCy~gsh~cdt~a~C~pg~~~~~tc-ecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~ 757 (1289)
T KOG1214|consen 680 IGPVKEDSDP-TPVNPCYDGSHMCDTTARCHPGTGVDYTC-ECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSY 757 (1289)
T ss_pred ccceecCCCC-cccccceecCcccCCCccccCCCCcceEE-EEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCce
Confidence 9888643332 2346665 3788998999998764 5999 999999999999999999998 4599999999999999
Q ss_pred ccccCCCCCC--CCCCCceec-ccCCCCCCCCCCCCCC-cceeeeccCCCCCceecCCCCCCCcCCCCccccCCccCCCC
Q psy620 168 TCGPCPSGYT--GDGERCQRI-GGCSRNPCAQGKLNEK-TRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDLAE 243 (1290)
Q Consensus 168 ~C~~C~~Gy~--Gdg~~C~~i-deC~~~pC~~g~~~~~-~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~~~ 243 (1290)
+| +|..||. +++.+|..+ .+-..++|..+..... ...+.|+... .+.|.| +|.+||.|+|..|.+++||. ++
T Consensus 758 rc-eC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG-gs~y~C-~CLPGfsGDG~~c~dvDeC~-ps 833 (1289)
T KOG1214|consen 758 RC-ECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG-GSTYSC-ACLPGFSGDGHQCTDVDECS-PS 833 (1289)
T ss_pred eE-EEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC-CceEEE-eecCCccCCccccccccccC-cc
Confidence 99 9999986 777789873 3222345554411110 0115666654 468999 99999999999999999998 89
Q ss_pred CCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCC--CCCCCCCCcccc--CCCCcEEcC
Q psy620 244 PCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGR--NGGCDSNSMCTN--TEGSFTCTS 319 (1290)
Q Consensus 244 pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~--~g~C~~~g~C~n--~~gsy~C~~ 319 (1290)
.|+..+.|.+++++|.|. |.+||.|+. .+++..+ .....|...+ .-.|+.+..|.. -+.+|.|
T Consensus 834 rChp~A~CyntpgsfsC~-C~pGy~GDG-f~CVP~~---------~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~-- 900 (1289)
T KOG1214|consen 834 RCHPAATCYNTPGSFSCR-CQPGYYGDG-FQCVPDT---------SSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEV-- 900 (1289)
T ss_pred ccCCCceEecCCCcceee-cccCccCCC-ceecCCC---------ccCCccccccccceeeccccceeEeeCCCcccC--
Confidence 999999999999999999 999999987 1111110 1122333221 122554443332 2456788
Q ss_pred cCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeee--cCCceEEecCCCcccCCC
Q psy620 320 LCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRI--LGNHYACKCDNGWAGDGQ 377 (1290)
Q Consensus 320 ~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~--~~gsy~C~C~~Gy~GdG~ 377 (1290)
.|.++-.|.. ...|-... ...| ..|.-+|.|..+ .+.+++|.|.. +||+
T Consensus 901 p~~~~ppG~~-~~~c~~~~--~~~v---p~Cd~hgh~ap~qchG~~~~CwCvd---~dGr 951 (1289)
T KOG1214|consen 901 PGTQTPPGST-PPHCGPSP--EQYV---PQCDDHGHFAPLQCHGKSDFCWCVD---KDGR 951 (1289)
T ss_pred CCCCCCCCCC-CCCCCCcc--cccC---CCccccccccccccCCCcceeEEec---CCCc
Confidence 7777654432 23454311 1112 246666766543 24458999987 5665
No 4
>KOG1214|consensus
Probab=99.73 E-value=5.4e-17 Score=192.88 Aligned_cols=160 Identities=33% Similarity=0.767 Sum_probs=129.4
Q ss_pred CCCCCCCcccccCCC-CeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCc
Q psy620 242 AEPCDPRVQCTNLFP-GYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSL 320 (1290)
Q Consensus 242 ~~pC~~~g~C~n~~g-sy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~ 320 (1290)
+..|..++.|....+ .|+|. |..||.|.. +.|.+++||+.. .+.|.+++.|++.+++|+| +
T Consensus 699 sh~cdt~a~C~pg~~~~~tce-cs~g~~gdg--------------r~c~d~~eca~~-~~~CGp~s~Cin~pg~~rc--e 760 (1289)
T KOG1214|consen 699 SHMCDTTARCHPGTGVDYTCE-CSSGYQGDG--------------RNCVDENECATG-FHRCGPNSVCINLPGSYRC--E 760 (1289)
T ss_pred CcccCCCccccCCCCcceEEE-EeeccCCCC--------------CCCCChhhhccC-CCCCCCCceeecCCCceeE--E
Confidence 345666777876544 58898 999998876 889999999998 7889999999999999999 9
Q ss_pred CcCCccccCCCCCCCCCCC--CCCCCCCC-CCCCCCC--eEeeecCCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCC
Q psy620 321 CRNSYMVRNVSVGCQSQNF--GADVCPDG-TRCDRNA--KCTRILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLAC 395 (1290)
Q Consensus 321 C~~Gy~g~~~g~~C~~~~~--~id~C~~~-~~C~~~g--~C~~~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C 395 (1290)
|..||.....+..|..... .++.|..+ +.|...+ .|+....++|.|.|.+||.|||..|. +.| .|
T Consensus 761 C~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~---dvD-------eC 830 (1289)
T KOG1214|consen 761 CRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT---DVD-------EC 830 (1289)
T ss_pred EeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccc---ccc-------cc
Confidence 9999988887888976432 35778877 8898755 56666678899999999999999884 333 56
Q ss_pred CCCCCC-CCccccCCCCCCCCccccCCCCCCCCCCcccc
Q psy620 396 PDRKCR-KDNCVHIPNSGINNHADNCPRNANPDQRMCGH 433 (1290)
Q Consensus 396 ~~~~C~-ng~C~~~~gs~~~~~~C~C~~Gy~G~~~~c~~ 433 (1290)
+++.|. +.+|++.+|+ |.|.|.+||.|++-.|..
T Consensus 831 ~psrChp~A~Cyntpgs----fsC~C~pGy~GDGf~CVP 865 (1289)
T KOG1214|consen 831 SPSRCHPAATCYNTPGS----FSCRCQPGYYGDGFQCVP 865 (1289)
T ss_pred CccccCCCceEecCCCc----ceeecccCccCCCceecC
Confidence 678884 4589999976 679999999999866643
No 5
>KOG4289|consensus
Probab=99.72 E-value=4.2e-17 Score=200.61 Aligned_cols=105 Identities=33% Similarity=0.841 Sum_probs=93.8
Q ss_pred cCCCCCCCCCCCCCCeeec----------------------CCCCcccccCCCCcccCCCCCC-CCCCCCCCCCCCCCee
Q psy620 104 KKPTCATDNPCFPGVECRD----------------------TREGPRCMRCPDGYVGDGIHCK-PGVTCNMRPCFQGVQC 160 (1290)
Q Consensus 104 ~~d~C~~~~pC~~gg~C~~----------------------~~g~y~C~~C~~Gy~Gdg~~Ce-dideC~~~pC~~gg~C 160 (1290)
+.+.|. ..||.|..+|+. ..++++| +||+||+| ..|+ .+++|.+.||.++++|
T Consensus 1178 dDniCl-rEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrC-rCPpGFTg--d~CeTeiDlCYs~pC~nng~C 1253 (2531)
T KOG4289|consen 1178 DDNICL-REPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRC-RCPPGFTG--DYCETEIDLCYSGPCGNNGRC 1253 (2531)
T ss_pred cCchhh-cchhHHHHhhhhheeecccCccccccceeeeeccccCceeE-eCCCCCCc--ccccchhHhhhcCCCCCCCce
Confidence 457898 999999999975 2357999 99999999 4999 8999999999999999
Q ss_pred ccCCCCcccccCCCCCCCCCCCcee---cccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCC
Q psy620 161 FDTVEGYTCGPCPSGYTGDGERCQR---IGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEG 225 (1290)
Q Consensus 161 ~n~~g~y~C~~C~~Gy~Gdg~~C~~---ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~G 225 (1290)
....|+|+| .|.+||+|. +||. ...|.+..|.++ ++|++.. .++|.| .|+.|
T Consensus 1254 ~srEggYtC-eCrpg~tGe--hCEvs~~agrCvpGvC~ng--------gtC~~~~-nggf~c-~Cp~g 1308 (2531)
T KOG4289|consen 1254 RSREGGYTC-ECRPGFTGE--HCEVSARAGRCVPGVCKNG--------GTCVNLL-NGGFCC-HCPYG 1308 (2531)
T ss_pred EEecCceeE-EecCCcccc--ceeeecccCccccceecCC--------CEEeecC-CCceec-cCCCc
Confidence 999999999 999999998 9986 467888999999 9999886 478999 99987
No 6
>KOG4289|consensus
Probab=99.64 E-value=1.8e-15 Score=186.64 Aligned_cols=111 Identities=34% Similarity=0.820 Sum_probs=92.9
Q ss_pred CCCCCCCCCCCCCeecc----------------------CCCCcccccCCCCCCCCCCCcee-cccCCCCCCCCCCCCCC
Q psy620 146 GVTCNMRPCFQGVQCFD----------------------TVEGYTCGPCPSGYTGDGERCQR-IGGCSRNPCAQGKLNEK 202 (1290)
Q Consensus 146 ideC~~~pC~~gg~C~n----------------------~~g~y~C~~C~~Gy~Gdg~~C~~-ideC~~~pC~~g~~~~~ 202 (1290)
-+.|...||.+...|+. ..++++| +||+||+|+ .|+. ++.|-+.||.++
T Consensus 1179 DniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrC-rCPpGFTgd--~CeTeiDlCYs~pC~nn----- 1250 (2531)
T KOG4289|consen 1179 DNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRC-RCPPGFTGD--YCETEIDLCYSGPCGNN----- 1250 (2531)
T ss_pred CchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeE-eCCCCCCcc--cccchhHhhhcCCCCCC-----
Confidence 35688888888888852 3467899 999999999 9998 999999999999
Q ss_pred cceeeeccCCCCCceecCCCCCCCcCCCCccc---cCCccCCCCCCCCCcccccC-CCCeecccCCCC-CccCCCCc
Q psy620 203 TRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCH---DIDECDLAEPCDPRVQCTNL-FPGYRCDPCPAG-FTGSTGVQ 274 (1290)
Q Consensus 203 ~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~---dideC~~~~pC~~~g~C~n~-~gsy~C~~C~~G-y~G~~Ce~ 274 (1290)
++|...+ ++|+| .|.+||+| .+|+ ..-.|. +..|.++++|++. .++|.|. |+.| |+++.|+.
T Consensus 1251 ---g~C~srE--ggYtC-eCrpg~tG--ehCEvs~~agrCv-pGvC~nggtC~~~~nggf~c~-Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1251 ---GRCRSRE--GGYTC-ECRPGFTG--EHCEVSARAGRCV-PGVCKNGGTCVNLLNGGFCCH-CPYGEFEDPRCEV 1317 (2531)
T ss_pred ---CceEEec--CceeE-EecCCccc--cceeeecccCccc-cceecCCCEEeecCCCceecc-CCCcccCCCceEE
Confidence 9999876 69999 99999999 8997 345676 8899999999875 5789998 9987 55666654
No 7
>KOG0994|consensus
Probab=99.63 E-value=1.4e-15 Score=185.36 Aligned_cols=312 Identities=22% Similarity=0.464 Sum_probs=190.7
Q ss_pred eeecccCCCcccchhhhhhhcccccccceeeeeecccccccccCCCcCcccCCCCCC----CCCCCC-CC--eeecCCCC
Q psy620 54 LYMDCVDHGTVAMTQSLKKMFDSMKNPQMRLRKTDEESVDEIELPAIPIVKKPTCAT----DNPCFP-GV--ECRDTREG 126 (1290)
Q Consensus 54 ly~~C~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~d~C~~----~~pC~~-gg--~C~~~~g~ 126 (1290)
+.++|.+.+++.........+.+ .+|+++.|....-.++.|.+.+..|..- +|.. .+-|.. -| .|+...-+
T Consensus 763 ~~CnCnptGSlS~vCn~~GGqCq-CkPnVVGR~CdqCApGtyGFGPsGCk~C-dC~~~Gs~~~~Cd~~tGQC~C~~g~yg 840 (1758)
T KOG0994|consen 763 SMCNCNPTGSLSSVCNPNGGQCQ-CKPNVVGRRCDQCAPGTYGFGPSGCKAC-DCNSIGSLDKYCDKITGQCQCRPGTYG 840 (1758)
T ss_pred cccccCCCccccccccCCCceec-ccCccccccccccCCcccCcCCccCccc-cccccccccccccccccceeeccccch
Confidence 46888888888776666666665 7888888888777777787766654321 2221 122221 12 35665556
Q ss_pred cccccCCCCcccCCCCCC------CCCCCCCC--CCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCCCCCCCCCC
Q psy620 127 PRCMRCPDGYVGDGIHCK------PGVTCNMR--PCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPCAQGK 198 (1290)
Q Consensus 127 y~C~~C~~Gy~Gdg~~Ce------dideC~~~--pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~ 198 (1290)
.+|.+|.+||+|. +.|. ..++|.+. .|. .|.+...++.|++|..||+|+++. ..-..|.+.||+.+.
T Consensus 841 rqCnqCqpG~WgF-PeCr~CqCNgHA~~Cd~~tGaCi---~CqD~T~G~~CdrCl~GyyGdP~l-g~g~~CrPCpCP~gp 915 (1758)
T KOG0994|consen 841 RQCNQCQPGYWGF-PECRPCQCNGHADTCDPITGACI---DCQDSTTGHSCDRCLDGYYGDPRL-GSGIGCRPCPCPDGP 915 (1758)
T ss_pred hhccccCCCccCC-CcCccccccCcccccCccccccc---cccccccccchhhhhccccCCccc-CCCCCCCCCCCCCCC
Confidence 7899999999996 4444 23444432 122 367788999999999999998643 123467888888773
Q ss_pred CCCCcceeeeccCCCCCceecCCCCCCCcCCCCccccCCccCC---CCCCCCCcccccCCCCeecccCCCCCccCCCCcc
Q psy620 199 LNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDL---AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQG 275 (1290)
Q Consensus 199 ~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~---~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~ 275 (1290)
......--.|...+......| .|.+||.| .+|+ +|.. .+|=. +++|. .|. |.--.-
T Consensus 916 ~Sg~~~A~sC~~d~~t~~ivC-~C~~GY~G--~RCe---~CA~~~fGnP~~-GGtCq------~Ce-C~~NiD------- 974 (1758)
T KOG0994|consen 916 ASGRQHADSCYLDTRTQQIVC-HCQEGYSG--SRCE---ICADNHFGNPSE-GGTCQ------KCE-CSNNID------- 974 (1758)
T ss_pred ccchhccccccccccccceee-ecccCccc--cchh---hhcccccCCccc-CCccc------ccc-ccCCcC-------
Confidence 222212234544333335678 89999998 6775 3541 12222 34442 233 321100
Q ss_pred ccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCe
Q psy620 276 VGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAK 355 (1290)
Q Consensus 276 ~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~ 355 (1290)
=.+...|... ++.|. .|.....+-+|+ .|+.||+|..-...|+. -.|.....= +.+.
T Consensus 975 ------------~~d~~aCD~~-TG~CL---kCL~hTeG~hCe-~Ck~Gf~GdA~~q~Cqr-----C~Cn~LGTn-~~~~ 1031 (1758)
T KOG0994|consen 975 ------------LYDPGACDVA-TGACL---KCLYHTEGDHCE-HCKDGFYGDALRQNCQR-----CVCNFLGTN-STCH 1031 (1758)
T ss_pred ------------ccCCCccchh-hchhh---hhhhcccccchh-hccccchhHHHHhhhhh-----heccccccC-Cccc
Confidence 0244567666 67787 688777788997 99999998876667765 123321100 1134
Q ss_pred EeeecCCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCCCCCCCCC-C--ccccCCCCCCCCccccCCCCCCCC
Q psy620 356 CTRILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLACPDRKCRK-D--NCVHIPNSGINNHADNCPRNANPD 427 (1290)
Q Consensus 356 C~~~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C~~~~C~n-g--~C~~~~gs~~~~~~C~C~~Gy~G~ 427 (1290)
|... +.+|.|.+...| .+|......-+-..+...|.+-.|.. + +|.. ...+|.|.+||-|.
T Consensus 1032 CDr~---tGQCpClpNv~G--~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~------ftGQCqCkpGfGGR 1095 (1758)
T KOG0994|consen 1032 CDRF---TGQCPCLPNVQG--VRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNE------FTGQCQCKPGFGGR 1095 (1758)
T ss_pred cccc---cCcCCCCccccc--ccccccccchhccccCCCCCccCCCccCCccccc------cccceeccCCCCCc
Confidence 5443 459999999999 88865443333345677787766633 1 3432 24469999999443
No 8
>KOG1219|consensus
Probab=99.61 E-value=1.1e-15 Score=193.74 Aligned_cols=111 Identities=40% Similarity=1.015 Sum_probs=103.3
Q ss_pred cCCCCCCCCCCCCCCeeecCC-CCcccccCCCCcccCCCCCC-CCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCC
Q psy620 104 KKPTCATDNPCFPGVECRDTR-EGPRCMRCPDGYVGDGIHCK-PGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGE 181 (1290)
Q Consensus 104 ~~d~C~~~~pC~~gg~C~~~~-g~y~C~~C~~Gy~Gdg~~Ce-dideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~ 181 (1290)
..+.|. .+||+++|+|...+ ++|.| .|++-|+| .+|| ++..|+++||..|++|+...++|.| .|+.||+|.
T Consensus 3863 ~~d~C~-~npCqhgG~C~~~~~ggy~C-kCpsqysG--~~CEi~~epC~snPC~~GgtCip~~n~f~C-nC~~gyTG~-- 3935 (4289)
T KOG1219|consen 3863 LTDPCN-DNPCQHGGTCISQPKGGYKC-KCPSQYSG--NHCEIDLEPCASNPCLTGGTCIPFYNGFLC-NCPNGYTGK-- 3935 (4289)
T ss_pred cccccc-cCcccCCCEecCCCCCceEE-eCcccccC--cccccccccccCCCCCCCCEEEecCCCeeE-eCCCCccCc--
Confidence 349999 99999999999876 77999 99999999 8999 7889999999999999999999999 999999999
Q ss_pred Ccee--cccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCccc
Q psy620 182 RCQR--IGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCH 234 (1290)
Q Consensus 182 ~C~~--ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~ 234 (1290)
+|+. +++|..++|.++ +.|+++. ++|.| .|.+||.| +.|.
T Consensus 3936 ~Ce~~Gi~eCs~n~C~~g--------g~C~n~~--gsf~C-ncT~g~~g--r~c~ 3977 (4289)
T KOG1219|consen 3936 RCEARGISECSKNVCGTG--------GQCINIP--GSFHC-NCTPGILG--RTCC 3977 (4289)
T ss_pred eeecccccccccccccCC--------ceeeccC--CceEe-ccChhHhc--ccCc
Confidence 9997 899999999999 9999986 68999 99999999 7774
No 9
>KOG1217|consensus
Probab=99.57 E-value=2.2e-13 Score=165.35 Aligned_cols=199 Identities=35% Similarity=0.762 Sum_probs=141.1
Q ss_pred CCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCCCCCC--CCCCCCCCcceeeeccCC-CCCceecCCCCC
Q psy620 148 TCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPC--AQGKLNEKTRCVRCDDIP-EHPYYRCGSCPE 224 (1290)
Q Consensus 148 eC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC--~~g~~~~~~~Cg~C~~~~-~~g~y~C~~C~~ 224 (1290)
.|...+......|......|.| .|++||.+. .|.....|...+. ... +.|.... ....+.| .|..
T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~c-~c~~g~~~~--~~~~~~~C~~~~~~~~~~--------~~c~~~~~~~~~~~c-~C~~ 158 (487)
T KOG1217|consen 91 PCRSPCLLLCGECVDCVGSYEC-TCPPGYQGT--PCEGECECVTGPGVCCID--------GSCSNGPGSVGPFRC-SCTE 158 (487)
T ss_pred cccCCcccCCccccCCCCCcee-eCCCccccC--cCCcceeecCCCCCeeCc--------hhhcCCCCCCCceee-eeCC
Confidence 4444455556677778889999 899999997 5554334665542 222 4566542 1347899 9999
Q ss_pred CCcCCCCccccC-CccCC-CCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCccc-----------C
Q psy620 225 GTTGNGTRCHDI-DECDL-AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVD-----------I 291 (1290)
Q Consensus 225 Gy~Gdg~~C~di-deC~~-~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~d-----------i 291 (1290)
||.+ ..|... ++|.. ..+|.+++.|.+..++|.|. |++||.+..|+.. .....|.. .
T Consensus 159 g~~~--~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~-c~~~~~~~~~~~~-------~~~~~c~~~~~~~~~~g~~~ 228 (487)
T KOG1217|consen 159 GYEG--EPCETDLDECIQYSSPCQNGGTCVNTGGSYLCS-CPPGYTGSTCETT-------GNGGTCVDSVACSCPPGARG 228 (487)
T ss_pred Cccc--ccccccccccccCCCCcCCCcccccCCCCeeEe-CCCCccCCcCcCC-------CCCceEecceeccCCCCCCC
Confidence 9999 667643 78873 55799999999999999999 9999999997753 01111211 2
Q ss_pred CCCCCCCCCCCCCC-CccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCC
Q psy620 292 DECADGRNGGCDSN-SMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDN 370 (1290)
Q Consensus 292 deC~~~~~g~C~~~-g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~ 370 (1290)
..|... ...|..+ ++|++..++|+| .|++||.+... ..|.. +++|.....|.++++|++. .+.|.|.|++
T Consensus 229 ~~c~~~-~~~~~~~~~~c~~~~~~~~C--~~~~g~~~~~~-~~~~~----~~~C~~~~~c~~~~~C~~~-~~~~~C~C~~ 299 (487)
T KOG1217|consen 229 PECEVS-IVECASGDGTCVNTVGSYTC--RCPEGYTGDAC-VTCVD----VDSCALIASCPNGGTCVNV-PGSYRCTCPP 299 (487)
T ss_pred CCcccc-cccccCCCCcccccCCceee--eCCCCcccccc-ceeee----ccccCCCCccCCCCeeecC-CCcceeeCCC
Confidence 233332 2234433 899999999999 99999986542 34555 6889865349999999998 6679999999
Q ss_pred CcccCCCCc
Q psy620 371 GWAGDGQFC 379 (1290)
Q Consensus 371 Gy~GdG~~C 379 (1290)
||+| ..|
T Consensus 300 g~~g--~~~ 306 (487)
T KOG1217|consen 300 GFTG--RLC 306 (487)
T ss_pred CCCC--CCC
Confidence 9999 555
No 10
>KOG1219|consensus
Probab=99.48 E-value=9.2e-14 Score=176.88 Aligned_cols=114 Identities=37% Similarity=0.913 Sum_probs=103.3
Q ss_pred CCCC-CCCCCCCCCCCCCeeccCC-CCcccccCCCCCCCCCCCcee-cccCCCCCCCCCCCCCCcceeeeccCCCCCcee
Q psy620 142 HCKP-GVTCNMRPCFQGVQCFDTV-EGYTCGPCPSGYTGDGERCQR-IGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYR 218 (1290)
Q Consensus 142 ~Ced-ideC~~~pC~~gg~C~n~~-g~y~C~~C~~Gy~Gdg~~C~~-ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~ 218 (1290)
-|.- .+.|..+||+++|+|...+ ++|.| .|++-|+|. .|+. +..|.++||..| ++|+... .+|.
T Consensus 3859 gC~l~~d~C~~npCqhgG~C~~~~~ggy~C-kCpsqysG~--~CEi~~epC~snPC~~G--------gtCip~~--n~f~ 3925 (4289)
T KOG1219|consen 3859 GCSLLTDPCNDNPCQHGGTCISQPKGGYKC-KCPSQYSGN--HCEIDLEPCASNPCLTG--------GTCIPFY--NGFL 3925 (4289)
T ss_pred cccccccccccCcccCCCEecCCCCCceEE-eCcccccCc--ccccccccccCCCCCCC--------CEEEecC--CCee
Confidence 3552 2789999999999999876 68999 999999999 9998 899999999999 9999875 5899
Q ss_pred cCCCCCCCcCCCCccc--cCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCC
Q psy620 219 CGSCPEGTTGNGTRCH--DIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGV 273 (1290)
Q Consensus 219 C~~C~~Gy~Gdg~~C~--dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce 273 (1290)
| .|+.||+| .+|+ .++||. .++|.+++.|+|..|+|+|. |.+||.|..|.
T Consensus 3926 C-nC~~gyTG--~~Ce~~Gi~eCs-~n~C~~gg~C~n~~gsf~Cn-cT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3926 C-NCPNGYTG--KRCEARGISECS-KNVCGTGGQCINIPGSFHCN-CTPGILGRTCC 3977 (4289)
T ss_pred E-eCCCCccC--ceeecccccccc-cccccCCceeeccCCceEec-cChhHhcccCc
Confidence 9 99999999 8997 389998 89999999999999999999 99999998863
No 11
>KOG1217|consensus
Probab=99.44 E-value=1.4e-12 Score=158.28 Aligned_cols=264 Identities=31% Similarity=0.755 Sum_probs=191.9
Q ss_pred CCCCCCCCCCCCCeeecCCCCcccccCCCCcccCCCCCCCCCCCCCCC--CCCCCeeccCC---CCcccccCCCCCCCCC
Q psy620 106 PTCATDNPCFPGVECRDTREGPRCMRCPDGYVGDGIHCKPGVTCNMRP--CFQGVQCFDTV---EGYTCGPCPSGYTGDG 180 (1290)
Q Consensus 106 d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~CedideC~~~p--C~~gg~C~n~~---g~y~C~~C~~Gy~Gdg 180 (1290)
+.|. ..+...++.|......+.| .|++||.+ ..|+...+|...+ +...+.|.... ..|+| .|..||.+.
T Consensus 90 ~~~~-~~~~~~~~~~~~~~~~~~c-~c~~g~~~--~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c-~C~~g~~~~- 163 (487)
T KOG1217|consen 90 PPCR-SPCLLLCGECVDCVGSYEC-TCPPGYQG--TPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRC-SCTEGYEGE- 163 (487)
T ss_pred cccc-CCcccCCccccCCCCCcee-eCCCcccc--CcCCcceeecCCCCCeeCchhhcCCCCCCCceee-eeCCCcccc-
Confidence 4444 4444556677778889999 99999998 5666433677766 35666787754 58999 999999998
Q ss_pred CCceec-ccCCC--CCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCcccc--------------------CC
Q psy620 181 ERCQRI-GGCSR--NPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHD--------------------ID 237 (1290)
Q Consensus 181 ~~C~~i-deC~~--~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~d--------------------id 237 (1290)
.|... ++|.. .+|.++ +.|.+.. ++|.| .|++||.+ ..|+. ..
T Consensus 164 -~~~~~~~~C~~~~~~c~~~--------~~C~~~~--~~~~C-~c~~~~~~--~~~~~~~~~~~c~~~~~~~~~~g~~~~ 229 (487)
T KOG1217|consen 164 -PCETDLDECIQYSSPCQNG--------GTCVNTG--GSYLC-SCPPGYTG--STCETTGNGGTCVDSVACSCPPGARGP 229 (487)
T ss_pred -cccccccccccCCCCcCCC--------cccccCC--CCeeE-eCCCCccC--CcCcCCCCCceEecceeccCCCCCCCC
Confidence 77764 78884 569988 8898875 46999 99999997 33331 12
Q ss_pred ccCC-CCCCCCC-cccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCc
Q psy620 238 ECDL-AEPCDPR-VQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSF 315 (1290)
Q Consensus 238 eC~~-~~pC~~~-g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy 315 (1290)
.|.. ...|... ++|++..++|+|. |++||++..+ ..|.++++|... .. |.++++|++..+.|
T Consensus 230 ~c~~~~~~~~~~~~~c~~~~~~~~C~-~~~g~~~~~~-------------~~~~~~~~C~~~-~~-c~~~~~C~~~~~~~ 293 (487)
T KOG1217|consen 230 ECEVSIVECASGDGTCVNTVGSYTCR-CPEGYTGDAC-------------VTCVDVDSCALI-AS-CPNGGTCVNVPGSY 293 (487)
T ss_pred CcccccccccCCCCcccccCCceeee-CCCCcccccc-------------ceeeeccccCCC-Cc-cCCCCeeecCCCcc
Confidence 3321 1234433 8999999999999 9999998862 346889999987 34 99999999999999
Q ss_pred EEcCcCcCCccccCCCCCCCCCCCCCCCCC---CCCCCCCCCeEee-ecCCceEEecCCCcccCCCCcCCcCCCCCCCCC
Q psy620 316 TCTSLCRNSYMVRNVSVGCQSQNFGADVCP---DGTRCDRNAKCTR-ILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDY 391 (1290)
Q Consensus 316 ~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~---~~~~C~~~g~C~~-~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~ 391 (1290)
+| .|++||.+..+ ..|.. ..+|. ....|.++++|.. ...+.+.|.|..||.| ..|+...
T Consensus 294 ~C--~C~~g~~g~~~-~~~~~----~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g--~~C~~~~-------- 356 (487)
T KOG1217|consen 294 RC--TCPPGFTGRLC-TECVD----VDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTG--RRCEDSN-------- 356 (487)
T ss_pred ee--eCCCCCCCCCC-ccccc----cccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCC--CccccCC--------
Confidence 99 99999987665 33444 46775 2467988889932 2245788999999888 8997432
Q ss_pred CCCCCCCCCCCC-ccccC-CCCCCCCccccCCCCCCCC
Q psy620 392 DLACPDRKCRKD-NCVHI-PNSGINNHADNCPRNANPD 427 (1290)
Q Consensus 392 ~~~C~~~~C~ng-~C~~~-~gs~~~~~~C~C~~Gy~G~ 427 (1290)
..|...+|.++ .|++. .+ .+.|.|+.+|.+.
T Consensus 357 -~~C~~~~~~~~~~c~~~~~~----~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 357 -DECASSPCCPGGTCVNETPG----SYRCACPAGFAGK 389 (487)
T ss_pred -ccccCCccccCCEeccCCCC----CeEecCCCccccC
Confidence 14555556444 78873 33 5679999999654
No 12
>KOG0994|consensus
Probab=99.11 E-value=5e-10 Score=138.00 Aligned_cols=221 Identities=29% Similarity=0.745 Sum_probs=132.6
Q ss_pred eeecCCCCcccccCCCCcccCCCCCCCCCCCCCCCCCCCC--------eecc--CCCCcccccCCCCCCCCCCCceeccc
Q psy620 119 ECRDTREGPRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGV--------QCFD--TVEGYTCGPCPSGYTGDGERCQRIGG 188 (1290)
Q Consensus 119 ~C~~~~g~y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg--------~C~n--~~g~y~C~~C~~Gy~Gdg~~C~~ide 188 (1290)
.|.+...++.|.+|..||.|+.+. -....|.+.||..+- .|.- ......| .|.+||+|. +|+.
T Consensus 877 ~CqD~T~G~~CdrCl~GyyGdP~l-g~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC-~C~~GY~G~--RCe~--- 949 (1758)
T KOG0994|consen 877 DCQDSTTGHSCDRCLDGYYGDPRL-GSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVC-HCQEGYSGS--RCEI--- 949 (1758)
T ss_pred cccccccccchhhhhccccCCccc-CCCCCCCCCCCCCCCccchhccccccccccccceee-ecccCcccc--chhh---
Confidence 366777899999999999998432 223456666775442 3432 2235678 999999998 8863
Q ss_pred CCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCccccCCccCC-CCCCCCCcccccCCCCeecccCCCCC
Q psy620 189 CSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDL-AEPCDPRVQCTNLFPGYRCDPCPAGF 267 (1290)
Q Consensus 189 C~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~-~~pC~~~g~C~n~~gsy~C~~C~~Gy 267 (1290)
|..+-=.+. .+. ++|. .| .|....-- .+...|.. ...|. +|.....+-+|..|..||
T Consensus 950 CA~~~fGnP--~~G---GtCq--------~C-eC~~NiD~-----~d~~aCD~~TG~CL---kCL~hTeG~hCe~Ck~Gf 1007 (1758)
T KOG0994|consen 950 CADNHFGNP--SEG---GTCQ--------KC-ECSNNIDL-----YDPGACDVATGACL---KCLYHTEGDHCEHCKDGF 1007 (1758)
T ss_pred hcccccCCc--ccC---Cccc--------cc-cccCCcCc-----cCCCccchhhchhh---hhhhcccccchhhccccc
Confidence 544311111 000 3442 33 33322110 01122321 12332 466666777999999999
Q ss_pred ccCCCCccccccccccCCCCcc-------cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCC
Q psy620 268 TGSTGVQGVGLEHAVRFRQTCV-------DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFG 340 (1290)
Q Consensus 268 ~G~~Ce~~~~~~~~~~~~~~C~-------dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~ 340 (1290)
+|.. ...+.+.|+ ..-.|... ++.|. |.+...+.+|. +|.+.++-...|..|+.
T Consensus 1008 ~GdA---------~~q~CqrC~Cn~LGTn~~~~CDr~-tGQCp----ClpNv~G~~CD-qCA~N~w~laSG~GCe~---- 1068 (1758)
T KOG0994|consen 1008 YGDA---------LRQNCQRCVCNFLGTNSTCHCDRF-TGQCP----CLPNVQGVRCD-QCAENHWNLASGEGCEP---- 1068 (1758)
T ss_pred hhHH---------HHhhhhhheccccccCCccccccc-cCcCC----CCccccccccc-ccccchhccccCCCCCc----
Confidence 9976 122223331 11345555 66676 88888889998 99998877777888886
Q ss_pred CCCCCCCCCCCC--CCeEeeecCCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCCCCCCC
Q psy620 341 ADVCPDGTRCDR--NAKCTRILGNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLACPDRKC 400 (1290)
Q Consensus 341 id~C~~~~~C~~--~g~C~~~~~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C~~~~C 400 (1290)
|. |+. +-+|... . .+|.|++||-| +.|.+.-+ =.|.+....|..-.|
T Consensus 1069 ---C~----Cd~~~~pqCN~f-t--GQCqCkpGfGG--R~C~qCqe-l~WGdP~~~C~aCdC 1117 (1758)
T KOG0994|consen 1069 ---CN----CDPIGGPQCNEF-T--GQCQCKPGFGG--RTCSQCQE-LYWGDPNEKCRACDC 1117 (1758)
T ss_pred ---cC----CCccCCcccccc-c--cceeccCCCCC--cchhHHHH-hhcCCCCCCceecCC
Confidence 43 332 2367655 2 39999999999 88865433 245555566655444
No 13
>KOG4260|consensus
Probab=98.72 E-value=1.6e-08 Score=109.47 Aligned_cols=152 Identities=32% Similarity=0.836 Sum_probs=110.0
Q ss_pred ccccCCCCcccCCCCCCCCCCC---CCCCCCCCCeecc---CCCCcccccCCCCCCCCCCCceecccCCC----------
Q psy620 128 RCMRCPDGYVGDGIHCKPGVTC---NMRPCFQGVQCFD---TVEGYTCGPCPSGYTGDGERCQRIGGCSR---------- 191 (1290)
Q Consensus 128 ~C~~C~~Gy~Gdg~~CedideC---~~~pC~~gg~C~n---~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~---------- 191 (1290)
.| ||+|-+| +.|. .| +..||..++.|.- ..|+-.| .|.+||+|. .|.. |..
T Consensus 130 vC--Cp~gtyG--pdCl---~Cpggser~C~GnG~C~GdGsR~GsGkC-kC~~GY~Gp--~C~~---Cg~eyfes~Rne~ 196 (350)
T KOG4260|consen 130 VC--CPDGTYG--PDCL---QCPGGSERPCFGNGSCHGDGSREGSGKC-KCETGYTGP--LCRY---CGIEYFESSRNEQ 196 (350)
T ss_pred ec--cCCCCcC--Cccc---cCCCCCcCCcCCCCcccCCCCCCCCCcc-cccCCCCCc--cccc---cchHHHHhhcccc
Confidence 56 9999988 6776 34 2368999999973 3467899 999999998 6643 211
Q ss_pred ----CCCCCCCCCCCccee-eeccCCCCCceecCCCCCCCcCCCCccccCCccCC-CCCCCCCcccccCCCCeecccCCC
Q psy620 192 ----NPCAQGKLNEKTRCV-RCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDL-AEPCDPRVQCTNLFPGYRCDPCPA 265 (1290)
Q Consensus 192 ----~pC~~g~~~~~~~Cg-~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~-~~pC~~~g~C~n~~gsy~C~~C~~ 265 (1290)
..|+.+ |. .|... ++-.|..|..||..+...|.||+||.. +.+|.....|+|+.|+|.|. +++
T Consensus 197 ~lvCt~Ch~~-------C~~~Csg~---~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~-dk~ 265 (350)
T KOG4260|consen 197 HLVCTACHEG-------CLGVCSGE---SSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCE-DKE 265 (350)
T ss_pred cchhhhhhhh-------hhcccCCC---CCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEec-ccc
Confidence 123322 22 45543 244687899999988889999999984 67899999999999999999 999
Q ss_pred CCccCCCCccccccccccCCCCcccCCCCCCCCCCCC-CCCCccccCCCCcEEcCcCcCCcc
Q psy620 266 GFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGC-DSNSMCTNTEGSFTCTSLCRNSYM 326 (1290)
Q Consensus 266 Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C-~~~g~C~n~~gsy~C~~~C~~Gy~ 326 (1290)
||.+.. ++|..- ...| ..+..|.|+.++|+| .|..|+.
T Consensus 266 Gy~~g~--------------------d~C~~~-~d~~~~kn~~c~ni~~~~r~--v~f~~~~ 304 (350)
T KOG4260|consen 266 GYKKGV--------------------DECQFC-ADVCASKNRPCMNIDGQYRC--VCFSGLI 304 (350)
T ss_pred cccCCh--------------------HHhhhh-hhhcccCCCCcccCCccEEE--Eecccce
Confidence 997632 333320 0012 135678899999999 9999985
No 14
>PF02412 TSP_3: Thrombospondin type 3 repeat; InterPro: IPR003367 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. This entry represents the type 3 thrombospondin repeat, and related repeats present in other types of protein.; GO: 0005509 calcium ion binding, 0007155 cell adhesion; PDB: 1UX6_A 3FBY_C 1YO8_A 2RHP_A.
Probab=98.58 E-value=2e-08 Score=78.26 Aligned_cols=36 Identities=61% Similarity=1.142 Sum_probs=30.5
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCcCCCCCCCCCC
Q psy620 829 TDTDNDGTGDACDNDMDNDGINNHADNCPRNANPDQ 864 (1290)
Q Consensus 829 ~D~D~DG~~D~~d~D~D~DGi~d~~d~c~~~~n~~~ 864 (1290)
+|+|+|||||+|+.|.|+|||+|..|+||+++|+.|
T Consensus 1 ~D~D~dg~GD~C~~D~D~Dgi~d~~DnCP~~~n~~Q 36 (36)
T PF02412_consen 1 EDSDGDGIGDACDDDSDGDGIPDACDNCPNVPNPDQ 36 (36)
T ss_dssp --TTSSSS-GGGSSSTTSSSS-GGGHSSTTSTTTTS
T ss_pred CcccCCCCCcccccCCCCCcccCcccCCCCCCCCCC
Confidence 589999999999999999999999999999999876
No 15
>KOG1836|consensus
Probab=98.58 E-value=2.1e-06 Score=115.83 Aligned_cols=227 Identities=24% Similarity=0.533 Sum_probs=134.9
Q ss_pred eccCCCCcccccCCCCCCCCCCCceecccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcCCCCcccc----
Q psy620 160 CFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTGNGTRCHD---- 235 (1290)
Q Consensus 160 C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~d---- 235 (1290)
|.....+-+|..|..||+|.... .....|++.+|.++ +.|..+....+..|..|++||+| .+|+.
T Consensus 749 C~~~t~G~~C~~C~~GfYg~~~~-~~~~dC~~C~Cp~~--------~~~~~~~~~~~~iCk~Cp~gytG--~rCe~c~dg 817 (1705)
T KOG1836|consen 749 CKHNTFGGQCAQCVDGFYGLPDL-GTSGDCQPCPCPNG--------GACGQTPEILEVVCKNCPPGYTG--LRCEECADG 817 (1705)
T ss_pred cccCCCCCchhhhcCCCCCcccc-CCCCCCccCCCCCC--------hhhcCcCcccceecCCCCCCCcc--cccccCCCc
Confidence 55556677899999999987321 11233899999998 77877765667889339999998 67751
Q ss_pred -----------CCccCCCCCCCC-------------Cc---ccccCCCCeecccCCCCCccCCCCccccccccccCCCCc
Q psy620 236 -----------IDECDLAEPCDP-------------RV---QCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTC 288 (1290)
Q Consensus 236 -----------ideC~~~~pC~~-------------~g---~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C 288 (1290)
.-.|. +.+|.. .+ +|+....+.+|..|.+||.|..=. . ........|
T Consensus 818 yfg~p~~~~~~~~~c~-~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~-~----~p~~~c~~c 891 (1705)
T KOG1836|consen 818 YFGNPLGHDGDVRPCQ-SCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLA-P----NPEDKCFAC 891 (1705)
T ss_pred cccCCCCCCCCcccCc-cceeccccCccccccccccccceeeccCCcccccccccccCccccccC-C----CcCCccccc
Confidence 11333 222321 11 355555667788899999887611 0 000011111
Q ss_pred --c------cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeec
Q psy620 289 --V------DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRIL 360 (1290)
Q Consensus 289 --~------dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~ 360 (1290)
. ..-.|... ++.|. |.....+-.|. .|.+||.+.+.+..|+. ..|... =+.+..|..
T Consensus 892 ~c~p~gs~~~~~~c~~~-tGQce----c~~~v~g~~c~-~c~~g~fnl~s~~gC~~-----c~c~~~--gs~~~~c~~-- 956 (1705)
T KOG1836|consen 892 GCVPAGSELPSLTCNPV-TGQCE----CKPNVEGRDCL-YCFKGFFNLNSGVGCEP-----CNCDPT--GSESSDCDV-- 956 (1705)
T ss_pred cCccCCcccccccCCCc-cccee----ccCCCCccccc-cccccccccCCCCCccc-----cccccc--ccccccccc--
Confidence 1 12235544 45554 67777778887 89999988877777876 223211 111235653
Q ss_pred CCceEEecCCCcccCCCCcCCcCCCCCCCCCCCCCCCCCC-CCC----ccccCCCCCCCCccccCCCCCCCCC
Q psy620 361 GNHYACKCDNGWAGDGQFCGRDTDLDGWPDYDLACPDRKC-RKD----NCVHIPNSGINNHADNCPRNANPDQ 428 (1290)
Q Consensus 361 ~gsy~C~C~~Gy~GdG~~Ce~~~d~d~~~~~~~~C~~~~C-~ng----~C~~~~gs~~~~~~C~C~~Gy~G~~ 428 (1290)
++.+|.|++|.+| .+|..... ..+......|..-.| .+| .|... ..+|.|.+++.|..
T Consensus 957 -~tGqc~c~~gVtg--qrc~qc~~-~~~~~~~~gc~~c~c~~~Gs~~~qc~~~------~G~c~c~~~~~g~~ 1019 (1705)
T KOG1836|consen 957 -GTGQCYCRPGVTG--QRCDQCET-YHFGFQTEGCGLCECDPLGSRGFQCDPE------DGQCPCRPGFEGRR 1019 (1705)
T ss_pred -cCCceeeecCccc--cccCcccc-CcccccccCCcceecccCCcccceeccc------CCeeeecCCCCCcc
Confidence 3459999999999 88864322 222233345544444 223 24432 34589999986653
No 16
>KOG4260|consensus
Probab=98.56 E-value=9.7e-08 Score=103.49 Aligned_cols=156 Identities=31% Similarity=0.715 Sum_probs=105.1
Q ss_pred CCCCCCCCCCCceecccCCCCCCCCCCCCCCcceeeeccCC-CCCceecCCCCCCCcCCCCccccC---------Cc---
Q psy620 172 CPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCVRCDDIP-EHPYYRCGSCPEGTTGNGTRCHDI---------DE--- 238 (1290)
Q Consensus 172 C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg~C~~~~-~~g~y~C~~C~~Gy~Gdg~~C~di---------de--- 238 (1290)
||.|-+|. .|..-..=...||... +.|..-. ..++-+| .|.+||+| ..|..- ++
T Consensus 132 Cp~gtyGp--dCl~Cpggser~C~Gn--------G~C~GdGsR~GsGkC-kC~~GY~G--p~C~~Cg~eyfes~Rne~~l 198 (350)
T KOG4260|consen 132 CPDGTYGP--DCLQCPGGSERPCFGN--------GSCHGDGSREGSGKC-KCETGYTG--PLCRYCGIEYFESSRNEQHL 198 (350)
T ss_pred cCCCCcCC--ccccCCCCCcCCcCCC--------CcccCCCCCCCCCcc-cccCCCCC--ccccccchHHHHhhcccccc
Confidence 88898887 6653111123567666 7776422 2356789 99999999 666411 11
Q ss_pred -cCC-CCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCCCCCCCCCCccccCCCCcE
Q psy620 239 -CDL-AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFT 316 (1290)
Q Consensus 239 -C~~-~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~ 316 (1290)
|.. ..+|. +.|... ++..|..|..||.... ..|.|||||... ..+|.....|+|+.|+|.
T Consensus 199 vCt~Ch~~C~--~~Csg~-~~k~C~kCkkGW~lde--------------~gCvDvnEC~~e-p~~c~~~qfCvNteGSf~ 260 (350)
T KOG4260|consen 199 VCTACHEGCL--GVCSGE-SSKGCSKCKKGWKLDE--------------EGCVDVNECQNE-PAPCKAHQFCVNTEGSFK 260 (350)
T ss_pred hhhhhhhhhh--cccCCC-CCCChhhhcccceecc--------------cccccHHHHhcC-CCCCChhheeecCCCceE
Confidence 210 12332 245432 3346877999998775 669999999988 577999999999999999
Q ss_pred EcCcCcCCccccCCCCCCCCCCCCCCCCCCC-CCC-CCCCeEeeecCCceEEecCCCcc
Q psy620 317 CTSLCRNSYMVRNVSVGCQSQNFGADVCPDG-TRC-DRNAKCTRILGNHYACKCDNGWA 373 (1290)
Q Consensus 317 C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~-~~C-~~~g~C~~~~~gsy~C~C~~Gy~ 373 (1290)
| .+++||... +++|..- ..| ..+..|.++ .++|+|+|..|+.
T Consensus 261 C--~dk~Gy~~g------------~d~C~~~~d~~~~kn~~c~ni-~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 261 C--EDKEGYKKG------------VDECQFCADVCASKNRPCMNI-DGQYRCVCFSGLI 304 (350)
T ss_pred e--cccccccCC------------hHHhhhhhhhcccCCCCcccC-CccEEEEecccce
Confidence 9 899999752 2334310 122 245678888 8899999999875
No 17
>PF02412 TSP_3: Thrombospondin type 3 repeat; InterPro: IPR003367 Thrombospondins are multimeric multidomain glycoproteins that function at cell surfaces and in the extracellular matrix milieu. They act as regulators of cell interactions in vertebrates. They are divided into two subfamilies, A and B, according to their overall molecular organisation. The subgroup A proteins TSP-1 and -2 contain an N-terminal domain, a VWFC domain, three TSP1 repeats, three EGF-like domains, TSP3 repeats and a C-terminal domain. They are assembled as trimer. The subgroup B thrombospondins, designated TSP-3, -4, and COMP (cartilage oligomeric matrix protein, also designated TSP-5) are distinct in that they contain unique N-terminal regions, lack the VWFC domain and TSP1 repeats, contain four copies of EGF-like domains, and are assembled as pentamers []. EGF, TSP3 repeats and the C-terminal domain are thus the hallmark of a thrombospondin. This entry represents the type 3 thrombospondin repeat, and related repeats present in other types of protein.; GO: 0005509 calcium ion binding, 0007155 cell adhesion; PDB: 1UX6_A 3FBY_C 1YO8_A 2RHP_A.
Probab=98.55 E-value=2.5e-08 Score=77.72 Aligned_cols=35 Identities=57% Similarity=1.011 Sum_probs=21.1
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCccccCcCCCCC
Q psy620 927 DNDRDGKGDECDPDLDGDGISNDEDNCRLIYNPNQ 961 (1290)
Q Consensus 927 D~D~Dg~~D~~d~D~D~DGi~d~~d~cp~~~n~~~ 961 (1290)
|+|+|||||+|+.|.|+|||+|..||||.++|+.|
T Consensus 2 D~D~dg~GD~C~~D~D~Dgi~d~~DnCP~~~n~~Q 36 (36)
T PF02412_consen 2 DSDGDGIGDACDDDSDGDGIPDACDNCPNVPNPDQ 36 (36)
T ss_dssp -TTSSSS-GGGSSSTTSSSS-GGGHSSTTSTTTTS
T ss_pred cccCCCCCcccccCCCCCcccCcccCCCCCCCCCC
Confidence 56666666666666666666666666666666654
No 18
>KOG1225|consensus
Probab=98.45 E-value=8.1e-07 Score=107.63 Aligned_cols=132 Identities=33% Similarity=0.869 Sum_probs=96.8
Q ss_pred cccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCCCCCCCCCCCCCCccee
Q psy620 127 PRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCV 206 (1290)
Q Consensus 127 y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg 206 (1290)
..| .|+.+|+| ..|+. -.|. ..|..++.|++. +| .|++||+|. .|.. -.|... |..+ +
T Consensus 234 ~ic-~c~~~~~g--~~c~~-~~C~-~~c~~~g~c~~G----~C-IC~~Gf~G~--dC~e-~~Cp~~-cs~~--------g 291 (525)
T KOG1225|consen 234 GIC-ECPEGYFG--PLCST-IYCP-GGCTGRGQCVEG----RC-ICPPGFTGD--DCDE-LVCPVD-CSGG--------G 291 (525)
T ss_pred cee-ecCCceeC--Ccccc-ccCC-CCCcccceEeCC----eE-eCCCCCcCC--CCCc-ccCCcc-cCCC--------c
Confidence 479 99999998 67772 2343 346666788865 69 999999998 8875 235444 7666 6
Q ss_pred eeccCCCCCceecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCC
Q psy620 207 RCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQ 286 (1290)
Q Consensus 207 ~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~ 286 (1290)
.|++ + +| .|++||+| +.|+ +-+| ...|..+++|+ .+ +|. |.+||+|..|+.
T Consensus 292 ~~~~-----g-~C-iC~~g~~G--~dCs-~~~c--padC~g~G~Ci--~G--~C~-C~~Gy~G~~C~~------------ 342 (525)
T KOG1225|consen 292 VCVD-----G-EC-ICNPGYSG--KDCS-IRRC--PADCSGHGKCI--DG--ECL-CDEGYTGELCIQ------------ 342 (525)
T ss_pred eecC-----C-Ee-ecCCCccc--cccc-cccC--CccCCCCCccc--CC--ceE-eCCCCcCCcccc------------
Confidence 6654 2 89 99999999 8886 3346 47899999999 33 698 999999998541
Q ss_pred CcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccC
Q psy620 287 TCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRN 329 (1290)
Q Consensus 287 ~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~ 329 (1290)
.. |.+++.|++. | .|..||.|.+
T Consensus 343 ----~~---------C~~~g~cv~g-----C--~C~~Gw~G~d 365 (525)
T KOG1225|consen 343 ----RA---------CSGGGQCVNG-----C--KCKKGWRGPD 365 (525)
T ss_pred ----cc---------cCCCceeccC-----c--eeccCccCCC
Confidence 11 5556677653 7 8999998755
No 19
>KOG1225|consensus
Probab=98.38 E-value=1.7e-06 Score=104.85 Aligned_cols=131 Identities=32% Similarity=0.807 Sum_probs=94.3
Q ss_pred eecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCcccCCCCCC
Q psy620 217 YRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECAD 296 (1290)
Q Consensus 217 y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~ 296 (1290)
..| .|+.+|+| ..|+ .-.| ...|..++.|+.. +|. |++||+|..|.. -.|..
T Consensus 234 ~ic-~c~~~~~g--~~c~-~~~C--~~~c~~~g~c~~G----~CI-C~~Gf~G~dC~e-----------------~~Cp~ 285 (525)
T KOG1225|consen 234 GIC-ECPEGYFG--PLCS-TIYC--PGGCTGRGQCVEG----RCI-CPPGFTGDDCDE-----------------LVCPV 285 (525)
T ss_pred cee-ecCCceeC--Cccc-cccC--CCCCcccceEeCC----eEe-CCCCCcCCCCCc-----------------ccCCc
Confidence 378 89999999 7776 3345 3566666777765 698 999999999652 12332
Q ss_pred CCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCCCcccCC
Q psy620 297 GRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDG 376 (1290)
Q Consensus 297 ~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG 376 (1290)
. |..++.|++. .| .|++||+| +.|+. ..|. ..|+.+|.|+.. +|.|.+||+|
T Consensus 286 ~----cs~~g~~~~g----~C--iC~~g~~G----~dCs~-----~~cp--adC~g~G~Ci~G-----~C~C~~Gy~G-- 337 (525)
T KOG1225|consen 286 D----CSGGGVCVDG----EC--ICNPGYSG----KDCSI-----RRCP--ADCSGHGKCIDG-----ECLCDEGYTG-- 337 (525)
T ss_pred c----cCCCceecCC----Ee--ecCCCccc----ccccc-----ccCC--ccCCCCCcccCC-----ceEeCCCCcC--
Confidence 2 6556666654 69 99999984 55664 4465 689999999843 8999999999
Q ss_pred CCcCCcCCCCCCCCCCCCCCCCCCCCC-ccccCCCCCCCCccccCCCCCCCCC
Q psy620 377 QFCGRDTDLDGWPDYDLACPDRKCRKD-NCVHIPNSGINNHADNCPRNANPDQ 428 (1290)
Q Consensus 377 ~~Ce~~~d~d~~~~~~~~C~~~~C~ng-~C~~~~gs~~~~~~C~C~~Gy~G~~ 428 (1290)
..|+.. .|.++ .|++ . |.|..||.|..
T Consensus 338 ~~C~~~----------------~C~~~g~cv~--------g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 338 ELCIQR----------------ACSGGGQCVN--------G-CKCKKGWRGPD 365 (525)
T ss_pred Cccccc----------------ccCCCceecc--------C-ceeccCccCCC
Confidence 788521 25555 5664 2 99999997765
No 20
>KOG1836|consensus
Probab=97.75 E-value=0.00039 Score=94.71 Aligned_cols=108 Identities=22% Similarity=0.492 Sum_probs=70.1
Q ss_pred cccCCCCCccCCCCccccc----cccccCCCCcc------cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccC
Q psy620 260 CDPCPAGFTGSTGVQGVGL----EHAVRFRQTCV------DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRN 329 (1290)
Q Consensus 260 C~~C~~Gy~G~~Ce~~~~~----~~~~~~~~~C~------dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~ 329 (1290)
|. |+.||+|..||..... .........|. ..+.|... ++.|. |+....+-+|+ +|..||++..
T Consensus 697 c~-C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~-tG~C~----C~~~t~G~~C~-~C~~GfYg~~ 769 (1705)
T KOG1836|consen 697 CT-CPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPR-TGQCK----CKHNTFGGQCA-QCVDGFYGLP 769 (1705)
T ss_pred cc-CCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCC-CCcee----cccCCCCCchh-hhcCCCCCcc
Confidence 88 9999999999875532 11111112221 13567666 56664 77777777898 9999998764
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCeEeeec-CCceEEe-cCCCcccCCCCcCCcCC
Q psy620 330 VSVGCQSQNFGADVCPDGTRCDRNAKCTRIL-GNHYACK-CDNGWAGDGQFCGRDTD 384 (1290)
Q Consensus 330 ~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~-~gsy~C~-C~~Gy~GdG~~Ce~~~d 384 (1290)
....=. + |. .-+|.+++.|..+. .....|. |++||+| .+|+...+
T Consensus 770 ~~~~~~------d-C~-~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG--~rCe~c~d 816 (1705)
T KOG1836|consen 770 DLGTSG------D-CQ-PCPCPNGGACGQTPEILEVVCKNCPPGYTG--LRCEECAD 816 (1705)
T ss_pred ccCCCC------C-Cc-cCCCCCChhhcCcCcccceecCCCCCCCcc--cccccCCC
Confidence 322110 1 33 24477777776654 5678999 9999999 99986544
No 21
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.60 E-value=3e-05 Score=63.06 Aligned_cols=39 Identities=46% Similarity=0.997 Sum_probs=33.7
Q ss_pred cCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCC
Q psy620 290 DIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVS 331 (1290)
Q Consensus 290 dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g 331 (1290)
|||||+.. .+.|..+++|+|+.|+|+| .|++||+....+
T Consensus 1 DidEC~~~-~~~C~~~~~C~N~~Gsy~C--~C~~Gy~~~~~~ 39 (42)
T PF07645_consen 1 DIDECAEG-PHNCPENGTCVNTEGSYSC--SCPPGYELNDDG 39 (42)
T ss_dssp ESSTTTTT-SSSSSTTSEEEEETTEEEE--EESTTEEECTTS
T ss_pred CccccCCC-CCcCCCCCEEEcCCCCEEe--eCCCCcEECCCC
Confidence 68999998 5789999999999999999 999999744433
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.45 E-value=9.3e-05 Score=58.08 Aligned_cols=31 Identities=48% Similarity=1.195 Sum_probs=25.7
Q ss_pred CCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620 348 TRCDRNAKCTRILGNHYACKCDNGWAGDGQFC 379 (1290)
Q Consensus 348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C 379 (1290)
+.|+.+|+|+++ .++|+|+|++||.|||..|
T Consensus 6 ~~C~~nA~C~~~-~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNT-GGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GGS-TTCEEEE--TTSEEEEE-CEEECCSTCE
T ss_pred CCCCCCcEeecC-CCCEEeECCCCCccCCcCC
Confidence 579999999999 6799999999999999876
No 23
>KOG1226|consensus
Probab=97.36 E-value=0.001 Score=82.48 Aligned_cols=99 Identities=20% Similarity=0.559 Sum_probs=64.2
Q ss_pred ecccCCCCCccCCCCccccccccccCCCCcccCCCCCCCC-CCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCC
Q psy620 259 RCDPCPAGFTGSTGVQGVGLEHAVRFRQTCVDIDECADGR-NGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQ 337 (1290)
Q Consensus 259 ~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~dideC~~~~-~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~ 337 (1290)
.|. |.+||.|..||-........ ...+.|.... ...|...|.|+=. +| +|.+...+.--|+.|+-.
T Consensus 479 ~C~-C~~G~~G~~CEC~~~~~ss~------~~~~~Cr~~~~~~vCSgrG~C~CG----qC--~C~~~~~~~i~G~fCECD 545 (783)
T KOG1226|consen 479 QCR-CDEGWLGKKCECSTDELSSS------EEEDKCRENSDSPVCSGRGDCVCG----QC--VCHKPDNGKIYGKFCECD 545 (783)
T ss_pred cee-cCCCCCCCcccCCccccCcH------hHHhhccCCCCCCCcCCCCcEeCC----ce--EecCCCCCceeeeeeecc
Confidence 477 99999999998433211110 1124454331 1258877888643 47 888766554456777753
Q ss_pred CCCCCCCCCC--CCCCCCCeEeeecCCceEEecCCCcccCCCCcC
Q psy620 338 NFGADVCPDG--TRCDRNAKCTRILGNHYACKCDNGWAGDGQFCG 380 (1290)
Q Consensus 338 ~~~id~C~~~--~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~Ce 380 (1290)
.-.|... ..|..+|.|.-. +|+|.+||+| ..|+
T Consensus 546 ---nfsC~r~~g~lC~g~G~C~CG-----~CvC~~GwtG--~~C~ 580 (783)
T KOG1226|consen 546 ---NFSCERHKGVLCGGHGRCECG-----RCVCNPGWTG--SACN 580 (783)
T ss_pred ---CcccccccCcccCCCCeEeCC-----cEEcCCCCcc--CCCC
Confidence 2235432 679999999764 8999999999 7775
No 24
>KOG1226|consensus
Probab=97.07 E-value=0.0024 Score=79.46 Aligned_cols=137 Identities=28% Similarity=0.724 Sum_probs=82.6
Q ss_pred ccccCCCCCCCCCCCceec----------ccCCC----CCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcC--CCC
Q psy620 168 TCGPCPSGYTGDGERCQRI----------GGCSR----NPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTG--NGT 231 (1290)
Q Consensus 168 ~C~~C~~Gy~Gdg~~C~~i----------deC~~----~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~G--dg~ 231 (1290)
.| .|.+||.|. .|+-. +.|.. .+|... |.|.-. +| +|.+...+ .|.
T Consensus 479 ~C-~C~~G~~G~--~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgr--------G~C~CG------qC-~C~~~~~~~i~G~ 540 (783)
T KOG1226|consen 479 QC-RCDEGWLGK--KCECSTDELSSSEEEDKCRENSDSPVCSGR--------GDCVCG------QC-VCHKPDNGKIYGK 540 (783)
T ss_pred ce-ecCCCCCCC--cccCCccccCcHhHHhhccCCCCCCCcCCC--------CcEeCC------ce-EecCCCCCceeee
Confidence 47 899999998 66531 23332 145444 555432 56 67766552 137
Q ss_pred ccc-cCCccCC--CCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccccCCCCc-ccCCCCCCCCCCCCCCCCc
Q psy620 232 RCH-DIDECDL--AEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTC-VDIDECADGRNGGCDSNSM 307 (1290)
Q Consensus 232 ~C~-dideC~~--~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C-~dideC~~~~~g~C~~~g~ 307 (1290)
.|+ +--.|.. ...|..+++|.=. +|. |.+||+|..|+ | .+.+.|....-..|...|+
T Consensus 541 fCECDnfsC~r~~g~lC~g~G~C~CG----~Cv-C~~GwtG~~C~--------------C~~std~C~~~~G~iCSGrG~ 601 (783)
T KOG1226|consen 541 FCECDNFSCERHKGVLCGGHGRCECG----RCV-CNPGWTGSACN--------------CPLSTDTCESSDGQICSGRGT 601 (783)
T ss_pred eeeccCcccccccCcccCCCCeEeCC----cEE-cCCCCccCCCC--------------CCCCCccccCCCCceeCCCce
Confidence 886 2223432 2358888887543 698 99999999976 3 4566676652234766677
Q ss_pred cccCCCCcEEcCcCcCC-ccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEe
Q psy620 308 CTNTEGSFTCTSLCRNS-YMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCT 357 (1290)
Q Consensus 308 C~n~~gsy~C~~~C~~G-y~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~ 357 (1290)
|.-. +| +|... |. +..|+. -..|. ++|..+..|+
T Consensus 602 C~Cg----~C--~C~~~~~s----G~~CE~----cptc~--~~C~~~~~Cv 636 (783)
T KOG1226|consen 602 CECG----RC--KCTDPPYS----GEFCEK----CPTCP--DPCAENKSCV 636 (783)
T ss_pred eeCC----ce--EcCCCCcC----cchhhc----CCCCC--Ccccccccch
Confidence 7543 46 77665 75 566765 23344 4566665554
No 25
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.00 E-value=0.00018 Score=54.95 Aligned_cols=28 Identities=46% Similarity=1.058 Sum_probs=26.1
Q ss_pred CCCCCCCceeccCC-CCCccccCCCCccCC
Q psy620 593 DNPCFPGVECRDTR-EGPRCMRCPDGYVGD 621 (1290)
Q Consensus 593 ~nPC~~g~~C~~~~-~g~~Cg~Cp~G~~Gd 621 (1290)
++||.|+++|++.. .+|+| .|++||+|.
T Consensus 3 ~~~C~n~g~C~~~~~~~y~C-~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTCIDLPGGGYTC-ECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEEEEESTSEEEE-EEBTTEEST
T ss_pred CCcCCCCeEEEeCCCCCEEe-ECCCCCccC
Confidence 68999999999999 89999 999999983
No 26
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.97 E-value=0.00096 Score=54.28 Aligned_cols=32 Identities=31% Similarity=0.941 Sum_probs=29.4
Q ss_pred CCCCCCC-CCCCCCCeEeeecCCceEEecCCCcc
Q psy620 341 ADVCPDG-TRCDRNAKCTRILGNHYACKCDNGWA 373 (1290)
Q Consensus 341 id~C~~~-~~C~~~g~C~~~~~gsy~C~C~~Gy~ 373 (1290)
|+||... +.|..++.|+++ .|+|+|.|++||.
T Consensus 2 idEC~~~~~~C~~~~~C~N~-~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNT-EGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEE-TTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEEEcC-CCCEEeeCCCCcE
Confidence 7899876 789989999999 9999999999998
No 27
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.73 E-value=0.0014 Score=51.61 Aligned_cols=35 Identities=46% Similarity=1.099 Sum_probs=30.2
Q ss_pred CCCCCC-CCCCCCCeeccCCCCcccccCCCCCC-CCCCCc
Q psy620 146 GVTCNM-RPCFQGVQCFDTVEGYTCGPCPSGYT-GDGERC 183 (1290)
Q Consensus 146 ideC~~-~pC~~gg~C~n~~g~y~C~~C~~Gy~-Gdg~~C 183 (1290)
+++|.. .+|.++++|+++.++|.| .|++||. |. .|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C-~C~~g~~~g~--~C 38 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRC-ECPPGYTDGR--NC 38 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEe-ECCCCCccCC--cC
Confidence 678887 789999999999999999 9999998 54 65
No 28
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.60 E-value=0.00061 Score=52.13 Aligned_cols=29 Identities=48% Similarity=1.161 Sum_probs=24.2
Q ss_pred CCCCCCCCCCCeeecCC-CCcccccCCCCccc
Q psy620 108 CATDNPCFPGVECRDTR-EGPRCMRCPDGYVG 138 (1290)
Q Consensus 108 C~~~~pC~~gg~C~~~~-g~y~C~~C~~Gy~G 138 (1290)
|. ++||+++|+|++.. .+|+| .|++||+|
T Consensus 1 C~-~~~C~n~g~C~~~~~~~y~C-~C~~G~~G 30 (32)
T PF00008_consen 1 CS-SNPCQNGGTCIDLPGGGYTC-ECPPGYTG 30 (32)
T ss_dssp TT-TTSSTTTEEEEEESTSEEEE-EEBTTEES
T ss_pred CC-CCcCCCCeEEEeCCCCCEEe-ECCCCCcc
Confidence 44 67888888998887 88889 89999888
No 29
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.58 E-value=0.0022 Score=50.45 Aligned_cols=37 Identities=43% Similarity=1.003 Sum_probs=31.4
Q ss_pred CCCCCCCCCCCCCCeeecCCCCcccccCCCCcc-cCCCCCC
Q psy620 105 KPTCATDNPCFPGVECRDTREGPRCMRCPDGYV-GDGIHCK 144 (1290)
Q Consensus 105 ~d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~-Gdg~~Ce 144 (1290)
+++|....+|.++++|++..++|.| .|++||+ | ..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C-~C~~g~~~g--~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRC-ECPPGYTDG--RNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEe-ECCCCCccC--CcCC
Confidence 5788822799999999999999999 9999999 6 6664
No 30
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.53 E-value=0.00082 Score=52.83 Aligned_cols=32 Identities=38% Similarity=0.999 Sum_probs=22.4
Q ss_pred CCCCCCCCeeecCCCCcccccCCCCcccCCCCC
Q psy620 111 DNPCFPGVECRDTREGPRCMRCPDGYVGDGIHC 143 (1290)
Q Consensus 111 ~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~C 143 (1290)
...|+.+++|+++.++|.| .|++||+|+|..|
T Consensus 5 ~~~C~~nA~C~~~~~~~~C-~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTC-TCKPGYEGDGFFC 36 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEE-EE-CEEECCSTCE
T ss_pred CCCCCCCcEeecCCCCEEe-ECCCCCccCCcCC
Confidence 4567888888888888888 8888888877655
No 31
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.47 E-value=0.00065 Score=71.46 Aligned_cols=133 Identities=29% Similarity=0.776 Sum_probs=84.9
Q ss_pred CceecCCCCCCCcC-CCCccccCCccCC----CCCCCCCcccccCC-----CCeecccCCCCCccCCCCccccccccccC
Q psy620 215 PYYRCGSCPEGTTG-NGTRCHDIDECDL----AEPCDPRVQCTNLF-----PGYRCDPCPAGFTGSTGVQGVGLEHAVRF 284 (1290)
Q Consensus 215 g~y~C~~C~~Gy~G-dg~~C~dideC~~----~~pC~~~g~C~n~~-----gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~ 284 (1290)
..|.| .|.+||.. +..+|+...+|.. ..+|...++|++.. ..|.|. |.+||....
T Consensus 18 NHfEC-~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~-C~~gY~~~~------------- 82 (197)
T PF06247_consen 18 NHFEC-KCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCD-CINGYILKQ------------- 82 (197)
T ss_dssp SEEEE-EESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEE-E-TTEEESS-------------
T ss_pred CceEE-EcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEe-cccCceeeC-------------
Confidence 36899 99999983 4578988888863 35799999998765 569999 999999876
Q ss_pred CCCcccCCCCCCCCCCCCCCCCccccC---CCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecC
Q psy620 285 RQTCVDIDECADGRNGGCDSNSMCTNT---EGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILG 361 (1290)
Q Consensus 285 ~~~C~dideC~~~~~g~C~~~g~C~n~---~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~ 361 (1290)
..|.. .+|... .|. .|.|+-. +....| +|.-|+. ..+...|... +.-.|+ -.|..+-+|... .
T Consensus 83 -~vCvp-~~C~~~---~Cg-~GKCI~d~~~~~~~~C--SC~IGkV-~~dn~kCtk~--G~T~C~--LKCk~nE~CK~~-~ 148 (197)
T PF06247_consen 83 -GVCVP-NKCNNK---DCG-SGKCILDPDNPNNPTC--SCNIGKV-PDDNKKCTKT--GETKCS--LKCKENEECKLV-D 148 (197)
T ss_dssp -SSEEE-GGGSS------T-TEEEEEEEGGGSEEEE--EE-TEEE-TTTTTESEEE--E----------TTTEEEEEE-T
T ss_pred -CeEch-hhcCce---ecC-CCeEEecCCCCCCcee--EeeeceE-eccCCcccCC--Ccccee--eecCCCcceeee-C
Confidence 34533 456644 387 6789743 334599 9999997 4455667652 234576 468888899998 8
Q ss_pred CceEEecCCCcccCC
Q psy620 362 NHYACKCDNGWAGDG 376 (1290)
Q Consensus 362 gsy~C~C~~Gy~GdG 376 (1290)
+-|+|.|..||.+++
T Consensus 149 ~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 149 GYYKCVCKEGFPGDG 163 (197)
T ss_dssp TEEEEEE-TT-EEET
T ss_pred cEEEeecCCCCCCCC
Confidence 899999999998743
No 32
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.19 E-value=0.0025 Score=45.38 Aligned_cols=22 Identities=36% Similarity=0.777 Sum_probs=19.8
Q ss_pred CCccccCCCCcc--CCCccccCCCC
Q psy620 608 GPRCMRCPDGYV--GDGIHCKPGVT 630 (1290)
Q Consensus 608 g~~Cg~Cp~G~~--Gdg~~C~dide 630 (1290)
+|+| .|++||. .+|.+|+||||
T Consensus 1 sy~C-~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTC-SCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEe-eCCCCCcCCCCCCccccCCC
Confidence 5889 8999998 68999999997
No 33
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.15 E-value=0.00086 Score=70.56 Aligned_cols=141 Identities=19% Similarity=0.567 Sum_probs=87.4
Q ss_pred eeccCCCCcccccCCCCCCC-CCCCceecccCCC-----CCCCCCCCCCCcceeeeccCCC---CCceecCCCCCCCcCC
Q psy620 159 QCFDTVEGYTCGPCPSGYTG-DGERCQRIGGCSR-----NPCAQGKLNEKTRCVRCDDIPE---HPYYRCGSCPEGTTGN 229 (1290)
Q Consensus 159 ~C~n~~g~y~C~~C~~Gy~G-dg~~C~~ideC~~-----~pC~~g~~~~~~~Cg~C~~~~~---~g~y~C~~C~~Gy~Gd 229 (1290)
..+...+.|.| .|.+||.- +..+|+...+|.. .+|..- ++|+.... ...|+| .|.+||...
T Consensus 12 ~LiQMSNHfEC-~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdy--------a~C~~~~~~~~~~~~~C-~C~~gY~~~ 81 (197)
T PF06247_consen 12 YLIQMSNHFEC-KCNEGFVLKNENTCEEKVECDKLENVNKPCGDY--------AKCINQANKGEERAYKC-DCINGYILK 81 (197)
T ss_dssp EEEEESSEEEE-EESTTEEEEETTEEEE----SG-GGTTSEEETT--------EEEEE-SSTTSSTSEEE-EE-TTEEES
T ss_pred EEEEccCceEE-EcCCCcEEccccccccceecCcccccCccccch--------hhhhcCCCcccceeEEE-ecccCceee
Confidence 55555678999 99999973 2337998777865 478887 88987652 357999 999999965
Q ss_pred CCccccCCccCCCCCCCCCcccccC---CCCeecccCCCCCccCCCCccccccccccCCCCcc--cCCCCCCCCCCCCCC
Q psy620 230 GTRCHDIDECDLAEPCDPRVQCTNL---FPGYRCDPCPAGFTGSTGVQGVGLEHAVRFRQTCV--DIDECADGRNGGCDS 304 (1290)
Q Consensus 230 g~~C~dideC~~~~pC~~~g~C~n~---~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~~~~~~C~--dideC~~~~~g~C~~ 304 (1290)
...|. ..+|. .-.|. .+.|+-. .....|+ |.-|+... +...|+ -..+|+.. |..
T Consensus 82 ~~vCv-p~~C~-~~~Cg-~GKCI~d~~~~~~~~CS-C~IGkV~~-------------dn~kCtk~G~T~C~LK----Ck~ 140 (197)
T PF06247_consen 82 QGVCV-PNKCN-NKDCG-SGKCILDPDNPNNPTCS-CNIGKVPD-------------DNKKCTKTGETKCSLK----CKE 140 (197)
T ss_dssp SSSEE-EGGGS-S---T-TEEEEEEEGGGSEEEEE-E-TEEETT-------------TTTESEEEE------------TT
T ss_pred CCeEc-hhhcC-ceecC-CCeEEecCCCCCCceeE-eeeceEec-------------cCCcccCCCccceeee----cCC
Confidence 56776 35676 66787 5889732 2345899 99999822 234563 33567766 888
Q ss_pred CCccccCCCCcEEcCcCcCCccccCCCC
Q psy620 305 NSMCTNTEGSFTCTSLCRNSYMVRNVSV 332 (1290)
Q Consensus 305 ~g~C~n~~gsy~C~~~C~~Gy~g~~~g~ 332 (1290)
+..|....+-|+| .|..||.+...+.
T Consensus 141 nE~CK~~~~~Y~C--~~~~~~~~~~~~~ 166 (197)
T PF06247_consen 141 NEECKLVDGYYKC--VCKEGFPGDGEGE 166 (197)
T ss_dssp TEEEEEETTEEEE--EE-TT-EEETTT-
T ss_pred CcceeeeCcEEEe--ecCCCCCCCCCcc
Confidence 8999999999999 9999997665443
No 34
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.96 E-value=0.0069 Score=47.01 Aligned_cols=35 Identities=46% Similarity=1.107 Sum_probs=29.5
Q ss_pred CCCCCC-CCCCCCCeeccCCCCcccccCCCCCCCCCCCc
Q psy620 146 GVTCNM-RPCFQGVQCFDTVEGYTCGPCPSGYTGDGERC 183 (1290)
Q Consensus 146 ideC~~-~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C 183 (1290)
+++|.. .+|.+++.|++..++|+| .|++||.|. +|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C-~C~~g~~g~--~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRC-SCPPGYTGR--NC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEe-ECCCCCcCC--cC
Confidence 567877 789888899999999999 899999886 55
No 35
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.93 E-value=0.008 Score=46.64 Aligned_cols=36 Identities=44% Similarity=1.015 Sum_probs=31.1
Q ss_pred CCCCCCC-CCCCCCCeeecCCCCcccccCCCCcccCCCCCC
Q psy620 105 KPTCATD-NPCFPGVECRDTREGPRCMRCPDGYVGDGIHCK 144 (1290)
Q Consensus 105 ~d~C~~~-~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~Ce 144 (1290)
+++|. . .+|.+++.|++..++|+| .|++||+| ..|+
T Consensus 2 ~~~C~-~~~~C~~~~~C~~~~~~~~C-~C~~g~~g--~~C~ 38 (38)
T cd00054 2 IDECA-SGNPCQNGGTCVNTVGSYRC-SCPPGYTG--RNCE 38 (38)
T ss_pred cccCC-CCCCcCCCCEeECCCCCeEe-ECCCCCcC--CcCC
Confidence 47787 5 799999999999999999 99999998 5663
No 36
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.92 E-value=0.0041 Score=44.31 Aligned_cols=22 Identities=50% Similarity=1.159 Sum_probs=19.3
Q ss_pred CCccCCCCCCCC--CCCCccCCCCC
Q psy620 698 YYRCGSCPEGTT--GNGTRCHDIDE 720 (1290)
Q Consensus 698 ~y~C~~C~~Gy~--Gng~~C~~~~~ 720 (1290)
||+|. |++||. .+|.+|.||+|
T Consensus 1 sy~C~-C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCS-CPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEee-CCCCCcCCCCCCccccCCC
Confidence 69997 999997 68888999886
No 37
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.77 E-value=0.004 Score=48.94 Aligned_cols=36 Identities=39% Similarity=0.983 Sum_probs=27.9
Q ss_pred CCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCC
Q psy620 294 CADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGC 334 (1290)
Q Consensus 294 C~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C 334 (1290)
|+.. ++.|. ..|++++++|+| .|++||++..++++|
T Consensus 1 C~~~-NGgC~--h~C~~~~g~~~C--~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 1 CSVN-NGGCS--HICVNTPGSYRC--SCPPGYKLAEDGRTC 36 (36)
T ss_dssp CTTG-GGGSS--SEEEEETTSEEE--E-STTEEE-TTSSSE
T ss_pred CCCC-CCCcC--CCCccCCCceEe--ECCCCCEECcCCCCC
Confidence 3444 66787 689999999999 999999998887765
No 38
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.89 E-value=0.026 Score=42.96 Aligned_cols=28 Identities=50% Similarity=1.169 Sum_probs=19.8
Q ss_pred CCCCCCCCeeccCCCCcccccCCCCCCCC
Q psy620 151 MRPCFQGVQCFDTVEGYTCGPCPSGYTGD 179 (1290)
Q Consensus 151 ~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gd 179 (1290)
..+|.++++|++..++|+| .|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C-~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRC-VCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEe-ECCCCCccc
Confidence 4567666777777777777 777777665
No 39
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.61 E-value=0.037 Score=42.13 Aligned_cols=31 Identities=48% Similarity=1.073 Sum_probs=26.7
Q ss_pred CCCCCCCCeeecCCCCcccccCCCCcccCCCCC
Q psy620 111 DNPCFPGVECRDTREGPRCMRCPDGYVGDGIHC 143 (1290)
Q Consensus 111 ~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gdg~~C 143 (1290)
..+|.++++|++..++|.| .|+.||.|. ..|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C-~C~~g~~g~-~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRC-VCPPGYTGD-RSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEe-ECCCCCccc-CCc
Confidence 5689999999999999999 999999994 244
No 40
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.38 E-value=0.044 Score=42.24 Aligned_cols=26 Identities=46% Similarity=1.113 Sum_probs=19.6
Q ss_pred CCCCCCCeeecCCCCcccccCCCCcccC
Q psy620 112 NPCFPGVECRDTREGPRCMRCPDGYVGD 139 (1290)
Q Consensus 112 ~pC~~gg~C~~~~g~y~C~~C~~Gy~Gd 139 (1290)
.+|.++ +|++..++|+| .|++||+|.
T Consensus 6 ~~C~~~-~C~~~~~~~~C-~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTPGSYTC-SCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCCCCeEe-ECCCCCccC
Confidence 577777 78777777888 788888773
No 41
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.28 E-value=0.041 Score=42.38 Aligned_cols=33 Identities=55% Similarity=1.358 Sum_probs=27.6
Q ss_pred CCCC-CCCCCCCeeccCCCCcccccCCCCCCCCCCCc
Q psy620 148 TCNM-RPCFQGVQCFDTVEGYTCGPCPSGYTGDGERC 183 (1290)
Q Consensus 148 eC~~-~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C 183 (1290)
+|.. .+|.++ +|++..++|+| .|++||.|. ..|
T Consensus 1 ~C~~~~~C~~~-~C~~~~~~~~C-~C~~g~~g~-~~C 34 (35)
T smart00181 1 ECASGGPCSNG-TCINTPGSYTC-SCPPGYTGD-KRC 34 (35)
T ss_pred CCCCcCCCCCC-EEECCCCCeEe-ECCCCCccC-Ccc
Confidence 3566 689998 99999999999 999999994 255
No 42
>KOG1218|consensus
Probab=91.54 E-value=3.1 Score=48.29 Aligned_cols=193 Identities=24% Similarity=0.538 Sum_probs=95.4
Q ss_pred CcccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCceecccCC--CCCCCCCCCCCCc
Q psy620 126 GPRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQRIGGCS--RNPCAQGKLNEKT 203 (1290)
Q Consensus 126 ~y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~ideC~--~~pC~~g~~~~~~ 203 (1290)
...| .|.++|+|. ..+.....+. +|... |........| .+..+|.+. .|.....+. ...|...
T Consensus 14 ~~~c-~c~~~~~g~-~~~~~~~~~~--~~~~~--~~~~~~~~~~-~~~~~~~~~--~c~~~~~~~~~~~~c~~~------ 78 (316)
T KOG1218|consen 14 SGQC-FCDPGYTGR-LQCEHQAVTS--ACSGI--CPCEVNSGEC-GLGYGFVGS--VCRIECVCGNAGGGCSQP------ 78 (316)
T ss_pred CCce-ecCCCcccc-ccccCCCCCc--ccccc--CCccCCceeE-ecccccCCC--ccccccccCCCCCcccCc------
Confidence 4578 899999993 2222111111 11111 1112234567 778888877 555422221 1223333
Q ss_pred ceeeeccCCCCCceecCCC-CCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCCCccccccccc
Q psy620 204 RCVRCDDIPEHPYYRCGSC-PEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTGVQGVGLEHAV 282 (1290)
Q Consensus 204 ~Cg~C~~~~~~g~y~C~~C-~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce~~~~~~~~~ 282 (1290)
..|........+.. .| ..+|.+ ..|+...+|... |.. .+|.+... .|. |..+|.+..|.. ..
T Consensus 79 --~~c~~~~~~~~~~~-~~~~~~~~g--~~C~~~~~~~~~--c~~-~~C~~~~~--~c~-~~~~~~~~~C~~------~~ 141 (316)
T KOG1218|consen 79 --CRCKNGGTCVSSTG-YCHLNGYEG--PQCESPCPCGDG--CAE-KTCANPRR--ECR-CGGGYIGEQCGE------EN 141 (316)
T ss_pred --cccCCCCcccCCCC-cccCCCCCc--ccccCCCCcCCc--ccc-cccCCCcc--cee-cCCcCccccccc------cC
Confidence 23333221122333 45 577777 778766666422 443 45655443 576 888888877653 01
Q ss_pred cCCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCC
Q psy620 283 RFRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGN 362 (1290)
Q Consensus 283 ~~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~g 362 (1290)
..+..|....++ . ..+... .-.| .|++||.+.. |... ...|.....|.+++.|... .
T Consensus 142 ~~g~~C~~~c~~--------~--~~~~~~--~~~c--~c~~g~~g~~----~~~~---~~~c~~~~~~~~g~~C~~~-~- 198 (316)
T KOG1218|consen 142 LVGLKCQRDCQC--------T--GGCDCK--NGIC--TCQPGFVGVF----CVES---CSGCSPLTACENGAKCNRS-T- 198 (316)
T ss_pred CCCCCccCCCCC--------c--cccCCC--CCce--eccCCccccc----cccc---CCCcCCCcccCCCCeeecc-c-
Confidence 112223222111 1 111111 2257 8999998544 4331 1116655678888888765 2
Q ss_pred ceEEecCCCccc
Q psy620 363 HYACKCDNGWAG 374 (1290)
Q Consensus 363 sy~C~C~~Gy~G 374 (1290)
..+.+.+++.+
T Consensus 199 -~~~~~~~~~~~ 209 (316)
T KOG1218|consen 199 -GSCLCYPGPSG 209 (316)
T ss_pred -cccccCCCCcc
Confidence 25566665543
No 43
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=88.43 E-value=0.49 Score=37.37 Aligned_cols=29 Identities=31% Similarity=0.860 Sum_probs=21.2
Q ss_pred CCCCCCCeEeeecCCceEEecCCCccc--CCCCc
Q psy620 348 TRCDRNAKCTRILGNHYACKCDNGWAG--DGQFC 379 (1290)
Q Consensus 348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~G--dG~~C 379 (1290)
..|++ .|+++ .++|+|.|++||.- |+++|
T Consensus 6 GgC~h--~C~~~-~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 6 GGCSH--ICVNT-PGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp GGSSS--EEEEE-TTSEEEE-STTEEE-TTSSSE
T ss_pred CCcCC--CCccC-CCceEeECCCCCEECcCCCCC
Confidence 34655 79998 78999999999975 44544
No 44
>KOG1218|consensus
Probab=87.97 E-value=11 Score=43.72 Aligned_cols=102 Identities=26% Similarity=0.673 Sum_probs=57.3
Q ss_pred cC-CCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCcee----cccCCCCCCCCCCCCCCcce
Q psy620 131 RC-PDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQR----IGGCSRNPCAQGKLNEKTRC 205 (1290)
Q Consensus 131 ~C-~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~----ideC~~~pC~~g~~~~~~~C 205 (1290)
.| ..+|.| ..|+...+|... |.. -+|.+... .| .|..+|.+. .|.. ...|... |...
T Consensus 93 ~~~~~~~~g--~~C~~~~~~~~~-c~~-~~C~~~~~--~c-~~~~~~~~~--~C~~~~~~g~~C~~~-c~~~-------- 154 (316)
T KOG1218|consen 93 YCHLNGYEG--PQCESPCPCGDG-CAE-KTCANPRR--EC-RCGGGYIGE--QCGEENLVGLKCQRD-CQCT-------- 154 (316)
T ss_pred cccCCCCCc--ccccCCCCcCCc-ccc-cccCCCcc--ce-ecCCcCccc--cccccCCCCCCccCC-CCCc--------
Confidence 55 678888 788866666544 333 45554432 46 666666655 5543 1112111 1111
Q ss_pred eeeccCCCCCceecCCCCCCCcCCCCccccCCc-cCCCCCCCCCcccccCCCC
Q psy620 206 VRCDDIPEHPYYRCGSCPEGTTGNGTRCHDIDE-CDLAEPCDPRVQCTNLFPG 257 (1290)
Q Consensus 206 g~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~dide-C~~~~pC~~~g~C~n~~gs 257 (1290)
..+... .-.| .|.+||.+ ..|+.... |.....|.+++.|....+.
T Consensus 155 ~~~~~~----~~~c-~c~~g~~g--~~~~~~~~~c~~~~~~~~g~~C~~~~~~ 200 (316)
T KOG1218|consen 155 GGCDCK----NGIC-TCQPGFVG--VFCVESCSGCSPLTACENGAKCNRSTGS 200 (316)
T ss_pred cccCCC----CCce-eccCCccc--ccccccCCCcCCCcccCCCCeeeccccc
Confidence 112211 2368 89999999 67754332 6655678887788776664
No 45
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=87.30 E-value=0.36 Score=29.56 Aligned_cols=13 Identities=46% Similarity=1.528 Sum_probs=10.2
Q ss_pred EEecCCCcccCCCCc
Q psy620 365 ACKCDNGWAGDGQFC 379 (1290)
Q Consensus 365 ~C~C~~Gy~GdG~~C 379 (1290)
+|+|++||+| .+|
T Consensus 1 ~C~C~~G~~G--~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTG--PNC 13 (13)
T ss_dssp EEEE-TTEET--TTT
T ss_pred CccCcCCCcC--CCC
Confidence 5999999999 665
No 46
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=85.44 E-value=0.81 Score=35.19 Aligned_cols=27 Identities=30% Similarity=0.806 Sum_probs=21.8
Q ss_pred CCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620 348 TRCDRNAKCTRILGNHYACKCDNGWAGDGQFC 379 (1290)
Q Consensus 348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C 379 (1290)
..|.++|+|+.. ..+|+|.+||+| ..|
T Consensus 6 ~~C~~~G~C~~~---~g~C~C~~g~~G--~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP---CGRCVCDSGYTG--PDC 32 (32)
T ss_pred CccCCCCEEeCC---CCEEECCCCCcC--CCC
Confidence 468899999864 359999999999 554
No 47
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.37 E-value=0.71 Score=51.36 Aligned_cols=43 Identities=30% Similarity=0.647 Sum_probs=34.7
Q ss_pred CCCCcccCCCCCCCCCCCCCCCCccccCCCCcEEcCcCcCCccccCCC
Q psy620 284 FRQTCVDIDECADGRNGGCDSNSMCTNTEGSFTCTSLCRNSYMVRNVS 331 (1290)
Q Consensus 284 ~~~~C~dideC~~~~~g~C~~~g~C~n~~gsy~C~~~C~~Gy~g~~~g 331 (1290)
....|.++++|... ++.|. ..|.++.|+|.| .|++||++...+
T Consensus 180 ~~~~C~~~~~C~~~-~~~c~--~~C~~~~g~~~c--~c~~g~~~~~~~ 222 (224)
T cd01475 180 QGKICVVPDLCATL-SHVCQ--QVCISTPGSYLC--ACTEGYALLEDN 222 (224)
T ss_pred ccccCcCchhhcCC-CCCcc--ceEEcCCCCEEe--ECCCCccCCCCC
Confidence 35668888999876 56787 589999999999 999999865433
No 48
>KOG3514|consensus
Probab=83.22 E-value=1.8 Score=56.08 Aligned_cols=34 Identities=32% Similarity=0.880 Sum_probs=30.7
Q ss_pred CCCCCCCCCCCCeeecCCCCcccccCC-CCcccCCCCCC
Q psy620 107 TCATDNPCFPGVECRDTREGPRCMRCP-DGYVGDGIHCK 144 (1290)
Q Consensus 107 ~C~~~~pC~~gg~C~~~~g~y~C~~C~-~Gy~Gdg~~Ce 144 (1290)
.|. ++||+|+|+|......|.| .|. .||.| +.|+
T Consensus 625 ~C~-~nPC~N~g~C~egwNrfiC-DCs~T~~~G--~~Ce 659 (1591)
T KOG3514|consen 625 ICE-SNPCQNGGKCSEGWNRFIC-DCSGTGFEG--RTCE 659 (1591)
T ss_pred ccC-CCcccCCCCcccccccccc-ccccCcccC--cccc
Confidence 788 9999999999999999999 896 67888 8888
No 49
>KOG3512|consensus
Probab=82.69 E-value=2.2 Score=51.11 Aligned_cols=116 Identities=20% Similarity=0.391 Sum_probs=60.5
Q ss_pred ccccCCCC-eecccCCCCCccCCCCcccccccccc-------------------CCCCcccCCCCCCCCCCCCCCCCccc
Q psy620 250 QCTNLFPG-YRCDPCPAGFTGSTGVQGVGLEHAVR-------------------FRQTCVDIDECADGRNGGCDSNSMCT 309 (1290)
Q Consensus 250 ~C~n~~gs-y~C~~C~~Gy~G~~Ce~~~~~~~~~~-------------------~~~~C~dideC~~~~~g~C~~~g~C~ 309 (1290)
.|+-...+ ++|. |...-+|+.|+.+...-..++ ..+.|.---|+-.. .+.+. +++|+
T Consensus 286 ~Cv~d~~~~ltCd-C~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~l-Sgr~S-ggvCl 362 (592)
T KOG3512|consen 286 RCVMDESSHLTCD-CEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRL-SGRRS-GGVCL 362 (592)
T ss_pred eeeeccCCceEEe-cccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcc-cCccc-cceEe
Confidence 56544444 7888 888888888776554322111 11111111122221 22233 45565
Q ss_pred c---CCCCcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCC----CCeEeeecCCceEEecCCCcccCCCCcC
Q psy620 310 N---TEGSFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDR----NAKCTRILGNHYACKCDNGWAGDGQFCG 380 (1290)
Q Consensus 310 n---~~gsy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~----~g~C~~~~~gsy~C~C~~Gy~GdG~~Ce 380 (1290)
| ...+..|. .|++||+-. .++.=.. ...|. .-.|++ +-+|..+ +.+|.|++|.+| .+|.
T Consensus 363 nCrHnTaGrhCh-yCreGyyRd-~s~pl~h----rkaCk-~CdChpVGs~gktCNq~---tGqCpCkeGvtG--~tCn 428 (592)
T KOG3512|consen 363 NCRHNTAGRHCH-YCREGYYRD-GSKPLTH----RKACK-ACDCHPVGSAGKTCNQT---TGQCPCKEGVTG--LTCN 428 (592)
T ss_pred ecccCCCCcccc-cccCccccC-CCCCCch----hhhhh-hcCCccccccccccccc---CCcccCCCCCcc--cccc
Confidence 4 34456786 899999633 2221111 11222 112443 4478755 349999999999 8884
No 50
>PF00683 TB: TB domain; InterPro: IPR002212 Transforming growth factor beta (TGF-beta)-binding protein-like (TB) domain comes from human fibrillin-1[]. This domain is found in fibrillins and latent TGF-beta-binding proteins (LTBPs) which are localized to fibrillar structures in the extracellular matrix [].; GO: 0005488 binding; PDB: 2W86_A 1UZJ_B 1UZQ_A 1UZK_A 1UZP_A 1APJ_A 1KSQ_A.
Probab=82.61 E-value=0.089 Score=42.97 Aligned_cols=22 Identities=27% Similarity=0.477 Sum_probs=15.4
Q ss_pred cCCCCCCCCCCCCCCCCCCCCc
Q psy620 465 DSDHDGIGDACDNCPRVSNPEQ 486 (1290)
Q Consensus 465 c~~~~~~G~~C~~Cp~~~~~~~ 486 (1290)
|+.+.+||.+|+.||...+.+|
T Consensus 18 Cs~G~aWG~~Ce~CP~~~t~ef 39 (42)
T PF00683_consen 18 CSVGRAWGSPCEPCPPPGTDEF 39 (42)
T ss_dssp TTT-SEETTTTEE---TTSHHH
T ss_pred CCCCCcCCCccccCCCCCChHH
Confidence 7889999999999999877655
No 51
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=82.59 E-value=1.2 Score=34.34 Aligned_cols=24 Identities=38% Similarity=0.785 Sum_probs=19.3
Q ss_pred CCCCCCCeeecCCCCcccccCCCCccc
Q psy620 112 NPCFPGVECRDTREGPRCMRCPDGYVG 138 (1290)
Q Consensus 112 ~pC~~gg~C~~~~g~y~C~~C~~Gy~G 138 (1290)
..|.++|+|+.. ..+| .|.+||+|
T Consensus 6 ~~C~~~G~C~~~--~g~C-~C~~g~~G 29 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRC-VCDSGYTG 29 (32)
T ss_pred CccCCCCEEeCC--CCEE-ECCCCCcC
Confidence 468888999866 3489 99999998
No 52
>KOG3516|consensus
Probab=81.55 E-value=3.2 Score=54.93 Aligned_cols=35 Identities=31% Similarity=0.894 Sum_probs=27.9
Q ss_pred CCCCCCCCCCCCCeeecCCCCcccccCC-CCcccCCCCCC
Q psy620 106 PTCATDNPCFPGVECRDTREGPRCMRCP-DGYVGDGIHCK 144 (1290)
Q Consensus 106 d~C~~~~pC~~gg~C~~~~g~y~C~~C~-~Gy~Gdg~~Ce 144 (1290)
--|+ +.+|.|||+|+....+|.| -|. ..|.| ..|.
T Consensus 956 GhCs-s~~C~NGG~Cvery~gytC-DCs~Tay~G--p~Cs 991 (1306)
T KOG3516|consen 956 GHCS-SYPCLNGGHCVERYDGYTC-DCSRTAYDG--PFCS 991 (1306)
T ss_pred cccc-cccccCCCEEEEecCceee-ccccCcCCC--Cccc
Confidence 4577 7799999999999999999 886 44666 5665
No 53
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=81.40 E-value=1.1 Score=35.54 Aligned_cols=32 Identities=34% Similarity=0.690 Sum_probs=23.8
Q ss_pred CCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620 348 TRCDRNAKCTRILGNHYACKCDNGWAGDGQFC 379 (1290)
Q Consensus 348 ~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C 379 (1290)
..|-.+|.|.+...|+++|+|..||..+|..|
T Consensus 5 ~~cP~NA~C~~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp S---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred ccCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence 56778999999867999999999998766555
No 54
>smart00051 DSL delta serrate ligand.
Probab=79.47 E-value=2.3 Score=38.03 Aligned_cols=44 Identities=20% Similarity=0.517 Sum_probs=30.4
Q ss_pred cCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620 320 LCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDGQFC 379 (1290)
Q Consensus 320 ~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C 379 (1290)
.|.++|.|..+.+.|.. .+.+..+.+|... ..|+|.+||+| ..|
T Consensus 20 ~C~~~~yG~~C~~~C~~----------~~d~~~~~~Cd~~----G~~~C~~Gw~G--~~C 63 (63)
T smart00051 20 TCDENYYGEGCNKFCRP----------RDDFFGHYTCDEN----GNKGCLEGWMG--PYC 63 (63)
T ss_pred eCCCCCcCCccCCEeCc----------CccccCCccCCcC----CCEecCCCCcC--CCC
Confidence 79999998776666654 1234456667432 37899999999 554
No 55
>smart00682 G2F G2 nidogen domain and fibulin.
Probab=77.85 E-value=2.1 Score=47.58 Aligned_cols=67 Identities=15% Similarity=0.171 Sum_probs=50.8
Q ss_pred eecccccccccccCccceEEEEeeeeccccceeeeecccCCCcccc---hhhhhhhcccccccceeeeeeccccccccc
Q psy620 21 VEGWSVKDDLLDDGVINGLLLGVKQDIMGARYTLYMDCVDHGTVAM---TQSLKKMFDSMKNPQMRLRKTDEESVDEIE 96 (1290)
Q Consensus 21 ~~~~s~~~~~~~~~~~~~l~~~~~~~i~G~~~~ly~~C~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 96 (1290)
...++.+.+.+ ....++|+++|+|+ |..|++...... ..++.++|+.|...+..||+++.+.+.+..
T Consensus 153 l~s~str~~~v---~~~~~~y~~~Q~I~------y~~C~~~~~~~~~~~~l~Vs~I~~~Y~~~e~~LR~a~~n~i~~~~ 222 (227)
T smart00682 153 LTTSSTREYTV---DNQTHSYTVDQTIT------FEECQHRDAFPPTTQQLHVSSVFVDYNDEERVLRFAAHNSVGPGD 222 (227)
T ss_pred EEEEEeeEEEE---ccEEEeEEEeEEEE------ecccCCCCCCCCcceEEEEEEEEEEecCchhheeeeeeeeecCCC
Confidence 34556666655 45688999999988 999998875532 233459999999999999999988776654
No 56
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=76.95 E-value=1.7 Score=48.24 Aligned_cols=38 Identities=24% Similarity=0.432 Sum_probs=30.4
Q ss_pred CCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCC
Q psy620 141 IHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGD 179 (1290)
Q Consensus 141 ~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gd 179 (1290)
..|+++++|...+......|.++.|+|.| .|++||+..
T Consensus 182 ~~C~~~~~C~~~~~~c~~~C~~~~g~~~c-~c~~g~~~~ 219 (224)
T cd01475 182 KICVVPDLCATLSHVCQQVCISTPGSYLC-ACTEGYALL 219 (224)
T ss_pred ccCcCchhhcCCCCCccceEEcCCCCEEe-ECCCCccCC
Confidence 67888889976443333589999999999 999999854
No 57
>KOG3516|consensus
Probab=71.96 E-value=9.6 Score=50.73 Aligned_cols=39 Identities=31% Similarity=0.763 Sum_probs=34.1
Q ss_pred cccCCCCCCCCCCCCCCeeecCCCCcccccCC-CCcccCCCCCC
Q psy620 102 IVKKPTCATDNPCFPGVECRDTREGPRCMRCP-DGYVGDGIHCK 144 (1290)
Q Consensus 102 ~~~~d~C~~~~pC~~gg~C~~~~g~y~C~~C~-~Gy~Gdg~~Ce 144 (1290)
|.-++.|. ++||+++|.|..+...|.| .|. .||+| .+|+
T Consensus 542 C~i~drCl-PN~CehgG~C~Qs~~~f~C-~C~~TGY~G--atCH 581 (1306)
T KOG3516|consen 542 CGISDRCL-PNPCEHGGKCSQSWDDFEC-NCELTGYKG--ATCH 581 (1306)
T ss_pred cccccccC-CccccCCCcccccccceeE-ecccccccc--cccc
Confidence 44567888 9999999999998899999 999 99999 6777
No 58
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=69.80 E-value=7.5 Score=32.97 Aligned_cols=33 Identities=24% Similarity=0.725 Sum_probs=23.2
Q ss_pred CCCCCCCCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620 342 DVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDGQFC 379 (1290)
Q Consensus 342 d~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C 379 (1290)
..|.....|..++.|++. +|.|++||+-.+.+|
T Consensus 20 ~~C~~~~qC~~~s~C~~g-----~C~C~~g~~~~~~~C 52 (52)
T PF01683_consen 20 ESCESDEQCIGGSVCVNG-----RCQCPPGYVEVGGRC 52 (52)
T ss_pred CCCCCcCCCCCcCEEcCC-----EeECCCCCEecCCCC
Confidence 346555677788899654 999999997643443
No 59
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=68.98 E-value=0.92 Score=35.94 Aligned_cols=34 Identities=29% Similarity=0.689 Sum_probs=19.6
Q ss_pred CCCCCCCCCCeeccCC-CCcccccCCCCCCCCCCCc
Q psy620 149 CNMRPCFQGVQCFDTV-EGYTCGPCPSGYTGDGERC 183 (1290)
Q Consensus 149 C~~~pC~~gg~C~n~~-g~y~C~~C~~Gy~Gdg~~C 183 (1290)
|...+|..++.|.+.. |++.| +|..||...+..|
T Consensus 2 C~~~~cP~NA~C~~~~dG~eec-rCllgyk~~~~~C 36 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEEC-RCLLGYKKVGGKC 36 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEE-EE-TTEEEETTEE
T ss_pred ccCccCCCCcccEEcCCCCEEE-EeeCCccccCCCc
Confidence 4556677777887765 77888 8888886443344
No 60
>KOG3512|consensus
Probab=68.78 E-value=8.6 Score=46.37 Aligned_cols=16 Identities=38% Similarity=0.802 Sum_probs=11.9
Q ss_pred CCCcccccCCCCCCCC
Q psy620 164 VEGYTCGPCPSGYTGD 179 (1290)
Q Consensus 164 ~g~y~C~~C~~Gy~Gd 179 (1290)
..+-.|..|.+||+-+
T Consensus 368 TaGrhChyCreGyyRd 383 (592)
T KOG3512|consen 368 TAGRHCHYCREGYYRD 383 (592)
T ss_pred CCCcccccccCccccC
Confidence 4456787899999855
No 61
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=64.80 E-value=1.2e+02 Score=40.56 Aligned_cols=85 Identities=26% Similarity=0.693 Sum_probs=43.1
Q ss_pred eeecCCCCcccccCCCCcccC-CCCCCCCCCCCCCCCCCCCeeccC-------CCCcccccCCCCCCCCCCCceecccCC
Q psy620 119 ECRDTREGPRCMRCPDGYVGD-GIHCKPGVTCNMRPCFQGVQCFDT-------VEGYTCGPCPSGYTGDGERCQRIGGCS 190 (1290)
Q Consensus 119 ~C~~~~g~y~C~~C~~Gy~Gd-g~~CedideC~~~pC~~gg~C~n~-------~g~y~C~~C~~Gy~Gdg~~C~~ideC~ 190 (1290)
+|....+...|..|..||... +..|. ..|.... .+.|..- .++=.| .|++||+.....|.
T Consensus 366 tC~~~~~~~tCt~C~~gyl~~~g~sC~--~~C~~~~---~~~Ct~c~~g~~~~~~~C~c-~C~~G~y~~~g~C~------ 433 (800)
T PTZ00214 366 TCGYNSGAVTCTRCSAGYLGVDGKSCS--ESCSGDT---RGVCTKVAEGSESTEVSCRC-VCKPTFYNSSGTCT------ 433 (800)
T ss_pred cccCCCCCcccccccCCcCcCCCCccc--ccCCCCC---CCcccccccccccccCcccc-cCCCCcccCCCCcc------
Confidence 444333335688888888642 23453 2332211 1223211 112245 68999885433453
Q ss_pred CCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCc
Q psy620 191 RNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTT 227 (1290)
Q Consensus 191 ~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~ 227 (1290)
+|+.. |.+|... ....|..|++||.
T Consensus 434 --~C~~s-------Ca~C~~~---~~~~CtsC~~g~~ 458 (800)
T PTZ00214 434 --PCTDS-------CAVCKDG---TPTGCQQCSPGKI 458 (800)
T ss_pred --CCCCc-------ccccCCC---CcCcCccCCCCcE
Confidence 34332 4666543 2446878999985
No 62
>smart00051 DSL delta serrate ligand.
Probab=61.63 E-value=10 Score=33.95 Aligned_cols=47 Identities=21% Similarity=0.437 Sum_probs=32.2
Q ss_pred eecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCCC
Q psy620 217 YRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGSTG 272 (1290)
Q Consensus 217 y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~C 272 (1290)
++- .|.++|.| ..|. ..|...+.+..+.+|.. .| .|. |.+||+|..|
T Consensus 17 ~rv-~C~~~~yG--~~C~--~~C~~~~d~~~~~~Cd~-~G--~~~-C~~Gw~G~~C 63 (63)
T smart00051 17 IRV-TCDENYYG--EGCN--KFCRPRDDFFGHYTCDE-NG--NKG-CLEGWMGPYC 63 (63)
T ss_pred EEe-eCCCCCcC--CccC--CEeCcCccccCCccCCc-CC--CEe-cCCCCcCCCC
Confidence 344 79999999 7775 34542344566777854 34 577 9999999863
No 63
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=60.88 E-value=40 Score=41.16 Aligned_cols=128 Identities=27% Similarity=0.627 Sum_probs=68.0
Q ss_pred ccCCCCccc--CCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCC-CCcee-cccCCCCCCCCCCCCCCcce
Q psy620 130 MRCPDGYVG--DGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDG-ERCQR-IGGCSRNPCAQGKLNEKTRC 205 (1290)
Q Consensus 130 ~~C~~Gy~G--dg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg-~~C~~-ideC~~~pC~~g~~~~~~~C 205 (1290)
..|..||.- +...|....+|....|. +|.+... -.|..|..+|+... +.|.. -.+|....+... ......|
T Consensus 2 ~~C~~gy~~~~~~t~C~~~~~C~~~~C~---~Cs~~~~-~~Ct~C~~~~~lt~t~~Ci~~C~~c~~~~~~t~-~~~~~~C 76 (397)
T PF03302_consen 2 TECTSGYKLSTDKTSCVSASECKTPNCK---TCSNDKK-EVCTECNSGYYLTPTNQCIEDCAKCSNYYCSTC-GNDKKTC 76 (397)
T ss_pred ccccCCceECCCCCcccccCCCCCCCCc---cccCCCC-CccCcCCCCCcCCCCCccccCcccccccccccc-ccccccc
Confidence 468889873 44577766677766664 4655433 56878999987542 23432 111222111111 0012234
Q ss_pred eeeccCC---CCCceecCCCCCCCcCCCCccccCCccCCCCCCCCCcccccCCCCeecccCCCCCccCC
Q psy620 206 VRCDDIP---EHPYYRCGSCPEGTTGNGTRCHDIDECDLAEPCDPRVQCTNLFPGYRCDPCPAGFTGST 271 (1290)
Q Consensus 206 g~C~~~~---~~g~y~C~~C~~Gy~Gdg~~C~dideC~~~~pC~~~g~C~n~~gsy~C~~C~~Gy~G~~ 271 (1290)
..|.... -.+.-.|..|+.||+-++..|. .|. ..| ..|... ....|..|++||....
T Consensus 77 ~~C~~~~~~~~~~~~~c~~C~~G~y~~~~~C~---~C~--~~C---~~C~~~-~~~~Ct~C~~g~~L~~ 136 (397)
T PF03302_consen 77 KKCSIGNCLTCSGDACCSECPDGYYKNGNKCV---PCH--ESC---ATCSGG-APNQCTSCKPGKVLKY 136 (397)
T ss_pred cccccccccccccCccccCCCCCccccCCCCC---CCC--ccc---cccCCC-CCCCCcccCCCccccc
Confidence 4444211 0122356689999996556664 221 223 234432 3457888999997665
No 64
>cd00255 nidG2 Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an extracellular sheet-like matrix. Nidogen is a multifunctional protein that interacts with many other basement membrane proteins, like collagen, perlecan, lamin, and has a potential role in the assembly and connection of networks. Nidogen consists of 3 globular domains (G1-G3), G3 is the lamin-binding domain, while G2 binds collagen IV and perlecan. Also found in hemicentin, a protein which functions at various cell-cell and cell-matrix junctions and might assist in refining broad regions of cell contact into oriented, line-shaped junctions. Nidogen G2 consists of an N-terminal EGF-like domain (excluded from this alignment model) and an 11-stranded beta-barrel with a central helix, a topology that exhibits high structural similarity to the green flourescent proteins of Cnidaria.
Probab=59.86 E-value=6.7 Score=43.68 Aligned_cols=69 Identities=17% Similarity=0.243 Sum_probs=49.6
Q ss_pred ceeeccccccccccc-CccceEEEEeeeeccccceeeeecccCCCcc---cchhhhhhhcccccccceeeeeecccccc
Q psy620 19 PIVEGWSVKDDLLDD-GVINGLLLGVKQDIMGARYTLYMDCVDHGTV---AMTQSLKKMFDSMKNPQMRLRKTDEESVD 93 (1290)
Q Consensus 19 ~~~~~~s~~~~~~~~-~~~~~l~~~~~~~i~G~~~~ly~~C~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 93 (1290)
......+.|.+++.. +...+++|+++|+|+ |..|.+.... .....+.++|..|...+..||+++.+.+.
T Consensus 150 g~l~s~str~~~v~~~~~~~~~~y~~~Q~I~------y~~c~~~~~~~p~~~~l~v~~i~~~Y~~~e~~lrf~~~~~i~ 222 (224)
T cd00255 150 GVLTSSSTREYTVDEGGESQTLSYQWNQTIT------YEECPHDDEAAPDLQQLLVARIFALYNPEEEILRFAITNSIG 222 (224)
T ss_pred CEEEEEEeeEEEEecCCCceEEeEEeeeEEE------EeecCCCCcCCCceEEEEEEEEEEEecChHHheeeeeeeccc
Confidence 445556667676654 335589999999988 9999985422 22333449999999999999998776654
No 65
>PHA02887 EGF-like protein; Provisional
Probab=57.60 E-value=9 Score=38.03 Aligned_cols=31 Identities=29% Similarity=0.758 Sum_probs=24.2
Q ss_pred CCCCCCCeEeeec-CCceEEecCCCcccCCCCcCC
Q psy620 348 TRCDRNAKCTRIL-GNHYACKCDNGWAGDGQFCGR 381 (1290)
Q Consensus 348 ~~C~~~g~C~~~~-~gsy~C~C~~Gy~GdG~~Ce~ 381 (1290)
+.|- +|+|.... ...+.|.|..||+| .+|+.
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG--~RCE~ 123 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTG--IRCDE 123 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCccc--CCCCc
Confidence 5677 47997542 45689999999999 89973
No 66
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=52.19 E-value=13 Score=37.53 Aligned_cols=31 Identities=23% Similarity=0.643 Sum_probs=24.5
Q ss_pred CCCCCCCeEeeec-CCceEEecCCCcccCCCCcCC
Q psy620 348 TRCDRNAKCTRIL-GNHYACKCDNGWAGDGQFCGR 381 (1290)
Q Consensus 348 ~~C~~~g~C~~~~-~gsy~C~C~~Gy~GdG~~Ce~ 381 (1290)
+.|-++ +|.... ...+.|.|..||+| .+||.
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtG--eRCEh 82 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTG--IRCQH 82 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCccc--ccccc
Confidence 567764 897642 46799999999999 89974
No 67
>KOG3514|consensus
Probab=51.82 E-value=10 Score=49.59 Aligned_cols=35 Identities=31% Similarity=0.830 Sum_probs=30.1
Q ss_pred CCCCCCCCCCCCceeccCCCCCccccC-CCCccCCCcccc
Q psy620 588 PTCATDNPCFPGVECRDTREGPRCMRC-PDGYVGDGIHCK 626 (1290)
Q Consensus 588 ~~C~~~nPC~~g~~C~~~~~g~~Cg~C-p~G~~Gdg~~C~ 626 (1290)
..|. +|||.||++|.+.-+.|.| .| ..||.| +.|+
T Consensus 624 ~~C~-~nPC~N~g~C~egwNrfiC-DCs~T~~~G--~~Ce 659 (1591)
T KOG3514|consen 624 KICE-SNPCQNGGKCSEGWNRFIC-DCSGTGFEG--RTCE 659 (1591)
T ss_pred cccC-CCcccCCCCcccccccccc-ccccCcccC--cccc
Confidence 3797 8999999999999999999 68 567877 6776
No 68
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=50.28 E-value=15 Score=37.21 Aligned_cols=39 Identities=38% Similarity=0.971 Sum_probs=27.9
Q ss_pred cCCCCCC--CCCCCCCCeeecCC--CCcccccCCCCcccCCCCCCCC
Q psy620 104 KKPTCAT--DNPCFPGVECRDTR--EGPRCMRCPDGYVGDGIHCKPG 146 (1290)
Q Consensus 104 ~~d~C~~--~~pC~~gg~C~~~~--g~y~C~~C~~Gy~Gdg~~Cedi 146 (1290)
++-+|.. .+=|.|| +|.--. ..+.| .|..||+| .+||..
T Consensus 41 ~i~~Cp~ey~~YClHG-~C~yI~dl~~~~C-rC~~GYtG--eRCEh~ 83 (139)
T PHA03099 41 AIRLCGPEGDGYCLHG-DCIHARDIDGMYC-RCSHGYTG--IRCQHV 83 (139)
T ss_pred ccccCChhhCCEeECC-EEEeeccCCCcee-ECCCCccc--ccccce
Confidence 4556663 5668876 886543 67889 99999999 688743
No 69
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=49.03 E-value=2.8e+02 Score=33.88 Aligned_cols=44 Identities=34% Similarity=0.939 Sum_probs=28.3
Q ss_pred cccccCCCCCCCCCCCceecccCCCCCCCCCCCCCCcceeeeccCCCCCceecCCCCCCCcC
Q psy620 167 YTCGPCPSGYTGDGERCQRIGGCSRNPCAQGKLNEKTRCVRCDDIPEHPYYRCGSCPEGTTG 228 (1290)
Q Consensus 167 y~C~~C~~Gy~Gdg~~C~~ideC~~~pC~~g~~~~~~~Cg~C~~~~~~g~y~C~~C~~Gy~G 228 (1290)
-.|..|+.||+-++..|. ||+.. |.+|... ....|..|++||..
T Consensus 91 ~~c~~C~~G~y~~~~~C~--------~C~~~-------C~~C~~~---~~~~Ct~C~~g~~L 134 (397)
T PF03302_consen 91 ACCSECPDGYYKNGNKCV--------PCHES-------CATCSGG---APNQCTSCKPGKVL 134 (397)
T ss_pred ccccCCCCCccccCCCCC--------CCCcc-------ccccCCC---CCCCCcccCCCccc
Confidence 356689999986544553 44443 4566543 24578889999874
No 70
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=44.29 E-value=19 Score=35.51 Aligned_cols=32 Identities=28% Similarity=0.710 Sum_probs=26.2
Q ss_pred CCCCCCCCCCCCCCeeecCCCCcccccCCCCccc
Q psy620 105 KPTCATDNPCFPGVECRDTREGPRCMRCPDGYVG 138 (1290)
Q Consensus 105 ~d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~G 138 (1290)
.+.|.....|.+.+.|... ....| .|.+||.-
T Consensus 77 ~d~Cd~y~~CG~~g~C~~~-~~~~C-~Cl~GF~P 108 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNSN-NSPKC-SCLPGFEP 108 (110)
T ss_pred ccCCCCccccCCccEeCCC-CCCce-ECCCCcCC
Confidence 4789878999999999543 56689 99999974
No 71
>PHA02887 EGF-like protein; Provisional
Probab=43.61 E-value=18 Score=36.06 Aligned_cols=28 Identities=32% Similarity=0.695 Sum_probs=22.6
Q ss_pred eeeccCCCCCceecCCCCCCCcCCCCccccC
Q psy620 206 VRCDDIPEHPYYRCGSCPEGTTGNGTRCHDI 236 (1290)
Q Consensus 206 g~C~~~~~~g~y~C~~C~~Gy~Gdg~~C~di 236 (1290)
|+|.-+.+...+.| .|..||+| .+|+.+
T Consensus 97 G~C~yI~dL~epsC-rC~~GYtG--~RCE~v 124 (126)
T PHA02887 97 GECMNIIDLDEKFC-ICNKGYTG--IRCDEV 124 (126)
T ss_pred CEEEccccCCCcee-ECCCCccc--CCCCcc
Confidence 47877766677899 99999999 888743
No 72
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=34.57 E-value=41 Score=28.39 Aligned_cols=35 Identities=29% Similarity=0.724 Sum_probs=0.0
Q ss_pred CCCCCe----eecCCCCcccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCC
Q psy620 114 CFPGVE----CRDTREGPRCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGD 179 (1290)
Q Consensus 114 C~~gg~----C~~~~g~y~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gd 179 (1290)
|.+++. |....+ +| .|+++|+| ..|+ .|++||++.
T Consensus 4 C~~~g~~~~~C~~~~G--~C-~C~~~~~G--~~C~--------------------------~C~~g~~~~ 42 (50)
T cd00055 4 CNGHGSLSGQCDPGTG--QC-ECKPNTTG--RRCD--------------------------RCAPGYYGL 42 (50)
T ss_pred CcCCCCCCccccCCCC--EE-eCCCcCCC--CCCC--------------------------CCCCCCccC
No 73
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=33.84 E-value=50 Score=27.93 Aligned_cols=29 Identities=41% Similarity=1.088 Sum_probs=20.3
Q ss_pred CCCCCCCCCCCCCeeecCCCCcccccCCCCcccC
Q psy620 106 PTCATDNPCFPGVECRDTREGPRCMRCPDGYVGD 139 (1290)
Q Consensus 106 d~C~~~~pC~~gg~C~~~~g~y~C~~C~~Gy~Gd 139 (1290)
..|.....|..++.|++. +| .|++||+-.
T Consensus 20 ~~C~~~~qC~~~s~C~~g----~C-~C~~g~~~~ 48 (52)
T PF01683_consen 20 ESCESDEQCIGGSVCVNG----RC-QCPPGYVEV 48 (52)
T ss_pred CCCCCcCCCCCcCEEcCC----Ee-ECCCCCEec
Confidence 456666677777788653 78 888888743
No 74
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=33.62 E-value=34 Score=33.67 Aligned_cols=32 Identities=28% Similarity=0.567 Sum_probs=25.6
Q ss_pred CCCCCCCCCCCCCCeEeeecCCceEEecCCCccc
Q psy620 341 ADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAG 374 (1290)
Q Consensus 341 id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~G 374 (1290)
.+.|.....|..+|.|.. ..+..|.|.+||.-
T Consensus 77 ~d~Cd~y~~CG~~g~C~~--~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNS--NNSPKCSCLPGFEP 108 (110)
T ss_pred ccCCCCccccCCccEeCC--CCCCceECCCCcCC
Confidence 467886789999999954 34568999999974
No 75
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=31.65 E-value=79 Score=34.14 Aligned_cols=33 Identities=33% Similarity=0.441 Sum_probs=27.5
Q ss_pred cccccCccceEEEEeeeeccccceeeeecccCCCccc
Q psy620 29 DLLDDGVINGLLLGVKQDIMGARYTLYMDCVDHGTVA 65 (1290)
Q Consensus 29 ~~~~~~~~~~l~~~~~~~i~G~~~~ly~~C~~~~~~~ 65 (1290)
..+.+++||++.+.+... .++||++|.......
T Consensus 112 ~~l~dg~WH~lal~V~~~----~v~LyvDC~~~~~~~ 144 (184)
T smart00210 112 LPLADGQWHKLALSVSGS----SATLYVDCNEIDSRP 144 (184)
T ss_pred CccccCCceEEEEEEeCC----EEEEEECCcccccee
Confidence 457899999999998776 579999999877664
No 76
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=29.90 E-value=1.8e+02 Score=38.86 Aligned_cols=38 Identities=24% Similarity=0.699 Sum_probs=24.4
Q ss_pred CcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCC----CCCCCCCeEe
Q psy620 314 SFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDG----TRCDRNAKCT 357 (1290)
Q Consensus 314 sy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~----~~C~~~g~C~ 357 (1290)
..+| +|..||.....+..|.. ...|... ..|...++|+
T Consensus 681 ~~~C--~C~~g~~p~~~~~~C~~----~~~C~~~~~gC~~C~~~g~C~ 722 (800)
T PTZ00214 681 VRRC--WCERGFLPALDRSGCVL----PTECPPDMPSCAACDESGRCL 722 (800)
T ss_pred ccee--EecCCcccccCCCcccc----ccCCCcccccccccCCCCcee
Confidence 4589 99999987777778876 2345421 2455555554
No 77
>KOG3509|consensus
Probab=28.77 E-value=1.1e+02 Score=41.29 Aligned_cols=117 Identities=25% Similarity=0.432 Sum_probs=0.0
Q ss_pred ccccCCCCcccCCCCCCCCCCCCCCCCCCCCeeccCCCCcccccCCCCCCCCCCCcee-cccCCCCCCCCCCCCCCccee
Q psy620 128 RCMRCPDGYVGDGIHCKPGVTCNMRPCFQGVQCFDTVEGYTCGPCPSGYTGDGERCQR-IGGCSRNPCAQGKLNEKTRCV 206 (1290)
Q Consensus 128 ~C~~C~~Gy~Gdg~~CedideC~~~pC~~gg~C~n~~g~y~C~~C~~Gy~Gdg~~C~~-ideC~~~pC~~g~~~~~~~Cg 206 (1290)
.| .|++||.| ..|++-.++...++. +.|.... .+ .|.-+.... .|+. .-.| .
T Consensus 719 ~C-~c~~g~~G--~~ce~c~e~~~ls~t--~~~~~~~---~~-~c~~~~h~~--~c~~~~~~n----------------t 771 (964)
T KOG3509|consen 719 QC-QCPKGLVG--TSCEDCAEGYTLSTT--GGLYPGL---CE-DCECNSHIS--QCEDDLGYN----------------T 771 (964)
T ss_pred cc-ccCccccC--ccccccccccccccc--CCcCccc---Cc-ccccCCCcc--ccccccccc----------------c
Q ss_pred eeccCCCCCceecCCCCCCCcCC---CCccccCCccCCCCC--CCCCcccccCCCCeecccCCCCCccCCCC
Q psy620 207 RCDDIPEHPYYRCGSCPEGTTGN---GTRCHDIDECDLAEP--CDPRVQCTNLFPGYRCDPCPAGFTGSTGV 273 (1290)
Q Consensus 207 ~C~~~~~~g~y~C~~C~~Gy~Gd---g~~C~dideC~~~~p--C~~~g~C~n~~gsy~C~~C~~Gy~G~~Ce 273 (1290)
.|.+... +++|..|++||.++ +..+.....|.+..+ +.+...-.-...++.|..|+++++|..|+
T Consensus 772 ~~q~~~~--~~~~~~~~~g~~~da~~g~~~D~~p~~~l~~~~~~~~r~~l~~~~~~~~~~~~p~~~~g~~~~ 841 (964)
T KOG3509|consen 772 DCQNNTE--GDRCELCSPGTYGDARRGTPEDCRPATALTIQCSCNNRSPLSCDGFGPGCLLCPHNTEGTTCE 841 (964)
T ss_pred cccccCc--cceeeecCCCccccCccCCcccCCccchhhhhhhhcccCccccccCCCCcccCCCCccccchh
No 78
>TIGR00648 recU recombination protein U. The Bacillus protein has been shown to be required for DNA recombination and repair. RJD 11/20/00
Probab=25.88 E-value=32 Score=36.88 Aligned_cols=46 Identities=11% Similarity=0.172 Sum_probs=36.6
Q ss_pred ccceeeEEEEeecCCcEEEEEeeecccccc--------cccCcceecCCccEEEE
Q psy620 1226 DLCSGNVATFYQSSQKFYVMMWKKNSQVYW--------QTTPFRAVAEPGIQLKV 1272 (1290)
Q Consensus 1226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~yw--------~~~~~~~~~~~~~~~~~ 1272 (1290)
-...+||++.+....+||+|.|++..+ || .+-|+.-..+-|++|++
T Consensus 101 ~gGiaF~iI~F~~~~e~y~v~~~~l~~-~w~~~~~~GrKSi~~~~i~~~g~~i~~ 154 (169)
T TIGR00648 101 QDGICFLIISFQTFDQVYFLEADKLFY-FWKRKEKNGRKSIRKDEIEETAYPIPL 154 (169)
T ss_pred CCCEEEEEEEEeecCeEEEEEHHHHHH-HHHHHhhCCCCcccHHHHHHhCEEecc
Confidence 456789999999999999999988754 78 44577777788888865
No 79
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=25.40 E-value=61 Score=27.07 Aligned_cols=22 Identities=27% Similarity=0.661 Sum_probs=16.9
Q ss_pred CeEeeecCCceEEecCCCcccCCCCcC
Q psy620 354 AKCTRILGNHYACKCDNGWAGDGQFCG 380 (1290)
Q Consensus 354 g~C~~~~~gsy~C~C~~Gy~GdG~~Ce 380 (1290)
.+|... ..+|.|+++|+| .+|+
T Consensus 11 ~~C~~~---~G~C~C~~~~~G--~~C~ 32 (49)
T PF00053_consen 11 QTCDPS---TGQCVCKPGTTG--PRCD 32 (49)
T ss_dssp SSEEET---CEEESBSTTEES--TTS-
T ss_pred CcccCC---CCEEeccccccC--CcCc
Confidence 367653 469999999999 8885
No 80
>PRK02234 recU Holliday junction-specific endonuclease; Reviewed
Probab=21.73 E-value=47 Score=36.46 Aligned_cols=47 Identities=13% Similarity=0.274 Sum_probs=35.1
Q ss_pred cccceeeEEEEeecCCcEEEEEeeecccccc--------cccCcceecCCccEEEE
Q psy620 1225 LDLCSGNVATFYQSSQKFYVMMWKKNSQVYW--------QTTPFRAVAEPGIQLKV 1272 (1290)
Q Consensus 1225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~yw--------~~~~~~~~~~~~~~~~~ 1272 (1290)
.-...+||++.+-.-.+||+|.|++.. .|| .+-|+.-..+-|++|++
T Consensus 123 ~~gGiaF~iI~F~~~~e~y~vp~~~l~-~~w~~~~~~grKSI~~e~i~~~~~~i~~ 177 (195)
T PRK02234 123 KQGGICFVIIRFSTLDETYLLPASKLI-KFWERQKDGGRKSIPLEEIKKNGYEIPL 177 (195)
T ss_pred HCCCEEEEEEEEEeCCeEEEEEHHHHH-HHHHHHHhCCCCcccHHHHHHcCEEecc
Confidence 345688999999999999999999874 488 34455555666777754
No 81
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=20.02 E-value=26 Score=31.35 Aligned_cols=48 Identities=25% Similarity=0.523 Sum_probs=19.7
Q ss_pred CcEEcCcCcCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCeEeeecCCceEEecCCCcccCCCCc
Q psy620 314 SFTCTSLCRNSYMVRNVSVGCQSQNFGADVCPDGTRCDRNAKCTRILGNHYACKCDNGWAGDGQFC 379 (1290)
Q Consensus 314 sy~C~~~C~~Gy~g~~~g~~C~~~~~~id~C~~~~~C~~~g~C~~~~~gsy~C~C~~Gy~GdG~~C 379 (1290)
.++- .|.+.|.|..+...|.+. +.- ..+-+|... | .=+|.+||+| ..|
T Consensus 16 ~~rv--~C~~nyyG~~C~~~C~~~----~d~------~ghy~Cd~~--G--~~~C~~Gw~G--~~C 63 (63)
T PF01414_consen 16 RIRV--VCDENYYGPNCSKFCKPR----DDS------FGHYTCDSN--G--NKVCLPGWTG--PNC 63 (63)
T ss_dssp ---------TTEETTTT-EE---E----EET------TEEEEE-SS------EEE-TTEES--TTS
T ss_pred EEEE--ECCCCCCCccccCCcCCC----cCC------cCCcccCCC--C--CCCCCCCCcC--CCC
Confidence 4566 899999987776666551 110 112245432 2 4579999999 555
Done!