Query psy2974
Match_columns 186
No_of_seqs 243 out of 2059
Neff 9.3
Searched_HMMs 46136
Date Fri Aug 16 22:19:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2974.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2974hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00431 CUB: CUB domain CUB d 99.4 4.6E-13 1E-17 90.6 6.9 72 81-152 1-110 (110)
2 cd00041 CUB CUB domain; extrac 99.4 2.3E-12 5E-17 87.5 9.3 74 81-154 1-113 (113)
3 smart00042 CUB Domain first fo 99.3 2.8E-11 6E-16 80.9 7.5 63 90-152 1-102 (102)
4 PF14670 FXa_inhibition: Coagu 98.9 9.5E-10 2.1E-14 59.2 2.6 36 41-76 1-36 (36)
5 PF07645 EGF_CA: Calcium-bindi 98.8 4.5E-09 9.8E-14 58.9 3.2 40 37-76 1-42 (42)
6 KOG4586|consensus 98.8 2.4E-08 5.3E-13 68.1 6.2 70 87-156 47-155 (156)
7 KOG4260|consensus 98.2 5.4E-07 1.2E-11 69.1 1.8 99 7-109 195-312 (350)
8 cd01475 vWA_Matrilin VWA_Matri 98.0 4.9E-06 1.1E-10 63.6 2.9 44 31-74 180-223 (224)
9 PF12662 cEGF: Complement Clr- 97.7 2.2E-05 4.8E-10 38.1 1.5 21 58-78 1-21 (24)
10 smart00179 EGF_CA Calcium-bind 97.5 0.00013 2.7E-09 39.6 2.8 31 37-68 1-33 (39)
11 KOG1214|consensus 97.5 0.00015 3.3E-09 63.7 4.5 48 29-76 725-774 (1289)
12 PF12662 cEGF: Complement Clr- 97.3 7.7E-05 1.7E-09 36.2 0.5 16 25-40 9-24 (24)
13 KOG4289|consensus 97.1 0.0028 6.1E-08 59.0 8.2 86 23-113 1227-1321(2531)
14 PF12947 EGF_3: EGF domain; I 97.0 0.00063 1.4E-08 36.5 2.4 29 41-69 1-31 (36)
15 KOG1214|consensus 96.8 0.00083 1.8E-08 59.2 2.5 48 25-78 816-865 (1289)
16 KOG1219|consensus 96.6 0.0029 6.4E-08 61.3 5.1 91 5-100 3865-3971(4289)
17 cd00054 EGF_CA Calcium-binding 96.5 0.0029 6.3E-08 33.6 2.8 30 38-68 2-33 (38)
18 KOG4292|consensus 96.4 0.0045 9.7E-08 51.4 4.1 45 79-123 290-334 (454)
19 smart00181 EGF Epidermal growt 96.3 0.0043 9.4E-08 32.6 2.5 24 46-69 6-30 (35)
20 PF00008 EGF: EGF-like domain 95.7 0.013 2.8E-07 30.4 2.4 19 50-68 10-29 (32)
21 cd00053 EGF Epidermal growth f 95.3 0.02 4.4E-07 29.7 2.5 24 46-69 6-31 (36)
22 KOG1219|consensus 94.9 0.016 3.5E-07 56.6 2.1 40 25-68 3931-3972(4289)
23 PF02408 CUB_2: CUB-like domai 93.3 0.42 9E-06 32.8 6.2 25 100-124 37-61 (120)
24 KOG4292|consensus 92.6 0.21 4.5E-06 41.8 4.5 71 83-153 171-248 (454)
25 PF09064 Tme5_EGF_like: Thromb 91.9 0.16 3.5E-06 26.5 2.0 26 46-72 6-31 (34)
26 PF12661 hEGF: Human growth fa 91.1 0.2 4.3E-06 20.4 1.5 9 60-68 1-9 (13)
27 PF12946 EGF_MSP1_1: MSP1 EGF 76.5 3.7 8.1E-05 22.0 2.5 21 50-70 11-32 (37)
28 KOG4289|consensus 67.4 4.6 9.9E-05 39.1 2.5 49 54-102 1217-1270(2531)
29 PF06247 Plasmod_Pvs28: Plasmo 62.1 7.6 0.00016 28.9 2.4 47 25-72 27-83 (197)
30 KOG4260|consensus 61.5 5.9 0.00013 31.2 1.9 33 38-70 271-306 (350)
31 KOG1217|consensus 60.0 24 0.00052 29.3 5.5 52 50-101 243-302 (487)
32 PF07974 EGF_2: EGF-like domai 54.3 15 0.00032 18.8 2.1 17 50-68 12-28 (32)
33 KOG1215|consensus 53.7 12 0.00027 34.5 2.9 45 34-78 666-710 (877)
34 KOG1217|consensus 52.7 11 0.00023 31.4 2.2 38 33-71 266-305 (487)
35 KOG1215|consensus 52.3 45 0.00097 30.9 6.3 36 38-73 364-400 (877)
36 PF05428 CRF-BP: Corticotropin 46.1 1.4E+02 0.003 24.0 7.2 31 126-158 150-180 (311)
37 PHA03099 epidermal growth fact 45.3 16 0.00034 25.4 1.7 36 38-77 42-81 (139)
38 KOG0196|consensus 43.7 1.5E+02 0.0033 27.5 7.8 91 60-158 260-371 (996)
39 PF00954 S_locus_glycop: S-loc 29.9 41 0.00089 22.3 1.8 19 50-69 90-108 (110)
40 PF01826 TIL: Trypsin Inhibito 27.8 46 0.001 18.9 1.6 16 61-77 35-50 (55)
41 PF02014 Reeler: Reeler domain 25.5 1.3E+02 0.0029 20.5 3.9 42 125-166 28-72 (132)
42 PHA02887 EGF-like protein; Pro 22.5 68 0.0015 22.0 1.8 24 50-77 97-122 (126)
43 PF13897 GOLD_2: Golgi-dynamic 22.3 1.5E+02 0.0033 20.8 3.5 30 129-158 16-45 (136)
44 PF07754 DUF1610: Domain of un 21.8 41 0.00089 16.1 0.5 10 7-16 14-23 (24)
45 PTZ00214 high cysteine membran 20.8 1.3E+02 0.0028 27.8 3.7 46 25-77 654-700 (800)
No 1
>PF00431 CUB: CUB domain CUB domain entry Spermadhesins family entry Link to schematic domain picture by Peer Bork. ; InterPro: IPR000859 The CUB domain (for complement C1r/C1s, Uegf, Bmp1) is a structural motif of approximately 110 residues found almost exclusively in extracellular and plasma membrane-associated proteins, many of which are developmentally regulated [, ]. These proteins are involved in a diverse range of functions, including complement activation, developmental patterning, tissue repair, axon guidance and angiogenesis, cell signalling, fertilisation, haemostasis, inflammation, neurotransmission, receptor-mediated endocytosis, and tumour suppression [, ]. Many CUB-containing proteins are peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin). Proteins containing a CUB domain include: Mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system. Cricetidae sp. (Hamster) serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium. Mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement. Vertebrate enteropeptidase (3.4.21.9 from EC), a type II membrane protein of the intestinal brush border, which activates trypsinogen. Vertebrate bone morphogenic protein 1 (BMP-1), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity. Sea urchin blastula proteins BP10 and SpAN. Caenorhabditis elegans hypothetical proteins F42A10.8 and R151.5. Neuropilin (A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits. Fibropellins I and III from Strongylocentrotus purpuratus (Purple sea urchin). Mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein. Mammalian spermadhesins. Xenopus laevis embryonic protein UVS.2, which is expressed during dorsoanterior development. Several of the above proteins consist of a catalytic domain together with several CUB domains interspersed by calcium-binding EGF domains. Some CUB domains appear to be involved in oligomerisation and/or recognition of substrates and binding partners. For example, in the complement proteases, the CUB domains mediate dimerisation and binding to collagen-like regions of target proteins (e.g. C1q for C1r/C1s). The structure of CUB domains consists of a beta-sandwich with a jelly-roll fold. Almost all CUB domains contain four conserved cysteines that probably form two disulphide bridges (C1-C2, C3-C4). The CUB1 domains of C1s and Map19 have calcium-binding sites [].; PDB: 1SFP_A 3KQ4_B 2WNO_A 2QQK_A 2QQL_A 2QQO_B 2QQM_A 3POJ_A 3POB_A 3POG_B ....
Probab=99.43 E-value=4.6e-13 Score=90.60 Aligned_cols=72 Identities=44% Similarity=0.888 Sum_probs=64.7
Q ss_pred cceeecCCCCcccCCCCCCCCCCCCceEEEEEcCCCCEEEeccC------------------------------------
Q psy2974 81 CKYEITAPNGVIKTPNHPDYYPSKRECIWHFTTTPGHRIKLNPT------------------------------------ 124 (186)
Q Consensus 81 C~~~~~~~~G~i~sp~~p~~y~~~~~C~w~i~~~~g~~i~l~~~------------------------------------ 124 (186)
|++.+....|.|.||+||..|+.+..|.|.|.+++++.|.|.|.
T Consensus 1 Cg~~~~~~~g~i~Sp~yp~~y~~~~~C~w~i~~~~~~~I~l~f~~~~~~~~~~c~~d~l~v~~g~~~~~~~~~~~cg~~~ 80 (110)
T PF00431_consen 1 CGGRLTNNSGIISSPNYPSNYPSNSDCTWTITAPPGHRIRLTFLSFDLESSDSCCQDYLEVYDGNDESSPLLGRFCGSSP 80 (110)
T ss_dssp SEEEECSSEEEEESTTTTS-SSSSEEEEEEEE-STTEEEEEEEEEEEB--TTTSTSSEEEEESSSSTTSEEEEEESSSSC
T ss_pred CcCEEECCeEEEECCCCCCCCCCCCcEeEEEEecccceeeeccccccceeeeeecccceeEEeeccccceeeeeccCCcC
Confidence 77888889999999999999999999999999999999999853
Q ss_pred --ceeeeCCeEEEEEEeCCCCCCCCcEEEE
Q psy2974 125 --DIISISEGLLVRFRSDDTVVGKGFSASY 152 (186)
Q Consensus 125 --~~~s~~~~l~i~f~sd~~~~~~GF~~~y 152 (186)
.+++.++.|+|.|+++......||+|+|
T Consensus 81 ~~~i~s~~~~l~i~f~s~~~~~~~gF~~~y 110 (110)
T PF00431_consen 81 PPSIISSSNSLFIRFHSDSSNSSRGFKATY 110 (110)
T ss_dssp CEEEEESSSEEEEEEEESSSSTTSEEEEEE
T ss_pred CccEEECCCEEEEEEEECCCCCCccEEEEC
Confidence 3688899999999999999999999987
No 2
>cd00041 CUB CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
Probab=99.41 E-value=2.3e-12 Score=87.48 Aligned_cols=74 Identities=46% Similarity=0.876 Sum_probs=65.7
Q ss_pred cceeecCC-CCcccCCCCCCCCCCCCceEEEEEcCCCCEEEeccC-----------------------------------
Q psy2974 81 CKYEITAP-NGVIKTPNHPDYYPSKRECIWHFTTTPGHRIKLNPT----------------------------------- 124 (186)
Q Consensus 81 C~~~~~~~-~G~i~sp~~p~~y~~~~~C~w~i~~~~g~~i~l~~~----------------------------------- 124 (186)
|+..+... .|.|.||+||..|+++..|.|.|.++++.+|.|.|.
T Consensus 1 C~~~~~~~~~g~i~Sp~~p~~~~~~~~C~w~i~~~~g~~i~l~f~~~~l~~~~~C~~d~l~i~~g~~~~~~~~~~~Cg~~ 80 (113)
T cd00041 1 CGGTLTASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80 (113)
T ss_pred CCCEEccCCCeEEECCCCCCCCCCCCcEEEEEEcCCCCEEEEEEeCcccccCCCCCCcEEEEEcCCCCccccceeeECCC
Confidence 66677776 899999999999999999999999999999988853
Q ss_pred ---ceeeeCCeEEEEEEeCCCCCCCCcEEEEEE
Q psy2974 125 ---DIISISEGLLVRFRSDDTVVGKGFSASYIA 154 (186)
Q Consensus 125 ---~~~s~~~~l~i~f~sd~~~~~~GF~~~y~~ 154 (186)
.++|.++.|.|.|++|......||+++|++
T Consensus 81 ~~~~~~s~~~~~~i~f~s~~~~~~~GF~~~y~~ 113 (113)
T cd00041 81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113 (113)
T ss_pred CCCCEEecCCEEEEEEEeCCCCCCCCEEEEEEC
Confidence 267888999999999999999999999973
No 3
>smart00042 CUB Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
Probab=99.25 E-value=2.8e-11 Score=80.93 Aligned_cols=63 Identities=48% Similarity=0.850 Sum_probs=54.5
Q ss_pred CcccCCCCCCCCCCCCceEEEEEcCCCCEEEeccCc---------------------------------------eeeeC
Q psy2974 90 GVIKTPNHPDYYPSKRECIWHFTTTPGHRIKLNPTD---------------------------------------IISIS 130 (186)
Q Consensus 90 G~i~sp~~p~~y~~~~~C~w~i~~~~g~~i~l~~~~---------------------------------------~~s~~ 130 (186)
|.|.||+||..|+.+..|.|.|.+++++.|.|.|.. +.|.+
T Consensus 1 G~i~Sp~yP~~y~~~~~C~w~i~~~~g~~i~l~f~~~~l~~~~~C~~d~l~i~~g~~~~~~~~~~~Cg~~~~~~~~~s~~ 80 (102)
T smart00042 1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSELPPPVISSSS 80 (102)
T ss_pred CEEeCCCCCcCCCCCCcEEEEEECCCCeEEEEEEEEEeccCCCCeeEeEEEEEeCCCCCCceeEEEecCcCCCCeEEcCC
Confidence 678999999999999999999999999999887531 34558
Q ss_pred CeEEEEEEeCCCCCCCCcEEEE
Q psy2974 131 EGLLVRFRSDDTVVGKGFSASY 152 (186)
Q Consensus 131 ~~l~i~f~sd~~~~~~GF~~~y 152 (186)
+.|+|.|+++......||+++|
T Consensus 81 n~~~i~f~s~~~~~~~GF~~~y 102 (102)
T smart00042 81 NSLTVTFVSDSSVQKRGFSARY 102 (102)
T ss_pred CEEEEEEEeCCCCCCCCeEEEC
Confidence 8999999999988889999876
No 4
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.91 E-value=9.5e-10 Score=59.17 Aligned_cols=36 Identities=61% Similarity=1.279 Sum_probs=30.6
Q ss_pred ccccccCcccccccCCCceEEEecCCeeeecccccc
Q psy2974 41 CMTNNGGCQHECRNTIGSYICSCHNGYTLLENGHDC 76 (186)
Q Consensus 41 C~~~~~~C~~~C~n~~g~~~C~C~~G~~l~~~~~~C 76 (186)
|..+++.|+|.|++.+++|+|.|++||.|..|+++|
T Consensus 1 C~~~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 1 CSVNNGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp CTTGGGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCCCCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence 455678999999999999999999999999999876
No 5
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.81 E-value=4.5e-09 Score=58.94 Aligned_cols=40 Identities=48% Similarity=0.991 Sum_probs=34.0
Q ss_pred ccccccccccCcc--cccccCCCceEEEecCCeeeecccccc
Q psy2974 37 DKDECMTNNGGCQ--HECRNTIGSYICSCHNGYTLLENGHDC 76 (186)
Q Consensus 37 d~~eC~~~~~~C~--~~C~n~~g~~~C~C~~G~~l~~~~~~C 76 (186)
||+||......|. +.|+|+.|+|.|.|++||.....+..|
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTTC 42 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCcC
Confidence 7899999877886 799999999999999999966555544
No 6
>KOG4586|consensus
Probab=98.76 E-value=2.4e-08 Score=68.06 Aligned_cols=70 Identities=30% Similarity=0.618 Sum_probs=61.1
Q ss_pred CCCCcccCCCCCCCCCCCCceEEEEEcCCCCEEEeccC---------------------------------------cee
Q psy2974 87 APNGVIKTPNHPDYYPSKRECIWHFTTTPGHRIKLNPT---------------------------------------DII 127 (186)
Q Consensus 87 ~~~G~i~sp~~p~~y~~~~~C~w~i~~~~g~~i~l~~~---------------------------------------~~~ 127 (186)
...+.+++|++|..|+++.+|...|.+.+-+.|++.|. .+.
T Consensus 47 ~n~~Iftspn~ps~ypp~r~cv~vi~~~p~~~ve~~Fde~y~IEps~EC~fD~iEvrDGpfGFSPlI~rfCG~~nPp~Ir 126 (156)
T KOG4586|consen 47 QNFNIFTSPNFPSRYPPNRDCVRVIHSRPQHDVEVKFDEVYHIEPSYECPFDFIEVRDGPFGFSPLIARFCGDRNPPEIR 126 (156)
T ss_pred cccceEecCCccccCCCCcceEEeEecccccceEEeeeeeEEecccccCCCCcccccCCCcCccHHHHHHhccCCChhhe
Confidence 44567899999999999999999999998888877643 368
Q ss_pred eeCCeEEEEEEeCCCCCCCCcEEEEEEEe
Q psy2974 128 SISEGLLVRFRSDDTVVGKGFSASYIAID 156 (186)
Q Consensus 128 s~~~~l~i~f~sd~~~~~~GF~~~y~~~~ 156 (186)
|.+..|||+|.||......||.|.|..++
T Consensus 127 s~grFlWIkF~sD~ele~~gfsa~y~~~p 155 (156)
T KOG4586|consen 127 SVGRFLWIKFRSDSELEYQGFSAEYAIVP 155 (156)
T ss_pred ecCcEEEEEEcccchhhhcccceeeeccC
Confidence 99999999999999999999999997654
No 7
>KOG4260|consensus
Probab=98.22 E-value=5.4e-07 Score=69.09 Aligned_cols=99 Identities=20% Similarity=0.380 Sum_probs=66.3
Q ss_pred ccCcCCCCCCCCCCCCCC------------cEEEcCCCCCccccccccccccCcc--cccccCCCceEEEecCCeeeecc
Q psy2974 7 RIEDKCNNCNESPHDTLT------------VFSKIGDSFKSQDKDECMTNNGGCQ--HECRNTIGSYICSCHNGYTLLEN 72 (186)
Q Consensus 7 ~~~~~C~~c~~~~~~~~~------------Gy~~~~d~~~C~d~~eC~~~~~~C~--~~C~n~~g~~~C~C~~G~~l~~~ 72 (186)
...-+|..|+.+|.+.+. ||.+. -..|.|||||...+..|. +.|+|+.|+|.|.+.+||...
T Consensus 195 e~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~ld--e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-- 270 (350)
T KOG4260|consen 195 EQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLD--EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-- 270 (350)
T ss_pred cccchhhhhhhhhhcccCCCCCCChhhhcccceec--ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC--
Confidence 456688889998885433 89887 345999999998877774 699999999999999999853
Q ss_pred cccccc--CCcc---eeecCCCCcccCCCCCCCCCCCCceEE
Q psy2974 73 GHDCKE--GGCK---YEITAPNGVIKTPNHPDYYPSKRECIW 109 (186)
Q Consensus 73 ~~~C~~--~~C~---~~~~~~~G~i~sp~~p~~y~~~~~C~w 109 (186)
...|.- ..|. ..+.+..|.+.+-++.+..-....|+|
T Consensus 271 ~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~~~g~cV~ 312 (350)
T KOG4260|consen 271 VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLIIIEGFCVW 312 (350)
T ss_pred hHHhhhhhhhcccCCCCcccCCccEEEEecccceeeeeeeec
Confidence 334431 2221 224455555555444443333445555
No 8
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=97.97 E-value=4.9e-06 Score=63.59 Aligned_cols=44 Identities=39% Similarity=0.733 Sum_probs=39.4
Q ss_pred CCCCccccccccccccCcccccccCCCceEEEecCCeeeecccc
Q psy2974 31 DSFKSQDKDECMTNNGGCQHECRNTIGSYICSCHNGYTLLENGH 74 (186)
Q Consensus 31 d~~~C~d~~eC~~~~~~C~~~C~n~~g~~~C~C~~G~~l~~~~~ 74 (186)
++..|.++++|...++.|.+.|.+..|+|.|.|++||.+..+++
T Consensus 180 ~~~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~~~ 223 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNK 223 (224)
T ss_pred ccccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCCCC
Confidence 56789999999988788999999999999999999999877654
No 9
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.70 E-value=2.2e-05 Score=38.06 Aligned_cols=21 Identities=48% Similarity=1.168 Sum_probs=19.1
Q ss_pred ceEEEecCCeeeecccccccc
Q psy2974 58 SYICSCHNGYTLLENGHDCKE 78 (186)
Q Consensus 58 ~~~C~C~~G~~l~~~~~~C~~ 78 (186)
||.|.|++||.+..++++|.+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~D 21 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCED 21 (24)
T ss_pred CEEeeCCCCCcCCCCCCcccc
Confidence 689999999999999999974
No 10
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.47 E-value=0.00013 Score=39.57 Aligned_cols=31 Identities=55% Similarity=1.170 Sum_probs=24.8
Q ss_pred ccccccccccCccc--ccccCCCceEEEecCCee
Q psy2974 37 DKDECMTNNGGCQH--ECRNTIGSYICSCHNGYT 68 (186)
Q Consensus 37 d~~eC~~~~~~C~~--~C~n~~g~~~C~C~~G~~ 68 (186)
++++|... ..|.+ .|++..++|.|.|++||.
T Consensus 1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCc
Confidence 46778763 35654 899999999999999996
No 11
>KOG1214|consensus
Probab=97.46 E-value=0.00015 Score=63.66 Aligned_cols=48 Identities=38% Similarity=0.824 Sum_probs=40.7
Q ss_pred cCCCCCccccccccccccCcc--cccccCCCceEEEecCCeeeecccccc
Q psy2974 29 IGDSFKSQDKDECMTNNGGCQ--HECRNTIGSYICSCHNGYTLLENGHDC 76 (186)
Q Consensus 29 ~~d~~~C~d~~eC~~~~~~C~--~~C~n~~g~~~C~C~~G~~l~~~~~~C 76 (186)
.+|++.|.|.+||...+..|. ..|.|.+++|.|.|..||....++.+|
T Consensus 725 ~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tC 774 (1289)
T KOG1214|consen 725 QGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTC 774 (1289)
T ss_pred CCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcce
Confidence 368899999999998777885 589999999999999999877766555
No 12
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.30 E-value=7.7e-05 Score=36.16 Aligned_cols=16 Identities=25% Similarity=0.235 Sum_probs=15.4
Q ss_pred cEEEcCCCCCcccccc
Q psy2974 25 VFSKIGDSFKSQDKDE 40 (186)
Q Consensus 25 Gy~~~~d~~~C~d~~e 40 (186)
||++.+|++.|.||||
T Consensus 9 Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 9 GYQLSPDGRSCEDIDE 24 (24)
T ss_pred CCcCCCCCCccccCCC
Confidence 9999999999999997
No 13
>KOG4289|consensus
Probab=97.06 E-value=0.0028 Score=58.96 Aligned_cols=86 Identities=21% Similarity=0.367 Sum_probs=57.3
Q ss_pred CCcEEEcCCCCCcc-ccccccccccCcccccccCCCceEEEecCCeeeec-----cccccccCCc--ceeecC-CCCccc
Q psy2974 23 LTVFSKIGDSFKSQ-DKDECMTNNGGCQHECRNTIGSYICSCHNGYTLLE-----NGHDCKEGGC--KYEITA-PNGVIK 93 (186)
Q Consensus 23 ~~Gy~~~~d~~~C~-d~~eC~~~~~~C~~~C~n~~g~~~C~C~~G~~l~~-----~~~~C~~~~C--~~~~~~-~~G~i~ 93 (186)
++||. +.+|+ ++++|-..+....+.|+...|+|.|.|.+||.... ....|.++.| ++.+.+ ..|.+.
T Consensus 1227 PpGFT----gd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~ 1302 (2531)
T KOG4289|consen 1227 PPGFT----GDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFC 1302 (2531)
T ss_pred CCCCC----cccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCcee
Confidence 34887 56884 78999886544468999999999999999997211 1123445555 345543 447788
Q ss_pred CCCCCCCCCCCCceEEEEEc
Q psy2974 94 TPNHPDYYPSKRECIWHFTT 113 (186)
Q Consensus 94 sp~~p~~y~~~~~C~w~i~~ 113 (186)
+.+.++.| ....|.-.-..
T Consensus 1303 c~Cp~ge~-e~prC~v~trS 1321 (2531)
T KOG4289|consen 1303 CHCPYGEF-EDPRCEVTTRS 1321 (2531)
T ss_pred ccCCCccc-CCCceEEEeec
Confidence 88877754 34566655444
No 14
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.02 E-value=0.00063 Score=36.49 Aligned_cols=29 Identities=55% Similarity=1.263 Sum_probs=20.9
Q ss_pred ccccccCcc--cccccCCCceEEEecCCeee
Q psy2974 41 CMTNNGGCQ--HECRNTIGSYICSCHNGYTL 69 (186)
Q Consensus 41 C~~~~~~C~--~~C~n~~g~~~C~C~~G~~l 69 (186)
|...++.|. ..|.++.++|.|.|.+||..
T Consensus 1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~G 31 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEG 31 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TTSEEEEE-CEEEC
T ss_pred CCCCCCCCCCCcEeecCCCCEEeECCCCCcc
Confidence 344556674 68999999999999999984
No 15
>KOG1214|consensus
Probab=96.79 E-value=0.00083 Score=59.22 Aligned_cols=48 Identities=40% Similarity=0.881 Sum_probs=39.4
Q ss_pred cEEEcCCCCCccccccccccccCcc--cccccCCCceEEEecCCeeeecccccccc
Q psy2974 25 VFSKIGDSFKSQDKDECMTNNGGCQ--HECRNTIGSYICSCHNGYTLLENGHDCKE 78 (186)
Q Consensus 25 Gy~~~~d~~~C~d~~eC~~~~~~C~--~~C~n~~g~~~C~C~~G~~l~~~~~~C~~ 78 (186)
|| ++||..|.|++||.. ..|. ..|.|++++|.|.|.+||. .++..|.+
T Consensus 816 Gf--sGDG~~c~dvDeC~p--srChp~A~CyntpgsfsC~C~pGy~--GDGf~CVP 865 (1289)
T KOG1214|consen 816 GF--SGDGHQCTDVDECSP--SRCHPAATCYNTPGSFSCRCQPGYY--GDGFQCVP 865 (1289)
T ss_pred Cc--cCCccccccccccCc--cccCCCceEecCCCcceeecccCcc--CCCceecC
Confidence 65 478999999999985 4564 6999999999999999997 45667764
No 16
>KOG1219|consensus
Probab=96.65 E-value=0.0029 Score=61.27 Aligned_cols=91 Identities=20% Similarity=0.406 Sum_probs=62.6
Q ss_pred CCccCcCCCCCCCCCCCCCC-cEEE----cCCCCCcc-ccccccccccCcccccccCCCceEEEecCCeeeecccccccc
Q psy2974 5 DPRIEDKCNNCNESPHDTLT-VFSK----IGDSFKSQ-DKDECMTNNGGCQHECRNTIGSYICSCHNGYTLLENGHDCKE 78 (186)
Q Consensus 5 ~~~~~~~C~~c~~~~~~~~~-Gy~~----~~d~~~C~-d~~eC~~~~~~C~~~C~n~~g~~~C~C~~G~~l~~~~~~C~~ 78 (186)
|++-.--|-+ .+.|..++. ||.- .-.|+.|+ +++.|..+|..-.++|....++|.|.|+.||. |..|+.
T Consensus 3865 d~C~~npCqh-gG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyT----G~~Ce~ 3939 (4289)
T KOG1219|consen 3865 DPCNDNPCQH-GGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYT----GKRCEA 3939 (4289)
T ss_pred cccccCcccC-CCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCcc----Cceeec
Confidence 4455555632 456666655 6622 22377784 67779887655578999999999999999997 556653
Q ss_pred --------CCc--ceeecCCCCcccCCCCCCC
Q psy2974 79 --------GGC--KYEITAPNGVIKTPNHPDY 100 (186)
Q Consensus 79 --------~~C--~~~~~~~~G~i~sp~~p~~ 100 (186)
..| ++.|.+..|++.|-+-|++
T Consensus 3940 ~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~ 3971 (4289)
T KOG1219|consen 3940 RGISECSKNVCGTGGQCINIPGSFHCNCTPGI 3971 (4289)
T ss_pred ccccccccccccCCceeeccCCceEeccChhH
Confidence 334 3567888999988777764
No 17
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.54 E-value=0.0029 Score=33.57 Aligned_cols=30 Identities=53% Similarity=1.109 Sum_probs=23.3
Q ss_pred cccccccccCc--ccccccCCCceEEEecCCee
Q psy2974 38 KDECMTNNGGC--QHECRNTIGSYICSCHNGYT 68 (186)
Q Consensus 38 ~~eC~~~~~~C--~~~C~n~~g~~~C~C~~G~~ 68 (186)
+++|... ..| .+.|.+..++|.|.|+.||.
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~ 33 (38)
T cd00054 2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYT 33 (38)
T ss_pred cccCCCC-CCcCCCCEeECCCCCeEeECCCCCc
Confidence 5667652 245 46899999999999999986
No 18
>KOG4292|consensus
Probab=96.37 E-value=0.0045 Score=51.38 Aligned_cols=45 Identities=24% Similarity=0.525 Sum_probs=40.3
Q ss_pred CCcceeecCCCCcccCCCCCCCCCCCCceEEEEEcCCCCEEEecc
Q psy2974 79 GGCKYEITAPNGVIKTPNHPDYYPSKRECIWHFTTTPGHRIKLNP 123 (186)
Q Consensus 79 ~~C~~~~~~~~G~i~sp~~p~~y~~~~~C~w~i~~~~g~~i~l~~ 123 (186)
..|+...+...|.+.||+||..|+++..|+|.+.++.++.|.+.|
T Consensus 290 ~~C~~~~~~~~G~~~Sp~yp~~y~~~l~Ct~~l~~~~g~~l~v~f 334 (454)
T KOG4292|consen 290 SDCNRTYHGKFGNLTSPGYPQNYPNNLDCTTHLTVPGGHTLVVFF 334 (454)
T ss_pred cccccceecceeEEcCCCCcccCCCCCcceEEEEcCCCCEEEEEe
Confidence 457777777789999999999999999999999999999998775
No 19
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.29 E-value=0.0043 Score=32.59 Aligned_cols=24 Identities=58% Similarity=1.142 Sum_probs=20.5
Q ss_pred cCccc-ccccCCCceEEEecCCeee
Q psy2974 46 GGCQH-ECRNTIGSYICSCHNGYTL 69 (186)
Q Consensus 46 ~~C~~-~C~n~~g~~~C~C~~G~~l 69 (186)
..|.+ .|++..++|.|.|++||.+
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCcc
Confidence 35665 8999999999999999975
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=95.66 E-value=0.013 Score=30.44 Aligned_cols=19 Identities=42% Similarity=1.020 Sum_probs=17.3
Q ss_pred cccccCC-CceEEEecCCee
Q psy2974 50 HECRNTI-GSYICSCHNGYT 68 (186)
Q Consensus 50 ~~C~n~~-g~~~C~C~~G~~ 68 (186)
++|+... ++|.|.|++||.
T Consensus 10 g~C~~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 10 GTCIDLPGGGYTCECPPGYT 29 (32)
T ss_dssp EEEEEESTSEEEEEEBTTEE
T ss_pred eEEEeCCCCCEEeECCCCCc
Confidence 5899888 999999999996
No 21
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.27 E-value=0.02 Score=29.65 Aligned_cols=24 Identities=50% Similarity=0.956 Sum_probs=19.8
Q ss_pred cCcc--cccccCCCceEEEecCCeee
Q psy2974 46 GGCQ--HECRNTIGSYICSCHNGYTL 69 (186)
Q Consensus 46 ~~C~--~~C~n~~g~~~C~C~~G~~l 69 (186)
..|. +.|++..++|.|.|+.||..
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcc
Confidence 3454 68999999999999999974
No 22
>KOG1219|consensus
Probab=94.86 E-value=0.016 Score=56.58 Aligned_cols=40 Identities=28% Similarity=0.591 Sum_probs=32.8
Q ss_pred cEEEcCCCCCcc--ccccccccccCcccccccCCCceEEEecCCee
Q psy2974 25 VFSKIGDSFKSQ--DKDECMTNNGGCQHECRNTIGSYICSCHNGYT 68 (186)
Q Consensus 25 Gy~~~~d~~~C~--d~~eC~~~~~~C~~~C~n~~g~~~C~C~~G~~ 68 (186)
||. |..|+ +++||..+...-.+.|.|.+|+|+|.|-+||.
T Consensus 3931 gyT----G~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~ 3972 (4289)
T KOG1219|consen 3931 GYT----GKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGIL 3972 (4289)
T ss_pred Ccc----CceeecccccccccccccCCceeeccCCceEeccChhHh
Confidence 565 77785 48899986544468999999999999999997
No 23
>PF02408 CUB_2: CUB-like domain; InterPro: IPR003366 This domain is found in a family of hypothetical Caenorhabditis elegans proteins. The aligned region has no known function nor do any of the proteins which possess it. However, this domain is related to the CUB domain (IPR000859 from INTERPRO). The aligned region is approximately 130 amino acids long and contains two conserved cysteine residues.
Probab=93.29 E-value=0.42 Score=32.79 Aligned_cols=25 Identities=16% Similarity=0.513 Sum_probs=21.6
Q ss_pred CCCCCCceEEEEEcCCCCEEEeccC
Q psy2974 100 YYPSKRECIWHFTTTPGHRIKLNPT 124 (186)
Q Consensus 100 ~y~~~~~C~w~i~~~~g~~i~l~~~ 124 (186)
.++.+..|.|.|.+|.|..++|.+.
T Consensus 37 ~~p~n~~C~y~i~iP~G~~a~v~~~ 61 (120)
T PF02408_consen 37 QFPANQNCTYQINIPKGYYAKVTLS 61 (120)
T ss_pred ccCCCCceEEEEEcCCceEEEEEEE
Confidence 3678899999999999999888754
No 24
>KOG4292|consensus
Probab=92.63 E-value=0.21 Score=41.79 Aligned_cols=71 Identities=27% Similarity=0.417 Sum_probs=49.2
Q ss_pred eeec-CCCCcccCCCCCCCCCCCCceEEEEEcCCCCEEEeccCce------eeeCCeEEEEEEeCCCCCCCCcEEEEE
Q psy2974 83 YEIT-APNGVIKTPNHPDYYPSKRECIWHFTTTPGHRIKLNPTDI------ISISEGLLVRFRSDDTVVGKGFSASYI 153 (186)
Q Consensus 83 ~~~~-~~~G~i~sp~~p~~y~~~~~C~w~i~~~~g~~i~l~~~~~------~s~~~~l~i~f~sd~~~~~~GF~~~y~ 153 (186)
.++. +..|.+.+|++|..|.....|.|.+.+++++++.|++-.+ ..-.......|.++.+....++...+.
T Consensus 171 ~t~~~q~sg~~~~~~~p~~~v~s~~c~Wli~t~p~q~l~irv~~~~~~pt~n~c~s~~L~i~m~~~s~~~~~~~~~~~ 248 (454)
T KOG4292|consen 171 ITFTTQNSGSIPSPNDPDTYVASSICNWLIDTPPGQRLYIRVVHLHLAPTINECESASLTIFMSYKSEKYGGHGNSRP 248 (454)
T ss_pred eeecCcccccccCCCCcccccccchhhhhhhcCCCcceeEEEEeccCCcccccccceeEEEEeccCCcccccccCcCh
Confidence 3443 4459999999999999999999999999999998885432 111223444566666666655544433
No 25
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=91.92 E-value=0.16 Score=26.50 Aligned_cols=26 Identities=31% Similarity=0.703 Sum_probs=17.9
Q ss_pred cCcccccccCCCceEEEecCCeeeecc
Q psy2974 46 GGCQHECRNTIGSYICSCHNGYTLLEN 72 (186)
Q Consensus 46 ~~C~~~C~n~~g~~~C~C~~G~~l~~~ 72 (186)
..|.+.|..- ....|.||.||.+..+
T Consensus 6 t~CpA~CDpn-~~~~C~CPeGyIlde~ 31 (34)
T PF09064_consen 6 TECPADCDPN-SPGQCFCPEGYILDEG 31 (34)
T ss_pred ccCCCccCCC-CCCceeCCCceEecCC
Confidence 3567777532 2337999999998754
No 26
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=91.15 E-value=0.2 Score=20.43 Aligned_cols=9 Identities=44% Similarity=1.250 Sum_probs=6.8
Q ss_pred EEEecCCee
Q psy2974 60 ICSCHNGYT 68 (186)
Q Consensus 60 ~C~C~~G~~ 68 (186)
.|.|++||.
T Consensus 1 ~C~C~~G~~ 9 (13)
T PF12661_consen 1 TCQCPPGWT 9 (13)
T ss_dssp EEEE-TTEE
T ss_pred CccCcCCCc
Confidence 489999997
No 27
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=76.47 E-value=3.7 Score=21.96 Aligned_cols=21 Identities=33% Similarity=0.585 Sum_probs=16.6
Q ss_pred cccccCC-CceEEEecCCeeee
Q psy2974 50 HECRNTI-GSYICSCHNGYTLL 70 (186)
Q Consensus 50 ~~C~n~~-g~~~C~C~~G~~l~ 70 (186)
..|.+.. |+..|.|.+||...
T Consensus 11 A~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 11 AGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp EEEEEETTSEEEEEE-TTEEEE
T ss_pred cccEEcCCCCEEEEeeCCcccc
Confidence 5787766 99999999999865
No 28
>KOG4289|consensus
Probab=67.37 E-value=4.6 Score=39.09 Aligned_cols=49 Identities=18% Similarity=0.352 Sum_probs=34.0
Q ss_pred cCCCceEEEecCCeeeeccc---cccccCCcc--eeecCCCCcccCCCCCCCCC
Q psy2974 54 NTIGSYICSCHNGYTLLENG---HDCKEGGCK--YEITAPNGVIKTPNHPDYYP 102 (186)
Q Consensus 54 n~~g~~~C~C~~G~~l~~~~---~~C~~~~C~--~~~~~~~G~i~sp~~p~~y~ 102 (186)
+..+++.|.||+||....-. ..|-.+.|+ +.+....|.++|.++|++-.
T Consensus 1217 ~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tG 1270 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTG 1270 (2531)
T ss_pred cccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccc
Confidence 44588999999999843211 223345665 44777889999999988754
No 29
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=62.08 E-value=7.6 Score=28.86 Aligned_cols=47 Identities=32% Similarity=0.606 Sum_probs=29.2
Q ss_pred cEEEcCCCCCcccccccccc---ccCcc--cccccCC-----CceEEEecCCeeeecc
Q psy2974 25 VFSKIGDSFKSQDKDECMTN---NGGCQ--HECRNTI-----GSYICSCHNGYTLLEN 72 (186)
Q Consensus 25 Gy~~~~d~~~C~d~~eC~~~---~~~C~--~~C~n~~-----g~~~C~C~~G~~l~~~ 72 (186)
||++... .+|+...+|... .-.|. ..|.+.. ..|.|.|.+||.+..+
T Consensus 27 gfvl~~E-ntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~ 83 (197)
T PF06247_consen 27 GFVLKNE-NTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG 83 (197)
T ss_dssp TEEEEET-TEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS
T ss_pred CcEEccc-cccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC
Confidence 8888744 469999998762 23564 5777655 5799999999998754
No 30
>KOG4260|consensus
Probab=61.49 E-value=5.9 Score=31.21 Aligned_cols=33 Identities=33% Similarity=0.776 Sum_probs=24.5
Q ss_pred cccccccccCcc---cccccCCCceEEEecCCeeee
Q psy2974 38 KDECMTNNGGCQ---HECRNTIGSYICSCHNGYTLL 70 (186)
Q Consensus 38 ~~eC~~~~~~C~---~~C~n~~g~~~C~C~~G~~l~ 70 (186)
.++|..-...|. ..|.|+.++|.|.|..|+...
T Consensus 271 ~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 271 VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLIII 306 (350)
T ss_pred hHHhhhhhhhcccCCCCcccCCccEEEEecccceee
Confidence 677765223442 579999999999999998754
No 31
>KOG1217|consensus
Probab=59.97 E-value=24 Score=29.28 Aligned_cols=52 Identities=29% Similarity=0.555 Sum_probs=29.6
Q ss_pred cccccCCCceEEEecCCeeeec-----cccccccCC-cc--eeecCCCCcccCCCCCCCC
Q psy2974 50 HECRNTIGSYICSCHNGYTLLE-----NGHDCKEGG-CK--YEITAPNGVIKTPNHPDYY 101 (186)
Q Consensus 50 ~~C~n~~g~~~C~C~~G~~l~~-----~~~~C~~~~-C~--~~~~~~~G~i~sp~~p~~y 101 (186)
+.|.+..++|.|.|++||.+.. +...|.... |. +.+....+.+.+-+.|++.
T Consensus 243 ~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~ 302 (487)
T KOG1217|consen 243 GTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFT 302 (487)
T ss_pred CcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCC
Confidence 5677777888888888887665 334454322 43 2444444444444444443
No 32
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=54.28 E-value=15 Score=18.83 Aligned_cols=17 Identities=35% Similarity=0.841 Sum_probs=12.0
Q ss_pred cccccCCCceEEEecCCee
Q psy2974 50 HECRNTIGSYICSCHNGYT 68 (186)
Q Consensus 50 ~~C~n~~g~~~C~C~~G~~ 68 (186)
++|... ...|.|.+||.
T Consensus 12 G~C~~~--~g~C~C~~g~~ 28 (32)
T PF07974_consen 12 GTCVSP--CGRCVCDSGYT 28 (32)
T ss_pred CEEeCC--CCEEECCCCCc
Confidence 456543 55799999986
No 33
>KOG1215|consensus
Probab=53.66 E-value=12 Score=34.48 Aligned_cols=45 Identities=29% Similarity=0.784 Sum_probs=37.4
Q ss_pred CccccccccccccCcccccccCCCceEEEecCCeeeecccccccc
Q psy2974 34 KSQDKDECMTNNGGCQHECRNTIGSYICSCHNGYTLLENGHDCKE 78 (186)
Q Consensus 34 ~C~d~~eC~~~~~~C~~~C~n~~g~~~C~C~~G~~l~~~~~~C~~ 78 (186)
.....+.|...++.|.+.|...+....|.|+.|+.+..+++.|..
T Consensus 666 ~~~~~n~C~~~n~~c~~KOG~~p~~~~c~c~~~~~l~~~~~~C~~ 710 (877)
T KOG1215|consen 666 RPTGVNPCESSNGGCSQLCLPRPQGSTCACPEGYRLSPDGKSCSS 710 (877)
T ss_pred CCCCCCcccccCCCCCeeeecCCCCCeeeCCCCCeecCCCCeecC
Confidence 345667787766789999998887779999999999999998864
No 34
>KOG1217|consensus
Probab=52.67 E-value=11 Score=31.39 Aligned_cols=38 Identities=34% Similarity=0.656 Sum_probs=30.6
Q ss_pred CCccccccccccccCc--ccccccCCCceEEEecCCeeeec
Q psy2974 33 FKSQDKDECMTNNGGC--QHECRNTIGSYICSCHNGYTLLE 71 (186)
Q Consensus 33 ~~C~d~~eC~~~~~~C--~~~C~n~~g~~~C~C~~G~~l~~ 71 (186)
..|.++++|..... | .+.|++..+.|.|.|++||....
T Consensus 266 ~~~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~~ 305 (487)
T KOG1217|consen 266 VTCVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGRL 305 (487)
T ss_pred ceeeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCCC
Confidence 35778999988653 5 46999999999999999998554
No 35
>KOG1215|consensus
Probab=52.31 E-value=45 Score=30.87 Aligned_cols=36 Identities=36% Similarity=0.951 Sum_probs=29.5
Q ss_pred cccccccccCcccccc-cCCCceEEEecCCeeeeccc
Q psy2974 38 KDECMTNNGGCQHECR-NTIGSYICSCHNGYTLLENG 73 (186)
Q Consensus 38 ~~eC~~~~~~C~~~C~-n~~g~~~C~C~~G~~l~~~~ 73 (186)
.+.|...++.|.+.|. +.++.+.|.|..||.+..+.
T Consensus 364 ~~~~~~~~g~Csq~C~~~~p~~~~c~c~~g~~~~~~~ 400 (877)
T KOG1215|consen 364 SNPCESDNGGCSQLCVPNSPGTFKCACSPGYELRLDK 400 (877)
T ss_pred CCcccccCCccceeccCCCCCceeEecCCCcEeccCC
Confidence 3455666789999999 67899999999999987666
No 36
>PF05428 CRF-BP: Corticotropin-releasing factor binding protein (CRF-BP); InterPro: IPR008435 This family consists of several eukaryotic corticotropin-releasing factor binding proteins (CRF-BP or CRH-BP). Corticotropin-releasing hormone (CRH) plays multiple roles in vertebrate species. In mammals, it is the major hypothalamic releasing factor for pituitary adrenocorticotropin secretion, and is a neurotransmitter or neuromodulator at other sites in the central nervous system. In non-mammalian vertebrates, CRH not only acts as a neurotransmitter and hypophysiotropin, it also acts as a potent thyrotropin-releasing factor, allowing CRH to regulate both the adrenal and thyroid axes, especially in development. CRH-BP is thought to play an inhibitory role in which it binds CRH and other CRH-like ligands and prevents the activation of CRH receptors. There is however evidence that CRH-BP may also exhibit diverse extra and intracellular roles in a cell specific fashion and at specific times in development [].
Probab=46.10 E-value=1.4e+02 Score=24.04 Aligned_cols=31 Identities=23% Similarity=0.329 Sum_probs=23.6
Q ss_pred eeeeCCeEEEEEEeCCCCCCCCcEEEEEEEeCC
Q psy2974 126 IISISEGLLVRFRSDDTVVGKGFSASYIAIDTQ 158 (186)
Q Consensus 126 ~~s~~~~l~i~f~sd~~~~~~GF~~~y~~~~~~ 158 (186)
+.|+.|...|.|+-. ..+.||.++.+.....
T Consensus 150 ~rSSQNvAmi~fRip--~~GsgFt~~vR~~~Np 180 (311)
T PF05428_consen 150 FRSSQNVAMIQFRIP--SPGSGFTLTVRFIKNP 180 (311)
T ss_pred eEeccceEEEEEEec--CCCCceEEEEEeCCCC
Confidence 688899999999643 4678899888876544
No 37
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=45.29 E-value=16 Score=25.42 Aligned_cols=36 Identities=31% Similarity=0.819 Sum_probs=23.1
Q ss_pred ccccccc-ccCc-ccccccCC--CceEEEecCCeeeeccccccc
Q psy2974 38 KDECMTN-NGGC-QHECRNTI--GSYICSCHNGYTLLENGHDCK 77 (186)
Q Consensus 38 ~~eC~~~-~~~C-~~~C~n~~--g~~~C~C~~G~~l~~~~~~C~ 77 (186)
+..|... .+.| .+.|.-.+ ..+.|.|..||. |..|+
T Consensus 42 i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYt----GeRCE 81 (139)
T PHA03099 42 IRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYT----GIRCQ 81 (139)
T ss_pred cccCChhhCCEeECCEEEeeccCCCceeECCCCcc----ccccc
Confidence 4445432 2345 34676554 688999999997 66675
No 38
>KOG0196|consensus
Probab=43.75 E-value=1.5e+02 Score=27.52 Aligned_cols=91 Identities=21% Similarity=0.423 Sum_probs=44.4
Q ss_pred EEEecCCeeeecccccccc------------CC---ccee-ecCCCCcccCCCCCCCCC-----CCCceEEEEEcCCCCE
Q psy2974 60 ICSCHNGYTLLENGHDCKE------------GG---CKYE-ITAPNGVIKTPNHPDYYP-----SKRECIWHFTTTPGHR 118 (186)
Q Consensus 60 ~C~C~~G~~l~~~~~~C~~------------~~---C~~~-~~~~~G~i~sp~~p~~y~-----~~~~C~w~i~~~~g~~ 118 (186)
.|.|.+||.-...+..|+. .. |-.. .....|.-.|.|-.++|. +...|+ .|+...
T Consensus 260 ~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gyyRA~~Dp~~mpCT----~PPSaP 335 (996)
T KOG0196|consen 260 GCVCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGYYRADSDPPSMPCT----RPPSAP 335 (996)
T ss_pred ceeecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCcccCCCCCCCCCCC----CCCCcc
Confidence 4888888887666667752 11 2111 223445556666666662 334454 333333
Q ss_pred EEeccCceeeeCCeEEEEEEeCCCCCCCCcEEEEEEEeCC
Q psy2974 119 IKLNPTDIISISEGLLVRFRSDDTVVGKGFSASYIAIDTQ 158 (186)
Q Consensus 119 i~l~~~~~~s~~~~l~i~f~sd~~~~~~GF~~~y~~~~~~ 158 (186)
..|.+ ...+..+.|.+..-....++. -++|..+-..
T Consensus 336 ~nlis---~vn~Ts~~L~W~~P~d~GGR~-Di~y~v~Ck~ 371 (996)
T KOG0196|consen 336 RNLIS---NVNGTSLILEWSPPADTGGRE-DITYNVICKK 371 (996)
T ss_pred ceeee---ecccceEEEEecCCcccCCCc-ceEEEEEeec
Confidence 33321 123334555554444444433 4555554433
No 39
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=29.91 E-value=41 Score=22.29 Aligned_cols=19 Identities=32% Similarity=0.623 Sum_probs=13.0
Q ss_pred cccccCCCceEEEecCCeee
Q psy2974 50 HECRNTIGSYICSCHNGYTL 69 (186)
Q Consensus 50 ~~C~n~~g~~~C~C~~G~~l 69 (186)
+.|. ....-.|.|.+||..
T Consensus 90 g~C~-~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 90 GICN-SNNSPKCSCLPGFEP 108 (110)
T ss_pred cEeC-CCCCCceECCCCcCC
Confidence 4663 344556999999963
No 40
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=27.82 E-value=46 Score=18.91 Aligned_cols=16 Identities=38% Similarity=0.874 Sum_probs=12.3
Q ss_pred EEecCCeeeeccccccc
Q psy2974 61 CSCHNGYTLLENGHDCK 77 (186)
Q Consensus 61 C~C~~G~~l~~~~~~C~ 77 (186)
|.|++||.+..+ ..|.
T Consensus 35 C~C~~G~v~~~~-~~CV 50 (55)
T PF01826_consen 35 CFCPPGYVRNDN-GRCV 50 (55)
T ss_dssp EEETTTEEEETT-SEEE
T ss_pred CCCCCCeeEcCC-CCEE
Confidence 999999987755 4564
No 41
>PF02014 Reeler: Reeler domain Schematic picture including Reeler domain; InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=25.51 E-value=1.3e+02 Score=20.53 Aligned_cols=42 Identities=12% Similarity=0.064 Sum_probs=27.6
Q ss_pred ceeeeCCeEEEEEEeCCCCCCCCcEEEEEEEeCCC---Cceeeec
Q psy2974 125 DIISISEGLLVRFRSDDTVVGKGFSASYIAIDTQG---SKEFSEI 166 (186)
Q Consensus 125 ~~~s~~~~l~i~f~sd~~~~~~GF~~~y~~~~~~~---~~~~~~~ 166 (186)
.....+..+.|...+......+||.++.+...... .+.|.-.
T Consensus 28 ~~y~pg~~~~Vtl~~~~~~~F~GFllqAr~~~~~~~~~vG~f~~~ 72 (132)
T PF02014_consen 28 SSYEPGQTYTVTLSSSGSSSFRGFLLQARDANNSSPAPVGTFQLP 72 (132)
T ss_dssp SSB-TTBEEEEEEEETTTEEBSEEEEEEEETT--SSSBSSEEEES
T ss_pred CeEcCCCEEEEEEECCCCCceeEEEEEEEeCCCCCcccccccCcC
Confidence 34556666777777788888999999988766653 4666444
No 42
>PHA02887 EGF-like protein; Provisional
Probab=22.52 E-value=68 Score=21.97 Aligned_cols=24 Identities=46% Similarity=1.035 Sum_probs=16.7
Q ss_pred cccccCC--CceEEEecCCeeeeccccccc
Q psy2974 50 HECRNTI--GSYICSCHNGYTLLENGHDCK 77 (186)
Q Consensus 50 ~~C~n~~--g~~~C~C~~G~~l~~~~~~C~ 77 (186)
+.|.-.. ....|.|+.||. |..|.
T Consensus 97 G~C~yI~dL~epsCrC~~GYt----G~RCE 122 (126)
T PHA02887 97 GECMNIIDLDEKFCICNKGYT----GIRCD 122 (126)
T ss_pred CEEEccccCCCceeECCCCcc----cCCCC
Confidence 4665444 567899999997 65564
No 43
>PF13897 GOLD_2: Golgi-dynamics membrane-trafficking
Probab=22.35 E-value=1.5e+02 Score=20.81 Aligned_cols=30 Identities=17% Similarity=0.241 Sum_probs=18.6
Q ss_pred eCCeEEEEEEeCCCCCCCCcEEEEEEEeCC
Q psy2974 129 ISEGLLVRFRSDDTVVGKGFSASYIAIDTQ 158 (186)
Q Consensus 129 ~~~~l~i~f~sd~~~~~~GF~~~y~~~~~~ 158 (186)
.+..|...|.+|.-.-+-|....|.....+
T Consensus 16 ~G~~l~WeFaTd~yDIgFG~~few~~~~s~ 45 (136)
T PF13897_consen 16 EGKCLFWEFATDSYDIGFGVYFEWTPPTSN 45 (136)
T ss_pred CCCEEEEEEeeCCCCceEEEEEEecCCCCC
Confidence 345677778887766666655555554433
No 44
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=21.76 E-value=41 Score=16.10 Aligned_cols=10 Identities=30% Similarity=0.773 Sum_probs=7.6
Q ss_pred ccCcCCCCCC
Q psy2974 7 RIEDKCNNCN 16 (186)
Q Consensus 7 ~~~~~C~~c~ 16 (186)
..+|.||+|-
T Consensus 14 ~v~f~CPnCG 23 (24)
T PF07754_consen 14 AVPFPCPNCG 23 (24)
T ss_pred CceEeCCCCC
Confidence 4678999874
No 45
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=20.82 E-value=1.3e+02 Score=27.83 Aligned_cols=46 Identities=17% Similarity=0.254 Sum_probs=28.2
Q ss_pred cEEEcCCCCCccccccccccccCcccccccC-CCceEEEecCCeeeeccccccc
Q psy2974 25 VFSKIGDSFKSQDKDECMTNNGGCQHECRNT-IGSYICSCHNGYTLLENGHDCK 77 (186)
Q Consensus 25 Gy~~~~d~~~C~d~~eC~~~~~~C~~~C~n~-~g~~~C~C~~G~~l~~~~~~C~ 77 (186)
+|.+.+++..|. .+|...... . .. .+...|.|..||....++..|.
T Consensus 654 ~~~ls~~~~~C~--~~C~~~~~~--~---~~~~~~~~C~C~~g~~p~~~~~~C~ 700 (800)
T PTZ00214 654 ELFVSLDGQSCL--EECTGDKVV--G---EVSGGVRRCWCERGFLPALDRSGCV 700 (800)
T ss_pred CceecCcCCchh--hcCCCcccc--c---cccCCcceeEecCCcccccCCCccc
Confidence 777778887775 355432100 0 01 1245799999999877777774
Done!