Query         psy11294
Match_columns 111
No_of_seqs    159 out of 807
Neff          6.0 
Searched_HMMs 46136
Date          Fri Aug 16 23:12:42 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy11294.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11294hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd01328 FSL_SPARC Follistatin-  99.8 2.8E-20 6.1E-25  124.1   5.8   62   30-94      1-62  (86)
  2 smart00280 KAZAL Kazal type se  99.5   1E-14 2.2E-19   85.9   4.1   42   54-100     1-46  (46)
  3 cd01327 KAZAL_PSTI Kazal-type   99.5   4E-14 8.7E-19   83.8   4.1   39   57-100     3-45  (45)
  4 PF07648 Kazal_2:  Kazal-type s  99.3 1.7E-12 3.6E-17   74.2   2.3   32   69-100     7-42  (42)
  5 cd00104 KAZAL_FS Kazal type se  99.2 6.3E-12 1.4E-16   71.9   3.1   37   59-100     1-41  (41)
  6 PF00050 Kazal_1:  Kazal-type s  99.1 3.8E-11 8.3E-16   71.7   2.8   37   59-100     8-48  (48)
  7 KOG4004|consensus               99.1 4.7E-11   1E-15   91.1   3.8   75   26-102    48-132 (259)
  8 KOG3555|consensus               98.7 6.5E-09 1.4E-13   84.6   3.2   80   26-111    85-194 (434)
  9 KOG4578|consensus               98.3   4E-07 8.6E-12   74.0   2.0   43   55-102    34-79  (421)
 10 smart00057 FIMAC factor I memb  98.2 1.9E-06 4.1E-11   55.3   4.5   61   31-102     2-69  (69)
 11 cd01330 KAZAL_SLC21 The kazal-  97.5 6.8E-05 1.5E-09   46.0   2.2   18   69-86     17-34  (54)
 12 smart00274 FOLN Follistatin-N-  96.5  0.0017 3.6E-08   34.3   1.5   24   30-54      1-24  (26)
 13 PF09289 FOLN:  Follistatin/Ost  94.9  0.0091   2E-07   30.4   0.3   22   31-53      1-22  (22)
 14 TIGR00805 oat sodium-independe  94.0   0.033 7.2E-07   48.0   2.0   18   68-85    460-477 (633)
 15 cd01327 KAZAL_PSTI Kazal-type   93.1   0.027 5.8E-07   32.9   0.0   13   12-24     16-28  (45)
 16 KOG3626|consensus               88.8    0.28 6.1E-06   43.8   2.2   18   68-85    525-542 (735)
 17 KOG4597|consensus               78.9     2.1 4.6E-05   36.8   3.1   57   30-94    105-162 (560)
 18 PF00008 EGF:  EGF-like domain   64.9     2.7 5.8E-05   22.4   0.4   26   31-57      1-26  (32)
 19 KOG3509|consensus               48.0      27 0.00059   32.5   4.1   61   29-94    173-250 (964)
 20 KOG1219|consensus               41.8      31 0.00067   35.9   3.6   28   28-57   3903-3930(4289)
 21 cd00053 EGF Epidermal growth f  37.6      37  0.0008   16.7   2.1   22   33-56      5-26  (36)
 22 smart00179 EGF_CA Calcium-bind  32.4      55  0.0012   16.7   2.3   26   30-57      4-30  (39)
 23 smart00181 EGF Epidermal growt  28.5      65  0.0014   16.3   2.1   21   34-57      6-26  (35)
 24 PF12946 EGF_MSP1_1:  MSP1 EGF   26.5      21 0.00045   20.2  -0.1   25   31-56      2-26  (37)
 25 cd00054 EGF_CA Calcium-binding  24.4      95  0.0021   15.4   2.3   26   30-57      4-30  (38)
 26 PF00954 S_locus_glycop:  S-loc  22.5      87  0.0019   20.6   2.4   28   27-57     76-104 (110)

No 1  
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=99.81  E-value=2.8e-20  Score=124.06  Aligned_cols=62  Identities=31%  Similarity=0.726  Sum_probs=55.2

Q ss_pred             CCCCCCCCCCCeeccCCCCCCccCcCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC
Q psy11294         30 PCQENTCQYGAKCIPSEDGTSYKCECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM   94 (111)
Q Consensus        30 pC~~~~C~~g~~C~~~~~g~~~~C~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~   94 (111)
                      ||+++.|+.|++|+++. .++|+|+|+..||...+  +.++||||||+||+|+|+|++++|..++
T Consensus         1 pC~~v~C~~G~~C~~d~-~~~p~CvC~~~Cp~~~~--~~~~VCGsDg~TY~s~C~L~r~~C~~~~   62 (86)
T cd01328           1 PCENHHCGAGKVCEVDD-ENTPKCVCIDPCPEEVD--DRRKVCTNDNETFDSDCELYRTRCLCKG   62 (86)
T ss_pred             CCCCcCCCCCCEeeECC-CCCeEEecCCcCCCCCC--CCCceeCCCCCCcccHhHHhhhHhccCC
Confidence            79999999999999975 45799999999999763  3478999999999999999999998664


No 2  
>smart00280 KAZAL Kazal type serine protease inhibitors. Kazal type serine protease inhibitors and follistatin-like domains.
Probab=99.53  E-value=1e-14  Score=85.88  Aligned_cols=42  Identities=38%  Similarity=0.903  Sum_probs=35.9

Q ss_pred             cCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294         54 ECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL  100 (111)
Q Consensus        54 ~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C  100 (111)
                      +|+..|+..+     .|||||||+||.|+|+|++++|..+.    .+.|+|
T Consensus         1 ~C~~~C~~~~-----~pVCgsdg~TY~N~C~l~~~~c~~~~~i~~~~~G~C   46 (46)
T smart00280        1 DCPEACPREY-----DPVCGSDGVTYSNECHLCKAACESGKSIEVKHDGPC   46 (46)
T ss_pred             CCCCCCCCCC-----CccCCCCCCEeCCHhHHHHHHhcCCCCeEEeecCCC
Confidence            3778899887     89999999999999999999998554    466766


No 3  
>cd01327 KAZAL_PSTI Kazal-type pancreatic secretory trypsin inhibitors (PSTI) and related proteins, including the second domain of the ovomucoid turkey inhibitor and the C-terminal domain of the esophagus cancer-related gene-2 protein (ECRG-2), are members of the superfamily of kazal-type proteinase inhibitors and follistatin-like proteins.
Probab=99.48  E-value=4e-14  Score=83.80  Aligned_cols=39  Identities=28%  Similarity=0.565  Sum_probs=33.8

Q ss_pred             CCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294         57 QECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL  100 (111)
Q Consensus        57 ~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C  100 (111)
                      ..||..+     .|||||||+||.|+|+|+.++|..+.    +++|+|
T Consensus         3 ~~Cp~~~-----~PVCGsDg~TY~N~C~l~~~~c~~~~~i~~~~~G~C   45 (45)
T cd01327           3 FGCPKDY-----DPVCGTDGVTYSNECLLCAENLKRQTNIRIKHDGEC   45 (45)
T ss_pred             CCCCCCC-----CceeCCCCCEeCCHhHHHHHHhccCCCeEEeecCCC
Confidence            4678887     79999999999999999999998776    567776


No 4  
>PF07648 Kazal_2:  Kazal-type serine protease inhibitor domain;  InterPro: IPR011497 This domain is usually indicative of serine protease inhibitors that belong to Merops inhibitor families: I1, I2, I17 and I31. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays and have a central alpha-helix, a short two-stranded antiparallel beta-sheet and several disulphide bonds [, , ]. The amino terminal segment of this domain binds to the active site of its target proteases, thus inhibiting their function.; PDB: 2KMP_A 1LDT_L 2KMR_A 2KMO_A 2KMQ_A 1AN1_I 4E0S_B 3T5O_A 4A5W_B 1LR7_A ....
Probab=99.28  E-value=1.7e-12  Score=74.20  Aligned_cols=32  Identities=41%  Similarity=0.854  Sum_probs=28.0

Q ss_pred             CCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294         69 RPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL  100 (111)
Q Consensus        69 ~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C  100 (111)
                      .||||+||+||.|+|+|++++|..+.    .++|+|
T Consensus         7 ~PVCg~dg~ty~n~C~l~~~~c~~~~~~~~~~~g~C   42 (42)
T PF07648_consen    7 SPVCGSDGKTYSNECELRCANCRTNSKIQIVHDGSC   42 (42)
T ss_dssp             STEEETTSEEESSHHHHHHHHHHHTTTCEEEEESSS
T ss_pred             CCEEcCCCCEEhhHHHHHHHHHhcCCCeEEEeCCCC
Confidence            49999999999999999999999765    456766


No 5  
>cd00104 KAZAL_FS Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the  Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor.
Probab=99.24  E-value=6.3e-12  Score=71.90  Aligned_cols=37  Identities=35%  Similarity=0.748  Sum_probs=30.1

Q ss_pred             CCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294         59 CPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL  100 (111)
Q Consensus        59 C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C  100 (111)
                      ||..+     .||||+||+||.|+|+|+.++|..+.    .+.|+|
T Consensus         1 C~~~~-----~pVCgsdg~tY~n~C~~~~~~c~~~~~~~~~~~g~C   41 (41)
T cd00104           1 CPKEY-----DPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC   41 (41)
T ss_pred             CCCCC-----CccCCCCCCEECCHhhhhHHhhcCCCCeEEEeccCC
Confidence            55555     79999999999999999999998554    455655


No 6  
>PF00050 Kazal_1:  Kazal-type serine protease inhibitor domain;  InterPro: IPR002350 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This family of Kazal inhibitors, belongs to MEROPS inhibitor family I1, clan IA. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) []. The members are primarily metazoan, but includes exceptions in the alveolata (apicomplexa), stramenopiles, higher plants and bacteria. Kazal inhibitors, which inhibit a number of serine proteases (such as trypsin and elastase), belong to family of proteins that includes pancreatic secretory trypsin inhibitor; avian ovomucoid; acrosin inhibitor; and elastase inhibitor. These proteins contain between 1 and 7 Kazal-type inhibitor repeats [, ]. The structure of the Kazal repeat includes a large quantity of extended chain, 2 short alpha-helices and a 3-stranded anti-parallel beta sheet [].The inhibitor makes 11 contacts with its enzyme substrate: unusually, 8 of these important residues are hypervariable []. Altering the enzyme-contact residues, and especially that of the active site bond, affects the the strength of inhibition and specificity of the inhibitor for particular serine proteases [, ]. The presence of this Pfam domain is usually indicative of serine protease inhibitors, however, Kazal-like domains are also seen in the extracellular part of agrins which are not known to be proteinase inhibitors.; GO: 0005515 protein binding; PDB: 1HPT_A 1CGI_I 1CGJ_I 2P6A_D 3HH2_C 2B0U_D 2BUS_A 1BUS_A 1OVO_A 3OVO_A ....
Probab=99.12  E-value=3.8e-11  Score=71.66  Aligned_cols=37  Identities=38%  Similarity=0.692  Sum_probs=25.6

Q ss_pred             CCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294         59 CPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL  100 (111)
Q Consensus        59 C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C  100 (111)
                      |+..+     .|||||||+||.|+|+|-.+.-+.+.    ++.|+|
T Consensus         8 C~~~~-----~PVCGsdg~TY~N~C~lC~~~~~~~~~i~~~~~G~C   48 (48)
T PF00050_consen    8 CPREY-----DPVCGSDGVTYSNECELCAANRRSGKNIRIAHDGPC   48 (48)
T ss_dssp             EESSC-----S-EEETTSEEESSHHHHHHHHHHTTTSS-EEEESS-
T ss_pred             CCCCC-----CceECCCCCCccccchhhhhhcccCCCeEEeecCCC
Confidence            66665     89999999999999999323334443    467776


No 7  
>KOG4004|consensus
Probab=99.12  E-value=4.7e-11  Score=91.09  Aligned_cols=75  Identities=27%  Similarity=0.693  Sum_probs=61.6

Q ss_pred             CCCCCCCCCCCCCCCeeccCCCCCCccCcCCC-CCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCCC---------
Q psy11294         26 ELTNPCQENTCQYGAKCIPSEDGTSYKCECPQ-ECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKML---------   95 (111)
Q Consensus        26 ~~~~pC~~~~C~~g~~C~~~~~g~~~~C~C~~-~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~~---------   95 (111)
                      +.++||++..|++|..|+++..+ .|+|+|.. .||... ..+...|||.|++||.|.|+|++.+|.....         
T Consensus        48 ~a~npC~dh~Cg~gk~C~vd~~~-~P~Cvc~~~kCP~~~-~~p~~KVC~nnNqTf~S~C~~~~~~C~~~~~~k~~k~hl~  125 (259)
T KOG4004|consen   48 EARNPCADHKCGPGKNCLVDLQT-QPRCVCCRYKCPRKQ-QRPVHKVCGNNNQTFNSWCEMHKHSCESRYFIKVHKLHLK  125 (259)
T ss_pred             cccCccccccCCCCceeeecCCC-CceeEEecCCCCccc-CCchhhhhcCCCcchhHHHHHHHhhhhhhcccccceeehh
Confidence            45899999999999999998665 79999885 999842 1123569999999999999999999998763         


Q ss_pred             ccccCCC
Q psy11294         96 SRGFLLD  102 (111)
Q Consensus        96 ~~G~C~~  102 (111)
                      +-|+|+-
T Consensus       126 y~G~Ck~  132 (259)
T KOG4004|consen  126 YQGSCKY  132 (259)
T ss_pred             ccccccc
Confidence            4567764


No 8  
>KOG3555|consensus
Probab=98.74  E-value=6.5e-09  Score=84.58  Aligned_cols=80  Identities=26%  Similarity=0.563  Sum_probs=62.7

Q ss_pred             CCCCCCCCCCCCCCCeeccCCCCCCccCc--------------------------CCCCCCCCCCCCCCCCeecCCCcee
Q psy11294         26 ELTNPCQENTCQYGAKCIPSEDGTSYKCE--------------------------CPQECPNYGDHTGSRPVCGSNGVDY   79 (111)
Q Consensus        26 ~~~~pC~~~~C~~g~~C~~~~~g~~~~C~--------------------------C~~~C~~~~d~~~~~pVCGSDG~TY   79 (111)
                      ...+||..+.|.+..+|+.. +.+++.|+                          |. .||...    ..+||||||.||
T Consensus        85 ~~kdpc~kvkcs~hkvci~Q-d~Q~A~cis~k~l~~r~k~a~v~~~q~~d~~l~~CK-pCpvaq----~a~vCGsDghtY  158 (434)
T KOG3555|consen   85 PIKDPCLKVKCSRHKVCIAQ-DYQTAGCISRKQLQHRQKAAGVSVIQWDDPELDNCK-PCPVAQ----PAFVCGSDGHTY  158 (434)
T ss_pred             cccChHhhhcccccceeecc-ccchhhhHHHHHHhhhccCCCcceecccCcccccCc-cCCccc----ccceecCCCCee
Confidence            45789999999999999874 33344442                          32 688776    488999999999


Q ss_pred             cCHhHHHHHHhhcCC----CccccCCCccCCCCCCC
Q psy11294         80 KDLCEFQRASCSTKM----LSRGFLLDIWIPPSPKH  111 (111)
Q Consensus        80 ~n~C~L~~aaC~~~~----~~~G~C~~i~~pp~~~~  111 (111)
                      ++.|.|...||...+    ..+|+|--...++.|+|
T Consensus       159 ss~ckLe~~aC~~sksiav~c~g~cpcps~~d~~~~  194 (434)
T KOG3555|consen  159 SSRCKLEYHACHVSKSIAVICEGPCPCPSEADKPVH  194 (434)
T ss_pred             hhhhhHHHHhhhhhhhhhhhhCCCCCCCCCCCcchh
Confidence            999999999999887    46888876666666654


No 9  
>KOG4578|consensus
Probab=98.26  E-value=4e-07  Score=73.96  Aligned_cols=43  Identities=40%  Similarity=0.792  Sum_probs=37.4

Q ss_pred             CCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC---CccccCCC
Q psy11294         55 CPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM---LSRGFLLD  102 (111)
Q Consensus        55 C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~---~~~G~C~~  102 (111)
                      |+..|....     .|||||||+||.+.|+|++|.|...+   +++|.|+.
T Consensus        34 c~l~C~~~~-----kPvCasDGrty~srCe~qRAkC~dpql~~~yrG~Ck~   79 (421)
T KOG4578|consen   34 CQLECDDNE-----KPVCASDGRTYPSRCELQRAKCGDPQLSLKYRGSCKA   79 (421)
T ss_pred             hccccCCCC-----CCccccCCccchhHHHHHHhhcCCCceeEEecCcHHH
Confidence            777777764     89999999999999999999999887   57888864


No 10 
>smart00057 FIMAC factor I membrane attack complex.
Probab=98.24  E-value=1.9e-06  Score=55.32  Aligned_cols=61  Identities=30%  Similarity=0.585  Sum_probs=51.5

Q ss_pred             CCCCCCCCCCeeccCCCCCCccCcCC--CCCCCCCCCCCCCCeecCCCcee--cCHhHHHHHHhhcCC---CccccCCC
Q psy11294         31 CQENTCQYGAKCIPSEDGTSYKCECP--QECPNYGDHTGSRPVCGSNGVDY--KDLCEFQRASCSTKM---LSRGFLLD  102 (111)
Q Consensus        31 C~~~~C~~g~~C~~~~~g~~~~C~C~--~~C~~~~d~~~~~pVCGSDG~TY--~n~C~L~~aaC~~~~---~~~G~C~~  102 (111)
                      |..+.|.+|+.|+.      .+|+|.  .+||...     ..||..|+..|  .|+|+|....|+.++   .+.|+|.+
T Consensus         2 c~c~~C~pWekc~~------~~CvCk~P~qC~~~~-----~~vCv~~~~~~~t~S~C~~~a~~C~g~~~~~~~~g~C~~   69 (69)
T smart00057        2 CAKVFCQPWQKCSA------STCVCKLPYECPKAG-----TDVCVEDGRSEKTLTYCKQKSLECLNPKYKFLHIGSCTA   69 (69)
T ss_pred             CcCccCCCcccccC------CeeEeCCHhHCCCCC-----CCeeEecCceeeeecHHHHHHHHhcCCCcEEeccCCCCC
Confidence            56678999999986      459988  6899864     58999999999  999999999999876   57888863


No 11 
>cd01330 KAZAL_SLC21 The kazal-type serine protease inhibitor domain has been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The KAZAL_SLC21 domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=97.51  E-value=6.8e-05  Score=45.97  Aligned_cols=18  Identities=44%  Similarity=0.997  Sum_probs=16.7

Q ss_pred             CCeecCCCceecCHhHHH
Q psy11294         69 RPVCGSNGVDYKDLCEFQ   86 (111)
Q Consensus        69 ~pVCGSDG~TY~n~C~L~   86 (111)
                      .||||+||+||.|.|+..
T Consensus        17 ~PVCg~~g~tY~SpC~AG   34 (54)
T cd01330          17 SPVCGENGITYFSPCHAG   34 (54)
T ss_pred             CCccCCCCCEEcCHhHcC
Confidence            799999999999999865


No 12 
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=96.49  E-value=0.0017  Score=34.28  Aligned_cols=24  Identities=21%  Similarity=0.798  Sum_probs=20.0

Q ss_pred             CCCCCCCCCCCeeccCCCCCCccCc
Q psy11294         30 PCQENTCQYGAKCIPSEDGTSYKCE   54 (111)
Q Consensus        30 pC~~~~C~~g~~C~~~~~g~~~~C~   54 (111)
                      +|+++.|++|..|++++. ++|+|+
T Consensus         1 ~C~~v~C~~G~~C~~d~~-g~p~Cv   24 (26)
T smart00274        1 SCRNVQCPFGKVCVVDKG-GNARCV   24 (26)
T ss_pred             CCCCEECCCCCEEEeCCC-CCEEEe
Confidence            589999999999999844 468775


No 13 
>PF09289 FOLN:  Follistatin/Osteonectin-like EGF domain;  InterPro: IPR015369 This domain is predominantly found in osteonectin and follistatin. They adopt an EGF-like structure [, ]. Follistatin is involved in diverse activities from embryonic development to cell secretion. ; GO: 0005515 protein binding; PDB: 1LR7_A 1LR8_A 1LR9_A 2ARP_F 3B4V_H 2KCX_A 3SEK_C 2P6A_D 3HH2_C 2B0U_D ....
Probab=94.88  E-value=0.0091  Score=30.41  Aligned_cols=22  Identities=32%  Similarity=0.816  Sum_probs=14.9

Q ss_pred             CCCCCCCCCCeeccCCCCCCccC
Q psy11294         31 CQENTCQYGAKCIPSEDGTSYKC   53 (111)
Q Consensus        31 C~~~~C~~g~~C~~~~~g~~~~C   53 (111)
                      |.++.|++|.+|.++.++ +|.|
T Consensus         1 C~n~~Ck~GKvC~~d~~~-~P~C   22 (22)
T PF09289_consen    1 CDNFHCKRGKVCKVDEQG-KPHC   22 (22)
T ss_dssp             STT---BTTEEEEEETTT-CEEE
T ss_pred             CCCcccCCCCEeeeCCCC-CcCC
Confidence            788999999999997654 5654


No 14 
>TIGR00805 oat sodium-independent organic anion transporter. Proteins of the OAT family catalyze the Na+-independent facilitated transport of organic anions such as bromosulfobromophthalein and prostaglandins as well as conjugated and unconjugated bile acids (taurocholate and cholate, respectively). These transporters have been characterized in mammals, but homologues are present in C. elegans and A. thaliana. Some of the mammalian proteins exhibit a high degree of tissue specificity. For example, the rat OAT is found at high levels in liver and kidney and at lower levels in other tissues. These proteins possess 10-12 putative a-helical transmembrane spanners. They may catalyze electrogenic anion uniport or anion exchange.
Probab=93.97  E-value=0.033  Score=47.99  Aligned_cols=18  Identities=44%  Similarity=0.960  Sum_probs=16.3

Q ss_pred             CCCeecCCCceecCHhHH
Q psy11294         68 SRPVCGSNGVDYKDLCEF   85 (111)
Q Consensus        68 ~~pVCGSDG~TY~n~C~L   85 (111)
                      ..||||.||+||-|.|+-
T Consensus       460 ~~PVCg~~~~tY~SpC~A  477 (633)
T TIGR00805       460 FDPVCGDNGLAYLSPCHA  477 (633)
T ss_pred             cccccCCCCCEEECcccc
Confidence            389999999999999974


No 15 
>cd01327 KAZAL_PSTI Kazal-type pancreatic secretory trypsin inhibitors (PSTI) and related proteins, including the second domain of the ovomucoid turkey inhibitor and the C-terminal domain of the esophagus cancer-related gene-2 protein (ECRG-2), are members of the superfamily of kazal-type proteinase inhibitors and follistatin-like proteins.
Probab=93.13  E-value=0.027  Score=32.91  Aligned_cols=13  Identities=8%  Similarity=0.225  Sum_probs=11.4

Q ss_pred             ecceeCCCcccCC
Q psy11294         12 LILDFLYACYIFP   24 (111)
Q Consensus        12 ~g~TY~NeC~L~~   24 (111)
                      +|+||.|+|+|+.
T Consensus        16 Dg~TY~N~C~l~~   28 (45)
T cd01327          16 DGVTYSNECLLCA   28 (45)
T ss_pred             CCCEeCCHhHHHH
Confidence            6999999999963


No 16 
>KOG3626|consensus
Probab=88.83  E-value=0.28  Score=43.80  Aligned_cols=18  Identities=44%  Similarity=0.992  Sum_probs=16.2

Q ss_pred             CCCeecCCCceecCHhHH
Q psy11294         68 SRPVCGSNGVDYKDLCEF   85 (111)
Q Consensus        68 ~~pVCGSDG~TY~n~C~L   85 (111)
                      -.||||.||.||-|.|+-
T Consensus       525 ~~PVCg~~G~tY~SpChA  542 (735)
T KOG3626|consen  525 YEPVCGENGITYFSPCHA  542 (735)
T ss_pred             cCcccCCCCCEEeChhhh
Confidence            389999999999999973


No 17 
>KOG4597|consensus
Probab=78.87  E-value=2.1  Score=36.85  Aligned_cols=57  Identities=21%  Similarity=0.632  Sum_probs=44.3

Q ss_pred             CCCCCCCC-CCCeeccCCCCCCccCcCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC
Q psy11294         30 PCQENTCQ-YGAKCIPSEDGTSYKCECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM   94 (111)
Q Consensus        30 pC~~~~C~-~g~~C~~~~~g~~~~C~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~   94 (111)
                      .|+...|. .+.+|..-.  +.++|.|...|..+.     .--|.+||-+|.| |.|-..+|..+.
T Consensus       105 ~~~~~~~~qq~s~~dif~--~~~r~~~~~~~~~eP-----~~~~~d~~~k~~n-~t~cs~aCgKG~  162 (560)
T KOG4597|consen  105 LCAQFPCSQQGSVCDIFD--GQPRCTCIDRCEKEP-----SFTCADDGLKYYN-CTMCSEACGKGV  162 (560)
T ss_pred             hhccCccccccccccccC--CCCCcccccccccCC-----chhhhhcCceecc-eEehhhhhcCCc
Confidence            46666665 455666542  248899999898875     5679999999999 999999998875


No 18 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=64.91  E-value=2.7  Score=22.43  Aligned_cols=26  Identities=50%  Similarity=1.227  Sum_probs=19.2

Q ss_pred             CCCCCCCCCCeeccCCCCCCccCcCCC
Q psy11294         31 CQENTCQYGAKCIPSEDGTSYKCECPQ   57 (111)
Q Consensus        31 C~~~~C~~g~~C~~~~~g~~~~C~C~~   57 (111)
                      |....|..++.|+... .+...|+|+.
T Consensus         1 C~~~~C~n~g~C~~~~-~~~y~C~C~~   26 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLP-GGGYTCECPP   26 (32)
T ss_dssp             TTTTSSTTTEEEEEES-TSEEEEEEBT
T ss_pred             CCCCcCCCCeEEEeCC-CCCEEeECCC
Confidence            4455788899999864 3468899884


No 19 
>KOG3509|consensus
Probab=47.96  E-value=27  Score=32.48  Aligned_cols=61  Identities=21%  Similarity=0.261  Sum_probs=44.6

Q ss_pred             CCCCCCCCCCCCeeccCCCCC-----------------CccCcCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhh
Q psy11294         29 NPCQENTCQYGAKCIPSEDGT-----------------SYKCECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCS   91 (111)
Q Consensus        29 ~pC~~~~C~~g~~C~~~~~g~-----------------~~~C~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~   91 (111)
                      -+|....|.+|..|.+..+-+                 .+.|.|+..+..+.     .++|..+|++|..++-++..++.
T Consensus       173 ~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r~~~~f~~~~~g-----~~~~~~~~vp~~~e~S~~~~~~~  247 (964)
T KOG3509|consen  173 IHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPRAFCPFEKSVEG-----CLKCFCFGVPRPSESSLHAFRAI  247 (964)
T ss_pred             ccccCcccccceeEEecCCcceeeeccccccceehhhccCceeccccccccc-----ccceeecCCCccccchhhhHhhh
Confidence            358888999999999864311                 12233444444443     78999999999999999998888


Q ss_pred             cCC
Q psy11294         92 TKM   94 (111)
Q Consensus        92 ~~~   94 (111)
                      .+.
T Consensus       248 h~~  250 (964)
T KOG3509|consen  248 HGA  250 (964)
T ss_pred             ccc
Confidence            765


No 20 
>KOG1219|consensus
Probab=41.77  E-value=31  Score=35.93  Aligned_cols=28  Identities=39%  Similarity=1.088  Sum_probs=21.7

Q ss_pred             CCCCCCCCCCCCCeeccCCCCCCccCcCCC
Q psy11294         28 TNPCQENTCQYGAKCIPSEDGTSYKCECPQ   57 (111)
Q Consensus        28 ~~pC~~~~C~~g~~C~~~~~g~~~~C~C~~   57 (111)
                      ..||...+|..|+.|++..  +...|.|+.
T Consensus      3903 ~epC~snPC~~GgtCip~~--n~f~CnC~~ 3930 (4289)
T KOG1219|consen 3903 LEPCASNPCLTGGTCIPFY--NGFLCNCPN 3930 (4289)
T ss_pred             cccccCCCCCCCCEEEecC--CCeeEeCCC
Confidence            4688888899999999864  347888873


No 21 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=37.64  E-value=37  Score=16.67  Aligned_cols=22  Identities=41%  Similarity=1.159  Sum_probs=15.4

Q ss_pred             CCCCCCCCeeccCCCCCCccCcCC
Q psy11294         33 ENTCQYGAKCIPSEDGTSYKCECP   56 (111)
Q Consensus        33 ~~~C~~g~~C~~~~~g~~~~C~C~   56 (111)
                      ...|..++.|+...  +...|.|+
T Consensus         5 ~~~C~~~~~C~~~~--~~~~C~C~   26 (36)
T cd00053           5 SNPCSNGGTCVNTP--GSYRCVCP   26 (36)
T ss_pred             CCCCCCCCEEecCC--CCeEeECC
Confidence            44677788998754  34788877


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=32.42  E-value=55  Score=16.71  Aligned_cols=26  Identities=50%  Similarity=1.299  Sum_probs=15.8

Q ss_pred             CCCC-CCCCCCCeeccCCCCCCccCcCCC
Q psy11294         30 PCQE-NTCQYGAKCIPSEDGTSYKCECPQ   57 (111)
Q Consensus        30 pC~~-~~C~~g~~C~~~~~g~~~~C~C~~   57 (111)
                      .|.. ..|..++.|+... + ...|.|+.
T Consensus         4 ~C~~~~~C~~~~~C~~~~-g-~~~C~C~~   30 (39)
T smart00179        4 ECASGNPCQNGGTCVNTV-G-SYRCECPP   30 (39)
T ss_pred             cCcCCCCcCCCCEeECCC-C-CeEeECCC
Confidence            3444 4567777898642 2 46787763


No 23 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=28.45  E-value=65  Score=16.31  Aligned_cols=21  Identities=43%  Similarity=1.135  Sum_probs=14.5

Q ss_pred             CCCCCCCeeccCCCCCCccCcCCC
Q psy11294         34 NTCQYGAKCIPSEDGTSYKCECPQ   57 (111)
Q Consensus        34 ~~C~~g~~C~~~~~g~~~~C~C~~   57 (111)
                      ..|..+ .|+..  .+...|.|+.
T Consensus         6 ~~C~~~-~C~~~--~~~~~C~C~~   26 (35)
T smart00181        6 GPCSNG-TCINT--PGSYTCSCPP   26 (35)
T ss_pred             CCCCCC-EEECC--CCCeEeECCC
Confidence            467778 89875  3457887774


No 24 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=26.53  E-value=21  Score=20.22  Aligned_cols=25  Identities=36%  Similarity=0.922  Sum_probs=16.1

Q ss_pred             CCCCCCCCCCeeccCCCCCCccCcCC
Q psy11294         31 CQENTCQYGAKCIPSEDGTSYKCECP   56 (111)
Q Consensus        31 C~~~~C~~g~~C~~~~~g~~~~C~C~   56 (111)
                      |.+..|+..+.|+...+| +..|.|.
T Consensus         2 C~~~~cP~NA~C~~~~dG-~eecrCl   26 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDG-SEECRCL   26 (37)
T ss_dssp             -SSS---TTEEEEEETTS-EEEEEE-
T ss_pred             ccCccCCCCcccEEcCCC-CEEEEee
Confidence            778899999999987555 6778876


No 25 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=24.40  E-value=95  Score=15.37  Aligned_cols=26  Identities=42%  Similarity=1.158  Sum_probs=15.5

Q ss_pred             CCCC-CCCCCCCeeccCCCCCCccCcCCC
Q psy11294         30 PCQE-NTCQYGAKCIPSEDGTSYKCECPQ   57 (111)
Q Consensus        30 pC~~-~~C~~g~~C~~~~~g~~~~C~C~~   57 (111)
                      .|.. ..|..++.|+...  +...|.|+.
T Consensus         4 ~C~~~~~C~~~~~C~~~~--~~~~C~C~~   30 (38)
T cd00054           4 ECASGNPCQNGGTCVNTV--GSYRCSCPP   30 (38)
T ss_pred             cCCCCCCcCCCCEeECCC--CCeEeECCC
Confidence            4444 4566677887643  236687763


No 26 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=22.53  E-value=87  Score=20.61  Aligned_cols=28  Identities=25%  Similarity=0.574  Sum_probs=21.3

Q ss_pred             CCCCCCC-CCCCCCCeeccCCCCCCccCcCCC
Q psy11294         27 LTNPCQE-NTCQYGAKCIPSEDGTSYKCECPQ   57 (111)
Q Consensus        27 ~~~pC~~-~~C~~g~~C~~~~~g~~~~C~C~~   57 (111)
                      ..+.|+. ..|+..+.|..+   ..+.|.|..
T Consensus        76 p~d~Cd~y~~CG~~g~C~~~---~~~~C~Cl~  104 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNSN---NSPKCSCLP  104 (110)
T ss_pred             cccCCCCccccCCccEeCCC---CCCceECCC
Confidence            4567886 689999999663   357899884


Done!