Query         psy8617
Match_columns 184
No_of_seqs    216 out of 1217
Neff          8.9 
Searched_HMMs 46136
Date          Fri Aug 16 23:23:44 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8617.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8617hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00031 CA Cadherin repeat dom 100.0 1.4E-31   3E-36  202.7  22.4  159    1-182     9-169 (199)
  2 KOG4289|consensus              100.0 1.9E-32 4.1E-37  243.2  17.8  158    2-182   282-441 (2531)
  3 KOG4289|consensus              100.0 4.9E-30 1.1E-34  228.1  13.7  158    2-181   796-955 (2531)
  4 KOG1219|consensus              100.0 1.7E-28 3.7E-33  224.1  18.0  158    2-182   966-1125(4289)
  5 KOG1219|consensus              100.0 4.8E-27   1E-31  214.8  18.5  158    2-182   653-810 (4289)
  6 PF00028 Cadherin:  Cadherin do  99.8 6.5E-19 1.4E-23  118.2  13.0   84    2-108     9-93  (93)
  7 smart00112 CA Cadherin repeats  99.8   3E-19 6.6E-24  116.3  10.9   79   14-115     1-79  (79)
  8 KOG1834|consensus               99.6 1.4E-14   3E-19  123.0  13.4  129   49-179    73-211 (952)
  9 cd00031 CA Cadherin repeat dom  99.5 7.2E-13 1.6E-17  100.0  12.7   85    2-109   115-199 (199)
 10 PF00028 Cadherin:  Cadherin do  99.4 1.3E-12 2.9E-17   87.4   8.7   62  122-183     1-64  (93)
 11 smart00112 CA Cadherin repeats  98.1 6.8E-06 1.5E-10   52.8   4.6   41  142-182     1-43  (79)
 12 KOG1834|consensus               98.1 2.6E-05 5.6E-10   67.6   9.2   79    6-110   165-245 (952)
 13 PF08266 Cadherin_2:  Cadherin-  97.7 3.8E-05 8.3E-10   50.3   3.7   23   57-79     44-66  (84)
 14 PF08266 Cadherin_2:  Cadherin-  97.2 0.00088 1.9E-08   43.9   5.1   60  122-182     3-66  (84)
 15 smart00736 CADG Dystroglycan-t  96.7    0.05 1.1E-06   36.3  10.3   52   57-112    45-96  (97)
 16 PF08758 Cadherin_pro:  Cadheri  96.5  0.0077 1.7E-07   39.9   5.0   62  113-179     2-63  (90)
 17 PF08758 Cadherin_pro:  Cadheri  95.0    0.35 7.7E-06   32.0   8.4   62    3-95     20-81  (90)
 18 TIGR01965 VCBS_repeat VCBS rep  95.0    0.41 8.8E-06   32.3   8.7   88    9-129     2-97  (99)
 19 PF05345 He_PIG:  Putative Ig d  86.1     4.8  0.0001   23.3   5.9   35   57-93     13-48  (49)
 20 TIGR00845 caca sodium/calcium   81.3      49  0.0011   31.3  14.4   56  100-159   514-569 (928)
 21 PF07495 Y_Y_Y:  Y_Y_Y domain;   81.3     9.3  0.0002   22.9   6.6   29   79-107    37-65  (66)
 22 TIGR03660 T1SS_rpt_143 T1SS-14  73.4      31 0.00066   24.7  11.7   67   58-130    56-128 (137)
 23 KOG3597|consensus               69.9      22 0.00049   30.5   7.0   60   99-159    24-83  (442)
 24 KOG4221|consensus               51.6 2.5E+02  0.0054   27.7  12.3   69   45-116   552-620 (1381)
 25 COG2706 3-carboxymuconate cycl  47.8 1.6E+02  0.0035   24.5   9.8   72  103-179   213-287 (346)
 26 PF03160 Calx-beta:  Calx-beta   45.1      84  0.0018   20.4   7.1   52  104-159     2-53  (100)
 27 PF03413 PepSY:  Peptidase prop  43.3      44 0.00095   19.5   3.4   12  163-174    51-62  (64)
 28 PF00778 DIX:  DIX domain;  Int  43.1      79  0.0017   20.5   4.7   56   57-119    16-74  (84)
 29 PF12245 Big_3_2:  Bacterial Ig  42.1      75  0.0016   18.9   5.3   30   80-111    22-51  (60)
 30 PF14157 YmzC:  YmzC-like prote  37.2      60  0.0013   19.8   3.2   18  161-178    42-59  (63)
 31 smart00089 PKD Repeats in poly  37.0   1E+02  0.0022   18.9   4.6   29   76-107    50-78  (79)
 32 PF13750 Big_3_3:  Bacterial Ig  34.5 1.8E+02  0.0039   21.1  10.0   31   76-108   118-148 (158)
 33 cd00146 PKD polycystic kidney   33.1      89  0.0019   19.2   3.8   19   76-94     52-70  (81)
 34 PF02494 HYR:  HYR domain;  Int  30.4   1E+02  0.0023   19.2   3.8   25   81-107    57-81  (81)
 35 PF07861 WND:  WisP family N-Te  28.7 1.9E+02  0.0041   22.0   5.3   28  148-175   201-228 (263)
 36 cd02848 Chitinase_N_term Chiti  25.1 2.3E+02   0.005   19.3   4.9   33   74-108    73-105 (106)
 37 PF12461 DUF3688:  Protein of u  22.6      85  0.0018   20.7   2.3   24  151-174    61-86  (91)
 38 PF09100 Qn_am_d_aIV:  Quinohem  22.4 1.6E+02  0.0035   20.8   3.7   34   81-115    98-132 (133)
 39 PF00635 Motile_Sperm:  MSP (Ma  20.6 1.7E+02  0.0036   19.1   3.5   25  150-174    32-57  (109)
 40 KOG4680|consensus               20.5 2.7E+02  0.0059   20.0   4.5   34  123-156   109-142 (153)

No 1  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00  E-value=1.4e-31  Score=202.72  Aligned_cols=159  Identities=39%  Similarity=0.612  Sum_probs=149.2

Q ss_pred             CcCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccc
Q psy8617           1 MVPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETR   80 (184)
Q Consensus         1 ~~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~   80 (184)
                      .+++|+.|+++.|+|+|.+.++.++|+|.++..                       ..+|.|++.+|.|++++.||||..
T Consensus         9 n~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~-----------------------~~~F~i~~~tG~l~~~~~lD~e~~   65 (199)
T cd00031           9 NAPPGTVVGTVSATDPDSGENGRVTYSILGGNE-----------------------DGLFSIDPNTGVITTTKPLDREEQ   65 (199)
T ss_pred             CCCCCCEEEEEEEECCCCCCCceEEEEEeCCCC-----------------------cccEEEeCCCCEEEECCCCCCcCC
Confidence            367899999999999999888999999999875                       469999999999999999999999


Q ss_pred             cEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC
Q psy8617          81 DKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN  160 (184)
Q Consensus        81 ~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~  160 (184)
                      ..|.|.|.|+|.+.|.++....++|.|.|+||++|.|....|.+.|.|+.++|+.++++.|+|+|.+.++.++|+|.++.
T Consensus        66 ~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~  145 (199)
T cd00031          66 SEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGN  145 (199)
T ss_pred             ceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCC
Confidence            99999999999888877789999999999999999999999999999999999999999999999988999999999987


Q ss_pred             --CcEEEEcCccEEEEcccccccc
Q psy8617         161 --TSFQINPFSGKKSILYLHSVIH  182 (184)
Q Consensus       161 --~~F~Id~~tG~i~~~~~~~~~~  182 (184)
                        .+|.|++.+|.|++.+.++.|.
T Consensus       146 ~~~~f~i~~~~G~i~~~~~ld~e~  169 (199)
T cd00031         146 DKELFSIDPNTGIITLAKPLDREE  169 (199)
T ss_pred             CCCEEEEeCCceEEEeCCccCCcc
Confidence              7999999999999998887664


No 2  
>KOG4289|consensus
Probab=100.00  E-value=1.9e-32  Score=243.19  Aligned_cols=158  Identities=32%  Similarity=0.552  Sum_probs=152.7

Q ss_pred             cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617           2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD   81 (184)
Q Consensus         2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~   81 (184)
                      .++|+.|++|+|+|.|.++|+.|+|++.+++.                       .+.|.|++.+|.|++..+||||+..
T Consensus       282 ~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~-----------------------~~~f~in~rSGvI~T~a~lDRE~~~  338 (2531)
T KOG4289|consen  282 LEVGYEVLTVRATDGDSPPNANIRYRLLEGNA-----------------------KNVFEINPRSGVISTRAPLDREELE  338 (2531)
T ss_pred             cccCceEEEEEeccCCCCCCCceEEEecCCCc-----------------------cceeEEcCccceeeccCccCHHhhh
Confidence            47899999999999999999999999999977                       7899999999999999999999999


Q ss_pred             EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-
Q psy8617          82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN-  160 (184)
Q Consensus        82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~-  160 (184)
                      .|.|.|.|.|.|.|+...++.|.|+|.|+|||+|+|....|.+.|.|+..++++|++|+|+|+|.|.|+.+.|+|.+++ 
T Consensus       339 ~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~  418 (2531)
T KOG4289|consen  339 SYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNG  418 (2531)
T ss_pred             heEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCc
Confidence            9999999999999888889999999999999999999999999999999999999999999999999999999999998 


Q ss_pred             -CcEEEEcCccEEEEcccccccc
Q psy8617         161 -TSFQINPFSGKKSILYLHSVIH  182 (184)
Q Consensus       161 -~~F~Id~~tG~i~~~~~~~~~~  182 (184)
                       +.|.||..||+|.+..+++.|.
T Consensus       419 ~G~f~id~~tGel~vv~plD~e~  441 (2531)
T KOG4289|consen  419 RGQFYIDSLTGELDVVEPLDFEN  441 (2531)
T ss_pred             cccEEEecccceEEEeccccccC
Confidence             9999999999999999999875


No 3  
>KOG4289|consensus
Probab=99.97  E-value=4.9e-30  Score=228.05  Aligned_cols=158  Identities=35%  Similarity=0.547  Sum_probs=150.2

Q ss_pred             cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617           2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD   81 (184)
Q Consensus         2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~   81 (184)
                      +|++|.|++|.|||+|.|.|+++.|.+.++..                      ..+.|.|++.+|.|++.+.||||...
T Consensus       796 ~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d----------------------~p~~F~IEptSGviRtl~rLdRE~~a  853 (2531)
T KOG4289|consen  796 APVFTSVLQVSATDADSGPNGRVYYTFQGGDD----------------------GPGDFYIEPTSGVIRTLRRLDRENVA  853 (2531)
T ss_pred             CCCcceEEEEEEeccCCCCCceEEEEecCCCC----------------------CCCceEEccCcceeehhhhhcchhee
Confidence            69999999999999999999999999987765                      14799999999999999999999999


Q ss_pred             EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-
Q psy8617          82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN-  160 (184)
Q Consensus        82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~-  160 (184)
                      .|.|.+.|+|.|.|++++.++|+|+|.|+|||||.|++..|.+.|.|+.++|..+++++|.|+|+|+|+.|.|+|.+++ 
T Consensus       854 vy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~d  933 (2531)
T KOG4289|consen  854 VYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGND  933 (2531)
T ss_pred             EEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCcc
Confidence            9999999999999999999999999999999999999999999999999999999999999999999999999999997 


Q ss_pred             -CcEEEEcCccEEEEccccccc
Q psy8617         161 -TSFQINPFSGKKSILYLHSVI  181 (184)
Q Consensus       161 -~~F~Id~~tG~i~~~~~~~~~  181 (184)
                       ..|.++...|+|....-|+.|
T Consensus       934 ~~~fq~de~~~~lla~~efdye  955 (2531)
T KOG4289|consen  934 PELFQLDEFSGELLALVEFDYE  955 (2531)
T ss_pred             HHHHHHHHhhhhhhhheeehhh
Confidence             999999999998888777765


No 4  
>KOG1219|consensus
Probab=99.96  E-value=1.7e-28  Score=224.14  Aligned_cols=158  Identities=30%  Similarity=0.424  Sum_probs=153.2

Q ss_pred             cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617           2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD   81 (184)
Q Consensus         2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~   81 (184)
                      +|+||.|++|.|.|.|.|..+.++|+|..+..                       .+.|+|+..+|.|++.+.||||...
T Consensus       966 apiGT~vi~i~A~dedsgldg~l~Y~I~~gdg-----------------------~g~FsId~~tG~irTl~~lDrE~ks 1022 (4289)
T KOG1219|consen  966 APIGTIVIRIQARDEDSGLDGELSYKIRTGDG-----------------------DGIFSIDSTTGSIRTLKALDREKKS 1022 (4289)
T ss_pred             CCcceEEEEEEEecCCCCccceEEEEEEcCCc-----------------------ceeEEecCCcceEeechhhchhhcc
Confidence            79999999999999999999999999999987                       7899999999999999999999999


Q ss_pred             EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-
Q psy8617          82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN-  160 (184)
Q Consensus        82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~-  160 (184)
                      .|-|+|.|+|.|.+++++.+.+.|.|.|+|||+|+|.++.|..+|.|+++.+..|.++.|.|+|...|+++.|.|.+++ 
T Consensus      1023 ~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGny 1102 (4289)
T KOG1219|consen 1023 SYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNY 1102 (4289)
T ss_pred             eEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCc
Confidence            9999999999999999999999999999999999999999999999999999999999999999888999999999997 


Q ss_pred             -CcEEEEcCccEEEEcccccccc
Q psy8617         161 -TSFQINPFSGKKSILYLHSVIH  182 (184)
Q Consensus       161 -~~F~Id~~tG~i~~~~~~~~~~  182 (184)
                       ++|.|+|.||.|++.+-+++|.
T Consensus      1103 q~FF~Id~~TG~iTt~r~LDRE~ 1125 (4289)
T KOG1219|consen 1103 QGFFQIDPETGLITTIRRLDREK 1125 (4289)
T ss_pred             cceEEEccccceeeeehhhcccc
Confidence             9999999999999999998875


No 5  
>KOG1219|consensus
Probab=99.95  E-value=4.8e-27  Score=214.84  Aligned_cols=158  Identities=27%  Similarity=0.471  Sum_probs=153.4

Q ss_pred             cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617           2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD   81 (184)
Q Consensus         2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~   81 (184)
                      +|+||.|+++.|+|.|.|-|+.++|.|.+++.                       ...|.|+.++|.|++.++||+|...
T Consensus       653 vpigt~la~L~atD~Dtgfng~l~yvI~dgne-----------------------~~~~~Id~qsg~itvas~ld~~~t~  709 (4289)
T KOG1219|consen  653 VPIGTTLAILSATDSDTGFNGKLVYVIEDGNE-----------------------SICFLIDRQSGNITVASPLDNENTE  709 (4289)
T ss_pred             CCCCceEEEEeccCCCCCcCceEEEEEeCCcc-----------------------ceEEEEecccceEEEecchhhhhhh
Confidence            68999999999999999999999999999987                       7899999999999999999999999


Q ss_pred             EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCC
Q psy8617          82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSNT  161 (184)
Q Consensus        82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~~  161 (184)
                      .|.|.|.|.|.|.|..++...+.|.|.|.|||+|.|.+..|.+.|+|+..+|+.|++|.+.|.|.|.||+++|+|.....
T Consensus       710 ~yiLnvta~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~sd  789 (4289)
T KOG1219|consen  710 QYILNVTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKSD  789 (4289)
T ss_pred             eeEEEEEEecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCcc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999899


Q ss_pred             cEEEEcCccEEEEcccccccc
Q psy8617         162 SFQINPFSGKKSILYLHSVIH  182 (184)
Q Consensus       162 ~F~Id~~tG~i~~~~~~~~~~  182 (184)
                      .|+|||.||.|.+.+.+++|.
T Consensus       790 vfsIdp~tGivv~~~sLdrE~  810 (4289)
T KOG1219|consen  790 VFSIDPFTGIVVTSKSLDREG  810 (4289)
T ss_pred             eEEecCcccEEEeccccCccc
Confidence            999999999999999998874


No 6  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.82  E-value=6.5e-19  Score=118.21  Aligned_cols=84  Identities=45%  Similarity=0.665  Sum_probs=79.8

Q ss_pred             cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617           2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD   81 (184)
Q Consensus         2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~   81 (184)
                      +|+|+.|+++.|+|+|.+.|+.+.|+|.++..                       .++|.|++.+|.|++++.||||...
T Consensus         9 ~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~-----------------------~~~F~I~~~tg~i~~~~~LD~E~~~   65 (93)
T PF00028_consen    9 APPGTVVGQVTATDPDSGPNSQITYSILGGNP-----------------------DGLFSIDPNTGEISLKKPLDRETQS   65 (93)
T ss_dssp             GSTSSEEEEEEEEESSTSTTSSEEEEEEETTS-----------------------TTSEEEETTTTEEEESSSSCTTTTS
T ss_pred             CCCCCEEEEEEEEeCCCCCCceEEEEEecCcc-----------------------cCceEEeeeeeccccceecCcccCC
Confidence            68899999999999999999999999999986                       6799999999999999999999999


Q ss_pred             EEEEEEEEEeC-CCCcccceEEEEEEEe
Q psy8617          82 KFLLKIAACDN-GTPVSTAYATVAINVL  108 (184)
Q Consensus        82 ~~~l~v~a~d~-~~p~~~~~~~v~I~v~  108 (184)
                      .|.|.|.|+|. +.|++++++.|+|+|.
T Consensus        66 ~y~l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   66 SYQLTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             EEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             EEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence            99999999999 8899999999999884


No 7  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.81  E-value=3e-19  Score=116.28  Aligned_cols=79  Identities=38%  Similarity=0.709  Sum_probs=72.6

Q ss_pred             EeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccccEEEEEEEEEeCC
Q psy8617          14 VDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDKFLLKIAACDNG   93 (184)
Q Consensus        14 ~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~   93 (184)
                      +|+|.|.|+.++|+|.++..                       ..+|.|++.+|.|+++++||||....|.|.|.|.|.+
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~-----------------------~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~   57 (79)
T smart00112        1 TDADSGENGKVTYSILSGNE-----------------------DGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGG   57 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCC-----------------------CCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECC
Confidence            47888888888888887765                       4799999999999999999999999999999999999


Q ss_pred             CCcccceEEEEEEEecCCCCCC
Q psy8617          94 TPVSTAYATVAINVLDANDNDP  115 (184)
Q Consensus        94 ~p~~~~~~~v~I~v~d~Nd~~P  115 (184)
                      .|++++.+.|+|.|.|+|||+|
T Consensus        58 ~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       58 GPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             CCCcccEEEEEEEEEECCCCCC
Confidence            9999999999999999999998


No 8  
>KOG1834|consensus
Probab=99.61  E-value=1.4e-14  Score=123.01  Aligned_cols=129  Identities=25%  Similarity=0.356  Sum_probs=106.9

Q ss_pred             EeeCCcccccceEEeCcc--cEEEECCCCCcccccEEEEEEEEEeCCC-C-----cccceEEEEEEEecCCCCCCccCCC
Q psy8617          49 LIQDDKLVHDTFHIMPNT--GWLSLKQPMDRETRDKFLLKIAACDNGT-P-----VSTAYATVAINVLDANDNDPKFSKQ  120 (184)
Q Consensus        49 ~~~~~~~~~~~F~I~~~t--G~i~~~~~lD~E~~~~~~l~v~a~d~~~-p-----~~~~~~~v~I~v~d~Nd~~P~f~~~  120 (184)
                      .+++.+.+...--+|..|  |.|+.+.+||.|.++.|+|+|+|.|+|. |     ..+..++|.|.|.|+|+++|.|..+
T Consensus        73 klhgq~vPFdavVvdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep  152 (952)
T KOG1834|consen   73 KLHGQPVPFDAVVVDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEP  152 (952)
T ss_pred             EecCCCCCceEEEEeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhccc
Confidence            334444444445556655  5799999999999999999999999874 4     3677789999999999999999999


Q ss_pred             ceEEEEeCCCCCCcEEEEEEEEeCCCC-CCceE-EEEEecCCCcEEEEcCccEEEEccccc
Q psy8617         121 DYEFSIEENLPRGTVVGSVKATDEDLA-SNAAL-RYSLIPSNTSFQINPFSGKKSILYLHS  179 (184)
Q Consensus       121 ~~~~~v~E~~~~gt~v~~v~a~D~D~~-~n~~i-~y~i~~~~~~F~Id~~tG~i~~~~~~~  179 (184)
                      .|.+.|.|. +.-..|++|.|.|.|-+ ++++| .|.|...+-+|.||. .|.|+.+..+.
T Consensus       153 ~Yka~V~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t~d~PFaIdn-~G~irnTekLn  211 (952)
T KOG1834|consen  153 WYKAHVTEG-KVYDSILRVEAIDKDCSPQYSQICEYEITTPDVPFAIDN-DGNIRNTEKLN  211 (952)
T ss_pred             ceeeEEecc-eeeeeeEEEEeecCCCCCcccceeEEEecCCCCceEEcC-CCccccccccc
Confidence            999999998 56777999999999986 67877 899999999999984 88888877554


No 9  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.49  E-value=7.2e-13  Score=100.03  Aligned_cols=85  Identities=39%  Similarity=0.653  Sum_probs=78.3

Q ss_pred             cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617           2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD   81 (184)
Q Consensus         2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~   81 (184)
                      .|+|+.|+++.|+|+|.+.++.++|+|.++..                       .++|.|++.+|.|++.+.||+|...
T Consensus       115 ~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~-----------------------~~~f~i~~~~G~i~~~~~ld~e~~~  171 (199)
T cd00031         115 APPGTVVGTVTATDADSGENAKLTYSILSGND-----------------------KELFSIDPNTGIITLAKPLDREEKS  171 (199)
T ss_pred             CCCCCEEEEEEEEcCCCCCCccEEEEEeCCCC-----------------------CCEEEEeCCceEEEeCCccCCccCc
Confidence            57899999999999999889999999988875                       4699999999999999999999999


Q ss_pred             EEEEEEEEEeCCCCcccceEEEEEEEec
Q psy8617          82 KFLLKIAACDNGTPVSTAYATVAINVLD  109 (184)
Q Consensus        82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d  109 (184)
                      .|.+.|.|+|.+.|.++.++.++|.|.|
T Consensus       172 ~~~l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         172 SYELTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             eEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence            9999999999988778888999998875


No 10 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.42  E-value=1.3e-12  Score=87.37  Aligned_cols=62  Identities=40%  Similarity=0.592  Sum_probs=59.0

Q ss_pred             eEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC--CcEEEEcCccEEEEccccccccc
Q psy8617         122 YEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN--TSFQINPFSGKKSILYLHSVIHM  183 (184)
Q Consensus       122 ~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~--~~F~Id~~tG~i~~~~~~~~~~~  183 (184)
                      |.+.|+|++++|+.++++.|.|+|.+.|+.+.|+|.+++  .+|.|++.+|.|++.+.++.|..
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~   64 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQ   64 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTT
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccC
Confidence            789999999999999999999999999999999999987  99999999999999999998753


No 11 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.08  E-value=6.8e-06  Score=52.84  Aligned_cols=41  Identities=27%  Similarity=0.382  Sum_probs=36.9

Q ss_pred             EeCCCCCCceEEEEEecCC--CcEEEEcCccEEEEcccccccc
Q psy8617         142 TDEDLASNAALRYSLIPSN--TSFQINPFSGKKSILYLHSVIH  182 (184)
Q Consensus       142 ~D~D~~~n~~i~y~i~~~~--~~F~Id~~tG~i~~~~~~~~~~  182 (184)
                      +|+|.|.|+.++|+|.+++  .+|.|++.+|.|++.+.++.|.
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~   43 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREE   43 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeC
Confidence            4889989999999999887  8999999999999998888774


No 12 
>KOG1834|consensus
Probab=98.07  E-value=2.6e-05  Score=67.56  Aligned_cols=79  Identities=22%  Similarity=0.340  Sum_probs=65.8

Q ss_pred             eEEEEEEEEeCCCCC-CcE-EEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccccEE
Q psy8617           6 FQILQVTAVDLDTGN-NAR-ITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDKF   83 (184)
Q Consensus         6 t~v~~v~A~D~D~g~-n~~-i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~~   83 (184)
                      ..|++|.|.|.|.++ +++ ..|.|...+-                         +|.||. .|.|+.+++|.|.+..+|
T Consensus       165 d~il~veAiD~DCspq~sqIC~YEI~t~d~-------------------------PFaIdn-~G~irnTekLny~ke~~Y  218 (952)
T KOG1834|consen  165 DSILRVEAIDKDCSPQYSQICEYEITTPDV-------------------------PFAIDN-DGNIRNTEKLNYTKEHQY  218 (952)
T ss_pred             eeeEEEEeecCCCCCcccceeEEEecCCCC-------------------------ceEEcC-CCccccccccccccceeE
Confidence            458999999999764 555 4788887654                         899976 599999999999999999


Q ss_pred             EEEEEEEeCCCCcccceEEEEEEEecC
Q psy8617          84 LLKIAACDNGTPVSTAYATVAINVLDA  110 (184)
Q Consensus        84 ~l~v~a~d~~~p~~~~~~~v~I~v~d~  110 (184)
                      .|+|.|.|+|.....+.+.|+|.|...
T Consensus       219 ~ltVtAyDCg~kraa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  219 KLTVTAYDCGKKRAASDSLVTVHVKPT  245 (952)
T ss_pred             EEEEEEEecccccccCcceEEEEecCc
Confidence            999999999976656667888888765


No 13 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.74  E-value=3.8e-05  Score=50.30  Aligned_cols=23  Identities=30%  Similarity=0.448  Sum_probs=18.2

Q ss_pred             ccceEEeCcccEEEECCCCCccc
Q psy8617          57 HDTFHIMPNTGWLSLKQPMDRET   79 (184)
Q Consensus        57 ~~~F~I~~~tG~i~~~~~lD~E~   79 (184)
                      ..+|.++..+|.|+++..+|||+
T Consensus        44 ~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen   44 SQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             S-SEEE-TTTSEEEESS--SCCC
T ss_pred             cceeEecCCceeEEeCCccCHHH
Confidence            67999999999999999999998


No 14 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.23  E-value=0.00088  Score=43.86  Aligned_cols=60  Identities=22%  Similarity=0.398  Sum_probs=37.0

Q ss_pred             eEEEEeCCCCCCcEEEEEEEEeCCCCC--CceEEEEEecC--CCcEEEEcCccEEEEcccccccc
Q psy8617         122 YEFSIEENLPRGTVVGSVKATDEDLAS--NAALRYSLIPS--NTSFQINPFSGKKSILYLHSVIH  182 (184)
Q Consensus       122 ~~~~v~E~~~~gt~v~~v~a~D~D~~~--n~~i~y~i~~~--~~~F~Id~~tG~i~~~~~~~~~~  182 (184)
                      ..+.|+|..++|+.|+.| |.|.....  ...-.|+|.+.  ..+|.+++.||.|++..-.|+|.
T Consensus         3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen    3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred             eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence            467899999999999998 55543221  11234665554  39999999999999999888874


No 15 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.70  E-value=0.05  Score=36.34  Aligned_cols=52  Identities=29%  Similarity=0.388  Sum_probs=40.9

Q ss_pred             ccceEEeCcccEEEECCCCCcccccEEEEEEEEEeCCCCcccceEEEEEEEecCCC
Q psy8617          57 HDTFHIMPNTGWLSLKQPMDRETRDKFLLKIAACDNGTPVSTAYATVAINVLDAND  112 (184)
Q Consensus        57 ~~~F~I~~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd  112 (184)
                      .+|...++.++.+.=. +.... ...|.+.|.|+|..+  .+....++|.|.+.|+
T Consensus        45 P~Wl~fd~~~~~~~Gt-P~~~~-~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       45 PSWLSFDSDTGTLSGT-PTNSD-VGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             CCeEEEeCCCCEEEEE-CCCCC-CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            4699999999888763 43333 456999999999876  5667789999999886


No 16 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.48  E-value=0.0077  Score=39.94  Aligned_cols=62  Identities=21%  Similarity=0.321  Sum_probs=30.9

Q ss_pred             CCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCcEEEEcCccEEEEccccc
Q psy8617         113 NDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSNTSFQINPFSGKKSILYLHS  179 (184)
Q Consensus       113 ~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~~~F~Id~~tG~i~~~~~~~  179 (184)
                      .-|-|.+..|.+.|+.+...|..|++|.-.|...  +..+.|.-.  +..|.|.+ .|.|++++.+.
T Consensus         2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--DpdF~V~~-DGsVy~~r~v~   63 (90)
T PF08758_consen    2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--DPDFRVLE-DGSVYAKRPVQ   63 (90)
T ss_dssp             ---B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-----SEEEEET-TTEEEEES--S
T ss_pred             CcCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--CCCEEEcC-CCeEEEeeeEe
Confidence            3588999999999999999999999999999853  456777654  45999986 89999988665


No 17 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=95.04  E-value=0.35  Score=31.97  Aligned_cols=62  Identities=16%  Similarity=0.245  Sum_probs=33.6

Q ss_pred             CCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccccE
Q psy8617           3 PCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDK   82 (184)
Q Consensus         3 p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~   82 (184)
                      ..|+.|++|.-.|..  .+..+.|.-.                           ...|.|.++ |.|++++.+.....+ 
T Consensus        20 ~~g~~lg~V~f~dC~--~~~~~~~~ss---------------------------DpdF~V~~D-GsVy~~r~v~l~~~~-   68 (90)
T PF08758_consen   20 EAGQPLGKVNFEDCT--GRRRVIFESS---------------------------DPDFRVLED-GSVYAKRPVQLSSEQ-   68 (90)
T ss_dssp             -SS--EEE---B--S--S---EEEE------------------------------SEEEEETT-TEEEEES--S-SSS--
T ss_pred             hCCcEEEEEEeccCC--CCCceEEecC---------------------------CCCEEEcCC-CeEEEeeeEecCCCc-
Confidence            358889999988874  3456777642                           237999775 999999998775443 


Q ss_pred             EEEEEEEEeCCCC
Q psy8617          83 FLLKIAACDNGTP   95 (184)
Q Consensus        83 ~~l~v~a~d~~~p   95 (184)
                      -.|.|.|.|..+.
T Consensus        69 ~~F~V~a~D~~~~   81 (90)
T PF08758_consen   69 RSFTVHAWDSQTQ   81 (90)
T ss_dssp             EEEEEEEEETTTT
T ss_pred             eEEEEEEECCCCC
Confidence            4688999998663


No 18 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.04  E-value=0.41  Score=32.27  Aligned_cols=88  Identities=22%  Similarity=0.199  Sum_probs=51.8

Q ss_pred             EEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEEC--------CCCCcccc
Q psy8617           9 LQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLK--------QPMDRETR   80 (184)
Q Consensus         9 ~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~--------~~lD~E~~   80 (184)
                      +++.++|+|.+.  ...+++.....                      ..+.|.|++ +|...-.        +.|..-+.
T Consensus         2 G~Lt~sD~D~gd--~~~~s~~~~~g----------------------~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~   56 (99)
T TIGR01965         2 GQLTISDADAGQ--AHFIAQTDAAG----------------------QYGTFSIDA-DGQWTYQADNSQTAVQALKAGET   56 (99)
T ss_pred             CceEEeCCCCCC--ceEEecccccC----------------------CcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCE
Confidence            468899999754  34555543321                      145677766 4533221        22333334


Q ss_pred             cEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCC
Q psy8617          81 DKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEEN  129 (184)
Q Consensus        81 ~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~  129 (184)
                      -.-.|.+.+.|+      .+..|+|.|.-.|| +|..... -...+.|+
T Consensus        57 ~tdsFtvtv~DG------tt~~vtItI~GtND-apvi~~~-~~g~v~ED   97 (99)
T TIGR01965        57 LTDTFTVTSADG------TSQTVTITITGAND-AAVIGGA-DTGSVTED   97 (99)
T ss_pred             EEEEEEEEEeCC------CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence            345677778885      27789999999999 6655332 23455554


No 19 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=86.10  E-value=4.8  Score=23.26  Aligned_cols=35  Identities=17%  Similarity=0.274  Sum_probs=26.7

Q ss_pred             ccceEEeCcccEEEECCCCCcc-cccEEEEEEEEEeCC
Q psy8617          57 HDTFHIMPNTGWLSLKQPMDRE-TRDKFLLKIAACDNG   93 (184)
Q Consensus        57 ~~~F~I~~~tG~i~~~~~lD~E-~~~~~~l~v~a~d~~   93 (184)
                      ..+..||+.+|.|+=.  .+.+ ....|.+.|.|+|..
T Consensus        13 P~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   13 PSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence            5799999999999865  2233 235899999999874


No 20 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=81.35  E-value=49  Score=31.25  Aligned_cols=56  Identities=20%  Similarity=0.329  Sum_probs=31.3

Q ss_pred             eEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy8617         100 YATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPS  159 (184)
Q Consensus       100 ~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~  159 (184)
                      ....+|+|.| ||++|.|.-..-.+.|.|+.  |+.-.+|.-.-.-.+ .-.+.|.-..+
T Consensus       514 ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~RtsGa~G-~VtV~Y~T~dG  569 (928)
T TIGR00845       514 PNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTSGARG-TVIVPYRTVEG  569 (928)
T ss_pred             CceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcCCCCe-eEEEEEEeecC
Confidence            3456777787 68899876555567788874  554444333211111 22355666554


No 21 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=81.31  E-value=9.3  Score=22.89  Aligned_cols=29  Identities=24%  Similarity=0.346  Sum_probs=18.6

Q ss_pred             cccEEEEEEEEEeCCCCcccceEEEEEEE
Q psy8617          79 TRDKFLLKIAACDNGTPVSTAYATVAINV  107 (184)
Q Consensus        79 ~~~~~~l~v~a~d~~~p~~~~~~~v~I~v  107 (184)
                      ....|.|.|.|.|..+........+.|.|
T Consensus        37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen   37 PPGKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             -SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            35679999999997764444335666665


No 22 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=73.36  E-value=31  Score=24.66  Aligned_cols=67  Identities=22%  Similarity=0.389  Sum_probs=41.3

Q ss_pred             cceEEeC-ccc--EEEECCCCCccc---ccEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCC
Q psy8617          58 DTFHIMP-NTG--WLSLKQPMDRET---RDKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENL  130 (184)
Q Consensus        58 ~~F~I~~-~tG--~i~~~~~lD~E~---~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~  130 (184)
                      ..|+|.- ..|  ...+.++||+..   .-...|.|.|+|..+...  ...+.|.|.|  | .|...... .+.|.|..
T Consensus        56 ~Vftvtl~~~GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~  128 (137)
T TIGR03660        56 PVFTLTLNADGSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDD  128 (137)
T ss_pred             EEEEEEECCCccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence            3555531 234  455668898854   446788899998766433  3477888877  4 46654433 36777743


No 23 
>KOG3597|consensus
Probab=69.94  E-value=22  Score=30.50  Aligned_cols=60  Identities=18%  Similarity=0.118  Sum_probs=45.2

Q ss_pred             ceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy8617          99 AYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPS  159 (184)
Q Consensus        99 ~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~  159 (184)
                      .+...+|.|..+||.+..+-...+.+.+.|+...-.....+.+.|+|.+ -..+.|++.+.
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~-~~~l~f~v~~t   83 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSA-PLPLEFQVLGT   83 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCC-ccceEEEEccC
Confidence            4556889999999977777666677888887665555667889999876 34677777764


No 24 
>KOG4221|consensus
Probab=51.58  E-value=2.5e+02  Score=27.70  Aligned_cols=69  Identities=12%  Similarity=0.201  Sum_probs=49.7

Q ss_pred             EEEEEeeCCcccccceEEeCcccEEEECCCCCcccccEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCc
Q psy8617          45 ITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPK  116 (184)
Q Consensus        45 ~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~  116 (184)
                      ..|++.-.......++.++.++-++++.   +.|....|.+.|.|....++..++...-.++..|+=+.||.
T Consensus       552 ~~yk~~ys~~~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  552 TGYKLFYSEDDTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             eEEEEEEEcCCCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            5666655444446788888877777766   46777889999999998887766655555667777776775


No 25 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=47.82  E-value=1.6e+02  Score=24.46  Aligned_cols=72  Identities=15%  Similarity=0.273  Sum_probs=43.6

Q ss_pred             EEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC---CcEEEEcCccEEEEccccc
Q psy8617         103 VAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN---TSFQINPFSGKKSILYLHS  179 (184)
Q Consensus       103 v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~---~~F~Id~~tG~i~~~~~~~  179 (184)
                      -+|.|...|...-.|..-+....++|+-.-..-.+-|+. .+    ++++-|---.+.   ..|.|++.+|.+.+...-.
T Consensus       213 stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhi-s~----dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~  287 (346)
T COG2706         213 STVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHI-SP----DGRFLYASNRGHDSIAVFSVDPDGGKLELVGITP  287 (346)
T ss_pred             CEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEE-CC----CCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEec
Confidence            346677776655667666666667776322222222222 23    467777776653   7788888888887766444


No 26 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=45.11  E-value=84  Score=20.37  Aligned_cols=52  Identities=23%  Similarity=0.409  Sum_probs=29.6

Q ss_pred             EEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy8617         104 AINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPS  159 (184)
Q Consensus       104 ~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~  159 (184)
                      +|.|.| ||.+ .+.-..-...+.|+.  |..-..|.....+....-.+.|...++
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~g   53 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDG   53 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCC
Confidence            467778 5655 665444456778875  445555555544433344677777665


No 27 
>PF03413 PepSY:  Peptidase propeptide and YPEB domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR005075  This signature, PepSY, is found in the propeptide of members of the MEROPS peptidase family M4 (clan MA(E)), which contains the thermostable thermolysins (3.4.24.27 from EC), and related thermolabile neutral proteases (bacillolysins) (3.4.24.28 from EC) from various species of Bacillus. It is also in many non-peptidase proteins, including Bacillus subtilis YpeB protein - a regulator of SleB spore cortex lytic enzyme - and a large number of eubacterial and archaeal cell wall-associated and secreted proteins which are mostly annotated as 'hypothetical protein'. Many extracellular bacterial proteases are produced as proenzymes. The propeptides usually have a dual function, i.e. they function as an intramolecular chaperone required for the folding of the polypeptide and as an inhibitor preventing premature activation of the enzyme. Analysis of the propeptide region of the M4 family of peptidases reveals two regions of conservation, the PepSY domain and a second domain, proximate to the N terminus, the FTP domain (IPR011096 from INTERPRO), which is also found in isolation in the propeptide of eukaryotic peptidases belong to MEROPS peptidase family M36.  Propeptide domain swapping experiments, for example swapping the propeptide domain of PA protease with that of vibrolysin, both propeptides contain the FTP and PepSY domains, allows the PA protease domain to fold correctly and inhibits the C-terminal autoprocessing activity. However, swapping the propeptide of PA protease for the thermolysin propeptide, does not facilitate the correct folding nor the processing of the chimaeric protein into an active peptidase []. Mutational analysis of the Pseudomonas aeruginosa elastase gene revealed two mutations in the propeptide which resulted in the loss of inhibitory activity but not chaperone activity: A-15V and T-153I (where +1 is defined as the first residue of the mature peptidase). Both mutations resulted in peptidase activity, the T-153V mutation being much less effective than the A-15I mutation [] in activating peptidase activity. The T-153V mutation lies N-terminal to the FTP domain while the A-15I mutation is C-terminal to the PepSY domain.  Given the diverse range of other proteins, both domains occur in in isolation, the exact function of each is still unclear; though it has been proposed that the PepSY domain primarily has inhibitory activity and in conjunction with the FTP domain in chaperone activity. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 2GU3_A 3NQZ_A 3NQY_A 2KGY_A.
Probab=43.25  E-value=44  Score=19.45  Aligned_cols=12  Identities=8%  Similarity=0.346  Sum_probs=7.5

Q ss_pred             EEEEcCccEEEE
Q psy8617         163 FQINPFSGKKSI  174 (184)
Q Consensus       163 F~Id~~tG~i~~  174 (184)
                      +.||+.||+|.-
T Consensus        51 v~VDa~tG~Il~   62 (64)
T PF03413_consen   51 VYVDAYTGEILS   62 (64)
T ss_dssp             EEEETTT--EEE
T ss_pred             EEEECCCCeEEE
Confidence            349999999864


No 28 
>PF00778 DIX:  DIX domain;  InterPro: IPR001158 Dishevelled (Dsh) protein is an important component of the Wnt signal-transduction pathway. It has three relatively conserved domains: DIX, PDZ and DEP. The DIX domain of Dvl-1 (a mammalian Dishevelled homologue) shares 37% identity with the C-terminal region of Axin. Dsh can interact with the Axin/APC/GSK3/beta-catenin complex, and may thus modulate its activity []. The Wnt signalling pathway is conserved in various species from Caenorhabditis elegans to mammals, and plays important roles in development, cellular proliferation, and differentiation. The molecular mechanisms by which the Wnt signal regulates cellular functions are becoming increasingly well understood. Wnt stabilises cytoplasmic beta-catenin, which stimulates the expression of genes including c-myc, c-jun, fra-1, and cyclin D1. Axin and its homologue Axil are components of the Wnt signalling pathway that negatively regulate this pathway. Other components of the Wnt signalling pathway, including Dvl, glycogen synthase kinase-3beta (GSK-3beta), beta-catenin, and adenomatous polyposis coli (APC), interact with Axin, and the phosphorylation and stability of beta-catenin are regulated in the Axin complex. Axil has similar functions to Axin. Thus, Axin and Axil act as scaffold proteins in the Wnt signalling pathway, thereby modulating the Wnt-dependent cellular functions [].; GO: 0004871 signal transducer activity, 0007275 multicellular organismal development, 0005622 intracellular; PDB: 1WSP_B 2D5G_A 3PZ7_A 3PZ8_H.
Probab=43.12  E-value=79  Score=20.54  Aligned_cols=56  Identities=18%  Similarity=0.214  Sum_probs=32.1

Q ss_pred             ccceEEeCcccEEEEC---CCCCcccccEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCC
Q psy8617          57 HDTFHIMPNTGWLSLK---QPMDRETRDKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSK  119 (184)
Q Consensus        57 ~~~F~I~~~tG~i~~~---~~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~  119 (184)
                      .-.+.|....+.|++.   +.|.+.  ..|.+..++.|...   ..  .|...|.|-++.-|.|+.
T Consensus        16 Py~~~ip~~~~~iTL~dFK~~l~r~--G~yrffFK~~~~d~---~~--~V~eEi~dD~~~LP~~eG   74 (84)
T PF00778_consen   16 PYVVKIPKPPGEITLGDFKEVLPRP--GNYRFFFKSLDPDF---GC--VVKEEITDDDDILPLFEG   74 (84)
T ss_dssp             -EEEEESSSSTT-BHHHHHHTCTS---SGEEEEEEEEETTT---ET--EEEEEE-STTSB---BTT
T ss_pred             ceEEEccCCCCccCHHHHHHhhcCC--CCceEEEEEeCCCC---Ce--eeEEEEcCCcccccccCC
Confidence            4577777666788876   457555  34555556666532   11  688888998888888864


No 29 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=42.11  E-value=75  Score=18.93  Aligned_cols=30  Identities=23%  Similarity=0.392  Sum_probs=20.7

Q ss_pred             ccEEEEEEEEEeCCCCcccceEEEEEEEecCC
Q psy8617          80 RDKFLLKIAACDNGTPVSTAYATVAINVLDAN  111 (184)
Q Consensus        80 ~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~N  111 (184)
                      ...|.+.+.|+|..+.  .......+.+.|..
T Consensus        22 dg~yt~~v~a~D~AGN--~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   22 DGEYTLTVTATDKAGN--TSSSTTQIVIVDNT   51 (60)
T ss_pred             CccEEEEEEEEECCCC--EEEeeeEEEEEcCC
Confidence            5679999999998874  34445555555553


No 30 
>PF14157 YmzC:  YmzC-like protein; PDB: 3KVP_E.
Probab=37.19  E-value=60  Score=19.85  Aligned_cols=18  Identities=6%  Similarity=0.030  Sum_probs=15.5

Q ss_pred             CcEEEEcCccEEEEcccc
Q psy8617         161 TSFQINPFSGKKSILYLH  178 (184)
Q Consensus       161 ~~F~Id~~tG~i~~~~~~  178 (184)
                      ..|.-||.|++|++.+-+
T Consensus        42 KIfkyd~~tNei~L~KE~   59 (63)
T PF14157_consen   42 KIFKYDEDTNEITLKKEF   59 (63)
T ss_dssp             EEEEEETTTTEEEEEEEE
T ss_pred             EEEEeCCCCCeEEEEEee
Confidence            678889999999998855


No 31 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=36.99  E-value=1e+02  Score=18.87  Aligned_cols=29  Identities=17%  Similarity=0.219  Sum_probs=20.5

Q ss_pred             CcccccEEEEEEEEEeCCCCcccceEEEEEEE
Q psy8617          76 DRETRDKFLLKIAACDNGTPVSTAYATVAINV  107 (184)
Q Consensus        76 D~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v  107 (184)
                      -|+....|.+++.+.|..+   +.+..+.|.|
T Consensus        50 ~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       50 TYTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             EeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            4566778999999998766   4455555554


No 32 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=34.48  E-value=1.8e+02  Score=21.12  Aligned_cols=31  Identities=16%  Similarity=0.357  Sum_probs=21.8

Q ss_pred             CcccccEEEEEEEEEeCCCCcccceEEEEEEEe
Q psy8617          76 DRETRDKFLLKIAACDNGTPVSTAYATVAINVL  108 (184)
Q Consensus        76 D~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~  108 (184)
                      ..|....|.|+|.|.|..+.  ..+..+.+...
T Consensus       118 sle~~~~YtLtV~a~D~aGN--~~~~si~F~y~  148 (158)
T PF13750_consen  118 SLEADDSYTLTVSATDKAGN--QSTKSISFSYM  148 (158)
T ss_pred             CcCCCCeEEEEEEEEecCCC--EEEEEEEEEEe
Confidence            34778899999999998774  44444555443


No 33 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=33.10  E-value=89  Score=19.20  Aligned_cols=19  Identities=0%  Similarity=0.067  Sum_probs=15.3

Q ss_pred             CcccccEEEEEEEEEeCCC
Q psy8617          76 DRETRDKFLLKIAACDNGT   94 (184)
Q Consensus        76 D~E~~~~~~l~v~a~d~~~   94 (184)
                      .+.+...|.+++.++|..+
T Consensus        52 ~y~~~G~y~v~l~v~d~~g   70 (81)
T cd00146          52 TYTKPGTYTVTLTVTNAVG   70 (81)
T ss_pred             EcCCCcEEEEEEEEEeCCC
Confidence            3667788999999999854


No 34 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=30.40  E-value=1e+02  Score=19.18  Aligned_cols=25  Identities=16%  Similarity=0.226  Sum_probs=19.0

Q ss_pred             cEEEEEEEEEeCCCCcccceEEEEEEE
Q psy8617          81 DKFLLKIAACDNGTPVSTAYATVAINV  107 (184)
Q Consensus        81 ~~~~l~v~a~d~~~p~~~~~~~v~I~v  107 (184)
                      ..+.+...|+|..+  ..+.+.+.|.|
T Consensus        57 G~t~V~ytA~D~~G--N~a~C~f~V~V   81 (81)
T PF02494_consen   57 GTTTVTYTATDAAG--NSATCSFTVTV   81 (81)
T ss_pred             ceEEEEEEEEECCC--CEEEEEEEEEC
Confidence            46889999999866  46677777764


No 35 
>PF07861 WND:  WisP family N-Terminal Region;  InterPro: IPR012503 This family is found at the N terminus of the Tropheryma whipplei WisP family proteins []. 
Probab=28.67  E-value=1.9e+02  Score=22.04  Aligned_cols=28  Identities=18%  Similarity=0.239  Sum_probs=24.0

Q ss_pred             CCceEEEEEecCCCcEEEEcCccEEEEc
Q psy8617         148 SNAALRYSLIPSNTSFQINPFSGKKSIL  175 (184)
Q Consensus       148 ~n~~i~y~i~~~~~~F~Id~~tG~i~~~  175 (184)
                      .|..++|+|...+.-..||..||.|...
T Consensus       201 R~S~~T~SLs~P~~~v~lD~~TG~l~~S  228 (263)
T PF07861_consen  201 RGSPFTYSLSTPVAGVRLDANTGALSGS  228 (263)
T ss_pred             cCCcceEEeccCCCceEEecccceeeee
Confidence            4678899999888999999999998764


No 36 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=25.11  E-value=2.3e+02  Score=19.32  Aligned_cols=33  Identities=15%  Similarity=0.312  Sum_probs=22.5

Q ss_pred             CCCcccccEEEEEEEEEeCCCCcccceEEEEEEEe
Q psy8617          74 PMDRETRDKFLLKIAACDNGTPVSTAYATVAINVL  108 (184)
Q Consensus        74 ~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~  108 (184)
                      .+++.+...|.+.|++.|..+...++  .+.|.|-
T Consensus        73 t~~v~kgG~y~m~V~lCn~dGCS~S~--~~~I~VA  105 (106)
T cd02848          73 TFKVGKGGRYQMQVALCNGDGCSTSA--AKEIVVA  105 (106)
T ss_pred             EEEeCCCCeEEEEEEEECCCCccCcC--CEEEEec
Confidence            35567778899999999887744444  4444444


No 37 
>PF12461 DUF3688:  Protein of unknown function (DUF3688) ;  InterPro: IPR022160 This entry is represented by Spiroplasma phage 1-C74, Orf1. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  This domain family is found in bacteria and viruses, and is typically between 79 and 104 amino acids in length. There is a conserved YRW sequence motif. There is a single completely conserved residue Y that may be functionally important. 
Probab=22.61  E-value=85  Score=20.68  Aligned_cols=24  Identities=13%  Similarity=0.018  Sum_probs=18.0

Q ss_pred             eEEEEEecCC--CcEEEEcCccEEEE
Q psy8617         151 ALRYSLIPSN--TSFQINPFSGKKSI  174 (184)
Q Consensus       151 ~i~y~i~~~~--~~F~Id~~tG~i~~  174 (184)
                      .-.|+..+..  ....||+.||+|+.
T Consensus        61 KsvYRWdG~gEPq~P~ID~ntG~Itd   86 (91)
T PF12461_consen   61 KSVYRWDGVGEPQTPTIDKNTGNITD   86 (91)
T ss_pred             EEEEEecCCCCccCceEcCCCCeEeE
Confidence            4467776654  57889999999974


No 38 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=22.45  E-value=1.6e+02  Score=20.78  Aligned_cols=34  Identities=26%  Similarity=0.270  Sum_probs=17.2

Q ss_pred             cEEEEEEEEE-eCCCCcccceEEEEEEEecCCCCCC
Q psy8617          81 DKFLLKIAAC-DNGTPVSTAYATVAINVLDANDNDP  115 (184)
Q Consensus        81 ~~~~l~v~a~-d~~~p~~~~~~~v~I~v~d~Nd~~P  115 (184)
                      ..=.|.|.|+ ..++.+++....+.|+|..-|+ +|
T Consensus        98 N~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp  132 (133)
T PF09100_consen   98 NAGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP  132 (133)
T ss_dssp             S-EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred             CcccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence            3446777776 3344568888999999988775 44


No 39 
>PF00635 Motile_Sperm:  MSP (Major sperm protein) domain;  InterPro: IPR000535 Major sperm proteins (MSP) are central components in molecular interactions underlying sperm motility in Caenorhabditis elegans, whose sperm employ an amoebae-like crawling motion using a MSP-containing lamellipod, rather than the flagellar-based swimming motion associated with other sperm. These proteins oligomerise to form an extensive filament system that extends from sperm villipoda, along the leading edge of the pseudopod. About 30 MSP isoforms may exist in C. elegans. MSPs form a fibrous network, whereby MSP dimers form helical subfilaments that coil around one another to produce filaments, which in turn form supercoils to produce bundles. The crystal structure of MSP from C. elegans reveals an immunoglobulin (Ig)-like seven-stranded beta sandwich fold []. ; GO: 0005198 structural molecule activity; PDB: 1MSP_A 3MSP_B 2BVU_B 2MSP_C 1Z9O_F 1Z9L_A 3IKK_A 1WIC_A 2CRI_A 2RR3_A ....
Probab=20.64  E-value=1.7e+02  Score=19.07  Aligned_cols=25  Identities=12%  Similarity=0.363  Sum_probs=19.1

Q ss_pred             ceEEEEEecCC-CcEEEEcCccEEEE
Q psy8617         150 AALRYSLIPSN-TSFQINPFSGKKSI  174 (184)
Q Consensus       150 ~~i~y~i~~~~-~~F~Id~~tG~i~~  174 (184)
                      ..+.|.+...+ ..|.|.|..|.|.-
T Consensus        32 ~~i~fKiktt~~~~y~v~P~~G~i~p   57 (109)
T PF00635_consen   32 KPIAFKIKTTNPNRYRVKPSYGIIEP   57 (109)
T ss_dssp             SEEEEEEEES-TTTEEEESSEEEE-T
T ss_pred             CcEEEEEEcCCCceEEecCCCEEECC
Confidence            47889998775 78999999998743


No 40 
>KOG4680|consensus
Probab=20.51  E-value=2.7e+02  Score=20.03  Aligned_cols=34  Identities=18%  Similarity=0.208  Sum_probs=26.2

Q ss_pred             EEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEE
Q psy8617         123 EFSIEENLPRGTVVGSVKATDEDLASNAALRYSL  156 (184)
Q Consensus       123 ~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i  156 (184)
                      ...++--.+||+.+.+.+|.|.+...--.+++++
T Consensus       109 sq~LPg~tPPG~Y~lkm~~~d~~~~~LTCisfsf  142 (153)
T KOG4680|consen  109 SQVLPGYTPPGSYVLKMTAYDAKGKELTCISFSF  142 (153)
T ss_pred             eEeccCcCCCceEEEEEEeecCCCCEEEEEEEEE
Confidence            4557778899999999999999876555555554


Done!