Query         psy8640
Match_columns 303
No_of_seqs    200 out of 1806
Neff          8.1 
Searched_HMMs 46136
Date          Sat Aug 17 00:01:23 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8640.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8640hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 2.3E-40   5E-45  331.3  27.1  266   15-303   277-609 (2531)
  2 KOG4289|consensus              100.0 3.4E-39 7.5E-44  322.9  19.6  269   14-303   171-507 (2531)
  3 KOG1219|consensus              100.0 3.2E-34   7E-39  295.2  28.7  269   11-303   644-1087(4289)
  4 KOG1219|consensus              100.0 1.2E-32 2.7E-37  283.6  28.0  266   14-303  2583-2912(4289)
  5 cd00031 CA Cadherin repeat dom 100.0   5E-28 1.1E-32  208.2  27.1  194   12-265     2-199 (199)
  6 cd00031 CA Cadherin repeat dom  99.9 2.9E-21 6.2E-26  166.0  18.9  128  125-303     1-131 (199)
  7 PF00028 Cadherin:  Cadherin do  99.8 5.4E-17 1.2E-21  123.1  15.0   90  126-264     1-93  (93)
  8 KOG1834|consensus               99.7 4.5E-16 9.8E-21  149.0  18.8  151   59-266    84-245 (952)
  9 smart00112 CA Cadherin repeats  99.6 1.9E-15 4.2E-20  111.0  10.6   77  146-271     1-79  (79)
 10 smart00112 CA Cadherin repeats  99.5 2.4E-13 5.2E-18   99.8   9.8   58   57-119    22-79  (79)
 11 PF00028 Cadherin:  Cadherin do  99.3 5.3E-11 1.2E-15   90.0  11.9   89   13-112     2-93  (93)
 12 KOG1834|consensus               99.2 2.5E-10 5.3E-15  110.1  12.6  142  109-302    21-176 (952)
 13 PF08758 Cadherin_pro:  Cadheri  97.6 0.00067 1.4E-08   51.0   9.5   66  118-199     3-68  (90)
 14 PF08266 Cadherin_2:  Cadherin-  97.3 0.00019   4E-09   53.3   3.1   63  126-198     3-67  (84)
 15 smart00736 CADG Dystroglycan-t  95.9    0.26 5.7E-06   37.2  11.6   73  145-268    24-96  (97)
 16 PF08266 Cadherin_2:  Cadherin-  95.4   0.015 3.3E-07   43.1   3.1   65   12-79      3-67  (84)
 17 TIGR01965 VCBS_repeat VCBS rep  95.1    0.22 4.7E-06   38.0   8.6   30  241-275    59-88  (99)
 18 PF13750 Big_3_3:  Bacterial Ig  93.3     4.2 9.1E-05   33.8  15.5   29  237-265   121-149 (158)
 19 smart00736 CADG Dystroglycan-t  92.5       2 4.3E-05   32.3   9.6   52   56-116    45-96  (97)
 20 TIGR03660 T1SS_rpt_143 T1SS-14  70.7      24 0.00053   28.5   7.3   46  241-291    87-133 (137)
 21 PF12245 Big_3_2:  Bacterial Ig  67.8      20 0.00043   24.4   5.4   31  238-268    22-52  (60)
 22 TIGR00845 caca sodium/calcium   66.3 1.4E+02  0.0029   32.1  13.3   50  114-165   395-444 (928)
 23 cd00146 PKD polycystic kidney   63.7      15 0.00032   26.1   4.3   28  236-263    54-81  (81)
 24 PF05345 He_PIG:  Putative Ig d  61.2      32  0.0007   22.4   5.2   38   54-93     11-48  (49)
 25 smart00089 PKD Repeats in poly  60.2      20 0.00043   25.2   4.5   27  236-263    52-78  (79)
 26 PF07495 Y_Y_Y:  Y_Y_Y domain;   58.7      22 0.00048   24.1   4.4   26  239-264    39-66  (66)
 27 TIGR03660 T1SS_rpt_143 T1SS-14  58.3      55  0.0012   26.5   7.1   67   56-133    55-127 (137)
 28 PF08758 Cadherin_pro:  Cadheri  55.5      56  0.0012   24.4   6.3   35   57-94     46-80  (90)
 29 cd02848 Chitinase_N_term Chiti  51.3      20 0.00044   27.7   3.3   33  233-265    74-106 (106)
 30 PF08329 ChitinaseA_N:  Chitina  49.9      34 0.00073   27.6   4.5   37  232-268    76-112 (133)
 31 KOG3597|consensus               46.4 2.9E+02  0.0063   27.0  11.1   60  104-164    25-84  (442)
 32 cd02848 Chitinase_N_term Chiti  41.6 1.2E+02  0.0025   23.5   6.2   34   73-113    73-106 (106)
 33 PF03160 Calx-beta:  Calx-beta   35.4 1.8E+02  0.0039   21.3  10.4   53  108-164     2-54  (100)
 34 KOG4221|consensus               33.1 7.3E+02   0.016   27.7  14.5   74   23-120   547-620 (1381)
 35 PF05895 DUF859:  Siphovirus pr  32.2 5.7E+02   0.012   26.2  13.3   32   83-121   300-331 (624)
 36 TIGR00845 caca sodium/calcium   27.9   8E+02   0.017   26.5  19.1   57  105-165   515-571 (928)
 37 PF00778 DIX:  DIX domain;  Int  26.9 1.7E+02  0.0036   21.6   4.7   42  241-300    43-84  (84)
 38 PF13754 Big_3_4:  Bacterial Ig  26.3      92   0.002   20.5   3.1   15  239-253    24-38  (54)
 39 PF13750 Big_3_3:  Bacterial Ig  26.0 3.7E+02   0.008   22.1   7.3   36  239-275    15-51  (158)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=2.3e-40  Score=331.29  Aligned_cols=266  Identities=24%  Similarity=0.354  Sum_probs=228.2

Q ss_pred             EeeccccccccceEEEEEEe--cccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEEC
Q psy8640          15 LIMRRGVVHRENIQVYLVWY--GSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDG   92 (303)
Q Consensus        15 ~i~e~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~   92 (303)
                      .+-||..+|.++.+..+-.-  ..++.+.|+.      ..++..+.|.||+++|.|.+..+||||+.+. |.|.+.|+|.
T Consensus       277 ~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl------~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~-y~L~VeAsDq  349 (2531)
T KOG4289|consen  277 ELRENLEVGYEVLTVRATDGDSPPNANIRYRL------LEGNAKNVFEINPRSGVISTRAPLDREELES-YQLDVEASDQ  349 (2531)
T ss_pred             HHhhccccCceEEEEEeccCCCCCCCceEEEe------cCCCccceeEEcCccceeeccCccCHHhhhh-eEEEEEeccC
Confidence            45677778888777766443  5677777754      3446778999999999999999999999999 9999999999


Q ss_pred             CCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccC
Q psy8640          93 GKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVD  172 (303)
Q Consensus        93 g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~  172 (303)
                      |.    ++...++.|.|.|.|+|||+|+|....|.+.|+|+..+++.|++|+|+|.|.|.|+++.|+|.+++..      
T Consensus       350 G~----~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~------  419 (2531)
T KOG4289|consen  350 GR----PPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGR------  419 (2531)
T ss_pred             CC----CCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCcc------
Confidence            98    44455999999999999999999999999999999999999999999999999999999999998754      


Q ss_pred             CCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE------------EEeecccccCCC----------------ceE
Q psy8640         173 DSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY------------SVWTRSLVWSRP----------------MQC  224 (303)
Q Consensus       173 ~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~------------~~~~~~~~~~~~----------------~~~  224 (303)
                         +.|.||..||+|.+..+||||.. .|++.|+|.+            .+..-.++.+.|                ...
T Consensus       420 ---G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~  495 (2531)
T KOG4289|consen  420 ---GQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGY  495 (2531)
T ss_pred             ---ccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccc
Confidence               88999999999999999999998 8999999862            123333333333                222


Q ss_pred             EE----------------------------------eecCceeeeeeecEEEEEEEEEECCCC--eEEEEEEEEEeecCC
Q psy8640         225 LI----------------------------------MRRGVVFDYEARSRYAFTLLSTDSGGR--TCKVKVRVEIESRDE  268 (303)
Q Consensus       225 ~i----------------------------------~~~~~~lD~E~~~~y~l~V~a~D~g~~--s~~~~v~I~V~DvNd  268 (303)
                      .+                                  +...+.||||+.+.|.|.|.|+|.|.+  ++++.|.|.|+|+||
T Consensus       496 ~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvnd  575 (2531)
T KOG4289|consen  496 LVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVND  575 (2531)
T ss_pred             eEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCC
Confidence            11                                  122367999999999999999999986  778899999999999


Q ss_pred             CCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640         269 FHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS  303 (303)
Q Consensus       269 n~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~  303 (303)
                      |.|+|+++.|++. .|++  |.|+.|.+|+|+|.|.
T Consensus       576 ndP~Ft~~eytl~inED~--pvgsSI~tvtAvD~d~  609 (2531)
T KOG4289|consen  576 NDPTFTQKEYTLRINEDA--PVGSSIVTVTAVDRDA  609 (2531)
T ss_pred             CCCccccCceEEEecCCc--cccceEEEEEEecccc
Confidence            9999999999999 9999  7999999999999983


No 2  
>KOG4289|consensus
Probab=100.00  E-value=3.4e-39  Score=322.91  Aligned_cols=269  Identities=23%  Similarity=0.373  Sum_probs=233.7

Q ss_pred             eEeeccccccccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECC
Q psy8640          14 CLIMRRGVVHRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGG   93 (303)
Q Consensus        14 ~~i~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g   93 (303)
                      ..++||.++++-..+..+.. -+++++.|...+   ..+....++|+||+.+|.|++++.||||+++. +.|.+.|.|.+
T Consensus       171 ~~lpEn~pagT~iasv~A~~-~~a~rl~Ysm~a---l~dsRS~~lFslD~~sG~irta~~lDREt~e~-HvlrVtA~d~~  245 (2531)
T KOG4289|consen  171 KELPENEPAGTIIASVKASD-PDAGRLYYSMVA---LFDSRSQNLFSLDPMSGAIRTAKSLDRETKET-HVLRVTAQDHG  245 (2531)
T ss_pred             ccCcCCCCCCceeEEEEecC-CCcCceEEEeee---ccchhccccEeeccccccchhhhhhhhhhhhe-eEEEEEeeecC
Confidence            68999999999988777654 448888886544   44556678999999999999999999999999 99999999999


Q ss_pred             CCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCC
Q psy8640          94 KPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDD  173 (303)
Q Consensus        94 ~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~  173 (303)
                      .    |.++++++++|.|.|.|||.|+|.+..|..+++||.++|+.|++|+|+|.|.++|+.|+|++.++++.       
T Consensus       246 ~----P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~-------  314 (2531)
T KOG4289|consen  246 D----PRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAK-------  314 (2531)
T ss_pred             C----CcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCcc-------
Confidence            8    89999999999999999999999999999999999999999999999999999999999999998654       


Q ss_pred             CcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE------------EEeecccccCCC--------------------
Q psy8640         174 SWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY------------SVWTRSLVWSRP--------------------  221 (303)
Q Consensus       174 ~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~------------~~~~~~~~~~~~--------------------  221 (303)
                        ..|.||+.+|.|.+.++||+|+...|.|.|+|++            .+.....+.|.|                    
T Consensus       315 --~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~av  392 (2531)
T KOG4289|consen  315 --NVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAV  392 (2531)
T ss_pred             --ceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCce
Confidence              8999999999999999999999999999999972            122222233333                    


Q ss_pred             ----------------ceEEEeec------------C-----ceeeeeeecEEEEEEEEEECCCC--eEEEEEEEEEeec
Q psy8640         222 ----------------MQCLIMRR------------G-----VVFDYEARSRYAFTLLSTDSGGR--TCKVKVRVEIESR  266 (303)
Q Consensus       222 ----------------~~~~i~~~------------~-----~~lD~E~~~~y~l~V~a~D~g~~--s~~~~v~I~V~Dv  266 (303)
                                      +.+.|++.            |     .+||+|.. .|.+.|+|+|.|.|  +.+.-+.|+|+|+
T Consensus       393 vlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDI  471 (2531)
T KOG4289|consen  393 VLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDI  471 (2531)
T ss_pred             EEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEec
Confidence                            22333322            1     36999999 99999999997774  6677777999999


Q ss_pred             CCCCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640         267 DEFHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS  303 (303)
Q Consensus       267 Ndn~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~  303 (303)
                      |||+|+|....+.+. .|++  |.|..++.+.|.|+|+
T Consensus       472 NDhaPifvstpfq~tvlEnv--~lg~~v~~vqaidads  507 (2531)
T KOG4289|consen  472 NDHAPIFVSTPFQATVLENV--PLGYLVCHVQAIDADS  507 (2531)
T ss_pred             CCCCceeEechhhhhhhhcc--cccceEEEEecccCCC
Confidence            999999999999999 9999  7999999999999996


No 3  
>KOG1219|consensus
Probab=100.00  E-value=3.2e-34  Score=295.17  Aligned_cols=269  Identities=22%  Similarity=0.310  Sum_probs=229.8

Q ss_pred             cceeEeeccccccccceEEEEEEe--cccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEE
Q psy8640          11 PMQCLIMRRGVVHRENIQVYLVWY--GSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILK   88 (303)
Q Consensus        11 p~~~~i~e~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~   88 (303)
                      |-...++|+.++|+..+.-.+-.-  |.+|.+.|      -+.+++....|.|+.++|.|+++++||+|..++ |.|.++
T Consensus       644 P~iI~v~Edvpigt~la~L~atD~Dtgfng~l~y------vI~dgne~~~~~Id~qsg~itvas~ld~~~t~~-yiLnvt  716 (4289)
T KOG1219|consen  644 PFIIVVPEDVPIGTTLAILSATDSDTGFNGKLVY------VIEDGNESICFLIDRQSGNITVASPLDNENTEQ-YILNVT  716 (4289)
T ss_pred             CceEEccccCCCCceEEEEeccCCCCCcCceEEE------EEeCCccceEEEEecccceEEEecchhhhhhhe-eEEEEE
Confidence            445678888888888665444322  66677777      577888899999999999999999999999999 999999


Q ss_pred             EEECCCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCC
Q psy8640          89 VEDGGKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGY  168 (303)
Q Consensus        89 a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~  168 (303)
                      |.|.|.    |..+++..+.|.|.|.|||+|.|.+..|.++|.|+..+|+.++.|.|.|.|.|.||+++|+|....    
T Consensus       717 a~D~gt----Pqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~s----  788 (4289)
T KOG1219|consen  717 AYDLGT----PQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKS----  788 (4289)
T ss_pred             EecCCC----chhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCc----
Confidence            999998    888999999999999999999999999999999999999999999999999999999999999865    


Q ss_pred             cccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEE------------Eeec----------------------
Q psy8640         169 TDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYS------------VWTR----------------------  214 (303)
Q Consensus       169 ~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~------------~~~~----------------------  214 (303)
                             +.|+||+.||.+.+.++||+|.+..|+|.++|.+.            +...                      
T Consensus       789 -------dvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedl  861 (4289)
T KOG1219|consen  789 -------DVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDL  861 (4289)
T ss_pred             -------ceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccC
Confidence                   78999999999999999999999999999998610            0000                      


Q ss_pred             --------------------------------------------------------------------------------
Q psy8640         215 --------------------------------------------------------------------------------  214 (303)
Q Consensus       215 --------------------------------------------------------------------------------  214 (303)
                                                                                                      
T Consensus       862 p~gt~~~~l~A~d~diGq~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vev  941 (4289)
T KOG1219|consen  862 PYGTVTWQLVALDPDIGQLGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEV  941 (4289)
T ss_pred             CCceEEEEhhhcCcccCcCceeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEEEE
Confidence                                                                                            


Q ss_pred             -------------------ccccCCCceEEE-------------------------------------eecCceeeeeee
Q psy8640         215 -------------------SLVWSRPMQCLI-------------------------------------MRRGVVFDYEAR  238 (303)
Q Consensus       215 -------------------~~~~~~~~~~~i-------------------------------------~~~~~~lD~E~~  238 (303)
                                         ++.+|.|....+                                     ++....||||..
T Consensus       942 ldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~k 1021 (4289)
T KOG1219|consen  942 LDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKK 1021 (4289)
T ss_pred             eccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhc
Confidence                               011122222211                                     111246999999


Q ss_pred             cEEEEEEEEEECCCC--eEEEEEEEEEeecCCCCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640         239 SRYAFTLLSTDSGGR--TCKVKVRVEIESRDEFHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS  303 (303)
Q Consensus       239 ~~y~l~V~a~D~g~~--s~~~~v~I~V~DvNdn~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~  303 (303)
                      ..|-|+|.|+|.|..  ++-..+.|.|+|+|||+|+|.++.|... .|++  |.+..|.++.|.|+||
T Consensus      1022 s~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enS--p~~vsivq~ea~D~Ds 1087 (4289)
T KOG1219|consen 1022 SSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENS--PETVSIVQAEANDPDS 1087 (4289)
T ss_pred             ceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCC--CCceEEEEeccCCCCc
Confidence            999999999999983  7788999999999999999999999999 9999  8999999999999995


No 4  
>KOG1219|consensus
Probab=100.00  E-value=1.2e-32  Score=283.63  Aligned_cols=266  Identities=23%  Similarity=0.270  Sum_probs=221.6

Q ss_pred             eEeeccccccccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECC
Q psy8640          14 CLIMRRGVVHRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGG   93 (303)
Q Consensus        14 ~~i~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g   93 (303)
                      +++||.-++|.+..+..+..-..  ..-|++.-- -.-.+++...|.||++||.|++.++||+|.... |.+.|.|.+++
T Consensus      2583 fsvpEDv~vG~~Ig~v~a~~a~~--~~i~~~v~~-gt~Esn~d~~Fsvdr~TG~i~v~ksLD~E~kk~-yqi~v~a~~~~ 2658 (4289)
T KOG1219|consen 2583 FSVPEDVPVGEEIGQVSASDADE--HVIYSLVLG-GTPESNPDLPFSVDRNTGMIKVNKSLDHEKKKS-YQIKVKATCGQ 2658 (4289)
T ss_pred             EeccccCCCCCeeeEEeecccCC--ceEEEEEeC-CCCCCCCCCceEEcCCCceEEeccccchhhhce-EEEEEEeecCC
Confidence            78999999999998888765322  223322100 011334556799999999999999999999999 99999999876


Q ss_pred             CCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCC
Q psy8640          94 KPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDD  173 (303)
Q Consensus        94 ~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~  173 (303)
                      .      .-+.++|.|.|.|+|||+|+|....|.+.+.||+|.|+.|++++|.|+|.|.||+|+|+|....         
T Consensus      2659 ~------vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v--------- 2723 (4289)
T KOG1219|consen 2659 W------VVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPV--------- 2723 (4289)
T ss_pred             c------eEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCc---------
Confidence            4      5688899999999999999999999999999999999999999999999999999999999975         


Q ss_pred             CcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE----------EEeeccc---------------------------
Q psy8640         174 SWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY----------SVWTRSL---------------------------  216 (303)
Q Consensus       174 ~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~----------~~~~~~~---------------------------  216 (303)
                        .+|.|+++||+|++.+.||+|++..|.|.|.|.+          -+.....                           
T Consensus      2724 --~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~p~~~~ia 2801 (4289)
T KOG1219|consen 2724 --PYFAINPETGWLTTLFELDLEKQDLYSLKVVATDHGVPSSQATVLVHVTDVNDSPPRFQREIYEGTVVEDVPGGKIIA 2801 (4289)
T ss_pred             --ceEEEcCCCCeeeehhhhccccCCceEEEEEEecCCcccccceEEEEEEecCCCcchhhhHhhccceeccCCCCceee
Confidence              5899999999999999999999999999999972          1111111                           


Q ss_pred             ---------ccCCCceEEEe-----------ec------CceeeeeeecEEEEEEEEEECCCCeEEEEEEEEEeecCCCC
Q psy8640         217 ---------VWSRPMQCLIM-----------RR------GVVFDYEARSRYAFTLLSTDSGGRTCKVKVRVEIESRDEFH  270 (303)
Q Consensus       217 ---------~~~~~~~~~i~-----------~~------~~~lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~  270 (303)
                               .+.+-+++.|.           .+      -+++|||...+|.|+|.|+| |-..++..|.|.|+|.|||.
T Consensus      2802 ~~si~d~D~s~~nq~t~fI~~gd~~gqF~~i~ne~~~~~kKt~~~E~t~ny~Ltvtatd-g~f~~s~~vkv~v~~s~dn~ 2880 (4289)
T KOG1219|consen 2802 GLSIFDADVSEVNQVTGFITLGDPLGQFWIIENEWIYEFKKTLDRESTKNYLLTVTATD-GIFMNSDNVKVLVLDSNDNS 2880 (4289)
T ss_pred             eeEeccccccccceeEEEEeCCCccceEEEEcCcceEEEecchhhhcccceEEEEEEec-cceeccceEEEEeeccccCC
Confidence                     11121222221           11      15799999999999999999 88888999999999999999


Q ss_pred             CeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640         271 PQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS  303 (303)
Q Consensus       271 P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~  303 (303)
                      |...+..|+.. .|+.-  .|..++.+.|+|.|.
T Consensus      2881 ~~c~~~~~t~i~~ed~f--~gk~ilkisal~~dn 2912 (4289)
T KOG1219|consen 2881 PFCGNQLYTKIQCEDVF--PGKQILKISALDVDN 2912 (4289)
T ss_pred             ccCcchhccceeccccC--CceeEEEeeeeccCc
Confidence            99999999999 99996  999999999999873


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97  E-value=5e-28  Score=208.15  Aligned_cols=194  Identities=32%  Similarity=0.426  Sum_probs=165.9

Q ss_pred             ceeEeeccccccccceEEEEEEecc--cCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEE
Q psy8640          12 MQCLIMRRGVVHRENIQVYLVWYGS--TSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKV   89 (303)
Q Consensus        12 ~~~~i~e~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a   89 (303)
                      ..+.|.||...+.......+..-..  ++.+.|..      .......+|.|++.+|.|++.+.||||.... |.|.|+|
T Consensus         2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i------~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~-~~l~v~a   74 (199)
T cd00031           2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSI------LGGNEDGLFSIDPNTGVITTTKPLDREEQSE-YTLTVVA   74 (199)
T ss_pred             eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEE------eCCCCcccEEEeCCCCEEEECCCCCCcCCce-EEEEEEE
Confidence            4578899999877766555444322  24455533      3333337999999999999999999999999 9999999


Q ss_pred             EECCCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCc
Q psy8640          90 EDGGKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYT  169 (303)
Q Consensus        90 ~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~  169 (303)
                      .|.|.    +.++....+.|.|.|+|||+|.|....|.+.|.|+.++|+.++++.|+|+|.+.++.++|+|..+..    
T Consensus        75 ~D~g~----~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~----  146 (199)
T cd00031          75 SDGGG----PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGND----  146 (199)
T ss_pred             EECCc----CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCC----
Confidence            99876    5667999999999999999999998899999999999999999999999999989999999999763    


Q ss_pred             ccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEE
Q psy8640         170 DVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTD  249 (303)
Q Consensus       170 ~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D  249 (303)
                           .+.|.|++.+|.|++.+.||||....                                        |.|.|.|+|
T Consensus       147 -----~~~f~i~~~~G~i~~~~~ld~e~~~~----------------------------------------~~l~v~a~D  181 (199)
T cd00031         147 -----KELFSIDPNTGIITLAKPLDREEKSS----------------------------------------YELTVVATD  181 (199)
T ss_pred             -----CCEEEEeCCceEEEeCCccCCccCce----------------------------------------EEEEEEEEE
Confidence                 26899999999999999999999888                                        999999999


Q ss_pred             CC--CCeEEEEEEEEEee
Q psy8640         250 SG--GRTCKVKVRVEIES  265 (303)
Q Consensus       250 ~g--~~s~~~~v~I~V~D  265 (303)
                      .+  ..++++.++|.|.|
T Consensus       182 ~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         182 GGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             CCCCCceeEEEEEEEEEC
Confidence            88  56888899998875


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.88  E-value=2.9e-21  Score=166.02  Aligned_cols=128  Identities=33%  Similarity=0.523  Sum_probs=118.6

Q ss_pred             ceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEE
Q psy8640         125 EYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFT  204 (303)
Q Consensus       125 ~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~  204 (303)
                      .|.+.|.|+.+.|+.++++.|.|+|.+.++.+.|+|..+...         .+|.|++.+|.|++.+.||||....    
T Consensus         1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~---------~~F~i~~~tG~l~~~~~lD~e~~~~----   67 (199)
T cd00031           1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNED---------GLFSIDPNTGVITTTKPLDREEQSE----   67 (199)
T ss_pred             CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCc---------ccEEEeCCCCEEEECCCCCCcCCce----
Confidence            378999999999999999999999999889999999997632         6899999999999999999999988    


Q ss_pred             EEEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEECCCCe--EEEEEEEEEeecCCCCCeeCCCeeEEE-
Q psy8640         205 LLSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDSGGRT--CKVKVRVEIESRDEFHPQFTERTFKFI-  281 (303)
Q Consensus       205 v~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~g~~s--~~~~v~I~V~DvNdn~P~F~~~~y~~~-  281 (303)
                                                          |.|.|.|+|.|.+.  +...++|.|.|+|||+|.|....|.+. 
T Consensus        68 ------------------------------------~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v  111 (199)
T cd00031          68 ------------------------------------YTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASV  111 (199)
T ss_pred             ------------------------------------EEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEE
Confidence                                                99999999987765  899999999999999999999999999 


Q ss_pred             ecCCCCCCCcEEEEEEEEeCCC
Q psy8640         282 LSGTDLPVGYVVGHLRLVRFRS  303 (303)
Q Consensus       282 ~e~~~~p~gt~v~~v~A~D~D~  303 (303)
                      .|+.  |.|+.++++.|+|+|.
T Consensus       112 ~e~~--~~~~~i~~~~a~D~D~  131 (199)
T cd00031         112 PENA--PPGTVVGTVTATDADS  131 (199)
T ss_pred             eCCC--CCCCEEEEEEEEcCCC
Confidence            9998  5999999999999994


No 7  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.75  E-value=5.4e-17  Score=123.14  Aligned_cols=90  Identities=33%  Similarity=0.488  Sum_probs=82.6

Q ss_pred             eEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEE
Q psy8640         126 YLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTL  205 (303)
Q Consensus       126 ~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v  205 (303)
                      |++.|+|++++|+.++++.|.|+|.+.|+.+.|+|..+.+.         +.|.|++.+|.|++.++||||....     
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~---------~~F~I~~~tg~i~~~~~LD~E~~~~-----   66 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPD---------GLFSIDPNTGEISLKKPLDRETQSS-----   66 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTST---------TSEEEETTTTEEEESSSSCTTTTSE-----
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCccc---------CceEEeeeeeccccceecCcccCCE-----
Confidence            78999999999999999999999999999999999998753         8899999999999999999999999     


Q ss_pred             EEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEEC-CCC--eEEEEEEEEEe
Q psy8640         206 LSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDS-GGR--TCKVKVRVEIE  264 (303)
Q Consensus       206 ~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~-g~~--s~~~~v~I~V~  264 (303)
                                                         |.|.|.|+|. |.+  +++++|+|+|+
T Consensus        67 -----------------------------------y~l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   67 -----------------------------------YQLTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             -----------------------------------EEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             -----------------------------------EEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence                                               9999999999 554  67778888774


No 8  
>KOG1834|consensus
Probab=99.71  E-value=4.5e-16  Score=148.95  Aligned_cols=151  Identities=25%  Similarity=0.420  Sum_probs=129.9

Q ss_pred             EEEeCCcc--EEEEcccCCCCCCCceEEEEEEEEECCCCCCCC-----CCeeEEEEEEEEEcCCCCCCcCCCCceEEEEe
Q psy8640          59 QAVDPLTG--VLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDS-----PLSSITHLKITLQDENDNAPKFPITEYLEFVG  131 (303)
Q Consensus        59 F~Id~~tG--~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~-----~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~  131 (303)
                      -.+|+.||  .|+.+.+||-|.++. |+|+|+|.|.|.   +|     ..+-.+++.|.|.|+|+++|.|....|.+.|.
T Consensus        84 vVvdK~TGegvlRaK~~lDCelqke-ytf~iQAydCg~---gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~  159 (952)
T KOG1834|consen   84 VVVDKYTGEGVLRAKEPLDCELQKE-YTFTIQAYDCGN---GPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVT  159 (952)
T ss_pred             EEEeccCCceEEeecCccccccccc-ceEEEEEEecCC---CCCccccccccceEEEEEeccccccCchhcccceeeEEe
Confidence            34588875  688999999999999 999999999986   23     56778899999999999999999999999999


Q ss_pred             CCCCCCeEEEEEEEEeCCCC-CCceE-EEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE
Q psy8640         132 ENEPIGSSVFTARATDMDKG-DYGKL-NYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY  209 (303)
Q Consensus       132 E~~~~gt~v~~v~A~D~D~g-~n~~i-~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~  209 (303)
                      |. +.-..|++|.|.|.|.+ .+++| .|.|...+           -.|.||. .|.|+...+|+|.....         
T Consensus       160 EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t~d-----------~PFaIdn-~G~irnTekLny~ke~~---------  217 (952)
T KOG1834|consen  160 EG-KVYDSILRVEAIDKDCSPQYSQICEYEITTPD-----------VPFAIDN-DGNIRNTEKLNYTKEHQ---------  217 (952)
T ss_pred             cc-eeeeeeEEEEeecCCCCCcccceeEEEecCCC-----------CceEEcC-CCcccccccccccccee---------
Confidence            98 56788999999999987 47777 88998866           6799995 69999999999999988         


Q ss_pred             EEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEECCCC--eEEEEEEEEEeec
Q psy8640         210 SVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDSGGR--TCKVKVRVEIESR  266 (303)
Q Consensus       210 ~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~g~~--s~~~~v~I~V~Dv  266 (303)
                                                     |.|+|.|.|+|..  .+.+.|+|.|...
T Consensus       218 -------------------------------Y~ltVtAyDCg~kraa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  218 -------------------------------YKLTVTAYDCGKKRAASDSLVTVHVKPT  245 (952)
T ss_pred             -------------------------------EEEEEEEEecccccccCcceEEEEecCc
Confidence                                           9999999999985  2336777777543


No 9  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.64  E-value=1.9e-15  Score=111.05  Aligned_cols=77  Identities=36%  Similarity=0.593  Sum_probs=70.2

Q ss_pred             EeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEEEeecccccCCCceEE
Q psy8640         146 TDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQCL  225 (303)
Q Consensus       146 ~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~~  225 (303)
                      +|+|.|.|+.++|+|..+...         .+|.|++.+|.|++.++||||....                         
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~---------~~F~i~~~tg~i~~~~~LD~e~~~~-------------------------   46 (79)
T smart00112        1 TDADSGENGKVTYSILSGNED---------GLFSIDPETGEITTTKPLDREEQPE-------------------------   46 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCC---------CEEEEeCCccEEEeCCccCeeCCCe-------------------------
Confidence            488999999999999987632         7899999999999999999999888                         


Q ss_pred             EeecCceeeeeeecEEEEEEEEEECCC--CeEEEEEEEEEeecCCCCC
Q psy8640         226 IMRRGVVFDYEARSRYAFTLLSTDSGG--RTCKVKVRVEIESRDEFHP  271 (303)
Q Consensus       226 i~~~~~~lD~E~~~~y~l~V~a~D~g~--~s~~~~v~I~V~DvNdn~P  271 (303)
                                     |.|.|.|+|.|.  +++.+.|+|.|.|+|||+|
T Consensus        47 ---------------y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       47 ---------------YTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             ---------------EEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence                           999999999987  5889999999999999998


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.49  E-value=2.4e-13  Score=99.81  Aligned_cols=58  Identities=38%  Similarity=0.648  Sum_probs=55.7

Q ss_pred             ccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCCCCC
Q psy8640          57 AIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDSPLSSITHLKITLQDENDNAP  119 (303)
Q Consensus        57 ~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P  119 (303)
                      .+|.|++.+|.|++.++||||..+. |.|.|.|.|.|.    +++++.+.|.|.|.|+|||+|
T Consensus        22 ~~F~i~~~tg~i~~~~~LD~e~~~~-y~l~v~a~D~~~----~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       22 GLFSIDPETGEITTTKPLDREEQPE-YTLTVEATDGGG----PPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             CEEEEeCCccEEEeCCccCeeCCCe-EEEEEEEEECCC----CCcccEEEEEEEEEECCCCCC
Confidence            8999999999999999999999999 999999999987    688999999999999999998


No 11 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.29  E-value=5.3e-11  Score=90.02  Aligned_cols=89  Identities=26%  Similarity=0.297  Sum_probs=76.9

Q ss_pred             eeEeeccccccccceEEEEEEe--cccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEE
Q psy8640          13 QCLIMRRGVVHRENIQVYLVWY--GSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVE   90 (303)
Q Consensus        13 ~~~i~e~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~   90 (303)
                      .+.|+||..+|+...+..+..-  +.++.+.|.      +..++...+|.|++.+|.|++.++||||.... |.|.|.|+
T Consensus         2 ~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~------i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~-y~l~v~a~   74 (93)
T PF00028_consen    2 SFSVPENAPPGTVVGQVTATDPDSGPNSQITYS------ILGGNPDGLFSIDPNTGEISLKKPLDRETQSS-YQLTVRAT   74 (93)
T ss_dssp             EEEEETTGSTSSEEEEEEEEESSTSTTSSEEEE------EEETTSTTSEEEETTTTEEEESSSSCTTTTSE-EEEEEEEE
T ss_pred             EEEEECCCCCCCEEEEEEEEeCCCCCCceEEEE------EecCcccCceEEeeeeeccccceecCcccCCE-EEEEEEEE
Confidence            3679999999999888888766  467778884      45555579999999999999999999999999 99999999


Q ss_pred             EC-CCCCCCCCCeeEEEEEEEEE
Q psy8640          91 DG-GKPPLDSPLSSITHLKITLQ  112 (303)
Q Consensus        91 D~-g~~~~~~~~s~~~~v~I~V~  112 (303)
                      |. |.    +++++.++|.|.|.
T Consensus        75 D~~~~----~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   75 DSGGS----PPLSSTATVTINVL   93 (93)
T ss_dssp             ETTTS----SEEEEEEEEEEEEE
T ss_pred             ECCCC----CCCEEEEEEEEEEC
Confidence            99 66    88999999999884


No 12 
>KOG1834|consensus
Probab=99.18  E-value=2.5e-10  Score=110.15  Aligned_cols=142  Identities=23%  Similarity=0.305  Sum_probs=108.4

Q ss_pred             EEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCC--CceE-EEEEeeCCCCCCcccCCCcCcEEEECCCc
Q psy8640         109 ITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGD--YGKL-NYSIISAAASGYTDVDDSWKLFRVDALTG  185 (303)
Q Consensus       109 I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~--n~~i-~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG  185 (303)
                      ....-+|-+.|.. ...|..-|.||-..=...--+.|.|.|.+-  .|.| .|.|.+..-.        .+..-+|..||
T Consensus        21 ~~aarankhkpwi-e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vP--------FdavVvdK~TG   91 (952)
T KOG1834|consen   21 HHAARANKHKPWI-EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVP--------FDAVVVDKYTG   91 (952)
T ss_pred             cccccccccCccc-ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCC--------ceEEEEeccCC
Confidence            3445577777765 455999999985322223457788888752  3444 7787775421        02333488775


Q ss_pred             --EEEEcccCcccccceEEEEEEEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEECCC-C-------eE
Q psy8640         186 --LVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDSGG-R-------TC  255 (303)
Q Consensus       186 --~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~g~-~-------s~  255 (303)
                        .|+.+.+||.|.++.                                        |+|+|.|.|+|. +       +-
T Consensus        92 egvlRaK~~lDCelqke----------------------------------------ytf~iQAydCg~gpdgtn~kKSh  131 (952)
T KOG1834|consen   92 EGVLRAKEPLDCELQKE----------------------------------------YTFTIQAYDCGNGPDGTNTKKSH  131 (952)
T ss_pred             ceEEeecCccccccccc----------------------------------------ceEEEEEEecCCCCCcccccccc
Confidence              577788999998888                                        999999999875 2       66


Q ss_pred             EEEEEEEEeecCCCCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCC
Q psy8640         256 KVKVRVEIESRDEFHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFR  302 (303)
Q Consensus       256 ~~~v~I~V~DvNdn~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D  302 (303)
                      .++|+|.|.|+|+.+|+|..+.|++. .|..   .-..|++|.|.|.|
T Consensus       132 katvhIrVkDvNe~AP~f~ep~Yka~V~EGK---~yd~il~veAiD~D  176 (952)
T KOG1834|consen  132 KATVHIRVKDVNEFAPVFKEPWYKAHVTEGK---VYDSILRVEAIDKD  176 (952)
T ss_pred             ceEEEEEeccccccCchhcccceeeEEecce---eeeeeEEEEeecCC
Confidence            89999999999999999999999999 8887   77899999999998


No 13 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.63  E-value=0.00067  Score=51.00  Aligned_cols=66  Identities=18%  Similarity=0.170  Sum_probs=32.7

Q ss_pred             CCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccc
Q psy8640         118 APKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEA  197 (303)
Q Consensus       118 ~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~  197 (303)
                      -|-|.+..|.+.|+.+...|..+++|.-.|...  +..+.|.-..             ..|.|.+ .|.|++++++....
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ssD-------------pdF~V~~-DGsVy~~r~v~l~~   66 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESSD-------------PDFRVLE-DGSVYAKRPVQLSS   66 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE----------------SEEEEET-TTEEEEES--S-SS
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecCC-------------CCEEEcC-CCeEEEeeeEecCC
Confidence            588999999999999999999999999998743  4567876655             4599986 69999999988765


Q ss_pred             cc
Q psy8640         198 RS  199 (303)
Q Consensus       198 ~~  199 (303)
                      ..
T Consensus        67 ~~   68 (90)
T PF08758_consen   67 EQ   68 (90)
T ss_dssp             S-
T ss_pred             Cc
Confidence            44


No 14 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.32  E-value=0.00019  Score=53.30  Aligned_cols=63  Identities=27%  Similarity=0.305  Sum_probs=39.9

Q ss_pred             eEEEEeCCCCCCeEEEEEEEEeCCCCC--CceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCccccc
Q psy8640         126 YLEFVGENEPIGSSVFTARATDMDKGD--YGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEAR  198 (303)
Q Consensus       126 ~~~~V~E~~~~gt~v~~v~A~D~D~g~--n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~  198 (303)
                      ..++|+|..+.|+.||.+ |.|.-...  -....|++......         .+|.+++.+|.|+++..+|||.-
T Consensus         3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~---------~~~~v~~~tG~L~v~~rIDRE~L   67 (84)
T PF08266_consen    3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNS---------QYFRVNEKTGDLFVSERIDREEL   67 (84)
T ss_dssp             EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS----------SEEE-TTTSEEEESS--SCCCC
T ss_pred             eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCc---------ceeEecCCceeEEeCCccCHHHH
Confidence            467899999999999998 44543221  11246777665432         89999999999999999999965


No 15 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.86  E-value=0.26  Score=37.23  Aligned_cols=73  Identities=19%  Similarity=0.256  Sum_probs=52.0

Q ss_pred             EEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEEEeecccccCCCceE
Q psy8640         145 ATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQC  224 (303)
Q Consensus       145 A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~  224 (303)
                      ..|.|   ...+.|++...+...    .+  .....|+.++.+.-. +.... ...                        
T Consensus        24 F~d~d---~~~lty~~~~~~~~~----lP--~Wl~fd~~~~~~~Gt-P~~~~-~g~------------------------   68 (97)
T smart00736       24 FTDAD---GDTLTYSATLSDGSA----LP--SWLSFDSDTGTLSGT-PTNSD-VGS------------------------   68 (97)
T ss_pred             eECCC---CCeEEEEEEeCCCCC----CC--CeEEEeCCCCEEEEE-CCCCC-CcE------------------------
Confidence            46666   337888887543100    22  458889888887763 43322 244                        


Q ss_pred             EEeecCceeeeeeecEEEEEEEEEECCCCeEEEEEEEEEeecCC
Q psy8640         225 LIMRRGVVFDYEARSRYAFTLLSTDSGGRTCKVKVRVEIESRDE  268 (303)
Q Consensus       225 ~i~~~~~~lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNd  268 (303)
                                      |.|.|.|+|..+.+....++|.|.+.||
T Consensus        69 ----------------~~i~v~a~D~~g~~~~~~f~i~V~~~~~   96 (97)
T smart00736       69 ----------------LSLKVTATDSSGASASDTFTITVVNTND   96 (97)
T ss_pred             ----------------EEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence                            9999999998888889999999999987


No 16 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=95.42  E-value=0.015  Score=43.05  Aligned_cols=65  Identities=9%  Similarity=-0.056  Sum_probs=34.1

Q ss_pred             ceeEeeccccccccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCC
Q psy8640          12 MQCLIMRRGVVHRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTG   79 (303)
Q Consensus        12 ~~~~i~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~   79 (303)
                      +.++|+|....|+.......   ...-...--.-.++.+.......+|.+++.+|.|++...+|||.-
T Consensus         3 i~YsV~EE~~~Gt~IGnia~---dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~L   67 (84)
T PF08266_consen    3 IRYSVPEEMPPGTVIGNIAK---DLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREEL   67 (84)
T ss_dssp             EEEEEESS--TT-EEEECCC---CCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC
T ss_pred             eEEEeecCCCCCCEEEEhHH---hhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHH
Confidence            45788888887766543321   111100000112233344456789999999999999999999985


No 17 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.11  E-value=0.22  Score=38.02  Aligned_cols=30  Identities=23%  Similarity=0.357  Sum_probs=24.5

Q ss_pred             EEEEEEEEECCCCeEEEEEEEEEeecCCCCCeeCC
Q psy8640         241 YAFTLLSTDSGGRTCKVKVRVEIESRDEFHPQFTE  275 (303)
Q Consensus       241 y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~  275 (303)
                      -.|++.+.| |   .+++|+|+|.-.|| +|+..-
T Consensus        59 dsFtvtv~D-G---tt~~vtItI~GtND-apvi~~   88 (99)
T TIGR01965        59 DTFTVTSAD-G---TSQTVTITITGAND-AAVIGG   88 (99)
T ss_pred             EEEEEEEeC-C---CeEEEEEEEEccCC-CCEEec
Confidence            678888888 4   38899999999999 887643


No 18 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=93.25  E-value=4.2  Score=33.76  Aligned_cols=29  Identities=21%  Similarity=0.350  Sum_probs=22.8

Q ss_pred             eecEEEEEEEEEECCCCeEEEEEEEEEee
Q psy8640         237 ARSRYAFTLLSTDSGGRTCKVKVRVEIES  265 (303)
Q Consensus       237 ~~~~y~l~V~a~D~g~~s~~~~v~I~V~D  265 (303)
                      ..+.|.|+|.|.|..+-..+..+......
T Consensus       121 ~~~~YtLtV~a~D~aGN~~~~si~F~y~P  149 (158)
T PF13750_consen  121 ADDSYTLTVSATDKAGNQSTKSISFSYMP  149 (158)
T ss_pred             CCCeEEEEEEEEecCCCEEEEEEEEEEeC
Confidence            34449999999999888877877776653


No 19 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=92.46  E-value=2  Score=32.35  Aligned_cols=52  Identities=29%  Similarity=0.284  Sum_probs=41.5

Q ss_pred             cccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCC
Q psy8640          56 LAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDSPLSSITHLKITLQDEND  116 (303)
Q Consensus        56 ~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvND  116 (303)
                      ..|...|+.++.++-. +...+ ... |.+.+.|+|+..      .+....+.|.|.+.|+
T Consensus        45 P~Wl~fd~~~~~~~Gt-P~~~~-~g~-~~i~v~a~D~~g------~~~~~~f~i~V~~~~~   96 (97)
T smart00736       45 PSWLSFDSDTGTLSGT-PTNSD-VGS-LSLKVTATDSSG------ASASDTFTITVVNTND   96 (97)
T ss_pred             CCeEEEeCCCCEEEEE-CCCCC-CcE-EEEEEEEEECCC------CEEEEEEEEEEeCCCC
Confidence            4699999999988884 44333 346 999999999753      5788899999999987


No 20 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=70.71  E-value=24  Score=28.53  Aligned_cols=46  Identities=24%  Similarity=0.363  Sum_probs=31.3

Q ss_pred             EEEEEEEEECCCCeEEEEEEEEEeecCCCCCeeCCCeeEEE-ecCCCCCCCc
Q psy8640         241 YAFTLLSTDSGGRTCKVKVRVEIESRDEFHPQFTERTFKFI-LSGTDLPVGY  291 (303)
Q Consensus       241 y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~~~y~~~-~e~~~~p~gt  291 (303)
                      ..|.|.|+|..+-.++..+.|+|.|  | .|......- +. .|+.- +.|+
T Consensus        87 l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~~-~~V~E~~L-~~g~  133 (137)
T TIGR03660        87 LNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVDA-LTVDEDDL-PGGS  133 (137)
T ss_pred             EeeeEEEEeCCCCccccEEEEEEEC--C-CCeeccccc-eEEecccc-ccCC
Confidence            6788999998876666788888887  6 577765542 33 45432 3443


No 21 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=67.83  E-value=20  Score=24.43  Aligned_cols=31  Identities=23%  Similarity=0.400  Sum_probs=23.7

Q ss_pred             ecEEEEEEEEEECCCCeEEEEEEEEEeecCC
Q psy8640         238 RSRYAFTLLSTDSGGRTCKVKVRVEIESRDE  268 (303)
Q Consensus       238 ~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNd  268 (303)
                      .+.|.|.+.|.|..|..++....+.+.|..-
T Consensus        22 dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~~   52 (60)
T PF12245_consen   22 DGEYTLTVTATDKAGNTSSSTTQIVIVDNTA   52 (60)
T ss_pred             CccEEEEEEEEECCCCEEEeeeEEEEEcCCC
Confidence            4569999999999887776766666666543


No 22 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=66.35  E-value=1.4e+02  Score=32.08  Aligned_cols=50  Identities=22%  Similarity=0.109  Sum_probs=33.2

Q ss_pred             CCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCC
Q psy8640         114 ENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAA  165 (303)
Q Consensus       114 vNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~  165 (303)
                      .||..+.|....-...|.|+.  |+.-..|.-...|.+....+.|+...+..
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA  444 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTA  444 (928)
T ss_pred             ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCcc
Confidence            566666666666677889984  77666665444354445678999887754


No 23 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=63.67  E-value=15  Score=26.08  Aligned_cols=28  Identities=21%  Similarity=0.298  Sum_probs=19.7

Q ss_pred             eeecEEEEEEEEEECCCCeEEEEEEEEE
Q psy8640         236 EARSRYAFTLLSTDSGGRTCKVKVRVEI  263 (303)
Q Consensus       236 E~~~~y~l~V~a~D~g~~s~~~~v~I~V  263 (303)
                      ...+.|.+++.|+|..+.+.+.++.|.|
T Consensus        54 ~~~G~y~v~l~v~d~~g~~~~~~~~V~V   81 (81)
T cd00146          54 TKPGTYTVTLTVTNAVGSSSTKTTTVVV   81 (81)
T ss_pred             CCCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence            3344499999999986666665666654


No 24 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=61.23  E-value=32  Score=22.40  Aligned_cols=38  Identities=18%  Similarity=0.221  Sum_probs=27.2

Q ss_pred             CCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECC
Q psy8640          54 QRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGG   93 (303)
Q Consensus        54 ~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g   93 (303)
                      .-..+..+|+.+|.|+-.-.-.-+. -. |.+.|.|+|..
T Consensus        11 ~LP~gLs~d~~tG~isGtp~~~~~~-G~-y~~~vtatd~~   48 (49)
T PF05345_consen   11 GLPSGLSLDPSTGTISGTPTSSVQP-GT-YTFTVTATDGS   48 (49)
T ss_pred             CCCCcEEEeCCCCEEEeecCCCccc-cE-EEEEEEEEcCC
Confidence            3447889999999998773322111 35 99999999864


No 25 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=60.22  E-value=20  Score=25.22  Aligned_cols=27  Identities=26%  Similarity=0.380  Sum_probs=20.2

Q ss_pred             eeecEEEEEEEEEECCCCeEEEEEEEEE
Q psy8640         236 EARSRYAFTLLSTDSGGRTCKVKVRVEI  263 (303)
Q Consensus       236 E~~~~y~l~V~a~D~g~~s~~~~v~I~V  263 (303)
                      +..+.|.+++.+.|..+ ++++.++|.|
T Consensus        52 ~~~G~y~v~l~v~n~~g-~~~~~~~i~v   78 (79)
T smart00089       52 TKPGTYTVTLTVTNAVG-SASATVTVVV   78 (79)
T ss_pred             CCCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence            34444999999999766 6677777766


No 26 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=58.73  E-value=22  Score=24.06  Aligned_cols=26  Identities=15%  Similarity=0.427  Sum_probs=17.7

Q ss_pred             cEEEEEEEEEECCCC--eEEEEEEEEEe
Q psy8640         239 SRYAFTLLSTDSGGR--TCKVKVRVEIE  264 (303)
Q Consensus       239 ~~y~l~V~a~D~g~~--s~~~~v~I~V~  264 (303)
                      +.|.|.|.|.|..+.  ....++.|.|+
T Consensus        39 G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen   39 GKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             EEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             EEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            459999999997764  22266777663


No 27 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=58.34  E-value=55  Score=26.50  Aligned_cols=67  Identities=21%  Similarity=0.262  Sum_probs=42.1

Q ss_pred             cccEEEeC-Ccc--EEEEcccCCCCCC---CceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEE
Q psy8640          56 LAIQAVDP-LTG--VLSVQQPLDYDTG---SSEHLLILKVEDGGKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEF  129 (303)
Q Consensus        56 ~~~F~Id~-~tG--~l~~~~~LD~E~~---~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~  129 (303)
                      ...|+|.- ..|  ...+.++||+...   -. ..|.|.|+|...     . .++..+.|.|.|  | .|...... ...
T Consensus        55 ~~Vftvtl~~~GsYtftL~~~lDH~~g~d~l~-l~~~v~a~D~DG-----D-~s~~~l~VtI~D--D-~P~~~~~~-~~~  123 (137)
T TIGR03660        55 NPVFTLTLNADGSYEFTLEGPLDHAAGSDELT-LNFPIIATDFDG-----D-TSSITLPVTIVD--D-VPTITDVD-ALT  123 (137)
T ss_pred             cEEEEEEECCCccEEEEEcccccCCCCCceEE-EeeeEEEEeCCC-----C-ccccEEEEEEEC--C-CCeecccc-ceE
Confidence            35676621 245  4666789998542   23 678888998743     1 223488888887  6 47665443 368


Q ss_pred             EeCC
Q psy8640         130 VGEN  133 (303)
Q Consensus       130 V~E~  133 (303)
                      |.|+
T Consensus       124 V~E~  127 (137)
T TIGR03660       124 VDED  127 (137)
T ss_pred             Eecc
Confidence            8775


No 28 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=55.51  E-value=56  Score=24.35  Aligned_cols=35  Identities=11%  Similarity=0.121  Sum_probs=25.9

Q ss_pred             ccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCC
Q psy8640          57 AIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGK   94 (303)
Q Consensus        57 ~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~   94 (303)
                      ..|.|+ ..|.|++++++.....+  -+|.|.|.|...
T Consensus        46 pdF~V~-~DGsVy~~r~v~l~~~~--~~F~V~a~D~~~   80 (90)
T PF08758_consen   46 PDFRVL-EDGSVYAKRPVQLSSEQ--RSFTVHAWDSQT   80 (90)
T ss_dssp             SEEEEE-TTTEEEEES--S-SSS---EEEEEEEEETTT
T ss_pred             CCEEEc-CCCeEEEeeeEecCCCc--eEEEEEEECCCC
Confidence            489999 78999999999876543  579999999764


No 29 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=51.25  E-value=20  Score=27.66  Aligned_cols=33  Identities=12%  Similarity=0.283  Sum_probs=24.4

Q ss_pred             eeeeeecEEEEEEEEEECCCCeEEEEEEEEEee
Q psy8640         233 FDYEARSRYAFTLLSTDSGGRTCKVKVRVEIES  265 (303)
Q Consensus       233 lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~D  265 (303)
                      +++.+-+.|.++|.++|..+-+.++.+.|.|-|
T Consensus        74 ~~v~kgG~y~m~V~lCn~dGCS~S~~~~I~VAD  106 (106)
T cd02848          74 FKVGKGGRYQMQVALCNGDGCSTSAAKEIVVAD  106 (106)
T ss_pred             EEeCCCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence            445555669999999998776666677777755


No 30 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=49.86  E-value=34  Score=27.59  Aligned_cols=37  Identities=19%  Similarity=0.327  Sum_probs=25.3

Q ss_pred             eeeeeeecEEEEEEEEEECCCCeEEEEEEEEEeecCC
Q psy8640         232 VFDYEARSRYAFTLLSTDSGGRTCKVKVRVEIESRDE  268 (303)
Q Consensus       232 ~lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNd  268 (303)
                      .+.....+.|.+.|+++|..+-+.+..+.|.|.|-+-
T Consensus        76 ~~~~~~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG  112 (133)
T PF08329_consen   76 TFTVTKGGRYQMQVELCNADGCSTSAPVEVVVADTDG  112 (133)
T ss_dssp             EEEE-S-EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred             EEEecCCCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence            3455566779999999998887777899999999874


No 31 
>KOG3597|consensus
Probab=46.43  E-value=2.9e+02  Score=26.98  Aligned_cols=60  Identities=17%  Similarity=0.102  Sum_probs=42.4

Q ss_pred             EEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCC
Q psy8640         104 ITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAA  164 (303)
Q Consensus       104 ~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~  164 (303)
                      +....|.|.-+||.+..+....+.+-+.|+...-.....+.+.|+|.+. ..+.|++....
T Consensus        25 ~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~   84 (442)
T KOG3597|consen   25 TDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTS   84 (442)
T ss_pred             EeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCC
Confidence            3468899999999777776666678888875543334567888888753 45777777654


No 32 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=41.59  E-value=1.2e+02  Score=23.48  Aligned_cols=34  Identities=15%  Similarity=0.229  Sum_probs=22.3

Q ss_pred             cCCCCCCCceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEc
Q psy8640          73 PLDYDTGSSEHLLILKVEDGGKPPLDSPLSSITHLKITLQD  113 (303)
Q Consensus        73 ~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~D  113 (303)
                      .+++.+.-. |.++|++.|...      .+.+..+.|.|.|
T Consensus        73 t~~v~kgG~-y~m~V~lCn~dG------CS~S~~~~I~VAD  106 (106)
T cd02848          73 TFKVGKGGR-YQMQVALCNGDG------CSTSAAKEIVVAD  106 (106)
T ss_pred             EEEeCCCCe-EEEEEEEECCCC------ccCcCCEEEEecC
Confidence            345666667 999999998753      3444556666554


No 33 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=35.36  E-value=1.8e+02  Score=21.33  Aligned_cols=53  Identities=19%  Similarity=0.277  Sum_probs=31.6

Q ss_pred             EEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCC
Q psy8640         108 KITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAA  164 (303)
Q Consensus       108 ~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~  164 (303)
                      +|.|.| ||.+ .+....-...+.|+.  |.....|.-...+....-.+.|....+.
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt   54 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT   54 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence            567778 7755 776666677889874  5445555555443333446677776654


No 34 
>KOG4221|consensus
Probab=33.12  E-value=7.3e+02  Score=27.71  Aligned_cols=74  Identities=15%  Similarity=0.121  Sum_probs=47.4

Q ss_pred             cccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCCCCCCCCCe
Q psy8640          23 HRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDSPLS  102 (303)
Q Consensus        23 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s  102 (303)
                      +-.-++.|.+.|+..                +....+.++..+-++++.   +.|.... |.+.|.|.....    .+.+
T Consensus       547 ~n~~I~~yk~~ys~~----------------~~~~~~~~~~n~~e~ti~---gL~k~Te-Y~~~vvA~N~~G----~g~s  602 (1381)
T KOG4221|consen  547 GNGPITGYKLFYSED----------------DTGKELRVENNATEYTIN---GLEKYTE-YSIRVVAYNSAG----SGVS  602 (1381)
T ss_pred             CCCCceEEEEEEEcC----------------CCCceEEEecCccEEEee---cCCCccc-eEEEEEEecCCC----CCCC
Confidence            444566676666542                344678888777777766   4455666 999999998754    3444


Q ss_pred             eEEEEEEEEEcCCCCCCc
Q psy8640         103 SITHLKITLQDENDNAPK  120 (303)
Q Consensus       103 ~~~~v~I~V~DvNDn~P~  120 (303)
                      +..--.+...|+=+-||.
T Consensus       603 S~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  603 SADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             CCceEEEeccCCCCCCCc
Confidence            444444556677776664


No 35 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=32.15  E-value=5.7e+02  Score=26.21  Aligned_cols=32  Identities=19%  Similarity=0.177  Sum_probs=24.3

Q ss_pred             EEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCCCCCcC
Q psy8640          83 HLLILKVEDGGKPPLDSPLSSITHLKITLQDENDNAPKF  121 (303)
Q Consensus        83 ~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f  121 (303)
                      .++++.++|...     ..+...+..|.|.+-.  +|.+
T Consensus       300 ~Ti~atVtDSRG-----r~S~~~~~tItVl~Y~--~P~l  331 (624)
T PF05895_consen  300 ATIRATVTDSRG-----RTSDPKTKTITVLEYS--PPTL  331 (624)
T ss_pred             EEEEEEEEECCC-----ccCCceEEEEEEEEcC--CCcE
Confidence            889999999863     3456778899998776  5665


No 36 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=27.88  E-value=8e+02  Score=26.52  Aligned_cols=57  Identities=21%  Similarity=0.223  Sum_probs=34.5

Q ss_pred             EEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCC
Q psy8640         105 THLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAA  165 (303)
Q Consensus       105 ~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~  165 (303)
                      ...+|.|.| ||++|.|....-...|.|+.  |..-.+|.-+- +....-.+.|.-..+..
T Consensus       515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~Rts-Ga~G~VtV~Y~T~dGTA  571 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTS-GARGTVIVPYRTVEGTA  571 (928)
T ss_pred             ceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcC-CCCeeEEEEEEeecCcc
Confidence            456677777 88899887776678899974  55444432221 11112346787777653


No 37 
>PF00778 DIX:  DIX domain;  InterPro: IPR001158 Dishevelled (Dsh) protein is an important component of the Wnt signal-transduction pathway. It has three relatively conserved domains: DIX, PDZ and DEP. The DIX domain of Dvl-1 (a mammalian Dishevelled homologue) shares 37% identity with the C-terminal region of Axin. Dsh can interact with the Axin/APC/GSK3/beta-catenin complex, and may thus modulate its activity []. The Wnt signalling pathway is conserved in various species from Caenorhabditis elegans to mammals, and plays important roles in development, cellular proliferation, and differentiation. The molecular mechanisms by which the Wnt signal regulates cellular functions are becoming increasingly well understood. Wnt stabilises cytoplasmic beta-catenin, which stimulates the expression of genes including c-myc, c-jun, fra-1, and cyclin D1. Axin and its homologue Axil are components of the Wnt signalling pathway that negatively regulate this pathway. Other components of the Wnt signalling pathway, including Dvl, glycogen synthase kinase-3beta (GSK-3beta), beta-catenin, and adenomatous polyposis coli (APC), interact with Axin, and the phosphorylation and stability of beta-catenin are regulated in the Axin complex. Axil has similar functions to Axin. Thus, Axin and Axil act as scaffold proteins in the Wnt signalling pathway, thereby modulating the Wnt-dependent cellular functions [].; GO: 0004871 signal transducer activity, 0007275 multicellular organismal development, 0005622 intracellular; PDB: 1WSP_B 2D5G_A 3PZ7_A 3PZ8_H.
Probab=26.85  E-value=1.7e+02  Score=21.58  Aligned_cols=42  Identities=33%  Similarity=0.528  Sum_probs=23.2

Q ss_pred             EEEEEEEEECCCCeEEEEEEEEEeecCCCCCeeCCCeeEEEecCCCCCCCcEEEEEEEEe
Q psy8640         241 YAFTLLSTDSGGRTCKVKVRVEIESRDEFHPQFTERTFKFILSGTDLPVGYVVGHLRLVR  300 (303)
Q Consensus       241 y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~~~y~~~~e~~~~p~gt~v~~v~A~D  300 (303)
                      |.+-..+.|...- +  .|...|.|-++.-|.|               .|.+++.|...|
T Consensus        43 yrffFK~~~~d~~-~--~V~eEi~dD~~~LP~~---------------eGkI~~~v~~~d   84 (84)
T PF00778_consen   43 YRFFFKSLDPDFG-C--VVKEEITDDDDILPLF---------------EGKIVAKVESVD   84 (84)
T ss_dssp             EEEEEEEEETTTE-T--EEEEEE-STTSB---B---------------TTEEEEEEEE--
T ss_pred             ceEEEEEeCCCCC-e--eeEEEEcCCccccccc---------------CCEEEEEEEeCC
Confidence            5555565553322 1  6788889999988988               556666665554


No 38 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=26.26  E-value=92  Score=20.53  Aligned_cols=15  Identities=33%  Similarity=0.654  Sum_probs=12.4

Q ss_pred             cEEEEEEEEEECCCC
Q psy8640         239 SRYAFTLLSTDSGGR  253 (303)
Q Consensus       239 ~~y~l~V~a~D~g~~  253 (303)
                      +.|.|++.|+|..+.
T Consensus        24 G~y~itv~a~D~AGN   38 (54)
T PF13754_consen   24 GTYTITVTATDAAGN   38 (54)
T ss_pred             ccEEEEEEEEeCCCC
Confidence            349999999998764


No 39 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=26.02  E-value=3.7e+02  Score=22.09  Aligned_cols=36  Identities=25%  Similarity=0.386  Sum_probs=25.2

Q ss_pred             cEEEEEE-EEEECCCCeEEEEEEEEEeecCCCCCeeCC
Q psy8640         239 SRYAFTL-LSTDSGGRTCKVKVRVEIESRDEFHPQFTE  275 (303)
Q Consensus       239 ~~y~l~V-~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~  275 (303)
                      +.|.+.+ .|.|..+...+..+...+. ++-.+|.+.-
T Consensus        15 G~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi   51 (158)
T PF13750_consen   15 GSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI   51 (158)
T ss_pred             ccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE
Confidence            5699999 7999877766666554444 3445888755


Done!