Query         psy2920
Match_columns 325
No_of_seqs    175 out of 1490
Neff          8.7 
Searched_HMMs 46136
Date          Fri Aug 16 21:00:01 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2920.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2920hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 1.9E-42 4.1E-47  344.0  25.4  269    1-322   174-456 (2531)
  2 KOG4289|consensus              100.0 8.7E-42 1.9E-46  339.4  28.6  272    2-323   280-559 (2531)
  3 KOG1219|consensus              100.0 1.6E-36 3.4E-41  310.4  32.4  281    1-322   650-1036(4289)
  4 KOG1219|consensus              100.0   4E-36 8.7E-41  307.4  30.0  268    2-322   964-1240(4289)
  5 cd00031 CA Cadherin repeat dom 100.0 5.3E-30 1.1E-34  220.9  27.1  186    1-213     7-199 (199)
  6 cd00031 CA Cadherin repeat dom  99.9 9.5E-24 2.1E-28  181.8  24.0  174  115-321     2-184 (199)
  7 PF00028 Cadherin:  Cadherin do  99.6 3.8E-15 8.3E-20  112.6  12.9   86  115-212     1-93  (93)
  8 KOG1834|consensus               99.6 2.5E-14 5.5E-19  136.1  20.0  174   13-211    54-242 (952)
  9 PF00028 Cadherin:  Cadherin do  99.6 5.9E-15 1.3E-19  111.6  12.8   86    1-101     6-93  (93)
 10 smart00112 CA Cadherin repeats  99.6 2.2E-14 4.7E-19  105.0  10.8   77   17-108     2-79  (79)
 11 KOG1834|consensus               99.5 3.5E-13 7.6E-18  128.4  17.4  191   98-321    21-229 (952)
 12 smart00112 CA Cadherin repeats  99.4 3.5E-12 7.6E-17   93.2   9.9   72  146-219     2-79  (79)
 13 PF08266 Cadherin_2:  Cadherin-  97.2 0.00019 4.2E-09   52.7   2.2   61    1-75      8-68  (84)
 14 PF08758 Cadherin_pro:  Cadheri  96.5    0.02 4.4E-07   42.6   8.1   78  106-196     2-80  (90)
 15 smart00736 CADG Dystroglycan-t  96.1    0.13 2.7E-06   38.8  10.5   68  144-216    24-96  (97)
 16 PF08266 Cadherin_2:  Cadherin-  96.0   0.018 3.9E-07   42.3   5.5   57  114-181     2-66  (84)
 17 smart00736 CADG Dystroglycan-t  94.4    0.83 1.8E-05   34.3  10.4   64   26-105    30-96  (97)
 18 PF13750 Big_3_3:  Bacterial Ig  93.7     2.8 6.1E-05   34.6  13.1  122  181-323    13-138 (158)
 19 PF08758 Cadherin_pro:  Cadheri  92.0     2.2 4.8E-05   31.7   9.1   66    1-89     16-81  (90)
 20 TIGR01965 VCBS_repeat VCBS rep  91.8     4.4 9.5E-05   30.6  10.5   89   11-122     2-97  (99)
 21 PF05345 He_PIG:  Putative Ig d  89.3     1.9 4.2E-05   28.0   5.9   35  161-195    14-48  (49)
 22 KOG3597|consensus               88.7      13 0.00027   36.0  13.3  156   92-274    24-199 (442)
 23 PF13750 Big_3_3:  Bacterial Ig  87.6      15 0.00032   30.3  15.7  111   74-196    14-136 (158)
 24 TIGR01965 VCBS_repeat VCBS rep  87.5       6 0.00013   29.9   8.3   71  140-219     2-85  (99)
 25 PF07495 Y_Y_Y:  Y_Y_Y domain;   84.5      10 0.00022   25.7   8.6   56  151-211     9-65  (66)
 26 TIGR00845 caca sodium/calcium   84.1      62  0.0014   34.4  18.3   29   94-123   515-543 (928)
 27 PF05895 DUF859:  Siphovirus pr  81.9      38 0.00083   34.4  13.5  113   73-195   296-424 (624)
 28 TIGR03660 T1SS_rpt_143 T1SS-14  76.2      39 0.00084   27.2  11.1   61   60-125    67-130 (137)
 29 KOG4221|consensus               75.3 1.3E+02  0.0029   32.9  22.6   66   28-109   552-620 (1381)
 30 PF05345 He_PIG:  Putative Ig d  65.3      26 0.00057   22.6   5.3   36   38-88     13-49  (49)
 31 PF13754 Big_3_4:  Bacterial Ig  53.7      40 0.00088   22.1   4.7   17  308-324    24-40  (54)
 32 KOG4221|consensus               52.5 3.6E+02  0.0079   29.8  22.6  121   74-219   493-619 (1381)
 33 PF07495 Y_Y_Y:  Y_Y_Y domain;   49.8      34 0.00073   23.0   4.1   29   73-101    37-66  (66)
 34 PF12245 Big_3_2:  Bacterial Ig  49.7      71  0.0015   21.5   5.6   31   73-104    21-51  (60)
 35 PF13860 FlgD_ig:  FlgD Ig-like  39.6 1.2E+02  0.0026   21.6   5.8   13  308-320    69-81  (81)
 36 smart00089 PKD Repeats in poly  38.5      80  0.0017   21.9   4.7   30   69-100    49-78  (79)
 37 PF02494 HYR:  HYR domain;  Int  37.9      75  0.0016   22.5   4.5   27   73-100    55-81  (81)
 38 smart00089 PKD Repeats in poly  34.3      90  0.0019   21.6   4.4   29  178-211    50-78  (79)
 39 PF10365 DUF2436:  Domain of un  29.9   3E+02  0.0065   22.2   7.3  126   62-195    27-156 (161)
 40 PF02010 REJ:  REJ domain;  Int  28.6      90  0.0019   29.9   4.7   42  167-212   145-186 (440)
 41 cd00146 PKD polycystic kidney   28.6 1.4E+02  0.0031   20.7   4.7   19   70-88     52-70  (81)
 42 cd02848 Chitinase_N_term Chiti  26.9 1.8E+02  0.0039   22.3   5.0   34   68-102    73-106 (106)
 43 KOG3513|consensus               26.5 8.8E+02   0.019   26.5  20.5  121   71-219   488-619 (1051)
 44 cd05760 Ig2_PTK7 Second immuno  21.6 2.8E+02  0.0061   19.0   6.2   47  162-213    30-76  (77)
 45 KOG0196|consensus               21.0   1E+03   0.022   25.4  12.2   38  178-219   404-444 (996)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=1.9e-42  Score=344.04  Aligned_cols=269  Identities=25%  Similarity=0.318  Sum_probs=242.9

Q ss_pred             CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeC---CCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEE
Q psy2920           1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQG---NDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQY   77 (325)
Q Consensus         1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~---~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~   77 (325)
                      |||.|.||.|+.+.|.|+|+     +++.|++...   ...+.|+||+.+|             .|++.+.||||....|
T Consensus       174 pEn~pagT~iasv~A~~~~a-----~rl~Ysm~al~dsRS~~lFslD~~sG-------------~irta~~lDREt~e~H  235 (2531)
T KOG4289|consen  174 PENEPAGTIIASVKASDPDA-----GRLYYSMVALFDSRSQNLFSLDPMSG-------------AIRTAKSLDRETKETH  235 (2531)
T ss_pred             cCCCCCCceeEEEEecCCCc-----CceEEEeeeccchhccccEeeccccc-------------cchhhhhhhhhhhhee
Confidence            89999999999999999983     6799998753   2347999999999             9999999999999999


Q ss_pred             EEEEEEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEE
Q psy2920          78 SLVITATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVA  153 (325)
Q Consensus        78 ~l~v~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~  153 (325)
                      .|+|+|.|.+.|. +++++|+|.|+|+|||.|+|.+..|.-.+.||.+.|          ..|.+|+|+|.|.   +.|.
T Consensus       236 vlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evG----------y~vLtvrAtD~Dsp~Nani~  305 (2531)
T KOG4289|consen  236 VLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVG----------YEVLTVRATDGDSPPNANIR  305 (2531)
T ss_pred             EEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccC----------ceEEEEEeccCCCCCCCceE
Confidence            9999999999887 899999999999999999999999999999999999          9999999999998   5899


Q ss_pred             EEEecC--CCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEec-CCCCCceeeeccccCC
Q psy2920         154 YKLAVP--SNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNS-PVLQDEQLLQMIDRKS  229 (325)
Q Consensus       154 Y~i~~~--~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d-~~~~P~~~~~~~~~~~  229 (325)
                      |++.++  .+.|+|++.+|.|.+..+ |+|....|.|.|+|+|.|.|+..+  |+.|.|+|+| |||+|. |..      
T Consensus       306 Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~--Ta~V~itV~D~NDNaPq-Fse------  376 (2531)
T KOG4289|consen  306 YRLLEGNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPR--TAMVEITVEDENDNAPQ-FSE------  376 (2531)
T ss_pred             EEecCCCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCc--eEEEEEEEEecCCCCcc-ccc------
Confidence            999986  578999999999999999 999999999999999999988888  9999999999 667777 322      


Q ss_pred             CCccceeeeeEeeccCC---ceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCC
Q psy2920         230 EAPHRIEKRRVTRAVRP---TKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGP  306 (325)
Q Consensus       230 ~~~~~~~~~~v~~~~~~---~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~  306 (325)
                          ..|..++.|++.+   +.+|+|+|.|.|.|+.+.|+|.+|+..+.|.|+.        .+|+|.+..+||+|..  
T Consensus       377 ----~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~--------~tGel~vv~plD~e~~--  442 (2531)
T KOG4289|consen  377 ----KRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDS--------LTGELDVVEPLDFENS--  442 (2531)
T ss_pred             ----cceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEec--------ccceEEEeccccccCC--
Confidence                2234555555554   6889999999999999999999999999999987        6999999999999994  


Q ss_pred             CceEEEEEEEecCCCC
Q psy2920         307 EKTIDFWVTINNQATN  322 (325)
Q Consensus       307 ~~~~~l~V~a~D~~~~  322 (325)
                        .|.+.|.|.|.|-+
T Consensus       443 --~ytl~IrAqDggrP  456 (2531)
T KOG4289|consen  443 --EYTLRIRAQDGGRP  456 (2531)
T ss_pred             --eeEEEEEcccCCCC
Confidence              89999999999865


No 2  
>KOG4289|consensus
Probab=100.00  E-value=8.7e-42  Score=339.35  Aligned_cols=272  Identities=24%  Similarity=0.295  Sum_probs=239.7

Q ss_pred             CCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEE
Q psy2920           2 ENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVI   81 (325)
Q Consensus         2 En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v   81 (325)
                      ||.++|+.|.+++|+|.|++.+  +.|.|++.+++....|.|++.+|             .|.+..+||||+...|+|.|
T Consensus       280 En~evGy~vLtvrAtD~Dsp~N--ani~Yrl~eg~~~~~f~in~rSG-------------vI~T~a~lDRE~~~~y~L~V  344 (2531)
T KOG4289|consen  280 ENLEVGYEVLTVRATDGDSPPN--ANIRYRLLEGNAKNVFEINPRSG-------------VISTRAPLDREELESYQLDV  344 (2531)
T ss_pred             hccccCceEEEEEeccCCCCCC--CceEEEecCCCccceeEEcCccc-------------eeeccCccCHHhhhheEEEE
Confidence            8999999999999999999875  99999999998889999999999             99999999999999999999


Q ss_pred             EEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEEe
Q psy2920          82 TATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKLA  157 (325)
Q Consensus        82 ~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i~  157 (325)
                      .|.|.|.++ ..++.|.|+|.|+|||+|+|....|.++|.|+..++          +.|.+|.|+|.|.   +.|+|+|.
T Consensus       345 eAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~----------avvlrV~AtDrD~g~Ng~VHYsi~  414 (2531)
T KOG4289|consen  345 EASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPP----------AVVLRVTATDRDKGTNGKVHYSIA  414 (2531)
T ss_pred             EeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCC----------ceEEEEEecccCCCcCceEEEEee
Confidence            999998765 569999999999999999999999999999999998          9999999999997   58999999


Q ss_pred             cCC--CcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecC-CCCCceeeeccccCCCCcc
Q psy2920         158 VPS--NLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSP-VLQDEQLLQMIDRKSEAPH  233 (325)
Q Consensus       158 ~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~-~~~P~~~~~~~~~~~~~~~  233 (325)
                      +++  +.|.||..||+|.+..+ |+|.. .|.|.|+|.|+|-|++++  +.-+.|+|.|+ |++|. |...   .+...+
T Consensus       415 Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn--~sgl~iqVlDINDhaPi-fvst---pfq~tv  487 (2531)
T KOG4289|consen  415 SGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSN--TSGLVIQVLDINDHAPI-FVST---PFQATV  487 (2531)
T ss_pred             ccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccC--CCceEEEEEecCCCCce-eEec---hhhhhh
Confidence            865  68999999999999999 99987 999999999999999998  66777999995 55555 4432   233444


Q ss_pred             ceeeeeEeeccCCceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEE
Q psy2920         234 RIEKRRVTRAVRPTKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFW  313 (325)
Q Consensus       234 ~~~~~~v~~~~~~~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~  313 (325)
                      .|+ .+++..   +..+.|.|.|.|.|+++.|++.+..   .|.|+.        .+|.|++.|.||||+.   ..|.|.
T Consensus       488 lEn-v~lg~~---v~~vqaidadsg~na~l~y~laG~~---pf~I~~--------~SG~Itvtk~ldrEt~---~~ysl~  549 (2531)
T KOG4289|consen  488 LEN-VPLGYL---VCHVQAIDADSGENARLHYSLAGVG---PFQINN--------GSGWITVTKELDRETV---EHYSLG  549 (2531)
T ss_pred             hhc-ccccce---EEEEecccCCCCcccceeeeeccCC---CeeEec--------CCceEEEeeccccccc---ceEEEE
Confidence            442 233333   4779999999999999999997654   588875        5999999999999999   899999


Q ss_pred             EEEecCCCCC
Q psy2920         314 VTINNQATNG  323 (325)
Q Consensus       314 V~a~D~~~~~  323 (325)
                      |+|+|.|+++
T Consensus       550 V~ard~gtp~  559 (2531)
T KOG4289|consen  550 VEARDHGTPP  559 (2531)
T ss_pred             EEEcCCCCCc
Confidence            9999999764


No 3  
>KOG1219|consensus
Probab=100.00  E-value=1.6e-36  Score=310.44  Aligned_cols=281  Identities=21%  Similarity=0.283  Sum_probs=248.4

Q ss_pred             CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920           1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV   80 (325)
Q Consensus         1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~   80 (325)
                      +|+.|+|+.++.+.|+|.|.+.+  +.+.|-|..++.+..|.|+..+|             .|.+..+||+|....|.|.
T Consensus       650 ~Edvpigt~la~L~atD~Dtgfn--g~l~yvI~dgne~~~~~Id~qsg-------------~itvas~ld~~~t~~yiLn  714 (4289)
T KOG1219|consen  650 PEDVPIGTTLAILSATDSDTGFN--GKLVYVIEDGNESICFLIDRQSG-------------NITVASPLDNENTEQYILN  714 (4289)
T ss_pred             cccCCCCceEEEEeccCCCCCcC--ceEEEEEeCCccceEEEEecccc-------------eEEEecchhhhhhheeEEE
Confidence            69999999999999999998875  99999999998889999999999             9999999999999999999


Q ss_pred             EEEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEE
Q psy2920          81 ITATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKL  156 (325)
Q Consensus        81 v~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i  156 (325)
                      ++|.|.|.|. ++...+.|.|.|+|||+|+|.+..|.+.|+|+..+|          +.|+.|.|.|.|.   |+++|+|
T Consensus       715 vta~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepg----------s~Ia~vetnd~D~g~NG~v~fsL  784 (4289)
T KOG1219|consen  715 VTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPG----------SFIAHVETNDTDGGNNGMVSFSL  784 (4289)
T ss_pred             EEEecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCC----------ceEEEEEecccCCCCCceEEEEe
Confidence            9999999887 677788899999999999999999999999999999          9999999999997   6999999


Q ss_pred             ecCCCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccce
Q psy2920         157 AVPSNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRI  235 (325)
Q Consensus       157 ~~~~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~  235 (325)
                      ...+..|+||+.||++.+.+. |+|.+..|.|.|+|+|.+.|.+.+  .+.+.|.++|+|++||.+..+..   ...+++
T Consensus       785 ~n~sdvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq~~s--vv~l~vsvedVndnpPkci~~hs---r~kipe  859 (4289)
T KOG1219|consen  785 LNKSDVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQLFS--VVELDVSVEDVNDNPPKCIIRHS---RSKIPE  859 (4289)
T ss_pred             cCCcceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCceEE--EEEEEEEEeeccCCCCccccccc---cccCcc
Confidence            999999999999999999999 999999999999999999888877  88888999998776664543321   111111


Q ss_pred             --------------------------------------------------------------------------------
Q psy2920         236 --------------------------------------------------------------------------------  235 (325)
Q Consensus       236 --------------------------------------------------------------------------------  235 (325)
                                                                                                      
T Consensus       860 dlp~gt~~~~l~A~d~diGq~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~V  939 (4289)
T KOG1219|consen  860 DLPYGTVTWQLVALDPDIGQLGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEV  939 (4289)
T ss_pred             cCCCceEEEEhhhcCcccCcCceeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEE
Confidence                                                                                            


Q ss_pred             ------------------eeeeEeeccCC---ceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEE
Q psy2920         236 ------------------EKRRVTRAVRP---TKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVR  294 (325)
Q Consensus       236 ------------------~~~~v~~~~~~---~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~  294 (325)
                                        ....|.|+.+.   +..|.|.|.|.|..+.+.|+|..|+.-+.|+|+.        .+|.|.
T Consensus       940 evldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~--------~tG~ir 1011 (4289)
T KOG1219|consen  940 EVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDS--------TTGSIR 1011 (4289)
T ss_pred             EEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecC--------CcceEe
Confidence                              12334444444   4558999999999999999999999889999986        699999


Q ss_pred             EceeCccccCCCCceEEEEEEEecCCCC
Q psy2920         295 VKKRWDFEELGPEKTIDFWVTINNQATN  322 (325)
Q Consensus       295 ~~~~lD~E~~~~~~~~~l~V~a~D~~~~  322 (325)
                      +.+.||||..   ..|-|+|.|+|.|+.
T Consensus      1012 Tl~~lDrE~k---s~YwltveA~D~gt~ 1036 (4289)
T KOG1219|consen 1012 TLKALDREKK---SSYWLTVEAKDLGTV 1036 (4289)
T ss_pred             echhhchhhc---ceEEEEEEEEecCCC
Confidence            9999999999   999999999999975


No 4  
>KOG1219|consensus
Probab=100.00  E-value=4e-36  Score=307.45  Aligned_cols=268  Identities=24%  Similarity=0.297  Sum_probs=235.5

Q ss_pred             CCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEE
Q psy2920           2 ENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVI   81 (325)
Q Consensus         2 En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v   81 (325)
                      ||+|+||.|+.+.|.|.|++..  +.+.|+|..++..+.|.|+..+|             .|++.+.||||....|-|++
T Consensus       964 EnapiGT~vi~i~A~dedsgld--g~l~Y~I~~gdg~g~FsId~~tG-------------~irTl~~lDrE~ks~Ywltv 1028 (4289)
T KOG1219|consen  964 ENAPIGTIVIRIQARDEDSGLD--GELSYKIRTGDGDGIFSIDSTTG-------------SIRTLKALDREKKSSYWLTV 1028 (4289)
T ss_pred             ecCCcceEEEEEEEecCCCCcc--ceEEEEEEcCCcceeEEecCCcc-------------eEeechhhchhhcceEEEEE
Confidence            8999999999999999998765  99999999987778999999999             99999999999999999999


Q ss_pred             EEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEEe
Q psy2920          82 TATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKLA  157 (325)
Q Consensus        82 ~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i~  157 (325)
                      .|+|.|.++ ++.+.+.|.|.|+|||+|+|.++.|..+|.|+.+.+          ..|.++.|+|+|.   +++.|.|.
T Consensus      1029 eA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~----------vsivq~ea~D~Dsssn~kLmykI~ 1098 (4289)
T KOG1219|consen 1029 EAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPET----------VSIVQAEANDPDSSSNQKLMYKIT 1098 (4289)
T ss_pred             EEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCc----------eEEEEeccCCCCcccCcceEEEEc
Confidence            999998877 788999999999999999999999999999999998          9999999999995   58999999


Q ss_pred             cC--CCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccc
Q psy2920         158 VP--SNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHR  234 (325)
Q Consensus       158 ~~--~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~  234 (325)
                      +|  .++|.|++.||.|+..+. |||++++|.|.|.+.|.|.|++.+  +..|.|.|+|+|+.+|.|.+.+.        
T Consensus      1099 sGnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s--~~rviV~IldvNdnsp~Flqk~~-------- 1168 (4289)
T KOG1219|consen 1099 SGNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCS--NQRVIVSILDVNDNSPRFLQKKT-------- 1168 (4289)
T ss_pred             cCCccceEEEccccceeeeehhhcccccccceEEEEEecCCCCcccc--ceEEEEEEeeccCCchhhhhhee--------
Confidence            75  589999999999997777 999999999999999999999999  88999999998888777777532        


Q ss_pred             eeeeeEeeccC-CceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEE
Q psy2920         235 IEKRRVTRAVR-PTKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFW  313 (325)
Q Consensus       235 ~~~~~v~~~~~-~~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~  313 (325)
                        ...+++... +.-++.|.|.|+|.|++|+|+|..|++.  |.|+.        .+| +...+.++-..     .|.+.
T Consensus      1169 --~~~v~~r~s~plyRl~a~d~DeG~narityniedgde~--FsID~--------~t~-vVsss~~~~~~-----eydi~ 1230 (4289)
T KOG1219|consen 1169 --FLRVPERSSPPLYRLAAQDNDEGNNARITYNIEDGDEV--FSIDI--------ATG-VVSSSTLDPAG-----EYDIL 1230 (4289)
T ss_pred             --EEEeeeccCCceeEEEEEecCCCcceEEEEecccCceE--EEEec--------cCc-eEEeeeccCCc-----ceeEe
Confidence              234444333 3668999999999999999999999876  99986        577 55556666643     57666


Q ss_pred             -EEEecCCCC
Q psy2920         314 -VTINNQATN  322 (325)
Q Consensus       314 -V~a~D~~~~  322 (325)
                       |.|+|.|.+
T Consensus      1231 ~Ikatd~g~p 1240 (4289)
T KOG1219|consen 1231 GIKATDRGAP 1240 (4289)
T ss_pred             eEEEecCCCC
Confidence             999998865


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.98  E-value=5.3e-30  Score=220.88  Aligned_cols=186  Identities=34%  Similarity=0.459  Sum_probs=169.5

Q ss_pred             CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920           1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV   80 (325)
Q Consensus         1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~   80 (325)
                      +||+++|+.|+++.|.|.|.+.  ++.+.|+|.++....+|.|++.+|             .|.+.+.||||....|.|.
T Consensus         7 ~En~~~g~~v~~~~a~D~D~~~--~~~~~y~i~~~~~~~~F~i~~~tG-------------~l~~~~~lD~e~~~~~~l~   71 (199)
T cd00031           7 PENAPPGTVVGTVSATDPDSGE--NGRVTYSILGGNEDGLFSIDPNTG-------------VITTTKPLDREEQSEYTLT   71 (199)
T ss_pred             eCCCCCCCEEEEEEEECCCCCC--CceEEEEEeCCCCcccEEEeCCCC-------------EEEECCCCCCcCCceEEEE
Confidence            6999999999999999999876  589999999876557999999998             8999999999999999999


Q ss_pred             EEEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEE
Q psy2920          81 ITATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKL  156 (325)
Q Consensus        81 v~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i  156 (325)
                      |.|+|.+++. ++...+.|.|.|+|||+|.|....|.+.+.|+.++|          +.++++.|+|+|.   +.+.|+|
T Consensus        72 v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~----------~~i~~~~a~D~D~~~~~~~~y~l  141 (199)
T cd00031          72 VVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPG----------TVVGTVTATDADSGENAKLTYSI  141 (199)
T ss_pred             EEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCC----------CEEEEEEEEcCCCCCCccEEEEE
Confidence            9999986655 489999999999999999999899999999999998          9999999999997   7899999


Q ss_pred             ecCC--CcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEec
Q psy2920         157 AVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNS  213 (325)
Q Consensus       157 ~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d  213 (325)
                      .++.  ++|.|++.+|.|.+.+. |+|....|.|.|.|+|.+.+..++  +++++|.+.|
T Consensus       142 ~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~--~~~i~i~v~d  199 (199)
T cd00031         142 LSGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSS--TATVTVTVLD  199 (199)
T ss_pred             eCCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCcee--EEEEEEEEEC
Confidence            9876  79999999999999988 999999999999999999876666  7777777754


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.93  E-value=9.5e-24  Score=181.81  Aligned_cols=174  Identities=24%  Similarity=0.306  Sum_probs=148.9

Q ss_pred             eEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC---eEEEEEecCC--CcEEEeCCceEEEEccC-CCCCCceEEEE
Q psy2920         115 YRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD---KVAYKLAVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELT  188 (325)
Q Consensus       115 ~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~---~v~Y~i~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~  188 (325)
                      |.+.+.|+.+.|          +.++++.|.|+|.+   .+.|+|.++.  .+|.|++.+|.|++.+. |||....|.|.
T Consensus         2 ~~~~i~En~~~g----------~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~   71 (199)
T cd00031           2 YSVSVPENAPPG----------TVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLT   71 (199)
T ss_pred             eEEEEeCCCCCC----------CEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEE
Confidence            788999999998          99999999999984   7999999865  59999999999999998 99999999999


Q ss_pred             EEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccceeeeeEeeccCC---ceEEEEEeCCCCccceEEE
Q psy2920         189 VEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRIEKRRVTRAVRP---TKRIDFSETDGEMEGRIVF  265 (325)
Q Consensus       189 V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~~~~~v~~~~~~---~~~v~a~d~D~~~~~~i~y  265 (325)
                      |+|+|+|.|..++  ++.|+|.|.|+|+.||.|...   .|.       ..+.++.++   +..+.|+|.|.+.++.++|
T Consensus        72 v~a~D~g~~~~~~--~~~v~I~V~d~Nd~~P~~~~~---~~~-------~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y  139 (199)
T cd00031          72 VVASDGGGPPLSS--TATVTVTVLDVNDNPPVFEQS---SYE-------ASVPENAPPGTVVGTVTATDADSGENAKLTY  139 (199)
T ss_pred             EEEEECCcCccee--EEEEEEEEccCCCCCCccccc---ceE-------EEEeCCCCCCCEEEEEEEEcCCCCCCccEEE
Confidence            9999988877655  889999999988887766642   233       344444322   5678999999988899999


Q ss_pred             EEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEEEEEecCCC
Q psy2920         266 ALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFWVTINNQAT  321 (325)
Q Consensus       266 ~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~V~a~D~~~  321 (325)
                      +|.++.....|.|+.        .+|.|++.+.||||..   ..|+|.|.|+|.+.
T Consensus       140 ~l~~~~~~~~f~i~~--------~~G~i~~~~~ld~e~~---~~~~l~v~a~D~~~  184 (199)
T cd00031         140 SILSGNDKELFSIDP--------NTGIITLAKPLDREEK---SSYELTVVATDGGG  184 (199)
T ss_pred             EEeCCCCCCEEEEeC--------CceEEEeCCccCCccC---ceEEEEEEEEECCC
Confidence            999987656788876        5999999999999998   89999999999974


No 7  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.65  E-value=3.8e-15  Score=112.60  Aligned_cols=86  Identities=31%  Similarity=0.438  Sum_probs=78.8

Q ss_pred             eEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEEecCC--CcEEEeCCceEEEEccC-CCCCCceEEEE
Q psy2920         115 YRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKLAVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELT  188 (325)
Q Consensus       115 ~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~  188 (325)
                      |.++|+|+.++|          +.++++.|.|+|.   +.+.|+|..+.  ++|.|++.+|.|++.+. |||....|.|.
T Consensus         1 Y~~~v~E~~~~g----------~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~   70 (93)
T PF00028_consen    1 YSFSVPENAPPG----------TVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLT   70 (93)
T ss_dssp             EEEEEETTGSTS----------SEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEE
T ss_pred             CEEEEECCCCCC----------CEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEE
Confidence            789999999999          9999999999995   58999999865  89999999999999999 99999999999


Q ss_pred             EEEEeC-CCCCCcceecEEEEEEEe
Q psy2920         189 VEAHDL-QSPSRYAREPAQVWLQFN  212 (325)
Q Consensus       189 V~a~D~-g~p~~~s~~t~~v~I~v~  212 (325)
                      |.|+|. +.|++++  +++|+|+|+
T Consensus        71 v~a~D~~~~~~~~~--~~~V~I~V~   93 (93)
T PF00028_consen   71 VRATDSGGSPPLSS--TATVTINVL   93 (93)
T ss_dssp             EEEEETTTSSEEEE--EEEEEEEEE
T ss_pred             EEEEECCCCCCCEE--EEEEEEEEC
Confidence            999999 7888888  777877763


No 8  
>KOG1834|consensus
Probab=99.64  E-value=2.5e-14  Score=136.12  Aligned_cols=174  Identities=22%  Similarity=0.231  Sum_probs=138.9

Q ss_pred             eEEEecCCCCCCCceE-EEEEeeCCCC-ceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCC
Q psy2920          13 LPILVASSHKHPHQLV-RYSITQGNDN-HTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGIN   90 (325)
Q Consensus        13 ~~a~d~D~~~~~~~~i-~y~i~~~~~~-~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~   90 (325)
                      +-|.|.|.+-..-+.| .|.|.+.+.. +.--+|..+|+           ..|+...+||.|.++.|+|+|+|.|.|..+
T Consensus        54 l~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGe-----------gvlRaK~~lDCelqkeytf~iQAydCg~gp  122 (952)
T KOG1834|consen   54 LAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGE-----------GVLRAKEPLDCELQKEYTFTIQAYDCGNGP  122 (952)
T ss_pred             eeeecCCCCcccccccceeEecCCCCCceEEEEeccCCc-----------eEEeecCcccccccccceEEEEEEecCCCC
Confidence            5566777653222344 7788765432 23345677762           279999999999999999999999976433


Q ss_pred             -------eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC----C-eEEEEEec
Q psy2920          91 -------KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG----D-KVAYKLAV  158 (325)
Q Consensus        91 -------~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~----~-~v~Y~i~~  158 (325)
                             +..++|.|+|.|+|+.+|+|..+.|.+.|.|.+.-           ..|++|.|.|.|-    + ...|.|..
T Consensus       123 dgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~y-----------d~il~veAiD~DCspq~sqIC~YEI~t  191 (952)
T KOG1834|consen  123 DGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVY-----------DSILRVEAIDKDCSPQYSQICEYEITT  191 (952)
T ss_pred             CccccccccceEEEEEeccccccCchhcccceeeEEecceee-----------eeeEEEEeecCCCCCcccceeEEEecC
Confidence                   35689999999999999999999999999998776           6899999999996    2 46899999


Q ss_pred             CCCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEE
Q psy2920         159 PSNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQF  211 (325)
Q Consensus       159 ~~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v  211 (325)
                      +.-.|.|| +.|.|+.+.+ .|....+|.|+|.|.|.|.-...+  .+.|+|.|
T Consensus       192 ~d~PFaId-n~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~--d~lV~v~V  242 (952)
T KOG1834|consen  192 PDVPFAID-NDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAAS--DSLVTVHV  242 (952)
T ss_pred             CCCceEEc-CCCccccccccccccceeEEEEEEEEecccccccC--cceEEEEe
Confidence            88899996 7899999888 898999999999999999744333  56777777


No 9  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.63  E-value=5.9e-15  Score=111.56  Aligned_cols=86  Identities=29%  Similarity=0.375  Sum_probs=77.8

Q ss_pred             CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920           1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV   80 (325)
Q Consensus         1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~   80 (325)
                      +|++++|+.|+++.|.|.|.+.  |+.+.|+|.++++...|.|++.+|             .|.+.++||||....|.|.
T Consensus         6 ~E~~~~g~~v~~v~a~D~D~~~--n~~i~y~i~~~~~~~~F~I~~~tg-------------~i~~~~~LD~E~~~~y~l~   70 (93)
T PF00028_consen    6 PENAPPGTVVGQVTATDPDSGP--NSQITYSILGGNPDGLFSIDPNTG-------------EISLKKPLDRETQSSYQLT   70 (93)
T ss_dssp             ETTGSTSSEEEEEEEEESSTST--TSSEEEEEEETTSTTSEEEETTTT-------------EEEESSSSCTTTTSEEEEE
T ss_pred             ECCCCCCCEEEEEEEEeCCCCC--CceEEEEEecCcccCceEEeeeee-------------ccccceecCcccCCEEEEE
Confidence            6999999999999999999554  599999999988778999999999             8999999999999999999


Q ss_pred             EEEEeC-CCCC-eeEEEEEEEEe
Q psy2920          81 ITATDV-KGIN-KAETHVTVDVL  101 (325)
Q Consensus        81 v~a~D~-~~~~-~~~~~v~v~V~  101 (325)
                      |.|+|. +.++ ++.+.|+|+|+
T Consensus        71 v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   71 VRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             EEEEETTTSSEEEEEEEEEEEEE
T ss_pred             EEEEECCCCCCCEEEEEEEEEEC
Confidence            999999 5555 67888888874


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.58  E-value=2.2e-14  Score=105.04  Aligned_cols=77  Identities=40%  Similarity=0.551  Sum_probs=68.3

Q ss_pred             ecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCC-eeEEE
Q psy2920          17 VASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGIN-KAETH   95 (325)
Q Consensus        17 d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~-~~~~~   95 (325)
                      |.|.+  .|+.+.|+|.++.....|.|++.+|             .|.+.++||||....|.|.|.|.|.+++. ++.+.
T Consensus         2 D~D~g--~n~~i~Y~i~~~~~~~~F~i~~~tg-------------~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~   66 (79)
T smart00112        2 DADSG--ENGKVTYSILSGNEDGLFSIDPETG-------------EITTTKPLDREEQPEYTLTVEATDGGGPPLSSTAT   66 (79)
T ss_pred             CCCCC--cCcEEEEEEecCCCCCEEEEeCCcc-------------EEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEE
Confidence            45554  4688999999876558999999998             89999999999999999999999998765 78899


Q ss_pred             EEEEEecCCCCCC
Q psy2920          96 VTVDVLDENDNSP  108 (325)
Q Consensus        96 v~v~V~DvNdn~P  108 (325)
                      |.|.|.|+|||+|
T Consensus        67 v~I~V~D~Nd~~P   79 (79)
T smart00112       67 VTVTVLDVNDNAP   79 (79)
T ss_pred             EEEEEEECCCCCC
Confidence            9999999999998


No 11 
>KOG1834|consensus
Probab=99.53  E-value=3.5e-13  Score=128.44  Aligned_cols=191  Identities=16%  Similarity=0.154  Sum_probs=137.3

Q ss_pred             EEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC-----C-eEEEEEecCCCcEEE---eCC
Q psy2920          98 VDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG-----D-KVAYKLAVPSNLVVI---VPQ  168 (325)
Q Consensus        98 v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~-----~-~v~Y~i~~~~~~F~i---~~~  168 (325)
                      ....-+|-+.|+. ...|...|.||-..-          ...-.+.|-|.|.     | ..-|.|.+..-.|.+   |+.
T Consensus        21 ~~aarankhkpwi-e~ey~gvV~Endntv----------ll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~   89 (952)
T KOG1834|consen   21 HHAARANKHKPWI-EEEYHGVVTENDNTV----------LLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKY   89 (952)
T ss_pred             cccccccccCccc-ccceeEEEEeCCceE----------EeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEecc
Confidence            4455677788865 468999999986532          2233588999997     3 467888886556755   788


Q ss_pred             ceE--EEEccC-CCCCCceEEEEEEEEeCCCCC-Cc---ceecEEEEEEEecCCCCCceeeeccccCCCCccceeeeeEe
Q psy2920         169 TGE--LLLTGD-PPSDDVEYELTVEAHDLQSPS-RY---AREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRIEKRRVT  241 (325)
Q Consensus       169 tG~--i~l~~~-d~e~~~~y~l~V~a~D~g~p~-~~---s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~~~~~v~  241 (325)
                      ||+  |+.+.+ |.|.+..|+++|+|.|.|.-+ ..   .+.-++|+|+|.|+|+..|+|..+   .|.+-+.|  -.+.
T Consensus        90 TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep---~Yka~V~E--GK~y  164 (952)
T KOG1834|consen   90 TGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEP---WYKAHVTE--GKVY  164 (952)
T ss_pred             CCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhccc---ceeeEEec--ceee
Confidence            876  555555 999999999999999998533 11   123689999999977666657665   45555554  1222


Q ss_pred             eccCCceEEEEEeCCCCc-cceE-EEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEEEEEecC
Q psy2920         242 RAVRPTKRIDFSETDGEM-EGRI-VFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFWVTINNQ  319 (325)
Q Consensus       242 ~~~~~~~~v~a~d~D~~~-~~~i-~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~V~a~D~  319 (325)
                      ..   +.++.|+|.|.++ ++.| .|.|.           +.+-+|.|++.|.|+.+.+|.|...   +.|.|+|+|-|+
T Consensus       165 d~---il~veAiD~DCspq~sqIC~YEI~-----------t~d~PFaIdn~G~irnTekLny~ke---~~Y~ltVtAyDC  227 (952)
T KOG1834|consen  165 DS---ILRVEAIDKDCSPQYSQICEYEIT-----------TPDVPFAIDNDGNIRNTEKLNYTKE---HQYKLTVTAYDC  227 (952)
T ss_pred             ee---eEEEEeecCCCCCcccceeEEEec-----------CCCCceEEcCCCccccccccccccc---eeEEEEEEEEec
Confidence            22   4789999999765 4444 45554           3334666678999999999999998   999999999998


Q ss_pred             CC
Q psy2920         320 AT  321 (325)
Q Consensus       320 ~~  321 (325)
                      |-
T Consensus       228 g~  229 (952)
T KOG1834|consen  228 GK  229 (952)
T ss_pred             cc
Confidence            74


No 12 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.39  E-value=3.5e-12  Score=93.25  Aligned_cols=72  Identities=33%  Similarity=0.374  Sum_probs=64.1

Q ss_pred             eCCC---CeEEEEEecCC--CcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCc
Q psy2920         146 DADG---DKVAYKLAVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDE  219 (325)
Q Consensus       146 D~D~---~~v~Y~i~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~  219 (325)
                      |+|.   +.+.|+|.++.  .+|+|++.+|.|++.+. |||....|.|.|+|+|.+.|++++  +++|+|+|.|+|++||
T Consensus         2 D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~--~~~v~I~V~D~Nd~~P   79 (79)
T smart00112        2 DADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSS--TATVTVTVLDVNDNAP   79 (79)
T ss_pred             CCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCccc--EEEEEEEEEECCCCCC
Confidence            6676   47999999865  79999999999988888 999999999999999999987777  8999999999888775


No 13 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.23  E-value=0.00019  Score=52.74  Aligned_cols=61  Identities=21%  Similarity=0.251  Sum_probs=36.5

Q ss_pred             CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCccccc
Q psy2920           1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVA   75 (325)
Q Consensus         1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~   75 (325)
                      +|..++||.||.|. .|.-.+........++|.+.....+|.++..+|             .|.+...+|||.-+
T Consensus         8 ~EE~~~Gt~IGnia-~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG-------------~L~v~~rIDRE~LC   68 (84)
T PF08266_consen    8 PEEMPPGTVIGNIA-KDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTG-------------DLFVSERIDREELC   68 (84)
T ss_dssp             ESS--TT-EEEECC-CCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTS-------------EEEESS--SCCCC-
T ss_pred             ecCCCCCCEEEEhH-HhhCCCcccccccceEEeecCCcceeEecCCce-------------eEEeCCccCHHHHC
Confidence            58999999999983 333221111123467777766678999999999             99999999999753


No 14 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.53  E-value=0.02  Score=42.59  Aligned_cols=78  Identities=26%  Similarity=0.338  Sum_probs=41.9

Q ss_pred             CCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC-eEEEEEecCCCcEEEeCCceEEEEccCCCCCCce
Q psy2920         106 NSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD-KVAYKLAVPSNLVVIVPQTGELLLTGDPPSDDVE  184 (325)
Q Consensus       106 n~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~-~v~Y~i~~~~~~F~i~~~tG~i~l~~~d~e~~~~  184 (325)
                      ..|-|....|.+.|+.+...|          ..|++|.-.|-..+ .+.|.-.++  .|+|. ..|.|+++..-......
T Consensus         2 C~pGF~~~~~~~~Vp~~l~~g----------~~lg~V~f~dC~~~~~~~~~ssDp--dF~V~-~DGsVy~~r~v~l~~~~   68 (90)
T PF08758_consen    2 CRPGFSQKKYTFEVPSNLEAG----------QPLGKVNFEDCTGRRRVIFESSDP--DFRVL-EDGSVYAKRPVQLSSEQ   68 (90)
T ss_dssp             ---B--S-EEEE----SS-SS------------EEE---B--SS---EEEE---S--EEEEE-TTTEEEEES--S-SSS-
T ss_pred             CcCCcccceEEEEcCchhhCC----------cEEEEEEeccCCCCCceEEecCCC--CEEEc-CCCeEEEeeeEecCCCc
Confidence            358899999999999998888          99999999999774 688776654  89996 68999998882222333


Q ss_pred             EEEEEEEEeCCC
Q psy2920         185 YELTVEAHDLQS  196 (325)
Q Consensus       185 y~l~V~a~D~g~  196 (325)
                      -.+.|.|+|..+
T Consensus        69 ~~F~V~a~D~~~   80 (90)
T PF08758_consen   69 RSFTVHAWDSQT   80 (90)
T ss_dssp             EEEEEEEEETTT
T ss_pred             eEEEEEEECCCC
Confidence            579999999775


No 15 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.06  E-value=0.13  Score=38.80  Aligned_cols=68  Identities=26%  Similarity=0.240  Sum_probs=50.4

Q ss_pred             EEeCCCCeEEEEEecC-----CCcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCC
Q psy2920         144 ATDADGDKVAYKLAVP-----SNLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVL  216 (325)
Q Consensus       144 A~D~D~~~v~Y~i~~~-----~~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~  216 (325)
                      ..|+|...++|++...     ..|...++.++.+.=... .+....|.+.|.|+|+.+  .+.  ...+.|.|.+.++
T Consensus        24 F~d~d~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~-~~~~g~~~i~v~a~D~~g--~~~--~~~f~i~V~~~~~   96 (97)
T smart00736       24 FTDADGDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPT-NSDVGSLSLKVTATDSSG--ASA--SDTFTITVVNTND   96 (97)
T ss_pred             eECCCCCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECC-CCCCcEEEEEEEEEECCC--CEE--EEEEEEEEeCCCC
Confidence            4688877999999742     358999999988877544 434577999999999876  333  5678888876554


No 16 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.03  E-value=0.018  Score=42.28  Aligned_cols=57  Identities=18%  Similarity=0.297  Sum_probs=37.8

Q ss_pred             eeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC-----CeEEEEEec--CCCcEEEeCCceEEEEccC-CCCC
Q psy2920         114 VYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG-----DKVAYKLAV--PSNLVVIVPQTGELLLTGD-PPSD  181 (325)
Q Consensus       114 ~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~-----~~v~Y~i~~--~~~~F~i~~~tG~i~l~~~-d~e~  181 (325)
                      +..++|+|..++|          +.|+.+ |.|.-.     ..-.|+|.+  ...+|.++..+|.|++... |||.
T Consensus         2 qi~YsV~EE~~~G----------t~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen    2 QIRYSVPEEMPPG----------TVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             EEEEEEESS--TT-----------EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred             CeEEEeecCCCCC----------CEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence            3467899999999          999998 444432     134566665  4579999999999999988 9986


No 17 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.44  E-value=0.83  Score=34.27  Aligned_cols=64  Identities=28%  Similarity=0.360  Sum_probs=48.1

Q ss_pred             ceEEEEEeeCC---CCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCCeeEEEEEEEEec
Q psy2920          26 QLVRYSITQGN---DNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGINKAETHVTVDVLD  102 (325)
Q Consensus        26 ~~i~y~i~~~~---~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~D  102 (325)
                      ..++|++...+   .+.+.++++.++             .+.- .+.. +....|.+++.|+|..| .+....++|.|.+
T Consensus        30 ~~lty~~~~~~~~~lP~Wl~fd~~~~-------------~~~G-tP~~-~~~g~~~i~v~a~D~~g-~~~~~~f~i~V~~   93 (97)
T smart00736       30 DTLTYSATLSDGSALPSWLSFDSDTG-------------TLSG-TPTN-SDVGSLSLKVTATDSSG-ASASDTFTITVVN   93 (97)
T ss_pred             CeEEEEEEeCCCCCCCCeEEEeCCCC-------------EEEE-ECCC-CCCcEEEEEEEEEECCC-CEEEEEEEEEEeC
Confidence            67999987543   246889998876             5554 3333 33567999999999866 5677889999999


Q ss_pred             CCC
Q psy2920         103 END  105 (325)
Q Consensus       103 vNd  105 (325)
                      .|+
T Consensus        94 ~~~   96 (97)
T smart00736       94 TND   96 (97)
T ss_pred             CCC
Confidence            987


No 18 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=93.74  E-value=2.8  Score=34.60  Aligned_cols=122  Identities=16%  Similarity=0.140  Sum_probs=60.3

Q ss_pred             CCceEEEEE-EEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccceeeeeEeeccCCceEEEEEeCCCCc
Q psy2920         181 DDVEYELTV-EAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRIEKRRVTRAVRPTKRIDFSETDGEM  259 (325)
Q Consensus       181 ~~~~y~l~V-~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~v~a~d~D~~~  259 (325)
                      ..+.|.|.+ .|.|..+-.  +  +..+..++ .+|..||.+.-.....    . .+...+....  ...+.++|.-.+.
T Consensus        13 ~dG~Y~l~~~~a~D~agN~--~--~~~~~~~~-~iD~T~Ptisi~~~~~----~-~~g~~v~~~~--~i~i~~tD~~~~~   80 (158)
T PF13750_consen   13 PDGSYTLTVVTATDAAGNT--S--TSTVSETF-TIDNTPPTISISDGAS----V-ANGSTVYGLV--NISINVTDNSDDS   80 (158)
T ss_pred             CCccEEEEEEEEEecCCCE--E--EEEEeeEE-EEcCCCCEEEEecCCc----c-CCCcccccee--eeEEEEEeCCCCc
Confidence            347899999 799977632  2  22333333 2466666443311100    0 0011111111  1235666655443


Q ss_pred             cceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEc--eeC-ccccCCCCceEEEEEEEecCCCCC
Q psy2920         260 EGRIVFALEKETEREKFKIRDENPWITVDENGDVRVK--KRW-DFEELGPEKTIDFWVTINNQATNG  323 (325)
Q Consensus       260 ~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~--~~l-D~E~~~~~~~~~l~V~a~D~~~~~  323 (325)
                       .-...+|.+|.-...-.+...    .. ..|...+.  +.+ ..|..   ..|.|+|.|+|.+++.
T Consensus        81 -~i~sv~l~Gg~~~d~v~ls~~----~~-~~~~~~~~yp~~fpsle~~---~~YtLtV~a~D~aGN~  138 (158)
T PF13750_consen   81 -KITSVSLTGGPASDSVSLSWT----NK-GNGVYTLEYPRIFPSLEAD---DSYTLTVSATDKAGNQ  138 (158)
T ss_pred             -eEEEEEEECCcccceEEEeeE----ec-cCceEEeecccccCCcCCC---CeEEEEEEEEecCCCE
Confidence             445678877653332222210    01 23433332  111 12455   7999999999998875


No 19 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=91.96  E-value=2.2  Score=31.65  Aligned_cols=66  Identities=11%  Similarity=0.103  Sum_probs=34.9

Q ss_pred             CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920           1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV   80 (325)
Q Consensus         1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~   80 (325)
                      |.+...|..|+.+...|-...    ..+.|.-..    ..|+|...+              .|.+.+++...... -.+.
T Consensus        16 p~~l~~g~~lg~V~f~dC~~~----~~~~~~ssD----pdF~V~~DG--------------sVy~~r~v~l~~~~-~~F~   72 (90)
T PF08758_consen   16 PSNLEAGQPLGKVNFEDCTGR----RRVIFESSD----PDFRVLEDG--------------SVYAKRPVQLSSEQ-RSFT   72 (90)
T ss_dssp             --SS-SS--EEE---B--SS-------EEEE-------SEEEEETTT--------------EEEEES--S-SSS--EEEE
T ss_pred             CchhhCCcEEEEEEeccCCCC----CceEEecCC----CCEEEcCCC--------------eEEEeeeEecCCCc-eEEE
Confidence            455678999999988777432    457776643    379999865              78999988765443 4789


Q ss_pred             EEEEeCCCC
Q psy2920          81 ITATDVKGI   89 (325)
Q Consensus        81 v~a~D~~~~   89 (325)
                      |.|.|..+.
T Consensus        73 V~a~D~~~~   81 (90)
T PF08758_consen   73 VHAWDSQTQ   81 (90)
T ss_dssp             EEEEETTTT
T ss_pred             EEEECCCCC
Confidence            999998663


No 20 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=91.75  E-value=4.4  Score=30.64  Aligned_cols=89  Identities=17%  Similarity=0.135  Sum_probs=55.1

Q ss_pred             EeeEEEecCCCCCCCceEEEEEee-CCCCceEEEeCCCCccccccccCCcceEEEec------ccCCcccccEEEEEEEE
Q psy2920          11 KGLPILVASSHKHPHQLVRYSITQ-GNDNHTFALKNPTTHEIRNSTESKSGVLLVTS------GDIDREEVAQYSLVITA   83 (325)
Q Consensus        11 ~~~~a~d~D~~~~~~~~i~y~i~~-~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~------~~LDrE~~~~~~l~v~a   83 (325)
                      ++|.+.|+|.++.    .++++.. .+.-+.|.|+....            ....+.      +.|.-...-.-.|++.+
T Consensus         2 G~Lt~sD~D~gd~----~~~s~~~~~g~yGtlti~~~G~------------wtYtl~n~~~avq~L~~Ge~~tdsFtvtv   65 (99)
T TIGR01965         2 GQLTISDADAGQA----HFIAQTDAAGQYGTFSIDADGQ------------WTYQADNSQTAVQALKAGETLTDTFTVTS   65 (99)
T ss_pred             CceEEeCCCCCCc----eEEecccccCCcEEEEECCCCc------------EEEEeCCCcHHHHhhcCCCEEEEEEEEEE
Confidence            4678889987754    5666642 22346788887322            122222      22444444456777888


Q ss_pred             EeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEecc
Q psy2920          84 TDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDL  122 (325)
Q Consensus        84 ~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~  122 (325)
                      .|.     .+..|.|+|.-.|| +|+.... -...+.|+
T Consensus        66 ~DG-----tt~~vtItI~GtND-apvi~~~-~~g~v~ED   97 (99)
T TIGR01965        66 ADG-----TSQTVTITITGAND-AAVIGGA-DTGSVTED   97 (99)
T ss_pred             eCC-----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence            884     38889999999999 7755432 23556554


No 21 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=89.27  E-value=1.9  Score=27.99  Aligned_cols=35  Identities=26%  Similarity=0.248  Sum_probs=27.5

Q ss_pred             CcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCC
Q psy2920         161 NLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQ  195 (325)
Q Consensus       161 ~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g  195 (325)
                      .+..||+.+|.|.=....-.....|.+.|.|+|..
T Consensus        14 ~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   14 SGLSLDPSTGTISGTPTSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CcEEEeCCCCEEEeecCCCccccEEEEEEEEEcCC
Confidence            57889999999987765222346899999999965


No 22 
>KOG3597|consensus
Probab=88.68  E-value=13  Score=36.03  Aligned_cols=156  Identities=10%  Similarity=0.068  Sum_probs=85.0

Q ss_pred             eEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC--eEEEEEecCCC------cE
Q psy2920          92 AETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD--KVAYKLAVPSN------LV  163 (325)
Q Consensus        92 ~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~--~v~Y~i~~~~~------~F  163 (325)
                      .+..+.|.|..+||.+..+....+.+.+.|....-          .....+.+.|+|.+  .+.|++....+      .|
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~----------l~~~~l~~~d~d~~~~~l~f~v~~t~~~~~~~~~~   93 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKV----------LDPELLTAADPDSAPLPLEFQVLGTSSVPLPVLKF   93 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCcee----------ccceEeeccCCCCCccceEEEEccCCCCCCcccee
Confidence            35567899999999777777666778888776543          55567899999984  68888886211      13


Q ss_pred             EEe-----------CCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCc
Q psy2920         164 VIV-----------PQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAP  232 (325)
Q Consensus       164 ~i~-----------~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~  232 (325)
                      ..-           -..|.+.+.-. -..-..+.+..+++|+-.   .+  .  ..|.  -+-..++.+....  .....
T Consensus        94 ~~~g~~~~~Fs~~~v~~g~~~yvh~-g~el~~~~~~~~~SDg~~---~S--~--~~i~--~~~~~~~~~~~~~--~~gL~  161 (442)
T KOG3597|consen   94 DVPGAPATEFSYEEVEDGSLSYVHS-GTELRESELQLRVSDGLL---VS--E--RAIL--KVEATGPAPHLAR--NTGLK  161 (442)
T ss_pred             eccCCcccceEehHhhcCceeEEec-CcccccceEEEEeecceE---ee--e--eEEe--cccCCCcceeeec--ccceE
Confidence            331           22333333222 112456778888888552   11  1  1111  1222222111110  10000


Q ss_pred             cce-eeeeEeeccCCceEEEEEeCCCCccceEEEEEEeCCccc
Q psy2920         233 HRI-EKRRVTRAVRPTKRIDFSETDGEMEGRIVFALEKETERE  274 (325)
Q Consensus       233 ~~~-~~~~v~~~~~~~~~v~a~d~D~~~~~~i~y~i~~~~~~~  274 (325)
                      +.. ..+.+..     .-+.+.|.|.+..-.+.|.|......|
T Consensus       162 v~~gS~~~IT~-----~~L~ved~d~~~d~~v~~~i~~~P~~G  199 (442)
T KOG3597|consen  162 VLQGSTAPITP-----SNLSVEDNDSSPDDEVRYDITPPPQHG  199 (442)
T ss_pred             EccCccccccH-----hHceeecCCCCCCcEEEEEecCCCCCC
Confidence            000 0122222     337888888666678899988765544


No 23 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=87.57  E-value=15  Score=30.35  Aligned_cols=111  Identities=26%  Similarity=0.220  Sum_probs=58.8

Q ss_pred             ccEEEEEE-EEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC--
Q psy2920          74 VAQYSLVI-TATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD--  150 (325)
Q Consensus        74 ~~~~~l~v-~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~--  150 (325)
                      -..|.+.+ .|.|.-| +..+..+...+. +...+|.+.-       .....+...   .+-.|..=.++.++|....  
T Consensus        14 dG~Y~l~~~~a~D~ag-N~~~~~~~~~~~-iD~T~Ptisi-------~~~~~~~~g---~~v~~~~~i~i~~tD~~~~~~   81 (158)
T PF13750_consen   14 DGSYTLTVVTATDAAG-NTSTSTVSETFT-IDNTPPTISI-------SDGASVANG---STVYGLVNISINVTDNSDDSK   81 (158)
T ss_pred             CccEEEEEEEEEecCC-CEEEEEEeeEEE-EcCCCCEEEE-------ecCCccCCC---ccccceeeeEEEEEeCCCCce
Confidence            45799999 7999866 333444433333 3445775532       111111100   0011234456778777763  


Q ss_pred             eEEEEEecCC--CcE--EE-eCCceEEEEccC----CCCCCceEEEEEEEEeCCC
Q psy2920         151 KVAYKLAVPS--NLV--VI-VPQTGELLLTGD----PPSDDVEYELTVEAHDLQS  196 (325)
Q Consensus       151 ~v~Y~i~~~~--~~F--~i-~~~tG~i~l~~~----d~e~~~~y~l~V~a~D~g~  196 (325)
                      -...+|.++.  ..-  .. ....|...+.-+    ..|..+.|.|+|.|+|..+
T Consensus        82 i~sv~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG  136 (158)
T PF13750_consen   82 ITSVSLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG  136 (158)
T ss_pred             EEEEEEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC
Confidence            3456666532  221  11 123444444322    3467789999999999876


No 24 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=87.50  E-value=6  Score=29.92  Aligned_cols=71  Identities=15%  Similarity=0.149  Sum_probs=46.3

Q ss_pred             EEEEEEeCCCC-eEEEEEec---CCCcEEEeCCceEEEEccC-C-----C--C-CCceEEEEEEEEeCCCCCCcceecEE
Q psy2920         140 GSVSATDADGD-KVAYKLAV---PSNLVVIVPQTGELLLTGD-P-----P--S-DDVEYELTVEAHDLQSPSRYAREPAQ  206 (325)
Q Consensus       140 ~~v~A~D~D~~-~v~Y~i~~---~~~~F~i~~~tG~i~l~~~-d-----~--e-~~~~y~l~V~a~D~g~p~~~s~~t~~  206 (325)
                      +++.++|+|.+ ...+++..   ..+.|.|++ +|.-..... .     +  + ..-.-.++|.+.|+      .  +.+
T Consensus         2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------t--t~~   72 (99)
T TIGR01965         2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------T--SQT   72 (99)
T ss_pred             CceEEeCCCCCCceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------C--eEE
Confidence            46789999985 45666643   235688865 676544433 1     1  1 11235677788884      2  678


Q ss_pred             EEEEEecCCCCCc
Q psy2920         207 VWLQFNSPVLQDE  219 (325)
Q Consensus       207 v~I~v~d~~~~P~  219 (325)
                      |+|+|.-.|++|.
T Consensus        73 vtItI~GtNDapv   85 (99)
T TIGR01965        73 VTITITGANDAAV   85 (99)
T ss_pred             EEEEEEccCCCCE
Confidence            8899888999998


No 25 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.48  E-value=10  Score=25.67  Aligned_cols=56  Identities=14%  Similarity=0.150  Sum_probs=31.7

Q ss_pred             eEEEEEecC-CCcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEE
Q psy2920         151 KVAYKLAVP-SNLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQF  211 (325)
Q Consensus       151 ~v~Y~i~~~-~~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v  211 (325)
                      ...|+|.+. ..+..+...+-.+.....   ..+.|+|.|+|.|..+.....  ..++.|.|
T Consensus         9 ~Y~Y~l~g~d~~W~~~~~~~~~~~~~~L---~~G~Y~l~V~a~~~~~~~~~~--~~~l~i~I   65 (66)
T PF07495_consen    9 RYRYRLEGFDDEWITLGSYSNSISYTNL---PPGKYTLEVRAKDNNGKWSSD--EKSLTITI   65 (66)
T ss_dssp             EEEEEEETTESSEEEESSTS-EEEEES-----SEEEEEEEEEEETTS-B-SS---EEEEEEE
T ss_pred             EEEEEEECCCCeEEECCCCcEEEEEEeC---CCEEEEEEEEEECCCCCcCcc--cEEEEEEE
Confidence            466777763 456666433325555443   668999999999977643333  24555554


No 26 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=84.12  E-value=62  Score=34.42  Aligned_cols=29  Identities=28%  Similarity=0.417  Sum_probs=19.8

Q ss_pred             EEEEEEEecCCCCCCccCcceeEEEEeccC
Q psy2920          94 THVTVDVLDENDNSPVFSQKVYRFIVGDLK  123 (325)
Q Consensus        94 ~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~  123 (325)
                      ...+|+|.| ||++|.|....-...|.|+.
T Consensus       515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~  543 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFEEDVFHVSESI  543 (928)
T ss_pred             ceEEEEEec-CcccCcccccCceEEEEcCC
Confidence            355677777 77788776555556788864


No 27 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=81.95  E-value=38  Score=34.37  Aligned_cols=113  Identities=19%  Similarity=0.211  Sum_probs=62.2

Q ss_pred             cccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEE--EeCCCC
Q psy2920          73 EVAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSA--TDADGD  150 (325)
Q Consensus        73 ~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A--~D~D~~  150 (325)
                      ......+++.++|.+|..+......|+|++.  ++|.+.     +++.-...-+..  +.-.+...|..+..  ..+..-
T Consensus       296 ~~G~~Ti~atVtDSRGr~S~~~~~tItVl~Y--~~P~ls-----fsv~R~~~~~~~--~~v~~~a~Iapl~v~g~qKN~~  366 (624)
T PF05895_consen  296 FSGSATIRATVTDSRGRTSDPKTKTITVLEY--SPPTLS-----FSVYRCGSSGNT--LTVTRNAKIAPLTVNGVQKNTM  366 (624)
T ss_pred             cCceEEEEEEEEECCCccCCceEEEEEEEEc--CCCcEE-----EEEEEeCCCCcE--EEEEEEEEEeEEEEcccccceE
Confidence            3567899999999998667778899999998  567663     222222111100  00000011221111  111112


Q ss_pred             eEEEEEec-CCCcEEEeCC--ce-----------EEEEccCCCCCCceEEEEEEEEeCC
Q psy2920         151 KVAYKLAV-PSNLVVIVPQ--TG-----------ELLLTGDPPSDDVEYELTVEAHDLQ  195 (325)
Q Consensus       151 ~v~Y~i~~-~~~~F~i~~~--tG-----------~i~l~~~d~e~~~~y~l~V~a~D~g  195 (325)
                      .+.|+... +...|.++..  .|           ...|.. .+.....|.+++.++|.-
T Consensus       367 ~lt~~~a~~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g-~y~~~kSy~V~~~l~D~F  424 (624)
T PF05895_consen  367 TLTFKVAPLGTGTFTTDNGSASGTWSSISELTNSSANLGG-TYDAEKSYDVRGTLSDKF  424 (624)
T ss_pred             EEEEEEEEcCcceEEEEccccccceeeeeeecccceeecc-ccCCCceEEEEEEEEEEe
Confidence            57777765 5566766521  11           122222 466778999999999954


No 28 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=76.24  E-value=39  Score=27.19  Aligned_cols=61  Identities=20%  Similarity=0.288  Sum_probs=43.2

Q ss_pred             ceEEEecccCCccc---ccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCC
Q psy2920          60 GVLLVTSGDIDREE---VAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSS  125 (325)
Q Consensus        60 ~~~l~~~~~LDrE~---~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~  125 (325)
                      .+...+..+||...   .-...|.|.|+|..|..+ ...+.|+|.|  | .|...... .+.|.|...+
T Consensus        67 sYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s-~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L~  130 (137)
T TIGR03660        67 SYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS-SITLPVTIVD--D-VPTITDVD-ALTVDEDDLP  130 (137)
T ss_pred             cEEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc-ccEEEEEEEC--C-CCeecccc-ceEEeccccc
Confidence            45788999998843   446788899999877543 3588888887  6 47665443 4788886544


No 29 
>KOG4221|consensus
Probab=75.28  E-value=1.3e+02  Score=32.91  Aligned_cols=66  Identities=18%  Similarity=0.302  Sum_probs=43.0

Q ss_pred             EEEEEeeC--CCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCCeeEEEEEE-EEecCC
Q psy2920          28 VRYSITQG--NDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGINKAETHVTV-DVLDEN  104 (325)
Q Consensus        28 i~y~i~~~--~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v-~V~DvN  104 (325)
                      +.|++.-.  +....+.++..+.             .+++.   +.|....|.+.|.|....|...++..+.| +..|+-
T Consensus       552 ~~yk~~ys~~~~~~~~~~~~n~~-------------e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~P  615 (1381)
T KOG4221|consen  552 TGYKLFYSEDDTGKELRVENNAT-------------EYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVP  615 (1381)
T ss_pred             eEEEEEEEcCCCCceEEEecCcc-------------EEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCC
Confidence            35665432  2335678877665             55555   45778899999999998887644444433 455777


Q ss_pred             CCCCc
Q psy2920         105 DNSPV  109 (325)
Q Consensus       105 dn~P~  109 (325)
                      +-||.
T Consensus       616 saPP~  620 (1381)
T KOG4221|consen  616 SAPPQ  620 (1381)
T ss_pred             CCCCc
Confidence            76665


No 30 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=65.33  E-value=26  Score=22.62  Aligned_cols=36  Identities=22%  Similarity=0.325  Sum_probs=25.3

Q ss_pred             CceEEEeCCCCccccccccCCcceEEEecccCCcc-cccEEEEEEEEEeCCC
Q psy2920          38 NHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDRE-EVAQYSLVITATDVKG   88 (325)
Q Consensus        38 ~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE-~~~~~~l~v~a~D~~~   88 (325)
                      .....+|+.+|             .|.-.  .+.. ....|.+.|.|+|..|
T Consensus        13 P~gLs~d~~tG-------------~isGt--p~~~~~~G~y~~~vtatd~~G   49 (49)
T PF05345_consen   13 PSGLSLDPSTG-------------TISGT--PTSSVQPGTYTFTVTATDGSG   49 (49)
T ss_pred             CCcEEEeCCCC-------------EEEee--cCCCccccEEEEEEEEEcCCC
Confidence            45788999998             55543  3334 3368999999999753


No 31 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=53.68  E-value=40  Score=22.11  Aligned_cols=17  Identities=29%  Similarity=0.202  Sum_probs=14.7

Q ss_pred             ceEEEEEEEecCCCCCC
Q psy2920         308 KTIDFWVTINNQATNGG  324 (325)
Q Consensus       308 ~~~~l~V~a~D~~~~~g  324 (325)
                      ..|.|.|.|+|.+++..
T Consensus        24 G~y~itv~a~D~AGN~s   40 (54)
T PF13754_consen   24 GTYTITVTATDAAGNTS   40 (54)
T ss_pred             ccEEEEEEEEeCCCCCC
Confidence            68999999999998764


No 32 
>KOG4221|consensus
Probab=52.52  E-value=3.6e+02  Score=29.82  Aligned_cols=121  Identities=15%  Similarity=0.171  Sum_probs=70.4

Q ss_pred             ccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcce-----eEEEEeccCCCCCCCcccccceeEEEEEEEEeCC
Q psy2920          74 VAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKV-----YRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDAD  148 (325)
Q Consensus        74 ~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~-----~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D  148 (325)
                      ...|.+++.|....|...+...+.|.-...   +|.- ...     ..+.|.-..++-.++        .|..       
T Consensus       493 ~t~Y~~rv~A~n~~g~g~sS~pLkV~t~pE---gp~~-~~a~ats~~ti~v~WepP~~~n~--------~I~~-------  553 (1381)
T KOG4221|consen  493 LTMYFFRVRAKNEAGSGESSAPLKVTTQPE---GPVQ-LQAYATSPTTILVTWEPPPFGNG--------PITG-------  553 (1381)
T ss_pred             ceeEEEEEeccCcccCCccCCceEEecCCC---CCcc-ccccccCcceEEEEecCCCCCCC--------CceE-------
Confidence            457999999988777665566665543333   3322 112     222222222220010        0000       


Q ss_pred             CCeEEEEEecCCCcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEE-EEecCCCCCc
Q psy2920         149 GDKVAYKLAVPSNLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWL-QFNSPVLQDE  219 (325)
Q Consensus       149 ~~~v~Y~i~~~~~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I-~v~d~~~~P~  219 (325)
                       =++.|+..+....+.++.++-+.++..  .+....|.+.|+|..+.++..++   +.|+| +..|+-.+||
T Consensus       554 -yk~~ys~~~~~~~~~~~~n~~e~ti~g--L~k~TeY~~~vvA~N~~G~g~sS---~~i~V~Tlsd~PsaPP  619 (1381)
T KOG4221|consen  554 -YKLFYSEDDTGKELRVENNATEYTING--LEKYTEYSIRVVAYNSAGSGVSS---ADITVRTLSDVPSAPP  619 (1381)
T ss_pred             -EEEEEEcCCCCceEEEecCccEEEeec--CCCccceEEEEEEecCCCCCCCC---CceEEEeccCCCCCCC
Confidence             045555555567888877776766664  56778899999999999888776   23333 3456666776


No 33 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=49.76  E-value=34  Score=22.99  Aligned_cols=29  Identities=28%  Similarity=0.444  Sum_probs=19.4

Q ss_pred             cccEEEEEEEEEeCCCCCe-eEEEEEEEEe
Q psy2920          73 EVAQYSLVITATDVKGINK-AETHVTVDVL  101 (325)
Q Consensus        73 ~~~~~~l~v~a~D~~~~~~-~~~~v~v~V~  101 (325)
                      ....|.|.|.|.|..+... ....+.|.|+
T Consensus        37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen   37 PPGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             -SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            4678999999999876543 2366776663


No 34 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=49.65  E-value=71  Score=21.46  Aligned_cols=31  Identities=29%  Similarity=0.537  Sum_probs=20.8

Q ss_pred             cccEEEEEEEEEeCCCCCeeEEEEEEEEecCC
Q psy2920          73 EVAQYSLVITATDVKGINKAETHVTVDVLDEN  104 (325)
Q Consensus        73 ~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvN  104 (325)
                      ....|.+.+.++|.-| +.+.......+.|..
T Consensus        21 ~dg~yt~~v~a~D~AG-N~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   21 ADGEYTLTVTATDKAG-NTSSSTTQIVIVDNT   51 (60)
T ss_pred             CCccEEEEEEEEECCC-CEEEeeeEEEEEcCC
Confidence            3567999999999877 444445555555443


No 35 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=39.61  E-value=1.2e+02  Score=21.59  Aligned_cols=13  Identities=31%  Similarity=0.345  Sum_probs=11.2

Q ss_pred             ceEEEEEEEecCC
Q psy2920         308 KTIDFWVTINNQA  320 (325)
Q Consensus       308 ~~~~l~V~a~D~~  320 (325)
                      ..|.|.|.|+|.|
T Consensus        69 G~Y~~~v~a~~~g   81 (81)
T PF13860_consen   69 GTYTFRVTATDGG   81 (81)
T ss_dssp             EEEEEEEEEEET-
T ss_pred             CCEEEEEEEEeCC
Confidence            7999999999975


No 36 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=38.53  E-value=80  Score=21.92  Aligned_cols=30  Identities=30%  Similarity=0.388  Sum_probs=21.8

Q ss_pred             CCcccccEEEEEEEEEeCCCCCeeEEEEEEEE
Q psy2920          69 IDREEVAQYSLVITATDVKGINKAETHVTVDV  100 (325)
Q Consensus        69 LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V  100 (325)
                      .-|.....|.+++.+.|..|  +....++|.|
T Consensus        49 ~~y~~~G~y~v~l~v~n~~g--~~~~~~~i~v   78 (79)
T smart00089       49 HTYTKPGTYTVTLTVTNAVG--SASATVTVVV   78 (79)
T ss_pred             EEeCCCcEEEEEEEEEcCCC--cEEEEEEEEE
Confidence            34667789999999999866  4555555554


No 37 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=37.93  E-value=75  Score=22.51  Aligned_cols=27  Identities=41%  Similarity=0.463  Sum_probs=21.5

Q ss_pred             cccEEEEEEEEEeCCCCCeeEEEEEEEE
Q psy2920          73 EVAQYSLVITATDVKGINKAETHVTVDV  100 (325)
Q Consensus        73 ~~~~~~l~v~a~D~~~~~~~~~~v~v~V  100 (325)
                      ....|.++..|+|..| +++.+.+.|+|
T Consensus        55 ~~G~t~V~ytA~D~~G-N~a~C~f~V~V   81 (81)
T PF02494_consen   55 PVGTTTVTYTATDAAG-NSATCSFTVTV   81 (81)
T ss_pred             eeceEEEEEEEEECCC-CEEEEEEEEEC
Confidence            3568999999999865 66788888875


No 38 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=34.30  E-value=90  Score=21.65  Aligned_cols=29  Identities=17%  Similarity=0.139  Sum_probs=21.0

Q ss_pred             CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEE
Q psy2920         178 PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQF  211 (325)
Q Consensus       178 d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v  211 (325)
                      .|.....|.+++.|+|..+   ++  ++++.|.|
T Consensus        50 ~y~~~G~y~v~l~v~n~~g---~~--~~~~~i~v   78 (79)
T smart00089       50 TYTKPGTYTVTLTVTNAVG---SA--SATVTVVV   78 (79)
T ss_pred             EeCCCcEEEEEEEEEcCCC---cE--EEEEEEEE
Confidence            3667789999999999765   33  55555554


No 39 
>PF10365 DUF2436:  Domain of unknown function (DUF2436);  InterPro: IPR018832  Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms.  This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).  
Probab=29.85  E-value=3e+02  Score=22.22  Aligned_cols=126  Identities=13%  Similarity=0.142  Sum_probs=66.2

Q ss_pred             EEEecccCCcccccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEE
Q psy2920          62 LLVTSGDIDREEVAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGS  141 (325)
Q Consensus        62 ~l~~~~~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~  141 (325)
                      .|.+...=..+...-|++.+-| |+..-. ....-+=-+.+ |+++|.=.-..|+-.|++|+-+.-     ++....+--
T Consensus        27 ~IILeah~vW~DgsGyQ~LlDa-DHn~yG-~~iP~~g~l~~-n~~~pa~ly~~FEYkiP~NADps~-----tpq~mv~dG   98 (161)
T PF10365_consen   27 RIILEAHNVWGDGSGYQMLLDA-DHNTYG-DVIPDSGPLWD-NCNVPANLYDPFEYKIPANADPST-----TPQNMVVDG   98 (161)
T ss_pred             EEEEeccccccCCcceEEEEcC-Cccccc-ccccccCcccc-CCCCChhhcccceEeccCCCCCcc-----CcceEEecC
Confidence            5666555566667777776654 221100 00111112233 778887666778888999987641     111122111


Q ss_pred             EEEEeCCCCeEEEEEecC--CCcEEEeCCceEEEEccCC--CCCCceEEEEEEEEeCC
Q psy2920         142 VSATDADGDKVAYKLAVP--SNLVVIVPQTGELLLTGDP--PSDDVEYELTVEAHDLQ  195 (325)
Q Consensus       142 v~A~D~D~~~v~Y~i~~~--~~~F~i~~~tG~i~l~~~d--~e~~~~y~l~V~a~D~g  195 (325)
                      ---.+.-.|..-|-|..+  +....|-..-|.=..+..|  +|.-..|++++++..++
T Consensus        99 ~~~i~IPaG~YDy~I~~P~~~~kiwIaGd~g~~~tr~dDy~fEAGKtY~ftm~~~g~g  156 (161)
T PF10365_consen   99 EASIDIPAGTYDYCIAAPQPGGKIWIAGDGGDGPTRGDDYVFEAGKTYRFTMKRVGSG  156 (161)
T ss_pred             ceEEEecCceeEEEEecCCCCCeEEEecCCCCCCccccceEEecCCEEEEEEEeccCC
Confidence            111122225667777764  3445553333333333333  46778999999987765


No 40 
>PF02010 REJ:  REJ domain;  InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=28.65  E-value=90  Score=29.92  Aligned_cols=42  Identities=12%  Similarity=0.115  Sum_probs=0.0

Q ss_pred             CCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEe
Q psy2920         167 PQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFN  212 (325)
Q Consensus       167 ~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~  212 (325)
                      ...+.+.+..........|.+++.+++.+.  .++  .++..|.+.
T Consensus       145 ~~~~~l~i~~~~l~~~~~y~f~ltv~k~~r--~s~--s~~~~v~v~  186 (440)
T PF02010_consen  145 SSSSSLTIPASTLSPGSTYTFTLTVSKGSR--SSS--SASQTVTVV  186 (440)
T ss_dssp             ----------------------------------------------
T ss_pred             CCCEEEEEEhHHcCCCceEEEEEEEEeCCC--Cce--eeEEEEEec
Confidence            444555554443333444999999998773  133  455555553


No 41 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=28.64  E-value=1.4e+02  Score=20.67  Aligned_cols=19  Identities=21%  Similarity=0.426  Sum_probs=15.9

Q ss_pred             CcccccEEEEEEEEEeCCC
Q psy2920          70 DREEVAQYSLVITATDVKG   88 (325)
Q Consensus        70 DrE~~~~~~l~v~a~D~~~   88 (325)
                      .|.....|.+++.++|..+
T Consensus        52 ~y~~~G~y~v~l~v~d~~g   70 (81)
T cd00146          52 TYTKPGTYTVTLTVTNAVG   70 (81)
T ss_pred             EcCCCcEEEEEEEEEeCCC
Confidence            4677889999999999854


No 42 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=26.86  E-value=1.8e+02  Score=22.29  Aligned_cols=34  Identities=12%  Similarity=0.214  Sum_probs=22.5

Q ss_pred             cCCcccccEEEEEEEEEeCCCCCeeEEEEEEEEec
Q psy2920          68 DIDREEVAQYSLVITATDVKGINKAETHVTVDVLD  102 (325)
Q Consensus        68 ~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~D  102 (325)
                      .+++..-..|.++|.++|..|.. .+..+.|.|-|
T Consensus        73 t~~v~kgG~y~m~V~lCn~dGCS-~S~~~~I~VAD  106 (106)
T cd02848          73 TFKVGKGGRYQMQVALCNGDGCS-TSAAKEIVVAD  106 (106)
T ss_pred             EEEeCCCCeEEEEEEEECCCCcc-CcCCEEEEecC
Confidence            34566677899999999987754 22344454443


No 43 
>KOG3513|consensus
Probab=26.53  E-value=8.8e+02  Score=26.55  Aligned_cols=121  Identities=20%  Similarity=0.177  Sum_probs=62.0

Q ss_pred             cccccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEE-eCCC
Q psy2920          71 REEVAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSAT-DADG  149 (325)
Q Consensus        71 rE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~-D~D~  149 (325)
                      ++....|+.  .|...-|  ++..+..+.|.|    ++++....-...+    ..|.         ....+-.|. |+-.
T Consensus       488 ~~DaG~YtC--~A~N~~G--~a~~~~~L~Vkd----~tri~~~P~~~~v----~~g~---------~v~l~Ce~shD~~l  546 (1051)
T KOG3513|consen  488 RSDAGKYTC--VAENKLG--KAESTGNLIVKD----ATRITLAPSNTDV----KVGE---------SVTLTCEASHDPSL  546 (1051)
T ss_pred             cccCcEEEE--EEEcccC--ccceEEEEEEec----CceEEeccchhhh----ccCc---------eEEEEeecccCCCc
Confidence            555666654  4444323  455556666664    4555443222222    2220         334444444 4443


Q ss_pred             C-eEEEEEec-------CCCcEEEe--CCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCc
Q psy2920         150 D-KVAYKLAV-------PSNLVVIV--PQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDE  219 (325)
Q Consensus       150 ~-~v~Y~i~~-------~~~~F~i~--~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~  219 (325)
                      . .+.+++.+       ....|.++  ..+|.++....+.+..+.|...++..   .-..++  .+.+.|  ..+-.+|+
T Consensus       547 d~~f~W~~nG~~id~~~~~~~~~~~~~~~~g~L~i~nv~l~~~G~Y~C~aqT~---~Ds~s~--~A~l~V--~gpPgpP~  619 (1051)
T KOG3513|consen  547 DITFTWKKNGRPIDFNPDGDHFEINDGSDSGRLTIANVSLEDSGKYTCVAQTA---LDSASA--RADLLV--RGPPGPPP  619 (1051)
T ss_pred             ceEEEEEECCEEhhccCCCCceEEeCCcCccceEEEeeccccCceEEEEEEEe---ecchhc--ccceEE--ecCCCCCC
Confidence            3 34444432       13566664  33477777666788889999998873   111222  445544  45656665


No 44 
>cd05760 Ig2_PTK7 Second immunoglobulin (Ig)-like domain of protein tyrosine kinase (PTK) 7, also known as CCK4. Ig2_PTK7: domain similar to the second immunoglobulin (Ig)-like domain in protein tyrosine kinase (PTK) 7, also known as CCK4. PTK7 is a subfamily of the receptor protein tyrosine kinase family, and is referred to as an RPTK-like molecule. RPTKs transduce extracellular signals across the cell membrane, and play important roles in regulating cell proliferation, migration, and differentiation. PTK7 is organized as an extracellular portion having seven Ig-like domains, a single transmembrane region, and a cytoplasmic tyrosine kinase-like domain. PTK7 is considered a pseudokinase as it has several unusual residues in some of the highly conserved tyrosine kinase (TK) motifs; it is predicted to lack TK activity. PTK7 may function as a cell-adhesion molecule. PTK7 mRNA is expressed at high levels in placenta, melanocytes, liver, lung, pancreas, and kidney. PTK7 is overexpressed in s
Probab=21.57  E-value=2.8e+02  Score=19.03  Aligned_cols=47  Identities=15%  Similarity=0.132  Sum_probs=27.1

Q ss_pred             cEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEec
Q psy2920         162 LVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNS  213 (325)
Q Consensus       162 ~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d  213 (325)
                      .+.+....+.|.+.....++...|....  .+..+.. .+  .+.++|.|.|
T Consensus        30 ~~~~~~~~~~L~I~~~~~~D~G~Y~C~a--~N~~G~~-~s--~~~~~l~v~d   76 (77)
T cd05760          30 NYSVSSKERTLTLRSAGPDDSGLYYCCA--HNAFGSV-CS--SQNFTLSIID   76 (77)
T ss_pred             cEEEeCCCCEEEEeeCCcccCEEEEEEE--EeCCCeE-ee--CceEEEEEec
Confidence            3445444556666555667788898865  5544321 12  4566776654


No 45 
>KOG0196|consensus
Probab=21.00  E-value=1e+03  Score=25.38  Aligned_cols=38  Identities=21%  Similarity=0.091  Sum_probs=23.8

Q ss_pred             CCCCCceEEEEEEEEeCCC---CCCcceecEEEEEEEecCCCCCc
Q psy2920         178 PPSDDVEYELTVEAHDLQS---PSRYAREPAQVWLQFNSPVLQDE  219 (325)
Q Consensus       178 d~e~~~~y~l~V~a~D~g~---p~~~s~~t~~v~I~v~d~~~~P~  219 (325)
                      +.+....|++.|+|.++-.   |...  ..+.|.|+.  +...|+
T Consensus       404 ~L~ah~~YTFeV~AvNgVS~lsp~~~--~~a~vnItt--~qa~ps  444 (996)
T KOG0196|consen  404 DLLAHTNYTFEVEAVNGVSDLSPFPR--QFASVNITT--NQAAPS  444 (996)
T ss_pred             ccccccccEEEEEEeecccccCCCCC--cceeEEeec--cccCCC
Confidence            6677889999999998643   2222  245666654  444443


Done!