Query         psy669
Match_columns 248
No_of_seqs    268 out of 1705
Neff          9.3 
Searched_HMMs 46136
Date          Sat Aug 17 00:33:29 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy669.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/669hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 8.3E-45 1.8E-49  334.9  21.6  229    2-235   221-480 (2531)
  2 KOG4289|consensus              100.0 8.8E-42 1.9E-46  315.1  22.5  233    2-239   326-586 (2531)
  3 KOG1219|consensus              100.0 3.2E-38 6.9E-43  299.6  25.2  231    2-236   903-1166(4289)
  4 KOG1219|consensus              100.0 5.3E-37 1.1E-41  291.4  22.6  225    2-239  2634-2889(4289)
  5 cd00031 CA Cadherin repeat dom 100.0 1.4E-26   3E-31  184.9  22.8  131   54-186     1-131 (199)
  6 cd00031 CA Cadherin repeat dom  99.9 7.9E-26 1.7E-30  180.5  21.3  147    1-149    53-199 (199)
  7 PF00028 Cadherin:  Cadherin do  99.8 3.2E-19   7E-24  125.2  13.5   92   55-148     1-93  (93)
  8 smart00112 CA Cadherin repeats  99.7   3E-17 6.5E-22  111.6  11.1   79   75-155     1-79  (79)
  9 KOG1834|consensus               99.7 6.1E-17 1.3E-21  142.7  15.5  143    2-150    95-245 (952)
 10 KOG1834|consensus               99.6 9.6E-14 2.1E-18  122.8  16.9  180   39-223    22-242 (952)
 11 smart00112 CA Cadherin repeats  99.2 9.8E-11 2.1E-15   79.3   6.6   47    2-48     33-79  (79)
 12 PF00028 Cadherin:  Cadherin do  98.6 1.4E-07 3.1E-12   65.7   6.8   61  162-223     1-92  (93)
 13 PF08266 Cadherin_2:  Cadherin-  97.9 2.7E-05 5.8E-10   53.0   4.5   63   55-119     3-67  (84)
 14 PF08758 Cadherin_pro:  Cadheri  97.4  0.0034 7.5E-08   43.2   9.4   78   47-133     3-80  (90)
 15 smart00736 CADG Dystroglycan-t  96.4    0.08 1.7E-06   36.9  10.2   70   74-152    24-96  (97)
 16 TIGR01965 VCBS_repeat VCBS rep  94.0     1.4 3.1E-05   30.8  10.6   88   70-169     2-97  (99)
 17 TIGR00845 caca sodium/calcium   92.9      10 0.00022   37.2  17.8  122   43-169   395-542 (928)
 18 PF13750 Big_3_3:  Bacterial Ig  92.5     3.8 8.3E-05   31.3  15.9  126   14-148    15-148 (158)
 19 smart00736 CADG Dystroglycan-t  83.5     6.2 0.00014   27.2   6.4   31   13-45     66-96  (97)
 20 PF05345 He_PIG:  Putative Ig d  82.2     7.3 0.00016   23.3   5.5   35   95-132    13-48  (49)
 21 TIGR00845 caca sodium/calcium   79.6      68  0.0015   31.8  14.0   57   33-93    514-570 (928)
 22 TIGR03660 T1SS_rpt_143 T1SS-14  77.8      14  0.0003   27.6   6.8   56    2-63     70-128 (137)
 23 PF07495 Y_Y_Y:  Y_Y_Y domain;   75.0      18 0.00038   22.6   7.9   60   82-148     7-66  (66)
 24 TIGR03660 T1SS_rpt_143 T1SS-14  72.7      23 0.00051   26.4   6.9   54  109-169    71-127 (137)
 25 PF07495 Y_Y_Y:  Y_Y_Y domain;   72.4     9.4  0.0002   23.9   4.3   29   13-41     38-66  (66)
 26 KOG3597|consensus               66.6      88  0.0019   28.3  10.3   59   32-91     24-82  (442)
 27 PF13750 Big_3_3:  Bacterial Ig  64.6      61  0.0013   24.7  11.0   14  119-132    14-28  (158)
 28 PF03160 Calx-beta:  Calx-beta   63.5      45 0.00097   22.8   9.6   52   37-92      2-53  (100)
 29 PF10365 DUF2436:  Domain of un  55.4      80  0.0017   23.6   6.7   88   44-132    66-156 (161)
 30 cd02848 Chitinase_N_term Chiti  52.7      42  0.0009   23.8   4.8   33  113-148    73-105 (106)
 31 PF12245 Big_3_2:  Bacterial Ig  52.3      55  0.0012   20.3   5.7   30   13-44     22-51  (60)
 32 PF08758 Cadherin_pro:  Cadheri  49.5      13 0.00028   25.5   1.8   33  154-186     3-35  (90)
 33 smart00089 PKD Repeats in poly  47.2      58  0.0012   21.0   4.8   28   10-40     51-78  (79)
 34 cd00146 PKD polycystic kidney   44.6      54  0.0012   21.2   4.3   30  114-146    51-80  (81)
 35 cd00146 PKD polycystic kidney   44.2      66  0.0014   20.8   4.7   28   10-39     53-80  (81)
 36 KOG4221|consensus               44.1 3.6E+02  0.0079   27.9  11.1   40   10-49    581-620 (1381)
 37 PF12245 Big_3_2:  Bacterial Ig  42.6      82  0.0018   19.5   5.4   30  119-151    22-51  (60)
 38 TIGR01965 VCBS_repeat VCBS rep  35.9   1E+02  0.0022   21.6   4.6   41   14-62     57-97  (99)
 39 cd02848 Chitinase_N_term Chiti  35.2      56  0.0012   23.2   3.2   30    7-36     73-102 (106)
 40 PF05895 DUF859:  Siphovirus pr  34.5 4.2E+02   0.009   25.3  12.5  110   13-132   297-424 (624)
 41 KOG3597|consensus               32.9      66  0.0014   29.0   4.1   52  139-190    24-75  (442)
 42 PF02494 HYR:  HYR domain;  Int  32.9      97  0.0021   20.3   4.1   25  120-147    57-81  (81)
 43 PF08329 ChitinaseA_N:  Chitina  30.3 2.3E+02   0.005   21.0   6.5   37  113-152    76-112 (133)
 44 COG4288 Uncharacterized protei  27.5 1.9E+02  0.0041   20.6   4.7   41   32-75     66-106 (124)
 45 PF09100 Qn_am_d_aIV:  Quinohem  27.0 1.5E+02  0.0033   21.7   4.3   35  120-155    98-132 (133)
 46 KOG4221|consensus               25.5 7.9E+02   0.017   25.7  12.4   67   85-156   552-620 (1381)
 47 PF13754 Big_3_4:  Bacterial Ig  20.6 1.9E+02  0.0041   17.4   3.4   18   12-29     22-39  (54)
 48 PF03160 Calx-beta:  Calx-beta   20.5 2.8E+02  0.0061   18.7   8.4   35  144-182     2-36  (100)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=8.3e-45  Score=334.92  Aligned_cols=229  Identities=34%  Similarity=0.422  Sum_probs=215.9

Q ss_pred             cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC
Q psy669            2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP   81 (248)
Q Consensus         2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~   81 (248)
                      |++++.||||..+.+.|+|.|.|.+.|+++++++|+|.|.|+|||.|+|.+..|..++.||.+.|+.|++++|+|.|.++
T Consensus       221 irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~  300 (2531)
T KOG4289|consen  221 IRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPP  300 (2531)
T ss_pred             chhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCC
Confidence            67899999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCe
Q psy669           82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNF  161 (248)
Q Consensus        82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~  161 (248)
                      |+.++|++..|+....|.|++.+|.| .+..+||||+...|.|.|.|.|.|.| +...++.|.|+|.|+|||+|+|....
T Consensus       301 Nani~Yrl~eg~~~~~f~in~rSGvI-~T~a~lDRE~~~~y~L~VeAsDqG~~-pgp~Ta~V~itV~D~NDNaPqFse~~  378 (2531)
T KOG4289|consen  301 NANIRYRLLEGNAKNVFEINPRSGVI-STRAPLDREELESYQLDVEASDQGRP-PGPRTAMVEITVEDENDNAPQFSEKR  378 (2531)
T ss_pred             CCceEEEecCCCccceeEEcCcccee-eccCccCHHhhhheEEEEEeccCCCC-CCCceEEEEEEEEecCCCCccccccc
Confidence            99999999999888999999999999 89999999999999999999999988 66679999999999999999999999


Q ss_pred             eEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCcee
Q psy669          162 YHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEATL  210 (248)
Q Consensus       162 y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~~~  210 (248)
                      |.++|.|+..++++|++|+|+|.|-                               .+||+|.. +|++.+++.|+..  
T Consensus       379 Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggr--  455 (2531)
T KOG4289|consen  379 YVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGR--  455 (2531)
T ss_pred             eEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCC--
Confidence            9999999999999999999999993                               68999999 9999999888755  


Q ss_pred             ecCCceEEEEEEEEeeeCCcccccC
Q psy669          211 ETGSGESEATLEVLFENSTWENFHE  235 (248)
Q Consensus       211 ~~~~~t~~~~i~v~d~nd~~P~f~~  235 (248)
                      +..+.+.-++|.|+|+|||.|.|..
T Consensus       456 PpLsn~sgl~iqVlDINDhaPifvs  480 (2531)
T KOG4289|consen  456 PPLSNTSGLVIQVLDINDHAPIFVS  480 (2531)
T ss_pred             CCccCCCceEEEEEecCCCCceeEe
Confidence            4455566677999999999999943


No 2  
>KOG4289|consensus
Probab=100.00  E-value=8.8e-42  Score=315.06  Aligned_cols=233  Identities=29%  Similarity=0.383  Sum_probs=218.4

Q ss_pred             cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC
Q psy669            2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP   81 (248)
Q Consensus         2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~   81 (248)
                      |.+..+||||+.++|.|.|.|.|.|.++...+++|.|+|.|+|||+|+|..+.|.++|.|+..+++.|++|+|+|.|.|.
T Consensus       326 I~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~  405 (2531)
T KOG4289|consen  326 ISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGT  405 (2531)
T ss_pred             eeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCc
Confidence            56788999999999999999999998888889999999999999999999999999999999999999999999999999


Q ss_pred             CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCe
Q psy669           82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNF  161 (248)
Q Consensus        82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~  161 (248)
                      |+.+.|+|.+|+..+.|.||..||.+ .+..+||+|.. .|.+.|.|.|+|.| +++.+..+.|+|+|+|||+|.|....
T Consensus       406 Ng~VHYsi~Sgn~~G~f~id~~tGel-~vv~plD~e~~-~ytl~IrAqDggrP-pLsn~sgl~iqVlDINDhaPifvstp  482 (2531)
T KOG4289|consen  406 NGKVHYSIASGNGRGQFYIDSLTGEL-DVVEPLDFENS-EYTLRIRAQDGGRP-PLSNTSGLVIQVLDINDHAPIFVSTP  482 (2531)
T ss_pred             CceEEEEeeccCccccEEEecccceE-EEeccccccCC-eeEEEEEcccCCCC-CccCCCceEEEEEecCCCCceeEech
Confidence            99999999999999999999999999 88999999988 89999999999999 99999999999999999999999999


Q ss_pred             eEEEecCCCCCCeeEEEEEEEeCCC----------------------------CCCCccCceEEEEEEccCCCCceeecC
Q psy669          162 YHVNIPETRHRDEPLSQVSLISTTG----------------------------RKLDRENQAEHVLETGSGESEATLETG  213 (248)
Q Consensus       162 y~~~v~E~~~~g~~v~~v~A~D~D~----------------------------~~ld~e~~~~y~~~v~~~~~~~~~~~~  213 (248)
                      +.++|.|+.+.|..++.++|.|+|.                            +.||||....|+|.+.+.+...  ..-
T Consensus       483 fq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gt--p~l  560 (2531)
T KOG4289|consen  483 FQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGT--PPL  560 (2531)
T ss_pred             hhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEEEcCCCC--Ccc
Confidence            9999999999999999999999994                            7799999999999998877644  444


Q ss_pred             CceEEEEEEEEeeeCCcccccCCCCc
Q psy669          214 SGESEATLEVLFENSTWENFHETNIT  239 (248)
Q Consensus       214 ~~t~~~~i~v~d~nd~~P~f~~~~~~  239 (248)
                      +.++.+.|.++|+|||.|.|....|.
T Consensus       561 ~tstsI~Vtv~dvndndP~Ft~~eyt  586 (2531)
T KOG4289|consen  561 STSTSISVTVLDVNDNDPTFTQKEYT  586 (2531)
T ss_pred             cccceEEEEecccCCCCCccccCceE
Confidence            55679999999999999999666554


No 3  
>KOG1219|consensus
Probab=100.00  E-value=3.2e-38  Score=299.60  Aligned_cols=231  Identities=31%  Similarity=0.419  Sum_probs=216.9

Q ss_pred             cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCC--CCccCCCceEEEEECCCCCCeeEEEEEEEeCCC
Q psy669            2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDN--VPMTLDPVYYAQIVENQSGILPIVQLAASDGDL   79 (248)
Q Consensus         2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn--~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~   79 (248)
                      |++.++||+|+...|.|.|.|.|.|.|.+++.+.+.|.++|+|.|  ||.|....-.++|.||+|+|+.+++|.|.|.|.
T Consensus       903 vfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~deds  982 (4289)
T KOG1219|consen  903 VFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDS  982 (4289)
T ss_pred             EEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCC
Confidence            567889999999999999999999999899999999999999987  999999999999999999999999999999999


Q ss_pred             CCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeecc
Q psy669           80 DPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQ  159 (248)
Q Consensus        80 ~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~  159 (248)
                      |..+.++|+|..|+..+.|+|+..+|.| .+...||||....|.|+|.|+|.|.+ ++++.+.+.|.|.|+|||+|+|.+
T Consensus       983 gldg~l~Y~I~~gdg~g~FsId~~tG~i-rTl~~lDrE~ks~YwltveA~D~gt~-~~ssv~~vyI~ieDvNDn~Pq~s~ 1060 (4289)
T KOG1219|consen  983 GLDGELSYKIRTGDGDGIFSIDSTTGSI-RTLKALDREKKSSYWLTVEAKDLGTV-PLSSVCEVYIEIEDVNDNVPQFSS 1060 (4289)
T ss_pred             CccceEEEEEEcCCcceeEEecCCcceE-eechhhchhhcceEEEEEEEEecCCC-ccccceeEEEEEEecCCCCcccCC
Confidence            9999999999999888999999999999 78889999999999999999999999 999999999999999999999999


Q ss_pred             CeeEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCc
Q psy669          160 NFYHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEA  208 (248)
Q Consensus       160 ~~y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~  208 (248)
                      +.|..+|.|+++.+..|.++.|.|+|.                               ..||||.+.++.|.+.-.|++-
T Consensus      1061 pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~ge 1140 (4289)
T KOG1219|consen 1061 PVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGE 1140 (4289)
T ss_pred             ceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehhhcccccccceEEEEEecCCC
Confidence            999999999999999999999999993                               6799999999999987777633


Q ss_pred             eeecCCceEEEEEEEEeeeCCcccccCC
Q psy669          209 TLETGSGESEATLEVLFENSTWENFHET  236 (248)
Q Consensus       209 ~~~~~~~t~~~~i~v~d~nd~~P~f~~~  236 (248)
                        ........+.|.|+|+|||+|.|.+.
T Consensus      1141 --p~l~s~~rviV~IldvNdnsp~Flqk 1166 (4289)
T KOG1219|consen 1141 --PWLCSNQRVIVSILDVNDNSPRFLQK 1166 (4289)
T ss_pred             --CccccceEEEEEEeeccCCchhhhhh
Confidence              55566789999999999999999554


No 4  
>KOG1219|consensus
Probab=100.00  E-value=5.3e-37  Score=291.41  Aligned_cols=225  Identities=30%  Similarity=0.396  Sum_probs=209.8

Q ss_pred             cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC
Q psy669            2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP   81 (248)
Q Consensus         2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~   81 (248)
                      |.+.++||+|+.++|.++|+|.+.++  .-+.+.|.|.|.|+|||+|.|..+.|.+.+.||+|.|+.|++++|.|.|.+.
T Consensus      2634 i~v~ksLD~E~kk~yqi~v~a~~~~~--vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~ 2711 (4289)
T KOG1219|consen 2634 IKVNKSLDHEKKKSYQIKVKATCGQW--VVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGN 2711 (4289)
T ss_pred             EEeccccchhhhceEEEEEEeecCCc--eEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCC
Confidence            78999999999999999999998874  6789999999999999999999999999999999999999999999999999


Q ss_pred             CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCe
Q psy669           82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNF  161 (248)
Q Consensus        82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~  161 (248)
                      ||+++|++.+..+  +|.|+++||+| ++...||+|+...|.|.|.|+|+|.|   ++.++|.|+|.|+||++|.|+...
T Consensus      2712 nGqirysl~~~v~--yF~In~etGwl-Tt~~eld~ek~d~y~lkv~AtDhG~~---ssq~~v~v~vtDvndspprf~~ei 2785 (4289)
T KOG1219|consen 2712 NGQIRYSLTSPVP--YFAINPETGWL-TTLFELDLEKQDLYSLKVVATDHGVP---SSQATVLVHVTDVNDSPPRFQREI 2785 (4289)
T ss_pred             CceEEEEEcCCcc--eEEEcCCCCee-eehhhhccccCCceEEEEEEecCCcc---cccceEEEEEEecCCCcchhhhHh
Confidence            9999999987655  99999999999 88899999999999999999999966   678999999999999999999999


Q ss_pred             eEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCcee
Q psy669          162 YHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEATL  210 (248)
Q Consensus       162 y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~~~  210 (248)
                      |..++.|+.+.|.+++.++-.|+|.                               +++|+|....|.|++.+.++.|. 
T Consensus      2786 y~gtvv~d~p~~~~ia~~si~d~D~s~~nq~t~fI~~gd~~gqF~~i~ne~~~~~kKt~~~E~t~ny~Ltvtatdg~f~- 2864 (4289)
T KOG1219|consen 2786 YEGTVVEDVPGGKIIAGLSIFDADVSEVNQVTGFITLGDPLGQFWIIENEWIYEFKKTLDRESTKNYLLTVTATDGIFM- 2864 (4289)
T ss_pred             hccceeccCCCCceeeeeEeccccccccceeEEEEeCCCccceEEEEcCcceEEEecchhhhcccceEEEEEEecccee-
Confidence            9999999999999999999999994                               78999999999999999988663 


Q ss_pred             ecCCceEEEEEEEEeeeCCcccccCCCCc
Q psy669          211 ETGSGESEATLEVLFENSTWENFHETNIT  239 (248)
Q Consensus       211 ~~~~~t~~~~i~v~d~nd~~P~f~~~~~~  239 (248)
                          .+..+.|.++|.|||.|.+.+.+|.
T Consensus      2865 ----~s~~vkv~v~~s~dn~~~c~~~~~t 2889 (4289)
T KOG1219|consen 2865 ----NSDNVKVLVLDSNDNSPFCGNQLYT 2889 (4289)
T ss_pred             ----ccceEEEEeeccccCCccCcchhcc
Confidence                3568999999999999999666554


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.96  E-value=1.4e-26  Score=184.92  Aligned_cols=131  Identities=37%  Similarity=0.579  Sum_probs=123.1

Q ss_pred             ceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCC
Q psy669           54 VYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGA  133 (248)
Q Consensus        54 ~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~  133 (248)
                      .|.+.|.|+.+.|+.++++.|.|+|.+.++.++|+|.++....+|.|++.+|.| .+.+.||||....|.|.|.|+|.|.
T Consensus         1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l-~~~~~lD~e~~~~~~l~v~a~D~g~   79 (199)
T cd00031           1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVI-TTTKPLDREEQSEYTLTVVASDGGG   79 (199)
T ss_pred             CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEE-EECCCCCCcCCceEEEEEEEEECCc
Confidence            378999999999999999999999998889999999987766899999999999 7888999999999999999999888


Q ss_pred             CCCCceEEEEEEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEEeCCC
Q psy669          134 PAPLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLISTTG  186 (248)
Q Consensus       134 p~~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~  186 (248)
                      | .++++..++|.|.|+||++|.|....|.+.+.|+.+.|+.++++.|+|+|.
T Consensus        80 ~-~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~  131 (199)
T cd00031          80 P-PLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS  131 (199)
T ss_pred             C-cceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC
Confidence            7 667899999999999999999998999999999999999999999999995


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.95  E-value=7.9e-26  Score=180.51  Aligned_cols=147  Identities=38%  Similarity=0.547  Sum_probs=136.6

Q ss_pred             CcEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCC
Q psy669            1 NIRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLD   80 (248)
Q Consensus         1 ~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~   80 (248)
                      +|++++.||||....|.|.|+|.|.|.+.+++...+.|.|.|+||++|.|....|.+.|.|+.++|+.++++.|+|+|.+
T Consensus        53 ~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~  132 (199)
T cd00031          53 VITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSG  132 (199)
T ss_pred             EEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCC
Confidence            36788899999999999999999988777779999999999999999999988999999999999999999999999998


Q ss_pred             CCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEe
Q psy669           81 PDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVED  149 (248)
Q Consensus        81 ~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d  149 (248)
                      .++.++|+|.++....+|.|++.+|.| .+.+.||||....|.|.|.|+|.+.| .+++++.++|.|.|
T Consensus       133 ~~~~~~y~l~~~~~~~~f~i~~~~G~i-~~~~~ld~e~~~~~~l~v~a~D~~~~-~~~~~~~i~i~v~d  199 (199)
T cd00031         133 ENAKLTYSILSGNDKELFSIDPNTGII-TLAKPLDREEKSSYELTVVATDGGGP-PLSSTATVTVTVLD  199 (199)
T ss_pred             CCccEEEEEeCCCCCCEEEEeCCceEE-EeCCccCCccCceEEEEEEEEECCCC-CceeEEEEEEEEEC
Confidence            889999999977655799999999999 78889999999999999999999887 88899999999875


No 7  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.82  E-value=3.2e-19  Score=125.24  Aligned_cols=92  Identities=41%  Similarity=0.604  Sum_probs=87.7

Q ss_pred             eEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeC-CC
Q psy669           55 YYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDD-GA  133 (248)
Q Consensus        55 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~-~~  133 (248)
                      |.++|+|+.++|+.++++.|.|+|.+.|+.+.|+|..+++.++|.|++.+|.| .+.+.||||....|.|.|.|+|. +.
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i-~~~~~LD~E~~~~y~l~v~a~D~~~~   79 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEI-SLKKPLDRETQSSYQLTVRATDSGGS   79 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEE-EESSSSCTTTTSEEEEEEEEEETTTS
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeecc-ccceecCcccCCEEEEEEEEEECCCC
Confidence            78999999999999999999999998899999999998888999999999999 89999999999999999999999 88


Q ss_pred             CCCCceEEEEEEEEE
Q psy669          134 PAPLSSTTRVIISVE  148 (248)
Q Consensus       134 p~~~~~~~~v~I~V~  148 (248)
                      | ++++++.|+|+|+
T Consensus        80 ~-~~~~~~~V~I~V~   93 (93)
T PF00028_consen   80 P-PLSSTATVTINVL   93 (93)
T ss_dssp             S-EEEEEEEEEEEEE
T ss_pred             C-CCEEEEEEEEEEC
Confidence            8 8999999999885


No 8  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.74  E-value=3e-17  Score=111.64  Aligned_cols=79  Identities=49%  Similarity=0.754  Sum_probs=72.9

Q ss_pred             EeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCC
Q psy669           75 SDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNA  154 (248)
Q Consensus        75 ~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~  154 (248)
                      +|+|.|.|+.++|+|.++....+|.|++.+|.| .+.++||||....|.|.|.|+|.|.| ++++.+.|+|+|.|+|||+
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i-~~~~~LD~e~~~~y~l~v~a~D~~~~-~~~~~~~v~I~V~D~Nd~~   78 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEI-TTTKPLDREEQPEYTLTVEATDGGGP-PLSSTATVTVTVLDVNDNA   78 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEE-EeCCccCeeCCCeEEEEEEEEECCCC-CcccEEEEEEEEEECCCCC
Confidence            478888899999999987755899999999988 77779999999999999999999998 8999999999999999999


Q ss_pred             C
Q psy669          155 P  155 (248)
Q Consensus       155 P  155 (248)
                      |
T Consensus        79 P   79 (79)
T smart00112       79 P   79 (79)
T ss_pred             C
Confidence            8


No 9  
>KOG1834|consensus
Probab=99.74  E-value=6.1e-17  Score=142.68  Aligned_cols=143  Identities=29%  Similarity=0.394  Sum_probs=123.3

Q ss_pred             cEEccCCCCccCcEEEEEEEEEECCCC------CCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEE
Q psy669            2 IRTSTILDVETKTHYWLTVFAQDHGVV------PLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAAS   75 (248)
Q Consensus         2 i~~~~~LD~E~~~~y~l~v~a~D~~~~------~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~   75 (248)
                      |+++.+||.|.++.|+|+|+|.|+|..      ..+..++|+|+|.|+|..+|.|..+.|.+.|.|.. .-..|+++.|.
T Consensus        95 lRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAi  173 (952)
T KOG1834|consen   95 LRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAI  173 (952)
T ss_pred             EeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-eeeeeEEEEee
Confidence            678899999999999999999997743      36778999999999999999999999999999984 45779999999


Q ss_pred             eCCCCC-Cc-eEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEec
Q psy669           76 DGDLDP-DQ-RISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDV  150 (248)
Q Consensus        76 D~D~~~-~~-~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~  150 (248)
                      |.|.++ ++ ...|.|..  +.-.|.||.. |.| ....+|.|....+|.|+|.|.|.|.. +..+.+.|+|.|...
T Consensus       174 D~DCspq~sqIC~YEI~t--~d~PFaIdn~-G~i-rnTekLny~ke~~Y~ltVtAyDCg~k-raa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  174 DKDCSPQYSQICEYEITT--PDVPFAIDND-GNI-RNTEKLNYTKEHQYKLTVTAYDCGKK-RAASDSLVTVHVKPT  245 (952)
T ss_pred             cCCCCCcccceeEEEecC--CCCceEEcCC-Ccc-ccccccccccceeEEEEEEEEecccc-cccCcceEEEEecCc
Confidence            999976 55 46899974  5568999987 999 77889999999999999999999976 445557788888654


No 10 
>KOG1834|consensus
Probab=99.58  E-value=9.6e-14  Score=122.77  Aligned_cols=180  Identities=23%  Similarity=0.283  Sum_probs=135.8

Q ss_pred             EEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC---CceEEEEEecCCC-CCcEEEeCCC--eEEEEecC
Q psy669           39 QVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP---DQRISYKISAGNP-ESYFNIDIGS--GLISTTGR  112 (248)
Q Consensus        39 ~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~---~~~i~y~i~~~~~-~~~F~id~~t--g~i~~~~~  112 (248)
                      ...-+|-+.|.... .|.+-|.||...-.....+.|.|.|.+.   ....-|.|.+..- ...--+|..|  |.| ..+.
T Consensus        22 ~aarankhkpwie~-ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvl-RaK~   99 (952)
T KOG1834|consen   22 HAARANKHKPWIEE-EYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVL-RAKE   99 (952)
T ss_pred             ccccccccCccccc-ceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEE-eecC
Confidence            34456667776654 4999999997543334468899999754   2346788875332 1233447776  467 7889


Q ss_pred             CCCccccceeEEEEEEEeCCCCC-----CCceEEEEEEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEEeCCC-
Q psy669          113 KLDRENQAEHVLEVTVTDDGAPA-----PLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLISTTG-  186 (248)
Q Consensus       113 ~ld~e~~~~~~l~v~a~d~~~p~-----~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~-  186 (248)
                      +||+|.++.|+|+|+|.|.|.-.     ..+..++|.|+|.|+|+++|+|..+.|.+.|.|+ +.-..|++|.|.|.|- 
T Consensus       100 ~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~DCs  178 (952)
T KOG1834|consen  100 PLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKDCS  178 (952)
T ss_pred             cccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecc-eeeeeeEEEEeecCCCC
Confidence            99999999999999999977541     4577899999999999999999999999999999 7788899999999983 


Q ss_pred             -----------------------------CCCCccCceEEEEEEccCCCCceeecCCceEEEEEEE
Q psy669          187 -----------------------------RKLDRENQAEHVLETGSGESEATLETGSGESEATLEV  223 (248)
Q Consensus       187 -----------------------------~~ld~e~~~~y~~~v~~~~~~~~~~~~~~t~~~~i~v  223 (248)
                                                   .+|.|..+..|.|++.+-+-.-.  ....-..|+|+|
T Consensus       179 pq~sqIC~YEI~t~d~PFaIdn~G~irnTekLny~ke~~Y~ltVtAyDCg~k--raa~d~lV~v~V  242 (952)
T KOG1834|consen  179 PQYSQICEYEITTPDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVTAYDCGKK--RAASDSLVTVHV  242 (952)
T ss_pred             CcccceeEEEecCCCCceEEcCCCccccccccccccceeEEEEEEEEecccc--cccCcceEEEEe
Confidence                                         56888889999999977765332  222234666666


No 11 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.16  E-value=9.8e-11  Score=79.30  Aligned_cols=47  Identities=45%  Similarity=0.567  Sum_probs=44.9

Q ss_pred             cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCC
Q psy669            2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVP   48 (248)
Q Consensus         2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P   48 (248)
                      |++.++||||..+.|.|.|+|.|.+.+++++.+.|+|.|.|+|||+|
T Consensus        33 i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       33 ITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             EEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            67788999999999999999999999899999999999999999998


No 12 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.62  E-value=1.4e-07  Score=65.71  Aligned_cols=61  Identities=25%  Similarity=0.337  Sum_probs=44.6

Q ss_pred             eEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCcee
Q psy669          162 YHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEATL  210 (248)
Q Consensus       162 y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~~~  210 (248)
                      |++.|+|++++|+.|+++.|.|+|.                               ++||||....|.|.+.+.+. ...
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~-~~~   79 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDS-GGS   79 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEET-TTS
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEEC-CCC
Confidence            7899999999999999999999993                               56777777777777766665 111


Q ss_pred             ecCCceEEEEEEE
Q psy669          211 ETGSGESEATLEV  223 (248)
Q Consensus       211 ~~~~~t~~~~i~v  223 (248)
                      .....+++++|+|
T Consensus        80 ~~~~~~~~V~I~V   92 (93)
T PF00028_consen   80 PPLSSTATVTINV   92 (93)
T ss_dssp             SEEEEEEEEEEEE
T ss_pred             CCCEEEEEEEEEE
Confidence            2333455555555


No 13 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.86  E-value=2.7e-05  Score=53.01  Aligned_cols=63  Identities=22%  Similarity=0.346  Sum_probs=40.7

Q ss_pred             eEEEEECCCCCCeeEEEEEEEeCCCCCC--ceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCcccc
Q psy669           55 YYAQIVENQSGILPIVQLAASDGDLDPD--QRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQ  119 (248)
Q Consensus        55 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~--~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~  119 (248)
                      ...+|+|..++|+.|+.| |.|......  ....|++.+.....+|.++..+|.+ .+...+|||..
T Consensus         3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L-~v~~rIDRE~L   67 (84)
T PF08266_consen    3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDL-FVSERIDREEL   67 (84)
T ss_dssp             EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEE-EESS--SCCCC
T ss_pred             eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeE-EeCCccCHHHH
Confidence            467899999999999998 445433211  1236777776667899999999999 88899999963


No 14 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.35  E-value=0.0034  Score=43.24  Aligned_cols=78  Identities=17%  Similarity=0.214  Sum_probs=41.4

Q ss_pred             CCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEE
Q psy669           47 VPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEV  126 (248)
Q Consensus        47 ~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v  126 (248)
                      .|-|....|.+.|+.+...|..|++|.-.|...  +..+.|...  ++  .|.|..+ |.| .+.+.+.... ..-.|.|
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--Dp--dF~V~~D-GsV-y~~r~v~l~~-~~~~F~V   73 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--DP--DFRVLED-GSV-YAKRPVQLSS-EQRSFTV   73 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-----S--EEEEETT-TEE-EEES--S-SS-S-EEEEE
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--CC--CEEEcCC-CeE-EEeeeEecCC-CceEEEE
Confidence            588999999999999999999999999988753  556777663  44  7999988 999 7777776543 3358999


Q ss_pred             EEEeCCC
Q psy669          127 TVTDDGA  133 (248)
Q Consensus       127 ~a~d~~~  133 (248)
                      .|.|...
T Consensus        74 ~a~D~~~   80 (90)
T PF08758_consen   74 HAWDSQT   80 (90)
T ss_dssp             EEEETTT
T ss_pred             EEECCCC
Confidence            9999764


No 15 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.41  E-value=0.08  Score=36.93  Aligned_cols=70  Identities=24%  Similarity=0.394  Sum_probs=50.7

Q ss_pred             EEeCCCCCCceEEEEEecCC---CCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEec
Q psy669           74 ASDGDLDPDQRISYKISAGN---PESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDV  150 (248)
Q Consensus        74 a~D~D~~~~~~i~y~i~~~~---~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~  150 (248)
                      ..|+|   +..++|++...+   -..|...++.++.+.-  .+.... ...|.+.|.|+|..   +.+....++|.|.+.
T Consensus        24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G--tP~~~~-~g~~~i~v~a~D~~---g~~~~~~f~i~V~~~   94 (97)
T smart00736       24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSG--TPTNSD-VGSLSLKVTATDSS---GASASDTFTITVVNT   94 (97)
T ss_pred             eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE--ECCCCC-CcEEEEEEEEEECC---CCEEEEEEEEEEeCC
Confidence            45666   457888886432   2468999999888732  334333 45699999999986   456788899999998


Q ss_pred             CC
Q psy669          151 ND  152 (248)
Q Consensus       151 Nd  152 (248)
                      |+
T Consensus        95 ~~   96 (97)
T smart00736       95 ND   96 (97)
T ss_pred             CC
Confidence            86


No 16 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.03  E-value=1.4  Score=30.82  Aligned_cols=88  Identities=22%  Similarity=0.237  Sum_probs=52.7

Q ss_pred             EEEEEEeCCCCCCceEEEEEec-CCCCCcEEEeCCCeEEEEe-c------CCCCccccceeEEEEEEEeCCCCCCCceEE
Q psy669           70 VQLAASDGDLDPDQRISYKISA-GNPESYFNIDIGSGLISTT-G------RKLDRENQAEHVLEVTVTDDGAPAPLSSTT  141 (248)
Q Consensus        70 ~~v~a~D~D~~~~~~i~y~i~~-~~~~~~F~id~~tg~i~~~-~------~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~  141 (248)
                      +++.++|+|.+.  ...+++.. ...-+.|.|+.. |..... .      +.|.....-.-.|++.+.|+       .+.
T Consensus         2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~~-G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG-------tt~   71 (99)
T TIGR01965         2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDAD-GQWTYQADNSQTAVQALKAGETLTDTFTVTSADG-------TSQ   71 (99)
T ss_pred             CceEEeCCCCCC--ceEEecccccCCcEEEEECCC-CcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC-------CeE
Confidence            467888888753  34555532 122356888774 544221 1      12332233345777888875       278


Q ss_pred             EEEEEEEecCCCCCeeccCeeEEEecCC
Q psy669          142 RVIISVEDVNDNAPEFEQNFYHVNIPET  169 (248)
Q Consensus       142 ~v~I~V~d~Nd~~P~f~~~~y~~~v~E~  169 (248)
                      .|+|.|.-.|| +|+.... -...+.|+
T Consensus        72 ~vtItI~GtND-apvi~~~-~~g~v~ED   97 (99)
T TIGR01965        72 TVTITITGAND-AAVIGGA-DTGSVTED   97 (99)
T ss_pred             EEEEEEEccCC-CCEEecc-cceeEecC
Confidence            89999999999 7766533 25666665


No 17 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=92.90  E-value=10  Score=37.24  Aligned_cols=122  Identities=19%  Similarity=0.208  Sum_probs=63.1

Q ss_pred             cCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCC--CcEEEeCCCeEEEEe---------c
Q psy669           43 VNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPE--SYFNIDIGSGLISTT---------G  111 (248)
Q Consensus        43 vNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~--~~F~id~~tg~i~~~---------~  111 (248)
                      .||..+.|....-...|.|+.  |+.-..|.-...|.+..-.+.|+..+|...  .-|.  +.+|.+...         -
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV  470 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI  470 (928)
T ss_pred             ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence            455566665555677889985  666666655544544345688988765432  1122  223433110         0


Q ss_pred             CCCC---ccccceeEEEEEEEeCCC------------CCCCceEEEEEEEEEecCCCCCeeccCeeEEEecCC
Q psy669          112 RKLD---RENQAEHVLEVTVTDDGA------------PAPLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPET  169 (248)
Q Consensus       112 ~~ld---~e~~~~~~l~v~a~d~~~------------p~~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~  169 (248)
                      ..+|   +|..+.|.+.+.--..+.            +..+......+|+|.| ||++|.|.-..-...|.|+
T Consensus       471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es  542 (928)
T TIGR00845       471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSES  542 (928)
T ss_pred             EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcC
Confidence            1122   344455555543321110            0012233456677776 6789986654446777786


No 18 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=92.48  E-value=3.8  Score=31.32  Aligned_cols=126  Identities=21%  Similarity=0.202  Sum_probs=66.6

Q ss_pred             cEEEEEE-EEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCC-CeeEEEEEEEeCCCCCCceEEEEEec
Q psy669           14 THYWLTV-FAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSG-ILPIVQLAASDGDLDPDQRISYKISA   91 (248)
Q Consensus        14 ~~y~l~v-~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~-g~~v~~v~a~D~D~~~~~~i~y~i~~   91 (248)
                      ..|.+++ .|.|..+.  ..+..+..++. +...+|...- .....+..+... |..=..+.++|.-.+. ..-+.+|.+
T Consensus        15 G~Y~l~~~~a~D~agN--~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~G   89 (158)
T PF13750_consen   15 GSYTLTVVTATDAAGN--TSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTG   89 (158)
T ss_pred             ccEEEEEEEEEecCCC--EEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEEC
Confidence            4599999 79998744  34444432322 2235776644 111223233222 2233567777766533 344677765


Q ss_pred             CCCCCcEEE--eC-CCeEEEEec--CCC-CccccceeEEEEEEEeCCCCCCCceEEEEEEEEE
Q psy669           92 GNPESYFNI--DI-GSGLISTTG--RKL-DRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVE  148 (248)
Q Consensus        92 ~~~~~~F~i--d~-~tg~i~~~~--~~l-d~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~  148 (248)
                      |.....-.+  .. ..|.. ...  +.+ ..|....|.|+|.|+|..   +..++..+.....
T Consensus        90 g~~~d~v~ls~~~~~~~~~-~~~yp~~fpsle~~~~YtLtV~a~D~a---GN~~~~si~F~y~  148 (158)
T PF13750_consen   90 GPASDSVSLSWTNKGNGVY-TLEYPRIFPSLEADDSYTLTVSATDKA---GNQSTKSISFSYM  148 (158)
T ss_pred             CcccceEEEeeEeccCceE-EeecccccCCcCCCCeEEEEEEEEecC---CCEEEEEEEEEEe
Confidence            443322222  11 12333 221  111 236788899999999976   4556666666554


No 19 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=83.46  E-value=6.2  Score=27.25  Aligned_cols=31  Identities=32%  Similarity=0.219  Sum_probs=26.2

Q ss_pred             CcEEEEEEEEEECCCCCCeeeEEEEEEEEecCC
Q psy669           13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVND   45 (248)
Q Consensus        13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvND   45 (248)
                      ...|.++|.|+|..+  .+....+.|.|.+.|+
T Consensus        66 ~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       66 VGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            456999999999874  6788889999999886


No 20 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=82.23  E-value=7.3  Score=23.34  Aligned_cols=35  Identities=23%  Similarity=0.280  Sum_probs=24.9

Q ss_pred             CCcEEEeCCCeEEEEecCCCCcc-ccceeEEEEEEEeCC
Q psy669           95 ESYFNIDIGSGLISTTGRKLDRE-NQAEHVLEVTVTDDG  132 (248)
Q Consensus        95 ~~~F~id~~tg~i~~~~~~ld~e-~~~~~~l~v~a~d~~  132 (248)
                      .....+|+.+|.| .=  ..+.. ....|.+.|.|+|..
T Consensus        13 P~gLs~d~~tG~i-sG--tp~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   13 PSGLSLDPSTGTI-SG--TPTSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CCcEEEeCCCCEE-Ee--ecCCCccccEEEEEEEEEcCC
Confidence            4678999999999 32  23333 335899999999863


No 21 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=79.57  E-value=68  Score=31.80  Aligned_cols=57  Identities=9%  Similarity=0.094  Sum_probs=33.1

Q ss_pred             eEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCC
Q psy669           33 RTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGN   93 (248)
Q Consensus        33 ~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~   93 (248)
                      ....+|+|.| ||++|.|....-..+|.|+.  |..-.+|.-+- +....-.+.|+...|.
T Consensus       514 ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~Rts-Ga~G~VtV~Y~T~dGT  570 (928)
T TIGR00845       514 PNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTS-GARGTVIVPYRTVEGT  570 (928)
T ss_pred             CceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcC-CCCeeEEEEEEeecCc
Confidence            4466777777 68899887766677888884  44433332221 1111224567666543


No 22 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=77.85  E-value=14  Score=27.59  Aligned_cols=56  Identities=16%  Similarity=0.156  Sum_probs=36.9

Q ss_pred             cEEccCCCCcc---CcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCC
Q psy669            2 IRTSTILDVET---KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQ   63 (248)
Q Consensus         2 i~~~~~LD~E~---~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~   63 (248)
                      ..+.++||+..   .-...|.|.|+|..+...  ...+.|+|.|  | .|...... ..+|.|..
T Consensus        70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~  128 (137)
T TIGR03660        70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDD  128 (137)
T ss_pred             EEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence            35678888854   334788899999775433  4577787777  4 46665543 36787754


No 23 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=74.98  E-value=18  Score=22.61  Aligned_cols=60  Identities=18%  Similarity=0.271  Sum_probs=33.4

Q ss_pred             CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEE
Q psy669           82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVE  148 (248)
Q Consensus        82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~  148 (248)
                      +-..+|+|.+.+ ..+..+...+-.+ .. ..|   ....|.|.|.|.|.... .......+.|.|.
T Consensus         7 ~~~Y~Y~l~g~d-~~W~~~~~~~~~~-~~-~~L---~~G~Y~l~V~a~~~~~~-~~~~~~~l~i~I~   66 (66)
T PF07495_consen    7 NIRYRYRLEGFD-DEWITLGSYSNSI-SY-TNL---PPGKYTLEVRAKDNNGK-WSSDEKSLTITIL   66 (66)
T ss_dssp             TEEEEEEEETTE-SSEEEESSTS-EE-EE-ES-----SEEEEEEEEEEETTS--B-SS-EEEEEEEE
T ss_pred             ceEEEEEEECCC-CeEEECCCCcEEE-EE-EeC---CCEEEEEEEEEECCCCC-cCcccEEEEEEEC
Confidence            456778886433 5566665553244 11 223   57789999999996543 2222266666653


No 24 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=72.74  E-value=23  Score=26.36  Aligned_cols=54  Identities=19%  Similarity=0.227  Sum_probs=34.8

Q ss_pred             EecCCCCccc---cceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCeeEEEecCC
Q psy669          109 TTGRKLDREN---QAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPET  169 (248)
Q Consensus       109 ~~~~~ld~e~---~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~  169 (248)
                      ++..+||+..   .-...|.|.|+|...   -.+...+.|+|.|  | .|+..... ...|.|+
T Consensus        71 tL~~~lDH~~g~d~l~l~~~v~a~D~DG---D~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~  127 (137)
T TIGR03660        71 TLEGPLDHAAGSDELTLNFPIIATDFDG---DTSSITLPVTIVD--D-VPTITDVD-ALTVDED  127 (137)
T ss_pred             EEcccccCCCCCceEEEeeeEEEEeCCC---CccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence            5556677643   334678899999652   2334588888887  4 57765443 4678775


No 25 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=72.36  E-value=9.4  Score=23.92  Aligned_cols=29  Identities=21%  Similarity=0.132  Sum_probs=18.7

Q ss_pred             CcEEEEEEEEEECCCCCCeeeEEEEEEEE
Q psy669           13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQ   41 (248)
Q Consensus        13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~   41 (248)
                      ...|.|.|+|.|..+.......++.|.|.
T Consensus        38 ~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen   38 PGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             CEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            35699999999977554444467777663


No 26 
>KOG3597|consensus
Probab=66.61  E-value=88  Score=28.27  Aligned_cols=59  Identities=20%  Similarity=0.229  Sum_probs=44.5

Q ss_pred             eeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEec
Q psy669           32 ARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISA   91 (248)
Q Consensus        32 ~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~   91 (248)
                      .+..+.|.|..+||.+..+....+..-+.|+...-.....+.+.|+|.. -..+.|++..
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~-~~~l~f~v~~   82 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSA-PLPLEFQVLG   82 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCC-ccceEEEEcc
Confidence            4666789999999999888777777888888655444567888888865 3457788864


No 27 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=64.63  E-value=61  Score=24.71  Aligned_cols=14  Identities=36%  Similarity=0.399  Sum_probs=9.5

Q ss_pred             cceeEEEE-EEEeCC
Q psy669          119 QAEHVLEV-TVTDDG  132 (248)
Q Consensus       119 ~~~~~l~v-~a~d~~  132 (248)
                      ...|.+.+ .|.|..
T Consensus        14 dG~Y~l~~~~a~D~a   28 (158)
T PF13750_consen   14 DGSYTLTVVTATDAA   28 (158)
T ss_pred             CccEEEEEEEEEecC
Confidence            45577777 677754


No 28 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=63.54  E-value=45  Score=22.79  Aligned_cols=52  Identities=17%  Similarity=0.206  Sum_probs=28.6

Q ss_pred             EEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecC
Q psy669           37 YIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAG   92 (248)
Q Consensus        37 ~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~   92 (248)
                      +|+|.| ||.+ .+....-...+.|+.  |..-..|.-...+....-.+.|....+
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~g   53 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDG   53 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCC
Confidence            467777 5555 766666667888885  334444444433322234556666544


No 29 
>PF10365 DUF2436:  Domain of unknown function (DUF2436);  InterPro: IPR018832  Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms.  This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).  
Probab=55.38  E-value=80  Score=23.57  Aligned_cols=88  Identities=23%  Similarity=0.200  Sum_probs=50.1

Q ss_pred             CCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCC---CCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccc
Q psy669           44 NDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDL---DPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQA  120 (248)
Q Consensus        44 NDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~---~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~  120 (248)
                      |+++|.=.-..|+..|++|+-+-..-..+ ..|...   -+.+..-|.|....+.....|-...|.=..-....-+|..+
T Consensus        66 n~~~pa~ly~~FEYkiP~NADps~tpq~m-v~dG~~~i~IPaG~YDy~I~~P~~~~kiwIaGd~g~~~tr~dDy~fEAGK  144 (161)
T PF10365_consen   66 NCNVPANLYDPFEYKIPANADPSTTPQNM-VVDGEASIDIPAGTYDYCIAAPQPGGKIWIAGDGGDGPTRGDDYVFEAGK  144 (161)
T ss_pred             CCCCChhhcccceEeccCCCCCccCcceE-EecCceEEEecCceeEEEEecCCCCCeEEEecCCCCCCccccceEEecCC
Confidence            56677555567788889887653221111 111111   12456678887666656666655543220111123458888


Q ss_pred             eeEEEEEEEeCC
Q psy669          121 EHVLEVTVTDDG  132 (248)
Q Consensus       121 ~~~l~v~a~d~~  132 (248)
                      .|.+++.....|
T Consensus       145 tY~ftm~~~g~g  156 (161)
T PF10365_consen  145 TYRFTMKRVGSG  156 (161)
T ss_pred             EEEEEEEeccCC
Confidence            999999887655


No 30 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=52.71  E-value=42  Score=23.81  Aligned_cols=33  Identities=12%  Similarity=0.156  Sum_probs=22.0

Q ss_pred             CCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEE
Q psy669          113 KLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVE  148 (248)
Q Consensus       113 ~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~  148 (248)
                      .+++.+...|.+.|.++|...   -+..+.+.|.|-
T Consensus        73 t~~v~kgG~y~m~V~lCn~dG---CS~S~~~~I~VA  105 (106)
T cd02848          73 TFKVGKGGRYQMQVALCNGDG---CSTSAAKEIVVA  105 (106)
T ss_pred             EEEeCCCCeEEEEEEEECCCC---ccCcCCEEEEec
Confidence            355777889999999998653   233445555554


No 31 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=52.34  E-value=55  Score=20.34  Aligned_cols=30  Identities=23%  Similarity=0.210  Sum_probs=20.6

Q ss_pred             CcEEEEEEEEEECCCCCCeeeEEEEEEEEecC
Q psy669           13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVN   44 (248)
Q Consensus        13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvN   44 (248)
                      ...|.|.+.|.|..+  ..+.......+.|..
T Consensus        22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDNT   51 (60)
T ss_pred             CccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence            567999999999874  445555555555553


No 32 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=49.52  E-value=13  Score=25.51  Aligned_cols=33  Identities=30%  Similarity=0.597  Sum_probs=14.2

Q ss_pred             CCeeccCeeEEEecCCCCCCeeEEEEEEEeCCC
Q psy669          154 APEFEQNFYHVNIPETRHRDEPLSQVSLISTTG  186 (248)
Q Consensus       154 ~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~  186 (248)
                      .|-|.+..|.+.|+.+...|..|++|.-.|=.+
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~   35 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG   35 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC
Confidence            478999999999999999999999998777654


No 33 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=47.24  E-value=58  Score=20.96  Aligned_cols=28  Identities=18%  Similarity=0.111  Sum_probs=19.3

Q ss_pred             CccCcEEEEEEEEEECCCCCCeeeEEEEEEE
Q psy669           10 VETKTHYWLTVFAQDHGVVPLHARTDVYIQV   40 (248)
Q Consensus        10 ~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V   40 (248)
                      |+....|.+++.+.|..+   +.+..+.|.|
T Consensus        51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       51 YTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            455667899999988764   4555665554


No 34 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=44.64  E-value=54  Score=21.22  Aligned_cols=30  Identities=20%  Similarity=0.114  Sum_probs=20.0

Q ss_pred             CCccccceeEEEEEEEeCCCCCCCceEEEEEEE
Q psy669          114 LDRENQAEHVLEVTVTDDGAPAPLSSTTRVIIS  146 (248)
Q Consensus       114 ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~  146 (248)
                      ..|.....|.+++.++|..   ..+....+.|.
T Consensus        51 ~~y~~~G~y~v~l~v~d~~---g~~~~~~~~V~   80 (81)
T cd00146          51 HTYTKPGTYTVTLTVTNAV---GSSSTKTTTVV   80 (81)
T ss_pred             EEcCCCcEEEEEEEEEeCC---CCEEEEEEEEE
Confidence            3466788899999999974   23344444444


No 35 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=44.19  E-value=66  Score=20.78  Aligned_cols=28  Identities=7%  Similarity=0.004  Sum_probs=18.0

Q ss_pred             CccCcEEEEEEEEEECCCCCCeeeEEEEEE
Q psy669           10 VETKTHYWLTVFAQDHGVVPLHARTDVYIQ   39 (248)
Q Consensus        10 ~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~   39 (248)
                      |.+...|.+++.+.|..  ..+.+..+.|.
T Consensus        53 y~~~G~y~v~l~v~d~~--g~~~~~~~~V~   80 (81)
T cd00146          53 YTKPGTYTVTLTVTNAV--GSSSTKTTTVV   80 (81)
T ss_pred             cCCCcEEEEEEEEEeCC--CCEEEEEEEEE
Confidence            55667799999999875  23444444443


No 36 
>KOG4221|consensus
Probab=44.11  E-value=3.6e+02  Score=27.89  Aligned_cols=40  Identities=18%  Similarity=0.174  Sum_probs=30.3

Q ss_pred             CccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCc
Q psy669           10 VETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPM   49 (248)
Q Consensus        10 ~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~   49 (248)
                      .|....|.+.|.|....+.+.++...-.++..|+=+-+|.
T Consensus       581 L~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  581 LEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             CCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            4566779999999998888776666556677777766665


No 37 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=42.65  E-value=82  Score=19.54  Aligned_cols=30  Identities=30%  Similarity=0.314  Sum_probs=20.8

Q ss_pred             cceeEEEEEEEeCCCCCCCceEEEEEEEEEecC
Q psy669          119 QAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVN  151 (248)
Q Consensus       119 ~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~N  151 (248)
                      ...|.+.+.|+|..   +..+.......+.|..
T Consensus        22 dg~yt~~v~a~D~A---GN~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   22 DGEYTLTVTATDKA---GNTSSSTTQIVIVDNT   51 (60)
T ss_pred             CccEEEEEEEEECC---CCEEEeeeEEEEEcCC
Confidence            56799999999976   4455555555555543


No 38 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=35.89  E-value=1e+02  Score=21.60  Aligned_cols=41  Identities=20%  Similarity=0.183  Sum_probs=27.5

Q ss_pred             cEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECC
Q psy669           14 THYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVEN   62 (248)
Q Consensus        14 ~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~   62 (248)
                      ..-+|++.+.|+      .+.+|.|+|.-.|| +|+..-. -...+.|+
T Consensus        57 ~tdsFtvtv~DG------tt~~vtItI~GtND-apvi~~~-~~g~v~ED   97 (99)
T TIGR01965        57 LTDTFTVTSADG------TSQTVTITITGAND-AAVIGGA-DTGSVTED   97 (99)
T ss_pred             EEEEEEEEEeCC------CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence            345677778875      28889999999997 5544332 24566665


No 39 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=35.20  E-value=56  Score=23.19  Aligned_cols=30  Identities=13%  Similarity=0.142  Sum_probs=20.3

Q ss_pred             CCCCccCcEEEEEEEEEECCCCCCeeeEEE
Q psy669            7 ILDVETKTHYWLTVFAQDHGVVPLHARTDV   36 (248)
Q Consensus         7 ~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v   36 (248)
                      .+++.+...|.++|+++|..+...++...|
T Consensus        73 t~~v~kgG~y~m~V~lCn~dGCS~S~~~~I  102 (106)
T cd02848          73 TFKVGKGGRYQMQVALCNGDGCSTSAAKEI  102 (106)
T ss_pred             EEEeCCCCeEEEEEEEECCCCccCcCCEEE
Confidence            355666677999999999876544444443


No 40 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=34.49  E-value=4.2e+02  Score=25.30  Aligned_cols=110  Identities=17%  Similarity=0.130  Sum_probs=57.8

Q ss_pred             CcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEE------eCCCCCCceEE
Q psy669           13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAAS------DGDLDPDQRIS   86 (248)
Q Consensus        13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~------D~D~~~~~~i~   86 (248)
                      .....+++.++|..+ ..+.+.+..|+|++=  ++|.+....++..-.++    .......|.      +.-......+.
T Consensus       297 ~G~~Ti~atVtDSRG-r~S~~~~~tItVl~Y--~~P~lsfsv~R~~~~~~----~~~v~~~a~Iapl~v~g~qKN~~~lt  369 (624)
T PF05895_consen  297 SGSATIRATVTDSRG-RTSDPKTKTITVLEY--SPPTLSFSVYRCGSSGN----TLTVTRNAKIAPLTVNGVQKNTMTLT  369 (624)
T ss_pred             CceEEEEEEEEECCC-ccCCceEEEEEEEEc--CCCcEEEEEEEeCCCCc----EEEEEEEEEEeEEEEcccccceEEEE
Confidence            345889999999763 224567889999988  57877433333322222    111111221      11111123566


Q ss_pred             EEEecCCCCCcEEEeCCC--eEE----------EEecCCCCccccceeEEEEEEEeCC
Q psy669           87 YKISAGNPESYFNIDIGS--GLI----------STTGRKLDRENQAEHVLEVTVTDDG  132 (248)
Q Consensus        87 y~i~~~~~~~~F~id~~t--g~i----------~~~~~~ld~e~~~~~~l~v~a~d~~  132 (248)
                      |+...-. ...|.+|...  |..          ..+...  +.....|.+.+.++|.-
T Consensus       370 ~~~a~~g-t~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~--y~~~kSy~V~~~l~D~F  424 (624)
T PF05895_consen  370 FKVAPLG-TGTFTTDNGSASGTWSSISELTNSSANLGGT--YDAEKSYDVRGTLSDKF  424 (624)
T ss_pred             EEEEEcC-cceEEEEccccccceeeeeeecccceeeccc--cCCCceEEEEEEEEEEe
Confidence            7665422 3456665431  111          022233  34556799999999964


No 41 
>KOG3597|consensus
Probab=32.93  E-value=66  Score=29.01  Aligned_cols=52  Identities=17%  Similarity=0.042  Sum_probs=38.6

Q ss_pred             eEEEEEEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEEeCCCCCCC
Q psy669          139 STTRVIISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLISTTGRKLD  190 (248)
Q Consensus       139 ~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~~~ld  190 (248)
                      .+....|.|.-+||.+..+....+.+-+.|+...-.....+.+.|+|.+++.
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~   75 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPLP   75 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCccc
Confidence            3566788999999988777777777888887555555567888999975444


No 42 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=32.90  E-value=97  Score=20.25  Aligned_cols=25  Identities=24%  Similarity=0.251  Sum_probs=19.5

Q ss_pred             ceeEEEEEEEeCCCCCCCceEEEEEEEE
Q psy669          120 AEHVLEVTVTDDGAPAPLSSTTRVIISV  147 (248)
Q Consensus       120 ~~~~l~v~a~d~~~p~~~~~~~~v~I~V  147 (248)
                      ..|.++..|+|..   +..+++.+.|+|
T Consensus        57 G~t~V~ytA~D~~---GN~a~C~f~V~V   81 (81)
T PF02494_consen   57 GTTTVTYTATDAA---GNSATCSFTVTV   81 (81)
T ss_pred             ceEEEEEEEEECC---CCEEEEEEEEEC
Confidence            4588999999975   566788888765


No 43 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=30.35  E-value=2.3e+02  Score=21.03  Aligned_cols=37  Identities=14%  Similarity=0.210  Sum_probs=23.6

Q ss_pred             CCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCC
Q psy669          113 KLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVND  152 (248)
Q Consensus       113 ~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd  152 (248)
                      .+.......|.+.|.+++..   +-+....+.|.|.|..-
T Consensus        76 ~~~~~~gG~y~~~VeLCN~~---GCS~S~~~~V~VaDTDG  112 (133)
T PF08329_consen   76 TFTVTKGGRYQMQVELCNAD---GCSTSAPVEVVVADTDG  112 (133)
T ss_dssp             EEEE-S-EEEEEEEEEEETT---EEEE---EEEEEE-TTS
T ss_pred             EEEecCCCEEEEEEEEECCC---CcccCCCEEEEEeCCCc
Confidence            45566788899999999965   33556688889998754


No 44 
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.49  E-value=1.9e+02  Score=20.56  Aligned_cols=41  Identities=24%  Similarity=0.324  Sum_probs=27.7

Q ss_pred             eeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEE
Q psy669           32 ARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAAS   75 (248)
Q Consensus        32 ~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~   75 (248)
                      ...+++++..|.+|-.+.-....|..-+.-|.|+   |.|..-.
T Consensus        66 arrt~hlRfndL~dpe~iP~d~~yasviesnvPv---VvQ~tRL  106 (124)
T COG4288          66 ARRTLHLRFNDLGDPEAIPKDTPYASVIESNVPV---VVQLTRL  106 (124)
T ss_pred             cceeEEEEecccCCcccCCCCCchhhheecCCce---EEEEEEe
Confidence            4567888888998877777777777766666653   4444433


No 45 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.95  E-value=1.5e+02  Score=21.73  Aligned_cols=35  Identities=26%  Similarity=0.426  Sum_probs=17.9

Q ss_pred             ceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCC
Q psy669          120 AEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAP  155 (248)
Q Consensus       120 ~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P  155 (248)
                      ..=+|.|.|+-...-..++..+.+.|+|.+-|+ +|
T Consensus        98 N~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp  132 (133)
T PF09100_consen   98 NAGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP  132 (133)
T ss_dssp             S-EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred             CcccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence            344677777654332267899999999988876 55


No 46 
>KOG4221|consensus
Probab=25.54  E-value=7.9e+02  Score=25.67  Aligned_cols=67  Identities=16%  Similarity=0.234  Sum_probs=42.6

Q ss_pred             EEEEEec--CCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCe
Q psy669           85 ISYKISA--GNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPE  156 (248)
Q Consensus        85 i~y~i~~--~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~  156 (248)
                      ..|++..  ++.+..+.++.++-.. ++.   +.+....|.+.|.|.....+ +.++...-.++..|+-+-||.
T Consensus       552 ~~yk~~ys~~~~~~~~~~~~n~~e~-ti~---gL~k~TeY~~~vvA~N~~G~-g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  552 TGYKLFYSEDDTGKELRVENNATEY-TIN---GLEKYTEYSIRVVAYNSAGS-GVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             eEEEEEEEcCCCCceEEEecCccEE-Eee---cCCCccceEEEEEEecCCCC-CCCCCceEEEeccCCCCCCCc
Confidence            3455542  2445688888776665 443   34667789999999997666 444433333455577776776


No 47 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=20.60  E-value=1.9e+02  Score=17.38  Aligned_cols=18  Identities=28%  Similarity=0.222  Sum_probs=13.6

Q ss_pred             cCcEEEEEEEEEECCCCC
Q psy669           12 TKTHYWLTVFAQDHGVVP   29 (248)
Q Consensus        12 ~~~~y~l~v~a~D~~~~~   29 (248)
                      ....|.+++.|+|..+..
T Consensus        22 ~dG~y~itv~a~D~AGN~   39 (54)
T PF13754_consen   22 ADGTYTITVTATDAAGNT   39 (54)
T ss_pred             CCccEEEEEEEEeCCCCC
Confidence            356799999999976543


No 48 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=20.47  E-value=2.8e+02  Score=18.65  Aligned_cols=35  Identities=11%  Similarity=0.123  Sum_probs=18.2

Q ss_pred             EEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEE
Q psy669          144 IISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLI  182 (248)
Q Consensus       144 ~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~  182 (248)
                      +|.|.| ||.+ .+.-..-...+.|+.  |..-..|.-.
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~   36 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRS   36 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEE
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEc
Confidence            466777 5644 655444456677763  4344444433


Done!