Query         psy11061
Match_columns 212
No_of_seqs    141 out of 1430
Neff          9.7 
Searched_HMMs 46136
Date          Fri Aug 16 17:05:12 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy11061.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11061hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 5.3E-39 1.1E-43  282.8  18.4  207    2-210   196-408 (2531)
  2 KOG4289|consensus              100.0 4.6E-38   1E-42  276.9  21.6  207    1-210   303-512 (2531)
  3 KOG1219|consensus              100.0 5.3E-34 1.2E-38  258.4  22.0  207    1-212   987-1197(4289)
  4 KOG1219|consensus              100.0 5.1E-33 1.1E-37  252.1  22.3  204    1-211  2109-2316(4289)
  5 cd00031 CA Cadherin repeat dom 100.0 2.2E-28 4.7E-33  185.9  24.4  165    1-168    31-199 (199)
  6 cd00031 CA Cadherin repeat dom  99.9 2.3E-23 5.1E-28  158.1  19.2  132   76-209     2-135 (199)
  7 PF00028 Cadherin:  Cadherin do  99.8 3.8E-17 8.2E-22  109.3  14.4   90   76-167     1-93  (93)
  8 KOG1834|consensus               99.8 6.6E-17 1.4E-21  135.8  17.1  158    3-169    71-245 (952)
  9 smart00112 CA Cadherin repeats  99.7 1.1E-15 2.3E-20   99.0  10.8   77   96-174     1-79  (79)
 10 smart00112 CA Cadherin repeats  99.6 4.1E-15 8.9E-20   96.3   9.7   68    1-69     10-79  (79)
 11 KOG1834|consensus               99.4 6.3E-12 1.4E-16  106.3  13.5  148   58-211    20-184 (952)
 12 PF00028 Cadherin:  Cadherin do  99.2 1.1E-10 2.4E-15   77.8   9.4   61    1-62     30-93  (93)
 13 PF08758 Cadherin_pro:  Cadheri  97.5  0.0025 5.4E-08   41.9   9.9   79   68-155     3-82  (90)
 14 smart00736 CADG Dystroglycan-t  97.2   0.015 3.3E-07   38.7  11.0   69   96-171    25-96  (97)
 15 PF08266 Cadherin_2:  Cadherin-  97.2  0.0007 1.5E-08   43.9   4.2   60   75-137     2-63  (84)
 16 TIGR01965 VCBS_repeat VCBS rep  97.1   0.016 3.5E-07   38.6  10.2   88   91-189     2-98  (99)
 17 PF13750 Big_3_3:  Bacterial Ig  97.0   0.075 1.6E-06   38.8  15.9  127   36-167    14-148 (158)
 18 smart00736 CADG Dystroglycan-t  97.0   0.013 2.9E-07   39.0   9.3   63    2-66     32-96  (97)
 19 PF08266 Cadherin_2:  Cadherin-  96.2   0.016 3.4E-07   37.6   5.0   31    4-35     36-66  (84)
 20 PF05345 He_PIG:  Putative Ig d  94.2    0.35 7.6E-06   27.8   6.1   37  115-153    12-49  (49)
 21 PF07495 Y_Y_Y:  Y_Y_Y domain;   93.8    0.86 1.9E-05   27.5   8.4   59  103-166     7-65  (66)
 22 PF08758 Cadherin_pro:  Cadheri  92.4       1 2.2E-05   29.5   6.8   37   14-52     46-82  (90)
 23 PF05345 He_PIG:  Putative Ig d  92.0     1.1 2.4E-05   25.7   5.9   46    3-50      1-49  (49)
 24 TIGR01965 VCBS_repeat VCBS rep  88.1       6 0.00013   26.4   8.2   65   12-83     25-97  (99)
 25 TIGR00845 caca sodium/calcium   86.9      20 0.00044   33.7  12.7   49   64-114   395-443 (928)
 26 TIGR03660 T1SS_rpt_143 T1SS-14  86.2     5.2 0.00011   28.5   6.9   56   24-83     69-127 (137)
 27 PF07495 Y_Y_Y:  Y_Y_Y domain;   84.6     6.5 0.00014   23.5   6.7   53    3-61     11-65  (66)
 28 PF12245 Big_3_2:  Bacterial Ig  82.5     7.5 0.00016   23.2   5.6   31   36-66     22-52  (60)
 29 PF05895 DUF859:  Siphovirus pr  79.7      47   0.001   30.0  13.1  109   36-153   297-425 (624)
 30 KOG4221|consensus               79.4      21 0.00045   34.5   9.6   56   12-70    563-620 (1381)
 31 KOG3597|consensus               76.4      49  0.0011   28.5  14.4   97   53-152    24-135 (442)
 32 TIGR00845 caca sodium/calcium   75.5      75  0.0016   30.1  16.4   56   55-114   515-570 (928)
 33 cd02848 Chitinase_N_term Chiti  75.5     6.1 0.00013   26.7   4.0   33   31-63     74-106 (106)
 34 PF13754 Big_3_4:  Bacterial Ig  75.0      14  0.0003   21.5   5.4   19  138-156    22-40  (54)
 35 PF08329 ChitinaseA_N:  Chitina  74.4      28  0.0006   24.6   7.3   32  138-171    81-112 (133)
 36 PF12245 Big_3_2:  Bacterial Ig  73.1      17 0.00036   21.7   5.2   31  138-170    21-51  (60)
 37 PF03160 Calx-beta:  Calx-beta   69.4      29 0.00062   22.7   8.3   53   58-114     2-54  (100)
 38 PF13753 SWM_repeat:  Putative   68.7      62  0.0013   26.3  12.4  110   36-157    11-128 (317)
 39 cd00146 PKD polycystic kidney   68.7      15 0.00033   22.8   4.6   30   31-60     51-80  (81)
 40 TIGR03660 T1SS_rpt_143 T1SS-14  66.1      45 0.00097   23.7   7.2   43  141-189    86-128 (137)
 41 PF02494 HYR:  HYR domain;  Int  65.2      16 0.00035   22.9   4.2   26   36-61     56-81  (81)
 42 cd02848 Chitinase_N_term Chiti  65.1      27 0.00059   23.6   5.3   29  138-168    78-106 (106)
 43 smart00089 PKD Repeats in poly  62.6      26 0.00055   21.6   4.8   30   31-61     49-78  (79)
 44 PF13750 Big_3_3:  Bacterial Ig  62.4      25 0.00054   25.6   5.2   30   33-62    119-148 (158)
 45 smart00089 PKD Repeats in poly  61.2      25 0.00054   21.7   4.6   28  136-166    51-78  (79)
 46 cd00146 PKD polycystic kidney   56.9      26 0.00056   21.7   4.1   20  136-155    53-72  (81)
 47 KOG4221|consensus               56.8 2.1E+02  0.0045   28.3  12.8   67  106-175   552-620 (1381)
 48 PF08329 ChitinaseA_N:  Chitina  47.4      25 0.00054   24.9   3.0   37   30-66     76-112 (133)
 49 TIGR00864 PCC polycystin catio  39.9 5.4E+02   0.012   28.2  14.4   30   32-62   1479-1508(2740)
 50 COG4288 Uncharacterized protei  33.6 1.5E+02  0.0033   20.1   5.8   60   36-98     42-108 (124)
 51 KOG3597|consensus               28.0   1E+02  0.0022   26.6   4.2   50  158-207    24-73  (442)
 52 PF14157 YmzC:  YmzC-like prote  27.8      44 0.00096   20.2   1.5   18   13-30     41-58  (63)
 53 PF13860 FlgD_ig:  FlgD Ig-like  26.5 1.7E+02  0.0037   18.3   4.8   15  138-152    67-81  (81)
 54 COG1470 Predicted membrane pro  25.0 4.8E+02    0.01   22.9  13.8   72  125-203   386-470 (513)
 55 cd04046 C2_Calpain C2 domain p  24.7 2.3E+02   0.005   19.2   6.6   17  186-202   107-123 (126)
 56 PF09100 Qn_am_d_aIV:  Quinohem  23.3 1.3E+02  0.0028   21.1   3.3   32   37-69    100-132 (133)
 57 PF07145 PAM2:  Ataxin-2 C-term  23.0      51  0.0011   14.6   0.9   11   64-74      6-16  (18)
 58 TIGR03769 P_ac_wall_RPT actino  23.0 1.4E+02  0.0031   16.2   3.7   14  136-149     9-22  (41)
 59 cd01758 PLAT_LPL PLAT/ LH2 dom  21.2 3.1E+02  0.0067   19.4   8.3   66   38-108     2-68  (137)
 60 PF15418 DUF4625:  Domain of un  21.2 3.1E+02  0.0066   19.3   8.7   19  138-156   105-123 (132)
 61 KOG3865|consensus               20.9 4.5E+02  0.0097   21.8   6.3   63   14-80    186-251 (402)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=5.3e-39  Score=282.79  Aligned_cols=207  Identities=29%  Similarity=0.453  Sum_probs=196.2

Q ss_pred             eEEEEcC--CCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCCccCCCceE
Q psy11061          2 VNYTLGE--SPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEPIFNPVEYN   77 (212)
Q Consensus         2 i~Y~i~~--~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P~f~~~~~~   77 (212)
                      +.|++..  ++.+.++|+||+.+|.|++++.||||..+.+.|.|.|+|.+.  .+++++++|.|.|.|||+|+|.+..|.
T Consensus       196 l~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~  275 (2531)
T KOG4289|consen  196 LYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYR  275 (2531)
T ss_pred             eEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHH
Confidence            6788833  335679999999999999999999999999999999999983  789999999999999999999999999


Q ss_pred             EEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCC
Q psy11061         78 VSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNK  155 (212)
Q Consensus        78 ~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~  155 (212)
                      -++.|+.++|..|+++.|+|.|.++|+.|.|++..|++...|.|++.+|.|++..+  +|  ....|.|.|.|+|+|.++
T Consensus       276 e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~--lDRE~~~~y~L~VeAsDqG~~p  353 (2531)
T KOG4289|consen  276 EELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAP--LDREELESYQLDVEASDQGRPP  353 (2531)
T ss_pred             HHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCc--cCHHhhhheEEEEEeccCCCCC
Confidence            99999999999999999999999999999999999988899999999999999999  88  899999999999999887


Q ss_pred             eeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCee
Q psy11061        156 CSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTE  210 (212)
Q Consensus       156 ~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~  210 (212)
                      ...+++|.|+|.|+|||+|+|....|.+.|.|+..++++|++|+|+|+|-|.|+.
T Consensus       354 gp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~  408 (2531)
T KOG4289|consen  354 GPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGK  408 (2531)
T ss_pred             CCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCce
Confidence            7779999999999999999999999999999999999999999999999999884


No 2  
>KOG4289|consensus
Probab=100.00  E-value=4.6e-38  Score=276.90  Aligned_cols=207  Identities=32%  Similarity=0.479  Sum_probs=196.7

Q ss_pred             CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCCccCCCceEE
Q psy11061          1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEPIFNPVEYNV   78 (212)
Q Consensus         1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P~f~~~~~~~   78 (212)
                      .|+|++.+++ ..+.|+|++.+|.|++..+||||+.+.|.|.|.|+|.|.  ...+++|.|.|.|.|||+|+|....|.+
T Consensus       303 ni~Yrl~eg~-~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvv  381 (2531)
T KOG4289|consen  303 NIRYRLLEGN-AKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVV  381 (2531)
T ss_pred             ceEEEecCCC-ccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEE
Confidence            4899998886 568999999999999999999999999999999999983  3458999999999999999999999999


Q ss_pred             EEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC-CCCeEEEEEEEEECCCCCee
Q psy11061         79 SLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS-RSNSYHINVSAMDGGGNKCS  157 (212)
Q Consensus        79 ~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v~a~D~~~~~~~  157 (212)
                      .|.|+..+++.|++++|+|.|.+.|+.+.|+|.+|+.-+.|.||..+|++.+..+  || +...|+++|+|.|++-|+.+
T Consensus       382 qv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~p--lD~e~~~ytl~IrAqDggrPpLs  459 (2531)
T KOG4289|consen  382 QVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEP--LDFENSEYTLRIRAQDGGRPPLS  459 (2531)
T ss_pred             EecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEecc--ccccCCeeEEEEEcccCCCCCcc
Confidence            9999999999999999999999999999999999999999999999999999999  99 77799999999999999988


Q ss_pred             eeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCee
Q psy11061        158 QDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTE  210 (212)
Q Consensus       158 ~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~  210 (212)
                      .+.-+.|+|.|+|||+|.|...++..+|-|+.+.|..++.+.|.|.|+|.++.
T Consensus       460 n~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~  512 (2531)
T KOG4289|consen  460 NTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENAR  512 (2531)
T ss_pred             CCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccc
Confidence            88888899999999999999999999999999999999999999999999874


No 3  
>KOG1219|consensus
Probab=100.00  E-value=5.3e-34  Score=258.39  Aligned_cols=207  Identities=28%  Similarity=0.393  Sum_probs=195.6

Q ss_pred             CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCCccCCCceEE
Q psy11061          1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEPIFNPVEYNV   78 (212)
Q Consensus         1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P~f~~~~~~~   78 (212)
                      .|.|+|..++ ..+.|.|+..+|.|++.++||||....|-|++.|.|.|.  +++.+.+.|.|.|+|||+|+|....|..
T Consensus       987 ~l~Y~I~~gd-g~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~a 1065 (4289)
T KOG1219|consen  987 ELSYKIRTGD-GDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYA 1065 (4289)
T ss_pred             eEEEEEEcCC-cceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEee
Confidence            3789997775 468999999999999999999999999999999999984  7889999999999999999999999999


Q ss_pred             EEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCe
Q psy11061         79 SLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKC  156 (212)
Q Consensus        79 ~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~  156 (212)
                      +|.|+++.+..+.++.|.|+|...++++.|+|.+|++.+.|.||+.+|.|+..+.  ||  ++..+.|.|.+.|.|.|+.
T Consensus      1066 sI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~--LDRE~qdEHiLeVTi~D~gep~l 1143 (4289)
T KOG1219|consen 1066 SISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRR--LDREKQDEHILEVTIQDNGEPWL 1143 (4289)
T ss_pred             eeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehh--hcccccccceEEEEEecCCCCcc
Confidence            9999999999999999999998889999999999999999999999999998888  88  8999999999999999999


Q ss_pred             eeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCeecC
Q psy11061        157 SQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTEFS  212 (212)
Q Consensus       157 ~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~s  212 (212)
                      ++.+.|.|.|.|+|||+|.|.+..+...|+|...+  .++++.|.|.|+|.++.++
T Consensus      1144 ~s~~rviV~IldvNdnsp~Flqk~~~~~v~~r~s~--plyRl~a~d~DeG~narit 1197 (4289)
T KOG1219|consen 1144 CSNQRVIVSILDVNDNSPRFLQKKTFLRVPERSSP--PLYRLAAQDNDEGNNARIT 1197 (4289)
T ss_pred             ccceEEEEEEeeccCCchhhhhheeEEEeeeccCC--ceeEEEEEecCCCcceEEE
Confidence            99999999999999999999999999999998766  8999999999999998764


No 4  
>KOG1219|consensus
Probab=100.00  E-value=5.1e-33  Score=252.14  Aligned_cols=204  Identities=29%  Similarity=0.421  Sum_probs=190.8

Q ss_pred             CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCC-CCceEEEEEEEEeecCCCCCccCCCceEEE
Q psy11061          1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRG-GLSTTAMVRIQITDVNDNEPIFNPVEYNVS   79 (212)
Q Consensus         1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~-~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~   79 (212)
                      +|.|+|.++.  -.+|.++-+||.|.+..+||||....|.|.|+|+|.+ ++.+++.|.|.|.|+|||+|+|.+..|...
T Consensus      2109 kv~YsIldg~--~slFtvnf~TG~i~v~~pLd~ea~t~h~l~ieAtd~~~p~~Aea~VeIiV~dIndn~PvFeqlsYt~s 2186 (4289)
T KOG1219|consen 2109 KVTYSILDGN--TSLFTVNFTTGVILVLIPLDREASTLHELLIEATDAGIPLSAEAKVEIIVGDINDNPPVFEQLSYTIS 2186 (4289)
T ss_pred             ceEEEEecCC--cceEEEecccceEEeccccccccccceEEEEEEeccCCCcceeeEEEEEecccCCCCchhheeeEEEE
Confidence            4799999987  6899999999999999999999999999999999987 689999999999999999999999999999


Q ss_pred             EecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC-CCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCe
Q psy11061         80 LRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK-DANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKC  156 (212)
Q Consensus        80 v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~-~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~  156 (212)
                      +.|+++.|+.+.++.|+|.|.  |..+.|++..+. ....|+|+..||.|++.+.  ||  +.+.|.+.|+|+|+|.+ .
T Consensus      2187 isE~s~igt~viqilATdsDs--n~~isYsl~g~s~~sk~f~In~sTG~it~~g~--ldyE~~q~f~~fvratdggk~-l 2261 (4289)
T KOG1219|consen 2187 ISENSKIGTKVIQILATDSDS--NREISYSLEGNSEISKPFRINVSTGWITVAGK--LDYEENQEFRFFVRATDGGKP-L 2261 (4289)
T ss_pred             ccCCCccCceEEEEEeccCCC--CCceEEEeecCCccccceEEecccceEEEeee--cChhhcceEEEEEEEccCCCc-c
Confidence            999999999999999999997  788999998743 3568999999999999999  88  89999999999998877 6


Q ss_pred             eeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCeec
Q psy11061        157 SQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTEF  211 (212)
Q Consensus       157 ~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~  211 (212)
                      ++.+.|.|.|.|+|||+|.|.+..|.+.|+|.+..|..|..+.|.|.|.-+...+
T Consensus      2262 Sseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ 2316 (4289)
T KOG1219|consen 2262 SSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKL 2316 (4289)
T ss_pred             cccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhh
Confidence            6789999999999999999999999999999999999999999999998876643


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97  E-value=2.2e-28  Score=185.86  Aligned_cols=165  Identities=34%  Similarity=0.576  Sum_probs=148.9

Q ss_pred             CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCc--eEEEEEEEEeecCCCCCccCCCceEE
Q psy11061          1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLS--TTAMVRIQITDVNDNEPIFNPVEYNV   78 (212)
Q Consensus         1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~--~~~~v~i~V~d~nd~~P~f~~~~~~~   78 (212)
                      +++|+|.++.. ..+|.|++.+|.|+++++||||....|.|.|+|.|.|...  ....++|.|.|+||++|.|....|.+
T Consensus        31 ~~~y~i~~~~~-~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~  109 (199)
T cd00031          31 RVTYSILGGNE-DGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEA  109 (199)
T ss_pred             eEEEEEeCCCC-cccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEE
Confidence            37899988762 2799999999999999999999999999999999976544  89999999999999999999889999


Q ss_pred             EEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCe
Q psy11061         79 SLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKC  156 (212)
Q Consensus        79 ~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~  156 (212)
                      .+.|+.+.|+.++++.|+|+|.+.++.+.|++..+...++|.|++.+|.|.+...  +|  ....|.+.|.|+|.+++..
T Consensus       110 ~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~--ld~e~~~~~~l~v~a~D~~~~~~  187 (199)
T cd00031         110 SVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITLAKP--LDREEKSSYELTVVATDGGGPPL  187 (199)
T ss_pred             EEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCc--cCCccCceEEEEEEEEECCCCCc
Confidence            9999999999999999999999888899999987765589999999999999988  77  6779999999999987667


Q ss_pred             eeeEEEEEEEee
Q psy11061        157 SQDAQVNINIIN  168 (212)
Q Consensus       157 ~~~~~v~I~v~d  168 (212)
                      +.++.++|.|.|
T Consensus       188 ~~~~~i~i~v~d  199 (199)
T cd00031         188 SSTATVTVTVLD  199 (199)
T ss_pred             eeEEEEEEEEEC
Confidence            778888888865


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.92  E-value=2.3e-23  Score=158.13  Aligned_cols=132  Identities=35%  Similarity=0.591  Sum_probs=122.2

Q ss_pred             eEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCC
Q psy11061         76 YNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGG  153 (212)
Q Consensus        76 ~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~  153 (212)
                      |.+.+.|+.+.|+.++++.|.|+|.+.++.+.|+|..+...++|.|++.+|.|++.+.  ||  ....|.|.|.|+|.+.
T Consensus         2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~--lD~e~~~~~~l~v~a~D~g~   79 (199)
T cd00031           2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKP--LDREEQSEYTLTVVASDGGG   79 (199)
T ss_pred             eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCC--CCCcCCceEEEEEEEEECCc
Confidence            7889999999999999999999999888899999998766589999999999999998  77  7899999999999887


Q ss_pred             CCeeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCe
Q psy11061        154 NKCSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFT  209 (212)
Q Consensus       154 ~~~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~  209 (212)
                      +.++....++|.|.|+||++|.|....|.+.+.|+.++|+.++++.|+|+|.+.+.
T Consensus        80 ~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~  135 (199)
T cd00031          80 PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENA  135 (199)
T ss_pred             CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCc
Confidence            77778899999999999999999988999999999999999999999999997444


No 7  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.77  E-value=3.8e-17  Score=109.27  Aligned_cols=90  Identities=34%  Similarity=0.542  Sum_probs=84.7

Q ss_pred             eEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEEC-C
Q psy11061         76 YNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDG-G  152 (212)
Q Consensus        76 ~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~-~  152 (212)
                      |.+.|+|+.+.|+.++++.|.|+|.+.++.+.|+|..++..++|.|++.+|.|.+.+.  ||  ....|.|.|.|+|. +
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~--LD~E~~~~y~l~v~a~D~~~   78 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKP--LDRETQSSYQLTVRATDSGG   78 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSS--SCTTTTSEEEEEEEEEETTT
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeecccccee--cCcccCCEEEEEEEEEECCC
Confidence            7899999999999999999999999899999999999887899999999999999999  88  88999999999999 7


Q ss_pred             CCCeeeeEEEEEEEe
Q psy11061        153 GNKCSQDAQVNINII  167 (212)
Q Consensus       153 ~~~~~~~~~v~I~v~  167 (212)
                      .++++++++|.|+|.
T Consensus        79 ~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   79 SPPLSSTATVTINVL   93 (93)
T ss_dssp             SSEEEEEEEEEEEEE
T ss_pred             CCCCEEEEEEEEEEC
Confidence            888889999999884


No 8  
>KOG1834|consensus
Probab=99.76  E-value=6.6e-17  Score=135.82  Aligned_cols=158  Identities=27%  Similarity=0.464  Sum_probs=128.5

Q ss_pred             EEEEcCCCCCcCceEE---eCCce--EEEECCcCCcCCCCeEEEEEEEEeCCC--------CceEEEEEEEEeecCCCCC
Q psy11061          3 NYTLGESPSRTNHFYM---KSVSG--EICIAQDLDFESRSSYEFPVVATDRGG--------LSTTAMVRIQITDVNDNEP   69 (212)
Q Consensus         3 ~Y~i~~~~~~~~~F~i---d~~tg--~i~~~~~lD~E~~~~~~l~i~a~d~~~--------~~~~~~v~i~V~d~nd~~P   69 (212)
                      -|.|.+++   -.|.+   |..||  .|+.+.+||.|.++.|+|+|+|.|+|.        .+..++++|.|.|+|+.+|
T Consensus        71 ~fklhgq~---vPFdavVvdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP  147 (952)
T KOG1834|consen   71 GFKLHGQP---VPFDAVVVDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAP  147 (952)
T ss_pred             eeEecCCC---CCceEEEEeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCc
Confidence            36664443   34543   55555  699999999999999999999999872        4567899999999999999


Q ss_pred             ccCCCceEEEEecCCCCCcEEEEEEEEeCCCC-CCceE-EEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEE
Q psy11061         70 IFNPVEYNVSLRDDIQTTTAFAVVLATDRDSD-RFGTI-SYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHIN  145 (212)
Q Consensus        70 ~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~-~~~~i-~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~  145 (212)
                      +|....|.+.|.|+. .-..++++.|.|.|.+ +++.| .|.|+.  ++-.|.|| +.|.|+.+..  |.  +..+|.|+
T Consensus       148 ~f~ep~Yka~V~EGK-~yd~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaId-n~G~irnTek--Lny~ke~~Y~lt  221 (952)
T KOG1834|consen  148 VFKEPWYKAHVTEGK-VYDSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAID-NDGNIRNTEK--LNYTKEHQYKLT  221 (952)
T ss_pred             hhcccceeeEEecce-eeeeeEEEEeecCCCCCcccceeEEEecC--CCCceEEc-CCCccccccc--cccccceeEEEE
Confidence            999999999999984 4577899999999986 56666 788875  45689998 5699999999  77  89999999


Q ss_pred             EEEEECCCCCeeeeEEEEEEEeeC
Q psy11061        146 VSAMDGGGNKCSQDAQVNINIINS  169 (212)
Q Consensus       146 v~a~D~~~~~~~~~~~v~I~v~d~  169 (212)
                      |.|.|.+..+....+-|+|.|...
T Consensus       222 VtAyDCg~kraa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  222 VTAYDCGKKRAASDSLVTVHVKPT  245 (952)
T ss_pred             EEEEecccccccCcceEEEEecCc
Confidence            999998865544556788887654


No 9  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.67  E-value=1.1e-15  Score=99.04  Aligned_cols=77  Identities=35%  Similarity=0.641  Sum_probs=70.2

Q ss_pred             EeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCCCC
Q psy11061         96 TDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKCSQDAQVNINIINSHMPI  173 (212)
Q Consensus        96 ~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd~~  173 (212)
                      +|+|.+.++.+.|+|..+...++|.|++.+|.|++.+.  ||  ....|.|.|.|+|.+++.+++.++|+|+|.|+||++
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~--LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~   78 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKP--LDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA   78 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCc--cCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence            47888888999999998766589999999999988888  88  789999999999999888889999999999999999


Q ss_pred             C
Q psy11061        174 P  174 (212)
Q Consensus       174 P  174 (212)
                      |
T Consensus        79 P   79 (79)
T smart00112       79 P   79 (79)
T ss_pred             C
Confidence            8


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.63  E-value=4.1e-15  Score=96.30  Aligned_cols=68  Identities=40%  Similarity=0.607  Sum_probs=62.2

Q ss_pred             CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCC
Q psy11061          1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEP   69 (212)
Q Consensus         1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P   69 (212)
                      +|+|+|..+.. ..+|.|++.+|.|+++++||||....|.|.|+|.|.++  +++.+.|+|.|.|+|||+|
T Consensus        10 ~i~Y~i~~~~~-~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       10 KVTYSILSGNE-DGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             EEEEEEecCCC-CCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            37899987762 28999999999999999999999999999999999985  7899999999999999998


No 11 
>KOG1834|consensus
Probab=99.40  E-value=6.3e-12  Score=106.32  Aligned_cols=148  Identities=23%  Similarity=0.327  Sum_probs=114.2

Q ss_pred             EEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCC--CCceE-EEEEEcCCCCCcEEE---eCCc--cEEE
Q psy11061         58 RIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSD--RFGTI-SYKIVTGKDANLFRI---DRSS--GELF  129 (212)
Q Consensus        58 ~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~--~~~~i-~y~i~~~~~~~~F~i---d~~~--G~i~  129 (212)
                      +....-+|-+.|.... .|...|.||...-.....+.|-|.|.+  -.+.| -|+|.+.+  -.|.+   |..+  |.|+
T Consensus        20 ~~~aarankhkpwie~-ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~--vPFdavVvdK~TGegvlR   96 (952)
T KOG1834|consen   20 HHHAARANKHKPWIEE-EYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQP--VPFDAVVVDKYTGEGVLR   96 (952)
T ss_pred             ccccccccccCccccc-ceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCC--CCceEEEEeccCCceEEe
Confidence            4556677888887766 599999999765444457888898875  22334 56665432  34554   5555  5688


Q ss_pred             EeeecCCC--CCCeEEEEEEEEECC-CC-----CeeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEE
Q psy11061        130 VTRGNFLS--RSNSYHINVSAMDGG-GN-----KCSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAA  201 (212)
Q Consensus       130 ~~~~~~ld--~~~~~~l~v~a~D~~-~~-----~~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~  201 (212)
                      .+.+  +|  .+..|+|+|+|.|.+ +|     .++..++|+|.|.|+|.++|+|..+.|.+.|.|. .....|++|.|.
T Consensus        97 aK~~--lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAi  173 (952)
T KOG1834|consen   97 AKEP--LDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAI  173 (952)
T ss_pred             ecCc--ccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecc-eeeeeeEEEEee
Confidence            8888  99  699999999999977 44     4577899999999999999999999999999998 677899999999


Q ss_pred             eCC-CCCCeec
Q psy11061        202 TSD-SGKFTEF  211 (212)
Q Consensus       202 D~D-~~~~~~~  211 (212)
                      |.| +++++++
T Consensus       174 D~DCspq~sqI  184 (952)
T KOG1834|consen  174 DKDCSPQYSQI  184 (952)
T ss_pred             cCCCCCcccce
Confidence            999 5555553


No 12 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.23  E-value=1.1e-10  Score=77.79  Aligned_cols=61  Identities=36%  Similarity=0.544  Sum_probs=53.2

Q ss_pred             CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeC-CC--CceEEEEEEEEe
Q psy11061          1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDR-GG--LSTTAMVRIQIT   62 (212)
Q Consensus         1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~-~~--~~~~~~v~i~V~   62 (212)
                      ++.|+|..+. +.++|.|++.+|.|+++++||||....|.|.|.|+|. ++  +++.+.|+|+|.
T Consensus        30 ~i~y~i~~~~-~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   30 QITYSILGGN-PDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             SEEEEEEETT-STTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             eEEEEEecCc-ccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence            4789997776 3689999999999999999999999999999999999 53  567778887764


No 13 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.53  E-value=0.0025  Score=41.89  Aligned_cols=79  Identities=16%  Similarity=0.286  Sum_probs=44.4

Q ss_pred             CCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC-CCCeEEEEE
Q psy11061         68 EPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS-RSNSYHINV  146 (212)
Q Consensus        68 ~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v  146 (212)
                      .|-|....|.+.|+.+...|..++++.-.|...  +..+.|...  +  ..|+|. ..|.|.+..+  +. ..+.-.|.|
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--D--pdF~V~-~DGsVy~~r~--v~l~~~~~~F~V   73 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--D--PDFRVL-EDGSVYAKRP--VQLSSEQRSFTV   73 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEE-TTTEEEEES----S-SSS-EEEEE
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--C--CCEEEc-CCCeEEEeee--EecCCCceEEEE
Confidence            488999999999999999999999999988843  345777753  2  389997 5699999999  77 666678999


Q ss_pred             EEEECCCCC
Q psy11061        147 SAMDGGGNK  155 (212)
Q Consensus       147 ~a~D~~~~~  155 (212)
                      .|.|..+..
T Consensus        74 ~a~D~~~~~   82 (90)
T PF08758_consen   74 HAWDSQTQE   82 (90)
T ss_dssp             EEEETTTTE
T ss_pred             EEECCCCCe
Confidence            999987654


No 14 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.20  E-value=0.015  Score=38.71  Aligned_cols=69  Identities=26%  Similarity=0.342  Sum_probs=50.9

Q ss_pred             EeCCCCCCceEEEEEEcCC---CCCcEEEeCCccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCC
Q psy11061         96 TDRDSDRFGTISYKIVTGK---DANLFRIDRSSGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINIINSHM  171 (212)
Q Consensus        96 ~D~D~~~~~~i~y~i~~~~---~~~~F~id~~~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd  171 (212)
                      .|.|   ...+.|++...+   .+.|...|+.++.+.-...  ....+.|.++|.|+|..+  .+....++|.|.+.|+
T Consensus        25 ~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~--~~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       25 TDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPT--NSDVGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             ECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECC--CCCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            5665   335888886432   3579999998888876443  335678999999999876  4467789999998885


No 15 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.18  E-value=0.0007  Score=43.90  Aligned_cols=60  Identities=20%  Similarity=0.455  Sum_probs=37.9

Q ss_pred             ceEEEEecCCCCCcEEEEEEEEeCCCCCC--ceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC
Q psy11061         75 EYNVSLRDDIQTTTAFAVVLATDRDSDRF--GTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS  137 (212)
Q Consensus        75 ~~~~~v~e~~~~g~~v~~~~a~D~D~~~~--~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld  137 (212)
                      .....|+|..+.|+.|+.+. .|......  ....|++.+.....+|.++..+|.+.+...  +|
T Consensus         2 qi~YsV~EE~~~Gt~IGnia-~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~r--ID   63 (84)
T PF08266_consen    2 QIRYSVPEEMPPGTVIGNIA-KDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSER--ID   63 (84)
T ss_dssp             EEEEEEESS--TT-EEEECC-CCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS----S
T ss_pred             CeEEEeecCCCCCCEEEEhH-HhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCc--cC
Confidence            35688999999999999984 44433211  124677777666789999999999999988  77


No 16 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=97.11  E-value=0.016  Score=38.63  Aligned_cols=88  Identities=18%  Similarity=0.195  Sum_probs=57.8

Q ss_pred             EEEEEEeCCCCCCceEEEEEEc-CCCCCcEEEeCCccEEEEeeecCC------C--CCCeEEEEEEEEECCCCCeeeeEE
Q psy11061         91 AVVLATDRDSDRFGTISYKIVT-GKDANLFRIDRSSGELFVTRGNFL------S--RSNSYHINVSAMDGGGNKCSQDAQ  161 (212)
Q Consensus        91 ~~~~a~D~D~~~~~~i~y~i~~-~~~~~~F~id~~~G~i~~~~~~~l------d--~~~~~~l~v~a~D~~~~~~~~~~~  161 (212)
                      +++.++|+|.+..  ..+.... ....+.|.|+. +|.+.....+..      .  ..-.-.|.+.+.|+.      +.+
T Consensus         2 G~Lt~sD~D~gd~--~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt------t~~   72 (99)
T TIGR01965         2 GQLTISDADAGQA--HFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT------SQT   72 (99)
T ss_pred             CceEEeCCCCCCc--eEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC------eEE
Confidence            4688899997654  4555532 23467899986 688877654321      1  223456888888842      788


Q ss_pred             EEEEEeeCCCCCCeEecCceEEEEeCCC
Q psy11061        162 VNINIINSHMPIPLFQQSTYSFVVPEDV  189 (212)
Q Consensus       162 v~I~v~d~nd~~P~f~~~~~~~~v~e~~  189 (212)
                      |+|+|...|| +|+..... ...+.|+.
T Consensus        73 vtItI~GtND-apvi~~~~-~g~v~ED~   98 (99)
T TIGR01965        73 VTITITGAND-AAVIGGAD-TGSVTEDS   98 (99)
T ss_pred             EEEEEEccCC-CCEEeccc-ceeEecCC
Confidence            9999999997 66655433 46666663


No 17 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=97.02  E-value=0.075  Score=38.81  Aligned_cols=127  Identities=20%  Similarity=0.331  Sum_probs=73.8

Q ss_pred             CCeEEEEE-EEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecCCCC-CcEEEEEEEEeCCCCCCceEEEEEEcC
Q psy11061         36 RSSYEFPV-VATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQT-TTAFAVVLATDRDSDRFGTISYKIVTG  113 (212)
Q Consensus        36 ~~~~~l~i-~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~-g~~v~~~~a~D~D~~~~~~i~y~i~~~  113 (212)
                      ...|.+.+ .|.|.-|...+..+...+. +...+|.+.- .....+..+..+ +..-..+.++|...+. .--...+.+|
T Consensus        14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~Gg   90 (158)
T PF13750_consen   14 DGSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTGG   90 (158)
T ss_pred             CccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEECC
Confidence            46899999 7999988766666654444 3556787744 111222222222 2333567788876554 3346677655


Q ss_pred             CCCCcEEEe---CCccEEEEeeecCC---CCCCeEEEEEEEEECCCCCeeeeEEEEEEEe
Q psy11061        114 KDANLFRID---RSSGELFVTRGNFL---SRSNSYHINVSAMDGGGNKCSQDAQVNINII  167 (212)
Q Consensus       114 ~~~~~F~id---~~~G~i~~~~~~~l---d~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~  167 (212)
                      .....-.+.   ...|...+.-+..+   ...+.|+|.|.|.|..|..  .+.++.....
T Consensus        91 ~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aGN~--~~~si~F~y~  148 (158)
T PF13750_consen   91 PASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAGNQ--STKSISFSYM  148 (158)
T ss_pred             cccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCCCE--EEEEEEEEEe
Confidence            433332221   12345444433223   3688999999999998865  3445555544


No 18 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.01  E-value=0.013  Score=39.00  Aligned_cols=63  Identities=21%  Similarity=0.291  Sum_probs=48.8

Q ss_pred             eEEEEcC--CCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCceEEEEEEEEeecCC
Q psy11061          2 VNYTLGE--SPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLSTTAMVRIQITDVND   66 (212)
Q Consensus         2 i~Y~i~~--~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd   66 (212)
                      |+|++..  +..-..|.+.++.++.+.-. +... ....|.+.+.|+|..|.+....++|.|.+.|+
T Consensus        32 lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~-~~g~~~i~v~a~D~~g~~~~~~f~i~V~~~~~   96 (97)
T smart00736       32 LTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNS-DVGSLSLKVTATDSSGASASDTFTITVVNTND   96 (97)
T ss_pred             EEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCC-CCcEEEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence            6788743  22234799999999888874 4332 34679999999999888899999999999886


No 19 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.15  E-value=0.016  Score=37.59  Aligned_cols=31  Identities=13%  Similarity=0.389  Sum_probs=20.9

Q ss_pred             EEEcCCCCCcCceEEeCCceEEEECCcCCcCC
Q psy11061          4 YTLGESPSRTNHFYMKSVSGEICIAQDLDFES   35 (212)
Q Consensus         4 Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~   35 (212)
                      |+|.+.. ...+|.+++.+|.|+++..+|||.
T Consensus        36 ~ri~s~~-~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen   36 FRIVSEG-NSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             BEEE-SS-SS-SEEE-TTTSEEEESS--SCCC
T ss_pred             eEEeecC-CcceeEecCCceeEEeCCccCHHH
Confidence            4543332 358999999999999999999998


No 20 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=94.24  E-value=0.35  Score=27.80  Aligned_cols=37  Identities=30%  Similarity=0.420  Sum_probs=29.2

Q ss_pred             CCCcEEEeCCccEEEEeeecCCC-CCCeEEEEEEEEECCC
Q psy11061        115 DANLFRIDRSSGELFVTRGNFLS-RSNSYHINVSAMDGGG  153 (212)
Q Consensus       115 ~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v~a~D~~~  153 (212)
                      -+....+|+.+|.|.-...  -. ..+.|.+.|.|+|..+
T Consensus        12 LP~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~G   49 (49)
T PF05345_consen   12 LPSGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGSG   49 (49)
T ss_pred             CCCcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCCC
Confidence            3568899999999987755  33 4579999999999753


No 21 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=93.81  E-value=0.86  Score=27.52  Aligned_cols=59  Identities=20%  Similarity=0.261  Sum_probs=34.4

Q ss_pred             CceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEE
Q psy11061        103 FGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINI  166 (212)
Q Consensus       103 ~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v  166 (212)
                      +-...|++.. ....+..+...+-.+....-    ..+.|+|.|.|.|..+.......++.|+|
T Consensus         7 ~~~Y~Y~l~g-~d~~W~~~~~~~~~~~~~~L----~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen    7 NIRYRYRLEG-FDDEWITLGSYSNSISYTNL----PPGKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             TEEEEEEEET-TESSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             ceEEEEEEEC-CCCeEEECCCCcEEEEEEeC----CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            3356777753 44466666433224443332    68999999999998876654446666666


No 22 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=92.42  E-value=1  Score=29.50  Aligned_cols=37  Identities=19%  Similarity=0.256  Sum_probs=27.6

Q ss_pred             CceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCc
Q psy11061         14 NHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLS   52 (212)
Q Consensus        14 ~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~   52 (212)
                      ..|.|.+ .|.|.+++++.... ..-+|.|.|+|..+..
T Consensus        46 pdF~V~~-DGsVy~~r~v~l~~-~~~~F~V~a~D~~~~~   82 (90)
T PF08758_consen   46 PDFRVLE-DGSVYAKRPVQLSS-EQRSFTVHAWDSQTQE   82 (90)
T ss_dssp             SEEEEET-TTEEEEES--S-SS-S-EEEEEEEEETTTTE
T ss_pred             CCEEEcC-CCeEEEeeeEecCC-CceEEEEEEECCCCCe
Confidence            4799998 99999999987654 4468999999998754


No 23 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=92.01  E-value=1.1  Score=25.69  Aligned_cols=46  Identities=24%  Similarity=0.381  Sum_probs=31.7

Q ss_pred             EEEEc--CCCCCcCceEEeCCceEEEECCcCCcC-CCCeEEEEEEEEeCCC
Q psy11061          3 NYTLG--ESPSRTNHFYMKSVSGEICIAQDLDFE-SRSSYEFPVVATDRGG   50 (212)
Q Consensus         3 ~Y~i~--~~~~~~~~F~id~~tg~i~~~~~lD~E-~~~~~~l~i~a~d~~~   50 (212)
                      +|++.  ....-+.+..||+.+|.|+-.-.  .+ ....|.+.|.|.|..|
T Consensus         1 Tys~~~~~~~~LP~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~G   49 (49)
T PF05345_consen    1 TYSLTTPTGGGLPSGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGSG   49 (49)
T ss_pred             CEEEEcCCCCCCCCcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCCC
Confidence            47761  12234578899999999987633  22 2368999999999753


No 24 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=88.10  E-value=6  Score=26.44  Aligned_cols=65  Identities=25%  Similarity=0.336  Sum_probs=42.4

Q ss_pred             CcCceEEeCCceEEEEC--------CcCCcCCCCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecC
Q psy11061         12 RTNHFYMKSVSGEICIA--------QDLDFESRSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDD   83 (212)
Q Consensus        12 ~~~~F~id~~tg~i~~~--------~~lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~   83 (212)
                      ..+.|.|++ +|.....        +.|.--+...-+|++.+.|.    .+..|+|.|...|| +|+..-. -...+.|+
T Consensus        25 ~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG----tt~~vtItI~GtND-apvi~~~-~~g~v~ED   97 (99)
T TIGR01965        25 QYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG----TSQTVTITITGAND-AAVIGGA-DTGSVTED   97 (99)
T ss_pred             CcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence            356788888 7765533        22333334456788888885    38899999999999 7866432 23445544


No 25 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=86.93  E-value=20  Score=33.72  Aligned_cols=49  Identities=14%  Similarity=0.053  Sum_probs=31.3

Q ss_pred             cCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC
Q psy11061         64 VNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK  114 (212)
Q Consensus        64 ~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~  114 (212)
                      .||..+.+...+-...|.|+  .|+.-..+.-...|......+.|+..+|.
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En--~GtV~VtV~R~GGdl~~tVsVdY~T~DGT  443 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLEN--CGTVALTVVRRGGDLTNTVYVDYRTEDGT  443 (928)
T ss_pred             ccCCcceEEecCCeEEEeec--CcEEEEEEEEccCCCCceEEEEEEccCCc
Confidence            44545555555567778887  46666666665545544456889887764


No 26 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=86.25  E-value=5.2  Score=28.46  Aligned_cols=56  Identities=27%  Similarity=0.536  Sum_probs=39.1

Q ss_pred             EEEECCcCCcCC---CCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecC
Q psy11061         24 EICIAQDLDFES---RSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDD   83 (212)
Q Consensus        24 ~i~~~~~lD~E~---~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~   83 (212)
                      .+.+.++||...   .-...|.|.|+|..|-.....+.|.|.|  | .|...... ...|.|.
T Consensus        69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~  127 (137)
T TIGR03660        69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDED  127 (137)
T ss_pred             EEEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence            355778888854   3467888999998876566788888887  5 57765543 3667664


No 27 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.61  E-value=6.5  Score=23.50  Aligned_cols=53  Identities=23%  Similarity=0.251  Sum_probs=30.1

Q ss_pred             EEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCc--eEEEEEEEE
Q psy11061          3 NYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLS--TTAMVRIQI   61 (212)
Q Consensus         3 ~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~--~~~~v~i~V   61 (212)
                      .|+|...+  ..|..+...+-.+..+..    ....|+|.|+|.|..+..  ....+.|.|
T Consensus        11 ~Y~l~g~d--~~W~~~~~~~~~~~~~~L----~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen   11 RYRLEGFD--DEWITLGSYSNSISYTNL----PPGKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             EEEEETTE--SSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             EEEEECCC--CeEEECCCCcEEEEEEeC----CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            46564433  467666664424443322    357899999999987522  225666655


No 28 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=82.53  E-value=7.5  Score=23.20  Aligned_cols=31  Identities=29%  Similarity=0.491  Sum_probs=24.4

Q ss_pred             CCeEEEEEEEEeCCCCceEEEEEEEEeecCC
Q psy11061         36 RSSYEFPVVATDRGGLSTTAMVRIQITDVND   66 (212)
Q Consensus        36 ~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd   66 (212)
                      ...|++.+.|.|.-|........+.+.|..-
T Consensus        22 dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~~   52 (60)
T PF12245_consen   22 DGEYTLTVTATDKAGNTSSSTTQIVIVDNTA   52 (60)
T ss_pred             CccEEEEEEEEECCCCEEEeeeEEEEEcCCC
Confidence            5789999999999888777777776666543


No 29 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=79.71  E-value=47  Score=30.00  Aligned_cols=109  Identities=15%  Similarity=0.135  Sum_probs=61.0

Q ss_pred             CCeEEEEEEEEeCCCCc-eEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEE------EeCCCCCCceEEE
Q psy11061         36 RSSYEFPVVATDRGGLS-TTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLA------TDRDSDRFGTISY  108 (212)
Q Consensus        36 ~~~~~l~i~a~d~~~~~-~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a------~D~D~~~~~~i~y  108 (212)
                      ....++++.|+|..|+. .....+|+|.+...  |.+....+...  +..  ........|      .+.-.....++.|
T Consensus       297 ~G~~Ti~atVtDSRGr~S~~~~~tItVl~Y~~--P~lsfsv~R~~--~~~--~~~~v~~~a~Iapl~v~g~qKN~~~lt~  370 (624)
T PF05895_consen  297 SGSATIRATVTDSRGRTSDPKTKTITVLEYSP--PTLSFSVYRCG--SSG--NTLTVTRNAKIAPLTVNGVQKNTMTLTF  370 (624)
T ss_pred             CceEEEEEEEEECCCccCCceEEEEEEEEcCC--CcEEEEEEEeC--CCC--cEEEEEEEEEEeEEEEcccccceEEEEE
Confidence            46899999999998864 57889999998754  76632211111  111  111111111      1221112225666


Q ss_pred             EEEcCCCCCcEEEeCC-------------ccEEEEeeecCCCCCCeEEEEEEEEECCC
Q psy11061        109 KIVTGKDANLFRIDRS-------------SGELFVTRGNFLSRSNSYHINVSAMDGGG  153 (212)
Q Consensus       109 ~i~~~~~~~~F~id~~-------------~G~i~~~~~~~ld~~~~~~l~v~a~D~~~  153 (212)
                      +...-+ ...|.+|..             .+...+...  ++....|.+++.++|.-.
T Consensus       371 ~~a~~g-t~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~--y~~~kSy~V~~~l~D~F~  425 (624)
T PF05895_consen  371 KVAPLG-TGTFTTDNGSASGTWSSISELTNSSANLGGT--YDAEKSYDVRGTLSDKFT  425 (624)
T ss_pred             EEEEcC-cceEEEEccccccceeeeeeecccceeeccc--cCCCceEEEEEEEEEEee
Confidence            665422 245555521             112333344  668899999999999763


No 30 
>KOG4221|consensus
Probab=79.40  E-value=21  Score=34.54  Aligned_cols=56  Identities=21%  Similarity=0.359  Sum_probs=37.5

Q ss_pred             CcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCC-ceEEEEEE-EEeecCCCCCc
Q psy11061         12 RTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGL-STTAMVRI-QITDVNDNEPI   70 (212)
Q Consensus        12 ~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~-~~~~~v~i-~V~d~nd~~P~   70 (212)
                      .+.++.++.++-+.++.   +.|....|.+.+.|..+.|. .++..++| ...|+=+.+|.
T Consensus       563 ~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  563 TGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             CCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            56899999887777776   34566789999999998652 23344444 34455444443


No 31 
>KOG3597|consensus
Probab=76.38  E-value=49  Score=28.47  Aligned_cols=97  Identities=18%  Similarity=0.256  Sum_probs=61.6

Q ss_pred             eEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCC----CCcEEEeC-----
Q psy11061         53 TTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKD----ANLFRIDR-----  123 (212)
Q Consensus        53 ~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~----~~~F~id~-----  123 (212)
                      .+....|.|.-+||.+..+....+.+.+.|+...-.....+.+.|+|.++.. +.|++.....    .+.|..-.     
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~-l~f~v~~t~~~~~~~~~~~~~g~~~~~  102 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPLP-LEFQVLGTSSVPLPVLKFDVPGAPATE  102 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCccc-eEEEEccCCCCCCccceeeccCCcccc
Confidence            5566789999999977766666667888888665455567888898887543 7777765311    11233321     


Q ss_pred             ------CccEEEEeeecCCCCCCeEEEEEEEEECC
Q psy11061        124 ------SSGELFVTRGNFLSRSNSYHINVSAMDGG  152 (212)
Q Consensus       124 ------~~G~i~~~~~~~ld~~~~~~l~v~a~D~~  152 (212)
                            ..|.+.+...+.  ......++..++|+-
T Consensus       103 Fs~~~v~~g~~~yvh~g~--el~~~~~~~~~SDg~  135 (442)
T KOG3597|consen  103 FSYEEVEDGSLSYVHSGT--ELRESELQLRVSDGL  135 (442)
T ss_pred             eEehHhhcCceeEEecCc--ccccceEEEEeecce
Confidence                  234554444321  456777888888854


No 32 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=75.50  E-value=75  Score=30.15  Aligned_cols=56  Identities=23%  Similarity=0.282  Sum_probs=32.9

Q ss_pred             EEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC
Q psy11061         55 AMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK  114 (212)
Q Consensus        55 ~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~  114 (212)
                      ...+|+|.| ||++|.|....-...|.|+.  |..-..+.-+-. ....-.+.|+...|.
T Consensus       515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~RtsG-a~G~VtV~Y~T~dGT  570 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTSG-ARGTVIVPYRTVEGT  570 (928)
T ss_pred             ceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcCC-CCeeEEEEEEeecCc
Confidence            355666676 77788877666677888873  444444433322 111235678777664


No 33 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=75.49  E-value=6.1  Score=26.67  Aligned_cols=33  Identities=24%  Similarity=0.382  Sum_probs=27.1

Q ss_pred             CCcCCCCeEEEEEEEEeCCCCceEEEEEEEEee
Q psy11061         31 LDFESRSSYEFPVVATDRGGLSTTAMVRIQITD   63 (212)
Q Consensus        31 lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d   63 (212)
                      +++.+...|.+.|++.|..|.+.+..+.|.|-|
T Consensus        74 ~~v~kgG~y~m~V~lCn~dGCS~S~~~~I~VAD  106 (106)
T cd02848          74 FKVGKGGRYQMQVALCNGDGCSTSAAKEIVVAD  106 (106)
T ss_pred             EEeCCCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence            556677899999999999888888888887765


No 34 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=75.01  E-value=14  Score=21.46  Aligned_cols=19  Identities=37%  Similarity=0.646  Sum_probs=16.4

Q ss_pred             CCCeEEEEEEEEECCCCCe
Q psy11061        138 RSNSYHINVSAMDGGGNKC  156 (212)
Q Consensus       138 ~~~~~~l~v~a~D~~~~~~  156 (212)
                      ..+.|.+.+.|+|..|...
T Consensus        22 ~dG~y~itv~a~D~AGN~s   40 (54)
T PF13754_consen   22 ADGTYTITVTATDAAGNTS   40 (54)
T ss_pred             CCccEEEEEEEEeCCCCCC
Confidence            6788999999999987764


No 35 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=74.44  E-value=28  Score=24.64  Aligned_cols=32  Identities=22%  Similarity=0.531  Sum_probs=23.0

Q ss_pred             CCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCC
Q psy11061        138 RSNSYHINVSAMDGGGNKCSQDAQVNINIINSHM  171 (212)
Q Consensus       138 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd  171 (212)
                      ..+.|.++|+++|..|  ++....+.|.|.|...
T Consensus        81 ~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDG  112 (133)
T PF08329_consen   81 KGGRYQMQVELCNADG--CSTSAPVEVVVADTDG  112 (133)
T ss_dssp             S-EEEEEEEEEEETTE--EEE---EEEEEE-TTS
T ss_pred             CCCEEEEEEEEECCCC--cccCCCEEEEEeCCCc
Confidence            7899999999999887  5666788888888764


No 36 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=73.11  E-value=17  Score=21.66  Aligned_cols=31  Identities=26%  Similarity=0.503  Sum_probs=21.7

Q ss_pred             CCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCC
Q psy11061        138 RSNSYHINVSAMDGGGNKCSQDAQVNINIINSH  170 (212)
Q Consensus       138 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~n  170 (212)
                      ..+.|.+.+.|+|..|...  .....+.+.|..
T Consensus        21 ~dg~yt~~v~a~D~AGN~~--~~~~~~~i~d~~   51 (60)
T PF12245_consen   21 ADGEYTLTVTATDKAGNTS--SSTTQIVIVDNT   51 (60)
T ss_pred             CCccEEEEEEEEECCCCEE--EeeeEEEEEcCC
Confidence            3678999999999998763  344455555543


No 37 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=69.45  E-value=29  Score=22.68  Aligned_cols=53  Identities=21%  Similarity=0.266  Sum_probs=30.0

Q ss_pred             EEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC
Q psy11061         58 RIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK  114 (212)
Q Consensus        58 ~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~  114 (212)
                      +|.|.| ||. |.+....-...+.|+.  |.....+....++....-.+.|....+.
T Consensus         2 tvtI~d-~d~-~~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt   54 (100)
T PF03160_consen    2 TVTILD-DDD-PTVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT   54 (100)
T ss_dssp             EEEEE--TTS-EEEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred             EEEEEC-CCC-CEEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence            466777 664 4766555667788874  3444555555443233345677776653


No 38 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=68.70  E-value=62  Score=26.27  Aligned_cols=110  Identities=25%  Similarity=0.319  Sum_probs=58.3

Q ss_pred             CCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecCC------CCCcEEEEEEEEeCCCCCCceEEEE
Q psy11061         36 RSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDDI------QTTTAFAVVLATDRDSDRFGTISYK  109 (212)
Q Consensus        36 ~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~------~~g~~v~~~~a~D~D~~~~~~i~y~  109 (212)
                      ...|.+.+.+.|..|........+.|--   .+|...-.    .+..+.      ......+...+.+.+.  +..+.+.
T Consensus        11 d~~~~v~vt~tD~aGN~~~~t~~~~vDt---~~P~v~i~----~~~~~~~~~~~~~~~~~t~s~tvs~~~~--g~~v~v~   81 (317)
T PF13753_consen   11 DGTYTVSVTVTDAAGNTSTATQSITVDT---TAPTVTIT----SIADDDIINGDEATNTVTFSGTVSGAEP--GSTVTVT   81 (317)
T ss_pred             CCcEEEEEEEEeCCCCeeeeeEEEEEec---CCCceeee----cccCCCccccceeeeeeEEEEEecCCCC--CCEEEEE
Confidence            4679999999999887666655555332   26644322    111111      1122234444444333  3345555


Q ss_pred             EEcCCCCCcEEEeCCccEEEEeeecCCC-CCCeEEEEEE-EEECCCCCee
Q psy11061        110 IVTGKDANLFRIDRSSGELFVTRGNFLS-RSNSYHINVS-AMDGGGNKCS  157 (212)
Q Consensus       110 i~~~~~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v~-a~D~~~~~~~  157 (212)
                      + .+.. ..+..+ ..|.+.......-. ..+.|.+.+. ++|..|....
T Consensus        82 ~-~g~~-~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~~t  128 (317)
T PF13753_consen   82 I-NGTT-GTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNTST  128 (317)
T ss_pred             E-CCEE-EEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCcccc
Confidence            5 2221 333343 45655444431002 5678999999 9998876543


No 39 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=68.68  E-value=15  Score=22.85  Aligned_cols=30  Identities=17%  Similarity=0.289  Sum_probs=22.3

Q ss_pred             CCcCCCCeEEEEEEEEeCCCCceEEEEEEE
Q psy11061         31 LDFESRSSYEFPVVATDRGGLSTTAMVRIQ   60 (212)
Q Consensus        31 lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~   60 (212)
                      ..|.....|.+++.|.|..+.+......|.
T Consensus        51 ~~y~~~G~y~v~l~v~d~~g~~~~~~~~V~   80 (81)
T cd00146          51 HTYTKPGTYTVTLTVTNAVGSSSTKTTTVV   80 (81)
T ss_pred             EEcCCCcEEEEEEEEEeCCCCEEEEEEEEE
Confidence            446677899999999998776665565554


No 40 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=66.12  E-value=45  Score=23.71  Aligned_cols=43  Identities=19%  Similarity=0.362  Sum_probs=29.5

Q ss_pred             eEEEEEEEEECCCCCeeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCC
Q psy11061        141 SYHINVSAMDGGGNKCSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDV  189 (212)
Q Consensus       141 ~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~  189 (212)
                      ...|.|.|+|..+...  ...+.|+|.|  | .|...... ..+|.|+.
T Consensus        86 ~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~  128 (137)
T TIGR03660        86 TLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDD  128 (137)
T ss_pred             EEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence            4567899999887653  4578888887  3 47765544 36777753


No 41 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=65.19  E-value=16  Score=22.94  Aligned_cols=26  Identities=23%  Similarity=0.266  Sum_probs=22.4

Q ss_pred             CCeEEEEEEEEeCCCCceEEEEEEEE
Q psy11061         36 RSSYEFPVVATDRGGLSTTAMVRIQI   61 (212)
Q Consensus        36 ~~~~~l~i~a~d~~~~~~~~~v~i~V   61 (212)
                      ...+.++..|.|..|+.+.+.+.|.|
T Consensus        56 ~G~t~V~ytA~D~~GN~a~C~f~V~V   81 (81)
T PF02494_consen   56 VGTTTVTYTATDAAGNSATCSFTVTV   81 (81)
T ss_pred             eceEEEEEEEEECCCCEEEEEEEEEC
Confidence            45899999999998888899988875


No 42 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=65.05  E-value=27  Score=23.62  Aligned_cols=29  Identities=28%  Similarity=0.636  Sum_probs=21.1

Q ss_pred             CCCeEEEEEEEEECCCCCeeeeEEEEEEEee
Q psy11061        138 RSNSYHINVSAMDGGGNKCSQDAQVNINIIN  168 (212)
Q Consensus       138 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d  168 (212)
                      +.+.|.++|.++|..+  ++..+.+.|.|-|
T Consensus        78 kgG~y~m~V~lCn~dG--CS~S~~~~I~VAD  106 (106)
T cd02848          78 KGGRYQMQVALCNGDG--CSTSAAKEIVVAD  106 (106)
T ss_pred             CCCeEEEEEEEECCCC--ccCcCCEEEEecC
Confidence            6899999999999877  3444556665543


No 43 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=62.64  E-value=26  Score=21.64  Aligned_cols=30  Identities=20%  Similarity=0.395  Sum_probs=22.5

Q ss_pred             CCcCCCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy11061         31 LDFESRSSYEFPVVATDRGGLSTTAMVRIQI   61 (212)
Q Consensus        31 lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V   61 (212)
                      .-|+....|.+++.+.|..| +.+..+.|.|
T Consensus        49 ~~y~~~G~y~v~l~v~n~~g-~~~~~~~i~v   78 (79)
T smart00089       49 HTYTKPGTYTVTLTVTNAVG-SASATVTVVV   78 (79)
T ss_pred             EEeCCCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence            45566789999999999877 5566666655


No 44 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=62.37  E-value=25  Score=25.60  Aligned_cols=30  Identities=30%  Similarity=0.478  Sum_probs=25.3

Q ss_pred             cCCCCeEEEEEEEEeCCCCceEEEEEEEEe
Q psy11061         33 FESRSSYEFPVVATDRGGLSTTAMVRIQIT   62 (212)
Q Consensus        33 ~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~   62 (212)
                      .|..+.|+|++.|.|..|...+..+.+...
T Consensus       119 le~~~~YtLtV~a~D~aGN~~~~si~F~y~  148 (158)
T PF13750_consen  119 LEADDSYTLTVSATDKAGNQSTKSISFSYM  148 (158)
T ss_pred             cCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence            367789999999999998888888877665


No 45 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=61.24  E-value=25  Score=21.67  Aligned_cols=28  Identities=18%  Similarity=0.468  Sum_probs=20.7

Q ss_pred             CCCCCeEEEEEEEEECCCCCeeeeEEEEEEE
Q psy11061        136 LSRSNSYHINVSAMDGGGNKCSQDAQVNINI  166 (212)
Q Consensus       136 ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v  166 (212)
                      +...+.|.+++.++|..+   +..+++.|.|
T Consensus        51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       51 YTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            337799999999999876   3455666655


No 46 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=56.95  E-value=26  Score=21.75  Aligned_cols=20  Identities=10%  Similarity=0.458  Sum_probs=16.0

Q ss_pred             CCCCCeEEEEEEEEECCCCC
Q psy11061        136 LSRSNSYHINVSAMDGGGNK  155 (212)
Q Consensus       136 ld~~~~~~l~v~a~D~~~~~  155 (212)
                      +...+.|.+++.|+|..+..
T Consensus        53 y~~~G~y~v~l~v~d~~g~~   72 (81)
T cd00146          53 YTKPGTYTVTLTVTNAVGSS   72 (81)
T ss_pred             cCCCcEEEEEEEEEeCCCCE
Confidence            44789999999999986543


No 47 
>KOG4221|consensus
Probab=56.77  E-value=2.1e+02  Score=28.28  Aligned_cols=67  Identities=19%  Similarity=0.335  Sum_probs=46.9

Q ss_pred             EEEEEEc--CCCCCcEEEeCCccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCCCCCe
Q psy11061        106 ISYKIVT--GKDANLFRIDRSSGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINIINSHMPIPL  175 (212)
Q Consensus       106 i~y~i~~--~~~~~~F~id~~~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd~~P~  175 (212)
                      ..|++..  ++....+.++.++-+.++. .  |++...|.+.|.|.+..|...++...-.++..|+-+.||.
T Consensus       552 ~~yk~~ys~~~~~~~~~~~~n~~e~ti~-g--L~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  552 TGYKLFYSEDDTGKELRVENNATEYTIN-G--LEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             eEEEEEEEcCCCCceEEEecCccEEEee-c--CCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            3454433  2455688888766666555 3  7789999999999998887655444445567788888887


No 48 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=47.38  E-value=25  Score=24.86  Aligned_cols=37  Identities=22%  Similarity=0.372  Sum_probs=25.8

Q ss_pred             cCCcCCCCeEEEEEEEEeCCCCceEEEEEEEEeecCC
Q psy11061         30 DLDFESRSSYEFPVVATDRGGLSTTAMVRIQITDVND   66 (212)
Q Consensus        30 ~lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd   66 (212)
                      .+.......|++.+++.+..|.+.+..+.|.|.|..-
T Consensus        76 ~~~~~~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG  112 (133)
T PF08329_consen   76 TFTVTKGGRYQMQVELCNADGCSTSAPVEVVVADTDG  112 (133)
T ss_dssp             EEEE-S-EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred             EEEecCCCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence            3555567899999999999888888899999998765


No 49 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=39.91  E-value=5.4e+02  Score=28.16  Aligned_cols=30  Identities=17%  Similarity=0.390  Sum_probs=21.1

Q ss_pred             CcCCCCeEEEEEEEEeCCCCceEEEEEEEEe
Q psy11061         32 DFESRSSYEFPVVATDRGGLSTTAMVRIQIT   62 (212)
Q Consensus        32 D~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~   62 (212)
                      -|.....|.+++.|.+..|. .+..+.|.|.
T Consensus      1479 tY~~~GtYtVtLTvtN~~Gs-st~T~~VtV~ 1508 (2740)
T TIGR00864      1479 AFNSPGDFNIRLAAANEVGK-NEATLNVAVK 1508 (2740)
T ss_pred             EcCCCceEEEEEEEECCCCc-eEEEEEEEEe
Confidence            45567789999999998764 3445566665


No 50 
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.62  E-value=1.5e+02  Score=20.06  Aligned_cols=60  Identities=13%  Similarity=0.294  Sum_probs=36.5

Q ss_pred             CCeEEEEEEEEeCCC-------CceEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeC
Q psy11061         36 RSSYEFPVVATDRGG-------LSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDR   98 (212)
Q Consensus        36 ~~~~~l~i~a~d~~~-------~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~   98 (212)
                      .....+++.-.|+.+       ..+..++++...|.+|-.+.-....|..-+..+.+   .|.+++--|.
T Consensus        42 ~AkvrI~i~fsDr~pvgpyevevaarrt~hlRfndL~dpe~iP~d~~yasviesnvP---vVvQ~tRLDs  108 (124)
T COG4288          42 PAKVRITILFSDREPVGPYEVEVAARRTLHLRFNDLGDPEAIPKDTPYASVIESNVP---VVVQLTRLDS  108 (124)
T ss_pred             CceEEEEEEeccCCCCCceEEEeecceeEEEEecccCCcccCCCCCchhhheecCCc---eEEEEEEecc
Confidence            345556666666542       13456788889999986666666667776666654   3444444443


No 51 
>KOG3597|consensus
Probab=27.99  E-value=1e+02  Score=26.61  Aligned_cols=50  Identities=24%  Similarity=0.133  Sum_probs=35.3

Q ss_pred             eeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCC
Q psy11061        158 QDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGK  207 (212)
Q Consensus       158 ~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~  207 (212)
                      .+....|.|..+||.+..+....+.+.+.|+...=.-...+.+.|+|...
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~   73 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP   73 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc
Confidence            35668899999998877777666667777765444444677788888654


No 52 
>PF14157 YmzC:  YmzC-like protein; PDB: 3KVP_E.
Probab=27.80  E-value=44  Score=20.18  Aligned_cols=18  Identities=17%  Similarity=0.300  Sum_probs=13.1

Q ss_pred             cCceEEeCCceEEEECCc
Q psy11061         13 TNHFYMKSVSGEICIAQD   30 (212)
Q Consensus        13 ~~~F~id~~tg~i~~~~~   30 (212)
                      -..|..|+.+++|.+.+.
T Consensus        41 iKIfkyd~~tNei~L~KE   58 (63)
T PF14157_consen   41 IKIFKYDEDTNEITLKKE   58 (63)
T ss_dssp             EEEEEEETTTTEEEEEEE
T ss_pred             EEEEEeCCCCCeEEEEEe
Confidence            457777888888877654


No 53 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=26.49  E-value=1.7e+02  Score=18.29  Aligned_cols=15  Identities=40%  Similarity=0.775  Sum_probs=11.3

Q ss_pred             CCCeEEEEEEEEECC
Q psy11061        138 RSNSYHINVSAMDGG  152 (212)
Q Consensus       138 ~~~~~~l~v~a~D~~  152 (212)
                      ..+.|.+.|.++|.+
T Consensus        67 ~~G~Y~~~v~a~~~g   81 (81)
T PF13860_consen   67 PDGTYTFRVTATDGG   81 (81)
T ss_dssp             -SEEEEEEEEEEET-
T ss_pred             CCCCEEEEEEEEeCC
Confidence            578899999998864


No 54 
>COG1470 Predicted membrane protein [Function unknown]
Probab=24.98  E-value=4.8e+02  Score=22.93  Aligned_cols=72  Identities=19%  Similarity=0.280  Sum_probs=42.3

Q ss_pred             ccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCC--------CCCeEecC-----ceEEEEeCCCCC
Q psy11061        125 SGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINIINSHM--------PIPLFQQS-----TYSFVVPEDVFK  191 (212)
Q Consensus       125 ~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd--------~~P~f~~~-----~~~~~v~e~~~~  191 (212)
                      +|.++++..    ....-.+.+...+.|+.+.   .-|.+.|.+..+        -.|...+.     ..+..+++++..
T Consensus       386 ~g~~~lt~t----aGee~~i~i~I~NsGna~L---tdIkl~v~~PqgWei~Vd~~~I~sL~pge~~tV~ltI~vP~~a~a  458 (513)
T COG1470         386 NGPYRLTIT----AGEEKTIRISIENSGNAPL---TDIKLTVNGPQGWEIEVDESTIPSLEPGESKTVSLTITVPEDAGA  458 (513)
T ss_pred             CCcEEEEec----CCccceEEEEEEecCCCcc---ceeeEEecCCccceEEECcccccccCCCCcceEEEEEEcCCCCCC
Confidence            344444433    4455666777777765443   345555543322        22333222     346789999999


Q ss_pred             CeEEEEEEEEeC
Q psy11061        192 NSIVGTIKAATS  203 (212)
Q Consensus       192 g~~v~~v~a~D~  203 (212)
                      |....+++|+-.
T Consensus       459 GdY~i~i~~ksD  470 (513)
T COG1470         459 GDYRITITAKSD  470 (513)
T ss_pred             CcEEEEEEEeec
Confidence            999999988643


No 55 
>cd04046 C2_Calpain C2 domain present in Calpain proteins. A single C2 domain is found in calpains (EC 3.4.22.52, EC 3.4.22.53), calcium-dependent, non-lysosomal cysteine proteases.  Caplains are classified as belonging to Clan CA by MEROPS and include six families: C1, C2, C10, C12, C28, and C47.  C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins.  Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1.  However, there are a few exceptions to this including RIM isoforms and some splice variants of pic
Probab=24.72  E-value=2.3e+02  Score=19.22  Aligned_cols=17  Identities=6%  Similarity=0.102  Sum_probs=13.0

Q ss_pred             eCCCCCCeEEEEEEEEe
Q psy11061        186 PEDVFKNSIVGTIKAAT  202 (212)
Q Consensus       186 ~e~~~~g~~v~~v~a~D  202 (212)
                      ..+...|++..++.+.|
T Consensus       107 ~~~~~~G~i~~~~~~~~  123 (126)
T cd04046         107 AAGEVPGTISVKVTSSD  123 (126)
T ss_pred             CCCCCCCEEEEEEEEcc
Confidence            56777888888887776


No 56 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=23.30  E-value=1.3e+02  Score=21.08  Aligned_cols=32  Identities=19%  Similarity=0.324  Sum_probs=16.4

Q ss_pred             CeEEEEEEEEeCC-CCceEEEEEEEEeecCCCCC
Q psy11061         37 SSYEFPVVATDRG-GLSTTAMVRIQITDVNDNEP   69 (212)
Q Consensus        37 ~~~~l~i~a~d~~-~~~~~~~v~i~V~d~nd~~P   69 (212)
                      ....+...+.|.+ .++....+.|.|+.-|+ +|
T Consensus       100 Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp  132 (133)
T PF09100_consen  100 GNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP  132 (133)
T ss_dssp             EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred             ccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence            3455555555554 37889999999998887 55


No 57 
>PF07145 PAM2:  Ataxin-2 C-terminal region;  InterPro: IPR009818 This entry represents a conserved region approximately 250 residues long located towards the C terminus of eukaryotic ataxin-2. Ataxin-2 is a protein of unknown function, within which expansion of a polyglutamine tract (due to expansion of unstable CAG repeats in the coding region of the SCA2 gene) causes spinocerebellar ataxia type 2 (SCA2), a late-onset neurodegenerative disorder []. The expanded polyglutamine repeat in ataxin-2 causes disruption of the normal morphology of the Golgi complex and increased incidence of cell death []. Ataxin-2 is predicted to consist of mostly non-globular domains [].; PDB: 3NTW_B 1JH4_B 3KTR_B 3KUJ_B 3KUT_D 3KUS_D 1JGN_B 2RQG_A 2RQH_A.
Probab=23.05  E-value=51  Score=14.59  Aligned_cols=11  Identities=45%  Similarity=0.757  Sum_probs=7.6

Q ss_pred             cCCCCCccCCC
Q psy11061         64 VNDNEPIFNPV   74 (212)
Q Consensus        64 ~nd~~P~f~~~   74 (212)
                      .|-|+|+|.+.
T Consensus         6 LNp~A~eFvP~   16 (18)
T PF07145_consen    6 LNPNAPEFVPS   16 (18)
T ss_dssp             SSTTSSSS-TT
T ss_pred             cCCCCccccCC
Confidence            57888888654


No 58 
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=22.96  E-value=1.4e+02  Score=16.20  Aligned_cols=14  Identities=21%  Similarity=0.589  Sum_probs=10.4

Q ss_pred             CCCCCeEEEEEEEE
Q psy11061        136 LSRSNSYHINVSAM  149 (212)
Q Consensus       136 ld~~~~~~l~v~a~  149 (212)
                      +.+.+.|.|.++++
T Consensus         9 FT~PG~Y~l~~~a~   22 (41)
T TIGR03769         9 FTKPGTYTLTVQAT   22 (41)
T ss_pred             eCCCeEEEEEEEEE
Confidence            44678898888874


No 59 
>cd01758 PLAT_LPL PLAT/ LH2 domain present in lipoprotein lipase (LPL).  LPL is a key enzyme in catabolism of plasma lipoprotein triglycerides (TGs) and has therefeore has a profound influence on triglyceride and high-density lipoprotein (HDL) cholesterol levels in the blood. In general, PLAT/LH2 domain's proposed function is to mediate interaction with lipids or membrane bound proteins.
Probab=21.24  E-value=3.1e+02  Score=19.42  Aligned_cols=66  Identities=11%  Similarity=0.187  Sum_probs=39.6

Q ss_pred             eEEEEEEEEeCCCCc-eEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEE
Q psy11061         38 SYEFPVVATDRGGLS-TTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISY  108 (212)
Q Consensus        38 ~~~l~i~a~d~~~~~-~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y  108 (212)
                      .|.++|......... ....+.|.+...+.+...+     .+...+....|..-.-+..+|.|.|+-..+.+
T Consensus         2 hYqVtV~~~~~~~~~~t~~~v~i~L~G~~g~S~~~-----~l~~~~~~~~G~t~sfLi~t~~dlG~L~~vk~   68 (137)
T cd01758           2 HYQLKIHFFNQTNRIETDPTFTISLYGTLGESENL-----PLTLPEGITGNKTNSFLITTEKDIGDLLMLKL   68 (137)
T ss_pred             eEEEEEEEecccCCCcccceEEEEEEcCCCcccCE-----EEecCcccCCCCeEEEEEECCCCcCCEEEEEE
Confidence            578888877654332 5567777777776655433     22222322456666666777888776555555


No 60 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=21.17  E-value=3.1e+02  Score=19.32  Aligned_cols=19  Identities=26%  Similarity=0.613  Sum_probs=16.8

Q ss_pred             CCCeEEEEEEEEECCCCCe
Q psy11061        138 RSNSYHINVSAMDGGGNKC  156 (212)
Q Consensus       138 ~~~~~~l~v~a~D~~~~~~  156 (212)
                      ..+.|.|.+.|+|..|...
T Consensus       105 ~~G~YH~~i~VtD~~Gn~~  123 (132)
T PF15418_consen  105 PAGDYHFMITVTDAAGNQT  123 (132)
T ss_pred             CCcceEEEEEEEECCCCEE
Confidence            7899999999999988654


No 61 
>KOG3865|consensus
Probab=20.87  E-value=4.5e+02  Score=21.80  Aligned_cols=63  Identities=22%  Similarity=0.395  Sum_probs=42.3

Q ss_pred             CceEEeCCceEEEECCcCCcCC---CCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEE
Q psy11061         14 NHFYMKSVSGEICIAQDLDFES---RSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSL   80 (212)
Q Consensus        14 ~~F~id~~tg~i~~~~~lD~E~---~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v   80 (212)
                      ..|-+.  .|.|.+...||+|-   -+...+.|.++++.. .....+.+.|.-.-| -=.|....|...|
T Consensus       186 k~FlmS--~~~lhLevsLDkEiYyHGE~isvnV~V~NNsn-KtVKkIK~~V~Q~ad-i~Lfs~aqy~~~V  251 (402)
T KOG3865|consen  186 KQFLMS--DGPLHLEVSLDKEIYYHGEPISVNVHVTNNSN-KTVKKIKISVRQVAD-ICLFSTAQYKKPV  251 (402)
T ss_pred             HhhccC--CCceEEEEEecchheecCCceeEEEEEecCCc-ceeeeeEEEeEeece-EEEEeccccccee
Confidence            355555  48899999999998   467888888888743 355566666666555 3445555565444


Done!