Query         psy9046
Match_columns 215
No_of_seqs    290 out of 1336
Neff          9.3 
Searched_HMMs 46136
Date          Fri Aug 16 17:34:21 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9046.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9046hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 4.7E-38   1E-42  282.4  18.4  191    1-215   252-452 (2531)
  2 KOG4289|consensus              100.0   4E-34 8.8E-39  257.3  16.1  190    1-214   766-968 (2531)
  3 KOG1219|consensus              100.0   6E-32 1.3E-36  250.1  21.6  191    1-215   934-1137(4289)
  4 cd00031 CA Cadherin repeat dom 100.0 4.1E-30 8.8E-35  198.9  24.0  170   28-215     1-181 (199)
  5 KOG1219|consensus              100.0 8.5E-31 1.8E-35  242.6  22.8  189    1-215  2559-2757(4289)
  6 KOG1834|consensus               99.8 5.1E-19 1.1E-23  151.4  18.0  180   10-215    25-226 (952)
  7 cd00031 CA Cadherin repeat dom  99.7 2.1E-16 4.6E-21  122.0  14.4  108    1-130    85-199 (199)
  8 PF00028 Cadherin:  Cadherin do  99.7 2.6E-15 5.6E-20  102.3  11.3   85   29-129     1-93  (93)
  9 PF00028 Cadherin:  Cadherin do  99.6 5.2E-15 1.1E-19  100.7  11.1   71  143-215     1-75  (93)
 10 smart00112 CA Cadherin repeats  99.5 2.3E-13 4.9E-18   89.8   8.5   72   49-136     1-79  (79)
 11 KOG1834|consensus               99.1 1.2E-09 2.6E-14   94.7  11.1  106    2-131   133-245 (952)
 12 smart00112 CA Cadherin repeats  98.8   2E-08 4.4E-13   65.9   6.4   51  165-215     1-55  (79)
 13 PF08758 Cadherin_pro:  Cadheri  97.9 7.4E-05 1.6E-09   50.1   6.7   75  135-215     3-77  (90)
 14 PF08266 Cadherin_2:  Cadherin-  97.1 0.00057 1.2E-08   45.2   3.3   58  143-203     3-65  (84)
 15 PF08266 Cadherin_2:  Cadherin-  97.0  0.0008 1.7E-08   44.5   3.4   56   29-88      3-65  (84)
 16 PF13750 Big_3_3:  Bacterial Ig  96.9   0.057 1.2E-06   40.2  12.6  114   99-215    10-133 (158)
 17 PF08758 Cadherin_pro:  Cadheri  96.6   0.034 7.4E-07   37.3   9.0   78   15-118     3-81  (90)
 18 smart00736 CADG Dystroglycan-t  96.2    0.11 2.3E-06   35.3   9.5   68   48-133    24-96  (97)
 19 TIGR01965 VCBS_repeat VCBS rep  95.8    0.15 3.2E-06   34.8   8.6   86   44-150     2-97  (99)
 20 PF05345 He_PIG:  Putative Ig d  93.6    0.37   8E-06   28.3   5.4   34  182-215    12-46  (49)
 21 smart00736 CADG Dystroglycan-t  93.1    0.78 1.7E-05   31.0   7.3   51  163-215    23-77  (97)
 22 PF07495 Y_Y_Y:  Y_Y_Y domain;   84.2     2.6 5.5E-05   25.9   4.2   30  100-129    35-66  (66)
 23 KOG3597|consensus               82.6      23 0.00049   31.0  10.5   55    1-61     25-81  (442)
 24 PF12245 Big_3_2:  Bacterial Ig  80.4      10 0.00022   23.1   5.8   35   98-132    17-51  (60)
 25 TIGR01965 VCBS_repeat VCBS rep  79.4     9.6 0.00021   26.0   5.9   37  160-196     2-39  (99)
 26 PF15418 DUF4625:  Domain of un  74.2      11 0.00023   27.2   5.3   36   93-128    96-131 (132)
 27 PF02494 HYR:  HYR domain;  Int  70.0      10 0.00022   24.4   4.1   25  104-128    57-81  (81)
 28 KOG3597|consensus               69.5      33 0.00072   30.0   8.1   59  120-180    24-83  (442)
 29 PF13754 Big_3_4:  Bacterial Ig  59.7      28 0.00061   20.6   4.3   17  102-118    22-38  (54)
 30 cd00146 PKD polycystic kidney   59.2      27 0.00058   22.1   4.5   27  102-128    55-81  (81)
 31 PF05895 DUF859:  Siphovirus pr  58.9 1.5E+02  0.0033   27.3  11.1   34  104-139   298-332 (624)
 32 PF13753 SWM_repeat:  Putative   56.4 1.1E+02  0.0025   25.1  10.5  104  103-215    11-121 (317)
 33 PF03160 Calx-beta:  Calx-beta   55.8      57  0.0012   21.6   6.1   50  125-180     2-53  (100)
 34 TIGR00845 caca sodium/calcium   54.7 2.1E+02  0.0046   27.8  17.3   54  122-180   515-569 (928)
 35 smart00089 PKD Repeats in poly  54.3      42 0.00092   21.0   4.9   26  102-128    53-78  (79)
 36 TIGR03660 T1SS_rpt_143 T1SS-14  52.4      88  0.0019   22.7  11.6   43  106-152    87-129 (137)
 37 PF13750 Big_3_3:  Bacterial Ig  47.6      58  0.0013   24.1   5.2   32  100-131   119-150 (158)
 38 PRK14081 triple tyrosine motif  47.4 2.4E+02  0.0052   26.3  16.2   32  101-132   155-189 (667)
 39 TIGR00845 caca sodium/calcium   43.5 3.2E+02   0.007   26.6  11.1   28    3-37    516-543 (928)
 40 PRK13211 N-acetylglucosamine-b  37.7   3E+02  0.0064   24.6  14.1   33   99-131   374-407 (478)
 41 cd02848 Chitinase_N_term Chiti  36.1      61  0.0013   22.4   3.5   29  102-130    78-106 (106)
 42 PF08329 ChitinaseA_N:  Chitina  35.0      62  0.0013   23.4   3.5   31  102-132    81-111 (133)
 43 PF07550 DUF1533:  Protein of u  25.7 1.4E+02   0.003   18.3   3.6   33  182-214    24-56  (65)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=4.7e-38  Score=282.41  Aligned_cols=191  Identities=29%  Similarity=0.430  Sum_probs=178.1

Q ss_pred             CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCce---eEEEecCC--CcEEEeCCccc
Q psy9046           1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIA---LYVLEQDG--DMFSVNNITVC   75 (215)
Q Consensus         1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~---~y~i~~~~--~~F~id~~tg~   75 (215)
                      |++|+|.|+|+|||.|+|      ++..|.-++.||.++|..|.++.|+|.|.+.+   .|++..|+  ..|.|++.+|.
T Consensus       252 tttv~V~V~D~nDhsPvF------Eq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGv  325 (2531)
T KOG4289|consen  252 TTTVTVLVLDTNDHSPVF------EQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGV  325 (2531)
T ss_pred             eeEEEEEEeecCCCCccc------chhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccce
Confidence            689999999999999999      79999999999999999999999999999876   59997764  79999999998


Q ss_pred             eEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCC
Q psy9046          76 EFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIII  153 (215)
Q Consensus        76 ~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~  153 (215)
                      |.+-.   ++|||....|+             |.|.|+|.|.+  ..++.|.|+|.|+|||+|+|..+.|.++|.|+..+
T Consensus       326 I~T~a---~lDRE~~~~y~-------------L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~  389 (2531)
T KOG4289|consen  326 ISTRA---PLDREELESYQ-------------LDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTP  389 (2531)
T ss_pred             eeccC---ccCHHhhhheE-------------EEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCC
Confidence            55443   79999999999             79999999876  56999999999999999999999999999999986


Q ss_pred             CCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCCccEEEEEEEEEC
Q psy9046         154 TPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKGKTVYNLLVQVSD  215 (215)
Q Consensus       154 g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~~~~y~l~V~a~D  215 (215)
                      +  +.|++|+|+|.|.  |+.++|+|.+|+..+.|.||. +|+|.+..+||.+...|++.|+|+|
T Consensus       390 ~--avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~~ytl~IrAqD  452 (2531)
T KOG4289|consen  390 P--AVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENSEYTLRIRAQD  452 (2531)
T ss_pred             C--ceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCCeeEEEEEccc
Confidence            6  9999999999999  999999999999999999999 9999999999988779999999987


No 2  
>KOG4289|consensus
Probab=100.00  E-value=4e-34  Score=257.32  Aligned_cols=190  Identities=27%  Similarity=0.403  Sum_probs=174.4

Q ss_pred             CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCcee---EEEe---cCCCcEEEeCCcc
Q psy9046           1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIAL---YVLE---QDGDMFSVNNITV   74 (215)
Q Consensus         1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~~---y~i~---~~~~~F~id~~tg   74 (215)
                      |++|.|.|.|+|||+|+|      ..+.|.++|.|+.|+++.+++|+|+|+|.|.++   |.+.   ++.+.|.|++.+|
T Consensus       766 tttveV~v~diNDnaPqf------~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IEptSG  839 (2531)
T KOG4289|consen  766 TTTVEVLVNDINDNAPQF------LASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIEPTSG  839 (2531)
T ss_pred             cEEEEEEeecccccCccc------chhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEccCcc
Confidence            678999999999999999      589999999999999999999999999998874   5553   3458999999999


Q ss_pred             ceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeCCCCCCccCCCceEEEEecCCC
Q psy9046          75 CEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENII  152 (215)
Q Consensus        75 ~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~  152 (215)
                      .+.+++   +||||....|.             |.+.|.|.|.|  ++.+.|+|+|+|+|||||+|.+..|...|.||.+
T Consensus       840 viRtl~---rLdRE~~avy~-------------L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~ensp  903 (2531)
T KOG4289|consen  840 VIRTLR---RLDRENVAVYV-------------LAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSP  903 (2531)
T ss_pred             eeehhh---hhcchheeEEE-------------EEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCc
Confidence            987777   69999999998             78999999987  8889999999999999999999999999999999


Q ss_pred             CCCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC--ccEEEEEEEEE
Q psy9046         153 ITPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG--KTVYNLLVQVS  214 (215)
Q Consensus       153 ~g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~--~~~y~l~V~a~  214 (215)
                      +|  ..+++++|.|+|+  |++|.|+|++|+....|.++. .|+|...-.||++  ...|.+.++|+
T Consensus       904 vg--s~va~i~a~dpdEG~NA~IsYqIvgg~d~~~fq~de~~~~lla~~efdyef~~~eyv~~~qat  968 (2531)
T KOG4289|consen  904 VG--SVVALITADDPDEGPNAHISYQIVGGNDPELFQLDEFSGELLALVEFDYEFTRVEYVLVVQAT  968 (2531)
T ss_pred             cc--eeeEEEEccCCCcCCcceEEEeeccCccHHHHHHHHhhhhhhhheeehhhhccceeeEEeecc
Confidence            98  9999999999999  999999999999899999999 9999999999965  67888888774


No 3  
>KOG1219|consensus
Probab=100.00  E-value=6e-32  Score=250.11  Aligned_cols=191  Identities=35%  Similarity=0.463  Sum_probs=175.5

Q ss_pred             CeEEEEEEEecCCC--CCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCc---eeEEEecC--CCcEEEeCCc
Q psy9046           1 MGTLIISIIDVNDI--PPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSI---ALYVLEQD--GDMFSVNNIT   73 (215)
Q Consensus         1 t~~v~I~V~DvND~--~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~---~~y~i~~~--~~~F~id~~t   73 (215)
                      +|.+.|.|+|+|.|  ||.|      ..-.-.++|.||+|+|+.++++.|.|.|.|.   ..|+|..+  .+.|+|+..+
T Consensus       934 ~chl~Vevldv~enlhpp~F------~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~t 1007 (4289)
T KOG1219|consen  934 ICHLEVEVLDVNENLHPPEF------ISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTT 1007 (4289)
T ss_pred             eEEEEEEEeccCCCCCCcch------heeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCc
Confidence            47899999999876  9999      4667789999999999999999999999864   46999654  5799999999


Q ss_pred             cceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeCCCCCCccCCCceEEEEecCC
Q psy9046          74 VCEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENI  151 (215)
Q Consensus        74 g~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~  151 (215)
                      |.+.++.   .||||.++.|.             |+|.|+|.|..  ++.+.+.|.|.|+|||+|+|.++.|..+|.|++
T Consensus      1008 G~irTl~---~lDrE~ks~Yw-------------ltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enS 1071 (4289)
T KOG1219|consen 1008 GSIRTLK---ALDREKKSSYW-------------LTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENS 1071 (4289)
T ss_pred             ceEeech---hhchhhcceEE-------------EEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCC
Confidence            9988887   59999999999             79999999976  889999999999999999999999999999999


Q ss_pred             CCCCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046         152 IITPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD  215 (215)
Q Consensus       152 ~~g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D  215 (215)
                      +.+  ..|.++.|.|+|.  |++++|.|.+|+..++|.|++ +|.|+..+.||+| +..|.|.|.++|
T Consensus      1072 p~~--vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D 1137 (4289)
T KOG1219|consen 1072 PET--VSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQD 1137 (4289)
T ss_pred             CCc--eEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehhhcccccccceEEEEEec
Confidence            988  9999999999996  999999999999999999999 9999999999965 889999999887


No 4  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.98  E-value=4.1e-30  Score=198.86  Aligned_cols=170  Identities=38%  Similarity=0.570  Sum_probs=153.2

Q ss_pred             ceEEEEecCCCCCcEEEEEEEEeCCCCc---eeEEEecCC--CcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCC
Q psy9046          28 NYLLEIEEEQPIGSLLGKFTALDKDSSI---ALYVLEQDG--DMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSP  102 (215)
Q Consensus        28 ~~~~~V~E~~~~g~~v~~v~a~D~D~~~---~~y~i~~~~--~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~  102 (215)
                      .|.+.|+|+++.|+.++++.|.|+|.+.   ..|+|.++.  .+|.|++.+|.+...+   .||||....|.        
T Consensus         1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~---~lD~e~~~~~~--------   69 (199)
T cd00031           1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTK---PLDREEQSEYT--------   69 (199)
T ss_pred             CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECC---CCCCcCCceEE--------
Confidence            3789999999999999999999999974   479998766  5999999999865544   69999999988        


Q ss_pred             CCeEEEEEEEEeCCCCc--eEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC--CCeEEEEEe
Q psy9046         103 GDTKVVLVSAEDASTPC--SHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE--NPHLTYKIL  178 (215)
Q Consensus       103 ~~~~~l~v~a~D~~~~~--~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~--~~~i~y~i~  178 (215)
                           |.|+|+|.+.+.  .+..++|.|.|+|||+|.|....|.+.+.|+.+.|  +.++++.|+|+|.  ++.++|+|.
T Consensus        70 -----l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~--~~i~~~~a~D~D~~~~~~~~y~l~  142 (199)
T cd00031          70 -----LTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPG--TVVGTVTATDADSGENAKLTYSIL  142 (199)
T ss_pred             -----EEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCC--CEEEEEEEEcCCCCCCccEEEEEe
Confidence                 789999987775  89999999999999999999889999999999977  9999999999998  799999999


Q ss_pred             eCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046         179 SGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD  215 (215)
Q Consensus       179 ~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D  215 (215)
                      ++...+.|.|++ +|.|++.+.||++ ...|.|.|.|+|
T Consensus       143 ~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D  181 (199)
T cd00031         143 SGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATD  181 (199)
T ss_pred             CCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEE
Confidence            876568999999 9999999999965 569999999987


No 5  
>KOG1219|consensus
Probab=99.98  E-value=8.5e-31  Score=242.59  Aligned_cols=189  Identities=27%  Similarity=0.374  Sum_probs=168.6

Q ss_pred             CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCceeEEEecC------CCcEEEeCCcc
Q psy9046           1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIALYVLEQD------GDMFSVNNITV   74 (215)
Q Consensus         1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~~y~i~~~------~~~F~id~~tg   74 (215)
                      +++|.|+|++..++.|+|      +.+.|.++|+|+.++|+.|++++|.|.|...+...+.++      ...|.+|+.||
T Consensus      2559 ~ttV~v~vl~e~v~lPrF------Sep~y~fsvpEDv~vG~~Ig~v~a~~a~~~~i~~~v~~gt~Esn~d~~Fsvdr~TG 2632 (4289)
T KOG1219|consen 2559 NTTVIVTVLPEDVNLPRF------SEPIYTFSVPEDVPVGEEIGQVSASDADEHVIYSLVLGGTPESNPDLPFSVDRNTG 2632 (4289)
T ss_pred             ceEEEEEecCcccCcccc------cCceEEEeccccCCCCCeeeEEeecccCCceEEEEEeCCCCCCCCCCceEEcCCCc
Confidence            478999999999999999      688999999999999999999999999976653333222      25799999999


Q ss_pred             ceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCC
Q psy9046          75 CEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIIT  154 (215)
Q Consensus        75 ~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g  154 (215)
                      .+..-+   +||+|.+.+|+             +.|+|++.++--+..+|.|.|.|+|||+|+|+.+.|.+.+.||+++|
T Consensus      2633 ~i~v~k---sLD~E~kk~yq-------------i~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvG 2696 (4289)
T KOG1219|consen 2633 MIKVNK---SLDHEKKKSYQ-------------IKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVG 2696 (4289)
T ss_pred             eEEecc---ccchhhhceEE-------------EEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCC
Confidence            865444   79999999999             68999998876788899999999999999999999999999999998


Q ss_pred             CCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046         155 PSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD  215 (215)
Q Consensus       155 ~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D  215 (215)
                        +.|++++|.|.|.  |++|+|++...  ...|.|++ +|.|++...||.+ ...|.|.|.|+|
T Consensus      2697 --tsV~qf~AsD~Ds~~nGqirysl~~~--v~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtD 2757 (4289)
T KOG1219|consen 2697 --TSVIQFHASDMDSGNNGQIRYSLTSP--VPYFAINPETGWLTTLFELDLEKQDLYSLKVVATD 2757 (4289)
T ss_pred             --ceEEEEEeeccCCCCCceEEEEEcCC--cceEEEcCCCCeeeehhhhccccCCceEEEEEEec
Confidence              9999999999999  89999999875  33899999 9999999999955 899999999998


No 6  
>KOG1834|consensus
Probab=99.83  E-value=5.1e-19  Score=151.38  Aligned_cols=180  Identities=27%  Similarity=0.328  Sum_probs=150.8

Q ss_pred             ecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCC------ceeEEEecCCCcEEE---eCCccceEEEE
Q psy9046          10 DVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSS------IALYVLEQDGDMFSV---NNITVCEFQLS   80 (215)
Q Consensus        10 DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~------~~~y~i~~~~~~F~i---d~~tg~~~~~~   80 (215)
                      -+|-+.|..       ...|...|.||...=.....+-|.|.|.+      .-+|.|.+..-.|.+   |..||+ -.++
T Consensus        25 rankhkpwi-------e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGe-gvlR   96 (952)
T KOG1834|consen   25 RANKHKPWI-------EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGE-GVLR   96 (952)
T ss_pred             cccccCccc-------ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCc-eEEe
Confidence            356667776       78899999999764444456889999984      336998776666654   888997 5666


Q ss_pred             EEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--------ceEEEEEEEEeeCCCCCCccCCCceEEEEecCCC
Q psy9046          81 VDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--------CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENII  152 (215)
Q Consensus        81 ~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--------~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~  152 (215)
                      ++.+||-|.++.|+             ++|+|.|.|..        +..++|+|+|.|+|+++|+|..+.|.+.|.|+..
T Consensus        97 aK~~lDCelqkeyt-------------f~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~  163 (952)
T KOG1834|consen   97 AKEPLDCELQKEYT-------------FTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV  163 (952)
T ss_pred             ecCcccccccccce-------------EEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee
Confidence            66699999999999             68999998754        6789999999999999999999999999999987


Q ss_pred             CCCCCEEEEEEEEeCCC---CCe-EEEEEeeCCCCCcEEEcCCeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046         153 ITPSPTLLTLEATDSDE---NPH-LTYKILSGNDDDVFAVDDTGILTIQGPLAKG-KTVYNLLVQVSD  215 (215)
Q Consensus       153 ~g~~~~i~~v~a~D~D~---~~~-i~y~i~~~~~~~~F~Id~~G~i~~~~~ld~~-~~~y~l~V~a~D  215 (215)
                      .   ..|++|.|.|.|.   +++ ..|.|..  .+-.|.||..|.|+.+.+|.+. ..+|.|+|+|-|
T Consensus       164 y---d~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn~G~irnTekLny~ke~~Y~ltVtAyD  226 (952)
T KOG1834|consen  164 Y---DSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVTAYD  226 (952)
T ss_pred             e---eeeEEEEeecCCCCCcccceeEEEecC--CCCceEEcCCCccccccccccccceeEEEEEEEEe
Confidence            6   5899999999998   555 5889985  5668999999999999999974 789999999987


No 7  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.72  E-value=2.1e-16  Score=121.98  Aligned_cols=108  Identities=30%  Similarity=0.462  Sum_probs=93.4

Q ss_pred             CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCC---CceeEEEecCC--CcEEEeCCccc
Q psy9046           1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDS---SIALYVLEQDG--DMFSVNNITVC   75 (215)
Q Consensus         1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~---~~~~y~i~~~~--~~F~id~~tg~   75 (215)
                      +++|+|.|.|+||++|.|      ....|.+.|.|+.++|+.++++.|+|+|.   +...|+|..+.  .+|.|++.+|.
T Consensus        85 ~~~v~I~V~d~Nd~~P~~------~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~  158 (199)
T cd00031          85 TATVTVTVLDVNDNPPVF------EQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGI  158 (199)
T ss_pred             EEEEEEEEccCCCCCCcc------cccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceE
Confidence            367999999999999999      46899999999999999999999999998   45579998776  79999999998


Q ss_pred             eEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCC--CCceEEEEEEEEee
Q psy9046          76 EFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDAS--TPCSHFRVNIKVLD  130 (215)
Q Consensus        76 ~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~--~~~~~~~v~I~V~D  130 (215)
                      +....   .||+|....|.             +.|.|+|.+  ..+.++.++|.|.|
T Consensus       159 i~~~~---~ld~e~~~~~~-------------l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         159 ITLAK---PLDREEKSSYE-------------LTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             EEeCC---ccCCccCceEE-------------EEEEEEECCCCCceeEEEEEEEEEC
Confidence            65544   69999988888             789999998  44778888888765


No 8  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.65  E-value=2.6e-15  Score=102.26  Aligned_cols=85  Identities=31%  Similarity=0.414  Sum_probs=72.9

Q ss_pred             eEEEEecCCCCCcEEEEEEEEeCCCCce---eEEEecCC--CcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCCC
Q psy9046          29 YLLEIEEEQPIGSLLGKFTALDKDSSIA---LYVLEQDG--DMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSPG  103 (215)
Q Consensus        29 ~~~~V~E~~~~g~~v~~v~a~D~D~~~~---~y~i~~~~--~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~~  103 (215)
                      |.++|+|++++|+.++++.|.|+|.+.+   .|+|..++  .+|.|++.+|.+.+.+   +||||....|.         
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~---~LD~E~~~~y~---------   68 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKK---PLDRETQSSYQ---------   68 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESS---SSCTTTTSEEE---------
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccce---ecCcccCCEEE---------
Confidence            7899999999999999999999997554   69998766  8999999999855544   79999999999         


Q ss_pred             CeEEEEEEEEeC-CCC--ceEEEEEEEEe
Q psy9046         104 DTKVVLVSAEDA-STP--CSHFRVNIKVL  129 (215)
Q Consensus       104 ~~~~l~v~a~D~-~~~--~~~~~v~I~V~  129 (215)
                          |.|.|+|. +.|  +++++|.|+|+
T Consensus        69 ----l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   69 ----LTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             ----EEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             ----EEEEEEECCCCCCCEEEEEEEEEEC
Confidence                79999999 666  67777777764


No 9  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.63  E-value=5.2e-15  Score=100.75  Aligned_cols=71  Identities=37%  Similarity=0.634  Sum_probs=66.9

Q ss_pred             eEEEEecCCCCCCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046         143 YRTSIQENIIITPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD  215 (215)
Q Consensus       143 ~~~~v~E~~~~g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D  215 (215)
                      |++.|+|++++|  +.++++.|.|+|.  |+.+.|+|..++..++|.|++ +|.|++.++||++ .+.|.|.|.|+|
T Consensus         1 Y~~~v~E~~~~g--~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D   75 (93)
T PF00028_consen    1 YSFSVPENAPPG--TVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATD   75 (93)
T ss_dssp             EEEEEETTGSTS--SEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEE
T ss_pred             CEEEEECCCCCC--CEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEE
Confidence            789999999987  9999999999996  999999999988889999999 9999999999964 899999999987


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.48  E-value=2.3e-13  Score=89.77  Aligned_cols=72  Identities=33%  Similarity=0.408  Sum_probs=60.1

Q ss_pred             EeCCCCce---eEEEecCC--CcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceE
Q psy9046          49 LDKDSSIA---LYVLEQDG--DMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSH  121 (215)
Q Consensus        49 ~D~D~~~~---~y~i~~~~--~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~  121 (215)
                      +|+|.|.+   .|+|..+.  .+|.|++.+|.+...+   .||||....|.             |.|.|.|.+.+  ++.
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~---~LD~e~~~~y~-------------l~v~a~D~~~~~~~~~   64 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTK---PLDREEQPEYT-------------LTVEATDGGGPPLSST   64 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCC---ccCeeCCCeEE-------------EEEEEEECCCCCcccE
Confidence            37787643   69997765  8999999999644333   79999999998             78999999874  889


Q ss_pred             EEEEEEEeeCCCCCC
Q psy9046         122 FRVNIKVLDVNDNRP  136 (215)
Q Consensus       122 ~~v~I~V~DvNdn~P  136 (215)
                      ++|+|+|.|+|||+|
T Consensus        65 ~~v~I~V~D~Nd~~P   79 (79)
T smart00112       65 ATVTVTVLDVNDNAP   79 (79)
T ss_pred             EEEEEEEEECCCCCC
Confidence            999999999999998


No 11 
>KOG1834|consensus
Probab=99.08  E-value=1.2e-09  Score=94.66  Aligned_cols=106  Identities=25%  Similarity=0.287  Sum_probs=86.6

Q ss_pred             eEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCc-----eeEEEecCCCcEEEeCCccce
Q psy9046           2 GTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSI-----ALYVLEQDGDMFSVNNITVCE   76 (215)
Q Consensus         2 ~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~-----~~y~i~~~~~~F~id~~tg~~   76 (215)
                      ++|.|+|.|+|.++|+|      ..+.|.+.|.|.-- -..|+++.|.|.|.+.     -.|.|..++-.|.||.. |.+
T Consensus       133 atvhIrVkDvNe~AP~f------~ep~Yka~V~EGK~-yd~il~veAiD~DCspq~sqIC~YEI~t~d~PFaIdn~-G~i  204 (952)
T KOG1834|consen  133 ATVHIRVKDVNEFAPVF------KEPWYKAHVTEGKV-YDSILRVEAIDKDCSPQYSQICEYEITTPDVPFAIDND-GNI  204 (952)
T ss_pred             eEEEEEeccccccCchh------cccceeeEEeccee-eeeeEEEEeecCCCCCcccceeEEEecCCCCceEEcCC-Ccc
Confidence            68999999999999999      68999999999854 5568999999999853     25999999999999984 665


Q ss_pred             EEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeC
Q psy9046          77 FQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDV  131 (215)
Q Consensus        77 ~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~Dv  131 (215)
                      ....   +|+|.+...|.             |+|.|.|.|..  ...+.|+|+|...
T Consensus       205 rnTe---kLny~ke~~Y~-------------ltVtAyDCg~kraa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  205 RNTE---KLNYTKEHQYK-------------LTVTAYDCGKKRAASDSLVTVHVKPT  245 (952)
T ss_pred             cccc---ccccccceeEE-------------EEEEEEecccccccCcceEEEEecCc
Confidence            5444   69998888888             79999999865  3335677777543


No 12 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.79  E-value=2e-08  Score=65.88  Aligned_cols=51  Identities=45%  Similarity=0.775  Sum_probs=44.6

Q ss_pred             EeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046         165 TDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD  215 (215)
Q Consensus       165 ~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D  215 (215)
                      +|+|.  |+.++|+|.+++....|.|++ +|.|++.++||++ ...|.|.|+|+|
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D   55 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATD   55 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEE
Confidence            36776  789999999876668999999 9999999999965 789999999987


No 13 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.87  E-value=7.4e-05  Score=50.14  Aligned_cols=75  Identities=20%  Similarity=0.296  Sum_probs=40.1

Q ss_pred             CCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCCCCeEEEEEeeCCCCCcEEEcCCeeEEEcccCCCCccEEEEEEEEE
Q psy9046         135 RPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDENPHLTYKILSGNDDDVFAVDDTGILTIQGPLAKGKTVYNLLVQVS  214 (215)
Q Consensus       135 ~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~~~~i~y~i~~~~~~~~F~Id~~G~i~~~~~ld~~~~~y~l~V~a~  214 (215)
                      .|-|.+..|.+.|+.+...|  ..|++|.-.|-..+..+.|.-.    +..|.|..+|.|++++++.-....-.+.|.|.
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g--~~lg~V~f~dC~~~~~~~~~ss----DpdF~V~~DGsVy~~r~v~l~~~~~~F~V~a~   76 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAG--QPLGKVNFEDCTGRRRVIFESS----DPDFRVLEDGSVYAKRPVQLSSEQRSFTVHAW   76 (90)
T ss_dssp             --B--S-EEEE----SS-SS----EEE---B--SS---EEEE-------SEEEEETTTEEEEES--S-SSS-EEEEEEEE
T ss_pred             cCCcccceEEEEcCchhhCC--cEEEEEEeccCCCCCceEEecC----CCCEEEcCCCeEEEeeeEecCCCceEEEEEEE
Confidence            58899999999999998766  9999999999876778998764    23799999999999999985444456777776


Q ss_pred             C
Q psy9046         215 D  215 (215)
Q Consensus       215 D  215 (215)
                      |
T Consensus        77 D   77 (90)
T PF08758_consen   77 D   77 (90)
T ss_dssp             E
T ss_pred             C
Confidence            5


No 14 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.11  E-value=0.00057  Score=45.24  Aligned_cols=58  Identities=21%  Similarity=0.308  Sum_probs=38.3

Q ss_pred             eEEEEecCCCCCCCCEEEEEEEEeCCC----CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC
Q psy9046         143 YRTSIQENIIITPSPTLLTLEATDSDE----NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG  203 (215)
Q Consensus       143 ~~~~v~E~~~~g~~~~i~~v~a~D~D~----~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~  203 (215)
                      ...+|.|..+.|  +.|+.| |.|.-.    -..-.|+|.+.....+|.++. +|.|++...+|+|
T Consensus         3 i~YsV~EE~~~G--t~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE   65 (84)
T PF08266_consen    3 IRYSVPEEMPPG--TVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDRE   65 (84)
T ss_dssp             EEEEEESS--TT---EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCC
T ss_pred             eEEEeecCCCCC--CEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHH
Confidence            367899999977  999998 555544    123478888777788999999 9999999999964


No 15 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.03  E-value=0.0008  Score=44.54  Aligned_cols=56  Identities=21%  Similarity=0.260  Sum_probs=35.0

Q ss_pred             eEEEEecCCCCCcEEEEEEEEeCCCCc-----eeEEEec--CCCcEEEeCCccceEEEEEEEecCCC
Q psy9046          29 YLLEIEEEQPIGSLLGKFTALDKDSSI-----ALYVLEQ--DGDMFSVNNITVCEFQLSVDLAAADN   88 (215)
Q Consensus        29 ~~~~V~E~~~~g~~v~~v~a~D~D~~~-----~~y~i~~--~~~~F~id~~tg~~~~~~~~~~ld~e   88 (215)
                      ...+|+|..+.|+.||.+ |.|.-...     ..|++..  ...+|.++..+|.+++..   ++|||
T Consensus         3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~---rIDRE   65 (84)
T PF08266_consen    3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSE---RIDRE   65 (84)
T ss_dssp             EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS-----SCC
T ss_pred             eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCC---ccCHH
Confidence            357899999999999999 55554321     2477754  358999999999866543   79999


No 16 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=96.88  E-value=0.057  Score=40.20  Aligned_cols=114  Identities=17%  Similarity=0.301  Sum_probs=65.2

Q ss_pred             cCCCCCeEEEEE-EEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC-CCeEEEE
Q psy9046          99 ENSPGDTKVVLV-SAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE-NPHLTYK  176 (215)
Q Consensus        99 e~~~~~~~~l~v-~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~-~~~i~y~  176 (215)
                      ...+.|.|.+.+ .|.|..+...+..+...+. +...+|.+.- .....+..+..+. +..=..+.++|... ..--..+
T Consensus        10 ~~l~dG~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~-~~~~i~i~~tD~~~~~~i~sv~   86 (158)
T PF13750_consen   10 STLPDGSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVY-GLVNISINVTDNSDDSKITSVS   86 (158)
T ss_pred             CcCCCccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCcccc-ceeeeEEEEEeCCCCceEEEEE
Confidence            456778888999 7999887766666554443 3444776533 1111222332221 12335678887765 3345777


Q ss_pred             EeeCCCCCcEEE--cC--CeeEEEc--ccCC--CCccEEEEEEEEEC
Q psy9046         177 ILSGNDDDVFAV--DD--TGILTIQ--GPLA--KGKTVYNLLVQVSD  215 (215)
Q Consensus       177 i~~~~~~~~F~I--d~--~G~i~~~--~~ld--~~~~~y~l~V~a~D  215 (215)
                      |.+|.....-.+  ..  .|...+.  +.+.  +....|+|+|.|+|
T Consensus        87 l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D  133 (158)
T PF13750_consen   87 LTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATD  133 (158)
T ss_pred             EECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEe
Confidence            776654443333  22  4544443  2222  45789999999998


No 17 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.64  E-value=0.034  Score=37.27  Aligned_cols=78  Identities=22%  Similarity=0.208  Sum_probs=36.8

Q ss_pred             CCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCceeEEEecCCCcEEEeCCccceEEEEEEEecCCC-cceee
Q psy9046          15 PPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIALYVLEQDGDMFSVNNITVCEFQLSVDLAAADN-FQSEY   93 (215)
Q Consensus        15 ~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~~y~i~~~~~~F~id~~tg~~~~~~~~~~ld~e-~~~~~   93 (215)
                      -|=|      .+..|.+.|+.+...|..|++|.-.|-.... ...+.+.+..|.|..+ |.+++.+.   +... .+.. 
T Consensus         3 ~pGF------~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~-~~~~~ssDpdF~V~~D-GsVy~~r~---v~l~~~~~~-   70 (90)
T PF08758_consen    3 RPGF------SQKKYTFEVPSNLEAGQPLGKVNFEDCTGRR-RVIFESSDPDFRVLED-GSVYAKRP---VQLSSEQRS-   70 (90)
T ss_dssp             --B--------S-EEEE----SS-SS--EEE---B--SS----EEEE---SEEEEETT-TEEEEES-----S-SSS-EE-
T ss_pred             cCCc------ccceEEEEcCchhhCCcEEEEEEeccCCCCC-ceEEecCCCCEEEcCC-CeEEEeee---EecCCCceE-
Confidence            4677      5899999999999999999999998885433 2333344558999984 87766653   3221 2222 


Q ss_pred             EEEEecCCCCCeEEEEEEEEeCCCC
Q psy9046          94 NATIKENSPGDTKVVLVSAEDASTP  118 (215)
Q Consensus        94 ~~~~~e~~~~~~~~l~v~a~D~~~~  118 (215)
                                    +.|.|.|....
T Consensus        71 --------------F~V~a~D~~~~   81 (90)
T PF08758_consen   71 --------------FTVHAWDSQTQ   81 (90)
T ss_dssp             --------------EEEEEEETTTT
T ss_pred             --------------EEEEEECCCCC
Confidence                          57899998764


No 18 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.17  E-value=0.11  Score=35.25  Aligned_cols=68  Identities=19%  Similarity=0.161  Sum_probs=47.1

Q ss_pred             EEeCCCCceeEEEec-----CCCcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCCceEE
Q psy9046          48 ALDKDSSIALYVLEQ-----DGDMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTPCSHF  122 (215)
Q Consensus        48 a~D~D~~~~~y~i~~-----~~~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~~~~~  122 (215)
                      ..|+|....+|++..     -..|...++.++.+.- .   +.... ...|.             +.|.|+|+.+.+...
T Consensus        24 F~d~d~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-t---P~~~~-~g~~~-------------i~v~a~D~~g~~~~~   85 (97)
T smart00736       24 FTDADGDTLTYSATLSDGSALPSWLSFDSDTGTLSG-T---PTNSD-VGSLS-------------LKVTATDSSGASASD   85 (97)
T ss_pred             eECCCCCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-E---CCCCC-CcEEE-------------EEEEEEECCCCEEEE
Confidence            467886666788742     1368899888765322 2   22222 23355             799999998778899


Q ss_pred             EEEEEEeeCCC
Q psy9046         123 RVNIKVLDVND  133 (215)
Q Consensus       123 ~v~I~V~DvNd  133 (215)
                      .+.|.|.+.|+
T Consensus        86 ~f~i~V~~~~~   96 (97)
T smart00736       86 TFTITVVNTND   96 (97)
T ss_pred             EEEEEEeCCCC
Confidence            99999999987


No 19 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.79  E-value=0.15  Score=34.78  Aligned_cols=86  Identities=20%  Similarity=0.221  Sum_probs=51.4

Q ss_pred             EEEEEEeCCCCcee-EEEecC---CCcEEEeCCccceEEEEEEE------ecCCCcceeeEEEEecCCCCCeEEEEEEEE
Q psy9046          44 GKFTALDKDSSIAL-YVLEQD---GDMFSVNNITVCEFQLSVDL------AAADNFQSEYNATIKENSPGDTKVVLVSAE  113 (215)
Q Consensus        44 ~~v~a~D~D~~~~~-y~i~~~---~~~F~id~~tg~~~~~~~~~------~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~  113 (215)
                      +++.++|+|.+... +++...   -+.|.|++ .|. +.-....      .|...+...             -.+++.+.
T Consensus         2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~-wtYtl~n~~~avq~L~~Ge~~t-------------dsFtvtv~   66 (99)
T TIGR01965         2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQ-WTYQADNSQTAVQALKAGETLT-------------DTFTVTSA   66 (99)
T ss_pred             CceEEeCCCCCCceEEecccccCCcEEEEECC-CCc-EEEEeCCCcHHHHhhcCCCEEE-------------EEEEEEEe
Confidence            46889999987653 444222   25688877 564 2222100      111111111             22678888


Q ss_pred             eCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecC
Q psy9046         114 DASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQEN  150 (215)
Q Consensus       114 D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~  150 (215)
                      |+    .+.+|.|.|.-.|| +|+..... ...+.|+
T Consensus        67 DG----tt~~vtItI~GtND-apvi~~~~-~g~v~ED   97 (99)
T TIGR01965        67 DG----TSQTVTITITGAND-AAVIGGAD-TGSVTED   97 (99)
T ss_pred             CC----CeEEEEEEEEccCC-CCEEeccc-ceeEecC
Confidence            86    38899999999999 78665432 5666665


No 20 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=93.55  E-value=0.37  Score=28.30  Aligned_cols=34  Identities=18%  Similarity=0.213  Sum_probs=26.2

Q ss_pred             CCCcEEEcC-CeeEEEcccCCCCccEEEEEEEEEC
Q psy9046         182 DDDVFAVDD-TGILTIQGPLAKGKTVYNLLVQVSD  215 (215)
Q Consensus       182 ~~~~F~Id~-~G~i~~~~~ld~~~~~y~l~V~a~D  215 (215)
                      -.....||+ +|.|.-...-..+.+.|.+.|.|+|
T Consensus        12 LP~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd   46 (49)
T PF05345_consen   12 LPSGLSLDPSTGTISGTPTSSVQPGTYTFTVTATD   46 (49)
T ss_pred             CCCcEEEeCCCCEEEeecCCCccccEEEEEEEEEc
Confidence            356688999 9999877444434579999999987


No 21 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=93.05  E-value=0.78  Score=30.95  Aligned_cols=51  Identities=27%  Similarity=0.313  Sum_probs=37.9

Q ss_pred             EEEeCCCCCeEEEEEeeCC---CCCcEEEcC-CeeEEEcccCCCCccEEEEEEEEEC
Q psy9046         163 EATDSDENPHLTYKILSGN---DDDVFAVDD-TGILTIQGPLAKGKTVYNLLVQVSD  215 (215)
Q Consensus       163 ~a~D~D~~~~i~y~i~~~~---~~~~F~Id~-~G~i~~~~~ld~~~~~y~l~V~a~D  215 (215)
                      ...|+| +..++|++...+   ...|...|+ ++.+.= .+.....+.|.+.|.|+|
T Consensus        23 tF~d~d-~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-tP~~~~~g~~~i~v~a~D   77 (97)
T smart00736       23 TFTDAD-GDTLTYSATLSDGSALPSWLSFDSDTGTLSG-TPTNSDVGSLSLKVTATD   77 (97)
T ss_pred             ceECCC-CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-ECCCCCCcEEEEEEEEEE
Confidence            457777 779999987532   246888898 887766 455544677999999987


No 22 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.18  E-value=2.6  Score=25.86  Aligned_cols=30  Identities=23%  Similarity=0.293  Sum_probs=18.5

Q ss_pred             CCCCCeEEEEEEEEeCCCC--ceEEEEEEEEe
Q psy9046         100 NSPGDTKVVLVSAEDASTP--CSHFRVNIKVL  129 (215)
Q Consensus       100 ~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~  129 (215)
                      +.++|.|.|.|+|.|..+.  ....++.|+|+
T Consensus        35 ~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen   35 NLPPGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             S--SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             eCCCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            3466777799999997655  22266776663


No 23 
>KOG3597|consensus
Probab=82.58  E-value=23  Score=30.98  Aligned_cols=55  Identities=27%  Similarity=0.126  Sum_probs=38.1

Q ss_pred             CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCce--eEEEe
Q psy9046           1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIA--LYVLE   61 (215)
Q Consensus         1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~--~y~i~   61 (215)
                      |....|.|.-+||.+..+.      ...+.+.+.|+...-.....+.+.|+|.+..  .|++.
T Consensus        25 ~~~~~i~v~pvndpp~~~~------~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~l~f~v~   81 (442)
T KOG3597|consen   25 TDVLRIHVNPVNDPPSLIF------PSGSLLVILEGGQKVLDPELLTAADPDSAPLPLEFQVL   81 (442)
T ss_pred             EeeecccccccCCCcceee------cccceEEeecCCceeccceEeeccCCCCCccceEEEEc
Confidence            3567889999999777663      4445588888866444456688888888543  46664


No 24 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=80.37  E-value=10  Score=23.09  Aligned_cols=35  Identities=23%  Similarity=0.250  Sum_probs=24.6

Q ss_pred             ecCCCCCeEEEEEEEEeCCCCceEEEEEEEEeeCC
Q psy9046          98 KENSPGDTKVVLVSAEDASTPCSHFRVNIKVLDVN  132 (215)
Q Consensus        98 ~e~~~~~~~~l~v~a~D~~~~~~~~~v~I~V~DvN  132 (215)
                      ++....|.|.+.+.+.|..+........+.+.|..
T Consensus        17 P~~~~dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   17 PENDADGEYTLTVTATDKAGNTSSSTTQIVIVDNT   51 (60)
T ss_pred             cCccCCccEEEEEEEEECCCCEEEeeeEEEEEcCC
Confidence            34444556779999999988767666666666654


No 25 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=79.40  E-value=9.6  Score=25.97  Aligned_cols=37  Identities=19%  Similarity=0.206  Sum_probs=24.1

Q ss_pred             EEEEEEeCCCCCeEEEEEeeC-CCCCcEEEcCCeeEEE
Q psy9046         160 LTLEATDSDENPHLTYKILSG-NDDDVFAVDDTGILTI  196 (215)
Q Consensus       160 ~~v~a~D~D~~~~i~y~i~~~-~~~~~F~Id~~G~i~~  196 (215)
                      +++.++|+|.+....+++... ...+.|.|+++|....
T Consensus         2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~~G~wtY   39 (99)
T TIGR01965         2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDADGQWTY   39 (99)
T ss_pred             CceEEeCCCCCCceEEecccccCCcEEEEECCCCcEEE
Confidence            467888999766666666432 2346688887775544


No 26 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=74.24  E-value=11  Score=27.19  Aligned_cols=36  Identities=17%  Similarity=0.255  Sum_probs=29.6

Q ss_pred             eEEEEecCCCCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046          93 YNATIKENSPGDTKVVLVSAEDASTPCSHFRVNIKV  128 (215)
Q Consensus        93 ~~~~~~e~~~~~~~~l~v~a~D~~~~~~~~~v~I~V  128 (215)
                      ..+.+++..+.|.|.+.+.++|..+........|+|
T Consensus        96 ~~i~IPa~a~~G~YH~~i~VtD~~Gn~~~~~~~i~I  131 (132)
T PF15418_consen   96 EHIDIPADAPAGDYHFMITVTDAAGNQTEEERSIKI  131 (132)
T ss_pred             EeeeCCCCCCCcceEEEEEEEECCCCEEEEEEEEEE
Confidence            356788999999999999999998876666666665


No 27 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=70.00  E-value=10  Score=24.37  Aligned_cols=25  Identities=24%  Similarity=0.279  Sum_probs=19.9

Q ss_pred             CeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046         104 DTKVVLVSAEDASTPCSHFRVNIKV  128 (215)
Q Consensus       104 ~~~~l~v~a~D~~~~~~~~~v~I~V  128 (215)
                      |.+.++..|+|..+..+++++.|+|
T Consensus        57 G~t~V~ytA~D~~GN~a~C~f~V~V   81 (81)
T PF02494_consen   57 GTTTVTYTATDAAGNSATCSFTVTV   81 (81)
T ss_pred             ceEEEEEEEEECCCCEEEEEEEEEC
Confidence            4455889999988778888888875


No 28 
>KOG3597|consensus
Probab=69.55  E-value=33  Score=29.99  Aligned_cols=59  Identities=22%  Similarity=0.098  Sum_probs=41.8

Q ss_pred             eEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC-CCeEEEEEeeC
Q psy9046         120 SHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE-NPHLTYKILSG  180 (215)
Q Consensus       120 ~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~-~~~i~y~i~~~  180 (215)
                      .+....|+|.-+||.+..+...-+.+-+.|+..--  ..--.+++.|+|. -..+.|++.+.
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~--l~~~~l~~~d~d~~~~~l~f~v~~t   83 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKV--LDPELLTAADPDSAPLPLEFQVLGT   83 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCcee--ccceEeeccCCCCCccceEEEEccC
Confidence            56678999999999666666655666666654311  2234688899998 77899999863


No 29 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=59.74  E-value=28  Score=20.56  Aligned_cols=17  Identities=29%  Similarity=0.317  Sum_probs=13.3

Q ss_pred             CCCeEEEEEEEEeCCCC
Q psy9046         102 PGDTKVVLVSAEDASTP  118 (215)
Q Consensus       102 ~~~~~~l~v~a~D~~~~  118 (215)
                      ..|.|.+++.|+|..+.
T Consensus        22 ~dG~y~itv~a~D~AGN   38 (54)
T PF13754_consen   22 ADGTYTITVTATDAAGN   38 (54)
T ss_pred             CCccEEEEEEEEeCCCC
Confidence            45667789999998765


No 30 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=59.17  E-value=27  Score=22.09  Aligned_cols=27  Identities=19%  Similarity=0.130  Sum_probs=17.3

Q ss_pred             CCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046         102 PGDTKVVLVSAEDASTPCSHFRVNIKV  128 (215)
Q Consensus       102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V  128 (215)
                      ..|.|.+++.++|..+.+...++.|.|
T Consensus        55 ~~G~y~v~l~v~d~~g~~~~~~~~V~V   81 (81)
T cd00146          55 KPGTYTVTLTVTNAVGSSSTKTTTVVV   81 (81)
T ss_pred             CCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence            345555899999986545554665543


No 31 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=58.91  E-value=1.5e+02  Score=27.34  Aligned_cols=34  Identities=15%  Similarity=0.245  Sum_probs=26.9

Q ss_pred             CeEEEEEEEEeCCCC-ceEEEEEEEEeeCCCCCCccC
Q psy9046         104 DTKVVLVSAEDASTP-CSHFRVNIKVLDVNDNRPEFK  139 (215)
Q Consensus       104 ~~~~l~v~a~D~~~~-~~~~~v~I~V~DvNdn~P~f~  139 (215)
                      |...+++.++|..+. +......|+|++-.  +|.+.
T Consensus       298 G~~Ti~atVtDSRGr~S~~~~~tItVl~Y~--~P~ls  332 (624)
T PF05895_consen  298 GSATIRATVTDSRGRTSDPKTKTITVLEYS--PPTLS  332 (624)
T ss_pred             ceEEEEEEEEECCCccCCceEEEEEEEEcC--CCcEE
Confidence            566789999998876 45678999999876  77773


No 32 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=56.40  E-value=1.1e+02  Score=25.14  Aligned_cols=104  Identities=21%  Similarity=0.252  Sum_probs=51.3

Q ss_pred             CCeEEEEEEEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCC-C---CCCEEEEEEEEeCCCCCeEEEEEe
Q psy9046         103 GDTKVVLVSAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIII-T---PSPTLLTLEATDSDENPHLTYKIL  178 (215)
Q Consensus       103 ~~~~~l~v~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~-g---~~~~i~~v~a~D~D~~~~i~y~i~  178 (215)
                      .+.|.+.+.++|..+.....+..|.|--.   +|...-.    .+.++... +   .....+...+++.+.+..+...+ 
T Consensus        11 d~~~~v~vt~tD~aGN~~~~t~~~~vDt~---~P~v~i~----~~~~~~~~~~~~~~~~~t~s~tvs~~~~g~~v~v~~-   82 (317)
T PF13753_consen   11 DGTYTVSVTVTDAAGNTSTATQSITVDTT---APTVTIT----SIADDDIINGDEATNTVTFSGTVSGAEPGSTVTVTI-   82 (317)
T ss_pred             CCcEEEEEEEEeCCCCeeeeeEEEEEecC---CCceeee----cccCCCccccceeeeeeEEEEEecCCCCCCEEEEEE-
Confidence            45566899999987765555655553322   6643321    11111110 0   01234555555555455666666 


Q ss_pred             eCCCCCcEEEcCCeeEEEcccC--CCCccEEEEEEE-EEC
Q psy9046         179 SGNDDDVFAVDDTGILTIQGPL--AKGKTVYNLLVQ-VSD  215 (215)
Q Consensus       179 ~~~~~~~F~Id~~G~i~~~~~l--d~~~~~y~l~V~-a~D  215 (215)
                      .+ ....+..+.+|...+.-..  .-..+.|.+.+. ++|
T Consensus        83 ~g-~~~t~~~~~~G~ws~t~~~~~~l~~g~~ti~v~~~tD  121 (317)
T PF13753_consen   83 NG-TTGTLTADADGNWSVTVTPSDDLPDGDYTITVTTVTD  121 (317)
T ss_pred             CC-EEEEEEEecCCcEEEeeccccccccCcceeEEEEEEc
Confidence            21 1122333335543322111  123567888887 776


No 33 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=55.84  E-value=57  Score=21.57  Aligned_cols=50  Identities=14%  Similarity=0.250  Sum_probs=29.2

Q ss_pred             EEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC--CCeEEEEEeeC
Q psy9046         125 NIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE--NPHLTYKILSG  180 (215)
Q Consensus       125 ~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~--~~~i~y~i~~~  180 (215)
                      +|.|.| ||.+ .+.-..-...+.|+..    ..-..|.-..++.  .-.+.|+..++
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~~----~~~v~V~~~~~~~~~~v~v~~~~~~g   53 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGDG----TVTVTVTRSGGSLDGPVTVNYSTVDG   53 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTSS----EEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCCC----EEEEEEEEcccCCCcceEEEEEEeCC
Confidence            567778 6644 6665544677888752    4445555554543  44677777765


No 34 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=54.75  E-value=2.1e+02  Score=27.77  Aligned_cols=54  Identities=15%  Similarity=0.258  Sum_probs=31.9

Q ss_pred             EEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEE-EEEeCCCCCeEEEEEeeC
Q psy9046         122 FRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTL-EATDSDENPHLTYKILSG  180 (215)
Q Consensus       122 ~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v-~a~D~D~~~~i~y~i~~~  180 (215)
                      ...+|+|.| ||++|.|.-..-...|.|+.  |  ..-.+| +..+.+..-.+.|+-.+|
T Consensus       515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G--~vtvtV~RtsGa~G~VtV~Y~T~dG  569 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFEEDVFHVSESI--G--IMEVKVLRTSGARGTVIVPYRTVEG  569 (928)
T ss_pred             ceEEEEEec-CcccCcccccCceEEEEcCC--C--EEEEEEEEcCCCCeeEEEEEEeecC
Confidence            345666677 78899877655577888874  3  433333 222222133567877765


No 35 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=54.29  E-value=42  Score=20.99  Aligned_cols=26  Identities=19%  Similarity=0.174  Sum_probs=18.6

Q ss_pred             CCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046         102 PGDTKVVLVSAEDASTPCSHFRVNIKV  128 (215)
Q Consensus       102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V  128 (215)
                      ..|.|.+++.+.|..+ +.++.+.|.|
T Consensus        53 ~~G~y~v~l~v~n~~g-~~~~~~~i~v   78 (79)
T smart00089       53 KPGTYTVTLTVTNAVG-SASATVTVVV   78 (79)
T ss_pred             CCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence            3455558999999776 6777777766


No 36 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=52.35  E-value=88  Score=22.65  Aligned_cols=43  Identities=16%  Similarity=0.255  Sum_probs=31.1

Q ss_pred             EEEEEEEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCC
Q psy9046         106 KVVLVSAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENII  152 (215)
Q Consensus       106 ~~l~v~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~  152 (215)
                      +.|.|.|+|..+-.....+.|+|.|  | .|...... ...|.|...
T Consensus        87 l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L  129 (137)
T TIGR03660        87 LNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDEDDL  129 (137)
T ss_pred             EeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEecccc
Confidence            4478999998776666788888887  6 57776543 478888654


No 37 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=47.56  E-value=58  Score=24.13  Aligned_cols=32  Identities=19%  Similarity=0.146  Sum_probs=24.9

Q ss_pred             CCCCCeEEEEEEEEeCCCCceEEEEEEEEeeC
Q psy9046         100 NSPGDTKVVLVSAEDASTPCSHFRVNIKVLDV  131 (215)
Q Consensus       100 ~~~~~~~~l~v~a~D~~~~~~~~~v~I~V~Dv  131 (215)
                      -+..+.|.|+|.|.|..+...+.++.+.....
T Consensus       119 le~~~~YtLtV~a~D~aGN~~~~si~F~y~P~  150 (158)
T PF13750_consen  119 LEADDSYTLTVSATDKAGNQSTKSISFSYMPP  150 (158)
T ss_pred             cCCCCeEEEEEEEEecCCCEEEEEEEEEEeCC
Confidence            34556677999999998888888888877643


No 38 
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=47.39  E-value=2.4e+02  Score=26.27  Aligned_cols=32  Identities=16%  Similarity=0.224  Sum_probs=22.1

Q ss_pred             CCCCeEEEEEEEEeCCCC---ceEEEEEEEEeeCC
Q psy9046         101 SPGDTKVVLVSAEDASTP---CSHFRVNIKVLDVN  132 (215)
Q Consensus       101 ~~~~~~~l~v~a~D~~~~---~~~~~v~I~V~DvN  132 (215)
                      .+.|.|.|.+.|.|....   -..+.+...|....
T Consensus       155 ~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~Vk~~~  189 (667)
T PRK14081        155 NKPGKYELLVECKRIDSTKDFDDFKKVKFKVKEID  189 (667)
T ss_pred             cCCCcEEEEEEEecCCCccccCcceEEEEEcccCc
Confidence            366888888999987665   45566666665554


No 39 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=43.49  E-value=3.2e+02  Score=26.60  Aligned_cols=28  Identities=21%  Similarity=0.329  Sum_probs=18.5

Q ss_pred             EEEEEEEecCCCCCccCCCCCCCCCceEEEEecCC
Q psy9046           3 TLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQ   37 (215)
Q Consensus         3 ~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~   37 (215)
                      +.+|+|.| ||++|.|.      -..-..+|.|+.
T Consensus       516 ~ATVTIlD-DD~aGIfs------Fe~~~~sV~Es~  543 (928)
T TIGR00845       516 TATVTILD-DDHAGIFT------FEEDVFHVSESI  543 (928)
T ss_pred             eEEEEEec-CcccCccc------ccCceEEEEcCC
Confidence            45677777 67888873      333456788874


No 40 
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=37.73  E-value=3e+02  Score=24.56  Aligned_cols=33  Identities=6%  Similarity=0.076  Sum_probs=24.5

Q ss_pred             cCCCCCeEEEEEEEEeCCCC-ceEEEEEEEEeeC
Q psy9046          99 ENSPGDTKVVLVSAEDASTP-CSHFRVNIKVLDV  131 (215)
Q Consensus        99 e~~~~~~~~l~v~a~D~~~~-~~~~~v~I~V~Dv  131 (215)
                      .+...|.|.|.++++|..+. ....++.+.|...
T Consensus       374 s~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~~~  407 (478)
T PRK13211        374 SKLKAGHHMLVVKAKPKDGELIKQQTLDFMLEAK  407 (478)
T ss_pred             ccCCCceEEEEEEEEeCCCceeeeeeEEEEEEec
Confidence            55667999999999987655 3467788888643


No 41 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=36.14  E-value=61  Score=22.36  Aligned_cols=29  Identities=17%  Similarity=0.100  Sum_probs=20.7

Q ss_pred             CCCeEEEEEEEEeCCCCceEEEEEEEEee
Q psy9046         102 PGDTKVVLVSAEDASTPCSHFRVNIKVLD  130 (215)
Q Consensus       102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V~D  130 (215)
                      +.|.|..+|++.|..+-+....+.|.|-|
T Consensus        78 kgG~y~m~V~lCn~dGCS~S~~~~I~VAD  106 (106)
T cd02848          78 KGGRYQMQVALCNGDGCSTSAAKEIVVAD  106 (106)
T ss_pred             CCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence            44556689999998776666677777654


No 42 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=35.01  E-value=62  Score=23.35  Aligned_cols=31  Identities=19%  Similarity=0.170  Sum_probs=20.6

Q ss_pred             CCCeEEEEEEEEeCCCCceEEEEEEEEeeCC
Q psy9046         102 PGDTKVVLVSAEDASTPCSHFRVNIKVLDVN  132 (215)
Q Consensus       102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V~DvN  132 (215)
                      +.|.|..+|++.+..+-+....+.|.|.|..
T Consensus        81 ~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTD  111 (133)
T PF08329_consen   81 KGGRYQMQVELCNADGCSTSAPVEVVVADTD  111 (133)
T ss_dssp             S-EEEEEEEEEEETTEEEE---EEEEEE-TT
T ss_pred             CCCEEEEEEEEECCCCcccCCCEEEEEeCCC
Confidence            4566668999999877667778888888875


No 43 
>PF07550 DUF1533:  Protein of unknown function (DUF1533);  InterPro: IPR011432 This domain is found duplicated in proteins of unknown function. The proteins typically also contain leucine-rich repeats.
Probab=25.70  E-value=1.4e+02  Score=18.29  Aligned_cols=33  Identities=12%  Similarity=0.351  Sum_probs=22.5

Q ss_pred             CCCcEEEcCCeeEEEcccCCCCccEEEEEEEEE
Q psy9046         182 DDDVFAVDDTGILTIQGPLAKGKTVYNLLVQVS  214 (215)
Q Consensus       182 ~~~~F~Id~~G~i~~~~~ld~~~~~y~l~V~a~  214 (215)
                      ....|.++..|.|.+...+-.+.+.|.|.|.|.
T Consensus        24 ~~~~y~~~~~~~l~i~~~~f~~~G~~~I~I~A~   56 (65)
T PF07550_consen   24 GNDKYSISSKGSLKIKASAFNKDGENTIVIKAT   56 (65)
T ss_pred             ccccEEeccCCcEEEcHHHcCcCCceEEEEEeC
Confidence            345677766444777766554567899999874


Done!