Query         psy8618
Match_columns 232
No_of_seqs    201 out of 1431
Neff          10.0
Searched_HMMs 46136
Date          Fri Aug 16 23:25:15 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8618.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8618hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 9.6E-45 2.1E-49  321.2  21.0  225    1-231   220-457 (2531)
  2 KOG4289|consensus              100.0 7.2E-42 1.6E-46  303.1  22.9  222    2-232   326-560 (2531)
  3 KOG1219|consensus              100.0 3.4E-37 7.3E-42  281.2  26.0  225    2-231   903-1142(4289)
  4 KOG1219|consensus              100.0 1.8E-36 3.8E-41  276.6  24.1  221    2-232  1010-1242(4289)
  5 cd00031 CA Cadherin repeat dom 100.0 3.1E-30 6.8E-35  198.2  24.3  171   56-230     2-185 (199)
  6 cd00031 CA Cadherin repeat dom  99.9 7.2E-26 1.6E-30  173.9  21.3  147    1-151    53-199 (199)
  7 KOG1834|consensus               99.8 2.3E-19 4.9E-24  151.4  17.7  187   37-231    20-231 (952)
  8 PF00028 Cadherin:  Cadherin do  99.8   3E-19 6.5E-24  120.5  13.7   92   56-150     1-93  (93)
  9 smart00112 CA Cadherin repeats  99.8 1.3E-17 2.9E-22  108.9  11.0   79   76-157     1-79  (79)
 10 KOG1834|consensus               99.7 7.7E-17 1.7E-21  136.3  15.0  143    2-152    95-245 (952)
 11 PF00028 Cadherin:  Cadherin do  99.4 4.8E-12   1E-16   85.1   8.5   69  162-231    12-81  (93)
 12 smart00112 CA Cadherin repeats  99.2 7.6E-11 1.6E-15   76.7   7.8   60  171-231     1-60  (79)
 13 PF08266 Cadherin_2:  Cadherin-  98.0 1.3E-05 2.7E-10   52.3   4.2   61   57-121     4-66  (84)
 14 PF08758 Cadherin_pro:  Cadheri  97.4   0.003 6.4E-08   41.8   8.8   78   47-136     3-80  (90)
 15 smart00736 CADG Dystroglycan-t  97.2   0.012 2.5E-07   39.6  10.9   73   75-154    24-96  (97)
 16 TIGR01965 VCBS_repeat VCBS rep  96.6   0.035 7.7E-07   37.2   8.9   87   71-170     2-96  (99)
 17 PF08266 Cadherin_2:  Cadherin-  96.1   0.011 2.4E-07   38.5   4.1   32  182-214    35-66  (84)
 18 PF08758 Cadherin_pro:  Cadheri  94.9    0.17 3.6E-06   33.5   6.5   69  156-231     3-82  (90)
 19 PF13750 Big_3_3:  Bacterial Ig  92.9     2.8 6.2E-05   30.8  15.9  129   13-150    14-148 (158)
 20 PF05345 He_PIG:  Putative Ig d  92.3     1.2 2.6E-05   25.7   6.2   37   97-135    11-48  (49)
 21 TIGR00845 caca sodium/calcium   91.6      13 0.00027   35.4  16.1  121   43-170   395-541 (928)
 22 smart00736 CADG Dystroglycan-t  89.5     1.6 3.6E-05   29.0   5.9   31   13-45     66-96  (97)
 23 TIGR01965 VCBS_repeat VCBS rep  88.2     2.6 5.7E-05   28.3   6.0   48    8-64     51-98  (99)
 24 TIGR03660 T1SS_rpt_143 T1SS-14  87.1     4.1 8.8E-05   29.2   6.8   58    2-66     70-130 (137)
 25 PF05345 He_PIG:  Putative Ig d  85.3     5.2 0.00011   23.0   6.1   37  190-228    11-48  (49)
 26 PF13750 Big_3_3:  Bacterial Ig  84.4      14  0.0003   27.2  14.0  103  122-229    14-136 (158)
 27 TIGR03660 T1SS_rpt_143 T1SS-14  77.6      23  0.0005   25.4   7.6   59  109-173    68-129 (137)
 28 PF07495 Y_Y_Y:  Y_Y_Y domain;   77.3      13 0.00028   22.3   8.6   59   83-149     7-65  (66)
 29 PF07495 Y_Y_Y:  Y_Y_Y domain;   72.4      10 0.00022   22.8   4.2   27   14-40     39-65  (66)
 30 TIGR00864 PCC polycystin catio  69.0 1.8E+02  0.0039   31.7  21.0   98  119-227  1564-1681(2740)
 31 KOG4221|consensus               69.0 1.3E+02  0.0027   29.9  17.3  136   85-226   551-695 (1381)
 32 TIGR00845 caca sodium/calcium   67.5 1.2E+02  0.0026   29.2  13.7   56   34-94    515-570 (928)
 33 KOG3597|consensus               64.5      97  0.0021   27.0  16.1   60   32-93     24-83  (442)
 34 TIGR00864 PCC polycystin catio  54.6 3.3E+02  0.0071   29.9  18.1   31  118-151  1479-1509(2740)
 35 smart00089 PKD Repeats in poly  54.1      43 0.00094   20.8   4.8   30  117-149    49-78  (79)
 36 smart00089 PKD Repeats in poly  49.2      60  0.0013   20.1   4.9   28   10-40     51-78  (79)
 37 PF12245 Big_3_2:  Bacterial Ig  48.9      57  0.0012   19.5   5.8   29   13-43     22-50  (60)
 38 cd00146 PKD polycystic kidney   48.4      58  0.0013   20.3   4.8   18   10-27     53-70  (81)
 39 PF02494 HYR:  HYR domain;  Int  47.3      52  0.0011   20.8   4.4   24   15-40     58-81  (81)
 40 PF03160 Calx-beta:  Calx-beta   46.3      83  0.0018   20.6  12.1   52   37-93      2-53  (100)
 41 PF10365 DUF2436:  Domain of un  40.4 1.2E+02  0.0027   21.7   5.5   49  180-229   108-157 (161)
 42 cd02848 Chitinase_N_term Chiti  38.5 1.2E+02  0.0027   20.6   5.1   33  116-150    73-105 (106)
 43 PF05895 DUF859:  Siphovirus pr  37.1 3.4E+02  0.0073   25.0  13.3  120   15-149   299-433 (624)
 44 PF13860 FlgD_ig:  FlgD Ig-like  34.5 1.2E+02  0.0026   19.1   4.9   14  215-228    68-81  (81)
 45 PF09100 Qn_am_d_aIV:  Quinohem  32.1 1.4E+02  0.0031   21.0   4.7   31   17-48    101-132 (133)
 46 PF00801 PKD:  PKD domain;  Int  29.2 1.3E+02  0.0029   18.0   5.2   18  212-229    48-65  (69)
 47 PF13754 Big_3_4:  Bacterial Ig  26.8 1.4E+02   0.003   17.3   5.0   17  121-137    22-38  (54)
 48 COG3354 FlaG Putative archaeal  26.2 2.6E+02  0.0056   20.3   7.9   54  131-187    55-111 (154)
 49 PF14302 DUF4377:  Domain of un  26.1 1.2E+02  0.0026   19.4   3.4   25  208-232    40-64  (80)
 50 PF08329 ChitinaseA_N:  Chitina  25.0 1.6E+02  0.0036   21.0   4.2   35    9-45     78-112 (133)
 51 PF03413 PepSY:  Peptidase prop  24.6 1.4E+02  0.0031   17.2   3.5   14  100-113    49-62  (64)
 52 PF13753 SWM_repeat:  Putative   23.8 4.1E+02  0.0088   21.7  16.4  107   13-137    11-125 (317)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=9.6e-45  Score=321.18  Aligned_cols=225  Identities=36%  Similarity=0.615  Sum_probs=214.7

Q ss_pred             CeEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCC
Q psy8618           1 EISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDD   80 (232)
Q Consensus         1 ~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~   80 (232)
                      .|++++.||||.++.+.|.|.|.|.+.|+++++++|+|.|.|+|||.|+|.+. .|.-++.|+.+.|+.|.++.|+|.|+
T Consensus       220 ~irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~-~Y~e~lREn~evGy~vLtvrAtD~Ds  298 (2531)
T KOG4289|consen  220 AIRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQD-EYREELRENLEVGYEVLTVRATDGDS  298 (2531)
T ss_pred             cchhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchh-HHHHHHhhccccCceEEEEEeccCCC
Confidence            36789999999999999999999999999999999999999999999999998 78899999999999999999999999


Q ss_pred             CCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeee
Q psy8618          81 GSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFT  160 (232)
Q Consensus        81 ~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~  160 (232)
                      ++|+.++|+++.|   +....|.|++.+|.|++..+||||....|++.|.|.|.|.++...++.|.|+|.|.|||+|+|.
T Consensus       299 p~Nani~Yrl~eg---~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFs  375 (2531)
T KOG4289|consen  299 PPNANIRYRLLEG---NAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFS  375 (2531)
T ss_pred             CCCCceEEEecCC---CccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCcccc
Confidence            9999999999988   6788999999999999999999999999999999999999988889999999999999999999


Q ss_pred             CCceEEE-------------EEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEEC
Q psy8618         161 SSSLLFK-------------VSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDR  227 (232)
Q Consensus       161 ~~~~~~~-------------v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~  227 (232)
                      ...|.++             |+++|.|.+.|+.|.|+|.++ +..+.|.|+..+|+|.+..+||+|.. .|.+.|+|.|+
T Consensus       376 e~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sg-n~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDg  453 (2531)
T KOG4289|consen  376 EKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASG-NGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDG  453 (2531)
T ss_pred             ccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeecc-CccccEEEecccceEEEeccccccCC-eeEEEEEcccC
Confidence            9998665             556999999999999999998 88999999999999999999999999 89999999999


Q ss_pred             CccC
Q psy8618         228 GKET  231 (232)
Q Consensus       228 g~~~  231 (232)
                      |.|+
T Consensus       454 grPp  457 (2531)
T KOG4289|consen  454 GRPP  457 (2531)
T ss_pred             CCCC
Confidence            9997


No 2  
>KOG4289|consensus
Probab=100.00  E-value=7.2e-42  Score=303.08  Aligned_cols=222  Identities=39%  Similarity=0.620  Sum_probs=208.8

Q ss_pred             eEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCC
Q psy8618           2 ISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDG   81 (232)
Q Consensus         2 i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~   81 (232)
                      |.+..+||||+...|.|.|.|+|.|.++...++.|.|.|.|+|||+|.|... .|.++|.|+..+++.|.++.|+|.|.|
T Consensus       326 I~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~-~Yvvqv~Edvt~~avvlrV~AtDrD~g  404 (2531)
T KOG4289|consen  326 ISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEK-RYVVQVREDVTPPAVVLRVTATDRDKG  404 (2531)
T ss_pred             eeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCcccccc-ceEEEecccCCCCceEEEEEecccCCC
Confidence            6788999999999999999999999988878999999999999999999998 799999999999999999999999999


Q ss_pred             CCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeeeC
Q psy8618          82 SNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFTS  161 (232)
Q Consensus        82 ~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~  161 (232)
                      .|+.+.|+|.+|   +..+-|.||..||+|.+..+||+|.. .|++.|.|.|+|.|+++.+.-+.|+|.|+|||+|.|..
T Consensus       405 ~Ng~VHYsi~Sg---n~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvs  480 (2531)
T KOG4289|consen  405 TNGKVHYSIASG---NGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVS  480 (2531)
T ss_pred             cCceEEEEeecc---CccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEe
Confidence            999999999999   77789999999999999999999988 99999999999999999998888999999999999987


Q ss_pred             Cce-------------EEEEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEECC
Q psy8618         162 SSL-------------LFKVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDRG  228 (232)
Q Consensus       162 ~~~-------------~~~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~g  228 (232)
                      ..+             ...+.+.|.|.+.++.+.|++.+    .++|.|+..+|.|.+.|.||||+...|.|.|+|+|+|
T Consensus       481 tpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG----~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~g  556 (2531)
T KOG4289|consen  481 TPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG----VGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHG  556 (2531)
T ss_pred             chhhhhhhhcccccceEEEEecccCCCCcccceeeeecc----CCCeeEecCCceEEEeecccccccceEEEEEEEcCCC
Confidence            654             45677799999999999999976    4599999999999999999999999999999999999


Q ss_pred             ccCC
Q psy8618         229 KETQ  232 (232)
Q Consensus       229 ~~~q  232 (232)
                      .|+.
T Consensus       557 tp~l  560 (2531)
T KOG4289|consen  557 TPPL  560 (2531)
T ss_pred             CCcc
Confidence            9973


No 3  
>KOG1219|consensus
Probab=100.00  E-value=3.4e-37  Score=281.22  Aligned_cols=225  Identities=34%  Similarity=0.485  Sum_probs=211.5

Q ss_pred             eEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCC--CCeecCCCceEEEEecCCCCCcEEEEEEEEeCC
Q psy8618           2 ISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDN--APELVDPQEDVISVREEQPPGTEVVRARAVDKD   79 (232)
Q Consensus         2 i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~--~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D   79 (232)
                      +.+-++||+|+.+.|.|.|.|.|.|.|.+++.+.+.|.|.|+|+|  ||.|..- ....+|.||+|.|+.++.+.|.|.|
T Consensus       903 vfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~-v~e~~V~EnapiGT~vi~i~A~ded  981 (4289)
T KOG1219|consen  903 VFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISF-VTEGHVLENAPIGTIVIRIQARDED  981 (4289)
T ss_pred             EEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchhee-eeeeeEeecCCcceEEEEEEEecCC
Confidence            567889999999999999999999999899999999999999887  8999866 5679999999999999999999999


Q ss_pred             CCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCee
Q psy8618          80 DGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTF  159 (232)
Q Consensus        80 ~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f  159 (232)
                      .|-.+.+.|+|..|   +..+.|+|+..+|.|++.+.||||....|.|+|.|.|.|.+++++.+.+.|.|.|+|||+|+|
T Consensus       982 sgldg~l~Y~I~~g---dg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~ 1058 (4289)
T KOG1219|consen  982 SGLDGELSYKIRTG---DGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQF 1058 (4289)
T ss_pred             CCccceEEEEEEcC---CcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCccc
Confidence            99999999999988   778899999999999999999999999999999999999999999999999999999999999


Q ss_pred             eCCceEE-------------EEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEE
Q psy8618         160 TSSSLLF-------------KVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATD  226 (232)
Q Consensus       160 ~~~~~~~-------------~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D  226 (232)
                      ..+.|..             ++.+.|+|...++.+.|.|..| +..+.|.|++.+|.|...+.||+|++..+.|.|.++|
T Consensus      1059 s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sG-nyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D 1137 (4289)
T KOG1219|consen 1059 SSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSG-NYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQD 1137 (4289)
T ss_pred             CCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccC-CccceEEEccccceeeeehhhcccccccceEEEEEec
Confidence            9888754             4455888988899999999998 8999999999999999999999999999999999999


Q ss_pred             CCccC
Q psy8618         227 RGKET  231 (232)
Q Consensus       227 ~g~~~  231 (232)
                      +|.|.
T Consensus      1138 ~gep~ 1142 (4289)
T KOG1219|consen 1138 NGEPW 1142 (4289)
T ss_pred             CCCCc
Confidence            99996


No 4  
>KOG1219|consensus
Probab=100.00  E-value=1.8e-36  Score=276.60  Aligned_cols=221  Identities=36%  Similarity=0.533  Sum_probs=201.6

Q ss_pred             eEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCC
Q psy8618           2 ISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDG   81 (232)
Q Consensus         2 i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~   81 (232)
                      |.+.++||||.+..|-|++.|+|.|.+++++.+.+.|.|.|+|||+|.|..+ .|..+|.|++|.+..|.++.|.|+|..
T Consensus      1010 irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~p-vy~asI~enSp~~vsivq~ea~D~Dss 1088 (4289)
T KOG1219|consen 1010 IRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSP-VYYASISENSPETVSIVQAEANDPDSS 1088 (4289)
T ss_pred             EeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCc-eEeeeeccCCCCceEEEEeccCCCCcc
Confidence            6788999999999999999999999999999999999999999999999998 889999999999999999999999988


Q ss_pred             CCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeeeC
Q psy8618          82 SNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFTS  161 (232)
Q Consensus        82 ~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~  161 (232)
                      .++++.|.|.+|   +..++|.|++.||.|+..+.||||.+..|.|.|+++|.|.|.+++...|.|.|.|+|||+|+|.+
T Consensus      1089 sn~kLmykI~sG---nyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~IldvNdnsp~Flq 1165 (4289)
T KOG1219|consen 1089 SNQKLMYKITSG---NYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDVNDNSPRFLQ 1165 (4289)
T ss_pred             cCcceEEEEccC---CccceEEEccccceeeeehhhcccccccceEEEEEecCCCCccccceEEEEEEeeccCCchhhhh
Confidence            899999999999   78899999999999999999999999999999999999999999999999999999999999998


Q ss_pred             CceEEEEE-----------eecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEE-EEEEECCc
Q psy8618         162 SSLLFKVS-----------ASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFP-VVATDRGK  229 (232)
Q Consensus       162 ~~~~~~v~-----------~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~-V~a~D~g~  229 (232)
                      ..+.++++           +.|.|.+++++|+|+|..| +  ..|.|+..+|.+.. +.++  ..+.|.+. |.|+|+|.
T Consensus      1166 k~~~~~v~~r~s~plyRl~a~d~DeG~narityniedg-d--e~FsID~~t~vVss-s~~~--~~~eydi~~Ikatd~g~ 1239 (4289)
T KOG1219|consen 1166 KKTFLRVPERSSPPLYRLAAQDNDEGNNARITYNIEDG-D--EVFSIDIATGVVSS-STLD--PAGEYDILGIKATDRGA 1239 (4289)
T ss_pred             heeEEEeeeccCCceeEEEEEecCCCcceEEEEecccC-c--eEEEEeccCceEEe-eecc--CCcceeEeeEEEecCCC
Confidence            88766544           5899999999999999976 3  33999999995544 4454  34468887 99999999


Q ss_pred             cCC
Q psy8618         230 ETQ  232 (232)
Q Consensus       230 ~~q  232 (232)
                      |.|
T Consensus      1240 pq~ 1242 (4289)
T KOG1219|consen 1240 PQA 1242 (4289)
T ss_pred             Ccc
Confidence            964


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00  E-value=3.1e-30  Score=198.18  Aligned_cols=171  Identities=42%  Similarity=0.640  Sum_probs=153.7

Q ss_pred             eEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCC
Q psy8618          56 DVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNG  135 (232)
Q Consensus        56 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~  135 (232)
                      |.+.++|+.+.|+.++++.|.|+|.+.++.++|+|.++   ....+|.|++.+|.|++.+.||||....|.|.|.|+|.+
T Consensus         2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~---~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g   78 (199)
T cd00031           2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGG---NEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGG   78 (199)
T ss_pred             eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCC---CCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECC
Confidence            57899999999999999999999988888999999876   333799999999999999999999999999999999988


Q ss_pred             CCCcceeEEEEEEEEecCCCCCeeeCCce-------------EEEEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCce
Q psy8618         136 FPSRQTVRLLRVEVLDLNDNRPTFTSSSL-------------LFKVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSG  202 (232)
Q Consensus       136 ~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~-------------~~~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G  202 (232)
                      .+.+++...+.|.|.|+||++|.|....|             .+++.+.|+|.+.++.++|+|..+ .....|.|++.+|
T Consensus        79 ~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~-~~~~~f~i~~~~G  157 (199)
T cd00031          79 GPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSG-NDKELFSIDPNTG  157 (199)
T ss_pred             cCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCC-CCCCEEEEeCCce
Confidence            88777889999999999999999985554             445666899988889999999985 3358999999999


Q ss_pred             EEEEeecCCCccccEEEEEEEEEECCcc
Q psy8618         203 EICISQDLDFESRSSYEFPVVATDRGKE  230 (232)
Q Consensus       203 ~i~~~~~ld~e~~~~y~l~V~a~D~g~~  230 (232)
                      .|.+.+.||+|....|.|.|.|+|+|.|
T Consensus       158 ~i~~~~~ld~e~~~~~~l~v~a~D~~~~  185 (199)
T cd00031         158 IITLAKPLDREEKSSYELTVVATDGGGP  185 (199)
T ss_pred             EEEeCCccCCccCceEEEEEEEEECCCC
Confidence            9999999999999999999999999854


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.95  E-value=7.2e-26  Score=173.92  Aligned_cols=147  Identities=44%  Similarity=0.651  Sum_probs=134.5

Q ss_pred             CeEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCC
Q psy8618           1 EISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDD   80 (232)
Q Consensus         1 ~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~   80 (232)
                      +|.+.+.||||....|.|.|.|.|.|.+..++...++|.|.|+|||+|.|... .|.+.+.|+.+.|+.++++.|+|+|.
T Consensus        53 ~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~-~~~~~v~e~~~~~~~i~~~~a~D~D~  131 (199)
T cd00031          53 VITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQS-SYEASVPENAPPGTVVGTVTATDADS  131 (199)
T ss_pred             EEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCccccc-ceEEEEeCCCCCCCEEEEEEEEcCCC
Confidence            37788999999999999999999988877779999999999999999999976 68899999999999999999999998


Q ss_pred             CCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEe
Q psy8618          81 GSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLD  151 (232)
Q Consensus        81 ~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d  151 (232)
                      +.++.++|+|.++   ....+|.|++.+|.|++.+.||+|....|.+.|.|+|.+.+.+++.+.+.|.|.|
T Consensus       132 ~~~~~~~y~l~~~---~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         132 GENAKLTYSILSG---NDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             CCCccEEEEEeCC---CCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence            8889999999766   3357999999999999999999999999999999999988778888888888865


No 7  
>KOG1834|consensus
Probab=99.84  E-value=2.3e-19  Score=151.42  Aligned_cols=187  Identities=22%  Similarity=0.329  Sum_probs=151.1

Q ss_pred             EEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCC---CCcEEEEEEeeCCCCCCcccEEEeCCc--eEE
Q psy8618          37 KISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDG---SNATISYSILKGRDTDGYGVFTIDPIS--GVI  111 (232)
Q Consensus        37 ~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~---~~~~v~y~i~~~~~~~~~~~f~id~~t--G~i  111 (232)
                      .....-+|-+.|+...  .|...|.||...-.....+.|.|.|.+   .+..+.|.|.+. + -......+|..|  |.|
T Consensus        20 ~~~aarankhkpwie~--ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq-~-vPFdavVvdK~TGegvl   95 (952)
T KOG1834|consen   20 HHHAARANKHKPWIEE--EYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ-P-VPFDAVVVDKYTGEGVL   95 (952)
T ss_pred             ccccccccccCccccc--ceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC-C-CCceEEEEeccCCceEE
Confidence            3445667888997654  578999999986666677899999874   234467777543 2 233444557666  589


Q ss_pred             EECccCCccCCCEEEEEEEEEeCCCCC------cceeEEEEEEEEecCCCCCeeeCCceEE------------EEEeecC
Q psy8618         112 RTKSVLDHEDRSIYRIAVAATDNGFPS------RQTVRLLRVEVLDLNDNRPTFTSSSLLF------------KVSASDP  173 (232)
Q Consensus       112 ~~~~~ld~e~~~~~~l~v~a~d~~~~~------~~~~~~v~i~v~d~Nd~~P~f~~~~~~~------------~v~~~D~  173 (232)
                      +.+.+||+|.+..|+|+|+|.|+|..+      ++..++|.|+|.|+|+++|.|..+.|.+            .|.+.|.
T Consensus        96 RaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~veAiD~  175 (952)
T KOG1834|consen   96 RAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVEAIDK  175 (952)
T ss_pred             eecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEEeecC
Confidence            999999999999999999999987643      5677899999999999999999988855            4566899


Q ss_pred             CCCC--CeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEECCccC
Q psy8618         174 DCGV--NAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDRGKET  231 (232)
Q Consensus       174 d~~~--~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~g~~~  231 (232)
                      |+++  +..+.|.|.   ...-+|.|+ +.|.|+.+.+|+|.....|.|+|+|.|+|+.+
T Consensus       176 DCspq~sqIC~YEI~---t~d~PFaId-n~G~irnTekLny~ke~~Y~ltVtAyDCg~kr  231 (952)
T KOG1834|consen  176 DCSPQYSQICEYEIT---TPDVPFAID-NDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKR  231 (952)
T ss_pred             CCCCcccceeEEEec---CCCCceEEc-CCCccccccccccccceeEEEEEEEEeccccc
Confidence            9987  466799998   468899999 89999999999999999999999999999753


No 8  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.83  E-value=3e-19  Score=120.48  Aligned_cols=92  Identities=43%  Similarity=0.668  Sum_probs=86.9

Q ss_pred             eEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeC-
Q psy8618          56 DVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDN-  134 (232)
Q Consensus        56 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~-  134 (232)
                      |.+.|+|+.++|+.++++.|.|+|.+.++.+.|+|..+   +..++|.|++.+|.|++.+.||||....|.|.|.|+|. 
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~---~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~   77 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGG---NPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSG   77 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEET---TSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETT
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecC---cccCceEEeeeeeccccceecCcccCCEEEEEEEEEECC
Confidence            67999999999999999999999999999999999987   55889999999999999999999999999999999999 


Q ss_pred             CCCCcceeEEEEEEEE
Q psy8618         135 GFPSRQTVRLLRVEVL  150 (232)
Q Consensus       135 ~~~~~~~~~~v~i~v~  150 (232)
                      +.|++++.+.|.|+|.
T Consensus        78 ~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   78 GSPPLSSTATVTINVL   93 (93)
T ss_dssp             TSSEEEEEEEEEEEEE
T ss_pred             CCCCCEEEEEEEEEEC
Confidence            8889999999999874


No 9  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.76  E-value=1.3e-17  Score=108.90  Aligned_cols=79  Identities=44%  Similarity=0.736  Sum_probs=72.6

Q ss_pred             EeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCC
Q psy8618          76 VDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDN  155 (232)
Q Consensus        76 ~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~  155 (232)
                      +|+|.+.++.++|+|.++   ....+|.|++.+|.|++.++||||....|.|.|.|.|.+.|++++.+.|.|+|.|+|||
T Consensus         1 ~D~D~g~n~~i~Y~i~~~---~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~   77 (79)
T smart00112        1 TDADSGENGKVTYSILSG---NEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDN   77 (79)
T ss_pred             CCCCCCcCcEEEEEEecC---CCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCC
Confidence            478888889999999876   33389999999999999999999999999999999999998899999999999999999


Q ss_pred             CC
Q psy8618         156 RP  157 (232)
Q Consensus       156 ~P  157 (232)
                      +|
T Consensus        78 ~P   79 (79)
T smart00112       78 AP   79 (79)
T ss_pred             CC
Confidence            98


No 10 
>KOG1834|consensus
Probab=99.74  E-value=7.7e-17  Score=136.35  Aligned_cols=143  Identities=29%  Similarity=0.389  Sum_probs=121.7

Q ss_pred             eEecCCCCcCCCCeEEEEEEEEeCCCC------CceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEE
Q psy8618           2 ISTREPLDREARSSHELVAEARDQGSP------PRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARA   75 (232)
Q Consensus         2 i~~~~~lD~E~~~~~~l~v~a~d~~~~------~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a   75 (232)
                      |+.+.+||-|.++.|+|+|+|.|-|..      ..+-.++|+|+|.|+|+++|+|..+ .|.+.|.|.-.. ..|+++.|
T Consensus        95 lRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep-~Yka~V~EGK~y-d~il~veA  172 (952)
T KOG1834|consen   95 LRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEP-WYKAHVTEGKVY-DSILRVEA  172 (952)
T ss_pred             EeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhccc-ceeeEEecceee-eeeEEEEe
Confidence            678899999999999999999996543      3567899999999999999999998 788999998654 55899999


Q ss_pred             EeCCCCC--CcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEec
Q psy8618          76 VDKDDGS--NATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDL  152 (232)
Q Consensus        76 ~D~D~~~--~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~  152 (232)
                      .|.|-++  ...+.|.|+.     ..-.|.||. .|.|+.+.+|.|.....|.|+|+|.|+|.....+.+.|+|+|...
T Consensus       173 iD~DCspq~sqIC~YEI~t-----~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  173 IDKDCSPQYSQICEYEITT-----PDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKPT  245 (952)
T ss_pred             ecCCCCCcccceeEEEecC-----CCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEecCc
Confidence            9999754  4557889974     455699997 799999999999999999999999999986655657788888654


No 11 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.36  E-value=4.8e-12  Score=85.10  Aligned_cols=69  Identities=33%  Similarity=0.565  Sum_probs=63.3

Q ss_pred             CceEEEEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEEC-CccC
Q psy8618         162 SSLLFKVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDR-GKET  231 (232)
Q Consensus       162 ~~~~~~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~-g~~~  231 (232)
                      ....+.+.+.|+|.+.++.+.|+|..+ +....|.|++.+|.|.+.+.||||..+.|.|.|.|+|. |.|+
T Consensus        12 g~~v~~v~a~D~D~~~n~~i~y~i~~~-~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~   81 (93)
T PF00028_consen   12 GTVVGQVTATDPDSGPNSQITYSILGG-NPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPP   81 (93)
T ss_dssp             SSEEEEEEEEESSTSTTSSEEEEEEET-TSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSE
T ss_pred             CCEEEEEEEEeCCCCCCceEEEEEecC-cccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCC
Confidence            456788999999999999999999987 55899999999999999999999999999999999999 7775


No 12 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.21  E-value=7.6e-11  Score=76.73  Aligned_cols=60  Identities=32%  Similarity=0.505  Sum_probs=53.6

Q ss_pred             ecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEECCccC
Q psy8618         171 SDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDRGKET  231 (232)
Q Consensus       171 ~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~g~~~  231 (232)
                      +|+|.+.++.+.|+|..+ .....|.|++.+|.|.+.++||||....|.|.|.|+|+|.|+
T Consensus         1 ~D~D~g~n~~i~Y~i~~~-~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~   60 (79)
T smart00112        1 TDADSGENGKVTYSILSG-NEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPP   60 (79)
T ss_pred             CCCCCCcCcEEEEEEecC-CCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCC
Confidence            377888889999999976 334899999999999999999999999999999999999864


No 13 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.96  E-value=1.3e-05  Score=52.34  Aligned_cols=61  Identities=25%  Similarity=0.357  Sum_probs=39.5

Q ss_pred             EEEEecCCCCCcEEEEEEEEeCCCCCC--cEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccC
Q psy8618          57 VISVREEQPPGTEVVRARAVDKDDGSN--ATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHED  121 (232)
Q Consensus        57 ~~~v~E~~~~g~~v~~v~a~D~D~~~~--~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~  121 (232)
                      ...|+|+.++|+.|+.+ |.|......  ..-.|++++.   ....+|.++..+|.|.++..+|||.
T Consensus         4 ~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~---~~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen    4 RYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSE---GNSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             EEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-S---SSS-SEEE-TTTSEEEESS--SCCC
T ss_pred             EEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeec---CCcceeEecCCceeEEeCCccCHHH
Confidence            57899999999999998 545433210  1135777665   5678999999999999999999996


No 14 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.36  E-value=0.003  Score=41.76  Aligned_cols=78  Identities=13%  Similarity=0.123  Sum_probs=41.4

Q ss_pred             CCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEE
Q psy8618          47 APELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYR  126 (232)
Q Consensus        47 ~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~  126 (232)
                      .|-|... .|.+.|+.+...|..|++|.-.|..  .+..+.|.-.      +. .|.|.+ .|.|.+++++... ...-.
T Consensus         3 ~pGF~~~-~~~~~Vp~~l~~g~~lg~V~f~dC~--~~~~~~~~ss------Dp-dF~V~~-DGsVy~~r~v~l~-~~~~~   70 (90)
T PF08758_consen    3 RPGFSQK-KYTFEVPSNLEAGQPLGKVNFEDCT--GRRRVIFESS------DP-DFRVLE-DGSVYAKRPVQLS-SEQRS   70 (90)
T ss_dssp             --B--S--EEEE----SS-SS--EEE---B--S--S---EEEE---------S-EEEEET-TTEEEEES--S-S-SS-EE
T ss_pred             cCCcccc-eEEEEcCchhhCCcEEEEEEeccCC--CCCceEEecC------CC-CEEEcC-CCeEEEeeeEecC-CCceE
Confidence            4778877 7889999999999999999999885  3445777652      22 699998 7999999998654 33457


Q ss_pred             EEEEEEeCCC
Q psy8618         127 IAVAATDNGF  136 (232)
Q Consensus       127 l~v~a~d~~~  136 (232)
                      |.|.|.|...
T Consensus        71 F~V~a~D~~~   80 (90)
T PF08758_consen   71 FTVHAWDSQT   80 (90)
T ss_dssp             EEEEEEETTT
T ss_pred             EEEEEECCCC
Confidence            9999999865


No 15 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.24  E-value=0.012  Score=39.59  Aligned_cols=73  Identities=21%  Similarity=0.260  Sum_probs=52.7

Q ss_pred             EEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCC
Q psy8618          75 AVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLND  154 (232)
Q Consensus        75 a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd  154 (232)
                      ..|.|   +..++|++..........|...++.++.+.- .+. .+..+.|.+.|.|+|+..  .+....+.|.|.+.|+
T Consensus        24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-tP~-~~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSG-TPT-NSDVGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-ECC-CCCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            46665   4468998864432244679999998888776 344 334577999999999864  5566779999988876


No 16 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.62  E-value=0.035  Score=37.23  Aligned_cols=87  Identities=18%  Similarity=0.259  Sum_probs=55.6

Q ss_pred             EEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECc--------cCCccCCCEEEEEEEEEeCCCCCccee
Q psy8618          71 VRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKS--------VLDHEDRSIYRIAVAATDNGFPSRQTV  142 (232)
Q Consensus        71 ~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~--------~ld~e~~~~~~l~v~a~d~~~~~~~~~  142 (232)
                      +++.++|+|.+..  ..++....  ....+.|.|++ +|.....-        .|...+.-.-+|++.+.|+      ..
T Consensus         2 G~Lt~sD~D~gd~--~~~s~~~~--~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt   70 (99)
T TIGR01965         2 GQLTISDADAGQA--HFIAQTDA--AGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TS   70 (99)
T ss_pred             CceEEeCCCCCCc--eEEecccc--cCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------Ce
Confidence            4688999996543  45555322  14577899987 67654431        2332234457788889996      26


Q ss_pred             EEEEEEEEecCCCCCeeeCCceEEEEEe
Q psy8618         143 RLLRVEVLDLNDNRPTFTSSSLLFKVSA  170 (232)
Q Consensus       143 ~~v~i~v~d~Nd~~P~f~~~~~~~~v~~  170 (232)
                      ..|.|+|.-.|| +|...... ...+.|
T Consensus        71 ~~vtItI~GtND-apvi~~~~-~g~v~E   96 (99)
T TIGR01965        71 QTVTITITGAND-AAVIGGAD-TGSVTE   96 (99)
T ss_pred             EEEEEEEEccCC-CCEEeccc-ceeEec
Confidence            789999999999 77654322 245554


No 17 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.09  E-value=0.011  Score=38.50  Aligned_cols=32  Identities=19%  Similarity=0.449  Sum_probs=23.1

Q ss_pred             EEEEccCCCCCCcEEEECCceEEEEeecCCCcc
Q psy8618         182 NYTLGESPSRTNHFYMKSVSGEICISQDLDFES  214 (232)
Q Consensus       182 ~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~  214 (232)
                      .|.+... ....+|.++..+|.|.+...+|+|.
T Consensus        35 ~~ri~s~-~~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen   35 NFRIVSE-GNSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             TBEEE-S-SSS-SEEE-TTTSEEEESS--SCCC
T ss_pred             ceEEeec-CCcceeEecCCceeEEeCCccCHHH
Confidence            6666654 5689999999999999999999986


No 18 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=94.89  E-value=0.17  Score=33.46  Aligned_cols=69  Identities=22%  Similarity=0.349  Sum_probs=34.3

Q ss_pred             CCeeeCCceEEEEEee-----------cCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEE
Q psy8618         156 RPTFTSSSLLFKVSAS-----------DPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVA  224 (232)
Q Consensus       156 ~P~f~~~~~~~~v~~~-----------D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a  224 (232)
                      .|-|....|.+.|+..           =.|+.....+.|..     .+..|.|. ..|.|++++++..... .-.|.|.|
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~s-----sDpdF~V~-~DGsVy~~r~v~l~~~-~~~F~V~a   75 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGRRRVIFES-----SDPDFRVL-EDGSVYAKRPVQLSSE-QRSFTVHA   75 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS---EEEE--------SEEEEE-TTTEEEEES--S-SSS--EEEEEEE
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCCCCceEEec-----CCCCEEEc-CCCeEEEeeeEecCCC-ceEEEEEE
Confidence            4667777777777651           01232233345553     24589999 6999999999887654 35799999


Q ss_pred             EECCccC
Q psy8618         225 TDRGKET  231 (232)
Q Consensus       225 ~D~g~~~  231 (232)
                      .|.....
T Consensus        76 ~D~~~~~   82 (90)
T PF08758_consen   76 WDSQTQE   82 (90)
T ss_dssp             EETTTTE
T ss_pred             ECCCCCe
Confidence            9987653


No 19 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=92.89  E-value=2.8  Score=30.81  Aligned_cols=129  Identities=20%  Similarity=0.257  Sum_probs=65.2

Q ss_pred             CCeEEEEE-EEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCC-CcEEEEEEEEeCCCCCCcEEEEEE
Q psy8618          13 RSSHELVA-EARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPP-GTEVVRARAVDKDDGSNATISYSI   90 (232)
Q Consensus        13 ~~~~~l~v-~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~-g~~v~~v~a~D~D~~~~~~v~y~i   90 (232)
                      .-.|.+++ .|.|..+...+.....++.   ++..+|.+.-.  ....+..+... +..-..+.++|.-.+. .....++
T Consensus        14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~---iD~T~Ptisi~--~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l   87 (158)
T PF13750_consen   14 DGSYTLTVVTATDAAGNTSTSTVSETFT---IDNTPPTISIS--DGASVANGSTVYGLVNISINVTDNSDDS-KITSVSL   87 (158)
T ss_pred             CccEEEEEEEEEecCCCEEEEEEeeEEE---EcCCCCEEEEe--cCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEE
Confidence            35899999 7999877433333332333   34446755430  00122222222 3333557777765433 3446666


Q ss_pred             eeCCCCCCcccEEEe-CCceEEEEC--ccC-CccCCCEEEEEEEEEeCCCCCcceeEEEEEEEE
Q psy8618          91 LKGRDTDGYGVFTID-PISGVIRTK--SVL-DHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVL  150 (232)
Q Consensus        91 ~~~~~~~~~~~f~id-~~tG~i~~~--~~l-d~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~  150 (232)
                      ..|.. ...-..... ...|...+.  +.+ ..|....|+|+|.|.|..+  ..++..+.+...
T Consensus        88 ~Gg~~-~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si~F~y~  148 (158)
T PF13750_consen   88 TGGPA-SDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSISFSYM  148 (158)
T ss_pred             ECCcc-cceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC--CEEEEEEEEEEe
Confidence            44421 122112222 123444443  211 3466789999999999864  344445555544


No 20 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=92.26  E-value=1.2  Score=25.68  Aligned_cols=37  Identities=24%  Similarity=0.348  Sum_probs=28.3

Q ss_pred             CCcccEEEeCCceEEEECccCCcc-CCCEEEEEEEEEeCC
Q psy8618          97 DGYGVFTIDPISGVIRTKSVLDHE-DRSIYRIAVAATDNG  135 (232)
Q Consensus        97 ~~~~~f~id~~tG~i~~~~~ld~e-~~~~~~l~v~a~d~~  135 (232)
                      ....+..+|+.+|.|+-.  .+.. ..+.|.|.|.|+|..
T Consensus        11 ~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   11 GLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence            556789999999998875  2233 346899999999964


No 21 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=91.62  E-value=13  Score=35.38  Aligned_cols=121  Identities=19%  Similarity=0.255  Sum_probs=61.6

Q ss_pred             CCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEEC--------
Q psy8618          43 VNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTK--------  114 (232)
Q Consensus        43 vNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~--------  114 (232)
                      .||..+.|... .-..++.|+.  |+.-..|.-...|.+..-.|.|+...|+.....+ |.  +.+|.|...        
T Consensus       395 ~dd~~s~i~Fe-~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~D-Y~--~~sGTLtF~PGEt~KtI  468 (928)
T TIGR00845       395 ENDPVSKIFFE-PGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSD-YE--FTEGTLVFKPGETQKEF  468 (928)
T ss_pred             ccCCcceEEec-CCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCC-cc--ccCceEEECCCceEEEE
Confidence            45555555554 3467899986  5655665544445444566899887775422111 21  223433322        


Q ss_pred             --ccCC---ccCCCEEEEEEEEEeCCC-------------CCcceeEEEEEEEEecCCCCCeeeCCceEEEEEe
Q psy8618         115 --SVLD---HEDRSIYRIAVAATDNGF-------------PSRQTVRLLRVEVLDLNDNRPTFTSSSLLFKVSA  170 (232)
Q Consensus       115 --~~ld---~e~~~~~~l~v~a~d~~~-------------~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~~~v~~  170 (232)
                        .-+|   +|....|.+.+.-...+.             .........+|+|.| ||+++.|....-.+.|.|
T Consensus       469 tV~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~E  541 (928)
T TIGR00845       469 RIGIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSE  541 (928)
T ss_pred             EEEEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEc
Confidence              1122   333444555444322110             012223356777787 888988765443444544


No 22 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=89.50  E-value=1.6  Score=29.03  Aligned_cols=31  Identities=29%  Similarity=0.314  Sum_probs=26.7

Q ss_pred             CCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCC
Q psy8618          13 RSSHELVAEARDQGSPPRSSRVAVKISVTDVND   45 (232)
Q Consensus        13 ~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd   45 (232)
                      ...|.+.|.|+|+.+  .++...++|.|.+.|+
T Consensus        66 ~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       66 VGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            457999999999876  6788899999999887


No 23 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=88.19  E-value=2.6  Score=28.30  Aligned_cols=48  Identities=25%  Similarity=0.316  Sum_probs=33.0

Q ss_pred             CCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCC
Q psy8618           8 LDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQ   64 (232)
Q Consensus         8 lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~   64 (232)
                      |.--+...-++++.+.|+      ....|+|.|...|| +|+....  -...+.|+.
T Consensus        51 L~~Ge~~tdsFtvtv~DG------tt~~vtItI~GtND-apvi~~~--~~g~v~ED~   98 (99)
T TIGR01965        51 LKAGETLTDTFTVTSADG------TSQTVTITITGAND-AAVIGGA--DTGSVTEDS   98 (99)
T ss_pred             hcCCCEEEEEEEEEEeCC------CeEEEEEEEEccCC-CCEEecc--cceeEecCC
Confidence            333334455677788885      27889999999999 8877654  246666653


No 24 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=87.05  E-value=4.1  Score=29.22  Aligned_cols=58  Identities=22%  Similarity=0.446  Sum_probs=38.5

Q ss_pred             eEecCCCCcCC---CCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCC
Q psy8618           2 ISTREPLDREA---RSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPP   66 (232)
Q Consensus         2 i~~~~~lD~E~---~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~   66 (232)
                      ..+.++||...   .-...|.|.|+|..+-..  ...+.|.|.|  | .|..... . .+.|.|+...
T Consensus        70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~-~-~~~V~E~~L~  130 (137)
T TIGR03660        70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDV-D-ALTVDEDDLP  130 (137)
T ss_pred             EEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeeccc-c-ceEEeccccc
Confidence            35678888743   446778899999776433  3477777776  6 5877654 2 4788886543


No 25 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=85.28  E-value=5.2  Score=22.98  Aligned_cols=37  Identities=22%  Similarity=0.319  Sum_probs=28.5

Q ss_pred             CCCCcEEEECCceEEEEeecCCCc-cccEEEEEEEEEECC
Q psy8618         190 SRTNHFYMKSVSGEICISQDLDFE-SRSSYEFPVVATDRG  228 (232)
Q Consensus       190 ~~~~~f~i~~~~G~i~~~~~ld~e-~~~~y~l~V~a~D~g  228 (232)
                      .-.....||+.+|.|.-.-  +.. ..+.|.+.|.|+|..
T Consensus        11 ~LP~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   11 GLPSGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CCCCcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcCC
Confidence            5567889999999998663  333 335899999999975


No 26 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=84.45  E-value=14  Score=27.17  Aligned_cols=103  Identities=18%  Similarity=0.307  Sum_probs=53.7

Q ss_pred             CCEEEEEE-EEEeCCCCCcceeEEEEEEEEecCCCCCeeeC---Cc----------eEEEEEeecCCCCCCeEEEEEEcc
Q psy8618         122 RSIYRIAV-AATDNGFPSRQTVRLLRVEVLDLNDNRPTFTS---SS----------LLFKVSASDPDCGVNAMVNYTLGE  187 (232)
Q Consensus       122 ~~~~~l~v-~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~---~~----------~~~~v~~~D~d~~~~~~i~y~l~~  187 (232)
                      -+.|.+.+ .|.|..+..........+.|   ...+|.+.-   ..          ..+.|..+|..... ..-..+|.+
T Consensus        14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~i---D~T~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~G   89 (158)
T PF13750_consen   14 DGSYTLTVVTATDAAGNTSTSTVSETFTI---DNTPPTISISDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTG   89 (158)
T ss_pred             CccEEEEEEEEEecCCCEEEEEEeeEEEE---cCCCCEEEEecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEEC
Confidence            46799999 79997653322222222332   334777654   00          23344444444333 555777776


Q ss_pred             CCCCCCcEEE--E-CCceEEEEe--ecC-CCccccEEEEEEEEEECCc
Q psy8618         188 SPSRTNHFYM--K-SVSGEICIS--QDL-DFESRSSYEFPVVATDRGK  229 (232)
Q Consensus       188 ~~~~~~~f~i--~-~~~G~i~~~--~~l-d~e~~~~y~l~V~a~D~g~  229 (232)
                      ++ ....-.+  . ...|...+.  +.+ ..|....|.|.|.|.|...
T Consensus        90 g~-~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG  136 (158)
T PF13750_consen   90 GP-ASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG  136 (158)
T ss_pred             Cc-ccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC
Confidence            52 2222222  1 123333332  111 3367789999999999753


No 27 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=77.57  E-value=23  Score=25.36  Aligned_cols=59  Identities=25%  Similarity=0.266  Sum_probs=38.2

Q ss_pred             eEEEECccCCccC---CCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeeeCCceEEEEEeecC
Q psy8618         109 GVIRTKSVLDHED---RSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFTSSSLLFKVSASDP  173 (232)
Q Consensus       109 G~i~~~~~ld~e~---~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~~~v~~~D~  173 (232)
                      -.+++.++||+..   .-...|.|.|+|.+.-..  ...+.|+|.|  | .|...... .++|.|.+.
T Consensus        68 YtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L  129 (137)
T TIGR03660        68 YEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDDL  129 (137)
T ss_pred             EEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEecccc
Confidence            4566777887643   335778899999765332  3467777776  5 57765544 368887443


No 28 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=77.28  E-value=13  Score=22.30  Aligned_cols=59  Identities=22%  Similarity=0.215  Sum_probs=32.7

Q ss_pred             CcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEE
Q psy8618          83 NATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEV  149 (232)
Q Consensus        83 ~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v  149 (232)
                      +-..+|++. |   -...|..+...+-.+... .|   ..+.|+|.|.|.|...........+.|+|
T Consensus         7 ~~~Y~Y~l~-g---~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen    7 NIRYRYRLE-G---FDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             TEEEEEEEE-T---TESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             ceEEEEEEE-C---CCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            334566664 4   345667766543243332 22   56889999999997654333335566665


No 29 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=72.36  E-value=10  Score=22.85  Aligned_cols=27  Identities=19%  Similarity=0.240  Sum_probs=17.2

Q ss_pred             CeEEEEEEEEeCCCCCceeEEEEEEEE
Q psy8618          14 SSHELVAEARDQGSPPRSSRVAVKISV   40 (232)
Q Consensus        14 ~~~~l~v~a~d~~~~~~~~~~~v~v~V   40 (232)
                      -.|.|.|.|.|..+........+.|.|
T Consensus        39 G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen   39 GKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             EEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             EEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            489999999997765433335666655


No 30 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=69.03  E-value=1.8e+02  Score=31.70  Aligned_cols=98  Identities=10%  Similarity=0.078  Sum_probs=49.6

Q ss_pred             ccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCC------------------CCeeeCCceEEEEEeecCCCCCCeE
Q psy8618         119 HEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDN------------------RPTFTSSSLLFKVSASDPDCGVNAM  180 (232)
Q Consensus       119 ~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~------------------~P~f~~~~~~~~v~~~D~d~~~~~~  180 (232)
                      |...+.|++.++|++....   ...++.|.|.+....                  -..+....-.+...+   ....|+.
T Consensus      1564 Y~spGtYtVtLTvtN~~Gs---~~~T~~i~V~~~I~GlqI~~~~~~gg~~~~~~~~~~~~~t~~~v~l~~---~~~~GTn 1637 (2740)
T TIGR00864      1564 FRSVGTFNIIVTAENDVGA---AQASIFLFVLQEIEGLQILGETAEGGGGGVQELDGCYFETNHTVQFHA---GFKDGTN 1637 (2740)
T ss_pred             ecCCceEEEEEEEecCCCc---cceeEEEEEeecccceEEEeecccccCCcccccccceeccCCEEEeee---ccCCCCe
Confidence            6678999999999997642   233445555432110                  011222222344444   3344667


Q ss_pred             EEEEEccCCC-C-CCcEEEECCceEEEEeecCCCccccEEEEEEEEEEC
Q psy8618         181 VNYTLGESPS-R-TNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDR  227 (232)
Q Consensus       181 i~y~l~~~~~-~-~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~  227 (232)
                      ++|+..-+.+ . .+...  ..+|...   .+.|...+.|.+.+.|++.
T Consensus      1638 vsysW~~~~~n~~dg~~~--~~tGk~~---~vt~~~pG~Y~VtL~aSN~ 1681 (2740)
T TIGR00864      1638 LSFSWNAILDNEPDGPAF--AGSGKGA---KLNPLEAGPCDIFLQAANL 1681 (2740)
T ss_pred             eEEEEEEecCCCCCCccc--cccCcee---EEecCCCceEEEEEEEeec
Confidence            7776643201 0 11111  1344321   2344566788888888764


No 31 
>KOG4221|consensus
Probab=68.96  E-value=1.3e+02  Score=29.92  Aligned_cols=136  Identities=13%  Similarity=0.145  Sum_probs=73.3

Q ss_pred             EEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCe-ee---
Q psy8618          85 TISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPT-FT---  160 (232)
Q Consensus        85 ~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~-f~---  160 (232)
                      ...|.+.-... +...++.++.++-..++.   +-+....|.+.|.|.....++.++...-.++..|+-+.+|. ..   
T Consensus       551 I~~yk~~ys~~-~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev  626 (1381)
T KOG4221|consen  551 ITGYKLFYSED-DTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEV  626 (1381)
T ss_pred             ceEEEEEEEcC-CCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEe
Confidence            34555543322 456778888776666665   34566789999999998877766655445667777777776 21   


Q ss_pred             CCceEEEEEee--cCCCCCCeEEEEEEccC--CCCCCc-EEEECCceEEEEeecCCCccccEEEEEEEEEE
Q psy8618         161 SSSLLFKVSAS--DPDCGVNAMVNYTLGES--PSRTNH-FYMKSVSGEICISQDLDFESRSSYEFPVVATD  226 (232)
Q Consensus       161 ~~~~~~~v~~~--D~d~~~~~~i~y~l~~~--~~~~~~-f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D  226 (232)
                      .+.-.+.|.=.  -.....|...-|.|...  ...... ..+...+-.-.+..  +.|....|.|.|.|..
T Consensus       627 ~sStsVrVsW~pP~~~t~ng~itgYkIRy~~~~~~~~~~~t~v~~n~~~~l~~--~Lep~T~Y~vrIsa~t  695 (1381)
T KOG4221|consen  627 VSSTSVRVSWLPPPSETQNGQITGYKIRYRKLSREDEVNETVVKGNTTQYLFN--GLEPNTQYRVRISAMT  695 (1381)
T ss_pred             cCCCeEEEEccCCCcccccceEEEEEEEecccCcccccceeecccchhhhHhh--cCCCCceEEEEEEEec
Confidence            12223444321  11222344446666543  011111 22221011111111  2455667999998865


No 32 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=67.45  E-value=1.2e+02  Score=29.16  Aligned_cols=56  Identities=21%  Similarity=0.332  Sum_probs=32.2

Q ss_pred             EEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCC
Q psy8618          34 VAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGR   94 (232)
Q Consensus        34 ~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~   94 (232)
                      ...+|+|.| ||++|.|... .-...|.|+.  |+.-.++.-+- +....-.+.|....|+
T Consensus       515 s~ATVTIlD-DD~aGIfsFe-~~~~sV~Es~--G~vtvtV~Rts-Ga~G~VtV~Y~T~dGT  570 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFE-EDVFHVSESI--GIMEVKVLRTS-GARGTVIVPYRTVEGT  570 (928)
T ss_pred             ceEEEEEec-CcccCccccc-CceEEEEcCC--CEEEEEEEEcC-CCCeeEEEEEEeecCc
Confidence            455667777 7789988766 3458889975  44433332221 1112233668776664


No 33 
>KOG3597|consensus
Probab=64.51  E-value=97  Score=27.01  Aligned_cols=60  Identities=17%  Similarity=0.230  Sum_probs=44.3

Q ss_pred             eEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeC
Q psy8618          32 SRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKG   93 (232)
Q Consensus        32 ~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~   93 (232)
                      .+....|.|..+||.+..+... -+.+.+.|+...-.....+.+.|+|.+. ..+.|+++..
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~-~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t   83 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPS-GSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGT   83 (442)
T ss_pred             EEeeecccccccCCCcceeecc-cceEEeecCCceeccceEeeccCCCCCc-cceEEEEccC
Confidence            4667889999999966656555 4458888888766666778899998754 3578888654


No 34 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=54.58  E-value=3.3e+02  Score=29.92  Aligned_cols=31  Identities=19%  Similarity=0.132  Sum_probs=21.8

Q ss_pred             CccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEe
Q psy8618         118 DHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLD  151 (232)
Q Consensus       118 d~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d  151 (232)
                      -|...+.|+++++|++....   ...++.|.|..
T Consensus      1479 tY~~~GtYtVtLTvtN~~Gs---st~T~~VtV~~ 1509 (2740)
T TIGR00864      1479 AFNSPGDFNIRLAAANEVGK---NEATLNVAVKA 1509 (2740)
T ss_pred             EcCCCceEEEEEEEECCCCc---eEEEEEEEEec
Confidence            36678999999999998642   34455666653


No 35 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=54.06  E-value=43  Score=20.75  Aligned_cols=30  Identities=13%  Similarity=0.106  Sum_probs=21.5

Q ss_pred             CCccCCCEEEEEEEEEeCCCCCcceeEEEEEEE
Q psy8618         117 LDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEV  149 (232)
Q Consensus       117 ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v  149 (232)
                      .-|...+.|.+++.++|...   +..+.+.|.|
T Consensus        49 ~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       49 HTYTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             EEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            34667889999999999764   4455555554


No 36 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=49.17  E-value=60  Score=20.08  Aligned_cols=28  Identities=11%  Similarity=0.106  Sum_probs=17.4

Q ss_pred             cCCCCeEEEEEEEEeCCCCCceeEEEEEEEE
Q psy8618          10 REARSSHELVAEARDQGSPPRSSRVAVKISV   40 (232)
Q Consensus        10 ~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V   40 (232)
                      |+....|.+++.+.|..+   ++.+.++|.|
T Consensus        51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       51 YTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            445567888888887665   4444555543


No 37 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=48.89  E-value=57  Score=19.46  Aligned_cols=29  Identities=21%  Similarity=0.257  Sum_probs=18.7

Q ss_pred             CCeEEEEEEEEeCCCCCceeEEEEEEEEEeC
Q psy8618          13 RSSHELVAEARDQGSPPRSSRVAVKISVTDV   43 (232)
Q Consensus        13 ~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dv   43 (232)
                      ...|.|.+.|+|..|..  +.......+.|.
T Consensus        22 dg~yt~~v~a~D~AGN~--~~~~~~~~i~d~   50 (60)
T PF12245_consen   22 DGEYTLTVTATDKAGNT--SSSTTQIVIVDN   50 (60)
T ss_pred             CccEEEEEEEEECCCCE--EEeeeEEEEEcC
Confidence            56899999999987743  333444444443


No 38 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=48.35  E-value=58  Score=20.25  Aligned_cols=18  Identities=0%  Similarity=-0.073  Sum_probs=13.0

Q ss_pred             cCCCCeEEEEEEEEeCCC
Q psy8618          10 REARSSHELVAEARDQGS   27 (232)
Q Consensus        10 ~E~~~~~~l~v~a~d~~~   27 (232)
                      |.....|.+++.++|..+
T Consensus        53 y~~~G~y~v~l~v~d~~g   70 (81)
T cd00146          53 YTKPGTYTVTLTVTNAVG   70 (81)
T ss_pred             cCCCcEEEEEEEEEeCCC
Confidence            455667888888888754


No 39 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=47.30  E-value=52  Score=20.76  Aligned_cols=24  Identities=17%  Similarity=0.292  Sum_probs=13.7

Q ss_pred             eEEEEEEEEeCCCCCceeEEEEEEEE
Q psy8618          15 SHELVAEARDQGSPPRSSRVAVKISV   40 (232)
Q Consensus        15 ~~~l~v~a~d~~~~~~~~~~~v~v~V   40 (232)
                      .+.++..|+|..+  .++.+.+.|.|
T Consensus        58 ~t~V~ytA~D~~G--N~a~C~f~V~V   81 (81)
T PF02494_consen   58 TTTVTYTATDAAG--NSATCSFTVTV   81 (81)
T ss_pred             eEEEEEEEEECCC--CEEEEEEEEEC
Confidence            5566666666544  44555555543


No 40 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=46.28  E-value=83  Score=20.60  Aligned_cols=52  Identities=19%  Similarity=0.283  Sum_probs=28.3

Q ss_pred             EEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeC
Q psy8618          37 KISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKG   93 (232)
Q Consensus        37 ~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~   93 (232)
                      +|.|.| ||.+ .+... .....+.|+..  ..-..+.-...+....-.+.|....+
T Consensus         2 tvtI~d-~d~~-~v~f~-~~~~~v~E~~~--~~~v~V~~~~~~~~~~v~v~~~~~~g   53 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFS-SPSYTVSEGDG--TVTVTVTRSGGSLDGPVTVNYSTVDG   53 (100)
T ss_dssp             EEEEE--TTSE-EEEES-SSEEEEETTSS--EEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred             EEEEEC-CCCC-EEEEe-CCEEEEEeCCC--EEEEEEEEcccCCCcceEEEEEEeCC
Confidence            466777 6644 77665 34678888874  34444444433322334466666555


No 41 
>PF10365 DUF2436:  Domain of unknown function (DUF2436);  InterPro: IPR018832  Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms.  This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).  
Probab=40.39  E-value=1.2e+02  Score=21.73  Aligned_cols=49  Identities=16%  Similarity=0.351  Sum_probs=27.0

Q ss_pred             EEEEEEccCCCCCCcEEEECCceEEEEee-cCCCccccEEEEEEEEEECCc
Q psy8618         180 MVNYTLGESPSRTNHFYMKSVSGEICISQ-DLDFESRSSYEFPVVATDRGK  229 (232)
Q Consensus       180 ~i~y~l~~~~~~~~~f~i~~~~G~i~~~~-~ld~e~~~~y~l~V~a~D~g~  229 (232)
                      ...|.|... .......|....|.=..+. ..-+|...+|++++.+..+|-
T Consensus       108 ~YDy~I~~P-~~~~kiwIaGd~g~~~tr~dDy~fEAGKtY~ftm~~~g~gD  157 (161)
T PF10365_consen  108 TYDYCIAAP-QPGGKIWIAGDGGDGPTRGDDYVFEAGKTYRFTMKRVGSGD  157 (161)
T ss_pred             eeEEEEecC-CCCCeEEEecCCCCCCccccceEEecCCEEEEEEEeccCCC
Confidence            335666654 3344555543333222211 133477789999999887763


No 42 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=38.46  E-value=1.2e+02  Score=20.62  Aligned_cols=33  Identities=15%  Similarity=0.122  Sum_probs=22.3

Q ss_pred             cCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEE
Q psy8618         116 VLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVL  150 (232)
Q Consensus       116 ~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~  150 (232)
                      .+++.+.+.|...|.++|...-..++.  +.|.|-
T Consensus        73 t~~v~kgG~y~m~V~lCn~dGCS~S~~--~~I~VA  105 (106)
T cd02848          73 TFKVGKGGRYQMQVALCNGDGCSTSAA--KEIVVA  105 (106)
T ss_pred             EEEeCCCCeEEEEEEEECCCCccCcCC--EEEEec
Confidence            355667788999999999876444443  444443


No 43 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=37.10  E-value=3.4e+02  Score=25.02  Aligned_cols=120  Identities=19%  Similarity=0.181  Sum_probs=0.0

Q ss_pred             eEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEE--EEeCCCCCCcEEEEEEee
Q psy8618          15 SHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRAR--AVDKDDGSNATISYSILK   92 (232)
Q Consensus        15 ~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~--a~D~D~~~~~~v~y~i~~   92 (232)
                      ...+++.++|..+ ..+...+.+|+|++-  ++|.+... .+...-.++...-+.-+.+.  ..+.-......+.|+...
T Consensus       299 ~~Ti~atVtDSRG-r~S~~~~~tItVl~Y--~~P~lsfs-v~R~~~~~~~~~v~~~a~Iapl~v~g~qKN~~~lt~~~a~  374 (624)
T PF05895_consen  299 SATIRATVTDSRG-RTSDPKTKTITVLEY--SPPTLSFS-VYRCGSSGNTLTVTRNAKIAPLTVNGVQKNTMTLTFKVAP  374 (624)
T ss_pred             eEEEEEEEEECCC-ccCCceEEEEEEEEc--CCCcEEEE-EEEeCCCCcEEEEEEEEEEeEEEEcccccceEEEEEEEEE


Q ss_pred             CCCCCCcccEEEeCCceE-------------EEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEE
Q psy8618          93 GRDTDGYGVFTIDPISGV-------------IRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEV  149 (232)
Q Consensus        93 ~~~~~~~~~f~id~~tG~-------------i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v  149 (232)
                          -....|.+|.....             ..+...+|  ....|.+.+.++|.     -....+...|
T Consensus       375 ----~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~--~~kSy~V~~~l~D~-----F~s~t~~~~V  433 (624)
T PF05895_consen  375 ----LGTGTFTTDNGSASGTWSSISELTNSSANLGGTYD--AEKSYDVRGTLSDK-----FTSTTFTVTV  433 (624)
T ss_pred             ----cCcceEEEEccccccceeeeeeecccceeeccccC--CCceEEEEEEEEEE-----eeeEEEEEEc


No 44 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=34.53  E-value=1.2e+02  Score=19.14  Aligned_cols=14  Identities=50%  Similarity=0.809  Sum_probs=11.6

Q ss_pred             ccEEEEEEEEEECC
Q psy8618         215 RSSYEFPVVATDRG  228 (232)
Q Consensus       215 ~~~y~l~V~a~D~g  228 (232)
                      .+.|.+.|.|+|.|
T Consensus        68 ~G~Y~~~v~a~~~g   81 (81)
T PF13860_consen   68 DGTYTFRVTATDGG   81 (81)
T ss_dssp             SEEEEEEEEEEET-
T ss_pred             CCCEEEEEEEEeCC
Confidence            46899999999987


No 45 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=32.15  E-value=1.4e+02  Score=21.03  Aligned_cols=31  Identities=23%  Similarity=0.364  Sum_probs=15.4

Q ss_pred             EEEEEEEe-CCCCCceeEEEEEEEEEeCCCCCC
Q psy8618          17 ELVAEARD-QGSPPRSSRVAVKISVTDVNDNAP   48 (232)
Q Consensus        17 ~l~v~a~d-~~~~~~~~~~~v~v~V~dvNd~~P   48 (232)
                      .|.|.|+= .++.+++....+.|+|..-|+ +|
T Consensus       101 nl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp  132 (133)
T PF09100_consen  101 NLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP  132 (133)
T ss_dssp             EEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred             cEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence            45566653 234578899999999998888 66


No 46 
>PF00801 PKD:  PKD domain;  InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions.  PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=29.22  E-value=1.3e+02  Score=17.99  Aligned_cols=18  Identities=17%  Similarity=0.327  Sum_probs=14.5

Q ss_pred             CccccEEEEEEEEEECCc
Q psy8618         212 FESRSSYEFPVVATDRGK  229 (232)
Q Consensus       212 ~e~~~~y~l~V~a~D~g~  229 (232)
                      |...+.|.+.|.|+|.-.
T Consensus        48 y~~~G~y~V~ltv~n~~g   65 (69)
T PF00801_consen   48 YSSPGTYTVTLTVTNGVG   65 (69)
T ss_dssp             ESSSEEEEEEEEEEETTS
T ss_pred             cCCCeEEEEEEEEEECCC
Confidence            456789999999999753


No 47 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=26.80  E-value=1.4e+02  Score=17.29  Aligned_cols=17  Identities=35%  Similarity=0.376  Sum_probs=13.7

Q ss_pred             CCCEEEEEEEEEeCCCC
Q psy8618         121 DRSIYRIAVAATDNGFP  137 (232)
Q Consensus       121 ~~~~~~l~v~a~d~~~~  137 (232)
                      ..+.|.+.+.|+|....
T Consensus        22 ~dG~y~itv~a~D~AGN   38 (54)
T PF13754_consen   22 ADGTYTITVTATDAAGN   38 (54)
T ss_pred             CCccEEEEEEEEeCCCC
Confidence            45789999999997553


No 48 
>COG3354 FlaG Putative archaeal flagellar protein G [Cell motility and secretion]
Probab=26.16  E-value=2.6e+02  Score=20.29  Aligned_cols=54  Identities=13%  Similarity=0.109  Sum_probs=32.2

Q ss_pred             EEeCCCCCcce---eEEEEEEEEecCCCCCeeeCCceEEEEEeecCCCCCCeEEEEEEcc
Q psy8618         131 ATDNGFPSRQT---VRLLRVEVLDLNDNRPTFTSSSLLFKVSASDPDCGVNAMVNYTLGE  187 (232)
Q Consensus       131 a~d~~~~~~~~---~~~v~i~v~d~Nd~~P~f~~~~~~~~v~~~D~d~~~~~~i~y~l~~  187 (232)
                      ..|.+.++...   ..++++.|.+.-+-++.|......+-|..   .......+.|.-..
T Consensus        55 IndPg~i~~~~~~g~~t~t~yiKNtG~~~~~fd~~sitVliDG---~iv~~a~~~~~~~~  111 (154)
T COG3354          55 INDPGQIPYVGTDGPYTYTFYIKNTGSDSIAFDNTSITVLIDG---NIVTPAYVTFTSVN  111 (154)
T ss_pred             ecCCCCCccccCCCceEEEEEEecCCCcccccCCCeEEEEEcC---cEeccceEEEEecC
Confidence            34555554433   34677888888777888887775555532   33445555665543


No 49 
>PF14302 DUF4377:  Domain of unknown function (DUF4377)
Probab=26.12  E-value=1.2e+02  Score=19.39  Aligned_cols=25  Identities=16%  Similarity=0.288  Sum_probs=19.7

Q ss_pred             ecCCCccccEEEEEEEEEECCccCC
Q psy8618         208 QDLDFESRSSYEFPVVATDRGKETQ  232 (232)
Q Consensus       208 ~~ld~e~~~~y~l~V~a~D~g~~~q  232 (232)
                      ..++||..-.|.|.|..+.--+|||
T Consensus        40 eGF~yE~Gy~Y~L~Vk~~~~~nppa   64 (80)
T PF14302_consen   40 EGFEYEPGYEYVLRVKRTPVANPPA   64 (80)
T ss_pred             cCcCcCCCcEEEEEEEEEECCCCCC
Confidence            4568888888999988887777765


No 50 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=25.00  E-value=1.6e+02  Score=20.96  Aligned_cols=35  Identities=14%  Similarity=0.190  Sum_probs=21.2

Q ss_pred             CcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCC
Q psy8618           9 DREARSSHELVAEARDQGSPPRSSRVAVKISVTDVND   45 (232)
Q Consensus         9 D~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd   45 (232)
                      ....--.|.++|.+.+..+  -+....+.|.|.|.+=
T Consensus        78 ~~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDG  112 (133)
T PF08329_consen   78 TVTKGGRYQMQVELCNADG--CSTSAPVEVVVADTDG  112 (133)
T ss_dssp             EE-S-EEEEEEEEEEETTE--EEE---EEEEEE-TTS
T ss_pred             EecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCCc
Confidence            3445568999999998776  3445577778887754


No 51 
>PF03413 PepSY:  Peptidase propeptide and YPEB domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR005075  This signature, PepSY, is found in the propeptide of members of the MEROPS peptidase family M4 (clan MA(E)), which contains the thermostable thermolysins (3.4.24.27 from EC), and related thermolabile neutral proteases (bacillolysins) (3.4.24.28 from EC) from various species of Bacillus. It is also in many non-peptidase proteins, including Bacillus subtilis YpeB protein - a regulator of SleB spore cortex lytic enzyme - and a large number of eubacterial and archaeal cell wall-associated and secreted proteins which are mostly annotated as 'hypothetical protein'. Many extracellular bacterial proteases are produced as proenzymes. The propeptides usually have a dual function, i.e. they function as an intramolecular chaperone required for the folding of the polypeptide and as an inhibitor preventing premature activation of the enzyme. Analysis of the propeptide region of the M4 family of peptidases reveals two regions of conservation, the PepSY domain and a second domain, proximate to the N terminus, the FTP domain (IPR011096 from INTERPRO), which is also found in isolation in the propeptide of eukaryotic peptidases belong to MEROPS peptidase family M36.  Propeptide domain swapping experiments, for example swapping the propeptide domain of PA protease with that of vibrolysin, both propeptides contain the FTP and PepSY domains, allows the PA protease domain to fold correctly and inhibits the C-terminal autoprocessing activity. However, swapping the propeptide of PA protease for the thermolysin propeptide, does not facilitate the correct folding nor the processing of the chimaeric protein into an active peptidase []. Mutational analysis of the Pseudomonas aeruginosa elastase gene revealed two mutations in the propeptide which resulted in the loss of inhibitory activity but not chaperone activity: A-15V and T-153I (where +1 is defined as the first residue of the mature peptidase). Both mutations resulted in peptidase activity, the T-153V mutation being much less effective than the A-15I mutation [] in activating peptidase activity. The T-153V mutation lies N-terminal to the FTP domain while the A-15I mutation is C-terminal to the PepSY domain.  Given the diverse range of other proteins, both domains occur in in isolation, the exact function of each is still unclear; though it has been proposed that the PepSY domain primarily has inhibitory activity and in conjunction with the FTP domain in chaperone activity. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 2GU3_A 3NQZ_A 3NQY_A 2KGY_A.
Probab=24.64  E-value=1.4e+02  Score=17.19  Aligned_cols=14  Identities=21%  Similarity=0.221  Sum_probs=8.4

Q ss_pred             ccEEEeCCceEEEE
Q psy8618         100 GVFTIDPISGVIRT  113 (232)
Q Consensus       100 ~~f~id~~tG~i~~  113 (232)
                      ..+.||+.||.|.-
T Consensus        49 ~~v~VDa~tG~Il~   62 (64)
T PF03413_consen   49 YEVYVDAYTGEILS   62 (64)
T ss_dssp             EEEEEETTT--EEE
T ss_pred             EEEEEECCCCeEEE
Confidence            44559999998753


No 52 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=23.77  E-value=4.1e+02  Score=21.74  Aligned_cols=107  Identities=19%  Similarity=0.254  Sum_probs=51.9

Q ss_pred             CCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCC------CCcEEEEEEEEeCCCCCCcEE
Q psy8618          13 RSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQP------PGTEVVRARAVDKDDGSNATI   86 (232)
Q Consensus        13 ~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~------~g~~v~~v~a~D~D~~~~~~v   86 (232)
                      -..|.+.+.++|..+....  ....+.|-   -.+|.....     .+.++..      .........+.+.+  .+..+
T Consensus        11 d~~~~v~vt~tD~aGN~~~--~t~~~~vD---t~~P~v~i~-----~~~~~~~~~~~~~~~~~t~s~tvs~~~--~g~~v   78 (317)
T PF13753_consen   11 DGTYTVSVTVTDAAGNTST--ATQSITVD---TTAPTVTIT-----SIADDDIINGDEATNTVTFSGTVSGAE--PGSTV   78 (317)
T ss_pred             CCcEEEEEEEEeCCCCeee--eeEEEEEe---cCCCceeee-----cccCCCccccceeeeeeEEEEEecCCC--CCCEE
Confidence            4678999999998775443  33333322   226633221     1111111      12223444444443  34456


Q ss_pred             EEEEeeCCCCCCcccEEEeCCceEEEECccC-CccCCCEEEEEEE-EEeCCCC
Q psy8618          87 SYSILKGRDTDGYGVFTIDPISGVIRTKSVL-DHEDRSIYRIAVA-ATDNGFP  137 (232)
Q Consensus        87 ~y~i~~~~~~~~~~~f~id~~tG~i~~~~~l-d~e~~~~~~l~v~-a~d~~~~  137 (232)
                      ...+ .+    ....+..+ ..|.....-.. +.-..+.|.+.+. ++|....
T Consensus        79 ~v~~-~g----~~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN  125 (317)
T PF13753_consen   79 TVTI-NG----TTGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGN  125 (317)
T ss_pred             EEEE-CC----EEEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCc
Confidence            6666 22    12223333 35543332221 1223457889998 9997553


Done!