Query         psy6838
Match_columns 442
No_of_seqs    311 out of 1620
Neff          9.6 
Searched_HMMs 46136
Date          Fri Aug 16 19:42:46 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6838.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6838hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 1.3E-74 2.9E-79  582.7  40.7  433    1-439   259-933 (2531)
  2 KOG4289|consensus              100.0 5.7E-67 1.2E-71  527.6  42.5  425    6-439   159-827 (2531)
  3 KOG1219|consensus              100.0 3.4E-64 7.4E-69  521.8  47.1  437    1-442   407-1106(4289)
  4 KOG1219|consensus              100.0 5.5E-62 1.2E-66  505.5  45.8  429    1-442  2168-2829(4289)
  5 cd00031 CA Cadherin repeat dom 100.0 9.1E-31   2E-35  233.7  27.9  197  162-390     2-199 (199)
  6 cd00031 CA Cadherin repeat dom 100.0 1.5E-30 3.2E-35  232.3  28.8  196   16-253     2-199 (199)
  7 KOG1834|consensus               99.8 1.5E-17 3.2E-22  160.3  18.9  210  145-391    20-245 (952)
  8 PF00028 Cadherin:  Cadherin do  99.8 1.4E-17   3E-22  129.1  14.3   92  298-389     1-93  (93)
  9 PF00028 Cadherin:  Cadherin do  99.7 1.2E-16 2.6E-21  123.8  12.5   91  162-252     1-93  (93)
 10 smart00112 CA Cadherin repeats  99.7 4.4E-16 9.6E-21  116.7  11.0   79  318-396     1-79  (79)
 11 KOG1834|consensus               99.6 2.1E-14 4.5E-19  138.9  20.3  152  100-254    86-245 (952)
 12 smart00112 CA Cadherin repeats  99.6 2.2E-15 4.8E-20  112.9  10.6   78  182-259     1-79  (79)
 13 PF08758 Cadherin_pro:  Cadheri  97.5  0.0012 2.5E-08   49.9   9.7   78  290-375     3-80  (90)
 14 PF08266 Cadherin_2:  Cadherin-  97.4 8.7E-05 1.9E-09   55.3   2.5   63   15-78      2-66  (84)
 15 PF08266 Cadherin_2:  Cadherin-  97.4 9.9E-05 2.2E-09   55.0   2.5   62  298-360     3-66  (84)
 16 PF08758 Cadherin_pro:  Cadheri  97.1  0.0084 1.8E-07   45.3  10.1   79    8-133     3-81  (90)
 17 smart00736 CADG Dystroglycan-t  97.0   0.014 3.1E-07   45.0  11.0   69  318-393    25-96  (97)
 18 TIGR01965 VCBS_repeat VCBS rep  97.0  0.0084 1.8E-07   45.9   9.1   87  313-410     2-97  (99)
 19 TIGR01965 VCBS_repeat VCBS rep  96.8   0.015 3.1E-07   44.6   9.2   50  111-170    49-98  (99)
 20 smart00736 CADG Dystroglycan-t  96.7   0.035 7.7E-07   42.8  11.1   31  118-153    66-96  (97)
 21 PF13750 Big_3_3:  Bacterial Ig  95.9    0.86 1.9E-05   38.5  16.3  126  118-252    14-148 (158)
 22 PF13750 Big_3_3:  Bacterial Ig  95.6     1.2 2.5E-05   37.8  17.3   76  314-392    70-151 (158)
 23 TIGR00845 caca sodium/calcium   93.0      16 0.00035   39.7  21.0   48  151-200   395-443 (928)
 24 PF05345 He_PIG:  Putative Ig d  92.4    0.52 1.1E-05   31.1   5.5   37  336-374    11-48  (49)
 25 TIGR00845 caca sodium/calcium   90.7     8.1 0.00018   41.9  15.0   53  382-437   515-569 (928)
 26 TIGR03660 T1SS_rpt_143 T1SS-14  88.6      12 0.00027   30.6  11.8   57  106-170    69-128 (137)
 27 TIGR00864 PCC polycystin catio  88.1      75  0.0016   39.0  30.0  112   25-151  1482-1594(2740)
 28 PF07495 Y_Y_Y:  Y_Y_Y domain;   87.9     6.1 0.00013   27.5   8.4   60  188-251     6-65  (66)
 29 PF07495 Y_Y_Y:  Y_Y_Y domain;   87.1       8 0.00017   26.9   8.6   60  325-389     7-66  (66)
 30 TIGR03660 T1SS_rpt_143 T1SS-14  79.5      23  0.0005   29.1   9.3   56  349-410    69-127 (137)
 31 PF02010 REJ:  REJ domain;  Int  79.5     3.2 6.9E-05   41.6   5.3   69  359-436   270-340 (440)
 32 PF05345 He_PIG:  Putative Ig d  78.8      11 0.00025   24.6   6.0   35   96-132    14-49  (49)
 33 KOG4221|consensus               77.0 1.3E+02  0.0029   33.8  18.1   59  336-397   562-620 (1381)
 34 PF12245 Big_3_2:  Bacterial Ig  58.5      45 0.00098   22.8   5.8   30  361-392    22-51  (60)
 35 PF13753 SWM_repeat:  Putative   55.8 1.9E+02   0.004   27.5  18.1  111  117-239    10-125 (317)
 36 cd02848 Chitinase_N_term Chiti  55.0      25 0.00053   27.4   4.3   34  355-390    73-106 (106)
 37 cd00146 PKD polycystic kidney   54.7      30 0.00065   25.0   4.8   31  356-388    51-81  (81)
 38 TIGR00864 PCC polycystin catio  54.7 5.3E+02   0.012   32.4  35.3  194   25-254  1398-1594(2740)
 39 PF12245 Big_3_2:  Bacterial Ig  52.8      59  0.0013   22.2   5.6   30  118-152    22-51  (60)
 40 smart00089 PKD Repeats in poly  51.7      36 0.00079   24.4   4.8   31  355-388    48-78  (79)
 41 PF13754 Big_3_4:  Bacterial Ig  49.0      63  0.0014   21.5   5.2   16  360-375    22-37  (54)
 42 COG2706 3-carboxymuconate cycl  47.6 1.5E+02  0.0032   28.4   8.9   34   43-76    254-288 (346)
 43 PF02494 HYR:  HYR domain;  Int  46.8      42 0.00091   24.4   4.4   25  362-388    57-81  (81)
 44 PF08329 ChitinaseA_N:  Chitina  45.6      72  0.0016   26.0   5.8   51  355-411    76-126 (133)
 45 KOG4221|consensus               41.5 5.9E+02   0.013   29.1  22.5  137  104-260   481-620 (1381)
 46 cd00146 PKD polycystic kidney   40.7      67  0.0015   23.1   4.7   29  114-147    52-80  (81)
 47 smart00089 PKD Repeats in poly  38.0      82  0.0018   22.5   4.8   30  219-251    49-78  (79)
 48 PF03160 Calx-beta:  Calx-beta   36.6 1.8E+02   0.004   21.8  10.7   24  248-273     2-25  (100)
 49 PF02494 HYR:  HYR domain;  Int  32.3      95  0.0021   22.5   4.4   24  120-148    58-81  (81)
 50 cd02848 Chitinase_N_term Chiti  31.7 1.1E+02  0.0023   23.9   4.5   34  112-150    73-106 (106)
 51 PF13753 SWM_repeat:  Putative   27.1 5.4E+02   0.012   24.3  12.7   26  223-250    10-35  (317)
 52 PF03174 CHB_HEX_C:  Chitobiase  24.3 1.6E+02  0.0034   21.0   4.2   58   66-133    10-67  (75)
 53 PF08329 ChitinaseA_N:  Chitina  23.9 2.8E+02  0.0061   22.6   5.9   36  112-152    76-111 (133)
 54 PF13290 CHB_HEX_C_1:  Chitobia  22.6 2.8E+02  0.0061   19.4   5.2   44   80-133    18-61  (67)
 55 TIGR03769 P_ac_wall_RPT actino  20.1   2E+02  0.0044   18.0   3.5   14  358-371     9-22  (41)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=1.3e-74  Score=582.66  Aligned_cols=433  Identities=37%  Similarity=0.594  Sum_probs=416.9

Q ss_pred             CCcCCCCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccccc
Q psy6838           1 VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVS   80 (442)
Q Consensus         1 V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~   80 (442)
                      |.|+|||.|+|.+.+|..++.||.++|..|.++.|+|.|.++|+.+.|++..+++...|.||+++|.|+..+.||+|...
T Consensus       259 V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~  338 (2531)
T KOG4289|consen  259 VLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREELE  338 (2531)
T ss_pred             EeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCccCHHhhh
Confidence            67999999999999999999999999999999999999999999999999999999999999999999999999998875


Q ss_pred             ------------------------------------------------------------------CCCcEEEEEEeCCC
Q psy6838          81 ------------------------------------------------------------------DNGRLEYSIVSGDD   94 (442)
Q Consensus        81 ------------------------------------------------------------------~~~~i~y~i~~~~~   94 (442)
                                                                                        .|+.+.|+|.+|..
T Consensus       339 ~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~  418 (2531)
T KOG4289|consen  339 SYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNG  418 (2531)
T ss_pred             heEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCc
Confidence                                                                              78899999999999


Q ss_pred             CCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCC
Q psy6838          95 EEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLN  173 (442)
Q Consensus        95 ~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g  173 (442)
                      -+.|.||..+|+|.+..+||+|.. .|+++|+|.|++.|+   ++.+.-+.|+|.|+|||+|.|... ...+|.|+.+.|
T Consensus       419 ~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPp---Lsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg  494 (2531)
T KOG4289|consen  419 RGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPP---LSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLG  494 (2531)
T ss_pred             cccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCC---ccCCCceEEEEEecCCCCceeEechhhhhhhhccccc
Confidence            999999999999999999999998 999999999999987   888888889999999999999876 889999999999


Q ss_pred             cEEEEEEEEeCCCCCCceEEEEEecCCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEe
Q psy6838         174 TVVAALKAIDRDEGQNSYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLD  253 (442)
Q Consensus       174 ~~v~~v~a~D~D~~~~~~~~y~i~~~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d  253 (442)
                      ..++.+.|.|.|.|.|+.+.|++.+-.  +|.|+..+|+|++.+.||||+.+.|.|.|.|+|.|.|++++.+.|.|.+.|
T Consensus       495 ~~v~~vqaidadsg~na~l~y~laG~~--pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~d  572 (2531)
T KOG4289|consen  495 YLVCHVQAIDADSGENARLHYSLAGVG--PFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLD  572 (2531)
T ss_pred             ceEEEEecccCCCCcccceeeeeccCC--CeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecc
Confidence            999999999999999999999998864  899999999999999999999999999999999999999999999999999


Q ss_pred             CCCCCCeeeccceeeeccccccccceEEE---------------------------------------------------
Q psy6838         254 ENDNNPIFDSKSYSASVPENISVGATVLQ---------------------------------------------------  282 (442)
Q Consensus       254 ~nd~~P~f~~~~~~~~v~e~~~~g~~v~~---------------------------------------------------  282 (442)
                      +|||.|.|++..|++.+.|+.+.|+.|++                                                   
T Consensus       573 vndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~v  652 (2531)
T KOG4289|consen  573 VNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQERQYV  652 (2531)
T ss_pred             cCCCCCccccCceEEEecCCccccceEEEEEEeccccccceEEEecCCcccccceeeccCCcceEEeecchhhcccceEE
Confidence            99999999999999999999999998876                                                   


Q ss_pred             --------------------------------------------------------------------------------
Q psy6838         283 --------------------------------------------------------------------------------  282 (442)
Q Consensus       283 --------------------------------------------------------------------------------  282 (442)
                                                                                                      
T Consensus       653 l~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y~led~~Frid~ds  732 (2531)
T KOG4289|consen  653 LAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITYILEDEAFRIDPDS  732 (2531)
T ss_pred             EEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEEEecccceeecCCC
Confidence                                                                                            


Q ss_pred             ----------------------------------------EeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeCCC
Q psy6838         283 ----------------------------------------VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADE  322 (442)
Q Consensus       283 ----------------------------------------v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~  322 (442)
                                                              |.++||++|.|..+.|+++|.|+.|+++.+++|+|+|.|.
T Consensus       733 g~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~  812 (2531)
T KOG4289|consen  733 GAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADS  812 (2531)
T ss_pred             CceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCC
Confidence                                                    6789999999999999999999999999999999999999


Q ss_pred             CCCeEEEEEEEeC-CCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCC
Q psy6838         323 GVNGVVTYSVVSD-WANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPM  401 (442)
Q Consensus       323 ~~~~~i~y~l~~~-~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~  401 (442)
                      +.|+.+.|.+.++ +..+.|.|++++|.|++.+.||||....|.|.+.|.|.|.|++++.+.|+|+|.|+|||||.|.+.
T Consensus       813 g~Ng~v~y~~qg~~d~p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~  892 (2531)
T KOG4289|consen  813 GPNGRVYYTFQGGDDGPGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQD  892 (2531)
T ss_pred             CCCceEEEEecCCCCCCCceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCc
Confidence            9999999999755 345799999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCeeEEecCCCCCceEEEEEEEeCCCC---eeEEEEeeCCC
Q psy6838         402 PPNIPVYENVTLGTGLASVHAVDLDSG---KLPFLLPIGDF  439 (442)
Q Consensus       402 ~~~~~v~E~~~~g~~v~~v~a~D~D~~---~~~y~i~~~~~  439 (442)
                      +++..|.||.++|..+++|+|.|||+|   .|+|+|.+|+.
T Consensus       893 e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~d  933 (2531)
T KOG4289|consen  893 ELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGND  933 (2531)
T ss_pred             ceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCcc
Confidence            999999999999999999999999999   89999999975


No 2  
>KOG4289|consensus
Probab=100.00  E-value=5.7e-67  Score=527.65  Aligned_cols=425  Identities=36%  Similarity=0.577  Sum_probs=393.4

Q ss_pred             CCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCC---CCCcEEEcCCccEEEEeeeccccccc--
Q psy6838           6 DNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDW---ANDVFSINPQSGIFTLTAKLDYEDVS--   80 (442)
Q Consensus         6 Dn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~---~~~~f~id~~tg~~~~~~~ld~~~~~--   80 (442)
                      -|+|.|.+..|+..++||.|+||.|+++.|.|+|.   +.+.|++..--   ..++|+||+.+|++++...||+|.+.  
T Consensus       159 ~~~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~a---~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~H  235 (2531)
T KOG4289|consen  159 ANAVQFQQPNYQKELPENEPAGTIIASVKASDPDA---GRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETH  235 (2531)
T ss_pred             CCCccCCCcchhccCcCCCCCCceeEEEEecCCCc---CceEEEeeeccchhccccEeeccccccchhhhhhhhhhhhee
Confidence            46899999999999999999999999999999994   47999997542   35899999999999999999999876  


Q ss_pred             ----------------------------------------------------------------CCCcEEEEEEeCCCCC
Q psy6838          81 ----------------------------------------------------------------DNGRLEYSIVSGDDEE   96 (442)
Q Consensus        81 ----------------------------------------------------------------~~~~i~y~i~~~~~~~   96 (442)
                                                                                      +|..|+|++.+|.+..
T Consensus       236 vlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~  315 (2531)
T KOG4289|consen  236 VLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKN  315 (2531)
T ss_pred             EEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccc
Confidence                                                                            7899999999999999


Q ss_pred             cEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCCcE
Q psy6838          97 IFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTV  175 (442)
Q Consensus        97 ~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g~~  175 (442)
                      .|.|++.+|.|.+..+||||+...|.|.|.|+|.|.++..   .++.|.|+|.|+|||+|+|... |.+.|.|+..++++
T Consensus       316 ~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp---~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~av  392 (2531)
T KOG4289|consen  316 VFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGP---RTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAV  392 (2531)
T ss_pred             eeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCC---ceEEEEEEEEecCCCCccccccceEEEecccCCCCce
Confidence            9999999999999999999999999999999999998754   3899999999999999999987 99999999999999


Q ss_pred             EEEEEEEeCCCCCCceEEEEEecCC-CCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeC
Q psy6838         176 VAALKAIDRDEGQNSYVEYSIINSE-ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDE  254 (442)
Q Consensus       176 v~~v~a~D~D~~~~~~~~y~i~~~~-~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~  254 (442)
                      |.+|+|+|.|.|.|+.+.|+|.+++ .+.|.||..||+|.+..+||+|.. .|++.|+|.|+|.|+++.+.-+.|.|.|+
T Consensus       393 vlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDI  471 (2531)
T KOG4289|consen  393 VLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDI  471 (2531)
T ss_pred             EEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEec
Confidence            9999999999999999999999998 678999999999999999999998 99999999999999999999999999999


Q ss_pred             CCCCCeeeccceeeeccccccccceEEE----------------------------------------------------
Q psy6838         255 NDNNPIFDSKSYSASVPENISVGATVLQ----------------------------------------------------  282 (442)
Q Consensus       255 nd~~P~f~~~~~~~~v~e~~~~g~~v~~----------------------------------------------------  282 (442)
                      |||+|+|...++++++.|+.+.|..++.                                                    
T Consensus       472 NDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~  551 (2531)
T KOG4289|consen  472 NDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVE  551 (2531)
T ss_pred             CCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEE
Confidence            9999999999999999999999988765                                                    


Q ss_pred             ------------------EeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCC------
Q psy6838         283 ------------------VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWAN------  338 (442)
Q Consensus       283 ------------------v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~------  338 (442)
                                        +.++||+.|.|.+..|.+++.|++|.|+.|.+++|.|.|.  +..++|.|.+++..      
T Consensus       552 ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~--~s~ityqi~g~ntrn~Fsi~  629 (2531)
T KOG4289|consen  552 ARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDA--NSVITYQITGGNTRNRFSIS  629 (2531)
T ss_pred             EcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEecccc--ccceEEEecCCcccccceee
Confidence                              5678999999999999999999999999999999999995  56688877653310      


Q ss_pred             --------------------------------------------------------------------------------
Q psy6838         339 --------------------------------------------------------------------------------  338 (442)
Q Consensus       339 --------------------------------------------------------------------------------  338 (442)
                                                                                                      
T Consensus       630 si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~  709 (2531)
T KOG4289|consen  630 SIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDE  709 (2531)
T ss_pred             ccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCCceeEEEecccC
Confidence                                                                                            


Q ss_pred             --------------CCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCe
Q psy6838         339 --------------DVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPN  404 (442)
Q Consensus       339 --------------~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~  404 (442)
                                    ..|+||+++|.+++...||||.+-.|++.|+|.|.+.|.+.++.+|.|.|.|+|||+|+|..+.|.
T Consensus       710 D~geNARI~y~led~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt  789 (2531)
T KOG4289|consen  710 DTGENARITYILEDEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYT  789 (2531)
T ss_pred             CCCccceEEEEecccceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhce
Confidence                          127999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             eEEecCCCCCceEEEEEEEeCCCC---eeEEEEeeCCC
Q psy6838         405 IPVYENVTLGTGLASVHAVDLDSG---KLPFLLPIGDF  439 (442)
Q Consensus       405 ~~v~E~~~~g~~v~~v~a~D~D~~---~~~y~i~~~~~  439 (442)
                      ++|.|++++++.+++|+|+|+|.+   .+.|.+.+|+.
T Consensus       790 ~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d  827 (2531)
T KOG4289|consen  790 GSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDD  827 (2531)
T ss_pred             eEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCC
Confidence            999999999999999999999999   56666666543


No 3  
>KOG1219|consensus
Probab=100.00  E-value=3.4e-64  Score=521.81  Aligned_cols=437  Identities=32%  Similarity=0.507  Sum_probs=410.7

Q ss_pred             CCcCCCCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccccc
Q psy6838           1 VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVS   80 (442)
Q Consensus         1 V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~   80 (442)
                      |.|+|||+|.|....|.++++|+.|+|+.+....|+|+|.|.||.++|+|.. .....|+|++.+|.+++.++||||...
T Consensus       407 vld~n~n~pif~r~~~~ve~penvpig~~vl~~satDpdegengyvtysia~-~~~lPFaI~~~~GilsvS~kldrel~r  485 (4289)
T KOG1219|consen  407 VLDVNDNSPIFPRDVYRVEIPENVPIGTRVLISSATDPDEGENGYVTYSIAD-DTMLPFAIDQSDGILSVSGKLDRELRR  485 (4289)
T ss_pred             EeccCCCCCcceeeeeeeecCCCCCcceEEEEEeccCCCcCcCceEEEEecC-CccCceEeccccceEEeccccCccccc
Confidence            5799999999999999999999999999999999999999999999999986 345789999999999999999999532


Q ss_pred             --------------------------------------------------------------------------------
Q psy6838          81 --------------------------------------------------------------------------------   80 (442)
Q Consensus        81 --------------------------------------------------------------------------------   80 (442)
                                                                                                      
T Consensus       486 vYtfRv~Asd~G~per~~e~~~~I~ildlNDn~P~F~~~n~t~t~~~~~~vg~~l~tvsAtD~De~ellky~i~~~nel~  565 (4289)
T KOG1219|consen  486 VYTFRVRASDWGVPERESEVHLNILILDLNDNPPNFEIRNCTGTINGDPKVGTKLFTVSATDLDELELLKYRILPGNELS  565 (4289)
T ss_pred             eEEEEEEEeccCCcchhceeeEEEEEeccCCCCCcceeeecccccccCCCCCcEEEEeeccccCcccceeEEEEeCCcCc
Confidence                                                                                            


Q ss_pred             --------------------------------------------------------------------------------
Q psy6838          81 --------------------------------------------------------------------------------   80 (442)
Q Consensus        81 --------------------------------------------------------------------------------   80 (442)
                                                                                                      
T Consensus       566 feln~nSgeisLvr~n~t~~~~s~~slv~a~d~G~p~as~t~lni~~~k~~Tgv~~~~~p~Ilq~~e~~~fPqf~s~fP~  645 (4289)
T KOG1219|consen  566 FELNSNSGEISLVRQNNTECLQSCESLVIAADDGVPPASPTLLNITVMKYGTGVGNEHEPNILQRFENKHFPQFPSDFPF  645 (4289)
T ss_pred             eeeccCCCeEEEEEccccccccccceEEEehhcCCCcCCceeeEEEEEecccccccccChhHhhhhccccCccccccCCc
Confidence                                                                                            


Q ss_pred             -------------------------CCCcEEEEEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCC
Q psy6838          81 -------------------------DNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQ  135 (442)
Q Consensus        81 -------------------------~~~~i~y~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~  135 (442)
                                               -|+.++|-|..|..+..|.|+..+|.|.+.++||+|...+|.|.|+|.|.|.|. 
T Consensus       646 iI~v~Edvpigt~la~L~atD~Dtgfng~l~yvI~dgne~~~~~Id~qsg~itvas~ld~~~t~~yiLnvta~D~gtPq-  724 (4289)
T KOG1219|consen  646 IIVVPEDVPIGTTLAILSATDSDTGFNGKLVYVIEDGNESICFLIDRQSGNITVASPLDNENTEQYILNVTAYDLGTPQ-  724 (4289)
T ss_pred             eEEccccCCCCceEEEEeccCCCCCcCceEEEEEeCCccceEEEEecccceEEEecchhhhhhheeEEEEEEecCCCch-
Confidence                                     578899999999988999999999999999999999999999999999999875 


Q ss_pred             CCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCCeEecCcceEEE
Q psy6838         136 KRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEATPFSLGPVDGLLR  214 (442)
Q Consensus       136 ~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~~~~F~id~~tG~i~  214 (442)
                        .++.....|.|.|.|||+|+|... |.++|.|+..+|+.|+++.|.|.|.|+|+.++|+|.... ..|+||+.+|.+.
T Consensus       725 --kss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~s-dvfsIdp~tGivv  801 (4289)
T KOG1219|consen  725 --KSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKS-DVFSIDPFTGIVV  801 (4289)
T ss_pred             --hhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCc-ceEEecCcccEEE
Confidence              467788889999999999999876 999999999999999999999999999999999999954 6799999999999


Q ss_pred             EcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEE------------
Q psy6838         215 VAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQ------------  282 (442)
Q Consensus       215 ~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~------------  282 (442)
                      +.++||||.+..|+|.|+|+|.+.|.+.+...+.|.|.|+|||+|.|....++..++|+.+.|+.+..            
T Consensus       802 ~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~  881 (4289)
T KOG1219|consen  802 TSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLG  881 (4289)
T ss_pred             eccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCc
Confidence            99999999999999999999999888999999999999999999999999999999999999887644            


Q ss_pred             -----------------------------------------------------------Eeec--CCCCCccCCcceeEE
Q psy6838         283 -----------------------------------------------------------VSDT--NDNSPVFSEPVYAFD  301 (442)
Q Consensus       283 -----------------------------------------------------------v~~~--n~~~p~~~~~~~~~~  301 (442)
                                                                                 +.++  |-++|+|..-.-..+
T Consensus       882 kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~  961 (4289)
T KOG1219|consen  882 KVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGH  961 (4289)
T ss_pred             eeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeee
Confidence                                                                       2223  457999999999999


Q ss_pred             EecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEE
Q psy6838         302 VLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSST  381 (442)
Q Consensus       302 v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~  381 (442)
                      |.||+|.|+.+++|.|.|.|.|..+.+.|+|..|+.-+.|+|+.++|.|++.+.||+|....|.|+|.|.|.|.+++++.
T Consensus       962 V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv 1041 (4289)
T KOG1219|consen  962 VLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSV 1041 (4289)
T ss_pred             EeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccc
Confidence            99999999999999999999999999999999998889999999999999999999999999999999999999999999


Q ss_pred             EEEEEEEeeCCCCCCccCCCCCeeEEecCCCCCceEEEEEEEeCCCC---eeEEEEeeCCC-CCC
Q psy6838         382 VTVYFNVVDLNDNPPLFDPMPPNIPVYENVTLGTGLASVHAVDLDSG---KLPFLLPIGDF-DLL  442 (442)
Q Consensus       382 ~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~---~~~y~i~~~~~-~~f  442 (442)
                      +.+.|.|.|+|||+|+|.+.-|.++|.||++.+..|.++.|.|+|..   .|.|.|.+||. |+|
T Consensus      1042 ~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF 1106 (4289)
T KOG1219|consen 1042 CEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFF 1106 (4289)
T ss_pred             eeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceE
Confidence            99999999999999999999999999999999999999999999955   99999999998 555


No 4  
>KOG1219|consensus
Probab=100.00  E-value=5.5e-62  Score=505.50  Aligned_cols=429  Identities=35%  Similarity=0.535  Sum_probs=398.1

Q ss_pred             CCcCCCCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCC-CCcEEEcCCccEEEEeeecccccc
Q psy6838           1 VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWA-NDVFSINPQSGIFTLTAKLDYEDV   79 (442)
Q Consensus         1 V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~-~~~f~id~~tg~~~~~~~ld~~~~   79 (442)
                      |.|||||+|+|.+..|.++++|++++|+.|.++.|+|.|.  |..+.|+|.+++. ...|+|+..||.+++.++||||..
T Consensus      2168 V~dIndn~PvFeqlsYt~sisE~s~igt~viqilATdsDs--n~~isYsl~g~s~~sk~f~In~sTG~it~~g~ldyE~~ 2245 (4289)
T KOG1219|consen 2168 VGDINDNPPVFEQLSYTISISENSKIGTKVIQILATDSDS--NREISYSLEGNSEISKPFRINVSTGWITVAGKLDYEEN 2245 (4289)
T ss_pred             ecccCCCCchhheeeEEEEccCCCccCceEEEEEeccCCC--CCceEEEeecCCccccceEEecccceEEEeeecChhhc
Confidence            6799999999999999999999999999999999999997  8999999998654 578999999999999999999987


Q ss_pred             c-------------------------------------------------------------------------------
Q psy6838          80 S-------------------------------------------------------------------------------   80 (442)
Q Consensus        80 ~-------------------------------------------------------------------------------   80 (442)
                      .                                                                               
T Consensus      2246 q~f~~fvratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l~ 2325 (4289)
T KOG1219|consen 2246 QEFRFFVRATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFLI 2325 (4289)
T ss_pred             ceEEEEEEEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhhhhcccceee
Confidence            6                                                                               


Q ss_pred             ------------------------------------------------------------------------CCCcEEEE
Q psy6838          81 ------------------------------------------------------------------------DNGRLEYS   88 (442)
Q Consensus        81 ------------------------------------------------------------------------~~~~i~y~   88 (442)
                                                                                              .++.+.|.
T Consensus      2326 ~s~~G~iTlfNl~k~~l~~s~~lrv~vsD~v~~at~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~~ 2405 (4289)
T KOG1219|consen 2326 LSENGIITLFNLLKSPLQTSYPLRVTVSDGVFRATMEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIYY 2405 (4289)
T ss_pred             eccCceEEehhhcccccccccceeeeeccCcceeeeEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceeee
Confidence                                                                                    55667777


Q ss_pred             EEeCCCCCcEEEeCCccEEEEcccCCcccCcEEE--EEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeE
Q psy6838          89 IVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYN--LVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETT  165 (442)
Q Consensus        89 i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~--l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~  165 (442)
                      +....++..|.|++. |.|.+.+.||||....+.  +.+.|.|++|     +.+-++++|.+.|+|||+|+|... |.+.
T Consensus      2406 si~s~~sd~~~in~~-GqI~t~~kld~e~s~~~vi~i~v~a~Da~g-----r~af~tvti~ltDiNDnpPqF~a~~Y~~n 2479 (4289)
T KOG1219|consen 2406 SINSRASDHFEINKS-GQIKTLSKLDREYSEELVIIIAVMAFDAGG-----RVAFCTVTIILTDINDNPPQFDAQLYRVN 2479 (4289)
T ss_pred             eechhccCceeECCC-ccEEeeehhhhccCceEEEEEEEEEecCCC-----eEEEEEEEEEEEecCCCCccccceeEEEE
Confidence            777778899999977 999999999999866554  5566679887     588999999999999999999987 9999


Q ss_pred             eeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC--CCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCce
Q psy6838         166 IPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSE--ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSS  243 (442)
Q Consensus       166 v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~--~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~  243 (442)
                      |.|++..|..|+.+.|+|.|.+.|+.++|.+.++.  ...|.|++ +|.|++++.|++++...|+|.|+|.|.|.|.+.+
T Consensus      2480 I~enaskg~~V~~v~A~D~De~snadvty~i~~e~~~~~v~~in~-sG~Itv~~sL~~~en~tl~l~vkA~D~g~P~~~s 2558 (4289)
T KOG1219|consen 2480 ITENASKGKLVGHVIARDADEGSNADVTYEIVGESDVKHVFEINE-SGVITVKRSLDGLENSTLHLFVKAIDDGKPRRRS 2558 (4289)
T ss_pred             eecccCCCceEEEEEEecCCCCCcccEEEEecCchhhhheeeecC-CceEEeehhhhcccCcEEEEEEEeccCCCCCccc
Confidence            99999999999999999999999999999999885  45688988 9999999999999999999999999999999999


Q ss_pred             eEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEE-----------------------------------------
Q psy6838         244 RTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQ-----------------------------------------  282 (442)
Q Consensus       244 ~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~-----------------------------------------  282 (442)
                      .++|.|+|.+...+.|.|..+.|.++++|+.+.|..|.+                                         
T Consensus      2559 ~ttV~v~vl~e~v~lPrFSep~y~fsvpEDv~vG~~Ig~v~a~~a~~~~i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~k 2638 (4289)
T KOG1219|consen 2559 NTTVIVTVLPEDVNLPRFSEPIYTFSVPEDVPVGEEIGQVSASDADEHVIYSLVLGGTPESNPDLPFSVDRNTGMIKVNK 2638 (4289)
T ss_pred             ceEEEEEecCcccCcccccCceEEEeccccCCCCCeeeEEeecccCCceEEEEEeCCCCCCCCCCceEEcCCCceEEecc
Confidence            999999999999999999999999999999999998866                                         


Q ss_pred             -------------------------------EeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEE
Q psy6838         283 -------------------------------VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYS  331 (442)
Q Consensus       283 -------------------------------v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~  331 (442)
                                                     |.++||++|+|....|.+.+.||+|.|+.|++++|.|.|.+.+++++|+
T Consensus      2639 sLD~E~kk~yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirys 2718 (4289)
T KOG1219|consen 2639 SLDHEKKKSYQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYS 2718 (4289)
T ss_pred             ccchhhhceEEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEE
Confidence                                           6688999999999999999999999999999999999999999999999


Q ss_pred             EEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecCC
Q psy6838         332 VVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYENV  411 (442)
Q Consensus       332 l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~  411 (442)
                      |...  ..+|.|+++||.|++...||.|++..|.|.|+|+|.|.+  ++.++|.|+|.|+||+||.|....|..++.|+.
T Consensus      2719 l~~~--v~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~--ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~ 2794 (4289)
T KOG1219|consen 2719 LTSP--VPYFAINPETGWLTTLFELDLEKQDLYSLKVVATDHGVP--SSQATVLVHVTDVNDSPPRFQREIYEGTVVEDV 2794 (4289)
T ss_pred             EcCC--cceEEEcCCCCeeeehhhhccccCCceEEEEEEecCCcc--cccceEEEEEEecCCCcchhhhHhhccceeccC
Confidence            9854  339999999999999999999999999999999999995  788999999999999999999999999999999


Q ss_pred             CCCceEEEEEEEeCCCC---eeEEEEeeCCC-CCC
Q psy6838         412 TLGTGLASVHAVDLDSG---KLPFLLPIGDF-DLL  442 (442)
Q Consensus       412 ~~g~~v~~v~a~D~D~~---~~~y~i~~~~~-~~f  442 (442)
                      +.|..++.++..|.|..   ..+|-|..|++ |-|
T Consensus      2795 p~~~~ia~~si~d~D~s~~nq~t~fI~~gd~~gqF 2829 (4289)
T KOG1219|consen 2795 PGGKIIAGLSIFDADVSEVNQVTGFITLGDPLGQF 2829 (4289)
T ss_pred             CCCceeeeeEeccccccccceeEEEEeCCCccceE
Confidence            99999999999999999   58899999876 543


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00  E-value=9.1e-31  Score=233.73  Aligned_cols=197  Identities=43%  Similarity=0.665  Sum_probs=178.5

Q ss_pred             CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCC
Q psy6838         162 NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPS  240 (442)
Q Consensus       162 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~  240 (442)
                      +.+.+.|+++.|+.++++.|.|+|.+.++.++|+|.++.. .+|.|++.+|.|++.+.||||....|.|.|+|+|.|.+.
T Consensus         2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~   81 (199)
T cd00031           2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP   81 (199)
T ss_pred             eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc
Confidence            5688999999999999999999999888889999999874 799999999999999999999999999999999988877


Q ss_pred             CceeEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEEEeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeC
Q psy6838         241 KSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDA  320 (442)
Q Consensus       241 ~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~  320 (442)
                      +++...++|.|.|+||++|.|....|                                .+.|.|+.+.|+.++++.|.|+
T Consensus        82 ~~~~~~v~I~V~d~Nd~~P~~~~~~~--------------------------------~~~v~e~~~~~~~i~~~~a~D~  129 (199)
T cd00031          82 LSSTATVTVTVLDVNDNPPVFEQSSY--------------------------------EASVPENAPPGTVVGTVTATDA  129 (199)
T ss_pred             ceeEEEEEEEEccCCCCCCcccccce--------------------------------EEEEeCCCCCCCEEEEEEEEcC
Confidence            77999999999999999999876544                                4556677777888888888999


Q ss_pred             CCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEee
Q psy6838         321 DEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVD  390 (442)
Q Consensus       321 D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d  390 (442)
                      |.+.++.++|+|.++....+|.|++.+|.|++.+.||||....|.|.|.|+|.+.+.+++++.++|.|.|
T Consensus       130 D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         130 DSGENAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             CCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence            9988899999999765558999999999999999999999999999999999987778999999999875


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00  E-value=1.5e-30  Score=232.33  Aligned_cols=196  Identities=43%  Similarity=0.726  Sum_probs=177.4

Q ss_pred             EEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeecccccccCCCcEEEEEEeCCCC
Q psy6838          16 YAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVSDNGRLEYSIVSGDDE   95 (442)
Q Consensus        16 ~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~~~~~i~y~i~~~~~~   95 (442)
                      |.+.|+|++++|+.|+++.|.|+|.+.++.++|+|.++                                       ...
T Consensus         2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~---------------------------------------~~~   42 (199)
T cd00031           2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGG---------------------------------------NED   42 (199)
T ss_pred             eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCC---------------------------------------CCc
Confidence            68899999999999999999999987777788877663                                       223


Q ss_pred             CcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeec-cCCeeEeeCCCCCCc
Q psy6838          96 EIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFI-TANETTIPENSPLNT  174 (442)
Q Consensus        96 ~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~-~~~~~~v~E~~~~g~  174 (442)
                      .+|.|++.+|.|++.+.||||....|.|.|+|+|.+.+.   ++++..+.|.|.|+||++|.|. ..+.+.+.|+.++|+
T Consensus        43 ~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~---~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~  119 (199)
T cd00031          43 GLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP---LSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGT  119 (199)
T ss_pred             ccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc---ceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCC
Confidence            578888788999999999999999999999999987653   4689999999999999999999 559999999999999


Q ss_pred             EEEEEEEEeCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEe
Q psy6838         175 VVAALKAIDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLD  253 (442)
Q Consensus       175 ~v~~v~a~D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d  253 (442)
                      .++++.|+|+|.+.++.++|+|.++.. .+|.|++.+|.|++.+.||||....|.|.|.|+|.+.+.++....++|.|.|
T Consensus       120 ~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         120 VVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             EEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence            999999999999888889999999874 7999999999999999999999999999999999988888899999998875


No 7  
>KOG1834|consensus
Probab=99.78  E-value=1.5e-17  Score=160.29  Aligned_cols=210  Identities=26%  Similarity=0.375  Sum_probs=168.9

Q ss_pred             EEEEEeccCCCCeeccCCeeEeeCCCCCCcEEEEEEEEeCCCCC---CceEEEEEecCCCCCeE---ecCcce--EEEEc
Q psy6838         145 SIVIRDVNDNAPEFITANETTIPENSPLNTVVAALKAIDRDEGQ---NSYVEYSIINSEATPFS---LGPVDG--LLRVA  216 (442)
Q Consensus       145 ~I~V~dvNd~~P~f~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~---~~~~~y~i~~~~~~~F~---id~~tG--~i~~~  216 (442)
                      .....-+|-|.|+....|...|.||...-.....+.|.|.|.+-   ...+-|.|.+.+ -+|.   +|..||  .|+.+
T Consensus        20 ~~~aarankhkpwie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~-vPFdavVvdK~TGegvlRaK   98 (952)
T KOG1834|consen   20 HHHAARANKHKPWIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQP-VPFDAVVVDKYTGEGVLRAK   98 (952)
T ss_pred             ccccccccccCcccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCC-CCceEEEEeccCCceEEeec
Confidence            34566788899999999999999998655556678899999841   233578888765 4554   466665  68888


Q ss_pred             ccCCcccccEEEEEEEEEeCCCCC------CceeEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEEEeecCCCC
Q psy6838         217 QKLDREIKSNYTLEIKAKDRGEPS------KSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSDTNDNS  290 (442)
Q Consensus       217 ~~lD~e~~~~~~l~V~a~D~~~~~------~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~~~n~~~  290 (442)
                      .+||.|.++.|+|+|+|.|.|..+      .+-.++|.|+|.|+|+.+|.|..+.|.+.|.|...               
T Consensus        99 ~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~---------------  163 (952)
T KOG1834|consen   99 EPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV---------------  163 (952)
T ss_pred             CcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee---------------
Confidence            999999999999999999987543      56678999999999999999999988888776522               


Q ss_pred             CccCCcceeEEEecCCCCCcEEEEEEEEeCCCC--CCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEE
Q psy6838         291 PVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEG--VNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVV  368 (442)
Q Consensus       291 p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~--~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V  368 (442)
                                        ...|+++.|.|.|.+  ....+.|.|..  ..-+|.||.. |.|+.+.+|+|...+.|.|+|
T Consensus       164 ------------------yd~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn~-G~irnTekLny~ke~~Y~ltV  222 (952)
T KOG1834|consen  164 ------------------YDSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDND-GNIRNTEKLNYTKEHQYKLTV  222 (952)
T ss_pred             ------------------eeeeEEEEeecCCCCCcccceeEEEecC--CCCceEEcCC-CccccccccccccceeEEEEE
Confidence                              235677888999974  34567899984  6778999975 999999999999999999999


Q ss_pred             EEEECCCCCceEEEEEEEEEeeC
Q psy6838         369 QAQDTGKPMMSSTVTVYFNVVDL  391 (442)
Q Consensus       369 ~a~D~~~~~~~~~~~v~I~V~dv  391 (442)
                      +|.|+|......-+.|+|+|...
T Consensus       223 tAyDCg~kraa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  223 TAYDCGKKRAASDSLVTVHVKPT  245 (952)
T ss_pred             EEEecccccccCcceEEEEecCc
Confidence            99999985555557888888765


No 8  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.77  E-value=1.4e-17  Score=129.11  Aligned_cols=92  Identities=43%  Similarity=0.694  Sum_probs=88.1

Q ss_pred             eeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEEC-CCC
Q psy6838         298 YAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDT-GKP  376 (442)
Q Consensus       298 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~-~~~  376 (442)
                      |.+.|.|++++|+.++++.|.|+|.+.++.+.|+|..++..++|.|++.+|.|++.+.||||....|.|.|.|+|. +.|
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~   80 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP   80 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence            6789999999999999999999999999999999998888899999999999999999999999999999999999 889


Q ss_pred             CceEEEEEEEEEe
Q psy6838         377 MMSSTVTVYFNVV  389 (442)
Q Consensus       377 ~~~~~~~v~I~V~  389 (442)
                      +++++++|+|+|+
T Consensus        81 ~~~~~~~V~I~V~   93 (93)
T PF00028_consen   81 PLSSTATVTINVL   93 (93)
T ss_dssp             EEEEEEEEEEEEE
T ss_pred             CCEEEEEEEEEEC
Confidence            9999999999985


No 9  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.72  E-value=1.2e-16  Score=123.82  Aligned_cols=91  Identities=38%  Similarity=0.626  Sum_probs=86.8

Q ss_pred             CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-CCCeEecCcceEEEEcccCCcccccEEEEEEEEEeC-CCC
Q psy6838         162 NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSE-ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDR-GEP  239 (442)
Q Consensus       162 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~-~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~-~~~  239 (442)
                      |.+.|+|++++|+.++++.|.|+|.+.++.+.|+|..+. ..+|.|++.+|.|++.++||||....|.|.|.|+|. +.|
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~   80 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP   80 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence            578999999999999999999999999999999999998 789999999999999999999999999999999999 888


Q ss_pred             CCceeEEEEEEEE
Q psy6838         240 SKSSRTKFFIRLL  252 (442)
Q Consensus       240 ~~~~~~~v~I~V~  252 (442)
                      ++++++.|+|.|.
T Consensus        81 ~~~~~~~V~I~V~   93 (93)
T PF00028_consen   81 PLSSTATVTINVL   93 (93)
T ss_dssp             EEEEEEEEEEEEE
T ss_pred             CCEEEEEEEEEEC
Confidence            9999999999884


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.68  E-value=4.4e-16  Score=116.70  Aligned_cols=79  Identities=49%  Similarity=0.733  Sum_probs=73.9

Q ss_pred             EeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCC
Q psy6838         318 TDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPP  396 (442)
Q Consensus       318 ~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P  396 (442)
                      +|+|.+.++.++|+|.++....+|.|++.+|.|++.++||||....|.|.|+|+|.+.|++++.++|+|+|.|+|||+|
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            3788888999999999776668999999999999999999999999999999999999999999999999999999998


No 11 
>KOG1834|consensus
Probab=99.64  E-value=2.1e-14  Score=138.91  Aligned_cols=152  Identities=26%  Similarity=0.430  Sum_probs=126.5

Q ss_pred             EeCCc--cEEEEcccCCcccCcEEEEEEEEEeCCCCCCC---CceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCC
Q psy6838         100 INPSN--GTLSNLKHLDRETKSSYNLVVMATDMAKPPQK---RLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLN  173 (442)
Q Consensus       100 i~~~~--g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~---~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g  173 (442)
                      +|..+  |.|+.+.+||-|.+..|+|+|+|.|++..|..   ..|..++|.|.|.|+|+++|+|..+ |.+.|.|+.. -
T Consensus        86 vdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~-y  164 (952)
T KOG1834|consen   86 VDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV-Y  164 (952)
T ss_pred             EeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee-e
Confidence            45555  57889999999999999999999999875521   2467889999999999999999988 8888888864 5


Q ss_pred             cEEEEEEEEeCCCC-CC-ceEEEEEecCCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEE
Q psy6838         174 TVVAALKAIDRDEG-QN-SYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL  251 (442)
Q Consensus       174 ~~v~~v~a~D~D~~-~~-~~~~y~i~~~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V  251 (442)
                      ..|+++.|.|.|-+ ++ ..+.|.|...+ -+|.||. .|.|+.+.+|+|.....|.|+|.|.|.|.........|+|.|
T Consensus       165 d~il~veAiD~DCspq~sqIC~YEI~t~d-~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~V  242 (952)
T KOG1834|consen  165 DSILRVEAIDKDCSPQYSQICEYEITTPD-VPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHV  242 (952)
T ss_pred             eeeEEEEeecCCCCCcccceeEEEecCCC-CceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEe
Confidence            57899999999985 33 34689999865 7899997 999999999999999999999999999875555556777777


Q ss_pred             EeC
Q psy6838         252 LDE  254 (442)
Q Consensus       252 ~d~  254 (442)
                      ...
T Consensus       243 kp~  245 (952)
T KOG1834|consen  243 KPT  245 (952)
T ss_pred             cCc
Confidence            543


No 12 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.64  E-value=2.2e-15  Score=112.86  Aligned_cols=78  Identities=40%  Similarity=0.645  Sum_probs=73.6

Q ss_pred             EeCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeCCCCCC
Q psy6838         182 IDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNP  259 (442)
Q Consensus       182 ~D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P  259 (442)
                      +|+|.+.++.++|+|.++.. .+|.|++.+|.|++.++||||....|.|.|+|+|.+.|++++.+.|+|.|.|+|||+|
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            47899889999999998875 8999999999999999999999999999999999999889999999999999999998


No 13 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.54  E-value=0.0012  Score=49.94  Aligned_cols=78  Identities=23%  Similarity=0.358  Sum_probs=42.4

Q ss_pred             CCccCCcceeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEE
Q psy6838         290 SPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQ  369 (442)
Q Consensus       290 ~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~  369 (442)
                      .|-|.+..|.+.|..+...|..||+|.-.|...  +..+.|.-.    ...|.|.++ |.|++++++.... ..-.|+|.
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss----DpdF~V~~D-GsVy~~r~v~l~~-~~~~F~V~   74 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS----DPDFRVLED-GSVYAKRPVQLSS-EQRSFTVH   74 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEETT-TEEEEES--S-SS-S-EEEEEE
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC----CCCEEEcCC-CeEEEeeeEecCC-CceEEEEE
Confidence            367888888899999999999999999999853  455777654    238999987 9999999987654 34689999


Q ss_pred             EEECCC
Q psy6838         370 AQDTGK  375 (442)
Q Consensus       370 a~D~~~  375 (442)
                      |.|..+
T Consensus        75 a~D~~~   80 (90)
T PF08758_consen   75 AWDSQT   80 (90)
T ss_dssp             EEETTT
T ss_pred             EECCCC
Confidence            999876


No 14 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.45  E-value=8.7e-05  Score=55.25  Aligned_cols=63  Identities=21%  Similarity=0.462  Sum_probs=29.9

Q ss_pred             eEEEEEecCCCCCcEEEEEEeeeCCCCCC--cEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccc
Q psy6838          15 VYAFDVLEDLPRGSRVGQVFTTDADEGVN--GVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYED   78 (442)
Q Consensus        15 ~~~~~v~E~~~~g~~v~~~~a~D~D~~~~--~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~   78 (442)
                      ....+|+|..+.|+.||.+ |.|.-....  ....+++.+.....||.+++++|.|++..++|+|.
T Consensus         2 qi~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen    2 QIRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             EEEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred             CeEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence            3578999999999999999 555432110  01234444433444555555555555544455444


No 15 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.41  E-value=9.9e-05  Score=54.96  Aligned_cols=62  Identities=21%  Similarity=0.478  Sum_probs=40.3

Q ss_pred             eeEEEecCCCCCcEEEEEEEEeCCCCCC--eEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCcc
Q psy6838         298 YAFDVLEDLPRGSRVGQVFTTDADEGVN--GVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYED  360 (442)
Q Consensus       298 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~--~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~  360 (442)
                      ..++|.|..++|+.||.+ |.|--....  ....|+|.......+|.++..+|.|+++..+|||.
T Consensus         3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen    3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred             eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence            457899999999999998 344432111  12367877777788999999999999999999996


No 16 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.11  E-value=0.0084  Score=45.30  Aligned_cols=79  Identities=22%  Similarity=0.340  Sum_probs=41.2

Q ss_pred             CCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeecccccccCCCcEEE
Q psy6838           8 SPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVSDNGRLEY   87 (442)
Q Consensus         8 ~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~~~~~i~y   87 (442)
                      -|-|....|.+.|+.+...|..|++|.-.|...  +..+.|+..+                                   
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ssD-----------------------------------   45 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESSD-----------------------------------   45 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE--------------------------------------
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecCC-----------------------------------
Confidence            588999999999999999999999999999853  4456665422                                   


Q ss_pred             EEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCC
Q psy6838          88 SIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKP  133 (442)
Q Consensus        88 ~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~  133 (442)
                              ..|.|. ..|.|.+++++..... .-.|.|.|.|..+.
T Consensus        46 --------pdF~V~-~DGsVy~~r~v~l~~~-~~~F~V~a~D~~~~   81 (90)
T PF08758_consen   46 --------PDFRVL-EDGSVYAKRPVQLSSE-QRSFTVHAWDSQTQ   81 (90)
T ss_dssp             --------SEEEEE-TTTEEEEES--S-SSS--EEEEEEEEETTTT
T ss_pred             --------CCEEEc-CCCeEEEeeeEecCCC-ceEEEEEEECCCCC
Confidence                    367776 6889999988876433 35799999998863


No 17 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.02  E-value=0.014  Score=44.99  Aligned_cols=69  Identities=30%  Similarity=0.342  Sum_probs=53.8

Q ss_pred             EeCCCCCCeEEEEEEEeC---CCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCC
Q psy6838         318 TDADEGVNGVVTYSVVSD---WANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLND  393 (442)
Q Consensus       318 ~D~D~~~~~~i~y~l~~~---~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd  393 (442)
                      .|+|   +..++|++...   ....|..+++.++.++-. +. .+..+.|.+.|+|+|+.+  .++...++|.|.+.|+
T Consensus        25 ~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~-~~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       25 TDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PT-NSDVGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             ECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CC-CCCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            6776   46799988743   235799999998888763 33 334577999999999876  6788999999999987


No 18 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.98  E-value=0.0084  Score=45.88  Aligned_cols=87  Identities=24%  Similarity=0.303  Sum_probs=58.2

Q ss_pred             EEEEEEeCCCCCCeEEEEEEEe-CCCCCCeEEeCCccEEEEc--------ccCCCccCcEEEEEEEEEECCCCCceEEEE
Q psy6838         313 GQVFTTDADEGVNGVVTYSVVS-DWANDVFSINPQSGIFTLT--------AKLDYEDVHHYIFVVQAQDTGKPMMSSTVT  383 (442)
Q Consensus       313 ~~v~a~D~D~~~~~~i~y~l~~-~~~~~~f~id~~tG~l~~~--------~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~  383 (442)
                      |++.++|+|.+.  ..++++.. ...-+.|.|++ .|..+..        +.|...+...-+|+|++.|+      .+++
T Consensus         2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt~~   72 (99)
T TIGR01965         2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TSQT   72 (99)
T ss_pred             CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------CeEE
Confidence            567789999754  45666642 22346788887 4755443        12332234467899999995      2899


Q ss_pred             EEEEEeeCCCCCCccCCCCCeeEEecC
Q psy6838         384 VYFNVVDLNDNPPLFDPMPPNIPVYEN  410 (442)
Q Consensus       384 v~I~V~dvNd~~P~f~~~~~~~~v~E~  410 (442)
                      |+|+|...|| +|.... .-...+.|+
T Consensus        73 vtItI~GtND-apvi~~-~~~g~v~ED   97 (99)
T TIGR01965        73 VTITITGAND-AAVIGG-ADTGSVTED   97 (99)
T ss_pred             EEEEEEccCC-CCEEec-ccceeEecC
Confidence            9999999999 887653 345677776


No 19 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.82  E-value=0.015  Score=44.59  Aligned_cols=50  Identities=24%  Similarity=0.298  Sum_probs=36.7

Q ss_pred             ccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCC
Q psy6838         111 KHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENS  170 (442)
Q Consensus       111 ~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~  170 (442)
                      +.|...+.-.-+|++.+.|+.         +.+|.|+|...|| +|++.......+.|+.
T Consensus        49 q~L~~Ge~~tdsFtvtv~DGt---------t~~vtItI~GtND-apvi~~~~~g~v~ED~   98 (99)
T TIGR01965        49 QALKAGETLTDTFTVTSADGT---------SQTVTITITGAND-AAVIGGADTGSVTEDS   98 (99)
T ss_pred             HhhcCCCEEEEEEEEEEeCCC---------eEEEEEEEEccCC-CCEEecccceeEecCC
Confidence            344444445567888888842         5789999999999 8888777777777764


No 20 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.73  E-value=0.035  Score=42.79  Aligned_cols=31  Identities=32%  Similarity=0.389  Sum_probs=26.5

Q ss_pred             CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccC
Q psy6838         118 KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVND  153 (442)
Q Consensus       118 ~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd  153 (442)
                      ...|.++|.|+|..+     .+....++|.|.+.|+
T Consensus        66 ~g~~~i~v~a~D~~g-----~~~~~~f~i~V~~~~~   96 (97)
T smart00736       66 VGSLSLKVTATDSSG-----ASASDTFTITVVNTND   96 (97)
T ss_pred             CcEEEEEEEEEECCC-----CEEEEEEEEEEeCCCC
Confidence            356999999999886     4788899999999886


No 21 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=95.95  E-value=0.86  Score=38.54  Aligned_cols=126  Identities=21%  Similarity=0.345  Sum_probs=72.5

Q ss_pred             CcEEEEEE-EEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCCCC-CcEEEEEEEEeCCCCCCceEEEE
Q psy6838         118 KSSYNLVV-MATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENSPL-NTVVAALKAIDRDEGQNSYVEYS  195 (442)
Q Consensus       118 ~~~~~l~v-~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~~~-g~~v~~v~a~D~D~~~~~~~~y~  195 (442)
                      ...|++++ .|.|..+.     ..+..+..++. +...+|.+.-.....+..+... +..-..+.++|.-.+. ..-+.+
T Consensus        14 dG~Y~l~~~~a~D~agN-----~~~~~~~~~~~-iD~T~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~   86 (158)
T PF13750_consen   14 DGSYTLTVVTATDAAGN-----TSTSTVSETFT-IDNTPPTISISDGASVANGSTVYGLVNISINVTDNSDDS-KITSVS   86 (158)
T ss_pred             CccEEEEEEEEEecCCC-----EEEEEEeeEEE-EcCCCCEEEEecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEE
Confidence            46899999 79999874     44444443333 3456898866322233333332 3344557777766544 335677


Q ss_pred             EecCC-CCCe--EecC-cceEEEEc--ccC-CcccccEEEEEEEEEeCCCCCCceeEEEEEEEE
Q psy6838         196 IINSE-ATPF--SLGP-VDGLLRVA--QKL-DREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLL  252 (442)
Q Consensus       196 i~~~~-~~~F--~id~-~tG~i~~~--~~l-D~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~  252 (442)
                      |.++. ...-  .... ..|...+.  +.+ ..|....|+|+|.|+|..  +...+..+.....
T Consensus        87 l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~a--GN~~~~si~F~y~  148 (158)
T PF13750_consen   87 LTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKA--GNQSTKSISFSYM  148 (158)
T ss_pred             EECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecC--CCEEEEEEEEEEe
Confidence            77665 2221  1111 13333332  111 236788999999999987  4566666666655


No 22 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=95.64  E-value=1.2  Score=37.75  Aligned_cols=76  Identities=24%  Similarity=0.330  Sum_probs=42.5

Q ss_pred             EEEEEeCCCCCCeEEEEEEEeCCCCCCeEE--eCC-ccEEEEc--cc-CCCccCcEEEEEEEEEECCCCCceEEEEEEEE
Q psy6838         314 QVFTTDADEGVNGVVTYSVVSDWANDVFSI--NPQ-SGIFTLT--AK-LDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFN  387 (442)
Q Consensus       314 ~v~a~D~D~~~~~~i~y~l~~~~~~~~f~i--d~~-tG~l~~~--~~-ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~  387 (442)
                      .+.++|.-... .....+|.+|.......+  ... .|...+.  +. ...|....|+|+|.|.|..+  ...+..+.+.
T Consensus        70 ~i~~tD~~~~~-~i~sv~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si~F~  146 (158)
T PF13750_consen   70 SINVTDNSDDS-KITSVSLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSISFS  146 (158)
T ss_pred             EEEEEeCCCCc-eEEEEEEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC--CEEEEEEEEE
Confidence            34445554332 344566665443333222  111 2333332  11 12366789999999999877  6778888887


Q ss_pred             EeeCC
Q psy6838         388 VVDLN  392 (442)
Q Consensus       388 V~dvN  392 (442)
                      ....|
T Consensus       147 y~P~~  151 (158)
T PF13750_consen  147 YMPPN  151 (158)
T ss_pred             EeCCe
Confidence            76543


No 23 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=92.98  E-value=16  Score=39.73  Aligned_cols=48  Identities=23%  Similarity=0.255  Sum_probs=30.5

Q ss_pred             ccCCCCeeccC-CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC
Q psy6838         151 VNDNAPEFITA-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSE  200 (442)
Q Consensus       151 vNd~~P~f~~~-~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~  200 (442)
                      .||..+.+... ....+.|+.  |+.-..|.-...|.+....+.|+..++.
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGT  443 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGT  443 (928)
T ss_pred             ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCc
Confidence            34444455433 677899986  6666666554445544566889888776


No 24 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=92.43  E-value=0.52  Score=31.08  Aligned_cols=37  Identities=24%  Similarity=0.304  Sum_probs=28.6

Q ss_pred             CCCCCeEEeCCccEEEEcccCCCc-cCcEEEEEEEEEECC
Q psy6838         336 WANDVFSINPQSGIFTLTAKLDYE-DVHHYIFVVQAQDTG  374 (442)
Q Consensus       336 ~~~~~f~id~~tG~l~~~~~ld~e-~~~~~~l~V~a~D~~  374 (442)
                      ....+..||+.+|.|+-.  .+.. ....|.|+|+|+|..
T Consensus        11 ~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   11 GLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence            456789999999999765  3333 336999999999975


No 25 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=90.75  E-value=8.1  Score=41.92  Aligned_cols=53  Identities=17%  Similarity=0.226  Sum_probs=34.2

Q ss_pred             EEEEEEEeeCCCCCCccCCCCCeeEEecCCCCCceEEE-EEEEeCCCC-eeEEEEeeC
Q psy6838         382 VTVYFNVVDLNDNPPLFDPMPPNIPVYENVTLGTGLAS-VHAVDLDSG-KLPFLLPIG  437 (442)
Q Consensus       382 ~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~-v~a~D~D~~-~~~y~i~~~  437 (442)
                      .+.+|+|.| ||++|.|........|.|+  .|..-.+ ++..+.++. .+.|.-.+|
T Consensus       515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es--~G~vtvtV~RtsGa~G~VtV~Y~T~dG  569 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFEEDVFHVSES--IGIMEVKVLRTSGARGTVIVPYRTVEG  569 (928)
T ss_pred             ceEEEEEec-CcccCcccccCceEEEEcC--CCEEEEEEEEcCCCCeeEEEEEEeecC
Confidence            456677777 6789988766666788898  4554333 334444444 777877665


No 26 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=88.56  E-value=12  Score=30.65  Aligned_cols=57  Identities=23%  Similarity=0.388  Sum_probs=40.2

Q ss_pred             EEEEcccCCccc---CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCC
Q psy6838         106 TLSNLKHLDRET---KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENS  170 (442)
Q Consensus       106 ~l~~~~~lD~E~---~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~  170 (442)
                      .+.+..+||+..   .-...|.|.|+|..+.     .++..+.|+|.|  | .|.........|.|+.
T Consensus        69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD-----~s~~~l~VtI~D--D-~P~~~~~~~~~V~E~~  128 (137)
T TIGR03660        69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGD-----TSSITLPVTIVD--D-VPTITDVDALTVDEDD  128 (137)
T ss_pred             EEEEcccccCCCCCceEEEeeeEEEEeCCCC-----ccccEEEEEEEC--C-CCeeccccceEEeccc
Confidence            344567888743   4467888999998874     234578888877  5 6887666667888853


No 27 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=88.08  E-value=75  Score=39.05  Aligned_cols=112  Identities=13%  Similarity=0.142  Sum_probs=61.5

Q ss_pred             CCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccc-ccCCCcEEEEEEeCCCCCcEEEeCC
Q psy6838          25 PRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYED-VSDNGRLEYSIVSGDDEEIFTINPS  103 (442)
Q Consensus        25 ~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~-~~~~~~i~y~i~~~~~~~~F~i~~~  103 (442)
                      .+|+...+++|++.. |.+ ..+-.+.=..+-..|.+++..+.+.+...+.+.. ...+..+.|+..-++....+.-++.
T Consensus      1482 ~~GtYtVtLTvtN~~-Gss-t~T~~VtV~~pV~~~tinas~~~vpl~~sV~Fta~~s~Gs~v~ysWdFGDg~ts~~~npt 1559 (2740)
T TIGR00864      1482 SPGDFNIRLAAANEV-GKN-EATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNT 1559 (2740)
T ss_pred             CCceEEEEEEEECCC-Cce-EEEEEEEEeccccceEEcCCCccccccceEEEEEEccCCCceeEEEEeCCCCccccCCCc
Confidence            357777777777664 222 1111221112334577777777665554454443 2245567777776654322221110


Q ss_pred             ccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEec
Q psy6838         104 NGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDV  151 (442)
Q Consensus       104 ~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dv  151 (442)
                             ..--|.+...|++++.|++..|.      ...++.|.|.+.
T Consensus      1560 -------~~yTY~spGtYtVtLTvtN~~Gs------~~~T~~i~V~~~ 1594 (2740)
T TIGR00864      1560 -------IFYTFRSVGTFNIIVTAENDVGA------AQASIFLFVLQE 1594 (2740)
T ss_pred             -------eEEeecCCceEEEEEEEecCCCc------cceeEEEEEeec
Confidence                   01135677899999999998873      345556666643


No 28 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=87.93  E-value=6.1  Score=27.48  Aligned_cols=60  Identities=23%  Similarity=0.333  Sum_probs=36.4

Q ss_pred             CCceEEEEEecCCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEE
Q psy6838         188 QNSYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL  251 (442)
Q Consensus       188 ~~~~~~y~i~~~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V  251 (442)
                      .+-...|+|.+-+..|..+...+-.+... .|   ..+.|+|.|+|.|..+.......++.|+|
T Consensus         6 ~~~~Y~Y~l~g~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen    6 ENIRYRYRLEGFDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             TTEEEEEEEETTESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             CceEEEEEEECCCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            34457788888777777766533244332 22   46899999999997754333336666665


No 29 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=87.14  E-value=8  Score=26.85  Aligned_cols=60  Identities=17%  Similarity=0.264  Sum_probs=35.0

Q ss_pred             CeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEe
Q psy6838         325 NGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVV  389 (442)
Q Consensus       325 ~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~  389 (442)
                      +-...|+|. |-...|..+...+-.+..+ .|   ..+.|+|.|+|.|..+.......++.|+|.
T Consensus         7 ~~~Y~Y~l~-g~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen    7 NIRYRYRLE-GFDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             TEEEEEEEE-TTESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             ceEEEEEEE-CCCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            445677777 4445666665543244433 22   568999999999977644333367776663


No 30 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=79.53  E-value=23  Score=29.08  Aligned_cols=56  Identities=29%  Similarity=0.360  Sum_probs=36.6

Q ss_pred             EEEEcccCCCcc---CcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecC
Q psy6838         349 IFTLTAKLDYED---VHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYEN  410 (442)
Q Consensus       349 ~l~~~~~ld~e~---~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~  410 (442)
                      .+++.++||...   .-...|.|.|+|..+.  .+..++.|+|.|  | .|...... ...|.|+
T Consensus        69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~  127 (137)
T TIGR03660        69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVDED  127 (137)
T ss_pred             EEEEcccccCCCCCceEEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence            444556666532   3367889999997763  334688888888  5 57765433 3678775


No 31 
>PF02010 REJ:  REJ domain;  InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=79.51  E-value=3.2  Score=41.59  Aligned_cols=69  Identities=17%  Similarity=0.236  Sum_probs=0.0

Q ss_pred             ccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecCCCCCceEEEEEEEeC-CCC-eeEEEEee
Q psy6838         359 EDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYENVTLGTGLASVHAVDL-DSG-KLPFLLPI  436 (442)
Q Consensus       359 e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~-D~~-~~~y~i~~  436 (442)
                      +....|.|++.|+|.+..  ...+.+.+.+   |- ||.-.  ...+.=.+. ......+++.+.+- |.. +|+|++.-
T Consensus       270 ~~g~~Y~~~l~v~~~~~~--~~~a~~~~~~---n~-pP~~g--~c~v~P~~G-~al~T~Ft~~c~~~~d~~~pL~Y~f~~  340 (440)
T PF02010_consen  270 EPGSTYTFRLTVTDSSGS--SGSASISFTV---NA-PPSGG--SCSVSPSSG-YALETTFTITCSGWTDDDQPLTYQFYY  340 (440)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             cccccccccccccccccc--cccccccccc---cc-ccccc--ccccccccc-ccccccccccccccccccccccccccc
Confidence            456789999999998662  2256666666   43 66421  111111111 12223556666665 444 66666544


No 32 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=78.79  E-value=11  Score=24.65  Aligned_cols=35  Identities=29%  Similarity=0.454  Sum_probs=20.6

Q ss_pred             CcEEEeCCccEEEEcccCCcc-cCcEEEEEEEEEeCCC
Q psy6838          96 EIFTINPSNGTLSNLKHLDRE-TKSSYNLVVMATDMAK  132 (442)
Q Consensus        96 ~~F~i~~~~g~l~~~~~lD~E-~~~~~~l~v~a~D~~~  132 (442)
                      ....+|+.+|.|.=.-  +.. ....|.++|+|+|..+
T Consensus        14 ~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~G   49 (49)
T PF05345_consen   14 SGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDGSG   49 (49)
T ss_pred             CcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcCCC
Confidence            3455555555554331  122 2258999999999754


No 33 
>KOG4221|consensus
Probab=77.02  E-value=1.3e+02  Score=33.79  Aligned_cols=59  Identities=17%  Similarity=0.219  Sum_probs=46.5

Q ss_pred             CCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCc
Q psy6838         336 WANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPL  397 (442)
Q Consensus       336 ~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~  397 (442)
                      +.+.++.++.++-.+++.   +.+....|.+.|.|....+++.++...-.++..|+-+.||.
T Consensus       562 ~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  562 DTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             CCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            456788888877777765   34667789999999998887777666666788889888887


No 34 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=58.47  E-value=45  Score=22.83  Aligned_cols=30  Identities=27%  Similarity=0.286  Sum_probs=21.9

Q ss_pred             CcEEEEEEEEEECCCCCceEEEEEEEEEeeCC
Q psy6838         361 VHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLN  392 (442)
Q Consensus       361 ~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvN  392 (442)
                      .+.|++.++++|..+  ..+.....+.+.|..
T Consensus        22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDNT   51 (60)
T ss_pred             CccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence            568999999999877  455556666666654


No 35 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=55.78  E-value=1.9e+02  Score=27.46  Aligned_cols=111  Identities=23%  Similarity=0.304  Sum_probs=52.1

Q ss_pred             cCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC---CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEE
Q psy6838         117 TKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA---NETTIPENSPLNTVVAALKAIDRDEGQNSYVE  193 (442)
Q Consensus       117 ~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~---~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~  193 (442)
                      ....|.+.+.++|..|..     +.....+.|--   .+|.....   ....+.-............+.+.+.  ...+.
T Consensus        10 ~d~~~~v~vt~tD~aGN~-----~~~t~~~~vDt---~~P~v~i~~~~~~~~~~~~~~~~~~t~s~tvs~~~~--g~~v~   79 (317)
T PF13753_consen   10 ADGTYTVSVTVTDAAGNT-----STATQSITVDT---TAPTVTITSIADDDIINGDEATNTVTFSGTVSGAEP--GSTVT   79 (317)
T ss_pred             CCCcEEEEEEEEeCCCCe-----eeeeEEEEEec---CCCceeeecccCCCccccceeeeeeEEEEEecCCCC--CCEEE
Confidence            346789999999999853     33445554322   26644332   0001100001122233444444343  33355


Q ss_pred             EEEecCCCCCeEecCcceEEEEcccC-CcccccEEEEEEE-EEeCCCC
Q psy6838         194 YSIINSEATPFSLGPVDGLLRVAQKL-DREIKSNYTLEIK-AKDRGEP  239 (442)
Q Consensus       194 y~i~~~~~~~F~id~~tG~i~~~~~l-D~e~~~~~~l~V~-a~D~~~~  239 (442)
                      +.+ .+....+..+ ..|.....-.. +.-....|.+.+. ++|..+.
T Consensus        80 v~~-~g~~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN  125 (317)
T PF13753_consen   80 VTI-NGTTGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGN  125 (317)
T ss_pred             EEE-CCEEEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCc
Confidence            555 2222233333 24432222111 1123458889998 9997753


No 36 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=55.03  E-value=25  Score=27.36  Aligned_cols=34  Identities=15%  Similarity=0.104  Sum_probs=23.1

Q ss_pred             cCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEee
Q psy6838         355 KLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVD  390 (442)
Q Consensus       355 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d  390 (442)
                      .+++.+.+.|.++|++.|..+  -+.++.+.|.|.|
T Consensus        73 t~~v~kgG~y~m~V~lCn~dG--CS~S~~~~I~VAD  106 (106)
T cd02848          73 TFKVGKGGRYQMQVALCNGDG--CSTSAAKEIVVAD  106 (106)
T ss_pred             EEEeCCCCeEEEEEEEECCCC--ccCcCCEEEEecC
Confidence            455667789999999999877  3444455555543


No 37 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=54.73  E-value=30  Score=24.98  Aligned_cols=31  Identities=23%  Similarity=0.175  Sum_probs=22.0

Q ss_pred             CCCccCcEEEEEEEEEECCCCCceEEEEEEEEE
Q psy6838         356 LDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNV  388 (442)
Q Consensus       356 ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V  388 (442)
                      ..|...+.|.++++++|..+  .+...++.|.|
T Consensus        51 ~~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~V   81 (81)
T cd00146          51 HTYTKPGTYTVTLTVTNAVG--SSSTKTTTVVV   81 (81)
T ss_pred             EEcCCCcEEEEEEEEEeCCC--CEEEEEEEEEC
Confidence            44678899999999999864  34444555543


No 38 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=54.72  E-value=5.3e+02  Score=32.38  Aligned_cols=194  Identities=16%  Similarity=0.215  Sum_probs=88.1

Q ss_pred             CCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccE-EEEeeecccccccCCCcEEEEEEeCCCCCcEEEeCC
Q psy6838          25 PRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGI-FTLTAKLDYEDVSDNGRLEYSIVSGDDEEIFTINPS  103 (442)
Q Consensus        25 ~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~-~~~~~~ld~~~~~~~~~i~y~i~~~~~~~~F~i~~~  103 (442)
                      .+|+....++|++..........-......+...|..+...|. +.+.....+.+...+..+.|...-|++..      .
T Consensus      1398 ~pGtYTVTLTVtN~~gs~T~s~~I~V~~Pv~~~~ft~ngT~g~~ltlg~pv~Ftd~StGt~~Sy~WdFGDGst------s 1471 (2740)
T TIGR00864      1398 DPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGNNLELGQPYLFSAFGRARNASYLWDFGDGGL------L 1471 (2740)
T ss_pred             CCceEEEEEEEECCCcceeeeeEEEEecCCCcceeeecCCcccccccCcceEEeecCCCCceEEEEECCCCCc------c
Confidence            3566666777765432111111111221111223444443332 22222222333445566777777665421      0


Q ss_pred             ccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCCcEEEEEEEE
Q psy6838         104 NGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTVVAALKAI  182 (442)
Q Consensus       104 ~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g~~v~~v~a~  182 (442)
                      ++.   .-..-|.....|++++.|++..|.      .+.++.|+|..-   ..-|... ....    .+.+.. ..+.+ 
T Consensus      1472 t~q---nptHtY~~~GtYtVtLTvtN~~Gs------st~T~~VtV~~p---V~~~tinas~~~----vpl~~s-V~Fta- 1533 (2740)
T TIGR00864      1472 EGP---EILHAFNSPGDFNIRLAAANEVGK------NEATLNVAVKAR---VRGLTINASLTN----VPLNGS-VHFEA- 1533 (2740)
T ss_pred             Ccc---ceEEEcCCCceEEEEEEEECCCCc------eEEEEEEEEecc---ccceEEcCCCcc----ccccce-EEEEE-
Confidence            111   111236778899999999998873      344556666431   1111111 0111    111111 12222 


Q ss_pred             eCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeC
Q psy6838         183 DRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDE  254 (442)
Q Consensus       183 D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~  254 (442)
                      +.+.|..  ++|+-.=++. ..+.=++ + .     .-.|.....|++++.|++..+   +...++.|.|.+.
T Consensus      1534 ~~s~Gs~--v~ysWdFGDg~ts~~~np-t-~-----~yTY~spGtYtVtLTvtN~~G---s~~~T~~i~V~~~ 1594 (2740)
T TIGR00864      1534 HLDAGDD--VRFSWILCDHCTPIFGGN-T-I-----FYTFRSVGTFNIIVTAENDVG---AAQASIFLFVLQE 1594 (2740)
T ss_pred             EccCCCc--eeEEEEeCCCCccccCCC-c-e-----EEeecCCceEEEEEEEecCCC---ccceeEEEEEeec
Confidence            2333333  3444333321 1111011 0 0     123567899999999999864   3355566666653


No 39 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=52.79  E-value=59  Score=22.24  Aligned_cols=30  Identities=40%  Similarity=0.491  Sum_probs=21.4

Q ss_pred             CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEecc
Q psy6838         118 KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVN  152 (442)
Q Consensus       118 ~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvN  152 (442)
                      ...|+|.++|+|..|.     .........+.|..
T Consensus        22 dg~yt~~v~a~D~AGN-----~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   22 DGEYTLTVTATDKAGN-----TSSSTTQIVIVDNT   51 (60)
T ss_pred             CccEEEEEEEEECCCC-----EEEeeeEEEEEcCC
Confidence            5689999999999984     44555555555543


No 40 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=51.66  E-value=36  Score=24.38  Aligned_cols=31  Identities=19%  Similarity=0.224  Sum_probs=23.5

Q ss_pred             cCCCccCcEEEEEEEEEECCCCCceEEEEEEEEE
Q psy6838         355 KLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNV  388 (442)
Q Consensus       355 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V  388 (442)
                      ...|...+.|.+++.++|..+   +.++++.|.|
T Consensus        48 ~~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       48 THTYTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             EEEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            345677889999999999766   5566666665


No 41 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=49.04  E-value=63  Score=21.53  Aligned_cols=16  Identities=25%  Similarity=0.289  Sum_probs=13.5

Q ss_pred             cCcEEEEEEEEEECCC
Q psy6838         360 DVHHYIFVVQAQDTGK  375 (442)
Q Consensus       360 ~~~~~~l~V~a~D~~~  375 (442)
                      ..+.|.++|+|+|..+
T Consensus        22 ~dG~y~itv~a~D~AG   37 (54)
T PF13754_consen   22 ADGTYTITVTATDAAG   37 (54)
T ss_pred             CCccEEEEEEEEeCCC
Confidence            4678999999999765


No 42 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=47.65  E-value=1.5e+02  Score=28.43  Aligned_cols=34  Identities=24%  Similarity=0.381  Sum_probs=22.4

Q ss_pred             CcEEEEEEEeCCC-CCcEEEcCCccEEEEeeeccc
Q psy6838          43 NGVVTYSVVSDWA-NDVFSINPQSGIFTLTAKLDY   76 (442)
Q Consensus        43 ~~~~~y~l~~~~~-~~~f~id~~tg~~~~~~~ld~   76 (442)
                      .|.+.|-...+.. -.-|+|++.+|.|.+...-.-
T Consensus       254 dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~t  288 (346)
T COG2706         254 DGRFLYASNRGHDSIAVFSVDPDGGKLELVGITPT  288 (346)
T ss_pred             CCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEecc
Confidence            4556665554432 256999999999887765433


No 43 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=46.82  E-value=42  Score=24.43  Aligned_cols=25  Identities=20%  Similarity=0.125  Sum_probs=21.0

Q ss_pred             cEEEEEEEEEECCCCCceEEEEEEEEE
Q psy6838         362 HHYIFVVQAQDTGKPMMSSTVTVYFNV  388 (442)
Q Consensus       362 ~~~~l~V~a~D~~~~~~~~~~~v~I~V  388 (442)
                      +.+.++.+|+|..+  ..+++++.|+|
T Consensus        57 G~t~V~ytA~D~~G--N~a~C~f~V~V   81 (81)
T PF02494_consen   57 GTTTVTYTATDAAG--NSATCSFTVTV   81 (81)
T ss_pred             ceEEEEEEEEECCC--CEEEEEEEEEC
Confidence            47899999999866  67899998875


No 44 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=45.60  E-value=72  Score=26.02  Aligned_cols=51  Identities=18%  Similarity=0.144  Sum_probs=27.0

Q ss_pred             cCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecCC
Q psy6838         355 KLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYENV  411 (442)
Q Consensus       355 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~  411 (442)
                      .+.+...+.|+++|+++|..+  -+....+.|.|.|..-  -.+.  +....+.||-
T Consensus        76 ~~~~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDG--sHl~--pL~~~~~~nN  126 (133)
T PF08329_consen   76 TFTVTKGGRYQMQVELCNADG--CSTSAPVEVVVADTDG--SHLA--PLPYNWDENN  126 (133)
T ss_dssp             EEEE-S-EEEEEEEEEEETTE--EEE---EEEEEE-TTS--TTS----------TTS
T ss_pred             EEEecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCCc--cccc--cccCcccccC
Confidence            455556789999999999877  6677789999999843  2222  2334566653


No 45 
>KOG4221|consensus
Probab=41.47  E-value=5.9e+02  Score=29.10  Aligned_cols=137  Identities=14%  Similarity=0.151  Sum_probs=76.5

Q ss_pred             ccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCCCCCcEEEEEEEEe
Q psy6838         104 NGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENSPLNTVVAALKAID  183 (442)
Q Consensus       104 ~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~~~g~~v~~v~a~D  183 (442)
                      .|.+.....|-.  ...|.+.++|....|..    .+++.+.|.-.  -| +|...+.+        ..+..-..+...-
T Consensus       481 ~g~~~tv~nl~p--~t~Y~~rv~A~n~~g~g----~sS~pLkV~t~--pE-gp~~~~a~--------ats~~ti~v~Wep  543 (1381)
T KOG4221|consen  481 PGIQVTVQNLSP--LTMYFFRVRAKNEAGSG----ESSAPLKVTTQ--PE-GPVQLQAY--------ATSPTTILVTWEP  543 (1381)
T ss_pred             CceEEEeeeccc--ceeEEEEEeccCcccCC----ccCCceEEecC--CC-CCcccccc--------ccCcceEEEEecC
Confidence            453333444433  35799999998877642    34455555322  22 22221111        1111122233222


Q ss_pred             CCCCCCceEEEEEec---CCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeCCCCCCe
Q psy6838         184 RDEGQNSYVEYSIIN---SEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPI  260 (442)
Q Consensus       184 ~D~~~~~~~~y~i~~---~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~  260 (442)
                      +-.+......|++.-   +...++.++..+-++++.   +.|....|.+.|.|.+..+++.++..--.++..|+-+.||.
T Consensus       544 P~~~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  544 PPFGNGPITGYKLFYSEDDTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             CCCCCCCceEEEEEEEcCCCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            222222223343322   234678888777777665   44667889999999999888877666556777788887775


No 46 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=40.66  E-value=67  Score=23.07  Aligned_cols=29  Identities=21%  Similarity=0.310  Sum_probs=20.5

Q ss_pred             CcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEE
Q psy6838         114 DRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIV  147 (442)
Q Consensus       114 D~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~  147 (442)
                      -|.....|.+++.++|..+     .+.+..+.|.
T Consensus        52 ~y~~~G~y~v~l~v~d~~g-----~~~~~~~~V~   80 (81)
T cd00146          52 TYTKPGTYTVTLTVTNAVG-----SSSTKTTTVV   80 (81)
T ss_pred             EcCCCcEEEEEEEEEeCCC-----CEEEEEEEEE
Confidence            3667789999999999865     2444455554


No 47 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=38.03  E-value=82  Score=22.46  Aligned_cols=30  Identities=10%  Similarity=0.131  Sum_probs=22.3

Q ss_pred             CCcccccEEEEEEEEEeCCCCCCceeEEEEEEE
Q psy6838         219 LDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL  251 (442)
Q Consensus       219 lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V  251 (442)
                      .-|.....|.+++.+.|..+   +...+++|.|
T Consensus        49 ~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       49 HTYTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             EEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            34667889999999999865   5566666655


No 48 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=36.59  E-value=1.8e+02  Score=21.82  Aligned_cols=24  Identities=29%  Similarity=0.364  Sum_probs=12.1

Q ss_pred             EEEEEeCCCCCCeeeccceeeecccc
Q psy6838         248 FIRLLDENDNNPIFDSKSYSASVPEN  273 (442)
Q Consensus       248 ~I~V~d~nd~~P~f~~~~~~~~v~e~  273 (442)
                      +|+|.| ||.+ .+.-......+.|+
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~   25 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEG   25 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETT
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeC
Confidence            466677 5544 65544444455554


No 49 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=32.32  E-value=95  Score=22.50  Aligned_cols=24  Identities=25%  Similarity=0.289  Sum_probs=20.2

Q ss_pred             EEEEEEEEEeCCCCCCCCceeEEEEEEEE
Q psy6838         120 SYNLVVMATDMAKPPQKRLSSTTQVSIVI  148 (442)
Q Consensus       120 ~~~l~v~a~D~~~~~~~~~s~~~~v~I~V  148 (442)
                      .|.++..|+|..|     .++++.+.|+|
T Consensus        58 ~t~V~ytA~D~~G-----N~a~C~f~V~V   81 (81)
T PF02494_consen   58 TTTVTYTATDAAG-----NSATCSFTVTV   81 (81)
T ss_pred             eEEEEEEEEECCC-----CEEEEEEEEEC
Confidence            6899999999877     47788888875


No 50 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=31.73  E-value=1.1e+02  Score=23.90  Aligned_cols=34  Identities=18%  Similarity=0.220  Sum_probs=24.0

Q ss_pred             cCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEe
Q psy6838         112 HLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRD  150 (442)
Q Consensus       112 ~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~d  150 (442)
                      .+++.+...|.++|++.|..|.     +.+..+.|.|.|
T Consensus        73 t~~v~kgG~y~m~V~lCn~dGC-----S~S~~~~I~VAD  106 (106)
T cd02848          73 TFKVGKGGRYQMQVALCNGDGC-----STSAAKEIVVAD  106 (106)
T ss_pred             EEEeCCCCeEEEEEEEECCCCc-----cCcCCEEEEecC
Confidence            3556677899999999999874     444555565543


No 51 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=27.07  E-value=5.4e+02  Score=24.28  Aligned_cols=26  Identities=15%  Similarity=0.254  Sum_probs=17.4

Q ss_pred             cccEEEEEEEEEeCCCCCCceeEEEEEE
Q psy6838         223 IKSNYTLEIKAKDRGEPSKSSRTKFFIR  250 (442)
Q Consensus       223 ~~~~~~l~V~a~D~~~~~~~~~~~v~I~  250 (442)
                      ....|.+.+.++|..+...  .....+.
T Consensus        10 ~d~~~~v~vt~tD~aGN~~--~~t~~~~   35 (317)
T PF13753_consen   10 ADGTYTVSVTVTDAAGNTS--TATQSIT   35 (317)
T ss_pred             CCCcEEEEEEEEeCCCCee--eeeEEEE
Confidence            3567899999999875333  3444554


No 52 
>PF03174 CHB_HEX_C:  Chitobiase/beta-hexosaminidase C-terminal domain;  InterPro: IPR004867 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This short domain is found in members of the glycoside hydrolase family 20 (GH20 from CAZY) and represents the C-terminal domain in chitobiases and beta-hexosaminidases 3.2.1.52 from EC. It is composed of a beta sandwich structure []. The function of this domain is unknown. ; GO: 0004563 beta-N-acetylhexosaminidase activity, 0005975 carbohydrate metabolic process; PDB: 1C7T_A 1QBA_A 1QBB_A 1C7S_A.
Probab=24.29  E-value=1.6e+02  Score=21.03  Aligned_cols=58  Identities=7%  Similarity=0.089  Sum_probs=24.5

Q ss_pred             cEEEEeeecccccccCCCcEEEEEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCC
Q psy6838          66 GIFTLTAKLDYEDVSDNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKP  133 (442)
Q Consensus        66 g~~~~~~~ld~~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~  133 (442)
                      |.......+......++..|.|.+.+..+.        ........+|-..+...  |+++|.+.++.
T Consensus        10 G~~~~~~~v~l~~~~~~~~I~YT~DGs~Pt--------~~s~~Y~~Pi~i~~~~~--vka~a~~~~g~   67 (75)
T PF03174_consen   10 GTYDEGQTVTLSSDTPGATIYYTLDGSEPT--------KSSPLYTGPITIPESGT--VKARAFDNGGN   67 (75)
T ss_dssp             EEEEETTEEEEE-SSTTSEEEEESSSSSS------------EE--CCC--B--S----EEEEE-TTS-
T ss_pred             CcEecCeEEEEEeCCCCCEEEEEcCCCccc--------ccCcccCcCEEeCCCcE--EEEEEEcCCCC
Confidence            333333333333334677899999765541        11112233444433333  88888887753


No 53 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=23.89  E-value=2.8e+02  Score=22.64  Aligned_cols=36  Identities=17%  Similarity=0.298  Sum_probs=23.7

Q ss_pred             cCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEecc
Q psy6838         112 HLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVN  152 (442)
Q Consensus       112 ~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvN  152 (442)
                      .+.......|.++|++.+..|     -+.+..+.|.|.|-+
T Consensus        76 ~~~~~~gG~y~~~VeLCN~~G-----CS~S~~~~V~VaDTD  111 (133)
T PF08329_consen   76 TFTVTKGGRYQMQVELCNADG-----CSTSAPVEVVVADTD  111 (133)
T ss_dssp             EEEE-S-EEEEEEEEEEETTE-----EEE---EEEEEE-TT
T ss_pred             EEEecCCCEEEEEEEEECCCC-----cccCCCEEEEEeCCC
Confidence            455566788999999999887     366678888888864


No 54 
>PF13290 CHB_HEX_C_1:  Chitobiase/beta-hexosaminidase C-terminal domain
Probab=22.55  E-value=2.8e+02  Score=19.44  Aligned_cols=44  Identities=7%  Similarity=0.016  Sum_probs=23.0

Q ss_pred             cCCCcEEEEEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCC
Q psy6838          80 SDNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKP  133 (442)
Q Consensus        80 ~~~~~i~y~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~  133 (442)
                      ..+..|.|.+.+..+...       .. ...++|-.  ....+++.+|.|.++.
T Consensus        18 ~~~~~IyYT~DGs~Pt~~-------S~-~Y~~Pi~i--~~~ttVka~a~~~~g~   61 (67)
T PF13290_consen   18 DPDATIYYTTDGSEPTPS-------SP-LYTGPITI--TGTTTVKARAFDPDGN   61 (67)
T ss_pred             CCCCEEEEEcCCCccccC-------CC-eeccCEEe--cCCEEEEEEEEcCCCc
Confidence            356788888875543210       00 01122211  1346788888887763


No 55 
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=20.13  E-value=2e+02  Score=18.01  Aligned_cols=14  Identities=29%  Similarity=0.470  Sum_probs=10.7

Q ss_pred             CccCcEEEEEEEEE
Q psy6838         358 YEDVHHYIFVVQAQ  371 (442)
Q Consensus       358 ~e~~~~~~l~V~a~  371 (442)
                      |.+.+.|.|+++++
T Consensus         9 FT~PG~Y~l~~~a~   22 (41)
T TIGR03769         9 FTKPGTYTLTVQAT   22 (41)
T ss_pred             eCCCeEEEEEEEEE
Confidence            45677898888885


Done!