Query         psy8629
Match_columns 483
No_of_seqs    287 out of 2003
Neff          8.9 
Searched_HMMs 46136
Date          Fri Aug 16 23:39:25 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8629.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8629hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 1.3E-70 2.8E-75  573.8  39.4  426   12-483   161-616 (2531)
  2 KOG4289|consensus              100.0 6.5E-70 1.4E-74  568.6  40.4  419   12-478   266-920 (2531)
  3 KOG1219|consensus              100.0 8.2E-61 1.8E-65  514.2  48.2  447   12-478  2175-2812(4289)
  4 KOG1219|consensus              100.0 4.6E-60   1E-64  508.5  45.0  416   18-482   643-1095(4289)
  5 cd00031 CA Cadherin repeat dom 100.0 3.7E-29 7.9E-34  231.0  29.7  198  156-439     2-199 (199)
  6 cd00031 CA Cadherin repeat dom 100.0 5.4E-28 1.2E-32  223.2  26.4  196   19-255     1-199 (199)
  7 PF00028 Cadherin:  Cadherin do  99.7 3.5E-16 7.7E-21  125.8  13.8   89  348-438     4-93  (93)
  8 PF00028 Cadherin:  Cadherin do  99.7 5.9E-16 1.3E-20  124.5  15.0   92  156-254     1-93  (93)
  9 KOG1834|consensus               99.7 4.4E-15 9.6E-20  148.9  20.6  212  141-439    23-244 (952)
 10 smart00112 CA Cadherin repeats  99.7 1.1E-15 2.5E-20  118.9  11.2   79  365-445     1-79  (79)
 11 smart00112 CA Cadherin repeats  99.6 2.1E-14 4.6E-19  111.7  10.9   79  176-261     1-79  (79)
 12 KOG1834|consensus               99.5 8.3E-13 1.8E-17  132.8  19.3  204   15-256    32-245 (952)
 13 PF08266 Cadherin_2:  Cadherin-  98.3 1.4E-06 3.1E-11   67.6   6.8   62   19-84      2-68  (84)
 14 PF08758 Cadherin_pro:  Cadheri  97.4  0.0016 3.5E-08   51.2   9.0   64   12-83      3-67  (90)
 15 smart00736 CADG Dystroglycan-t  96.6   0.054 1.2E-06   43.4  11.6   72  364-442    24-96  (97)
 16 PF08266 Cadherin_2:  Cadherin-  96.5  0.0018 3.8E-08   50.4   2.4   60  348-410     6-67  (84)
 17 TIGR01965 VCBS_repeat VCBS rep  96.3   0.038 8.3E-07   44.1   9.0   92  171-276     2-98  (99)
 18 TIGR01965 VCBS_repeat VCBS rep  96.1   0.079 1.7E-06   42.3   9.7   89  360-460     2-98  (99)
 19 PF08758 Cadherin_pro:  Cadheri  96.0     0.1 2.2E-06   41.1   9.8   79  148-241     3-81  (90)
 20 smart00736 CADG Dystroglycan-t  95.3    0.29 6.4E-06   39.1  10.4   70  175-258    24-96  (97)
 21 PF13750 Big_3_3:  Bacterial Ig  94.3     3.5 7.5E-05   36.2  16.1   82  171-257    69-151 (158)
 22 KOG3597|consensus               93.8     6.1 0.00013   40.5  18.0  154  133-304    24-194 (442)
 23 TIGR00845 caca sodium/calcium   91.9      15 0.00033   41.3  19.1  154  144-304   395-568 (928)
 24 PF05345 He_PIG:  Putative Ig d  89.1     2.1 4.6E-05   29.4   6.2   39  383-423     9-48  (49)
 25 TIGR00845 caca sodium/calcium   83.5      20 0.00044   40.3  13.4   47  256-304   395-441 (928)
 26 TIGR03660 T1SS_rpt_143 T1SS-14  81.9      20 0.00044   30.6  10.1   69  197-275    56-127 (137)
 27 PF07495 Y_Y_Y:  Y_Y_Y domain;   79.3      10 0.00022   27.4   6.7   61  371-438     6-66  (66)
 28 COG2706 3-carboxymuconate cycl  75.3      90  0.0019   30.9  26.9   75  250-330   214-288 (346)
 29 KOG3597|consensus               70.5      38 0.00083   34.9  10.2   63  245-308    24-86  (442)
 30 TIGR03660 T1SS_rpt_143 T1SS-14  69.7      23 0.00049   30.3   7.2   44  411-460    85-128 (137)
 31 PF06586 TraK:  TraK protein;    52.3   2E+02  0.0043   26.8  11.7   95   18-140    19-119 (234)
 32 PF05345 He_PIG:  Putative Ig d  48.6      85  0.0018   21.4   6.5   45   49-95      2-47  (49)
 33 PF03160 Calx-beta:  Calx-beta   48.1 1.3E+02  0.0029   23.5  10.8   51  250-304     2-52  (100)
 34 PF07495 Y_Y_Y:  Y_Y_Y domain;   46.5      39 0.00085   24.2   4.2   30  225-254    37-66  (66)
 35 TIGR00864 PCC polycystin catio  43.8 8.5E+02   0.019   31.6  34.5  189  222-439  1563-1779(2740)
 36 cd00146 PKD polycystic kidney   42.7      56  0.0012   24.5   4.7   30  405-436    51-80  (81)
 37 smart00089 PKD Repeats in poly  42.4      59  0.0013   24.2   4.8   31  404-437    48-78  (79)
 38 PF12245 Big_3_2:  Bacterial Ig  40.8 1.1E+02  0.0024   21.8   5.7   31  409-441    21-51  (60)
 39 PF03160 Calx-beta:  Calx-beta   39.7 1.8E+02   0.004   22.7  10.1   53  138-194     2-54  (100)
 40 PF12245 Big_3_2:  Bacterial Ig  39.1 1.2E+02  0.0027   21.6   5.7   30  226-257    22-51  (60)
 41 PF13956 Ibs_toxin:  Toxin Ibs,  34.7      25 0.00054   18.5   1.0   16    1-16      2-17  (19)
 42 cd00146 PKD polycystic kidney   34.1      92   0.002   23.2   4.7   29  222-252    52-80  (81)
 43 PF12988 DUF3872:  Domain of un  33.9 2.8E+02  0.0061   23.6   7.7   42   24-75     40-83  (137)
 44 smart00089 PKD Repeats in poly  33.8      99  0.0021   22.9   4.9   29  222-253    50-78  (79)
 45 PF10365 DUF2436:  Domain of un  32.3 3.2E+02  0.0069   23.3   9.4   90  141-240    63-157 (161)
 46 PF13754 Big_3_4:  Bacterial Ig  30.8 1.7E+02  0.0036   20.3   5.2   16  409-424    22-37  (54)
 47 PF02494 HYR:  HYR domain;  Int  24.3 1.5E+02  0.0032   22.3   4.4   26  410-437    56-81  (81)
 48 PF09100 Qn_am_d_aIV:  Quinohem  22.0   2E+02  0.0043   24.1   4.7   31  414-445   101-132 (133)
 49 PF02494 HYR:  HYR domain;  Int  21.6 3.5E+02  0.0076   20.2   9.7   24  228-253    58-81  (81)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=1.3e-70  Score=573.77  Aligned_cols=426  Identities=34%  Similarity=0.464  Sum_probs=398.2

Q ss_pred             HhhhcCceEEEEEeCCCCCCeEEEEEEccCCCCCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEEE
Q psy8629          12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVGDSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVAI   91 (483)
Q Consensus        12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v~   91 (483)
                      ++.|.+..|...++||.|+||.|.+++|.|++..++.|++.....+...++|+||+.+|.|++++.||||....+-|+|.
T Consensus       161 ~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~a~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~HvlrVt  240 (2531)
T KOG4289|consen  161 AVQFQQPNYQKELPENEPAGTIIASVKASDPDAGRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHVLRVT  240 (2531)
T ss_pred             CccCCCcchhccCcCCCCCCceeEEEEecCCCcCceEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeEEEEE
Confidence            45899999999999999999999999999999888999998766666778999999999999999999999998989888


Q ss_pred             ecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEE
Q psy8629          92 PLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRT  171 (483)
Q Consensus        92 a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~  171 (483)
                      |.|.+.                               +.-+.+++|+|.|.|+|||.|+|.+.+|...+.||.++|..|.
T Consensus       241 A~d~~~-------------------------------P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vL  289 (2531)
T KOG4289|consen  241 AQDHGD-------------------------------PRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVL  289 (2531)
T ss_pred             eeecCC-------------------------------CcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEE
Confidence            877543                               3334588999999999999999999999999999999999999


Q ss_pred             EEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEE
Q psy8629         172 LNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNV  251 (483)
Q Consensus       172 ~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I  251 (483)
                      +++|+|.|.++|+.+.|++..|++.+.|.|++.   +|.    |++.++||||+...|+|.|.|+|.|.|+...++.|.|
T Consensus       290 tvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~r---SGv----I~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~i  362 (2531)
T KOG4289|consen  290 TVRATDGDSPPNANIRYRLLEGNAKNVFEINPR---SGV----ISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEI  362 (2531)
T ss_pred             EEEeccCCCCCCCceEEEecCCCccceeEEcCc---cce----eeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEE
Confidence            999999999999999999999988999999994   567    8899999999999999999999999998888999999


Q ss_pred             EEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCCCc
Q psy8629         252 TIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALDFE  331 (483)
Q Consensus       252 ~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD~E  331 (483)
                      +|.|+|||+|+|....|...|.|+..++++|++|+|+|.|.|.|+.+.|+|.  +++..+.|.||..||+|.+..+||||
T Consensus       363 tV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~--Sgn~~G~f~id~~tGel~vv~plD~e  440 (2531)
T KOG4289|consen  363 TVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIA--SGNGRGQFYIDSLTGELDVVEPLDFE  440 (2531)
T ss_pred             EEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEee--ccCccccEEEecccceEEEecccccc
Confidence            9999999999999999999999999999999999999999999999999995  57889999999999999999999999


Q ss_pred             ccceeeeEeE--------------EEE---Eee-------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEee
Q psy8629         332 TKEVHDLVPR--------------FAV---EIS-------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLRE  381 (483)
Q Consensus       332 ~~~~~~l~v~--------------~~~---~v~-------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~  381 (483)
                      .. .|.+.|+              +++   .|+             ++.||.+.|..++.+.|.|.|.|.|+.++|++. 
T Consensus       441 ~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~la-  518 (2531)
T KOG4289|consen  441 NS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLA-  518 (2531)
T ss_pred             CC-eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeec-
Confidence            98 9999988              222   222             688999999999999999999999999999997 


Q ss_pred             CCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCCC
Q psy8629         382 NTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENKP  461 (483)
Q Consensus       382 ~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~~  461 (483)
                          ..+.|.|+..+|.|++.+.||||+.+.|.|-|.|+|.|.|++++.+.|.|.+.|+|||.|.|.+..|+..+.|++|
T Consensus       519 ----G~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~p  594 (2531)
T KOG4289|consen  519 ----GVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAP  594 (2531)
T ss_pred             ----cCCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCcc
Confidence                3458999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCcEEEEEEEEeCCCCCCCcCC
Q psy8629         462 KGSCILKSIFYDIFNISEHSFK  483 (483)
Q Consensus       462 ~g~~v~~v~a~D~D~~~~~~~~  483 (483)
                      .|+.|.+|+|.|.|.++.++|.
T Consensus       595 vgsSI~tvtAvD~d~~s~ityq  616 (2531)
T KOG4289|consen  595 VGSSIVTVTAVDRDANSVITYQ  616 (2531)
T ss_pred             ccceEEEEEEeccccccceEEE
Confidence            9999999999999999988873


No 2  
>KOG4289|consensus
Probab=100.00  E-value=6.5e-70  Score=568.57  Aligned_cols=419  Identities=34%  Similarity=0.439  Sum_probs=385.5

Q ss_pred             HhhhcCceEEEEEeCCCCCCeEEEEEEccCCC---CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeE
Q psy8629          12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVG---DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSL   88 (483)
Q Consensus        12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d---~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l   88 (483)
                      ++-|+|++|.-++.||.++|..|.+++|+|.|   +..+.|++..+   .+.+.|+||+.+|.|+++.+||||+..+|.|
T Consensus       266 sPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg---~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L  342 (2531)
T KOG4289|consen  266 SPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEG---NAKNVFEINPRSGVISTRAPLDREELESYQL  342 (2531)
T ss_pred             CcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCC---CccceeEEcCccceeeccCccCHHhhhheEE
Confidence            46799999999999999999999999999965   66789999854   3456899999999999999999999999999


Q ss_pred             EEEecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCe
Q psy8629          89 VAIPLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDV  168 (483)
Q Consensus        89 ~v~a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt  168 (483)
                      .|.|+|.+.                               +....++.|.|+|.|+|||+|+|....|..+|.|+..+++
T Consensus       343 ~VeAsDqG~-------------------------------~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~a  391 (2531)
T KOG4289|consen  343 DVEASDQGR-------------------------------PPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPA  391 (2531)
T ss_pred             EEEeccCCC-------------------------------CCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCc
Confidence            999988542                               1112278999999999999999999999999999999999


Q ss_pred             EEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEE
Q psy8629         169 KRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMI  248 (483)
Q Consensus       169 ~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~  248 (483)
                      .|.+|+|+|.|.|.|+.+.|+|.+|+..+.|.|+..+   |.    +.+..+||+|.. .|++.|+|.|+|.|+++.+.-
T Consensus       392 vvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~t---Ge----l~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sg  463 (2531)
T KOG4289|consen  392 VVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLT---GE----LDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSG  463 (2531)
T ss_pred             eEEEEEecccCCCcCceEEEEeeccCccccEEEeccc---ce----EEEeccccccCC-eeEEEEEcccCCCCCccCCCc
Confidence            9999999999999999999999999999999999954   55    788899999998 999999999999999999999


Q ss_pred             EEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccC
Q psy8629         249 VNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKAL  328 (483)
Q Consensus       249 v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~L  328 (483)
                      +.|+|+|+|||+|.|....+..+|.|+.+.|..++.+.|.|.|.|+|+.+.|++.+     .+.|.|+..+|+|.+.+.|
T Consensus       464 l~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG-----~~pf~I~~~SG~Itvtk~l  538 (2531)
T KOG4289|consen  464 LVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG-----VGPFQINNGSGWITVTKEL  538 (2531)
T ss_pred             eEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeecc-----CCCeeEecCCceEEEeecc
Confidence            99999999999999999999999999999999999999999999999999999954     4589999999999999999


Q ss_pred             CCcccceeeeEeE-------------------------------------------------------------------
Q psy8629         329 DFETKEVHDLVPR-------------------------------------------------------------------  341 (483)
Q Consensus       329 D~E~~~~~~l~v~-------------------------------------------------------------------  341 (483)
                      |||+.+.|.|.|+                                                                   
T Consensus       539 drEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ityqi~  618 (2531)
T KOG4289|consen  539 DRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSVITYQIT  618 (2531)
T ss_pred             cccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEeccccccceEEEec
Confidence            9999999998887                                                                   


Q ss_pred             --------------------------------------------------------------------EEEEee------
Q psy8629         342 --------------------------------------------------------------------FAVEIS------  347 (483)
Q Consensus       342 --------------------------------------------------------------------~~~~v~------  347 (483)
                                                                                          |+.+|+      
T Consensus       619 g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G  698 (2531)
T KOG4289|consen  619 GGNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLG  698 (2531)
T ss_pred             CCcccccceeeccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCC
Confidence                                                                                000000      


Q ss_pred             --------------------------------------------------------------------------------
Q psy8629         348 --------------------------------------------------------------------------------  347 (483)
Q Consensus       348 --------------------------------------------------------------------------------  347 (483)
                                                                                                      
T Consensus       699 ~tvvtlsasd~D~geNARI~y~led~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diND  778 (2531)
T KOG4289|consen  699 TTVVTLSASDEDTGENARITYILEDEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDIND  778 (2531)
T ss_pred             ceeEEEecccCCCCccceEEEEecccceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccc
Confidence                                                                                            


Q ss_pred             ------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEE
Q psy8629         348 ------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKL  415 (483)
Q Consensus       348 ------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l  415 (483)
                                  +|.|++|++|.|++|+|+|.|.|.|+.+.|.+..+. +..+.|.|++++|.|++.+.||||....|.|
T Consensus       779 naPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~-d~p~~F~IEptSGviRtl~rLdRE~~avy~L  857 (2531)
T KOG4289|consen  779 NAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGD-DGPGDFYIEPTSGVIRTLRRLDRENVAVYVL  857 (2531)
T ss_pred             cCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCC-CCCCceEEccCcceeehhhhhcchheeEEEE
Confidence                        788999999999999999999999999999887543 3458899999999999999999999999999


Q ss_pred             EEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCCCCCcEEEEEEEEeCCCCC
Q psy8629         416 TVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENKPKGSCILKSIFYDIFNIS  478 (483)
Q Consensus       416 ~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~  478 (483)
                      .+.|+|.|.|++++++.|+|+|+|+|||||+|.+..|+..|.||.++|+.+++++|.|||.+.
T Consensus       858 ~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~  920 (2531)
T KOG4289|consen  858 AAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGP  920 (2531)
T ss_pred             EEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCC
Confidence            999999999999999999999999999999999999999999999999999999999999643


No 3  
>KOG1219|consensus
Probab=100.00  E-value=8.2e-61  Score=514.24  Aligned_cols=447  Identities=31%  Similarity=0.421  Sum_probs=395.3

Q ss_pred             HhhhcCceEEEEEeCCCCCCeEEEEEEccCCC-CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEE
Q psy8629          12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVG-DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVA   90 (483)
Q Consensus        12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d-~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v   90 (483)
                      ++.|.+..|.++++|+++.|+.|.++.|+|.| +..+.|+|..  ++.....|+|++.||+|++.+.||||+...|.+.+
T Consensus      2175 ~PvFeqlsYt~sisE~s~igt~viqilATdsDsn~~isYsl~g--~s~~sk~f~In~sTG~it~~g~ldyE~~q~f~~fv 2252 (4289)
T KOG1219|consen 2175 PPVFEQLSYTISISENSKIGTKVIQILATDSDSNREISYSLEG--NSEISKPFRINVSTGWITVAGKLDYEENQEFRFFV 2252 (4289)
T ss_pred             CchhheeeEEEEccCCCccCceEEEEEeccCCCCCceEEEeec--CCccccceEEecccceEEEeeecChhhcceEEEEE
Confidence            46799999999999999999999999999988 6678999986  44556789999999999999999999999999999


Q ss_pred             EecCCccee----------------------------------------EE-----------------------------
Q psy8629          91 IPLSGENVR----------------------------------------VV-----------------------------  101 (483)
Q Consensus        91 ~a~d~~~~~----------------------------------------~~-----------------------------  101 (483)
                      .|.||+...                                        ++                             
T Consensus      2253 ratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l~~s~~G~i 2332 (4289)
T KOG1219|consen 2253 RATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFLILSENGII 2332 (4289)
T ss_pred             EEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhhhhcccceeeeccCceE
Confidence            999884210                                        00                             


Q ss_pred             --------------------------------------------------------------------------------
Q psy8629         102 --------------------------------------------------------------------------------  101 (483)
Q Consensus       102 --------------------------------------------------------------------------------  101 (483)
                                                                                                      
T Consensus      2333 TlfNl~k~~l~~s~~lrv~vsD~v~~at~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~~si~s~~s 2412 (4289)
T KOG1219|consen 2333 TLFNLLKSPLQTSYPLRVTVSDGVFRATMEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIYYSINSRAS 2412 (4289)
T ss_pred             EehhhcccccccccceeeeeccCcceeeeEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceeeeeechhcc
Confidence                                                                                            


Q ss_pred             ---EeecCcEEEEcccCCccccceEEEEEEeC--CCC----eEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEE
Q psy8629         102 ---SFYEPGEIRTKVPLDRETRSSYSLVAIPL--SGE----NVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTL  172 (483)
Q Consensus       102 ---~~~~~g~l~~~~~ld~e~~~~~~~~~~~~--~~~----~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~  172 (483)
                         .+.+.|.|.+...||||....+.+.+..+  |++    -++++|.+.|+|||||.|....|++.|.|++..|..|++
T Consensus      2413 d~~~in~~GqI~t~~kld~e~s~~~vi~i~v~a~Da~gr~af~tvti~ltDiNDnpPqF~a~~Y~~nI~enaskg~~V~~ 2492 (4289)
T KOG1219|consen 2413 DHFEINKSGQIKTLSKLDREYSEELVIIIAVMAFDAGGRVAFCTVTIILTDINDNPPQFDAQLYRVNITENASKGKLVGH 2492 (4289)
T ss_pred             CceeECCCccEEeeehhhhccCceEEEEEEEEEecCCCeEEEEEEEEEEEecCCCCccccceeEEEEeecccCCCceEEE
Confidence               01136888889999999998888876654  654    678999999999999999999999999999999999999


Q ss_pred             EEEEeCCCCCCceEEEEEEeC-CCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEE
Q psy8629         173 NAAKDRDLGIYNTQRYNIVSG-NTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNV  251 (483)
Q Consensus       173 v~a~D~D~~~~~~~~y~l~~~-~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I  251 (483)
                      +.|+|.|.+.|..++|.+.++ .....|.|++    +|.    |.+.+.|++++...|.|.|+|.|+|.|++.+..+|.+
T Consensus      2493 v~A~D~De~snadvty~i~~e~~~~~v~~in~----sG~----Itv~~sL~~~en~tl~l~vkA~D~g~P~~~s~ttV~v 2564 (4289)
T KOG1219|consen 2493 VIARDADEGSNADVTYEIVGESDVKHVFEINE----SGV----ITVKRSLDGLENSTLHLFVKAIDDGKPRRRSNTTVIV 2564 (4289)
T ss_pred             EEEecCCCCCcccEEEEecCchhhhheeeecC----Cce----EEeehhhhcccCcEEEEEEEeccCCCCCcccceEEEE
Confidence            999999999999999999876 2234566653    577    7889999999999999999999999999999999999


Q ss_pred             EEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCC---CcceEEeCCccEEEEcccC
Q psy8629         252 TIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDK---DGLFNIDSKSGLIFVNKAL  328 (483)
Q Consensus       252 ~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~---~~~F~Id~~tG~l~~~~~L  328 (483)
                      +|++..++.|.|+.+.|.++|+|+.++|+.|++++|.|.|.    ..-|++...++++   ...|.+|++||.|.++++|
T Consensus      2565 ~vl~e~v~lPrFSep~y~fsvpEDv~vG~~Ig~v~a~~a~~----~~i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~ksL 2640 (4289)
T KOG1219|consen 2565 TVLPEDVNLPRFSEPIYTFSVPEDVPVGEEIGQVSASDADE----HVIYSLVLGGTPESNPDLPFSVDRNTGMIKVNKSL 2640 (4289)
T ss_pred             EecCcccCcccccCceEEEeccccCCCCCeeeEEeecccCC----ceEEEEEeCCCCCCCCCCceEEcCCCceEEecccc
Confidence            99999999999999999999999999999999999999985    3566666544444   4459999999999999999


Q ss_pred             CCcccceeeeEeEE----------EEEee------------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEe
Q psy8629         329 DFETKEVHDLVPRF----------AVEIS------------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLR  380 (483)
Q Consensus       329 D~E~~~~~~l~v~~----------~~~v~------------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~  380 (483)
                      |+|...+|.+.|..          .+.|.                  .+.||.+.|+.|++++|+|.|.|.|++|+|+|.
T Consensus      2641 D~E~kk~yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~ 2720 (4289)
T KOG1219|consen 2641 DHEKKKSYQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLT 2720 (4289)
T ss_pred             chhhhceEEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEc
Confidence            99999999999881          11111                  567999999999999999999999999999998


Q ss_pred             eCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCC
Q psy8629         381 ENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENK  460 (483)
Q Consensus       381 ~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~  460 (483)
                      ..    ..+|.|++.||+|++...||+|++..|.|.|+|+|.|.|  ++.++|.|.|.|+||+||+|+...|.++|.|+.
T Consensus      2721 ~~----v~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~--ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~ 2794 (4289)
T KOG1219|consen 2721 SP----VPYFAINPETGWLTTLFELDLEKQDLYSLKVVATDHGVP--SSQATVLVHVTDVNDSPPRFQREIYEGTVVEDV 2794 (4289)
T ss_pred             CC----cceEEEcCCCCeeeehhhhccccCCceEEEEEEecCCcc--cccceEEEEEEecCCCcchhhhHhhccceeccC
Confidence            42    339999999999999999999999999999999999986  788999999999999999999999999999999


Q ss_pred             CCCcEEEEEEEEeCCCCC
Q psy8629         461 PKGSCILKSIFYDIFNIS  478 (483)
Q Consensus       461 ~~g~~v~~v~a~D~D~~~  478 (483)
                      +.|.+++.++..|.|-..
T Consensus      2795 p~~~~ia~~si~d~D~s~ 2812 (4289)
T KOG1219|consen 2795 PGGKIIAGLSIFDADVSE 2812 (4289)
T ss_pred             CCCceeeeeEeccccccc
Confidence            999999999999999543


No 4  
>KOG1219|consensus
Probab=100.00  E-value=4.6e-60  Score=508.46  Aligned_cols=416  Identities=29%  Similarity=0.400  Sum_probs=380.9

Q ss_pred             ceEEEEEeCCCCCCeEEEEEEccCCC---CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEEEecC
Q psy8629          18 HLRDLEISEGVPVGTRVGFIGDGFVG---DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVAIPLS   94 (483)
Q Consensus        18 ~~~~~~v~En~~~gt~V~~v~a~d~d---~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v~a~d   94 (483)
                      -..-+.|+|+.|.|+.++.++|+|.|   +..+.|-|..+.   .+..|.|+..+|.|++.++||+|....|.|.+.|.|
T Consensus       643 fP~iI~v~Edvpigt~la~L~atD~Dtgfng~l~yvI~dgn---e~~~~~Id~qsg~itvas~ld~~~t~~yiLnvta~D  719 (4289)
T KOG1219|consen  643 FPFIIVVPEDVPIGTTLAILSATDSDTGFNGKLVYVIEDGN---ESICFLIDRQSGNITVASPLDNENTEQYILNVTAYD  719 (4289)
T ss_pred             CCceEEccccCCCCceEEEEeccCCCCCcCceEEEEEeCCc---cceEEEEecccceEEEecchhhhhhheeEEEEEEec
Confidence            45678999999999999999999977   556789998643   456899999999999999999999999999999998


Q ss_pred             CcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEE
Q psy8629          95 GENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNA  174 (483)
Q Consensus        95 ~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~  174 (483)
                      ++.                               +.++....+.|.|.|.|||+|+|.+..|.++|+|+..+|+.|++|.
T Consensus       720 ~gt-------------------------------Pqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~ve  768 (4289)
T KOG1219|consen  720 LGT-------------------------------PQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVE  768 (4289)
T ss_pred             CCC-------------------------------chhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEE
Confidence            763                               4455567788999999999999999999999999999999999999


Q ss_pred             EEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEE
Q psy8629         175 AKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQ  254 (483)
Q Consensus       175 a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~  254 (483)
                      |.|.|.|.||.++|+|..  +...|+|++.   +|.    +.+.++||||.+..|.|.|+|+|.+.|.+.+...+.|.|.
T Consensus       769 tnd~D~g~NG~v~fsL~n--~sdvfsIdp~---tGi----vv~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsve  839 (4289)
T KOG1219|consen  769 TNDTDGGNNGMVSFSLLN--KSDVFSIDPF---TGI----VVTSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVE  839 (4289)
T ss_pred             ecccCCCCCceEEEEecC--CcceEEecCc---ccE----EEeccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEe
Confidence            999999999999999976  5679999995   466    7889999999999999999999999999999999999999


Q ss_pred             ecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCCCcccc
Q psy8629         255 DVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALDFETKE  334 (483)
Q Consensus       255 dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~  334 (483)
                      |+|||+|.|....+...|+|+.|.|+++..+.|.|+|.|+.++++|.+..    ....|.+|--+|.+++.++||||...
T Consensus       840 dVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~----~~v~~rvd~~sGavfi~~~LDf~k~~  915 (4289)
T KOG1219|consen  840 DVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTD----DTVGERVDFPSGAVFIGKPLDFEKSD  915 (4289)
T ss_pred             eccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEec----CccccccccccccEEEeccccccccc
Confidence            99999999999999999999999999999999999999999999999964    23445999999999999999999999


Q ss_pred             eeeeEeEEE------------EEee--------------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeC
Q psy8629         335 VHDLVPRFA------------VEIS--------------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLREN  382 (483)
Q Consensus       335 ~~~l~v~~~------------~~v~--------------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~  382 (483)
                      .|+|.|++.            ..|.                    +|.||++.|+.++.+.|.|.|.|..+.+.|+|.  
T Consensus       916 fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~--  993 (4289)
T KOG1219|consen  916 FYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIR--  993 (4289)
T ss_pred             ceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEE--
Confidence            999998811            1111                    789999999999999999999999999999999  


Q ss_pred             CCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCCCC
Q psy8629         383 TDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENKPK  462 (483)
Q Consensus       383 ~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~~~  462 (483)
                      .|...+.|.||.++|.|++.+.||||....|-|+|.|+|.|.+++++.+.+.|.|+|+|||+|+|.++.|..+|.||+|.
T Consensus       994 ~gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~ 1073 (4289)
T KOG1219|consen  994 TGDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPE 1073 (4289)
T ss_pred             cCCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCC
Confidence            56778899999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CcEEEEEEEEeCCCCC--CCcC
Q psy8629         463 GSCILKSIFYDIFNIS--EHSF  482 (483)
Q Consensus       463 g~~v~~v~a~D~D~~~--~~~~  482 (483)
                      +..|.++.|.|||+.+  .+.|
T Consensus      1074 ~vsivq~ea~D~Dsssn~kLmy 1095 (4289)
T KOG1219|consen 1074 TVSIVQAEANDPDSSSNQKLMY 1095 (4289)
T ss_pred             ceEEEEeccCCCCcccCcceEE
Confidence            9999999999999544  5554


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97  E-value=3.7e-29  Score=231.03  Aligned_cols=198  Identities=44%  Similarity=0.654  Sum_probs=178.0

Q ss_pred             eEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEE
Q psy8629         156 MSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEA  235 (483)
Q Consensus       156 ~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a  235 (483)
                      |.+.|+|+++.|+.++++.|.|+|.+.++.++|+|.++....+|.|++.   +|.    |.+.+.||||....|.|.|+|
T Consensus         2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~---tG~----l~~~~~lD~e~~~~~~l~v~a   74 (199)
T cd00031           2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPN---TGV----ITTTKPLDREEQSEYTLTVVA   74 (199)
T ss_pred             eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCC---CCE----EEECCCCCCcCCceEEEEEEE
Confidence            7899999999999999999999999888899999998765579999985   466    778899999999999999999


Q ss_pred             EECCCCCeeeEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEE
Q psy8629         236 LDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNI  315 (483)
Q Consensus       236 ~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~I  315 (483)
                      +|.|.|.++++..++|.|.|+||++|.|....|.+.+.|+.+.|+.++++.|+|+|.+.++.++|+|..           
T Consensus        75 ~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~-----------  143 (199)
T cd00031          75 SDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILS-----------  143 (199)
T ss_pred             EECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeC-----------
Confidence            999888888999999999999999999998899999999999999999999999998766777777742           


Q ss_pred             eCCccEEEEcccCCCcccceeeeEeEEEEEeeeeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCC
Q psy8629         316 DSKSGLIFVNKALDFETKEVHDLVPRFAVEISSFDEIAEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSK  395 (483)
Q Consensus       316 d~~tG~l~~~~~LD~E~~~~~~l~v~~~~~v~~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~  395 (483)
                                                                                          +...++|.|++.
T Consensus       144 --------------------------------------------------------------------~~~~~~f~i~~~  155 (199)
T cd00031         144 --------------------------------------------------------------------GNDKELFSIDPN  155 (199)
T ss_pred             --------------------------------------------------------------------CCCCCEEEEeCC
Confidence                                                                                111357888888


Q ss_pred             CcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEe
Q psy8629         396 SGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHD  439 (483)
Q Consensus       396 tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~d  439 (483)
                      +|.|.+.+.||||....|.|.|.|+|.+.|.+++++.++|.|.|
T Consensus       156 ~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         156 TGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             ceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence            99999999999999999999999999988889999999999875


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.96  E-value=5.4e-28  Score=223.19  Aligned_cols=196  Identities=34%  Similarity=0.503  Sum_probs=172.2

Q ss_pred             eEEEEEeCCCCCCeEEEEEEccCCCC---CCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEEEecCC
Q psy8629          19 LRDLEISEGVPVGTRVGFIGDGFVGD---SGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVAIPLSG   95 (483)
Q Consensus        19 ~~~~~v~En~~~gt~V~~v~a~d~d~---~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v~a~d~   95 (483)
                      .|.++|+||+++|+.|+++.|.|.|.   ..+.|+|.+...   ..+|.|++.+|+|++.++||||....|.|.+.|.|+
T Consensus         1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~---~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~   77 (199)
T cd00031           1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNE---DGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDG   77 (199)
T ss_pred             CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCC---cccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEEC
Confidence            47899999999999999999999874   457899986532   258999999999999999999999999999999884


Q ss_pred             cceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEE
Q psy8629          96 ENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAA  175 (483)
Q Consensus        96 ~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a  175 (483)
                      +..                               ....+..+.|.|.|+|||+|.|....|.+.|.|+.+.|+.++++.|
T Consensus        78 g~~-------------------------------~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a  126 (199)
T cd00031          78 GGP-------------------------------PLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTA  126 (199)
T ss_pred             CcC-------------------------------cceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEE
Confidence            321                               0113678999999999999999999999999999999999999999


Q ss_pred             EeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEEe
Q psy8629         176 KDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQD  255 (483)
Q Consensus       176 ~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d  255 (483)
                      +|+|.+.++.++|+|.++.....|.|++   .+|.    |.+.+.||||....|.|.|.|+|.+.|.+++++.++|.|.|
T Consensus       127 ~D~D~~~~~~~~y~l~~~~~~~~f~i~~---~~G~----i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         127 TDADSGENAKLTYSILSGNDKELFSIDP---NTGI----ITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             EcCCCCCCccEEEEEeCCCCCCEEEEeC---CceE----EEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence            9999988899999999876557999998   4566    77788999999999999999999988889999999999875


No 7  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.70  E-value=3.5e-16  Score=125.80  Aligned_cols=89  Identities=46%  Similarity=0.583  Sum_probs=84.0

Q ss_pred             eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEEC-CcCC
Q psy8629         348 SFDEIAEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDN-GVPA  426 (483)
Q Consensus       348 ~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~-g~p~  426 (483)
                      .+.|++++|+.++++.|.|+|.+.|+.+.|+|..  ++...+|.|++.+|.|++.+.||||..+.|.|.|.|+|. |.|+
T Consensus         4 ~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~--~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~   81 (93)
T PF00028_consen    4 SVPENAPPGTVVGQVTATDPDSGPNSQITYSILG--GNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPP   81 (93)
T ss_dssp             EEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEE--TTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSE
T ss_pred             EEECCCCCCCEEEEEEEEeCCCCCCceEEEEEec--CcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCC
Confidence            5789999999999999999999999999999994  445899999999999999999999999999999999999 8999


Q ss_pred             ceEEEEEEEEEE
Q psy8629         427 LSSSATVLVTIH  438 (483)
Q Consensus       427 ~s~~~~v~I~V~  438 (483)
                      ++++++|.|+|+
T Consensus        82 ~~~~~~V~I~V~   93 (93)
T PF00028_consen   82 LSSTATVTINVL   93 (93)
T ss_dssp             EEEEEEEEEEEE
T ss_pred             CEEEEEEEEEEC
Confidence            999999999985


No 8  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.70  E-value=5.9e-16  Score=124.51  Aligned_cols=92  Identities=33%  Similarity=0.552  Sum_probs=86.5

Q ss_pred             eEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEE
Q psy8629         156 MSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEA  235 (483)
Q Consensus       156 ~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a  235 (483)
                      |.+.|+|++++|+.++++.|.|+|.+.|+.+.|+|.+++....|.|++.   +|.    |.+.++||||..+.|.|.|.|
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~---tg~----i~~~~~LD~E~~~~y~l~v~a   73 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPN---TGE----ISLKKPLDRETQSSYQLTVRA   73 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETT---TTE----EEESSSSCTTTTSEEEEEEEE
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeee---eec----cccceecCcccCCEEEEEEEE
Confidence            7899999999999999999999999999999999999988899999995   455    888999999999999999999


Q ss_pred             EEC-CCCCeeeEEEEEEEEE
Q psy8629         236 LDG-GSPPLRGSMIVNVTIQ  254 (483)
Q Consensus       236 ~D~-g~p~~~~~~~v~I~V~  254 (483)
                      +|. |.|++++++.|+|+|+
T Consensus        74 ~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   74 TDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             EETTTSSEEEEEEEEEEEEE
T ss_pred             EECCCCCCCEEEEEEEEEEC
Confidence            999 8999999999999985


No 9  
>KOG1834|consensus
Probab=99.67  E-value=4.4e-15  Score=148.85  Aligned_cols=212  Identities=26%  Similarity=0.300  Sum_probs=161.9

Q ss_pred             EecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCC---CceEEEEEEeCCCCCceEEecccCCCceEEEEEEe
Q psy8629         141 VIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGI---YNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQI  217 (483)
Q Consensus       141 V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~---~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~  217 (483)
                      .--+|-+.| +....|...|.||-..-....-+.|-|.|.+.   .....|.|.+.  .-+|.+......+|.-.  |+.
T Consensus        23 aarankhkp-wie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq--~vPFdavVvdK~TGegv--lRa   97 (952)
T KOG1834|consen   23 AARANKHKP-WIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ--PVPFDAVVVDKYTGEGV--LRA   97 (952)
T ss_pred             cccccccCc-ccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC--CCCceEEEEeccCCceE--Eee
Confidence            345677777 55677999999997644444457888999742   12346777653  44777655544555433  778


Q ss_pred             ccCCCCCCCCEEEEEEEEEECCC-C-----CeeeEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCC
Q psy8629         218 NGVLDRETIPEYSLVIEALDGGS-P-----PLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDAD  291 (483)
Q Consensus       218 ~~~LD~E~~~~~~l~v~a~D~g~-p-----~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D  291 (483)
                      ..+||.|.++.|+|+|+|.|.|. |     ..+..++|.|+|.|+|+.+|+|..+.|.+.|.|. ..-..|++|.|.|.|
T Consensus        98 K~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~D  176 (952)
T KOG1834|consen   98 KEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKD  176 (952)
T ss_pred             cCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecc-eeeeeeEEEEeecCC
Confidence            89999999999999999999664 3     3567789999999999999999999999999998 567788999999999


Q ss_pred             CCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCCCcccceeeeEeEEEEEeeeeeccCCCCeEEEEEEEEeCCCCC
Q psy8629         292 AGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALDFETKEVHDLVPRFAVEISSFDEIAEPGTSVFQISAFDQDEGI  371 (483)
Q Consensus       292 ~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~l~v~~~~~v~~v~E~~~~gt~v~~v~a~D~D~~~  371 (483)
                      .++                                                                            +
T Consensus       177 Csp----------------------------------------------------------------------------q  180 (952)
T KOG1834|consen  177 CSP----------------------------------------------------------------------------Q  180 (952)
T ss_pred             CCC----------------------------------------------------------------------------c
Confidence            754                                                                            1


Q ss_pred             CceE-EEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEe
Q psy8629         372 NSVI-TYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHD  439 (483)
Q Consensus       372 ~~~v-~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~d  439 (483)
                      +++| .|.|.    +..-.|.||. .|.|+...+|+|.+...|.|+|+|.|.|..+..+.+.|+|.|..
T Consensus       181 ~sqIC~YEI~----t~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp  244 (952)
T KOG1834|consen  181 YSQICEYEIT----TPDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKP  244 (952)
T ss_pred             ccceeEEEec----CCCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEecC
Confidence            2222 35555    2456799985 59999999999999999999999999998655555777887764


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.65  E-value=1.1e-15  Score=118.86  Aligned_cols=79  Identities=48%  Similarity=0.641  Sum_probs=73.0

Q ss_pred             EeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCC
Q psy8629         365 FDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNE  444 (483)
Q Consensus       365 ~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~  444 (483)
                      +|+|.|.|+.++|+|...  +...+|.|++.+|.|.+.++||||....|.|.|+|+|.|.|+++++++|.|.|.|+|||+
T Consensus         1 ~D~D~g~n~~i~Y~i~~~--~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~   78 (79)
T smart00112        1 TDADSGENGKVTYSILSG--NEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA   78 (79)
T ss_pred             CCCCCCcCcEEEEEEecC--CCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence            488999999999999943  333899999999999999999999999999999999999999999999999999999999


Q ss_pred             C
Q psy8629         445 P  445 (483)
Q Consensus       445 P  445 (483)
                      |
T Consensus        79 P   79 (79)
T smart00112       79 P   79 (79)
T ss_pred             C
Confidence            8


No 11 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.57  E-value=2.1e-14  Score=111.66  Aligned_cols=79  Identities=44%  Similarity=0.743  Sum_probs=71.7

Q ss_pred             EeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEEe
Q psy8629         176 KDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQD  255 (483)
Q Consensus       176 ~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d  255 (483)
                      +|+|.|.|+.+.|+|.+++...+|.|++.+   |.    +.+.++||||..+.|.|.|+|+|.|.|++++.+.|+|+|.|
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~t---g~----i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D   73 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPET---GE----ITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLD   73 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCc---cE----EEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEE
Confidence            488999999999999987755899999854   54    67788999999999999999999999999999999999999


Q ss_pred             cCCCCC
Q psy8629         256 VNDNQP  261 (483)
Q Consensus       256 vNd~~P  261 (483)
                      +|||+|
T Consensus        74 ~Nd~~P   79 (79)
T smart00112       74 VNDNAP   79 (79)
T ss_pred             CCCCCC
Confidence            999998


No 12 
>KOG1834|consensus
Probab=99.52  E-value=8.3e-13  Score=132.80  Aligned_cols=204  Identities=20%  Similarity=0.209  Sum_probs=150.6

Q ss_pred             hcCceEEEEEeCCCCCCeEEEEEEccCCCC------CCCCEEEEecCCCCCcccEEEeCCcc--EEEeCccCCccCCcee
Q psy8629          15 ADEHLRDLEISEGVPVGTRVGFIGDGFVGD------SGPPYLIVPVPGSAVDTDLSIEQNTG--EIRTKVPLDRETRSSY   86 (483)
Q Consensus        15 ~~~~~~~~~v~En~~~gt~V~~v~a~d~d~------~~~~y~i~~~~~~~~~~~F~Id~~tG--~i~~~~~LD~E~~~~y   86 (483)
                      +...+|..-|.||--.=...--+.|.|.|.      .--.|.|-..  +-+-+.--+|+.||  .|+.+.+||-|.++.|
T Consensus        32 wie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq--~vPFdavVvdK~TGegvlRaK~~lDCelqkey  109 (952)
T KOG1834|consen   32 WIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ--PVPFDAVVVDKYTGEGVLRAKEPLDCELQKEY  109 (952)
T ss_pred             ccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC--CCCceEEEEeccCCceEEeecCcccccccccc
Confidence            556789999999975322223466666542      1124666532  11111233466664  7888889999999999


Q ss_pred             eEEEEecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCC
Q psy8629          87 SLVAIPLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPR  166 (483)
Q Consensus        87 ~l~v~a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~  166 (483)
                      +++|+|.|.++.-      +|                   .....+..+++.|+|.|+|+.+|+|..+.|.+.|.|....
T Consensus       110 tf~iQAydCg~gp------dg-------------------tn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~y  164 (952)
T KOG1834|consen  110 TFTIQAYDCGNGP------DG-------------------TNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVY  164 (952)
T ss_pred             eEEEEEEecCCCC------Cc-------------------cccccccceEEEEEeccccccCchhcccceeeEEecceee
Confidence            9999998865310      00                   0011233689999999999999999999999999998655


Q ss_pred             CeEEEEEEEEeCCCCC-Cce-EEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCee
Q psy8629         167 DVKRTLNAAKDRDLGI-YNT-QRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLR  244 (483)
Q Consensus       167 gt~v~~v~a~D~D~~~-~~~-~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~  244 (483)
                       ..|.++.|.|.|.++ ++. ..|.|..  +.-+|.|+.    .|.    |+...+|+|.+...|.|+|.|.|.|..+..
T Consensus       165 -d~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn----~G~----irnTekLny~ke~~Y~ltVtAyDCg~kraa  233 (952)
T KOG1834|consen  165 -DSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDN----DGN----IRNTEKLNYTKEHQYKLTVTAYDCGKKRAA  233 (952)
T ss_pred             -eeeEEEEeecCCCCCcccceeEEEecC--CCCceEEcC----CCc----cccccccccccceeEEEEEEEEeccccccc
Confidence             678899999999874 444 4899987  566999994    566    888999999999999999999999986666


Q ss_pred             eEEEEEEEEEec
Q psy8629         245 GSMIVNVTIQDV  256 (483)
Q Consensus       245 ~~~~v~I~V~dv  256 (483)
                      +...|+|.|...
T Consensus       234 ~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  234 SDSLVTVHVKPT  245 (952)
T ss_pred             CcceEEEEecCc
Confidence            668899998765


No 13 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=98.35  E-value=1.4e-06  Score=67.57  Aligned_cols=62  Identities=24%  Similarity=0.324  Sum_probs=43.3

Q ss_pred             eEEEEEeCCCCCCeEEEEEEccCCC-----CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCc
Q psy8629          19 LRDLEISEGVPVGTRVGFIGDGFVG-----DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRS   84 (483)
Q Consensus        19 ~~~~~v~En~~~gt~V~~v~a~d~d-----~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~   84 (483)
                      +.+++|+|+.++||+||.+ |.|.+     -....|.+.+..   ...+|.+++.||.|+++.++|||+.+
T Consensus         2 qi~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~---~~~~~~v~~~tG~L~v~~rIDRE~LC   68 (84)
T PF08266_consen    2 QIRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEG---NSQYFRVNEKTGDLFVSERIDREELC   68 (84)
T ss_dssp             EEEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SS---SS-SEEE-TTTSEEEESS--SCCCC-
T ss_pred             CeEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecC---CcceeEecCCceeEEeCCccCHHHHC
Confidence            6789999999999999999 55643     224578887642   35699999999999999999999976


No 14 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.36  E-value=0.0016  Score=51.23  Aligned_cols=64  Identities=11%  Similarity=0.164  Sum_probs=31.9

Q ss_pred             HhhhcCceEEEEEeCCCCCCeEEEEEEccCCCCC-CCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCC
Q psy8629          12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVGDS-GPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETR   83 (483)
Q Consensus        12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d~~-~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~   83 (483)
                      -++|.++.|.+.|+.+...|..||+|+-.|..++ .+.|...+       ..|+|.+ .|.|++++++.....
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssD-------pdF~V~~-DGsVy~~r~v~l~~~   67 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSD-------PDFRVLE-DGSVYAKRPVQLSSE   67 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS---EEEE----------SEEEEET-TTEEEEES--S-SSS
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCCCCceEEecCC-------CCEEEcC-CCeEEEeeeEecCCC
Confidence            4789999999999999999999999999996543 45666553       2799995 599999998876443


No 15 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.58  E-value=0.054  Score=43.44  Aligned_cols=72  Identities=31%  Similarity=0.412  Sum_probs=54.6

Q ss_pred             EEeCCCCCCceEEEEEeeCCC-CCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCC
Q psy8629         364 AFDQDEGINSVITYSLRENTD-DHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVND  442 (483)
Q Consensus       364 a~D~D~~~~~~v~y~l~~~~~-~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd  442 (483)
                      ..|+|   +..++|++....+ ....|...|+.++.+.-. +.. +..+.|.++|+|+|+..  .++...++|.|.+.|+
T Consensus        24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~-~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTN-SDVGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCC-CCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            46776   4688999874322 456799999998887763 333 34577999999999864  5788899999999887


No 16 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.51  E-value=0.0018  Score=50.35  Aligned_cols=60  Identities=23%  Similarity=0.337  Sum_probs=37.7

Q ss_pred             eeeccCCCCeEEEEEEEEeCCCCCC--ceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCC
Q psy8629         348 SFDEIAEPGTSVFQISAFDQDEGIN--SVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETD  410 (483)
Q Consensus       348 ~v~E~~~~gt~v~~v~a~D~D~~~~--~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~  410 (483)
                      +|+|..++|+.||.| |.|--....  ....|++..  .....+|.+++.+|.|.+...+|||.-
T Consensus         6 sV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s--~~~~~~~~v~~~tG~L~v~~rIDRE~L   67 (84)
T PF08266_consen    6 SVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVS--EGNSQYFRVNEKTGDLFVSERIDREEL   67 (84)
T ss_dssp             EEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE---SSSS-SEEE-TTTSEEEESS--SCCCC
T ss_pred             EeecCCCCCCEEEEh-HHhhCCCcccccccceEEee--cCCcceeEecCCceeEEeCCccCHHHH
Confidence            678999999999999 444432111  123577663  346789999999999999999999973


No 17 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.31  E-value=0.038  Score=44.12  Aligned_cols=92  Identities=18%  Similarity=0.234  Sum_probs=58.1

Q ss_pred             EEEEEEeCCCCCCceEEEEEEe-CCCCCceEEecccCCCceEEEEEE----eccCCCCCCCCEEEEEEEEEECCCCCeee
Q psy8629         171 TLNAAKDRDLGIYNTQRYNIVS-GNTNNAFRLSSHREKDGVLYLDLQ----INGVLDRETIPEYSLVIEALDGGSPPLRG  245 (483)
Q Consensus       171 ~~v~a~D~D~~~~~~~~y~l~~-~~~~~~F~i~~~~~~~g~~~~~l~----~~~~LD~E~~~~~~l~v~a~D~g~p~~~~  245 (483)
                      +++.++|+|.+..  ..+++.. ...-+.|.|+.    +|...+.+-    ..+.|.--+.-.-.|++++.|+.      
T Consensus         2 G~Lt~sD~D~gd~--~~~s~~~~~g~yGtlti~~----~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt------   69 (99)
T TIGR01965         2 GQLTISDADAGQA--HFIAQTDAAGQYGTFSIDA----DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT------   69 (99)
T ss_pred             CceEEeCCCCCCc--eEEecccccCCcEEEEECC----CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC------
Confidence            4688999998753  4555532 12345688875    465554332    12334433444677888889962      


Q ss_pred             EEEEEEEEEecCCCCCeecCCceEEEecCCC
Q psy8629         246 SMIVNVTIQDVNDNQPIFNQSRYFATVPENA  276 (483)
Q Consensus       246 ~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~  276 (483)
                      +++|+|+|...|| +|+.... -...+.|+.
T Consensus        70 t~~vtItI~GtND-apvi~~~-~~g~v~ED~   98 (99)
T TIGR01965        70 SQTVTITITGAND-AAVIGGA-DTGSVTEDS   98 (99)
T ss_pred             eEEEEEEEEccCC-CCEEecc-cceeEecCC
Confidence            7889999999999 7766433 236777763


No 18 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.08  E-value=0.079  Score=42.32  Aligned_cols=89  Identities=22%  Similarity=0.281  Sum_probs=57.5

Q ss_pred             EEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEE-Ec-------ccCCCCCCCeeEEEEEEEECCcCCceEEE
Q psy8629         360 FQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLIT-MK-------SNVDCETDPVPKLTVIATDNGVPALSSSA  431 (483)
Q Consensus       360 ~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~-~~-------~~lD~E~~~~y~l~V~a~D~g~p~~s~~~  431 (483)
                      ++|.++|+|.+  ....+++... ...-+.|.|++ .|.-. ..       +.|.-.+...-.++|.++|+      .+.
T Consensus         2 G~Lt~sD~D~g--d~~~~s~~~~-~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt~   71 (99)
T TIGR01965         2 GQLTISDADAG--QAHFIAQTDA-AGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TSQ   71 (99)
T ss_pred             CceEEeCCCCC--CceEEecccc-cCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------CeE
Confidence            56899999965  4456666422 23456799987 45322 11       22333334467888999995      278


Q ss_pred             EEEEEEEecCCCCCeecCCceEEEEcCCC
Q psy8629         432 TVLVTIHDVNDNEPIFDQSFYNVTIPENK  460 (483)
Q Consensus       432 ~v~I~V~dvNd~~P~f~~~~~~~~V~E~~  460 (483)
                      +|+|+|...|| +|+..... ...|.|+.
T Consensus        72 ~vtItI~GtND-apvi~~~~-~g~v~ED~   98 (99)
T TIGR01965        72 TVTITITGAND-AAVIGGAD-TGSVTEDS   98 (99)
T ss_pred             EEEEEEEccCC-CCEEeccc-ceeEecCC
Confidence            89999999999 66655432 56777764


No 19 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=95.95  E-value=0.1  Score=41.09  Aligned_cols=79  Identities=16%  Similarity=0.195  Sum_probs=40.8

Q ss_pred             CCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCC
Q psy8629         148 APTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIP  227 (483)
Q Consensus       148 ~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~  227 (483)
                      .|-|....|.+.|+.+...|..+++|.-.|-...  ..+.|...  +  ..|.|.    .+|.    |.+.+++..... 
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~--~~~~~~ss--D--pdF~V~----~DGs----Vy~~r~v~l~~~-   67 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGR--RRVIFESS--D--PDFRVL----EDGS----VYAKRPVQLSSE-   67 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEE----TTTE----EEEES--S-SSS-
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCCC--CceEEecC--C--CCEEEc----CCCe----EEEeeeEecCCC-
Confidence            5889999999999999999999999999888533  35667653  2  289999    5687    566777766443 


Q ss_pred             EEEEEEEEEECCCC
Q psy8629         228 EYSLVIEALDGGSP  241 (483)
Q Consensus       228 ~~~l~v~a~D~g~p  241 (483)
                      .-.|+|.|.|..+.
T Consensus        68 ~~~F~V~a~D~~~~   81 (90)
T PF08758_consen   68 QRSFTVHAWDSQTQ   81 (90)
T ss_dssp             -EEEEEEEEETTTT
T ss_pred             ceEEEEEEECCCCC
Confidence            35799999997763


No 20 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.27  E-value=0.29  Score=39.12  Aligned_cols=70  Identities=23%  Similarity=0.328  Sum_probs=48.4

Q ss_pred             EEeCCCCCCceEEEEEEeCC---CCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEE
Q psy8629         175 AKDRDLGIYNTQRYNIVSGN---TNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNV  251 (483)
Q Consensus       175 a~D~D~~~~~~~~y~l~~~~---~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I  251 (483)
                      ..|+| +  ..++|++...+   -..+.++++.   ++.++     ..+...+ ...|.+.|.|+|+.+  .++...++|
T Consensus        24 F~d~d-~--~~lty~~~~~~~~~lP~Wl~fd~~---~~~~~-----GtP~~~~-~g~~~i~v~a~D~~g--~~~~~~f~i   89 (97)
T smart00736       24 FTDAD-G--DTLTYSATLSDGSALPSWLSFDSD---TGTLS-----GTPTNSD-VGSLSLKVTATDSSG--ASASDTFTI   89 (97)
T ss_pred             eECCC-C--CeEEEEEEeCCCCCCCCeEEEeCC---CCEEE-----EECCCCC-CcEEEEEEEEEECCC--CEEEEEEEE
Confidence            46777 2  37889886432   2457788874   34432     2344333 467999999999876  678889999


Q ss_pred             EEEecCC
Q psy8629         252 TIQDVND  258 (483)
Q Consensus       252 ~V~dvNd  258 (483)
                      .|.+.|+
T Consensus        90 ~V~~~~~   96 (97)
T smart00736       90 TVVNTND   96 (97)
T ss_pred             EEeCCCC
Confidence            9999886


No 21 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=94.29  E-value=3.5  Score=36.25  Aligned_cols=82  Identities=16%  Similarity=0.213  Sum_probs=47.5

Q ss_pred             EEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCC-CCCCCCEEEEEEEEEECCCCCeeeEEEE
Q psy8629         171 TLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVL-DRETIPEYSLVIEALDGGSPPLRGSMIV  249 (483)
Q Consensus       171 ~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~L-D~E~~~~~~l~v~a~D~g~p~~~~~~~v  249 (483)
                      ..+.++|.-.+. ...+.+|.+|.....-.+.......|...  +...+.+ ..|....|+|+|.|.|..+  ..++..+
T Consensus        69 i~i~~tD~~~~~-~i~sv~l~Gg~~~d~v~ls~~~~~~~~~~--~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si  143 (158)
T PF13750_consen   69 ISINVTDNSDDS-KITSVSLTGGPASDSVSLSWTNKGNGVYT--LEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSI  143 (158)
T ss_pred             eEEEEEeCCCCc-eEEEEEEECCcccceEEEeeEeccCceEE--eecccccCCcCCCCeEEEEEEEEecCC--CEEEEEE
Confidence            456777766543 34567776654343333332222334422  3222222 3377889999999999766  5777777


Q ss_pred             EEEEEecC
Q psy8629         250 NVTIQDVN  257 (483)
Q Consensus       250 ~I~V~dvN  257 (483)
                      .....+.|
T Consensus       144 ~F~y~P~~  151 (158)
T PF13750_consen  144 SFSYMPPN  151 (158)
T ss_pred             EEEEeCCe
Confidence            77766443


No 22 
>KOG3597|consensus
Probab=93.79  E-value=6.1  Score=40.54  Aligned_cols=154  Identities=18%  Similarity=0.181  Sum_probs=88.2

Q ss_pred             CeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCC--------------CCCc
Q psy8629         133 ENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGN--------------TNNA  198 (483)
Q Consensus       133 ~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~--------------~~~~  198 (483)
                      .+....|.|.-+||.+..+....+.+.+.|+...-.....+.+.|+|.+. ..+.|++.+..              +.+.
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~  102 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATE  102 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccc
Confidence            36678899999999888777777778999997665566678899999765 35678876521              1122


Q ss_pred             eEEecccCCCceEEEEEEeccCCCCC--CCCEEEEEEEEEECCCCCeeeEEEEEEEEEecCCCCCeecCCc-eEEEecCC
Q psy8629         199 FRLSSHREKDGVLYLDLQINGVLDRE--TIPEYSLVIEALDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSR-YFATVPEN  275 (483)
Q Consensus       199 F~i~~~~~~~g~~~~~l~~~~~LD~E--~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~-~~~~v~E~  275 (483)
                      |+-...  ..|.+.        +++.  +.....+...++|+-.   .+. . .| .. .-..+|.+.... ....|.-+
T Consensus       103 Fs~~~v--~~g~~~--------yvh~g~el~~~~~~~~~SDg~~---~S~-~-~i-~~-~~~~~~~~~~~~~~gL~v~~g  165 (442)
T KOG3597|consen  103 FSYEEV--EDGSLS--------YVHSGTELRESELQLRVSDGLL---VSE-R-AI-LK-VEATGPAPHLARNTGLKVLQG  165 (442)
T ss_pred             eEehHh--hcCcee--------EEecCcccccceEEEEeecceE---eee-e-EE-ec-ccCCCcceeeecccceEEccC
Confidence            332221  122221        1111  2457788899999763   222 1 11 11 112234332221 11222222


Q ss_pred             CCCCcEEEEEEEEeCCCCCccEEEEEEEe
Q psy8629         276 ATVGTSIFQVFATDADAGSNGEVEYSINR  304 (483)
Q Consensus       276 ~~~gt~v~~v~a~D~D~~~~~~i~y~i~~  304 (483)
                      +..--.-.-+.+.|.|.++...+.|.|..
T Consensus       166 S~~~IT~~~L~ved~d~~~d~~v~~~i~~  194 (442)
T KOG3597|consen  166 STAPITPSNLSVEDNDSSPDDEVRYDITP  194 (442)
T ss_pred             ccccccHhHceeecCCCCCCcEEEEEecC
Confidence            21111113488889987667789999964


No 23 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=91.94  E-value=15  Score=41.29  Aligned_cols=154  Identities=19%  Similarity=0.138  Sum_probs=80.8

Q ss_pred             CCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCC--ceEE-----ecccCCCceEEEEEE
Q psy8629         144 ENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNN--AFRL-----SSHREKDGVLYLDLQ  216 (483)
Q Consensus       144 vND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~--~F~i-----~~~~~~~g~~~~~l~  216 (483)
                      .||.++.|....-..+|.|+.  |+.-..|.-...|.+..-.+.|+..+|....  -|.-     .-..|++- -++.+.
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~~~sGTLtF~PGEt~-KtItV~  471 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYEFTEGTLVFKPGETQ-KEFRIG  471 (928)
T ss_pred             ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCccccCceEEECCCceE-EEEEEE
Confidence            566666666666678999986  4555555444435544456889887764321  1221     11111111 122243


Q ss_pred             eccCCCCCCCCEEEEEEEEEECC-------------CCCeeeEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEE
Q psy8629         217 INGVLDRETIPEYSLVIEALDGG-------------SPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIF  283 (483)
Q Consensus       217 ~~~~LD~E~~~~~~l~v~a~D~g-------------~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~  283 (483)
                      +...=-+|....|.+.+.--..+             ...+......+|+|.| ||++|.|.-..-...|.|+.  |+.-.
T Consensus       472 IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtv  548 (928)
T TIGR00845       472 IIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEV  548 (928)
T ss_pred             EccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEEEE
Confidence            33433466667777766543222             0112233456677777 78899877655567889974  55544


Q ss_pred             EEEEEeCCCCCccEEEEEEEe
Q psy8629         284 QVFATDADAGSNGEVEYSINR  304 (483)
Q Consensus       284 ~v~a~D~D~~~~~~i~y~i~~  304 (483)
                      +|.-+-.-.| .-.+.|...+
T Consensus       549 tV~RtsGa~G-~VtV~Y~T~d  568 (928)
T TIGR00845       549 KVLRTSGARG-TVIVPYRTVE  568 (928)
T ss_pred             EEEEcCCCCe-eEEEEEEeec
Confidence            4432211111 2345676543


No 24 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=89.08  E-value=2.1  Score=29.42  Aligned_cols=39  Identities=26%  Similarity=0.284  Sum_probs=30.5

Q ss_pred             CCCCCCcEEEeCCCcEEEEcccCCCC-CCCeeEEEEEEEECC
Q psy8629         383 TDDHPSWFQIDSKSGLITMKSNVDCE-TDPVPKLTVIATDNG  423 (483)
Q Consensus       383 ~~~~~~~F~id~~tG~i~~~~~lD~E-~~~~y~l~V~a~D~g  423 (483)
                      .+....+..+|+.+|.|.-.  .+.. ....|.++|.|+|..
T Consensus         9 ~~~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen    9 GGGLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CCCCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence            34677889999999999766  4444 346999999999974


No 25 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=83.46  E-value=20  Score=40.34  Aligned_cols=47  Identities=23%  Similarity=0.165  Sum_probs=28.8

Q ss_pred             cCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEe
Q psy8629         256 VNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINR  304 (483)
Q Consensus       256 vNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~  304 (483)
                      .||.++.|.-..-...|.|++  |++-.+|.-...|.+....+.|...+
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~D  441 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTED  441 (928)
T ss_pred             ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccC
Confidence            455555554444456789984  77777776555455445567787643


No 26 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=81.87  E-value=20  Score=30.59  Aligned_cols=69  Identities=29%  Similarity=0.353  Sum_probs=42.7

Q ss_pred             CceEEecccCCCceEEEEEEeccCCCCCC---CCEEEEEEEEEECCCCCeeeEEEEEEEEEecCCCCCeecCCceEEEec
Q psy8629         197 NAFRLSSHREKDGVLYLDLQINGVLDRET---IPEYSLVIEALDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVP  273 (483)
Q Consensus       197 ~~F~i~~~~~~~g~~~~~l~~~~~LD~E~---~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~  273 (483)
                      ..|.|.-..  +|...  +.+.++||...   .-...|.|.|+|..+-  .+...+.|+|.|  | .|...... ..+|.
T Consensus        56 ~Vftvtl~~--~GsYt--ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~  125 (137)
T TIGR03660        56 PVFTLTLNA--DGSYE--FTLEGPLDHAAGSDELTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVD  125 (137)
T ss_pred             EEEEEEECC--CccEE--EEEcccccCCCCCceEEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEe
Confidence            345554332  34332  66678887743   3367888999997663  334588888887  5 57664443 36777


Q ss_pred             CC
Q psy8629         274 EN  275 (483)
Q Consensus       274 E~  275 (483)
                      |+
T Consensus       126 E~  127 (137)
T TIGR03660       126 ED  127 (137)
T ss_pred             cc
Confidence            74


No 27 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=79.34  E-value=10  Score=27.40  Aligned_cols=61  Identities=26%  Similarity=0.251  Sum_probs=34.6

Q ss_pred             CCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEE
Q psy8629         371 INSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIH  438 (483)
Q Consensus       371 ~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~  438 (483)
                      .+-...|+|.   |-...|..+...+-.+... .|   ..+.|.|.|+|.|...........+.|+|.
T Consensus         6 ~~~~Y~Y~l~---g~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen    6 ENIRYRYRLE---GFDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             TTEEEEEEEE---TTESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             CceEEEEEEE---CCCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            3445677787   4456677665543233222 22   478999999999976543333366666653


No 28 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=75.30  E-value=90  Score=30.91  Aligned_cols=75  Identities=21%  Similarity=0.322  Sum_probs=44.1

Q ss_pred             EEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCC
Q psy8629         250 NVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALD  329 (483)
Q Consensus       250 ~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD  329 (483)
                      +|.|...|.+.=.|..-....+++|+- .|..=+.--...+    +++.-|.-.+ .-+.-..|.|++.+|.|.+.+.-.
T Consensus       214 tV~v~~y~~~~g~~~~lQ~i~tlP~dF-~g~~~~aaIhis~----dGrFLYasNR-g~dsI~~f~V~~~~g~L~~~~~~~  287 (346)
T COG2706         214 TVDVLEYNPAVGKFEELQTIDTLPEDF-TGTNWAAAIHISP----DGRFLYASNR-GHDSIAVFSVDPDGGKLELVGITP  287 (346)
T ss_pred             EEEEEEEcCCCceEEEeeeeccCcccc-CCCCceeEEEECC----CCCEEEEecC-CCCeEEEEEEcCCCCEEEEEEEec
Confidence            566777766555666666666777763 3332222222223    4567776543 233456899999999987765543


Q ss_pred             C
Q psy8629         330 F  330 (483)
Q Consensus       330 ~  330 (483)
                      -
T Consensus       288 t  288 (346)
T COG2706         288 T  288 (346)
T ss_pred             c
Confidence            3


No 29 
>KOG3597|consensus
Probab=70.48  E-value=38  Score=34.87  Aligned_cols=63  Identities=16%  Similarity=0.162  Sum_probs=46.8

Q ss_pred             eEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCC
Q psy8629         245 GSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSD  308 (483)
Q Consensus       245 ~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~  308 (483)
                      .+....|.|..+||.+..+....+..-+.|+...-.....+.+.|+|..+ ..+.|++....+.
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~   86 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSV   86 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCC
Confidence            45667899999999777776666668888887655556678899999764 4588888764333


No 30 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=69.73  E-value=23  Score=30.29  Aligned_cols=44  Identities=34%  Similarity=0.354  Sum_probs=29.6

Q ss_pred             CeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCC
Q psy8629         411 PVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENK  460 (483)
Q Consensus       411 ~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~  460 (483)
                      -...|.|.|+|...-  .++.++.|+|.|  | .|...... ...|.|+.
T Consensus        85 l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~  128 (137)
T TIGR03660        85 LTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVDEDD  128 (137)
T ss_pred             EEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence            356788999997542  334588888887  6 47765443 36787843


No 31 
>PF06586 TraK:  TraK protein;  InterPro: IPR010563 This family consists of several TraK proteins from Escherichia coli, Salmonella typhi and Salmonella typhimurium. TraK is known to be essential for pilus assembly but its exact role in this process is unknown [].
Probab=52.29  E-value=2e+02  Score=26.78  Aligned_cols=95  Identities=14%  Similarity=0.126  Sum_probs=48.8

Q ss_pred             ceEEEEEeCCCCCCeEEEEEEccCCC----CCCCCEEEEecCCCCCcccEEE--eCCccEEEeCccCCccCCceeeEEEE
Q psy8629          18 HLRDLEISEGVPVGTRVGFIGDGFVG----DSGPPYLIVPVPGSAVDTDLSI--EQNTGEIRTKVPLDRETRSSYSLVAI   91 (483)
Q Consensus        18 ~~~~~~v~En~~~gt~V~~v~a~d~d----~~~~~y~i~~~~~~~~~~~F~I--d~~tG~i~~~~~LD~E~~~~y~l~v~   91 (483)
                      .......-...+-++....++..|..    .....-.+....+......|..  +..+|.+.+. .. + ....+++-+.
T Consensus        19 ~~~a~~~v~~~~~~~~~i~lS~~~~NRIv~~~~~i~~V~~~~~~~~~~~~~~~~~~~~G~v~~~-~~-~-~~~p~tl~vt   95 (234)
T PF06586_consen   19 AAFAAQTVTVSPGDTVPIPLSNRDPNRIVVPGDRITSVSFPEGGCITVDIQGENDDVTGSVYVS-PT-Y-SSKPFTLFVT   95 (234)
T ss_pred             hccCccEEEcCCCCeEEEEEcCCCCeEEEccCCceeEEecCCCcccccccccccccccCcEEEE-ec-C-CCCCEEEEEE
Confidence            33444444556677778888888743    1111223332211111112332  3467888877 22 2 2223566665


Q ss_pred             ecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEE
Q psy8629          92 PLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVR  140 (483)
Q Consensus        92 a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~  140 (483)
                      ..+|.                         .|+|.+.+.+....++.+.
T Consensus        96 t~~g~-------------------------~~sl~l~P~~~p~~ti~l~  119 (234)
T PF06586_consen   96 TEKGR-------------------------TYSLTLVPKAIPAQTIRLV  119 (234)
T ss_pred             eCCCC-------------------------EEEEEEEeccCCCeEEEEe
Confidence            54443                         6888888877665444444


No 32 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=48.57  E-value=85  Score=21.41  Aligned_cols=45  Identities=20%  Similarity=0.426  Sum_probs=27.0

Q ss_pred             EEEEecCCCCCcccEEEeCCccEEEeCccCCcc-CCceeeEEEEecCC
Q psy8629          49 YLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRE-TRSSYSLVAIPLSG   95 (483)
Q Consensus        49 y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E-~~~~y~l~v~a~d~   95 (483)
                      |++....+.+.-....+|+.+|.|+=.-  +.+ +...|.+.|.+.|+
T Consensus         2 ys~~~~~~~~LP~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~   47 (49)
T PF05345_consen    2 YSLTTPTGGGLPSGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDG   47 (49)
T ss_pred             EEEEcCCCCCCCCcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcC
Confidence            5553222333445799999999998662  222 22467777766654


No 33 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=48.12  E-value=1.3e+02  Score=23.51  Aligned_cols=51  Identities=29%  Similarity=0.270  Sum_probs=30.1

Q ss_pred             EEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEe
Q psy8629         250 NVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINR  304 (483)
Q Consensus       250 ~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~  304 (483)
                      +|+|.| ||.+ .+.-..-..++.|+.  |..-..|.-..++....-.+.|....
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~   52 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVD   52 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEE
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeC
Confidence            577788 6645 776666667888885  44555555554443334556776654


No 34 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=46.54  E-value=39  Score=24.18  Aligned_cols=30  Identities=20%  Similarity=0.199  Sum_probs=18.8

Q ss_pred             CCCEEEEEEEEEECCCCCeeeEEEEEEEEE
Q psy8629         225 TIPEYSLVIEALDGGSPPLRGSMIVNVTIQ  254 (483)
Q Consensus       225 ~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~  254 (483)
                      ....|+|.|+|.|..+........+.|.|+
T Consensus        37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen   37 PPGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             -SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            356899999999976543333366666653


No 35 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=43.83  E-value=8.5e+02  Score=31.63  Aligned_cols=189  Identities=14%  Similarity=0.114  Sum_probs=88.9

Q ss_pred             CCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEEecCCCCCeecCCceEEEecC--------------CCCCCcEEEEEEE
Q psy8629         222 DRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPE--------------NATVGTSIFQVFA  287 (483)
Q Consensus       222 D~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E--------------~~~~gt~v~~v~a  287 (483)
                      .|.+...|++++.|++..+   +...++.|.|.+....        ..+...+              ..+.+ ....+.+
T Consensus      1563 TY~spGtYtVtLTvtN~~G---s~~~T~~i~V~~~I~G--------lqI~~~~~~gg~~~~~~~~~~~~~t~-~~v~l~~ 1630 (2740)
T TIGR00864      1563 TFRSVGTFNIIVTAENDVG---AAQASIFLFVLQEIEG--------LQILGETAEGGGGGVQELDGCYFETN-HTVQFHA 1630 (2740)
T ss_pred             eecCCceEEEEEEEecCCC---ccceeEEEEEeecccc--------eEEEeecccccCCcccccccceeccC-CEEEeee
Confidence            3667789999999999776   4456667777654211        0111100              00111 1244555


Q ss_pred             EeCCCCCccEEEEEEEecCC-C-CCcceEEeCCccEEEEcccCCCcccceeeeEeEEEEEee--------eeeccCCCCe
Q psy8629         288 TDADAGSNGEVEYSINRRQS-D-KDGLFNIDSKSGLIFVNKALDFETKEVHDLVPRFAVEIS--------SFDEIAEPGT  357 (483)
Q Consensus       288 ~D~D~~~~~~i~y~i~~~~~-~-~~~~F~Id~~tG~l~~~~~LD~E~~~~~~l~v~~~~~v~--------~v~E~~~~gt  357 (483)
                      .-.| |  ..++|+-.-... + ....|   ..+|+-   -.+-|.....|.+++..+..+.        .+.|  +.+.
T Consensus      1631 ~~~~-G--TnvsysW~~~~~n~~dg~~~---~~tGk~---~~vt~~~pG~Y~VtL~aSN~vG~~t~s~~i~v~e--pi~~ 1699 (2740)
T TIGR00864      1631 GFKD-G--TNLSFSWNAILDNEPDGPAF---AGSGKG---AKLNPLEAGPCDIFLQAANLLGQATADCTIDFLE--PAGN 1699 (2740)
T ss_pred             ccCC-C--CeeEEEEEEecCCCCCCccc---cccCce---eEEecCCCceEEEEEEEeecccceeeEEEEEEEE--EcCc
Confidence            4444 3  345555422111 1 11112   134432   1234455677887777544433        1222  2222


Q ss_pred             EEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcE-EEEccc---CCCCCCCeeEEEEEEEECCcCCceEEEEE
Q psy8629         358 SVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGL-ITMKSN---VDCETDPVPKLTVIATDNGVPALSSSATV  433 (483)
Q Consensus       358 ~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~-i~~~~~---lD~E~~~~y~l~V~a~D~g~p~~s~~~~v  433 (483)
                      .  .+.+.+.-...|..|.|+..-..|. .-.|...-..|. ..+..+   -.|...+.|.++|.|...-.   +...++
T Consensus      1700 L--~L~aS~~~aavn~~V~fs~~lt~GS-~vty~W~~gdG~s~~~s~p~~tHtf~~~G~y~VtVta~N~vs---s~~~s~ 1773 (2740)
T TIGR00864      1700 L--MLQASDNPAAVNALINLSAELAEGS-GLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELG---SANASE 1773 (2740)
T ss_pred             e--EEeccCCcccCCCeEEEEEEecCCC-CcEEEEEECCCCCcccCCCceEEEccCCceEEEEEEEEcccc---ccceeE
Confidence            2  1333343334566666665432222 222333211221 111222   24667889999999998754   223455


Q ss_pred             EEEEEe
Q psy8629         434 LVTIHD  439 (483)
Q Consensus       434 ~I~V~d  439 (483)
                      .|.|.+
T Consensus      1774 ~V~Vqe 1779 (2740)
T TIGR00864      1774 EVDVQE 1779 (2740)
T ss_pred             EEEEEe
Confidence            665554


No 36 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=42.71  E-value=56  Score=24.45  Aligned_cols=30  Identities=20%  Similarity=0.042  Sum_probs=21.7

Q ss_pred             CCCCCCCeeEEEEEEEECCcCCceEEEEEEEE
Q psy8629         405 VDCETDPVPKLTVIATDNGVPALSSSATVLVT  436 (483)
Q Consensus       405 lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~  436 (483)
                      ..|...+.|.++++++|...  .+.+.++.|.
T Consensus        51 ~~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~   80 (81)
T cd00146          51 HTYTKPGTYTVTLTVTNAVG--SSSTKTTTVV   80 (81)
T ss_pred             EEcCCCcEEEEEEEEEeCCC--CEEEEEEEEE
Confidence            45778899999999999853  3444455554


No 37 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=42.37  E-value=59  Score=24.16  Aligned_cols=31  Identities=26%  Similarity=0.164  Sum_probs=23.4

Q ss_pred             cCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEE
Q psy8629         404 NVDCETDPVPKLTVIATDNGVPALSSSATVLVTI  437 (483)
Q Consensus       404 ~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V  437 (483)
                      ..-|...+.|.+++.++|...   +++.+++|.|
T Consensus        48 ~~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       48 THTYTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             EEEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            346778899999999999765   5566666665


No 38 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=40.76  E-value=1.1e+02  Score=21.75  Aligned_cols=31  Identities=39%  Similarity=0.310  Sum_probs=21.2

Q ss_pred             CCCeeEEEEEEEECCcCCceEEEEEEEEEEecC
Q psy8629         409 TDPVPKLTVIATDNGVPALSSSATVLVTIHDVN  441 (483)
Q Consensus       409 ~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvN  441 (483)
                      ..+.|.+.++|+|..+  ..++....+.+.|..
T Consensus        21 ~dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   21 ADGEYTLTVTATDKAG--NTSSSTTQIVIVDNT   51 (60)
T ss_pred             CCccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence            4678999999999854  244555555665643


No 39 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=39.69  E-value=1.8e+02  Score=22.69  Aligned_cols=53  Identities=21%  Similarity=0.202  Sum_probs=29.6

Q ss_pred             EEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCC
Q psy8629         138 VVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGN  194 (483)
Q Consensus       138 ~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~  194 (483)
                      +|+|.| ||.+ .+....-...+.|+..  .....+.-..++....-.+.|...+|.
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~~--~~~v~V~~~~~~~~~~v~v~~~~~~gt   54 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGDG--TVTVTVTRSGGSLDGPVTVNYSTVDGT   54 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTSS--EEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCCC--EEEEEEEEcccCCCcceEEEEEEeCCc
Confidence            577888 7765 7766666688899864  333334333343233345666666543


No 40 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=39.10  E-value=1.2e+02  Score=21.55  Aligned_cols=30  Identities=27%  Similarity=0.299  Sum_probs=21.0

Q ss_pred             CCEEEEEEEEEECCCCCeeeEEEEEEEEEecC
Q psy8629         226 IPEYSLVIEALDGGSPPLRGSMIVNVTIQDVN  257 (483)
Q Consensus       226 ~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvN  257 (483)
                      ...|.+.++|+|..+  ..+.....+.+.|..
T Consensus        22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDNT   51 (60)
T ss_pred             CccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence            567999999999766  455555556555543


No 41 
>PF13956 Ibs_toxin:  Toxin Ibs, type I toxin-antitoxin system
Probab=34.72  E-value=25  Score=18.54  Aligned_cols=16  Identities=25%  Similarity=0.310  Sum_probs=11.7

Q ss_pred             ChhhHHHHHHHHhhhc
Q psy8629           1 MRLWRLLWLLAAVAAD   16 (483)
Q Consensus         1 ~~~~~~~~~~~~~~~~   16 (483)
                      ||+.+.|.+|+..+|.
T Consensus         2 Mk~vIIlvvLLliSf~   17 (19)
T PF13956_consen    2 MKLVIILVVLLLISFP   17 (19)
T ss_pred             ceehHHHHHHHhcccc
Confidence            6778888877766664


No 42 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=34.14  E-value=92  Score=23.22  Aligned_cols=29  Identities=7%  Similarity=0.092  Sum_probs=20.4

Q ss_pred             CCCCCCEEEEEEEEEECCCCCeeeEEEEEEE
Q psy8629         222 DRETIPEYSLVIEALDGGSPPLRGSMIVNVT  252 (483)
Q Consensus       222 D~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~  252 (483)
                      .|.....|.+++.++|..+  .+......|.
T Consensus        52 ~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~   80 (81)
T cd00146          52 TYTKPGTYTVTLTVTNAVG--SSSTKTTTVV   80 (81)
T ss_pred             EcCCCcEEEEEEEEEeCCC--CEEEEEEEEE
Confidence            4677889999999999754  3444444544


No 43 
>PF12988 DUF3872:  Domain of unknown function, B. Theta Gene description (DUF3872);  InterPro: IPR024355 This entry represents proteins of unknown function found primarily in Bacteroides species. The Bacteroides thetaiotaomicron gene coding for this protein is located in a conjugate transposon and appears to be upregulated in the presence of host or other bacterial species compared to growth in pure culture [, ].; PDB: 2L3B_A 2L7Q_A.
Probab=33.93  E-value=2.8e+02  Score=23.56  Aligned_cols=42  Identities=17%  Similarity=0.168  Sum_probs=17.2

Q ss_pred             EeCCCCCCeEE-EEEEccCCC-CCCCCEEEEecCCCCCcccEEEeCCccEEEeC
Q psy8629          24 ISEGVPVGTRV-GFIGDGFVG-DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTK   75 (483)
Q Consensus        24 v~En~~~gt~V-~~v~a~d~d-~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~   75 (483)
                      |+.....|..| .++.-.-.+ -..-.|+|.         +|..| ..|+|+..
T Consensus        40 Vpk~I~~GeTvEIR~~l~reG~y~~t~Y~iR---------YFQ~d-GkG~L~~~   83 (137)
T PF12988_consen   40 VPKKIKKGETVEIRCELKREGNYADTRYTIR---------YFQPD-GKGTLRMD   83 (137)
T ss_dssp             --SS--TTEEEEEEEEEEESS--SS---EEE---------EE-SS-S-EEEEET
T ss_pred             cccccCCCCEEEEEEEEecCceecccEEEEE---------EEeec-CCEEEEec
Confidence            56777778776 233222222 223467776         66666 34777654


No 44 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=33.84  E-value=99  Score=22.89  Aligned_cols=29  Identities=14%  Similarity=0.208  Sum_probs=22.0

Q ss_pred             CCCCCCEEEEEEEEEECCCCCeeeEEEEEEEE
Q psy8629         222 DRETIPEYSLVIEALDGGSPPLRGSMIVNVTI  253 (483)
Q Consensus       222 D~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V  253 (483)
                      -|.....|.+++.+.|..+   ++++.++|.|
T Consensus        50 ~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       50 TYTKPGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             EeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            4667789999999999776   5566666655


No 45 
>PF10365 DUF2436:  Domain of unknown function (DUF2436);  InterPro: IPR018832  Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms.  This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).  
Probab=32.31  E-value=3.2e+02  Score=23.31  Aligned_cols=90  Identities=18%  Similarity=0.185  Sum_probs=52.5

Q ss_pred             EecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCC---CCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEe
Q psy8629         141 VIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDL---GIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQI  217 (483)
Q Consensus       141 V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~---~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~  217 (483)
                      +.+ |+++|.=.-..|...|++|+-+...- +-...|...   -+.+..-|.|....++...-|....| ++.       
T Consensus        63 l~~-n~~~pa~ly~~FEYkiP~NADps~tp-q~mv~dG~~~i~IPaG~YDy~I~~P~~~~kiwIaGd~g-~~~-------  132 (161)
T PF10365_consen   63 LWD-NCNVPANLYDPFEYKIPANADPSTTP-QNMVVDGEASIDIPAGTYDYCIAAPQPGGKIWIAGDGG-DGP-------  132 (161)
T ss_pred             ccc-CCCCChhhcccceEeccCCCCCccCc-ceEEecCceEEEecCceeEEEEecCCCCCeEEEecCCC-CCC-------
Confidence            344 77788766677889999997653322 211222211   12356678887766655555554321 111       


Q ss_pred             ccC--CCCCCCCEEEEEEEEEECCC
Q psy8629         218 NGV--LDRETIPEYSLVIEALDGGS  240 (483)
Q Consensus       218 ~~~--LD~E~~~~~~l~v~a~D~g~  240 (483)
                      .+.  .-+|.-+.|+++++....|.
T Consensus       133 tr~dDy~fEAGKtY~ftm~~~g~gD  157 (161)
T PF10365_consen  133 TRGDDYVFEAGKTYRFTMKRVGSGD  157 (161)
T ss_pred             ccccceEEecCCEEEEEEEeccCCC
Confidence            222  23577889999999887653


No 46 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=30.83  E-value=1.7e+02  Score=20.32  Aligned_cols=16  Identities=38%  Similarity=0.320  Sum_probs=13.2

Q ss_pred             CCCeeEEEEEEEECCc
Q psy8629         409 TDPVPKLTVIATDNGV  424 (483)
Q Consensus       409 ~~~~y~l~V~a~D~g~  424 (483)
                      ..+.|.++|+|+|...
T Consensus        22 ~dG~y~itv~a~D~AG   37 (54)
T PF13754_consen   22 ADGTYTITVTATDAAG   37 (54)
T ss_pred             CCccEEEEEEEEeCCC
Confidence            4788999999999743


No 47 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=24.29  E-value=1.5e+02  Score=22.30  Aligned_cols=26  Identities=27%  Similarity=0.212  Sum_probs=20.3

Q ss_pred             CCeeEEEEEEEECCcCCceEEEEEEEEE
Q psy8629         410 DPVPKLTVIATDNGVPALSSSATVLVTI  437 (483)
Q Consensus       410 ~~~y~l~V~a~D~g~p~~s~~~~v~I~V  437 (483)
                      .+.|.++..|+|..+  ..+++.+.|+|
T Consensus        56 ~G~t~V~ytA~D~~G--N~a~C~f~V~V   81 (81)
T PF02494_consen   56 VGTTTVTYTATDAAG--NSATCSFTVTV   81 (81)
T ss_pred             eceEEEEEEEEECCC--CEEEEEEEEEC
Confidence            346899999999744  57788888875


No 48 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=22.00  E-value=2e+02  Score=24.08  Aligned_cols=31  Identities=35%  Similarity=0.543  Sum_probs=16.5

Q ss_pred             EEEEEEEEC-CcCCceEEEEEEEEEEecCCCCC
Q psy8629         414 KLTVIATDN-GVPALSSSATVLVTIHDVNDNEP  445 (483)
Q Consensus       414 ~l~V~a~D~-g~p~~s~~~~v~I~V~dvNd~~P  445 (483)
                      +|.|.|+=. +.-+++..+.+.|+|...|+ ||
T Consensus       101 nl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp  132 (133)
T PF09100_consen  101 NLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP  132 (133)
T ss_dssp             EEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred             cEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence            666666654 33468999999999999988 55


No 49 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=21.60  E-value=3.5e+02  Score=20.19  Aligned_cols=24  Identities=17%  Similarity=0.188  Sum_probs=20.0

Q ss_pred             EEEEEEEEEECCCCCeeeEEEEEEEE
Q psy8629         228 EYSLVIEALDGGSPPLRGSMIVNVTI  253 (483)
Q Consensus       228 ~~~l~v~a~D~g~p~~~~~~~v~I~V  253 (483)
                      .|.++..|+|..+  .++++.+.|+|
T Consensus        58 ~t~V~ytA~D~~G--N~a~C~f~V~V   81 (81)
T PF02494_consen   58 TTTVTYTATDAAG--NSATCSFTVTV   81 (81)
T ss_pred             eEEEEEEEEECCC--CEEEEEEEEEC
Confidence            6889999999766  68888888875


Done!