Query         psy5884
Match_columns 195
No_of_seqs    193 out of 1323
Neff          9.5 
Searched_HMMs 46136
Date          Fri Aug 16 20:36:06 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5884.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5884hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 1.7E-35 3.7E-40  258.8  16.6  186    3-193   221-418 (2531)
  2 KOG4289|consensus              100.0 1.4E-33   3E-38  247.0  19.4  186    3-192   735-932 (2531)
  3 KOG1219|consensus              100.0 8.7E-30 1.9E-34  229.6  19.0  179    2-187  2130-2313(4289)
  4 KOG1219|consensus              100.0 7.1E-29 1.5E-33  223.8  21.7  180    3-187   903-1090(4289)
  5 cd00031 CA Cadherin repeat dom  99.9 6.4E-24 1.4E-28  159.5  20.7  139    3-145    54-198 (199)
  6 cd00031 CA Cadherin repeat dom  99.9 6.4E-22 1.4E-26  148.6  18.7  126   57-185     2-132 (199)
  7 PF00028 Cadherin:  Cadherin do  99.7 1.7E-16 3.8E-21  105.2  12.7   87   57-145     1-93  (93)
  8 KOG1834|consensus               99.7 4.4E-16 9.6E-21  130.0  14.6  137    2-145    94-243 (952)
  9 smart00112 CA Cadherin repeats  99.6 5.7E-15 1.2E-19   94.8  10.2   75   76-153     1-79  (79)
 10 smart00112 CA Cadherin repeats  99.2 3.5E-11 7.5E-16   77.1   7.1   47    3-51     33-79  (79)
 11 KOG1834|consensus               99.2 2.1E-10 4.6E-15   96.6  13.0  142   42-186    22-179 (952)
 12 PF00028 Cadherin:  Cadherin do  98.3 3.1E-06 6.7E-11   55.6   6.9   40    3-44     53-93  (93)
 13 PF08758 Cadherin_pro:  Cadheri  97.2  0.0064 1.4E-07   39.7   9.1   77   50-135     3-82  (90)
 14 smart00736 CADG Dystroglycan-t  97.1   0.023 4.9E-07   37.6  11.0   72   75-150    24-96  (97)
 15 PF08266 Cadherin_2:  Cadherin-  97.0 0.00081 1.8E-08   43.3   3.4   59   59-120     5-66  (84)
 16 PF13750 Big_3_3:  Bacterial Ig  97.0    0.07 1.5E-06   38.7  16.1  132   10-145     8-148 (158)
 17 TIGR01965 VCBS_repeat VCBS rep  94.2       1 2.2E-05   29.9   8.9   86   71-167     2-97  (99)
 18 PF05345 He_PIG:  Putative Ig d  90.2     2.2 4.7E-05   24.3   5.9   36   97-132    12-48  (49)
 19 TIGR03660 T1SS_rpt_143 T1SS-14  88.7     2.7 5.9E-05   29.6   6.7   60    1-65     68-128 (137)
 20 smart00736 CADG Dystroglycan-t  88.3     1.8   4E-05   28.3   5.4   31   16-48     66-96  (97)
 21 PF07495 Y_Y_Y:  Y_Y_Y domain;   80.2     4.6 9.9E-05   24.0   4.2   30   15-44     37-66  (66)
 22 PF03160 Calx-beta:  Calx-beta   78.9      15 0.00033   23.8   8.8   50   40-92      2-52  (100)
 23 TIGR01965 VCBS_repeat VCBS rep  77.3      12 0.00026   24.8   5.7   41   16-63     56-96  (99)
 24 PF08758 Cadherin_pro:  Cadheri  77.2       2 4.4E-05   27.9   2.0   35  151-185     2-36  (90)
 25 PF07495 Y_Y_Y:  Y_Y_Y domain;   71.6      18 0.00039   21.3   8.8   58   82-144     6-65  (66)
 26 KOG3597|consensus               70.9      66  0.0014   27.5  13.4   58   35-93     24-83  (442)
 27 TIGR00845 caca sodium/calcium   67.0 1.1E+02  0.0025   28.7  12.6  118   46-168   395-543 (928)
 28 PF12245 Big_3_2:  Bacterial Ig  64.9      27 0.00057   20.6   5.6   27  119-145    22-48  (60)
 29 PF02010 REJ:  REJ domain;  Int  64.0      16 0.00035   30.7   5.1  131   17-155    50-193 (440)
 30 TIGR00845 caca sodium/calcium   63.9 1.3E+02  0.0029   28.3  13.1   56   35-93    513-569 (928)
 31 PF13754 Big_3_4:  Bacterial Ig  59.9      31 0.00068   19.8   5.0   27  118-144    22-49  (54)
 32 TIGR03660 T1SS_rpt_143 T1SS-14  59.4      60  0.0013   22.9   9.0   54  110-168    70-128 (137)
 33 smart00089 PKD Repeats in poly  55.5      41 0.00088   20.5   4.8   25   16-43     54-78  (79)
 34 PF13750 Big_3_3:  Bacterial Ig  53.2      83  0.0018   22.7   7.7   37  119-157    14-51  (158)
 35 cd00146 PKD polycystic kidney   51.7      41  0.0009   20.6   4.4   15   16-30     56-70  (81)
 36 cd00146 PKD polycystic kidney   51.4      55  0.0012   20.0   8.5   31  114-144    51-81  (81)
 37 KOG4221|consensus               49.5 2.5E+02  0.0054   27.5  10.3  105   16-132   584-697 (1381)
 38 PF02494 HYR:  HYR domain;  Int  43.9      57  0.0012   20.2   4.1   24  121-144    58-81  (81)
 39 PF12245 Big_3_2:  Bacterial Ig  36.9      90  0.0019   18.3   5.7   30   16-47     22-51  (60)
 40 PF15418 DUF4625:  Domain of un  34.6 1.1E+02  0.0023   21.5   4.5   26  119-144   106-131 (132)
 41 KOG3597|consensus               33.7 1.3E+02  0.0029   25.7   5.7   50  136-186    24-73  (442)
 42 PF05895 DUF859:  Siphovirus pr  31.6 3.9E+02  0.0085   24.1  13.3  107   16-132   297-424 (624)
 43 PF09100 Qn_am_d_aIV:  Quinohem  28.7 1.8E+02  0.0039   20.3   4.7   31   18-48     99-130 (133)
 44 cd02848 Chitinase_N_term Chiti  28.2      88  0.0019   21.1   3.1   28    9-38     74-101 (106)
 45 PF08329 ChitinaseA_N:  Chitina  26.1 1.2E+02  0.0026   21.3   3.6   28  118-145    81-108 (133)
 46 KOG4221|consensus               26.1 6.5E+02   0.014   24.9  13.0   68   86-154   552-620 (1381)
 47 cd01758 PLAT_LPL PLAT/ LH2 dom  24.8 2.5E+02  0.0055   19.7   7.6   25  121-145     2-27  (137)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=1.7e-35  Score=258.84  Aligned_cols=186  Identities=27%  Similarity=0.419  Sum_probs=172.5

Q ss_pred             EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCC-cEEEEcCCC-CceEEEEEEeeCCC
Q psy5884           3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIP-PIQIQAGEA-RRQIVKIEATDNDL   80 (195)
Q Consensus         3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~-~~~v~e~~~-g~~v~~v~a~D~D~   80 (195)
                      |.+++.||||.  ...+.|.|.|.|.+.|.++++++|+|.|+|.|||.|+|++.. ...+.|+.. |..|.+|.|+|.|.
T Consensus       221 irta~~lDREt--~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Ds  298 (2531)
T KOG4289|consen  221 IRTAKSLDRET--KETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDS  298 (2531)
T ss_pred             chhhhhhhhhh--hheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCC
Confidence            56789999999  789999999999999999999999999999999999999974 578899988 99999999999999


Q ss_pred             CCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC--CceeEEEEEEEEcCCCCCCCeEe
Q psy5884          81 GVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG--KSSQTIVEVSVVPGPNTRSPVFQ  156 (195)
Q Consensus        81 ~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~~P~f~  156 (195)
                      ++|+.+.|++..  |.++..|.||+.+|.|++..++|+|  +.|.|.|.|+|.|.  -..+++|.|+|. |+|||+|+|.
T Consensus       299 p~Nani~Yrl~e--g~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~-D~NDNaPqFs  375 (2531)
T KOG4289|consen  299 PPNANIRYRLLE--GNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVE-DENDNAPQFS  375 (2531)
T ss_pred             CCCCceEEEecC--CCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEE-ecCCCCcccc
Confidence            999999999976  5678899999999999999999988  47999999999986  234899999999 9999999999


Q ss_pred             cCeeEEEEecCCCCCeEEEEEEEEeCCCCCC------cCcccc
Q psy5884         157 QNTYDVNVSEGASINSTVATITVSRLASGCA------IMNKEE  193 (195)
Q Consensus       157 ~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~------i~~~~~  193 (195)
                      ...|.+.|+|+..++++|++++|+|.|.|.|      |++||.
T Consensus       376 e~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~  418 (2531)
T KOG4289|consen  376 EKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNG  418 (2531)
T ss_pred             ccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCc
Confidence            9999999999999999999999999999965      777775


No 2  
>KOG4289|consensus
Probab=100.00  E-value=1.4e-33  Score=247.02  Aligned_cols=186  Identities=27%  Similarity=0.407  Sum_probs=170.1

Q ss_pred             EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCC-CcEEEEcCCC-CceEEEEEEeeCCC
Q psy5884           3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEA-RRQIVKIEATDNDL   80 (195)
Q Consensus         3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D~   80 (195)
                      +.+...||||.  +..|++.+.|+|.+.|+...+.+|.|.|.|+|||+|+|..+ |..+|.|+.| ++.+++|+|+|+|.
T Consensus       735 i~t~~~ld~ed--qvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~  812 (2531)
T KOG4289|consen  735 IYTQAELDYED--QVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADS  812 (2531)
T ss_pred             eEEeeeeeccc--ceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCC
Confidence            45678899999  99999999999999999999999999999999999999987 5789999998 99999999999999


Q ss_pred             CCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC--CceeEEEEEEEEcCCCCCCCeEe
Q psy5884          81 GVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG--KSSQTIVEVSVVPGPNTRSPVFQ  156 (195)
Q Consensus        81 ~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~~P~f~  156 (195)
                      |.|+++.|.+..+ ..+.+.|.|++.+|.|++...||+|  .+|.|.+.|.|.|.  ++..+.|+|+|. |+||++|.|.
T Consensus       813 g~Ng~v~y~~qg~-~d~p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvl-dvNDnaPvfe  890 (2531)
T KOG4289|consen  813 GPNGRVYYTFQGG-DDGPGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVL-DVNDNAPVFE  890 (2531)
T ss_pred             CCCceEEEEecCC-CCCCCceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEE-ecCCCCCCCC
Confidence            9999999988643 3355899999999999999999998  48999999999987  567788888888 9999999999


Q ss_pred             cCeeEEEEecCCCCCeEEEEEEEEeCCCCCC------cCccc
Q psy5884         157 QNTYDVNVSEGASINSTVATITVSRLASGCA------IMNKE  192 (195)
Q Consensus       157 ~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~------i~~~~  192 (195)
                      +.+|...|.||+++|..++++.|.|||+|.|      |..||
T Consensus       891 ~~e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~  932 (2531)
T KOG4289|consen  891 QDELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGN  932 (2531)
T ss_pred             CcceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCc
Confidence            9999999999999999999999999999977      56555


No 3  
>KOG1219|consensus
Probab=99.97  E-value=8.7e-30  Score=229.55  Aligned_cols=179  Identities=30%  Similarity=0.397  Sum_probs=165.1

Q ss_pred             EEecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCC-CcEEEEcCCC-CceEEEEEEeeCC
Q psy5884           2 LVVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEA-RRQIVKIEATDND   79 (195)
Q Consensus         2 ~i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D   79 (195)
                      .|.+..+||||.  ...|.|.|.|+|.+.| +.+++.|.|.|.|+|||||+|.+. |.++++|+.+ |+.+.+|.|+|.|
T Consensus      2130 ~i~v~~pLd~ea--~t~h~l~ieAtd~~~p-~~Aea~VeIiV~dIndn~PvFeqlsYt~sisE~s~igt~viqilATdsD 2206 (4289)
T KOG1219|consen 2130 VILVLIPLDREA--STLHELLIEATDAGIP-LSAEAKVEIIVGDINDNPPVFEQLSYTISISENSKIGTKVIQILATDSD 2206 (4289)
T ss_pred             eEEecccccccc--ccceEEEEEEeccCCC-cceeeEEEEEecccCCCCchhheeeEEEEccCCCccCceEEEEEeccCC
Confidence            367788999999  8899999999999876 999999999999999999999986 6689999998 9999999999999


Q ss_pred             CCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCCC-ceeEEEEEEEEcCCCCCCCeEe
Q psy5884          80 LGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGGK-SSQTIVEVSVVPGPNTRSPVFQ  156 (195)
Q Consensus        80 ~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~~-~~~~~v~i~V~~d~nd~~P~f~  156 (195)
                      .  |+.+.|+|.. .......|+|+..||.|++.+.||||  ++|.|.|+|+|+|.+ ++.+.+.|+|. |.||++|.|.
T Consensus      2207 s--n~~isYsl~g-~s~~sk~f~In~sTG~it~~g~ldyE~~q~f~~fvratdggk~lSseviv~V~Ve-D~Ndn~Pef~ 2282 (4289)
T KOG1219|consen 2207 S--NREISYSLEG-NSEISKPFRINVSTGWITVAGKLDYEENQEFRFFVRATDGGKPLSSEVIVEVHVE-DFNDNPPEFN 2282 (4289)
T ss_pred             C--CCceEEEeec-CCccccceEEecccceEEEeeecChhhcceEEEEEEEccCCCcccccEEEEEEeh-hcCCCCchhc
Confidence            9  8899999985 33457899999999999999999998  479999999998875 78899999999 9999999999


Q ss_pred             cCeeEEEEecCCCCCeEEEEEEEEeCCCCCC
Q psy5884         157 QNTYDVNVSEGASINSTVATITVSRLASGCA  187 (195)
Q Consensus       157 ~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~  187 (195)
                      +..|.+.|+|.+..|..|..+.|.|+|+...
T Consensus      2283 q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~ 2313 (4289)
T KOG1219|consen 2283 QRNYEAFVSDPARSGHFITVVNAHDLDSSDH 2313 (4289)
T ss_pred             cccceeecCCCccceeEEEEEEeccCCccch
Confidence            9999999999999999999999999999854


No 4  
>KOG1219|consensus
Probab=99.97  E-value=7.1e-29  Score=223.78  Aligned_cols=180  Identities=24%  Similarity=0.341  Sum_probs=167.1

Q ss_pred             EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCC--CCccCCCC-cEEEEcCCC-CceEEEEEEeeC
Q psy5884           3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDN--APRLPMIP-PIQIQAGEA-RRQIVKIEATDN   78 (195)
Q Consensus         3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn--~P~f~~~~-~~~v~e~~~-g~~v~~v~a~D~   78 (195)
                      +++.++||+|+  ...|.|.|.|.|+|.|.+++.+.+.|.++|+|.|  ||.|.... ...|.|+++ |+.+++|.|.|.
T Consensus       903 vfi~~~LDf~k--~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~de  980 (4289)
T KOG1219|consen  903 VFIGKPLDFEK--SDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDE  980 (4289)
T ss_pred             EEEeccccccc--ccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecC
Confidence            45678999999  8999999999999998899999999999999988  99997763 689999999 999999999999


Q ss_pred             CCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCC--EEEEEEEEEECCC--CceeEEEEEEEEcCCCCCCCe
Q psy5884          79 DLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGE--QYSITVQATDTGG--KSSQTIVEVSVVPGPNTRSPV  154 (195)
Q Consensus        79 D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~--~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~~P~  154 (195)
                      |.|..+.+.|+|..  |++.+.|.||.++|.|++.+.||+|.  .|-|++.|+|.|.  +++.+.+.|.|. |+|||+|+
T Consensus       981 dsgldg~l~Y~I~~--gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ie-DvNDn~Pq 1057 (4289)
T KOG1219|consen  981 DSGLDGELSYKIRT--GDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIE-DVNDNVPQ 1057 (4289)
T ss_pred             CCCccceEEEEEEc--CCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEE-ecCCCCcc
Confidence            99999999999986  67889999999999999999999984  7999999999997  677889999999 99999999


Q ss_pred             EecCeeEEEEecCCCCCeEEEEEEEEeCCCCCC
Q psy5884         155 FQQNTYDVNVSEGASINSTVATITVSRLASGCA  187 (195)
Q Consensus       155 f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~  187 (195)
                      |.+..|..+|.|+++.+..|.++.|.|+|+..+
T Consensus      1058 ~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn 1090 (4289)
T KOG1219|consen 1058 FSSPVYYASISENSPETVSIVQAEANDPDSSSN 1090 (4289)
T ss_pred             cCCceEeeeeccCCCCceEEEEeccCCCCcccC
Confidence            999999999999999999999999999996644


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.93  E-value=6.4e-24  Score=159.46  Aligned_cols=139  Identities=32%  Similarity=0.467  Sum_probs=123.3

Q ss_pred             EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCC-CCcEEEEcCCC-CceEEEEEEeeCCC
Q psy5884           3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPM-IPPIQIQAGEA-RRQIVKIEATDNDL   80 (195)
Q Consensus         3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~-~~~~~v~e~~~-g~~v~~v~a~D~D~   80 (195)
                      |.+.+.||||.  ...|.|.|.|+|.|.+.++++..+.|.|.|+||++|.|.. .+.+.+.|+.+ |+.++++.|+|+|.
T Consensus        54 l~~~~~lD~e~--~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~  131 (199)
T cd00031          54 ITTTKPLDREE--QSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS  131 (199)
T ss_pred             EEECCCCCCcC--CceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC
Confidence            56778999999  8999999999998887777999999999999999999984 46799999987 99999999999999


Q ss_pred             CCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECC--CCceeEEEEEEEE
Q psy5884          81 GVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTG--GKSSQTIVEVSVV  145 (195)
Q Consensus        81 ~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~--~~~~~~~v~i~V~  145 (195)
                      +.++.++|+|...  .....|.|++.+|.|.+.+.+|++  ..|.|.|.|+|.+  .++.++++.|.|.
T Consensus       132 ~~~~~~~y~l~~~--~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~  198 (199)
T cd00031         132 GENAKLTYSILSG--NDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVL  198 (199)
T ss_pred             CCCccEEEEEeCC--CCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEE
Confidence            9899999999762  224799999999999999999987  3799999999998  5777788888776


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.90  E-value=6.4e-22  Score=148.59  Aligned_cols=126  Identities=36%  Similarity=0.533  Sum_probs=114.2

Q ss_pred             CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC
Q psy5884          57 PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG  133 (195)
Q Consensus        57 ~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~  133 (195)
                      +.+.+.|+.+ |+.++++.|+|+|.+.++.+.|+|..  +....+|.|++.+|.|++.+.||+|  ..|.|.|.|+|.|.
T Consensus         2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~--~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~   79 (199)
T cd00031           2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILG--GNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGG   79 (199)
T ss_pred             eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeC--CCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCc
Confidence            4578999998 99999999999999888999999986  3333799999999999999999987  48999999999876


Q ss_pred             Cc--eeEEEEEEEEcCCCCCCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCC
Q psy5884         134 KS--SQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGASINSTVATITVSRLASG  185 (195)
Q Consensus       134 ~~--~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g  185 (195)
                      +.  ....+.|.|. |+|+++|.|....|.+.+.|+.++|+.++++.|.|+|.+
T Consensus        80 ~~~~~~~~v~I~V~-d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~  132 (199)
T cd00031          80 PPLSSTATVTVTVL-DVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSG  132 (199)
T ss_pred             CcceeEEEEEEEEc-cCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCC
Confidence            55  7889999999 999999999989999999999999999999999999986


No 7  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.73  E-value=1.7e-16  Score=105.16  Aligned_cols=87  Identities=33%  Similarity=0.455  Sum_probs=75.2

Q ss_pred             CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEEC-C
Q psy5884          57 PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDT-G  132 (195)
Q Consensus        57 ~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~-~  132 (195)
                      |.+.++|+.+ |+.++++.|.|+|.+.|+.+.|+|..  +....+|.|++.+|.|++.++||+|  ..|.|.|.|+|. +
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~--~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~   78 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILG--GNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGG   78 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEE--TTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTT
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEec--CcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCC
Confidence            3578999987 99999999999999999999999987  4446899999999999999999987  479999999999 5


Q ss_pred             C--CceeEEEEEEEE
Q psy5884         133 G--KSSQTIVEVSVV  145 (195)
Q Consensus       133 ~--~~~~~~v~i~V~  145 (195)
                      .  ++++++|.|+|+
T Consensus        79 ~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   79 SPPLSSTATVTINVL   93 (93)
T ss_dssp             SSEEEEEEEEEEEEE
T ss_pred             CCCCEEEEEEEEEEC
Confidence            5  456667777664


No 8  
>KOG1834|consensus
Probab=99.70  E-value=4.4e-16  Score=130.01  Aligned_cols=137  Identities=28%  Similarity=0.401  Sum_probs=114.5

Q ss_pred             EEecCCCCCCCCCCCCeEEEEEEEEECCC-C-----CeEeeEEEEEEEEecCCCCCccCCC-CcEEEEcCCCCceEEEEE
Q psy5884           2 LVVSKENLDRDPPNPGKFRFQVIAREKQG-T-----AASVPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEARRQIVKIE   74 (195)
Q Consensus         2 ~i~~~~~LD~E~~~~~~~~l~v~a~D~~~-~-----~~~~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~g~~v~~v~   74 (195)
                      +|.+..+||-|.  ++.|+|+|+|.|.|. |     ..+-.++|.|+|.|+|+.+|.|..+ |.+.|.|......|+++.
T Consensus        94 vlRaK~~lDCel--qkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~ve  171 (952)
T KOG1834|consen   94 VLRAKEPLDCEL--QKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVE  171 (952)
T ss_pred             EEeecCcccccc--cccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEE
Confidence            467889999999  899999999999763 3     5677889999999999999999887 668888887788999999


Q ss_pred             EeeCCCC-CCceE-EEEEEeeCCCCeecEEEeCCCCEEEEceeCCC--CCEEEEEEEEEECCCCce--eEEEEEEEE
Q psy5884          75 ATDNDLG-VNAEI-TYSIYHVSNTGLHKFRIDPRSGVIETTGKLIA--GEQYSITVQATDTGGKSS--QTIVEVSVV  145 (195)
Q Consensus        75 a~D~D~~-~n~~~-~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~--~~~~~l~v~a~D~~~~~~--~~~v~i~V~  145 (195)
                      |.|.|-+ ++++| .|.|..    ..-+|.||. .|.|+.+.+|.|  +.+|.|+|.|.|+|.+.+  .+.|+|+|.
T Consensus       172 AiD~DCspq~sqIC~YEI~t----~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vk  243 (952)
T KOG1834|consen  172 AIDKDCSPQYSQICEYEITT----PDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVK  243 (952)
T ss_pred             eecCCCCCcccceeEEEecC----CCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEec
Confidence            9999965 57776 578864    456899977 799999999986  468999999999997433  367777776


No 9  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.63  E-value=5.7e-15  Score=94.84  Aligned_cols=75  Identities=44%  Similarity=0.609  Sum_probs=66.2

Q ss_pred             eeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC--CceeEEEEEEEEcCCCCC
Q psy5884          76 TDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG--KSSQTIVEVSVVPGPNTR  151 (195)
Q Consensus        76 ~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~  151 (195)
                      +|+|.|.|+.+.|+|..  +....+|.|++.+|.|++.++||+|  ..|.|.|.|.|.+.  +++.++|.|+|. |+|++
T Consensus         1 ~D~D~g~n~~i~Y~i~~--~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~-D~Nd~   77 (79)
T smart00112        1 TDADSGENGKVTYSILS--GNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVL-DVNDN   77 (79)
T ss_pred             CCCCCCcCcEEEEEEec--CCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEE-ECCCC
Confidence            48899999999999986  3333799999999999999999987  47999999999987  688899999999 99999


Q ss_pred             CC
Q psy5884         152 SP  153 (195)
Q Consensus       152 ~P  153 (195)
                      +|
T Consensus        78 ~P   79 (79)
T smart00112       78 AP   79 (79)
T ss_pred             CC
Confidence            98


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.24  E-value=3.5e-11  Score=77.10  Aligned_cols=47  Identities=32%  Similarity=0.541  Sum_probs=44.0

Q ss_pred             EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCC
Q psy5884           3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAP   51 (195)
Q Consensus         3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P   51 (195)
                      |++.++||||.  ...|.|.|.|+|.++++++++++|.|.|.|+|||+|
T Consensus        33 i~~~~~LD~e~--~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       33 ITTTKPLDREE--QPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             EEeCCccCeeC--CCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            56677999999  889999999999999899999999999999999998


No 11 
>KOG1834|consensus
Probab=99.23  E-value=2.1e-10  Score=96.58  Aligned_cols=142  Identities=20%  Similarity=0.280  Sum_probs=106.9

Q ss_pred             EEEecCCCCCccCCCCcEEEEcCCC-CceEEEEEEeeCCCC--CCceEE-EEEEeeCCCCeecEEEeCCC--CEEEEcee
Q psy5884          42 NLNDINDNAPRLPMIPPIQIQAGEA-RRQIVKIEATDNDLG--VNAEIT-YSIYHVSNTGLHKFRIDPRS--GVIETTGK  115 (195)
Q Consensus        42 ~V~DvNDn~P~f~~~~~~~v~e~~~-g~~v~~v~a~D~D~~--~n~~~~-y~i~~~~~~~~~~F~id~~t--G~i~~~~~  115 (195)
                      ..--+|-+.|.....|...|.|+.. -...--+.|.|.|.+  -.+.+. |.|- +.+-+...--+|..|  |.|+.+.+
T Consensus        22 ~aarankhkpwie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklh-gq~vPFdavVvdK~TGegvlRaK~~  100 (952)
T KOG1834|consen   22 HAARANKHKPWIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLH-GQPVPFDAVVVDKYTGEGVLRAKEP  100 (952)
T ss_pred             ccccccccCcccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEec-CCCCCceEEEEeccCCceEEeecCc
Confidence            4455788899988888888888753 222234778888864  233454 4443 222223333447767  45888899


Q ss_pred             CCCC--CEEEEEEEEEECCC--------CceeEEEEEEEEcCCCCCCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCC
Q psy5884         116 LIAG--EQYSITVQATDTGG--------KSSQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGASINSTVATITVSRLASG  185 (195)
Q Consensus       116 ld~~--~~~~l~v~a~D~~~--------~~~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g  185 (195)
                      +|=+  +.|+|+|+|.|+|.        ++..++|.|+|. |+|..+|.|....|.+.|.|+. .-..|+++.|.|.|.+
T Consensus       101 lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVk-DvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAiD~DCs  178 (952)
T KOG1834|consen  101 LDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVK-DVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAIDKDCS  178 (952)
T ss_pred             ccccccccceEEEEEEecCCCCCccccccccceEEEEEec-cccccCchhcccceeeEEecce-eeeeeEEEEeecCCCC
Confidence            9954  67999999999874        577899999999 9999999999999999999984 6778999999999998


Q ss_pred             C
Q psy5884         186 C  186 (195)
Q Consensus       186 ~  186 (195)
                      .
T Consensus       179 p  179 (952)
T KOG1834|consen  179 P  179 (952)
T ss_pred             C
Confidence            3


No 12 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.31  E-value=3.1e-06  Score=55.61  Aligned_cols=40  Identities=23%  Similarity=0.385  Sum_probs=37.1

Q ss_pred             EecCCCCCCCCCCCCeEEEEEEEEEC-CCCCeEeeEEEEEEEE
Q psy5884           3 VVSKENLDRDPPNPGKFRFQVIAREK-QGTAASVPVSLTVNLN   44 (195)
Q Consensus         3 i~~~~~LD~E~~~~~~~~l~v~a~D~-~~~~~~~~~~v~I~V~   44 (195)
                      |.+.++||||+  ...|.|.|.|+|. +.|+++++++|+|.|+
T Consensus        53 i~~~~~LD~E~--~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   53 ISLKKPLDRET--QSSYQLTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             EEESSSSCTTT--TSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             cccceecCccc--CCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence            67889999999  8999999999999 7889999999999885


No 13 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.22  E-value=0.0064  Score=39.70  Aligned_cols=77  Identities=17%  Similarity=0.211  Sum_probs=40.5

Q ss_pred             CCccCCC-CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCC-CCCEEEEEE
Q psy5884          50 APRLPMI-PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLI-AGEQYSITV  126 (195)
Q Consensus        50 ~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld-~~~~~~l~v  126 (195)
                      .|-|... +.+.++.+.. |..+++|.-.|-..  +..+.|.-      ....|+|.+ .|.|.+++++. ..++-.|.|
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~s------sDpdF~V~~-DGsVy~~r~v~l~~~~~~F~V   73 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFES------SDPDFRVLE-DGSVYAKRPVQLSSEQRSFTV   73 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE---------SEEEEET-TTEEEEES--S-SSS-EEEEE
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEec------CCCCEEEcC-CCeEEEeeeEecCCCceEEEE
Confidence            4667665 4677887765 99999999988854  45588865      234899977 79999999987 445678999


Q ss_pred             EEEECCCCc
Q psy5884         127 QATDTGGKS  135 (195)
Q Consensus       127 ~a~D~~~~~  135 (195)
                      .|.|..+..
T Consensus        74 ~a~D~~~~~   82 (90)
T PF08758_consen   74 HAWDSQTQE   82 (90)
T ss_dssp             EEEETTTTE
T ss_pred             EEECCCCCe
Confidence            999987654


No 14 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.10  E-value=0.023  Score=37.56  Aligned_cols=72  Identities=26%  Similarity=0.300  Sum_probs=51.0

Q ss_pred             EeeCCCCCCceEEEEEEeeCC-CCeecEEEeCCCCEEEEceeCCCCCEEEEEEEEEECCCCceeEEEEEEEEcCCCC
Q psy5884          75 ATDNDLGVNAEITYSIYHVSN-TGLHKFRIDPRSGVIETTGKLIAGEQYSITVQATDTGGKSSQTIVEVSVVPGPNT  150 (195)
Q Consensus        75 a~D~D~~~n~~~~y~i~~~~~-~~~~~F~id~~tG~i~~~~~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V~~d~nd  150 (195)
                      ..|.|   ...++|++....+ ....+.+.|+.++.+.=.........|.+.|.|+|..+.+....+.|.|. +.|+
T Consensus        24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~~~~~g~~~i~v~a~D~~g~~~~~~f~i~V~-~~~~   96 (97)
T smart00736       24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPTNSDVGSLSLKVTATDSSGASASDTFTITVV-NTND   96 (97)
T ss_pred             eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECCCCCCcEEEEEEEEEECCCCEEEEEEEEEEe-CCCC
Confidence            46666   3468998864222 23457788888888766433333567999999999888888889999998 6664


No 15 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.03  E-value=0.00081  Score=43.34  Aligned_cols=59  Identities=17%  Similarity=0.240  Sum_probs=36.0

Q ss_pred             EEEEcCCC-CceEEEEEEeeCCCCCC--ceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCC
Q psy5884          59 IQIQAGEA-RRQIVKIEATDNDLGVN--AEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGE  120 (195)
Q Consensus        59 ~~v~e~~~-g~~v~~v~a~D~D~~~n--~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~  120 (195)
                      ++|+|..+ |+.|+.| |.|.-....  ..-.|.+.+  .....+|.++..+|.|.+...+|||.
T Consensus         5 YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s--~~~~~~~~v~~~tG~L~v~~rIDRE~   66 (84)
T PF08266_consen    5 YSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVS--EGNSQYFRVNEKTGDLFVSERIDREE   66 (84)
T ss_dssp             EEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE---SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred             EEeecCCCCCCEEEEh-HHhhCCCcccccccceEEee--cCCcceeEecCCceeEEeCCccCHHH
Confidence            67888876 9999999 445433211  113567665  34568999999999999999999985


No 16 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=97.03  E-value=0.07  Score=38.66  Aligned_cols=132  Identities=21%  Similarity=0.295  Sum_probs=71.4

Q ss_pred             CCCCCCCCeEEEEE-EEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCCC--CceEEEEEEeeCCCCCCceE
Q psy5884          10 DRDPPNPGKFRFQV-IAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGEA--RRQIVKIEATDNDLGVNAEI   86 (195)
Q Consensus        10 D~E~~~~~~~~l~v-~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~~--g~~v~~v~a~D~D~~~n~~~   86 (195)
                      |....+...|.+.+ .|.|..+...+......+.|   ...+|.+.-.....+..+..  +..-..+.++|.-.+. .--
T Consensus         8 d~~~l~dG~Y~l~~~~a~D~agN~~~~~~~~~~~i---D~T~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~   83 (158)
T PF13750_consen    8 DLSTLPDGSYTLTVVTATDAAGNTSTSTVSETFTI---DNTPPTISISDGASVANGSTVYGLVNISINVTDNSDDS-KIT   83 (158)
T ss_pred             EeCcCCCccEEEEEEEEEecCCCEEEEEEeeEEEE---cCCCCEEEEecCCccCCCccccceeeeEEEEEeCCCCc-eEE
Confidence            44444567899999 79999876443333333433   34477754311111111211  3344567787776543 223


Q ss_pred             EEEEEeeCCCCeecEEEeC-CCCEEEEc-----eeCCCCCEEEEEEEEEECCCCceeEEEEEEEE
Q psy5884          87 TYSIYHVSNTGLHKFRIDP-RSGVIETT-----GKLIAGEQYSITVQATDTGGKSSQTIVEVSVV  145 (195)
Q Consensus        87 ~y~i~~~~~~~~~~F~id~-~tG~i~~~-----~~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V~  145 (195)
                      ...+..+.....-...... ..|...+.     ..+...+.|.|+|.|.|..+...+.++.+...
T Consensus        84 sv~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aGN~~~~si~F~y~  148 (158)
T PF13750_consen   84 SVSLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAGNQSTKSISFSYM  148 (158)
T ss_pred             EEEEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence            4555432111111122222 23333222     23334568999999999998888888888776


No 17 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.17  E-value=1  Score=29.92  Aligned_cols=86  Identities=23%  Similarity=0.298  Sum_probs=51.8

Q ss_pred             EEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEc--------eeCCCCC--EEEEEEEEEECCCCceeEEE
Q psy5884          71 VKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETT--------GKLIAGE--QYSITVQATDTGGKSSQTIV  140 (195)
Q Consensus        71 ~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~--------~~ld~~~--~~~l~v~a~D~~~~~~~~~v  140 (195)
                      ++|.++|+|.++.  .+++.....+ ..+.|.|++ .|.-...        +.|...+  .-.|++.+.|.    .+.+|
T Consensus         2 G~Lt~sD~D~gd~--~~~s~~~~~g-~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG----tt~~v   73 (99)
T TIGR01965         2 GQLTISDADAGQA--HFIAQTDAAG-QYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG----TSQTV   73 (99)
T ss_pred             CceEEeCCCCCCc--eEEecccccC-CcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC----CeEEE
Confidence            4688999998654  4555532222 456788877 5643222        1222222  36788888994    38899


Q ss_pred             EEEEEcCCCCCCCeEecCeeEEEEecC
Q psy5884         141 EVSVVPGPNTRSPVFQQNTYDVNVSEG  167 (195)
Q Consensus       141 ~i~V~~d~nd~~P~f~~~~~~~~v~E~  167 (195)
                      .|+|. ..||.| ..... -...+.|+
T Consensus        74 tItI~-GtNDap-vi~~~-~~g~v~ED   97 (99)
T TIGR01965        74 TITIT-GANDAA-VIGGA-DTGSVTED   97 (99)
T ss_pred             EEEEE-ccCCCC-EEecc-cceeEecC
Confidence            99999 778654 44322 23566665


No 18 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=90.19  E-value=2.2  Score=24.34  Aligned_cols=36  Identities=31%  Similarity=0.367  Sum_probs=26.7

Q ss_pred             CeecEEEeCCCCEEEEceeCC-CCCEEEEEEEEEECC
Q psy5884          97 GLHKFRIDPRSGVIETTGKLI-AGEQYSITVQATDTG  132 (195)
Q Consensus        97 ~~~~F~id~~tG~i~~~~~ld-~~~~~~l~v~a~D~~  132 (195)
                      ......||+.+|.|.-...-. ....|.|.|.|+|..
T Consensus        12 LP~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   12 LPSGLSLDPSTGTISGTPTSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             CCCcEEEeCCCCEEEeecCCCccccEEEEEEEEEcCC
Confidence            445688899999986664433 235899999999964


No 19 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=88.72  E-value=2.7  Score=29.64  Aligned_cols=60  Identities=20%  Similarity=0.411  Sum_probs=40.2

Q ss_pred             CEEecCCCCCCCCCC-CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCC
Q psy5884           1 YLVVSKENLDRDPPN-PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGE   65 (195)
Q Consensus         1 ~~i~~~~~LD~E~~~-~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~   65 (195)
                      |-..+..+||...-+ .-.+.|.|.|+|..+-..  +..+.|.|.|  |. |.......+.+.|..
T Consensus        68 YtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D~-P~~~~~~~~~V~E~~  128 (137)
T TIGR03660        68 YEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--DV-PTITDVDALTVDEDD  128 (137)
T ss_pred             EEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--CC-CeeccccceEEeccc
Confidence            567788888875311 346788899999877533  3478888887  54 665544457777743


No 20 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=88.34  E-value=1.8  Score=28.33  Aligned_cols=31  Identities=29%  Similarity=0.499  Sum_probs=26.3

Q ss_pred             CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCC
Q psy5884          16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDIND   48 (195)
Q Consensus        16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvND   48 (195)
                      ...|.+.|.|+|+.+  .+....+.|.|.+.||
T Consensus        66 ~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~   96 (97)
T smart00736       66 VGSLSLKVTATDSSG--ASASDTFTITVVNTND   96 (97)
T ss_pred             CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence            346999999999976  5677889999999887


No 21 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=80.23  E-value=4.6  Score=24.02  Aligned_cols=30  Identities=33%  Similarity=0.543  Sum_probs=19.6

Q ss_pred             CCCeEEEEEEEEECCCCCeEeeEEEEEEEE
Q psy5884          15 NPGKFRFQVIAREKQGTAASVPVSLTVNLN   44 (195)
Q Consensus        15 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~   44 (195)
                      +...|.|.|.|.|..+.......++.|.|+
T Consensus        37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen   37 PPGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             -SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            367899999999987753333366666663


No 22 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=78.86  E-value=15  Score=23.83  Aligned_cols=50  Identities=20%  Similarity=0.397  Sum_probs=25.6

Q ss_pred             EEEEEecCCCCCccCCC-CcEEEEcCCCCceEEEEEEeeCCCCCCceEEEEEEe
Q psy5884          40 TVNLNDINDNAPRLPMI-PPIQIQAGEARRQIVKIEATDNDLGVNAEITYSIYH   92 (195)
Q Consensus        40 ~I~V~DvNDn~P~f~~~-~~~~v~e~~~g~~v~~v~a~D~D~~~n~~~~y~i~~   92 (195)
                      +|.|.| ||.+ .+.-. ....+.|+. |.....|....++....-.+.|....
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~-~~~~v~V~~~~~~~~~~v~v~~~~~~   52 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD-GTVTVTVTRSGGSLDGPVTVNYSTVD   52 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS-SEEEEEEEEESS-TSSEEEEEEEEEE
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC-CEEEEEEEEcccCCCcceEEEEEEeC
Confidence            467788 6666 65443 346777764 43444454444433233446666654


No 23 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=77.27  E-value=12  Score=24.83  Aligned_cols=41  Identities=15%  Similarity=0.145  Sum_probs=27.4

Q ss_pred             CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEc
Q psy5884          16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQA   63 (195)
Q Consensus        16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e   63 (195)
                      .-.-+|.+.+.|+      .+.+|.|.|.-.||.| +....-...+.|
T Consensus        56 ~~tdsFtvtv~DG------tt~~vtItI~GtNDap-vi~~~~~g~v~E   96 (99)
T TIGR01965        56 TLTDTFTVTSADG------TSQTVTITITGANDAA-VIGGADTGSVTE   96 (99)
T ss_pred             EEEEEEEEEEeCC------CeEEEEEEEEccCCCC-EEecccceeEec
Confidence            4456778888886      2778999999999954 443333344444


No 24 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=77.21  E-value=2  Score=27.93  Aligned_cols=35  Identities=14%  Similarity=0.281  Sum_probs=14.1

Q ss_pred             CCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCC
Q psy5884         151 RSPVFQQNTYDVNVSEGASINSTVATITVSRLASG  185 (195)
Q Consensus       151 ~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g  185 (195)
                      +.|-|.+..|.+.|+.+...|..|++|.-.|=...
T Consensus         2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~   36 (90)
T PF08758_consen    2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGR   36 (90)
T ss_dssp             ---B--S-EEEE----SS-SS--EEE---B--SS-
T ss_pred             CcCCcccceEEEEcCchhhCCcEEEEEEeccCCCC
Confidence            46899999999999999999999999877765443


No 25 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=71.63  E-value=18  Score=21.29  Aligned_cols=58  Identities=19%  Similarity=0.300  Sum_probs=30.2

Q ss_pred             CCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCCEEEEEEEEEECCCC--ceeEEEEEEE
Q psy5884          82 VNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGEQYSITVQATDTGGK--SSQTIVEVSV  144 (195)
Q Consensus        82 ~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~~~~l~v~a~D~~~~--~~~~~v~i~V  144 (195)
                      .+-...|.|.   |.......+...+-.+..  .....+.|.|.|+|.|..+.  ....++.|.|
T Consensus         6 ~~~~Y~Y~l~---g~d~~W~~~~~~~~~~~~--~~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen    6 ENIRYRYRLE---GFDDEWITLGSYSNSISY--TNLPPGKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             TTEEEEEEEE---TTESSEEEESSTS-EEEE--ES--SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             CceEEEEEEE---CCCCeEEECCCCcEEEEE--EeCCCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            3445667665   333344444332213222  22345689999999997662  2226666665


No 26 
>KOG3597|consensus
Probab=70.86  E-value=66  Score=27.47  Aligned_cols=58  Identities=16%  Similarity=0.153  Sum_probs=38.7

Q ss_pred             eeEEEEEEEEecCCCCCccCCC-CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEee
Q psy5884          35 VPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHV   93 (195)
Q Consensus        35 ~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~   93 (195)
                      .+....|.|.-+||.+..+-.. +.+.+.|+.. -.....+.+.|+|.++- .+.|+++..
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~-~l~f~v~~t   83 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPL-PLEFQVLGT   83 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCcc-ceEEEEccC
Confidence            4667889999999988765444 3455566543 22335688888887543 378888753


No 27 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=66.97  E-value=1.1e+02  Score=28.72  Aligned_cols=118  Identities=13%  Similarity=0.150  Sum_probs=58.5

Q ss_pred             cCCCCCccCCC-CcEEEEcCCCCceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEc----------e
Q psy5884          46 INDNAPRLPMI-PPIQIQAGEARRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETT----------G  114 (195)
Q Consensus        46 vNDn~P~f~~~-~~~~v~e~~~g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~----------~  114 (195)
                      .||.++.|.-. ....+.|+. |+.-..|.-...|.+....+.|+..+++.....-|.  +.+|.|...          .
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En~-GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV~  471 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLENC-GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRIG  471 (928)
T ss_pred             ccCCcceEEecCCeEEEeecC-cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEEE
Confidence            44555554333 346777764 666666655544554455688887654333222232  223443222          1


Q ss_pred             eCC---CC--CEEEEEEEEEECC---------------CCceeEEEEEEEEcCCCCCCCeEecCeeEEEEecCC
Q psy5884         115 KLI---AG--EQYSITVQATDTG---------------GKSSQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGA  168 (195)
Q Consensus       115 ~ld---~~--~~~~l~v~a~D~~---------------~~~~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~  168 (195)
                      -+|   .|  +.|.+.+.--..+               ......+.+|+|. | ||++|.|.-..-...|.|+.
T Consensus       472 IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIl-D-DD~aGIfsFe~~~~sV~Es~  543 (928)
T TIGR00845       472 IIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTIL-D-DDHAGIFTFEEDVFHVSESI  543 (928)
T ss_pred             EccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEe-c-CcccCcccccCceEEEEcCC
Confidence            122   23  3454444321111               0112245567777 4 67888766544456677763


No 28 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=64.92  E-value=27  Score=20.64  Aligned_cols=27  Identities=33%  Similarity=0.639  Sum_probs=21.1

Q ss_pred             CCEEEEEEEEEECCCCceeEEEEEEEE
Q psy5884         119 GEQYSITVQATDTGGKSSQTIVEVSVV  145 (195)
Q Consensus       119 ~~~~~l~v~a~D~~~~~~~~~v~i~V~  145 (195)
                      ...|.+.+.|.|..+..........+.
T Consensus        22 dg~yt~~v~a~D~AGN~~~~~~~~~i~   48 (60)
T PF12245_consen   22 DGEYTLTVTATDKAGNTSSSTTQIVIV   48 (60)
T ss_pred             CccEEEEEEEEECCCCEEEeeeEEEEE
Confidence            457999999999988766666666665


No 29 
>PF02010 REJ:  REJ domain;  InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=64.04  E-value=16  Score=30.67  Aligned_cols=131  Identities=17%  Similarity=0.199  Sum_probs=17.4

Q ss_pred             CeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCCCCceEEEE---EEeeCCCCC-CceEEEEEEe
Q psy5884          17 GKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGEARRQIVKI---EATDNDLGV-NAEITYSIYH   92 (195)
Q Consensus        17 ~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~~g~~v~~v---~a~D~D~~~-n~~~~y~i~~   92 (195)
                      ..|.+.+.++-..++.......+.|.|.-.. -.|...-.....+..   + ....+   .-.|+|... +..++|....
T Consensus        50 G~y~~~~~Vt~~~~~~~~~~~~~~v~V~~s~-l~~~I~gG~~~~~~~---~-~~i~ldgs~S~Dpd~~~~~~~l~y~W~C  124 (440)
T PF02010_consen   50 GDYTFTLTVTASSNPGLSSTDSVTVTVEPSP-LVAVIKGGSSRTVGY---N-SDITLDGSQSYDPDGPPGDSGLTYSWSC  124 (440)
T ss_dssp             CEEEEEEEEE--BCTTEEEEEEEEEEEE----------------------------------------------------
T ss_pred             CCEEEEEEEEEECCCCceEEEEEEEEEeecc-ceeEEcCCccceeec---C-ceEEEeeEEEecccccccCCceEEEEEE
Confidence            3455655555233346778888888887522 122221111111111   1 11122   235777642 2446776553


Q ss_pred             eCCCCe-e-------cEEEeCCCCEEEEce-eCCCCCEEEEEEEEEECCCCceeEEEEEEEEcCCCCCCCeE
Q psy5884          93 VSNTGL-H-------KFRIDPRSGVIETTG-KLIAGEQYSITVQATDTGGKSSQTIVEVSVVPGPNTRSPVF  155 (195)
Q Consensus        93 ~~~~~~-~-------~F~id~~tG~i~~~~-~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V~~d~nd~~P~f  155 (195)
                      ...... .       ........+.+.+.. .|.....|.|++.+++.+..+..+...|.|.   ...+|..
T Consensus       125 ~~~~~~~~C~~~~~~~~~~~~~~~~l~i~~~~l~~~~~y~f~ltv~k~~r~s~s~~~~v~v~---~~~~p~v  193 (440)
T PF02010_consen  125 TDLSSNSACSTPSTNITLLNSSSSSLTIPASTLSPGSTYTFTLTVSKGSRSSSSASQTVTVV---SGDPPTV  193 (440)
T ss_dssp             ------------------------------------------------------------------------
T ss_pred             eCCceeccccccccccccCCCCCEEEEEEhHHcCCCceEEEEEEEEeCCCCceeeEEEEEec---cCCCCce
Confidence            222211 0       111123334454433 3444445999999999776577777777776   2345544


No 30 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=63.90  E-value=1.3e+02  Score=28.34  Aligned_cols=56  Identities=20%  Similarity=0.209  Sum_probs=29.8

Q ss_pred             eeEEEEEEEEecCCCCCccCCCC-cEEEEcCCCCceEEEEEEeeCCCCCCceEEEEEEee
Q psy5884          35 VPVSLTVNLNDINDNAPRLPMIP-PIQIQAGEARRQIVKIEATDNDLGVNAEITYSIYHV   93 (195)
Q Consensus        35 ~~~~v~I~V~DvNDn~P~f~~~~-~~~v~e~~~g~~v~~v~a~D~D~~~n~~~~y~i~~~   93 (195)
                      ...+.+|+|.| ||++|.|.-.. ...+.|+. |..-.+|.-+-.-.| .-.+.|...++
T Consensus       513 ~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~-G~vtvtV~RtsGa~G-~VtV~Y~T~dG  569 (928)
T TIGR00845       513 SPNTATVTILD-DDHAGIFTFEEDVFHVSESI-GIMEVKVLRTSGARG-TVIVPYRTVEG  569 (928)
T ss_pred             CCceEEEEEec-CcccCcccccCceEEEEcCC-CEEEEEEEEcCCCCe-eEEEEEEeecC
Confidence            33466777787 78899876543 46777764 443333322211111 22366766543


No 31 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=59.87  E-value=31  Score=19.80  Aligned_cols=27  Identities=41%  Similarity=0.623  Sum_probs=18.7

Q ss_pred             CCCEEEEEEEEEECCCCceeE-EEEEEE
Q psy5884         118 AGEQYSITVQATDTGGKSSQT-IVEVSV  144 (195)
Q Consensus       118 ~~~~~~l~v~a~D~~~~~~~~-~v~i~V  144 (195)
                      .+..|.+.+.|+|..+..+.. ...+.|
T Consensus        22 ~dG~y~itv~a~D~AGN~s~~~~~~~ti   49 (54)
T PF13754_consen   22 ADGTYTITVTATDAAGNTSTSSSVTFTI   49 (54)
T ss_pred             CCccEEEEEEEEeCCCCCCCccceeEEE
Confidence            477899999999987744333 444544


No 32 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=59.44  E-value=60  Score=22.89  Aligned_cols=54  Identities=22%  Similarity=0.319  Sum_probs=35.0

Q ss_pred             EEEceeCCCC-----CEEEEEEEEEECCCCceeEEEEEEEEcCCCCCCCeEecCeeEEEEecCC
Q psy5884         110 IETTGKLIAG-----EQYSITVQATDTGGKSSQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGA  168 (195)
Q Consensus       110 i~~~~~ld~~-----~~~~l~v~a~D~~~~~~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~  168 (195)
                      ..+.++||+.     -...|.|.|+|..+-.+...+.|+|. |  | .|...... ...|.|+.
T Consensus        70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~-D--D-~P~~~~~~-~~~V~E~~  128 (137)
T TIGR03660        70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIV-D--D-VPTITDVD-ALTVDEDD  128 (137)
T ss_pred             EEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEE-C--C-CCeecccc-ceEEeccc
Confidence            4455566542     25788999999877555678889888 4  3 36654433 36777753


No 33 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=55.50  E-value=41  Score=20.50  Aligned_cols=25  Identities=24%  Similarity=0.502  Sum_probs=17.3

Q ss_pred             CCeEEEEEEEEECCCCCeEeeEEEEEEE
Q psy5884          16 PGKFRFQVIAREKQGTAASVPVSLTVNL   43 (195)
Q Consensus        16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V   43 (195)
                      ...|.+.+.+.|..+   +.+.++.|.|
T Consensus        54 ~G~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       54 PGTYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             CcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence            567889999988866   4455555544


No 34 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=53.21  E-value=83  Score=22.66  Aligned_cols=37  Identities=27%  Similarity=0.458  Sum_probs=24.3

Q ss_pred             CCEEEEEE-EEEECCCCceeEEEEEEEEcCCCCCCCeEec
Q psy5884         119 GEQYSITV-QATDTGGKSSQTIVEVSVVPGPNTRSPVFQQ  157 (195)
Q Consensus       119 ~~~~~l~v-~a~D~~~~~~~~~v~i~V~~d~nd~~P~f~~  157 (195)
                      +..|.+.+ .|.|..+...+..+..++.  +..-+|....
T Consensus        14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~--iD~T~Ptisi   51 (158)
T PF13750_consen   14 DGSYTLTVVTATDAAGNTSTSTVSETFT--IDNTPPTISI   51 (158)
T ss_pred             CccEEEEEEEEEecCCCEEEEEEeeEEE--EcCCCCEEEE
Confidence            45789998 7999877666665554444  2346777754


No 35 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=51.75  E-value=41  Score=20.58  Aligned_cols=15  Identities=20%  Similarity=0.578  Sum_probs=12.0

Q ss_pred             CCeEEEEEEEEECCC
Q psy5884          16 PGKFRFQVIAREKQG   30 (195)
Q Consensus        16 ~~~~~l~v~a~D~~~   30 (195)
                      ...|.+.+.++|..+
T Consensus        56 ~G~y~v~l~v~d~~g   70 (81)
T cd00146          56 PGTYTVTLTVTNAVG   70 (81)
T ss_pred             CcEEEEEEEEEeCCC
Confidence            567889999998854


No 36 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=51.36  E-value=55  Score=20.01  Aligned_cols=31  Identities=26%  Similarity=0.328  Sum_probs=22.1

Q ss_pred             eeCCCCCEEEEEEEEEECCCCceeEEEEEEE
Q psy5884         114 GKLIAGEQYSITVQATDTGGKSSQTIVEVSV  144 (195)
Q Consensus       114 ~~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V  144 (195)
                      ..+.....|.+++.++|..+.+...++.|.|
T Consensus        51 ~~y~~~G~y~v~l~v~d~~g~~~~~~~~V~V   81 (81)
T cd00146          51 HTYTKPGTYTVTLTVTNAVGSSSTKTTTVVV   81 (81)
T ss_pred             EEcCCCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence            3445567899999999986666655666543


No 37 
>KOG4221|consensus
Probab=49.55  E-value=2.5e+02  Score=27.53  Aligned_cols=105  Identities=15%  Similarity=0.133  Sum_probs=55.2

Q ss_pred             CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCc-cC----CCCcEEEEcCCC--CceEEEEEEeeCCCCCCceEEE
Q psy5884          16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPR-LP----MIPPIQIQAGEA--RRQIVKIEATDNDLGVNAEITY   88 (195)
Q Consensus        16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~-f~----~~~~~~v~e~~~--g~~v~~v~a~D~D~~~n~~~~y   88 (195)
                      ...|.+.+.|....|.+.++..--.++..|+=+.||. ..    .+..+.|..+.+  .+..+++.            -|
T Consensus       584 ~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~it------------gY  651 (1381)
T KOG4221|consen  584 YTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQIT------------GY  651 (1381)
T ss_pred             ccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecCCCeEEEEccCCCcccccceEE------------EE
Confidence            3479999999999887666555445556666666665 21    112355554443  34333332            23


Q ss_pred             EEEee-CCCCeecEEEe-CCCCEEEEceeCCCCCEEEEEEEEEECC
Q psy5884          89 SIYHV-SNTGLHKFRID-PRSGVIETTGKLIAGEQYSITVQATDTG  132 (195)
Q Consensus        89 ~i~~~-~~~~~~~F~id-~~tG~i~~~~~ld~~~~~~l~v~a~D~~  132 (195)
                      .|... .+.....++.- ..+-.-.+...|++..+|.|.|.|....
T Consensus       652 kIRy~~~~~~~~~~~t~v~~n~~~~l~~~Lep~T~Y~vrIsa~t~n  697 (1381)
T KOG4221|consen  652 KIRYRKLSREDEVNETVVKGNTTQYLFNGLEPNTQYRVRISAMTVN  697 (1381)
T ss_pred             EEEecccCcccccceeecccchhhhHhhcCCCCceEEEEEEEeccC
Confidence            33321 11122222221 1111112233467778899999998753


No 38 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=43.92  E-value=57  Score=20.20  Aligned_cols=24  Identities=33%  Similarity=0.439  Sum_probs=20.1

Q ss_pred             EEEEEEEEEECCCCceeEEEEEEE
Q psy5884         121 QYSITVQATDTGGKSSQTIVEVSV  144 (195)
Q Consensus       121 ~~~l~v~a~D~~~~~~~~~v~i~V  144 (195)
                      .+.+...|+|..+..+++++.|+|
T Consensus        58 ~t~V~ytA~D~~GN~a~C~f~V~V   81 (81)
T PF02494_consen   58 TTTVTYTATDAAGNSATCSFTVTV   81 (81)
T ss_pred             eEEEEEEEEECCCCEEEEEEEEEC
Confidence            588999999988888888887765


No 39 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=36.88  E-value=90  Score=18.32  Aligned_cols=30  Identities=20%  Similarity=0.412  Sum_probs=19.7

Q ss_pred             CCeEEEEEEEEECCCCCeEeeEEEEEEEEecC
Q psy5884          16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDIN   47 (195)
Q Consensus        16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvN   47 (195)
                      ...|.+.+.|+|..|...  .....+.+.|..
T Consensus        22 dg~yt~~v~a~D~AGN~~--~~~~~~~i~d~~   51 (60)
T PF12245_consen   22 DGEYTLTVTATDKAGNTS--SSTTQIVIVDNT   51 (60)
T ss_pred             CccEEEEEEEEECCCCEE--EeeeEEEEEcCC
Confidence            347999999999988633  334444444443


No 40 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=34.56  E-value=1.1e+02  Score=21.48  Aligned_cols=26  Identities=15%  Similarity=0.419  Sum_probs=20.4

Q ss_pred             CCEEEEEEEEEECCCCceeEEEEEEE
Q psy5884         119 GEQYSITVQATDTGGKSSQTIVEVSV  144 (195)
Q Consensus       119 ~~~~~l~v~a~D~~~~~~~~~v~i~V  144 (195)
                      ...|.|.+.++|..+........|.|
T Consensus       106 ~G~YH~~i~VtD~~Gn~~~~~~~i~I  131 (132)
T PF15418_consen  106 AGDYHFMITVTDAAGNQTEEERSIKI  131 (132)
T ss_pred             CcceEEEEEEEECCCCEEEEEEEEEE
Confidence            45799999999998877766666655


No 41 
>KOG3597|consensus
Probab=33.74  E-value=1.3e+02  Score=25.71  Aligned_cols=50  Identities=12%  Similarity=0.062  Sum_probs=36.0

Q ss_pred             eeEEEEEEEEcCCCCCCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCCC
Q psy5884         136 SQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGASINSTVATITVSRLASGC  186 (195)
Q Consensus       136 ~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~  186 (195)
                      .+....|.|. .+|+.+..+....+.+-+.|+..--.--..+.+.|+|.++
T Consensus        24 ~~~~~~i~v~-pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~   73 (442)
T KOG3597|consen   24 QTDVLRIHVN-PVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP   73 (442)
T ss_pred             EEeeeccccc-ccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc
Confidence            4556678887 7898888777777778888876543344577888888874


No 42 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=31.56  E-value=3.9e+02  Score=24.12  Aligned_cols=107  Identities=17%  Similarity=0.216  Sum_probs=58.2

Q ss_pred             CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCCC-Cce-EEEEEEeeC------CCCCCceEE
Q psy5884          16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGEA-RRQ-IVKIEATDN------DLGVNAEIT   87 (195)
Q Consensus        16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~~-g~~-v~~v~a~D~------D~~~n~~~~   87 (195)
                      ...+.+.+.++|..|. .+.+.+..|.|++=+  +|.+.    +.+.-... +.. .....|.=+      -.-....++
T Consensus       297 ~G~~Ti~atVtDSRGr-~S~~~~~tItVl~Y~--~P~ls----fsv~R~~~~~~~~~v~~~a~Iapl~v~g~qKN~~~lt  369 (624)
T PF05895_consen  297 SGSATIRATVTDSRGR-TSDPKTKTITVLEYS--PPTLS----FSVYRCGSSGNTLTVTRNAKIAPLTVNGVQKNTMTLT  369 (624)
T ss_pred             CceEEEEEEEEECCCc-cCCceEEEEEEEEcC--CCcEE----EEEEEeCCCCcEEEEEEEEEEeEEEEcccccceEEEE
Confidence            4578899999998773 345677888888764  66652    22221111 111 111122111      111122466


Q ss_pred             EEEEeeCCCCeecEEEeCCC-------------CEEEEceeCCCCCEEEEEEEEEECC
Q psy5884          88 YSIYHVSNTGLHKFRIDPRS-------------GVIETTGKLIAGEQYSITVQATDTG  132 (195)
Q Consensus        88 y~i~~~~~~~~~~F~id~~t-------------G~i~~~~~ld~~~~~~l~v~a~D~~  132 (195)
                      |+...   .+...|.+|...             +...+...++.+..|.+.+.++|.-
T Consensus       370 ~~~a~---~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~~~kSy~V~~~l~D~F  424 (624)
T PF05895_consen  370 FKVAP---LGTGTFTTDNGSASGTWSSISELTNSSANLGGTYDAEKSYDVRGTLSDKF  424 (624)
T ss_pred             EEEEE---cCcceEEEEccccccceeeeeeecccceeeccccCCCceEEEEEEEEEEe
Confidence            66543   234556654321             1233445567788999999999953


No 43 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=28.68  E-value=1.8e+02  Score=20.30  Aligned_cols=31  Identities=23%  Similarity=0.283  Sum_probs=15.7

Q ss_pred             eEEEEEEEEECC-CCCeEeeEEEEEEEEecCC
Q psy5884          18 KFRFQVIAREKQ-GTAASVPVSLTVNLNDIND   48 (195)
Q Consensus        18 ~~~l~v~a~D~~-~~~~~~~~~v~I~V~DvND   48 (195)
                      .=.|.|.|+=.. +.++++...+.|+|..-|+
T Consensus        99 ~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~  130 (133)
T PF09100_consen   99 AGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN  130 (133)
T ss_dssp             -EEEEEEEEETTTT---EEEEEEEEE---S--
T ss_pred             cccEEEEEEEccCCcccceeEeEEEEeecccC
Confidence            345566665433 4589999999999998886


No 44 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=28.21  E-value=88  Score=21.05  Aligned_cols=28  Identities=18%  Similarity=0.328  Sum_probs=17.7

Q ss_pred             CCCCCCCCCeEEEEEEEEECCCCCeEeeEE
Q psy5884           9 LDRDPPNPGKFRFQVIAREKQGTAASVPVS   38 (195)
Q Consensus         9 LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~   38 (195)
                      +++.+  ...|.++|.++|..+=..|+...
T Consensus        74 ~~v~k--gG~y~m~V~lCn~dGCS~S~~~~  101 (106)
T cd02848          74 FKVGK--GGRYQMQVALCNGDGCSTSAAKE  101 (106)
T ss_pred             EEeCC--CCeEEEEEEEECCCCccCcCCEE
Confidence            34444  56788899988887644444333


No 45 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=26.14  E-value=1.2e+02  Score=21.32  Aligned_cols=28  Identities=32%  Similarity=0.504  Sum_probs=20.7

Q ss_pred             CCCEEEEEEEEEECCCCceeEEEEEEEE
Q psy5884         118 AGEQYSITVQATDTGGKSSQTIVEVSVV  145 (195)
Q Consensus       118 ~~~~~~l~v~a~D~~~~~~~~~v~i~V~  145 (195)
                      ....|.+.|++.|..+-+.++.+.|.|.
T Consensus        81 ~gG~y~~~VeLCN~~GCS~S~~~~V~Va  108 (133)
T PF08329_consen   81 KGGRYQMQVELCNADGCSTSAPVEVVVA  108 (133)
T ss_dssp             S-EEEEEEEEEEETTEEEE---EEEEEE
T ss_pred             CCCEEEEEEEEECCCCcccCCCEEEEEe
Confidence            3458999999999888777778889888


No 46 
>KOG4221|consensus
Probab=26.12  E-value=6.5e+02  Score=24.92  Aligned_cols=68  Identities=18%  Similarity=0.269  Sum_probs=44.0

Q ss_pred             EEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCCEEEEEEEEEECCCC-ceeEEEEEEEEcCCCCCCCe
Q psy5884          86 ITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGEQYSITVQATDTGGK-SSQTIVEVSVVPGPNTRSPV  154 (195)
Q Consensus        86 ~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~~~~l~v~a~D~~~~-~~~~~v~i~V~~d~nd~~P~  154 (195)
                      ..|.+....++....+.++.++-+.++. .|+.+..|.+.|.|.+..+. .+++.+.|.-..|+-+.||.
T Consensus       552 ~~yk~~ys~~~~~~~~~~~~n~~e~ti~-gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  552 TGYKLFYSEDDTGKELRVENNATEYTIN-GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             eEEEEEEEcCCCCceEEEecCccEEEee-cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            4566554334455678887755555443 66777789999999998763 33455555444477788887


No 47 
>cd01758 PLAT_LPL PLAT/ LH2 domain present in lipoprotein lipase (LPL).  LPL is a key enzyme in catabolism of plasma lipoprotein triglycerides (TGs) and has therefeore has a profound influence on triglyceride and high-density lipoprotein (HDL) cholesterol levels in the blood. In general, PLAT/LH2 domain's proposed function is to mediate interaction with lipids or membrane bound proteins.
Probab=24.84  E-value=2.5e+02  Score=19.70  Aligned_cols=25  Identities=8%  Similarity=0.297  Sum_probs=14.9

Q ss_pred             EEEEEEEEEECCCCc-eeEEEEEEEE
Q psy5884         121 QYSITVQATDTGGKS-SQTIVEVSVV  145 (195)
Q Consensus       121 ~~~l~v~a~D~~~~~-~~~~v~i~V~  145 (195)
                      +|.++|......... ..+++.|.+.
T Consensus         2 hYqVtV~~~~~~~~~~t~~~v~i~L~   27 (137)
T cd01758           2 HYQLKIHFFNQTNRIETDPTFTISLY   27 (137)
T ss_pred             eEEEEEEEecccCCCcccceEEEEEE
Confidence            477777775543322 4566777776


Done!