Query         psy585
Match_columns 111
No_of_seqs    237 out of 1299
Neff          8.6 
Searched_HMMs 46136
Date          Fri Aug 16 19:39:54 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy585.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/585hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus               99.9   1E-21 2.2E-26  159.2  12.5  110    1-110   749-912 (2531)
  2 KOG4289|consensus               99.8 6.5E-21 1.4E-25  154.6   9.0  110    1-110   235-397 (2531)
  3 KOG1219|consensus               99.8 5.7E-20 1.2E-24  153.2  11.2  111    1-111  2145-2305(4289)
  4 KOG1219|consensus               99.8 5.3E-19 1.1E-23  147.6  11.1  110    1-110  2648-2804(4289)
  5 cd00031 CA Cadherin repeat dom  99.3 4.2E-11 9.1E-16   80.7  10.8   47    1-47     68-117 (199)
  6 cd00031 CA Cadherin repeat dom  99.3   1E-10 2.2E-15   78.9  12.1  105    4-110    18-125 (199)
  7 KOG1834|consensus               98.8 4.2E-08 9.1E-13   76.7   9.5   78    1-78    109-243 (952)
  8 smart00112 CA Cadherin repeats  98.6 5.4E-07 1.2E-11   52.6   7.9   66   18-85     11-79  (79)
  9 KOG1834|consensus               98.5 3.9E-07 8.5E-12   71.4   7.7   53   51-103   104-165 (952)
 10 smart00112 CA Cadherin repeats  98.5 3.5E-07 7.6E-12   53.4   4.7   30    1-30     47-79  (79)
 11 PF00028 Cadherin:  Cadherin do  97.6 0.00023 4.9E-09   42.5   5.4   28   51-78     62-93  (93)
 12 TIGR01965 VCBS_repeat VCBS rep  92.6    0.74 1.6E-05   28.3   5.9   42   53-99     56-97  (99)
 13 TIGR03660 T1SS_rpt_143 T1SS-14  92.0     2.3   5E-05   27.6   8.2   66   35-104    66-133 (137)
 14 smart00736 CADG Dystroglycan-t  89.2     1.3 2.8E-05   26.7   4.7   30   53-82     66-96  (97)
 15 PF12245 Big_3_2:  Bacterial Ig  84.3     3.9 8.5E-05   22.5   4.5   26    1-26     25-51  (60)
 16 PF13750 Big_3_3:  Bacterial Ig  82.7      11 0.00025   24.9   9.7   27   52-78    121-148 (158)
 17 PF07495 Y_Y_Y:  Y_Y_Y domain;   80.1       7 0.00015   21.2   4.6   26   53-78     38-66  (66)
 18 TIGR03660 T1SS_rpt_143 T1SS-14  78.6     9.2  0.0002   24.9   5.4   40    2-45     88-128 (137)
 19 PF08758 Cadherin_pro:  Cadheri  75.0      11 0.00025   22.5   4.7   17   29-45      3-19  (90)
 20 smart00089 PKD Repeats in poly  67.2      17 0.00036   20.4   4.2   26   52-77     53-78  (79)
 21 TIGR01965 VCBS_repeat VCBS rep  65.7      18 0.00039   22.2   4.2   38    2-44     60-97  (99)
 22 cd00146 PKD polycystic kidney   60.9      21 0.00045   20.1   3.8   26   52-77     55-81  (81)
 23 PF13860 FlgD_ig:  FlgD Ig-like  59.1      33 0.00071   19.7   5.7   51   15-66     24-81  (81)
 24 PF10365 DUF2436:  Domain of un  56.2      24 0.00052   23.2   3.7   23   26-48     66-88  (161)
 25 PF13754 Big_3_4:  Bacterial Ig  53.7      20 0.00044   19.1   2.7   10    1-10     26-35  (54)
 26 PF03160 Calx-beta:  Calx-beta   52.8      24 0.00051   20.9   3.2   25   74-100     2-26  (100)
 27 PF02494 HYR:  HYR domain;  Int  52.1      43 0.00093   19.0   5.4   32   46-77     49-81  (81)
 28 TIGR00864 PCC polycystin catio  47.1 2.9E+02  0.0062   27.1  10.1   28   52-79   1566-1593(2740)
 29 TIGR00845 caca sodium/calcium   46.6 1.9E+02  0.0042   24.9  10.5   28   72-100   516-543 (928)
 30 PF07145 PAM2:  Ataxin-2 C-term  42.9      11 0.00024   15.8   0.4   12   80-91      6-17  (18)
 31 TIGR00845 caca sodium/calcium   42.2 1.6E+02  0.0035   25.4   7.3   29   16-45    515-543 (928)
 32 cd05762 Ig8_MLCK Eighth immuno  39.2      82  0.0018   18.6   4.5   29   54-85     69-97  (98)
 33 cd01757 PLAT_RAB6IP1 PLAT/LH2   37.2 1.1E+02  0.0023   19.2   6.8   42   55-96      2-47  (114)
 34 PF08329 ChitinaseA_N:  Chitina  35.5      84  0.0018   20.4   3.8   30   53-82     82-112 (133)
 35 PF15418 DUF4625:  Domain of un  31.7 1.4E+02  0.0031   19.1   4.6   25   53-77    106-131 (132)
 36 PF09100 Qn_am_d_aIV:  Quinohem  31.2 1.2E+02  0.0025   19.7   3.9   32   54-86    100-133 (133)
 37 KOG4221|consensus               31.0 1.7E+02  0.0037   26.2   5.8   59   28-86    542-620 (1381)
 38 PF05688 DUF824:  Salmonella re  27.7      88  0.0019   16.5   2.5   16   69-84     13-28  (47)
 39 PF05345 He_PIG:  Putative Ig d  26.0      91   0.002   16.2   2.4   13   54-66     36-48  (49)
 40 cd02848 Chitinase_N_term Chiti  25.9 1.1E+02  0.0024   19.0   3.1   27   53-79     79-106 (106)
 41 COG1470 Predicted membrane pro  24.0 3.8E+02  0.0082   21.4   7.6   10    1-10    352-361 (513)
 42 PF02369 Big_1:  Bacterial Ig-l  20.7      89  0.0019   18.7   1.9   17   68-84     22-38  (100)
 43 PF00801 PKD:  PKD domain;  Int  20.1 1.6E+02  0.0036   15.8   2.9   16   53-68     51-66  (69)

No 1  
>KOG4289|consensus
Probab=99.87  E-value=1e-21  Score=159.18  Aligned_cols=110  Identities=32%  Similarity=0.427  Sum_probs=100.3

Q ss_pred             CEEEEEEEeCC---CccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC---------------------------
Q psy585            1 YELKLAASDNL---KENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP---------------------------   50 (111)
Q Consensus         1 y~l~v~A~D~~---~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~---------------------------   50 (111)
                      |++.+.|+|.|   +.+++++.|.|.|+|||+|+|..+.|.++|.|+.++++.                           
T Consensus       749 ytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~  828 (2531)
T KOG4289|consen  749 YTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDG  828 (2531)
T ss_pred             eEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCC
Confidence            78999999986   678899999999999999999999999999999776542                           


Q ss_pred             ---------------------ceEeEEEEEEEEecCCcc---ceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccE
Q psy585           51 ---------------------KRVLQYELTLVASDSLNE---NKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPH  106 (111)
Q Consensus        51 ---------------------e~~~~y~l~v~a~d~~~~---~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~  106 (111)
                                           |....|.|.+.|.|.|.+   +.+.|+|+|+|+|||||+|.+.+|...|.||.|.|+.+
T Consensus       829 p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~v  908 (2531)
T KOG4289|consen  829 PGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVV  908 (2531)
T ss_pred             CCceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceee
Confidence                                 667899999999999873   77899999999999999999999999999999999999


Q ss_pred             EEEE
Q psy585          107 SLLK  110 (111)
Q Consensus       107 ~~iq  110 (111)
                      +.|.
T Consensus       909 a~i~  912 (2531)
T KOG4289|consen  909 ALIT  912 (2531)
T ss_pred             EEEE
Confidence            8874


No 2  
>KOG4289|consensus
Probab=99.84  E-value=6.5e-21  Score=154.62  Aligned_cols=110  Identities=29%  Similarity=0.413  Sum_probs=96.8

Q ss_pred             CEEEEEEEeCC---CccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC---------------------------
Q psy585            1 YELKLAASDNL---KENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP---------------------------   50 (111)
Q Consensus         1 y~l~v~A~D~~---~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~---------------------------   50 (111)
                      +.|.|+|.|.+   .++++++.|.|.|.|||.|.|.+++|..++.||.+.|++                           
T Consensus       235 HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~  314 (2531)
T KOG4289|consen  235 HVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAK  314 (2531)
T ss_pred             eEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCcc
Confidence            46899999985   799999999999999999999999999999998654432                           


Q ss_pred             --------------------ceEeEEEEEEEEecCCc---cceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccEE
Q psy585           51 --------------------KRVLQYELTLVASDSLN---ENKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPHS  107 (111)
Q Consensus        51 --------------------e~~~~y~l~v~a~d~~~---~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~~  107 (111)
                                          |....|.|.|+|+|+|.   +.++.|.|+|.|+|||+|+|....|.+.|.|+..+++.+.
T Consensus       315 ~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvl  394 (2531)
T KOG4289|consen  315 NVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVL  394 (2531)
T ss_pred             ceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEE
Confidence                                55678999999999986   5899999999999999999999999999999988877665


Q ss_pred             EEE
Q psy585          108 LLK  110 (111)
Q Consensus       108 ~iq  110 (111)
                      -++
T Consensus       395 rV~  397 (2531)
T KOG4289|consen  395 RVT  397 (2531)
T ss_pred             EEE
Confidence            543


No 3  
>KOG1219|consensus
Probab=99.83  E-value=5.7e-20  Score=153.16  Aligned_cols=111  Identities=29%  Similarity=0.427  Sum_probs=100.8

Q ss_pred             CEEEEEEEeC--CCccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC----------------------------
Q psy585            1 YELKLAASDN--LKENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP----------------------------   50 (111)
Q Consensus         1 y~l~v~A~D~--~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~----------------------------   50 (111)
                      |.|.|+|+|.  ++.+.+.|.|.|.|+|||+|.|.+..|.++++|++..|++                            
T Consensus      2145 h~l~ieAtd~~~p~~Aea~VeIiV~dIndn~PvFeqlsYt~sisE~s~igt~viqilATdsDsn~~isYsl~g~s~~sk~ 2224 (4289)
T KOG1219|consen 2145 HELLIEATDAGIPLSAEAKVEIIVGDINDNPPVFEQLSYTISISENSKIGTKVIQILATDSDSNREISYSLEGNSEISKP 2224 (4289)
T ss_pred             eEEEEEEeccCCCcceeeEEEEEecccCCCCchhheeeEEEEccCCCccCceEEEEEeccCCCCCceEEEeecCCccccc
Confidence            7899999996  4789999999999999999999999999999999765543                            


Q ss_pred             ------------------ceEeEEEEEEEEecCCcc--ceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccEEEEE
Q psy585           51 ------------------KRVLQYELTLVASDSLNE--NKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPHSLLK  110 (111)
Q Consensus        51 ------------------e~~~~y~l~v~a~d~~~~--~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~~~iq  110 (111)
                                        |...+|.+.+.|+|+|.+  +.+.|.|+|.|+|||+|.|.++.|.+.|+|.++-|..+.+++
T Consensus      2225 f~In~sTG~it~~g~ldyE~~q~f~~fvratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~ 2304 (4289)
T KOG1219|consen 2225 FRINVSTGWITVAGKLDYEENQEFRFFVRATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVN 2304 (4289)
T ss_pred             eEEecccceEEEeeecChhhcceEEEEEEEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEE
Confidence                              667899999999999986  558899999999999999999999999999999999888876


Q ss_pred             C
Q psy585          111 L  111 (111)
Q Consensus       111 v  111 (111)
                      .
T Consensus      2305 a 2305 (4289)
T KOG1219|consen 2305 A 2305 (4289)
T ss_pred             e
Confidence            3


No 4  
>KOG1219|consensus
Probab=99.80  E-value=5.3e-19  Score=147.58  Aligned_cols=110  Identities=35%  Similarity=0.533  Sum_probs=100.0

Q ss_pred             CEEEEEEEeCCC-ccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC-----------------------------
Q psy585            1 YELKLAASDNLK-ENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP-----------------------------   50 (111)
Q Consensus         1 y~l~v~A~D~~~-~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~-----------------------------   50 (111)
                      |++.|.|++++. -+.+.|.|.|.|+|||+|.|..+.|.+.+.||.+.|+.                             
T Consensus      2648 yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v~yF~ 2727 (4289)
T KOG1219|consen 2648 YQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPVPYFA 2727 (4289)
T ss_pred             EEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCcceEE
Confidence            789999999875 78899999999999999999999999999999765542                             


Q ss_pred             ----------------ceEeEEEEEEEEecCCcc-ceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccEEEEE
Q psy585           51 ----------------KRVLQYELTLVASDSLNE-NKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPHSLLK  110 (111)
Q Consensus        51 ----------------e~~~~y~l~v~a~d~~~~-~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~~~iq  110 (111)
                                      |++..|.|.+.|+|.|.+ +.++|.++|.|+||+||+|....|.+++.|+.|+|..+..+.
T Consensus      2728 In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~p~~~~ia~~s 2804 (4289)
T KOG1219|consen 2728 INPETGWLTTLFELDLEKQDLYSLKVVATDHGVPSSQATVLVHVTDVNDSPPRFQREIYEGTVVEDVPGGKIIAGLS 2804 (4289)
T ss_pred             EcCCCCeeeehhhhccccCCceEEEEEEecCCcccccceEEEEEEecCCCcchhhhHhhccceeccCCCCceeeeeE
Confidence                            678899999999999984 789999999999999999999999999999999998887664


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.31  E-value=4.2e-11  Score=80.74  Aligned_cols=47  Identities=43%  Similarity=0.681  Sum_probs=41.4

Q ss_pred             CEEEEEEEeCC-C--ccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCC
Q psy585            1 YELKLAASDNL-K--ENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDR   47 (111)
Q Consensus         1 y~l~v~A~D~~-~--~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~   47 (111)
                      |.|.|+|+|.| .  ++...+.|.|.|+|||+|.|....|.+.|.|+.+.
T Consensus        68 ~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~  117 (199)
T cd00031          68 YTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPP  117 (199)
T ss_pred             EEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCC
Confidence            78999999954 3  38899999999999999999988999999998543


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.30  E-value=1e-10  Score=78.90  Aligned_cols=105  Identities=28%  Similarity=0.332  Sum_probs=72.2

Q ss_pred             EEEEEeCCCccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCCceEeEEEEEEEEecCCcc---ceEEEEEEEeec
Q psy585            4 KLAASDNLKENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLPKRVLQYELTLVASDSLNE---NKTTVVIHINDV   80 (111)
Q Consensus         4 ~v~A~D~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~~~---~~~~v~I~V~D~   80 (111)
                      .+.|.|.-......+...+..-++. ..|.-....+.+.-....+ .|....|.|.|.|+|.|.+   ++..+.|.|.|+
T Consensus        18 ~~~a~D~D~~~~~~~~y~i~~~~~~-~~F~i~~~tG~l~~~~~lD-~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~   95 (199)
T cd00031          18 TVSATDPDSGENGRVTYSILGGNED-GLFSIDPNTGVITTTKPLD-REEQSEYTLTVVASDGGGPPLSSTATVTVTVLDV   95 (199)
T ss_pred             EEEEECCCCCCCceEEEEEeCCCCc-ccEEEeCCCCEEEECCCCC-CcCCceEEEEEEEEECCcCcceeEEEEEEEEccC
Confidence            4557765322134455555555443 4554443333333222111 2555899999999996653   678999999999


Q ss_pred             CCCCCEEecCcceEEEECCCCCCccEEEEE
Q psy585           81 NDMPPVFNTSLYPAIMEEELPGPYPHSLLK  110 (111)
Q Consensus        81 Ndn~P~F~~~~y~~~v~E~~~~~~~~~~iq  110 (111)
                      |||+|.|....|.+.+.|+.++|+.+..++
T Consensus        96 Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~  125 (199)
T cd00031          96 NDNPPVFEQSSYEASVPENAPPGTVVGTVT  125 (199)
T ss_pred             CCCCCcccccceEEEEeCCCCCCCEEEEEE
Confidence            999999999999999999999988877664


No 7  
>KOG1834|consensus
Probab=98.81  E-value=4.2e-08  Score=76.69  Aligned_cols=78  Identities=37%  Similarity=0.596  Sum_probs=61.8

Q ss_pred             CEEEEEEEeCC---------CccEEEEEEEEeeCCCCCCeecCCcEEEEEEccC-----------C--------------
Q psy585            1 YELKLAASDNL---------KENYTTVVIHVKDVNDNPPVFERPTYRTQITEED-----------D--------------   46 (111)
Q Consensus         1 y~l~v~A~D~~---------~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~-----------~--------------   46 (111)
                      |+++|+|.|.|         .+..++|.|+|.|+|..+|.|....|.+.|.|..           |              
T Consensus       109 ytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~veAiD~DCspq~sqIC~YE  188 (952)
T KOG1834|consen  109 YTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVEAIDKDCSPQYSQICEYE  188 (952)
T ss_pred             ceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEEeecCCCCCcccceeEEE
Confidence            89999999953         4677899999999999999999999999999962           0              


Q ss_pred             ---CCCC-----------------ceEeEEEEEEEEecCCcc---ceEEEEEEEe
Q psy585           47 ---RTLP-----------------KRVLQYELTLVASDSLNE---NKTTVVIHIN   78 (111)
Q Consensus        47 ---~~~~-----------------e~~~~y~l~v~a~d~~~~---~~~~v~I~V~   78 (111)
                         ++.|                 .+...|.|+|.|.|.|..   +.+.|+|+|.
T Consensus       189 I~t~d~PFaIdn~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vk  243 (952)
T KOG1834|consen  189 ITTPDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVK  243 (952)
T ss_pred             ecCCCCceEEcCCCccccccccccccceeEEEEEEEEecccccccCcceEEEEec
Confidence               1111                 245799999999999873   3356777763


No 8  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.60  E-value=5.4e-07  Score=52.55  Aligned_cols=66  Identities=26%  Similarity=0.319  Sum_probs=43.9

Q ss_pred             EEEEEeeCCCCCCeecCCcEEEEEEccCCCCCCceEeEEEEEEEEecCCc---cceEEEEEEEeecCCCCC
Q psy585           18 VVIHVKDVNDNPPVFERPTYRTQITEEDDRTLPKRVLQYELTLVASDSLN---ENKTTVVIHINDVNDMPP   85 (111)
Q Consensus        18 v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~~---~~~~~v~I~V~D~Ndn~P   85 (111)
                      +.-.+..-+++ ..|.-..-.+.+.-... ---|....|.|.|.|+|.|.   .+.++|.|+|.|+|||+|
T Consensus        11 i~Y~i~~~~~~-~~F~i~~~tg~i~~~~~-LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       11 VTYSILSGNED-GLFSIDPETGEITTTKP-LDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             EEEEEecCCCC-CEEEEeCCccEEEeCCc-cCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            55666655543 45554433332221111 11245689999999999986   478999999999999998


No 9  
>KOG1834|consensus
Probab=98.53  E-value=3.9e-07  Score=71.41  Aligned_cols=53  Identities=34%  Similarity=0.447  Sum_probs=46.0

Q ss_pred             ceEeEEEEEEEEecCCc---------cceEEEEEEEeecCCCCCEEecCcceEEEECCCCCC
Q psy585           51 KRVLQYELTLVASDSLN---------ENKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGP  103 (111)
Q Consensus        51 e~~~~y~l~v~a~d~~~---------~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~  103 (111)
                      |.+..|+|+|+|.|.|.         +-.++|.|.|.|+|+.+|+|..+.|.+.|.|+.--.
T Consensus       104 elqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd  165 (952)
T KOG1834|consen  104 ELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYD  165 (952)
T ss_pred             cccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeee
Confidence            55789999999999764         246899999999999999999999999999985433


No 10 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.47  E-value=3.5e-07  Score=53.38  Aligned_cols=30  Identities=43%  Similarity=0.603  Sum_probs=28.0

Q ss_pred             CEEEEEEEeCC---CccEEEEEEEEeeCCCCCC
Q psy585            1 YELKLAASDNL---KENYTTVVIHVKDVNDNPP   30 (111)
Q Consensus         1 y~l~v~A~D~~---~~~~~~v~I~V~D~NDn~P   30 (111)
                      |.|.|+|+|.|   +++.+.|.|.|.|+|||+|
T Consensus        47 y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       47 YTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             EEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            78999999986   6899999999999999998


No 11 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=97.60  E-value=0.00023  Score=42.54  Aligned_cols=28  Identities=32%  Similarity=0.394  Sum_probs=22.5

Q ss_pred             ceEeEEEEEEEEecC-Ccc---ceEEEEEEEe
Q psy585           51 KRVLQYELTLVASDS-LNE---NKTTVVIHIN   78 (111)
Q Consensus        51 e~~~~y~l~v~a~d~-~~~---~~~~v~I~V~   78 (111)
                      |....|.|.|.|+|. |.+   ++++|+|+|+
T Consensus        62 E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   62 ETQSSYQLTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             TTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             ccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence            667899999999999 553   6777888774


No 12 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=92.64  E-value=0.74  Score=28.34  Aligned_cols=42  Identities=26%  Similarity=0.377  Sum_probs=32.0

Q ss_pred             EeEEEEEEEEecCCccceEEEEEEEeecCCCCCEEecCcceEEEECC
Q psy585           53 VLQYELTLVASDSLNENKTTVVIHINDVNDMPPVFNTSLYPAIMEEE   99 (111)
Q Consensus        53 ~~~y~l~v~a~d~~~~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~   99 (111)
                      ...-.|++.+.|+   .+.+|.|+|.-.|| +|+..-. -...+.|+
T Consensus        56 ~~tdsFtvtv~DG---tt~~vtItI~GtND-apvi~~~-~~g~v~ED   97 (99)
T TIGR01965        56 TLTDTFTVTSADG---TSQTVTITITGAND-AAVIGGA-DTGSVTED   97 (99)
T ss_pred             EEEEEEEEEEeCC---CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence            3455677888886   48899999999999 8877643 35777776


No 13 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=91.96  E-value=2.3  Score=27.64  Aligned_cols=66  Identities=21%  Similarity=0.316  Sum_probs=43.4

Q ss_pred             CcEEEEEEccCCCCCCceEeEEEEEEEEecCCc-cceEEEEEEEeecCCCCCEEecCcceEEEECC-CCCCc
Q psy585           35 PTYRTQITEEDDRTLPKRVLQYELTLVASDSLN-ENKTTVVIHINDVNDMPPVFNTSLYPAIMEEE-LPGPY  104 (111)
Q Consensus        35 ~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~~-~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~-~~~~~  104 (111)
                      ..|.+++...-+-...+......|.|.|+|..+ .+..++.|+|.|  | .|.-.... ...|.|+ .+.|.
T Consensus        66 GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L~~g~  133 (137)
T TIGR03660        66 GSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDEDDLPGGS  133 (137)
T ss_pred             ccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEeccccccCC
Confidence            358888776532211244566778899998755 455688888887  6 68876544 4788885 55443


No 14 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=89.24  E-value=1.3  Score=26.65  Aligned_cols=30  Identities=30%  Similarity=0.352  Sum_probs=25.3

Q ss_pred             EeEEEEEEEEecCC-ccceEEEEEEEeecCC
Q psy585           53 VLQYELTLVASDSL-NENKTTVVIHINDVND   82 (111)
Q Consensus        53 ~~~y~l~v~a~d~~-~~~~~~v~I~V~D~Nd   82 (111)
                      ...|.++|.|+|+. ......+.|.|.+.||
T Consensus        66 ~g~~~i~v~a~D~~g~~~~~~f~i~V~~~~~   96 (97)
T smart00736       66 VGSLSLKVTATDSSGASASDTFTITVVNTND   96 (97)
T ss_pred             CcEEEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence            35699999999986 4677889999999887


No 15 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=84.26  E-value=3.9  Score=22.48  Aligned_cols=26  Identities=27%  Similarity=0.358  Sum_probs=18.4

Q ss_pred             CEEEEEEEeC-CCccEEEEEEEEeeCC
Q psy585            1 YELKLAASDN-LKENYTTVVIHVKDVN   26 (111)
Q Consensus         1 y~l~v~A~D~-~~~~~~~v~I~V~D~N   26 (111)
                      |+|.++|+|. |..+.....+.+.|..
T Consensus        25 yt~~v~a~D~AGN~~~~~~~~~i~d~~   51 (60)
T PF12245_consen   25 YTLTVTATDKAGNTSSSTTQIVIVDNT   51 (60)
T ss_pred             EEEEEEEEECCCCEEEeeeEEEEEcCC
Confidence            7899999997 6666666666666543


No 16 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=82.71  E-value=11  Score=24.86  Aligned_cols=27  Identities=19%  Similarity=0.247  Sum_probs=16.1

Q ss_pred             eEeEEEEEEEEecCCc-cceEEEEEEEe
Q psy585           52 RVLQYELTLVASDSLN-ENKTTVVIHIN   78 (111)
Q Consensus        52 ~~~~y~l~v~a~d~~~-~~~~~v~I~V~   78 (111)
                      ....|+|+|.|.|..+ .++.++.....
T Consensus       121 ~~~~YtLtV~a~D~aGN~~~~si~F~y~  148 (158)
T PF13750_consen  121 ADDSYTLTVSATDKAGNQSTKSISFSYM  148 (158)
T ss_pred             CCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence            3457777777777644 45555555443


No 17 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=80.09  E-value=7  Score=21.24  Aligned_cols=26  Identities=23%  Similarity=0.321  Sum_probs=18.2

Q ss_pred             EeEEEEEEEEecCCcc---ceEEEEEEEe
Q psy585           53 VLQYELTLVASDSLNE---NKTTVVIHIN   78 (111)
Q Consensus        53 ~~~y~l~v~a~d~~~~---~~~~v~I~V~   78 (111)
                      ...|+|.|.|.|..+.   ...++.|.|+
T Consensus        38 ~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~   66 (66)
T PF07495_consen   38 PGKYTLEVRAKDNNGKWSSDEKSLTITIL   66 (66)
T ss_dssp             SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred             CEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence            4689999999997652   2366777663


No 18 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=78.61  E-value=9.2  Score=24.85  Aligned_cols=40  Identities=20%  Similarity=0.311  Sum_probs=25.4

Q ss_pred             EEEEEEEeC-CCccEEEEEEEEeeCCCCCCeecCCcEEEEEEccC
Q psy585            2 ELKLAASDN-LKENYTTVVIHVKDVNDNPPVFERPTYRTQITEED   45 (111)
Q Consensus         2 ~l~v~A~D~-~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~   45 (111)
                      .|.|.|+|. |-.+...+.|.|.|  | .|.-.... ...|.|++
T Consensus        88 ~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~  128 (137)
T TIGR03660        88 NFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDEDD  128 (137)
T ss_pred             eeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence            466788875 44444577788777  5 47665543 36777754


No 19 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=74.98  E-value=11  Score=22.55  Aligned_cols=17  Identities=18%  Similarity=0.374  Sum_probs=5.1

Q ss_pred             CCeecCCcEEEEEEccC
Q psy585           29 PPVFERPTYRTQITEED   45 (111)
Q Consensus        29 ~P~F~~~~y~~~v~E~~   45 (111)
                      .|-|.+..|.+.|+.+-
T Consensus         3 ~pGF~~~~~~~~Vp~~l   19 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNL   19 (90)
T ss_dssp             --B--S-EEEE----SS
T ss_pred             cCCcccceEEEEcCchh
Confidence            46666667777666653


No 20 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=67.24  E-value=17  Score=20.36  Aligned_cols=26  Identities=19%  Similarity=0.390  Sum_probs=20.0

Q ss_pred             eEeEEEEEEEEecCCccceEEEEEEE
Q psy585           52 RVLQYELTLVASDSLNENKTTVVIHI   77 (111)
Q Consensus        52 ~~~~y~l~v~a~d~~~~~~~~v~I~V   77 (111)
                      ....|.+++.+.|..+.+++++.|.|
T Consensus        53 ~~G~y~v~l~v~n~~g~~~~~~~i~v   78 (79)
T smart00089       53 KPGTYTVTLTVTNAVGSASATVTVVV   78 (79)
T ss_pred             CCcEEEEEEEEEcCCCcEEEEEEEEE
Confidence            44689999999998776677777765


No 21 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=65.65  E-value=18  Score=22.21  Aligned_cols=38  Identities=24%  Similarity=0.384  Sum_probs=26.6

Q ss_pred             EEEEEEEeCCCccEEEEEEEEeeCCCCCCeecCCcEEEEEEcc
Q psy585            2 ELKLAASDNLKENYTTVVIHVKDVNDNPPVFERPTYRTQITEE   44 (111)
Q Consensus         2 ~l~v~A~D~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~   44 (111)
                      .+++.+.|+   .+..|.|.|.-.|| +|.... .-.+.+.|+
T Consensus        60 sFtvtv~DG---tt~~vtItI~GtND-apvi~~-~~~g~v~ED   97 (99)
T TIGR01965        60 TFTVTSADG---TSQTVTITITGAND-AAVIGG-ADTGSVTED   97 (99)
T ss_pred             EEEEEEeCC---CeEEEEEEEEccCC-CCEEec-ccceeEecC
Confidence            467778886   37889999999999 454433 234667665


No 22 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=60.92  E-value=21  Score=20.06  Aligned_cols=26  Identities=15%  Similarity=0.355  Sum_probs=17.4

Q ss_pred             eEeEEEEEEEEecCCc-cceEEEEEEE
Q psy585           52 RVLQYELTLVASDSLN-ENKTTVVIHI   77 (111)
Q Consensus        52 ~~~~y~l~v~a~d~~~-~~~~~v~I~V   77 (111)
                      +...|.+++.+.|..+ ....++.|.|
T Consensus        55 ~~G~y~v~l~v~d~~g~~~~~~~~V~V   81 (81)
T cd00146          55 KPGTYTVTLTVTNAVGSSSTKTTTVVV   81 (81)
T ss_pred             CCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence            3468999999999844 4444555543


No 23 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=59.10  E-value=33  Score=19.75  Aligned_cols=51  Identities=20%  Similarity=0.148  Sum_probs=25.2

Q ss_pred             EEEEEEEEeeCCCCCCeec-------CCcEEEEEEccCCCCCCceEeEEEEEEEEecCC
Q psy585           15 YTTVVIHVKDVNDNPPVFE-------RPTYRTQITEEDDRTLPKRVLQYELTLVASDSL   66 (111)
Q Consensus        15 ~~~v~I~V~D~NDn~P~F~-------~~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~   66 (111)
                      ...+.|.|.|.+-+. +.+       ...+.+.-.=.++.+.+-....|.+.+.|+|.|
T Consensus        24 a~~v~v~I~d~~G~~-V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~a~~~g   81 (81)
T PF13860_consen   24 ADNVTVTIYDSNGQV-VRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVTATDGG   81 (81)
T ss_dssp             CEEEEEEEEETTS-E-EEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEEEEET-
T ss_pred             ccEEEEEEEcCCCCE-EEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEEEEeCC
Confidence            446788888886643 111       011122211113444455567899999999875


No 24 
>PF10365 DUF2436:  Domain of unknown function (DUF2436);  InterPro: IPR018832  Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms.  This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).  
Probab=56.20  E-value=24  Score=23.23  Aligned_cols=23  Identities=22%  Similarity=0.250  Sum_probs=16.0

Q ss_pred             CCCCCeecCCcEEEEEEccCCCC
Q psy585           26 NDNPPVFERPTYRTQITEEDDRT   48 (111)
Q Consensus        26 NDn~P~F~~~~y~~~v~E~~~~~   48 (111)
                      |+|+|.=.-..|+..|++|+++.
T Consensus        66 n~~~pa~ly~~FEYkiP~NADps   88 (161)
T PF10365_consen   66 NCNVPANLYDPFEYKIPANADPS   88 (161)
T ss_pred             CCCCChhhcccceEeccCCCCCc
Confidence            55666544467888899988764


No 25 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=53.74  E-value=20  Score=19.07  Aligned_cols=10  Identities=30%  Similarity=0.710  Sum_probs=6.9

Q ss_pred             CEEEEEEEeC
Q psy585            1 YELKLAASDN   10 (111)
Q Consensus         1 y~l~v~A~D~   10 (111)
                      |.++++|+|.
T Consensus        26 y~itv~a~D~   35 (54)
T PF13754_consen   26 YTITVTATDA   35 (54)
T ss_pred             EEEEEEEEeC
Confidence            5677777775


No 26 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=52.76  E-value=24  Score=20.86  Aligned_cols=25  Identities=24%  Similarity=0.189  Sum_probs=18.1

Q ss_pred             EEEEeecCCCCCEEecCcceEEEECCC
Q psy585           74 VIHINDVNDMPPVFNTSLYPAIMEEEL  100 (111)
Q Consensus        74 ~I~V~D~Ndn~P~F~~~~y~~~v~E~~  100 (111)
                      +|.|.| ||.+ .+....-...+.|+.
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~   26 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD   26 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC
Confidence            567788 7744 888777778888885


No 27 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=52.13  E-value=43  Score=19.03  Aligned_cols=32  Identities=16%  Similarity=0.046  Sum_probs=22.1

Q ss_pred             CCCCCceEeEEEEEEEEecC-CccceEEEEEEE
Q psy585           46 DRTLPKRVLQYELTLVASDS-LNENKTTVVIHI   77 (111)
Q Consensus        46 ~~~~~e~~~~y~l~v~a~d~-~~~~~~~v~I~V   77 (111)
                      .++..-....|.++..|+|. |..++++..|+|
T Consensus        49 ~~g~~f~~G~t~V~ytA~D~~GN~a~C~f~V~V   81 (81)
T PF02494_consen   49 PPGDLFPVGTTTVTYTATDAAGNSATCSFTVTV   81 (81)
T ss_pred             CCCceEeeceEEEEEEEEECCCCEEEEEEEEEC
Confidence            33333344589999999997 446788887765


No 28 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=47.12  E-value=2.9e+02  Score=27.06  Aligned_cols=28  Identities=7%  Similarity=0.317  Sum_probs=21.4

Q ss_pred             eEeEEEEEEEEecCCccceEEEEEEEee
Q psy585           52 RVLQYELTLVASDSLNENKTTVVIHIND   79 (111)
Q Consensus        52 ~~~~y~l~v~a~d~~~~~~~~v~I~V~D   79 (111)
                      ....|++++.|.|..+....++.|.|.+
T Consensus      1566 spGtYtVtLTvtN~~Gs~~~T~~i~V~~ 1593 (2740)
T TIGR00864      1566 SVGTFNIIVTAENDVGAAQASIFLFVLQ 1593 (2740)
T ss_pred             CCceEEEEEEEecCCCccceeEEEEEee
Confidence            3468999999999887766666666664


No 29 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=46.61  E-value=1.9e+02  Score=24.94  Aligned_cols=28  Identities=21%  Similarity=0.246  Sum_probs=20.7

Q ss_pred             EEEEEEeecCCCCCEEecCcceEEEECCC
Q psy585           72 TVVIHINDVNDMPPVFNTSLYPAIMEEEL  100 (111)
Q Consensus        72 ~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~  100 (111)
                      +.+|+|.| ||++|.|....-..+|.|+.
T Consensus       516 ~ATVTIlD-DD~aGIfsFe~~~~sV~Es~  543 (928)
T TIGR00845       516 TATVTILD-DDHAGIFTFEEDVFHVSESI  543 (928)
T ss_pred             eEEEEEec-CcccCcccccCceEEEEcCC
Confidence            45666677 78899887666667888874


No 30 
>PF07145 PAM2:  Ataxin-2 C-terminal region;  InterPro: IPR009818 This entry represents a conserved region approximately 250 residues long located towards the C terminus of eukaryotic ataxin-2. Ataxin-2 is a protein of unknown function, within which expansion of a polyglutamine tract (due to expansion of unstable CAG repeats in the coding region of the SCA2 gene) causes spinocerebellar ataxia type 2 (SCA2), a late-onset neurodegenerative disorder []. The expanded polyglutamine repeat in ataxin-2 causes disruption of the normal morphology of the Golgi complex and increased incidence of cell death []. Ataxin-2 is predicted to consist of mostly non-globular domains [].; PDB: 3NTW_B 1JH4_B 3KTR_B 3KUJ_B 3KUT_D 3KUS_D 1JGN_B 2RQG_A 2RQH_A.
Probab=42.92  E-value=11  Score=15.84  Aligned_cols=12  Identities=33%  Similarity=0.382  Sum_probs=8.1

Q ss_pred             cCCCCCEEecCc
Q psy585           80 VNDMPPVFNTSL   91 (111)
Q Consensus        80 ~Ndn~P~F~~~~   91 (111)
                      .|-|+|.|.+..
T Consensus         6 LNp~A~eFvP~~   17 (18)
T PF07145_consen    6 LNPNAPEFVPSS   17 (18)
T ss_dssp             SSTTSSSS-TTT
T ss_pred             cCCCCccccCCC
Confidence            577889887654


No 31 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=42.15  E-value=1.6e+02  Score=25.36  Aligned_cols=29  Identities=17%  Similarity=0.271  Sum_probs=20.1

Q ss_pred             EEEEEEEeeCCCCCCeecCCcEEEEEEccC
Q psy585           16 TTVVIHVKDVNDNPPVFERPTYRTQITEED   45 (111)
Q Consensus        16 ~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~   45 (111)
                      ...+|+|.| ||++|.|.-..-...|.|+.
T Consensus       515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~  543 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFEEDVFHVSESI  543 (928)
T ss_pred             ceEEEEEec-CcccCcccccCceEEEEcCC
Confidence            356677777 88999876544456788864


No 32 
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II.  Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin.  This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=39.17  E-value=82  Score=18.55  Aligned_cols=29  Identities=28%  Similarity=0.472  Sum_probs=19.0

Q ss_pred             eEEEEEEEEecCCccceEEEEEEEeecCCCCC
Q psy585           54 LQYELTLVASDSLNENKTTVVIHINDVNDMPP   85 (111)
Q Consensus        54 ~~y~l~v~a~d~~~~~~~~v~I~V~D~Ndn~P   85 (111)
                      ..|++.  |.+..+...+++.+.|.|.-+ ||
T Consensus        69 G~Ytc~--a~N~~G~~~~~~~l~V~~~P~-pP   97 (98)
T cd05762          69 GCYTLE--VENKLGSRQAQVNLTVVDKPD-PP   97 (98)
T ss_pred             EEEEEE--EEcCCCceeEEEEEEEecCCC-CC
Confidence            445443  444445677889999998876 55


No 33 
>cd01757 PLAT_RAB6IP1 PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains consists of an eight stranded beta-barrel. In RabIP1 this domain may participate in lipid-mediated modulation of Rab6IP1's function via it's generally proposed function of mediating interaction with lipids or membrane bound proteins.
Probab=37.18  E-value=1.1e+02  Score=19.23  Aligned_cols=42  Identities=12%  Similarity=0.109  Sum_probs=28.3

Q ss_pred             EEEEEEEEecC--CccceEEEEEEEeecCCCCCEEecC--cceEEE
Q psy585           55 QYELTLVASDS--LNENKTTVVIHINDVNDMPPVFNTS--LYPAIM   96 (111)
Q Consensus        55 ~y~l~v~a~d~--~~~~~~~v~I~V~D~Ndn~P~F~~~--~y~~~v   96 (111)
                      .|.++|.--+.  |.-.++.|.|.+...+...+.+...  .+.+.+
T Consensus         2 ~Y~VtV~TG~~~~gAGT~anV~i~L~G~~g~s~~~~L~~~~f~~~v   47 (114)
T cd01757           2 PYHVVIVPSKKLGGSMFTANPWICVSGELGETPPLQIPKNSLEMTF   47 (114)
T ss_pred             eEEEEEEeCCCCCCCCCcceEEEEEEEcCCCcCCEEecCCceEEEE
Confidence            48888877765  4457788999998888766655433  344443


No 34 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=35.54  E-value=84  Score=20.37  Aligned_cols=30  Identities=10%  Similarity=0.207  Sum_probs=20.1

Q ss_pred             EeEEEEEEEEecCCc-cceEEEEEEEeecCC
Q psy585           53 VLQYELTLVASDSLN-ENKTTVVIHINDVND   82 (111)
Q Consensus        53 ~~~y~l~v~a~d~~~-~~~~~v~I~V~D~Nd   82 (111)
                      ..+|+++|++.|..+ +.++.+.|.|.|-.-
T Consensus        82 gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG  112 (133)
T PF08329_consen   82 GGRYQMQVELCNADGCSTSAPVEVVVADTDG  112 (133)
T ss_dssp             -EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred             CCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence            468999999998776 455678888888764


No 35 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=31.70  E-value=1.4e+02  Score=19.11  Aligned_cols=25  Identities=16%  Similarity=0.327  Sum_probs=16.4

Q ss_pred             EeEEEEEEEEecCCc-cceEEEEEEE
Q psy585           53 VLQYELTLVASDSLN-ENKTTVVIHI   77 (111)
Q Consensus        53 ~~~y~l~v~a~d~~~-~~~~~v~I~V   77 (111)
                      ...|.|.+.++|..+ .......|.|
T Consensus       106 ~G~YH~~i~VtD~~Gn~~~~~~~i~I  131 (132)
T PF15418_consen  106 AGDYHFMITVTDAAGNQTEEERSIKI  131 (132)
T ss_pred             CcceEEEEEEEECCCCEEEEEEEEEE
Confidence            357999999999765 3444444443


No 36 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=31.15  E-value=1.2e+02  Score=19.67  Aligned_cols=32  Identities=13%  Similarity=0.361  Sum_probs=16.4

Q ss_pred             eEEEEEEEEecCCc--cceEEEEEEEeecCCCCCE
Q psy585           54 LQYELTLVASDSLN--ENKTTVVIHINDVNDMPPV   86 (111)
Q Consensus        54 ~~y~l~v~a~d~~~--~~~~~v~I~V~D~Ndn~P~   86 (111)
                      ....+.-.+.|++.  ++.+.+.|+|.+-|+ +|.
T Consensus       100 Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-ppi  133 (133)
T PF09100_consen  100 GNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PPI  133 (133)
T ss_dssp             EEEEEEEEETTTT---EEEEEEEEE---S----S-
T ss_pred             ccEEEEEEEccCCcccceeEeEEEEeecccC-CCC
Confidence            34455555566665  578889999999888 763


No 37 
>KOG4221|consensus
Probab=30.98  E-value=1.7e+02  Score=26.20  Aligned_cols=59  Identities=24%  Similarity=0.485  Sum_probs=35.7

Q ss_pred             CCCeecCC---cEEEEEEccCCCCC--------------CceEeEEEEEEEEecCCcc--ceEEEEE-EEeecCCCCCE
Q psy585           28 NPPVFERP---TYRTQITEEDDRTL--------------PKRVLQYELTLVASDSLNE--NKTTVVI-HINDVNDMPPV   86 (111)
Q Consensus        28 n~P~F~~~---~y~~~v~E~~~~~~--------------~e~~~~y~l~v~a~d~~~~--~~~~v~I-~V~D~Ndn~P~   86 (111)
                      .+|.|...   .|.....|++....              .++...|.+.+.|.+..++  +++.++| +..|+=+-||.
T Consensus       542 epP~~~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~  620 (1381)
T KOG4221|consen  542 EPPPFGNGPITGYKLFYSEDDTGKELRVENNATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ  620 (1381)
T ss_pred             cCCCCCCCCceEEEEEEEcCCCCceEEEecCccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence            36776544   47777777732211              1566789999999997665  3444433 44555555554


No 38 
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=27.68  E-value=88  Score=16.46  Aligned_cols=16  Identities=19%  Similarity=0.482  Sum_probs=12.5

Q ss_pred             ceEEEEEEEeecCCCC
Q psy585           69 NKTTVVIHINDVNDMP   84 (111)
Q Consensus        69 ~~~~v~I~V~D~Ndn~   84 (111)
                      .+..++|++.|.|.||
T Consensus        13 e~I~ltVt~kda~G~p   28 (47)
T PF05688_consen   13 ETIPLTVTVKDANGNP   28 (47)
T ss_pred             CeEEEEEEEECCCCCC
Confidence            4678899999997754


No 39 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=25.95  E-value=91  Score=16.23  Aligned_cols=13  Identities=31%  Similarity=0.424  Sum_probs=10.9

Q ss_pred             eEEEEEEEEecCC
Q psy585           54 LQYELTLVASDSL   66 (111)
Q Consensus        54 ~~y~l~v~a~d~~   66 (111)
                      ..|.+.|.|+|+.
T Consensus        36 G~y~~~vtatd~~   48 (49)
T PF05345_consen   36 GTYTFTVTATDGS   48 (49)
T ss_pred             cEEEEEEEEEcCC
Confidence            5799999999874


No 40 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=25.86  E-value=1.1e+02  Score=19.04  Aligned_cols=27  Identities=11%  Similarity=0.172  Sum_probs=17.6

Q ss_pred             EeEEEEEEEEecCCc-cceEEEEEEEee
Q psy585           53 VLQYELTLVASDSLN-ENKTTVVIHIND   79 (111)
Q Consensus        53 ~~~y~l~v~a~d~~~-~~~~~v~I~V~D   79 (111)
                      ..+|.++|++-|..+ +.++.+.|.|-|
T Consensus        79 gG~y~m~V~lCn~dGCS~S~~~~I~VAD  106 (106)
T cd02848          79 GGRYQMQVALCNGDGCSTSAAKEIVVAD  106 (106)
T ss_pred             CCeEEEEEEEECCCCccCcCCEEEEecC
Confidence            357999999888765 334455565544


No 41 
>COG1470 Predicted membrane protein [Function unknown]
Probab=24.00  E-value=3.8e+02  Score=21.44  Aligned_cols=10  Identities=30%  Similarity=0.667  Sum_probs=5.6

Q ss_pred             CEEEEEEEeC
Q psy585            1 YELKLAASDN   10 (111)
Q Consensus         1 y~l~v~A~D~   10 (111)
                      |.+.|.|.-+
T Consensus       352 Ynv~I~A~s~  361 (513)
T COG1470         352 YNVTITASSS  361 (513)
T ss_pred             eeEEEEEecc
Confidence            5566666543


No 42 
>PF02369 Big_1:  Bacterial Ig-like domain (group 1);  InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=20.73  E-value=89  Score=18.70  Aligned_cols=17  Identities=24%  Similarity=0.470  Sum_probs=12.6

Q ss_pred             cceEEEEEEEeecCCCC
Q psy585           68 ENKTTVVIHINDVNDMP   84 (111)
Q Consensus        68 ~~~~~v~I~V~D~Ndn~   84 (111)
                      ....++++.|.|.|.||
T Consensus        22 ~~~~tltatV~D~~gnp   38 (100)
T PF02369_consen   22 SDTNTLTATVTDANGNP   38 (100)
T ss_dssp             SS-EEEEEEEEETTSEB
T ss_pred             cCcEEEEEEEEcCCCCC
Confidence            46678888999988855


No 43 
>PF00801 PKD:  PKD domain;  InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions.  PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=20.06  E-value=1.6e+02  Score=15.80  Aligned_cols=16  Identities=19%  Similarity=0.343  Sum_probs=11.7

Q ss_pred             EeEEEEEEEEecCCcc
Q psy585           53 VLQYELTLVASDSLNE   68 (111)
Q Consensus        53 ~~~y~l~v~a~d~~~~   68 (111)
                      ...|.+++.|+|..+.
T Consensus        51 ~G~y~V~ltv~n~~g~   66 (69)
T PF00801_consen   51 PGTYTVTLTVTNGVGS   66 (69)
T ss_dssp             SEEEEEEEEEEETTSE
T ss_pred             CeEEEEEEEEEECCCC
Confidence            4678888888887543


Done!