Query         psy7564
Match_columns 115
No_of_seqs    188 out of 1343
Neff          8.3 
Searched_HMMs 46136
Date          Fri Aug 16 21:38:33 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7564.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7564hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00031 CA Cadherin repeat dom  99.9 1.9E-20 4.2E-25  129.1  14.2   96   12-114    44-141 (199)
  2 KOG1219|consensus               99.8 2.1E-20 4.5E-25  159.3  13.0   96   13-115  2625-2720(4289)
  3 KOG4289|consensus               99.8 4.4E-20 9.4E-25  153.3  11.0   95   13-114   317-413 (2531)
  4 KOG4289|consensus               99.8   2E-20 4.3E-25  155.3   6.8   96   12-114   211-308 (2531)
  5 KOG1219|consensus               99.7   2E-17 4.3E-22  141.6  10.7   96   13-115  1001-1098(4289)
  6 smart00112 CA Cadherin repeats  99.5 7.5E-14 1.6E-18   83.8   8.3   55   12-66     23-79  (79)
  7 KOG1834|consensus               99.3   3E-11 6.4E-16   95.9   9.8   92   15-114    86-189 (952)
  8 PF00028 Cadherin:  Cadherin do  98.9 9.4E-09   2E-13   63.2   7.8   48   12-59     43-93  (93)
  9 PF00028 Cadherin:  Cadherin do  98.5 3.1E-07 6.8E-12   56.3   5.6   35   73-114     1-35  (93)
 10 cd00031 CA Cadherin repeat dom  98.3 6.5E-06 1.4E-10   56.4   7.9   49   12-60    149-199 (199)
 11 smart00736 CADG Dystroglycan-t  94.8    0.37   8E-06   29.7   7.6   46   16-63     51-96  (97)
 12 PF08266 Cadherin_2:  Cadherin-  93.9   0.058 1.3E-06   32.9   2.4   22   12-33     46-67  (84)
 13 KOG1834|consensus               93.3    0.18   4E-06   41.5   5.0   47   12-59    195-243 (952)
 14 TIGR03660 T1SS_rpt_143 T1SS-14  88.2     5.4 0.00012   26.5   7.5   56   21-80     69-127 (137)
 15 PF05345 He_PIG:  Putative Ig d  87.5     2.4 5.2E-05   23.0   4.6   30   15-46     18-48  (49)
 16 PF12245 Big_3_2:  Bacterial Ig  84.2     5.5 0.00012   22.4   5.8   33   31-63     20-52  (60)
 17 PF08758 Cadherin_pro:  Cadheri  81.7     9.7 0.00021   23.4   6.4   35   13-49     48-82  (90)
 18 cd00146 PKD polycystic kidney   79.0     7.7 0.00017   22.4   4.8   29   29-57     52-80  (81)
 19 PF08758 Cadherin_pro:  Cadheri  77.9     3.1 6.6E-05   25.6   2.9   42   64-105     2-49  (90)
 20 PF13750 Big_3_3:  Bacterial Ig  76.2      21 0.00045   24.1   9.2   38   32-70     13-51  (158)
 21 PF13750 Big_3_3:  Bacterial Ig  74.8      13 0.00027   25.2   5.5   30   30-59    119-148 (158)
 22 smart00089 PKD Repeats in poly  73.4      14 0.00031   21.1   4.9   30   28-58     49-78  (79)
 23 KOG3597|consensus               69.4      30 0.00065   27.5   7.1   56   50-114    24-80  (442)
 24 PF02494 HYR:  HYR domain;  Int  69.4      16 0.00034   21.4   4.5   26   33-58     56-81  (81)
 25 PF07495 Y_Y_Y:  Y_Y_Y domain;   59.9      25 0.00055   19.3   4.0   27   32-58     37-65  (66)
 26 TIGR01965 VCBS_repeat VCBS rep  58.4      43 0.00094   21.0   6.2   40   35-80     58-97  (99)
 27 cd02848 Chitinase_N_term Chiti  56.8      14 0.00031   23.5   2.7   33   28-60     74-106 (106)
 28 PF08329 ChitinaseA_N:  Chitina  54.9      18 0.00039   23.9   3.1   36   28-63     77-112 (133)
 29 PF13754 Big_3_4:  Bacterial Ig  48.8      44 0.00095   18.1   4.5   29   20-50     12-40  (54)
 30 PF15418 DUF4625:  Domain of un  46.6      73  0.0016   20.9   5.0   26   33-58    106-131 (132)
 31 PF03160 Calx-beta:  Calx-beta   46.0      65  0.0014   19.3   6.3   25   55-81      2-26  (100)
 32 TIGR00845 caca sodium/calcium   35.3 2.5E+02  0.0053   24.8   7.5   29   52-81    515-543 (928)
 33 PRK08577 hypothetical protein;  35.3      99  0.0022   19.9   4.3   37   10-47     32-68  (136)
 34 PF05688 DUF824:  Salmonella re  31.5      70  0.0015   17.2   2.6   16   50-65     13-28  (47)
 35 COG1098 VacB Predicted RNA bin  23.0      26 0.00057   23.0  -0.1   33   73-115    42-74  (129)
 36 COG4288 Uncharacterized protei  21.9 2.3E+02   0.005   18.2   5.3   44   52-107    68-111 (124)
 37 PF13290 CHB_HEX_C_1:  Chitobia  21.2 1.1E+02  0.0024   17.5   2.3   16   96-115    13-28  (67)
 38 PF12100 DUF3576:  Domain of un  21.1 2.3E+02   0.005   17.9   5.6   43   14-56     23-66  (103)
 39 cd05775 Ig_SLAM-CD84_like_N N-  20.1 2.1E+02  0.0045   17.0   5.6   32   12-44     53-84  (97)

No 1  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.86  E-value=1.9e-20  Score=129.11  Aligned_cols=96  Identities=41%  Similarity=0.553  Sum_probs=89.9

Q ss_pred             ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCC--eEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEe
Q psy7564          12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGA--KAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLT   89 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~--~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~   89 (115)
                      .|.|++.+|.|++.+.||||....|.+.|.|.|.+..  .+...++|.|.|+||++|.|....|.+.+.|+.++|+.++ 
T Consensus        44 ~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~-  122 (199)
T cd00031          44 LFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVG-  122 (199)
T ss_pred             cEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEE-
Confidence            4669999999999999999999999999999997544  3889999999999999999998899999999999999999 


Q ss_pred             CCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564          90 NTSLPLVIRAEDKDSETNALLDGST  114 (115)
Q Consensus        90 ~~~~~l~v~A~D~D~~~n~~i~y~i  114 (115)
                            ++.|+|+|.+.|+.+.|+|
T Consensus       123 ------~~~a~D~D~~~~~~~~y~l  141 (199)
T cd00031         123 ------TVTATDADSGENAKLTYSI  141 (199)
T ss_pred             ------EEEEEcCCCCCCccEEEEE
Confidence                  8999999999899999997


No 2  
>KOG1219|consensus
Probab=99.84  E-value=2.1e-20  Score=159.30  Aligned_cols=96  Identities=31%  Similarity=0.389  Sum_probs=91.9

Q ss_pred             eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeCCC
Q psy7564          13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTNTS   92 (115)
Q Consensus        13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~~~   92 (115)
                      |++|++||.|.+.++||+|+.++|++.|.|++.+...+.+.|.|+|.|+|||+|.|....|.+.+.||++.|+.|.    
T Consensus      2625 Fsvdr~TG~i~v~ksLD~E~kk~yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~---- 2700 (4289)
T KOG1219|consen 2625 FSVDRNTGMIKVNKSLDHEKKKSYQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVI---- 2700 (4289)
T ss_pred             eEEcCCCceEEeccccchhhhceEEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEE----
Confidence            4599999999999999999999999999999976778899999999999999999999999999999999999999    


Q ss_pred             CCeEEEEEeCCCCCCCeEEEEEC
Q psy7564          93 LPLVIRAEDKDSETNALLDGSTN  115 (115)
Q Consensus        93 ~~l~v~A~D~D~~~n~~i~y~i~  115 (115)
                         +++|.|.|.+.||+|+|+|.
T Consensus      2701 ---qf~AsD~Ds~~nGqirysl~ 2720 (4289)
T KOG1219|consen 2701 ---QFHASDMDSGNNGQIRYSLT 2720 (4289)
T ss_pred             ---EEEeeccCCCCCceEEEEEc
Confidence               89999999999999999984


No 3  
>KOG4289|consensus
Probab=99.83  E-value=4.4e-20  Score=153.34  Aligned_cols=95  Identities=33%  Similarity=0.446  Sum_probs=89.8

Q ss_pred             eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeC
Q psy7564          13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTN   90 (115)
Q Consensus        13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~   90 (115)
                      |.||+.+|.|.+..+||||+...|+|.|.|+|.|  +...++.|.|+|.|+|||+|+|....|.+.|.|+..+++.|+  
T Consensus       317 f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvl--  394 (2531)
T KOG4289|consen  317 FEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVL--  394 (2531)
T ss_pred             eEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEE--
Confidence            4599999999999999999999999999999986  234489999999999999999999999999999999999999  


Q ss_pred             CCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564          91 TSLPLVIRAEDKDSETNALLDGST  114 (115)
Q Consensus        91 ~~~~l~v~A~D~D~~~n~~i~y~i  114 (115)
                           +|.|+|.|.|.||.|+|+|
T Consensus       395 -----rV~AtDrD~g~Ng~VHYsi  413 (2531)
T KOG4289|consen  395 -----RVTATDRDKGTNGKVHYSI  413 (2531)
T ss_pred             -----EEEecccCCCcCceEEEEe
Confidence                 9999999999999999998


No 4  
>KOG4289|consensus
Probab=99.82  E-value=2e-20  Score=155.33  Aligned_cols=96  Identities=35%  Similarity=0.410  Sum_probs=91.9

Q ss_pred             ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEe
Q psy7564          12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLT   89 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~   89 (115)
                      .|+||+.+|.|++++.||||....+.++|+|.|.+  .++++++|+|.|.|.|||.|.|.+..|..++.||.++|+.|. 
T Consensus       211 lFslD~~sG~irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vL-  289 (2531)
T KOG4289|consen  211 LFSLDPMSGAIRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVL-  289 (2531)
T ss_pred             cEeeccccccchhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEE-
Confidence            36699999999999999999999999999999985  578899999999999999999999999999999999999999 


Q ss_pred             CCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564          90 NTSLPLVIRAEDKDSETNALLDGST  114 (115)
Q Consensus        90 ~~~~~l~v~A~D~D~~~n~~i~y~i  114 (115)
                            +|+|+|.|.|+|+.|.|++
T Consensus       290 ------tvrAtD~Dsp~Nani~Yrl  308 (2531)
T KOG4289|consen  290 ------TVRATDGDSPPNANIRYRL  308 (2531)
T ss_pred             ------EEEeccCCCCCCCceEEEe
Confidence                  8999999999999999986


No 5  
>KOG1219|consensus
Probab=99.73  E-value=2e-17  Score=141.57  Aligned_cols=96  Identities=31%  Similarity=0.411  Sum_probs=91.5

Q ss_pred             eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeC
Q psy7564          13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTN   90 (115)
Q Consensus        13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~   90 (115)
                      |+||..+|.|++.+.||||..+.|-+++.|+|.|  +.++.+.+.|.|.|+|||+|+|.++.|..+|.|+++.+..|.  
T Consensus      1001 FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsiv-- 1078 (4289)
T KOG1219|consen 1001 FSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIV-- 1078 (4289)
T ss_pred             EEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEE--
Confidence            6699999999999999999999999999999997  567889999999999999999999999999999999999999  


Q ss_pred             CCCCeEEEEEeCCCCCCCeEEEEEC
Q psy7564          91 TSLPLVIRAEDKDSETNALLDGSTN  115 (115)
Q Consensus        91 ~~~~l~v~A~D~D~~~n~~i~y~i~  115 (115)
                           ++.|.|+|...|+++.|.|.
T Consensus      1079 -----q~ea~D~Dsssn~kLmykI~ 1098 (4289)
T KOG1219|consen 1079 -----QAEANDPDSSSNQKLMYKIT 1098 (4289)
T ss_pred             -----EeccCCCCcccCcceEEEEc
Confidence                 89999999989999999984


No 6  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.53  E-value=7.5e-14  Score=83.81  Aligned_cols=55  Identities=40%  Similarity=0.583  Sum_probs=51.3

Q ss_pred             ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCC--CeEEEEEEEEEEeCCCCCC
Q psy7564          12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAG--AKAVCSVIVHILDRNDNTP   66 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~--~~~~~~v~I~V~D~Nd~~P   66 (115)
                      .|.|++.+|.|++.++||||....|.|.|.|.|.++  +.+.+.|+|+|.|+|||+|
T Consensus        23 ~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       23 LFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             EEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence            466999999999999999999999999999999875  7889999999999999998


No 7  
>KOG1834|consensus
Probab=99.28  E-value=3e-11  Score=95.92  Aligned_cols=92  Identities=25%  Similarity=0.338  Sum_probs=76.8

Q ss_pred             EeCCce--EEEEcCCCCccCcCEEEEEEEEEeCC--C------CeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCC
Q psy7564          15 LHPPYG--VLLTKKPLDYEATRIYNLSVTATNMA--G------AKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIG   84 (115)
Q Consensus        15 id~~tG--~i~~~~~lD~E~~~~~~l~v~a~d~~--~------~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~g   84 (115)
                      +|..||  .|+.+.+||.|.+..|+++|+|.|.|  +      .+..++|.|+|.|+|+++|.|..+.|.+.|.|.... 
T Consensus        86 vdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~y-  164 (952)
T KOG1834|consen   86 VDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVY-  164 (952)
T ss_pred             EeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceee-
Confidence            677776  58888899999999999999999974  1      355789999999999999999999999999999644 


Q ss_pred             cEEEeCCCCCeEEEEEeCCCC-CCCeEE-EEE
Q psy7564          85 ALVLTNTSLPLVIRAEDKDSE-TNALLD-GST  114 (115)
Q Consensus        85 t~v~~~~~~~l~v~A~D~D~~-~n~~i~-y~i  114 (115)
                      ..|+       +|.|.|.|.+ ++++|- |.|
T Consensus       165 d~il-------~veAiD~DCspq~sqIC~YEI  189 (952)
T KOG1834|consen  165 DSIL-------RVEAIDKDCSPQYSQICEYEI  189 (952)
T ss_pred             eeeE-------EEEeecCCCCCcccceeEEEe
Confidence            4455       8999999966 777764 443


No 8  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.92  E-value=9.4e-09  Score=63.20  Aligned_cols=48  Identities=31%  Similarity=0.476  Sum_probs=41.7

Q ss_pred             ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeC-C--CCeEEEEEEEEEE
Q psy7564          12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNM-A--GAKAVCSVIVHIL   59 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~-~--~~~~~~~v~I~V~   59 (115)
                      .|.|++.+|.|++.++||||....|.|.|.|+|. +  ++.+++.|+|+|.
T Consensus        43 ~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   43 LFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             SEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             ceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence            3559999999999999999999999999999999 4  3567778888774


No 9  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.52  E-value=3.1e-07  Score=56.33  Aligned_cols=35  Identities=37%  Similarity=0.405  Sum_probs=33.7

Q ss_pred             eEEEEecCCCCCcEEEeCCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564          73 YTGYVSEAAPIGALVLTNTSLPLVIRAEDKDSETNALLDGST  114 (115)
Q Consensus        73 y~~~v~E~~~~gt~v~~~~~~~l~v~A~D~D~~~n~~i~y~i  114 (115)
                      |.+.|+|++++|+.|+       ++.|.|+|.+.|+.+.|+|
T Consensus         1 Y~~~v~E~~~~g~~v~-------~v~a~D~D~~~n~~i~y~i   35 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVG-------QVTATDPDSGPNSQITYSI   35 (93)
T ss_dssp             EEEEEETTGSTSSEEE-------EEEEEESSTSTTSSEEEEE
T ss_pred             CEEEEECCCCCCCEEE-------EEEEEeCCCCCCceEEEEE
Confidence            7899999999999999       8999999999999999997


No 10 
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=98.26  E-value=6.5e-06  Score=56.41  Aligned_cols=49  Identities=33%  Similarity=0.490  Sum_probs=44.3

Q ss_pred             ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEe
Q psy7564          12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILD   60 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D   60 (115)
                      .|.|++.+|.|.+.+.||+|....|.+.|.|+|.+  ++.++..++|.|.|
T Consensus       149 ~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d  199 (199)
T cd00031         149 LFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD  199 (199)
T ss_pred             EEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence            46699999999999999999999999999999987  67888888888875


No 11 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.77  E-value=0.37  Score=29.66  Aligned_cols=46  Identities=28%  Similarity=0.403  Sum_probs=34.4

Q ss_pred             eCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCC
Q psy7564          16 HPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAKAVCSVIVHILDRND   63 (115)
Q Consensus        16 d~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd   63 (115)
                      |+.++.+.=. +.. +....|.+.+.|+|..+......++|.|.+.|+
T Consensus        51 d~~~~~~~Gt-P~~-~~~g~~~i~v~a~D~~g~~~~~~f~i~V~~~~~   96 (97)
T smart00736       51 DSDTGTLSGT-PTN-SDVGSLSLKVTATDSSGASASDTFTITVVNTND   96 (97)
T ss_pred             eCCCCEEEEE-CCC-CCCcEEEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence            5556555442 333 335679999999998888888899999999887


No 12 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=93.86  E-value=0.058  Score=32.87  Aligned_cols=22  Identities=18%  Similarity=0.178  Sum_probs=16.9

Q ss_pred             ceEEeCCceEEEEcCCCCccCc
Q psy7564          12 KPALHPPYGVLLTKKPLDYEAT   33 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~   33 (115)
                      .|.+++.+|.|++...+|||+-
T Consensus        46 ~~~v~~~tG~L~v~~rIDRE~L   67 (84)
T PF08266_consen   46 YFRVNEKTGDLFVSERIDREEL   67 (84)
T ss_dssp             SEEE-TTTSEEEESS--SCCCC
T ss_pred             eeEecCCceeEEeCCccCHHHH
Confidence            4779999999999999999973


No 13 
>KOG1834|consensus
Probab=93.29  E-value=0.18  Score=41.46  Aligned_cols=47  Identities=28%  Similarity=0.356  Sum_probs=38.8

Q ss_pred             ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCeE--EEEEEEEEE
Q psy7564          12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAKA--VCSVIVHIL   59 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~--~~~v~I~V~   59 (115)
                      .|.||. .|.|+.+.+|.|.+...|.++|+|.|.|.+.+  ...|+|+|.
T Consensus       195 PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vk  243 (952)
T KOG1834|consen  195 PFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVK  243 (952)
T ss_pred             ceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEec
Confidence            355985 69999999999999999999999999986533  367777774


No 14 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=88.24  E-value=5.4  Score=26.49  Aligned_cols=56  Identities=29%  Similarity=0.408  Sum_probs=38.8

Q ss_pred             EEEEcCCCCccC---cCEEEEEEEEEeCCCCeEEEEEEEEEEeCCCCCCeEeccceEEEEecC
Q psy7564          21 VLLTKKPLDYEA---TRIYNLSVTATNMAGAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEA   80 (115)
Q Consensus        21 ~i~~~~~lD~E~---~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~   80 (115)
                      ...+.++||...   .-...+.|.|+|..+-.+...+.|+|.|  | .|...... ...|.|.
T Consensus        69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~  127 (137)
T TIGR03660        69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDED  127 (137)
T ss_pred             EEEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence            567778898743   3356788999998766556688888877  5 47765433 3677774


No 15 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=87.50  E-value=2.4  Score=22.96  Aligned_cols=30  Identities=27%  Similarity=0.392  Sum_probs=22.4

Q ss_pred             EeCCceEEEEcCCCCcc-CcCEEEEEEEEEeCC
Q psy7564          15 LHPPYGVLLTKKPLDYE-ATRIYNLSVTATNMA   46 (115)
Q Consensus        15 id~~tG~i~~~~~lD~E-~~~~~~l~v~a~d~~   46 (115)
                      +|+.+|.|.-.-  +.. ....|.+.|.|+|..
T Consensus        18 ~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~   48 (49)
T PF05345_consen   18 LDPSTGTISGTP--TSSVQPGTYTFTVTATDGS   48 (49)
T ss_pred             EeCCCCEEEeec--CCCccccEEEEEEEEEcCC
Confidence            799999987662  333 336899999999864


No 16 
>PF12245 Big_3_2:  Bacterial Ig-like domain (group 3);  InterPro: IPR022038  This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT. 
Probab=84.23  E-value=5.5  Score=22.36  Aligned_cols=33  Identities=33%  Similarity=0.481  Sum_probs=25.3

Q ss_pred             cCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCC
Q psy7564          31 EATRIYNLSVTATNMAGAKAVCSVIVHILDRND   63 (115)
Q Consensus        31 E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd   63 (115)
                      +....|.+.+.+.|..|..+.....+.+.|..-
T Consensus        20 ~~dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~~   52 (60)
T PF12245_consen   20 DADGEYTLTVTATDKAGNTSSSTTQIVIVDNTA   52 (60)
T ss_pred             cCCccEEEEEEEEECCCCEEEeeeEEEEEcCCC
Confidence            346789999999999888777777666665543


No 17 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=81.74  E-value=9.7  Score=23.36  Aligned_cols=35  Identities=14%  Similarity=0.232  Sum_probs=23.7

Q ss_pred             eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCe
Q psy7564          13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAK   49 (115)
Q Consensus        13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~   49 (115)
                      |+|++ .|.|++++++...... -.+.|.|.|..++.
T Consensus        48 F~V~~-DGsVy~~r~v~l~~~~-~~F~V~a~D~~~~~   82 (90)
T PF08758_consen   48 FRVLE-DGSVYAKRPVQLSSEQ-RSFTVHAWDSQTQE   82 (90)
T ss_dssp             EEEET-TTEEEEES--S-SSS--EEEEEEEEETTTTE
T ss_pred             EEEcC-CCeEEEeeeEecCCCc-eEEEEEEECCCCCe
Confidence            66886 6999999988765433 36889999986664


No 18 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=78.96  E-value=7.7  Score=22.41  Aligned_cols=29  Identities=24%  Similarity=0.414  Sum_probs=20.8

Q ss_pred             CccCcCEEEEEEEEEeCCCCeEEEEEEEE
Q psy7564          29 DYEATRIYNLSVTATNMAGAKAVCSVIVH   57 (115)
Q Consensus        29 D~E~~~~~~l~v~a~d~~~~~~~~~v~I~   57 (115)
                      .|.+...|.+++.++|..+......+.|.
T Consensus        52 ~y~~~G~y~v~l~v~d~~g~~~~~~~~V~   80 (81)
T cd00146          52 TYTKPGTYTVTLTVTNAVGSSSTKTTTVV   80 (81)
T ss_pred             EcCCCcEEEEEEEEEeCCCCEEEEEEEEE
Confidence            35667889999999998666555555554


No 19 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=77.92  E-value=3.1  Score=25.60  Aligned_cols=42  Identities=21%  Similarity=0.206  Sum_probs=15.8

Q ss_pred             CCCeEeccceEEEEecCCCCCcEEEeCC------CCCeEEEEEeCCCC
Q psy7564          64 NTPYFTQLNYTGYVSEAAPIGALVLTNT------SLPLVIRAEDKDSE  105 (115)
Q Consensus        64 ~~P~f~~~~y~~~v~E~~~~gt~v~~~~------~~~l~v~A~D~D~~  105 (115)
                      ..|-|....|.+.|+.+-..|..|+..+      +..+.....|+|..
T Consensus         2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssDpdF~   49 (90)
T PF08758_consen    2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSDPDFR   49 (90)
T ss_dssp             ---B--S-EEEE----SS-SS--EEE---B--SS---EEEE---SEEE
T ss_pred             CcCCcccceEEEEcCchhhCCcEEEEEEeccCCCCCceEEecCCCCEE
Confidence            4688999999999999999999999511      22344455555544


No 20 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=76.21  E-value=21  Score=24.09  Aligned_cols=38  Identities=26%  Similarity=0.379  Sum_probs=27.1

Q ss_pred             CcCEEEEEE-EEEeCCCCeEEEEEEEEEEeCCCCCCeEec
Q psy7564          32 ATRIYNLSV-TATNMAGAKAVCSVIVHILDRNDNTPYFTQ   70 (115)
Q Consensus        32 ~~~~~~l~v-~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~   70 (115)
                      ....|.+.+ .|.|..+...+..+..++. ++..+|....
T Consensus        13 ~dG~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi   51 (158)
T PF13750_consen   13 PDGSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI   51 (158)
T ss_pred             CCccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE
Confidence            357899999 7999877766665554444 4667888865


No 21 
>PF13750 Big_3_3:  Bacterial Ig-like domain (group 3)
Probab=74.76  E-value=13  Score=25.18  Aligned_cols=30  Identities=33%  Similarity=0.471  Sum_probs=25.9

Q ss_pred             ccCcCEEEEEEEEEeCCCCeEEEEEEEEEE
Q psy7564          30 YEATRIYNLSVTATNMAGAKAVCSVIVHIL   59 (115)
Q Consensus        30 ~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~   59 (115)
                      .|....|.|+|.|+|..|..++..+.+...
T Consensus       119 le~~~~YtLtV~a~D~aGN~~~~si~F~y~  148 (158)
T PF13750_consen  119 LEADDSYTLTVSATDKAGNQSTKSISFSYM  148 (158)
T ss_pred             cCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence            467789999999999999988888888775


No 22 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=73.41  E-value=14  Score=21.10  Aligned_cols=30  Identities=30%  Similarity=0.479  Sum_probs=21.5

Q ss_pred             CCccCcCEEEEEEEEEeCCCCeEEEEEEEEE
Q psy7564          28 LDYEATRIYNLSVTATNMAGAKAVCSVIVHI   58 (115)
Q Consensus        28 lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V   58 (115)
                      .-|+....|.+++.+.|..+ .++..++|.|
T Consensus        49 ~~y~~~G~y~v~l~v~n~~g-~~~~~~~i~v   78 (79)
T smart00089       49 HTYTKPGTYTVTLTVTNAVG-SASATVTVVV   78 (79)
T ss_pred             EEeCCCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence            34566788999999998766 5566666655


No 23 
>KOG3597|consensus
Probab=69.40  E-value=30  Score=27.49  Aligned_cols=56  Identities=16%  Similarity=0.083  Sum_probs=38.1

Q ss_pred             EEEEEEEEEEeCCCCCCeEeccceEEEEecCCC-CCcEEEeCCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564          50 AVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAP-IGALVLTNTSLPLVIRAEDKDSETNALLDGST  114 (115)
Q Consensus        50 ~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~-~gt~v~~~~~~~l~v~A~D~D~~~n~~i~y~i  114 (115)
                      .+....|.|..+||.+..+....+.+-+.|+.. +...-        .+.+.|+|... ..+.|++
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~--------~l~~~d~d~~~-~~l~f~v   80 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPE--------LLTAADPDSAP-LPLEFQV   80 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccce--------EeeccCCCCCc-cceEEEE
Confidence            455678999999998877777777766667643 22222        58899999764 3455543


No 24 
>PF02494 HYR:  HYR domain;  InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=69.37  E-value=16  Score=21.38  Aligned_cols=26  Identities=35%  Similarity=0.497  Sum_probs=19.4

Q ss_pred             cCEEEEEEEEEeCCCCeEEEEEEEEE
Q psy7564          33 TRIYNLSVTATNMAGAKAVCSVIVHI   58 (115)
Q Consensus        33 ~~~~~l~v~a~d~~~~~~~~~v~I~V   58 (115)
                      ...|.++..|+|..+..+++.+.|+|
T Consensus        56 ~G~t~V~ytA~D~~GN~a~C~f~V~V   81 (81)
T PF02494_consen   56 VGTTTVTYTATDAAGNSATCSFTVTV   81 (81)
T ss_pred             eceEEEEEEEEECCCCEEEEEEEEEC
Confidence            35677888888887777777777664


No 25 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=59.86  E-value=25  Score=19.28  Aligned_cols=27  Identities=26%  Similarity=0.304  Sum_probs=17.1

Q ss_pred             CcCEEEEEEEEEeCCCCeE--EEEEEEEE
Q psy7564          32 ATRIYNLSVTATNMAGAKA--VCSVIVHI   58 (115)
Q Consensus        32 ~~~~~~l~v~a~d~~~~~~--~~~v~I~V   58 (115)
                      ....|+|.|.|.+..+...  ...+.|+|
T Consensus        37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I   65 (66)
T PF07495_consen   37 PPGKYTLEVRAKDNNGKWSSDEKSLTITI   65 (66)
T ss_dssp             -SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred             CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence            3578999999999865322  25555555


No 26 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=58.39  E-value=43  Score=20.96  Aligned_cols=40  Identities=25%  Similarity=0.268  Sum_probs=28.0

Q ss_pred             EEEEEEEEEeCCCCeEEEEEEEEEEeCCCCCCeEeccceEEEEecC
Q psy7564          35 IYNLSVTATNMAGAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEA   80 (115)
Q Consensus        35 ~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~   80 (115)
                      .-.+++.+.|.    .+..|+|+|.-.|| +|..... -.+.+.|+
T Consensus        58 tdsFtvtv~DG----tt~~vtItI~GtND-apvi~~~-~~g~v~ED   97 (99)
T TIGR01965        58 TDTFTVTSADG----TSQTVTITITGAND-AAVIGGA-DTGSVTED   97 (99)
T ss_pred             EEEEEEEEeCC----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence            34567777773    37889999999999 7766542 44566665


No 27 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=56.82  E-value=14  Score=23.47  Aligned_cols=33  Identities=15%  Similarity=0.216  Sum_probs=23.4

Q ss_pred             CCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEe
Q psy7564          28 LDYEATRIYNLSVTATNMAGAKAVCSVIVHILD   60 (115)
Q Consensus        28 lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D   60 (115)
                      +++.+-..|.++|.+.|..+.+.+..+.|.|-|
T Consensus        74 ~~v~kgG~y~m~V~lCn~dGCS~S~~~~I~VAD  106 (106)
T cd02848          74 FKVGKGGRYQMQVALCNGDGCSTSAAKEIVVAD  106 (106)
T ss_pred             EEeCCCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence            445566788899988888776666666666644


No 28 
>PF08329 ChitinaseA_N:  Chitinase A, N-terminal domain;  InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=54.86  E-value=18  Score=23.94  Aligned_cols=36  Identities=19%  Similarity=0.282  Sum_probs=25.3

Q ss_pred             CCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCC
Q psy7564          28 LDYEATRIYNLSVTATNMAGAKAVCSVIVHILDRND   63 (115)
Q Consensus        28 lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd   63 (115)
                      +...+...|.++|++.+..|.+.+..+.|.|.|.+.
T Consensus        77 ~~~~~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG  112 (133)
T PF08329_consen   77 FTVTKGGRYQMQVELCNADGCSTSAPVEVVVADTDG  112 (133)
T ss_dssp             EEE-S-EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred             EEecCCCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence            555567889999999998888778899999998865


No 29 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=48.79  E-value=44  Score=18.13  Aligned_cols=29  Identities=34%  Similarity=0.413  Sum_probs=19.5

Q ss_pred             eEEEEcCCCCccCcCEEEEEEEEEeCCCCeE
Q psy7564          20 GVLLTKKPLDYEATRIYNLSVTATNMAGAKA   50 (115)
Q Consensus        20 G~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~   50 (115)
                      |.-...-+..  ....|.+++.|+|..|..+
T Consensus        12 G~Ws~t~~~~--~dG~y~itv~a~D~AGN~s   40 (54)
T PF13754_consen   12 GNWSFTVPAL--ADGTYTITVTATDAAGNTS   40 (54)
T ss_pred             CcEEEeCCCC--CCccEEEEEEEEeCCCCCC
Confidence            4444443333  5788999999999876544


No 30 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=46.55  E-value=73  Score=20.90  Aligned_cols=26  Identities=23%  Similarity=0.323  Sum_probs=19.4

Q ss_pred             cCEEEEEEEEEeCCCCeEEEEEEEEE
Q psy7564          33 TRIYNLSVTATNMAGAKAVCSVIVHI   58 (115)
Q Consensus        33 ~~~~~l~v~a~d~~~~~~~~~v~I~V   58 (115)
                      ...|++.+.++|..|........|.|
T Consensus       106 ~G~YH~~i~VtD~~Gn~~~~~~~i~I  131 (132)
T PF15418_consen  106 AGDYHFMITVTDAAGNQTEEERSIKI  131 (132)
T ss_pred             CcceEEEEEEEECCCCEEEEEEEEEE
Confidence            46788888888888877766666654


No 31 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=45.99  E-value=65  Score=19.29  Aligned_cols=25  Identities=32%  Similarity=0.298  Sum_probs=15.3

Q ss_pred             EEEEEeCCCCCCeEeccceEEEEecCC
Q psy7564          55 IVHILDRNDNTPYFTQLNYTGYVSEAA   81 (115)
Q Consensus        55 ~I~V~D~Nd~~P~f~~~~y~~~v~E~~   81 (115)
                      +|.|.| ||.+ .+.-..-...+.|+.
T Consensus         2 tvtI~d-~d~~-~v~f~~~~~~v~E~~   26 (100)
T PF03160_consen    2 TVTILD-DDDP-TVSFSSPSYTVSEGD   26 (100)
T ss_dssp             EEEEE--TTSE-EEEESSSEEEEETTS
T ss_pred             EEEEEC-CCCC-EEEEeCCEEEEEeCC
Confidence            467777 6645 776555556677774


No 32 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=35.34  E-value=2.5e+02  Score=24.80  Aligned_cols=29  Identities=34%  Similarity=0.297  Sum_probs=19.6

Q ss_pred             EEEEEEEEeCCCCCCeEeccceEEEEecCC
Q psy7564          52 CSVIVHILDRNDNTPYFTQLNYTGYVSEAA   81 (115)
Q Consensus        52 ~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~   81 (115)
                      ...+|+|.| ||++|.|.-..-...|.|+.
T Consensus       515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~  543 (928)
T TIGR00845       515 NTATVTILD-DDHAGIFTFEEDVFHVSESI  543 (928)
T ss_pred             ceEEEEEec-CcccCcccccCceEEEEcCC
Confidence            356777777 78899877544456677763


No 33 
>PRK08577 hypothetical protein; Provisional
Probab=35.27  E-value=99  Score=19.95  Aligned_cols=37  Identities=5%  Similarity=0.163  Sum_probs=27.4

Q ss_pred             ecceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCC
Q psy7564          10 ILKPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAG   47 (115)
Q Consensus        10 ~~~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~   47 (115)
                      .+.|.++...|+|.+. ++-.+....|.+.+.+.|.++
T Consensus        32 ~~~~~~~~~~~~~~~~-~~~~~~k~~~~I~V~~~Dr~G   68 (136)
T PRK08577         32 YVLLIADTDKKEIHLE-PIALPGKKLVEIELVVEDRPG   68 (136)
T ss_pred             EEEEEEECCCCEEEEE-EcCCCCccEEEEEEEEcCCCC
Confidence            3456678888888884 555566678889999988765


No 34 
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=31.50  E-value=70  Score=17.20  Aligned_cols=16  Identities=25%  Similarity=0.345  Sum_probs=13.0

Q ss_pred             EEEEEEEEEEeCCCCC
Q psy7564          50 AVCSVIVHILDRNDNT   65 (115)
Q Consensus        50 ~~~~v~I~V~D~Nd~~   65 (115)
                      .+..++|++.|.|.+|
T Consensus        13 e~I~ltVt~kda~G~p   28 (47)
T PF05688_consen   13 ETIPLTVTVKDANGNP   28 (47)
T ss_pred             CeEEEEEEEECCCCCC
Confidence            3678999999998755


No 35 
>COG1098 VacB Predicted RNA binding protein (contains ribosomal protein S1 domain) [Translation, ribosomal structure and biogenesis]
Probab=23.05  E-value=26  Score=23.01  Aligned_cols=33  Identities=18%  Similarity=0.262  Sum_probs=23.1

Q ss_pred             eEEEEecCCCCCcEEEeCCCCCeEEEEEeCCCCCCCeEEEEEC
Q psy7564          73 YTGYVSEAAPIGALVLTNTSLPLVIRAEDKDSETNALLDGSTN  115 (115)
Q Consensus        73 y~~~v~E~~~~gt~v~~~~~~~l~v~A~D~D~~~n~~i~y~i~  115 (115)
                      |.-.|.+-...|..|.        +...|-|.  +|.+..|||
T Consensus        42 fVkdI~d~L~vG~eV~--------vKVl~ide--~GKisLSIr   74 (129)
T COG1098          42 FVKDIHDHLKVGQEVK--------VKVLDIDE--NGKISLSIR   74 (129)
T ss_pred             hHHhHHHHhcCCCEEE--------EEEEeecc--CCCcceehH
Confidence            4444555567788885        77777775  788888875


No 36 
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.88  E-value=2.3e+02  Score=18.22  Aligned_cols=44  Identities=16%  Similarity=0.258  Sum_probs=23.8

Q ss_pred             EEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeCCCCCeEEEEEeCCCCCC
Q psy7564          52 CSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTNTSLPLVIRAEDKDSETN  107 (115)
Q Consensus        52 ~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~~~~~l~v~A~D~D~~~n  107 (115)
                      .++.+..-|.+|-.+.-....|..-+.-            +.|..++-+-.|.+.+
T Consensus        68 rt~hlRfndL~dpe~iP~d~~yasvies------------nvPvVvQ~tRLDsgq~  111 (124)
T COG4288          68 RTLHLRFNDLGDPEAIPKDTPYASVIES------------NVPVVVQLTRLDSGQA  111 (124)
T ss_pred             eeEEEEecccCCcccCCCCCchhhheec------------CCceEEEEEEeccCCc
Confidence            3566777777764443333334333332            3334577777887754


No 37 
>PF13290 CHB_HEX_C_1:  Chitobiase/beta-hexosaminidase C-terminal domain
Probab=21.24  E-value=1.1e+02  Score=17.45  Aligned_cols=16  Identities=25%  Similarity=0.320  Sum_probs=9.7

Q ss_pred             EEEEEeCCCCCCCeEEEEEC
Q psy7564          96 VIRAEDKDSETNALLDGSTN  115 (115)
Q Consensus        96 ~v~A~D~D~~~n~~i~y~i~  115 (115)
                      .+.+.++|    +.|+|.++
T Consensus        13 ~l~~~~~~----~~IyYT~D   28 (67)
T PF13290_consen   13 TLSTADPD----ATIYYTTD   28 (67)
T ss_pred             EEEeCCCC----CEEEEEcC
Confidence            45554433    57888764


No 38 
>PF12100 DUF3576:  Domain of unknown function (DUF3576);  InterPro: IPR021959  This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is about 100 amino acids in length. This domain has a single completely conserved residue G that may be functionally important. 
Probab=21.12  E-value=2.3e+02  Score=17.92  Aligned_cols=43  Identities=19%  Similarity=0.101  Sum_probs=26.9

Q ss_pred             EEeCCceEEEEcC-CCCccCcCEEEEEEEEEeCCCCeEEEEEEE
Q psy7564          14 ALHPPYGVLLTKK-PLDYEATRIYNLSVTATNMAGAKAVCSVIV   56 (115)
Q Consensus        14 ~id~~tG~i~~~~-~lD~E~~~~~~l~v~a~d~~~~~~~~~v~I   56 (115)
                      +.|+..|.|.+-- ..---....|.++|...|..-......|.+
T Consensus        23 saD~~gGVI~TdWY~~p~~~~er~k~tv~Ild~~Lradal~V~v   66 (103)
T PF12100_consen   23 SADPFGGVIVTDWYSPPPGPNERFKATVYILDRALRADALRVSV   66 (103)
T ss_pred             hcCCCCCEEEeccccCCCCCCeeEEEEEEEECccccCCceEEEE
Confidence            4688899988873 222233466888888888654444444444


No 39 
>cd05775 Ig_SLAM-CD84_like_N N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84_like. Ig_SLAM-CD84_like_N: The N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84_like. The SLAM family is a group of immune-cell specific receptors that can regulate both adaptive and innate immune responses. Members of this group include proteins such as CD84, SLAM (CD150), Ly-9 (CD229), NTB-A (ly-108, SLAM6), 19A (CRACC), and SLAMF9. The genes coding for the SLAM family are nested on chromosome 1, in humans at 1q23, and in mice at 1H2. The SLAM family is a subset of the CD2 family, which also includes CD2 and CD58 located on chromosome 1 at 1p13 in humans. In mice, CD2 is located on chromosome 3, and there is no CD58 homolog. The SLAM family proteins are organized as an extracellular domain with either two or four Ig-like domains, a single transmembrane segment, and a cytoplasmic region 
Probab=20.11  E-value=2.1e+02  Score=17.01  Aligned_cols=32  Identities=16%  Similarity=0.159  Sum_probs=15.5

Q ss_pred             ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEe
Q psy7564          12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATN   44 (115)
Q Consensus        12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d   44 (115)
                      ++.++..++.|++. .+-.+....|...+...+
T Consensus        53 R~~~~~~~~sL~I~-~~~~~DsG~Y~c~v~~~~   84 (97)
T cd05775          53 RVNFSQNDYSLQIS-NLKMEDAGSYRAEINTKN   84 (97)
T ss_pred             eEEecCCceeEEEC-CCchHHCEEEEEEEEcCC
Confidence            34455555555443 334445555655554433


Done!