Query         psy1047
Match_columns 127
No_of_seqs    127 out of 1374
Neff          8.3 
Searched_HMMs 46136
Date          Fri Aug 16 19:00:58 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1047.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1047hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00028 Cadherin:  Cadherin do  99.9 5.7E-23 1.2E-27  130.3  14.6   92   11-124     1-93  (93)
  2 KOG4289|consensus               99.8   8E-20 1.7E-24  155.3  11.5  102    1-125   369-470 (2531)
  3 cd00031 CA Cadherin repeat dom  99.8 1.4E-17 3.1E-22  117.6  15.1  102    1-124    97-198 (199)
  4 KOG1219|consensus               99.8 7.5E-18 1.6E-22  147.5  11.8   97    1-123  2675-2771(4289)
  5 KOG4289|consensus               99.8   1E-17 2.2E-22  142.8  11.8  103    1-125   778-881 (2531)
  6 cd00031 CA Cadherin repeat dom  99.7 1.8E-16 3.8E-21  112.0  14.1   93   10-124     1-93  (199)
  7 KOG1219|consensus               99.7 4.9E-17 1.1E-21  142.5  11.1  103    1-125  1053-1155(4289)
  8 smart00112 CA Cadherin repeats  99.7 1.4E-15 3.1E-20   93.4  10.0   74   31-126     1-74  (79)
  9 KOG1834|consensus               99.0 4.3E-09 9.3E-14   85.8  10.6   98    2-124   145-243 (952)
 10 PF08266 Cadherin_2:  Cadherin-  97.7 3.2E-05 6.8E-10   48.4   3.0   63   10-94      2-66  (84)
 11 PF08758 Cadherin_pro:  Cadheri  97.6  0.0014   3E-08   41.5   9.3   81    3-113     3-83  (90)
 12 KOG1834|consensus               97.1  0.0072 1.6E-07   50.4  10.4   98    8-124    34-140 (952)
 13 smart00736 CADG Dystroglycan-t  96.0     0.2 4.4E-06   31.6  10.2   66   30-124    24-92  (97)
 14 PF05345 He_PIG:  Putative Ig d  93.2    0.64 1.4E-05   25.8   5.8   38   50-109    11-48  (49)
 15 PF07495 Y_Y_Y:  Y_Y_Y domain;   88.4     3.2 6.9E-05   23.7   8.0   59   40-124     8-66  (66)
 16 smart00089 PKD Repeats in poly  67.4      20 0.00044   20.9   4.8   29   91-123    50-78  (79)
 17 TIGR03660 T1SS_rpt_143 T1SS-14  60.8      46 0.00099   22.5   5.9   41   80-121    66-109 (137)
 18 TIGR01965 VCBS_repeat VCBS rep  59.9      43 0.00094   21.4   7.4   72   27-123     3-79  (99)
 19 PF13754 Big_3_4:  Bacterial Ig  59.7      27 0.00058   19.4   4.0   28   94-122    22-50  (54)
 20 cd00146 PKD polycystic kidney   59.4      22 0.00048   20.9   3.9   19   91-110    52-70  (81)
 21 PF00870 P53:  P53 DNA-binding   43.7      42 0.00092   24.1   3.8   29   98-127   157-185 (196)
 22 PF15418 DUF4625:  Domain of un  39.3      66  0.0014   21.6   4.0   24   98-121   108-131 (132)
 23 cd02848 Chitinase_N_term Chiti  38.5 1.1E+02  0.0024   19.9   5.3   32   89-121    73-104 (106)
 24 PF10794 DUF2606:  Protein of u  34.6 1.4E+02  0.0031   19.9   5.1   47   11-58     46-93  (131)
 25 cd04022 C2A_MCTP_PRT_plant C2   34.4 1.3E+02  0.0027   19.3   5.0   32    3-35     44-75  (127)
 26 cd04016 C2_Tollip C2 domain pr  30.3 1.6E+02  0.0034   19.2   6.4   54    3-61     46-107 (121)
 27 KOG4221|consensus               28.1   4E+02  0.0086   25.1   7.8   56   38-117   547-605 (1381)
 28 cd08692 C2B_Tac2-N C2 domain s  26.8   2E+02  0.0044   19.3   5.4   35    2-39     63-97  (135)
 29 PF14302 DUF4377:  Domain of un  24.1 1.8E+02  0.0038   17.6   5.5   36   88-124    40-77  (80)
 30 TIGR00845 caca sodium/calcium   22.1   6E+02   0.013   23.0  10.5   59    6-67    402-460 (928)
 31 KOG4680|consensus               20.5   3E+02  0.0065   18.9   4.6   35   11-46    108-142 (153)
 32 cd08389 C2A_Synaptotagmin-14_1  20.3 1.7E+02  0.0037   18.8   3.4   32    2-35     61-93  (124)

No 1  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.91  E-value=5.7e-23  Score=130.33  Aligned_cols=92  Identities=42%  Similarity=0.613  Sum_probs=89.0

Q ss_pred             eEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCC
Q psy1047          11 YQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLL   90 (127)
Q Consensus        11 y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~l   90 (127)
                      |.+.|+|+.++|+.++++.|.|+|. +.|+.++|+|..+++..+|.|++.+|.|                    .+.++|
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~-~~n~~i~y~i~~~~~~~~F~I~~~tg~i--------------------~~~~~L   59 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDS-GPNSQITYSILGGNPDGLFSIDPNTGEI--------------------SLKKPL   59 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESST-STTSSEEEEEEETTSTTSEEEETTTTEE--------------------EESSSS
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCC-CCCceEEEEEecCcccCceEEeeeeecc--------------------ccceec
Confidence            8899999999999999999999997 9999999999998889999999999999                    999999


Q ss_pred             CCCCCCCeEEEEEEEEEC-CCCCcceeEEEEEEeE
Q psy1047          91 DREIAPLGYNLTLRAVDK-GTPPRETYKATPRNGL  124 (127)
Q Consensus        91 d~E~~~~~y~l~v~a~D~-g~p~~~st~~i~i~~~  124 (127)
                      |||... .|.|.|.|+|. |.|+++++++++|+||
T Consensus        60 D~E~~~-~y~l~v~a~D~~~~~~~~~~~~V~I~V~   93 (93)
T PF00028_consen   60 DRETQS-SYQLTVRATDSGGSPPLSSTATVTINVL   93 (93)
T ss_dssp             CTTTTS-EEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred             CcccCC-EEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence            999999 99999999999 8999999999999997


No 2  
>KOG4289|consensus
Probab=99.82  E-value=8e-20  Score=155.32  Aligned_cols=102  Identities=35%  Similarity=0.484  Sum_probs=98.6

Q ss_pred             CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047           1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK   80 (127)
Q Consensus         1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~   80 (127)
                      ||+|+|+...|.++|.|+..++..|++++|+|+|. |.|++|.|+|.+|+..+.|.||..||++                
T Consensus       369 DNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~-g~Ng~VHYsi~Sgn~~G~f~id~~tGel----------------  431 (2531)
T KOG4289|consen  369 DNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDK-GTNGKVHYSIASGNGRGQFYIDSLTGEL----------------  431 (2531)
T ss_pred             CCCccccccceEEEecccCCCCceEEEEEecccCC-CcCceEEEEeeccCccccEEEecccceE----------------
Confidence            79999999999999999999999999999999999 9999999999999999999999999999                


Q ss_pred             eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeEe
Q psy1047          81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGLT  125 (127)
Q Consensus        81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~~  125 (127)
                          .+..+||+|..+  |.+.|+|+|+|.|+++.+..+.|+|+-
T Consensus       432 ----~vv~plD~e~~~--ytl~IrAqDggrPpLsn~sgl~iqVlD  470 (2531)
T KOG4289|consen  432 ----DVVEPLDFENSE--YTLRIRAQDGGRPPLSNTSGLVIQVLD  470 (2531)
T ss_pred             ----EEeccccccCCe--eEEEEEcccCCCCCccCCCceEEEEEe
Confidence                899999999995  999999999999999999999999874


No 3  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.78  E-value=1.4e-17  Score=117.61  Aligned_cols=102  Identities=40%  Similarity=0.582  Sum_probs=95.2

Q ss_pred             CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047           1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK   80 (127)
Q Consensus         1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~   80 (127)
                      +|+|+|....|.+.+.|+.++|..++++.|+|+|. +.++.++|+|..+....+|.|++.+|.|                
T Consensus        97 d~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~-~~~~~~~y~l~~~~~~~~f~i~~~~G~i----------------  159 (199)
T cd00031          97 DNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS-GENAKLTYSILSGNDKELFSIDPNTGII----------------  159 (199)
T ss_pred             CCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC-CCCccEEEEEeCCCCCCEEEEeCCceEE----------------
Confidence            58999998999999999999999999999999998 7899999999875555899999999999                


Q ss_pred             eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeE
Q psy1047          81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGL  124 (127)
Q Consensus        81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~  124 (127)
                          .+.+.||+|... .|.+.|.|.|.+.|.++++..++|+++
T Consensus       160 ----~~~~~ld~e~~~-~~~l~v~a~D~~~~~~~~~~~i~i~v~  198 (199)
T cd00031         160 ----TLAKPLDREEKS-SYELTVVATDGGGPPLSSTATVTVTVL  198 (199)
T ss_pred             ----EeCCccCCccCc-eEEEEEEEEECCCCCceeEEEEEEEEE
Confidence                888899999999 999999999999999999999999986


No 4  
>KOG1219|consensus
Probab=99.75  E-value=7.5e-18  Score=147.47  Aligned_cols=97  Identities=36%  Similarity=0.491  Sum_probs=87.9

Q ss_pred             CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047           1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK   80 (127)
Q Consensus         1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~   80 (127)
                      ||+|+|..+.|.+.+.||+++|+.|.+++|.|.|+ |.||+|+|++.+.  ..+|.|++++|+|                
T Consensus      2675 DNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds-~~nGqirysl~~~--v~yF~In~etGwl---------------- 2735 (4289)
T KOG1219|consen 2675 DNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDS-GNNGQIRYSLTSP--VPYFAINPETGWL---------------- 2735 (4289)
T ss_pred             CCCccccCCceeEEEeccCCCCceEEEEEeeccCC-CCCceEEEEEcCC--cceEEEcCCCCee----------------
Confidence            79999999999999999999999999999999999 9999999999854  3499999999999                


Q ss_pred             eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEe
Q psy1047          81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNG  123 (127)
Q Consensus        81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~  123 (127)
                          .+.+.||+|+++ .|.|.|+|+|+|.|.  +++++.+++
T Consensus      2736 ----Tt~~eld~ek~d-~y~lkv~AtDhG~~s--sq~~v~v~v 2771 (4289)
T KOG1219|consen 2736 ----TTLFELDLEKQD-LYSLKVVATDHGVPS--SQATVLVHV 2771 (4289)
T ss_pred             ----eehhhhccccCC-ceEEEEEEecCCccc--ccceEEEEE
Confidence                889999999999 899999999999994  444454443


No 5  
>KOG4289|consensus
Probab=99.75  E-value=1e-17  Score=142.83  Aligned_cols=103  Identities=37%  Similarity=0.524  Sum_probs=97.0

Q ss_pred             CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCC-CCCcEEEeCCCCeEEEeeeccccCCCCCC
Q psy1047           1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGD-GLGYFTIDNEGPLNHFRIVPTKIDPGTKK   79 (127)
Q Consensus         1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~-~~~~F~i~~~tg~i~~~~~~~~~~~~~~~   79 (127)
                      +|+|+|..+.|.+.|.|+.|+++.++++.|+|+|. |+|+.+.|.+..++ ..+.|.|++.+|.|               
T Consensus       778 DnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~-g~Ng~v~y~~qg~~d~p~~F~IEptSGvi---------------  841 (2531)
T KOG4289|consen  778 DNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADS-GPNGRVYYTFQGGDDGPGDFYIEPTSGVI---------------  841 (2531)
T ss_pred             ccCcccchhhceeEeecCCCCcceEEEEEEeccCC-CCCceEEEEecCCCCCCCceEEccCccee---------------
Confidence            69999999999999999999999999999999999 99999999886643 46899999999999               


Q ss_pred             ceeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeEe
Q psy1047          80 KEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGLT  125 (127)
Q Consensus        80 ~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~~  125 (127)
                           ++.+.||||... .|.|.+.|.|.|+|++++++.|+|-+|-
T Consensus       842 -----Rtl~rLdRE~~a-vy~L~a~avDrg~p~ls~~~eItvtvld  881 (2531)
T KOG4289|consen  842 -----RTLRRLDRENVA-VYVLAAYAVDRGNPPLSAPVEITVTVLD  881 (2531)
T ss_pred             -----ehhhhhcchhee-EEEEEEEEeeCCCCCcCCceEEEEEEEe
Confidence                 999999999999 9999999999999999999999998874


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.73  E-value=1.8e-16  Score=112.03  Aligned_cols=93  Identities=39%  Similarity=0.572  Sum_probs=85.9

Q ss_pred             eeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccC
Q psy1047          10 VYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKL   89 (127)
Q Consensus        10 ~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~   89 (127)
                      .|.+.++|+.++|+.+.++.|.|+|. +.+..++|+|.++....+|.|++.+|.|                    .+.+.
T Consensus         1 ~~~~~i~En~~~g~~v~~~~a~D~D~-~~~~~~~y~i~~~~~~~~F~i~~~tG~l--------------------~~~~~   59 (199)
T cd00031           1 SYSVSVPENAPPGTVVGTVSATDPDS-GENGRVTYSILGGNEDGLFSIDPNTGVI--------------------TTTKP   59 (199)
T ss_pred             CeEEEEeCCCCCCCEEEEEEEECCCC-CCCceEEEEEeCCCCcccEEEeCCCCEE--------------------EECCC
Confidence            37899999999999999999999998 7789999999876656899999999999                    89999


Q ss_pred             CCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeE
Q psy1047          90 LDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGL  124 (127)
Q Consensus        90 ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~  124 (127)
                      ||||... .|.|.|.|.|.|.|+++++..++|+++
T Consensus        60 lD~e~~~-~~~l~v~a~D~g~~~~~~~~~v~I~V~   93 (199)
T cd00031          60 LDREEQS-EYTLTVVASDGGGPPLSSTATVTVTVL   93 (199)
T ss_pred             CCCcCCc-eEEEEEEEEECCcCcceeEEEEEEEEc
Confidence            9999999 999999999999999889999999886


No 7  
>KOG1219|consensus
Probab=99.72  E-value=4.9e-17  Score=142.53  Aligned_cols=103  Identities=31%  Similarity=0.418  Sum_probs=100.0

Q ss_pred             CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047           1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK   80 (127)
Q Consensus         1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~   80 (127)
                      ||+|+|.+..|..+|.|+++.+..|.++.|+|+|+ ..|+++.|.|.+|++.+.|.|++.||.|                
T Consensus      1053 Dn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Ds-ssn~kLmykI~sGnyq~FF~Id~~TG~i---------------- 1115 (4289)
T KOG1219|consen 1053 DNVPQFSSPVYYASISENSPETVSIVQAEANDPDS-SSNQKLMYKITSGNYQGFFQIDPETGLI---------------- 1115 (4289)
T ss_pred             CCCcccCCceEeeeeccCCCCceEEEEeccCCCCc-ccCcceEEEEccCCccceEEEcccccee----------------
Confidence            79999999999999999999999999999999998 7899999999999999999999999999                


Q ss_pred             eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeEe
Q psy1047          81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGLT  125 (127)
Q Consensus        81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~~  125 (127)
                          .+.+.||||++. ++.|.|.+.|.|.|++.+...+.|++|.
T Consensus      1116 ----Tt~r~LDRE~qd-EHiLeVTi~D~gep~l~s~~rviV~Ild 1155 (4289)
T KOG1219|consen 1116 ----TTIRRLDREKQD-EHILEVTIQDNGEPWLCSNQRVIVSILD 1155 (4289)
T ss_pred             ----eeehhhcccccc-cceEEEEEecCCCCccccceEEEEEEee
Confidence                888999999999 9999999999999999999999999985


No 8  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.66  E-value=1.4e-15  Score=93.42  Aligned_cols=74  Identities=39%  Similarity=0.560  Sum_probs=67.3

Q ss_pred             EeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECCC
Q psy1047          31 ADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGT  110 (127)
Q Consensus        31 ~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~  110 (127)
                      +|+|. |.|+.++|+|.+++...+|.|++.+|.|                    .+.++||||... .|.|.|.|+|.|.
T Consensus         1 ~D~D~-g~n~~i~Y~i~~~~~~~~F~i~~~tg~i--------------------~~~~~LD~e~~~-~y~l~v~a~D~~~   58 (79)
T smart00112        1 TDADS-GENGKVTYSILSGNEDGLFSIDPETGEI--------------------TTTKPLDREEQP-EYTLTVEATDGGG   58 (79)
T ss_pred             CCCCC-CcCcEEEEEEecCCCCCEEEEeCCccEE--------------------EeCCccCeeCCC-eEEEEEEEEECCC
Confidence            48898 8899999999976666899999999988                    888899999999 9999999999999


Q ss_pred             CCcceeEEEEEEeEeC
Q psy1047         111 PPRETYKATPRNGLTC  126 (127)
Q Consensus       111 p~~~st~~i~i~~~~~  126 (127)
                      |+++++++++|+++-+
T Consensus        59 ~~~~~~~~v~I~V~D~   74 (79)
T smart00112       59 PPLSSTATVTVTVLDV   74 (79)
T ss_pred             CCcccEEEEEEEEEEC
Confidence            9999999999999753


No 9  
>KOG1834|consensus
Probab=99.00  E-value=4.3e-09  Score=85.79  Aligned_cols=98  Identities=26%  Similarity=0.252  Sum_probs=84.8

Q ss_pred             CCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcE-EEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047           2 NAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKI-SYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK   80 (127)
Q Consensus         2 N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i-~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~   80 (127)
                      .||+|....|.+.|.|. ++...|+++.|.|.|-...+++| .|.|.  .++-.|.||.. |.|                
T Consensus       145 ~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~--t~d~PFaIdn~-G~i----------------  204 (952)
T KOG1834|consen  145 FAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKDCSPQYSQICEYEIT--TPDVPFAIDND-GNI----------------  204 (952)
T ss_pred             cCchhcccceeeEEecc-eeeeeeEEEEeecCCCCCcccceeEEEec--CCCCceEEcCC-Ccc----------------
Confidence            58999999999999998 66899999999999974577888 89997  46778999876 888                


Q ss_pred             eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeE
Q psy1047          81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGL  124 (127)
Q Consensus        81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~  124 (127)
                          ..+..|+|.+.. +|.|+|+|.|-|.-...+-+-+.|++-
T Consensus       205 ----rnTekLny~ke~-~Y~ltVtAyDCg~kraa~d~lV~v~Vk  243 (952)
T KOG1834|consen  205 ----RNTEKLNYTKEH-QYKLTVTAYDCGKKRAASDSLVTVHVK  243 (952)
T ss_pred             ----ccccccccccce-eEEEEEEEEecccccccCcceEEEEec
Confidence                999999999999 999999999999877777666666653


No 10 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.72  E-value=3.2e-05  Score=48.36  Aligned_cols=63  Identities=22%  Similarity=0.287  Sum_probs=40.4

Q ss_pred             eeEEEEeCCCCCCeEEEEEEEEeCCCCCCC--CcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEc
Q psy1047          10 VYQATVKENQPIGTLVITLKAADADPPGMN--SKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVL   87 (127)
Q Consensus        10 ~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n--~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~   87 (127)
                      +....|+|..++|+.|+.+ |.|.-. ...  ....|++.+.....+|.++..+|.+                    .+.
T Consensus         2 qi~YsV~EE~~~Gt~IGni-a~dL~l-~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L--------------------~v~   59 (84)
T PF08266_consen    2 QIRYSVPEEMPPGTVIGNI-AKDLGL-DPQSLSSRNFRIVSEGNSQYFRVNEKTGDL--------------------FVS   59 (84)
T ss_dssp             EEEEEEESS--TT-EEEEC-CCCCT---HHHHCCTTBEEE-SSSS-SEEE-TTTSEE--------------------EES
T ss_pred             CeEEEeecCCCCCCEEEEh-HHhhCC-CcccccccceEEeecCCcceeEecCCceeE--------------------EeC
Confidence            3567899999999999998 555432 110  1235676666678899999999999                    899


Q ss_pred             cCCCCCC
Q psy1047          88 KLLDREI   94 (127)
Q Consensus        88 ~~ld~E~   94 (127)
                      ..+|||.
T Consensus        60 ~rIDRE~   66 (84)
T PF08266_consen   60 ERIDREE   66 (84)
T ss_dssp             S--SCCC
T ss_pred             CccCHHH
Confidence            9999997


No 11 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.61  E-value=0.0014  Score=41.46  Aligned_cols=81  Identities=17%  Similarity=0.188  Sum_probs=44.2

Q ss_pred             CCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCcee
Q psy1047           3 APVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEY   82 (127)
Q Consensus         3 ~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~   82 (127)
                      .|-|.+..|.+.|+.+...|..|+++.-.|=..   +..+.|.-  +|+  .|.|..+ |.|                  
T Consensus         3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~---~~~~~~~s--sDp--dF~V~~D-GsV------------------   56 (90)
T PF08758_consen    3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG---RRRVIFES--SDP--DFRVLED-GSV------------------   56 (90)
T ss_dssp             --B--S-EEEE----SS-SS--EEE---B--SS------EEEE-----S--EEEEETT-TEE------------------
T ss_pred             cCCcccceEEEEcCchhhCCcEEEEEEeccCCC---CCceEEec--CCC--CEEEcCC-CeE------------------
Confidence            488999999999999999999999998877643   45677864  344  7999987 998                  


Q ss_pred             eEEEccCCCCCCCCCeEEEEEEEEECCCCCc
Q psy1047          83 NIVVLKLLDREIAPLGYNLTLRAVDKGTPPR  113 (127)
Q Consensus        83 ~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~  113 (127)
                        ++.+++......  -.|.|.|.|......
T Consensus        57 --y~~r~v~l~~~~--~~F~V~a~D~~~~~~   83 (90)
T PF08758_consen   57 --YAKRPVQLSSEQ--RSFTVHAWDSQTQEQ   83 (90)
T ss_dssp             --EEES--S-SSS---EEEEEEEEETTTTEE
T ss_pred             --EEeeeEecCCCc--eEEEEEEECCCCCeE
Confidence              899998876554  589999999766554


No 12 
>KOG1834|consensus
Probab=97.12  E-value=0.0072  Score=50.36  Aligned_cols=98  Identities=22%  Similarity=0.254  Sum_probs=66.2

Q ss_pred             CCeeEEEEeCCCCCCeEEEEEEEEeCCCC-CCCCcE-EEEEEeC-CCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeE
Q psy1047           8 DFVYQATVKENQPIGTLVITLKAADADPP-GMNSKI-SYSVRGG-DGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNI   84 (127)
Q Consensus         8 ~~~y~~~v~E~~~~g~~v~~~~a~D~D~~-g~n~~i-~y~i~~~-~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l   84 (127)
                      ...|...|.||-..=...--+.|-|.|.+ ...+.| -|.|-+. -+-....+|+.||+                  +.|
T Consensus        34 e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGe------------------gvl   95 (952)
T KOG1834|consen   34 EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGE------------------GVL   95 (952)
T ss_pred             ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCc------------------eEE
Confidence            45799999998553333344677788851 123455 5666431 12344456666654                  456


Q ss_pred             EEccCCCCCCCCCeEEEEEEEEECCCCC------cceeEEEEEEeE
Q psy1047          85 VVLKLLDREIAPLGYNLTLRAVDKGTPP------RETYKATPRNGL  124 (127)
Q Consensus        85 ~~~~~ld~E~~~~~y~l~v~a~D~g~p~------~~st~~i~i~~~  124 (127)
                      +...+||-|.++ +|+|+|+|.|-|.-+      .+.-+++||+|-
T Consensus        96 RaK~~lDCelqk-eytf~iQAydCg~gpdgtn~kKShkatvhIrVk  140 (952)
T KOG1834|consen   96 RAKEPLDCELQK-EYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVK  140 (952)
T ss_pred             eecCcccccccc-cceEEEEEEecCCCCCccccccccceEEEEEec
Confidence            888999999999 999999999988644      556688888764


No 13 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.98  E-value=0.2  Score=31.57  Aligned_cols=66  Identities=15%  Similarity=0.156  Sum_probs=45.4

Q ss_pred             EEeCCCCCCCCcEEEEEEeCC---CCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEE
Q psy1047          30 AADADPPGMNSKISYSVRGGD---GLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAV  106 (127)
Q Consensus        30 a~D~D~~g~n~~i~y~i~~~~---~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~  106 (127)
                      ..|+|.    ..++|++...+   ...|++++..++.+                    .= .+.+.+ .. .|.+.|.|+
T Consensus        24 F~d~d~----~~lty~~~~~~~~~lP~Wl~fd~~~~~~--------------------~G-tP~~~~-~g-~~~i~v~a~   76 (97)
T smart00736       24 FTDADG----DTLTYSATLSDGSALPSWLSFDSDTGTL--------------------SG-TPTNSD-VG-SLSLKVTAT   76 (97)
T ss_pred             eECCCC----CeEEEEEEeCCCCCCCCeEEEeCCCCEE--------------------EE-ECCCCC-Cc-EEEEEEEEE
Confidence            456663    46889886433   25799999988876                    32 455444 46 799999999


Q ss_pred             ECCCCCcceeEEEEEEeE
Q psy1047         107 DKGTPPRETYKATPRNGL  124 (127)
Q Consensus       107 D~g~p~~~st~~i~i~~~  124 (127)
                      |...  .+....+.|+|-
T Consensus        77 D~~g--~~~~~~f~i~V~   92 (97)
T smart00736       77 DSSG--ASASDTFTITVV   92 (97)
T ss_pred             ECCC--CEEEEEEEEEEe
Confidence            9875  455566766653


No 14 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=93.21  E-value=0.64  Score=25.80  Aligned_cols=38  Identities=13%  Similarity=-0.004  Sum_probs=26.5

Q ss_pred             CCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECC
Q psy1047          50 DGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKG  109 (127)
Q Consensus        50 ~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g  109 (127)
                      ....++.||+.+|.|                     .-.+-...... .|.+.|.|.|..
T Consensus        11 ~LP~gLs~d~~tG~i---------------------sGtp~~~~~~G-~y~~~vtatd~~   48 (49)
T PF05345_consen   11 GLPSGLSLDPSTGTI---------------------SGTPTSSVQPG-TYTFTVTATDGS   48 (49)
T ss_pred             CCCCcEEEeCCCCEE---------------------EeecCCCcccc-EEEEEEEEEcCC
Confidence            346789999999988                     22222222346 799999999964


No 15 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=88.38  E-value=3.2  Score=23.66  Aligned_cols=59  Identities=17%  Similarity=0.089  Sum_probs=34.5

Q ss_pred             CcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEE
Q psy1047          40 SKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKAT  119 (127)
Q Consensus        40 ~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i  119 (127)
                      -..+|+|. |-...|..+...+-.+                     .-..   -... .|+|.|.|.|..........++
T Consensus         8 ~~Y~Y~l~-g~d~~W~~~~~~~~~~---------------------~~~~---L~~G-~Y~l~V~a~~~~~~~~~~~~~l   61 (66)
T PF07495_consen    8 IRYRYRLE-GFDDEWITLGSYSNSI---------------------SYTN---LPPG-KYTLEVRAKDNNGKWSSDEKSL   61 (66)
T ss_dssp             EEEEEEEE-TTESSEEEESSTS-EE---------------------EEES-----SE-EEEEEEEEEETTS-B-SS-EEE
T ss_pred             eEEEEEEE-CCCCeEEECCCCcEEE---------------------EEEe---CCCE-EEEEEEEEECCCCCcCcccEEE
Confidence            34566666 4556777776653244                     1122   2456 8999999999876655554667


Q ss_pred             EEEeE
Q psy1047         120 PRNGL  124 (127)
Q Consensus       120 ~i~~~  124 (127)
                      +|.|+
T Consensus        62 ~i~I~   66 (66)
T PF07495_consen   62 TITIL   66 (66)
T ss_dssp             EEEEE
T ss_pred             EEEEC
Confidence            77664


No 16 
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=67.38  E-value=20  Score=20.91  Aligned_cols=29  Identities=10%  Similarity=-0.007  Sum_probs=21.1

Q ss_pred             CCCCCCCeEEEEEEEEECCCCCcceeEEEEEEe
Q psy1047          91 DREIAPLGYNLTLRAVDKGTPPRETYKATPRNG  123 (127)
Q Consensus        91 d~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~  123 (127)
                      .|+... .|.+++.+.|...   +++.++.|.+
T Consensus        50 ~y~~~G-~y~v~l~v~n~~g---~~~~~~~i~v   78 (79)
T smart00089       50 TYTKPG-TYTVTLTVTNAVG---SASATVTVVV   78 (79)
T ss_pred             EeCCCc-EEEEEEEEEcCCC---cEEEEEEEEE
Confidence            456777 8999999999766   5555555544


No 17 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=60.85  E-value=46  Score=22.54  Aligned_cols=41  Identities=15%  Similarity=0.095  Sum_probs=27.5

Q ss_pred             ceeeEEEccCCCCCC---CCCeEEEEEEEEECCCCCcceeEEEEE
Q psy1047          80 KEYNIVVLKLLDREI---APLGYNLTLRAVDKGTPPRETYKATPR  121 (127)
Q Consensus        80 ~~~~l~~~~~ld~E~---~~~~y~l~v~a~D~g~p~~~st~~i~i  121 (127)
                      +.++..+..+||...   .- ...|.|.|+|...-..+.+.+|+|
T Consensus        66 GsYtftL~~~lDH~~g~d~l-~l~~~v~a~D~DGD~s~~~l~VtI  109 (137)
T TIGR03660        66 GSYEFTLEGPLDHAAGSDEL-TLNFPIIATDFDGDTSSITLPVTI  109 (137)
T ss_pred             ccEEEEEcccccCCCCCceE-EEeeeEEEEeCCCCccccEEEEEE
Confidence            457778888888733   33 577888999876655555444444


No 18 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=59.92  E-value=43  Score=21.42  Aligned_cols=72  Identities=19%  Similarity=0.219  Sum_probs=38.3

Q ss_pred             EEEEEeCCCCCCCCcEEEEEEe-CCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEE----EccCCCCCCCCCeEEE
Q psy1047          27 TLKAADADPPGMNSKISYSVRG-GDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIV----VLKLLDREIAPLGYNL  101 (127)
Q Consensus        27 ~~~a~D~D~~g~n~~i~y~i~~-~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~----~~~~ld~E~~~~~y~l  101 (127)
                      ++.++|+|. +..  ..+++.. ...-+.|.|+.. |.-+                +.|-    ..+.|..-..- .-.|
T Consensus         3 ~Lt~sD~D~-gd~--~~~s~~~~~g~yGtlti~~~-G~wt----------------Ytl~n~~~avq~L~~Ge~~-tdsF   61 (99)
T TIGR01965         3 QLTISDADA-GQA--HFIAQTDAAGQYGTFSIDAD-GQWT----------------YQADNSQTAVQALKAGETL-TDTF   61 (99)
T ss_pred             ceEEeCCCC-CCc--eEEecccccCCcEEEEECCC-CcEE----------------EEeCCCcHHHHhhcCCCEE-EEEE
Confidence            578899997 432  3454421 123456788763 5431                1110    11233322333 4678


Q ss_pred             EEEEEECCCCCcceeEEEEEEe
Q psy1047         102 TLRAVDKGTPPRETYKATPRNG  123 (127)
Q Consensus       102 ~v~a~D~g~p~~~st~~i~i~~  123 (127)
                      ++.+.|+    .+.+.+|+|++
T Consensus        62 tvtv~DG----tt~~vtItI~G   79 (99)
T TIGR01965        62 TVTSADG----TSQTVTITITG   79 (99)
T ss_pred             EEEEeCC----CeEEEEEEEEc
Confidence            8888885    26666777664


No 19 
>PF13754 Big_3_4:  Bacterial Ig-like domain (group 3)
Probab=59.68  E-value=27  Score=19.42  Aligned_cols=28  Identities=18%  Similarity=0.262  Sum_probs=19.6

Q ss_pred             CCCCeEEEEEEEEEC-CCCCcceeEEEEEE
Q psy1047          94 IAPLGYNLTLRAVDK-GTPPRETYKATPRN  122 (127)
Q Consensus        94 ~~~~~y~l~v~a~D~-g~p~~~st~~i~i~  122 (127)
                      ... .|.+++.|.|. |+....+...++|.
T Consensus        22 ~dG-~y~itv~a~D~AGN~s~~~~~~~tiD   50 (54)
T PF13754_consen   22 ADG-TYTITVTATDAAGNTSTSSSVTFTID   50 (54)
T ss_pred             CCc-cEEEEEEEEeCCCCCCCccceeEEEe
Confidence            355 79999999995 55555545556654


No 20 
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=59.42  E-value=22  Score=20.86  Aligned_cols=19  Identities=16%  Similarity=0.134  Sum_probs=15.5

Q ss_pred             CCCCCCCeEEEEEEEEECCC
Q psy1047          91 DREIAPLGYNLTLRAVDKGT  110 (127)
Q Consensus        91 d~E~~~~~y~l~v~a~D~g~  110 (127)
                      .|.... .|.++|.+.|...
T Consensus        52 ~y~~~G-~y~v~l~v~d~~g   70 (81)
T cd00146          52 TYTKPG-TYTVTLTVTNAVG   70 (81)
T ss_pred             EcCCCc-EEEEEEEEEeCCC
Confidence            467778 8999999999753


No 21 
>PF00870 P53:  P53 DNA-binding domain;  InterPro: IPR011615 This domain is found in p53 transcription factors, where it is responsible for DNA-binding. These transcription factors play diverse roles in the regulation of cellular functions: the p53 tumour suppressor upregulates the expression of genes involved in cell cycle arrest and apoptosis []. The DNA-binding domain acts to clamp, or in the case of TonEBP, encircle the DNA target in order to stabilise the protein-DNA complex []. Protein interactions may also serve to stabilise the protein-DNA complex, for example in the STAT-1 dimer the SH2 (Src homology 2) domain in each monomer is coupled to the DNA-binding domain to increase stability []. The DNA-binding domain consists of a beta-sandwich formed of 9 strands in 2 sheets with a Greek-key topology. This structure is found in many transcription factors, often within the DNA-binding domain.; GO: 0044212 transcription regulatory region DNA binding; PDB: 3US2_A 3QYM_E 3QYN_A 3US0_C 3US1_D 2RMN_A 3Q06_B 2VUK_A 2H1L_M 1KZY_B ....
Probab=43.67  E-value=42  Score=24.11  Aligned_cols=29  Identities=14%  Similarity=0.083  Sum_probs=21.0

Q ss_pred             eEEEEEEEEECCCCCcceeEEEEEEeEeCC
Q psy1047          98 GYNLTLRAVDKGTPPRETYKATPRNGLTCP  127 (127)
Q Consensus        98 ~y~l~v~a~D~g~p~~~st~~i~i~~~~~~  127 (127)
                      ...+.++..|.+. ..-+...+.|+||+||
T Consensus       157 pi~~ifTLE~~~g-~vlGR~~~~vrvCaCP  185 (196)
T PF00870_consen  157 PILLIFTLEDSDG-QVLGRRSFEVRVCACP  185 (196)
T ss_dssp             -EEEEEEEEETTS-CEEEEEEEEEEEESSH
T ss_pred             eEEEEEEEECCCC-CEeeeeccCceEcCCC
Confidence            3566666677433 4458899999999998


No 22 
>PF15418 DUF4625:  Domain of unknown function (DUF4625)
Probab=39.30  E-value=66  Score=21.59  Aligned_cols=24  Identities=13%  Similarity=0.232  Sum_probs=16.6

Q ss_pred             eEEEEEEEEECCCCCcceeEEEEE
Q psy1047          98 GYNLTLRAVDKGTPPRETYKATPR  121 (127)
Q Consensus        98 ~y~l~v~a~D~g~p~~~st~~i~i  121 (127)
                      .|.|.+++.|...-...-...|.|
T Consensus       108 ~YH~~i~VtD~~Gn~~~~~~~i~I  131 (132)
T PF15418_consen  108 DYHFMITVTDAAGNQTEEERSIKI  131 (132)
T ss_pred             ceEEEEEEEECCCCEEEEEEEEEE
Confidence            799999999976555444444444


No 23 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=38.50  E-value=1.1e+02  Score=19.86  Aligned_cols=32  Identities=6%  Similarity=0.004  Sum_probs=23.5

Q ss_pred             CCCCCCCCCeEEEEEEEEECCCCCcceeEEEEE
Q psy1047          89 LLDREIAPLGYNLTLRAVDKGTPPRETYKATPR  121 (127)
Q Consensus        89 ~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i  121 (127)
                      .+++.+.. .|.++|.+.|...-+.+....|.|
T Consensus        73 t~~v~kgG-~y~m~V~lCn~dGCS~S~~~~I~V  104 (106)
T cd02848          73 TFKVGKGG-RYQMQVALCNGDGCSTSAAKEIVV  104 (106)
T ss_pred             EEEeCCCC-eEEEEEEEECCCCccCcCCEEEEe
Confidence            34466677 899999999987766666665554


No 24 
>PF10794 DUF2606:  Protein of unknown function (DUF2606);  InterPro: IPR019730 This entry represents bacterial proteins with unknown function. 
Probab=34.60  E-value=1.4e+02  Score=19.93  Aligned_cols=47  Identities=32%  Similarity=0.404  Sum_probs=31.7

Q ss_pred             eEEEEeCCCCC-CeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEe
Q psy1047          11 YQATVKENQPI-GTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTID   58 (127)
Q Consensus        11 y~~~v~E~~~~-g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~   58 (127)
                      ++++-.|.+|+ |..|..+.|.|.|. .++.++--+|...|..+.+..+
T Consensus        46 ~hVen~e~~pi~~~ev~lmKa~ds~~-qPs~eig~~IGKTD~~Gki~Wk   93 (131)
T PF10794_consen   46 FHVENAEGQPIKDFEVTLMKAADSDP-QPSKEIGISIGKTDEEGKIIWK   93 (131)
T ss_pred             EEEecCCCCcccceEEEEEeccccCC-CCchhhceeecccCCCCcEEEe
Confidence            44555566665 77888899989887 7777776666545555555444


No 25 
>cd04022 C2A_MCTP_PRT_plant C2 domain first repeat found in Multiple C2 domain and Transmembrane region Proteins (MCTP); plant subset. MCTPs are involved in Ca2+ signaling at the membrane.  Plant-MCTPs are composed of a variable N-terminal sequence, four C2 domains, two transmembrane regions (TMRs), and a short C-terminal sequence.  It is one of four protein classes that are anchored to membranes via a transmembrane region; the others being synaptotagmins, extended synaptotagmins, and ferlins. MCTPs are the only membrane-bound C2 domain proteins that contain two functional TMRs. MCTPs are unique in that they bind Ca2+ but not phospholipids. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates
Probab=34.35  E-value=1.3e+02  Score=19.27  Aligned_cols=32  Identities=16%  Similarity=0.310  Sum_probs=17.2

Q ss_pred             CCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCC
Q psy1047           3 APVFSDFVYQATVKENQPIGTLVITLKAADADP   35 (127)
Q Consensus         3 ~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~   35 (127)
                      .|+|.+ .+.+.+.+........+.+...|.|.
T Consensus        44 nP~Wne-~f~f~~~~~~~~~~~~l~~~V~d~~~   75 (127)
T cd04022          44 NPVWNE-KLVFNVSDPSRLSNLVLEVYVYNDRR   75 (127)
T ss_pred             CCccce-EEEEEccCHHHccCCeEEEEEeeCCC
Confidence            477743 56666654332233455566666554


No 26 
>cd04016 C2_Tollip C2 domain present in Toll-interacting protein (Tollip). Tollip is a part of the Interleukin-1 receptor (IL-1R) signaling pathway. Tollip is proposed to link serine/threonine kinase IRAK to IL-1Rs as well as inhibiting phosphorylation of IRAK. There is a single C2 domain present in Tollip. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins.  Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1.  However, there are a few exceptions to this including RIM isoforms and some splice varian
Probab=30.30  E-value=1.6e+02  Score=19.16  Aligned_cols=54  Identities=15%  Similarity=0.203  Sum_probs=30.2

Q ss_pred             CCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCC---CcEEEEEE----eC-CCCCcEEEeCCC
Q psy1047           3 APVFSDFVYQATVKENQPIGTLVITLKAADADPPGMN---SKISYSVR----GG-DGLGYFTIDNEG   61 (127)
Q Consensus         3 ~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n---~~i~y~i~----~~-~~~~~F~i~~~t   61 (127)
                      .|+|. ..|.+.+.++.    ..+.+...|.|..+.+   |..+..+.    .| ....||.|.+..
T Consensus        46 nP~WN-e~F~f~v~~~~----~~l~~~V~d~d~~~~dd~iG~~~i~l~~~~~~g~~~~~W~~L~~~~  107 (121)
T cd04016          46 NPRWN-KTIQCTLPEGV----DSIYIEIFDERAFTMDERIAWTHITIPESVFNGETLDDWYSLSGKQ  107 (121)
T ss_pred             CCccC-eEEEEEecCCC----cEEEEEEEeCCCCcCCceEEEEEEECchhccCCCCccccEeCcCcc
Confidence            58884 47777776542    2356777777753333   22334332    22 246788887643


No 27 
>KOG4221|consensus
Probab=28.08  E-value=4e+02  Score=25.05  Aligned_cols=56  Identities=14%  Similarity=0.113  Sum_probs=38.3

Q ss_pred             CCCcE-EEEEEe--CCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcc
Q psy1047          38 MNSKI-SYSVRG--GDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRE  114 (127)
Q Consensus        38 ~n~~i-~y~i~~--~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~  114 (127)
                      .|+.| .|++-.  ++...++.++.++-+.                    .+.   +.|... +|.+.|+|.+...+..+
T Consensus       547 ~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~--------------------ti~---gL~k~T-eY~~~vvA~N~~G~g~s  602 (1381)
T KOG4221|consen  547 GNGPITGYKLFYSEDDTGKELRVENNATEY--------------------TIN---GLEKYT-EYSIRVVAYNSAGSGVS  602 (1381)
T ss_pred             CCCCceEEEEEEEcCCCCceEEEecCccEE--------------------Eee---cCCCcc-ceEEEEEEecCCCCCCC
Confidence            45555 566633  3467788888776665                    332   457777 89999999998777776


Q ss_pred             eeE
Q psy1047         115 TYK  117 (127)
Q Consensus       115 st~  117 (127)
                      +..
T Consensus       603 S~~  605 (1381)
T KOG4221|consen  603 SAD  605 (1381)
T ss_pred             CCc
Confidence            653


No 28 
>cd08692 C2B_Tac2-N C2 domain second repeat found in Tac2-N (Tandem C2 protein in Nucleus). Tac2-N contains two C2 domains and a short C-terminus including a WHXL motif, which are key in stabilizing transport vesicles to the plasma membrane by binding to a plasma membrane.  However unlike the usual carboxyl-terminal-type (C-type) tandem C2 proteins, it lacks a transmembrane domain, a Slp-homology domain, and a Munc13-1-interacting domain. Homology search analysis indicate that no known protein motifs are located in its N-terminus, making Tac2-N a novel class of Ca2+-independent, C-type tandem C2 proteins. The C2 domain was first identified in PKC. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polypho
Probab=26.77  E-value=2e+02  Score=19.28  Aligned_cols=35  Identities=6%  Similarity=0.052  Sum_probs=23.1

Q ss_pred             CCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCC
Q psy1047           2 NAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMN   39 (127)
Q Consensus         2 N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n   39 (127)
                      |+|+|.. .+.|.|++...  ..-+.+...|.|..+.|
T Consensus        63 ~~P~fNE-sF~Fdv~~~~~--~v~l~v~v~d~~~~~~n   97 (135)
T cd08692          63 GQVKWGE-TMIFPVTQQEH--GIQFLIKLYSRSSVRRK   97 (135)
T ss_pred             CCceecc-eEEEeCCchhh--eeEEEEEEEeCCCCcCC
Confidence            6799944 77788876532  35566777788863344


No 29 
>PF14302 DUF4377:  Domain of unknown function (DUF4377)
Probab=24.07  E-value=1.8e+02  Score=17.60  Aligned_cols=36  Identities=17%  Similarity=0.274  Sum_probs=28.2

Q ss_pred             cCCCCCCCCCeEEEEEEEEECCCCCc--ceeEEEEEEeE
Q psy1047          88 KLLDREIAPLGYNLTLRAVDKGTPPR--ETYKATPRNGL  124 (127)
Q Consensus        88 ~~ld~E~~~~~y~l~v~a~D~g~p~~--~st~~i~i~~~  124 (127)
                      ..+++|..- .|.|.|......+|+.  ++..-+.++++
T Consensus        40 eGF~yE~Gy-~Y~L~Vk~~~~~nppaD~ss~~Y~L~~vi   77 (80)
T PF14302_consen   40 EGFEYEPGY-EYVLRVKRTPVANPPADASSYRYVLIKVI   77 (80)
T ss_pred             cCcCcCCCc-EEEEEEEEEECCCCCCCCCceeEEEEEEE
Confidence            677899888 8999999999888886  45555666654


No 30 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=22.14  E-value=6e+02  Score=23.05  Aligned_cols=59  Identities=17%  Similarity=0.066  Sum_probs=31.0

Q ss_pred             eeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEe
Q psy1047           6 FSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFR   67 (127)
Q Consensus         6 F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~   67 (127)
                      |.-..-...|.|+  .|+.-..+.-...|. +..-.+.|+...|.....-......|.+++.
T Consensus       402 i~Fe~~~Y~V~En--~GtV~VtV~R~GGdl-~~tVsVdY~T~DGTA~AG~DY~~~sGTLtF~  460 (928)
T TIGR00845       402 IFFEPGHYTCLEN--CGTVALTVVRRGGDL-TNTVYVDYRTEDGTANAGSDYEFTEGTLVFK  460 (928)
T ss_pred             EEecCCeEEEeec--CcEEEEEEEEccCCC-CceEEEEEEccCCccCCCCCccccCceEEEC
Confidence            3333445568887  466666665544354 3335678887654432111112345766555


No 31 
>KOG4680|consensus
Probab=20.46  E-value=3e+02  Score=18.94  Aligned_cols=35  Identities=26%  Similarity=0.301  Sum_probs=25.4

Q ss_pred             eEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEE
Q psy1047          11 YQATVKENQPIGTLVITLKAADADPPGMNSKISYSV   46 (127)
Q Consensus        11 y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i   46 (127)
                      ....++...|+|+-+++++|-|.+. +.-.-|+++.
T Consensus       108 hsq~LPg~tPPG~Y~lkm~~~d~~~-~~LTCisfsf  142 (153)
T KOG4680|consen  108 HSQVLPGYTPPGSYVLKMTAYDAKG-KELTCISFSF  142 (153)
T ss_pred             eeEeccCcCCCceEEEEEEeecCCC-CEEEEEEEEE
Confidence            4566788899999999999999986 3333344443


No 32 
>cd08389 C2A_Synaptotagmin-14_16 C2A domain first repeat present in Synaptotagmins 14 and 16. Synaptotagmin 14 and 16 are membrane-trafficking proteins in specific tissues outside the brain.   Both of these contain C-terminal tandem C2 repeats, but only Synaptotagmin 14 has an N-terminal transmembrane domain and a putative fatty-acylation site. Previously all synaptotagmins were thought to be calcium sensors in the regulation of neurotransmitter release and hormone secretion, but it has been shown that not all of them bind calcium and this is indeed the case here.  Of the 17 identified synaptotagmins only 8 bind calcium (1-3, 5-7, 9, 10).  The function of the two C2 domains that bind calcium are: regulating the fusion step of synaptic vesicle exocytosis (C2A) and  binding to phosphatidyl-inositol-3,4,5-triphosphate (PIP3) in the absence of calcium ions and to phosphatidylinositol bisphosphate (PIP2) in their presence (C2B).  C2B also regulates also the recycling step of synaptic vesicle
Probab=20.27  E-value=1.7e+02  Score=18.77  Aligned_cols=32  Identities=13%  Similarity=0.458  Sum_probs=17.5

Q ss_pred             CCCeeeCCeeEEE-EeCCCCCCeEEEEEEEEeCCC
Q psy1047           2 NAPVFSDFVYQAT-VKENQPIGTLVITLKAADADP   35 (127)
Q Consensus         2 N~P~F~~~~y~~~-v~E~~~~g~~v~~~~a~D~D~   35 (127)
                      ..|+|.. .+.+. +++.. .....+.+...|.|.
T Consensus        61 ~nP~fnE-~F~f~~i~~~~-l~~~~L~~~V~~~~~   93 (124)
T cd08389          61 PNPVFNE-TFTFSRVEPEE-LNNMALRFRLYGVER   93 (124)
T ss_pred             CCCcccC-EEEECCCCHHH-hccCEEEEEEEECCC
Confidence            3577744 56665 55443 234445566666664


Done!