Query psy1047
Match_columns 127
No_of_seqs 127 out of 1374
Neff 8.3
Searched_HMMs 46136
Date Fri Aug 16 19:00:58 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1047.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1047hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00028 Cadherin: Cadherin do 99.9 5.7E-23 1.2E-27 130.3 14.6 92 11-124 1-93 (93)
2 KOG4289|consensus 99.8 8E-20 1.7E-24 155.3 11.5 102 1-125 369-470 (2531)
3 cd00031 CA Cadherin repeat dom 99.8 1.4E-17 3.1E-22 117.6 15.1 102 1-124 97-198 (199)
4 KOG1219|consensus 99.8 7.5E-18 1.6E-22 147.5 11.8 97 1-123 2675-2771(4289)
5 KOG4289|consensus 99.8 1E-17 2.2E-22 142.8 11.8 103 1-125 778-881 (2531)
6 cd00031 CA Cadherin repeat dom 99.7 1.8E-16 3.8E-21 112.0 14.1 93 10-124 1-93 (199)
7 KOG1219|consensus 99.7 4.9E-17 1.1E-21 142.5 11.1 103 1-125 1053-1155(4289)
8 smart00112 CA Cadherin repeats 99.7 1.4E-15 3.1E-20 93.4 10.0 74 31-126 1-74 (79)
9 KOG1834|consensus 99.0 4.3E-09 9.3E-14 85.8 10.6 98 2-124 145-243 (952)
10 PF08266 Cadherin_2: Cadherin- 97.7 3.2E-05 6.8E-10 48.4 3.0 63 10-94 2-66 (84)
11 PF08758 Cadherin_pro: Cadheri 97.6 0.0014 3E-08 41.5 9.3 81 3-113 3-83 (90)
12 KOG1834|consensus 97.1 0.0072 1.6E-07 50.4 10.4 98 8-124 34-140 (952)
13 smart00736 CADG Dystroglycan-t 96.0 0.2 4.4E-06 31.6 10.2 66 30-124 24-92 (97)
14 PF05345 He_PIG: Putative Ig d 93.2 0.64 1.4E-05 25.8 5.8 38 50-109 11-48 (49)
15 PF07495 Y_Y_Y: Y_Y_Y domain; 88.4 3.2 6.9E-05 23.7 8.0 59 40-124 8-66 (66)
16 smart00089 PKD Repeats in poly 67.4 20 0.00044 20.9 4.8 29 91-123 50-78 (79)
17 TIGR03660 T1SS_rpt_143 T1SS-14 60.8 46 0.00099 22.5 5.9 41 80-121 66-109 (137)
18 TIGR01965 VCBS_repeat VCBS rep 59.9 43 0.00094 21.4 7.4 72 27-123 3-79 (99)
19 PF13754 Big_3_4: Bacterial Ig 59.7 27 0.00058 19.4 4.0 28 94-122 22-50 (54)
20 cd00146 PKD polycystic kidney 59.4 22 0.00048 20.9 3.9 19 91-110 52-70 (81)
21 PF00870 P53: P53 DNA-binding 43.7 42 0.00092 24.1 3.8 29 98-127 157-185 (196)
22 PF15418 DUF4625: Domain of un 39.3 66 0.0014 21.6 4.0 24 98-121 108-131 (132)
23 cd02848 Chitinase_N_term Chiti 38.5 1.1E+02 0.0024 19.9 5.3 32 89-121 73-104 (106)
24 PF10794 DUF2606: Protein of u 34.6 1.4E+02 0.0031 19.9 5.1 47 11-58 46-93 (131)
25 cd04022 C2A_MCTP_PRT_plant C2 34.4 1.3E+02 0.0027 19.3 5.0 32 3-35 44-75 (127)
26 cd04016 C2_Tollip C2 domain pr 30.3 1.6E+02 0.0034 19.2 6.4 54 3-61 46-107 (121)
27 KOG4221|consensus 28.1 4E+02 0.0086 25.1 7.8 56 38-117 547-605 (1381)
28 cd08692 C2B_Tac2-N C2 domain s 26.8 2E+02 0.0044 19.3 5.4 35 2-39 63-97 (135)
29 PF14302 DUF4377: Domain of un 24.1 1.8E+02 0.0038 17.6 5.5 36 88-124 40-77 (80)
30 TIGR00845 caca sodium/calcium 22.1 6E+02 0.013 23.0 10.5 59 6-67 402-460 (928)
31 KOG4680|consensus 20.5 3E+02 0.0065 18.9 4.6 35 11-46 108-142 (153)
32 cd08389 C2A_Synaptotagmin-14_1 20.3 1.7E+02 0.0037 18.8 3.4 32 2-35 61-93 (124)
No 1
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.91 E-value=5.7e-23 Score=130.33 Aligned_cols=92 Identities=42% Similarity=0.613 Sum_probs=89.0
Q ss_pred eEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCC
Q psy1047 11 YQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLL 90 (127)
Q Consensus 11 y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~l 90 (127)
|.+.|+|+.++|+.++++.|.|+|. +.|+.++|+|..+++..+|.|++.+|.| .+.++|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~-~~n~~i~y~i~~~~~~~~F~I~~~tg~i--------------------~~~~~L 59 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDS-GPNSQITYSILGGNPDGLFSIDPNTGEI--------------------SLKKPL 59 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESST-STTSSEEEEEEETTSTTSEEEETTTTEE--------------------EESSSS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCC-CCCceEEEEEecCcccCceEEeeeeecc--------------------ccceec
Confidence 8899999999999999999999997 9999999999998889999999999999 999999
Q ss_pred CCCCCCCeEEEEEEEEEC-CCCCcceeEEEEEEeE
Q psy1047 91 DREIAPLGYNLTLRAVDK-GTPPRETYKATPRNGL 124 (127)
Q Consensus 91 d~E~~~~~y~l~v~a~D~-g~p~~~st~~i~i~~~ 124 (127)
|||... .|.|.|.|+|. |.|+++++++++|+||
T Consensus 60 D~E~~~-~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 60 DRETQS-SYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp CTTTTS-EEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred CcccCC-EEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 999999 99999999999 8999999999999997
No 2
>KOG4289|consensus
Probab=99.82 E-value=8e-20 Score=155.32 Aligned_cols=102 Identities=35% Similarity=0.484 Sum_probs=98.6
Q ss_pred CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047 1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK 80 (127)
Q Consensus 1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~ 80 (127)
||+|+|+...|.++|.|+..++..|++++|+|+|. |.|++|.|+|.+|+..+.|.||..||++
T Consensus 369 DNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~-g~Ng~VHYsi~Sgn~~G~f~id~~tGel---------------- 431 (2531)
T KOG4289|consen 369 DNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDK-GTNGKVHYSIASGNGRGQFYIDSLTGEL---------------- 431 (2531)
T ss_pred CCCccccccceEEEecccCCCCceEEEEEecccCC-CcCceEEEEeeccCccccEEEecccceE----------------
Confidence 79999999999999999999999999999999999 9999999999999999999999999999
Q ss_pred eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeEe
Q psy1047 81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGLT 125 (127)
Q Consensus 81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~~ 125 (127)
.+..+||+|..+ |.+.|+|+|+|.|+++.+..+.|+|+-
T Consensus 432 ----~vv~plD~e~~~--ytl~IrAqDggrPpLsn~sgl~iqVlD 470 (2531)
T KOG4289|consen 432 ----DVVEPLDFENSE--YTLRIRAQDGGRPPLSNTSGLVIQVLD 470 (2531)
T ss_pred ----EEeccccccCCe--eEEEEEcccCCCCCccCCCceEEEEEe
Confidence 899999999995 999999999999999999999999874
No 3
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.78 E-value=1.4e-17 Score=117.61 Aligned_cols=102 Identities=40% Similarity=0.582 Sum_probs=95.2
Q ss_pred CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047 1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK 80 (127)
Q Consensus 1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~ 80 (127)
+|+|+|....|.+.+.|+.++|..++++.|+|+|. +.++.++|+|..+....+|.|++.+|.|
T Consensus 97 d~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~-~~~~~~~y~l~~~~~~~~f~i~~~~G~i---------------- 159 (199)
T cd00031 97 DNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS-GENAKLTYSILSGNDKELFSIDPNTGII---------------- 159 (199)
T ss_pred CCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC-CCCccEEEEEeCCCCCCEEEEeCCceEE----------------
Confidence 58999998999999999999999999999999998 7899999999875555899999999999
Q ss_pred eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeE
Q psy1047 81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGL 124 (127)
Q Consensus 81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~ 124 (127)
.+.+.||+|... .|.+.|.|.|.+.|.++++..++|+++
T Consensus 160 ----~~~~~ld~e~~~-~~~l~v~a~D~~~~~~~~~~~i~i~v~ 198 (199)
T cd00031 160 ----TLAKPLDREEKS-SYELTVVATDGGGPPLSSTATVTVTVL 198 (199)
T ss_pred ----EeCCccCCccCc-eEEEEEEEEECCCCCceeEEEEEEEEE
Confidence 888899999999 999999999999999999999999986
No 4
>KOG1219|consensus
Probab=99.75 E-value=7.5e-18 Score=147.47 Aligned_cols=97 Identities=36% Similarity=0.491 Sum_probs=87.9
Q ss_pred CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047 1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK 80 (127)
Q Consensus 1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~ 80 (127)
||+|+|..+.|.+.+.||+++|+.|.+++|.|.|+ |.||+|+|++.+. ..+|.|++++|+|
T Consensus 2675 DNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds-~~nGqirysl~~~--v~yF~In~etGwl---------------- 2735 (4289)
T KOG1219|consen 2675 DNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDS-GNNGQIRYSLTSP--VPYFAINPETGWL---------------- 2735 (4289)
T ss_pred CCCccccCCceeEEEeccCCCCceEEEEEeeccCC-CCCceEEEEEcCC--cceEEEcCCCCee----------------
Confidence 79999999999999999999999999999999999 9999999999854 3499999999999
Q ss_pred eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEe
Q psy1047 81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNG 123 (127)
Q Consensus 81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~ 123 (127)
.+.+.||+|+++ .|.|.|+|+|+|.|. +++++.+++
T Consensus 2736 ----Tt~~eld~ek~d-~y~lkv~AtDhG~~s--sq~~v~v~v 2771 (4289)
T KOG1219|consen 2736 ----TTLFELDLEKQD-LYSLKVVATDHGVPS--SQATVLVHV 2771 (4289)
T ss_pred ----eehhhhccccCC-ceEEEEEEecCCccc--ccceEEEEE
Confidence 889999999999 899999999999994 444454443
No 5
>KOG4289|consensus
Probab=99.75 E-value=1e-17 Score=142.83 Aligned_cols=103 Identities=37% Similarity=0.524 Sum_probs=97.0
Q ss_pred CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCC-CCCcEEEeCCCCeEEEeeeccccCCCCCC
Q psy1047 1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGD-GLGYFTIDNEGPLNHFRIVPTKIDPGTKK 79 (127)
Q Consensus 1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~-~~~~F~i~~~tg~i~~~~~~~~~~~~~~~ 79 (127)
+|+|+|..+.|.+.|.|+.|+++.++++.|+|+|. |+|+.+.|.+..++ ..+.|.|++.+|.|
T Consensus 778 DnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~-g~Ng~v~y~~qg~~d~p~~F~IEptSGvi--------------- 841 (2531)
T KOG4289|consen 778 DNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADS-GPNGRVYYTFQGGDDGPGDFYIEPTSGVI--------------- 841 (2531)
T ss_pred ccCcccchhhceeEeecCCCCcceEEEEEEeccCC-CCCceEEEEecCCCCCCCceEEccCccee---------------
Confidence 69999999999999999999999999999999999 99999999886643 46899999999999
Q ss_pred ceeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeEe
Q psy1047 80 KEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGLT 125 (127)
Q Consensus 80 ~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~~ 125 (127)
++.+.||||... .|.|.+.|.|.|+|++++++.|+|-+|-
T Consensus 842 -----Rtl~rLdRE~~a-vy~L~a~avDrg~p~ls~~~eItvtvld 881 (2531)
T KOG4289|consen 842 -----RTLRRLDRENVA-VYVLAAYAVDRGNPPLSAPVEITVTVLD 881 (2531)
T ss_pred -----ehhhhhcchhee-EEEEEEEEeeCCCCCcCCceEEEEEEEe
Confidence 999999999999 9999999999999999999999998874
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.73 E-value=1.8e-16 Score=112.03 Aligned_cols=93 Identities=39% Similarity=0.572 Sum_probs=85.9
Q ss_pred eeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccC
Q psy1047 10 VYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKL 89 (127)
Q Consensus 10 ~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ 89 (127)
.|.+.++|+.++|+.+.++.|.|+|. +.+..++|+|.++....+|.|++.+|.| .+.+.
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~-~~~~~~~y~i~~~~~~~~F~i~~~tG~l--------------------~~~~~ 59 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDS-GENGRVTYSILGGNEDGLFSIDPNTGVI--------------------TTTKP 59 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCC-CCCceEEEEEeCCCCcccEEEeCCCCEE--------------------EECCC
Confidence 37899999999999999999999998 7789999999876656899999999999 89999
Q ss_pred CCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeE
Q psy1047 90 LDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGL 124 (127)
Q Consensus 90 ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~ 124 (127)
||||... .|.|.|.|.|.|.|+++++..++|+++
T Consensus 60 lD~e~~~-~~~l~v~a~D~g~~~~~~~~~v~I~V~ 93 (199)
T cd00031 60 LDREEQS-EYTLTVVASDGGGPPLSSTATVTVTVL 93 (199)
T ss_pred CCCcCCc-eEEEEEEEEECCcCcceeEEEEEEEEc
Confidence 9999999 999999999999999889999999886
No 7
>KOG1219|consensus
Probab=99.72 E-value=4.9e-17 Score=142.53 Aligned_cols=103 Identities=31% Similarity=0.418 Sum_probs=100.0
Q ss_pred CCCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047 1 MNAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK 80 (127)
Q Consensus 1 ~N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~ 80 (127)
||+|+|.+..|..+|.|+++.+..|.++.|+|+|+ ..|+++.|.|.+|++.+.|.|++.||.|
T Consensus 1053 Dn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Ds-ssn~kLmykI~sGnyq~FF~Id~~TG~i---------------- 1115 (4289)
T KOG1219|consen 1053 DNVPQFSSPVYYASISENSPETVSIVQAEANDPDS-SSNQKLMYKITSGNYQGFFQIDPETGLI---------------- 1115 (4289)
T ss_pred CCCcccCCceEeeeeccCCCCceEEEEeccCCCCc-ccCcceEEEEccCCccceEEEcccccee----------------
Confidence 79999999999999999999999999999999998 7899999999999999999999999999
Q ss_pred eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeEe
Q psy1047 81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGLT 125 (127)
Q Consensus 81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~~ 125 (127)
.+.+.||||++. ++.|.|.+.|.|.|++.+...+.|++|.
T Consensus 1116 ----Tt~r~LDRE~qd-EHiLeVTi~D~gep~l~s~~rviV~Ild 1155 (4289)
T KOG1219|consen 1116 ----TTIRRLDREKQD-EHILEVTIQDNGEPWLCSNQRVIVSILD 1155 (4289)
T ss_pred ----eeehhhcccccc-cceEEEEEecCCCCccccceEEEEEEee
Confidence 888999999999 9999999999999999999999999985
No 8
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.66 E-value=1.4e-15 Score=93.42 Aligned_cols=74 Identities=39% Similarity=0.560 Sum_probs=67.3
Q ss_pred EeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECCC
Q psy1047 31 ADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGT 110 (127)
Q Consensus 31 ~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~ 110 (127)
+|+|. |.|+.++|+|.+++...+|.|++.+|.| .+.++||||... .|.|.|.|+|.|.
T Consensus 1 ~D~D~-g~n~~i~Y~i~~~~~~~~F~i~~~tg~i--------------------~~~~~LD~e~~~-~y~l~v~a~D~~~ 58 (79)
T smart00112 1 TDADS-GENGKVTYSILSGNEDGLFSIDPETGEI--------------------TTTKPLDREEQP-EYTLTVEATDGGG 58 (79)
T ss_pred CCCCC-CcCcEEEEEEecCCCCCEEEEeCCccEE--------------------EeCCccCeeCCC-eEEEEEEEEECCC
Confidence 48898 8899999999976666899999999988 888899999999 9999999999999
Q ss_pred CCcceeEEEEEEeEeC
Q psy1047 111 PPRETYKATPRNGLTC 126 (127)
Q Consensus 111 p~~~st~~i~i~~~~~ 126 (127)
|+++++++++|+++-+
T Consensus 59 ~~~~~~~~v~I~V~D~ 74 (79)
T smart00112 59 PPLSSTATVTVTVLDV 74 (79)
T ss_pred CCcccEEEEEEEEEEC
Confidence 9999999999999753
No 9
>KOG1834|consensus
Probab=99.00 E-value=4.3e-09 Score=85.79 Aligned_cols=98 Identities=26% Similarity=0.252 Sum_probs=84.8
Q ss_pred CCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcE-EEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCc
Q psy1047 2 NAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKI-SYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKK 80 (127)
Q Consensus 2 N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i-~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~ 80 (127)
.||+|....|.+.|.|. ++...|+++.|.|.|-...+++| .|.|. .++-.|.||.. |.|
T Consensus 145 ~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~--t~d~PFaIdn~-G~i---------------- 204 (952)
T KOG1834|consen 145 FAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKDCSPQYSQICEYEIT--TPDVPFAIDND-GNI---------------- 204 (952)
T ss_pred cCchhcccceeeEEecc-eeeeeeEEEEeecCCCCCcccceeEEEec--CCCCceEEcCC-Ccc----------------
Confidence 58999999999999998 66899999999999974577888 89997 46778999876 888
Q ss_pred eeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEEEEEeE
Q psy1047 81 EYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATPRNGL 124 (127)
Q Consensus 81 ~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~~ 124 (127)
..+..|+|.+.. +|.|+|+|.|-|.-...+-+-+.|++-
T Consensus 205 ----rnTekLny~ke~-~Y~ltVtAyDCg~kraa~d~lV~v~Vk 243 (952)
T KOG1834|consen 205 ----RNTEKLNYTKEH-QYKLTVTAYDCGKKRAASDSLVTVHVK 243 (952)
T ss_pred ----ccccccccccce-eEEEEEEEEecccccccCcceEEEEec
Confidence 999999999999 999999999999877777666666653
No 10
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.72 E-value=3.2e-05 Score=48.36 Aligned_cols=63 Identities=22% Similarity=0.287 Sum_probs=40.4
Q ss_pred eeEEEEeCCCCCCeEEEEEEEEeCCCCCCC--CcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEc
Q psy1047 10 VYQATVKENQPIGTLVITLKAADADPPGMN--SKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVL 87 (127)
Q Consensus 10 ~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n--~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~ 87 (127)
+....|+|..++|+.|+.+ |.|.-. ... ....|++.+.....+|.++..+|.+ .+.
T Consensus 2 qi~YsV~EE~~~Gt~IGni-a~dL~l-~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L--------------------~v~ 59 (84)
T PF08266_consen 2 QIRYSVPEEMPPGTVIGNI-AKDLGL-DPQSLSSRNFRIVSEGNSQYFRVNEKTGDL--------------------FVS 59 (84)
T ss_dssp EEEEEEESS--TT-EEEEC-CCCCT---HHHHCCTTBEEE-SSSS-SEEE-TTTSEE--------------------EES
T ss_pred CeEEEeecCCCCCCEEEEh-HHhhCC-CcccccccceEEeecCCcceeEecCCceeE--------------------EeC
Confidence 3567899999999999998 555432 110 1235676666678899999999999 899
Q ss_pred cCCCCCC
Q psy1047 88 KLLDREI 94 (127)
Q Consensus 88 ~~ld~E~ 94 (127)
..+|||.
T Consensus 60 ~rIDRE~ 66 (84)
T PF08266_consen 60 ERIDREE 66 (84)
T ss_dssp S--SCCC
T ss_pred CccCHHH
Confidence 9999997
No 11
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.61 E-value=0.0014 Score=41.46 Aligned_cols=81 Identities=17% Similarity=0.188 Sum_probs=44.2
Q ss_pred CCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCcee
Q psy1047 3 APVFSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEY 82 (127)
Q Consensus 3 ~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~ 82 (127)
.|-|.+..|.+.|+.+...|..|+++.-.|=.. +..+.|.- +|+ .|.|..+ |.|
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~---~~~~~~~s--sDp--dF~V~~D-GsV------------------ 56 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG---RRRVIFES--SDP--DFRVLED-GSV------------------ 56 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS------EEEE-----S--EEEEETT-TEE------------------
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC---CCceEEec--CCC--CEEEcCC-CeE------------------
Confidence 488999999999999999999999998877643 45677864 344 7999987 998
Q ss_pred eEEEccCCCCCCCCCeEEEEEEEEECCCCCc
Q psy1047 83 NIVVLKLLDREIAPLGYNLTLRAVDKGTPPR 113 (127)
Q Consensus 83 ~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~ 113 (127)
++.+++...... -.|.|.|.|......
T Consensus 57 --y~~r~v~l~~~~--~~F~V~a~D~~~~~~ 83 (90)
T PF08758_consen 57 --YAKRPVQLSSEQ--RSFTVHAWDSQTQEQ 83 (90)
T ss_dssp --EEES--S-SSS---EEEEEEEEETTTTEE
T ss_pred --EEeeeEecCCCc--eEEEEEEECCCCCeE
Confidence 899998876554 589999999766554
No 12
>KOG1834|consensus
Probab=97.12 E-value=0.0072 Score=50.36 Aligned_cols=98 Identities=22% Similarity=0.254 Sum_probs=66.2
Q ss_pred CCeeEEEEeCCCCCCeEEEEEEEEeCCCC-CCCCcE-EEEEEeC-CCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeE
Q psy1047 8 DFVYQATVKENQPIGTLVITLKAADADPP-GMNSKI-SYSVRGG-DGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNI 84 (127)
Q Consensus 8 ~~~y~~~v~E~~~~g~~v~~~~a~D~D~~-g~n~~i-~y~i~~~-~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l 84 (127)
...|...|.||-..=...--+.|-|.|.+ ...+.| -|.|-+. -+-....+|+.||+ +.|
T Consensus 34 e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGe------------------gvl 95 (952)
T KOG1834|consen 34 EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGE------------------GVL 95 (952)
T ss_pred ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCc------------------eEE
Confidence 45799999998553333344677788851 123455 5666431 12344456666654 456
Q ss_pred EEccCCCCCCCCCeEEEEEEEEECCCCC------cceeEEEEEEeE
Q psy1047 85 VVLKLLDREIAPLGYNLTLRAVDKGTPP------RETYKATPRNGL 124 (127)
Q Consensus 85 ~~~~~ld~E~~~~~y~l~v~a~D~g~p~------~~st~~i~i~~~ 124 (127)
+...+||-|.++ +|+|+|+|.|-|.-+ .+.-+++||+|-
T Consensus 96 RaK~~lDCelqk-eytf~iQAydCg~gpdgtn~kKShkatvhIrVk 140 (952)
T KOG1834|consen 96 RAKEPLDCELQK-EYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVK 140 (952)
T ss_pred eecCcccccccc-cceEEEEEEecCCCCCccccccccceEEEEEec
Confidence 888999999999 999999999988644 556688888764
No 13
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.98 E-value=0.2 Score=31.57 Aligned_cols=66 Identities=15% Similarity=0.156 Sum_probs=45.4
Q ss_pred EEeCCCCCCCCcEEEEEEeCC---CCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEE
Q psy1047 30 AADADPPGMNSKISYSVRGGD---GLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAV 106 (127)
Q Consensus 30 a~D~D~~g~n~~i~y~i~~~~---~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~ 106 (127)
..|+|. ..++|++...+ ...|++++..++.+ .= .+.+.+ .. .|.+.|.|+
T Consensus 24 F~d~d~----~~lty~~~~~~~~~lP~Wl~fd~~~~~~--------------------~G-tP~~~~-~g-~~~i~v~a~ 76 (97)
T smart00736 24 FTDADG----DTLTYSATLSDGSALPSWLSFDSDTGTL--------------------SG-TPTNSD-VG-SLSLKVTAT 76 (97)
T ss_pred eECCCC----CeEEEEEEeCCCCCCCCeEEEeCCCCEE--------------------EE-ECCCCC-Cc-EEEEEEEEE
Confidence 456663 46889886433 25799999988876 32 455444 46 799999999
Q ss_pred ECCCCCcceeEEEEEEeE
Q psy1047 107 DKGTPPRETYKATPRNGL 124 (127)
Q Consensus 107 D~g~p~~~st~~i~i~~~ 124 (127)
|... .+....+.|+|-
T Consensus 77 D~~g--~~~~~~f~i~V~ 92 (97)
T smart00736 77 DSSG--ASASDTFTITVV 92 (97)
T ss_pred ECCC--CEEEEEEEEEEe
Confidence 9875 455566766653
No 14
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=93.21 E-value=0.64 Score=25.80 Aligned_cols=38 Identities=13% Similarity=-0.004 Sum_probs=26.5
Q ss_pred CCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECC
Q psy1047 50 DGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKG 109 (127)
Q Consensus 50 ~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g 109 (127)
....++.||+.+|.| .-.+-...... .|.+.|.|.|..
T Consensus 11 ~LP~gLs~d~~tG~i---------------------sGtp~~~~~~G-~y~~~vtatd~~ 48 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTI---------------------SGTPTSSVQPG-TYTFTVTATDGS 48 (49)
T ss_pred CCCCcEEEeCCCCEE---------------------EeecCCCcccc-EEEEEEEEEcCC
Confidence 346789999999988 22222222346 799999999964
No 15
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=88.38 E-value=3.2 Score=23.66 Aligned_cols=59 Identities=17% Similarity=0.089 Sum_probs=34.5
Q ss_pred CcEEEEEEeCCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcceeEEE
Q psy1047 40 SKISYSVRGGDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKAT 119 (127)
Q Consensus 40 ~~i~y~i~~~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i 119 (127)
-..+|+|. |-...|..+...+-.+ .-.. -... .|+|.|.|.|..........++
T Consensus 8 ~~Y~Y~l~-g~d~~W~~~~~~~~~~---------------------~~~~---L~~G-~Y~l~V~a~~~~~~~~~~~~~l 61 (66)
T PF07495_consen 8 IRYRYRLE-GFDDEWITLGSYSNSI---------------------SYTN---LPPG-KYTLEVRAKDNNGKWSSDEKSL 61 (66)
T ss_dssp EEEEEEEE-TTESSEEEESSTS-EE---------------------EEES-----SE-EEEEEEEEEETTS-B-SS-EEE
T ss_pred eEEEEEEE-CCCCeEEECCCCcEEE---------------------EEEe---CCCE-EEEEEEEEECCCCCcCcccEEE
Confidence 34566666 4556777776653244 1122 2456 8999999999876655554667
Q ss_pred EEEeE
Q psy1047 120 PRNGL 124 (127)
Q Consensus 120 ~i~~~ 124 (127)
+|.|+
T Consensus 62 ~i~I~ 66 (66)
T PF07495_consen 62 TITIL 66 (66)
T ss_dssp EEEEE
T ss_pred EEEEC
Confidence 77664
No 16
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=67.38 E-value=20 Score=20.91 Aligned_cols=29 Identities=10% Similarity=-0.007 Sum_probs=21.1
Q ss_pred CCCCCCCeEEEEEEEEECCCCCcceeEEEEEEe
Q psy1047 91 DREIAPLGYNLTLRAVDKGTPPRETYKATPRNG 123 (127)
Q Consensus 91 d~E~~~~~y~l~v~a~D~g~p~~~st~~i~i~~ 123 (127)
.|+... .|.+++.+.|... +++.++.|.+
T Consensus 50 ~y~~~G-~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 50 TYTKPG-TYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EeCCCc-EEEEEEEEEcCCC---cEEEEEEEEE
Confidence 456777 8999999999766 5555555544
No 17
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=60.85 E-value=46 Score=22.54 Aligned_cols=41 Identities=15% Similarity=0.095 Sum_probs=27.5
Q ss_pred ceeeEEEccCCCCCC---CCCeEEEEEEEEECCCCCcceeEEEEE
Q psy1047 80 KEYNIVVLKLLDREI---APLGYNLTLRAVDKGTPPRETYKATPR 121 (127)
Q Consensus 80 ~~~~l~~~~~ld~E~---~~~~y~l~v~a~D~g~p~~~st~~i~i 121 (127)
+.++..+..+||... .- ...|.|.|+|...-..+.+.+|+|
T Consensus 66 GsYtftL~~~lDH~~g~d~l-~l~~~v~a~D~DGD~s~~~l~VtI 109 (137)
T TIGR03660 66 GSYEFTLEGPLDHAAGSDEL-TLNFPIIATDFDGDTSSITLPVTI 109 (137)
T ss_pred ccEEEEEcccccCCCCCceE-EEeeeEEEEeCCCCccccEEEEEE
Confidence 457778888888733 33 577888999876655555444444
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=59.92 E-value=43 Score=21.42 Aligned_cols=72 Identities=19% Similarity=0.219 Sum_probs=38.3
Q ss_pred EEEEEeCCCCCCCCcEEEEEEe-CCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEE----EccCCCCCCCCCeEEE
Q psy1047 27 TLKAADADPPGMNSKISYSVRG-GDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIV----VLKLLDREIAPLGYNL 101 (127)
Q Consensus 27 ~~~a~D~D~~g~n~~i~y~i~~-~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~----~~~~ld~E~~~~~y~l 101 (127)
++.++|+|. +.. ..+++.. ...-+.|.|+.. |.-+ +.|- ..+.|..-..- .-.|
T Consensus 3 ~Lt~sD~D~-gd~--~~~s~~~~~g~yGtlti~~~-G~wt----------------Ytl~n~~~avq~L~~Ge~~-tdsF 61 (99)
T TIGR01965 3 QLTISDADA-GQA--HFIAQTDAAGQYGTFSIDAD-GQWT----------------YQADNSQTAVQALKAGETL-TDTF 61 (99)
T ss_pred ceEEeCCCC-CCc--eEEecccccCCcEEEEECCC-CcEE----------------EEeCCCcHHHHhhcCCCEE-EEEE
Confidence 578899997 432 3454421 123456788763 5431 1110 11233322333 4678
Q ss_pred EEEEEECCCCCcceeEEEEEEe
Q psy1047 102 TLRAVDKGTPPRETYKATPRNG 123 (127)
Q Consensus 102 ~v~a~D~g~p~~~st~~i~i~~ 123 (127)
++.+.|+ .+.+.+|+|++
T Consensus 62 tvtv~DG----tt~~vtItI~G 79 (99)
T TIGR01965 62 TVTSADG----TSQTVTITITG 79 (99)
T ss_pred EEEEeCC----CeEEEEEEEEc
Confidence 8888885 26666777664
No 19
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=59.68 E-value=27 Score=19.42 Aligned_cols=28 Identities=18% Similarity=0.262 Sum_probs=19.6
Q ss_pred CCCCeEEEEEEEEEC-CCCCcceeEEEEEE
Q psy1047 94 IAPLGYNLTLRAVDK-GTPPRETYKATPRN 122 (127)
Q Consensus 94 ~~~~~y~l~v~a~D~-g~p~~~st~~i~i~ 122 (127)
... .|.+++.|.|. |+....+...++|.
T Consensus 22 ~dG-~y~itv~a~D~AGN~s~~~~~~~tiD 50 (54)
T PF13754_consen 22 ADG-TYTITVTATDAAGNTSTSSSVTFTID 50 (54)
T ss_pred CCc-cEEEEEEEEeCCCCCCCccceeEEEe
Confidence 355 79999999995 55555545556654
No 20
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=59.42 E-value=22 Score=20.86 Aligned_cols=19 Identities=16% Similarity=0.134 Sum_probs=15.5
Q ss_pred CCCCCCCeEEEEEEEEECCC
Q psy1047 91 DREIAPLGYNLTLRAVDKGT 110 (127)
Q Consensus 91 d~E~~~~~y~l~v~a~D~g~ 110 (127)
.|.... .|.++|.+.|...
T Consensus 52 ~y~~~G-~y~v~l~v~d~~g 70 (81)
T cd00146 52 TYTKPG-TYTVTLTVTNAVG 70 (81)
T ss_pred EcCCCc-EEEEEEEEEeCCC
Confidence 467778 8999999999753
No 21
>PF00870 P53: P53 DNA-binding domain; InterPro: IPR011615 This domain is found in p53 transcription factors, where it is responsible for DNA-binding. These transcription factors play diverse roles in the regulation of cellular functions: the p53 tumour suppressor upregulates the expression of genes involved in cell cycle arrest and apoptosis []. The DNA-binding domain acts to clamp, or in the case of TonEBP, encircle the DNA target in order to stabilise the protein-DNA complex []. Protein interactions may also serve to stabilise the protein-DNA complex, for example in the STAT-1 dimer the SH2 (Src homology 2) domain in each monomer is coupled to the DNA-binding domain to increase stability []. The DNA-binding domain consists of a beta-sandwich formed of 9 strands in 2 sheets with a Greek-key topology. This structure is found in many transcription factors, often within the DNA-binding domain.; GO: 0044212 transcription regulatory region DNA binding; PDB: 3US2_A 3QYM_E 3QYN_A 3US0_C 3US1_D 2RMN_A 3Q06_B 2VUK_A 2H1L_M 1KZY_B ....
Probab=43.67 E-value=42 Score=24.11 Aligned_cols=29 Identities=14% Similarity=0.083 Sum_probs=21.0
Q ss_pred eEEEEEEEEECCCCCcceeEEEEEEeEeCC
Q psy1047 98 GYNLTLRAVDKGTPPRETYKATPRNGLTCP 127 (127)
Q Consensus 98 ~y~l~v~a~D~g~p~~~st~~i~i~~~~~~ 127 (127)
...+.++..|.+. ..-+...+.|+||+||
T Consensus 157 pi~~ifTLE~~~g-~vlGR~~~~vrvCaCP 185 (196)
T PF00870_consen 157 PILLIFTLEDSDG-QVLGRRSFEVRVCACP 185 (196)
T ss_dssp -EEEEEEEEETTS-CEEEEEEEEEEEESSH
T ss_pred eEEEEEEEECCCC-CEeeeeccCceEcCCC
Confidence 3566666677433 4458899999999998
No 22
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=39.30 E-value=66 Score=21.59 Aligned_cols=24 Identities=13% Similarity=0.232 Sum_probs=16.6
Q ss_pred eEEEEEEEEECCCCCcceeEEEEE
Q psy1047 98 GYNLTLRAVDKGTPPRETYKATPR 121 (127)
Q Consensus 98 ~y~l~v~a~D~g~p~~~st~~i~i 121 (127)
.|.|.+++.|...-...-...|.|
T Consensus 108 ~YH~~i~VtD~~Gn~~~~~~~i~I 131 (132)
T PF15418_consen 108 DYHFMITVTDAAGNQTEEERSIKI 131 (132)
T ss_pred ceEEEEEEEECCCCEEEEEEEEEE
Confidence 799999999976555444444444
No 23
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=38.50 E-value=1.1e+02 Score=19.86 Aligned_cols=32 Identities=6% Similarity=0.004 Sum_probs=23.5
Q ss_pred CCCCCCCCCeEEEEEEEEECCCCCcceeEEEEE
Q psy1047 89 LLDREIAPLGYNLTLRAVDKGTPPRETYKATPR 121 (127)
Q Consensus 89 ~ld~E~~~~~y~l~v~a~D~g~p~~~st~~i~i 121 (127)
.+++.+.. .|.++|.+.|...-+.+....|.|
T Consensus 73 t~~v~kgG-~y~m~V~lCn~dGCS~S~~~~I~V 104 (106)
T cd02848 73 TFKVGKGG-RYQMQVALCNGDGCSTSAAKEIVV 104 (106)
T ss_pred EEEeCCCC-eEEEEEEEECCCCccCcCCEEEEe
Confidence 34466677 899999999987766666665554
No 24
>PF10794 DUF2606: Protein of unknown function (DUF2606); InterPro: IPR019730 This entry represents bacterial proteins with unknown function.
Probab=34.60 E-value=1.4e+02 Score=19.93 Aligned_cols=47 Identities=32% Similarity=0.404 Sum_probs=31.7
Q ss_pred eEEEEeCCCCC-CeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEe
Q psy1047 11 YQATVKENQPI-GTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTID 58 (127)
Q Consensus 11 y~~~v~E~~~~-g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~ 58 (127)
++++-.|.+|+ |..|..+.|.|.|. .++.++--+|...|..+.+..+
T Consensus 46 ~hVen~e~~pi~~~ev~lmKa~ds~~-qPs~eig~~IGKTD~~Gki~Wk 93 (131)
T PF10794_consen 46 FHVENAEGQPIKDFEVTLMKAADSDP-QPSKEIGISIGKTDEEGKIIWK 93 (131)
T ss_pred EEEecCCCCcccceEEEEEeccccCC-CCchhhceeecccCCCCcEEEe
Confidence 44555566665 77888899989887 7777776666545555555444
No 25
>cd04022 C2A_MCTP_PRT_plant C2 domain first repeat found in Multiple C2 domain and Transmembrane region Proteins (MCTP); plant subset. MCTPs are involved in Ca2+ signaling at the membrane. Plant-MCTPs are composed of a variable N-terminal sequence, four C2 domains, two transmembrane regions (TMRs), and a short C-terminal sequence. It is one of four protein classes that are anchored to membranes via a transmembrane region; the others being synaptotagmins, extended synaptotagmins, and ferlins. MCTPs are the only membrane-bound C2 domain proteins that contain two functional TMRs. MCTPs are unique in that they bind Ca2+ but not phospholipids. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates
Probab=34.35 E-value=1.3e+02 Score=19.27 Aligned_cols=32 Identities=16% Similarity=0.310 Sum_probs=17.2
Q ss_pred CCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCC
Q psy1047 3 APVFSDFVYQATVKENQPIGTLVITLKAADADP 35 (127)
Q Consensus 3 ~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~ 35 (127)
.|+|.+ .+.+.+.+........+.+...|.|.
T Consensus 44 nP~Wne-~f~f~~~~~~~~~~~~l~~~V~d~~~ 75 (127)
T cd04022 44 NPVWNE-KLVFNVSDPSRLSNLVLEVYVYNDRR 75 (127)
T ss_pred CCccce-EEEEEccCHHHccCCeEEEEEeeCCC
Confidence 477743 56666654332233455566666554
No 26
>cd04016 C2_Tollip C2 domain present in Toll-interacting protein (Tollip). Tollip is a part of the Interleukin-1 receptor (IL-1R) signaling pathway. Tollip is proposed to link serine/threonine kinase IRAK to IL-1Rs as well as inhibiting phosphorylation of IRAK. There is a single C2 domain present in Tollip. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins. Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1. However, there are a few exceptions to this including RIM isoforms and some splice varian
Probab=30.30 E-value=1.6e+02 Score=19.16 Aligned_cols=54 Identities=15% Similarity=0.203 Sum_probs=30.2
Q ss_pred CCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCC---CcEEEEEE----eC-CCCCcEEEeCCC
Q psy1047 3 APVFSDFVYQATVKENQPIGTLVITLKAADADPPGMN---SKISYSVR----GG-DGLGYFTIDNEG 61 (127)
Q Consensus 3 ~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n---~~i~y~i~----~~-~~~~~F~i~~~t 61 (127)
.|+|. ..|.+.+.++. ..+.+...|.|..+.+ |..+..+. .| ....||.|.+..
T Consensus 46 nP~WN-e~F~f~v~~~~----~~l~~~V~d~d~~~~dd~iG~~~i~l~~~~~~g~~~~~W~~L~~~~ 107 (121)
T cd04016 46 NPRWN-KTIQCTLPEGV----DSIYIEIFDERAFTMDERIAWTHITIPESVFNGETLDDWYSLSGKQ 107 (121)
T ss_pred CCccC-eEEEEEecCCC----cEEEEEEEeCCCCcCCceEEEEEEECchhccCCCCccccEeCcCcc
Confidence 58884 47777776542 2356777777753333 22334332 22 246788887643
No 27
>KOG4221|consensus
Probab=28.08 E-value=4e+02 Score=25.05 Aligned_cols=56 Identities=14% Similarity=0.113 Sum_probs=38.3
Q ss_pred CCCcE-EEEEEe--CCCCCcEEEeCCCCeEEEeeeccccCCCCCCceeeEEEccCCCCCCCCCeEEEEEEEEECCCCCcc
Q psy1047 38 MNSKI-SYSVRG--GDGLGYFTIDNEGPLNHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRE 114 (127)
Q Consensus 38 ~n~~i-~y~i~~--~~~~~~F~i~~~tg~i~~~~~~~~~~~~~~~~~~~l~~~~~ld~E~~~~~y~l~v~a~D~g~p~~~ 114 (127)
.|+.| .|++-. ++...++.++.++-+. .+. +.|... +|.+.|+|.+...+..+
T Consensus 547 ~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~--------------------ti~---gL~k~T-eY~~~vvA~N~~G~g~s 602 (1381)
T KOG4221|consen 547 GNGPITGYKLFYSEDDTGKELRVENNATEY--------------------TIN---GLEKYT-EYSIRVVAYNSAGSGVS 602 (1381)
T ss_pred CCCCceEEEEEEEcCCCCceEEEecCccEE--------------------Eee---cCCCcc-ceEEEEEEecCCCCCCC
Confidence 45555 566633 3467788888776665 332 457777 89999999998777776
Q ss_pred eeE
Q psy1047 115 TYK 117 (127)
Q Consensus 115 st~ 117 (127)
+..
T Consensus 603 S~~ 605 (1381)
T KOG4221|consen 603 SAD 605 (1381)
T ss_pred CCc
Confidence 653
No 28
>cd08692 C2B_Tac2-N C2 domain second repeat found in Tac2-N (Tandem C2 protein in Nucleus). Tac2-N contains two C2 domains and a short C-terminus including a WHXL motif, which are key in stabilizing transport vesicles to the plasma membrane by binding to a plasma membrane. However unlike the usual carboxyl-terminal-type (C-type) tandem C2 proteins, it lacks a transmembrane domain, a Slp-homology domain, and a Munc13-1-interacting domain. Homology search analysis indicate that no known protein motifs are located in its N-terminus, making Tac2-N a novel class of Ca2+-independent, C-type tandem C2 proteins. The C2 domain was first identified in PKC. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polypho
Probab=26.77 E-value=2e+02 Score=19.28 Aligned_cols=35 Identities=6% Similarity=0.052 Sum_probs=23.1
Q ss_pred CCCeeeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCC
Q psy1047 2 NAPVFSDFVYQATVKENQPIGTLVITLKAADADPPGMN 39 (127)
Q Consensus 2 N~P~F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n 39 (127)
|+|+|.. .+.|.|++... ..-+.+...|.|..+.|
T Consensus 63 ~~P~fNE-sF~Fdv~~~~~--~v~l~v~v~d~~~~~~n 97 (135)
T cd08692 63 GQVKWGE-TMIFPVTQQEH--GIQFLIKLYSRSSVRRK 97 (135)
T ss_pred CCceecc-eEEEeCCchhh--eeEEEEEEEeCCCCcCC
Confidence 6799944 77788876532 35566777788863344
No 29
>PF14302 DUF4377: Domain of unknown function (DUF4377)
Probab=24.07 E-value=1.8e+02 Score=17.60 Aligned_cols=36 Identities=17% Similarity=0.274 Sum_probs=28.2
Q ss_pred cCCCCCCCCCeEEEEEEEEECCCCCc--ceeEEEEEEeE
Q psy1047 88 KLLDREIAPLGYNLTLRAVDKGTPPR--ETYKATPRNGL 124 (127)
Q Consensus 88 ~~ld~E~~~~~y~l~v~a~D~g~p~~--~st~~i~i~~~ 124 (127)
..+++|..- .|.|.|......+|+. ++..-+.++++
T Consensus 40 eGF~yE~Gy-~Y~L~Vk~~~~~nppaD~ss~~Y~L~~vi 77 (80)
T PF14302_consen 40 EGFEYEPGY-EYVLRVKRTPVANPPADASSYRYVLIKVI 77 (80)
T ss_pred cCcCcCCCc-EEEEEEEEEECCCCCCCCCceeEEEEEEE
Confidence 677899888 8999999999888886 45555666654
No 30
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=22.14 E-value=6e+02 Score=23.05 Aligned_cols=59 Identities=17% Similarity=0.066 Sum_probs=31.0
Q ss_pred eeCCeeEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEEEeCCCCCcEEEeCCCCeEEEe
Q psy1047 6 FSDFVYQATVKENQPIGTLVITLKAADADPPGMNSKISYSVRGGDGLGYFTIDNEGPLNHFR 67 (127)
Q Consensus 6 F~~~~y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i~~~~~~~~F~i~~~tg~i~~~ 67 (127)
|.-..-...|.|+ .|+.-..+.-...|. +..-.+.|+...|.....-......|.+++.
T Consensus 402 i~Fe~~~Y~V~En--~GtV~VtV~R~GGdl-~~tVsVdY~T~DGTA~AG~DY~~~sGTLtF~ 460 (928)
T TIGR00845 402 IFFEPGHYTCLEN--CGTVALTVVRRGGDL-TNTVYVDYRTEDGTANAGSDYEFTEGTLVFK 460 (928)
T ss_pred EEecCCeEEEeec--CcEEEEEEEEccCCC-CceEEEEEEccCCccCCCCCccccCceEEEC
Confidence 3333445568887 466666665544354 3335678887654432111112345766555
No 31
>KOG4680|consensus
Probab=20.46 E-value=3e+02 Score=18.94 Aligned_cols=35 Identities=26% Similarity=0.301 Sum_probs=25.4
Q ss_pred eEEEEeCCCCCCeEEEEEEEEeCCCCCCCCcEEEEE
Q psy1047 11 YQATVKENQPIGTLVITLKAADADPPGMNSKISYSV 46 (127)
Q Consensus 11 y~~~v~E~~~~g~~v~~~~a~D~D~~g~n~~i~y~i 46 (127)
....++...|+|+-+++++|-|.+. +.-.-|+++.
T Consensus 108 hsq~LPg~tPPG~Y~lkm~~~d~~~-~~LTCisfsf 142 (153)
T KOG4680|consen 108 HSQVLPGYTPPGSYVLKMTAYDAKG-KELTCISFSF 142 (153)
T ss_pred eeEeccCcCCCceEEEEEEeecCCC-CEEEEEEEEE
Confidence 4566788899999999999999986 3333344443
No 32
>cd08389 C2A_Synaptotagmin-14_16 C2A domain first repeat present in Synaptotagmins 14 and 16. Synaptotagmin 14 and 16 are membrane-trafficking proteins in specific tissues outside the brain. Both of these contain C-terminal tandem C2 repeats, but only Synaptotagmin 14 has an N-terminal transmembrane domain and a putative fatty-acylation site. Previously all synaptotagmins were thought to be calcium sensors in the regulation of neurotransmitter release and hormone secretion, but it has been shown that not all of them bind calcium and this is indeed the case here. Of the 17 identified synaptotagmins only 8 bind calcium (1-3, 5-7, 9, 10). The function of the two C2 domains that bind calcium are: regulating the fusion step of synaptic vesicle exocytosis (C2A) and binding to phosphatidyl-inositol-3,4,5-triphosphate (PIP3) in the absence of calcium ions and to phosphatidylinositol bisphosphate (PIP2) in their presence (C2B). C2B also regulates also the recycling step of synaptic vesicle
Probab=20.27 E-value=1.7e+02 Score=18.77 Aligned_cols=32 Identities=13% Similarity=0.458 Sum_probs=17.5
Q ss_pred CCCeeeCCeeEEE-EeCCCCCCeEEEEEEEEeCCC
Q psy1047 2 NAPVFSDFVYQAT-VKENQPIGTLVITLKAADADP 35 (127)
Q Consensus 2 N~P~F~~~~y~~~-v~E~~~~g~~v~~~~a~D~D~ 35 (127)
..|+|.. .+.+. +++.. .....+.+...|.|.
T Consensus 61 ~nP~fnE-~F~f~~i~~~~-l~~~~L~~~V~~~~~ 93 (124)
T cd08389 61 PNPVFNE-TFTFSRVEPEE-LNNMALRFRLYGVER 93 (124)
T ss_pred CCCcccC-EEEECCCCHHH-hccCEEEEEEEECCC
Confidence 3577744 56665 55443 234445566666664
Done!