Query psy7564
Match_columns 115
No_of_seqs 188 out of 1343
Neff 8.3
Searched_HMMs 46136
Date Fri Aug 16 21:38:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7564.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7564hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00031 CA Cadherin repeat dom 99.9 1.9E-20 4.2E-25 129.1 14.2 96 12-114 44-141 (199)
2 KOG1219|consensus 99.8 2.1E-20 4.5E-25 159.3 13.0 96 13-115 2625-2720(4289)
3 KOG4289|consensus 99.8 4.4E-20 9.4E-25 153.3 11.0 95 13-114 317-413 (2531)
4 KOG4289|consensus 99.8 2E-20 4.3E-25 155.3 6.8 96 12-114 211-308 (2531)
5 KOG1219|consensus 99.7 2E-17 4.3E-22 141.6 10.7 96 13-115 1001-1098(4289)
6 smart00112 CA Cadherin repeats 99.5 7.5E-14 1.6E-18 83.8 8.3 55 12-66 23-79 (79)
7 KOG1834|consensus 99.3 3E-11 6.4E-16 95.9 9.8 92 15-114 86-189 (952)
8 PF00028 Cadherin: Cadherin do 98.9 9.4E-09 2E-13 63.2 7.8 48 12-59 43-93 (93)
9 PF00028 Cadherin: Cadherin do 98.5 3.1E-07 6.8E-12 56.3 5.6 35 73-114 1-35 (93)
10 cd00031 CA Cadherin repeat dom 98.3 6.5E-06 1.4E-10 56.4 7.9 49 12-60 149-199 (199)
11 smart00736 CADG Dystroglycan-t 94.8 0.37 8E-06 29.7 7.6 46 16-63 51-96 (97)
12 PF08266 Cadherin_2: Cadherin- 93.9 0.058 1.3E-06 32.9 2.4 22 12-33 46-67 (84)
13 KOG1834|consensus 93.3 0.18 4E-06 41.5 5.0 47 12-59 195-243 (952)
14 TIGR03660 T1SS_rpt_143 T1SS-14 88.2 5.4 0.00012 26.5 7.5 56 21-80 69-127 (137)
15 PF05345 He_PIG: Putative Ig d 87.5 2.4 5.2E-05 23.0 4.6 30 15-46 18-48 (49)
16 PF12245 Big_3_2: Bacterial Ig 84.2 5.5 0.00012 22.4 5.8 33 31-63 20-52 (60)
17 PF08758 Cadherin_pro: Cadheri 81.7 9.7 0.00021 23.4 6.4 35 13-49 48-82 (90)
18 cd00146 PKD polycystic kidney 79.0 7.7 0.00017 22.4 4.8 29 29-57 52-80 (81)
19 PF08758 Cadherin_pro: Cadheri 77.9 3.1 6.6E-05 25.6 2.9 42 64-105 2-49 (90)
20 PF13750 Big_3_3: Bacterial Ig 76.2 21 0.00045 24.1 9.2 38 32-70 13-51 (158)
21 PF13750 Big_3_3: Bacterial Ig 74.8 13 0.00027 25.2 5.5 30 30-59 119-148 (158)
22 smart00089 PKD Repeats in poly 73.4 14 0.00031 21.1 4.9 30 28-58 49-78 (79)
23 KOG3597|consensus 69.4 30 0.00065 27.5 7.1 56 50-114 24-80 (442)
24 PF02494 HYR: HYR domain; Int 69.4 16 0.00034 21.4 4.5 26 33-58 56-81 (81)
25 PF07495 Y_Y_Y: Y_Y_Y domain; 59.9 25 0.00055 19.3 4.0 27 32-58 37-65 (66)
26 TIGR01965 VCBS_repeat VCBS rep 58.4 43 0.00094 21.0 6.2 40 35-80 58-97 (99)
27 cd02848 Chitinase_N_term Chiti 56.8 14 0.00031 23.5 2.7 33 28-60 74-106 (106)
28 PF08329 ChitinaseA_N: Chitina 54.9 18 0.00039 23.9 3.1 36 28-63 77-112 (133)
29 PF13754 Big_3_4: Bacterial Ig 48.8 44 0.00095 18.1 4.5 29 20-50 12-40 (54)
30 PF15418 DUF4625: Domain of un 46.6 73 0.0016 20.9 5.0 26 33-58 106-131 (132)
31 PF03160 Calx-beta: Calx-beta 46.0 65 0.0014 19.3 6.3 25 55-81 2-26 (100)
32 TIGR00845 caca sodium/calcium 35.3 2.5E+02 0.0053 24.8 7.5 29 52-81 515-543 (928)
33 PRK08577 hypothetical protein; 35.3 99 0.0022 19.9 4.3 37 10-47 32-68 (136)
34 PF05688 DUF824: Salmonella re 31.5 70 0.0015 17.2 2.6 16 50-65 13-28 (47)
35 COG1098 VacB Predicted RNA bin 23.0 26 0.00057 23.0 -0.1 33 73-115 42-74 (129)
36 COG4288 Uncharacterized protei 21.9 2.3E+02 0.005 18.2 5.3 44 52-107 68-111 (124)
37 PF13290 CHB_HEX_C_1: Chitobia 21.2 1.1E+02 0.0024 17.5 2.3 16 96-115 13-28 (67)
38 PF12100 DUF3576: Domain of un 21.1 2.3E+02 0.005 17.9 5.6 43 14-56 23-66 (103)
39 cd05775 Ig_SLAM-CD84_like_N N- 20.1 2.1E+02 0.0045 17.0 5.6 32 12-44 53-84 (97)
No 1
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.86 E-value=1.9e-20 Score=129.11 Aligned_cols=96 Identities=41% Similarity=0.553 Sum_probs=89.9
Q ss_pred ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCC--eEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEe
Q psy7564 12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGA--KAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLT 89 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~--~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~ 89 (115)
.|.|++.+|.|++.+.||||....|.+.|.|.|.+.. .+...++|.|.|+||++|.|....|.+.+.|+.++|+.++
T Consensus 44 ~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~- 122 (199)
T cd00031 44 LFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVG- 122 (199)
T ss_pred cEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEE-
Confidence 4669999999999999999999999999999997544 3889999999999999999998899999999999999999
Q ss_pred CCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564 90 NTSLPLVIRAEDKDSETNALLDGST 114 (115)
Q Consensus 90 ~~~~~l~v~A~D~D~~~n~~i~y~i 114 (115)
++.|+|+|.+.|+.+.|+|
T Consensus 123 ------~~~a~D~D~~~~~~~~y~l 141 (199)
T cd00031 123 ------TVTATDADSGENAKLTYSI 141 (199)
T ss_pred ------EEEEEcCCCCCCccEEEEE
Confidence 8999999999899999997
No 2
>KOG1219|consensus
Probab=99.84 E-value=2.1e-20 Score=159.30 Aligned_cols=96 Identities=31% Similarity=0.389 Sum_probs=91.9
Q ss_pred eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeCCC
Q psy7564 13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTNTS 92 (115)
Q Consensus 13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~~~ 92 (115)
|++|++||.|.+.++||+|+.++|++.|.|++.+...+.+.|.|+|.|+|||+|.|....|.+.+.||++.|+.|.
T Consensus 2625 Fsvdr~TG~i~v~ksLD~E~kk~yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~---- 2700 (4289)
T KOG1219|consen 2625 FSVDRNTGMIKVNKSLDHEKKKSYQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVI---- 2700 (4289)
T ss_pred eEEcCCCceEEeccccchhhhceEEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEE----
Confidence 4599999999999999999999999999999976778899999999999999999999999999999999999999
Q ss_pred CCeEEEEEeCCCCCCCeEEEEEC
Q psy7564 93 LPLVIRAEDKDSETNALLDGSTN 115 (115)
Q Consensus 93 ~~l~v~A~D~D~~~n~~i~y~i~ 115 (115)
+++|.|.|.+.||+|+|+|.
T Consensus 2701 ---qf~AsD~Ds~~nGqirysl~ 2720 (4289)
T KOG1219|consen 2701 ---QFHASDMDSGNNGQIRYSLT 2720 (4289)
T ss_pred ---EEEeeccCCCCCceEEEEEc
Confidence 89999999999999999984
No 3
>KOG4289|consensus
Probab=99.83 E-value=4.4e-20 Score=153.34 Aligned_cols=95 Identities=33% Similarity=0.446 Sum_probs=89.8
Q ss_pred eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeC
Q psy7564 13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTN 90 (115)
Q Consensus 13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~ 90 (115)
|.||+.+|.|.+..+||||+...|+|.|.|+|.| +...++.|.|+|.|+|||+|+|....|.+.|.|+..+++.|+
T Consensus 317 f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvl-- 394 (2531)
T KOG4289|consen 317 FEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVL-- 394 (2531)
T ss_pred eEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEE--
Confidence 4599999999999999999999999999999986 234489999999999999999999999999999999999999
Q ss_pred CCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564 91 TSLPLVIRAEDKDSETNALLDGST 114 (115)
Q Consensus 91 ~~~~l~v~A~D~D~~~n~~i~y~i 114 (115)
+|.|+|.|.|.||.|+|+|
T Consensus 395 -----rV~AtDrD~g~Ng~VHYsi 413 (2531)
T KOG4289|consen 395 -----RVTATDRDKGTNGKVHYSI 413 (2531)
T ss_pred -----EEEecccCCCcCceEEEEe
Confidence 9999999999999999998
No 4
>KOG4289|consensus
Probab=99.82 E-value=2e-20 Score=155.33 Aligned_cols=96 Identities=35% Similarity=0.410 Sum_probs=91.9
Q ss_pred ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEe
Q psy7564 12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLT 89 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~ 89 (115)
.|+||+.+|.|++++.||||....+.++|+|.|.+ .++++++|+|.|.|.|||.|.|.+..|..++.||.++|+.|.
T Consensus 211 lFslD~~sG~irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vL- 289 (2531)
T KOG4289|consen 211 LFSLDPMSGAIRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVL- 289 (2531)
T ss_pred cEeeccccccchhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEE-
Confidence 36699999999999999999999999999999985 578899999999999999999999999999999999999999
Q ss_pred CCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564 90 NTSLPLVIRAEDKDSETNALLDGST 114 (115)
Q Consensus 90 ~~~~~l~v~A~D~D~~~n~~i~y~i 114 (115)
+|+|+|.|.|+|+.|.|++
T Consensus 290 ------tvrAtD~Dsp~Nani~Yrl 308 (2531)
T KOG4289|consen 290 ------TVRATDGDSPPNANIRYRL 308 (2531)
T ss_pred ------EEEeccCCCCCCCceEEEe
Confidence 8999999999999999986
No 5
>KOG1219|consensus
Probab=99.73 E-value=2e-17 Score=141.57 Aligned_cols=96 Identities=31% Similarity=0.411 Sum_probs=91.5
Q ss_pred eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeC
Q psy7564 13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTN 90 (115)
Q Consensus 13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~ 90 (115)
|+||..+|.|++.+.||||..+.|-+++.|+|.| +.++.+.+.|.|.|+|||+|+|.++.|..+|.|+++.+..|.
T Consensus 1001 FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsiv-- 1078 (4289)
T KOG1219|consen 1001 FSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIV-- 1078 (4289)
T ss_pred EEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEE--
Confidence 6699999999999999999999999999999997 567889999999999999999999999999999999999999
Q ss_pred CCCCeEEEEEeCCCCCCCeEEEEEC
Q psy7564 91 TSLPLVIRAEDKDSETNALLDGSTN 115 (115)
Q Consensus 91 ~~~~l~v~A~D~D~~~n~~i~y~i~ 115 (115)
++.|.|+|...|+++.|.|.
T Consensus 1079 -----q~ea~D~Dsssn~kLmykI~ 1098 (4289)
T KOG1219|consen 1079 -----QAEANDPDSSSNQKLMYKIT 1098 (4289)
T ss_pred -----EeccCCCCcccCcceEEEEc
Confidence 89999999989999999984
No 6
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.53 E-value=7.5e-14 Score=83.81 Aligned_cols=55 Identities=40% Similarity=0.583 Sum_probs=51.3
Q ss_pred ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCC--CeEEEEEEEEEEeCCCCCC
Q psy7564 12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAG--AKAVCSVIVHILDRNDNTP 66 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~--~~~~~~v~I~V~D~Nd~~P 66 (115)
.|.|++.+|.|++.++||||....|.|.|.|.|.++ +.+.+.|+|+|.|+|||+|
T Consensus 23 ~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 23 LFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred EEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 466999999999999999999999999999999875 7889999999999999998
No 7
>KOG1834|consensus
Probab=99.28 E-value=3e-11 Score=95.92 Aligned_cols=92 Identities=25% Similarity=0.338 Sum_probs=76.8
Q ss_pred EeCCce--EEEEcCCCCccCcCEEEEEEEEEeCC--C------CeEEEEEEEEEEeCCCCCCeEeccceEEEEecCCCCC
Q psy7564 15 LHPPYG--VLLTKKPLDYEATRIYNLSVTATNMA--G------AKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIG 84 (115)
Q Consensus 15 id~~tG--~i~~~~~lD~E~~~~~~l~v~a~d~~--~------~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~g 84 (115)
+|..|| .|+.+.+||.|.+..|+++|+|.|.| + .+..++|.|+|.|+|+++|.|..+.|.+.|.|....
T Consensus 86 vdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~y- 164 (952)
T KOG1834|consen 86 VDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVY- 164 (952)
T ss_pred EeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceee-
Confidence 677776 58888899999999999999999974 1 355789999999999999999999999999999644
Q ss_pred cEEEeCCCCCeEEEEEeCCCC-CCCeEE-EEE
Q psy7564 85 ALVLTNTSLPLVIRAEDKDSE-TNALLD-GST 114 (115)
Q Consensus 85 t~v~~~~~~~l~v~A~D~D~~-~n~~i~-y~i 114 (115)
..|+ +|.|.|.|.+ ++++|- |.|
T Consensus 165 d~il-------~veAiD~DCspq~sqIC~YEI 189 (952)
T KOG1834|consen 165 DSIL-------RVEAIDKDCSPQYSQICEYEI 189 (952)
T ss_pred eeeE-------EEEeecCCCCCcccceeEEEe
Confidence 4455 8999999966 777764 443
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.92 E-value=9.4e-09 Score=63.20 Aligned_cols=48 Identities=31% Similarity=0.476 Sum_probs=41.7
Q ss_pred ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeC-C--CCeEEEEEEEEEE
Q psy7564 12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNM-A--GAKAVCSVIVHIL 59 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~-~--~~~~~~~v~I~V~ 59 (115)
.|.|++.+|.|++.++||||....|.|.|.|+|. + ++.+++.|+|+|.
T Consensus 43 ~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 43 LFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp SEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred ceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 3559999999999999999999999999999999 4 3567778888774
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.52 E-value=3.1e-07 Score=56.33 Aligned_cols=35 Identities=37% Similarity=0.405 Sum_probs=33.7
Q ss_pred eEEEEecCCCCCcEEEeCCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564 73 YTGYVSEAAPIGALVLTNTSLPLVIRAEDKDSETNALLDGST 114 (115)
Q Consensus 73 y~~~v~E~~~~gt~v~~~~~~~l~v~A~D~D~~~n~~i~y~i 114 (115)
|.+.|+|++++|+.|+ ++.|.|+|.+.|+.+.|+|
T Consensus 1 Y~~~v~E~~~~g~~v~-------~v~a~D~D~~~n~~i~y~i 35 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVG-------QVTATDPDSGPNSQITYSI 35 (93)
T ss_dssp EEEEEETTGSTSSEEE-------EEEEEESSTSTTSSEEEEE
T ss_pred CEEEEECCCCCCCEEE-------EEEEEeCCCCCCceEEEEE
Confidence 7899999999999999 8999999999999999997
No 10
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=98.26 E-value=6.5e-06 Score=56.41 Aligned_cols=49 Identities=33% Similarity=0.490 Sum_probs=44.3
Q ss_pred ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCC--CCeEEEEEEEEEEe
Q psy7564 12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMA--GAKAVCSVIVHILD 60 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~--~~~~~~~v~I~V~D 60 (115)
.|.|++.+|.|.+.+.||+|....|.+.|.|+|.+ ++.++..++|.|.|
T Consensus 149 ~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 149 LFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 46699999999999999999999999999999987 67888888888875
No 11
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.77 E-value=0.37 Score=29.66 Aligned_cols=46 Identities=28% Similarity=0.403 Sum_probs=34.4
Q ss_pred eCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCC
Q psy7564 16 HPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAKAVCSVIVHILDRND 63 (115)
Q Consensus 16 d~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd 63 (115)
|+.++.+.=. +.. +....|.+.+.|+|..+......++|.|.+.|+
T Consensus 51 d~~~~~~~Gt-P~~-~~~g~~~i~v~a~D~~g~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 51 DSDTGTLSGT-PTN-SDVGSLSLKVTATDSSGASASDTFTITVVNTND 96 (97)
T ss_pred eCCCCEEEEE-CCC-CCCcEEEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence 5556555442 333 335679999999998888888899999999887
No 12
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=93.86 E-value=0.058 Score=32.87 Aligned_cols=22 Identities=18% Similarity=0.178 Sum_probs=16.9
Q ss_pred ceEEeCCceEEEEcCCCCccCc
Q psy7564 12 KPALHPPYGVLLTKKPLDYEAT 33 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~ 33 (115)
.|.+++.+|.|++...+|||+-
T Consensus 46 ~~~v~~~tG~L~v~~rIDRE~L 67 (84)
T PF08266_consen 46 YFRVNEKTGDLFVSERIDREEL 67 (84)
T ss_dssp SEEE-TTTSEEEESS--SCCCC
T ss_pred eeEecCCceeEEeCCccCHHHH
Confidence 4779999999999999999973
No 13
>KOG1834|consensus
Probab=93.29 E-value=0.18 Score=41.46 Aligned_cols=47 Identities=28% Similarity=0.356 Sum_probs=38.8
Q ss_pred ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCeE--EEEEEEEEE
Q psy7564 12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAKA--VCSVIVHIL 59 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~--~~~v~I~V~ 59 (115)
.|.||. .|.|+.+.+|.|.+...|.++|+|.|.|.+.+ ...|+|+|.
T Consensus 195 PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vk 243 (952)
T KOG1834|consen 195 PFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVK 243 (952)
T ss_pred ceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEec
Confidence 355985 69999999999999999999999999986533 367777774
No 14
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=88.24 E-value=5.4 Score=26.49 Aligned_cols=56 Identities=29% Similarity=0.408 Sum_probs=38.8
Q ss_pred EEEEcCCCCccC---cCEEEEEEEEEeCCCCeEEEEEEEEEEeCCCCCCeEeccceEEEEecC
Q psy7564 21 VLLTKKPLDYEA---TRIYNLSVTATNMAGAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEA 80 (115)
Q Consensus 21 ~i~~~~~lD~E~---~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~ 80 (115)
...+.++||... .-...+.|.|+|..+-.+...+.|+|.| | .|...... ...|.|.
T Consensus 69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDED 127 (137)
T ss_pred EEEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence 567778898743 3356788999998766556688888877 5 47765433 3677774
No 15
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=87.50 E-value=2.4 Score=22.96 Aligned_cols=30 Identities=27% Similarity=0.392 Sum_probs=22.4
Q ss_pred EeCCceEEEEcCCCCcc-CcCEEEEEEEEEeCC
Q psy7564 15 LHPPYGVLLTKKPLDYE-ATRIYNLSVTATNMA 46 (115)
Q Consensus 15 id~~tG~i~~~~~lD~E-~~~~~~l~v~a~d~~ 46 (115)
+|+.+|.|.-.- +.. ....|.+.|.|+|..
T Consensus 18 ~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 18 LDPSTGTISGTP--TSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred EeCCCCEEEeec--CCCccccEEEEEEEEEcCC
Confidence 799999987662 333 336899999999864
No 16
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=84.23 E-value=5.5 Score=22.36 Aligned_cols=33 Identities=33% Similarity=0.481 Sum_probs=25.3
Q ss_pred cCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCC
Q psy7564 31 EATRIYNLSVTATNMAGAKAVCSVIVHILDRND 63 (115)
Q Consensus 31 E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd 63 (115)
+....|.+.+.+.|..|..+.....+.+.|..-
T Consensus 20 ~~dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~~ 52 (60)
T PF12245_consen 20 DADGEYTLTVTATDKAGNTSSSTTQIVIVDNTA 52 (60)
T ss_pred cCCccEEEEEEEEECCCCEEEeeeEEEEEcCCC
Confidence 346789999999999888777777666665543
No 17
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=81.74 E-value=9.7 Score=23.36 Aligned_cols=35 Identities=14% Similarity=0.232 Sum_probs=23.7
Q ss_pred eEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCCCe
Q psy7564 13 PALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAGAK 49 (115)
Q Consensus 13 f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~ 49 (115)
|+|++ .|.|++++++...... -.+.|.|.|..++.
T Consensus 48 F~V~~-DGsVy~~r~v~l~~~~-~~F~V~a~D~~~~~ 82 (90)
T PF08758_consen 48 FRVLE-DGSVYAKRPVQLSSEQ-RSFTVHAWDSQTQE 82 (90)
T ss_dssp EEEET-TTEEEEES--S-SSS--EEEEEEEEETTTTE
T ss_pred EEEcC-CCeEEEeeeEecCCCc-eEEEEEEECCCCCe
Confidence 66886 6999999988765433 36889999986664
No 18
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=78.96 E-value=7.7 Score=22.41 Aligned_cols=29 Identities=24% Similarity=0.414 Sum_probs=20.8
Q ss_pred CccCcCEEEEEEEEEeCCCCeEEEEEEEE
Q psy7564 29 DYEATRIYNLSVTATNMAGAKAVCSVIVH 57 (115)
Q Consensus 29 D~E~~~~~~l~v~a~d~~~~~~~~~v~I~ 57 (115)
.|.+...|.+++.++|..+......+.|.
T Consensus 52 ~y~~~G~y~v~l~v~d~~g~~~~~~~~V~ 80 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAVGSSSTKTTTVV 80 (81)
T ss_pred EcCCCcEEEEEEEEEeCCCCEEEEEEEEE
Confidence 35667889999999998666555555554
No 19
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=77.92 E-value=3.1 Score=25.60 Aligned_cols=42 Identities=21% Similarity=0.206 Sum_probs=15.8
Q ss_pred CCCeEeccceEEEEecCCCCCcEEEeCC------CCCeEEEEEeCCCC
Q psy7564 64 NTPYFTQLNYTGYVSEAAPIGALVLTNT------SLPLVIRAEDKDSE 105 (115)
Q Consensus 64 ~~P~f~~~~y~~~v~E~~~~gt~v~~~~------~~~l~v~A~D~D~~ 105 (115)
..|-|....|.+.|+.+-..|..|+..+ +..+.....|+|..
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssDpdF~ 49 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSDPDFR 49 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--SS---EEEE---SEEE
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCCCCCceEEecCCCCEE
Confidence 4688999999999999999999999511 22344455555544
No 20
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=76.21 E-value=21 Score=24.09 Aligned_cols=38 Identities=26% Similarity=0.379 Sum_probs=27.1
Q ss_pred CcCEEEEEE-EEEeCCCCeEEEEEEEEEEeCCCCCCeEec
Q psy7564 32 ATRIYNLSV-TATNMAGAKAVCSVIVHILDRNDNTPYFTQ 70 (115)
Q Consensus 32 ~~~~~~l~v-~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~ 70 (115)
....|.+.+ .|.|..+...+..+..++. ++..+|....
T Consensus 13 ~dG~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi 51 (158)
T PF13750_consen 13 PDGSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI 51 (158)
T ss_pred CCccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE
Confidence 357899999 7999877766665554444 4667888865
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=74.76 E-value=13 Score=25.18 Aligned_cols=30 Identities=33% Similarity=0.471 Sum_probs=25.9
Q ss_pred ccCcCEEEEEEEEEeCCCCeEEEEEEEEEE
Q psy7564 30 YEATRIYNLSVTATNMAGAKAVCSVIVHIL 59 (115)
Q Consensus 30 ~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~ 59 (115)
.|....|.|+|.|+|..|..++..+.+...
T Consensus 119 le~~~~YtLtV~a~D~aGN~~~~si~F~y~ 148 (158)
T PF13750_consen 119 LEADDSYTLTVSATDKAGNQSTKSISFSYM 148 (158)
T ss_pred cCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence 467789999999999999988888888775
No 22
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=73.41 E-value=14 Score=21.10 Aligned_cols=30 Identities=30% Similarity=0.479 Sum_probs=21.5
Q ss_pred CCccCcCEEEEEEEEEeCCCCeEEEEEEEEE
Q psy7564 28 LDYEATRIYNLSVTATNMAGAKAVCSVIVHI 58 (115)
Q Consensus 28 lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V 58 (115)
.-|+....|.+++.+.|..+ .++..++|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g-~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG-SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence 34566788999999998766 5566666655
No 23
>KOG3597|consensus
Probab=69.40 E-value=30 Score=27.49 Aligned_cols=56 Identities=16% Similarity=0.083 Sum_probs=38.1
Q ss_pred EEEEEEEEEEeCCCCCCeEeccceEEEEecCCC-CCcEEEeCCCCCeEEEEEeCCCCCCCeEEEEE
Q psy7564 50 AVCSVIVHILDRNDNTPYFTQLNYTGYVSEAAP-IGALVLTNTSLPLVIRAEDKDSETNALLDGST 114 (115)
Q Consensus 50 ~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~-~gt~v~~~~~~~l~v~A~D~D~~~n~~i~y~i 114 (115)
.+....|.|..+||.+..+....+.+-+.|+.. +...- .+.+.|+|... ..+.|++
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~--------~l~~~d~d~~~-~~l~f~v 80 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPE--------LLTAADPDSAP-LPLEFQV 80 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccce--------EeeccCCCCCc-cceEEEE
Confidence 455678999999998877777777766667643 22222 58899999764 3455543
No 24
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=69.37 E-value=16 Score=21.38 Aligned_cols=26 Identities=35% Similarity=0.497 Sum_probs=19.4
Q ss_pred cCEEEEEEEEEeCCCCeEEEEEEEEE
Q psy7564 33 TRIYNLSVTATNMAGAKAVCSVIVHI 58 (115)
Q Consensus 33 ~~~~~l~v~a~d~~~~~~~~~v~I~V 58 (115)
...|.++..|+|..+..+++.+.|+|
T Consensus 56 ~G~t~V~ytA~D~~GN~a~C~f~V~V 81 (81)
T PF02494_consen 56 VGTTTVTYTATDAAGNSATCSFTVTV 81 (81)
T ss_pred eceEEEEEEEEECCCCEEEEEEEEEC
Confidence 35677888888887777777777664
No 25
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=59.86 E-value=25 Score=19.28 Aligned_cols=27 Identities=26% Similarity=0.304 Sum_probs=17.1
Q ss_pred CcCEEEEEEEEEeCCCCeE--EEEEEEEE
Q psy7564 32 ATRIYNLSVTATNMAGAKA--VCSVIVHI 58 (115)
Q Consensus 32 ~~~~~~l~v~a~d~~~~~~--~~~v~I~V 58 (115)
....|+|.|.|.+..+... ...+.|+|
T Consensus 37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 37 PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp -SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 3578999999999865322 25555555
No 26
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=58.39 E-value=43 Score=20.96 Aligned_cols=40 Identities=25% Similarity=0.268 Sum_probs=28.0
Q ss_pred EEEEEEEEEeCCCCeEEEEEEEEEEeCCCCCCeEeccceEEEEecC
Q psy7564 35 IYNLSVTATNMAGAKAVCSVIVHILDRNDNTPYFTQLNYTGYVSEA 80 (115)
Q Consensus 35 ~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~ 80 (115)
.-.+++.+.|. .+..|+|+|.-.|| +|..... -.+.+.|+
T Consensus 58 tdsFtvtv~DG----tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 58 TDTFTVTSADG----TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred EEEEEEEEeCC----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 34567777773 37889999999999 7766542 44566665
No 27
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=56.82 E-value=14 Score=23.47 Aligned_cols=33 Identities=15% Similarity=0.216 Sum_probs=23.4
Q ss_pred CCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEe
Q psy7564 28 LDYEATRIYNLSVTATNMAGAKAVCSVIVHILD 60 (115)
Q Consensus 28 lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D 60 (115)
+++.+-..|.++|.+.|..+.+.+..+.|.|-|
T Consensus 74 ~~v~kgG~y~m~V~lCn~dGCS~S~~~~I~VAD 106 (106)
T cd02848 74 FKVGKGGRYQMQVALCNGDGCSTSAAKEIVVAD 106 (106)
T ss_pred EEeCCCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence 445566788899988888776666666666644
No 28
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=54.86 E-value=18 Score=23.94 Aligned_cols=36 Identities=19% Similarity=0.282 Sum_probs=25.3
Q ss_pred CCccCcCEEEEEEEEEeCCCCeEEEEEEEEEEeCCC
Q psy7564 28 LDYEATRIYNLSVTATNMAGAKAVCSVIVHILDRND 63 (115)
Q Consensus 28 lD~E~~~~~~l~v~a~d~~~~~~~~~v~I~V~D~Nd 63 (115)
+...+...|.++|++.+..|.+.+..+.|.|.|.+.
T Consensus 77 ~~~~~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 77 FTVTKGGRYQMQVELCNADGCSTSAPVEVVVADTDG 112 (133)
T ss_dssp EEE-S-EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred EEecCCCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence 555567889999999998888778899999998865
No 29
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=48.79 E-value=44 Score=18.13 Aligned_cols=29 Identities=34% Similarity=0.413 Sum_probs=19.5
Q ss_pred eEEEEcCCCCccCcCEEEEEEEEEeCCCCeE
Q psy7564 20 GVLLTKKPLDYEATRIYNLSVTATNMAGAKA 50 (115)
Q Consensus 20 G~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~ 50 (115)
|.-...-+.. ....|.+++.|+|..|..+
T Consensus 12 G~Ws~t~~~~--~dG~y~itv~a~D~AGN~s 40 (54)
T PF13754_consen 12 GNWSFTVPAL--ADGTYTITVTATDAAGNTS 40 (54)
T ss_pred CcEEEeCCCC--CCccEEEEEEEEeCCCCCC
Confidence 4444443333 5788999999999876544
No 30
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=46.55 E-value=73 Score=20.90 Aligned_cols=26 Identities=23% Similarity=0.323 Sum_probs=19.4
Q ss_pred cCEEEEEEEEEeCCCCeEEEEEEEEE
Q psy7564 33 TRIYNLSVTATNMAGAKAVCSVIVHI 58 (115)
Q Consensus 33 ~~~~~l~v~a~d~~~~~~~~~v~I~V 58 (115)
...|++.+.++|..|........|.|
T Consensus 106 ~G~YH~~i~VtD~~Gn~~~~~~~i~I 131 (132)
T PF15418_consen 106 AGDYHFMITVTDAAGNQTEEERSIKI 131 (132)
T ss_pred CcceEEEEEEEECCCCEEEEEEEEEE
Confidence 46788888888888877766666654
No 31
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=45.99 E-value=65 Score=19.29 Aligned_cols=25 Identities=32% Similarity=0.298 Sum_probs=15.3
Q ss_pred EEEEEeCCCCCCeEeccceEEEEecCC
Q psy7564 55 IVHILDRNDNTPYFTQLNYTGYVSEAA 81 (115)
Q Consensus 55 ~I~V~D~Nd~~P~f~~~~y~~~v~E~~ 81 (115)
+|.|.| ||.+ .+.-..-...+.|+.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~ 26 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD 26 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC
Confidence 467777 6645 776555556677774
No 32
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=35.34 E-value=2.5e+02 Score=24.80 Aligned_cols=29 Identities=34% Similarity=0.297 Sum_probs=19.6
Q ss_pred EEEEEEEEeCCCCCCeEeccceEEEEecCC
Q psy7564 52 CSVIVHILDRNDNTPYFTQLNYTGYVSEAA 81 (115)
Q Consensus 52 ~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~ 81 (115)
...+|+|.| ||++|.|.-..-...|.|+.
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~ 543 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSESI 543 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcCC
Confidence 356777777 78899877544456677763
No 33
>PRK08577 hypothetical protein; Provisional
Probab=35.27 E-value=99 Score=19.95 Aligned_cols=37 Identities=5% Similarity=0.163 Sum_probs=27.4
Q ss_pred ecceEEeCCceEEEEcCCCCccCcCEEEEEEEEEeCCC
Q psy7564 10 ILKPALHPPYGVLLTKKPLDYEATRIYNLSVTATNMAG 47 (115)
Q Consensus 10 ~~~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~ 47 (115)
.+.|.++...|+|.+. ++-.+....|.+.+.+.|.++
T Consensus 32 ~~~~~~~~~~~~~~~~-~~~~~~k~~~~I~V~~~Dr~G 68 (136)
T PRK08577 32 YVLLIADTDKKEIHLE-PIALPGKKLVEIELVVEDRPG 68 (136)
T ss_pred EEEEEEECCCCEEEEE-EcCCCCccEEEEEEEEcCCCC
Confidence 3456678888888884 555566678889999988765
No 34
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=31.50 E-value=70 Score=17.20 Aligned_cols=16 Identities=25% Similarity=0.345 Sum_probs=13.0
Q ss_pred EEEEEEEEEEeCCCCC
Q psy7564 50 AVCSVIVHILDRNDNT 65 (115)
Q Consensus 50 ~~~~v~I~V~D~Nd~~ 65 (115)
.+..++|++.|.|.+|
T Consensus 13 e~I~ltVt~kda~G~p 28 (47)
T PF05688_consen 13 ETIPLTVTVKDANGNP 28 (47)
T ss_pred CeEEEEEEEECCCCCC
Confidence 3678999999998755
No 35
>COG1098 VacB Predicted RNA binding protein (contains ribosomal protein S1 domain) [Translation, ribosomal structure and biogenesis]
Probab=23.05 E-value=26 Score=23.01 Aligned_cols=33 Identities=18% Similarity=0.262 Sum_probs=23.1
Q ss_pred eEEEEecCCCCCcEEEeCCCCCeEEEEEeCCCCCCCeEEEEEC
Q psy7564 73 YTGYVSEAAPIGALVLTNTSLPLVIRAEDKDSETNALLDGSTN 115 (115)
Q Consensus 73 y~~~v~E~~~~gt~v~~~~~~~l~v~A~D~D~~~n~~i~y~i~ 115 (115)
|.-.|.+-...|..|. +...|-|. +|.+..|||
T Consensus 42 fVkdI~d~L~vG~eV~--------vKVl~ide--~GKisLSIr 74 (129)
T COG1098 42 FVKDIHDHLKVGQEVK--------VKVLDIDE--NGKISLSIR 74 (129)
T ss_pred hHHhHHHHhcCCCEEE--------EEEEeecc--CCCcceehH
Confidence 4444555567788885 77777775 788888875
No 36
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.88 E-value=2.3e+02 Score=18.22 Aligned_cols=44 Identities=16% Similarity=0.258 Sum_probs=23.8
Q ss_pred EEEEEEEEeCCCCCCeEeccceEEEEecCCCCCcEEEeCCCCCeEEEEEeCCCCCC
Q psy7564 52 CSVIVHILDRNDNTPYFTQLNYTGYVSEAAPIGALVLTNTSLPLVIRAEDKDSETN 107 (115)
Q Consensus 52 ~~v~I~V~D~Nd~~P~f~~~~y~~~v~E~~~~gt~v~~~~~~~l~v~A~D~D~~~n 107 (115)
.++.+..-|.+|-.+.-....|..-+.- +.|..++-+-.|.+.+
T Consensus 68 rt~hlRfndL~dpe~iP~d~~yasvies------------nvPvVvQ~tRLDsgq~ 111 (124)
T COG4288 68 RTLHLRFNDLGDPEAIPKDTPYASVIES------------NVPVVVQLTRLDSGQA 111 (124)
T ss_pred eeEEEEecccCCcccCCCCCchhhheec------------CCceEEEEEEeccCCc
Confidence 3566777777764443333334333332 3334577777887754
No 37
>PF13290 CHB_HEX_C_1: Chitobiase/beta-hexosaminidase C-terminal domain
Probab=21.24 E-value=1.1e+02 Score=17.45 Aligned_cols=16 Identities=25% Similarity=0.320 Sum_probs=9.7
Q ss_pred EEEEEeCCCCCCCeEEEEEC
Q psy7564 96 VIRAEDKDSETNALLDGSTN 115 (115)
Q Consensus 96 ~v~A~D~D~~~n~~i~y~i~ 115 (115)
.+.+.++| +.|+|.++
T Consensus 13 ~l~~~~~~----~~IyYT~D 28 (67)
T PF13290_consen 13 TLSTADPD----ATIYYTTD 28 (67)
T ss_pred EEEeCCCC----CEEEEEcC
Confidence 45554433 57888764
No 38
>PF12100 DUF3576: Domain of unknown function (DUF3576); InterPro: IPR021959 This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is about 100 amino acids in length. This domain has a single completely conserved residue G that may be functionally important.
Probab=21.12 E-value=2.3e+02 Score=17.92 Aligned_cols=43 Identities=19% Similarity=0.101 Sum_probs=26.9
Q ss_pred EEeCCceEEEEcC-CCCccCcCEEEEEEEEEeCCCCeEEEEEEE
Q psy7564 14 ALHPPYGVLLTKK-PLDYEATRIYNLSVTATNMAGAKAVCSVIV 56 (115)
Q Consensus 14 ~id~~tG~i~~~~-~lD~E~~~~~~l~v~a~d~~~~~~~~~v~I 56 (115)
+.|+..|.|.+-- ..---....|.++|...|..-......|.+
T Consensus 23 saD~~gGVI~TdWY~~p~~~~er~k~tv~Ild~~Lradal~V~v 66 (103)
T PF12100_consen 23 SADPFGGVIVTDWYSPPPGPNERFKATVYILDRALRADALRVSV 66 (103)
T ss_pred hcCCCCCEEEeccccCCCCCCeeEEEEEEEECccccCCceEEEE
Confidence 4688899988873 222233466888888888654444444444
No 39
>cd05775 Ig_SLAM-CD84_like_N N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84_like. Ig_SLAM-CD84_like_N: The N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84_like. The SLAM family is a group of immune-cell specific receptors that can regulate both adaptive and innate immune responses. Members of this group include proteins such as CD84, SLAM (CD150), Ly-9 (CD229), NTB-A (ly-108, SLAM6), 19A (CRACC), and SLAMF9. The genes coding for the SLAM family are nested on chromosome 1, in humans at 1q23, and in mice at 1H2. The SLAM family is a subset of the CD2 family, which also includes CD2 and CD58 located on chromosome 1 at 1p13 in humans. In mice, CD2 is located on chromosome 3, and there is no CD58 homolog. The SLAM family proteins are organized as an extracellular domain with either two or four Ig-like domains, a single transmembrane segment, and a cytoplasmic region
Probab=20.11 E-value=2.1e+02 Score=17.01 Aligned_cols=32 Identities=16% Similarity=0.159 Sum_probs=15.5
Q ss_pred ceEEeCCceEEEEcCCCCccCcCEEEEEEEEEe
Q psy7564 12 KPALHPPYGVLLTKKPLDYEATRIYNLSVTATN 44 (115)
Q Consensus 12 ~f~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d 44 (115)
++.++..++.|++. .+-.+....|...+...+
T Consensus 53 R~~~~~~~~sL~I~-~~~~~DsG~Y~c~v~~~~ 84 (97)
T cd05775 53 RVNFSQNDYSLQIS-NLKMEDAGSYRAEINTKN 84 (97)
T ss_pred eEEecCCceeEEEC-CCchHHCEEEEEEEEcCC
Confidence 34455555555443 334445555655554433
Done!