Query psy8617
Match_columns 184
No_of_seqs 216 out of 1217
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 23:23:44 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8617.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8617hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00031 CA Cadherin repeat dom 100.0 1.4E-31 3E-36 202.7 22.4 159 1-182 9-169 (199)
2 KOG4289|consensus 100.0 1.9E-32 4.1E-37 243.2 17.8 158 2-182 282-441 (2531)
3 KOG4289|consensus 100.0 4.9E-30 1.1E-34 228.1 13.7 158 2-181 796-955 (2531)
4 KOG1219|consensus 100.0 1.7E-28 3.7E-33 224.1 18.0 158 2-182 966-1125(4289)
5 KOG1219|consensus 100.0 4.8E-27 1E-31 214.8 18.5 158 2-182 653-810 (4289)
6 PF00028 Cadherin: Cadherin do 99.8 6.5E-19 1.4E-23 118.2 13.0 84 2-108 9-93 (93)
7 smart00112 CA Cadherin repeats 99.8 3E-19 6.6E-24 116.3 10.9 79 14-115 1-79 (79)
8 KOG1834|consensus 99.6 1.4E-14 3E-19 123.0 13.4 129 49-179 73-211 (952)
9 cd00031 CA Cadherin repeat dom 99.5 7.2E-13 1.6E-17 100.0 12.7 85 2-109 115-199 (199)
10 PF00028 Cadherin: Cadherin do 99.4 1.3E-12 2.9E-17 87.4 8.7 62 122-183 1-64 (93)
11 smart00112 CA Cadherin repeats 98.1 6.8E-06 1.5E-10 52.8 4.6 41 142-182 1-43 (79)
12 KOG1834|consensus 98.1 2.6E-05 5.6E-10 67.6 9.2 79 6-110 165-245 (952)
13 PF08266 Cadherin_2: Cadherin- 97.7 3.8E-05 8.3E-10 50.3 3.7 23 57-79 44-66 (84)
14 PF08266 Cadherin_2: Cadherin- 97.2 0.00088 1.9E-08 43.9 5.1 60 122-182 3-66 (84)
15 smart00736 CADG Dystroglycan-t 96.7 0.05 1.1E-06 36.3 10.3 52 57-112 45-96 (97)
16 PF08758 Cadherin_pro: Cadheri 96.5 0.0077 1.7E-07 39.9 5.0 62 113-179 2-63 (90)
17 PF08758 Cadherin_pro: Cadheri 95.0 0.35 7.7E-06 32.0 8.4 62 3-95 20-81 (90)
18 TIGR01965 VCBS_repeat VCBS rep 95.0 0.41 8.8E-06 32.3 8.7 88 9-129 2-97 (99)
19 PF05345 He_PIG: Putative Ig d 86.1 4.8 0.0001 23.3 5.9 35 57-93 13-48 (49)
20 TIGR00845 caca sodium/calcium 81.3 49 0.0011 31.3 14.4 56 100-159 514-569 (928)
21 PF07495 Y_Y_Y: Y_Y_Y domain; 81.3 9.3 0.0002 22.9 6.6 29 79-107 37-65 (66)
22 TIGR03660 T1SS_rpt_143 T1SS-14 73.4 31 0.00066 24.7 11.7 67 58-130 56-128 (137)
23 KOG3597|consensus 69.9 22 0.00049 30.5 7.0 60 99-159 24-83 (442)
24 KOG4221|consensus 51.6 2.5E+02 0.0054 27.7 12.3 69 45-116 552-620 (1381)
25 COG2706 3-carboxymuconate cycl 47.8 1.6E+02 0.0035 24.5 9.8 72 103-179 213-287 (346)
26 PF03160 Calx-beta: Calx-beta 45.1 84 0.0018 20.4 7.1 52 104-159 2-53 (100)
27 PF03413 PepSY: Peptidase prop 43.3 44 0.00095 19.5 3.4 12 163-174 51-62 (64)
28 PF00778 DIX: DIX domain; Int 43.1 79 0.0017 20.5 4.7 56 57-119 16-74 (84)
29 PF12245 Big_3_2: Bacterial Ig 42.1 75 0.0016 18.9 5.3 30 80-111 22-51 (60)
30 PF14157 YmzC: YmzC-like prote 37.2 60 0.0013 19.8 3.2 18 161-178 42-59 (63)
31 smart00089 PKD Repeats in poly 37.0 1E+02 0.0022 18.9 4.6 29 76-107 50-78 (79)
32 PF13750 Big_3_3: Bacterial Ig 34.5 1.8E+02 0.0039 21.1 10.0 31 76-108 118-148 (158)
33 cd00146 PKD polycystic kidney 33.1 89 0.0019 19.2 3.8 19 76-94 52-70 (81)
34 PF02494 HYR: HYR domain; Int 30.4 1E+02 0.0023 19.2 3.8 25 81-107 57-81 (81)
35 PF07861 WND: WisP family N-Te 28.7 1.9E+02 0.0041 22.0 5.3 28 148-175 201-228 (263)
36 cd02848 Chitinase_N_term Chiti 25.1 2.3E+02 0.005 19.3 4.9 33 74-108 73-105 (106)
37 PF12461 DUF3688: Protein of u 22.6 85 0.0018 20.7 2.3 24 151-174 61-86 (91)
38 PF09100 Qn_am_d_aIV: Quinohem 22.4 1.6E+02 0.0035 20.8 3.7 34 81-115 98-132 (133)
39 PF00635 Motile_Sperm: MSP (Ma 20.6 1.7E+02 0.0036 19.1 3.5 25 150-174 32-57 (109)
40 KOG4680|consensus 20.5 2.7E+02 0.0059 20.0 4.5 34 123-156 109-142 (153)
No 1
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00 E-value=1.4e-31 Score=202.72 Aligned_cols=159 Identities=39% Similarity=0.612 Sum_probs=149.2
Q ss_pred CcCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccc
Q psy8617 1 MVPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETR 80 (184)
Q Consensus 1 ~~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~ 80 (184)
.+++|+.|+++.|+|+|.+.++.++|+|.++.. ..+|.|++.+|.|++++.||||..
T Consensus 9 n~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~-----------------------~~~F~i~~~tG~l~~~~~lD~e~~ 65 (199)
T cd00031 9 NAPPGTVVGTVSATDPDSGENGRVTYSILGGNE-----------------------DGLFSIDPNTGVITTTKPLDREEQ 65 (199)
T ss_pred CCCCCCEEEEEEEECCCCCCCceEEEEEeCCCC-----------------------cccEEEeCCCCEEEECCCCCCcCC
Confidence 367899999999999999888999999999875 469999999999999999999999
Q ss_pred cEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC
Q psy8617 81 DKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN 160 (184)
Q Consensus 81 ~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~ 160 (184)
..|.|.|.|+|.+.|.++....++|.|.|+||++|.|....|.+.|.|+.++|+.++++.|+|+|.+.++.++|+|.++.
T Consensus 66 ~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~ 145 (199)
T cd00031 66 SEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGN 145 (199)
T ss_pred ceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCC
Confidence 99999999999888877789999999999999999999999999999999999999999999999988999999999987
Q ss_pred --CcEEEEcCccEEEEcccccccc
Q psy8617 161 --TSFQINPFSGKKSILYLHSVIH 182 (184)
Q Consensus 161 --~~F~Id~~tG~i~~~~~~~~~~ 182 (184)
.+|.|++.+|.|++.+.++.|.
T Consensus 146 ~~~~f~i~~~~G~i~~~~~ld~e~ 169 (199)
T cd00031 146 DKELFSIDPNTGIITLAKPLDREE 169 (199)
T ss_pred CCCEEEEeCCceEEEeCCccCCcc
Confidence 7999999999999998887664
No 2
>KOG4289|consensus
Probab=100.00 E-value=1.9e-32 Score=243.19 Aligned_cols=158 Identities=32% Similarity=0.552 Sum_probs=152.7
Q ss_pred cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617 2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD 81 (184)
Q Consensus 2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~ 81 (184)
.++|+.|++|+|+|.|.++|+.|+|++.+++. .+.|.|++.+|.|++..+||||+..
T Consensus 282 ~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~-----------------------~~~f~in~rSGvI~T~a~lDRE~~~ 338 (2531)
T KOG4289|consen 282 LEVGYEVLTVRATDGDSPPNANIRYRLLEGNA-----------------------KNVFEINPRSGVISTRAPLDREELE 338 (2531)
T ss_pred cccCceEEEEEeccCCCCCCCceEEEecCCCc-----------------------cceeEEcCccceeeccCccCHHhhh
Confidence 47899999999999999999999999999977 7899999999999999999999999
Q ss_pred EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-
Q psy8617 82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN- 160 (184)
Q Consensus 82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~- 160 (184)
.|.|.|.|.|.|.|+...++.|.|+|.|+|||+|+|....|.+.|.|+..++++|++|+|+|+|.|.|+.+.|+|.+++
T Consensus 339 ~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~ 418 (2531)
T KOG4289|consen 339 SYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNG 418 (2531)
T ss_pred heEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCc
Confidence 9999999999999888889999999999999999999999999999999999999999999999999999999999998
Q ss_pred -CcEEEEcCccEEEEcccccccc
Q psy8617 161 -TSFQINPFSGKKSILYLHSVIH 182 (184)
Q Consensus 161 -~~F~Id~~tG~i~~~~~~~~~~ 182 (184)
+.|.||..||+|.+..+++.|.
T Consensus 419 ~G~f~id~~tGel~vv~plD~e~ 441 (2531)
T KOG4289|consen 419 RGQFYIDSLTGELDVVEPLDFEN 441 (2531)
T ss_pred cccEEEecccceEEEeccccccC
Confidence 9999999999999999999875
No 3
>KOG4289|consensus
Probab=99.97 E-value=4.9e-30 Score=228.05 Aligned_cols=158 Identities=35% Similarity=0.547 Sum_probs=150.2
Q ss_pred cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617 2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD 81 (184)
Q Consensus 2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~ 81 (184)
+|++|.|++|.|||+|.|.|+++.|.+.++.. ..+.|.|++.+|.|++.+.||||...
T Consensus 796 ~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d----------------------~p~~F~IEptSGviRtl~rLdRE~~a 853 (2531)
T KOG4289|consen 796 APVFTSVLQVSATDADSGPNGRVYYTFQGGDD----------------------GPGDFYIEPTSGVIRTLRRLDRENVA 853 (2531)
T ss_pred CCCcceEEEEEEeccCCCCCceEEEEecCCCC----------------------CCCceEEccCcceeehhhhhcchhee
Confidence 69999999999999999999999999987765 14799999999999999999999999
Q ss_pred EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-
Q psy8617 82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN- 160 (184)
Q Consensus 82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~- 160 (184)
.|.|.+.|+|.|.|++++.++|+|+|.|+|||||.|++..|.+.|.|+.++|..+++++|.|+|+|+|+.|.|+|.+++
T Consensus 854 vy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~d 933 (2531)
T KOG4289|consen 854 VYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGND 933 (2531)
T ss_pred EEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCcc
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999997
Q ss_pred -CcEEEEcCccEEEEccccccc
Q psy8617 161 -TSFQINPFSGKKSILYLHSVI 181 (184)
Q Consensus 161 -~~F~Id~~tG~i~~~~~~~~~ 181 (184)
..|.++...|+|....-|+.|
T Consensus 934 ~~~fq~de~~~~lla~~efdye 955 (2531)
T KOG4289|consen 934 PELFQLDEFSGELLALVEFDYE 955 (2531)
T ss_pred HHHHHHHHhhhhhhhheeehhh
Confidence 999999999998888777765
No 4
>KOG1219|consensus
Probab=99.96 E-value=1.7e-28 Score=224.14 Aligned_cols=158 Identities=30% Similarity=0.424 Sum_probs=153.2
Q ss_pred cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617 2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD 81 (184)
Q Consensus 2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~ 81 (184)
+|+||.|++|.|.|.|.|..+.++|+|..+.. .+.|+|+..+|.|++.+.||||...
T Consensus 966 apiGT~vi~i~A~dedsgldg~l~Y~I~~gdg-----------------------~g~FsId~~tG~irTl~~lDrE~ks 1022 (4289)
T KOG1219|consen 966 APIGTIVIRIQARDEDSGLDGELSYKIRTGDG-----------------------DGIFSIDSTTGSIRTLKALDREKKS 1022 (4289)
T ss_pred CCcceEEEEEEEecCCCCccceEEEEEEcCCc-----------------------ceeEEecCCcceEeechhhchhhcc
Confidence 79999999999999999999999999999987 7899999999999999999999999
Q ss_pred EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-
Q psy8617 82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN- 160 (184)
Q Consensus 82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~- 160 (184)
.|-|+|.|+|.|.+++++.+.+.|.|.|+|||+|+|.++.|..+|.|+++.+..|.++.|.|+|...|+++.|.|.+++
T Consensus 1023 ~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGny 1102 (4289)
T KOG1219|consen 1023 SYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNY 1102 (4289)
T ss_pred eEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCc
Confidence 9999999999999999999999999999999999999999999999999999999999999999888999999999997
Q ss_pred -CcEEEEcCccEEEEcccccccc
Q psy8617 161 -TSFQINPFSGKKSILYLHSVIH 182 (184)
Q Consensus 161 -~~F~Id~~tG~i~~~~~~~~~~ 182 (184)
++|.|+|.||.|++.+-+++|.
T Consensus 1103 q~FF~Id~~TG~iTt~r~LDRE~ 1125 (4289)
T KOG1219|consen 1103 QGFFQIDPETGLITTIRRLDREK 1125 (4289)
T ss_pred cceEEEccccceeeeehhhcccc
Confidence 9999999999999999998875
No 5
>KOG1219|consensus
Probab=99.95 E-value=4.8e-27 Score=214.84 Aligned_cols=158 Identities=27% Similarity=0.471 Sum_probs=153.4
Q ss_pred cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617 2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD 81 (184)
Q Consensus 2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~ 81 (184)
+|+||.|+++.|+|.|.|-|+.++|.|.+++. ...|.|+.++|.|++.++||+|...
T Consensus 653 vpigt~la~L~atD~Dtgfng~l~yvI~dgne-----------------------~~~~~Id~qsg~itvas~ld~~~t~ 709 (4289)
T KOG1219|consen 653 VPIGTTLAILSATDSDTGFNGKLVYVIEDGNE-----------------------SICFLIDRQSGNITVASPLDNENTE 709 (4289)
T ss_pred CCCCceEEEEeccCCCCCcCceEEEEEeCCcc-----------------------ceEEEEecccceEEEecchhhhhhh
Confidence 68999999999999999999999999999987 7899999999999999999999999
Q ss_pred EEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCC
Q psy8617 82 KFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSNT 161 (184)
Q Consensus 82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~~ 161 (184)
.|.|.|.|.|.|.|..++...+.|.|.|.|||+|.|.+..|.+.|+|+..+|+.|++|.+.|.|.|.||+++|+|.....
T Consensus 710 ~yiLnvta~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~sd 789 (4289)
T KOG1219|consen 710 QYILNVTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKSD 789 (4289)
T ss_pred eeEEEEEEecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCcc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999899
Q ss_pred cEEEEcCccEEEEcccccccc
Q psy8617 162 SFQINPFSGKKSILYLHSVIH 182 (184)
Q Consensus 162 ~F~Id~~tG~i~~~~~~~~~~ 182 (184)
.|+|||.||.|.+.+.+++|.
T Consensus 790 vfsIdp~tGivv~~~sLdrE~ 810 (4289)
T KOG1219|consen 790 VFSIDPFTGIVVTSKSLDREG 810 (4289)
T ss_pred eEEecCcccEEEeccccCccc
Confidence 999999999999999998874
No 6
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.82 E-value=6.5e-19 Score=118.21 Aligned_cols=84 Identities=45% Similarity=0.665 Sum_probs=79.8
Q ss_pred cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617 2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD 81 (184)
Q Consensus 2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~ 81 (184)
+|+|+.|+++.|+|+|.+.|+.+.|+|.++.. .++|.|++.+|.|++++.||||...
T Consensus 9 ~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~-----------------------~~~F~I~~~tg~i~~~~~LD~E~~~ 65 (93)
T PF00028_consen 9 APPGTVVGQVTATDPDSGPNSQITYSILGGNP-----------------------DGLFSIDPNTGEISLKKPLDRETQS 65 (93)
T ss_dssp GSTSSEEEEEEEEESSTSTTSSEEEEEEETTS-----------------------TTSEEEETTTTEEEESSSSCTTTTS
T ss_pred CCCCCEEEEEEEEeCCCCCCceEEEEEecCcc-----------------------cCceEEeeeeeccccceecCcccCC
Confidence 68899999999999999999999999999986 6799999999999999999999999
Q ss_pred EEEEEEEEEeC-CCCcccceEEEEEEEe
Q psy8617 82 KFLLKIAACDN-GTPVSTAYATVAINVL 108 (184)
Q Consensus 82 ~~~l~v~a~d~-~~p~~~~~~~v~I~v~ 108 (184)
.|.|.|.|+|. +.|++++++.|+|+|.
T Consensus 66 ~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 66 SYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred EEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 99999999999 8899999999999884
No 7
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.81 E-value=3e-19 Score=116.28 Aligned_cols=79 Identities=38% Similarity=0.709 Sum_probs=72.6
Q ss_pred EeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccccEEEEEEEEEeCC
Q psy8617 14 VDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDKFLLKIAACDNG 93 (184)
Q Consensus 14 ~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~ 93 (184)
+|+|.|.|+.++|+|.++.. ..+|.|++.+|.|+++++||||....|.|.|.|.|.+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~-----------------------~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~ 57 (79)
T smart00112 1 TDADSGENGKVTYSILSGNE-----------------------DGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGG 57 (79)
T ss_pred CCCCCCcCcEEEEEEecCCC-----------------------CCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECC
Confidence 47888888888888887765 4799999999999999999999999999999999999
Q ss_pred CCcccceEEEEEEEecCCCCCC
Q psy8617 94 TPVSTAYATVAINVLDANDNDP 115 (184)
Q Consensus 94 ~p~~~~~~~v~I~v~d~Nd~~P 115 (184)
.|++++.+.|+|.|.|+|||+|
T Consensus 58 ~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 58 GPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCcccEEEEEEEEEECCCCCC
Confidence 9999999999999999999998
No 8
>KOG1834|consensus
Probab=99.61 E-value=1.4e-14 Score=123.01 Aligned_cols=129 Identities=25% Similarity=0.356 Sum_probs=106.9
Q ss_pred EeeCCcccccceEEeCcc--cEEEECCCCCcccccEEEEEEEEEeCCC-C-----cccceEEEEEEEecCCCCCCccCCC
Q psy8617 49 LIQDDKLVHDTFHIMPNT--GWLSLKQPMDRETRDKFLLKIAACDNGT-P-----VSTAYATVAINVLDANDNDPKFSKQ 120 (184)
Q Consensus 49 ~~~~~~~~~~~F~I~~~t--G~i~~~~~lD~E~~~~~~l~v~a~d~~~-p-----~~~~~~~v~I~v~d~Nd~~P~f~~~ 120 (184)
.+++.+.+...--+|..| |.|+.+.+||.|.++.|+|+|+|.|+|. | ..+..++|.|.|.|+|+++|.|..+
T Consensus 73 klhgq~vPFdavVvdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep 152 (952)
T KOG1834|consen 73 KLHGQPVPFDAVVVDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEP 152 (952)
T ss_pred EecCCCCCceEEEEeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhccc
Confidence 334444444445556655 5799999999999999999999999874 4 3677789999999999999999999
Q ss_pred ceEEEEeCCCCCCcEEEEEEEEeCCCC-CCceE-EEEEecCCCcEEEEcCccEEEEccccc
Q psy8617 121 DYEFSIEENLPRGTVVGSVKATDEDLA-SNAAL-RYSLIPSNTSFQINPFSGKKSILYLHS 179 (184)
Q Consensus 121 ~~~~~v~E~~~~gt~v~~v~a~D~D~~-~n~~i-~y~i~~~~~~F~Id~~tG~i~~~~~~~ 179 (184)
.|.+.|.|. +.-..|++|.|.|.|-+ ++++| .|.|...+-+|.||. .|.|+.+..+.
T Consensus 153 ~Yka~V~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t~d~PFaIdn-~G~irnTekLn 211 (952)
T KOG1834|consen 153 WYKAHVTEG-KVYDSILRVEAIDKDCSPQYSQICEYEITTPDVPFAIDN-DGNIRNTEKLN 211 (952)
T ss_pred ceeeEEecc-eeeeeeEEEEeecCCCCCcccceeEEEecCCCCceEEcC-CCccccccccc
Confidence 999999998 56777999999999986 67877 899999999999984 88888877554
No 9
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.49 E-value=7.2e-13 Score=100.03 Aligned_cols=85 Identities=39% Similarity=0.653 Sum_probs=78.3
Q ss_pred cCCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCccccc
Q psy8617 2 VPCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRD 81 (184)
Q Consensus 2 ~p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~ 81 (184)
.|+|+.|+++.|+|+|.+.++.++|+|.++.. .++|.|++.+|.|++.+.||+|...
T Consensus 115 ~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~-----------------------~~~f~i~~~~G~i~~~~~ld~e~~~ 171 (199)
T cd00031 115 APPGTVVGTVTATDADSGENAKLTYSILSGND-----------------------KELFSIDPNTGIITLAKPLDREEKS 171 (199)
T ss_pred CCCCCEEEEEEEEcCCCCCCccEEEEEeCCCC-----------------------CCEEEEeCCceEEEeCCccCCccCc
Confidence 57899999999999999889999999988875 4699999999999999999999999
Q ss_pred EEEEEEEEEeCCCCcccceEEEEEEEec
Q psy8617 82 KFLLKIAACDNGTPVSTAYATVAINVLD 109 (184)
Q Consensus 82 ~~~l~v~a~d~~~p~~~~~~~v~I~v~d 109 (184)
.|.+.|.|+|.+.|.++.++.++|.|.|
T Consensus 172 ~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 172 SYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred eEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 9999999999988778888999998875
No 10
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.42 E-value=1.3e-12 Score=87.37 Aligned_cols=62 Identities=40% Similarity=0.592 Sum_probs=59.0
Q ss_pred eEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC--CcEEEEcCccEEEEccccccccc
Q psy8617 122 YEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN--TSFQINPFSGKKSILYLHSVIHM 183 (184)
Q Consensus 122 ~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~--~~F~Id~~tG~i~~~~~~~~~~~ 183 (184)
|.+.|+|++++|+.++++.|.|+|.+.|+.+.|+|.+++ .+|.|++.+|.|++.+.++.|..
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~ 64 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQ 64 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccC
Confidence 789999999999999999999999999999999999987 99999999999999999998753
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.08 E-value=6.8e-06 Score=52.84 Aligned_cols=41 Identities=27% Similarity=0.382 Sum_probs=36.9
Q ss_pred EeCCCCCCceEEEEEecCC--CcEEEEcCccEEEEcccccccc
Q psy8617 142 TDEDLASNAALRYSLIPSN--TSFQINPFSGKKSILYLHSVIH 182 (184)
Q Consensus 142 ~D~D~~~n~~i~y~i~~~~--~~F~Id~~tG~i~~~~~~~~~~ 182 (184)
+|+|.|.|+.++|+|.+++ .+|.|++.+|.|++.+.++.|.
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~ 43 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREE 43 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeC
Confidence 4889989999999999887 8999999999999998888774
No 12
>KOG1834|consensus
Probab=98.07 E-value=2.6e-05 Score=67.56 Aligned_cols=79 Identities=22% Similarity=0.340 Sum_probs=65.8
Q ss_pred eEEEEEEEEeCCCCC-CcE-EEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccccEE
Q psy8617 6 FQILQVTAVDLDTGN-NAR-ITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDKF 83 (184)
Q Consensus 6 t~v~~v~A~D~D~g~-n~~-i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~~ 83 (184)
..|++|.|.|.|.++ +++ ..|.|...+- +|.||. .|.|+.+++|.|.+..+|
T Consensus 165 d~il~veAiD~DCspq~sqIC~YEI~t~d~-------------------------PFaIdn-~G~irnTekLny~ke~~Y 218 (952)
T KOG1834|consen 165 DSILRVEAIDKDCSPQYSQICEYEITTPDV-------------------------PFAIDN-DGNIRNTEKLNYTKEHQY 218 (952)
T ss_pred eeeEEEEeecCCCCCcccceeEEEecCCCC-------------------------ceEEcC-CCccccccccccccceeE
Confidence 458999999999764 555 4788887654 899976 599999999999999999
Q ss_pred EEEEEEEeCCCCcccceEEEEEEEecC
Q psy8617 84 LLKIAACDNGTPVSTAYATVAINVLDA 110 (184)
Q Consensus 84 ~l~v~a~d~~~p~~~~~~~v~I~v~d~ 110 (184)
.|+|.|.|+|.....+.+.|+|.|...
T Consensus 219 ~ltVtAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 219 KLTVTAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred EEEEEEEecccccccCcceEEEEecCc
Confidence 999999999976656667888888765
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.74 E-value=3.8e-05 Score=50.30 Aligned_cols=23 Identities=30% Similarity=0.448 Sum_probs=18.2
Q ss_pred ccceEEeCcccEEEECCCCCccc
Q psy8617 57 HDTFHIMPNTGWLSLKQPMDRET 79 (184)
Q Consensus 57 ~~~F~I~~~tG~i~~~~~lD~E~ 79 (184)
..+|.++..+|.|+++..+|||+
T Consensus 44 ~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 44 SQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp S-SEEE-TTTSEEEESS--SCCC
T ss_pred cceeEecCCceeEEeCCccCHHH
Confidence 67999999999999999999998
No 14
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.23 E-value=0.00088 Score=43.86 Aligned_cols=60 Identities=22% Similarity=0.398 Sum_probs=37.0
Q ss_pred eEEEEeCCCCCCcEEEEEEEEeCCCCC--CceEEEEEecC--CCcEEEEcCccEEEEcccccccc
Q psy8617 122 YEFSIEENLPRGTVVGSVKATDEDLAS--NAALRYSLIPS--NTSFQINPFSGKKSILYLHSVIH 182 (184)
Q Consensus 122 ~~~~v~E~~~~gt~v~~v~a~D~D~~~--n~~i~y~i~~~--~~~F~Id~~tG~i~~~~~~~~~~ 182 (184)
..+.|+|..++|+.|+.| |.|..... ...-.|+|.+. ..+|.+++.||.|++..-.|+|.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 467899999999999998 55543221 11234665554 39999999999999999888874
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.70 E-value=0.05 Score=36.34 Aligned_cols=52 Identities=29% Similarity=0.388 Sum_probs=40.9
Q ss_pred ccceEEeCcccEEEECCCCCcccccEEEEEEEEEeCCCCcccceEEEEEEEecCCC
Q psy8617 57 HDTFHIMPNTGWLSLKQPMDRETRDKFLLKIAACDNGTPVSTAYATVAINVLDAND 112 (184)
Q Consensus 57 ~~~F~I~~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd 112 (184)
.+|...++.++.+.=. +.... ...|.+.|.|+|..+ .+....++|.|.+.|+
T Consensus 45 P~Wl~fd~~~~~~~Gt-P~~~~-~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 45 PSWLSFDSDTGTLSGT-PTNSD-VGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred CCeEEEeCCCCEEEEE-CCCCC-CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 4699999999888763 43333 456999999999876 5667789999999886
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.48 E-value=0.0077 Score=39.94 Aligned_cols=62 Identities=21% Similarity=0.321 Sum_probs=30.9
Q ss_pred CCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCcEEEEcCccEEEEccccc
Q psy8617 113 NDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSNTSFQINPFSGKKSILYLHS 179 (184)
Q Consensus 113 ~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~~~F~Id~~tG~i~~~~~~~ 179 (184)
.-|-|.+..|.+.|+.+...|..|++|.-.|... +..+.|.-. +..|.|.+ .|.|++++.+.
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--DpdF~V~~-DGsVy~~r~v~ 63 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--DPDFRVLE-DGSVYAKRPVQ 63 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-----SEEEEET-TTEEEEES--S
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--CCCEEEcC-CCeEEEeeeEe
Confidence 3588999999999999999999999999999853 456777654 45999986 89999988665
No 17
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=95.04 E-value=0.35 Score=31.97 Aligned_cols=62 Identities=16% Similarity=0.245 Sum_probs=33.6
Q ss_pred CCCeEEEEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEECCCCCcccccE
Q psy8617 3 PCLFQILQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDK 82 (184)
Q Consensus 3 p~Gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~ 82 (184)
..|+.|++|.-.|.. .+..+.|.-. ...|.|.++ |.|++++.+.....+
T Consensus 20 ~~g~~lg~V~f~dC~--~~~~~~~~ss---------------------------DpdF~V~~D-GsVy~~r~v~l~~~~- 68 (90)
T PF08758_consen 20 EAGQPLGKVNFEDCT--GRRRVIFESS---------------------------DPDFRVLED-GSVYAKRPVQLSSEQ- 68 (90)
T ss_dssp -SS--EEE---B--S--S---EEEE------------------------------SEEEEETT-TEEEEES--S-SSS--
T ss_pred hCCcEEEEEEeccCC--CCCceEEecC---------------------------CCCEEEcCC-CeEEEeeeEecCCCc-
Confidence 358889999988874 3456777642 237999775 999999998775443
Q ss_pred EEEEEEEEeCCCC
Q psy8617 83 FLLKIAACDNGTP 95 (184)
Q Consensus 83 ~~l~v~a~d~~~p 95 (184)
-.|.|.|.|..+.
T Consensus 69 ~~F~V~a~D~~~~ 81 (90)
T PF08758_consen 69 RSFTVHAWDSQTQ 81 (90)
T ss_dssp EEEEEEEEETTTT
T ss_pred eEEEEEEECCCCC
Confidence 4688999998663
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.04 E-value=0.41 Score=32.27 Aligned_cols=88 Identities=22% Similarity=0.199 Sum_probs=51.8
Q ss_pred EEEEEEeCCCCCCcEEEEEEecCCCcccccCCCcceEEEEEeeCCcccccceEEeCcccEEEEC--------CCCCcccc
Q psy8617 9 LQVTAVDLDTGNNARITYKLIQDDTAVDLDTGNNARITYKLIQDDKLVHDTFHIMPNTGWLSLK--------QPMDRETR 80 (184)
Q Consensus 9 ~~v~A~D~D~g~n~~i~Y~i~~~~~~~d~d~~~~~~~~~~~~~~~~~~~~~F~I~~~tG~i~~~--------~~lD~E~~ 80 (184)
+++.++|+|.+. ...+++..... ..+.|.|++ +|...-. +.|..-+.
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g----------------------~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~ 56 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAG----------------------QYGTFSIDA-DGQWTYQADNSQTAVQALKAGET 56 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccC----------------------CcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCE
Confidence 468899999754 34555543321 145677766 4533221 22333334
Q ss_pred cEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCC
Q psy8617 81 DKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEEN 129 (184)
Q Consensus 81 ~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~ 129 (184)
-.-.|.+.+.|+ .+..|+|.|.-.|| +|..... -...+.|+
T Consensus 57 ~tdsFtvtv~DG------tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 57 LTDTFTVTSADG------TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred EEEEEEEEEeCC------CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 345677778885 27789999999999 6655332 23455554
No 19
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=86.10 E-value=4.8 Score=23.26 Aligned_cols=35 Identities=17% Similarity=0.274 Sum_probs=26.7
Q ss_pred ccceEEeCcccEEEECCCCCcc-cccEEEEEEEEEeCC
Q psy8617 57 HDTFHIMPNTGWLSLKQPMDRE-TRDKFLLKIAACDNG 93 (184)
Q Consensus 57 ~~~F~I~~~tG~i~~~~~lD~E-~~~~~~l~v~a~d~~ 93 (184)
..+..||+.+|.|+=. .+.+ ....|.+.|.|+|..
T Consensus 13 P~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 13 PSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence 5799999999999865 2233 235899999999874
No 20
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=81.35 E-value=49 Score=31.25 Aligned_cols=56 Identities=20% Similarity=0.329 Sum_probs=31.3
Q ss_pred eEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy8617 100 YATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPS 159 (184)
Q Consensus 100 ~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~ 159 (184)
....+|+|.| ||++|.|.-..-.+.|.|+. |+.-.+|.-.-.-.+ .-.+.|.-..+
T Consensus 514 ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~RtsGa~G-~VtV~Y~T~dG 569 (928)
T TIGR00845 514 PNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTSGARG-TVIVPYRTVEG 569 (928)
T ss_pred CceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcCCCCe-eEEEEEEeecC
Confidence 3456777787 68899876555567788874 554444333211111 22355666554
No 21
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=81.31 E-value=9.3 Score=22.89 Aligned_cols=29 Identities=24% Similarity=0.346 Sum_probs=18.6
Q ss_pred cccEEEEEEEEEeCCCCcccceEEEEEEE
Q psy8617 79 TRDKFLLKIAACDNGTPVSTAYATVAINV 107 (184)
Q Consensus 79 ~~~~~~l~v~a~d~~~p~~~~~~~v~I~v 107 (184)
....|.|.|.|.|..+........+.|.|
T Consensus 37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 37 PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp -SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 35679999999997764444335666665
No 22
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=73.36 E-value=31 Score=24.66 Aligned_cols=67 Identities=22% Similarity=0.389 Sum_probs=41.3
Q ss_pred cceEEeC-ccc--EEEECCCCCccc---ccEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCCCceEEEEeCCC
Q psy8617 58 DTFHIMP-NTG--WLSLKQPMDRET---RDKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSKQDYEFSIEENL 130 (184)
Q Consensus 58 ~~F~I~~-~tG--~i~~~~~lD~E~---~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~ 130 (184)
..|+|.- ..| ...+.++||+.. .-...|.|.|+|..+... ...+.|.|.| | .|...... .+.|.|..
T Consensus 56 ~Vftvtl~~~GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 56 PVFTLTLNADGSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEEEEECCCccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 3555531 234 455668898854 446788899998766433 3477888877 4 46654433 36777743
No 23
>KOG3597|consensus
Probab=69.94 E-value=22 Score=30.50 Aligned_cols=60 Identities=18% Similarity=0.118 Sum_probs=45.2
Q ss_pred ceEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy8617 99 AYATVAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPS 159 (184)
Q Consensus 99 ~~~~v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~ 159 (184)
.+...+|.|..+||.+..+-...+.+.+.|+...-.....+.+.|+|.+ -..+.|++.+.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~-~~~l~f~v~~t 83 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSA-PLPLEFQVLGT 83 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCC-ccceEEEEccC
Confidence 4556889999999977777666677888887665555667889999876 34677777764
No 24
>KOG4221|consensus
Probab=51.58 E-value=2.5e+02 Score=27.70 Aligned_cols=69 Identities=12% Similarity=0.201 Sum_probs=49.7
Q ss_pred EEEEEeeCCcccccceEEeCcccEEEECCCCCcccccEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCc
Q psy8617 45 ITYKLIQDDKLVHDTFHIMPNTGWLSLKQPMDRETRDKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPK 116 (184)
Q Consensus 45 ~~~~~~~~~~~~~~~F~I~~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~ 116 (184)
..|++.-.......++.++.++-++++. +.|....|.+.|.|....++..++...-.++..|+=+.||.
T Consensus 552 ~~yk~~ys~~~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 552 TGYKLFYSEDDTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred eEEEEEEEcCCCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 5666655444446788888877777766 46777889999999998887766655555667777776775
No 25
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=47.82 E-value=1.6e+02 Score=24.46 Aligned_cols=72 Identities=15% Similarity=0.273 Sum_probs=43.6
Q ss_pred EEEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC---CcEEEEcCccEEEEccccc
Q psy8617 103 VAINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPSN---TSFQINPFSGKKSILYLHS 179 (184)
Q Consensus 103 v~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~~---~~F~Id~~tG~i~~~~~~~ 179 (184)
-+|.|...|...-.|..-+....++|+-.-..-.+-|+. .+ ++++-|---.+. ..|.|++.+|.+.+...-.
T Consensus 213 stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhi-s~----dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~ 287 (346)
T COG2706 213 STVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHI-SP----DGRFLYASNRGHDSIAVFSVDPDGGKLELVGITP 287 (346)
T ss_pred CEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEE-CC----CCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEec
Confidence 346677776655667666666667776322222222222 23 467777776653 7788888888887766444
No 26
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=45.11 E-value=84 Score=20.37 Aligned_cols=52 Identities=23% Similarity=0.409 Sum_probs=29.6
Q ss_pred EEEEecCCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy8617 104 AINVLDANDNDPKFSKQDYEFSIEENLPRGTVVGSVKATDEDLASNAALRYSLIPS 159 (184)
Q Consensus 104 ~I~v~d~Nd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i~~~ 159 (184)
+|.|.| ||.+ .+.-..-...+.|+. |..-..|.....+....-.+.|...++
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCC
Confidence 467778 5655 665444456778875 445555555544433344677777665
No 27
>PF03413 PepSY: Peptidase propeptide and YPEB domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR005075 This signature, PepSY, is found in the propeptide of members of the MEROPS peptidase family M4 (clan MA(E)), which contains the thermostable thermolysins (3.4.24.27 from EC), and related thermolabile neutral proteases (bacillolysins) (3.4.24.28 from EC) from various species of Bacillus. It is also in many non-peptidase proteins, including Bacillus subtilis YpeB protein - a regulator of SleB spore cortex lytic enzyme - and a large number of eubacterial and archaeal cell wall-associated and secreted proteins which are mostly annotated as 'hypothetical protein'. Many extracellular bacterial proteases are produced as proenzymes. The propeptides usually have a dual function, i.e. they function as an intramolecular chaperone required for the folding of the polypeptide and as an inhibitor preventing premature activation of the enzyme. Analysis of the propeptide region of the M4 family of peptidases reveals two regions of conservation, the PepSY domain and a second domain, proximate to the N terminus, the FTP domain (IPR011096 from INTERPRO), which is also found in isolation in the propeptide of eukaryotic peptidases belong to MEROPS peptidase family M36. Propeptide domain swapping experiments, for example swapping the propeptide domain of PA protease with that of vibrolysin, both propeptides contain the FTP and PepSY domains, allows the PA protease domain to fold correctly and inhibits the C-terminal autoprocessing activity. However, swapping the propeptide of PA protease for the thermolysin propeptide, does not facilitate the correct folding nor the processing of the chimaeric protein into an active peptidase []. Mutational analysis of the Pseudomonas aeruginosa elastase gene revealed two mutations in the propeptide which resulted in the loss of inhibitory activity but not chaperone activity: A-15V and T-153I (where +1 is defined as the first residue of the mature peptidase). Both mutations resulted in peptidase activity, the T-153V mutation being much less effective than the A-15I mutation [] in activating peptidase activity. The T-153V mutation lies N-terminal to the FTP domain while the A-15I mutation is C-terminal to the PepSY domain. Given the diverse range of other proteins, both domains occur in in isolation, the exact function of each is still unclear; though it has been proposed that the PepSY domain primarily has inhibitory activity and in conjunction with the FTP domain in chaperone activity. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 2GU3_A 3NQZ_A 3NQY_A 2KGY_A.
Probab=43.25 E-value=44 Score=19.45 Aligned_cols=12 Identities=8% Similarity=0.346 Sum_probs=7.5
Q ss_pred EEEEcCccEEEE
Q psy8617 163 FQINPFSGKKSI 174 (184)
Q Consensus 163 F~Id~~tG~i~~ 174 (184)
+.||+.||+|.-
T Consensus 51 v~VDa~tG~Il~ 62 (64)
T PF03413_consen 51 VYVDAYTGEILS 62 (64)
T ss_dssp EEEETTT--EEE
T ss_pred EEEECCCCeEEE
Confidence 349999999864
No 28
>PF00778 DIX: DIX domain; InterPro: IPR001158 Dishevelled (Dsh) protein is an important component of the Wnt signal-transduction pathway. It has three relatively conserved domains: DIX, PDZ and DEP. The DIX domain of Dvl-1 (a mammalian Dishevelled homologue) shares 37% identity with the C-terminal region of Axin. Dsh can interact with the Axin/APC/GSK3/beta-catenin complex, and may thus modulate its activity []. The Wnt signalling pathway is conserved in various species from Caenorhabditis elegans to mammals, and plays important roles in development, cellular proliferation, and differentiation. The molecular mechanisms by which the Wnt signal regulates cellular functions are becoming increasingly well understood. Wnt stabilises cytoplasmic beta-catenin, which stimulates the expression of genes including c-myc, c-jun, fra-1, and cyclin D1. Axin and its homologue Axil are components of the Wnt signalling pathway that negatively regulate this pathway. Other components of the Wnt signalling pathway, including Dvl, glycogen synthase kinase-3beta (GSK-3beta), beta-catenin, and adenomatous polyposis coli (APC), interact with Axin, and the phosphorylation and stability of beta-catenin are regulated in the Axin complex. Axil has similar functions to Axin. Thus, Axin and Axil act as scaffold proteins in the Wnt signalling pathway, thereby modulating the Wnt-dependent cellular functions [].; GO: 0004871 signal transducer activity, 0007275 multicellular organismal development, 0005622 intracellular; PDB: 1WSP_B 2D5G_A 3PZ7_A 3PZ8_H.
Probab=43.12 E-value=79 Score=20.54 Aligned_cols=56 Identities=18% Similarity=0.214 Sum_probs=32.1
Q ss_pred ccceEEeCcccEEEEC---CCCCcccccEEEEEEEEEeCCCCcccceEEEEEEEecCCCCCCccCC
Q psy8617 57 HDTFHIMPNTGWLSLK---QPMDRETRDKFLLKIAACDNGTPVSTAYATVAINVLDANDNDPKFSK 119 (184)
Q Consensus 57 ~~~F~I~~~tG~i~~~---~~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~Nd~~P~f~~ 119 (184)
.-.+.|....+.|++. +.|.+. ..|.+..++.|... .. .|...|.|-++.-|.|+.
T Consensus 16 Py~~~ip~~~~~iTL~dFK~~l~r~--G~yrffFK~~~~d~---~~--~V~eEi~dD~~~LP~~eG 74 (84)
T PF00778_consen 16 PYVVKIPKPPGEITLGDFKEVLPRP--GNYRFFFKSLDPDF---GC--VVKEEITDDDDILPLFEG 74 (84)
T ss_dssp -EEEEESSSSTT-BHHHHHHTCTS---SGEEEEEEEEETTT---ET--EEEEEE-STTSB---BTT
T ss_pred ceEEEccCCCCccCHHHHHHhhcCC--CCceEEEEEeCCCC---Ce--eeEEEEcCCcccccccCC
Confidence 4577777666788876 457555 34555556666532 11 688888998888888864
No 29
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=42.11 E-value=75 Score=18.93 Aligned_cols=30 Identities=23% Similarity=0.392 Sum_probs=20.7
Q ss_pred ccEEEEEEEEEeCCCCcccceEEEEEEEecCC
Q psy8617 80 RDKFLLKIAACDNGTPVSTAYATVAINVLDAN 111 (184)
Q Consensus 80 ~~~~~l~v~a~d~~~p~~~~~~~v~I~v~d~N 111 (184)
...|.+.+.|+|..+. .......+.+.|..
T Consensus 22 dg~yt~~v~a~D~AGN--~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGN--TSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECCCC--EEEeeeEEEEEcCC
Confidence 5679999999998874 34445555555553
No 30
>PF14157 YmzC: YmzC-like protein; PDB: 3KVP_E.
Probab=37.19 E-value=60 Score=19.85 Aligned_cols=18 Identities=6% Similarity=0.030 Sum_probs=15.5
Q ss_pred CcEEEEcCccEEEEcccc
Q psy8617 161 TSFQINPFSGKKSILYLH 178 (184)
Q Consensus 161 ~~F~Id~~tG~i~~~~~~ 178 (184)
..|.-||.|++|++.+-+
T Consensus 42 KIfkyd~~tNei~L~KE~ 59 (63)
T PF14157_consen 42 KIFKYDEDTNEITLKKEF 59 (63)
T ss_dssp EEEEEETTTTEEEEEEEE
T ss_pred EEEEeCCCCCeEEEEEee
Confidence 678889999999998855
No 31
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=36.99 E-value=1e+02 Score=18.87 Aligned_cols=29 Identities=17% Similarity=0.219 Sum_probs=20.5
Q ss_pred CcccccEEEEEEEEEeCCCCcccceEEEEEEE
Q psy8617 76 DRETRDKFLLKIAACDNGTPVSTAYATVAINV 107 (184)
Q Consensus 76 D~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v 107 (184)
-|+....|.+++.+.|..+ +.+..+.|.|
T Consensus 50 ~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 50 TYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 4566778999999998766 4455555554
No 32
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=34.48 E-value=1.8e+02 Score=21.12 Aligned_cols=31 Identities=16% Similarity=0.357 Sum_probs=21.8
Q ss_pred CcccccEEEEEEEEEeCCCCcccceEEEEEEEe
Q psy8617 76 DRETRDKFLLKIAACDNGTPVSTAYATVAINVL 108 (184)
Q Consensus 76 D~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~ 108 (184)
..|....|.|+|.|.|..+. ..+..+.+...
T Consensus 118 sle~~~~YtLtV~a~D~aGN--~~~~si~F~y~ 148 (158)
T PF13750_consen 118 SLEADDSYTLTVSATDKAGN--QSTKSISFSYM 148 (158)
T ss_pred CcCCCCeEEEEEEEEecCCC--EEEEEEEEEEe
Confidence 34778899999999998774 44444555443
No 33
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=33.10 E-value=89 Score=19.20 Aligned_cols=19 Identities=0% Similarity=0.067 Sum_probs=15.3
Q ss_pred CcccccEEEEEEEEEeCCC
Q psy8617 76 DRETRDKFLLKIAACDNGT 94 (184)
Q Consensus 76 D~E~~~~~~l~v~a~d~~~ 94 (184)
.+.+...|.+++.++|..+
T Consensus 52 ~y~~~G~y~v~l~v~d~~g 70 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAVG 70 (81)
T ss_pred EcCCCcEEEEEEEEEeCCC
Confidence 3667788999999999854
No 34
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=30.40 E-value=1e+02 Score=19.18 Aligned_cols=25 Identities=16% Similarity=0.226 Sum_probs=19.0
Q ss_pred cEEEEEEEEEeCCCCcccceEEEEEEE
Q psy8617 81 DKFLLKIAACDNGTPVSTAYATVAINV 107 (184)
Q Consensus 81 ~~~~l~v~a~d~~~p~~~~~~~v~I~v 107 (184)
..+.+...|+|..+ ..+.+.+.|.|
T Consensus 57 G~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 57 GTTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred ceEEEEEEEEECCC--CEEEEEEEEEC
Confidence 46889999999866 46677777764
No 35
>PF07861 WND: WisP family N-Terminal Region; InterPro: IPR012503 This family is found at the N terminus of the Tropheryma whipplei WisP family proteins [].
Probab=28.67 E-value=1.9e+02 Score=22.04 Aligned_cols=28 Identities=18% Similarity=0.239 Sum_probs=24.0
Q ss_pred CCceEEEEEecCCCcEEEEcCccEEEEc
Q psy8617 148 SNAALRYSLIPSNTSFQINPFSGKKSIL 175 (184)
Q Consensus 148 ~n~~i~y~i~~~~~~F~Id~~tG~i~~~ 175 (184)
.|..++|+|...+.-..||..||.|...
T Consensus 201 R~S~~T~SLs~P~~~v~lD~~TG~l~~S 228 (263)
T PF07861_consen 201 RGSPFTYSLSTPVAGVRLDANTGALSGS 228 (263)
T ss_pred cCCcceEEeccCCCceEEecccceeeee
Confidence 4678899999888999999999998764
No 36
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=25.11 E-value=2.3e+02 Score=19.32 Aligned_cols=33 Identities=15% Similarity=0.312 Sum_probs=22.5
Q ss_pred CCCcccccEEEEEEEEEeCCCCcccceEEEEEEEe
Q psy8617 74 PMDRETRDKFLLKIAACDNGTPVSTAYATVAINVL 108 (184)
Q Consensus 74 ~lD~E~~~~~~l~v~a~d~~~p~~~~~~~v~I~v~ 108 (184)
.+++.+...|.+.|++.|..+...++ .+.|.|-
T Consensus 73 t~~v~kgG~y~m~V~lCn~dGCS~S~--~~~I~VA 105 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDGCSTSA--AKEIVVA 105 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCCccCcC--CEEEEec
Confidence 35567778899999999887744444 4444444
No 37
>PF12461 DUF3688: Protein of unknown function (DUF3688) ; InterPro: IPR022160 This entry is represented by Spiroplasma phage 1-C74, Orf1. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This domain family is found in bacteria and viruses, and is typically between 79 and 104 amino acids in length. There is a conserved YRW sequence motif. There is a single completely conserved residue Y that may be functionally important.
Probab=22.61 E-value=85 Score=20.68 Aligned_cols=24 Identities=13% Similarity=0.018 Sum_probs=18.0
Q ss_pred eEEEEEecCC--CcEEEEcCccEEEE
Q psy8617 151 ALRYSLIPSN--TSFQINPFSGKKSI 174 (184)
Q Consensus 151 ~i~y~i~~~~--~~F~Id~~tG~i~~ 174 (184)
.-.|+..+.. ....||+.||+|+.
T Consensus 61 KsvYRWdG~gEPq~P~ID~ntG~Itd 86 (91)
T PF12461_consen 61 KSVYRWDGVGEPQTPTIDKNTGNITD 86 (91)
T ss_pred EEEEEecCCCCccCceEcCCCCeEeE
Confidence 4467776654 57889999999974
No 38
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=22.45 E-value=1.6e+02 Score=20.78 Aligned_cols=34 Identities=26% Similarity=0.270 Sum_probs=17.2
Q ss_pred cEEEEEEEEE-eCCCCcccceEEEEEEEecCCCCCC
Q psy8617 81 DKFLLKIAAC-DNGTPVSTAYATVAINVLDANDNDP 115 (184)
Q Consensus 81 ~~~~l~v~a~-d~~~p~~~~~~~v~I~v~d~Nd~~P 115 (184)
..=.|.|.|+ ..++.+++....+.|+|..-|+ +|
T Consensus 98 N~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 98 NAGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp S-EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred CcccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 3446777776 3344568888999999988775 44
No 39
>PF00635 Motile_Sperm: MSP (Major sperm protein) domain; InterPro: IPR000535 Major sperm proteins (MSP) are central components in molecular interactions underlying sperm motility in Caenorhabditis elegans, whose sperm employ an amoebae-like crawling motion using a MSP-containing lamellipod, rather than the flagellar-based swimming motion associated with other sperm. These proteins oligomerise to form an extensive filament system that extends from sperm villipoda, along the leading edge of the pseudopod. About 30 MSP isoforms may exist in C. elegans. MSPs form a fibrous network, whereby MSP dimers form helical subfilaments that coil around one another to produce filaments, which in turn form supercoils to produce bundles. The crystal structure of MSP from C. elegans reveals an immunoglobulin (Ig)-like seven-stranded beta sandwich fold []. ; GO: 0005198 structural molecule activity; PDB: 1MSP_A 3MSP_B 2BVU_B 2MSP_C 1Z9O_F 1Z9L_A 3IKK_A 1WIC_A 2CRI_A 2RR3_A ....
Probab=20.64 E-value=1.7e+02 Score=19.07 Aligned_cols=25 Identities=12% Similarity=0.363 Sum_probs=19.1
Q ss_pred ceEEEEEecCC-CcEEEEcCccEEEE
Q psy8617 150 AALRYSLIPSN-TSFQINPFSGKKSI 174 (184)
Q Consensus 150 ~~i~y~i~~~~-~~F~Id~~tG~i~~ 174 (184)
..+.|.+...+ ..|.|.|..|.|.-
T Consensus 32 ~~i~fKiktt~~~~y~v~P~~G~i~p 57 (109)
T PF00635_consen 32 KPIAFKIKTTNPNRYRVKPSYGIIEP 57 (109)
T ss_dssp SEEEEEEEES-TTTEEEESSEEEE-T
T ss_pred CcEEEEEEcCCCceEEecCCCEEECC
Confidence 47889998775 78999999998743
No 40
>KOG4680|consensus
Probab=20.51 E-value=2.7e+02 Score=20.03 Aligned_cols=34 Identities=18% Similarity=0.208 Sum_probs=26.2
Q ss_pred EEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEE
Q psy8617 123 EFSIEENLPRGTVVGSVKATDEDLASNAALRYSL 156 (184)
Q Consensus 123 ~~~v~E~~~~gt~v~~v~a~D~D~~~n~~i~y~i 156 (184)
...++--.+||+.+.+.+|.|.+...--.+++++
T Consensus 109 sq~LPg~tPPG~Y~lkm~~~d~~~~~LTCisfsf 142 (153)
T KOG4680|consen 109 SQVLPGYTPPGSYVLKMTAYDAKGKELTCISFSF 142 (153)
T ss_pred eEeccCcCCCceEEEEEEeecCCCCEEEEEEEEE
Confidence 4557778899999999999999876555555554
Done!