Query psy8618
Match_columns 232
No_of_seqs 201 out of 1431
Neff 10.0
Searched_HMMs 46136
Date Fri Aug 16 23:25:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8618.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8618hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 9.6E-45 2.1E-49 321.2 21.0 225 1-231 220-457 (2531)
2 KOG4289|consensus 100.0 7.2E-42 1.6E-46 303.1 22.9 222 2-232 326-560 (2531)
3 KOG1219|consensus 100.0 3.4E-37 7.3E-42 281.2 26.0 225 2-231 903-1142(4289)
4 KOG1219|consensus 100.0 1.8E-36 3.8E-41 276.6 24.1 221 2-232 1010-1242(4289)
5 cd00031 CA Cadherin repeat dom 100.0 3.1E-30 6.8E-35 198.2 24.3 171 56-230 2-185 (199)
6 cd00031 CA Cadherin repeat dom 99.9 7.2E-26 1.6E-30 173.9 21.3 147 1-151 53-199 (199)
7 KOG1834|consensus 99.8 2.3E-19 4.9E-24 151.4 17.7 187 37-231 20-231 (952)
8 PF00028 Cadherin: Cadherin do 99.8 3E-19 6.5E-24 120.5 13.7 92 56-150 1-93 (93)
9 smart00112 CA Cadherin repeats 99.8 1.3E-17 2.9E-22 108.9 11.0 79 76-157 1-79 (79)
10 KOG1834|consensus 99.7 7.7E-17 1.7E-21 136.3 15.0 143 2-152 95-245 (952)
11 PF00028 Cadherin: Cadherin do 99.4 4.8E-12 1E-16 85.1 8.5 69 162-231 12-81 (93)
12 smart00112 CA Cadherin repeats 99.2 7.6E-11 1.6E-15 76.7 7.8 60 171-231 1-60 (79)
13 PF08266 Cadherin_2: Cadherin- 98.0 1.3E-05 2.7E-10 52.3 4.2 61 57-121 4-66 (84)
14 PF08758 Cadherin_pro: Cadheri 97.4 0.003 6.4E-08 41.8 8.8 78 47-136 3-80 (90)
15 smart00736 CADG Dystroglycan-t 97.2 0.012 2.5E-07 39.6 10.9 73 75-154 24-96 (97)
16 TIGR01965 VCBS_repeat VCBS rep 96.6 0.035 7.7E-07 37.2 8.9 87 71-170 2-96 (99)
17 PF08266 Cadherin_2: Cadherin- 96.1 0.011 2.4E-07 38.5 4.1 32 182-214 35-66 (84)
18 PF08758 Cadherin_pro: Cadheri 94.9 0.17 3.6E-06 33.5 6.5 69 156-231 3-82 (90)
19 PF13750 Big_3_3: Bacterial Ig 92.9 2.8 6.2E-05 30.8 15.9 129 13-150 14-148 (158)
20 PF05345 He_PIG: Putative Ig d 92.3 1.2 2.6E-05 25.7 6.2 37 97-135 11-48 (49)
21 TIGR00845 caca sodium/calcium 91.6 13 0.00027 35.4 16.1 121 43-170 395-541 (928)
22 smart00736 CADG Dystroglycan-t 89.5 1.6 3.6E-05 29.0 5.9 31 13-45 66-96 (97)
23 TIGR01965 VCBS_repeat VCBS rep 88.2 2.6 5.7E-05 28.3 6.0 48 8-64 51-98 (99)
24 TIGR03660 T1SS_rpt_143 T1SS-14 87.1 4.1 8.8E-05 29.2 6.8 58 2-66 70-130 (137)
25 PF05345 He_PIG: Putative Ig d 85.3 5.2 0.00011 23.0 6.1 37 190-228 11-48 (49)
26 PF13750 Big_3_3: Bacterial Ig 84.4 14 0.0003 27.2 14.0 103 122-229 14-136 (158)
27 TIGR03660 T1SS_rpt_143 T1SS-14 77.6 23 0.0005 25.4 7.6 59 109-173 68-129 (137)
28 PF07495 Y_Y_Y: Y_Y_Y domain; 77.3 13 0.00028 22.3 8.6 59 83-149 7-65 (66)
29 PF07495 Y_Y_Y: Y_Y_Y domain; 72.4 10 0.00022 22.8 4.2 27 14-40 39-65 (66)
30 TIGR00864 PCC polycystin catio 69.0 1.8E+02 0.0039 31.7 21.0 98 119-227 1564-1681(2740)
31 KOG4221|consensus 69.0 1.3E+02 0.0027 29.9 17.3 136 85-226 551-695 (1381)
32 TIGR00845 caca sodium/calcium 67.5 1.2E+02 0.0026 29.2 13.7 56 34-94 515-570 (928)
33 KOG3597|consensus 64.5 97 0.0021 27.0 16.1 60 32-93 24-83 (442)
34 TIGR00864 PCC polycystin catio 54.6 3.3E+02 0.0071 29.9 18.1 31 118-151 1479-1509(2740)
35 smart00089 PKD Repeats in poly 54.1 43 0.00094 20.8 4.8 30 117-149 49-78 (79)
36 smart00089 PKD Repeats in poly 49.2 60 0.0013 20.1 4.9 28 10-40 51-78 (79)
37 PF12245 Big_3_2: Bacterial Ig 48.9 57 0.0012 19.5 5.8 29 13-43 22-50 (60)
38 cd00146 PKD polycystic kidney 48.4 58 0.0013 20.3 4.8 18 10-27 53-70 (81)
39 PF02494 HYR: HYR domain; Int 47.3 52 0.0011 20.8 4.4 24 15-40 58-81 (81)
40 PF03160 Calx-beta: Calx-beta 46.3 83 0.0018 20.6 12.1 52 37-93 2-53 (100)
41 PF10365 DUF2436: Domain of un 40.4 1.2E+02 0.0027 21.7 5.5 49 180-229 108-157 (161)
42 cd02848 Chitinase_N_term Chiti 38.5 1.2E+02 0.0027 20.6 5.1 33 116-150 73-105 (106)
43 PF05895 DUF859: Siphovirus pr 37.1 3.4E+02 0.0073 25.0 13.3 120 15-149 299-433 (624)
44 PF13860 FlgD_ig: FlgD Ig-like 34.5 1.2E+02 0.0026 19.1 4.9 14 215-228 68-81 (81)
45 PF09100 Qn_am_d_aIV: Quinohem 32.1 1.4E+02 0.0031 21.0 4.7 31 17-48 101-132 (133)
46 PF00801 PKD: PKD domain; Int 29.2 1.3E+02 0.0029 18.0 5.2 18 212-229 48-65 (69)
47 PF13754 Big_3_4: Bacterial Ig 26.8 1.4E+02 0.003 17.3 5.0 17 121-137 22-38 (54)
48 COG3354 FlaG Putative archaeal 26.2 2.6E+02 0.0056 20.3 7.9 54 131-187 55-111 (154)
49 PF14302 DUF4377: Domain of un 26.1 1.2E+02 0.0026 19.4 3.4 25 208-232 40-64 (80)
50 PF08329 ChitinaseA_N: Chitina 25.0 1.6E+02 0.0036 21.0 4.2 35 9-45 78-112 (133)
51 PF03413 PepSY: Peptidase prop 24.6 1.4E+02 0.0031 17.2 3.5 14 100-113 49-62 (64)
52 PF13753 SWM_repeat: Putative 23.8 4.1E+02 0.0088 21.7 16.4 107 13-137 11-125 (317)
No 1
>KOG4289|consensus
Probab=100.00 E-value=9.6e-45 Score=321.18 Aligned_cols=225 Identities=36% Similarity=0.615 Sum_probs=214.7
Q ss_pred CeEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCC
Q psy8618 1 EISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDD 80 (232)
Q Consensus 1 ~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~ 80 (232)
.|++++.||||.++.+.|.|.|.|.+.|+++++++|+|.|.|+|||.|+|.+. .|.-++.|+.+.|+.|.++.|+|.|+
T Consensus 220 ~irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~-~Y~e~lREn~evGy~vLtvrAtD~Ds 298 (2531)
T KOG4289|consen 220 AIRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQD-EYREELRENLEVGYEVLTVRATDGDS 298 (2531)
T ss_pred cchhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchh-HHHHHHhhccccCceEEEEEeccCCC
Confidence 36789999999999999999999999999999999999999999999999998 78899999999999999999999999
Q ss_pred CCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeee
Q psy8618 81 GSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFT 160 (232)
Q Consensus 81 ~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~ 160 (232)
++|+.++|+++.| +....|.|++.+|.|++..+||||....|++.|.|.|.|.++...++.|.|+|.|.|||+|+|.
T Consensus 299 p~Nani~Yrl~eg---~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFs 375 (2531)
T KOG4289|consen 299 PPNANIRYRLLEG---NAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFS 375 (2531)
T ss_pred CCCCceEEEecCC---CccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCcccc
Confidence 9999999999988 6788999999999999999999999999999999999999988889999999999999999999
Q ss_pred CCceEEE-------------EEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEEC
Q psy8618 161 SSSLLFK-------------VSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDR 227 (232)
Q Consensus 161 ~~~~~~~-------------v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~ 227 (232)
...|.++ |+++|.|.+.|+.|.|+|.++ +..+.|.|+..+|+|.+..+||+|.. .|.+.|+|.|+
T Consensus 376 e~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sg-n~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDg 453 (2531)
T KOG4289|consen 376 EKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASG-NGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDG 453 (2531)
T ss_pred ccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeecc-CccccEEEecccceEEEeccccccCC-eeEEEEEcccC
Confidence 9998665 556999999999999999998 88999999999999999999999999 89999999999
Q ss_pred CccC
Q psy8618 228 GKET 231 (232)
Q Consensus 228 g~~~ 231 (232)
|.|+
T Consensus 454 grPp 457 (2531)
T KOG4289|consen 454 GRPP 457 (2531)
T ss_pred CCCC
Confidence 9997
No 2
>KOG4289|consensus
Probab=100.00 E-value=7.2e-42 Score=303.08 Aligned_cols=222 Identities=39% Similarity=0.620 Sum_probs=208.8
Q ss_pred eEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCC
Q psy8618 2 ISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDG 81 (232)
Q Consensus 2 i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~ 81 (232)
|.+..+||||+...|.|.|.|+|.|.++...++.|.|.|.|+|||+|.|... .|.++|.|+..+++.|.++.|+|.|.|
T Consensus 326 I~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~-~Yvvqv~Edvt~~avvlrV~AtDrD~g 404 (2531)
T KOG4289|consen 326 ISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEK-RYVVQVREDVTPPAVVLRVTATDRDKG 404 (2531)
T ss_pred eeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCcccccc-ceEEEecccCCCCceEEEEEecccCCC
Confidence 6788999999999999999999999988878999999999999999999998 799999999999999999999999999
Q ss_pred CCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeeeC
Q psy8618 82 SNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFTS 161 (232)
Q Consensus 82 ~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~ 161 (232)
.|+.+.|+|.+| +..+-|.||..||+|.+..+||+|.. .|++.|.|.|+|.|+++.+.-+.|+|.|+|||+|.|..
T Consensus 405 ~Ng~VHYsi~Sg---n~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvs 480 (2531)
T KOG4289|consen 405 TNGKVHYSIASG---NGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVS 480 (2531)
T ss_pred cCceEEEEeecc---CccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEe
Confidence 999999999999 77789999999999999999999988 99999999999999999998888999999999999987
Q ss_pred Cce-------------EEEEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEECC
Q psy8618 162 SSL-------------LFKVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDRG 228 (232)
Q Consensus 162 ~~~-------------~~~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~g 228 (232)
..+ ...+.+.|.|.+.++.+.|++.+ .++|.|+..+|.|.+.|.||||+...|.|.|+|+|+|
T Consensus 481 tpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG----~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~g 556 (2531)
T KOG4289|consen 481 TPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG----VGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHG 556 (2531)
T ss_pred chhhhhhhhcccccceEEEEecccCCCCcccceeeeecc----CCCeeEecCCceEEEeecccccccceEEEEEEEcCCC
Confidence 654 45677799999999999999976 4599999999999999999999999999999999999
Q ss_pred ccCC
Q psy8618 229 KETQ 232 (232)
Q Consensus 229 ~~~q 232 (232)
.|+.
T Consensus 557 tp~l 560 (2531)
T KOG4289|consen 557 TPPL 560 (2531)
T ss_pred CCcc
Confidence 9973
No 3
>KOG1219|consensus
Probab=100.00 E-value=3.4e-37 Score=281.22 Aligned_cols=225 Identities=34% Similarity=0.485 Sum_probs=211.5
Q ss_pred eEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCC--CCeecCCCceEEEEecCCCCCcEEEEEEEEeCC
Q psy8618 2 ISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDN--APELVDPQEDVISVREEQPPGTEVVRARAVDKD 79 (232)
Q Consensus 2 i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~--~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D 79 (232)
+.+-++||+|+.+.|.|.|.|.|.|.|.+++.+.+.|.|.|+|+| ||.|..- ....+|.||+|.|+.++.+.|.|.|
T Consensus 903 vfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~-v~e~~V~EnapiGT~vi~i~A~ded 981 (4289)
T KOG1219|consen 903 VFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISF-VTEGHVLENAPIGTIVIRIQARDED 981 (4289)
T ss_pred EEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchhee-eeeeeEeecCCcceEEEEEEEecCC
Confidence 567889999999999999999999999899999999999999887 8999866 5679999999999999999999999
Q ss_pred CCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCee
Q psy8618 80 DGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTF 159 (232)
Q Consensus 80 ~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f 159 (232)
.|-.+.+.|+|..| +..+.|+|+..+|.|++.+.||||....|.|+|.|.|.|.+++++.+.+.|.|.|+|||+|+|
T Consensus 982 sgldg~l~Y~I~~g---dg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~ 1058 (4289)
T KOG1219|consen 982 SGLDGELSYKIRTG---DGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQF 1058 (4289)
T ss_pred CCccceEEEEEEcC---CcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCccc
Confidence 99999999999988 778899999999999999999999999999999999999999999999999999999999999
Q ss_pred eCCceEE-------------EEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEE
Q psy8618 160 TSSSLLF-------------KVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATD 226 (232)
Q Consensus 160 ~~~~~~~-------------~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D 226 (232)
..+.|.. ++.+.|+|...++.+.|.|..| +..+.|.|++.+|.|...+.||+|++..+.|.|.++|
T Consensus 1059 s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sG-nyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D 1137 (4289)
T KOG1219|consen 1059 SSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSG-NYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQD 1137 (4289)
T ss_pred CCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccC-CccceEEEccccceeeeehhhcccccccceEEEEEec
Confidence 9888754 4455888988899999999998 8999999999999999999999999999999999999
Q ss_pred CCccC
Q psy8618 227 RGKET 231 (232)
Q Consensus 227 ~g~~~ 231 (232)
+|.|.
T Consensus 1138 ~gep~ 1142 (4289)
T KOG1219|consen 1138 NGEPW 1142 (4289)
T ss_pred CCCCc
Confidence 99996
No 4
>KOG1219|consensus
Probab=100.00 E-value=1.8e-36 Score=276.60 Aligned_cols=221 Identities=36% Similarity=0.533 Sum_probs=201.6
Q ss_pred eEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCC
Q psy8618 2 ISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDG 81 (232)
Q Consensus 2 i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~ 81 (232)
|.+.++||||.+..|-|++.|+|.|.+++++.+.+.|.|.|+|||+|.|..+ .|..+|.|++|.+..|.++.|.|+|..
T Consensus 1010 irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~p-vy~asI~enSp~~vsivq~ea~D~Dss 1088 (4289)
T KOG1219|consen 1010 IRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSP-VYYASISENSPETVSIVQAEANDPDSS 1088 (4289)
T ss_pred EeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCc-eEeeeeccCCCCceEEEEeccCCCCcc
Confidence 6788999999999999999999999999999999999999999999999998 889999999999999999999999988
Q ss_pred CCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeeeC
Q psy8618 82 SNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFTS 161 (232)
Q Consensus 82 ~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~ 161 (232)
.++++.|.|.+| +..++|.|++.||.|+..+.||||.+..|.|.|+++|.|.|.+++...|.|.|.|+|||+|+|.+
T Consensus 1089 sn~kLmykI~sG---nyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~IldvNdnsp~Flq 1165 (4289)
T KOG1219|consen 1089 SNQKLMYKITSG---NYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDVNDNSPRFLQ 1165 (4289)
T ss_pred cCcceEEEEccC---CccceEEEccccceeeeehhhcccccccceEEEEEecCCCCccccceEEEEEEeeccCCchhhhh
Confidence 899999999999 78899999999999999999999999999999999999999999999999999999999999998
Q ss_pred CceEEEEE-----------eecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEE-EEEEECCc
Q psy8618 162 SSLLFKVS-----------ASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFP-VVATDRGK 229 (232)
Q Consensus 162 ~~~~~~v~-----------~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~-V~a~D~g~ 229 (232)
..+.++++ +.|.|.+++++|+|+|..| + ..|.|+..+|.+.. +.++ ..+.|.+. |.|+|+|.
T Consensus 1166 k~~~~~v~~r~s~plyRl~a~d~DeG~narityniedg-d--e~FsID~~t~vVss-s~~~--~~~eydi~~Ikatd~g~ 1239 (4289)
T KOG1219|consen 1166 KKTFLRVPERSSPPLYRLAAQDNDEGNNARITYNIEDG-D--EVFSIDIATGVVSS-STLD--PAGEYDILGIKATDRGA 1239 (4289)
T ss_pred heeEEEeeeccCCceeEEEEEecCCCcceEEEEecccC-c--eEEEEeccCceEEe-eecc--CCcceeEeeEEEecCCC
Confidence 88766544 5899999999999999976 3 33999999995544 4454 34468887 99999999
Q ss_pred cCC
Q psy8618 230 ETQ 232 (232)
Q Consensus 230 ~~q 232 (232)
|.|
T Consensus 1240 pq~ 1242 (4289)
T KOG1219|consen 1240 PQA 1242 (4289)
T ss_pred Ccc
Confidence 964
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00 E-value=3.1e-30 Score=198.18 Aligned_cols=171 Identities=42% Similarity=0.640 Sum_probs=153.7
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCC
Q psy8618 56 DVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNG 135 (232)
Q Consensus 56 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~ 135 (232)
|.+.++|+.+.|+.++++.|.|+|.+.++.++|+|.++ ....+|.|++.+|.|++.+.||||....|.|.|.|+|.+
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~---~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g 78 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGG---NEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGG 78 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCC---CCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECC
Confidence 57899999999999999999999988888999999876 333799999999999999999999999999999999988
Q ss_pred CCCcceeEEEEEEEEecCCCCCeeeCCce-------------EEEEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCce
Q psy8618 136 FPSRQTVRLLRVEVLDLNDNRPTFTSSSL-------------LFKVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSG 202 (232)
Q Consensus 136 ~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~-------------~~~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G 202 (232)
.+.+++...+.|.|.|+||++|.|....| .+++.+.|+|.+.++.++|+|..+ .....|.|++.+|
T Consensus 79 ~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~-~~~~~f~i~~~~G 157 (199)
T cd00031 79 GPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSG-NDKELFSIDPNTG 157 (199)
T ss_pred cCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCC-CCCCEEEEeCCce
Confidence 88777889999999999999999985554 445666899988889999999985 3358999999999
Q ss_pred EEEEeecCCCccccEEEEEEEEEECCcc
Q psy8618 203 EICISQDLDFESRSSYEFPVVATDRGKE 230 (232)
Q Consensus 203 ~i~~~~~ld~e~~~~y~l~V~a~D~g~~ 230 (232)
.|.+.+.||+|....|.|.|.|+|+|.|
T Consensus 158 ~i~~~~~ld~e~~~~~~l~v~a~D~~~~ 185 (199)
T cd00031 158 IITLAKPLDREEKSSYELTVVATDGGGP 185 (199)
T ss_pred EEEeCCccCCccCceEEEEEEEEECCCC
Confidence 9999999999999999999999999854
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.95 E-value=7.2e-26 Score=173.92 Aligned_cols=147 Identities=44% Similarity=0.651 Sum_probs=134.5
Q ss_pred CeEecCCCCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCC
Q psy8618 1 EISTREPLDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDD 80 (232)
Q Consensus 1 ~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~ 80 (232)
+|.+.+.||||....|.|.|.|.|.|.+..++...++|.|.|+|||+|.|... .|.+.+.|+.+.|+.++++.|+|+|.
T Consensus 53 ~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~-~~~~~v~e~~~~~~~i~~~~a~D~D~ 131 (199)
T cd00031 53 VITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQS-SYEASVPENAPPGTVVGTVTATDADS 131 (199)
T ss_pred EEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCccccc-ceEEEEeCCCCCCCEEEEEEEEcCCC
Confidence 37788999999999999999999988877779999999999999999999976 68899999999999999999999998
Q ss_pred CCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEe
Q psy8618 81 GSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLD 151 (232)
Q Consensus 81 ~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d 151 (232)
+.++.++|+|.++ ....+|.|++.+|.|++.+.||+|....|.+.|.|+|.+.+.+++.+.+.|.|.|
T Consensus 132 ~~~~~~~y~l~~~---~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 132 GENAKLTYSILSG---NDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred CCCccEEEEEeCC---CCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 8889999999766 3357999999999999999999999999999999999988778888888888865
No 7
>KOG1834|consensus
Probab=99.84 E-value=2.3e-19 Score=151.42 Aligned_cols=187 Identities=22% Similarity=0.329 Sum_probs=151.1
Q ss_pred EEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCC---CCcEEEEEEeeCCCCCCcccEEEeCCc--eEE
Q psy8618 37 KISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDG---SNATISYSILKGRDTDGYGVFTIDPIS--GVI 111 (232)
Q Consensus 37 ~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~---~~~~v~y~i~~~~~~~~~~~f~id~~t--G~i 111 (232)
.....-+|-+.|+... .|...|.||...-.....+.|.|.|.+ .+..+.|.|.+. + -......+|..| |.|
T Consensus 20 ~~~aarankhkpwie~--ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq-~-vPFdavVvdK~TGegvl 95 (952)
T KOG1834|consen 20 HHHAARANKHKPWIEE--EYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ-P-VPFDAVVVDKYTGEGVL 95 (952)
T ss_pred ccccccccccCccccc--ceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC-C-CCceEEEEeccCCceEE
Confidence 3445667888997654 578999999986666677899999874 234467777543 2 233444557666 589
Q ss_pred EECccCCccCCCEEEEEEEEEeCCCCC------cceeEEEEEEEEecCCCCCeeeCCceEE------------EEEeecC
Q psy8618 112 RTKSVLDHEDRSIYRIAVAATDNGFPS------RQTVRLLRVEVLDLNDNRPTFTSSSLLF------------KVSASDP 173 (232)
Q Consensus 112 ~~~~~ld~e~~~~~~l~v~a~d~~~~~------~~~~~~v~i~v~d~Nd~~P~f~~~~~~~------------~v~~~D~ 173 (232)
+.+.+||+|.+..|+|+|+|.|+|..+ ++..++|.|+|.|+|+++|.|..+.|.+ .|.+.|.
T Consensus 96 RaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~veAiD~ 175 (952)
T KOG1834|consen 96 RAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVEAIDK 175 (952)
T ss_pred eecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEEeecC
Confidence 999999999999999999999987643 5677899999999999999999988855 4566899
Q ss_pred CCCC--CeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEECCccC
Q psy8618 174 DCGV--NAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDRGKET 231 (232)
Q Consensus 174 d~~~--~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~g~~~ 231 (232)
|+++ +..+.|.|. ...-+|.|+ +.|.|+.+.+|+|.....|.|+|+|.|+|+.+
T Consensus 176 DCspq~sqIC~YEI~---t~d~PFaId-n~G~irnTekLny~ke~~Y~ltVtAyDCg~kr 231 (952)
T KOG1834|consen 176 DCSPQYSQICEYEIT---TPDVPFAID-NDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKR 231 (952)
T ss_pred CCCCcccceeEEEec---CCCCceEEc-CCCccccccccccccceeEEEEEEEEeccccc
Confidence 9987 466799998 468899999 89999999999999999999999999999753
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.83 E-value=3e-19 Score=120.48 Aligned_cols=92 Identities=43% Similarity=0.668 Sum_probs=86.9
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeC-
Q psy8618 56 DVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDN- 134 (232)
Q Consensus 56 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~- 134 (232)
|.+.|+|+.++|+.++++.|.|+|.+.++.+.|+|..+ +..++|.|++.+|.|++.+.||||....|.|.|.|+|.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~---~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~ 77 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGG---NPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSG 77 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEET---TSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecC---cccCceEEeeeeeccccceecCcccCCEEEEEEEEEECC
Confidence 67999999999999999999999999999999999987 55889999999999999999999999999999999999
Q ss_pred CCCCcceeEEEEEEEE
Q psy8618 135 GFPSRQTVRLLRVEVL 150 (232)
Q Consensus 135 ~~~~~~~~~~v~i~v~ 150 (232)
+.|++++.+.|.|+|.
T Consensus 78 ~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 78 GSPPLSSTATVTINVL 93 (93)
T ss_dssp TSSEEEEEEEEEEEEE
T ss_pred CCCCCEEEEEEEEEEC
Confidence 8889999999999874
No 9
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.76 E-value=1.3e-17 Score=108.90 Aligned_cols=79 Identities=44% Similarity=0.736 Sum_probs=72.6
Q ss_pred EeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCC
Q psy8618 76 VDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDN 155 (232)
Q Consensus 76 ~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~ 155 (232)
+|+|.+.++.++|+|.++ ....+|.|++.+|.|++.++||||....|.|.|.|.|.+.|++++.+.|.|+|.|+|||
T Consensus 1 ~D~D~g~n~~i~Y~i~~~---~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~ 77 (79)
T smart00112 1 TDADSGENGKVTYSILSG---NEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDN 77 (79)
T ss_pred CCCCCCcCcEEEEEEecC---CCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCC
Confidence 478888889999999876 33389999999999999999999999999999999999998899999999999999999
Q ss_pred CC
Q psy8618 156 RP 157 (232)
Q Consensus 156 ~P 157 (232)
+|
T Consensus 78 ~P 79 (79)
T smart00112 78 AP 79 (79)
T ss_pred CC
Confidence 98
No 10
>KOG1834|consensus
Probab=99.74 E-value=7.7e-17 Score=136.35 Aligned_cols=143 Identities=29% Similarity=0.389 Sum_probs=121.7
Q ss_pred eEecCCCCcCCCCeEEEEEEEEeCCCC------CceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEE
Q psy8618 2 ISTREPLDREARSSHELVAEARDQGSP------PRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARA 75 (232)
Q Consensus 2 i~~~~~lD~E~~~~~~l~v~a~d~~~~------~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a 75 (232)
|+.+.+||-|.++.|+|+|+|.|-|.. ..+-.++|+|+|.|+|+++|+|..+ .|.+.|.|.-.. ..|+++.|
T Consensus 95 lRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep-~Yka~V~EGK~y-d~il~veA 172 (952)
T KOG1834|consen 95 LRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEP-WYKAHVTEGKVY-DSILRVEA 172 (952)
T ss_pred EeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhccc-ceeeEEecceee-eeeEEEEe
Confidence 678899999999999999999996543 3567899999999999999999998 788999998654 55899999
Q ss_pred EeCCCCC--CcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEec
Q psy8618 76 VDKDDGS--NATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDL 152 (232)
Q Consensus 76 ~D~D~~~--~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~ 152 (232)
.|.|-++ ...+.|.|+. ..-.|.||. .|.|+.+.+|.|.....|.|+|+|.|+|.....+.+.|+|+|...
T Consensus 173 iD~DCspq~sqIC~YEI~t-----~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 173 IDKDCSPQYSQICEYEITT-----PDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred ecCCCCCcccceeEEEecC-----CCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEecCc
Confidence 9999754 4557889974 455699997 799999999999999999999999999986655657788888654
No 11
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.36 E-value=4.8e-12 Score=85.10 Aligned_cols=69 Identities=33% Similarity=0.565 Sum_probs=63.3
Q ss_pred CceEEEEEeecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEEC-CccC
Q psy8618 162 SSLLFKVSASDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDR-GKET 231 (232)
Q Consensus 162 ~~~~~~v~~~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~-g~~~ 231 (232)
....+.+.+.|+|.+.++.+.|+|..+ +....|.|++.+|.|.+.+.||||..+.|.|.|.|+|. |.|+
T Consensus 12 g~~v~~v~a~D~D~~~n~~i~y~i~~~-~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~ 81 (93)
T PF00028_consen 12 GTVVGQVTATDPDSGPNSQITYSILGG-NPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPP 81 (93)
T ss_dssp SSEEEEEEEEESSTSTTSSEEEEEEET-TSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSE
T ss_pred CCEEEEEEEEeCCCCCCceEEEEEecC-cccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCC
Confidence 456788999999999999999999987 55899999999999999999999999999999999999 7775
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.21 E-value=7.6e-11 Score=76.73 Aligned_cols=60 Identities=32% Similarity=0.505 Sum_probs=53.6
Q ss_pred ecCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEEEECCccC
Q psy8618 171 SDPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDRGKET 231 (232)
Q Consensus 171 ~D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~g~~~ 231 (232)
+|+|.+.++.+.|+|..+ .....|.|++.+|.|.+.++||||....|.|.|.|+|+|.|+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~-~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~ 60 (79)
T smart00112 1 TDADSGENGKVTYSILSG-NEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPP 60 (79)
T ss_pred CCCCCCcCcEEEEEEecC-CCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCC
Confidence 377888889999999976 334899999999999999999999999999999999999864
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.96 E-value=1.3e-05 Score=52.34 Aligned_cols=61 Identities=25% Similarity=0.357 Sum_probs=39.5
Q ss_pred EEEEecCCCCCcEEEEEEEEeCCCCCC--cEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccC
Q psy8618 57 VISVREEQPPGTEVVRARAVDKDDGSN--ATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHED 121 (232)
Q Consensus 57 ~~~v~E~~~~g~~v~~v~a~D~D~~~~--~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~ 121 (232)
...|+|+.++|+.|+.+ |.|...... ..-.|++++. ....+|.++..+|.|.++..+|||.
T Consensus 4 ~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~---~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 4 RYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSE---GNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-S---SSS-SEEE-TTTSEEEESS--SCCC
T ss_pred EEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeec---CCcceeEecCCceeEEeCCccCHHH
Confidence 57899999999999998 545433210 1135777665 5678999999999999999999996
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.36 E-value=0.003 Score=41.76 Aligned_cols=78 Identities=13% Similarity=0.123 Sum_probs=41.4
Q ss_pred CCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEE
Q psy8618 47 APELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYR 126 (232)
Q Consensus 47 ~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~ 126 (232)
.|-|... .|.+.|+.+...|..|++|.-.|.. .+..+.|.-. +. .|.|.+ .|.|.+++++... ...-.
T Consensus 3 ~pGF~~~-~~~~~Vp~~l~~g~~lg~V~f~dC~--~~~~~~~~ss------Dp-dF~V~~-DGsVy~~r~v~l~-~~~~~ 70 (90)
T PF08758_consen 3 RPGFSQK-KYTFEVPSNLEAGQPLGKVNFEDCT--GRRRVIFESS------DP-DFRVLE-DGSVYAKRPVQLS-SEQRS 70 (90)
T ss_dssp --B--S--EEEE----SS-SS--EEE---B--S--S---EEEE---------S-EEEEET-TTEEEEES--S-S-SS-EE
T ss_pred cCCcccc-eEEEEcCchhhCCcEEEEEEeccCC--CCCceEEecC------CC-CEEEcC-CCeEEEeeeEecC-CCceE
Confidence 4778877 7889999999999999999999885 3445777652 22 699998 7999999998654 33457
Q ss_pred EEEEEEeCCC
Q psy8618 127 IAVAATDNGF 136 (232)
Q Consensus 127 l~v~a~d~~~ 136 (232)
|.|.|.|...
T Consensus 71 F~V~a~D~~~ 80 (90)
T PF08758_consen 71 FTVHAWDSQT 80 (90)
T ss_dssp EEEEEEETTT
T ss_pred EEEEEECCCC
Confidence 9999999865
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.24 E-value=0.012 Score=39.59 Aligned_cols=73 Identities=21% Similarity=0.260 Sum_probs=52.7
Q ss_pred EEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCC
Q psy8618 75 AVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLND 154 (232)
Q Consensus 75 a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd 154 (232)
..|.| +..++|++..........|...++.++.+.- .+. .+..+.|.+.|.|+|+.. .+....+.|.|.+.|+
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-tP~-~~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSG-TPT-NSDVGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-ECC-CCCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 46665 4468998864432244679999998888776 344 334577999999999864 5566779999988876
No 16
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.62 E-value=0.035 Score=37.23 Aligned_cols=87 Identities=18% Similarity=0.259 Sum_probs=55.6
Q ss_pred EEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEECc--------cCCccCCCEEEEEEEEEeCCCCCccee
Q psy8618 71 VRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTKS--------VLDHEDRSIYRIAVAATDNGFPSRQTV 142 (232)
Q Consensus 71 ~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~--------~ld~e~~~~~~l~v~a~d~~~~~~~~~ 142 (232)
+++.++|+|.+.. ..++.... ....+.|.|++ +|.....- .|...+.-.-+|++.+.|+ ..
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~--~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt 70 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDA--AGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TS 70 (99)
T ss_pred CceEEeCCCCCCc--eEEecccc--cCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------Ce
Confidence 4688999996543 45555322 14577899987 67654431 2332234457788889996 26
Q ss_pred EEEEEEEEecCCCCCeeeCCceEEEEEe
Q psy8618 143 RLLRVEVLDLNDNRPTFTSSSLLFKVSA 170 (232)
Q Consensus 143 ~~v~i~v~d~Nd~~P~f~~~~~~~~v~~ 170 (232)
..|.|+|.-.|| +|...... ...+.|
T Consensus 71 ~~vtItI~GtND-apvi~~~~-~g~v~E 96 (99)
T TIGR01965 71 QTVTITITGAND-AAVIGGAD-TGSVTE 96 (99)
T ss_pred EEEEEEEEccCC-CCEEeccc-ceeEec
Confidence 789999999999 77654322 245554
No 17
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.09 E-value=0.011 Score=38.50 Aligned_cols=32 Identities=19% Similarity=0.449 Sum_probs=23.1
Q ss_pred EEEEccCCCCCCcEEEECCceEEEEeecCCCcc
Q psy8618 182 NYTLGESPSRTNHFYMKSVSGEICISQDLDFES 214 (232)
Q Consensus 182 ~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~ 214 (232)
.|.+... ....+|.++..+|.|.+...+|+|.
T Consensus 35 ~~ri~s~-~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 35 NFRIVSE-GNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp TBEEE-S-SSS-SEEE-TTTSEEEESS--SCCC
T ss_pred ceEEeec-CCcceeEecCCceeEEeCCccCHHH
Confidence 6666654 5689999999999999999999986
No 18
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=94.89 E-value=0.17 Score=33.46 Aligned_cols=69 Identities=22% Similarity=0.349 Sum_probs=34.3
Q ss_pred CCeeeCCceEEEEEee-----------cCCCCCCeEEEEEEccCCCCCCcEEEECCceEEEEeecCCCccccEEEEEEEE
Q psy8618 156 RPTFTSSSLLFKVSAS-----------DPDCGVNAMVNYTLGESPSRTNHFYMKSVSGEICISQDLDFESRSSYEFPVVA 224 (232)
Q Consensus 156 ~P~f~~~~~~~~v~~~-----------D~d~~~~~~i~y~l~~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a 224 (232)
.|-|....|.+.|+.. =.|+.....+.|.. .+..|.|. ..|.|++++++..... .-.|.|.|
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~s-----sDpdF~V~-~DGsVy~~r~v~l~~~-~~~F~V~a 75 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGRRRVIFES-----SDPDFRVL-EDGSVYAKRPVQLSSE-QRSFTVHA 75 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS---EEEE--------SEEEEE-TTTEEEEES--S-SSS--EEEEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCCCCceEEec-----CCCCEEEc-CCCeEEEeeeEecCCC-ceEEEEEE
Confidence 4667777777777651 01232233345553 24589999 6999999999887654 35799999
Q ss_pred EECCccC
Q psy8618 225 TDRGKET 231 (232)
Q Consensus 225 ~D~g~~~ 231 (232)
.|.....
T Consensus 76 ~D~~~~~ 82 (90)
T PF08758_consen 76 WDSQTQE 82 (90)
T ss_dssp EETTTTE
T ss_pred ECCCCCe
Confidence 9987653
No 19
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=92.89 E-value=2.8 Score=30.81 Aligned_cols=129 Identities=20% Similarity=0.257 Sum_probs=65.2
Q ss_pred CCeEEEEE-EEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCC-CcEEEEEEEEeCCCCCCcEEEEEE
Q psy8618 13 RSSHELVA-EARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPP-GTEVVRARAVDKDDGSNATISYSI 90 (232)
Q Consensus 13 ~~~~~l~v-~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~-g~~v~~v~a~D~D~~~~~~v~y~i 90 (232)
.-.|.+++ .|.|..+...+.....++. ++..+|.+.-. ....+..+... +..-..+.++|.-.+. .....++
T Consensus 14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~---iD~T~Ptisi~--~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l 87 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGNTSTSTVSETFT---IDNTPPTISIS--DGASVANGSTVYGLVNISINVTDNSDDS-KITSVSL 87 (158)
T ss_pred CccEEEEEEEEEecCCCEEEEEEeeEEE---EcCCCCEEEEe--cCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEE
Confidence 35899999 7999877433333332333 34446755430 00122222222 3333557777765433 3446666
Q ss_pred eeCCCCCCcccEEEe-CCceEEEEC--ccC-CccCCCEEEEEEEEEeCCCCCcceeEEEEEEEE
Q psy8618 91 LKGRDTDGYGVFTID-PISGVIRTK--SVL-DHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVL 150 (232)
Q Consensus 91 ~~~~~~~~~~~f~id-~~tG~i~~~--~~l-d~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~ 150 (232)
..|.. ...-..... ...|...+. +.+ ..|....|+|+|.|.|..+ ..++..+.+...
T Consensus 88 ~Gg~~-~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si~F~y~ 148 (158)
T PF13750_consen 88 TGGPA-SDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSISFSYM 148 (158)
T ss_pred ECCcc-cceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC--CEEEEEEEEEEe
Confidence 44421 122112222 123444443 211 3466789999999999864 344445555544
No 20
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=92.26 E-value=1.2 Score=25.68 Aligned_cols=37 Identities=24% Similarity=0.348 Sum_probs=28.3
Q ss_pred CCcccEEEeCCceEEEECccCCcc-CCCEEEEEEEEEeCC
Q psy8618 97 DGYGVFTIDPISGVIRTKSVLDHE-DRSIYRIAVAATDNG 135 (232)
Q Consensus 97 ~~~~~f~id~~tG~i~~~~~ld~e-~~~~~~l~v~a~d~~ 135 (232)
....+..+|+.+|.|+-. .+.. ..+.|.|.|.|+|..
T Consensus 11 ~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence 556789999999998875 2233 346899999999964
No 21
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=91.62 E-value=13 Score=35.38 Aligned_cols=121 Identities=19% Similarity=0.255 Sum_probs=61.6
Q ss_pred CCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCCCCCCcccEEEeCCceEEEEC--------
Q psy8618 43 VNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGRDTDGYGVFTIDPISGVIRTK-------- 114 (232)
Q Consensus 43 vNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~-------- 114 (232)
.||..+.|... .-..++.|+. |+.-..|.-...|.+..-.|.|+...|+.....+ |. +.+|.|...
T Consensus 395 ~dd~~s~i~Fe-~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~D-Y~--~~sGTLtF~PGEt~KtI 468 (928)
T TIGR00845 395 ENDPVSKIFFE-PGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSD-YE--FTEGTLVFKPGETQKEF 468 (928)
T ss_pred ccCCcceEEec-CCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCC-cc--ccCceEEECCCceEEEE
Confidence 45555555554 3467899986 5655665544445444566899887775422111 21 223433322
Q ss_pred --ccCC---ccCCCEEEEEEEEEeCCC-------------CCcceeEEEEEEEEecCCCCCeeeCCceEEEEEe
Q psy8618 115 --SVLD---HEDRSIYRIAVAATDNGF-------------PSRQTVRLLRVEVLDLNDNRPTFTSSSLLFKVSA 170 (232)
Q Consensus 115 --~~ld---~e~~~~~~l~v~a~d~~~-------------~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~~~v~~ 170 (232)
.-+| +|....|.+.+.-...+. .........+|+|.| ||+++.|....-.+.|.|
T Consensus 469 tV~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~E 541 (928)
T TIGR00845 469 RIGIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSE 541 (928)
T ss_pred EEEEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEc
Confidence 1122 333444555444322110 012223356777787 888988765443444544
No 22
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=89.50 E-value=1.6 Score=29.03 Aligned_cols=31 Identities=29% Similarity=0.314 Sum_probs=26.7
Q ss_pred CCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCC
Q psy8618 13 RSSHELVAEARDQGSPPRSSRVAVKISVTDVND 45 (232)
Q Consensus 13 ~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd 45 (232)
...|.+.|.|+|+.+ .++...++|.|.+.|+
T Consensus 66 ~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 66 VGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 457999999999876 6788899999999887
No 23
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=88.19 E-value=2.6 Score=28.30 Aligned_cols=48 Identities=25% Similarity=0.316 Sum_probs=33.0
Q ss_pred CCcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCC
Q psy8618 8 LDREARSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQ 64 (232)
Q Consensus 8 lD~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~ 64 (232)
|.--+...-++++.+.|+ ....|+|.|...|| +|+.... -...+.|+.
T Consensus 51 L~~Ge~~tdsFtvtv~DG------tt~~vtItI~GtND-apvi~~~--~~g~v~ED~ 98 (99)
T TIGR01965 51 LKAGETLTDTFTVTSADG------TSQTVTITITGAND-AAVIGGA--DTGSVTEDS 98 (99)
T ss_pred hcCCCEEEEEEEEEEeCC------CeEEEEEEEEccCC-CCEEecc--cceeEecCC
Confidence 333334455677788885 27889999999999 8877654 246666653
No 24
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=87.05 E-value=4.1 Score=29.22 Aligned_cols=58 Identities=22% Similarity=0.446 Sum_probs=38.5
Q ss_pred eEecCCCCcCC---CCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCC
Q psy8618 2 ISTREPLDREA---RSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPP 66 (232)
Q Consensus 2 i~~~~~lD~E~---~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~ 66 (232)
..+.++||... .-...|.|.|+|..+-.. ...+.|.|.| | .|..... . .+.|.|+...
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~-~-~~~V~E~~L~ 130 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDV-D-ALTVDEDDLP 130 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeeccc-c-ceEEeccccc
Confidence 35678888743 446778899999776433 3477777776 6 5877654 2 4788886543
No 25
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=85.28 E-value=5.2 Score=22.98 Aligned_cols=37 Identities=22% Similarity=0.319 Sum_probs=28.5
Q ss_pred CCCCcEEEECCceEEEEeecCCCc-cccEEEEEEEEEECC
Q psy8618 190 SRTNHFYMKSVSGEICISQDLDFE-SRSSYEFPVVATDRG 228 (232)
Q Consensus 190 ~~~~~f~i~~~~G~i~~~~~ld~e-~~~~y~l~V~a~D~g 228 (232)
.-.....||+.+|.|.-.- +.. ..+.|.+.|.|+|..
T Consensus 11 ~LP~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCCcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcCC
Confidence 5567889999999998663 333 335899999999975
No 26
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=84.45 E-value=14 Score=27.17 Aligned_cols=103 Identities=18% Similarity=0.307 Sum_probs=53.7
Q ss_pred CCEEEEEE-EEEeCCCCCcceeEEEEEEEEecCCCCCeeeC---Cc----------eEEEEEeecCCCCCCeEEEEEEcc
Q psy8618 122 RSIYRIAV-AATDNGFPSRQTVRLLRVEVLDLNDNRPTFTS---SS----------LLFKVSASDPDCGVNAMVNYTLGE 187 (232)
Q Consensus 122 ~~~~~l~v-~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~---~~----------~~~~v~~~D~d~~~~~~i~y~l~~ 187 (232)
-+.|.+.+ .|.|..+..........+.| ...+|.+.- .. ..+.|..+|..... ..-..+|.+
T Consensus 14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~i---D~T~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~G 89 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGNTSTSTVSETFTI---DNTPPTISISDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTG 89 (158)
T ss_pred CccEEEEEEEEEecCCCEEEEEEeeEEEE---cCCCCEEEEecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEEC
Confidence 46799999 79997653322222222332 334777654 00 23344444444333 555777776
Q ss_pred CCCCCCcEEE--E-CCceEEEEe--ecC-CCccccEEEEEEEEEECCc
Q psy8618 188 SPSRTNHFYM--K-SVSGEICIS--QDL-DFESRSSYEFPVVATDRGK 229 (232)
Q Consensus 188 ~~~~~~~f~i--~-~~~G~i~~~--~~l-d~e~~~~y~l~V~a~D~g~ 229 (232)
++ ....-.+ . ...|...+. +.+ ..|....|.|.|.|.|...
T Consensus 90 g~-~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG 136 (158)
T PF13750_consen 90 GP-ASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG 136 (158)
T ss_pred Cc-ccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC
Confidence 52 2222222 1 123333332 111 3367789999999999753
No 27
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=77.57 E-value=23 Score=25.36 Aligned_cols=59 Identities=25% Similarity=0.266 Sum_probs=38.2
Q ss_pred eEEEECccCCccC---CCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCeeeCCceEEEEEeecC
Q psy8618 109 GVIRTKSVLDHED---RSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPTFTSSSLLFKVSASDP 173 (232)
Q Consensus 109 G~i~~~~~ld~e~---~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~~~v~~~D~ 173 (232)
-.+++.++||+.. .-...|.|.|+|.+.-.. ...+.|+|.| | .|...... .++|.|.+.
T Consensus 68 YtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L 129 (137)
T TIGR03660 68 YEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDDL 129 (137)
T ss_pred EEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEecccc
Confidence 4566777887643 335778899999765332 3467777776 5 57765544 368887443
No 28
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=77.28 E-value=13 Score=22.30 Aligned_cols=59 Identities=22% Similarity=0.215 Sum_probs=32.7
Q ss_pred CcEEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEE
Q psy8618 83 NATISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEV 149 (232)
Q Consensus 83 ~~~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v 149 (232)
+-..+|++. | -...|..+...+-.+... .| ..+.|+|.|.|.|...........+.|+|
T Consensus 7 ~~~Y~Y~l~-g---~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 7 NIRYRYRLE-G---FDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TEEEEEEEE-T---TESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred ceEEEEEEE-C---CCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 334566664 4 345667766543243332 22 56889999999997654333335566665
No 29
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=72.36 E-value=10 Score=22.85 Aligned_cols=27 Identities=19% Similarity=0.240 Sum_probs=17.2
Q ss_pred CeEEEEEEEEeCCCCCceeEEEEEEEE
Q psy8618 14 SSHELVAEARDQGSPPRSSRVAVKISV 40 (232)
Q Consensus 14 ~~~~l~v~a~d~~~~~~~~~~~v~v~V 40 (232)
-.|.|.|.|.|..+........+.|.|
T Consensus 39 G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 39 GKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp EEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred EEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 489999999997765433335666655
No 30
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=69.03 E-value=1.8e+02 Score=31.70 Aligned_cols=98 Identities=10% Similarity=0.078 Sum_probs=49.6
Q ss_pred ccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCC------------------CCeeeCCceEEEEEeecCCCCCCeE
Q psy8618 119 HEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDN------------------RPTFTSSSLLFKVSASDPDCGVNAM 180 (232)
Q Consensus 119 ~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~------------------~P~f~~~~~~~~v~~~D~d~~~~~~ 180 (232)
|...+.|++.++|++.... ...++.|.|.+.... -..+....-.+...+ ....|+.
T Consensus 1564 Y~spGtYtVtLTvtN~~Gs---~~~T~~i~V~~~I~GlqI~~~~~~gg~~~~~~~~~~~~~t~~~v~l~~---~~~~GTn 1637 (2740)
T TIGR00864 1564 FRSVGTFNIIVTAENDVGA---AQASIFLFVLQEIEGLQILGETAEGGGGGVQELDGCYFETNHTVQFHA---GFKDGTN 1637 (2740)
T ss_pred ecCCceEEEEEEEecCCCc---cceeEEEEEeecccceEEEeecccccCCcccccccceeccCCEEEeee---ccCCCCe
Confidence 6678999999999997642 233445555432110 011222222344444 3344667
Q ss_pred EEEEEccCCC-C-CCcEEEECCceEEEEeecCCCccccEEEEEEEEEEC
Q psy8618 181 VNYTLGESPS-R-TNHFYMKSVSGEICISQDLDFESRSSYEFPVVATDR 227 (232)
Q Consensus 181 i~y~l~~~~~-~-~~~f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D~ 227 (232)
++|+..-+.+ . .+... ..+|... .+.|...+.|.+.+.|++.
T Consensus 1638 vsysW~~~~~n~~dg~~~--~~tGk~~---~vt~~~pG~Y~VtL~aSN~ 1681 (2740)
T TIGR00864 1638 LSFSWNAILDNEPDGPAF--AGSGKGA---KLNPLEAGPCDIFLQAANL 1681 (2740)
T ss_pred eEEEEEEecCCCCCCccc--cccCcee---EEecCCCceEEEEEEEeec
Confidence 7776643201 0 11111 1344321 2344566788888888764
No 31
>KOG4221|consensus
Probab=68.96 E-value=1.3e+02 Score=29.92 Aligned_cols=136 Identities=13% Similarity=0.145 Sum_probs=73.3
Q ss_pred EEEEEEeeCCCCCCcccEEEeCCceEEEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEecCCCCCe-ee---
Q psy8618 85 TISYSILKGRDTDGYGVFTIDPISGVIRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLDLNDNRPT-FT--- 160 (232)
Q Consensus 85 ~v~y~i~~~~~~~~~~~f~id~~tG~i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~-f~--- 160 (232)
...|.+.-... +...++.++.++-..++. +-+....|.+.|.|.....++.++...-.++..|+-+.+|. ..
T Consensus 551 I~~yk~~ys~~-~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev 626 (1381)
T KOG4221|consen 551 ITGYKLFYSED-DTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEV 626 (1381)
T ss_pred ceEEEEEEEcC-CCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEe
Confidence 34555543322 456778888776666665 34566789999999998877766655445667777777776 21
Q ss_pred CCceEEEEEee--cCCCCCCeEEEEEEccC--CCCCCc-EEEECCceEEEEeecCCCccccEEEEEEEEEE
Q psy8618 161 SSSLLFKVSAS--DPDCGVNAMVNYTLGES--PSRTNH-FYMKSVSGEICISQDLDFESRSSYEFPVVATD 226 (232)
Q Consensus 161 ~~~~~~~v~~~--D~d~~~~~~i~y~l~~~--~~~~~~-f~i~~~~G~i~~~~~ld~e~~~~y~l~V~a~D 226 (232)
.+.-.+.|.=. -.....|...-|.|... ...... ..+...+-.-.+.. +.|....|.|.|.|..
T Consensus 627 ~sStsVrVsW~pP~~~t~ng~itgYkIRy~~~~~~~~~~~t~v~~n~~~~l~~--~Lep~T~Y~vrIsa~t 695 (1381)
T KOG4221|consen 627 VSSTSVRVSWLPPPSETQNGQITGYKIRYRKLSREDEVNETVVKGNTTQYLFN--GLEPNTQYRVRISAMT 695 (1381)
T ss_pred cCCCeEEEEccCCCcccccceEEEEEEEecccCcccccceeecccchhhhHhh--cCCCCceEEEEEEEec
Confidence 12223444321 11222344446666543 011111 22221011111111 2455667999998865
No 32
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=67.45 E-value=1.2e+02 Score=29.16 Aligned_cols=56 Identities=21% Similarity=0.332 Sum_probs=32.2
Q ss_pred EEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeCC
Q psy8618 34 VAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKGR 94 (232)
Q Consensus 34 ~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~~ 94 (232)
...+|+|.| ||++|.|... .-...|.|+. |+.-.++.-+- +....-.+.|....|+
T Consensus 515 s~ATVTIlD-DD~aGIfsFe-~~~~sV~Es~--G~vtvtV~Rts-Ga~G~VtV~Y~T~dGT 570 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFE-EDVFHVSESI--GIMEVKVLRTS-GARGTVIVPYRTVEGT 570 (928)
T ss_pred ceEEEEEec-CcccCccccc-CceEEEEcCC--CEEEEEEEEcC-CCCeeEEEEEEeecCc
Confidence 455667777 7789988766 3458889975 44433332221 1112233668776664
No 33
>KOG3597|consensus
Probab=64.51 E-value=97 Score=27.01 Aligned_cols=60 Identities=17% Similarity=0.230 Sum_probs=44.3
Q ss_pred eEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeC
Q psy8618 32 SRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKG 93 (232)
Q Consensus 32 ~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~ 93 (232)
.+....|.|..+||.+..+... -+.+.+.|+...-.....+.+.|+|.+. ..+.|+++..
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~-~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t 83 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPS-GSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGT 83 (442)
T ss_pred EEeeecccccccCCCcceeecc-cceEEeecCCceeccceEeeccCCCCCc-cceEEEEccC
Confidence 4667889999999966656555 4458888888766666778899998754 3578888654
No 34
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=54.58 E-value=3.3e+02 Score=29.92 Aligned_cols=31 Identities=19% Similarity=0.132 Sum_probs=21.8
Q ss_pred CccCCCEEEEEEEEEeCCCCCcceeEEEEEEEEe
Q psy8618 118 DHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVLD 151 (232)
Q Consensus 118 d~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d 151 (232)
-|...+.|+++++|++.... ...++.|.|..
T Consensus 1479 tY~~~GtYtVtLTvtN~~Gs---st~T~~VtV~~ 1509 (2740)
T TIGR00864 1479 AFNSPGDFNIRLAAANEVGK---NEATLNVAVKA 1509 (2740)
T ss_pred EcCCCceEEEEEEEECCCCc---eEEEEEEEEec
Confidence 36678999999999998642 34455666653
No 35
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=54.06 E-value=43 Score=20.75 Aligned_cols=30 Identities=13% Similarity=0.106 Sum_probs=21.5
Q ss_pred CCccCCCEEEEEEEEEeCCCCCcceeEEEEEEE
Q psy8618 117 LDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEV 149 (232)
Q Consensus 117 ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v 149 (232)
.-|...+.|.+++.++|... +..+.+.|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 34667889999999999764 4455555554
No 36
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=49.17 E-value=60 Score=20.08 Aligned_cols=28 Identities=11% Similarity=0.106 Sum_probs=17.4
Q ss_pred cCCCCeEEEEEEEEeCCCCCceeEEEEEEEE
Q psy8618 10 REARSSHELVAEARDQGSPPRSSRVAVKISV 40 (232)
Q Consensus 10 ~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V 40 (232)
|+....|.+++.+.|..+ ++.+.++|.|
T Consensus 51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 51 YTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 445567888888887665 4444555543
No 37
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=48.89 E-value=57 Score=19.46 Aligned_cols=29 Identities=21% Similarity=0.257 Sum_probs=18.7
Q ss_pred CCeEEEEEEEEeCCCCCceeEEEEEEEEEeC
Q psy8618 13 RSSHELVAEARDQGSPPRSSRVAVKISVTDV 43 (232)
Q Consensus 13 ~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dv 43 (232)
...|.|.+.|+|..|.. +.......+.|.
T Consensus 22 dg~yt~~v~a~D~AGN~--~~~~~~~~i~d~ 50 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGNT--SSSTTQIVIVDN 50 (60)
T ss_pred CccEEEEEEEEECCCCE--EEeeeEEEEEcC
Confidence 56899999999987743 333444444443
No 38
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=48.35 E-value=58 Score=20.25 Aligned_cols=18 Identities=0% Similarity=-0.073 Sum_probs=13.0
Q ss_pred cCCCCeEEEEEEEEeCCC
Q psy8618 10 REARSSHELVAEARDQGS 27 (232)
Q Consensus 10 ~E~~~~~~l~v~a~d~~~ 27 (232)
|.....|.+++.++|..+
T Consensus 53 y~~~G~y~v~l~v~d~~g 70 (81)
T cd00146 53 YTKPGTYTVTLTVTNAVG 70 (81)
T ss_pred cCCCcEEEEEEEEEeCCC
Confidence 455667888888888754
No 39
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=47.30 E-value=52 Score=20.76 Aligned_cols=24 Identities=17% Similarity=0.292 Sum_probs=13.7
Q ss_pred eEEEEEEEEeCCCCCceeEEEEEEEE
Q psy8618 15 SHELVAEARDQGSPPRSSRVAVKISV 40 (232)
Q Consensus 15 ~~~l~v~a~d~~~~~~~~~~~v~v~V 40 (232)
.+.++..|+|..+ .++.+.+.|.|
T Consensus 58 ~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 58 TTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred eEEEEEEEEECCC--CEEEEEEEEEC
Confidence 5566666666544 44555555543
No 40
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=46.28 E-value=83 Score=20.60 Aligned_cols=52 Identities=19% Similarity=0.283 Sum_probs=28.3
Q ss_pred EEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEeeC
Q psy8618 37 KISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRARAVDKDDGSNATISYSILKG 93 (232)
Q Consensus 37 ~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~v~y~i~~~ 93 (232)
+|.|.| ||.+ .+... .....+.|+.. ..-..+.-...+....-.+.|....+
T Consensus 2 tvtI~d-~d~~-~v~f~-~~~~~v~E~~~--~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFS-SPSYTVSEGDG--TVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TTSE-EEEES-SSEEEEETTSS--EEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CCCC-EEEEe-CCEEEEEeCCC--EEEEEEEEcccCCCcceEEEEEEeCC
Confidence 466777 6644 77665 34678888874 34444444433322334466666555
No 41
>PF10365 DUF2436: Domain of unknown function (DUF2436); InterPro: IPR018832 Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms. This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).
Probab=40.39 E-value=1.2e+02 Score=21.73 Aligned_cols=49 Identities=16% Similarity=0.351 Sum_probs=27.0
Q ss_pred EEEEEEccCCCCCCcEEEECCceEEEEee-cCCCccccEEEEEEEEEECCc
Q psy8618 180 MVNYTLGESPSRTNHFYMKSVSGEICISQ-DLDFESRSSYEFPVVATDRGK 229 (232)
Q Consensus 180 ~i~y~l~~~~~~~~~f~i~~~~G~i~~~~-~ld~e~~~~y~l~V~a~D~g~ 229 (232)
...|.|... .......|....|.=..+. ..-+|...+|++++.+..+|-
T Consensus 108 ~YDy~I~~P-~~~~kiwIaGd~g~~~tr~dDy~fEAGKtY~ftm~~~g~gD 157 (161)
T PF10365_consen 108 TYDYCIAAP-QPGGKIWIAGDGGDGPTRGDDYVFEAGKTYRFTMKRVGSGD 157 (161)
T ss_pred eeEEEEecC-CCCCeEEEecCCCCCCccccceEEecCCEEEEEEEeccCCC
Confidence 335666654 3344555543333222211 133477789999999887763
No 42
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=38.46 E-value=1.2e+02 Score=20.62 Aligned_cols=33 Identities=15% Similarity=0.122 Sum_probs=22.3
Q ss_pred cCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEEE
Q psy8618 116 VLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEVL 150 (232)
Q Consensus 116 ~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~ 150 (232)
.+++.+.+.|...|.++|...-..++. +.|.|-
T Consensus 73 t~~v~kgG~y~m~V~lCn~dGCS~S~~--~~I~VA 105 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDGCSTSAA--KEIVVA 105 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCCccCcCC--EEEEec
Confidence 355667788999999999876444443 444443
No 43
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=37.10 E-value=3.4e+02 Score=25.02 Aligned_cols=120 Identities=19% Similarity=0.181 Sum_probs=0.0
Q ss_pred eEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCCCCcEEEEEE--EEeCCCCCCcEEEEEEee
Q psy8618 15 SHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQPPGTEVVRAR--AVDKDDGSNATISYSILK 92 (232)
Q Consensus 15 ~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~~g~~v~~v~--a~D~D~~~~~~v~y~i~~ 92 (232)
...+++.++|..+ ..+...+.+|+|++- ++|.+... .+...-.++...-+.-+.+. ..+.-......+.|+...
T Consensus 299 ~~Ti~atVtDSRG-r~S~~~~~tItVl~Y--~~P~lsfs-v~R~~~~~~~~~v~~~a~Iapl~v~g~qKN~~~lt~~~a~ 374 (624)
T PF05895_consen 299 SATIRATVTDSRG-RTSDPKTKTITVLEY--SPPTLSFS-VYRCGSSGNTLTVTRNAKIAPLTVNGVQKNTMTLTFKVAP 374 (624)
T ss_pred eEEEEEEEEECCC-ccCCceEEEEEEEEc--CCCcEEEE-EEEeCCCCcEEEEEEEEEEeEEEEcccccceEEEEEEEEE
Q ss_pred CCCCCCcccEEEeCCceE-------------EEECccCCccCCCEEEEEEEEEeCCCCCcceeEEEEEEE
Q psy8618 93 GRDTDGYGVFTIDPISGV-------------IRTKSVLDHEDRSIYRIAVAATDNGFPSRQTVRLLRVEV 149 (232)
Q Consensus 93 ~~~~~~~~~f~id~~tG~-------------i~~~~~ld~e~~~~~~l~v~a~d~~~~~~~~~~~v~i~v 149 (232)
-....|.+|..... ..+...+| ....|.+.+.++|. -....+...|
T Consensus 375 ----~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~--~~kSy~V~~~l~D~-----F~s~t~~~~V 433 (624)
T PF05895_consen 375 ----LGTGTFTTDNGSASGTWSSISELTNSSANLGGTYD--AEKSYDVRGTLSDK-----FTSTTFTVTV 433 (624)
T ss_pred ----cCcceEEEEccccccceeeeeeecccceeeccccC--CCceEEEEEEEEEE-----eeeEEEEEEc
No 44
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=34.53 E-value=1.2e+02 Score=19.14 Aligned_cols=14 Identities=50% Similarity=0.809 Sum_probs=11.6
Q ss_pred ccEEEEEEEEEECC
Q psy8618 215 RSSYEFPVVATDRG 228 (232)
Q Consensus 215 ~~~y~l~V~a~D~g 228 (232)
.+.|.+.|.|+|.|
T Consensus 68 ~G~Y~~~v~a~~~g 81 (81)
T PF13860_consen 68 DGTYTFRVTATDGG 81 (81)
T ss_dssp SEEEEEEEEEEET-
T ss_pred CCCEEEEEEEEeCC
Confidence 46899999999987
No 45
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=32.15 E-value=1.4e+02 Score=21.03 Aligned_cols=31 Identities=23% Similarity=0.364 Sum_probs=15.4
Q ss_pred EEEEEEEe-CCCCCceeEEEEEEEEEeCCCCCC
Q psy8618 17 ELVAEARD-QGSPPRSSRVAVKISVTDVNDNAP 48 (232)
Q Consensus 17 ~l~v~a~d-~~~~~~~~~~~v~v~V~dvNd~~P 48 (232)
.|.|.|+= .++.+++....+.|+|..-|+ +|
T Consensus 101 nl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 101 NLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp EEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred cEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 45566653 234578899999999998888 66
No 46
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=29.22 E-value=1.3e+02 Score=17.99 Aligned_cols=18 Identities=17% Similarity=0.327 Sum_probs=14.5
Q ss_pred CccccEEEEEEEEEECCc
Q psy8618 212 FESRSSYEFPVVATDRGK 229 (232)
Q Consensus 212 ~e~~~~y~l~V~a~D~g~ 229 (232)
|...+.|.+.|.|+|.-.
T Consensus 48 y~~~G~y~V~ltv~n~~g 65 (69)
T PF00801_consen 48 YSSPGTYTVTLTVTNGVG 65 (69)
T ss_dssp ESSSEEEEEEEEEEETTS
T ss_pred cCCCeEEEEEEEEEECCC
Confidence 456789999999999753
No 47
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=26.80 E-value=1.4e+02 Score=17.29 Aligned_cols=17 Identities=35% Similarity=0.376 Sum_probs=13.7
Q ss_pred CCCEEEEEEEEEeCCCC
Q psy8618 121 DRSIYRIAVAATDNGFP 137 (232)
Q Consensus 121 ~~~~~~l~v~a~d~~~~ 137 (232)
..+.|.+.+.|+|....
T Consensus 22 ~dG~y~itv~a~D~AGN 38 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGN 38 (54)
T ss_pred CCccEEEEEEEEeCCCC
Confidence 45789999999997553
No 48
>COG3354 FlaG Putative archaeal flagellar protein G [Cell motility and secretion]
Probab=26.16 E-value=2.6e+02 Score=20.29 Aligned_cols=54 Identities=13% Similarity=0.109 Sum_probs=32.2
Q ss_pred EEeCCCCCcce---eEEEEEEEEecCCCCCeeeCCceEEEEEeecCCCCCCeEEEEEEcc
Q psy8618 131 ATDNGFPSRQT---VRLLRVEVLDLNDNRPTFTSSSLLFKVSASDPDCGVNAMVNYTLGE 187 (232)
Q Consensus 131 a~d~~~~~~~~---~~~v~i~v~d~Nd~~P~f~~~~~~~~v~~~D~d~~~~~~i~y~l~~ 187 (232)
..|.+.++... ..++++.|.+.-+-++.|......+-|.. .......+.|.-..
T Consensus 55 IndPg~i~~~~~~g~~t~t~yiKNtG~~~~~fd~~sitVliDG---~iv~~a~~~~~~~~ 111 (154)
T COG3354 55 INDPGQIPYVGTDGPYTYTFYIKNTGSDSIAFDNTSITVLIDG---NIVTPAYVTFTSVN 111 (154)
T ss_pred ecCCCCCccccCCCceEEEEEEecCCCcccccCCCeEEEEEcC---cEeccceEEEEecC
Confidence 34555554433 34677888888777888887775555532 33445555665543
No 49
>PF14302 DUF4377: Domain of unknown function (DUF4377)
Probab=26.12 E-value=1.2e+02 Score=19.39 Aligned_cols=25 Identities=16% Similarity=0.288 Sum_probs=19.7
Q ss_pred ecCCCccccEEEEEEEEEECCccCC
Q psy8618 208 QDLDFESRSSYEFPVVATDRGKETQ 232 (232)
Q Consensus 208 ~~ld~e~~~~y~l~V~a~D~g~~~q 232 (232)
..++||..-.|.|.|..+.--+|||
T Consensus 40 eGF~yE~Gy~Y~L~Vk~~~~~nppa 64 (80)
T PF14302_consen 40 EGFEYEPGYEYVLRVKRTPVANPPA 64 (80)
T ss_pred cCcCcCCCcEEEEEEEEEECCCCCC
Confidence 4568888888999988887777765
No 50
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=25.00 E-value=1.6e+02 Score=20.96 Aligned_cols=35 Identities=14% Similarity=0.190 Sum_probs=21.2
Q ss_pred CcCCCCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCC
Q psy8618 9 DREARSSHELVAEARDQGSPPRSSRVAVKISVTDVND 45 (232)
Q Consensus 9 D~E~~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd 45 (232)
....--.|.++|.+.+..+ -+....+.|.|.|.+=
T Consensus 78 ~~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 78 TVTKGGRYQMQVELCNADG--CSTSAPVEVVVADTDG 112 (133)
T ss_dssp EE-S-EEEEEEEEEEETTE--EEE---EEEEEE-TTS
T ss_pred EecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCCc
Confidence 3445568999999998776 3445577778887754
No 51
>PF03413 PepSY: Peptidase propeptide and YPEB domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR005075 This signature, PepSY, is found in the propeptide of members of the MEROPS peptidase family M4 (clan MA(E)), which contains the thermostable thermolysins (3.4.24.27 from EC), and related thermolabile neutral proteases (bacillolysins) (3.4.24.28 from EC) from various species of Bacillus. It is also in many non-peptidase proteins, including Bacillus subtilis YpeB protein - a regulator of SleB spore cortex lytic enzyme - and a large number of eubacterial and archaeal cell wall-associated and secreted proteins which are mostly annotated as 'hypothetical protein'. Many extracellular bacterial proteases are produced as proenzymes. The propeptides usually have a dual function, i.e. they function as an intramolecular chaperone required for the folding of the polypeptide and as an inhibitor preventing premature activation of the enzyme. Analysis of the propeptide region of the M4 family of peptidases reveals two regions of conservation, the PepSY domain and a second domain, proximate to the N terminus, the FTP domain (IPR011096 from INTERPRO), which is also found in isolation in the propeptide of eukaryotic peptidases belong to MEROPS peptidase family M36. Propeptide domain swapping experiments, for example swapping the propeptide domain of PA protease with that of vibrolysin, both propeptides contain the FTP and PepSY domains, allows the PA protease domain to fold correctly and inhibits the C-terminal autoprocessing activity. However, swapping the propeptide of PA protease for the thermolysin propeptide, does not facilitate the correct folding nor the processing of the chimaeric protein into an active peptidase []. Mutational analysis of the Pseudomonas aeruginosa elastase gene revealed two mutations in the propeptide which resulted in the loss of inhibitory activity but not chaperone activity: A-15V and T-153I (where +1 is defined as the first residue of the mature peptidase). Both mutations resulted in peptidase activity, the T-153V mutation being much less effective than the A-15I mutation [] in activating peptidase activity. The T-153V mutation lies N-terminal to the FTP domain while the A-15I mutation is C-terminal to the PepSY domain. Given the diverse range of other proteins, both domains occur in in isolation, the exact function of each is still unclear; though it has been proposed that the PepSY domain primarily has inhibitory activity and in conjunction with the FTP domain in chaperone activity. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 2GU3_A 3NQZ_A 3NQY_A 2KGY_A.
Probab=24.64 E-value=1.4e+02 Score=17.19 Aligned_cols=14 Identities=21% Similarity=0.221 Sum_probs=8.4
Q ss_pred ccEEEeCCceEEEE
Q psy8618 100 GVFTIDPISGVIRT 113 (232)
Q Consensus 100 ~~f~id~~tG~i~~ 113 (232)
..+.||+.||.|.-
T Consensus 49 ~~v~VDa~tG~Il~ 62 (64)
T PF03413_consen 49 YEVYVDAYTGEILS 62 (64)
T ss_dssp EEEEEETTT--EEE
T ss_pred EEEEEECCCCeEEE
Confidence 44559999998753
No 52
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=23.77 E-value=4.1e+02 Score=21.74 Aligned_cols=107 Identities=19% Similarity=0.254 Sum_probs=51.9
Q ss_pred CCeEEEEEEEEeCCCCCceeEEEEEEEEEeCCCCCCeecCCCceEEEEecCCC------CCcEEEEEEEEeCCCCCCcEE
Q psy8618 13 RSSHELVAEARDQGSPPRSSRVAVKISVTDVNDNAPELVDPQEDVISVREEQP------PGTEVVRARAVDKDDGSNATI 86 (232)
Q Consensus 13 ~~~~~l~v~a~d~~~~~~~~~~~v~v~V~dvNd~~P~f~~~~~~~~~v~E~~~------~g~~v~~v~a~D~D~~~~~~v 86 (232)
-..|.+.+.++|..+.... ....+.|- -.+|..... .+.++.. .........+.+.+ .+..+
T Consensus 11 d~~~~v~vt~tD~aGN~~~--~t~~~~vD---t~~P~v~i~-----~~~~~~~~~~~~~~~~~t~s~tvs~~~--~g~~v 78 (317)
T PF13753_consen 11 DGTYTVSVTVTDAAGNTST--ATQSITVD---TTAPTVTIT-----SIADDDIINGDEATNTVTFSGTVSGAE--PGSTV 78 (317)
T ss_pred CCcEEEEEEEEeCCCCeee--eeEEEEEe---cCCCceeee-----cccCCCccccceeeeeeEEEEEecCCC--CCCEE
Confidence 4678999999998775443 33333322 226633221 1111111 12223444444443 34456
Q ss_pred EEEEeeCCCCCCcccEEEeCCceEEEECccC-CccCCCEEEEEEE-EEeCCCC
Q psy8618 87 SYSILKGRDTDGYGVFTIDPISGVIRTKSVL-DHEDRSIYRIAVA-ATDNGFP 137 (232)
Q Consensus 87 ~y~i~~~~~~~~~~~f~id~~tG~i~~~~~l-d~e~~~~~~l~v~-a~d~~~~ 137 (232)
...+ .+ ....+..+ ..|.....-.. +.-..+.|.+.+. ++|....
T Consensus 79 ~v~~-~g----~~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN 125 (317)
T PF13753_consen 79 TVTI-NG----TTGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGN 125 (317)
T ss_pred EEEE-CC----EEEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCc
Confidence 6666 22 12223333 35543332221 1223457889998 9997553
Done!