Query psy11061
Match_columns 212
No_of_seqs 141 out of 1430
Neff 9.7
Searched_HMMs 46136
Date Fri Aug 16 17:05:12 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11061.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11061hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 5.3E-39 1.1E-43 282.8 18.4 207 2-210 196-408 (2531)
2 KOG4289|consensus 100.0 4.6E-38 1E-42 276.9 21.6 207 1-210 303-512 (2531)
3 KOG1219|consensus 100.0 5.3E-34 1.2E-38 258.4 22.0 207 1-212 987-1197(4289)
4 KOG1219|consensus 100.0 5.1E-33 1.1E-37 252.1 22.3 204 1-211 2109-2316(4289)
5 cd00031 CA Cadherin repeat dom 100.0 2.2E-28 4.7E-33 185.9 24.4 165 1-168 31-199 (199)
6 cd00031 CA Cadherin repeat dom 99.9 2.3E-23 5.1E-28 158.1 19.2 132 76-209 2-135 (199)
7 PF00028 Cadherin: Cadherin do 99.8 3.8E-17 8.2E-22 109.3 14.4 90 76-167 1-93 (93)
8 KOG1834|consensus 99.8 6.6E-17 1.4E-21 135.8 17.1 158 3-169 71-245 (952)
9 smart00112 CA Cadherin repeats 99.7 1.1E-15 2.3E-20 99.0 10.8 77 96-174 1-79 (79)
10 smart00112 CA Cadherin repeats 99.6 4.1E-15 8.9E-20 96.3 9.7 68 1-69 10-79 (79)
11 KOG1834|consensus 99.4 6.3E-12 1.4E-16 106.3 13.5 148 58-211 20-184 (952)
12 PF00028 Cadherin: Cadherin do 99.2 1.1E-10 2.4E-15 77.8 9.4 61 1-62 30-93 (93)
13 PF08758 Cadherin_pro: Cadheri 97.5 0.0025 5.4E-08 41.9 9.9 79 68-155 3-82 (90)
14 smart00736 CADG Dystroglycan-t 97.2 0.015 3.3E-07 38.7 11.0 69 96-171 25-96 (97)
15 PF08266 Cadherin_2: Cadherin- 97.2 0.0007 1.5E-08 43.9 4.2 60 75-137 2-63 (84)
16 TIGR01965 VCBS_repeat VCBS rep 97.1 0.016 3.5E-07 38.6 10.2 88 91-189 2-98 (99)
17 PF13750 Big_3_3: Bacterial Ig 97.0 0.075 1.6E-06 38.8 15.9 127 36-167 14-148 (158)
18 smart00736 CADG Dystroglycan-t 97.0 0.013 2.9E-07 39.0 9.3 63 2-66 32-96 (97)
19 PF08266 Cadherin_2: Cadherin- 96.2 0.016 3.4E-07 37.6 5.0 31 4-35 36-66 (84)
20 PF05345 He_PIG: Putative Ig d 94.2 0.35 7.6E-06 27.8 6.1 37 115-153 12-49 (49)
21 PF07495 Y_Y_Y: Y_Y_Y domain; 93.8 0.86 1.9E-05 27.5 8.4 59 103-166 7-65 (66)
22 PF08758 Cadherin_pro: Cadheri 92.4 1 2.2E-05 29.5 6.8 37 14-52 46-82 (90)
23 PF05345 He_PIG: Putative Ig d 92.0 1.1 2.4E-05 25.7 5.9 46 3-50 1-49 (49)
24 TIGR01965 VCBS_repeat VCBS rep 88.1 6 0.00013 26.4 8.2 65 12-83 25-97 (99)
25 TIGR00845 caca sodium/calcium 86.9 20 0.00044 33.7 12.7 49 64-114 395-443 (928)
26 TIGR03660 T1SS_rpt_143 T1SS-14 86.2 5.2 0.00011 28.5 6.9 56 24-83 69-127 (137)
27 PF07495 Y_Y_Y: Y_Y_Y domain; 84.6 6.5 0.00014 23.5 6.7 53 3-61 11-65 (66)
28 PF12245 Big_3_2: Bacterial Ig 82.5 7.5 0.00016 23.2 5.6 31 36-66 22-52 (60)
29 PF05895 DUF859: Siphovirus pr 79.7 47 0.001 30.0 13.1 109 36-153 297-425 (624)
30 KOG4221|consensus 79.4 21 0.00045 34.5 9.6 56 12-70 563-620 (1381)
31 KOG3597|consensus 76.4 49 0.0011 28.5 14.4 97 53-152 24-135 (442)
32 TIGR00845 caca sodium/calcium 75.5 75 0.0016 30.1 16.4 56 55-114 515-570 (928)
33 cd02848 Chitinase_N_term Chiti 75.5 6.1 0.00013 26.7 4.0 33 31-63 74-106 (106)
34 PF13754 Big_3_4: Bacterial Ig 75.0 14 0.0003 21.5 5.4 19 138-156 22-40 (54)
35 PF08329 ChitinaseA_N: Chitina 74.4 28 0.0006 24.6 7.3 32 138-171 81-112 (133)
36 PF12245 Big_3_2: Bacterial Ig 73.1 17 0.00036 21.7 5.2 31 138-170 21-51 (60)
37 PF03160 Calx-beta: Calx-beta 69.4 29 0.00062 22.7 8.3 53 58-114 2-54 (100)
38 PF13753 SWM_repeat: Putative 68.7 62 0.0013 26.3 12.4 110 36-157 11-128 (317)
39 cd00146 PKD polycystic kidney 68.7 15 0.00033 22.8 4.6 30 31-60 51-80 (81)
40 TIGR03660 T1SS_rpt_143 T1SS-14 66.1 45 0.00097 23.7 7.2 43 141-189 86-128 (137)
41 PF02494 HYR: HYR domain; Int 65.2 16 0.00035 22.9 4.2 26 36-61 56-81 (81)
42 cd02848 Chitinase_N_term Chiti 65.1 27 0.00059 23.6 5.3 29 138-168 78-106 (106)
43 smart00089 PKD Repeats in poly 62.6 26 0.00055 21.6 4.8 30 31-61 49-78 (79)
44 PF13750 Big_3_3: Bacterial Ig 62.4 25 0.00054 25.6 5.2 30 33-62 119-148 (158)
45 smart00089 PKD Repeats in poly 61.2 25 0.00054 21.7 4.6 28 136-166 51-78 (79)
46 cd00146 PKD polycystic kidney 56.9 26 0.00056 21.7 4.1 20 136-155 53-72 (81)
47 KOG4221|consensus 56.8 2.1E+02 0.0045 28.3 12.8 67 106-175 552-620 (1381)
48 PF08329 ChitinaseA_N: Chitina 47.4 25 0.00054 24.9 3.0 37 30-66 76-112 (133)
49 TIGR00864 PCC polycystin catio 39.9 5.4E+02 0.012 28.2 14.4 30 32-62 1479-1508(2740)
50 COG4288 Uncharacterized protei 33.6 1.5E+02 0.0033 20.1 5.8 60 36-98 42-108 (124)
51 KOG3597|consensus 28.0 1E+02 0.0022 26.6 4.2 50 158-207 24-73 (442)
52 PF14157 YmzC: YmzC-like prote 27.8 44 0.00096 20.2 1.5 18 13-30 41-58 (63)
53 PF13860 FlgD_ig: FlgD Ig-like 26.5 1.7E+02 0.0037 18.3 4.8 15 138-152 67-81 (81)
54 COG1470 Predicted membrane pro 25.0 4.8E+02 0.01 22.9 13.8 72 125-203 386-470 (513)
55 cd04046 C2_Calpain C2 domain p 24.7 2.3E+02 0.005 19.2 6.6 17 186-202 107-123 (126)
56 PF09100 Qn_am_d_aIV: Quinohem 23.3 1.3E+02 0.0028 21.1 3.3 32 37-69 100-132 (133)
57 PF07145 PAM2: Ataxin-2 C-term 23.0 51 0.0011 14.6 0.9 11 64-74 6-16 (18)
58 TIGR03769 P_ac_wall_RPT actino 23.0 1.4E+02 0.0031 16.2 3.7 14 136-149 9-22 (41)
59 cd01758 PLAT_LPL PLAT/ LH2 dom 21.2 3.1E+02 0.0067 19.4 8.3 66 38-108 2-68 (137)
60 PF15418 DUF4625: Domain of un 21.2 3.1E+02 0.0066 19.3 8.7 19 138-156 105-123 (132)
61 KOG3865|consensus 20.9 4.5E+02 0.0097 21.8 6.3 63 14-80 186-251 (402)
No 1
>KOG4289|consensus
Probab=100.00 E-value=5.3e-39 Score=282.79 Aligned_cols=207 Identities=29% Similarity=0.453 Sum_probs=196.2
Q ss_pred eEEEEcC--CCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCCccCCCceE
Q psy11061 2 VNYTLGE--SPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEPIFNPVEYN 77 (212)
Q Consensus 2 i~Y~i~~--~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P~f~~~~~~ 77 (212)
+.|++.. ++.+.++|+||+.+|.|++++.||||..+.+.|.|.|+|.+. .+++++++|.|.|.|||+|+|.+..|.
T Consensus 196 l~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~ 275 (2531)
T KOG4289|consen 196 LYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYR 275 (2531)
T ss_pred eEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHH
Confidence 6788833 335679999999999999999999999999999999999983 789999999999999999999999999
Q ss_pred EEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCC
Q psy11061 78 VSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNK 155 (212)
Q Consensus 78 ~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~ 155 (212)
-++.|+.++|..|+++.|+|.|.++|+.|.|++..|++...|.|++.+|.|++..+ +| ....|.|.|.|+|+|.++
T Consensus 276 e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~--lDRE~~~~y~L~VeAsDqG~~p 353 (2531)
T KOG4289|consen 276 EELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAP--LDREELESYQLDVEASDQGRPP 353 (2531)
T ss_pred HHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCc--cCHHhhhheEEEEEeccCCCCC
Confidence 99999999999999999999999999999999999988899999999999999999 88 899999999999999887
Q ss_pred eeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCee
Q psy11061 156 CSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTE 210 (212)
Q Consensus 156 ~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~ 210 (212)
...+++|.|+|.|+|||+|+|....|.+.|.|+..++++|++|+|+|+|-|.|+.
T Consensus 354 gp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~ 408 (2531)
T KOG4289|consen 354 GPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGK 408 (2531)
T ss_pred CCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCce
Confidence 7779999999999999999999999999999999999999999999999999884
No 2
>KOG4289|consensus
Probab=100.00 E-value=4.6e-38 Score=276.90 Aligned_cols=207 Identities=32% Similarity=0.479 Sum_probs=196.7
Q ss_pred CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCCccCCCceEE
Q psy11061 1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEPIFNPVEYNV 78 (212)
Q Consensus 1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P~f~~~~~~~ 78 (212)
.|+|++.+++ ..+.|+|++.+|.|++..+||||+.+.|.|.|.|+|.|. ...+++|.|.|.|.|||+|+|....|.+
T Consensus 303 ni~Yrl~eg~-~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvv 381 (2531)
T KOG4289|consen 303 NIRYRLLEGN-AKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVV 381 (2531)
T ss_pred ceEEEecCCC-ccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEE
Confidence 4899998886 568999999999999999999999999999999999983 3458999999999999999999999999
Q ss_pred EEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC-CCCeEEEEEEEEECCCCCee
Q psy11061 79 SLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS-RSNSYHINVSAMDGGGNKCS 157 (212)
Q Consensus 79 ~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v~a~D~~~~~~~ 157 (212)
.|.|+..+++.|++++|+|.|.+.|+.+.|+|.+|+.-+.|.||..+|++.+..+ || +...|+++|+|.|++-|+.+
T Consensus 382 qv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~p--lD~e~~~ytl~IrAqDggrPpLs 459 (2531)
T KOG4289|consen 382 QVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEP--LDFENSEYTLRIRAQDGGRPPLS 459 (2531)
T ss_pred EecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEecc--ccccCCeeEEEEEcccCCCCCcc
Confidence 9999999999999999999999999999999999999999999999999999999 99 77799999999999999988
Q ss_pred eeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCee
Q psy11061 158 QDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTE 210 (212)
Q Consensus 158 ~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~ 210 (212)
.+.-+.|+|.|+|||+|.|...++..+|-|+.+.|..++.+.|.|.|+|.++.
T Consensus 460 n~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~ 512 (2531)
T KOG4289|consen 460 NTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENAR 512 (2531)
T ss_pred CCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccc
Confidence 88888899999999999999999999999999999999999999999999874
No 3
>KOG1219|consensus
Probab=100.00 E-value=5.3e-34 Score=258.39 Aligned_cols=207 Identities=28% Similarity=0.393 Sum_probs=195.6
Q ss_pred CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCCccCCCceEE
Q psy11061 1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEPIFNPVEYNV 78 (212)
Q Consensus 1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P~f~~~~~~~ 78 (212)
.|.|+|..++ ..+.|.|+..+|.|++.++||||....|-|++.|.|.|. +++.+.+.|.|.|+|||+|+|....|..
T Consensus 987 ~l~Y~I~~gd-g~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~a 1065 (4289)
T KOG1219|consen 987 ELSYKIRTGD-GDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYA 1065 (4289)
T ss_pred eEEEEEEcCC-cceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEee
Confidence 3789997775 468999999999999999999999999999999999984 7889999999999999999999999999
Q ss_pred EEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCe
Q psy11061 79 SLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKC 156 (212)
Q Consensus 79 ~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~ 156 (212)
+|.|+++.+..+.++.|.|+|...++++.|+|.+|++.+.|.||+.+|.|+..+. || ++..+.|.|.+.|.|.|+.
T Consensus 1066 sI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~--LDRE~qdEHiLeVTi~D~gep~l 1143 (4289)
T KOG1219|consen 1066 SISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRR--LDREKQDEHILEVTIQDNGEPWL 1143 (4289)
T ss_pred eeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehh--hcccccccceEEEEEecCCCCcc
Confidence 9999999999999999999998889999999999999999999999999998888 88 8999999999999999999
Q ss_pred eeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCeecC
Q psy11061 157 SQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTEFS 212 (212)
Q Consensus 157 ~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~s 212 (212)
++.+.|.|.|.|+|||+|.|.+..+...|+|...+ .++++.|.|.|+|.++.++
T Consensus 1144 ~s~~rviV~IldvNdnsp~Flqk~~~~~v~~r~s~--plyRl~a~d~DeG~narit 1197 (4289)
T KOG1219|consen 1144 CSNQRVIVSILDVNDNSPRFLQKKTFLRVPERSSP--PLYRLAAQDNDEGNNARIT 1197 (4289)
T ss_pred ccceEEEEEEeeccCCchhhhhheeEEEeeeccCC--ceeEEEEEecCCCcceEEE
Confidence 99999999999999999999999999999998766 8999999999999998764
No 4
>KOG1219|consensus
Probab=100.00 E-value=5.1e-33 Score=252.14 Aligned_cols=204 Identities=29% Similarity=0.421 Sum_probs=190.8
Q ss_pred CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCC-CCceEEEEEEEEeecCCCCCccCCCceEEE
Q psy11061 1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRG-GLSTTAMVRIQITDVNDNEPIFNPVEYNVS 79 (212)
Q Consensus 1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~-~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~ 79 (212)
+|.|+|.++. -.+|.++-+||.|.+..+||||....|.|.|+|+|.+ ++.+++.|.|.|.|+|||+|+|.+..|...
T Consensus 2109 kv~YsIldg~--~slFtvnf~TG~i~v~~pLd~ea~t~h~l~ieAtd~~~p~~Aea~VeIiV~dIndn~PvFeqlsYt~s 2186 (4289)
T KOG1219|consen 2109 KVTYSILDGN--TSLFTVNFTTGVILVLIPLDREASTLHELLIEATDAGIPLSAEAKVEIIVGDINDNPPVFEQLSYTIS 2186 (4289)
T ss_pred ceEEEEecCC--cceEEEecccceEEeccccccccccceEEEEEEeccCCCcceeeEEEEEecccCCCCchhheeeEEEE
Confidence 4799999987 6899999999999999999999999999999999987 689999999999999999999999999999
Q ss_pred EecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC-CCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCe
Q psy11061 80 LRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK-DANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKC 156 (212)
Q Consensus 80 v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~-~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~ 156 (212)
+.|+++.|+.+.++.|+|.|. |..+.|++..+. ....|+|+..||.|++.+. || +.+.|.+.|+|+|+|.+ .
T Consensus 2187 isE~s~igt~viqilATdsDs--n~~isYsl~g~s~~sk~f~In~sTG~it~~g~--ldyE~~q~f~~fvratdggk~-l 2261 (4289)
T KOG1219|consen 2187 ISENSKIGTKVIQILATDSDS--NREISYSLEGNSEISKPFRINVSTGWITVAGK--LDYEENQEFRFFVRATDGGKP-L 2261 (4289)
T ss_pred ccCCCccCceEEEEEeccCCC--CCceEEEeecCCccccceEEecccceEEEeee--cChhhcceEEEEEEEccCCCc-c
Confidence 999999999999999999997 788999998743 3568999999999999999 88 89999999999998877 6
Q ss_pred eeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCeec
Q psy11061 157 SQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFTEF 211 (212)
Q Consensus 157 ~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~ 211 (212)
++.+.|.|.|.|+|||+|.|.+..|.+.|+|.+..|..|..+.|.|.|.-+...+
T Consensus 2262 Sseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ 2316 (4289)
T KOG1219|consen 2262 SSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKL 2316 (4289)
T ss_pred cccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhh
Confidence 6789999999999999999999999999999999999999999999998876643
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=2.2e-28 Score=185.86 Aligned_cols=165 Identities=34% Similarity=0.576 Sum_probs=148.9
Q ss_pred CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCc--eEEEEEEEEeecCCCCCccCCCceEE
Q psy11061 1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLS--TTAMVRIQITDVNDNEPIFNPVEYNV 78 (212)
Q Consensus 1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~--~~~~v~i~V~d~nd~~P~f~~~~~~~ 78 (212)
+++|+|.++.. ..+|.|++.+|.|+++++||||....|.|.|+|.|.|... ....++|.|.|+||++|.|....|.+
T Consensus 31 ~~~y~i~~~~~-~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~ 109 (199)
T cd00031 31 RVTYSILGGNE-DGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEA 109 (199)
T ss_pred eEEEEEeCCCC-cccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEE
Confidence 37899988762 2799999999999999999999999999999999976544 89999999999999999999889999
Q ss_pred EEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCe
Q psy11061 79 SLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKC 156 (212)
Q Consensus 79 ~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~ 156 (212)
.+.|+.+.|+.++++.|+|+|.+.++.+.|++..+...++|.|++.+|.|.+... +| ....|.+.|.|+|.+++..
T Consensus 110 ~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~--ld~e~~~~~~l~v~a~D~~~~~~ 187 (199)
T cd00031 110 SVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITLAKP--LDREEKSSYELTVVATDGGGPPL 187 (199)
T ss_pred EEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCc--cCCccCceEEEEEEEEECCCCCc
Confidence 9999999999999999999999888899999987765589999999999999988 77 6779999999999987667
Q ss_pred eeeEEEEEEEee
Q psy11061 157 SQDAQVNINIIN 168 (212)
Q Consensus 157 ~~~~~v~I~v~d 168 (212)
+.++.++|.|.|
T Consensus 188 ~~~~~i~i~v~d 199 (199)
T cd00031 188 SSTATVTVTVLD 199 (199)
T ss_pred eeEEEEEEEEEC
Confidence 778888888865
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.92 E-value=2.3e-23 Score=158.13 Aligned_cols=132 Identities=35% Similarity=0.591 Sum_probs=122.2
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCC
Q psy11061 76 YNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGG 153 (212)
Q Consensus 76 ~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~ 153 (212)
|.+.+.|+.+.|+.++++.|.|+|.+.++.+.|+|..+...++|.|++.+|.|++.+. || ....|.|.|.|+|.+.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~--lD~e~~~~~~l~v~a~D~g~ 79 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKP--LDREEQSEYTLTVVASDGGG 79 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCC--CCCcCCceEEEEEEEEECCc
Confidence 7889999999999999999999999888899999998766589999999999999998 77 7899999999999887
Q ss_pred CCeeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCCCe
Q psy11061 154 NKCSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGKFT 209 (212)
Q Consensus 154 ~~~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~~~ 209 (212)
+.++....++|.|.|+||++|.|....|.+.+.|+.++|+.++++.|+|+|.+.+.
T Consensus 80 ~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~ 135 (199)
T cd00031 80 PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENA 135 (199)
T ss_pred CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCc
Confidence 77778899999999999999999988999999999999999999999999997444
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.77 E-value=3.8e-17 Score=109.27 Aligned_cols=90 Identities=34% Similarity=0.542 Sum_probs=84.7
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEEC-C
Q psy11061 76 YNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDG-G 152 (212)
Q Consensus 76 ~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~-~ 152 (212)
|.+.|+|+.+.|+.++++.|.|+|.+.++.+.|+|..++..++|.|++.+|.|.+.+. || ....|.|.|.|+|. +
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~--LD~E~~~~y~l~v~a~D~~~ 78 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKP--LDRETQSSYQLTVRATDSGG 78 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSS--SCTTTTSEEEEEEEEEETTT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeecccccee--cCcccCCEEEEEEEEEECCC
Confidence 7899999999999999999999999899999999999887899999999999999999 88 88999999999999 7
Q ss_pred CCCeeeeEEEEEEEe
Q psy11061 153 GNKCSQDAQVNINII 167 (212)
Q Consensus 153 ~~~~~~~~~v~I~v~ 167 (212)
.++++++++|.|+|.
T Consensus 79 ~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 79 SPPLSSTATVTINVL 93 (93)
T ss_dssp SSEEEEEEEEEEEEE
T ss_pred CCCCEEEEEEEEEEC
Confidence 888889999999884
No 8
>KOG1834|consensus
Probab=99.76 E-value=6.6e-17 Score=135.82 Aligned_cols=158 Identities=27% Similarity=0.464 Sum_probs=128.5
Q ss_pred EEEEcCCCCCcCceEE---eCCce--EEEECCcCCcCCCCeEEEEEEEEeCCC--------CceEEEEEEEEeecCCCCC
Q psy11061 3 NYTLGESPSRTNHFYM---KSVSG--EICIAQDLDFESRSSYEFPVVATDRGG--------LSTTAMVRIQITDVNDNEP 69 (212)
Q Consensus 3 ~Y~i~~~~~~~~~F~i---d~~tg--~i~~~~~lD~E~~~~~~l~i~a~d~~~--------~~~~~~v~i~V~d~nd~~P 69 (212)
-|.|.+++ -.|.+ |..|| .|+.+.+||.|.++.|+|+|+|.|+|. .+..++++|.|.|+|+.+|
T Consensus 71 ~fklhgq~---vPFdavVvdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP 147 (952)
T KOG1834|consen 71 GFKLHGQP---VPFDAVVVDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAP 147 (952)
T ss_pred eeEecCCC---CCceEEEEeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCc
Confidence 36664443 34543 55555 699999999999999999999999872 4567899999999999999
Q ss_pred ccCCCceEEEEecCCCCCcEEEEEEEEeCCCC-CCceE-EEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEE
Q psy11061 70 IFNPVEYNVSLRDDIQTTTAFAVVLATDRDSD-RFGTI-SYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHIN 145 (212)
Q Consensus 70 ~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~-~~~~i-~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~ 145 (212)
+|....|.+.|.|+. .-..++++.|.|.|.+ +++.| .|.|+. ++-.|.|| +.|.|+.+.. |. +..+|.|+
T Consensus 148 ~f~ep~Yka~V~EGK-~yd~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaId-n~G~irnTek--Lny~ke~~Y~lt 221 (952)
T KOG1834|consen 148 VFKEPWYKAHVTEGK-VYDSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAID-NDGNIRNTEK--LNYTKEHQYKLT 221 (952)
T ss_pred hhcccceeeEEecce-eeeeeEEEEeecCCCCCcccceeEEEecC--CCCceEEc-CCCccccccc--cccccceeEEEE
Confidence 999999999999984 4577899999999986 56666 788875 45689998 5699999999 77 89999999
Q ss_pred EEEEECCCCCeeeeEEEEEEEeeC
Q psy11061 146 VSAMDGGGNKCSQDAQVNINIINS 169 (212)
Q Consensus 146 v~a~D~~~~~~~~~~~v~I~v~d~ 169 (212)
|.|.|.+..+....+-|+|.|...
T Consensus 222 VtAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 222 VTAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred EEEEecccccccCcceEEEEecCc
Confidence 999998865544556788887654
No 9
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.67 E-value=1.1e-15 Score=99.04 Aligned_cols=77 Identities=35% Similarity=0.641 Sum_probs=70.2
Q ss_pred EeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC--CCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCCCC
Q psy11061 96 TDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS--RSNSYHINVSAMDGGGNKCSQDAQVNINIINSHMPI 173 (212)
Q Consensus 96 ~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld--~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd~~ 173 (212)
+|+|.+.++.+.|+|..+...++|.|++.+|.|++.+. || ....|.|.|.|+|.+++.+++.++|+|+|.|+||++
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~--LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~ 78 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKP--LDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA 78 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCc--cCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence 47888888999999998766589999999999988888 88 789999999999999888889999999999999999
Q ss_pred C
Q psy11061 174 P 174 (212)
Q Consensus 174 P 174 (212)
|
T Consensus 79 P 79 (79)
T smart00112 79 P 79 (79)
T ss_pred C
Confidence 8
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.63 E-value=4.1e-15 Score=96.30 Aligned_cols=68 Identities=40% Similarity=0.607 Sum_probs=62.2
Q ss_pred CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCC--CceEEEEEEEEeecCCCCC
Q psy11061 1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGG--LSTTAMVRIQITDVNDNEP 69 (212)
Q Consensus 1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~--~~~~~~v~i~V~d~nd~~P 69 (212)
+|+|+|..+.. ..+|.|++.+|.|+++++||||....|.|.|+|.|.++ +++.+.|+|.|.|+|||+|
T Consensus 10 ~i~Y~i~~~~~-~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 10 KVTYSILSGNE-DGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred EEEEEEecCCC-CCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 37899987762 28999999999999999999999999999999999985 7899999999999999998
No 11
>KOG1834|consensus
Probab=99.40 E-value=6.3e-12 Score=106.32 Aligned_cols=148 Identities=23% Similarity=0.327 Sum_probs=114.2
Q ss_pred EEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCC--CCceE-EEEEEcCCCCCcEEE---eCCc--cEEE
Q psy11061 58 RIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSD--RFGTI-SYKIVTGKDANLFRI---DRSS--GELF 129 (212)
Q Consensus 58 ~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~--~~~~i-~y~i~~~~~~~~F~i---d~~~--G~i~ 129 (212)
+....-+|-+.|.... .|...|.||...-.....+.|-|.|.+ -.+.| -|+|.+.+ -.|.+ |..+ |.|+
T Consensus 20 ~~~aarankhkpwie~-ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~--vPFdavVvdK~TGegvlR 96 (952)
T KOG1834|consen 20 HHHAARANKHKPWIEE-EYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQP--VPFDAVVVDKYTGEGVLR 96 (952)
T ss_pred ccccccccccCccccc-ceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCC--CCceEEEEeccCCceEEe
Confidence 4556677888887766 599999999765444457888898875 22334 56665432 34554 5555 5688
Q ss_pred EeeecCCC--CCCeEEEEEEEEECC-CC-----CeeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEE
Q psy11061 130 VTRGNFLS--RSNSYHINVSAMDGG-GN-----KCSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAA 201 (212)
Q Consensus 130 ~~~~~~ld--~~~~~~l~v~a~D~~-~~-----~~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~ 201 (212)
.+.+ +| .+..|+|+|+|.|.+ +| .++..++|+|.|.|+|.++|+|..+.|.+.|.|. .....|++|.|.
T Consensus 97 aK~~--lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAi 173 (952)
T KOG1834|consen 97 AKEP--LDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAI 173 (952)
T ss_pred ecCc--ccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecc-eeeeeeEEEEee
Confidence 8888 99 699999999999977 44 4577899999999999999999999999999998 677899999999
Q ss_pred eCC-CCCCeec
Q psy11061 202 TSD-SGKFTEF 211 (212)
Q Consensus 202 D~D-~~~~~~~ 211 (212)
|.| +++++++
T Consensus 174 D~DCspq~sqI 184 (952)
T KOG1834|consen 174 DKDCSPQYSQI 184 (952)
T ss_pred cCCCCCcccce
Confidence 999 5555553
No 12
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.23 E-value=1.1e-10 Score=77.79 Aligned_cols=61 Identities=36% Similarity=0.544 Sum_probs=53.2
Q ss_pred CeEEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeC-CC--CceEEEEEEEEe
Q psy11061 1 MVNYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDR-GG--LSTTAMVRIQIT 62 (212)
Q Consensus 1 ~i~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~-~~--~~~~~~v~i~V~ 62 (212)
++.|+|..+. +.++|.|++.+|.|+++++||||....|.|.|.|+|. ++ +++.+.|+|+|.
T Consensus 30 ~i~y~i~~~~-~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 30 QITYSILGGN-PDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp SEEEEEEETT-STTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred eEEEEEecCc-ccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 4789997776 3689999999999999999999999999999999999 53 567778887764
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.53 E-value=0.0025 Score=41.89 Aligned_cols=79 Identities=16% Similarity=0.286 Sum_probs=44.4
Q ss_pred CCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC-CCCeEEEEE
Q psy11061 68 EPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS-RSNSYHINV 146 (212)
Q Consensus 68 ~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v 146 (212)
.|-|....|.+.|+.+...|..++++.-.|... +..+.|... + ..|+|. ..|.|.+..+ +. ..+.-.|.|
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--D--pdF~V~-~DGsVy~~r~--v~l~~~~~~F~V 73 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--D--PDFRVL-EDGSVYAKRP--VQLSSEQRSFTV 73 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEE-TTTEEEEES----S-SSS-EEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--C--CCEEEc-CCCeEEEeee--EecCCCceEEEE
Confidence 488999999999999999999999999988843 345777753 2 389997 5699999999 77 666678999
Q ss_pred EEEECCCCC
Q psy11061 147 SAMDGGGNK 155 (212)
Q Consensus 147 ~a~D~~~~~ 155 (212)
.|.|..+..
T Consensus 74 ~a~D~~~~~ 82 (90)
T PF08758_consen 74 HAWDSQTQE 82 (90)
T ss_dssp EEEETTTTE
T ss_pred EEECCCCCe
Confidence 999987654
No 14
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.20 E-value=0.015 Score=38.71 Aligned_cols=69 Identities=26% Similarity=0.342 Sum_probs=50.9
Q ss_pred EeCCCCCCceEEEEEEcCC---CCCcEEEeCCccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCC
Q psy11061 96 TDRDSDRFGTISYKIVTGK---DANLFRIDRSSGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINIINSHM 171 (212)
Q Consensus 96 ~D~D~~~~~~i~y~i~~~~---~~~~F~id~~~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd 171 (212)
.|.| ...+.|++...+ .+.|...|+.++.+.-... ....+.|.++|.|+|..+ .+....++|.|.+.|+
T Consensus 25 ~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~--~~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 25 TDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPT--NSDVGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred ECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECC--CCCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 5665 335888886432 3579999998888876443 335678999999999876 4467789999998885
No 15
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.18 E-value=0.0007 Score=43.90 Aligned_cols=60 Identities=20% Similarity=0.455 Sum_probs=37.9
Q ss_pred ceEEEEecCCCCCcEEEEEEEEeCCCCCC--ceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCC
Q psy11061 75 EYNVSLRDDIQTTTAFAVVLATDRDSDRF--GTISYKIVTGKDANLFRIDRSSGELFVTRGNFLS 137 (212)
Q Consensus 75 ~~~~~v~e~~~~g~~v~~~~a~D~D~~~~--~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld 137 (212)
.....|+|..+.|+.|+.+. .|...... ....|++.+.....+|.++..+|.+.+... +|
T Consensus 2 qi~YsV~EE~~~Gt~IGnia-~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~r--ID 63 (84)
T PF08266_consen 2 QIRYSVPEEMPPGTVIGNIA-KDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSER--ID 63 (84)
T ss_dssp EEEEEEESS--TT-EEEECC-CCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS----S
T ss_pred CeEEEeecCCCCCCEEEEhH-HhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCc--cC
Confidence 35688999999999999984 44433211 124677777666789999999999999988 77
No 16
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=97.11 E-value=0.016 Score=38.63 Aligned_cols=88 Identities=18% Similarity=0.195 Sum_probs=57.8
Q ss_pred EEEEEEeCCCCCCceEEEEEEc-CCCCCcEEEeCCccEEEEeeecCC------C--CCCeEEEEEEEEECCCCCeeeeEE
Q psy11061 91 AVVLATDRDSDRFGTISYKIVT-GKDANLFRIDRSSGELFVTRGNFL------S--RSNSYHINVSAMDGGGNKCSQDAQ 161 (212)
Q Consensus 91 ~~~~a~D~D~~~~~~i~y~i~~-~~~~~~F~id~~~G~i~~~~~~~l------d--~~~~~~l~v~a~D~~~~~~~~~~~ 161 (212)
+++.++|+|.+.. ..+.... ....+.|.|+. +|.+.....+.. . ..-.-.|.+.+.|+. +.+
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt------t~~ 72 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT------SQT 72 (99)
T ss_pred CceEEeCCCCCCc--eEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC------eEE
Confidence 4688899997654 4555532 23467899986 688877654321 1 223456888888842 788
Q ss_pred EEEEEeeCCCCCCeEecCceEEEEeCCC
Q psy11061 162 VNINIINSHMPIPLFQQSTYSFVVPEDV 189 (212)
Q Consensus 162 v~I~v~d~nd~~P~f~~~~~~~~v~e~~ 189 (212)
|+|+|...|| +|+..... ...+.|+.
T Consensus 73 vtItI~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 73 VTITITGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred EEEEEEccCC-CCEEeccc-ceeEecCC
Confidence 9999999997 66655433 46666663
No 17
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=97.02 E-value=0.075 Score=38.81 Aligned_cols=127 Identities=20% Similarity=0.331 Sum_probs=73.8
Q ss_pred CCeEEEEE-EEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecCCCC-CcEEEEEEEEeCCCCCCceEEEEEEcC
Q psy11061 36 RSSYEFPV-VATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQT-TTAFAVVLATDRDSDRFGTISYKIVTG 113 (212)
Q Consensus 36 ~~~~~l~i-~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~-g~~v~~~~a~D~D~~~~~~i~y~i~~~ 113 (212)
...|.+.+ .|.|.-|...+..+...+. +...+|.+.- .....+..+..+ +..-..+.++|...+. .--...+.+|
T Consensus 14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~Gg 90 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTGG 90 (158)
T ss_pred CccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEECC
Confidence 46899999 7999988766666654444 3556787744 111222222222 2333567788876554 3346677655
Q ss_pred CCCCcEEEe---CCccEEEEeeecCC---CCCCeEEEEEEEEECCCCCeeeeEEEEEEEe
Q psy11061 114 KDANLFRID---RSSGELFVTRGNFL---SRSNSYHINVSAMDGGGNKCSQDAQVNINII 167 (212)
Q Consensus 114 ~~~~~F~id---~~~G~i~~~~~~~l---d~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~ 167 (212)
.....-.+. ...|...+.-+..+ ...+.|+|.|.|.|..|.. .+.++.....
T Consensus 91 ~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aGN~--~~~si~F~y~ 148 (158)
T PF13750_consen 91 PASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAGNQ--STKSISFSYM 148 (158)
T ss_pred cccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCCCE--EEEEEEEEEe
Confidence 433332221 12345444433223 3688999999999998865 3445555544
No 18
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.01 E-value=0.013 Score=39.00 Aligned_cols=63 Identities=21% Similarity=0.291 Sum_probs=48.8
Q ss_pred eEEEEcC--CCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCceEEEEEEEEeecCC
Q psy11061 2 VNYTLGE--SPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLSTTAMVRIQITDVND 66 (212)
Q Consensus 2 i~Y~i~~--~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd 66 (212)
|+|++.. +..-..|.+.++.++.+.-. +... ....|.+.+.|+|..|.+....++|.|.+.|+
T Consensus 32 lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~-~~g~~~i~v~a~D~~g~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 32 LTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNS-DVGSLSLKVTATDSSGASASDTFTITVVNTND 96 (97)
T ss_pred EEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCC-CCcEEEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence 6788743 22234799999999888874 4332 34679999999999888899999999999886
No 19
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.15 E-value=0.016 Score=37.59 Aligned_cols=31 Identities=13% Similarity=0.389 Sum_probs=20.9
Q ss_pred EEEcCCCCCcCceEEeCCceEEEECCcCCcCC
Q psy11061 4 YTLGESPSRTNHFYMKSVSGEICIAQDLDFES 35 (212)
Q Consensus 4 Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~ 35 (212)
|+|.+.. ...+|.+++.+|.|+++..+|||.
T Consensus 36 ~ri~s~~-~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 36 FRIVSEG-NSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp BEEE-SS-SS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEeecC-CcceeEecCCceeEEeCCccCHHH
Confidence 4543332 358999999999999999999998
No 20
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=94.24 E-value=0.35 Score=27.80 Aligned_cols=37 Identities=30% Similarity=0.420 Sum_probs=29.2
Q ss_pred CCCcEEEeCCccEEEEeeecCCC-CCCeEEEEEEEEECCC
Q psy11061 115 DANLFRIDRSSGELFVTRGNFLS-RSNSYHINVSAMDGGG 153 (212)
Q Consensus 115 ~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v~a~D~~~ 153 (212)
-+....+|+.+|.|.-... -. ..+.|.+.|.|+|..+
T Consensus 12 LP~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 12 LPSGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CCCcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCCC
Confidence 3568899999999987755 33 4579999999999753
No 21
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=93.81 E-value=0.86 Score=27.52 Aligned_cols=59 Identities=20% Similarity=0.261 Sum_probs=34.4
Q ss_pred CceEEEEEEcCCCCCcEEEeCCccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEE
Q psy11061 103 FGTISYKIVTGKDANLFRIDRSSGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINI 166 (212)
Q Consensus 103 ~~~i~y~i~~~~~~~~F~id~~~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v 166 (212)
+-...|++.. ....+..+...+-.+....- ..+.|+|.|.|.|..+.......++.|+|
T Consensus 7 ~~~Y~Y~l~g-~d~~W~~~~~~~~~~~~~~L----~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 7 NIRYRYRLEG-FDDEWITLGSYSNSISYTNL----PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TEEEEEEEET-TESSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred ceEEEEEEEC-CCCeEEECCCCcEEEEEEeC----CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 3356777753 44466666433224443332 68999999999998876654446666666
No 22
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=92.42 E-value=1 Score=29.50 Aligned_cols=37 Identities=19% Similarity=0.256 Sum_probs=27.6
Q ss_pred CceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCc
Q psy11061 14 NHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLS 52 (212)
Q Consensus 14 ~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~ 52 (212)
..|.|.+ .|.|.+++++.... ..-+|.|.|+|..+..
T Consensus 46 pdF~V~~-DGsVy~~r~v~l~~-~~~~F~V~a~D~~~~~ 82 (90)
T PF08758_consen 46 PDFRVLE-DGSVYAKRPVQLSS-EQRSFTVHAWDSQTQE 82 (90)
T ss_dssp SEEEEET-TTEEEEES--S-SS-S-EEEEEEEEETTTTE
T ss_pred CCEEEcC-CCeEEEeeeEecCC-CceEEEEEEECCCCCe
Confidence 4799998 99999999987654 4468999999998754
No 23
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=92.01 E-value=1.1 Score=25.69 Aligned_cols=46 Identities=24% Similarity=0.381 Sum_probs=31.7
Q ss_pred EEEEc--CCCCCcCceEEeCCceEEEECCcCCcC-CCCeEEEEEEEEeCCC
Q psy11061 3 NYTLG--ESPSRTNHFYMKSVSGEICIAQDLDFE-SRSSYEFPVVATDRGG 50 (212)
Q Consensus 3 ~Y~i~--~~~~~~~~F~id~~tg~i~~~~~lD~E-~~~~~~l~i~a~d~~~ 50 (212)
+|++. ....-+.+..||+.+|.|+-.-. .+ ....|.+.|.|.|..|
T Consensus 1 Tys~~~~~~~~LP~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 1 TYSLTTPTGGGLPSGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CEEEEcCCCCCCCCcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCCC
Confidence 47761 12234578899999999987633 22 2368999999999753
No 24
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=88.10 E-value=6 Score=26.44 Aligned_cols=65 Identities=25% Similarity=0.336 Sum_probs=42.4
Q ss_pred CcCceEEeCCceEEEEC--------CcCCcCCCCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecC
Q psy11061 12 RTNHFYMKSVSGEICIA--------QDLDFESRSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDD 83 (212)
Q Consensus 12 ~~~~F~id~~tg~i~~~--------~~lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~ 83 (212)
..+.|.|++ +|..... +.|.--+...-+|++.+.|. .+..|+|.|...|| +|+..-. -...+.|+
T Consensus 25 ~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG----tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 25 QYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG----TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred CcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 356788888 7765533 22333334456788888885 38899999999999 7866432 23445544
No 25
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=86.93 E-value=20 Score=33.72 Aligned_cols=49 Identities=14% Similarity=0.053 Sum_probs=31.3
Q ss_pred cCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC
Q psy11061 64 VNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK 114 (212)
Q Consensus 64 ~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~ 114 (212)
.||..+.+...+-...|.|+ .|+.-..+.-...|......+.|+..+|.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En--~GtV~VtV~R~GGdl~~tVsVdY~T~DGT 443 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLEN--CGTVALTVVRRGGDLTNTVYVDYRTEDGT 443 (928)
T ss_pred ccCCcceEEecCCeEEEeec--CcEEEEEEEEccCCCCceEEEEEEccCCc
Confidence 44545555555567778887 46666666665545544456889887764
No 26
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=86.25 E-value=5.2 Score=28.46 Aligned_cols=56 Identities=27% Similarity=0.536 Sum_probs=39.1
Q ss_pred EEEECCcCCcCC---CCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecC
Q psy11061 24 EICIAQDLDFES---RSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDD 83 (212)
Q Consensus 24 ~i~~~~~lD~E~---~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~ 83 (212)
.+.+.++||... .-...|.|.|+|..|-.....+.|.|.| | .|...... ...|.|.
T Consensus 69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDED 127 (137)
T ss_pred EEEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence 355778888854 3467888999998876566788888887 5 57765543 3667664
No 27
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.61 E-value=6.5 Score=23.50 Aligned_cols=53 Identities=23% Similarity=0.251 Sum_probs=30.1
Q ss_pred EEEEcCCCCCcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCCc--eEEEEEEEE
Q psy11061 3 NYTLGESPSRTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGLS--TTAMVRIQI 61 (212)
Q Consensus 3 ~Y~i~~~~~~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~~--~~~~v~i~V 61 (212)
.|+|...+ ..|..+...+-.+..+.. ....|+|.|+|.|..+.. ....+.|.|
T Consensus 11 ~Y~l~g~d--~~W~~~~~~~~~~~~~~L----~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 11 RYRLEGFD--DEWITLGSYSNSISYTNL----PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp EEEEETTE--SSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred EEEEECCC--CeEEECCCCcEEEEEEeC----CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 46564433 467666664424443322 357899999999987522 225666655
No 28
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=82.53 E-value=7.5 Score=23.20 Aligned_cols=31 Identities=29% Similarity=0.491 Sum_probs=24.4
Q ss_pred CCeEEEEEEEEeCCCCceEEEEEEEEeecCC
Q psy11061 36 RSSYEFPVVATDRGGLSTTAMVRIQITDVND 66 (212)
Q Consensus 36 ~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd 66 (212)
...|++.+.|.|.-|........+.+.|..-
T Consensus 22 dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~~ 52 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGNTSSSTTQIVIVDNTA 52 (60)
T ss_pred CccEEEEEEEEECCCCEEEeeeEEEEEcCCC
Confidence 5789999999999888777777776666543
No 29
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=79.71 E-value=47 Score=30.00 Aligned_cols=109 Identities=15% Similarity=0.135 Sum_probs=61.0
Q ss_pred CCeEEEEEEEEeCCCCc-eEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEE------EeCCCCCCceEEE
Q psy11061 36 RSSYEFPVVATDRGGLS-TTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLA------TDRDSDRFGTISY 108 (212)
Q Consensus 36 ~~~~~l~i~a~d~~~~~-~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a------~D~D~~~~~~i~y 108 (212)
....++++.|+|..|+. .....+|+|.+... |.+....+... +.. ........| .+.-.....++.|
T Consensus 297 ~G~~Ti~atVtDSRGr~S~~~~~tItVl~Y~~--P~lsfsv~R~~--~~~--~~~~v~~~a~Iapl~v~g~qKN~~~lt~ 370 (624)
T PF05895_consen 297 SGSATIRATVTDSRGRTSDPKTKTITVLEYSP--PTLSFSVYRCG--SSG--NTLTVTRNAKIAPLTVNGVQKNTMTLTF 370 (624)
T ss_pred CceEEEEEEEEECCCccCCceEEEEEEEEcCC--CcEEEEEEEeC--CCC--cEEEEEEEEEEeEEEEcccccceEEEEE
Confidence 46899999999998864 57889999998754 76632211111 111 111111111 1221112225666
Q ss_pred EEEcCCCCCcEEEeCC-------------ccEEEEeeecCCCCCCeEEEEEEEEECCC
Q psy11061 109 KIVTGKDANLFRIDRS-------------SGELFVTRGNFLSRSNSYHINVSAMDGGG 153 (212)
Q Consensus 109 ~i~~~~~~~~F~id~~-------------~G~i~~~~~~~ld~~~~~~l~v~a~D~~~ 153 (212)
+...-+ ...|.+|.. .+...+... ++....|.+++.++|.-.
T Consensus 371 ~~a~~g-t~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~--y~~~kSy~V~~~l~D~F~ 425 (624)
T PF05895_consen 371 KVAPLG-TGTFTTDNGSASGTWSSISELTNSSANLGGT--YDAEKSYDVRGTLSDKFT 425 (624)
T ss_pred EEEEcC-cceEEEEccccccceeeeeeecccceeeccc--cCCCceEEEEEEEEEEee
Confidence 665422 245555521 112333344 668899999999999763
No 30
>KOG4221|consensus
Probab=79.40 E-value=21 Score=34.54 Aligned_cols=56 Identities=21% Similarity=0.359 Sum_probs=37.5
Q ss_pred CcCceEEeCCceEEEECCcCCcCCCCeEEEEEEEEeCCCC-ceEEEEEE-EEeecCCCCCc
Q psy11061 12 RTNHFYMKSVSGEICIAQDLDFESRSSYEFPVVATDRGGL-STTAMVRI-QITDVNDNEPI 70 (212)
Q Consensus 12 ~~~~F~id~~tg~i~~~~~lD~E~~~~~~l~i~a~d~~~~-~~~~~v~i-~V~d~nd~~P~ 70 (212)
.+.++.++.++-+.++. +.|....|.+.+.|..+.|. .++..++| ...|+=+.+|.
T Consensus 563 ~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 563 TGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred CCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 56899999887777776 34566789999999998652 23344444 34455444443
No 31
>KOG3597|consensus
Probab=76.38 E-value=49 Score=28.47 Aligned_cols=97 Identities=18% Similarity=0.256 Sum_probs=61.6
Q ss_pred eEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCC----CCcEEEeC-----
Q psy11061 53 TTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGKD----ANLFRIDR----- 123 (212)
Q Consensus 53 ~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~~----~~~F~id~----- 123 (212)
.+....|.|.-+||.+..+....+.+.+.|+...-.....+.+.|+|.++.. +.|++..... .+.|..-.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~-l~f~v~~t~~~~~~~~~~~~~g~~~~~ 102 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPLP-LEFQVLGTSSVPLPVLKFDVPGAPATE 102 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCccc-eEEEEccCCCCCCccceeeccCCcccc
Confidence 5566789999999977766666667888888665455567888898887543 7777765311 11233321
Q ss_pred ------CccEEEEeeecCCCCCCeEEEEEEEEECC
Q psy11061 124 ------SSGELFVTRGNFLSRSNSYHINVSAMDGG 152 (212)
Q Consensus 124 ------~~G~i~~~~~~~ld~~~~~~l~v~a~D~~ 152 (212)
..|.+.+...+. ......++..++|+-
T Consensus 103 Fs~~~v~~g~~~yvh~g~--el~~~~~~~~~SDg~ 135 (442)
T KOG3597|consen 103 FSYEEVEDGSLSYVHSGT--ELRESELQLRVSDGL 135 (442)
T ss_pred eEehHhhcCceeEEecCc--ccccceEEEEeecce
Confidence 234554444321 456777888888854
No 32
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=75.50 E-value=75 Score=30.15 Aligned_cols=56 Identities=23% Similarity=0.282 Sum_probs=32.9
Q ss_pred EEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC
Q psy11061 55 AMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK 114 (212)
Q Consensus 55 ~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~ 114 (212)
...+|+|.| ||++|.|....-...|.|+. |..-..+.-+-. ....-.+.|+...|.
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~RtsG-a~G~VtV~Y~T~dGT 570 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTSG-ARGTVIVPYRTVEGT 570 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcCC-CCeeEEEEEEeecCc
Confidence 355666676 77788877666677888873 444444433322 111235678777664
No 33
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=75.49 E-value=6.1 Score=26.67 Aligned_cols=33 Identities=24% Similarity=0.382 Sum_probs=27.1
Q ss_pred CCcCCCCeEEEEEEEEeCCCCceEEEEEEEEee
Q psy11061 31 LDFESRSSYEFPVVATDRGGLSTTAMVRIQITD 63 (212)
Q Consensus 31 lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d 63 (212)
+++.+...|.+.|++.|..|.+.+..+.|.|-|
T Consensus 74 ~~v~kgG~y~m~V~lCn~dGCS~S~~~~I~VAD 106 (106)
T cd02848 74 FKVGKGGRYQMQVALCNGDGCSTSAAKEIVVAD 106 (106)
T ss_pred EEeCCCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence 556677899999999999888888888887765
No 34
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=75.01 E-value=14 Score=21.46 Aligned_cols=19 Identities=37% Similarity=0.646 Sum_probs=16.4
Q ss_pred CCCeEEEEEEEEECCCCCe
Q psy11061 138 RSNSYHINVSAMDGGGNKC 156 (212)
Q Consensus 138 ~~~~~~l~v~a~D~~~~~~ 156 (212)
..+.|.+.+.|+|..|...
T Consensus 22 ~dG~y~itv~a~D~AGN~s 40 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGNTS 40 (54)
T ss_pred CCccEEEEEEEEeCCCCCC
Confidence 6788999999999987764
No 35
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=74.44 E-value=28 Score=24.64 Aligned_cols=32 Identities=22% Similarity=0.531 Sum_probs=23.0
Q ss_pred CCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCC
Q psy11061 138 RSNSYHINVSAMDGGGNKCSQDAQVNINIINSHM 171 (212)
Q Consensus 138 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd 171 (212)
..+.|.++|+++|..| ++....+.|.|.|...
T Consensus 81 ~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 81 KGGRYQMQVELCNADG--CSTSAPVEVVVADTDG 112 (133)
T ss_dssp S-EEEEEEEEEEETTE--EEE---EEEEEE-TTS
T ss_pred CCCEEEEEEEEECCCC--cccCCCEEEEEeCCCc
Confidence 7899999999999887 5666788888888764
No 36
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=73.11 E-value=17 Score=21.66 Aligned_cols=31 Identities=26% Similarity=0.503 Sum_probs=21.7
Q ss_pred CCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCC
Q psy11061 138 RSNSYHINVSAMDGGGNKCSQDAQVNINIINSH 170 (212)
Q Consensus 138 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~n 170 (212)
..+.|.+.+.|+|..|... .....+.+.|..
T Consensus 21 ~dg~yt~~v~a~D~AGN~~--~~~~~~~i~d~~ 51 (60)
T PF12245_consen 21 ADGEYTLTVTATDKAGNTS--SSTTQIVIVDNT 51 (60)
T ss_pred CCccEEEEEEEEECCCCEE--EeeeEEEEEcCC
Confidence 3678999999999998763 344455555543
No 37
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=69.45 E-value=29 Score=22.68 Aligned_cols=53 Identities=21% Similarity=0.266 Sum_probs=30.0
Q ss_pred EEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC
Q psy11061 58 RIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISYKIVTGK 114 (212)
Q Consensus 58 ~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y~i~~~~ 114 (212)
+|.|.| ||. |.+....-...+.|+. |.....+....++....-.+.|....+.
T Consensus 2 tvtI~d-~d~-~~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DDD-PTVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TTS-EEEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CCC-CEEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 466777 664 4766555667788874 3444555555443233345677776653
No 38
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=68.70 E-value=62 Score=26.27 Aligned_cols=110 Identities=25% Similarity=0.319 Sum_probs=58.3
Q ss_pred CCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEEecCC------CCCcEEEEEEEEeCCCCCCceEEEE
Q psy11061 36 RSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDDI------QTTTAFAVVLATDRDSDRFGTISYK 109 (212)
Q Consensus 36 ~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~------~~g~~v~~~~a~D~D~~~~~~i~y~ 109 (212)
...|.+.+.+.|..|........+.|-- .+|...-. .+..+. ......+...+.+.+. +..+.+.
T Consensus 11 d~~~~v~vt~tD~aGN~~~~t~~~~vDt---~~P~v~i~----~~~~~~~~~~~~~~~~~t~s~tvs~~~~--g~~v~v~ 81 (317)
T PF13753_consen 11 DGTYTVSVTVTDAAGNTSTATQSITVDT---TAPTVTIT----SIADDDIINGDEATNTVTFSGTVSGAEP--GSTVTVT 81 (317)
T ss_pred CCcEEEEEEEEeCCCCeeeeeEEEEEec---CCCceeee----cccCCCccccceeeeeeEEEEEecCCCC--CCEEEEE
Confidence 4679999999999887666655555332 26644322 111111 1122234444444333 3345555
Q ss_pred EEcCCCCCcEEEeCCccEEEEeeecCCC-CCCeEEEEEE-EEECCCCCee
Q psy11061 110 IVTGKDANLFRIDRSSGELFVTRGNFLS-RSNSYHINVS-AMDGGGNKCS 157 (212)
Q Consensus 110 i~~~~~~~~F~id~~~G~i~~~~~~~ld-~~~~~~l~v~-a~D~~~~~~~ 157 (212)
+ .+.. ..+..+ ..|.+.......-. ..+.|.+.+. ++|..|....
T Consensus 82 ~-~g~~-~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~~t 128 (317)
T PF13753_consen 82 I-NGTT-GTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNTST 128 (317)
T ss_pred E-CCEE-EEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCcccc
Confidence 5 2221 333343 45655444431002 5678999999 9998876543
No 39
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=68.68 E-value=15 Score=22.85 Aligned_cols=30 Identities=17% Similarity=0.289 Sum_probs=22.3
Q ss_pred CCcCCCCeEEEEEEEEeCCCCceEEEEEEE
Q psy11061 31 LDFESRSSYEFPVVATDRGGLSTTAMVRIQ 60 (212)
Q Consensus 31 lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~ 60 (212)
..|.....|.+++.|.|..+.+......|.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g~~~~~~~~V~ 80 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVGSSSTKTTTVV 80 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCCCEEEEEEEEE
Confidence 446677899999999998776665565554
No 40
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=66.12 E-value=45 Score=23.71 Aligned_cols=43 Identities=19% Similarity=0.362 Sum_probs=29.5
Q ss_pred eEEEEEEEEECCCCCeeeeEEEEEEEeeCCCCCCeEecCceEEEEeCCC
Q psy11061 141 SYHINVSAMDGGGNKCSQDAQVNINIINSHMPIPLFQQSTYSFVVPEDV 189 (212)
Q Consensus 141 ~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~ 189 (212)
...|.|.|+|..+... ...+.|+|.| | .|...... ..+|.|+.
T Consensus 86 ~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 86 TLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 4567899999887653 4578888887 3 47765544 36777753
No 41
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=65.19 E-value=16 Score=22.94 Aligned_cols=26 Identities=23% Similarity=0.266 Sum_probs=22.4
Q ss_pred CCeEEEEEEEEeCCCCceEEEEEEEE
Q psy11061 36 RSSYEFPVVATDRGGLSTTAMVRIQI 61 (212)
Q Consensus 36 ~~~~~l~i~a~d~~~~~~~~~v~i~V 61 (212)
...+.++..|.|..|+.+.+.+.|.|
T Consensus 56 ~G~t~V~ytA~D~~GN~a~C~f~V~V 81 (81)
T PF02494_consen 56 VGTTTVTYTATDAAGNSATCSFTVTV 81 (81)
T ss_pred eceEEEEEEEEECCCCEEEEEEEEEC
Confidence 45899999999998888899988875
No 42
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=65.05 E-value=27 Score=23.62 Aligned_cols=29 Identities=28% Similarity=0.636 Sum_probs=21.1
Q ss_pred CCCeEEEEEEEEECCCCCeeeeEEEEEEEee
Q psy11061 138 RSNSYHINVSAMDGGGNKCSQDAQVNINIIN 168 (212)
Q Consensus 138 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d 168 (212)
+.+.|.++|.++|..+ ++..+.+.|.|-|
T Consensus 78 kgG~y~m~V~lCn~dG--CS~S~~~~I~VAD 106 (106)
T cd02848 78 KGGRYQMQVALCNGDG--CSTSAAKEIVVAD 106 (106)
T ss_pred CCCeEEEEEEEECCCC--ccCcCCEEEEecC
Confidence 6899999999999877 3444556665543
No 43
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=62.64 E-value=26 Score=21.64 Aligned_cols=30 Identities=20% Similarity=0.395 Sum_probs=22.5
Q ss_pred CCcCCCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy11061 31 LDFESRSSYEFPVVATDRGGLSTTAMVRIQI 61 (212)
Q Consensus 31 lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V 61 (212)
.-|+....|.+++.+.|..| +.+..+.|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g-~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG-SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence 45566789999999999877 5566666655
No 44
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=62.37 E-value=25 Score=25.60 Aligned_cols=30 Identities=30% Similarity=0.478 Sum_probs=25.3
Q ss_pred cCCCCeEEEEEEEEeCCCCceEEEEEEEEe
Q psy11061 33 FESRSSYEFPVVATDRGGLSTTAMVRIQIT 62 (212)
Q Consensus 33 ~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~ 62 (212)
.|..+.|+|++.|.|..|...+..+.+...
T Consensus 119 le~~~~YtLtV~a~D~aGN~~~~si~F~y~ 148 (158)
T PF13750_consen 119 LEADDSYTLTVSATDKAGNQSTKSISFSYM 148 (158)
T ss_pred cCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence 367789999999999998888888877665
No 45
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=61.24 E-value=25 Score=21.67 Aligned_cols=28 Identities=18% Similarity=0.468 Sum_probs=20.7
Q ss_pred CCCCCeEEEEEEEEECCCCCeeeeEEEEEEE
Q psy11061 136 LSRSNSYHINVSAMDGGGNKCSQDAQVNINI 166 (212)
Q Consensus 136 ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v 166 (212)
+...+.|.+++.++|..+ +..+++.|.|
T Consensus 51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 51 YTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 337799999999999876 3455666655
No 46
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=56.95 E-value=26 Score=21.75 Aligned_cols=20 Identities=10% Similarity=0.458 Sum_probs=16.0
Q ss_pred CCCCCeEEEEEEEEECCCCC
Q psy11061 136 LSRSNSYHINVSAMDGGGNK 155 (212)
Q Consensus 136 ld~~~~~~l~v~a~D~~~~~ 155 (212)
+...+.|.+++.|+|..+..
T Consensus 53 y~~~G~y~v~l~v~d~~g~~ 72 (81)
T cd00146 53 YTKPGTYTVTLTVTNAVGSS 72 (81)
T ss_pred cCCCcEEEEEEEEEeCCCCE
Confidence 44789999999999986543
No 47
>KOG4221|consensus
Probab=56.77 E-value=2.1e+02 Score=28.28 Aligned_cols=67 Identities=19% Similarity=0.335 Sum_probs=46.9
Q ss_pred EEEEEEc--CCCCCcEEEeCCccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCCCCCe
Q psy11061 106 ISYKIVT--GKDANLFRIDRSSGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINIINSHMPIPL 175 (212)
Q Consensus 106 i~y~i~~--~~~~~~F~id~~~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd~~P~ 175 (212)
..|++.. ++....+.++.++-+.++. . |++...|.+.|.|.+..|...++...-.++..|+-+.||.
T Consensus 552 ~~yk~~ys~~~~~~~~~~~~n~~e~ti~-g--L~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 552 TGYKLFYSEDDTGKELRVENNATEYTIN-G--LEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred eEEEEEEEcCCCCceEEEecCccEEEee-c--CCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 3454433 2455688888766666555 3 7789999999999998887655444445567788888887
No 48
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=47.38 E-value=25 Score=24.86 Aligned_cols=37 Identities=22% Similarity=0.372 Sum_probs=25.8
Q ss_pred cCCcCCCCeEEEEEEEEeCCCCceEEEEEEEEeecCC
Q psy11061 30 DLDFESRSSYEFPVVATDRGGLSTTAMVRIQITDVND 66 (212)
Q Consensus 30 ~lD~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd 66 (212)
.+.......|++.+++.+..|.+.+..+.|.|.|..-
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNADGCSTSAPVEVVVADTDG 112 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred EEEecCCCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence 3555567899999999999888888899999998765
No 49
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=39.91 E-value=5.4e+02 Score=28.16 Aligned_cols=30 Identities=17% Similarity=0.390 Sum_probs=21.1
Q ss_pred CcCCCCeEEEEEEEEeCCCCceEEEEEEEEe
Q psy11061 32 DFESRSSYEFPVVATDRGGLSTTAMVRIQIT 62 (212)
Q Consensus 32 D~E~~~~~~l~i~a~d~~~~~~~~~v~i~V~ 62 (212)
-|.....|.+++.|.+..|. .+..+.|.|.
T Consensus 1479 tY~~~GtYtVtLTvtN~~Gs-st~T~~VtV~ 1508 (2740)
T TIGR00864 1479 AFNSPGDFNIRLAAANEVGK-NEATLNVAVK 1508 (2740)
T ss_pred EcCCCceEEEEEEEECCCCc-eEEEEEEEEe
Confidence 45567789999999998764 3445566665
No 50
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.62 E-value=1.5e+02 Score=20.06 Aligned_cols=60 Identities=13% Similarity=0.294 Sum_probs=36.5
Q ss_pred CCeEEEEEEEEeCCC-------CceEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeC
Q psy11061 36 RSSYEFPVVATDRGG-------LSTTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDR 98 (212)
Q Consensus 36 ~~~~~l~i~a~d~~~-------~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~ 98 (212)
.....+++.-.|+.+ ..+..++++...|.+|-.+.-....|..-+..+.+ .|.+++--|.
T Consensus 42 ~AkvrI~i~fsDr~pvgpyevevaarrt~hlRfndL~dpe~iP~d~~yasviesnvP---vVvQ~tRLDs 108 (124)
T COG4288 42 PAKVRITILFSDREPVGPYEVEVAARRTLHLRFNDLGDPEAIPKDTPYASVIESNVP---VVVQLTRLDS 108 (124)
T ss_pred CceEEEEEEeccCCCCCceEEEeecceeEEEEecccCCcccCCCCCchhhheecCCc---eEEEEEEecc
Confidence 345556666666542 13456788889999986666666667776666654 3444444443
No 51
>KOG3597|consensus
Probab=27.99 E-value=1e+02 Score=26.61 Aligned_cols=50 Identities=24% Similarity=0.133 Sum_probs=35.3
Q ss_pred eeEEEEEEEeeCCCCCCeEecCceEEEEeCCCCCCeEEEEEEEEeCCCCC
Q psy11061 158 QDAQVNINIINSHMPIPLFQQSTYSFVVPEDVFKNSIVGTIKAATSDSGK 207 (212)
Q Consensus 158 ~~~~v~I~v~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~a~D~D~~~ 207 (212)
.+....|.|..+||.+..+....+.+.+.|+...=.-...+.+.|+|...
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~ 73 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP 73 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc
Confidence 35668899999998877777666667777765444444677788888654
No 52
>PF14157 YmzC: YmzC-like protein; PDB: 3KVP_E.
Probab=27.80 E-value=44 Score=20.18 Aligned_cols=18 Identities=17% Similarity=0.300 Sum_probs=13.1
Q ss_pred cCceEEeCCceEEEECCc
Q psy11061 13 TNHFYMKSVSGEICIAQD 30 (212)
Q Consensus 13 ~~~F~id~~tg~i~~~~~ 30 (212)
-..|..|+.+++|.+.+.
T Consensus 41 iKIfkyd~~tNei~L~KE 58 (63)
T PF14157_consen 41 IKIFKYDEDTNEITLKKE 58 (63)
T ss_dssp EEEEEEETTTTEEEEEEE
T ss_pred EEEEEeCCCCCeEEEEEe
Confidence 457777888888877654
No 53
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=26.49 E-value=1.7e+02 Score=18.29 Aligned_cols=15 Identities=40% Similarity=0.775 Sum_probs=11.3
Q ss_pred CCCeEEEEEEEEECC
Q psy11061 138 RSNSYHINVSAMDGG 152 (212)
Q Consensus 138 ~~~~~~l~v~a~D~~ 152 (212)
..+.|.+.|.++|.+
T Consensus 67 ~~G~Y~~~v~a~~~g 81 (81)
T PF13860_consen 67 PDGTYTFRVTATDGG 81 (81)
T ss_dssp -SEEEEEEEEEEET-
T ss_pred CCCCEEEEEEEEeCC
Confidence 578899999998864
No 54
>COG1470 Predicted membrane protein [Function unknown]
Probab=24.98 E-value=4.8e+02 Score=22.93 Aligned_cols=72 Identities=19% Similarity=0.280 Sum_probs=42.3
Q ss_pred ccEEEEeeecCCCCCCeEEEEEEEEECCCCCeeeeEEEEEEEeeCCC--------CCCeEecC-----ceEEEEeCCCCC
Q psy11061 125 SGELFVTRGNFLSRSNSYHINVSAMDGGGNKCSQDAQVNINIINSHM--------PIPLFQQS-----TYSFVVPEDVFK 191 (212)
Q Consensus 125 ~G~i~~~~~~~ld~~~~~~l~v~a~D~~~~~~~~~~~v~I~v~d~nd--------~~P~f~~~-----~~~~~v~e~~~~ 191 (212)
+|.++++.. ....-.+.+...+.|+.+. .-|.+.|.+..+ -.|...+. ..+..+++++..
T Consensus 386 ~g~~~lt~t----aGee~~i~i~I~NsGna~L---tdIkl~v~~PqgWei~Vd~~~I~sL~pge~~tV~ltI~vP~~a~a 458 (513)
T COG1470 386 NGPYRLTIT----AGEEKTIRISIENSGNAPL---TDIKLTVNGPQGWEIEVDESTIPSLEPGESKTVSLTITVPEDAGA 458 (513)
T ss_pred CCcEEEEec----CCccceEEEEEEecCCCcc---ceeeEEecCCccceEEECcccccccCCCCcceEEEEEEcCCCCCC
Confidence 344444433 4455666777777765443 345555543322 22333222 346789999999
Q ss_pred CeEEEEEEEEeC
Q psy11061 192 NSIVGTIKAATS 203 (212)
Q Consensus 192 g~~v~~v~a~D~ 203 (212)
|....+++|+-.
T Consensus 459 GdY~i~i~~ksD 470 (513)
T COG1470 459 GDYRITITAKSD 470 (513)
T ss_pred CcEEEEEEEeec
Confidence 999999988643
No 55
>cd04046 C2_Calpain C2 domain present in Calpain proteins. A single C2 domain is found in calpains (EC 3.4.22.52, EC 3.4.22.53), calcium-dependent, non-lysosomal cysteine proteases. Caplains are classified as belonging to Clan CA by MEROPS and include six families: C1, C2, C10, C12, C28, and C47. C2 domains fold into an 8-standed beta-sandwich that can adopt 2 structural arrangements: Type I and Type II, distinguished by a circular permutation involving their N- and C-terminal beta strands. Many C2 domains are Ca2+-dependent membrane-targeting modules that bind a wide variety of substances including bind phospholipids, inositol polyphosphates, and intracellular proteins. Most C2 domain proteins are either signal transduction enzymes that contain a single C2 domain, such as protein kinase C, or membrane trafficking proteins which contain at least two C2 domains, such as synaptotagmin 1. However, there are a few exceptions to this including RIM isoforms and some splice variants of pic
Probab=24.72 E-value=2.3e+02 Score=19.22 Aligned_cols=17 Identities=6% Similarity=0.102 Sum_probs=13.0
Q ss_pred eCCCCCCeEEEEEEEEe
Q psy11061 186 PEDVFKNSIVGTIKAAT 202 (212)
Q Consensus 186 ~e~~~~g~~v~~v~a~D 202 (212)
..+...|++..++.+.|
T Consensus 107 ~~~~~~G~i~~~~~~~~ 123 (126)
T cd04046 107 AAGEVPGTISVKVTSSD 123 (126)
T ss_pred CCCCCCCEEEEEEEEcc
Confidence 56777888888887776
No 56
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=23.30 E-value=1.3e+02 Score=21.08 Aligned_cols=32 Identities=19% Similarity=0.324 Sum_probs=16.4
Q ss_pred CeEEEEEEEEeCC-CCceEEEEEEEEeecCCCCC
Q psy11061 37 SSYEFPVVATDRG-GLSTTAMVRIQITDVNDNEP 69 (212)
Q Consensus 37 ~~~~l~i~a~d~~-~~~~~~~v~i~V~d~nd~~P 69 (212)
....+...+.|.+ .++....+.|.|+.-|+ +|
T Consensus 100 Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 100 GNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred ccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 3455555555554 37889999999998887 55
No 57
>PF07145 PAM2: Ataxin-2 C-terminal region; InterPro: IPR009818 This entry represents a conserved region approximately 250 residues long located towards the C terminus of eukaryotic ataxin-2. Ataxin-2 is a protein of unknown function, within which expansion of a polyglutamine tract (due to expansion of unstable CAG repeats in the coding region of the SCA2 gene) causes spinocerebellar ataxia type 2 (SCA2), a late-onset neurodegenerative disorder []. The expanded polyglutamine repeat in ataxin-2 causes disruption of the normal morphology of the Golgi complex and increased incidence of cell death []. Ataxin-2 is predicted to consist of mostly non-globular domains [].; PDB: 3NTW_B 1JH4_B 3KTR_B 3KUJ_B 3KUT_D 3KUS_D 1JGN_B 2RQG_A 2RQH_A.
Probab=23.05 E-value=51 Score=14.59 Aligned_cols=11 Identities=45% Similarity=0.757 Sum_probs=7.6
Q ss_pred cCCCCCccCCC
Q psy11061 64 VNDNEPIFNPV 74 (212)
Q Consensus 64 ~nd~~P~f~~~ 74 (212)
.|-|+|+|.+.
T Consensus 6 LNp~A~eFvP~ 16 (18)
T PF07145_consen 6 LNPNAPEFVPS 16 (18)
T ss_dssp SSTTSSSS-TT
T ss_pred cCCCCccccCC
Confidence 57888888654
No 58
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=22.96 E-value=1.4e+02 Score=16.20 Aligned_cols=14 Identities=21% Similarity=0.589 Sum_probs=10.4
Q ss_pred CCCCCeEEEEEEEE
Q psy11061 136 LSRSNSYHINVSAM 149 (212)
Q Consensus 136 ld~~~~~~l~v~a~ 149 (212)
+.+.+.|.|.++++
T Consensus 9 FT~PG~Y~l~~~a~ 22 (41)
T TIGR03769 9 FTKPGTYTLTVQAT 22 (41)
T ss_pred eCCCeEEEEEEEEE
Confidence 44678898888874
No 59
>cd01758 PLAT_LPL PLAT/ LH2 domain present in lipoprotein lipase (LPL). LPL is a key enzyme in catabolism of plasma lipoprotein triglycerides (TGs) and has therefeore has a profound influence on triglyceride and high-density lipoprotein (HDL) cholesterol levels in the blood. In general, PLAT/LH2 domain's proposed function is to mediate interaction with lipids or membrane bound proteins.
Probab=21.24 E-value=3.1e+02 Score=19.42 Aligned_cols=66 Identities=11% Similarity=0.187 Sum_probs=39.6
Q ss_pred eEEEEEEEEeCCCCc-eEEEEEEEEeecCCCCCccCCCceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEE
Q psy11061 38 SYEFPVVATDRGGLS-TTAMVRIQITDVNDNEPIFNPVEYNVSLRDDIQTTTAFAVVLATDRDSDRFGTISY 108 (212)
Q Consensus 38 ~~~l~i~a~d~~~~~-~~~~v~i~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~~~a~D~D~~~~~~i~y 108 (212)
.|.++|......... ....+.|.+...+.+...+ .+...+....|..-.-+..+|.|.|+-..+.+
T Consensus 2 hYqVtV~~~~~~~~~~t~~~v~i~L~G~~g~S~~~-----~l~~~~~~~~G~t~sfLi~t~~dlG~L~~vk~ 68 (137)
T cd01758 2 HYQLKIHFFNQTNRIETDPTFTISLYGTLGESENL-----PLTLPEGITGNKTNSFLITTEKDIGDLLMLKL 68 (137)
T ss_pred eEEEEEEEecccCCCcccceEEEEEEcCCCcccCE-----EEecCcccCCCCeEEEEEECCCCcCCEEEEEE
Confidence 578888877654332 5567777777776655433 22222322456666666777888776555555
No 60
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=21.17 E-value=3.1e+02 Score=19.32 Aligned_cols=19 Identities=26% Similarity=0.613 Sum_probs=16.8
Q ss_pred CCCeEEEEEEEEECCCCCe
Q psy11061 138 RSNSYHINVSAMDGGGNKC 156 (212)
Q Consensus 138 ~~~~~~l~v~a~D~~~~~~ 156 (212)
..+.|.|.+.|+|..|...
T Consensus 105 ~~G~YH~~i~VtD~~Gn~~ 123 (132)
T PF15418_consen 105 PAGDYHFMITVTDAAGNQT 123 (132)
T ss_pred CCcceEEEEEEEECCCCEE
Confidence 7899999999999988654
No 61
>KOG3865|consensus
Probab=20.87 E-value=4.5e+02 Score=21.80 Aligned_cols=63 Identities=22% Similarity=0.395 Sum_probs=42.3
Q ss_pred CceEEeCCceEEEECCcCCcCC---CCeEEEEEEEEeCCCCceEEEEEEEEeecCCCCCccCCCceEEEE
Q psy11061 14 NHFYMKSVSGEICIAQDLDFES---RSSYEFPVVATDRGGLSTTAMVRIQITDVNDNEPIFNPVEYNVSL 80 (212)
Q Consensus 14 ~~F~id~~tg~i~~~~~lD~E~---~~~~~l~i~a~d~~~~~~~~~v~i~V~d~nd~~P~f~~~~~~~~v 80 (212)
..|-+. .|.|.+...||+|- -+...+.|.++++.. .....+.+.|.-.-| -=.|....|...|
T Consensus 186 k~FlmS--~~~lhLevsLDkEiYyHGE~isvnV~V~NNsn-KtVKkIK~~V~Q~ad-i~Lfs~aqy~~~V 251 (402)
T KOG3865|consen 186 KQFLMS--DGPLHLEVSLDKEIYYHGEPISVNVHVTNNSN-KTVKKIKISVRQVAD-ICLFSTAQYKKPV 251 (402)
T ss_pred HhhccC--CCceEEEEEecchheecCCceeEEEEEecCCc-ceeeeeEEEeEeece-EEEEeccccccee
Confidence 355555 48899999999998 467888888888743 355566666666555 3445555565444
Done!