Query psy11062
Match_columns 281
No_of_seqs 194 out of 1697
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 17:06:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11062.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11062hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 4.4E-41 9.6E-46 326.2 20.1 230 1-235 186-419 (2531)
2 KOG4289|consensus 100.0 5.4E-39 1.2E-43 311.8 22.3 223 2-229 292-517 (2531)
3 KOG1219|consensus 100.0 1.6E-34 3.4E-39 288.6 24.6 236 1-242 975-1213(4289)
4 KOG1219|consensus 100.0 1E-33 2.2E-38 282.9 23.1 215 7-228 2105-2319(4289)
5 cd00031 CA Cadherin repeat dom 100.0 1.7E-29 3.6E-34 211.6 25.2 177 2-180 20-199 (199)
6 cd00031 CA Cadherin repeat dom 99.9 1.5E-25 3.2E-30 187.6 20.0 179 86-268 2-183 (199)
7 PF00028 Cadherin: Cadherin do 99.8 2.4E-18 5.2E-23 126.8 13.3 92 86-179 1-93 (93)
8 smart00112 CA Cadherin repeats 99.7 6.5E-17 1.4E-21 115.5 10.6 79 106-186 1-79 (79)
9 KOG1834|consensus 99.7 1.1E-15 2.5E-20 141.7 17.6 150 26-181 85-245 (952)
10 smart00112 CA Cadherin repeats 99.6 8.8E-15 1.9E-19 104.4 10.5 75 5-79 4-79 (79)
11 KOG1834|consensus 99.6 5.9E-14 1.3E-18 130.4 15.3 191 68-263 20-223 (952)
12 PF00028 Cadherin: Cadherin do 99.3 1.1E-11 2.4E-16 91.1 10.3 72 1-72 18-93 (93)
13 PF08266 Cadherin_2: Cadherin- 97.4 0.00023 5E-09 51.0 3.8 62 86-150 3-66 (84)
14 PF08758 Cadherin_pro: Cadheri 97.1 0.0076 1.6E-07 43.8 9.3 80 77-166 2-81 (90)
15 TIGR01965 VCBS_repeat VCBS rep 96.6 0.042 9E-07 40.5 9.8 89 101-201 2-98 (99)
16 smart00736 CADG Dystroglycan-t 96.5 0.051 1.1E-06 39.9 10.3 69 4-76 25-96 (97)
17 smart00736 CADG Dystroglycan-t 96.5 0.074 1.6E-06 39.0 11.0 72 105-183 24-96 (97)
18 TIGR01965 VCBS_repeat VCBS rep 95.7 0.18 3.8E-06 37.3 9.6 86 1-94 4-98 (99)
19 PF13750 Big_3_3: Bacterial Ig 95.6 0.93 2E-05 36.5 16.5 128 46-179 15-148 (158)
20 PF08266 Cadherin_2: Cadherin- 95.3 0.021 4.6E-07 40.9 3.3 35 12-46 34-68 (84)
21 TIGR00845 caca sodium/calcium 91.5 14 0.0003 38.1 16.7 130 74-210 395-550 (928)
22 PF07495 Y_Y_Y: Y_Y_Y domain; 87.7 4.3 9.2E-05 26.9 7.1 60 112-178 6-65 (66)
23 PF08758 Cadherin_pro: Cadheri 85.5 6.4 0.00014 28.5 7.3 47 9-61 36-82 (90)
24 TIGR03660 T1SS_rpt_143 T1SS-14 81.1 24 0.00053 27.6 10.3 55 140-200 70-127 (137)
25 PF05345 He_PIG: Putative Ig d 79.5 8.6 0.00019 24.3 5.4 37 126-164 11-48 (49)
26 KOG3597|consensus 70.9 96 0.0021 29.3 15.1 162 63-240 24-203 (442)
27 cd02848 Chitinase_N_term Chiti 68.2 35 0.00077 25.4 7.0 34 145-180 73-106 (106)
28 PF07495 Y_Y_Y: Y_Y_Y domain; 54.7 57 0.0012 21.2 6.1 58 10-71 6-65 (66)
29 PF12245 Big_3_2: Bacterial Ig 54.1 60 0.0013 21.3 5.8 29 151-181 22-50 (60)
30 cd00146 PKD polycystic kidney 50.7 41 0.0009 22.9 4.7 30 146-177 51-80 (81)
31 cd05762 Ig8_MLCK Eighth immuno 48.8 98 0.0021 22.2 9.9 37 144-186 61-97 (98)
32 KOG4221|consensus 48.4 3.8E+02 0.0083 28.9 14.2 70 115-187 551-620 (1381)
33 PF03160 Calx-beta: Calx-beta 48.0 1E+02 0.0022 22.0 11.6 53 68-124 2-54 (100)
34 smart00089 PKD Repeats in poly 44.7 62 0.0013 21.9 4.8 31 145-178 48-78 (79)
35 PF05895 DUF859: Siphovirus pr 43.5 3.5E+02 0.0075 27.0 13.4 108 47-165 299-425 (624)
36 PF06650 DUF1162: Protein of u 41.7 2.3E+02 0.005 24.4 18.3 155 10-186 58-228 (277)
37 PF08329 ChitinaseA_N: Chitina 40.3 66 0.0014 25.1 4.7 37 145-183 76-112 (133)
38 TIGR03660 T1SS_rpt_143 T1SS-14 38.9 1.9E+02 0.0041 22.6 10.9 67 23-94 56-128 (137)
39 PF02494 HYR: HYR domain; Int 37.2 80 0.0017 21.7 4.5 25 152-178 57-81 (81)
40 TIGR00845 caca sodium/calcium 35.2 5.5E+02 0.012 26.9 15.8 54 66-123 516-569 (928)
41 PF13750 Big_3_3: Bacterial Ig 31.9 2.7E+02 0.0057 22.2 11.6 38 150-190 13-51 (158)
42 PF13754 Big_3_4: Bacterial Ig 30.8 1.4E+02 0.0031 18.9 4.8 18 150-167 22-39 (54)
43 COG4288 Uncharacterized protei 25.9 1.6E+02 0.0034 22.1 4.3 59 147-222 53-111 (124)
44 cd02848 Chitinase_N_term Chiti 25.6 1.4E+02 0.0031 22.3 4.1 34 39-73 73-106 (106)
45 cd05850 Ig1_Contactin-2 First 24.8 2.5E+02 0.0055 19.7 9.1 80 79-165 2-83 (94)
46 PF09100 Qn_am_d_aIV: Quinohem 21.7 1.7E+02 0.0038 22.5 4.0 31 48-79 100-132 (133)
47 PF13287 Fn3_assoc: Fn3 associ 21.0 1.7E+02 0.0036 19.1 3.4 21 1-21 1-21 (59)
No 1
>KOG4289|consensus
Probab=100.00 E-value=4.4e-41 Score=326.17 Aligned_cols=230 Identities=32% Similarity=0.443 Sum_probs=214.3
Q ss_pred CEEEEecCCCcEEEEEEe---CCCCceEEEECCccEEEECcCCCceeccEEEEEEEEE-eCCCCceeEEEEEEEEEecCC
Q psy11062 1 MKILLLDQSNPIRYTIHS---GDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQAS-SGSPPLYSYAQVKVTIEDRND 76 (281)
Q Consensus 1 ~~~~d~d~~~~l~Y~i~~---g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~-dg~~~~~~~~~v~i~V~dvNd 76 (281)
++|.++| .++|.|++.. .-..++|+||+.+|.|++++.||||+...+.|.|.|. -+.|+.+++++|+|.|.|.||
T Consensus 186 v~A~~~~-a~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nD 264 (2531)
T KOG4289|consen 186 VKASDPD-AGRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTND 264 (2531)
T ss_pred EEecCCC-cCceEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCC
Confidence 3567777 4789999984 2344899999999999999999999999999999999 478899999999999999999
Q ss_pred CCCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEE
Q psy11062 77 NAPEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTI 156 (281)
Q Consensus 77 ~~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l 156 (281)
|.|.|++..|..++.||.++|..|.+++|+|.|+++|+.+.|++.+|+ ....|.||+.+|.|+++.+||||+...|+|
T Consensus 265 hsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~--~~~~f~in~rSGvI~T~a~lDRE~~~~y~L 342 (2531)
T KOG4289|consen 265 HSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGN--AKNVFEINPRSGVISTRAPLDREELESYQL 342 (2531)
T ss_pred CCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCC--ccceeEEcCccceeeccCccCHHhhhheEE
Confidence 999999999999999999999999999999999999999999999974 678999999999999999999999999999
Q ss_pred EEEEEECCCCCCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEEEEEEeeEEEeecCccccEEEEEeecCCh
Q psy11062 157 VVSATDQGDPPMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQVRTTCTPLYVLVEGQCSLVCWSTLRLMTH 235 (281)
Q Consensus 157 ~V~a~D~~~p~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v~~v~A~~yd~d~g~~~~~~~~~~~~~~~ 235 (281)
.|+|+|.|.++...++.|.|.|.|+|||+|+|....|.+.|.|+..++++|++|+|+ |.|-|.||.+-|++.+....
T Consensus 343 ~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~At--DrD~g~Ng~VHYsi~Sgn~~ 419 (2531)
T KOG4289|consen 343 DVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTAT--DRDKGTNGKVHYSIASGNGR 419 (2531)
T ss_pred EEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEec--ccCCCcCceEEEEeeccCcc
Confidence 999999999988889999999999999999999999999999999999999999999 99999999999987665443
No 2
>KOG4289|consensus
Probab=100.00 E-value=5.4e-39 Score=311.77 Aligned_cols=223 Identities=33% Similarity=0.518 Sum_probs=210.5
Q ss_pred EEEEec--CCCcEEEEEEeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEE-eCCCCceeEEEEEEEEEecCCCC
Q psy11062 2 KILLLD--QSNPIRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQAS-SGSPPLYSYAQVKVTIEDRNDNA 78 (281)
Q Consensus 2 ~~~d~d--~~~~l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~-dg~~~~~~~~~v~i~V~dvNd~~ 78 (281)
+|+|.| +|+.|.|++..|++.+.|.||+.+|.|++..+||||+...|.|.|.|. .|.++...|++|.|.|.|.|||+
T Consensus 292 rAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNa 371 (2531)
T KOG4289|consen 292 RATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNA 371 (2531)
T ss_pred EeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCC
Confidence 455555 688999999999999999999999999999999999999999999999 46666677999999999999999
Q ss_pred CeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEE
Q psy11062 79 PEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVV 158 (281)
Q Consensus 79 P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V 158 (281)
|.|....|.+.|.|+..++++|.+|+|+|.|.|.|+.+.|+|..|+ ..+.|.||..+|+|.+..+||+|.. .|.+.|
T Consensus 372 PqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn--~~G~f~id~~tGel~vv~plD~e~~-~ytl~I 448 (2531)
T KOG4289|consen 372 PQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGN--GRGQFYIDSLTGELDVVEPLDFENS-EYTLRI 448 (2531)
T ss_pred ccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccC--ccccEEEecccceEEEeccccccCC-eeEEEE
Confidence 9999999999999999999999999999999999999999999985 5788999999999999999999988 899999
Q ss_pred EEEECCCCCCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEEEEEEeeEEEeecCccccEEEEE
Q psy11062 159 SATDQGDPPMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQVRTTCTPLYVLVEGQCSLVCWST 229 (281)
Q Consensus 159 ~a~D~~~p~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v~~v~A~~yd~d~g~~~~~~~~~ 229 (281)
+|.|+|.|+++.+..+.|+|+|+|||+|.|....+.++|.|+.+.|..+..+.|. |.|+|.|+.+.|+.
T Consensus 449 rAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqai--dadsg~na~l~y~l 517 (2531)
T KOG4289|consen 449 RAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAI--DADSGENARLHYSL 517 (2531)
T ss_pred EcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecc--cCCCCcccceeeee
Confidence 9999999999999999999999999999999999999999999999999999999 99999999998853
No 3
>KOG1219|consensus
Probab=100.00 E-value=1.6e-34 Score=288.60 Aligned_cols=236 Identities=31% Similarity=0.466 Sum_probs=217.9
Q ss_pred CEEEEecC--CCcEEEEEEeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEE-eCCCCceeEEEEEEEEEecCCC
Q psy11062 1 MKILLLDQ--SNPIRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQAS-SGSPPLYSYAQVKVTIEDRNDN 77 (281)
Q Consensus 1 ~~~~d~d~--~~~l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~-dg~~~~~~~~~v~i~V~dvNd~ 77 (281)
|+|.|.|. ++.|.|.|..|++.+.|+|+..+|.|++.+.||||....|-|+|.|+ .|.++.++.+.+.|.|.|+|||
T Consensus 975 i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn 1054 (4289)
T KOG1219|consen 975 IQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDN 1054 (4289)
T ss_pred EEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCC
Confidence 35666664 46799999999999999999999999999999999999999999999 5778889999999999999999
Q ss_pred CCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEE
Q psy11062 78 APEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIV 157 (281)
Q Consensus 78 ~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~ 157 (281)
+|.|.+..|..+|.||+|.+..|.++.|.|+|...++++.|.|..|+ ..+.|.||+.+|.|...+.||||++.++.|.
T Consensus 1055 ~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGn--yq~FF~Id~~TG~iTt~r~LDRE~qdEHiLe 1132 (4289)
T KOG1219|consen 1055 VPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGN--YQGFFQIDPETGLITTIRRLDREKQDEHILE 1132 (4289)
T ss_pred CcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCC--ccceEEEccccceeeeehhhcccccccceEE
Confidence 99999999999999999999999999999999888999999999985 6899999999999999999999999999999
Q ss_pred EEEEECCCCCCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEEEEEEeeEEEeecCccccEEEEEeecCChhh
Q psy11062 158 VSATDQGDPPMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQVRTTCTPLYVLVEGQCSLVCWSTLRLMTHST 237 (281)
Q Consensus 158 V~a~D~~~p~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v~~v~A~~yd~d~g~~~~~~~~~~~~~~~~~ 237 (281)
|.++|.|.|.+++.+.|.|.|+|+|||+|.|.+..+...++|...+ .+.++.|. |.|+|.|++++|.+-...-...
T Consensus 1133 VTi~D~gep~l~s~~rviV~IldvNdnsp~Flqk~~~~~v~~r~s~--plyRl~a~--d~DeG~narityniedgde~Fs 1208 (4289)
T KOG1219|consen 1133 VTIQDNGEPWLCSNQRVIVSILDVNDNSPRFLQKKTFLRVPERSSP--PLYRLAAQ--DNDEGNNARITYNIEDGDEVFS 1208 (4289)
T ss_pred EEEecCCCCccccceEEEEEEeeccCCchhhhhheeEEEeeeccCC--ceeEEEEE--ecCCCcceEEEEecccCceEEE
Confidence 9999999999999999999999999999999999999999998765 88999999 9999999999998776666655
Q ss_pred hhccC
Q psy11062 238 LKSEL 242 (281)
Q Consensus 238 l~~~~ 242 (281)
++..+
T Consensus 1209 ID~~t 1213 (4289)
T KOG1219|consen 1209 IDIAT 1213 (4289)
T ss_pred EeccC
Confidence 55433
No 4
>KOG1219|consensus
Probab=100.00 E-value=1e-33 Score=282.91 Aligned_cols=215 Identities=31% Similarity=0.483 Sum_probs=201.5
Q ss_pred cCCCcEEEEEEeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEEeCCCCceeEEEEEEEEEecCCCCCeeccCcE
Q psy11062 7 DQSNPIRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQASSGSPPLYSYAQVKVTIEDRNDNAPEFESTTI 86 (281)
Q Consensus 7 d~~~~l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~dg~~~~~~~~~v~i~V~dvNd~~P~f~~~~~ 86 (281)
+.+.++.|+|++|+ ...|.+|-+||.|.+..+||||....|.|.|+|+|..-++.+++.|.|.|.|+|||+|.|.+..|
T Consensus 2105 ~l~~kv~YsIldg~-~slFtvnf~TG~i~v~~pLd~ea~t~h~l~ieAtd~~~p~~Aea~VeIiV~dIndn~PvFeqlsY 2183 (4289)
T KOG1219|consen 2105 PLGLKVTYSILDGN-TSLFTVNFTTGVILVLIPLDREASTLHELLIEATDAGIPLSAEAKVEIIVGDINDNPPVFEQLSY 2183 (4289)
T ss_pred CcCCceEEEEecCC-cceEEEecccceEEeccccccccccceEEEEEEeccCCCcceeeEEEEEecccCCCCchhheeeE
Confidence 34566999999988 57899999999999999999999999999999997555689999999999999999999999999
Q ss_pred EEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEECCCC
Q psy11062 87 RLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQGDP 166 (281)
Q Consensus 87 ~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~p 166 (281)
..+++|++++|+.+.++.|+|.|. |..+.|+|.++ ......|+|+..||+|.+.+.||||+...|.|.|+|+|+|.|
T Consensus 2184 t~sisE~s~igt~viqilATdsDs--n~~isYsl~g~-s~~sk~f~In~sTG~it~~g~ldyE~~q~f~~fvratdggk~ 2260 (4289)
T KOG1219|consen 2184 TISISENSKIGTKVIQILATDSDS--NREISYSLEGN-SEISKPFRINVSTGWITVAGKLDYEENQEFRFFVRATDGGKP 2260 (4289)
T ss_pred EEEccCCCccCceEEEEEeccCCC--CCceEEEeecC-CccccceEEecccceEEEeeecChhhcceEEEEEEEccCCCc
Confidence 999999999999999999999996 88999999984 367889999999999999999999999999999999999998
Q ss_pred CCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEEEEEEeeEEEeecCccccEEEE
Q psy11062 167 PMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQVRTTCTPLYVLVEGQCSLVCWS 228 (281)
Q Consensus 167 ~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v~~v~A~~yd~d~g~~~~~~~~ 228 (281)
+++.+.|.|+|.|+|||+|.|.+..|.+.|+|.+.-|..|..+.|. |+|..+..++.|.
T Consensus 2261 -lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~--D~Dssd~lk~ey~ 2319 (4289)
T KOG1219|consen 2261 -LSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAH--DLDSSDHLKLEYN 2319 (4289)
T ss_pred -ccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEec--cCCccchhhhhhc
Confidence 8899999999999999999999999999999999999999999998 9998888777764
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=1.7e-29 Score=211.57 Aligned_cols=177 Identities=44% Similarity=0.643 Sum_probs=161.7
Q ss_pred EEEEecCC--CcEEEEEEeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEEe-CCCCceeEEEEEEEEEecCCCC
Q psy11062 2 KILLLDQS--NPIRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQASS-GSPPLYSYAQVKVTIEDRNDNA 78 (281)
Q Consensus 2 ~~~d~d~~--~~l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~d-g~~~~~~~~~v~i~V~dvNd~~ 78 (281)
+|.|.|.+ +.+.|+|.++...++|.|++.+|.|++++.||||....|.|.|.|.| |.+.++++..+.|.|.|+||++
T Consensus 20 ~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~ 99 (199)
T cd00031 20 SATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNP 99 (199)
T ss_pred EEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCC
Confidence 46666664 78999999877668999999999999999999999999999999997 6666668999999999999999
Q ss_pred CeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEE
Q psy11062 79 PEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVV 158 (281)
Q Consensus 79 P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V 158 (281)
|.|....|.+.|.|+.++|+.++++.|+|+|.+.++.++|+|.++. ....|.|++.+|.|.+.++||+|....|.|.|
T Consensus 100 P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~--~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v 177 (199)
T cd00031 100 PVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGN--DKELFSIDPNTGIITLAKPLDREEKSSYELTV 177 (199)
T ss_pred CcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCC--CCCEEEEeCCceEEEeCCccCCccCceEEEEE
Confidence 9999889999999999999999999999999988899999999853 24799999999999999999999999999999
Q ss_pred EEEECCCCCCeEEEEEEEEEEe
Q psy11062 159 SATDQGDPPMSSNLTVLLEVQD 180 (281)
Q Consensus 159 ~a~D~~~p~~~~~~~v~I~V~D 180 (281)
.|+|.+.+.+++++.++|.|.|
T Consensus 178 ~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 178 VATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EEEECCCCCceeEEEEEEEEEC
Confidence 9999998888999999999876
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.94 E-value=1.5e-25 Score=187.60 Aligned_cols=179 Identities=34% Similarity=0.457 Sum_probs=155.3
Q ss_pred EEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEECCC
Q psy11062 86 IRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQGD 165 (281)
Q Consensus 86 ~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~ 165 (281)
|.+.|+||++.|+.++++.|.|+|.+.++.++|+|.+++ ...+|.|++.+|.|++.+.||||....|.|.|+|+|.+.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~--~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~ 79 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGN--EDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGG 79 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCC--CcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCc
Confidence 688999999999999999999999988889999999864 237999999999999999999999999999999999988
Q ss_pred CCCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEEEEEEeeEEEeecCccccEEEEEeecCC--hhhhhc-cC
Q psy11062 166 PPMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQVRTTCTPLYVLVEGQCSLVCWSTLRLMT--HSTLKS-EL 242 (281)
Q Consensus 166 p~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v~~v~A~~yd~d~g~~~~~~~~~~~~~~--~~~l~~-~~ 242 (281)
|.+++++.++|.|.|+||++|.|....|.+.|.|+.++|+.++++.|+ |.|.+.++.++|.+..... ...++. .|
T Consensus 80 ~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~--D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G 157 (199)
T cd00031 80 PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTAT--DADSGENAKLTYSILSGNDKELFSIDPNTG 157 (199)
T ss_pred CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEE--cCCCCCCccEEEEEeCCCCCCEEEEeCCce
Confidence 877899999999999999999999889999999999999999999999 8998889999999888874 555543 45
Q ss_pred CCCCCCCcceEEEEEEEEEEecCcce
Q psy11062 243 NSPLGSNSFQLSILWYSIFLRTNTMQ 268 (281)
Q Consensus 243 ~~~~~~~~~~~~~~~y~~~~~~~~~~ 268 (281)
...+....-.-....|++.++..++.
T Consensus 158 ~i~~~~~ld~e~~~~~~l~v~a~D~~ 183 (199)
T cd00031 158 IITLAKPLDREEKSSYELTVVATDGG 183 (199)
T ss_pred EEEeCCccCCccCceEEEEEEEEECC
Confidence 44444433334444889998888875
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.79 E-value=2.4e-18 Score=126.81 Aligned_cols=92 Identities=43% Similarity=0.641 Sum_probs=87.2
Q ss_pred EEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEEC-C
Q psy11062 86 IRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQ-G 164 (281)
Q Consensus 86 ~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~-~ 164 (281)
|++.|+|+.++|+.++.+.|.|+|.+.|+.+.|+|.+++ ....|.|++.+|.|++.++||||....|.|.|.|+|. +
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~--~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~ 78 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGN--PDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGG 78 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETT--STTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCc--ccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCC
Confidence 789999999999999999999999999999999999974 3789999999999999999999999999999999999 8
Q ss_pred CCCCeEEEEEEEEEE
Q psy11062 165 DPPMSSNLTVLLEVQ 179 (281)
Q Consensus 165 ~p~~~~~~~v~I~V~ 179 (281)
.|++++++.|+|+|+
T Consensus 79 ~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 79 SPPLSSTATVTINVL 93 (93)
T ss_dssp SSEEEEEEEEEEEEE
T ss_pred CCCCEEEEEEEEEEC
Confidence 899999999999985
No 8
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.72 E-value=6.5e-17 Score=115.47 Aligned_cols=79 Identities=48% Similarity=0.759 Sum_probs=73.3
Q ss_pred EeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCCCC
Q psy11062 106 RDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVNDNP 185 (281)
Q Consensus 106 ~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND~~ 185 (281)
+|+|.|.|+.++|+|.+++. ..+|.|++.+|.|++.++||||....|.|.|+|+|.+.|++++.+.|+|.|.|+|||+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~--~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~ 78 (79)
T smart00112 1 TDADSGENGKVTYSILSGNE--DGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA 78 (79)
T ss_pred CCCCCCcCcEEEEEEecCCC--CCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence 48898889999999998642 3899999999999999999999999999999999999999999999999999999999
Q ss_pred C
Q psy11062 186 P 186 (281)
Q Consensus 186 P 186 (281)
|
T Consensus 79 P 79 (79)
T smart00112 79 P 79 (79)
T ss_pred C
Confidence 8
No 9
>KOG1834|consensus
Probab=99.69 E-value=1.1e-15 Score=141.68 Aligned_cols=150 Identities=27% Similarity=0.412 Sum_probs=125.5
Q ss_pred EEECCc--cEEEECcCCCceeccEEEEEEEEEe-CCCC------ceeEEEEEEEEEecCCCCCeeccCcEEEEEeCCCCC
Q psy11062 26 TLDSTT--GVIRTAATLDHEVTGVLLLSVQASS-GSPP------LYSYAQVKVTIEDRNDNAPEFESTTIRLSVPESAAL 96 (281)
Q Consensus 26 ~Id~~~--G~i~~~~~LD~E~~~~~~l~v~a~d-g~~~------~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~ 96 (281)
-+|+.+ |.|+.+.+||.|.++.|+|+|+|.| |.++ .+..++|+|.|.|+|+++|.|...-|.+.|.|.. .
T Consensus 85 VvdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~ 163 (952)
T KOG1834|consen 85 VVDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-V 163 (952)
T ss_pred EEeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-e
Confidence 346654 5788899999999999999999995 3322 3456899999999999999999999999999885 4
Q ss_pred CceEEEEEEEeCCC-CCCceE-EEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEECCCCCCeEEEEE
Q psy11062 97 GLPLYTVVARDKDS-GDNGKI-RYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQGDPPMSSNLTV 174 (281)
Q Consensus 97 gt~v~~v~a~D~D~-g~~~~i-~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v 174 (281)
-..|+++.|.|.|- +++++| .|.|.. .+-+|.|| +.|.|+..++|+|.+...|.|+|.|.|+|..+..+-+.|
T Consensus 164 yd~il~veAiD~DCspq~sqIC~YEI~t----~d~PFaId-n~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV 238 (952)
T KOG1834|consen 164 YDSILRVEAIDKDCSPQYSQICEYEITT----PDVPFAID-NDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLV 238 (952)
T ss_pred eeeeEEEEeecCCCCCcccceeEEEecC----CCCceEEc-CCCccccccccccccceeEEEEEEEEecccccccCcceE
Confidence 56799999999995 455555 688887 36689998 579999999999999999999999999999766666888
Q ss_pred EEEEEeC
Q psy11062 175 LLEVQDV 181 (281)
Q Consensus 175 ~I~V~Dv 181 (281)
+|+|...
T Consensus 239 ~v~Vkp~ 245 (952)
T KOG1834|consen 239 TVHVKPT 245 (952)
T ss_pred EEEecCc
Confidence 8888653
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.60 E-value=8.8e-15 Score=104.35 Aligned_cols=75 Identities=43% Similarity=0.638 Sum_probs=67.0
Q ss_pred EecCCCcEEEEEEeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEEe-CCCCceeEEEEEEEEEecCCCCC
Q psy11062 5 LLDQSNPIRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQASS-GSPPLYSYAQVKVTIEDRNDNAP 79 (281)
Q Consensus 5 d~d~~~~l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~d-g~~~~~~~~~v~i~V~dvNd~~P 79 (281)
|.+.++.+.|+|.+++..++|+|++.+|.|++.++||||....|.|.|.|.+ |.++..+++.|+|.|.|+|||+|
T Consensus 4 D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 4 DSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 3446788999999877668999999999999999999999999999999996 45568899999999999999988
No 11
>KOG1834|consensus
Probab=99.57 E-value=5.9e-14 Score=130.44 Aligned_cols=191 Identities=21% Similarity=0.173 Sum_probs=140.1
Q ss_pred EEEEEecCCCCCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCC--C-ceEEEEEeeCCCCCceeeEEeCce--eEEEE
Q psy11062 68 KVTIEDRNDNAPEFESTTIRLSVPESAALGLPLYTVVARDKDSGD--N-GKIRYHLASGNFGDSRLFAVDQTS--GHLSL 142 (281)
Q Consensus 68 ~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~--~-~~i~y~l~~~~~~~~~~F~id~~t--G~i~~ 142 (281)
....--+|-|.|+. ...|.+.|.||...-.....+.|-|.|.+. . ...-|.|.+.+. .-..--+|+.+ |.|+.
T Consensus 20 ~~~aarankhkpwi-e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~v-PFdavVvdK~TGegvlRa 97 (952)
T KOG1834|consen 20 HHHAARANKHKPWI-EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPV-PFDAVVVDKYTGEGVLRA 97 (952)
T ss_pred ccccccccccCccc-ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCC-CceEEEEeccCCceEEee
Confidence 34556688887865 456899999997544444568888999632 2 245688887432 11223347766 57888
Q ss_pred ceecCccCcceEEEEEEEEECCCCC------CeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEEEEEEeeEEE
Q psy11062 143 SKHLDYETSQRHTIVVSATDQGDPP------MSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQVRTTCTPLYV 216 (281)
Q Consensus 143 ~~~LD~E~~~~y~l~V~a~D~~~p~------~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v~~v~A~~yd 216 (281)
+.+||+|.++.|+|+|+|.|+|.-+ .+..++|+|+|.|+|+.+|+|..+.|.+.|.|+. +-..|++|.|. |
T Consensus 98 K~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAi--D 174 (952)
T KOG1834|consen 98 KEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAI--D 174 (952)
T ss_pred cCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-eeeeeEEEEee--c
Confidence 8999999999999999999998743 6778999999999999999999999999999988 55699999999 7
Q ss_pred eecC-ccccEE-EEEeecCChhhhhccCCCCCCCCcceEEEEEEEEEEe
Q psy11062 217 LVEG-QCSLVC-WSTLRLMTHSTLKSELNSPLGSNSFQLSILWYSIFLR 263 (281)
Q Consensus 217 ~d~g-~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~y~~~~~ 263 (281)
.|.+ .++.+. |.+....-.+.+++.|.........--...-|.|+|+
T Consensus 175 ~DCspq~sqIC~YEI~t~d~PFaIdn~G~irnTekLny~ke~~Y~ltVt 223 (952)
T KOG1834|consen 175 KDCSPQYSQICEYEITTPDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVT 223 (952)
T ss_pred CCCCCcccceeEEEecCCCCceEEcCCCccccccccccccceeEEEEEE
Confidence 6655 446665 6777766667788777665444443333444666555
No 12
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.34 E-value=1.1e-11 Score=91.09 Aligned_cols=72 Identities=35% Similarity=0.566 Sum_probs=64.5
Q ss_pred CEEEEec--CCCcEEEEEEeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEEeC--CCCceeEEEEEEEEE
Q psy11062 1 MKILLLD--QSNPIRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQASSG--SPPLYSYAQVKVTIE 72 (281)
Q Consensus 1 ~~~~d~d--~~~~l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~dg--~~~~~~~~~v~i~V~ 72 (281)
++|.|.| .|+.+.|+|..+++.++|.|++.+|.|++.++||||....|.|.|.|+|+ .|++++++.|+|.|.
T Consensus 18 v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 18 VTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp EEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred EEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 4677887 67899999999998899999999999999999999999999999999965 567888899999874
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.36 E-value=0.00023 Score=51.03 Aligned_cols=62 Identities=31% Similarity=0.507 Sum_probs=39.2
Q ss_pred EEEEEeCCCCCCceEEEEEEEeCCCCCC--ceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccC
Q psy11062 86 IRLSVPESAALGLPLYTVVARDKDSGDN--GKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYET 150 (281)
Q Consensus 86 ~~~~v~En~~~gt~v~~v~a~D~D~g~~--~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~ 150 (281)
...+|+|..++|+.|+.+ |.|.-.... ..-.|++... ....+|.++..+|.|.+..++|||+
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~--~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSE--GNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-S--SSS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeec--CCcceeEecCCceeEEeCCccCHHH
Confidence 467899999999999999 445432211 1235676653 3578999999999999999999996
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.09 E-value=0.0076 Score=43.77 Aligned_cols=80 Identities=20% Similarity=0.297 Sum_probs=42.7
Q ss_pred CCCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEE
Q psy11062 77 NAPEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTI 156 (281)
Q Consensus 77 ~~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l 156 (281)
..|-|.+..|.+.|+.+...|..+++|.-.|=. .+..+.|.-.. ..|+|.+ .|.|++++++.....+ -.|
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~--~~~~~~~~ssD------pdF~V~~-DGsVy~~r~v~l~~~~-~~F 71 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCT--GRRRVIFESSD------PDFRVLE-DGSVYAKRPVQLSSEQ-RSF 71 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--S--S---EEEE---------SEEEEET-TTEEEEES--S-SSS--EEE
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCC--CCCceEEecCC------CCEEEcC-CCeEEEeeeEecCCCc-eEE
Confidence 358899999999999999999999999887764 24456766544 3799975 6999999999875443 479
Q ss_pred EEEEEECCCC
Q psy11062 157 VVSATDQGDP 166 (281)
Q Consensus 157 ~V~a~D~~~p 166 (281)
.|.|.|....
T Consensus 72 ~V~a~D~~~~ 81 (90)
T PF08758_consen 72 TVHAWDSQTQ 81 (90)
T ss_dssp EEEEEETTTT
T ss_pred EEEEECCCCC
Confidence 9999999873
No 15
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.56 E-value=0.042 Score=40.54 Aligned_cols=89 Identities=21% Similarity=0.287 Sum_probs=56.7
Q ss_pred EEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEc--------eecCccCcceEEEEEEEEECCCCCCeEEE
Q psy11062 101 YTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLS--------KHLDYETSQRHTIVVSATDQGDPPMSSNL 172 (281)
Q Consensus 101 ~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~--------~~LD~E~~~~y~l~V~a~D~~~p~~~~~~ 172 (281)
+++.++|+|.+.. ..+.+... ....+.|.|++ .|..... +.|...+...-.|++.+.|+. +.
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~-~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt------t~ 71 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDA-AGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT------SQ 71 (99)
T ss_pred CceEEeCCCCCCc--eEEecccc-cCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC------eE
Confidence 3578899997653 44555321 13567788876 4643332 223333444567888899962 78
Q ss_pred EEEEEEEeCCCCCCeeeCCceEEEEeCCC
Q psy11062 173 TVLLEVQDVNDNPPVFERSEYFVSVSEAL 201 (281)
Q Consensus 173 ~v~I~V~DvND~~P~f~~~~y~~~v~E~~ 201 (281)
.|+|.|.-.|| +|+..... ...+.|+.
T Consensus 72 ~vtItI~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 72 TVTITITGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred EEEEEEEccCC-CCEEeccc-ceeEecCC
Confidence 89999999999 78775442 46666654
No 16
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.52 E-value=0.051 Score=39.88 Aligned_cols=69 Identities=23% Similarity=0.374 Sum_probs=52.3
Q ss_pred EEecCCCcEEEEEEeCC---CCceEEEECCccEEEECcCCCceeccEEEEEEEEEeCCCCceeEEEEEEEEEecCC
Q psy11062 4 LLLDQSNPIRYTIHSGD---PEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQASSGSPPLYSYAQVKVTIEDRND 76 (281)
Q Consensus 4 ~d~d~~~~l~Y~i~~g~---~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~dg~~~~~~~~~v~i~V~dvNd 76 (281)
.|.| +..++|++...+ -..|.++++.++.++-. +.+.+ ...|.+.|.|+|+.+ .+....++|.|.+.|+
T Consensus 25 ~d~d-~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~-~g~~~i~v~a~D~~g-~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 25 TDAD-GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSD-VGSLSLKVTATDSSG-ASASDTFTITVVNTND 96 (97)
T ss_pred ECCC-CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCC-CcEEEEEEEEEECCC-CEEEEEEEEEEeCCCC
Confidence 4555 678999997543 23699999999888874 44433 356999999998774 5677889999998887
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.50 E-value=0.074 Score=39.04 Aligned_cols=72 Identities=22% Similarity=0.261 Sum_probs=53.0
Q ss_pred EEeCCCCCCceEEEEEeeCC-CCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCC
Q psy11062 105 ARDKDSGDNGKIRYHLASGN-FGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVND 183 (281)
Q Consensus 105 a~D~D~g~~~~i~y~l~~~~-~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND 183 (281)
..|+| +..++|++...+ .....|...|+.++.+.= .+... ..+.|.+.|.|+|..+ .+....++|.|.+.||
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-tP~~~-~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSG-TPTNS-DVGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-ECCCC-CCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 35776 447899887432 124569999988877655 34443 3567999999999986 6788899999999887
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.73 E-value=0.18 Score=37.25 Aligned_cols=86 Identities=22% Similarity=0.226 Sum_probs=57.8
Q ss_pred CEEEEecCCCcEEEEEEeC-CCCceEEEECCccEEEEC--------cCCCceeccEEEEEEEEEeCCCCceeEEEEEEEE
Q psy11062 1 MKILLLDQSNPIRYTIHSG-DPEGYFTLDSTTGVIRTA--------ATLDHEVTGVLLLSVQASSGSPPLYSYAQVKVTI 71 (281)
Q Consensus 1 ~~~~d~d~~~~l~Y~i~~g-~~~~~F~Id~~~G~i~~~--------~~LD~E~~~~~~l~v~a~dg~~~~~~~~~v~i~V 71 (281)
|.++|.|.+....+++... ...+.|.|++ +|..... ..|.--..-.-+|++.+.||. +..|+|.|
T Consensus 4 Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt-----t~~vtItI 77 (99)
T TIGR01965 4 LTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT-----SQTVTITI 77 (99)
T ss_pred eEEeCCCCCCceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC-----eEEEEEEE
Confidence 4678888888888888532 2347899987 5755432 123332333457888888873 77899999
Q ss_pred EecCCCCCeeccCcEEEEEeCCC
Q psy11062 72 EDRNDNAPEFESTTIRLSVPESA 94 (281)
Q Consensus 72 ~dvNd~~P~f~~~~~~~~v~En~ 94 (281)
.-.|| +|...... ...+.|+.
T Consensus 78 ~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 78 TGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred EccCC-CCEEeccc-ceeEecCC
Confidence 99999 58765433 46677763
No 19
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=95.62 E-value=0.93 Score=36.49 Aligned_cols=128 Identities=21% Similarity=0.263 Sum_probs=69.3
Q ss_pred cEEEEEE-EEEeCCCCceeEEEEEEEEEecCCCCCeeccCcEEEEEeCCCCC-CceEEEEEEEeCCCCCCceEEEEEeeC
Q psy11062 46 GVLLLSV-QASSGSPPLYSYAQVKVTIEDRNDNAPEFESTTIRLSVPESAAL-GLPLYTVVARDKDSGDNGKIRYHLASG 123 (281)
Q Consensus 46 ~~~~l~v-~a~dg~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~-gt~v~~v~a~D~D~g~~~~i~y~l~~~ 123 (281)
..|.+.+ .|.|..+- ..+..+...+. ++..+|.+.. .....+..+... |..-..+.++|...+. ..-..+|.+|
T Consensus 15 G~Y~l~~~~a~D~agN-~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~Gg 90 (158)
T PF13750_consen 15 GSYTLTVVTATDAAGN-TSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTGG 90 (158)
T ss_pred ccEEEEEEEEEecCCC-EEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEECC
Confidence 3689999 78876532 23333332232 3555787755 112223333322 3344567788877544 3446777766
Q ss_pred CCCCceeeEEeCc-eeEEEEc--eec-CccCcceEEEEEEEEECCCCCCeEEEEEEEEEE
Q psy11062 124 NFGDSRLFAVDQT-SGHLSLS--KHL-DYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQ 179 (281)
Q Consensus 124 ~~~~~~~F~id~~-tG~i~~~--~~L-D~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~ 179 (281)
.....-....... .|...+. +.+ ..|....|.|+|.|.|..+ ...+..+.....
T Consensus 91 ~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si~F~y~ 148 (158)
T PF13750_consen 91 PASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSISFSYM 148 (158)
T ss_pred cccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC--CEEEEEEEEEEe
Confidence 4333323332221 2333222 111 3367889999999999887 466666665554
No 20
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=95.27 E-value=0.021 Score=40.87 Aligned_cols=35 Identities=23% Similarity=0.351 Sum_probs=24.9
Q ss_pred EEEEEEeCCCCceEEEECCccEEEECcCCCceecc
Q psy11062 12 IRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTG 46 (281)
Q Consensus 12 l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~ 46 (281)
..++++++....+|.+++.+|.|+++..+|||+.+
T Consensus 34 ~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~LC 68 (84)
T PF08266_consen 34 RNFRIVSEGNSQYFRVNEKTGDLFVSERIDREELC 68 (84)
T ss_dssp TTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC-
T ss_pred cceEEeecCCcceeEecCCceeEEeCCccCHHHHC
Confidence 35677766667899999999999999999999875
No 21
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=91.50 E-value=14 Score=38.08 Aligned_cols=130 Identities=20% Similarity=0.229 Sum_probs=68.8
Q ss_pred cCCCCCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEc----------
Q psy11062 74 RNDNAPEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLS---------- 143 (281)
Q Consensus 74 vNd~~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~---------- 143 (281)
.||..+.+.-..-...|.||. |+.-..|.-...|.+....+.|+..+|......-|. ..+|.|.-.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 455555555454466799984 777777766655655556788988866432222222 223433221
Q ss_pred ---eecCccCcceEEEEEEEEECCC-------------CCCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEE
Q psy11062 144 ---KHLDYETSQRHTIVVSATDQGD-------------PPMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQV 207 (281)
Q Consensus 144 ---~~LD~E~~~~y~l~V~a~D~~~-------------p~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v 207 (281)
..=-+|..+.|.+.+.--..+. .........+|.|.| ||++|.|.-..-...|.|+. |..-
T Consensus 471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vt 547 (928)
T TIGR00845 471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIME 547 (928)
T ss_pred EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEEE
Confidence 1112344555555543221110 011122345666777 68899876544467788874 4443
Q ss_pred EEE
Q psy11062 208 RTT 210 (281)
Q Consensus 208 ~~v 210 (281)
.+|
T Consensus 548 vtV 550 (928)
T TIGR00845 548 VKV 550 (928)
T ss_pred EEE
Confidence 333
No 22
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=87.70 E-value=4.3 Score=26.89 Aligned_cols=60 Identities=20% Similarity=0.244 Sum_probs=32.9
Q ss_pred CCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEECCCCCCeEEEEEEEEE
Q psy11062 112 DNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEV 178 (281)
Q Consensus 112 ~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V 178 (281)
.+-...|+|.+ ....+..+...+-.+.. . ....+.|.|.|+|.|..+........+.|.|
T Consensus 6 ~~~~Y~Y~l~g---~d~~W~~~~~~~~~~~~-~---~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 6 ENIRYRYRLEG---FDDEWITLGSYSNSISY-T---NLPPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TTEEEEEEEET---TESSEEEESSTS-EEEE-E---S--SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CceEEEEEEEC---CCCeEEECCCCcEEEEE-E---eCCCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 34566777876 23555655433212222 1 2257899999999997764333335666655
No 23
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=85.51 E-value=6.4 Score=28.48 Aligned_cols=47 Identities=13% Similarity=0.175 Sum_probs=28.2
Q ss_pred CCcEEEEEEeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEEeCCCCc
Q psy11062 9 SNPIRYTIHSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQASSGSPPL 61 (281)
Q Consensus 9 ~~~l~Y~i~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~dg~~~~ 61 (281)
+..+.|+- .++ .|+|.+ .|.|++++++...... -++.|.|.|.....
T Consensus 36 ~~~~~~~s--sDp--dF~V~~-DGsVy~~r~v~l~~~~-~~F~V~a~D~~~~~ 82 (90)
T PF08758_consen 36 RRRVIFES--SDP--DFRVLE-DGSVYAKRPVQLSSEQ-RSFTVHAWDSQTQE 82 (90)
T ss_dssp ---EEEE-----S--EEEEET-TTEEEEES--S-SSS--EEEEEEEEETTTTE
T ss_pred CCceEEec--CCC--CEEEcC-CCeEEEeeeEecCCCc-eEEEEEEECCCCCe
Confidence 34566655 445 699987 5999999998876443 47999999876443
No 24
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=81.13 E-value=24 Score=27.61 Aligned_cols=55 Identities=24% Similarity=0.331 Sum_probs=35.5
Q ss_pred EEEceecCcc---CcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCC
Q psy11062 140 LSLSKHLDYE---TSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEA 200 (281)
Q Consensus 140 i~~~~~LD~E---~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~ 200 (281)
+.+.++||.. ..-...|.|.|+|..+- .+...+.|.|.| | .|...... ...|.|+
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVDED 127 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence 3444555542 23457788999998763 334588888888 6 58775543 3677774
No 25
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=79.47 E-value=8.6 Score=24.30 Aligned_cols=37 Identities=24% Similarity=0.268 Sum_probs=27.3
Q ss_pred CCceeeEEeCceeEEEEceecCcc-CcceEEEEEEEEECC
Q psy11062 126 GDSRLFAVDQTSGHLSLSKHLDYE-TSQRHTIVVSATDQG 164 (281)
Q Consensus 126 ~~~~~F~id~~tG~i~~~~~LD~E-~~~~y~l~V~a~D~~ 164 (281)
.......+|+.+|.|.-. .+.. ....|.+.|.|+|..
T Consensus 11 ~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence 356678889999987544 4433 346899999999975
No 26
>KOG3597|consensus
Probab=70.86 E-value=96 Score=29.34 Aligned_cols=162 Identities=14% Similarity=0.073 Sum_probs=85.3
Q ss_pred eEEEEEEEEEecCCCCCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCC--CceeeEEeC-----
Q psy11062 63 SYAQVKVTIEDRNDNAPEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFG--DSRLFAVDQ----- 135 (281)
Q Consensus 63 ~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~--~~~~F~id~----- 135 (281)
.+....|.|..+||.+..+...-+.+-+.|+...-.....+.+.|+|..+ ..+.|++.+.... ....|....
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~ 102 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATE 102 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccc
Confidence 45568899999999766665555568888886544455678889999754 3578888753211 111133311
Q ss_pred ------ceeEEEEceecCcc--CcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCCCCCeeeCC-ceEEEEeCC--CCCC
Q psy11062 136 ------TSGHLSLSKHLDYE--TSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVNDNPPVFERS-EYFVSVSEA--LPAK 204 (281)
Q Consensus 136 ------~tG~i~~~~~LD~E--~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND~~P~f~~~-~y~~~v~E~--~~~g 204 (281)
..|.+.. ++. +.....+...++|+-. .+. .+ +.-.-..+|.+... .--..|.-+ +++.
T Consensus 103 Fs~~~v~~g~~~y----vh~g~el~~~~~~~~~SDg~~---~S~---~~-i~~~~~~~~~~~~~~~~gL~v~~gS~~~IT 171 (442)
T KOG3597|consen 103 FSYEEVEDGSLSY----VHSGTELRESELQLRVSDGLL---VSE---RA-ILKVEATGPAPHLARNTGLKVLQGSTAPIT 171 (442)
T ss_pred eEehHhhcCceeE----EecCcccccceEEEEeecceE---eee---eE-EecccCCCcceeeecccceEEccCcccccc
Confidence 1222222 222 3566788888998864 121 11 11111223433211 111122111 1111
Q ss_pred cEEEEEEeeEEEeecCccccEEEEEeecCChhhhhc
Q psy11062 205 SQVRTTCTPLYVLVEGQCSLVCWSTLRLMTHSTLKS 240 (281)
Q Consensus 205 t~v~~v~A~~yd~d~g~~~~~~~~~~~~~~~~~l~~ 240 (281)
.. -+++. |.+.+....+.|.+.....+..|.+
T Consensus 172 ~~--~L~ve--d~d~~~d~~v~~~i~~~P~~G~Lq~ 203 (442)
T KOG3597|consen 172 PS--NLSVE--DNDSSPDDEVRYDITPPPQHGHLQR 203 (442)
T ss_pred Hh--Hceee--cCCCCCCcEEEEEecCCCCCChhhe
Confidence 11 12233 4444566678888888877777764
No 27
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=68.16 E-value=35 Score=25.43 Aligned_cols=34 Identities=15% Similarity=0.148 Sum_probs=24.4
Q ss_pred ecCccCcceEEEEEEEEECCCCCCeEEEEEEEEEEe
Q psy11062 145 HLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQD 180 (281)
Q Consensus 145 ~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~D 180 (281)
.+++.+.+.|.++|+++|..+ .+....+.|.|-|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dG--CS~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDG--CSTSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCC--ccCcCCEEEEecC
Confidence 456778899999999999887 3444455555543
No 28
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=54.71 E-value=57 Score=21.22 Aligned_cols=58 Identities=26% Similarity=0.302 Sum_probs=29.6
Q ss_pred CcEEEEE-EeCCCCceEEEECCccEEEECcCCCceeccEEEEEEEEEeCCCCcee-EEEEEEEE
Q psy11062 10 NPIRYTI-HSGDPEGYFTLDSTTGVIRTAATLDHEVTGVLLLSVQASSGSPPLYS-YAQVKVTI 71 (281)
Q Consensus 10 ~~l~Y~i-~~g~~~~~F~Id~~~G~i~~~~~LD~E~~~~~~l~v~a~dg~~~~~~-~~~v~i~V 71 (281)
..+.|+. +.|-...|..+...+-.+... .-....|.|.|+|.+....... ...+.|.|
T Consensus 6 ~~~~Y~Y~l~g~d~~W~~~~~~~~~~~~~----~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 6 ENIRYRYRLEGFDDEWITLGSYSNSISYT----NLPPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TTEEEEEEEETTESSEEEESSTS-EEEEE----S--SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CceEEEEEEECCCCeEEECCCCcEEEEEE----eCCCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 3444444 345555677774432233322 1234589999999965433222 25666665
No 29
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=54.12 E-value=60 Score=21.27 Aligned_cols=29 Identities=28% Similarity=0.298 Sum_probs=21.1
Q ss_pred cceEEEEEEEEECCCCCCeEEEEEEEEEEeC
Q psy11062 151 SQRHTIVVSATDQGDPPMSSNLTVLLEVQDV 181 (281)
Q Consensus 151 ~~~y~l~V~a~D~~~p~~~~~~~v~I~V~Dv 181 (281)
.+.|.+.+.|.|..+ ..+.....+.+.|.
T Consensus 22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~ 50 (60)
T PF12245_consen 22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDN 50 (60)
T ss_pred CccEEEEEEEEECCC--CEEEeeeEEEEEcC
Confidence 567999999999887 34555556666664
No 30
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=50.73 E-value=41 Score=22.92 Aligned_cols=30 Identities=20% Similarity=0.176 Sum_probs=21.6
Q ss_pred cCccCcceEEEEEEEEECCCCCCeEEEEEEEE
Q psy11062 146 LDYETSQRHTIVVSATDQGDPPMSSNLTVLLE 177 (281)
Q Consensus 146 LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~ 177 (281)
..|.+...|.+++.|+|..+ .+....+.|.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~ 80 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG--SSSTKTTTVV 80 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC--CEEEEEEEEE
Confidence 45778899999999999865 3444455554
No 31
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=48.84 E-value=98 Score=22.19 Aligned_cols=37 Identities=27% Similarity=0.234 Sum_probs=24.4
Q ss_pred eecCccCcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCCCCC
Q psy11062 144 KHLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVNDNPP 186 (281)
Q Consensus 144 ~~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND~~P 186 (281)
....++..+.|.+. |.+..+ +..+++.+.|.|..+ +|
T Consensus 61 ~~~~~~D~G~Ytc~--a~N~~G---~~~~~~~l~V~~~P~-pP 97 (98)
T cd05762 61 TEGQQEHCGCYTLE--VENKLG---SRQAQVNLTVVDKPD-PP 97 (98)
T ss_pred CCCChhhCEEEEEE--EEcCCC---ceeEEEEEEEecCCC-CC
Confidence 45556666777654 455544 567788888998876 44
No 32
>KOG4221|consensus
Probab=48.40 E-value=3.8e+02 Score=28.92 Aligned_cols=70 Identities=19% Similarity=0.230 Sum_probs=47.0
Q ss_pred eEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCCCCCe
Q psy11062 115 KIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVNDNPPV 187 (281)
Q Consensus 115 ~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND~~P~ 187 (281)
...|++.....+....+.++.++-+.++. +.|+...|.+.|.|.+..++..++.-.-.+...|+-+-||.
T Consensus 551 I~~yk~~ys~~~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 551 ITGYKLFYSEDDTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred ceEEEEEEEcCCCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 44577765433456778887666666555 45677889999999999887665543333455677777776
No 33
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=48.01 E-value=1e+02 Score=22.05 Aligned_cols=53 Identities=25% Similarity=0.330 Sum_probs=31.0
Q ss_pred EEEEEecCCCCCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCC
Q psy11062 68 KVTIEDRNDNAPEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGN 124 (281)
Q Consensus 68 ~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~ 124 (281)
+|.|.| ||. |.+.-..-...+.|+. |..-..+....++....-.+.|....+.
T Consensus 2 tvtI~d-~d~-~~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DDD-PTVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TTS-EEEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CCC-CEEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 466777 675 5776666567788885 4445555555554333445667666643
No 34
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=44.75 E-value=62 Score=21.89 Aligned_cols=31 Identities=23% Similarity=0.258 Sum_probs=23.4
Q ss_pred ecCccCcceEEEEEEEEECCCCCCeEEEEEEEEE
Q psy11062 145 HLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEV 178 (281)
Q Consensus 145 ~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V 178 (281)
...|+....|.+++.|+|..+ +....++|.|
T Consensus 48 ~~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 48 THTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 455678899999999999876 5566666655
No 35
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=43.50 E-value=3.5e+02 Score=26.99 Aligned_cols=108 Identities=12% Similarity=0.141 Sum_probs=57.7
Q ss_pred EEEEEEEEEeCCCCceeEEEEEEEEEecCCCCCeeccCcEEEEEeCCCCCCceEEEEEEE------eCCCCCCceEEEEE
Q psy11062 47 VLLLSVQASSGSPPLYSYAQVKVTIEDRNDNAPEFESTTIRLSVPESAALGLPLYTVVAR------DKDSGDNGKIRYHL 120 (281)
Q Consensus 47 ~~~l~v~a~dg~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~a~------D~D~g~~~~i~y~l 120 (281)
...|++.++|..+..+......|.|.+-+- |.+....++..-.++ .......|. +.-.....+++|+.
T Consensus 299 ~~Ti~atVtDSRGr~S~~~~~tItVl~Y~~--P~lsfsv~R~~~~~~----~~~v~~~a~Iapl~v~g~qKN~~~lt~~~ 372 (624)
T PF05895_consen 299 SATIRATVTDSRGRTSDPKTKTITVLEYSP--PTLSFSVYRCGSSGN----TLTVTRNAKIAPLTVNGVQKNTMTLTFKV 372 (624)
T ss_pred eEEEEEEEEECCCccCCceEEEEEEEEcCC--CcEEEEEEEeCCCCc----EEEEEEEEEEeEEEEcccccceEEEEEEE
Confidence 567777778877666677888999988754 776433222211111 111112221 11111223566666
Q ss_pred eeCCCCCceeeEEeCc--ee-----------EEEEceecCccCcceEEEEEEEEECCC
Q psy11062 121 ASGNFGDSRLFAVDQT--SG-----------HLSLSKHLDYETSQRHTIVVSATDQGD 165 (281)
Q Consensus 121 ~~~~~~~~~~F~id~~--tG-----------~i~~~~~LD~E~~~~y~l~V~a~D~~~ 165 (281)
.. -....|.+|.. +| ...+.+. +.....|.+.+.++|.-.
T Consensus 373 a~---~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~--y~~~kSy~V~~~l~D~F~ 425 (624)
T PF05895_consen 373 AP---LGTGTFTTDNGSASGTWSSISELTNSSANLGGT--YDAEKSYDVRGTLSDKFT 425 (624)
T ss_pred EE---cCcceEEEEccccccceeeeeeecccceeeccc--cCCCceEEEEEEEEEEee
Confidence 55 23455655431 11 1223333 346688999999999854
No 36
>PF06650 DUF1162: Protein of unknown function (DUF1162); InterPro: IPR009543 Proteins in this entry may play a role in the control of protein cycling through the trans-Golgi network. Vacuolar sorting protein is an ATPase required for endosomal trafficking []. Defects in the human protein VPS13A cause chorea-acanthocytosis, an autosomal recessive neurodegenerative disorder characterised by the gradual onset of hyperkinetic movements and abnormal erythrocyte morphology [].; GO: 0008104 protein localization
Probab=41.69 E-value=2.3e+02 Score=24.40 Aligned_cols=155 Identities=19% Similarity=0.269 Sum_probs=74.7
Q ss_pred CcEEEEEEeCCCCceEEEECC--ccEEEECcCCCceeccEEEEEEEEEeCCCCceeEEEEEEEEEecCCCCCeec---cC
Q psy11062 10 NPIRYTIHSGDPEGYFTLDST--TGVIRTAATLDHEVTGVLLLSVQASSGSPPLYSYAQVKVTIEDRNDNAPEFE---ST 84 (281)
Q Consensus 10 ~~l~Y~i~~g~~~~~F~Id~~--~G~i~~~~~LD~E~~~~~~l~v~a~dg~~~~~~~~~v~i~V~dvNd~~P~f~---~~ 84 (281)
..+.-.+-.+..+..|.|+.. .|.|.+...=+. ....|.|.+.+..+......+..|+| .|.|. ..
T Consensus 58 ~~~~i~v~~s~wS~~fsld~vg~~~~v~~~~~~~~-~~~~~~lgv~i~~~~~~~~~TkvVti--------~Pryii~N~t 128 (277)
T PF06650_consen 58 NRARIRVSDSSWSEPFSLDAVGSSGEVVCPSPQKS-DSEVYELGVSIELGPGKFKRTKVVTI--------TPRYIIVNRT 128 (277)
T ss_pred ceEEEEccCCcccCCEeeecCcceeEEEEecCCCC-CceEEEEEEEEEeCCCccccEEEEEE--------ECCEEEEECC
Confidence 345555555556678999854 455665432220 12234566666655444444556666 25441 22
Q ss_pred cEEEEEeCCCC----------CCceEEEEEEEeCCCCCCceEEEEEeeCCCCCceeeEEeCceeEEEEceecCccCcceE
Q psy11062 85 TIRLSVPESAA----------LGLPLYTVVARDKDSGDNGKIRYHLASGNFGDSRLFAVDQTSGHLSLSKHLDYETSQRH 154 (281)
Q Consensus 85 ~~~~~v~En~~----------~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~~~~~F~id~~tG~i~~~~~LD~E~~~~y 154 (281)
.+.+.+.|... +|..+. +.-.+.. ....+..++...+....+.|.|+.. |.+.+.-.-+ .....
T Consensus 129 ~~~i~i~q~~~~~~~~~~~l~pg~~~p-~~~~~~~--~~~~l~vr~~~~~~~WS~~f~i~~~-g~~~l~l~~~--~~~~~ 202 (277)
T PF06650_consen 129 GFPIRIRQCGSPDDDEWITLPPGESVP-FHWPDSS--KPKLLRVRFPEENWQWSGPFNIDEV-GTTYLKLRHS--DGESV 202 (277)
T ss_pred CCCEEEEECcccCCCceEEecCCCCEE-EEcccCC--CCCEEEEEECCCCCcccCCEEEccc-eEEEEEEecC--CCcEE
Confidence 22333433322 222211 1111111 2234555555333245778888754 6554432222 34445
Q ss_pred EEEEEEEECCCCCCeEEEEEEEEEEeCCCC-CC
Q psy11062 155 TIVVSATDQGDPPMSSNLTVLLEVQDVNDN-PP 186 (281)
Q Consensus 155 ~l~V~a~D~~~p~~~~~~~v~I~V~DvND~-~P 186 (281)
.+.|.+.-.+ +++.|.+.+..+. +|
T Consensus 203 ~~~v~v~~~~-------~t~~i~~~~~~~~~~P 228 (277)
T PF06650_consen 203 LLRVEVRLEG-------ATIFITFSDESDGQPP 228 (277)
T ss_pred EEEEEEEecC-------cEEEEEEEeCCCCCCC
Confidence 5555555332 4566777777665 45
No 37
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=40.34 E-value=66 Score=25.09 Aligned_cols=37 Identities=16% Similarity=0.190 Sum_probs=25.5
Q ss_pred ecCccCcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCC
Q psy11062 145 HLDYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVND 183 (281)
Q Consensus 145 ~LD~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND 183 (281)
.+...+.+.|.++|+++|..+ .+....+.|.|.|.+-
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNADG--CSTSAPVEVVVADTDG 112 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETTE--EEE---EEEEEE-TTS
T ss_pred EEEecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCCc
Confidence 456667899999999999887 6667789999999844
No 38
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=38.86 E-value=1.9e+02 Score=22.62 Aligned_cols=67 Identities=28% Similarity=0.333 Sum_probs=40.3
Q ss_pred ceEEEE-CCcc--EEEECcCCCcee---ccEEEEEEEEEeCCCCceeEEEEEEEEEecCCCCCeeccCcEEEEEeCCC
Q psy11062 23 GYFTLD-STTG--VIRTAATLDHEV---TGVLLLSVQASSGSPPLYSYAQVKVTIEDRNDNAPEFESTTIRLSVPESA 94 (281)
Q Consensus 23 ~~F~Id-~~~G--~i~~~~~LD~E~---~~~~~l~v~a~dg~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~ 94 (281)
..|.|. ...| ...+..+||+.. .-...|.|.|+|..+-. ++..+.|.|.| | .|...... ...|.|+.
T Consensus 56 ~Vftvtl~~~GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~-s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 56 PVFTLTLNADGSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDT-SSITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEEEEECCCccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCc-cccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 456662 1245 456688998844 22456777788654322 23578888877 5 47765544 36788853
No 39
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=37.20 E-value=80 Score=21.73 Aligned_cols=25 Identities=24% Similarity=0.268 Sum_probs=20.2
Q ss_pred ceEEEEEEEEECCCCCCeEEEEEEEEE
Q psy11062 152 QRHTIVVSATDQGDPPMSSNLTVLLEV 178 (281)
Q Consensus 152 ~~y~l~V~a~D~~~p~~~~~~~v~I~V 178 (281)
+.|.++..|+|..+ ..+++.+.|.|
T Consensus 57 G~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 57 GTTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred ceEEEEEEEEECCC--CEEEEEEEEEC
Confidence 45889999999866 57788888875
No 40
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=35.21 E-value=5.5e+02 Score=26.93 Aligned_cols=54 Identities=26% Similarity=0.278 Sum_probs=27.3
Q ss_pred EEEEEEEecCCCCCeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeC
Q psy11062 66 QVKVTIEDRNDNAPEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASG 123 (281)
Q Consensus 66 ~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~ 123 (281)
..+|.|.| ||++|.|.-..-...|.|+. |+.-..|.-+-.-. ..-.+.|+...|
T Consensus 516 ~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~RtsGa~-G~VtV~Y~T~dG 569 (928)
T TIGR00845 516 TATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTSGAR-GTVIVPYRTVEG 569 (928)
T ss_pred eEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcCCCC-eeEEEEEEeecC
Confidence 44555666 66777766554456777773 54433332221111 122455665544
No 41
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=31.89 E-value=2.7e+02 Score=22.25 Aligned_cols=38 Identities=21% Similarity=0.382 Sum_probs=23.5
Q ss_pred CcceEEEEE-EEEECCCCCCeEEEEEEEEEEeCCCCCCeeeC
Q psy11062 150 TSQRHTIVV-SATDQGDPPMSSNLTVLLEVQDVNDNPPVFER 190 (281)
Q Consensus 150 ~~~~y~l~V-~a~D~~~p~~~~~~~v~I~V~DvND~~P~f~~ 190 (281)
..+.|.+.+ .|.|..+...+.....++.| +..+|.+.-
T Consensus 13 ~dG~Y~l~~~~a~D~agN~~~~~~~~~~~i---D~T~Ptisi 51 (158)
T PF13750_consen 13 PDGSYTLTVVTATDAAGNTSTSTVSETFTI---DNTPPTISI 51 (158)
T ss_pred CCccEEEEEEEEEecCCCEEEEEEeeEEEE---cCCCCEEEE
Confidence 357899999 79998874333333323433 345788754
No 42
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=30.80 E-value=1.4e+02 Score=18.88 Aligned_cols=18 Identities=33% Similarity=0.340 Sum_probs=14.5
Q ss_pred CcceEEEEEEEEECCCCC
Q psy11062 150 TSQRHTIVVSATDQGDPP 167 (281)
Q Consensus 150 ~~~~y~l~V~a~D~~~p~ 167 (281)
....|.+++.|+|..+..
T Consensus 22 ~dG~y~itv~a~D~AGN~ 39 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGNT 39 (54)
T ss_pred CCccEEEEEEEEeCCCCC
Confidence 367899999999987653
No 43
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.88 E-value=1.6e+02 Score=22.07 Aligned_cols=59 Identities=24% Similarity=0.295 Sum_probs=37.5
Q ss_pred CccCcceEEEEEEEEECCCCCCeEEEEEEEEEEeCCCCCCeeeCCceEEEEeCCCCCCcEEEEEEeeEEEeecCcc
Q psy11062 147 DYETSQRHTIVVSATDQGDPPMSSNLTVLLEVQDVNDNPPVFERSEYFVSVSEALPAKSQVRTTCTPLYVLVEGQC 222 (281)
Q Consensus 147 D~E~~~~y~l~V~a~D~~~p~~~~~~~v~I~V~DvND~~P~f~~~~y~~~v~E~~~~gt~v~~v~A~~yd~d~g~~ 222 (281)
|+|..+.|.++|.| .-+++++..|.+|-.+.-....|...+.-|.+ +|.+.+- +|.|.+
T Consensus 53 Dr~pvgpyevevaa----------rrt~hlRfndL~dpe~iP~d~~yasviesnvP---vVvQ~tR----LDsgq~ 111 (124)
T COG4288 53 DREPVGPYEVEVAA----------RRTLHLRFNDLGDPEAIPKDTPYASVIESNVP---VVVQLTR----LDSGQA 111 (124)
T ss_pred cCCCCCceEEEeec----------ceeEEEEecccCCcccCCCCCchhhheecCCc---eEEEEEE----eccCCc
Confidence 56666667777643 35678899999985555445566666666665 4555543 455544
No 44
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=25.61 E-value=1.4e+02 Score=22.26 Aligned_cols=34 Identities=18% Similarity=0.224 Sum_probs=22.0
Q ss_pred CCCceeccEEEEEEEEEeCCCCceeEEEEEEEEEe
Q psy11062 39 TLDHEVTGVLLLSVQASSGSPPLYSYAQVKVTIED 73 (281)
Q Consensus 39 ~LD~E~~~~~~l~v~a~dg~~~~~~~~~v~i~V~d 73 (281)
.+++.+-..|.+.|+++++++ .+..+.+.|.|-|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dG-CS~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDG-CSTSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCC-ccCcCCEEEEecC
Confidence 355666678999999997765 3344455555543
No 45
>cd05850 Ig1_Contactin-2 First Ig domain of contactin-2. Ig1_Contactin-2: First Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=24.80 E-value=2.5e+02 Score=19.69 Aligned_cols=80 Identities=15% Similarity=0.198 Sum_probs=41.0
Q ss_pred CeeccCcEEEEEeCCCCCCceEEEEEEEeCCCCCCceEEEEEeeCCCC--CceeeEEeCceeEEEEceecCccCcceEEE
Q psy11062 79 PEFESTTIRLSVPESAALGLPLYTVVARDKDSGDNGKIRYHLASGNFG--DSRLFAVDQTSGHLSLSKHLDYETSQRHTI 156 (281)
Q Consensus 79 P~f~~~~~~~~v~En~~~gt~v~~v~a~D~D~g~~~~i~y~l~~~~~~--~~~~F~id~~tG~i~~~~~LD~E~~~~y~l 156 (281)
|.|...+....++|...-++......|... +...+++...+.... ....+.+. .|.|........+..+.|..
T Consensus 2 P~f~~~P~~~~~~~~~~g~~v~l~C~a~g~---P~p~i~W~~~g~~l~~~~~~~~~~~--~g~L~I~~~~~~~D~G~Y~C 76 (94)
T cd05850 2 PVFEEQPSSLLFPEGSPEEKVTLGCRARAS---PPATYRWKMNGTEIKFAPESRYTLV--AGNLVINNPQKARDAGSYQC 76 (94)
T ss_pred CeeEECCCCEEecCCCCCCeEEEEeEceeC---CCCEEEEEECCEECccCCCceEEEE--CCeEEEccCCccCcCEEEEE
Confidence 777777767777777653455556665432 123455554321111 12234442 46666655444455566654
Q ss_pred EEEEEECCC
Q psy11062 157 VVSATDQGD 165 (281)
Q Consensus 157 ~V~a~D~~~ 165 (281)
.|++..+
T Consensus 77 --~A~N~~G 83 (94)
T cd05850 77 --LAINRCG 83 (94)
T ss_pred --EEEcCcC
Confidence 4555544
No 46
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=21.74 E-value=1.7e+02 Score=22.54 Aligned_cols=31 Identities=35% Similarity=0.461 Sum_probs=16.6
Q ss_pred EEEEEEEE--eCCCCceeEEEEEEEEEecCCCCC
Q psy11062 48 LLLSVQAS--SGSPPLYSYAQVKVTIEDRNDNAP 79 (281)
Q Consensus 48 ~~l~v~a~--dg~~~~~~~~~v~i~V~dvNd~~P 79 (281)
=.|.|.|+ ++..+++....+.+.|.+-|+ +|
T Consensus 100 Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 100 GNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred ccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 35666666 777788899999999988887 44
No 47
>PF13287 Fn3_assoc: Fn3 associated
Probab=21.03 E-value=1.7e+02 Score=19.12 Aligned_cols=21 Identities=24% Similarity=0.417 Sum_probs=15.6
Q ss_pred CEEEEecCCCcEEEEEEeCCC
Q psy11062 1 MKILLLDQSNPIRYTIHSGDP 21 (281)
Q Consensus 1 ~~~~d~d~~~~l~Y~i~~g~~ 21 (281)
|++...++++.|.|++-+.+|
T Consensus 1 V~L~s~~~~~~I~YT~DGs~P 21 (59)
T PF13287_consen 1 VTLTSPDPGAKIYYTTDGSEP 21 (59)
T ss_pred CEEECCCCCCEEEEeCCCCCC
Confidence 456656889999999965555
Done!