Query psy2920
Match_columns 325
No_of_seqs 175 out of 1490
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 21:00:01 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2920.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2920hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 1.9E-42 4.1E-47 344.0 25.4 269 1-322 174-456 (2531)
2 KOG4289|consensus 100.0 8.7E-42 1.9E-46 339.4 28.6 272 2-323 280-559 (2531)
3 KOG1219|consensus 100.0 1.6E-36 3.4E-41 310.4 32.4 281 1-322 650-1036(4289)
4 KOG1219|consensus 100.0 4E-36 8.7E-41 307.4 30.0 268 2-322 964-1240(4289)
5 cd00031 CA Cadherin repeat dom 100.0 5.3E-30 1.1E-34 220.9 27.1 186 1-213 7-199 (199)
6 cd00031 CA Cadherin repeat dom 99.9 9.5E-24 2.1E-28 181.8 24.0 174 115-321 2-184 (199)
7 PF00028 Cadherin: Cadherin do 99.6 3.8E-15 8.3E-20 112.6 12.9 86 115-212 1-93 (93)
8 KOG1834|consensus 99.6 2.5E-14 5.5E-19 136.1 20.0 174 13-211 54-242 (952)
9 PF00028 Cadherin: Cadherin do 99.6 5.9E-15 1.3E-19 111.6 12.8 86 1-101 6-93 (93)
10 smart00112 CA Cadherin repeats 99.6 2.2E-14 4.7E-19 105.0 10.8 77 17-108 2-79 (79)
11 KOG1834|consensus 99.5 3.5E-13 7.6E-18 128.4 17.4 191 98-321 21-229 (952)
12 smart00112 CA Cadherin repeats 99.4 3.5E-12 7.6E-17 93.2 9.9 72 146-219 2-79 (79)
13 PF08266 Cadherin_2: Cadherin- 97.2 0.00019 4.2E-09 52.7 2.2 61 1-75 8-68 (84)
14 PF08758 Cadherin_pro: Cadheri 96.5 0.02 4.4E-07 42.6 8.1 78 106-196 2-80 (90)
15 smart00736 CADG Dystroglycan-t 96.1 0.13 2.7E-06 38.8 10.5 68 144-216 24-96 (97)
16 PF08266 Cadherin_2: Cadherin- 96.0 0.018 3.9E-07 42.3 5.5 57 114-181 2-66 (84)
17 smart00736 CADG Dystroglycan-t 94.4 0.83 1.8E-05 34.3 10.4 64 26-105 30-96 (97)
18 PF13750 Big_3_3: Bacterial Ig 93.7 2.8 6.1E-05 34.6 13.1 122 181-323 13-138 (158)
19 PF08758 Cadherin_pro: Cadheri 92.0 2.2 4.8E-05 31.7 9.1 66 1-89 16-81 (90)
20 TIGR01965 VCBS_repeat VCBS rep 91.8 4.4 9.5E-05 30.6 10.5 89 11-122 2-97 (99)
21 PF05345 He_PIG: Putative Ig d 89.3 1.9 4.2E-05 28.0 5.9 35 161-195 14-48 (49)
22 KOG3597|consensus 88.7 13 0.00027 36.0 13.3 156 92-274 24-199 (442)
23 PF13750 Big_3_3: Bacterial Ig 87.6 15 0.00032 30.3 15.7 111 74-196 14-136 (158)
24 TIGR01965 VCBS_repeat VCBS rep 87.5 6 0.00013 29.9 8.3 71 140-219 2-85 (99)
25 PF07495 Y_Y_Y: Y_Y_Y domain; 84.5 10 0.00022 25.7 8.6 56 151-211 9-65 (66)
26 TIGR00845 caca sodium/calcium 84.1 62 0.0014 34.4 18.3 29 94-123 515-543 (928)
27 PF05895 DUF859: Siphovirus pr 81.9 38 0.00083 34.4 13.5 113 73-195 296-424 (624)
28 TIGR03660 T1SS_rpt_143 T1SS-14 76.2 39 0.00084 27.2 11.1 61 60-125 67-130 (137)
29 KOG4221|consensus 75.3 1.3E+02 0.0029 32.9 22.6 66 28-109 552-620 (1381)
30 PF05345 He_PIG: Putative Ig d 65.3 26 0.00057 22.6 5.3 36 38-88 13-49 (49)
31 PF13754 Big_3_4: Bacterial Ig 53.7 40 0.00088 22.1 4.7 17 308-324 24-40 (54)
32 KOG4221|consensus 52.5 3.6E+02 0.0079 29.8 22.6 121 74-219 493-619 (1381)
33 PF07495 Y_Y_Y: Y_Y_Y domain; 49.8 34 0.00073 23.0 4.1 29 73-101 37-66 (66)
34 PF12245 Big_3_2: Bacterial Ig 49.7 71 0.0015 21.5 5.6 31 73-104 21-51 (60)
35 PF13860 FlgD_ig: FlgD Ig-like 39.6 1.2E+02 0.0026 21.6 5.8 13 308-320 69-81 (81)
36 smart00089 PKD Repeats in poly 38.5 80 0.0017 21.9 4.7 30 69-100 49-78 (79)
37 PF02494 HYR: HYR domain; Int 37.9 75 0.0016 22.5 4.5 27 73-100 55-81 (81)
38 smart00089 PKD Repeats in poly 34.3 90 0.0019 21.6 4.4 29 178-211 50-78 (79)
39 PF10365 DUF2436: Domain of un 29.9 3E+02 0.0065 22.2 7.3 126 62-195 27-156 (161)
40 PF02010 REJ: REJ domain; Int 28.6 90 0.0019 29.9 4.7 42 167-212 145-186 (440)
41 cd00146 PKD polycystic kidney 28.6 1.4E+02 0.0031 20.7 4.7 19 70-88 52-70 (81)
42 cd02848 Chitinase_N_term Chiti 26.9 1.8E+02 0.0039 22.3 5.0 34 68-102 73-106 (106)
43 KOG3513|consensus 26.5 8.8E+02 0.019 26.5 20.5 121 71-219 488-619 (1051)
44 cd05760 Ig2_PTK7 Second immuno 21.6 2.8E+02 0.0061 19.0 6.2 47 162-213 30-76 (77)
45 KOG0196|consensus 21.0 1E+03 0.022 25.4 12.2 38 178-219 404-444 (996)
No 1
>KOG4289|consensus
Probab=100.00 E-value=1.9e-42 Score=344.04 Aligned_cols=269 Identities=25% Similarity=0.318 Sum_probs=242.9
Q ss_pred CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeC---CCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEE
Q psy2920 1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQG---NDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQY 77 (325)
Q Consensus 1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~---~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~ 77 (325)
|||.|.||.|+.+.|.|+|+ +++.|++... ...+.|+||+.+| .|++.+.||||....|
T Consensus 174 pEn~pagT~iasv~A~~~~a-----~rl~Ysm~al~dsRS~~lFslD~~sG-------------~irta~~lDREt~e~H 235 (2531)
T KOG4289|consen 174 PENEPAGTIIASVKASDPDA-----GRLYYSMVALFDSRSQNLFSLDPMSG-------------AIRTAKSLDRETKETH 235 (2531)
T ss_pred cCCCCCCceeEEEEecCCCc-----CceEEEeeeccchhccccEeeccccc-------------cchhhhhhhhhhhhee
Confidence 89999999999999999983 6799998753 2347999999999 9999999999999999
Q ss_pred EEEEEEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEE
Q psy2920 78 SLVITATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVA 153 (325)
Q Consensus 78 ~l~v~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~ 153 (325)
.|+|+|.|.+.|. +++++|+|.|+|+|||.|+|.+..|.-.+.||.+.| ..|.+|+|+|.|. +.|.
T Consensus 236 vlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evG----------y~vLtvrAtD~Dsp~Nani~ 305 (2531)
T KOG4289|consen 236 VLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVG----------YEVLTVRATDGDSPPNANIR 305 (2531)
T ss_pred EEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccC----------ceEEEEEeccCCCCCCCceE
Confidence 9999999999887 899999999999999999999999999999999999 9999999999998 5899
Q ss_pred EEEecC--CCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEec-CCCCCceeeeccccCC
Q psy2920 154 YKLAVP--SNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNS-PVLQDEQLLQMIDRKS 229 (325)
Q Consensus 154 Y~i~~~--~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d-~~~~P~~~~~~~~~~~ 229 (325)
|++.++ .+.|+|++.+|.|.+..+ |+|....|.|.|+|+|.|.|+..+ |+.|.|+|+| |||+|. |..
T Consensus 306 Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~--Ta~V~itV~D~NDNaPq-Fse------ 376 (2531)
T KOG4289|consen 306 YRLLEGNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPR--TAMVEITVEDENDNAPQ-FSE------ 376 (2531)
T ss_pred EEecCCCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCc--eEEEEEEEEecCCCCcc-ccc------
Confidence 999986 578999999999999999 999999999999999999988888 9999999999 667777 322
Q ss_pred CCccceeeeeEeeccCC---ceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCC
Q psy2920 230 EAPHRIEKRRVTRAVRP---TKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGP 306 (325)
Q Consensus 230 ~~~~~~~~~~v~~~~~~---~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~ 306 (325)
..|..++.|++.+ +.+|+|+|.|.|.|+.+.|+|.+|+..+.|.|+. .+|+|.+..+||+|..
T Consensus 377 ----~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~--------~tGel~vv~plD~e~~-- 442 (2531)
T KOG4289|consen 377 ----KRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDS--------LTGELDVVEPLDFENS-- 442 (2531)
T ss_pred ----cceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEec--------ccceEEEeccccccCC--
Confidence 2234555555554 6889999999999999999999999999999987 6999999999999994
Q ss_pred CceEEEEEEEecCCCC
Q psy2920 307 EKTIDFWVTINNQATN 322 (325)
Q Consensus 307 ~~~~~l~V~a~D~~~~ 322 (325)
.|.+.|.|.|.|-+
T Consensus 443 --~ytl~IrAqDggrP 456 (2531)
T KOG4289|consen 443 --EYTLRIRAQDGGRP 456 (2531)
T ss_pred --eeEEEEEcccCCCC
Confidence 89999999999865
No 2
>KOG4289|consensus
Probab=100.00 E-value=8.7e-42 Score=339.35 Aligned_cols=272 Identities=24% Similarity=0.295 Sum_probs=239.7
Q ss_pred CCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEE
Q psy2920 2 ENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVI 81 (325)
Q Consensus 2 En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v 81 (325)
||.++|+.|.+++|+|.|++.+ +.|.|++.+++....|.|++.+| .|.+..+||||+...|+|.|
T Consensus 280 En~evGy~vLtvrAtD~Dsp~N--ani~Yrl~eg~~~~~f~in~rSG-------------vI~T~a~lDRE~~~~y~L~V 344 (2531)
T KOG4289|consen 280 ENLEVGYEVLTVRATDGDSPPN--ANIRYRLLEGNAKNVFEINPRSG-------------VISTRAPLDREELESYQLDV 344 (2531)
T ss_pred hccccCceEEEEEeccCCCCCC--CceEEEecCCCccceeEEcCccc-------------eeeccCccCHHhhhheEEEE
Confidence 8999999999999999999875 99999999998889999999999 99999999999999999999
Q ss_pred EEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEEe
Q psy2920 82 TATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKLA 157 (325)
Q Consensus 82 ~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i~ 157 (325)
.|.|.|.++ ..++.|.|+|.|+|||+|+|....|.++|.|+..++ +.|.+|.|+|.|. +.|+|+|.
T Consensus 345 eAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~----------avvlrV~AtDrD~g~Ng~VHYsi~ 414 (2531)
T KOG4289|consen 345 EASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPP----------AVVLRVTATDRDKGTNGKVHYSIA 414 (2531)
T ss_pred EeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCC----------ceEEEEEecccCCCcCceEEEEee
Confidence 999998765 569999999999999999999999999999999998 9999999999997 58999999
Q ss_pred cCC--CcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecC-CCCCceeeeccccCCCCcc
Q psy2920 158 VPS--NLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSP-VLQDEQLLQMIDRKSEAPH 233 (325)
Q Consensus 158 ~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~-~~~P~~~~~~~~~~~~~~~ 233 (325)
+++ +.|.||..||+|.+..+ |+|.. .|.|.|+|.|+|-|++++ +.-+.|+|.|+ |++|. |... .+...+
T Consensus 415 Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn--~sgl~iqVlDINDhaPi-fvst---pfq~tv 487 (2531)
T KOG4289|consen 415 SGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSN--TSGLVIQVLDINDHAPI-FVST---PFQATV 487 (2531)
T ss_pred ccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccC--CCceEEEEEecCCCCce-eEec---hhhhhh
Confidence 865 68999999999999999 99987 999999999999999998 66777999995 55555 4432 233444
Q ss_pred ceeeeeEeeccCCceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEE
Q psy2920 234 RIEKRRVTRAVRPTKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFW 313 (325)
Q Consensus 234 ~~~~~~v~~~~~~~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~ 313 (325)
.|+ .+++.. +..+.|.|.|.|.|+++.|++.+.. .|.|+. .+|.|++.|.||||+. ..|.|.
T Consensus 488 lEn-v~lg~~---v~~vqaidadsg~na~l~y~laG~~---pf~I~~--------~SG~Itvtk~ldrEt~---~~ysl~ 549 (2531)
T KOG4289|consen 488 LEN-VPLGYL---VCHVQAIDADSGENARLHYSLAGVG---PFQINN--------GSGWITVTKELDRETV---EHYSLG 549 (2531)
T ss_pred hhc-ccccce---EEEEecccCCCCcccceeeeeccCC---CeeEec--------CCceEEEeeccccccc---ceEEEE
Confidence 442 233333 4779999999999999999997654 588875 5999999999999999 899999
Q ss_pred EEEecCCCCC
Q psy2920 314 VTINNQATNG 323 (325)
Q Consensus 314 V~a~D~~~~~ 323 (325)
|+|+|.|+++
T Consensus 550 V~ard~gtp~ 559 (2531)
T KOG4289|consen 550 VEARDHGTPP 559 (2531)
T ss_pred EEEcCCCCCc
Confidence 9999999764
No 3
>KOG1219|consensus
Probab=100.00 E-value=1.6e-36 Score=310.44 Aligned_cols=281 Identities=21% Similarity=0.283 Sum_probs=248.4
Q ss_pred CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920 1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV 80 (325)
Q Consensus 1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~ 80 (325)
+|+.|+|+.++.+.|+|.|.+.+ +.+.|-|..++.+..|.|+..+| .|.+..+||+|....|.|.
T Consensus 650 ~Edvpigt~la~L~atD~Dtgfn--g~l~yvI~dgne~~~~~Id~qsg-------------~itvas~ld~~~t~~yiLn 714 (4289)
T KOG1219|consen 650 PEDVPIGTTLAILSATDSDTGFN--GKLVYVIEDGNESICFLIDRQSG-------------NITVASPLDNENTEQYILN 714 (4289)
T ss_pred cccCCCCceEEEEeccCCCCCcC--ceEEEEEeCCccceEEEEecccc-------------eEEEecchhhhhhheeEEE
Confidence 69999999999999999998875 99999999998889999999999 9999999999999999999
Q ss_pred EEEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEE
Q psy2920 81 ITATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKL 156 (325)
Q Consensus 81 v~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i 156 (325)
++|.|.|.|. ++...+.|.|.|+|||+|+|.+..|.+.|+|+..+| +.|+.|.|.|.|. |+++|+|
T Consensus 715 vta~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepg----------s~Ia~vetnd~D~g~NG~v~fsL 784 (4289)
T KOG1219|consen 715 VTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPG----------SFIAHVETNDTDGGNNGMVSFSL 784 (4289)
T ss_pred EEEecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCC----------ceEEEEEecccCCCCCceEEEEe
Confidence 9999999887 677788899999999999999999999999999999 9999999999997 6999999
Q ss_pred ecCCCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccce
Q psy2920 157 AVPSNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRI 235 (325)
Q Consensus 157 ~~~~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~ 235 (325)
...+..|+||+.||++.+.+. |+|.+..|.|.|+|+|.+.|.+.+ .+.+.|.++|+|++||.+..+.. ...+++
T Consensus 785 ~n~sdvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq~~s--vv~l~vsvedVndnpPkci~~hs---r~kipe 859 (4289)
T KOG1219|consen 785 LNKSDVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQLFS--VVELDVSVEDVNDNPPKCIIRHS---RSKIPE 859 (4289)
T ss_pred cCCcceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCceEE--EEEEEEEEeeccCCCCccccccc---cccCcc
Confidence 999999999999999999999 999999999999999999888877 88888999998776664543321 111111
Q ss_pred --------------------------------------------------------------------------------
Q psy2920 236 -------------------------------------------------------------------------------- 235 (325)
Q Consensus 236 -------------------------------------------------------------------------------- 235 (325)
T Consensus 860 dlp~gt~~~~l~A~d~diGq~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~V 939 (4289)
T KOG1219|consen 860 DLPYGTVTWQLVALDPDIGQLGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEV 939 (4289)
T ss_pred cCCCceEEEEhhhcCcccCcCceeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEE
Confidence
Q ss_pred ------------------eeeeEeeccCC---ceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEE
Q psy2920 236 ------------------EKRRVTRAVRP---TKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVR 294 (325)
Q Consensus 236 ------------------~~~~v~~~~~~---~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~ 294 (325)
....|.|+.+. +..|.|.|.|.|..+.+.|+|..|+.-+.|+|+. .+|.|.
T Consensus 940 evldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~--------~tG~ir 1011 (4289)
T KOG1219|consen 940 EVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDS--------TTGSIR 1011 (4289)
T ss_pred EEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecC--------CcceEe
Confidence 12334444444 4558999999999999999999999889999986 699999
Q ss_pred EceeCccccCCCCceEEEEEEEecCCCC
Q psy2920 295 VKKRWDFEELGPEKTIDFWVTINNQATN 322 (325)
Q Consensus 295 ~~~~lD~E~~~~~~~~~l~V~a~D~~~~ 322 (325)
+.+.||||.. ..|-|+|.|+|.|+.
T Consensus 1012 Tl~~lDrE~k---s~YwltveA~D~gt~ 1036 (4289)
T KOG1219|consen 1012 TLKALDREKK---SSYWLTVEAKDLGTV 1036 (4289)
T ss_pred echhhchhhc---ceEEEEEEEEecCCC
Confidence 9999999999 999999999999975
No 4
>KOG1219|consensus
Probab=100.00 E-value=4e-36 Score=307.45 Aligned_cols=268 Identities=24% Similarity=0.297 Sum_probs=235.5
Q ss_pred CCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEE
Q psy2920 2 ENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVI 81 (325)
Q Consensus 2 En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v 81 (325)
||+|+||.|+.+.|.|.|++.. +.+.|+|..++..+.|.|+..+| .|++.+.||||....|-|++
T Consensus 964 EnapiGT~vi~i~A~dedsgld--g~l~Y~I~~gdg~g~FsId~~tG-------------~irTl~~lDrE~ks~Ywltv 1028 (4289)
T KOG1219|consen 964 ENAPIGTIVIRIQARDEDSGLD--GELSYKIRTGDGDGIFSIDSTTG-------------SIRTLKALDREKKSSYWLTV 1028 (4289)
T ss_pred ecCCcceEEEEEEEecCCCCcc--ceEEEEEEcCCcceeEEecCCcc-------------eEeechhhchhhcceEEEEE
Confidence 8999999999999999998765 99999999987778999999999 99999999999999999999
Q ss_pred EEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEEe
Q psy2920 82 TATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKLA 157 (325)
Q Consensus 82 ~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i~ 157 (325)
.|+|.|.++ ++.+.+.|.|.|+|||+|+|.++.|..+|.|+.+.+ ..|.++.|+|+|. +++.|.|.
T Consensus 1029 eA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~----------vsivq~ea~D~Dsssn~kLmykI~ 1098 (4289)
T KOG1219|consen 1029 EAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPET----------VSIVQAEANDPDSSSNQKLMYKIT 1098 (4289)
T ss_pred EEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCc----------eEEEEeccCCCCcccCcceEEEEc
Confidence 999998877 788999999999999999999999999999999998 9999999999995 58999999
Q ss_pred cC--CCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccc
Q psy2920 158 VP--SNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHR 234 (325)
Q Consensus 158 ~~--~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~ 234 (325)
+| .++|.|++.||.|+..+. |||++++|.|.|.+.|.|.|++.+ +..|.|.|+|+|+.+|.|.+.+.
T Consensus 1099 sGnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s--~~rviV~IldvNdnsp~Flqk~~-------- 1168 (4289)
T KOG1219|consen 1099 SGNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCS--NQRVIVSILDVNDNSPRFLQKKT-------- 1168 (4289)
T ss_pred cCCccceEEEccccceeeeehhhcccccccceEEEEEecCCCCcccc--ceEEEEEEeeccCCchhhhhhee--------
Confidence 75 589999999999997777 999999999999999999999999 88999999998888777777532
Q ss_pred eeeeeEeeccC-CceEEEEEeCCCCccceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEE
Q psy2920 235 IEKRRVTRAVR-PTKRIDFSETDGEMEGRIVFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFW 313 (325)
Q Consensus 235 ~~~~~v~~~~~-~~~~v~a~d~D~~~~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~ 313 (325)
...+++... +.-++.|.|.|+|.|++|+|+|..|++. |.|+. .+| +...+.++-.. .|.+.
T Consensus 1169 --~~~v~~r~s~plyRl~a~d~DeG~narityniedgde~--FsID~--------~t~-vVsss~~~~~~-----eydi~ 1230 (4289)
T KOG1219|consen 1169 --FLRVPERSSPPLYRLAAQDNDEGNNARITYNIEDGDEV--FSIDI--------ATG-VVSSSTLDPAG-----EYDIL 1230 (4289)
T ss_pred --EEEeeeccCCceeEEEEEecCCCcceEEEEecccCceE--EEEec--------cCc-eEEeeeccCCc-----ceeEe
Confidence 234444333 3668999999999999999999999876 99986 577 55556666643 57666
Q ss_pred -EEEecCCCC
Q psy2920 314 -VTINNQATN 322 (325)
Q Consensus 314 -V~a~D~~~~ 322 (325)
|.|+|.|.+
T Consensus 1231 ~Ikatd~g~p 1240 (4289)
T KOG1219|consen 1231 GIKATDRGAP 1240 (4289)
T ss_pred eEEEecCCCC
Confidence 999998865
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.98 E-value=5.3e-30 Score=220.88 Aligned_cols=186 Identities=34% Similarity=0.459 Sum_probs=169.5
Q ss_pred CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920 1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV 80 (325)
Q Consensus 1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~ 80 (325)
+||+++|+.|+++.|.|.|.+. ++.+.|+|.++....+|.|++.+| .|.+.+.||||....|.|.
T Consensus 7 ~En~~~g~~v~~~~a~D~D~~~--~~~~~y~i~~~~~~~~F~i~~~tG-------------~l~~~~~lD~e~~~~~~l~ 71 (199)
T cd00031 7 PENAPPGTVVGTVSATDPDSGE--NGRVTYSILGGNEDGLFSIDPNTG-------------VITTTKPLDREEQSEYTLT 71 (199)
T ss_pred eCCCCCCCEEEEEEEECCCCCC--CceEEEEEeCCCCcccEEEeCCCC-------------EEEECCCCCCcCCceEEEE
Confidence 6999999999999999999876 589999999876557999999998 8999999999999999999
Q ss_pred EEEEeCCCCC-eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEE
Q psy2920 81 ITATDVKGIN-KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKL 156 (325)
Q Consensus 81 v~a~D~~~~~-~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i 156 (325)
|.|+|.+++. ++...+.|.|.|+|||+|.|....|.+.+.|+.++| +.++++.|+|+|. +.+.|+|
T Consensus 72 v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~----------~~i~~~~a~D~D~~~~~~~~y~l 141 (199)
T cd00031 72 VVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPG----------TVVGTVTATDADSGENAKLTYSI 141 (199)
T ss_pred EEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCC----------CEEEEEEEEcCCCCCCccEEEEE
Confidence 9999986655 489999999999999999999899999999999998 9999999999997 7899999
Q ss_pred ecCC--CcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEec
Q psy2920 157 AVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNS 213 (325)
Q Consensus 157 ~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d 213 (325)
.++. ++|.|++.+|.|.+.+. |+|....|.|.|.|+|.+.+..++ +++++|.+.|
T Consensus 142 ~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~--~~~i~i~v~d 199 (199)
T cd00031 142 LSGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSS--TATVTVTVLD 199 (199)
T ss_pred eCCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCcee--EEEEEEEEEC
Confidence 9876 79999999999999988 999999999999999999876666 7777777754
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.93 E-value=9.5e-24 Score=181.81 Aligned_cols=174 Identities=24% Similarity=0.306 Sum_probs=148.9
Q ss_pred eEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC---eEEEEEecCC--CcEEEeCCceEEEEccC-CCCCCceEEEE
Q psy2920 115 YRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD---KVAYKLAVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELT 188 (325)
Q Consensus 115 ~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~---~v~Y~i~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~ 188 (325)
|.+.+.|+.+.| +.++++.|.|+|.+ .+.|+|.++. .+|.|++.+|.|++.+. |||....|.|.
T Consensus 2 ~~~~i~En~~~g----------~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~ 71 (199)
T cd00031 2 YSVSVPENAPPG----------TVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLT 71 (199)
T ss_pred eEEEEeCCCCCC----------CEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEE
Confidence 788999999998 99999999999984 7999999865 59999999999999998 99999999999
Q ss_pred EEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccceeeeeEeeccCC---ceEEEEEeCCCCccceEEE
Q psy2920 189 VEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRIEKRRVTRAVRP---TKRIDFSETDGEMEGRIVF 265 (325)
Q Consensus 189 V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~~~~~v~~~~~~---~~~v~a~d~D~~~~~~i~y 265 (325)
|+|+|+|.|..++ ++.|+|.|.|+|+.||.|... .|. ..+.++.++ +..+.|+|.|.+.++.++|
T Consensus 72 v~a~D~g~~~~~~--~~~v~I~V~d~Nd~~P~~~~~---~~~-------~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y 139 (199)
T cd00031 72 VVASDGGGPPLSS--TATVTVTVLDVNDNPPVFEQS---SYE-------ASVPENAPPGTVVGTVTATDADSGENAKLTY 139 (199)
T ss_pred EEEEECCcCccee--EEEEEEEEccCCCCCCccccc---ceE-------EEEeCCCCCCCEEEEEEEEcCCCCCCccEEE
Confidence 9999988877655 889999999988887766642 233 344444322 5678999999988899999
Q ss_pred EEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEEEEEecCCC
Q psy2920 266 ALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFWVTINNQAT 321 (325)
Q Consensus 266 ~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~V~a~D~~~ 321 (325)
+|.++.....|.|+. .+|.|++.+.||||.. ..|+|.|.|+|.+.
T Consensus 140 ~l~~~~~~~~f~i~~--------~~G~i~~~~~ld~e~~---~~~~l~v~a~D~~~ 184 (199)
T cd00031 140 SILSGNDKELFSIDP--------NTGIITLAKPLDREEK---SSYELTVVATDGGG 184 (199)
T ss_pred EEeCCCCCCEEEEeC--------CceEEEeCCccCCccC---ceEEEEEEEEECCC
Confidence 999987656788876 5999999999999998 89999999999974
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.65 E-value=3.8e-15 Score=112.60 Aligned_cols=86 Identities=31% Similarity=0.438 Sum_probs=78.8
Q ss_pred eEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC---CeEEEEEecCC--CcEEEeCCceEEEEccC-CCCCCceEEEE
Q psy2920 115 YRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG---DKVAYKLAVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELT 188 (325)
Q Consensus 115 ~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~---~~v~Y~i~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~ 188 (325)
|.++|+|+.++| +.++++.|.|+|. +.+.|+|..+. ++|.|++.+|.|++.+. |||....|.|.
T Consensus 1 Y~~~v~E~~~~g----------~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~ 70 (93)
T PF00028_consen 1 YSFSVPENAPPG----------TVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLT 70 (93)
T ss_dssp EEEEEETTGSTS----------SEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEE
T ss_pred CEEEEECCCCCC----------CEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEE
Confidence 789999999999 9999999999995 58999999865 89999999999999999 99999999999
Q ss_pred EEEEeC-CCCCCcceecEEEEEEEe
Q psy2920 189 VEAHDL-QSPSRYAREPAQVWLQFN 212 (325)
Q Consensus 189 V~a~D~-g~p~~~s~~t~~v~I~v~ 212 (325)
|.|+|. +.|++++ +++|+|+|+
T Consensus 71 v~a~D~~~~~~~~~--~~~V~I~V~ 93 (93)
T PF00028_consen 71 VRATDSGGSPPLSS--TATVTINVL 93 (93)
T ss_dssp EEEEETTTSSEEEE--EEEEEEEEE
T ss_pred EEEEECCCCCCCEE--EEEEEEEEC
Confidence 999999 7888888 777877763
No 8
>KOG1834|consensus
Probab=99.64 E-value=2.5e-14 Score=136.12 Aligned_cols=174 Identities=22% Similarity=0.231 Sum_probs=138.9
Q ss_pred eEEEecCCCCCCCceE-EEEEeeCCCC-ceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCC
Q psy2920 13 LPILVASSHKHPHQLV-RYSITQGNDN-HTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGIN 90 (325)
Q Consensus 13 ~~a~d~D~~~~~~~~i-~y~i~~~~~~-~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~ 90 (325)
+-|.|.|.+-..-+.| .|.|.+.+.. +.--+|..+|+ ..|+...+||.|.++.|+|+|+|.|.|..+
T Consensus 54 l~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGe-----------gvlRaK~~lDCelqkeytf~iQAydCg~gp 122 (952)
T KOG1834|consen 54 LAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGE-----------GVLRAKEPLDCELQKEYTFTIQAYDCGNGP 122 (952)
T ss_pred eeeecCCCCcccccccceeEecCCCCCceEEEEeccCCc-----------eEEeecCcccccccccceEEEEEEecCCCC
Confidence 5566777653222344 7788765432 23345677762 279999999999999999999999976433
Q ss_pred -------eeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC----C-eEEEEEec
Q psy2920 91 -------KAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG----D-KVAYKLAV 158 (325)
Q Consensus 91 -------~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~----~-~v~Y~i~~ 158 (325)
+..++|.|+|.|+|+.+|+|..+.|.+.|.|.+.- ..|++|.|.|.|- + ...|.|..
T Consensus 123 dgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~y-----------d~il~veAiD~DCspq~sqIC~YEI~t 191 (952)
T KOG1834|consen 123 DGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVY-----------DSILRVEAIDKDCSPQYSQICEYEITT 191 (952)
T ss_pred CccccccccceEEEEEeccccccCchhcccceeeEEecceee-----------eeeEEEEeecCCCCCcccceeEEEecC
Confidence 35689999999999999999999999999998776 6899999999996 2 46899999
Q ss_pred CCCcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEE
Q psy2920 159 PSNLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQF 211 (325)
Q Consensus 159 ~~~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v 211 (325)
+.-.|.|| +.|.|+.+.+ .|....+|.|+|.|.|.|.-...+ .+.|+|.|
T Consensus 192 ~d~PFaId-n~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~--d~lV~v~V 242 (952)
T KOG1834|consen 192 PDVPFAID-NDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAAS--DSLVTVHV 242 (952)
T ss_pred CCCceEEc-CCCccccccccccccceeEEEEEEEEecccccccC--cceEEEEe
Confidence 88899996 7899999888 898999999999999999744333 56777777
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.63 E-value=5.9e-15 Score=111.56 Aligned_cols=86 Identities=29% Similarity=0.375 Sum_probs=77.8
Q ss_pred CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920 1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV 80 (325)
Q Consensus 1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~ 80 (325)
+|++++|+.|+++.|.|.|.+. |+.+.|+|.++++...|.|++.+| .|.+.++||||....|.|.
T Consensus 6 ~E~~~~g~~v~~v~a~D~D~~~--n~~i~y~i~~~~~~~~F~I~~~tg-------------~i~~~~~LD~E~~~~y~l~ 70 (93)
T PF00028_consen 6 PENAPPGTVVGQVTATDPDSGP--NSQITYSILGGNPDGLFSIDPNTG-------------EISLKKPLDRETQSSYQLT 70 (93)
T ss_dssp ETTGSTSSEEEEEEEEESSTST--TSSEEEEEEETTSTTSEEEETTTT-------------EEEESSSSCTTTTSEEEEE
T ss_pred ECCCCCCCEEEEEEEEeCCCCC--CceEEEEEecCcccCceEEeeeee-------------ccccceecCcccCCEEEEE
Confidence 6999999999999999999554 599999999988778999999999 8999999999999999999
Q ss_pred EEEEeC-CCCC-eeEEEEEEEEe
Q psy2920 81 ITATDV-KGIN-KAETHVTVDVL 101 (325)
Q Consensus 81 v~a~D~-~~~~-~~~~~v~v~V~ 101 (325)
|.|+|. +.++ ++.+.|+|+|+
T Consensus 71 v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 71 VRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp EEEEETTTSSEEEEEEEEEEEEE
T ss_pred EEEEECCCCCCCEEEEEEEEEEC
Confidence 999999 5555 67888888874
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.58 E-value=2.2e-14 Score=105.04 Aligned_cols=77 Identities=40% Similarity=0.551 Sum_probs=68.3
Q ss_pred ecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCC-eeEEE
Q psy2920 17 VASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGIN-KAETH 95 (325)
Q Consensus 17 d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~-~~~~~ 95 (325)
|.|.+ .|+.+.|+|.++.....|.|++.+| .|.+.++||||....|.|.|.|.|.+++. ++.+.
T Consensus 2 D~D~g--~n~~i~Y~i~~~~~~~~F~i~~~tg-------------~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~ 66 (79)
T smart00112 2 DADSG--ENGKVTYSILSGNEDGLFSIDPETG-------------EITTTKPLDREEQPEYTLTVEATDGGGPPLSSTAT 66 (79)
T ss_pred CCCCC--cCcEEEEEEecCCCCCEEEEeCCcc-------------EEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEE
Confidence 45554 4688999999876558999999998 89999999999999999999999998765 78899
Q ss_pred EEEEEecCCCCCC
Q psy2920 96 VTVDVLDENDNSP 108 (325)
Q Consensus 96 v~v~V~DvNdn~P 108 (325)
|.|.|.|+|||+|
T Consensus 67 v~I~V~D~Nd~~P 79 (79)
T smart00112 67 VTVTVLDVNDNAP 79 (79)
T ss_pred EEEEEEECCCCCC
Confidence 9999999999998
No 11
>KOG1834|consensus
Probab=99.53 E-value=3.5e-13 Score=128.44 Aligned_cols=191 Identities=16% Similarity=0.154 Sum_probs=137.3
Q ss_pred EEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC-----C-eEEEEEecCCCcEEE---eCC
Q psy2920 98 VDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG-----D-KVAYKLAVPSNLVVI---VPQ 168 (325)
Q Consensus 98 v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~-----~-~v~Y~i~~~~~~F~i---~~~ 168 (325)
....-+|-+.|+. ...|...|.||-..- ...-.+.|-|.|. | ..-|.|.+..-.|.+ |+.
T Consensus 21 ~~aarankhkpwi-e~ey~gvV~Endntv----------ll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~ 89 (952)
T KOG1834|consen 21 HHAARANKHKPWI-EEEYHGVVTENDNTV----------LLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKY 89 (952)
T ss_pred cccccccccCccc-ccceeEEEEeCCceE----------EeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEecc
Confidence 4455677788865 468999999986532 2233588999997 3 467888886556755 788
Q ss_pred ceE--EEEccC-CCCCCceEEEEEEEEeCCCCC-Cc---ceecEEEEEEEecCCCCCceeeeccccCCCCccceeeeeEe
Q psy2920 169 TGE--LLLTGD-PPSDDVEYELTVEAHDLQSPS-RY---AREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRIEKRRVT 241 (325)
Q Consensus 169 tG~--i~l~~~-d~e~~~~y~l~V~a~D~g~p~-~~---s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~~~~~v~ 241 (325)
||+ |+.+.+ |.|.+..|+++|+|.|.|.-+ .. .+.-++|+|+|.|+|+..|+|..+ .|.+-+.| -.+.
T Consensus 90 TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep---~Yka~V~E--GK~y 164 (952)
T KOG1834|consen 90 TGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEP---WYKAHVTE--GKVY 164 (952)
T ss_pred CCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhccc---ceeeEEec--ceee
Confidence 876 555555 999999999999999998533 11 123689999999977666657665 45555554 1222
Q ss_pred eccCCceEEEEEeCCCCc-cceE-EEEEEeCCcccceEeeCCCCceEeCCCeeEEEceeCccccCCCCceEEEEEEEecC
Q psy2920 242 RAVRPTKRIDFSETDGEM-EGRI-VFALEKETEREKFKIRDENPWITVDENGDVRVKKRWDFEELGPEKTIDFWVTINNQ 319 (325)
Q Consensus 242 ~~~~~~~~v~a~d~D~~~-~~~i-~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~~~lD~E~~~~~~~~~l~V~a~D~ 319 (325)
.. +.++.|+|.|.++ ++.| .|.|. +.+-+|.|++.|.|+.+.+|.|... +.|.|+|+|-|+
T Consensus 165 d~---il~veAiD~DCspq~sqIC~YEI~-----------t~d~PFaIdn~G~irnTekLny~ke---~~Y~ltVtAyDC 227 (952)
T KOG1834|consen 165 DS---ILRVEAIDKDCSPQYSQICEYEIT-----------TPDVPFAIDNDGNIRNTEKLNYTKE---HQYKLTVTAYDC 227 (952)
T ss_pred ee---eEEEEeecCCCCCcccceeEEEec-----------CCCCceEEcCCCccccccccccccc---eeEEEEEEEEec
Confidence 22 4789999999765 4444 45554 3334666678999999999999998 999999999998
Q ss_pred CC
Q psy2920 320 AT 321 (325)
Q Consensus 320 ~~ 321 (325)
|-
T Consensus 228 g~ 229 (952)
T KOG1834|consen 228 GK 229 (952)
T ss_pred cc
Confidence 74
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.39 E-value=3.5e-12 Score=93.25 Aligned_cols=72 Identities=33% Similarity=0.374 Sum_probs=64.1
Q ss_pred eCCC---CeEEEEEecCC--CcEEEeCCceEEEEccC-CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCc
Q psy2920 146 DADG---DKVAYKLAVPS--NLVVIVPQTGELLLTGD-PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDE 219 (325)
Q Consensus 146 D~D~---~~v~Y~i~~~~--~~F~i~~~tG~i~l~~~-d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~ 219 (325)
|+|. +.+.|+|.++. .+|+|++.+|.|++.+. |||....|.|.|+|+|.+.|++++ +++|+|+|.|+|++||
T Consensus 2 D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~--~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 2 DADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSS--TATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCccc--EEEEEEEEEECCCCCC
Confidence 6676 47999999865 79999999999988888 999999999999999999987777 8999999999888775
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.23 E-value=0.00019 Score=52.74 Aligned_cols=61 Identities=21% Similarity=0.251 Sum_probs=36.5
Q ss_pred CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCccccc
Q psy2920 1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVA 75 (325)
Q Consensus 1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~ 75 (325)
+|..++||.||.|. .|.-.+........++|.+.....+|.++..+| .|.+...+|||.-+
T Consensus 8 ~EE~~~Gt~IGnia-~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG-------------~L~v~~rIDRE~LC 68 (84)
T PF08266_consen 8 PEEMPPGTVIGNIA-KDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTG-------------DLFVSERIDREELC 68 (84)
T ss_dssp ESS--TT-EEEECC-CCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTS-------------EEEESS--SCCCC-
T ss_pred ecCCCCCCEEEEhH-HhhCCCcccccccceEEeecCCcceeEecCCce-------------eEEeCCccCHHHHC
Confidence 58999999999983 333221111123467777766678999999999 99999999999753
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.53 E-value=0.02 Score=42.59 Aligned_cols=78 Identities=26% Similarity=0.338 Sum_probs=41.9
Q ss_pred CCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC-eEEEEEecCCCcEEEeCCceEEEEccCCCCCCce
Q psy2920 106 NSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD-KVAYKLAVPSNLVVIVPQTGELLLTGDPPSDDVE 184 (325)
Q Consensus 106 n~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~-~v~Y~i~~~~~~F~i~~~tG~i~l~~~d~e~~~~ 184 (325)
..|-|....|.+.|+.+...| ..|++|.-.|-..+ .+.|.-.++ .|+|. ..|.|+++..-......
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g----------~~lg~V~f~dC~~~~~~~~~ssDp--dF~V~-~DGsVy~~r~v~l~~~~ 68 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAG----------QPLGKVNFEDCTGRRRVIFESSDP--DFRVL-EDGSVYAKRPVQLSSEQ 68 (90)
T ss_dssp ---B--S-EEEE----SS-SS------------EEE---B--SS---EEEE---S--EEEEE-TTTEEEEES--S-SSS-
T ss_pred CcCCcccceEEEEcCchhhCC----------cEEEEEEeccCCCCCceEEecCCC--CEEEc-CCCeEEEeeeEecCCCc
Confidence 358899999999999998888 99999999999774 688776654 89996 68999998882222333
Q ss_pred EEEEEEEEeCCC
Q psy2920 185 YELTVEAHDLQS 196 (325)
Q Consensus 185 y~l~V~a~D~g~ 196 (325)
-.+.|.|+|..+
T Consensus 69 ~~F~V~a~D~~~ 80 (90)
T PF08758_consen 69 RSFTVHAWDSQT 80 (90)
T ss_dssp EEEEEEEEETTT
T ss_pred eEEEEEEECCCC
Confidence 579999999775
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.06 E-value=0.13 Score=38.80 Aligned_cols=68 Identities=26% Similarity=0.240 Sum_probs=50.4
Q ss_pred EEeCCCCeEEEEEecC-----CCcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCC
Q psy2920 144 ATDADGDKVAYKLAVP-----SNLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVL 216 (325)
Q Consensus 144 A~D~D~~~v~Y~i~~~-----~~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~ 216 (325)
..|+|...++|++... ..|...++.++.+.=... .+....|.+.|.|+|+.+ .+. ...+.|.|.+.++
T Consensus 24 F~d~d~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~-~~~~g~~~i~v~a~D~~g--~~~--~~~f~i~V~~~~~ 96 (97)
T smart00736 24 FTDADGDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPT-NSDVGSLSLKVTATDSSG--ASA--SDTFTITVVNTND 96 (97)
T ss_pred eECCCCCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECC-CCCCcEEEEEEEEEECCC--CEE--EEEEEEEEeCCCC
Confidence 4688877999999742 358999999988877544 434577999999999876 333 5678888876554
No 16
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.03 E-value=0.018 Score=42.28 Aligned_cols=57 Identities=18% Similarity=0.297 Sum_probs=37.8
Q ss_pred eeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCC-----CeEEEEEec--CCCcEEEeCCceEEEEccC-CCCC
Q psy2920 114 VYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADG-----DKVAYKLAV--PSNLVVIVPQTGELLLTGD-PPSD 181 (325)
Q Consensus 114 ~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~-----~~v~Y~i~~--~~~~F~i~~~tG~i~l~~~-d~e~ 181 (325)
+..++|+|..++| +.|+.+ |.|.-. ..-.|+|.+ ...+|.++..+|.|++... |||.
T Consensus 2 qi~YsV~EE~~~G----------t~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 2 QIRYSVPEEMPPG----------TVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEEESS--TT-----------EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred CeEEEeecCCCCC----------CEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 3467899999999 999998 444432 134566665 4579999999999999988 9986
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.44 E-value=0.83 Score=34.27 Aligned_cols=64 Identities=28% Similarity=0.360 Sum_probs=48.1
Q ss_pred ceEEEEEeeCC---CCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCCeeEEEEEEEEec
Q psy2920 26 QLVRYSITQGN---DNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGINKAETHVTVDVLD 102 (325)
Q Consensus 26 ~~i~y~i~~~~---~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~D 102 (325)
..++|++...+ .+.+.++++.++ .+.- .+.. +....|.+++.|+|..| .+....++|.|.+
T Consensus 30 ~~lty~~~~~~~~~lP~Wl~fd~~~~-------------~~~G-tP~~-~~~g~~~i~v~a~D~~g-~~~~~~f~i~V~~ 93 (97)
T smart00736 30 DTLTYSATLSDGSALPSWLSFDSDTG-------------TLSG-TPTN-SDVGSLSLKVTATDSSG-ASASDTFTITVVN 93 (97)
T ss_pred CeEEEEEEeCCCCCCCCeEEEeCCCC-------------EEEE-ECCC-CCCcEEEEEEEEEECCC-CEEEEEEEEEEeC
Confidence 67999987543 246889998876 5554 3333 33567999999999866 5677889999999
Q ss_pred CCC
Q psy2920 103 END 105 (325)
Q Consensus 103 vNd 105 (325)
.|+
T Consensus 94 ~~~ 96 (97)
T smart00736 94 TND 96 (97)
T ss_pred CCC
Confidence 987
No 18
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=93.74 E-value=2.8 Score=34.60 Aligned_cols=122 Identities=16% Similarity=0.140 Sum_probs=60.3
Q ss_pred CCceEEEEE-EEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCccceeeeeEeeccCCceEEEEEeCCCCc
Q psy2920 181 DDVEYELTV-EAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAPHRIEKRRVTRAVRPTKRIDFSETDGEM 259 (325)
Q Consensus 181 ~~~~y~l~V-~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~v~a~d~D~~~ 259 (325)
..+.|.|.+ .|.|..+-. + +..+..++ .+|..||.+.-..... . .+...+.... ...+.++|.-.+.
T Consensus 13 ~dG~Y~l~~~~a~D~agN~--~--~~~~~~~~-~iD~T~Ptisi~~~~~----~-~~g~~v~~~~--~i~i~~tD~~~~~ 80 (158)
T PF13750_consen 13 PDGSYTLTVVTATDAAGNT--S--TSTVSETF-TIDNTPPTISISDGAS----V-ANGSTVYGLV--NISINVTDNSDDS 80 (158)
T ss_pred CCccEEEEEEEEEecCCCE--E--EEEEeeEE-EEcCCCCEEEEecCCc----c-CCCcccccee--eeEEEEEeCCCCc
Confidence 347899999 799977632 2 22333333 2466666443311100 0 0011111111 1235666655443
Q ss_pred cceEEEEEEeCCcccceEeeCCCCceEeCCCeeEEEc--eeC-ccccCCCCceEEEEEEEecCCCCC
Q psy2920 260 EGRIVFALEKETEREKFKIRDENPWITVDENGDVRVK--KRW-DFEELGPEKTIDFWVTINNQATNG 323 (325)
Q Consensus 260 ~~~i~y~i~~~~~~~~F~i~~~~~~f~i~~tG~i~~~--~~l-D~E~~~~~~~~~l~V~a~D~~~~~ 323 (325)
.-...+|.+|.-...-.+... .. ..|...+. +.+ ..|.. ..|.|+|.|+|.+++.
T Consensus 81 -~i~sv~l~Gg~~~d~v~ls~~----~~-~~~~~~~~yp~~fpsle~~---~~YtLtV~a~D~aGN~ 138 (158)
T PF13750_consen 81 -KITSVSLTGGPASDSVSLSWT----NK-GNGVYTLEYPRIFPSLEAD---DSYTLTVSATDKAGNQ 138 (158)
T ss_pred -eEEEEEEECCcccceEEEeeE----ec-cCceEEeecccccCCcCCC---CeEEEEEEEEecCCCE
Confidence 445678877653332222210 01 23433332 111 12455 7999999999998875
No 19
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=91.96 E-value=2.2 Score=31.65 Aligned_cols=66 Identities=11% Similarity=0.103 Sum_probs=34.9
Q ss_pred CCCCCCCceEEeeEEEecCCCCCCCceEEEEEeeCCCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEE
Q psy2920 1 MENSKSGTGVKGLPILVASSHKHPHQLVRYSITQGNDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLV 80 (325)
Q Consensus 1 ~En~~~Gt~v~~~~a~d~D~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~ 80 (325)
|.+...|..|+.+...|-... ..+.|.-.. ..|+|...+ .|.+.+++...... -.+.
T Consensus 16 p~~l~~g~~lg~V~f~dC~~~----~~~~~~ssD----pdF~V~~DG--------------sVy~~r~v~l~~~~-~~F~ 72 (90)
T PF08758_consen 16 PSNLEAGQPLGKVNFEDCTGR----RRVIFESSD----PDFRVLEDG--------------SVYAKRPVQLSSEQ-RSFT 72 (90)
T ss_dssp --SS-SS--EEE---B--SS-------EEEE-------SEEEEETTT--------------EEEEES--S-SSS--EEEE
T ss_pred CchhhCCcEEEEEEeccCCCC----CceEEecCC----CCEEEcCCC--------------eEEEeeeEecCCCc-eEEE
Confidence 455678999999988777432 457776643 379999865 78999988765443 4789
Q ss_pred EEEEeCCCC
Q psy2920 81 ITATDVKGI 89 (325)
Q Consensus 81 v~a~D~~~~ 89 (325)
|.|.|..+.
T Consensus 73 V~a~D~~~~ 81 (90)
T PF08758_consen 73 VHAWDSQTQ 81 (90)
T ss_dssp EEEEETTTT
T ss_pred EEEECCCCC
Confidence 999998663
No 20
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=91.75 E-value=4.4 Score=30.64 Aligned_cols=89 Identities=17% Similarity=0.135 Sum_probs=55.1
Q ss_pred EeeEEEecCCCCCCCceEEEEEee-CCCCceEEEeCCCCccccccccCCcceEEEec------ccCCcccccEEEEEEEE
Q psy2920 11 KGLPILVASSHKHPHQLVRYSITQ-GNDNHTFALKNPTTHEIRNSTESKSGVLLVTS------GDIDREEVAQYSLVITA 83 (325)
Q Consensus 11 ~~~~a~d~D~~~~~~~~i~y~i~~-~~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~------~~LDrE~~~~~~l~v~a 83 (325)
++|.+.|+|.++. .++++.. .+.-+.|.|+.... ....+. +.|.-...-.-.|++.+
T Consensus 2 G~Lt~sD~D~gd~----~~~s~~~~~g~yGtlti~~~G~------------wtYtl~n~~~avq~L~~Ge~~tdsFtvtv 65 (99)
T TIGR01965 2 GQLTISDADAGQA----HFIAQTDAAGQYGTFSIDADGQ------------WTYQADNSQTAVQALKAGETLTDTFTVTS 65 (99)
T ss_pred CceEEeCCCCCCc----eEEecccccCCcEEEEECCCCc------------EEEEeCCCcHHHHhhcCCCEEEEEEEEEE
Confidence 4678889987754 5666642 22346788887322 122222 22444444456777888
Q ss_pred EeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEecc
Q psy2920 84 TDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDL 122 (325)
Q Consensus 84 ~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~ 122 (325)
.|. .+..|.|+|.-.|| +|+.... -...+.|+
T Consensus 66 ~DG-----tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 66 ADG-----TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred eCC-----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 884 38889999999999 7755432 23556554
No 21
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=89.27 E-value=1.9 Score=27.99 Aligned_cols=35 Identities=26% Similarity=0.248 Sum_probs=27.5
Q ss_pred CcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCC
Q psy2920 161 NLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQ 195 (325)
Q Consensus 161 ~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g 195 (325)
.+..||+.+|.|.=....-.....|.+.|.|+|..
T Consensus 14 ~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 14 SGLSLDPSTGTISGTPTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CcEEEeCCCCEEEeecCCCccccEEEEEEEEEcCC
Confidence 57889999999987765222346899999999965
No 22
>KOG3597|consensus
Probab=88.68 E-value=13 Score=36.03 Aligned_cols=156 Identities=10% Similarity=0.068 Sum_probs=85.0
Q ss_pred eEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC--eEEEEEecCCC------cE
Q psy2920 92 AETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD--KVAYKLAVPSN------LV 163 (325)
Q Consensus 92 ~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~--~v~Y~i~~~~~------~F 163 (325)
.+..+.|.|..+||.+..+....+.+.+.|....- .....+.+.|+|.+ .+.|++....+ .|
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~----------l~~~~l~~~d~d~~~~~l~f~v~~t~~~~~~~~~~ 93 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKV----------LDPELLTAADPDSAPLPLEFQVLGTSSVPLPVLKF 93 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCcee----------ccceEeeccCCCCCccceEEEEccCCCCCCcccee
Confidence 35567899999999777777666778888776543 55567899999984 68888886211 13
Q ss_pred EEe-----------CCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCceeeeccccCCCCc
Q psy2920 164 VIV-----------PQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDEQLLQMIDRKSEAP 232 (325)
Q Consensus 164 ~i~-----------~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~~~~~~~~~~~~~~ 232 (325)
..- -..|.+.+.-. -..-..+.+..+++|+-. .+ . ..|. -+-..++.+.... .....
T Consensus 94 ~~~g~~~~~Fs~~~v~~g~~~yvh~-g~el~~~~~~~~~SDg~~---~S--~--~~i~--~~~~~~~~~~~~~--~~gL~ 161 (442)
T KOG3597|consen 94 DVPGAPATEFSYEEVEDGSLSYVHS-GTELRESELQLRVSDGLL---VS--E--RAIL--KVEATGPAPHLAR--NTGLK 161 (442)
T ss_pred eccCCcccceEehHhhcCceeEEec-CcccccceEEEEeecceE---ee--e--eEEe--cccCCCcceeeec--ccceE
Confidence 331 22333333222 112456778888888552 11 1 1111 1222222111110 10000
Q ss_pred cce-eeeeEeeccCCceEEEEEeCCCCccceEEEEEEeCCccc
Q psy2920 233 HRI-EKRRVTRAVRPTKRIDFSETDGEMEGRIVFALEKETERE 274 (325)
Q Consensus 233 ~~~-~~~~v~~~~~~~~~v~a~d~D~~~~~~i~y~i~~~~~~~ 274 (325)
+.. ..+.+.. .-+.+.|.|.+..-.+.|.|......|
T Consensus 162 v~~gS~~~IT~-----~~L~ved~d~~~d~~v~~~i~~~P~~G 199 (442)
T KOG3597|consen 162 VLQGSTAPITP-----SNLSVEDNDSSPDDEVRYDITPPPQHG 199 (442)
T ss_pred EccCccccccH-----hHceeecCCCCCCcEEEEEecCCCCCC
Confidence 000 0122222 337888888666678899988765544
No 23
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=87.57 E-value=15 Score=30.35 Aligned_cols=111 Identities=26% Similarity=0.220 Sum_probs=58.8
Q ss_pred ccEEEEEE-EEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEEeCCCC--
Q psy2920 74 VAQYSLVI-TATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDADGD-- 150 (325)
Q Consensus 74 ~~~~~l~v-~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D~~-- 150 (325)
-..|.+.+ .|.|.-| +..+..+...+. +...+|.+.- .....+... .+-.|..=.++.++|....
T Consensus 14 dG~Y~l~~~~a~D~ag-N~~~~~~~~~~~-iD~T~Ptisi-------~~~~~~~~g---~~v~~~~~i~i~~tD~~~~~~ 81 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAG-NTSTSTVSETFT-IDNTPPTISI-------SDGASVANG---STVYGLVNISINVTDNSDDSK 81 (158)
T ss_pred CccEEEEEEEEEecCC-CEEEEEEeeEEE-EcCCCCEEEE-------ecCCccCCC---ccccceeeeEEEEEeCCCCce
Confidence 45799999 7999866 333444433333 3445775532 111111100 0011234456778777763
Q ss_pred eEEEEEecCC--CcE--EE-eCCceEEEEccC----CCCCCceEEEEEEEEeCCC
Q psy2920 151 KVAYKLAVPS--NLV--VI-VPQTGELLLTGD----PPSDDVEYELTVEAHDLQS 196 (325)
Q Consensus 151 ~v~Y~i~~~~--~~F--~i-~~~tG~i~l~~~----d~e~~~~y~l~V~a~D~g~ 196 (325)
-...+|.++. ..- .. ....|...+.-+ ..|..+.|.|+|.|+|..+
T Consensus 82 i~sv~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG 136 (158)
T PF13750_consen 82 ITSVSLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG 136 (158)
T ss_pred EEEEEEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC
Confidence 3456666532 221 11 123444444322 3467789999999999876
No 24
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=87.50 E-value=6 Score=29.92 Aligned_cols=71 Identities=15% Similarity=0.149 Sum_probs=46.3
Q ss_pred EEEEEEeCCCC-eEEEEEec---CCCcEEEeCCceEEEEccC-C-----C--C-CCceEEEEEEEEeCCCCCCcceecEE
Q psy2920 140 GSVSATDADGD-KVAYKLAV---PSNLVVIVPQTGELLLTGD-P-----P--S-DDVEYELTVEAHDLQSPSRYAREPAQ 206 (325)
Q Consensus 140 ~~v~A~D~D~~-~v~Y~i~~---~~~~F~i~~~tG~i~l~~~-d-----~--e-~~~~y~l~V~a~D~g~p~~~s~~t~~ 206 (325)
+++.++|+|.+ ...+++.. ..+.|.|++ +|.-..... . + + ..-.-.++|.+.|+ . +.+
T Consensus 2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------t--t~~ 72 (99)
T TIGR01965 2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------T--SQT 72 (99)
T ss_pred CceEEeCCCCCCceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------C--eEE
Confidence 46789999985 45666643 235688865 676544433 1 1 1 11235677788884 2 678
Q ss_pred EEEEEecCCCCCc
Q psy2920 207 VWLQFNSPVLQDE 219 (325)
Q Consensus 207 v~I~v~d~~~~P~ 219 (325)
|+|+|.-.|++|.
T Consensus 73 vtItI~GtNDapv 85 (99)
T TIGR01965 73 VTITITGANDAAV 85 (99)
T ss_pred EEEEEEccCCCCE
Confidence 8899888999998
No 25
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.48 E-value=10 Score=25.67 Aligned_cols=56 Identities=14% Similarity=0.150 Sum_probs=31.7
Q ss_pred eEEEEEecC-CCcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEE
Q psy2920 151 KVAYKLAVP-SNLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQF 211 (325)
Q Consensus 151 ~v~Y~i~~~-~~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v 211 (325)
...|+|.+. ..+..+...+-.+..... ..+.|+|.|+|.|..+..... ..++.|.|
T Consensus 9 ~Y~Y~l~g~d~~W~~~~~~~~~~~~~~L---~~G~Y~l~V~a~~~~~~~~~~--~~~l~i~I 65 (66)
T PF07495_consen 9 RYRYRLEGFDDEWITLGSYSNSISYTNL---PPGKYTLEVRAKDNNGKWSSD--EKSLTITI 65 (66)
T ss_dssp EEEEEEETTESSEEEESSTS-EEEEES-----SEEEEEEEEEEETTS-B-SS---EEEEEEE
T ss_pred EEEEEEECCCCeEEECCCCcEEEEEEeC---CCEEEEEEEEEECCCCCcCcc--cEEEEEEE
Confidence 466777763 456666433325555443 668999999999977643333 24555554
No 26
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=84.12 E-value=62 Score=34.42 Aligned_cols=29 Identities=28% Similarity=0.417 Sum_probs=19.8
Q ss_pred EEEEEEEecCCCCCCccCcceeEEEEeccC
Q psy2920 94 THVTVDVLDENDNSPVFSQKVYRFIVGDLK 123 (325)
Q Consensus 94 ~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~ 123 (325)
...+|+|.| ||++|.|....-...|.|+.
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~ 543 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSESI 543 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcCC
Confidence 355677777 77788776555556788864
No 27
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=81.95 E-value=38 Score=34.37 Aligned_cols=113 Identities=19% Similarity=0.211 Sum_probs=62.2
Q ss_pred cccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEE--EeCCCC
Q psy2920 73 EVAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSA--TDADGD 150 (325)
Q Consensus 73 ~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A--~D~D~~ 150 (325)
......+++.++|.+|..+......|+|++. ++|.+. +++.-...-+.. +.-.+...|..+.. ..+..-
T Consensus 296 ~~G~~Ti~atVtDSRGr~S~~~~~tItVl~Y--~~P~ls-----fsv~R~~~~~~~--~~v~~~a~Iapl~v~g~qKN~~ 366 (624)
T PF05895_consen 296 FSGSATIRATVTDSRGRTSDPKTKTITVLEY--SPPTLS-----FSVYRCGSSGNT--LTVTRNAKIAPLTVNGVQKNTM 366 (624)
T ss_pred cCceEEEEEEEEECCCccCCceEEEEEEEEc--CCCcEE-----EEEEEeCCCCcE--EEEEEEEEEeEEEEcccccceE
Confidence 3567899999999998667778899999998 567663 222222111100 00000011221111 111112
Q ss_pred eEEEEEec-CCCcEEEeCC--ce-----------EEEEccCCCCCCceEEEEEEEEeCC
Q psy2920 151 KVAYKLAV-PSNLVVIVPQ--TG-----------ELLLTGDPPSDDVEYELTVEAHDLQ 195 (325)
Q Consensus 151 ~v~Y~i~~-~~~~F~i~~~--tG-----------~i~l~~~d~e~~~~y~l~V~a~D~g 195 (325)
.+.|+... +...|.++.. .| ...|.. .+.....|.+++.++|.-
T Consensus 367 ~lt~~~a~~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g-~y~~~kSy~V~~~l~D~F 424 (624)
T PF05895_consen 367 TLTFKVAPLGTGTFTTDNGSASGTWSSISELTNSSANLGG-TYDAEKSYDVRGTLSDKF 424 (624)
T ss_pred EEEEEEEEcCcceEEEEccccccceeeeeeecccceeecc-ccCCCceEEEEEEEEEEe
Confidence 57777765 5566766521 11 122222 466778999999999954
No 28
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=76.24 E-value=39 Score=27.19 Aligned_cols=61 Identities=20% Similarity=0.288 Sum_probs=43.2
Q ss_pred ceEEEecccCCccc---ccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCC
Q psy2920 60 GVLLVTSGDIDREE---VAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSS 125 (325)
Q Consensus 60 ~~~l~~~~~LDrE~---~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~ 125 (325)
.+...+..+||... .-...|.|.|+|..|..+ ...+.|+|.| | .|...... .+.|.|...+
T Consensus 67 sYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s-~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L~ 130 (137)
T TIGR03660 67 SYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS-SITLPVTIVD--D-VPTITDVD-ALTVDEDDLP 130 (137)
T ss_pred cEEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc-ccEEEEEEEC--C-CCeecccc-ceEEeccccc
Confidence 45788999998843 446788899999877543 3588888887 6 47665443 4788886544
No 29
>KOG4221|consensus
Probab=75.28 E-value=1.3e+02 Score=32.91 Aligned_cols=66 Identities=18% Similarity=0.302 Sum_probs=43.0
Q ss_pred EEEEEeeC--CCCceEEEeCCCCccccccccCCcceEEEecccCCcccccEEEEEEEEEeCCCCCeeEEEEEE-EEecCC
Q psy2920 28 VRYSITQG--NDNHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDREEVAQYSLVITATDVKGINKAETHVTV-DVLDEN 104 (325)
Q Consensus 28 i~y~i~~~--~~~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v-~V~DvN 104 (325)
+.|++.-. +....+.++..+. .+++. +.|....|.+.|.|....|...++..+.| +..|+-
T Consensus 552 ~~yk~~ys~~~~~~~~~~~~n~~-------------e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~P 615 (1381)
T KOG4221|consen 552 TGYKLFYSEDDTGKELRVENNAT-------------EYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVP 615 (1381)
T ss_pred eEEEEEEEcCCCCceEEEecCcc-------------EEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCC
Confidence 35665432 2335678877665 55555 45778899999999998887644444433 455777
Q ss_pred CCCCc
Q psy2920 105 DNSPV 109 (325)
Q Consensus 105 dn~P~ 109 (325)
+-||.
T Consensus 616 saPP~ 620 (1381)
T KOG4221|consen 616 SAPPQ 620 (1381)
T ss_pred CCCCc
Confidence 76665
No 30
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=65.33 E-value=26 Score=22.62 Aligned_cols=36 Identities=22% Similarity=0.325 Sum_probs=25.3
Q ss_pred CceEEEeCCCCccccccccCCcceEEEecccCCcc-cccEEEEEEEEEeCCC
Q psy2920 38 NHTFALKNPTTHEIRNSTESKSGVLLVTSGDIDRE-EVAQYSLVITATDVKG 88 (325)
Q Consensus 38 ~~~F~i~~~~g~~~~~~~~~~~~~~l~~~~~LDrE-~~~~~~l~v~a~D~~~ 88 (325)
.....+|+.+| .|.-. .+.. ....|.+.|.|+|..|
T Consensus 13 P~gLs~d~~tG-------------~isGt--p~~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 13 PSGLSLDPSTG-------------TISGT--PTSSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CCcEEEeCCCC-------------EEEee--cCCCccccEEEEEEEEEcCCC
Confidence 45788999998 55543 3334 3368999999999753
No 31
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=53.68 E-value=40 Score=22.11 Aligned_cols=17 Identities=29% Similarity=0.202 Sum_probs=14.7
Q ss_pred ceEEEEEEEecCCCCCC
Q psy2920 308 KTIDFWVTINNQATNGG 324 (325)
Q Consensus 308 ~~~~l~V~a~D~~~~~g 324 (325)
..|.|.|.|+|.+++..
T Consensus 24 G~y~itv~a~D~AGN~s 40 (54)
T PF13754_consen 24 GTYTITVTATDAAGNTS 40 (54)
T ss_pred ccEEEEEEEEeCCCCCC
Confidence 68999999999998764
No 32
>KOG4221|consensus
Probab=52.52 E-value=3.6e+02 Score=29.82 Aligned_cols=121 Identities=15% Similarity=0.171 Sum_probs=70.4
Q ss_pred ccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcce-----eEEEEeccCCCCCCCcccccceeEEEEEEEEeCC
Q psy2920 74 VAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKV-----YRFIVGDLKSSLNSNVSHWKKFSSIGSVSATDAD 148 (325)
Q Consensus 74 ~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~-----~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~D~D 148 (325)
...|.+++.|....|...+...+.|.-... +|.- ... ..+.|.-..++-.++ .|..
T Consensus 493 ~t~Y~~rv~A~n~~g~g~sS~pLkV~t~pE---gp~~-~~a~ats~~ti~v~WepP~~~n~--------~I~~------- 553 (1381)
T KOG4221|consen 493 LTMYFFRVRAKNEAGSGESSAPLKVTTQPE---GPVQ-LQAYATSPTTILVTWEPPPFGNG--------PITG------- 553 (1381)
T ss_pred ceeEEEEEeccCcccCCccCCceEEecCCC---CCcc-ccccccCcceEEEEecCCCCCCC--------CceE-------
Confidence 457999999988777665566665543333 3322 112 222222222220010 0000
Q ss_pred CCeEEEEEecCCCcEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEE-EEecCCCCCc
Q psy2920 149 GDKVAYKLAVPSNLVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWL-QFNSPVLQDE 219 (325)
Q Consensus 149 ~~~v~Y~i~~~~~~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I-~v~d~~~~P~ 219 (325)
=++.|+..+....+.++.++-+.++.. .+....|.+.|+|..+.++..++ +.|+| +..|+-.+||
T Consensus 554 -yk~~ys~~~~~~~~~~~~n~~e~ti~g--L~k~TeY~~~vvA~N~~G~g~sS---~~i~V~Tlsd~PsaPP 619 (1381)
T KOG4221|consen 554 -YKLFYSEDDTGKELRVENNATEYTING--LEKYTEYSIRVVAYNSAGSGVSS---ADITVRTLSDVPSAPP 619 (1381)
T ss_pred -EEEEEEcCCCCceEEEecCccEEEeec--CCCccceEEEEEEecCCCCCCCC---CceEEEeccCCCCCCC
Confidence 045555555567888877776766664 56778899999999999888776 23333 3456666776
No 33
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=49.76 E-value=34 Score=22.99 Aligned_cols=29 Identities=28% Similarity=0.444 Sum_probs=19.4
Q ss_pred cccEEEEEEEEEeCCCCCe-eEEEEEEEEe
Q psy2920 73 EVAQYSLVITATDVKGINK-AETHVTVDVL 101 (325)
Q Consensus 73 ~~~~~~l~v~a~D~~~~~~-~~~~v~v~V~ 101 (325)
....|.|.|.|.|..+... ....+.|.|+
T Consensus 37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 37 PPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp -SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 4678999999999876543 2366776663
No 34
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=49.65 E-value=71 Score=21.46 Aligned_cols=31 Identities=29% Similarity=0.537 Sum_probs=20.8
Q ss_pred cccEEEEEEEEEeCCCCCeeEEEEEEEEecCC
Q psy2920 73 EVAQYSLVITATDVKGINKAETHVTVDVLDEN 104 (325)
Q Consensus 73 ~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvN 104 (325)
....|.+.+.++|.-| +.+.......+.|..
T Consensus 21 ~dg~yt~~v~a~D~AG-N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 21 ADGEYTLTVTATDKAG-NTSSSTTQIVIVDNT 51 (60)
T ss_pred CCccEEEEEEEEECCC-CEEEeeeEEEEEcCC
Confidence 3567999999999877 444445555555443
No 35
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=39.61 E-value=1.2e+02 Score=21.59 Aligned_cols=13 Identities=31% Similarity=0.345 Sum_probs=11.2
Q ss_pred ceEEEEEEEecCC
Q psy2920 308 KTIDFWVTINNQA 320 (325)
Q Consensus 308 ~~~~l~V~a~D~~ 320 (325)
..|.|.|.|+|.|
T Consensus 69 G~Y~~~v~a~~~g 81 (81)
T PF13860_consen 69 GTYTFRVTATDGG 81 (81)
T ss_dssp EEEEEEEEEEET-
T ss_pred CCEEEEEEEEeCC
Confidence 7999999999975
No 36
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=38.53 E-value=80 Score=21.92 Aligned_cols=30 Identities=30% Similarity=0.388 Sum_probs=21.8
Q ss_pred CCcccccEEEEEEEEEeCCCCCeeEEEEEEEE
Q psy2920 69 IDREEVAQYSLVITATDVKGINKAETHVTVDV 100 (325)
Q Consensus 69 LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V 100 (325)
.-|.....|.+++.+.|..| +....++|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g--~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG--SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC--cEEEEEEEEE
Confidence 34667789999999999866 4555555554
No 37
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=37.93 E-value=75 Score=22.51 Aligned_cols=27 Identities=41% Similarity=0.463 Sum_probs=21.5
Q ss_pred cccEEEEEEEEEeCCCCCeeEEEEEEEE
Q psy2920 73 EVAQYSLVITATDVKGINKAETHVTVDV 100 (325)
Q Consensus 73 ~~~~~~l~v~a~D~~~~~~~~~~v~v~V 100 (325)
....|.++..|+|..| +++.+.+.|+|
T Consensus 55 ~~G~t~V~ytA~D~~G-N~a~C~f~V~V 81 (81)
T PF02494_consen 55 PVGTTTVTYTATDAAG-NSATCSFTVTV 81 (81)
T ss_pred eeceEEEEEEEEECCC-CEEEEEEEEEC
Confidence 3568999999999865 66788888875
No 38
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=34.30 E-value=90 Score=21.65 Aligned_cols=29 Identities=17% Similarity=0.139 Sum_probs=21.0
Q ss_pred CCCCCceEEEEEEEEeCCCCCCcceecEEEEEEE
Q psy2920 178 PPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQF 211 (325)
Q Consensus 178 d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v 211 (325)
.|.....|.+++.|+|..+ ++ ++++.|.|
T Consensus 50 ~y~~~G~y~v~l~v~n~~g---~~--~~~~~i~v 78 (79)
T smart00089 50 TYTKPGTYTVTLTVTNAVG---SA--SATVTVVV 78 (79)
T ss_pred EeCCCcEEEEEEEEEcCCC---cE--EEEEEEEE
Confidence 3667789999999999765 33 55555554
No 39
>PF10365 DUF2436: Domain of unknown function (DUF2436); InterPro: IPR018832 Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms. This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).
Probab=29.85 E-value=3e+02 Score=22.22 Aligned_cols=126 Identities=13% Similarity=0.142 Sum_probs=66.2
Q ss_pred EEEecccCCcccccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEE
Q psy2920 62 LLVTSGDIDREEVAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGS 141 (325)
Q Consensus 62 ~l~~~~~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~ 141 (325)
.|.+...=..+...-|++.+-| |+..-. ....-+=-+.+ |+++|.=.-..|+-.|++|+-+.- ++....+--
T Consensus 27 ~IILeah~vW~DgsGyQ~LlDa-DHn~yG-~~iP~~g~l~~-n~~~pa~ly~~FEYkiP~NADps~-----tpq~mv~dG 98 (161)
T PF10365_consen 27 RIILEAHNVWGDGSGYQMLLDA-DHNTYG-DVIPDSGPLWD-NCNVPANLYDPFEYKIPANADPST-----TPQNMVVDG 98 (161)
T ss_pred EEEEeccccccCCcceEEEEcC-Cccccc-ccccccCcccc-CCCCChhhcccceEeccCCCCCcc-----CcceEEecC
Confidence 5666555566667777776654 221100 00111112233 778887666778888999987641 111122111
Q ss_pred EEEEeCCCCeEEEEEecC--CCcEEEeCCceEEEEccCC--CCCCceEEEEEEEEeCC
Q psy2920 142 VSATDADGDKVAYKLAVP--SNLVVIVPQTGELLLTGDP--PSDDVEYELTVEAHDLQ 195 (325)
Q Consensus 142 v~A~D~D~~~v~Y~i~~~--~~~F~i~~~tG~i~l~~~d--~e~~~~y~l~V~a~D~g 195 (325)
---.+.-.|..-|-|..+ +....|-..-|.=..+..| +|.-..|++++++..++
T Consensus 99 ~~~i~IPaG~YDy~I~~P~~~~kiwIaGd~g~~~tr~dDy~fEAGKtY~ftm~~~g~g 156 (161)
T PF10365_consen 99 EASIDIPAGTYDYCIAAPQPGGKIWIAGDGGDGPTRGDDYVFEAGKTYRFTMKRVGSG 156 (161)
T ss_pred ceEEEecCceeEEEEecCCCCCeEEEecCCCCCCccccceEEecCCEEEEEEEeccCC
Confidence 111122225667777764 3445553333333333333 46778999999987765
No 40
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=28.65 E-value=90 Score=29.92 Aligned_cols=42 Identities=12% Similarity=0.115 Sum_probs=0.0
Q ss_pred CCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEe
Q psy2920 167 PQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFN 212 (325)
Q Consensus 167 ~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~ 212 (325)
...+.+.+..........|.+++.+++.+. .++ .++..|.+.
T Consensus 145 ~~~~~l~i~~~~l~~~~~y~f~ltv~k~~r--~s~--s~~~~v~v~ 186 (440)
T PF02010_consen 145 SSSSSLTIPASTLSPGSTYTFTLTVSKGSR--SSS--SASQTVTVV 186 (440)
T ss_dssp ----------------------------------------------
T ss_pred CCCEEEEEEhHHcCCCceEEEEEEEEeCCC--Cce--eeEEEEEec
Confidence 444555554443333444999999998773 133 455555553
No 41
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=28.64 E-value=1.4e+02 Score=20.67 Aligned_cols=19 Identities=21% Similarity=0.426 Sum_probs=15.9
Q ss_pred CcccccEEEEEEEEEeCCC
Q psy2920 70 DREEVAQYSLVITATDVKG 88 (325)
Q Consensus 70 DrE~~~~~~l~v~a~D~~~ 88 (325)
.|.....|.+++.++|..+
T Consensus 52 ~y~~~G~y~v~l~v~d~~g 70 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAVG 70 (81)
T ss_pred EcCCCcEEEEEEEEEeCCC
Confidence 4677889999999999854
No 42
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=26.86 E-value=1.8e+02 Score=22.29 Aligned_cols=34 Identities=12% Similarity=0.214 Sum_probs=22.5
Q ss_pred cCCcccccEEEEEEEEEeCCCCCeeEEEEEEEEec
Q psy2920 68 DIDREEVAQYSLVITATDVKGINKAETHVTVDVLD 102 (325)
Q Consensus 68 ~LDrE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~D 102 (325)
.+++..-..|.++|.++|..|.. .+..+.|.|-|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dGCS-~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDGCS-TSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCCcc-CcCCEEEEecC
Confidence 34566677899999999987754 22344454443
No 43
>KOG3513|consensus
Probab=26.53 E-value=8.8e+02 Score=26.55 Aligned_cols=121 Identities=20% Similarity=0.177 Sum_probs=62.0
Q ss_pred cccccEEEEEEEEEeCCCCCeeEEEEEEEEecCCCCCCccCcceeEEEEeccCCCCCCCcccccceeEEEEEEEE-eCCC
Q psy2920 71 REEVAQYSLVITATDVKGINKAETHVTVDVLDENDNSPVFSQKVYRFIVGDLKSSLNSNVSHWKKFSSIGSVSAT-DADG 149 (325)
Q Consensus 71 rE~~~~~~l~v~a~D~~~~~~~~~~v~v~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~~~~~~~~~v~~v~A~-D~D~ 149 (325)
++....|+. .|...-| ++..+..+.|.| ++++....-...+ ..|. ....+-.|. |+-.
T Consensus 488 ~~DaG~YtC--~A~N~~G--~a~~~~~L~Vkd----~tri~~~P~~~~v----~~g~---------~v~l~Ce~shD~~l 546 (1051)
T KOG3513|consen 488 RSDAGKYTC--VAENKLG--KAESTGNLIVKD----ATRITLAPSNTDV----KVGE---------SVTLTCEASHDPSL 546 (1051)
T ss_pred cccCcEEEE--EEEcccC--ccceEEEEEEec----CceEEeccchhhh----ccCc---------eEEEEeecccCCCc
Confidence 555666654 4444323 455556666664 4555443222222 2220 334444444 4443
Q ss_pred C-eEEEEEec-------CCCcEEEe--CCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEecCCCCCc
Q psy2920 150 D-KVAYKLAV-------PSNLVVIV--PQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNSPVLQDE 219 (325)
Q Consensus 150 ~-~v~Y~i~~-------~~~~F~i~--~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d~~~~P~ 219 (325)
. .+.+++.+ ....|.++ ..+|.++....+.+..+.|...++.. .-..++ .+.+.| ..+-.+|+
T Consensus 547 d~~f~W~~nG~~id~~~~~~~~~~~~~~~~g~L~i~nv~l~~~G~Y~C~aqT~---~Ds~s~--~A~l~V--~gpPgpP~ 619 (1051)
T KOG3513|consen 547 DITFTWKKNGRPIDFNPDGDHFEINDGSDSGRLTIANVSLEDSGKYTCVAQTA---LDSASA--RADLLV--RGPPGPPP 619 (1051)
T ss_pred ceEEEEEECCEEhhccCCCCceEEeCCcCccceEEEeeccccCceEEEEEEEe---ecchhc--ccceEE--ecCCCCCC
Confidence 3 34444432 13566664 33477777666788889999998873 111222 445544 45656665
No 44
>cd05760 Ig2_PTK7 Second immunoglobulin (Ig)-like domain of protein tyrosine kinase (PTK) 7, also known as CCK4. Ig2_PTK7: domain similar to the second immunoglobulin (Ig)-like domain in protein tyrosine kinase (PTK) 7, also known as CCK4. PTK7 is a subfamily of the receptor protein tyrosine kinase family, and is referred to as an RPTK-like molecule. RPTKs transduce extracellular signals across the cell membrane, and play important roles in regulating cell proliferation, migration, and differentiation. PTK7 is organized as an extracellular portion having seven Ig-like domains, a single transmembrane region, and a cytoplasmic tyrosine kinase-like domain. PTK7 is considered a pseudokinase as it has several unusual residues in some of the highly conserved tyrosine kinase (TK) motifs; it is predicted to lack TK activity. PTK7 may function as a cell-adhesion molecule. PTK7 mRNA is expressed at high levels in placenta, melanocytes, liver, lung, pancreas, and kidney. PTK7 is overexpressed in s
Probab=21.57 E-value=2.8e+02 Score=19.03 Aligned_cols=47 Identities=15% Similarity=0.132 Sum_probs=27.1
Q ss_pred cEEEeCCceEEEEccCCCCCCceEEEEEEEEeCCCCCCcceecEEEEEEEec
Q psy2920 162 LVVIVPQTGELLLTGDPPSDDVEYELTVEAHDLQSPSRYAREPAQVWLQFNS 213 (325)
Q Consensus 162 ~F~i~~~tG~i~l~~~d~e~~~~y~l~V~a~D~g~p~~~s~~t~~v~I~v~d 213 (325)
.+.+....+.|.+.....++...|.... .+..+.. .+ .+.++|.|.|
T Consensus 30 ~~~~~~~~~~L~I~~~~~~D~G~Y~C~a--~N~~G~~-~s--~~~~~l~v~d 76 (77)
T cd05760 30 NYSVSSKERTLTLRSAGPDDSGLYYCCA--HNAFGSV-CS--SQNFTLSIID 76 (77)
T ss_pred cEEEeCCCCEEEEeeCCcccCEEEEEEE--EeCCCeE-ee--CceEEEEEec
Confidence 3445444556666555667788898865 5544321 12 4566776654
No 45
>KOG0196|consensus
Probab=21.00 E-value=1e+03 Score=25.38 Aligned_cols=38 Identities=21% Similarity=0.091 Sum_probs=23.8
Q ss_pred CCCCCceEEEEEEEEeCCC---CCCcceecEEEEEEEecCCCCCc
Q psy2920 178 PPSDDVEYELTVEAHDLQS---PSRYAREPAQVWLQFNSPVLQDE 219 (325)
Q Consensus 178 d~e~~~~y~l~V~a~D~g~---p~~~s~~t~~v~I~v~d~~~~P~ 219 (325)
+.+....|++.|+|.++-. |... ..+.|.|+. +...|+
T Consensus 404 ~L~ah~~YTFeV~AvNgVS~lsp~~~--~~a~vnItt--~qa~ps 444 (996)
T KOG0196|consen 404 DLLAHTNYTFEVEAVNGVSDLSPFPR--QFASVNITT--NQAAPS 444 (996)
T ss_pred ccccccccEEEEEEeecccccCCCCC--cceeEEeec--cccCCC
Confidence 6677889999999998643 2222 245666654 444443
Done!