Query psy8640
Match_columns 303
No_of_seqs 200 out of 1806
Neff 8.1
Searched_HMMs 46136
Date Sat Aug 17 00:01:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8640.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8640hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 2.3E-40 5E-45 331.3 27.1 266 15-303 277-609 (2531)
2 KOG4289|consensus 100.0 3.4E-39 7.5E-44 322.9 19.6 269 14-303 171-507 (2531)
3 KOG1219|consensus 100.0 3.2E-34 7E-39 295.2 28.7 269 11-303 644-1087(4289)
4 KOG1219|consensus 100.0 1.2E-32 2.7E-37 283.6 28.0 266 14-303 2583-2912(4289)
5 cd00031 CA Cadherin repeat dom 100.0 5E-28 1.1E-32 208.2 27.1 194 12-265 2-199 (199)
6 cd00031 CA Cadherin repeat dom 99.9 2.9E-21 6.2E-26 166.0 18.9 128 125-303 1-131 (199)
7 PF00028 Cadherin: Cadherin do 99.8 5.4E-17 1.2E-21 123.1 15.0 90 126-264 1-93 (93)
8 KOG1834|consensus 99.7 4.5E-16 9.8E-21 149.0 18.8 151 59-266 84-245 (952)
9 smart00112 CA Cadherin repeats 99.6 1.9E-15 4.2E-20 111.0 10.6 77 146-271 1-79 (79)
10 smart00112 CA Cadherin repeats 99.5 2.4E-13 5.2E-18 99.8 9.8 58 57-119 22-79 (79)
11 PF00028 Cadherin: Cadherin do 99.3 5.3E-11 1.2E-15 90.0 11.9 89 13-112 2-93 (93)
12 KOG1834|consensus 99.2 2.5E-10 5.3E-15 110.1 12.6 142 109-302 21-176 (952)
13 PF08758 Cadherin_pro: Cadheri 97.6 0.00067 1.4E-08 51.0 9.5 66 118-199 3-68 (90)
14 PF08266 Cadherin_2: Cadherin- 97.3 0.00019 4E-09 53.3 3.1 63 126-198 3-67 (84)
15 smart00736 CADG Dystroglycan-t 95.9 0.26 5.7E-06 37.2 11.6 73 145-268 24-96 (97)
16 PF08266 Cadherin_2: Cadherin- 95.4 0.015 3.3E-07 43.1 3.1 65 12-79 3-67 (84)
17 TIGR01965 VCBS_repeat VCBS rep 95.1 0.22 4.7E-06 38.0 8.6 30 241-275 59-88 (99)
18 PF13750 Big_3_3: Bacterial Ig 93.3 4.2 9.1E-05 33.8 15.5 29 237-265 121-149 (158)
19 smart00736 CADG Dystroglycan-t 92.5 2 4.3E-05 32.3 9.6 52 56-116 45-96 (97)
20 TIGR03660 T1SS_rpt_143 T1SS-14 70.7 24 0.00053 28.5 7.3 46 241-291 87-133 (137)
21 PF12245 Big_3_2: Bacterial Ig 67.8 20 0.00043 24.4 5.4 31 238-268 22-52 (60)
22 TIGR00845 caca sodium/calcium 66.3 1.4E+02 0.0029 32.1 13.3 50 114-165 395-444 (928)
23 cd00146 PKD polycystic kidney 63.7 15 0.00032 26.1 4.3 28 236-263 54-81 (81)
24 PF05345 He_PIG: Putative Ig d 61.2 32 0.0007 22.4 5.2 38 54-93 11-48 (49)
25 smart00089 PKD Repeats in poly 60.2 20 0.00043 25.2 4.5 27 236-263 52-78 (79)
26 PF07495 Y_Y_Y: Y_Y_Y domain; 58.7 22 0.00048 24.1 4.4 26 239-264 39-66 (66)
27 TIGR03660 T1SS_rpt_143 T1SS-14 58.3 55 0.0012 26.5 7.1 67 56-133 55-127 (137)
28 PF08758 Cadherin_pro: Cadheri 55.5 56 0.0012 24.4 6.3 35 57-94 46-80 (90)
29 cd02848 Chitinase_N_term Chiti 51.3 20 0.00044 27.7 3.3 33 233-265 74-106 (106)
30 PF08329 ChitinaseA_N: Chitina 49.9 34 0.00073 27.6 4.5 37 232-268 76-112 (133)
31 KOG3597|consensus 46.4 2.9E+02 0.0063 27.0 11.1 60 104-164 25-84 (442)
32 cd02848 Chitinase_N_term Chiti 41.6 1.2E+02 0.0025 23.5 6.2 34 73-113 73-106 (106)
33 PF03160 Calx-beta: Calx-beta 35.4 1.8E+02 0.0039 21.3 10.4 53 108-164 2-54 (100)
34 KOG4221|consensus 33.1 7.3E+02 0.016 27.7 14.5 74 23-120 547-620 (1381)
35 PF05895 DUF859: Siphovirus pr 32.2 5.7E+02 0.012 26.2 13.3 32 83-121 300-331 (624)
36 TIGR00845 caca sodium/calcium 27.9 8E+02 0.017 26.5 19.1 57 105-165 515-571 (928)
37 PF00778 DIX: DIX domain; Int 26.9 1.7E+02 0.0036 21.6 4.7 42 241-300 43-84 (84)
38 PF13754 Big_3_4: Bacterial Ig 26.3 92 0.002 20.5 3.1 15 239-253 24-38 (54)
39 PF13750 Big_3_3: Bacterial Ig 26.0 3.7E+02 0.008 22.1 7.3 36 239-275 15-51 (158)
No 1
>KOG4289|consensus
Probab=100.00 E-value=2.3e-40 Score=331.29 Aligned_cols=266 Identities=24% Similarity=0.354 Sum_probs=228.2
Q ss_pred EeeccccccccceEEEEEEe--cccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEEC
Q psy8640 15 LIMRRGVVHRENIQVYLVWY--GSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDG 92 (303)
Q Consensus 15 ~i~e~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~ 92 (303)
.+-||..+|.++.+..+-.- ..++.+.|+. ..++..+.|.||+++|.|.+..+||||+.+. |.|.+.|+|.
T Consensus 277 ~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl------~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~-y~L~VeAsDq 349 (2531)
T KOG4289|consen 277 ELRENLEVGYEVLTVRATDGDSPPNANIRYRL------LEGNAKNVFEINPRSGVISTRAPLDREELES-YQLDVEASDQ 349 (2531)
T ss_pred HHhhccccCceEEEEEeccCCCCCCCceEEEe------cCCCccceeEEcCccceeeccCccCHHhhhh-eEEEEEeccC
Confidence 45677778888777766443 5677777754 3446778999999999999999999999999 9999999999
Q ss_pred CCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccC
Q psy8640 93 GKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVD 172 (303)
Q Consensus 93 g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~ 172 (303)
|. ++...++.|.|.|.|+|||+|+|....|.+.|+|+..+++.|++|+|+|.|.|.|+++.|+|.+++..
T Consensus 350 G~----~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~------ 419 (2531)
T KOG4289|consen 350 GR----PPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGR------ 419 (2531)
T ss_pred CC----CCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCcc------
Confidence 98 44455999999999999999999999999999999999999999999999999999999999998754
Q ss_pred CCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE------------EEeecccccCCC----------------ceE
Q psy8640 173 DSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY------------SVWTRSLVWSRP----------------MQC 224 (303)
Q Consensus 173 ~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~------------~~~~~~~~~~~~----------------~~~ 224 (303)
+.|.||..||+|.+..+||||.. .|++.|+|.+ .+..-.++.+.| ...
T Consensus 420 ---G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~ 495 (2531)
T KOG4289|consen 420 ---GQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGY 495 (2531)
T ss_pred ---ccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccc
Confidence 88999999999999999999998 8999999862 123333333333 222
Q ss_pred EE----------------------------------eecCceeeeeeecEEEEEEEEEECCCC--eEEEEEEEEEeecCC
Q psy8640 225 LI----------------------------------MRRGVVFDYEARSRYAFTLLSTDSGGR--TCKVKVRVEIESRDE 268 (303)
Q Consensus 225 ~i----------------------------------~~~~~~lD~E~~~~y~l~V~a~D~g~~--s~~~~v~I~V~DvNd 268 (303)
.+ +...+.||||+.+.|.|.|.|+|.|.+ ++++.|.|.|+|+||
T Consensus 496 ~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvnd 575 (2531)
T KOG4289|consen 496 LVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVND 575 (2531)
T ss_pred eEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCC
Confidence 11 122367999999999999999999986 778899999999999
Q ss_pred CCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640 269 FHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS 303 (303)
Q Consensus 269 n~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~ 303 (303)
|.|+|+++.|++. .|++ |.|+.|.+|+|+|.|.
T Consensus 576 ndP~Ft~~eytl~inED~--pvgsSI~tvtAvD~d~ 609 (2531)
T KOG4289|consen 576 NDPTFTQKEYTLRINEDA--PVGSSIVTVTAVDRDA 609 (2531)
T ss_pred CCCccccCceEEEecCCc--cccceEEEEEEecccc
Confidence 9999999999999 9999 7999999999999983
No 2
>KOG4289|consensus
Probab=100.00 E-value=3.4e-39 Score=322.91 Aligned_cols=269 Identities=23% Similarity=0.373 Sum_probs=233.7
Q ss_pred eEeeccccccccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECC
Q psy8640 14 CLIMRRGVVHRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGG 93 (303)
Q Consensus 14 ~~i~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g 93 (303)
..++||.++++-..+..+.. -+++++.|...+ ..+....++|+||+.+|.|++++.||||+++. +.|.+.|.|.+
T Consensus 171 ~~lpEn~pagT~iasv~A~~-~~a~rl~Ysm~a---l~dsRS~~lFslD~~sG~irta~~lDREt~e~-HvlrVtA~d~~ 245 (2531)
T KOG4289|consen 171 KELPENEPAGTIIASVKASD-PDAGRLYYSMVA---LFDSRSQNLFSLDPMSGAIRTAKSLDRETKET-HVLRVTAQDHG 245 (2531)
T ss_pred ccCcCCCCCCceeEEEEecC-CCcCceEEEeee---ccchhccccEeeccccccchhhhhhhhhhhhe-eEEEEEeeecC
Confidence 68999999999988777654 448888886544 44556678999999999999999999999999 99999999999
Q ss_pred CCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCC
Q psy8640 94 KPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDD 173 (303)
Q Consensus 94 ~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~ 173 (303)
. |.++++++++|.|.|.|||.|+|.+..|..+++||.++|+.|++|+|+|.|.++|+.|+|++.++++.
T Consensus 246 ~----P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~------- 314 (2531)
T KOG4289|consen 246 D----PRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAK------- 314 (2531)
T ss_pred C----CcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCcc-------
Confidence 8 89999999999999999999999999999999999999999999999999999999999999998654
Q ss_pred CcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE------------EEeecccccCCC--------------------
Q psy8640 174 SWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY------------SVWTRSLVWSRP-------------------- 221 (303)
Q Consensus 174 ~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~------------~~~~~~~~~~~~-------------------- 221 (303)
..|.||+.+|.|.+.++||+|+...|.|.|+|++ .+.....+.|.|
T Consensus 315 --~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~av 392 (2531)
T KOG4289|consen 315 --NVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAV 392 (2531)
T ss_pred --ceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCce
Confidence 8999999999999999999999999999999972 122222233333
Q ss_pred ----------------ceEEEeec------------C-----ceeeeeeecEEEEEEEEEECCCC--eEEEEEEEEEeec
Q psy8640 222 ----------------MQCLIMRR------------G-----VVFDYEARSRYAFTLLSTDSGGR--TCKVKVRVEIESR 266 (303)
Q Consensus 222 ----------------~~~~i~~~------------~-----~~lD~E~~~~y~l~V~a~D~g~~--s~~~~v~I~V~Dv 266 (303)
+.+.|++. | .+||+|.. .|.+.|+|+|.|.| +.+.-+.|+|+|+
T Consensus 393 vlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDI 471 (2531)
T KOG4289|consen 393 VLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDI 471 (2531)
T ss_pred EEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEec
Confidence 22333322 1 36999999 99999999997774 6677777999999
Q ss_pred CCCCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640 267 DEFHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS 303 (303)
Q Consensus 267 Ndn~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~ 303 (303)
|||+|+|....+.+. .|++ |.|..++.+.|.|+|+
T Consensus 472 NDhaPifvstpfq~tvlEnv--~lg~~v~~vqaidads 507 (2531)
T KOG4289|consen 472 NDHAPIFVSTPFQATVLENV--PLGYLVCHVQAIDADS 507 (2531)
T ss_pred CCCCceeEechhhhhhhhcc--cccceEEEEecccCCC
Confidence 999999999999999 9999 7999999999999996
No 3
>KOG1219|consensus
Probab=100.00 E-value=3.2e-34 Score=295.17 Aligned_cols=269 Identities=22% Similarity=0.310 Sum_probs=229.8
Q ss_pred cceeEeeccccccccceEEEEEEe--cccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEE
Q psy8640 11 PMQCLIMRRGVVHRENIQVYLVWY--GSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILK 88 (303)
Q Consensus 11 p~~~~i~e~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~ 88 (303)
|-...++|+.++|+..+.-.+-.- |.+|.+.| -+.+++....|.|+.++|.|+++++||+|..++ |.|.++
T Consensus 644 P~iI~v~Edvpigt~la~L~atD~Dtgfng~l~y------vI~dgne~~~~~Id~qsg~itvas~ld~~~t~~-yiLnvt 716 (4289)
T KOG1219|consen 644 PFIIVVPEDVPIGTTLAILSATDSDTGFNGKLVY------VIEDGNESICFLIDRQSGNITVASPLDNENTEQ-YILNVT 716 (4289)
T ss_pred CceEEccccCCCCceEEEEeccCCCCCcCceEEE------EEeCCccceEEEEecccceEEEecchhhhhhhe-eEEEEE
Confidence 445678888888888665444322 66677777 577888899999999999999999999999999 999999
Q ss_pred EEECCCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCC
Q psy8640 89 VEDGGKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGY 168 (303)
Q Consensus 89 a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~ 168 (303)
|.|.|. |..+++..+.|.|.|.|||+|.|.+..|.++|.|+..+|+.++.|.|.|.|.|.||+++|+|....
T Consensus 717 a~D~gt----Pqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~s---- 788 (4289)
T KOG1219|consen 717 AYDLGT----PQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKS---- 788 (4289)
T ss_pred EecCCC----chhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCc----
Confidence 999998 888999999999999999999999999999999999999999999999999999999999999865
Q ss_pred cccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEE------------Eeec----------------------
Q psy8640 169 TDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYS------------VWTR---------------------- 214 (303)
Q Consensus 169 ~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~------------~~~~---------------------- 214 (303)
+.|+||+.||.+.+.++||+|.+..|+|.++|.+. +...
T Consensus 789 -------dvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedl 861 (4289)
T KOG1219|consen 789 -------DVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDL 861 (4289)
T ss_pred -------ceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccC
Confidence 78999999999999999999999999999998610 0000
Q ss_pred --------------------------------------------------------------------------------
Q psy8640 215 -------------------------------------------------------------------------------- 214 (303)
Q Consensus 215 -------------------------------------------------------------------------------- 214 (303)
T Consensus 862 p~gt~~~~l~A~d~diGq~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vev 941 (4289)
T KOG1219|consen 862 PYGTVTWQLVALDPDIGQLGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEV 941 (4289)
T ss_pred CCceEEEEhhhcCcccCcCceeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEEEE
Confidence
Q ss_pred -------------------ccccCCCceEEE-------------------------------------eecCceeeeeee
Q psy8640 215 -------------------SLVWSRPMQCLI-------------------------------------MRRGVVFDYEAR 238 (303)
Q Consensus 215 -------------------~~~~~~~~~~~i-------------------------------------~~~~~~lD~E~~ 238 (303)
++.+|.|....+ ++....||||..
T Consensus 942 ldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~k 1021 (4289)
T KOG1219|consen 942 LDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKK 1021 (4289)
T ss_pred eccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhc
Confidence 011122222211 111246999999
Q ss_pred cEEEEEEEEEECCCC--eEEEEEEEEEeecCCCCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640 239 SRYAFTLLSTDSGGR--TCKVKVRVEIESRDEFHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS 303 (303)
Q Consensus 239 ~~y~l~V~a~D~g~~--s~~~~v~I~V~DvNdn~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~ 303 (303)
..|-|+|.|+|.|.. ++-..+.|.|+|+|||+|+|.++.|... .|++ |.+..|.++.|.|+||
T Consensus 1022 s~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enS--p~~vsivq~ea~D~Ds 1087 (4289)
T KOG1219|consen 1022 SSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENS--PETVSIVQAEANDPDS 1087 (4289)
T ss_pred ceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCC--CCceEEEEeccCCCCc
Confidence 999999999999983 7788999999999999999999999999 9999 8999999999999995
No 4
>KOG1219|consensus
Probab=100.00 E-value=1.2e-32 Score=283.63 Aligned_cols=266 Identities=23% Similarity=0.270 Sum_probs=221.6
Q ss_pred eEeeccccccccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECC
Q psy8640 14 CLIMRRGVVHRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGG 93 (303)
Q Consensus 14 ~~i~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g 93 (303)
+++||.-++|.+..+..+..-.. ..-|++.-- -.-.+++...|.||++||.|++.++||+|.... |.+.|.|.+++
T Consensus 2583 fsvpEDv~vG~~Ig~v~a~~a~~--~~i~~~v~~-gt~Esn~d~~Fsvdr~TG~i~v~ksLD~E~kk~-yqi~v~a~~~~ 2658 (4289)
T KOG1219|consen 2583 FSVPEDVPVGEEIGQVSASDADE--HVIYSLVLG-GTPESNPDLPFSVDRNTGMIKVNKSLDHEKKKS-YQIKVKATCGQ 2658 (4289)
T ss_pred EeccccCCCCCeeeEEeecccCC--ceEEEEEeC-CCCCCCCCCceEEcCCCceEEeccccchhhhce-EEEEEEeecCC
Confidence 78999999999998888765322 223322100 011334556799999999999999999999999 99999999876
Q ss_pred CCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCC
Q psy8640 94 KPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDD 173 (303)
Q Consensus 94 ~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~ 173 (303)
. .-+.++|.|.|.|+|||+|+|....|.+.+.||+|.|+.|++++|.|+|.|.||+|+|+|....
T Consensus 2659 ~------vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v--------- 2723 (4289)
T KOG1219|consen 2659 W------VVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPV--------- 2723 (4289)
T ss_pred c------eEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCc---------
Confidence 4 5688899999999999999999999999999999999999999999999999999999999975
Q ss_pred CcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE----------EEeeccc---------------------------
Q psy8640 174 SWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY----------SVWTRSL--------------------------- 216 (303)
Q Consensus 174 ~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~----------~~~~~~~--------------------------- 216 (303)
.+|.|+++||+|++.+.||+|++..|.|.|.|.+ -+.....
T Consensus 2724 --~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~p~~~~ia 2801 (4289)
T KOG1219|consen 2724 --PYFAINPETGWLTTLFELDLEKQDLYSLKVVATDHGVPSSQATVLVHVTDVNDSPPRFQREIYEGTVVEDVPGGKIIA 2801 (4289)
T ss_pred --ceEEEcCCCCeeeehhhhccccCCceEEEEEEecCCcccccceEEEEEEecCCCcchhhhHhhccceeccCCCCceee
Confidence 5899999999999999999999999999999972 1111111
Q ss_pred ---------ccCCCceEEEe-----------ec------CceeeeeeecEEEEEEEEEECCCCeEEEEEEEEEeecCCCC
Q psy8640 217 ---------VWSRPMQCLIM-----------RR------GVVFDYEARSRYAFTLLSTDSGGRTCKVKVRVEIESRDEFH 270 (303)
Q Consensus 217 ---------~~~~~~~~~i~-----------~~------~~~lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~ 270 (303)
.+.+-+++.|. .+ -+++|||...+|.|+|.|+| |-..++..|.|.|+|.|||.
T Consensus 2802 ~~si~d~D~s~~nq~t~fI~~gd~~gqF~~i~ne~~~~~kKt~~~E~t~ny~Ltvtatd-g~f~~s~~vkv~v~~s~dn~ 2880 (4289)
T KOG1219|consen 2802 GLSIFDADVSEVNQVTGFITLGDPLGQFWIIENEWIYEFKKTLDRESTKNYLLTVTATD-GIFMNSDNVKVLVLDSNDNS 2880 (4289)
T ss_pred eeEeccccccccceeEEEEeCCCccceEEEEcCcceEEEecchhhhcccceEEEEEEec-cceeccceEEEEeeccccCC
Confidence 11121222221 11 15799999999999999999 88888999999999999999
Q ss_pred CeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCCC
Q psy8640 271 PQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFRS 303 (303)
Q Consensus 271 P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D~ 303 (303)
|...+..|+.. .|+.- .|..++.+.|+|.|.
T Consensus 2881 ~~c~~~~~t~i~~ed~f--~gk~ilkisal~~dn 2912 (4289)
T KOG1219|consen 2881 PFCGNQLYTKIQCEDVF--PGKQILKISALDVDN 2912 (4289)
T ss_pred ccCcchhccceeccccC--CceeEEEeeeeccCc
Confidence 99999999999 99996 999999999999873
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=5e-28 Score=208.15 Aligned_cols=194 Identities=32% Similarity=0.426 Sum_probs=165.9
Q ss_pred ceeEeeccccccccceEEEEEEecc--cCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEE
Q psy8640 12 MQCLIMRRGVVHRENIQVYLVWYGS--TSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKV 89 (303)
Q Consensus 12 ~~~~i~e~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a 89 (303)
..+.|.||...+.......+..-.. ++.+.|.. .......+|.|++.+|.|++.+.||||.... |.|.|+|
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i------~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~-~~l~v~a 74 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSI------LGGNEDGLFSIDPNTGVITTTKPLDREEQSE-YTLTVVA 74 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEE------eCCCCcccEEEeCCCCEEEECCCCCCcCCce-EEEEEEE
Confidence 4578899999877766555444322 24455533 3333337999999999999999999999999 9999999
Q ss_pred EECCCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCc
Q psy8640 90 EDGGKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYT 169 (303)
Q Consensus 90 ~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~ 169 (303)
.|.|. +.++....+.|.|.|+|||+|.|....|.+.|.|+.++|+.++++.|+|+|.+.++.++|+|..+..
T Consensus 75 ~D~g~----~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~---- 146 (199)
T cd00031 75 SDGGG----PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGND---- 146 (199)
T ss_pred EECCc----CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCC----
Confidence 99876 5667999999999999999999998899999999999999999999999999989999999999763
Q ss_pred ccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEE
Q psy8640 170 DVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTD 249 (303)
Q Consensus 170 ~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D 249 (303)
.+.|.|++.+|.|++.+.||||.... |.|.|.|+|
T Consensus 147 -----~~~f~i~~~~G~i~~~~~ld~e~~~~----------------------------------------~~l~v~a~D 181 (199)
T cd00031 147 -----KELFSIDPNTGIITLAKPLDREEKSS----------------------------------------YELTVVATD 181 (199)
T ss_pred -----CCEEEEeCCceEEEeCCccCCccCce----------------------------------------EEEEEEEEE
Confidence 26899999999999999999999888 999999999
Q ss_pred CC--CCeEEEEEEEEEee
Q psy8640 250 SG--GRTCKVKVRVEIES 265 (303)
Q Consensus 250 ~g--~~s~~~~v~I~V~D 265 (303)
.+ ..++++.++|.|.|
T Consensus 182 ~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 182 GGGPPLSSTATVTVTVLD 199 (199)
T ss_pred CCCCCceeEEEEEEEEEC
Confidence 88 56888899998875
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.88 E-value=2.9e-21 Score=166.02 Aligned_cols=128 Identities=33% Similarity=0.523 Sum_probs=118.6
Q ss_pred ceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEE
Q psy8640 125 EYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFT 204 (303)
Q Consensus 125 ~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~ 204 (303)
.|.+.|.|+.+.|+.++++.|.|+|.+.++.+.|+|..+... .+|.|++.+|.|++.+.||||....
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~---------~~F~i~~~tG~l~~~~~lD~e~~~~---- 67 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNED---------GLFSIDPNTGVITTTKPLDREEQSE---- 67 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCc---------ccEEEeCCCCEEEECCCCCCcCCce----
Confidence 378999999999999999999999999889999999997632 6899999999999999999999988
Q ss_pred EEEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEECCCCe--EEEEEEEEEeecCCCCCeeCCCeeEEE-
Q psy8640 205 LLSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDSGGRT--CKVKVRVEIESRDEFHPQFTERTFKFI- 281 (303)
Q Consensus 205 v~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~g~~s--~~~~v~I~V~DvNdn~P~F~~~~y~~~- 281 (303)
|.|.|.|+|.|.+. +...++|.|.|+|||+|.|....|.+.
T Consensus 68 ------------------------------------~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v 111 (199)
T cd00031 68 ------------------------------------YTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASV 111 (199)
T ss_pred ------------------------------------EEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEE
Confidence 99999999987765 899999999999999999999999999
Q ss_pred ecCCCCCCCcEEEEEEEEeCCC
Q psy8640 282 LSGTDLPVGYVVGHLRLVRFRS 303 (303)
Q Consensus 282 ~e~~~~p~gt~v~~v~A~D~D~ 303 (303)
.|+. |.|+.++++.|+|+|.
T Consensus 112 ~e~~--~~~~~i~~~~a~D~D~ 131 (199)
T cd00031 112 PENA--PPGTVVGTVTATDADS 131 (199)
T ss_pred eCCC--CCCCEEEEEEEEcCCC
Confidence 9998 5999999999999994
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.75 E-value=5.4e-17 Score=123.14 Aligned_cols=90 Identities=33% Similarity=0.488 Sum_probs=82.6
Q ss_pred eEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEE
Q psy8640 126 YLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTL 205 (303)
Q Consensus 126 ~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v 205 (303)
|++.|+|++++|+.++++.|.|+|.+.|+.+.|+|..+.+. +.|.|++.+|.|++.++||||....
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~---------~~F~I~~~tg~i~~~~~LD~E~~~~----- 66 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPD---------GLFSIDPNTGEISLKKPLDRETQSS----- 66 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTST---------TSEEEETTTTEEEESSSSCTTTTSE-----
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCccc---------CceEEeeeeeccccceecCcccCCE-----
Confidence 78999999999999999999999999999999999998753 8899999999999999999999999
Q ss_pred EEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEEC-CCC--eEEEEEEEEEe
Q psy8640 206 LSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDS-GGR--TCKVKVRVEIE 264 (303)
Q Consensus 206 ~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~-g~~--s~~~~v~I~V~ 264 (303)
|.|.|.|+|. |.+ +++++|+|+|+
T Consensus 67 -----------------------------------y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 67 -----------------------------------YQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp -----------------------------------EEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred -----------------------------------EEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 9999999999 554 67778888774
No 8
>KOG1834|consensus
Probab=99.71 E-value=4.5e-16 Score=148.95 Aligned_cols=151 Identities=25% Similarity=0.420 Sum_probs=129.9
Q ss_pred EEEeCCcc--EEEEcccCCCCCCCceEEEEEEEEECCCCCCCC-----CCeeEEEEEEEEEcCCCCCCcCCCCceEEEEe
Q psy8640 59 QAVDPLTG--VLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDS-----PLSSITHLKITLQDENDNAPKFPITEYLEFVG 131 (303)
Q Consensus 59 F~Id~~tG--~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~-----~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~V~ 131 (303)
-.+|+.|| .|+.+.+||-|.++. |+|+|+|.|.|. +| ..+-.+++.|.|.|+|+++|.|....|.+.|.
T Consensus 84 vVvdK~TGegvlRaK~~lDCelqke-ytf~iQAydCg~---gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~ 159 (952)
T KOG1834|consen 84 VVVDKYTGEGVLRAKEPLDCELQKE-YTFTIQAYDCGN---GPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVT 159 (952)
T ss_pred EEEeccCCceEEeecCccccccccc-ceEEEEEEecCC---CCCccccccccceEEEEEeccccccCchhcccceeeEEe
Confidence 34588875 688999999999999 999999999986 23 56778899999999999999999999999999
Q ss_pred CCCCCCeEEEEEEEEeCCCC-CCceE-EEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEE
Q psy8640 132 ENEPIGSSVFTARATDMDKG-DYGKL-NYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTY 209 (303)
Q Consensus 132 E~~~~gt~v~~v~A~D~D~g-~n~~i-~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~ 209 (303)
|. +.-..|++|.|.|.|.+ .+++| .|.|...+ -.|.||. .|.|+...+|+|.....
T Consensus 160 EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t~d-----------~PFaIdn-~G~irnTekLny~ke~~--------- 217 (952)
T KOG1834|consen 160 EG-KVYDSILRVEAIDKDCSPQYSQICEYEITTPD-----------VPFAIDN-DGNIRNTEKLNYTKEHQ--------- 217 (952)
T ss_pred cc-eeeeeeEEEEeecCCCCCcccceeEEEecCCC-----------CceEEcC-CCcccccccccccccee---------
Confidence 98 56788999999999987 47777 88998866 6799995 69999999999999988
Q ss_pred EEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEECCCC--eEEEEEEEEEeec
Q psy8640 210 SVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDSGGR--TCKVKVRVEIESR 266 (303)
Q Consensus 210 ~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~g~~--s~~~~v~I~V~Dv 266 (303)
|.|+|.|.|+|.. .+.+.|+|.|...
T Consensus 218 -------------------------------Y~ltVtAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 218 -------------------------------YKLTVTAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred -------------------------------EEEEEEEEecccccccCcceEEEEecCc
Confidence 9999999999985 2336777777543
No 9
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.64 E-value=1.9e-15 Score=111.05 Aligned_cols=77 Identities=36% Similarity=0.593 Sum_probs=70.2
Q ss_pred EeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEEEeecccccCCCceEE
Q psy8640 146 TDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQCL 225 (303)
Q Consensus 146 ~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~~ 225 (303)
+|+|.|.|+.++|+|..+... .+|.|++.+|.|++.++||||....
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~---------~~F~i~~~tg~i~~~~~LD~e~~~~------------------------- 46 (79)
T smart00112 1 TDADSGENGKVTYSILSGNED---------GLFSIDPETGEITTTKPLDREEQPE------------------------- 46 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCC---------CEEEEeCCccEEEeCCccCeeCCCe-------------------------
Confidence 488999999999999987632 7899999999999999999999888
Q ss_pred EeecCceeeeeeecEEEEEEEEEECCC--CeEEEEEEEEEeecCCCCC
Q psy8640 226 IMRRGVVFDYEARSRYAFTLLSTDSGG--RTCKVKVRVEIESRDEFHP 271 (303)
Q Consensus 226 i~~~~~~lD~E~~~~y~l~V~a~D~g~--~s~~~~v~I~V~DvNdn~P 271 (303)
|.|.|.|+|.|. +++.+.|+|.|.|+|||+|
T Consensus 47 ---------------y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 47 ---------------YTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred ---------------EEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 999999999987 5889999999999999998
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.49 E-value=2.4e-13 Score=99.81 Aligned_cols=58 Identities=38% Similarity=0.648 Sum_probs=55.7
Q ss_pred ccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCCCCC
Q psy8640 57 AIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDSPLSSITHLKITLQDENDNAP 119 (303)
Q Consensus 57 ~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P 119 (303)
.+|.|++.+|.|++.++||||..+. |.|.|.|.|.|. +++++.+.|.|.|.|+|||+|
T Consensus 22 ~~F~i~~~tg~i~~~~~LD~e~~~~-y~l~v~a~D~~~----~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 22 GLFSIDPETGEITTTKPLDREEQPE-YTLTVEATDGGG----PPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CEEEEeCCccEEEeCCccCeeCCCe-EEEEEEEEECCC----CCcccEEEEEEEEEECCCCCC
Confidence 8999999999999999999999999 999999999987 688999999999999999998
No 11
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.29 E-value=5.3e-11 Score=90.02 Aligned_cols=89 Identities=26% Similarity=0.297 Sum_probs=76.9
Q ss_pred eeEeeccccccccceEEEEEEe--cccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEE
Q psy8640 13 QCLIMRRGVVHRENIQVYLVWY--GSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVE 90 (303)
Q Consensus 13 ~~~i~e~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~ 90 (303)
.+.|+||..+|+...+..+..- +.++.+.|. +..++...+|.|++.+|.|++.++||||.... |.|.|.|+
T Consensus 2 ~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~------i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~-y~l~v~a~ 74 (93)
T PF00028_consen 2 SFSVPENAPPGTVVGQVTATDPDSGPNSQITYS------ILGGNPDGLFSIDPNTGEISLKKPLDRETQSS-YQLTVRAT 74 (93)
T ss_dssp EEEEETTGSTSSEEEEEEEEESSTSTTSSEEEE------EEETTSTTSEEEETTTTEEEESSSSCTTTTSE-EEEEEEEE
T ss_pred EEEEECCCCCCCEEEEEEEEeCCCCCCceEEEE------EecCcccCceEEeeeeeccccceecCcccCCE-EEEEEEEE
Confidence 3679999999999888888766 467778884 45555579999999999999999999999999 99999999
Q ss_pred EC-CCCCCCCCCeeEEEEEEEEE
Q psy8640 91 DG-GKPPLDSPLSSITHLKITLQ 112 (303)
Q Consensus 91 D~-g~~~~~~~~s~~~~v~I~V~ 112 (303)
|. |. +++++.++|.|.|.
T Consensus 75 D~~~~----~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 75 DSGGS----PPLSSTATVTINVL 93 (93)
T ss_dssp ETTTS----SEEEEEEEEEEEEE
T ss_pred ECCCC----CCCEEEEEEEEEEC
Confidence 99 66 88999999999884
No 12
>KOG1834|consensus
Probab=99.18 E-value=2.5e-10 Score=110.15 Aligned_cols=142 Identities=23% Similarity=0.305 Sum_probs=108.4
Q ss_pred EEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCC--CceE-EEEEeeCCCCCCcccCCCcCcEEEECCCc
Q psy8640 109 ITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGD--YGKL-NYSIISAAASGYTDVDDSWKLFRVDALTG 185 (303)
Q Consensus 109 I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~--n~~i-~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG 185 (303)
....-+|-+.|.. ...|..-|.||-..=...--+.|.|.|.+- .|.| .|.|.+..-. .+..-+|..||
T Consensus 21 ~~aarankhkpwi-e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vP--------FdavVvdK~TG 91 (952)
T KOG1834|consen 21 HHAARANKHKPWI-EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVP--------FDAVVVDKYTG 91 (952)
T ss_pred cccccccccCccc-ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCC--------ceEEEEeccCC
Confidence 3445577777765 455999999985322223457788888752 3444 7787775421 02333488775
Q ss_pred --EEEEcccCcccccceEEEEEEEEEEEeecccccCCCceEEEeecCceeeeeeecEEEEEEEEEECCC-C-------eE
Q psy8640 186 --LVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQCLIMRRGVVFDYEARSRYAFTLLSTDSGG-R-------TC 255 (303)
Q Consensus 186 --~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~~i~~~~~~lD~E~~~~y~l~V~a~D~g~-~-------s~ 255 (303)
.|+.+.+||.|.++. |+|+|.|.|+|. + +-
T Consensus 92 egvlRaK~~lDCelqke----------------------------------------ytf~iQAydCg~gpdgtn~kKSh 131 (952)
T KOG1834|consen 92 EGVLRAKEPLDCELQKE----------------------------------------YTFTIQAYDCGNGPDGTNTKKSH 131 (952)
T ss_pred ceEEeecCccccccccc----------------------------------------ceEEEEEEecCCCCCcccccccc
Confidence 577788999998888 999999999875 2 66
Q ss_pred EEEEEEEEeecCCCCCeeCCCeeEEE-ecCCCCCCCcEEEEEEEEeCC
Q psy8640 256 KVKVRVEIESRDEFHPQFTERTFKFI-LSGTDLPVGYVVGHLRLVRFR 302 (303)
Q Consensus 256 ~~~v~I~V~DvNdn~P~F~~~~y~~~-~e~~~~p~gt~v~~v~A~D~D 302 (303)
.++|+|.|.|+|+.+|+|..+.|++. .|.. .-..|++|.|.|.|
T Consensus 132 katvhIrVkDvNe~AP~f~ep~Yka~V~EGK---~yd~il~veAiD~D 176 (952)
T KOG1834|consen 132 KATVHIRVKDVNEFAPVFKEPWYKAHVTEGK---VYDSILRVEAIDKD 176 (952)
T ss_pred ceEEEEEeccccccCchhcccceeeEEecce---eeeeeEEEEeecCC
Confidence 89999999999999999999999999 8887 77899999999998
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.63 E-value=0.00067 Score=51.00 Aligned_cols=66 Identities=18% Similarity=0.170 Sum_probs=32.7
Q ss_pred CCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccc
Q psy8640 118 APKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEA 197 (303)
Q Consensus 118 ~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~ 197 (303)
-|-|.+..|.+.|+.+...|..+++|.-.|... +..+.|.-.. ..|.|.+ .|.|++++++....
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ssD-------------pdF~V~~-DGsVy~~r~v~l~~ 66 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESSD-------------PDFRVLE-DGSVYAKRPVQLSS 66 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE----------------SEEEEET-TTEEEEES--S-SS
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecCC-------------CCEEEcC-CCeEEEeeeEecCC
Confidence 588999999999999999999999999998743 4567876655 4599986 69999999988765
Q ss_pred cc
Q psy8640 198 RS 199 (303)
Q Consensus 198 ~~ 199 (303)
..
T Consensus 67 ~~ 68 (90)
T PF08758_consen 67 EQ 68 (90)
T ss_dssp S-
T ss_pred Cc
Confidence 44
No 14
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.32 E-value=0.00019 Score=53.30 Aligned_cols=63 Identities=27% Similarity=0.305 Sum_probs=39.9
Q ss_pred eEEEEeCCCCCCeEEEEEEEEeCCCCC--CceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCccccc
Q psy8640 126 YLEFVGENEPIGSSVFTARATDMDKGD--YGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEAR 198 (303)
Q Consensus 126 ~~~~V~E~~~~gt~v~~v~A~D~D~g~--n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~ 198 (303)
..++|+|..+.|+.||.+ |.|.-... -....|++...... .+|.+++.+|.|+++..+|||.-
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~---------~~~~v~~~tG~L~v~~rIDRE~L 67 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNS---------QYFRVNEKTGDLFVSERIDREEL 67 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS----------SEEE-TTTSEEEESS--SCCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCc---------ceeEecCCceeEEeCCccCHHHH
Confidence 467899999999999998 44543221 11246777665432 89999999999999999999965
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.86 E-value=0.26 Score=37.23 Aligned_cols=73 Identities=19% Similarity=0.256 Sum_probs=52.0
Q ss_pred EEeCCCCCCceEEEEEeeCCCCCCcccCCCcCcEEEECCCcEEEEcccCcccccceEEEEEEEEEEEeecccccCCCceE
Q psy8640 145 ATDMDKGDYGKLNYSIISAAASGYTDVDDSWKLFRVDALTGLVTTNAVFDYEARSRYAFTLLSTYSVWTRSLVWSRPMQC 224 (303)
Q Consensus 145 A~D~D~g~n~~i~Y~i~~~~~~~~~~~~~~~~~F~Id~~tG~I~~~~~LD~e~~~~~~l~v~a~~~~~~~~~~~~~~~~~ 224 (303)
..|.| ...+.|++...+... .+ .....|+.++.+.-. +.... ...
T Consensus 24 F~d~d---~~~lty~~~~~~~~~----lP--~Wl~fd~~~~~~~Gt-P~~~~-~g~------------------------ 68 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSA----LP--SWLSFDSDTGTLSGT-PTNSD-VGS------------------------ 68 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCC----CC--CeEEEeCCCCEEEEE-CCCCC-CcE------------------------
Confidence 46666 337888887543100 22 458889888887763 43322 244
Q ss_pred EEeecCceeeeeeecEEEEEEEEEECCCCeEEEEEEEEEeecCC
Q psy8640 225 LIMRRGVVFDYEARSRYAFTLLSTDSGGRTCKVKVRVEIESRDE 268 (303)
Q Consensus 225 ~i~~~~~~lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNd 268 (303)
|.|.|.|+|..+.+....++|.|.+.||
T Consensus 69 ----------------~~i~v~a~D~~g~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 69 ----------------LSLKVTATDSSGASASDTFTITVVNTND 96 (97)
T ss_pred ----------------EEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence 9999999998888889999999999987
No 16
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=95.42 E-value=0.015 Score=43.05 Aligned_cols=65 Identities=9% Similarity=-0.056 Sum_probs=34.1
Q ss_pred ceeEeeccccccccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCC
Q psy8640 12 MQCLIMRRGVVHRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTG 79 (303)
Q Consensus 12 ~~~~i~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~ 79 (303)
+.++|+|....|+....... ...-...--.-.++.+.......+|.+++.+|.|++...+|||.-
T Consensus 3 i~YsV~EE~~~Gt~IGnia~---dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~L 67 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNIAK---DLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREEL 67 (84)
T ss_dssp EEEEEESS--TT-EEEECCC---CCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC
T ss_pred eEEEeecCCCCCCEEEEhHH---hhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHH
Confidence 45788888887766543321 111100000112233344456789999999999999999999985
No 17
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.11 E-value=0.22 Score=38.02 Aligned_cols=30 Identities=23% Similarity=0.357 Sum_probs=24.5
Q ss_pred EEEEEEEEECCCCeEEEEEEEEEeecCCCCCeeCC
Q psy8640 241 YAFTLLSTDSGGRTCKVKVRVEIESRDEFHPQFTE 275 (303)
Q Consensus 241 y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~ 275 (303)
-.|++.+.| | .+++|+|+|.-.|| +|+..-
T Consensus 59 dsFtvtv~D-G---tt~~vtItI~GtND-apvi~~ 88 (99)
T TIGR01965 59 DTFTVTSAD-G---TSQTVTITITGAND-AAVIGG 88 (99)
T ss_pred EEEEEEEeC-C---CeEEEEEEEEccCC-CCEEec
Confidence 678888888 4 38899999999999 887643
No 18
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=93.25 E-value=4.2 Score=33.76 Aligned_cols=29 Identities=21% Similarity=0.350 Sum_probs=22.8
Q ss_pred eecEEEEEEEEEECCCCeEEEEEEEEEee
Q psy8640 237 ARSRYAFTLLSTDSGGRTCKVKVRVEIES 265 (303)
Q Consensus 237 ~~~~y~l~V~a~D~g~~s~~~~v~I~V~D 265 (303)
..+.|.|+|.|.|..+-..+..+......
T Consensus 121 ~~~~YtLtV~a~D~aGN~~~~si~F~y~P 149 (158)
T PF13750_consen 121 ADDSYTLTVSATDKAGNQSTKSISFSYMP 149 (158)
T ss_pred CCCeEEEEEEEEecCCCEEEEEEEEEEeC
Confidence 34449999999999888877877776653
No 19
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=92.46 E-value=2 Score=32.35 Aligned_cols=52 Identities=29% Similarity=0.284 Sum_probs=41.5
Q ss_pred cccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCC
Q psy8640 56 LAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDSPLSSITHLKITLQDEND 116 (303)
Q Consensus 56 ~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvND 116 (303)
..|...|+.++.++-. +...+ ... |.+.+.|+|+.. .+....+.|.|.+.|+
T Consensus 45 P~Wl~fd~~~~~~~Gt-P~~~~-~g~-~~i~v~a~D~~g------~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 45 PSWLSFDSDTGTLSGT-PTNSD-VGS-LSLKVTATDSSG------ASASDTFTITVVNTND 96 (97)
T ss_pred CCeEEEeCCCCEEEEE-CCCCC-CcE-EEEEEEEEECCC------CEEEEEEEEEEeCCCC
Confidence 4699999999988884 44333 346 999999999753 5788899999999987
No 20
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=70.71 E-value=24 Score=28.53 Aligned_cols=46 Identities=24% Similarity=0.363 Sum_probs=31.3
Q ss_pred EEEEEEEEECCCCeEEEEEEEEEeecCCCCCeeCCCeeEEE-ecCCCCCCCc
Q psy8640 241 YAFTLLSTDSGGRTCKVKVRVEIESRDEFHPQFTERTFKFI-LSGTDLPVGY 291 (303)
Q Consensus 241 y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~~~y~~~-~e~~~~p~gt 291 (303)
..|.|.|+|..+-.++..+.|+|.| | .|......- +. .|+.- +.|+
T Consensus 87 l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~~-~~V~E~~L-~~g~ 133 (137)
T TIGR03660 87 LNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVDA-LTVDEDDL-PGGS 133 (137)
T ss_pred EeeeEEEEeCCCCccccEEEEEEEC--C-CCeeccccc-eEEecccc-ccCC
Confidence 6788999998876666788888887 6 577765542 33 45432 3443
No 21
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=67.83 E-value=20 Score=24.43 Aligned_cols=31 Identities=23% Similarity=0.400 Sum_probs=23.7
Q ss_pred ecEEEEEEEEEECCCCeEEEEEEEEEeecCC
Q psy8640 238 RSRYAFTLLSTDSGGRTCKVKVRVEIESRDE 268 (303)
Q Consensus 238 ~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNd 268 (303)
.+.|.|.+.|.|..|..++....+.+.|..-
T Consensus 22 dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~~ 52 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGNTSSSTTQIVIVDNTA 52 (60)
T ss_pred CccEEEEEEEEECCCCEEEeeeEEEEEcCCC
Confidence 4569999999999887776766666666543
No 22
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=66.35 E-value=1.4e+02 Score=32.08 Aligned_cols=50 Identities=22% Similarity=0.109 Sum_probs=33.2
Q ss_pred CCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCC
Q psy8640 114 ENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAA 165 (303)
Q Consensus 114 vNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~ 165 (303)
.||..+.|....-...|.|+. |+.-..|.-...|.+....+.|+...+..
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA 444 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTA 444 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCcc
Confidence 566666666666677889984 77666665444354445678999887754
No 23
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=63.67 E-value=15 Score=26.08 Aligned_cols=28 Identities=21% Similarity=0.298 Sum_probs=19.7
Q ss_pred eeecEEEEEEEEEECCCCeEEEEEEEEE
Q psy8640 236 EARSRYAFTLLSTDSGGRTCKVKVRVEI 263 (303)
Q Consensus 236 E~~~~y~l~V~a~D~g~~s~~~~v~I~V 263 (303)
...+.|.+++.|+|..+.+.+.++.|.|
T Consensus 54 ~~~G~y~v~l~v~d~~g~~~~~~~~V~V 81 (81)
T cd00146 54 TKPGTYTVTLTVTNAVGSSSTKTTTVVV 81 (81)
T ss_pred CCCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence 3344499999999986666665666654
No 24
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=61.23 E-value=32 Score=22.40 Aligned_cols=38 Identities=18% Similarity=0.221 Sum_probs=27.2
Q ss_pred CCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECC
Q psy8640 54 QRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGG 93 (303)
Q Consensus 54 ~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g 93 (303)
.-..+..+|+.+|.|+-.-.-.-+. -. |.+.|.|+|..
T Consensus 11 ~LP~gLs~d~~tG~isGtp~~~~~~-G~-y~~~vtatd~~ 48 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGTPTSSVQP-GT-YTFTVTATDGS 48 (49)
T ss_pred CCCCcEEEeCCCCEEEeecCCCccc-cE-EEEEEEEEcCC
Confidence 3447889999999998773322111 35 99999999864
No 25
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=60.22 E-value=20 Score=25.22 Aligned_cols=27 Identities=26% Similarity=0.380 Sum_probs=20.2
Q ss_pred eeecEEEEEEEEEECCCCeEEEEEEEEE
Q psy8640 236 EARSRYAFTLLSTDSGGRTCKVKVRVEI 263 (303)
Q Consensus 236 E~~~~y~l~V~a~D~g~~s~~~~v~I~V 263 (303)
+..+.|.+++.+.|..+ ++++.++|.|
T Consensus 52 ~~~G~y~v~l~v~n~~g-~~~~~~~i~v 78 (79)
T smart00089 52 TKPGTYTVTLTVTNAVG-SASATVTVVV 78 (79)
T ss_pred CCCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence 34444999999999766 6677777766
No 26
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=58.73 E-value=22 Score=24.06 Aligned_cols=26 Identities=15% Similarity=0.427 Sum_probs=17.7
Q ss_pred cEEEEEEEEEECCCC--eEEEEEEEEEe
Q psy8640 239 SRYAFTLLSTDSGGR--TCKVKVRVEIE 264 (303)
Q Consensus 239 ~~y~l~V~a~D~g~~--s~~~~v~I~V~ 264 (303)
+.|.|.|.|.|..+. ....++.|.|+
T Consensus 39 G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 39 GKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp EEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred EEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 459999999997764 22266777663
No 27
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=58.34 E-value=55 Score=26.50 Aligned_cols=67 Identities=21% Similarity=0.262 Sum_probs=42.1
Q ss_pred cccEEEeC-Ccc--EEEEcccCCCCCC---CceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCCCCCcCCCCceEEE
Q psy8640 56 LAIQAVDP-LTG--VLSVQQPLDYDTG---SSEHLLILKVEDGGKPPLDSPLSSITHLKITLQDENDNAPKFPITEYLEF 129 (303)
Q Consensus 56 ~~~F~Id~-~tG--~l~~~~~LD~E~~---~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~~~~~ 129 (303)
...|+|.- ..| ...+.++||+... -. ..|.|.|+|... . .++..+.|.|.| | .|...... ...
T Consensus 55 ~~Vftvtl~~~GsYtftL~~~lDH~~g~d~l~-l~~~v~a~D~DG-----D-~s~~~l~VtI~D--D-~P~~~~~~-~~~ 123 (137)
T TIGR03660 55 NPVFTLTLNADGSYEFTLEGPLDHAAGSDELT-LNFPIIATDFDG-----D-TSSITLPVTIVD--D-VPTITDVD-ALT 123 (137)
T ss_pred cEEEEEEECCCccEEEEEcccccCCCCCceEE-EeeeEEEEeCCC-----C-ccccEEEEEEEC--C-CCeecccc-ceE
Confidence 35676621 245 4666789998542 23 678888998743 1 223488888887 6 47665443 368
Q ss_pred EeCC
Q psy8640 130 VGEN 133 (303)
Q Consensus 130 V~E~ 133 (303)
|.|+
T Consensus 124 V~E~ 127 (137)
T TIGR03660 124 VDED 127 (137)
T ss_pred Eecc
Confidence 8775
No 28
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=55.51 E-value=56 Score=24.35 Aligned_cols=35 Identities=11% Similarity=0.121 Sum_probs=25.9
Q ss_pred ccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCC
Q psy8640 57 AIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGK 94 (303)
Q Consensus 57 ~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~ 94 (303)
..|.|+ ..|.|++++++.....+ -+|.|.|.|...
T Consensus 46 pdF~V~-~DGsVy~~r~v~l~~~~--~~F~V~a~D~~~ 80 (90)
T PF08758_consen 46 PDFRVL-EDGSVYAKRPVQLSSEQ--RSFTVHAWDSQT 80 (90)
T ss_dssp SEEEEE-TTTEEEEES--S-SSS---EEEEEEEEETTT
T ss_pred CCEEEc-CCCeEEEeeeEecCCCc--eEEEEEEECCCC
Confidence 489999 78999999999876543 579999999764
No 29
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=51.25 E-value=20 Score=27.66 Aligned_cols=33 Identities=12% Similarity=0.283 Sum_probs=24.4
Q ss_pred eeeeeecEEEEEEEEEECCCCeEEEEEEEEEee
Q psy8640 233 FDYEARSRYAFTLLSTDSGGRTCKVKVRVEIES 265 (303)
Q Consensus 233 lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~D 265 (303)
+++.+-+.|.++|.++|..+-+.++.+.|.|-|
T Consensus 74 ~~v~kgG~y~m~V~lCn~dGCS~S~~~~I~VAD 106 (106)
T cd02848 74 FKVGKGGRYQMQVALCNGDGCSTSAAKEIVVAD 106 (106)
T ss_pred EEeCCCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence 445555669999999998776666677777755
No 30
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=49.86 E-value=34 Score=27.59 Aligned_cols=37 Identities=19% Similarity=0.327 Sum_probs=25.3
Q ss_pred eeeeeeecEEEEEEEEEECCCCeEEEEEEEEEeecCC
Q psy8640 232 VFDYEARSRYAFTLLSTDSGGRTCKVKVRVEIESRDE 268 (303)
Q Consensus 232 ~lD~E~~~~y~l~V~a~D~g~~s~~~~v~I~V~DvNd 268 (303)
.+.....+.|.+.|+++|..+-+.+..+.|.|.|-+-
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNADGCSTSAPVEVVVADTDG 112 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred EEEecCCCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence 3455566779999999998887777899999999874
No 31
>KOG3597|consensus
Probab=46.43 E-value=2.9e+02 Score=26.98 Aligned_cols=60 Identities=17% Similarity=0.102 Sum_probs=42.4
Q ss_pred EEEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCC
Q psy8640 104 ITHLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAA 164 (303)
Q Consensus 104 ~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~ 164 (303)
+....|.|.-+||.+..+....+.+-+.|+...-.....+.+.|+|.+. ..+.|++....
T Consensus 25 ~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~ 84 (442)
T KOG3597|consen 25 TDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTS 84 (442)
T ss_pred EeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCC
Confidence 3468899999999777776666678888875543334567888888753 45777777654
No 32
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=41.59 E-value=1.2e+02 Score=23.48 Aligned_cols=34 Identities=15% Similarity=0.229 Sum_probs=22.3
Q ss_pred cCCCCCCCceEEEEEEEEECCCCCCCCCCeeEEEEEEEEEc
Q psy8640 73 PLDYDTGSSEHLLILKVEDGGKPPLDSPLSSITHLKITLQD 113 (303)
Q Consensus 73 ~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~D 113 (303)
.+++.+.-. |.++|++.|... .+.+..+.|.|.|
T Consensus 73 t~~v~kgG~-y~m~V~lCn~dG------CS~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGR-YQMQVALCNGDG------CSTSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCe-EEEEEEEECCCC------ccCcCCEEEEecC
Confidence 345666667 999999998753 3444556666554
No 33
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=35.36 E-value=1.8e+02 Score=21.33 Aligned_cols=53 Identities=19% Similarity=0.277 Sum_probs=31.6
Q ss_pred EEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCC
Q psy8640 108 KITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAA 164 (303)
Q Consensus 108 ~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~ 164 (303)
+|.|.| ||.+ .+....-...+.|+. |.....|.-...+....-.+.|....+.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 567778 7755 776666677889874 5445555555443333446677776654
No 34
>KOG4221|consensus
Probab=33.12 E-value=7.3e+02 Score=27.71 Aligned_cols=74 Identities=15% Similarity=0.121 Sum_probs=47.4
Q ss_pred cccceEEEEEEecccCceeeeecccccccCCCCcccEEEeCCccEEEEcccCCCCCCCceEEEEEEEEECCCCCCCCCCe
Q psy8640 23 HRENIQVYLVWYGSTSRLCRRTCGTIPLSQQQRLAIQAVDPLTGVLSVQQPLDYDTGSSEHLLILKVEDGGKPPLDSPLS 102 (303)
Q Consensus 23 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~~l~i~a~D~g~~~~~~~~s 102 (303)
+-.-++.|.+.|+.. +....+.++..+-++++. +.|.... |.+.|.|..... .+.+
T Consensus 547 ~n~~I~~yk~~ys~~----------------~~~~~~~~~~n~~e~ti~---gL~k~Te-Y~~~vvA~N~~G----~g~s 602 (1381)
T KOG4221|consen 547 GNGPITGYKLFYSED----------------DTGKELRVENNATEYTIN---GLEKYTE-YSIRVVAYNSAG----SGVS 602 (1381)
T ss_pred CCCCceEEEEEEEcC----------------CCCceEEEecCccEEEee---cCCCccc-eEEEEEEecCCC----CCCC
Confidence 444566676666542 344678888777777766 4455666 999999998754 3444
Q ss_pred eEEEEEEEEEcCCCCCCc
Q psy8640 103 SITHLKITLQDENDNAPK 120 (303)
Q Consensus 103 ~~~~v~I~V~DvNDn~P~ 120 (303)
+..--.+...|+=+-||.
T Consensus 603 S~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 603 SADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred CCceEEEeccCCCCCCCc
Confidence 444444556677776664
No 35
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=32.15 E-value=5.7e+02 Score=26.21 Aligned_cols=32 Identities=19% Similarity=0.177 Sum_probs=24.3
Q ss_pred EEEEEEEEECCCCCCCCCCeeEEEEEEEEEcCCCCCCcC
Q psy8640 83 HLLILKVEDGGKPPLDSPLSSITHLKITLQDENDNAPKF 121 (303)
Q Consensus 83 ~~l~i~a~D~g~~~~~~~~s~~~~v~I~V~DvNDn~P~f 121 (303)
.++++.++|... ..+...+..|.|.+-. +|.+
T Consensus 300 ~Ti~atVtDSRG-----r~S~~~~~tItVl~Y~--~P~l 331 (624)
T PF05895_consen 300 ATIRATVTDSRG-----RTSDPKTKTITVLEYS--PPTL 331 (624)
T ss_pred EEEEEEEEECCC-----ccCCceEEEEEEEEcC--CCcE
Confidence 889999999863 3456778899998776 5665
No 36
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=27.88 E-value=8e+02 Score=26.52 Aligned_cols=57 Identities=21% Similarity=0.223 Sum_probs=34.5
Q ss_pred EEEEEEEEcCCCCCCcCCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCC
Q psy8640 105 THLKITLQDENDNAPKFPITEYLEFVGENEPIGSSVFTARATDMDKGDYGKLNYSIISAAA 165 (303)
Q Consensus 105 ~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~Y~i~~~~~ 165 (303)
...+|.|.| ||++|.|....-...|.|+. |..-.+|.-+- +....-.+.|.-..+..
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~Rts-Ga~G~VtV~Y~T~dGTA 571 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTS-GARGTVIVPYRTVEGTA 571 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcC-CCCeeEEEEEEeecCcc
Confidence 456677777 88899887776678899974 55444432221 11112346787777653
No 37
>PF00778 DIX: DIX domain; InterPro: IPR001158 Dishevelled (Dsh) protein is an important component of the Wnt signal-transduction pathway. It has three relatively conserved domains: DIX, PDZ and DEP. The DIX domain of Dvl-1 (a mammalian Dishevelled homologue) shares 37% identity with the C-terminal region of Axin. Dsh can interact with the Axin/APC/GSK3/beta-catenin complex, and may thus modulate its activity []. The Wnt signalling pathway is conserved in various species from Caenorhabditis elegans to mammals, and plays important roles in development, cellular proliferation, and differentiation. The molecular mechanisms by which the Wnt signal regulates cellular functions are becoming increasingly well understood. Wnt stabilises cytoplasmic beta-catenin, which stimulates the expression of genes including c-myc, c-jun, fra-1, and cyclin D1. Axin and its homologue Axil are components of the Wnt signalling pathway that negatively regulate this pathway. Other components of the Wnt signalling pathway, including Dvl, glycogen synthase kinase-3beta (GSK-3beta), beta-catenin, and adenomatous polyposis coli (APC), interact with Axin, and the phosphorylation and stability of beta-catenin are regulated in the Axin complex. Axil has similar functions to Axin. Thus, Axin and Axil act as scaffold proteins in the Wnt signalling pathway, thereby modulating the Wnt-dependent cellular functions [].; GO: 0004871 signal transducer activity, 0007275 multicellular organismal development, 0005622 intracellular; PDB: 1WSP_B 2D5G_A 3PZ7_A 3PZ8_H.
Probab=26.85 E-value=1.7e+02 Score=21.58 Aligned_cols=42 Identities=33% Similarity=0.528 Sum_probs=23.2
Q ss_pred EEEEEEEEECCCCeEEEEEEEEEeecCCCCCeeCCCeeEEEecCCCCCCCcEEEEEEEEe
Q psy8640 241 YAFTLLSTDSGGRTCKVKVRVEIESRDEFHPQFTERTFKFILSGTDLPVGYVVGHLRLVR 300 (303)
Q Consensus 241 y~l~V~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~~~y~~~~e~~~~p~gt~v~~v~A~D 300 (303)
|.+-..+.|...- + .|...|.|-++.-|.| .|.+++.|...|
T Consensus 43 yrffFK~~~~d~~-~--~V~eEi~dD~~~LP~~---------------eGkI~~~v~~~d 84 (84)
T PF00778_consen 43 YRFFFKSLDPDFG-C--VVKEEITDDDDILPLF---------------EGKIVAKVESVD 84 (84)
T ss_dssp EEEEEEEEETTTE-T--EEEEEE-STTSB---B---------------TTEEEEEEEE--
T ss_pred ceEEEEEeCCCCC-e--eeEEEEcCCccccccc---------------CCEEEEEEEeCC
Confidence 5555565553322 1 6788889999988988 556666665554
No 38
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=26.26 E-value=92 Score=20.53 Aligned_cols=15 Identities=33% Similarity=0.654 Sum_probs=12.4
Q ss_pred cEEEEEEEEEECCCC
Q psy8640 239 SRYAFTLLSTDSGGR 253 (303)
Q Consensus 239 ~~y~l~V~a~D~g~~ 253 (303)
+.|.|++.|+|..+.
T Consensus 24 G~y~itv~a~D~AGN 38 (54)
T PF13754_consen 24 GTYTITVTATDAAGN 38 (54)
T ss_pred ccEEEEEEEEeCCCC
Confidence 349999999998764
No 39
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=26.02 E-value=3.7e+02 Score=22.09 Aligned_cols=36 Identities=25% Similarity=0.386 Sum_probs=25.2
Q ss_pred cEEEEEE-EEEECCCCeEEEEEEEEEeecCCCCCeeCC
Q psy8640 239 SRYAFTL-LSTDSGGRTCKVKVRVEIESRDEFHPQFTE 275 (303)
Q Consensus 239 ~~y~l~V-~a~D~g~~s~~~~v~I~V~DvNdn~P~F~~ 275 (303)
+.|.+.+ .|.|..+...+..+...+. ++-.+|.+.-
T Consensus 15 G~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi 51 (158)
T PF13750_consen 15 GSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI 51 (158)
T ss_pred ccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE
Confidence 5699999 7999877766666554444 3445888755
Done!