Query psy6838
Match_columns 442
No_of_seqs 311 out of 1620
Neff 9.6
Searched_HMMs 46136
Date Fri Aug 16 19:42:46 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6838.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6838hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 1.3E-74 2.9E-79 582.7 40.7 433 1-439 259-933 (2531)
2 KOG4289|consensus 100.0 5.7E-67 1.2E-71 527.6 42.5 425 6-439 159-827 (2531)
3 KOG1219|consensus 100.0 3.4E-64 7.4E-69 521.8 47.1 437 1-442 407-1106(4289)
4 KOG1219|consensus 100.0 5.5E-62 1.2E-66 505.5 45.8 429 1-442 2168-2829(4289)
5 cd00031 CA Cadherin repeat dom 100.0 9.1E-31 2E-35 233.7 27.9 197 162-390 2-199 (199)
6 cd00031 CA Cadherin repeat dom 100.0 1.5E-30 3.2E-35 232.3 28.8 196 16-253 2-199 (199)
7 KOG1834|consensus 99.8 1.5E-17 3.2E-22 160.3 18.9 210 145-391 20-245 (952)
8 PF00028 Cadherin: Cadherin do 99.8 1.4E-17 3E-22 129.1 14.3 92 298-389 1-93 (93)
9 PF00028 Cadherin: Cadherin do 99.7 1.2E-16 2.6E-21 123.8 12.5 91 162-252 1-93 (93)
10 smart00112 CA Cadherin repeats 99.7 4.4E-16 9.6E-21 116.7 11.0 79 318-396 1-79 (79)
11 KOG1834|consensus 99.6 2.1E-14 4.5E-19 138.9 20.3 152 100-254 86-245 (952)
12 smart00112 CA Cadherin repeats 99.6 2.2E-15 4.8E-20 112.9 10.6 78 182-259 1-79 (79)
13 PF08758 Cadherin_pro: Cadheri 97.5 0.0012 2.5E-08 49.9 9.7 78 290-375 3-80 (90)
14 PF08266 Cadherin_2: Cadherin- 97.4 8.7E-05 1.9E-09 55.3 2.5 63 15-78 2-66 (84)
15 PF08266 Cadherin_2: Cadherin- 97.4 9.9E-05 2.2E-09 55.0 2.5 62 298-360 3-66 (84)
16 PF08758 Cadherin_pro: Cadheri 97.1 0.0084 1.8E-07 45.3 10.1 79 8-133 3-81 (90)
17 smart00736 CADG Dystroglycan-t 97.0 0.014 3.1E-07 45.0 11.0 69 318-393 25-96 (97)
18 TIGR01965 VCBS_repeat VCBS rep 97.0 0.0084 1.8E-07 45.9 9.1 87 313-410 2-97 (99)
19 TIGR01965 VCBS_repeat VCBS rep 96.8 0.015 3.1E-07 44.6 9.2 50 111-170 49-98 (99)
20 smart00736 CADG Dystroglycan-t 96.7 0.035 7.7E-07 42.8 11.1 31 118-153 66-96 (97)
21 PF13750 Big_3_3: Bacterial Ig 95.9 0.86 1.9E-05 38.5 16.3 126 118-252 14-148 (158)
22 PF13750 Big_3_3: Bacterial Ig 95.6 1.2 2.5E-05 37.8 17.3 76 314-392 70-151 (158)
23 TIGR00845 caca sodium/calcium 93.0 16 0.00035 39.7 21.0 48 151-200 395-443 (928)
24 PF05345 He_PIG: Putative Ig d 92.4 0.52 1.1E-05 31.1 5.5 37 336-374 11-48 (49)
25 TIGR00845 caca sodium/calcium 90.7 8.1 0.00018 41.9 15.0 53 382-437 515-569 (928)
26 TIGR03660 T1SS_rpt_143 T1SS-14 88.6 12 0.00027 30.6 11.8 57 106-170 69-128 (137)
27 TIGR00864 PCC polycystin catio 88.1 75 0.0016 39.0 30.0 112 25-151 1482-1594(2740)
28 PF07495 Y_Y_Y: Y_Y_Y domain; 87.9 6.1 0.00013 27.5 8.4 60 188-251 6-65 (66)
29 PF07495 Y_Y_Y: Y_Y_Y domain; 87.1 8 0.00017 26.9 8.6 60 325-389 7-66 (66)
30 TIGR03660 T1SS_rpt_143 T1SS-14 79.5 23 0.0005 29.1 9.3 56 349-410 69-127 (137)
31 PF02010 REJ: REJ domain; Int 79.5 3.2 6.9E-05 41.6 5.3 69 359-436 270-340 (440)
32 PF05345 He_PIG: Putative Ig d 78.8 11 0.00025 24.6 6.0 35 96-132 14-49 (49)
33 KOG4221|consensus 77.0 1.3E+02 0.0029 33.8 18.1 59 336-397 562-620 (1381)
34 PF12245 Big_3_2: Bacterial Ig 58.5 45 0.00098 22.8 5.8 30 361-392 22-51 (60)
35 PF13753 SWM_repeat: Putative 55.8 1.9E+02 0.004 27.5 18.1 111 117-239 10-125 (317)
36 cd02848 Chitinase_N_term Chiti 55.0 25 0.00053 27.4 4.3 34 355-390 73-106 (106)
37 cd00146 PKD polycystic kidney 54.7 30 0.00065 25.0 4.8 31 356-388 51-81 (81)
38 TIGR00864 PCC polycystin catio 54.7 5.3E+02 0.012 32.4 35.3 194 25-254 1398-1594(2740)
39 PF12245 Big_3_2: Bacterial Ig 52.8 59 0.0013 22.2 5.6 30 118-152 22-51 (60)
40 smart00089 PKD Repeats in poly 51.7 36 0.00079 24.4 4.8 31 355-388 48-78 (79)
41 PF13754 Big_3_4: Bacterial Ig 49.0 63 0.0014 21.5 5.2 16 360-375 22-37 (54)
42 COG2706 3-carboxymuconate cycl 47.6 1.5E+02 0.0032 28.4 8.9 34 43-76 254-288 (346)
43 PF02494 HYR: HYR domain; Int 46.8 42 0.00091 24.4 4.4 25 362-388 57-81 (81)
44 PF08329 ChitinaseA_N: Chitina 45.6 72 0.0016 26.0 5.8 51 355-411 76-126 (133)
45 KOG4221|consensus 41.5 5.9E+02 0.013 29.1 22.5 137 104-260 481-620 (1381)
46 cd00146 PKD polycystic kidney 40.7 67 0.0015 23.1 4.7 29 114-147 52-80 (81)
47 smart00089 PKD Repeats in poly 38.0 82 0.0018 22.5 4.8 30 219-251 49-78 (79)
48 PF03160 Calx-beta: Calx-beta 36.6 1.8E+02 0.004 21.8 10.7 24 248-273 2-25 (100)
49 PF02494 HYR: HYR domain; Int 32.3 95 0.0021 22.5 4.4 24 120-148 58-81 (81)
50 cd02848 Chitinase_N_term Chiti 31.7 1.1E+02 0.0023 23.9 4.5 34 112-150 73-106 (106)
51 PF13753 SWM_repeat: Putative 27.1 5.4E+02 0.012 24.3 12.7 26 223-250 10-35 (317)
52 PF03174 CHB_HEX_C: Chitobiase 24.3 1.6E+02 0.0034 21.0 4.2 58 66-133 10-67 (75)
53 PF08329 ChitinaseA_N: Chitina 23.9 2.8E+02 0.0061 22.6 5.9 36 112-152 76-111 (133)
54 PF13290 CHB_HEX_C_1: Chitobia 22.6 2.8E+02 0.0061 19.4 5.2 44 80-133 18-61 (67)
55 TIGR03769 P_ac_wall_RPT actino 20.1 2E+02 0.0044 18.0 3.5 14 358-371 9-22 (41)
No 1
>KOG4289|consensus
Probab=100.00 E-value=1.3e-74 Score=582.66 Aligned_cols=433 Identities=37% Similarity=0.594 Sum_probs=416.9
Q ss_pred CCcCCCCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccccc
Q psy6838 1 VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVS 80 (442)
Q Consensus 1 V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~ 80 (442)
|.|+|||.|+|.+.+|..++.||.++|..|.++.|+|.|.++|+.+.|++..+++...|.||+++|.|+..+.||+|...
T Consensus 259 V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~ 338 (2531)
T KOG4289|consen 259 VLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREELE 338 (2531)
T ss_pred EeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCccCHHhhh
Confidence 67999999999999999999999999999999999999999999999999999999999999999999999999998875
Q ss_pred ------------------------------------------------------------------CCCcEEEEEEeCCC
Q psy6838 81 ------------------------------------------------------------------DNGRLEYSIVSGDD 94 (442)
Q Consensus 81 ------------------------------------------------------------------~~~~i~y~i~~~~~ 94 (442)
.|+.+.|+|.+|..
T Consensus 339 ~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~ 418 (2531)
T KOG4289|consen 339 SYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNG 418 (2531)
T ss_pred heEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCc
Confidence 78899999999999
Q ss_pred CCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCC
Q psy6838 95 EEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLN 173 (442)
Q Consensus 95 ~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g 173 (442)
-+.|.||..+|+|.+..+||+|.. .|+++|+|.|++.|+ ++.+.-+.|+|.|+|||+|.|... ...+|.|+.+.|
T Consensus 419 ~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPp---Lsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg 494 (2531)
T KOG4289|consen 419 RGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPP---LSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLG 494 (2531)
T ss_pred cccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCC---ccCCCceEEEEEecCCCCceeEechhhhhhhhccccc
Confidence 999999999999999999999998 999999999999987 888888889999999999999876 889999999999
Q ss_pred cEEEEEEEEeCCCCCCceEEEEEecCCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEe
Q psy6838 174 TVVAALKAIDRDEGQNSYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLD 253 (442)
Q Consensus 174 ~~v~~v~a~D~D~~~~~~~~y~i~~~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d 253 (442)
..++.+.|.|.|.|.|+.+.|++.+-. +|.|+..+|+|++.+.||||+.+.|.|.|.|+|.|.|++++.+.|.|.+.|
T Consensus 495 ~~v~~vqaidadsg~na~l~y~laG~~--pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~d 572 (2531)
T KOG4289|consen 495 YLVCHVQAIDADSGENARLHYSLAGVG--PFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLD 572 (2531)
T ss_pred ceEEEEecccCCCCcccceeeeeccCC--CeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecc
Confidence 999999999999999999999998864 899999999999999999999999999999999999999999999999999
Q ss_pred CCCCCCeeeccceeeeccccccccceEEE---------------------------------------------------
Q psy6838 254 ENDNNPIFDSKSYSASVPENISVGATVLQ--------------------------------------------------- 282 (442)
Q Consensus 254 ~nd~~P~f~~~~~~~~v~e~~~~g~~v~~--------------------------------------------------- 282 (442)
+|||.|.|++..|++.+.|+.+.|+.|++
T Consensus 573 vndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~v 652 (2531)
T KOG4289|consen 573 VNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQERQYV 652 (2531)
T ss_pred cCCCCCccccCceEEEecCCccccceEEEEEEeccccccceEEEecCCcccccceeeccCCcceEEeecchhhcccceEE
Confidence 99999999999999999999999998876
Q ss_pred --------------------------------------------------------------------------------
Q psy6838 283 -------------------------------------------------------------------------------- 282 (442)
Q Consensus 283 -------------------------------------------------------------------------------- 282 (442)
T Consensus 653 l~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y~led~~Frid~ds 732 (2531)
T KOG4289|consen 653 LAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITYILEDEAFRIDPDS 732 (2531)
T ss_pred EEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEEEecccceeecCCC
Confidence
Q ss_pred ----------------------------------------EeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeCCC
Q psy6838 283 ----------------------------------------VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADE 322 (442)
Q Consensus 283 ----------------------------------------v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~ 322 (442)
|.++||++|.|..+.|+++|.|+.|+++.+++|+|+|.|.
T Consensus 733 g~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~ 812 (2531)
T KOG4289|consen 733 GAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADS 812 (2531)
T ss_pred CceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCC
Confidence 6789999999999999999999999999999999999999
Q ss_pred CCCeEEEEEEEeC-CCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCC
Q psy6838 323 GVNGVVTYSVVSD-WANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPM 401 (442)
Q Consensus 323 ~~~~~i~y~l~~~-~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~ 401 (442)
+.|+.+.|.+.++ +..+.|.|++++|.|++.+.||||....|.|.+.|.|.|.|++++.+.|+|+|.|+|||||.|.+.
T Consensus 813 g~Ng~v~y~~qg~~d~p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~ 892 (2531)
T KOG4289|consen 813 GPNGRVYYTFQGGDDGPGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQD 892 (2531)
T ss_pred CCCceEEEEecCCCCCCCceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCc
Confidence 9999999999755 345799999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCeeEEecCCCCCceEEEEEEEeCCCC---eeEEEEeeCCC
Q psy6838 402 PPNIPVYENVTLGTGLASVHAVDLDSG---KLPFLLPIGDF 439 (442)
Q Consensus 402 ~~~~~v~E~~~~g~~v~~v~a~D~D~~---~~~y~i~~~~~ 439 (442)
+++..|.||.++|..+++|+|.|||+| .|+|+|.+|+.
T Consensus 893 e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~d 933 (2531)
T KOG4289|consen 893 ELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGND 933 (2531)
T ss_pred ceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCcc
Confidence 999999999999999999999999999 89999999975
No 2
>KOG4289|consensus
Probab=100.00 E-value=5.7e-67 Score=527.65 Aligned_cols=425 Identities=36% Similarity=0.577 Sum_probs=393.4
Q ss_pred CCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCC---CCCcEEEcCCccEEEEeeeccccccc--
Q psy6838 6 DNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDW---ANDVFSINPQSGIFTLTAKLDYEDVS-- 80 (442)
Q Consensus 6 Dn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~---~~~~f~id~~tg~~~~~~~ld~~~~~-- 80 (442)
-|+|.|.+..|+..++||.|+||.|+++.|.|+|. +.+.|++..-- ..++|+||+.+|++++...||+|.+.
T Consensus 159 ~~~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~a---~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~H 235 (2531)
T KOG4289|consen 159 ANAVQFQQPNYQKELPENEPAGTIIASVKASDPDA---GRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETH 235 (2531)
T ss_pred CCCccCCCcchhccCcCCCCCCceeEEEEecCCCc---CceEEEeeeccchhccccEeeccccccchhhhhhhhhhhhee
Confidence 46899999999999999999999999999999994 47999997542 35899999999999999999999876
Q ss_pred ----------------------------------------------------------------CCCcEEEEEEeCCCCC
Q psy6838 81 ----------------------------------------------------------------DNGRLEYSIVSGDDEE 96 (442)
Q Consensus 81 ----------------------------------------------------------------~~~~i~y~i~~~~~~~ 96 (442)
+|..|+|++.+|.+..
T Consensus 236 vlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~ 315 (2531)
T KOG4289|consen 236 VLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKN 315 (2531)
T ss_pred EEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccc
Confidence 7899999999999999
Q ss_pred cEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCCcE
Q psy6838 97 IFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTV 175 (442)
Q Consensus 97 ~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g~~ 175 (442)
.|.|++.+|.|.+..+||||+...|.|.|.|+|.|.++.. .++.|.|+|.|+|||+|+|... |.+.|.|+..++++
T Consensus 316 ~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp---~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~av 392 (2531)
T KOG4289|consen 316 VFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGP---RTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAV 392 (2531)
T ss_pred eeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCC---ceEEEEEEEEecCCCCccccccceEEEecccCCCCce
Confidence 9999999999999999999999999999999999998754 3899999999999999999987 99999999999999
Q ss_pred EEEEEEEeCCCCCCceEEEEEecCC-CCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeC
Q psy6838 176 VAALKAIDRDEGQNSYVEYSIINSE-ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDE 254 (442)
Q Consensus 176 v~~v~a~D~D~~~~~~~~y~i~~~~-~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~ 254 (442)
|.+|+|+|.|.|.|+.+.|+|.+++ .+.|.||..||+|.+..+||+|.. .|++.|+|.|+|.|+++.+.-+.|.|.|+
T Consensus 393 vlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDI 471 (2531)
T KOG4289|consen 393 VLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDI 471 (2531)
T ss_pred EEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCCCceEEEEEec
Confidence 9999999999999999999999998 678999999999999999999998 99999999999999999999999999999
Q ss_pred CCCCCeeeccceeeeccccccccceEEE----------------------------------------------------
Q psy6838 255 NDNNPIFDSKSYSASVPENISVGATVLQ---------------------------------------------------- 282 (442)
Q Consensus 255 nd~~P~f~~~~~~~~v~e~~~~g~~v~~---------------------------------------------------- 282 (442)
|||+|+|...++++++.|+.+.|..++.
T Consensus 472 NDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ 551 (2531)
T KOG4289|consen 472 NDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVE 551 (2531)
T ss_pred CCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEE
Confidence 9999999999999999999999988765
Q ss_pred ------------------EeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCC------
Q psy6838 283 ------------------VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWAN------ 338 (442)
Q Consensus 283 ------------------v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~------ 338 (442)
+.++||+.|.|.+..|.+++.|++|.|+.|.+++|.|.|. +..++|.|.+++..
T Consensus 552 ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~--~s~ityqi~g~ntrn~Fsi~ 629 (2531)
T KOG4289|consen 552 ARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDA--NSVITYQITGGNTRNRFSIS 629 (2531)
T ss_pred EcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEecccc--ccceEEEecCCcccccceee
Confidence 5678999999999999999999999999999999999995 56688877653310
Q ss_pred --------------------------------------------------------------------------------
Q psy6838 339 -------------------------------------------------------------------------------- 338 (442)
Q Consensus 339 -------------------------------------------------------------------------------- 338 (442)
T Consensus 630 si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~ 709 (2531)
T KOG4289|consen 630 SIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDE 709 (2531)
T ss_pred ccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCCceeEEEecccC
Confidence
Q ss_pred --------------CCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCe
Q psy6838 339 --------------DVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPN 404 (442)
Q Consensus 339 --------------~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~ 404 (442)
..|+||+++|.+++...||||.+-.|++.|+|.|.+.|.+.++.+|.|.|.|+|||+|+|..+.|.
T Consensus 710 D~geNARI~y~led~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt 789 (2531)
T KOG4289|consen 710 DTGENARITYILEDEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYT 789 (2531)
T ss_pred CCCccceEEEEecccceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhce
Confidence 127999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eEEecCCCCCceEEEEEEEeCCCC---eeEEEEeeCCC
Q psy6838 405 IPVYENVTLGTGLASVHAVDLDSG---KLPFLLPIGDF 439 (442)
Q Consensus 405 ~~v~E~~~~g~~v~~v~a~D~D~~---~~~y~i~~~~~ 439 (442)
++|.|++++++.+++|+|+|+|.+ .+.|.+.+|+.
T Consensus 790 ~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d 827 (2531)
T KOG4289|consen 790 GSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDD 827 (2531)
T ss_pred eEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCC
Confidence 999999999999999999999999 56666666543
No 3
>KOG1219|consensus
Probab=100.00 E-value=3.4e-64 Score=521.81 Aligned_cols=437 Identities=32% Similarity=0.507 Sum_probs=410.7
Q ss_pred CCcCCCCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccccc
Q psy6838 1 VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVS 80 (442)
Q Consensus 1 V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~ 80 (442)
|.|+|||+|.|....|.++++|+.|+|+.+....|+|+|.|.||.++|+|.. .....|+|++.+|.+++.++||||...
T Consensus 407 vld~n~n~pif~r~~~~ve~penvpig~~vl~~satDpdegengyvtysia~-~~~lPFaI~~~~GilsvS~kldrel~r 485 (4289)
T KOG1219|consen 407 VLDVNDNSPIFPRDVYRVEIPENVPIGTRVLISSATDPDEGENGYVTYSIAD-DTMLPFAIDQSDGILSVSGKLDRELRR 485 (4289)
T ss_pred EeccCCCCCcceeeeeeeecCCCCCcceEEEEEeccCCCcCcCceEEEEecC-CccCceEeccccceEEeccccCccccc
Confidence 5799999999999999999999999999999999999999999999999986 345789999999999999999999532
Q ss_pred --------------------------------------------------------------------------------
Q psy6838 81 -------------------------------------------------------------------------------- 80 (442)
Q Consensus 81 -------------------------------------------------------------------------------- 80 (442)
T Consensus 486 vYtfRv~Asd~G~per~~e~~~~I~ildlNDn~P~F~~~n~t~t~~~~~~vg~~l~tvsAtD~De~ellky~i~~~nel~ 565 (4289)
T KOG1219|consen 486 VYTFRVRASDWGVPERESEVHLNILILDLNDNPPNFEIRNCTGTINGDPKVGTKLFTVSATDLDELELLKYRILPGNELS 565 (4289)
T ss_pred eEEEEEEEeccCCcchhceeeEEEEEeccCCCCCcceeeecccccccCCCCCcEEEEeeccccCcccceeEEEEeCCcCc
Confidence
Q ss_pred --------------------------------------------------------------------------------
Q psy6838 81 -------------------------------------------------------------------------------- 80 (442)
Q Consensus 81 -------------------------------------------------------------------------------- 80 (442)
T Consensus 566 feln~nSgeisLvr~n~t~~~~s~~slv~a~d~G~p~as~t~lni~~~k~~Tgv~~~~~p~Ilq~~e~~~fPqf~s~fP~ 645 (4289)
T KOG1219|consen 566 FELNSNSGEISLVRQNNTECLQSCESLVIAADDGVPPASPTLLNITVMKYGTGVGNEHEPNILQRFENKHFPQFPSDFPF 645 (4289)
T ss_pred eeeccCCCeEEEEEccccccccccceEEEehhcCCCcCCceeeEEEEEecccccccccChhHhhhhccccCccccccCCc
Confidence
Q ss_pred -------------------------CCCcEEEEEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCC
Q psy6838 81 -------------------------DNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQ 135 (442)
Q Consensus 81 -------------------------~~~~i~y~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~ 135 (442)
-|+.++|-|..|..+..|.|+..+|.|.+.++||+|...+|.|.|+|.|.|.|.
T Consensus 646 iI~v~Edvpigt~la~L~atD~Dtgfng~l~yvI~dgne~~~~~Id~qsg~itvas~ld~~~t~~yiLnvta~D~gtPq- 724 (4289)
T KOG1219|consen 646 IIVVPEDVPIGTTLAILSATDSDTGFNGKLVYVIEDGNESICFLIDRQSGNITVASPLDNENTEQYILNVTAYDLGTPQ- 724 (4289)
T ss_pred eEEccccCCCCceEEEEeccCCCCCcCceEEEEEeCCccceEEEEecccceEEEecchhhhhhheeEEEEEEecCCCch-
Confidence 578899999999988999999999999999999999999999999999999875
Q ss_pred CCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCCeEecCcceEEE
Q psy6838 136 KRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEATPFSLGPVDGLLR 214 (442)
Q Consensus 136 ~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~~~~F~id~~tG~i~ 214 (442)
.++.....|.|.|.|||+|+|... |.++|.|+..+|+.|+++.|.|.|.|+|+.++|+|.... ..|+||+.+|.+.
T Consensus 725 --kss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~s-dvfsIdp~tGivv 801 (4289)
T KOG1219|consen 725 --KSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKS-DVFSIDPFTGIVV 801 (4289)
T ss_pred --hhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCc-ceEEecCcccEEE
Confidence 467788889999999999999876 999999999999999999999999999999999999954 6799999999999
Q ss_pred EcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEE------------
Q psy6838 215 VAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQ------------ 282 (442)
Q Consensus 215 ~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~------------ 282 (442)
+.++||||.+..|+|.|+|+|.+.|.+.+...+.|.|.|+|||+|.|....++..++|+.+.|+.+..
T Consensus 802 ~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~ 881 (4289)
T KOG1219|consen 802 TSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLG 881 (4289)
T ss_pred eccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCc
Confidence 99999999999999999999999888999999999999999999999999999999999999887644
Q ss_pred -----------------------------------------------------------Eeec--CCCCCccCCcceeEE
Q psy6838 283 -----------------------------------------------------------VSDT--NDNSPVFSEPVYAFD 301 (442)
Q Consensus 283 -----------------------------------------------------------v~~~--n~~~p~~~~~~~~~~ 301 (442)
+.++ |-++|+|..-.-..+
T Consensus 882 kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~ 961 (4289)
T KOG1219|consen 882 KVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGH 961 (4289)
T ss_pred eeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeee
Confidence 2223 457999999999999
Q ss_pred EecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEE
Q psy6838 302 VLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSST 381 (442)
Q Consensus 302 v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~ 381 (442)
|.||+|.|+.+++|.|.|.|.|..+.+.|+|..|+.-+.|+|+.++|.|++.+.||+|....|.|+|.|.|.|.+++++.
T Consensus 962 V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv 1041 (4289)
T KOG1219|consen 962 VLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSV 1041 (4289)
T ss_pred EeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccc
Confidence 99999999999999999999999999999999998889999999999999999999999999999999999999999999
Q ss_pred EEEEEEEeeCCCCCCccCCCCCeeEEecCCCCCceEEEEEEEeCCCC---eeEEEEeeCCC-CCC
Q psy6838 382 VTVYFNVVDLNDNPPLFDPMPPNIPVYENVTLGTGLASVHAVDLDSG---KLPFLLPIGDF-DLL 442 (442)
Q Consensus 382 ~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~---~~~y~i~~~~~-~~f 442 (442)
+.+.|.|.|+|||+|+|.+.-|.++|.||++.+..|.++.|.|+|.. .|.|.|.+||. |+|
T Consensus 1042 ~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF 1106 (4289)
T KOG1219|consen 1042 CEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFF 1106 (4289)
T ss_pred eeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceE
Confidence 99999999999999999999999999999999999999999999955 99999999998 555
No 4
>KOG1219|consensus
Probab=100.00 E-value=5.5e-62 Score=505.50 Aligned_cols=429 Identities=35% Similarity=0.535 Sum_probs=398.1
Q ss_pred CCcCCCCCCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCC-CCcEEEcCCccEEEEeeecccccc
Q psy6838 1 VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWA-NDVFSINPQSGIFTLTAKLDYEDV 79 (442)
Q Consensus 1 V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~-~~~f~id~~tg~~~~~~~ld~~~~ 79 (442)
|.|||||+|+|.+..|.++++|++++|+.|.++.|+|.|. |..+.|+|.+++. ...|+|+..||.+++.++||||..
T Consensus 2168 V~dIndn~PvFeqlsYt~sisE~s~igt~viqilATdsDs--n~~isYsl~g~s~~sk~f~In~sTG~it~~g~ldyE~~ 2245 (4289)
T KOG1219|consen 2168 VGDINDNPPVFEQLSYTISISENSKIGTKVIQILATDSDS--NREISYSLEGNSEISKPFRINVSTGWITVAGKLDYEEN 2245 (4289)
T ss_pred ecccCCCCchhheeeEEEEccCCCccCceEEEEEeccCCC--CCceEEEeecCCccccceEEecccceEEEeeecChhhc
Confidence 6799999999999999999999999999999999999997 8999999998654 578999999999999999999987
Q ss_pred c-------------------------------------------------------------------------------
Q psy6838 80 S------------------------------------------------------------------------------- 80 (442)
Q Consensus 80 ~------------------------------------------------------------------------------- 80 (442)
.
T Consensus 2246 q~f~~fvratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l~ 2325 (4289)
T KOG1219|consen 2246 QEFRFFVRATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFLI 2325 (4289)
T ss_pred ceEEEEEEEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhhhhcccceee
Confidence 6
Q ss_pred ------------------------------------------------------------------------CCCcEEEE
Q psy6838 81 ------------------------------------------------------------------------DNGRLEYS 88 (442)
Q Consensus 81 ------------------------------------------------------------------------~~~~i~y~ 88 (442)
.++.+.|.
T Consensus 2326 ~s~~G~iTlfNl~k~~l~~s~~lrv~vsD~v~~at~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~~ 2405 (4289)
T KOG1219|consen 2326 LSENGIITLFNLLKSPLQTSYPLRVTVSDGVFRATMEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIYY 2405 (4289)
T ss_pred eccCceEEehhhcccccccccceeeeeccCcceeeeEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceeee
Confidence 55667777
Q ss_pred EEeCCCCCcEEEeCCccEEEEcccCCcccCcEEE--EEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeE
Q psy6838 89 IVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYN--LVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETT 165 (442)
Q Consensus 89 i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~--l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~ 165 (442)
+....++..|.|++. |.|.+.+.||||....+. +.+.|.|++| +.+-++++|.+.|+|||+|+|... |.+.
T Consensus 2406 si~s~~sd~~~in~~-GqI~t~~kld~e~s~~~vi~i~v~a~Da~g-----r~af~tvti~ltDiNDnpPqF~a~~Y~~n 2479 (4289)
T KOG1219|consen 2406 SINSRASDHFEINKS-GQIKTLSKLDREYSEELVIIIAVMAFDAGG-----RVAFCTVTIILTDINDNPPQFDAQLYRVN 2479 (4289)
T ss_pred eechhccCceeECCC-ccEEeeehhhhccCceEEEEEEEEEecCCC-----eEEEEEEEEEEEecCCCCccccceeEEEE
Confidence 777778899999977 999999999999866554 5566679887 588999999999999999999987 9999
Q ss_pred eeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC--CCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCce
Q psy6838 166 IPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSE--ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSS 243 (442)
Q Consensus 166 v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~--~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~ 243 (442)
|.|++..|..|+.+.|+|.|.+.|+.++|.+.++. ...|.|++ +|.|++++.|++++...|+|.|+|.|.|.|.+.+
T Consensus 2480 I~enaskg~~V~~v~A~D~De~snadvty~i~~e~~~~~v~~in~-sG~Itv~~sL~~~en~tl~l~vkA~D~g~P~~~s 2558 (4289)
T KOG1219|consen 2480 ITENASKGKLVGHVIARDADEGSNADVTYEIVGESDVKHVFEINE-SGVITVKRSLDGLENSTLHLFVKAIDDGKPRRRS 2558 (4289)
T ss_pred eecccCCCceEEEEEEecCCCCCcccEEEEecCchhhhheeeecC-CceEEeehhhhcccCcEEEEEEEeccCCCCCccc
Confidence 99999999999999999999999999999999885 45688988 9999999999999999999999999999999999
Q ss_pred eEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEE-----------------------------------------
Q psy6838 244 RTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQ----------------------------------------- 282 (442)
Q Consensus 244 ~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~----------------------------------------- 282 (442)
.++|.|+|.+...+.|.|..+.|.++++|+.+.|..|.+
T Consensus 2559 ~ttV~v~vl~e~v~lPrFSep~y~fsvpEDv~vG~~Ig~v~a~~a~~~~i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~k 2638 (4289)
T KOG1219|consen 2559 NTTVIVTVLPEDVNLPRFSEPIYTFSVPEDVPVGEEIGQVSASDADEHVIYSLVLGGTPESNPDLPFSVDRNTGMIKVNK 2638 (4289)
T ss_pred ceEEEEEecCcccCcccccCceEEEeccccCCCCCeeeEEeecccCCceEEEEEeCCCCCCCCCCceEEcCCCceEEecc
Confidence 999999999999999999999999999999999998866
Q ss_pred -------------------------------EeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEE
Q psy6838 283 -------------------------------VSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYS 331 (442)
Q Consensus 283 -------------------------------v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~ 331 (442)
|.++||++|+|....|.+.+.||+|.|+.|++++|.|.|.+.+++++|+
T Consensus 2639 sLD~E~kk~yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirys 2718 (4289)
T KOG1219|consen 2639 SLDHEKKKSYQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYS 2718 (4289)
T ss_pred ccchhhhceEEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEE
Confidence 6688999999999999999999999999999999999999999999999
Q ss_pred EEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecCC
Q psy6838 332 VVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYENV 411 (442)
Q Consensus 332 l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~ 411 (442)
|... ..+|.|+++||.|++...||.|++..|.|.|+|+|.|.+ ++.++|.|+|.|+||+||.|....|..++.|+.
T Consensus 2719 l~~~--v~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~--ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~ 2794 (4289)
T KOG1219|consen 2719 LTSP--VPYFAINPETGWLTTLFELDLEKQDLYSLKVVATDHGVP--SSQATVLVHVTDVNDSPPRFQREIYEGTVVEDV 2794 (4289)
T ss_pred EcCC--cceEEEcCCCCeeeehhhhccccCCceEEEEEEecCCcc--cccceEEEEEEecCCCcchhhhHhhccceeccC
Confidence 9854 339999999999999999999999999999999999995 788999999999999999999999999999999
Q ss_pred CCCceEEEEEEEeCCCC---eeEEEEeeCCC-CCC
Q psy6838 412 TLGTGLASVHAVDLDSG---KLPFLLPIGDF-DLL 442 (442)
Q Consensus 412 ~~g~~v~~v~a~D~D~~---~~~y~i~~~~~-~~f 442 (442)
+.|..++.++..|.|.. ..+|-|..|++ |-|
T Consensus 2795 p~~~~ia~~si~d~D~s~~nq~t~fI~~gd~~gqF 2829 (4289)
T KOG1219|consen 2795 PGGKIIAGLSIFDADVSEVNQVTGFITLGDPLGQF 2829 (4289)
T ss_pred CCCceeeeeEeccccccccceeEEEEeCCCccceE
Confidence 99999999999999999 58899999876 543
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00 E-value=9.1e-31 Score=233.73 Aligned_cols=197 Identities=43% Similarity=0.665 Sum_probs=178.5
Q ss_pred CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCC
Q psy6838 162 NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPS 240 (442)
Q Consensus 162 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~ 240 (442)
+.+.+.|+++.|+.++++.|.|+|.+.++.++|+|.++.. .+|.|++.+|.|++.+.||||....|.|.|+|+|.|.+.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~ 81 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP 81 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc
Confidence 5688999999999999999999999888889999999874 799999999999999999999999999999999988877
Q ss_pred CceeEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEEEeecCCCCCccCCcceeEEEecCCCCCcEEEEEEEEeC
Q psy6838 241 KSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSDTNDNSPVFSEPVYAFDVLEDLPRGSRVGQVFTTDA 320 (442)
Q Consensus 241 ~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~~~n~~~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~ 320 (442)
+++...++|.|.|+||++|.|....| .+.|.|+.+.|+.++++.|.|+
T Consensus 82 ~~~~~~v~I~V~d~Nd~~P~~~~~~~--------------------------------~~~v~e~~~~~~~i~~~~a~D~ 129 (199)
T cd00031 82 LSSTATVTVTVLDVNDNPPVFEQSSY--------------------------------EASVPENAPPGTVVGTVTATDA 129 (199)
T ss_pred ceeEEEEEEEEccCCCCCCcccccce--------------------------------EEEEeCCCCCCCEEEEEEEEcC
Confidence 77999999999999999999876544 4556677777888888888999
Q ss_pred CCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEee
Q psy6838 321 DEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVD 390 (442)
Q Consensus 321 D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d 390 (442)
|.+.++.++|+|.++....+|.|++.+|.|++.+.||||....|.|.|.|+|.+.+.+++++.++|.|.|
T Consensus 130 D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 130 DSGENAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred CCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 9988899999999765558999999999999999999999999999999999987778999999999875
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00 E-value=1.5e-30 Score=232.33 Aligned_cols=196 Identities=43% Similarity=0.726 Sum_probs=177.4
Q ss_pred EEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeecccccccCCCcEEEEEEeCCCC
Q psy6838 16 YAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVSDNGRLEYSIVSGDDE 95 (442)
Q Consensus 16 ~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~~~~~i~y~i~~~~~~ 95 (442)
|.+.|+|++++|+.|+++.|.|+|.+.++.++|+|.++ ...
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~---------------------------------------~~~ 42 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGG---------------------------------------NED 42 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCC---------------------------------------CCc
Confidence 68899999999999999999999987777788877663 223
Q ss_pred CcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeec-cCCeeEeeCCCCCCc
Q psy6838 96 EIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFI-TANETTIPENSPLNT 174 (442)
Q Consensus 96 ~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~-~~~~~~v~E~~~~g~ 174 (442)
.+|.|++.+|.|++.+.||||....|.|.|+|+|.+.+. ++++..+.|.|.|+||++|.|. ..+.+.+.|+.++|+
T Consensus 43 ~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~---~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~ 119 (199)
T cd00031 43 GLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP---LSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGT 119 (199)
T ss_pred ccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc---ceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCC
Confidence 578888788999999999999999999999999987653 4689999999999999999999 559999999999999
Q ss_pred EEEEEEEEeCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEe
Q psy6838 175 VVAALKAIDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLD 253 (442)
Q Consensus 175 ~v~~v~a~D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d 253 (442)
.++++.|+|+|.+.++.++|+|.++.. .+|.|++.+|.|++.+.||||....|.|.|.|+|.+.+.++....++|.|.|
T Consensus 120 ~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 120 VVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 999999999999888889999999874 7999999999999999999999999999999999988888899999998875
No 7
>KOG1834|consensus
Probab=99.78 E-value=1.5e-17 Score=160.29 Aligned_cols=210 Identities=26% Similarity=0.375 Sum_probs=168.9
Q ss_pred EEEEEeccCCCCeeccCCeeEeeCCCCCCcEEEEEEEEeCCCCC---CceEEEEEecCCCCCeE---ecCcce--EEEEc
Q psy6838 145 SIVIRDVNDNAPEFITANETTIPENSPLNTVVAALKAIDRDEGQ---NSYVEYSIINSEATPFS---LGPVDG--LLRVA 216 (442)
Q Consensus 145 ~I~V~dvNd~~P~f~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~---~~~~~y~i~~~~~~~F~---id~~tG--~i~~~ 216 (442)
.....-+|-|.|+....|...|.||...-.....+.|.|.|.+- ...+-|.|.+.+ -+|. +|..|| .|+.+
T Consensus 20 ~~~aarankhkpwie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~-vPFdavVvdK~TGegvlRaK 98 (952)
T KOG1834|consen 20 HHHAARANKHKPWIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQP-VPFDAVVVDKYTGEGVLRAK 98 (952)
T ss_pred ccccccccccCcccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCC-CCceEEEEeccCCceEEeec
Confidence 34566788899999999999999998655556678899999841 233578888765 4554 466665 68888
Q ss_pred ccCCcccccEEEEEEEEEeCCCCC------CceeEEEEEEEEeCCCCCCeeeccceeeeccccccccceEEEEeecCCCC
Q psy6838 217 QKLDREIKSNYTLEIKAKDRGEPS------KSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSDTNDNS 290 (442)
Q Consensus 217 ~~lD~e~~~~~~l~V~a~D~~~~~------~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~e~~~~g~~v~~v~~~n~~~ 290 (442)
.+||.|.++.|+|+|+|.|.|..+ .+-.++|.|+|.|+|+.+|.|..+.|.+.|.|...
T Consensus 99 ~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~--------------- 163 (952)
T KOG1834|consen 99 EPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV--------------- 163 (952)
T ss_pred CcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee---------------
Confidence 999999999999999999987543 56678999999999999999999988888776522
Q ss_pred CccCCcceeEEEecCCCCCcEEEEEEEEeCCCC--CCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEE
Q psy6838 291 PVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEG--VNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVV 368 (442)
Q Consensus 291 p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~--~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V 368 (442)
...|+++.|.|.|.+ ....+.|.|.. ..-+|.||.. |.|+.+.+|+|...+.|.|+|
T Consensus 164 ------------------yd~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn~-G~irnTekLny~ke~~Y~ltV 222 (952)
T KOG1834|consen 164 ------------------YDSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDND-GNIRNTEKLNYTKEHQYKLTV 222 (952)
T ss_pred ------------------eeeeEEEEeecCCCCCcccceeEEEecC--CCCceEEcCC-CccccccccccccceeEEEEE
Confidence 235677888999974 34567899984 6778999975 999999999999999999999
Q ss_pred EEEECCCCCceEEEEEEEEEeeC
Q psy6838 369 QAQDTGKPMMSSTVTVYFNVVDL 391 (442)
Q Consensus 369 ~a~D~~~~~~~~~~~v~I~V~dv 391 (442)
+|.|+|......-+.|+|+|...
T Consensus 223 tAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 223 TAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred EEEecccccccCcceEEEEecCc
Confidence 99999985555557888888765
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.77 E-value=1.4e-17 Score=129.11 Aligned_cols=92 Identities=43% Similarity=0.694 Sum_probs=88.1
Q ss_pred eeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEEC-CCC
Q psy6838 298 YAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDT-GKP 376 (442)
Q Consensus 298 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~-~~~ 376 (442)
|.+.|.|++++|+.++++.|.|+|.+.++.+.|+|..++..++|.|++.+|.|++.+.||||....|.|.|.|+|. +.|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~ 80 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP 80 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence 6789999999999999999999999999999999998888899999999999999999999999999999999999 889
Q ss_pred CceEEEEEEEEEe
Q psy6838 377 MMSSTVTVYFNVV 389 (442)
Q Consensus 377 ~~~~~~~v~I~V~ 389 (442)
+++++++|+|+|+
T Consensus 81 ~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 81 PLSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEEEEEEE
T ss_pred CCEEEEEEEEEEC
Confidence 9999999999985
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.72 E-value=1.2e-16 Score=123.82 Aligned_cols=91 Identities=38% Similarity=0.626 Sum_probs=86.8
Q ss_pred CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-CCCeEecCcceEEEEcccCCcccccEEEEEEEEEeC-CCC
Q psy6838 162 NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSE-ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDR-GEP 239 (442)
Q Consensus 162 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~-~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~-~~~ 239 (442)
|.+.|+|++++|+.++++.|.|+|.+.++.+.|+|..+. ..+|.|++.+|.|++.++||||....|.|.|.|+|. +.|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~ 80 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP 80 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence 578999999999999999999999999999999999998 789999999999999999999999999999999999 888
Q ss_pred CCceeEEEEEEEE
Q psy6838 240 SKSSRTKFFIRLL 252 (442)
Q Consensus 240 ~~~~~~~v~I~V~ 252 (442)
++++++.|+|.|.
T Consensus 81 ~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 81 PLSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEEEEEEE
T ss_pred CCEEEEEEEEEEC
Confidence 9999999999884
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.68 E-value=4.4e-16 Score=116.70 Aligned_cols=79 Identities=49% Similarity=0.733 Sum_probs=73.9
Q ss_pred EeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCC
Q psy6838 318 TDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPP 396 (442)
Q Consensus 318 ~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P 396 (442)
+|+|.+.++.++|+|.++....+|.|++.+|.|++.++||||....|.|.|+|+|.+.|++++.++|+|+|.|+|||+|
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 3788888999999999776668999999999999999999999999999999999999999999999999999999998
No 11
>KOG1834|consensus
Probab=99.64 E-value=2.1e-14 Score=138.91 Aligned_cols=152 Identities=26% Similarity=0.430 Sum_probs=126.5
Q ss_pred EeCCc--cEEEEcccCCcccCcEEEEEEEEEeCCCCCCC---CceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCC
Q psy6838 100 INPSN--GTLSNLKHLDRETKSSYNLVVMATDMAKPPQK---RLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLN 173 (442)
Q Consensus 100 i~~~~--g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~---~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g 173 (442)
+|..+ |.|+.+.+||-|.+..|+|+|+|.|++..|.. ..|..++|.|.|.|+|+++|+|..+ |.+.|.|+.. -
T Consensus 86 vdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~-y 164 (952)
T KOG1834|consen 86 VDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV-Y 164 (952)
T ss_pred EeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee-e
Confidence 45555 57889999999999999999999999875521 2467889999999999999999988 8888888864 5
Q ss_pred cEEEEEEEEeCCCC-CC-ceEEEEEecCCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEE
Q psy6838 174 TVVAALKAIDRDEG-QN-SYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL 251 (442)
Q Consensus 174 ~~v~~v~a~D~D~~-~~-~~~~y~i~~~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V 251 (442)
..|+++.|.|.|-+ ++ ..+.|.|...+ -+|.||. .|.|+.+.+|+|.....|.|+|.|.|.|.........|+|.|
T Consensus 165 d~il~veAiD~DCspq~sqIC~YEI~t~d-~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~V 242 (952)
T KOG1834|consen 165 DSILRVEAIDKDCSPQYSQICEYEITTPD-VPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHV 242 (952)
T ss_pred eeeEEEEeecCCCCCcccceeEEEecCCC-CceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEe
Confidence 57899999999985 33 34689999865 7899997 999999999999999999999999999875555556777777
Q ss_pred EeC
Q psy6838 252 LDE 254 (442)
Q Consensus 252 ~d~ 254 (442)
...
T Consensus 243 kp~ 245 (952)
T KOG1834|consen 243 KPT 245 (952)
T ss_pred cCc
Confidence 543
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.64 E-value=2.2e-15 Score=112.86 Aligned_cols=78 Identities=40% Similarity=0.645 Sum_probs=73.6
Q ss_pred EeCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeCCCCCC
Q psy6838 182 IDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNP 259 (442)
Q Consensus 182 ~D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P 259 (442)
+|+|.+.++.++|+|.++.. .+|.|++.+|.|++.++||||....|.|.|+|+|.+.|++++.+.|+|.|.|+|||+|
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 47899889999999998875 8999999999999999999999999999999999999889999999999999999998
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.54 E-value=0.0012 Score=49.94 Aligned_cols=78 Identities=23% Similarity=0.358 Sum_probs=42.4
Q ss_pred CCccCCcceeEEEecCCCCCcEEEEEEEEeCCCCCCeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEE
Q psy6838 290 SPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQ 369 (442)
Q Consensus 290 ~p~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~ 369 (442)
.|-|.+..|.+.|..+...|..||+|.-.|... +..+.|.-. ...|.|.++ |.|++++++.... ..-.|+|.
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss----DpdF~V~~D-GsVy~~r~v~l~~-~~~~F~V~ 74 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS----DPDFRVLED-GSVYAKRPVQLSS-EQRSFTVH 74 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEETT-TEEEEES--S-SS-S-EEEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC----CCCEEEcCC-CeEEEeeeEecCC-CceEEEEE
Confidence 367888888899999999999999999999853 455777654 238999987 9999999987654 34689999
Q ss_pred EEECCC
Q psy6838 370 AQDTGK 375 (442)
Q Consensus 370 a~D~~~ 375 (442)
|.|..+
T Consensus 75 a~D~~~ 80 (90)
T PF08758_consen 75 AWDSQT 80 (90)
T ss_dssp EEETTT
T ss_pred EECCCC
Confidence 999876
No 14
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.45 E-value=8.7e-05 Score=55.25 Aligned_cols=63 Identities=21% Similarity=0.462 Sum_probs=29.9
Q ss_pred eEEEEEecCCCCCcEEEEEEeeeCCCCCC--cEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccc
Q psy6838 15 VYAFDVLEDLPRGSRVGQVFTTDADEGVN--GVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYED 78 (442)
Q Consensus 15 ~~~~~v~E~~~~g~~v~~~~a~D~D~~~~--~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~ 78 (442)
....+|+|..+.|+.||.+ |.|.-.... ....+++.+.....||.+++++|.|++..++|+|.
T Consensus 2 qi~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 2 QIRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred CeEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 3578999999999999999 555432110 01234444433444555555555555544455444
No 15
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.41 E-value=9.9e-05 Score=54.96 Aligned_cols=62 Identities=21% Similarity=0.478 Sum_probs=40.3
Q ss_pred eeEEEecCCCCCcEEEEEEEEeCCCCCC--eEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCcc
Q psy6838 298 YAFDVLEDLPRGSRVGQVFTTDADEGVN--GVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYED 360 (442)
Q Consensus 298 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~--~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~ 360 (442)
..++|.|..++|+.||.+ |.|--.... ....|+|.......+|.++..+|.|+++..+|||.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 457899999999999998 344432111 12367877777788999999999999999999996
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.11 E-value=0.0084 Score=45.30 Aligned_cols=79 Identities=22% Similarity=0.340 Sum_probs=41.2
Q ss_pred CCccCCCeEEEEEecCCCCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeecccccccCCCcEEE
Q psy6838 8 SPVFSEPVYAFDVLEDLPRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVSDNGRLEY 87 (442)
Q Consensus 8 ~P~f~~~~~~~~v~E~~~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~~~~~~~i~y 87 (442)
-|-|....|.+.|+.+...|..|++|.-.|... +..+.|+..+
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ssD----------------------------------- 45 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESSD----------------------------------- 45 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE--------------------------------------
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecCC-----------------------------------
Confidence 588999999999999999999999999999853 4456665422
Q ss_pred EEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCC
Q psy6838 88 SIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKP 133 (442)
Q Consensus 88 ~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~ 133 (442)
..|.|. ..|.|.+++++..... .-.|.|.|.|..+.
T Consensus 46 --------pdF~V~-~DGsVy~~r~v~l~~~-~~~F~V~a~D~~~~ 81 (90)
T PF08758_consen 46 --------PDFRVL-EDGSVYAKRPVQLSSE-QRSFTVHAWDSQTQ 81 (90)
T ss_dssp --------SEEEEE-TTTEEEEES--S-SSS--EEEEEEEEETTTT
T ss_pred --------CCEEEc-CCCeEEEeeeEecCCC-ceEEEEEEECCCCC
Confidence 367776 6889999988876433 35799999998863
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.02 E-value=0.014 Score=44.99 Aligned_cols=69 Identities=30% Similarity=0.342 Sum_probs=53.8
Q ss_pred EeCCCCCCeEEEEEEEeC---CCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCC
Q psy6838 318 TDADEGVNGVVTYSVVSD---WANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLND 393 (442)
Q Consensus 318 ~D~D~~~~~~i~y~l~~~---~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd 393 (442)
.|+| +..++|++... ....|..+++.++.++-. +. .+..+.|.+.|+|+|+.+ .++...++|.|.+.|+
T Consensus 25 ~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~-~~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 25 TDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PT-NSDVGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred ECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CC-CCCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 6776 46799988743 235799999998888763 33 334577999999999876 6788999999999987
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.98 E-value=0.0084 Score=45.88 Aligned_cols=87 Identities=24% Similarity=0.303 Sum_probs=58.2
Q ss_pred EEEEEEeCCCCCCeEEEEEEEe-CCCCCCeEEeCCccEEEEc--------ccCCCccCcEEEEEEEEEECCCCCceEEEE
Q psy6838 313 GQVFTTDADEGVNGVVTYSVVS-DWANDVFSINPQSGIFTLT--------AKLDYEDVHHYIFVVQAQDTGKPMMSSTVT 383 (442)
Q Consensus 313 ~~v~a~D~D~~~~~~i~y~l~~-~~~~~~f~id~~tG~l~~~--------~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~ 383 (442)
|++.++|+|.+. ..++++.. ...-+.|.|++ .|..+.. +.|...+...-+|+|++.|+ .+++
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt~~ 72 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TSQT 72 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------CeEE
Confidence 567789999754 45666642 22346788887 4755443 12332234467899999995 2899
Q ss_pred EEEEEeeCCCCCCccCCCCCeeEEecC
Q psy6838 384 VYFNVVDLNDNPPLFDPMPPNIPVYEN 410 (442)
Q Consensus 384 v~I~V~dvNd~~P~f~~~~~~~~v~E~ 410 (442)
|+|+|...|| +|.... .-...+.|+
T Consensus 73 vtItI~GtND-apvi~~-~~~g~v~ED 97 (99)
T TIGR01965 73 VTITITGAND-AAVIGG-ADTGSVTED 97 (99)
T ss_pred EEEEEEccCC-CCEEec-ccceeEecC
Confidence 9999999999 887653 345677776
No 19
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.82 E-value=0.015 Score=44.59 Aligned_cols=50 Identities=24% Similarity=0.298 Sum_probs=36.7
Q ss_pred ccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCC
Q psy6838 111 KHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENS 170 (442)
Q Consensus 111 ~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~ 170 (442)
+.|...+.-.-+|++.+.|+. +.+|.|+|...|| +|++.......+.|+.
T Consensus 49 q~L~~Ge~~tdsFtvtv~DGt---------t~~vtItI~GtND-apvi~~~~~g~v~ED~ 98 (99)
T TIGR01965 49 QALKAGETLTDTFTVTSADGT---------SQTVTITITGAND-AAVIGGADTGSVTEDS 98 (99)
T ss_pred HhhcCCCEEEEEEEEEEeCCC---------eEEEEEEEEccCC-CCEEecccceeEecCC
Confidence 344444445567888888842 5789999999999 8888777777777764
No 20
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.73 E-value=0.035 Score=42.79 Aligned_cols=31 Identities=32% Similarity=0.389 Sum_probs=26.5
Q ss_pred CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccC
Q psy6838 118 KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVND 153 (442)
Q Consensus 118 ~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd 153 (442)
...|.++|.|+|..+ .+....++|.|.+.|+
T Consensus 66 ~g~~~i~v~a~D~~g-----~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 66 VGSLSLKVTATDSSG-----ASASDTFTITVVNTND 96 (97)
T ss_pred CcEEEEEEEEEECCC-----CEEEEEEEEEEeCCCC
Confidence 356999999999886 4788899999999886
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=95.95 E-value=0.86 Score=38.54 Aligned_cols=126 Identities=21% Similarity=0.345 Sum_probs=72.5
Q ss_pred CcEEEEEE-EEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCCCC-CcEEEEEEEEeCCCCCCceEEEE
Q psy6838 118 KSSYNLVV-MATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENSPL-NTVVAALKAIDRDEGQNSYVEYS 195 (442)
Q Consensus 118 ~~~~~l~v-~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~~~-g~~v~~v~a~D~D~~~~~~~~y~ 195 (442)
...|++++ .|.|..+. ..+..+..++. +...+|.+.-.....+..+... +..-..+.++|.-.+. ..-+.+
T Consensus 14 dG~Y~l~~~~a~D~agN-----~~~~~~~~~~~-iD~T~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~ 86 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGN-----TSTSTVSETFT-IDNTPPTISISDGASVANGSTVYGLVNISINVTDNSDDS-KITSVS 86 (158)
T ss_pred CccEEEEEEEEEecCCC-----EEEEEEeeEEE-EcCCCCEEEEecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEE
Confidence 46899999 79999874 44444443333 3456898866322233333332 3344557777766544 335677
Q ss_pred EecCC-CCCe--EecC-cceEEEEc--ccC-CcccccEEEEEEEEEeCCCCCCceeEEEEEEEE
Q psy6838 196 IINSE-ATPF--SLGP-VDGLLRVA--QKL-DREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLL 252 (442)
Q Consensus 196 i~~~~-~~~F--~id~-~tG~i~~~--~~l-D~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~ 252 (442)
|.++. ...- .... ..|...+. +.+ ..|....|+|+|.|+|.. +...+..+.....
T Consensus 87 l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~a--GN~~~~si~F~y~ 148 (158)
T PF13750_consen 87 LTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKA--GNQSTKSISFSYM 148 (158)
T ss_pred EECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecC--CCEEEEEEEEEEe
Confidence 77665 2221 1111 13333332 111 236788999999999987 4566666666655
No 22
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=95.64 E-value=1.2 Score=37.75 Aligned_cols=76 Identities=24% Similarity=0.330 Sum_probs=42.5
Q ss_pred EEEEEeCCCCCCeEEEEEEEeCCCCCCeEE--eCC-ccEEEEc--cc-CCCccCcEEEEEEEEEECCCCCceEEEEEEEE
Q psy6838 314 QVFTTDADEGVNGVVTYSVVSDWANDVFSI--NPQ-SGIFTLT--AK-LDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFN 387 (442)
Q Consensus 314 ~v~a~D~D~~~~~~i~y~l~~~~~~~~f~i--d~~-tG~l~~~--~~-ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~ 387 (442)
.+.++|.-... .....+|.+|.......+ ... .|...+. +. ...|....|+|+|.|.|..+ ...+..+.+.
T Consensus 70 ~i~~tD~~~~~-~i~sv~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si~F~ 146 (158)
T PF13750_consen 70 SINVTDNSDDS-KITSVSLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSISFS 146 (158)
T ss_pred EEEEEeCCCCc-eEEEEEEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC--CEEEEEEEEE
Confidence 34445554332 344566665443333222 111 2333332 11 12366789999999999877 6778888887
Q ss_pred EeeCC
Q psy6838 388 VVDLN 392 (442)
Q Consensus 388 V~dvN 392 (442)
....|
T Consensus 147 y~P~~ 151 (158)
T PF13750_consen 147 YMPPN 151 (158)
T ss_pred EeCCe
Confidence 76543
No 23
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=92.98 E-value=16 Score=39.73 Aligned_cols=48 Identities=23% Similarity=0.255 Sum_probs=30.5
Q ss_pred ccCCCCeeccC-CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC
Q psy6838 151 VNDNAPEFITA-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSE 200 (442)
Q Consensus 151 vNd~~P~f~~~-~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~y~i~~~~ 200 (442)
.||..+.+... ....+.|+. |+.-..|.-...|.+....+.|+..++.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGT 443 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGT 443 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCc
Confidence 34444455433 677899986 6666666554445544566889888776
No 24
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=92.43 E-value=0.52 Score=31.08 Aligned_cols=37 Identities=24% Similarity=0.304 Sum_probs=28.6
Q ss_pred CCCCCeEEeCCccEEEEcccCCCc-cCcEEEEEEEEEECC
Q psy6838 336 WANDVFSINPQSGIFTLTAKLDYE-DVHHYIFVVQAQDTG 374 (442)
Q Consensus 336 ~~~~~f~id~~tG~l~~~~~ld~e-~~~~~~l~V~a~D~~ 374 (442)
....+..||+.+|.|+-. .+.. ....|.|+|+|+|..
T Consensus 11 ~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence 456789999999999765 3333 336999999999975
No 25
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=90.75 E-value=8.1 Score=41.92 Aligned_cols=53 Identities=17% Similarity=0.226 Sum_probs=34.2
Q ss_pred EEEEEEEeeCCCCCCccCCCCCeeEEecCCCCCceEEE-EEEEeCCCC-eeEEEEeeC
Q psy6838 382 VTVYFNVVDLNDNPPLFDPMPPNIPVYENVTLGTGLAS-VHAVDLDSG-KLPFLLPIG 437 (442)
Q Consensus 382 ~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~-v~a~D~D~~-~~~y~i~~~ 437 (442)
.+.+|+|.| ||++|.|........|.|+ .|..-.+ ++..+.++. .+.|.-.+|
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es--~G~vtvtV~RtsGa~G~VtV~Y~T~dG 569 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSES--IGIMEVKVLRTSGARGTVIVPYRTVEG 569 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcC--CCEEEEEEEEcCCCCeeEEEEEEeecC
Confidence 456677777 6789988766666788898 4554333 334444444 777877665
No 26
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=88.56 E-value=12 Score=30.65 Aligned_cols=57 Identities=23% Similarity=0.388 Sum_probs=40.2
Q ss_pred EEEEcccCCccc---CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCC
Q psy6838 106 TLSNLKHLDRET---KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENS 170 (442)
Q Consensus 106 ~l~~~~~lD~E~---~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~ 170 (442)
.+.+..+||+.. .-...|.|.|+|..+. .++..+.|+|.| | .|.........|.|+.
T Consensus 69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD-----~s~~~l~VtI~D--D-~P~~~~~~~~~V~E~~ 128 (137)
T TIGR03660 69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGD-----TSSITLPVTIVD--D-VPTITDVDALTVDEDD 128 (137)
T ss_pred EEEEcccccCCCCCceEEEeeeEEEEeCCCC-----ccccEEEEEEEC--C-CCeeccccceEEeccc
Confidence 344567888743 4467888999998874 234578888877 5 6887666667888853
No 27
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=88.08 E-value=75 Score=39.05 Aligned_cols=112 Identities=13% Similarity=0.142 Sum_probs=61.5
Q ss_pred CCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccEEEEeeeccccc-ccCCCcEEEEEEeCCCCCcEEEeCC
Q psy6838 25 PRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYED-VSDNGRLEYSIVSGDDEEIFTINPS 103 (442)
Q Consensus 25 ~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~~~~~~~ld~~~-~~~~~~i~y~i~~~~~~~~F~i~~~ 103 (442)
.+|+...+++|++.. |.+ ..+-.+.=..+-..|.+++..+.+.+...+.+.. ...+..+.|+..-++....+.-++.
T Consensus 1482 ~~GtYtVtLTvtN~~-Gss-t~T~~VtV~~pV~~~tinas~~~vpl~~sV~Fta~~s~Gs~v~ysWdFGDg~ts~~~npt 1559 (2740)
T TIGR00864 1482 SPGDFNIRLAAANEV-GKN-EATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNT 1559 (2740)
T ss_pred CCceEEEEEEEECCC-Cce-EEEEEEEEeccccceEEcCCCccccccceEEEEEEccCCCceeEEEEeCCCCccccCCCc
Confidence 357777777777664 222 1111221112334577777777665554454443 2245567777776654322221110
Q ss_pred ccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEec
Q psy6838 104 NGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDV 151 (442)
Q Consensus 104 ~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dv 151 (442)
..--|.+...|++++.|++..|. ...++.|.|.+.
T Consensus 1560 -------~~yTY~spGtYtVtLTvtN~~Gs------~~~T~~i~V~~~ 1594 (2740)
T TIGR00864 1560 -------IFYTFRSVGTFNIIVTAENDVGA------AQASIFLFVLQE 1594 (2740)
T ss_pred -------eEEeecCCceEEEEEEEecCCCc------cceeEEEEEeec
Confidence 01135677899999999998873 345556666643
No 28
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=87.93 E-value=6.1 Score=27.48 Aligned_cols=60 Identities=23% Similarity=0.333 Sum_probs=36.4
Q ss_pred CCceEEEEEecCCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEE
Q psy6838 188 QNSYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL 251 (442)
Q Consensus 188 ~~~~~~y~i~~~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V 251 (442)
.+-...|+|.+-+..|..+...+-.+... .| ..+.|+|.|+|.|..+.......++.|+|
T Consensus 6 ~~~~Y~Y~l~g~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 6 ENIRYRYRLEGFDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TTEEEEEEEETTESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CceEEEEEEECCCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 34457788888777777766533244332 22 46899999999997754333336666665
No 29
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=87.14 E-value=8 Score=26.85 Aligned_cols=60 Identities=17% Similarity=0.264 Sum_probs=35.0
Q ss_pred CeEEEEEEEeCCCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEe
Q psy6838 325 NGVVTYSVVSDWANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVV 389 (442)
Q Consensus 325 ~~~i~y~l~~~~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~ 389 (442)
+-...|+|. |-...|..+...+-.+..+ .| ..+.|+|.|+|.|..+.......++.|+|.
T Consensus 7 ~~~Y~Y~l~-g~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 7 NIRYRYRLE-GFDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp TEEEEEEEE-TTESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred ceEEEEEEE-CCCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 445677777 4445666665543244433 22 568999999999977644333367776663
No 30
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=79.53 E-value=23 Score=29.08 Aligned_cols=56 Identities=29% Similarity=0.360 Sum_probs=36.6
Q ss_pred EEEEcccCCCcc---CcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecC
Q psy6838 349 IFTLTAKLDYED---VHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYEN 410 (442)
Q Consensus 349 ~l~~~~~ld~e~---~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~ 410 (442)
.+++.++||... .-...|.|.|+|..+. .+..++.|+|.| | .|...... ...|.|+
T Consensus 69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVDED 127 (137)
T ss_pred EEEEcccccCCCCCceEEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence 444556666532 3367889999997763 334688888888 5 57765433 3678775
No 31
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=79.51 E-value=3.2 Score=41.59 Aligned_cols=69 Identities=17% Similarity=0.236 Sum_probs=0.0
Q ss_pred ccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecCCCCCceEEEEEEEeC-CCC-eeEEEEee
Q psy6838 359 EDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYENVTLGTGLASVHAVDL-DSG-KLPFLLPI 436 (442)
Q Consensus 359 e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~-D~~-~~~y~i~~ 436 (442)
+....|.|++.|+|.+.. ...+.+.+.+ |- ||.-. ...+.=.+. ......+++.+.+- |.. +|+|++.-
T Consensus 270 ~~g~~Y~~~l~v~~~~~~--~~~a~~~~~~---n~-pP~~g--~c~v~P~~G-~al~T~Ft~~c~~~~d~~~pL~Y~f~~ 340 (440)
T PF02010_consen 270 EPGSTYTFRLTVTDSSGS--SGSASISFTV---NA-PPSGG--SCSVSPSSG-YALETTFTITCSGWTDDDQPLTYQFYY 340 (440)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cccccccccccccccccc--cccccccccc---cc-ccccc--ccccccccc-ccccccccccccccccccccccccccc
Confidence 456789999999998662 2256666666 43 66421 111111111 12223556666665 444 66666544
No 32
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=78.79 E-value=11 Score=24.65 Aligned_cols=35 Identities=29% Similarity=0.454 Sum_probs=20.6
Q ss_pred CcEEEeCCccEEEEcccCCcc-cCcEEEEEEEEEeCCC
Q psy6838 96 EIFTINPSNGTLSNLKHLDRE-TKSSYNLVVMATDMAK 132 (442)
Q Consensus 96 ~~F~i~~~~g~l~~~~~lD~E-~~~~~~l~v~a~D~~~ 132 (442)
....+|+.+|.|.=.- +.. ....|.++|+|+|..+
T Consensus 14 ~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 14 SGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcCCC
Confidence 3455555555554331 122 2258999999999754
No 33
>KOG4221|consensus
Probab=77.02 E-value=1.3e+02 Score=33.79 Aligned_cols=59 Identities=17% Similarity=0.219 Sum_probs=46.5
Q ss_pred CCCCCeEEeCCccEEEEcccCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCc
Q psy6838 336 WANDVFSINPQSGIFTLTAKLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPL 397 (442)
Q Consensus 336 ~~~~~f~id~~tG~l~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~ 397 (442)
+.+.++.++.++-.+++. +.+....|.+.|.|....+++.++...-.++..|+-+.||.
T Consensus 562 ~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 562 DTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred CCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 456788888877777765 34667789999999998887777666666788889888887
No 34
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=58.47 E-value=45 Score=22.83 Aligned_cols=30 Identities=27% Similarity=0.286 Sum_probs=21.9
Q ss_pred CcEEEEEEEEEECCCCCceEEEEEEEEEeeCC
Q psy6838 361 VHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLN 392 (442)
Q Consensus 361 ~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvN 392 (442)
.+.|++.++++|..+ ..+.....+.+.|..
T Consensus 22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence 568999999999877 455556666666654
No 35
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=55.78 E-value=1.9e+02 Score=27.46 Aligned_cols=111 Identities=23% Similarity=0.304 Sum_probs=52.1
Q ss_pred cCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC---CeeEeeCCCCCCcEEEEEEEEeCCCCCCceEE
Q psy6838 117 TKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA---NETTIPENSPLNTVVAALKAIDRDEGQNSYVE 193 (442)
Q Consensus 117 ~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~---~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~ 193 (442)
....|.+.+.++|..|.. +.....+.|-- .+|..... ....+.-............+.+.+. ...+.
T Consensus 10 ~d~~~~v~vt~tD~aGN~-----~~~t~~~~vDt---~~P~v~i~~~~~~~~~~~~~~~~~~t~s~tvs~~~~--g~~v~ 79 (317)
T PF13753_consen 10 ADGTYTVSVTVTDAAGNT-----STATQSITVDT---TAPTVTITSIADDDIINGDEATNTVTFSGTVSGAEP--GSTVT 79 (317)
T ss_pred CCCcEEEEEEEEeCCCCe-----eeeeEEEEEec---CCCceeeecccCCCccccceeeeeeEEEEEecCCCC--CCEEE
Confidence 346789999999999853 33445554322 26644332 0001100001122233444444343 33355
Q ss_pred EEEecCCCCCeEecCcceEEEEcccC-CcccccEEEEEEE-EEeCCCC
Q psy6838 194 YSIINSEATPFSLGPVDGLLRVAQKL-DREIKSNYTLEIK-AKDRGEP 239 (442)
Q Consensus 194 y~i~~~~~~~F~id~~tG~i~~~~~l-D~e~~~~~~l~V~-a~D~~~~ 239 (442)
+.+ .+....+..+ ..|.....-.. +.-....|.+.+. ++|..+.
T Consensus 80 v~~-~g~~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN 125 (317)
T PF13753_consen 80 VTI-NGTTGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGN 125 (317)
T ss_pred EEE-CCEEEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCc
Confidence 555 2222233333 24432222111 1123458889998 9997753
No 36
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=55.03 E-value=25 Score=27.36 Aligned_cols=34 Identities=15% Similarity=0.104 Sum_probs=23.1
Q ss_pred cCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEee
Q psy6838 355 KLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVD 390 (442)
Q Consensus 355 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d 390 (442)
.+++.+.+.|.++|++.|..+ -+.++.+.|.|.|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dG--CS~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDG--CSTSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCC--ccCcCCEEEEecC
Confidence 455667789999999999877 3444455555543
No 37
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=54.73 E-value=30 Score=24.98 Aligned_cols=31 Identities=23% Similarity=0.175 Sum_probs=22.0
Q ss_pred CCCccCcEEEEEEEEEECCCCCceEEEEEEEEE
Q psy6838 356 LDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNV 388 (442)
Q Consensus 356 ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V 388 (442)
..|...+.|.++++++|..+ .+...++.|.|
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~V 81 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG--SSSTKTTTVVV 81 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC--CEEEEEEEEEC
Confidence 44678899999999999864 34444555543
No 38
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=54.72 E-value=5.3e+02 Score=32.38 Aligned_cols=194 Identities=16% Similarity=0.215 Sum_probs=88.1
Q ss_pred CCCcEEEEEEeeeCCCCCCcEEEEEEEeCCCCCcEEEcCCccE-EEEeeecccccccCCCcEEEEEEeCCCCCcEEEeCC
Q psy6838 25 PRGSRVGQVFTTDADEGVNGVVTYSVVSDWANDVFSINPQSGI-FTLTAKLDYEDVSDNGRLEYSIVSGDDEEIFTINPS 103 (442)
Q Consensus 25 ~~g~~v~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~id~~tg~-~~~~~~ld~~~~~~~~~i~y~i~~~~~~~~F~i~~~ 103 (442)
.+|+....++|++..........-......+...|..+...|. +.+.....+.+...+..+.|...-|++.. .
T Consensus 1398 ~pGtYTVTLTVtN~~gs~T~s~~I~V~~Pv~~~~ft~ngT~g~~ltlg~pv~Ftd~StGt~~Sy~WdFGDGst------s 1471 (2740)
T TIGR00864 1398 DPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGNNLELGQPYLFSAFGRARNASYLWDFGDGGL------L 1471 (2740)
T ss_pred CCceEEEEEEEECCCcceeeeeEEEEecCCCcceeeecCCcccccccCcceEEeecCCCCceEEEEECCCCCc------c
Confidence 3566666777765432111111111221111223444443332 22222222333445566777777665421 0
Q ss_pred ccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccC-CeeEeeCCCCCCcEEEEEEEE
Q psy6838 104 NGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTVVAALKAI 182 (442)
Q Consensus 104 ~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~g~~v~~v~a~ 182 (442)
++. .-..-|.....|++++.|++..|. .+.++.|+|..- ..-|... .... .+.+.. ..+.+
T Consensus 1472 t~q---nptHtY~~~GtYtVtLTvtN~~Gs------st~T~~VtV~~p---V~~~tinas~~~----vpl~~s-V~Fta- 1533 (2740)
T TIGR00864 1472 EGP---EILHAFNSPGDFNIRLAAANEVGK------NEATLNVAVKAR---VRGLTINASLTN----VPLNGS-VHFEA- 1533 (2740)
T ss_pred Ccc---ceEEEcCCCceEEEEEEEECCCCc------eEEEEEEEEecc---ccceEEcCCCcc----ccccce-EEEEE-
Confidence 111 111236778899999999998873 344556666431 1111111 0111 111111 12222
Q ss_pred eCCCCCCceEEEEEecCCC-CCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeC
Q psy6838 183 DRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDE 254 (442)
Q Consensus 183 D~D~~~~~~~~y~i~~~~~-~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~ 254 (442)
+.+.|.. ++|+-.=++. ..+.=++ + . .-.|.....|++++.|++..+ +...++.|.|.+.
T Consensus 1534 ~~s~Gs~--v~ysWdFGDg~ts~~~np-t-~-----~yTY~spGtYtVtLTvtN~~G---s~~~T~~i~V~~~ 1594 (2740)
T TIGR00864 1534 HLDAGDD--VRFSWILCDHCTPIFGGN-T-I-----FYTFRSVGTFNIIVTAENDVG---AAQASIFLFVLQE 1594 (2740)
T ss_pred EccCCCc--eeEEEEeCCCCccccCCC-c-e-----EEeecCCceEEEEEEEecCCC---ccceeEEEEEeec
Confidence 2333333 3444333321 1111011 0 0 123567899999999999864 3355566666653
No 39
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=52.79 E-value=59 Score=22.24 Aligned_cols=30 Identities=40% Similarity=0.491 Sum_probs=21.4
Q ss_pred CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEecc
Q psy6838 118 KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVN 152 (442)
Q Consensus 118 ~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvN 152 (442)
...|+|.++|+|..|. .........+.|..
T Consensus 22 dg~yt~~v~a~D~AGN-----~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGN-----TSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECCCC-----EEEeeeEEEEEcCC
Confidence 5689999999999984 44555555555543
No 40
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=51.66 E-value=36 Score=24.38 Aligned_cols=31 Identities=19% Similarity=0.224 Sum_probs=23.5
Q ss_pred cCCCccCcEEEEEEEEEECCCCCceEEEEEEEEE
Q psy6838 355 KLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNV 388 (442)
Q Consensus 355 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V 388 (442)
...|...+.|.+++.++|..+ +.++++.|.|
T Consensus 48 ~~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 48 THTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 345677889999999999766 5566666665
No 41
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=49.04 E-value=63 Score=21.53 Aligned_cols=16 Identities=25% Similarity=0.289 Sum_probs=13.5
Q ss_pred cCcEEEEEEEEEECCC
Q psy6838 360 DVHHYIFVVQAQDTGK 375 (442)
Q Consensus 360 ~~~~~~l~V~a~D~~~ 375 (442)
..+.|.++|+|+|..+
T Consensus 22 ~dG~y~itv~a~D~AG 37 (54)
T PF13754_consen 22 ADGTYTITVTATDAAG 37 (54)
T ss_pred CCccEEEEEEEEeCCC
Confidence 4678999999999765
No 42
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=47.65 E-value=1.5e+02 Score=28.43 Aligned_cols=34 Identities=24% Similarity=0.381 Sum_probs=22.4
Q ss_pred CcEEEEEEEeCCC-CCcEEEcCCccEEEEeeeccc
Q psy6838 43 NGVVTYSVVSDWA-NDVFSINPQSGIFTLTAKLDY 76 (442)
Q Consensus 43 ~~~~~y~l~~~~~-~~~f~id~~tg~~~~~~~ld~ 76 (442)
.|.+.|-...+.. -.-|+|++.+|.|.+...-.-
T Consensus 254 dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~t 288 (346)
T COG2706 254 DGRFLYASNRGHDSIAVFSVDPDGGKLELVGITPT 288 (346)
T ss_pred CCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEecc
Confidence 4556665554432 256999999999887765433
No 43
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=46.82 E-value=42 Score=24.43 Aligned_cols=25 Identities=20% Similarity=0.125 Sum_probs=21.0
Q ss_pred cEEEEEEEEEECCCCCceEEEEEEEEE
Q psy6838 362 HHYIFVVQAQDTGKPMMSSTVTVYFNV 388 (442)
Q Consensus 362 ~~~~l~V~a~D~~~~~~~~~~~v~I~V 388 (442)
+.+.++.+|+|..+ ..+++++.|+|
T Consensus 57 G~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 57 GTTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred ceEEEEEEEEECCC--CEEEEEEEEEC
Confidence 47899999999866 67899998875
No 44
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=45.60 E-value=72 Score=26.02 Aligned_cols=51 Identities=18% Similarity=0.144 Sum_probs=27.0
Q ss_pred cCCCccCcEEEEEEEEEECCCCCceEEEEEEEEEeeCCCCCCccCCCCCeeEEecCC
Q psy6838 355 KLDYEDVHHYIFVVQAQDTGKPMMSSTVTVYFNVVDLNDNPPLFDPMPPNIPVYENV 411 (442)
Q Consensus 355 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~ 411 (442)
.+.+...+.|+++|+++|..+ -+....+.|.|.|..- -.+. +....+.||-
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDG--sHl~--pL~~~~~~nN 126 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNADG--CSTSAPVEVVVADTDG--SHLA--PLPYNWDENN 126 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETTE--EEE---EEEEEE-TTS--TTS----------TTS
T ss_pred EEEecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCCc--cccc--cccCcccccC
Confidence 455556789999999999877 6677789999999843 2222 2334566653
No 45
>KOG4221|consensus
Probab=41.47 E-value=5.9e+02 Score=29.10 Aligned_cols=137 Identities=14% Similarity=0.151 Sum_probs=76.5
Q ss_pred ccEEEEcccCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeccCCCCeeccCCeeEeeCCCCCCcEEEEEEEEe
Q psy6838 104 NGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENSPLNTVVAALKAID 183 (442)
Q Consensus 104 ~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~~~g~~v~~v~a~D 183 (442)
.|.+.....|-. ...|.+.++|....|.. .+++.+.|.-. -| +|...+.+ ..+..-..+...-
T Consensus 481 ~g~~~tv~nl~p--~t~Y~~rv~A~n~~g~g----~sS~pLkV~t~--pE-gp~~~~a~--------ats~~ti~v~Wep 543 (1381)
T KOG4221|consen 481 PGIQVTVQNLSP--LTMYFFRVRAKNEAGSG----ESSAPLKVTTQ--PE-GPVQLQAY--------ATSPTTILVTWEP 543 (1381)
T ss_pred CceEEEeeeccc--ceeEEEEEeccCcccCC----ccCCceEEecC--CC-CCcccccc--------ccCcceEEEEecC
Confidence 453333444433 35799999998877642 34455555322 22 22221111 1111122233222
Q ss_pred CCCCCCceEEEEEec---CCCCCeEecCcceEEEEcccCCcccccEEEEEEEEEeCCCCCCceeEEEEEEEEeCCCCCCe
Q psy6838 184 RDEGQNSYVEYSIIN---SEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPI 260 (442)
Q Consensus 184 ~D~~~~~~~~y~i~~---~~~~~F~id~~tG~i~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~ 260 (442)
+-.+......|++.- +...++.++..+-++++. +.|....|.+.|.|.+..+++.++..--.++..|+-+.||.
T Consensus 544 P~~~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 544 PPFGNGPITGYKLFYSEDDTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred CCCCCCCceEEEEEEEcCCCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 222222223343322 234678888777777665 44667889999999999888877666556777788887775
No 46
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=40.66 E-value=67 Score=23.07 Aligned_cols=29 Identities=21% Similarity=0.310 Sum_probs=20.5
Q ss_pred CcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEE
Q psy6838 114 DRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIV 147 (442)
Q Consensus 114 D~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~ 147 (442)
-|.....|.+++.++|..+ .+.+..+.|.
T Consensus 52 ~y~~~G~y~v~l~v~d~~g-----~~~~~~~~V~ 80 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAVG-----SSSTKTTTVV 80 (81)
T ss_pred EcCCCcEEEEEEEEEeCCC-----CEEEEEEEEE
Confidence 3667789999999999865 2444455554
No 47
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=38.03 E-value=82 Score=22.46 Aligned_cols=30 Identities=10% Similarity=0.131 Sum_probs=22.3
Q ss_pred CCcccccEEEEEEEEEeCCCCCCceeEEEEEEE
Q psy6838 219 LDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL 251 (442)
Q Consensus 219 lD~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V 251 (442)
.-|.....|.+++.+.|..+ +...+++|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 34667889999999999865 5566666655
No 48
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=36.59 E-value=1.8e+02 Score=21.82 Aligned_cols=24 Identities=29% Similarity=0.364 Sum_probs=12.1
Q ss_pred EEEEEeCCCCCCeeeccceeeecccc
Q psy6838 248 FIRLLDENDNNPIFDSKSYSASVPEN 273 (442)
Q Consensus 248 ~I~V~d~nd~~P~f~~~~~~~~v~e~ 273 (442)
+|+|.| ||.+ .+.-......+.|+
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~ 25 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEG 25 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETT
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeC
Confidence 466677 5544 65544444455554
No 49
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=32.32 E-value=95 Score=22.50 Aligned_cols=24 Identities=25% Similarity=0.289 Sum_probs=20.2
Q ss_pred EEEEEEEEEeCCCCCCCCceeEEEEEEEE
Q psy6838 120 SYNLVVMATDMAKPPQKRLSSTTQVSIVI 148 (442)
Q Consensus 120 ~~~l~v~a~D~~~~~~~~~s~~~~v~I~V 148 (442)
.|.++..|+|..| .++++.+.|+|
T Consensus 58 ~t~V~ytA~D~~G-----N~a~C~f~V~V 81 (81)
T PF02494_consen 58 TTTVTYTATDAAG-----NSATCSFTVTV 81 (81)
T ss_pred eEEEEEEEEECCC-----CEEEEEEEEEC
Confidence 6899999999877 47788888875
No 50
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=31.73 E-value=1.1e+02 Score=23.90 Aligned_cols=34 Identities=18% Similarity=0.220 Sum_probs=24.0
Q ss_pred cCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEe
Q psy6838 112 HLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRD 150 (442)
Q Consensus 112 ~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~d 150 (442)
.+++.+...|.++|++.|..|. +.+..+.|.|.|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dGC-----S~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDGC-----STSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCCc-----cCcCCEEEEecC
Confidence 3556677899999999999874 444555565543
No 51
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=27.07 E-value=5.4e+02 Score=24.28 Aligned_cols=26 Identities=15% Similarity=0.254 Sum_probs=17.4
Q ss_pred cccEEEEEEEEEeCCCCCCceeEEEEEE
Q psy6838 223 IKSNYTLEIKAKDRGEPSKSSRTKFFIR 250 (442)
Q Consensus 223 ~~~~~~l~V~a~D~~~~~~~~~~~v~I~ 250 (442)
....|.+.+.++|..+... .....+.
T Consensus 10 ~d~~~~v~vt~tD~aGN~~--~~t~~~~ 35 (317)
T PF13753_consen 10 ADGTYTVSVTVTDAAGNTS--TATQSIT 35 (317)
T ss_pred CCCcEEEEEEEEeCCCCee--eeeEEEE
Confidence 3567899999999875333 3444554
No 52
>PF03174 CHB_HEX_C: Chitobiase/beta-hexosaminidase C-terminal domain; InterPro: IPR004867 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This short domain is found in members of the glycoside hydrolase family 20 (GH20 from CAZY) and represents the C-terminal domain in chitobiases and beta-hexosaminidases 3.2.1.52 from EC. It is composed of a beta sandwich structure []. The function of this domain is unknown. ; GO: 0004563 beta-N-acetylhexosaminidase activity, 0005975 carbohydrate metabolic process; PDB: 1C7T_A 1QBA_A 1QBB_A 1C7S_A.
Probab=24.29 E-value=1.6e+02 Score=21.03 Aligned_cols=58 Identities=7% Similarity=0.089 Sum_probs=24.5
Q ss_pred cEEEEeeecccccccCCCcEEEEEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCC
Q psy6838 66 GIFTLTAKLDYEDVSDNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKP 133 (442)
Q Consensus 66 g~~~~~~~ld~~~~~~~~~i~y~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~ 133 (442)
|.......+......++..|.|.+.+..+. ........+|-..+... |+++|.+.++.
T Consensus 10 G~~~~~~~v~l~~~~~~~~I~YT~DGs~Pt--------~~s~~Y~~Pi~i~~~~~--vka~a~~~~g~ 67 (75)
T PF03174_consen 10 GTYDEGQTVTLSSDTPGATIYYTLDGSEPT--------KSSPLYTGPITIPESGT--VKARAFDNGGN 67 (75)
T ss_dssp EEEEETTEEEEE-SSTTSEEEEESSSSSS------------EE--CCC--B--S----EEEEE-TTS-
T ss_pred CcEecCeEEEEEeCCCCCEEEEEcCCCccc--------ccCcccCcCEEeCCCcE--EEEEEEcCCCC
Confidence 333333333333334677899999765541 11112233444433333 88888887753
No 53
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=23.89 E-value=2.8e+02 Score=22.64 Aligned_cols=36 Identities=17% Similarity=0.298 Sum_probs=23.7
Q ss_pred cCCcccCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEecc
Q psy6838 112 HLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVN 152 (442)
Q Consensus 112 ~lD~E~~~~~~l~v~a~D~~~~~~~~~s~~~~v~I~V~dvN 152 (442)
.+.......|.++|++.+..| -+.+..+.|.|.|-+
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~G-----CS~S~~~~V~VaDTD 111 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNADG-----CSTSAPVEVVVADTD 111 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETTE-----EEE---EEEEEE-TT
T ss_pred EEEecCCCEEEEEEEEECCCC-----cccCCCEEEEEeCCC
Confidence 455566788999999999887 366678888888864
No 54
>PF13290 CHB_HEX_C_1: Chitobiase/beta-hexosaminidase C-terminal domain
Probab=22.55 E-value=2.8e+02 Score=19.44 Aligned_cols=44 Identities=7% Similarity=0.016 Sum_probs=23.0
Q ss_pred cCCCcEEEEEEeCCCCCcEEEeCCccEEEEcccCCcccCcEEEEEEEEEeCCCC
Q psy6838 80 SDNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKP 133 (442)
Q Consensus 80 ~~~~~i~y~i~~~~~~~~F~i~~~~g~l~~~~~lD~E~~~~~~l~v~a~D~~~~ 133 (442)
..+..|.|.+.+..+... .. ...++|-. ....+++.+|.|.++.
T Consensus 18 ~~~~~IyYT~DGs~Pt~~-------S~-~Y~~Pi~i--~~~ttVka~a~~~~g~ 61 (67)
T PF13290_consen 18 DPDATIYYTTDGSEPTPS-------SP-LYTGPITI--TGTTTVKARAFDPDGN 61 (67)
T ss_pred CCCCEEEEEcCCCccccC-------CC-eeccCEEe--cCCEEEEEEEEcCCCc
Confidence 356788888875543210 00 01122211 1346788888887763
No 55
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=20.13 E-value=2e+02 Score=18.01 Aligned_cols=14 Identities=29% Similarity=0.470 Sum_probs=10.7
Q ss_pred CccCcEEEEEEEEE
Q psy6838 358 YEDVHHYIFVVQAQ 371 (442)
Q Consensus 358 ~e~~~~~~l~V~a~ 371 (442)
|.+.+.|.|+++++
T Consensus 9 FT~PG~Y~l~~~a~ 22 (41)
T TIGR03769 9 FTKPGTYTLTVQAT 22 (41)
T ss_pred eCCCeEEEEEEEEE
Confidence 45677898888885
Done!