Query psy11780
Match_columns 383
No_of_seqs 297 out of 1814
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 18:46:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11780.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11780hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 1.6E-51 3.6E-56 420.5 26.3 308 3-379 221-537 (2531)
2 KOG4289|consensus 100.0 8.3E-50 1.8E-54 408.1 30.3 320 2-372 531-944 (2531)
3 KOG1219|consensus 100.0 2E-45 4.3E-50 386.3 34.2 304 4-376 904-1218(4289)
4 KOG1219|consensus 100.0 8E-45 1.7E-49 381.8 34.6 304 3-375 697-1116(4289)
5 cd00031 CA Cadherin repeat dom 100.0 3.9E-31 8.5E-36 235.8 27.9 197 51-306 2-199 (199)
6 cd00031 CA Cadherin repeat dom 99.9 3.9E-25 8.4E-30 197.0 22.0 156 183-374 1-159 (199)
7 KOG1834|consensus 99.8 6E-18 1.3E-22 165.2 19.7 201 33-264 23-229 (952)
8 PF00028 Cadherin: Cadherin do 99.7 9.8E-17 2.1E-21 125.3 12.9 91 184-305 1-93 (93)
9 PF00028 Cadherin: Cadherin do 99.7 8E-16 1.7E-20 120.1 11.3 92 51-170 1-93 (93)
10 smart00112 CA Cadherin repeats 99.6 1.5E-15 3.3E-20 114.8 10.3 78 204-312 1-79 (79)
11 smart00112 CA Cadherin repeats 99.6 2.2E-14 4.7E-19 108.5 10.4 74 98-177 6-79 (79)
12 KOG1834|consensus 99.3 4.8E-11 1E-15 117.6 15.3 174 167-373 21-204 (952)
13 PF08266 Cadherin_2: Cadherin- 97.7 4.8E-05 1E-09 57.7 3.7 74 185-259 4-80 (84)
14 PF08758 Cadherin_pro: Cadheri 97.1 0.004 8.6E-08 47.9 8.7 79 39-154 2-80 (90)
15 PF08758 Cadherin_pro: Cadheri 97.1 0.0065 1.4E-07 46.7 9.5 78 176-264 3-80 (90)
16 PF08266 Cadherin_2: Cadherin- 97.0 0.0004 8.6E-09 52.7 2.0 65 51-140 3-67 (84)
17 smart00736 CADG Dystroglycan-t 95.7 0.16 3.5E-06 39.5 10.2 69 203-309 24-96 (97)
18 TIGR01965 VCBS_repeat VCBS rep 95.0 0.17 3.8E-06 39.4 7.9 89 199-326 2-97 (99)
19 smart00736 CADG Dystroglycan-t 94.7 0.79 1.7E-05 35.6 11.1 65 102-174 31-96 (97)
20 TIGR01965 VCBS_repeat VCBS rep 94.6 0.79 1.7E-05 35.8 10.6 48 135-192 51-98 (99)
21 PF13750 Big_3_3: Bacterial Ig 92.4 6.3 0.00014 33.6 17.0 113 141-264 15-136 (158)
22 TIGR00845 caca sodium/calcium 86.9 53 0.0011 36.0 20.7 137 172-327 395-543 (928)
23 TIGR03660 T1SS_rpt_143 T1SS-14 86.1 9.5 0.0002 31.7 9.6 59 125-191 66-127 (137)
24 PF07495 Y_Y_Y: Y_Y_Y domain; 84.9 5.8 0.00012 27.9 7.0 46 211-264 7-52 (66)
25 PF05345 He_PIG: Putative Ig d 76.2 12 0.00026 25.1 5.6 37 223-264 13-49 (49)
26 KOG3597|consensus 71.1 1.2E+02 0.0026 30.5 14.9 60 162-222 25-84 (442)
27 KOG4221|consensus 68.8 2.1E+02 0.0046 32.4 17.4 66 105-178 554-620 (1381)
28 PF05895 DUF859: Siphovirus pr 68.4 1.3E+02 0.0027 31.8 13.5 109 141-264 298-425 (624)
29 TIGR03660 T1SS_rpt_143 T1SS-14 62.9 88 0.0019 26.0 9.9 58 236-327 71-128 (137)
30 PF13750 Big_3_3: Bacterial Ig 51.8 1.5E+02 0.0032 25.2 17.3 30 137-170 119-148 (158)
31 PF12245 Big_3_2: Bacterial Ig 49.2 57 0.0012 22.7 5.2 13 252-264 24-36 (60)
32 TIGR00845 caca sodium/calcium 48.3 4.2E+02 0.0092 29.4 16.0 56 162-221 514-569 (928)
33 PF07495 Y_Y_Y: Y_Y_Y domain; 40.1 61 0.0013 22.4 4.3 30 139-170 37-66 (66)
34 PF02494 HYR: HYR domain; Int 35.1 73 0.0016 23.3 4.2 13 252-264 58-70 (81)
35 cd00146 PKD polycystic kidney 33.1 89 0.0019 22.5 4.4 14 251-264 57-70 (81)
36 smart00089 PKD Repeats in poly 29.7 1.3E+02 0.0028 21.6 4.7 14 251-264 55-68 (79)
37 PF13754 Big_3_4: Bacterial Ig 28.8 1.7E+02 0.0038 19.7 4.9 13 252-264 25-37 (54)
38 cd02848 Chitinase_N_term Chiti 27.8 77 0.0017 25.0 3.3 15 250-264 79-93 (106)
39 PF12245 Big_3_2: Bacterial Ig 26.5 2.2E+02 0.0048 19.6 5.7 33 140-178 22-54 (60)
40 PF08329 ChitinaseA_N: Chitina 25.9 91 0.002 25.8 3.5 31 249-308 81-111 (133)
41 cd00146 PKD polycystic kidney 21.6 2.3E+02 0.0049 20.3 4.8 20 135-154 51-70 (81)
42 cd02848 Chitinase_N_term Chiti 21.1 1E+02 0.0022 24.4 2.8 33 134-170 73-105 (106)
43 PF08329 ChitinaseA_N: Chitina 21.1 3.4E+02 0.0074 22.4 6.0 36 134-173 76-111 (133)
No 1
>KOG4289|consensus
Probab=100.00 E-value=1.6e-51 Score=420.52 Aligned_cols=308 Identities=28% Similarity=0.399 Sum_probs=287.8
Q ss_pred eeccccccceeeeeEEEEeeeee-------eeeEEEEEeecC-CCCCeEEeCCCeeEEEEEecCCCCCCCeEEEEEEEEc
Q psy11780 3 NLLNAGVNRVKEKHVLLIGTYLL-------LSAILVDSSVSG-NRAPRFIIDEQTEIVIRLKEGPDTPAGTFIYRLRGAD 74 (383)
Q Consensus 3 ~~~~~~lD~E~~~~~~l~~~~~~-------~~~~~~i~v~~~-Dn~P~F~~~~~~~~~~~v~E~~~~~~g~~i~~v~A~D 74 (383)
+.+++.||||++..+.|.|+|.| .+++++|.|.|. ||.|+|. |..|.-+++| |.++|+.|++++|+|
T Consensus 221 irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFE---q~~Y~e~lRE--n~evGy~vLtvrAtD 295 (2531)
T KOG4289|consen 221 IRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFE---QDEYREELRE--NLEVGYEVLTVRATD 295 (2531)
T ss_pred chhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccc---hhHHHHHHhh--ccccCceEEEEEecc
Confidence 45789999999999999999988 568888888777 9999999 9999999999 999999999999999
Q ss_pred CCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCCCCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCCC
Q psy11780 75 PDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSN 154 (383)
Q Consensus 75 ~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~~ 154 (383)
.|.+ +|+++.|++ ..|+..+.|.|++.+ |.|.++.+||||+...|+|.|.|+|+|.
T Consensus 296 ~Dsp--------------------~Nani~Yrl--~eg~~~~~f~in~rS--GvI~T~a~lDRE~~~~y~L~VeAsDqG~ 351 (2531)
T KOG4289|consen 296 GDSP--------------------PNANIRYRL--LEGNAKNVFEINPRS--GVISTRAPLDREELESYQLDVEASDQGR 351 (2531)
T ss_pred CCCC--------------------CCCceEEEe--cCCCccceeEEcCcc--ceeeccCccCHHhhhheEEEEEeccCCC
Confidence 9995 488999999 667888999999998 9999999999999999999999999998
Q ss_pred CCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-cccEEEeCCCc
Q psy11780 155 NDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGS-QGIFDIDPNTG 233 (383)
Q Consensus 155 ~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~-~~~F~Id~~tG 233 (383)
++.+ .++.|.|+|+|+|||+|+|....|.++|.|+..+++.|++|+|+|.|.|.|+.+.|+|.+++ .+.|.||..||
T Consensus 352 ~pgp--~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tG 429 (2531)
T KOG4289|consen 352 PPGP--RTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTG 429 (2531)
T ss_pred CCCC--ceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccc
Confidence 7544 38999999999999999999999999999999999999999999999999999999999877 78999999999
Q ss_pred eEEEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEeecCCCCCc
Q psy11780 234 NIFTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETPT 313 (383)
Q Consensus 234 ~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNDn~P~ 313 (383)
+|.+..+||+|.. . |++.|+|.|++. |+++.+.-+.|+|+|+|||+|.
T Consensus 430 el~vv~plD~e~~-~----ytl~IrAqDggr---------------------------PpLsn~sgl~iqVlDINDhaPi 477 (2531)
T KOG4289|consen 430 ELDVVEPLDFENS-E----YTLRIRAQDGGR---------------------------PPLSNTSGLVIQVLDINDHAPI 477 (2531)
T ss_pred eEEEeccccccCC-e----eEEEEEcccCCC---------------------------CCccCCCceEEEEEecCCCCce
Confidence 9999999999998 4 999999999998 8899999888999999999999
Q ss_pred ccCCceEEEEeCCCCCCcceEEeecccceEEeCCCCCCeEEEEEEeCCCccEEEecccEEEEEEEE
Q psy11780 314 FRSPRYVAEIHENSVINTPVTFLGGVVPQVFDYDQGKNGTFQMFIEGDMGTFEITPQKIINEASFL 379 (383)
Q Consensus 314 f~~~~y~~~V~E~~~~g~~v~~v~~~~~~a~D~D~g~n~~i~y~l~~~~~~F~I~~~tG~~~~~~~ 379 (383)
|...++.++|.|+.+.|..+..+. |.|.|+|+|+.+.|++.|-. .|.|+..+|..+..-.
T Consensus 478 fvstpfq~tvlEnv~lg~~v~~vq-----aidadsg~na~l~y~laG~~-pf~I~~~SG~Itvtk~ 537 (2531)
T KOG4289|consen 478 FVSTPFQATVLENVPLGYLVCHVQ-----AIDADSGENARLHYSLAGVG-PFQINNGSGWITVTKE 537 (2531)
T ss_pred eEechhhhhhhhcccccceEEEEe-----cccCCCCcccceeeeeccCC-CeeEecCCceEEEeec
Confidence 999999999999999999999998 99999999999999997654 7999999999877654
No 2
>KOG4289|consensus
Probab=100.00 E-value=8.3e-50 Score=408.11 Aligned_cols=320 Identities=31% Similarity=0.461 Sum_probs=291.3
Q ss_pred eeeccccccceeeeeEEEEeeeee-------eeeEEEEEeecC-CCCCeEEeCCCeeEEEEEecCCCCCCCeEEEEEEEE
Q psy11780 2 INLLNAGVNRVKEKHVLLIGTYLL-------LSAILVDSSVSG-NRAPRFIIDEQTEIVIRLKEGPDTPAGTFIYRLRGA 73 (383)
Q Consensus 2 ~~~~~~~lD~E~~~~~~l~~~~~~-------~~~~~~i~v~~~-Dn~P~F~~~~~~~~~~~v~E~~~~~~g~~i~~v~A~ 73 (383)
.+++.+.||||+.+.|.|.|.++| ++++|.|.+.|+ ||.|.|. +..|.++|.| +.|.|+.|++++|+
T Consensus 531 ~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft---~~eytl~inE--D~pvgsSI~tvtAv 605 (2531)
T KOG4289|consen 531 WITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFT---QKEYTLRINE--DAPVGSSIVTVTAV 605 (2531)
T ss_pred eEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccc---cCceEEEecC--CccccceEEEEEEe
Confidence 367899999999999999999998 678888998888 9999999 9999999999 99999999999999
Q ss_pred cCCCC-ceEEEEEcc-ccceEEEeecceeE--------------------------------------------------
Q psy11780 74 DPDGD-VLTFGVQGH-GHDVIRIENLNVML-------------------------------------------------- 101 (383)
Q Consensus 74 D~D~~-~~~~~~~~~-~~~~~~i~~~~~~~-------------------------------------------------- 101 (383)
|.|.. ..+|.|.|- ....|.|+..+-.|
T Consensus 606 D~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~ 685 (2531)
T KOG4289|consen 606 DRDANSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSS 685 (2531)
T ss_pred ccccccceEEEecCCcccccceeeccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccC
Confidence 99975 488999883 36777777766111
Q ss_pred ------------------------------EEEEEEeeeCCCCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEe
Q psy11780 102 ------------------------------KVVYHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVD 151 (383)
Q Consensus 102 ------------------------------~v~y~i~~~~~~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D 151 (383)
+|+|-+ ....|+||+++ |.+++...||||.+-.|.+.+.|+|
T Consensus 686 pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y~l------ed~~Frid~ds--g~i~t~~~ld~edqvtytl~itA~D 757 (2531)
T KOG4289|consen 686 PFTVSINEDRPLGTTVVTLSASDEDTGENARITYIL------EDEAFRIDPDS--GAIYTQAELDYEDQVTYTLAITARD 757 (2531)
T ss_pred CeeEeeccCCcCCceeEEEecccCCCCccceEEEEe------cccceeecCCC--CceEEeeeeecccceeeEeeeeecC
Confidence 566633 13459999998 9999999999999999999999999
Q ss_pred CCCCCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC--cccEEEe
Q psy11780 152 RSNNDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGS--QGIFDID 229 (383)
Q Consensus 152 ~~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~--~~~F~Id 229 (383)
.+.| +..++.+|.|.|.|+|||+|+|..+.|.++|.|++|++|.+++|.|+|+|.++|+.+.|.+.++. .+.|.|+
T Consensus 758 ~~~p--q~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IE 835 (2531)
T KOG4289|consen 758 NGIP--QKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIE 835 (2531)
T ss_pred CCCC--CcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEc
Confidence 9987 66889999999999999999999999999999999999999999999999999999999998754 7899999
Q ss_pred CCCceEEEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEeecCC
Q psy11780 230 PNTGNIFTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVND 309 (383)
Q Consensus 230 ~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvND 309 (383)
+.+|.|++.+.||||..+. |.|.+.|.|.|. |+++++++|+|+|+|+||
T Consensus 836 ptSGviRtl~rLdRE~~av----y~L~a~avDrg~---------------------------p~ls~~~eItvtvldvND 884 (2531)
T KOG4289|consen 836 PTSGVIRTLRRLDRENVAV----YVLAAYAVDRGN---------------------------PPLSAPVEITVTVLDVND 884 (2531)
T ss_pred cCcceeehhhhhcchheeE----EEEEEEEeeCCC---------------------------CCcCCceEEEEEEEecCC
Confidence 9999999999999999987 999999999999 889999999999999999
Q ss_pred CCCcccCCceEEEEeCCCCCCcceEEeecccceEEeCCCCCCeEEEEEEeCC--CccEEEecccE
Q psy11780 310 ETPTFRSPRYVAEIHENSVINTPVTFLGGVVPQVFDYDQGKNGTFQMFIEGD--MGTFEITPQKI 372 (383)
Q Consensus 310 n~P~f~~~~y~~~V~E~~~~g~~v~~v~~~~~~a~D~D~g~n~~i~y~l~~~--~~~F~I~~~tG 372 (383)
|||+|.+..|...|.||.+.|..++++. |.|+|+|+|+++.|+|.++ ...|.++...|
T Consensus 885 naPvfe~~e~e~~I~enspvgs~va~i~-----a~dpdEG~NA~IsYqIvgg~d~~~fq~de~~~ 944 (2531)
T KOG4289|consen 885 NAPVFEQDELELFIEENSPVGSVVALIT-----ADDPDEGPNAHISYQIVGGNDPELFQLDEFSG 944 (2531)
T ss_pred CCCCCCCcceeeEEeecCccceeeEEEE-----ccCCCcCCcceEEEeeccCccHHHHHHHHhhh
Confidence 9999999999999999999999999999 9999999999999999654 46788887777
No 3
>KOG1219|consensus
Probab=100.00 E-value=2e-45 Score=386.33 Aligned_cols=304 Identities=29% Similarity=0.424 Sum_probs=279.2
Q ss_pred eccccccceeeeeEEEEeeeee-------eeeEEEEEeecC---CCCCeEEeCCCeeEEEEEecCCCCCCCeEEEEEEEE
Q psy11780 4 LLNAGVNRVKEKHVLLIGTYLL-------LSAILVDSSVSG---NRAPRFIIDEQTEIVIRLKEGPDTPAGTFIYRLRGA 73 (383)
Q Consensus 4 ~~~~~lD~E~~~~~~l~~~~~~-------~~~~~~i~v~~~---Dn~P~F~~~~~~~~~~~v~E~~~~~~g~~i~~v~A~ 73 (383)
-+-++||+|+.+.|+|.|++.| +.+.+.+.++|. =|||.|. .-.-..+|.| |+|+|+.++++.|.
T Consensus 904 fi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~---~~v~e~~V~E--napiGT~vi~i~A~ 978 (4289)
T KOG1219|consen 904 FIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFI---SFVTEGHVLE--NAPIGTIVIRIQAR 978 (4289)
T ss_pred EEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchh---eeeeeeeEee--cCCcceEEEEEEEe
Confidence 3567999999999999999988 567777777777 4679997 5556789999 99999999999999
Q ss_pred cCCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCCCCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCC
Q psy11780 74 DPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRS 153 (383)
Q Consensus 74 D~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~ 153 (383)
|.|. |.-|.+.|+|. .|+..+.|.|++.+ |.|++.++||||..+.|.|+|.|+|.|
T Consensus 979 deds--------------------gldg~l~Y~I~--~gdg~g~FsId~~t--G~irTl~~lDrE~ks~YwltveA~D~g 1034 (4289)
T KOG1219|consen 979 DEDS--------------------GLDGELSYKIR--TGDGDGIFSIDSTT--GSIRTLKALDREKKSSYWLTVEAKDLG 1034 (4289)
T ss_pred cCCC--------------------CccceEEEEEE--cCCcceeEEecCCc--ceEeechhhchhhcceEEEEEEEEecC
Confidence 9997 34579999995 56778999999988 999999999999999999999999999
Q ss_pred CCCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-cccEEEeCCC
Q psy11780 154 NNDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGS-QGIFDIDPNT 232 (383)
Q Consensus 154 ~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~-~~~F~Id~~t 232 (383)
.+ ++++-+.+.|.|+|+|||+|.|.++.|..+|.|++|.+..|.++.|+|+|...|+++.|.|.+++ .++|.||+.|
T Consensus 1035 t~--~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~T 1112 (4289)
T KOG1219|consen 1035 TV--PLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPET 1112 (4289)
T ss_pred CC--ccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEcccc
Confidence 87 56889999999999999999999999999999999999999999999999888999999999876 8999999999
Q ss_pred ceEEEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEeecCCCCC
Q psy11780 233 GNIFTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETP 312 (383)
Q Consensus 233 G~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNDn~P 312 (383)
|.|++.+.||||++.. +.|.|+++|.|. |.+++...|.|.|+|+|||+|
T Consensus 1113 G~iTt~r~LDRE~qdE----HiLeVTi~D~ge---------------------------p~l~s~~rviV~IldvNdnsp 1161 (4289)
T KOG1219|consen 1113 GLITTIRRLDREKQDE----HILEVTIQDNGE---------------------------PWLCSNQRVIVSILDVNDNSP 1161 (4289)
T ss_pred ceeeeehhhccccccc----ceEEEEEecCCC---------------------------CccccceEEEEEEeeccCCch
Confidence 9999999999999998 999999999998 889999999999999999999
Q ss_pred cccCCceEEEEeCCCCCCcceEEeecccceEEeCCCCCCeEEEEEEeCCCccEEEecccEEEEE
Q psy11780 313 TFRSPRYVAEIHENSVINTPVTFLGGVVPQVFDYDQGKNGTFQMFIEGDMGTFEITPQKIINEA 376 (383)
Q Consensus 313 ~f~~~~y~~~V~E~~~~g~~v~~v~~~~~~a~D~D~g~n~~i~y~l~~~~~~F~I~~~tG~~~~ 376 (383)
.|.+..|...|+|...+ .+.++. |+|.|+|.|++++|.+..++..|+||..||+...
T Consensus 1162 ~Flqk~~~~~v~~r~s~--plyRl~-----a~d~DeG~narityniedgde~FsID~~t~vVss 1218 (4289)
T KOG1219|consen 1162 RFLQKKTFLRVPERSSP--PLYRLA-----AQDNDEGNNARITYNIEDGDEVFSIDIATGVVSS 1218 (4289)
T ss_pred hhhhheeEEEeeeccCC--ceeEEE-----EEecCCCcceEEEEecccCceEEEEeccCceEEe
Confidence 99999999999998765 888988 9999999999999999877777999999997544
No 4
>KOG1219|consensus
Probab=100.00 E-value=8e-45 Score=381.79 Aligned_cols=304 Identities=31% Similarity=0.446 Sum_probs=278.2
Q ss_pred eeccccccceeeeeEEEEeeeee-------eeeEEEEEeecC-CCCCeEEeCCCeeEEEEEecCCCCCCCeEEEEEEEEc
Q psy11780 3 NLLNAGVNRVKEKHVLLIGTYLL-------LSAILVDSSVSG-NRAPRFIIDEQTEIVIRLKEGPDTPAGTFIYRLRGAD 74 (383)
Q Consensus 3 ~~~~~~lD~E~~~~~~l~~~~~~-------~~~~~~i~v~~~-Dn~P~F~~~~~~~~~~~v~E~~~~~~g~~i~~v~A~D 74 (383)
+++.++||+|.+.+|.|.++|.| +-..+.+.+.|. ||+|+|. +..|.++|.| |+.+|+.|++|.|.|
T Consensus 697 itvas~ld~~~t~~yiLnvta~D~gtPqkss~r~l~v~vkd~ndn~p~f~---e~sy~vtvse--dtepgs~Ia~vetnd 771 (4289)
T KOG1219|consen 697 ITVASPLDNENTEQYILNVTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFV---ERSYHVTVSE--DTEPGSFIAHVETND 771 (4289)
T ss_pred EEEecchhhhhhheeEEEEEEecCCCchhhceeeEEEEEEecccCCcccc---ccceEEEEec--CCCCCceEEEEEecc
Confidence 57889999999999999999998 345666777665 9999999 8899999999 999999999999999
Q ss_pred CCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCCCCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCCC
Q psy11780 75 PDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSN 154 (383)
Q Consensus 75 ~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~~ 154 (383)
.|+++ ||.++|+|. ++...|.|++.| |.|.+.++||||.+..|.|.|+|+|++.
T Consensus 772 ~D~g~--------------------NG~v~fsL~----n~sdvfsIdp~t--Givv~~~sLdrE~q~~y~l~I~a~dqp~ 825 (4289)
T KOG1219|consen 772 TDGGN--------------------NGMVSFSLL----NKSDVFSIDPFT--GIVVTSKSLDREGQTSYHLKIEARDQPP 825 (4289)
T ss_pred cCCCC--------------------CceEEEEec----CCcceEEecCcc--cEEEeccccCcccCceeEEEEEEcCCCC
Confidence 99954 899999993 468899999999 9999999999999999999999999987
Q ss_pred CCCCCeeEEEEEEEEEEcCCCCCeEecCc---------------------------------------------------
Q psy11780 155 NDRRNTGTAAILVKVQDVEDQPPEFVAVS--------------------------------------------------- 183 (383)
Q Consensus 155 ~~~~~~~~~~v~I~V~DvNDn~P~f~~~~--------------------------------------------------- 183 (383)
| ++.+.+.+.|.|+|+|||+|.|....
T Consensus 826 p--q~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~~~v~~rvd~~sGav 903 (4289)
T KOG1219|consen 826 P--QLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTDDTVGERVDFPSGAV 903 (4289)
T ss_pred C--ceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEecCccccccccccccE
Confidence 5 67889999999999999999442111
Q ss_pred ------------------------------------------------------cEEEEcCCCCCCcEEEEEEEEeCCCC
Q psy11780 184 ------------------------------------------------------SVTRVSEDVPVGTSVLQVKAVDGDRG 209 (383)
Q Consensus 184 ------------------------------------------------------~~~~v~E~~~~gt~v~~v~A~D~D~~ 209 (383)
-+.+|.||+|.|+.++++.|.|.|.|
T Consensus 904 fi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsg 983 (4289)
T KOG1219|consen 904 FIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSG 983 (4289)
T ss_pred EEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCC
Confidence 12679999999999999999999999
Q ss_pred CCceEEEEEecCC-cccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeeccccc
Q psy11780 210 INNKILYSISSGS-QGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESK 288 (383)
Q Consensus 210 ~n~~i~y~i~~~~-~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~ 288 (383)
..+.+.|+|..+. .+.|.||+.+|.|++.++||||..+. |-|+|.|.|.|.
T Consensus 984 ldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~----YwltveA~D~gt------------------------ 1035 (4289)
T KOG1219|consen 984 LDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSS----YWLTVEAKDLGT------------------------ 1035 (4289)
T ss_pred ccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcce----EEEEEEEEecCC------------------------
Confidence 9999999998765 78999999999999999999999988 999999999998
Q ss_pred ccCCCCceEEEEEEEEeecCCCCCcccCCceEEEEeCCCCCCcceEEeecccceEEeCCCCCCeEEEEEEe--CCCccEE
Q psy11780 289 TVQPAPTAKTEVTIIITDVNDETPTFRSPRYVAEIHENSVINTPVTFLGGVVPQVFDYDQGKNGTFQMFIE--GDMGTFE 366 (383)
Q Consensus 289 ~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~g~~v~~v~~~~~~a~D~D~g~n~~i~y~l~--~~~~~F~ 366 (383)
+++++.+.+.|.|+|+|||+|+|.++-|..+|.|+++.+..|.++. |.|+|...|+++.|.|. +..+.|.
T Consensus 1036 ---~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~e-----a~D~Dsssn~kLmykI~sGnyq~FF~ 1107 (4289)
T KOG1219|consen 1036 ---VPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAE-----ANDPDSSSNQKLMYKITSGNYQGFFQ 1107 (4289)
T ss_pred ---CccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEec-----cCCCCcccCcceEEEEccCCccceEE
Confidence 7899999999999999999999999999999999999999999998 99999999999999995 4589999
Q ss_pred EecccEEEE
Q psy11780 367 ITPQKIINE 375 (383)
Q Consensus 367 I~~~tG~~~ 375 (383)
|++.||..+
T Consensus 1108 Id~~TG~iT 1116 (4289)
T KOG1219|consen 1108 IDPETGLIT 1116 (4289)
T ss_pred Eccccceee
Confidence 999999988
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00 E-value=3.9e-31 Score=235.78 Aligned_cols=197 Identities=38% Similarity=0.558 Sum_probs=178.2
Q ss_pred EEEEEecCCCCCCCeEEEEEEEEcCCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCCCCCCcEEEEeecceEEEE
Q psy11780 51 IVIRLKEGPDTPAGTFIYRLRGADPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLFSISTVNGKGVIK 130 (383)
Q Consensus 51 ~~~~v~E~~~~~~g~~i~~v~A~D~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~~~~~~F~I~~~t~~G~I~ 130 (383)
|.+++.| ++++|+.++++.|+|+|.+. ++.+.|+|.. +...++|.|++.+ |.|+
T Consensus 2 ~~~~i~E--n~~~g~~v~~~~a~D~D~~~--------------------~~~~~y~i~~--~~~~~~F~i~~~t--G~l~ 55 (199)
T cd00031 2 YSVSVPE--NAPPGTVVGTVSATDPDSGE--------------------NGRVTYSILG--GNEDGLFSIDPNT--GVIT 55 (199)
T ss_pred eEEEEeC--CCCCCCEEEEEEEECCCCCC--------------------CceEEEEEeC--CCCcccEEEeCCC--CEEE
Confidence 6788999 99999999999999999842 5689999954 4434799999987 9999
Q ss_pred EcccCCCCCCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCC
Q psy11780 131 LIGKLDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGI 210 (383)
Q Consensus 131 ~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~ 210 (383)
+.++||||....|.|.|+|+|.|.+ ..+++..++|.|.|+||++|.|....|.+.|.|+.++|+.++++.|+|+|.+.
T Consensus 56 ~~~~lD~e~~~~~~l~v~a~D~g~~--~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~ 133 (199)
T cd00031 56 TTKPLDREEQSEYTLTVVASDGGGP--PLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGE 133 (199)
T ss_pred ECCCCCCcCCceEEEEEEEEECCcC--cceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCC
Confidence 9999999999999999999998765 34689999999999999999999889999999999999999999999999988
Q ss_pred CceEEEEEecCCc-ccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccccc
Q psy11780 211 NNKILYSISSGSQ-GIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESKT 289 (383)
Q Consensus 211 n~~i~y~i~~~~~-~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (383)
++.++|+|.++.. .+|.|++.+|.|++.+.||+|.... |.|.|.|+|.+.
T Consensus 134 ~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~----~~l~v~a~D~~~------------------------- 184 (199)
T cd00031 134 NAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREEKSS----YELTVVATDGGG------------------------- 184 (199)
T ss_pred CccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCccCce----EEEEEEEEECCC-------------------------
Confidence 9999999998764 8999999999999999999999876 999999999986
Q ss_pred cCCCCceEEEEEEEEee
Q psy11780 290 VQPAPTAKTEVTIIITD 306 (383)
Q Consensus 290 ~~~~~s~~~~v~I~V~D 306 (383)
+.+++++.++|.|.|
T Consensus 185 --~~~~~~~~i~i~v~d 199 (199)
T cd00031 185 --PPLSSTATVTVTVLD 199 (199)
T ss_pred --CCceeEEEEEEEEEC
Confidence 568888899998875
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.94 E-value=3.9e-25 Score=197.05 Aligned_cols=156 Identities=37% Similarity=0.538 Sum_probs=144.3
Q ss_pred ccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCc-ccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEe
Q psy11780 183 SSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGSQ-GIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTE 261 (383)
Q Consensus 183 ~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~~-~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D 261 (383)
.|.+.|+|+++.|+.++++.|.|+|.+.++.++|+|.++.. .+|.|++.+|.|++.+.||||.... |.|.|.|+|
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~----~~l~v~a~D 76 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSE----YTLTVVASD 76 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCce----EEEEEEEEE
Confidence 36789999999999999999999999888999999998764 7999999999999999999999977 999999999
Q ss_pred CCCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEeecCCCCCcccCCceEEEEeCCCCCCcceEEeecccc
Q psy11780 262 ESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETPTFRSPRYVAEIHENSVINTPVTFLGGVVP 341 (383)
Q Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~g~~v~~v~~~~~ 341 (383)
.+. +.+++...++|.|.|+|||+|.|....|.+.|.|+.+.|+.++++.
T Consensus 77 ~g~---------------------------~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~---- 125 (199)
T cd00031 77 GGG---------------------------PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVT---- 125 (199)
T ss_pred CCc---------------------------CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEE----
Confidence 876 5567899999999999999999998889999999999999999998
Q ss_pred eEEeCCCCCCeEEEEEEeCCC--ccEEEecccEEE
Q psy11780 342 QVFDYDQGKNGTFQMFIEGDM--GTFEITPQKIIN 374 (383)
Q Consensus 342 ~a~D~D~g~n~~i~y~l~~~~--~~F~I~~~tG~~ 374 (383)
|+|+|.+.|+.++|+|.+.. ++|.|++.+|.-
T Consensus 126 -a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i 159 (199)
T cd00031 126 -ATDADSGENAKLTYSILSGNDKELFSIDPNTGII 159 (199)
T ss_pred -EEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEE
Confidence 99999999999999998775 799999988854
No 7
>KOG1834|consensus
Probab=99.79 E-value=6e-18 Score=165.24 Aligned_cols=201 Identities=24% Similarity=0.320 Sum_probs=157.3
Q ss_pred EeecCCCCCeEEeCCCeeEEEEEecCCCCCCCeEEEEEEEEcCCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCC
Q psy11780 33 SSVSGNRAPRFIIDEQTEIVIRLKEGPDTPAGTFIYRLRGADPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDG 112 (383)
Q Consensus 33 ~v~~~Dn~P~F~~~~~~~~~~~v~E~~~~~~g~~i~~v~A~D~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~ 112 (383)
.-..|-+.|.. ...|..-|.|++|+ -....-+.|-|.|. ||+|.-. -.-|.|.+ .+
T Consensus 23 aarankhkpwi----e~ey~gvV~Endnt--vll~Ppl~aLdkda-plr~age----------------iC~fklhg-q~ 78 (952)
T KOG1834|consen 23 AARANKHKPWI----EEEYHGVVTENDNT--VLLDPPLAALDKDA-PLRYAGE----------------ICGFKLHG-QP 78 (952)
T ss_pred cccccccCccc----ccceeEEEEeCCce--EEeCCCeeeecCCC-Ccccccc----------------cceeEecC-CC
Confidence 33445677766 67899999994332 23344578888887 3443211 34567743 12
Q ss_pred CCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCCCCC----CCCeeEEEEEEEEEEcCCCCCeEecCccEEEE
Q psy11780 113 DEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSNND----RRNTGTAAILVKVQDVEDQPPEFVAVSSVTRV 188 (383)
Q Consensus 113 ~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~~~~----~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v 188 (383)
-+-..-.+|..||+|.|+.+.+||+|.++.|+|+|+|.|.|..+ ...+..++|+|.|.|+|+.+|+|..+.|.+.|
T Consensus 79 vPFdavVvdK~TGegvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V 158 (952)
T KOG1834|consen 79 VPFDAVVVDKYTGEGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHV 158 (952)
T ss_pred CCceEEEEeccCCceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEE
Confidence 22334456788999999999999999999999999999988632 13577899999999999999999999999999
Q ss_pred cCCCCCCcEEEEEEEEeCCCC-CCceE-EEEEecCCcccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeCCC
Q psy11780 189 SEDVPVGTSVLQVKAVDGDRG-INNKI-LYSISSGSQGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEESK 264 (383)
Q Consensus 189 ~E~~~~gt~v~~v~A~D~D~~-~n~~i-~y~i~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~ 264 (383)
.|. +.-..|++|.|.|.|.+ ++++| .|.|.. .+-.|.||. .|.|+.+.+|.|.+... |.|+|+|.|+|.
T Consensus 159 ~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t-~d~PFaIdn-~G~irnTekLny~ke~~----Y~ltVtAyDCg~ 229 (952)
T KOG1834|consen 159 TEG-KVYDSILRVEAIDKDCSPQYSQICEYEITT-PDVPFAIDN-DGNIRNTEKLNYTKEHQ----YKLTVTAYDCGK 229 (952)
T ss_pred ecc-eeeeeeEEEEeecCCCCCcccceeEEEecC-CCCceEEcC-CCcccccccccccccee----EEEEEEEEeccc
Confidence 998 56677999999999984 45555 688885 688999986 59999999999998876 999999999997
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.72 E-value=9.8e-17 Score=125.34 Aligned_cols=91 Identities=44% Similarity=0.645 Sum_probs=85.6
Q ss_pred cEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC-cccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeC
Q psy11780 184 SVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGS-QGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEE 262 (383)
Q Consensus 184 ~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~-~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~ 262 (383)
|.++|+|++++|+.++++.|.|+|.+.|+.+.|+|.++. .++|.|++.+|.|++.++||||.... |.|.|.|+|.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~----y~l~v~a~D~ 76 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSS----YQLTVRATDS 76 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSE----EEEEEEEEET
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCE----EEEEEEEEEC
Confidence 789999999999999999999999999999999999887 89999999999999999999999977 9999999999
Q ss_pred -CCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEe
Q psy11780 263 -SKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIIT 305 (383)
Q Consensus 263 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~ 305 (383)
+. |++++++.|+|+|+
T Consensus 77 ~~~---------------------------~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 77 GGS---------------------------PPLSSTATVTINVL 93 (93)
T ss_dssp TTS---------------------------SEEEEEEEEEEEEE
T ss_pred CCC---------------------------CCCEEEEEEEEEEC
Confidence 65 77899999999885
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.66 E-value=8e-16 Score=120.14 Aligned_cols=92 Identities=29% Similarity=0.458 Sum_probs=82.0
Q ss_pred EEEEEecCCCCCCCeEEEEEEEEcCCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCCCCCCcEEEEeecceEEEE
Q psy11780 51 IVIRLKEGPDTPAGTFIYRLRGADPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLFSISTVNGKGVIK 130 (383)
Q Consensus 51 ~~~~v~E~~~~~~g~~i~~v~A~D~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~~~~~~F~I~~~t~~G~I~ 130 (383)
|.++|+| ++++|+.++++.|.|+|.+ .|+.+.|+|. .+++.++|.|++.+ |.|.
T Consensus 1 Y~~~v~E--~~~~g~~v~~v~a~D~D~~--------------------~n~~i~y~i~--~~~~~~~F~I~~~t--g~i~ 54 (93)
T PF00028_consen 1 YSFSVPE--NAPPGTVVGQVTATDPDSG--------------------PNSQITYSIL--GGNPDGLFSIDPNT--GEIS 54 (93)
T ss_dssp EEEEEET--TGSTSSEEEEEEEEESSTS--------------------TTSSEEEEEE--ETTSTTSEEEETTT--TEEE
T ss_pred CEEEEEC--CCCCCCEEEEEEEEeCCCC--------------------CCceEEEEEe--cCcccCceEEeeee--eccc
Confidence 7899999 9999999999999999963 4779999995 45668999999998 8999
Q ss_pred EcccCCCCCCcEEEEEEEEEeC-CCCCCCCeeEEEEEEEEE
Q psy11780 131 LIGKLDYENKFLYQLKILAVDR-SNNDRRNTGTAAILVKVQ 170 (383)
Q Consensus 131 ~~~~LD~E~~~~~~l~V~a~D~-~~~~~~~~~~~~v~I~V~ 170 (383)
+.++||||....|.|.|.|+|. +.| .+++++.|+|+|+
T Consensus 55 ~~~~LD~E~~~~y~l~v~a~D~~~~~--~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 55 LKKPLDRETQSSYQLTVRATDSGGSP--PLSSTATVTINVL 93 (93)
T ss_dssp ESSSSCTTTTSEEEEEEEEEETTTSS--EEEEEEEEEEEEE
T ss_pred cceecCcccCCEEEEEEEEEECCCCC--CCEEEEEEEEEEC
Confidence 9999999999999999999999 555 5788899999885
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.64 E-value=1.5e-15 Score=114.84 Aligned_cols=78 Identities=45% Similarity=0.757 Sum_probs=72.2
Q ss_pred EeCCCCCCceEEEEEecCCc-ccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEee
Q psy11780 204 VDGDRGINNKILYSISSGSQ-GIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITD 282 (383)
Q Consensus 204 ~D~D~~~n~~i~y~i~~~~~-~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~ 282 (383)
+|+|.|.|+.++|+|.++.. .+|.|++.+|.|++.++||||.... |.|.|.|.|.+.
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~----y~l~v~a~D~~~------------------ 58 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPE----YTLTVEATDGGG------------------ 58 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCe----EEEEEEEEECCC------------------
Confidence 48899889999999997764 8999999999999999999999876 999999999998
Q ss_pred ecccccccCCCCceEEEEEEEEeecCCCCC
Q psy11780 283 VTEESKTVQPAPTAKTEVTIIITDVNDETP 312 (383)
Q Consensus 283 ~~~~~~~~~~~~s~~~~v~I~V~DvNDn~P 312 (383)
+.+++.+.|+|+|.|+|||+|
T Consensus 59 ---------~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 59 ---------PPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred ---------CCcccEEEEEEEEEECCCCCC
Confidence 778999999999999999998
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.57 E-value=2.2e-14 Score=108.45 Aligned_cols=74 Identities=35% Similarity=0.447 Sum_probs=65.5
Q ss_pred ceeEEEEEEEeeeCCCCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcCCCCC
Q psy11780 98 NVMLKVVYHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVEDQPP 177 (383)
Q Consensus 98 ~~~~~v~y~i~~~~~~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvNDn~P 177 (383)
+.|+.+.|+|.. +....+|.|++.+ |.|++.++||||..+.|.|.|+|.|.+.+ .+++.+.|+|+|.|+|||+|
T Consensus 6 g~n~~i~Y~i~~--~~~~~~F~i~~~t--g~i~~~~~LD~e~~~~y~l~v~a~D~~~~--~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 6 GENGKVTYSILS--GNEDGLFSIDPET--GEITTTKPLDREEQPEYTLTVEATDGGGP--PLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CcCcEEEEEEec--CCCCCEEEEeCCc--cEEEeCCccCeeCCCeEEEEEEEEECCCC--CcccEEEEEEEEEECCCCCC
Confidence 347799999953 4434899999998 89999999999999999999999999876 56889999999999999998
No 12
>KOG1834|consensus
Probab=99.31 E-value=4.8e-11 Score=117.55 Aligned_cols=174 Identities=21% Similarity=0.239 Sum_probs=130.3
Q ss_pred EEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCC--Cc-eEEEEEecCCcccEE---EeCCCc--eEEEc
Q psy11780 167 VKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGI--NN-KILYSISSGSQGIFD---IDPNTG--NIFTL 238 (383)
Q Consensus 167 I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~--n~-~i~y~i~~~~~~~F~---Id~~tG--~i~~~ 238 (383)
...--+|-+.|... ..|..-|.||-..=...--+.|-|.|.+. .+ .+-|.|.+. .-.|. +|..|| .|+.+
T Consensus 21 ~~aarankhkpwie-~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq-~vPFdavVvdK~TGegvlRaK 98 (952)
T KOG1834|consen 21 HHAARANKHKPWIE-EEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ-PVPFDAVVVDKYTGEGVLRAK 98 (952)
T ss_pred cccccccccCcccc-cceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC-CCCceEEEEeccCCceEEeec
Confidence 34455677788765 56899999985332223357888888732 23 346888764 44454 466664 78888
Q ss_pred ccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEeecCCCCCcccCCc
Q psy11780 239 KELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETPTFRSPR 318 (383)
Q Consensus 239 ~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~ 318 (383)
.+||.|.++. |+|+|+|.|+|.. | .+....+|..++|.|+|.|+|+++|+|..+.
T Consensus 99 ~~lDCelqke----ytf~iQAydCg~g--p-------------------dgtn~kKShkatvhIrVkDvNe~AP~f~ep~ 153 (952)
T KOG1834|consen 99 EPLDCELQKE----YTFTIQAYDCGNG--P-------------------DGTNTKKSHKATVHIRVKDVNEFAPVFKEPW 153 (952)
T ss_pred Cccccccccc----ceEEEEEEecCCC--C-------------------CccccccccceEEEEEeccccccCchhcccc
Confidence 9999998887 9999999999961 0 0011257888999999999999999999999
Q ss_pred eEEEEeCCCCCCcceEEeecccceEEeCCCC-CCeE-EEEEEeCCCccEEEecccEE
Q psy11780 319 YVAEIHENSVINTPVTFLGGVVPQVFDYDQG-KNGT-FQMFIEGDMGTFEITPQKII 373 (383)
Q Consensus 319 y~~~V~E~~~~g~~v~~v~~~~~~a~D~D~g-~n~~-i~y~l~~~~~~F~I~~~tG~ 373 (383)
|.+.|.|+... ..|+++. |.|.|.+ .+++ +.|.|...+-.|.||.+.-+
T Consensus 154 Yka~V~EGK~y-d~il~ve-----AiD~DCspq~sqIC~YEI~t~d~PFaIdn~G~i 204 (952)
T KOG1834|consen 154 YKAHVTEGKVY-DSILRVE-----AIDKDCSPQYSQICEYEITTPDVPFAIDNDGNI 204 (952)
T ss_pred eeeEEecceee-eeeEEEE-----eecCCCCCcccceeEEEecCCCCceEEcCCCcc
Confidence 99999998654 5677787 9999997 5666 69999988889999977544
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.66 E-value=4.8e-05 Score=57.68 Aligned_cols=74 Identities=24% Similarity=0.387 Sum_probs=45.6
Q ss_pred EEEEcCCCCCCcEEEEEEEEeCCCCCC--ceEEEEEecC-CcccEEEeCCCceEEEcccCCcccccCCCceEEEEEEE
Q psy11780 185 VTRVSEDVPVGTSVLQVKAVDGDRGIN--NKILYSISSG-SQGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKV 259 (383)
Q Consensus 185 ~~~v~E~~~~gt~v~~v~A~D~D~~~n--~~i~y~i~~~-~~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a 259 (383)
..+|+|..++|+.|+.| |.|.-.... ....|++.+. ...+|.+++.+|.|++...+|||........+.|.+.+
T Consensus 4 ~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~LC~~~~~C~l~~ev 80 (84)
T PF08266_consen 4 RYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREELCGQSSPCFLSFEV 80 (84)
T ss_dssp EEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC-TTSSS-EEEEEE
T ss_pred EEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHHCCCCCCEEEEEEE
Confidence 46899999999999998 445433111 1235665543 48899999999999999999999986655556666654
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.13 E-value=0.004 Score=47.87 Aligned_cols=79 Identities=16% Similarity=0.231 Sum_probs=43.6
Q ss_pred CCCeEEeCCCeeEEEEEecCCCCCCCeEEEEEEEEcCCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCCCCCCcE
Q psy11780 39 RAPRFIIDEQTEIVIRLKEGPDTPAGTFIYRLRGADPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLF 118 (383)
Q Consensus 39 n~P~F~~~~~~~~~~~v~E~~~~~~g~~i~~v~A~D~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~~~~~~F 118 (383)
..|=|. +..|.+.++. +...|..|++|+=.|-.+.. ++.|.-. + ..|
T Consensus 2 C~pGF~---~~~~~~~Vp~--~l~~g~~lg~V~f~dC~~~~----------------------~~~~~ss----D--pdF 48 (90)
T PF08758_consen 2 CRPGFS---QKKYTFEVPS--NLEAGQPLGKVNFEDCTGRR----------------------RVIFESS----D--PDF 48 (90)
T ss_dssp ---B-----S-EEEE------SS-SS--EEE---B--SS-------------------------EEEE---------SEE
T ss_pred CcCCcc---cceEEEEcCc--hhhCCcEEEEEEeccCCCCC----------------------ceEEecC----C--CCE
Confidence 357788 8899999999 89999999999999887633 5666552 2 399
Q ss_pred EEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCCC
Q psy11780 119 SISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSN 154 (383)
Q Consensus 119 ~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~~ 154 (383)
.|.++ |.|++++++..... .-+|.|.|.|..+
T Consensus 49 ~V~~D---GsVy~~r~v~l~~~-~~~F~V~a~D~~~ 80 (90)
T PF08758_consen 49 RVLED---GSVYAKRPVQLSSE-QRSFTVHAWDSQT 80 (90)
T ss_dssp EEETT---TEEEEES--S-SSS--EEEEEEEEETTT
T ss_pred EEcCC---CeEEEeeeEecCCC-ceEEEEEEECCCC
Confidence 99998 99999999987544 3479999999976
No 15
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.09 E-value=0.0065 Score=46.68 Aligned_cols=78 Identities=13% Similarity=0.251 Sum_probs=44.1
Q ss_pred CCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCcccEEEeCCCceEEEcccCCcccccCCCceEEE
Q psy11780 176 PPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGSQGIFDIDPNTGNIFTLKELDRESATSNNGAYIL 255 (383)
Q Consensus 176 ~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l 255 (383)
.|-|.+..|.+.|+.+...|+.|++|.-.|-.. +..+.|... +..|.|.+ .|.|++++++.....+ -+|
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss---DpdF~V~~-DGsVy~~r~v~l~~~~-----~~F 71 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS---DPDFRVLE-DGSVYAKRPVQLSSEQ-----RSF 71 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE------SEEEEET-TTEEEEES--S-SSS------EEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC---CCCEEEcC-CCeEEEeeeEecCCCc-----eEE
Confidence 588999999999999999999999999987743 467888764 34899976 5999999999886543 789
Q ss_pred EEEEEeCCC
Q psy11780 256 EIKVTEESK 264 (383)
Q Consensus 256 ~V~a~D~~~ 264 (383)
.|.|.|..+
T Consensus 72 ~V~a~D~~~ 80 (90)
T PF08758_consen 72 TVHAWDSQT 80 (90)
T ss_dssp EEEEEETTT
T ss_pred EEEEECCCC
Confidence 999999887
No 16
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.99 E-value=0.0004 Score=52.69 Aligned_cols=65 Identities=20% Similarity=0.304 Sum_probs=38.1
Q ss_pred EEEEEecCCCCCCCeEEEEEEEEcCCCCceEEEEEccccceEEEeecceeEEEEEEEeeeCCCCCCcEEEEeecceEEEE
Q psy11780 51 IVIRLKEGPDTPAGTFIYRLRGADPDGDVLTFGVQGHGHDVIRIENLNVMLKVVYHLENVDGDEKDLFSISTVNGKGVIK 130 (383)
Q Consensus 51 ~~~~v~E~~~~~~g~~i~~v~A~D~D~~~~~~~~~~~~~~~~~i~~~~~~~~v~y~i~~~~~~~~~~F~I~~~t~~G~I~ 130 (383)
..++|+| +.++|+.|+.+. .|.-.+. .. .....|+|. ......+|.++..+ |.|.
T Consensus 3 i~YsV~E--E~~~Gt~IGnia-~dL~l~~-----~~-------------l~~~~~ri~--s~~~~~~~~v~~~t--G~L~ 57 (84)
T PF08266_consen 3 IRYSVPE--EMPPGTVIGNIA-KDLGLDP-----QS-------------LSSRNFRIV--SEGNSQYFRVNEKT--GDLF 57 (84)
T ss_dssp EEEEEES--S--TT-EEEECC-CCCT--H-----HH-------------HCCTTBEEE---SSSS-SEEE-TTT--SEEE
T ss_pred eEEEeec--CCCCCCEEEEhH-HhhCCCc-----cc-------------ccccceEEe--ecCCcceeEecCCc--eeEE
Confidence 4567899 999999999994 4433211 00 012245553 33447899999998 9999
Q ss_pred EcccCCCCCC
Q psy11780 131 LIGKLDYENK 140 (383)
Q Consensus 131 ~~~~LD~E~~ 140 (383)
++.++|||+-
T Consensus 58 v~~rIDRE~L 67 (84)
T PF08266_consen 58 VSERIDREEL 67 (84)
T ss_dssp ESS--SCCCC
T ss_pred eCCccCHHHH
Confidence 9999999974
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.74 E-value=0.16 Score=39.46 Aligned_cols=69 Identities=22% Similarity=0.387 Sum_probs=51.3
Q ss_pred EEeCCCCCCceEEEEEecCC----cccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEE
Q psy11780 203 AVDGDRGINNKILYSISSGS----QGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTI 278 (383)
Q Consensus 203 A~D~D~~~n~~i~y~i~~~~----~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~ 278 (383)
..|.| ...++|++.... ..|...|+.++.+.- .+...... .|.+.|.|+|..+
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-tP~~~~~g-----~~~i~v~a~D~~g-------------- 80 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSG-TPTNSDVG-----SLSLKVTATDSSG-------------- 80 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-ECCCCCCc-----EEEEEEEEEECCC--------------
Confidence 46676 367888886322 679999999888765 34443332 3999999999986
Q ss_pred EEeeecccccccCCCCceEEEEEEEEeecCC
Q psy11780 279 IITDVTEESKTVQPAPTAKTEVTIIITDVND 309 (383)
Q Consensus 279 ~~~~~~~~~~~~~~~~s~~~~v~I~V~DvND 309 (383)
.++...++|.|.+.||
T Consensus 81 ---------------~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 81 ---------------ASASDTFTITVVNTND 96 (97)
T ss_pred ---------------CEEEEEEEEEEeCCCC
Confidence 5778889999999887
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.98 E-value=0.17 Score=39.44 Aligned_cols=89 Identities=21% Similarity=0.292 Sum_probs=52.5
Q ss_pred EEEEEEeCCCCCCceEEEEEe--cCCcccEEEeCCCceEEEcc-----cCCcccccCCCceEEEEEEEEeCCCCCCCCCC
Q psy11780 199 LQVKAVDGDRGINNKILYSIS--SGSQGIFDIDPNTGNIFTLK-----ELDRESATSNNGAYILEIKVTEESKTVQPAPT 271 (383)
Q Consensus 199 ~~v~A~D~D~~~n~~i~y~i~--~~~~~~F~Id~~tG~i~~~~-----~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~ 271 (383)
+++.++|+|.+ ....++.. .+..+.|.|++ .|...-.- ..++-.... ...-.|++.+.|+.
T Consensus 2 G~Lt~sD~D~g--d~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge-~~tdsFtvtv~DGt-------- 69 (99)
T TIGR01965 2 GQLTISDADAG--QAHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGE-TLTDTFTVTSADGT-------- 69 (99)
T ss_pred CceEEeCCCCC--CceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCC-EEEEEEEEEEeCCC--------
Confidence 46889999976 34555553 33477889987 57654221 111111110 01256666666632
Q ss_pred CceeEEEEEeeecccccccCCCCceEEEEEEEEeecCCCCCcccCCceEEEEeCC
Q psy11780 272 AKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETPTFRSPRYVAEIHEN 326 (383)
Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~ 326 (383)
+..|+|+|.-.|| +|+.... -.+.+.|+
T Consensus 70 -------------------------t~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 70 -------------------------SQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred -------------------------eEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 5789999999999 6655443 24666665
No 19
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.67 E-value=0.79 Score=35.55 Aligned_cols=65 Identities=18% Similarity=0.242 Sum_probs=44.9
Q ss_pred EEEEEEeeeCC-CCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcCC
Q psy11780 102 KVVYHLENVDG-DEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVED 174 (383)
Q Consensus 102 ~v~y~i~~~~~-~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvND 174 (383)
.++|++....+ .-..+...++.+ +.+.= .|...+. ..|.+.|.|+|..+ .+....++|.|.+.||
T Consensus 31 ~lty~~~~~~~~~lP~Wl~fd~~~--~~~~G-tP~~~~~-g~~~i~v~a~D~~g----~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 31 TLTYSATLSDGSALPSWLSFDSDT--GTLSG-TPTNSDV-GSLSLKVTATDSSG----ASASDTFTITVVNTND 96 (97)
T ss_pred eEEEEEEeCCCCCCCCeEEEeCCC--CEEEE-ECCCCCC-cEEEEEEEEEECCC----CEEEEEEEEEEeCCCC
Confidence 55566543322 224588888876 56655 4555443 56999999999875 3567889999999885
No 20
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.57 E-value=0.79 Score=35.77 Aligned_cols=48 Identities=13% Similarity=0.226 Sum_probs=32.7
Q ss_pred CCCCCCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCCC
Q psy11780 135 LDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSEDV 192 (383)
Q Consensus 135 LD~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~ 192 (383)
|..-..-.-.|++.+.|+ .+..|+|.|.-.|| +|+..... ...+.|+.
T Consensus 51 L~~Ge~~tdsFtvtv~DG--------tt~~vtItI~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 51 LKAGETLTDTFTVTSADG--------TSQTVTITITGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred hcCCCEEEEEEEEEEeCC--------CeEEEEEEEEccCC-CCEEeccc-ceeEecCC
Confidence 443344456778888885 16889999999997 78765543 45676663
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=92.39 E-value=6.3 Score=33.61 Aligned_cols=113 Identities=19% Similarity=0.282 Sum_probs=60.2
Q ss_pred cEEEEEE-EEEeCCCCCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCCCCC-CcEEEEEEEEeCCCCCCceEEEEE
Q psy11780 141 FLYQLKI-LAVDRSNNDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPV-GTSVLQVKAVDGDRGINNKILYSI 218 (383)
Q Consensus 141 ~~~~l~V-~a~D~~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~-gt~v~~v~A~D~D~~~n~~i~y~i 218 (383)
-.|.+.+ .|.|..+. ..+..+...+.- ...+|.+.- .....+..+... |..=..+.++|.-.+. ..-..+|
T Consensus 15 G~Y~l~~~~a~D~agN----~~~~~~~~~~~i-D~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l 87 (158)
T PF13750_consen 15 GSYTLTVVTATDAAGN----TSTSTVSETFTI-DNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSL 87 (158)
T ss_pred ccEEEEEEEEEecCCC----EEEEEEeeEEEE-cCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEE
Confidence 5789999 79999874 333334323332 236888765 111223333222 2233467777776543 4456888
Q ss_pred ecCC-cccEEE--eC-CCceEEEc--ccC-CcccccCCCceEEEEEEEEeCCC
Q psy11780 219 SSGS-QGIFDI--DP-NTGNIFTL--KEL-DRESATSNNGAYILEIKVTEESK 264 (383)
Q Consensus 219 ~~~~-~~~F~I--d~-~tG~i~~~--~~l-D~E~~~~~~~~y~l~V~a~D~~~ 264 (383)
.++. ...-.+ .. ..|...+. +.+ ..|... .|+|+|.|+|..+
T Consensus 88 ~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~----~YtLtV~a~D~aG 136 (158)
T PF13750_consen 88 TGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADD----SYTLTVSATDKAG 136 (158)
T ss_pred ECCcccceEEEeeEeccCceEEeecccccCCcCCCC----eEEEEEEEEecCC
Confidence 7654 222222 11 23333332 111 123333 4999999999988
No 22
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=86.88 E-value=53 Score=36.05 Aligned_cols=137 Identities=15% Similarity=0.173 Sum_probs=73.7
Q ss_pred cCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC---cccEEEeCCCceEEEcccCCcccccC
Q psy11780 172 VEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGS---QGIFDIDPNTGNIFTLKELDRESATS 248 (383)
Q Consensus 172 vNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~---~~~F~Id~~tG~i~~~~~lD~E~~~~ 248 (383)
.||..+.|.-..-..++.|+ .|+.-..|.-...|.+..-.+.|+...+. ..-|. +.+|.|.-.. -|.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En--~GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~P---GEt--- 464 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLEN--CGTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKP---GET--- 464 (928)
T ss_pred ccCCcceEEecCCeEEEeec--CcEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECC---Cce---
Confidence 45555555555555678998 47766666544435544467888887553 11222 3456553221 111
Q ss_pred CCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccc--cc-------cCCCCceEEEEEEEEeecCCCCCcccCCce
Q psy11780 249 NNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEES--KT-------VQPAPTAKTEVTIIITDVNDETPTFRSPRY 319 (383)
Q Consensus 249 ~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-------~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y 319 (383)
.-.++|...|... ......+.+++....... +. ....+......+|+|.| ||++|.|....-
T Consensus 465 ---~KtItV~IIDDdi-----~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~ 535 (928)
T TIGR00845 465 ---QKEFRIGIIDDDI-----FEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEED 535 (928)
T ss_pred ---EEEEEEEEccCCC-----CCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCc
Confidence 1355555555543 223445666665543210 00 01112234466777787 789998776555
Q ss_pred EEEEeCCC
Q psy11780 320 VAEIHENS 327 (383)
Q Consensus 320 ~~~V~E~~ 327 (383)
...|.|+.
T Consensus 536 ~~sV~Es~ 543 (928)
T TIGR00845 536 VFHVSESI 543 (928)
T ss_pred eEEEEcCC
Confidence 67788874
No 23
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=86.13 E-value=9.5 Score=31.72 Aligned_cols=59 Identities=29% Similarity=0.416 Sum_probs=42.3
Q ss_pred ceEEEEEcccCCCCC---CcEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCC
Q psy11780 125 GKGVIKLIGKLDYEN---KFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSED 191 (383)
Q Consensus 125 ~~G~I~~~~~LD~E~---~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~ 191 (383)
|+....+.++||+.. .-...|.|.|+|..+. .++..+.|.|.| | .|...... ...|.|+
T Consensus 66 GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD----~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 66 GSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGD----TSSITLPVTIVD--D-VPTITDVD-ALTVDED 127 (137)
T ss_pred ccEEEEEcccccCCCCCceEEEeeeEEEEeCCCC----ccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence 457788899999843 3567889999998874 224588888888 4 57765544 3678775
No 24
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.87 E-value=5.8 Score=27.88 Aligned_cols=46 Identities=13% Similarity=0.304 Sum_probs=27.8
Q ss_pred CceEEEEEecCCcccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeCCC
Q psy11780 211 NNKILYSISSGSQGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEESK 264 (383)
Q Consensus 211 n~~i~y~i~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~ 264 (383)
+-..+|+|.+-+..+..+...+-. +.+...+ +|.|.|.|+|.|..+
T Consensus 7 ~~~Y~Y~l~g~d~~W~~~~~~~~~------~~~~~L~--~G~Y~l~V~a~~~~~ 52 (66)
T PF07495_consen 7 NIRYRYRLEGFDDEWITLGSYSNS------ISYTNLP--PGKYTLEVRAKDNNG 52 (66)
T ss_dssp TEEEEEEEETTESSEEEESSTS-E------EEEES----SEEEEEEEEEEETTS
T ss_pred ceEEEEEEECCCCeEEECCCCcEE------EEEEeCC--CEEEEEEEEEECCCC
Confidence 445677787655667666443222 2222222 256999999999887
No 25
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=76.16 E-value=12 Score=25.05 Aligned_cols=37 Identities=24% Similarity=0.391 Sum_probs=26.0
Q ss_pred cccEEEeCCCceEEEcccCCcccccCCCceEEEEEEEEeCCC
Q psy11780 223 QGIFDIDPNTGNIFTLKELDRESATSNNGAYILEIKVTEESK 264 (383)
Q Consensus 223 ~~~F~Id~~tG~i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~ 264 (383)
..+..+|+.+|.|.-. .+.... .+.|.+.|.|+|..+
T Consensus 13 P~gLs~d~~tG~isGt--p~~~~~---~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 13 PSGLSLDPSTGTISGT--PTSSVQ---PGTYTFTVTATDGSG 49 (49)
T ss_pred CCcEEEeCCCCEEEee--cCCCcc---ccEEEEEEEEEcCCC
Confidence 6788999999998754 222211 123999999999753
No 26
>KOG3597|consensus
Probab=71.08 E-value=1.2e+02 Score=30.47 Aligned_cols=60 Identities=18% Similarity=0.171 Sum_probs=44.5
Q ss_pred EEEEEEEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCC
Q psy11780 162 TAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSGS 222 (383)
Q Consensus 162 ~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~~ 222 (383)
+....|.|.-+||.+..+....+.+-+.|+...-.....+.+.|+|.. ...+.|.+.+..
T Consensus 25 ~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~-~~~l~f~v~~t~ 84 (442)
T KOG3597|consen 25 TDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSA-PLPLEFQVLGTS 84 (442)
T ss_pred EeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCC-ccceEEEEccCC
Confidence 566788999999988777777777888888655444567888999875 356788887643
No 27
>KOG4221|consensus
Probab=68.79 E-value=2.1e+02 Score=32.45 Aligned_cols=66 Identities=18% Similarity=0.314 Sum_probs=42.9
Q ss_pred EEEeeeCCCCCCcEEEEeecceEEEEEcccCCCCCCcEEEEEEEEEeCCCCCCCCeeEEEEE-EEEEEcCCCCCe
Q psy11780 105 YHLENVDGDEKDLFSISTVNGKGVIKLIGKLDYENKFLYQLKILAVDRSNNDRRNTGTAAIL-VKVQDVEDQPPE 178 (383)
Q Consensus 105 y~i~~~~~~~~~~F~I~~~t~~G~I~~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~-I~V~DvNDn~P~ 178 (383)
|++--..++....+.++.++ -+.++. +.|....|.+.|.|....+.+. + ++.|+ ++..|+-+-||.
T Consensus 554 yk~~ys~~~~~~~~~~~~n~--~e~ti~---gL~k~TeY~~~vvA~N~~G~g~--s-S~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 554 YKLFYSEDDTGKELRVENNA--TEYTIN---GLEKYTEYSIRVVAYNSAGSGV--S-SADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred EEEEEEcCCCCceEEEecCc--cEEEee---cCCCccceEEEEEEecCCCCCC--C-CCceEEEeccCCCCCCCc
Confidence 44433334567789998876 334444 5677789999999999988642 2 33343 445677776665
No 28
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=68.35 E-value=1.3e+02 Score=31.76 Aligned_cols=109 Identities=13% Similarity=0.130 Sum_probs=63.6
Q ss_pred cEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCC------CCCceE
Q psy11780 141 FLYQLKILAVDRSNNDRRNTGTAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDR------GINNKI 214 (383)
Q Consensus 141 ~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~------~~n~~i 214 (383)
-...+++.++|..+ ..+.+...+|+|++-. +|.+.-..++. .++ ..+......|.=... .....+
T Consensus 298 G~~Ti~atVtDSRG---r~S~~~~~tItVl~Y~--~P~lsfsv~R~--~~~--~~~~~v~~~a~Iapl~v~g~qKN~~~l 368 (624)
T PF05895_consen 298 GSATIRATVTDSRG---RTSDPKTKTITVLEYS--PPTLSFSVYRC--GSS--GNTLTVTRNAKIAPLTVNGVQKNTMTL 368 (624)
T ss_pred ceEEEEEEEEECCC---ccCCceEEEEEEEEcC--CCcEEEEEEEe--CCC--CcEEEEEEEEEEeEEEEcccccceEEE
Confidence 56789999999987 3456788999999985 89885332222 211 112122222221111 112356
Q ss_pred EEEEecCCcccEEEeCC--Cce-----------EEEcccCCcccccCCCceEEEEEEEEeCCC
Q psy11780 215 LYSISSGSQGIFDIDPN--TGN-----------IFTLKELDRESATSNNGAYILEIKVTEESK 264 (383)
Q Consensus 215 ~y~i~~~~~~~F~Id~~--tG~-----------i~~~~~lD~E~~~~~~~~y~l~V~a~D~~~ 264 (383)
.|+...-....|.+|.. .|. ..+...+|-++. |.+++.++|.=.
T Consensus 369 t~~~a~~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~~~kS------y~V~~~l~D~F~ 425 (624)
T PF05895_consen 369 TFKVAPLGTGTFTTDNGSASGTWSSISELTNSSANLGGTYDAEKS------YDVRGTLSDKFT 425 (624)
T ss_pred EEEEEEcCcceEEEEccccccceeeeeeecccceeeccccCCCce------EEEEEEEEEEee
Confidence 67766555667776632 111 223445566654 999999999865
No 29
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=62.90 E-value=88 Score=25.97 Aligned_cols=58 Identities=21% Similarity=0.254 Sum_probs=36.3
Q ss_pred EEcccCCcccccCCCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEeecCCCCCccc
Q psy11780 236 FTLKELDRESATSNNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVNDETPTFR 315 (383)
Q Consensus 236 ~~~~~lD~E~~~~~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~ 315 (383)
.+.++||.... ...-...|.|.|+|..+ ......+.|+|.| | .|...
T Consensus 71 tL~~~lDH~~g-~d~l~l~~~v~a~D~DG-----------------------------D~s~~~l~VtI~D--D-~P~~~ 117 (137)
T TIGR03660 71 TLEGPLDHAAG-SDELTLNFPIIATDFDG-----------------------------DTSSITLPVTIVD--D-VPTIT 117 (137)
T ss_pred EEcccccCCCC-CceEEEeeeEEEEeCCC-----------------------------CccccEEEEEEEC--C-CCeec
Confidence 45566666432 11224788899999887 2334588888887 5 47765
Q ss_pred CCceEEEEeCCC
Q psy11780 316 SPRYVAEIHENS 327 (383)
Q Consensus 316 ~~~y~~~V~E~~ 327 (383)
... ...|.|..
T Consensus 118 ~~~-~~~V~E~~ 128 (137)
T TIGR03660 118 DVD-ALTVDEDD 128 (137)
T ss_pred ccc-ceEEeccc
Confidence 543 36777753
No 30
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=51.83 E-value=1.5e+02 Score=25.17 Aligned_cols=30 Identities=27% Similarity=0.321 Sum_probs=22.1
Q ss_pred CCCCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEE
Q psy11780 137 YENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQ 170 (383)
Q Consensus 137 ~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~ 170 (383)
.|....|.|+|.|.|..+. .++..+.....
T Consensus 119 le~~~~YtLtV~a~D~aGN----~~~~si~F~y~ 148 (158)
T PF13750_consen 119 LEADDSYTLTVSATDKAGN----QSTKSISFSYM 148 (158)
T ss_pred cCCCCeEEEEEEEEecCCC----EEEEEEEEEEe
Confidence 3778999999999999874 44555555443
No 31
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=49.19 E-value=57 Score=22.70 Aligned_cols=13 Identities=23% Similarity=0.534 Sum_probs=12.0
Q ss_pred eEEEEEEEEeCCC
Q psy11780 252 AYILEIKVTEESK 264 (383)
Q Consensus 252 ~y~l~V~a~D~~~ 264 (383)
.|.+.+.|+|..+
T Consensus 24 ~yt~~v~a~D~AG 36 (60)
T PF12245_consen 24 EYTLTVTATDKAG 36 (60)
T ss_pred cEEEEEEEEECCC
Confidence 4999999999998
No 32
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=48.29 E-value=4.2e+02 Score=29.37 Aligned_cols=56 Identities=21% Similarity=0.249 Sum_probs=31.7
Q ss_pred EEEEEEEEEEcCCCCCeEecCccEEEEcCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy11780 162 TAAILVKVQDVEDQPPEFVAVSSVTRVSEDVPVGTSVLQVKAVDGDRGINNKILYSISSG 221 (383)
Q Consensus 162 ~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~A~D~D~~~n~~i~y~i~~~ 221 (383)
....+|+|.| ||++|.|.-..-...|.|+. |+.-.+|.-+ .+....-.+.|....+
T Consensus 514 ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~Rt-sGa~G~VtV~Y~T~dG 569 (928)
T TIGR00845 514 PNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRT-SGARGTVIVPYRTVEG 569 (928)
T ss_pred CceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEc-CCCCeeEEEEEEeecC
Confidence 3456677777 67899877665567888984 5543333222 1111123456766544
No 33
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=40.15 E-value=61 Score=22.43 Aligned_cols=30 Identities=13% Similarity=0.179 Sum_probs=18.7
Q ss_pred CCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEE
Q psy11780 139 NKFLYQLKILAVDRSNNDRRNTGTAAILVKVQ 170 (383)
Q Consensus 139 ~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~ 170 (383)
..-.|.|.|+|.|..+... .....+.|.|+
T Consensus 37 ~~G~Y~l~V~a~~~~~~~~--~~~~~l~i~I~ 66 (66)
T PF07495_consen 37 PPGKYTLEVRAKDNNGKWS--SDEKSLTITIL 66 (66)
T ss_dssp -SEEEEEEEEEEETTS-B---SS-EEEEEEEE
T ss_pred CCEEEEEEEEEECCCCCcC--cccEEEEEEEC
Confidence 4578999999999877421 22255666553
No 34
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=35.08 E-value=73 Score=23.35 Aligned_cols=13 Identities=8% Similarity=0.118 Sum_probs=11.5
Q ss_pred eEEEEEEEEeCCC
Q psy11780 252 AYILEIKVTEESK 264 (383)
Q Consensus 252 ~y~l~V~a~D~~~ 264 (383)
.|.++..|+|..+
T Consensus 58 ~t~V~ytA~D~~G 70 (81)
T PF02494_consen 58 TTTVTYTATDAAG 70 (81)
T ss_pred eEEEEEEEEECCC
Confidence 3999999999877
No 35
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=33.09 E-value=89 Score=22.55 Aligned_cols=14 Identities=29% Similarity=0.565 Sum_probs=11.8
Q ss_pred ceEEEEEEEEeCCC
Q psy11780 251 GAYILEIKVTEESK 264 (383)
Q Consensus 251 ~~y~l~V~a~D~~~ 264 (383)
+.|.+++.|+|..+
T Consensus 57 G~y~v~l~v~d~~g 70 (81)
T cd00146 57 GTYTVTLTVTNAVG 70 (81)
T ss_pred cEEEEEEEEEeCCC
Confidence 45999999999865
No 36
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=29.67 E-value=1.3e+02 Score=21.58 Aligned_cols=14 Identities=29% Similarity=0.565 Sum_probs=11.6
Q ss_pred ceEEEEEEEEeCCC
Q psy11780 251 GAYILEIKVTEESK 264 (383)
Q Consensus 251 ~~y~l~V~a~D~~~ 264 (383)
+.|.+++.+.|..+
T Consensus 55 G~y~v~l~v~n~~g 68 (79)
T smart00089 55 GTYTVTLTVTNAVG 68 (79)
T ss_pred cEEEEEEEEEcCCC
Confidence 45999999998765
No 37
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=28.84 E-value=1.7e+02 Score=19.67 Aligned_cols=13 Identities=15% Similarity=0.488 Sum_probs=11.9
Q ss_pred eEEEEEEEEeCCC
Q psy11780 252 AYILEIKVTEESK 264 (383)
Q Consensus 252 ~y~l~V~a~D~~~ 264 (383)
.|.+++.|+|..+
T Consensus 25 ~y~itv~a~D~AG 37 (54)
T PF13754_consen 25 TYTITVTATDAAG 37 (54)
T ss_pred cEEEEEEEEeCCC
Confidence 3999999999988
No 38
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=27.78 E-value=77 Score=25.05 Aligned_cols=15 Identities=13% Similarity=0.452 Sum_probs=12.5
Q ss_pred CceEEEEEEEEeCCC
Q psy11780 250 NGAYILEIKVTEESK 264 (383)
Q Consensus 250 ~~~y~l~V~a~D~~~ 264 (383)
++.|.++|+++|..+
T Consensus 79 gG~y~m~V~lCn~dG 93 (106)
T cd02848 79 GGRYQMQVALCNGDG 93 (106)
T ss_pred CCeEEEEEEEECCCC
Confidence 345999999999876
No 39
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=26.52 E-value=2.2e+02 Score=19.64 Aligned_cols=33 Identities=24% Similarity=0.391 Sum_probs=22.2
Q ss_pred CcEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcCCCCCe
Q psy11780 140 KFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVEDQPPE 178 (383)
Q Consensus 140 ~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvNDn~P~ 178 (383)
...|.|.++|.|..|. .+.......+.|. ..|.
T Consensus 22 dg~yt~~v~a~D~AGN----~~~~~~~~~i~d~--~~p~ 54 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGN----TSSSTTQIVIVDN--TAPA 54 (60)
T ss_pred CccEEEEEEEEECCCC----EEEeeeEEEEEcC--CCCC
Confidence 4579999999999884 3444555555554 3454
No 40
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=25.89 E-value=91 Score=25.75 Aligned_cols=31 Identities=13% Similarity=0.461 Sum_probs=20.5
Q ss_pred CCceEEEEEEEEeCCCCCCCCCCCceeEEEEEeeecccccccCCCCceEEEEEEEEeecC
Q psy11780 249 NNGAYILEIKVTEESKTVQPAPTAKTEVTIIITDVTEESKTVQPAPTAKTEVTIIITDVN 308 (383)
Q Consensus 249 ~~~~y~l~V~a~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvN 308 (383)
++++|+++|++++..+ -+.+..+.|.|-|-.
T Consensus 81 ~gG~y~~~VeLCN~~G-----------------------------CS~S~~~~V~VaDTD 111 (133)
T PF08329_consen 81 KGGRYQMQVELCNADG-----------------------------CSTSAPVEVVVADTD 111 (133)
T ss_dssp S-EEEEEEEEEEETTE-----------------------------EEE---EEEEEE-TT
T ss_pred CCCEEEEEEEEECCCC-----------------------------cccCCCEEEEEeCCC
Confidence 3466999999999887 355567888888874
No 41
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=21.56 E-value=2.3e+02 Score=20.31 Aligned_cols=20 Identities=10% Similarity=0.198 Sum_probs=16.5
Q ss_pred CCCCCCcEEEEEEEEEeCCC
Q psy11780 135 LDYENKFLYQLKILAVDRSN 154 (383)
Q Consensus 135 LD~E~~~~~~l~V~a~D~~~ 154 (383)
..|...-.|.+++.++|..+
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g 70 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG 70 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC
Confidence 34777889999999999864
No 42
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=21.14 E-value=1e+02 Score=24.40 Aligned_cols=33 Identities=15% Similarity=0.156 Sum_probs=22.7
Q ss_pred cCCCCCCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEE
Q psy11780 134 KLDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQ 170 (383)
Q Consensus 134 ~LD~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~ 170 (383)
.+++.+.-.|.++|+++|..+- +.++.+.|.|-
T Consensus 73 t~~v~kgG~y~m~V~lCn~dGC----S~S~~~~I~VA 105 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDGC----STSAAKEIVVA 105 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCCc----cCcCCEEEEec
Confidence 3556777899999999999763 33344455443
No 43
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=21.06 E-value=3.4e+02 Score=22.40 Aligned_cols=36 Identities=17% Similarity=0.233 Sum_probs=23.7
Q ss_pred cCCCCCCcEEEEEEEEEeCCCCCCCCeeEEEEEEEEEEcC
Q psy11780 134 KLDYENKFLYQLKILAVDRSNNDRRNTGTAAILVKVQDVE 173 (383)
Q Consensus 134 ~LD~E~~~~~~l~V~a~D~~~~~~~~~~~~~v~I~V~DvN 173 (383)
.+.+...-.|+++|+++|..+ .+.+..+.|.|.|-.
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~G----CS~S~~~~V~VaDTD 111 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNADG----CSTSAPVEVVVADTD 111 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETTE----EEE---EEEEEE-TT
T ss_pred EEEecCCCEEEEEEEEECCCC----cccCCCEEEEEeCCC
Confidence 455666789999999999986 355668888888864
Done!