Query psy669
Match_columns 248
No_of_seqs 268 out of 1705
Neff 9.3
Searched_HMMs 46136
Date Sat Aug 17 00:33:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy669.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/669hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 8.3E-45 1.8E-49 334.9 21.6 229 2-235 221-480 (2531)
2 KOG4289|consensus 100.0 8.8E-42 1.9E-46 315.1 22.5 233 2-239 326-586 (2531)
3 KOG1219|consensus 100.0 3.2E-38 6.9E-43 299.6 25.2 231 2-236 903-1166(4289)
4 KOG1219|consensus 100.0 5.3E-37 1.1E-41 291.4 22.6 225 2-239 2634-2889(4289)
5 cd00031 CA Cadherin repeat dom 100.0 1.4E-26 3E-31 184.9 22.8 131 54-186 1-131 (199)
6 cd00031 CA Cadherin repeat dom 99.9 7.9E-26 1.7E-30 180.5 21.3 147 1-149 53-199 (199)
7 PF00028 Cadherin: Cadherin do 99.8 3.2E-19 7E-24 125.2 13.5 92 55-148 1-93 (93)
8 smart00112 CA Cadherin repeats 99.7 3E-17 6.5E-22 111.6 11.1 79 75-155 1-79 (79)
9 KOG1834|consensus 99.7 6.1E-17 1.3E-21 142.7 15.5 143 2-150 95-245 (952)
10 KOG1834|consensus 99.6 9.6E-14 2.1E-18 122.8 16.9 180 39-223 22-242 (952)
11 smart00112 CA Cadherin repeats 99.2 9.8E-11 2.1E-15 79.3 6.6 47 2-48 33-79 (79)
12 PF00028 Cadherin: Cadherin do 98.6 1.4E-07 3.1E-12 65.7 6.8 61 162-223 1-92 (93)
13 PF08266 Cadherin_2: Cadherin- 97.9 2.7E-05 5.8E-10 53.0 4.5 63 55-119 3-67 (84)
14 PF08758 Cadherin_pro: Cadheri 97.4 0.0034 7.5E-08 43.2 9.4 78 47-133 3-80 (90)
15 smart00736 CADG Dystroglycan-t 96.4 0.08 1.7E-06 36.9 10.2 70 74-152 24-96 (97)
16 TIGR01965 VCBS_repeat VCBS rep 94.0 1.4 3.1E-05 30.8 10.6 88 70-169 2-97 (99)
17 TIGR00845 caca sodium/calcium 92.9 10 0.00022 37.2 17.8 122 43-169 395-542 (928)
18 PF13750 Big_3_3: Bacterial Ig 92.5 3.8 8.3E-05 31.3 15.9 126 14-148 15-148 (158)
19 smart00736 CADG Dystroglycan-t 83.5 6.2 0.00014 27.2 6.4 31 13-45 66-96 (97)
20 PF05345 He_PIG: Putative Ig d 82.2 7.3 0.00016 23.3 5.5 35 95-132 13-48 (49)
21 TIGR00845 caca sodium/calcium 79.6 68 0.0015 31.8 14.0 57 33-93 514-570 (928)
22 TIGR03660 T1SS_rpt_143 T1SS-14 77.8 14 0.0003 27.6 6.8 56 2-63 70-128 (137)
23 PF07495 Y_Y_Y: Y_Y_Y domain; 75.0 18 0.00038 22.6 7.9 60 82-148 7-66 (66)
24 TIGR03660 T1SS_rpt_143 T1SS-14 72.7 23 0.00051 26.4 6.9 54 109-169 71-127 (137)
25 PF07495 Y_Y_Y: Y_Y_Y domain; 72.4 9.4 0.0002 23.9 4.3 29 13-41 38-66 (66)
26 KOG3597|consensus 66.6 88 0.0019 28.3 10.3 59 32-91 24-82 (442)
27 PF13750 Big_3_3: Bacterial Ig 64.6 61 0.0013 24.7 11.0 14 119-132 14-28 (158)
28 PF03160 Calx-beta: Calx-beta 63.5 45 0.00097 22.8 9.6 52 37-92 2-53 (100)
29 PF10365 DUF2436: Domain of un 55.4 80 0.0017 23.6 6.7 88 44-132 66-156 (161)
30 cd02848 Chitinase_N_term Chiti 52.7 42 0.0009 23.8 4.8 33 113-148 73-105 (106)
31 PF12245 Big_3_2: Bacterial Ig 52.3 55 0.0012 20.3 5.7 30 13-44 22-51 (60)
32 PF08758 Cadherin_pro: Cadheri 49.5 13 0.00028 25.5 1.8 33 154-186 3-35 (90)
33 smart00089 PKD Repeats in poly 47.2 58 0.0012 21.0 4.8 28 10-40 51-78 (79)
34 cd00146 PKD polycystic kidney 44.6 54 0.0012 21.2 4.3 30 114-146 51-80 (81)
35 cd00146 PKD polycystic kidney 44.2 66 0.0014 20.8 4.7 28 10-39 53-80 (81)
36 KOG4221|consensus 44.1 3.6E+02 0.0079 27.9 11.1 40 10-49 581-620 (1381)
37 PF12245 Big_3_2: Bacterial Ig 42.6 82 0.0018 19.5 5.4 30 119-151 22-51 (60)
38 TIGR01965 VCBS_repeat VCBS rep 35.9 1E+02 0.0022 21.6 4.6 41 14-62 57-97 (99)
39 cd02848 Chitinase_N_term Chiti 35.2 56 0.0012 23.2 3.2 30 7-36 73-102 (106)
40 PF05895 DUF859: Siphovirus pr 34.5 4.2E+02 0.009 25.3 12.5 110 13-132 297-424 (624)
41 KOG3597|consensus 32.9 66 0.0014 29.0 4.1 52 139-190 24-75 (442)
42 PF02494 HYR: HYR domain; Int 32.9 97 0.0021 20.3 4.1 25 120-147 57-81 (81)
43 PF08329 ChitinaseA_N: Chitina 30.3 2.3E+02 0.005 21.0 6.5 37 113-152 76-112 (133)
44 COG4288 Uncharacterized protei 27.5 1.9E+02 0.0041 20.6 4.7 41 32-75 66-106 (124)
45 PF09100 Qn_am_d_aIV: Quinohem 27.0 1.5E+02 0.0033 21.7 4.3 35 120-155 98-132 (133)
46 KOG4221|consensus 25.5 7.9E+02 0.017 25.7 12.4 67 85-156 552-620 (1381)
47 PF13754 Big_3_4: Bacterial Ig 20.6 1.9E+02 0.0041 17.4 3.4 18 12-29 22-39 (54)
48 PF03160 Calx-beta: Calx-beta 20.5 2.8E+02 0.0061 18.7 8.4 35 144-182 2-36 (100)
No 1
>KOG4289|consensus
Probab=100.00 E-value=8.3e-45 Score=334.92 Aligned_cols=229 Identities=34% Similarity=0.422 Sum_probs=215.9
Q ss_pred cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC
Q psy669 2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP 81 (248)
Q Consensus 2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~ 81 (248)
|++++.||||..+.+.|+|.|.|.+.|+++++++|+|.|.|+|||.|+|.+..|..++.||.+.|+.|++++|+|.|.++
T Consensus 221 irta~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~ 300 (2531)
T KOG4289|consen 221 IRTAKSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPP 300 (2531)
T ss_pred chhhhhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCC
Confidence 67899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCe
Q psy669 82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNF 161 (248)
Q Consensus 82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~ 161 (248)
|+.++|++..|+....|.|++.+|.| .+..+||||+...|.|.|.|.|.|.| +...++.|.|+|.|+|||+|+|....
T Consensus 301 Nani~Yrl~eg~~~~~f~in~rSGvI-~T~a~lDRE~~~~y~L~VeAsDqG~~-pgp~Ta~V~itV~D~NDNaPqFse~~ 378 (2531)
T KOG4289|consen 301 NANIRYRLLEGNAKNVFEINPRSGVI-STRAPLDREELESYQLDVEASDQGRP-PGPRTAMVEITVEDENDNAPQFSEKR 378 (2531)
T ss_pred CCceEEEecCCCccceeEEcCcccee-eccCccCHHhhhheEEEEEeccCCCC-CCCceEEEEEEEEecCCCCccccccc
Confidence 99999999999888999999999999 89999999999999999999999988 66679999999999999999999999
Q ss_pred eEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCcee
Q psy669 162 YHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEATL 210 (248)
Q Consensus 162 y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~~~ 210 (248)
|.++|.|+..++++|++|+|+|.|- .+||+|.. +|++.+++.|+..
T Consensus 379 Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggr-- 455 (2531)
T KOG4289|consen 379 YVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGR-- 455 (2531)
T ss_pred eEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCC--
Confidence 9999999999999999999999993 68999999 9999999888755
Q ss_pred ecCCceEEEEEEEEeeeCCcccccC
Q psy669 211 ETGSGESEATLEVLFENSTWENFHE 235 (248)
Q Consensus 211 ~~~~~t~~~~i~v~d~nd~~P~f~~ 235 (248)
+..+.+.-++|.|+|+|||.|.|..
T Consensus 456 PpLsn~sgl~iqVlDINDhaPifvs 480 (2531)
T KOG4289|consen 456 PPLSNTSGLVIQVLDINDHAPIFVS 480 (2531)
T ss_pred CCccCCCceEEEEEecCCCCceeEe
Confidence 4455566677999999999999943
No 2
>KOG4289|consensus
Probab=100.00 E-value=8.8e-42 Score=315.06 Aligned_cols=233 Identities=29% Similarity=0.383 Sum_probs=218.4
Q ss_pred cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC
Q psy669 2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP 81 (248)
Q Consensus 2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~ 81 (248)
|.+..+||||+.++|.|.|.|.|.|.++...+++|.|+|.|+|||+|+|..+.|.++|.|+..+++.|++|+|+|.|.|.
T Consensus 326 I~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~ 405 (2531)
T KOG4289|consen 326 ISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGT 405 (2531)
T ss_pred eeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCc
Confidence 56788999999999999999999998888889999999999999999999999999999999999999999999999999
Q ss_pred CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCe
Q psy669 82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNF 161 (248)
Q Consensus 82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~ 161 (248)
|+.+.|+|.+|+..+.|.||..||.+ .+..+||+|.. .|.+.|.|.|+|.| +++.+..+.|+|+|+|||+|.|....
T Consensus 406 Ng~VHYsi~Sgn~~G~f~id~~tGel-~vv~plD~e~~-~ytl~IrAqDggrP-pLsn~sgl~iqVlDINDhaPifvstp 482 (2531)
T KOG4289|consen 406 NGKVHYSIASGNGRGQFYIDSLTGEL-DVVEPLDFENS-EYTLRIRAQDGGRP-PLSNTSGLVIQVLDINDHAPIFVSTP 482 (2531)
T ss_pred CceEEEEeeccCccccEEEecccceE-EEeccccccCC-eeEEEEEcccCCCC-CccCCCceEEEEEecCCCCceeEech
Confidence 99999999999999999999999999 88999999988 89999999999999 99999999999999999999999999
Q ss_pred eEEEecCCCCCCeeEEEEEEEeCCC----------------------------CCCCccCceEEEEEEccCCCCceeecC
Q psy669 162 YHVNIPETRHRDEPLSQVSLISTTG----------------------------RKLDRENQAEHVLETGSGESEATLETG 213 (248)
Q Consensus 162 y~~~v~E~~~~g~~v~~v~A~D~D~----------------------------~~ld~e~~~~y~~~v~~~~~~~~~~~~ 213 (248)
+.++|.|+.+.|..++.++|.|+|. +.||||....|+|.+.+.+... ..-
T Consensus 483 fq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gt--p~l 560 (2531)
T KOG4289|consen 483 FQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGT--PPL 560 (2531)
T ss_pred hhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEEEcCCCC--Ccc
Confidence 9999999999999999999999994 7799999999999998877644 444
Q ss_pred CceEEEEEEEEeeeCCcccccCCCCc
Q psy669 214 SGESEATLEVLFENSTWENFHETNIT 239 (248)
Q Consensus 214 ~~t~~~~i~v~d~nd~~P~f~~~~~~ 239 (248)
+.++.+.|.++|+|||.|.|....|.
T Consensus 561 ~tstsI~Vtv~dvndndP~Ft~~eyt 586 (2531)
T KOG4289|consen 561 STSTSISVTVLDVNDNDPTFTQKEYT 586 (2531)
T ss_pred cccceEEEEecccCCCCCccccCceE
Confidence 55679999999999999999666554
No 3
>KOG1219|consensus
Probab=100.00 E-value=3.2e-38 Score=299.60 Aligned_cols=231 Identities=31% Similarity=0.419 Sum_probs=216.9
Q ss_pred cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCC--CCccCCCceEEEEECCCCCCeeEEEEEEEeCCC
Q psy669 2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDN--VPMTLDPVYYAQIVENQSGILPIVQLAASDGDL 79 (248)
Q Consensus 2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn--~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~ 79 (248)
|++.++||+|+...|.|.|.|.|.|.|.+++.+.+.|.++|+|.| ||.|....-.++|.||+|+|+.+++|.|.|.|.
T Consensus 903 vfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~deds 982 (4289)
T KOG1219|consen 903 VFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDS 982 (4289)
T ss_pred EEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCC
Confidence 567889999999999999999999999899999999999999987 999999999999999999999999999999999
Q ss_pred CCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeecc
Q psy669 80 DPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQ 159 (248)
Q Consensus 80 ~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~ 159 (248)
|..+.++|+|..|+..+.|+|+..+|.| .+...||||....|.|+|.|+|.|.+ ++++.+.+.|.|.|+|||+|+|.+
T Consensus 983 gldg~l~Y~I~~gdg~g~FsId~~tG~i-rTl~~lDrE~ks~YwltveA~D~gt~-~~ssv~~vyI~ieDvNDn~Pq~s~ 1060 (4289)
T KOG1219|consen 983 GLDGELSYKIRTGDGDGIFSIDSTTGSI-RTLKALDREKKSSYWLTVEAKDLGTV-PLSSVCEVYIEIEDVNDNVPQFSS 1060 (4289)
T ss_pred CccceEEEEEEcCCcceeEEecCCcceE-eechhhchhhcceEEEEEEEEecCCC-ccccceeEEEEEEecCCCCcccCC
Confidence 9999999999999888999999999999 78889999999999999999999999 999999999999999999999999
Q ss_pred CeeEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCc
Q psy669 160 NFYHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEA 208 (248)
Q Consensus 160 ~~y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~ 208 (248)
+.|..+|.|+++.+..|.++.|.|+|. ..||||.+.++.|.+.-.|++-
T Consensus 1061 pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~ge 1140 (4289)
T KOG1219|consen 1061 PVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGE 1140 (4289)
T ss_pred ceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehhhcccccccceEEEEEecCCC
Confidence 999999999999999999999999993 6799999999999987777633
Q ss_pred eeecCCceEEEEEEEEeeeCCcccccCC
Q psy669 209 TLETGSGESEATLEVLFENSTWENFHET 236 (248)
Q Consensus 209 ~~~~~~~t~~~~i~v~d~nd~~P~f~~~ 236 (248)
........+.|.|+|+|||+|.|.+.
T Consensus 1141 --p~l~s~~rviV~IldvNdnsp~Flqk 1166 (4289)
T KOG1219|consen 1141 --PWLCSNQRVIVSILDVNDNSPRFLQK 1166 (4289)
T ss_pred --CccccceEEEEEEeeccCCchhhhhh
Confidence 55566789999999999999999554
No 4
>KOG1219|consensus
Probab=100.00 E-value=5.3e-37 Score=291.41 Aligned_cols=225 Identities=30% Similarity=0.396 Sum_probs=209.8
Q ss_pred cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC
Q psy669 2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP 81 (248)
Q Consensus 2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~ 81 (248)
|.+.++||+|+.++|.++|+|.+.++ .-+.+.|.|.|.|+|||+|.|..+.|.+.+.||+|.|+.|++++|.|.|.+.
T Consensus 2634 i~v~ksLD~E~kk~yqi~v~a~~~~~--vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~ 2711 (4289)
T KOG1219|consen 2634 IKVNKSLDHEKKKSYQIKVKATCGQW--VVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGN 2711 (4289)
T ss_pred EEeccccchhhhceEEEEEEeecCCc--eEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCC
Confidence 78999999999999999999998874 6789999999999999999999999999999999999999999999999999
Q ss_pred CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCe
Q psy669 82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNF 161 (248)
Q Consensus 82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~ 161 (248)
||+++|++.+..+ +|.|+++||+| ++...||+|+...|.|.|.|+|+|.| ++.++|.|+|.|+||++|.|+...
T Consensus 2712 nGqirysl~~~v~--yF~In~etGwl-Tt~~eld~ek~d~y~lkv~AtDhG~~---ssq~~v~v~vtDvndspprf~~ei 2785 (4289)
T KOG1219|consen 2712 NGQIRYSLTSPVP--YFAINPETGWL-TTLFELDLEKQDLYSLKVVATDHGVP---SSQATVLVHVTDVNDSPPRFQREI 2785 (4289)
T ss_pred CceEEEEEcCCcc--eEEEcCCCCee-eehhhhccccCCceEEEEEEecCCcc---cccceEEEEEEecCCCcchhhhHh
Confidence 9999999987655 99999999999 88899999999999999999999966 678999999999999999999999
Q ss_pred eEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCcee
Q psy669 162 YHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEATL 210 (248)
Q Consensus 162 y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~~~ 210 (248)
|..++.|+.+.|.+++.++-.|+|. +++|+|....|.|++.+.++.|.
T Consensus 2786 y~gtvv~d~p~~~~ia~~si~d~D~s~~nq~t~fI~~gd~~gqF~~i~ne~~~~~kKt~~~E~t~ny~Ltvtatdg~f~- 2864 (4289)
T KOG1219|consen 2786 YEGTVVEDVPGGKIIAGLSIFDADVSEVNQVTGFITLGDPLGQFWIIENEWIYEFKKTLDRESTKNYLLTVTATDGIFM- 2864 (4289)
T ss_pred hccceeccCCCCceeeeeEeccccccccceeEEEEeCCCccceEEEEcCcceEEEecchhhhcccceEEEEEEecccee-
Confidence 9999999999999999999999994 78999999999999999988663
Q ss_pred ecCCceEEEEEEEEeeeCCcccccCCCCc
Q psy669 211 ETGSGESEATLEVLFENSTWENFHETNIT 239 (248)
Q Consensus 211 ~~~~~t~~~~i~v~d~nd~~P~f~~~~~~ 239 (248)
.+..+.|.++|.|||.|.+.+.+|.
T Consensus 2865 ----~s~~vkv~v~~s~dn~~~c~~~~~t 2889 (4289)
T KOG1219|consen 2865 ----NSDNVKVLVLDSNDNSPFCGNQLYT 2889 (4289)
T ss_pred ----ccceEEEEeeccccCCccCcchhcc
Confidence 3568999999999999999666554
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.96 E-value=1.4e-26 Score=184.92 Aligned_cols=131 Identities=37% Similarity=0.579 Sum_probs=123.1
Q ss_pred ceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCC
Q psy669 54 VYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGA 133 (248)
Q Consensus 54 ~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~ 133 (248)
.|.+.|.|+.+.|+.++++.|.|+|.+.++.++|+|.++....+|.|++.+|.| .+.+.||||....|.|.|.|+|.|.
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l-~~~~~lD~e~~~~~~l~v~a~D~g~ 79 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVI-TTTKPLDREEQSEYTLTVVASDGGG 79 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEE-EECCCCCCcCCceEEEEEEEEECCc
Confidence 378999999999999999999999998889999999987766899999999999 7888999999999999999999888
Q ss_pred CCCCceEEEEEEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEEeCCC
Q psy669 134 PAPLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLISTTG 186 (248)
Q Consensus 134 p~~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~ 186 (248)
| .++++..++|.|.|+||++|.|....|.+.+.|+.+.|+.++++.|+|+|.
T Consensus 80 ~-~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~ 131 (199)
T cd00031 80 P-PLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS 131 (199)
T ss_pred C-cceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC
Confidence 7 667899999999999999999998999999999999999999999999995
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.95 E-value=7.9e-26 Score=180.51 Aligned_cols=147 Identities=38% Similarity=0.547 Sum_probs=136.6
Q ss_pred CcEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCC
Q psy669 1 NIRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLD 80 (248)
Q Consensus 1 ~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~ 80 (248)
+|++++.||||....|.|.|+|.|.|.+.+++...+.|.|.|+||++|.|....|.+.|.|+.++|+.++++.|+|+|.+
T Consensus 53 ~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~ 132 (199)
T cd00031 53 VITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSG 132 (199)
T ss_pred EEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCC
Confidence 36788899999999999999999988777779999999999999999999988999999999999999999999999998
Q ss_pred CCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEe
Q psy669 81 PDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVED 149 (248)
Q Consensus 81 ~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d 149 (248)
.++.++|+|.++....+|.|++.+|.| .+.+.||||....|.|.|.|+|.+.| .+++++.++|.|.|
T Consensus 133 ~~~~~~y~l~~~~~~~~f~i~~~~G~i-~~~~~ld~e~~~~~~l~v~a~D~~~~-~~~~~~~i~i~v~d 199 (199)
T cd00031 133 ENAKLTYSILSGNDKELFSIDPNTGII-TLAKPLDREEKSSYELTVVATDGGGP-PLSSTATVTVTVLD 199 (199)
T ss_pred CCccEEEEEeCCCCCCEEEEeCCceEE-EeCCccCCccCceEEEEEEEEECCCC-CceeEEEEEEEEEC
Confidence 889999999977655799999999999 78889999999999999999999887 88899999999875
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.82 E-value=3.2e-19 Score=125.24 Aligned_cols=92 Identities=41% Similarity=0.604 Sum_probs=87.7
Q ss_pred eEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeC-CC
Q psy669 55 YYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDD-GA 133 (248)
Q Consensus 55 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~-~~ 133 (248)
|.++|+|+.++|+.++++.|.|+|.+.|+.+.|+|..+++.++|.|++.+|.| .+.+.||||....|.|.|.|+|. +.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i-~~~~~LD~E~~~~y~l~v~a~D~~~~ 79 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEI-SLKKPLDRETQSSYQLTVRATDSGGS 79 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEE-EESSSSCTTTTSEEEEEEEEEETTTS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeecc-ccceecCcccCCEEEEEEEEEECCCC
Confidence 78999999999999999999999998899999999998888999999999999 89999999999999999999999 88
Q ss_pred CCCCceEEEEEEEEE
Q psy669 134 PAPLSSTTRVIISVE 148 (248)
Q Consensus 134 p~~~~~~~~v~I~V~ 148 (248)
| ++++++.|+|+|+
T Consensus 80 ~-~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 80 P-PLSSTATVTINVL 93 (93)
T ss_dssp S-EEEEEEEEEEEEE
T ss_pred C-CCEEEEEEEEEEC
Confidence 8 8999999999885
No 8
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.74 E-value=3e-17 Score=111.64 Aligned_cols=79 Identities=49% Similarity=0.754 Sum_probs=72.9
Q ss_pred EeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCC
Q psy669 75 SDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNA 154 (248)
Q Consensus 75 ~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~ 154 (248)
+|+|.|.|+.++|+|.++....+|.|++.+|.| .+.++||||....|.|.|.|+|.|.| ++++.+.|+|+|.|+|||+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i-~~~~~LD~e~~~~y~l~v~a~D~~~~-~~~~~~~v~I~V~D~Nd~~ 78 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEI-TTTKPLDREEQPEYTLTVEATDGGGP-PLSSTATVTVTVLDVNDNA 78 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEE-EeCCccCeeCCCeEEEEEEEEECCCC-CcccEEEEEEEEEECCCCC
Confidence 478888899999999987755899999999988 77779999999999999999999998 8999999999999999999
Q ss_pred C
Q psy669 155 P 155 (248)
Q Consensus 155 P 155 (248)
|
T Consensus 79 P 79 (79)
T smart00112 79 P 79 (79)
T ss_pred C
Confidence 8
No 9
>KOG1834|consensus
Probab=99.74 E-value=6.1e-17 Score=142.68 Aligned_cols=143 Identities=29% Similarity=0.394 Sum_probs=123.3
Q ss_pred cEEccCCCCccCcEEEEEEEEEECCCC------CCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEE
Q psy669 2 IRTSTILDVETKTHYWLTVFAQDHGVV------PLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAAS 75 (248)
Q Consensus 2 i~~~~~LD~E~~~~y~l~v~a~D~~~~------~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~ 75 (248)
|+++.+||.|.++.|+|+|+|.|+|.. ..+..++|+|+|.|+|..+|.|..+.|.+.|.|.. .-..|+++.|.
T Consensus 95 lRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAi 173 (952)
T KOG1834|consen 95 LRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAI 173 (952)
T ss_pred EeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-eeeeeEEEEee
Confidence 678899999999999999999997743 36778999999999999999999999999999984 45779999999
Q ss_pred eCCCCC-Cc-eEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEec
Q psy669 76 DGDLDP-DQ-RISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDV 150 (248)
Q Consensus 76 D~D~~~-~~-~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~ 150 (248)
|.|.++ ++ ...|.|.. +.-.|.||.. |.| ....+|.|....+|.|+|.|.|.|.. +..+.+.|+|.|...
T Consensus 174 D~DCspq~sqIC~YEI~t--~d~PFaIdn~-G~i-rnTekLny~ke~~Y~ltVtAyDCg~k-raa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 174 DKDCSPQYSQICEYEITT--PDVPFAIDND-GNI-RNTEKLNYTKEHQYKLTVTAYDCGKK-RAASDSLVTVHVKPT 245 (952)
T ss_pred cCCCCCcccceeEEEecC--CCCceEEcCC-Ccc-ccccccccccceeEEEEEEEEecccc-cccCcceEEEEecCc
Confidence 999976 55 46899974 5568999987 999 77889999999999999999999976 445557788888654
No 10
>KOG1834|consensus
Probab=99.58 E-value=9.6e-14 Score=122.77 Aligned_cols=180 Identities=23% Similarity=0.283 Sum_probs=135.8
Q ss_pred EEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCC---CceEEEEEecCCC-CCcEEEeCCC--eEEEEecC
Q psy669 39 QVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDP---DQRISYKISAGNP-ESYFNIDIGS--GLISTTGR 112 (248)
Q Consensus 39 ~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~---~~~i~y~i~~~~~-~~~F~id~~t--g~i~~~~~ 112 (248)
...-+|-+.|.... .|.+-|.||...-.....+.|.|.|.+. ....-|.|.+..- ...--+|..| |.| ..+.
T Consensus 22 ~aarankhkpwie~-ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvl-RaK~ 99 (952)
T KOG1834|consen 22 HAARANKHKPWIEE-EYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVL-RAKE 99 (952)
T ss_pred ccccccccCccccc-ceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEE-eecC
Confidence 34456667776654 4999999997543334468899999754 2346788875332 1233447776 467 7889
Q ss_pred CCCccccceeEEEEEEEeCCCCC-----CCceEEEEEEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEEeCCC-
Q psy669 113 KLDRENQAEHVLEVTVTDDGAPA-----PLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLISTTG- 186 (248)
Q Consensus 113 ~ld~e~~~~~~l~v~a~d~~~p~-----~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~- 186 (248)
+||+|.++.|+|+|+|.|.|.-. ..+..++|.|+|.|+|+++|+|..+.|.+.|.|+ +.-..|++|.|.|.|-
T Consensus 100 ~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~DCs 178 (952)
T KOG1834|consen 100 PLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKDCS 178 (952)
T ss_pred cccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecc-eeeeeeEEEEeecCCCC
Confidence 99999999999999999977541 4577899999999999999999999999999999 7788899999999983
Q ss_pred -----------------------------CCCCccCceEEEEEEccCCCCceeecCCceEEEEEEE
Q psy669 187 -----------------------------RKLDRENQAEHVLETGSGESEATLETGSGESEATLEV 223 (248)
Q Consensus 187 -----------------------------~~ld~e~~~~y~~~v~~~~~~~~~~~~~~t~~~~i~v 223 (248)
.+|.|..+..|.|++.+-+-.-. ....-..|+|+|
T Consensus 179 pq~sqIC~YEI~t~d~PFaIdn~G~irnTekLny~ke~~Y~ltVtAyDCg~k--raa~d~lV~v~V 242 (952)
T KOG1834|consen 179 PQYSQICEYEITTPDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVTAYDCGKK--RAASDSLVTVHV 242 (952)
T ss_pred CcccceeEEEecCCCCceEEcCCCccccccccccccceeEEEEEEEEecccc--cccCcceEEEEe
Confidence 56888889999999977765332 222234666666
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.16 E-value=9.8e-11 Score=79.30 Aligned_cols=47 Identities=45% Similarity=0.567 Sum_probs=44.9
Q ss_pred cEEccCCCCccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCC
Q psy669 2 IRTSTILDVETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVP 48 (248)
Q Consensus 2 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P 48 (248)
|++.++||||..+.|.|.|+|.|.+.+++++.+.|+|.|.|+|||+|
T Consensus 33 i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 33 ITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred EEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 67788999999999999999999999899999999999999999998
No 12
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.62 E-value=1.4e-07 Score=65.71 Aligned_cols=61 Identities=25% Similarity=0.337 Sum_probs=44.6
Q ss_pred eEEEecCCCCCCeeEEEEEEEeCCC-------------------------------CCCCccCceEEEEEEccCCCCcee
Q psy669 162 YHVNIPETRHRDEPLSQVSLISTTG-------------------------------RKLDRENQAEHVLETGSGESEATL 210 (248)
Q Consensus 162 y~~~v~E~~~~g~~v~~v~A~D~D~-------------------------------~~ld~e~~~~y~~~v~~~~~~~~~ 210 (248)
|++.|+|++++|+.|+++.|.|+|. ++||||....|.|.+.+.+. ...
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~-~~~ 79 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDS-GGS 79 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEET-TTS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEEC-CCC
Confidence 7899999999999999999999993 56777777777777766665 111
Q ss_pred ecCCceEEEEEEE
Q psy669 211 ETGSGESEATLEV 223 (248)
Q Consensus 211 ~~~~~t~~~~i~v 223 (248)
.....+++++|+|
T Consensus 80 ~~~~~~~~V~I~V 92 (93)
T PF00028_consen 80 PPLSSTATVTINV 92 (93)
T ss_dssp SEEEEEEEEEEEE
T ss_pred CCCEEEEEEEEEE
Confidence 2333455555555
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.86 E-value=2.7e-05 Score=53.01 Aligned_cols=63 Identities=22% Similarity=0.346 Sum_probs=40.7
Q ss_pred eEEEEECCCCCCeeEEEEEEEeCCCCCC--ceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCcccc
Q psy669 55 YYAQIVENQSGILPIVQLAASDGDLDPD--QRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQ 119 (248)
Q Consensus 55 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~--~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~ 119 (248)
...+|+|..++|+.|+.| |.|...... ....|++.+.....+|.++..+|.+ .+...+|||..
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L-~v~~rIDRE~L 67 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDL-FVSERIDREEL 67 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEE-EESS--SCCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeE-EeCCccCHHHH
Confidence 467899999999999998 445433211 1236777776667899999999999 88899999963
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.35 E-value=0.0034 Score=43.24 Aligned_cols=78 Identities=17% Similarity=0.214 Sum_probs=41.4
Q ss_pred CCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEE
Q psy669 47 VPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEV 126 (248)
Q Consensus 47 ~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v 126 (248)
.|-|....|.+.|+.+...|..|++|.-.|... +..+.|... ++ .|.|..+ |.| .+.+.+.... ..-.|.|
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--Dp--dF~V~~D-GsV-y~~r~v~l~~-~~~~F~V 73 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--DP--DFRVLED-GSV-YAKRPVQLSS-EQRSFTV 73 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-----S--EEEEETT-TEE-EEES--S-SS-S-EEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--CC--CEEEcCC-CeE-EEeeeEecCC-CceEEEE
Confidence 588999999999999999999999999988753 556777663 44 7999988 999 7777776543 3358999
Q ss_pred EEEeCCC
Q psy669 127 TVTDDGA 133 (248)
Q Consensus 127 ~a~d~~~ 133 (248)
.|.|...
T Consensus 74 ~a~D~~~ 80 (90)
T PF08758_consen 74 HAWDSQT 80 (90)
T ss_dssp EEEETTT
T ss_pred EEECCCC
Confidence 9999764
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.41 E-value=0.08 Score=36.93 Aligned_cols=70 Identities=24% Similarity=0.394 Sum_probs=50.7
Q ss_pred EEeCCCCCCceEEEEEecCC---CCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEec
Q psy669 74 ASDGDLDPDQRISYKISAGN---PESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDV 150 (248)
Q Consensus 74 a~D~D~~~~~~i~y~i~~~~---~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~ 150 (248)
..|+| +..++|++...+ -..|...++.++.+.- .+.... ...|.+.|.|+|.. +.+....++|.|.+.
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G--tP~~~~-~g~~~i~v~a~D~~---g~~~~~~f~i~V~~~ 94 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSG--TPTNSD-VGSLSLKVTATDSS---GASASDTFTITVVNT 94 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE--ECCCCC-CcEEEEEEEEEECC---CCEEEEEEEEEEeCC
Confidence 45666 457888886432 2468999999888732 334333 45699999999986 456788899999998
Q ss_pred CC
Q psy669 151 ND 152 (248)
Q Consensus 151 Nd 152 (248)
|+
T Consensus 95 ~~ 96 (97)
T smart00736 95 ND 96 (97)
T ss_pred CC
Confidence 86
No 16
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.03 E-value=1.4 Score=30.82 Aligned_cols=88 Identities=22% Similarity=0.237 Sum_probs=52.7
Q ss_pred EEEEEEeCCCCCCceEEEEEec-CCCCCcEEEeCCCeEEEEe-c------CCCCccccceeEEEEEEEeCCCCCCCceEE
Q psy669 70 VQLAASDGDLDPDQRISYKISA-GNPESYFNIDIGSGLISTT-G------RKLDRENQAEHVLEVTVTDDGAPAPLSSTT 141 (248)
Q Consensus 70 ~~v~a~D~D~~~~~~i~y~i~~-~~~~~~F~id~~tg~i~~~-~------~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~ 141 (248)
+++.++|+|.+. ...+++.. ...-+.|.|+.. |..... . +.|.....-.-.|++.+.|+ .+.
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~~-G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG-------tt~ 71 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDAD-GQWTYQADNSQTAVQALKAGETLTDTFTVTSADG-------TSQ 71 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccCCcEEEEECCC-CcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC-------CeE
Confidence 467888888753 34555532 122356888774 544221 1 12332233345777888875 278
Q ss_pred EEEEEEEecCCCCCeeccCeeEEEecCC
Q psy669 142 RVIISVEDVNDNAPEFEQNFYHVNIPET 169 (248)
Q Consensus 142 ~v~I~V~d~Nd~~P~f~~~~y~~~v~E~ 169 (248)
.|+|.|.-.|| +|+.... -...+.|+
T Consensus 72 ~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 72 TVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred EEEEEEEccCC-CCEEecc-cceeEecC
Confidence 89999999999 7766533 25666665
No 17
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=92.90 E-value=10 Score=37.24 Aligned_cols=122 Identities=19% Similarity=0.208 Sum_probs=63.1
Q ss_pred cCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCCCC--CcEEEeCCCeEEEEe---------c
Q psy669 43 VNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPE--SYFNIDIGSGLISTT---------G 111 (248)
Q Consensus 43 vNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~--~~F~id~~tg~i~~~---------~ 111 (248)
.||..+.|....-...|.|+. |+.-..|.-...|.+..-.+.|+..+|... .-|. +.+|.+... -
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 455566665555677889985 666666655544544345688988765432 1122 223433110 0
Q ss_pred CCCC---ccccceeEEEEEEEeCCC------------CCCCceEEEEEEEEEecCCCCCeeccCeeEEEecCC
Q psy669 112 RKLD---RENQAEHVLEVTVTDDGA------------PAPLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPET 169 (248)
Q Consensus 112 ~~ld---~e~~~~~~l~v~a~d~~~------------p~~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~ 169 (248)
..+| +|..+.|.+.+.--..+. +..+......+|+|.| ||++|.|.-..-...|.|+
T Consensus 471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es 542 (928)
T TIGR00845 471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSES 542 (928)
T ss_pred EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcC
Confidence 1122 344455555543321110 0012233456677776 6789986654446777786
No 18
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=92.48 E-value=3.8 Score=31.32 Aligned_cols=126 Identities=21% Similarity=0.202 Sum_probs=66.6
Q ss_pred cEEEEEE-EEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCC-CeeEEEEEEEeCCCCCCceEEEEEec
Q psy669 14 THYWLTV-FAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSG-ILPIVQLAASDGDLDPDQRISYKISA 91 (248)
Q Consensus 14 ~~y~l~v-~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~-g~~v~~v~a~D~D~~~~~~i~y~i~~ 91 (248)
..|.+++ .|.|..+. ..+..+..++. +...+|...- .....+..+... |..=..+.++|.-.+. ..-+.+|.+
T Consensus 15 G~Y~l~~~~a~D~agN--~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~G 89 (158)
T PF13750_consen 15 GSYTLTVVTATDAAGN--TSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTG 89 (158)
T ss_pred ccEEEEEEEEEecCCC--EEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEEC
Confidence 4599999 79998744 34444432322 2235776644 111223233222 2233567777766533 344677765
Q ss_pred CCCCCcEEE--eC-CCeEEEEec--CCC-CccccceeEEEEEEEeCCCCCCCceEEEEEEEEE
Q psy669 92 GNPESYFNI--DI-GSGLISTTG--RKL-DRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVE 148 (248)
Q Consensus 92 ~~~~~~F~i--d~-~tg~i~~~~--~~l-d~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~ 148 (248)
|.....-.+ .. ..|.. ... +.+ ..|....|.|+|.|+|.. +..++..+.....
T Consensus 90 g~~~d~v~ls~~~~~~~~~-~~~yp~~fpsle~~~~YtLtV~a~D~a---GN~~~~si~F~y~ 148 (158)
T PF13750_consen 90 GPASDSVSLSWTNKGNGVY-TLEYPRIFPSLEADDSYTLTVSATDKA---GNQSTKSISFSYM 148 (158)
T ss_pred CcccceEEEeeEeccCceE-EeecccccCCcCCCCeEEEEEEEEecC---CCEEEEEEEEEEe
Confidence 443322222 11 12333 221 111 236788899999999976 4556666666554
No 19
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=83.46 E-value=6.2 Score=27.25 Aligned_cols=31 Identities=32% Similarity=0.219 Sum_probs=26.2
Q ss_pred CcEEEEEEEEEECCCCCCeeeEEEEEEEEecCC
Q psy669 13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVND 45 (248)
Q Consensus 13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvND 45 (248)
...|.++|.|+|..+ .+....+.|.|.+.|+
T Consensus 66 ~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 66 VGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 456999999999874 6788889999999886
No 20
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=82.23 E-value=7.3 Score=23.34 Aligned_cols=35 Identities=23% Similarity=0.280 Sum_probs=24.9
Q ss_pred CCcEEEeCCCeEEEEecCCCCcc-ccceeEEEEEEEeCC
Q psy669 95 ESYFNIDIGSGLISTTGRKLDRE-NQAEHVLEVTVTDDG 132 (248)
Q Consensus 95 ~~~F~id~~tg~i~~~~~~ld~e-~~~~~~l~v~a~d~~ 132 (248)
.....+|+.+|.| .= ..+.. ....|.+.|.|+|..
T Consensus 13 P~gLs~d~~tG~i-sG--tp~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 13 PSGLSLDPSTGTI-SG--TPTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCcEEEeCCCCEE-Ee--ecCCCccccEEEEEEEEEcCC
Confidence 4678999999999 32 23333 335899999999863
No 21
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=79.57 E-value=68 Score=31.80 Aligned_cols=57 Identities=9% Similarity=0.094 Sum_probs=33.1
Q ss_pred eEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecCC
Q psy669 33 RTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGN 93 (248)
Q Consensus 33 ~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~ 93 (248)
....+|+|.| ||++|.|....-..+|.|+. |..-.+|.-+- +....-.+.|+...|.
T Consensus 514 ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtvtV~Rts-Ga~G~VtV~Y~T~dGT 570 (928)
T TIGR00845 514 PNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEVKVLRTS-GARGTVIVPYRTVEGT 570 (928)
T ss_pred CceEEEEEec-CcccCcccccCceEEEEcCC--CEEEEEEEEcC-CCCeeEEEEEEeecCc
Confidence 4466777777 68899887766677888884 44433332221 1111224567666543
No 22
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=77.85 E-value=14 Score=27.59 Aligned_cols=56 Identities=16% Similarity=0.156 Sum_probs=36.9
Q ss_pred cEEccCCCCcc---CcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCC
Q psy669 2 IRTSTILDVET---KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQ 63 (248)
Q Consensus 2 i~~~~~LD~E~---~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~ 63 (248)
..+.++||+.. .-...|.|.|+|..+... ...+.|+|.| | .|...... ..+|.|..
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 35678888854 334788899999775433 4577787777 4 46665543 36787754
No 23
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=74.98 E-value=18 Score=22.61 Aligned_cols=60 Identities=18% Similarity=0.271 Sum_probs=33.4
Q ss_pred CceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEE
Q psy669 82 DQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVE 148 (248)
Q Consensus 82 ~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~ 148 (248)
+-..+|+|.+.+ ..+..+...+-.+ .. ..| ....|.|.|.|.|.... .......+.|.|.
T Consensus 7 ~~~Y~Y~l~g~d-~~W~~~~~~~~~~-~~-~~L---~~G~Y~l~V~a~~~~~~-~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 7 NIRYRYRLEGFD-DEWITLGSYSNSI-SY-TNL---PPGKYTLEVRAKDNNGK-WSSDEKSLTITIL 66 (66)
T ss_dssp TEEEEEEEETTE-SSEEEESSTS-EE-EE-ES-----SEEEEEEEEEEETTS--B-SS-EEEEEEEE
T ss_pred ceEEEEEEECCC-CeEEECCCCcEEE-EE-EeC---CCEEEEEEEEEECCCCC-cCcccEEEEEEEC
Confidence 456778886433 5566665553244 11 223 57789999999996543 2222266666653
No 24
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=72.74 E-value=23 Score=26.36 Aligned_cols=54 Identities=19% Similarity=0.227 Sum_probs=34.8
Q ss_pred EecCCCCccc---cceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCeeccCeeEEEecCC
Q psy669 109 TTGRKLDREN---QAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPEFEQNFYHVNIPET 169 (248)
Q Consensus 109 ~~~~~ld~e~---~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~ 169 (248)
++..+||+.. .-...|.|.|+|... -.+...+.|+|.| | .|+..... ...|.|+
T Consensus 71 tL~~~lDH~~g~d~l~l~~~v~a~D~DG---D~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 71 TLEGPLDHAAGSDELTLNFPIIATDFDG---DTSSITLPVTIVD--D-VPTITDVD-ALTVDED 127 (137)
T ss_pred EEcccccCCCCCceEEEeeeEEEEeCCC---CccccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence 5556677643 334678899999652 2334588888887 4 57765443 4678775
No 25
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=72.36 E-value=9.4 Score=23.92 Aligned_cols=29 Identities=21% Similarity=0.132 Sum_probs=18.7
Q ss_pred CcEEEEEEEEEECCCCCCeeeEEEEEEEE
Q psy669 13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQ 41 (248)
Q Consensus 13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~ 41 (248)
...|.|.|+|.|..+.......++.|.|.
T Consensus 38 ~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 38 PGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 35699999999977554444467777663
No 26
>KOG3597|consensus
Probab=66.61 E-value=88 Score=28.27 Aligned_cols=59 Identities=20% Similarity=0.229 Sum_probs=44.5
Q ss_pred eeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEec
Q psy669 32 ARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISA 91 (248)
Q Consensus 32 ~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~ 91 (248)
.+..+.|.|..+||.+..+....+..-+.|+...-.....+.+.|+|.. -..+.|++..
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~-~~~l~f~v~~ 82 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSA-PLPLEFQVLG 82 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCC-ccceEEEEcc
Confidence 4666789999999999888777777888888655444567888888865 3457788864
No 27
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=64.63 E-value=61 Score=24.71 Aligned_cols=14 Identities=36% Similarity=0.399 Sum_probs=9.5
Q ss_pred cceeEEEE-EEEeCC
Q psy669 119 QAEHVLEV-TVTDDG 132 (248)
Q Consensus 119 ~~~~~l~v-~a~d~~ 132 (248)
...|.+.+ .|.|..
T Consensus 14 dG~Y~l~~~~a~D~a 28 (158)
T PF13750_consen 14 DGSYTLTVVTATDAA 28 (158)
T ss_pred CccEEEEEEEEEecC
Confidence 45577777 677754
No 28
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=63.54 E-value=45 Score=22.79 Aligned_cols=52 Identities=17% Similarity=0.206 Sum_probs=28.6
Q ss_pred EEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCCCCCceEEEEEecC
Q psy669 37 YIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAG 92 (248)
Q Consensus 37 ~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~ 92 (248)
+|+|.| ||.+ .+....-...+.|+. |..-..|.-...+....-.+.|....+
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCC
Confidence 467777 5555 766666667888885 334444444433322234556666544
No 29
>PF10365 DUF2436: Domain of unknown function (DUF2436); InterPro: IPR018832 Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms. This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).
Probab=55.38 E-value=80 Score=23.57 Aligned_cols=88 Identities=23% Similarity=0.200 Sum_probs=50.1
Q ss_pred CCCCCccCCCceEEEEECCCCCCeeEEEEEEEeCCC---CCCceEEEEEecCCCCCcEEEeCCCeEEEEecCCCCccccc
Q psy669 44 NDNVPMTLDPVYYAQIVENQSGILPIVQLAASDGDL---DPDQRISYKISAGNPESYFNIDIGSGLISTTGRKLDRENQA 120 (248)
Q Consensus 44 NDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~---~~~~~i~y~i~~~~~~~~F~id~~tg~i~~~~~~ld~e~~~ 120 (248)
|+++|.=.-..|+..|++|+-+-..-..+ ..|... -+.+..-|.|....+.....|-...|.=..-....-+|..+
T Consensus 66 n~~~pa~ly~~FEYkiP~NADps~tpq~m-v~dG~~~i~IPaG~YDy~I~~P~~~~kiwIaGd~g~~~tr~dDy~fEAGK 144 (161)
T PF10365_consen 66 NCNVPANLYDPFEYKIPANADPSTTPQNM-VVDGEASIDIPAGTYDYCIAAPQPGGKIWIAGDGGDGPTRGDDYVFEAGK 144 (161)
T ss_pred CCCCChhhcccceEeccCCCCCccCcceE-EecCceEEEecCceeEEEEecCCCCCeEEEecCCCCCCccccceEEecCC
Confidence 56677555567788889887653221111 111111 12456678887666656666655543220111123458888
Q ss_pred eeEEEEEEEeCC
Q psy669 121 EHVLEVTVTDDG 132 (248)
Q Consensus 121 ~~~l~v~a~d~~ 132 (248)
.|.+++.....|
T Consensus 145 tY~ftm~~~g~g 156 (161)
T PF10365_consen 145 TYRFTMKRVGSG 156 (161)
T ss_pred EEEEEEEeccCC
Confidence 999999887655
No 30
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=52.71 E-value=42 Score=23.81 Aligned_cols=33 Identities=12% Similarity=0.156 Sum_probs=22.0
Q ss_pred CCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEE
Q psy669 113 KLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVE 148 (248)
Q Consensus 113 ~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~ 148 (248)
.+++.+...|.+.|.++|... -+..+.+.|.|-
T Consensus 73 t~~v~kgG~y~m~V~lCn~dG---CS~S~~~~I~VA 105 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDG---CSTSAAKEIVVA 105 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCC---ccCcCCEEEEec
Confidence 355777889999999998653 233445555554
No 31
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=52.34 E-value=55 Score=20.34 Aligned_cols=30 Identities=23% Similarity=0.210 Sum_probs=20.6
Q ss_pred CcEEEEEEEEEECCCCCCeeeEEEEEEEEecC
Q psy669 13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVN 44 (248)
Q Consensus 13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvN 44 (248)
...|.|.+.|.|..+ ..+.......+.|..
T Consensus 22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence 567999999999874 445555555555553
No 32
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=49.52 E-value=13 Score=25.51 Aligned_cols=33 Identities=30% Similarity=0.597 Sum_probs=14.2
Q ss_pred CCeeccCeeEEEecCCCCCCeeEEEEEEEeCCC
Q psy669 154 APEFEQNFYHVNIPETRHRDEPLSQVSLISTTG 186 (248)
Q Consensus 154 ~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~ 186 (248)
.|-|.+..|.+.|+.+...|..|++|.-.|=.+
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~ 35 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG 35 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC
Confidence 478999999999999999999999998777654
No 33
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=47.24 E-value=58 Score=20.96 Aligned_cols=28 Identities=18% Similarity=0.111 Sum_probs=19.3
Q ss_pred CccCcEEEEEEEEEECCCCCCeeeEEEEEEE
Q psy669 10 VETKTHYWLTVFAQDHGVVPLHARTDVYIQV 40 (248)
Q Consensus 10 ~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V 40 (248)
|+....|.+++.+.|..+ +.+..+.|.|
T Consensus 51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 51 YTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 455667899999988764 4555665554
No 34
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=44.64 E-value=54 Score=21.22 Aligned_cols=30 Identities=20% Similarity=0.114 Sum_probs=20.0
Q ss_pred CCccccceeEEEEEEEeCCCCCCCceEEEEEEE
Q psy669 114 LDRENQAEHVLEVTVTDDGAPAPLSSTTRVIIS 146 (248)
Q Consensus 114 ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~ 146 (248)
..|.....|.+++.++|.. ..+....+.|.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~---g~~~~~~~~V~ 80 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAV---GSSSTKTTTVV 80 (81)
T ss_pred EEcCCCcEEEEEEEEEeCC---CCEEEEEEEEE
Confidence 3466788899999999974 23344444444
No 35
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=44.19 E-value=66 Score=20.78 Aligned_cols=28 Identities=7% Similarity=0.004 Sum_probs=18.0
Q ss_pred CccCcEEEEEEEEEECCCCCCeeeEEEEEE
Q psy669 10 VETKTHYWLTVFAQDHGVVPLHARTDVYIQ 39 (248)
Q Consensus 10 ~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~ 39 (248)
|.+...|.+++.+.|.. ..+.+..+.|.
T Consensus 53 y~~~G~y~v~l~v~d~~--g~~~~~~~~V~ 80 (81)
T cd00146 53 YTKPGTYTVTLTVTNAV--GSSSTKTTTVV 80 (81)
T ss_pred cCCCcEEEEEEEEEeCC--CCEEEEEEEEE
Confidence 55667799999999875 23444444443
No 36
>KOG4221|consensus
Probab=44.11 E-value=3.6e+02 Score=27.89 Aligned_cols=40 Identities=18% Similarity=0.174 Sum_probs=30.3
Q ss_pred CccCcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCc
Q psy669 10 VETKTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPM 49 (248)
Q Consensus 10 ~E~~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~ 49 (248)
.|....|.+.|.|....+.+.++...-.++..|+=+-+|.
T Consensus 581 L~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 581 LEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred CCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 4566779999999998888776666556677777766665
No 37
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=42.65 E-value=82 Score=19.54 Aligned_cols=30 Identities=30% Similarity=0.314 Sum_probs=20.8
Q ss_pred cceeEEEEEEEeCCCCCCCceEEEEEEEEEecC
Q psy669 119 QAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVN 151 (248)
Q Consensus 119 ~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~N 151 (248)
...|.+.+.|+|.. +..+.......+.|..
T Consensus 22 dg~yt~~v~a~D~A---GN~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKA---GNTSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECC---CCEEEeeeEEEEEcCC
Confidence 56799999999976 4455555555555543
No 38
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=35.89 E-value=1e+02 Score=21.60 Aligned_cols=41 Identities=20% Similarity=0.183 Sum_probs=27.5
Q ss_pred cEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECC
Q psy669 14 THYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVEN 62 (248)
Q Consensus 14 ~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~ 62 (248)
..-+|++.+.|+ .+.+|.|+|.-.|| +|+..-. -...+.|+
T Consensus 57 ~tdsFtvtv~DG------tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 57 LTDTFTVTSADG------TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred EEEEEEEEEeCC------CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 345677778875 28889999999997 5544332 24566665
No 39
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=35.20 E-value=56 Score=23.19 Aligned_cols=30 Identities=13% Similarity=0.142 Sum_probs=20.3
Q ss_pred CCCCccCcEEEEEEEEEECCCCCCeeeEEE
Q psy669 7 ILDVETKTHYWLTVFAQDHGVVPLHARTDV 36 (248)
Q Consensus 7 ~LD~E~~~~y~l~v~a~D~~~~~~~~~~~v 36 (248)
.+++.+...|.++|+++|..+...++...|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dGCS~S~~~~I 102 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDGCSTSAAKEI 102 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCCccCcCCEEE
Confidence 355666677999999999876544444443
No 40
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=34.49 E-value=4.2e+02 Score=25.30 Aligned_cols=110 Identities=17% Similarity=0.130 Sum_probs=57.8
Q ss_pred CcEEEEEEEEEECCCCCCeeeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEE------eCCCCCCceEE
Q psy669 13 KTHYWLTVFAQDHGVVPLHARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAAS------DGDLDPDQRIS 86 (248)
Q Consensus 13 ~~~y~l~v~a~D~~~~~~~~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~------D~D~~~~~~i~ 86 (248)
.....+++.++|..+ ..+.+.+..|+|++= ++|.+....++..-.++ .......|. +.-......+.
T Consensus 297 ~G~~Ti~atVtDSRG-r~S~~~~~tItVl~Y--~~P~lsfsv~R~~~~~~----~~~v~~~a~Iapl~v~g~qKN~~~lt 369 (624)
T PF05895_consen 297 SGSATIRATVTDSRG-RTSDPKTKTITVLEY--SPPTLSFSVYRCGSSGN----TLTVTRNAKIAPLTVNGVQKNTMTLT 369 (624)
T ss_pred CceEEEEEEEEECCC-ccCCceEEEEEEEEc--CCCcEEEEEEEeCCCCc----EEEEEEEEEEeEEEEcccccceEEEE
Confidence 345889999999763 224567889999988 57877433333322222 111111221 11111123566
Q ss_pred EEEecCCCCCcEEEeCCC--eEE----------EEecCCCCccccceeEEEEEEEeCC
Q psy669 87 YKISAGNPESYFNIDIGS--GLI----------STTGRKLDRENQAEHVLEVTVTDDG 132 (248)
Q Consensus 87 y~i~~~~~~~~F~id~~t--g~i----------~~~~~~ld~e~~~~~~l~v~a~d~~ 132 (248)
|+...-. ...|.+|... |.. ..+... +.....|.+.+.++|.-
T Consensus 370 ~~~a~~g-t~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~--y~~~kSy~V~~~l~D~F 424 (624)
T PF05895_consen 370 FKVAPLG-TGTFTTDNGSASGTWSSISELTNSSANLGGT--YDAEKSYDVRGTLSDKF 424 (624)
T ss_pred EEEEEcC-cceEEEEccccccceeeeeeecccceeeccc--cCCCceEEEEEEEEEEe
Confidence 7665422 3456665431 111 022233 34556799999999964
No 41
>KOG3597|consensus
Probab=32.93 E-value=66 Score=29.01 Aligned_cols=52 Identities=17% Similarity=0.042 Sum_probs=38.6
Q ss_pred eEEEEEEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEEeCCCCCCC
Q psy669 139 STTRVIISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLISTTGRKLD 190 (248)
Q Consensus 139 ~~~~v~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~~~ld 190 (248)
.+....|.|.-+||.+..+....+.+-+.|+...-.....+.+.|+|.+++.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~ 75 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPLP 75 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCccc
Confidence 3566788999999988777777777888887555555567888999975444
No 42
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=32.90 E-value=97 Score=20.25 Aligned_cols=25 Identities=24% Similarity=0.251 Sum_probs=19.5
Q ss_pred ceeEEEEEEEeCCCCCCCceEEEEEEEE
Q psy669 120 AEHVLEVTVTDDGAPAPLSSTTRVIISV 147 (248)
Q Consensus 120 ~~~~l~v~a~d~~~p~~~~~~~~v~I~V 147 (248)
..|.++..|+|.. +..+++.+.|+|
T Consensus 57 G~t~V~ytA~D~~---GN~a~C~f~V~V 81 (81)
T PF02494_consen 57 GTTTVTYTATDAA---GNSATCSFTVTV 81 (81)
T ss_pred ceEEEEEEEEECC---CCEEEEEEEEEC
Confidence 4588999999975 566788888765
No 43
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=30.35 E-value=2.3e+02 Score=21.03 Aligned_cols=37 Identities=14% Similarity=0.210 Sum_probs=23.6
Q ss_pred CCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCC
Q psy669 113 KLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVND 152 (248)
Q Consensus 113 ~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd 152 (248)
.+.......|.+.|.+++.. +-+....+.|.|.|..-
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~---GCS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNAD---GCSTSAPVEVVVADTDG 112 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETT---EEEE---EEEEEE-TTS
T ss_pred EEEecCCCEEEEEEEEECCC---CcccCCCEEEEEeCCCc
Confidence 45566788899999999965 33556688889998754
No 44
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.49 E-value=1.9e+02 Score=20.56 Aligned_cols=41 Identities=24% Similarity=0.324 Sum_probs=27.7
Q ss_pred eeEEEEEEEEecCCCCCccCCCceEEEEECCCCCCeeEEEEEEE
Q psy669 32 ARTDVYIQVQNVNDNVPMTLDPVYYAQIVENQSGILPIVQLAAS 75 (248)
Q Consensus 32 ~~~~v~i~V~DvNDn~P~f~~~~~~~~v~E~~~~g~~v~~v~a~ 75 (248)
...+++++..|.+|-.+.-....|..-+.-|.|+ |.|..-.
T Consensus 66 arrt~hlRfndL~dpe~iP~d~~yasviesnvPv---VvQ~tRL 106 (124)
T COG4288 66 ARRTLHLRFNDLGDPEAIPKDTPYASVIESNVPV---VVQLTRL 106 (124)
T ss_pred cceeEEEEecccCCcccCCCCCchhhheecCCce---EEEEEEe
Confidence 4567888888998877777777777766666653 4444433
No 45
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.95 E-value=1.5e+02 Score=21.73 Aligned_cols=35 Identities=26% Similarity=0.426 Sum_probs=17.9
Q ss_pred ceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCC
Q psy669 120 AEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAP 155 (248)
Q Consensus 120 ~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P 155 (248)
..=+|.|.|+-...-..++..+.+.|+|.+-|+ +|
T Consensus 98 N~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 98 NAGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp S-EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred CcccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 344677777654332267899999999988876 55
No 46
>KOG4221|consensus
Probab=25.54 E-value=7.9e+02 Score=25.67 Aligned_cols=67 Identities=16% Similarity=0.234 Sum_probs=42.6
Q ss_pred EEEEEec--CCCCCcEEEeCCCeEEEEecCCCCccccceeEEEEEEEeCCCCCCCceEEEEEEEEEecCCCCCe
Q psy669 85 ISYKISA--GNPESYFNIDIGSGLISTTGRKLDRENQAEHVLEVTVTDDGAPAPLSSTTRVIISVEDVNDNAPE 156 (248)
Q Consensus 85 i~y~i~~--~~~~~~F~id~~tg~i~~~~~~ld~e~~~~~~l~v~a~d~~~p~~~~~~~~v~I~V~d~Nd~~P~ 156 (248)
..|++.. ++.+..+.++.++-.. ++. +.+....|.+.|.|.....+ +.++...-.++..|+-+-||.
T Consensus 552 ~~yk~~ys~~~~~~~~~~~~n~~e~-ti~---gL~k~TeY~~~vvA~N~~G~-g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 552 TGYKLFYSEDDTGKELRVENNATEY-TIN---GLEKYTEYSIRVVAYNSAGS-GVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred eEEEEEEEcCCCCceEEEecCccEE-Eee---cCCCccceEEEEEEecCCCC-CCCCCceEEEeccCCCCCCCc
Confidence 3455542 2445688888776665 443 34667789999999997666 444433333455577776776
No 47
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=20.60 E-value=1.9e+02 Score=17.38 Aligned_cols=18 Identities=28% Similarity=0.222 Sum_probs=13.6
Q ss_pred cCcEEEEEEEEEECCCCC
Q psy669 12 TKTHYWLTVFAQDHGVVP 29 (248)
Q Consensus 12 ~~~~y~l~v~a~D~~~~~ 29 (248)
....|.+++.|+|..+..
T Consensus 22 ~dG~y~itv~a~D~AGN~ 39 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGNT 39 (54)
T ss_pred CCccEEEEEEEEeCCCCC
Confidence 356799999999976543
No 48
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=20.47 E-value=2.8e+02 Score=18.65 Aligned_cols=35 Identities=11% Similarity=0.123 Sum_probs=18.2
Q ss_pred EEEEEecCCCCCeeccCeeEEEecCCCCCCeeEEEEEEE
Q psy669 144 IISVEDVNDNAPEFEQNFYHVNIPETRHRDEPLSQVSLI 182 (248)
Q Consensus 144 ~I~V~d~Nd~~P~f~~~~y~~~v~E~~~~g~~v~~v~A~ 182 (248)
+|.|.| ||.+ .+.-..-...+.|+. |..-..|.-.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~ 36 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRS 36 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEE
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEc
Confidence 466777 5644 655444456677763 4344444433
Done!