Query psy5886
Match_columns 184
No_of_seqs 159 out of 1128
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 20:38:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5886.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5886hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00031 CA Cadherin repeat dom 100.0 1.3E-29 2.8E-34 192.3 23.7 160 19-180 2-184 (199)
2 KOG4289|consensus 100.0 7E-31 1.5E-35 233.8 17.0 169 10-181 265-456 (2531)
3 KOG4289|consensus 100.0 1.8E-27 3.8E-32 212.4 18.0 168 12-182 373-559 (2531)
4 KOG1219|consensus 99.9 1.3E-26 2.8E-31 212.6 19.7 163 17-181 956-1141(4289)
5 KOG1219|consensus 99.9 8.7E-26 1.9E-30 207.2 21.3 164 15-182 2577-2762(4289)
6 PF00028 Cadherin: Cadherin do 99.7 1.9E-16 4.1E-21 106.4 10.7 75 107-181 1-79 (93)
7 KOG1834|consensus 99.6 9.2E-15 2E-19 124.3 16.1 161 17-179 35-228 (952)
8 PF00028 Cadherin: Cadherin do 99.5 4.3E-13 9.4E-18 89.9 11.4 73 19-93 1-93 (93)
9 cd00031 CA Cadherin repeat dom 99.4 4.6E-12 1E-16 95.8 11.9 76 106-181 1-80 (199)
10 smart00112 CA Cadherin repeats 99.3 3.2E-11 7E-16 78.3 8.6 60 39-100 1-79 (79)
11 smart00112 CA Cadherin repeats 99.1 1.9E-10 4.1E-15 74.6 7.1 55 126-180 1-58 (79)
12 PF08758 Cadherin_pro: Cadheri 97.6 0.00039 8.5E-09 46.2 7.3 78 99-181 3-81 (90)
13 PF08266 Cadherin_2: Cadherin- 97.4 0.001 2.3E-08 43.6 6.5 58 107-165 3-66 (84)
14 KOG1834|consensus 95.7 0.096 2.1E-06 46.3 9.3 81 9-95 145-245 (952)
15 smart00736 CADG Dystroglycan-t 95.7 0.2 4.4E-06 33.4 9.2 57 38-97 24-96 (97)
16 PF08266 Cadherin_2: Cadherin- 94.3 0.066 1.4E-06 35.1 3.4 72 19-93 3-81 (84)
17 TIGR01965 VCBS_repeat VCBS rep 94.2 0.9 1.9E-05 30.7 8.8 37 72-114 61-97 (99)
18 PF08758 Cadherin_pro: Cadheri 93.7 0.88 1.9E-05 30.2 8.0 48 15-70 7-54 (90)
19 smart00736 CADG Dystroglycan-t 90.9 1.9 4.2E-05 28.6 7.1 54 124-182 23-82 (97)
20 PF05345 He_PIG: Putative Ig d 89.9 1.6 3.4E-05 25.4 5.3 35 144-180 14-49 (49)
21 TIGR00845 caca sodium/calcium 85.2 34 0.00074 32.3 14.7 44 12-59 402-445 (928)
22 PF13750 Big_3_3: Bacterial Ig 84.6 13 0.00029 27.1 13.7 107 72-181 19-137 (158)
23 TIGR03660 T1SS_rpt_143 T1SS-14 77.7 23 0.0005 25.3 8.9 75 28-115 52-128 (137)
24 KOG3597|consensus 71.9 29 0.00063 29.9 8.1 95 84-181 24-137 (442)
25 PF07495 Y_Y_Y: Y_Y_Y domain; 54.0 45 0.00097 19.8 6.5 45 132-181 6-53 (66)
26 PF05895 DUF859: Siphovirus pr 46.1 2.3E+02 0.005 25.7 11.5 101 72-181 302-426 (624)
27 cd00146 PKD polycystic kidney 42.4 33 0.00071 21.3 2.9 21 162-182 52-72 (81)
28 COG2706 3-carboxymuconate cycl 36.0 2.6E+02 0.0056 23.4 10.9 67 89-159 214-284 (346)
29 PF13754 Big_3_4: Bacterial Ig 33.4 1E+02 0.0022 17.9 4.3 17 165-181 22-38 (54)
30 PF00801 PKD: PKD domain; Int 33.3 57 0.0012 19.6 2.8 19 163-181 48-66 (69)
31 PF03160 Calx-beta: Calx-beta 32.0 1.4E+02 0.0031 19.2 6.4 51 89-142 2-52 (100)
32 PRK08577 hypothetical protein; 29.4 1.4E+02 0.0031 20.9 4.7 38 142-180 29-68 (136)
33 PF02494 HYR: HYR domain; Int 28.9 1.5E+02 0.0033 18.5 6.1 30 62-92 52-81 (81)
34 smart00089 PKD Repeats in poly 23.9 1E+02 0.0022 18.8 2.8 21 160-180 48-68 (79)
35 PF13287 Fn3_assoc: Fn3 associ 22.1 1.9E+02 0.0041 17.2 5.1 36 46-81 9-49 (59)
36 cd00917 PG-PI_TP The phosphati 20.4 2.2E+02 0.0048 19.5 4.2 36 3-42 76-111 (122)
No 1
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=1.3e-29 Score=192.26 Aligned_cols=160 Identities=41% Similarity=0.554 Sum_probs=147.8
Q ss_pred EEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCCCcccEEEcCCCC-----------------eEEEEEeCCC
Q psy5886 19 YGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPATE-----------------HQGPIQDEGG 81 (184)
Q Consensus 19 y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tg-----------------~~v~a~d~~~ 81 (184)
|.+.|.||++.|+.|+++.|.|+|.+.|+.++|+|.++... .+|.|++.+| +.|+|.|.|.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~--~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~ 79 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNED--GLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGG 79 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCc--ccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCc
Confidence 78999999999999999999999999899999999998543 7899999988 4789999775
Q ss_pred C--eeEEEEEEEEEeCCCCCCEEecCceEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCCC---CEEEECCCcEE
Q psy5886 82 L--SSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPEDI---PFVVDANTGDI 155 (184)
Q Consensus 82 ~--~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~~---~F~Id~~tG~i 155 (184)
. ++...+.|.|.|+||++|.|....|.+.+.|+.+ |+.++++.|+|+|.+.++.++|+|..+. .|.|++.+|.|
T Consensus 80 ~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i 159 (199)
T cd00031 80 PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGII 159 (199)
T ss_pred CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEE
Confidence 4 3899999999999999999999999999999995 9999999999999988999999998754 79999999999
Q ss_pred EeCccCCccCCCeEEEEEEEEeCCC
Q psy5886 156 RTNKALDYEKQQLYLERVFLISKFA 180 (184)
Q Consensus 156 ~~~~~LD~E~~~~y~l~V~a~D~~~ 180 (184)
.+.+.||+|....|+|.|.|+|..+
T Consensus 160 ~~~~~ld~e~~~~~~l~v~a~D~~~ 184 (199)
T cd00031 160 TLAKPLDREEKSSYELTVVATDGGG 184 (199)
T ss_pred EeCCccCCccCceEEEEEEEEECCC
Confidence 9999999999999999999999873
No 2
>KOG4289|consensus
Probab=99.97 E-value=7e-31 Score=233.85 Aligned_cols=169 Identities=32% Similarity=0.419 Sum_probs=156.2
Q ss_pred CCCCCCCceEEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCCCcccEEEcCCCC-----------------e
Q psy5886 10 LEMTSDSLFYGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPATE-----------------H 72 (184)
Q Consensus 10 ~~~~~~~~~y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tg-----------------~ 72 (184)
-.+.++|..|.-++.||.++|+.|.++.|+|.|.++|++++|++.+++ +...|.||+++| +
T Consensus 265 hsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~--~~~~f~in~rSGvI~T~a~lDRE~~~~y~L 342 (2531)
T KOG4289|consen 265 HSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGN--AKNVFEINPRSGVISTRAPLDREELESYQL 342 (2531)
T ss_pred CCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCC--ccceeEEcCccceeeccCccCHHhhhheEE
Confidence 345678889999999999999999999999999999999999999984 468899999999 4
Q ss_pred EEEEEeCCC--CeeEEEEEEEEEeCCCCCCEEecCceEEEEeCCC-CCcEEEEEEEEeCCCCCCcEEEEEeCCC---CCE
Q psy5886 73 QGPIQDEGG--LSSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQ-DGLAVGRVHATDADEGQNAVVYYSVPED---IPF 146 (184)
Q Consensus 73 ~v~a~d~~~--~~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~-~g~~v~~v~a~D~D~~~n~~i~y~l~~~---~~F 146 (184)
.|.|+|+|. ...++.|.|.|.|+|||+|+|..+.|.+.|.|+. +++.|++|+|+|+|.|.|+.+-|+|.++ +.|
T Consensus 343 ~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f 422 (2531)
T KOG4289|consen 343 DVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQF 422 (2531)
T ss_pred EEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccE
Confidence 799999983 3458999999999999999999999999999998 6999999999999999999999999886 569
Q ss_pred EEECCCcEEEeCccCCccCCCeEEEEEEEEeCCCC
Q psy5886 147 VVDANTGDIRTNKALDYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 147 ~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~~ 181 (184)
.||..||+|.+..+||+|.. .|++.|+|.|+.-.
T Consensus 423 ~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrP 456 (2531)
T KOG4289|consen 423 YIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRP 456 (2531)
T ss_pred EEecccceEEEeccccccCC-eeEEEEEcccCCCC
Confidence 99999999999999999998 99999999998754
No 3
>KOG4289|consensus
Probab=99.95 E-value=1.8e-27 Score=212.41 Aligned_cols=168 Identities=33% Similarity=0.415 Sum_probs=155.8
Q ss_pred CCCCCceEEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCCCcccEEEcCCCC----------------eEEE
Q psy5886 12 MTSDSLFYGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPATE----------------HQGP 75 (184)
Q Consensus 12 ~~~~~~~y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tg----------------~~v~ 75 (184)
.|++ +.|.+.|.|+..++++|++|.|+|.|.|.|+.+-|+|.+++ +.+.|.||..|| +.|+
T Consensus 373 qFse-~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn--~~G~f~id~~tGel~vv~plD~e~~~ytl~Ir 449 (2531)
T KOG4289|consen 373 QFSE-KRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGN--GRGQFYIDSLTGELDVVEPLDFENSEYTLRIR 449 (2531)
T ss_pred cccc-cceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccC--ccccEEEecccceEEEeccccccCCeeEEEEE
Confidence 3444 48999999999999999999999999999999999999995 458899999999 4789
Q ss_pred EEeCC--CCeeEEEEEEEEEeCCCCCCEEecCceEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCCCCEEEECCC
Q psy5886 76 IQDEG--GLSSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPEDIPFVVDANT 152 (184)
Q Consensus 76 a~d~~--~~~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~~~F~Id~~t 152 (184)
|.|.| .++.+.-+.|.|+|+|||+|.|...++.+.|.|+.+ |..++.++|.|.|.|+|+++.|++.+-.+|.|+..+
T Consensus 450 AqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~S 529 (2531)
T KOG4289|consen 450 AQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGS 529 (2531)
T ss_pred cccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCC
Confidence 99988 378888888999999999999999999999999998 999999999999999999999999998999999999
Q ss_pred cEEEeCccCCccCCCeEEEEEEEEeCCCCC
Q psy5886 153 GDIRTNKALDYEKQQLYLERVFLISKFATS 182 (184)
Q Consensus 153 G~i~~~~~LD~E~~~~y~l~V~a~D~~~~s 182 (184)
|-|++.+.||||+...|.|.|.|+|..-.+
T Consensus 530 G~Itvtk~ldrEt~~~ysl~V~ard~gtp~ 559 (2531)
T KOG4289|consen 530 GWITVTKELDRETVEHYSLGVEARDHGTPP 559 (2531)
T ss_pred ceEEEeecccccccceEEEEEEEcCCCCCc
Confidence 999999999999999999999999976543
No 4
>KOG1219|consensus
Probab=99.95 E-value=1.3e-26 Score=212.58 Aligned_cols=163 Identities=28% Similarity=0.402 Sum_probs=152.3
Q ss_pred ceEEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCCCcccEEEcCCCC-----------------eEEEEEeC
Q psy5886 17 LFYGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPATE-----------------HQGPIQDE 79 (184)
Q Consensus 17 ~~y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tg-----------------~~v~a~d~ 79 (184)
-.-++.|.||+|.||.|+++.|.|.|.|..+.++|+|..+ ++-+.|+||..+| ++|.|.|.
T Consensus 956 ~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~g--dg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~ 1033 (4289)
T KOG1219|consen 956 FVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTG--DGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDL 1033 (4289)
T ss_pred eeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcC--CcceeEEecCCcceEeechhhchhhcceEEEEEEEEec
Confidence 3566899999999999999999999999999999999999 4458899999999 58999999
Q ss_pred C--CCeeEEEEEEEEEeCCCCCCEEecCceEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCCC---CEEEECCCc
Q psy5886 80 G--GLSSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPEDI---PFVVDANTG 153 (184)
Q Consensus 80 ~--~~~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~~---~F~Id~~tG 153 (184)
| .+++.+.+.|.|.|+|||+|+|.+..|..+|.|++| +..|.++.|.|+|...|+++.|+|..|+ .|.|++.||
T Consensus 1034 gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG 1113 (4289)
T KOG1219|consen 1034 GTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETG 1113 (4289)
T ss_pred CCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEccccc
Confidence 9 488999999999999999999999999999999998 9999999999999888999999999875 599999999
Q ss_pred EEEeCccCCccCCCeEEEEEEEEeCCCC
Q psy5886 154 DIRTNKALDYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 154 ~i~~~~~LD~E~~~~y~l~V~a~D~~~~ 181 (184)
.|++.+.||||.+..|.|.|.++|..-.
T Consensus 1114 ~iTt~r~LDRE~qdEHiLeVTi~D~gep 1141 (4289)
T KOG1219|consen 1114 LITTIRRLDREKQDEHILEVTIQDNGEP 1141 (4289)
T ss_pred eeeeehhhcccccccceEEEEEecCCCC
Confidence 9999999999999999999999998754
No 5
>KOG1219|consensus
Probab=99.94 E-value=8.7e-26 Score=207.21 Aligned_cols=164 Identities=29% Similarity=0.367 Sum_probs=145.3
Q ss_pred CCceEEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCCC---cccEEEcCCCC-----------------eEE
Q psy5886 15 DSLFYGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTDA---ASYFNLDPATE-----------------HQG 74 (184)
Q Consensus 15 ~~~~y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~~---~~~F~id~~tg-----------------~~v 74 (184)
..+.|.++|+|+.++|+.|++++|.|.|... .|+++-++... ...|++|+.|| +.|
T Consensus 2577 Sep~y~fsvpEDv~vG~~Ig~v~a~~a~~~~----i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~ksLD~E~kk~yqi~v 2652 (4289)
T KOG1219|consen 2577 SEPIYTFSVPEDVPVGEEIGQVSASDADEHV----IYSLVLGGTPESNPDLPFSVDRNTGMIKVNKSLDHEKKKSYQIKV 2652 (4289)
T ss_pred cCceEEEeccccCCCCCeeeEEeecccCCce----EEEEEeCCCCCCCCCCceEEcCCCceEEeccccchhhhceEEEEE
Confidence 3459999999999999999999999998653 56665443322 24599999999 357
Q ss_pred EEEeCCCCeeEEEEEEEEEeCCCCCCEEecCceEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCC-CCEEEECCC
Q psy5886 75 PIQDEGGLSSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPED-IPFVVDANT 152 (184)
Q Consensus 75 ~a~d~~~~~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~-~~F~Id~~t 152 (184)
.|.+.+..-+.+.|.|.|.|+|||+|.|..++|.+.+.||.| |+.|++++|.|.|.+.|++|+|+|... .+|.|+++|
T Consensus 2653 ~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v~yF~In~et 2732 (4289)
T KOG1219|consen 2653 KATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPVPYFAINPET 2732 (4289)
T ss_pred EeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCcceEEEcCCC
Confidence 888777778899999999999999999999999999999998 999999999999999999999999875 469999999
Q ss_pred cEEEeCccCCccCCCeEEEEEEEEeCCCCC
Q psy5886 153 GDIRTNKALDYEKQQLYLERVFLISKFATS 182 (184)
Q Consensus 153 G~i~~~~~LD~E~~~~y~l~V~a~D~~~~s 182 (184)
|-|++...||+|++..|.|.|.|+|-...+
T Consensus 2733 GwlTt~~eld~ek~d~y~lkv~AtDhG~~s 2762 (4289)
T KOG1219|consen 2733 GWLTTLFELDLEKQDLYSLKVVATDHGVPS 2762 (4289)
T ss_pred CeeeehhhhccccCCceEEEEEEecCCccc
Confidence 999999999999999999999999988764
No 6
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.71 E-value=1.9e-16 Score=106.39 Aligned_cols=75 Identities=41% Similarity=0.566 Sum_probs=70.0
Q ss_pred eEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCC---CCEEEECCCcEEEeCccCCccCCCeEEEEEEEEeCCCC
Q psy5886 107 YEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPED---IPFVVDANTGDIRTNKALDYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 107 ~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~---~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~~ 181 (184)
|.+.|+|+++ |+.++++.|.|+|.+.|+.+.|+|..+ ..|.|++.+|.|++.+.||||....|.|.|.|+|+.+.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~ 79 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGS 79 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCC
Confidence 7899999996 999999999999999999999999875 47999999999999999999999999999999999443
No 7
>KOG1834|consensus
Probab=99.64 E-value=9.2e-15 Score=124.33 Aligned_cols=161 Identities=28% Similarity=0.332 Sum_probs=129.6
Q ss_pred ceEEEEEeCCCCCCcEEEEEEEEeCCCCC---CcEEEEEEecCCCCCcccEEEcCCCC-------------------eEE
Q psy5886 17 LFYGAMARDKHELGVTLMQVKATDKDSGK---FGLVEYRLLDNNTDAASYFNLDPATE-------------------HQG 74 (184)
Q Consensus 17 ~~y~~~v~E~~~~gt~v~~v~a~D~D~~~---n~~~~y~i~~~~~~~~~~F~id~~tg-------------------~~v 74 (184)
..|...|.||-..=....-+.|-|+|.+. ..-.-|.|.+.+-. -..--+|..|| ++|
T Consensus 35 ~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vP-FdavVvdK~TGegvlRaK~~lDCelqkeytf~i 113 (952)
T KOG1834|consen 35 EEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVP-FDAVVVDKYTGEGVLRAKEPLDCELQKEYTFTI 113 (952)
T ss_pred cceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCC-ceEEEEeccCCceEEeecCcccccccccceEEE
Confidence 48999999996433333347889999753 23467777775321 12234666776 589
Q ss_pred EEEeCCC--------CeeEEEEEEEEEeCCCCCCEEecCceEEEEeCCCCCcEEEEEEEEeCCCC-CCcEE-EEEeCC-C
Q psy5886 75 PIQDEGG--------LSSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQDGLAVGRVHATDADEG-QNAVV-YYSVPE-D 143 (184)
Q Consensus 75 ~a~d~~~--------~~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~~g~~v~~v~a~D~D~~-~n~~i-~y~l~~-~ 143 (184)
+|.|+|. .+-.++|.|+|.|+|..+|.|..+-|.+.|.|+.....|++|.|.|.|-+ ++++| .|.|.. +
T Consensus 114 QAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~veAiD~DCspq~sqIC~YEI~t~d 193 (952)
T KOG1834|consen 114 QAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVEAIDKDCSPQYSQICEYEITTPD 193 (952)
T ss_pred EEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEEeecCCCCCcccceeEEEecCCC
Confidence 9999872 46678999999999999999999999999999999999999999999998 56655 888875 6
Q ss_pred CCEEEECCCcEEEeCccCCccCCCeEEEEEEEEeCC
Q psy5886 144 IPFVVDANTGDIRTNKALDYEKQQLYLERVFLISKF 179 (184)
Q Consensus 144 ~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~ 179 (184)
-+|.|| +.|.|+...+|+|.....|.|+|.|-|-.
T Consensus 194 ~PFaId-n~G~irnTekLny~ke~~Y~ltVtAyDCg 228 (952)
T KOG1834|consen 194 VPFAID-NDGNIRNTEKLNYTKEHQYKLTVTAYDCG 228 (952)
T ss_pred CceEEc-CCCccccccccccccceeEEEEEEEEecc
Confidence 789998 77999999999999999999999999854
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.51 E-value=4.3e-13 Score=89.92 Aligned_cols=73 Identities=38% Similarity=0.502 Sum_probs=62.7
Q ss_pred EEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCCCcccEEEcCCCC-----------------eEEEEEeC-C
Q psy5886 19 YGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPATE-----------------HQGPIQDE-G 80 (184)
Q Consensus 19 y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tg-----------------~~v~a~d~-~ 80 (184)
|+++|+|++++|+.|+++.|.|+|.+.|+.+.|+|.+++. ..+|.|++.+| +.|.|+|. +
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~--~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~ 78 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNP--DGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGG 78 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTS--TTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcc--cCceEEeeeeeccccceecCcccCCEEEEEEEEEECCC
Confidence 8999999999999999999999999999999999999865 58999999999 46788887 4
Q ss_pred C--CeeEEEEEEEEE
Q psy5886 81 G--LSSTATVNIRVT 93 (184)
Q Consensus 81 ~--~~~~~~v~I~V~ 93 (184)
. ++++++|.|+|.
T Consensus 79 ~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 79 SPPLSSTATVTINVL 93 (93)
T ss_dssp SSEEEEEEEEEEEEE
T ss_pred CCCCEEEEEEEEEEC
Confidence 3 566677777663
No 9
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.41 E-value=4.6e-12 Score=95.76 Aligned_cols=76 Identities=42% Similarity=0.520 Sum_probs=69.6
Q ss_pred ceEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCC---CCEEEECCCcEEEeCccCCccCCCeEEEEEEEEeCCCC
Q psy5886 106 PYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPED---IPFVVDANTGDIRTNKALDYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 106 ~~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~---~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~~ 181 (184)
.|.+.++|+++ |+.|+++.|.|+|.+.|+.++|+|..+ .+|.|++.+|.|++.+.||||....|.|.|+|+|+...
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~ 80 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGP 80 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcC
Confidence 37899999996 999999999999999889999999876 37999999999999999999999999999999996443
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.28 E-value=3.2e-11 Score=78.29 Aligned_cols=60 Identities=50% Similarity=0.748 Sum_probs=53.3
Q ss_pred EeCCCCCCcEEEEEEecCCCCCcccEEEcCCCC-----------------eEEEEEeCCC--CeeEEEEEEEEEeCCCCC
Q psy5886 39 TDKDSGKFGLVEYRLLDNNTDAASYFNLDPATE-----------------HQGPIQDEGG--LSSTATVNIRVTDINDKN 99 (184)
Q Consensus 39 ~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tg-----------------~~v~a~d~~~--~~~~~~v~I~V~d~ND~~ 99 (184)
+|+|.|.|+.++|+|.+++.. .+|.|++.+| +.|+|.|.+. +++.++|.|.|.|+|||+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~--~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~ 78 (79)
T smart00112 1 TDADSGENGKVTYSILSGNED--GLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA 78 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCC--CEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence 488999999999999988543 7899999998 4689999885 889999999999999999
Q ss_pred C
Q psy5886 100 P 100 (184)
Q Consensus 100 P 100 (184)
|
T Consensus 79 P 79 (79)
T smart00112 79 P 79 (79)
T ss_pred C
Confidence 8
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.15 E-value=1.9e-10 Score=74.64 Aligned_cols=55 Identities=40% Similarity=0.459 Sum_probs=50.6
Q ss_pred EeCCCCCCcEEEEEeCCC---CCEEEECCCcEEEeCccCCccCCCeEEEEEEEEeCCC
Q psy5886 126 TDADEGQNAVVYYSVPED---IPFVVDANTGDIRTNKALDYEKQQLYLERVFLISKFA 180 (184)
Q Consensus 126 ~D~D~~~n~~i~y~l~~~---~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~ 180 (184)
+|+|.|.|+.++|+|..+ .+|.|++.+|.|++.++||||....|.|.|+|+|..+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~ 58 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGG 58 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCC
Confidence 488998899999999875 5799999999999999999999999999999999876
No 12
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.63 E-value=0.00039 Score=46.23 Aligned_cols=78 Identities=19% Similarity=0.347 Sum_probs=42.5
Q ss_pred CCEEecCceEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCCCCEEEECCCcEEEeCccCCccCCCeEEEEEEEEe
Q psy5886 99 NPEFQSLPYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPEDIPFVVDANTGDIRTNKALDYEKQQLYLERVFLIS 177 (184)
Q Consensus 99 ~P~f~~~~~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D 177 (184)
.|-|.+..|.+.|+.+.. |..|++|.-.|=.. +..+.|.-.+. .|.|. ..|.|.+++.+..... .-.+.|.|.|
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ssDp-dF~V~-~DGsVy~~r~v~l~~~-~~~F~V~a~D 77 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESSDP-DFRVL-EDGSVYAKRPVQLSSE-QRSFTVHAWD 77 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE---S-EEEEE-TTTEEEEES--S-SSS--EEEEEEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecCCC-CEEEc-CCCeEEEeeeEecCCC-ceEEEEEEEC
Confidence 588999999999999974 99999999887633 46688877554 89997 5699999999877543 3579999999
Q ss_pred CCCC
Q psy5886 178 KFAT 181 (184)
Q Consensus 178 ~~~~ 181 (184)
..+.
T Consensus 78 ~~~~ 81 (90)
T PF08758_consen 78 SQTQ 81 (90)
T ss_dssp TTTT
T ss_pred CCCC
Confidence 8864
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.35 E-value=0.001 Score=43.62 Aligned_cols=58 Identities=26% Similarity=0.443 Sum_probs=34.9
Q ss_pred eEEEEeCCCC-CcEEEEEEEEeCCCCCC--cEEEEEeCC---CCCEEEECCCcEEEeCccCCccC
Q psy5886 107 YEFSVPEGQD-GLAVGRVHATDADEGQN--AVVYYSVPE---DIPFVVDANTGDIRTNKALDYEK 165 (184)
Q Consensus 107 ~~~~v~e~~~-g~~v~~v~a~D~D~~~n--~~i~y~l~~---~~~F~Id~~tG~i~~~~~LD~E~ 165 (184)
..++|+|..+ |+.|+.+ |.|...... ..-.+++.. ..+|.++..+|.|++...||||.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 4678999985 9999998 344432210 112455443 35799999999999999999997
No 14
>KOG1834|consensus
Probab=95.73 E-value=0.096 Score=46.33 Aligned_cols=81 Identities=19% Similarity=0.262 Sum_probs=58.2
Q ss_pred CCCCCCCCceEEEEEeCCCCCCcEEEEEEEEeCCCCC-CcE-EEEEEecCCCCCcccEEEcCCCC---------------
Q psy5886 9 ALEMTSDSLFYGAMARDKHELGVTLMQVKATDKDSGK-FGL-VEYRLLDNNTDAASYFNLDPATE--------------- 71 (184)
Q Consensus 9 ~~~~~~~~~~y~~~v~E~~~~gt~v~~v~a~D~D~~~-n~~-~~y~i~~~~~~~~~~F~id~~tg--------------- 71 (184)
+++.+.+ +.|.+.|.|.-. =..|+++.|.|+|-+. +++ ..|.|...+ -.|.||....
T Consensus 145 ~AP~f~e-p~Yka~V~EGK~-yd~il~veAiD~DCspq~sqIC~YEI~t~d----~PFaIdn~G~irnTekLny~ke~~Y 218 (952)
T KOG1834|consen 145 FAPVFKE-PWYKAHVTEGKV-YDSILRVEAIDKDCSPQYSQICEYEITTPD----VPFAIDNDGNIRNTEKLNYTKEHQY 218 (952)
T ss_pred cCchhcc-cceeeEEeccee-eeeeEEEEeecCCCCCcccceeEEEecCCC----CceEEcCCCccccccccccccceeE
Confidence 3444444 599999999844 5569999999999654 554 588888753 4599987654
Q ss_pred -eEEEEEeCCCCe-e-EEEEEEEEEeC
Q psy5886 72 -HQGPIQDEGGLS-S-TATVNIRVTDI 95 (184)
Q Consensus 72 -~~v~a~d~~~~~-~-~~~v~I~V~d~ 95 (184)
++|.|.|+|... + -+.|+|+|...
T Consensus 219 ~ltVtAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 219 KLTVTAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred EEEEEEEecccccccCcceEEEEecCc
Confidence 589999999633 2 36788887744
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.71 E-value=0.2 Score=33.45 Aligned_cols=57 Identities=26% Similarity=0.325 Sum_probs=42.8
Q ss_pred EEeCCCCCCcEEEEEEecCCC-CCcccEEEcCCCC---------------eEEEEEeCCCCeeEEEEEEEEEeCCC
Q psy5886 38 ATDKDSGKFGLVEYRLLDNNT-DAASYFNLDPATE---------------HQGPIQDEGGLSSTATVNIRVTDIND 97 (184)
Q Consensus 38 a~D~D~~~n~~~~y~i~~~~~-~~~~~F~id~~tg---------------~~v~a~d~~~~~~~~~v~I~V~d~ND 97 (184)
..|.| +..++|++...+. .-+.|...|+.++ +.|.|+|..+.+....+.|.|.+.||
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~~~~~g~~~i~v~a~D~~g~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPTNSDVGSLSLKVTATDSSGASASDTFTITVVNTND 96 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECCCCCCcEEEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence 46776 3468888874431 2356888888776 47899998888888999999999885
No 16
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=94.27 E-value=0.066 Score=35.07 Aligned_cols=72 Identities=17% Similarity=0.250 Sum_probs=38.7
Q ss_pred EEEEEeCCCCCCcEEEEEEEEeCCCCC--CcEEEEEEecCCCCCcccEEEcCCCCeEEEEE--eCC---CCeeEEEEEEE
Q psy5886 19 YGAMARDKHELGVTLMQVKATDKDSGK--FGLVEYRLLDNNTDAASYFNLDPATEHQGPIQ--DEG---GLSSTATVNIR 91 (184)
Q Consensus 19 y~~~v~E~~~~gt~v~~v~a~D~D~~~--n~~~~y~i~~~~~~~~~~F~id~~tg~~v~a~--d~~---~~~~~~~v~I~ 91 (184)
...+|+|..+.|+.|+.+ |.|--... -....|+|.... ...+|.++..+|...... |.. +....+.+.+.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~--~~~~~~v~~~tG~L~v~~rIDRE~LC~~~~~C~l~~e 79 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEG--NSQYFRVNEKTGDLFVSERIDREELCGQSSPCFLSFE 79 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SS--SS-SEEE-TTTSEEEESS--SCCCC-TTSSS-EEEEE
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecC--CcceeEecCCceeEEeCCccCHHHHCCCCCCEEEEEE
Confidence 355788999999999999 44432211 112456766653 358999999999655443 322 23445666655
Q ss_pred EE
Q psy5886 92 VT 93 (184)
Q Consensus 92 V~ 93 (184)
|.
T Consensus 80 vv 81 (84)
T PF08266_consen 80 VV 81 (84)
T ss_dssp EE
T ss_pred EE
Confidence 54
No 17
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.17 E-value=0.9 Score=30.68 Aligned_cols=37 Identities=27% Similarity=0.214 Sum_probs=24.7
Q ss_pred eEEEEEeCCCCeeEEEEEEEEEeCCCCCCEEecCceEEEEeCC
Q psy5886 72 HQGPIQDEGGLSSTATVNIRVTDINDKNPEFQSLPYEFSVPEG 114 (184)
Q Consensus 72 ~~v~a~d~~~~~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~ 114 (184)
+.+.+.|. .+.+|.|.|.-.|| +|..... -...+.|+
T Consensus 61 Ftvtv~DG----tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 61 FTVTSADG----TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred EEEEEeCC----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 46777774 37889999999997 7755432 23455554
No 18
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=93.66 E-value=0.88 Score=30.16 Aligned_cols=48 Identities=19% Similarity=0.079 Sum_probs=20.5
Q ss_pred CCceEEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCCCcccEEEcCCC
Q psy5886 15 DSLFYGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPAT 70 (184)
Q Consensus 15 ~~~~y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~t 70 (184)
.+..|.+.|+.+...|..|++|.-.|=.. +..+.|.... ..|.|..+-
T Consensus 7 ~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ssD------pdF~V~~DG 54 (90)
T PF08758_consen 7 SQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESSD------PDFRVLEDG 54 (90)
T ss_dssp -S-EEEE----SS-SS--EEE---B--SS-----EEEE---------SEEEEETTT
T ss_pred ccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecCC------CCEEEcCCC
Confidence 46699999999999999999999877642 3356664433 247776654
No 19
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=90.90 E-value=1.9 Score=28.61 Aligned_cols=54 Identities=20% Similarity=0.174 Sum_probs=38.4
Q ss_pred EEEeCCCCCCcEEEEEeCC--C----CCEEEECCCcEEEeCccCCccCCCeEEEEEEEEeCCCCC
Q psy5886 124 HATDADEGQNAVVYYSVPE--D----IPFVVDANTGDIRTNKALDYEKQQLYLERVFLISKFATS 182 (184)
Q Consensus 124 ~a~D~D~~~n~~i~y~l~~--~----~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~~s 182 (184)
...|+| ...++|++.. + .+...|+.++.+.- .+.... ...|.+.|.|+|+.+.+
T Consensus 23 tF~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-tP~~~~-~g~~~i~v~a~D~~g~~ 82 (97)
T smart00736 23 TFTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSG-TPTNSD-VGSLSLKVTATDSSGAS 82 (97)
T ss_pred ceECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-ECCCCC-CcEEEEEEEEEECCCCE
Confidence 345666 2678888852 1 46888999988877 344433 46799999999998753
No 20
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=89.92 E-value=1.6 Score=25.39 Aligned_cols=35 Identities=17% Similarity=0.133 Sum_probs=26.8
Q ss_pred CCEEEECCCcEEEeCccCCcc-CCCeEEEEEEEEeCCC
Q psy5886 144 IPFVVDANTGDIRTNKALDYE-KQQLYLERVFLISKFA 180 (184)
Q Consensus 144 ~~F~Id~~tG~i~~~~~LD~E-~~~~y~l~V~a~D~~~ 180 (184)
....+|+.+|.|.-.-. .. ....|.+.|.|+|+.+
T Consensus 14 ~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 14 SGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCCC
Confidence 35899999999987632 22 2368999999999875
No 21
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=85.25 E-value=34 Score=32.31 Aligned_cols=44 Identities=18% Similarity=0.237 Sum_probs=29.0
Q ss_pred CCCCCceEEEEEeCCCCCCcEEEEEEEEeCCCCCCcEEEEEEecCCCC
Q psy5886 12 MTSDSLFYGAMARDKHELGVTLMQVKATDKDSGKFGLVEYRLLDNNTD 59 (184)
Q Consensus 12 ~~~~~~~y~~~v~E~~~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~ 59 (184)
+.+++..|+ |.||. |++-.+|.-...|.+..-.+.|+..++...
T Consensus 402 i~Fe~~~Y~--V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~ 445 (928)
T TIGR00845 402 IFFEPGHYT--CLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTAN 445 (928)
T ss_pred EEecCCeEE--EeecC--cEEEEEEEEccCCCCceEEEEEEccCCccC
Confidence 445555555 45885 888777776655655556788988877544
No 22
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=84.64 E-value=13 Score=27.14 Aligned_cols=107 Identities=21% Similarity=0.224 Sum_probs=55.0
Q ss_pred eEE-EEEeCCCCeeEEEEEEEEEeCCCCCCEEecCceEEEEeCCC-C-CcEEEEEEEEeCCCCCCcEEEEEeCCCC---C
Q psy5886 72 HQG-PIQDEGGLSSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQ-D-GLAVGRVHATDADEGQNAVVYYSVPEDI---P 145 (184)
Q Consensus 72 ~~v-~a~d~~~~~~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~-~-g~~v~~v~a~D~D~~~n~~i~y~l~~~~---~ 145 (184)
+.+ .|.|..+...+..+...+. ++..+|.+.- .....+..+. . +..=..+.++|..... .--+.+|.+|. .
T Consensus 19 l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~Gg~~~d~ 95 (158)
T PF13750_consen 19 LTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTGGPASDS 95 (158)
T ss_pred EEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEECCcccce
Confidence 356 6888776665555553333 2235776532 0001122222 1 3333467777765542 33466665542 1
Q ss_pred --EEE-ECCCcEEEeC--ccC-CccCCCeEEEEEEEEeCCCC
Q psy5886 146 --FVV-DANTGDIRTN--KAL-DYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 146 --F~I-d~~tG~i~~~--~~L-D~E~~~~y~l~V~a~D~~~~ 181 (184)
... ....|...+. +.+ ..|....|+|+|.|+|..|.
T Consensus 96 v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aGN 137 (158)
T PF13750_consen 96 VSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAGN 137 (158)
T ss_pred EEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCCC
Confidence 222 2233433332 211 23678899999999999874
No 23
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=77.66 E-value=23 Score=25.33 Aligned_cols=75 Identities=19% Similarity=0.239 Sum_probs=44.4
Q ss_pred CCCcEEEEEEEEeCCCCCCcEEEEEEecCCCC--CcccEEEcCCCCeEEEEEeCCCCeeEEEEEEEEEeCCCCCCEEecC
Q psy5886 28 ELGVTLMQVKATDKDSGKFGLVEYRLLDNNTD--AASYFNLDPATEHQGPIQDEGGLSSTATVNIRVTDINDKNPEFQSL 105 (184)
Q Consensus 28 ~~gt~v~~v~a~D~D~~~n~~~~y~i~~~~~~--~~~~F~id~~tg~~v~a~d~~~~~~~~~v~I~V~d~ND~~P~f~~~ 105 (184)
.-|..|+++... . ++..+|+|...-.. +.....+ .+.|.|+|..+-.++..+.|.|.| | .|.....
T Consensus 52 a~g~~Vftvtl~-~----~GsYtftL~~~lDH~~g~d~l~l----~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~ 119 (137)
T TIGR03660 52 AGGNPVFTLTLN-A----DGSYEFTLEGPLDHAAGSDELTL----NFPIIATDFDGDTSSITLPVTIVD--D-VPTITDV 119 (137)
T ss_pred cCCcEEEEEEEC-C----CccEEEEEcccccCCCCCceEEE----eeeEEEEeCCCCccccEEEEEEEC--C-CCeeccc
Confidence 335667776653 2 34567776654211 0111111 256899998876666788888887 4 5776544
Q ss_pred ceEEEEeCCC
Q psy5886 106 PYEFSVPEGQ 115 (184)
Q Consensus 106 ~~~~~v~e~~ 115 (184)
. .+.|.|+.
T Consensus 120 ~-~~~V~E~~ 128 (137)
T TIGR03660 120 D-ALTVDEDD 128 (137)
T ss_pred c-ceEEeccc
Confidence 4 37888865
No 24
>KOG3597|consensus
Probab=71.86 E-value=29 Score=29.91 Aligned_cols=95 Identities=15% Similarity=0.141 Sum_probs=58.9
Q ss_pred eEEEEEEEEEeCCCCCCEEecCceEEEEeCCCC-CcEEEEEEEEeCCCCCCcEEEEEeCCC--C---------------C
Q psy5886 84 STATVNIRVTDINDKNPEFQSLPYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSVPED--I---------------P 145 (184)
Q Consensus 84 ~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~i~y~l~~~--~---------------~ 145 (184)
.+....|.|..+||.+..+...-+.+.+.|+.. -..-..+.+.|+|..+-. +.|++... . .
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~-l~f~v~~t~~~~~~~~~~~~~g~~~~~ 102 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPLP-LEFQVLGTSSVPLPVLKFDVPGAPATE 102 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCccc-eEEEEccCCCCCCccceeeccCCcccc
Confidence 567788999999998877766666788888763 344456777888886444 77877541 1 1
Q ss_pred EEE-ECCCcEEEeCccCCccCCCeEEEEEEEEeCCCC
Q psy5886 146 FVV-DANTGDIRTNKALDYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 146 F~I-d~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~~ 181 (184)
|+- +-..|.+.....= .+...+.+...++|+...
T Consensus 103 Fs~~~v~~g~~~yvh~g--~el~~~~~~~~~SDg~~~ 137 (442)
T KOG3597|consen 103 FSYEEVEDGSLSYVHSG--TELRESELQLRVSDGLLV 137 (442)
T ss_pred eEehHhhcCceeEEecC--cccccceEEEEeecceEe
Confidence 333 2223433332110 024566788888887644
No 25
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=53.96 E-value=45 Score=19.78 Aligned_cols=45 Identities=16% Similarity=0.071 Sum_probs=26.0
Q ss_pred CCcEEEEEeCC--CCCEEEECCCc-EEEeCccCCccCCCeEEEEEEEEeCCCC
Q psy5886 132 QNAVVYYSVPE--DIPFVVDANTG-DIRTNKALDYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 132 ~n~~i~y~l~~--~~~F~Id~~tG-~i~~~~~LD~E~~~~y~l~V~a~D~~~~ 181 (184)
.+-..+|+|.+ +.+..+. ... .+.. ..| ..+.|+|.|+|.|..+.
T Consensus 6 ~~~~Y~Y~l~g~d~~W~~~~-~~~~~~~~-~~L---~~G~Y~l~V~a~~~~~~ 53 (66)
T PF07495_consen 6 ENIRYRYRLEGFDDEWITLG-SYSNSISY-TNL---PPGKYTLEVRAKDNNGK 53 (66)
T ss_dssp TTEEEEEEEETTESSEEEES-STS-EEEE-ES-----SEEEEEEEEEEETTS-
T ss_pred CceEEEEEEECCCCeEEECC-CCcEEEEE-EeC---CCEEEEEEEEEECCCCC
Confidence 34567787865 2334443 323 3332 222 56889999999998765
No 26
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=46.14 E-value=2.3e+02 Score=25.74 Aligned_cols=101 Identities=21% Similarity=0.261 Sum_probs=54.0
Q ss_pred eEEEEEeCCCCe-eEEEEEEEEEeCCCCCCEEecCceEEEEeCCCC-CcE-EEEEEEEeC----CCC-CC-cEEEEEeCC
Q psy5886 72 HQGPIQDEGGLS-STATVNIRVTDINDKNPEFQSLPYEFSVPEGQD-GLA-VGRVHATDA----DEG-QN-AVVYYSVPE 142 (184)
Q Consensus 72 ~~v~a~d~~~~~-~~~~v~I~V~d~ND~~P~f~~~~~~~~v~e~~~-g~~-v~~v~a~D~----D~~-~n-~~i~y~l~~ 142 (184)
+.+.++|+.++. ......|.|++-. +|.+. +.+.-... +.. .....|.=. +.- .| -.++|+...
T Consensus 302 i~atVtDSRGr~S~~~~~tItVl~Y~--~P~ls-----fsv~R~~~~~~~~~v~~~a~Iapl~v~g~qKN~~~lt~~~a~ 374 (624)
T PF05895_consen 302 IRATVTDSRGRTSDPKTKTITVLEYS--PPTLS-----FSVYRCGSSGNTLTVTRNAKIAPLTVNGVQKNTMTLTFKVAP 374 (624)
T ss_pred EEEEEEECCCccCCceEEEEEEEEcC--CCcEE-----EEEEEeCCCCcEEEEEEEEEEeEEEEcccccceEEEEEEEEE
Confidence 467788998875 4788999999995 88773 23322211 111 111111111 111 12 245555543
Q ss_pred --CCCEEEECCC-------------cEEEeCccCCccCCCeEEEEEEEEeCCCC
Q psy5886 143 --DIPFVVDANT-------------GDIRTNKALDYEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 143 --~~~F~Id~~t-------------G~i~~~~~LD~E~~~~y~l~V~a~D~~~~ 181 (184)
...|.+|... +...+...+ .....|.+++.++|.+..
T Consensus 375 ~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y--~~~kSy~V~~~l~D~F~s 426 (624)
T PF05895_consen 375 LGTGTFTTDNGSASGTWSSISELTNSSANLGGTY--DAEKSYDVRGTLSDKFTS 426 (624)
T ss_pred cCcceEEEEccccccceeeeeeecccceeecccc--CCCceEEEEEEEEEEeee
Confidence 3446554321 122333343 455599999999998754
No 27
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=42.39 E-value=33 Score=21.32 Aligned_cols=21 Identities=19% Similarity=0.294 Sum_probs=17.4
Q ss_pred CccCCCeEEEEEEEEeCCCCC
Q psy5886 162 DYEKQQLYLERVFLISKFATS 182 (184)
Q Consensus 162 D~E~~~~y~l~V~a~D~~~~s 182 (184)
.|.....|++++.|+|..+.+
T Consensus 52 ~y~~~G~y~v~l~v~d~~g~~ 72 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAVGSS 72 (81)
T ss_pred EcCCCcEEEEEEEEEeCCCCE
Confidence 467889999999999997653
No 28
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=36.02 E-value=2.6e+02 Score=23.37 Aligned_cols=67 Identities=18% Similarity=0.249 Sum_probs=36.1
Q ss_pred EEEEEeCCCCCCEEecCceEEEEeCCCCCcEEEEEEEEeCCCCCCcEEEEEeCCC----CCEEEECCCcEEEeCc
Q psy5886 89 NIRVTDINDKNPEFQSLPYEFSVPEGQDGLAVGRVHATDADEGQNAVVYYSVPED----IPFVVDANTGDIRTNK 159 (184)
Q Consensus 89 ~I~V~d~ND~~P~f~~~~~~~~v~e~~~g~~v~~v~a~D~D~~~n~~i~y~l~~~----~~F~Id~~tG~i~~~~ 159 (184)
+|.|...|.+.=.|..-.....++|+=.|..-+.-....+ ++++-|--..+ ..|+|++.+|.|.+.+
T Consensus 214 tV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~----dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~ 284 (346)
T COG2706 214 TVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISP----DGRFLYASNRGHDSIAVFSVDPDGGKLELVG 284 (346)
T ss_pred EEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECC----CCCEEEEecCCCCeEEEEEEcCCCCEEEEEE
Confidence 4556666655566666666667777654433222112222 24455555443 2377777777666653
No 29
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=33.42 E-value=1e+02 Score=17.90 Aligned_cols=17 Identities=12% Similarity=0.002 Sum_probs=14.4
Q ss_pred CCCeEEEEEEEEeCCCC
Q psy5886 165 KQQLYLERVFLISKFAT 181 (184)
Q Consensus 165 ~~~~y~l~V~a~D~~~~ 181 (184)
....|.++|.|+|..|.
T Consensus 22 ~dG~y~itv~a~D~AGN 38 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGN 38 (54)
T ss_pred CCccEEEEEEEEeCCCC
Confidence 46789999999998875
No 30
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=33.29 E-value=57 Score=19.65 Aligned_cols=19 Identities=11% Similarity=0.225 Sum_probs=15.7
Q ss_pred ccCCCeEEEEEEEEeCCCC
Q psy5886 163 YEKQQLYLERVFLISKFAT 181 (184)
Q Consensus 163 ~E~~~~y~l~V~a~D~~~~ 181 (184)
|.....|+++|.|+|..+.
T Consensus 48 y~~~G~y~V~ltv~n~~g~ 66 (69)
T PF00801_consen 48 YSSPGTYTVTLTVTNGVGS 66 (69)
T ss_dssp ESSSEEEEEEEEEEETTSE
T ss_pred cCCCeEEEEEEEEEECCCC
Confidence 4457899999999998764
No 31
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=31.98 E-value=1.4e+02 Score=19.24 Aligned_cols=51 Identities=20% Similarity=0.303 Sum_probs=28.2
Q ss_pred EEEEEeCCCCCCEEecCceEEEEeCCCCCcEEEEEEEEeCCCCCCcEEEEEeCC
Q psy5886 89 NIRVTDINDKNPEFQSLPYEFSVPEGQDGLAVGRVHATDADEGQNAVVYYSVPE 142 (184)
Q Consensus 89 ~I~V~d~ND~~P~f~~~~~~~~v~e~~~g~~v~~v~a~D~D~~~n~~i~y~l~~ 142 (184)
+|.|.| ||.+ .+.-..-...+.|+. |..-..|.....+....-.+.|...+
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~-~~~~v~V~~~~~~~~~~v~v~~~~~~ 52 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD-GTVTVTVTRSGGSLDGPVTVNYSTVD 52 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS-SEEEEEEEEESS-TSSEEEEEEEEEE
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC-CEEEEEEEEcccCCCcceEEEEEEeC
Confidence 466778 6655 776666667788875 33344444444433333466777654
No 32
>PRK08577 hypothetical protein; Provisional
Probab=29.36 E-value=1.4e+02 Score=20.87 Aligned_cols=38 Identities=11% Similarity=0.220 Sum_probs=27.8
Q ss_pred CCCC--EEEECCCcEEEeCccCCccCCCeEEEEEEEEeCCC
Q psy5886 142 EDIP--FVVDANTGDIRTNKALDYEKQQLYLERVFLISKFA 180 (184)
Q Consensus 142 ~~~~--F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~~~ 180 (184)
.|+. |.++...|.|.+ .++-.+....|.+.|.+.|..|
T Consensus 29 ~g~~~~~~~~~~~~~~~~-~~~~~~~k~~~~I~V~~~Dr~G 68 (136)
T PRK08577 29 EGMYVLLIADTDKKEIHL-EPIALPGKKLVEIELVVEDRPG 68 (136)
T ss_pred CCCEEEEEEECCCCEEEE-EEcCCCCccEEEEEEEEcCCCC
Confidence 4554 555777788888 4555566678999999999875
No 33
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=28.85 E-value=1.5e+02 Score=18.46 Aligned_cols=30 Identities=23% Similarity=0.316 Sum_probs=23.9
Q ss_pred ccEEEcCCCCeEEEEEeCCCCeeEEEEEEEE
Q psy5886 62 SYFNLDPATEHQGPIQDEGGLSSTATVNIRV 92 (184)
Q Consensus 62 ~~F~id~~tg~~v~a~d~~~~~~~~~v~I~V 92 (184)
..|.+- .+-+...|+|..+..+++.+.|.|
T Consensus 52 ~~f~~G-~t~V~ytA~D~~GN~a~C~f~V~V 81 (81)
T PF02494_consen 52 DLFPVG-TTTVTYTATDAAGNSATCSFTVTV 81 (81)
T ss_pred ceEeec-eEEEEEEEEECCCCEEEEEEEEEC
Confidence 567664 566789999988888899998875
No 34
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=23.87 E-value=1e+02 Score=18.84 Aligned_cols=21 Identities=14% Similarity=0.214 Sum_probs=17.1
Q ss_pred cCCccCCCeEEEEEEEEeCCC
Q psy5886 160 ALDYEKQQLYLERVFLISKFA 180 (184)
Q Consensus 160 ~LD~E~~~~y~l~V~a~D~~~ 180 (184)
..-|.....|.+++.|+|..+
T Consensus 48 ~~~y~~~G~y~v~l~v~n~~g 68 (79)
T smart00089 48 THTYTKPGTYTVTLTVTNAVG 68 (79)
T ss_pred EEEeCCCcEEEEEEEEEcCCC
Confidence 334677889999999999877
No 35
>PF13287 Fn3_assoc: Fn3 associated
Probab=22.08 E-value=1.9e+02 Score=17.21 Aligned_cols=36 Identities=14% Similarity=0.178 Sum_probs=23.7
Q ss_pred CcEEEEEEecCCCCC-----cccEEEcCCCCeEEEEEeCCC
Q psy5886 46 FGLVEYRLLDNNTDA-----ASYFNLDPATEHQGPIQDEGG 81 (184)
Q Consensus 46 n~~~~y~i~~~~~~~-----~~~F~id~~tg~~v~a~d~~~ 81 (184)
++++.|++.+..+.. .+.|.|...+.+...|.+.++
T Consensus 9 ~~~I~YT~DGs~Pt~~s~~Y~~pi~i~~~~tikA~a~~~~g 49 (59)
T PF13287_consen 9 GAKIYYTTDGSEPTTNSILYTGPITIKKGTTIKAIAVDDGG 49 (59)
T ss_pred CCEEEEeCCCCCCCCCCCccccCEEeCCCCEEEEEEEeCCC
Confidence 457888888776542 245888766656666666664
No 36
>cd00917 PG-PI_TP The phosphatidylinositol/phosphatidylglycerol transfer protein (PG/PI-TP) has been shown to bind phosphatidylglycerol and phosphatidylinositol, but the biological significance of this is still obscure. These proteins belong to the ML domain family.
Probab=20.43 E-value=2.2e+02 Score=19.49 Aligned_cols=36 Identities=22% Similarity=0.183 Sum_probs=24.3
Q ss_pred CCCCCCCCCCCCCCceEEEEEeCCCCCCcEEEEEEEEeCC
Q psy5886 3 PCPIPSALEMTSDSLFYGAMARDKHELGVTLMQVKATDKD 42 (184)
Q Consensus 3 ~~~~p~~~~~~~~~~~y~~~v~E~~~~gt~v~~v~a~D~D 42 (184)
.||...... .......|+...|.|....++...|.|
T Consensus 76 ~CPi~~G~~----~~~~~~~ip~~~P~g~y~v~~~l~d~~ 111 (122)
T cd00917 76 SCPIEPGDK----FLTKLVDLPGEIPPGKYTVSARAYTKD 111 (122)
T ss_pred cCCcCCCcE----EEEEEeeCCCCCCCceEEEEEEEECCC
Confidence 677744421 123336788888999998888888865
Done!