Query psy4200
Match_columns 388
No_of_seqs 251 out of 1536
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 19:25:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4200.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4200hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 8.1E-60 1.8E-64 468.1 35.5 364 1-387 364-937 (2531)
2 KOG4289|consensus 100.0 3.2E-57 7E-62 449.7 33.5 360 1-382 259-826 (2531)
3 KOG1219|consensus 100.0 1.6E-53 3.5E-58 435.2 40.7 360 1-381 2168-2722(4289)
4 KOG1219|consensus 100.0 5.8E-52 1.3E-56 424.0 39.9 366 1-386 735-1208(4289)
5 cd00031 CA Cadherin repeat dom 100.0 1.2E-28 2.5E-33 215.5 28.1 190 16-266 2-199 (199)
6 cd00031 CA Cadherin repeat dom 99.9 3.5E-21 7.5E-26 168.1 23.0 129 164-304 1-130 (199)
7 PF00028 Cadherin: Cadherin do 99.7 7.9E-16 1.7E-20 117.1 13.9 92 165-265 1-93 (93)
8 smart00112 CA Cadherin repeats 99.6 2.1E-14 4.6E-19 105.6 10.5 78 185-272 1-79 (79)
9 KOG1834|consensus 99.5 4.1E-13 8.8E-18 128.0 18.9 201 5-267 27-245 (952)
10 PF00028 Cadherin: Cadherin do 99.5 1.7E-13 3.6E-18 104.2 11.8 70 16-88 1-76 (93)
11 KOG1834|consensus 99.3 2.5E-11 5.4E-16 116.0 15.5 149 147-304 20-176 (952)
12 smart00112 CA Cadherin repeats 99.3 1E-11 2.2E-16 91.2 9.9 69 41-158 8-79 (79)
13 PF08266 Cadherin_2: Cadherin- 97.7 4.5E-05 9.7E-10 55.9 3.8 66 165-231 3-71 (84)
14 PF08266 Cadherin_2: Cadherin- 97.6 4.8E-05 1E-09 55.8 3.0 59 19-78 5-71 (84)
15 PF08758 Cadherin_pro: Cadheri 97.4 0.00037 8E-09 51.8 5.7 76 7-88 2-78 (90)
16 PF08758 Cadherin_pro: Cadheri 96.0 0.091 2E-06 39.0 9.0 77 156-243 2-78 (90)
17 smart00736 CADG Dystroglycan-t 94.2 1.1 2.4E-05 33.8 10.6 69 184-269 24-96 (97)
18 smart00736 CADG Dystroglycan-t 93.6 1.4 3E-05 33.2 10.2 49 35-87 23-77 (97)
19 TIGR01965 VCBS_repeat VCBS rep 93.0 1.3 2.9E-05 33.4 9.0 39 130-173 60-98 (99)
20 TIGR01965 VCBS_repeat VCBS rep 92.2 1.1 2.3E-05 33.9 7.5 87 180-287 2-98 (99)
21 PF07495 Y_Y_Y: Y_Y_Y domain; 82.2 7.7 0.00017 26.4 6.5 45 191-243 6-50 (66)
22 TIGR00845 caca sodium/calcium 78.2 1E+02 0.0023 33.2 18.2 47 153-201 395-441 (928)
23 TIGR03660 T1SS_rpt_143 T1SS-14 74.9 43 0.00093 27.0 9.9 60 215-287 69-128 (137)
24 PF05345 He_PIG: Putative Ig d 68.0 25 0.00055 22.6 5.5 34 204-242 13-46 (49)
25 TIGR00845 caca sodium/calcium 56.6 2.9E+02 0.0063 30.0 22.2 59 3-66 394-460 (928)
26 KOG3597|consensus 49.1 38 0.00083 33.0 5.6 62 319-383 24-85 (442)
27 PF03160 Calx-beta: Calx-beta 47.4 92 0.002 23.1 6.6 53 324-382 2-54 (100)
28 KOG3597|consensus 33.7 1E+02 0.0022 30.3 5.8 59 142-201 24-82 (442)
29 PF05895 DUF859: Siphovirus pr 26.6 7.4E+02 0.016 25.7 13.3 102 130-243 301-423 (624)
30 PF09100 Qn_am_d_aIV: Quinohem 26.1 1.2E+02 0.0026 24.0 3.9 30 305-335 103-132 (133)
31 cd02848 Chitinase_N_term Chiti 20.4 1.2E+02 0.0027 23.1 3.0 28 231-266 79-106 (106)
No 1
>KOG4289|consensus
Probab=100.00 E-value=8.1e-60 Score=468.14 Aligned_cols=364 Identities=29% Similarity=0.453 Sum_probs=314.6
Q ss_pred CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCccc
Q psy4200 1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYN 74 (388)
Q Consensus 1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~ 74 (388)
|+|.|||+|.|.++.|. +.|.|+..++++|++|+|+|.| |+.++|+|.++ +.|.||..||+|.+..+||+|..
T Consensus 364 V~D~NDNaPqFse~~Yv-vqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~ 442 (2531)
T KOG4289|consen 364 VEDENDNAPQFSEKRYV-VQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS 442 (2531)
T ss_pred EEecCCCCccccccceE-EEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC
Confidence 68999999999999999 9999999999999999999998 79999999974 48999999999999999999988
Q ss_pred ceeeEEEEEEEEec--C---------------------------------------------------------------
Q psy4200 75 STNTSTIVLTLEGV--D--------------------------------------------------------------- 89 (388)
Q Consensus 75 ~~~~~~l~v~a~D~--~--------------------------------------------------------------- 89 (388)
.|+ +.|+|+|. |
T Consensus 443 -~yt--l~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG 519 (2531)
T KOG4289|consen 443 -EYT--LRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG 519 (2531)
T ss_pred -eeE--EEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeecc
Confidence 899 88888882 2
Q ss_pred ----------------------------------------------------------CCCccceeee-------eccee
Q psy4200 90 ----------------------------------------------------------PEGSKVKYGI-------YGTDW 104 (388)
Q Consensus 90 ----------------------------------------------------------P~f~~~~~~~-------~g~~v 104 (388)
|.|++..|.. .|+.|
T Consensus 520 ~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI 599 (2531)
T KOG4289|consen 520 VGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSI 599 (2531)
T ss_pred CCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceE
Confidence 5565555554 45555
Q ss_pred EE---ecCCccE-E--EE----------------------cCCCCccccceEEEEEEEeeCCCceeEEEEEEEEeecCCC
Q psy4200 105 FS---LDRDSGE-L--RV----------------------AQPLYREDGHHTSKTNVDIRDGHHTSKTNVDIRVGDVQNT 156 (388)
Q Consensus 105 ~~---~D~d~g~-~--~~----------------------~~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V~V~dvNd~ 156 (388)
++ +|.|... + .+ ..++++.. ..+..+.|+|+||++.+.+.|.|.|.|.|.+
T Consensus 600 ~tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKq-e~~~vl~vtAtDg~l~d~~~V~v~I~danTh 678 (2531)
T KOG4289|consen 600 VTVTAVDRDANSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQ-ERQYVLAVTATDGTLQDTCSVNVNITDANTH 678 (2531)
T ss_pred EEEEEeccccccceEEEecCCcccccceeeccCCcceEEeecchhhcc-cceEEEEEEecCCccccceEEEEEeeecccC
Confidence 54 4444311 0 00 00011000 1112677899999999999999999999999
Q ss_pred CCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEEE
Q psy4200 157 PPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVISL 236 (388)
Q Consensus 157 ~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l 236 (388)
.|.|...+|.++|.|..|.|+.|..+.|+|.|.|+|++|+|-+. +..|+||+.+|.+++...||||. +-.|++
T Consensus 679 rpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y~le---d~~Frid~dsg~i~t~~~ld~ed----qvtytl 751 (2531)
T KOG4289|consen 679 RPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITYILE---DEAFRIDPDSGAIYTQAELDYED----QVTYTL 751 (2531)
T ss_pred CcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEEEec---ccceeecCCCCceEEeeeeeccc----ceeeEe
Confidence 99999999999999999999999999999999999999999443 24599999999999999999999 668999
Q ss_pred EEEEEEccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC------------
Q psy4200 237 TVRAREMVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD------------ 304 (388)
Q Consensus 237 ~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D------------ 304 (388)
.++|+| +|.| +..++.+|.|.|.|+|||+|+|..+.|.++|.|++|++|.+. +++|+|+|
T Consensus 752 ~itA~D--~~~p----q~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~Tsvl--QVSatDaD~g~Ng~v~y~~q 823 (2531)
T KOG4289|consen 752 AITARD--NGIP----QKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVL--QVSATDADSGPNGRVYYTFQ 823 (2531)
T ss_pred eeeecC--CCCC----CcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEE--EEEEeccCCCCCceEEEEec
Confidence 999998 7765 678999999999999999999999999999999999999995 99999999
Q ss_pred ----------------------------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCC
Q psy4200 305 ----------------------------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETA 350 (388)
Q Consensus 305 ----------------------------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~ 350 (388)
.+.|+| .|.|++++.+.|+|+|+|+|||||+|.+.+|...|.||.
T Consensus 824 g~~d~p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avD-rg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~ens 902 (2531)
T KOG4289|consen 824 GGDDGPGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVD-RGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENS 902 (2531)
T ss_pred CCCCCCCceEEccCcceeehhhhhcchheeEEEEEEEEee-CCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecC
Confidence 234455 688999999999999999999999999999999999999
Q ss_pred CCCcEEEEEEEEcCCCCCCCceeEEEEEEcCceeEEe
Q psy4200 351 QAGTSITTITALDSDGGDYGTGGIVYELLGEYGIMYV 387 (388)
Q Consensus 351 ~~g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~~~~~~ 387 (388)
+.|+.+++|.|.|+|+|+|+. |.|+|+++...-+|
T Consensus 903 pvgs~va~i~a~dpdEG~NA~--IsYqIvgg~d~~~f 937 (2531)
T KOG4289|consen 903 PVGSVVALITADDPDEGPNAH--ISYQIVGGNDPELF 937 (2531)
T ss_pred ccceeeEEEEccCCCcCCcce--EEEeeccCccHHHH
Confidence 999999999999999999988 99999977655444
No 2
>KOG4289|consensus
Probab=100.00 E-value=3.2e-57 Score=449.73 Aligned_cols=360 Identities=30% Similarity=0.483 Sum_probs=319.4
Q ss_pred CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCccc
Q psy4200 1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYN 74 (388)
Q Consensus 1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~ 74 (388)
|.|.|||.|+|.++.|. -++.||.++|+.|.+|+|+|.| |+.|+|++.++ +.|.||+.+|.|+++.+||||+.
T Consensus 259 V~D~nDhsPvFEq~~Y~-e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~ 337 (2531)
T KOG4289|consen 259 VLDTNDHSPVFEQDEYR-EELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREEL 337 (2531)
T ss_pred EeecCCCCcccchhHHH-HHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCccCHHhh
Confidence 56999999999999999 9999999999999999999998 79999999975 57999999999999999999999
Q ss_pred ceeeEEEEEEEEecC---------------------CCCccceeee-------ecceeEE--------------------
Q psy4200 75 STNTSTIVLTLEGVD---------------------PEGSKVKYGI-------YGTDWFS-------------------- 106 (388)
Q Consensus 75 ~~~~~~l~v~a~D~~---------------------P~f~~~~~~~-------~g~~v~~-------------------- 106 (388)
..|. |.|.|.|.+ |.|++..|.. +++.+.+
T Consensus 338 ~~y~--L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~S 415 (2531)
T KOG4289|consen 338 ESYQ--LDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIAS 415 (2531)
T ss_pred hheE--EEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeec
Confidence 9999 777777721 9999888876 4444444
Q ss_pred --------ecCCccEEEEcCCCCccccceEEEEEEEeeCCC---ceeEEEEEEEEeecCCCCCeeeCCcceEEEecCCCC
Q psy4200 107 --------LDRDSGELRVAQPLYREDGHHTSKTNVDIRDGH---HTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESAPI 175 (388)
Q Consensus 107 --------~D~d~g~~~~~~~~~~~~~~~~~~~~v~a~d~~---~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~~~ 175 (388)
+|..+|+|.+..++|.|.. ...+.++|.||+ ++...-+.|.|+|+|||+|.|....+..+|.|+.+.
T Consensus 416 gn~~G~f~id~~tGel~vv~plD~e~~--~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~l 493 (2531)
T KOG4289|consen 416 GNGRGQFYIDSLTGELDVVEPLDFENS--EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPL 493 (2531)
T ss_pred cCccccEEEecccceEEEeccccccCC--eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccc
Confidence 4444455566777777776 446889999986 456667779999999999999999999999999999
Q ss_pred CcEEEEEEEeeCCCCCCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCce
Q psy4200 176 GSVVLRVEAKDGDLAQPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQAT 255 (388)
Q Consensus 176 g~~v~~v~A~D~D~~~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~ 255 (388)
|..++.+.|.|+|.|.|+.+.|++.+ -+.|.|+..+|.|++.+.||||+ ...|.|.|.|+| .|.| +++
T Consensus 494 g~~v~~vqaidadsg~na~l~y~laG--~~pf~I~~~SG~Itvtk~ldrEt----~~~ysl~V~ard--~gtp----~l~ 561 (2531)
T KOG4289|consen 494 GYLVCHVQAIDADSGENARLHYSLAG--VGPFQINNGSGWITVTKELDRET----VEHYSLGVEARD--HGTP----PLS 561 (2531)
T ss_pred cceEEEEecccCCCCcccceeeeecc--CCCeeEecCCceEEEeecccccc----cceEEEEEEEcC--CCCC----ccc
Confidence 99999999999999999999999975 35899999999999999999999 458999999998 5665 688
Q ss_pred EEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC-------------------------------
Q psy4200 256 AFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD------------------------------- 304 (388)
Q Consensus 256 ~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D------------------------------- 304 (388)
+.+.|.|.+.|+|||.|+|++.+|+..+.|+.+.|+.|. +++|+|.|
T Consensus 562 tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~--tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitl 639 (2531)
T KOG4289|consen 562 TSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIV--TVTAVDRDANSVITYQITGGNTRNRFSISSIGGGGLITL 639 (2531)
T ss_pred ccceEEEEecccCCCCCccccCceEEEecCCccccceEE--EEEEeccccccceEEEecCCcccccceeeccCCcceEEe
Confidence 899999999999999999999999999999999999995 99999999
Q ss_pred --------------------------------------------------------------------------------
Q psy4200 305 -------------------------------------------------------------------------------- 304 (388)
Q Consensus 305 -------------------------------------------------------------------------------- 304 (388)
T Consensus 640 alp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y 719 (2531)
T KOG4289|consen 640 ALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITY 719 (2531)
T ss_pred ecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEE
Confidence
Q ss_pred --------------------------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCCCC
Q psy4200 305 --------------------------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETAQA 352 (388)
Q Consensus 305 --------------------------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~ 352 (388)
.++|.| .+.|+...+++|.|.|.|+|||+|+|..+.|.++|.|++++
T Consensus 720 ~led~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D-~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv 798 (2531)
T KOG4289|consen 720 ILEDEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARD-NGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPV 798 (2531)
T ss_pred EecccceeecCCCCceEEeeeeecccceeeEeeeeecC-CCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCC
Confidence 123333 57788999999999999999999999999999999999999
Q ss_pred CcEEEEEEEEcCCCCCCCceeEEEEEEcCc
Q psy4200 353 GTSITTITALDSDGGDYGTGGIVYELLGEY 382 (388)
Q Consensus 353 g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~ 382 (388)
+|.|++|.|+|+|.|.||+ +-|.+.|+.
T Consensus 799 ~TsvlQVSatDaD~g~Ng~--v~y~~qg~~ 826 (2531)
T KOG4289|consen 799 FTSVLQVSATDADSGPNGR--VYYTFQGGD 826 (2531)
T ss_pred cceEEEEEEeccCCCCCce--EEEEecCCC
Confidence 9999999999999999997 888887443
No 3
>KOG1219|consensus
Probab=100.00 E-value=1.6e-53 Score=435.24 Aligned_cols=360 Identities=28% Similarity=0.406 Sum_probs=303.5
Q ss_pred CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC-CCeEEEEEec----CCcEEEeCCeeEEEEcccCCCcccc
Q psy4200 1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE-GSKVKYGIYG----TDRFSLDRDSGELRVAQPLDREYNS 75 (388)
Q Consensus 1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D-~~~i~y~i~~----~~~F~Id~~tG~i~~~~~lD~e~~~ 75 (388)
|+|+|||||+|.+..|. ++++|++++|+.|.++.|+|.| |..|.|+|.+ ...|.|+..||.|++.+.||||+..
T Consensus 2168 V~dIndn~PvFeqlsYt-~sisE~s~igt~viqilATdsDsn~~isYsl~g~s~~sk~f~In~sTG~it~~g~ldyE~~q 2246 (4289)
T KOG1219|consen 2168 VGDINDNPPVFEQLSYT-ISISENSKIGTKVIQILATDSDSNREISYSLEGNSEISKPFRINVSTGWITVAGKLDYEENQ 2246 (4289)
T ss_pred ecccCCCCchhheeeEE-EEccCCCccCceEEEEEeccCCCCCceEEEeecCCccccceEEecccceEEEeeecChhhcc
Confidence 68999999999999999 9999999999999999999999 8999999998 4589999999999999999999999
Q ss_pred eeeEEEEEEEEec--------------------CCCCccceeee-------ecceeEE---ecCCc--------------
Q psy4200 76 TNTSTIVLTLEGV--------------------DPEGSKVKYGI-------YGTDWFS---LDRDS-------------- 111 (388)
Q Consensus 76 ~~~~~l~v~a~D~--------------------~P~f~~~~~~~-------~g~~v~~---~D~d~-------------- 111 (388)
.|. +.|+|.|. +|.|.+..|+. -|..+.. .|+|+
T Consensus 2247 ~f~--~fvratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l 2324 (4289)
T KOG1219|consen 2247 EFR--FFVRATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFL 2324 (4289)
T ss_pred eEE--EEEEEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhhhhccccee
Confidence 998 77777762 28888877775 1222222 33332
Q ss_pred -----cEE------------------------------------------------------------------------
Q psy4200 112 -----GEL------------------------------------------------------------------------ 114 (388)
Q Consensus 112 -----g~~------------------------------------------------------------------------ 114 (388)
|.+
T Consensus 2325 ~~s~~G~iTlfNl~k~~l~~s~~lrv~vsD~v~~at~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~ 2404 (4289)
T KOG1219|consen 2325 ILSENGIITLFNLLKSPLQTSYPLRVTVSDGVFRATMEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIY 2404 (4289)
T ss_pred eeccCceEEehhhcccccccccceeeeeccCcceeeeEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceee
Confidence 111
Q ss_pred -------------------EEcCCCCcccc-ceEEEEEEEeeCC-CceeEEEEEEEEeecCCCCCeeeCCcceEEEecCC
Q psy4200 115 -------------------RVAQPLYREDG-HHTSKTNVDIRDG-HHTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESA 173 (388)
Q Consensus 115 -------------------~~~~~~~~~~~-~~~~~~~v~a~d~-~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~ 173 (388)
...+.+++|.- .+...+.+.|.|+ ++.+.++++|.++|+|||+|.|....|+.+|.|++
T Consensus 2405 ~si~s~~sd~~~in~~GqI~t~~kld~e~s~~~vi~i~v~a~Da~gr~af~tvti~ltDiNDnpPqF~a~~Y~~nI~ena 2484 (4289)
T KOG1219|consen 2405 YSINSRASDHFEINKSGQIKTLSKLDREYSEELVIIIAVMAFDAGGRVAFCTVTIILTDINDNPPQFDAQLYRVNITENA 2484 (4289)
T ss_pred eeechhccCceeECCCccEEeeehhhhccCceEEEEEEEEEecCCCeEEEEEEEEEEEecCCCCccccceeEEEEeeccc
Confidence 01111111110 0111455667785 68899999999999999999999999999999999
Q ss_pred CCCcEEEEEEEeeCCCCCCceEEEEEecC--CCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCC
Q psy4200 174 PIGSVVLRVEAKDGDLAQPRSIYYDLLTN--PDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQE 251 (388)
Q Consensus 174 ~~g~~v~~v~A~D~D~~~~~~v~y~l~~~--~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~ 251 (388)
.-|..|+++.|+|.|.+.|+.++|.+.+. -..-|.|++ +|.|.+++.|+.+. +..|.|.|+|.| +|.|
T Consensus 2485 skg~~V~~v~A~D~De~snadvty~i~~e~~~~~v~~in~-sG~Itv~~sL~~~e----n~tl~l~vkA~D--~g~P--- 2554 (4289)
T KOG1219|consen 2485 SKGKLVGHVIARDADEGSNADVTYEIVGESDVKHVFEINE-SGVITVKRSLDGLE----NSTLHLFVKAID--DGKP--- 2554 (4289)
T ss_pred CCCceEEEEEEecCCCCCcccEEEEecCchhhhheeeecC-CceEEeehhhhccc----CcEEEEEEEecc--CCCC---
Confidence 99999999999999999999999999864 345678887 99999999999998 778999999998 7776
Q ss_pred CCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC---------------------------
Q psy4200 252 DQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD--------------------------- 304 (388)
Q Consensus 252 ~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D--------------------------- 304 (388)
++.+..+|.|+|.+..++.|.|..+.|.++|+|+.+.|..|+ ++.|.|.|
T Consensus 2555 -~~~s~ttV~v~vl~e~v~lPrFSep~y~fsvpEDv~vG~~Ig--~v~a~~a~~~~i~~~v~~gt~Esn~d~~Fsvdr~T 2631 (4289)
T KOG1219|consen 2555 -RRRSNTTVIVTVLPEDVNLPRFSEPIYTFSVPEDVPVGEEIG--QVSASDADEHVIYSLVLGGTPESNPDLPFSVDRNT 2631 (4289)
T ss_pred -CcccceEEEEEecCcccCcccccCceEEEeccccCCCCCeee--EEeecccCCceEEEEEeCCCCCCCCCCceEEcCCC
Confidence 778899999999999999999999999999999999999998 88998887
Q ss_pred -------------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCC
Q psy4200 305 -------------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSD 365 (388)
Q Consensus 305 -------------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D 365 (388)
.+.|.. . ...-+.+.|.|.|.|+|||+|.|..+.|.+.+.||++.|+.|++++|.|.|
T Consensus 2632 G~i~v~ksLD~E~kk~yqi~v~a~~-~--~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~D 2708 (4289)
T KOG1219|consen 2632 GMIKVNKSLDHEKKKSYQIKVKATC-G--QWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMD 2708 (4289)
T ss_pred ceEEeccccchhhhceEEEEEEeec-C--CceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccC
Confidence 122222 1 114678899999999999999999999999999999999999999999999
Q ss_pred CCCCCceeEEEEEEcC
Q psy4200 366 GGDYGTGGIVYELLGE 381 (388)
Q Consensus 366 ~~~~~~~~i~ysi~~~ 381 (388)
.+.+|+ |+|||...
T Consensus 2709 s~~nGq--irysl~~~ 2722 (4289)
T KOG1219|consen 2709 SGNNGQ--IRYSLTSP 2722 (4289)
T ss_pred CCCCce--EEEEEcCC
Confidence 999999 99999976
No 4
>KOG1219|consensus
Probab=100.00 E-value=5.8e-52 Score=423.98 Aligned_cols=366 Identities=29% Similarity=0.458 Sum_probs=318.7
Q ss_pred CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEec-CCcEEEeCCeeEEEEcccCCCcccce
Q psy4200 1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYG-TDRFSLDRDSGELRVAQPLDREYNST 76 (388)
Q Consensus 1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~-~~~F~Id~~tG~i~~~~~lD~e~~~~ 76 (388)
|+|.|||+|+|.+..|. ++|.|+..+|+.|++|.|+|.| ||.++|+|.. .+.|+||+.||.|.+.++||||....
T Consensus 735 vkd~ndn~p~f~e~sy~-vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~sdvfsIdp~tGivv~~~sLdrE~q~~ 813 (4289)
T KOG1219|consen 735 VKDYNDNTPIFVERSYH-VTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKSDVFSIDPFTGIVVTSKSLDREGQTS 813 (4289)
T ss_pred EEecccCCccccccceE-EEEecCCCCCceEEEEEecccCCCCCceEEEEecCCcceEEecCcccEEEeccccCcccCce
Confidence 57999999999999999 9999999999999999999998 6999999997 78999999999999999999999999
Q ss_pred eeEEEEEEEEecC---------------------CCCccceeee-------ecceeEE---ecCCccE------------
Q psy4200 77 NTSTIVLTLEGVD---------------------PEGSKVKYGI-------YGTDWFS---LDRDSGE------------ 113 (388)
Q Consensus 77 ~~~~l~v~a~D~~---------------------P~f~~~~~~~-------~g~~v~~---~D~d~g~------------ 113 (388)
|. |.|.|.|.| |.|-...+.. .|+.+.. .|+|-|.
T Consensus 814 y~--l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~~~ 891 (4289)
T KOG1219|consen 814 YH--LKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTDDT 891 (4289)
T ss_pred eE--EEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEecCc
Confidence 99 888888744 3333222222 4555554 5666542
Q ss_pred -----------EEEcCCCCccccceEEEEEEEeeCCC---ceeEEEEEEEEeecCCC--CCeeeCCcceEEEecCCCCCc
Q psy4200 114 -----------LRVAQPLYREDGHHTSKTNVDIRDGH---HTSKTNVDIRVGDVQNT--PPIFINSSFSGEIMESAPIGS 177 (388)
Q Consensus 114 -----------~~~~~~~~~~~~~~~~~~~v~a~d~~---~~~~~~v~V~V~dvNd~--~P~f~~~~~~~~v~E~~~~g~ 177 (388)
+.+.+++|++...+. .|.|+|.|++ +++.+.+.|.++|+|.| ||.|..-.-.++|.||+|.|+
T Consensus 892 v~~rvd~~sGavfi~~~LDf~k~~fy-nLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT 970 (4289)
T KOG1219|consen 892 VGERVDFPSGAVFIGKPLDFEKSDFY-NLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGT 970 (4289)
T ss_pred cccccccccccEEEecccccccccce-EEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcce
Confidence 345566666654443 7899999976 56788999999999876 999998888899999999999
Q ss_pred EEEEEEEeeCCCCCCceEEEEEec-CCCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceE
Q psy4200 178 VVLRVEAKDGDLAQPRSIYYDLLT-NPDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATA 256 (388)
Q Consensus 178 ~v~~v~A~D~D~~~~~~v~y~l~~-~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~ 256 (388)
.++++.|.|-|.|..+.++|+|.. +..+.|+||..+|.|++.+.||||. +..|.|+|.|.| .|.+ ++++
T Consensus 971 ~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~----ks~YwltveA~D--~gt~----~~ss 1040 (4289)
T KOG1219|consen 971 IVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREK----KSSYWLTVEAKD--LGTV----PLSS 1040 (4289)
T ss_pred EEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhh----cceEEEEEEEEe--cCCC----cccc
Confidence 999999999999999999999996 5678999999999999999999999 779999999998 5654 6888
Q ss_pred EEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC--------------------------------
Q psy4200 257 FAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD-------------------------------- 304 (388)
Q Consensus 257 ~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D-------------------------------- 304 (388)
.+.+.|.|+|+|||+|+|.+..|..+|.|+++.+..| +++.|+|+|
T Consensus 1041 v~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsi--vq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~ 1118 (4289)
T KOG1219|consen 1041 VCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSI--VQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTI 1118 (4289)
T ss_pred ceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEE--EEeccCCCCcccCcceEEEEccCCccceEEEccccceeeee
Confidence 9999999999999999999999999999999999999 599999999
Q ss_pred ------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCCCCCCCce
Q psy4200 305 ------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSDGGDYGTG 372 (388)
Q Consensus 305 ------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~ 372 (388)
|-+...+.|.|.+.+.+.|.|.|+|+|||+|+|.+..|...++|...+ .+.++.|.|.|.|.|++
T Consensus 1119 r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~IldvNdnsp~Flqk~~~~~v~~r~s~--plyRl~a~d~DeG~nar- 1195 (4289)
T KOG1219|consen 1119 RRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDVNDNSPRFLQKKTFLRVPERSSP--PLYRLAAQDNDEGNNAR- 1195 (4289)
T ss_pred hhhcccccccceEEEEEecCCCCccccceEEEEEEeeccCCchhhhhheeEEEeeeccCC--ceeEEEEEecCCCcceE-
Confidence 223334468899999999999999999999999999999999998875 88999999999999988
Q ss_pred eEEEEEEcCceeEE
Q psy4200 373 GIVYELLGEYGIMY 386 (388)
Q Consensus 373 ~i~ysi~~~~~~~~ 386 (388)
|+|+|..+++.|+
T Consensus 1196 -ityniedgde~Fs 1208 (4289)
T KOG1219|consen 1196 -ITYNIEDGDEVFS 1208 (4289)
T ss_pred -EEEecccCceEEE
Confidence 9999998888743
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=1.2e-28 Score=215.48 Aligned_cols=190 Identities=41% Similarity=0.650 Sum_probs=168.4
Q ss_pred CcceEEecCCCCCcEEEEEEEECCCC---CeEEEEEecC---CcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEec-
Q psy4200 16 NSPLVVEENTPPGTIVSTLEGVDPEG---SKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEGV- 88 (388)
Q Consensus 16 ~~~~~v~E~~~~gt~v~~v~a~D~D~---~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D~- 88 (388)
|. +.|+|+++.|+.|+++.|.|+|. +.++|+|.++ .+|.|++.+|.|++++.||||....|. |.|.|.|.
T Consensus 2 ~~-~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~--l~v~a~D~g 78 (199)
T cd00031 2 YS-VSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYT--LTVVASDGG 78 (199)
T ss_pred eE-EEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEE--EEEEEEECC
Confidence 56 89999999999999999999994 6899999974 389999999999999999999999999 88888882
Q ss_pred CCCCccceeeeecceeEEecCCccEEEEcCCCCccccceEEEEEEEeeCCCceeEEEEEEEEeecCCCCCeeeCCcceEE
Q psy4200 89 DPEGSKVKYGIYGTDWFSLDRDSGELRVAQPLYREDGHHTSKTNVDIRDGHHTSKTNVDIRVGDVQNTPPIFINSSFSGE 168 (388)
Q Consensus 89 ~P~f~~~~~~~~g~~v~~~D~d~g~~~~~~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~ 168 (388)
.| ..+....++|.|.|+||++|.|....|.+.
T Consensus 79 ~~------------------------------------------------~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~ 110 (199)
T cd00031 79 GP------------------------------------------------PLSSTATVTVTVLDVNDNPPVFEQSSYEAS 110 (199)
T ss_pred cC------------------------------------------------cceeEEEEEEEEccCCCCCCcccccceEEE
Confidence 11 123678999999999999999999999999
Q ss_pred EecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCC-CceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCC
Q psy4200 169 IMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPD-EFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGK 247 (388)
Q Consensus 169 v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~-~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~ 247 (388)
|.|+.++|+.++++.|+|+|.+.++.++|+|.+... .+|.|++.+|.|++.+.||+|. ...|.+.|.|+| .|.
T Consensus 111 v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~----~~~~~l~v~a~D--~~~ 184 (199)
T cd00031 111 VPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREE----KSSYELTVVATD--GGG 184 (199)
T ss_pred EeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCcc----CceEEEEEEEEE--CCC
Confidence 999999999999999999999888899999997544 8999999999999999999999 558999999998 343
Q ss_pred CcCCCCceEEEEEEEEEcc
Q psy4200 248 PLQEDQATAFAQVTVTILD 266 (388)
Q Consensus 248 ~~~~~~~~~~~~v~I~V~d 266 (388)
+ .++.++.+.|.|.|
T Consensus 185 ~----~~~~~~~i~i~v~d 199 (199)
T cd00031 185 P----PLSSTATVTVTVLD 199 (199)
T ss_pred C----CceeEEEEEEEEEC
Confidence 2 46888888888865
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.89 E-value=3.5e-21 Score=168.14 Aligned_cols=129 Identities=43% Similarity=0.674 Sum_probs=117.8
Q ss_pred cceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCC-CceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEE
Q psy4200 164 SFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPD-EFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRARE 242 (388)
Q Consensus 164 ~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~-~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D 242 (388)
.|.++|.|+.+.|+.++++.|.|+|.+.++.++|+|.++.. .+|.|++.+|.|++.+.||||. ...|.|.|+|+|
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~----~~~~~l~v~a~D 76 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREE----QSEYTLTVVASD 76 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcC----CceEEEEEEEEE
Confidence 36789999999999999999999999988899999997543 7999999999999999999999 568999999997
Q ss_pred ccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC
Q psy4200 243 MVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD 304 (388)
Q Consensus 243 ~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D 304 (388)
.|.+ .++....+.|.|.|+||++|.|....|.+.|.|+.+.|+.++ ++.|+|+|
T Consensus 77 --~g~~----~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~--~~~a~D~D 130 (199)
T cd00031 77 --GGGP----PLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVG--TVTATDAD 130 (199)
T ss_pred --CCcC----cceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEE--EEEEEcCC
Confidence 4554 456899999999999999999998899999999999999997 89999999
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.69 E-value=7.9e-16 Score=117.05 Aligned_cols=92 Identities=42% Similarity=0.610 Sum_probs=82.9
Q ss_pred ceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCC-CCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEc
Q psy4200 165 FSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNP-DEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREM 243 (388)
Q Consensus 165 ~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~-~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~ 243 (388)
|.++|+|+.++|+.++++.|.|+|.+.|+.+.|+|.++. ..+|.|++.+|.|++.++||||. ...|.|.|.|+|.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~----~~~y~l~v~a~D~ 76 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRET----QSSYQLTVRATDS 76 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTT----TSEEEEEEEEEET
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCccc----CCEEEEEEEEEEC
Confidence 788999999999999999999999999999999999744 79999999999999999999999 6689999999983
Q ss_pred cCCCCcCCCCceEEEEEEEEEc
Q psy4200 244 VDGKPLQEDQATAFAQVTVTIL 265 (388)
Q Consensus 244 ~~g~~~~~~~~~~~~~v~I~V~ 265 (388)
.|.+ .++++++|.|+|+
T Consensus 77 -~~~~----~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 77 -GGSP----PLSSTATVTINVL 93 (93)
T ss_dssp -TTSS----EEEEEEEEEEEEE
T ss_pred -CCCC----CCEEEEEEEEEEC
Confidence 1444 6889999999874
No 8
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.58 E-value=2.1e-14 Score=105.61 Aligned_cols=78 Identities=44% Similarity=0.685 Sum_probs=69.2
Q ss_pred eeCCCCCCceEEEEEecCCC-CceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEE
Q psy4200 185 KDGDLAQPRSIYYDLLTNPD-EFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVT 263 (388)
Q Consensus 185 ~D~D~~~~~~v~y~l~~~~~-~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~ 263 (388)
+|+|.|.|+.++|+|.++.. .+|.|++.+|.|.+.++||||. ...|.|.|+|+| .|.+ .+++.+.|.|.
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~----~~~y~l~v~a~D--~~~~----~~~~~~~v~I~ 70 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREE----QPEYTLTVEATD--GGGP----PLSSTATVTVT 70 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeC----CCeEEEEEEEEE--CCCC----CcccEEEEEEE
Confidence 48899889999999997544 8999999999999999999998 568999999998 4543 57899999999
Q ss_pred EcccCCCCC
Q psy4200 264 ILDVNDSPP 272 (388)
Q Consensus 264 V~dvNd~~P 272 (388)
|.|+|||+|
T Consensus 71 V~D~Nd~~P 79 (79)
T smart00112 71 VLDVNDNAP 79 (79)
T ss_pred EEECCCCCC
Confidence 999999998
No 9
>KOG1834|consensus
Probab=99.55 E-value=4.1e-13 Score=128.04 Aligned_cols=201 Identities=28% Similarity=0.430 Sum_probs=149.7
Q ss_pred CCCCCeeccCCCcceEEecCCCCCcEEE--EEEEECCCC------CeEEEEEecCC-cEE---EeCCee--EEEEcccCC
Q psy4200 5 GNSPPSFTTDVNSPLVVEENTPPGTIVS--TLEGVDPEG------SKVKYGIYGTD-RFS---LDRDSG--ELRVAQPLD 70 (388)
Q Consensus 5 nd~~P~F~~~~~~~~~v~E~~~~gt~v~--~v~a~D~D~------~~i~y~i~~~~-~F~---Id~~tG--~i~~~~~lD 70 (388)
|-+-|. ....|. ..|.||-. +++. -+.|.|.|. .-.-|.|.+.+ +|. +|..|| .|+.+.+||
T Consensus 27 nkhkpw-ie~ey~-gvV~Endn--tvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvlRaK~~lD 102 (952)
T KOG1834|consen 27 NKHKPW-IEEEYH-GVVTENDN--TVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVLRAKEPLD 102 (952)
T ss_pred cccCcc-ccccee-EEEEeCCc--eEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEEeecCccc
Confidence 333443 344688 88999863 3332 356777763 44567777743 454 577765 788899999
Q ss_pred CcccceeeEEEEEEEEe--cCCCCccceeeeecceeEEecCCccEEEEcCCCCccccceEEEEEEEeeCCCceeEEEEEE
Q psy4200 71 REYNSTNTSTIVLTLEG--VDPEGSKVKYGIYGTDWFSLDRDSGELRVAQPLYREDGHHTSKTNVDIRDGHHTSKTNVDI 148 (388)
Q Consensus 71 ~e~~~~~~~~l~v~a~D--~~P~f~~~~~~~~g~~v~~~D~d~g~~~~~~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V 148 (388)
.|.+..|+ |+|+|.| .+|.-++ -..+..++++|
T Consensus 103 Celqkeyt--f~iQAydCg~gpdgtn-------------------------------------------~kKShkatvhI 137 (952)
T KOG1834|consen 103 CELQKEYT--FTIQAYDCGNGPDGTN-------------------------------------------TKKSHKATVHI 137 (952)
T ss_pred ccccccce--EEEEEEecCCCCCccc-------------------------------------------cccccceEEEE
Confidence 99999999 9999998 2232110 12356679999
Q ss_pred EEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCC-Cce-EEEEEecCCCCceEEeCCeeEEEecccCCccc
Q psy4200 149 RVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQ-PRS-IYYDLLTNPDEFFLIDSNTGELKTAKPLDREI 226 (388)
Q Consensus 149 ~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~-~~~-v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~ 226 (388)
.|.|+|+.+|+|....|.+.|.|. ++-..|++|.|.|.|-+. +++ ..|.|.. ++-+|.||. .|.|+...+|.|..
T Consensus 138 rVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t-~d~PFaIdn-~G~irnTekLny~k 214 (952)
T KOG1834|consen 138 RVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKDCSPQYSQICEYEITT-PDVPFAIDN-DGNIRNTEKLNYTK 214 (952)
T ss_pred EeccccccCchhcccceeeEEecc-eeeeeeEEEEeecCCCCCcccceeEEEecC-CCCceEEcC-CCcccccccccccc
Confidence 999999999999999999999998 566689999999999864 555 4688885 678999974 79999999999988
Q ss_pred cCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEccc
Q psy4200 227 LGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILDV 267 (388)
Q Consensus 227 ~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dv 267 (388)
...|.|+|.|.|+ |.. +....+.|+|+|...
T Consensus 215 ----e~~Y~ltVtAyDC--g~k----raa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 215 ----EHQYKLTVTAYDC--GKK----RAASDSLVTVHVKPT 245 (952)
T ss_pred ----ceeEEEEEEEEec--ccc----cccCcceEEEEecCc
Confidence 6789999999984 332 223346777777654
No 10
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.52 E-value=1.7e-13 Score=104.19 Aligned_cols=70 Identities=37% Similarity=0.701 Sum_probs=65.3
Q ss_pred CcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEec
Q psy4200 16 NSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEGV 88 (388)
Q Consensus 16 ~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D~ 88 (388)
|. ++|+|++++|+.|+++.|.|+| ++.+.|+|.++ .+|.|++.+|.|+++++||||....|. |.|.|.|.
T Consensus 1 Y~-~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~--l~v~a~D~ 76 (93)
T PF00028_consen 1 YS-FSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQ--LTVRATDS 76 (93)
T ss_dssp EE-EEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEE--EEEEEEET
T ss_pred CE-EEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEE--EEEEEEEC
Confidence 56 8999999999999999999998 69999999964 589999999999999999999999999 99999983
No 11
>KOG1834|consensus
Probab=99.35 E-value=2.5e-11 Score=116.04 Aligned_cols=149 Identities=23% Similarity=0.292 Sum_probs=116.1
Q ss_pred EEEEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCC--Cc-eEEEEEecCCCCceE---EeCCe--eEEEe
Q psy4200 147 DIRVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQ--PR-SIYYDLLTNPDEFFL---IDSNT--GELKT 218 (388)
Q Consensus 147 ~V~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~--~~-~v~y~l~~~~~~~F~---id~~t--G~i~~ 218 (388)
....--+|-+.|... ..|.+-|.||...-...--+.|-|.|.+. .+ ..-|.|.+. .-+|. +|..| |.|+.
T Consensus 20 ~~~aarankhkpwie-~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq-~vPFdavVvdK~TGegvlRa 97 (952)
T KOG1834|consen 20 HHHAARANKHKPWIE-EEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ-PVPFDAVVVDKYTGEGVLRA 97 (952)
T ss_pred ccccccccccCcccc-cceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC-CCCceEEEEeccCCceEEee
Confidence 445566787888776 88999999996544444458888988753 12 456788753 34555 47766 57888
Q ss_pred cccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEE
Q psy4200 219 AKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDM 298 (388)
Q Consensus 219 ~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l 298 (388)
+.+||.|. +..|+|+|+|.|+..|..+.....+..++|.|.|.|+|+.+|.|....|.+.|.|+.-....+ ++
T Consensus 98 K~~lDCel----qkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il---~v 170 (952)
T KOG1834|consen 98 KEPLDCEL----QKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSIL---RV 170 (952)
T ss_pred cCcccccc----cccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeE---EE
Confidence 99999998 568999999999765544444467788999999999999999999999999999998777665 89
Q ss_pred EEeeCC
Q psy4200 299 IVTDSD 304 (388)
Q Consensus 299 ~a~D~D 304 (388)
.|.|.|
T Consensus 171 eAiD~D 176 (952)
T KOG1834|consen 171 EAIDKD 176 (952)
T ss_pred EeecCC
Confidence 999999
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.34 E-value=1e-11 Score=91.25 Aligned_cols=69 Identities=30% Similarity=0.489 Sum_probs=59.3
Q ss_pred CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEecCCCCccceeeeecceeEEecCCccEEEEc
Q psy4200 41 GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEGVDPEGSKVKYGIYGTDWFSLDRDSGELRVA 117 (388)
Q Consensus 41 ~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D~~P~f~~~~~~~~g~~v~~~D~d~g~~~~~ 117 (388)
++.++|+|.++ .+|.|++.+|.|++.++||||....|. |.|.|.|.+
T Consensus 8 n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~--l~v~a~D~~---------------------------- 57 (79)
T smart00112 8 NGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYT--LTVEATDGG---------------------------- 57 (79)
T ss_pred CcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEE--EEEEEEECC----------------------------
Confidence 58899999864 689999999999999999999999999 999988821
Q ss_pred CCCCccccceEEEEEEEeeCCCceeEEEEEEEEeecCCCCC
Q psy4200 118 QPLYREDGHHTSKTNVDIRDGHHTSKTNVDIRVGDVQNTPP 158 (388)
Q Consensus 118 ~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V~V~dvNd~~P 158 (388)
...+++.+.|+|.|.|+|||+|
T Consensus 58 -------------------~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 58 -------------------GPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred -------------------CCCcccEEEEEEEEEECCCCCC
Confidence 0125677899999999999998
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.69 E-value=4.5e-05 Score=55.91 Aligned_cols=66 Identities=27% Similarity=0.467 Sum_probs=40.8
Q ss_pred ceEEEecCCCCCcEEEEEEEeeCCCCCC--ceEEEEEec-CCCCceEEeCCeeEEEecccCCccccCCcc
Q psy4200 165 FSGEIMESAPIGSVVLRVEAKDGDLAQP--RSIYYDLLT-NPDEFFLIDSNTGELKTAKPLDREILGGTN 231 (388)
Q Consensus 165 ~~~~v~E~~~~g~~v~~v~A~D~D~~~~--~~v~y~l~~-~~~~~F~id~~tG~i~~~~~LD~E~~~~~~ 231 (388)
...+|+|..+.|+.|+.+ |.|.-.... ....|++.+ ....+|.++..+|.|.++..+|||.+...+
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~LC~~~ 71 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREELCGQS 71 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC-TTS
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHHCCCC
Confidence 457899999999999998 445432211 123566654 456899999999999999999999976433
No 14
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.61 E-value=4.8e-05 Score=55.76 Aligned_cols=59 Identities=27% Similarity=0.481 Sum_probs=38.8
Q ss_pred eEEecCCCCCcEEEEEEEECCC-----CCeEEEEEec---CCcEEEeCCeeEEEEcccCCCcccceee
Q psy4200 19 LVVEENTPPGTIVSTLEGVDPE-----GSKVKYGIYG---TDRFSLDRDSGELRVAQPLDREYNSTNT 78 (388)
Q Consensus 19 ~~v~E~~~~gt~v~~v~a~D~D-----~~~i~y~i~~---~~~F~Id~~tG~i~~~~~lD~e~~~~~~ 78 (388)
++|+|..+.|+.|+.| |.|.. ...-.|++.+ ..+|.+++.+|.|+++..+|||+.+...
T Consensus 5 YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~LC~~~ 71 (84)
T PF08266_consen 5 YSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREELCGQS 71 (84)
T ss_dssp EEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC-TTS
T ss_pred EEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHHCCCC
Confidence 7899999999999999 55653 1223455543 4599999999999999999999976543
No 15
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.43 E-value=0.00037 Score=51.79 Aligned_cols=76 Identities=14% Similarity=0.235 Sum_probs=42.1
Q ss_pred CCCeeccCCCcceEEecCCCCCcEEEEEEEECCCC-CeEEEEEecCCcEEEeCCeeEEEEcccCCCcccceeeEEEEEEE
Q psy4200 7 SPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPEG-SKVKYGIYGTDRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTL 85 (388)
Q Consensus 7 ~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D~-~~i~y~i~~~~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a 85 (388)
+.|=|.+..|. +.|+.+...|..|++|.-.|..+ ..+.|....+ .|.|.+ .|.|++++++..... ... +.|.|
T Consensus 2 C~pGF~~~~~~-~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssDp-dF~V~~-DGsVy~~r~v~l~~~-~~~--F~V~a 75 (90)
T PF08758_consen 2 CRPGFSQKKYT-FEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSDP-DFRVLE-DGSVYAKRPVQLSSE-QRS--FTVHA 75 (90)
T ss_dssp ---B--S-EEE-E----SS-SS--EEE---B--SS---EEEE---S-EEEEET-TTEEEEES--S-SSS--EE--EEEEE
T ss_pred CcCCcccceEE-EEcCchhhCCcEEEEEEeccCCCCCceEEecCCC-CEEEcC-CCeEEEeeeEecCCC-ceE--EEEEE
Confidence 45889999999 99999999999999999999875 5688876655 999998 799999999877533 235 99999
Q ss_pred Eec
Q psy4200 86 EGV 88 (388)
Q Consensus 86 ~D~ 88 (388)
.|.
T Consensus 76 ~D~ 78 (90)
T PF08758_consen 76 WDS 78 (90)
T ss_dssp EET
T ss_pred ECC
Confidence 993
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=95.96 E-value=0.091 Score=39.04 Aligned_cols=77 Identities=18% Similarity=0.226 Sum_probs=40.8
Q ss_pred CCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEE
Q psy4200 156 TPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVIS 235 (388)
Q Consensus 156 ~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~ 235 (388)
+.|-|.+..|.+.|+.+...|..|++|.-.|-... ..+.|... + ..|.|.+ .|.|.+++++..... .-.
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~--~~~~~~ss-D--pdF~V~~-DGsVy~~r~v~l~~~-----~~~ 70 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGR--RRVIFESS-D--PDFRVLE-DGSVYAKRPVQLSSE-----QRS 70 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--SS-----EEEE------SEEEEET-TTEEEEES--S-SSS------EE
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCCCC--CceEEecC-C--CCEEEcC-CCeEEEeeeEecCCC-----ceE
Confidence 46889999999999999999999999999887433 46777764 2 3799964 799999999887542 358
Q ss_pred EEEEEEEc
Q psy4200 236 LTVRAREM 243 (388)
Q Consensus 236 l~v~a~D~ 243 (388)
|.|.|.|.
T Consensus 71 F~V~a~D~ 78 (90)
T PF08758_consen 71 FTVHAWDS 78 (90)
T ss_dssp EEEEEEET
T ss_pred EEEEEECC
Confidence 99999984
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.17 E-value=1.1 Score=33.77 Aligned_cols=69 Identities=30% Similarity=0.526 Sum_probs=49.0
Q ss_pred EeeCCCCCCceEEEEEecC----CCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEE
Q psy4200 184 AKDGDLAQPRSIYYDLLTN----PDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQ 259 (388)
Q Consensus 184 A~D~D~~~~~~v~y~l~~~----~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~ 259 (388)
..|+| + ..++|++... -..|..+++.++.+.-. |..... +.|.+.|.|+|. .| .+....
T Consensus 24 F~d~d-~--~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~~-----g~~~i~v~a~D~-~g-------~~~~~~ 86 (97)
T smart00736 24 FTDAD-G--DTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSDV-----GSLSLKVTATDS-SG-------ASASDT 86 (97)
T ss_pred eECCC-C--CeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCCC-----cEEEEEEEEEEC-CC-------CEEEEE
Confidence 45666 2 3788888631 24699999998887773 444332 369999999973 12 467888
Q ss_pred EEEEEcccCC
Q psy4200 260 VTVTILDVND 269 (388)
Q Consensus 260 v~I~V~dvNd 269 (388)
+.|.|.+.|+
T Consensus 87 f~i~V~~~~~ 96 (97)
T smart00736 87 FTITVVNTND 96 (97)
T ss_pred EEEEEeCCCC
Confidence 9999999886
No 18
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=93.63 E-value=1.4 Score=33.23 Aligned_cols=49 Identities=20% Similarity=0.338 Sum_probs=33.9
Q ss_pred EEECCCCCeEEEEEec------CCcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEe
Q psy4200 35 EGVDPEGSKVKYGIYG------TDRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEG 87 (388)
Q Consensus 35 ~a~D~D~~~i~y~i~~------~~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D 87 (388)
...|+|+..++|++.. +.+.+.|+.++.++=. +...+ ...+. ++|.|+|
T Consensus 23 tF~d~d~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~-~g~~~--i~v~a~D 77 (97)
T smart00736 23 TFTDADGDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSD-VGSLS--LKVTATD 77 (97)
T ss_pred ceECCCCCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCC-CcEEE--EEEEEEE
Confidence 3578888899999973 3478999988887764 33333 34477 6666666
No 19
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=92.98 E-value=1.3 Score=33.36 Aligned_cols=39 Identities=18% Similarity=0.313 Sum_probs=28.0
Q ss_pred EEEEEeeCCCceeEEEEEEEEeecCCCCCeeeCCcceEEEecCC
Q psy4200 130 KTNVDIRDGHHTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESA 173 (388)
Q Consensus 130 ~~~v~a~d~~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~ 173 (388)
.|++.+.||. ...|+|.|.-.|| +|+.. ..-...+.|+.
T Consensus 60 sFtvtv~DGt---t~~vtItI~GtND-apvi~-~~~~g~v~ED~ 98 (99)
T TIGR01965 60 TFTVTSADGT---SQTVTITITGAND-AAVIG-GADTGSVTEDS 98 (99)
T ss_pred EEEEEEeCCC---eEEEEEEEEccCC-CCEEe-cccceeEecCC
Confidence 5667777773 7789999999997 57665 34456777763
No 20
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=92.20 E-value=1.1 Score=33.93 Aligned_cols=87 Identities=24% Similarity=0.258 Sum_probs=53.3
Q ss_pred EEEEEeeCCCCCCceEEEEEec--CCCCceEEeCCeeEEEecc--------cCCccccCCcccEEEEEEEEEEccCCCCc
Q psy4200 180 LRVEAKDGDLAQPRSIYYDLLT--NPDEFFLIDSNTGELKTAK--------PLDREILGGTNGVISLTVRAREMVDGKPL 249 (388)
Q Consensus 180 ~~v~A~D~D~~~~~~v~y~l~~--~~~~~F~id~~tG~i~~~~--------~LD~E~~~~~~~~~~l~v~a~D~~~g~~~ 249 (388)
+++.++|+|.+. ...+++.. +..+.|.|++ +|.....- .|.... ...-.|.+.+. ||.
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge----~~tdsFtvtv~---DGt-- 69 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGE----TLTDTFTVTSA---DGT-- 69 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCC----EEEEEEEEEEe---CCC--
Confidence 468889999875 45555542 4467788876 66554421 122111 33567888888 452
Q ss_pred CCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCC
Q psy4200 250 QEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDI 287 (388)
Q Consensus 250 ~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~ 287 (388)
+..|.|+|...||.| .... .-...+.|+.
T Consensus 70 -------t~~vtItI~GtNDap-vi~~-~~~g~v~ED~ 98 (99)
T TIGR01965 70 -------SQTVTITITGANDAA-VIGG-ADTGSVTEDS 98 (99)
T ss_pred -------eEEEEEEEEccCCCC-EEec-ccceeEecCC
Confidence 678999999999755 4332 2245666653
No 21
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=82.19 E-value=7.7 Score=26.44 Aligned_cols=45 Identities=24% Similarity=0.300 Sum_probs=26.8
Q ss_pred CCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEc
Q psy4200 191 QPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREM 243 (388)
Q Consensus 191 ~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~ 243 (388)
.+-..+|+|.+-...+..+...+-.+ .+-.++ .+.|+|.|+|.|.
T Consensus 6 ~~~~Y~Y~l~g~d~~W~~~~~~~~~~------~~~~L~--~G~Y~l~V~a~~~ 50 (66)
T PF07495_consen 6 ENIRYRYRLEGFDDEWITLGSYSNSI------SYTNLP--PGKYTLEVRAKDN 50 (66)
T ss_dssp TTEEEEEEEETTESSEEEESSTS-EE------EEES----SEEEEEEEEEEET
T ss_pred CceEEEEEEECCCCeEEECCCCcEEE------EEEeCC--CEEEEEEEEEECC
Confidence 34467788876555666664332222 222333 5789999999985
No 22
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=78.24 E-value=1e+02 Score=33.17 Aligned_cols=47 Identities=19% Similarity=0.214 Sum_probs=28.3
Q ss_pred cCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEec
Q psy4200 153 VQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLT 201 (388)
Q Consensus 153 vNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~ 201 (388)
.||.++.|.-..-..+|.|++ |+.-..|.-...|.+....+.|+..+
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~D 441 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTED 441 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccC
Confidence 455566655555556899984 66656665544444444567777654
No 23
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=74.94 E-value=43 Score=27.04 Aligned_cols=60 Identities=25% Similarity=0.416 Sum_probs=37.3
Q ss_pred EEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCC
Q psy4200 215 ELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDI 287 (388)
Q Consensus 215 ~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~ 287 (388)
.+.+.++||...- ...-...|.|.|+|. +|.. +...+.|+|.| |. |...... ..+|.|+.
T Consensus 69 tftL~~~lDH~~g-~d~l~l~~~v~a~D~-DGD~-------s~~~l~VtI~D--D~-P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 69 EFTLEGPLDHAAG-SDELTLNFPIIATDF-DGDT-------SSITLPVTIVD--DV-PTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEEcccccCCCC-CceEEEeeeEEEEeC-CCCc-------cccEEEEEEEC--CC-Ceecccc-ceEEeccc
Confidence 4455666666431 113457889999987 6653 23588888887 44 6665433 37888853
No 24
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=67.97 E-value=25 Score=22.61 Aligned_cols=34 Identities=21% Similarity=0.306 Sum_probs=25.1
Q ss_pred CCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEE
Q psy4200 204 DEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRARE 242 (388)
Q Consensus 204 ~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D 242 (388)
..+..||+.+|.|.-.-.-.-+ .+.|.+.|.|+|
T Consensus 13 P~gLs~d~~tG~isGtp~~~~~-----~G~y~~~vtatd 46 (49)
T PF05345_consen 13 PSGLSLDPSTGTISGTPTSSVQ-----PGTYTFTVTATD 46 (49)
T ss_pred CCcEEEeCCCCEEEeecCCCcc-----ccEEEEEEEEEc
Confidence 4678999999999886332211 247999999996
No 25
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=56.63 E-value=2.9e+02 Score=29.95 Aligned_cols=59 Identities=15% Similarity=0.141 Sum_probs=36.3
Q ss_pred ccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC-C--CeEEEEEecC-----CcEEEeCCeeEEEEc
Q psy4200 3 AYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE-G--SKVKYGIYGT-----DRFSLDRDSGELRVA 66 (388)
Q Consensus 3 d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D-~--~~i~y~i~~~-----~~F~Id~~tG~i~~~ 66 (388)
+.||..++|....-. ..|.|+. |+.-.+|.-...+ + -.+.|+..++ .-|. +.+|.|...
T Consensus 394 ~~dd~~s~i~Fe~~~-Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~ 460 (928)
T TIGR00845 394 EENDPVSKIFFEPGH-YTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFK 460 (928)
T ss_pred cccCCcceEEecCCe-EEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEEC
Confidence 356667777666666 7899996 7776666655433 2 5588887642 2333 235766544
No 26
>KOG3597|consensus
Probab=49.15 E-value=38 Score=33.05 Aligned_cols=62 Identities=23% Similarity=0.228 Sum_probs=46.7
Q ss_pred EEEEEEEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCCCCCCCceeEEEEEEcCce
Q psy4200 319 SSATLIVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSDGGDYGTGGIVYELLGEYG 383 (388)
Q Consensus 319 ~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~~ 383 (388)
-+....|.|..+||.+..+....+.+-+.|+...-.-.-.+.+.|+|...- ++.|+|.+...
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~---~l~f~v~~t~~ 85 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPL---PLEFQVLGTSS 85 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCcc---ceEEEEccCCC
Confidence 566788999999997777777777777777765433345699999998874 38999986543
No 27
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=47.37 E-value=92 Score=23.05 Aligned_cols=53 Identities=21% Similarity=0.248 Sum_probs=28.1
Q ss_pred EEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCCCCCCCceeEEEEEEcCc
Q psy4200 324 IVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSDGGDYGTGGIVYELLGEY 382 (388)
Q Consensus 324 ~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~ 382 (388)
+|.|+| ||. |.+.-..-..++.|+. |..-..|.-...+.... -.+.|+..++.
T Consensus 2 tvtI~d-~d~-~~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~--v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DDD-PTVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGP--VTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TTS-EEEEESSSEEEEETTS--SEEEEEEEEESS-TSSE--EEEEEEEEESS
T ss_pred EEEEEC-CCC-CEEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcc--eEEEEEEeCCc
Confidence 567778 664 4766555556777875 33444455444432222 23777766543
No 28
>KOG3597|consensus
Probab=33.71 E-value=1e+02 Score=30.27 Aligned_cols=59 Identities=20% Similarity=0.180 Sum_probs=45.9
Q ss_pred eEEEEEEEEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEec
Q psy4200 142 SKTNVDIRVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLT 201 (388)
Q Consensus 142 ~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~ 201 (388)
.+....|.|..+||.+..+....+.+-+.|+...-.....+.+.|+|... ..+.|++.+
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~ 82 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLG 82 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEcc
Confidence 45678899999999888777777778888886555556778899999774 468888885
No 29
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=26.65 E-value=7.4e+02 Score=25.70 Aligned_cols=102 Identities=13% Similarity=0.154 Sum_probs=52.8
Q ss_pred EEEEEeeCC--CceeEEEEEEEEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCC------CCCceEEEEEec
Q psy4200 130 KTNVDIRDG--HHTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDL------AQPRSIYYDLLT 201 (388)
Q Consensus 130 ~~~v~a~d~--~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~------~~~~~v~y~l~~ 201 (388)
.+++.++|. ..++.....|.|++=. +|.+.-..++..-.++ .......|.=... -...++.|+...
T Consensus 301 Ti~atVtDSRGr~S~~~~~tItVl~Y~--~P~lsfsv~R~~~~~~----~~~v~~~a~Iapl~v~g~qKN~~~lt~~~a~ 374 (624)
T PF05895_consen 301 TIRATVTDSRGRTSDPKTKTITVLEYS--PPTLSFSVYRCGSSGN----TLTVTRNAKIAPLTVNGVQKNTMTLTFKVAP 374 (624)
T ss_pred EEEEEEEECCCccCCceEEEEEEEEcC--CCcEEEEEEEeCCCCc----EEEEEEEEEEeEEEEcccccceEEEEEEEEE
Confidence 445555664 3456778999999874 7877533332222222 1112222211111 112356777665
Q ss_pred CCCCceEEeCC-------------eeEEEecccCCccccCCcccEEEEEEEEEEc
Q psy4200 202 NPDEFFLIDSN-------------TGELKTAKPLDREILGGTNGVISLTVRAREM 243 (388)
Q Consensus 202 ~~~~~F~id~~-------------tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~ 243 (388)
-....|.+|.. .+...+...+|-+. .|.+.+.++|.
T Consensus 375 ~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~~~k------Sy~V~~~l~D~ 423 (624)
T PF05895_consen 375 LGTGTFTTDNGSASGTWSSISELTNSSANLGGTYDAEK------SYDVRGTLSDK 423 (624)
T ss_pred cCcceEEEEccccccceeeeeeecccceeeccccCCCc------eEEEEEEEEEE
Confidence 34455655432 11234445566655 79999999984
No 30
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.14 E-value=1.2e+02 Score=23.98 Aligned_cols=30 Identities=33% Similarity=0.415 Sum_probs=15.9
Q ss_pred eEEEEecCCCceeEEEEEEEEEEEeCCCCCC
Q psy4200 305 LVIAEETHTAEKLSSSATLIVQVTDVNDNVP 335 (388)
Q Consensus 305 ~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P 335 (388)
.|+|+-..+...++..+.+.|+|.+.|+ +|
T Consensus 103 ~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 103 KVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp EEEEEETTTT---EEEEEEEEE---S----S
T ss_pred EEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 3455555667789999999999999987 66
No 31
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=20.45 E-value=1.2e+02 Score=23.14 Aligned_cols=28 Identities=21% Similarity=0.264 Sum_probs=18.7
Q ss_pred ccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEcc
Q psy4200 231 NGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILD 266 (388)
Q Consensus 231 ~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~d 266 (388)
.+.|.+.|+++|. +|+ +..+.+.|.|-|
T Consensus 79 gG~y~m~V~lCn~-dGC-------S~S~~~~I~VAD 106 (106)
T cd02848 79 GGRYQMQVALCNG-DGC-------STSAAKEIVVAD 106 (106)
T ss_pred CCeEEEEEEEECC-CCc-------cCcCCEEEEecC
Confidence 6789999999975 443 344555665543
Done!