Query psy7562
Match_columns 582
No_of_seqs 338 out of 2177
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 21:36:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7562.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7562hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 2.8E-85 6.1E-90 700.8 56.3 566 3-581 225-945 (2531)
2 KOG4289|consensus 100.0 1.1E-73 2.4E-78 612.0 49.2 523 45-580 159-839 (2531)
3 KOG1219|consensus 100.0 3E-72 6.5E-77 617.4 57.8 562 3-580 477-1214(4289)
4 KOG1219|consensus 100.0 9.7E-70 2.1E-74 597.7 60.6 559 1-577 2035-2834(4289)
5 cd00031 CA Cadherin repeat dom 100.0 2E-30 4.3E-35 247.2 29.4 196 324-522 2-199 (199)
6 cd00031 CA Cadherin repeat dom 100.0 9.1E-28 2E-32 228.8 28.9 193 109-417 2-199 (199)
7 PF00028 Cadherin: Cadherin do 99.7 9.5E-17 2.1E-21 133.0 14.5 92 430-521 1-93 (93)
8 KOG1834|consensus 99.7 2.8E-15 6.1E-20 154.0 20.1 207 307-522 21-244 (952)
9 PF00028 Cadherin: Cadherin do 99.7 4E-15 8.8E-20 123.1 14.9 90 324-416 1-93 (93)
10 smart00112 CA Cadherin repeats 99.6 5.2E-15 1.1E-19 118.5 11.0 79 450-528 1-79 (79)
11 smart00112 CA Cadherin repeats 99.5 1.7E-13 3.6E-18 109.7 10.7 76 345-423 2-79 (79)
12 KOG1834|consensus 99.5 3.7E-12 8E-17 131.5 19.0 198 104-417 32-244 (952)
13 PF08266 Cadherin_2: Cadherin- 98.2 4.2E-06 9.1E-11 66.7 6.1 61 109-170 3-68 (84)
14 PF08758 Cadherin_pro: Cadheri 97.4 0.0016 3.4E-08 52.7 9.8 79 315-402 2-80 (90)
15 PF08266 Cadherin_2: Cadherin- 97.1 0.00079 1.7E-08 53.7 4.8 60 324-387 3-66 (84)
16 PF08758 Cadherin_pro: Cadheri 97.0 0.0082 1.8E-07 48.6 9.5 79 422-508 3-81 (90)
17 smart00736 CADG Dystroglycan-t 96.8 0.02 4.4E-07 47.3 11.0 73 343-420 24-96 (97)
18 smart00736 CADG Dystroglycan-t 95.9 0.13 2.7E-06 42.5 10.9 71 448-525 23-96 (97)
19 TIGR01965 VCBS_repeat VCBS rep 95.8 0.11 2.4E-06 42.6 9.8 89 339-438 2-98 (99)
20 TIGR01965 VCBS_repeat VCBS rep 94.9 0.2 4.3E-06 41.2 8.3 34 124-158 2-37 (99)
21 PF13750 Big_3_3: Bacterial Ig 94.7 3.2 7E-05 37.5 17.3 127 388-521 14-148 (158)
22 TIGR00845 caca sodium/calcium 88.9 14 0.00031 42.7 16.1 143 418-567 395-569 (928)
23 TIGR00845 caca sodium/calcium 88.5 54 0.0012 38.2 20.3 149 312-469 395-571 (928)
24 PF05345 He_PIG: Putative Ig d 85.4 3.4 7.3E-05 29.2 5.7 39 467-506 10-48 (49)
25 PF05345 He_PIG: Putative Ig d 84.9 5.5 0.00012 28.1 6.6 46 354-401 2-48 (49)
26 PF07495 Y_Y_Y: Y_Y_Y domain; 82.4 14 0.00031 27.4 8.7 62 455-521 5-66 (66)
27 KOG3597|consensus 76.7 47 0.001 35.3 12.9 153 302-467 25-195 (442)
28 TIGR03660 T1SS_rpt_143 T1SS-14 70.2 23 0.00049 31.2 7.5 56 377-438 70-128 (137)
29 TIGR03660 T1SS_rpt_143 T1SS-14 67.3 16 0.00035 32.1 6.0 54 1-63 72-128 (137)
30 PF13750 Big_3_3: Bacterial Ig 60.5 1.3E+02 0.0028 27.1 18.1 30 385-416 119-148 (158)
31 PF03160 Calx-beta: Calx-beta 59.5 94 0.002 25.2 10.9 53 412-468 2-54 (100)
32 PF07495 Y_Y_Y: Y_Y_Y domain; 42.0 57 0.0012 24.0 4.6 28 10-37 38-65 (66)
33 PF03160 Calx-beta: Calx-beta 41.1 1.9E+02 0.0041 23.3 10.0 50 306-359 2-53 (100)
34 smart00089 PKD Repeats in poly 39.6 66 0.0014 24.6 4.9 30 5-37 49-78 (79)
35 cd00146 PKD polycystic kidney 37.8 67 0.0015 24.8 4.6 30 5-36 51-80 (81)
36 cd02848 Chitinase_N_term Chiti 36.8 69 0.0015 26.7 4.5 34 487-522 73-106 (106)
37 PF12245 Big_3_2: Bacterial Ig 35.4 1.5E+02 0.0032 21.8 5.8 30 492-523 21-50 (60)
38 PF12245 Big_3_2: Bacterial Ig 35.4 1E+02 0.0023 22.6 5.0 28 10-39 22-49 (60)
39 cd00146 PKD polycystic kidney 34.2 85 0.0019 24.1 4.7 30 488-519 51-80 (81)
40 smart00089 PKD Repeats in poly 30.5 1.1E+02 0.0025 23.3 4.8 31 487-520 48-78 (79)
41 KOG3597|consensus 30.4 3.8E+02 0.0083 28.6 9.9 59 407-466 24-82 (442)
42 PF02494 HYR: HYR domain; Int 27.4 1.1E+02 0.0024 23.7 4.3 24 12-37 58-81 (81)
43 PF09100 Qn_am_d_aIV: Quinohem 26.8 93 0.002 26.7 3.7 33 10-42 97-130 (133)
44 cd02848 Chitinase_N_term Chiti 25.7 94 0.002 25.9 3.5 32 5-38 74-105 (106)
45 KOG4221|consensus 24.7 1.3E+03 0.029 28.1 17.6 177 334-529 439-620 (1381)
46 PF08329 ChitinaseA_N: Chitina 22.8 2.2E+02 0.0048 24.9 5.5 53 487-545 76-128 (133)
47 KOG4221|consensus 20.4 1.6E+03 0.035 27.4 17.8 69 352-424 552-620 (1381)
48 PF14302 DUF4377: Domain of un 20.3 3.1E+02 0.0067 21.5 5.5 35 4-38 41-77 (80)
No 1
>KOG4289|consensus
Probab=100.00 E-value=2.8e-85 Score=700.82 Aligned_cols=566 Identities=30% Similarity=0.476 Sum_probs=503.8
Q ss_pred cceecccCCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCCCCCcccCCCceEEEeCCCCCCcEEEEEEEEcCCCC--
Q psy7562 3 NIIVFVAGQEFDLLVRATDHGSPNQSSTARVSIQVVRLPDESKNPPTIKSPDQRVEVTENDAVGFLVALIQASDPDGD-- 80 (582)
Q Consensus 3 ~~ldyE~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d~ND~~~~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~-- 80 (582)
+.||.|+...|.|.|+|.|.|.|++|++++|.|.|.|.|| |.|+|++..|.-++.||.++|+.|++|+|+|.|.+
T Consensus 225 ~~lDREt~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nD---hsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~N 301 (2531)
T KOG4289|consen 225 KSLDRETKETHVLRVTAQDHGDPRRSATTTVTVLVLDTND---HSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPN 301 (2531)
T ss_pred hhhhhhhhheeEEEEEeeecCCCcccceeEEEEEEeecCC---CCcccchhHHHHHHhhccccCceEEEEEeccCCCCCC
Confidence 6799999999999999999999999999999999999988 78999999999999999999999999999999976
Q ss_pred -eEEEEE-------------------------------EEEEe-----------------EEecCCCCCCCeeeCCCeeE
Q psy7562 81 -TLWYRI-------------------------------YVLHW-----------------ECLPDESKNPPTIKSPDQRV 111 (582)
Q Consensus 81 -~~~y~i-------------------------------~~~~~-----------------~~v~d~nd~~P~f~~~~~~~ 111 (582)
.+.|++ +.|+. ..|.|+|||+|.|.+..|.+
T Consensus 302 ani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvv 381 (2531)
T KOG4289|consen 302 ANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVV 381 (2531)
T ss_pred CceEEEecCCCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEE
Confidence 467765 12222 24889999999999999999
Q ss_pred EEecCCCCCcEEEEEEEEcCCC---CeEEEEEEeCCCCCCeEEeCCcceEEEcccCCcccccEEEEEEEEeeCCccccc-
Q psy7562 112 EVTENDAVGFLVALIQASDPDG---DTLWYRIVDGDPNSTFSIGNDKGNVLLARRVDWETQAQYTLNISVSDGIHEWTG- 187 (582)
Q Consensus 112 ~v~E~~~~g~~v~~v~A~D~D~---~~~~y~i~~~~~~~~F~I~~~tG~i~~~~~LD~E~~~~~~l~V~a~D~~~~~~~- 187 (582)
.|.|+..++++|.+|+|+|.|. +.++|+|.+|+..+.|.||..||+|.+..+||+|.. .|++.|+|.|++.++..
T Consensus 382 qv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn 460 (2531)
T KOG4289|consen 382 QVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSN 460 (2531)
T ss_pred EecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccC
Confidence 9999999999999999999996 789999999999999999999999999999999998 99999999999865431
Q ss_pred ------------ccCC-----------------------------------------------------ceeEEeeeccc
Q psy7562 188 ------------RHRP-----------------------------------------------------SAIRTVVNLDH 202 (582)
Q Consensus 188 ------------~~~p-----------------------------------------------------~~~~~~~~ld~ 202 (582)
+|.| |+|...+.|||
T Consensus 461 ~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldr 540 (2531)
T KOG4289|consen 461 TSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDR 540 (2531)
T ss_pred CCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccc
Confidence 2222 46778899999
Q ss_pred cccceEEEEEEEEeCCCCCcccccEEEEEEEEeeecCCCCccCCCceEEEEEeecccceEEEEEEe--------eeeeEE
Q psy7562 203 ESLPRIEFHVQVTDLGRPRLTSDTLAKVIVEVTDTNDCPPQFSQDVYNVSILVPTFTNVAVLQFSQ--------DVYNVS 274 (582)
Q Consensus 203 e~~~~~~~~v~~~D~g~p~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~~~~~~~~~~~~~~~--------~~~~~~ 274 (582)
|..+.|.|-|.++|+|.|++..+ ..|.|.+.|+|||.|.|.+..|++.+.|....+..++++.. .+|++.
T Consensus 541 Et~~~ysl~V~ard~gtp~l~ts--tsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ityqi~ 618 (2531)
T KOG4289|consen 541 ETVEHYSLGVEARDHGTPPLSTS--TSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSVITYQIT 618 (2531)
T ss_pred cccceEEEEEEEcCCCCCccccc--ceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEeccccccceEEEec
Confidence 99999999999999999999875 67899999999999999999999999999988888876642 244442
Q ss_pred E---------EeeeeeeEEEEEc--cc--------CcccCCCceeEEEEEEEEEEeCCCCCCeeccCeeEEEEECCCCCC
Q psy7562 275 I---------LVPTFTNVAVLQL--NS--------GETNCSHQWSSRIWLHVTVVDVNEHRPILTQSLYETNISESVPVG 335 (582)
Q Consensus 275 ~---------~~~~~~~~~~~~~--~~--------~~~~~~~~~s~~~~v~I~V~DvNd~~P~f~~~~~~~~v~E~~~~g 335 (582)
- ......++..+.+ +. .-....+.+..++.|.|.|.|.|-+.|.|...+|+++|+|..|.|
T Consensus 619 g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G 698 (2531)
T KOG4289|consen 619 GGNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLG 698 (2531)
T ss_pred CCcccccceeeccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCC
Confidence 1 1111111111110 00 001123567788899999999999999999999999999999999
Q ss_pred cEEEEEEEEeCC--CCCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCceeEEEEEE
Q psy7562 336 SEILKVAASDSD--QESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAKRNFARVSI 413 (582)
Q Consensus 336 t~v~~v~a~D~D--~~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~v~I 413 (582)
+.|..+.|+|.| +|++|+| |.+. ..|+||+++|.+++...||||.+-.|.+.|.|.|.+.|++.++++|.|
T Consensus 699 ~tvvtlsasd~D~geNARI~y-~led------~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV 771 (2531)
T KOG4289|consen 699 TTVVTLSASDEDTGENARITY-ILED------EAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEV 771 (2531)
T ss_pred ceeEEEecccCCCCccceEEE-Eecc------cceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEE
Confidence 999999999999 6799999 5443 359999999999999999999999999999999999999999999999
Q ss_pred EEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeC-CCCCceEEeCCCeEEEEcccCCcCC
Q psy7562 414 NVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASG-NIGNVFTMDPILGTIQTARPLDLSL 492 (582)
Q Consensus 414 ~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~-~~~~~F~Id~~tG~i~~~~~ld~e~ 492 (582)
.|.|+|||+|+|..+.|.++|.|++|++|.+++|.|+|+|.|.|+++.|.+..+ +..+.|.|++.+|.|++.+.||||.
T Consensus 772 ~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IEptSGviRtl~rLdRE~ 851 (2531)
T KOG4289|consen 772 LVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIEPTSGVIRTLRRLDREN 851 (2531)
T ss_pred EeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEccCcceeehhhhhcchh
Confidence 999999999999999999999999999999999999999999999999999864 3457999999999999999999999
Q ss_pred CceEEEEEEEEECCCCCceeeEEEEEEEEecCCCCCeeecCceEEEEecCCCCCeEEEEEEeeeCC-----ceEEEEEeC
Q psy7562 493 MSEYMLLVKATDQGAPPLAATVPVQIIVVMADNDPPRFAKAELAAELFENEPPGTVVRHIEARSTS-----SLVFEIIRG 567 (582)
Q Consensus 493 ~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~~-----~i~y~i~~g 567 (582)
...|.|.+.|+|.|.|++++.+.|+|+|.|+|||||+|.+.+|...|.||.++|..++++.|.|++ .|+|+|..|
T Consensus 852 ~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg 931 (2531)
T KOG4289|consen 852 VAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGG 931 (2531)
T ss_pred eeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccC
Confidence 999999999999999999999999999999999999999999999999999999999999999964 599999999
Q ss_pred CCCCcEEEecCCCC
Q psy7562 568 DSDDMFSINKGLKK 581 (582)
Q Consensus 568 ~~~~~F~Id~~tGe 581 (582)
+....|.++..+||
T Consensus 932 ~d~~~fq~de~~~~ 945 (2531)
T KOG4289|consen 932 NDPELFQLDEFSGE 945 (2531)
T ss_pred ccHHHHHHHHhhhh
Confidence 99999999998885
No 2
>KOG4289|consensus
Probab=100.00 E-value=1.1e-73 Score=611.96 Aligned_cols=523 Identities=28% Similarity=0.454 Sum_probs=457.0
Q ss_pred CCCCcccCCCceEEEeCCCCCCcEEEEEEEEcCCCCeEEEEEEEEE------------------------eE--------
Q psy7562 45 KNPPTIKSPDQRVEVTENDAVGFLVALIQASDPDGDTLWYRIYVLH------------------------WE-------- 92 (582)
Q Consensus 45 ~~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~y~i~~~~------------------------~~-------- 92 (582)
.++|.|.+..|...++||.|+|+.|++|.|.|+|.+.+.|++..+. .+
T Consensus 159 ~~~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~a~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~Hvlr 238 (2531)
T KOG4289|consen 159 ANAVQFQQPNYQKELPENEPAGTIIASVKASDPDAGRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHVLR 238 (2531)
T ss_pred CCCccCCCcchhccCcCCCCCCceeEEEEecCCCcCceEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeEEE
Confidence 3689999999999999999999999999999999999999873211 00
Q ss_pred -------------------EecCCCCCCCeeeCCCeeEEEecCCCCCcEEEEEEEEcCCC---CeEEEEEEeCCCCCCeE
Q psy7562 93 -------------------CLPDESKNPPTIKSPDQRVEVTENDAVGFLVALIQASDPDG---DTLWYRIVDGDPNSTFS 150 (582)
Q Consensus 93 -------------------~v~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~---~~~~y~i~~~~~~~~F~ 150 (582)
.|.|.|||.|+|.+..|.-++.||.++|+.|.+++|+|.|. +.+.|++.+|+.+..|.
T Consensus 239 VtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~ 318 (2531)
T KOG4289|consen 239 VTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFE 318 (2531)
T ss_pred EEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeE
Confidence 27899999999999999999999999999999999999997 88999999999999999
Q ss_pred EeCCcceEEEcccCCcccccEEEEEEEEeeCCcccc-------------cccCC--------------------------
Q psy7562 151 IGNDKGNVLLARRVDWETQAQYTLNISVSDGIHEWT-------------GRHRP-------------------------- 191 (582)
Q Consensus 151 I~~~tG~i~~~~~LD~E~~~~~~l~V~a~D~~~~~~-------------~~~~p-------------------------- 191 (582)
||+.+|.|++..+||||+...|+|.|.|+|.+.... +++.|
T Consensus 319 in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~A 398 (2531)
T KOG4289|consen 319 INPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTA 398 (2531)
T ss_pred EcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEe
Confidence 999999999999999999999999999999875432 11111
Q ss_pred ------------------------------ceeEEeeeccccccceEEEEEEEEeCCCCCcccccEEEEEEEEeeecCCC
Q psy7562 192 ------------------------------SAIRTVVNLDHESLPRIEFHVQVTDLGRPRLTSDTLAKVIVEVTDTNDCP 241 (582)
Q Consensus 192 ------------------------------~~~~~~~~ld~e~~~~~~~~v~~~D~g~p~~~~~~~~~v~I~V~DvNDn~ 241 (582)
|++..+.+||+|.. .|.+.|+|.|+|.|+|++. ..++|+|.|+|||+
T Consensus 399 tDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPpLsn~--sgl~iqVlDINDha 475 (2531)
T KOG4289|consen 399 TDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPPLSNT--SGLVIQVLDINDHA 475 (2531)
T ss_pred cccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCCccCC--CceEEEEEecCCCC
Confidence 24566789999999 9999999999999999985 45669999999999
Q ss_pred CccCCCceEEEEEeecccceEEEEEE----------eeeeeEEEEee-----eeeeEEEEEccc------------Cccc
Q psy7562 242 PQFSQDVYNVSILVPTFTNVAVLQFS----------QDVYNVSILVP-----TFTNVAVLQLNS------------GETN 294 (582)
Q Consensus 242 P~f~~~~~~~~v~~~~~~~~~~~~~~----------~~~~~~~~~~~-----~~~~~~~~~~~~------------~~~~ 294 (582)
|.|....+.+++.|+...+-.+.... ...|++.-..+ ..+-..+....+ ..+.
T Consensus 476 Pifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~ 555 (2531)
T KOG4289|consen 476 PIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDH 555 (2531)
T ss_pred ceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEEEcCC
Confidence 99999999999999888777665442 12344432111 111112222222 2356
Q ss_pred CCCceeEEEEEEEEEEeCCCCCCeeccCeeEEEEECCCCCCcEEEEEEEEeCCCCCeEEEEEEeCCCCCCccCEEEECcc
Q psy7562 295 CSHQWSSRIWLHVTVVDVNEHRPILTQSLYETNISESVPVGSEILKVAASDSDQESKVLYSIHHAQSPLSAALFRIDFHS 374 (582)
Q Consensus 295 ~~~~~s~~~~v~I~V~DvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~v~y~i~~~~~~~~~~~F~i~~~t 374 (582)
+.|++++++.|.|++.|+|||.|.|++..|+..+.|+.|.|+.|.+|+|+|.|.+..++|.|.+++ ....|.|....
T Consensus 556 gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ityqi~g~n---trn~Fsi~si~ 632 (2531)
T KOG4289|consen 556 GTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSVITYQITGGN---TRNRFSISSIG 632 (2531)
T ss_pred CCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEeccccccceEEEecCCc---ccccceeeccC
Confidence 789999999999999999999999999999999999999999999999999999999999999886 46899998764
Q ss_pred --cEEEECccCCCCCccEEEEEEEEEeCCCCCceeEEEEEEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEEC
Q psy7562 375 --GALSVAQPLDRESLSHHLLTIMARDQGTPAKRNFARVSINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDR 452 (582)
Q Consensus 375 --G~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~ 452 (582)
|.|.+.-++|+.+...|.+.|.|+|++ +..++.|.|.|.|.|-+.|+|....|+++|.|..|.|+.|.++.|+|.
T Consensus 633 g~Glitlalp~dkKqe~~~vl~vtAtDg~---l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~ 709 (2531)
T KOG4289|consen 633 GGGLITLALPLDKKQERQYVLAVTATDGT---LQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDE 709 (2531)
T ss_pred CcceEEeecchhhcccceEEEEEEecCCc---cccceEEEEEeeecccCCcccccCCeeEeeccCCcCCceeEEEecccC
Confidence 688889999999999999999999965 788999999999999999999999999999999999999999999999
Q ss_pred CCCCCcEEEEEEEeCCCCCceEEeCCCeEEEEcccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEecCCCCCeeec
Q psy7562 453 DHGENGVVTYSIASGNIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVMADNDPPRFAK 532 (582)
Q Consensus 453 D~~~n~~v~y~l~~~~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~f~~ 532 (582)
|.|+|++|+| |.+. ..|+||+.+|.+++...||||.+-.|.+.|.|.|.+.|++..+.+|.|.|.|+||++|+|..
T Consensus 710 D~geNARI~y-~led---~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~a 785 (2531)
T KOG4289|consen 710 DTGENARITY-ILED---EAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLA 785 (2531)
T ss_pred CCCccceEEE-Eecc---cceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccch
Confidence 9999999999 4432 45999999999999999999999999999999999999999999999999999999999999
Q ss_pred CceEEEEecCCCCCeEEEEEEeeeC-----CceEEEEEeC-CCCCcEEEecCCC
Q psy7562 533 AELAAELFENEPPGTVVRHIEARST-----SSLVFEIIRG-DSDDMFSINKGLK 580 (582)
Q Consensus 533 ~~~~~~v~E~~~~g~~v~~v~a~D~-----~~i~y~i~~g-~~~~~F~Id~~tG 580 (582)
+.|.++|.|++|+++.+++|+|+|. +.+.|.+-.| +..+.|.|++.||
T Consensus 786 ssyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IEptSG 839 (2531)
T KOG4289|consen 786 SSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIEPTSG 839 (2531)
T ss_pred hhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEccCcc
Confidence 9999999999999999999999993 4555655443 3468899999998
No 3
>KOG1219|consensus
Probab=100.00 E-value=3e-72 Score=617.44 Aligned_cols=562 Identities=28% Similarity=0.392 Sum_probs=495.8
Q ss_pred cceecccCCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCCCCCcccCCCceEEEeCCCCCCcEEEEEEEEcCCCCe-
Q psy7562 3 NIIVFVAGQEFDLLVRATDHGSPNQSSTARVSIQVVRLPDESKNPPTIKSPDQRVEVTENDAVGFLVALIQASDPDGDT- 81 (582)
Q Consensus 3 ~~ldyE~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d~ND~~~~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~- 81 (582)
.+||||...-|+|+|+|.|.|.|.+.+++.+.|.++|.|| |+|.|.......+++-.-++|+.+.++.|+|.|.+.
T Consensus 477 ~kldrel~rvYtfRv~Asd~G~per~~e~~~~I~ildlND---n~P~F~~~n~t~t~~~~~~vg~~l~tvsAtD~De~el 553 (4289)
T KOG1219|consen 477 GKLDRELRRVYTFRVRASDWGVPERESEVHLNILILDLND---NPPNFEIRNCTGTINGDPKVGTKLFTVSATDLDELEL 553 (4289)
T ss_pred cccCccccceEEEEEEEeccCCcchhceeeEEEEEeccCC---CCCcceeeecccccccCCCCCcEEEEeeccccCcccc
Confidence 6899999899999999999999999999999999999998 899999999999999999999999999999999874
Q ss_pred EEEEEEE----------------EE---------------eE----------------E----------------ecCCC
Q psy7562 82 LWYRIYV----------------LH---------------WE----------------C----------------LPDES 98 (582)
Q Consensus 82 ~~y~i~~----------------~~---------------~~----------------~----------------v~d~n 98 (582)
+.|+|.. |. +. . ..-.|
T Consensus 554 lky~i~~~nel~feln~nSgeisLvr~n~t~~~~s~~slv~a~d~G~p~as~t~lni~~~k~~Tgv~~~~~p~Ilq~~e~ 633 (4289)
T KOG1219|consen 554 LKYRILPGNELSFELNSNSGEISLVRQNNTECLQSCESLVIAADDGVPPASPTLLNITVMKYGTGVGNEHEPNILQRFEN 633 (4289)
T ss_pred eeEEEEeCCcCceeeccCCCeEEEEEccccccccccceEEEehhcCCCcCCceeeEEEEEecccccccccChhHhhhhcc
Confidence 5666521 00 00 0 11247
Q ss_pred CCCCeeeC-CCeeEEEecCCCCCcEEEEEEEEcCCC---CeEEEEEEeCCCCCCeEEeCCcceEEEcccCCcccccEEEE
Q psy7562 99 KNPPTIKS-PDQRVEVTENDAVGFLVALIQASDPDG---DTLWYRIVDGDPNSTFSIGNDKGNVLLARRVDWETQAQYTL 174 (582)
Q Consensus 99 d~~P~f~~-~~~~~~v~E~~~~g~~v~~v~A~D~D~---~~~~y~i~~~~~~~~F~I~~~tG~i~~~~~LD~E~~~~~~l 174 (582)
++-|.|.. -+..+.|+|+.|+|+.++.++|+|.|. +.+.|.|.+|+....|.|+..+|.|.++.+||+|....|.|
T Consensus 634 ~~fPqf~s~fP~iI~v~Edvpigt~la~L~atD~Dtgfng~l~yvI~dgne~~~~~Id~qsg~itvas~ld~~~t~~yiL 713 (4289)
T KOG1219|consen 634 KHFPQFPSDFPFIIVVPEDVPIGTTLAILSATDSDTGFNGKLVYVIEDGNESICFLIDRQSGNITVASPLDNENTEQYIL 713 (4289)
T ss_pred ccCccccccCCceEEccccCCCCceEEEEeccCCCCCcCceEEEEEeCCccceEEEEecccceEEEecchhhhhhheeEE
Confidence 88899988 678899999999999999999999997 88999999999889999999999999999999999999999
Q ss_pred EEEEeeCCccccc-------------ccCC--------------------------------------------------
Q psy7562 175 NISVSDGIHEWTG-------------RHRP-------------------------------------------------- 191 (582)
Q Consensus 175 ~V~a~D~~~~~~~-------------~~~p-------------------------------------------------- 191 (582)
+|+|.|.+.+... ++.|
T Consensus 714 nvta~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~sdvfsI 793 (4289)
T KOG1219|consen 714 NVTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKSDVFSI 793 (4289)
T ss_pred EEEEecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCcceEEe
Confidence 9999999865431 1111
Q ss_pred ----ceeEEeeeccccccceEEEEEEEEeCCCCCcccccEEEEEEEEeeecCCCCccCCCceEEEEEeecccceEEEEEE
Q psy7562 192 ----SAIRTVVNLDHESLPRIEFHVQVTDLGRPRLTSDTLAKVIVEVTDTNDCPPQFSQDVYNVSILVPTFTNVAVLQFS 267 (582)
Q Consensus 192 ----~~~~~~~~ld~e~~~~~~~~v~~~D~g~p~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~~~~~~~~~~~~~~ 267 (582)
|.+.+.++||+|.+..|.|.|.|+|.+.|.+.+ ...+.|.|.|||||+|++...++...+.|..+.|..+.+..
T Consensus 794 dp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq~~s--vv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~ 871 (4289)
T KOG1219|consen 794 DPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQLFS--VVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLV 871 (4289)
T ss_pred cCcccEEEeccccCcccCceeEEEEEEcCCCCCceEE--EEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhh
Confidence 123456789999999999999999999988876 57899999999999999999999999999988888776542
Q ss_pred ----------eeeeeEEEEe----------------------eeeeeEEEEEcccCcccCCCceeEEEEEEEEEEeCCCC
Q psy7562 268 ----------QDVYNVSILV----------------------PTFTNVAVLQLNSGETNCSHQWSSRIWLHVTVVDVNEH 315 (582)
Q Consensus 268 ----------~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNd~ 315 (582)
+..|.+..-. ..+.++.+.. .+.+.|.+++.+.+.|.+.|+|.|
T Consensus 872 A~d~diGq~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a----~d~g~p~lss~chl~Vevldv~en 947 (4289)
T KOG1219|consen 872 ALDPDIGQLGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTA----VDRGTPILSSICHLEVEVLDVNEN 947 (4289)
T ss_pred hcCcccCcCceeEEEEecCccccccccccccEEEecccccccccceEEEEEE----ecCCCcceeeeEEEEEEEeccCCC
Confidence 2233322110 0111222221 234566899999999999999998
Q ss_pred --CCeeccCeeEEEEECCCCCCcEEEEEEEEeCCC--CCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEE
Q psy7562 316 --RPILTQSLYETNISESVPVGSEILKVAASDSDQ--ESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHH 391 (582)
Q Consensus 316 --~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~--~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~ 391 (582)
||.|..-.-..+|.||+|.|+.|+++.|.|.|. .+.+.|+|..+. ..+.|+|++.+|.|++.+.||||....|
T Consensus 948 lhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gd---g~g~FsId~~tG~irTl~~lDrE~ks~Y 1024 (4289)
T KOG1219|consen 948 LHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGD---GDGIFSIDSTTGSIRTLKALDREKKSSY 1024 (4289)
T ss_pred CCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCC---cceeEEecCCcceEeechhhchhhcceE
Confidence 899998888999999999999999999999995 489999999884 4689999999999999999999999999
Q ss_pred EEEEEEEeCCCCCceeEEEEEEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeCCCCC
Q psy7562 392 LLTIMARDQGTPAKRNFARVSINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASGNIGN 471 (582)
Q Consensus 392 ~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~~~~~ 471 (582)
.|+|.|+|.|.+++++.+.+.|.|+|+|||+|+|.++.|..+|.|+++.+..|.++.|.|+|...|+++.|.|.+|+..+
T Consensus 1025 wltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~ 1104 (4289)
T KOG1219|consen 1025 WLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQG 1104 (4289)
T ss_pred EEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccc
Confidence 99999999999999999999999999999999999999999999999999999999999999888999999999999999
Q ss_pred ceEEeCCCeEEEEcccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEecCCCCCeeecCceEEEEecCCCCCeEEEE
Q psy7562 472 VFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVMADNDPPRFAKAELAAELFENEPPGTVVRH 551 (582)
Q Consensus 472 ~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~ 551 (582)
.|.||+.||.|++.+.||||.+.++.|.|.+.|.|.|++++.+.|.|.|+|+|||+|.|.+..+...|+|...+ .+++
T Consensus 1105 FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~IldvNdnsp~Flqk~~~~~v~~r~s~--plyR 1182 (4289)
T KOG1219|consen 1105 FFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDVNDNSPRFLQKKTFLRVPERSSP--PLYR 1182 (4289)
T ss_pred eEEEccccceeeeehhhcccccccceEEEEEecCCCCccccceEEEEEEeeccCCchhhhhheeEEEeeeccCC--ceeE
Confidence 99999999999999999999999999999999999999999999999999999999999999999999998775 7899
Q ss_pred EEeeeC-----CceEEEEEeCCCCCcEEEecCCC
Q psy7562 552 IEARST-----SSLVFEIIRGDSDDMFSINKGLK 580 (582)
Q Consensus 552 v~a~D~-----~~i~y~i~~g~~~~~F~Id~~tG 580 (582)
+-|.|+ .+++|+|..|+.. |+||..||
T Consensus 1183 l~a~d~DeG~narityniedgde~--FsID~~t~ 1214 (4289)
T KOG1219|consen 1183 LAAQDNDEGNNARITYNIEDGDEV--FSIDIATG 1214 (4289)
T ss_pred EEEEecCCCcceEEEEecccCceE--EEEeccCc
Confidence 999984 5799999988765 99999998
No 4
>KOG1219|consensus
Probab=100.00 E-value=9.7e-70 Score=597.73 Aligned_cols=559 Identities=27% Similarity=0.374 Sum_probs=474.0
Q ss_pred CCcceecccCCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCCCCCcccCCCceEEEeCCCCCCcEEEEEEEEcCCCC
Q psy7562 1 MENIIVFVAGQEFDLLVRATDHGSPNQSSTARVSIQVVRLPDESKNPPTIKSPDQRVEVTENDAVGFLVALIQASDPDGD 80 (582)
Q Consensus 1 ~~~~ldyE~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d~ND~~~~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~ 80 (582)
|+++||||.-..|.|+|+|.|+|+ .+.++..|.|+|. |+ .+|.|.++.|.+.|.|+..+-+.|..+.|..+-+-
T Consensus 2035 l~kpld~e~ln~y~fkviAkd~gs-~~~~eviV~vtv~--nk---a~PvF~~~~y~avi~e~~tv~spvv~vqa~s~l~~ 2108 (4289)
T KOG1219|consen 2035 LKKPLDFENLNKYRFKVIAKDGGS-VHRVEVIVHVTVE--NK---AAPVFITPDYVAVIEELITVSSPVVHVQAASPLGL 2108 (4289)
T ss_pred ecCccchhhcceeEEEEEEcCCCc-eeeeEEEEEEEEe--cC---CCcceecCcEEEEeecccccccceeEEeecCCcCC
Confidence 689999999888999999999998 6777777777764 45 57999999999999999999999999999888877
Q ss_pred eEEEEEEE--------------------EEeE--------------------------EecCCCCCCCeeeCCCeeEEEe
Q psy7562 81 TLWYRIYV--------------------LHWE--------------------------CLPDESKNPPTIKSPDQRVEVT 114 (582)
Q Consensus 81 ~~~y~i~~--------------------~~~~--------------------------~v~d~nd~~P~f~~~~~~~~v~ 114 (582)
.+.|+|.. |+.+ .|.|+|||||+|.+..|..+++
T Consensus 2109 kv~YsIldg~~slFtvnf~TG~i~v~~pLd~ea~t~h~l~ieAtd~~~p~~Aea~VeIiV~dIndn~PvFeqlsYt~sis 2188 (4289)
T KOG1219|consen 2109 KVTYSILDGNTSLFTVNFTTGVILVLIPLDREASTLHELLIEATDAGIPLSAEAKVEIIVGDINDNPPVFEQLSYTISIS 2188 (4289)
T ss_pred ceEEEEecCCcceEEEecccceEEeccccccccccceEEEEEEeccCCCcceeeEEEEEecccCCCCchhheeeEEEEcc
Confidence 78898831 1111 1679999999999999999999
Q ss_pred cCCCCCcEEEEEEEEcCCC-CeEEEEEEeCC-CCCCeEEeCCcceEEEcccCCcccccEEEEEEEEeeCCccccc-----
Q psy7562 115 ENDAVGFLVALIQASDPDG-DTLWYRIVDGD-PNSTFSIGNDKGNVLLARRVDWETQAQYTLNISVSDGIHEWTG----- 187 (582)
Q Consensus 115 E~~~~g~~v~~v~A~D~D~-~~~~y~i~~~~-~~~~F~I~~~tG~i~~~~~LD~E~~~~~~l~V~a~D~~~~~~~----- 187 (582)
|+++.|+.|.++.|+|.|. +.+.|+|.+++ ....|+|+..||.|++++.||||+..+|.+.|+|+|++.+...
T Consensus 2189 E~s~igt~viqilATdsDsn~~isYsl~g~s~~sk~f~In~sTG~it~~g~ldyE~~q~f~~fvratdggk~lSseviv~ 2268 (4289)
T KOG1219|consen 2189 ENSKIGTKVIQILATDSDSNREISYSLEGNSEISKPFRINVSTGWITVAGKLDYEENQEFRFFVRATDGGKPLSSEVIVE 2268 (4289)
T ss_pred CCCccCceEEEEEeccCCCCCceEEEeecCCccccceEEecccceEEEeeecChhhcceEEEEEEEccCCCcccccEEEE
Confidence 9999999999999999997 88999998643 4568999999999999999999999999999999999876221
Q ss_pred -------ccCC---------------------------------------------------------------------
Q psy7562 188 -------RHRP--------------------------------------------------------------------- 191 (582)
Q Consensus 188 -------~~~p--------------------------------------------------------------------- 191 (582)
++.|
T Consensus 2269 V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l~~s~~G~iTlfNl~k~~l~~s~~l 2348 (4289)
T KOG1219|consen 2269 VHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFLILSENGIITLFNLLKSPLQTSYPL 2348 (4289)
T ss_pred EEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhhhhcccceeeeccCceEEehhhcccccccccce
Confidence 0000
Q ss_pred ------------------------------------------------------------------------ceeEEeee
Q psy7562 192 ------------------------------------------------------------------------SAIRTVVN 199 (582)
Q Consensus 192 ------------------------------------------------------------------------~~~~~~~~ 199 (582)
|.|.+...
T Consensus 2349 rv~vsD~v~~at~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~~si~s~~sd~~~in~~GqI~t~~k 2428 (4289)
T KOG1219|consen 2349 RVTVSDGVFRATMEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIYYSINSRASDHFEINKSGQIKTLSK 2428 (4289)
T ss_pred eeeeccCcceeeeEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceeeeeechhccCceeECCCccEEeeeh
Confidence 12445566
Q ss_pred ccccccceEEEEEEEE--eCCCCCcccccEEEEEEEEeeecCCCCccCCCceEEEEEeecccceEEEEEE---ee-----
Q psy7562 200 LDHESLPRIEFHVQVT--DLGRPRLTSDTLAKVIVEVTDTNDCPPQFSQDVYNVSILVPTFTNVAVLQFS---QD----- 269 (582)
Q Consensus 200 ld~e~~~~~~~~v~~~--D~g~p~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~~~~~~~~~~~~~~---~~----- 269 (582)
||+|....+.+-|.++ |.|.. .+.++++|.++|.|||||.|..+.|++++.++...+..++.+. .|
T Consensus 2429 ld~e~s~~~vi~i~v~a~Da~gr----~af~tvti~ltDiNDnpPqF~a~~Y~~nI~enaskg~~V~~v~A~D~De~sna 2504 (4289)
T KOG1219|consen 2429 LDREYSEELVIIIAVMAFDAGGR----VAFCTVTIILTDINDNPPQFDAQLYRVNITENASKGKLVGHVIARDADEGSNA 2504 (4289)
T ss_pred hhhccCceEEEEEEEEEecCCCe----EEEEEEEEEEEecCCCCccccceeEEEEeecccCCCceEEEEEEecCCCCCcc
Confidence 8899888887766665 88754 2468899999999999999999999999999988777776652 11
Q ss_pred eeeEEEEee----------eeeeEEEEEc------------ccCcccCCCceeEEEEEEEEEEeCCCCCCeeccCeeEEE
Q psy7562 270 VYNVSILVP----------TFTNVAVLQL------------NSGETNCSHQWSSRIWLHVTVVDVNEHRPILTQSLYETN 327 (582)
Q Consensus 270 ~~~~~~~~~----------~~~~~~~~~~------------~~~~~~~~~~~s~~~~v~I~V~DvNd~~P~f~~~~~~~~ 327 (582)
.+.+.+... ..+...+... ....+.+.|+..+..+|.|+|.+..++.|+|+.+.|.++
T Consensus 2505 dvty~i~~e~~~~~v~~in~sG~Itv~~sL~~~en~tl~l~vkA~D~g~P~~~s~ttV~v~vl~e~v~lPrFSep~y~fs 2584 (4289)
T KOG1219|consen 2505 DVTYEIVGESDVKHVFEINESGVITVKRSLDGLENSTLHLFVKAIDDGKPRRRSNTTVIVTVLPEDVNLPRFSEPIYTFS 2584 (4289)
T ss_pred cEEEEecCchhhhheeeecCCceEEeehhhhcccCcEEEEEEEeccCCCCCcccceEEEEEecCcccCcccccCceEEEe
Confidence 122221111 0111111111 112356788999999999999999999999999999999
Q ss_pred EECCCCCCcEEEEEEEEeCCCCCeEEEEEEeCCCCC--CccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCc
Q psy7562 328 ISESVPVGSEILKVAASDSDQESKVLYSIHHAQSPL--SAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAK 405 (582)
Q Consensus 328 v~E~~~~gt~v~~v~a~D~D~~~~v~y~i~~~~~~~--~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~ 405 (582)
|+|+.++|+.|++|+|.|.|. .+.|++.-++.+. ....|++++.||.|.+.++||+|...+|++.|+|+++++ .
T Consensus 2585 vpEDv~vG~~Ig~v~a~~a~~--~~i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~ksLD~E~kk~yqi~v~a~~~~~--v 2660 (4289)
T KOG1219|consen 2585 VPEDVPVGEEIGQVSASDADE--HVIYSLVLGGTPESNPDLPFSVDRNTGMIKVNKSLDHEKKKSYQIKVKATCGQW--V 2660 (4289)
T ss_pred ccccCCCCCeeeEEeecccCC--ceEEEEEeCCCCCCCCCCceEEcCCCceEEeccccchhhhceEEEEEEeecCCc--e
Confidence 999999999999999999863 4567776654443 345799999999999999999999999999999999886 6
Q ss_pred eeEEEEEEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeCCCCCceEEeCCCeEEEEc
Q psy7562 406 RNFARVSINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASGNIGNVFTMDPILGTIQTA 485 (582)
Q Consensus 406 ~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~~~~~~F~Id~~tG~i~~~ 485 (582)
-+.++|.|.|.|+|||+|+|..+.|.+.+.||++.|+.|++++|.|.|.|.||+|+|+|.+.. .+|.|++.||.|++.
T Consensus 2661 va~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v--~yF~In~etGwlTt~ 2738 (4289)
T KOG1219|consen 2661 VAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPV--PYFAINPETGWLTTL 2738 (4289)
T ss_pred EEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCc--ceEEEcCCCCeeeeh
Confidence 788999999999999999999999999999999999999999999999999999999998753 399999999999999
Q ss_pred ccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEecCCCCCeeecCceEEEEecCCCCCeEEEEEEeeeC-----Cce
Q psy7562 486 RPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVMADNDPPRFAKAELAAELFENEPPGTVVRHIEARST-----SSL 560 (582)
Q Consensus 486 ~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~-----~~i 560 (582)
..||+|.+..|.|.|.|+|.|.+ ++.++|.|+|.|+||+||+|....|.+++.|+.+.|.+++.++..|. ...
T Consensus 2739 ~eld~ek~d~y~lkv~AtDhG~~--ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~p~~~~ia~~si~d~D~s~~nq~ 2816 (4289)
T KOG1219|consen 2739 FELDLEKQDLYSLKVVATDHGVP--SSQATVLVHVTDVNDSPPRFQREIYEGTVVEDVPGGKIIAGLSIFDADVSEVNQV 2816 (4289)
T ss_pred hhhccccCCceEEEEEEecCCcc--cccceEEEEEEecCCCcchhhhHhhccceeccCCCCceeeeeEecccccccccee
Confidence 99999999999999999999996 78899999999999999999999999999999999999999998873 358
Q ss_pred EEEEEeCCCCCcE-EEec
Q psy7562 561 VFEIIRGDSDDMF-SINK 577 (582)
Q Consensus 561 ~y~i~~g~~~~~F-~Id~ 577 (582)
.|.|..|++.|.| .|+.
T Consensus 2817 t~fI~~gd~~gqF~~i~n 2834 (4289)
T KOG1219|consen 2817 TGFITLGDPLGQFWIIEN 2834 (4289)
T ss_pred EEEEeCCCccceEEEEcC
Confidence 8999999999888 5543
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00 E-value=2e-30 Score=247.24 Aligned_cols=196 Identities=42% Similarity=0.624 Sum_probs=183.4
Q ss_pred eEEEEECCCCCCcEEEEEEEEeCCCC--CeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCC
Q psy7562 324 YETNISESVPVGSEILKVAASDSDQE--SKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQG 401 (582)
Q Consensus 324 ~~~~v~E~~~~gt~v~~v~a~D~D~~--~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~ 401 (582)
|.+.|.|+++.|+.++++.|.|+|.+ +.+.|+|.++.. ..+|.|++.+|.|++.+.||||....|.|.|+|+|.|
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~---~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g 78 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNE---DGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGG 78 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCC---cccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECC
Confidence 67899999999999999999999964 899999998753 2799999999999999999999999999999999988
Q ss_pred CCCceeEEEEEEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeCCCCCceEEeCCCeE
Q psy7562 402 TPAKRNFARVSINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASGNIGNVFTMDPILGT 481 (582)
Q Consensus 402 ~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~~~~~~F~Id~~tG~ 481 (582)
.|.+++...++|.|.|+|||+|.|....|.+.|.|+.++|+.++++.|+|+|.+.++.++|+|.++...++|.|++.+|.
T Consensus 79 ~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~ 158 (199)
T cd00031 79 GPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGI 158 (199)
T ss_pred cCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceE
Confidence 88778999999999999999999998899999999999999999999999999989999999998665589999999999
Q ss_pred EEEcccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEe
Q psy7562 482 IQTARPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVM 522 (582)
Q Consensus 482 i~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d 522 (582)
|.+.+.||||....|.|.|.|+|.+.+.+++++.++|.|.|
T Consensus 159 i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 159 ITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 99999999999999999999999998878999999999875
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.96 E-value=9.1e-28 Score=228.84 Aligned_cols=193 Identities=38% Similarity=0.580 Sum_probs=173.7
Q ss_pred eeEEEecCCCCCcEEEEEEEEcCCC---CeEEEEEEeCCCCCCeEEeCCcceEEEcccCCcccccEEEEEEEEeeCCccc
Q psy7562 109 QRVEVTENDAVGFLVALIQASDPDG---DTLWYRIVDGDPNSTFSIGNDKGNVLLARRVDWETQAQYTLNISVSDGIHEW 185 (582)
Q Consensus 109 ~~~~v~E~~~~g~~v~~v~A~D~D~---~~~~y~i~~~~~~~~F~I~~~tG~i~~~~~LD~E~~~~~~l~V~a~D~~~~~ 185 (582)
|.++|.||+++|+.+++++|+|+|. +.+.|+|.++....+|+|++.+|.|++++.||||....|.|.|+|.|.+...
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~ 81 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP 81 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc
Confidence 6789999999999999999999997 5799999977665799999999999999999999999999999999864321
Q ss_pred ccccCCceeEEeeeccccccceEEEEEEEEeCCCCCcccccEEEEEEEEeeecCCCCccCCCceEEEEEeecccceEEEE
Q psy7562 186 TGRHRPSAIRTVVNLDHESLPRIEFHVQVTDLGRPRLTSDTLAKVIVEVTDTNDCPPQFSQDVYNVSILVPTFTNVAVLQ 265 (582)
Q Consensus 186 ~~~~~p~~~~~~~~ld~e~~~~~~~~v~~~D~g~p~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~~~~~~~~~~~~ 265 (582)
T Consensus 82 -------------------------------------------------------------------------------- 81 (199)
T cd00031 82 -------------------------------------------------------------------------------- 81 (199)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred EEeeeeeEEEEeeeeeeEEEEEcccCcccCCCceeEEEEEEEEEEeCCCCCCeeccCeeEEEEECCCCCCcEEEEEEEEe
Q psy7562 266 FSQDVYNVSILVPTFTNVAVLQLNSGETNCSHQWSSRIWLHVTVVDVNEHRPILTQSLYETNISESVPVGSEILKVAASD 345 (582)
Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D 345 (582)
++++..+.|.|.|+||++|.|....|.+.|.|+.+.|+.++++.|+|
T Consensus 82 ---------------------------------~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D 128 (199)
T cd00031 82 ---------------------------------LSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATD 128 (199)
T ss_pred ---------------------------------ceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEc
Confidence 23456688999999999999998889999999999999999999999
Q ss_pred CCC--CCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCceeEEEEEEEEEe
Q psy7562 346 SDQ--ESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAKRNFARVSINVLD 417 (582)
Q Consensus 346 ~D~--~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~D 417 (582)
+|. ++.++|+|.++.. .++|.|++.+|.|++.+.||||....|.|.|.|+|.+.+.++..+.++|.|.|
T Consensus 129 ~D~~~~~~~~y~l~~~~~---~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 129 ADSGENAKLTYSILSGND---KELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred CCCCCCccEEEEEeCCCC---CCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 996 6999999998752 47999999999999999999999999999999999988778999999998875
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.73 E-value=9.5e-17 Score=132.95 Aligned_cols=92 Identities=45% Similarity=0.627 Sum_probs=88.6
Q ss_pred eeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeCCCCCceEEeCCCeEEEEcccCCcCCCceEEEEEEEEEC-CCC
Q psy7562 430 VKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASGNIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQ-GAP 508 (582)
Q Consensus 430 ~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~-~~~ 508 (582)
|.+.|+|+.++|+.++++.|.|+|.+.|+.+.|+|..++..++|.|++.+|.|++.++||||....|.|.|.|+|. |.|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~ 80 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP 80 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence 7899999999999999999999999999999999999888899999999999999999999999999999999999 889
Q ss_pred CceeeEEEEEEEE
Q psy7562 509 PLAATVPVQIIVV 521 (582)
Q Consensus 509 ~~~~~~~v~I~V~ 521 (582)
+++++++|.|+|+
T Consensus 81 ~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 81 PLSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEEEEEEE
T ss_pred CCEEEEEEEEEEC
Confidence 9999999999985
No 8
>KOG1834|consensus
Probab=99.68 E-value=2.8e-15 Score=154.02 Aligned_cols=207 Identities=28% Similarity=0.361 Sum_probs=165.2
Q ss_pred EEEEeCCCCCCeeccCeeEEEEECCCCCCcEEE--EEEEEeCCC-----CCeEEEEEEeCCCCCCccCEEEECccc--EE
Q psy7562 307 VTVVDVNEHRPILTQSLYETNISESVPVGSEIL--KVAASDSDQ-----ESKVLYSIHHAQSPLSAALFRIDFHSG--AL 377 (582)
Q Consensus 307 I~V~DvNd~~P~f~~~~~~~~v~E~~~~gt~v~--~v~a~D~D~-----~~~v~y~i~~~~~~~~~~~F~i~~~tG--~i 377 (582)
...-.+|-+.|.... .|..-|.||. .+++. -+.|.|.|. +.-.-|.|.+..-| .+.--+|..|| .|
T Consensus 21 ~~aarankhkpwie~-ey~gvV~End--ntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vP--FdavVvdK~TGegvl 95 (952)
T KOG1834|consen 21 HHAARANKHKPWIEE-EYHGVVTEND--NTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVP--FDAVVVDKYTGEGVL 95 (952)
T ss_pred cccccccccCccccc-ceeEEEEeCC--ceEEeCCCeeeecCCCCcccccccceeEecCCCCC--ceEEEEeccCCceEE
Confidence 445567777887654 6899999997 34443 477888883 34556788765432 34455677665 79
Q ss_pred EECccCCCCCccEEEEEEEEEeCCCCC------ceeEEEEEEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEE
Q psy7562 378 SVAQPLDRESLSHHLLTIMARDQGTPA------KRNFARVSINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATD 451 (582)
Q Consensus 378 ~~~~~lD~E~~~~~~l~v~a~D~~~~~------~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D 451 (582)
+.+.+||-|.+..|+|+|+|.|.|..+ ++-.++|.|+|.|+|+++|+|..+.|.+.|.|.. .-..|++|.|.|
T Consensus 96 RaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAiD 174 (952)
T KOG1834|consen 96 RAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAID 174 (952)
T ss_pred eecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-eeeeeEEEEeec
Confidence 999999999999999999999976533 4667899999999999999999999999999984 556899999999
Q ss_pred CCCC-CCcEE-EEEEEeCCCCCceEEeCCCeEEEEcccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEe
Q psy7562 452 RDHG-ENGVV-TYSIASGNIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVM 522 (582)
Q Consensus 452 ~D~~-~n~~v-~y~l~~~~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d 522 (582)
.|-+ .+++| .|.|.. ..-.|.||. .|.|..+.+|.|.....|.|+|.|.|+|..+..+-+.|+|.|..
T Consensus 175 ~DCspq~sqIC~YEI~t--~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp 244 (952)
T KOG1834|consen 175 KDCSPQYSQICEYEITT--PDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKP 244 (952)
T ss_pred CCCCCcccceeEEEecC--CCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEecC
Confidence 9975 47766 788875 457899997 89999999999999999999999999998655555777777753
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.65 E-value=4e-15 Score=123.15 Aligned_cols=90 Identities=40% Similarity=0.625 Sum_probs=84.8
Q ss_pred eEEEEECCCCCCcEEEEEEEEeCC--CCCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeC-
Q psy7562 324 YETNISESVPVGSEILKVAASDSD--QESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQ- 400 (582)
Q Consensus 324 ~~~~v~E~~~~gt~v~~v~a~D~D--~~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~- 400 (582)
|.++|+|++++|+.++++.|.|+| .++.+.|+|..+.. .++|.|++.+|.|++.++||||....|.|.|.|+|.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~---~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~ 77 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNP---DGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSG 77 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTS---TTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcc---cCceEEeeeeeccccceecCcccCCEEEEEEEEEECC
Confidence 789999999999999999999999 57999999999863 589999999999999999999999999999999999
Q ss_pred CCCCceeEEEEEEEEE
Q psy7562 401 GTPAKRNFARVSINVL 416 (582)
Q Consensus 401 ~~~~~~~~~~v~I~V~ 416 (582)
+.|++++++.|.|+|+
T Consensus 78 ~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 78 GSPPLSSTATVTINVL 93 (93)
T ss_dssp TSSEEEEEEEEEEEEE
T ss_pred CCCCCEEEEEEEEEEC
Confidence 8899999999999885
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.61 E-value=5.2e-15 Score=118.46 Aligned_cols=79 Identities=49% Similarity=0.793 Sum_probs=74.6
Q ss_pred EECCCCCCcEEEEEEEeCCCCCceEEeCCCeEEEEcccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEecCCCCC
Q psy7562 450 TDRDHGENGVVTYSIASGNIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVMADNDPP 528 (582)
Q Consensus 450 ~D~D~~~n~~v~y~l~~~~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P 528 (582)
+|+|.|.|+.++|+|.+++...+|.|++.+|.|.+.++||||....|.|.|+|+|.|.|++++.++|+|.|.|+||++|
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 4889999999999999877668999999999999999999999999999999999999999999999999999999998
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.50 E-value=1.7e-13 Score=109.68 Aligned_cols=76 Identities=41% Similarity=0.635 Sum_probs=69.6
Q ss_pred eCCC--CCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCceeEEEEEEEEEecCCCC
Q psy7562 345 DSDQ--ESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAKRNFARVSINVLDSNDHP 422 (582)
Q Consensus 345 D~D~--~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~ 422 (582)
|+|. ++.+.|+|.++.. ..+|.|++.+|.|++.++||||....|.|.|+|.|.+.|++++.+.|.|.|.|+|||+
T Consensus 2 D~D~g~n~~i~Y~i~~~~~---~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~ 78 (79)
T smart00112 2 DADSGENGKVTYSILSGNE---DGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA 78 (79)
T ss_pred CCCCCcCcEEEEEEecCCC---CCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence 6664 5889999998753 2799999999999999999999999999999999999999999999999999999999
Q ss_pred C
Q psy7562 423 P 423 (582)
Q Consensus 423 P 423 (582)
|
T Consensus 79 P 79 (79)
T smart00112 79 P 79 (79)
T ss_pred C
Confidence 8
No 12
>KOG1834|consensus
Probab=99.46 E-value=3.7e-12 Score=131.46 Aligned_cols=198 Identities=26% Similarity=0.319 Sum_probs=148.1
Q ss_pred eeCCCeeEEEecCCCCCcEEEEEEEEcCCC-----Ce-EEEEEEeCCCCCCeE---EeCCcc--eEEEcccCCcccccEE
Q psy7562 104 IKSPDQRVEVTENDAVGFLVALIQASDPDG-----DT-LWYRIVDGDPNSTFS---IGNDKG--NVLLARRVDWETQAQY 172 (582)
Q Consensus 104 f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~-----~~-~~y~i~~~~~~~~F~---I~~~tG--~i~~~~~LD~E~~~~~ 172 (582)
+....|..-|.||...=....-+-|-|.|+ +. --|.|.++.- .|. +|..|| .|+.+.+||-|.++.|
T Consensus 32 wie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~v--PFdavVvdK~TGegvlRaK~~lDCelqkey 109 (952)
T KOG1834|consen 32 WIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPV--PFDAVVVDKYTGEGVLRAKEPLDCELQKEY 109 (952)
T ss_pred ccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCC--CceEEEEeccCCceEEeecCcccccccccc
Confidence 345678888888854322222355777775 33 3577765332 454 477776 7888999999999999
Q ss_pred EEEEEEeeCCcccccccCCceeEEeeeccccccceEEEEEEEEeCCCCCcccccEEEEEEEEeeecCCCCccCCCceEEE
Q psy7562 173 TLNISVSDGIHEWTGRHRPSAIRTVVNLDHESLPRIEFHVQVTDLGRPRLTSDTLAKVIVEVTDTNDCPPQFSQDVYNVS 252 (582)
Q Consensus 173 ~l~V~a~D~~~~~~~~~~p~~~~~~~~ld~e~~~~~~~~v~~~D~g~p~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~ 252 (582)
+|+|+|.|-+....+ +
T Consensus 110 tf~iQAydCg~gpdg-----------------------------------t----------------------------- 125 (952)
T KOG1834|consen 110 TFTIQAYDCGNGPDG-----------------------------------T----------------------------- 125 (952)
T ss_pred eEEEEEEecCCCCCc-----------------------------------c-----------------------------
Confidence 999999986543220 0
Q ss_pred EEeecccceEEEEEEeeeeeEEEEeeeeeeEEEEEcccCcccCCCceeEEEEEEEEEEeCCCCCCeeccCeeEEEEECCC
Q psy7562 253 ILVPTFTNVAVLQFSQDVYNVSILVPTFTNVAVLQLNSGETNCSHQWSSRIWLHVTVVDVNEHRPILTQSLYETNISESV 332 (582)
Q Consensus 253 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~v~I~V~DvNd~~P~f~~~~~~~~v~E~~ 332 (582)
.-..|-.++|+|.|.|+|+.+|+|..+.|.+.|.|..
T Consensus 126 -------------------------------------------n~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK 162 (952)
T KOG1834|consen 126 -------------------------------------------NTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK 162 (952)
T ss_pred -------------------------------------------ccccccceEEEEEeccccccCchhcccceeeEEecce
Confidence 0112345779999999999999999999999999986
Q ss_pred CCCcEEEEEEEEeCC---CC-CeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCceeE
Q psy7562 333 PVGSEILKVAASDSD---QE-SKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAKRNF 408 (582)
Q Consensus 333 ~~gt~v~~v~a~D~D---~~-~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~ 408 (582)
.-..|++|.|.|.| .. .-..|.|...+ -.|.||.. |.|+.+.+|.|.+...|.|+|.|.|.|.-....-
T Consensus 163 -~yd~il~veAiD~DCspq~sqIC~YEI~t~d-----~PFaIdn~-G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d 235 (952)
T KOG1834|consen 163 -VYDSILRVEAIDKDCSPQYSQICEYEITTPD-----VPFAIDND-GNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASD 235 (952)
T ss_pred -eeeeeEEEEeecCCCCCcccceeEEEecCCC-----CceEEcCC-CccccccccccccceeEEEEEEEEecccccccCc
Confidence 46779999999999 22 45678888754 57999954 9999999999999999999999999987544455
Q ss_pred EEEEEEEEe
Q psy7562 409 ARVSINVLD 417 (582)
Q Consensus 409 ~~v~I~V~D 417 (582)
+.|+|.|..
T Consensus 236 ~lV~v~Vkp 244 (952)
T KOG1834|consen 236 SLVTVHVKP 244 (952)
T ss_pred ceEEEEecC
Confidence 677787753
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=98.17 E-value=4.2e-06 Score=66.72 Aligned_cols=61 Identities=25% Similarity=0.380 Sum_probs=43.5
Q ss_pred eeEEEecCCCCCcEEEEEEEEcCCC-----CeEEEEEEeCCCCCCeEEeCCcceEEEcccCCccccc
Q psy7562 109 QRVEVTENDAVGFLVALIQASDPDG-----DTLWYRIVDGDPNSTFSIGNDKGNVLLARRVDWETQA 170 (582)
Q Consensus 109 ~~~~v~E~~~~g~~v~~v~A~D~D~-----~~~~y~i~~~~~~~~F~I~~~tG~i~~~~~LD~E~~~ 170 (582)
..++|+|..++|+.||.+ |.|... ....|+|.......+|.++..+|.|+++.++|||+.+
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~LC 68 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREELC 68 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC-
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHHC
Confidence 467999999999999999 777643 3457888776677899999999999999999999864
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.43 E-value=0.0016 Score=52.72 Aligned_cols=79 Identities=23% Similarity=0.347 Sum_probs=43.8
Q ss_pred CCCeeccCeeEEEEECCCCCCcEEEEEEEEeCCCCCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEE
Q psy7562 315 HRPILTQSLYETNISESVPVGSEILKVAASDSDQESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLT 394 (582)
Q Consensus 315 ~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~ 394 (582)
+.|-|....|.+.|+.+...|..|++|.-.|-.....+.|.-.+ ..|.|..+ |.|++++++..... .-.|.
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssD-------pdF~V~~D-GsVy~~r~v~l~~~-~~~F~ 72 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSD-------PDFRVLED-GSVYAKRPVQLSSE-QRSFT 72 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--SS---EEEE----------SEEEEETT-TEEEEES--S-SSS--EEEE
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCCCCCceEEecCC-------CCEEEcCC-CeEEEeeeEecCCC-ceEEE
Confidence 46889999999999999999999999999999767778887654 36999976 99999999876543 35799
Q ss_pred EEEEeCCC
Q psy7562 395 IMARDQGT 402 (582)
Q Consensus 395 v~a~D~~~ 402 (582)
|.|.|..+
T Consensus 73 V~a~D~~~ 80 (90)
T PF08758_consen 73 VHAWDSQT 80 (90)
T ss_dssp EEEEETTT
T ss_pred EEEECCCC
Confidence 99999875
No 15
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.12 E-value=0.00079 Score=53.74 Aligned_cols=60 Identities=27% Similarity=0.436 Sum_probs=39.4
Q ss_pred eEEEEECCCCCCcEEEEEEEEeCCCC----CeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCC
Q psy7562 324 YETNISESVPVGSEILKVAASDSDQE----SKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRES 387 (582)
Q Consensus 324 ~~~~v~E~~~~gt~v~~v~a~D~D~~----~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~ 387 (582)
...+|+|..+.|+.|+.+ |.|.--+ ..-.|++.... ...+|.++..+|.|+++..+|||+
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~---~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEG---NSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SS---SS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecC---CcceeEecCCceeEEeCCccCHHH
Confidence 457899999999999999 4455311 23357776643 358999999999999999999997
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.96 E-value=0.0082 Score=48.60 Aligned_cols=79 Identities=20% Similarity=0.214 Sum_probs=43.9
Q ss_pred CCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeCCCCCceEEeCCCeEEEEcccCCcCCCceEEEEEE
Q psy7562 422 PPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASGNIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVK 501 (582)
Q Consensus 422 ~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~ 501 (582)
.|-|....|.+.|+.+...|..|++|.-.|-.. +..+.|.-. + ..|.|.+ .|.|++++++... ...-.+.|.
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--D--pdF~V~~-DGsVy~~r~v~l~-~~~~~F~V~ 74 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--D--PDFRVLE-DGSVYAKRPVQLS-SEQRSFTVH 74 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEET-TTEEEEES--S-S-SS-EEEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--C--CCEEEcC-CCeEEEeeeEecC-CCceEEEEE
Confidence 478999999999999999999999999999843 445777654 2 4899998 8999999999754 344689999
Q ss_pred EEECCCC
Q psy7562 502 ATDQGAP 508 (582)
Q Consensus 502 a~D~~~~ 508 (582)
|.|..+.
T Consensus 75 a~D~~~~ 81 (90)
T PF08758_consen 75 AWDSQTQ 81 (90)
T ss_dssp EEETTTT
T ss_pred EECCCCC
Confidence 9998763
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.82 E-value=0.02 Score=47.30 Aligned_cols=73 Identities=22% Similarity=0.348 Sum_probs=56.9
Q ss_pred EEeCCCCCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCceeEEEEEEEEEecCC
Q psy7562 343 ASDSDQESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAKRNFARVSINVLDSND 420 (582)
Q Consensus 343 a~D~D~~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvND 420 (582)
..|.| +..++|++...+....+.|...++.++.+.-. +.... ...|.++|.|+|+.+ .+....++|.|.+.|+
T Consensus 24 F~d~d-~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~-~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 24 FTDAD-GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSD-VGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred eECCC-CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCC-CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 46777 78999999865433346799999999988874 44333 456999999999876 6778889999999887
No 18
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.94 E-value=0.13 Score=42.50 Aligned_cols=71 Identities=28% Similarity=0.356 Sum_probs=53.8
Q ss_pred EEEECCCCCCcEEEEEEEeC---CCCCceEEeCCCeEEEEcccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEecC
Q psy7562 448 VATDRDHGENGVVTYSIASG---NIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVMAD 524 (582)
Q Consensus 448 ~a~D~D~~~n~~v~y~l~~~---~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~n 524 (582)
...|.| ...++|++... ....|...++.++.+.-. +.. +..+.|.+.|.|+|+.+ .+....+.|.|.+.|
T Consensus 23 tF~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~-~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~ 95 (97)
T smart00736 23 TFTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTN-SDVGSLSLKVTATDSSG--ASASDTFTITVVNTN 95 (97)
T ss_pred ceECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCC-CCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCC
Confidence 356776 34688888743 245799999999988774 443 33567999999999886 578889999999887
Q ss_pred C
Q psy7562 525 N 525 (582)
Q Consensus 525 d 525 (582)
+
T Consensus 96 ~ 96 (97)
T smart00736 96 D 96 (97)
T ss_pred C
Confidence 6
No 19
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.84 E-value=0.11 Score=42.63 Aligned_cols=89 Identities=22% Similarity=0.240 Sum_probs=61.8
Q ss_pred EEEEEEeCCCCCeEEEEEEeCCCCCCccCEEEECcccEEEEC--------ccCCCCCccEEEEEEEEEeCCCCCceeEEE
Q psy7562 339 LKVAASDSDQESKVLYSIHHAQSPLSAALFRIDFHSGALSVA--------QPLDRESLSHHLLTIMARDQGTPAKRNFAR 410 (582)
Q Consensus 339 ~~v~a~D~D~~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~--------~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~ 410 (582)
+++.++|+|.+..+.+++.... ...+.|.|++ .|.-... +.|..-+.-.-.|++.+.|+. +.+
T Consensus 2 G~Lt~sD~D~gd~~~~s~~~~~--g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt------t~~ 72 (99)
T TIGR01965 2 GQLTISDADAGQAHFIAQTDAA--GQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT------SQT 72 (99)
T ss_pred CceEEeCCCCCCceEEeccccc--CCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC------eEE
Confidence 4788999998888888886432 2457899987 4643221 234433444567888889862 788
Q ss_pred EEEEEEecCCCCCeEecceeeEEEecCC
Q psy7562 411 VSINVLDSNDHPPEFATSLVKGRVFETS 438 (582)
Q Consensus 411 v~I~V~DvNDn~P~f~~~~~~~~v~E~~ 438 (582)
|.|.|.-.|| +|+.... -...+.|+.
T Consensus 73 vtItI~GtND-apvi~~~-~~g~v~ED~ 98 (99)
T TIGR01965 73 VTITITGAND-AAVIGGA-DTGSVTEDS 98 (99)
T ss_pred EEEEEEccCC-CCEEecc-cceeEecCC
Confidence 9999999999 8887753 346676653
No 20
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.87 E-value=0.2 Score=41.16 Aligned_cols=34 Identities=26% Similarity=0.375 Sum_probs=23.2
Q ss_pred EEEEEEcCCC-CeEEEEEEe-CCCCCCeEEeCCcceE
Q psy7562 124 ALIQASDPDG-DTLWYRIVD-GDPNSTFSIGNDKGNV 158 (582)
Q Consensus 124 ~~v~A~D~D~-~~~~y~i~~-~~~~~~F~I~~~tG~i 158 (582)
|++.++|+|. ..+++++.. ....+.|.|++ +|..
T Consensus 2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~w 37 (99)
T TIGR01965 2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQW 37 (99)
T ss_pred CceEEeCCCCCCceEEecccccCCcEEEEECC-CCcE
Confidence 4688999997 567777742 22456788887 6743
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=94.67 E-value=3.2 Score=37.52 Aligned_cols=127 Identities=19% Similarity=0.260 Sum_probs=70.9
Q ss_pred ccEEEEEE-EEEeCCCCCceeEEEEEEEEEecCCCCCeEecceeeEEEecCCCC-CcEEEEEEEEECCCCCCcEEEEEEE
Q psy7562 388 LSHHLLTI-MARDQGTPAKRNFARVSINVLDSNDHPPEFATSLVKGRVFETSAV-GTNIVQVVATDRDHGENGVVTYSIA 465 (582)
Q Consensus 388 ~~~~~l~v-~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~-gt~v~~v~a~D~D~~~n~~v~y~l~ 465 (582)
...|.+.+ .|.|..+...+.....++.| ...+|.+.- .....+..+... |..=..+.++|.-.+. .--..+|.
T Consensus 14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~i---D~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~ 88 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGNTSTSTVSETFTI---DNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLT 88 (158)
T ss_pred CccEEEEEEEEEecCCCEEEEEEeeEEEE---cCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEE
Confidence 46899999 79998764333332223333 344787765 112223233222 2233557788776543 33467776
Q ss_pred eCCCCCceEEe---CCCeEEEEccc--C-CcCCCceEEEEEEEEECCCCCceeeEEEEEEEE
Q psy7562 466 SGNIGNVFTMD---PILGTIQTARP--L-DLSLMSEYMLLVKATDQGAPPLAATVPVQIIVV 521 (582)
Q Consensus 466 ~~~~~~~F~Id---~~tG~i~~~~~--l-d~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~ 521 (582)
.|.....-.+. ...|...+.-+ + ..|....|+|+|.|.|..+ -.++..+.+...
T Consensus 89 Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si~F~y~ 148 (158)
T PF13750_consen 89 GGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSISFSYM 148 (158)
T ss_pred CCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC--CEEEEEEEEEEe
Confidence 55433333331 12344433211 1 2377899999999999877 466667766665
No 22
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=88.88 E-value=14 Score=42.70 Aligned_cols=143 Identities=16% Similarity=0.152 Sum_probs=78.6
Q ss_pred cCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeCCCCC--ceEEeCCCeEEEEc----------
Q psy7562 418 SNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASGNIGN--VFTMDPILGTIQTA---------- 485 (582)
Q Consensus 418 vNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~~~~~--~F~Id~~tG~i~~~---------- 485 (582)
.||..+.|.-......|.|+. |+.-.+|.-...|.+....+.|+..+|.... -|. +.+|.|.-.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 456566665555567899974 6766666655445555567899988764322 232 234544332
Q ss_pred ---ccCCcCCCceEEEEEEEEECC-------------CCCceeeEEEEEEEEecCCCCCeeecCceEEEEecCCCCCeEE
Q psy7562 486 ---RPLDLSLMSEYMLLVKATDQG-------------APPLAATVPVQIIVVMADNDPPRFAKAELAAELFENEPPGTVV 549 (582)
Q Consensus 486 ---~~ld~e~~~~~~l~V~a~D~~-------------~~~~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~E~~~~g~~v 549 (582)
..=-+|....|.+.+.--..+ ...+....+.+|+|.| ||++|.|....-...|.|+. |+.-
T Consensus 471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vt 547 (928)
T TIGR00845 471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIME 547 (928)
T ss_pred EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEEE
Confidence 111134445555554332111 0112233456677776 67889887666667888873 5554
Q ss_pred EEEEeeeC----CceEEEEEeC
Q psy7562 550 RHIEARST----SSLVFEIIRG 567 (582)
Q Consensus 550 ~~v~a~D~----~~i~y~i~~g 567 (582)
.+|.-+.+ -.+.|...+|
T Consensus 548 vtV~RtsGa~G~VtV~Y~T~dG 569 (928)
T TIGR00845 548 VKVLRTSGARGTVIVPYRTVEG 569 (928)
T ss_pred EEEEEcCCCCeeEEEEEEeecC
Confidence 44433321 2356766654
No 23
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=88.52 E-value=54 Score=38.15 Aligned_cols=149 Identities=19% Similarity=0.174 Sum_probs=76.4
Q ss_pred CCCCCCeeccCeeEEEEECCCCCCcEEEEEEEEeCCCC--CeEEEEEEeCCCCCCccCEEEECcccEEEEC---------
Q psy7562 312 VNEHRPILTQSLYETNISESVPVGSEILKVAASDSDQE--SKVLYSIHHAQSPLSAALFRIDFHSGALSVA--------- 380 (582)
Q Consensus 312 vNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~--~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~--------- 380 (582)
.||..+.|.-.+-...|.|+. |+.-..|.-...|.+ -.+.|+..++.... ...|. ..+|.|.-.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~A-G~DY~--~~sGTLtF~PGEt~KtIt 469 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANA-GSDYE--FTEGTLVFKPGETQKEFR 469 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCC-CCCcc--ccCceEEECCCceEEEEE
Confidence 445444443333345589996 777777766554533 56778877664321 12222 233444321
Q ss_pred ----ccCCCCCccEEEEEEEEEeCC-------------CCCceeEEEEEEEEEecCCCCCeEecceeeEEEecCCCCCcE
Q psy7562 381 ----QPLDRESLSHHLLTIMARDQG-------------TPAKRNFARVSINVLDSNDHPPEFATSLVKGRVFETSAVGTN 443 (582)
Q Consensus 381 ----~~lD~E~~~~~~l~v~a~D~~-------------~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~ 443 (582)
..=-+|..+.|.+.+.--..+ ..........+|+|.| ||++|.|.-..-...|.|+. |+.
T Consensus 470 V~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~v 546 (928)
T TIGR00845 470 IGIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIM 546 (928)
T ss_pred EEEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEE
Confidence 111244455666555432211 0112223456677777 78899877655567888974 554
Q ss_pred EEEEEEEECCCCCCcEEEEEEEeCCC
Q psy7562 444 IVQVVATDRDHGENGVVTYSIASGNI 469 (582)
Q Consensus 444 v~~v~a~D~D~~~n~~v~y~l~~~~~ 469 (582)
-.+|.-+ .+....-.+.|+..+|..
T Consensus 547 tvtV~Rt-sGa~G~VtV~Y~T~dGTA 571 (928)
T TIGR00845 547 EVKVLRT-SGARGTVIVPYRTVEGTA 571 (928)
T ss_pred EEEEEEc-CCCCeeEEEEEEeecCcc
Confidence 4443222 111112346787776643
No 24
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=85.40 E-value=3.4 Score=29.20 Aligned_cols=39 Identities=28% Similarity=0.406 Sum_probs=29.4
Q ss_pred CCCCCceEEeCCCeEEEEcccCCcCCCceEEEEEEEEECC
Q psy7562 467 GNIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQG 506 (582)
Q Consensus 467 ~~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~~ 506 (582)
+....+..||+.+|.|+-...-. .....|.++|.|+|..
T Consensus 10 ~~LP~gLs~d~~tG~isGtp~~~-~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 10 GGLPSGLSLDPSTGTISGTPTSS-VQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCCCcEEEeCCCCEEEeecCCC-ccccEEEEEEEEEcCC
Confidence 34567999999999998763321 1336899999999975
No 25
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=84.89 E-value=5.5 Score=28.14 Aligned_cols=46 Identities=17% Similarity=0.188 Sum_probs=32.2
Q ss_pred EEEEeCCCCCCccCEEEECcccEEEECccCCCC-CccEEEEEEEEEeCC
Q psy7562 354 YSIHHAQSPLSAALFRIDFHSGALSVAQPLDRE-SLSHHLLTIMARDQG 401 (582)
Q Consensus 354 y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E-~~~~~~l~v~a~D~~ 401 (582)
|++........+.+..||+.+|.|.-.- +.. +...|.+.|.|+|+.
T Consensus 2 ys~~~~~~~~LP~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 2 YSLTTPTGGGLPSGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred EEEEcCCCCCCCCcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcCC
Confidence 5553222233568999999999998663 233 235899999999975
No 26
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=82.40 E-value=14 Score=27.39 Aligned_cols=62 Identities=18% Similarity=0.160 Sum_probs=35.8
Q ss_pred CCCcEEEEEEEeCCCCCceEEeCCCeEEEEcccCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEE
Q psy7562 455 GENGVVTYSIASGNIGNVFTMDPILGTIQTARPLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVV 521 (582)
Q Consensus 455 ~~n~~v~y~l~~~~~~~~F~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~ 521 (582)
+.+-..+|+|.+ -...+..+...+-.+.... ...+.|+|.|+|.|..+.......++.|+|.
T Consensus 5 ~~~~~Y~Y~l~g-~d~~W~~~~~~~~~~~~~~----L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 5 PENIRYRYRLEG-FDDEWITLGSYSNSISYTN----LPPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp CTTEEEEEEEET-TESSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CCceEEEEEEEC-CCCeEEECCCCcEEEEEEe----CCCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 345567777764 4455666655332443322 3579999999999987653333366776663
No 27
>KOG3597|consensus
Probab=76.73 E-value=47 Score=35.29 Aligned_cols=153 Identities=18% Similarity=0.120 Sum_probs=85.2
Q ss_pred EEEEEEEEEeCCCCCCeeccCeeEEEEECCCCCCcEEEEEEEEeCC-CCCeEEEEEEeCCCCC-CccCEEEECc------
Q psy7562 302 RIWLHVTVVDVNEHRPILTQSLYETNISESVPVGSEILKVAASDSD-QESKVLYSIHHAQSPL-SAALFRIDFH------ 373 (582)
Q Consensus 302 ~~~v~I~V~DvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D-~~~~v~y~i~~~~~~~-~~~~F~i~~~------ 373 (582)
+....|.|..+||.+..+-...+.+-+.|+...-...-.+.+.|+| ....+.|++.+..... ....|.....
T Consensus 25 ~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~l~f~v~~t~~~~~~~~~~~~~g~~~~~Fs 104 (442)
T KOG3597|consen 25 TDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPLPLEFQVLGTSSVPLPVLKFDVPGAPATEFS 104 (442)
T ss_pred EeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCccceEEEEccCCCCCCccceeeccCCcccceE
Confidence 5568899999999666665555667888887655555678888888 4577888888653321 1112333222
Q ss_pred -----ccEEEECccCCCC--CccEEEEEEEEEeCCCCCceeEEEEEEEEEecCCCCCeEecc-eeeEEEecCCCCCcEE-
Q psy7562 374 -----SGALSVAQPLDRE--SLSHHLLTIMARDQGTPAKRNFARVSINVLDSNDHPPEFATS-LVKGRVFETSAVGTNI- 444 (582)
Q Consensus 374 -----tG~i~~~~~lD~E--~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~-~~~~~v~E~~~~gt~v- 444 (582)
.|.+.. ++. +.....++..++|+-. .+. .+..... ...|.+... .....+.-. ....|
T Consensus 105 ~~~v~~g~~~y----vh~g~el~~~~~~~~~SDg~~---~S~-~~i~~~~---~~~~~~~~~~~~gL~v~~g--S~~~IT 171 (442)
T KOG3597|consen 105 YEEVEDGSLSY----VHSGTELRESELQLRVSDGLL---VSE-RAILKVE---ATGPAPHLARNTGLKVLQG--STAPIT 171 (442)
T ss_pred ehHhhcCceeE----EecCcccccceEEEEeecceE---eee-eEEeccc---CCCcceeeecccceEEccC--cccccc
Confidence 233322 222 2456778888888752 222 1111111 123333221 112222222 22223
Q ss_pred -EEEEEEECCCCCCcEEEEEEEeC
Q psy7562 445 -VQVVATDRDHGENGVVTYSIASG 467 (582)
Q Consensus 445 -~~v~a~D~D~~~n~~v~y~l~~~ 467 (582)
..+.+.|.|++..-.+.|.|..+
T Consensus 172 ~~~L~ved~d~~~d~~v~~~i~~~ 195 (442)
T KOG3597|consen 172 PSNLSVEDNDSSPDDEVRYDITPP 195 (442)
T ss_pred HhHceeecCCCCCCcEEEEEecCC
Confidence 23788888876677899999864
No 28
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=70.23 E-value=23 Score=31.18 Aligned_cols=56 Identities=20% Similarity=0.240 Sum_probs=35.6
Q ss_pred EEECccCCCC---CccEEEEEEEEEeCCCCCceeEEEEEEEEEecCCCCCeEecceeeEEEecCC
Q psy7562 377 LSVAQPLDRE---SLSHHLLTIMARDQGTPAKRNFARVSINVLDSNDHPPEFATSLVKGRVFETS 438 (582)
Q Consensus 377 i~~~~~lD~E---~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~ 438 (582)
+.+.++||.. ..-...|.|.|+|..+- .+...+.|.|.| | .|+..... ...|.|+.
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 3444555543 23357788999998763 334578888877 6 68876644 36787754
No 29
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=67.33 E-value=16 Score=32.11 Aligned_cols=54 Identities=28% Similarity=0.428 Sum_probs=37.1
Q ss_pred CCcceeccc---CCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCCCCCcccCCCceEEEeCCC
Q psy7562 1 MENIIVFVA---GQEFDLLVRATDHGSPNQSSTARVSIQVVRLPDESKNPPTIKSPDQRVEVTEND 63 (582)
Q Consensus 1 ~~~~ldyE~---~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d~ND~~~~~P~f~~~~~~~~v~E~~ 63 (582)
|.++||+.. .-...|.|.|+|..+- .++..+.|+|.| | .|...... ...|.|+.
T Consensus 72 L~~~lDH~~g~d~l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D----~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 72 LEGPLDHAAGSDELTLNFPIIATDFDGD--TSSITLPVTIVD--D----VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EcccccCCCCCceEEEeeeEEEEeCCCC--ccccEEEEEEEC--C----CCeecccc-ceEEeccc
Confidence 457788854 3468899999998763 234577788876 5 48765444 47888853
No 30
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=60.47 E-value=1.3e+02 Score=27.13 Aligned_cols=30 Identities=20% Similarity=0.231 Sum_probs=22.5
Q ss_pred CCCccEEEEEEEEEeCCCCCceeEEEEEEEEE
Q psy7562 385 RESLSHHLLTIMARDQGTPAKRNFARVSINVL 416 (582)
Q Consensus 385 ~E~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~ 416 (582)
.|....|.|+|.|.|..+ ..++..+.....
T Consensus 119 le~~~~YtLtV~a~D~aG--N~~~~si~F~y~ 148 (158)
T PF13750_consen 119 LEADDSYTLTVSATDKAG--NQSTKSISFSYM 148 (158)
T ss_pred cCCCCeEEEEEEEEecCC--CEEEEEEEEEEe
Confidence 377889999999999876 455556655554
No 31
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=59.53 E-value=94 Score=25.19 Aligned_cols=53 Identities=28% Similarity=0.355 Sum_probs=31.0
Q ss_pred EEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEeCC
Q psy7562 412 SINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIASGN 468 (582)
Q Consensus 412 ~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~~~ 468 (582)
+|.|.| ||.+ .+.-......+.|+. |..-..|.....+....-.+.|...+|.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 467788 7744 777666677888875 4444455545443333445677766653
No 32
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=42.03 E-value=57 Score=24.04 Aligned_cols=28 Identities=25% Similarity=0.366 Sum_probs=18.1
Q ss_pred CCEEEEEEEEEECCCCCceeEEEEEEEE
Q psy7562 10 GQEFDLLVRATDHGSPNQSSTARVSIQV 37 (582)
Q Consensus 10 ~~~~~l~v~a~D~g~p~~~~~~~v~I~V 37 (582)
...|.|.|+|.|..+..-.....+.|.|
T Consensus 38 ~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 38 PGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 4689999999998764333335666655
No 33
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=41.08 E-value=1.9e+02 Score=23.31 Aligned_cols=50 Identities=26% Similarity=0.335 Sum_probs=27.8
Q ss_pred EEEEEeCCCCCCeeccCeeEEEEECCCCCCcEEEEEEEEeCC--CCCeEEEEEEeC
Q psy7562 306 HVTVVDVNEHRPILTQSLYETNISESVPVGSEILKVAASDSD--QESKVLYSIHHA 359 (582)
Q Consensus 306 ~I~V~DvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D--~~~~v~y~i~~~ 359 (582)
+|+|.| || .|.+.-..-...+.|+. |..-..|....++ ..-.+.|...++
T Consensus 2 tvtI~d-~d-~~~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DD-DPTVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TT-SEEEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CC-CCEEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCC
Confidence 577788 66 44765554456788986 4455555555443 234455655554
No 34
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=39.57 E-value=66 Score=24.64 Aligned_cols=30 Identities=17% Similarity=0.279 Sum_probs=22.3
Q ss_pred eecccCCEEEEEEEEEECCCCCceeEEEEEEEE
Q psy7562 5 IVFVAGQEFDLLVRATDHGSPNQSSTARVSIQV 37 (582)
Q Consensus 5 ldyE~~~~~~l~v~a~D~g~p~~~~~~~v~I~V 37 (582)
.-|+....|.++++|+|..+ +.+..+.|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 45777889999999999876 5555565554
No 35
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=37.80 E-value=67 Score=24.77 Aligned_cols=30 Identities=13% Similarity=0.199 Sum_probs=20.5
Q ss_pred eecccCCEEEEEEEEEECCCCCceeEEEEEEE
Q psy7562 5 IVFVAGQEFDLLVRATDHGSPNQSSTARVSIQ 36 (582)
Q Consensus 5 ldyE~~~~~~l~v~a~D~g~p~~~~~~~v~I~ 36 (582)
..|.....|.++++|+|..+ .+.+..+.|.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~ 80 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG--SSSTKTTTVV 80 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC--CEEEEEEEEE
Confidence 45777888999999999864 3444344443
No 36
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=36.78 E-value=69 Score=26.69 Aligned_cols=34 Identities=12% Similarity=0.190 Sum_probs=23.9
Q ss_pred cCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEe
Q psy7562 487 PLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVM 522 (582)
Q Consensus 487 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d 522 (582)
.+++.+.+.|.++|+++|..+- +..+.+.|.|.|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dGC--S~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDGC--STSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCCc--cCcCCEEEEecC
Confidence 3556778999999999998773 444455555543
No 37
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=35.42 E-value=1.5e+02 Score=21.80 Aligned_cols=30 Identities=33% Similarity=0.419 Sum_probs=20.8
Q ss_pred CCceEEEEEEEEECCCCCceeeEEEEEEEEec
Q psy7562 492 LMSEYMLLVKATDQGAPPLAATVPVQIIVVMA 523 (582)
Q Consensus 492 ~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~ 523 (582)
..+.|.|.++|+|..+ ..+.....+.+.|.
T Consensus 21 ~dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~ 50 (60)
T PF12245_consen 21 ADGEYTLTVTATDKAG--NTSSSTTQIVIVDN 50 (60)
T ss_pred CCccEEEEEEEEECCC--CEEEeeeEEEEEcC
Confidence 3678999999999877 34445555555543
No 38
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=35.37 E-value=1e+02 Score=22.59 Aligned_cols=28 Identities=32% Similarity=0.464 Sum_probs=19.4
Q ss_pred CCEEEEEEEEEECCCCCceeEEEEEEEEEe
Q psy7562 10 GQEFDLLVRATDHGSPNQSSTARVSIQVVR 39 (582)
Q Consensus 10 ~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d 39 (582)
...|.|+++|+|..+. .++....+.+.|
T Consensus 22 dg~yt~~v~a~D~AGN--~~~~~~~~~i~d 49 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGN--TSSSTTQIVIVD 49 (60)
T ss_pred CccEEEEEEEEECCCC--EEEeeeEEEEEc
Confidence 4689999999999884 344444554544
No 39
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=34.16 E-value=85 Score=24.15 Aligned_cols=30 Identities=10% Similarity=0.158 Sum_probs=21.4
Q ss_pred CCcCCCceEEEEEEEEECCCCCceeeEEEEEE
Q psy7562 488 LDLSLMSEYMLLVKATDQGAPPLAATVPVQII 519 (582)
Q Consensus 488 ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~ 519 (582)
..|...+.|.+++.|+|..+ .+...++.|.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~ 80 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG--SSSTKTTTVV 80 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC--CEEEEEEEEE
Confidence 34678899999999999864 3444455554
No 40
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=30.48 E-value=1.1e+02 Score=23.28 Aligned_cols=31 Identities=16% Similarity=0.271 Sum_probs=23.3
Q ss_pred cCCcCCCceEEEEEEEEECCCCCceeeEEEEEEE
Q psy7562 487 PLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIV 520 (582)
Q Consensus 487 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V 520 (582)
..-|...+.|.+++.+.|..+ +.+.++.|.|
T Consensus 48 ~~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 48 THTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 345678899999999999876 5566666655
No 41
>KOG3597|consensus
Probab=30.41 E-value=3.8e+02 Score=28.61 Aligned_cols=59 Identities=17% Similarity=0.088 Sum_probs=41.2
Q ss_pred eEEEEEEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcEEEEEEEe
Q psy7562 407 NFARVSINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGVVTYSIAS 466 (582)
Q Consensus 407 ~~~~v~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~v~y~l~~ 466 (582)
.+....|.|.-+||.+..+....+.+.+.|+...-..--.+.+.|+|... ..+.|++..
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~ 82 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLG 82 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEcc
Confidence 34567899999999666666655667777876543344557888888753 347888875
No 42
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=27.38 E-value=1.1e+02 Score=23.72 Aligned_cols=24 Identities=25% Similarity=0.300 Sum_probs=18.7
Q ss_pred EEEEEEEEEECCCCCceeEEEEEEEE
Q psy7562 12 EFDLLVRATDHGSPNQSSTARVSIQV 37 (582)
Q Consensus 12 ~~~l~v~a~D~g~p~~~~~~~v~I~V 37 (582)
.|.++.+|+|..+ ..+++.+.|+|
T Consensus 58 ~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 58 TTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred eEEEEEEEEECCC--CEEEEEEEEEC
Confidence 6888999999766 56788887765
No 43
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.83 E-value=93 Score=26.69 Aligned_cols=33 Identities=24% Similarity=0.232 Sum_probs=18.9
Q ss_pred CCEEEEEEEEEE-CCCCCceeEEEEEEEEEecCC
Q psy7562 10 GQEFDLLVRATD-HGSPNQSSTARVSIQVVRLPD 42 (582)
Q Consensus 10 ~~~~~l~v~a~D-~g~p~~~~~~~v~I~V~d~ND 42 (582)
+..=.|.|.|+= .++-+++++..+.|+|.++|+
T Consensus 97 NN~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~ 130 (133)
T PF09100_consen 97 NNAGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN 130 (133)
T ss_dssp BS-EEEEEEEEETTTT---EEEEEEEEE---S--
T ss_pred CCcccEEEEEEEccCCcccceeEeEEEEeecccC
Confidence 455678888884 334468999999999999986
No 44
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=25.66 E-value=94 Score=25.92 Aligned_cols=32 Identities=19% Similarity=0.185 Sum_probs=22.7
Q ss_pred eecccCCEEEEEEEEEECCCCCceeEEEEEEEEE
Q psy7562 5 IVFVAGQEFDLLVRATDHGSPNQSSTARVSIQVV 38 (582)
Q Consensus 5 ldyE~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~ 38 (582)
++|.....|.++|++.|..+-+.| ..+.|.|.
T Consensus 74 ~~v~kgG~y~m~V~lCn~dGCS~S--~~~~I~VA 105 (106)
T cd02848 74 FKVGKGGRYQMQVALCNGDGCSTS--AAKEIVVA 105 (106)
T ss_pred EEeCCCCeEEEEEEEECCCCccCc--CCEEEEec
Confidence 567778889999999998874444 44555443
No 45
>KOG4221|consensus
Probab=24.74 E-value=1.3e+03 Score=28.07 Aligned_cols=177 Identities=12% Similarity=0.139 Sum_probs=92.3
Q ss_pred CCcEEEEEEEEeCC--CCCeEEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCceeEEEE
Q psy7562 334 VGSEILKVAASDSD--QESKVLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAKRNFARV 411 (582)
Q Consensus 334 ~gt~v~~v~a~D~D--~~~~v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~v 411 (582)
.++...++++.-+- .+....|++.............--...|.+.....|- -...|.|.+.|....+.. .+++.+
T Consensus 439 ~~srfi~~tw~~p~~~~g~i~~~~v~~~~~~~~rer~~~tss~g~~~tv~nl~--p~t~Y~~rv~A~n~~g~g-~sS~pL 515 (1381)
T KOG4221|consen 439 VSSRFIQLTWRPPAQISGNISTYTVFYKVEGDVRERLQNTSSPGIQVTVQNLS--PLTMYFFRVRAKNEAGSG-ESSAPL 515 (1381)
T ss_pred ccceeEEEeecCccccCCCcceEEEEEecCCchhhhheeccCCceEEEeeecc--cceeEEEEEeccCcccCC-ccCCce
Confidence 45566666665221 2444555554432211112222223456444445553 345899999998865532 333344
Q ss_pred EEEEEecCCCCCeEecceeeEEEecCCCCCcEEEEEEEEECCCCCCcE-EEEEEEe--CCCCCceEEeCCCeEEEEcccC
Q psy7562 412 SINVLDSNDHPPEFATSLVKGRVFETSAVGTNIVQVVATDRDHGENGV-VTYSIAS--GNIGNVFTMDPILGTIQTARPL 488 (582)
Q Consensus 412 ~I~V~DvNDn~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~n~~-v~y~l~~--~~~~~~F~Id~~tG~i~~~~~l 488 (582)
.| .---| +|.- ...|. .+..-..|.-. +-.-.|+. ..|++.. ++.+..+.++..+-+.++.
T Consensus 516 kV--~t~pE-gp~~-~~a~a--------ts~~ti~v~We-pP~~~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~ti~--- 579 (1381)
T KOG4221|consen 516 KV--TTQPE-GPVQ-LQAYA--------TSPTTILVTWE-PPPFGNGPITGYKLFYSEDDTGKELRVENNATEYTIN--- 579 (1381)
T ss_pred EE--ecCCC-CCcc-ccccc--------cCcceEEEEec-CCCCCCCCceEEEEEEEcCCCCceEEEecCccEEEee---
Confidence 33 32222 2222 11111 01111111111 11112443 3566544 3456788888766666665
Q ss_pred CcCCCceEEEEEEEEECCCCCceeeEEEEEEEEecCCCCCe
Q psy7562 489 DLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVMADNDPPR 529 (582)
Q Consensus 489 d~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~ 529 (582)
+.+....|.++|.|....++..++...-.++..|+.+.||.
T Consensus 580 gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 580 GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 33677889999999999887766655555677788888887
No 46
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=22.82 E-value=2.2e+02 Score=24.89 Aligned_cols=53 Identities=21% Similarity=0.335 Sum_probs=27.8
Q ss_pred cCCcCCCceEEEEEEEEECCCCCceeeEEEEEEEEecCCCCCeeecCceEEEEecCCCC
Q psy7562 487 PLDLSLMSEYMLLVKATDQGAPPLAATVPVQIIVVMADNDPPRFAKAELAAELFENEPP 545 (582)
Q Consensus 487 ~ld~e~~~~~~l~V~a~D~~~~~~~~~~~v~I~V~d~nd~~P~f~~~~~~~~v~E~~~~ 545 (582)
.+.....+.|+++|+++|..+ -+....+.|.|.|.+.... ......+.||-.+
T Consensus 76 ~~~~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTDGsHl----~pL~~~~~~nN~~ 128 (133)
T PF08329_consen 76 TFTVTKGGRYQMQVELCNADG--CSTSAPVEVVVADTDGSHL----APLPYNWDENNKP 128 (133)
T ss_dssp EEEE-S-EEEEEEEEEEETTE--EEE---EEEEEE-TTSTTS------------TTS--
T ss_pred EEEecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCCcccc----ccccCcccccCCC
Confidence 455667899999999999877 4666688899998764321 2234566666543
No 47
>KOG4221|consensus
Probab=20.41 E-value=1.6e+03 Score=27.42 Aligned_cols=69 Identities=10% Similarity=0.181 Sum_probs=48.1
Q ss_pred EEEEEEeCCCCCCccCEEEECcccEEEECccCCCCCccEEEEEEEEEeCCCCCceeEEEEEEEEEecCCCCCe
Q psy7562 352 VLYSIHHAQSPLSAALFRIDFHSGALSVAQPLDRESLSHHLLTIMARDQGTPAKRNFARVSINVLDSNDHPPE 424 (582)
Q Consensus 352 v~y~i~~~~~~~~~~~F~i~~~tG~i~~~~~lD~E~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~ 424 (582)
..|++.-..+ ....++.++.++-++++. +.|....|.+.|.|....++..++...-.++..|+-+-||.
T Consensus 552 ~~yk~~ys~~-~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 552 TGYKLFYSED-DTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred eEEEEEEEcC-CCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 3455554332 235788888776666666 35677899999999999887776655455677788887775
No 48
>PF14302 DUF4377: Domain of unknown function (DUF4377)
Probab=20.34 E-value=3.1e+02 Score=21.50 Aligned_cols=35 Identities=31% Similarity=0.355 Sum_probs=27.0
Q ss_pred ceecccCCEEEEEEEEEECCCCCc--eeEEEEEEEEE
Q psy7562 4 IIVFVAGQEFDLLVRATDHGSPNQ--SSTARVSIQVV 38 (582)
Q Consensus 4 ~ldyE~~~~~~l~v~a~D~g~p~~--~~~~~v~I~V~ 38 (582)
=++||.+-.|.|.|..+--..|+. ++..-+.+.|+
T Consensus 41 GF~yE~Gy~Y~L~Vk~~~~~nppaD~ss~~Y~L~~vi 77 (80)
T PF14302_consen 41 GFEYEPGYEYVLRVKRTPVANPPADASSYRYVLIKVI 77 (80)
T ss_pred CcCcCCCcEEEEEEEEEECCCCCCCCCceeEEEEEEE
Confidence 368999999999999998777754 55566666665
Done!