Query psy8639
Match_columns 570
No_of_seqs 245 out of 2101
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 23:59:22 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8639.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8639hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 9.3E-74 2E-78 607.5 54.5 513 19-569 245-820 (2531)
2 KOG4289|consensus 100.0 3.5E-70 7.6E-75 580.3 43.5 450 52-563 160-611 (2531)
3 KOG1219|consensus 100.0 1.8E-64 4E-69 551.9 54.1 487 6-569 687-1198(4289)
4 KOG1219|consensus 100.0 2.4E-61 5.2E-66 527.6 51.9 469 35-567 620-1093(4289)
5 cd00031 CA Cadherin repeat dom 100.0 8.9E-28 1.9E-32 227.2 29.2 198 61-268 2-199 (199)
6 cd00031 CA Cadherin repeat dom 100.0 2.6E-27 5.6E-32 224.0 28.7 197 283-524 2-199 (199)
7 PF00028 Cadherin: Cadherin do 99.6 2.3E-14 5E-19 117.5 13.2 92 283-377 1-93 (93)
8 PF00028 Cadherin: Cadherin do 99.6 2.9E-14 6.3E-19 116.9 13.4 91 391-523 1-93 (93)
9 KOG1834|consensus 99.6 1.1E-13 2.5E-18 141.1 19.9 204 56-269 32-245 (952)
10 KOG1834|consensus 99.5 6.7E-13 1.4E-17 135.6 20.7 221 155-387 21-251 (952)
11 smart00112 CA Cadherin repeats 99.5 4.3E-13 9.3E-18 106.3 10.5 79 303-384 1-79 (79)
12 smart00112 CA Cadherin repeats 99.5 5.5E-13 1.2E-17 105.7 11.0 73 458-530 6-79 (79)
13 PF08758 Cadherin_pro: Cadheri 97.3 0.0016 3.4E-08 52.2 8.8 78 53-142 3-80 (90)
14 TIGR01965 VCBS_repeat VCBS rep 97.2 0.004 8.7E-08 50.5 10.1 89 187-291 2-98 (99)
15 smart00736 CADG Dystroglycan-t 96.9 0.017 3.6E-07 47.4 11.3 88 65-162 9-96 (97)
16 PF08758 Cadherin_pro: Cadheri 96.6 0.025 5.4E-07 45.3 9.6 79 382-510 2-80 (90)
17 TIGR01965 VCBS_repeat VCBS rep 96.3 0.039 8.4E-07 44.8 9.2 89 76-180 2-98 (99)
18 PF08266 Cadherin_2: Cadherin- 96.2 0.0042 9E-08 49.1 2.9 61 63-127 5-66 (84)
19 smart00736 CADG Dystroglycan-t 96.1 0.091 2E-06 42.9 10.7 72 190-271 23-96 (97)
20 PF08266 Cadherin_2: Cadherin- 96.1 0.0058 1.3E-07 48.3 3.2 64 391-495 3-66 (84)
21 PF13750 Big_3_3: Bacterial Ig 95.8 1.3 2.9E-05 39.6 17.6 32 494-526 120-151 (158)
22 PF13750 Big_3_3: Bacterial Ig 95.5 1.2 2.6E-05 40.0 15.9 129 127-268 13-149 (158)
23 TIGR00845 caca sodium/calcium 92.2 21 0.00046 41.0 21.0 49 270-322 396-444 (928)
24 TIGR00864 PCC polycystin catio 92.1 46 0.00099 42.9 32.5 32 232-269 1563-1594(2740)
25 PF05345 He_PIG: Putative Ig d 87.2 3.1 6.8E-05 29.1 6.2 36 472-509 12-48 (49)
26 PF05345 He_PIG: Putative Ig d 87.0 3.7 8E-05 28.7 6.5 47 94-142 2-49 (49)
27 TIGR03660 T1SS_rpt_143 T1SS-14 85.6 13 0.00029 32.3 10.7 57 116-180 69-128 (137)
28 KOG3597|consensus 84.7 56 0.0012 34.4 29.4 159 150-322 25-197 (442)
29 PF07495 Y_Y_Y: Y_Y_Y domain; 82.2 16 0.00035 26.9 8.8 61 309-377 6-66 (66)
30 PF13753 SWM_repeat: Putative 81.9 60 0.0013 32.7 18.8 109 127-252 10-126 (317)
31 TIGR00845 caca sodium/calcium 79.7 1.3E+02 0.0028 35.0 20.5 152 160-321 395-570 (928)
32 TIGR03660 T1SS_rpt_143 T1SS-14 79.7 8.6 0.00019 33.5 7.2 45 237-292 85-129 (137)
33 KOG3597|consensus 77.6 41 0.00089 35.4 12.6 158 368-562 24-183 (442)
34 TIGR00864 PCC polycystin catio 76.5 2.5E+02 0.0055 36.7 33.7 37 491-527 1223-1259(2740)
35 PF07495 Y_Y_Y: Y_Y_Y domain; 74.2 22 0.00047 26.1 7.4 57 462-523 9-66 (66)
36 PF02010 REJ: REJ domain; Int 73.6 4.9 0.00011 42.7 5.0 29 498-527 375-403 (440)
37 PF05895 DUF859: Siphovirus pr 65.8 1E+02 0.0022 34.2 12.8 123 129-264 298-433 (624)
38 PF03160 Calx-beta: Calx-beta 60.7 87 0.0019 25.2 10.8 52 263-320 2-53 (100)
39 PF02010 REJ: REJ domain; Int 59.3 14 0.00029 39.3 4.9 131 237-387 50-194 (440)
40 PF02494 HYR: HYR domain; Int 53.0 66 0.0014 24.8 6.7 24 238-266 58-81 (81)
41 PF13753 SWM_repeat: Putative 52.5 2.6E+02 0.0056 28.1 19.2 114 236-364 11-125 (317)
42 cd00146 PKD polycystic kidney 49.0 36 0.00077 26.1 4.6 32 490-522 50-81 (81)
43 cd02848 Chitinase_N_term Chiti 47.6 41 0.00089 27.7 4.7 34 490-524 73-106 (106)
44 PF12245 Big_3_2: Bacterial Ig 45.5 79 0.0017 23.0 5.6 31 495-526 21-51 (60)
45 PF13754 Big_3_4: Bacterial Ig 44.3 65 0.0014 22.8 4.9 28 495-522 22-49 (54)
46 PF12245 Big_3_2: Bacterial Ig 41.8 1.1E+02 0.0023 22.3 5.8 30 128-161 22-51 (60)
47 PF03160 Calx-beta: Calx-beta 41.4 1.8E+02 0.004 23.2 8.4 35 373-411 2-36 (100)
48 PF13754 Big_3_4: Bacterial Ig 40.2 83 0.0018 22.3 4.9 17 236-252 23-39 (54)
49 smart00089 PKD Repeats in poly 37.2 77 0.0017 24.0 4.8 31 490-522 48-78 (79)
50 cd00146 PKD polycystic kidney 36.9 76 0.0017 24.2 4.8 30 123-156 51-80 (81)
51 PF08329 ChitinaseA_N: Chitina 35.7 1E+02 0.0023 26.6 5.6 64 462-527 41-112 (133)
52 PF02494 HYR: HYR domain; Int 26.5 1.2E+02 0.0026 23.3 4.3 26 496-522 56-81 (81)
53 smart00089 PKD Repeats in poly 25.4 1.5E+02 0.0032 22.4 4.6 28 346-376 51-78 (79)
54 PF09100 Qn_am_d_aIV: Quinohem 23.3 2.2E+02 0.0047 24.3 5.2 34 497-531 100-133 (133)
55 cd02848 Chitinase_N_term Chiti 21.2 3.3E+02 0.0072 22.5 5.8 30 346-377 76-105 (106)
56 PF08329 ChitinaseA_N: Chitina 20.5 5.6E+02 0.012 22.2 7.4 33 346-380 79-111 (133)
No 1
>KOG4289|consensus
Probab=100.00 E-value=9.3e-74 Score=607.53 Aligned_cols=513 Identities=29% Similarity=0.456 Sum_probs=458.6
Q ss_pred eCCCceeEEEEE-EEEEccCCCCcEEEEecCCCcccccccccccceeecCCCCCCeEEEEEEEEcCCCCCCCceEEEEEE
Q psy8639 19 STPNMVSRCKFK-KYLSNNEVCPKQIFTSTPNMVSRCKFKKYLSNNEIKEDVPIGFTVGNVVSTDSTGRAGGGHVMYSLT 97 (570)
Q Consensus 19 ~~p~~~~~~~~~-~v~d~nd~~p~~~~~~~~~~~~~p~f~~~~~~~~v~E~~~~gt~v~~l~a~D~D~~~~~~~~~ysi~ 97 (570)
|.|..++.+.+. .|+|+|||.| .|++..|..++.||.++|+.|.++.|+|.|.+ +|+.++|++
T Consensus 245 ~~P~~SAtttv~V~V~D~nDhsP--------------vFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp-~Nani~Yrl- 308 (2531)
T KOG4289|consen 245 GDPRRSATTTVTVLVLDTNDHSP--------------VFEQDEYREELRENLEVGYEVLTVRATDGDSP-PNANIRYRL- 308 (2531)
T ss_pred CCCcccceeEEEEEEeecCCCCc--------------ccchhHHHHHHhhccccCceEEEEEeccCCCC-CCCceEEEe-
Confidence 568888888886 9999999888 59999999999999999999999999999999 899999999
Q ss_pred ecCCCCCCCceEEECcccEEEECCcCCCCCCCeEEEEEEEEeCCCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEe
Q psy8639 98 SLSPTDHSGAFDIDKSSGSLVVARQLDRETCPEYKLEVRALDTSTSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVN 177 (570)
Q Consensus 98 ~~~~~~~~~~F~id~~~G~l~~~~~LD~E~~~~~~l~v~a~D~~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~ 177 (570)
..+ +....|+|++++|.|.++.+||||+...|+|.|.|+|.+..+++. ++.|.|+|.|+|||+|+|..+.|.+.|.
T Consensus 309 -~eg-~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~--Ta~V~itV~D~NDNaPqFse~~Yvvqv~ 384 (2531)
T KOG4289|consen 309 -LEG-NAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPR--TAMVEITVEDENDNAPQFSEKRYVVQVR 384 (2531)
T ss_pred -cCC-CccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCc--eEEEEEEEEecCCCCccccccceEEEec
Confidence 444 778999999999999999999999999999999999987744433 8999999999999999999999999999
Q ss_pred CCCCCCeEEEEeEEecCCCCCCceEEEEEEecCCCCCcEEEeCCCcEEEEcccCCccCcceEEEEEEEEECCCCCCCCce
Q psy8639 178 EDTLVGSTIWNFTATDADDGSNGELRYGLVRQYPEGNHFSVDHLTGTLTLLQPLDYETLTSFILVVKVTDQAANISERLS 257 (570)
Q Consensus 178 E~~~~gt~v~~v~a~D~D~g~~~~~~y~l~~~~~~~~~F~Id~~~G~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~ 257 (570)
|+.-+++.|.++.|+|.|.|.|+.+.|+|.++.. .+.|.||..+|+|.+..+||+|.. .|++.|+|.|++.|+ ++
T Consensus 385 Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~-~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPp---Ls 459 (2531)
T KOG4289|consen 385 EDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNG-RGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPP---LS 459 (2531)
T ss_pred ccCCCCceEEEEEecccCCCcCceEEEEeeccCc-cccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCC---cc
Confidence 9999999999999999999999999999999864 689999999999999999999998 999999999999886 88
Q ss_pred eEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccCCCCeEEEEEEEEeCCCCCCeEEEEEEEeCCCCCcEEEecCCCEEEE
Q psy8639 258 TSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQTRVGSHVTKMTAVDQDSGDNGRVTYVISGGNEDARFHLDHDKGILSL 337 (570)
Q Consensus 258 ~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~ysi~~~~~~~~F~id~~tG~l~~ 337 (570)
.+.-+.|+|.|+|||+|.|.... ..++|.|+.+.|..+..+.|.|.|.|+|+.+.|++.+ .+.|.|+..+|+|++
T Consensus 460 n~sgl~iqVlDINDhaPifvstp--fq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG---~~pf~I~~~SG~Itv 534 (2531)
T KOG4289|consen 460 NTSGLVIQVLDINDHAPIFVSTP--FQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG---VGPFQINNGSGWITV 534 (2531)
T ss_pred CCCceEEEEEecCCCCceeEech--hhhhhhhcccccceEEEEecccCCCCcccceeeeecc---CCCeeEecCCceEEE
Confidence 88888899999999999999886 6789999999999999999999999999999999985 468999999999999
Q ss_pred EeecCcccccceeEEEEEEEEEECCcCCceeEEEEEEEEeeCCCCCCeecCCeeEEEEeCCCCCCcEEEEEEEecCCCCC
Q psy8639 338 ARVISEYDHQSKFRAVLNITASDHGIPPKSSSLVINILSETVSTSLPTFLSPSYHANVSEDAIPGTYIIKVVARSPSTYG 417 (570)
Q Consensus 338 ~~~ld~~d~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~ 417 (570)
.+.|| ||+...|.|.|+|+|+|.|+.++.++|.|++.|+|||.|.|.++.|++.+.|+.+.|+.|.+++|.|.|.+.
T Consensus 535 tk~ld---rEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s 611 (2531)
T KOG4289|consen 535 TKELD---RETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANS 611 (2531)
T ss_pred eeccc---ccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEecccccc
Confidence 99999 999999999999999999999999999999999999999999999999999999999999999999999988
Q ss_pred CceEE-------EEEEeeccCC--------------CceEEEEEEeeC---CCcee-----------------EEEE---
Q psy8639 418 NYVYT-------EVFLVSEDAI--------------PGTYIIKVVARS---PSTYV-----------------YHMY--- 453 (570)
Q Consensus 418 ~~~~~-------~~~~v~~~~~--------------~~~~~~~~~~~~---~~~~~-----------------~~~~--- 453 (570)
.+.|. ..|.++.... .+.|.+.|-+++ .+... .||+
T Consensus 612 ~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI 691 (2531)
T KOG4289|consen 612 VITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSI 691 (2531)
T ss_pred ceEEEecCCcccccceeeccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEee
Confidence 87773 3344443321 122333333333 11111 1111
Q ss_pred ----------------EeecCCCceEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCC-CCeeEE
Q psy8639 454 ----------------LISDFSGYNLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKS-GDYDTT 516 (570)
Q Consensus 454 ----------------~~~~~~~~~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~-~~~~~~ 516 (570)
-.+-++|..++|-+. ...|+||+++|.+++...||||.+-.|.+.|+|.|.+. +..++.
T Consensus 692 ~e~rP~G~tvvtlsasd~D~geNARI~y~le----d~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtt 767 (2531)
T KOG4289|consen 692 NEDRPLGTTVVTLSASDEDTGENARITYILE----DEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTT 767 (2531)
T ss_pred ccCCcCCceeEEEecccCCCCccceEEEEec----ccceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccE
Confidence 112477888999553 34599999999999999999999999999999999998 678899
Q ss_pred EEEEEEEecCCCCcccccCCeeeEEecCCCCCC-eEEEEEEEeCCCCCcceecC
Q psy8639 517 NVFITILDVNDNAPEFKSGSCYPITIPENSEAA-NVHTFIATDFGNFHKSFIDS 569 (570)
Q Consensus 517 ~v~I~V~dvNd~~P~f~~~~~~~~~v~E~~~~~-~v~~i~a~D~D~~~~~~~~~ 569 (570)
+|.|.|.|+|||+|+|. .+.|.++|.|+.+.+ .|++++|||.|.|.|+..|+
T Consensus 768 tveV~v~diNDnaPqf~-assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y 820 (2531)
T KOG4289|consen 768 TVEVLVNDINDNAPQFL-ASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYY 820 (2531)
T ss_pred EEEEEeecccccCcccc-hhhceeEeecCCCCcceEEEEEEeccCCCCCceEEE
Confidence 99999999999999999 899999999999999 89999999999999999875
No 2
>KOG4289|consensus
Probab=100.00 E-value=3.5e-70 Score=580.34 Aligned_cols=450 Identities=29% Similarity=0.457 Sum_probs=427.9
Q ss_pred ccccccccccceeecCCCCCCeEEEEEEEEcCCCCCCCceEEEEEEecCCCCCCCceEEECcccEEEECCcCCCCCCCeE
Q psy8639 52 SRCKFKKYLSNNEIKEDVPIGFTVGNVVSTDSTGRAGGGHVMYSLTSLSPTDHSGAFDIDKSSGSLVVARQLDRETCPEY 131 (570)
Q Consensus 52 ~~p~f~~~~~~~~v~E~~~~gt~v~~l~a~D~D~~~~~~~~~ysi~~~~~~~~~~~F~id~~~G~l~~~~~LD~E~~~~~ 131 (570)
++++|.++.|...++||.|+||.|+.+.|.|+| .+.+.|++..+...-..++|+||+.+|.|++++.||||.+..+
T Consensus 160 ~~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~----a~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~H 235 (2531)
T KOG4289|consen 160 NAVQFQQPNYQKELPENEPAGTIIASVKASDPD----AGRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETH 235 (2531)
T ss_pred CCccCCCcchhccCcCCCCCCceeEEEEecCCC----cCceEEEeeeccchhccccEeeccccccchhhhhhhhhhhhee
Confidence 567999999999999999999999999999998 3689999966555566789999999999999999999999999
Q ss_pred EEEEEEEeCCCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCCCCceEEEEEEecCC
Q psy8639 132 KLEVRALDTSTSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDGSNGELRYGLVRQYP 211 (570)
Q Consensus 132 ~l~v~a~D~~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g~~~~~~y~l~~~~~ 211 (570)
-|+|+|.|.+. |..|++.+++|.|.|+|||.|+|.+..|...+.||.+.|..|.+++|+|.|.++|+.|.|++.++..
T Consensus 236 vlrVtA~d~~~--P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~ 313 (2531)
T KOG4289|consen 236 VLRVTAQDHGD--PRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNA 313 (2531)
T ss_pred EEEEEeeecCC--CcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCc
Confidence 99999999765 8899999999999999999999999999999999999999999999999999999999999999854
Q ss_pred CCCcEEEeCCCcEEEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccC
Q psy8639 212 EGNHFSVDHLTGTLTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQT 291 (570)
Q Consensus 212 ~~~~F~Id~~~G~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~ 291 (570)
...|.||+.+|.|....+||+|+..+|+|.|.|+|+|.+++++ ++.|.|+|.|.|||+|+|.... |.+.|.|+.
T Consensus 314 -~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~---Ta~V~itV~D~NDNaPqFse~~--Yvvqv~Edv 387 (2531)
T KOG4289|consen 314 -KNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPR---TAMVEITVEDENDNAPQFSEKR--YVVQVREDV 387 (2531)
T ss_pred -cceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCc---eEEEEEEEEecCCCCccccccc--eEEEecccC
Confidence 6899999999999999999999999999999999999987553 8999999999999999999887 999999999
Q ss_pred CCCeEEEEEEEEeCCCCCCeEEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEEEEECCcCCceeEEE
Q psy8639 292 RVGSHVTKMTAVDQDSGDNGRVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNITASDHGIPPKSSSLV 371 (570)
Q Consensus 292 ~~g~~v~~v~a~D~D~~~~~~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~g~~~~s~~~~ 371 (570)
.+++++++++|+|.|.|.|+.+.|+|.+++..+.|.|+..||+|.+..+|| +|.. .|++.|.|.|+|.|+++.+.-
T Consensus 388 t~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD---~e~~-~ytl~IrAqDggrPpLsn~sg 463 (2531)
T KOG4289|consen 388 TPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLD---FENS-EYTLRIRAQDGGRPPLSNTSG 463 (2531)
T ss_pred CCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEecccc---ccCC-eeEEEEEcccCCCCCccCCCc
Confidence 999999999999999999999999999999999999999999999999999 9988 999999999999999999999
Q ss_pred EEEEEeeCCCCCCeecCCeeEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccCCCceEEEEEEeeCCCceeEE
Q psy8639 372 INILSETVSTSLPTFLSPSYHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDAIPGTYIIKVVARSPSTYVYH 451 (570)
Q Consensus 372 v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 451 (570)
+.|+|.|+||++|.|....+..++.|+.+.|..+..+.|.|.|.
T Consensus 464 l~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidads------------------------------------ 507 (2531)
T KOG4289|consen 464 LVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADS------------------------------------ 507 (2531)
T ss_pred eEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCC------------------------------------
Confidence 99999999999999999999999999999999999999999999
Q ss_pred EEEeecCCCceEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCC-CCeeEEEEEEEEEecCCCCc
Q psy8639 452 MYLISDFSGYNLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKS-GDYDTTNVFITILDVNDNAP 530 (570)
Q Consensus 452 ~~~~~~~~~~~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~-~~~~~~~v~I~V~dvNd~~P 530 (570)
+.|+.+.|++.+ .+.|.|+..+|+|++.+.||||+...|.|-|.|+|.+. +.+++.+|.|.+.|+|||.|
T Consensus 508 ------g~na~l~y~laG---~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP 578 (2531)
T KOG4289|consen 508 ------GENARLHYSLAG---VGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDP 578 (2531)
T ss_pred ------Ccccceeeeecc---CCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCC
Confidence 999999999854 45899999999999999999999999999999999998 68899999999999999999
Q ss_pred ccccCCeeeEEecCCCCCC-eEEEEEEEeCCCCC
Q psy8639 531 EFKSGSCYPITIPENSEAA-NVHTFIATDFGNFH 563 (570)
Q Consensus 531 ~f~~~~~~~~~v~E~~~~~-~v~~i~a~D~D~~~ 563 (570)
.|. +..|+..+.|..+.| .|.+++|+|.|...
T Consensus 579 ~Ft-~~eytl~inED~pvgsSI~tvtAvD~d~~s 611 (2531)
T KOG4289|consen 579 TFT-QKEYTLRINEDAPVGSSIVTVTAVDRDANS 611 (2531)
T ss_pred ccc-cCceEEEecCCccccceEEEEEEecccccc
Confidence 999 889999999999999 89999999999764
No 3
>KOG1219|consensus
Probab=100.00 E-value=1.8e-64 Score=551.88 Aligned_cols=487 Identities=30% Similarity=0.419 Sum_probs=449.0
Q ss_pred eeeeeeccCceEEeCCC---------------------ceeEEEEE-EEEEccCCCCcEEEEecCCCcccccccccccce
Q psy8639 6 KFKKYLSNNEIFTSTPN---------------------MVSRCKFK-KYLSNNEVCPKQIFTSTPNMVSRCKFKKYLSNN 63 (570)
Q Consensus 6 ~~~~~~~~~~~~~~~p~---------------------~~~~~~~~-~v~d~nd~~p~~~~~~~~~~~~~p~f~~~~~~~ 63 (570)
-|.|+++.|+|-+-.|- .++-...+ .|.|.|||+| .|....|.+
T Consensus 687 ~~~Id~qsg~itvas~ld~~~t~~yiLnvta~D~gtPqkss~r~l~v~vkd~ndn~p--------------~f~e~sy~v 752 (4289)
T KOG1219|consen 687 CFLIDRQSGNITVASPLDNENTEQYILNVTAYDLGTPQKSSWRLLLVFVKDYNDNTP--------------IFVERSYHV 752 (4289)
T ss_pred EEEEecccceEEEecchhhhhhheeEEEEEEecCCCchhhceeeEEEEEEecccCCc--------------cccccceEE
Confidence 48899999998776654 22222222 6666666665 699999999
Q ss_pred eecCCCCCCeEEEEEEEEcCCCCCCCceEEEEEEecCCCCCCCceEEECcccEEEECCcCCCCCCCeEEEEEEEEeCCCC
Q psy8639 64 EIKEDVPIGFTVGNVVSTDSTGRAGGGHVMYSLTSLSPTDHSGAFDIDKSSGSLVVARQLDRETCPEYKLEVRALDTSTS 143 (570)
Q Consensus 64 ~v~E~~~~gt~v~~l~a~D~D~~~~~~~~~ysi~~~~~~~~~~~F~id~~~G~l~~~~~LD~E~~~~~~l~v~a~D~~g~ 143 (570)
+|.|+..+|+.|+.+++.|.|.+ +|+.++|+| .+....|+||+.+|.+.+.+.||||.+..|+|.|+|.|...
T Consensus 753 tvsedtepgs~Ia~vetnd~D~g-~NG~v~fsL-----~n~sdvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~- 825 (4289)
T KOG1219|consen 753 TVSEDTEPGSFIAHVETNDTDGG-NNGMVSFSL-----LNKSDVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPP- 825 (4289)
T ss_pred EEecCCCCCceEEEEEecccCCC-CCceEEEEe-----cCCcceEEecCcccEEEeccccCcccCceeEEEEEEcCCCC-
Confidence 99999999999999999999999 899999999 24578999999999999999999999999999999999654
Q ss_pred CCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCCCCceEEEEEEecCCCCCcEEEeCCCc
Q psy8639 144 SNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDGSNGELRYGLVRQYPEGNHFSVDHLTG 223 (570)
Q Consensus 144 ~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g~~~~~~y~l~~~~~~~~~F~Id~~~G 223 (570)
+.+...+.+.|.|.|||||+|.+....+...|+|+.|.|+.+..+.|.|+|.|.+++++|.+..+.- .+.++..+|
T Consensus 826 -pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~~~v---~~rvd~~sG 901 (4289)
T KOG1219|consen 826 -PQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTDDTV---GERVDFPSG 901 (4289)
T ss_pred -CceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEecCcc---ccccccccc
Confidence 6677888999999999999999998889999999999999999999999999999999999998653 459999999
Q ss_pred EEEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEeeCCCC--CCeeecCCCCeeEEEeccCCCCeEEEEEE
Q psy8639 224 TLTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARVVVKDLNDN--APVFVSPTRGSLIYVSEQTRVGSHVTKMT 301 (570)
Q Consensus 224 ~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dvNd~--~P~~~~~~~~~~~~v~E~~~~g~~v~~v~ 301 (570)
.+.+.++||||..+-|.|.|+|.|++.|. +++.+.+.|.+.|+|+| ||.|.... ....|.||++.|+.++++.
T Consensus 902 avfi~~~LDf~k~~fynLsv~a~d~g~p~---lss~chl~Vevldv~enlhpp~F~~~v--~e~~V~EnapiGT~vi~i~ 976 (4289)
T KOG1219|consen 902 AVFIGKPLDFEKSDFYNLSVTAVDRGTPI---LSSICHLEVEVLDVNENLHPPEFISFV--TEGHVLENAPIGTIVIRIQ 976 (4289)
T ss_pred cEEEecccccccccceEEEEEEecCCCcc---eeeeEEEEEEEeccCCCCCCcchheee--eeeeEeecCCcceEEEEEE
Confidence 99999999999999999999999999885 88999999999999987 99998765 5789999999999999999
Q ss_pred EEeCCCCCCeEEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEEEEECCcCCceeEEEEEEEEeeCCC
Q psy8639 302 AVDQDSGDNGRVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNITASDHGIPPKSSSLVINILSETVST 381 (570)
Q Consensus 302 a~D~D~~~~~~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~dvNd 381 (570)
|.|.|.|..+.+.|+|..++..+.|+|+..+|.|++.+.|| +|....|.|+|.|+|.|.+++++...+.|.|+|+||
T Consensus 977 A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lD---rE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvND 1053 (4289)
T KOG1219|consen 977 ARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALD---REKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVND 1053 (4289)
T ss_pred EecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhc---hhhcceEEEEEEEEecCCCccccceeEEEEEEecCC
Confidence 99999999999999999998899999999999999999999 999999999999999999999999999999999999
Q ss_pred CCCeecCCeeEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccCCCceEEEEEEeeCCCceeEEEEEeecCCCc
Q psy8639 382 SLPTFLSPSYHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDAIPGTYIIKVVARSPSTYVYHMYLISDFSGY 461 (570)
Q Consensus 382 ~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (570)
|+|.|..+-|..+|.|+++.+..|.++.|.|+|. ..++
T Consensus 1054 n~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Ds------------------------------------------ssn~ 1091 (4289)
T KOG1219|consen 1054 NVPQFSSPVYYASISENSPETVSIVQAEANDPDS------------------------------------------SSNQ 1091 (4289)
T ss_pred CCcccCCceEeeeeccCCCCceEEEEeccCCCCc------------------------------------------ccCc
Confidence 9999999999999999999999999999999998 7899
Q ss_pred eEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCC-CCeeEEEEEEEEEecCCCCcccccCCeeeE
Q psy8639 462 NLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKS-GDYDTTNVFITILDVNDNAPEFKSGSCYPI 540 (570)
Q Consensus 462 ~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~-~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~ 540 (570)
++.|.|.+|+..+.|+|++.+|.|+..+.||||++.+|.|+|+++|++. .+++.+-|.|.|+|+|||+|+|. +..+..
T Consensus 1092 kLmykI~sGnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~IldvNdnsp~Fl-qk~~~~ 1170 (4289)
T KOG1219|consen 1092 KLMYKITSGNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDVNDNSPRFL-QKKTFL 1170 (4289)
T ss_pred ceEEEEccCCccceEEEccccceeeeehhhcccccccceEEEEEecCCCCccccceEEEEEEeeccCCchhhh-hheeEE
Confidence 9999999999999999999999999999999999999999999999998 57888999999999999999999 889999
Q ss_pred EecCCCCCCeEEEEEEEeCCCCCcceecC
Q psy8639 541 TIPENSEAANVHTFIATDFGNFHKSFIDS 569 (570)
Q Consensus 541 ~v~E~~~~~~v~~i~a~D~D~~~~~~~~~ 569 (570)
.|+|.... .+.++.|+|.|+|.|+-|-+
T Consensus 1171 ~v~~r~s~-plyRl~a~d~DeG~narity 1198 (4289)
T KOG1219|consen 1171 RVPERSSP-PLYRLAAQDNDEGNNARITY 1198 (4289)
T ss_pred EeeeccCC-ceeEEEEEecCCCcceEEEE
Confidence 99998774 78999999999999987754
No 4
>KOG1219|consensus
Probab=100.00 E-value=2.4e-61 Score=527.59 Aligned_cols=469 Identities=30% Similarity=0.404 Sum_probs=431.9
Q ss_pred ccCCCCcEEEEecCCCcccccccc-cccceeecCCCCCCeEEEEEEEEcCCCCCCCceEEEEEEecCCCCCCCceEEECc
Q psy8639 35 NNEVCPKQIFTSTPNMVSRCKFKK-YLSNNEIKEDVPIGFTVGNVVSTDSTGRAGGGHVMYSLTSLSPTDHSGAFDIDKS 113 (570)
Q Consensus 35 ~nd~~p~~~~~~~~~~~~~p~f~~-~~~~~~v~E~~~~gt~v~~l~a~D~D~~~~~~~~~ysi~~~~~~~~~~~F~id~~ 113 (570)
-|+..|.+.-... ...-|.|.. -++-..|+|+.|+|+.++.+.|+|+|+| -|+.++|.| .+++....|.|+.+
T Consensus 620 ~~~~~p~Ilq~~e--~~~fPqf~s~fP~iI~v~Edvpigt~la~L~atD~Dtg-fng~l~yvI---~dgne~~~~~Id~q 693 (4289)
T KOG1219|consen 620 GNEHEPNILQRFE--NKHFPQFPSDFPFIIVVPEDVPIGTTLAILSATDSDTG-FNGKLVYVI---EDGNESICFLIDRQ 693 (4289)
T ss_pred ccccChhHhhhhc--cccCccccccCCceEEccccCCCCceEEEEeccCCCCC-cCceEEEEE---eCCccceEEEEecc
Confidence 4555665443321 123367777 5677889999999999999999999999 899999999 33467899999999
Q ss_pred ccEEEECCcCCCCCCCeEEEEEEEEeCCCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEec
Q psy8639 114 SGSLVVARQLDRETCPEYKLEVRALDTSTSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATD 193 (570)
Q Consensus 114 ~G~l~~~~~LD~E~~~~~~l~v~a~D~~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D 193 (570)
+|.|.+.++||+|....|.|.|+|.|.+. |..+++..+.|.|.|.|||+|.|.+..|.+.|.|+..+|+.++.+.|.|
T Consensus 694 sg~itvas~ld~~~t~~yiLnvta~D~gt--Pqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd 771 (4289)
T KOG1219|consen 694 SGNITVASPLDNENTEQYILNVTAYDLGT--PQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETND 771 (4289)
T ss_pred cceEEEecchhhhhhheeEEEEEEecCCC--chhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecc
Confidence 99999999999999999999999999766 7888999999999999999999999999999999999999999999999
Q ss_pred CCCCCCceEEEEEEecCCCCCcEEEeCCCcEEEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEeeCCCCC
Q psy8639 194 ADDGSNGELRYGLVRQYPEGNHFSVDHLTGTLTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARVVVKDLNDNA 273 (570)
Q Consensus 194 ~D~g~~~~~~y~l~~~~~~~~~F~Id~~~G~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dvNd~~ 273 (570)
.|.|+||.++|+|... ++.|+||+.+|.+.+.++||+|.+.+|.|.|.|+|++.|. +.+.+.+.|.|+|||||+
T Consensus 772 ~D~g~NG~v~fsL~n~---sdvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq---~~svv~l~vsvedVndnp 845 (4289)
T KOG1219|consen 772 TDGGNNGMVSFSLLNK---SDVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQ---LFSVVELDVSVEDVNDNP 845 (4289)
T ss_pred cCCCCCceEEEEecCC---cceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCc---eEEEEEEEEEEeeccCCC
Confidence 9999999999999873 5799999999999999999999999999999999998754 788899999999999999
Q ss_pred CeeecCCCCeeEEEeccCCCCeEEEEEEEEeCCCCCCeEEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEE
Q psy8639 274 PVFVSPTRGSLIYVSEQTRVGSHVTKMTAVDQDSGDNGRVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAV 353 (570)
Q Consensus 274 P~~~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~ 353 (570)
|.+.... +...|.|+.|.|+++..+.|.|+|.|.+++++|.+..+. -.+.++.-+|.+.+-++|| ++....|+
T Consensus 846 Pkci~~h--sr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~~~--v~~rvd~~sGavfi~~~LD---f~k~~fyn 918 (4289)
T KOG1219|consen 846 PKCIIRH--SRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTDDT--VGERVDFPSGAVFIGKPLD---FEKSDFYN 918 (4289)
T ss_pred Ccccccc--ccccCcccCCCceEEEEhhhcCcccCcCceeEEEEecCc--cccccccccccEEEecccc---cccccceE
Confidence 9998877 789999999999999999999999999999999999863 3358998899999999999 99999999
Q ss_pred EEEEEEECCcCCceeEEEEEEEEeeCCCC--CCeecCCeeEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccC
Q psy8639 354 LNITASDHGIPPKSSSLVINILSETVSTS--LPTFLSPSYHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDA 431 (570)
Q Consensus 354 l~V~a~D~g~~~~s~~~~v~I~V~dvNd~--~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~ 431 (570)
|.|+|.|+|.|.+++.+++.|.+.|+|.| ||.|....-.+.|.||++.|+.++.|.|.|.|.
T Consensus 919 Lsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~deds---------------- 982 (4289)
T KOG1219|consen 919 LSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDS---------------- 982 (4289)
T ss_pred EEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCC----------------
Confidence 99999999999999999999999999876 999988778899999999999999999999999
Q ss_pred CCceEEEEEEeeCCCceeEEEEEeecCCCceEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCC-
Q psy8639 432 IPGTYIIKVVARSPSTYVYHMYLISDFSGYNLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKS- 510 (570)
Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~- 510 (570)
|..+.+.|+|..|...+.|+||..+|.|++...||||....|-|+|.|+|.+.
T Consensus 983 --------------------------gldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~ 1036 (4289)
T KOG1219|consen 983 --------------------------GLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTV 1036 (4289)
T ss_pred --------------------------CccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCC
Confidence 99999999999998899999999999999999999999999999999999999
Q ss_pred CCeeEEEEEEEEEecCCCCcccccCCeeeEEecCCCCCC-eEEEEEEEeCCCCCccee
Q psy8639 511 GDYDTTNVFITILDVNDNAPEFKSGSCYPITIPENSEAA-NVHTFIATDFGNFHKSFI 567 (570)
Q Consensus 511 ~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~~v~E~~~~~-~v~~i~a~D~D~~~~~~~ 567 (570)
++++-..+.|.|.|+|||+|+|. +..|..+|.|+++.+ .|.++.|.|+|+..|+.+
T Consensus 1037 ~~ssv~~vyI~ieDvNDn~Pq~s-~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kL 1093 (4289)
T KOG1219|consen 1037 PLSSVCEVYIEIEDVNDNVPQFS-SPVYYASISENSPETVSIVQAEANDPDSSSNQKL 1093 (4289)
T ss_pred ccccceeEEEEEEecCCCCcccC-CceEeeeeccCCCCceEEEEeccCCCCcccCcce
Confidence 88899999999999999999999 888999999999999 789999999997776654
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=8.9e-28 Score=227.20 Aligned_cols=198 Identities=36% Similarity=0.540 Sum_probs=178.3
Q ss_pred cceeecCCCCCCeEEEEEEEEcCCCCCCCceEEEEEEecCCCCCCCceEEECcccEEEECCcCCCCCCCeEEEEEEEEeC
Q psy8639 61 SNNEIKEDVPIGFTVGNVVSTDSTGRAGGGHVMYSLTSLSPTDHSGAFDIDKSSGSLVVARQLDRETCPEYKLEVRALDT 140 (570)
Q Consensus 61 ~~~~v~E~~~~gt~v~~l~a~D~D~~~~~~~~~ysi~~~~~~~~~~~F~id~~~G~l~~~~~LD~E~~~~~~l~v~a~D~ 140 (570)
|...|.|+++.|+.++++.|.|+|.+ .++.++|+|.+ + ...++|.|++.+|.|++++.||||....|.|.|+|+|.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~-~~~~~~y~i~~--~-~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~ 77 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSG-ENGRVTYSILG--G-NEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDG 77 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCC-CCceEEEEEeC--C-CCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEEC
Confidence 66789999999999999999999987 47999999933 2 33379999999999999999999999999999999996
Q ss_pred CCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCCCCceEEEEEEecCCCCCcEEEeC
Q psy8639 141 STSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDGSNGELRYGLVRQYPEGNHFSVDH 220 (570)
Q Consensus 141 ~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g~~~~~~y~l~~~~~~~~~F~Id~ 220 (570)
++ +..+....++|.|.|+||++|.|....+.+.|.|+.+.|+.++++.|+|.|.+.++.++|+|..+.. .+.|.|++
T Consensus 78 g~--~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~-~~~f~i~~ 154 (199)
T cd00031 78 GG--PPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGND-KELFSIDP 154 (199)
T ss_pred Cc--CcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCC-CCEEEEeC
Confidence 44 4456899999999999999999998899999999999999999999999998888999999998754 47999999
Q ss_pred CCcEEEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEee
Q psy8639 221 LTGTLTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARVVVKD 268 (570)
Q Consensus 221 ~~G~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~d 268 (570)
.+|.|.+.+.||+|....|.|.|.|+|.+.+. ++.+..++|.|.|
T Consensus 155 ~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~---~~~~~~i~i~v~d 199 (199)
T cd00031 155 NTGIITLAKPLDREEKSSYELTVVATDGGGPP---LSSTATVTVTVLD 199 (199)
T ss_pred CceEEEeCCccCCccCceEEEEEEEEECCCCC---ceeEEEEEEEEEC
Confidence 99999999999999999999999999998632 7888888888865
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.96 E-value=2.6e-27 Score=224.01 Aligned_cols=197 Identities=38% Similarity=0.573 Sum_probs=183.9
Q ss_pred eeEEEeccCCCCeEEEEEEEEeCCCCCCeEEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEEEEECC
Q psy8639 283 SLIYVSEQTRVGSHVTKMTAVDQDSGDNGRVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNITASDHG 362 (570)
Q Consensus 283 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~g 362 (570)
+.+.+.|+++.|+.++++.|.|+|.+.+..++|+|..+....+|.|++.+|.|++.+.|| ||....|.|.|.|+|.|
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD---~e~~~~~~l~v~a~D~g 78 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLD---REEQSEYTLTVVASDGG 78 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCC---CcCCceEEEEEEEEECC
Confidence 678999999999999999999999988889999999886668999999999999999999 99999999999999988
Q ss_pred cCCceeEEEEEEEEeeCCCCCCeecCCeeEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccCCCceEEEEEEe
Q psy8639 363 IPPKSSSLVINILSETVSTSLPTFLSPSYHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDAIPGTYIIKVVA 442 (570)
Q Consensus 363 ~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 442 (570)
.|.+++...++|.|.|+||++|.|....+...+.|+.++|+.++++.|+|+|.
T Consensus 79 ~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~--------------------------- 131 (199)
T cd00031 79 GPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS--------------------------- 131 (199)
T ss_pred cCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC---------------------------
Confidence 88777999999999999999999998899999999999999999999999998
Q ss_pred eCCCceeEEEEEeecCCCceEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCC-CCeeEEEEEEE
Q psy8639 443 RSPSTYVYHMYLISDFSGYNLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKS-GDYDTTNVFIT 521 (570)
Q Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~-~~~~~~~v~I~ 521 (570)
+.++.++|+|..+...+.|.|++.+|.|++.+.||||....|.|.|.|+|.+. +++.+..++|+
T Consensus 132 ---------------~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~ 196 (199)
T cd00031 132 ---------------GENAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVT 196 (199)
T ss_pred ---------------CCCccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEE
Confidence 77889999998876568999999999999999999999999999999999985 57888889998
Q ss_pred EEe
Q psy8639 522 ILD 524 (570)
Q Consensus 522 V~d 524 (570)
|.|
T Consensus 197 v~d 199 (199)
T cd00031 197 VLD 199 (199)
T ss_pred EEC
Confidence 875
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.59 E-value=2.3e-14 Score=117.53 Aligned_cols=92 Identities=35% Similarity=0.524 Sum_probs=87.9
Q ss_pred eeEEEeccCCCCeEEEEEEEEeCCCCCCeEEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEEEEEC-
Q psy8639 283 SLIYVSEQTRVGSHVTKMTAVDQDSGDNGRVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNITASDH- 361 (570)
Q Consensus 283 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~- 361 (570)
|.+.|.|+++.|+.++++.|.|+|.+.++.+.|+|..++...+|.|++.+|.|++.+.|| ||....|.|.|.|+|.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD---~E~~~~y~l~v~a~D~~ 77 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLD---RETQSSYQLTVRATDSG 77 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSC---TTTTSEEEEEEEEEETT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecC---cccCCEEEEEEEEEECC
Confidence 578999999999999999999999999999999999997799999999999999999999 9999999999999999
Q ss_pred CcCCceeEEEEEEEEe
Q psy8639 362 GIPPKSSSLVINILSE 377 (570)
Q Consensus 362 g~~~~s~~~~v~I~V~ 377 (570)
+.|++++.++|.|.|+
T Consensus 78 ~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 78 GSPPLSSTATVTINVL 93 (93)
T ss_dssp TSSEEEEEEEEEEEEE
T ss_pred CCCCCEEEEEEEEEEC
Confidence 8999999999999884
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.59 E-value=2.9e-14 Score=116.95 Aligned_cols=91 Identities=41% Similarity=0.580 Sum_probs=84.9
Q ss_pred eEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccCCCceEEEEEEeeCCCceeEEEEEeecCCCceEEEEecCC
Q psy8639 391 YHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDAIPGTYIIKVVARSPSTYVYHMYLISDFSGYNLTYMIPAG 470 (570)
Q Consensus 391 ~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~i~~~ 470 (570)
|++.|+|++++|+.++++.|.|+|. +.++.+.|+|..+
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~------------------------------------------~~n~~i~y~i~~~ 38 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDS------------------------------------------GPNSQITYSILGG 38 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESST------------------------------------------STTSSEEEEEEET
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCC------------------------------------------CCCceEEEEEecC
Confidence 6889999999999999999999998 8899999999998
Q ss_pred CCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeC-CC-CCeeEEEEEEEEE
Q psy8639 471 VADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDS-KS-GDYDTTNVFITIL 523 (570)
Q Consensus 471 ~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~-~~-~~~~~~~v~I~V~ 523 (570)
...+.|+|++.+|.|++.+.||||....|.|.|.|+|. +. ++++.++|+|+|+
T Consensus 39 ~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 39 NPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp TSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred cccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 87899999999999999999999999999999999999 44 5778888888885
No 9
>KOG1834|consensus
Probab=99.58 E-value=1.1e-13 Score=141.13 Aligned_cols=204 Identities=21% Similarity=0.256 Sum_probs=154.8
Q ss_pred ccccccceeecCCCCCCeEEEEEEEEcCCCCC--CCceEEEEEEecCCCCCCCceEEECccc--EEEECCcCCCCCCCeE
Q psy8639 56 FKKYLSNNEIKEDVPIGFTVGNVVSTDSTGRA--GGGHVMYSLTSLSPTDHSGAFDIDKSSG--SLVVARQLDRETCPEY 131 (570)
Q Consensus 56 f~~~~~~~~v~E~~~~gt~v~~l~a~D~D~~~--~~~~~~ysi~~~~~~~~~~~F~id~~~G--~l~~~~~LD~E~~~~~ 131 (570)
|....|.+-|.||-..-.....+.|.|.|.+- .+..--|.|.+ +.-+-..--+|..+| .|+.+.+||-|.+..|
T Consensus 32 wie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhg--q~vPFdavVvdK~TGegvlRaK~~lDCelqkey 109 (952)
T KOG1834|consen 32 WIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHG--QPVPFDAVVVDKYTGEGVLRAKEPLDCELQKEY 109 (952)
T ss_pred ccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecC--CCCCceEEEEeccCCceEEeecCcccccccccc
Confidence 56677899999997654555558888988641 22334566622 211222233465554 7888899999999999
Q ss_pred EEEEEEEeCCCCC----CCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCC-CCce-EEEE
Q psy8639 132 KLEVRALDTSTSS----NPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDG-SNGE-LRYG 205 (570)
Q Consensus 132 ~l~v~a~D~~g~~----~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g-~~~~-~~y~ 205 (570)
+|+|+|+|.+.++ -..|-.++|.|.|.|+|+.+|+|..+.|.+.|.|+. .-..|+++.|.|.|-+ .+++ ..|.
T Consensus 110 tf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAiD~DCspq~sqIC~YE 188 (952)
T KOG1834|consen 110 TFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAIDKDCSPQYSQICEYE 188 (952)
T ss_pred eEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-eeeeeEEEEeecCCCCCcccceeEEE
Confidence 9999999976532 145667889999999999999999999999999996 5678999999999944 4444 5788
Q ss_pred EEecCCCCCcEEEeCCCcEEEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEeeC
Q psy8639 206 LVRQYPEGNHFSVDHLTGTLTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARVVVKDL 269 (570)
Q Consensus 206 l~~~~~~~~~F~Id~~~G~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dv 269 (570)
|... .-.|.||. .|.|+.+.+|.|....+|.|+|.|.|.+-.. ......|+|.|.+.
T Consensus 189 I~t~---d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kr---aa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 189 ITTP---DVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKR---AASDSLVTVHVKPT 245 (952)
T ss_pred ecCC---CCceEEcC-CCccccccccccccceeEEEEEEEEeccccc---ccCcceEEEEecCc
Confidence 8873 46899986 8999999999999999999999999998632 33346777777653
No 10
>KOG1834|consensus
Probab=99.54 E-value=6.7e-13 Score=135.63 Aligned_cols=221 Identities=23% Similarity=0.255 Sum_probs=170.4
Q ss_pred EEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCCC---CceEEEEEEecCCCCCcEEEeCCC--cEEEEcc
Q psy8639 155 IEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDGS---NGELRYGLVRQYPEGNHFSVDHLT--GTLTLLQ 229 (570)
Q Consensus 155 I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g~---~~~~~y~l~~~~~~~~~F~Id~~~--G~i~~~~ 229 (570)
...-.+|-+.|.. ...|...|.||-..-.....+.|.|.|... ..-..|.|.++.-+-....+|..+ |.|+.+.
T Consensus 21 ~~aarankhkpwi-e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvlRaK~ 99 (952)
T KOG1834|consen 21 HHAARANKHKPWI-EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVLRAKE 99 (952)
T ss_pred cccccccccCccc-ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEEeecC
Confidence 3445667777765 456889999996544444568889988521 124567787754333444557766 5688889
Q ss_pred cCCccCcceEEEEEEEEECCCCCCC---CceeEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccCCCCeEEEEEEEEeCC
Q psy8639 230 PLDYETLTSFILVVKVTDQAANISE---RLSTSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQTRVGSHVTKMTAVDQD 306 (570)
Q Consensus 230 ~lD~e~~~~~~l~v~a~D~~~~~~~---~~~~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D 306 (570)
+||.|.+..|+|+|+|.|.+..+.+ +.+..++|.|.|.|+|+.+|+|..+. |.+.|.|+.. -..|+++.|.|.|
T Consensus 100 ~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~--Yka~V~EGK~-yd~il~veAiD~D 176 (952)
T KOG1834|consen 100 PLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPW--YKAHVTEGKV-YDSILRVEAIDKD 176 (952)
T ss_pred cccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccc--eeeEEeccee-eeeeEEEEeecCC
Confidence 9999999999999999998865433 35778999999999999999999998 8899999875 5678899999999
Q ss_pred CCC-Ce-EEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEEEEECCcCCceeEEEEEEEEeeCCCCCC
Q psy8639 307 SGD-NG-RVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNITASDHGIPPKSSSLVINILSETVSTSLP 384 (570)
Q Consensus 307 ~~~-~~-~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~dvNd~~P 384 (570)
-++ ++ .+.|.|.. +.-.|.||.+ |.|+.+.+|. |.....|.|+|+|.|.|.-.....+-|+|.|... ..|
T Consensus 177 Cspq~sqIC~YEI~t--~d~PFaIdn~-G~irnTekLn---y~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp~--C~p 248 (952)
T KOG1834|consen 177 CSPQYSQICEYEITT--PDVPFAIDND-GNIRNTEKLN---YTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKPT--CKP 248 (952)
T ss_pred CCCcccceeEEEecC--CCCceEEcCC-Cccccccccc---cccceeEEEEEEEEecccccccCcceEEEEecCc--ccc
Confidence 653 44 55777875 6778999976 9999999999 9999999999999998876555557788888775 455
Q ss_pred eec
Q psy8639 385 TFL 387 (570)
Q Consensus 385 ~f~ 387 (570)
.+.
T Consensus 249 gwq 251 (952)
T KOG1834|consen 249 GWQ 251 (952)
T ss_pred chh
Confidence 443
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.47 E-value=4.3e-13 Score=106.32 Aligned_cols=79 Identities=34% Similarity=0.590 Sum_probs=73.9
Q ss_pred EeCCCCCCeEEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEEEEECCcCCceeEEEEEEEEeeCCCC
Q psy8639 303 VDQDSGDNGRVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNITASDHGIPPKSSSLVINILSETVSTS 382 (570)
Q Consensus 303 ~D~D~~~~~~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~dvNd~ 382 (570)
+|+|.|.++.++|+|..+....+|.|++.+|.|.+.+.|| ||....|.|.|.|+|.|.|.+++.++|.|.|.|+||+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD---~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~ 77 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLD---REEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDN 77 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccC---eeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCC
Confidence 4889888999999999876669999999999999999999 9999999999999999998899999999999999999
Q ss_pred CC
Q psy8639 383 LP 384 (570)
Q Consensus 383 ~P 384 (570)
+|
T Consensus 78 ~P 79 (79)
T smart00112 78 AP 79 (79)
T ss_pred CC
Confidence 98
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.46 E-value=5.5e-13 Score=105.72 Aligned_cols=73 Identities=44% Similarity=0.608 Sum_probs=67.8
Q ss_pred CCCceEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCC-CCeeEEEEEEEEEecCCCCc
Q psy8639 458 FSGYNLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKS-GDYDTTNVFITILDVNDNAP 530 (570)
Q Consensus 458 ~~~~~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~-~~~~~~~v~I~V~dvNd~~P 530 (570)
+.++.+.|+|..+.....|.|++.+|.|++.++||||....|.|.|+|+|.+. ++++.++|+|+|.|+|||+|
T Consensus 6 g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 6 GENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 67889999998876668999999999999999999999999999999999986 58899999999999999998
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.35 E-value=0.0016 Score=52.18 Aligned_cols=78 Identities=23% Similarity=0.333 Sum_probs=43.4
Q ss_pred cccccccccceeecCCCCCCeEEEEEEEEcCCCCCCCceEEEEEEecCCCCCCCceEEECcccEEEECCcCCCCCCCeEE
Q psy8639 53 RCKFKKYLSNNEIKEDVPIGFTVGNVVSTDSTGRAGGGHVMYSLTSLSPTDHSGAFDIDKSSGSLVVARQLDRETCPEYK 132 (570)
Q Consensus 53 ~p~f~~~~~~~~v~E~~~~gt~v~~l~a~D~D~~~~~~~~~ysi~~~~~~~~~~~F~id~~~G~l~~~~~LD~E~~~~~~ 132 (570)
+|-|....|.+.|+.+...|..++++.-.|-. ....+.|.. + ...|.|. .+|.|++++++..... .-+
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~---~~~~~~~~s------s-DpdF~V~-~DGsVy~~r~v~l~~~-~~~ 70 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCT---GRRRVIFES------S-DPDFRVL-EDGSVYAKRPVQLSSE-QRS 70 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--S---S---EEEE----------SEEEEE-TTTEEEEES--S-SSS--EE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCC---CCCceEEec------C-CCCEEEc-CCCeEEEeeeEecCCC-ceE
Confidence 57899999999999999999999999999874 356788876 2 3489999 6999999998876543 357
Q ss_pred EEEEEEeCCC
Q psy8639 133 LEVRALDTST 142 (570)
Q Consensus 133 l~v~a~D~~g 142 (570)
|.|.|.|..+
T Consensus 71 F~V~a~D~~~ 80 (90)
T PF08758_consen 71 FTVHAWDSQT 80 (90)
T ss_dssp EEEEEEETTT
T ss_pred EEEEEECCCC
Confidence 9999999765
No 14
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=97.24 E-value=0.004 Score=50.46 Aligned_cols=89 Identities=25% Similarity=0.235 Sum_probs=62.5
Q ss_pred EEeEEecCCCCCCceEEEEEEecCCCCCcEEEeCCCcEEEEc--------ccCCccCcceEEEEEEEEECCCCCCCCcee
Q psy8639 187 WNFTATDADDGSNGELRYGLVRQYPEGNHFSVDHLTGTLTLL--------QPLDYETLTSFILVVKVTDQAANISERLST 258 (570)
Q Consensus 187 ~~v~a~D~D~g~~~~~~y~l~~~~~~~~~F~Id~~~G~i~~~--------~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~ 258 (570)
++|.++|+|.++ ...+++.......+.|.|++ .|..... +.|...+.....|++.+.|+.
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt--------- 69 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT--------- 69 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC---------
Confidence 468899999765 46666654444467899987 6765543 234434445678888899852
Q ss_pred EEEEEEEEeeCCCCCCeeecCCCCeeEEEeccC
Q psy8639 259 SLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQT 291 (570)
Q Consensus 259 ~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~ 291 (570)
+.+|.|+|...|| +|++... ....+.|+.
T Consensus 70 t~~vtItI~GtND-apvi~~~---~~g~v~ED~ 98 (99)
T TIGR01965 70 SQTVTITITGAND-AAVIGGA---DTGSVTEDS 98 (99)
T ss_pred eEEEEEEEEccCC-CCEEecc---cceeEecCC
Confidence 6889999999999 8988765 356777754
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.93 E-value=0.017 Score=47.36 Aligned_cols=88 Identities=24% Similarity=0.328 Sum_probs=61.7
Q ss_pred ecCCCCCCeEEEEEEEEcCCCCCCCceEEEEEEecCCCCCCCceEEECcccEEEECCcCCCCCCCeEEEEEEEEeCCCCC
Q psy8639 65 IKEDVPIGFTVGNVVSTDSTGRAGGGHVMYSLTSLSPTDHSGAFDIDKSSGSLVVARQLDRETCPEYKLEVRALDTSTSS 144 (570)
Q Consensus 65 v~E~~~~gt~v~~l~a~D~D~~~~~~~~~ysi~~~~~~~~~~~F~id~~~G~l~~~~~LD~E~~~~~~l~v~a~D~~g~~ 144 (570)
+.++...-..+..-...|+| +..++|++....+...+.+...++.++.+.-. +... ....|.|+|+|+|..|
T Consensus 9 a~~g~~f~~~i~~~tF~d~d----~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~-~~g~~~i~v~a~D~~g-- 80 (97)
T smart00736 9 ATEGESFSYTIPSSTFTDAD----GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNS-DVGSLSLKVTATDSSG-- 80 (97)
T ss_pred EeCCCeEEEEcChhceECCC----CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCC-CCcEEEEEEEEEECCC--
Confidence 34444333334334456776 46899999654443456799999988888774 3332 2467999999999755
Q ss_pred CCceeEEEEEEEEeeCCC
Q psy8639 145 NPQSSAVAIKIEIVDVND 162 (570)
Q Consensus 145 ~~~s~~~~v~I~V~dvNd 162 (570)
.++...++|.|.+.|+
T Consensus 81 --~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 81 --ASASDTFTITVVNTND 96 (97)
T ss_pred --CEEEEEEEEEEeCCCC
Confidence 6788899999999887
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.61 E-value=0.025 Score=45.34 Aligned_cols=79 Identities=22% Similarity=0.269 Sum_probs=43.6
Q ss_pred CCCeecCCeeEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccCCCceEEEEEEeeCCCceeEEEEEeecCCCc
Q psy8639 382 SLPTFLSPSYHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDAIPGTYIIKVVARSPSTYVYHMYLISDFSGY 461 (570)
Q Consensus 382 ~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (570)
..|-|....|.+.|+.+...|..+++|.-.|... ..
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--------------------------------------------~~ 37 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--------------------------------------------RR 37 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--SS----------------------------------------------
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCCC--------------------------------------------CC
Confidence 3588999999999999999999999999998865 34
Q ss_pred eEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCC
Q psy8639 462 NLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKS 510 (570)
Q Consensus 462 ~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~ 510 (570)
.+.|.. .+..|+|.+ .|.|++.+++... .+...|.|.|.|..+
T Consensus 38 ~~~~~s----sDpdF~V~~-DGsVy~~r~v~l~-~~~~~F~V~a~D~~~ 80 (90)
T PF08758_consen 38 RVIFES----SDPDFRVLE-DGSVYAKRPVQLS-SEQRSFTVHAWDSQT 80 (90)
T ss_dssp -EEEE-------SEEEEET-TTEEEEES--S-S-SS-EEEEEEEEETTT
T ss_pred ceEEec----CCCCEEEcC-CCeEEEeeeEecC-CCceEEEEEEECCCC
Confidence 677764 345999988 7999999988754 444689999999985
No 17
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.32 E-value=0.039 Score=44.82 Aligned_cols=89 Identities=24% Similarity=0.309 Sum_probs=60.5
Q ss_pred EEEEEEcCCCCCCCceEEEEEEecCCCCCCCceEEECcccEEEEC--------CcCCCCCCCeEEEEEEEEeCCCCCCCc
Q psy8639 76 GNVVSTDSTGRAGGGHVMYSLTSLSPTDHSGAFDIDKSSGSLVVA--------RQLDRETCPEYKLEVRALDTSTSSNPQ 147 (570)
Q Consensus 76 ~~l~a~D~D~~~~~~~~~ysi~~~~~~~~~~~F~id~~~G~l~~~--------~~LD~E~~~~~~l~v~a~D~~g~~~~~ 147 (570)
++|.++|+|.+ ....+++... ...-+.|.|++ +|..... +.|..-+.-.-+|++++.|+
T Consensus 2 G~Lt~sD~D~g---d~~~~s~~~~--~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------- 68 (99)
T TIGR01965 2 GQLTISDADAG---QAHFIAQTDA--AGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------- 68 (99)
T ss_pred CceEEeCCCCC---CceEEecccc--cCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC-------
Confidence 57899999954 6677777332 23467899985 7866432 23433344467788888883
Q ss_pred eeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCC
Q psy8639 148 SSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDT 180 (570)
Q Consensus 148 s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~ 180 (570)
.+..|+|+|.-.|| +|+.... ....+.|+.
T Consensus 69 -tt~~vtItI~GtND-apvi~~~-~~g~v~ED~ 98 (99)
T TIGR01965 69 -TSQTVTITITGAND-AAVIGGA-DTGSVTEDS 98 (99)
T ss_pred -CeEEEEEEEEccCC-CCEEecc-cceeEecCC
Confidence 27889999999999 8876543 235676653
No 18
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.17 E-value=0.0042 Score=49.09 Aligned_cols=61 Identities=25% Similarity=0.451 Sum_probs=36.8
Q ss_pred eeecCCCCCCeEEEEEEEEcCCCCC-CCceEEEEEEecCCCCCCCceEEECcccEEEECCcCCCCC
Q psy8639 63 NEIKEDVPIGFTVGNVVSTDSTGRA-GGGHVMYSLTSLSPTDHSGAFDIDKSSGSLVVARQLDRET 127 (570)
Q Consensus 63 ~~v~E~~~~gt~v~~l~a~D~D~~~-~~~~~~ysi~~~~~~~~~~~F~id~~~G~l~~~~~LD~E~ 127 (570)
.+|+|..+.|+.||.+. .|.-... .-..-.|.+. .. ....+|.++..+|.|+++..+|||.
T Consensus 5 YsV~EE~~~Gt~IGnia-~dL~l~~~~l~~~~~ri~--s~-~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 5 YSVPEEMPPGTVIGNIA-KDLGLDPQSLSSRNFRIV--SE-GNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEESS--TT-EEEECC-CCCT--HHHHCCTTBEEE---S-SSS-SEEE-TTTSEEEESS--SCCC
T ss_pred EEeecCCCCCCEEEEhH-HhhCCCcccccccceEEe--ec-CCcceeEecCCceeEEeCCccCHHH
Confidence 47999999999999984 3432110 0012246662 22 4578999999999999999999998
No 19
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.10 E-value=0.091 Score=42.94 Aligned_cols=72 Identities=35% Similarity=0.333 Sum_probs=53.9
Q ss_pred EEecCCCCCCceEEEEEEecC--CCCCcEEEeCCCcEEEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEe
Q psy8639 190 TATDADDGSNGELRYGLVRQY--PEGNHFSVDHLTGTLTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARVVVK 267 (570)
Q Consensus 190 ~a~D~D~g~~~~~~y~l~~~~--~~~~~F~Id~~~G~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~ 267 (570)
...|.| ...++|++.... +..+|..+++.++.+.-. +... ....|.|+|.|+|+.+ .+....++|.|.
T Consensus 23 tF~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~-~~g~~~i~v~a~D~~g-----~~~~~~f~i~V~ 92 (97)
T smart00736 23 TFTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNS-DVGSLSLKVTATDSSG-----ASASDTFTITVV 92 (97)
T ss_pred ceECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCC-CCcEEEEEEEEEECCC-----CEEEEEEEEEEe
Confidence 346666 337888886532 335799999988887763 4333 3467999999999987 678889999999
Q ss_pred eCCC
Q psy8639 268 DLND 271 (570)
Q Consensus 268 dvNd 271 (570)
+.|+
T Consensus 93 ~~~~ 96 (97)
T smart00736 93 NTND 96 (97)
T ss_pred CCCC
Confidence 9886
No 20
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.07 E-value=0.0058 Score=48.28 Aligned_cols=64 Identities=25% Similarity=0.375 Sum_probs=38.3
Q ss_pred eEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccCCCceEEEEEEeeCCCceeEEEEEeecCCCceEEEEecCC
Q psy8639 391 YHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDAIPGTYIIKVVARSPSTYVYHMYLISDFSGYNLTYMIPAG 470 (570)
Q Consensus 391 ~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~i~~~ 470 (570)
...+|.|..++|+.|+.| |.|.-. ++ ..-....|.+.+.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l----------------------------~~------------~~l~~~~~ri~s~ 41 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGL----------------------------DP------------QSLSSRNFRIVSE 41 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT------------------------------H------------HHHCCTTBEEE-S
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCC----------------------------Cc------------ccccccceEEeec
Confidence 467899999999999987 444322 00 0001224555555
Q ss_pred CCCCcEEEeCCCcEEEEcccCCccc
Q psy8639 471 VADDMFKIDAETGVVTLARSLDRET 495 (570)
Q Consensus 471 ~~~~~F~Id~~tG~i~~~~~lD~E~ 495 (570)
.....|.++..+|.|.++..+|||.
T Consensus 42 ~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 42 GNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp SSS-SEEE-TTTSEEEESS--SCCC
T ss_pred CCcceeEecCCceeEEeCCccCHHH
Confidence 5678999999999999999999996
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=95.81 E-value=1.3 Score=39.65 Aligned_cols=32 Identities=31% Similarity=0.480 Sum_probs=26.7
Q ss_pred cccceEEEEEEEEeCCCCCeeEEEEEEEEEecC
Q psy8639 494 ETVDQYNLVVHVTDSKSGDYDTTNVFITILDVN 526 (570)
Q Consensus 494 E~~~~~~l~V~atD~~~~~~~~~~v~I~V~dvN 526 (570)
|....|+|+|.|+|.. |...++++.+...+.|
T Consensus 120 e~~~~YtLtV~a~D~a-GN~~~~si~F~y~P~~ 151 (158)
T PF13750_consen 120 EADDSYTLTVSATDKA-GNQSTKSISFSYMPPN 151 (158)
T ss_pred CCCCeEEEEEEEEecC-CCEEEEEEEEEEeCCe
Confidence 6788999999999998 5788888888876543
No 22
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=95.46 E-value=1.2 Score=39.99 Aligned_cols=129 Identities=19% Similarity=0.199 Sum_probs=70.1
Q ss_pred CCCeEEEEE-EEEeCCCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCCC-CCeEEEEeEEecCCCCCCceEEE
Q psy8639 127 TCPEYKLEV-RALDTSTSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDTL-VGSTIWNFTATDADDGSNGELRY 204 (570)
Q Consensus 127 ~~~~~~l~v-~a~D~~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~-~gt~v~~v~a~D~D~g~~~~~~y 204 (570)
....|.+++ +|.|..| ...+..+..++. ++..+|.+.- .....+..+.. .+..-..+.++|...+. .--..
T Consensus 13 ~dG~Y~l~~~~a~D~ag----N~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv 85 (158)
T PF13750_consen 13 PDGSYTLTVVTATDAAG----NTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSV 85 (158)
T ss_pred CCccEEEEEEEEEecCC----CEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEE
Confidence 356899999 8999776 344444443333 2445776533 11112222222 12333567777765433 34455
Q ss_pred EEEecCCCCCcEEE--eCC-CcEEEEc--cc-CCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEee
Q psy8639 205 GLVRQYPEGNHFSV--DHL-TGTLTLL--QP-LDYETLTSFILVVKVTDQAANISERLSTSLTARVVVKD 268 (570)
Q Consensus 205 ~l~~~~~~~~~F~I--d~~-~G~i~~~--~~-lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~d 268 (570)
+|..+. ......+ ... .|...+. +. -..|....|+|+|.|+|..| ...+..+.+...+
T Consensus 86 ~l~Gg~-~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG-----N~~~~si~F~y~P 149 (158)
T PF13750_consen 86 SLTGGP-ASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG-----NQSTKSISFSYMP 149 (158)
T ss_pred EEECCc-ccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC-----CEEEEEEEEEEeC
Confidence 665442 2233222 221 2333332 11 12366789999999999998 6777777776654
No 23
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=92.20 E-value=21 Score=40.99 Aligned_cols=49 Identities=16% Similarity=0.139 Sum_probs=33.3
Q ss_pred CCCCCeeecCCCCeeEEEeccCCCCeEEEEEEEEeCCCCCCeEEEEEEEeCCC
Q psy8639 270 NDNAPVFVSPTRGSLIYVSEQTRVGSHVTKMTAVDQDSGDNGRVTYVISGGNE 322 (570)
Q Consensus 270 Nd~~P~~~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~ysi~~~~~ 322 (570)
||..+.+.... ....|.|++ |+.-.++.-...|.+....+.|+..++..
T Consensus 396 dd~~s~i~Fe~--~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA 444 (928)
T TIGR00845 396 NDPVSKIFFEP--GHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTA 444 (928)
T ss_pred cCCcceEEecC--CeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCcc
Confidence 44455555554 578899987 67666676665566556778898887754
No 24
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=92.10 E-value=46 Score=42.86 Aligned_cols=32 Identities=9% Similarity=0.272 Sum_probs=23.5
Q ss_pred CccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEeeC
Q psy8639 232 DYETLTSFILVVKVTDQAANISERLSTSLTARVVVKDL 269 (570)
Q Consensus 232 D~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dv 269 (570)
-|.....|++++.|++..+ +...++.|.|.+.
T Consensus 1563 TY~spGtYtVtLTvtN~~G------s~~~T~~i~V~~~ 1594 (2740)
T TIGR00864 1563 TFRSVGTFNIIVTAENDVG------AAQASIFLFVLQE 1594 (2740)
T ss_pred eecCCceEEEEEEEecCCC------ccceeEEEEEeec
Confidence 3566789999999999886 3455666666654
No 25
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=87.22 E-value=3.1 Score=29.05 Aligned_cols=36 Identities=19% Similarity=0.327 Sum_probs=28.0
Q ss_pred CCCcEEEeCCCcEEEEcccCCcc-ccceEEEEEEEEeCC
Q psy8639 472 ADDMFKIDAETGVVTLARSLDRE-TVDQYNLVVHVTDSK 509 (570)
Q Consensus 472 ~~~~F~Id~~tG~i~~~~~lD~E-~~~~~~l~V~atD~~ 509 (570)
...+++||+.+|.|+-.-. .. ....|.|+|+|+|+.
T Consensus 12 LP~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 12 LPSGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCC
Confidence 4578999999999976532 22 347999999999986
No 26
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=86.99 E-value=3.7 Score=28.68 Aligned_cols=47 Identities=26% Similarity=0.338 Sum_probs=32.4
Q ss_pred EEEEecCCCCCCCceEEECcccEEEECCcCCCC-CCCeEEEEEEEEeCCC
Q psy8639 94 YSLTSLSPTDHSGAFDIDKSSGSLVVARQLDRE-TCPEYKLEVRALDTST 142 (570)
Q Consensus 94 ysi~~~~~~~~~~~F~id~~~G~l~~~~~LD~E-~~~~~~l~v~a~D~~g 142 (570)
|++........+....+|+.+|.|.-... .+ +...|.|.|+|+|..|
T Consensus 2 ys~~~~~~~~LP~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 2 YSLTTPTGGGLPSGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred EEEEcCCCCCCCCcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCCC
Confidence 55422233456778899999999987643 22 2359999999999643
No 27
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=85.62 E-value=13 Score=32.31 Aligned_cols=57 Identities=23% Similarity=0.390 Sum_probs=38.0
Q ss_pred EEEECCcCCCCC---CCeEEEEEEEEeCCCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCC
Q psy8639 116 SLVVARQLDRET---CPEYKLEVRALDTSTSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDT 180 (570)
Q Consensus 116 ~l~~~~~LD~E~---~~~~~l~v~a~D~~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~ 180 (570)
.+.+.++||+.. .-...|.|.|+|..| ...+..+.|+|.| | .|+..... .+.|.|+.
T Consensus 69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DG----D~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 69 EFTLEGPLDHAAGSDELTLNFPIIATDFDG----DTSSITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEEcccccCCCCCceEEEeeeEEEEeCCC----CccccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 334556776633 446788999999877 3334578888887 6 57765543 36888864
No 28
>KOG3597|consensus
Probab=84.72 E-value=56 Score=34.39 Aligned_cols=159 Identities=18% Similarity=0.167 Sum_probs=90.7
Q ss_pred EEEEEEEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCCCCceEEEEEEecCCCC---CcEEEeCC-----
Q psy8639 150 AVAIKIEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDGSNGELRYGLVRQYPEG---NHFSVDHL----- 221 (570)
Q Consensus 150 ~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g~~~~~~y~l~~~~~~~---~~F~Id~~----- 221 (570)
+....|.|..+||.+..+...-+.+.+.|+...-.....+.+.|+|.+. ..+.|++....... ..|.....
T Consensus 25 ~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~F 103 (442)
T KOG3597|consen 25 TDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATEF 103 (442)
T ss_pred EeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccce
Confidence 3457799999999666555544458899988766667789999998654 46888887743211 11333221
Q ss_pred ------CcEEEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccCCCCe
Q psy8639 222 ------TGTLTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQTRVGS 295 (570)
Q Consensus 222 ------~G~i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~~g~ 295 (570)
.|.+.+...- .+.....++..++|+-. .+. .+.......-. .|.+... ....|.-.+..-.
T Consensus 104 s~~~v~~g~~~yvh~g--~el~~~~~~~~~SDg~~------~S~-~~i~~~~~~~~-~~~~~~~---~gL~v~~gS~~~I 170 (442)
T KOG3597|consen 104 SYEEVEDGSLSYVHSG--TELRESELQLRVSDGLL------VSE-RAILKVEATGP-APHLARN---TGLKVLQGSTAPI 170 (442)
T ss_pred EehHhhcCceeEEecC--cccccceEEEEeecceE------eee-eEEecccCCCc-ceeeecc---cceEEccCccccc
Confidence 1223222110 12456788888888752 222 22222222211 3333332 2344444443222
Q ss_pred EEEEEEEEeCCCCCCeEEEEEEEeCCC
Q psy8639 296 HVTKMTAVDQDSGDNGRVTYVISGGNE 322 (570)
Q Consensus 296 ~v~~v~a~D~D~~~~~~i~ysi~~~~~ 322 (570)
.-..+++.|.|.+....+.|.|.....
T Consensus 171 T~~~L~ved~d~~~d~~v~~~i~~~P~ 197 (442)
T KOG3597|consen 171 TPSNLSVEDNDSSPDDEVRYDITPPPQ 197 (442)
T ss_pred cHhHceeecCCCCCCcEEEEEecCCCC
Confidence 223589999998777889999987644
No 29
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=82.22 E-value=16 Score=26.86 Aligned_cols=61 Identities=20% Similarity=0.166 Sum_probs=36.5
Q ss_pred CCeEEEEEEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEEEEECCcCCceeEEEEEEEEe
Q psy8639 309 DNGRVTYVISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNITASDHGIPPKSSSLVINILSE 377 (570)
Q Consensus 309 ~~~~i~ysi~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~ 377 (570)
.+-...|.|.+. ...|..+...+-.+... ....+.|+|.|.|.|..........++.|.|+
T Consensus 6 ~~~~Y~Y~l~g~-d~~W~~~~~~~~~~~~~-------~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 6 ENIRYRYRLEGF-DDEWITLGSYSNSISYT-------NLPPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp TTEEEEEEEETT-ESSEEEESSTS-EEEEE-------S--SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CceEEEEEEECC-CCeEEECCCCcEEEEEE-------eCCCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 455677777765 57777776543244443 33478999999999965543333366666663
No 30
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=81.93 E-value=60 Score=32.67 Aligned_cols=109 Identities=17% Similarity=0.230 Sum_probs=53.9
Q ss_pred CCCeEEEEEEEEeCCCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCC------CCCeEEEEeEEecCCCCCCc
Q psy8639 127 TCPEYKLEVRALDTSTSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDT------LVGSTIWNFTATDADDGSNG 200 (570)
Q Consensus 127 ~~~~~~l~v~a~D~~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~------~~gt~v~~v~a~D~D~g~~~ 200 (570)
....|.+.++++|..|. .......+.|-- .+|...-. .+.++. ..........+++.+. +.
T Consensus 10 ~d~~~~v~vt~tD~aGN----~~~~t~~~~vDt---~~P~v~i~----~~~~~~~~~~~~~~~~~t~s~tvs~~~~--g~ 76 (317)
T PF13753_consen 10 ADGTYTVSVTVTDAAGN----TSTATQSITVDT---TAPTVTIT----SIADDDIINGDEATNTVTFSGTVSGAEP--GS 76 (317)
T ss_pred CCCcEEEEEEEEeCCCC----eeeeeEEEEEec---CCCceeee----cccCCCccccceeeeeeEEEEEecCCCC--CC
Confidence 35689999999998773 333444444332 26643221 111111 1122334445444443 33
Q ss_pred eEEEEEEecCCCCCcEEEeCCCcEEEEcccC-CccCcceEEEEEE-EEECCCCC
Q psy8639 201 ELRYGLVRQYPEGNHFSVDHLTGTLTLLQPL-DYETLTSFILVVK-VTDQAANI 252 (570)
Q Consensus 201 ~~~y~l~~~~~~~~~F~Id~~~G~i~~~~~l-D~e~~~~~~l~v~-a~D~~~~~ 252 (570)
.+.+.+. +. ...+..+ ..|.+...-+. +.-....|.+.+. ++|..|..
T Consensus 77 ~v~v~~~-g~--~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~ 126 (317)
T PF13753_consen 77 TVTVTIN-GT--TGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNT 126 (317)
T ss_pred EEEEEEC-CE--EEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCcc
Confidence 5666552 11 1233333 34543332221 1123347889998 99999853
No 31
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=79.73 E-value=1.3e+02 Score=35.00 Aligned_cols=152 Identities=18% Similarity=0.189 Sum_probs=76.9
Q ss_pred CCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCCCCceEEEEEEecCCCCC-cEEEeCCCcEEEEc----------
Q psy8639 160 VNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDGSNGELRYGLVRQYPEGN-HFSVDHLTGTLTLL---------- 228 (570)
Q Consensus 160 vNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g~~~~~~y~l~~~~~~~~-~F~Id~~~G~i~~~---------- 228 (570)
.||..+.+.-.+....|.|+ .|+.-..|.=...|.+..-.+.|+..+++-..+ .|. +..|.|.-.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En--~GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLEN--CGTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeec--CcEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 44545555444445679998 466666665444454444578888877643221 222 234444322
Q ss_pred ---ccCCccCcceEEEEEEEEECCCC----------CCCCceeEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccCCCCe
Q psy8639 229 ---QPLDYETLTSFILVVKVTDQAAN----------ISERLSTSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQTRVGS 295 (570)
Q Consensus 229 ---~~lD~e~~~~~~l~v~a~D~~~~----------~~~~~~~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~~g~ 295 (570)
..--+|....|.+.+.--..+.. ...........+|+|.| ||++|.|.... ....|.|+. |.
T Consensus 471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~--~~~sV~Es~--G~ 545 (928)
T TIGR00845 471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEE--DVFHVSESI--GI 545 (928)
T ss_pred EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccC--ceEEEEcCC--CE
Confidence 11123444444444432111100 00012234466777787 56799887765 568899985 45
Q ss_pred EEEEEEEEeCCCCCCeEEEEEEEeCC
Q psy8639 296 HVTKMTAVDQDSGDNGRVTYVISGGN 321 (570)
Q Consensus 296 ~v~~v~a~D~D~~~~~~i~ysi~~~~ 321 (570)
.-.++.-+-. ....-.+.|...++.
T Consensus 546 vtvtV~RtsG-a~G~VtV~Y~T~dGT 570 (928)
T TIGR00845 546 MEVKVLRTSG-ARGTVIVPYRTVEGT 570 (928)
T ss_pred EEEEEEEcCC-CCeeEEEEEEeecCc
Confidence 4444433321 112335667766654
No 32
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=79.67 E-value=8.6 Score=33.48 Aligned_cols=45 Identities=22% Similarity=0.324 Sum_probs=32.2
Q ss_pred ceEEEEEEEEECCCCCCCCceeEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccCC
Q psy8639 237 TSFILVVKVTDQAANISERLSTSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQTR 292 (570)
Q Consensus 237 ~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~ 292 (570)
-.+.|.|.|+|..+. ..+..+.|+|.| | .|...... ...|.|+..
T Consensus 85 l~l~~~v~a~D~DGD-----~s~~~l~VtI~D--D-~P~~~~~~---~~~V~E~~L 129 (137)
T TIGR03660 85 LTLNFPIIATDFDGD-----TSSITLPVTIVD--D-VPTITDVD---ALTVDEDDL 129 (137)
T ss_pred EEEeeeEEEEeCCCC-----ccccEEEEEEEC--C-CCeecccc---ceEEecccc
Confidence 357888999998874 445688888887 6 68876553 478888553
No 33
>KOG3597|consensus
Probab=77.58 E-value=41 Score=35.38 Aligned_cols=158 Identities=13% Similarity=0.123 Sum_probs=84.4
Q ss_pred eEEEEEEEEeeCCCCCCeecCCeeEEEEeCCCCCCcEEEEEEEecCCCCCCceEEEEEEeeccCCCceEEEEEEeeCCCc
Q psy8639 368 SSLVINILSETVSTSLPTFLSPSYHANVSEDAIPGTYIIKVVARSPSTYGNYVYTEVFLVSEDAIPGTYIIKVVARSPST 447 (570)
Q Consensus 368 ~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 447 (570)
.+..+.|.|..+||.+..+....+.+.+.|+...-...-.+.+.|+|. .++...+.++.+.-.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~----------------~~~~l~f~v~~t~~~- 86 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDS----------------APLPLEFQVLGTSSV- 86 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCC----------------CccceEEEEccCCCC-
Confidence 456688899999997766655555566666654433445677888887 555566666655321
Q ss_pred eeEEEEEeecCCCceEEEEecCCCCCCcEEEe-CCCcEEEEcccCCccccceEEEEEEEEeCCCCCeeEEEEEEEEEecC
Q psy8639 448 YVYHMYLISDFSGYNLTYMIPAGVADDMFKID-AETGVVTLARSLDRETVDQYNLVVHVTDSKSGDYDTTNVFITILDVN 526 (570)
Q Consensus 448 ~~~~~~~~~~~~~~~~~y~i~~~~~~~~F~Id-~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~~~~~~~~v~I~V~dvN 526 (570)
+..+.-....+...+.|+-. -..|.+.....= .+.....++..++|+-. .+. .+......
T Consensus 87 ------------~~~~~~~~~~g~~~~~Fs~~~v~~g~~~yvh~g--~el~~~~~~~~~SDg~~--~S~-~~i~~~~~-- 147 (442)
T KOG3597|consen 87 ------------PLPVLKFDVPGAPATEFSYEEVEDGSLSYVHSG--TELRESELQLRVSDGLL--VSE-RAILKVEA-- 147 (442)
T ss_pred ------------CCccceeeccCCcccceEehHhhcCceeEEecC--cccccceEEEEeecceE--eee-eEEecccC--
Confidence 00111122223333445542 233444332211 14567888889999762 222 22222222
Q ss_pred CCCcccccCCeeeEEecCCCCCC-eEEEEEEEeCCCC
Q psy8639 527 DNAPEFKSGSCYPITIPENSEAA-NVHTFIATDFGNF 562 (570)
Q Consensus 527 d~~P~f~~~~~~~~~v~E~~~~~-~v~~i~a~D~D~~ 562 (570)
..|.+.......+.|.-.+.+- .=.-+++.|.|+.
T Consensus 148 -~~~~~~~~~~~gL~v~~gS~~~IT~~~L~ved~d~~ 183 (442)
T KOG3597|consen 148 -TGPAPHLARNTGLKVLQGSTAPITPSNLSVEDNDSS 183 (442)
T ss_pred -CCcceeeecccceEEccCccccccHhHceeecCCCC
Confidence 3444443344555565555443 2256889999853
No 34
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=76.53 E-value=2.5e+02 Score=36.71 Aligned_cols=37 Identities=5% Similarity=0.099 Sum_probs=25.0
Q ss_pred CCccccceEEEEEEEEeCCCCCeeEEEEEEEEEecCC
Q psy8639 491 LDRETVDQYNLVVHVTDSKSGDYDTTNVFITILDVND 527 (570)
Q Consensus 491 lD~E~~~~~~l~V~atD~~~~~~~~~~v~I~V~dvNd 527 (570)
+-|.+.+.|+++++|++..+.......+.|.+.++..
T Consensus 1223 HtY~~~GtYTVtLTvtN~~G~~sts~~V~V~v~~V~g 1259 (2740)
T TIGR00864 1223 HKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAG 1259 (2740)
T ss_pred eEcCCCceEEEEEEEEcCCCCceeEEEEEEEeeeccc
Confidence 5577789999999999988633444445555545544
No 35
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=74.21 E-value=22 Score=26.13 Aligned_cols=57 Identities=19% Similarity=0.321 Sum_probs=31.9
Q ss_pred eEEEEecCCCCCCcEEEeCCCcEEEEcccCCccccceEEEEEEEEeCCCCCe-eEEEEEEEEE
Q psy8639 462 NLTYMIPAGVADDMFKIDAETGVVTLARSLDRETVDQYNLVVHVTDSKSGDY-DTTNVFITIL 523 (570)
Q Consensus 462 ~~~y~i~~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~l~V~atD~~~~~~-~~~~v~I~V~ 523 (570)
...|.|.+ ....+..+...+-.+..... ..+.|.|.|+|.|..+... ...++.|+|.
T Consensus 9 ~Y~Y~l~g-~d~~W~~~~~~~~~~~~~~L----~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 9 RYRYRLEG-FDDEWITLGSYSNSISYTNL----PPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp EEEEEEET-TESSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred EEEEEEEC-CCCeEEECCCCcEEEEEEeC----CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 34555543 34556666443224443332 6799999999999886222 2266777663
No 36
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=73.57 E-value=4.9 Score=42.69 Aligned_cols=29 Identities=21% Similarity=0.224 Sum_probs=0.0
Q ss_pred eEEEEEEEEeCCCCCeeEEEEEEEEEecCC
Q psy8639 498 QYNLVVHVTDSKSGDYDTTNVFITILDVND 527 (570)
Q Consensus 498 ~~~l~V~atD~~~~~~~~~~v~I~V~dvNd 527 (570)
...+.|++.|..+ ......+.|+|.....
T Consensus 375 ~~~i~v~V~D~~g-a~~~~~~~V~V~~~~~ 403 (440)
T PF02010_consen 375 NVTIYVQVCDSLG-ACSNVTVTVTVTPPSV 403 (440)
T ss_dssp ------------------------------
T ss_pred ccccccccccccc-cccccccccccccccc
Confidence 5778889999884 5557777888865444
No 37
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=65.80 E-value=1e+02 Score=34.23 Aligned_cols=123 Identities=14% Similarity=0.123 Sum_probs=0.0
Q ss_pred CeEEEEEEEEeCCCCCCCceeEEEEEEEEeeCCCCCCcCCCCceEEEEeCCCCCCeEEEEeEEecCCCCCCceEEEEEEe
Q psy8639 129 PEYKLEVRALDTSTSSNPQSSAVAIKIEIVDVNDNAPRWGENPLTIPVNEDTLVGSTIWNFTATDADDGSNGELRYGLVR 208 (570)
Q Consensus 129 ~~~~l~v~a~D~~g~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~g~~~~~~y~l~~ 208 (570)
..++|+++++|.+| ..+.+...+|+|++=. +|.+.-..++..-.++...-..-+++...--..-....+++.+.-
T Consensus 298 G~~Ti~atVtDSRG---r~S~~~~~tItVl~Y~--~P~lsfsv~R~~~~~~~~~v~~~a~Iapl~v~g~qKN~~~lt~~~ 372 (624)
T PF05895_consen 298 GSATIRATVTDSRG---RTSDPKTKTITVLEYS--PPTLSFSVYRCGSSGNTLTVTRNAKIAPLTVNGVQKNTMTLTFKV 372 (624)
T ss_pred ceEEEEEEEEECCC---ccCCceEEEEEEEEcC--CCcEEEEEEEeCCCCcEEEEEEEEEEeEEEEcccccceEEEEEEE
Q ss_pred cCCCCCcEEEeCCCcE-------------EEEcccCCccCcceEEEEEEEEECCCCCCCCceeEEEEEE
Q psy8639 209 QYPEGNHFSVDHLTGT-------------LTLLQPLDYETLTSFILVVKVTDQAANISERLSTSLTARV 264 (570)
Q Consensus 209 ~~~~~~~F~Id~~~G~-------------i~~~~~lD~e~~~~~~l~v~a~D~~~~~~~~~~~~~~v~I 264 (570)
..-....|.+|..... ..+...+|-+. +|.+.+.++|.- .+.+..+.|
T Consensus 373 a~~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~~~k--Sy~V~~~l~D~F------~s~t~~~~V 433 (624)
T PF05895_consen 373 APLGTGTFTTDNGSASGTWSSISELTNSSANLGGTYDAEK--SYDVRGTLSDKF------TSTTFTVTV 433 (624)
T ss_pred EEcCcceEEEEccccccceeeeeeecccceeeccccCCCc--eEEEEEEEEEEe------eeEEEEEEc
No 38
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=60.74 E-value=87 Score=25.17 Aligned_cols=52 Identities=19% Similarity=0.198 Sum_probs=31.2
Q ss_pred EEEEeeCCCCCCeeecCCCCeeEEEeccCCCCeEEEEEEEEeCCCCCCeEEEEEEEeC
Q psy8639 263 RVVVKDLNDNAPVFVSPTRGSLIYVSEQTRVGSHVTKMTAVDQDSGDNGRVTYVISGG 320 (570)
Q Consensus 263 ~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~ysi~~~ 320 (570)
+|+|.| ||.+ .+.... ....+.|+.. ..-..+....++......+.|....+
T Consensus 2 tvtI~d-~d~~-~v~f~~--~~~~v~E~~~--~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSS--PSYTVSEGDG--TVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TTSE-EEEESS--SEEEEETTSS--EEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CCCC-EEEEeC--CEEEEEeCCC--EEEEEEEEcccCCCcceEEEEEEeCC
Confidence 577888 6644 777765 5678888874 44445555544433455667776665
No 39
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=59.30 E-value=14 Score=39.27 Aligned_cols=131 Identities=18% Similarity=0.127 Sum_probs=16.9
Q ss_pred ceEEEEEEEE--ECCCCCCCCceeEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccCCCCeEEE-EEEEEeCCCCC-CeE
Q psy8639 237 TSFILVVKVT--DQAANISERLSTSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQTRVGSHVT-KMTAVDQDSGD-NGR 312 (570)
Q Consensus 237 ~~~~l~v~a~--D~~~~~~~~~~~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~~g~~v~-~v~a~D~D~~~-~~~ 312 (570)
..|.+.++++ +... ......+.|.|..-+- .|.+... ....+.-. ....+ --.-.|+|... +..
T Consensus 50 G~y~~~~~Vt~~~~~~-----~~~~~~~~v~V~~s~l-~~~I~gG---~~~~~~~~---~~i~ldgs~S~Dpd~~~~~~~ 117 (440)
T PF02010_consen 50 GDYTFTLTVTASSNPG-----LSSTDSVTVTVEPSPL-VAVIKGG---SSRTVGYN---SDITLDGSQSYDPDGPPGDSG 117 (440)
T ss_dssp CEEEEEEEEE--BCTT-----EEEEEEEEEEEE-----------------------------------------------
T ss_pred CCEEEEEEEEEECCCC-----ceEEEEEEEEEeeccc-eeEEcCC---ccceeecC---ceEEEeeEEEecccccccCCc
Confidence 3455565555 3332 6778888888887432 3333221 11111111 11111 12335777532 244
Q ss_pred EEEEEEeCCCCCc----------EEEecCCCEEEEEeecCcccccceeEEEEEEEEEECCcCCceeEEEEEEEEeeCCCC
Q psy8639 313 VTYVISGGNEDAR----------FHLDHDKGILSLARVISEYDHQSKFRAVLNITASDHGIPPKSSSLVINILSETVSTS 382 (570)
Q Consensus 313 i~ysi~~~~~~~~----------F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~dvNd~ 382 (570)
++|.+.-...... .......+.+...... +.....|.|++++++++. .+..+...|.+..- .
T Consensus 118 l~y~W~C~~~~~~~~C~~~~~~~~~~~~~~~~l~i~~~~----l~~~~~y~f~ltv~k~~r--~s~s~~~~v~v~~~--~ 189 (440)
T PF02010_consen 118 LTYSWSCTDLSSNSACSTPSTNITLLNSSSSSLTIPAST----LSPGSTYTFTLTVSKGSR--SSSSASQTVTVVSG--D 189 (440)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred eEEEEEEeCCceeccccccccccccCCCCCEEEEEEhHH----cCCCceEEEEEEEEeCCC--CceeeEEEEEeccC--C
Confidence 6665542211111 1112233444442221 334445999999999763 24555555555543 4
Q ss_pred CCeec
Q psy8639 383 LPTFL 387 (570)
Q Consensus 383 ~P~f~ 387 (570)
+|.+.
T Consensus 190 ~p~v~ 194 (440)
T PF02010_consen 190 PPTVS 194 (440)
T ss_dssp -----
T ss_pred CCcee
Confidence 56543
No 40
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=53.00 E-value=66 Score=24.81 Aligned_cols=24 Identities=25% Similarity=0.211 Sum_probs=20.1
Q ss_pred eEEEEEEEEECCCCCCCCceeEEEEEEEE
Q psy8639 238 SFILVVKVTDQAANISERLSTSLTARVVV 266 (570)
Q Consensus 238 ~~~l~v~a~D~~~~~~~~~~~~~~v~I~V 266 (570)
.+.++..|+|..| +.++..+.|+|
T Consensus 58 ~t~V~ytA~D~~G-----N~a~C~f~V~V 81 (81)
T PF02494_consen 58 TTTVTYTATDAAG-----NSATCSFTVTV 81 (81)
T ss_pred eEEEEEEEEECCC-----CEEEEEEEEEC
Confidence 5889999999987 67788888875
No 41
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=52.54 E-value=2.6e+02 Score=28.09 Aligned_cols=114 Identities=19% Similarity=0.211 Sum_probs=53.9
Q ss_pred cceEEEEEEEEECCCCCCCCceeEEEEEEEEeeCCCCCCeeecCCCCeeEEEeccCCCCeEEEEEEEEeCCCCCCeEEEE
Q psy8639 236 LTSFILVVKVTDQAANISERLSTSLTARVVVKDLNDNAPVFVSPTRGSLIYVSEQTRVGSHVTKMTAVDQDSGDNGRVTY 315 (570)
Q Consensus 236 ~~~~~l~v~a~D~~~~~~~~~~~~~~v~I~V~dvNd~~P~~~~~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y 315 (570)
-..|.+.+.++|..|. ....+..|.|-- .+|.+..........+.-............+.+.+. +..+.+
T Consensus 11 d~~~~v~vt~tD~aGN-----~~~~t~~~~vDt---~~P~v~i~~~~~~~~~~~~~~~~~~t~s~tvs~~~~--g~~v~v 80 (317)
T PF13753_consen 11 DGTYTVSVTVTDAAGN-----TSTATQSITVDT---TAPTVTITSIADDDIINGDEATNTVTFSGTVSGAEP--GSTVTV 80 (317)
T ss_pred CCcEEEEEEEEeCCCC-----eeeeeEEEEEec---CCCceeeecccCCCccccceeeeeeEEEEEecCCCC--CCEEEE
Confidence 3578999999999985 334444544332 267543321000011110101122334444444443 444555
Q ss_pred EEEeCCCCCcEEEecCCCEEEEEeecCcccccceeEEEEEEE-EEECCcC
Q psy8639 316 VISGGNEDARFHLDHDKGILSLARVISEYDHQSKFRAVLNIT-ASDHGIP 364 (570)
Q Consensus 316 si~~~~~~~~F~id~~tG~l~~~~~ld~~d~e~~~~~~l~V~-a~D~g~~ 364 (570)
.+. + ....+..+ ..|.+.+.-... .......|.+.+. ++|....
T Consensus 81 ~~~-g-~~~t~~~~-~~G~ws~t~~~~--~~l~~g~~ti~v~~~tD~aGN 125 (317)
T PF13753_consen 81 TIN-G-TTGTLTAD-ADGNWSVTVTPS--DDLPDGDYTITVTTVTDAAGN 125 (317)
T ss_pred EEC-C-EEEEEEEe-cCCcEEEeeccc--cccccCcceeEEEEEEccCCc
Confidence 552 2 22334444 345533322110 0223457888998 9996543
No 42
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=49.01 E-value=36 Score=26.12 Aligned_cols=32 Identities=16% Similarity=0.293 Sum_probs=23.7
Q ss_pred cCCccccceEEEEEEEEeCCCCCeeEEEEEEEE
Q psy8639 490 SLDRETVDQYNLVVHVTDSKSGDYDTTNVFITI 522 (570)
Q Consensus 490 ~lD~E~~~~~~l~V~atD~~~~~~~~~~v~I~V 522 (570)
.+.|...+.|.++++++|.. +.+...++.|.|
T Consensus 50 ~~~y~~~G~y~v~l~v~d~~-g~~~~~~~~V~V 81 (81)
T cd00146 50 THTYTKPGTYTVTLTVTNAV-GSSSTKTTTVVV 81 (81)
T ss_pred EEEcCCCcEEEEEEEEEeCC-CCEEEEEEEEEC
Confidence 34577899999999999987 455555665553
No 43
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=47.63 E-value=41 Score=27.72 Aligned_cols=34 Identities=12% Similarity=0.183 Sum_probs=24.5
Q ss_pred cCCccccceEEEEEEEEeCCCCCeeEEEEEEEEEe
Q psy8639 490 SLDRETVDQYNLVVHVTDSKSGDYDTTNVFITILD 524 (570)
Q Consensus 490 ~lD~E~~~~~~l~V~atD~~~~~~~~~~v~I~V~d 524 (570)
.+++.+.+.|.++|++.|..+ -+.+..+.|.|.|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dG-CS~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDG-CSTSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCC-ccCcCCEEEEecC
Confidence 455667899999999999985 4444455666654
No 44
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=45.49 E-value=79 Score=23.00 Aligned_cols=31 Identities=26% Similarity=0.397 Sum_probs=22.2
Q ss_pred ccceEEEEEEEEeCCCCCeeEEEEEEEEEecC
Q psy8639 495 TVDQYNLVVHVTDSKSGDYDTTNVFITILDVN 526 (570)
Q Consensus 495 ~~~~~~l~V~atD~~~~~~~~~~v~I~V~dvN 526 (570)
..+.|.+.++++|..+ ...+....+.+.|..
T Consensus 21 ~dg~yt~~v~a~D~AG-N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 21 ADGEYTLTVTATDKAG-NTSSSTTQIVIVDNT 51 (60)
T ss_pred CCccEEEEEEEEECCC-CEEEeeeEEEEEcCC
Confidence 3678999999999984 555555555565544
No 45
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=44.32 E-value=65 Score=22.85 Aligned_cols=28 Identities=25% Similarity=0.376 Sum_probs=18.6
Q ss_pred ccceEEEEEEEEeCCCCCeeEEEEEEEE
Q psy8639 495 TVDQYNLVVHVTDSKSGDYDTTNVFITI 522 (570)
Q Consensus 495 ~~~~~~l~V~atD~~~~~~~~~~v~I~V 522 (570)
..+.|.|+++|+|..+..+....+.|.|
T Consensus 22 ~dG~y~itv~a~D~AGN~s~~~~~~~ti 49 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGNTSTSSSVTFTI 49 (54)
T ss_pred CCccEEEEEEEEeCCCCCCCccceeEEE
Confidence 4688999999999986333333334444
No 46
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=41.84 E-value=1.1e+02 Score=22.31 Aligned_cols=30 Identities=37% Similarity=0.426 Sum_probs=21.3
Q ss_pred CCeEEEEEEEEeCCCCCCCceeEEEEEEEEeeCC
Q psy8639 128 CPEYKLEVRALDTSTSSNPQSSAVAIKIEIVDVN 161 (570)
Q Consensus 128 ~~~~~l~v~a~D~~g~~~~~s~~~~v~I~V~dvN 161 (570)
...|.+.++++|..| ..........+.|..
T Consensus 22 dg~yt~~v~a~D~AG----N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKAG----NTSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECCC----CEEEeeeEEEEEcCC
Confidence 568999999999877 455555555555543
No 47
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=41.44 E-value=1.8e+02 Score=23.20 Aligned_cols=35 Identities=23% Similarity=0.134 Sum_probs=19.3
Q ss_pred EEEEeeCCCCCCeecCCeeEEEEeCCCCCCcEEEEEEEe
Q psy8639 373 NILSETVSTSLPTFLSPSYHANVSEDAIPGTYIIKVVAR 411 (570)
Q Consensus 373 ~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~ 411 (570)
+|+|.| ||.+ .+........+.|+. |..-..|.-.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~ 36 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRS 36 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEE
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEc
Confidence 466677 4544 666555567788875 3333444433
No 48
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=40.21 E-value=83 Score=22.27 Aligned_cols=17 Identities=29% Similarity=0.401 Sum_probs=14.7
Q ss_pred cceEEEEEEEEECCCCC
Q psy8639 236 LTSFILVVKVTDQAANI 252 (570)
Q Consensus 236 ~~~~~l~v~a~D~~~~~ 252 (570)
...|.++++|+|..|+.
T Consensus 23 dG~y~itv~a~D~AGN~ 39 (54)
T PF13754_consen 23 DGTYTITVTATDAAGNT 39 (54)
T ss_pred CccEEEEEEEEeCCCCC
Confidence 56899999999999864
No 49
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=37.23 E-value=77 Score=24.05 Aligned_cols=31 Identities=13% Similarity=0.253 Sum_probs=24.4
Q ss_pred cCCccccceEEEEEEEEeCCCCCeeEEEEEEEE
Q psy8639 490 SLDRETVDQYNLVVHVTDSKSGDYDTTNVFITI 522 (570)
Q Consensus 490 ~lD~E~~~~~~l~V~atD~~~~~~~~~~v~I~V 522 (570)
.+-|+..+.|.+++.++|..+ +.+.+++|.|
T Consensus 48 ~~~y~~~G~y~v~l~v~n~~g--~~~~~~~i~v 78 (79)
T smart00089 48 THTYTKPGTYTVTLTVTNAVG--SASATVTVVV 78 (79)
T ss_pred EEEeCCCcEEEEEEEEEcCCC--cEEEEEEEEE
Confidence 455778899999999999884 6666677765
No 50
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=36.86 E-value=76 Score=24.21 Aligned_cols=30 Identities=10% Similarity=0.080 Sum_probs=21.2
Q ss_pred CCCCCCCeEEEEEEEEeCCCCCCCceeEEEEEEE
Q psy8639 123 LDRETCPEYKLEVRALDTSTSSNPQSSAVAIKIE 156 (570)
Q Consensus 123 LD~E~~~~~~l~v~a~D~~g~~~~~s~~~~v~I~ 156 (570)
.-|.....|.++++++|..| .+.+....|.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g----~~~~~~~~V~ 80 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG----SSSTKTTTVV 80 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC----CEEEEEEEEE
Confidence 44778899999999999655 3444455444
No 51
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=35.66 E-value=1e+02 Score=26.60 Aligned_cols=64 Identities=17% Similarity=0.305 Sum_probs=34.7
Q ss_pred eEEEEecCCCCCCcEEEeCCCcEEE--------EcccCCccccceEEEEEEEEeCCCCCeeEEEEEEEEEecCC
Q psy8639 462 NLTYMIPAGVADDMFKIDAETGVVT--------LARSLDRETVDQYNLVVHVTDSKSGDYDTTNVFITILDVND 527 (570)
Q Consensus 462 ~~~y~i~~~~~~~~F~Id~~tG~i~--------~~~~lD~E~~~~~~l~V~atD~~~~~~~~~~v~I~V~dvNd 527 (570)
.+++.+-.|.....+++=. .|.+. ....+.+.+.+.|.++|++.|.. |-+.+..+.|.|.|-.-
T Consensus 41 ~VtwN~WsG~~Gd~~kly~-dG~~V~tG~~~~~~~a~~~~~~gG~y~~~VeLCN~~-GCS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 41 SVTWNVWSGTNGDTAKLYF-DGVLVWTGPSPQQKSATFTVTKGGRYQMQVELCNAD-GCSTSAPVEVVVADTDG 112 (133)
T ss_dssp EEEEE-SSS---SEEEEEE-TTEEEEEEE--SEEEEEEEE-S-EEEEEEEEEEETT-EEEE---EEEEEE-TTS
T ss_pred EEEEEEecCCCCCEEEEEE-CCEEEEeCCCccCceEEEEecCCCEEEEEEEEECCC-CcccCCCEEEEEeCCCc
Confidence 4555555555444555422 23321 12345556789999999999998 46667789999999654
No 52
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=26.46 E-value=1.2e+02 Score=23.31 Aligned_cols=26 Identities=23% Similarity=0.342 Sum_probs=21.7
Q ss_pred cceEEEEEEEEeCCCCCeeEEEEEEEE
Q psy8639 496 VDQYNLVVHVTDSKSGDYDTTNVFITI 522 (570)
Q Consensus 496 ~~~~~l~V~atD~~~~~~~~~~v~I~V 522 (570)
.+.+.++.+|+|.. |..+++.+.|+|
T Consensus 56 ~G~t~V~ytA~D~~-GN~a~C~f~V~V 81 (81)
T PF02494_consen 56 VGTTTVTYTATDAA-GNSATCSFTVTV 81 (81)
T ss_pred eceEEEEEEEEECC-CCEEEEEEEEEC
Confidence 46799999999987 578899988875
No 53
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=25.36 E-value=1.5e+02 Score=22.37 Aligned_cols=28 Identities=11% Similarity=0.054 Sum_probs=21.5
Q ss_pred ccceeEEEEEEEEEECCcCCceeEEEEEEEE
Q psy8639 346 HQSKFRAVLNITASDHGIPPKSSSLVINILS 376 (570)
Q Consensus 346 ~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V 376 (570)
|.....|.+++.++|... +..++++|.|
T Consensus 51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 51 YTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 778899999999999654 5566666655
No 54
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=23.27 E-value=2.2e+02 Score=24.32 Aligned_cols=34 Identities=18% Similarity=0.263 Sum_probs=16.3
Q ss_pred ceEEEEEEEEeCCCCCeeEEEEEEEEEecCCCCcc
Q psy8639 497 DQYNLVVHVTDSKSGDYDTTNVFITILDVNDNAPE 531 (570)
Q Consensus 497 ~~~~l~V~atD~~~~~~~~~~v~I~V~dvNd~~P~ 531 (570)
+.+.+.-++.|++..++.+..+.|+|.+-|+ +|+
T Consensus 100 Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-ppi 133 (133)
T PF09100_consen 100 GNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PPI 133 (133)
T ss_dssp EEEEEEEEETTTT---EEEEEEEEE---S----S-
T ss_pred ccEEEEEEEccCCcccceeEeEEEEeecccC-CCC
Confidence 4444444455554478899999999999887 663
No 55
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=21.20 E-value=3.3e+02 Score=22.52 Aligned_cols=30 Identities=13% Similarity=-0.002 Sum_probs=20.7
Q ss_pred ccceeEEEEEEEEEECCcCCceeEEEEEEEEe
Q psy8639 346 HQSKFRAVLNITASDHGIPPKSSSLVINILSE 377 (570)
Q Consensus 346 ~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~ 377 (570)
+...+.|.++|.++|...- +.++.+.|.|.
T Consensus 76 v~kgG~y~m~V~lCn~dGC--S~S~~~~I~VA 105 (106)
T cd02848 76 VGKGGRYQMQVALCNGDGC--STSAAKEIVVA 105 (106)
T ss_pred eCCCCeEEEEEEEECCCCc--cCcCCEEEEec
Confidence 6678899999999997654 33444555444
No 56
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=20.47 E-value=5.6e+02 Score=22.19 Aligned_cols=33 Identities=9% Similarity=0.040 Sum_probs=21.7
Q ss_pred ccceeEEEEEEEEEECCcCCceeEEEEEEEEeeCC
Q psy8639 346 HQSKFRAVLNITASDHGIPPKSSSLVINILSETVS 380 (570)
Q Consensus 346 ~e~~~~~~l~V~a~D~g~~~~s~~~~v~I~V~dvN 380 (570)
....+.|+++|++++... -+.+..+.|.|.|..
T Consensus 79 ~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTD 111 (133)
T PF08329_consen 79 VTKGGRYQMQVELCNADG--CSTSAPVEVVVADTD 111 (133)
T ss_dssp E-S-EEEEEEEEEEETTE--EEE---EEEEEE-TT
T ss_pred ecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCC
Confidence 667889999999999664 566667888888874
Done!