Query psy13120
Match_columns 384
No_of_seqs 262 out of 1615
Neff 9.8
Searched_HMMs 46136
Date Fri Aug 16 20:08:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13120.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13120hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 1.9E-61 4.1E-66 473.1 33.9 357 2-383 193-554 (2531)
2 KOG4289|consensus 100.0 3.7E-58 8E-63 450.1 37.5 357 1-384 300-660 (2531)
3 KOG1219|consensus 100.0 1.3E-54 2.8E-59 437.3 39.9 360 1-384 776-1138(4289)
4 KOG1219|consensus 100.0 3E-50 6.5E-55 406.0 39.7 359 2-384 672-1033(4289)
5 cd00031 CA Cadherin repeat dom 100.0 2.2E-31 4.8E-36 229.8 28.8 197 82-282 2-199 (199)
6 cd00031 CA Cadherin repeat dom 100.0 1.5E-28 3.2E-33 212.1 26.1 180 186-384 2-182 (199)
7 KOG1834|consensus 99.7 2.1E-16 4.6E-21 147.9 20.2 197 169-383 21-226 (952)
8 PF00028 Cadherin: Cadherin do 99.7 1.6E-16 3.4E-21 119.2 14.6 92 186-281 1-93 (93)
9 PF00028 Cadherin: Cadherin do 99.7 1E-16 2.2E-21 120.3 12.5 91 82-172 1-93 (93)
10 KOG1834|consensus 99.7 8.1E-16 1.8E-20 144.0 19.4 209 66-283 21-245 (952)
11 smart00112 CA Cadherin repeats 99.6 2.7E-15 5.9E-20 108.8 10.1 78 102-179 1-79 (79)
12 smart00112 CA Cadherin repeats 99.6 4.9E-15 1.1E-19 107.5 11.0 79 206-288 1-79 (79)
13 TIGR01965 VCBS_repeat VCBS rep 97.2 0.0031 6.6E-08 46.7 8.5 87 97-194 2-98 (99)
14 PF08266 Cadherin_2: Cadherin- 97.1 0.00046 1E-08 49.8 3.1 62 186-248 3-66 (84)
15 smart00736 CADG Dystroglycan-t 97.1 0.0092 2E-07 44.6 10.2 65 2-73 29-96 (97)
16 PF08758 Cadherin_pro: Cadheri 97.0 0.011 2.3E-07 43.3 9.6 80 178-265 3-82 (90)
17 smart00736 CADG Dystroglycan-t 97.0 0.018 3.9E-07 43.0 10.9 70 205-285 24-96 (97)
18 PF08758 Cadherin_pro: Cadheri 96.9 0.0064 1.4E-07 44.5 7.4 76 287-384 3-78 (90)
19 PF08266 Cadherin_2: Cadherin- 96.8 0.00055 1.2E-08 49.3 1.5 60 83-143 4-66 (84)
20 TIGR01965 VCBS_repeat VCBS rep 96.5 0.031 6.8E-07 41.4 9.0 87 201-302 2-97 (99)
21 PF13750 Big_3_3: Bacterial Ig 96.5 0.38 8.1E-06 39.4 16.0 128 144-282 14-149 (158)
22 PF13750 Big_3_3: Bacterial Ig 96.4 0.42 9.1E-06 39.1 16.4 127 37-172 13-148 (158)
23 PF05345 He_PIG: Putative Ig d 94.7 0.14 2.9E-06 32.8 5.5 36 347-384 11-47 (49)
24 KOG3597|consensus 93.6 4.4 9.6E-05 38.8 15.6 153 61-225 25-197 (442)
25 PF05345 He_PIG: Putative Ig d 93.4 0.43 9.3E-06 30.5 6.0 45 6-52 1-49 (49)
26 KOG3597|consensus 92.8 7.2 0.00016 37.4 15.7 153 163-334 24-194 (442)
27 TIGR00845 caca sodium/calcium 92.4 17 0.00036 38.4 21.0 146 71-225 395-571 (928)
28 TIGR00845 caca sodium/calcium 90.9 24 0.00052 37.3 18.8 148 174-334 395-568 (928)
29 PF07495 Y_Y_Y: Y_Y_Y domain; 89.0 4.9 0.00011 27.1 8.1 60 108-171 6-65 (66)
30 TIGR00864 PCC polycystin catio 84.6 98 0.0021 37.0 27.0 31 140-173 1563-1593(2740)
31 TIGR03660 T1SS_rpt_143 T1SS-14 84.3 19 0.00041 28.6 10.2 65 17-90 57-128 (137)
32 TIGR00864 PCC polycystin catio 83.8 1.1E+02 0.0023 36.8 29.0 204 35-283 1564-1780(2740)
33 TIGR03660 T1SS_rpt_143 T1SS-14 82.1 7.5 0.00016 30.9 7.0 47 249-305 84-130 (137)
34 PF07495 Y_Y_Y: Y_Y_Y domain; 79.3 16 0.00035 24.4 8.4 59 213-280 7-65 (66)
35 PF13753 SWM_repeat: Putative 76.0 65 0.0014 29.6 18.8 109 37-158 10-124 (317)
36 PF13754 Big_3_4: Bacterial Ig 73.9 14 0.00031 23.9 5.4 18 248-265 22-39 (54)
37 PF02010 REJ: REJ domain; Int 70.0 11 0.00023 36.5 6.1 33 247-289 270-302 (440)
38 PF13754 Big_3_4: Bacterial Ig 69.5 16 0.00035 23.7 4.9 29 24-54 11-39 (54)
39 cd02848 Chitinase_N_term Chiti 65.8 14 0.00031 27.7 4.5 62 4-70 37-106 (106)
40 PF12245 Big_3_2: Bacterial Ig 61.7 37 0.00081 22.5 5.6 30 37-71 21-50 (60)
41 cd02848 Chitinase_N_term Chiti 58.9 22 0.00049 26.7 4.5 33 244-282 74-106 (106)
42 PF13753 SWM_repeat: Putative 57.0 1.6E+02 0.0034 27.0 16.8 109 143-265 10-126 (317)
43 PRK14081 triple tyrosine motif 52.7 2.8E+02 0.006 28.5 30.8 223 38-289 349-577 (667)
44 PF12245 Big_3_2: Bacterial Ig 52.2 59 0.0013 21.5 5.4 29 144-174 22-50 (60)
45 PF03160 Calx-beta: Calx-beta 51.7 92 0.002 22.7 10.4 53 168-224 2-54 (100)
46 PF08329 ChitinaseA_N: Chitina 50.9 49 0.0011 26.1 5.4 63 4-72 41-111 (133)
47 smart00089 PKD Repeats in poly 48.8 49 0.0011 22.9 4.9 31 138-171 48-78 (79)
48 PRK14081 triple tyrosine motif 47.6 3.3E+02 0.0072 28.0 34.4 261 38-334 255-520 (667)
49 cd00146 PKD polycystic kidney 47.3 47 0.001 23.1 4.7 21 244-264 51-71 (81)
50 cd00146 PKD polycystic kidney 45.6 53 0.0012 22.8 4.7 29 34-67 52-80 (81)
51 smart00089 PKD Repeats in poly 44.6 62 0.0013 22.3 4.9 30 33-68 49-78 (79)
52 KOG4221|consensus 44.4 4.8E+02 0.01 28.9 24.6 57 121-180 564-620 (1381)
53 PF03160 Calx-beta: Calx-beta 44.1 1.2E+02 0.0027 22.0 7.4 51 277-334 2-52 (100)
54 PF02494 HYR: HYR domain; Int 42.6 55 0.0012 23.0 4.3 25 145-171 57-81 (81)
55 PF02494 HYR: HYR domain; Int 42.0 58 0.0013 22.9 4.4 26 38-68 56-81 (81)
56 PF08329 ChitinaseA_N: Chitina 33.5 1.2E+02 0.0027 23.9 5.2 78 215-303 41-126 (133)
57 PF05895 DUF859: Siphovirus pr 32.2 5.7E+02 0.012 26.2 20.3 191 146-384 212-423 (624)
58 PF09100 Qn_am_d_aIV: Quinohem 26.6 1.1E+02 0.0024 23.7 3.8 34 145-179 98-132 (133)
59 COG4288 Uncharacterized protei 24.5 3.1E+02 0.0067 20.6 5.6 54 34-103 53-107 (124)
60 COG1470 Predicted membrane pro 23.5 7.1E+02 0.015 24.4 15.0 60 144-206 396-468 (513)
61 KOG4221|consensus 20.3 1.2E+03 0.027 26.0 25.3 138 129-289 480-620 (1381)
No 1
>KOG4289|consensus
Probab=100.00 E-value=1.9e-61 Score=473.06 Aligned_cols=357 Identities=36% Similarity=0.554 Sum_probs=338.5
Q ss_pred CccEEEEEEeC---CCCCcEEEeCCCcEEEECccCCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEE
Q psy13120 2 NGRLEYSIVSG---DDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEF 78 (384)
Q Consensus 2 ~~~i~Y~i~~~---~~~~~F~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f 78 (384)
-+++.|++..- -..++|+||+.+|.|++++.||||....+.|+|+|.|.+.|. .|++++|+|.|.|+|||.|+|
T Consensus 193 a~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~HvlrVtA~d~~~P~---~SAtttv~V~V~D~nDhsPvF 269 (2531)
T KOG4289|consen 193 AGRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHVLRVTAQDHGDPR---RSATTTVTVLVLDTNDHSPVF 269 (2531)
T ss_pred cCceEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeEEEEEeeecCCCc---ccceeEEEEEEeecCCCCccc
Confidence 36789999642 234689999999999999999999999999999999999875 699999999999999999999
Q ss_pred ccC-ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcC-CCCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeC
Q psy13120 79 ITA-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINS-EATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDR 156 (384)
Q Consensus 79 ~~~-~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~-~~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~ 156 (384)
.+. |...+.||.+.|..|.++.|+|.|.++|+.+.|++.++ ....|.|++.+|.|++..+||||+...|.|.|.|+|.
T Consensus 270 Eq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDq 349 (2531)
T KOG4289|consen 270 EQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQ 349 (2531)
T ss_pred chhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCccCHHhhhheEEEEEeccC
Confidence 998 99999999999999999999999999999999999999 5678999999999999999999999999999999999
Q ss_pred CCCCCceeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCc
Q psy13120 157 GEPSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNG 236 (384)
Q Consensus 157 ~~~~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G 236 (384)
|.++...++.|.|+|.|+|||+|+|....|.++|.|+..+++.|.+|.|+|.|.+.|+.+.|+|.+|+..+.|.||..+|
T Consensus 350 G~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tG 429 (2531)
T KOG4289|consen 350 GRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTG 429 (2531)
T ss_pred CCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccc
Confidence 99888889999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCCCCcccCCCeeEEEcCCCCCCCceEEEEEEE
Q psy13120 237 LLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDNYPVFLHSPYLALVNENTLPGPSFRIMTVQA 316 (384)
Q Consensus 237 ~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a 316 (384)
+|.+..+||+|.. .|++.|+|.|++.|++ +.+.-+.|+|.|+|||+|.|...++.++|.|+. +.|..+..+.|
T Consensus 430 el~vv~plD~e~~-~ytl~IrAqDggrPpL----sn~sgl~iqVlDINDhaPifvstpfq~tvlEnv--~lg~~v~~vqa 502 (2531)
T KOG4289|consen 430 ELDVVEPLDFENS-EYTLRIRAQDGGRPPL----SNTSGLVIQVLDINDHAPIFVSTPFQATVLENV--PLGYLVCHVQA 502 (2531)
T ss_pred eEEEeccccccCC-eeEEEEEcccCCCCCc----cCCCceEEEEEecCCCCceeEechhhhhhhhcc--cccceEEEEec
Confidence 9999999999998 9999999999999987 677778899999999999999999999999999 55899999999
Q ss_pred ecCCCCCCCceEEEEEecCCCCCCCCcCCCCCCCceEEeCCcceEEEcccCCcccccEEEEEEEEEc
Q psy13120 317 HDADSLPFNGLVRYFLKDSGASSGNSQTQGGTGDLFRVNSTTGDIYLLKSLDREKQAEYLLTLQAMD 383 (384)
Q Consensus 317 ~D~D~~~~~~~i~y~l~~~~~~~~~~~~~~~~~~~f~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D 383 (384)
.|+|.+. |+.+.|++.+ .+.|.|+..+|+|++++.||||+...|.|.|+|+|
T Consensus 503 idadsg~-na~l~y~laG--------------~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard 554 (2531)
T KOG4289|consen 503 IDADSGE-NARLHYSLAG--------------VGPFQINNGSGWITVTKELDRETVEHYSLGVEARD 554 (2531)
T ss_pred ccCCCCc-ccceeeeecc--------------CCCeeEecCCceEEEeecccccccceEEEEEEEcC
Confidence 9999999 9999999976 34899999999999999999999999999999998
No 2
>KOG4289|consensus
Probab=100.00 E-value=3.7e-58 Score=450.09 Aligned_cols=357 Identities=36% Similarity=0.575 Sum_probs=336.3
Q ss_pred CCccEEEEEEeCCCCCcEEEeCCCcEEEECccCCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEcc
Q psy13120 1 DNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFIT 80 (384)
Q Consensus 1 ~~~~i~Y~i~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~ 80 (384)
.|++|+|++.+|++...|.||+.+|.|+++.+||||+.+.|.|.|+|.|.|.++.+ .++.|.|+|.|+|||+|+|..
T Consensus 300 ~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp---~Ta~V~itV~D~NDNaPqFse 376 (2531)
T KOG4289|consen 300 PNANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGP---RTAMVEITVEDENDNAPQFSE 376 (2531)
T ss_pred CCCceEEEecCCCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCC---ceEEEEEEEEecCCCCccccc
Confidence 48999999999989899999999999999999999999999999999999987643 389999999999999999999
Q ss_pred C-ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcC-CCCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCC
Q psy13120 81 A-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINS-EATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGE 158 (384)
Q Consensus 81 ~-~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~-~~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~ 158 (384)
. |.+.|.|+..+++.|.+++|+|.|.|.|+.+.|+|.++ ..|.|.|+..+|+|.+..+||+|.. .|++.|+|.|+|.
T Consensus 377 ~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggr 455 (2531)
T KOG4289|consen 377 KRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGR 455 (2531)
T ss_pred cceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCC
Confidence 8 99999999999999999999999999999999999998 4788999999999999999999998 9999999999999
Q ss_pred CCCceeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcEE
Q psy13120 159 PSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGLL 238 (384)
Q Consensus 159 ~~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l 238 (384)
|+++...-+.|.|+|+|||+|.|....+.++|.|+.+.|..+..+.|.|.|.+.++.+.|++. | .+.|.|+..+|.|
T Consensus 456 PpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~la-G--~~pf~I~~~SG~I 532 (2531)
T KOG4289|consen 456 PPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLA-G--VGPFQINNGSGWI 532 (2531)
T ss_pred CCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeec-c--CCCeeEecCCceE
Confidence 999999999999999999999999999999999999999999999999999999999999997 3 2479999999999
Q ss_pred EEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCCCCcccCCCeeEEEcCCCCCCCceEEEEEEEec
Q psy13120 239 RIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDNYPVFLHSPYLALVNENTLPGPSFRIMTVQAHD 318 (384)
Q Consensus 239 ~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~D 318 (384)
++++.||||+...|.|-|.|+|.+.+.. +++++|.|.+.|+||+.|.|.+..|...+.|++ +.|+.|.+++|.|
T Consensus 533 tvtk~ldrEt~~~ysl~V~ard~gtp~l----~tstsI~Vtv~dvndndP~Ft~~eytl~inED~--pvgsSI~tvtAvD 606 (2531)
T KOG4289|consen 533 TVTKELDRETVEHYSLGVEARDHGTPPL----STSTSISVTVLDVNDNDPTFTQKEYTLRINEDA--PVGSSIVTVTAVD 606 (2531)
T ss_pred EEeecccccccceEEEEEEEcCCCCCcc----cccceEEEEecccCCCCCccccCceEEEecCCc--cccceEEEEEEec
Confidence 9999999999999999999999999986 889999999999999999999999999999999 6699999999999
Q ss_pred CCCCCCCceEEEEEecCCCCCCCCcCCCCCCCceEEeCCc--ceEEEcccCCcccccEEEEEEEEEcC
Q psy13120 319 ADSLPFNGLVRYFLKDSGASSGNSQTQGGTGDLFRVNSTT--GDIYLLKSLDREKQAEYLLTLQAMDS 384 (384)
Q Consensus 319 ~D~~~~~~~i~y~l~~~~~~~~~~~~~~~~~~~f~Id~~t--G~i~~~~~ld~e~~~~~~l~V~a~D~ 384 (384)
.|. +..++|.|.+ ++....|.|+... |.|++.-++++.+.+.|.+.|+|+|+
T Consensus 607 ~d~---~s~ityqi~g-----------~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtDg 660 (2531)
T KOG4289|consen 607 RDA---NSVITYQITG-----------GNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATDG 660 (2531)
T ss_pred ccc---ccceEEEecC-----------CcccccceeeccCCcceEEeecchhhcccceEEEEEEecCC
Confidence 997 6689999986 2347999998765 68888888999999999999999994
No 3
>KOG1219|consensus
Probab=100.00 E-value=1.3e-54 Score=437.29 Aligned_cols=360 Identities=34% Similarity=0.526 Sum_probs=339.5
Q ss_pred CCccEEEEEEeCCCCCcEEEeCCCcEEEECccCCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEcc
Q psy13120 1 DNGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFIT 80 (384)
Q Consensus 1 ~~~~i~Y~i~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~ 80 (384)
+|++++|+|.. +.+.|+||+.||.+.+.+.||||.+..|+|.|+|.|.+.|. +.+...+.|.|.|||||+|.|..
T Consensus 776 ~NG~v~fsL~n--~sdvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq---~~svv~l~vsvedVndnpPkci~ 850 (4289)
T KOG1219|consen 776 NNGMVSFSLLN--KSDVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQ---LFSVVELDVSVEDVNDNPPKCII 850 (4289)
T ss_pred CCceEEEEecC--CcceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCc---eEEEEEEEEEEeeccCCCCcccc
Confidence 58999999975 56799999999999999999999999999999999998643 68888999999999999999998
Q ss_pred C-ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCCC
Q psy13120 81 A-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEP 159 (384)
Q Consensus 81 ~-~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~~~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~ 159 (384)
. +...|+|+.|.|+.+..+.|.|+|.|++++++|.+.....+ ++++..+|.+++.++||+|+...|.|.|.|.|+|.|
T Consensus 851 ~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~~~v~-~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p 929 (4289)
T KOG1219|consen 851 RHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTDDTVG-ERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTP 929 (4289)
T ss_pred ccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEecCccc-cccccccccEEEecccccccccceEEEEEEecCCCc
Confidence 8 89999999999999999999999999999999999998765 599999999999999999999999999999999999
Q ss_pred CCceeEEEEEEeccCCCC--CCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcE
Q psy13120 160 SKSSRTKFFIRLLDENDN--NPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGL 237 (384)
Q Consensus 160 ~~~~~~~v~I~V~dvnd~--~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~ 237 (384)
.+++.|.+.|.+.|+|.| ||.|..-.-.+.|.||++.|+.++++.|.|.|.+..+.+.|+|..|+..+.|+|+..+|.
T Consensus 930 ~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~ 1009 (4289)
T KOG1219|consen 930 ILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGS 1009 (4289)
T ss_pred ceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcce
Confidence 899999999999999887 899987777899999999999999999999999999999999999999999999999999
Q ss_pred EEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCCCCcccCCCeeEEEcCCCCCCCceEEEEEEEe
Q psy13120 238 LRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDNYPVFLHSPYLALVNENTLPGPSFRIMTVQAH 317 (384)
Q Consensus 238 l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~ 317 (384)
|++.+.||+|....|.|+|.|+|.+..+. ++.+.+.|.|+|+|||+|+|.++-|..+|.|++ +.+..+.++.|.
T Consensus 1010 irTl~~lDrE~ks~YwltveA~D~gt~~~----ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enS--p~~vsivq~ea~ 1083 (4289)
T KOG1219|consen 1010 IRTLKALDREKKSSYWLTVEAKDLGTVPL----SSVCEVYIEIEDVNDNVPQFSSPVYYASISENS--PETVSIVQAEAN 1083 (4289)
T ss_pred EeechhhchhhcceEEEEEEEEecCCCcc----ccceeEEEEEEecCCCCcccCCceEeeeeccCC--CCceEEEEeccC
Confidence 99999999999999999999999999886 788999999999999999999999999999999 568899999999
Q ss_pred cCCCCCCCceEEEEEecCCCCCCCCcCCCCCCCceEEeCCcceEEEcccCCcccccEEEEEEEEEcC
Q psy13120 318 DADSLPFNGLVRYFLKDSGASSGNSQTQGGTGDLFRVNSTTGDIYLLKSLDREKQAEYLLTLQAMDS 384 (384)
Q Consensus 318 D~D~~~~~~~i~y~l~~~~~~~~~~~~~~~~~~~f~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~ 384 (384)
|+|... ++++.|.|.+ |+..++|+|++.||.|+..+.||||.+..|.|.|++.|+
T Consensus 1084 D~Dsss-n~kLmykI~s-----------Gnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~ 1138 (4289)
T KOG1219|consen 1084 DPDSSS-NQKLMYKITS-----------GNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDN 1138 (4289)
T ss_pred CCCccc-CcceEEEEcc-----------CCccceEEEccccceeeeehhhcccccccceEEEEEecC
Confidence 999766 8999999987 466799999999999999999999999999999999985
No 4
>KOG1219|consensus
Probab=100.00 E-value=3e-50 Score=405.99 Aligned_cols=359 Identities=31% Similarity=0.489 Sum_probs=337.2
Q ss_pred CccEEEEEEeCCCCCcEEEeCCCcEEEECccCCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccC
Q psy13120 2 NGRLEYSIVSGDDEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA 81 (384)
Q Consensus 2 ~~~i~Y~i~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~ 81 (384)
|++++|-|..|+...-|.|+..+|.|.++++||+|....|.|.|+|.|.|.|. .+++....|.|.|.|||+|.|.+.
T Consensus 672 ng~l~yvI~dgne~~~~~Id~qsg~itvas~ld~~~t~~yiLnvta~D~gtPq---kss~r~l~v~vkd~ndn~p~f~e~ 748 (4289)
T KOG1219|consen 672 NGKLVYVIEDGNESICFLIDRQSGNITVASPLDNENTEQYILNVTAYDLGTPQ---KSSWRLLLVFVKDYNDNTPIFVER 748 (4289)
T ss_pred CceEEEEEeCCccceEEEEecccceEEEecchhhhhhheeEEEEEEecCCCch---hhceeeEEEEEEecccCCcccccc
Confidence 78999999999988899999999999999999999999999999999999864 478888899999999999999988
Q ss_pred -ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCCCC
Q psy13120 82 -NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPS 160 (384)
Q Consensus 82 -~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~~~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~~ 160 (384)
|.+.|.|+..+|+.|.++.+.|.|.|.|+.++|+|... ...|+|++.+|.+.+.++||||.+..|.|.|.|.|.+.|.
T Consensus 749 sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~-sdvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq 827 (4289)
T KOG1219|consen 749 SYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNK-SDVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQ 827 (4289)
T ss_pred ceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCC-cceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCc
Confidence 99999999999999999999999999999999999985 3579999999999999999999999999999999999888
Q ss_pred CceeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcEEEE
Q psy13120 161 KSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGLLRI 240 (384)
Q Consensus 161 ~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l~~ 240 (384)
+.+...+.|.|.|+|||+|.+....+...|+|+.+.|+++..+.|.|+|.|+.++++|.+..+. ..+.++--+|.+.+
T Consensus 828 ~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~~~--v~~rvd~~sGavfi 905 (4289)
T KOG1219|consen 828 LFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTDDT--VGERVDFPSGAVFI 905 (4289)
T ss_pred eEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEecCc--cccccccccccEEE
Confidence 9999999999999999999999999999999999999999999999999999999999998443 34588888999999
Q ss_pred cccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCC--CCcccCCCeeEEEcCCCCCCCceEEEEEEEec
Q psy13120 241 AKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDN--YPVFLHSPYLALVNENTLPGPSFRIMTVQAHD 318 (384)
Q Consensus 241 ~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~--~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~D 318 (384)
.++||||...-|+|.|+|.|++.+.. ++.+.+.|.+.|+|++ ||+|....-.+.|.||+ +.|+.++++.|.|
T Consensus 906 ~~~LDf~k~~fynLsv~a~d~g~p~l----ss~chl~Vevldv~enlhpp~F~~~v~e~~V~Ena--piGT~vi~i~A~d 979 (4289)
T KOG1219|consen 906 GKPLDFEKSDFYNLSVTAVDRGTPIL----SSICHLEVEVLDVNENLHPPEFISFVTEGHVLENA--PIGTIVIRIQARD 979 (4289)
T ss_pred ecccccccccceEEEEEEecCCCcce----eeeEEEEEEEeccCCCCCCcchheeeeeeeEeecC--CcceEEEEEEEec
Confidence 99999999999999999999999854 8899999999999887 89999888889999999 6699999999999
Q ss_pred CCCCCCCceEEEEEecCCCCCCCCcCCCCCCCceEEeCCcceEEEcccCCcccccEEEEEEEEEcC
Q psy13120 319 ADSLPFNGLVRYFLKDSGASSGNSQTQGGTGDLFRVNSTTGDIYLLKSLDREKQAEYLLTLQAMDS 384 (384)
Q Consensus 319 ~D~~~~~~~i~y~l~~~~~~~~~~~~~~~~~~~f~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~ 384 (384)
.|.+. .+.++|+|.. |+..+.|+||..+|.|...+.||||....|-|+|.|+|.
T Consensus 980 edsgl-dg~l~Y~I~~-----------gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~ 1033 (4289)
T KOG1219|consen 980 EDSGL-DGELSYKIRT-----------GDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDL 1033 (4289)
T ss_pred CCCCc-cceEEEEEEc-----------CCcceeEEecCCcceEeechhhchhhcceEEEEEEEEec
Confidence 99988 8999999997 456789999999999999999999999999999999984
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=100.00 E-value=2.2e-31 Score=229.77 Aligned_cols=197 Identities=42% Similarity=0.668 Sum_probs=181.9
Q ss_pred ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCC-CCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCCCC
Q psy13120 82 NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPS 160 (384)
Q Consensus 82 ~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~~~-~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~~ 160 (384)
+.+.|.|+++.|+.++++.|.|+|.+.++.++|+|.++.. ++|.|++.+|.|++.+.||||....|.|.|.|+|.|.+.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~ 81 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP 81 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc
Confidence 5688999999999999999999999888889999999864 799999999999999999999999999999999988877
Q ss_pred CceeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcEEEE
Q psy13120 161 KSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGLLRI 240 (384)
Q Consensus 161 ~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l~~ 240 (384)
.++...++|.|.|+||++|.|....|.+.+.|+.++|+.++++.|+|+|.+.++.+.|+|.++.....|.|++.+|.|.+
T Consensus 82 ~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~ 161 (199)
T cd00031 82 LSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITL 161 (199)
T ss_pred ceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEe
Confidence 77999999999999999999998899999999999999999999999998878899999997665689999999999999
Q ss_pred cccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEee
Q psy13120 241 AKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSD 282 (384)
Q Consensus 241 ~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~d 282 (384)
.+.||||....|.|.|.|+|.+.+.. +.++.++|.|.|
T Consensus 162 ~~~ld~e~~~~~~l~v~a~D~~~~~~----~~~~~i~i~v~d 199 (199)
T cd00031 162 AKPLDREEKSSYELTVVATDGGGPPL----SSTATVTVTVLD 199 (199)
T ss_pred CCccCCccCceEEEEEEEEECCCCCc----eeEEEEEEEEEC
Confidence 99999999999999999999986443 678888888865
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=1.5e-28 Score=212.14 Aligned_cols=180 Identities=44% Similarity=0.632 Sum_probs=165.4
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEECCCCC
Q psy13120 186 YSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDT 265 (384)
Q Consensus 186 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~ 265 (384)
|.+.+.|+.+.|+.++++.|.|+|.+.+..+.|+|.++....+|.|++.+|.|.+.+.||||....|.|.|+|+|.+.+.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~ 81 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP 81 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc
Confidence 67899999999999999999999987778999999977666899999999999999999999999999999999976665
Q ss_pred cceeeeeEEEEEEEEeecCCCCCcccCCCeeEEEcCCCCCCCceEEEEEEEecCCCCCCCceEEEEEecCCCCCCCCcCC
Q psy13120 266 EERIRFDTATVSILVSDINDNYPVFLHSPYLALVNENTLPGPSFRIMTVQAHDADSLPFNGLVRYFLKDSGASSGNSQTQ 345 (384)
Q Consensus 266 ~~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~D~D~~~~~~~i~y~l~~~~~~~~~~~~~ 345 (384)
. +..+.|+|.|.|+||++|.|....|.+.+.|+.. .|..++++.|+|+|.+. ++.++|+|..
T Consensus 82 ~----~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~--~~~~i~~~~a~D~D~~~-~~~~~y~l~~----------- 143 (199)
T cd00031 82 L----SSTATVTVTVLDVNDNPPVFEQSSYEASVPENAP--PGTVVGTVTATDADSGE-NAKLTYSILS----------- 143 (199)
T ss_pred c----eeEEEEEEEEccCCCCCCcccccceEEEEeCCCC--CCCEEEEEEEEcCCCCC-CccEEEEEeC-----------
Confidence 3 5789999999999999999998999999999984 48899999999999875 7899999987
Q ss_pred CCC-CCceEEeCCcceEEEcccCCcccccEEEEEEEEEcC
Q psy13120 346 GGT-GDLFRVNSTTGDIYLLKSLDREKQAEYLLTLQAMDS 384 (384)
Q Consensus 346 ~~~-~~~f~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D~ 384 (384)
+. .++|.|++.+|.|.+.+.||||....|.|.|.|+|+
T Consensus 144 -~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~ 182 (199)
T cd00031 144 -GNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDG 182 (199)
T ss_pred -CCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEEC
Confidence 33 389999999999999999999999999999999995
No 7
>KOG1834|consensus
Probab=99.74 E-value=2.1e-16 Score=147.87 Aligned_cols=197 Identities=27% Similarity=0.365 Sum_probs=155.9
Q ss_pred EEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCC---CCceEEEEEEeCCCC-CCeEEECCC--cEEEEcc
Q psy13120 169 IRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDG---INGRVRYTIISGDDN-HDLTISEDN--GLLRIAK 242 (384)
Q Consensus 169 I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~---~~~~i~y~i~~~~~~-~~f~i~~~~--G~l~~~~ 242 (384)
.....+|-+.|.. ...|...|.||--.=.....+.|.|.|.+ .+.-.-|.|.+.+-+ ...-+|..| |.|+...
T Consensus 21 ~~aarankhkpwi-e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvlRaK~ 99 (952)
T KOG1834|consen 21 HHAARANKHKPWI-EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVLRAKE 99 (952)
T ss_pred cccccccccCccc-ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEEeecC
Confidence 4455677777876 56789999999754344456888999874 223467888643221 122346555 5788889
Q ss_pred cCCccccCeEEEEEEEEECCCCCcc--eeeeeEEEEEEEEeecCCCCCcccCCCeeEEEcCCCCCCCceEEEEEEEecCC
Q psy13120 243 NLNYERKAQYSLTIQAEDCGGDTEE--RIRFDTATVSILVSDINDNYPVFLHSPYLALVNENTLPGPSFRIMTVQAHDAD 320 (384)
Q Consensus 243 ~lD~e~~~~~~l~V~a~D~~~~~~~--~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~D~D 320 (384)
+||.|.++.|+|+|+|.|++..+.. ..++..++|.|.|.|||+++|+|....|.+.|.|+.. ...|++|.|.|.|
T Consensus 100 ~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~---yd~il~veAiD~D 176 (952)
T KOG1834|consen 100 PLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV---YDSILRVEAIDKD 176 (952)
T ss_pred cccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee---eeeeEEEEeecCC
Confidence 9999999999999999998876532 2357789999999999999999999999999999974 4579999999999
Q ss_pred CCCCCceE-EEEEecCCCCCCCCcCCCCCCCceEEeCCcceEEEcccCCcccccEEEEEEEEEc
Q psy13120 321 SLPFNGLV-RYFLKDSGASSGNSQTQGGTGDLFRVNSTTGDIYLLKSLDREKQAEYLLTLQAMD 383 (384)
Q Consensus 321 ~~~~~~~i-~y~l~~~~~~~~~~~~~~~~~~~f~Id~~tG~i~~~~~ld~e~~~~~~l~V~a~D 383 (384)
.++.+++| .|.|.. ..-.|.|| +.|.|..+.+|+|.....|.|+|+|.|
T Consensus 177 Cspq~sqIC~YEI~t-------------~d~PFaId-n~G~irnTekLny~ke~~Y~ltVtAyD 226 (952)
T KOG1834|consen 177 CSPQYSQICEYEITT-------------PDVPFAID-NDGNIRNTEKLNYTKEHQYKLTVTAYD 226 (952)
T ss_pred CCCcccceeEEEecC-------------CCCceEEc-CCCccccccccccccceeEEEEEEEEe
Confidence 98766676 566664 66789998 589999999999999999999999987
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.73 E-value=1.6e-16 Score=119.20 Aligned_cols=92 Identities=40% Similarity=0.584 Sum_probs=85.5
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEEC-CCC
Q psy13120 186 YSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDC-GGD 264 (384)
Q Consensus 186 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~-~~~ 264 (384)
|.+.|+|+.++|+.++++.|.|+|.+.++.+.|+|.+++..++|.|++.+|.|.+.+.||||....|+|.|.|+|. +.+
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~ 80 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP 80 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence 7899999999999999999999999899999999999988999999999999999999999999999999999999 555
Q ss_pred CcceeeeeEEEEEEEEe
Q psy13120 265 TEERIRFDTATVSILVS 281 (384)
Q Consensus 265 ~~~~~~~~~~~v~i~V~ 281 (384)
+ +++.++|+|+|+
T Consensus 81 ~----~~~~~~V~I~V~ 93 (93)
T PF00028_consen 81 P----LSSTATVTINVL 93 (93)
T ss_dssp E----EEEEEEEEEEEE
T ss_pred C----CEEEEEEEEEEC
Confidence 5 478888888874
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.72 E-value=1e-16 Score=120.31 Aligned_cols=91 Identities=38% Similarity=0.626 Sum_probs=86.3
Q ss_pred ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCC-CCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeC-CCC
Q psy13120 82 NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSE-ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDR-GEP 159 (384)
Q Consensus 82 ~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~~-~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~-~~~ 159 (384)
|.++|+|++++|+.++++.|.|+|.+.++.+.|+|.++. .++|.|++.+|.|.+.++||||....|.|.|.|+|. +.|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~ 80 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP 80 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence 578999999999999999999999999999999999996 789999999999999999999999999999999999 888
Q ss_pred CCceeEEEEEEec
Q psy13120 160 SKSSRTKFFIRLL 172 (384)
Q Consensus 160 ~~~~~~~v~I~V~ 172 (384)
++++.++|+|+|.
T Consensus 81 ~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 81 PLSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEEEEEEE
T ss_pred CCEEEEEEEEEEC
Confidence 9999999999874
No 10
>KOG1834|consensus
Probab=99.71 E-value=8.1e-16 Score=144.02 Aligned_cols=209 Identities=28% Similarity=0.423 Sum_probs=167.1
Q ss_pred EEEEeecCCCCEEccCceeEeeCCCCCCcEEEEEEEEeCCCCC---CceEEEEEEcCCCCCeee---eCcc--eEEEEce
Q psy13120 66 IVIRDVNDNAPEFITANETTIPENSPLNTVVAALKAIDRDEGQ---NSYVEYSIINSEATPFSL---GPVD--GLLRVAQ 137 (384)
Q Consensus 66 I~V~dvNd~~P~f~~~~~~~v~E~~~~gt~v~~~~a~D~D~~~---~~~~~y~l~~~~~~~F~i---~~~~--G~i~~~~ 137 (384)
....-+|-+.|.....|...|.||...=.....+.|-|.|.+- ....-|.|.+.+ -+|.+ |..+ |.|+.+.
T Consensus 21 ~~aarankhkpwie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~-vPFdavVvdK~TGegvlRaK~ 99 (952)
T KOG1834|consen 21 HHAARANKHKPWIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQP-VPFDAVVVDKYTGEGVLRAKE 99 (952)
T ss_pred cccccccccCcccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCC-CCceEEEEeccCCceEEeecC
Confidence 4456678889999888999999996433333458888888731 223577777753 45654 5555 5899999
Q ss_pred eCCccCCCeEEEEEEEEeCCCC------CCceeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCC
Q psy13120 138 KLDREIKSNYTLEIKAKDRGEP------SKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDG 211 (384)
Q Consensus 138 ~ld~e~~~~~~l~V~a~d~~~~------~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~ 211 (384)
+||.|.++.|+|+|+|.|.|.. ..+..+++.|.|.|+|+.+|+|....|.+.|.|.. .-..|++|.|.|.|-+
T Consensus 100 ~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAiD~DCs 178 (952)
T KOG1834|consen 100 PLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAIDKDCS 178 (952)
T ss_pred cccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-eeeeeEEEEeecCCCC
Confidence 9999999999999999997643 35778999999999999999999999999999985 4577899999999975
Q ss_pred -CCc-eEEEEEEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeec
Q psy13120 212 -ING-RVRYTIISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDI 283 (384)
Q Consensus 212 -~~~-~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dv 283 (384)
+++ -..|.|+. ..-.|.||. .|.|+.+.+|.|.....|.|+|+|.|++.... .+.+.|+|.|..+
T Consensus 179 pq~sqIC~YEI~t--~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kra----a~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 179 PQYSQICEYEITT--PDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRA----ASDSLVTVHVKPT 245 (952)
T ss_pred CcccceeEEEecC--CCCceEEcC-CCccccccccccccceeEEEEEEEEecccccc----cCcceEEEEecCc
Confidence 333 46899984 556799986 69999999999999999999999999998763 3447888888766
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.63 E-value=2.7e-15 Score=108.81 Aligned_cols=78 Identities=40% Similarity=0.645 Sum_probs=73.1
Q ss_pred EeCCCCCCceEEEEEEcCCC-CCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCC
Q psy13120 102 IDRDEGQNSYVEYSIINSEA-TPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNP 179 (384)
Q Consensus 102 ~D~D~~~~~~~~y~l~~~~~-~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P 179 (384)
+|+|.+.++.+.|+|.++.. ++|.|++.+|.|.+.++||||....|.|.|.|+|.+.|++++.+.|+|.|.|+|||+|
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 47888888999999998865 8999999999999999999999999999999999999889999999999999999998
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.62 E-value=4.9e-15 Score=107.45 Aligned_cols=79 Identities=42% Similarity=0.689 Sum_probs=72.1
Q ss_pred EeCCCCCCceEEEEEEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCC
Q psy13120 206 TDLDDGINGRVRYTIISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDIND 285 (384)
Q Consensus 206 ~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd 285 (384)
+|+|.+.++.+.|+|.+++...+|.|++.+|.|.+.++||||....|.|.|.|+|.+.+.. ++.++|+|.|.|+||
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~----~~~~~v~I~V~D~Nd 76 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPL----SSTATVTVTVLDVND 76 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCc----ccEEEEEEEEEECCC
Confidence 4788888889999999777668999999999999999999999999999999999988754 789999999999999
Q ss_pred CCC
Q psy13120 286 NYP 288 (384)
Q Consensus 286 ~~P 288 (384)
++|
T Consensus 77 ~~P 79 (79)
T smart00112 77 NAP 79 (79)
T ss_pred CCC
Confidence 998
No 13
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=97.22 E-value=0.0031 Score=46.73 Aligned_cols=87 Identities=22% Similarity=0.270 Sum_probs=58.6
Q ss_pred EEEEEEeCCCCCCceEEEEEE--cCCCCCeeeeCcceEEEEc--------eeCCccCCCeEEEEEEEEeCCCCCCceeEE
Q psy13120 97 AALKAIDRDEGQNSYVEYSII--NSEATPFSLGPVDGLLRVA--------QKLDREIKSNYTLEIKAKDRGEPSKSSRTK 166 (384)
Q Consensus 97 ~~~~a~D~D~~~~~~~~y~l~--~~~~~~F~i~~~~G~i~~~--------~~ld~e~~~~~~l~V~a~d~~~~~~~~~~~ 166 (384)
++|.++|+|.++. .++++. .+..|.|.|++ .|..... +.|.....-.-.|+|.+.|+ ...+
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt~~ 72 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TSQT 72 (99)
T ss_pred CceEEeCCCCCCc--eEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------CeEE
Confidence 5788999998665 556553 33466789987 6765543 12332233456788888886 2788
Q ss_pred EEEEeccCCCCCCceeccceEEEEecCC
Q psy13120 167 FFIRLLDENDNNPIFDSKSYSASVPENI 194 (384)
Q Consensus 167 v~I~V~dvnd~~P~f~~~~~~~~v~E~~ 194 (384)
|+|+|...|| +|+..... ...+.|+.
T Consensus 73 vtItI~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 73 VTITITGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred EEEEEEccCC-CCEEeccc-ceeEecCC
Confidence 9999999999 88765433 46676653
No 14
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.11 E-value=0.00046 Score=49.76 Aligned_cols=62 Identities=23% Similarity=0.383 Sum_probs=40.6
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCCC--ceEEEEEEeCCCCCCeEEECCCcEEEEcccCCccc
Q psy13120 186 YSASVPENISVGATVLQVSATDLDDGIN--GRVRYTIISGDDNHDLTISEDNGLLRIAKNLNYER 248 (384)
Q Consensus 186 ~~~~v~E~~~~g~~v~~v~a~D~D~~~~--~~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~ 248 (384)
...+|+|..+.|+.|+.+ |.|...... ....|.+.+.....+|.++..+|.|.+...+|||.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 357899999999999998 445432210 12457777777788999999999999999999996
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.10 E-value=0.0092 Score=44.59 Aligned_cols=65 Identities=28% Similarity=0.366 Sum_probs=50.9
Q ss_pred CccEEEEEEeCC---CCCcEEEeCCCcEEEECccCCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecC
Q psy13120 2 NGRLEYSIVSGD---DEEIFTINPSNGTLSNLKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVND 73 (384)
Q Consensus 2 ~~~i~Y~i~~~~---~~~~F~id~~tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd 73 (384)
+..++|++...+ -+.|.+.|+.++.|+-. +.... ...|.|+|.|+|..| .++...++|.|.+.|+
T Consensus 29 ~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~-~g~~~i~v~a~D~~g-----~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 29 GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSD-VGSLSLKVTATDSSG-----ASASDTFTITVVNTND 96 (97)
T ss_pred CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCC-CcEEEEEEEEEECCC-----CEEEEEEEEEEeCCCC
Confidence 567899997543 24799999999998874 33333 466999999999987 3788899999999886
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.02 E-value=0.011 Score=43.27 Aligned_cols=80 Identities=26% Similarity=0.396 Sum_probs=43.9
Q ss_pred CCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEE
Q psy13120 178 NPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQ 257 (384)
Q Consensus 178 ~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~ 257 (384)
.|-|....|.+.|+.+...|..+++|.-.|... +..+.|... + ..|.|.. .|.|.+++++.... ..-.+.|.
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~ss--D--pdF~V~~-DGsVy~~r~v~l~~-~~~~F~V~ 74 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFESS--D--PDFRVLE-DGSVYAKRPVQLSS-EQRSFTVH 74 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEET-TTEEEEES--S-SS-S-EEEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEecC--C--CCEEEcC-CCeEEEeeeEecCC-CceEEEEE
Confidence 478999999999999999999999999999963 455777653 2 2799987 59999999986543 34589999
Q ss_pred EEECCCCC
Q psy13120 258 AEDCGGDT 265 (384)
Q Consensus 258 a~D~~~~~ 265 (384)
|.|..+..
T Consensus 75 a~D~~~~~ 82 (90)
T PF08758_consen 75 AWDSQTQE 82 (90)
T ss_dssp EEETTTTE
T ss_pred EECCCCCe
Confidence 99988764
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.98 E-value=0.018 Score=42.98 Aligned_cols=70 Identities=27% Similarity=0.309 Sum_probs=53.0
Q ss_pred EEeCCCCCCceEEEEEEeC---CCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEe
Q psy13120 205 ATDLDDGINGRVRYTIISG---DDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVS 281 (384)
Q Consensus 205 a~D~D~~~~~~i~y~i~~~---~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~ 281 (384)
..|+| +..++|++... ..+.|..+++.++.+.-. +.. +..+.|.+.|+|+|+.+. +....++|.|.
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~-~~~g~~~i~v~a~D~~g~------~~~~~f~i~V~ 92 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTN-SDVGSLSLKVTATDSSGA------SASDTFTITVV 92 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCC-CCCcEEEEEEEEEECCCC------EEEEEEEEEEe
Confidence 45666 45788988643 336799999999988764 433 235679999999999874 56889999999
Q ss_pred ecCC
Q psy13120 282 DIND 285 (384)
Q Consensus 282 dvNd 285 (384)
+.|+
T Consensus 93 ~~~~ 96 (97)
T smart00736 93 NTND 96 (97)
T ss_pred CCCC
Confidence 9886
No 18
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.88 E-value=0.0064 Score=44.48 Aligned_cols=76 Identities=24% Similarity=0.264 Sum_probs=40.5
Q ss_pred CCcccCCCeeEEEcCCCCCCCceEEEEEEEecCCCCCCCceEEEEEecCCCCCCCCcCCCCCCCceEEeCCcceEEEccc
Q psy13120 287 YPVFLHSPYLALVNENTLPGPSFRIMTVQAHDADSLPFNGLVRYFLKDSGASSGNSQTQGGTGDLFRVNSTTGDIYLLKS 366 (384)
Q Consensus 287 ~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~D~D~~~~~~~i~y~l~~~~~~~~~~~~~~~~~~~f~Id~~tG~i~~~~~ 366 (384)
.|-|.+..|.+.|+.+. ..|..+++|...|... +..+.|...+ ..|.|.+ .|.|+++++
T Consensus 3 ~pGF~~~~~~~~Vp~~l--~~g~~lg~V~f~dC~~---~~~~~~~ssD---------------pdF~V~~-DGsVy~~r~ 61 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNL--EAGQPLGKVNFEDCTG---RRRVIFESSD---------------PDFRVLE-DGSVYAKRP 61 (90)
T ss_dssp --B--S-EEEE----SS---SS--EEE---B--SS------EEEE------------------SEEEEET-TTEEEEES-
T ss_pred cCCcccceEEEEcCchh--hCCcEEEEEEeccCCC---CCceEEecCC---------------CCEEEcC-CCeEEEeee
Confidence 58899999999999997 4589999999999864 5578887654 4899985 799999999
Q ss_pred CCcccccEEEEEEEEEcC
Q psy13120 367 LDREKQAEYLLTLQAMDS 384 (384)
Q Consensus 367 ld~e~~~~~~l~V~a~D~ 384 (384)
+...... -.|.|.|.|+
T Consensus 62 v~l~~~~-~~F~V~a~D~ 78 (90)
T PF08758_consen 62 VQLSSEQ-RSFTVHAWDS 78 (90)
T ss_dssp -S-SSS--EEEEEEEEET
T ss_pred EecCCCc-eEEEEEEECC
Confidence 8755443 4799999885
No 19
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.82 E-value=0.00055 Score=49.35 Aligned_cols=60 Identities=25% Similarity=0.406 Sum_probs=37.3
Q ss_pred eeEeeCCCCCCcEEEEEEEEeCCCCCC--ceEEEEEEcC-CCCCeeeeCcceEEEEceeCCccC
Q psy13120 83 ETTIPENSPLNTVVAALKAIDRDEGQN--SYVEYSIINS-EATPFSLGPVDGLLRVAQKLDREI 143 (384)
Q Consensus 83 ~~~v~E~~~~gt~v~~~~a~D~D~~~~--~~~~y~l~~~-~~~~F~i~~~~G~i~~~~~ld~e~ 143 (384)
..+|+|..+.|+.||.+. .|...... ..-.|++.+. ...+|.++..+|.|.++..+|||.
T Consensus 4 ~YsV~EE~~~Gt~IGnia-~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 4 RYSVPEEMPPGTVIGNIA-KDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEESS--TT-EEEECC-CCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred EEEeecCCCCCCEEEEhH-HhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 468999999999999994 45432211 1124665554 466899999999999999999996
No 20
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.53 E-value=0.031 Score=41.43 Aligned_cols=87 Identities=20% Similarity=0.213 Sum_probs=55.0
Q ss_pred EEEEEEeCCCCCCceEEEEEEe-CCCCCCeEEECCCcEEEEc--------ccCCccccCeEEEEEEEEECCCCCcceeee
Q psy13120 201 LQVSATDLDDGINGRVRYTIIS-GDDNHDLTISEDNGLLRIA--------KNLNYERKAQYSLTIQAEDCGGDTEERIRF 271 (384)
Q Consensus 201 ~~v~a~D~D~~~~~~i~y~i~~-~~~~~~f~i~~~~G~l~~~--------~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~ 271 (384)
+++.++|+|.+. ...+++.. ....+.|.|++ .|..... +.|.--+...-.|+|++.|+.
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt--------- 69 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT--------- 69 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC---------
Confidence 457888888643 34555531 12345578876 5655432 223322344678889999962
Q ss_pred eEEEEEEEEeecCCCCCcccCCCeeEEEcCC
Q psy13120 272 DTATVSILVSDINDNYPVFLHSPYLALVNEN 302 (384)
Q Consensus 272 ~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~ 302 (384)
.++|+|+|.-.|| +|+... .-...+.|+
T Consensus 70 -t~~vtItI~GtND-apvi~~-~~~g~v~ED 97 (99)
T TIGR01965 70 -SQTVTITITGAND-AAVIGG-ADTGSVTED 97 (99)
T ss_pred -eEEEEEEEEccCC-CCEEec-ccceeEecC
Confidence 5889999999999 887653 334666665
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=96.47 E-value=0.38 Score=39.41 Aligned_cols=128 Identities=23% Similarity=0.365 Sum_probs=70.1
Q ss_pred CCeEEEEE-EEEeCCCCCCceeEEEEEEeccCCCCCCceeccceEEEEecCCCC-CcEEEEEEEEeCCCCCCceEEEEEE
Q psy13120 144 KSNYTLEI-KAKDRGEPSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISV-GATVLQVSATDLDDGINGRVRYTII 221 (384)
Q Consensus 144 ~~~~~l~V-~a~d~~~~~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~-g~~v~~v~a~D~D~~~~~~i~y~i~ 221 (384)
...|.+++ .|.|..+ ......+...+. ++..+|.+.. .....+..+... +..=..+.++|..... .--..+|.
T Consensus 14 dG~Y~l~~~~a~D~ag--N~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~ 88 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAG--NTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLT 88 (158)
T ss_pred CccEEEEEEEEEecCC--CEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEE
Confidence 46899999 7999865 333344332222 2345887754 111233333322 2233457777766433 33556776
Q ss_pred eCCCCCCeEEE--C-CCcEEEEc--ccC-CccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEee
Q psy13120 222 SGDDNHDLTIS--E-DNGLLRIA--KNL-NYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSD 282 (384)
Q Consensus 222 ~~~~~~~f~i~--~-~~G~l~~~--~~l-D~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~d 282 (384)
+|.......+. . ..|...+. +.+ ..|....|+|+|.|+|..+.. .+..+......
T Consensus 89 Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aGN~------~~~si~F~y~P 149 (158)
T PF13750_consen 89 GGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAGNQ------STKSISFSYMP 149 (158)
T ss_pred CCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCCCE------EEEEEEEEEeC
Confidence 55544433321 1 13333332 111 236678999999999998874 46666666553
No 22
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=96.40 E-value=0.42 Score=39.13 Aligned_cols=127 Identities=21% Similarity=0.331 Sum_probs=71.7
Q ss_pred CCcEEEEEE-EEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccCceeEeeCCCCC-CcEEEEEEEEeCCCCCCceEEE
Q psy13120 37 TKSSYNLVV-MATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENSPL-NTVVAALKAIDRDEGQNSYVEY 114 (384)
Q Consensus 37 ~~~~~~l~v-~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~~~-gt~v~~~~a~D~D~~~~~~~~y 114 (384)
....|.+++ +|.|..|. ..+..+..++. +...||.+.-.....+..+... +..-..+.++|...+. ..-..
T Consensus 13 ~dG~Y~l~~~~a~D~agN-----~~~~~~~~~~~-iD~T~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv 85 (158)
T PF13750_consen 13 PDGSYTLTVVTATDAAGN-----TSTSTVSETFT-IDNTPPTISISDGASVANGSTVYGLVNISINVTDNSDDS-KITSV 85 (158)
T ss_pred CCccEEEEEEEEEecCCC-----EEEEEEeeEEE-EcCCCCEEEEecCCccCCCccccceeeeEEEEEeCCCCc-eEEEE
Confidence 456899999 89999874 34444443333 3456888755222223333222 3333457787776543 34567
Q ss_pred EEEcCCC-CCe--eeeC-cceEEEEc--ee-CCccCCCeEEEEEEEEeCCCCCCceeEEEEEEec
Q psy13120 115 SIINSEA-TPF--SLGP-VDGLLRVA--QK-LDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLL 172 (384)
Q Consensus 115 ~l~~~~~-~~F--~i~~-~~G~i~~~--~~-ld~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~ 172 (384)
+|.++.. ..- .... ..|...+. +. ...|....|+|+|.|.|..+ ...+..+.....
T Consensus 86 ~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si~F~y~ 148 (158)
T PF13750_consen 86 SLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSISFSYM 148 (158)
T ss_pred EEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC--CEEEEEEEEEEe
Confidence 7776642 211 1111 13333332 11 13477889999999999865 556666666554
No 23
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=94.71 E-value=0.14 Score=32.77 Aligned_cols=36 Identities=22% Similarity=0.299 Sum_probs=29.3
Q ss_pred CCCCceEEeCCcceEEEcccCCcc-cccEEEEEEEEEcC
Q psy13120 347 GTGDLFRVNSTTGDIYLLKSLDRE-KQAEYLLTLQAMDS 384 (384)
Q Consensus 347 ~~~~~f~Id~~tG~i~~~~~ld~e-~~~~~~l~V~a~D~ 384 (384)
.-+.++.||+.+|.|+-. .+.. ..+.|.+.|.|+|+
T Consensus 11 ~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~ 47 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDG 47 (49)
T ss_pred CCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcC
Confidence 457899999999999987 3333 34799999999985
No 24
>KOG3597|consensus
Probab=93.64 E-value=4.4 Score=38.81 Aligned_cols=153 Identities=20% Similarity=0.218 Sum_probs=86.2
Q ss_pred EEEEEEEEEeecCCCCEEccC-ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCCCC-----eeeeC------
Q psy13120 61 TTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEATP-----FSLGP------ 128 (384)
Q Consensus 61 ~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~~~~~-----F~i~~------ 128 (384)
+....|.|..+||.+..+... +.+.+.|+...-.....+.+.|+|.+.- .+.|++.+..... |..-.
T Consensus 25 ~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~-~l~f~v~~t~~~~~~~~~~~~~g~~~~~F 103 (442)
T KOG3597|consen 25 TDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPL-PLEFQVLGTSSVPLPVLKFDVPGAPATEF 103 (442)
T ss_pred EeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCcc-ceEEEEccCCCCCCccceeeccCCcccce
Confidence 456779999999955555554 6688888876555566788999997543 3788888764322 33322
Q ss_pred -----cceEEEEceeCCcc--CCCeEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCCceecc-ceEEEEecCCCCCcEE
Q psy13120 129 -----VDGLLRVAQKLDRE--IKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPIFDSK-SYSASVPENISVGATV 200 (384)
Q Consensus 129 -----~~G~i~~~~~ld~e--~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P~f~~~-~~~~~v~E~~~~g~~v 200 (384)
..|.+.+ ++. +.....++..++|+-. .+. .+..... ...|.+-.. .....+.-....--.-
T Consensus 104 s~~~v~~g~~~y----vh~g~el~~~~~~~~~SDg~~---~S~-~~i~~~~---~~~~~~~~~~~~gL~v~~gS~~~IT~ 172 (442)
T KOG3597|consen 104 SYEEVEDGSLSY----VHSGTELRESELQLRVSDGLL---VSE-RAILKVE---ATGPAPHLARNTGLKVLQGSTAPITP 172 (442)
T ss_pred EehHhhcCceeE----EecCcccccceEEEEeecceE---eee-eEEeccc---CCCcceeeecccceEEccCccccccH
Confidence 1222222 222 2456788899999832 222 2212222 123333111 1123333332222222
Q ss_pred EEEEEEeCCCCCCceEEEEEEeCCC
Q psy13120 201 LQVSATDLDDGINGRVRYTIISGDD 225 (384)
Q Consensus 201 ~~v~a~D~D~~~~~~i~y~i~~~~~ 225 (384)
..+.+.|.|.+....+.|.|.....
T Consensus 173 ~~L~ved~d~~~d~~v~~~i~~~P~ 197 (442)
T KOG3597|consen 173 SNLSVEDNDSSPDDEVRYDITPPPQ 197 (442)
T ss_pred hHceeecCCCCCCcEEEEEecCCCC
Confidence 3488889887667789999985443
No 25
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=93.44 E-value=0.43 Score=30.46 Aligned_cols=45 Identities=29% Similarity=0.465 Sum_probs=31.7
Q ss_pred EEEEE---eCCCCCcEEEeCCCcEEEECccCCcC-CCcEEEEEEEEEeCCC
Q psy13120 6 EYSIV---SGDDEEIFTINPSNGTLSNLKHLDRE-TKSSYNLVVMATDMAK 52 (384)
Q Consensus 6 ~Y~i~---~~~~~~~F~id~~tG~i~~~~~lD~E-~~~~~~l~v~a~d~~~ 52 (384)
+|++. .+.-+.+..+|+.+|.|+-.-. .. ....|.++|+|+|..|
T Consensus 1 Tys~~~~~~~~LP~gLs~d~~tG~isGtp~--~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 1 TYSLTTPTGGGLPSGLSLDPSTGTISGTPT--SSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CEEEEcCCCCCCCCcEEEeCCCCEEEeecC--CCccccEEEEEEEEEcCCC
Confidence 36763 2233468999999999986633 22 2358999999999764
No 26
>KOG3597|consensus
Probab=92.85 E-value=7.2 Score=37.42 Aligned_cols=153 Identities=14% Similarity=0.070 Sum_probs=84.5
Q ss_pred eeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCC----CCeEEECC----
Q psy13120 163 SRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDN----HDLTISED---- 234 (384)
Q Consensus 163 ~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~----~~f~i~~~---- 234 (384)
.+....|.|..+||.+..+....+.+.+.|+...-.....+.+.|+|... ..+.|++...... ..|.....
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~ 102 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATE 102 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccc
Confidence 45667899999999666565555568888887655555668888988753 3478888732111 11332221
Q ss_pred -------CcEEEEcccCCcc--ccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCCCCcccC-CCeeEEEcCCCC
Q psy13120 235 -------NGLLRIAKNLNYE--RKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDNYPVFLH-SPYLALVNENTL 304 (384)
Q Consensus 235 -------~G~l~~~~~lD~e--~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~~P~f~~-~~~~~~v~e~~~ 304 (384)
.|.+.. ++. ......++..++|+-.- +. .+....... .|.+.. ......|.-+..
T Consensus 103 Fs~~~v~~g~~~y----vh~g~el~~~~~~~~~SDg~~~-------S~-~~i~~~~~~---~~~~~~~~~~gL~v~~gS~ 167 (442)
T KOG3597|consen 103 FSYEEVEDGSLSY----VHSGTELRESELQLRVSDGLLV-------SE-RAILKVEAT---GPAPHLARNTGLKVLQGST 167 (442)
T ss_pred eEehHhhcCceeE----EecCcccccceEEEEeecceEe-------ee-eEEecccCC---CcceeeecccceEEccCcc
Confidence 222222 222 24567788888887542 11 222222222 333221 112233433332
Q ss_pred CCCceEEEEEEEecCCCCCCCceEEEEEec
Q psy13120 305 PGPSFRIMTVQAHDADSLPFNGLVRYFLKD 334 (384)
Q Consensus 305 ~~~g~~i~~v~a~D~D~~~~~~~i~y~l~~ 334 (384)
. -..-..+++.|.|..+ ...+.|.|..
T Consensus 168 ~--~IT~~~L~ved~d~~~-d~~v~~~i~~ 194 (442)
T KOG3597|consen 168 A--PITPSNLSVEDNDSSP-DDEVRYDITP 194 (442)
T ss_pred c--cccHhHceeecCCCCC-CcEEEEEecC
Confidence 1 1112358889998655 6789999886
No 27
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=92.40 E-value=17 Score=38.44 Aligned_cols=146 Identities=21% Similarity=0.280 Sum_probs=78.0
Q ss_pred ecCCCCEEccC-ceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEEcCCC---CCeeeeCcceEEEEc----------
Q psy13120 71 VNDNAPEFITA-NETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSIINSEA---TPFSLGPVDGLLRVA---------- 136 (384)
Q Consensus 71 vNd~~P~f~~~-~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~~~~~---~~F~i~~~~G~i~~~---------- 136 (384)
.||.++.+... ....|.|+. |+.-..+.-...|.+..-.+.|+..++.+ ..|.- .+|.|...
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~~--~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYEF--TEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCccc--cCceEEECCCceEEEEEE
Confidence 44545566555 678899995 77666665544454444568898877732 12321 23444321
Q ss_pred eeCC---ccCCCeEEEEEEEEeCCC-------------CCCceeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEE
Q psy13120 137 QKLD---REIKSNYTLEIKAKDRGE-------------PSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATV 200 (384)
Q Consensus 137 ~~ld---~e~~~~~~l~V~a~d~~~-------------~~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v 200 (384)
.-+| +|..+.|.+.+.--..+. ..+......+|+|.| ||++|.|....-...|.|+. |..-
T Consensus 471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vt 547 (928)
T TIGR00845 471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIME 547 (928)
T ss_pred EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEEE
Confidence 1122 455556665554322110 112233456677777 66788877666668899985 4443
Q ss_pred EEEE-EEeCCCCCCceEEEEEEeCCC
Q psy13120 201 LQVS-ATDLDDGINGRVRYTIISGDD 225 (384)
Q Consensus 201 ~~v~-a~D~D~~~~~~i~y~i~~~~~ 225 (384)
.+|. ..+.+ ..-.+.|....|..
T Consensus 548 vtV~RtsGa~--G~VtV~Y~T~dGTA 571 (928)
T TIGR00845 548 VKVLRTSGAR--GTVIVPYRTVEGTA 571 (928)
T ss_pred EEEEEcCCCC--eeEEEEEEeecCcc
Confidence 3332 22221 12346677765543
No 28
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=90.89 E-value=24 Score=37.30 Aligned_cols=148 Identities=18% Similarity=0.154 Sum_probs=75.7
Q ss_pred CCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCCCCC--CeEEECCCcEEEEccc--------
Q psy13120 174 ENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGDDNH--DLTISEDNGLLRIAKN-------- 243 (384)
Q Consensus 174 vnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~--~f~i~~~~G~l~~~~~-------- 243 (384)
.||..+.+....-...|.|+. |+.-.+|.-...|....-.+.|+...|.... -|. +..|.|.....
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 445455555555567899985 5665555544435444456889887664432 232 23455543211
Q ss_pred --CC---ccccCeEEEEEEEEECCCCC-----------cceeeeeEEEEEEEEeecCCCCCcccCCCeeEEEcCCCCCCC
Q psy13120 244 --LN---YERKAQYSLTIQAEDCGGDT-----------EERIRFDTATVSILVSDINDNYPVFLHSPYLALVNENTLPGP 307 (384)
Q Consensus 244 --lD---~e~~~~~~l~V~a~D~~~~~-----------~~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~ 307 (384)
+| +|.... |.|..++..... ...........+|+|.| ||++|.|.-..-...|.|+.
T Consensus 471 ~IIDDdi~E~DE~--F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~---- 543 (928)
T TIGR00845 471 GIIDDDIFEEDEH--FYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI---- 543 (928)
T ss_pred EEccCCCCCCCce--EEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC----
Confidence 11 233333 444444322110 00001122455666677 67899887666677889986
Q ss_pred ceEEEEEEEecCCCCCCCceEEEEEec
Q psy13120 308 SFRIMTVQAHDADSLPFNGLVRYFLKD 334 (384)
Q Consensus 308 g~~i~~v~a~D~D~~~~~~~i~y~l~~ 334 (384)
|....+|.-+-.-.+ .-.+.|...+
T Consensus 544 G~vtvtV~RtsGa~G--~VtV~Y~T~d 568 (928)
T TIGR00845 544 GIMEVKVLRTSGARG--TVIVPYRTVE 568 (928)
T ss_pred CEEEEEEEEcCCCCe--eEEEEEEeec
Confidence 444444433321111 2346677665
No 29
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=88.96 E-value=4.9 Score=27.06 Aligned_cols=60 Identities=22% Similarity=0.293 Sum_probs=35.5
Q ss_pred CCceEEEEEEcCCCCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCCCCCceeEEEEEEe
Q psy13120 108 QNSYVEYSIINSEATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL 171 (384)
Q Consensus 108 ~~~~~~y~l~~~~~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V 171 (384)
.+-...|.|.+.+..+..+...+-.+... .-..+.|+|.|+|.|..+.......++.|.|
T Consensus 6 ~~~~Y~Y~l~g~d~~W~~~~~~~~~~~~~----~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 6 ENIRYRYRLEGFDDEWITLGSYSNSISYT----NLPPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TTEEEEEEEETTESSEEEESSTS-EEEEE----S--SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CceEEEEEEECCCCeEEECCCCcEEEEEE----eCCCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 34457788887766666665432244433 2246899999999997664444446666665
No 30
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=84.55 E-value=98 Score=37.02 Aligned_cols=31 Identities=10% Similarity=0.247 Sum_probs=22.3
Q ss_pred CccCCCeEEEEEEEEeCCCCCCceeEEEEEEecc
Q psy13120 140 DREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLD 173 (384)
Q Consensus 140 d~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~d 173 (384)
.|.....|++++.|++..+. ...++.|.|.+
T Consensus 1563 TY~spGtYtVtLTvtN~~Gs---~~~T~~i~V~~ 1593 (2740)
T TIGR00864 1563 TFRSVGTFNIIVTAENDVGA---AQASIFLFVLQ 1593 (2740)
T ss_pred eecCCceEEEEEEEecCCCc---cceeEEEEEee
Confidence 36778999999999998652 34555666654
No 31
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=84.31 E-value=19 Score=28.62 Aligned_cols=65 Identities=26% Similarity=0.490 Sum_probs=44.4
Q ss_pred cEEE--eCCCcE--EEECccCCcCC---CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccCceeEeeCC
Q psy13120 17 IFTI--NPSNGT--LSNLKHLDRET---KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPEN 89 (384)
Q Consensus 17 ~F~i--d~~tG~--i~~~~~lD~E~---~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~ 89 (384)
.|+| + ..|. +.+..+||+.. .-...|.|.|+|..|.. ....+.|+|.| | .|........+|.|+
T Consensus 57 Vftvtl~-~~GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~-----s~~~l~VtI~D--D-~P~~~~~~~~~V~E~ 127 (137)
T TIGR03660 57 VFTLTLN-ADGSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDT-----SSITLPVTIVD--D-VPTITDVDALTVDED 127 (137)
T ss_pred EEEEEEC-CCccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCc-----cccEEEEEEEC--C-CCeeccccceEEecc
Confidence 4555 3 2564 44568898854 33578889999988752 34578888877 6 688766656788886
Q ss_pred C
Q psy13120 90 S 90 (384)
Q Consensus 90 ~ 90 (384)
.
T Consensus 128 ~ 128 (137)
T TIGR03660 128 D 128 (137)
T ss_pred c
Confidence 4
No 32
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=83.80 E-value=1.1e+02 Score=36.78 Aligned_cols=204 Identities=14% Similarity=0.145 Sum_probs=94.8
Q ss_pred cCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccC-c---eeEeeC----CCCCCcEEEEEEEEeCCC
Q psy13120 35 RETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-N---ETTIPE----NSPLNTVVAALKAIDRDE 106 (384)
Q Consensus 35 ~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~---~~~v~E----~~~~gt~v~~~~a~D~D~ 106 (384)
|.....|++++.|++..| +...++.|.|.+....- .+... . ...+.| ..+.+ ....+.+.-.|
T Consensus 1564 Y~spGtYtVtLTvtN~~G------s~~~T~~i~V~~~I~Gl-qI~~~~~~gg~~~~~~~~~~~~~t~-~~v~l~~~~~~- 1634 (2740)
T TIGR00864 1564 FRSVGTFNIIVTAENDVG------AAQASIFLFVLQEIEGL-QILGETAEGGGGGVQELDGCYFETN-HTVQFHAGFKD- 1634 (2740)
T ss_pred ecCCceEEEEEEEecCCC------ccceeEEEEEeecccce-EEEeecccccCCcccccccceeccC-CEEEeeeccCC-
Confidence 455678999999999887 44566677776532210 00000 0 000000 01111 12345554333
Q ss_pred CCCceEEEEEEc--CC--CCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCCcee
Q psy13120 107 GQNSYVEYSIIN--SE--ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPIFD 182 (384)
Q Consensus 107 ~~~~~~~y~l~~--~~--~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P~f~ 182 (384)
|.+ ++|+..- .. ++... ..+|.-. .+-+.+...|.+++.|++.-+ ...+.-++.|.+- .+.+
T Consensus 1635 GTn--vsysW~~~~~n~~dg~~~--~~tGk~~---~vt~~~pG~Y~VtL~aSN~vG---~~t~s~~i~v~ep---i~~L- 1700 (2740)
T TIGR00864 1635 GTN--LSFSWNAILDNEPDGPAF--AGSGKGA---KLNPLEAGPCDIFLQAANLLG---QATADCTIDFLEP---AGNL- 1700 (2740)
T ss_pred CCe--eEEEEEEecCCCCCCccc--cccCcee---EEecCCCceEEEEEEEeeccc---ceeeEEEEEEEEE---cCce-
Confidence 344 4554422 11 11111 1234321 234567799999999999743 2233333444321 1111
Q ss_pred ccceEEEEecCCC-CCcEEEEEEEEeCCCCCCceEEEEEEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEEC
Q psy13120 183 SKSYSASVPENIS-VGATVLQVSATDLDDGINGRVRYTIISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDC 261 (384)
Q Consensus 183 ~~~~~~~v~E~~~-~g~~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~ 261 (384)
.+.+..+.. +|..| .+.+.-.. +..++|++.-++...+..=++ +. ...|.....|.++|+|.+.
T Consensus 1701 ----~L~aS~~~aavn~~V-~fs~~lt~---GS~vty~W~~gdG~s~~~s~p-----~~--tHtf~~~G~y~VtVta~N~ 1765 (2740)
T TIGR00864 1701 ----MLQASDNPAAVNALI-NLSAELAE---GSGLQYRWFLEEGDDLETSEP-----FM--SHSFPSAGLHLVTMKAFNE 1765 (2740)
T ss_pred ----EEeccCCcccCCCeE-EEEEEecC---CCCcEEEEEECCCCCcccCCC-----ce--EEEccCCceEEEEEEEEcc
Confidence 112222211 23333 34442222 344666655343322110000 01 1236678899999999998
Q ss_pred CCCCcceeeeeEEEEEEEEeec
Q psy13120 262 GGDTEERIRFDTATVSILVSDI 283 (384)
Q Consensus 262 ~~~~~~~~~~~~~~v~i~V~dv 283 (384)
.+.. ..++.|.|.+.
T Consensus 1766 vss~-------~~s~~V~Vqep 1780 (2740)
T TIGR00864 1766 LGSA-------NASEEVDVQEP 1780 (2740)
T ss_pred cccc-------ceeEEEEEEec
Confidence 7643 45677777544
No 33
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=82.07 E-value=7.5 Score=30.91 Aligned_cols=47 Identities=30% Similarity=0.487 Sum_probs=33.1
Q ss_pred cCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCCCCcccCCCeeEEEcCCCCC
Q psy13120 249 KAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDNYPVFLHSPYLALVNENTLP 305 (384)
Q Consensus 249 ~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~ 305 (384)
.-.+.|.|.|+|..+.. ....+.|+|.| | .|+..... ...|.|+..+
T Consensus 84 ~l~l~~~v~a~D~DGD~------s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L~ 130 (137)
T TIGR03660 84 ELTLNFPIIATDFDGDT------SSITLPVTIVD--D-VPTITDVD-ALTVDEDDLP 130 (137)
T ss_pred eEEEeeeEEEEeCCCCc------cccEEEEEEEC--C-CCeecccc-ceEEeccccc
Confidence 34678889999988764 34588888877 5 68766443 4688886654
No 34
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=79.35 E-value=16 Score=24.42 Aligned_cols=59 Identities=25% Similarity=0.413 Sum_probs=34.0
Q ss_pred CceEEEEEEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEE
Q psy13120 213 NGRVRYTIISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILV 280 (384)
Q Consensus 213 ~~~i~y~i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V 280 (384)
+-...|.|. |-...+..+...+-.+.... | ..+.|+|.|.|.|..+... ....++.|.|
T Consensus 7 ~~~Y~Y~l~-g~d~~W~~~~~~~~~~~~~~-L---~~G~Y~l~V~a~~~~~~~~----~~~~~l~i~I 65 (66)
T PF07495_consen 7 NIRYRYRLE-GFDDEWITLGSYSNSISYTN-L---PPGKYTLEVRAKDNNGKWS----SDEKSLTITI 65 (66)
T ss_dssp TEEEEEEEE-TTESSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-----SS-EEEEEEE
T ss_pred ceEEEEEEE-CCCCeEEECCCCcEEEEEEe-C---CCEEEEEEEEEECCCCCcC----cccEEEEEEE
Confidence 445677776 54455666544322444332 2 4689999999999877653 2336676666
No 35
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=75.96 E-value=65 Score=29.62 Aligned_cols=109 Identities=23% Similarity=0.296 Sum_probs=52.8
Q ss_pred CCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccC---ce-eEeeCCCCCCcEEEEEEEEeCCCCCCceE
Q psy13120 37 TKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA---NE-TTIPENSPLNTVVAALKAIDRDEGQNSYV 112 (384)
Q Consensus 37 ~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~---~~-~~v~E~~~~gt~v~~~~a~D~D~~~~~~~ 112 (384)
....|.+.+.++|..|.. +.....|.|- -.+|..... .. ....+.. .........+++.+. +..+
T Consensus 10 ~d~~~~v~vt~tD~aGN~-----~~~t~~~~vD---t~~P~v~i~~~~~~~~~~~~~~-~~~~t~s~tvs~~~~--g~~v 78 (317)
T PF13753_consen 10 ADGTYTVSVTVTDAAGNT-----STATQSITVD---TTAPTVTITSIADDDIINGDEA-TNTVTFSGTVSGAEP--GSTV 78 (317)
T ss_pred CCCcEEEEEEEEeCCCCe-----eeeeEEEEEe---cCCCceeeecccCCCcccccee-eeeeEEEEEecCCCC--CCEE
Confidence 356799999999999853 3344444432 236644322 11 1111111 122233444444443 3345
Q ss_pred EEEEEcCCCCCeeeeCcceEEEEceeC-CccCCCeEEEEEE-EEeCCC
Q psy13120 113 EYSIINSEATPFSLGPVDGLLRVAQKL-DREIKSNYTLEIK-AKDRGE 158 (384)
Q Consensus 113 ~y~l~~~~~~~F~i~~~~G~i~~~~~l-d~e~~~~~~l~V~-a~d~~~ 158 (384)
.+.+ .+..+.+..+ ..|...+.-.. +.-....|.+.+. ++|..+
T Consensus 79 ~v~~-~g~~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aG 124 (317)
T PF13753_consen 79 TVTI-NGTTGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAG 124 (317)
T ss_pred EEEE-CCEEEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCC
Confidence 5555 2222233333 34543332211 1123458899999 999755
No 36
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=73.87 E-value=14 Score=23.91 Aligned_cols=18 Identities=33% Similarity=0.693 Sum_probs=15.7
Q ss_pred ccCeEEEEEEEEECCCCC
Q psy13120 248 RKAQYSLTIQAEDCGGDT 265 (384)
Q Consensus 248 ~~~~~~l~V~a~D~~~~~ 265 (384)
....|.++|+|+|..+..
T Consensus 22 ~dG~y~itv~a~D~AGN~ 39 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGNT 39 (54)
T ss_pred CCccEEEEEEEEeCCCCC
Confidence 468899999999988876
No 37
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=69.96 E-value=11 Score=36.51 Aligned_cols=33 Identities=27% Similarity=0.519 Sum_probs=0.0
Q ss_pred cccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCCCCc
Q psy13120 247 ERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDNYPV 289 (384)
Q Consensus 247 e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~~P~ 289 (384)
+....|.|++.|+|.++.. ..+.+.+.+ | .+|.
T Consensus 270 ~~g~~Y~~~l~v~~~~~~~------~~a~~~~~~---n-~pP~ 302 (440)
T PF02010_consen 270 EPGSTYTFRLTVTDSSGSS------GSASISFTV---N-APPS 302 (440)
T ss_dssp -------------------------------------------
T ss_pred ccccccccccccccccccc------ccccccccc---c-cccc
Confidence 5567899999999987653 145555555 3 3664
No 38
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=69.46 E-value=16 Score=23.67 Aligned_cols=29 Identities=28% Similarity=0.286 Sum_probs=20.1
Q ss_pred CcEEEECccCCcCCCcEEEEEEEEEeCCCCC
Q psy13120 24 NGTLSNLKHLDRETKSSYNLVVMATDMAKPP 54 (384)
Q Consensus 24 tG~i~~~~~lD~E~~~~~~l~v~a~d~~~~~ 54 (384)
.|.+++.-+.. ....|.|+++|+|..|..
T Consensus 11 ~G~Ws~t~~~~--~dG~y~itv~a~D~AGN~ 39 (54)
T PF13754_consen 11 DGNWSFTVPAL--ADGTYTITVTATDAAGNT 39 (54)
T ss_pred CCcEEEeCCCC--CCccEEEEEEEEeCCCCC
Confidence 34444443333 577899999999999853
No 39
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=65.83 E-value=14 Score=27.73 Aligned_cols=62 Identities=18% Similarity=0.316 Sum_probs=35.7
Q ss_pred cEEEEEEeCCCCCcEEE--e---CCCcEEEE---CccCCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEe
Q psy13120 4 RLEYSIVSGDDEEIFTI--N---PSNGTLSN---LKHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRD 70 (384)
Q Consensus 4 ~i~Y~i~~~~~~~~F~i--d---~~tG~i~~---~~~lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~d 70 (384)
.|.|.+.+|+......| | -.+|.+.- +..+++.....|.++|+++|..|= +.+..+.|.|.|
T Consensus 37 ~V~wnvWsG~~Gd~a~vl~dg~~V~~G~~~~~~~~at~~v~kgG~y~m~V~lCn~dGC-----S~S~~~~I~VAD 106 (106)
T cd02848 37 SVKWNAWSGDPGDTYKVLLDGKEVWSGALTGSSGTATFKVGKGGRYQMQVALCNGDGC-----STSAAKEIVVAD 106 (106)
T ss_pred EEEEeeecCCCCcEEEEEECCeEEEcccCCCCccEEEEEeCCCCeEEEEEEEECCCCc-----cCcCCEEEEecC
Confidence 35666777766543333 1 02333322 124556677889999999999872 334455555543
No 40
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=61.74 E-value=37 Score=22.50 Aligned_cols=30 Identities=40% Similarity=0.492 Sum_probs=20.5
Q ss_pred CCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEee
Q psy13120 37 TKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDV 71 (384)
Q Consensus 37 ~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dv 71 (384)
....|+|.++++|..|. .......+.+.|.
T Consensus 21 ~dg~yt~~v~a~D~AGN-----~~~~~~~~~i~d~ 50 (60)
T PF12245_consen 21 ADGEYTLTVTATDKAGN-----TSSSTTQIVIVDN 50 (60)
T ss_pred CCccEEEEEEEEECCCC-----EEEeeeEEEEEcC
Confidence 46689999999999985 3444444444443
No 41
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=58.86 E-value=22 Score=26.70 Aligned_cols=33 Identities=18% Similarity=0.377 Sum_probs=23.6
Q ss_pred CCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEee
Q psy13120 244 LNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSD 282 (384)
Q Consensus 244 lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~d 282 (384)
+++.....|.++|.+.|..+.. .++.+.|.|-|
T Consensus 74 ~~v~kgG~y~m~V~lCn~dGCS------~S~~~~I~VAD 106 (106)
T cd02848 74 FKVGKGGRYQMQVALCNGDGCS------TSAAKEIVVAD 106 (106)
T ss_pred EEeCCCCeEEEEEEEECCCCcc------CcCCEEEEecC
Confidence 4566778999999999988754 44555565543
No 42
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=56.97 E-value=1.6e+02 Score=27.02 Aligned_cols=109 Identities=17% Similarity=0.272 Sum_probs=53.1
Q ss_pred CCCeEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCCceeccceEEEEecCC------CCCcEEEEEEEEeCCCCCCceE
Q psy13120 143 IKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENI------SVGATVLQVSATDLDDGINGRV 216 (384)
Q Consensus 143 ~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~------~~g~~v~~v~a~D~D~~~~~~i 216 (384)
....|.+.+.++|..+-.. .....+.|-- .+|...... +.++. ..........+.+.+ .+..+
T Consensus 10 ~d~~~~v~vt~tD~aGN~~--~~t~~~~vDt---~~P~v~i~~----~~~~~~~~~~~~~~~~t~s~tvs~~~--~g~~v 78 (317)
T PF13753_consen 10 ADGTYTVSVTVTDAAGNTS--TATQSITVDT---TAPTVTITS----IADDDIINGDEATNTVTFSGTVSGAE--PGSTV 78 (317)
T ss_pred CCCcEEEEEEEEeCCCCee--eeeEEEEEec---CCCceeeec----ccCCCccccceeeeeeEEEEEecCCC--CCCEE
Confidence 3467999999999865332 3444444322 266443221 11111 112223344444433 34445
Q ss_pred EEEEEeCCCCCCeEEECCCcEEEEcccC-CccccCeEEEEEE-EEECCCCC
Q psy13120 217 RYTIISGDDNHDLTISEDNGLLRIAKNL-NYERKAQYSLTIQ-AEDCGGDT 265 (384)
Q Consensus 217 ~y~i~~~~~~~~f~i~~~~G~l~~~~~l-D~e~~~~~~l~V~-a~D~~~~~ 265 (384)
.+.+. +. ...+..+ ..|.+...-.. +.-....|.+.+. ++|..+..
T Consensus 79 ~v~~~-g~-~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~ 126 (317)
T PF13753_consen 79 TVTIN-GT-TGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNT 126 (317)
T ss_pred EEEEC-CE-EEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccCCcc
Confidence 55551 21 1223333 34543332211 1223458999999 99988875
No 43
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=52.68 E-value=2.8e+02 Score=28.53 Aligned_cols=223 Identities=15% Similarity=0.208 Sum_probs=109.3
Q ss_pred CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccCceeEeeCCCCCCcEEE-EEEEEeCCCCCCceEEEEE
Q psy13120 38 KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENSPLNTVVA-ALKAIDRDEGQNSYVEYSI 116 (384)
Q Consensus 38 ~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~~~gt~v~-~~~a~D~D~~~~~~~~y~l 116 (384)
.+.|.+.|.+.|..... ...+...+...|.+... .|.......+...+.-..|..+. .+.|.+. .+-...|.+
T Consensus 349 ~G~Y~I~v~VKDk~S~~--eyD~~~~v~f~i~~~~~-~p~~I~~vl~d~~~~~lvG~~i~i~v~a~gg---~~~lY~f~i 422 (667)
T PRK14081 349 AGKYSITLWVKDISSKG--EYEDKSSIDYTIEEKSK-EPIKIEDVILDKGKHILKGEEIKIRVIAEGG---TNLRYSFII 422 (667)
T ss_pred CceEEEEEEEEcCcCcc--cccceEEEEEEEcccCC-CCeEEEEEEECCCCceEeCCeEEEEEEecCC---CeEEEEEEE
Confidence 56799999999987532 24566777777776554 56433322222224444554443 3444332 232334444
Q ss_pred EcCCCCCeeeeC-cceEEEEceeCCccCCCeEEEEEEEEeCCCC-CCceeEEEEEEeccCCCCCCceeccceEEEEecCC
Q psy13120 117 INSEATPFSLGP-VDGLLRVAQKLDREIKSNYTLEIKAKDRGEP-SKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENI 194 (384)
Q Consensus 117 ~~~~~~~F~i~~-~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~-~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~ 194 (384)
.-.+...-.++- .+..+. +-.+..+.|++.|.|.|.-.. ...+...+.+.| .+..|.-.. .......|..
T Consensus 423 k~ng~~ve~~~Y~~~~~~~----f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V---~e~~P~~i~-~il~~~~~~~ 494 (667)
T PRK14081 423 KKDGKEEEKIDYGKNNWVN----FIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKV---HEYIPAEID-YILLPVKEYY 494 (667)
T ss_pred EECCEEEEEeecccccEEE----EEECCCeeEEEEEEEecccCchhcccceEEEEEE---eccCceeee-eEEecccccE
Confidence 332210000000 011111 123567899999999997432 345556666665 444675211 1123334555
Q ss_pred CCCcEE-EEEEEEeCCCCCCceEEEEEEeCCCCCCeEEE--CCCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeee
Q psy13120 195 SVGATV-LQVSATDLDDGINGRVRYTIISGDDNHDLTIS--EDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRF 271 (384)
Q Consensus 195 ~~g~~v-~~v~a~D~D~~~~~~i~y~i~~~~~~~~f~i~--~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~ 271 (384)
..|..+ ..+.+.+. .+-.+.|.+.-. +++.... ......+.. | -..+.|++-|.|.|...... ..
T Consensus 495 ~vg~~i~~~~~~~~~---k~v~y~y~~~~N--G~~v~~t~Ys~~~~ysf~-P---~~~GkY~I~V~aKn~~s~~~---~D 562 (667)
T PRK14081 495 LVGDDIEIEVIIQNT---KDVLIKYILKIN--GHKVEETDYIKNKKYKFI-P---KCSGKYTIEVLAKNIKSTEE---YD 562 (667)
T ss_pred EeCCEEEEEEEEeCC---CeEEEEEEEEEC--CEEEEEeeccccceEEEe-e---cCCceEEEEEEEcccccccc---cc
Confidence 555544 33333332 244455655311 1111110 001111111 1 13578999999999775442 34
Q ss_pred eEEEEEEEEeecCCCCCc
Q psy13120 272 DTATVSILVSDINDNYPV 289 (384)
Q Consensus 272 ~~~~v~i~V~dvNd~~P~ 289 (384)
+...+.+.|. +..|.
T Consensus 563 ~~k~v~~~V~---e~~pi 577 (667)
T PRK14081 563 SKKEVKFYVR---EALPI 577 (667)
T ss_pred cceEEEEEEc---CCCCc
Confidence 5567777765 43454
No 44
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=52.18 E-value=59 Score=21.51 Aligned_cols=29 Identities=28% Similarity=0.485 Sum_probs=20.1
Q ss_pred CCeEEEEEEEEeCCCCCCceeEEEEEEeccC
Q psy13120 144 KSNYTLEIKAKDRGEPSKSSRTKFFIRLLDE 174 (384)
Q Consensus 144 ~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dv 174 (384)
...|.|.+.|+|..| ........+.+.|.
T Consensus 22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~ 50 (60)
T PF12245_consen 22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDN 50 (60)
T ss_pred CccEEEEEEEEECCC--CEEEeeeEEEEEcC
Confidence 578999999999865 44445555555544
No 45
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=51.73 E-value=92 Score=22.74 Aligned_cols=53 Identities=23% Similarity=0.287 Sum_probs=29.3
Q ss_pred EEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEEEEeCC
Q psy13120 168 FIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYTIISGD 224 (384)
Q Consensus 168 ~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~i~~~~ 224 (384)
+|.|.| || .|.+........+.|+.. .....|.-..++....-.+.|....|.
T Consensus 2 tvtI~d-~d-~~~v~f~~~~~~v~E~~~--~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DD-DPTVSFSSPSYTVSEGDG--TVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TT-SEEEEESSSEEEEETTSS--EEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CC-CCEEEEeCCEEEEEeCCC--EEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 466777 56 447766666778888863 344444444343223445666665543
No 46
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=50.95 E-value=49 Score=26.11 Aligned_cols=63 Identities=16% Similarity=0.288 Sum_probs=32.9
Q ss_pred cEEEEEEeCCCCCcEEEeCCCcEEEEC--------ccCCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeec
Q psy13120 4 RLEYSIVSGDDEEIFTINPSNGTLSNL--------KHLDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVN 72 (384)
Q Consensus 4 ~i~Y~i~~~~~~~~F~id~~tG~i~~~--------~~lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvN 72 (384)
.|++.+..|+....+.+= ..|.+.-. ..+.......|+++|+++|..| -+.+..+.|.|.|-.
T Consensus 41 ~VtwN~WsG~~Gd~~kly-~dG~~V~tG~~~~~~~a~~~~~~gG~y~~~VeLCN~~G-----CS~S~~~~V~VaDTD 111 (133)
T PF08329_consen 41 SVTWNVWSGTNGDTAKLY-FDGVLVWTGPSPQQKSATFTVTKGGRYQMQVELCNADG-----CSTSAPVEVVVADTD 111 (133)
T ss_dssp EEEEE-SSS---SEEEEE-ETTEEEEEEE--SEEEEEEEE-S-EEEEEEEEEEETTE-----EEE---EEEEEE-TT
T ss_pred EEEEEEecCCCCCEEEEE-ECCEEEEeCCCccCceEEEEecCCCEEEEEEEEECCCC-----cccCCCEEEEEeCCC
Confidence 355666667665555543 23433211 2344556678999999999887 255667788888754
No 47
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=48.79 E-value=49 Score=22.90 Aligned_cols=31 Identities=10% Similarity=0.128 Sum_probs=22.9
Q ss_pred eCCccCCCeEEEEEEEEeCCCCCCceeEEEEEEe
Q psy13120 138 KLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRL 171 (384)
Q Consensus 138 ~ld~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V 171 (384)
..-|.....|.+++.++|..+ +...++.|.|
T Consensus 48 ~~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 48 THTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 344677899999999999765 5566666655
No 48
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=47.63 E-value=3.3e+02 Score=27.98 Aligned_cols=261 Identities=20% Similarity=0.243 Sum_probs=120.9
Q ss_pred CcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccCceeEeeCCCCCCcEEEEEEEEeCCCCCCceEEEEEE
Q psy13120 38 KSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITANETTIPENSPLNTVVAALKAIDRDEGQNSYVEYSII 117 (384)
Q Consensus 38 ~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~v~E~~~~gt~v~~~~a~D~D~~~~~~~~y~l~ 117 (384)
.+.|.|.+.+.|..... .....+.+.-+|. -.+|+-..............|+.+ .+.|. ++.|.+-..+|.|.
T Consensus 255 ~G~Y~i~~~VKD~~S~~--eyDD~~~l~y~Vk---~~~~vkI~~~~~d~~s~ql~g~~I-~ika~-a~GG~~llYrf~I~ 327 (667)
T PRK14081 255 SGDYKLLCLVKDMYSNN--EFDDRAVLVYTVK---PYKDIKIRNFTTDLSSPQLTDTDI-ELKAV-AEGGKELLYRFIIK 327 (667)
T ss_pred CccEEEEEEEeccCccc--ccccceEEEEEEe---cCCCcEEEEEEEcCCCCeecCceE-EEEEE-ecCCCceEEEEEEC
Confidence 45799999999987632 2445555555552 123322222222222333355544 44443 22333322344442
Q ss_pred cCCCCCeeeeCcceEEEEce-eCCccCCCeEEEEEEEEeCCCC-CCceeEEEEEEeccCCCCCCceeccceEEEEecCCC
Q psy13120 118 NSEATPFSLGPVDGLLRVAQ-KLDREIKSNYTLEIKAKDRGEP-SKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENIS 195 (384)
Q Consensus 118 ~~~~~~F~i~~~~G~i~~~~-~ld~e~~~~~~l~V~a~d~~~~-~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~ 195 (384)
+.. .. .+|...... ..-.+..+.|.+.|.|.|.... ...+...+.+.|.+... .|..... ......+...
T Consensus 328 G~~--~e----~~~Y~~~n~~~w~P~~~G~Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~-~p~~I~~-vl~d~~~~~l 399 (667)
T PRK14081 328 GKE--SE----DSGYIRNNIYTWKPKIAGKYSITLWVKDISSKGEYEDKSSIDYTIEEKSK-EPIKIED-VILDKGKHIL 399 (667)
T ss_pred CcE--Ee----eccccccceEEEeeCCCceEEEEEEEEcCcCcccccceEEEEEEEcccCC-CCeEEEE-EEECCCCceE
Confidence 111 00 112211111 1122356899999999997432 35566777777776544 4542111 1122234444
Q ss_pred CCcEE-EEEEEEeCCCCCCceEEEEEEeCCCCCCe-EEEC-CCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeeee
Q psy13120 196 VGATV-LQVSATDLDDGINGRVRYTIISGDDNHDL-TISE-DNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFD 272 (384)
Q Consensus 196 ~g~~v-~~v~a~D~D~~~~~~i~y~i~~~~~~~~f-~i~~-~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~ 272 (384)
.|..+ ..+.|.+.. +-...|.+.-. +... .++- .+-.+... .+..+.|++.|.|.|...... ..+
T Consensus 400 vG~~i~i~v~a~gg~---~~lY~f~ik~n--g~~ve~~~Y~~~~~~~f~----P~~~G~Y~IeV~vKdk~S~~~---yD~ 467 (667)
T PRK14081 400 KGEEIKIRVIAEGGT---NLRYSFIIKKD--GKEEEKIDYGKNNWVNFI----PEEKGNYELEVRVKDKYSDKE---YDA 467 (667)
T ss_pred eCCeEEEEEEecCCC---eEEEEEEEEEC--CEEEEEeecccccEEEEE----ECCCeeEEEEEEEecccCchh---ccc
Confidence 55544 334444332 33344444311 0000 0000 01111111 145789999999999776543 244
Q ss_pred EEEEEEEEeecCCCCCcccCCCeeEEEcCCCCCCCceEEEEEEEecCCCCCCCceEEEEEec
Q psy13120 273 TATVSILVSDINDNYPVFLHSPYLALVNENTLPGPSFRIMTVQAHDADSLPFNGLVRYFLKD 334 (384)
Q Consensus 273 ~~~v~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~D~D~~~~~~~i~y~l~~ 334 (384)
...+.+.|. +..|.-. ........+....+ -.....+.+++.. +-.+.|.+.-
T Consensus 468 ~k~v~l~V~---e~~P~~i-~~il~~~~~~~~vg-~~i~~~~~~~~~k----~v~y~y~~~~ 520 (667)
T PRK14081 468 HTIVYIKVH---EYIPAEI-DYILLPVKEYYLVG-DDIEIEVIIQNTK----DVLIKYILKI 520 (667)
T ss_pred ceEEEEEEe---ccCceee-eeEEecccccEEeC-CEEEEEEEEeCCC----eEEEEEEEEE
Confidence 566676664 4477522 11111222332211 1223445555532 5567787765
No 49
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=47.31 E-value=47 Score=23.10 Aligned_cols=21 Identities=19% Similarity=0.506 Sum_probs=17.2
Q ss_pred CCccccCeEEEEEEEEECCCC
Q psy13120 244 LNYERKAQYSLTIQAEDCGGD 264 (384)
Q Consensus 244 lD~e~~~~~~l~V~a~D~~~~ 264 (384)
..|.....|.++++|+|..+.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g~ 71 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVGS 71 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCCC
Confidence 346788999999999998654
No 50
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=45.59 E-value=53 Score=22.82 Aligned_cols=29 Identities=21% Similarity=0.338 Sum_probs=20.6
Q ss_pred CcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEE
Q psy13120 34 DRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIV 67 (384)
Q Consensus 34 D~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~ 67 (384)
-|.....|.++++++|..+. +....+.|.
T Consensus 52 ~y~~~G~y~v~l~v~d~~g~-----~~~~~~~V~ 80 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAVGS-----SSTKTTTVV 80 (81)
T ss_pred EcCCCcEEEEEEEEEeCCCC-----EEEEEEEEE
Confidence 46677899999999998652 444455554
No 51
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=44.59 E-value=62 Score=22.33 Aligned_cols=30 Identities=17% Similarity=0.356 Sum_probs=21.8
Q ss_pred CCcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEE
Q psy13120 33 LDRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVI 68 (384)
Q Consensus 33 lD~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V 68 (384)
.-|+....|.+++.+.|..+ +.+..++|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g------~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG------SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC------cEEEEEEEEE
Confidence 34667788999999999876 4555666654
No 52
>KOG4221|consensus
Probab=44.40 E-value=4.8e+02 Score=28.91 Aligned_cols=57 Identities=14% Similarity=0.111 Sum_probs=41.9
Q ss_pred CCCeeeeCcceEEEEceeCCccCCCeEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCCc
Q psy13120 121 ATPFSLGPVDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPI 180 (384)
Q Consensus 121 ~~~F~i~~~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P~ 180 (384)
...+.++.++-.+++. +.|....|.+.|.|....++..++...-.++..|+-+.||.
T Consensus 564 ~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 564 GKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred CceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 4568887766666665 45677899999999998877665555455667788887775
No 53
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=44.14 E-value=1.2e+02 Score=22.03 Aligned_cols=51 Identities=22% Similarity=0.142 Sum_probs=30.5
Q ss_pred EEEEeecCCCCCcccCCCeeEEEcCCCCCCCceEEEEEEEecCCCCCCCceEEEEEec
Q psy13120 277 SILVSDINDNYPVFLHSPYLALVNENTLPGPSFRIMTVQAHDADSLPFNGLVRYFLKD 334 (384)
Q Consensus 277 ~i~V~dvNd~~P~f~~~~~~~~v~e~~~~~~g~~i~~v~a~D~D~~~~~~~i~y~l~~ 334 (384)
+|.|.| || .|.+.-......+.|+. |.....+.-..++... .-.+.|...+
T Consensus 2 tvtI~d-~d-~~~v~f~~~~~~v~E~~----~~~~v~V~~~~~~~~~-~v~v~~~~~~ 52 (100)
T PF03160_consen 2 TVTILD-DD-DPTVSFSSPSYTVSEGD----GTVTVTVTRSGGSLDG-PVTVNYSTVD 52 (100)
T ss_dssp EEEEE--TT-SEEEEESSSEEEEETTS----SEEEEEEEEESS-TSS-EEEEEEEEEE
T ss_pred EEEEEC-CC-CCEEEEeCCEEEEEeCC----CEEEEEEEEcccCCCc-ceEEEEEEeC
Confidence 466777 56 44766666677888986 4455666666554322 3456777665
No 54
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=42.64 E-value=55 Score=23.01 Aligned_cols=25 Identities=20% Similarity=0.294 Sum_probs=20.7
Q ss_pred CeEEEEEEEEeCCCCCCceeEEEEEEe
Q psy13120 145 SNYTLEIKAKDRGEPSKSSRTKFFIRL 171 (384)
Q Consensus 145 ~~~~l~V~a~d~~~~~~~~~~~v~I~V 171 (384)
+.|.++..|+|..+ ..+.|.+.|+|
T Consensus 57 G~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 57 GTTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred ceEEEEEEEEECCC--CEEEEEEEEEC
Confidence 46899999999865 77899998876
No 55
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=42.04 E-value=58 Score=22.86 Aligned_cols=26 Identities=23% Similarity=0.277 Sum_probs=21.6
Q ss_pred CcEEEEEEEEEeCCCCCCCCceeEEEEEEEE
Q psy13120 38 KSSYNLVVMATDMAKPPQKRLSSTTQVSIVI 68 (384)
Q Consensus 38 ~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V 68 (384)
...|.++.+|+|..|. .+++.+.|+|
T Consensus 56 ~G~t~V~ytA~D~~GN-----~a~C~f~V~V 81 (81)
T PF02494_consen 56 VGTTTVTYTATDAAGN-----SATCSFTVTV 81 (81)
T ss_pred eceEEEEEEEEECCCC-----EEEEEEEEEC
Confidence 4579999999999873 7889998875
No 56
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=33.47 E-value=1.2e+02 Score=23.92 Aligned_cols=78 Identities=18% Similarity=0.337 Sum_probs=36.2
Q ss_pred eEEEEEEeCCCCCCeEEECCCcEEEE--------cccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCC
Q psy13120 215 RVRYTIISGDDNHDLTISEDNGLLRI--------AKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDN 286 (384)
Q Consensus 215 ~i~y~i~~~~~~~~f~i~~~~G~l~~--------~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~ 286 (384)
.|++.+..|+.+..+.+=- .|.+.. ...+.+...+.|+++|++.|..+. +.+..+.|.|.|..-
T Consensus 41 ~VtwN~WsG~~Gd~~kly~-dG~~V~tG~~~~~~~a~~~~~~gG~y~~~VeLCN~~GC------S~S~~~~V~VaDTDG- 112 (133)
T PF08329_consen 41 SVTWNVWSGTNGDTAKLYF-DGVLVWTGPSPQQKSATFTVTKGGRYQMQVELCNADGC------STSAPVEVVVADTDG- 112 (133)
T ss_dssp EEEEE-SSS---SEEEEEE-TTEEEEEEE--SEEEEEEEE-S-EEEEEEEEEEETTEE------EE---EEEEEE-TTS-
T ss_pred EEEEEEecCCCCCEEEEEE-CCEEEEeCCCccCceEEEEecCCCEEEEEEEEECCCCc------ccCCCEEEEEeCCCc-
Confidence 3555555665555554422 233322 223445567899999999998874 567789999998842
Q ss_pred CCcccCCCeeEEEcCCC
Q psy13120 287 YPVFLHSPYLALVNENT 303 (384)
Q Consensus 287 ~P~f~~~~~~~~v~e~~ 303 (384)
-.+. +....+.||-
T Consensus 113 -sHl~--pL~~~~~~nN 126 (133)
T PF08329_consen 113 -SHLA--PLPYNWDENN 126 (133)
T ss_dssp -TTS----------TTS
T ss_pred -cccc--cccCcccccC
Confidence 2222 2334556654
No 57
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=32.25 E-value=5.7e+02 Score=26.19 Aligned_cols=191 Identities=17% Similarity=0.153 Sum_probs=0.0
Q ss_pred eEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCCc------eeccceEEEEecCCCCCcEEEEEEEEeCCCCCCceEEEE
Q psy13120 146 NYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPI------FDSKSYSASVPENISVGATVLQVSATDLDDGINGRVRYT 219 (384)
Q Consensus 146 ~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P~------f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~~~~~~i~y~ 219 (384)
...|++.+.+.-.|..++ |.+.|.|-..+. |.+..-.+.|.=+.+.|..=.++.. ..-+
T Consensus 212 s~~~t~~VP~s~kPtiss-----itlsd~n~~~~~ii~~~~fVq~~S~ikv~~n~A~g~ygSTI~~----------y~ae 276 (624)
T PF05895_consen 212 SKTFTLTVPNSVKPTISS-----ITLSDTNTKVSNIIGNNNFVQIKSNIKVTFNGASGSYGSTIKS----------YNAE 276 (624)
T ss_pred cEEEEEEECCCCCceeee-----EEEEecCccccccccCCceEEEeeeEEEEEeccccccCceEEE----------EEEE
Q ss_pred EEeCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEEEEEeecCCCCCcccCCCeeEEE
Q psy13120 220 IISGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVSILVSDINDNYPVFLHSPYLALV 299 (384)
Q Consensus 220 i~~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~i~V~dvNd~~P~f~~~~~~~~v 299 (384)
|. +....+....|.+-....- ...++++.|+|..+.. ++....+|+|.+= ++|.+....++..-
T Consensus 277 Iv----g~~~s~~~n~~~~g~~n~~-----G~~Ti~atVtDSRGr~-----S~~~~~tItVl~Y--~~P~lsfsv~R~~~ 340 (624)
T PF05895_consen 277 IV----GKNQSITSNGGTFGSVNFS-----GSATIRATVTDSRGRT-----SDPKTKTITVLEY--SPPTLSFSVYRCGS 340 (624)
T ss_pred Ee----CCceEEecCCcceeeeccC-----ceEEEEEEEEECCCcc-----CCceEEEEEEEEc--CCCcEEEEEEEeCC
Q ss_pred cCCCCCCCceEEEEEE--EecCCCCCCCceEEEEEecCCCCCCCCcCCCCCCCceEEeCCcce-------------EEEc
Q psy13120 300 NENTLPGPSFRIMTVQ--AHDADSLPFNGLVRYFLKDSGASSGNSQTQGGTGDLFRVNSTTGD-------------IYLL 364 (384)
Q Consensus 300 ~e~~~~~~g~~i~~v~--a~D~D~~~~~~~i~y~l~~~~~~~~~~~~~~~~~~~f~Id~~tG~-------------i~~~ 364 (384)
.++. -.-..-.++. ..+.-... ...+.|.... -....|.+|...-- ..+.
T Consensus 341 ~~~~--~~v~~~a~Iapl~v~g~qKN-~~~lt~~~a~------------~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~ 405 (624)
T PF05895_consen 341 SGNT--LTVTRNAKIAPLTVNGVQKN-TMTLTFKVAP------------LGTGTFTTDNGSASGTWSSISELTNSSANLG 405 (624)
T ss_pred CCcE--EEEEEEEEEeEEEEcccccc-eEEEEEEEEE------------cCcceEEEEccccccceeeeeeecccceeec
Q ss_pred ccCCcccccEEEEEEEEEcC
Q psy13120 365 KSLDREKQAEYLLTLQAMDS 384 (384)
Q Consensus 365 ~~ld~e~~~~~~l~V~a~D~ 384 (384)
..+| ....|.+.+.++|.
T Consensus 406 g~y~--~~kSy~V~~~l~D~ 423 (624)
T PF05895_consen 406 GTYD--AEKSYDVRGTLSDK 423 (624)
T ss_pred cccC--CCceEEEEEEEEEE
No 58
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.61 E-value=1.1e+02 Score=23.74 Aligned_cols=34 Identities=15% Similarity=0.268 Sum_probs=17.6
Q ss_pred CeEEEEEEEEeC-CCCCCceeEEEEEEeccCCCCCC
Q psy13120 145 SNYTLEIKAKDR-GEPSKSSRTKFFIRLLDENDNNP 179 (384)
Q Consensus 145 ~~~~l~V~a~d~-~~~~~~~~~~v~I~V~dvnd~~P 179 (384)
..=.|.|.|+-. ++.+++....+.|+|...|+ +|
T Consensus 98 N~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 98 NAGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp S-EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred CcccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 345677777743 34577899999999999887 65
No 59
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.52 E-value=3.1e+02 Score=20.65 Aligned_cols=54 Identities=26% Similarity=0.400 Sum_probs=31.7
Q ss_pred CcCCCcEEEEEEEEEeCCCCCCCCceeEEEEEEEEEeecCCCCEEccC-ceeEeeCCCCCCcEEEEEEEEe
Q psy13120 34 DRETKSSYNLVVMATDMAKPPQKRLSSTTQVSIVIRDVNDNAPEFITA-NETTIPENSPLNTVVAALKAID 103 (384)
Q Consensus 34 D~E~~~~~~l~v~a~d~~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~-~~~~v~E~~~~gt~v~~~~a~D 103 (384)
|+|....|.++| ++..++.+...|.+|..+.-... |...+.-|.| +|.+++--|
T Consensus 53 Dr~pvgpyevev-------------aarrt~hlRfndL~dpe~iP~d~~yasviesnvP---vVvQ~tRLD 107 (124)
T COG4288 53 DREPVGPYEVEV-------------AARRTLHLRFNDLGDPEAIPKDTPYASVIESNVP---VVVQLTRLD 107 (124)
T ss_pred cCCCCCceEEEe-------------ecceeEEEEecccCCcccCCCCCchhhheecCCc---eEEEEEEec
Confidence 666666676666 34457888999999844433333 4444555544 455554433
No 60
>COG1470 Predicted membrane protein [Function unknown]
Probab=23.52 E-value=7.1e+02 Score=24.41 Aligned_cols=60 Identities=18% Similarity=0.264 Sum_probs=29.6
Q ss_pred CCeEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCCceec-------------cceEEEEecCCCCCcEEEEEEEE
Q psy13120 144 KSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPIFDS-------------KSYSASVPENISVGATVLQVSAT 206 (384)
Q Consensus 144 ~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P~f~~-------------~~~~~~v~E~~~~g~~v~~v~a~ 206 (384)
...-.+.+...+.|+-++ ..+.+.|.+.-+..-.+.. -..++.+++++..|....+++|.
T Consensus 396 Gee~~i~i~I~NsGna~L---tdIkl~v~~PqgWei~Vd~~~I~sL~pge~~tV~ltI~vP~~a~aGdY~i~i~~k 468 (513)
T COG1470 396 GEEKTIRISIENSGNAPL---TDIKLTVNGPQGWEIEVDESTIPSLEPGESKTVSLTITVPEDAGAGDYRITITAK 468 (513)
T ss_pred CccceEEEEEEecCCCcc---ceeeEEecCCccceEEECcccccccCCCCcceEEEEEEcCCCCCCCcEEEEEEEe
Confidence 344566677777775444 3444555433222111111 11235556666666666665554
No 61
>KOG4221|consensus
Probab=20.34 E-value=1.2e+03 Score=26.02 Aligned_cols=138 Identities=18% Similarity=0.300 Sum_probs=73.6
Q ss_pred cceEEEEceeCCccCCCeEEEEEEEEeCCCCCCceeEEEEEEeccCCCCCCceeccceEEEEecCCCCCcEEEEEEEEeC
Q psy13120 129 VDGLLRVAQKLDREIKSNYTLEIKAKDRGEPSKSSRTKFFIRLLDENDNNPIFDSKSYSASVPENISVGATVLQVSATDL 208 (384)
Q Consensus 129 ~~G~i~~~~~ld~e~~~~~~l~V~a~d~~~~~~~~~~~v~I~V~dvnd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~ 208 (384)
..|.+...+.|- ....|.+.|.|....+.. .+...+.|.-. -+ +|.- ...+. .+..-..+...-+
T Consensus 480 s~g~~~tv~nl~--p~t~Y~~rv~A~n~~g~g-~sS~pLkV~t~--pE-gp~~-~~a~a--------ts~~ti~v~WepP 544 (1381)
T KOG4221|consen 480 SPGIQVTVQNLS--PLTMYFFRVRAKNEAGSG-ESSAPLKVTTQ--PE-GPVQ-LQAYA--------TSPTTILVTWEPP 544 (1381)
T ss_pred CCceEEEeeecc--cceeEEEEEeccCcccCC-ccCCceEEecC--CC-CCcc-ccccc--------cCcceEEEEecCC
Confidence 346333334442 337899999999876533 33444444322 22 3322 11111 1111122333333
Q ss_pred CCCCCceEEEEEE--eCCCCCCeEEECCCcEEEEcccCCccccCeEEEEEEEEECCCCCcceeeeeEEEEE-EEEeecCC
Q psy13120 209 DDGINGRVRYTII--SGDDNHDLTISEDNGLLRIAKNLNYERKAQYSLTIQAEDCGGDTEERIRFDTATVS-ILVSDIND 285 (384)
Q Consensus 209 D~~~~~~i~y~i~--~~~~~~~f~i~~~~G~l~~~~~lD~e~~~~~~l~V~a~D~~~~~~~~~~~~~~~v~-i~V~dvNd 285 (384)
--+...-..|.+. .++...++.++.++-++++. +.+....|.+.|.|.+..+... +++.++ ++..|+-+
T Consensus 545 ~~~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~ti~---gL~k~TeY~~~vvA~N~~G~g~-----sS~~i~V~Tlsd~Ps 616 (1381)
T KOG4221|consen 545 PFGNGPITGYKLFYSEDDTGKELRVENNATEYTIN---GLEKYTEYSIRVVAYNSAGSGV-----SSADITVRTLSDVPS 616 (1381)
T ss_pred CCCCCCceEEEEEEEcCCCCceEEEecCccEEEee---cCCCccceEEEEEEecCCCCCC-----CCCceEEEeccCCCC
Confidence 3222222344442 13566778888777776655 3356678999999999988874 233333 35677777
Q ss_pred CCCc
Q psy13120 286 NYPV 289 (384)
Q Consensus 286 ~~P~ 289 (384)
.||.
T Consensus 617 aPP~ 620 (1381)
T KOG4221|consen 617 APPQ 620 (1381)
T ss_pred CCCc
Confidence 6665
Done!