Query psy8629
Match_columns 483
No_of_seqs 287 out of 2003
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 23:39:25 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8629.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8629hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 1.3E-70 2.8E-75 573.8 39.4 426 12-483 161-616 (2531)
2 KOG4289|consensus 100.0 6.5E-70 1.4E-74 568.6 40.4 419 12-478 266-920 (2531)
3 KOG1219|consensus 100.0 8.2E-61 1.8E-65 514.2 48.2 447 12-478 2175-2812(4289)
4 KOG1219|consensus 100.0 4.6E-60 1E-64 508.5 45.0 416 18-482 643-1095(4289)
5 cd00031 CA Cadherin repeat dom 100.0 3.7E-29 7.9E-34 231.0 29.7 198 156-439 2-199 (199)
6 cd00031 CA Cadherin repeat dom 100.0 5.4E-28 1.2E-32 223.2 26.4 196 19-255 1-199 (199)
7 PF00028 Cadherin: Cadherin do 99.7 3.5E-16 7.7E-21 125.8 13.8 89 348-438 4-93 (93)
8 PF00028 Cadherin: Cadherin do 99.7 5.9E-16 1.3E-20 124.5 15.0 92 156-254 1-93 (93)
9 KOG1834|consensus 99.7 4.4E-15 9.6E-20 148.9 20.6 212 141-439 23-244 (952)
10 smart00112 CA Cadherin repeats 99.7 1.1E-15 2.5E-20 118.9 11.2 79 365-445 1-79 (79)
11 smart00112 CA Cadherin repeats 99.6 2.1E-14 4.6E-19 111.7 10.9 79 176-261 1-79 (79)
12 KOG1834|consensus 99.5 8.3E-13 1.8E-17 132.8 19.3 204 15-256 32-245 (952)
13 PF08266 Cadherin_2: Cadherin- 98.3 1.4E-06 3.1E-11 67.6 6.8 62 19-84 2-68 (84)
14 PF08758 Cadherin_pro: Cadheri 97.4 0.0016 3.5E-08 51.2 9.0 64 12-83 3-67 (90)
15 smart00736 CADG Dystroglycan-t 96.6 0.054 1.2E-06 43.4 11.6 72 364-442 24-96 (97)
16 PF08266 Cadherin_2: Cadherin- 96.5 0.0018 3.8E-08 50.4 2.4 60 348-410 6-67 (84)
17 TIGR01965 VCBS_repeat VCBS rep 96.3 0.038 8.3E-07 44.1 9.0 92 171-276 2-98 (99)
18 TIGR01965 VCBS_repeat VCBS rep 96.1 0.079 1.7E-06 42.3 9.7 89 360-460 2-98 (99)
19 PF08758 Cadherin_pro: Cadheri 96.0 0.1 2.2E-06 41.1 9.8 79 148-241 3-81 (90)
20 smart00736 CADG Dystroglycan-t 95.3 0.29 6.4E-06 39.1 10.4 70 175-258 24-96 (97)
21 PF13750 Big_3_3: Bacterial Ig 94.3 3.5 7.5E-05 36.2 16.1 82 171-257 69-151 (158)
22 KOG3597|consensus 93.8 6.1 0.00013 40.5 18.0 154 133-304 24-194 (442)
23 TIGR00845 caca sodium/calcium 91.9 15 0.00033 41.3 19.1 154 144-304 395-568 (928)
24 PF05345 He_PIG: Putative Ig d 89.1 2.1 4.6E-05 29.4 6.2 39 383-423 9-48 (49)
25 TIGR00845 caca sodium/calcium 83.5 20 0.00044 40.3 13.4 47 256-304 395-441 (928)
26 TIGR03660 T1SS_rpt_143 T1SS-14 81.9 20 0.00044 30.6 10.1 69 197-275 56-127 (137)
27 PF07495 Y_Y_Y: Y_Y_Y domain; 79.3 10 0.00022 27.4 6.7 61 371-438 6-66 (66)
28 COG2706 3-carboxymuconate cycl 75.3 90 0.0019 30.9 26.9 75 250-330 214-288 (346)
29 KOG3597|consensus 70.5 38 0.00083 34.9 10.2 63 245-308 24-86 (442)
30 TIGR03660 T1SS_rpt_143 T1SS-14 69.7 23 0.00049 30.3 7.2 44 411-460 85-128 (137)
31 PF06586 TraK: TraK protein; 52.3 2E+02 0.0043 26.8 11.7 95 18-140 19-119 (234)
32 PF05345 He_PIG: Putative Ig d 48.6 85 0.0018 21.4 6.5 45 49-95 2-47 (49)
33 PF03160 Calx-beta: Calx-beta 48.1 1.3E+02 0.0029 23.5 10.8 51 250-304 2-52 (100)
34 PF07495 Y_Y_Y: Y_Y_Y domain; 46.5 39 0.00085 24.2 4.2 30 225-254 37-66 (66)
35 TIGR00864 PCC polycystin catio 43.8 8.5E+02 0.019 31.6 34.5 189 222-439 1563-1779(2740)
36 cd00146 PKD polycystic kidney 42.7 56 0.0012 24.5 4.7 30 405-436 51-80 (81)
37 smart00089 PKD Repeats in poly 42.4 59 0.0013 24.2 4.8 31 404-437 48-78 (79)
38 PF12245 Big_3_2: Bacterial Ig 40.8 1.1E+02 0.0024 21.8 5.7 31 409-441 21-51 (60)
39 PF03160 Calx-beta: Calx-beta 39.7 1.8E+02 0.004 22.7 10.1 53 138-194 2-54 (100)
40 PF12245 Big_3_2: Bacterial Ig 39.1 1.2E+02 0.0027 21.6 5.7 30 226-257 22-51 (60)
41 PF13956 Ibs_toxin: Toxin Ibs, 34.7 25 0.00054 18.5 1.0 16 1-16 2-17 (19)
42 cd00146 PKD polycystic kidney 34.1 92 0.002 23.2 4.7 29 222-252 52-80 (81)
43 PF12988 DUF3872: Domain of un 33.9 2.8E+02 0.0061 23.6 7.7 42 24-75 40-83 (137)
44 smart00089 PKD Repeats in poly 33.8 99 0.0021 22.9 4.9 29 222-253 50-78 (79)
45 PF10365 DUF2436: Domain of un 32.3 3.2E+02 0.0069 23.3 9.4 90 141-240 63-157 (161)
46 PF13754 Big_3_4: Bacterial Ig 30.8 1.7E+02 0.0036 20.3 5.2 16 409-424 22-37 (54)
47 PF02494 HYR: HYR domain; Int 24.3 1.5E+02 0.0032 22.3 4.4 26 410-437 56-81 (81)
48 PF09100 Qn_am_d_aIV: Quinohem 22.0 2E+02 0.0043 24.1 4.7 31 414-445 101-132 (133)
49 PF02494 HYR: HYR domain; Int 21.6 3.5E+02 0.0076 20.2 9.7 24 228-253 58-81 (81)
No 1
>KOG4289|consensus
Probab=100.00 E-value=1.3e-70 Score=573.77 Aligned_cols=426 Identities=34% Similarity=0.464 Sum_probs=398.2
Q ss_pred HhhhcCceEEEEEeCCCCCCeEEEEEEccCCCCCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEEE
Q psy8629 12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVGDSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVAI 91 (483)
Q Consensus 12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v~ 91 (483)
++.|.+..|...++||.|+||.|.+++|.|++..++.|++.....+...++|+||+.+|.|++++.||||....+-|+|.
T Consensus 161 ~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~a~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~HvlrVt 240 (2531)
T KOG4289|consen 161 AVQFQQPNYQKELPENEPAGTIIASVKASDPDAGRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHVLRVT 240 (2531)
T ss_pred CccCCCcchhccCcCCCCCCceeEEEEecCCCcCceEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeEEEEE
Confidence 45899999999999999999999999999999888999998766666778999999999999999999999998989888
Q ss_pred ecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEE
Q psy8629 92 PLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRT 171 (483)
Q Consensus 92 a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~ 171 (483)
|.|.+. +.-+.+++|+|.|.|+|||.|+|.+.+|...+.||.++|..|.
T Consensus 241 A~d~~~-------------------------------P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vL 289 (2531)
T KOG4289|consen 241 AQDHGD-------------------------------PRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVL 289 (2531)
T ss_pred eeecCC-------------------------------CcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEE
Confidence 877543 3334588999999999999999999999999999999999999
Q ss_pred EEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEE
Q psy8629 172 LNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNV 251 (483)
Q Consensus 172 ~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I 251 (483)
+++|+|.|.++|+.+.|++..|++.+.|.|++. +|. |++.++||||+...|+|.|.|+|.|.|+...++.|.|
T Consensus 290 tvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~r---SGv----I~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~i 362 (2531)
T KOG4289|consen 290 TVRATDGDSPPNANIRYRLLEGNAKNVFEINPR---SGV----ISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEI 362 (2531)
T ss_pred EEEeccCCCCCCCceEEEecCCCccceeEEcCc---cce----eeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEE
Confidence 999999999999999999999988999999994 567 8899999999999999999999999998888999999
Q ss_pred EEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCCCc
Q psy8629 252 TIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALDFE 331 (483)
Q Consensus 252 ~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD~E 331 (483)
+|.|+|||+|+|....|...|.|+..++++|++|+|+|.|.|.|+.+.|+|. +++..+.|.||..||+|.+..+||||
T Consensus 363 tV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~--Sgn~~G~f~id~~tGel~vv~plD~e 440 (2531)
T KOG4289|consen 363 TVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIA--SGNGRGQFYIDSLTGELDVVEPLDFE 440 (2531)
T ss_pred EEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEee--ccCccccEEEecccceEEEecccccc
Confidence 9999999999999999999999999999999999999999999999999995 57889999999999999999999999
Q ss_pred ccceeeeEeE--------------EEE---Eee-------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEee
Q psy8629 332 TKEVHDLVPR--------------FAV---EIS-------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLRE 381 (483)
Q Consensus 332 ~~~~~~l~v~--------------~~~---~v~-------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~ 381 (483)
.. .|.+.|+ +++ .|+ ++.||.+.|..++.+.|.|.|.|.|+.++|++.
T Consensus 441 ~~-~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~la- 518 (2531)
T KOG4289|consen 441 NS-EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLA- 518 (2531)
T ss_pred CC-eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeec-
Confidence 98 9999988 222 222 688999999999999999999999999999997
Q ss_pred CCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCCC
Q psy8629 382 NTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENKP 461 (483)
Q Consensus 382 ~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~~ 461 (483)
..+.|.|+..+|.|++.+.||||+.+.|.|-|.|+|.|.|++++.+.|.|.+.|+|||.|.|.+..|+..+.|++|
T Consensus 519 ----G~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~p 594 (2531)
T KOG4289|consen 519 ----GVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAP 594 (2531)
T ss_pred ----cCCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCcc
Confidence 3458999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCcEEEEEEEEeCCCCCCCcCC
Q psy8629 462 KGSCILKSIFYDIFNISEHSFK 483 (483)
Q Consensus 462 ~g~~v~~v~a~D~D~~~~~~~~ 483 (483)
.|+.|.+|+|.|.|.++.++|.
T Consensus 595 vgsSI~tvtAvD~d~~s~ityq 616 (2531)
T KOG4289|consen 595 VGSSIVTVTAVDRDANSVITYQ 616 (2531)
T ss_pred ccceEEEEEEeccccccceEEE
Confidence 9999999999999999988873
No 2
>KOG4289|consensus
Probab=100.00 E-value=6.5e-70 Score=568.57 Aligned_cols=419 Identities=34% Similarity=0.439 Sum_probs=385.5
Q ss_pred HhhhcCceEEEEEeCCCCCCeEEEEEEccCCC---CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeE
Q psy8629 12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVG---DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSL 88 (483)
Q Consensus 12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d---~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l 88 (483)
++-|+|++|.-++.||.++|..|.+++|+|.| +..+.|++..+ .+.+.|+||+.+|.|+++.+||||+..+|.|
T Consensus 266 sPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg---~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L 342 (2531)
T KOG4289|consen 266 SPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEG---NAKNVFEINPRSGVISTRAPLDREELESYQL 342 (2531)
T ss_pred CcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCC---CccceeEEcCccceeeccCccCHHhhhheEE
Confidence 46799999999999999999999999999965 66789999854 3456899999999999999999999999999
Q ss_pred EEEecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCe
Q psy8629 89 VAIPLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDV 168 (483)
Q Consensus 89 ~v~a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt 168 (483)
.|.|+|.+. +....++.|.|+|.|+|||+|+|....|..+|.|+..+++
T Consensus 343 ~VeAsDqG~-------------------------------~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~a 391 (2531)
T KOG4289|consen 343 DVEASDQGR-------------------------------PPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPA 391 (2531)
T ss_pred EEEeccCCC-------------------------------CCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCc
Confidence 999988542 1112278999999999999999999999999999999999
Q ss_pred EEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEE
Q psy8629 169 KRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMI 248 (483)
Q Consensus 169 ~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~ 248 (483)
.|.+|+|+|.|.|.|+.+.|+|.+|+..+.|.|+..+ |. +.+..+||+|.. .|++.|+|.|+|.|+++.+.-
T Consensus 392 vvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~t---Ge----l~vv~plD~e~~-~ytl~IrAqDggrPpLsn~sg 463 (2531)
T KOG4289|consen 392 VVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLT---GE----LDVVEPLDFENS-EYTLRIRAQDGGRPPLSNTSG 463 (2531)
T ss_pred eEEEEEecccCCCcCceEEEEeeccCccccEEEeccc---ce----EEEeccccccCC-eeEEEEEcccCCCCCccCCCc
Confidence 9999999999999999999999999999999999954 55 788899999998 999999999999999999999
Q ss_pred EEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccC
Q psy8629 249 VNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKAL 328 (483)
Q Consensus 249 v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~L 328 (483)
+.|+|+|+|||+|.|....+..+|.|+.+.|..++.+.|.|.|.|+|+.+.|++.+ .+.|.|+..+|+|.+.+.|
T Consensus 464 l~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG-----~~pf~I~~~SG~Itvtk~l 538 (2531)
T KOG4289|consen 464 LVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG-----VGPFQINNGSGWITVTKEL 538 (2531)
T ss_pred eEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeecc-----CCCeeEecCCceEEEeecc
Confidence 99999999999999999999999999999999999999999999999999999954 4589999999999999999
Q ss_pred CCcccceeeeEeE-------------------------------------------------------------------
Q psy8629 329 DFETKEVHDLVPR------------------------------------------------------------------- 341 (483)
Q Consensus 329 D~E~~~~~~l~v~------------------------------------------------------------------- 341 (483)
|||+.+.|.|.|+
T Consensus 539 drEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ityqi~ 618 (2531)
T KOG4289|consen 539 DRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSVITYQIT 618 (2531)
T ss_pred cccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEeccccccceEEEec
Confidence 9999999998887
Q ss_pred --------------------------------------------------------------------EEEEee------
Q psy8629 342 --------------------------------------------------------------------FAVEIS------ 347 (483)
Q Consensus 342 --------------------------------------------------------------------~~~~v~------ 347 (483)
|+.+|+
T Consensus 619 g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G 698 (2531)
T KOG4289|consen 619 GGNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLG 698 (2531)
T ss_pred CCcccccceeeccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCC
Confidence 000000
Q ss_pred --------------------------------------------------------------------------------
Q psy8629 348 -------------------------------------------------------------------------------- 347 (483)
Q Consensus 348 -------------------------------------------------------------------------------- 347 (483)
T Consensus 699 ~tvvtlsasd~D~geNARI~y~led~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diND 778 (2531)
T KOG4289|consen 699 TTVVTLSASDEDTGENARITYILEDEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDIND 778 (2531)
T ss_pred ceeEEEecccCCCCccceEEEEecccceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccc
Confidence
Q ss_pred ------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEE
Q psy8629 348 ------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKL 415 (483)
Q Consensus 348 ------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l 415 (483)
+|.|++|++|.|++|+|+|.|.|.|+.+.|.+..+. +..+.|.|++++|.|++.+.||||....|.|
T Consensus 779 naPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~-d~p~~F~IEptSGviRtl~rLdRE~~avy~L 857 (2531)
T KOG4289|consen 779 NAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGD-DGPGDFYIEPTSGVIRTLRRLDRENVAVYVL 857 (2531)
T ss_pred cCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCC-CCCCceEEccCcceeehhhhhcchheeEEEE
Confidence 788999999999999999999999999999887543 3458899999999999999999999999999
Q ss_pred EEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCCCCCcEEEEEEEEeCCCCC
Q psy8629 416 TVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENKPKGSCILKSIFYDIFNIS 478 (483)
Q Consensus 416 ~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~ 478 (483)
.+.|+|.|.|++++++.|+|+|+|+|||||+|.+..|+..|.||.++|+.+++++|.|||.+.
T Consensus 858 ~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~ 920 (2531)
T KOG4289|consen 858 AAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGP 920 (2531)
T ss_pred EEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCC
Confidence 999999999999999999999999999999999999999999999999999999999999643
No 3
>KOG1219|consensus
Probab=100.00 E-value=8.2e-61 Score=514.24 Aligned_cols=447 Identities=31% Similarity=0.421 Sum_probs=395.3
Q ss_pred HhhhcCceEEEEEeCCCCCCeEEEEEEccCCC-CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEE
Q psy8629 12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVG-DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVA 90 (483)
Q Consensus 12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d-~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v 90 (483)
++.|.+..|.++++|+++.|+.|.++.|+|.| +..+.|+|.. ++.....|+|++.||+|++.+.||||+...|.+.+
T Consensus 2175 ~PvFeqlsYt~sisE~s~igt~viqilATdsDsn~~isYsl~g--~s~~sk~f~In~sTG~it~~g~ldyE~~q~f~~fv 2252 (4289)
T KOG1219|consen 2175 PPVFEQLSYTISISENSKIGTKVIQILATDSDSNREISYSLEG--NSEISKPFRINVSTGWITVAGKLDYEENQEFRFFV 2252 (4289)
T ss_pred CchhheeeEEEEccCCCccCceEEEEEeccCCCCCceEEEeec--CCccccceEEecccceEEEeeecChhhcceEEEEE
Confidence 46799999999999999999999999999988 6678999986 44556789999999999999999999999999999
Q ss_pred EecCCccee----------------------------------------EE-----------------------------
Q psy8629 91 IPLSGENVR----------------------------------------VV----------------------------- 101 (483)
Q Consensus 91 ~a~d~~~~~----------------------------------------~~----------------------------- 101 (483)
.|.||+... ++
T Consensus 2253 ratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l~~s~~G~i 2332 (4289)
T KOG1219|consen 2253 RATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFLILSENGII 2332 (4289)
T ss_pred EEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhhhhcccceeeeccCceE
Confidence 999884210 00
Q ss_pred --------------------------------------------------------------------------------
Q psy8629 102 -------------------------------------------------------------------------------- 101 (483)
Q Consensus 102 -------------------------------------------------------------------------------- 101 (483)
T Consensus 2333 TlfNl~k~~l~~s~~lrv~vsD~v~~at~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~~si~s~~s 2412 (4289)
T KOG1219|consen 2333 TLFNLLKSPLQTSYPLRVTVSDGVFRATMEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIYYSINSRAS 2412 (4289)
T ss_pred EehhhcccccccccceeeeeccCcceeeeEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceeeeeechhcc
Confidence
Q ss_pred ---EeecCcEEEEcccCCccccceEEEEEEeC--CCC----eEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEE
Q psy8629 102 ---SFYEPGEIRTKVPLDRETRSSYSLVAIPL--SGE----NVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTL 172 (483)
Q Consensus 102 ---~~~~~g~l~~~~~ld~e~~~~~~~~~~~~--~~~----~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~ 172 (483)
.+.+.|.|.+...||||....+.+.+..+ |++ -++++|.+.|+|||||.|....|++.|.|++..|..|++
T Consensus 2413 d~~~in~~GqI~t~~kld~e~s~~~vi~i~v~a~Da~gr~af~tvti~ltDiNDnpPqF~a~~Y~~nI~enaskg~~V~~ 2492 (4289)
T KOG1219|consen 2413 DHFEINKSGQIKTLSKLDREYSEELVIIIAVMAFDAGGRVAFCTVTIILTDINDNPPQFDAQLYRVNITENASKGKLVGH 2492 (4289)
T ss_pred CceeECCCccEEeeehhhhccCceEEEEEEEEEecCCCeEEEEEEEEEEEecCCCCccccceeEEEEeecccCCCceEEE
Confidence 01136888889999999998888876654 654 678999999999999999999999999999999999999
Q ss_pred EEEEeCCCCCCceEEEEEEeC-CCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEE
Q psy8629 173 NAAKDRDLGIYNTQRYNIVSG-NTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNV 251 (483)
Q Consensus 173 v~a~D~D~~~~~~~~y~l~~~-~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I 251 (483)
+.|+|.|.+.|..++|.+.++ .....|.|++ +|. |.+.+.|++++...|.|.|+|.|+|.|++.+..+|.+
T Consensus 2493 v~A~D~De~snadvty~i~~e~~~~~v~~in~----sG~----Itv~~sL~~~en~tl~l~vkA~D~g~P~~~s~ttV~v 2564 (4289)
T KOG1219|consen 2493 VIARDADEGSNADVTYEIVGESDVKHVFEINE----SGV----ITVKRSLDGLENSTLHLFVKAIDDGKPRRRSNTTVIV 2564 (4289)
T ss_pred EEEecCCCCCcccEEEEecCchhhhheeeecC----Cce----EEeehhhhcccCcEEEEEEEeccCCCCCcccceEEEE
Confidence 999999999999999999876 2234566653 577 7889999999999999999999999999999999999
Q ss_pred EEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCC---CcceEEeCCccEEEEcccC
Q psy8629 252 TIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDK---DGLFNIDSKSGLIFVNKAL 328 (483)
Q Consensus 252 ~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~---~~~F~Id~~tG~l~~~~~L 328 (483)
+|++..++.|.|+.+.|.++|+|+.++|+.|++++|.|.|. ..-|++...++++ ...|.+|++||.|.++++|
T Consensus 2565 ~vl~e~v~lPrFSep~y~fsvpEDv~vG~~Ig~v~a~~a~~----~~i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~ksL 2640 (4289)
T KOG1219|consen 2565 TVLPEDVNLPRFSEPIYTFSVPEDVPVGEEIGQVSASDADE----HVIYSLVLGGTPESNPDLPFSVDRNTGMIKVNKSL 2640 (4289)
T ss_pred EecCcccCcccccCceEEEeccccCCCCCeeeEEeecccCC----ceEEEEEeCCCCCCCCCCceEEcCCCceEEecccc
Confidence 99999999999999999999999999999999999999985 3566666544444 4459999999999999999
Q ss_pred CCcccceeeeEeEE----------EEEee------------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEe
Q psy8629 329 DFETKEVHDLVPRF----------AVEIS------------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLR 380 (483)
Q Consensus 329 D~E~~~~~~l~v~~----------~~~v~------------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~ 380 (483)
|+|...+|.+.|.. .+.|. .+.||.+.|+.|++++|+|.|.|.|++|+|+|.
T Consensus 2641 D~E~kk~yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~ 2720 (4289)
T KOG1219|consen 2641 DHEKKKSYQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLT 2720 (4289)
T ss_pred chhhhceEEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEc
Confidence 99999999999881 11111 567999999999999999999999999999998
Q ss_pred eCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCC
Q psy8629 381 ENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENK 460 (483)
Q Consensus 381 ~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~ 460 (483)
.. ..+|.|++.||+|++...||+|++..|.|.|+|+|.|.| ++.++|.|.|.|+||+||+|+...|.++|.|+.
T Consensus 2721 ~~----v~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~--ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~ 2794 (4289)
T KOG1219|consen 2721 SP----VPYFAINPETGWLTTLFELDLEKQDLYSLKVVATDHGVP--SSQATVLVHVTDVNDSPPRFQREIYEGTVVEDV 2794 (4289)
T ss_pred CC----cceEEEcCCCCeeeehhhhccccCCceEEEEEEecCCcc--cccceEEEEEEecCCCcchhhhHhhccceeccC
Confidence 42 339999999999999999999999999999999999986 788999999999999999999999999999999
Q ss_pred CCCcEEEEEEEEeCCCCC
Q psy8629 461 PKGSCILKSIFYDIFNIS 478 (483)
Q Consensus 461 ~~g~~v~~v~a~D~D~~~ 478 (483)
+.|.+++.++..|.|-..
T Consensus 2795 p~~~~ia~~si~d~D~s~ 2812 (4289)
T KOG1219|consen 2795 PGGKIIAGLSIFDADVSE 2812 (4289)
T ss_pred CCCceeeeeEeccccccc
Confidence 999999999999999543
No 4
>KOG1219|consensus
Probab=100.00 E-value=4.6e-60 Score=508.46 Aligned_cols=416 Identities=29% Similarity=0.400 Sum_probs=380.9
Q ss_pred ceEEEEEeCCCCCCeEEEEEEccCCC---CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEEEecC
Q psy8629 18 HLRDLEISEGVPVGTRVGFIGDGFVG---DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVAIPLS 94 (483)
Q Consensus 18 ~~~~~~v~En~~~gt~V~~v~a~d~d---~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v~a~d 94 (483)
-..-+.|+|+.|.|+.++.++|+|.| +..+.|-|..+. .+..|.|+..+|.|++.++||+|....|.|.+.|.|
T Consensus 643 fP~iI~v~Edvpigt~la~L~atD~Dtgfng~l~yvI~dgn---e~~~~~Id~qsg~itvas~ld~~~t~~yiLnvta~D 719 (4289)
T KOG1219|consen 643 FPFIIVVPEDVPIGTTLAILSATDSDTGFNGKLVYVIEDGN---ESICFLIDRQSGNITVASPLDNENTEQYILNVTAYD 719 (4289)
T ss_pred CCceEEccccCCCCceEEEEeccCCCCCcCceEEEEEeCCc---cceEEEEecccceEEEecchhhhhhheeEEEEEEec
Confidence 45678999999999999999999977 556789998643 456899999999999999999999999999999998
Q ss_pred CcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEE
Q psy8629 95 GENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNA 174 (483)
Q Consensus 95 ~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~ 174 (483)
++. +.++....+.|.|.|.|||+|+|.+..|.++|+|+..+|+.|++|.
T Consensus 720 ~gt-------------------------------Pqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~ve 768 (4289)
T KOG1219|consen 720 LGT-------------------------------PQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVE 768 (4289)
T ss_pred CCC-------------------------------chhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEE
Confidence 763 4455567788999999999999999999999999999999999999
Q ss_pred EEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEE
Q psy8629 175 AKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQ 254 (483)
Q Consensus 175 a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~ 254 (483)
|.|.|.|.||.++|+|.. +...|+|++. +|. +.+.++||||.+..|.|.|+|+|.+.|.+.+...+.|.|.
T Consensus 769 tnd~D~g~NG~v~fsL~n--~sdvfsIdp~---tGi----vv~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsve 839 (4289)
T KOG1219|consen 769 TNDTDGGNNGMVSFSLLN--KSDVFSIDPF---TGI----VVTSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVE 839 (4289)
T ss_pred ecccCCCCCceEEEEecC--CcceEEecCc---ccE----EEeccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEe
Confidence 999999999999999976 5679999995 466 7889999999999999999999999999999999999999
Q ss_pred ecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCCCcccc
Q psy8629 255 DVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALDFETKE 334 (483)
Q Consensus 255 dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~ 334 (483)
|+|||+|.|....+...|+|+.|.|+++..+.|.|+|.|+.++++|.+.. ....|.+|--+|.+++.++||||...
T Consensus 840 dVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~----~~v~~rvd~~sGavfi~~~LDf~k~~ 915 (4289)
T KOG1219|consen 840 DVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTD----DTVGERVDFPSGAVFIGKPLDFEKSD 915 (4289)
T ss_pred eccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEec----CccccccccccccEEEeccccccccc
Confidence 99999999999999999999999999999999999999999999999964 23445999999999999999999999
Q ss_pred eeeeEeEEE------------EEee--------------------eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeC
Q psy8629 335 VHDLVPRFA------------VEIS--------------------SFDEIAEPGTSVFQISAFDQDEGINSVITYSLREN 382 (483)
Q Consensus 335 ~~~l~v~~~------------~~v~--------------------~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~ 382 (483)
.|+|.|++. ..|. +|.||++.|+.++.+.|.|.|.|..+.+.|+|.
T Consensus 916 fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~-- 993 (4289)
T KOG1219|consen 916 FYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIR-- 993 (4289)
T ss_pred ceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEE--
Confidence 999998811 1111 789999999999999999999999999999999
Q ss_pred CCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCCCC
Q psy8629 383 TDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENKPK 462 (483)
Q Consensus 383 ~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~~~ 462 (483)
.|...+.|.||.++|.|++.+.||||....|-|+|.|+|.|.+++++.+.+.|.|+|+|||+|+|.++.|..+|.||+|.
T Consensus 994 ~gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~ 1073 (4289)
T KOG1219|consen 994 TGDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPE 1073 (4289)
T ss_pred cCCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCC
Confidence 56778899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CcEEEEEEEEeCCCCC--CCcC
Q psy8629 463 GSCILKSIFYDIFNIS--EHSF 482 (483)
Q Consensus 463 g~~v~~v~a~D~D~~~--~~~~ 482 (483)
+..|.++.|.|||+.+ .+.|
T Consensus 1074 ~vsivq~ea~D~Dsssn~kLmy 1095 (4289)
T KOG1219|consen 1074 TVSIVQAEANDPDSSSNQKLMY 1095 (4289)
T ss_pred ceEEEEeccCCCCcccCcceEE
Confidence 9999999999999544 5554
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=3.7e-29 Score=231.03 Aligned_cols=198 Identities=44% Similarity=0.654 Sum_probs=178.0
Q ss_pred eEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEE
Q psy8629 156 MSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEA 235 (483)
Q Consensus 156 ~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a 235 (483)
|.+.|+|+++.|+.++++.|.|+|.+.++.++|+|.++....+|.|++. +|. |.+.+.||||....|.|.|+|
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~---tG~----l~~~~~lD~e~~~~~~l~v~a 74 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPN---TGV----ITTTKPLDREEQSEYTLTVVA 74 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCC---CCE----EEECCCCCCcCCceEEEEEEE
Confidence 7899999999999999999999999888899999998765579999985 466 778899999999999999999
Q ss_pred EECCCCCeeeEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEE
Q psy8629 236 LDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNI 315 (483)
Q Consensus 236 ~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~I 315 (483)
+|.|.|.++++..++|.|.|+||++|.|....|.+.+.|+.+.|+.++++.|+|+|.+.++.++|+|..
T Consensus 75 ~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~----------- 143 (199)
T cd00031 75 SDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILS----------- 143 (199)
T ss_pred EECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeC-----------
Confidence 999888888999999999999999999998899999999999999999999999998766777777742
Q ss_pred eCCccEEEEcccCCCcccceeeeEeEEEEEeeeeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCC
Q psy8629 316 DSKSGLIFVNKALDFETKEVHDLVPRFAVEISSFDEIAEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSK 395 (483)
Q Consensus 316 d~~tG~l~~~~~LD~E~~~~~~l~v~~~~~v~~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~ 395 (483)
+...++|.|++.
T Consensus 144 --------------------------------------------------------------------~~~~~~f~i~~~ 155 (199)
T cd00031 144 --------------------------------------------------------------------GNDKELFSIDPN 155 (199)
T ss_pred --------------------------------------------------------------------CCCCCEEEEeCC
Confidence 111357888888
Q ss_pred CcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEe
Q psy8629 396 SGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHD 439 (483)
Q Consensus 396 tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~d 439 (483)
+|.|.+.+.||||....|.|.|.|+|.+.|.+++++.++|.|.|
T Consensus 156 ~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 156 TGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred ceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 99999999999999999999999999988889999999999875
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.96 E-value=5.4e-28 Score=223.19 Aligned_cols=196 Identities=34% Similarity=0.503 Sum_probs=172.2
Q ss_pred eEEEEEeCCCCCCeEEEEEEccCCCC---CCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCceeeEEEEecCC
Q psy8629 19 LRDLEISEGVPVGTRVGFIGDGFVGD---SGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRSSYSLVAIPLSG 95 (483)
Q Consensus 19 ~~~~~v~En~~~gt~V~~v~a~d~d~---~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~v~a~d~ 95 (483)
.|.++|+||+++|+.|+++.|.|.|. ..+.|+|.+... ..+|.|++.+|+|++.++||||....|.|.+.|.|+
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~---~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~ 77 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNE---DGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDG 77 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCC---cccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEEC
Confidence 47899999999999999999999874 457899986532 258999999999999999999999999999999884
Q ss_pred cceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEE
Q psy8629 96 ENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAA 175 (483)
Q Consensus 96 ~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a 175 (483)
+.. ....+..+.|.|.|+|||+|.|....|.+.|.|+.+.|+.++++.|
T Consensus 78 g~~-------------------------------~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a 126 (199)
T cd00031 78 GGP-------------------------------PLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTA 126 (199)
T ss_pred CcC-------------------------------cceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEE
Confidence 321 0113678999999999999999999999999999999999999999
Q ss_pred EeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEEe
Q psy8629 176 KDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQD 255 (483)
Q Consensus 176 ~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d 255 (483)
+|+|.+.++.++|+|.++.....|.|++ .+|. |.+.+.||||....|.|.|.|+|.+.|.+++++.++|.|.|
T Consensus 127 ~D~D~~~~~~~~y~l~~~~~~~~f~i~~---~~G~----i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 127 TDADSGENAKLTYSILSGNDKELFSIDP---NTGI----ITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EcCCCCCCccEEEEEeCCCCCCEEEEeC---CceE----EEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 9999988899999999876557999998 4566 77788999999999999999999988889999999999875
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.70 E-value=3.5e-16 Score=125.80 Aligned_cols=89 Identities=46% Similarity=0.583 Sum_probs=84.0
Q ss_pred eeeccCCCCeEEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEEC-CcCC
Q psy8629 348 SFDEIAEPGTSVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDN-GVPA 426 (483)
Q Consensus 348 ~v~E~~~~gt~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~-g~p~ 426 (483)
.+.|++++|+.++++.|.|+|.+.|+.+.|+|.. ++...+|.|++.+|.|++.+.||||..+.|.|.|.|+|. |.|+
T Consensus 4 ~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~--~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~ 81 (93)
T PF00028_consen 4 SVPENAPPGTVVGQVTATDPDSGPNSQITYSILG--GNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPP 81 (93)
T ss_dssp EEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEE--TTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSE
T ss_pred EEECCCCCCCEEEEEEEEeCCCCCCceEEEEEec--CcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCC
Confidence 5789999999999999999999999999999994 445899999999999999999999999999999999999 8999
Q ss_pred ceEEEEEEEEEE
Q psy8629 427 LSSSATVLVTIH 438 (483)
Q Consensus 427 ~s~~~~v~I~V~ 438 (483)
++++++|.|+|+
T Consensus 82 ~~~~~~V~I~V~ 93 (93)
T PF00028_consen 82 LSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEEEEEE
T ss_pred CEEEEEEEEEEC
Confidence 999999999985
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.70 E-value=5.9e-16 Score=124.51 Aligned_cols=92 Identities=33% Similarity=0.552 Sum_probs=86.5
Q ss_pred eEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEE
Q psy8629 156 MSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEA 235 (483)
Q Consensus 156 ~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a 235 (483)
|.+.|+|++++|+.++++.|.|+|.+.|+.+.|+|.+++....|.|++. +|. |.+.++||||..+.|.|.|.|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~---tg~----i~~~~~LD~E~~~~y~l~v~a 73 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPN---TGE----ISLKKPLDRETQSSYQLTVRA 73 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETT---TTE----EEESSSSCTTTTSEEEEEEEE
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeee---eec----cccceecCcccCCEEEEEEEE
Confidence 7899999999999999999999999999999999999988899999995 455 888999999999999999999
Q ss_pred EEC-CCCCeeeEEEEEEEEE
Q psy8629 236 LDG-GSPPLRGSMIVNVTIQ 254 (483)
Q Consensus 236 ~D~-g~p~~~~~~~v~I~V~ 254 (483)
+|. |.|++++++.|+|+|+
T Consensus 74 ~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 74 TDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp EETTTSSEEEEEEEEEEEEE
T ss_pred EECCCCCCCEEEEEEEEEEC
Confidence 999 8999999999999985
No 9
>KOG1834|consensus
Probab=99.67 E-value=4.4e-15 Score=148.85 Aligned_cols=212 Identities=26% Similarity=0.300 Sum_probs=161.9
Q ss_pred EecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCC---CceEEEEEEeCCCCCceEEecccCCCceEEEEEEe
Q psy8629 141 VIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGI---YNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQI 217 (483)
Q Consensus 141 V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~---~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~ 217 (483)
.--+|-+.| +....|...|.||-..-....-+.|-|.|.+. .....|.|.+. .-+|.+......+|.-. |+.
T Consensus 23 aarankhkp-wie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq--~vPFdavVvdK~TGegv--lRa 97 (952)
T KOG1834|consen 23 AARANKHKP-WIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ--PVPFDAVVVDKYTGEGV--LRA 97 (952)
T ss_pred cccccccCc-ccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC--CCCceEEEEeccCCceE--Eee
Confidence 345677777 55677999999997644444457888999742 12346777653 44777655544555433 778
Q ss_pred ccCCCCCCCCEEEEEEEEEECCC-C-----CeeeEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCC
Q psy8629 218 NGVLDRETIPEYSLVIEALDGGS-P-----PLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDAD 291 (483)
Q Consensus 218 ~~~LD~E~~~~~~l~v~a~D~g~-p-----~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D 291 (483)
..+||.|.++.|+|+|+|.|.|. | ..+..++|.|+|.|+|+.+|+|..+.|.+.|.|. ..-..|++|.|.|.|
T Consensus 98 K~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~D 176 (952)
T KOG1834|consen 98 KEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKD 176 (952)
T ss_pred cCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecc-eeeeeeEEEEeecCC
Confidence 89999999999999999999664 3 3567789999999999999999999999999998 567788999999999
Q ss_pred CCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCCCcccceeeeEeEEEEEeeeeeccCCCCeEEEEEEEEeCCCCC
Q psy8629 292 AGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALDFETKEVHDLVPRFAVEISSFDEIAEPGTSVFQISAFDQDEGI 371 (483)
Q Consensus 292 ~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD~E~~~~~~l~v~~~~~v~~v~E~~~~gt~v~~v~a~D~D~~~ 371 (483)
.++ +
T Consensus 177 Csp----------------------------------------------------------------------------q 180 (952)
T KOG1834|consen 177 CSP----------------------------------------------------------------------------Q 180 (952)
T ss_pred CCC----------------------------------------------------------------------------c
Confidence 754 1
Q ss_pred CceE-EEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEe
Q psy8629 372 NSVI-TYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHD 439 (483)
Q Consensus 372 ~~~v-~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~d 439 (483)
+++| .|.|. +..-.|.||. .|.|+...+|+|.+...|.|+|+|.|.|..+..+.+.|+|.|..
T Consensus 181 ~sqIC~YEI~----t~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp 244 (952)
T KOG1834|consen 181 YSQICEYEIT----TPDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKP 244 (952)
T ss_pred ccceeEEEec----CCCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEecC
Confidence 2222 35555 2456799985 59999999999999999999999999998655555777887764
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.65 E-value=1.1e-15 Score=118.86 Aligned_cols=79 Identities=48% Similarity=0.641 Sum_probs=73.0
Q ss_pred EeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCCCC
Q psy8629 365 FDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVNDNE 444 (483)
Q Consensus 365 ~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~ 444 (483)
+|+|.|.|+.++|+|... +...+|.|++.+|.|.+.++||||....|.|.|+|+|.|.|+++++++|.|.|.|+|||+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~--~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~ 78 (79)
T smart00112 1 TDADSGENGKVTYSILSG--NEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA 78 (79)
T ss_pred CCCCCCcCcEEEEEEecC--CCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence 488999999999999943 333899999999999999999999999999999999999999999999999999999999
Q ss_pred C
Q psy8629 445 P 445 (483)
Q Consensus 445 P 445 (483)
|
T Consensus 79 P 79 (79)
T smart00112 79 P 79 (79)
T ss_pred C
Confidence 8
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.57 E-value=2.1e-14 Score=111.66 Aligned_cols=79 Identities=44% Similarity=0.743 Sum_probs=71.7
Q ss_pred EeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEEe
Q psy8629 176 KDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQD 255 (483)
Q Consensus 176 ~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~d 255 (483)
+|+|.|.|+.+.|+|.+++...+|.|++.+ |. +.+.++||||..+.|.|.|+|+|.|.|++++.+.|+|+|.|
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~t---g~----i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D 73 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPET---GE----ITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLD 73 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCc---cE----EEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEE
Confidence 488999999999999987755899999854 54 67788999999999999999999999999999999999999
Q ss_pred cCCCCC
Q psy8629 256 VNDNQP 261 (483)
Q Consensus 256 vNd~~P 261 (483)
+|||+|
T Consensus 74 ~Nd~~P 79 (79)
T smart00112 74 VNDNAP 79 (79)
T ss_pred CCCCCC
Confidence 999998
No 12
>KOG1834|consensus
Probab=99.52 E-value=8.3e-13 Score=132.80 Aligned_cols=204 Identities=20% Similarity=0.209 Sum_probs=150.6
Q ss_pred hcCceEEEEEeCCCCCCeEEEEEEccCCCC------CCCCEEEEecCCCCCcccEEEeCCcc--EEEeCccCCccCCcee
Q psy8629 15 ADEHLRDLEISEGVPVGTRVGFIGDGFVGD------SGPPYLIVPVPGSAVDTDLSIEQNTG--EIRTKVPLDRETRSSY 86 (483)
Q Consensus 15 ~~~~~~~~~v~En~~~gt~V~~v~a~d~d~------~~~~y~i~~~~~~~~~~~F~Id~~tG--~i~~~~~LD~E~~~~y 86 (483)
+...+|..-|.||--.=...--+.|.|.|. .--.|.|-.. +-+-+.--+|+.|| .|+.+.+||-|.++.|
T Consensus 32 wie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq--~vPFdavVvdK~TGegvlRaK~~lDCelqkey 109 (952)
T KOG1834|consen 32 WIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ--PVPFDAVVVDKYTGEGVLRAKEPLDCELQKEY 109 (952)
T ss_pred ccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC--CCCceEEEEeccCCceEEeecCcccccccccc
Confidence 556789999999975322223466666542 1124666532 11111233466664 7888889999999999
Q ss_pred eEEEEecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEEEecCCCCCCccCCCceEEEEeCCCCC
Q psy8629 87 SLVAIPLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPR 166 (483)
Q Consensus 87 ~l~v~a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~ 166 (483)
+++|+|.|.++.- +| .....+..+++.|+|.|+|+.+|+|..+.|.+.|.|....
T Consensus 110 tf~iQAydCg~gp------dg-------------------tn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~y 164 (952)
T KOG1834|consen 110 TFTIQAYDCGNGP------DG-------------------TNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVY 164 (952)
T ss_pred eEEEEEEecCCCC------Cc-------------------cccccccceEEEEEeccccccCchhcccceeeEEecceee
Confidence 9999998865310 00 0011233689999999999999999999999999998655
Q ss_pred CeEEEEEEEEeCCCCC-Cce-EEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCee
Q psy8629 167 DVKRTLNAAKDRDLGI-YNT-QRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLR 244 (483)
Q Consensus 167 gt~v~~v~a~D~D~~~-~~~-~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~ 244 (483)
..|.++.|.|.|.++ ++. ..|.|.. +.-+|.|+. .|. |+...+|+|.+...|.|+|.|.|.|..+..
T Consensus 165 -d~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn----~G~----irnTekLny~ke~~Y~ltVtAyDCg~kraa 233 (952)
T KOG1834|consen 165 -DSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDN----DGN----IRNTEKLNYTKEHQYKLTVTAYDCGKKRAA 233 (952)
T ss_pred -eeeEEEEeecCCCCCcccceeEEEecC--CCCceEEcC----CCc----cccccccccccceeEEEEEEEEeccccccc
Confidence 678899999999874 444 4899987 566999994 566 888999999999999999999999986666
Q ss_pred eEEEEEEEEEec
Q psy8629 245 GSMIVNVTIQDV 256 (483)
Q Consensus 245 ~~~~v~I~V~dv 256 (483)
+...|+|.|...
T Consensus 234 ~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 234 SDSLVTVHVKPT 245 (952)
T ss_pred CcceEEEEecCc
Confidence 668899998765
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=98.35 E-value=1.4e-06 Score=67.57 Aligned_cols=62 Identities=24% Similarity=0.324 Sum_probs=43.3
Q ss_pred eEEEEEeCCCCCCeEEEEEEccCCC-----CCCCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCCc
Q psy8629 19 LRDLEISEGVPVGTRVGFIGDGFVG-----DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETRS 84 (483)
Q Consensus 19 ~~~~~v~En~~~gt~V~~v~a~d~d-----~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~ 84 (483)
+.+++|+|+.++||+||.+ |.|.+ -....|.+.+.. ...+|.+++.||.|+++.++|||+.+
T Consensus 2 qi~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~---~~~~~~v~~~tG~L~v~~rIDRE~LC 68 (84)
T PF08266_consen 2 QIRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEG---NSQYFRVNEKTGDLFVSERIDREELC 68 (84)
T ss_dssp EEEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SS---SS-SEEE-TTTSEEEESS--SCCCC-
T ss_pred CeEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecC---CcceeEecCCceeEEeCCccCHHHHC
Confidence 6789999999999999999 55643 224578887642 35699999999999999999999976
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.36 E-value=0.0016 Score=51.23 Aligned_cols=64 Identities=11% Similarity=0.164 Sum_probs=31.9
Q ss_pred HhhhcCceEEEEEeCCCCCCeEEEEEEccCCCCC-CCCEEEEecCCCCCcccEEEeCCccEEEeCccCCccCC
Q psy8629 12 AVAADEHLRDLEISEGVPVGTRVGFIGDGFVGDS-GPPYLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRETR 83 (483)
Q Consensus 12 ~~~~~~~~~~~~v~En~~~gt~V~~v~a~d~d~~-~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E~~ 83 (483)
-++|.++.|.+.|+.+...|..||+|+-.|..++ .+.|...+ ..|+|.+ .|.|++++++.....
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssD-------pdF~V~~-DGsVy~~r~v~l~~~ 67 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSD-------PDFRVLE-DGSVYAKRPVQLSSE 67 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS---EEEE----------SEEEEET-TTEEEEES--S-SSS
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCCCCceEEecCC-------CCEEEcC-CCeEEEeeeEecCCC
Confidence 4789999999999999999999999999996543 45666553 2799995 599999998876443
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.58 E-value=0.054 Score=43.44 Aligned_cols=72 Identities=31% Similarity=0.412 Sum_probs=54.6
Q ss_pred EEeCCCCCCceEEEEEeeCCC-CCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEEecCC
Q psy8629 364 AFDQDEGINSVITYSLRENTD-DHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIHDVND 442 (483)
Q Consensus 364 a~D~D~~~~~~v~y~l~~~~~-~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd 442 (483)
..|+| +..++|++....+ ....|...|+.++.+.-. +.. +..+.|.++|+|+|+.. .++...++|.|.+.|+
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~-~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTN-SDVGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCC-CCCcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 46776 4688999874322 456799999998887763 333 34577999999999864 5788899999999887
No 16
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.51 E-value=0.0018 Score=50.35 Aligned_cols=60 Identities=23% Similarity=0.337 Sum_probs=37.7
Q ss_pred eeeccCCCCeEEEEEEEEeCCCCCC--ceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCC
Q psy8629 348 SFDEIAEPGTSVFQISAFDQDEGIN--SVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETD 410 (483)
Q Consensus 348 ~v~E~~~~gt~v~~v~a~D~D~~~~--~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~ 410 (483)
+|+|..++|+.||.| |.|--.... ....|++.. .....+|.+++.+|.|.+...+|||.-
T Consensus 6 sV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s--~~~~~~~~v~~~tG~L~v~~rIDRE~L 67 (84)
T PF08266_consen 6 SVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVS--EGNSQYFRVNEKTGDLFVSERIDREEL 67 (84)
T ss_dssp EEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE---SSSS-SEEE-TTTSEEEESS--SCCCC
T ss_pred EeecCCCCCCEEEEh-HHhhCCCcccccccceEEee--cCCcceeEecCCceeEEeCCccCHHHH
Confidence 678999999999999 444432111 123577663 346789999999999999999999973
No 17
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.31 E-value=0.038 Score=44.12 Aligned_cols=92 Identities=18% Similarity=0.234 Sum_probs=58.1
Q ss_pred EEEEEEeCCCCCCceEEEEEEe-CCCCCceEEecccCCCceEEEEEE----eccCCCCCCCCEEEEEEEEEECCCCCeee
Q psy8629 171 TLNAAKDRDLGIYNTQRYNIVS-GNTNNAFRLSSHREKDGVLYLDLQ----INGVLDRETIPEYSLVIEALDGGSPPLRG 245 (483)
Q Consensus 171 ~~v~a~D~D~~~~~~~~y~l~~-~~~~~~F~i~~~~~~~g~~~~~l~----~~~~LD~E~~~~~~l~v~a~D~g~p~~~~ 245 (483)
+++.++|+|.+.. ..+++.. ...-+.|.|+. +|...+.+- ..+.|.--+.-.-.|++++.|+.
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~~g~yGtlti~~----~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGt------ 69 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDAAGQYGTFSIDA----DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADGT------ 69 (99)
T ss_pred CceEEeCCCCCCc--eEEecccccCCcEEEEECC----CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCC------
Confidence 4688999998753 4555532 12345688875 465554332 12334433444677888889962
Q ss_pred EEEEEEEEEecCCCCCeecCCceEEEecCCC
Q psy8629 246 SMIVNVTIQDVNDNQPIFNQSRYFATVPENA 276 (483)
Q Consensus 246 ~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~ 276 (483)
+++|+|+|...|| +|+.... -...+.|+.
T Consensus 70 t~~vtItI~GtND-apvi~~~-~~g~v~ED~ 98 (99)
T TIGR01965 70 SQTVTITITGAND-AAVIGGA-DTGSVTEDS 98 (99)
T ss_pred eEEEEEEEEccCC-CCEEecc-cceeEecCC
Confidence 7889999999999 7766433 236777763
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=96.08 E-value=0.079 Score=42.32 Aligned_cols=89 Identities=22% Similarity=0.281 Sum_probs=57.5
Q ss_pred EEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcEEE-Ec-------ccCCCCCCCeeEEEEEEEECCcCCceEEE
Q psy8629 360 FQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGLIT-MK-------SNVDCETDPVPKLTVIATDNGVPALSSSA 431 (483)
Q Consensus 360 ~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~i~-~~-------~~lD~E~~~~y~l~V~a~D~g~p~~s~~~ 431 (483)
++|.++|+|.+ ....+++... ...-+.|.|++ .|.-. .. +.|.-.+...-.++|.++|+ .+.
T Consensus 2 G~Lt~sD~D~g--d~~~~s~~~~-~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt~ 71 (99)
T TIGR01965 2 GQLTISDADAG--QAHFIAQTDA-AGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TSQ 71 (99)
T ss_pred CceEEeCCCCC--CceEEecccc-cCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------CeE
Confidence 56899999965 4456666422 23456799987 45322 11 22333334467888999995 278
Q ss_pred EEEEEEEecCCCCCeecCCceEEEEcCCC
Q psy8629 432 TVLVTIHDVNDNEPIFDQSFYNVTIPENK 460 (483)
Q Consensus 432 ~v~I~V~dvNd~~P~f~~~~~~~~V~E~~ 460 (483)
+|+|+|...|| +|+..... ...|.|+.
T Consensus 72 ~vtItI~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 72 TVTITITGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred EEEEEEEccCC-CCEEeccc-ceeEecCC
Confidence 89999999999 66655432 56777764
No 19
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=95.95 E-value=0.1 Score=41.09 Aligned_cols=79 Identities=16% Similarity=0.195 Sum_probs=40.8
Q ss_pred CCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCCCCCCCC
Q psy8629 148 APTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVLDRETIP 227 (483)
Q Consensus 148 ~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~ 227 (483)
.|-|....|.+.|+.+...|..+++|.-.|-... ..+.|... + ..|.|. .+|. |.+.+++.....
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~--~~~~~~ss--D--pdF~V~----~DGs----Vy~~r~v~l~~~- 67 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGR--RRVIFESS--D--PDFRVL----EDGS----VYAKRPVQLSSE- 67 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEE----TTTE----EEEES--S-SSS-
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCCC--CceEEecC--C--CCEEEc----CCCe----EEEeeeEecCCC-
Confidence 5889999999999999999999999999888533 35667653 2 289999 5687 566777766443
Q ss_pred EEEEEEEEEECCCC
Q psy8629 228 EYSLVIEALDGGSP 241 (483)
Q Consensus 228 ~~~l~v~a~D~g~p 241 (483)
.-.|+|.|.|..+.
T Consensus 68 ~~~F~V~a~D~~~~ 81 (90)
T PF08758_consen 68 QRSFTVHAWDSQTQ 81 (90)
T ss_dssp -EEEEEEEEETTTT
T ss_pred ceEEEEEEECCCCC
Confidence 35799999997763
No 20
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.27 E-value=0.29 Score=39.12 Aligned_cols=70 Identities=23% Similarity=0.328 Sum_probs=48.4
Q ss_pred EEeCCCCCCceEEEEEEeCC---CCCceEEecccCCCceEEEEEEeccCCCCCCCCEEEEEEEEEECCCCCeeeEEEEEE
Q psy8629 175 AKDRDLGIYNTQRYNIVSGN---TNNAFRLSSHREKDGVLYLDLQINGVLDRETIPEYSLVIEALDGGSPPLRGSMIVNV 251 (483)
Q Consensus 175 a~D~D~~~~~~~~y~l~~~~---~~~~F~i~~~~~~~g~~~~~l~~~~~LD~E~~~~~~l~v~a~D~g~p~~~~~~~v~I 251 (483)
..|+| + ..++|++...+ -..+.++++. ++.++ ..+...+ ...|.+.|.|+|+.+ .++...++|
T Consensus 24 F~d~d-~--~~lty~~~~~~~~~lP~Wl~fd~~---~~~~~-----GtP~~~~-~g~~~i~v~a~D~~g--~~~~~~f~i 89 (97)
T smart00736 24 FTDAD-G--DTLTYSATLSDGSALPSWLSFDSD---TGTLS-----GTPTNSD-VGSLSLKVTATDSSG--ASASDTFTI 89 (97)
T ss_pred eECCC-C--CeEEEEEEeCCCCCCCCeEEEeCC---CCEEE-----EECCCCC-CcEEEEEEEEEECCC--CEEEEEEEE
Confidence 46777 2 37889886432 2457788874 34432 2344333 467999999999876 678889999
Q ss_pred EEEecCC
Q psy8629 252 TIQDVND 258 (483)
Q Consensus 252 ~V~dvNd 258 (483)
.|.+.|+
T Consensus 90 ~V~~~~~ 96 (97)
T smart00736 90 TVVNTND 96 (97)
T ss_pred EEeCCCC
Confidence 9999886
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=94.29 E-value=3.5 Score=36.25 Aligned_cols=82 Identities=16% Similarity=0.213 Sum_probs=47.5
Q ss_pred EEEEEEeCCCCCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEeccCC-CCCCCCEEEEEEEEEECCCCCeeeEEEE
Q psy8629 171 TLNAAKDRDLGIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQINGVL-DRETIPEYSLVIEALDGGSPPLRGSMIV 249 (483)
Q Consensus 171 ~~v~a~D~D~~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~~~~L-D~E~~~~~~l~v~a~D~g~p~~~~~~~v 249 (483)
..+.++|.-.+. ...+.+|.+|.....-.+.......|... +...+.+ ..|....|+|+|.|.|..+ ..++..+
T Consensus 69 i~i~~tD~~~~~-~i~sv~l~Gg~~~d~v~ls~~~~~~~~~~--~~yp~~fpsle~~~~YtLtV~a~D~aG--N~~~~si 143 (158)
T PF13750_consen 69 ISINVTDNSDDS-KITSVSLTGGPASDSVSLSWTNKGNGVYT--LEYPRIFPSLEADDSYTLTVSATDKAG--NQSTKSI 143 (158)
T ss_pred eEEEEEeCCCCc-eEEEEEEECCcccceEEEeeEeccCceEE--eecccccCCcCCCCeEEEEEEEEecCC--CEEEEEE
Confidence 456777766543 34567776654343333332222334422 3222222 3377889999999999766 5777777
Q ss_pred EEEEEecC
Q psy8629 250 NVTIQDVN 257 (483)
Q Consensus 250 ~I~V~dvN 257 (483)
.....+.|
T Consensus 144 ~F~y~P~~ 151 (158)
T PF13750_consen 144 SFSYMPPN 151 (158)
T ss_pred EEEEeCCe
Confidence 77766443
No 22
>KOG3597|consensus
Probab=93.79 E-value=6.1 Score=40.54 Aligned_cols=154 Identities=18% Similarity=0.181 Sum_probs=88.2
Q ss_pred CeEEEEEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCC--------------CCCc
Q psy8629 133 ENVRVVVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGN--------------TNNA 198 (483)
Q Consensus 133 ~~~~v~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~--------------~~~~ 198 (483)
.+....|.|.-+||.+..+....+.+.+.|+...-.....+.+.|+|.+. ..+.|++.+.. +.+.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~ 102 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATE 102 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccc
Confidence 36678899999999888777777778999997665566678899999765 35678876521 1122
Q ss_pred eEEecccCCCceEEEEEEeccCCCCC--CCCEEEEEEEEEECCCCCeeeEEEEEEEEEecCCCCCeecCCc-eEEEecCC
Q psy8629 199 FRLSSHREKDGVLYLDLQINGVLDRE--TIPEYSLVIEALDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSR-YFATVPEN 275 (483)
Q Consensus 199 F~i~~~~~~~g~~~~~l~~~~~LD~E--~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~-~~~~v~E~ 275 (483)
|+-... ..|.+. +++. +.....+...++|+-. .+. . .| .. .-..+|.+.... ....|.-+
T Consensus 103 Fs~~~v--~~g~~~--------yvh~g~el~~~~~~~~~SDg~~---~S~-~-~i-~~-~~~~~~~~~~~~~~gL~v~~g 165 (442)
T KOG3597|consen 103 FSYEEV--EDGSLS--------YVHSGTELRESELQLRVSDGLL---VSE-R-AI-LK-VEATGPAPHLARNTGLKVLQG 165 (442)
T ss_pred eEehHh--hcCcee--------EEecCcccccceEEEEeecceE---eee-e-EE-ec-ccCCCcceeeecccceEEccC
Confidence 332221 122221 1111 2457788899999763 222 1 11 11 112234332221 11222222
Q ss_pred CCCCcEEEEEEEEeCCCCCccEEEEEEEe
Q psy8629 276 ATVGTSIFQVFATDADAGSNGEVEYSINR 304 (483)
Q Consensus 276 ~~~gt~v~~v~a~D~D~~~~~~i~y~i~~ 304 (483)
+..--.-.-+.+.|.|.++...+.|.|..
T Consensus 166 S~~~IT~~~L~ved~d~~~d~~v~~~i~~ 194 (442)
T KOG3597|consen 166 STAPITPSNLSVEDNDSSPDDEVRYDITP 194 (442)
T ss_pred ccccccHhHceeecCCCCCCcEEEEEecC
Confidence 21111113488889987667789999964
No 23
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=91.94 E-value=15 Score=41.29 Aligned_cols=154 Identities=19% Similarity=0.138 Sum_probs=80.8
Q ss_pred CCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCCCCC--ceEE-----ecccCCCceEEEEEE
Q psy8629 144 ENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGNTNN--AFRL-----SSHREKDGVLYLDLQ 216 (483)
Q Consensus 144 vND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~~~~--~F~i-----~~~~~~~g~~~~~l~ 216 (483)
.||.++.|....-..+|.|+. |+.-..|.-...|.+..-.+.|+..+|.... -|.- .-..|++- -++.+.
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~~~sGTLtF~PGEt~-KtItV~ 471 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYEFTEGTLVFKPGETQ-KEFRIG 471 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCccccCceEEECCCceE-EEEEEE
Confidence 566666666666678999986 4555555444435544456889887764321 1221 11111111 122243
Q ss_pred eccCCCCCCCCEEEEEEEEEECC-------------CCCeeeEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEE
Q psy8629 217 INGVLDRETIPEYSLVIEALDGG-------------SPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIF 283 (483)
Q Consensus 217 ~~~~LD~E~~~~~~l~v~a~D~g-------------~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~ 283 (483)
+...=-+|....|.+.+.--..+ ...+......+|+|.| ||++|.|.-..-...|.|+. |+.-.
T Consensus 472 IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vtv 548 (928)
T TIGR00845 472 IIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIMEV 548 (928)
T ss_pred EccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEEEE
Confidence 33433466667777766543222 0112233456677777 78899877655567889974 55544
Q ss_pred EEEEEeCCCCCccEEEEEEEe
Q psy8629 284 QVFATDADAGSNGEVEYSINR 304 (483)
Q Consensus 284 ~v~a~D~D~~~~~~i~y~i~~ 304 (483)
+|.-+-.-.| .-.+.|...+
T Consensus 549 tV~RtsGa~G-~VtV~Y~T~d 568 (928)
T TIGR00845 549 KVLRTSGARG-TVIVPYRTVE 568 (928)
T ss_pred EEEEcCCCCe-eEEEEEEeec
Confidence 4432211111 2345676543
No 24
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=89.08 E-value=2.1 Score=29.42 Aligned_cols=39 Identities=26% Similarity=0.284 Sum_probs=30.5
Q ss_pred CCCCCCcEEEeCCCcEEEEcccCCCC-CCCeeEEEEEEEECC
Q psy8629 383 TDDHPSWFQIDSKSGLITMKSNVDCE-TDPVPKLTVIATDNG 423 (483)
Q Consensus 383 ~~~~~~~F~id~~tG~i~~~~~lD~E-~~~~y~l~V~a~D~g 423 (483)
.+....+..+|+.+|.|.-. .+.. ....|.++|.|+|..
T Consensus 9 ~~~LP~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 9 GGGLPSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCCCCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCC
Confidence 34677889999999999766 4444 346999999999974
No 25
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=83.46 E-value=20 Score=40.34 Aligned_cols=47 Identities=23% Similarity=0.165 Sum_probs=28.8
Q ss_pred cCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEe
Q psy8629 256 VNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINR 304 (483)
Q Consensus 256 vNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~ 304 (483)
.||.++.|.-..-...|.|++ |++-.+|.-...|.+....+.|...+
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~D 441 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTED 441 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccC
Confidence 455555554444456789984 77777776555455445567787643
No 26
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=81.87 E-value=20 Score=30.59 Aligned_cols=69 Identities=29% Similarity=0.353 Sum_probs=42.7
Q ss_pred CceEEecccCCCceEEEEEEeccCCCCCC---CCEEEEEEEEEECCCCCeeeEEEEEEEEEecCCCCCeecCCceEEEec
Q psy8629 197 NAFRLSSHREKDGVLYLDLQINGVLDRET---IPEYSLVIEALDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVP 273 (483)
Q Consensus 197 ~~F~i~~~~~~~g~~~~~l~~~~~LD~E~---~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~ 273 (483)
..|.|.-.. +|... +.+.++||... .-...|.|.|+|..+- .+...+.|+|.| | .|...... ..+|.
T Consensus 56 ~Vftvtl~~--~GsYt--ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~ 125 (137)
T TIGR03660 56 PVFTLTLNA--DGSYE--FTLEGPLDHAAGSDELTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVD 125 (137)
T ss_pred EEEEEEECC--CccEE--EEEcccccCCCCCceEEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEe
Confidence 345554332 34332 66678887743 3367888999997663 334588888887 5 57664443 36777
Q ss_pred CC
Q psy8629 274 EN 275 (483)
Q Consensus 274 E~ 275 (483)
|+
T Consensus 126 E~ 127 (137)
T TIGR03660 126 ED 127 (137)
T ss_pred cc
Confidence 74
No 27
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=79.34 E-value=10 Score=27.40 Aligned_cols=61 Identities=26% Similarity=0.251 Sum_probs=34.6
Q ss_pred CCceEEEEEeeCCCCCCCcEEEeCCCcEEEEcccCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEEE
Q psy8629 371 INSVITYSLRENTDDHPSWFQIDSKSGLITMKSNVDCETDPVPKLTVIATDNGVPALSSSATVLVTIH 438 (483)
Q Consensus 371 ~~~~v~y~l~~~~~~~~~~F~id~~tG~i~~~~~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~ 438 (483)
.+-...|+|. |-...|..+...+-.+... .| ..+.|.|.|+|.|...........+.|+|.
T Consensus 6 ~~~~Y~Y~l~---g~d~~W~~~~~~~~~~~~~-~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 6 ENIRYRYRLE---GFDDEWITLGSYSNSISYT-NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp TTEEEEEEEE---TTESSEEEESSTS-EEEEE-S-----SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CceEEEEEEE---CCCCeEEECCCCcEEEEEE-eC---CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 3445677787 4456677665543233222 22 478999999999976543333366666653
No 28
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=75.30 E-value=90 Score=30.91 Aligned_cols=75 Identities=21% Similarity=0.322 Sum_probs=44.1
Q ss_pred EEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCCCCcceEEeCCccEEEEcccCC
Q psy8629 250 NVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSDKDGLFNIDSKSGLIFVNKALD 329 (483)
Q Consensus 250 ~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~~~~~F~Id~~tG~l~~~~~LD 329 (483)
+|.|...|.+.=.|..-....+++|+- .|..=+.--...+ +++.-|.-.+ .-+.-..|.|++.+|.|.+.+.-.
T Consensus 214 tV~v~~y~~~~g~~~~lQ~i~tlP~dF-~g~~~~aaIhis~----dGrFLYasNR-g~dsI~~f~V~~~~g~L~~~~~~~ 287 (346)
T COG2706 214 TVDVLEYNPAVGKFEELQTIDTLPEDF-TGTNWAAAIHISP----DGRFLYASNR-GHDSIAVFSVDPDGGKLELVGITP 287 (346)
T ss_pred EEEEEEEcCCCceEEEeeeeccCcccc-CCCCceeEEEECC----CCCEEEEecC-CCCeEEEEEEcCCCCEEEEEEEec
Confidence 566777766555666666666777763 3332222222223 4567776543 233456899999999987765543
Q ss_pred C
Q psy8629 330 F 330 (483)
Q Consensus 330 ~ 330 (483)
-
T Consensus 288 t 288 (346)
T COG2706 288 T 288 (346)
T ss_pred c
Confidence 3
No 29
>KOG3597|consensus
Probab=70.48 E-value=38 Score=34.87 Aligned_cols=63 Identities=16% Similarity=0.162 Sum_probs=46.8
Q ss_pred eEEEEEEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEecCCC
Q psy8629 245 GSMIVNVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINRRQSD 308 (483)
Q Consensus 245 ~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~~~~~ 308 (483)
.+....|.|..+||.+..+....+..-+.|+...-.....+.+.|+|..+ ..+.|++....+.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~ 86 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSV 86 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCC
Confidence 45667899999999777776666668888887655556678899999764 4588888764333
No 30
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=69.73 E-value=23 Score=30.29 Aligned_cols=44 Identities=34% Similarity=0.354 Sum_probs=29.6
Q ss_pred CeeEEEEEEEECCcCCceEEEEEEEEEEecCCCCCeecCCceEEEEcCCC
Q psy8629 411 PVPKLTVIATDNGVPALSSSATVLVTIHDVNDNEPIFDQSFYNVTIPENK 460 (483)
Q Consensus 411 ~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~V~E~~ 460 (483)
-...|.|.|+|...- .++.++.|+|.| | .|...... ...|.|+.
T Consensus 85 l~l~~~v~a~D~DGD--~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 85 LTLNFPIIATDFDGD--TSSITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEeeeEEEEeCCCC--ccccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 356788999997542 334588888887 6 47765443 36787843
No 31
>PF06586 TraK: TraK protein; InterPro: IPR010563 This family consists of several TraK proteins from Escherichia coli, Salmonella typhi and Salmonella typhimurium. TraK is known to be essential for pilus assembly but its exact role in this process is unknown [].
Probab=52.29 E-value=2e+02 Score=26.78 Aligned_cols=95 Identities=14% Similarity=0.126 Sum_probs=48.8
Q ss_pred ceEEEEEeCCCCCCeEEEEEEccCCC----CCCCCEEEEecCCCCCcccEEE--eCCccEEEeCccCCccCCceeeEEEE
Q psy8629 18 HLRDLEISEGVPVGTRVGFIGDGFVG----DSGPPYLIVPVPGSAVDTDLSI--EQNTGEIRTKVPLDRETRSSYSLVAI 91 (483)
Q Consensus 18 ~~~~~~v~En~~~gt~V~~v~a~d~d----~~~~~y~i~~~~~~~~~~~F~I--d~~tG~i~~~~~LD~E~~~~y~l~v~ 91 (483)
.......-...+-++....++..|.. .....-.+....+......|.. +..+|.+.+. .. + ....+++-+.
T Consensus 19 ~~~a~~~v~~~~~~~~~i~lS~~~~NRIv~~~~~i~~V~~~~~~~~~~~~~~~~~~~~G~v~~~-~~-~-~~~p~tl~vt 95 (234)
T PF06586_consen 19 AAFAAQTVTVSPGDTVPIPLSNRDPNRIVVPGDRITSVSFPEGGCITVDIQGENDDVTGSVYVS-PT-Y-SSKPFTLFVT 95 (234)
T ss_pred hccCccEEEcCCCCeEEEEEcCCCCeEEEccCCceeEEecCCCcccccccccccccccCcEEEE-ec-C-CCCCEEEEEE
Confidence 33444444556677778888888743 1111223332211111112332 3467888877 22 2 2223566665
Q ss_pred ecCCcceeEEEeecCcEEEEcccCCccccceEEEEEEeCCCCeEEEEEE
Q psy8629 92 PLSGENVRVVSFYEPGEIRTKVPLDRETRSSYSLVAIPLSGENVRVVVR 140 (483)
Q Consensus 92 a~d~~~~~~~~~~~~g~l~~~~~ld~e~~~~~~~~~~~~~~~~~~v~I~ 140 (483)
..+|. .|+|.+.+.+....++.+.
T Consensus 96 t~~g~-------------------------~~sl~l~P~~~p~~ti~l~ 119 (234)
T PF06586_consen 96 TEKGR-------------------------TYSLTLVPKAIPAQTIRLV 119 (234)
T ss_pred eCCCC-------------------------EEEEEEEeccCCCeEEEEe
Confidence 54443 6888888877665444444
No 32
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=48.57 E-value=85 Score=21.41 Aligned_cols=45 Identities=20% Similarity=0.426 Sum_probs=27.0
Q ss_pred EEEEecCCCCCcccEEEeCCccEEEeCccCCcc-CCceeeEEEEecCC
Q psy8629 49 YLIVPVPGSAVDTDLSIEQNTGEIRTKVPLDRE-TRSSYSLVAIPLSG 95 (483)
Q Consensus 49 y~i~~~~~~~~~~~F~Id~~tG~i~~~~~LD~E-~~~~y~l~v~a~d~ 95 (483)
|++....+.+.-....+|+.+|.|+=.- +.+ +...|.+.|.+.|+
T Consensus 2 ys~~~~~~~~LP~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~ 47 (49)
T PF05345_consen 2 YSLTTPTGGGLPSGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDG 47 (49)
T ss_pred EEEEcCCCCCCCCcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcC
Confidence 5553222333445799999999998662 222 22467777766654
No 33
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=48.12 E-value=1.3e+02 Score=23.51 Aligned_cols=51 Identities=29% Similarity=0.270 Sum_probs=30.1
Q ss_pred EEEEEecCCCCCeecCCceEEEecCCCCCCcEEEEEEEEeCCCCCccEEEEEEEe
Q psy8629 250 NVTIQDVNDNQPIFNQSRYFATVPENATVGTSIFQVFATDADAGSNGEVEYSINR 304 (483)
Q Consensus 250 ~I~V~dvNd~~P~f~~~~~~~~v~E~~~~gt~v~~v~a~D~D~~~~~~i~y~i~~ 304 (483)
+|+|.| ||.+ .+.-..-..++.|+. |..-..|.-..++....-.+.|....
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~ 52 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVD 52 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEE
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeC
Confidence 577788 6645 776666667888885 44555555554443334556776654
No 34
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=46.54 E-value=39 Score=24.18 Aligned_cols=30 Identities=20% Similarity=0.199 Sum_probs=18.8
Q ss_pred CCCEEEEEEEEEECCCCCeeeEEEEEEEEE
Q psy8629 225 TIPEYSLVIEALDGGSPPLRGSMIVNVTIQ 254 (483)
Q Consensus 225 ~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~ 254 (483)
....|+|.|+|.|..+........+.|.|+
T Consensus 37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 37 PPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp -SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 356899999999976543333366666653
No 35
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=43.83 E-value=8.5e+02 Score=31.63 Aligned_cols=189 Identities=14% Similarity=0.114 Sum_probs=88.9
Q ss_pred CCCCCCEEEEEEEEEECCCCCeeeEEEEEEEEEecCCCCCeecCCceEEEecC--------------CCCCCcEEEEEEE
Q psy8629 222 DRETIPEYSLVIEALDGGSPPLRGSMIVNVTIQDVNDNQPIFNQSRYFATVPE--------------NATVGTSIFQVFA 287 (483)
Q Consensus 222 D~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E--------------~~~~gt~v~~v~a 287 (483)
.|.+...|++++.|++..+ +...++.|.|.+.... ..+...+ ..+.+ ....+.+
T Consensus 1563 TY~spGtYtVtLTvtN~~G---s~~~T~~i~V~~~I~G--------lqI~~~~~~gg~~~~~~~~~~~~~t~-~~v~l~~ 1630 (2740)
T TIGR00864 1563 TFRSVGTFNIIVTAENDVG---AAQASIFLFVLQEIEG--------LQILGETAEGGGGGVQELDGCYFETN-HTVQFHA 1630 (2740)
T ss_pred eecCCceEEEEEEEecCCC---ccceeEEEEEeecccc--------eEEEeecccccCCcccccccceeccC-CEEEeee
Confidence 3667789999999999776 4456667777654211 0111100 00111 1244555
Q ss_pred EeCCCCCccEEEEEEEecCC-C-CCcceEEeCCccEEEEcccCCCcccceeeeEeEEEEEee--------eeeccCCCCe
Q psy8629 288 TDADAGSNGEVEYSINRRQS-D-KDGLFNIDSKSGLIFVNKALDFETKEVHDLVPRFAVEIS--------SFDEIAEPGT 357 (483)
Q Consensus 288 ~D~D~~~~~~i~y~i~~~~~-~-~~~~F~Id~~tG~l~~~~~LD~E~~~~~~l~v~~~~~v~--------~v~E~~~~gt 357 (483)
.-.| | ..++|+-.-... + ....| ..+|+- -.+-|.....|.+++..+..+. .+.| +.+.
T Consensus 1631 ~~~~-G--TnvsysW~~~~~n~~dg~~~---~~tGk~---~~vt~~~pG~Y~VtL~aSN~vG~~t~s~~i~v~e--pi~~ 1699 (2740)
T TIGR00864 1631 GFKD-G--TNLSFSWNAILDNEPDGPAF---AGSGKG---AKLNPLEAGPCDIFLQAANLLGQATADCTIDFLE--PAGN 1699 (2740)
T ss_pred ccCC-C--CeeEEEEEEecCCCCCCccc---cccCce---eEEecCCCceEEEEEEEeecccceeeEEEEEEEE--EcCc
Confidence 4444 3 345555422111 1 11112 134432 1234455677887777544433 1222 2222
Q ss_pred EEEEEEEEeCCCCCCceEEEEEeeCCCCCCCcEEEeCCCcE-EEEccc---CCCCCCCeeEEEEEEEECCcCCceEEEEE
Q psy8629 358 SVFQISAFDQDEGINSVITYSLRENTDDHPSWFQIDSKSGL-ITMKSN---VDCETDPVPKLTVIATDNGVPALSSSATV 433 (483)
Q Consensus 358 ~v~~v~a~D~D~~~~~~v~y~l~~~~~~~~~~F~id~~tG~-i~~~~~---lD~E~~~~y~l~V~a~D~g~p~~s~~~~v 433 (483)
. .+.+.+.-...|..|.|+..-..|. .-.|...-..|. ..+..+ -.|...+.|.++|.|...-. +...++
T Consensus 1700 L--~L~aS~~~aavn~~V~fs~~lt~GS-~vty~W~~gdG~s~~~s~p~~tHtf~~~G~y~VtVta~N~vs---s~~~s~ 1773 (2740)
T TIGR00864 1700 L--MLQASDNPAAVNALINLSAELAEGS-GLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELG---SANASE 1773 (2740)
T ss_pred e--EEeccCCcccCCCeEEEEEEecCCC-CcEEEEEECCCCCcccCCCceEEEccCCceEEEEEEEEcccc---ccceeE
Confidence 2 1333343334566666665432222 222333211221 111222 24667889999999998754 223455
Q ss_pred EEEEEe
Q psy8629 434 LVTIHD 439 (483)
Q Consensus 434 ~I~V~d 439 (483)
.|.|.+
T Consensus 1774 ~V~Vqe 1779 (2740)
T TIGR00864 1774 EVDVQE 1779 (2740)
T ss_pred EEEEEe
Confidence 665554
No 36
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=42.71 E-value=56 Score=24.45 Aligned_cols=30 Identities=20% Similarity=0.042 Sum_probs=21.7
Q ss_pred CCCCCCCeeEEEEEEEECCcCCceEEEEEEEE
Q psy8629 405 VDCETDPVPKLTVIATDNGVPALSSSATVLVT 436 (483)
Q Consensus 405 lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~ 436 (483)
..|...+.|.++++++|... .+.+.++.|.
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~ 80 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG--SSSTKTTTVV 80 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC--CEEEEEEEEE
Confidence 45778899999999999853 3444455554
No 37
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=42.37 E-value=59 Score=24.16 Aligned_cols=31 Identities=26% Similarity=0.164 Sum_probs=23.4
Q ss_pred cCCCCCCCeeEEEEEEEECCcCCceEEEEEEEEE
Q psy8629 404 NVDCETDPVPKLTVIATDNGVPALSSSATVLVTI 437 (483)
Q Consensus 404 ~lD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V 437 (483)
..-|...+.|.+++.++|... +++.+++|.|
T Consensus 48 ~~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 48 THTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 346778899999999999765 5566666665
No 38
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=40.76 E-value=1.1e+02 Score=21.75 Aligned_cols=31 Identities=39% Similarity=0.310 Sum_probs=21.2
Q ss_pred CCCeeEEEEEEEECCcCCceEEEEEEEEEEecC
Q psy8629 409 TDPVPKLTVIATDNGVPALSSSATVLVTIHDVN 441 (483)
Q Consensus 409 ~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~dvN 441 (483)
..+.|.+.++|+|..+ ..++....+.+.|..
T Consensus 21 ~dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 21 ADGEYTLTVTATDKAG--NTSSSTTQIVIVDNT 51 (60)
T ss_pred CCccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence 4678999999999854 244555555665643
No 39
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=39.69 E-value=1.8e+02 Score=22.69 Aligned_cols=53 Identities=21% Similarity=0.202 Sum_probs=29.6
Q ss_pred EEEEecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCCCCCceEEEEEEeCC
Q psy8629 138 VVRVIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDLGIYNTQRYNIVSGN 194 (483)
Q Consensus 138 ~I~V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~~~~~~~~y~l~~~~ 194 (483)
+|+|.| ||.+ .+....-...+.|+.. .....+.-..++....-.+.|...+|.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~~--~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGDG--TVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTSS--EEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCCC--EEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 577888 7765 7766666688899864 333334333343233345666666543
No 40
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=39.10 E-value=1.2e+02 Score=21.55 Aligned_cols=30 Identities=27% Similarity=0.299 Sum_probs=21.0
Q ss_pred CCEEEEEEEEEECCCCCeeeEEEEEEEEEecC
Q psy8629 226 IPEYSLVIEALDGGSPPLRGSMIVNVTIQDVN 257 (483)
Q Consensus 226 ~~~~~l~v~a~D~g~p~~~~~~~v~I~V~dvN 257 (483)
...|.+.++|+|..+ ..+.....+.+.|..
T Consensus 22 dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKAG--NTSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence 567999999999766 455555556555543
No 41
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=34.72 E-value=25 Score=18.54 Aligned_cols=16 Identities=25% Similarity=0.310 Sum_probs=11.7
Q ss_pred ChhhHHHHHHHHhhhc
Q psy8629 1 MRLWRLLWLLAAVAAD 16 (483)
Q Consensus 1 ~~~~~~~~~~~~~~~~ 16 (483)
||+.+.|.+|+..+|.
T Consensus 2 Mk~vIIlvvLLliSf~ 17 (19)
T PF13956_consen 2 MKLVIILVVLLLISFP 17 (19)
T ss_pred ceehHHHHHHHhcccc
Confidence 6778888877766664
No 42
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=34.14 E-value=92 Score=23.22 Aligned_cols=29 Identities=7% Similarity=0.092 Sum_probs=20.4
Q ss_pred CCCCCCEEEEEEEEEECCCCCeeeEEEEEEE
Q psy8629 222 DRETIPEYSLVIEALDGGSPPLRGSMIVNVT 252 (483)
Q Consensus 222 D~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~ 252 (483)
.|.....|.+++.++|..+ .+......|.
T Consensus 52 ~y~~~G~y~v~l~v~d~~g--~~~~~~~~V~ 80 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAVG--SSSTKTTTVV 80 (81)
T ss_pred EcCCCcEEEEEEEEEeCCC--CEEEEEEEEE
Confidence 4677889999999999754 3444444544
No 43
>PF12988 DUF3872: Domain of unknown function, B. Theta Gene description (DUF3872); InterPro: IPR024355 This entry represents proteins of unknown function found primarily in Bacteroides species. The Bacteroides thetaiotaomicron gene coding for this protein is located in a conjugate transposon and appears to be upregulated in the presence of host or other bacterial species compared to growth in pure culture [, ].; PDB: 2L3B_A 2L7Q_A.
Probab=33.93 E-value=2.8e+02 Score=23.56 Aligned_cols=42 Identities=17% Similarity=0.168 Sum_probs=17.2
Q ss_pred EeCCCCCCeEE-EEEEccCCC-CCCCCEEEEecCCCCCcccEEEeCCccEEEeC
Q psy8629 24 ISEGVPVGTRV-GFIGDGFVG-DSGPPYLIVPVPGSAVDTDLSIEQNTGEIRTK 75 (483)
Q Consensus 24 v~En~~~gt~V-~~v~a~d~d-~~~~~y~i~~~~~~~~~~~F~Id~~tG~i~~~ 75 (483)
|+.....|..| .++.-.-.+ -..-.|+|. +|..| ..|+|+..
T Consensus 40 Vpk~I~~GeTvEIR~~l~reG~y~~t~Y~iR---------YFQ~d-GkG~L~~~ 83 (137)
T PF12988_consen 40 VPKKIKKGETVEIRCELKREGNYADTRYTIR---------YFQPD-GKGTLRMD 83 (137)
T ss_dssp --SS--TTEEEEEEEEEEESS--SS---EEE---------EE-SS-S-EEEEET
T ss_pred cccccCCCCEEEEEEEEecCceecccEEEEE---------EEeec-CCEEEEec
Confidence 56777778776 233222222 223467776 66666 34777654
No 44
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=33.84 E-value=99 Score=22.89 Aligned_cols=29 Identities=14% Similarity=0.208 Sum_probs=22.0
Q ss_pred CCCCCCEEEEEEEEEECCCCCeeeEEEEEEEE
Q psy8629 222 DRETIPEYSLVIEALDGGSPPLRGSMIVNVTI 253 (483)
Q Consensus 222 D~E~~~~~~l~v~a~D~g~p~~~~~~~v~I~V 253 (483)
-|.....|.+++.+.|..+ ++++.++|.|
T Consensus 50 ~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 50 TYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 4667789999999999776 5566666655
No 45
>PF10365 DUF2436: Domain of unknown function (DUF2436); InterPro: IPR018832 Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms. This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).
Probab=32.31 E-value=3.2e+02 Score=23.31 Aligned_cols=90 Identities=18% Similarity=0.185 Sum_probs=52.5
Q ss_pred EecCCCCCCccCCCceEEEEeCCCCCCeEEEEEEEEeCCC---CCCceEEEEEEeCCCCCceEEecccCCCceEEEEEEe
Q psy8629 141 VIDENDNAPTFPSPFMSIEFPENTPRDVKRTLNAAKDRDL---GIYNTQRYNIVSGNTNNAFRLSSHREKDGVLYLDLQI 217 (483)
Q Consensus 141 V~DvND~~P~f~~~~~~~~V~E~~~~gt~v~~v~a~D~D~---~~~~~~~y~l~~~~~~~~F~i~~~~~~~g~~~~~l~~ 217 (483)
+.+ |+++|.=.-..|...|++|+-+...- +-...|... -+.+..-|.|....++...-|....| ++.
T Consensus 63 l~~-n~~~pa~ly~~FEYkiP~NADps~tp-q~mv~dG~~~i~IPaG~YDy~I~~P~~~~kiwIaGd~g-~~~------- 132 (161)
T PF10365_consen 63 LWD-NCNVPANLYDPFEYKIPANADPSTTP-QNMVVDGEASIDIPAGTYDYCIAAPQPGGKIWIAGDGG-DGP------- 132 (161)
T ss_pred ccc-CCCCChhhcccceEeccCCCCCccCc-ceEEecCceEEEecCceeEEEEecCCCCCeEEEecCCC-CCC-------
Confidence 344 77788766677889999997653322 211222211 12356678887766655555554321 111
Q ss_pred ccC--CCCCCCCEEEEEEEEEECCC
Q psy8629 218 NGV--LDRETIPEYSLVIEALDGGS 240 (483)
Q Consensus 218 ~~~--LD~E~~~~~~l~v~a~D~g~ 240 (483)
.+. .-+|.-+.|+++++....|.
T Consensus 133 tr~dDy~fEAGKtY~ftm~~~g~gD 157 (161)
T PF10365_consen 133 TRGDDYVFEAGKTYRFTMKRVGSGD 157 (161)
T ss_pred ccccceEEecCCEEEEEEEeccCCC
Confidence 222 23577889999999887653
No 46
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=30.83 E-value=1.7e+02 Score=20.32 Aligned_cols=16 Identities=38% Similarity=0.320 Sum_probs=13.2
Q ss_pred CCCeeEEEEEEEECCc
Q psy8629 409 TDPVPKLTVIATDNGV 424 (483)
Q Consensus 409 ~~~~y~l~V~a~D~g~ 424 (483)
..+.|.++|+|+|...
T Consensus 22 ~dG~y~itv~a~D~AG 37 (54)
T PF13754_consen 22 ADGTYTITVTATDAAG 37 (54)
T ss_pred CCccEEEEEEEEeCCC
Confidence 4788999999999743
No 47
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=24.29 E-value=1.5e+02 Score=22.30 Aligned_cols=26 Identities=27% Similarity=0.212 Sum_probs=20.3
Q ss_pred CCeeEEEEEEEECCcCCceEEEEEEEEE
Q psy8629 410 DPVPKLTVIATDNGVPALSSSATVLVTI 437 (483)
Q Consensus 410 ~~~y~l~V~a~D~g~p~~s~~~~v~I~V 437 (483)
.+.|.++..|+|..+ ..+++.+.|+|
T Consensus 56 ~G~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 56 VGTTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred eceEEEEEEEEECCC--CEEEEEEEEEC
Confidence 346899999999744 57788888875
No 48
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=22.00 E-value=2e+02 Score=24.08 Aligned_cols=31 Identities=35% Similarity=0.543 Sum_probs=16.5
Q ss_pred EEEEEEEEC-CcCCceEEEEEEEEEEecCCCCC
Q psy8629 414 KLTVIATDN-GVPALSSSATVLVTIHDVNDNEP 445 (483)
Q Consensus 414 ~l~V~a~D~-g~p~~s~~~~v~I~V~dvNd~~P 445 (483)
+|.|.|+=. +.-+++..+.+.|+|...|+ ||
T Consensus 101 nl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 101 NLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp EEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred cEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 666666654 33468999999999999988 55
No 49
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=21.60 E-value=3.5e+02 Score=20.19 Aligned_cols=24 Identities=17% Similarity=0.188 Sum_probs=20.0
Q ss_pred EEEEEEEEEECCCCCeeeEEEEEEEE
Q psy8629 228 EYSLVIEALDGGSPPLRGSMIVNVTI 253 (483)
Q Consensus 228 ~~~l~v~a~D~g~p~~~~~~~v~I~V 253 (483)
.|.++..|+|..+ .++++.+.|+|
T Consensus 58 ~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 58 TTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred eEEEEEEEEECCC--CEEEEEEEEEC
Confidence 6889999999766 68888888875
Done!