Query psy1887
Match_columns 653
No_of_seqs 267 out of 2772
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 18:47:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1887.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1887hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 8.5E-80 1.8E-84 673.2 53.5 542 2-591 246-945 (2531)
2 KOG1219|consensus 100.0 5.6E-70 1.2E-74 612.5 64.3 566 7-619 399-1277(4289)
3 KOG4289|consensus 100.0 9.1E-71 2E-75 602.5 55.5 544 21-614 160-889 (2531)
4 KOG1219|consensus 100.0 8.6E-69 1.9E-73 603.0 63.2 564 6-615 123-1164(4289)
5 cd00031 CA Cadherin repeat dom 100.0 2.1E-28 4.6E-33 238.8 28.7 197 311-526 1-199 (199)
6 cd00031 CA Cadherin repeat dom 99.9 7.9E-24 1.7E-28 206.5 24.4 161 431-594 1-161 (199)
7 PF00028 Cadherin: Cadherin do 99.7 3.7E-16 8E-21 132.6 14.3 92 432-525 1-93 (93)
8 smart00112 CA Cadherin repeats 99.6 9.4E-15 2E-19 119.9 11.0 79 452-532 1-79 (79)
9 KOG1834|consensus 99.6 7.9E-14 1.7E-18 146.8 20.0 216 294-527 20-245 (952)
10 PF00028 Cadherin: Cadherin do 99.5 1.2E-13 2.7E-18 117.0 12.3 90 312-418 1-93 (93)
11 smart00112 CA Cadherin repeats 99.4 7.7E-13 1.7E-17 108.5 10.3 77 337-425 1-79 (79)
12 KOG1834|consensus 99.1 2E-09 4.3E-14 114.2 16.6 154 225-420 86-245 (952)
13 PF08266 Cadherin_2: Cadherin- 97.0 0.00076 1.7E-08 55.2 4.3 63 432-497 3-67 (84)
14 PF08758 Cadherin_pro: Cadheri 96.5 0.027 5.8E-07 46.8 9.7 79 424-512 3-81 (90)
15 PF08758 Cadherin_pro: Cadheri 96.2 0.025 5.5E-07 47.0 7.8 80 102-196 2-82 (90)
16 TIGR01965 VCBS_repeat VCBS rep 95.8 0.12 2.6E-06 43.6 10.1 89 447-546 2-97 (99)
17 smart00736 CADG Dystroglycan-t 95.8 0.15 3.2E-06 43.2 10.8 70 451-529 24-96 (97)
18 TIGR01965 VCBS_repeat VCBS rep 95.7 0.073 1.6E-06 44.8 8.3 90 334-440 4-98 (99)
19 PF08266 Cadherin_2: Cadherin- 95.3 0.025 5.4E-07 46.4 4.4 61 312-386 3-67 (84)
20 PF13750 Big_3_3: Bacterial Ig 94.3 4.4 9.6E-05 37.6 17.3 130 387-527 15-150 (158)
21 PF13750 Big_3_3: Bacterial Ig 93.3 5.8 0.00013 36.8 15.9 131 265-418 14-148 (158)
22 smart00736 CADG Dystroglycan-t 92.8 1.1 2.4E-05 37.8 9.5 70 336-422 24-96 (97)
23 TIGR00845 caca sodium/calcium 91.8 11 0.00024 44.6 18.7 150 420-579 395-571 (928)
24 TIGR00845 caca sodium/calcium 85.7 87 0.0019 37.4 22.5 157 300-469 395-569 (928)
25 PF05345 He_PIG: Putative Ig d 84.0 3.8 8.2E-05 29.8 5.6 36 471-510 12-48 (49)
26 TIGR03660 T1SS_rpt_143 T1SS-14 78.3 40 0.00086 30.4 11.3 57 375-441 70-129 (137)
27 PF07495 Y_Y_Y: Y_Y_Y domain; 69.9 44 0.00095 25.4 8.5 60 459-525 7-66 (66)
28 KOG3597|consensus 68.8 1.1E+02 0.0025 33.3 13.9 153 409-576 24-194 (442)
29 PF03160 Calx-beta: Calx-beta 65.2 81 0.0017 26.3 10.2 53 414-470 2-54 (100)
30 TIGR03660 T1SS_rpt_143 T1SS-14 57.2 1.5E+02 0.0032 26.8 10.5 44 498-547 85-128 (137)
31 PF07495 Y_Y_Y: Y_Y_Y domain; 43.6 1E+02 0.0022 23.3 6.4 47 341-399 4-51 (66)
32 KOG3597|consensus 42.9 4.9E+02 0.011 28.5 15.4 50 289-343 24-73 (442)
33 PF15418 DUF4625: Domain of un 37.0 70 0.0015 28.7 5.0 24 182-205 108-131 (132)
34 smart00089 PKD Repeats in poly 33.8 2.3E+02 0.005 22.1 8.0 30 492-524 49-78 (79)
35 cd00146 PKD polycystic kidney 31.9 2.5E+02 0.0055 22.0 7.3 31 381-417 51-81 (81)
36 PF03160 Calx-beta: Calx-beta 31.7 1.2E+02 0.0027 25.1 5.6 52 12-67 2-53 (100)
37 PF13753 SWM_repeat: Putative 24.8 7.8E+02 0.017 25.4 12.3 132 387-535 12-147 (317)
38 KOG4221|consensus 23.1 1.5E+03 0.033 28.2 30.5 130 387-530 891-1029(1381)
39 PF12245 Big_3_2: Bacterial Ig 23.0 2.9E+02 0.0064 20.8 5.7 30 386-421 22-51 (60)
40 cd02848 Chitinase_N_term Chiti 22.6 1.8E+02 0.004 24.9 4.8 34 491-526 73-106 (106)
41 PF12245 Big_3_2: Bacterial Ig 20.1 3.8E+02 0.0083 20.1 5.8 32 495-528 20-51 (60)
No 1
>KOG4289|consensus
Probab=100.00 E-value=8.5e-80 Score=673.20 Aligned_cols=542 Identities=26% Similarity=0.367 Sum_probs=481.9
Q ss_pred CCcceeeEEEEEEEEeccCCCCcCCCCCceEEecCCCCCCcEEEEEEEecCCCCCCceEEEEEecCCCCCcceEEEcCCC
Q psy1887 2 DSRNIAGFEVVVVVDDVQDTPPIFINIQPVIQLAPNLTMNDVLTKITAIDGDKGHPRTIKYGLVSEGHPMTVFFTINELT 81 (653)
Q Consensus 2 d~~~~~~~~v~I~V~DvNDn~P~F~~~~~~~~V~E~~~~g~~i~~v~A~D~D~g~n~~i~ysl~~~~~~~~~~F~Id~~t 81 (653)
|-|-+++..|.|.|+|+|||+|+|.+.+|.-++.||.++|+.|.+++|+|.|+++|+.|.|+|..+.. .+.|.||+.+
T Consensus 246 ~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~--~~~f~in~rS 323 (2531)
T KOG4289|consen 246 DPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNA--KNVFEINPRS 323 (2531)
T ss_pred CCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCc--cceeEEcCcc
Confidence 55778999999999999999999999999999999999999999999999999999999999988733 3589999999
Q ss_pred CCCCC--------------------------ccEEEEEEEEeeecCCCceeccccCeEEEEeecCCCCceEEEEEEEecC
Q psy1887 82 GPKMS--------------------------PQVFLLVLMCYSVVANYPVFDISTQMRSLLIPASVKLGTIIYRLRASDS 135 (653)
Q Consensus 82 G~~~~--------------------------~~~~~vvi~v~d~Nd~~P~f~~~~~~~~~~v~E~~~~gt~i~~v~a~D~ 135 (653)
|++.+ +....|.|+|.|.|||+|.|....| .+.|.|+..++++|++|+|+|.
T Consensus 324 GvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Y--vvqv~Edvt~~avvlrV~AtDr 401 (2531)
T KOG4289|consen 324 GVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRY--VVQVREDVTPPAVVLRVTATDR 401 (2531)
T ss_pred ceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccce--EEEecccCCCCceEEEEEeccc
Confidence 99622 2345577899999999999999987 9999999999999999999999
Q ss_pred CCC--cceEEEe-cCCCceEEEEeeccCCCCCCeeEEEEEEccccCC-CceEEEEEEEEECCCCC--eeeeEEEEEEecC
Q psy1887 136 DKD--YPLTFDA-TDFGSYVVKIKSLPCSKNSSFCEANVYLDRVLVP-GQVFQFRIIVKDTRGDT--TTVPTSLTATDAA 209 (653)
Q Consensus 136 D~~--~~~~y~i-~~~~~~~f~i~~~~~~~~~~~~~g~i~l~~~le~-~~~~~~~v~v~d~~g~~--~~~~~~i~v~~~~ 209 (653)
|.+ ..++|+| ++++...|.|+.. +|.+.+..+|+. ...|.++|+|+|.+.+. ...-+.|+|.++|
T Consensus 402 D~g~Ng~VHYsi~Sgn~~G~f~id~~---------tGel~vv~plD~e~~~ytl~IrAqDggrPpLsn~sgl~iqVlDIN 472 (2531)
T KOG4289|consen 402 DKGTNGKVHYSIASGNGRGQFYIDSL---------TGELDVVEPLDFENSEYTLRIRAQDGGRPPLSNTSGLVIQVLDIN 472 (2531)
T ss_pred CCCcCceEEEEeeccCccccEEEecc---------cceEEEeccccccCCeeEEEEEcccCCCCCccCCCceEEEEEecC
Confidence 987 5899999 7788889999875 899999999953 33789999888876554 3344568888888
Q ss_pred CCCCCCCCCC--------------------------------------CeEEEEeCCCcEEEEeecCC-CCcceEEEEE-
Q psy1887 210 LNINAQFPHI--------------------------------------PGVIVVPEVLGTYLLWRHLS-GPKNMYLMRE- 249 (653)
Q Consensus 210 ~~~~p~~~~~--------------------------------------~~~~~i~e~~G~i~~~~~lD-e~~~~y~l~~- 249 (653)
|+.|+|-.. .+.|.|++.+|.|++.+.|| |....|.|.+
T Consensus 473 -DhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ 551 (2531)
T KOG4289|consen 473 -DHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVE 551 (2531)
T ss_pred -CCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEE
Confidence 888887322 24577888889999999999 9888888876
Q ss_pred --------------------------------------------------------------------------------
Q psy1887 250 -------------------------------------------------------------------------------- 249 (653)
Q Consensus 250 -------------------------------------------------------------------------------- 249 (653)
T Consensus 552 ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si 631 (2531)
T KOG4289|consen 552 ARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSVITYQITGGNTRNRFSISSI 631 (2531)
T ss_pred EcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEeccccccceEEEecCCcccccceeecc
Confidence
Q ss_pred -----EEeeCCccccceecCCCcEEEEEEEEEecccCCCCCCCeEEEEEEEEEEccCCCCCeecCCceEEEEeCCCCCCc
Q psy1887 250 -----IRLSKPYRELHTVASGQPVILMVLAEEERKDLNEPSPQSSTATIALVFDQAINTPPYFDTVQYITHLDENSPQGT 324 (653)
Q Consensus 250 -----~~~~~~ld~~~~~~~~~~~~l~V~A~d~~~d~~~~~~~s~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt 324 (653)
+.+..|+++.. -++|.+.|+|+| | .+..++.|.|.|.|.|-+.|.|....|+++|+|+.|.|+
T Consensus 632 ~g~Glitlalp~dkKq----e~~~vl~vtAtD-----g---~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~ 699 (2531)
T KOG4289|consen 632 GGGGLITLALPLDKKQ----ERQYVLAVTATD-----G---TLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGT 699 (2531)
T ss_pred CCcceEEeecchhhcc----cceEEEEEEecC-----C---ccccceEEEEEeeecccCCcccccCCeeEeeccCCcCCc
Confidence 33333444332 456789999986 2 377889999999999999999999999999999999999
Q ss_pred EEEeeeceeeEEeeCCCCcceEEEEEEecCCCcEEEcccceeeeeeEEEEEecCCCCCccCCCeEEEEEEEEECCCCCCC
Q psy1887 325 ALIFAESFHTQVTDDNMGKNGIFSLTLENNNGTFEIWPSVVERKAQFTIRVRNNKNLDYERTRTLSFVIVAKEISSDSSS 404 (653)
Q Consensus 325 ~v~~v~~~~~~a~D~D~g~n~~~~~~~~~~~~~F~I~~~tg~~~~~~~i~l~~~~~LD~E~~~~~~l~V~a~D~~~~~~~ 404 (653)
.|.. |+|+|.|.|.|++++|.+.+. .|+|++.+|. +.+...||||.+-.|.+.+.|+|.+ .
T Consensus 700 tvvt-----lsasd~D~geNARI~y~led~--~Frid~dsg~--------i~t~~~ld~edqvtytl~itA~D~~----~ 760 (2531)
T KOG4289|consen 700 TVVT-----LSASDEDTGENARITYILEDE--AFRIDPDSGA--------IYTQAELDYEDQVTYTLAITARDNG----I 760 (2531)
T ss_pred eeEE-----EecccCCCCccceEEEEeccc--ceeecCCCCc--------eEEeeeeecccceeeEeeeeecCCC----C
Confidence 9997 699999999999999965544 4999999998 7778999999999999999999998 7
Q ss_pred CCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCC-CCcceEEeCCCCe
Q psy1887 405 NLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGY-KNSSLELDAHTGD 483 (653)
Q Consensus 405 ~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~-~~~~F~Id~~tG~ 483 (653)
|++..+++|.|.|.|+|||+|+|..+.|.++|.|++|++|.+++|+|+|+|.|.|+.+.|.+-.+. ..+.|.|++.+|.
T Consensus 761 pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IEptSGv 840 (2531)
T KOG4289|consen 761 PQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIEPTSGV 840 (2531)
T ss_pred CCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEccCcce
Confidence 889999999999999999999999999999999999999999999999999999999999865442 3467999999999
Q ss_pred EEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEEEEecCCCCCeEeecCcEEEEecCCCCCeEEEEEEEEeCCC
Q psy1887 484 ITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLTILDVNDNAPIFVQTPFEFVLASDSRNFSERTFIKATDQDA 563 (653)
Q Consensus 484 i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~e~~~~g~~v~~v~A~D~D~ 563 (653)
|++. ..||||....|.|.+.|+|.|.|++++.+.|+|+|+|+|||||+|.++.|...|.||.+.|+.|++|.|.|||+
T Consensus 841 iRtl--~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdE 918 (2531)
T KOG4289|consen 841 IRTL--RRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDE 918 (2531)
T ss_pred eehh--hhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCc
Confidence 9998 88999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCeEEEEEEeCCCCCcEEEEcCCCc
Q psy1887 564 EAPNNIVRYEIISGNYDNKFSLHPETGV 591 (653)
Q Consensus 564 ~~~n~~i~Y~i~~~~~~~~F~Id~~tG~ 591 (653)
| +|+.|.|+|..|+....|.++...|+
T Consensus 919 G-~NA~IsYqIvgg~d~~~fq~de~~~~ 945 (2531)
T KOG4289|consen 919 G-PNAHISYQIVGGNDPELFQLDEFSGE 945 (2531)
T ss_pred C-CcceEEEeeccCccHHHHHHHHhhhh
Confidence 7 89999999999999999999988874
No 2
>KOG1219|consensus
Probab=100.00 E-value=5.6e-70 Score=612.52 Aligned_cols=566 Identities=24% Similarity=0.315 Sum_probs=487.7
Q ss_pred eeEEEEEEEEeccCCCCcCCCCCceEEecCCCCCCcEEEEEEEecCCCCCCceEEEEEecCCCCCcceEEEcCCCCCCCC
Q psy1887 7 AGFEVVVVVDDVQDTPPIFINIQPVIQLAPNLTMNDVLTKITAIDGDKGHPRTIKYGLVSEGHPMTVFFTINELTGPKMS 86 (653)
Q Consensus 7 ~~~~v~I~V~DvNDn~P~F~~~~~~~~V~E~~~~g~~i~~v~A~D~D~g~n~~i~ysl~~~~~~~~~~F~Id~~tG~~~~ 86 (653)
+...|.|.|+|+|||+|.|....|.+.++||.|+|+.+.-..|+|+|.|.||.|+|++.....- .|.|+..+|.+..
T Consensus 399 as~kvlidvld~n~n~pif~r~~~~ve~penvpig~~vl~~satDpdegengyvtysia~~~~l---PFaI~~~~Gilsv 475 (4289)
T KOG1219|consen 399 ASTKVLIDVLDVNDNSPIFPRDVYRVEIPENVPIGTRVLISSATDPDEGENGYVTYSIADDTML---PFAIDQSDGILSV 475 (4289)
T ss_pred cceEEEEEEeccCCCCCcceeeeeeeecCCCCCcceEEEEEeccCCCcCcCceEEEEecCCccC---ceEeccccceEEe
Confidence 4567999999999999999999999999999999999999999999999999999999986654 7999999987200
Q ss_pred cc----------E----------------EEEEEEEee------------------------------------------
Q psy1887 87 PQ----------V----------------FLLVLMCYS------------------------------------------ 98 (653)
Q Consensus 87 ~~----------~----------------~~vvi~v~d------------------------------------------ 98 (653)
+. . ..+.|.++|
T Consensus 476 S~kldrel~rvYtfRv~Asd~G~per~~e~~~~I~ildlNDn~P~F~~~n~t~t~~~~~~vg~~l~tvsAtD~De~ellk 555 (4289)
T KOG1219|consen 476 SGKLDRELRRVYTFRVRASDWGVPERESEVHLNILILDLNDNPPNFEIRNCTGTINGDPKVGTKLFTVSATDLDELELLK 555 (4289)
T ss_pred ccccCccccceEEEEEEEeccCCcchhceeeEEEEEeccCCCCCcceeeecccccccCCCCCcEEEEeeccccCccccee
Confidence 00 0 001122222
Q ss_pred ----------------------------------------------------------------------------ecCC
Q psy1887 99 ----------------------------------------------------------------------------VVAN 102 (653)
Q Consensus 99 ----------------------------------------------------------------------------~Nd~ 102 (653)
.|++
T Consensus 556 y~i~~~nel~feln~nSgeisLvr~n~t~~~~s~~slv~a~d~G~p~as~t~lni~~~k~~Tgv~~~~~p~Ilq~~e~~~ 635 (4289)
T KOG1219|consen 556 YRILPGNELSFELNSNSGEISLVRQNNTECLQSCESLVIAADDGVPPASPTLLNITVMKYGTGVGNEHEPNILQRFENKH 635 (4289)
T ss_pred EEEEeCCcCceeeccCCCeEEEEEccccccccccceEEEehhcCCCcCCceeeEEEEEecccccccccChhHhhhhcccc
Confidence 2445
Q ss_pred CceeccccCeEEEEeecCCCCceEEEEEEEecCCCC--cceEEEe-cCCCceEEEEeeccCCCCCCeeEEEEEEcccc--
Q psy1887 103 YPVFDISTQMRSLLIPASVKLGTIIYRLRASDSDKD--YPLTFDA-TDFGSYVVKIKSLPCSKNSSFCEANVYLDRVL-- 177 (653)
Q Consensus 103 ~P~f~~~~~~~~~~v~E~~~~gt~i~~v~a~D~D~~--~~~~y~i-~~~~~~~f~i~~~~~~~~~~~~~g~i~l~~~l-- 177 (653)
-|.|... +...+.++|+.|+|++++.++|+|+|.| ..+.|-| +.+...-|.|+.. +|.|.+..+|
T Consensus 636 fPqf~s~-fP~iI~v~Edvpigt~la~L~atD~Dtgfng~l~yvI~dgne~~~~~Id~q---------sg~itvas~ld~ 705 (4289)
T KOG1219|consen 636 FPQFPSD-FPFIIVVPEDVPIGTTLAILSATDSDTGFNGKLVYVIEDGNESICFLIDRQ---------SGNITVASPLDN 705 (4289)
T ss_pred Ccccccc-CCceEEccccCCCCceEEEEeccCCCCCcCceEEEEEeCCccceEEEEecc---------cceEEEecchhh
Confidence 5555442 2237899999999999999999999997 5899999 4456667778764 7899999999
Q ss_pred CCCceEEEEEEEEECCCCC--eeeeEEEEEEecCCCCCCCCCCCCeEEEEeCCC--------------------------
Q psy1887 178 VPGQVFQFRIIVKDTRGDT--TTVPTSLTATDAALNINAQFPHIPGVIVVPEVL-------------------------- 229 (653)
Q Consensus 178 e~~~~~~~~v~v~d~~g~~--~~~~~~i~v~~~~~~~~p~~~~~~~~~~i~e~~-------------------------- 229 (653)
+....|.+.++|.|.+-+- ......+.+.+.| ++.|.|.+..+.+.|.|++
T Consensus 706 ~~t~~yiLnvta~D~gtPqkss~r~l~v~vkd~n-dn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL 784 (4289)
T KOG1219|consen 706 ENTEQYILNVTAYDLGTPQKSSWRLLLVFVKDYN-DNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSL 784 (4289)
T ss_pred hhhheeEEEEEEecCCCchhhceeeEEEEEEecc-cCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEe
Confidence 4567899999999876543 3444567777777 8899998888877777776
Q ss_pred -------------cEEEEeecCC-CCcceEEEEE----------------------------------------------
Q psy1887 230 -------------GTYLLWRHLS-GPKNMYLMRE---------------------------------------------- 249 (653)
Q Consensus 230 -------------G~i~~~~~lD-e~~~~y~l~~---------------------------------------------- 249 (653)
|.+.+.++|| |.+..|.|.+
T Consensus 785 ~n~sdvfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~g 864 (4289)
T KOG1219|consen 785 LNKSDVFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYG 864 (4289)
T ss_pred cCCcceEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCc
Confidence 8888889999 9999998877
Q ss_pred --------------------------------------EEeeCCccccceecCCCcEEEEEEEEEecccCCCCCCCeEEE
Q psy1887 250 --------------------------------------IRLSKPYRELHTVASGQPVILMVLAEEERKDLNEPSPQSSTA 291 (653)
Q Consensus 250 --------------------------------------~~~~~~ld~~~~~~~~~~~~l~V~A~d~~~d~~~~~~~s~~~ 291 (653)
+++-+|||++- .+-|.|.|.|.| +|.|.+++.|
T Consensus 865 t~~~~l~A~d~diGq~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k----~~fynLsv~a~d-----~g~p~lss~c 935 (4289)
T KOG1219|consen 865 TVTWQLVALDPDIGQLGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEK----SDFYNLSVTAVD-----RGTPILSSIC 935 (4289)
T ss_pred eEEEEhhhcCcccCcCceeEEEEecCccccccccccccEEEeccccccc----ccceEEEEEEec-----CCCcceeeeE
Confidence 77778888874 667889999965 4466899999
Q ss_pred EEEEEEEccCCC--CCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCcceEEEEEE--ecCCCcEEEcccceee
Q psy1887 292 TIALVFDQAINT--PPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGKNGIFSLTL--ENNNGTFEIWPSVVER 367 (653)
Q Consensus 292 ~v~i~V~dvNd~--~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~n~~~~~~~--~~~~~~F~I~~~tg~~ 367 (653)
.+.|.+.|+|.| ||.|..-.-.++|.||+|.|+.++. +.|.|.|.|..+.++|.+ +++.+.|+|+..+|.
T Consensus 936 hl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~-----i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~- 1009 (4289)
T KOG1219|consen 936 HLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIR-----IQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGS- 1009 (4289)
T ss_pred EEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEE-----EEEecCCCCccceEEEEEEcCCcceeEEecCCcce-
Confidence 999999999885 9999998888999999999999998 599999999888877777 566679999999997
Q ss_pred eeeEEEEEecCCCCCccCCCeEEEEEEEEECCCCCCCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCCCCcEEE
Q psy1887 368 KAQFTIRVRNNKNLDYERTRTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENATAGEKVV 447 (653)
Q Consensus 368 ~~~~~i~l~~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~gt~v~ 447 (653)
|++.++||||.+..|.|+|.|+|.| .+++++.+.+.|.|+|+|||+|+|.++.|..+|.|+++.+..|.
T Consensus 1010 -------irTl~~lDrE~ks~YwltveA~D~g----t~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsiv 1078 (4289)
T KOG1219|consen 1010 -------IRTLKALDREKKSSYWLTVEAKDLG----TVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIV 1078 (4289)
T ss_pred -------EeechhhchhhcceEEEEEEEEecC----CCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEE
Confidence 9999999999999999999999999 77899999999999999999999999999999999999999999
Q ss_pred EEEEeeCCCCCCceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEEEEec
Q psy1887 448 QVKATDVDTNLGGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLTILDV 527 (653)
Q Consensus 448 ~v~a~D~D~g~n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~Dv 527 (653)
++.|+|+|...|+++.|.|.+|++.+.|.|++.||.|++. +.||||.+..|.|.|+++|.|.|.+++.+.|.|.|+|+
T Consensus 1079 q~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~--r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~Ildv 1156 (4289)
T KOG1219|consen 1079 QAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTI--RRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDV 1156 (4289)
T ss_pred EeccCCCCcccCcceEEEEccCCccceEEEccccceeeee--hhhcccccccceEEEEEecCCCCccccceEEEEEEeec
Confidence 9999999988999999999999999999999999999977 88999999999999999999999999999999999999
Q ss_pred CCCCCeEeecCcEEEEecCCCCCeEEEEEEEEeCCCCCCCCeEEEEEEeCCCCCcEEEEcCC------------------
Q psy1887 528 NDNAPIFVQTPFEFVLASDSRNFSERTFIKATDQDAEAPNNIVRYEIISGNYDNKFSLHPET------------------ 589 (653)
Q Consensus 528 NDn~P~f~~~~y~~~v~e~~~~g~~v~~v~A~D~D~~~~n~~i~Y~i~~~~~~~~F~Id~~t------------------ 589 (653)
|||+|.|.+..|...++|...+ .+.++.|+|.|+| .|++|+|+|..|+.. |+||..|
T Consensus 1157 Ndnsp~Flqk~~~~~v~~r~s~--plyRl~a~d~DeG-~narityniedgde~--FsID~~t~vVsss~~~~~~eydi~~ 1231 (4289)
T KOG1219|consen 1157 NDNSPRFLQKKTFLRVPERSSP--PLYRLAAQDNDEG-NNARITYNIEDGDEV--FSIDIATGVVSSSTLDPAGEYDILG 1231 (4289)
T ss_pred cCCchhhhhheeEEEeeeccCC--ceeEEEEEecCCC-cceEEEEecccCceE--EEEeccCceEEeeeccCCcceeEee
Confidence 9999999999999999998765 7899999999997 899999999887754 9999988
Q ss_pred ------CccceeeEEEEEEe----------CCCCCccceEEEecCC
Q psy1887 590 ------GVPHKWSTTQVRIY----------PPDSAVRNIKFLVPHS 619 (653)
Q Consensus 590 ------G~p~~~~~~~v~i~----------~~~~~~~~~~~~v~~~ 619 (653)
|.|.+++...|++. +..|...-+.|.+.+.
T Consensus 1232 Ikatd~g~pq~sa~trl~lt~~s~p~~ssep~~fee~f~~~~vse~ 1277 (4289)
T KOG1219|consen 1232 IKATDRGAPQASAGTRLHLTWISGPSESSEPVNFEEEFVDFTVSED 1277 (4289)
T ss_pred EEEecCCCCcccceeEEEEEecCCCCcCCCcccccceeEEEEEecC
Confidence 77878888888884 3345566666666665
No 3
>KOG4289|consensus
Probab=100.00 E-value=9.1e-71 Score=602.49 Aligned_cols=544 Identities=22% Similarity=0.312 Sum_probs=459.1
Q ss_pred CCCcCCCCCceEEecCCCCCCcEEEEEEEecCCCCCCceEEEEEecCCCC-CcceEEEcCCCCCCCCc------------
Q psy1887 21 TPPIFINIQPVIQLAPNLTMNDVLTKITAIDGDKGHPRTIKYGLVSEGHP-MTVFFTINELTGPKMSP------------ 87 (653)
Q Consensus 21 n~P~F~~~~~~~~V~E~~~~g~~i~~v~A~D~D~g~n~~i~ysl~~~~~~-~~~~F~Id~~tG~~~~~------------ 87 (653)
|+|+|.+..|...++||.|+|+.|..++|.|+|+ +.+.|++....++ ..++|+||+.+|.++..
T Consensus 160 ~~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~a---~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~Hv 236 (2531)
T KOG4289|consen 160 NAVQFQQPNYQKELPENEPAGTIIASVKASDPDA---GRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKETHV 236 (2531)
T ss_pred CCccCCCcchhccCcCCCCCCceeEEEEecCCCc---CceEEEeeeccchhccccEeeccccccchhhhhhhhhhhheeE
Confidence 6899999999999999999999999999999995 5699999865332 23499999999996332
Q ss_pred --------------cEEEEEEEEeeecCCCceeccccCeEEEEeecCCCCceEEEEEEEecCCCC--cceEEEe-cCCCc
Q psy1887 88 --------------QVFLLVLMCYSVVANYPVFDISTQMRSLLIPASVKLGTIIYRLRASDSDKD--YPLTFDA-TDFGS 150 (653)
Q Consensus 88 --------------~~~~vvi~v~d~Nd~~P~f~~~~~~~~~~v~E~~~~gt~i~~v~a~D~D~~--~~~~y~i-~~~~~ 150 (653)
....|+++|+|+|||.|+|.+.+| ...+.||.++|+.|.+++|+|.|.+ .++.|.+ .+++.
T Consensus 237 lrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y--~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~ 314 (2531)
T KOG4289|consen 237 LRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEY--REELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAK 314 (2531)
T ss_pred EEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHH--HHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCcc
Confidence 234577889999999999999998 8999999999999999999999986 5899999 66788
Q ss_pred eEEEEeeccCCCCCCeeEEEEEEccccC--CCceEEEEEEEEECCCCC--eeeeEEEEEEecCCCCCCCCCCCCeEEEEe
Q psy1887 151 YVVKIKSLPCSKNSSFCEANVYLDRVLV--PGQVFQFRIIVKDTRGDT--TTVPTSLTATDAALNINAQFPHIPGVIVVP 226 (653)
Q Consensus 151 ~~f~i~~~~~~~~~~~~~g~i~l~~~le--~~~~~~~~v~v~d~~g~~--~~~~~~i~v~~~~~~~~p~~~~~~~~~~i~ 226 (653)
..|+|+.- +|.|....+++ ....|++.|.|.|.+... .+..+.|+|.+.| |+.|+|....|...|.
T Consensus 315 ~~f~in~r---------SGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~N-DNaPqFse~~Yvvqv~ 384 (2531)
T KOG4289|consen 315 NVFEINPR---------SGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDEN-DNAPQFSEKRYVVQVR 384 (2531)
T ss_pred ceeEEcCc---------cceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecC-CCCccccccceEEEec
Confidence 99999863 78899999994 456899999999987544 3556677777777 8899999999999999
Q ss_pred CCCc--E-EEEeecCC---CCc--ceEEEEE---------------EEeeCCccccceecCCCcEEEEEEEEEecccCCC
Q psy1887 227 EVLG--T-YLLWRHLS---GPK--NMYLMRE---------------IRLSKPYRELHTVASGQPVILMVLAEEERKDLNE 283 (653)
Q Consensus 227 e~~G--~-i~~~~~lD---e~~--~~y~l~~---------------~~~~~~ld~~~~~~~~~~~~l~V~A~d~~~d~~~ 283 (653)
|+-+ . +...++-| +.+ -.|++.. +-+..|||+| ..+|.+.|+|.| |+
T Consensus 385 Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e-----~~~ytl~IrAqD-----gg 454 (2531)
T KOG4289|consen 385 EDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFE-----NSEYTLRIRAQD-----GG 454 (2531)
T ss_pred ccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEecccccc-----CCeeEEEEEccc-----CC
Confidence 9873 2 22234555 222 2466554 7788999988 459999999975 77
Q ss_pred CCCCeEEEEEEEEEEccCCCCCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCcceEEEEEEecCCCcEEEccc
Q psy1887 284 PSPQSSTATIALVFDQAINTPPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGKNGIFSLTLENNNGTFEIWPS 363 (653)
Q Consensus 284 ~~~~s~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~n~~~~~~~~~~~~~F~I~~~ 363 (653)
.|+++.+..++|.|+|+|||+|.|....+..+|.||.+.|..+.. +.|.|+|.|+|++..|++.+. +.|.|+..
T Consensus 455 rPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~-----vqaidadsg~na~l~y~laG~-~pf~I~~~ 528 (2531)
T KOG4289|consen 455 RPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCH-----VQAIDADSGENARLHYSLAGV-GPFQINNG 528 (2531)
T ss_pred CCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEE-----EecccCCCCcccceeeeeccC-CCeeEecC
Confidence 899999999999999999999999999999999999999999997 599999999999999997554 48999999
Q ss_pred ceeeeeeEEEEEecCCCCCccCCCeEEEEEEEEECCCCCCCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCCCC
Q psy1887 364 VVERKAQFTIRVRNNKNLDYERTRTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENATAG 443 (653)
Q Consensus 364 tg~~~~~~~i~l~~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~g 443 (653)
+|. |++++.||||+.+.|.|.|.|+|.+ .|++++.+.|.|.++|+|||.|.|++..|+..+.|+++.|
T Consensus 529 SG~--------Itvtk~ldrEt~~~ysl~V~ard~g----tp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvg 596 (2531)
T KOG4289|consen 529 SGW--------ITVTKELDRETVEHYSLGVEARDHG----TPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVG 596 (2531)
T ss_pred Cce--------EEEeecccccccceEEEEEEEcCCC----CCcccccceEEEEecccCCCCCccccCceEEEecCCcccc
Confidence 997 8889999999999999999999999 8899999999999999999999997776655555555555
Q ss_pred cEEEEEEEeeCC--------------------------------------------------------------------
Q psy1887 444 EKVVQVKATDVD-------------------------------------------------------------------- 455 (653)
Q Consensus 444 t~v~~v~a~D~D-------------------------------------------------------------------- 455 (653)
+.|.+++|+|.|
T Consensus 597 sSI~tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~dan 676 (2531)
T KOG4289|consen 597 SSIVTVTAVDRDANSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDAN 676 (2531)
T ss_pred ceEEEEEEeccccccceEEEecCCcccccceeeccCCcceEEeecchhhcccceEEEEEEecCCccccceEEEEEeeecc
Confidence 555555555555
Q ss_pred ----------------------------------CCCCceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEE
Q psy1887 456 ----------------------------------TNLGGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYK 501 (653)
Q Consensus 456 ----------------------------------~g~n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~ 501 (653)
.|.|++|+| |+. ...|+||+.+|.+++. ..||||.+-.|.
T Consensus 677 ThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y-~le---d~~Frid~dsg~i~t~--~~ld~edqvtyt 750 (2531)
T KOG4289|consen 677 THRPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITY-ILE---DEAFRIDPDSGAIYTQ--AELDYEDQVTYT 750 (2531)
T ss_pred cCCcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEE-Eec---ccceeecCCCCceEEe--eeeecccceeeE
Confidence 444444444 222 2259999999999998 789999999999
Q ss_pred EEEEEEECCCCCceEEEEEEEEEEecCCCCCeEeecCcEEEEecCCCCCeEEEEEEEEeCCCCCCCCeEEEEEEeC-CCC
Q psy1887 502 FQVEARDMQGLGLRTVVPLQLTILDVNDNAPIFVQTPFEFVLASDSRNFSERTFIKATDQDAEAPNNIVRYEIISG-NYD 580 (653)
Q Consensus 502 l~V~a~D~g~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~e~~~~g~~v~~v~A~D~D~~~~n~~i~Y~i~~~-~~~ 580 (653)
+.++|+|.+.|+...+++|.|.|.|+|||+|+|..+.|.++|.|++++++.|++|.|+|+|.+ .|+.+.|.+..| +..
T Consensus 751 l~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g-~Ng~v~y~~qg~~d~p 829 (2531)
T KOG4289|consen 751 LAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSG-PNGRVYYTFQGGDDGP 829 (2531)
T ss_pred eeeeecCCCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCCC-CCceEEEEecCCCCCC
Confidence 999999999999999999999999999999999999999999999999999999999999997 799998888654 345
Q ss_pred CcEEEEcCC--------------------------CccceeeEEEEEEeCCCCCccceEE
Q psy1887 581 NKFSLHPET--------------------------GVPHKWSTTQVRIYPPDSAVRNIKF 614 (653)
Q Consensus 581 ~~F~Id~~t--------------------------G~p~~~~~~~v~i~~~~~~~~~~~~ 614 (653)
+.|.|++.+ |.|++++.+.|+|.+.+.|++++.|
T Consensus 830 ~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvf 889 (2531)
T KOG4289|consen 830 GDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVF 889 (2531)
T ss_pred CceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCC
Confidence 789999998 7788999988888765555555444
No 4
>KOG1219|consensus
Probab=100.00 E-value=8.6e-69 Score=603.01 Aligned_cols=564 Identities=25% Similarity=0.336 Sum_probs=491.1
Q ss_pred eeeEEEEEEEEeccCCCCcCCCCCceEEecCCCCCCcEEEEEEEecCCCCCCceEEEEEecCCCCCcceEEEcCCCCCC-
Q psy1887 6 IAGFEVVVVVDDVQDTPPIFINIQPVIQLAPNLTMNDVLTKITAIDGDKGHPRTIKYGLVSEGHPMTVFFTINELTGPK- 84 (653)
Q Consensus 6 ~~~~~v~I~V~DvNDn~P~F~~~~~~~~V~E~~~~g~~i~~v~A~D~D~g~n~~i~ysl~~~~~~~~~~F~Id~~tG~~- 84 (653)
.++..|+++|+|.||-+|.|....|.++++|+.++-+.|.+|.|+|+|-|.|+.+-|++..... .|.|.|.+|++
T Consensus 123 Ea~trv~v~vlD~NDl~PlFsp~sY~v~i~ed~~~~s~i~rV~AtDADiG~N~efYysf~~Rs~----mFaihPtsGvv~ 198 (4289)
T KOG1219|consen 123 EATTRVHVRVLDRNDLSPLFSPQSYEVEIDEDLEPFSTILRVEATDADIGINSEFYYSFVNRSH----MFAIHPTSGVVR 198 (4289)
T ss_pred eeeeEEEEEEeccCCCcccccCCceEEecCCCCCcccceEEEEeccccccccceEEEEeccccc----cEEeccccceEE
Confidence 3678899999999999999999999999999999999999999999999999999999987542 59999999980
Q ss_pred --------------------------------------------------------------------------------
Q psy1887 85 -------------------------------------------------------------------------------- 84 (653)
Q Consensus 85 -------------------------------------------------------------------------------- 84 (653)
T Consensus 199 ~L~~~~~gkyel~vla~DR~~kly~~~ane~~P~itavvl~p~e~~~~p~ya~V~vd~~~~ga~~~s~~iv~gd~~~~f~ 278 (4289)
T KOG1219|consen 199 SLRHVKPGKYELKVLAEDRASKLYYFDANEVQPSITAVVLIPRETKPKPRYALVDVDKINPGANRQSAAIVTGDDSPNFA 278 (4289)
T ss_pred EeeeccccceEEEEeehhhhhhhcccccccCCCceEEEEEecccCCCCCeEEEEEeeccCCCccceeEEEEecCCCccee
Confidence
Q ss_pred -------------------------------------------------------------------------CCcc---
Q psy1887 85 -------------------------------------------------------------------------MSPQ--- 88 (653)
Q Consensus 85 -------------------------------------------------------------------------~~~~--- 88 (653)
..+|
T Consensus 279 ~v~s~~~skE~~~~~~~di~w~~~t~~~~~sL~akng~qf~s~kn~~vkfek~~~r~~~Sefa~~ntpVv~v~atpyv~k 358 (4289)
T KOG1219|consen 279 IVGSKGNSKEHWFEVEPDIVWNDMTIGINLSLQAKNGPQFFSLKNFTVKFEKEVYRFSVSEFAPPNTPVVMVEATPYVYK 358 (4289)
T ss_pred eecccCCCcceEEEecccccccccceeEEEEEEecCCCeeeeccccceEEEeeEEEEEecccCCCCCcEEEEecceeEee
Confidence 0000
Q ss_pred -----------------------------------------EEEEEEEEeeecCCCceeccccCeEEEEeecCCCCceEE
Q psy1887 89 -----------------------------------------VFLLVLMCYSVVANYPVFDISTQMRSLLIPASVKLGTII 127 (653)
Q Consensus 89 -----------------------------------------~~~vvi~v~d~Nd~~P~f~~~~~~~~~~v~E~~~~gt~i 127 (653)
...++|-|+|+|+|+|.|....| .+.++||.|+|+.+
T Consensus 359 ~s~gn~kfkln~~t~lis~~epldr~~~ah~~l~i~t~~~as~kvlidvld~n~n~pif~r~~~--~ve~penvpig~~v 436 (4289)
T KOG1219|consen 359 LSRGNSKFKLNEQTGLISVSEPLDRESEAHIDLLIITSPPASTKVLIDVLDVNDNSPIFPRDVY--RVEIPENVPIGTRV 436 (4289)
T ss_pred ccCcccceeeeeeeeeEEecchhhhhhhhceeeEEecCCCcceEEEEEEeccCCCCCcceeeee--eeecCCCCCcceEE
Confidence 11133557899999999999987 99999999999999
Q ss_pred EEEEEecCCCC--cceEEEecCCCceEEEEeeccCCCCCCeeEEEEEEccccC--CCceEEEEEEEEECCCCCeeee--E
Q psy1887 128 YRLRASDSDKD--YPLTFDATDFGSYVVKIKSLPCSKNSSFCEANVYLDRVLV--PGQVFQFRIIVKDTRGDTTTVP--T 201 (653)
Q Consensus 128 ~~v~a~D~D~~--~~~~y~i~~~~~~~f~i~~~~~~~~~~~~~g~i~l~~~le--~~~~~~~~v~v~d~~g~~~~~~--~ 201 (653)
+.+.|+|+|.| ..++|+|.+.....|.|+.. .|.+.+..+++ ..+.|.|+|+|.|.+-+..... +
T Consensus 437 l~~satDpdegengyvtysia~~~~lPFaI~~~---------~GilsvS~kldrel~rvYtfRv~Asd~G~per~~e~~~ 507 (4289)
T KOG1219|consen 437 LISSATDPDEGENGYVTYSIADDTMLPFAIDQS---------DGILSVSGKLDRELRRVYTFRVRASDWGVPERESEVHL 507 (4289)
T ss_pred EEEeccCCCcCcCceEEEEecCCccCceEeccc---------cceEEeccccCccccceEEEEEEEeccCCcchhceeeE
Confidence 99999999986 57899998888899999864 67788888884 3578999999999886654444 4
Q ss_pred EEEEEecCCCCCCCCCCC--------------------------------------CeEEEEeCCCcEEEEeecCC--C-
Q psy1887 202 SLTATDAALNINAQFPHI--------------------------------------PGVIVVPEVLGTYLLWRHLS--G- 240 (653)
Q Consensus 202 ~i~v~~~~~~~~p~~~~~--------------------------------------~~~~~i~e~~G~i~~~~~lD--e- 240 (653)
.|.+.+.| |++|.|... ...|.++.++|++.|.+ .+ +
T Consensus 508 ~I~ildlN-Dn~P~F~~~n~t~t~~~~~~vg~~l~tvsAtD~De~ellky~i~~~nel~feln~nSgeisLvr-~n~t~~ 585 (4289)
T KOG1219|consen 508 NILILDLN-DNPPNFEIRNCTGTINGDPKVGTKLFTVSATDLDELELLKYRILPGNELSFELNSNSGEISLVR-QNNTEC 585 (4289)
T ss_pred EEEEeccC-CCCCcceeeecccccccCCCCCcEEEEeeccccCcccceeEEEEeCCcCceeeccCCCeEEEEE-cccccc
Confidence 55555655 777877321 12356677778888775 33 2
Q ss_pred CcceEEEEE-----------------------------------------------------------------------
Q psy1887 241 PKNMYLMRE----------------------------------------------------------------------- 249 (653)
Q Consensus 241 ~~~~y~l~~----------------------------------------------------------------------- 249 (653)
.+..+.+.+
T Consensus 586 ~~s~~slv~a~d~G~p~as~t~lni~~~k~~Tgv~~~~~p~Ilq~~e~~~fPqf~s~fP~iI~v~Edvpigt~la~L~at 665 (4289)
T KOG1219|consen 586 LQSCESLVIAADDGVPPASPTLLNITVMKYGTGVGNEHEPNILQRFENKHFPQFPSDFPFIIVVPEDVPIGTTLAILSAT 665 (4289)
T ss_pred ccccceEEEehhcCCCcCCceeeEEEEEecccccccccChhHhhhhccccCccccccCCceEEccccCCCCceEEEEecc
Confidence 223334433
Q ss_pred -------------------------------EEeeCCccccceecCCCcEEEEEEEEEecccCCCCCCCeEEEEEEEEEE
Q psy1887 250 -------------------------------IRLSKPYRELHTVASGQPVILMVLAEEERKDLNEPSPQSSTATIALVFD 298 (653)
Q Consensus 250 -------------------------------~~~~~~ld~~~~~~~~~~~~l~V~A~d~~~d~~~~~~~s~~~~v~i~V~ 298 (653)
+.+..||+++. ...|.|.|+|.| +|.|..++...+.|.|.
T Consensus 666 D~Dtgfng~l~yvI~dgne~~~~~Id~qsg~itvas~ld~~~----t~~yiLnvta~D-----~gtPqkss~r~l~v~vk 736 (4289)
T KOG1219|consen 666 DSDTGFNGKLVYVIEDGNESICFLIDRQSGNITVASPLDNEN----TEQYILNVTAYD-----LGTPQKSSWRLLLVFVK 736 (4289)
T ss_pred CCCCCcCceEEEEEeCCccceEEEEecccceEEEecchhhhh----hheeEEEEEEec-----CCCchhhceeeEEEEEE
Confidence 66777888774 678899999986 56889999999999999
Q ss_pred ccCCCCCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCcceEEEEEEecCCCcEEEcccceeeeeeEEEEEecC
Q psy1887 299 QAINTPPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGKNGIFSLTLENNNGTFEIWPSVVERKAQFTIRVRNN 378 (653)
Q Consensus 299 dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~n~~~~~~~~~~~~~F~I~~~tg~~~~~~~i~l~~~ 378 (653)
|.|||+|.|.+..|.++|.|+..+|+.|+.| .+.|.|.|.||+++|++.+..+.|+|++.+|. |.+.
T Consensus 737 d~ndn~p~f~e~sy~vtvsedtepgs~Ia~v-----etnd~D~g~NG~v~fsL~n~sdvfsIdp~tGi--------vv~~ 803 (4289)
T KOG1219|consen 737 DYNDNTPIFVERSYHVTVSEDTEPGSFIAHV-----ETNDTDGGNNGMVSFSLLNKSDVFSIDPFTGI--------VVTS 803 (4289)
T ss_pred ecccCCccccccceEEEEecCCCCCceEEEE-----EecccCCCCCceEEEEecCCcceEEecCcccE--------EEec
Confidence 9999999999999999999999999999985 99999999999999999999999999999997 7889
Q ss_pred CCCCccCCCeEEEEEEEEECCCCCCCCCceEEEEEEEEEeecCCCCCeecccc---------------------------
Q psy1887 379 KNLDYERTRTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTATL--------------------------- 431 (653)
Q Consensus 379 ~~LD~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~--------------------------- 431 (653)
++||||.+..|+|.|.|+|.+ .|.+.+.+.+.|.|.|||||+|.|....
T Consensus 804 ~sLdrE~q~~y~l~I~a~dqp----~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq 879 (4289)
T KOG1219|consen 804 KSLDREGQTSYHLKIEARDQP----PPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQ 879 (4289)
T ss_pred cccCcccCceeEEEEEEcCCC----CCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCc
Confidence 999999999999999999988 5778899999999999999999982211
Q ss_pred ------------------------------------------------------------------------------eE
Q psy1887 432 ------------------------------------------------------------------------------YT 433 (653)
Q Consensus 432 ------------------------------------------------------------------------------y~ 433 (653)
-+
T Consensus 880 ~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e 959 (4289)
T KOG1219|consen 880 LGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTE 959 (4289)
T ss_pred CceeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeee
Confidence 16
Q ss_pred EEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCC
Q psy1887 434 AKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLG 513 (653)
Q Consensus 434 ~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~ 513 (653)
++|.||++.|+.+++|.|.|.|+|..+.++|+|..|+..+.|+||..+|.|++. +.||||..+.|.|+|.|+|.|.++
T Consensus 960 ~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl--~~lDrE~ks~YwltveA~D~gt~~ 1037 (4289)
T KOG1219|consen 960 GHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTL--KALDREKKSSYWLTVEAKDLGTVP 1037 (4289)
T ss_pred eeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeec--hhhchhhcceEEEEEEEEecCCCc
Confidence 689999999999999999999999999999999999888999999999999998 899999999999999999999999
Q ss_pred ceEEEEEEEEEEecCCCCCeEeecCcEEEEecCCCCCeEEEEEEEEeCCCCCCCCeEEEEEEeCCCCCcEEEEcCC----
Q psy1887 514 LRTVVPLQLTILDVNDNAPIFVQTPFEFVLASDSRNFSERTFIKATDQDAEAPNNIVRYEIISGNYDNKFSLHPET---- 589 (653)
Q Consensus 514 ~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~e~~~~g~~v~~v~A~D~D~~~~n~~i~Y~i~~~~~~~~F~Id~~t---- 589 (653)
+++.+.+.|.|+|+|||+|+|.++.|..+|.|+++.+..|.++.|.|+|.. .|++++|.|.+||..+.|.||+.|
T Consensus 1038 ~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dss-sn~kLmykI~sGnyq~FF~Id~~TG~iT 1116 (4289)
T KOG1219|consen 1038 LSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSS-SNQKLMYKITSGNYQGFFQIDPETGLIT 1116 (4289)
T ss_pred cccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcc-cCcceEEEEccCCccceEEEccccceee
Confidence 999999999999999999999999999999999999999999999999975 699999999999999999999998
Q ss_pred ----------------------CccceeeEEEEEEeCCCCCccceEEE
Q psy1887 590 ----------------------GVPHKWSTTQVRIYPPDSAVRNIKFL 615 (653)
Q Consensus 590 ----------------------G~p~~~~~~~v~i~~~~~~~~~~~~~ 615 (653)
|+|.+++...|.|.+++.|++.+.|.
T Consensus 1117 t~r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~IldvNdnsp~Fl 1164 (4289)
T KOG1219|consen 1117 TIRRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDVNDNSPRFL 1164 (4289)
T ss_pred eehhhcccccccceEEEEEecCCCCccccceEEEEEEeeccCCchhhh
Confidence 77888999999998887777776664
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=2.1e-28 Score=238.84 Aligned_cols=197 Identities=37% Similarity=0.541 Sum_probs=180.1
Q ss_pred ceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCcceEEEEEEecCC--CcEEEcccceeeeeeEEEEEecCCCCCccCCCe
Q psy1887 311 QYITHLDENSPQGTALIFAESFHTQVTDDNMGKNGIFSLTLENNN--GTFEIWPSVVERKAQFTIRVRNNKNLDYERTRT 388 (653)
Q Consensus 311 ~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~n~~~~~~~~~~~--~~F~I~~~tg~~~~~~~i~l~~~~~LD~E~~~~ 388 (653)
.|.++|+||++.|+.+++ +.|.|+|.+.+..+.|++.+.. ++|+|++.+|. |++.++||||..+.
T Consensus 1 ~~~~~i~En~~~g~~v~~-----~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~--------l~~~~~lD~e~~~~ 67 (199)
T cd00031 1 SYSVSVPENAPPGTVVGT-----VSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGV--------ITTTKPLDREEQSE 67 (199)
T ss_pred CeEEEEeCCCCCCCEEEE-----EEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCE--------EEECCCCCCcCCce
Confidence 378899999999999998 4899999998888888885544 59999999997 77889999999999
Q ss_pred EEEEEEEEECCCCCCCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEc
Q psy1887 389 LSFVIVAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAIL 468 (653)
Q Consensus 389 ~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~ 468 (653)
|.|.|+|+|.+ .+.+++.+.++|.|.|+|||+|.|....|.+.|.|+.++|+.++++.|+|+|.+.++.++|+|..
T Consensus 68 ~~l~v~a~D~g----~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~ 143 (199)
T cd00031 68 YTLTVVASDGG----GPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILS 143 (199)
T ss_pred EEEEEEEEECC----cCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeC
Confidence 99999999976 45567899999999999999999998999999999999999999999999999889999999988
Q ss_pred CCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEEEEe
Q psy1887 469 GYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLTILD 526 (653)
Q Consensus 469 ~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~D 526 (653)
+.....|.|++.+|.|++. +.||||....|.|.|.|+|.+.+++++++.++|.|.|
T Consensus 144 ~~~~~~f~i~~~~G~i~~~--~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 144 GNDKELFSIDPNTGIITLA--KPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred CCCCCEEEEeCCceEEEeC--CccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 7665799999999999988 7899999999999999999988889999999999876
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.93 E-value=7.9e-24 Score=206.50 Aligned_cols=161 Identities=36% Similarity=0.554 Sum_probs=148.8
Q ss_pred ceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECC
Q psy1887 431 LYTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQ 510 (653)
Q Consensus 431 ~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g 510 (653)
.|.+.|+|+++.|+.++++.|.|+|.+.++.++|+|..+....+|.|++.+|.|++. +.||||....|.|.|+|+|.|
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~--~~lD~e~~~~~~l~v~a~D~g 78 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTT--KPLDREEQSEYTLTVVASDGG 78 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEEC--CCCCCcCCceEEEEEEEEECC
Confidence 378899999999999999999999998889999999988766799999999999998 789999999999999999988
Q ss_pred CCCceEEEEEEEEEEecCCCCCeEeecCcEEEEecCCCCCeEEEEEEEEeCCCCCCCCeEEEEEEeCCCCCcEEEEcCCC
Q psy1887 511 GLGLRTVVPLQLTILDVNDNAPIFVQTPFEFVLASDSRNFSERTFIKATDQDAEAPNNIVRYEIISGNYDNKFSLHPETG 590 (653)
Q Consensus 511 ~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~e~~~~g~~v~~v~A~D~D~~~~n~~i~Y~i~~~~~~~~F~Id~~tG 590 (653)
.+.+++++.|+|.|.|+||++|.|....|.+.|.|+.++|+.++++.|+|+|.+ .++.++|+|..+...++|.|++.+|
T Consensus 79 ~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~-~~~~~~y~l~~~~~~~~f~i~~~~G 157 (199)
T cd00031 79 GPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSG-ENAKLTYSILSGNDKELFSIDPNTG 157 (199)
T ss_pred cCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCC-CCccEEEEEeCCCCCCEEEEeCCce
Confidence 888889999999999999999999988999999999999999999999999986 6899999998766568999999988
Q ss_pred ccce
Q psy1887 591 VPHK 594 (653)
Q Consensus 591 ~p~~ 594 (653)
...+
T Consensus 158 ~i~~ 161 (199)
T cd00031 158 IITL 161 (199)
T ss_pred EEEe
Confidence 6543
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.70 E-value=3.7e-16 Score=132.62 Aligned_cols=92 Identities=33% Similarity=0.550 Sum_probs=88.3
Q ss_pred eEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEEC-C
Q psy1887 432 YTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDM-Q 510 (653)
Q Consensus 432 y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~-g 510 (653)
|+++|+|++++|+.++++.|.|+|.+.|+.+.|+|..++...+|.|++.+|.|++. ++||||..+.|.|.|.|+|. +
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~--~~LD~E~~~~y~l~v~a~D~~~ 78 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLK--KPLDRETQSSYQLTVRATDSGG 78 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEES--SSSCTTTTSEEEEEEEEEETTT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccc--eecCcccCCEEEEEEEEEECCC
Confidence 78999999999999999999999999999999999999888899999999999998 88999999999999999999 8
Q ss_pred CCCceEEEEEEEEEE
Q psy1887 511 GLGLRTVVPLQLTIL 525 (653)
Q Consensus 511 ~~~~~~~~~v~I~V~ 525 (653)
.|++++++.|+|+|+
T Consensus 79 ~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 79 SPPLSSTATVTINVL 93 (93)
T ss_dssp SSEEEEEEEEEEEEE
T ss_pred CCCCEEEEEEEEEEC
Confidence 899999999999985
No 8
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.59 E-value=9.4e-15 Score=119.89 Aligned_cols=79 Identities=41% Similarity=0.678 Sum_probs=73.9
Q ss_pred eeCCCCCCceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEEEEecCCCC
Q psy1887 452 TDVDTNLGGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLTILDVNDNA 531 (653)
Q Consensus 452 ~D~D~g~n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~DvNDn~ 531 (653)
+|+|.|.|+.++|+|.+++...+|.|++.+|.|++. ++||||....|.|.|+|+|.|.|++++++.|+|+|.|+|||+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~--~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~ 78 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTT--KPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNA 78 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeC--CccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCC
Confidence 488999999999999988766899999999988887 799999999999999999999999999999999999999999
Q ss_pred C
Q psy1887 532 P 532 (653)
Q Consensus 532 P 532 (653)
|
T Consensus 79 P 79 (79)
T smart00112 79 P 79 (79)
T ss_pred C
Confidence 8
No 9
>KOG1834|consensus
Probab=99.59 E-value=7.9e-14 Score=146.82 Aligned_cols=216 Identities=21% Similarity=0.304 Sum_probs=167.8
Q ss_pred EEEEEccCCCCCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCc--ceE-EEEEEecCCCcEE---Ecccceee
Q psy1887 294 ALVFDQAINTPPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGK--NGI-FSLTLENNNGTFE---IWPSVVER 367 (653)
Q Consensus 294 ~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~--n~~-~~~~~~~~~~~F~---I~~~tg~~ 367 (653)
.....-+|-+.|+... .|.+-|.||-. +++. .++ +.|.|.|..- .|. .-|.+-+..-.|. ++..||.
T Consensus 20 ~~~aarankhkpwie~-ey~gvV~Endn--tvll-~Pp--l~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGe- 92 (952)
T KOG1834|consen 20 HHHAARANKHKPWIEE-EYHGVVTENDN--TVLL-DPP--LAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGE- 92 (952)
T ss_pred ccccccccccCccccc-ceeEEEEeCCc--eEEe-CCC--eeeecCCCCcccccccceeEecCCCCCceEEEEeccCCc-
Confidence 3455667888887765 79999999873 4443 233 5677887532 222 3344544554564 6778886
Q ss_pred eeeEEEEEecCCCCCccCCCeEEEEEEEEECCC--CCCCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCCCCcE
Q psy1887 368 KAQFTIRVRNNKNLDYERTRTLSFVIVAKEISS--DSSSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENATAGEK 445 (653)
Q Consensus 368 ~~~~~i~l~~~~~LD~E~~~~~~l~V~a~D~~~--~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~gt~ 445 (653)
..|+.+.+||.|.++.|+|+|+|.|.|. ++..-..+-.++|.|.|.|+|+.+|+|..+-|.+.|.|+.. -..
T Consensus 93 -----gvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~-yd~ 166 (952)
T KOG1834|consen 93 -----GVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV-YDS 166 (952)
T ss_pred -----eEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee-eee
Confidence 4588899999999999999999999874 23333677889999999999999999999999999999764 568
Q ss_pred EEEEEEeeCCC-CCCceE-EEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEE
Q psy1887 446 VVQVKATDVDT-NLGGEI-LYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLT 523 (653)
Q Consensus 446 v~~v~a~D~D~-g~n~~i-~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~ 523 (653)
|+++.|.|.|- .++++| .|.|+. +.-.|.||. .|.|+.. .+|.|.....|.|+|.|.|.|..+....+-|+|.
T Consensus 167 il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn-~G~irnT--ekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~ 241 (952)
T KOG1834|consen 167 ILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDN-DGNIRNT--EKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVH 241 (952)
T ss_pred eEEEEeecCCCCCcccceeEEEecC--CCCceEEcC-CCccccc--cccccccceeEEEEEEEEecccccccCcceEEEE
Confidence 89999999997 466777 566654 466899985 8999988 8899999999999999999987666666778888
Q ss_pred EEec
Q psy1887 524 ILDV 527 (653)
Q Consensus 524 V~Dv 527 (653)
|...
T Consensus 242 Vkp~ 245 (952)
T KOG1834|consen 242 VKPT 245 (952)
T ss_pred ecCc
Confidence 8754
No 10
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.52 E-value=1.2e-13 Score=117.03 Aligned_cols=90 Identities=29% Similarity=0.382 Sum_probs=81.7
Q ss_pred eEEEEeCCCCCCcEEEeeeceeeEEeeCCCCcceEEEEEEecC--CCcEEEcccceeeeeeEEEEEecCCCCCccCCCeE
Q psy1887 312 YITHLDENSPQGTALIFAESFHTQVTDDNMGKNGIFSLTLENN--NGTFEIWPSVVERKAQFTIRVRNNKNLDYERTRTL 389 (653)
Q Consensus 312 ~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~n~~~~~~~~~~--~~~F~I~~~tg~~~~~~~i~l~~~~~LD~E~~~~~ 389 (653)
|.++|+|++++|+.++. +.|.|+|.+.|+.+.|.+... .++|.|++.+|. |.+.++||||+.+.|
T Consensus 1 Y~~~v~E~~~~g~~v~~-----v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~--------i~~~~~LD~E~~~~y 67 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQ-----VTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGE--------ISLKKPLDRETQSSY 67 (93)
T ss_dssp EEEEEETTGSTSSEEEE-----EEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTE--------EEESSSSCTTTTSEE
T ss_pred CEEEEECCCCCCCEEEE-----EEEEeCCCCCCceEEEEEecCcccCceEEeeeeec--------cccceecCcccCCEE
Confidence 78999999999999998 599999999999888887433 589999999998 888999999999999
Q ss_pred EEEEEEEEC-CCCCCCCCceEEEEEEEEEe
Q psy1887 390 SFVIVAKEI-SSDSSSNLLSSQAPVLVYIN 418 (653)
Q Consensus 390 ~l~V~a~D~-~~~~~~~~~s~~~~v~I~V~ 418 (653)
.|.|.|+|. + .+++++.+.|+|+|+
T Consensus 68 ~l~v~a~D~~~----~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 68 QLTVRATDSGG----SPPLSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEETTT----SSEEEEEEEEEEEEE
T ss_pred EEEEEEEECCC----CCCCEEEEEEEEEEC
Confidence 999999999 5 788999999999985
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.44 E-value=7.7e-13 Score=108.46 Aligned_cols=77 Identities=36% Similarity=0.492 Sum_probs=68.6
Q ss_pred eeCCCCcceEEEEEEecCC--CcEEEcccceeeeeeEEEEEecCCCCCccCCCeEEEEEEEEECCCCCCCCCceEEEEEE
Q psy1887 337 TDDNMGKNGIFSLTLENNN--GTFEIWPSVVERKAQFTIRVRNNKNLDYERTRTLSFVIVAKEISSDSSSNLLSSQAPVL 414 (653)
Q Consensus 337 ~D~D~g~n~~~~~~~~~~~--~~F~I~~~tg~~~~~~~i~l~~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~v~ 414 (653)
+|+|.|.|+.++|++.... .+|.|++.+|. |++.++||||..+.|.|.|+|+|.+ .+++++.+.|+
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~--------i~~~~~LD~e~~~~y~l~v~a~D~~----~~~~~~~~~v~ 68 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGE--------ITTTKPLDREEQPEYTLTVEATDGG----GPPLSSTATVT 68 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccE--------EEeCCccCeeCCCeEEEEEEEEECC----CCCcccEEEEE
Confidence 4889999999889885443 79999999996 6677899999999999999999998 56799999999
Q ss_pred EEEeecCCCCC
Q psy1887 415 VYINDVNDNPP 425 (653)
Q Consensus 415 I~V~DvNDn~P 425 (653)
|+|.|+|||+|
T Consensus 69 I~V~D~Nd~~P 79 (79)
T smart00112 69 VTVLDVNDNAP 79 (79)
T ss_pred EEEEECCCCCC
Confidence 99999999998
No 12
>KOG1834|consensus
Probab=99.11 E-value=2e-09 Score=114.18 Aligned_cols=154 Identities=16% Similarity=0.209 Sum_probs=117.9
Q ss_pred EeCCC--cEEEEeecCC-CCcceEEEEEEEeeCCccccceecCCCcEEEEEEEEEecccCCC-CCCCeEEEEEEEEEEcc
Q psy1887 225 VPEVL--GTYLLWRHLS-GPKNMYLMREIRLSKPYRELHTVASGQPVILMVLAEEERKDLNE-PSPQSSTATIALVFDQA 300 (653)
Q Consensus 225 i~e~~--G~i~~~~~lD-e~~~~y~l~~~~~~~~ld~~~~~~~~~~~~l~V~A~d~~~d~~~-~~~~s~~~~v~i~V~dv 300 (653)
++..+ |.|....+|| |-+..|++++ +|.|+..+..+ ....|-.++|.|+|.|+
T Consensus 86 vdK~TGegvlRaK~~lDCelqkeytf~i-----------------------QAydCg~gpdgtn~kKShkatvhIrVkDv 142 (952)
T KOG1834|consen 86 VDKYTGEGVLRAKEPLDCELQKEYTFTI-----------------------QAYDCGNGPDGTNTKKSHKATVHIRVKDV 142 (952)
T ss_pred EeccCCceEEeecCcccccccccceEEE-----------------------EEEecCCCCCccccccccceEEEEEeccc
Confidence 44444 4566667888 8776666666 88885433322 33678889999999999
Q ss_pred CCCCCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCc-ceE-EEEEEecCCCcEEEcccceeeeeeEEEEEecC
Q psy1887 301 INTPPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGK-NGI-FSLTLENNNGTFEIWPSVVERKAQFTIRVRNN 378 (653)
Q Consensus 301 Nd~~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~-n~~-~~~~~~~~~~~F~I~~~tg~~~~~~~i~l~~~ 378 (653)
|+++|.|..+-|.+.|.|...... |++ +.|.|.|.++ +++ ..|.+...+-.|.|+.. |. |+.+
T Consensus 143 Ne~AP~f~ep~Yka~V~EGK~yd~-il~-----veAiD~DCspq~sqIC~YEI~t~d~PFaIdn~-G~--------irnT 207 (952)
T KOG1834|consen 143 NEFAPVFKEPWYKAHVTEGKVYDS-ILR-----VEAIDKDCSPQYSQICEYEITTPDVPFAIDND-GN--------IRNT 207 (952)
T ss_pred cccCchhcccceeeEEecceeeee-eEE-----EEeecCCCCCcccceeEEEecCCCCceEEcCC-Cc--------cccc
Confidence 999999999999999999765544 333 5999999875 555 66778788889999954 44 8999
Q ss_pred CCCCccCCCeEEEEEEEEECCCCCCCCCceEEEEEEEEEeec
Q psy1887 379 KNLDYERTRTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDV 420 (653)
Q Consensus 379 ~~LD~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~Dv 420 (653)
.+|.|.....|.|+|.|.|.| .....+.+.|+|.|...
T Consensus 208 ekLny~ke~~Y~ltVtAyDCg----~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 208 EKLNYTKEHQYKLTVTAYDCG----KKRAASDSLVTVHVKPT 245 (952)
T ss_pred cccccccceeEEEEEEEEecc----cccccCcceEEEEecCc
Confidence 999999999999999999998 33344447788887644
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.05 E-value=0.00076 Score=55.25 Aligned_cols=63 Identities=19% Similarity=0.342 Sum_probs=40.7
Q ss_pred eEEEeeCCCCCCcEEEEEEEeeCCCCC--CceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCc
Q psy1887 432 YTAKIPENATAGEKVVQVKATDVDTNL--GGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEA 497 (653)
Q Consensus 432 y~~~V~E~~~~gt~v~~v~a~D~D~g~--n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~ 497 (653)
...+|+|..++|+.|+.| |.|.-... -....|++++.....+|.++..+|.+++. ..+|||+.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~--~rIDRE~L 67 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVS--ERIDREEL 67 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEES--S--SCCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeC--CccCHHHH
Confidence 467899999999999999 44543311 11246777777677899999999999998 78999983
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.52 E-value=0.027 Score=46.82 Aligned_cols=79 Identities=20% Similarity=0.320 Sum_probs=42.4
Q ss_pred CCeecccceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEE
Q psy1887 424 PPVFTATLYTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQ 503 (653)
Q Consensus 424 ~P~f~~~~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~ 503 (653)
-|-|.+..|.+.|+.+...|..|++|.-.|-.. +..+.|. +.++ .|.|.. .|.|+++ +++.... ..-.+.
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~--ssDp--dF~V~~-DGsVy~~--r~v~l~~-~~~~F~ 72 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFE--SSDP--DFRVLE-DGSVYAK--RPVQLSS-EQRSFT 72 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-----S--EEEEET-TTEEEEE--S--S-SS-S-EEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEe--cCCC--CEEEcC-CCeEEEe--eeEecCC-CceEEE
Confidence 488999999999999999999999999999863 4668886 2323 799975 9999998 6676543 335799
Q ss_pred EEEEECCCC
Q psy1887 504 VEARDMQGL 512 (653)
Q Consensus 504 V~a~D~g~~ 512 (653)
|.|.|..+.
T Consensus 73 V~a~D~~~~ 81 (90)
T PF08758_consen 73 VHAWDSQTQ 81 (90)
T ss_dssp EEEEETTTT
T ss_pred EEEECCCCC
Confidence 999997553
No 15
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.21 E-value=0.025 Score=46.99 Aligned_cols=80 Identities=15% Similarity=0.290 Sum_probs=43.5
Q ss_pred CCceeccccCeEEEEeecCCCCceEEEEEEEecCCCCcceEEEecCCCceEEEEeeccCCCCCCeeEEEEEEccccC-CC
Q psy1887 102 NYPVFDISTQMRSLLIPASVKLGTIIYRLRASDSDKDYPLTFDATDFGSYVVKIKSLPCSKNSSFCEANVYLDRVLV-PG 180 (653)
Q Consensus 102 ~~P~f~~~~~~~~~~v~E~~~~gt~i~~v~a~D~D~~~~~~y~i~~~~~~~f~i~~~~~~~~~~~~~g~i~l~~~le-~~ 180 (653)
+.|.|....| .+.||.+...|..|++|.-.|=.....+.|.-+|. .|.|.. .|.|+..+++. ..
T Consensus 2 C~pGF~~~~~--~~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssDp---dF~V~~----------DGsVy~~r~v~l~~ 66 (90)
T PF08758_consen 2 CRPGFSQKKY--TFEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSDP---DFRVLE----------DGSVYAKRPVQLSS 66 (90)
T ss_dssp ---B--S-EE--EE----SS-SS--EEE---B--SS---EEEE---S---EEEEET----------TTEEEEES--S-SS
T ss_pred CcCCcccceE--EEEcCchhhCCcEEEEEEeccCCCCCceEEecCCC---CEEEcC----------CCeEEEeeeEecCC
Confidence 4688988876 99999999999999999999997777788877543 788876 58999999993 34
Q ss_pred ceEEEEEEEEECCCCC
Q psy1887 181 QVFQFRIIVKDTRGDT 196 (653)
Q Consensus 181 ~~~~~~v~v~d~~g~~ 196 (653)
..-.|.|.++|..+..
T Consensus 67 ~~~~F~V~a~D~~~~~ 82 (90)
T PF08758_consen 67 EQRSFTVHAWDSQTQE 82 (90)
T ss_dssp S-EEEEEEEEETTTTE
T ss_pred CceEEEEEEECCCCCe
Confidence 4468999999998765
No 16
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.80 E-value=0.12 Score=43.56 Aligned_cols=89 Identities=21% Similarity=0.228 Sum_probs=57.6
Q ss_pred EEEEEeeCCCCCCceEEEEEEc-CCCCcceEEeCCCCeEEEEcC------ccCCccCccEEEEEEEEEECCCCCceEEEE
Q psy1887 447 VQVKATDVDTNLGGEILYTAIL-GYKNSSLELDAHTGDITIANG------QQFDREEASEYKFQVEARDMQGLGLRTVVP 519 (653)
Q Consensus 447 ~~v~a~D~D~g~n~~i~y~i~~-~~~~~~F~Id~~tG~i~~~~~------~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~ 519 (653)
++|.++|+|.+. ...+++.. ....+.|.|++ .|.+..... +.|...+.-.-.|+|.+.|+ .+..
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt~~ 72 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TSQT 72 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------CeEE
Confidence 468899999763 45555421 12346799987 787766511 12332234456788888984 2788
Q ss_pred EEEEEEecCCCCCeEeecCcEEEEecC
Q psy1887 520 LQLTILDVNDNAPIFVQTPFEFVLASD 546 (653)
Q Consensus 520 v~I~V~DvNDn~P~f~~~~y~~~v~e~ 546 (653)
|.|+|.-.|| +|+..... ...+.|+
T Consensus 73 vtItI~GtND-apvi~~~~-~g~v~ED 97 (99)
T TIGR01965 73 VTITITGAND-AAVIGGAD-TGSVTED 97 (99)
T ss_pred EEEEEEccCC-CCEEeccc-ceeEecC
Confidence 9999999998 66655432 4666665
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=95.75 E-value=0.15 Score=43.22 Aligned_cols=70 Identities=27% Similarity=0.427 Sum_probs=52.6
Q ss_pred EeeCCCCCCceEEEEEEcCC---CCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEEEEec
Q psy1887 451 ATDVDTNLGGEILYTAILGY---KNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLTILDV 527 (653)
Q Consensus 451 a~D~D~g~n~~i~y~i~~~~---~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~Dv 527 (653)
..|.| ...++|++...+ ...|.+.|+.++.+.-. + ..+..+.|.+.|.|+|+.+ .+....++|.|.+.
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt---P-~~~~~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~ 94 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT---P-TNSDVGSLSLKVTATDSSG--ASASDTFTITVVNT 94 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE---C-CCCCCcEEEEEEEEEECCC--CEEEEEEEEEEeCC
Confidence 46776 367899876432 24689999999888754 2 3333577999999999754 67888899999999
Q ss_pred CC
Q psy1887 528 ND 529 (653)
Q Consensus 528 ND 529 (653)
|+
T Consensus 95 ~~ 96 (97)
T smart00736 95 ND 96 (97)
T ss_pred CC
Confidence 86
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.66 E-value=0.073 Score=44.85 Aligned_cols=90 Identities=20% Similarity=0.262 Sum_probs=59.5
Q ss_pred eEEeeCCCCcceEEEEE-EecCCCcEEEcccceeeeeeEEEEEe----cCCCCCccCCCeEEEEEEEEECCCCCCCCCce
Q psy1887 334 TQVTDDNMGKNGIFSLT-LENNNGTFEIWPSVVERKAQFTIRVR----NNKNLDYERTRTLSFVIVAKEISSDSSSNLLS 408 (653)
Q Consensus 334 ~~a~D~D~g~n~~~~~~-~~~~~~~F~I~~~tg~~~~~~~i~l~----~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~s 408 (653)
|.++|+|.+....+... ..+..+.|.|++ .|.+. ..+- ..+.|.--+.-.-.|+|.+.|+
T Consensus 4 Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~wt----Ytl~n~~~avq~L~~Ge~~tdsFtvtv~DG---------- 68 (99)
T TIGR01965 4 LTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQWT----YQADNSQTAVQALKAGETLTDTFTVTSADG---------- 68 (99)
T ss_pred eEEeCCCCCCceEEecccccCCcEEEEECC-CCcEE----EEeCCCcHHHHhhcCCCEEEEEEEEEEeCC----------
Confidence 68899998877666553 334557788887 45432 2232 1244554445556788888883
Q ss_pred EEEEEEEEEeecCCCCCeecccceEEEeeCCC
Q psy1887 409 SQAPVLVYINDVNDNPPVFTATLYTAKIPENA 440 (653)
Q Consensus 409 ~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~ 440 (653)
..+.|+|+|.-.|| +|+..... ...+.|+.
T Consensus 69 tt~~vtItI~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 69 TSQTVTITITGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred CeEEEEEEEEccCC-CCEEeccc-ceeEecCC
Confidence 26889999999999 88775443 46777764
No 19
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=95.34 E-value=0.025 Score=46.38 Aligned_cols=61 Identities=18% Similarity=0.205 Sum_probs=36.9
Q ss_pred eEEEEeCCCCCCcEEEeeeceeeEEeeCCCCc----ceEEEEEEecCCCcEEEcccceeeeeeEEEEEecCCCCCccCC
Q psy1887 312 YITHLDENSPQGTALIFAESFHTQVTDDNMGK----NGIFSLTLENNNGTFEIWPSVVERKAQFTIRVRNNKNLDYERT 386 (653)
Q Consensus 312 ~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~----n~~~~~~~~~~~~~F~I~~~tg~~~~~~~i~l~~~~~LD~E~~ 386 (653)
...+|+|..+.|+.|+.+ |.|.-... ...+.+...+...+|.++..+|. |.+...+|||+-
T Consensus 3 i~YsV~EE~~~Gt~IGni------a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~--------L~v~~rIDRE~L 67 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI------AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGD--------LFVSERIDREEL 67 (84)
T ss_dssp EEEEEESS--TT-EEEEC------CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSE--------EEESS--SCCCC
T ss_pred eEEEeecCCCCCCEEEEh------HHhhCCCcccccccceEEeecCCcceeEecCCcee--------EEeCCccCHHHH
Confidence 356899999999999973 55543221 12355444556679999999998 777899999964
No 20
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=94.32 E-value=4.4 Score=37.60 Aligned_cols=130 Identities=16% Similarity=0.218 Sum_probs=71.9
Q ss_pred CeEEEEE-EEEECCCCCCCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCCC-CcEEEEEEEeeCCCCCCceEEE
Q psy1887 387 RTLSFVI-VAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENATA-GEKVVQVKATDVDTNLGGEILY 464 (653)
Q Consensus 387 ~~~~l~V-~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~-gt~v~~v~a~D~D~g~n~~i~y 464 (653)
..|.+++ .|.|.. +...+..+...+. +...+|.+.. .....+..+... |..=..+.++|.-.+. .-...
T Consensus 15 G~Y~l~~~~a~D~a------gN~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv 85 (158)
T PF13750_consen 15 GSYTLTVVTATDAA------GNTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSV 85 (158)
T ss_pred ccEEEEEEEEEecC------CCEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEE
Confidence 4689999 799987 4455555543333 2345888755 112233333332 3333567888876543 34566
Q ss_pred EEEcCCCCcceEE--eC-CCCeEEEEcCcc-CCccCccEEEEEEEEEECCCCCceEEEEEEEEEEec
Q psy1887 465 TAILGYKNSSLEL--DA-HTGDITIANGQQ-FDREEASEYKFQVEARDMQGLGLRTVVPLQLTILDV 527 (653)
Q Consensus 465 ~i~~~~~~~~F~I--d~-~tG~i~~~~~~~-lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~Dv 527 (653)
+|..|.....-.+ .. ..|...+.-.+. ...|....|+|+|.|+|.. +..++..+.......
T Consensus 86 ~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~a--GN~~~~si~F~y~P~ 150 (158)
T PF13750_consen 86 SLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKA--GNQSTKSISFSYMPP 150 (158)
T ss_pred EEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecC--CCEEEEEEEEEEeCC
Confidence 6655544433222 22 234433321111 1347788999999999964 456666666665533
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=93.32 E-value=5.8 Score=36.81 Aligned_cols=131 Identities=17% Similarity=0.185 Sum_probs=66.7
Q ss_pred CCcEEEEE-EEEEecccCCCCCCCeEEEEEEEEEEccCCCCCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCc
Q psy1887 265 GQPVILMV-LAEEERKDLNEPSPQSSTATIALVFDQAINTPPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGK 343 (653)
Q Consensus 265 ~~~~~l~V-~A~d~~~d~~~~~~~s~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~ 343 (653)
.+.|.+++ +|+|. .| ...+..+...+. ++..||.+.- .....+.. |..+.......+.++|.-.+.
T Consensus 14 dG~Y~l~~~~a~D~----ag---N~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~----g~~v~~~~~i~i~~tD~~~~~ 80 (158)
T PF13750_consen 14 DGSYTLTVVTATDA----AG---NTSTSTVSETFT-IDNTPPTISI-SDGASVAN----GSTVYGLVNISINVTDNSDDS 80 (158)
T ss_pred CccEEEEEEEEEec----CC---CEEEEEEeeEEE-EcCCCCEEEE-ecCCccCC----CccccceeeeEEEEEeCCCCc
Confidence 56788888 79872 22 333344433333 4567888754 11122222 333322223357888875443
Q ss_pred ceEEEEEE--ecCCCcEEEcccceeeeeeEEEEEecCCCC-CccCCCeEEEEEEEEECCCCCCCCCceEEEEEEEEEe
Q psy1887 344 NGIFSLTL--ENNNGTFEIWPSVVERKAQFTIRVRNNKNL-DYERTRTLSFVIVAKEISSDSSSNLLSSQAPVLVYIN 418 (653)
Q Consensus 344 n~~~~~~~--~~~~~~F~I~~~tg~~~~~~~i~l~~~~~L-D~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~ 418 (653)
..-++.+ +...+.-.+.... ...+.+. +.-.+.+ ..|....|.|+|.|+|.. +..++..+.....
T Consensus 81 -~i~sv~l~Gg~~~d~v~ls~~~-~~~~~~~--~~yp~~fpsle~~~~YtLtV~a~D~a------GN~~~~si~F~y~ 148 (158)
T PF13750_consen 81 -KITSVSLTGGPASDSVSLSWTN-KGNGVYT--LEYPRIFPSLEADDSYTLTVSATDKA------GNQSTKSISFSYM 148 (158)
T ss_pred -eEEEEEEECCcccceEEEeeEe-ccCceEE--eecccccCCcCCCCeEEEEEEEEecC------CCEEEEEEEEEEe
Confidence 2233333 2223332222111 1112222 2111211 338889999999999987 6777777777665
No 22
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=92.78 E-value=1.1 Score=37.83 Aligned_cols=70 Identities=19% Similarity=0.208 Sum_probs=47.5
Q ss_pred EeeCCCCcceEEEEEEec---CCCcEEEcccceeeeeeEEEEEecCCCCCccCCCeEEEEEEEEECCCCCCCCCceEEEE
Q psy1887 336 VTDDNMGKNGIFSLTLEN---NNGTFEIWPSVVERKAQFTIRVRNNKNLDYERTRTLSFVIVAKEISSDSSSNLLSSQAP 412 (653)
Q Consensus 336 a~D~D~g~n~~~~~~~~~---~~~~F~I~~~tg~~~~~~~i~l~~~~~LD~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~ 412 (653)
..|.| +....|+..+.+ -..|..+++.++. +. ..|...+ ...|.|+|.|+|.. +.+....
T Consensus 24 F~d~d-~~~lty~~~~~~~~~lP~Wl~fd~~~~~--------~~-GtP~~~~-~g~~~i~v~a~D~~------g~~~~~~ 86 (97)
T smart00736 24 FTDAD-GDTLTYSATLSDGSALPSWLSFDSDTGT--------LS-GTPTNSD-VGSLSLKVTATDSS------GASASDT 86 (97)
T ss_pred eECCC-CCeEEEEEEeCCCCCCCCeEEEeCCCCE--------EE-EECCCCC-CcEEEEEEEEEECC------CCEEEEE
Confidence 35666 444455554432 1358888887764 32 2344444 45699999999987 5788889
Q ss_pred EEEEEeecCC
Q psy1887 413 VLVYINDVND 422 (653)
Q Consensus 413 v~I~V~DvND 422 (653)
++|.|.+.|+
T Consensus 87 f~i~V~~~~~ 96 (97)
T smart00736 87 FTITVVNTND 96 (97)
T ss_pred EEEEEeCCCC
Confidence 9999999987
No 23
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=91.78 E-value=11 Score=44.63 Aligned_cols=150 Identities=19% Similarity=0.150 Sum_probs=79.0
Q ss_pred cCCCCCeecccceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCCCCc--ceEEeCCCCeEEEEcC--------
Q psy1887 420 VNDNPPVFTATLYTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGYKNS--SLELDAHTGDITIANG-------- 489 (653)
Q Consensus 420 vNDn~P~f~~~~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~~~~--~F~Id~~tG~i~~~~~-------- 489 (653)
.||..+.|....-...|.|+. |++-.+|.-...|.+..-.+.|+...|.... -|. +..|.|.....
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 455455544444455689985 7777777776656655677899877664432 232 33454433211
Q ss_pred ccC---CccCccEEEEEEEEEECC-------------CCCceEEEEEEEEEEecCCCCCeEeecCcEEEEecCCCCCeEE
Q psy1887 490 QQF---DREEASEYKFQVEARDMQ-------------GLGLRTVVPLQLTILDVNDNAPIFVQTPFEFVLASDSRNFSER 553 (653)
Q Consensus 490 ~~l---D~E~~~~~~l~V~a~D~g-------------~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~e~~~~g~~v 553 (653)
.-+ -+|....|.+.+.--..+ ...+......+|+|.| ||++|.|.-..-...|.|+. |+.-
T Consensus 471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vt 547 (928)
T TIGR00845 471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIME 547 (928)
T ss_pred EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEEE
Confidence 011 134444555544321111 0112223356677777 68899876544457788874 4433
Q ss_pred EEE-EEEeCCCCCCCCeEEEEEEeCCC
Q psy1887 554 TFI-KATDQDAEAPNNIVRYEIISGNY 579 (653)
Q Consensus 554 ~~v-~A~D~D~~~~n~~i~Y~i~~~~~ 579 (653)
.+| +..+.+ + .-.+.|.-.+|..
T Consensus 548 vtV~RtsGa~-G--~VtV~Y~T~dGTA 571 (928)
T TIGR00845 548 VKVLRTSGAR-G--TVIVPYRTVEGTA 571 (928)
T ss_pred EEEEEcCCCC-e--eEEEEEEeecCcc
Confidence 333 333222 1 2346787776643
No 24
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=85.70 E-value=87 Score=37.41 Aligned_cols=157 Identities=17% Similarity=0.136 Sum_probs=77.6
Q ss_pred cCCCCCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCcceEEEEEEecCC----CcE-----EEcccceeeeee
Q psy1887 300 AINTPPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGKNGIFSLTLENNN----GTF-----EIWPSVVERKAQ 370 (653)
Q Consensus 300 vNd~~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~n~~~~~~~~~~~----~~F-----~I~~~tg~~~~~ 370 (653)
.||.++.|....-...|.||. |++-..| .-...|.+....+.|...+.. .-| .+.-..|+....
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV-----~R~GGdl~~tVsVdY~T~DGTA~AG~DY~~~sGTLtF~PGEt~Kt 467 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTV-----VRRGGDLTNTVYVDYRTEDGTANAGSDYEFTEGTLVFKPGETQKE 467 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEE-----EEccCCCCceEEEEEEccCCccCCCCCccccCceEEECCCceEEE
Confidence 455566665555566799986 6655443 222224444455666642211 111 122233444444
Q ss_pred EEEEEecCCCCCccCCCeEEEEEEEEECCC-CC--------CCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCC
Q psy1887 371 FTIRVRNNKNLDYERTRTLSFVIVAKEISS-DS--------SSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENAT 441 (653)
Q Consensus 371 ~~i~l~~~~~LD~E~~~~~~l~V~a~D~~~-~~--------~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~ 441 (653)
+.+.|. .-=-+|..+.|.+.+.--..+. ++ ....+......+|+|.| ||++|.|....-..+|.|+.
T Consensus 468 ItV~II--DDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~- 543 (928)
T TIGR00845 468 FRIGII--DDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI- 543 (928)
T ss_pred EEEEEc--cCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC-
Confidence 444443 3233566666666554321110 00 01123333456677777 78899876665567799974
Q ss_pred CCcEEEEEEEeeCCCCCCceEEEEEEcC
Q psy1887 442 AGEKVVQVKATDVDTNLGGEILYTAILG 469 (653)
Q Consensus 442 ~gt~v~~v~a~D~D~g~n~~i~y~i~~~ 469 (653)
|+.-.+|.-+-.-.| .-.+.|+...|
T Consensus 544 -G~vtvtV~RtsGa~G-~VtV~Y~T~dG 569 (928)
T TIGR00845 544 -GIMEVKVLRTSGARG-TVIVPYRTVEG 569 (928)
T ss_pred -CEEEEEEEEcCCCCe-eEEEEEEeecC
Confidence 555444433221111 23456765554
No 25
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=83.99 E-value=3.8 Score=29.78 Aligned_cols=36 Identities=33% Similarity=0.405 Sum_probs=28.1
Q ss_pred CCcceEEeCCCCeEEEEcCccCCcc-CccEEEEEEEEEECC
Q psy1887 471 KNSSLELDAHTGDITIANGQQFDRE-EASEYKFQVEARDMQ 510 (653)
Q Consensus 471 ~~~~F~Id~~tG~i~~~~~~~lD~E-~~~~~~l~V~a~D~g 510 (653)
......||+.+|.|.-. .+.. ....|.+.|.|+|..
T Consensus 12 LP~gLs~d~~tG~isGt----p~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 12 LPSGLSLDPSTGTISGT----PTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCcEEEeCCCCEEEee----cCCCccccEEEEEEEEEcCC
Confidence 34578999999999965 4444 347999999999964
No 26
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=78.25 E-value=40 Score=30.45 Aligned_cols=57 Identities=26% Similarity=0.258 Sum_probs=38.1
Q ss_pred EecCCCCCccC---CCeEEEEEEEEECCCCCCCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCC
Q psy1887 375 VRNNKNLDYER---TRTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENAT 441 (653)
Q Consensus 375 l~~~~~LD~E~---~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~ 441 (653)
+.+.++||... .-...|.|.|+|.. +-.+...+.|+|.| | .|...... ..+|.|+.-
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~D------GD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L 129 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFD------GDTSSITLPVTIVD--D-VPTITDVD-ALTVDEDDL 129 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCC------CCccccEEEEEEEC--C-CCeecccc-ceEEecccc
Confidence 44468888743 34678889999977 23334588888887 6 58775544 367888543
No 27
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=69.94 E-value=44 Score=25.35 Aligned_cols=60 Identities=22% Similarity=0.346 Sum_probs=34.3
Q ss_pred CceEEEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEEEE
Q psy1887 459 GGEILYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLTIL 525 (653)
Q Consensus 459 n~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~ 525 (653)
+-...|+|. |....+..+...+-.+... .| ..+.|+|.|+|.|..+........++|+|+
T Consensus 7 ~~~Y~Y~l~-g~d~~W~~~~~~~~~~~~~---~L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 7 NIRYRYRLE-GFDDEWITLGSYSNSISYT---NL---PPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp TEEEEEEEE-TTESSEEEESSTS-EEEEE---S-----SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred ceEEEEEEE-CCCCeEEECCCCcEEEEEE---eC---CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 345566644 4445566664433255544 22 579999999999976654444366777664
No 28
>KOG3597|consensus
Probab=68.81 E-value=1.1e+02 Score=33.26 Aligned_cols=153 Identities=17% Similarity=0.191 Sum_probs=83.9
Q ss_pred EEEEEEEEEeecCCCCCeecccceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCCCC----cceEEeC-----
Q psy1887 409 SQAPVLVYINDVNDNPPVFTATLYTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGYKN----SSLELDA----- 479 (653)
Q Consensus 409 ~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~~~----~~F~Id~----- 479 (653)
-+....|.|.-+||.+..+....+.+-+.|+...-.-...+.+.|+|.+. ..+.|++...... ..|..-.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~ 102 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATE 102 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccc
Confidence 34567899999999777676666778888876644444568889999853 5678887653221 1133221
Q ss_pred ------CCCeEEEEcCccCCccCccEEEEEEEEEECCCCCceEEEEEEEEEEecCCCCCeEeec-CcEEEEecCCCCCeE
Q psy1887 480 ------HTGDITIANGQQFDREEASEYKFQVEARDMQGLGLRTVVPLQLTILDVNDNAPIFVQT-PFEFVLASDSRNFSE 552 (653)
Q Consensus 480 ------~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~DvNDn~P~f~~~-~y~~~v~e~~~~g~~ 552 (653)
..|.+.....+. ....+.++..++|+ ...+-.+..... ...|.+... .-...+.-+.. ..
T Consensus 103 Fs~~~v~~g~~~yvh~g~----el~~~~~~~~~SDg----~~~S~~~i~~~~---~~~~~~~~~~~~gL~v~~gS~--~~ 169 (442)
T KOG3597|consen 103 FSYEEVEDGSLSYVHSGT----ELRESELQLRVSDG----LLVSERAILKVE---ATGPAPHLARNTGLKVLQGST--AP 169 (442)
T ss_pred eEehHhhcCceeEEecCc----ccccceEEEEeecc----eEeeeeEEeccc---CCCcceeeecccceEEccCcc--cc
Confidence 223333321111 25677888888884 222221111111 233333221 11223322221 12
Q ss_pred E--EEEEEEeCCCCCCCCeEEEEEEe
Q psy1887 553 R--TFIKATDQDAEAPNNIVRYEIIS 576 (653)
Q Consensus 553 v--~~v~A~D~D~~~~n~~i~Y~i~~ 576 (653)
+ ..+.+.|.|.. +...+.|.|..
T Consensus 170 IT~~~L~ved~d~~-~d~~v~~~i~~ 194 (442)
T KOG3597|consen 170 ITPSNLSVEDNDSS-PDDEVRYDITP 194 (442)
T ss_pred ccHhHceeecCCCC-CCcEEEEEecC
Confidence 2 35788898864 56779999974
No 29
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=65.18 E-value=81 Score=26.29 Aligned_cols=53 Identities=21% Similarity=0.235 Sum_probs=30.6
Q ss_pred EEEEeecCCCCCeecccceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEEEEEcCC
Q psy1887 414 LVYINDVNDNPPVFTATLYTAKIPENATAGEKVVQVKATDVDTNLGGEILYTAILGY 470 (653)
Q Consensus 414 ~I~V~DvNDn~P~f~~~~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y~i~~~~ 470 (653)
+|.|.| || .|.+....-..++.|+. |..-..+.....+....-.+.|....+.
T Consensus 2 tvtI~d-~d-~~~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DD-DPTVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TT-SEEEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CC-CCEEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 567788 66 44766665566788885 4455555555444333455666655553
No 30
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=57.18 E-value=1.5e+02 Score=26.77 Aligned_cols=44 Identities=25% Similarity=0.435 Sum_probs=29.8
Q ss_pred cEEEEEEEEEECCCCCceEEEEEEEEEEecCCCCCeEeecCcEEEEecCC
Q psy1887 498 SEYKFQVEARDMQGLGLRTVVPLQLTILDVNDNAPIFVQTPFEFVLASDS 547 (653)
Q Consensus 498 ~~~~l~V~a~D~g~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~e~~ 547 (653)
-...|.|.|+|..+... +..+.|+|.| | .|+..... ...|.|+.
T Consensus 85 l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 85 LTLNFPIIATDFDGDTS--SITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEeeeEEEEeCCCCcc--ccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 35678899999766443 3578888887 4 47765543 36787853
No 31
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=43.57 E-value=1e+02 Score=23.28 Aligned_cols=47 Identities=17% Similarity=0.109 Sum_probs=26.2
Q ss_pred CCcceEEEEEEecC-CCcEEEcccceeeeeeEEEEEecCCCCCccCCCeEEEEEEEEECC
Q psy1887 341 MGKNGIFSLTLENN-NGTFEIWPSVVERKAQFTIRVRNNKNLDYERTRTLSFVIVAKEIS 399 (653)
Q Consensus 341 ~g~n~~~~~~~~~~-~~~F~I~~~tg~~~~~~~i~l~~~~~LD~E~~~~~~l~V~a~D~~ 399 (653)
.+.+..|.|.+.+. ..+..+...+-. +.. .+| ....|.|.|+|.|..
T Consensus 4 ~~~~~~Y~Y~l~g~d~~W~~~~~~~~~------~~~---~~L---~~G~Y~l~V~a~~~~ 51 (66)
T PF07495_consen 4 NPENIRYRYRLEGFDDEWITLGSYSNS------ISY---TNL---PPGKYTLEVRAKDNN 51 (66)
T ss_dssp CCTTEEEEEEEETTESSEEEESSTS-E------EEE---ES-----SEEEEEEEEEEETT
T ss_pred CCCceEEEEEEECCCCeEEECCCCcEE------EEE---EeC---CCEEEEEEEEEECCC
Confidence 34566788877543 344444433211 111 111 346899999999976
No 32
>KOG3597|consensus
Probab=42.94 E-value=4.9e+02 Score=28.53 Aligned_cols=50 Identities=8% Similarity=0.022 Sum_probs=35.7
Q ss_pred EEEEEEEEEEccCCCCCeecCCceEEEEeCCCCCCcEEEeeeceeeEEeeCCCCc
Q psy1887 289 STATIALVFDQAINTPPYFDTVQYITHLDENSPQGTALIFAESFHTQVTDDNMGK 343 (653)
Q Consensus 289 ~~~~v~i~V~dvNd~~P~f~~~~~~~~v~En~~~gt~v~~v~~~~~~a~D~D~g~ 343 (653)
.++.+.|.|..+||.+..+....+.+-+.|....-..- ..+++.|+|.+.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~-----~~l~~~d~d~~~ 73 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDP-----ELLTAADPDSAP 73 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccc-----eEeeccCCCCCc
Confidence 45788999999999988887777778888865321111 235888988664
No 33
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=37.04 E-value=70 Score=28.68 Aligned_cols=24 Identities=29% Similarity=0.462 Sum_probs=16.2
Q ss_pred eEEEEEEEEECCCCCeeeeEEEEE
Q psy1887 182 VFQFRIIVKDTRGDTTTVPTSLTA 205 (653)
Q Consensus 182 ~~~~~v~v~d~~g~~~~~~~~i~v 205 (653)
.|.|.++|+|..|........|+|
T Consensus 108 ~YH~~i~VtD~~Gn~~~~~~~i~I 131 (132)
T PF15418_consen 108 DYHFMITVTDAAGNQTEEERSIKI 131 (132)
T ss_pred ceEEEEEEEECCCCEEEEEEEEEE
Confidence 477777777777776665555544
No 34
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=33.80 E-value=2.3e+02 Score=22.11 Aligned_cols=30 Identities=7% Similarity=0.249 Sum_probs=22.6
Q ss_pred CCccCccEEEEEEEEEECCCCCceEEEEEEEEE
Q psy1887 492 FDREEASEYKFQVEARDMQGLGLRTVVPLQLTI 524 (653)
Q Consensus 492 lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V 524 (653)
.-|+....|.+++.++|..+ +.++.+.|.|
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 45677899999999999755 5566666665
No 35
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=31.88 E-value=2.5e+02 Score=21.99 Aligned_cols=31 Identities=16% Similarity=0.169 Sum_probs=22.0
Q ss_pred CCccCCCeEEEEEEEEECCCCCCCCCceEEEEEEEEE
Q psy1887 381 LDYERTRTLSFVIVAKEISSDSSSNLLSSQAPVLVYI 417 (653)
Q Consensus 381 LD~E~~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V 417 (653)
..|.....|.++|.|+|.. +.+....++|.|
T Consensus 51 ~~y~~~G~y~v~l~v~d~~------g~~~~~~~~V~V 81 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAV------GSSSTKTTTVVV 81 (81)
T ss_pred EEcCCCcEEEEEEEEEeCC------CCEEEEEEEEEC
Confidence 4477888999999999975 355555555543
No 36
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=31.68 E-value=1.2e+02 Score=25.12 Aligned_cols=52 Identities=23% Similarity=0.320 Sum_probs=31.1
Q ss_pred EEEEEeccCCCCcCCCCCceEEecCCCCCCcEEEEEEEecCCCCCCceEEEEEecC
Q psy1887 12 VVVVDDVQDTPPIFINIQPVIQLAPNLTMNDVLTKITAIDGDKGHPRTIKYGLVSE 67 (653)
Q Consensus 12 ~I~V~DvNDn~P~F~~~~~~~~V~E~~~~g~~i~~v~A~D~D~g~n~~i~ysl~~~ 67 (653)
+|+|+| ||.+ .+.-.....++.|+. |..-..|.-..++....-.+.|+..++
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCC
Confidence 577888 6665 776666677889986 445555555544433344666766554
No 37
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=24.80 E-value=7.8e+02 Score=25.37 Aligned_cols=132 Identities=14% Similarity=0.176 Sum_probs=0.0
Q ss_pred CeEEEEEEEEECCCCCCCCCceEEEEEEEEEeecCCCCCeeccc--ceEEEeeCCCCCCcEEEEEEEeeCCCCCCceEEE
Q psy1887 387 RTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTAT--LYTAKIPENATAGEKVVQVKATDVDTNLGGEILY 464 (653)
Q Consensus 387 ~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~--~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i~y 464 (653)
..|.+.+.++|.. +..+.....|.|.-. +|..... .....+.-........+...+++.+ .+..+.+
T Consensus 12 ~~~~v~vt~tD~a------GN~~~~t~~~~vDt~---~P~v~i~~~~~~~~~~~~~~~~~~t~s~tvs~~~--~g~~v~v 80 (317)
T PF13753_consen 12 GTYTVSVTVTDAA------GNTSTATQSITVDTT---APTVTITSIADDDIINGDEATNTVTFSGTVSGAE--PGSTVTV 80 (317)
T ss_pred CcEEEEEEEEeCC------CCeeeeeEEEEEecC---CCceeeecccCCCccccceeeeeeEEEEEecCCC--CCCEEEE
Q ss_pred EEEcCCCCcceEEeCCCCeEEEEcCccCC-ccCccEEEEEEE-EEECCCCCceEEEEEEEEEEecCCCCCeEe
Q psy1887 465 TAILGYKNSSLELDAHTGDITIANGQQFD-REEASEYKFQVE-ARDMQGLGLRTVVPLQLTILDVNDNAPIFV 535 (653)
Q Consensus 465 ~i~~~~~~~~F~Id~~tG~i~~~~~~~lD-~E~~~~~~l~V~-a~D~g~~~~~~~~~v~I~V~DvNDn~P~f~ 535 (653)
.+ +.....+..+ ..|.+... .... .-....|.+.+. ++|..+..... ....+.|.-.--.+|.+.
T Consensus 81 ~~--~g~~~t~~~~-~~G~ws~t--~~~~~~l~~g~~ti~v~~~tD~aGN~~t~-~s~~~~vDt~~~~~p~vt 147 (317)
T PF13753_consen 81 TI--NGTTGTLTAD-ADGNWSVT--VTPSDDLPDGDYTITVTTVTDAAGNTSTA-ASQTFTVDTTAPTAPTVT 147 (317)
T ss_pred EE--CCEEEEEEEe-cCCcEEEe--eccccccccCcceeEEEEEEccCCccccc-cccccccccccccccccc
No 38
>KOG4221|consensus
Probab=23.06 E-value=1.5e+03 Score=28.19 Aligned_cols=130 Identities=17% Similarity=0.186 Sum_probs=69.1
Q ss_pred CeEEEEEEEEECCCCCCCCCceEEEEEEEEEeecCCCCCeecccceEEEeeCCCCCCcEEEEEEEeeCCCCCCceE----
Q psy1887 387 RTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDVNDNPPVFTATLYTAKIPENATAGEKVVQVKATDVDTNLGGEI---- 462 (653)
Q Consensus 387 ~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvNDn~P~f~~~~y~~~V~E~~~~gt~v~~v~a~D~D~g~n~~i---- 462 (653)
..|.|.|.++-.+. ...+....+.+... ...|.+......+.-.| .+.+.++. .-+-.-.|++|
T Consensus 891 t~yEfav~~~~~~~----r~stwsmsv~~~tl---e~~P~sPP~d~tv~p~e--~P~~v~v~---WqPp~e~nG~I~~Yi 958 (1381)
T KOG4221|consen 891 TPYEFAVMVVKRNR----RESTWSMSVENRTL---ELVPSSPPRDLTVQPDE--KPTTVIVH---WQPPTEPNGEITEYI 958 (1381)
T ss_pred ChhhhhhhhhhccC----cCCcccceeeeeec---ccCCCCCChhceecccC--CCCccccc---cCCCcCCCCceeeEE
Confidence 45677777665441 12233344444433 34788877777776666 34444432 23333445554
Q ss_pred -EEEEEcCCCCcceEEeCCCCeEEEEcCccCCccCccEEEEEEEEEECCCCC-ceE---EEEEEEEEEecCCC
Q psy1887 463 -LYTAILGYKNSSLELDAHTGDITIANGQQFDREEASEYKFQVEARDMQGLG-LRT---VVPLQLTILDVNDN 530 (653)
Q Consensus 463 -~y~i~~~~~~~~F~Id~~tG~i~~~~~~~lD~E~~~~~~l~V~a~D~g~~~-~~~---~~~v~I~V~DvNDn 530 (653)
.|+........-..+...-|..... +.-+.+....|.+.|+|+...+++ .+. ..+....+...||.
T Consensus 959 i~Ys~~~n~~~~dWt~~t~~g~~L~~--~v~~l~p~t~yffkiQAr~~kG~gp~s~~v~y~t~~~~~~~~~d~ 1029 (1381)
T KOG4221|consen 959 IYYSTDGNTPEHDWTIETTAGAELSH--QVPNLDPDTGYFFKIQARNEKGPGPFSSPVLYETSKAEIVMINDQ 1029 (1381)
T ss_pred EEEecCCCCchhhceeeecccchhhh--ccCCCCCCCceEEEEEeeccCCCCccccceeeeccccccccccch
Confidence 2332111222345565555655444 445556678999999999866665 332 23344444455554
No 39
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=22.97 E-value=2.9e+02 Score=20.75 Aligned_cols=30 Identities=13% Similarity=0.142 Sum_probs=22.0
Q ss_pred CCeEEEEEEEEECCCCCCCCCceEEEEEEEEEeecC
Q psy1887 386 TRTLSFVIVAKEISSDSSSNLLSSQAPVLVYINDVN 421 (653)
Q Consensus 386 ~~~~~l~V~a~D~~~~~~~~~~s~~~~v~I~V~DvN 421 (653)
...|.|.+.|+|.. +..+.....+.+.|..
T Consensus 22 dg~yt~~v~a~D~A------GN~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKA------GNTSSSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECC------CCEEEeeeEEEEEcCC
Confidence 56799999999988 5666666666666554
No 40
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=22.61 E-value=1.8e+02 Score=24.92 Aligned_cols=34 Identities=21% Similarity=0.356 Sum_probs=24.3
Q ss_pred cCCccCccEEEEEEEEEECCCCCceEEEEEEEEEEe
Q psy1887 491 QFDREEASEYKFQVEARDMQGLGLRTVVPLQLTILD 526 (653)
Q Consensus 491 ~lD~E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~D 526 (653)
.+++.+.+.|.++|+++|..+ -+.+..+.|.|-|
T Consensus 73 t~~v~kgG~y~m~V~lCn~dG--CS~S~~~~I~VAD 106 (106)
T cd02848 73 TFKVGKGGRYQMQVALCNGDG--CSTSAAKEIVVAD 106 (106)
T ss_pred EEEeCCCCeEEEEEEEECCCC--ccCcCCEEEEecC
Confidence 356677899999999999655 4444566666654
No 41
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=20.06 E-value=3.8e+02 Score=20.11 Aligned_cols=32 Identities=28% Similarity=0.427 Sum_probs=21.8
Q ss_pred cCccEEEEEEEEEECCCCCceEEEEEEEEEEecC
Q psy1887 495 EEASEYKFQVEARDMQGLGLRTVVPLQLTILDVN 528 (653)
Q Consensus 495 E~~~~~~l~V~a~D~g~~~~~~~~~v~I~V~DvN 528 (653)
+....|.|.++|+|..+ ..+.....+.+.|..
T Consensus 20 ~~dg~yt~~v~a~D~AG--N~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 20 DADGEYTLTVTATDKAG--NTSSSTTQIVIVDNT 51 (60)
T ss_pred cCCccEEEEEEEEECCC--CEEEeeeEEEEEcCC
Confidence 34678999999999654 445555555555553
Done!