Query psy1041
Match_columns 1095
No_of_seqs 441 out of 2777
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 17:11:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1041.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1041hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 3E-130 7E-135 1131.1 77.1 824 152-1073 158-994 (2531)
2 KOG1219|consensus 100.0 3E-127 7E-132 1136.4 100.1 1002 28-1095 2070-3321(4289)
3 KOG4289|consensus 100.0 2E-127 4E-132 1108.5 78.9 902 34-1060 161-1080(2531)
4 KOG1219|consensus 100.0 7E-123 1E-127 1099.4 90.5 952 30-1093 22-975 (4289)
5 cd00031 CA Cadherin repeat dom 100.0 2E-28 4.4E-33 258.2 29.2 197 860-1061 1-199 (199)
6 cd00031 CA Cadherin repeat dom 100.0 3.7E-28 7.9E-33 256.3 28.6 196 756-955 2-199 (199)
7 KOG1834|consensus 99.7 2.8E-15 6E-20 167.1 20.2 214 844-1063 21-246 (952)
8 PF00028 Cadherin: Cadherin do 99.6 2.5E-15 5.4E-20 137.1 14.4 92 968-1060 1-93 (93)
9 PF00028 Cadherin: Cadherin do 99.6 3.6E-15 7.7E-20 136.1 14.0 92 756-847 1-93 (93)
10 KOG1834|consensus 99.6 1.6E-13 3.6E-18 153.2 20.1 207 743-956 25-245 (952)
11 smart00112 CA Cadherin repeats 99.5 3.7E-14 8.1E-19 125.0 11.1 79 776-854 1-79 (79)
12 smart00112 CA Cadherin repeats 99.5 1.1E-13 2.3E-18 122.1 11.2 79 988-1069 1-79 (79)
13 PF08758 Cadherin_pro: Cadheri 96.3 0.028 6.1E-07 50.1 9.2 78 154-237 2-79 (90)
14 PF08266 Cadherin_2: Cadherin- 96.2 0.0032 7E-08 55.3 2.7 62 757-819 4-67 (84)
15 PF08758 Cadherin_pro: Cadheri 96.1 0.041 9E-07 49.0 9.1 79 852-936 2-80 (90)
16 TIGR01965 VCBS_repeat VCBS rep 95.9 0.054 1.2E-06 48.8 9.2 88 399-499 2-98 (99)
17 PF08266 Cadherin_2: Cadherin- 95.8 0.018 3.8E-07 50.6 5.4 60 861-921 3-66 (84)
18 TIGR01965 VCBS_repeat VCBS rep 95.3 0.14 3.1E-06 46.2 9.4 89 983-1084 2-98 (99)
19 smart00736 CADG Dystroglycan-t 95.0 0.23 5E-06 45.3 10.1 70 509-587 23-96 (97)
20 smart00736 CADG Dystroglycan-t 94.7 0.36 7.7E-06 44.0 10.8 70 403-481 24-96 (97)
21 PF13750 Big_3_3: Bacterial Ig 94.7 2.5 5.5E-05 42.2 17.5 129 224-369 14-147 (158)
22 TIGR00864 PCC polycystin catio 94.0 42 0.00092 47.6 67.5 222 816-1085 1750-1982(2740)
23 PF13750 Big_3_3: Bacterial Ig 93.4 3.2 6.9E-05 41.5 15.3 125 819-954 14-148 (158)
24 TIGR00845 caca sodium/calcium 89.5 21 0.00046 45.4 19.7 190 479-683 395-616 (928)
25 TIGR00845 caca sodium/calcium 84.0 94 0.002 39.9 21.0 165 849-1024 395-588 (928)
26 PF05345 He_PIG: Putative Ig d 83.8 3 6.5E-05 32.6 5.3 36 899-936 13-49 (49)
27 PF05345 He_PIG: Putative Ig d 83.5 3.7 8E-05 32.1 5.7 37 199-237 11-48 (49)
28 PF05895 DUF859: Siphovirus pr 80.0 1.2E+02 0.0026 37.4 19.4 110 922-1046 297-425 (624)
29 KOG3597|consensus 73.0 1.1E+02 0.0024 35.8 15.9 153 839-1006 25-196 (442)
30 PF13753 SWM_repeat: Putative 71.4 1.8E+02 0.0039 32.7 17.6 206 447-674 11-230 (317)
31 KOG3597|consensus 70.6 95 0.0021 36.4 14.7 157 363-528 26-195 (442)
32 TIGR00864 PCC polycystin catio 70.0 5.3E+02 0.011 37.5 77.0 123 113-256 947-1084(2740)
33 PF07495 Y_Y_Y: Y_Y_Y domain; 66.6 41 0.00089 27.7 8.2 63 515-583 4-66 (66)
34 PF13753 SWM_repeat: Putative 66.1 2.3E+02 0.005 31.9 19.2 130 116-268 11-149 (317)
35 TIGR03660 T1SS_rpt_143 T1SS-14 61.1 53 0.0011 31.9 8.8 62 325-392 65-128 (137)
36 TIGR03660 T1SS_rpt_143 T1SS-14 59.5 83 0.0018 30.6 9.8 56 1021-1084 70-128 (137)
37 PF07495 Y_Y_Y: Y_Y_Y domain; 54.5 1.1E+02 0.0023 25.1 8.7 59 995-1059 7-65 (66)
38 PF03160 Calx-beta: Calx-beta 46.8 2.2E+02 0.0047 25.6 12.5 52 950-1005 2-53 (100)
39 PF03160 Calx-beta: Calx-beta 41.8 94 0.002 28.1 7.0 51 579-633 2-54 (100)
40 PF05895 DUF859: Siphovirus pr 41.7 7.8E+02 0.017 30.5 21.1 107 447-566 297-423 (624)
41 PF02010 REJ: REJ domain; Int 39.3 33 0.00071 40.7 4.5 32 449-486 162-193 (440)
42 cd00146 PKD polycystic kidney 33.1 1.6E+02 0.0036 25.1 6.8 31 219-255 51-81 (81)
43 KOG4221|consensus 30.5 1.5E+03 0.032 30.4 51.9 164 786-959 859-1029(1381)
44 PF02010 REJ: REJ domain; Int 29.8 62 0.0013 38.4 4.7 219 819-1059 49-297 (440)
45 KOG4221|consensus 28.9 1.5E+03 0.034 30.2 39.7 70 295-377 548-619 (1381)
46 PF12245 Big_3_2: Bacterial Ig 27.0 1.8E+02 0.0038 23.8 5.4 31 921-957 21-51 (60)
47 cd00146 PKD polycystic kidney 21.0 5.2E+02 0.011 21.9 7.8 63 612-681 17-79 (81)
48 PF02494 HYR: HYR domain; Int 20.0 5.6E+02 0.012 21.9 8.2 25 1033-1059 57-81 (81)
No 1
>KOG4289|consensus
Probab=100.00 E-value=3.3e-130 Score=1131.12 Aligned_cols=824 Identities=34% Similarity=0.523 Sum_probs=770.3
Q ss_pred CCCCCCCCCCceEEEeeCCCCCCcEEEEEEeeeCCCCCCeEEEEEEec-----CCccEEEEccccEEEEccCCCcCCCcE
Q psy1041 152 NDLNPLFYPTEYEETVPEDLPLHTSILRVSAEDADLGRNGEIYYSFRD-----MNEQFSIHPTSGVVTLTRPLKYTDRSV 226 (1095)
Q Consensus 152 NDn~P~F~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~Y~l~~-----~~~~F~id~~tG~i~~~~~ld~e~~~~ 226 (1095)
.-|+|.|.+..|...++||.|.||.|++|+|.|+|. +++.|++.. ..+.|+||+.+|.|++.+.||||....
T Consensus 158 ~~~~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~a---~rl~Ysm~al~dsRS~~lFslD~~sG~irta~~lDREt~e~ 234 (2531)
T KOG4289|consen 158 AANAVQFQQPNYQKELPENEPAGTIIASVKASDPDA---GRLYYSMVALFDSRSQNLFSLDPMSGAIRTAKSLDRETKET 234 (2531)
T ss_pred CCCCccCCCcchhccCcCCCCCCceeEEEEecCCCc---CceEEEeeeccchhccccEeeccccccchhhhhhhhhhhhe
Confidence 457899999999999999999999999999999995 469999974 357899999999999999999999999
Q ss_pred EEEEEEEeecCccccCCCCcceEEEEEEEEeccCCcCeeEEeecCcccccCCcc--EEEEEEEEeCCCCCCCeEEEEEee
Q psy1041 227 HDLVVLGQDRGSVFKGGGKPSSAKLKIKVEQINLYGPEIYVQSLPDIVEQSYAD--IYAIVRVVDRDAGIHGEIASLDIV 304 (1095)
Q Consensus 227 ~~l~V~A~D~g~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~~~~~~~~--~~~~v~a~D~D~g~n~~v~~~~i~ 304 (1095)
|.|+|+|.|.|.| .+|++++|+|.|.|.|||.|+|.+..|.....|..+. .+-.++|+|.|++.|+.+. |+++
T Consensus 235 HvlrVtA~d~~~P----~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~-Yrl~ 309 (2531)
T KOG4289|consen 235 HVLRVTAQDHGDP----RRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIR-YRLL 309 (2531)
T ss_pred eEEEEEeeecCCC----cccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceE-EEec
Confidence 9999999999988 7899999999999999999999998887766665543 5678899999999999997 6899
Q ss_pred cCCCCCCeEEeeeccCCCCcCceEEEEECccccccCCCCceEEEEEEEECCCCCeeeEEEEEEEeccCCCCCCCccCCcE
Q psy1041 305 DGDPDGHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATQVHLVDLNDNKPVFDREIY 384 (1095)
Q Consensus 305 ~g~~~~~F~i~~~~~~~~~~~g~~~i~~~~~lD~E~~~~~y~l~v~a~D~g~p~~~s~~~~~i~v~d~Nd~~P~F~~~~~ 384 (1095)
.|+....|+|++. +| .|.+..+||||.... |+|.|.|+|.|.|+.-.++.+.|+|.|.|||+|+|....|
T Consensus 310 eg~~~~~f~in~r-------SG--vI~T~a~lDRE~~~~-y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Y 379 (2531)
T KOG4289|consen 310 EGNAKNVFEINPR-------SG--VISTRAPLDREELES-YQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRY 379 (2531)
T ss_pred CCCccceeEEcCc-------cc--eeeccCccCHHhhhh-eEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccce
Confidence 9988899999987 67 699999999999876 9999999999999877788999999999999999999999
Q ss_pred EEEEeCCCCCCceEEEEEEeeCCCCCCceEEEEEEeCCCCCcEEEECCccEEEEceecCcccccEEEEEEEEEeCCCCCC
Q psy1041 385 EVDVPETTPVNTPIIRLKVSDADDGKNAQVFLEIVGGNEGGEFNINPETGMLYTAVTLDAEDKAFYTLTVSAIDQGNAGT 464 (1095)
Q Consensus 385 ~~~v~E~~~~g~~v~~v~a~D~D~g~n~~i~ysi~~~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~~~~ 464 (1095)
.+.|.|+..+++.|++|+|+|.|.|.|+.+.|+|.+|+..|.|.||..||+|.+..+||+|.. .|++.|+|.|+|.|+
T Consensus 380 vvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~-~ytl~IrAqDggrPp- 457 (2531)
T KOG4289|consen 380 VVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS-EYTLRIRAQDGGRPP- 457 (2531)
T ss_pred EEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC-eeEEEEEcccCCCCC-
Confidence 999999999999999999999999999999999999999999999999999999999999998 999999999999875
Q ss_pred ceeeeEEEEEEEeecCCCCCcccCCccEEEEeecCCCCcEEEEEEEeeCCCCCCceEEEEEecCCCCCcEEecccceEEE
Q psy1041 465 RKQSAAKVKVNIVDTNDNDPLFDSPEMEVSINENEPAGTSVIKVTAKDKDSGENAYISYSIANLKPVPFEIDHFSGVIKT 544 (1095)
Q Consensus 465 ~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~ysi~~~~~~~F~Id~~tG~i~~ 544 (1095)
++.+.-+.|.|+|+|||+|.|...++.++|-||.+.|..++.++|.|+|+|.|+.+.|++.+. ++|.|+..+|+|++
T Consensus 458 -Lsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG~--~pf~I~~~SG~Itv 534 (2531)
T KOG4289|consen 458 -LSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAGV--GPFQINNGSGWITV 534 (2531)
T ss_pred -ccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeeccC--CCeeEecCCceEEE
Confidence 677777889999999999999999999999999999999999999999999999999999974 58999999999999
Q ss_pred ceeccccccccEEEEEEEEEECCcCCceeeEEEEEEEEEeCCCCCCcccccceEEEecCCCCCCcEEEEEEEEeCCCCCe
Q psy1041 545 TQVLDYESMRREYILRVRASDWGLPYRRQTEMQLKIKLLDVNDNRPQFEKVDCLGHVPRNLPIGREIITLSAIDFDAGNI 624 (1095)
Q Consensus 545 ~~~lD~E~~~~~~~l~V~a~D~g~p~~~s~~~~v~I~V~dvNDn~P~f~~~~~~~~V~E~~~~g~~v~~v~A~D~D~~~~ 624 (1095)
++.||||+. ..|.|.|+|+|+|.|+. ++.+.|.|+++|+|||.|.|++..|...+.|+++.|+.|++|+|+|.|....
T Consensus 535 tk~ldrEt~-~~ysl~V~ard~gtp~l-~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~tvtAvD~d~~s~ 612 (2531)
T KOG4289|consen 535 TKELDRETV-EHYSLGVEARDHGTPPL-STSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIVTVTAVDRDANSV 612 (2531)
T ss_pred eeccccccc-ceEEEEEEEcCCCCCcc-cccceEEEEecccCCCCCccccCceEEEecCCccccceEEEEEEeccccccc
Confidence 999999994 78999999999999875 5779999999999999999999999999999999999999999999999999
Q ss_pred EEEEEEeCCCCCcEEEeCCC--cEEEEeeccCcccccEEEEEEEEecCCCccceEEEEEEEeecccCCCCCccccCCCCC
Q psy1041 625 ISYRIVSGNEDGCFALDITS--GVLSIACDLTDVRVNEREINVTATDSAHFSDVVRIRINLVSARRIPEPGKTLENDSGG 702 (1095)
Q Consensus 625 i~y~i~~~~~~~~F~Id~~t--G~i~~~~~ld~~~~~~~~l~V~atD~~~~s~~~~v~I~v~~~~~~~~~~~~~~~~~~~ 702 (1095)
++|.|.+++....|.|+... |.|+++.++++....+|.+.|+|+| |...+...++|.
T Consensus 613 ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKqe~~~vl~vtAtD-g~l~d~~~V~v~-------------------- 671 (2531)
T KOG4289|consen 613 ITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQERQYVLAVTATD-GTLQDTCSVNVN-------------------- 671 (2531)
T ss_pred eEEEecCCcccccceeeccCCcceEEeecchhhcccceEEEEEEecC-CccccceEEEEE--------------------
Confidence 99999999988999999876 7899999999999999999999999 445566666553
Q ss_pred eEEeeccceeeeehhhhhhhhcccccccccceeeeccccccccCCCCccccCcEEEEEecCCCCCcEEEEEEEEeCCCCC
Q psy1041 703 FECKDTGVARRLTEVLAAAEKNNLRSQSYQEEFAMMPSRYGENVHVPEFFSFPIELQVNESVPLKSTLTKIIARDRDLGY 782 (1095)
Q Consensus 703 ~~~~~t~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~ 782 (1095)
|.|.|-++|.|...+|.++|+|..|.|+.|.+++|+|.|.|.
T Consensus 672 --------------------------------------I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~ge 713 (2531)
T KOG4289|consen 672 --------------------------------------ITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGE 713 (2531)
T ss_pred --------------------------------------eeecccCCcccccCCeeEeeccCCcCCceeEEEecccCCCCc
Confidence 467888999999999999999999999999999999999999
Q ss_pred CcEEEEEEEeCCCCCcEEEeCCceEEEEcccCCccCCCEEEEEEEEEECCCCCCceeEEEEEEEEecCCCCCcccccceE
Q psy1041 783 NGKLVFGISSGDNDSVFRIDPDSGELKVVGYLDRERTSEYTLNITVYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLAS 862 (1095)
Q Consensus 783 n~~v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~ 862 (1095)
|++|+|-+ . +..|+||+++|.+++...||||.+-.|.+.++|.|.|.|+++.+++|.|.|.|+|||+|+|..+.|.
T Consensus 714 NARI~y~l-e---d~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt 789 (2531)
T KOG4289|consen 714 NARITYIL-E---DEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYT 789 (2531)
T ss_pred cceEEEEe-c---ccceeecCCCCceEEeeeeecccceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhce
Confidence 99999944 3 3459999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCC---cceEEeCCcceEEEcccccccccCeeEEEEEEEeCCCCCC
Q psy1041 863 FRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDT---QDFAVDSATGSLYVSASLDRERQDLYELKIRASDCDGRND 939 (1095)
Q Consensus 863 ~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~---~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~A~D~~g~~~ 939 (1095)
++|.|++|++|.|++|+|+|+|.|.|+.+.|.+.++. +.|.|++.+|.|++...||||....|.|.+.|+|.+
T Consensus 790 ~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg---- 865 (2531)
T KOG4289|consen 790 GSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRG---- 865 (2531)
T ss_pred eEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEccCcceeehhhhhcchheeEEEEEEEEeeCC----
Confidence 9999999999999999999999999999999997653 899999999999999999999999999999999999
Q ss_pred CcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCCCCcEEEEEEEEcCCCCCCceEEEEEEeCCCCCCCEEEeCCce
Q psy1041 940 MYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIPVGTVVAILSASDPDLGQGGVVRYTIVSDNEADDVFSIDRLTG 1019 (1095)
Q Consensus 940 ~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG 1019 (1095)
.|++++.+.|+|+|.|+|||||+|.+..|...|.||.++|..+++++|.|+|+|+|+.|+|+|.+++ ....|.++...|
T Consensus 866 ~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~-d~~~fq~de~~~ 944 (2531)
T KOG4289|consen 866 NPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGN-DPELFQLDEFSG 944 (2531)
T ss_pred CCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCc-cHHHHHHHHhhh
Confidence 7789999999999999999999999999999999999999999999999999999999999999965 478999999999
Q ss_pred EEEEcccCCCC-CCCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCCCCeeec
Q psy1041 1020 TIRVAKPLDFE-KRQVHSLVVRAKDNGSPPLYSEATLIVEVSDVNENMNAPVFSD 1073 (1095)
Q Consensus 1020 ~i~~~~~ld~E-~~~~~~l~V~A~D~g~p~ls~~~~v~I~V~dvNdn~~~P~F~~ 1073 (1095)
+|.....|||| ....|.+.++|+- -|+.+.+++.|+|.|.||| +|....
T Consensus 945 ~lla~~efdyef~~~eyv~~~qats---~plvS~atv~i~vsd~ndn--~pvl~~ 994 (2531)
T KOG4289|consen 945 ELLALVEFDYEFTRVEYVLVVQATS---APLVSRATVHIRVSDQNDN--PPVLED 994 (2531)
T ss_pred hhhhheeehhhhccceeeEEeeccc---cccccceeEEEEecccCCC--chhhcc
Confidence 99999999999 7889999999864 3589999999999999999 777543
No 2
>KOG1219|consensus
Probab=100.00 E-value=3e-127 Score=1136.39 Aligned_cols=1002 Identities=28% Similarity=0.391 Sum_probs=848.4
Q ss_pred cCCCCCCceeeCCceEEEEEcCCCCCceeccCcccccccCCCceeEEEEEEeCCCCCceEeeEeeecCeEEEEEEecCCC
Q psy1041 28 DIGRSSSLRFTQKDYNVSISENSNSKTYVTPEEKMGIYRGTSEVDIRFKISSGDRDKFFKAEERLVGDFWFLLIRTRTGN 107 (1095)
Q Consensus 28 ~~~~~~~~~F~~~~y~~~v~En~~~gt~v~~~~~~~~~~~~~~~~i~ysi~~~~~~~~F~i~~~~~g~~~~~~i~~~~g~ 107 (1095)
..+...| .|.++.|.+.|+|+....++|....+.. .-+..+.|+|.+|+ ...|+++-. +|. |+...
T Consensus 2070 v~nka~P-vF~~~~y~avi~e~~tv~spvv~vqa~s----~l~~kv~YsIldg~-~slFtvnf~-TG~-----i~v~~-- 2135 (4289)
T KOG1219|consen 2070 VENKAAP-VFITPDYVAVIEELITVSSPVVHVQAAS----PLGLKVTYSILDGN-TSLFTVNFT-TGV-----ILVLI-- 2135 (4289)
T ss_pred EecCCCc-ceecCcEEEEeecccccccceeEEeecC----CcCCceEEEEecCC-cceEEEecc-cce-----EEecc--
Confidence 3444455 7999999999999999887774222111 11345999999997 678998764 442 33332
Q ss_pred ccccCccccceEEEEEEEEEeccCCCCCCceeEEEEEEEEEecCCCCCCCCCCCceEEEeeCCCCCCcEEEEEEeeeCCC
Q psy1041 108 TDVLNRERKDKYILHIKATITHRDGKKASYEETTCKVHVNVLDTNDLNPLFYPTEYEETVPEDLPLHTSILRVSAEDADL 187 (1095)
Q Consensus 108 ~~~lD~E~~~~y~l~V~A~d~~~~~~~~~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~g~~v~~v~A~D~D~ 187 (1095)
|||||....|.|.|+|+| .+-++.+.++|.|.|.|+|||+|+|++..|..+++|+.++|+.|+|+.|+|.|.
T Consensus 2136 --pLd~ea~t~h~l~ieAtd------~~~p~~Aea~VeIiV~dIndn~PvFeqlsYt~sisE~s~igt~viqilATdsDs 2207 (4289)
T KOG1219|consen 2136 --PLDREASTLHELLIEATD------AGIPLSAEAKVEIIVGDINDNPPVFEQLSYTISISENSKIGTKVIQILATDSDS 2207 (4289)
T ss_pred --ccccccccceEEEEEEec------cCCCcceeeEEEEEecccCCCCchhheeeEEEEccCCCccCceEEEEEeccCCC
Confidence 899999999999999999 344589999999999999999999999999999999999999999999999996
Q ss_pred CCCeEEEEEEecC---CccEEEEccccEEEEccCCCcCCCcEEEEEEEEeecCccccCCCCcceEEEEEEEEeccCCcCe
Q psy1041 188 GRNGEIYYSFRDM---NEQFSIHPTSGVVTLTRPLKYTDRSVHDLVVLGQDRGSVFKGGGKPSSAKLKIKVEQINLYGPE 264 (1095)
Q Consensus 188 g~n~~v~Y~l~~~---~~~F~id~~tG~i~~~~~ld~e~~~~~~l~V~A~D~g~~~~~~~~s~~~~v~I~V~dvNd~~P~ 264 (1095)
|+.|.|+|.+. +..|.||+.||+|++.+.||||+.+.|.|.|+|+|+|.| +++.+-|.|.|+|+|||+|.
T Consensus 2208 --n~~isYsl~g~s~~sk~f~In~sTG~it~~g~ldyE~~q~f~~fvratdggk~-----lSseviv~V~VeD~Ndn~Pe 2280 (4289)
T KOG1219|consen 2208 --NREISYSLEGNSEISKPFRINVSTGWITVAGKLDYEENQEFRFFVRATDGGKP-----LSSEVIVEVHVEDFNDNPPE 2280 (4289)
T ss_pred --CCceEEEeecCCccccceEEecccceEEEeeecChhhcceEEEEEEEccCCCc-----ccccEEEEEEehhcCCCCch
Confidence 99999999863 458999999999999999999999999999999999974 78999999999999999999
Q ss_pred eEEeecCcccccCCc--cEEEEEEEEeCCCC-------------------------------------------------
Q psy1041 265 IYVQSLPDIVEQSYA--DIYAIVRVVDRDAG------------------------------------------------- 293 (1095)
Q Consensus 265 f~~~~~~~~~~~~~~--~~~~~v~a~D~D~g------------------------------------------------- 293 (1095)
|.+..+...+.+.+. ..+..+.|.|.|..
T Consensus 2281 f~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l~~s~~G~iTlfNl~k~~l~~s~~lrv~vsD~v~~at 2360 (4289)
T KOG1219|consen 2281 FNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFLILSENGIITLFNLLKSPLQTSYPLRVTVSDGVFRAT 2360 (4289)
T ss_pred hccccceeecCCCccceeEEEEEEeccCCccchhhhhhcccceeeeccCceEEehhhcccccccccceeeeeccCcceee
Confidence 987766544444332 25555666666643
Q ss_pred -------------------------------------CCCeEEEEEeecCCCCCCeEEeeeccCCCCcCceEEEEECccc
Q psy1041 294 -------------------------------------IHGEIASLDIVDGDPDGHFRIVPTKIDPGTKKKEYNIVVLKLL 336 (1095)
Q Consensus 294 -------------------------------------~n~~v~~~~i~~g~~~~~F~i~~~~~~~~~~~g~~~i~~~~~l 336 (1095)
.+|.+.|+++.+- ...+|+|++. | .|.+.+.|
T Consensus 2361 ~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~~si~s~-~sd~~~in~~--------G--qI~t~~kl 2429 (4289)
T KOG1219|consen 2361 MEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIYYSINSR-ASDHFEINKS--------G--QIKTLSKL 2429 (4289)
T ss_pred eEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceeeeeechh-ccCceeECCC--------c--cEEeeehh
Confidence 2232222222211 1234666543 3 68888889
Q ss_pred cccCCCCceE--EEEEEEECCCCCeeeEEEEEEEeccCCCCCCCccCCcEEEEEeCCCCCCceEEEEEEeeCCCCCCceE
Q psy1041 337 DREIAPLGYN--LTLRAVDKGTPPRETYKATQVHLVDLNDNKPVFDREIYEVDVPETTPVNTPIIRLKVSDADDGKNAQV 414 (1095)
Q Consensus 337 D~E~~~~~y~--l~v~a~D~g~p~~~s~~~~~i~v~d~Nd~~P~F~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~n~~i 414 (1095)
|||.... |. +.+.|.|+|+ +.+..+++|.+.|+|||||.|....|+++|.|++..|..|+.+.|+|+|.|.|+.+
T Consensus 2430 d~e~s~~-~vi~i~v~a~Da~g--r~af~tvti~ltDiNDnpPqF~a~~Y~~nI~enaskg~~V~~v~A~D~De~snadv 2506 (4289)
T KOG1219|consen 2430 DREYSEE-LVIIIAVMAFDAGG--RVAFCTVTIILTDINDNPPQFDAQLYRVNITENASKGKLVGHVIARDADEGSNADV 2506 (4289)
T ss_pred hhccCce-EEEEEEEEEecCCC--eEEEEEEEEEEEecCCCCccccceeEEEEeecccCCCceEEEEEEecCCCCCcccE
Confidence 9997654 54 4555558875 55668899999999999999999999999999999999999999999999999999
Q ss_pred EEEEEeC-CCCCcEEEECCccEEEEceecCcccccEEEEEEEEEeCCCCCCceeeeEEEEEEEeecCCCCCcccCCccEE
Q psy1041 415 FLEIVGG-NEGGEFNINPETGMLYTAVTLDAEDKAFYTLTVSAIDQGNAGTRKQSAAKVKVNIVDTNDNDPLFDSPEMEV 493 (1095)
Q Consensus 415 ~ysi~~~-~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~ 493 (1095)
+|.+.++ .....|.|++ +|.|.+.+.|++++...|.|.|+|+|.|.|. +.+.++|.++|.+..++.|.|..+.|.+
T Consensus 2507 ty~i~~e~~~~~v~~in~-sG~Itv~~sL~~~en~tl~l~vkA~D~g~P~--~~s~ttV~v~vl~e~v~lPrFSep~y~f 2583 (4289)
T KOG1219|consen 2507 TYEIVGESDVKHVFEINE-SGVITVKRSLDGLENSTLHLFVKAIDDGKPR--RRSNTTVIVTVLPEDVNLPRFSEPIYTF 2583 (4289)
T ss_pred EEEecCchhhhheeeecC-CceEEeehhhhcccCcEEEEEEEeccCCCCC--cccceEEEEEecCcccCcccccCceEEE
Confidence 9999876 3446889998 8999999999999999999999999999764 6789999999999999999999999999
Q ss_pred EEeecCCCCcEEEEEEEeeCCCCCCceEEEEEecC-C-----CCCcEEecccceEEEceeccccccccEEEEEEEEEECC
Q psy1041 494 SINENEPAGTSVIKVTAKDKDSGENAYISYSIANL-K-----PVPFEIDHFSGVIKTTQVLDYESMRREYILRVRASDWG 567 (1095)
Q Consensus 494 ~V~E~~~~gt~v~~v~A~D~D~g~n~~i~ysi~~~-~-----~~~F~Id~~tG~i~~~~~lD~E~~~~~~~l~V~a~D~g 567 (1095)
+|+|+.+.|+.|++|+|.|+|.. +-|++.-+ . ..+|+||+.||.|.+.++||+|+ +++|.++|+|++++
T Consensus 2584 svpEDv~vG~~Ig~v~a~~a~~~----~i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~ksLD~E~-kk~yqi~v~a~~~~ 2658 (4289)
T KOG1219|consen 2584 SVPEDVPVGEEIGQVSASDADEH----VIYSLVLGGTPESNPDLPFSVDRNTGMIKVNKSLDHEK-KKSYQIKVKATCGQ 2658 (4289)
T ss_pred eccccCCCCCeeeEEeecccCCc----eEEEEEeCCCCCCCCCCceEEcCCCceEEeccccchhh-hceEEEEEEeecCC
Confidence 99999999999999999999852 45665432 2 24699999999999999999998 68999999999987
Q ss_pred cCCceeeEEEEEEEEEeCCCCCCcccccceEEEecCCCCCCcEEEEEEEEeCCCCC--eEEEEEEeCCCCCcEEEeCCCc
Q psy1041 568 LPYRRQTEMQLKIKLLDVNDNRPQFEKVDCLGHVPRNLPIGREIITLSAIDFDAGN--IISYRIVSGNEDGCFALDITSG 645 (1095)
Q Consensus 568 ~p~~~s~~~~v~I~V~dvNDn~P~f~~~~~~~~V~E~~~~g~~v~~v~A~D~D~~~--~i~y~i~~~~~~~~F~Id~~tG 645 (1095)
. .-+.++|.|.|.|+|||+|+|....|.+.+.||+|.|+.|++++|.|+|.+. +++|+|.+.. .+|.|+++||
T Consensus 2659 ~---vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v--~yF~In~etG 2733 (4289)
T KOG1219|consen 2659 W---VVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPV--PYFAINPETG 2733 (4289)
T ss_pred c---eEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCc--ceEEEcCCCC
Confidence 6 2467899999999999999999999999999999999999999999999987 8999998753 3999999999
Q ss_pred EEEEeeccCcccccEEEEEEEEecCCCccceEEEEEEEeecccCCCCC-------ccccC--------------------
Q psy1041 646 VLSIACDLTDVRVNEREINVTATDSAHFSDVVRIRINLVSARRIPEPG-------KTLEN-------------------- 698 (1095)
Q Consensus 646 ~i~~~~~ld~~~~~~~~l~V~atD~~~~s~~~~v~I~v~~~~~~~~~~-------~~~~~-------------------- 698 (1095)
+|++...||.|+...|.|.|+|+|.|.++..+.+.++|.+.|+.|++. ...++
T Consensus 2734 wlTt~~eld~ek~d~y~lkv~AtDhG~~ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~p~~~~ia~~si~d~D~s~~ 2813 (4289)
T KOG1219|consen 2734 WLTTLFELDLEKQDLYSLKVVATDHGVPSSQATVLVHVTDVNDSPPRFQREIYEGTVVEDVPGGKIIAGLSIFDADVSEV 2813 (4289)
T ss_pred eeeehhhhccccCCceEEEEEEecCCcccccceEEEEEEecCCCcchhhhHhhccceeccCCCCceeeeeEecccccccc
Confidence 999999999999999999999999999999999999999999776521 01111
Q ss_pred -----------CCCCeEEeeccceeeeehhhhh-----------------------------------------------
Q psy1041 699 -----------DSGGFECKDTGVARRLTEVLAA----------------------------------------------- 720 (1095)
Q Consensus 699 -----------~~~~~~~~~t~~~~~~~~~l~~----------------------------------------------- 720 (1095)
..+.|...+......+...++.
T Consensus 2814 nq~t~fI~~gd~~gqF~~i~ne~~~~~kKt~~~E~t~ny~Ltvtatdg~f~~s~~vkv~v~~s~dn~~~c~~~~~t~i~~ 2893 (4289)
T KOG1219|consen 2814 NQVTGFITLGDPLGQFWIIENEWIYEFKKTLDRESTKNYLLTVTATDGIFMNSDNVKVLVLDSNDNSPFCGNQLYTKIQC 2893 (4289)
T ss_pred ceeEEEEeCCCccceEEEEcCcceEEEecchhhhcccceEEEEEEeccceeccceEEEEeeccccCCccCcchhccceec
Confidence 1122211000000000000000
Q ss_pred ---------------------hhhc--------------cccccc------ccce-----------------e---eecc
Q psy1041 721 ---------------------AEKN--------------NLRSQS------YQEE-----------------F---AMMP 739 (1095)
Q Consensus 721 ---------------------~~~~--------------~~~~~~------~~~~-----------------~---~~~~ 739 (1095)
..++ +.+... .+.+ + ....
T Consensus 2894 ed~f~gk~ilkisal~~dn~tna~itf~L~~~~a~kf~lnp~tgilkt~~~~~~e~~~~~nLe~~at~~~gr~cqa~Itv 2973 (4289)
T KOG1219|consen 2894 EDVFPGKQILKISALDVDNLTNARITFPLAGQGAIKFKLNPKTGILKTPTDLDRETIIVKNLENAATDGGGRVCQANITV 2973 (4289)
T ss_pred cccCCceeEEEeeeeccCccccceeeeeccCCCcceEEEcCccceEecCCCCcchhhHHHhHHhHhhhcccceeeeeEEE
Confidence 0000 000000 0000 0 0112
Q ss_pred ccccccCCCCccccCcEEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEEeCCCCCcEEEeCCceEEEEcccCCccCC
Q psy1041 740 SRYGENVHVPEFFSFPIELQVNESVPLKSTLTKIIARDRDLGYNGKLVFGISSGDNDSVFRIDPDSGELKVVGYLDRERT 819 (1095)
Q Consensus 740 ~v~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~ 819 (1095)
++.|+|||+|.|....+.+.|-||...++.+.++.++|+|.|.+.++.|++.+. .++.|+|+..+|.+.+.++||.|.+
T Consensus 2974 ~~edvNdn~p~~~e~~~ai~ifente~~tl~~~v~r~~ad~~~f~ki~y~l~ds-a~g~fsidei~gvi~l~kpLd~e~~ 3052 (4289)
T KOG1219|consen 2974 SNEDVNDNAPTFLEETMAIDIFENTERSTLTITVNRSDADCGVFQKIFYSLEDS-ANGSFSIDEIHGVIWLEKPLDGEQQ 3052 (4289)
T ss_pred EecccccccccccchheEEEeecCCCcccceEEEeeccccccchhheEEEeeec-cCCccchhhccceEEeccccccchh
Confidence 477999999999999999999999999999999999999999999999999874 4679999999999999999999999
Q ss_pred CEEEEEEEEEECCCCCCceeEEEEEEEEecCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCC
Q psy1041 820 SEYTLNITVYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDT 899 (1095)
Q Consensus 820 ~~y~l~V~a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~ 899 (1095)
..|.|++.|.|.|. ++....+|.+.|.|.|||+|+|..+.|...|.|+.++|+.+.+++|.-+|. .+...-|.+.+++
T Consensus 3053 ~~f~lTv~asd~g~-~l~~l~tvlvsv~d~ndn~pvfe~seys~~v~e~~~vg~ev~~~~a~~~d~-~ae~~~y~~~~~n 3130 (4289)
T KOG1219|consen 3053 AIFDLTVLASDNGS-SLIFLGTVLVSVYDFNDNIPVFEDSEYSSAVHEDVTVGTEVLQTTANIRDF-AAENTNYPNLSGN 3130 (4289)
T ss_pred cceeeEEEEecCCc-eeEEeccEEEEEEecccCCccccccccceeeecccccCchhhhhhhccCCc-cccccccccccCC
Confidence 99999999999988 678889999999999999999999999999999999999999999998882 2334559999886
Q ss_pred --cceEEeCCcceEEEcccccccccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCC
Q psy1041 900 --QDFAVDSATGSLYVSASLDRERQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIP 977 (1095)
Q Consensus 900 --~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~ 977 (1095)
+.|++|+.||.+++.++||+|....|.|+|.|.|.+ .++++..++|.|.|.|+|||.|.|.+..|...+.||+.
T Consensus 3131 ~~g~fsLds~tgil~li~sldfet~skl~ltV~a~d~g----~p~Le~~atV~inv~DvnDn~p~f~qe~yv~~v~enal 3206 (4289)
T KOG1219|consen 3131 ESGFFSLDSDTGILSLIGSLDFETSSKLSLTVEAVDVG----GPSLEDVATVRINVTDVNDNVPSFDQEYYVTSVTENAL 3206 (4289)
T ss_pred CcceEEeccccceEEEecccCcccchhheEEEEEecCC----CCCccceeEEEEEeeecccCCCcccccceeEEEeeccc
Confidence 789999999999999999999999999999999999 57899999999999999999999999999999999999
Q ss_pred CCcEEEEEEEEcCCCCCCceEEEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEEEEEE
Q psy1041 978 VGTVVAILSASDPDLGQGGVVRYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEATLIV 1057 (1095)
Q Consensus 978 ~g~~v~~v~A~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~v~I 1057 (1095)
.|..+.+|.|.|.|...|+.|.|+|..+ +....|.||+..|++-+.+.||||+.+.|+|+|+|+|.|.||+-..+.|.|
T Consensus 3207 ~g~~vitV~a~d~d~e~ns~i~Yei~~~-n~~l~Ftidp~ngev~V~kslDrE~is~ynLkvia~d~g~ppl~e~t~v~I 3285 (4289)
T KOG1219|consen 3207 KGPTVITVEAFDRDGENNSAILYEIIKG-NQRLLFTIDPINGEVPVVKSLDREAISTYNLKVIAEDPGTPPLLENTAVSI 3285 (4289)
T ss_pred cCCceEEEEeccCCCCCccceEEEEecC-ccceeEEecccCceEEEEeccChhhhccceEEEEecCCCCCCccccceeEE
Confidence 9999999999999999999999999985 357899999999999999999999999999999999999999999999999
Q ss_pred EEEecCCCCCCCeeecceEEEEEeCCCCCCceEEEeeC
Q psy1041 1058 EVSDVNENMNAPVFSDFVYQATVKENQPIGTSLKPRLL 1095 (1095)
Q Consensus 1058 ~V~dvNdn~~~P~F~~~~y~~~v~E~~~~gt~v~~~~~ 1095 (1095)
+|.|+||| +|+|.+..|+..|.||.|+|+.|+++.+
T Consensus 3286 ~V~d~ndn--aPrf~~~n~st~vqen~piG~~vLq~~v 3321 (4289)
T KOG1219|consen 3286 EVIDVNDN--APRFITDNYSTYVQENEPIGHRVLQLLV 3321 (4289)
T ss_pred EEEeccCC--CCeeeccceeEEEecCCcccceEEEEEe
Confidence 99999998 9999999999999999999999999863
No 3
>KOG4289|consensus
Probab=100.00 E-value=1.8e-127 Score=1108.48 Aligned_cols=902 Identities=29% Similarity=0.449 Sum_probs=805.4
Q ss_pred CceeeCCceEEEEEcCCCCCceeccCcccccccCCCceeEEEEEEeC---CCCCceEeeEeeecCeEEEEEEecCCCccc
Q psy1041 34 SLRFTQKDYNVSISENSNSKTYVTPEEKMGIYRGTSEVDIRFKISSG---DRDKFFKAEERLVGDFWFLLIRTRTGNTDV 110 (1095)
Q Consensus 34 ~~~F~~~~y~~~v~En~~~gt~v~~~~~~~~~~~~~~~~i~ysi~~~---~~~~~F~i~~~~~g~~~~~~i~~~~g~~~~ 110 (1095)
.++|+++.|...++||.|.||.|++..+.+.+ .+.+.|++..- -..++|+||+. +| .|++.. .
T Consensus 161 ~~~Fqq~~Yq~~lpEn~pagT~iasv~A~~~~----a~rl~Ysm~al~dsRS~~lFslD~~-sG-----~irta~----~ 226 (2531)
T KOG4289|consen 161 AVQFQQPNYQKELPENEPAGTIIASVKASDPD----AGRLYYSMVALFDSRSQNLFSLDPM-SG-----AIRTAK----S 226 (2531)
T ss_pred CccCCCcchhccCcCCCCCCceeEEEEecCCC----cCceEEEeeeccchhccccEeeccc-cc-----cchhhh----h
Confidence 34999999999999999999999865444433 36799999642 24578999874 44 466654 7
Q ss_pred cCccccceEEEEEEEEEeccCCCCCCceeEEEEEEEEEecCCCCCCCCCCCceEEEeeCCCCCCcEEEEEEeeeCCCCCC
Q psy1041 111 LNRERKDKYILHIKATITHRDGKKASYEETTCKVHVNVLDTNDLNPLFYPTEYEETVPEDLPLHTSILRVSAEDADLGRN 190 (1095)
Q Consensus 111 lD~E~~~~y~l~V~A~d~~~~~~~~~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g~n 190 (1095)
||||.+..+.|.|+|.|. +.+.++++++|+|+|+|.|||+|+|++..|.-.+.||.++|..|++|+|+|.|+++|
T Consensus 227 lDREt~e~HvlrVtA~d~-----~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~N 301 (2531)
T KOG4289|consen 227 LDRETKETHVLRVTAQDH-----GDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPN 301 (2531)
T ss_pred hhhhhhheeEEEEEeeec-----CCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCC
Confidence 999999999999999994 447899999999999999999999999999999999999999999999999999999
Q ss_pred eEEEEEEecC--CccEEEEccccEEEEccCCCcCCCcEEEEEEEEeecCccccCCCCcceEEEEEEEEeccCCcCeeEEe
Q psy1041 191 GEIYYSFRDM--NEQFSIHPTSGVVTLTRPLKYTDRSVHDLVVLGQDRGSVFKGGGKPSSAKLKIKVEQINLYGPEIYVQ 268 (1095)
Q Consensus 191 ~~v~Y~l~~~--~~~F~id~~tG~i~~~~~ld~e~~~~~~l~V~A~D~g~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~ 268 (1095)
+.|.|++..+ ...|.||+.+|+|.+..+||||+...|.|.|+|+|+|.|++ ..++.|.|+|.|+|||+|+|...
T Consensus 302 ani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pg----p~Ta~V~itV~D~NDNaPqFse~ 377 (2531)
T KOG4289|consen 302 ANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPG----PRTAMVEITVEDENDNAPQFSEK 377 (2531)
T ss_pred CceEEEecCCCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCC----CceEEEEEEEEecCCCCcccccc
Confidence 9999999864 46799999999999999999999999999999999998743 44999999999999999999998
Q ss_pred ecCcccccCC--ccEEEEEEEEeCCCCCCCeEEEEEeecCCCCCCeEEeeeccCCCCcCceEEEEECccccccCCCCceE
Q psy1041 269 SLPDIVEQSY--ADIYAIVRVVDRDAGIHGEIASLDIVDGDPDGHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYN 346 (1095)
Q Consensus 269 ~~~~~~~~~~--~~~~~~v~a~D~D~g~n~~v~~~~i~~g~~~~~F~i~~~~~~~~~~~g~~~i~~~~~lD~E~~~~~y~ 346 (1095)
.|...+.|.. +..+.+|+|+|+|.|.||.|+ |+|.+|+..|.|.|+.. +| .|.+..+||+|.. .|+
T Consensus 378 ~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VH-Ysi~Sgn~~G~f~id~~-------tG--el~vv~plD~e~~--~yt 445 (2531)
T KOG4289|consen 378 RYVVQVREDVTPPAVVLRVTATDRDKGTNGKVH-YSIASGNGRGQFYIDSL-------TG--ELDVVEPLDFENS--EYT 445 (2531)
T ss_pred ceEEEecccCCCCceEEEEEecccCCCcCceEE-EEeeccCccccEEEecc-------cc--eEEEeccccccCC--eeE
Confidence 8876665543 347899999999999999998 69999999999999987 56 4778899999987 399
Q ss_pred EEEEEEECCCCCeeeEEEEEEEeccCCCCCCCccCCcEEEEEeCCCCCCceEEEEEEeeCCCCCCceEEEEEEeCCCCCc
Q psy1041 347 LTLRAVDKGTPPRETYKATQVHLVDLNDNKPVFDREIYEVDVPETTPVNTPIIRLKVSDADDGKNAQVFLEIVGGNEGGE 426 (1095)
Q Consensus 347 l~v~a~D~g~p~~~s~~~~~i~v~d~Nd~~P~F~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~n~~i~ysi~~~~~~~~ 426 (1095)
+.|+|.|+|.|+++++.-++|.|.|+|||+|.|....+.++|.|+.+.|..+..+.|.|+|.|.|+.+.|++.+ .+.
T Consensus 446 l~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG---~~p 522 (2531)
T KOG4289|consen 446 LRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG---VGP 522 (2531)
T ss_pred EEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeecc---CCC
Confidence 99999999999999886677999999999999999999999999999999999999999999999999999986 568
Q ss_pred EEEECCccEEEEceecCcccccEEEEEEEEEeCCCCCCceeeeEEEEEEEeecCCCCCcccCCccEEEEeecCCCCcEEE
Q psy1041 427 FNINPETGMLYTAVTLDAEDKAFYTLTVSAIDQGNAGTRKQSAAKVKVNIVDTNDNDPLFDSPEMEVSINENEPAGTSVI 506 (1095)
Q Consensus 427 F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~ 506 (1095)
|.|+..+|+|++.+.||||+...|.|.|+|+|+|.|+ +++++.|.|.+.|+|||.|.|.+..|+..+.|+.|.|+.|.
T Consensus 523 f~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~--l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~ 600 (2531)
T KOG4289|consen 523 FQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPP--LSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIV 600 (2531)
T ss_pred eeEecCCceEEEeecccccccceEEEEEEEcCCCCCc--ccccceEEEEecccCCCCCccccCceEEEecCCccccceEE
Confidence 9999999999999999999999999999999999775 78899999999999999999999999999999999999999
Q ss_pred EEEEeeCCCCCCceEEEEEecCC-CCCcEEeccc--ceEEEceeccccccccEEEEEEEEEECCcCCceeeEEEEEEEEE
Q psy1041 507 KVTAKDKDSGENAYISYSIANLK-PVPFEIDHFS--GVIKTTQVLDYESMRREYILRVRASDWGLPYRRQTEMQLKIKLL 583 (1095)
Q Consensus 507 ~v~A~D~D~g~n~~i~ysi~~~~-~~~F~Id~~t--G~i~~~~~lD~E~~~~~~~l~V~a~D~g~p~~~s~~~~v~I~V~ 583 (1095)
+|+|+|.|. +..++|.|.+++ ...|.|+... |.|+..-++|+.. .++|.+.|+|+|++ +..+..|.|.|.
T Consensus 601 tvtAvD~d~--~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKq-e~~~vl~vtAtDg~----l~d~~~V~v~I~ 673 (2531)
T KOG4289|consen 601 TVTAVDRDA--NSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQ-ERQYVLAVTATDGT----LQDTCSVNVNIT 673 (2531)
T ss_pred EEEEecccc--ccceEEEecCCcccccceeeccCCcceEEeecchhhcc-cceEEEEEEecCCc----cccceEEEEEee
Confidence 999999996 667999999977 6789999876 6788888999987 47899999999954 457789999999
Q ss_pred eCCCCCCcccccceEEEecCCCCCCcEEEEEEEEeCCCCC--eEEEEEEeCCCCCcEEEeCCCcEEEEeeccCcccccEE
Q psy1041 584 DVNDNRPQFEKVDCLGHVPRNLPIGREIITLSAIDFDAGN--IISYRIVSGNEDGCFALDITSGVLSIACDLTDVRVNER 661 (1095)
Q Consensus 584 dvNDn~P~f~~~~~~~~V~E~~~~g~~v~~v~A~D~D~~~--~i~y~i~~~~~~~~F~Id~~tG~i~~~~~ld~~~~~~~ 661 (1095)
|.|-+.|.|....|.++|+|..|.|+.|.+++|+|.|.|. .|+| |.. ...|+||+++|.+++...|++|.+-.|
T Consensus 674 danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y-~le---d~~Frid~dsg~i~t~~~ld~edqvty 749 (2531)
T KOG4289|consen 674 DANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITY-ILE---DEAFRIDPDSGAIYTQAELDYEDQVTY 749 (2531)
T ss_pred ecccCCcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEE-Eec---ccceeecCCCCceEEeeeeecccceee
Confidence 9999999999999999999999999999999999999987 6888 433 335999999999999999999999999
Q ss_pred EEEEEEecCCCccceEEEEEEEeecccCCCCCccccCCCCCeEEeeccceeeeehhhhhhhhcccccccccceeeecccc
Q psy1041 662 EINVTATDSAHFSDVVRIRINLVSARRIPEPGKTLENDSGGFECKDTGVARRLTEVLAAAEKNNLRSQSYQEEFAMMPSR 741 (1095)
Q Consensus 662 ~l~V~atD~~~~s~~~~v~I~v~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v 741 (1095)
.+.++|.|.+-+....+.+|.| .|
T Consensus 750 tl~itA~D~~~pq~adtttveV--------------------------------------------------------~v 773 (2531)
T KOG4289|consen 750 TLAITARDNGIPQKADTTTVEV--------------------------------------------------------LV 773 (2531)
T ss_pred EeeeeecCCCCCCcCccEEEEE--------------------------------------------------------Ee
Confidence 9999999998665444444333 46
Q ss_pred ccccCCCCccccCcEEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEEeCC-CCCcEEEeCCceEEEEcccCCccCCC
Q psy1041 742 YGENVHVPEFFSFPIELQVNESVPLKSTLTKIIARDRDLGYNGKLVFGISSGD-NDSVFRIDPDSGELKVVGYLDRERTS 820 (1095)
Q Consensus 742 ~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~y~i~~g~-~~~~F~Id~~tG~i~~~~~LD~E~~~ 820 (1095)
.|+|||+|+|....|..+|.|+.|+++.|++|+|+|+|.|.||.+.|.+.+|. ..+.|.|++.+|.|++.+.||||...
T Consensus 774 ~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IEptSGviRtl~rLdRE~~a 853 (2531)
T KOG4289|consen 774 NDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIEPTSGVIRTLRRLDRENVA 853 (2531)
T ss_pred ecccccCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEccCcceeehhhhhcchhee
Confidence 79999999999999999999999999999999999999999999999997653 34789999999999999999999999
Q ss_pred EEEEEEEEEECCCCCCceeEEEEEEEEecCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCC-
Q psy1041 821 EYTLNITVYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDT- 899 (1095)
Q Consensus 821 ~y~l~V~a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~- 899 (1095)
.|.|.+.|+|.|.|++++...|+|+|+|+|||||+|.+..|...|.||.++|..+.+++|.|+|.|+|+.|.|+|.+++
T Consensus 854 vy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~d 933 (2531)
T KOG4289|consen 854 VYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGND 933 (2531)
T ss_pred EEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCcc
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999987
Q ss_pred -cceEEeCCcceEEEccccccc-ccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccCCCCCeeecCc--eEEEEeCC
Q psy1041 900 -QDFAVDSATGSLYVSASLDRE-RQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDINDNAPNFALPN--YSVKVRED 975 (1095)
Q Consensus 900 -~~F~Id~~tG~i~~~~~LD~E-~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~--y~~~v~E~ 975 (1095)
..|.++...|+|.....|||| ....|.+.++|+-. ++.+.+++.|.|.|.|||+|+...-. |.-.+ |
T Consensus 934 ~~~fq~de~~~~lla~~efdyef~~~eyv~~~qats~-------plvS~atv~i~vsd~ndn~pvl~~f~iLfN~y~--n 1004 (2531)
T KOG4289|consen 934 PELFQLDEFSGELLALVEFDYEFTRVEYVLVVQATSA-------PLVSRATVHIRVSDQNDNPPVLEDFQILFNNYV--N 1004 (2531)
T ss_pred HHHHHHHHhhhhhhhheeehhhhccceeeEEeecccc-------ccccceeEEEEecccCCCchhhccHHHHhhhhh--h
Confidence 789999999999999999999 78899999998744 48899999999999999999865321 22222 5
Q ss_pred CCCCcEEEEEEEEcCCCCCCceEEEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEEEE
Q psy1041 976 IPVGTVVAILSASDPDLGQGGVVRYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEATL 1055 (1095)
Q Consensus 976 ~~~g~~v~~v~A~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~v 1055 (1095)
.-++..++.+-|.|+|. +..+.|++.. .+....++.+|+|.+.+.||+. -.-.+.|.++|+-. .-++++.+
T Consensus 1005 sf~~g~ig~iPA~Dpd~--sd~l~y~~eE----l~L~~an~~tGel~lsr~ldnN--l~asm~v~VsDG~h-svta~C~l 1075 (2531)
T KOG4289|consen 1005 SFPAGLIGRIPAHDPDV--SDSLIYSFEE----LNLLLANAKTGELLLSRELDNN--LEASMKVCVSDGAH-SVTAQCRL 1075 (2531)
T ss_pred ccccceeEecccCCcch--hhhhheeecc----ceeEEecccCCcEEehhhhhcc--cceeEEEEeecCcc-ceeeeEEE
Confidence 56777899999999996 5668899874 5678889999999999999976 34567788888764 34566666
Q ss_pred EEEEE
Q psy1041 1056 IVEVS 1060 (1095)
Q Consensus 1056 ~I~V~ 1060 (1095)
.+.+.
T Consensus 1076 rvvii 1080 (2531)
T KOG4289|consen 1076 RVVII 1080 (2531)
T ss_pred EEEEe
Confidence 65543
No 4
>KOG1219|consensus
Probab=100.00 E-value=6.8e-123 Score=1099.39 Aligned_cols=952 Identities=39% Similarity=0.588 Sum_probs=865.3
Q ss_pred CCCCCceeeCCceEEEEEcCCCCCceeccCcccccccCCCceeEEEEEEeCCCCCceEeeEeeecCeEEEEEEecCCCcc
Q psy1041 30 GRSSSLRFTQKDYNVSISENSNSKTYVTPEEKMGIYRGTSEVDIRFKISSGDRDKFFKAEERLVGDFWFLLIRTRTGNTD 109 (1095)
Q Consensus 30 ~~~~~~~F~~~~y~~~v~En~~~gt~v~~~~~~~~~~~~~~~~i~ysi~~~~~~~~F~i~~~~~g~~~~~~i~~~~g~~~ 109 (1095)
-.+++|+|..+.|+++|.||+...+++....+||+....++..++|.|++|+....|+.+....|++|+++||+++++
T Consensus 22 l~~~~~~FTh~~YN~tv~ENS~~ktYv~~~~KmGvyl~ep~w~vRy~iisGd~~nlFKaeey~vGdFcFLRIRtKg~N-- 99 (4289)
T KOG1219|consen 22 LTEQIFEFTHPLYNLTVEENSIGKTYVRNSTKMGVYLPEPNWKVRYAIISGDKSNLFKAEEYQVGDFCFLRIRTKGDN-- 99 (4289)
T ss_pred ecccchhhcccccceEEEecccccccccCceeeeeecCCCCceEEEEEEecchhhhhhhhheeeccEEEEEEEecCCC--
Confidence 467899999999999999999999999989999999999999999999999999999999999999999999999988
Q ss_pred ccCccccceEEEEEEEEEeccCCCCCCceeEEEEEEEEEecCCCCCCCCCCCceEEEeeCCCCCCcEEEEEEeeeCCCCC
Q psy1041 110 VLNRERKDKYILHIKATITHRDGKKASYEETTCKVHVNVLDTNDLNPLFYPTEYEETVPEDLPLHTSILRVSAEDADLGR 189 (1095)
Q Consensus 110 ~lD~E~~~~y~l~V~A~d~~~~~~~~~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g~ 189 (1095)
+|+||-++.|.|.|+|+. ..-.+.+++.|.++|+|.||-.|.|.+..|+++|+|+.++-+.|++|.|+|+|.|.
T Consensus 100 ~LNREvkD~YtlivkA~e------k~l~lEa~trv~v~vlD~NDl~PlFsp~sY~v~i~ed~~~~s~i~rV~AtDADiG~ 173 (4289)
T KOG1219|consen 100 PLNREVKDFYTLIVKAME------KDLSLEATTRVHVRVLDRNDLSPLFSPQSYEVEIDEDLEPFSTILRVEATDADIGI 173 (4289)
T ss_pred ccchhhhHHHHHHHHHHh------hcccceeeeEEEEEEeccCCCcccccCCceEEecCCCCCcccceEEEEeccccccc
Confidence 899999999999999997 33458999999999999999999999999999999999999999999999999999
Q ss_pred CeEEEEEEecCCccEEEEccccEEEEccCCCcCCCcEEEEEEEEeecCccccCCCCcceEEEEEEEEeccCCcCeeEEee
Q psy1041 190 NGEIYYSFRDMNEQFSIHPTSGVVTLTRPLKYTDRSVHDLVVLGQDRGSVFKGGGKPSSAKLKIKVEQINLYGPEIYVQS 269 (1095)
Q Consensus 190 n~~v~Y~l~~~~~~F~id~~tG~i~~~~~ld~e~~~~~~l~V~A~D~g~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~ 269 (1095)
|+.+.|++.+-+..|.|+|.+|+++ +|++-..+.|.|.|.|.|+++.+ .--|.|+.+|.+....
T Consensus 174 N~efYysf~~Rs~mFaihPtsGvv~---~L~~~~~gkyel~vla~DR~~kl-------------y~~~ane~~P~itavv 237 (4289)
T KOG1219|consen 174 NSEFYYSFVNRSHMFAIHPTSGVVR---SLRHVKPGKYELKVLAEDRASKL-------------YYFDANEVQPSITAVV 237 (4289)
T ss_pred cceEEEEeccccccEEeccccceEE---EeeeccccceEEEEeehhhhhhh-------------cccccccCCCceEEEE
Confidence 9999999999999999999999998 88998999999999999999643 1124889999998665
Q ss_pred cCcccccCCccEEEEEEEEeCCCCCCCeEEEEEeecCCCCCCeEEeeeccCCCCcCceEEEEECccccccCCCCceEEEE
Q psy1041 270 LPDIVEQSYADIYAIVRVVDRDAGIHGEIASLDIVDGDPDGHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTL 349 (1095)
Q Consensus 270 ~~~~~~~~~~~~~~~v~a~D~D~g~n~~v~~~~i~~g~~~~~F~i~~~~~~~~~~~g~~~i~~~~~lD~E~~~~~y~l~v 349 (1095)
+........+ .++.+.+-+.+.|. .+.+..|+.|+...+|.+... ..+++++.+.+...++|.....+|++++
T Consensus 238 l~p~e~~~~p-~ya~V~vd~~~~ga--~~~s~~iv~gd~~~~f~~v~s----~~~skE~~~~~~~di~w~~~t~~~~~sL 310 (4289)
T KOG1219|consen 238 LIPRETKPKP-RYALVDVDKINPGA--NRQSAAIVTGDDSPNFAIVGS----KGNSKEHWFEVEPDIVWNDMTIGINLSL 310 (4289)
T ss_pred EecccCCCCC-eEEEEEeeccCCCc--cceeEEEEecCCCcceeeecc----cCCCcceEEEecccccccccceeEEEEE
Confidence 5422222223 78888888888777 455678999999999998754 2457788999999999988887899999
Q ss_pred EEEECCCCCeeeEEEEEEEeccCCCCCCCccCCcEEEEEeCCCCCCceEEEEEEeeCCCCCCceEEEEEEeCCCCCcEEE
Q psy1041 350 RAVDKGTPPRETYKATQVHLVDLNDNKPVFDREIYEVDVPETTPVNTPIIRLKVSDADDGKNAQVFLEIVGGNEGGEFNI 429 (1095)
Q Consensus 350 ~a~D~g~p~~~s~~~~~i~v~d~Nd~~P~F~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~n~~i~ysi~~~~~~~~F~I 429 (1095)
.|.++ |..++ .-+.+-.|+...|.++++|.++++++|.-+.|+- ..|.+..|+ ..|.+
T Consensus 311 ~akng--~qf~s----------~kn~~vkfek~~~r~~~Sefa~~ntpVv~v~atp--------yv~k~s~gn--~kfkl 368 (4289)
T KOG1219|consen 311 QAKNG--PQFFS----------LKNFTVKFEKEVYRFSVSEFAPPNTPVVMVEATP--------YVYKLSRGN--SKFKL 368 (4289)
T ss_pred EecCC--Ceeee----------ccccceEEEeeEEEEEecccCCCCCcEEEEecce--------eEeeccCcc--cceee
Confidence 99983 33222 1122446999999999999999999999999982 677777654 68999
Q ss_pred ECCccEEEEceecCcccccEEEEEEEEEeCCCCCCceeeeEEEEEEEeecCCCCCcccCCccEEEEeecCCCCcEEEEEE
Q psy1041 430 NPETGMLYTAVTLDAEDKAFYTLTVSAIDQGNAGTRKQSAAKVKVNIVDTNDNDPLFDSPEMEVSINENEPAGTSVIKVT 509 (1095)
Q Consensus 430 d~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~ 509 (1095)
|..||.|++..+||||....|.|.|+. +. .+...|.|.|.|+|+|+|.|....|.+.++||.|+|+.+....
T Consensus 369 n~~t~lis~~epldr~~~ah~~l~i~t-~~-------~as~kvlidvld~n~n~pif~r~~~~ve~penvpig~~vl~~s 440 (4289)
T KOG1219|consen 369 NEQTGLISVSEPLDRESEAHIDLLIIT-SP-------PASTKVLIDVLDVNDNSPIFPRDVYRVEIPENVPIGTRVLISS 440 (4289)
T ss_pred eeeeeeEEecchhhhhhhhceeeEEec-CC-------CcceEEEEEEeccCCCCCcceeeeeeeecCCCCCcceEEEEEe
Confidence 999999999999999999999999986 32 3678899999999999999999999999999999999999999
Q ss_pred EeeCCCCCCceEEEEEecCCCCCcEEecccceEEEceeccccccccEEEEEEEEEECCcCCceeeEEEEEEEEEeCCCCC
Q psy1041 510 AKDKDSGENAYISYSIANLKPVPFEIDHFSGVIKTTQVLDYESMRREYILRVRASDWGLPYRRQTEMQLKIKLLDVNDNR 589 (1095)
Q Consensus 510 A~D~D~g~n~~i~ysi~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~~l~V~a~D~g~p~~~s~~~~v~I~V~dvNDn~ 589 (1095)
|+|+|.|.||.++|+|.+....+|.|++.+|.|.+.+.||||. ++.|+|+|+|+|+|.| ++.+++.+.|.++|.|||+
T Consensus 441 atDpdegengyvtysia~~~~lPFaI~~~~GilsvS~kldrel-~rvYtfRv~Asd~G~p-er~~e~~~~I~ildlNDn~ 518 (4289)
T KOG1219|consen 441 ATDPDEGENGYVTYSIADDTMLPFAIDQSDGILSVSGKLDREL-RRVYTFRVRASDWGVP-ERESEVHLNILILDLNDNP 518 (4289)
T ss_pred ccCCCcCcCceEEEEecCCccCceEeccccceEEeccccCccc-cceEEEEEEEeccCCc-chhceeeEEEEEeccCCCC
Confidence 9999999999999999998889999999999999999999998 7899999999999999 6788999999999999999
Q ss_pred CcccccceEEEecCCCCCCcEEEEEEEEeCCCCCeEEEEEEeCCCCCcEEEeCCCcEEEEeeccC-cccccEEEEEEEEe
Q psy1041 590 PQFEKVDCLGHVPRNLPIGREIITLSAIDFDAGNIISYRIVSGNEDGCFALDITSGVLSIACDLT-DVRVNEREINVTAT 668 (1095)
Q Consensus 590 P~f~~~~~~~~V~E~~~~g~~v~~v~A~D~D~~~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~ld-~~~~~~~~l~V~at 668 (1095)
|.|....+..++...-++|+.+.++.|+|.|++..+.|+|..+++.. |.+++.+|+|++.+.-. ......+.+.+.|.
T Consensus 519 P~F~~~n~t~t~~~~~~vg~~l~tvsAtD~De~ellky~i~~~nel~-feln~nSgeisLvr~n~t~~~~s~~slv~a~d 597 (4289)
T KOG1219|consen 519 PNFEIRNCTGTINGDPKVGTKLFTVSATDLDELELLKYRILPGNELS-FELNSNSGEISLVRQNNTECLQSCESLVIAAD 597 (4289)
T ss_pred CcceeeecccccccCCCCCcEEEEeeccccCcccceeEEEEeCCcCc-eeeccCCCeEEEEEccccccccccceEEEehh
Confidence 99999999999999999999999999999999999999999999887 99999999999987322 23455677888888
Q ss_pred cCCCccceEEEEEEEeecccCCCCCccccCCCCCeEEeeccceeeeehhhhhhhhcccccccccceeeeccccccccCCC
Q psy1041 669 DSAHFSDVVRIRINLVSARRIPEPGKTLENDSGGFECKDTGVARRLTEVLAAAEKNNLRSQSYQEEFAMMPSRYGENVHV 748 (1095)
Q Consensus 669 D~~~~s~~~~v~I~v~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~dvNd~~ 748 (1095)
||..++++..++|++... .||+..+....+.. .-.|++.
T Consensus 598 ~G~p~as~t~lni~~~k~--------------------~Tgv~~~~~p~Ilq---------------------~~e~~~f 636 (4289)
T KOG1219|consen 598 DGVPPASPTLLNITVMKY--------------------GTGVGNEHEPNILQ---------------------RFENKHF 636 (4289)
T ss_pred cCCCcCCceeeEEEEEec--------------------ccccccccChhHhh---------------------hhccccC
Confidence 888888888888887532 35554444333322 1248899
Q ss_pred Ccccc-CcEEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEEeCCCCCcEEEeCCceEEEEcccCCccCCCEEEEEEE
Q psy1041 749 PEFFS-FPIELQVNESVPLKSTLTKIIARDRDLGYNGKLVFGISSGDNDSVFRIDPDSGELKVVGYLDRERTSEYTLNIT 827 (1095)
Q Consensus 749 P~f~~-~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~ 827 (1095)
|+|.+ +|..+.|+|+.|+|+.++.+.|+|.|.|.||+++|.|..|+....|.|+.++|.|++.++||+|...+|.|.|+
T Consensus 637 Pqf~s~fP~iI~v~Edvpigt~la~L~atD~Dtgfng~l~yvI~dgne~~~~~Id~qsg~itvas~ld~~~t~~yiLnvt 716 (4289)
T KOG1219|consen 637 PQFPSDFPFIIVVPEDVPIGTTLAILSATDSDTGFNGKLVYVIEDGNESICFLIDRQSGNITVASPLDNENTEQYILNVT 716 (4289)
T ss_pred ccccccCCceEEccccCCCCceEEEEeccCCCCCcCceEEEEEeCCccceEEEEecccceEEEecchhhhhhheeEEEEE
Confidence 99976 89999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEECCCCCCceeEEEEEEEEecCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCCcceEEeCC
Q psy1041 828 VYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDTQDFAVDSA 907 (1095)
Q Consensus 828 a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~~~F~Id~~ 907 (1095)
|.|.|.|+++++..+.|.|.|.|||+|.|.+..|.+.|.|+..+|+.|.+|.|.|.|.|.||.++|+|.+..+.|+||+.
T Consensus 717 a~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~sdvfsIdp~ 796 (4289)
T KOG1219|consen 717 AYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKSDVFSIDPF 796 (4289)
T ss_pred EecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCCCCceEEEEecCCcceEEecCc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cceEEEcccccccccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCCCCcEEEEEEE
Q psy1041 908 TGSLYVSASLDRERQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIPVGTVVAILSA 987 (1095)
Q Consensus 908 tG~i~~~~~LD~E~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g~~v~~v~A 987 (1095)
||.|.+.++||||.+..|.|.|+|.|.+ .|++.+.+.+.|.|.|||||+|.|....+...|+|+.+.|+++.++.|
T Consensus 797 tGivv~~~sLdrE~q~~y~l~I~a~dqp----~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A 872 (4289)
T KOG1219|consen 797 TGIVVTSKSLDREGQTSYHLKIEARDQP----PPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVA 872 (4289)
T ss_pred ccEEEeccccCcccCceeEEEEEEcCCC----CCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhh
Confidence 9999999999999999999999999998 578999999999999999999999999999999999999999999999
Q ss_pred EcCCCCCCceEEEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCC
Q psy1041 988 SDPDLGQGGVVRYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEATLIVEVSDVNENMN 1067 (1095)
Q Consensus 988 ~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~v~I~V~dvNdn~~ 1067 (1095)
.|+|.|++++++|.+.. ....|.+|..+|.+.+.++||+|....|+|.|+|.|+|.|.+++.|.|.|.|.|+|+|.|
T Consensus 873 ~d~diGq~~kvry~l~~---~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlh 949 (4289)
T KOG1219|consen 873 LDPDIGQLGKVRYYLTD---DTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLH 949 (4289)
T ss_pred cCcccCcCceeEEEEec---CccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCC
Confidence 99999999999999987 345579999999999999999999999999999999999999999999999999999999
Q ss_pred CCeeecceEEEEEeCCCCCCceEEEe
Q psy1041 1068 APVFSDFVYQATVKENQPIGTSLKPR 1093 (1095)
Q Consensus 1068 ~P~F~~~~y~~~v~E~~~~gt~v~~~ 1093 (1095)
||.|....-.+.|.||+|+||.|+++
T Consensus 950 pp~F~~~v~e~~V~EnapiGT~vi~i 975 (4289)
T KOG1219|consen 950 PPEFISFVTEGHVLENAPIGTIVIRI 975 (4289)
T ss_pred CcchheeeeeeeEeecCCcceEEEEE
Confidence 99999999999999999999999986
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=2e-28 Score=258.23 Aligned_cols=197 Identities=44% Similarity=0.616 Sum_probs=184.0
Q ss_pred ceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCC--cceEEeCCcceEEEcccccccccCeeEEEEEEEeCCCC
Q psy1041 860 LASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDT--QDFAVDSATGSLYVSASLDRERQDLYELKIRASDCDGR 937 (1095)
Q Consensus 860 ~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~--~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~A~D~~g~ 937 (1095)
.|.+.|+||++.|+.++++.|.|+|.+.|+.+.|+|.++. .+|.|++.+|.|++.+.||||....|.|.|+|+|.+
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g-- 78 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGG-- 78 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECC--
Confidence 3789999999999999999999999988899999999887 599999999999999999999999999999999976
Q ss_pred CCCcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCCCCcEEEEEEEEcCCCCCCceEEEEEEeCCCCCCCEEEeCC
Q psy1041 938 NDMYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIPVGTVVAILSASDPDLGQGGVVRYTIVSDNEADDVFSIDRL 1017 (1095)
Q Consensus 938 ~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~ 1017 (1095)
.+.+++...++|.|.|+|||+|.|....|.+.|.|+.++|+.++++.|+|+|.+.|+.++|+|.++.. ...|.|++.
T Consensus 79 --~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~-~~~f~i~~~ 155 (199)
T cd00031 79 --GPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGND-KELFSIDPN 155 (199)
T ss_pred --cCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCC-CCEEEEeCC
Confidence 34556899999999999999999998999999999999999999999999999999999999998432 479999999
Q ss_pred ceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEEEEEEEEEe
Q psy1041 1018 TGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEATLIVEVSD 1061 (1095)
Q Consensus 1018 tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~v~I~V~d 1061 (1095)
+|.|++.+.||||....|.|.|.|+|.+.|.+++++.++|.|.|
T Consensus 156 ~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 156 TGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred ceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 99999999999999999999999999998889999999999875
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.96 E-value=3.7e-28 Score=256.30 Aligned_cols=196 Identities=41% Similarity=0.631 Sum_probs=185.0
Q ss_pred EEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEEeCCCCCcEEEeCCceEEEEcccCCccCCCEEEEEEEEEECCCCC
Q psy1041 756 IELQVNESVPLKSTLTKIIARDRDLGYNGKLVFGISSGDNDSVFRIDPDSGELKVVGYLDRERTSEYTLNITVYDLGKPQ 835 (1095)
Q Consensus 756 ~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~p~ 835 (1095)
|.+.+.|+.+.|+.++++.|.|+|.+.|+.++|+|.+++...+|.|++.+|.|++.+.||||....|.|.|+|+|.|.|.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~ 81 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPP 81 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCc
Confidence 57899999999999999999999999899999999988766899999999999999999999999999999999998888
Q ss_pred CceeEEEEEEEEecCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCC--cceEEeCCcceEEE
Q psy1041 836 KSTSKMLPITILDVNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDT--QDFAVDSATGSLYV 913 (1095)
Q Consensus 836 ~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~--~~F~Id~~tG~i~~ 913 (1095)
++++..+.|.|.|+|||+|.|....|.+.|.|+.+.|+.++++.|+|+|.+.|+.++|+|.++. ..|.|++.+|.|++
T Consensus 82 ~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~ 161 (199)
T cd00031 82 LSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITL 161 (199)
T ss_pred ceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEe
Confidence 8889999999999999999999889999999999999999999999999988999999999987 79999999999999
Q ss_pred cccccccccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEec
Q psy1041 914 SASLDRERQDLYELKIRASDCDGRNDMYTLHADALVRVTIDD 955 (1095)
Q Consensus 914 ~~~LD~E~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~D 955 (1095)
.+.||||....|.|.|.|+|.++ +.++.++.++|.|.|
T Consensus 162 ~~~ld~e~~~~~~l~v~a~D~~~----~~~~~~~~i~i~v~d 199 (199)
T cd00031 162 AKPLDREEKSSYELTVVATDGGG----PPLSSTATVTVTVLD 199 (199)
T ss_pred CCccCCccCceEEEEEEEEECCC----CCceeEEEEEEEEEC
Confidence 99999999999999999999983 457888999998876
No 7
>KOG1834|consensus
Probab=99.67 E-value=2.8e-15 Score=167.11 Aligned_cols=214 Identities=29% Similarity=0.313 Sum_probs=172.3
Q ss_pred EEEEecCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCC--Cc-eEEEEecCCCcceE---EeCCcc--eEEEcc
Q psy1041 844 ITILDVNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGD--NA-KVVYSLMTDTQDFA---VDSATG--SLYVSA 915 (1095)
Q Consensus 844 I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~--n~-~v~Ysl~~~~~~F~---Id~~tG--~i~~~~ 915 (1095)
...--+|-+.|.... .|.+-|.||...-...--+.|-|.|... .| ..-|.|.+.+-.|. +|..|| .|+.+.
T Consensus 21 ~~aarankhkpwie~-ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvlRaK~ 99 (952)
T KOG1834|consen 21 HHAARANKHKPWIEE-EYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVLRAKE 99 (952)
T ss_pred cccccccccCccccc-ceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEEeecC
Confidence 455667888998864 6999999997532222347889988642 12 35788888776674 477775 688899
Q ss_pred cccccccCeeEEEEEEEeCCCCCC--CcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCCCCcEEEEEEEEcCCCC
Q psy1041 916 SLDRERQDLYELKIRASDCDGRND--MYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIPVGTVVAILSASDPDLG 993 (1095)
Q Consensus 916 ~LD~E~~~~y~l~V~A~D~~g~~~--~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g 993 (1095)
+||-|.++.|+|+|+|.|++..+. ....+..++|.|.|.|+|+.||+|..+.|.+.|.|+- .-..|++|.|.|.|-+
T Consensus 100 ~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAiD~DCs 178 (952)
T KOG1834|consen 100 PLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAIDKDCS 178 (952)
T ss_pred cccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecce-eeeeeEEEEeecCCCC
Confidence 999999999999999999985321 1145677999999999999999999999999999984 5678899999999976
Q ss_pred C-CceE-EEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEEEEEEEEEecC
Q psy1041 994 Q-GGVV-RYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEATLIVEVSDVN 1063 (1095)
Q Consensus 994 ~-n~~v-~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~v~I~V~dvN 1063 (1095)
+ ++.| .|.|.. ..-.|.||. .|.|+.+.+|.|.....|.|+|+|.|.|.-+..+.+.|+|.|...-
T Consensus 179 pq~sqIC~YEI~t---~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp~C 246 (952)
T KOG1834|consen 179 PQYSQICEYEITT---PDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKPTC 246 (952)
T ss_pred CcccceeEEEecC---CCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEecCcc
Confidence 4 6665 688886 577899997 7999999999999999999999999999876566688888886543
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.65 E-value=2.5e-15 Score=137.09 Aligned_cols=92 Identities=48% Similarity=0.731 Sum_probs=87.6
Q ss_pred eEEEEeCCCCCCcEEEEEEEEcCCCCCCceEEEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEEC-CC
Q psy1041 968 YSVKVREDIPVGTVVAILSASDPDLGQGGVVRYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDN-GS 1046 (1095)
Q Consensus 968 y~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~-g~ 1046 (1095)
|.+.|+|++++|+.++++.|.|+|.+.|+.+.|+|..++ ..+.|.|++.+|.|++.++||||..+.|.|.|.|+|. |.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~-~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~ 79 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGN-PDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGS 79 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETT-STTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCc-ccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCC
Confidence 789999999999999999999999999999999999954 3789999999999999999999999999999999999 89
Q ss_pred CCceeEEEEEEEEE
Q psy1041 1047 PPLYSEATLIVEVS 1060 (1095)
Q Consensus 1047 p~ls~~~~v~I~V~ 1060 (1095)
|+++++++|.|+|+
T Consensus 80 ~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 80 PPLSSTATVTINVL 93 (93)
T ss_dssp SEEEEEEEEEEEEE
T ss_pred CCCEEEEEEEEEEC
Confidence 99999999999985
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.63 E-value=3.6e-15 Score=136.07 Aligned_cols=92 Identities=33% Similarity=0.591 Sum_probs=88.6
Q ss_pred EEEEEecCCCCCcEEEEEEEEeCCCCCCcEEEEEEEeCCCCCcEEEeCCceEEEEcccCCccCCCEEEEEEEEEEC-CCC
Q psy1041 756 IELQVNESVPLKSTLTKIIARDRDLGYNGKLVFGISSGDNDSVFRIDPDSGELKVVGYLDRERTSEYTLNITVYDL-GKP 834 (1095)
Q Consensus 756 ~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~-g~p 834 (1095)
|.+.|+|+.+.|+.++++.|.|+|.+.|+.+.|+|.+++...+|.|++.+|.|++.++||||....|.|.|.|+|. |.|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~ 80 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSP 80 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSS
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCC
Confidence 6789999999999999999999999999999999999988999999999999999999999999999999999999 899
Q ss_pred CCceeEEEEEEEE
Q psy1041 835 QKSTSKMLPITIL 847 (1095)
Q Consensus 835 ~~s~~~~v~I~V~ 847 (1095)
++++++.|.|+|+
T Consensus 81 ~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 81 PLSSTATVTINVL 93 (93)
T ss_dssp EEEEEEEEEEEEE
T ss_pred CCEEEEEEEEEEC
Confidence 9999999999885
No 10
>KOG1834|consensus
Probab=99.56 E-value=1.6e-13 Score=153.21 Aligned_cols=207 Identities=27% Similarity=0.362 Sum_probs=160.2
Q ss_pred cccCCCCccccCcEEEEEecCCCCCcEEEEEEEEeCCCC--CCcE-EEEEEEeCCC-CCcEEEeCCce--EEEEcccCCc
Q psy1041 743 GENVHVPEFFSFPIELQVNESVPLKSTLTKIIARDRDLG--YNGK-LVFGISSGDN-DSVFRIDPDSG--ELKVVGYLDR 816 (1095)
Q Consensus 743 dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g--~n~~-v~y~i~~g~~-~~~F~Id~~tG--~i~~~~~LD~ 816 (1095)
-+|-+.|-. ...|..-|.||.-.-...--+.|-|.|.. ..|. .-|.|.+.+- -+.--+|..|| .|+.+.+||.
T Consensus 25 rankhkpwi-e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvlRaK~~lDC 103 (952)
T KOG1834|consen 25 RANKHKPWI-EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVLRAKEPLDC 103 (952)
T ss_pred cccccCccc-ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEEeecCcccc
Confidence 356666654 34566677777644333345778888852 2233 3577765321 12334576665 6889999999
Q ss_pred cCCCEEEEEEEEEECCCC------CCceeEEEEEEEEecCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCC-Cc
Q psy1041 817 ERTSEYTLNITVYDLGKP------QKSTSKMLPITILDVNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGD-NA 889 (1095)
Q Consensus 817 E~~~~y~l~V~a~D~g~p------~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~-n~ 889 (1095)
|.+..|+|+|+|.|.|.. .+|-.++|.|+|.|+|+.||+|..+.|.+.|.|+... ..|++|.|.|.|-++ ++
T Consensus 104 elqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~y-d~il~veAiD~DCspq~s 182 (952)
T KOG1834|consen 104 ELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVY-DSILRVEAIDKDCSPQYS 182 (952)
T ss_pred cccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceee-eeeEEEEeecCCCCCccc
Confidence 999999999999998754 3566789999999999999999999999999999754 578999999999764 55
Q ss_pred e-EEEEecCCCcceEEeCCcceEEEcccccccccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEecc
Q psy1041 890 K-VVYSLMTDTQDFAVDSATGSLYVSASLDRERQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDI 956 (1095)
Q Consensus 890 ~-v~Ysl~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~Dv 956 (1095)
+ ..|.|...+-.|.||. .|.|+.+.+|.|.....|.|+|.|.|++ ..+....+.|+|.|...
T Consensus 183 qIC~YEI~t~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg----~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 183 QICEYEITTPDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCG----KKRAASDSLVTVHVKPT 245 (952)
T ss_pred ceeEEEecCCCCceEEcC-CCccccccccccccceeEEEEEEEEecc----cccccCcceEEEEecCc
Confidence 5 4799999889999995 8999999999999999999999999998 44555668899988654
No 11
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.54 E-value=3.7e-14 Score=124.96 Aligned_cols=79 Identities=46% Similarity=0.775 Sum_probs=75.0
Q ss_pred EeCCCCCCcEEEEEEEeCCCCCcEEEeCCceEEEEcccCCccCCCEEEEEEEEEECCCCCCceeEEEEEEEEecCCCCC
Q psy1041 776 RDRDLGYNGKLVFGISSGDNDSVFRIDPDSGELKVVGYLDRERTSEYTLNITVYDLGKPQKSTSKMLPITILDVNDNPP 854 (1095)
Q Consensus 776 ~D~D~g~n~~v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~DvNDn~P 854 (1095)
+|+|.|.|+.+.|+|.+++...+|.|++.+|.|.+.++||||....|.|.|+|+|.|.|++++.+.|.|.|.|+|||+|
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 4889999999999999887668999999999999999999999999999999999999999999999999999999998
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.51 E-value=1.1e-13 Score=122.05 Aligned_cols=79 Identities=53% Similarity=0.783 Sum_probs=73.3
Q ss_pred EcCCCCCCceEEEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCC
Q psy1041 988 SDPDLGQGGVVRYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEATLIVEVSDVNENMN 1067 (1095)
Q Consensus 988 ~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~v~I~V~dvNdn~~ 1067 (1095)
+|+|.|.|+.++|+|.++.. ..+|.|++.+|.|++.++||||....|.|.|+|+|.|.|++++.+.|+|+|.|+|||
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~-~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~-- 77 (79)
T smart00112 1 TDADSGENGKVTYSILSGNE-DGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDN-- 77 (79)
T ss_pred CCCCCCcCcEEEEEEecCCC-CCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCC--
Confidence 48999999999999998443 389999999999999999999999999999999999999999999999999999998
Q ss_pred CC
Q psy1041 1068 AP 1069 (1095)
Q Consensus 1068 ~P 1069 (1095)
+|
T Consensus 78 ~P 79 (79)
T smart00112 78 AP 79 (79)
T ss_pred CC
Confidence 66
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.32 E-value=0.028 Score=50.07 Aligned_cols=78 Identities=22% Similarity=0.359 Sum_probs=42.5
Q ss_pred CCCCCCCCceEEEeeCCCCCCcEEEEEEeeeCCCCCCeEEEEEEecCCccEEEEccccEEEEccCCCcCCCcEEEEEEEE
Q psy1041 154 LNPLFYPTEYEETVPEDLPLHTSILRVSAEDADLGRNGEIYYSFRDMNEQFSIHPTSGVVTLTRPLKYTDRSVHDLVVLG 233 (1095)
Q Consensus 154 n~P~F~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~Y~l~~~~~~F~id~~tG~i~~~~~ld~e~~~~~~l~V~A 233 (1095)
.-|-|.+..|.+.|+.+...|..|++|.-.|-. .+..+.|.-.+ ..|.|. ..|.|.+++++..... .-.|.|.|
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~--~~~~~~~~ssD--pdF~V~-~DGsVy~~r~v~l~~~-~~~F~V~a 75 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCT--GRRRVIFESSD--PDFRVL-EDGSVYAKRPVQLSSE-QRSFTVHA 75 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--S--S---EEEE-----SEEEEE-TTTEEEEES--S-SSS--EEEEEEE
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCC--CCCceEEecCC--CCEEEc-CCCeEEEeeeEecCCC-ceEEEEEE
Confidence 358899999999999999999999999999985 34567776664 489999 5799999999886543 35799999
Q ss_pred eecC
Q psy1041 234 QDRG 237 (1095)
Q Consensus 234 ~D~g 237 (1095)
.|..
T Consensus 76 ~D~~ 79 (90)
T PF08758_consen 76 WDSQ 79 (90)
T ss_dssp EETT
T ss_pred ECCC
Confidence 9976
No 14
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.23 E-value=0.0032 Score=55.27 Aligned_cols=62 Identities=29% Similarity=0.432 Sum_probs=40.3
Q ss_pred EEEEecCCCCCcEEEEEEEEeCCCCCC--cEEEEEEEeCCCCCcEEEeCCceEEEEcccCCccCC
Q psy1041 757 ELQVNESVPLKSTLTKIIARDRDLGYN--GKLVFGISSGDNDSVFRIDPDSGELKVVGYLDRERT 819 (1095)
Q Consensus 757 ~~~v~E~~~~g~~v~~v~A~D~D~g~n--~~v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~ 819 (1095)
..+|+|..+.|+.|+.|. .|.-.... ....|+|.+.....+|.+++.+|.|++...+|||..
T Consensus 4 ~YsV~EE~~~Gt~IGnia-~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~L 67 (84)
T PF08266_consen 4 RYSVPEEMPPGTVIGNIA-KDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREEL 67 (84)
T ss_dssp EEEEESS--TT-EEEECC-CCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC
T ss_pred EEEeecCCCCCCEEEEhH-HhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHH
Confidence 468999999999999994 34322111 123577777667889999999999999999999974
No 15
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.10 E-value=0.041 Score=49.01 Aligned_cols=79 Identities=24% Similarity=0.348 Sum_probs=43.4
Q ss_pred CCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCCcceEEeCCcceEEEcccccccccCeeEEEEEE
Q psy1041 852 NPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDTQDFAVDSATGSLYVSASLDRERQDLYELKIRA 931 (1095)
Q Consensus 852 n~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~A 931 (1095)
+.|-|.+..|.+.|+.+...|..+++|.-.|-.. +..+.|.- .+..|.|.. .|.|++++++...... -.|.|.|
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~s--sDpdF~V~~-DGsVy~~r~v~l~~~~-~~F~V~a 75 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFES--SDPDFRVLE-DGSVYAKRPVQLSSEQ-RSFTVHA 75 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-----SEEEEET-TTEEEEES--S-SSS--EEEEEEE
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEec--CCCCEEEcC-CCeEEEeeeEecCCCc-eEEEEEE
Confidence 4689999999999999999999999999988742 44577755 334999986 8999999999875433 4799999
Q ss_pred EeCCC
Q psy1041 932 SDCDG 936 (1095)
Q Consensus 932 ~D~~g 936 (1095)
.|..+
T Consensus 76 ~D~~~ 80 (90)
T PF08758_consen 76 WDSQT 80 (90)
T ss_dssp EETTT
T ss_pred ECCCC
Confidence 99985
No 16
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.94 E-value=0.054 Score=48.84 Aligned_cols=88 Identities=26% Similarity=0.414 Sum_probs=60.0
Q ss_pred EEEEEeeCCCCCCceEEEEEEe-CCCCCcEEEECCccEEEEce--------ecCcccccEEEEEEEEEeCCCCCCceeee
Q psy1041 399 IRLKVSDADDGKNAQVFLEIVG-GNEGGEFNINPETGMLYTAV--------TLDAEDKAFYTLTVSAIDQGNAGTRKQSA 469 (1095)
Q Consensus 399 ~~v~a~D~D~g~n~~i~ysi~~-~~~~~~F~Id~~tG~i~~~~--------~LD~E~~~~y~l~V~a~D~g~~~~~~~~~ 469 (1095)
+++.++|+|.+. ..++++.. ....|.|.|++ .|.....- .|..-+...-.|+|++.|+ .+
T Consensus 2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG--------tt 70 (99)
T TIGR01965 2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG--------TS 70 (99)
T ss_pred CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC--------Ce
Confidence 468899999865 45666642 22467899998 47544331 2333344456788888885 27
Q ss_pred EEEEEEEeecCCCCCcccCCccEEEEeecC
Q psy1041 470 AKVKVNIVDTNDNDPLFDSPEMEVSINENE 499 (1095)
Q Consensus 470 ~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~ 499 (1095)
+.|+|+|.-.|| +|+..... ...|.|+.
T Consensus 71 ~~vtItI~GtND-apvi~~~~-~g~v~ED~ 98 (99)
T TIGR01965 71 QTVTITITGAND-AAVIGGAD-TGSVTEDS 98 (99)
T ss_pred EEEEEEEEccCC-CCEEeccc-ceeEecCC
Confidence 789999999999 88776543 57777764
No 17
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=95.80 E-value=0.018 Score=50.64 Aligned_cols=60 Identities=35% Similarity=0.571 Sum_probs=37.1
Q ss_pred eEEEEecCCCCCeEEEEEEEEeCCCCCC--ceEEEEecCC--CcceEEeCCcceEEEcccccccc
Q psy1041 861 ASFRVTENALNGTVIFKVNATDLDLGDN--AKVVYSLMTD--TQDFAVDSATGSLYVSASLDRER 921 (1095)
Q Consensus 861 y~~~V~En~~~gt~v~~v~A~D~D~g~n--~~v~Ysl~~~--~~~F~Id~~tG~i~~~~~LD~E~ 921 (1095)
..++|+|..+.|++|+.| |.|.-.... ..-.|++.+. ..+|.++..+|.|+++..+|||.
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHH
Confidence 357899999999999999 444422110 1124555543 38999999999999999999996
No 18
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.33 E-value=0.14 Score=46.22 Aligned_cols=89 Identities=22% Similarity=0.310 Sum_probs=59.6
Q ss_pred EEEEEEcCCCCCCceEEEEEEeCCCCCCCEEEeCCceEEEEc--------ccCCCCCCCEEEEEEEEEECCCCCceeEEE
Q psy1041 983 AILSASDPDLGQGGVVRYTIVSDNEADDVFSIDRLTGTIRVA--------KPLDFEKRQVHSLVVRAKDNGSPPLYSEAT 1054 (1095)
Q Consensus 983 ~~v~A~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~--------~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~ 1054 (1095)
+++.++|+|.+.. ..+++......-+.|.|++ +|..+-. +.|..-+...-.|+|.+.|+ .+.+
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG------tt~~ 72 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG------TSQT 72 (99)
T ss_pred CceEEeCCCCCCc--eEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC------CeEE
Confidence 4688999997654 4566643223456788887 6754332 12332234456788999995 2788
Q ss_pred EEEEEEecCCCCCCCeeecceEEEEEeCCC
Q psy1041 1055 LIVEVSDVNENMNAPVFSDFVYQATVKENQ 1084 (1095)
Q Consensus 1055 v~I~V~dvNdn~~~P~F~~~~y~~~v~E~~ 1084 (1095)
|+|+|.-.|| +|+.... -...|.|+.
T Consensus 73 vtItI~GtND---apvi~~~-~~g~v~ED~ 98 (99)
T TIGR01965 73 VTITITGAND---AAVIGGA-DTGSVTEDS 98 (99)
T ss_pred EEEEEEccCC---CCEEecc-cceeEecCC
Confidence 9999999999 8977653 457777764
No 19
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.97 E-value=0.23 Score=45.30 Aligned_cols=70 Identities=23% Similarity=0.288 Sum_probs=52.1
Q ss_pred EEeeCCCCCCceEEEEEecCC----CCCcEEecccceEEEceeccccccccEEEEEEEEEECCcCCceeeEEEEEEEEEe
Q psy1041 509 TAKDKDSGENAYISYSIANLK----PVPFEIDHFSGVIKTTQVLDYESMRREYILRVRASDWGLPYRRQTEMQLKIKLLD 584 (1095)
Q Consensus 509 ~A~D~D~g~n~~i~ysi~~~~----~~~F~Id~~tG~i~~~~~lD~E~~~~~~~l~V~a~D~g~p~~~s~~~~v~I~V~d 584 (1095)
...|+| ...++|++.... +.|...|+.++.+.-.. ...+ ...|.|+|+|+|+.+ .+....++|.|.+
T Consensus 23 tF~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP-~~~~--~g~~~i~v~a~D~~g---~~~~~~f~i~V~~ 93 (97)
T smart00736 23 TFTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTP-TNSD--VGSLSLKVTATDSSG---ASASDTFTITVVN 93 (97)
T ss_pred ceECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEEC-CCCC--CcEEEEEEEEEECCC---CEEEEEEEEEEeC
Confidence 346776 346899886432 67899999999888753 3323 256999999999865 3567889999999
Q ss_pred CCC
Q psy1041 585 VND 587 (1095)
Q Consensus 585 vND 587 (1095)
.||
T Consensus 94 ~~~ 96 (97)
T smart00736 94 TND 96 (97)
T ss_pred CCC
Confidence 887
No 20
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.75 E-value=0.36 Score=44.03 Aligned_cols=70 Identities=20% Similarity=0.308 Sum_probs=52.8
Q ss_pred EeeCCCCCCceEEEEEEeCC---CCCcEEEECCccEEEEceecCcccccEEEEEEEEEeCCCCCCceeeeEEEEEEEeec
Q psy1041 403 VSDADDGKNAQVFLEIVGGN---EGGEFNINPETGMLYTAVTLDAEDKAFYTLTVSAIDQGNAGTRKQSAAKVKVNIVDT 479 (1095)
Q Consensus 403 a~D~D~g~n~~i~ysi~~~~---~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~~~~~~~~~~~v~I~V~Dv 479 (1095)
..|.| ...++|++...+ ...+.++++.++.++-. +.... ...|.++|+|+|+. +.+....++|.|.+.
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~-~g~~~i~v~a~D~~----g~~~~~~f~i~V~~~ 94 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSD-VGSLSLKVTATDSS----GASASDTFTITVVNT 94 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCC-CcEEEEEEEEEECC----CCEEEEEEEEEEeCC
Confidence 45666 347899986432 35699999999988774 33333 46799999999986 257788999999998
Q ss_pred CC
Q psy1041 480 ND 481 (1095)
Q Consensus 480 ND 481 (1095)
||
T Consensus 95 ~~ 96 (97)
T smart00736 95 ND 96 (97)
T ss_pred CC
Confidence 87
No 21
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=94.70 E-value=2.5 Score=42.24 Aligned_cols=129 Identities=19% Similarity=0.238 Sum_probs=70.2
Q ss_pred CcEEEEEE-EEeecCccccCCCCcceEEEEEEEEeccCCcCeeEEeecCcccccCCc---cEEEEEEEEeCCCCCCCeEE
Q psy1041 224 RSVHDLVV-LGQDRGSVFKGGGKPSSAKLKIKVEQINLYGPEIYVQSLPDIVEQSYA---DIYAIVRVVDRDAGIHGEIA 299 (1095)
Q Consensus 224 ~~~~~l~V-~A~D~g~~~~~~~~s~~~~v~I~V~dvNd~~P~f~~~~~~~~~~~~~~---~~~~~v~a~D~D~g~n~~v~ 299 (1095)
.+.|.+++ .|+|.. +...+..+..++. +...+|.+.. .......+... ..-..+.++|.-.+. .+.
T Consensus 14 dG~Y~l~~~~a~D~a------gN~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~--~i~ 83 (158)
T PF13750_consen 14 DGSYTLTVVTATDAA------GNTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS--KIT 83 (158)
T ss_pred CccEEEEEEEEEecC------CCEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc--eEE
Confidence 57899999 799987 3455555554444 3567899986 21111111111 122345666655444 677
Q ss_pred EEEeecCCCCCCeEEeeeccCCCCcCceEEEEECccc-cccCCCCceEEEEEEEECCCCCeeeEEEEEEEe
Q psy1041 300 SLDIVDGDPDGHFRIVPTKIDPGTKKKEYNIVVLKLL-DREIAPLGYNLTLRAVDKGTPPRETYKATQVHL 369 (1095)
Q Consensus 300 ~~~i~~g~~~~~F~i~~~~~~~~~~~g~~~i~~~~~l-D~E~~~~~y~l~v~a~D~g~p~~~s~~~~~i~v 369 (1095)
.+.+.+|.....-.+.... .+.|.+.+.-.+.+ ..|.... |+|+|.|+|..+.. +.+.+....
T Consensus 84 sv~l~Gg~~~d~v~ls~~~----~~~~~~~~~yp~~fpsle~~~~-YtLtV~a~D~aGN~--~~~si~F~y 147 (158)
T PF13750_consen 84 SVSLTGGPASDSVSLSWTN----KGNGVYTLEYPRIFPSLEADDS-YTLTVSATDKAGNQ--STKSISFSY 147 (158)
T ss_pred EEEEECCcccceEEEeeEe----ccCceEEeecccccCCcCCCCe-EEEEEEEEecCCCE--EEEEEEEEE
Confidence 7777666544443333221 22454444322222 2244444 99999999986643 334444443
No 22
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=93.96 E-value=42 Score=47.56 Aligned_cols=222 Identities=15% Similarity=0.162 Sum_probs=101.9
Q ss_pred ccCCCEEEEEEEEEECCCCCCceeEEEEEEEEecCCCCCcccccceEEEEec--CCCCCeEEEEEEEEeCCCCCCceEEE
Q psy1041 816 RERTSEYTLNITVYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLASFRVTE--NALNGTVIFKVNATDLDLGDNAKVVY 893 (1095)
Q Consensus 816 ~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~E--n~~~gt~v~~v~A~D~D~g~n~~v~Y 893 (1095)
|...+.|.++|+|.+.-+ +...++.|.|.+.=. -..|.... ..+ -.+.|.. ..+.|...+ |. .+.|
T Consensus 1750 f~~~G~y~VtVta~N~vs---s~~~s~~V~VqepV~-GL~i~~~~----~~~~~~~~ag~~-v~F~a~vst-Gs--nVsw 1817 (2740)
T TIGR00864 1750 FPSAGLHLVTMKAFNELG---SANASEEVDVQEPIS-GLKIRAAD----AGEQNFFAADSS-VCFQGELAT-GT--NVSW 1817 (2740)
T ss_pred ccCCceEEEEEEEEcccc---ccceeEEEEEEeccc-cceEecCC----CCcccceecCcE-EEEEEEccC-CC--eeEE
Confidence 567789999999998654 223456677753211 11111000 000 0112333 345555432 33 4555
Q ss_pred EecCCCcceEEeCCcceEEEcccccccccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccCCCCCeeecCceEEEEe
Q psy1041 894 SLMTDTQDFAVDSATGSLYVSASLDRERQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDINDNAPNFALPNYSVKVR 973 (1095)
Q Consensus 894 sl~~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~ 973 (1095)
.-.-+.+ ...+|. ....-+.+.+.|.+.+.|++.-+ ...++-.+.|.|. .+....+ + -.
T Consensus 1818 ~W~f~~g----~s~~gk---~v~~Tf~~aG~ytV~L~AsN~vs-------~~~~s~~~~VQe~-----I~~L~L~-a-s~ 1876 (2740)
T TIGR00864 1818 CWAIDGG----SSKMGK---HACMTFPDAGTFAIRLNASNAVS-------GKSASREFFAEEP-----IFGLELK-A-SK 1876 (2740)
T ss_pred EEEeCCC----Cccccc---eeEEecCCCeEEEEEEEEEcccC-------cceeeeeEEEEEe-----cceEEEe-c-cc
Confidence 4422111 011221 11244567789999999998764 1223444555542 2111110 0 00
Q ss_pred CCCCCCcEEEEEEEEcCCCCCCceEEEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEE
Q psy1041 974 EDIPVGTVVAILSASDPDLGQGGVVRYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEA 1053 (1095)
Q Consensus 974 E~~~~g~~v~~v~A~D~D~g~n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~ 1053 (1095)
....+|..+ ++.|.=. .| ..++|.+.=+++. .-.+++ |.. .. --|-..+.|.++|.|...=+. ..+
T Consensus 1877 ~~~~~n~~v-~fsa~l~-~G--S~Vtf~w~fGdgs--~~~~t~--~~t-~~--HsY~~~G~Y~VtV~A~N~Vs~---~~a 1942 (2740)
T TIGR00864 1877 KIAAIGEKV-EFQILLA-AG--SAVNFRLQIGGAA--PEVLQP--GPR-FS--HSFPRVDDHMVNLRAKNEVSC---AQA 1942 (2740)
T ss_pred ccccCCCEE-EEEEEec-CC--CceEEEEEeCCCC--ceeecC--CCe-EE--eecCCCCcEEEEEEEEeccce---eee
Confidence 112234332 3444211 12 2477777653221 111222 211 11 224467889999999987653 444
Q ss_pred EEEEEEEe------cCCCCCCCeee---cceEEEEEeCCCC
Q psy1041 1054 TLIVEVSD------VNENMNAPVFS---DFVYQATVKENQP 1085 (1095)
Q Consensus 1054 ~v~I~V~d------vNdn~~~P~F~---~~~y~~~v~E~~~ 1085 (1095)
.+.|.|+. +..+ +.+.|. ...+.+.|.++.+
T Consensus 1943 ~~~V~Vle~V~gL~I~~~-c~~~~~~g~~~tF~A~v~~g~~ 1982 (2740)
T TIGR00864 1943 NLHIEVLEAVRGLQIPDC-CAAGIATGEEKNFTANVQRGKP 1982 (2740)
T ss_pred eEEEEEEEeccceEecCC-cccceecCceEEEEEEEecCCc
Confidence 55555442 1121 122232 2456677776665
No 23
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=93.42 E-value=3.2 Score=41.52 Aligned_cols=125 Identities=19% Similarity=0.254 Sum_probs=70.1
Q ss_pred CCEEEEEE-EEEECCCCCCceeEEEEEEEEecCCCCCcccccceEEEEecCCCC-CeEEEEEEEEeCCCCCCceEEEEec
Q psy1041 819 TSEYTLNI-TVYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLASFRVTENALN-GTVIFKVNATDLDLGDNAKVVYSLM 896 (1095)
Q Consensus 819 ~~~y~l~V-~a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~-gt~v~~v~A~D~D~g~n~~v~Ysl~ 896 (1095)
.+.|.+.+ +|.|..+- ..+..+..++. +...+|.+.- .....+..+... |..=..+.++|.-.+. .....+|.
T Consensus 14 dG~Y~l~~~~a~D~agN--~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~ 88 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGN--TSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLT 88 (158)
T ss_pred CccEEEEEEEEEecCCC--EEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEE
Confidence 46899999 79997542 33333433333 2344787643 111233333332 2233668888876554 34567777
Q ss_pred CCC--cc--eEEeC-CcceEEEcc---cccccccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEe
Q psy1041 897 TDT--QD--FAVDS-ATGSLYVSA---SLDRERQDLYELKIRASDCDGRNDMYTLHADALVRVTID 954 (1095)
Q Consensus 897 ~~~--~~--F~Id~-~tG~i~~~~---~LD~E~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~ 954 (1095)
++. +. ..... ..|...+.- -...|....|.|+|.|+|..| ...+..+.....
T Consensus 89 Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aG------N~~~~si~F~y~ 148 (158)
T PF13750_consen 89 GGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAG------NQSTKSISFSYM 148 (158)
T ss_pred CCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCC------CEEEEEEEEEEe
Confidence 654 21 12222 234333321 123477889999999999997 355666666654
No 24
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=89.52 E-value=21 Score=45.45 Aligned_cols=190 Identities=21% Similarity=0.293 Sum_probs=97.2
Q ss_pred cCCCCCcccCCccEEEEeecCCCCcEEEEEEEeeCCCCCCceEEEEEecCCC---CCcEEecccceEEEce---------
Q psy1041 479 TNDNDPLFDSPEMEVSINENEPAGTSVIKVTAKDKDSGENAYISYSIANLKP---VPFEIDHFSGVIKTTQ--------- 546 (1095)
Q Consensus 479 vNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~A~D~D~g~n~~i~ysi~~~~~---~~F~Id~~tG~i~~~~--------- 546 (1095)
.||..+.|.-..-...|.|+. |+.-.+|.-...|.+..-.+.|+..++.. .-|. +..|.|.-..
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 455555544443366889985 77777777665566556778998876532 1232 2345443221
Q ss_pred -ecc---ccccccEEEEEEEEEECCc------------CCceeeEEEEEEEEEeCCCCCCcccccceEEEecCCCCCCcE
Q psy1041 547 -VLD---YESMRREYILRVRASDWGL------------PYRRQTEMQLKIKLLDVNDNRPQFEKVDCLGHVPRNLPIGRE 610 (1095)
Q Consensus 547 -~lD---~E~~~~~~~l~V~a~D~g~------------p~~~s~~~~v~I~V~dvNDn~P~f~~~~~~~~V~E~~~~g~~ 610 (1095)
-+| +|. ...|.+.+.--..+. +.........+|+|.| ||++|.|.-..-...|.|+. |..
T Consensus 471 ~IIDDdi~E~-DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~v 546 (928)
T TIGR00845 471 GIIDDDIFEE-DEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIM 546 (928)
T ss_pred EEccCCCCCC-CceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEE
Confidence 111 232 234444443211110 1112223456777777 88899876655567888874 554
Q ss_pred EEEEE-EEeCCCCCeEEEEEEeCCCCC---cEEEeCCCcEEEEeeccCcccccEEEEEEEEecCCCccceEEEEEEE
Q psy1041 611 IITLS-AIDFDAGNIISYRIVSGNEDG---CFALDITSGVLSIACDLTDVRVNEREINVTATDSAHFSDVVRIRINL 683 (1095)
Q Consensus 611 v~~v~-A~D~D~~~~i~y~i~~~~~~~---~F~Id~~tG~i~~~~~ld~~~~~~~~l~V~atD~~~~s~~~~v~I~v 683 (1095)
-.+|. ..+.+..-.+.|.-..|...+ -|. ..+|.|..... ...-.++|...|.........+.|++
T Consensus 547 tvtV~RtsGa~G~VtV~Y~T~dGTA~aGg~DY~--~~sGtLtF~~G-----EtsKtItV~IiDD~~~E~dEtF~V~L 616 (928)
T TIGR00845 547 EVKVLRTSGARGTVIVPYRTVEGTARGGGKDFE--DTCGELEFEND-----ETEKTIRVKIVDDEEYEKNDTFFIEL 616 (928)
T ss_pred EEEEEEcCCCCeeEEEEEEeecCccCCCCCCcc--cccceEEEcCC-----cEEEEEEEEEcCCCcccCceeEEEEE
Confidence 44443 333322226888877664432 233 35666654422 12334555555554433334444444
No 25
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=83.96 E-value=94 Score=39.87 Aligned_cols=165 Identities=23% Similarity=0.277 Sum_probs=83.8
Q ss_pred cCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCC----cceEEeCCcceEEEcc---------
Q psy1041 849 VNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDT----QDFAVDSATGSLYVSA--------- 915 (1095)
Q Consensus 849 vNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~----~~F~Id~~tG~i~~~~--------- 915 (1095)
.||..+.|.-..-...|.||. |++-.+|.=...|.+....+.|+..++. ..|. +.+|.|....
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV 470 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRI 470 (928)
T ss_pred ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEE
Confidence 455555544344456699985 6777777655445444456889988764 3343 2355543321
Q ss_pred -cc---cccccCeeEEEEEEEeCC---CCC------CCcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCCCCcEE
Q psy1041 916 -SL---DRERQDLYELKIRASDCD---GRN------DMYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIPVGTVV 982 (1095)
Q Consensus 916 -~L---D~E~~~~y~l~V~A~D~~---g~~------~~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g~~v 982 (1095)
-+ -.|....|.+.+.--..+ |.. ....+......+|+|.| ||++|.|....-...|.|+. |..-
T Consensus 471 ~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G~vt 547 (928)
T TIGR00845 471 GIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEEDVFHVSESI--GIME 547 (928)
T ss_pred EEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccCceEEEEcCC--CEEE
Confidence 11 133344454444321111 000 00112223456677777 88899877655566788974 5554
Q ss_pred EEEE-EEcCCCCCCceEEEEEEeCCCCCC--CEEEeCCceEEEEc
Q psy1041 983 AILS-ASDPDLGQGGVVRYTIVSDNEADD--VFSIDRLTGTIRVA 1024 (1095)
Q Consensus 983 ~~v~-A~D~D~g~n~~v~Y~i~~~~~~~~--~F~Id~~tG~i~~~ 1024 (1095)
.+|. ..+.+ | .-.+.|.-.++....+ -|. ..+|.|...
T Consensus 548 vtV~RtsGa~-G-~VtV~Y~T~dGTA~aGg~DY~--~~sGtLtF~ 588 (928)
T TIGR00845 548 VKVLRTSGAR-G-TVIVPYRTVEGTARGGGKDFE--DTCGELEFE 588 (928)
T ss_pred EEEEEcCCCC-e-eEEEEEEeecCccCCCCCCcc--cccceEEEc
Confidence 4443 32222 1 2246687766432221 343 346666543
No 26
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=83.83 E-value=3 Score=32.56 Aligned_cols=36 Identities=19% Similarity=0.422 Sum_probs=27.7
Q ss_pred CcceEEeCCcceEEEccccccc-ccCeeEEEEEEEeCCC
Q psy1041 899 TQDFAVDSATGSLYVSASLDRE-RQDLYELKIRASDCDG 936 (1095)
Q Consensus 899 ~~~F~Id~~tG~i~~~~~LD~E-~~~~y~l~V~A~D~~g 936 (1095)
.....||+.+|.|+-. .+.+ ....|.++|.|+|..|
T Consensus 13 P~gLs~d~~tG~isGt--p~~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 13 PSGLSLDPSTGTISGT--PTSSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CCcEEEeCCCCEEEee--cCCCccccEEEEEEEEEcCCC
Confidence 3788999999999655 4434 3369999999999763
No 27
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=83.50 E-value=3.7 Score=32.07 Aligned_cols=37 Identities=16% Similarity=0.310 Sum_probs=28.3
Q ss_pred cCCccEEEEccccEEEEccCCCcC-CCcEEEEEEEEeecC
Q psy1041 199 DMNEQFSIHPTSGVVTLTRPLKYT-DRSVHDLVVLGQDRG 237 (1095)
Q Consensus 199 ~~~~~F~id~~tG~i~~~~~ld~e-~~~~~~l~V~A~D~g 237 (1095)
......+||+.+|.|+-.. +.. ..+.|.|.|+|+|..
T Consensus 11 ~LP~gLs~d~~tG~isGtp--~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGTP--TSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCCcEEEeCCCCEEEeec--CCCccccEEEEEEEEEcCC
Confidence 3467899999999998663 222 347899999999964
No 28
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=80.02 E-value=1.2e+02 Score=37.40 Aligned_cols=110 Identities=14% Similarity=0.234 Sum_probs=63.5
Q ss_pred cCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCCCCcEEEEEEEEcCCC---C--CC-
Q psy1041 922 QDLYELKIRASDCDGRNDMYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIPVGTVVAILSASDPDL---G--QG- 995 (1095)
Q Consensus 922 ~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~---g--~n- 995 (1095)
.....+++.++|..|. .+....++|+|++-. +|.+....+.. .++ .........|.=... | .|
T Consensus 297 ~G~~Ti~atVtDSRGr-----~S~~~~~tItVl~Y~--~P~lsfsv~R~--~~~--~~~~~v~~~a~Iapl~v~g~qKN~ 365 (624)
T PF05895_consen 297 SGSATIRATVTDSRGR-----TSDPKTKTITVLEYS--PPTLSFSVYRC--GSS--GNTLTVTRNAKIAPLTVNGVQKNT 365 (624)
T ss_pred CceEEEEEEEEECCCc-----cCCceEEEEEEEEcC--CCcEEEEEEEe--CCC--CcEEEEEEEEEEeEEEEcccccce
Confidence 5678999999999973 355678999999876 78775333222 111 112222233322221 1 12
Q ss_pred ceEEEEEEeCCCCCCCEEEeCC--ceE-----------EEEcccCCCCCCCEEEEEEEEEECCC
Q psy1041 996 GVVRYTIVSDNEADDVFSIDRL--TGT-----------IRVAKPLDFEKRQVHSLVVRAKDNGS 1046 (1095)
Q Consensus 996 ~~v~Y~i~~~~~~~~~F~Id~~--tG~-----------i~~~~~ld~E~~~~~~l~V~A~D~g~ 1046 (1095)
-.++|+... -....|.+|.. .|. ..+.. .|.....|.+.+.++|.=.
T Consensus 366 ~~lt~~~a~--~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g--~y~~~kSy~V~~~l~D~F~ 425 (624)
T PF05895_consen 366 MTLTFKVAP--LGTGTFTTDNGSASGTWSSISELTNSSANLGG--TYDAEKSYDVRGTLSDKFT 425 (624)
T ss_pred EEEEEEEEE--cCcceEEEEccccccceeeeeeecccceeecc--ccCCCceEEEEEEEEEEee
Confidence 256777766 23566776642 111 22223 3446788999999999754
No 29
>KOG3597|consensus
Probab=73.03 E-value=1.1e+02 Score=35.85 Aligned_cols=153 Identities=24% Similarity=0.242 Sum_probs=86.7
Q ss_pred eEEEEEEEEecCCCCCcccccceEEEEecCCCCCeEEEEEEEEeCCCCCCceEEEEecCCCc------ceEEeCC-----
Q psy1041 839 SKMLPITILDVNDNPPKFEKSLASFRVTENALNGTVIFKVNATDLDLGDNAKVVYSLMTDTQ------DFAVDSA----- 907 (1095)
Q Consensus 839 ~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~v~A~D~D~g~n~~v~Ysl~~~~~------~F~Id~~----- 907 (1095)
+....|.|..+||++..+....+.+.+.|+...-...-.+.+.|+|.++ ..+.|++....+ .|..-..
T Consensus 25 ~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~F 103 (442)
T KOG3597|consen 25 TDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATEF 103 (442)
T ss_pred EeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccce
Confidence 4468899999999777676666778888886544444668899998764 457888875431 1333221
Q ss_pred ------cceEEEccccccc--ccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccCCCCCeeecCceEEEEeCCCCCC
Q psy1041 908 ------TGSLYVSASLDRE--RQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDINDNAPNFALPNYSVKVREDIPVG 979 (1095)
Q Consensus 908 ------tG~i~~~~~LD~E--~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g 979 (1095)
.|.+. +++. +...+.++.+++|+- ..+. .+....... --+|.+... ....|.-+...-
T Consensus 104 s~~~v~~g~~~----yvh~g~el~~~~~~~~~SDg~-------~~S~-~~i~~~~~~-~~~~~~~~~-~gL~v~~gS~~~ 169 (442)
T KOG3597|consen 104 SYEEVEDGSLS----YVHSGTELRESELQLRVSDGL-------LVSE-RAILKVEAT-GPAPHLARN-TGLKVLQGSTAP 169 (442)
T ss_pred EehHhhcCcee----EEecCcccccceEEEEeecce-------Eeee-eEEecccCC-Ccceeeecc-cceEEccCcccc
Confidence 22222 2222 256778889999876 2222 111111111 112333221 122332222111
Q ss_pred cEEEEEEEEcCCCCCCceEEEEEEeCC
Q psy1041 980 TVVAILSASDPDLGQGGVVRYTIVSDN 1006 (1095)
Q Consensus 980 ~~v~~v~A~D~D~g~n~~v~Y~i~~~~ 1006 (1095)
-.-..+.+.|.|+++.-.+.|.|....
T Consensus 170 IT~~~L~ved~d~~~d~~v~~~i~~~P 196 (442)
T KOG3597|consen 170 ITPSNLSVEDNDSSPDDEVRYDITPPP 196 (442)
T ss_pred ccHhHceeecCCCCCCcEEEEEecCCC
Confidence 111347899999877778999998853
No 30
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=71.37 E-value=1.8e+02 Score=32.72 Aligned_cols=206 Identities=18% Similarity=0.229 Sum_probs=90.3
Q ss_pred ccEEEEEEEEEeCCCCCCceeeeEEEEEEEeecCCCCCcccCCccEEEEeecCC------CCcEEEEEEEeeCCCCCCce
Q psy1041 447 KAFYTLTVSAIDQGNAGTRKQSAAKVKVNIVDTNDNDPLFDSPEMEVSINENEP------AGTSVIKVTAKDKDSGENAY 520 (1095)
Q Consensus 447 ~~~y~l~V~a~D~g~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~------~gt~v~~v~A~D~D~g~n~~ 520 (1095)
...|.+.++++|..+. .+.....|.|--. +|...-. .+.++.. .....+...+++.+.+ ..
T Consensus 11 d~~~~v~vt~tD~aGN----~~~~t~~~~vDt~---~P~v~i~----~~~~~~~~~~~~~~~~~t~s~tvs~~~~g--~~ 77 (317)
T PF13753_consen 11 DGTYTVSVTVTDAAGN----TSTATQSITVDTT---APTVTIT----SIADDDIINGDEATNTVTFSGTVSGAEPG--ST 77 (317)
T ss_pred CCcEEEEEEEEeCCCC----eeeeeEEEEEecC---CCceeee----cccCCCccccceeeeeeEEEEEecCCCCC--CE
Confidence 4578999999998632 3334455543322 6643221 1111111 1222345555555543 34
Q ss_pred EEEEEecCCCCCcEEecccceEEEceec--cccccccEEEEEEE-EEECCcCCceeeEEEEEEEEEeCCCCCCcccccce
Q psy1041 521 ISYSIANLKPVPFEIDHFSGVIKTTQVL--DYESMRREYILRVR-ASDWGLPYRRQTEMQLKIKLLDVNDNRPQFEKVDC 597 (1095)
Q Consensus 521 i~ysi~~~~~~~F~Id~~tG~i~~~~~l--D~E~~~~~~~l~V~-a~D~g~p~~~s~~~~v~I~V~dvNDn~P~f~~~~~ 597 (1095)
+.+.+ ++....+..+ ..|.....-.. +.+ ...|.+.+. ++|..+-... . ....+.|-..--.+|.+.-...
T Consensus 78 v~v~~-~g~~~t~~~~-~~G~ws~t~~~~~~l~--~g~~ti~v~~~tD~aGN~~t-~-~s~~~~vDt~~~~~p~vti~~~ 151 (317)
T PF13753_consen 78 VTVTI-NGTTGTLTAD-ADGNWSVTVTPSDDLP--DGDYTITVTTVTDAAGNTST-A-ASQTFTVDTTAPTAPTVTITGI 151 (317)
T ss_pred EEEEE-CCEEEEEEEe-cCCcEEEeeccccccc--cCcceeEEEEEEccCCcccc-c-cccccccccccccccccceecc
Confidence 66665 2233344444 35543322111 112 357899999 9998542211 1 2344433222112455443210
Q ss_pred EE--EecCCCCCCcEEEEEEEEeCCCCCeEEEEEEeCCCCCcEEEeCCCcEEEEe---eccCcccccEEEEEEEEecCCC
Q psy1041 598 LG--HVPRNLPIGREIITLSAIDFDAGNIISYRIVSGNEDGCFALDITSGVLSIA---CDLTDVRVNEREINVTATDSAH 672 (1095)
Q Consensus 598 ~~--~V~E~~~~g~~v~~v~A~D~D~~~~i~y~i~~~~~~~~F~Id~~tG~i~~~---~~ld~~~~~~~~l~V~atD~~~ 672 (1095)
.. .+.......+..+.-...+.+.+..+...+ .|... .+.+.. .|..++. ..+.......|.+.+.++|..+
T Consensus 152 ~~~~~~~~~~~~~t~t~sg~v~~~~~~d~v~vt~-~G~~~-~~~~~~-~g~~t~~~~~~~~~~~~d~~~~v~v~~tD~AG 228 (317)
T PF13753_consen 152 SDDNIINGAESTVTVTFSGTVTGFDAGDTVTVTI-NGTTY-TTTVGA-DGTWTVTVTPSDLAGLADGTYTVTVTVTDAAG 228 (317)
T ss_pred cCCceeeccceeecccccccceeeeeceeEEEee-ccccc-ceeecC-CCcccccccccccccccCceEEEEEEeeeccc
Confidence 00 000000011111222223445555455554 22222 344443 2322221 1233345568999999999765
Q ss_pred cc
Q psy1041 673 FS 674 (1095)
Q Consensus 673 ~s 674 (1095)
-.
T Consensus 229 N~ 230 (317)
T PF13753_consen 229 NT 230 (317)
T ss_pred Cc
Confidence 43
No 31
>KOG3597|consensus
Probab=70.58 E-value=95 Score=36.39 Aligned_cols=157 Identities=13% Similarity=0.115 Sum_probs=88.0
Q ss_pred EEEEEEeccCCCCCCCccCCcEEEEEeCCCCCCceEEEEEEeeCCCCCCceEEEEEEeCCCC----CcEEEECCcc-EEE
Q psy1041 363 KATQVHLVDLNDNKPVFDREIYEVDVPETTPVNTPIIRLKVSDADDGKNAQVFLEIVGGNEG----GEFNINPETG-MLY 437 (1095)
Q Consensus 363 ~~~~i~v~d~Nd~~P~F~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~n~~i~ysi~~~~~~----~~F~Id~~tG-~i~ 437 (1095)
....|.+..+||.+..+....+..-+.|+...-.....+.+.|+|.+. ..+.|++.+.... +.|..-..++ ..+
T Consensus 26 ~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~t~~~~~~~~~~~~~g~~~~~Fs 104 (442)
T KOG3597|consen 26 DVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLGTSSVPLPVLKFDVPGAPATEFS 104 (442)
T ss_pred eeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEccCCCCCCccceeeccCCcccceE
Confidence 567788999999766666666668888886544445678888988754 4578888753211 1233322211 111
Q ss_pred Ec------eecCccc--ccEEEEEEEEEeCCCCCCceeeeEEEEEEEeecCCCCCcccCCccEEEEeecCCCCcEEEEEE
Q psy1041 438 TA------VTLDAED--KAFYTLTVSAIDQGNAGTRKQSAAKVKVNIVDTNDNDPLFDSPEMEVSINENEPAGTSVIKVT 509 (1095)
Q Consensus 438 ~~------~~LD~E~--~~~y~l~V~a~D~g~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~v~~v~ 509 (1095)
.. -.+++.. ...+.++..++|+- ..+. ..|...-..--.|.+.... ...|.-++..--.-..+.
T Consensus 105 ~~~v~~g~~~yvh~g~el~~~~~~~~~SDg~-----~~S~--~~i~~~~~~~~~~~~~~~~-gL~v~~gS~~~IT~~~L~ 176 (442)
T KOG3597|consen 105 YEEVEDGSLSYVHSGTELRESELQLRVSDGL-----LVSE--RAILKVEATGPAPHLARNT-GLKVLQGSTAPITPSNLS 176 (442)
T ss_pred ehHhhcCceeEEecCcccccceEEEEeecce-----Eeee--eEEecccCCCcceeeeccc-ceEEccCccccccHhHce
Confidence 11 1133333 56788999999974 2222 1222211111233333331 333333322211123688
Q ss_pred EeeCCCCCCceEEEEEecC
Q psy1041 510 AKDKDSGENAYISYSIANL 528 (1095)
Q Consensus 510 A~D~D~g~n~~i~ysi~~~ 528 (1095)
+.|.|++..-.+.|.|...
T Consensus 177 ved~d~~~d~~v~~~i~~~ 195 (442)
T KOG3597|consen 177 VEDNDSSPDDEVRYDITPP 195 (442)
T ss_pred eecCCCCCCcEEEEEecCC
Confidence 9999977777899999874
No 32
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=70.05 E-value=5.3e+02 Score=37.54 Aligned_cols=123 Identities=19% Similarity=0.222 Sum_probs=68.2
Q ss_pred ccccceEEEEEEEEEeccCCCCCCceeEEEEEEEEEecCCCCCCCCCCCceEEEeeCCCCCCcEEEEEEee-eCCCCCCe
Q psy1041 113 RERKDKYILHIKATITHRDGKKASYEETTCKVHVNVLDTNDLNPLFYPTEYEETVPEDLPLHTSILRVSAE-DADLGRNG 191 (1095)
Q Consensus 113 ~E~~~~y~l~V~A~d~~~~~~~~~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~g~~v~~v~A~-D~D~g~n~ 191 (1095)
|-....|+|+|+|+.. -...+....|+|.-.|-. +..-...++-=.++|..+ .++|. =.|.+.+.
T Consensus 947 y~~a~~~~l~~ta~n~--------~s~~~~~~~vt~~~~~~m-----~~l~v~~~p~v~~~~~~~-~~~~~~~vd~~~~~ 1012 (2740)
T TIGR00864 947 YQHAAVFKLSLTAMNH--------VSNLTEDFNVTVDRLNPM-----QGLQVKGVPAVLPPGATL-ALTAGVLIDMAVEA 1012 (2740)
T ss_pred eecccEEEEEEEEecc--------ccceEEEEEEEehhcccc-----cccEEecCccccCCCceE-EEeeeeEeecccce
Confidence 4557789999999872 123346666777555431 111122455555667665 33333 34556655
Q ss_pred EEEEEEecCC-cc----------EEE-EccccEEEEccCC--CcCCCcEEEEEEEEeecCccccCCCCcceEEEEEEEE
Q psy1041 192 EIYYSFRDMN-EQ----------FSI-HPTSGVVTLTRPL--KYTDRSVHDLVVLGQDRGSVFKGGGKPSSAKLKIKVE 256 (1095)
Q Consensus 192 ~v~Y~l~~~~-~~----------F~i-d~~tG~i~~~~~l--d~e~~~~~~l~V~A~D~g~~~~~~~~s~~~~v~I~V~ 256 (1095)
...+++.+.. .. |-+ ||..+.+-+.+.. -|-..+.|++++.|++.-. +.+..+.|+|.
T Consensus 1013 ~~~w~fgDG~~~~~~~~~py~~~~~~~~~~~~q~l~eqnptH~Y~~~G~YTVtLtVsN~~~-------~it~~i~VsV~ 1084 (2740)
T TIGR00864 1013 AFLWNFGDGEQALFEFKPPYNESFPCPDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFE-------NISQQINMSVR 1084 (2740)
T ss_pred EEEEEeCCCCeeEEeccCCCcccccCCCCccceeecccCcceEECCCCcEEEEEEEEcCCC-------ccceEEEEEEe
Confidence 5556554321 12 222 3344555444332 3556889999999999751 34556666663
No 33
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=66.57 E-value=41 Score=27.66 Aligned_cols=63 Identities=27% Similarity=0.345 Sum_probs=35.8
Q ss_pred CCCCceEEEEEecCCCCCcEEecccceEEEceeccccccccEEEEEEEEEECCcCCceeeEEEEEEEEE
Q psy1041 515 SGENAYISYSIANLKPVPFEIDHFSGVIKTTQVLDYESMRREYILRVRASDWGLPYRRQTEMQLKIKLL 583 (1095)
Q Consensus 515 ~g~n~~i~ysi~~~~~~~F~Id~~tG~i~~~~~lD~E~~~~~~~l~V~a~D~g~p~~~s~~~~v~I~V~ 583 (1095)
.+.+-...|.|.+.+..+..+...+-.+..+ .|. ++.|+|.|+|.|..+..... ...+.|+|+
T Consensus 4 ~~~~~~Y~Y~l~g~d~~W~~~~~~~~~~~~~-~L~----~G~Y~l~V~a~~~~~~~~~~-~~~l~i~I~ 66 (66)
T PF07495_consen 4 NPENIRYRYRLEGFDDEWITLGSYSNSISYT-NLP----PGKYTLEVRAKDNNGKWSSD-EKSLTITIL 66 (66)
T ss_dssp CCTTEEEEEEEETTESSEEEESSTS-EEEEE-S------SEEEEEEEEEEETTS-B-SS--EEEEEEEE
T ss_pred CCCceEEEEEEECCCCeEEECCCCcEEEEEE-eCC----CEEEEEEEEEECCCCCcCcc-cEEEEEEEC
Confidence 3445677888887665555554432233332 232 67899999999975532222 256666663
No 34
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=66.08 E-value=2.3e+02 Score=31.88 Aligned_cols=130 Identities=17% Similarity=0.236 Sum_probs=61.2
Q ss_pred cceEEEEEEEEEeccCCCCCCceeEEEEEEEEEecCCCCCCCCCCCceEEEeeCCC------CCCcEEEEEEeeeCCCCC
Q psy1041 116 KDKYILHIKATITHRDGKKASYEETTCKVHVNVLDTNDLNPLFYPTEYEETVPEDL------PLHTSILRVSAEDADLGR 189 (1095)
Q Consensus 116 ~~~y~l~V~A~d~~~~~~~~~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~------~~g~~v~~v~A~D~D~g~ 189 (1095)
-..|.+.|+++|. +| -.+.....|.|--. +|...-. .+.++. ......+...+++.+.|.
T Consensus 11 d~~~~v~vt~tD~--aG-----N~~~~t~~~~vDt~---~P~v~i~----~~~~~~~~~~~~~~~~~t~s~tvs~~~~g~ 76 (317)
T PF13753_consen 11 DGTYTVSVTVTDA--AG-----NTSTATQSITVDTT---APTVTIT----SIADDDIINGDEATNTVTFSGTVSGAEPGS 76 (317)
T ss_pred CCcEEEEEEEEeC--CC-----CeeeeeEEEEEecC---CCceeee----cccCCCccccceeeeeeEEEEEecCCCCCC
Confidence 3468999999994 22 12224445553222 5643222 112211 112223455555555443
Q ss_pred CeEEEEEEecCCccEEEEccccEEEEcc-CCCcCCCcEEEEEEE-EeecCccccCCCCcceE-EEEEEEEeccCCcCeeE
Q psy1041 190 NGEIYYSFRDMNEQFSIHPTSGVVTLTR-PLKYTDRSVHDLVVL-GQDRGSVFKGGGKPSSA-KLKIKVEQINLYGPEIY 266 (1095)
Q Consensus 190 n~~v~Y~l~~~~~~F~id~~tG~i~~~~-~ld~e~~~~~~l~V~-A~D~g~~~~~~~~s~~~-~v~I~V~dvNd~~P~f~ 266 (1095)
.+.+.+......+..+ ..|.-...- +-+.-..+.|.+.+. ++|.. |..+.+ ...+.|...--.+|.+.
T Consensus 77 --~v~v~~~g~~~t~~~~-~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~a------GN~~t~~s~~~~vDt~~~~~p~vt 147 (317)
T PF13753_consen 77 --TVTVTINGTTGTLTAD-ADGNWSVTVTPSDDLPDGDYTITVTTVTDAA------GNTSTAASQTFTVDTTAPTAPTVT 147 (317)
T ss_pred --EEEEEECCEEEEEEEe-cCCcEEEeeccccccccCcceeEEEEEEccC------Cccccccccccccccccccccccc
Confidence 5555553323334444 345322111 111223568999999 99987 333333 44454433212357665
Q ss_pred Ee
Q psy1041 267 VQ 268 (1095)
Q Consensus 267 ~~ 268 (1095)
..
T Consensus 148 i~ 149 (317)
T PF13753_consen 148 IT 149 (317)
T ss_pred ee
Confidence 44
No 35
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=61.12 E-value=53 Score=31.95 Aligned_cols=62 Identities=23% Similarity=0.273 Sum_probs=40.8
Q ss_pred CceEEEEECccccccCCC--CceEEEEEEEECCCCCeeeEEEEEEEeccCCCCCCCccCCcEEEEEeCCC
Q psy1041 325 KKEYNIVVLKLLDREIAP--LGYNLTLRAVDKGTPPRETYKATQVHLVDLNDNKPVFDREIYEVDVPETT 392 (1095)
Q Consensus 325 ~g~~~i~~~~~lD~E~~~--~~y~l~v~a~D~g~p~~~s~~~~~i~v~d~Nd~~P~F~~~~~~~~v~E~~ 392 (1095)
.|.|.+.+.++||..... ..+.|.|.|+|..+-... ..+.|+|.| | .|+..... ..+|.|+.
T Consensus 65 ~GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~--~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 65 DGSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSS--ITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred CccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCccc--cEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 577888889999985421 137899999997554422 466777766 6 57554433 36788853
No 36
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=59.54 E-value=83 Score=30.62 Aligned_cols=56 Identities=27% Similarity=0.378 Sum_probs=36.7
Q ss_pred EEEcccCCC---CCCCEEEEEEEEEECCCCCceeEEEEEEEEEecCCCCCCCeeecceEEEEEeCCC
Q psy1041 1021 IRVAKPLDF---EKRQVHSLVVRAKDNGSPPLYSEATLIVEVSDVNENMNAPVFSDFVYQATVKENQ 1084 (1095)
Q Consensus 1021 i~~~~~ld~---E~~~~~~l~V~A~D~g~p~ls~~~~v~I~V~dvNdn~~~P~F~~~~y~~~v~E~~ 1084 (1095)
+++.++||. +..-...|.|.|+|..+.. +...+.|+|.| | .|+..... ...|.|+.
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~--s~~~l~VtI~D--D---~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDT--SSITLPVTIVD--D---VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCCCCc--cccEEEEEEEC--C---CCeecccc-ceEEeccc
Confidence 344455554 2234578899999987643 34588888877 6 68876544 47888854
No 37
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=54.49 E-value=1.1e+02 Score=25.12 Aligned_cols=59 Identities=24% Similarity=0.275 Sum_probs=31.9
Q ss_pred CceEEEEEEeCCCCCCCEEEeCCceEEEEcccCCCCCCCEEEEEEEEEECCCCCceeEEEEEEEE
Q psy1041 995 GGVVRYTIVSDNEADDVFSIDRLTGTIRVAKPLDFEKRQVHSLVVRAKDNGSPPLYSEATLIVEV 1059 (1095)
Q Consensus 995 n~~v~Y~i~~~~~~~~~F~Id~~tG~i~~~~~ld~E~~~~~~l~V~A~D~g~p~ls~~~~v~I~V 1059 (1095)
+-...|+|.+. ...+..+...+-.+.... | ..+.|.|.|+|.|..+..-....++.|.|
T Consensus 7 ~~~Y~Y~l~g~--d~~W~~~~~~~~~~~~~~-L---~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 7 NIRYRYRLEGF--DDEWITLGSYSNSISYTN-L---PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TEEEEEEEETT--ESSEEEESSTS-EEEEES------SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred ceEEEEEEECC--CCeEEECCCCcEEEEEEe-C---CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 44566777652 234444433221444332 2 57899999999998765333336666665
No 38
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=46.79 E-value=2.2e+02 Score=25.61 Aligned_cols=52 Identities=31% Similarity=0.364 Sum_probs=30.9
Q ss_pred EEEEeccCCCCCeeecCceEEEEeCCCCCCcEEEEEEEEcCCCCCCceEEEEEEeC
Q psy1041 950 RVTIDDINDNAPNFALPNYSVKVREDIPVGTVVAILSASDPDLGQGGVVRYTIVSD 1005 (1095)
Q Consensus 950 ~I~V~DvNDn~P~f~~~~y~~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~Y~i~~~ 1005 (1095)
+|+|.| ||.+ .+....-...+.|+. |..-..|.-...+....-.+.|...++
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCC
Confidence 577888 6744 777666566788884 445555655554433344567776653
No 39
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=41.84 E-value=94 Score=28.07 Aligned_cols=51 Identities=18% Similarity=0.244 Sum_probs=30.8
Q ss_pred EEEEEeCCCCCCcccccceEEEecCCCCCCcEEEEEEEEeCCCCC--eEEEEEEeCC
Q psy1041 579 KIKLLDVNDNRPQFEKVDCLGHVPRNLPIGREIITLSAIDFDAGN--IISYRIVSGN 633 (1095)
Q Consensus 579 ~I~V~dvNDn~P~f~~~~~~~~V~E~~~~g~~v~~v~A~D~D~~~--~i~y~i~~~~ 633 (1095)
+|+|.| ||.+ .+.-..-...+.|+. |..-+.|.-...+... .+.|....|.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gt 54 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGT 54 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCc
Confidence 577788 6655 776666667888876 4445555555444323 6778777654
No 40
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=41.66 E-value=7.8e+02 Score=30.55 Aligned_cols=107 Identities=19% Similarity=0.259 Sum_probs=60.8
Q ss_pred ccEEEEEEEEEeCCCCCCceeeeEEEEEEEeecCCCCCcccCCccEEEEeecCCCCcE-EEEEEEeeCCC------CCCc
Q psy1041 447 KAFYTLTVSAIDQGNAGTRKQSAAKVKVNIVDTNDNDPLFDSPEMEVSINENEPAGTS-VIKVTAKDKDS------GENA 519 (1095)
Q Consensus 447 ~~~y~l~V~a~D~g~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~~V~E~~~~gt~-v~~v~A~D~D~------g~n~ 519 (1095)
....+++++++|..+ +.+....++|+|++-. +|.+. +++.-...-+.. .....|.=+.. -...
T Consensus 297 ~G~~Ti~atVtDSRG---r~S~~~~~tItVl~Y~--~P~ls-----fsv~R~~~~~~~~~v~~~a~Iapl~v~g~qKN~~ 366 (624)
T PF05895_consen 297 SGSATIRATVTDSRG---RTSDPKTKTITVLEYS--PPTLS-----FSVYRCGSSGNTLTVTRNAKIAPLTVNGVQKNTM 366 (624)
T ss_pred CceEEEEEEEEECCC---ccCCceEEEEEEEEcC--CCcEE-----EEEEEeCCCCcEEEEEEEEEEeEEEEcccccceE
Confidence 467899999999862 2345788999999754 56653 333333222222 22223322211 1123
Q ss_pred eEEEEEecCCCCCcEEecc--cce-----------EEEceeccccccccEEEEEEEEEEC
Q psy1041 520 YISYSIANLKPVPFEIDHF--SGV-----------IKTTQVLDYESMRREYILRVRASDW 566 (1095)
Q Consensus 520 ~i~ysi~~~~~~~F~Id~~--tG~-----------i~~~~~lD~E~~~~~~~l~V~a~D~ 566 (1095)
.++|+....+...|.+|.. .|. ..+...+|-+ ..|.+.+.++|.
T Consensus 367 ~lt~~~a~~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~~~---kSy~V~~~l~D~ 423 (624)
T PF05895_consen 367 TLTFKVAPLGTGTFTTDNGSASGTWSSISELTNSSANLGGTYDAE---KSYDVRGTLSDK 423 (624)
T ss_pred EEEEEEEEcCcceEEEEccccccceeeeeeecccceeeccccCCC---ceEEEEEEEEEE
Confidence 5777777656666776642 111 2223344544 479999999996
No 41
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=39.28 E-value=33 Score=40.72 Aligned_cols=32 Identities=28% Similarity=0.443 Sum_probs=0.0
Q ss_pred EEEEEEEEEeCCCCCCceeeeEEEEEEEeecCCCCCcc
Q psy1041 449 FYTLTVSAIDQGNAGTRKQSAAKVKVNIVDTNDNDPLF 486 (1095)
Q Consensus 449 ~y~l~V~a~D~g~~~~~~~~~~~v~I~V~DvNDn~P~f 486 (1095)
.|.|++++++++ +.++.+..+|.|.. ..+|..
T Consensus 162 ~y~f~ltv~k~~----r~s~s~~~~v~v~~--~~~p~v 193 (440)
T PF02010_consen 162 TYTFTLTVSKGS----RSSSSASQTVTVVS--GDPPTV 193 (440)
T ss_dssp --------------------------------------
T ss_pred eEEEEEEEEeCC----CCceeeEEEEEecc--CCCCce
Confidence 399999999986 33666777777764 335554
No 42
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=33.11 E-value=1.6e+02 Score=25.11 Aligned_cols=31 Identities=16% Similarity=0.192 Sum_probs=22.6
Q ss_pred CCcCCCcEEEEEEEEeecCccccCCCCcceEEEEEEE
Q psy1041 219 LKYTDRSVHDLVVLGQDRGSVFKGGGKPSSAKLKIKV 255 (1095)
Q Consensus 219 ld~e~~~~~~l~V~A~D~g~~~~~~~~s~~~~v~I~V 255 (1095)
..|...+.|.++++|+|.. +.+.+.++.|.|
T Consensus 51 ~~y~~~G~y~v~l~v~d~~------g~~~~~~~~V~V 81 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAV------GSSSTKTTTVVV 81 (81)
T ss_pred EEcCCCcEEEEEEEEEeCC------CCEEEEEEEEEC
Confidence 4567889999999999975 345555666543
No 43
>KOG4221|consensus
Probab=30.53 E-value=1.5e+03 Score=30.41 Aligned_cols=164 Identities=17% Similarity=0.211 Sum_probs=89.3
Q ss_pred EEEEEEeCCCCCcEEEeCCceEEEEcccCCccCCCEEEEEEEEEECCCCCCceeEEEEEEEEecCCCCCcccccceEEEE
Q psy1041 786 LVFGISSGDNDSVFRIDPDSGELKVVGYLDRERTSEYTLNITVYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLASFRV 865 (1095)
Q Consensus 786 v~y~i~~g~~~~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V 865 (1095)
+.|++.+-......+-+-.++.=.+...|-+- ..|.|-|++.-.|.-..+....+.+... ...|.|......+.-
T Consensus 859 Vr~~~~gi~~~~~~~~~~~t~ls~~v~glkpn--t~yEfav~~~~~~~r~stwsmsv~~~tl---e~~P~sPP~d~tv~p 933 (1381)
T KOG4221|consen 859 VRWSLTGIRNGTLYRYDNSTDLSYLVGGLKPN--TPYEFAVMVVKRNRRESTWSMSVENRTL---ELVPSSPPRDLTVQP 933 (1381)
T ss_pred EEEeecccccceeEEEecccccceeccCcCcC--ChhhhhhhhhhccCcCCcccceeeeeec---ccCCCCCChhceecc
Confidence 44543333333445555455544455555433 4566666665433211233345555555 457888777666666
Q ss_pred ecCCCCCeEEEEEEEEeCCCCCCceEE----EEecCCC---cceEEeCCcceEEEcccccccccCeeEEEEEEEeCCCCC
Q psy1041 866 TENALNGTVIFKVNATDLDLGDNAKVV----YSLMTDT---QDFAVDSATGSLYVSASLDRERQDLYELKIRASDCDGRN 938 (1095)
Q Consensus 866 ~En~~~gt~v~~v~A~D~D~g~n~~v~----Ysl~~~~---~~F~Id~~tG~i~~~~~LD~E~~~~y~l~V~A~D~~g~~ 938 (1095)
.| .+.+.+.+ .-+-.-.||.|+ |....++ ..|.+....|........+.+-...|-|.|+|....|..
T Consensus 934 ~e--~P~~v~v~---WqPp~e~nG~I~~Yii~Ys~~~n~~~~dWt~~t~~g~~L~~~v~~l~p~t~yffkiQAr~~kG~g 1008 (1381)
T KOG4221|consen 934 DE--KPTTVIVH---WQPPTEPNGEITEYIIYYSTDGNTPEHDWTIETTAGAELSHQVPNLDPDTGYFFKIQARNEKGPG 1008 (1381)
T ss_pred cC--CCCccccc---cCCCcCCCCceeeEEEEEecCCCCchhhceeeecccchhhhccCCCCCCCceEEEEEeeccCCCC
Confidence 66 33333322 222233466543 3333333 678888877877777777777788999999999887643
Q ss_pred CCcceeEEEEEEEEEeccCCC
Q psy1041 939 DMYTLHADALVRVTIDDINDN 959 (1095)
Q Consensus 939 ~~~~~~~~~~v~I~V~DvNDn 959 (1095)
+....-...+....+.-.||.
T Consensus 1009 p~s~~v~y~t~~~~~~~~~d~ 1029 (1381)
T KOG4221|consen 1009 PFSSPVLYETSKAEIVMINDQ 1029 (1381)
T ss_pred ccccceeeeccccccccccch
Confidence 222222233344444455654
No 44
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=29.81 E-value=62 Score=38.37 Aligned_cols=219 Identities=18% Similarity=0.220 Sum_probs=18.4
Q ss_pred CCEEEEEEEEEECCCCCCceeEEEEEEEEecCCCCCcccccceEEEEecCCCCCeEEEE-EEEEeCCCCC-CceEEEE--
Q psy1041 819 TSEYTLNITVYDLGKPQKSTSKMLPITILDVNDNPPKFEKSLASFRVTENALNGTVIFK-VNATDLDLGD-NAKVVYS-- 894 (1095)
Q Consensus 819 ~~~y~l~V~a~D~g~p~~s~~~~v~I~V~DvNDn~P~F~~~~y~~~V~En~~~gt~v~~-v~A~D~D~g~-n~~v~Ys-- 894 (1095)
.+.|.+.++++-...+....+..+.|.|.-. +-.|....... ..+.-+ ..+.+. -.-.|+|... +..++|+
T Consensus 49 ~G~y~~~~~Vt~~~~~~~~~~~~~~v~V~~s-~l~~~I~gG~~-~~~~~~---~~i~ldgs~S~Dpd~~~~~~~l~y~W~ 123 (440)
T PF02010_consen 49 PGDYTFTLTVTASSNPGLSSTDSVTVTVEPS-PLVAVIKGGSS-RTVGYN---SDITLDGSQSYDPDGPPGDSGLTYSWS 123 (440)
T ss_dssp SCEEEEEEEEE--BCTTEEEEEEEEEEEE---------------------------------------------------
T ss_pred CCCEEEEEEEEEECCCCceEEEEEEEEEeec-cceeEEcCCcc-ceeecC---ceEEEeeEEEecccccccCCceEEEEE
Confidence 3467766666623344567778888888752 12333322111 111111 112211 1235777542 1234443
Q ss_pred ecCCCcc----------eEEeCCcceEEEc-ccccccccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccCCCCCee
Q psy1041 895 LMTDTQD----------FAVDSATGSLYVS-ASLDRERQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDINDNAPNF 963 (1095)
Q Consensus 895 l~~~~~~----------F~Id~~tG~i~~~-~~LD~E~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvNDn~P~f 963 (1095)
....... .......+.+.+. ..| +....|.|+++++++++ .++.+...|.|.. ..+|..
T Consensus 124 C~~~~~~~~C~~~~~~~~~~~~~~~~l~i~~~~l--~~~~~y~f~ltv~k~~r------~s~s~~~~v~v~~--~~~p~v 193 (440)
T PF02010_consen 124 CTDLSSNSACSTPSTNITLLNSSSSSLTIPASTL--SPGSTYTFTLTVSKGSR------SSSSASQTVTVVS--GDPPTV 193 (440)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred EeCCceeccccccccccccCCCCCEEEEEEhHHc--CCCceEEEEEEEEeCCC------CceeeEEEEEecc--CCCCce
Confidence 2222111 1122333444432 233 23334999999998872 3566666676654 335654
Q ss_pred ecCceE---EEEeCCCCCCcEEEEEEEEcCCCCCCceEEEEEEe-----CCCCCCCEE---EeCCceE----EEEcccCC
Q psy1041 964 ALPNYS---VKVREDIPVGTVVAILSASDPDLGQGGVVRYTIVS-----DNEADDVFS---IDRLTGT----IRVAKPLD 1028 (1095)
Q Consensus 964 ~~~~y~---~~v~E~~~~g~~v~~v~A~D~D~g~n~~v~Y~i~~-----~~~~~~~F~---Id~~tG~----i~~~~~ld 1028 (1095)
....-. ..|..+. .+..+..+.+.+.. ...+.|++.- .......|. -...+|. |.+. +--
T Consensus 194 ~I~~~~n~~~~vn~~~---~l~L~~~~~~~~~~-~~~~~y~Wsl~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lvi~-~~~ 268 (440)
T PF02010_consen 194 SISCVSNCKGKVNPSE---RLVLQASCSSCDSD-SSNVSYSWSLFSLDGVSDSNSELPDWSSMTTTGSSSSNLVID-PGV 268 (440)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred eEccccccccccCCCC---CEEEEEEEeCCCCc-CCCEEEEEEEeecccccccccccccccccccccccccccccc-ccc
Confidence 332111 0122221 23344455543321 3345555543 001111111 1111221 2222 112
Q ss_pred CCCCCEEEEEEEEEECCCCCceeEEEEEEEE
Q psy1041 1029 FEKRQVHSLVVRAKDNGSPPLYSEATLIVEV 1059 (1095)
Q Consensus 1029 ~E~~~~~~l~V~A~D~g~p~ls~~~~v~I~V 1059 (1095)
.+....|.|++.|+|.+... ..+.+.+.+
T Consensus 269 l~~g~~Y~~~l~v~~~~~~~--~~a~~~~~~ 297 (440)
T PF02010_consen 269 LEPGSTYTFRLTVTDSSGSS--GSASISFTV 297 (440)
T ss_dssp -------------------------------
T ss_pred cccccccccccccccccccc--ccccccccc
Confidence 35678999999999986531 144455544
No 45
>KOG4221|consensus
Probab=28.87 E-value=1.5e+03 Score=30.19 Aligned_cols=70 Identities=20% Similarity=0.313 Sum_probs=46.0
Q ss_pred CCeEEEEEee--cCCCCCCeEEeeeccCCCCcCceEEEEECccccccCCCCceEEEEEEEECCCCCeeeEEEEEEEeccC
Q psy1041 295 HGEIASLDIV--DGDPDGHFRIVPTKIDPGTKKKEYNIVVLKLLDREIAPLGYNLTLRAVDKGTPPRETYKATQVHLVDL 372 (1095)
Q Consensus 295 n~~v~~~~i~--~g~~~~~F~i~~~~~~~~~~~g~~~i~~~~~lD~E~~~~~y~l~v~a~D~g~p~~~s~~~~~i~v~d~ 372 (1095)
|+.|+.|.+. .++....+.++.+ ..++.|. -|+ ... .|.+.|.|....++..++.....++..|+
T Consensus 548 n~~I~~yk~~ys~~~~~~~~~~~~n-------~~e~ti~---gL~--k~T-eY~~~vvA~N~~G~g~sS~~i~V~Tlsd~ 614 (1381)
T KOG4221|consen 548 NGPITGYKLFYSEDDTGKELRVENN-------ATEYTIN---GLE--KYT-EYSIRVVAYNSAGSGVSSADITVRTLSDV 614 (1381)
T ss_pred CCCceEEEEEEEcCCCCceEEEecC-------ccEEEee---cCC--Ccc-ceEEEEEEecCCCCCCCCCceEEEeccCC
Confidence 5556544332 2345567888766 3455554 233 222 49999999999888777776667788898
Q ss_pred CCCCC
Q psy1041 373 NDNKP 377 (1095)
Q Consensus 373 Nd~~P 377 (1095)
-+.||
T Consensus 615 PsaPP 619 (1381)
T KOG4221|consen 615 PSAPP 619 (1381)
T ss_pred CCCCC
Confidence 88666
No 46
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=27.02 E-value=1.8e+02 Score=23.76 Aligned_cols=31 Identities=23% Similarity=0.366 Sum_probs=21.5
Q ss_pred ccCeeEEEEEEEeCCCCCCCcceeEEEEEEEEEeccC
Q psy1041 921 RQDLYELKIRASDCDGRNDMYTLHADALVRVTIDDIN 957 (1095)
Q Consensus 921 ~~~~y~l~V~A~D~~g~~~~~~~~~~~~v~I~V~DvN 957 (1095)
....|.|.++|+|..|+ .......+.+.|..
T Consensus 21 ~dg~yt~~v~a~D~AGN------~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 21 ADGEYTLTVTATDKAGN------TSSSTTQIVIVDNT 51 (60)
T ss_pred CCccEEEEEEEEECCCC------EEEeeeEEEEEcCC
Confidence 36789999999999973 34455555555544
No 47
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=21.03 E-value=5.2e+02 Score=21.90 Aligned_cols=63 Identities=21% Similarity=0.284 Sum_probs=33.8
Q ss_pred EEEEEEeCCCCCeEEEEEEeCCCCCcEEEeCCCcEEEEeeccCcccccEEEEEEEEecCCCccceEEEEE
Q psy1041 612 ITLSAIDFDAGNIISYRIVSGNEDGCFALDITSGVLSIACDLTDVRVNEREINVTATDSAHFSDVVRIRI 681 (1095)
Q Consensus 612 ~~v~A~D~D~~~~i~y~i~~~~~~~~F~Id~~tG~i~~~~~ld~~~~~~~~l~V~atD~~~~s~~~~v~I 681 (1095)
..+.+.+.+.+..+.|...-++.. .....+ ......+.....|.++++++|..+.+......|
T Consensus 17 v~~~~~~~~~~~~~~~~W~fgdg~----~~~~~~---~~~~~~y~~~G~y~v~l~v~d~~g~~~~~~~~V 79 (81)
T cd00146 17 VTFSASDSSGGSIVSYKWDFGDGE----VSSSGE---PTVTHTYTKPGTYTVTLTVTNAVGSSSTKTTTV 79 (81)
T ss_pred EEEEEEeCCCCCEEEEEEEeCCCC----ccccCC---CceEEEcCCCcEEEEEEEEEeCCCCEEEEEEEE
Confidence 455566655445566655433321 100110 112234567889999999999865554444444
No 48
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=20.02 E-value=5.6e+02 Score=21.95 Aligned_cols=25 Identities=16% Similarity=0.198 Sum_probs=20.7
Q ss_pred CEEEEEEEEEECCCCCceeEEEEEEEE
Q psy1041 1033 QVHSLVVRAKDNGSPPLYSEATLIVEV 1059 (1095)
Q Consensus 1033 ~~~~l~V~A~D~g~p~ls~~~~v~I~V 1059 (1095)
+.|.++..|+|..+ .++++.+.|+|
T Consensus 57 G~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 57 GTTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred ceEEEEEEEEECCC--CEEEEEEEEEC
Confidence 46899999999865 47899998876
Done!