Query psy585
Match_columns 111
No_of_seqs 237 out of 1299
Neff 8.6
Searched_HMMs 46136
Date Fri Aug 16 19:39:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy585.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/585hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.9 1E-21 2.2E-26 159.2 12.5 110 1-110 749-912 (2531)
2 KOG4289|consensus 99.8 6.5E-21 1.4E-25 154.6 9.0 110 1-110 235-397 (2531)
3 KOG1219|consensus 99.8 5.7E-20 1.2E-24 153.2 11.2 111 1-111 2145-2305(4289)
4 KOG1219|consensus 99.8 5.3E-19 1.1E-23 147.6 11.1 110 1-110 2648-2804(4289)
5 cd00031 CA Cadherin repeat dom 99.3 4.2E-11 9.1E-16 80.7 10.8 47 1-47 68-117 (199)
6 cd00031 CA Cadherin repeat dom 99.3 1E-10 2.2E-15 78.9 12.1 105 4-110 18-125 (199)
7 KOG1834|consensus 98.8 4.2E-08 9.1E-13 76.7 9.5 78 1-78 109-243 (952)
8 smart00112 CA Cadherin repeats 98.6 5.4E-07 1.2E-11 52.6 7.9 66 18-85 11-79 (79)
9 KOG1834|consensus 98.5 3.9E-07 8.5E-12 71.4 7.7 53 51-103 104-165 (952)
10 smart00112 CA Cadherin repeats 98.5 3.5E-07 7.6E-12 53.4 4.7 30 1-30 47-79 (79)
11 PF00028 Cadherin: Cadherin do 97.6 0.00023 4.9E-09 42.5 5.4 28 51-78 62-93 (93)
12 TIGR01965 VCBS_repeat VCBS rep 92.6 0.74 1.6E-05 28.3 5.9 42 53-99 56-97 (99)
13 TIGR03660 T1SS_rpt_143 T1SS-14 92.0 2.3 5E-05 27.6 8.2 66 35-104 66-133 (137)
14 smart00736 CADG Dystroglycan-t 89.2 1.3 2.8E-05 26.7 4.7 30 53-82 66-96 (97)
15 PF12245 Big_3_2: Bacterial Ig 84.3 3.9 8.5E-05 22.5 4.5 26 1-26 25-51 (60)
16 PF13750 Big_3_3: Bacterial Ig 82.7 11 0.00025 24.9 9.7 27 52-78 121-148 (158)
17 PF07495 Y_Y_Y: Y_Y_Y domain; 80.1 7 0.00015 21.2 4.6 26 53-78 38-66 (66)
18 TIGR03660 T1SS_rpt_143 T1SS-14 78.6 9.2 0.0002 24.9 5.4 40 2-45 88-128 (137)
19 PF08758 Cadherin_pro: Cadheri 75.0 11 0.00025 22.5 4.7 17 29-45 3-19 (90)
20 smart00089 PKD Repeats in poly 67.2 17 0.00036 20.4 4.2 26 52-77 53-78 (79)
21 TIGR01965 VCBS_repeat VCBS rep 65.7 18 0.00039 22.2 4.2 38 2-44 60-97 (99)
22 cd00146 PKD polycystic kidney 60.9 21 0.00045 20.1 3.8 26 52-77 55-81 (81)
23 PF13860 FlgD_ig: FlgD Ig-like 59.1 33 0.00071 19.7 5.7 51 15-66 24-81 (81)
24 PF10365 DUF2436: Domain of un 56.2 24 0.00052 23.2 3.7 23 26-48 66-88 (161)
25 PF13754 Big_3_4: Bacterial Ig 53.7 20 0.00044 19.1 2.7 10 1-10 26-35 (54)
26 PF03160 Calx-beta: Calx-beta 52.8 24 0.00051 20.9 3.2 25 74-100 2-26 (100)
27 PF02494 HYR: HYR domain; Int 52.1 43 0.00093 19.0 5.4 32 46-77 49-81 (81)
28 TIGR00864 PCC polycystin catio 47.1 2.9E+02 0.0062 27.1 10.1 28 52-79 1566-1593(2740)
29 TIGR00845 caca sodium/calcium 46.6 1.9E+02 0.0042 24.9 10.5 28 72-100 516-543 (928)
30 PF07145 PAM2: Ataxin-2 C-term 42.9 11 0.00024 15.8 0.4 12 80-91 6-17 (18)
31 TIGR00845 caca sodium/calcium 42.2 1.6E+02 0.0035 25.4 7.3 29 16-45 515-543 (928)
32 cd05762 Ig8_MLCK Eighth immuno 39.2 82 0.0018 18.6 4.5 29 54-85 69-97 (98)
33 cd01757 PLAT_RAB6IP1 PLAT/LH2 37.2 1.1E+02 0.0023 19.2 6.8 42 55-96 2-47 (114)
34 PF08329 ChitinaseA_N: Chitina 35.5 84 0.0018 20.4 3.8 30 53-82 82-112 (133)
35 PF15418 DUF4625: Domain of un 31.7 1.4E+02 0.0031 19.1 4.6 25 53-77 106-131 (132)
36 PF09100 Qn_am_d_aIV: Quinohem 31.2 1.2E+02 0.0025 19.7 3.9 32 54-86 100-133 (133)
37 KOG4221|consensus 31.0 1.7E+02 0.0037 26.2 5.8 59 28-86 542-620 (1381)
38 PF05688 DUF824: Salmonella re 27.7 88 0.0019 16.5 2.5 16 69-84 13-28 (47)
39 PF05345 He_PIG: Putative Ig d 26.0 91 0.002 16.2 2.4 13 54-66 36-48 (49)
40 cd02848 Chitinase_N_term Chiti 25.9 1.1E+02 0.0024 19.0 3.1 27 53-79 79-106 (106)
41 COG1470 Predicted membrane pro 24.0 3.8E+02 0.0082 21.4 7.6 10 1-10 352-361 (513)
42 PF02369 Big_1: Bacterial Ig-l 20.7 89 0.0019 18.7 1.9 17 68-84 22-38 (100)
43 PF00801 PKD: PKD domain; Int 20.1 1.6E+02 0.0036 15.8 2.9 16 53-68 51-66 (69)
No 1
>KOG4289|consensus
Probab=99.87 E-value=1e-21 Score=159.18 Aligned_cols=110 Identities=32% Similarity=0.427 Sum_probs=100.3
Q ss_pred CEEEEEEEeCC---CccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC---------------------------
Q psy585 1 YELKLAASDNL---KENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP--------------------------- 50 (111)
Q Consensus 1 y~l~v~A~D~~---~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~--------------------------- 50 (111)
|++.+.|+|.| +.+++++.|.|.|+|||+|+|..+.|.++|.|+.++++.
T Consensus 749 ytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~ 828 (2531)
T KOG4289|consen 749 YTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDG 828 (2531)
T ss_pred eEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCC
Confidence 78999999986 678899999999999999999999999999999776542
Q ss_pred ---------------------ceEeEEEEEEEEecCCcc---ceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccE
Q psy585 51 ---------------------KRVLQYELTLVASDSLNE---NKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPH 106 (111)
Q Consensus 51 ---------------------e~~~~y~l~v~a~d~~~~---~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~ 106 (111)
|....|.|.+.|.|.|.+ +.+.|+|+|+|+|||||+|.+.+|...|.||.|.|+.+
T Consensus 829 p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~v 908 (2531)
T KOG4289|consen 829 PGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVV 908 (2531)
T ss_pred CCceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceee
Confidence 667899999999999873 77899999999999999999999999999999999999
Q ss_pred EEEE
Q psy585 107 SLLK 110 (111)
Q Consensus 107 ~~iq 110 (111)
+.|.
T Consensus 909 a~i~ 912 (2531)
T KOG4289|consen 909 ALIT 912 (2531)
T ss_pred EEEE
Confidence 8874
No 2
>KOG4289|consensus
Probab=99.84 E-value=6.5e-21 Score=154.62 Aligned_cols=110 Identities=29% Similarity=0.413 Sum_probs=96.8
Q ss_pred CEEEEEEEeCC---CccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC---------------------------
Q psy585 1 YELKLAASDNL---KENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP--------------------------- 50 (111)
Q Consensus 1 y~l~v~A~D~~---~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~--------------------------- 50 (111)
+.|.|+|.|.+ .++++++.|.|.|.|||.|.|.+++|..++.||.+.|++
T Consensus 235 HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~ 314 (2531)
T KOG4289|consen 235 HVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAK 314 (2531)
T ss_pred eEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCcc
Confidence 46899999985 799999999999999999999999999999998654432
Q ss_pred --------------------ceEeEEEEEEEEecCCc---cceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccEE
Q psy585 51 --------------------KRVLQYELTLVASDSLN---ENKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPHS 107 (111)
Q Consensus 51 --------------------e~~~~y~l~v~a~d~~~---~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~~ 107 (111)
|....|.|.|+|+|+|. +.++.|.|+|.|+|||+|+|....|.+.|.|+..+++.+.
T Consensus 315 ~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvl 394 (2531)
T KOG4289|consen 315 NVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVL 394 (2531)
T ss_pred ceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEE
Confidence 55678999999999986 5899999999999999999999999999999988877665
Q ss_pred EEE
Q psy585 108 LLK 110 (111)
Q Consensus 108 ~iq 110 (111)
-++
T Consensus 395 rV~ 397 (2531)
T KOG4289|consen 395 RVT 397 (2531)
T ss_pred EEE
Confidence 543
No 3
>KOG1219|consensus
Probab=99.83 E-value=5.7e-20 Score=153.16 Aligned_cols=111 Identities=29% Similarity=0.427 Sum_probs=100.8
Q ss_pred CEEEEEEEeC--CCccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC----------------------------
Q psy585 1 YELKLAASDN--LKENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP---------------------------- 50 (111)
Q Consensus 1 y~l~v~A~D~--~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~---------------------------- 50 (111)
|.|.|+|+|. ++.+.+.|.|.|.|+|||+|.|.+..|.++++|++..|++
T Consensus 2145 h~l~ieAtd~~~p~~Aea~VeIiV~dIndn~PvFeqlsYt~sisE~s~igt~viqilATdsDsn~~isYsl~g~s~~sk~ 2224 (4289)
T KOG1219|consen 2145 HELLIEATDAGIPLSAEAKVEIIVGDINDNPPVFEQLSYTISISENSKIGTKVIQILATDSDSNREISYSLEGNSEISKP 2224 (4289)
T ss_pred eEEEEEEeccCCCcceeeEEEEEecccCCCCchhheeeEEEEccCCCccCceEEEEEeccCCCCCceEEEeecCCccccc
Confidence 7899999996 4789999999999999999999999999999999765543
Q ss_pred ------------------ceEeEEEEEEEEecCCcc--ceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccEEEEE
Q psy585 51 ------------------KRVLQYELTLVASDSLNE--NKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPHSLLK 110 (111)
Q Consensus 51 ------------------e~~~~y~l~v~a~d~~~~--~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~~~iq 110 (111)
|...+|.+.+.|+|+|.+ +.+.|.|+|.|+|||+|.|.++.|.+.|+|.++-|..+.+++
T Consensus 2225 f~In~sTG~it~~g~ldyE~~q~f~~fvratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~ 2304 (4289)
T KOG1219|consen 2225 FRINVSTGWITVAGKLDYEENQEFRFFVRATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVN 2304 (4289)
T ss_pred eEEecccceEEEeeecChhhcceEEEEEEEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEE
Confidence 667899999999999986 558899999999999999999999999999999999888876
Q ss_pred C
Q psy585 111 L 111 (111)
Q Consensus 111 v 111 (111)
.
T Consensus 2305 a 2305 (4289)
T KOG1219|consen 2305 A 2305 (4289)
T ss_pred e
Confidence 3
No 4
>KOG1219|consensus
Probab=99.80 E-value=5.3e-19 Score=147.58 Aligned_cols=110 Identities=35% Similarity=0.533 Sum_probs=100.0
Q ss_pred CEEEEEEEeCCC-ccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCC-----------------------------
Q psy585 1 YELKLAASDNLK-ENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLP----------------------------- 50 (111)
Q Consensus 1 y~l~v~A~D~~~-~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~----------------------------- 50 (111)
|++.|.|++++. -+.+.|.|.|.|+|||+|.|..+.|.+.+.||.+.|+.
T Consensus 2648 yqi~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v~yF~ 2727 (4289)
T KOG1219|consen 2648 YQIKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPVPYFA 2727 (4289)
T ss_pred EEEEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCcceEE
Confidence 789999999875 78899999999999999999999999999999765542
Q ss_pred ----------------ceEeEEEEEEEEecCCcc-ceEEEEEEEeecCCCCCEEecCcceEEEECCCCCCccEEEEE
Q psy585 51 ----------------KRVLQYELTLVASDSLNE-NKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGPYPHSLLK 110 (111)
Q Consensus 51 ----------------e~~~~y~l~v~a~d~~~~-~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~~~~~~iq 110 (111)
|++..|.|.+.|+|.|.+ +.++|.++|.|+||+||+|....|.+++.|+.|+|..+..+.
T Consensus 2728 In~etGwlTt~~eld~ek~d~y~lkv~AtDhG~~ssq~~v~v~vtDvndspprf~~eiy~gtvv~d~p~~~~ia~~s 2804 (4289)
T KOG1219|consen 2728 INPETGWLTTLFELDLEKQDLYSLKVVATDHGVPSSQATVLVHVTDVNDSPPRFQREIYEGTVVEDVPGGKIIAGLS 2804 (4289)
T ss_pred EcCCCCeeeehhhhccccCCceEEEEEEecCCcccccceEEEEEEecCCCcchhhhHhhccceeccCCCCceeeeeE
Confidence 678899999999999984 789999999999999999999999999999999998887664
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.31 E-value=4.2e-11 Score=80.74 Aligned_cols=47 Identities=43% Similarity=0.681 Sum_probs=41.4
Q ss_pred CEEEEEEEeCC-C--ccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCC
Q psy585 1 YELKLAASDNL-K--ENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDR 47 (111)
Q Consensus 1 y~l~v~A~D~~-~--~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~ 47 (111)
|.|.|+|+|.| . ++...+.|.|.|+|||+|.|....|.+.|.|+.+.
T Consensus 68 ~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~ 117 (199)
T cd00031 68 YTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPP 117 (199)
T ss_pred EEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCC
Confidence 78999999954 3 38899999999999999999988999999998543
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.30 E-value=1e-10 Score=78.90 Aligned_cols=105 Identities=28% Similarity=0.332 Sum_probs=72.2
Q ss_pred EEEEEeCCCccEEEEEEEEeeCCCCCCeecCCcEEEEEEccCCCCCCceEeEEEEEEEEecCCcc---ceEEEEEEEeec
Q psy585 4 KLAASDNLKENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRTLPKRVLQYELTLVASDSLNE---NKTTVVIHINDV 80 (111)
Q Consensus 4 ~v~A~D~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~~~---~~~~v~I~V~D~ 80 (111)
.+.|.|.-......+...+..-++. ..|.-....+.+.-....+ .|....|.|.|.|+|.|.+ ++..+.|.|.|+
T Consensus 18 ~~~a~D~D~~~~~~~~y~i~~~~~~-~~F~i~~~tG~l~~~~~lD-~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~ 95 (199)
T cd00031 18 TVSATDPDSGENGRVTYSILGGNED-GLFSIDPNTGVITTTKPLD-REEQSEYTLTVVASDGGGPPLSSTATVTVTVLDV 95 (199)
T ss_pred EEEEECCCCCCCceEEEEEeCCCCc-ccEEEeCCCCEEEECCCCC-CcCCceEEEEEEEEECCcCcceeEEEEEEEEccC
Confidence 4557765322134455555555443 4554443333333222111 2555899999999996653 678999999999
Q ss_pred CCCCCEEecCcceEEEECCCCCCccEEEEE
Q psy585 81 NDMPPVFNTSLYPAIMEEELPGPYPHSLLK 110 (111)
Q Consensus 81 Ndn~P~F~~~~y~~~v~E~~~~~~~~~~iq 110 (111)
|||+|.|....|.+.+.|+.++|+.+..++
T Consensus 96 Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~ 125 (199)
T cd00031 96 NDNPPVFEQSSYEASVPENAPPGTVVGTVT 125 (199)
T ss_pred CCCCCcccccceEEEEeCCCCCCCEEEEEE
Confidence 999999999999999999999988877664
No 7
>KOG1834|consensus
Probab=98.81 E-value=4.2e-08 Score=76.69 Aligned_cols=78 Identities=37% Similarity=0.596 Sum_probs=61.8
Q ss_pred CEEEEEEEeCC---------CccEEEEEEEEeeCCCCCCeecCCcEEEEEEccC-----------C--------------
Q psy585 1 YELKLAASDNL---------KENYTTVVIHVKDVNDNPPVFERPTYRTQITEED-----------D-------------- 46 (111)
Q Consensus 1 y~l~v~A~D~~---------~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~-----------~-------------- 46 (111)
|+++|+|.|.| .+..++|.|+|.|+|..+|.|....|.+.|.|.. |
T Consensus 109 ytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~veAiD~DCspq~sqIC~YE 188 (952)
T KOG1834|consen 109 YTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVEAIDKDCSPQYSQICEYE 188 (952)
T ss_pred ceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEEeecCCCCCcccceeEEE
Confidence 89999999953 4677899999999999999999999999999962 0
Q ss_pred ---CCCC-----------------ceEeEEEEEEEEecCCcc---ceEEEEEEEe
Q psy585 47 ---RTLP-----------------KRVLQYELTLVASDSLNE---NKTTVVIHIN 78 (111)
Q Consensus 47 ---~~~~-----------------e~~~~y~l~v~a~d~~~~---~~~~v~I~V~ 78 (111)
++.| .+...|.|+|.|.|.|.. +.+.|+|+|.
T Consensus 189 I~t~d~PFaIdn~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vk 243 (952)
T KOG1834|consen 189 ITTPDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVK 243 (952)
T ss_pred ecCCCCceEEcCCCccccccccccccceeEEEEEEEEecccccccCcceEEEEec
Confidence 1111 245799999999999873 3356777763
No 8
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.60 E-value=5.4e-07 Score=52.55 Aligned_cols=66 Identities=26% Similarity=0.319 Sum_probs=43.9
Q ss_pred EEEEEeeCCCCCCeecCCcEEEEEEccCCCCCCceEeEEEEEEEEecCCc---cceEEEEEEEeecCCCCC
Q psy585 18 VVIHVKDVNDNPPVFERPTYRTQITEEDDRTLPKRVLQYELTLVASDSLN---ENKTTVVIHINDVNDMPP 85 (111)
Q Consensus 18 v~I~V~D~NDn~P~F~~~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~~---~~~~~v~I~V~D~Ndn~P 85 (111)
+.-.+..-+++ ..|.-..-.+.+.-... ---|....|.|.|.|+|.|. .+.++|.|+|.|+|||+|
T Consensus 11 i~Y~i~~~~~~-~~F~i~~~tg~i~~~~~-LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 11 VTYSILSGNED-GLFSIDPETGEITTTKP-LDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred EEEEEecCCCC-CEEEEeCCccEEEeCCc-cCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 55666655543 45554433332221111 11245689999999999986 478999999999999998
No 9
>KOG1834|consensus
Probab=98.53 E-value=3.9e-07 Score=71.41 Aligned_cols=53 Identities=34% Similarity=0.447 Sum_probs=46.0
Q ss_pred ceEeEEEEEEEEecCCc---------cceEEEEEEEeecCCCCCEEecCcceEEEECCCCCC
Q psy585 51 KRVLQYELTLVASDSLN---------ENKTTVVIHINDVNDMPPVFNTSLYPAIMEEELPGP 103 (111)
Q Consensus 51 e~~~~y~l~v~a~d~~~---------~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~~~~ 103 (111)
|.+..|+|+|+|.|.|. +-.++|.|.|.|+|+.+|+|..+.|.+.|.|+.--.
T Consensus 104 elqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd 165 (952)
T KOG1834|consen 104 ELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYD 165 (952)
T ss_pred cccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeee
Confidence 55789999999999764 246899999999999999999999999999985433
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.47 E-value=3.5e-07 Score=53.38 Aligned_cols=30 Identities=43% Similarity=0.603 Sum_probs=28.0
Q ss_pred CEEEEEEEeCC---CccEEEEEEEEeeCCCCCC
Q psy585 1 YELKLAASDNL---KENYTTVVIHVKDVNDNPP 30 (111)
Q Consensus 1 y~l~v~A~D~~---~~~~~~v~I~V~D~NDn~P 30 (111)
|.|.|+|+|.| +++.+.|.|.|.|+|||+|
T Consensus 47 y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 47 YTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred EEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 78999999986 6899999999999999998
No 11
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=97.60 E-value=0.00023 Score=42.54 Aligned_cols=28 Identities=32% Similarity=0.394 Sum_probs=22.5
Q ss_pred ceEeEEEEEEEEecC-Ccc---ceEEEEEEEe
Q psy585 51 KRVLQYELTLVASDS-LNE---NKTTVVIHIN 78 (111)
Q Consensus 51 e~~~~y~l~v~a~d~-~~~---~~~~v~I~V~ 78 (111)
|....|.|.|.|+|. |.+ ++++|+|+|+
T Consensus 62 E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 62 ETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp TTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred ccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 667899999999999 553 6777888774
No 12
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=92.64 E-value=0.74 Score=28.34 Aligned_cols=42 Identities=26% Similarity=0.377 Sum_probs=32.0
Q ss_pred EeEEEEEEEEecCCccceEEEEEEEeecCCCCCEEecCcceEEEECC
Q psy585 53 VLQYELTLVASDSLNENKTTVVIHINDVNDMPPVFNTSLYPAIMEEE 99 (111)
Q Consensus 53 ~~~y~l~v~a~d~~~~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~ 99 (111)
...-.|++.+.|+ .+.+|.|+|.-.|| +|+..-. -...+.|+
T Consensus 56 ~~tdsFtvtv~DG---tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 56 TLTDTFTVTSADG---TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred EEEEEEEEEEeCC---CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 3455677888886 48899999999999 8877643 35777776
No 13
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=91.96 E-value=2.3 Score=27.64 Aligned_cols=66 Identities=21% Similarity=0.316 Sum_probs=43.4
Q ss_pred CcEEEEEEccCCCCCCceEeEEEEEEEEecCCc-cceEEEEEEEeecCCCCCEEecCcceEEEECC-CCCCc
Q psy585 35 PTYRTQITEEDDRTLPKRVLQYELTLVASDSLN-ENKTTVVIHINDVNDMPPVFNTSLYPAIMEEE-LPGPY 104 (111)
Q Consensus 35 ~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~~-~~~~~v~I~V~D~Ndn~P~F~~~~y~~~v~E~-~~~~~ 104 (111)
..|.+++...-+-...+......|.|.|+|..+ .+..++.|+|.| | .|.-.... ...|.|+ .+.|.
T Consensus 66 GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L~~g~ 133 (137)
T TIGR03660 66 GSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDEDDLPGGS 133 (137)
T ss_pred ccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEeccccccCC
Confidence 358888776532211244566778899998755 455688888887 6 68876544 4788885 55443
No 14
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=89.24 E-value=1.3 Score=26.65 Aligned_cols=30 Identities=30% Similarity=0.352 Sum_probs=25.3
Q ss_pred EeEEEEEEEEecCC-ccceEEEEEEEeecCC
Q psy585 53 VLQYELTLVASDSL-NENKTTVVIHINDVND 82 (111)
Q Consensus 53 ~~~y~l~v~a~d~~-~~~~~~v~I~V~D~Nd 82 (111)
...|.++|.|+|+. ......+.|.|.+.||
T Consensus 66 ~g~~~i~v~a~D~~g~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 66 VGSLSLKVTATDSSGASASDTFTITVVNTND 96 (97)
T ss_pred CcEEEEEEEEEECCCCEEEEEEEEEEeCCCC
Confidence 35699999999986 4677889999999887
No 15
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=84.26 E-value=3.9 Score=22.48 Aligned_cols=26 Identities=27% Similarity=0.358 Sum_probs=18.4
Q ss_pred CEEEEEEEeC-CCccEEEEEEEEeeCC
Q psy585 1 YELKLAASDN-LKENYTTVVIHVKDVN 26 (111)
Q Consensus 1 y~l~v~A~D~-~~~~~~~v~I~V~D~N 26 (111)
|+|.++|+|. |..+.....+.+.|..
T Consensus 25 yt~~v~a~D~AGN~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 25 YTLTVTATDKAGNTSSSTTQIVIVDNT 51 (60)
T ss_pred EEEEEEEEECCCCEEEeeeEEEEEcCC
Confidence 7899999997 6666666666666543
No 16
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=82.71 E-value=11 Score=24.86 Aligned_cols=27 Identities=19% Similarity=0.247 Sum_probs=16.1
Q ss_pred eEeEEEEEEEEecCCc-cceEEEEEEEe
Q psy585 52 RVLQYELTLVASDSLN-ENKTTVVIHIN 78 (111)
Q Consensus 52 ~~~~y~l~v~a~d~~~-~~~~~v~I~V~ 78 (111)
....|+|+|.|.|..+ .++.++.....
T Consensus 121 ~~~~YtLtV~a~D~aGN~~~~si~F~y~ 148 (158)
T PF13750_consen 121 ADDSYTLTVSATDKAGNQSTKSISFSYM 148 (158)
T ss_pred CCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence 3457777777777644 45555555443
No 17
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=80.09 E-value=7 Score=21.24 Aligned_cols=26 Identities=23% Similarity=0.321 Sum_probs=18.2
Q ss_pred EeEEEEEEEEecCCcc---ceEEEEEEEe
Q psy585 53 VLQYELTLVASDSLNE---NKTTVVIHIN 78 (111)
Q Consensus 53 ~~~y~l~v~a~d~~~~---~~~~v~I~V~ 78 (111)
...|+|.|.|.|..+. ...++.|.|+
T Consensus 38 ~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 38 PGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 4689999999997652 2366777663
No 18
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=78.61 E-value=9.2 Score=24.85 Aligned_cols=40 Identities=20% Similarity=0.311 Sum_probs=25.4
Q ss_pred EEEEEEEeC-CCccEEEEEEEEeeCCCCCCeecCCcEEEEEEccC
Q psy585 2 ELKLAASDN-LKENYTTVVIHVKDVNDNPPVFERPTYRTQITEED 45 (111)
Q Consensus 2 ~l~v~A~D~-~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~ 45 (111)
.|.|.|+|. |-.+...+.|.|.| | .|.-.... ...|.|++
T Consensus 88 ~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 88 NFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred eeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEeccc
Confidence 466788875 44444577788777 5 47665543 36777754
No 19
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=74.98 E-value=11 Score=22.55 Aligned_cols=17 Identities=18% Similarity=0.374 Sum_probs=5.1
Q ss_pred CCeecCCcEEEEEEccC
Q psy585 29 PPVFERPTYRTQITEED 45 (111)
Q Consensus 29 ~P~F~~~~y~~~v~E~~ 45 (111)
.|-|.+..|.+.|+.+-
T Consensus 3 ~pGF~~~~~~~~Vp~~l 19 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNL 19 (90)
T ss_dssp --B--S-EEEE----SS
T ss_pred cCCcccceEEEEcCchh
Confidence 46666667777666653
No 20
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=67.24 E-value=17 Score=20.36 Aligned_cols=26 Identities=19% Similarity=0.390 Sum_probs=20.0
Q ss_pred eEeEEEEEEEEecCCccceEEEEEEE
Q psy585 52 RVLQYELTLVASDSLNENKTTVVIHI 77 (111)
Q Consensus 52 ~~~~y~l~v~a~d~~~~~~~~v~I~V 77 (111)
....|.+++.+.|..+.+++++.|.|
T Consensus 53 ~~G~y~v~l~v~n~~g~~~~~~~i~v 78 (79)
T smart00089 53 KPGTYTVTLTVTNAVGSASATVTVVV 78 (79)
T ss_pred CCcEEEEEEEEEcCCCcEEEEEEEEE
Confidence 44689999999998776677777765
No 21
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=65.65 E-value=18 Score=22.21 Aligned_cols=38 Identities=24% Similarity=0.384 Sum_probs=26.6
Q ss_pred EEEEEEEeCCCccEEEEEEEEeeCCCCCCeecCCcEEEEEEcc
Q psy585 2 ELKLAASDNLKENYTTVVIHVKDVNDNPPVFERPTYRTQITEE 44 (111)
Q Consensus 2 ~l~v~A~D~~~~~~~~v~I~V~D~NDn~P~F~~~~y~~~v~E~ 44 (111)
.+++.+.|+ .+..|.|.|.-.|| +|.... .-.+.+.|+
T Consensus 60 sFtvtv~DG---tt~~vtItI~GtND-apvi~~-~~~g~v~ED 97 (99)
T TIGR01965 60 TFTVTSADG---TSQTVTITITGAND-AAVIGG-ADTGSVTED 97 (99)
T ss_pred EEEEEEeCC---CeEEEEEEEEccCC-CCEEec-ccceeEecC
Confidence 467778886 37889999999999 454433 234667665
No 22
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=60.92 E-value=21 Score=20.06 Aligned_cols=26 Identities=15% Similarity=0.355 Sum_probs=17.4
Q ss_pred eEeEEEEEEEEecCCc-cceEEEEEEE
Q psy585 52 RVLQYELTLVASDSLN-ENKTTVVIHI 77 (111)
Q Consensus 52 ~~~~y~l~v~a~d~~~-~~~~~v~I~V 77 (111)
+...|.+++.+.|..+ ....++.|.|
T Consensus 55 ~~G~y~v~l~v~d~~g~~~~~~~~V~V 81 (81)
T cd00146 55 KPGTYTVTLTVTNAVGSSSTKTTTVVV 81 (81)
T ss_pred CCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence 3468999999999844 4444555543
No 23
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=59.10 E-value=33 Score=19.75 Aligned_cols=51 Identities=20% Similarity=0.148 Sum_probs=25.2
Q ss_pred EEEEEEEEeeCCCCCCeec-------CCcEEEEEEccCCCCCCceEeEEEEEEEEecCC
Q psy585 15 YTTVVIHVKDVNDNPPVFE-------RPTYRTQITEEDDRTLPKRVLQYELTLVASDSL 66 (111)
Q Consensus 15 ~~~v~I~V~D~NDn~P~F~-------~~~y~~~v~E~~~~~~~e~~~~y~l~v~a~d~~ 66 (111)
...+.|.|.|.+-+. +.+ ...+.+.-.=.++.+.+-....|.+.+.|+|.|
T Consensus 24 a~~v~v~I~d~~G~~-V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~a~~~g 81 (81)
T PF13860_consen 24 ADNVTVTIYDSNGQV-VRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVTATDGG 81 (81)
T ss_dssp CEEEEEEEEETTS-E-EEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEEEEET-
T ss_pred ccEEEEEEEcCCCCE-EEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEEEEeCC
Confidence 446788888886643 111 011122211113444455567899999999875
No 24
>PF10365 DUF2436: Domain of unknown function (DUF2436); InterPro: IPR018832 Gingipains R and K are endopeptidases with specificity for arginyl and lysyl bonds, respectively. Like other cysteine peptidases, they require reducing conditions for activity. They are maximally active at approximately neutral pH. Gingipains R and K are secreted by the bacterium Porphyromonas gingivalis (Bacteroides gingivalis). The bacterium is a major pathogen in periodontal disease, and the many ways in which the activities of the gingipains may contribute to the disease processes have been reviewed []. These enzymes are also involved in the hemagglutinating activity of the organisms. This entry represents a central region found in gingipain K peptidases, active on lysyl bonds; they belong to the MEROPS peptidase family C25 (gingipain family, clan CD).
Probab=56.20 E-value=24 Score=23.23 Aligned_cols=23 Identities=22% Similarity=0.250 Sum_probs=16.0
Q ss_pred CCCCCeecCCcEEEEEEccCCCC
Q psy585 26 NDNPPVFERPTYRTQITEEDDRT 48 (111)
Q Consensus 26 NDn~P~F~~~~y~~~v~E~~~~~ 48 (111)
|+|+|.=.-..|+..|++|+++.
T Consensus 66 n~~~pa~ly~~FEYkiP~NADps 88 (161)
T PF10365_consen 66 NCNVPANLYDPFEYKIPANADPS 88 (161)
T ss_pred CCCCChhhcccceEeccCCCCCc
Confidence 55666544467888899988764
No 25
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=53.74 E-value=20 Score=19.07 Aligned_cols=10 Identities=30% Similarity=0.710 Sum_probs=6.9
Q ss_pred CEEEEEEEeC
Q psy585 1 YELKLAASDN 10 (111)
Q Consensus 1 y~l~v~A~D~ 10 (111)
|.++++|+|.
T Consensus 26 y~itv~a~D~ 35 (54)
T PF13754_consen 26 YTITVTATDA 35 (54)
T ss_pred EEEEEEEEeC
Confidence 5677777775
No 26
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=52.76 E-value=24 Score=20.86 Aligned_cols=25 Identities=24% Similarity=0.189 Sum_probs=18.1
Q ss_pred EEEEeecCCCCCEEecCcceEEEECCC
Q psy585 74 VIHINDVNDMPPVFNTSLYPAIMEEEL 100 (111)
Q Consensus 74 ~I~V~D~Ndn~P~F~~~~y~~~v~E~~ 100 (111)
+|.|.| ||.+ .+....-...+.|+.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~ 26 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD 26 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC
Confidence 567788 7744 888777778888885
No 27
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=52.13 E-value=43 Score=19.03 Aligned_cols=32 Identities=16% Similarity=0.046 Sum_probs=22.1
Q ss_pred CCCCCceEeEEEEEEEEecC-CccceEEEEEEE
Q psy585 46 DRTLPKRVLQYELTLVASDS-LNENKTTVVIHI 77 (111)
Q Consensus 46 ~~~~~e~~~~y~l~v~a~d~-~~~~~~~v~I~V 77 (111)
.++..-....|.++..|+|. |..++++..|+|
T Consensus 49 ~~g~~f~~G~t~V~ytA~D~~GN~a~C~f~V~V 81 (81)
T PF02494_consen 49 PPGDLFPVGTTTVTYTATDAAGNSATCSFTVTV 81 (81)
T ss_pred CCCceEeeceEEEEEEEEECCCCEEEEEEEEEC
Confidence 33333344589999999997 446788887765
No 28
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=47.12 E-value=2.9e+02 Score=27.06 Aligned_cols=28 Identities=7% Similarity=0.317 Sum_probs=21.4
Q ss_pred eEeEEEEEEEEecCCccceEEEEEEEee
Q psy585 52 RVLQYELTLVASDSLNENKTTVVIHIND 79 (111)
Q Consensus 52 ~~~~y~l~v~a~d~~~~~~~~v~I~V~D 79 (111)
....|++++.|.|..+....++.|.|.+
T Consensus 1566 spGtYtVtLTvtN~~Gs~~~T~~i~V~~ 1593 (2740)
T TIGR00864 1566 SVGTFNIIVTAENDVGAAQASIFLFVLQ 1593 (2740)
T ss_pred CCceEEEEEEEecCCCccceeEEEEEee
Confidence 3468999999999887766666666664
No 29
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=46.61 E-value=1.9e+02 Score=24.94 Aligned_cols=28 Identities=21% Similarity=0.246 Sum_probs=20.7
Q ss_pred EEEEEEeecCCCCCEEecCcceEEEECCC
Q psy585 72 TVVIHINDVNDMPPVFNTSLYPAIMEEEL 100 (111)
Q Consensus 72 ~v~I~V~D~Ndn~P~F~~~~y~~~v~E~~ 100 (111)
+.+|+|.| ||++|.|....-..+|.|+.
T Consensus 516 ~ATVTIlD-DD~aGIfsFe~~~~sV~Es~ 543 (928)
T TIGR00845 516 TATVTILD-DDHAGIFTFEEDVFHVSESI 543 (928)
T ss_pred eEEEEEec-CcccCcccccCceEEEEcCC
Confidence 45666677 78899887666667888874
No 30
>PF07145 PAM2: Ataxin-2 C-terminal region; InterPro: IPR009818 This entry represents a conserved region approximately 250 residues long located towards the C terminus of eukaryotic ataxin-2. Ataxin-2 is a protein of unknown function, within which expansion of a polyglutamine tract (due to expansion of unstable CAG repeats in the coding region of the SCA2 gene) causes spinocerebellar ataxia type 2 (SCA2), a late-onset neurodegenerative disorder []. The expanded polyglutamine repeat in ataxin-2 causes disruption of the normal morphology of the Golgi complex and increased incidence of cell death []. Ataxin-2 is predicted to consist of mostly non-globular domains [].; PDB: 3NTW_B 1JH4_B 3KTR_B 3KUJ_B 3KUT_D 3KUS_D 1JGN_B 2RQG_A 2RQH_A.
Probab=42.92 E-value=11 Score=15.84 Aligned_cols=12 Identities=33% Similarity=0.382 Sum_probs=8.1
Q ss_pred cCCCCCEEecCc
Q psy585 80 VNDMPPVFNTSL 91 (111)
Q Consensus 80 ~Ndn~P~F~~~~ 91 (111)
.|-|+|.|.+..
T Consensus 6 LNp~A~eFvP~~ 17 (18)
T PF07145_consen 6 LNPNAPEFVPSS 17 (18)
T ss_dssp SSTTSSSS-TTT
T ss_pred cCCCCccccCCC
Confidence 577889887654
No 31
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=42.15 E-value=1.6e+02 Score=25.36 Aligned_cols=29 Identities=17% Similarity=0.271 Sum_probs=20.1
Q ss_pred EEEEEEEeeCCCCCCeecCCcEEEEEEccC
Q psy585 16 TTVVIHVKDVNDNPPVFERPTYRTQITEED 45 (111)
Q Consensus 16 ~~v~I~V~D~NDn~P~F~~~~y~~~v~E~~ 45 (111)
...+|+|.| ||++|.|.-..-...|.|+.
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~ 543 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSESI 543 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcCC
Confidence 356677777 88999876544456788864
No 32
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=39.17 E-value=82 Score=18.55 Aligned_cols=29 Identities=28% Similarity=0.472 Sum_probs=19.0
Q ss_pred eEEEEEEEEecCCccceEEEEEEEeecCCCCC
Q psy585 54 LQYELTLVASDSLNENKTTVVIHINDVNDMPP 85 (111)
Q Consensus 54 ~~y~l~v~a~d~~~~~~~~v~I~V~D~Ndn~P 85 (111)
..|++. |.+..+...+++.+.|.|.-+ ||
T Consensus 69 G~Ytc~--a~N~~G~~~~~~~l~V~~~P~-pP 97 (98)
T cd05762 69 GCYTLE--VENKLGSRQAQVNLTVVDKPD-PP 97 (98)
T ss_pred EEEEEE--EEcCCCceeEEEEEEEecCCC-CC
Confidence 445443 444445677889999998876 55
No 33
>cd01757 PLAT_RAB6IP1 PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains consists of an eight stranded beta-barrel. In RabIP1 this domain may participate in lipid-mediated modulation of Rab6IP1's function via it's generally proposed function of mediating interaction with lipids or membrane bound proteins.
Probab=37.18 E-value=1.1e+02 Score=19.23 Aligned_cols=42 Identities=12% Similarity=0.109 Sum_probs=28.3
Q ss_pred EEEEEEEEecC--CccceEEEEEEEeecCCCCCEEecC--cceEEE
Q psy585 55 QYELTLVASDS--LNENKTTVVIHINDVNDMPPVFNTS--LYPAIM 96 (111)
Q Consensus 55 ~y~l~v~a~d~--~~~~~~~v~I~V~D~Ndn~P~F~~~--~y~~~v 96 (111)
.|.++|.--+. |.-.++.|.|.+...+...+.+... .+.+.+
T Consensus 2 ~Y~VtV~TG~~~~gAGT~anV~i~L~G~~g~s~~~~L~~~~f~~~v 47 (114)
T cd01757 2 PYHVVIVPSKKLGGSMFTANPWICVSGELGETPPLQIPKNSLEMTF 47 (114)
T ss_pred eEEEEEEeCCCCCCCCCcceEEEEEEEcCCCcCCEEecCCceEEEE
Confidence 48888877765 4457788999998888766655433 344443
No 34
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=35.54 E-value=84 Score=20.37 Aligned_cols=30 Identities=10% Similarity=0.207 Sum_probs=20.1
Q ss_pred EeEEEEEEEEecCCc-cceEEEEEEEeecCC
Q psy585 53 VLQYELTLVASDSLN-ENKTTVVIHINDVND 82 (111)
Q Consensus 53 ~~~y~l~v~a~d~~~-~~~~~v~I~V~D~Nd 82 (111)
..+|+++|++.|..+ +.++.+.|.|.|-.-
T Consensus 82 gG~y~~~VeLCN~~GCS~S~~~~V~VaDTDG 112 (133)
T PF08329_consen 82 GGRYQMQVELCNADGCSTSAPVEVVVADTDG 112 (133)
T ss_dssp -EEEEEEEEEEETTEEEE---EEEEEE-TTS
T ss_pred CCEEEEEEEEECCCCcccCCCEEEEEeCCCc
Confidence 468999999998776 455678888888764
No 35
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=31.70 E-value=1.4e+02 Score=19.11 Aligned_cols=25 Identities=16% Similarity=0.327 Sum_probs=16.4
Q ss_pred EeEEEEEEEEecCCc-cceEEEEEEE
Q psy585 53 VLQYELTLVASDSLN-ENKTTVVIHI 77 (111)
Q Consensus 53 ~~~y~l~v~a~d~~~-~~~~~v~I~V 77 (111)
...|.|.+.++|..+ .......|.|
T Consensus 106 ~G~YH~~i~VtD~~Gn~~~~~~~i~I 131 (132)
T PF15418_consen 106 AGDYHFMITVTDAAGNQTEEERSIKI 131 (132)
T ss_pred CcceEEEEEEEECCCCEEEEEEEEEE
Confidence 357999999999765 3444444443
No 36
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=31.15 E-value=1.2e+02 Score=19.67 Aligned_cols=32 Identities=13% Similarity=0.361 Sum_probs=16.4
Q ss_pred eEEEEEEEEecCCc--cceEEEEEEEeecCCCCCE
Q psy585 54 LQYELTLVASDSLN--ENKTTVVIHINDVNDMPPV 86 (111)
Q Consensus 54 ~~y~l~v~a~d~~~--~~~~~v~I~V~D~Ndn~P~ 86 (111)
....+.-.+.|++. ++.+.+.|+|.+-|+ +|.
T Consensus 100 Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-ppi 133 (133)
T PF09100_consen 100 GNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PPI 133 (133)
T ss_dssp EEEEEEEEETTTT---EEEEEEEEE---S----S-
T ss_pred ccEEEEEEEccCCcccceeEeEEEEeecccC-CCC
Confidence 34455555566665 578889999999888 763
No 37
>KOG4221|consensus
Probab=30.98 E-value=1.7e+02 Score=26.20 Aligned_cols=59 Identities=24% Similarity=0.485 Sum_probs=35.7
Q ss_pred CCCeecCC---cEEEEEEccCCCCC--------------CceEeEEEEEEEEecCCcc--ceEEEEE-EEeecCCCCCE
Q psy585 28 NPPVFERP---TYRTQITEEDDRTL--------------PKRVLQYELTLVASDSLNE--NKTTVVI-HINDVNDMPPV 86 (111)
Q Consensus 28 n~P~F~~~---~y~~~v~E~~~~~~--------------~e~~~~y~l~v~a~d~~~~--~~~~v~I-~V~D~Ndn~P~ 86 (111)
.+|.|... .|.....|++.... .++...|.+.+.|.+..++ +++.++| +..|+=+-||.
T Consensus 542 epP~~~n~~I~~yk~~ys~~~~~~~~~~~~n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 542 EPPPFGNGPITGYKLFYSEDDTGKELRVENNATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred cCCCCCCCCceEEEEEEEcCCCCceEEEecCccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 36776544 47777777732211 1566789999999997665 3444433 44555555554
No 38
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=27.68 E-value=88 Score=16.46 Aligned_cols=16 Identities=19% Similarity=0.482 Sum_probs=12.5
Q ss_pred ceEEEEEEEeecCCCC
Q psy585 69 NKTTVVIHINDVNDMP 84 (111)
Q Consensus 69 ~~~~v~I~V~D~Ndn~ 84 (111)
.+..++|++.|.|.||
T Consensus 13 e~I~ltVt~kda~G~p 28 (47)
T PF05688_consen 13 ETIPLTVTVKDANGNP 28 (47)
T ss_pred CeEEEEEEEECCCCCC
Confidence 4678899999997754
No 39
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=25.95 E-value=91 Score=16.23 Aligned_cols=13 Identities=31% Similarity=0.424 Sum_probs=10.9
Q ss_pred eEEEEEEEEecCC
Q psy585 54 LQYELTLVASDSL 66 (111)
Q Consensus 54 ~~y~l~v~a~d~~ 66 (111)
..|.+.|.|+|+.
T Consensus 36 G~y~~~vtatd~~ 48 (49)
T PF05345_consen 36 GTYTFTVTATDGS 48 (49)
T ss_pred cEEEEEEEEEcCC
Confidence 5799999999874
No 40
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=25.86 E-value=1.1e+02 Score=19.04 Aligned_cols=27 Identities=11% Similarity=0.172 Sum_probs=17.6
Q ss_pred EeEEEEEEEEecCCc-cceEEEEEEEee
Q psy585 53 VLQYELTLVASDSLN-ENKTTVVIHIND 79 (111)
Q Consensus 53 ~~~y~l~v~a~d~~~-~~~~~v~I~V~D 79 (111)
..+|.++|++-|..+ +.++.+.|.|-|
T Consensus 79 gG~y~m~V~lCn~dGCS~S~~~~I~VAD 106 (106)
T cd02848 79 GGRYQMQVALCNGDGCSTSAAKEIVVAD 106 (106)
T ss_pred CCeEEEEEEEECCCCccCcCCEEEEecC
Confidence 357999999888765 334455565544
No 41
>COG1470 Predicted membrane protein [Function unknown]
Probab=24.00 E-value=3.8e+02 Score=21.44 Aligned_cols=10 Identities=30% Similarity=0.667 Sum_probs=5.6
Q ss_pred CEEEEEEEeC
Q psy585 1 YELKLAASDN 10 (111)
Q Consensus 1 y~l~v~A~D~ 10 (111)
|.+.|.|.-+
T Consensus 352 Ynv~I~A~s~ 361 (513)
T COG1470 352 YNVTITASSS 361 (513)
T ss_pred eeEEEEEecc
Confidence 5566666543
No 42
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=20.73 E-value=89 Score=18.70 Aligned_cols=17 Identities=24% Similarity=0.470 Sum_probs=12.6
Q ss_pred cceEEEEEEEeecCCCC
Q psy585 68 ENKTTVVIHINDVNDMP 84 (111)
Q Consensus 68 ~~~~~v~I~V~D~Ndn~ 84 (111)
....++++.|.|.|.||
T Consensus 22 ~~~~tltatV~D~~gnp 38 (100)
T PF02369_consen 22 SDTNTLTATVTDANGNP 38 (100)
T ss_dssp SS-EEEEEEEEETTSEB
T ss_pred cCcEEEEEEEEcCCCCC
Confidence 46678888999988855
No 43
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=20.06 E-value=1.6e+02 Score=15.80 Aligned_cols=16 Identities=19% Similarity=0.343 Sum_probs=11.7
Q ss_pred EeEEEEEEEEecCCcc
Q psy585 53 VLQYELTLVASDSLNE 68 (111)
Q Consensus 53 ~~~y~l~v~a~d~~~~ 68 (111)
...|.+++.|+|..+.
T Consensus 51 ~G~y~V~ltv~n~~g~ 66 (69)
T PF00801_consen 51 PGTYTVTLTVTNGVGS 66 (69)
T ss_dssp SEEEEEEEEEEETTSE
T ss_pred CeEEEEEEEEEECCCC
Confidence 4678888888887543
Done!