Query psy5884
Match_columns 195
No_of_seqs 193 out of 1323
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 20:36:06 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5884.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5884hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 1.7E-35 3.7E-40 258.8 16.6 186 3-193 221-418 (2531)
2 KOG4289|consensus 100.0 1.4E-33 3E-38 247.0 19.4 186 3-192 735-932 (2531)
3 KOG1219|consensus 100.0 8.7E-30 1.9E-34 229.6 19.0 179 2-187 2130-2313(4289)
4 KOG1219|consensus 100.0 7.1E-29 1.5E-33 223.8 21.7 180 3-187 903-1090(4289)
5 cd00031 CA Cadherin repeat dom 99.9 6.4E-24 1.4E-28 159.5 20.7 139 3-145 54-198 (199)
6 cd00031 CA Cadherin repeat dom 99.9 6.4E-22 1.4E-26 148.6 18.7 126 57-185 2-132 (199)
7 PF00028 Cadherin: Cadherin do 99.7 1.7E-16 3.8E-21 105.2 12.7 87 57-145 1-93 (93)
8 KOG1834|consensus 99.7 4.4E-16 9.6E-21 130.0 14.6 137 2-145 94-243 (952)
9 smart00112 CA Cadherin repeats 99.6 5.7E-15 1.2E-19 94.8 10.2 75 76-153 1-79 (79)
10 smart00112 CA Cadherin repeats 99.2 3.5E-11 7.5E-16 77.1 7.1 47 3-51 33-79 (79)
11 KOG1834|consensus 99.2 2.1E-10 4.6E-15 96.6 13.0 142 42-186 22-179 (952)
12 PF00028 Cadherin: Cadherin do 98.3 3.1E-06 6.7E-11 55.6 6.9 40 3-44 53-93 (93)
13 PF08758 Cadherin_pro: Cadheri 97.2 0.0064 1.4E-07 39.7 9.1 77 50-135 3-82 (90)
14 smart00736 CADG Dystroglycan-t 97.1 0.023 4.9E-07 37.6 11.0 72 75-150 24-96 (97)
15 PF08266 Cadherin_2: Cadherin- 97.0 0.00081 1.8E-08 43.3 3.4 59 59-120 5-66 (84)
16 PF13750 Big_3_3: Bacterial Ig 97.0 0.07 1.5E-06 38.7 16.1 132 10-145 8-148 (158)
17 TIGR01965 VCBS_repeat VCBS rep 94.2 1 2.2E-05 29.9 8.9 86 71-167 2-97 (99)
18 PF05345 He_PIG: Putative Ig d 90.2 2.2 4.7E-05 24.3 5.9 36 97-132 12-48 (49)
19 TIGR03660 T1SS_rpt_143 T1SS-14 88.7 2.7 5.9E-05 29.6 6.7 60 1-65 68-128 (137)
20 smart00736 CADG Dystroglycan-t 88.3 1.8 4E-05 28.3 5.4 31 16-48 66-96 (97)
21 PF07495 Y_Y_Y: Y_Y_Y domain; 80.2 4.6 9.9E-05 24.0 4.2 30 15-44 37-66 (66)
22 PF03160 Calx-beta: Calx-beta 78.9 15 0.00033 23.8 8.8 50 40-92 2-52 (100)
23 TIGR01965 VCBS_repeat VCBS rep 77.3 12 0.00026 24.8 5.7 41 16-63 56-96 (99)
24 PF08758 Cadherin_pro: Cadheri 77.2 2 4.4E-05 27.9 2.0 35 151-185 2-36 (90)
25 PF07495 Y_Y_Y: Y_Y_Y domain; 71.6 18 0.00039 21.3 8.8 58 82-144 6-65 (66)
26 KOG3597|consensus 70.9 66 0.0014 27.5 13.4 58 35-93 24-83 (442)
27 TIGR00845 caca sodium/calcium 67.0 1.1E+02 0.0025 28.7 12.6 118 46-168 395-543 (928)
28 PF12245 Big_3_2: Bacterial Ig 64.9 27 0.00057 20.6 5.6 27 119-145 22-48 (60)
29 PF02010 REJ: REJ domain; Int 64.0 16 0.00035 30.7 5.1 131 17-155 50-193 (440)
30 TIGR00845 caca sodium/calcium 63.9 1.3E+02 0.0029 28.3 13.1 56 35-93 513-569 (928)
31 PF13754 Big_3_4: Bacterial Ig 59.9 31 0.00068 19.8 5.0 27 118-144 22-49 (54)
32 TIGR03660 T1SS_rpt_143 T1SS-14 59.4 60 0.0013 22.9 9.0 54 110-168 70-128 (137)
33 smart00089 PKD Repeats in poly 55.5 41 0.00088 20.5 4.8 25 16-43 54-78 (79)
34 PF13750 Big_3_3: Bacterial Ig 53.2 83 0.0018 22.7 7.7 37 119-157 14-51 (158)
35 cd00146 PKD polycystic kidney 51.7 41 0.0009 20.6 4.4 15 16-30 56-70 (81)
36 cd00146 PKD polycystic kidney 51.4 55 0.0012 20.0 8.5 31 114-144 51-81 (81)
37 KOG4221|consensus 49.5 2.5E+02 0.0054 27.5 10.3 105 16-132 584-697 (1381)
38 PF02494 HYR: HYR domain; Int 43.9 57 0.0012 20.2 4.1 24 121-144 58-81 (81)
39 PF12245 Big_3_2: Bacterial Ig 36.9 90 0.0019 18.3 5.7 30 16-47 22-51 (60)
40 PF15418 DUF4625: Domain of un 34.6 1.1E+02 0.0023 21.5 4.5 26 119-144 106-131 (132)
41 KOG3597|consensus 33.7 1.3E+02 0.0029 25.7 5.7 50 136-186 24-73 (442)
42 PF05895 DUF859: Siphovirus pr 31.6 3.9E+02 0.0085 24.1 13.3 107 16-132 297-424 (624)
43 PF09100 Qn_am_d_aIV: Quinohem 28.7 1.8E+02 0.0039 20.3 4.7 31 18-48 99-130 (133)
44 cd02848 Chitinase_N_term Chiti 28.2 88 0.0019 21.1 3.1 28 9-38 74-101 (106)
45 PF08329 ChitinaseA_N: Chitina 26.1 1.2E+02 0.0026 21.3 3.6 28 118-145 81-108 (133)
46 KOG4221|consensus 26.1 6.5E+02 0.014 24.9 13.0 68 86-154 552-620 (1381)
47 cd01758 PLAT_LPL PLAT/ LH2 dom 24.8 2.5E+02 0.0055 19.7 7.6 25 121-145 2-27 (137)
No 1
>KOG4289|consensus
Probab=100.00 E-value=1.7e-35 Score=258.84 Aligned_cols=186 Identities=27% Similarity=0.419 Sum_probs=172.5
Q ss_pred EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCC-cEEEEcCCC-CceEEEEEEeeCCC
Q psy5884 3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIP-PIQIQAGEA-RRQIVKIEATDNDL 80 (195)
Q Consensus 3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~-~~~v~e~~~-g~~v~~v~a~D~D~ 80 (195)
|.+++.||||. ...+.|.|.|.|.+.|.++++++|+|.|+|.|||.|+|++.. ...+.|+.. |..|.+|.|+|.|.
T Consensus 221 irta~~lDREt--~e~HvlrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~vLtvrAtD~Ds 298 (2531)
T KOG4289|consen 221 IRTAKSLDRET--KETHVLRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEVLTVRATDGDS 298 (2531)
T ss_pred chhhhhhhhhh--hheeEEEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceEEEEEeccCCC
Confidence 56789999999 789999999999999999999999999999999999999974 578899988 99999999999999
Q ss_pred CCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC--CceeEEEEEEEEcCCCCCCCeEe
Q psy5884 81 GVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG--KSSQTIVEVSVVPGPNTRSPVFQ 156 (195)
Q Consensus 81 ~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~~P~f~ 156 (195)
++|+.+.|++.. |.++..|.||+.+|.|++..++|+| +.|.|.|.|+|.|. -..+++|.|+|. |+|||+|+|.
T Consensus 299 p~Nani~Yrl~e--g~~~~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~-D~NDNaPqFs 375 (2531)
T KOG4289|consen 299 PPNANIRYRLLE--GNAKNVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVE-DENDNAPQFS 375 (2531)
T ss_pred CCCCceEEEecC--CCccceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEE-ecCCCCcccc
Confidence 999999999976 5678899999999999999999988 47999999999986 234899999999 9999999999
Q ss_pred cCeeEEEEecCCCCCeEEEEEEEEeCCCCCC------cCcccc
Q psy5884 157 QNTYDVNVSEGASINSTVATITVSRLASGCA------IMNKEE 193 (195)
Q Consensus 157 ~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~------i~~~~~ 193 (195)
...|.+.|+|+..++++|++++|+|.|.|.| |++||.
T Consensus 376 e~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~ 418 (2531)
T KOG4289|consen 376 EKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNG 418 (2531)
T ss_pred ccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCc
Confidence 9999999999999999999999999999965 777775
No 2
>KOG4289|consensus
Probab=100.00 E-value=1.4e-33 Score=247.02 Aligned_cols=186 Identities=27% Similarity=0.407 Sum_probs=170.1
Q ss_pred EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCC-CcEEEEcCCC-CceEEEEEEeeCCC
Q psy5884 3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEA-RRQIVKIEATDNDL 80 (195)
Q Consensus 3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D~ 80 (195)
+.+...||||. +..|++.+.|+|.+.|+...+.+|.|.|.|+|||+|+|..+ |..+|.|+.| ++.+++|+|+|+|.
T Consensus 735 i~t~~~ld~ed--qvtytl~itA~D~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~TsvlQVSatDaD~ 812 (2531)
T KOG4289|consen 735 IYTQAELDYED--QVTYTLAITARDNGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVLQVSATDADS 812 (2531)
T ss_pred eEEeeeeeccc--ceeeEeeeeecCCCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEEEEEEeccCC
Confidence 45678899999 99999999999999999999999999999999999999987 5789999998 99999999999999
Q ss_pred CCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC--CceeEEEEEEEEcCCCCCCCeEe
Q psy5884 81 GVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG--KSSQTIVEVSVVPGPNTRSPVFQ 156 (195)
Q Consensus 81 ~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~~P~f~ 156 (195)
|.|+++.|.+..+ ..+.+.|.|++.+|.|++...||+| .+|.|.+.|.|.|. ++..+.|+|+|. |+||++|.|.
T Consensus 813 g~Ng~v~y~~qg~-~d~p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvl-dvNDnaPvfe 890 (2531)
T KOG4289|consen 813 GPNGRVYYTFQGG-DDGPGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVL-DVNDNAPVFE 890 (2531)
T ss_pred CCCceEEEEecCC-CCCCCceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEE-ecCCCCCCCC
Confidence 9999999988643 3355899999999999999999998 48999999999987 567788888888 9999999999
Q ss_pred cCeeEEEEecCCCCCeEEEEEEEEeCCCCCC------cCccc
Q psy5884 157 QNTYDVNVSEGASINSTVATITVSRLASGCA------IMNKE 192 (195)
Q Consensus 157 ~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~------i~~~~ 192 (195)
+.+|...|.||+++|..++++.|.|||+|.| |..||
T Consensus 891 ~~e~e~~I~enspvgs~va~i~a~dpdEG~NA~IsYqIvgg~ 932 (2531)
T KOG4289|consen 891 QDELELFIEENSPVGSVVALITADDPDEGPNAHISYQIVGGN 932 (2531)
T ss_pred CcceeeEEeecCccceeeEEEEccCCCcCCcceEEEeeccCc
Confidence 9999999999999999999999999999977 56555
No 3
>KOG1219|consensus
Probab=99.97 E-value=8.7e-30 Score=229.55 Aligned_cols=179 Identities=30% Similarity=0.397 Sum_probs=165.1
Q ss_pred EEecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCC-CcEEEEcCCC-CceEEEEEEeeCC
Q psy5884 2 LVVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEA-RRQIVKIEATDND 79 (195)
Q Consensus 2 ~i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D 79 (195)
.|.+..+||||. ...|.|.|.|+|.+.| +.+++.|.|.|.|+|||||+|.+. |.++++|+.+ |+.+.+|.|+|.|
T Consensus 2130 ~i~v~~pLd~ea--~t~h~l~ieAtd~~~p-~~Aea~VeIiV~dIndn~PvFeqlsYt~sisE~s~igt~viqilATdsD 2206 (4289)
T KOG1219|consen 2130 VILVLIPLDREA--STLHELLIEATDAGIP-LSAEAKVEIIVGDINDNPPVFEQLSYTISISENSKIGTKVIQILATDSD 2206 (4289)
T ss_pred eEEecccccccc--ccceEEEEEEeccCCC-cceeeEEEEEecccCCCCchhheeeEEEEccCCCccCceEEEEEeccCC
Confidence 367788999999 8899999999999876 999999999999999999999986 6689999998 9999999999999
Q ss_pred CCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCCC-ceeEEEEEEEEcCCCCCCCeEe
Q psy5884 80 LGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGGK-SSQTIVEVSVVPGPNTRSPVFQ 156 (195)
Q Consensus 80 ~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~~-~~~~~v~i~V~~d~nd~~P~f~ 156 (195)
. |+.+.|+|.. .......|+|+..||.|++.+.|||| ++|.|.|+|+|+|.+ ++.+.+.|+|. |.||++|.|.
T Consensus 2207 s--n~~isYsl~g-~s~~sk~f~In~sTG~it~~g~ldyE~~q~f~~fvratdggk~lSseviv~V~Ve-D~Ndn~Pef~ 2282 (4289)
T KOG1219|consen 2207 S--NREISYSLEG-NSEISKPFRINVSTGWITVAGKLDYEENQEFRFFVRATDGGKPLSSEVIVEVHVE-DFNDNPPEFN 2282 (4289)
T ss_pred C--CCceEEEeec-CCccccceEEecccceEEEeeecChhhcceEEEEEEEccCCCcccccEEEEEEeh-hcCCCCchhc
Confidence 9 8899999985 33457899999999999999999998 479999999998875 78899999999 9999999999
Q ss_pred cCeeEEEEecCCCCCeEEEEEEEEeCCCCCC
Q psy5884 157 QNTYDVNVSEGASINSTVATITVSRLASGCA 187 (195)
Q Consensus 157 ~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~ 187 (195)
+..|.+.|+|.+..|..|..+.|.|+|+...
T Consensus 2283 q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~ 2313 (4289)
T KOG1219|consen 2283 QRNYEAFVSDPARSGHFITVVNAHDLDSSDH 2313 (4289)
T ss_pred cccceeecCCCccceeEEEEEEeccCCccch
Confidence 9999999999999999999999999999854
No 4
>KOG1219|consensus
Probab=99.97 E-value=7.1e-29 Score=223.78 Aligned_cols=180 Identities=24% Similarity=0.341 Sum_probs=167.1
Q ss_pred EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCC--CCccCCCC-cEEEEcCCC-CceEEEEEEeeC
Q psy5884 3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDN--APRLPMIP-PIQIQAGEA-RRQIVKIEATDN 78 (195)
Q Consensus 3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn--~P~f~~~~-~~~v~e~~~-g~~v~~v~a~D~ 78 (195)
+++.++||+|+ ...|.|.|.|.|+|.|.+++.+.+.|.++|+|.| ||.|.... ...|.|+++ |+.+++|.|.|.
T Consensus 903 vfi~~~LDf~k--~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT~vi~i~A~de 980 (4289)
T KOG1219|consen 903 VFIGKPLDFEK--SDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGTIVIRIQARDE 980 (4289)
T ss_pred EEEeccccccc--ccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcceEEEEEEEecC
Confidence 45678999999 8999999999999998899999999999999988 99997763 689999999 999999999999
Q ss_pred CCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCC--EEEEEEEEEECCC--CceeEEEEEEEEcCCCCCCCe
Q psy5884 79 DLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGE--QYSITVQATDTGG--KSSQTIVEVSVVPGPNTRSPV 154 (195)
Q Consensus 79 D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~--~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~~P~ 154 (195)
|.|..+.+.|+|.. |++.+.|.||.++|.|++.+.||+|. .|-|++.|+|.|. +++.+.+.|.|. |+|||+|+
T Consensus 981 dsgldg~l~Y~I~~--gdg~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ie-DvNDn~Pq 1057 (4289)
T KOG1219|consen 981 DSGLDGELSYKIRT--GDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIE-DVNDNVPQ 1057 (4289)
T ss_pred CCCccceEEEEEEc--CCcceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEE-ecCCCCcc
Confidence 99999999999986 67889999999999999999999984 7999999999997 677889999999 99999999
Q ss_pred EecCeeEEEEecCCCCCeEEEEEEEEeCCCCCC
Q psy5884 155 FQQNTYDVNVSEGASINSTVATITVSRLASGCA 187 (195)
Q Consensus 155 f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~~ 187 (195)
|.+..|..+|.|+++.+..|.++.|.|+|+..+
T Consensus 1058 ~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn 1090 (4289)
T KOG1219|consen 1058 FSSPVYYASISENSPETVSIVQAEANDPDSSSN 1090 (4289)
T ss_pred cCCceEeeeeccCCCCceEEEEeccCCCCcccC
Confidence 999999999999999999999999999996644
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.93 E-value=6.4e-24 Score=159.46 Aligned_cols=139 Identities=32% Similarity=0.467 Sum_probs=123.3
Q ss_pred EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCC-CCcEEEEcCCC-CceEEEEEEeeCCC
Q psy5884 3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPM-IPPIQIQAGEA-RRQIVKIEATDNDL 80 (195)
Q Consensus 3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~-~~~~~v~e~~~-g~~v~~v~a~D~D~ 80 (195)
|.+.+.||||. ...|.|.|.|+|.|.+.++++..+.|.|.|+||++|.|.. .+.+.+.|+.+ |+.++++.|+|+|.
T Consensus 54 l~~~~~lD~e~--~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~ 131 (199)
T cd00031 54 ITTTKPLDREE--QSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS 131 (199)
T ss_pred EEECCCCCCcC--CceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC
Confidence 56778999999 8999999999998887777999999999999999999984 46799999987 99999999999999
Q ss_pred CCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECC--CCceeEEEEEEEE
Q psy5884 81 GVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTG--GKSSQTIVEVSVV 145 (195)
Q Consensus 81 ~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~--~~~~~~~v~i~V~ 145 (195)
+.++.++|+|... .....|.|++.+|.|.+.+.+|++ ..|.|.|.|+|.+ .++.++++.|.|.
T Consensus 132 ~~~~~~~y~l~~~--~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~ 198 (199)
T cd00031 132 GENAKLTYSILSG--NDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVL 198 (199)
T ss_pred CCCccEEEEEeCC--CCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEE
Confidence 9899999999762 224799999999999999999987 3799999999998 5777788888776
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.90 E-value=6.4e-22 Score=148.59 Aligned_cols=126 Identities=36% Similarity=0.533 Sum_probs=114.2
Q ss_pred CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC
Q psy5884 57 PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG 133 (195)
Q Consensus 57 ~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~ 133 (195)
+.+.+.|+.+ |+.++++.|+|+|.+.++.+.|+|.. +....+|.|++.+|.|++.+.||+| ..|.|.|.|+|.|.
T Consensus 2 ~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~--~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~ 79 (199)
T cd00031 2 YSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILG--GNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGG 79 (199)
T ss_pred eEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeC--CCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCc
Confidence 4578999998 99999999999999888999999986 3333799999999999999999987 48999999999876
Q ss_pred Cc--eeEEEEEEEEcCCCCCCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCC
Q psy5884 134 KS--SQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGASINSTVATITVSRLASG 185 (195)
Q Consensus 134 ~~--~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g 185 (195)
+. ....+.|.|. |+|+++|.|....|.+.+.|+.++|+.++++.|.|+|.+
T Consensus 80 ~~~~~~~~v~I~V~-d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~ 132 (199)
T cd00031 80 PPLSSTATVTVTVL-DVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSG 132 (199)
T ss_pred CcceeEEEEEEEEc-cCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCC
Confidence 55 7889999999 999999999989999999999999999999999999986
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.73 E-value=1.7e-16 Score=105.16 Aligned_cols=87 Identities=33% Similarity=0.455 Sum_probs=75.2
Q ss_pred CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEEC-C
Q psy5884 57 PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDT-G 132 (195)
Q Consensus 57 ~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~-~ 132 (195)
|.+.++|+.+ |+.++++.|.|+|.+.|+.+.|+|.. +....+|.|++.+|.|++.++||+| ..|.|.|.|+|. +
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~--~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~ 78 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILG--GNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGG 78 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEE--TTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTT
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEec--CcccCceEEeeeeeccccceecCcccCCEEEEEEEEEECCC
Confidence 3578999987 99999999999999999999999987 4446899999999999999999987 479999999999 5
Q ss_pred C--CceeEEEEEEEE
Q psy5884 133 G--KSSQTIVEVSVV 145 (195)
Q Consensus 133 ~--~~~~~~v~i~V~ 145 (195)
. ++++++|.|+|+
T Consensus 79 ~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 79 SPPLSSTATVTINVL 93 (93)
T ss_dssp SSEEEEEEEEEEEEE
T ss_pred CCCCEEEEEEEEEEC
Confidence 5 456667777664
No 8
>KOG1834|consensus
Probab=99.70 E-value=4.4e-16 Score=130.01 Aligned_cols=137 Identities=28% Similarity=0.401 Sum_probs=114.5
Q ss_pred EEecCCCCCCCCCCCCeEEEEEEEEECCC-C-----CeEeeEEEEEEEEecCCCCCccCCC-CcEEEEcCCCCceEEEEE
Q psy5884 2 LVVSKENLDRDPPNPGKFRFQVIAREKQG-T-----AASVPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEARRQIVKIE 74 (195)
Q Consensus 2 ~i~~~~~LD~E~~~~~~~~l~v~a~D~~~-~-----~~~~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~g~~v~~v~ 74 (195)
+|.+..+||-|. ++.|+|+|+|.|.|. | ..+-.++|.|+|.|+|+.+|.|..+ |.+.|.|......|+++.
T Consensus 94 vlRaK~~lDCel--qkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~ve 171 (952)
T KOG1834|consen 94 VLRAKEPLDCEL--QKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVE 171 (952)
T ss_pred EEeecCcccccc--cccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEE
Confidence 467889999999 899999999999763 3 5677889999999999999999887 668888887788999999
Q ss_pred EeeCCCC-CCceE-EEEEEeeCCCCeecEEEeCCCCEEEEceeCCC--CCEEEEEEEEEECCCCce--eEEEEEEEE
Q psy5884 75 ATDNDLG-VNAEI-TYSIYHVSNTGLHKFRIDPRSGVIETTGKLIA--GEQYSITVQATDTGGKSS--QTIVEVSVV 145 (195)
Q Consensus 75 a~D~D~~-~n~~~-~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~--~~~~~l~v~a~D~~~~~~--~~~v~i~V~ 145 (195)
|.|.|-+ ++++| .|.|.. ..-+|.||. .|.|+.+.+|.| +.+|.|+|.|.|+|.+.+ .+.|+|+|.
T Consensus 172 AiD~DCspq~sqIC~YEI~t----~d~PFaIdn-~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vk 243 (952)
T KOG1834|consen 172 AIDKDCSPQYSQICEYEITT----PDVPFAIDN-DGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVK 243 (952)
T ss_pred eecCCCCCcccceeEEEecC----CCCceEEcC-CCccccccccccccceeEEEEEEEEecccccccCcceEEEEec
Confidence 9999965 57776 578864 456899977 799999999986 468999999999997433 367777776
No 9
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.63 E-value=5.7e-15 Score=94.84 Aligned_cols=75 Identities=44% Similarity=0.609 Sum_probs=66.2
Q ss_pred eeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCC--CEEEEEEEEEECCC--CceeEEEEEEEEcCCCCC
Q psy5884 76 TDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAG--EQYSITVQATDTGG--KSSQTIVEVSVVPGPNTR 151 (195)
Q Consensus 76 ~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~--~~~~l~v~a~D~~~--~~~~~~v~i~V~~d~nd~ 151 (195)
+|+|.|.|+.+.|+|.. +....+|.|++.+|.|++.++||+| ..|.|.|.|.|.+. +++.++|.|+|. |+|++
T Consensus 1 ~D~D~g~n~~i~Y~i~~--~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~-D~Nd~ 77 (79)
T smart00112 1 TDADSGENGKVTYSILS--GNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVL-DVNDN 77 (79)
T ss_pred CCCCCCcCcEEEEEEec--CCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEE-ECCCC
Confidence 48899999999999986 3333799999999999999999987 47999999999987 688899999999 99999
Q ss_pred CC
Q psy5884 152 SP 153 (195)
Q Consensus 152 ~P 153 (195)
+|
T Consensus 78 ~P 79 (79)
T smart00112 78 AP 79 (79)
T ss_pred CC
Confidence 98
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.24 E-value=3.5e-11 Score=77.10 Aligned_cols=47 Identities=32% Similarity=0.541 Sum_probs=44.0
Q ss_pred EecCCCCCCCCCCCCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCC
Q psy5884 3 VVSKENLDRDPPNPGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAP 51 (195)
Q Consensus 3 i~~~~~LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P 51 (195)
|++.++||||. ...|.|.|.|+|.++++++++++|.|.|.|+|||+|
T Consensus 33 i~~~~~LD~e~--~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 33 ITTTKPLDREE--QPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred EEeCCccCeeC--CCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 56677999999 889999999999999899999999999999999998
No 11
>KOG1834|consensus
Probab=99.23 E-value=2.1e-10 Score=96.58 Aligned_cols=142 Identities=20% Similarity=0.280 Sum_probs=106.9
Q ss_pred EEEecCCCCCccCCCCcEEEEcCCC-CceEEEEEEeeCCCC--CCceEE-EEEEeeCCCCeecEEEeCCC--CEEEEcee
Q psy5884 42 NLNDINDNAPRLPMIPPIQIQAGEA-RRQIVKIEATDNDLG--VNAEIT-YSIYHVSNTGLHKFRIDPRS--GVIETTGK 115 (195)
Q Consensus 42 ~V~DvNDn~P~f~~~~~~~v~e~~~-g~~v~~v~a~D~D~~--~n~~~~-y~i~~~~~~~~~~F~id~~t--G~i~~~~~ 115 (195)
..--+|-+.|.....|...|.|+.. -...--+.|.|.|.+ -.+.+. |.|- +.+-+...--+|..| |.|+.+.+
T Consensus 22 ~aarankhkpwie~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklh-gq~vPFdavVvdK~TGegvlRaK~~ 100 (952)
T KOG1834|consen 22 HAARANKHKPWIEEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLH-GQPVPFDAVVVDKYTGEGVLRAKEP 100 (952)
T ss_pred ccccccccCcccccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEec-CCCCCceEEEEeccCCceEEeecCc
Confidence 4455788899988888888888753 222234778888864 233454 4443 222223333447767 45888899
Q ss_pred CCCC--CEEEEEEEEEECCC--------CceeEEEEEEEEcCCCCCCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCC
Q psy5884 116 LIAG--EQYSITVQATDTGG--------KSSQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGASINSTVATITVSRLASG 185 (195)
Q Consensus 116 ld~~--~~~~l~v~a~D~~~--------~~~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g 185 (195)
+|=+ +.|+|+|+|.|+|. ++..++|.|+|. |+|..+|.|....|.+.|.|+. .-..|+++.|.|.|.+
T Consensus 101 lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVk-DvNe~AP~f~ep~Yka~V~EGK-~yd~il~veAiD~DCs 178 (952)
T KOG1834|consen 101 LDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVK-DVNEFAPVFKEPWYKAHVTEGK-VYDSILRVEAIDKDCS 178 (952)
T ss_pred ccccccccceEEEEEEecCCCCCccccccccceEEEEEec-cccccCchhcccceeeEEecce-eeeeeEEEEeecCCCC
Confidence 9954 67999999999874 577899999999 9999999999999999999984 6778999999999998
Q ss_pred C
Q psy5884 186 C 186 (195)
Q Consensus 186 ~ 186 (195)
.
T Consensus 179 p 179 (952)
T KOG1834|consen 179 P 179 (952)
T ss_pred C
Confidence 3
No 12
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.31 E-value=3.1e-06 Score=55.61 Aligned_cols=40 Identities=23% Similarity=0.385 Sum_probs=37.1
Q ss_pred EecCCCCCCCCCCCCeEEEEEEEEEC-CCCCeEeeEEEEEEEE
Q psy5884 3 VVSKENLDRDPPNPGKFRFQVIAREK-QGTAASVPVSLTVNLN 44 (195)
Q Consensus 3 i~~~~~LD~E~~~~~~~~l~v~a~D~-~~~~~~~~~~v~I~V~ 44 (195)
|.+.++||||+ ...|.|.|.|+|. +.|+++++++|+|.|+
T Consensus 53 i~~~~~LD~E~--~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 53 ISLKKPLDRET--QSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp EEESSSSCTTT--TSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred cccceecCccc--CCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 67889999999 8999999999999 7889999999999885
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.22 E-value=0.0064 Score=39.70 Aligned_cols=77 Identities=17% Similarity=0.211 Sum_probs=40.5
Q ss_pred CCccCCC-CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCC-CCCEEEEEE
Q psy5884 50 APRLPMI-PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLI-AGEQYSITV 126 (195)
Q Consensus 50 ~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld-~~~~~~l~v 126 (195)
.|-|... +.+.++.+.. |..+++|.-.|-.. +..+.|.- ....|+|.+ .|.|.+++++. ..++-.|.|
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~s------sDpdF~V~~-DGsVy~~r~v~l~~~~~~F~V 73 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFES------SDPDFRVLE-DGSVYAKRPVQLSSEQRSFTV 73 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE---------SEEEEET-TTEEEEES--S-SSS-EEEEE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEec------CCCCEEEcC-CCeEEEeeeEecCCCceEEEE
Confidence 4667665 4677887765 99999999988854 45588865 234899977 79999999987 445678999
Q ss_pred EEEECCCCc
Q psy5884 127 QATDTGGKS 135 (195)
Q Consensus 127 ~a~D~~~~~ 135 (195)
.|.|..+..
T Consensus 74 ~a~D~~~~~ 82 (90)
T PF08758_consen 74 HAWDSQTQE 82 (90)
T ss_dssp EEEETTTTE
T ss_pred EEECCCCCe
Confidence 999987654
No 14
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=97.10 E-value=0.023 Score=37.56 Aligned_cols=72 Identities=26% Similarity=0.300 Sum_probs=51.0
Q ss_pred EeeCCCCCCceEEEEEEeeCC-CCeecEEEeCCCCEEEEceeCCCCCEEEEEEEEEECCCCceeEEEEEEEEcCCCC
Q psy5884 75 ATDNDLGVNAEITYSIYHVSN-TGLHKFRIDPRSGVIETTGKLIAGEQYSITVQATDTGGKSSQTIVEVSVVPGPNT 150 (195)
Q Consensus 75 a~D~D~~~n~~~~y~i~~~~~-~~~~~F~id~~tG~i~~~~~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V~~d~nd 150 (195)
..|.| ...++|++....+ ....+.+.|+.++.+.=.........|.+.|.|+|..+.+....+.|.|. +.|+
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~~~~~g~~~i~v~a~D~~g~~~~~~f~i~V~-~~~~ 96 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPTNSDVGSLSLKVTATDSSGASASDTFTITVV-NTND 96 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECCCCCCcEEEEEEEEEECCCCEEEEEEEEEEe-CCCC
Confidence 46666 3468998864222 23457788888888766433333567999999999888888889999998 6664
No 15
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.03 E-value=0.00081 Score=43.34 Aligned_cols=59 Identities=17% Similarity=0.240 Sum_probs=36.0
Q ss_pred EEEEcCCC-CceEEEEEEeeCCCCCC--ceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCC
Q psy5884 59 IQIQAGEA-RRQIVKIEATDNDLGVN--AEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGE 120 (195)
Q Consensus 59 ~~v~e~~~-g~~v~~v~a~D~D~~~n--~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~ 120 (195)
++|+|..+ |+.|+.| |.|.-.... ..-.|.+.+ .....+|.++..+|.|.+...+|||.
T Consensus 5 YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s--~~~~~~~~v~~~tG~L~v~~rIDRE~ 66 (84)
T PF08266_consen 5 YSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVS--EGNSQYFRVNEKTGDLFVSERIDREE 66 (84)
T ss_dssp EEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE---SSSS-SEEE-TTTSEEEESS--SCCC
T ss_pred EEeecCCCCCCEEEEh-HHhhCCCcccccccceEEee--cCCcceeEecCCceeEEeCCccCHHH
Confidence 67888876 9999999 445433211 113567665 34568999999999999999999985
No 16
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=97.03 E-value=0.07 Score=38.66 Aligned_cols=132 Identities=21% Similarity=0.295 Sum_probs=71.4
Q ss_pred CCCCCCCCeEEEEE-EEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCCC--CceEEEEEEeeCCCCCCceE
Q psy5884 10 DRDPPNPGKFRFQV-IAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGEA--RRQIVKIEATDNDLGVNAEI 86 (195)
Q Consensus 10 D~E~~~~~~~~l~v-~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~~--g~~v~~v~a~D~D~~~n~~~ 86 (195)
|....+...|.+.+ .|.|..+...+......+.| ...+|.+.-.....+..+.. +..-..+.++|.-.+. .--
T Consensus 8 d~~~l~dG~Y~l~~~~a~D~agN~~~~~~~~~~~i---D~T~Ptisi~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~ 83 (158)
T PF13750_consen 8 DLSTLPDGSYTLTVVTATDAAGNTSTSTVSETFTI---DNTPPTISISDGASVANGSTVYGLVNISINVTDNSDDS-KIT 83 (158)
T ss_pred EeCcCCCccEEEEEEEEEecCCCEEEEEEeeEEEE---cCCCCEEEEecCCccCCCccccceeeeEEEEEeCCCCc-eEE
Confidence 44444567899999 79999876443333333433 34477754311111111211 3344567787776543 223
Q ss_pred EEEEEeeCCCCeecEEEeC-CCCEEEEc-----eeCCCCCEEEEEEEEEECCCCceeEEEEEEEE
Q psy5884 87 TYSIYHVSNTGLHKFRIDP-RSGVIETT-----GKLIAGEQYSITVQATDTGGKSSQTIVEVSVV 145 (195)
Q Consensus 87 ~y~i~~~~~~~~~~F~id~-~tG~i~~~-----~~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V~ 145 (195)
...+..+.....-...... ..|...+. ..+...+.|.|+|.|.|..+...+.++.+...
T Consensus 84 sv~l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D~aGN~~~~si~F~y~ 148 (158)
T PF13750_consen 84 SVSLTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATDKAGNQSTKSISFSYM 148 (158)
T ss_pred EEEEECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEecCCCEEEEEEEEEEe
Confidence 4555432111111122222 23333222 23334568999999999998888888888776
No 17
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=94.17 E-value=1 Score=29.92 Aligned_cols=86 Identities=23% Similarity=0.298 Sum_probs=51.8
Q ss_pred EEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEc--------eeCCCCC--EEEEEEEEEECCCCceeEEE
Q psy5884 71 VKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETT--------GKLIAGE--QYSITVQATDTGGKSSQTIV 140 (195)
Q Consensus 71 ~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~--------~~ld~~~--~~~l~v~a~D~~~~~~~~~v 140 (195)
++|.++|+|.++. .+++.....+ ..+.|.|++ .|.-... +.|...+ .-.|++.+.|. .+.+|
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~~g-~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG----tt~~v 73 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDAAG-QYGTFSIDA-DGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG----TSQTV 73 (99)
T ss_pred CceEEeCCCCCCc--eEEecccccC-CcEEEEECC-CCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC----CeEEE
Confidence 4688999998654 4555532222 456788877 5643222 1222222 36788888994 38899
Q ss_pred EEEEEcCCCCCCCeEecCeeEEEEecC
Q psy5884 141 EVSVVPGPNTRSPVFQQNTYDVNVSEG 167 (195)
Q Consensus 141 ~i~V~~d~nd~~P~f~~~~~~~~v~E~ 167 (195)
.|+|. ..||.| ..... -...+.|+
T Consensus 74 tItI~-GtNDap-vi~~~-~~g~v~ED 97 (99)
T TIGR01965 74 TITIT-GANDAA-VIGGA-DTGSVTED 97 (99)
T ss_pred EEEEE-ccCCCC-EEecc-cceeEecC
Confidence 99999 778654 44322 23566665
No 18
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=90.19 E-value=2.2 Score=24.34 Aligned_cols=36 Identities=31% Similarity=0.367 Sum_probs=26.7
Q ss_pred CeecEEEeCCCCEEEEceeCC-CCCEEEEEEEEEECC
Q psy5884 97 GLHKFRIDPRSGVIETTGKLI-AGEQYSITVQATDTG 132 (195)
Q Consensus 97 ~~~~F~id~~tG~i~~~~~ld-~~~~~~l~v~a~D~~ 132 (195)
......||+.+|.|.-...-. ....|.|.|.|+|..
T Consensus 12 LP~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd~~ 48 (49)
T PF05345_consen 12 LPSGLSLDPSTGTISGTPTSSVQPGTYTFTVTATDGS 48 (49)
T ss_pred CCCcEEEeCCCCEEEeecCCCccccEEEEEEEEEcCC
Confidence 445688899999986664433 235899999999964
No 19
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=88.72 E-value=2.7 Score=29.64 Aligned_cols=60 Identities=20% Similarity=0.411 Sum_probs=40.2
Q ss_pred CEEecCCCCCCCCCC-CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCC
Q psy5884 1 YLVVSKENLDRDPPN-PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGE 65 (195)
Q Consensus 1 ~~i~~~~~LD~E~~~-~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~ 65 (195)
|-..+..+||...-+ .-.+.|.|.|+|..+-.. +..+.|.|.| |. |.......+.+.|..
T Consensus 68 YtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D~-P~~~~~~~~~V~E~~ 128 (137)
T TIGR03660 68 YEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--DV-PTITDVDALTVDEDD 128 (137)
T ss_pred EEEEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--CC-CeeccccceEEeccc
Confidence 567788888875311 346788899999877533 3478888887 54 665544457777743
No 20
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=88.34 E-value=1.8 Score=28.33 Aligned_cols=31 Identities=29% Similarity=0.499 Sum_probs=26.3
Q ss_pred CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCC
Q psy5884 16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDIND 48 (195)
Q Consensus 16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvND 48 (195)
...|.+.|.|+|+.+ .+....+.|.|.+.||
T Consensus 66 ~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 66 VGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 346999999999976 5677889999999887
No 21
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=80.23 E-value=4.6 Score=24.02 Aligned_cols=30 Identities=33% Similarity=0.543 Sum_probs=19.6
Q ss_pred CCCeEEEEEEEEECCCCCeEeeEEEEEEEE
Q psy5884 15 NPGKFRFQVIAREKQGTAASVPVSLTVNLN 44 (195)
Q Consensus 15 ~~~~~~l~v~a~D~~~~~~~~~~~v~I~V~ 44 (195)
+...|.|.|.|.|..+.......++.|.|+
T Consensus 37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 37 PPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp -SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred CCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 367899999999987753333366666663
No 22
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=78.86 E-value=15 Score=23.83 Aligned_cols=50 Identities=20% Similarity=0.397 Sum_probs=25.6
Q ss_pred EEEEEecCCCCCccCCC-CcEEEEcCCCCceEEEEEEeeCCCCCCceEEEEEEe
Q psy5884 40 TVNLNDINDNAPRLPMI-PPIQIQAGEARRQIVKIEATDNDLGVNAEITYSIYH 92 (195)
Q Consensus 40 ~I~V~DvNDn~P~f~~~-~~~~v~e~~~g~~v~~v~a~D~D~~~n~~~~y~i~~ 92 (195)
+|.|.| ||.+ .+.-. ....+.|+. |.....|....++....-.+.|....
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~-~~~~v~V~~~~~~~~~~v~v~~~~~~ 52 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD-GTVTVTVTRSGGSLDGPVTVNYSTVD 52 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS-SEEEEEEEEESS-TSSEEEEEEEEEE
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC-CEEEEEEEEcccCCCcceEEEEEEeC
Confidence 467788 6666 65443 346777764 43444454444433233446666654
No 23
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=77.27 E-value=12 Score=24.83 Aligned_cols=41 Identities=15% Similarity=0.145 Sum_probs=27.4
Q ss_pred CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEc
Q psy5884 16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQA 63 (195)
Q Consensus 16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e 63 (195)
.-.-+|.+.+.|+ .+.+|.|.|.-.||.| +....-...+.|
T Consensus 56 ~~tdsFtvtv~DG------tt~~vtItI~GtNDap-vi~~~~~g~v~E 96 (99)
T TIGR01965 56 TLTDTFTVTSADG------TSQTVTITITGANDAA-VIGGADTGSVTE 96 (99)
T ss_pred EEEEEEEEEEeCC------CeEEEEEEEEccCCCC-EEecccceeEec
Confidence 4456778888886 2778999999999954 443333344444
No 24
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=77.21 E-value=2 Score=27.93 Aligned_cols=35 Identities=14% Similarity=0.281 Sum_probs=14.1
Q ss_pred CCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCC
Q psy5884 151 RSPVFQQNTYDVNVSEGASINSTVATITVSRLASG 185 (195)
Q Consensus 151 ~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g 185 (195)
+.|-|.+..|.+.|+.+...|..|++|.-.|=...
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~ 36 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGR 36 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--SS-
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCCCC
Confidence 46899999999999999999999999877765443
No 25
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=71.63 E-value=18 Score=21.29 Aligned_cols=58 Identities=19% Similarity=0.300 Sum_probs=30.2
Q ss_pred CCceEEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCCEEEEEEEEEECCCC--ceeEEEEEEE
Q psy5884 82 VNAEITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGEQYSITVQATDTGGK--SSQTIVEVSV 144 (195)
Q Consensus 82 ~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~~~~l~v~a~D~~~~--~~~~~v~i~V 144 (195)
.+-...|.|. |.......+...+-.+.. .....+.|.|.|+|.|..+. ....++.|.|
T Consensus 6 ~~~~Y~Y~l~---g~d~~W~~~~~~~~~~~~--~~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 6 ENIRYRYRLE---GFDDEWITLGSYSNSISY--TNLPPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp TTEEEEEEEE---TTESSEEEESSTS-EEEE--ES--SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CceEEEEEEE---CCCCeEEECCCCcEEEEE--EeCCCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 3445667665 333344444332213222 22345689999999997662 2226666665
No 26
>KOG3597|consensus
Probab=70.86 E-value=66 Score=27.47 Aligned_cols=58 Identities=16% Similarity=0.153 Sum_probs=38.7
Q ss_pred eeEEEEEEEEecCCCCCccCCC-CcEEEEcCCC-CceEEEEEEeeCCCCCCceEEEEEEee
Q psy5884 35 VPVSLTVNLNDINDNAPRLPMI-PPIQIQAGEA-RRQIVKIEATDNDLGVNAEITYSIYHV 93 (195)
Q Consensus 35 ~~~~v~I~V~DvNDn~P~f~~~-~~~~v~e~~~-g~~v~~v~a~D~D~~~n~~~~y~i~~~ 93 (195)
.+....|.|.-+||.+..+-.. +.+.+.|+.. -.....+.+.|+|.++- .+.|+++..
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~-~l~f~v~~t 83 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPL-PLEFQVLGT 83 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCcc-ceEEEEccC
Confidence 4667889999999988765444 3455566543 22335688888887543 378888753
No 27
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=66.97 E-value=1.1e+02 Score=28.72 Aligned_cols=118 Identities=13% Similarity=0.150 Sum_probs=58.5
Q ss_pred cCCCCCccCCC-CcEEEEcCCCCceEEEEEEeeCCCCCCceEEEEEEeeCCCCeecEEEeCCCCEEEEc----------e
Q psy5884 46 INDNAPRLPMI-PPIQIQAGEARRQIVKIEATDNDLGVNAEITYSIYHVSNTGLHKFRIDPRSGVIETT----------G 114 (195)
Q Consensus 46 vNDn~P~f~~~-~~~~v~e~~~g~~v~~v~a~D~D~~~n~~~~y~i~~~~~~~~~~F~id~~tG~i~~~----------~ 114 (195)
.||.++.|.-. ....+.|+. |+.-..|.-...|.+....+.|+..+++.....-|. +.+|.|... .
T Consensus 395 ~dd~~s~i~Fe~~~Y~V~En~-GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~PGEt~KtItV~ 471 (928)
T TIGR00845 395 ENDPVSKIFFEPGHYTCLENC-GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFKPGETQKEFRIG 471 (928)
T ss_pred ccCCcceEEecCCeEEEeecC-cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEECCCceEEEEEEE
Confidence 44555554333 346777764 666666655544554455688887654333222232 223443222 1
Q ss_pred eCC---CC--CEEEEEEEEEECC---------------CCceeEEEEEEEEcCCCCCCCeEecCeeEEEEecCC
Q psy5884 115 KLI---AG--EQYSITVQATDTG---------------GKSSQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGA 168 (195)
Q Consensus 115 ~ld---~~--~~~~l~v~a~D~~---------------~~~~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~ 168 (195)
-+| .| +.|.+.+.--..+ ......+.+|+|. | ||++|.|.-..-...|.|+.
T Consensus 472 IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIl-D-DD~aGIfsFe~~~~sV~Es~ 543 (928)
T TIGR00845 472 IIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTIL-D-DDHAGIFTFEEDVFHVSESI 543 (928)
T ss_pred EccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEe-c-CcccCcccccCceEEEEcCC
Confidence 122 23 3454444321111 0112245567777 4 67888766544456677763
No 28
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=64.92 E-value=27 Score=20.64 Aligned_cols=27 Identities=33% Similarity=0.639 Sum_probs=21.1
Q ss_pred CCEEEEEEEEEECCCCceeEEEEEEEE
Q psy5884 119 GEQYSITVQATDTGGKSSQTIVEVSVV 145 (195)
Q Consensus 119 ~~~~~l~v~a~D~~~~~~~~~v~i~V~ 145 (195)
...|.+.+.|.|..+..........+.
T Consensus 22 dg~yt~~v~a~D~AGN~~~~~~~~~i~ 48 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGNTSSSTTQIVIV 48 (60)
T ss_pred CccEEEEEEEEECCCCEEEeeeEEEEE
Confidence 457999999999988766666666665
No 29
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=64.04 E-value=16 Score=30.67 Aligned_cols=131 Identities=17% Similarity=0.199 Sum_probs=17.4
Q ss_pred CeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCCCCceEEEE---EEeeCCCCC-CceEEEEEEe
Q psy5884 17 GKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGEARRQIVKI---EATDNDLGV-NAEITYSIYH 92 (195)
Q Consensus 17 ~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~~g~~v~~v---~a~D~D~~~-n~~~~y~i~~ 92 (195)
..|.+.+.++-..++.......+.|.|.-.. -.|...-.....+.. + ....+ .-.|+|... +..++|....
T Consensus 50 G~y~~~~~Vt~~~~~~~~~~~~~~v~V~~s~-l~~~I~gG~~~~~~~---~-~~i~ldgs~S~Dpd~~~~~~~l~y~W~C 124 (440)
T PF02010_consen 50 GDYTFTLTVTASSNPGLSSTDSVTVTVEPSP-LVAVIKGGSSRTVGY---N-SDITLDGSQSYDPDGPPGDSGLTYSWSC 124 (440)
T ss_dssp CEEEEEEEEE--BCTTEEEEEEEEEEEE----------------------------------------------------
T ss_pred CCEEEEEEEEEECCCCceEEEEEEEEEeecc-ceeEEcCCccceeec---C-ceEEEeeEEEecccccccCCceEEEEEE
Confidence 3455655555233346778888888887522 122221111111111 1 11122 235777642 2446776553
Q ss_pred eCCCCe-e-------cEEEeCCCCEEEEce-eCCCCCEEEEEEEEEECCCCceeEEEEEEEEcCCCCCCCeE
Q psy5884 93 VSNTGL-H-------KFRIDPRSGVIETTG-KLIAGEQYSITVQATDTGGKSSQTIVEVSVVPGPNTRSPVF 155 (195)
Q Consensus 93 ~~~~~~-~-------~F~id~~tG~i~~~~-~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V~~d~nd~~P~f 155 (195)
...... . ........+.+.+.. .|.....|.|++.+++.+..+..+...|.|. ...+|..
T Consensus 125 ~~~~~~~~C~~~~~~~~~~~~~~~~l~i~~~~l~~~~~y~f~ltv~k~~r~s~s~~~~v~v~---~~~~p~v 193 (440)
T PF02010_consen 125 TDLSSNSACSTPSTNITLLNSSSSSLTIPASTLSPGSTYTFTLTVSKGSRSSSSASQTVTVV---SGDPPTV 193 (440)
T ss_dssp ------------------------------------------------------------------------
T ss_pred eCCceeccccccccccccCCCCCEEEEEEhHHcCCCceEEEEEEEEeCCCCceeeEEEEEec---cCCCCce
Confidence 222211 0 111123334454433 3444445999999999776577777777776 2345544
No 30
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=63.90 E-value=1.3e+02 Score=28.34 Aligned_cols=56 Identities=20% Similarity=0.209 Sum_probs=29.8
Q ss_pred eeEEEEEEEEecCCCCCccCCCC-cEEEEcCCCCceEEEEEEeeCCCCCCceEEEEEEee
Q psy5884 35 VPVSLTVNLNDINDNAPRLPMIP-PIQIQAGEARRQIVKIEATDNDLGVNAEITYSIYHV 93 (195)
Q Consensus 35 ~~~~v~I~V~DvNDn~P~f~~~~-~~~v~e~~~g~~v~~v~a~D~D~~~n~~~~y~i~~~ 93 (195)
...+.+|+|.| ||++|.|.-.. ...+.|+. |..-.+|.-+-.-.| .-.+.|...++
T Consensus 513 ~ps~ATVTIlD-DD~aGIfsFe~~~~sV~Es~-G~vtvtV~RtsGa~G-~VtV~Y~T~dG 569 (928)
T TIGR00845 513 SPNTATVTILD-DDHAGIFTFEEDVFHVSESI-GIMEVKVLRTSGARG-TVIVPYRTVEG 569 (928)
T ss_pred CCceEEEEEec-CcccCcccccCceEEEEcCC-CEEEEEEEEcCCCCe-eEEEEEEeecC
Confidence 33466777787 78899876543 46777764 443333322211111 22366766543
No 31
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=59.87 E-value=31 Score=19.80 Aligned_cols=27 Identities=41% Similarity=0.623 Sum_probs=18.7
Q ss_pred CCCEEEEEEEEEECCCCceeE-EEEEEE
Q psy5884 118 AGEQYSITVQATDTGGKSSQT-IVEVSV 144 (195)
Q Consensus 118 ~~~~~~l~v~a~D~~~~~~~~-~v~i~V 144 (195)
.+..|.+.+.|+|..+..+.. ...+.|
T Consensus 22 ~dG~y~itv~a~D~AGN~s~~~~~~~ti 49 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGNTSTSSSVTFTI 49 (54)
T ss_pred CCccEEEEEEEEeCCCCCCCccceeEEE
Confidence 477899999999987744333 444544
No 32
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=59.44 E-value=60 Score=22.89 Aligned_cols=54 Identities=22% Similarity=0.319 Sum_probs=35.0
Q ss_pred EEEceeCCCC-----CEEEEEEEEEECCCCceeEEEEEEEEcCCCCCCCeEecCeeEEEEecCC
Q psy5884 110 IETTGKLIAG-----EQYSITVQATDTGGKSSQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGA 168 (195)
Q Consensus 110 i~~~~~ld~~-----~~~~l~v~a~D~~~~~~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~ 168 (195)
..+.++||+. -...|.|.|+|..+-.+...+.|+|. | | .|...... ...|.|+.
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~~~l~VtI~-D--D-~P~~~~~~-~~~V~E~~ 128 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDTSSITLPVTIV-D--D-VPTITDVD-ALTVDEDD 128 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCCCCccccEEEEEEE-C--C-CCeecccc-ceEEeccc
Confidence 4455566542 25788999999877555678889888 4 3 36654433 36777753
No 33
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=55.50 E-value=41 Score=20.50 Aligned_cols=25 Identities=24% Similarity=0.502 Sum_probs=17.3
Q ss_pred CCeEEEEEEEEECCCCCeEeeEEEEEEE
Q psy5884 16 PGKFRFQVIAREKQGTAASVPVSLTVNL 43 (195)
Q Consensus 16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V 43 (195)
...|.+.+.+.|..+ +.+.++.|.|
T Consensus 54 ~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 54 PGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred CcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 567889999988866 4455555544
No 34
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=53.21 E-value=83 Score=22.66 Aligned_cols=37 Identities=27% Similarity=0.458 Sum_probs=24.3
Q ss_pred CCEEEEEE-EEEECCCCceeEEEEEEEEcCCCCCCCeEec
Q psy5884 119 GEQYSITV-QATDTGGKSSQTIVEVSVVPGPNTRSPVFQQ 157 (195)
Q Consensus 119 ~~~~~l~v-~a~D~~~~~~~~~v~i~V~~d~nd~~P~f~~ 157 (195)
+..|.+.+ .|.|..+...+..+..++. +..-+|....
T Consensus 14 dG~Y~l~~~~a~D~agN~~~~~~~~~~~--iD~T~Ptisi 51 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGNTSTSTVSETFT--IDNTPPTISI 51 (158)
T ss_pred CccEEEEEEEEEecCCCEEEEEEeeEEE--EcCCCCEEEE
Confidence 45789998 7999877666665554444 2346777754
No 35
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=51.75 E-value=41 Score=20.58 Aligned_cols=15 Identities=20% Similarity=0.578 Sum_probs=12.0
Q ss_pred CCeEEEEEEEEECCC
Q psy5884 16 PGKFRFQVIAREKQG 30 (195)
Q Consensus 16 ~~~~~l~v~a~D~~~ 30 (195)
...|.+.+.++|..+
T Consensus 56 ~G~y~v~l~v~d~~g 70 (81)
T cd00146 56 PGTYTVTLTVTNAVG 70 (81)
T ss_pred CcEEEEEEEEEeCCC
Confidence 567889999998854
No 36
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=51.36 E-value=55 Score=20.01 Aligned_cols=31 Identities=26% Similarity=0.328 Sum_probs=22.1
Q ss_pred eeCCCCCEEEEEEEEEECCCCceeEEEEEEE
Q psy5884 114 GKLIAGEQYSITVQATDTGGKSSQTIVEVSV 144 (195)
Q Consensus 114 ~~ld~~~~~~l~v~a~D~~~~~~~~~v~i~V 144 (195)
..+.....|.+++.++|..+.+...++.|.|
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g~~~~~~~~V~V 81 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVGSSSTKTTTVVV 81 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence 3445567899999999986666655666543
No 37
>KOG4221|consensus
Probab=49.55 E-value=2.5e+02 Score=27.53 Aligned_cols=105 Identities=15% Similarity=0.133 Sum_probs=55.2
Q ss_pred CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCc-cC----CCCcEEEEcCCC--CceEEEEEEeeCCCCCCceEEE
Q psy5884 16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPR-LP----MIPPIQIQAGEA--RRQIVKIEATDNDLGVNAEITY 88 (195)
Q Consensus 16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~-f~----~~~~~~v~e~~~--g~~v~~v~a~D~D~~~n~~~~y 88 (195)
...|.+.+.|....|.+.++..--.++..|+=+.||. .. .+..+.|..+.+ .+..+++. -|
T Consensus 584 ~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~it------------gY 651 (1381)
T KOG4221|consen 584 YTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQIT------------GY 651 (1381)
T ss_pred ccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecCCCeEEEEccCCCcccccceEE------------EE
Confidence 3479999999999887666555445556666666665 21 112355554443 34333332 23
Q ss_pred EEEee-CCCCeecEEEe-CCCCEEEEceeCCCCCEEEEEEEEEECC
Q psy5884 89 SIYHV-SNTGLHKFRID-PRSGVIETTGKLIAGEQYSITVQATDTG 132 (195)
Q Consensus 89 ~i~~~-~~~~~~~F~id-~~tG~i~~~~~ld~~~~~~l~v~a~D~~ 132 (195)
.|... .+.....++.- ..+-.-.+...|++..+|.|.|.|....
T Consensus 652 kIRy~~~~~~~~~~~t~v~~n~~~~l~~~Lep~T~Y~vrIsa~t~n 697 (1381)
T KOG4221|consen 652 KIRYRKLSREDEVNETVVKGNTTQYLFNGLEPNTQYRVRISAMTVN 697 (1381)
T ss_pred EEEecccCcccccceeecccchhhhHhhcCCCCceEEEEEEEeccC
Confidence 33321 11122222221 1111112233467778899999998753
No 38
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=43.92 E-value=57 Score=20.20 Aligned_cols=24 Identities=33% Similarity=0.439 Sum_probs=20.1
Q ss_pred EEEEEEEEEECCCCceeEEEEEEE
Q psy5884 121 QYSITVQATDTGGKSSQTIVEVSV 144 (195)
Q Consensus 121 ~~~l~v~a~D~~~~~~~~~v~i~V 144 (195)
.+.+...|+|..+..+++++.|+|
T Consensus 58 ~t~V~ytA~D~~GN~a~C~f~V~V 81 (81)
T PF02494_consen 58 TTTVTYTATDAAGNSATCSFTVTV 81 (81)
T ss_pred eEEEEEEEEECCCCEEEEEEEEEC
Confidence 588999999988888888887765
No 39
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=36.88 E-value=90 Score=18.32 Aligned_cols=30 Identities=20% Similarity=0.412 Sum_probs=19.7
Q ss_pred CCeEEEEEEEEECCCCCeEeeEEEEEEEEecC
Q psy5884 16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDIN 47 (195)
Q Consensus 16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvN 47 (195)
...|.+.+.|+|..|... .....+.+.|..
T Consensus 22 dg~yt~~v~a~D~AGN~~--~~~~~~~i~d~~ 51 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGNTS--SSTTQIVIVDNT 51 (60)
T ss_pred CccEEEEEEEEECCCCEE--EeeeEEEEEcCC
Confidence 347999999999988633 334444444443
No 40
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=34.56 E-value=1.1e+02 Score=21.48 Aligned_cols=26 Identities=15% Similarity=0.419 Sum_probs=20.4
Q ss_pred CCEEEEEEEEEECCCCceeEEEEEEE
Q psy5884 119 GEQYSITVQATDTGGKSSQTIVEVSV 144 (195)
Q Consensus 119 ~~~~~l~v~a~D~~~~~~~~~v~i~V 144 (195)
...|.|.+.++|..+........|.|
T Consensus 106 ~G~YH~~i~VtD~~Gn~~~~~~~i~I 131 (132)
T PF15418_consen 106 AGDYHFMITVTDAAGNQTEEERSIKI 131 (132)
T ss_pred CcceEEEEEEEECCCCEEEEEEEEEE
Confidence 45799999999998877766666655
No 41
>KOG3597|consensus
Probab=33.74 E-value=1.3e+02 Score=25.71 Aligned_cols=50 Identities=12% Similarity=0.062 Sum_probs=36.0
Q ss_pred eeEEEEEEEEcCCCCCCCeEecCeeEEEEecCCCCCeEEEEEEEEeCCCCC
Q psy5884 136 SQTIVEVSVVPGPNTRSPVFQQNTYDVNVSEGASINSTVATITVSRLASGC 186 (195)
Q Consensus 136 ~~~~v~i~V~~d~nd~~P~f~~~~~~~~v~E~~~~g~~v~~v~a~D~D~g~ 186 (195)
.+....|.|. .+|+.+..+....+.+-+.|+..--.--..+.+.|+|.++
T Consensus 24 ~~~~~~i~v~-pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~ 73 (442)
T KOG3597|consen 24 QTDVLRIHVN-PVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP 73 (442)
T ss_pred EEeeeccccc-ccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc
Confidence 4556678887 7898888777777778888876543344577888888874
No 42
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=31.56 E-value=3.9e+02 Score=24.12 Aligned_cols=107 Identities=17% Similarity=0.216 Sum_probs=58.2
Q ss_pred CCeEEEEEEEEECCCCCeEeeEEEEEEEEecCCCCCccCCCCcEEEEcCCC-Cce-EEEEEEeeC------CCCCCceEE
Q psy5884 16 PGKFRFQVIAREKQGTAASVPVSLTVNLNDINDNAPRLPMIPPIQIQAGEA-RRQ-IVKIEATDN------DLGVNAEIT 87 (195)
Q Consensus 16 ~~~~~l~v~a~D~~~~~~~~~~~v~I~V~DvNDn~P~f~~~~~~~v~e~~~-g~~-v~~v~a~D~------D~~~n~~~~ 87 (195)
...+.+.+.++|..|. .+.+.+..|.|++=+ +|.+. +.+.-... +.. .....|.=+ -.-....++
T Consensus 297 ~G~~Ti~atVtDSRGr-~S~~~~~tItVl~Y~--~P~ls----fsv~R~~~~~~~~~v~~~a~Iapl~v~g~qKN~~~lt 369 (624)
T PF05895_consen 297 SGSATIRATVTDSRGR-TSDPKTKTITVLEYS--PPTLS----FSVYRCGSSGNTLTVTRNAKIAPLTVNGVQKNTMTLT 369 (624)
T ss_pred CceEEEEEEEEECCCc-cCCceEEEEEEEEcC--CCcEE----EEEEEeCCCCcEEEEEEEEEEeEEEEcccccceEEEE
Confidence 4578899999998773 345677888888764 66652 22221111 111 111122111 111122466
Q ss_pred EEEEeeCCCCeecEEEeCCC-------------CEEEEceeCCCCCEEEEEEEEEECC
Q psy5884 88 YSIYHVSNTGLHKFRIDPRS-------------GVIETTGKLIAGEQYSITVQATDTG 132 (195)
Q Consensus 88 y~i~~~~~~~~~~F~id~~t-------------G~i~~~~~ld~~~~~~l~v~a~D~~ 132 (195)
|+... .+...|.+|... +...+...++.+..|.+.+.++|.-
T Consensus 370 ~~~a~---~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~~~kSy~V~~~l~D~F 424 (624)
T PF05895_consen 370 FKVAP---LGTGTFTTDNGSASGTWSSISELTNSSANLGGTYDAEKSYDVRGTLSDKF 424 (624)
T ss_pred EEEEE---cCcceEEEEccccccceeeeeeecccceeeccccCCCceEEEEEEEEEEe
Confidence 66543 234556654321 1233445567788999999999953
No 43
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=28.68 E-value=1.8e+02 Score=20.30 Aligned_cols=31 Identities=23% Similarity=0.283 Sum_probs=15.7
Q ss_pred eEEEEEEEEECC-CCCeEeeEEEEEEEEecCC
Q psy5884 18 KFRFQVIAREKQ-GTAASVPVSLTVNLNDIND 48 (195)
Q Consensus 18 ~~~l~v~a~D~~-~~~~~~~~~v~I~V~DvND 48 (195)
.=.|.|.|+=.. +.++++...+.|+|..-|+
T Consensus 99 ~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~ 130 (133)
T PF09100_consen 99 AGNLKVVATVKDGGKPLTGEAHLIVTVQRWNN 130 (133)
T ss_dssp -EEEEEEEEETTTT---EEEEEEEEE---S--
T ss_pred cccEEEEEEEccCCcccceeEeEEEEeecccC
Confidence 345566665433 4589999999999998886
No 44
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=28.21 E-value=88 Score=21.05 Aligned_cols=28 Identities=18% Similarity=0.328 Sum_probs=17.7
Q ss_pred CCCCCCCCCeEEEEEEEEECCCCCeEeeEE
Q psy5884 9 LDRDPPNPGKFRFQVIAREKQGTAASVPVS 38 (195)
Q Consensus 9 LD~E~~~~~~~~l~v~a~D~~~~~~~~~~~ 38 (195)
+++.+ ...|.++|.++|..+=..|+...
T Consensus 74 ~~v~k--gG~y~m~V~lCn~dGCS~S~~~~ 101 (106)
T cd02848 74 FKVGK--GGRYQMQVALCNGDGCSTSAAKE 101 (106)
T ss_pred EEeCC--CCeEEEEEEEECCCCccCcCCEE
Confidence 34444 56788899988887644444333
No 45
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=26.14 E-value=1.2e+02 Score=21.32 Aligned_cols=28 Identities=32% Similarity=0.504 Sum_probs=20.7
Q ss_pred CCCEEEEEEEEEECCCCceeEEEEEEEE
Q psy5884 118 AGEQYSITVQATDTGGKSSQTIVEVSVV 145 (195)
Q Consensus 118 ~~~~~~l~v~a~D~~~~~~~~~v~i~V~ 145 (195)
....|.+.|++.|..+-+.++.+.|.|.
T Consensus 81 ~gG~y~~~VeLCN~~GCS~S~~~~V~Va 108 (133)
T PF08329_consen 81 KGGRYQMQVELCNADGCSTSAPVEVVVA 108 (133)
T ss_dssp S-EEEEEEEEEEETTEEEE---EEEEEE
T ss_pred CCCEEEEEEEEECCCCcccCCCEEEEEe
Confidence 3458999999999888777778889888
No 46
>KOG4221|consensus
Probab=26.12 E-value=6.5e+02 Score=24.92 Aligned_cols=68 Identities=18% Similarity=0.269 Sum_probs=44.0
Q ss_pred EEEEEEeeCCCCeecEEEeCCCCEEEEceeCCCCCEEEEEEEEEECCCC-ceeEEEEEEEEcCCCCCCCe
Q psy5884 86 ITYSIYHVSNTGLHKFRIDPRSGVIETTGKLIAGEQYSITVQATDTGGK-SSQTIVEVSVVPGPNTRSPV 154 (195)
Q Consensus 86 ~~y~i~~~~~~~~~~F~id~~tG~i~~~~~ld~~~~~~l~v~a~D~~~~-~~~~~v~i~V~~d~nd~~P~ 154 (195)
..|.+....++....+.++.++-+.++. .|+.+..|.+.|.|.+..+. .+++.+.|.-..|+-+.||.
T Consensus 552 ~~yk~~ys~~~~~~~~~~~~n~~e~ti~-gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 552 TGYKLFYSEDDTGKELRVENNATEYTIN-GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred eEEEEEEEcCCCCceEEEecCccEEEee-cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 4566554334455678887755555443 66777789999999998763 33455555444477788887
No 47
>cd01758 PLAT_LPL PLAT/ LH2 domain present in lipoprotein lipase (LPL). LPL is a key enzyme in catabolism of plasma lipoprotein triglycerides (TGs) and has therefeore has a profound influence on triglyceride and high-density lipoprotein (HDL) cholesterol levels in the blood. In general, PLAT/LH2 domain's proposed function is to mediate interaction with lipids or membrane bound proteins.
Probab=24.84 E-value=2.5e+02 Score=19.70 Aligned_cols=25 Identities=8% Similarity=0.297 Sum_probs=14.9
Q ss_pred EEEEEEEEEECCCCc-eeEEEEEEEE
Q psy5884 121 QYSITVQATDTGGKS-SQTIVEVSVV 145 (195)
Q Consensus 121 ~~~l~v~a~D~~~~~-~~~~v~i~V~ 145 (195)
+|.++|......... ..+++.|.+.
T Consensus 2 hYqVtV~~~~~~~~~~t~~~v~i~L~ 27 (137)
T cd01758 2 HYQLKIHFFNQTNRIETDPTFTISLY 27 (137)
T ss_pred eEEEEEEEecccCCCcccceEEEEEE
Confidence 477777775543322 4566777776
Done!