Query psy9046
Match_columns 215
No_of_seqs 290 out of 1336
Neff 9.3
Searched_HMMs 46136
Date Fri Aug 16 17:34:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9046.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9046hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 4.7E-38 1E-42 282.4 18.4 191 1-215 252-452 (2531)
2 KOG4289|consensus 100.0 4E-34 8.8E-39 257.3 16.1 190 1-214 766-968 (2531)
3 KOG1219|consensus 100.0 6E-32 1.3E-36 250.1 21.6 191 1-215 934-1137(4289)
4 cd00031 CA Cadherin repeat dom 100.0 4.1E-30 8.8E-35 198.9 24.0 170 28-215 1-181 (199)
5 KOG1219|consensus 100.0 8.5E-31 1.8E-35 242.6 22.8 189 1-215 2559-2757(4289)
6 KOG1834|consensus 99.8 5.1E-19 1.1E-23 151.4 18.0 180 10-215 25-226 (952)
7 cd00031 CA Cadherin repeat dom 99.7 2.1E-16 4.6E-21 122.0 14.4 108 1-130 85-199 (199)
8 PF00028 Cadherin: Cadherin do 99.7 2.6E-15 5.6E-20 102.3 11.3 85 29-129 1-93 (93)
9 PF00028 Cadherin: Cadherin do 99.6 5.2E-15 1.1E-19 100.7 11.1 71 143-215 1-75 (93)
10 smart00112 CA Cadherin repeats 99.5 2.3E-13 4.9E-18 89.8 8.5 72 49-136 1-79 (79)
11 KOG1834|consensus 99.1 1.2E-09 2.6E-14 94.7 11.1 106 2-131 133-245 (952)
12 smart00112 CA Cadherin repeats 98.8 2E-08 4.4E-13 65.9 6.4 51 165-215 1-55 (79)
13 PF08758 Cadherin_pro: Cadheri 97.9 7.4E-05 1.6E-09 50.1 6.7 75 135-215 3-77 (90)
14 PF08266 Cadherin_2: Cadherin- 97.1 0.00057 1.2E-08 45.2 3.3 58 143-203 3-65 (84)
15 PF08266 Cadherin_2: Cadherin- 97.0 0.0008 1.7E-08 44.5 3.4 56 29-88 3-65 (84)
16 PF13750 Big_3_3: Bacterial Ig 96.9 0.057 1.2E-06 40.2 12.6 114 99-215 10-133 (158)
17 PF08758 Cadherin_pro: Cadheri 96.6 0.034 7.4E-07 37.3 9.0 78 15-118 3-81 (90)
18 smart00736 CADG Dystroglycan-t 96.2 0.11 2.3E-06 35.3 9.5 68 48-133 24-96 (97)
19 TIGR01965 VCBS_repeat VCBS rep 95.8 0.15 3.2E-06 34.8 8.6 86 44-150 2-97 (99)
20 PF05345 He_PIG: Putative Ig d 93.6 0.37 8E-06 28.3 5.4 34 182-215 12-46 (49)
21 smart00736 CADG Dystroglycan-t 93.1 0.78 1.7E-05 31.0 7.3 51 163-215 23-77 (97)
22 PF07495 Y_Y_Y: Y_Y_Y domain; 84.2 2.6 5.5E-05 25.9 4.2 30 100-129 35-66 (66)
23 KOG3597|consensus 82.6 23 0.00049 31.0 10.5 55 1-61 25-81 (442)
24 PF12245 Big_3_2: Bacterial Ig 80.4 10 0.00022 23.1 5.8 35 98-132 17-51 (60)
25 TIGR01965 VCBS_repeat VCBS rep 79.4 9.6 0.00021 26.0 5.9 37 160-196 2-39 (99)
26 PF15418 DUF4625: Domain of un 74.2 11 0.00023 27.2 5.3 36 93-128 96-131 (132)
27 PF02494 HYR: HYR domain; Int 70.0 10 0.00022 24.4 4.1 25 104-128 57-81 (81)
28 KOG3597|consensus 69.5 33 0.00072 30.0 8.1 59 120-180 24-83 (442)
29 PF13754 Big_3_4: Bacterial Ig 59.7 28 0.00061 20.6 4.3 17 102-118 22-38 (54)
30 cd00146 PKD polycystic kidney 59.2 27 0.00058 22.1 4.5 27 102-128 55-81 (81)
31 PF05895 DUF859: Siphovirus pr 58.9 1.5E+02 0.0033 27.3 11.1 34 104-139 298-332 (624)
32 PF13753 SWM_repeat: Putative 56.4 1.1E+02 0.0025 25.1 10.5 104 103-215 11-121 (317)
33 PF03160 Calx-beta: Calx-beta 55.8 57 0.0012 21.6 6.1 50 125-180 2-53 (100)
34 TIGR00845 caca sodium/calcium 54.7 2.1E+02 0.0046 27.8 17.3 54 122-180 515-569 (928)
35 smart00089 PKD Repeats in poly 54.3 42 0.00092 21.0 4.9 26 102-128 53-78 (79)
36 TIGR03660 T1SS_rpt_143 T1SS-14 52.4 88 0.0019 22.7 11.6 43 106-152 87-129 (137)
37 PF13750 Big_3_3: Bacterial Ig 47.6 58 0.0013 24.1 5.2 32 100-131 119-150 (158)
38 PRK14081 triple tyrosine motif 47.4 2.4E+02 0.0052 26.3 16.2 32 101-132 155-189 (667)
39 TIGR00845 caca sodium/calcium 43.5 3.2E+02 0.007 26.6 11.1 28 3-37 516-543 (928)
40 PRK13211 N-acetylglucosamine-b 37.7 3E+02 0.0064 24.6 14.1 33 99-131 374-407 (478)
41 cd02848 Chitinase_N_term Chiti 36.1 61 0.0013 22.4 3.5 29 102-130 78-106 (106)
42 PF08329 ChitinaseA_N: Chitina 35.0 62 0.0013 23.4 3.5 31 102-132 81-111 (133)
43 PF07550 DUF1533: Protein of u 25.7 1.4E+02 0.003 18.3 3.6 33 182-214 24-56 (65)
No 1
>KOG4289|consensus
Probab=100.00 E-value=4.7e-38 Score=282.41 Aligned_cols=191 Identities=29% Similarity=0.430 Sum_probs=178.1
Q ss_pred CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCce---eEEEecCC--CcEEEeCCccc
Q psy9046 1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIA---LYVLEQDG--DMFSVNNITVC 75 (215)
Q Consensus 1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~---~y~i~~~~--~~F~id~~tg~ 75 (215)
|++|+|.|+|+|||.|+| ++..|.-++.||.++|..|.++.|+|.|.+.+ .|++..|+ ..|.|++.+|.
T Consensus 252 tttv~V~V~D~nDhsPvF------Eq~~Y~e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGv 325 (2531)
T KOG4289|consen 252 TTTVTVLVLDTNDHSPVF------EQDEYREELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGV 325 (2531)
T ss_pred eeEEEEEEeecCCCCccc------chhHHHHHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccce
Confidence 689999999999999999 79999999999999999999999999999876 59997764 79999999998
Q ss_pred eEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCC
Q psy9046 76 EFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIII 153 (215)
Q Consensus 76 ~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~ 153 (215)
|.+-. ++|||....|+ |.|.|+|.|.+ ..++.|.|+|.|+|||+|+|..+.|.++|.|+..+
T Consensus 326 I~T~a---~lDRE~~~~y~-------------L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~ 389 (2531)
T KOG4289|consen 326 ISTRA---PLDREELESYQ-------------LDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTP 389 (2531)
T ss_pred eeccC---ccCHHhhhheE-------------EEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCC
Confidence 55443 79999999999 79999999876 56999999999999999999999999999999986
Q ss_pred CCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCCccEEEEEEEEEC
Q psy9046 154 TPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKGKTVYNLLVQVSD 215 (215)
Q Consensus 154 g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~~~~y~l~V~a~D 215 (215)
+ +.|++|+|+|.|. |+.++|+|.+|+..+.|.||. +|+|.+..+||.+...|++.|+|+|
T Consensus 390 ~--avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~~ytl~IrAqD 452 (2531)
T KOG4289|consen 390 P--AVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENSEYTLRIRAQD 452 (2531)
T ss_pred C--ceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCCeeEEEEEccc
Confidence 6 9999999999999 999999999999999999999 9999999999988779999999987
No 2
>KOG4289|consensus
Probab=100.00 E-value=4e-34 Score=257.32 Aligned_cols=190 Identities=27% Similarity=0.403 Sum_probs=174.4
Q ss_pred CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCcee---EEEe---cCCCcEEEeCCcc
Q psy9046 1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIAL---YVLE---QDGDMFSVNNITV 74 (215)
Q Consensus 1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~~---y~i~---~~~~~F~id~~tg 74 (215)
|++|.|.|.|+|||+|+| ..+.|.++|.|+.|+++.+++|+|+|+|.|.++ |.+. ++.+.|.|++.+|
T Consensus 766 tttveV~v~diNDnaPqf------~assyt~sV~Ed~Pv~TsvlQVSatDaD~g~Ng~v~y~~qg~~d~p~~F~IEptSG 839 (2531)
T KOG4289|consen 766 TTTVEVLVNDINDNAPQF------LASSYTGSVFEDAPVFTSVLQVSATDADSGPNGRVYYTFQGGDDGPGDFYIEPTSG 839 (2531)
T ss_pred cEEEEEEeecccccCccc------chhhceeEeecCCCCcceEEEEEEeccCCCCCceEEEEecCCCCCCCceEEccCcc
Confidence 678999999999999999 589999999999999999999999999998874 5553 3458999999999
Q ss_pred ceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeCCCCCCccCCCceEEEEecCCC
Q psy9046 75 CEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENII 152 (215)
Q Consensus 75 ~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~ 152 (215)
.+.+++ +||||....|. |.+.|.|.|.| ++.+.|+|+|+|+|||||+|.+..|...|.||.+
T Consensus 840 viRtl~---rLdRE~~avy~-------------L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~ensp 903 (2531)
T KOG4289|consen 840 VIRTLR---RLDRENVAVYV-------------LAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSP 903 (2531)
T ss_pred eeehhh---hhcchheeEEE-------------EEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCc
Confidence 987777 69999999998 78999999987 8889999999999999999999999999999999
Q ss_pred CCCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC--ccEEEEEEEEE
Q psy9046 153 ITPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG--KTVYNLLVQVS 214 (215)
Q Consensus 153 ~g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~--~~~y~l~V~a~ 214 (215)
+| ..+++++|.|+|+ |++|.|+|++|+....|.++. .|+|...-.||++ ...|.+.++|+
T Consensus 904 vg--s~va~i~a~dpdEG~NA~IsYqIvgg~d~~~fq~de~~~~lla~~efdyef~~~eyv~~~qat 968 (2531)
T KOG4289|consen 904 VG--SVVALITADDPDEGPNAHISYQIVGGNDPELFQLDEFSGELLALVEFDYEFTRVEYVLVVQAT 968 (2531)
T ss_pred cc--eeeEEEEccCCCcCCcceEEEeeccCccHHHHHHHHhhhhhhhheeehhhhccceeeEEeecc
Confidence 98 9999999999999 999999999999899999999 9999999999965 67888888774
No 3
>KOG1219|consensus
Probab=100.00 E-value=6e-32 Score=250.11 Aligned_cols=191 Identities=35% Similarity=0.463 Sum_probs=175.5
Q ss_pred CeEEEEEEEecCCC--CCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCc---eeEEEecC--CCcEEEeCCc
Q psy9046 1 MGTLIISIIDVNDI--PPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSI---ALYVLEQD--GDMFSVNNIT 73 (215)
Q Consensus 1 t~~v~I~V~DvND~--~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~---~~y~i~~~--~~~F~id~~t 73 (215)
+|.+.|.|+|+|.| ||.| ..-.-.++|.||+|+|+.++++.|.|.|.|. ..|+|..+ .+.|+|+..+
T Consensus 934 ~chl~Vevldv~enlhpp~F------~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~t 1007 (4289)
T KOG1219|consen 934 ICHLEVEVLDVNENLHPPEF------ISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTT 1007 (4289)
T ss_pred eEEEEEEEeccCCCCCCcch------heeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCc
Confidence 47899999999876 9999 4667789999999999999999999999864 46999654 5799999999
Q ss_pred cceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeCCCCCCccCCCceEEEEecCC
Q psy9046 74 VCEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENI 151 (215)
Q Consensus 74 g~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~ 151 (215)
|.+.++. .||||.++.|. |+|.|+|.|.. ++.+.+.|.|.|+|||+|+|.++.|..+|.|++
T Consensus 1008 G~irTl~---~lDrE~ks~Yw-------------ltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enS 1071 (4289)
T KOG1219|consen 1008 GSIRTLK---ALDREKKSSYW-------------LTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENS 1071 (4289)
T ss_pred ceEeech---hhchhhcceEE-------------EEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCC
Confidence 9988887 59999999999 79999999976 889999999999999999999999999999999
Q ss_pred CCCCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046 152 IITPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD 215 (215)
Q Consensus 152 ~~g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D 215 (215)
+.+ ..|.++.|.|+|. |++++|.|.+|+..++|.|++ +|.|+..+.||+| +..|.|.|.++|
T Consensus 1072 p~~--vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D 1137 (4289)
T KOG1219|consen 1072 PET--VSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRRLDREKQDEHILEVTIQD 1137 (4289)
T ss_pred CCc--eEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehhhcccccccceEEEEEec
Confidence 988 9999999999996 999999999999999999999 9999999999965 889999999887
No 4
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.98 E-value=4.1e-30 Score=198.86 Aligned_cols=170 Identities=38% Similarity=0.570 Sum_probs=153.2
Q ss_pred ceEEEEecCCCCCcEEEEEEEEeCCCCc---eeEEEecCC--CcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCC
Q psy9046 28 NYLLEIEEEQPIGSLLGKFTALDKDSSI---ALYVLEQDG--DMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSP 102 (215)
Q Consensus 28 ~~~~~V~E~~~~g~~v~~v~a~D~D~~~---~~y~i~~~~--~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~ 102 (215)
.|.+.|+|+++.|+.++++.|.|+|.+. ..|+|.++. .+|.|++.+|.+...+ .||||....|.
T Consensus 1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~---~lD~e~~~~~~-------- 69 (199)
T cd00031 1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTK---PLDREEQSEYT-------- 69 (199)
T ss_pred CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECC---CCCCcCCceEE--------
Confidence 3789999999999999999999999974 479998766 5999999999865544 69999999988
Q ss_pred CCeEEEEEEEEeCCCCc--eEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC--CCeEEEEEe
Q psy9046 103 GDTKVVLVSAEDASTPC--SHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE--NPHLTYKIL 178 (215)
Q Consensus 103 ~~~~~l~v~a~D~~~~~--~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~--~~~i~y~i~ 178 (215)
|.|+|+|.+.+. .+..++|.|.|+|||+|.|....|.+.+.|+.+.| +.++++.|+|+|. ++.++|+|.
T Consensus 70 -----l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~--~~i~~~~a~D~D~~~~~~~~y~l~ 142 (199)
T cd00031 70 -----LTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPG--TVVGTVTATDADSGENAKLTYSIL 142 (199)
T ss_pred -----EEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCC--CEEEEEEEEcCCCCCCccEEEEEe
Confidence 789999987775 89999999999999999999889999999999977 9999999999998 799999999
Q ss_pred eCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046 179 SGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD 215 (215)
Q Consensus 179 ~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D 215 (215)
++...+.|.|++ +|.|++.+.||++ ...|.|.|.|+|
T Consensus 143 ~~~~~~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D 181 (199)
T cd00031 143 SGNDKELFSIDPNTGIITLAKPLDREEKSSYELTVVATD 181 (199)
T ss_pred CCCCCCEEEEeCCceEEEeCCccCCccCceEEEEEEEEE
Confidence 876568999999 9999999999965 569999999987
No 5
>KOG1219|consensus
Probab=99.98 E-value=8.5e-31 Score=242.59 Aligned_cols=189 Identities=27% Similarity=0.374 Sum_probs=168.6
Q ss_pred CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCceeEEEecC------CCcEEEeCCcc
Q psy9046 1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIALYVLEQD------GDMFSVNNITV 74 (215)
Q Consensus 1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~~y~i~~~------~~~F~id~~tg 74 (215)
+++|.|+|++..++.|+| +.+.|.++|+|+.++|+.|++++|.|.|...+...+.++ ...|.+|+.||
T Consensus 2559 ~ttV~v~vl~e~v~lPrF------Sep~y~fsvpEDv~vG~~Ig~v~a~~a~~~~i~~~v~~gt~Esn~d~~Fsvdr~TG 2632 (4289)
T KOG1219|consen 2559 NTTVIVTVLPEDVNLPRF------SEPIYTFSVPEDVPVGEEIGQVSASDADEHVIYSLVLGGTPESNPDLPFSVDRNTG 2632 (4289)
T ss_pred ceEEEEEecCcccCcccc------cCceEEEeccccCCCCCeeeEEeecccCCceEEEEEeCCCCCCCCCCceEEcCCCc
Confidence 478999999999999999 688999999999999999999999999976653333222 25799999999
Q ss_pred ceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCC
Q psy9046 75 CEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIIT 154 (215)
Q Consensus 75 ~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g 154 (215)
.+..-+ +||+|.+.+|+ +.|+|++.++--+..+|.|.|.|+|||+|+|+.+.|.+.+.||+++|
T Consensus 2633 ~i~v~k---sLD~E~kk~yq-------------i~v~a~~~~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvG 2696 (4289)
T KOG1219|consen 2633 MIKVNK---SLDHEKKKSYQ-------------IKVKATCGQWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVG 2696 (4289)
T ss_pred eEEecc---ccchhhhceEE-------------EEEEeecCCceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCC
Confidence 865444 79999999999 68999998876788899999999999999999999999999999998
Q ss_pred CCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046 155 PSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD 215 (215)
Q Consensus 155 ~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D 215 (215)
+.|++++|.|.|. |++|+|++... ...|.|++ +|.|++...||.+ ...|.|.|.|+|
T Consensus 2697 --tsV~qf~AsD~Ds~~nGqirysl~~~--v~yF~In~etGwlTt~~eld~ek~d~y~lkv~AtD 2757 (4289)
T KOG1219|consen 2697 --TSVIQFHASDMDSGNNGQIRYSLTSP--VPYFAINPETGWLTTLFELDLEKQDLYSLKVVATD 2757 (4289)
T ss_pred --ceEEEEEeeccCCCCCceEEEEEcCC--cceEEEcCCCCeeeehhhhccccCCceEEEEEEec
Confidence 9999999999999 89999999875 33899999 9999999999955 899999999998
No 6
>KOG1834|consensus
Probab=99.83 E-value=5.1e-19 Score=151.38 Aligned_cols=180 Identities=27% Similarity=0.328 Sum_probs=150.8
Q ss_pred ecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCC------ceeEEEecCCCcEEE---eCCccceEEEE
Q psy9046 10 DVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSS------IALYVLEQDGDMFSV---NNITVCEFQLS 80 (215)
Q Consensus 10 DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~------~~~y~i~~~~~~F~i---d~~tg~~~~~~ 80 (215)
-+|-+.|.. ...|...|.||...=.....+-|.|.|.+ .-+|.|.+..-.|.+ |..||+ -.++
T Consensus 25 rankhkpwi-------e~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGe-gvlR 96 (952)
T KOG1834|consen 25 RANKHKPWI-------EEEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGE-GVLR 96 (952)
T ss_pred cccccCccc-------ccceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCc-eEEe
Confidence 356667776 78899999999764444456889999984 336998776666654 888997 5666
Q ss_pred EEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--------ceEEEEEEEEeeCCCCCCccCCCceEEEEecCCC
Q psy9046 81 VDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--------CSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENII 152 (215)
Q Consensus 81 ~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--------~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~ 152 (215)
++.+||-|.++.|+ ++|+|.|.|.. +..++|+|+|.|+|+++|+|..+.|.+.|.|+..
T Consensus 97 aK~~lDCelqkeyt-------------f~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~ 163 (952)
T KOG1834|consen 97 AKEPLDCELQKEYT-------------FTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV 163 (952)
T ss_pred ecCcccccccccce-------------EEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee
Confidence 66699999999999 68999998754 6789999999999999999999999999999987
Q ss_pred CCCCCEEEEEEEEeCCC---CCe-EEEEEeeCCCCCcEEEcCCeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046 153 ITPSPTLLTLEATDSDE---NPH-LTYKILSGNDDDVFAVDDTGILTIQGPLAKG-KTVYNLLVQVSD 215 (215)
Q Consensus 153 ~g~~~~i~~v~a~D~D~---~~~-i~y~i~~~~~~~~F~Id~~G~i~~~~~ld~~-~~~y~l~V~a~D 215 (215)
. ..|++|.|.|.|. +++ ..|.|.. .+-.|.||..|.|+.+.+|.+. ..+|.|+|+|-|
T Consensus 164 y---d~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn~G~irnTekLny~ke~~Y~ltVtAyD 226 (952)
T KOG1834|consen 164 Y---DSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVTAYD 226 (952)
T ss_pred e---eeeEEEEeecCCCCCcccceeEEEecC--CCCceEEcCCCccccccccccccceeEEEEEEEEe
Confidence 6 5899999999998 555 5889985 5668999999999999999974 789999999987
No 7
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.72 E-value=2.1e-16 Score=121.98 Aligned_cols=108 Identities=30% Similarity=0.462 Sum_probs=93.4
Q ss_pred CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCC---CceeEEEecCC--CcEEEeCCccc
Q psy9046 1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDS---SIALYVLEQDG--DMFSVNNITVC 75 (215)
Q Consensus 1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~---~~~~y~i~~~~--~~F~id~~tg~ 75 (215)
+++|+|.|.|+||++|.| ....|.+.|.|+.++|+.++++.|+|+|. +...|+|..+. .+|.|++.+|.
T Consensus 85 ~~~v~I~V~d~Nd~~P~~------~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~ 158 (199)
T cd00031 85 TATVTVTVLDVNDNPPVF------EQSSYEASVPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGI 158 (199)
T ss_pred EEEEEEEEccCCCCCCcc------cccceEEEEeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceE
Confidence 367999999999999999 46899999999999999999999999998 45579998776 79999999998
Q ss_pred eEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCC--CCceEEEEEEEEee
Q psy9046 76 EFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDAS--TPCSHFRVNIKVLD 130 (215)
Q Consensus 76 ~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~--~~~~~~~v~I~V~D 130 (215)
+.... .||+|....|. +.|.|+|.+ ..+.++.++|.|.|
T Consensus 159 i~~~~---~ld~e~~~~~~-------------l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 159 ITLAK---PLDREEKSSYE-------------LTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EEeCC---ccCCccCceEE-------------EEEEEEECCCCCceeEEEEEEEEEC
Confidence 65544 69999988888 789999998 44778888888765
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.65 E-value=2.6e-15 Score=102.26 Aligned_cols=85 Identities=31% Similarity=0.414 Sum_probs=72.9
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCce---eEEEecCC--CcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCCC
Q psy9046 29 YLLEIEEEQPIGSLLGKFTALDKDSSIA---LYVLEQDG--DMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSPG 103 (215)
Q Consensus 29 ~~~~V~E~~~~g~~v~~v~a~D~D~~~~---~y~i~~~~--~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~~ 103 (215)
|.++|+|++++|+.++++.|.|+|.+.+ .|+|..++ .+|.|++.+|.+.+.+ +||||....|.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~---~LD~E~~~~y~--------- 68 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKK---PLDRETQSSYQ--------- 68 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESS---SSCTTTTSEEE---------
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccce---ecCcccCCEEE---------
Confidence 7899999999999999999999997554 69998766 8999999999855544 79999999999
Q ss_pred CeEEEEEEEEeC-CCC--ceEEEEEEEEe
Q psy9046 104 DTKVVLVSAEDA-STP--CSHFRVNIKVL 129 (215)
Q Consensus 104 ~~~~l~v~a~D~-~~~--~~~~~v~I~V~ 129 (215)
|.|.|+|. +.| +++++|.|+|+
T Consensus 69 ----l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 69 ----LTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp ----EEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred ----EEEEEEECCCCCCCEEEEEEEEEEC
Confidence 79999999 666 67777777764
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.63 E-value=5.2e-15 Score=100.75 Aligned_cols=71 Identities=37% Similarity=0.634 Sum_probs=66.9
Q ss_pred eEEEEecCCCCCCCCEEEEEEEEeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046 143 YRTSIQENIIITPSPTLLTLEATDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD 215 (215)
Q Consensus 143 ~~~~v~E~~~~g~~~~i~~v~a~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D 215 (215)
|++.|+|++++| +.++++.|.|+|. |+.+.|+|..++..++|.|++ +|.|++.++||++ .+.|.|.|.|+|
T Consensus 1 Y~~~v~E~~~~g--~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D 75 (93)
T PF00028_consen 1 YSFSVPENAPPG--TVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATD 75 (93)
T ss_dssp EEEEEETTGSTS--SEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEE
T ss_pred CEEEEECCCCCC--CEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEEEEEEEEE
Confidence 789999999987 9999999999996 999999999988889999999 9999999999964 899999999987
No 10
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.48 E-value=2.3e-13 Score=89.77 Aligned_cols=72 Identities=33% Similarity=0.408 Sum_probs=60.1
Q ss_pred EeCCCCce---eEEEecCC--CcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceE
Q psy9046 49 LDKDSSIA---LYVLEQDG--DMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSH 121 (215)
Q Consensus 49 ~D~D~~~~---~y~i~~~~--~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~ 121 (215)
+|+|.|.+ .|+|..+. .+|.|++.+|.+...+ .||||....|. |.|.|.|.+.+ ++.
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~---~LD~e~~~~y~-------------l~v~a~D~~~~~~~~~ 64 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTK---PLDREEQPEYT-------------LTVEATDGGGPPLSST 64 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCC---ccCeeCCCeEE-------------EEEEEEECCCCCcccE
Confidence 37787643 69997765 8999999999644333 79999999998 78999999874 889
Q ss_pred EEEEEEEeeCCCCCC
Q psy9046 122 FRVNIKVLDVNDNRP 136 (215)
Q Consensus 122 ~~v~I~V~DvNdn~P 136 (215)
++|+|+|.|+|||+|
T Consensus 65 ~~v~I~V~D~Nd~~P 79 (79)
T smart00112 65 ATVTVTVLDVNDNAP 79 (79)
T ss_pred EEEEEEEEECCCCCC
Confidence 999999999999998
No 11
>KOG1834|consensus
Probab=99.08 E-value=1.2e-09 Score=94.66 Aligned_cols=106 Identities=25% Similarity=0.287 Sum_probs=86.6
Q ss_pred eEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCc-----eeEEEecCCCcEEEeCCccce
Q psy9046 2 GTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSI-----ALYVLEQDGDMFSVNNITVCE 76 (215)
Q Consensus 2 ~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~-----~~y~i~~~~~~F~id~~tg~~ 76 (215)
++|.|+|.|+|.++|+| ..+.|.+.|.|.-- -..|+++.|.|.|.+. -.|.|..++-.|.||.. |.+
T Consensus 133 atvhIrVkDvNe~AP~f------~ep~Yka~V~EGK~-yd~il~veAiD~DCspq~sqIC~YEI~t~d~PFaIdn~-G~i 204 (952)
T KOG1834|consen 133 ATVHIRVKDVNEFAPVF------KEPWYKAHVTEGKV-YDSILRVEAIDKDCSPQYSQICEYEITTPDVPFAIDND-GNI 204 (952)
T ss_pred eEEEEEeccccccCchh------cccceeeEEeccee-eeeeEEEEeecCCCCCcccceeEEEecCCCCceEEcCC-Ccc
Confidence 68999999999999999 68999999999854 5568999999999853 25999999999999984 665
Q ss_pred EEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCC--ceEEEEEEEEeeC
Q psy9046 77 FQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDV 131 (215)
Q Consensus 77 ~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~Dv 131 (215)
.... +|+|.+...|. |+|.|.|.|.. ...+.|+|+|...
T Consensus 205 rnTe---kLny~ke~~Y~-------------ltVtAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 205 RNTE---KLNYTKEHQYK-------------LTVTAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred cccc---ccccccceeEE-------------EEEEEEecccccccCcceEEEEecCc
Confidence 5444 69998888888 79999999865 3335677777543
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.79 E-value=2e-08 Score=65.88 Aligned_cols=51 Identities=45% Similarity=0.775 Sum_probs=44.6
Q ss_pred EeCCC--CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC-ccEEEEEEEEEC
Q psy9046 165 TDSDE--NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG-KTVYNLLVQVSD 215 (215)
Q Consensus 165 ~D~D~--~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~-~~~y~l~V~a~D 215 (215)
+|+|. |+.++|+|.+++....|.|++ +|.|++.++||++ ...|.|.|+|+|
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D 55 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATD 55 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEE
Confidence 36776 789999999876668999999 9999999999965 789999999987
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.87 E-value=7.4e-05 Score=50.14 Aligned_cols=75 Identities=20% Similarity=0.296 Sum_probs=40.1
Q ss_pred CCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCCCCeEEEEEeeCCCCCcEEEcCCeeEEEcccCCCCccEEEEEEEEE
Q psy9046 135 RPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDENPHLTYKILSGNDDDVFAVDDTGILTIQGPLAKGKTVYNLLVQVS 214 (215)
Q Consensus 135 ~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~~~~i~y~i~~~~~~~~F~Id~~G~i~~~~~ld~~~~~y~l~V~a~ 214 (215)
.|-|.+..|.+.|+.+...| ..|++|.-.|-..+..+.|.-. +..|.|..+|.|++++++.-....-.+.|.|.
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g--~~lg~V~f~dC~~~~~~~~~ss----DpdF~V~~DGsVy~~r~v~l~~~~~~F~V~a~ 76 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAG--QPLGKVNFEDCTGRRRVIFESS----DPDFRVLEDGSVYAKRPVQLSSEQRSFTVHAW 76 (90)
T ss_dssp --B--S-EEEE----SS-SS----EEE---B--SS---EEEE-------SEEEEETTTEEEEES--S-SSS-EEEEEEEE
T ss_pred cCCcccceEEEEcCchhhCC--cEEEEEEeccCCCCCceEEecC----CCCEEEcCCCeEEEeeeEecCCCceEEEEEEE
Confidence 58899999999999998766 9999999999876778998764 23799999999999999985444456777776
Q ss_pred C
Q psy9046 215 D 215 (215)
Q Consensus 215 D 215 (215)
|
T Consensus 77 D 77 (90)
T PF08758_consen 77 D 77 (90)
T ss_dssp E
T ss_pred C
Confidence 5
No 14
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.11 E-value=0.00057 Score=45.24 Aligned_cols=58 Identities=21% Similarity=0.308 Sum_probs=38.3
Q ss_pred eEEEEecCCCCCCCCEEEEEEEEeCCC----CCeEEEEEeeCCCCCcEEEcC-CeeEEEcccCCCC
Q psy9046 143 YRTSIQENIIITPSPTLLTLEATDSDE----NPHLTYKILSGNDDDVFAVDD-TGILTIQGPLAKG 203 (215)
Q Consensus 143 ~~~~v~E~~~~g~~~~i~~v~a~D~D~----~~~i~y~i~~~~~~~~F~Id~-~G~i~~~~~ld~~ 203 (215)
...+|.|..+.| +.|+.| |.|.-. -..-.|+|.+.....+|.++. +|.|++...+|+|
T Consensus 3 i~YsV~EE~~~G--t~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE 65 (84)
T PF08266_consen 3 IRYSVPEEMPPG--TVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDRE 65 (84)
T ss_dssp EEEEEESS--TT---EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCC
T ss_pred eEEEeecCCCCC--CEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHH
Confidence 367899999977 999998 555544 123478888777788999999 9999999999964
No 15
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.03 E-value=0.0008 Score=44.54 Aligned_cols=56 Identities=21% Similarity=0.260 Sum_probs=35.0
Q ss_pred eEEEEecCCCCCcEEEEEEEEeCCCCc-----eeEEEec--CCCcEEEeCCccceEEEEEEEecCCC
Q psy9046 29 YLLEIEEEQPIGSLLGKFTALDKDSSI-----ALYVLEQ--DGDMFSVNNITVCEFQLSVDLAAADN 88 (215)
Q Consensus 29 ~~~~V~E~~~~g~~v~~v~a~D~D~~~-----~~y~i~~--~~~~F~id~~tg~~~~~~~~~~ld~e 88 (215)
...+|+|..+.|+.||.+ |.|.-... ..|++.. ...+|.++..+|.+++.. ++|||
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~---rIDRE 65 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSE---RIDRE 65 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS-----SCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCC---ccCHH
Confidence 357899999999999999 55554321 2477754 358999999999866543 79999
No 16
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=96.88 E-value=0.057 Score=40.20 Aligned_cols=114 Identities=17% Similarity=0.301 Sum_probs=65.2
Q ss_pred cCCCCCeEEEEE-EEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC-CCeEEEE
Q psy9046 99 ENSPGDTKVVLV-SAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE-NPHLTYK 176 (215)
Q Consensus 99 e~~~~~~~~l~v-~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~-~~~i~y~ 176 (215)
...+.|.|.+.+ .|.|..+...+..+...+. +...+|.+.- .....+..+..+. +..=..+.++|... ..--..+
T Consensus 10 ~~l~dG~Y~l~~~~a~D~agN~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~-~~~~i~i~~tD~~~~~~i~sv~ 86 (158)
T PF13750_consen 10 STLPDGSYTLTVVTATDAAGNTSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVY-GLVNISINVTDNSDDSKITSVS 86 (158)
T ss_pred CcCCCccEEEEEEEEEecCCCEEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCcccc-ceeeeEEEEEeCCCCceEEEEE
Confidence 456778888999 7999887766666554443 3444776533 1111222332221 12335678887765 3345777
Q ss_pred EeeCCCCCcEEE--cC--CeeEEEc--ccCC--CCccEEEEEEEEEC
Q psy9046 177 ILSGNDDDVFAV--DD--TGILTIQ--GPLA--KGKTVYNLLVQVSD 215 (215)
Q Consensus 177 i~~~~~~~~F~I--d~--~G~i~~~--~~ld--~~~~~y~l~V~a~D 215 (215)
|.+|.....-.+ .. .|...+. +.+. +....|+|+|.|+|
T Consensus 87 l~Gg~~~d~v~ls~~~~~~~~~~~~yp~~fpsle~~~~YtLtV~a~D 133 (158)
T PF13750_consen 87 LTGGPASDSVSLSWTNKGNGVYTLEYPRIFPSLEADDSYTLTVSATD 133 (158)
T ss_pred EECCcccceEEEeeEeccCceEEeecccccCCcCCCCeEEEEEEEEe
Confidence 776654443333 22 4544443 2222 45789999999998
No 17
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.64 E-value=0.034 Score=37.27 Aligned_cols=78 Identities=22% Similarity=0.208 Sum_probs=36.8
Q ss_pred CCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCceeEEEecCCCcEEEeCCccceEEEEEEEecCCC-cceee
Q psy9046 15 PPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIALYVLEQDGDMFSVNNITVCEFQLSVDLAAADN-FQSEY 93 (215)
Q Consensus 15 ~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~~y~i~~~~~~F~id~~tg~~~~~~~~~~ld~e-~~~~~ 93 (215)
-|=| .+..|.+.|+.+...|..|++|.-.|-.... ...+.+.+..|.|..+ |.+++.+. +... .+..
T Consensus 3 ~pGF------~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~~-~~~~~ssDpdF~V~~D-GsVy~~r~---v~l~~~~~~- 70 (90)
T PF08758_consen 3 RPGF------SQKKYTFEVPSNLEAGQPLGKVNFEDCTGRR-RVIFESSDPDFRVLED-GSVYAKRP---VQLSSEQRS- 70 (90)
T ss_dssp --B--------S-EEEE----SS-SS--EEE---B--SS----EEEE---SEEEEETT-TEEEEES-----S-SSS-EE-
T ss_pred cCCc------ccceEEEEcCchhhCCcEEEEEEeccCCCCC-ceEEecCCCCEEEcCC-CeEEEeee---EecCCCceE-
Confidence 4677 5899999999999999999999998885433 2333344558999984 87766653 3221 2222
Q ss_pred EEEEecCCCCCeEEEEEEEEeCCCC
Q psy9046 94 NATIKENSPGDTKVVLVSAEDASTP 118 (215)
Q Consensus 94 ~~~~~e~~~~~~~~l~v~a~D~~~~ 118 (215)
+.|.|.|....
T Consensus 71 --------------F~V~a~D~~~~ 81 (90)
T PF08758_consen 71 --------------FTVHAWDSQTQ 81 (90)
T ss_dssp --------------EEEEEEETTTT
T ss_pred --------------EEEEEECCCCC
Confidence 57899998764
No 18
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.17 E-value=0.11 Score=35.25 Aligned_cols=68 Identities=19% Similarity=0.161 Sum_probs=47.1
Q ss_pred EEeCCCCceeEEEec-----CCCcEEEeCCccceEEEEEEEecCCCcceeeEEEEecCCCCCeEEEEEEEEeCCCCceEE
Q psy9046 48 ALDKDSSIALYVLEQ-----DGDMFSVNNITVCEFQLSVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTPCSHF 122 (215)
Q Consensus 48 a~D~D~~~~~y~i~~-----~~~~F~id~~tg~~~~~~~~~~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~D~~~~~~~~ 122 (215)
..|+|....+|++.. -..|...++.++.+.- . +.... ...|. +.|.|+|+.+.+...
T Consensus 24 F~d~d~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-t---P~~~~-~g~~~-------------i~v~a~D~~g~~~~~ 85 (97)
T smart00736 24 FTDADGDTLTYSATLSDGSALPSWLSFDSDTGTLSG-T---PTNSD-VGSLS-------------LKVTATDSSGASASD 85 (97)
T ss_pred eECCCCCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-E---CCCCC-CcEEE-------------EEEEEEECCCCEEEE
Confidence 467886666788742 1368899888765322 2 22222 23355 799999998778899
Q ss_pred EEEEEEeeCCC
Q psy9046 123 RVNIKVLDVND 133 (215)
Q Consensus 123 ~v~I~V~DvNd 133 (215)
.+.|.|.+.|+
T Consensus 86 ~f~i~V~~~~~ 96 (97)
T smart00736 86 TFTITVVNTND 96 (97)
T ss_pred EEEEEEeCCCC
Confidence 99999999987
No 19
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=95.79 E-value=0.15 Score=34.78 Aligned_cols=86 Identities=20% Similarity=0.221 Sum_probs=51.4
Q ss_pred EEEEEEeCCCCcee-EEEecC---CCcEEEeCCccceEEEEEEE------ecCCCcceeeEEEEecCCCCCeEEEEEEEE
Q psy9046 44 GKFTALDKDSSIAL-YVLEQD---GDMFSVNNITVCEFQLSVDL------AAADNFQSEYNATIKENSPGDTKVVLVSAE 113 (215)
Q Consensus 44 ~~v~a~D~D~~~~~-y~i~~~---~~~F~id~~tg~~~~~~~~~------~ld~e~~~~~~~~~~e~~~~~~~~l~v~a~ 113 (215)
+++.++|+|.+... +++... -+.|.|++ .|. +.-.... .|...+... -.+++.+.
T Consensus 2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~-wtYtl~n~~~avq~L~~Ge~~t-------------dsFtvtv~ 66 (99)
T TIGR01965 2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQ-WTYQADNSQTAVQALKAGETLT-------------DTFTVTSA 66 (99)
T ss_pred CceEEeCCCCCCceEEecccccCCcEEEEECC-CCc-EEEEeCCCcHHHHhhcCCCEEE-------------EEEEEEEe
Confidence 46889999987653 444222 25688877 564 2222100 111111111 22678888
Q ss_pred eCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecC
Q psy9046 114 DASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQEN 150 (215)
Q Consensus 114 D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~ 150 (215)
|+ .+.+|.|.|.-.|| +|+..... ...+.|+
T Consensus 67 DG----tt~~vtItI~GtND-apvi~~~~-~g~v~ED 97 (99)
T TIGR01965 67 DG----TSQTVTITITGAND-AAVIGGAD-TGSVTED 97 (99)
T ss_pred CC----CeEEEEEEEEccCC-CCEEeccc-ceeEecC
Confidence 86 38899999999999 78665432 5666665
No 20
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=93.55 E-value=0.37 Score=28.30 Aligned_cols=34 Identities=18% Similarity=0.213 Sum_probs=26.2
Q ss_pred CCCcEEEcC-CeeEEEcccCCCCccEEEEEEEEEC
Q psy9046 182 DDDVFAVDD-TGILTIQGPLAKGKTVYNLLVQVSD 215 (215)
Q Consensus 182 ~~~~F~Id~-~G~i~~~~~ld~~~~~y~l~V~a~D 215 (215)
-.....||+ +|.|.-...-..+.+.|.+.|.|+|
T Consensus 12 LP~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd 46 (49)
T PF05345_consen 12 LPSGLSLDPSTGTISGTPTSSVQPGTYTFTVTATD 46 (49)
T ss_pred CCCcEEEeCCCCEEEeecCCCccccEEEEEEEEEc
Confidence 356688999 9999877444434579999999987
No 21
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=93.05 E-value=0.78 Score=30.95 Aligned_cols=51 Identities=27% Similarity=0.313 Sum_probs=37.9
Q ss_pred EEEeCCCCCeEEEEEeeCC---CCCcEEEcC-CeeEEEcccCCCCccEEEEEEEEEC
Q psy9046 163 EATDSDENPHLTYKILSGN---DDDVFAVDD-TGILTIQGPLAKGKTVYNLLVQVSD 215 (215)
Q Consensus 163 ~a~D~D~~~~i~y~i~~~~---~~~~F~Id~-~G~i~~~~~ld~~~~~y~l~V~a~D 215 (215)
...|+| +..++|++...+ ...|...|+ ++.+.= .+.....+.|.+.|.|+|
T Consensus 23 tF~d~d-~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~G-tP~~~~~g~~~i~v~a~D 77 (97)
T smart00736 23 TFTDAD-GDTLTYSATLSDGSALPSWLSFDSDTGTLSG-TPTNSDVGSLSLKVTATD 77 (97)
T ss_pred ceECCC-CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEE-ECCCCCCcEEEEEEEEEE
Confidence 457777 779999987532 246888898 887766 455544677999999987
No 22
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.18 E-value=2.6 Score=25.86 Aligned_cols=30 Identities=23% Similarity=0.293 Sum_probs=18.5
Q ss_pred CCCCCeEEEEEEEEeCCCC--ceEEEEEEEEe
Q psy9046 100 NSPGDTKVVLVSAEDASTP--CSHFRVNIKVL 129 (215)
Q Consensus 100 ~~~~~~~~l~v~a~D~~~~--~~~~~v~I~V~ 129 (215)
+.++|.|.|.|+|.|..+. ....++.|+|+
T Consensus 35 ~L~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I~ 66 (66)
T PF07495_consen 35 NLPPGKYTLEVRAKDNNGKWSSDEKSLTITIL 66 (66)
T ss_dssp S--SEEEEEEEEEEETTS-B-SS-EEEEEEEE
T ss_pred eCCCEEEEEEEEEECCCCCcCcccEEEEEEEC
Confidence 3466777799999997655 22266776663
No 23
>KOG3597|consensus
Probab=82.58 E-value=23 Score=30.98 Aligned_cols=55 Identities=27% Similarity=0.126 Sum_probs=38.1
Q ss_pred CeEEEEEEEecCCCCCccCCCCCCCCCceEEEEecCCCCCcEEEEEEEEeCCCCce--eEEEe
Q psy9046 1 MGTLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIA--LYVLE 61 (215)
Q Consensus 1 t~~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~~~g~~v~~v~a~D~D~~~~--~y~i~ 61 (215)
|....|.|.-+||.+..+. ...+.+.+.|+...-.....+.+.|+|.+.. .|++.
T Consensus 25 ~~~~~i~v~pvndpp~~~~------~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~~l~f~v~ 81 (442)
T KOG3597|consen 25 TDVLRIHVNPVNDPPSLIF------PSGSLLVILEGGQKVLDPELLTAADPDSAPLPLEFQVL 81 (442)
T ss_pred EeeecccccccCCCcceee------cccceEEeecCCceeccceEeeccCCCCCccceEEEEc
Confidence 3567889999999777663 4445588888866444456688888888543 46664
No 24
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=80.37 E-value=10 Score=23.09 Aligned_cols=35 Identities=23% Similarity=0.250 Sum_probs=24.6
Q ss_pred ecCCCCCeEEEEEEEEeCCCCceEEEEEEEEeeCC
Q psy9046 98 KENSPGDTKVVLVSAEDASTPCSHFRVNIKVLDVN 132 (215)
Q Consensus 98 ~e~~~~~~~~l~v~a~D~~~~~~~~~v~I~V~DvN 132 (215)
++....|.|.+.+.+.|..+........+.+.|..
T Consensus 17 P~~~~dg~yt~~v~a~D~AGN~~~~~~~~~i~d~~ 51 (60)
T PF12245_consen 17 PENDADGEYTLTVTATDKAGNTSSSTTQIVIVDNT 51 (60)
T ss_pred cCccCCccEEEEEEEEECCCCEEEeeeEEEEEcCC
Confidence 34444556779999999988767666666666654
No 25
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=79.40 E-value=9.6 Score=25.97 Aligned_cols=37 Identities=19% Similarity=0.206 Sum_probs=24.1
Q ss_pred EEEEEEeCCCCCeEEEEEeeC-CCCCcEEEcCCeeEEE
Q psy9046 160 LTLEATDSDENPHLTYKILSG-NDDDVFAVDDTGILTI 196 (215)
Q Consensus 160 ~~v~a~D~D~~~~i~y~i~~~-~~~~~F~Id~~G~i~~ 196 (215)
+++.++|+|.+....+++... ...+.|.|+++|....
T Consensus 2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~~G~wtY 39 (99)
T TIGR01965 2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDADGQWTY 39 (99)
T ss_pred CceEEeCCCCCCceEEecccccCCcEEEEECCCCcEEE
Confidence 467888999766666666432 2346688887775544
No 26
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=74.24 E-value=11 Score=27.19 Aligned_cols=36 Identities=17% Similarity=0.255 Sum_probs=29.6
Q ss_pred eEEEEecCCCCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046 93 YNATIKENSPGDTKVVLVSAEDASTPCSHFRVNIKV 128 (215)
Q Consensus 93 ~~~~~~e~~~~~~~~l~v~a~D~~~~~~~~~v~I~V 128 (215)
..+.+++..+.|.|.+.+.++|..+........|+|
T Consensus 96 ~~i~IPa~a~~G~YH~~i~VtD~~Gn~~~~~~~i~I 131 (132)
T PF15418_consen 96 EHIDIPADAPAGDYHFMITVTDAAGNQTEEERSIKI 131 (132)
T ss_pred EeeeCCCCCCCcceEEEEEEEECCCCEEEEEEEEEE
Confidence 356788999999999999999998876666666665
No 27
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=70.00 E-value=10 Score=24.37 Aligned_cols=25 Identities=24% Similarity=0.279 Sum_probs=19.9
Q ss_pred CeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046 104 DTKVVLVSAEDASTPCSHFRVNIKV 128 (215)
Q Consensus 104 ~~~~l~v~a~D~~~~~~~~~v~I~V 128 (215)
|.+.++..|+|..+..+++++.|+|
T Consensus 57 G~t~V~ytA~D~~GN~a~C~f~V~V 81 (81)
T PF02494_consen 57 GTTTVTYTATDAAGNSATCSFTVTV 81 (81)
T ss_pred ceEEEEEEEEECCCCEEEEEEEEEC
Confidence 4455889999988778888888875
No 28
>KOG3597|consensus
Probab=69.55 E-value=33 Score=29.99 Aligned_cols=59 Identities=22% Similarity=0.098 Sum_probs=41.8
Q ss_pred eEEEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC-CCeEEEEEeeC
Q psy9046 120 SHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE-NPHLTYKILSG 180 (215)
Q Consensus 120 ~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~-~~~i~y~i~~~ 180 (215)
.+....|+|.-+||.+..+...-+.+-+.|+..-- ..--.+++.|+|. -..+.|++.+.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~--l~~~~l~~~d~d~~~~~l~f~v~~t 83 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKV--LDPELLTAADPDSAPLPLEFQVLGT 83 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCcee--ccceEeeccCCCCCccceEEEEccC
Confidence 56678999999999666666655666666654311 2234688899998 77899999863
No 29
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=59.74 E-value=28 Score=20.56 Aligned_cols=17 Identities=29% Similarity=0.317 Sum_probs=13.3
Q ss_pred CCCeEEEEEEEEeCCCC
Q psy9046 102 PGDTKVVLVSAEDASTP 118 (215)
Q Consensus 102 ~~~~~~l~v~a~D~~~~ 118 (215)
..|.|.+++.|+|..+.
T Consensus 22 ~dG~y~itv~a~D~AGN 38 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGN 38 (54)
T ss_pred CCccEEEEEEEEeCCCC
Confidence 45667789999998765
No 30
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=59.17 E-value=27 Score=22.09 Aligned_cols=27 Identities=19% Similarity=0.130 Sum_probs=17.3
Q ss_pred CCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046 102 PGDTKVVLVSAEDASTPCSHFRVNIKV 128 (215)
Q Consensus 102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V 128 (215)
..|.|.+++.++|..+.+...++.|.|
T Consensus 55 ~~G~y~v~l~v~d~~g~~~~~~~~V~V 81 (81)
T cd00146 55 KPGTYTVTLTVTNAVGSSSTKTTTVVV 81 (81)
T ss_pred CCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence 345555899999986545554665543
No 31
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=58.91 E-value=1.5e+02 Score=27.34 Aligned_cols=34 Identities=15% Similarity=0.245 Sum_probs=26.9
Q ss_pred CeEEEEEEEEeCCCC-ceEEEEEEEEeeCCCCCCccC
Q psy9046 104 DTKVVLVSAEDASTP-CSHFRVNIKVLDVNDNRPEFK 139 (215)
Q Consensus 104 ~~~~l~v~a~D~~~~-~~~~~v~I~V~DvNdn~P~f~ 139 (215)
|...+++.++|..+. +......|+|++-. +|.+.
T Consensus 298 G~~Ti~atVtDSRGr~S~~~~~tItVl~Y~--~P~ls 332 (624)
T PF05895_consen 298 GSATIRATVTDSRGRTSDPKTKTITVLEYS--PPTLS 332 (624)
T ss_pred ceEEEEEEEEECCCccCCceEEEEEEEEcC--CCcEE
Confidence 566789999998876 45678999999876 77773
No 32
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=56.40 E-value=1.1e+02 Score=25.14 Aligned_cols=104 Identities=21% Similarity=0.252 Sum_probs=51.3
Q ss_pred CCeEEEEEEEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCCC-C---CCCEEEEEEEEeCCCCCeEEEEEe
Q psy9046 103 GDTKVVLVSAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENIII-T---PSPTLLTLEATDSDENPHLTYKIL 178 (215)
Q Consensus 103 ~~~~~l~v~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~-g---~~~~i~~v~a~D~D~~~~i~y~i~ 178 (215)
.+.|.+.+.++|..+.....+..|.|--. +|...-. .+.++... + .....+...+++.+.+..+...+
T Consensus 11 d~~~~v~vt~tD~aGN~~~~t~~~~vDt~---~P~v~i~----~~~~~~~~~~~~~~~~~t~s~tvs~~~~g~~v~v~~- 82 (317)
T PF13753_consen 11 DGTYTVSVTVTDAAGNTSTATQSITVDTT---APTVTIT----SIADDDIINGDEATNTVTFSGTVSGAEPGSTVTVTI- 82 (317)
T ss_pred CCcEEEEEEEEeCCCCeeeeeEEEEEecC---CCceeee----cccCCCccccceeeeeeEEEEEecCCCCCCEEEEEE-
Confidence 45566899999987765555655553322 6643321 11111110 0 01234555555555455666666
Q ss_pred eCCCCCcEEEcCCeeEEEcccC--CCCccEEEEEEE-EEC
Q psy9046 179 SGNDDDVFAVDDTGILTIQGPL--AKGKTVYNLLVQ-VSD 215 (215)
Q Consensus 179 ~~~~~~~F~Id~~G~i~~~~~l--d~~~~~y~l~V~-a~D 215 (215)
.+ ....+..+.+|...+.-.. .-..+.|.+.+. ++|
T Consensus 83 ~g-~~~t~~~~~~G~ws~t~~~~~~l~~g~~ti~v~~~tD 121 (317)
T PF13753_consen 83 NG-TTGTLTADADGNWSVTVTPSDDLPDGDYTITVTTVTD 121 (317)
T ss_pred CC-EEEEEEEecCCcEEEeeccccccccCcceeEEEEEEc
Confidence 21 1122333335543322111 123567888887 776
No 33
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=55.84 E-value=57 Score=21.57 Aligned_cols=50 Identities=14% Similarity=0.250 Sum_probs=29.2
Q ss_pred EEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEEEEEeCCC--CCeEEEEEeeC
Q psy9046 125 NIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTLEATDSDE--NPHLTYKILSG 180 (215)
Q Consensus 125 ~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v~a~D~D~--~~~i~y~i~~~ 180 (215)
+|.|.| ||.+ .+.-..-...+.|+.. ..-..|.-..++. .-.+.|+..++
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~~----~~~v~V~~~~~~~~~~v~v~~~~~~g 53 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGDG----TVTVTVTRSGGSLDGPVTVNYSTVDG 53 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTSS----EEEEEEEEESS-TSSEEEEEEEEEES
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCCC----EEEEEEEEcccCCCcceEEEEEEeCC
Confidence 567778 6644 6665544677888752 4445555554543 44677777765
No 34
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=54.75 E-value=2.1e+02 Score=27.77 Aligned_cols=54 Identities=15% Similarity=0.258 Sum_probs=31.9
Q ss_pred EEEEEEEeeCCCCCCccCCCceEEEEecCCCCCCCCEEEEE-EEEeCCCCCeEEEEEeeC
Q psy9046 122 FRVNIKVLDVNDNRPEFKSKTYRTSIQENIIITPSPTLLTL-EATDSDENPHLTYKILSG 180 (215)
Q Consensus 122 ~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~~g~~~~i~~v-~a~D~D~~~~i~y~i~~~ 180 (215)
...+|+|.| ||++|.|.-..-...|.|+. | ..-.+| +..+.+..-.+.|+-.+|
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~--G--~vtvtV~RtsGa~G~VtV~Y~T~dG 569 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSESI--G--IMEVKVLRTSGARGTVIVPYRTVEG 569 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcCC--C--EEEEEEEEcCCCCeeEEEEEEeecC
Confidence 345666677 78899877655577888874 3 433333 222222133567877765
No 35
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=54.29 E-value=42 Score=20.99 Aligned_cols=26 Identities=19% Similarity=0.174 Sum_probs=18.6
Q ss_pred CCCeEEEEEEEEeCCCCceEEEEEEEE
Q psy9046 102 PGDTKVVLVSAEDASTPCSHFRVNIKV 128 (215)
Q Consensus 102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V 128 (215)
..|.|.+++.+.|..+ +.++.+.|.|
T Consensus 53 ~~G~y~v~l~v~n~~g-~~~~~~~i~v 78 (79)
T smart00089 53 KPGTYTVTLTVTNAVG-SASATVTVVV 78 (79)
T ss_pred CCcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence 3455558999999776 6777777766
No 36
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=52.35 E-value=88 Score=22.65 Aligned_cols=43 Identities=16% Similarity=0.255 Sum_probs=31.1
Q ss_pred EEEEEEEEeCCCCceEEEEEEEEeeCCCCCCccCCCceEEEEecCCC
Q psy9046 106 KVVLVSAEDASTPCSHFRVNIKVLDVNDNRPEFKSKTYRTSIQENII 152 (215)
Q Consensus 106 ~~l~v~a~D~~~~~~~~~v~I~V~DvNdn~P~f~~~~~~~~v~E~~~ 152 (215)
+.|.|.|+|..+-.....+.|+|.| | .|...... ...|.|...
T Consensus 87 l~~~v~a~D~DGD~s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~~L 129 (137)
T TIGR03660 87 LNFPIIATDFDGDTSSITLPVTIVD--D-VPTITDVD-ALTVDEDDL 129 (137)
T ss_pred EeeeEEEEeCCCCccccEEEEEEEC--C-CCeecccc-ceEEecccc
Confidence 4478999998776666788888887 6 57776543 478888654
No 37
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=47.56 E-value=58 Score=24.13 Aligned_cols=32 Identities=19% Similarity=0.146 Sum_probs=24.9
Q ss_pred CCCCCeEEEEEEEEeCCCCceEEEEEEEEeeC
Q psy9046 100 NSPGDTKVVLVSAEDASTPCSHFRVNIKVLDV 131 (215)
Q Consensus 100 ~~~~~~~~l~v~a~D~~~~~~~~~v~I~V~Dv 131 (215)
-+..+.|.|+|.|.|..+...+.++.+.....
T Consensus 119 le~~~~YtLtV~a~D~aGN~~~~si~F~y~P~ 150 (158)
T PF13750_consen 119 LEADDSYTLTVSATDKAGNQSTKSISFSYMPP 150 (158)
T ss_pred cCCCCeEEEEEEEEecCCCEEEEEEEEEEeCC
Confidence 34556677999999998888888888877643
No 38
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=47.39 E-value=2.4e+02 Score=26.27 Aligned_cols=32 Identities=16% Similarity=0.224 Sum_probs=22.1
Q ss_pred CCCCeEEEEEEEEeCCCC---ceEEEEEEEEeeCC
Q psy9046 101 SPGDTKVVLVSAEDASTP---CSHFRVNIKVLDVN 132 (215)
Q Consensus 101 ~~~~~~~l~v~a~D~~~~---~~~~~v~I~V~DvN 132 (215)
.+.|.|.|.+.|.|.... -..+.+...|....
T Consensus 155 ~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~Vk~~~ 189 (667)
T PRK14081 155 NKPGKYELLVECKRIDSTKDFDDFKKVKFKVKEID 189 (667)
T ss_pred cCCCcEEEEEEEecCCCccccCcceEEEEEcccCc
Confidence 366888888999987665 45566666665554
No 39
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=43.49 E-value=3.2e+02 Score=26.60 Aligned_cols=28 Identities=21% Similarity=0.329 Sum_probs=18.5
Q ss_pred EEEEEEEecCCCCCccCCCCCCCCCceEEEEecCC
Q psy9046 3 TLIISIIDVNDIPPEFEPPWTRDEPNYLLEIEEEQ 37 (215)
Q Consensus 3 ~v~I~V~DvND~~P~f~~~~~~~~~~~~~~V~E~~ 37 (215)
+.+|+|.| ||++|.|. -..-..+|.|+.
T Consensus 516 ~ATVTIlD-DD~aGIfs------Fe~~~~sV~Es~ 543 (928)
T TIGR00845 516 TATVTILD-DDHAGIFT------FEEDVFHVSESI 543 (928)
T ss_pred eEEEEEec-CcccCccc------ccCceEEEEcCC
Confidence 45677777 67888873 333456788874
No 40
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=37.73 E-value=3e+02 Score=24.56 Aligned_cols=33 Identities=6% Similarity=0.076 Sum_probs=24.5
Q ss_pred cCCCCCeEEEEEEEEeCCCC-ceEEEEEEEEeeC
Q psy9046 99 ENSPGDTKVVLVSAEDASTP-CSHFRVNIKVLDV 131 (215)
Q Consensus 99 e~~~~~~~~l~v~a~D~~~~-~~~~~v~I~V~Dv 131 (215)
.+...|.|.|.++++|..+. ....++.+.|...
T Consensus 374 s~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~~~ 407 (478)
T PRK13211 374 SKLKAGHHMLVVKAKPKDGELIKQQTLDFMLEAK 407 (478)
T ss_pred ccCCCceEEEEEEEEeCCCceeeeeeEEEEEEec
Confidence 55667999999999987655 3467788888643
No 41
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=36.14 E-value=61 Score=22.36 Aligned_cols=29 Identities=17% Similarity=0.100 Sum_probs=20.7
Q ss_pred CCCeEEEEEEEEeCCCCceEEEEEEEEee
Q psy9046 102 PGDTKVVLVSAEDASTPCSHFRVNIKVLD 130 (215)
Q Consensus 102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V~D 130 (215)
+.|.|..+|++.|..+-+....+.|.|-|
T Consensus 78 kgG~y~m~V~lCn~dGCS~S~~~~I~VAD 106 (106)
T cd02848 78 KGGRYQMQVALCNGDGCSTSAAKEIVVAD 106 (106)
T ss_pred CCCeEEEEEEEECCCCccCcCCEEEEecC
Confidence 44556689999998776666677777654
No 42
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=35.01 E-value=62 Score=23.35 Aligned_cols=31 Identities=19% Similarity=0.170 Sum_probs=20.6
Q ss_pred CCCeEEEEEEEEeCCCCceEEEEEEEEeeCC
Q psy9046 102 PGDTKVVLVSAEDASTPCSHFRVNIKVLDVN 132 (215)
Q Consensus 102 ~~~~~~l~v~a~D~~~~~~~~~v~I~V~DvN 132 (215)
+.|.|..+|++.+..+-+....+.|.|.|..
T Consensus 81 ~gG~y~~~VeLCN~~GCS~S~~~~V~VaDTD 111 (133)
T PF08329_consen 81 KGGRYQMQVELCNADGCSTSAPVEVVVADTD 111 (133)
T ss_dssp S-EEEEEEEEEEETTEEEE---EEEEEE-TT
T ss_pred CCCEEEEEEEEECCCCcccCCCEEEEEeCCC
Confidence 4566668999999877667778888888875
No 43
>PF07550 DUF1533: Protein of unknown function (DUF1533); InterPro: IPR011432 This domain is found duplicated in proteins of unknown function. The proteins typically also contain leucine-rich repeats.
Probab=25.70 E-value=1.4e+02 Score=18.29 Aligned_cols=33 Identities=12% Similarity=0.351 Sum_probs=22.5
Q ss_pred CCCcEEEcCCeeEEEcccCCCCccEEEEEEEEE
Q psy9046 182 DDDVFAVDDTGILTIQGPLAKGKTVYNLLVQVS 214 (215)
Q Consensus 182 ~~~~F~Id~~G~i~~~~~ld~~~~~y~l~V~a~ 214 (215)
....|.++..|.|.+...+-.+.+.|.|.|.|.
T Consensus 24 ~~~~y~~~~~~~l~i~~~~f~~~G~~~I~I~A~ 56 (65)
T PF07550_consen 24 GNDKYSISSKGSLKIKASAFNKDGENTIVIKAT 56 (65)
T ss_pred ccccEEeccCCcEEEcHHHcCcCCceEEEEEeC
Confidence 345677766444777766554567899999874
Done!