Query psy10256
Match_columns 96
No_of_seqs 183 out of 1176
Neff 9.3
Searched_HMMs 46136
Date Fri Aug 16 21:49:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10256.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10256hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00031 CA Cadherin repeat dom 99.8 2.1E-19 4.5E-24 113.1 13.0 95 1-95 53-153 (199)
2 KOG4289|consensus 99.8 6.7E-20 1.5E-24 138.0 9.9 93 2-94 326-424 (2531)
3 KOG4289|consensus 99.8 1.2E-19 2.7E-24 136.6 7.9 87 2-88 841-931 (2531)
4 KOG1219|consensus 99.8 2.3E-18 5E-23 133.4 11.6 92 1-94 2633-2728(4289)
5 KOG1219|consensus 99.7 1.9E-16 4E-21 123.2 10.6 94 1-94 696-793 (4289)
6 smart00112 CA Cadherin repeats 99.5 3.3E-13 7.2E-18 73.9 7.1 47 2-48 33-79 (79)
7 KOG1834|consensus 99.5 7.6E-13 1.6E-17 95.0 9.7 94 2-95 95-199 (952)
8 PF00028 Cadherin: Cadherin do 98.7 9.4E-08 2E-12 53.6 6.9 40 2-41 53-93 (93)
9 PF00028 Cadherin: Cadherin do 98.5 4.1E-07 8.8E-12 51.0 4.8 40 55-94 1-46 (93)
10 cd00031 CA Cadherin repeat dom 97.7 0.00027 5.8E-09 44.2 6.5 41 2-42 159-199 (199)
11 smart00736 CADG Dystroglycan-t 91.2 1.6 3.5E-05 24.4 5.9 31 13-45 66-96 (97)
12 KOG1834|consensus 90.8 0.5 1.1E-05 35.7 4.3 41 2-42 204-244 (952)
13 PF08758 Cadherin_pro: Cadheri 87.0 1.3 2.8E-05 24.9 3.4 24 47-70 3-26 (90)
14 TIGR03660 T1SS_rpt_143 T1SS-14 86.0 5.8 0.00012 24.1 6.7 55 2-62 70-127 (137)
15 PF07495 Y_Y_Y: Y_Y_Y domain; 80.5 5.9 0.00013 20.1 4.1 28 13-40 38-65 (66)
16 TIGR00845 caca sodium/calcium 71.9 36 0.00079 27.2 7.6 29 34-63 515-543 (928)
17 PF08266 Cadherin_2: Cadherin- 67.4 9.3 0.0002 21.1 2.9 23 55-77 3-26 (84)
18 PF13750 Big_3_3: Bacterial Ig 66.5 28 0.00061 21.5 5.5 29 11-41 120-148 (158)
19 COG4288 Uncharacterized protei 54.8 30 0.00064 20.3 3.5 62 15-78 44-111 (124)
20 cd00146 PKD polycystic kidney 50.8 36 0.00078 17.8 4.2 17 10-26 53-69 (81)
21 smart00089 PKD Repeats in poly 50.6 36 0.00078 17.7 4.8 28 10-40 51-78 (79)
22 PF02494 HYR: HYR domain; Int 50.1 39 0.00084 17.9 4.3 13 34-46 59-71 (81)
23 PF03160 Calx-beta: Calx-beta 48.3 46 0.001 18.3 4.9 25 37-63 2-26 (100)
24 TIGR01965 VCBS_repeat VCBS rep 47.0 55 0.0012 18.7 4.4 40 15-62 58-97 (99)
25 cd02848 Chitinase_N_term Chiti 44.8 42 0.00091 19.5 3.1 24 8-31 74-97 (106)
26 PF08329 ChitinaseA_N: Chitina 40.0 86 0.0019 19.0 4.2 33 10-44 79-111 (133)
27 PF05688 DUF824: Salmonella re 38.2 53 0.0012 16.1 2.6 15 33-47 14-28 (47)
28 PF12245 Big_3_2: Bacterial Ig 33.4 71 0.0015 16.1 5.7 29 13-43 22-50 (60)
29 PF15418 DUF4625: Domain of un 32.9 1.1E+02 0.0025 18.3 4.6 12 15-26 108-119 (132)
30 KOG3597|consensus 31.2 2.1E+02 0.0047 21.0 7.8 54 32-86 24-81 (442)
No 1
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.84 E-value=2.1e-19 Score=113.05 Aligned_cols=95 Identities=28% Similarity=0.345 Sum_probs=86.3
Q ss_pred CEEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCCeEeccceEEEEecCCCCCeeEE----EcCCCC
Q psy10256 1 ELKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTDINDNAPEFASVINSASILENAEIGSLST----VNATIG 76 (96)
Q Consensus 1 ~i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~~g~~v~----~D~D~~ 76 (96)
+|++.++||||....|.|.+.|.|.+.+...+...+.|.|.|+|||+|.|....|.+.+.|+.++|+.++ +|+|.+
T Consensus 53 ~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~~ 132 (199)
T cd00031 53 VITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADSG 132 (199)
T ss_pred EEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCCC
Confidence 3678889999999999999999998777777899999999999999999999999999999999999998 999999
Q ss_pred CCceEEEEEcCCC--ceEecC
Q psy10256 77 VNRCGHTFRGKDL--RCYNLF 95 (96)
Q Consensus 77 ~n~~i~y~i~~~~--~~f~if 95 (96)
.|+.+.|+|.+.. ..|.|-
T Consensus 133 ~~~~~~y~l~~~~~~~~f~i~ 153 (199)
T cd00031 133 ENAKLTYSILSGNDKELFSID 153 (199)
T ss_pred CCccEEEEEeCCCCCCEEEEe
Confidence 9999999997666 677763
No 2
>KOG4289|consensus
Probab=99.82 E-value=6.7e-20 Score=137.95 Aligned_cols=93 Identities=23% Similarity=0.341 Sum_probs=86.0
Q ss_pred EEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCCeEeccceEEEEecCCCCCeeEE----EcCCCCC
Q psy10256 2 LKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTDINDNAPEFASVINSASILENAEIGSLST----VNATIGV 77 (96)
Q Consensus 2 i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~~g~~v~----~D~D~~~ 77 (96)
|.+..+||||+...|.|.|.|.|.|.++...++.|.|+|.|.|||+|+|....|.++|.|+..++++|. +|+|.|.
T Consensus 326 I~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~ 405 (2531)
T KOG4289|consen 326 ISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGT 405 (2531)
T ss_pred eeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCc
Confidence 678899999999999999999999987766699999999999999999999999999999999999998 9999999
Q ss_pred CceEEEEEcCCC--ceEec
Q psy10256 78 NRCGHTFRGKDL--RCYNL 94 (96)
Q Consensus 78 n~~i~y~i~~~~--~~f~i 94 (96)
|++|+|+|.+.+ +.|.|
T Consensus 406 Ng~VHYsi~Sgn~~G~f~i 424 (2531)
T KOG4289|consen 406 NGKVHYSIASGNGRGQFYI 424 (2531)
T ss_pred CceEEEEeeccCccccEEE
Confidence 999999996554 77765
No 3
>KOG4289|consensus
Probab=99.80 E-value=1.2e-19 Score=136.55 Aligned_cols=87 Identities=34% Similarity=0.381 Sum_probs=83.3
Q ss_pred EEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCCeEeccceEEEEecCCCCCeeEE----EcCCCCC
Q psy10256 2 LKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTDINDNAPEFASVINSASILENAEIGSLST----VNATIGV 77 (96)
Q Consensus 2 i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~~g~~v~----~D~D~~~ 77 (96)
|++.+.||||..+.|.|.+.|+|.|.|++++...|+|+|.|+|||+|.|.+..|...|.||.++|+.|. .|+|+|.
T Consensus 841 iRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~i~a~dpdEG~ 920 (2531)
T KOG4289|consen 841 IRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVALITADDPDEGP 920 (2531)
T ss_pred eehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEEEEccCCCcCC
Confidence 678889999999999999999999999999999999999999999999999999999999999999998 8999999
Q ss_pred CceEEEEEcCC
Q psy10256 78 NRCGHTFRGKD 88 (96)
Q Consensus 78 n~~i~y~i~~~ 88 (96)
|++|.|+|.+.
T Consensus 921 NA~IsYqIvgg 931 (2531)
T KOG4289|consen 921 NAHISYQIVGG 931 (2531)
T ss_pred cceEEEeeccC
Confidence 99999999644
No 4
>KOG1219|consensus
Probab=99.78 E-value=2.3e-18 Score=133.36 Aligned_cols=92 Identities=21% Similarity=0.228 Sum_probs=84.6
Q ss_pred CEEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCCeEeccceEEEEecCCCCCeeEE----EcCCCC
Q psy10256 1 ELKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTDINDNAPEFASVINSASILENAEIGSLST----VNATIG 76 (96)
Q Consensus 1 ~i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~~g~~v~----~D~D~~ 76 (96)
.|++.++||+|++.+|.+.|.|++.+. .-+...|.|.|.|+|||+|.|....|.+.+.||.|+|+.|+ .|+|.|
T Consensus 2633 ~i~v~ksLD~E~kk~yqi~v~a~~~~~--vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~ 2710 (4289)
T KOG1219|consen 2633 MIKVNKSLDHEKKKSYQIKVKATCGQW--VVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMDSG 2710 (4289)
T ss_pred eEEeccccchhhhceEEEEEEeecCCc--eEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccCCC
Confidence 378899999999999999999998763 46678999999999999999999999999999999999998 999999
Q ss_pred CCceEEEEEcCCCceEec
Q psy10256 77 VNRCGHTFRGKDLRCYNL 94 (96)
Q Consensus 77 ~n~~i~y~i~~~~~~f~i 94 (96)
.||+|+|++..+...|.|
T Consensus 2711 ~nGqirysl~~~v~yF~I 2728 (4289)
T KOG1219|consen 2711 NNGQIRYSLTSPVPYFAI 2728 (4289)
T ss_pred CCceEEEEEcCCcceEEE
Confidence 999999999988777765
No 5
>KOG1219|consensus
Probab=99.70 E-value=1.9e-16 Score=123.17 Aligned_cols=94 Identities=23% Similarity=0.290 Sum_probs=89.5
Q ss_pred CEEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCCeEeccceEEEEecCCCCCeeEE----EcCCCC
Q psy10256 1 ELKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTDINDNAPEFASVINSASILENAEIGSLST----VNATIG 76 (96)
Q Consensus 1 ~i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~~g~~v~----~D~D~~ 76 (96)
+|++.++||+|....|.|.+.|.|.|.|..++.....|.|.|.|||+|.|.+..|.+.|+|+..+|+.|. .|.|.|
T Consensus 696 ~itvas~ld~~~t~~yiLnvta~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~Ia~vetnd~D~g 775 (4289)
T KOG1219|consen 696 NITVASPLDNENTEQYILNVTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFIAHVETNDTDGG 775 (4289)
T ss_pred eEEEecchhhhhhheeEEEEEEecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceEEEEEecccCCC
Confidence 4788999999999999999999999999998999999999999999999999999999999999999998 899999
Q ss_pred CCceEEEEEcCCCceEec
Q psy10256 77 VNRCGHTFRGKDLRCYNL 94 (96)
Q Consensus 77 ~n~~i~y~i~~~~~~f~i 94 (96)
.||++.|+|++....|+|
T Consensus 776 ~NG~v~fsL~n~sdvfsI 793 (4289)
T KOG1219|consen 776 NNGMVSFSLLNKSDVFSI 793 (4289)
T ss_pred CCceEEEEecCCcceEEe
Confidence 999999999998888876
No 6
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.47 E-value=3.3e-13 Score=73.94 Aligned_cols=47 Identities=38% Similarity=0.639 Sum_probs=44.5
Q ss_pred EEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCC
Q psy10256 2 LKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTDINDNAP 48 (96)
Q Consensus 2 i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P 48 (96)
|++.++||||..+.|.|.|.|.|.+.+++++.+.|+|+|.|+|||+|
T Consensus 33 i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 33 ITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred EEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 67778999999999999999999998889999999999999999998
No 7
>KOG1834|consensus
Probab=99.46 E-value=7.6e-13 Score=95.03 Aligned_cols=94 Identities=16% Similarity=0.121 Sum_probs=80.1
Q ss_pred EEeCCCCCccCCCEEEEEEEEEeCCCCC------ceEEEEEEEEEecCCCCCCeEeccceEEEEecCCCCCeeEE---Ec
Q psy10256 2 LKTARGLDREKIARYTLNAHVQDRDQPA------WECVSLITLTLTDINDNAPEFASVINSASILENAEIGSLST---VN 72 (96)
Q Consensus 2 i~~~~~LDrE~~~~~~l~v~a~d~~~~~------~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~~g~~v~---~D 72 (96)
|++..+||.|.++.|+++|+|.|+|..+ .+..++|.|+|.|+|+.+|.|..+.|.+.|.|.....++.. .|
T Consensus 95 lRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il~veAiD 174 (952)
T KOG1834|consen 95 LRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSILRVEAID 174 (952)
T ss_pred EeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeEEEEeec
Confidence 6788999999999999999999998743 47788999999999999999999999999999965555544 89
Q ss_pred CCCCC-CceE-EEEEcCCCceEecC
Q psy10256 73 ATIGV-NRCG-HTFRGKDLRCYNLF 95 (96)
Q Consensus 73 ~D~~~-n~~i-~y~i~~~~~~f~if 95 (96)
.|-++ +++| .|.|....-.|.|-
T Consensus 175 ~DCspq~sqIC~YEI~t~d~PFaId 199 (952)
T KOG1834|consen 175 KDCSPQYSQICEYEITTPDVPFAID 199 (952)
T ss_pred CCCCCcccceeEEEecCCCCceEEc
Confidence 99776 8887 69998777777664
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.73 E-value=9.4e-08 Score=53.59 Aligned_cols=40 Identities=23% Similarity=0.443 Sum_probs=36.8
Q ss_pred EEeCCCCCccCCCEEEEEEEEEeC-CCCCceEEEEEEEEEe
Q psy10256 2 LKTARGLDREKIARYTLNAHVQDR-DQPAWECVSLITLTLT 41 (96)
Q Consensus 2 i~~~~~LDrE~~~~~~l~v~a~d~-~~~~~~~~~~v~i~v~ 41 (96)
|++.++||||+...|.|.|.|+|. +.+++++.+.|.|+|+
T Consensus 53 i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 53 ISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp EEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred cccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 788999999999999999999999 7788999999999874
No 9
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=98.47 E-value=4.1e-07 Score=50.96 Aligned_cols=40 Identities=20% Similarity=0.075 Sum_probs=35.6
Q ss_pred eEEEEecCCCCCeeEE----EcCCCCCCceEEEEEcCCC--ceEec
Q psy10256 55 NSASILENAEIGSLST----VNATIGVNRCGHTFRGKDL--RCYNL 94 (96)
Q Consensus 55 ~~~~v~E~~~~g~~v~----~D~D~~~n~~i~y~i~~~~--~~f~i 94 (96)
|.+.|+|+.++|+.|+ .|+|.+.|+.+.|+|.++. ..|.|
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I 46 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSI 46 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEE
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEE
Confidence 7899999999999998 9999999999999996554 77776
No 10
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=97.66 E-value=0.00027 Score=44.18 Aligned_cols=41 Identities=29% Similarity=0.470 Sum_probs=36.4
Q ss_pred EEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEec
Q psy10256 2 LKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTD 42 (96)
Q Consensus 2 i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d 42 (96)
|.+.+.||||....|.+.+.|.|.+.+.+++...++|.|.|
T Consensus 159 i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 159 ITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 66778999999999999999999988778888889888875
No 11
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=91.20 E-value=1.6 Score=24.40 Aligned_cols=31 Identities=19% Similarity=0.320 Sum_probs=25.0
Q ss_pred CCEEEEEEEEEeCCCCCceEEEEEEEEEecCCC
Q psy10256 13 IARYTLNAHVQDRDQPAWECVSLITLTLTDIND 45 (96)
Q Consensus 13 ~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd 45 (96)
...|.+.+.|+|..+ .+....++|.|.+.|+
T Consensus 66 ~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 66 VGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 456999999999873 5667788999988886
No 12
>KOG1834|consensus
Probab=90.83 E-value=0.5 Score=35.71 Aligned_cols=41 Identities=20% Similarity=0.270 Sum_probs=33.3
Q ss_pred EEeCCCCCccCCCEEEEEEEEEeCCCCCceEEEEEEEEEec
Q psy10256 2 LKTARGLDREKIARYTLNAHVQDRDQPAWECVSLITLTLTD 42 (96)
Q Consensus 2 i~~~~~LDrE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d 42 (96)
|+.+.+|.|-+...|.|+|.|.|+|.........|+|+|..
T Consensus 204 irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp 244 (952)
T KOG1834|consen 204 IRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKP 244 (952)
T ss_pred cccccccccccceeEEEEEEEEecccccccCcceEEEEecC
Confidence 56678899999999999999999997655555677788744
No 13
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=86.95 E-value=1.3 Score=24.87 Aligned_cols=24 Identities=21% Similarity=0.229 Sum_probs=9.8
Q ss_pred CCeEeccceEEEEecCCCCCeeEE
Q psy10256 47 APEFASVINSASILENAEIGSLST 70 (96)
Q Consensus 47 ~P~f~~~~~~~~v~E~~~~g~~v~ 70 (96)
.|-|.+..|.+.|+.+...|..|+
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg 26 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLG 26 (90)
T ss_dssp --B--S-EEEE----SS-SS--EE
T ss_pred cCCcccceEEEEcCchhhCCcEEE
Confidence 477888888888888888888887
No 14
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=85.95 E-value=5.8 Score=24.05 Aligned_cols=55 Identities=18% Similarity=0.196 Sum_probs=34.3
Q ss_pred EEeCCCCCccC---CCEEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCCeEeccceEEEEecC
Q psy10256 2 LKTARGLDREK---IARYTLNAHVQDRDQPAWECVSLITLTLTDINDNAPEFASVINSASILEN 62 (96)
Q Consensus 2 i~~~~~LDrE~---~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~ 62 (96)
+.+..+||... .-...|.+.|+|..+-.. ...+.|+|.| |. |...... ...|.|+
T Consensus 70 ftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s--~~~l~VtI~D--D~-P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 70 FTLEGPLDHAAGSDELTLNFPIIATDFDGDTS--SITLPVTIVD--DV-PTITDVD-ALTVDED 127 (137)
T ss_pred EEEcccccCCCCCceEEEeeeEEEEeCCCCcc--ccEEEEEEEC--CC-Ceecccc-ceEEecc
Confidence 45677888744 234567788888774333 3477778776 54 6665433 3677774
No 15
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=80.49 E-value=5.9 Score=20.09 Aligned_cols=28 Identities=21% Similarity=0.279 Sum_probs=16.0
Q ss_pred CCEEEEEEEEEeCCCCCceEEEEEEEEE
Q psy10256 13 IARYTLNAHVQDRDQPAWECVSLITLTL 40 (96)
Q Consensus 13 ~~~~~l~v~a~d~~~~~~~~~~~v~i~v 40 (96)
...|+|.|.|.+..+........+.|+|
T Consensus 38 ~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 38 PGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 3468899999987654322224555554
No 16
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=71.87 E-value=36 Score=27.19 Aligned_cols=29 Identities=24% Similarity=0.250 Sum_probs=20.3
Q ss_pred EEEEEEEecCCCCCCeEeccceEEEEecCC
Q psy10256 34 SLITLTLTDINDNAPEFASVINSASILENA 63 (96)
Q Consensus 34 ~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~ 63 (96)
...+|+|.| ||++|.|.-..-...|.|+.
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es~ 543 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSESI 543 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcCC
Confidence 355677777 88899876555567788873
No 17
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=67.35 E-value=9.3 Score=21.09 Aligned_cols=23 Identities=17% Similarity=0.278 Sum_probs=14.5
Q ss_pred eEEEEecCCCCCeeEE-EcCCCCC
Q psy10256 55 NSASILENAEIGSLST-VNATIGV 77 (96)
Q Consensus 55 ~~~~v~E~~~~g~~v~-~D~D~~~ 77 (96)
...+|+|..++|+.|+ ...|.+.
T Consensus 3 i~YsV~EE~~~Gt~IGnia~dL~l 26 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNIAKDLGL 26 (84)
T ss_dssp EEEEEESS--TT-EEEECCCCCT-
T ss_pred eEEEeecCCCCCCEEEEhHHhhCC
Confidence 4678999999999999 4445544
No 18
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=66.53 E-value=28 Score=21.48 Aligned_cols=29 Identities=21% Similarity=0.299 Sum_probs=20.4
Q ss_pred cCCCEEEEEEEEEeCCCCCceEEEEEEEEEe
Q psy10256 11 EKIARYTLNAHVQDRDQPAWECVSLITLTLT 41 (96)
Q Consensus 11 E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~ 41 (96)
|....|+|++.|.|.. +..+...+.+...
T Consensus 120 e~~~~YtLtV~a~D~a--GN~~~~si~F~y~ 148 (158)
T PF13750_consen 120 EADDSYTLTVSATDKA--GNQSTKSISFSYM 148 (158)
T ss_pred CCCCeEEEEEEEEecC--CCEEEEEEEEEEe
Confidence 6778999999999987 3444555555543
No 19
>COG4288 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.80 E-value=30 Score=20.30 Aligned_cols=62 Identities=13% Similarity=0.139 Sum_probs=38.1
Q ss_pred EEEEEEEEEeCCCC-C----ceEEEEEEEEEecCCCCCCeEeccceEEEEecCCCCCeeEE-EcCCCCCC
Q psy10256 15 RYTLNAHVQDRDQP-A----WECVSLITLTLTDINDNAPEFASVINSASILENAEIGSLST-VNATIGVN 78 (96)
Q Consensus 15 ~~~l~v~a~d~~~~-~----~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~~g~~v~-~D~D~~~n 78 (96)
...+++...|...- + ..+..++.+...|.+|-.+.-....|..-+..|.| .+|. +-.|.|.+
T Consensus 44 kvrI~i~fsDr~pvgpyevevaarrt~hlRfndL~dpe~iP~d~~yasviesnvP--vVvQ~tRLDsgq~ 111 (124)
T COG4288 44 KVRITILFSDREPVGPYEVEVAARRTLHLRFNDLGDPEAIPKDTPYASVIESNVP--VVVQLTRLDSGQA 111 (124)
T ss_pred eEEEEEEeccCCCCCceEEEeecceeEEEEecccCCcccCCCCCchhhheecCCc--eEEEEEEeccCCc
Confidence 44555555555432 1 13445778899999987777776777776666654 3333 66666654
No 20
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=50.80 E-value=36 Score=17.76 Aligned_cols=17 Identities=24% Similarity=0.241 Sum_probs=11.0
Q ss_pred ccCCCEEEEEEEEEeCC
Q psy10256 10 REKIARYTLNAHVQDRD 26 (96)
Q Consensus 10 rE~~~~~~l~v~a~d~~ 26 (96)
+.+...|.+.+.+.|..
T Consensus 53 y~~~G~y~v~l~v~d~~ 69 (81)
T cd00146 53 YTKPGTYTVTLTVTNAV 69 (81)
T ss_pred cCCCcEEEEEEEEEeCC
Confidence 44556677777777754
No 21
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=50.62 E-value=36 Score=17.67 Aligned_cols=28 Identities=18% Similarity=0.239 Sum_probs=15.3
Q ss_pred ccCCCEEEEEEEEEeCCCCCceEEEEEEEEE
Q psy10256 10 REKIARYTLNAHVQDRDQPAWECVSLITLTL 40 (96)
Q Consensus 10 rE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v 40 (96)
+++...|.+.+.+.|..+ +....+.|.|
T Consensus 51 y~~~G~y~v~l~v~n~~g---~~~~~~~i~v 78 (79)
T smart00089 51 YTKPGTYTVTLTVTNAVG---SASATVTVVV 78 (79)
T ss_pred eCCCcEEEEEEEEEcCCC---cEEEEEEEEE
Confidence 344556777777776543 3344444443
No 22
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=50.14 E-value=39 Score=17.94 Aligned_cols=13 Identities=38% Similarity=0.440 Sum_probs=5.9
Q ss_pred EEEEEEEecCCCC
Q psy10256 34 SLITLTLTDINDN 46 (96)
Q Consensus 34 ~~v~i~v~d~Ndn 46 (96)
..|+....|.+-|
T Consensus 59 t~V~ytA~D~~GN 71 (81)
T PF02494_consen 59 TTVTYTATDAAGN 71 (81)
T ss_pred EEEEEEEEECCCC
Confidence 3444444444443
No 23
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=48.31 E-value=46 Score=18.26 Aligned_cols=25 Identities=24% Similarity=0.311 Sum_probs=15.3
Q ss_pred EEEEecCCCCCCeEeccceEEEEecCC
Q psy10256 37 TLTLTDINDNAPEFASVINSASILENA 63 (96)
Q Consensus 37 ~i~v~d~Ndn~P~f~~~~~~~~v~E~~ 63 (96)
+|+|.| ||.+ .+.-..-...+.|+.
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~ 26 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD 26 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC
Confidence 456666 5655 666666666777775
No 24
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=46.96 E-value=55 Score=18.73 Aligned_cols=40 Identities=23% Similarity=0.326 Sum_probs=25.2
Q ss_pred EEEEEEEEEeCCCCCceEEEEEEEEEecCCCCCCeEeccceEEEEecC
Q psy10256 15 RYTLNAHVQDRDQPAWECVSLITLTLTDINDNAPEFASVINSASILEN 62 (96)
Q Consensus 15 ~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~ 62 (96)
.-.|++.+.|+ ....|.|+|.-.||. |.... .-...+.|+
T Consensus 58 tdsFtvtv~DG------tt~~vtItI~GtNDa-pvi~~-~~~g~v~ED 97 (99)
T TIGR01965 58 TDTFTVTSADG------TSQTVTITITGANDA-AVIGG-ADTGSVTED 97 (99)
T ss_pred EEEEEEEEeCC------CeEEEEEEEEccCCC-CEEec-ccceeEecC
Confidence 34566667774 267899999999995 44332 223555554
No 25
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=44.85 E-value=42 Score=19.53 Aligned_cols=24 Identities=17% Similarity=0.185 Sum_probs=15.1
Q ss_pred CCccCCCEEEEEEEEEeCCCCCce
Q psy10256 8 LDREKIARYTLNAHVQDRDQPAWE 31 (96)
Q Consensus 8 LDrE~~~~~~l~v~a~d~~~~~~~ 31 (96)
+++.+-..|.+.|.+.|..++..+
T Consensus 74 ~~v~kgG~y~m~V~lCn~dGCS~S 97 (106)
T cd02848 74 FKVGKGGRYQMQVALCNGDGCSTS 97 (106)
T ss_pred EEeCCCCeEEEEEEEECCCCccCc
Confidence 345555678888888777654333
No 26
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=39.98 E-value=86 Score=18.99 Aligned_cols=33 Identities=15% Similarity=0.287 Sum_probs=17.5
Q ss_pred ccCCCEEEEEEEEEeCCCCCceEEEEEEEEEecCC
Q psy10256 10 REKIARYTLNAHVQDRDQPAWECVSLITLTLTDIN 44 (96)
Q Consensus 10 rE~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~N 44 (96)
..+...|...+++.+..+ .+.+..+.|.|.|..
T Consensus 79 ~~~gG~y~~~VeLCN~~G--CS~S~~~~V~VaDTD 111 (133)
T PF08329_consen 79 VTKGGRYQMQVELCNADG--CSTSAPVEVVVADTD 111 (133)
T ss_dssp E-S-EEEEEEEEEEETTE--EEE---EEEEEE-TT
T ss_pred ecCCCEEEEEEEEECCCC--cccCCCEEEEEeCCC
Confidence 344567888888888764 344446666666653
No 27
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=38.22 E-value=53 Score=16.11 Aligned_cols=15 Identities=33% Similarity=0.430 Sum_probs=11.8
Q ss_pred EEEEEEEEecCCCCC
Q psy10256 33 VSLITLTLTDINDNA 47 (96)
Q Consensus 33 ~~~v~i~v~d~Ndn~ 47 (96)
+.+++|++.|.|-+|
T Consensus 14 ~I~ltVt~kda~G~p 28 (47)
T PF05688_consen 14 TIPLTVTVKDANGNP 28 (47)
T ss_pred eEEEEEEEECCCCCC
Confidence 567889999997655
No 28
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=33.43 E-value=71 Score=16.14 Aligned_cols=29 Identities=21% Similarity=0.275 Sum_probs=17.9
Q ss_pred CCEEEEEEEEEeCCCCCceEEEEEEEEEecC
Q psy10256 13 IARYTLNAHVQDRDQPAWECVSLITLTLTDI 43 (96)
Q Consensus 13 ~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~ 43 (96)
...|.+.+.+.|.-+. .+.......+.|.
T Consensus 22 dg~yt~~v~a~D~AGN--~~~~~~~~~i~d~ 50 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGN--TSSSTTQIVIVDN 50 (60)
T ss_pred CccEEEEEEEEECCCC--EEEeeeEEEEEcC
Confidence 4579999999998643 3333444444433
No 29
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=32.94 E-value=1.1e+02 Score=18.30 Aligned_cols=12 Identities=25% Similarity=0.365 Sum_probs=7.8
Q ss_pred EEEEEEEEEeCC
Q psy10256 15 RYTLNAHVQDRD 26 (96)
Q Consensus 15 ~~~l~v~a~d~~ 26 (96)
.|++.+.++|..
T Consensus 108 ~YH~~i~VtD~~ 119 (132)
T PF15418_consen 108 DYHFMITVTDAA 119 (132)
T ss_pred ceEEEEEEEECC
Confidence 466666666665
No 30
>KOG3597|consensus
Probab=31.23 E-value=2.1e+02 Score=20.99 Aligned_cols=54 Identities=9% Similarity=-0.024 Sum_probs=36.0
Q ss_pred EEEEEEEEEecCCCCCCeEeccceEEEEecCCC--CCeeEE--EcCCCCCCceEEEEEc
Q psy10256 32 CVSLITLTLTDINDNAPEFASVINSASILENAE--IGSLST--VNATIGVNRCGHTFRG 86 (96)
Q Consensus 32 ~~~~v~i~v~d~Ndn~P~f~~~~~~~~v~E~~~--~g~~v~--~D~D~~~n~~i~y~i~ 86 (96)
.+....|.|..+||.+..+...-+.+-+.|+.. .+..+. .|+|... ..+.|++.
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~ 81 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVL 81 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEc
Confidence 355678999999999998887777777777642 333333 5655544 33566663
Done!