Query psy11095
Match_columns 102
No_of_seqs 154 out of 1265
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 17:50:22 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11095.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11095hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00031 CA Cadherin repeat dom 99.8 2.1E-18 4.6E-23 111.4 13.4 87 7-93 44-132 (199)
2 KOG4289|consensus 99.8 1.5E-18 3.3E-23 134.5 9.8 96 5-101 315-411 (2531)
3 KOG4289|consensus 99.8 7.5E-18 1.6E-22 130.7 10.0 95 6-100 831-926 (2531)
4 KOG1219|consensus 99.7 5.7E-17 1.2E-21 129.4 11.6 92 7-101 2625-2717(4289)
5 KOG1219|consensus 99.7 4.1E-16 8.9E-21 124.7 10.3 90 4-93 997-1088(4289)
6 smart00112 CA Cadherin repeats 99.6 2.5E-14 5.5E-19 80.4 8.1 57 6-62 22-79 (79)
7 KOG1834|consensus 99.4 7.8E-13 1.7E-17 97.5 8.8 85 14-99 93-184 (952)
8 PF00028 Cadherin: Cadherin do 99.1 9.7E-10 2.1E-14 63.4 8.4 50 6-55 43-93 (93)
9 cd00031 CA Cadherin repeat dom 98.3 8.5E-06 1.8E-10 52.4 8.1 51 6-56 148-199 (199)
10 PF00028 Cadherin: Cadherin do 97.6 0.00019 4.1E-09 41.0 4.8 31 70-100 3-33 (93)
11 KOG1834|consensus 97.4 0.00044 9.4E-09 52.4 5.7 54 4-57 192-245 (952)
12 PF08758 Cadherin_pro: Cadheri 94.3 0.28 6.2E-06 28.2 5.7 39 4-43 44-82 (90)
13 PF08266 Cadherin_2: Cadherin- 90.4 0.42 9.1E-06 27.2 2.9 24 4-27 43-67 (84)
14 PF07495 Y_Y_Y: Y_Y_Y domain; 86.2 3 6.5E-05 21.8 4.5 29 26-54 37-65 (66)
15 smart00736 CADG Dystroglycan-t 74.8 13 0.00029 21.2 7.1 43 13-59 54-96 (97)
16 PF08758 Cadherin_pro: Cadheri 70.5 3 6.5E-05 24.0 1.4 32 60-91 2-34 (90)
17 TIGR03660 T1SS_rpt_143 T1SS-14 60.1 39 0.00085 21.0 7.7 60 11-75 63-127 (137)
18 KOG3597|consensus 57.1 70 0.0015 23.9 6.6 48 46-93 24-72 (442)
19 cd00146 PKD polycystic kidney 54.7 33 0.00071 18.4 4.1 20 22-41 51-70 (81)
20 PF13750 Big_3_3: Bacterial Ig 49.8 64 0.0014 20.4 5.4 30 24-55 119-148 (158)
21 smart00089 PKD Repeats in poly 47.2 44 0.00096 17.7 4.8 29 22-53 49-77 (79)
22 PF02494 HYR: HYR domain; Int 40.5 62 0.0014 17.5 4.2 26 27-54 56-81 (81)
23 PF12245 Big_3_2: Bacterial Ig 36.0 67 0.0015 16.6 5.6 29 27-57 22-50 (60)
24 PF15418 DUF4625: Domain of un 34.9 1E+02 0.0022 18.9 4.0 19 27-45 106-124 (132)
25 cd02848 Chitinase_N_term Chiti 32.0 88 0.0019 18.6 3.2 29 22-50 74-102 (106)
26 COG1881 Phospholipid-binding p 31.8 83 0.0018 20.4 3.3 21 74-94 54-74 (174)
27 PF13754 Big_3_4: Bacterial Ig 30.3 82 0.0018 15.9 5.5 31 11-43 9-39 (54)
28 PRK08577 hypothetical protein; 29.9 1.3E+02 0.0028 18.2 3.9 30 11-41 39-68 (136)
29 PF09100 Qn_am_d_aIV: Quinohem 29.7 1.4E+02 0.003 18.5 3.8 33 29-62 99-132 (133)
30 TIGR01965 VCBS_repeat VCBS rep 26.1 1.5E+02 0.0032 17.4 6.1 40 29-75 58-97 (99)
31 PF07145 PAM2: Ataxin-2 C-term 26.0 36 0.00078 13.6 0.6 11 57-67 6-16 (18)
32 TIGR00845 caca sodium/calcium 24.5 3.8E+02 0.0082 22.4 6.4 27 48-75 515-542 (928)
No 1
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.81 E-value=2.1e-18 Score=111.36 Aligned_cols=87 Identities=36% Similarity=0.502 Sum_probs=79.4
Q ss_pred cEEEC-CceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecCCCCCcEeecCcce-EEeCCCCCCcEEEE
Q psy11095 7 CYLLF-TWADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDVNDEIPLFTEREQE-TVLEGEPIGSKVTQ 84 (102)
Q Consensus 7 ~f~~~-~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~-~v~E~~~~g~~v~~ 84 (102)
.|.++ .+|.|++.++||||....|.|.|.|.|.+.+.++....++|.|.|+||++|.|....|. .+.|+.++|+.+++
T Consensus 44 ~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~ 123 (199)
T cd00031 44 LFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGT 123 (199)
T ss_pred cEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEE
Confidence 45555 46999999999999999999999999988777778999999999999999999977665 99999999999999
Q ss_pred EEEEeCCCC
Q psy11095 85 VNAIDKDGT 93 (102)
Q Consensus 85 v~a~D~D~~ 93 (102)
+.|+|+|.+
T Consensus 124 ~~a~D~D~~ 132 (199)
T cd00031 124 VTATDADSG 132 (199)
T ss_pred EEEEcCCCC
Confidence 999999987
No 2
>KOG4289|consensus
Probab=99.78 E-value=1.5e-18 Score=134.45 Aligned_cols=96 Identities=27% Similarity=0.332 Sum_probs=86.9
Q ss_pred CccEEECCceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecCCCCCcEeecCcce-EEeCCCCCCcEEE
Q psy11095 5 FLCYLLFTWADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDVNDEIPLFTEREQE-TVLEGEPIGSKVT 83 (102)
Q Consensus 5 ~~~f~~~~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~-~v~E~~~~g~~v~ 83 (102)
+.|-+..++|.|.+..+||||+...|.|.|.|+|.|.++...++.|.|.|.|.|||+|+|....|. .|.|+..++++|+
T Consensus 315 ~~f~in~rSGvI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvl 394 (2531)
T KOG4289|consen 315 NVFEINPRSGVISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVL 394 (2531)
T ss_pred ceeEEcCccceeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEE
Confidence 456666778999999999999999999999999999888777999999999999999999999997 9999999999999
Q ss_pred EEEEEeCCCCCCCceEEe
Q psy11095 84 QVNAIDKDGTFPNNQLLD 101 (102)
Q Consensus 84 ~v~a~D~D~~~~~~~~~~ 101 (102)
+|+|+|+|.| .|+.|-|
T Consensus 395 rV~AtDrD~g-~Ng~VHY 411 (2531)
T KOG4289|consen 395 RVTATDRDKG-TNGKVHY 411 (2531)
T ss_pred EEEecccCCC-cCceEEE
Confidence 9999999999 4554433
No 3
>KOG4289|consensus
Probab=99.75 E-value=7.5e-18 Score=130.72 Aligned_cols=95 Identities=27% Similarity=0.349 Sum_probs=85.9
Q ss_pred ccEEECCceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecCCCCCcEeecCcce-EEeCCCCCCcEEEE
Q psy11095 6 LCYLLFTWADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDVNDEIPLFTEREQE-TVLEGEPIGSKVTQ 84 (102)
Q Consensus 6 ~~f~~~~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~-~v~E~~~~g~~v~~ 84 (102)
.|++.+.+|.|++...||||....|.|.+.|.|.+.|+++...+|+|.|+|+|||+|.|.+..+. .|.|++++|..+.+
T Consensus 831 ~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~enspvgs~va~ 910 (2531)
T KOG4289|consen 831 DFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENSPVGSVVAL 910 (2531)
T ss_pred ceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecCccceeeEE
Confidence 44555556999999999999999999999999999999999999999999999999999999986 99999999999999
Q ss_pred EEEEeCCCCCCCceEE
Q psy11095 85 VNAIDKDGTFPNNQLL 100 (102)
Q Consensus 85 v~a~D~D~~~~~~~~~ 100 (102)
+.|.|||.|+..-..|
T Consensus 911 i~a~dpdEG~NA~IsY 926 (2531)
T KOG4289|consen 911 ITADDPDEGPNAHISY 926 (2531)
T ss_pred EEccCCCcCCcceEEE
Confidence 9999999994444444
No 4
>KOG1219|consensus
Probab=99.73 E-value=5.7e-17 Score=129.40 Aligned_cols=92 Identities=29% Similarity=0.403 Sum_probs=81.2
Q ss_pred cEEECCceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecCCCCCcEeecCcce-EEeCCCCCCcEEEEE
Q psy11095 7 CYLLFTWADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDVNDEIPLFTEREQE-TVLEGEPIGSKVTQV 85 (102)
Q Consensus 7 ~f~~~~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~-~v~E~~~~g~~v~~v 85 (102)
|.+.+++|.|.+.++||+|+.++|.+.|.|.+.+. .-....|.|.|.|+|||+|.|....|. .+.|+++.|+.|.++
T Consensus 2625 Fsvdr~TG~i~v~ksLD~E~kk~yqi~v~a~~~~~--vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf 2702 (4289)
T KOG1219|consen 2625 FSVDRNTGMIKVNKSLDHEKKKSYQIKVKATCGQW--VVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQF 2702 (4289)
T ss_pred eEEcCCCceEEeccccchhhhceEEEEEEeecCCc--eEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEE
Confidence 44445679999999999999999999999998765 567888999999999999999999986 899999999999999
Q ss_pred EEEeCCCCCCCceEEe
Q psy11095 86 NAIDKDGTFPNNQLLD 101 (102)
Q Consensus 86 ~a~D~D~~~~~~~~~~ 101 (102)
+|.|+|.+ ++|++-+
T Consensus 2703 ~AsD~Ds~-~nGqiry 2717 (4289)
T KOG1219|consen 2703 HASDMDSG-NNGQIRY 2717 (4289)
T ss_pred EeeccCCC-CCceEEE
Confidence 99999999 5555543
No 5
>KOG1219|consensus
Probab=99.67 E-value=4.1e-16 Score=124.68 Aligned_cols=90 Identities=29% Similarity=0.352 Sum_probs=83.9
Q ss_pred CCccEEEC-CceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecCCCCCcEeecCcce-EEeCCCCCCcE
Q psy11095 4 NFLCYLLF-TWADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDVNDEIPLFTEREQE-TVLEGEPIGSK 81 (102)
Q Consensus 4 ~~~~f~~~-~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~-~v~E~~~~g~~ 81 (102)
++..|.++ .+|.|++.++||||..+.|.|++.|+|.|.+++++-+.+.|.|.|+|||+|+|.++.|. +|.|+++.+..
T Consensus 997 g~g~FsId~~tG~irTl~~lDrE~ks~YwltveA~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vs 1076 (4289)
T KOG1219|consen 997 GDGIFSIDSTTGSIRTLKALDREKKSSYWLTVEAKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVS 1076 (4289)
T ss_pred cceeEEecCCcceEeechhhchhhcceEEEEEEEEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceE
Confidence 45566666 57999999999999999999999999999999999999999999999999999999876 99999999999
Q ss_pred EEEEEEEeCCCC
Q psy11095 82 VTQVNAIDKDGT 93 (102)
Q Consensus 82 v~~v~a~D~D~~ 93 (102)
|.++.|.|+|..
T Consensus 1077 ivq~ea~D~Dss 1088 (4289)
T KOG1219|consen 1077 IVQAEANDPDSS 1088 (4289)
T ss_pred EEEeccCCCCcc
Confidence 999999999965
No 6
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.57 E-value=2.5e-14 Score=80.45 Aligned_cols=57 Identities=35% Similarity=0.515 Sum_probs=51.9
Q ss_pred ccEEEC-CceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecCCCCCc
Q psy11095 6 LCYLLF-TWADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDVNDEIP 62 (102)
Q Consensus 6 ~~f~~~-~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P 62 (102)
.+|.++ .+|.|++.++||||....|.|.|.|.|.+.+++++.+.|+|.|.|+||++|
T Consensus 22 ~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 22 GLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEECCCCCC
Confidence 455555 569999999999999999999999999999889999999999999999998
No 7
>KOG1834|consensus
Probab=99.43 E-value=7.8e-13 Score=97.55 Aligned_cols=85 Identities=36% Similarity=0.520 Sum_probs=76.0
Q ss_pred eeEEEcCcCCcccCCEEEEEEEEEECCC-C-----CceeeEEEEEEEecCCCCCcEeecCcce-EEeCCCCCCcEEEEEE
Q psy11095 14 ADIKVNHPLDYETIKEYNLTIRVENNGA-Q-----QLASEATVHIILEDVNDEIPLFTEREQE-TVLEGEPIGSKVTQVN 86 (102)
Q Consensus 14 G~i~~~~~LD~E~~~~~~l~v~a~d~~~-~-----~~~~~~~v~i~v~d~Nd~~P~f~~~~~~-~v~E~~~~g~~v~~v~ 86 (102)
|.|+.+.+||.|.++.|+|+|+|.|++. | ..+..++|+|+|.|+|..+|.|..+.|. .|.|+ ++...|+++.
T Consensus 93 gvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~ve 171 (952)
T KOG1834|consen 93 GVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVE 171 (952)
T ss_pred eEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecc-eeeeeeEEEE
Confidence 7899999999999999999999999874 3 4678899999999999999999999997 77776 6677899999
Q ss_pred EEeCCCCCCCceE
Q psy11095 87 AIDKDGTFPNNQL 99 (102)
Q Consensus 87 a~D~D~~~~~~~~ 99 (102)
|.|.|.++..+++
T Consensus 172 AiD~DCspq~sqI 184 (952)
T KOG1834|consen 172 AIDKDCSPQYSQI 184 (952)
T ss_pred eecCCCCCcccce
Confidence 9999999887776
No 8
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.12 E-value=9.7e-10 Score=63.38 Aligned_cols=50 Identities=36% Similarity=0.477 Sum_probs=43.9
Q ss_pred ccEEECCceeEEEcCcCCcccCCEEEEEEEEEEC-CCCCceeeEEEEEEEe
Q psy11095 6 LCYLLFTWADIKVNHPLDYETIKEYNLTIRVENN-GAQQLASEATVHIILE 55 (102)
Q Consensus 6 ~~f~~~~~G~i~~~~~LD~E~~~~~~l~v~a~d~-~~~~~~~~~~v~i~v~ 55 (102)
.|.+.+.+|.|+++++||||....|.|.|.|+|. +.+++++.+.|+|.|.
T Consensus 43 ~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 43 LFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp SEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred ceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 4444455799999999999999999999999999 7888999999999874
No 9
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=98.27 E-value=8.5e-06 Score=52.36 Aligned_cols=51 Identities=29% Similarity=0.391 Sum_probs=44.2
Q ss_pred ccEEECC-ceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEec
Q psy11095 6 LCYLLFT-WADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILED 56 (102)
Q Consensus 6 ~~f~~~~-~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d 56 (102)
..|.++. +|.|++.++||||....|.|.|.|.|.+.+.++....++|.+.|
T Consensus 148 ~~f~i~~~~G~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 148 ELFSIDPNTGIITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred CEEEEeCCceEEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 3556655 79999999999999999999999999987778888889988875
No 10
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=97.59 E-value=0.00019 Score=41.01 Aligned_cols=31 Identities=32% Similarity=0.288 Sum_probs=25.3
Q ss_pred eEEeCCCCCCcEEEEEEEEeCCCCCCCceEE
Q psy11095 70 ETVLEGEPIGSKVTQVNAIDKDGTFPNNQLL 100 (102)
Q Consensus 70 ~~v~E~~~~g~~v~~v~a~D~D~~~~~~~~~ 100 (102)
..|+|+.++|+.++++.|.|+|.+..+...|
T Consensus 3 ~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y 33 (93)
T PF00028_consen 3 FSVPENAPPGTVVGQVTATDPDSGPNSQITY 33 (93)
T ss_dssp EEEETTGSTSSEEEEEEEEESSTSTTSSEEE
T ss_pred EEEECCCCCCCEEEEEEEEeCCCCCCceEEE
Confidence 3899999999999999999999764444433
No 11
>KOG1834|consensus
Probab=97.41 E-value=0.00044 Score=52.40 Aligned_cols=54 Identities=19% Similarity=0.305 Sum_probs=48.6
Q ss_pred CCccEEECCceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecC
Q psy11095 4 NFLCYLLFTWADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDV 57 (102)
Q Consensus 4 ~~~~f~~~~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~ 57 (102)
-+..|.|++.|.|+.+.+|.|.....|.|+|+|.|+|.....+...|+|+|...
T Consensus 192 ~d~PFaIdn~G~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp~ 245 (952)
T KOG1834|consen 192 PDVPFAIDNDGNIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKPT 245 (952)
T ss_pred CCCceEEcCCCccccccccccccceeEEEEEEEEecccccccCcceEEEEecCc
Confidence 467899999999999999999999999999999999987766778899988776
No 12
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=94.30 E-value=0.28 Score=28.24 Aligned_cols=39 Identities=8% Similarity=0.146 Sum_probs=26.8
Q ss_pred CCccEEECCceeEEEcCcCCcccCCEEEEEEEEEECCCCC
Q psy11095 4 NFLCYLLFTWADIKVNHPLDYETIKEYNLTIRVENNGAQQ 43 (102)
Q Consensus 4 ~~~~f~~~~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~ 43 (102)
++..|.+..+|.|++++++...... -.+.|.|+|..+..
T Consensus 44 sDpdF~V~~DGsVy~~r~v~l~~~~-~~F~V~a~D~~~~~ 82 (90)
T PF08758_consen 44 SDPDFRVLEDGSVYAKRPVQLSSEQ-RSFTVHAWDSQTQE 82 (90)
T ss_dssp --SEEEEETTTEEEEES--S-SSS--EEEEEEEEETTTTE
T ss_pred CCCCEEEcCCCeEEEeeeEecCCCc-eEEEEEEECCCCCe
Confidence 4568999999999999998765433 47889999977653
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=90.37 E-value=0.42 Score=27.17 Aligned_cols=24 Identities=21% Similarity=0.243 Sum_probs=15.9
Q ss_pred CCccEEECC-ceeEEEcCcCCcccC
Q psy11095 4 NFLCYLLFT-WADIKVNHPLDYETI 27 (102)
Q Consensus 4 ~~~~f~~~~-~G~i~~~~~LD~E~~ 27 (102)
+..+|.++. +|.|++...+|||.-
T Consensus 43 ~~~~~~v~~~tG~L~v~~rIDRE~L 67 (84)
T PF08266_consen 43 NSQYFRVNEKTGDLFVSERIDREEL 67 (84)
T ss_dssp SS-SEEE-TTTSEEEESS--SCCCC
T ss_pred CcceeEecCCceeEEeCCccCHHHH
Confidence 345666665 799999999999984
No 14
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=86.20 E-value=3 Score=21.77 Aligned_cols=29 Identities=21% Similarity=0.421 Sum_probs=18.7
Q ss_pred cCCEEEEEEEEEECCCCCceeeEEEEEEE
Q psy11095 26 TIKEYNLTIRVENNGAQQLASEATVHIIL 54 (102)
Q Consensus 26 ~~~~~~l~v~a~d~~~~~~~~~~~v~i~v 54 (102)
....|.|.|.|.+..+........+.|.|
T Consensus 37 ~~G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 37 PPGKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp -SEEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred CCEEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 35689999999998776544435666655
No 15
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=74.82 E-value=13 Score=21.15 Aligned_cols=43 Identities=14% Similarity=0.288 Sum_probs=28.6
Q ss_pred ceeEEEcCcCCcccCCEEEEEEEEEECCCCCceeeEEEEEEEecCCC
Q psy11095 13 WADIKVNHPLDYETIKEYNLTIRVENNGAQQLASEATVHIILEDVND 59 (102)
Q Consensus 13 ~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd 59 (102)
++.++- .|..-+ ...|.+.+.|+|..+ .+....++|.|.+.|+
T Consensus 54 ~~~~~G-tP~~~~-~g~~~i~v~a~D~~g--~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 54 TGTLSG-TPTNSD-VGSLSLKVTATDSSG--ASASDTFTITVVNTND 96 (97)
T ss_pred CCEEEE-ECCCCC-CcEEEEEEEEEECCC--CEEEEEEEEEEeCCCC
Confidence 344444 244333 456999999999775 4456678888888764
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=70.50 E-value=3 Score=23.97 Aligned_cols=32 Identities=22% Similarity=0.211 Sum_probs=11.3
Q ss_pred CCcEeecCcce-EEeCCCCCCcEEEEEEEEeCC
Q psy11095 60 EIPLFTEREQE-TVLEGEPIGSKVTQVNAIDKD 91 (102)
Q Consensus 60 ~~P~f~~~~~~-~v~E~~~~g~~v~~v~a~D~D 91 (102)
+.|-|.+..|. .|+.+...|+.|++|.-.|..
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~ 34 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCT 34 (90)
T ss_dssp ---B--S-EEEE----SS-SS--EEE---B--S
T ss_pred CcCCcccceEEEEcCchhhCCcEEEEEEeccCC
Confidence 56888888775 899889999999988877763
No 17
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=60.12 E-value=39 Score=20.97 Aligned_cols=60 Identities=20% Similarity=0.262 Sum_probs=37.1
Q ss_pred CCce--eEEEcCcCCccc---CCEEEEEEEEEECCCCCceeeEEEEEEEecCCCCCcEeecCcceEEeCC
Q psy11095 11 FTWA--DIKVNHPLDYET---IKEYNLTIRVENNGAQQLASEATVHIILEDVNDEIPLFTEREQETVLEG 75 (102)
Q Consensus 11 ~~~G--~i~~~~~LD~E~---~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~~v~E~ 75 (102)
+.+| ..++.++||... .-...|.|.|+|..+.... ..+.|.|.| -.|......-..|.|.
T Consensus 63 ~~~GsYtftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~s~--~~l~VtI~D---D~P~~~~~~~~~V~E~ 127 (137)
T TIGR03660 63 NADGSYEFTLEGPLDHAAGSDELTLNFPIIATDFDGDTSS--ITLPVTIVD---DVPTITDVDALTVDED 127 (137)
T ss_pred CCCccEEEEEcccccCCCCCceEEEeeeEEEEeCCCCccc--cEEEEEEEC---CCCeeccccceEEecc
Confidence 3455 467778898744 3356778888887665433 366777766 3476655433466663
No 18
>KOG3597|consensus
Probab=57.11 E-value=70 Score=23.95 Aligned_cols=48 Identities=23% Similarity=0.234 Sum_probs=32.7
Q ss_pred eeEEEEEEEecCCCCCcEeecCcce-EEeCCCCCCcEEEEEEEEeCCCC
Q psy11095 46 SEATVHIILEDVNDEIPLFTEREQE-TVLEGEPIGSKVTQVNAIDKDGT 93 (102)
Q Consensus 46 ~~~~v~i~v~d~Nd~~P~f~~~~~~-~v~E~~~~g~~v~~v~a~D~D~~ 93 (102)
......|.+..+||.+..+....+. -+.|+..--.....+.+.|+|..
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~ 72 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSA 72 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCC
Confidence 5667889999999887766665554 66665433333456778888865
No 19
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=54.65 E-value=33 Score=18.39 Aligned_cols=20 Identities=25% Similarity=0.356 Sum_probs=14.8
Q ss_pred CCcccCCEEEEEEEEEECCC
Q psy11095 22 LDYETIKEYNLTIRVENNGA 41 (102)
Q Consensus 22 LD~E~~~~~~l~v~a~d~~~ 41 (102)
..|.+...|.+++.+.|..+
T Consensus 51 ~~y~~~G~y~v~l~v~d~~g 70 (81)
T cd00146 51 HTYTKPGTYTVTLTVTNAVG 70 (81)
T ss_pred EEcCCCcEEEEEEEEEeCCC
Confidence 34566778999999988654
No 20
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=49.80 E-value=64 Score=20.36 Aligned_cols=30 Identities=17% Similarity=0.297 Sum_probs=20.4
Q ss_pred cccCCEEEEEEEEEECCCCCceeeEEEEEEEe
Q psy11095 24 YETIKEYNLTIRVENNGAQQLASEATVHIILE 55 (102)
Q Consensus 24 ~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~v~ 55 (102)
.|....|.|+|.|.|..+. .....+.+...
T Consensus 119 le~~~~YtLtV~a~D~aGN--~~~~si~F~y~ 148 (158)
T PF13750_consen 119 LEADDSYTLTVSATDKAGN--QSTKSISFSYM 148 (158)
T ss_pred cCCCCeEEEEEEEEecCCC--EEEEEEEEEEe
Confidence 3677899999999997764 33444444433
No 21
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=47.15 E-value=44 Score=17.74 Aligned_cols=29 Identities=28% Similarity=0.437 Sum_probs=18.5
Q ss_pred CCcccCCEEEEEEEEEECCCCCceeeEEEEEE
Q psy11095 22 LDYETIKEYNLTIRVENNGAQQLASEATVHII 53 (102)
Q Consensus 22 LD~E~~~~~~l~v~a~d~~~~~~~~~~~v~i~ 53 (102)
.-|+....|.+++.+.|..+ +....+.|.
T Consensus 49 ~~y~~~G~y~v~l~v~n~~g---~~~~~~~i~ 77 (79)
T smart00089 49 HTYTKPGTYTVTLTVTNAVG---SASATVTVV 77 (79)
T ss_pred EEeCCCcEEEEEEEEEcCCC---cEEEEEEEE
Confidence 34566778888888888665 334444443
No 22
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=40.46 E-value=62 Score=17.54 Aligned_cols=26 Identities=4% Similarity=0.156 Sum_probs=16.6
Q ss_pred CCEEEEEEEEEECCCCCceeeEEEEEEE
Q psy11095 27 IKEYNLTIRVENNGAQQLASEATVHIIL 54 (102)
Q Consensus 27 ~~~~~l~v~a~d~~~~~~~~~~~v~i~v 54 (102)
...|.++..|.|..+ ....|.+.|.|
T Consensus 56 ~G~t~V~ytA~D~~G--N~a~C~f~V~V 81 (81)
T PF02494_consen 56 VGTTTVTYTATDAAG--NSATCSFTVTV 81 (81)
T ss_pred eceEEEEEEEEECCC--CEEEEEEEEEC
Confidence 346788888888655 34466665543
No 23
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=36.05 E-value=67 Score=16.65 Aligned_cols=29 Identities=28% Similarity=0.447 Sum_probs=19.3
Q ss_pred CCEEEEEEEEEECCCCCceeeEEEEEEEecC
Q psy11095 27 IKEYNLTIRVENNGAQQLASEATVHIILEDV 57 (102)
Q Consensus 27 ~~~~~l~v~a~d~~~~~~~~~~~v~i~v~d~ 57 (102)
...|.+.+.+.|..+.. ........+.|.
T Consensus 22 dg~yt~~v~a~D~AGN~--~~~~~~~~i~d~ 50 (60)
T PF12245_consen 22 DGEYTLTVTATDKAGNT--SSSTTQIVIVDN 50 (60)
T ss_pred CccEEEEEEEEECCCCE--EEeeeEEEEEcC
Confidence 66899999999987643 333444444444
No 24
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=34.90 E-value=1e+02 Score=18.95 Aligned_cols=19 Identities=21% Similarity=0.410 Sum_probs=13.0
Q ss_pred CCEEEEEEEEEECCCCCce
Q psy11095 27 IKEYNLTIRVENNGAQQLA 45 (102)
Q Consensus 27 ~~~~~l~v~a~d~~~~~~~ 45 (102)
...|.|.+.+.|..+....
T Consensus 106 ~G~YH~~i~VtD~~Gn~~~ 124 (132)
T PF15418_consen 106 AGDYHFMITVTDAAGNQTE 124 (132)
T ss_pred CcceEEEEEEEECCCCEEE
Confidence 3568888888887664433
No 25
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=32.01 E-value=88 Score=18.64 Aligned_cols=29 Identities=7% Similarity=0.165 Sum_probs=16.9
Q ss_pred CCcccCCEEEEEEEEEECCCCCceeeEEE
Q psy11095 22 LDYETIKEYNLTIRVENNGAQQLASEATV 50 (102)
Q Consensus 22 LD~E~~~~~~l~v~a~d~~~~~~~~~~~v 50 (102)
+++.+-..|.++|.+.|..+...+....|
T Consensus 74 ~~v~kgG~y~m~V~lCn~dGCS~S~~~~I 102 (106)
T cd02848 74 FKVGKGGRYQMQVALCNGDGCSTSAAKEI 102 (106)
T ss_pred EEeCCCCeEEEEEEEECCCCccCcCCEEE
Confidence 34555667777777777655444444333
No 26
>COG1881 Phospholipid-binding protein [General function prediction only]
Probab=31.76 E-value=83 Score=20.39 Aligned_cols=21 Identities=29% Similarity=0.374 Sum_probs=17.4
Q ss_pred CCCCCCcEEEEEEEEeCCCCC
Q psy11095 74 EGEPIGSKVTQVNAIDKDGTF 94 (102)
Q Consensus 74 E~~~~g~~v~~v~a~D~D~~~ 94 (102)
++.+.|+.-+.+...|+|+++
T Consensus 54 s~~P~~tkS~AL~v~DpDAP~ 74 (174)
T COG1881 54 SGVPEGTKSFALTVDDPDAPT 74 (174)
T ss_pred cCCCCCCeeEEEEEECCCCCC
Confidence 356778888899999999984
No 27
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=30.32 E-value=82 Score=15.89 Aligned_cols=31 Identities=10% Similarity=0.104 Sum_probs=21.8
Q ss_pred CCceeEEEcCcCCcccCCEEEEEEEEEECCCCC
Q psy11095 11 FTWADIKVNHPLDYETIKEYNLTIRVENNGAQQ 43 (102)
Q Consensus 11 ~~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~~~ 43 (102)
+..|.-.+.-+.. ....|.+++.|.|..+..
T Consensus 9 ~~~G~Ws~t~~~~--~dG~y~itv~a~D~AGN~ 39 (54)
T PF13754_consen 9 DSDGNWSFTVPAL--ADGTYTITVTATDAAGNT 39 (54)
T ss_pred CCCCcEEEeCCCC--CCccEEEEEEEEeCCCCC
Confidence 4456666655544 577899999999977653
No 28
>PRK08577 hypothetical protein; Provisional
Probab=29.87 E-value=1.3e+02 Score=18.21 Aligned_cols=30 Identities=17% Similarity=0.245 Sum_probs=20.5
Q ss_pred CCceeEEEcCcCCcccCCEEEEEEEEEECCC
Q psy11095 11 FTWADIKVNHPLDYETIKEYNLTIRVENNGA 41 (102)
Q Consensus 11 ~~~G~i~~~~~LD~E~~~~~~l~v~a~d~~~ 41 (102)
...|+|.+ .|+-.+....|.+.+.+.|..+
T Consensus 39 ~~~~~~~~-~~~~~~~k~~~~I~V~~~Dr~G 68 (136)
T PRK08577 39 TDKKEIHL-EPIALPGKKLVEIELVVEDRPG 68 (136)
T ss_pred CCCCEEEE-EEcCCCCccEEEEEEEEcCCCC
Confidence 33477777 3665566677888888887654
No 29
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=29.69 E-value=1.4e+02 Score=18.55 Aligned_cols=33 Identities=24% Similarity=0.370 Sum_probs=15.9
Q ss_pred EEEEEEEEEE-CCCCCceeeEEEEEEEecCCCCCc
Q psy11095 29 EYNLTIRVEN-NGAQQLASEATVHIILEDVNDEIP 62 (102)
Q Consensus 29 ~~~l~v~a~d-~~~~~~~~~~~v~i~v~d~Nd~~P 62 (102)
.=.|.|.|+- .++.+++.+..+.+.|..-| ++|
T Consensus 99 ~Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~-~pp 132 (133)
T PF09100_consen 99 AGNLKVVATVKDGGKPLTGEAHLIVTVQRWN-NPP 132 (133)
T ss_dssp -EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred cccEEEEEEEccCCcccceeEeEEEEeeccc-CCC
Confidence 3456666653 34556888888999888775 555
No 30
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=26.11 E-value=1.5e+02 Score=17.40 Aligned_cols=40 Identities=18% Similarity=0.297 Sum_probs=25.6
Q ss_pred EEEEEEEEEECCCCCceeeEEEEEEEecCCCCCcEeecCcceEEeCC
Q psy11095 29 EYNLTIRVENNGAQQLASEATVHIILEDVNDEIPLFTEREQETVLEG 75 (102)
Q Consensus 29 ~~~l~v~a~d~~~~~~~~~~~v~i~v~d~Nd~~P~f~~~~~~~v~E~ 75 (102)
.-.|++.+.|. ....|.|.|.-.| .+|.........+.|+
T Consensus 58 tdsFtvtv~DG------tt~~vtItI~GtN-Dapvi~~~~~g~v~ED 97 (99)
T TIGR01965 58 TDTFTVTSADG------TSQTVTITITGAN-DAAVIGGADTGSVTED 97 (99)
T ss_pred EEEEEEEEeCC------CeEEEEEEEEccC-CCCEEecccceeEecC
Confidence 34566777774 2777999999998 5555544333455554
No 31
>PF07145 PAM2: Ataxin-2 C-terminal region; InterPro: IPR009818 This entry represents a conserved region approximately 250 residues long located towards the C terminus of eukaryotic ataxin-2. Ataxin-2 is a protein of unknown function, within which expansion of a polyglutamine tract (due to expansion of unstable CAG repeats in the coding region of the SCA2 gene) causes spinocerebellar ataxia type 2 (SCA2), a late-onset neurodegenerative disorder []. The expanded polyglutamine repeat in ataxin-2 causes disruption of the normal morphology of the Golgi complex and increased incidence of cell death []. Ataxin-2 is predicted to consist of mostly non-globular domains [].; PDB: 3NTW_B 1JH4_B 3KTR_B 3KUJ_B 3KUT_D 3KUS_D 1JGN_B 2RQG_A 2RQH_A.
Probab=25.97 E-value=36 Score=13.57 Aligned_cols=11 Identities=27% Similarity=0.489 Sum_probs=7.0
Q ss_pred CCCCCcEeecC
Q psy11095 57 VNDEIPLFTER 67 (102)
Q Consensus 57 ~Nd~~P~f~~~ 67 (102)
.|.++|.|.+.
T Consensus 6 LNp~A~eFvP~ 16 (18)
T PF07145_consen 6 LNPNAPEFVPS 16 (18)
T ss_dssp SSTTSSSS-TT
T ss_pred cCCCCccccCC
Confidence 56778887654
No 32
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=24.51 E-value=3.8e+02 Score=22.37 Aligned_cols=27 Identities=30% Similarity=0.353 Sum_probs=17.2
Q ss_pred EEEEEEEecCCCCCcEeecCcc-eEEeCC
Q psy11095 48 ATVHIILEDVNDEIPLFTEREQ-ETVLEG 75 (102)
Q Consensus 48 ~~v~i~v~d~Nd~~P~f~~~~~-~~v~E~ 75 (102)
...+|.|.| ||++|.|.-..- ..|.|+
T Consensus 515 s~ATVTIlD-DD~aGIfsFe~~~~sV~Es 542 (928)
T TIGR00845 515 NTATVTILD-DDHAGIFTFEEDVFHVSES 542 (928)
T ss_pred ceEEEEEec-CcccCcccccCceEEEEcC
Confidence 355677777 578887765432 366665
Done!