Query psy7019
Match_columns 128
No_of_seqs 126 out of 692
Neff 6.7
Searched_HMMs 46136
Date Sat Aug 17 00:43:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7019.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7019hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0196|consensus 99.2 9.8E-12 2.1E-16 109.7 6.1 72 3-74 465-537 (996)
2 PF00041 fn3: Fibronectin type 98.8 1E-08 2.2E-13 65.3 5.2 59 6-64 25-85 (85)
3 KOG3513|consensus 98.6 4.7E-07 1E-11 82.5 10.2 75 4-78 843-920 (1051)
4 cd00063 FN3 Fibronectin type 3 98.5 1.2E-06 2.7E-11 54.4 7.9 62 9-70 29-92 (93)
5 KOG4221|consensus 98.5 2E-06 4.3E-11 79.2 11.8 84 3-87 542-627 (1381)
6 KOG4367|consensus 98.4 1.4E-07 3.1E-12 79.3 2.5 62 12-73 470-531 (699)
7 KOG4221|consensus 98.3 3.3E-06 7.2E-11 77.7 8.2 70 7-76 644-715 (1381)
8 KOG3513|consensus 97.9 5.5E-05 1.2E-09 69.3 9.3 73 6-78 639-718 (1051)
9 PF09294 Interfer-bind: Interf 97.8 7.4E-05 1.6E-09 50.3 5.8 61 11-71 43-105 (106)
10 smart00060 FN3 Fibronectin typ 97.6 0.0008 1.7E-08 40.2 8.1 36 25-60 46-82 (83)
11 PF07495 Y_Y_Y: Y_Y_Y domain; 96.3 0.02 4.4E-07 35.1 5.9 46 13-64 11-57 (66)
12 KOG4258|consensus 96.2 0.011 2.4E-07 53.7 6.1 70 2-72 850-923 (1025)
13 KOG4152|consensus 96.2 0.0064 1.4E-07 52.9 4.1 37 36-72 648-684 (830)
14 KOG4802|consensus 96.0 0.021 4.5E-07 48.4 6.4 63 24-86 197-263 (516)
15 PF01108 Tissue_fac: Tissue fa 95.9 0.032 7E-07 37.9 5.9 52 8-59 47-103 (107)
16 KOG4222|consensus 95.7 0.076 1.6E-06 49.7 9.1 63 6-69 776-841 (1281)
17 PLN02533 probable purple acid 91.7 0.3 6.5E-06 41.2 4.7 35 34-74 102-136 (427)
18 KOG0196|consensus 90.9 0.55 1.2E-05 43.0 5.6 46 33-78 396-445 (996)
19 PF10179 DUF2369: Uncharacteri 90.5 0.53 1.2E-05 38.3 4.7 29 29-57 9-37 (300)
20 PF09423 PhoD: PhoD-like phosp 90.3 0.52 1.1E-05 39.6 4.8 34 36-72 65-98 (453)
21 KOG4222|consensus 90.3 0.17 3.7E-06 47.5 2.0 64 11-74 558-624 (1281)
22 PF10179 DUF2369: Uncharacteri 85.9 4.9 0.00011 32.7 7.6 76 35-114 128-211 (300)
23 KOG1378|consensus 78.9 3.9 8.4E-05 35.1 4.7 38 31-73 105-142 (452)
24 KOG4806|consensus 61.9 12 0.00027 31.5 3.9 29 29-57 164-192 (454)
25 cd05762 Ig8_MLCK Eighth immuno 61.5 33 0.00071 22.5 5.4 44 33-76 54-97 (98)
26 KOG4258|consensus 59.7 15 0.00033 34.2 4.4 43 36-78 567-613 (1025)
27 PF11344 DUF3146: Protein of u 57.7 7.4 0.00016 25.6 1.6 15 41-55 66-80 (80)
28 PF07353 Uroplakin_II: Uroplak 57.4 19 0.00042 27.0 3.9 36 35-71 102-137 (184)
29 COG4733 Phage-related protein, 55.4 29 0.00063 32.3 5.4 38 24-62 649-686 (952)
30 KOG4802|consensus 51.6 28 0.00062 30.0 4.5 36 37-72 316-353 (516)
31 TIGR03000 plancto_dom_1 Planct 49.9 23 0.0005 23.1 2.9 25 31-55 25-49 (75)
32 PF04775 Bile_Hydr_Trans: Acyl 48.5 30 0.00065 24.2 3.7 25 35-59 5-29 (126)
33 COG3540 PhoD Phosphodiesterase 47.5 26 0.00057 30.6 3.7 36 35-75 101-136 (522)
34 PF02922 CBM_48: Carbohydrate- 47.5 55 0.0012 20.4 4.5 20 37-56 53-74 (85)
35 PF02010 REJ: REJ domain; Int 46.2 6.7 0.00014 32.3 0.0 32 40-72 267-298 (440)
36 smart00112 CA Cadherin repeats 44.7 71 0.0015 19.4 5.7 47 11-60 10-59 (79)
37 PF13754 Big_3_4: Bacterial Ig 44.4 66 0.0014 19.0 5.2 33 22-64 10-42 (54)
38 cd05735 Ig8_DSCAM Eight immuno 42.3 56 0.0012 20.8 4.0 41 35-75 47-87 (88)
39 cd05748 Ig_Titin_like Immunogl 42.3 75 0.0016 19.0 5.1 28 33-60 38-65 (74)
40 cd02856 Glycogen_debranching_e 40.7 37 0.0008 22.4 3.0 19 36-54 48-66 (103)
41 PF15417 DUF4624: Domain of un 39.8 61 0.0013 23.0 4.0 32 23-54 73-108 (132)
42 PF01299 Lamp: Lysosome-associ 37.9 14 0.00031 29.6 0.7 34 35-69 231-264 (306)
43 COG4733 Phage-related protein, 37.4 31 0.00067 32.2 2.8 25 40-64 762-786 (952)
44 KOG0095|consensus 36.6 23 0.00049 26.7 1.6 26 1-27 45-70 (213)
45 PHA02579 7 baseplate wedge sub 35.8 53 0.0012 30.6 3.9 51 9-60 29-89 (1030)
46 cd02852 Isoamylase_N_term Isoa 35.7 48 0.001 22.4 3.0 19 36-54 52-70 (119)
47 PF13750 Big_3_3: Bacterial Ig 33.7 1.4E+02 0.0031 21.7 5.4 21 40-60 117-137 (158)
48 COG3979 Uncharacterized protei 33.7 89 0.0019 23.4 4.4 41 33-73 51-92 (181)
49 KOG4152|consensus 33.3 74 0.0016 28.5 4.3 59 8-66 716-791 (830)
50 cd02860 Pullulanase_N_term Pul 32.6 69 0.0015 20.9 3.3 20 36-55 50-69 (100)
51 PF00801 PKD: PKD domain; Int 31.6 1.2E+02 0.0026 18.2 5.2 45 8-60 22-66 (69)
52 cd05894 Ig_C5_MyBP-C C5 immuno 30.6 1.4E+02 0.0031 18.7 4.8 27 34-60 51-77 (86)
53 PF05345 He_PIG: Putative Ig d 30.4 61 0.0013 18.9 2.5 14 46-59 36-49 (49)
54 cd05851 Ig3_Contactin-1 Third 30.0 80 0.0017 20.0 3.2 31 34-64 52-82 (88)
55 COG3422 Uncharacterized conser 30.0 42 0.00092 20.9 1.7 14 44-57 11-24 (59)
56 cd02853 MTHase_N_term Maltooli 29.7 65 0.0014 20.4 2.7 17 36-53 43-59 (85)
57 PF13860 FlgD_ig: FlgD Ig-like 29.6 1.5E+02 0.0033 18.7 4.8 16 42-57 65-80 (81)
58 PF00907 T-box: T-box; InterP 28.7 70 0.0015 23.5 3.1 27 33-59 31-57 (184)
59 PF00028 Cadherin: Cadherin do 28.5 50 0.0011 20.8 2.0 48 9-59 28-78 (93)
60 PF12514 DUF3718: Protein of u 27.1 19 0.00042 22.8 -0.2 15 109-123 25-39 (68)
61 PRK12634 flgD flagellar basal 25.9 2.9E+02 0.0063 21.3 6.2 17 43-59 163-179 (221)
62 PF02018 CBM_4_9: Carbohydrate 25.6 73 0.0016 20.8 2.5 20 40-59 56-75 (131)
63 COG5584 Predicted small secret 25.1 2.1E+02 0.0045 19.7 4.6 44 9-56 51-94 (103)
64 cd04972 Ig_TrkABC_d4 Fourth do 24.8 1.2E+02 0.0027 18.9 3.4 27 35-61 56-82 (90)
65 COG3900 Predicted periplasmic 24.1 2.7E+02 0.0058 22.2 5.6 62 7-71 171-238 (262)
66 PF05738 Cna_B: Cna protein B- 23.7 87 0.0019 18.7 2.4 25 29-54 21-45 (70)
67 cd05893 Ig_Palladin_C C-termin 23.7 52 0.0011 20.4 1.4 27 36-62 42-68 (75)
68 KOG4806|consensus 22.8 1.1E+02 0.0024 26.0 3.5 25 36-60 411-435 (454)
69 KOG1948|consensus 22.3 2.9E+02 0.0063 26.4 6.2 22 34-55 945-966 (1165)
70 cd05765 Ig_3 Subgroup of the i 21.9 1.5E+02 0.0032 17.7 3.3 28 34-61 46-73 (81)
71 PLN02960 alpha-amylase 21.8 1.3E+02 0.0029 28.3 4.0 41 5-54 306-346 (897)
72 PLN03144 Carbon catabolite rep 21.5 2.3E+02 0.005 25.4 5.3 65 6-72 151-218 (606)
73 PRK06655 flgD flagellar basal 21.3 3.9E+02 0.0084 20.7 6.1 13 43-55 167-179 (225)
No 1
>KOG0196|consensus
Probab=99.24 E-value=9.8e-12 Score=109.73 Aligned_cols=72 Identities=22% Similarity=0.327 Sum_probs=63.8
Q ss_pred ccceeCCcEEEEEEEeeCCC-CCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCC
Q psy7019 3 KTTEFNQASFKASHSLSDRP-DDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTA 74 (128)
Q Consensus 3 ~~~~~~~~~i~Y~le~~~~~-~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~ 74 (128)
++-+-||.++.|+|++++++ ++........+.++.++++|+|++.|.|||||++.+|+|+||...+|+|.+.
T Consensus 465 ~p~~png~ildYEvky~ek~~~e~~~~~~~t~~~~~ti~gL~p~t~YvfqVRarT~aG~G~~S~~~~fqT~~~ 537 (996)
T KOG0196|consen 465 EPDQPNGVILDYEVKYYEKDEDERSYSTLKTKTTTATITGLKPGTVYVFQVRARTAAGYGPYSGKHEFQTLPS 537 (996)
T ss_pred CCCCCCCcceeEEEEEeeccccccceeEEecccceEEeeccCCCcEEEEEEEEecccCCCCCCCceeeeecCc
Confidence 56678999999999999875 4555566666889999999999999999999999999999999999999875
No 2
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=98.80 E-value=1e-08 Score=65.28 Aligned_cols=59 Identities=27% Similarity=0.270 Sum_probs=47.2
Q ss_pred eeCCcEEEEEEEeeCCCCCC-eE-EEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019 6 EFNQASFKASHSLSDRPDDR-KH-VVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYS 64 (128)
Q Consensus 6 ~~~~~~i~Y~le~~~~~~~~-~~-~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S 64 (128)
..++...+|.|++....+.. +. ....+..+.+.+.+|.|++.|.|+|+|.|..|.|++|
T Consensus 25 ~~~~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~a~~~~g~g~~S 85 (85)
T PF00041_consen 25 SGNGPITGYRVEYRSVNSTSDWQEVTVPGNETSYTITGLQPGTTYEFRVRAVNSDGEGPPS 85 (85)
T ss_dssp STSSSESEEEEEEEETTSSSEEEEEEEETTSSEEEEESCCTTSEEEEEEEEEETTEEEEEE
T ss_pred CCCCCeeEEEEEEEecccceeeeeeeeeeeeeeeeeccCCCCCEEEEEEEEEeCCcCcCCC
Confidence 34666779999998655554 43 3455566799999999999999999999999998875
No 3
>KOG3513|consensus
Probab=98.56 E-value=4.7e-07 Score=82.48 Aligned_cols=75 Identities=28% Similarity=0.296 Sum_probs=62.3
Q ss_pred cceeCCcEEEEEEEeeCCCCC---CeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCCCC
Q psy7019 4 TTEFNQASFKASHSLSDRPDD---RKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPPAP 78 (128)
Q Consensus 4 ~~~~~~~~i~Y~le~~~~~~~---~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp~~ 78 (128)
+.+-||-..+|+|++...++. .......+..++.++++|+|++.|.|.|||.|.+|.||.|......|..+||.+
T Consensus 843 ~~~~nG~l~gY~v~Y~~~~~~~~~~~~~~i~~~~~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp~s~~~~~tt~k~pPs~ 920 (1051)
T KOG3513|consen 843 PLWDNGKLTGYEVKYWKINEKEGSLSRVQIAGNRTSWRLTGLEPNTKYRFYVRAYTSAGGGPASSEENVTTKKAPPSQ 920 (1051)
T ss_pred cCccCCccceeEEEEEEcCCCcccccceeecCCcceEeeeCCCCCceEEEEEEEecCCCCCCCccceeccccCCCCcc
Confidence 456788889999999753332 233344578899999999999999999999999999999999999998888875
No 4
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.48 E-value=1.2e-06 Score=54.44 Aligned_cols=62 Identities=21% Similarity=0.203 Sum_probs=49.7
Q ss_pred CcEEEEEEEeeCCCCCCeEEEEec--CCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeee
Q psy7019 9 QASFKASHSLSDRPDDRKHVVYQG--TNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFT 70 (128)
Q Consensus 9 ~~~i~Y~le~~~~~~~~~~~v~~g--~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~ 70 (128)
+....|.+++.......|..+... ..+.+.+.+|.|++.|.|+|+|.+..|.|.+|....|.
T Consensus 29 ~~~~~y~v~~~~~~~~~~~~~~~~~~~~~~~~i~~l~p~~~Y~~~v~a~~~~~~~~~s~~~~~~ 92 (93)
T cd00063 29 GPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESPPSESVTVT 92 (93)
T ss_pred CcceeEEEEEeeCCCCCCEEeeccCCcccEEEEccccCCCEEEEEEEEECCCccCCCccccccc
Confidence 556789999876445667766654 67889999999999999999999999988888755543
No 5
>KOG4221|consensus
Probab=98.47 E-value=2e-06 Score=79.18 Aligned_cols=84 Identities=17% Similarity=0.145 Sum_probs=67.1
Q ss_pred ccce-eCCcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCCCCCC-
Q psy7019 3 KTTE-FNQASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPPAPLK- 80 (128)
Q Consensus 3 ~~~~-~~~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp~~~~- 80 (128)
|+++ -||.+++|.+-+.+.+.+.|. .+....++++|.+|+|.+.|.|||.|.|.+|.|..|..+.|.|..+-|+.|.
T Consensus 542 epP~~~n~~I~~yk~~ys~~~~~~~~-~~~~n~~e~ti~gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 542 EPPPFGNGPITGYKLFYSEDDTGKEL-RVENNATEYTINGLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred cCCCCCCCCceEEEEEEEcCCCCceE-EEecCccEEEeecCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 4445 788899999999865455554 3444788999999999999999999999999999999999999988776533
Q ss_pred CceEEEe
Q psy7019 81 GKTYFSL 87 (128)
Q Consensus 81 ~p~v~~~ 87 (128)
+..+...
T Consensus 621 Nl~lev~ 627 (1381)
T KOG4221|consen 621 NLSLEVV 627 (1381)
T ss_pred ceEEEec
Confidence 3555543
No 6
>KOG4367|consensus
Probab=98.40 E-value=1.4e-07 Score=79.31 Aligned_cols=62 Identities=27% Similarity=0.351 Sum_probs=58.7
Q ss_pred EEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCC
Q psy7019 12 FKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCT 73 (128)
Q Consensus 12 i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~ 73 (128)
=+|.||..+.++++|..+|.|+++-|+|++|+-++.|.-||+|+|++|.++||+.+.++|+.
T Consensus 470 dg~~leld~g~~g~frevy~g~etmctvdglhfns~y~arvka~n~tg~s~ys~tl~lqts~ 531 (699)
T KOG4367|consen 470 DGYILELDDGNGGQFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYSKTLVLQTSE 531 (699)
T ss_pred cceEEEeecCCCCceeEEEecCceeEEecceecchhHHHHHHHhhccCCCcccceeEeeecc
Confidence 37999999888899999999999999999999999999999999999999999999999974
No 7
>KOG4221|consensus
Probab=98.27 E-value=3.3e-06 Score=77.74 Aligned_cols=70 Identities=24% Similarity=0.205 Sum_probs=57.9
Q ss_pred eCCcEEEEEEEeeCCC-CCC-eEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCC
Q psy7019 7 FNQASFKASHSLSDRP-DDR-KHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPP 76 (128)
Q Consensus 7 ~~~~~i~Y~le~~~~~-~~~-~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp 76 (128)
-||-..+|.++++... +.+ ...+..|..+.+.+.+|+|++.|.||+.|.|..|.|++|+++.+.|+...+
T Consensus 644 ~ng~itgYkIRy~~~~~~~~~~~t~v~~n~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~ 715 (1381)
T KOG4221|consen 644 QNGQITGYKIRYRKLSREDEVNETVVKGNTTQYLFNGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDL 715 (1381)
T ss_pred ccceEEEEEEEecccCcccccceeecccchhhhHhhcCCCCceEEEEEEEeccCCCCCcccceeccCccccc
Confidence 3777889999997432 223 356677788899999999999999999999999999999999999975433
No 8
>KOG3513|consensus
Probab=97.93 E-value=5.5e-05 Score=69.29 Aligned_cols=73 Identities=22% Similarity=0.142 Sum_probs=58.4
Q ss_pred eeCCc-EEEEEEEeeCCCCCCeEEEEec--CC---ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCce-eeeeCCCCCCC
Q psy7019 6 EFNQA-SFKASHSLSDRPDDRKHVVYQG--TN---FSCKVNKLQELTTYRFYITATNDAGQGPYSEAY-PFTTCTAPPAP 78 (128)
Q Consensus 6 ~~~~~-~i~Y~le~~~~~~~~~~~v~~g--~~---t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~-~~~T~~~pp~~ 78 (128)
..|+. +..|.+|.++...+.|..+..- .. .+..+.+|.|-..|.|||.|.|..|.|++|.++ ...|.++.|..
T Consensus 639 ~dn~SpI~~Y~iq~rt~~~~~W~~v~~vp~~~~~~~sa~vv~L~Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~ 718 (1051)
T KOG3513|consen 639 SDNNSPIEKYTIQFRTPFPGKWKAVTTVPGNITGDESATVVNLSPWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSV 718 (1051)
T ss_pred CCCCCCceEEeEEecCCCCCcceEeeECCCcccCccceeEEccCCCcceEEEEEEEcccccCCCCCCccceecCCCCCcc
Confidence 34544 5689999998888899876642 22 367899999999999999999999999999876 56777776653
No 9
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=97.78 E-value=7.4e-05 Score=50.28 Aligned_cols=61 Identities=21% Similarity=0.130 Sum_probs=43.0
Q ss_pred EEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcC--CCCCCCCCceeeee
Q psy7019 11 SFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATND--AGQGPYSEAYPFTT 71 (128)
Q Consensus 11 ~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~--~G~G~~S~~~~~~T 71 (128)
.+.|.|.+.+.....-........+.+.+.+|.|++.|.++|+|... .-.|.+|++...+|
T Consensus 43 ~~~Y~v~~~~~~~~~~~~~~~~~~~~~~l~~L~p~t~YCv~V~~~~~~~~~~s~~S~~~C~~t 105 (106)
T PF09294_consen 43 SLSYNVSYWKNGSNEKKKEIETKNSSVTLSDLKPGTNYCVSVQAFSPSQNKNSQPSEPQCITT 105 (106)
T ss_dssp G-EEEEEEEETTTSCEEEEEESSSEEEEEES--TTSEEEEEEEEEECSSTEEEEEBSEEEEE-
T ss_pred CeEEEEEEEeCCCccceEEEeecCCEEEEeCCCCCCCEEEEEEEEeccCCCcCCCCCCEeEeC
Confidence 47899998865555344556667778899999999999999999433 33577888777666
No 10
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=97.62 E-value=0.0008 Score=40.18 Aligned_cols=36 Identities=28% Similarity=0.302 Sum_probs=28.1
Q ss_pred CeEEEEecC-CceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019 25 RKHVVYQGT-NFSCKVNKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 25 ~~~~v~~g~-~t~~~V~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
.|....... ...+.+.+|.|++.|.|+|+|.+..|.
T Consensus 46 ~~~~~~~~~~~~~~~i~~L~~~~~Y~v~v~a~~~~g~ 82 (83)
T smart00060 46 SWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82 (83)
T ss_pred ccEEEEecCCccEEEEeCcCCCCEEEEEEEEEcccCC
Confidence 455444333 478999999999999999999998654
No 11
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=96.29 E-value=0.02 Score=35.10 Aligned_cols=46 Identities=15% Similarity=0.094 Sum_probs=31.3
Q ss_pred EEEEEeeCCCCCCeEEEEecCCc-eEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019 13 KASHSLSDRPDDRKHVVYQGTNF-SCKVNKLQELTTYRFYITATNDAGQGPYS 64 (128)
Q Consensus 13 ~Y~le~~~~~~~~~~~v~~g~~t-~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S 64 (128)
.|.|+ . .+++|..... .. ++..+.|.|| .|.|+|+|.|..|.....
T Consensus 11 ~Y~l~--g-~d~~W~~~~~--~~~~~~~~~L~~G-~Y~l~V~a~~~~~~~~~~ 57 (66)
T PF07495_consen 11 RYRLE--G-FDDEWITLGS--YSNSISYTNLPPG-KYTLEVRAKDNNGKWSSD 57 (66)
T ss_dssp EEEEE--T-TESSEEEESS--TS-EEEEES--SE-EEEEEEEEEETTS-B-SS
T ss_pred EEEEE--C-CCCeEEECCC--CcEEEEEEeCCCE-EEEEEEEEECCCCCcCcc
Confidence 45554 2 4577876554 33 8999999988 899999999998877654
No 12
>KOG4258|consensus
Probab=96.23 E-value=0.011 Score=53.67 Aligned_cols=70 Identities=17% Similarity=0.185 Sum_probs=54.6
Q ss_pred cccceeCCcEEEEEEEeeCCCCCCeEEE----EecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019 2 IKTTEFNQASFKASHSLSDRPDDRKHVV----YQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC 72 (128)
Q Consensus 2 ~~~~~~~~~~i~Y~le~~~~~~~~~~~v----~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~ 72 (128)
.|+.+-||-+++|.++++...+++-..+ +..+.-.+.++.|.|| .|..||||.+-+|.|++-+.+.|--.
T Consensus 850 ~EP~~pNGli~~Y~Vk~r~~~~et~v~cvsR~~~~k~~gv~l~~l~~G-~y~~~vratSlaGng~~t~~~~~~v~ 923 (1025)
T KOG4258|consen 850 LEPKEPNGLILNYEVKYRRNGDETHVECVSRMDYAKAGGVYLKRLNPG-NYSVRVRATSLAGNGSWTESVIFYVS 923 (1025)
T ss_pred CCCCCCCccEEEEEEEEeeccCcchhhhhhhhhhhhcCceEEEecCCC-cEEEEEEEEeeccCCCCcCCcceEEe
Confidence 4788999999999999986666554332 2224566889999988 89999999999999999887665443
No 13
>KOG4152|consensus
Probab=96.17 E-value=0.0064 Score=52.95 Aligned_cols=37 Identities=35% Similarity=0.511 Sum_probs=33.3
Q ss_pred eEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019 36 SCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC 72 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~ 72 (128)
.-+-..|.+|+.|.|||.|.|..|.|++|+..+|.|.
T Consensus 648 ~~~k~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc 684 (830)
T KOG4152|consen 648 KPVKTSLVTGQAYRFRVTAINGKGPGPASTILKLKTC 684 (830)
T ss_pred cccccccccccceeeeeeeeeccCCCchhhheeeeec
Confidence 3445678999999999999999999999999999996
No 14
>KOG4802|consensus
Probab=96.03 E-value=0.021 Score=48.36 Aligned_cols=63 Identities=16% Similarity=0.204 Sum_probs=44.3
Q ss_pred CCeEEEEecC-CceEEEccCCCCCEEEEEEEEEcCCCCCCCCCcee-eeeC--CCCCCCCCCceEEE
Q psy7019 24 DRKHVVYQGT-NFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYP-FTTC--TAPPAPLKGKTYFS 86 (128)
Q Consensus 24 ~~~~~v~~g~-~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~-~~T~--~~pp~~~~~p~v~~ 86 (128)
-+|..+.... ...+...++.||-=|.|||.|.|..|.=.||++.. |..+ +.+|++|+...+..
T Consensus 197 thwQtv~~t~~e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~ 263 (516)
T KOG4802|consen 197 THWQTVEKTMEENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIG 263 (516)
T ss_pred ccceeeeecCCCceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeee
Confidence 3477766543 34788899999999999999999999888887764 3333 33344455556654
No 15
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=95.89 E-value=0.032 Score=37.87 Aligned_cols=52 Identities=12% Similarity=0.051 Sum_probs=40.6
Q ss_pred CCcEEEEEEEeeCCCCCCeEEE---EecCCceEEEccCC--CCCEEEEEEEEEcCCC
Q psy7019 8 NQASFKASHSLSDRPDDRKHVV---YQGTNFSCKVNKLQ--ELTTYRFYITATNDAG 59 (128)
Q Consensus 8 ~~~~i~Y~le~~~~~~~~~~~v---~~g~~t~~~V~~L~--p~t~Y~FRVrA~N~~G 59 (128)
+.....|.+|+....++.|..+ .....++|.+++.. +...|.+||||.-...
T Consensus 47 ~~~~~~ytVq~~~~~~~~W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~~rV~A~~~~~ 103 (107)
T PF01108_consen 47 SPPNVTYTVQYKKYGSSSWKDVPGCQNITETSCDLTDETSDPSESYYARVRAEVGNQ 103 (107)
T ss_dssp SSSTEEEEEEEEESSTSCEEEECCEEEESSSEEECTTCCTTTTSEEEEEEEEEETTE
T ss_pred CCCCeEEEEEEEecCCcceeeccceecccccceeCcchhhcCcCCEEEEEEEEeCCc
Confidence 4456899999985577889876 44467899998865 7899999999986543
No 16
>KOG4222|consensus
Probab=95.72 E-value=0.076 Score=49.73 Aligned_cols=63 Identities=19% Similarity=0.128 Sum_probs=48.2
Q ss_pred eeCCcEEEEEEEeeCCCCCC---eEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceee
Q psy7019 6 EFNQASFKASHSLSDRPDDR---KHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPF 69 (128)
Q Consensus 6 ~~~~~~i~Y~le~~~~~~~~---~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~ 69 (128)
+.||...+|.++....+.-. ...+. ....+.+|.+|.+|..|.|+|-|+|++|.|..|+..-.
T Consensus 776 ~~ng~~qg~ki~~~~~e~tr~h~n~t~~-a~~~sv~i~~l~~g~ay~vtv~a~T~aGvG~~s~p~~~ 841 (1281)
T KOG4222|consen 776 VQNGILQGYKIECSGGEKTRIHINKTTN-ARTGSVTIGNLVTGIAYSVTVAARTGAGVGVKSPPQPI 841 (1281)
T ss_pred ccCCcccceeEEeecCcccccccccccc-CCCCceEeccccccceEEEEEeeecCCccCCCCCCeee
Confidence 46888889999986433222 22223 45678999999999999999999999999999876433
No 17
>PLN02533 probable purple acid phosphatase
Probab=91.74 E-value=0.3 Score=41.16 Aligned_cols=35 Identities=14% Similarity=0.184 Sum_probs=27.3
Q ss_pred CceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCC
Q psy7019 34 NFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTA 74 (128)
Q Consensus 34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~ 74 (128)
...++|++|+|++.|.+||. ....|+...|.|++.
T Consensus 102 iH~v~l~~L~p~T~Y~Yrvg------~~~~s~~~~F~T~p~ 136 (427)
T PLN02533 102 INDVVIGPLKPNTVYYYKCG------GPSSTQEFSFRTPPS 136 (427)
T ss_pred EEEEEeCCCCCCCEEEEEEC------CCCCccceEEECCCC
Confidence 35689999999999999994 223477788998754
No 18
>KOG0196|consensus
Probab=90.90 E-value=0.55 Score=43.04 Aligned_cols=46 Identities=33% Similarity=0.486 Sum_probs=34.5
Q ss_pred CCceEEEccCCCCCEEEEEEEEEcCCCC-CCC---CCceeeeeCCCCCCC
Q psy7019 33 TNFSCKVNKLQELTTYRFYITATNDAGQ-GPY---SEAYPFTTCTAPPAP 78 (128)
Q Consensus 33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~-G~~---S~~~~~~T~~~pp~~ 78 (128)
.++++.|.+|.|.+.|.|.|.|.|+... +++ ...+.+.|..+.|++
T Consensus 396 t~~~V~v~~L~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~ps~ 445 (996)
T KOG0196|consen 396 TETSVTVSDLLAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAPSP 445 (996)
T ss_pred ccceEEEeccccccccEEEEEEeecccccCCCCCcceeEEeeccccCCCc
Confidence 3577999999999999999999998543 332 234677777666655
No 19
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=90.45 E-value=0.53 Score=38.27 Aligned_cols=29 Identities=28% Similarity=0.331 Sum_probs=26.2
Q ss_pred EEecCCceEEEccCCCCCEEEEEEEEEcC
Q psy7019 29 VYQGTNFSCKVNKLQELTTYRFYITATND 57 (128)
Q Consensus 29 v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~ 57 (128)
+-.|..+.+.|.+|+|++.|.|.|.|+|.
T Consensus 9 vCvg~~t~~t~~~L~p~t~YyfdVF~vn~ 37 (300)
T PF10179_consen 9 VCVGQKTNQTLSGLKPDTTYYFDVFVVNQ 37 (300)
T ss_pred EEcCCCceEEeccCCCCCeEEEEEEEEEC
Confidence 44567899999999999999999999998
No 20
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=90.34 E-value=0.52 Score=39.59 Aligned_cols=34 Identities=29% Similarity=0.339 Sum_probs=19.7
Q ss_pred eEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019 36 SCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC 72 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~ 72 (128)
.+.|++|+|++.|.+|+.. .+.+..|....|+|.
T Consensus 65 ~v~v~gL~p~t~Y~Y~~~~---~~~~~~s~~g~~rT~ 98 (453)
T PF09423_consen 65 KVDVTGLQPGTRYYYRFVV---DGGGQTSPVGRFRTA 98 (453)
T ss_dssp EEEE-S--TT-EEEEEEEE-----TTEE---EEEE--
T ss_pred ecccCCCCCCceEEEEEEE---ecCCCCCCceEEEcC
Confidence 5779999999999999998 333566777899998
No 21
>KOG4222|consensus
Probab=90.27 E-value=0.17 Score=47.47 Aligned_cols=64 Identities=14% Similarity=0.092 Sum_probs=47.3
Q ss_pred EEEEEEEeeCCC-CCCeEEEE-ecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCcee-eeeCCC
Q psy7019 11 SFKASHSLSDRP-DDRKHVVY-QGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYP-FTTCTA 74 (128)
Q Consensus 11 ~i~Y~le~~~~~-~~~~~~v~-~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~-~~T~~~ 74 (128)
...|.+|++-.. .+.|..+- .-+.+.+.|.+|+|++.|.|-|||.|..|-+.+|.... +.|.++
T Consensus 558 ~s~yiieafs~~~~etw~~ta~~v~~t~~~I~gL~P~~sylf~vRa~n~~Gis~Ps~~S~~vrta~a 624 (1281)
T KOG4222|consen 558 ASGYIIEAFSPDLGETWQTTAGRVKTTTYAIRGLKPNLSYLFLVRAENEQGISDPSTSSDPVRTAPA 624 (1281)
T ss_pred cchhHHHHhhhhhcccccccccccccceeeecCcCccceeeeeeeccccccccCCcccCCccccCCC
Confidence 457888876433 35576544 23578899999999999999999999999988876553 344433
No 22
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=85.90 E-value=4.9 Score=32.73 Aligned_cols=76 Identities=18% Similarity=0.343 Sum_probs=51.8
Q ss_pred ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeee-eCC-----CCCCCCCCceEEEeeccccEEEEehhhhhhhhcce
Q psy7019 35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFT-TCT-----APPAPLKGKTYFSLVLSRGVCLRAGFLWVFASGRA 108 (128)
Q Consensus 35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~-T~~-----~pp~~~~~p~v~~~~~~~~v~~~~g~~~~~~~~~~ 108 (128)
..+.++++.||..|..|+.+.|... +.. .+.+. ++. +-|.-|..+.+...+.-+ -|..+-++|+-+-++.
T Consensus 128 ~~f~l~~~~~g~~Yliri~~~~~~e-~~~--~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lr-tC~SvTIAW~~s~d~~ 203 (300)
T PF10179_consen 128 RHFRLSGVKPGERYLIRIQISNSDE-GPS--TFKVQAASTNPSKQPYPQLPDDTSIKEFNKLR-TCNSVTIAWLGSPDRS 203 (300)
T ss_pred EEEEECCCCCCCeEEEEEEccCCCC-Cce--EEEEEEecCCcccCCCCCCCCCCceeEEcCCc-ccceEEEEEecCCCCC
Confidence 5688999999999999998766433 222 23333 221 223345667776554334 6888999999888774
Q ss_pred --eeeeee
Q psy7019 109 --FCVSAR 114 (128)
Q Consensus 109 --~~~~~~ 114 (128)
.|+|++
T Consensus 204 ~kYCvy~~ 211 (300)
T PF10179_consen 204 IKYCVYRR 211 (300)
T ss_pred ceEEEEEE
Confidence 899987
No 23
>KOG1378|consensus
Probab=78.88 E-value=3.9 Score=35.14 Aligned_cols=38 Identities=21% Similarity=0.332 Sum_probs=30.8
Q ss_pred ecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCC
Q psy7019 31 QGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCT 73 (128)
Q Consensus 31 ~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~ 73 (128)
.|....+.+++|+|++.|.+||-.. ..+|++..|.|++
T Consensus 105 sg~ih~~~~~~L~~~t~YyY~~Gs~-----~~wS~~f~F~t~p 142 (452)
T KOG1378|consen 105 SGYIHDAVMKNLEPNTRYYYQVGSD-----LKWSEIFSFKTPP 142 (452)
T ss_pred eeeEeeeeecCCCCCceEEEEeCCC-----CCcccceEeECCC
Confidence 3445678899999999999999642 2289999999987
No 24
>KOG4806|consensus
Probab=61.95 E-value=12 Score=31.47 Aligned_cols=29 Identities=24% Similarity=0.260 Sum_probs=25.3
Q ss_pred EEecCCceEEEccCCCCCEEEEEEEEEcC
Q psy7019 29 VYQGTNFSCKVNKLQELTTYRFYITATND 57 (128)
Q Consensus 29 v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~ 57 (128)
+-.|+.+.+.|.+|+|+++|.|.|..+|.
T Consensus 164 iCig~~n~~~v~~L~pdt~Y~~dvFvv~~ 192 (454)
T KOG4806|consen 164 ICIGNKNIFTVSDLKPDTQYYFDVFVVNI 192 (454)
T ss_pred EEcCCccEEEhhhcCCCceEEEEEEEEec
Confidence 44567779999999999999999999985
No 25
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=61.51 E-value=33 Score=22.45 Aligned_cols=44 Identities=11% Similarity=0.117 Sum_probs=31.4
Q ss_pred CCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCCC
Q psy7019 33 TNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAPP 76 (128)
Q Consensus 33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~pp 76 (128)
......|.+....-.=.|.+.|.|..|.-..+-.+.+.-.+.||
T Consensus 54 ~~s~L~I~~~~~~D~G~Ytc~a~N~~G~~~~~~~l~V~~~P~pP 97 (98)
T cd05762 54 NSSKLTITEGQQEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP 97 (98)
T ss_pred CeeEEEECCCChhhCEEEEEEEEcCCCceeEEEEEEEecCCCCC
Confidence 34567888888777778899999999987665555555444444
No 26
>KOG4258|consensus
Probab=59.73 E-value=15 Score=34.24 Aligned_cols=43 Identities=16% Similarity=0.210 Sum_probs=34.3
Q ss_pred eEEEccCCCCCEEEEEEEEEcCCCC----CCCCCceeeeeCCCCCCC
Q psy7019 36 SCKVNKLQELTTYRFYITATNDAGQ----GPYSEAYPFTTCTAPPAP 78 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA~N~~G~----G~~S~~~~~~T~~~pp~~ 78 (128)
...+.+|+|.|+|.+-|++....-+ -..|+...++|.+..|.+
T Consensus 567 ~~~l~~LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~Psp 613 (1025)
T KOG4258|consen 567 GFLLDGLKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSP 613 (1025)
T ss_pred ceehhcCCccceeEEEEeeeehhhhccccccccceEEEEecCCCCCC
Confidence 5789999999999999999844222 246889999999877765
No 27
>PF11344 DUF3146: Protein of unknown function (DUF3146); InterPro: IPR021492 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=57.70 E-value=7.4 Score=25.60 Aligned_cols=15 Identities=33% Similarity=0.450 Sum_probs=13.5
Q ss_pred cCCCCCEEEEEEEEE
Q psy7019 41 KLQELTTYRFYITAT 55 (128)
Q Consensus 41 ~L~p~t~Y~FRVrA~ 55 (128)
.|+||..|.|-|||.
T Consensus 66 ~LEpGgdY~Ftirak 80 (80)
T PF11344_consen 66 QLEPGGDYSFTIRAK 80 (80)
T ss_pred eccCCCceEEEEecC
Confidence 589999999999984
No 28
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=57.44 E-value=19 Score=27.02 Aligned_cols=36 Identities=25% Similarity=0.262 Sum_probs=24.7
Q ss_pred ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeee
Q psy7019 35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTT 71 (128)
Q Consensus 35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T 71 (128)
..+.+.+|.||++|.|+-...++.+ .+.|.++...|
T Consensus 102 saYqVtNL~pGTkY~isY~Vtkgts-tESS~~i~msT 137 (184)
T PF07353_consen 102 SAYQVTNLQPGTKYYISYLVTKGTS-TESSNEIPMST 137 (184)
T ss_pred eeEEeeccCCCcEEEEEEEEecCcc-ceecceecccc
Confidence 3488999999999999988866522 23333444444
No 29
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=55.43 E-value=29 Score=32.30 Aligned_cols=38 Identities=11% Similarity=0.049 Sum_probs=32.2
Q ss_pred CCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCC
Q psy7019 24 DRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGP 62 (128)
Q Consensus 24 ~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~ 62 (128)
+.|...-++..+.+.+.+|-+ ..|..||||.|..|...
T Consensus 649 ~~~vta~rt~~~~~~~~gi~~-GqY~i~VrAiN~~g~~~ 686 (952)
T COG4733 649 GNWITAPRTSAAGFDVEGIPA-GQYAIRVRAINVFEPNS 686 (952)
T ss_pred CceEecccccccceeecCcCc-cceEEEEEEeeccCCCC
Confidence 467777777889999999998 59999999999988743
No 30
>KOG4802|consensus
Probab=51.63 E-value=28 Score=29.96 Aligned_cols=36 Identities=19% Similarity=0.219 Sum_probs=29.1
Q ss_pred EEEccCCCCCEEEEEEEEEcCCCCCCCC-C-ceeeeeC
Q psy7019 37 CKVNKLQELTTYRFYITATNDAGQGPYS-E-AYPFTTC 72 (128)
Q Consensus 37 ~~V~~L~p~t~Y~FRVrA~N~~G~G~~S-~-~~~~~T~ 72 (128)
..|++|.|+..|..+|+|...-|.+..- + .+.|.|.
T Consensus 316 ~si~~L~Pns~Y~VevqAi~y~g~~rLksek~~~F~~T 353 (516)
T KOG4802|consen 316 FSIKELLPNSSYYVEVQAISYLGSRRLKSEKWMLFNTT 353 (516)
T ss_pred hhhhhcCCCCeEEEEEEEEEeccCcccccceEEEEeec
Confidence 4499999999999999999999999874 3 2455553
No 31
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=49.87 E-value=23 Score=23.12 Aligned_cols=25 Identities=24% Similarity=0.316 Sum_probs=21.2
Q ss_pred ecCCceEEEccCCCCCEEEEEEEEE
Q psy7019 31 QGTNFSCKVNKLQELTTYRFYITAT 55 (128)
Q Consensus 31 ~g~~t~~~V~~L~p~t~Y~FRVrA~ 55 (128)
.|...++.-..|.+|..|.|+|+|.
T Consensus 25 ~G~~R~F~T~~L~~G~~y~Y~v~a~ 49 (75)
T TIGR03000 25 TGTVRTFTTPPLEAGKEYEYTVTAE 49 (75)
T ss_pred CccEEEEECCCCCCCCEEEEEEEEE
Confidence 3456778889999999999999994
No 32
>PF04775 Bile_Hydr_Trans: Acyl-CoA thioester hydrolase/BAAT N-terminal region; InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=48.49 E-value=30 Score=24.20 Aligned_cols=25 Identities=20% Similarity=0.223 Sum_probs=17.6
Q ss_pred ceEEEccCCCCCEEEEEEEEEcCCC
Q psy7019 35 FSCKVNKLQELTTYRFYITATNDAG 59 (128)
Q Consensus 35 t~~~V~~L~p~t~Y~FRVrA~N~~G 59 (128)
-..+|++|.|+..|.++.+..+..|
T Consensus 5 ~~I~v~GL~p~~~vtl~a~~~~~~g 29 (126)
T PF04775_consen 5 VDIRVSGLPPGQEVTLRARLTDDNG 29 (126)
T ss_dssp -EEEEES--TT-EEEEEEEEE-TTS
T ss_pred eEEEEeCCCCCCEEEEEEEEEeCCC
Confidence 4678999999999999999997655
No 33
>COG3540 PhoD Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism]
Probab=47.55 E-value=26 Score=30.59 Aligned_cols=36 Identities=19% Similarity=0.217 Sum_probs=27.1
Q ss_pred ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCC
Q psy7019 35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAP 75 (128)
Q Consensus 35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~p 75 (128)
-++.+.+|.|++.|.+|+.+-+.. |....++|.+++
T Consensus 101 v~v~~~gL~P~~~yfYRf~~~~~~-----spvGrtrTapa~ 136 (522)
T COG3540 101 VHVDLRGLSPDQDYFYRFKAGDER-----SPVGRTRTAPAP 136 (522)
T ss_pred EEEeccCCCCCceEEEEEeeCCcc-----ccccccccCCCC
Confidence 357789999999999999986653 345566776654
No 34
>PF02922 CBM_48: Carbohydrate-binding module 48 (Isoamylase N-terminal domain); InterPro: IPR004193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. This domain is found in a range of enzymes that act on branched substrates ie. isoamylase, pullulanase and branching enzyme. Isoamylase hydrolyses 1,6-alpha-D-glucosidic branch linkages in glycogen, amylopectin and dextrin; 1,4-alpha-glucan branching enzyme functions in the formation of 1,6-glucosidic linkages of glycogen; and pullulanase is a starch-debranching enzyme.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BHZ_A 2BY2_A 2BY3_A 2BXY_A 2BY1_A 2BHY_A 2BHU_A 2BXZ_A 2BY0_A 2FHB_A ....
Probab=47.51 E-value=55 Score=20.38 Aligned_cols=20 Identities=20% Similarity=0.326 Sum_probs=14.6
Q ss_pred EEEc-cCCCCC-EEEEEEEEEc
Q psy7019 37 CKVN-KLQELT-TYRFYITATN 56 (128)
Q Consensus 37 ~~V~-~L~p~t-~Y~FRVrA~N 56 (128)
+.+. ++.+|. .|.|+|.-.+
T Consensus 53 ~~~~~~~~~g~~~Y~y~i~~~~ 74 (85)
T PF02922_consen 53 VTVPGDLPPGGYYYKYRIDGDD 74 (85)
T ss_dssp EEEEGCGTTTT-EEEEEEEETT
T ss_pred EEEcCCcCCCCEEEEEEEEeCC
Confidence 3444 788875 9999998654
No 35
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=46.25 E-value=6.7 Score=32.32 Aligned_cols=32 Identities=28% Similarity=0.420 Sum_probs=0.0
Q ss_pred ccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019 40 NKLQELTTYRFYITATNDAGQGPYSEAYPFTTC 72 (128)
Q Consensus 40 ~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~ 72 (128)
..|++|..|.|||.+.+..|...+ ..+.|.+.
T Consensus 267 ~~l~~g~~Y~~~l~v~~~~~~~~~-a~~~~~~n 298 (440)
T PF02010_consen 267 GVLEPGSTYTFRLTVTDSSGSSGS-ASISFTVN 298 (440)
T ss_dssp ---------------------------------
T ss_pred cccccccccccccccccccccccc-cccccccc
Confidence 369999999999999999887553 34555553
No 36
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=44.70 E-value=71 Score=19.39 Aligned_cols=47 Identities=15% Similarity=0.117 Sum_probs=26.4
Q ss_pred EEEEEEEeeCCCCCCeEEEEecCCceEEE---ccCCCCCEEEEEEEEEcCCCC
Q psy7019 11 SFKASHSLSDRPDDRKHVVYQGTNFSCKV---NKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 11 ~i~Y~le~~~~~~~~~~~v~~g~~t~~~V---~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
.+.|.|.-.. ..+.|..-... . ...+ -+-+....|.|.|+|.+..+.
T Consensus 10 ~i~Y~i~~~~-~~~~F~i~~~t-g-~i~~~~~LD~e~~~~y~l~v~a~D~~~~ 59 (79)
T smart00112 10 KVTYSILSGN-EDGLFSIDPET-G-EITTTKPLDREEQPEYTLTVEATDGGGP 59 (79)
T ss_pred EEEEEEecCC-CCCEEEEeCCc-c-EEEeCCccCeeCCCeEEEEEEEEECCCC
Confidence 4889888543 22334322111 1 1222 223356899999999987654
No 37
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=44.38 E-value=66 Score=18.96 Aligned_cols=33 Identities=33% Similarity=0.343 Sum_probs=23.4
Q ss_pred CCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019 22 PDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYS 64 (128)
Q Consensus 22 ~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S 64 (128)
.++.|..- +..+ ..-.|.+.|+|.+.+|--..+
T Consensus 10 ~~G~Ws~t---------~~~~-~dG~y~itv~a~D~AGN~s~~ 42 (54)
T PF13754_consen 10 SDGNWSFT---------VPAL-ADGTYTITVTATDAAGNTSTS 42 (54)
T ss_pred CCCcEEEe---------CCCC-CCccEEEEEEEEeCCCCCCCc
Confidence 45677643 3344 477999999999999976544
No 38
>cd05735 Ig8_DSCAM Eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM: the eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=42.30 E-value=56 Score=20.76 Aligned_cols=41 Identities=12% Similarity=-0.013 Sum_probs=26.6
Q ss_pred ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeCCCC
Q psy7019 35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTCTAP 75 (128)
Q Consensus 35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~~~p 75 (128)
....|.++.+.-.=.|...|.|..|....+-.+.++..+.|
T Consensus 47 s~L~I~~~~~~D~G~YtC~A~N~~G~~~~~~~L~V~~~P~~ 87 (88)
T cd05735 47 STLQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQEPPDP 87 (88)
T ss_pred EEEEECCCCcccCEEEEEEEEcCCCcceEEEEEEEeCCCCC
Confidence 34667776655555588999999998765545555444333
No 39
>cd05748 Ig_Titin_like Immunoglobulin (Ig)-like domain of titin and similar proteins. Ig_Titin_like: immunoglobulin (Ig)-like domain found in titin-like proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic, depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone. It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin, and similar to titin has repeated Ig-like and FN-
Probab=42.30 E-value=75 Score=18.99 Aligned_cols=28 Identities=18% Similarity=0.288 Sum_probs=23.9
Q ss_pred CCceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019 33 TNFSCKVNKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
......|.++.+.-.-.|.+.|.|..|.
T Consensus 38 ~~~~L~I~~~~~~D~G~Y~C~a~N~~G~ 65 (74)
T cd05748 38 SSTSLVIKNAERSDSGKYTLTLKNPAGE 65 (74)
T ss_pred CeEEEEECCCCcCcCEEEEEEEECCCcc
Confidence 3567889999988888999999999875
No 40
>cd02856 Glycogen_debranching_enzyme_N_term Glycogen_debranching_enzyme N-terminal domain. Glycogen debranching enzymes have both 4-alpha-glucanotransferase and amylo-1,6-glucosidase activities. As a transferase it transfers a segment of a 1,4-alpha-D-glucan to a new 4-position in an acceptor, which may be glucose or another 1,4-alpha-D-glucan. As a glucosidase it catalyzes the endohydrolysis of 1,6-alpha-D-glucoside linkages at points of branching in chains of 1,4-linked alpha-D-glucose residues. The N-terminus of the glycogen debranching enzyme may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=40.72 E-value=37 Score=22.45 Aligned_cols=19 Identities=11% Similarity=0.165 Sum_probs=16.1
Q ss_pred eEEEccCCCCCEEEEEEEE
Q psy7019 36 SCKVNKLQELTTYRFYITA 54 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA 54 (128)
+..|.++.+|..|.|||..
T Consensus 48 ~~~v~~~~~g~~Y~y~i~g 66 (103)
T cd02856 48 HGFLPGIKAGQRYGFRVHG 66 (103)
T ss_pred EEEECCCCCCCEEEEEECC
Confidence 4677888899999999975
No 41
>PF15417 DUF4624: Domain of unknown function (DUF4624)
Probab=39.80 E-value=61 Score=23.01 Aligned_cols=32 Identities=22% Similarity=0.335 Sum_probs=24.3
Q ss_pred CCCeEEEEec----CCceEEEccCCCCCEEEEEEEE
Q psy7019 23 DDRKHVVYQG----TNFSCKVNKLQELTTYRFYITA 54 (128)
Q Consensus 23 ~~~~~~v~~g----~~t~~~V~~L~p~t~Y~FRVrA 54 (128)
+-.|...+.| ...+.-+++|+.+..|..|.++
T Consensus 73 evLWsn~~~~~V~~dt~tisL~nlqk~kEY~V~ftG 108 (132)
T PF15417_consen 73 EVLWSNTWNGKVSGDTFTISLNNLQKEKEYVVCFTG 108 (132)
T ss_pred ceeeccccccccccceEEEEhhhcccCceEEEEEec
Confidence 3447666654 4556889999999999999885
No 42
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=37.89 E-value=14 Score=29.56 Aligned_cols=34 Identities=12% Similarity=0.034 Sum_probs=23.1
Q ss_pred ceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceee
Q psy7019 35 FSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPF 69 (128)
Q Consensus 35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~ 69 (128)
..+.+.+-..=.....||+|++..+.+ |+...+.
T Consensus 231 q~i~lt~~~~l~~~~lqvQaF~~~~~~-Fg~a~~C 264 (306)
T PF01299_consen 231 QSINLTDNVSLNLSDLQVQAFRVKNNT-FGTAEEC 264 (306)
T ss_pred CEEEecCCceEEEeEEEEEEEEecCCC-CCChhcC
Confidence 345555544455677899999988877 8876554
No 43
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=37.41 E-value=31 Score=32.16 Aligned_cols=25 Identities=16% Similarity=0.060 Sum_probs=22.1
Q ss_pred ccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019 40 NKLQELTTYRFYITATNDAGQGPYS 64 (128)
Q Consensus 40 ~~L~p~t~Y~FRVrA~N~~G~G~~S 64 (128)
.+|.||..|.|++|++|..|-..+=
T Consensus 762 ~~i~~g~~~~F~~R~Vn~vG~~~~~ 786 (952)
T COG4733 762 VGIQAGLTFWFRNRNVDLVGNNDKW 786 (952)
T ss_pred cCcCCCceEEEEeeecccccccccc
Confidence 5899999999999999999976553
No 44
>KOG0095|consensus
Probab=36.61 E-value=23 Score=26.67 Aligned_cols=26 Identities=31% Similarity=0.286 Sum_probs=17.6
Q ss_pred CcccceeCCcEEEEEEEeeCCCCCCeE
Q psy7019 1 MIKTTEFNQASFKASHSLSDRPDDRKH 27 (128)
Q Consensus 1 ~~~~~~~~~~~i~Y~le~~~~~~~~~~ 27 (128)
||||+|.||.-+...+= .-.++++|+
T Consensus 45 miktvev~gekiklqiw-dtagqerfr 70 (213)
T KOG0095|consen 45 MIKTVEVNGEKIKLQIW-DTAGQERFR 70 (213)
T ss_pred EEEEEEECCeEEEEEEe-eccchHHHH
Confidence 89999999998866543 223444443
No 45
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=35.79 E-value=53 Score=30.56 Aligned_cols=51 Identities=24% Similarity=0.182 Sum_probs=31.5
Q ss_pred CcEEEEEEEeeCCCC--CC--------eEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019 9 QASFKASHSLSDRPD--DR--------KHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 9 ~~~i~Y~le~~~~~~--~~--------~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
|+.+.|.+|+.+... +. |...--..++..=-+.+.|.+.|+|||+.. ..|+
T Consensus 29 G~NFyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed~~~p~t~Yk~Rv~~~-~qGF 89 (1030)
T PHA02579 29 GANFYYFVELAETRDADGELIPDDELRWINLGYTANNEWFEDKLQPNTYYKFRVAVA-AQGF 89 (1030)
T ss_pred CCceEEEEEEEeeccCCCccCCCccccceecCcccchhhhhhccCCcceEEEEEEee-ccCC
Confidence 677889999874322 22 543222234444445599999999999863 3444
No 46
>cd02852 Isoamylase_N_term Isoamylase N-terminus domain. Isoamylase (aka glycogen 6-glucanohydrolase) is one of the starch-debranching enzymes that catalyzes the hydrolysis of alpha-1,6-glucosidic linkages specific in alpha-glucans such as amylopectin or glycogen. Isoamylase contains a bound calcium ion, but this is not in the same position as the conserved calcium ion that has been reported in other alpha-amylase family enzymes. The N-terminus of isoamylase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=35.73 E-value=48 Score=22.43 Aligned_cols=19 Identities=16% Similarity=0.224 Sum_probs=16.4
Q ss_pred eEEEccCCCCCEEEEEEEE
Q psy7019 36 SCKVNKLQELTTYRFYITA 54 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA 54 (128)
++.|.++.+|..|.|||..
T Consensus 52 ~~~v~~~~~g~~Y~y~v~g 70 (119)
T cd02852 52 HVFVEGLKPGQLYGYRVDG 70 (119)
T ss_pred EEEECCCCCCCEEEEEECC
Confidence 4678889999999999984
No 47
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=33.73 E-value=1.4e+02 Score=21.66 Aligned_cols=21 Identities=29% Similarity=0.509 Sum_probs=18.9
Q ss_pred ccCCCCCEEEEEEEEEcCCCC
Q psy7019 40 NKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 40 ~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
-.|+++..|.+.|.|.+.+|=
T Consensus 117 psle~~~~YtLtV~a~D~aGN 137 (158)
T PF13750_consen 117 PSLEADDSYTLTVSATDKAGN 137 (158)
T ss_pred CCcCCCCeEEEEEEEEecCCC
Confidence 478899999999999999985
No 48
>COG3979 Uncharacterized protein contain chitin-binding domain type 3 [General function prediction only]
Probab=33.66 E-value=89 Score=23.39 Aligned_cols=41 Identities=22% Similarity=0.289 Sum_probs=29.0
Q ss_pred CCceEEEccCCCCCEEEEEEEEEcCCCC-CCCCCceeeeeCC
Q psy7019 33 TNFSCKVNKLQELTTYRFYITATNDAGQ-GPYSEAYPFTTCT 73 (128)
Q Consensus 33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G~-G~~S~~~~~~T~~ 73 (128)
......+++|.+.+.|.|.+-+....|- +.++......+..
T Consensus 51 ~~~~~~~~g~~~et~y~~~~~a~~t~g~~s~~~~~~~~~~~~ 92 (181)
T COG3979 51 TGTVVTVEGLAPETEYWTLVEAPDTSGNWSAWSRLLTVSTSG 92 (181)
T ss_pred ccceEEecCcceeeEEEEeeeccCCCCccccceeeeeeccCc
Confidence 4677899999999999999999886332 3344444444443
No 49
>KOG4152|consensus
Probab=33.30 E-value=74 Score=28.54 Aligned_cols=59 Identities=22% Similarity=0.239 Sum_probs=39.6
Q ss_pred CCcEEEEEEEeeCC--------CCCCeEEEEecCCceEEEc---------cCCCCCEEEEEEEEEcCCCCCCCCCc
Q psy7019 8 NQASFKASHSLSDR--------PDDRKHVVYQGTNFSCKVN---------KLQELTTYRFYITATNDAGQGPYSEA 66 (128)
Q Consensus 8 ~~~~i~Y~le~~~~--------~~~~~~~v~~g~~t~~~V~---------~L~p~t~Y~FRVrA~N~~G~G~~S~~ 66 (128)
.|.+++|+.-..-. .+-.|..+|.|-.+.|.|+ +...+..-.||+.|.|..|+|+.-..
T Consensus 716 sg~Iieys~ylAi~tq~~~~~~sql~fmr~ycg~~t~c~vt~g~l~tA~~d~t~~paivfri~aknekGyGpatqv 791 (830)
T KOG4152|consen 716 SGTIIEYSPYLAIITQAGATGVSQLPFMRSYCGIITTCVVTSGKLITAVPDITTGPAIVFRIVAKNEKGYGPATQV 791 (830)
T ss_pred CcceEEeehhhHhhhhhhccCccccceeeeecccceeeEEecccceeeccccccCcceEEEEEecCCCCCCcceEE
Confidence 45677886543211 1123667777776666543 44567788999999999999987654
No 50
>cd02860 Pullulanase_N_term Pullulanase domain N-terminus. Pullulanase (AKA dextrinase; alpha-dextrin endo-1,6-alpha glucosidase) is an enzyme with action similar to that of isoamylase; it cleaves 1,6-alpha-glucosidic linkages in pullulan, amylopectin, and glycogen, and in alpha-and beta-amylase limit-dextrins of amylopectin and glycogen. The N-terminus of pullulanase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=32.62 E-value=69 Score=20.91 Aligned_cols=20 Identities=15% Similarity=0.076 Sum_probs=16.4
Q ss_pred eEEEccCCCCCEEEEEEEEE
Q psy7019 36 SCKVNKLQELTTYRFYITAT 55 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA~ 55 (128)
+..|.++.+|..|.|||...
T Consensus 50 ~~~v~~~~~g~~Y~y~i~~~ 69 (100)
T cd02860 50 SVTLDGDLEGYYYLYEVKVY 69 (100)
T ss_pred EEEeCCccCCcEEEEEEEEe
Confidence 45677888999999999864
No 51
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=31.63 E-value=1.2e+02 Score=18.16 Aligned_cols=45 Identities=24% Similarity=0.221 Sum_probs=26.4
Q ss_pred CCcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019 8 NQASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 8 ~~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
+|..+.|...+.+ ..... ...+..-.-..+| .|..+|.+.|..|.
T Consensus 22 ~g~~~~y~W~fgd-----~~~~~--~~~~~t~ty~~~G-~y~V~ltv~n~~g~ 66 (69)
T PF00801_consen 22 DGSPVTYSWDFGD-----NGTVS--TGSSVTHTYSSPG-TYTVTLTVTNGVGS 66 (69)
T ss_dssp TSSECEEEEE-SS-----ESEEE--CSSEEEEEESSSE-EEEEEEEEEETTSE
T ss_pred CCCCeEEEEEECC-----CCccc--cCCCEEEEcCCCe-EEEEEEEEEECCCC
Confidence 4667778777664 11111 2333444444455 89999999998773
No 52
>cd05894 Ig_C5_MyBP-C C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). MyBP_C consists of repeated domains, Ig and fibronectin type 3, and various linkers. Three isoforms of MYBP_C exist and are included in this group: cardiac(c), and fast and slow skeletal muscle (s) MyBP_C. cMYBP_C has insertions between and inside domains and an additional cardiac-specific Ig domain at the N-terminus. For cMYBP_C an interaction has been demonstrated between this C5 domain and the Ig C8 domain.
Probab=30.63 E-value=1.4e+02 Score=18.75 Aligned_cols=27 Identities=22% Similarity=0.158 Sum_probs=23.7
Q ss_pred CceEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019 34 NFSCKVNKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
.....|.++.+.-.-.|.+.|.|..|.
T Consensus 51 ~~~L~I~~~~~~D~G~Y~c~a~N~~G~ 77 (86)
T cd05894 51 LSSFVIEGAEREDEGVYTITVTNPVGE 77 (86)
T ss_pred eEEEEECCCccCcCEEEEEEEEeCCCc
Confidence 456899999999999999999999885
No 53
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=30.38 E-value=61 Score=18.91 Aligned_cols=14 Identities=50% Similarity=0.824 Sum_probs=12.1
Q ss_pred CEEEEEEEEEcCCC
Q psy7019 46 TTYRFYITATNDAG 59 (128)
Q Consensus 46 t~Y~FRVrA~N~~G 59 (128)
-.|.|.|.|.+..|
T Consensus 36 G~y~~~vtatd~~G 49 (49)
T PF05345_consen 36 GTYTFTVTATDGSG 49 (49)
T ss_pred cEEEEEEEEEcCCC
Confidence 38999999999876
No 54
>cd05851 Ig3_Contactin-1 Third Ig domain of contactin-1. Ig3_Contactin-1: Third Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=30.00 E-value=80 Score=19.97 Aligned_cols=31 Identities=16% Similarity=0.069 Sum_probs=25.0
Q ss_pred CceEEEccCCCCCEEEEEEEEEcCCCCCCCC
Q psy7019 34 NFSCKVNKLQELTTYRFYITATNDAGQGPYS 64 (128)
Q Consensus 34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S 64 (128)
.....|.++.+.-.-.|.+.|.|..|....+
T Consensus 52 ~~~L~I~~v~~~D~G~Y~C~A~N~~G~~~~~ 82 (88)
T cd05851 52 GAVLKIFNIQPEDEGTYECEAENIKGKDKHQ 82 (88)
T ss_pred CCEEEECcCChhhCEEEEEEEEcCCCceEEE
Confidence 3468899999888888999999999965433
No 55
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=29.97 E-value=42 Score=20.88 Aligned_cols=14 Identities=36% Similarity=0.627 Sum_probs=11.6
Q ss_pred CCCEEEEEEEEEcC
Q psy7019 44 ELTTYRFYITATND 57 (128)
Q Consensus 44 p~t~Y~FRVrA~N~ 57 (128)
.+.+|.||++|.|.
T Consensus 11 k~Ge~rfrlkA~N~ 24 (59)
T COG3422 11 KAGEYRFRLKAANG 24 (59)
T ss_pred CCCcEEEEEEccCc
Confidence 36689999999885
No 56
>cd02853 MTHase_N_term Maltooligosyl trehalose synthase (MTSase) N-terminus domain. MTSase and maltooligosyl trehalose trehalohydrolase (MTHase) work together to produce trehalose. MTSase is responsible for converting the alpha-1,4-glucosidic linkage to an alpha,alpha-1,1-glucosidic linkage at the reducing end of the maltooligosaccharide through an intramolecular transglucosylation reaction, while MTHase hydrolyzes the penultimate alpha-1,4 linkage of the reducing end, resulting in the release of trehalose. The N-terminus of MTSase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=29.72 E-value=65 Score=20.42 Aligned_cols=17 Identities=24% Similarity=0.366 Sum_probs=14.1
Q ss_pred eEEEccCCCCCEEEEEEE
Q psy7019 36 SCKVNKLQELTTYRFYIT 53 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVr 53 (128)
+..+.++ +|..|.|+|.
T Consensus 43 ~~~v~~~-~g~~Y~y~v~ 59 (85)
T cd02853 43 EAEVPGA-AGTRYRYRLD 59 (85)
T ss_pred EEEeCCC-CCCeEEEEEC
Confidence 3556778 9999999997
No 57
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=29.56 E-value=1.5e+02 Score=18.69 Aligned_cols=16 Identities=38% Similarity=0.391 Sum_probs=10.0
Q ss_pred CCCCCEEEEEEEEEcC
Q psy7019 42 LQELTTYRFYITATND 57 (128)
Q Consensus 42 L~p~t~Y~FRVrA~N~ 57 (128)
..|.-.|.|+|.|.+.
T Consensus 65 ~~~~G~Y~~~v~a~~~ 80 (81)
T PF13860_consen 65 PVPDGTYTFRVTATDG 80 (81)
T ss_dssp B--SEEEEEEEEEEET
T ss_pred CCCCCCEEEEEEEEeC
Confidence 3455678888888764
No 58
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=28.65 E-value=70 Score=23.50 Aligned_cols=27 Identities=11% Similarity=0.132 Sum_probs=20.8
Q ss_pred CCceEEEccCCCCCEEEEEEEEEcCCC
Q psy7019 33 TNFSCKVNKLQELTTYRFYITATNDAG 59 (128)
Q Consensus 33 ~~t~~~V~~L~p~t~Y~FRVrA~N~~G 59 (128)
+...+.|++|.|...|.+-++-.-...
T Consensus 31 P~l~y~vsGL~p~~~Y~i~l~~~~~d~ 57 (184)
T PF00907_consen 31 PTLEYSVSGLDPDSLYSISLHFERVDN 57 (184)
T ss_dssp S-EEEEEESS-TTSEEEEEEEEEESCS
T ss_pred cccEEEecCCCCCcceEEEEEEEEecC
Confidence 467899999999999999988776533
No 59
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=28.49 E-value=50 Score=20.81 Aligned_cols=48 Identities=15% Similarity=0.189 Sum_probs=26.7
Q ss_pred CcEEEEEEEeeCCCCCCeEEEEecCCceEEEc---cCCCCCEEEEEEEEEcCCC
Q psy7019 9 QASFKASHSLSDRPDDRKHVVYQGTNFSCKVN---KLQELTTYRFYITATNDAG 59 (128)
Q Consensus 9 ~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~---~L~p~t~Y~FRVrA~N~~G 59 (128)
+..+.|.+.-.+ ..+.|..-.. .-...+. +-+....|.|.|+|.+..|
T Consensus 28 n~~i~y~i~~~~-~~~~F~I~~~--tg~i~~~~~LD~E~~~~y~l~v~a~D~~~ 78 (93)
T PF00028_consen 28 NSQITYSILGGN-PDGLFSIDPN--TGEISLKKPLDRETQSSYQLTVRATDSGG 78 (93)
T ss_dssp TSSEEEEEEETT-STTSEEEETT--TTEEEESSSSCTTTTSEEEEEEEEEETTT
T ss_pred CceEEEEEecCc-ccCceEEeee--eeccccceecCcccCCEEEEEEEEEECCC
Confidence 345667776543 2344443221 1123333 2345789999999999833
No 60
>PF12514 DUF3718: Protein of unknown function (DUF3718); InterPro: IPR022193 This entry is represented by Bacteriophage Aaphi23, Orf19. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This domain family is found in bacteria and viruses, and is approximately 70 amino acids in length. There is a single completely conserved residue C that may be functionally important.
Probab=27.13 E-value=19 Score=22.76 Aligned_cols=15 Identities=20% Similarity=0.297 Sum_probs=12.7
Q ss_pred eeeeeeecCCCcccc
Q psy7019 109 FCVSARSPCPGQTLS 123 (128)
Q Consensus 109 ~~~~~~~~~~~~~~~ 123 (128)
+-++..+.||||.|.
T Consensus 25 r~i~~~l~CNG~sl~ 39 (68)
T PF12514_consen 25 RNIYDGLVCNGESLL 39 (68)
T ss_pred hhcccCCeECCcCHH
Confidence 567889999999885
No 61
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.90 E-value=2.9e+02 Score=21.29 Aligned_cols=17 Identities=29% Similarity=0.335 Sum_probs=12.0
Q ss_pred CCCCEEEEEEEEEcCCC
Q psy7019 43 QELTTYRFYITATNDAG 59 (128)
Q Consensus 43 ~p~t~Y~FRVrA~N~~G 59 (128)
.|.-.|.|+|.|.+..|
T Consensus 163 ~~~G~Yt~~v~a~~~~G 179 (221)
T PRK12634 163 MAAGKYGVTATQTDTAG 179 (221)
T ss_pred CCCeeeEEEEEEEeCCC
Confidence 45567888888876655
No 62
>PF02018 CBM_4_9: Carbohydrate binding domain; InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=25.63 E-value=73 Score=20.83 Aligned_cols=20 Identities=25% Similarity=0.270 Sum_probs=17.2
Q ss_pred ccCCCCCEEEEEEEEEcCCC
Q psy7019 40 NKLQELTTYRFYITATNDAG 59 (128)
Q Consensus 40 ~~L~p~t~Y~FRVrA~N~~G 59 (128)
..|++|..|.|.+.+....+
T Consensus 56 ~~l~~G~~Y~~s~~vk~~~~ 75 (131)
T PF02018_consen 56 ISLKPGKTYTVSFWVKADSG 75 (131)
T ss_dssp EEE-TTSEEEEEEEEEESSS
T ss_pred eEecCCCEEEEEEEEEeCCC
Confidence 68999999999999988777
No 63
>COG5584 Predicted small secreted protein [Function unknown]
Probab=25.09 E-value=2.1e+02 Score=19.72 Aligned_cols=44 Identities=25% Similarity=0.317 Sum_probs=29.1
Q ss_pred CcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEEEc
Q psy7019 9 QASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITATN 56 (128)
Q Consensus 9 ~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N 56 (128)
|.|+.|+-|..+ ..+....+|+|.-++.+-. +..+|.|-+.|.+
T Consensus 51 GSwI~~~pe~~~-k~~~~~~VY~gGIT~i~~g---e~~qyef~aDA~T 94 (103)
T COG5584 51 GSWIVYEPEVYP-KTDKYRLVYQGGITTIKEG---ENIQYEFYADANT 94 (103)
T ss_pred eeEEEEeccccc-CCCcEEEEEecceeEecCC---CceEEEEEEecCC
Confidence 556666666553 6677888998865554333 3568888888744
No 64
>cd04972 Ig_TrkABC_d4 Fourth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor fo
Probab=24.82 E-value=1.2e+02 Score=18.89 Aligned_cols=27 Identities=22% Similarity=0.310 Sum_probs=23.0
Q ss_pred ceEEEccCCCCCEEEEEEEEEcCCCCC
Q psy7019 35 FSCKVNKLQELTTYRFYITATNDAGQG 61 (128)
Q Consensus 35 t~~~V~~L~p~t~Y~FRVrA~N~~G~G 61 (128)
....|.++.+.-...|++.|.|..|.-
T Consensus 56 ~~L~I~~v~~~D~g~Y~C~A~N~~G~~ 82 (90)
T cd04972 56 YNLQLSNITSETQTTVTCTAENPVGQA 82 (90)
T ss_pred EEEEEecCCcccCEEEEEEEECCCCce
Confidence 468899999888889999999999864
No 65
>COG3900 Predicted periplasmic protein [Function unknown]
Probab=24.06 E-value=2.7e+02 Score=22.20 Aligned_cols=62 Identities=11% Similarity=0.193 Sum_probs=38.7
Q ss_pred eCCcEEEEEEEeeCCCCCCeEE-EEecCCc---eEEE--ccCCCCCEEEEEEEEEcCCCCCCCCCceeeee
Q psy7019 7 FNQASFKASHSLSDRPDDRKHV-VYQGTNF---SCKV--NKLQELTTYRFYITATNDAGQGPYSEAYPFTT 71 (128)
Q Consensus 7 ~~~~~i~Y~le~~~~~~~~~~~-v~~g~~t---~~~V--~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T 71 (128)
.||+.+ |+|-.+ +++-.|.+ +..|... .+.| +.+.-+.+|.++.+-.+ .|.+-.|..+.|..
T Consensus 171 Vgg~ec-d~lafr-~ddvDfqIWIsqGeqpvp~k~VITsk~v~g~PqYtv~fsnwk-sgdev~s~~F~Fkp 238 (262)
T COG3900 171 VGGEEC-DQLAFR-KDDVDFQIWISQGEQPVPLKYVITSKDVPGEPQYTVVFSNWK-SGDEVPSSDFTFKP 238 (262)
T ss_pred cCceee-hhhhcc-CCCCceEEEeecCCCCcceeEEEEecccCCCCcEEEEEcccc-cCCCccccceeecC
Confidence 455555 555555 36666654 4455432 2333 45667789999999876 56677777777753
No 66
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=23.74 E-value=87 Score=18.72 Aligned_cols=25 Identities=24% Similarity=0.133 Sum_probs=18.0
Q ss_pred EEecCCceEEEccCCCCCEEEEEEEE
Q psy7019 29 VYQGTNFSCKVNKLQELTTYRFYITA 54 (128)
Q Consensus 29 v~~g~~t~~~V~~L~p~t~Y~FRVrA 54 (128)
.....+-.+.+.+|.+|. |.++-..
T Consensus 21 ~~Td~~G~~~f~~L~~G~-Y~l~E~~ 45 (70)
T PF05738_consen 21 VTTDENGKYTFKNLPPGT-YTLKETK 45 (70)
T ss_dssp EEGGTTSEEEEEEEESEE-EEEEEEE
T ss_pred EEECCCCEEEEeecCCeE-EEEEEEE
Confidence 444556678999999998 7666554
No 67
>cd05893 Ig_Palladin_C C-terminal immunoglobulin (Ig)-like domain of palladin. Ig_Palladin_C: C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP, mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions.
Probab=23.69 E-value=52 Score=20.40 Aligned_cols=27 Identities=15% Similarity=0.155 Sum_probs=23.6
Q ss_pred eEEEccCCCCCEEEEEEEEEcCCCCCC
Q psy7019 36 SCKVNKLQELTTYRFYITATNDAGQGP 62 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA~N~~G~G~ 62 (128)
...|.++++.-.-.+.+.|.|..|.-.
T Consensus 42 ~L~I~~v~~~D~G~Y~C~A~N~~G~~~ 68 (75)
T cd05893 42 CLLIQGATKEDAGWYTVSAKNEAGIVS 68 (75)
T ss_pred EEEECCCCHHHCEEEEEEEEcCCCEEE
Confidence 578999999988999999999998653
No 68
>KOG4806|consensus
Probab=22.80 E-value=1.1e+02 Score=25.97 Aligned_cols=25 Identities=20% Similarity=0.227 Sum_probs=20.2
Q ss_pred eEEEccCCCCCEEEEEEEEEcCCCC
Q psy7019 36 SCKVNKLQELTTYRFYITATNDAGQ 60 (128)
Q Consensus 36 ~~~V~~L~p~t~Y~FRVrA~N~~G~ 60 (128)
.-+|.+|.||+.|...|-|.-..|.
T Consensus 411 teTI~gL~PgssYlldv~a~~~~g~ 435 (454)
T KOG4806|consen 411 TETILGLMPGSSYLLDVTANLSMGK 435 (454)
T ss_pred hhhhcccccCceEEEEEEEcccCCc
Confidence 3569999999999999998655443
No 69
>KOG1948|consensus
Probab=22.30 E-value=2.9e+02 Score=26.39 Aligned_cols=22 Identities=14% Similarity=0.176 Sum_probs=19.8
Q ss_pred CceEEEccCCCCCEEEEEEEEE
Q psy7019 34 NFSCKVNKLQELTTYRFYITAT 55 (128)
Q Consensus 34 ~t~~~V~~L~p~t~Y~FRVrA~ 55 (128)
+-.+++.+|.||..|..+|.+.
T Consensus 945 nG~yRiRGL~Pdc~Y~V~vk~~ 966 (1165)
T KOG1948|consen 945 NGTYRIRGLLPDCEYQVHVKSY 966 (1165)
T ss_pred CCcEEEeccCCCceEEEEEeec
Confidence 4569999999999999999986
No 70
>cd05765 Ig_3 Subgroup of the immunoglobulin (Ig) superfamily. Ig_3: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=21.94 E-value=1.5e+02 Score=17.70 Aligned_cols=28 Identities=18% Similarity=-0.027 Sum_probs=23.7
Q ss_pred CceEEEccCCCCCEEEEEEEEEcCCCCC
Q psy7019 34 NFSCKVNKLQELTTYRFYITATNDAGQG 61 (128)
Q Consensus 34 ~t~~~V~~L~p~t~Y~FRVrA~N~~G~G 61 (128)
..+..|.++++.-.-.+...|.|..|..
T Consensus 46 ~~~L~I~~~~~~D~G~Y~C~a~N~~G~~ 73 (81)
T cd05765 46 IGQLVIYNAQPQDAGLYTCTARNSGGLL 73 (81)
T ss_pred ccEEEEccCCcccCEEEEEEEecCCceE
Confidence 3568899999888889999999998853
No 71
>PLN02960 alpha-amylase
Probab=21.80 E-value=1.3e+02 Score=28.27 Aligned_cols=41 Identities=7% Similarity=0.012 Sum_probs=27.9
Q ss_pred ceeCCcEEEEEEEeeCCCCCCeEEEEecCCceEEEccCCCCCEEEEEEEE
Q psy7019 5 TEFNQASFKASHSLSDRPDDRKHVVYQGTNFSCKVNKLQELTTYRFYITA 54 (128)
Q Consensus 5 ~~~~~~~i~Y~le~~~~~~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA 54 (128)
++.--..+.|++++.+ ..+.|... .|..+.+|+.|+|+|.-
T Consensus 306 ~~~~~~~~~~~~~~~k-~~~gw~~~--------~ip~~~hG~~Yky~v~~ 346 (897)
T PLN02960 306 FLEKKPPLPYWEETRK-GRKAWLKK--------YIPAIPHGSKYRVYFNT 346 (897)
T ss_pred HhccCCCCcceeeeee-cCCcEEEE--------EccCCCCCCEEEEEEEe
Confidence 4444556789999885 44444322 35578889999999974
No 72
>PLN03144 Carbon catabolite repressor protein 4 homolog; Provisional
Probab=21.45 E-value=2.3e+02 Score=25.43 Aligned_cols=65 Identities=8% Similarity=-0.056 Sum_probs=42.8
Q ss_pred eeCCcEEEEEEEeeCCC---CCCeEEEEecCCceEEEccCCCCCEEEEEEEEEcCCCCCCCCCceeeeeC
Q psy7019 6 EFNQASFKASHSLSDRP---DDRKHVVYQGTNFSCKVNKLQELTTYRFYITATNDAGQGPYSEAYPFTTC 72 (128)
Q Consensus 6 ~~~~~~i~Y~le~~~~~---~~~~~~v~~g~~t~~~V~~L~p~t~Y~FRVrA~N~~G~G~~S~~~~~~T~ 72 (128)
+.||....|...+..+. ++.|..+- ....++-+.-.=|..-+|++.+.+....++..+..++.|.
T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~w~~vg--~~~~y~p~~~d~g~~lk~~~~~~~~~~~~~~~~~~~~~t~ 218 (606)
T PLN03144 151 LTNGSVPLYPSGIEQKTQVGGETWIEVG--RSKTYTPTADDVGHVLKFECVVVDAETGLPVGHPQSILTS 218 (606)
T ss_pred cccCcccccccccccccCCCCCceEEeC--CCccccCChhhCCceEEEEEEEcccccCCCCccceeeccc
Confidence 45665555655544333 56686654 4556666666668889999999988887777666666554
No 73
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=21.26 E-value=3.9e+02 Score=20.65 Aligned_cols=13 Identities=15% Similarity=0.115 Sum_probs=9.5
Q ss_pred CCCCEEEEEEEEE
Q psy7019 43 QELTTYRFYITAT 55 (128)
Q Consensus 43 ~p~t~Y~FRVrA~ 55 (128)
.|.-.|.|+|.|.
T Consensus 167 lp~G~Yt~~V~A~ 179 (225)
T PRK06655 167 LPDGNYTIKASAS 179 (225)
T ss_pred CCCeeEEEEEEEE
Confidence 3556788888886
Done!