Query psy10995
Match_columns 265
No_of_seqs 178 out of 1311
Neff 7.4
Searched_HMMs 46136
Date Fri Aug 16 15:30:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10995.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10995hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00093 VWC: von Willebrand f 99.4 5.9E-13 1.3E-17 91.7 4.4 56 79-135 1-57 (57)
2 PF00093 VWC: von Willebrand f 99.4 6E-13 1.3E-17 91.7 4.2 54 12-66 1-57 (57)
3 smart00214 VWC von Willebrand 99.3 6.5E-12 1.4E-16 87.1 5.3 55 12-66 1-59 (59)
4 smart00214 VWC von Willebrand 99.3 7.6E-12 1.6E-16 86.8 5.5 56 79-135 1-59 (59)
5 smart00215 VWC_out von Willebr 98.2 1.1E-06 2.4E-11 62.3 3.1 42 12-53 1-42 (67)
6 smart00215 VWC_out von Willebr 97.7 2.6E-05 5.6E-10 55.3 3.0 42 79-122 1-42 (67)
7 PF12714 TILa: TILa domain 97.3 0.00045 9.7E-09 47.2 4.4 40 79-118 2-43 (56)
8 PF12714 TILa: TILa domain 97.2 0.00055 1.2E-08 46.8 4.4 38 11-48 1-42 (56)
9 PF05375 Pacifastin_I: Pacifas 95.6 0.016 3.4E-07 36.6 3.0 29 20-48 3-32 (39)
10 CHL00038 psbL photosystem II p 95.3 0.04 8.7E-07 33.8 4.1 32 231-262 5-36 (38)
11 PRK00753 psbL photosystem II r 94.9 0.05 1.1E-06 33.6 3.7 32 231-262 6-37 (39)
12 PF02419 PsbL: PsbL protein; 94.8 0.034 7.3E-07 34.1 2.7 25 238-262 11-35 (37)
13 PF05825 PSP94: Beta-microsemi 94.6 0.043 9.4E-07 41.5 3.6 39 75-113 14-52 (94)
14 PF05375 Pacifastin_I: Pacifas 93.8 0.063 1.4E-06 33.8 2.6 28 89-117 4-32 (39)
15 PF05825 PSP94: Beta-microsemi 89.6 0.35 7.7E-06 36.5 2.9 35 10-44 17-52 (94)
16 KOG1544|consensus 89.3 0.33 7.2E-06 44.5 3.0 57 58-116 87-147 (470)
17 cd00061 FN1 Fibronectin type 1 84.0 1.4 3E-05 28.3 3.0 30 83-112 8-42 (43)
18 cd00061 FN1 Fibronectin type 1 83.5 1.6 3.4E-05 28.1 3.0 28 16-43 9-42 (43)
19 KOG1544|consensus 81.2 1.1 2.3E-05 41.3 2.2 37 11-47 111-147 (470)
20 PF14828 Amnionless: Amnionles 80.9 0.85 1.8E-05 43.9 1.6 34 195-228 192-234 (437)
21 PF00039 fn1: Fibronectin type 78.2 2.2 4.8E-05 26.8 2.3 28 83-110 6-39 (39)
22 PF15430 SVWC: Single domain v 76.7 3.5 7.5E-05 28.2 3.3 35 175-210 1-40 (65)
23 TIGR03063 srtB_target sortase 76.4 2.9 6.2E-05 24.6 2.3 13 252-264 17-29 (29)
24 PF14828 Amnionless: Amnionles 75.8 1.3 2.8E-05 42.6 1.2 39 30-69 192-235 (437)
25 COG4537 ComGC Competence prote 74.8 2.7 5.8E-05 32.1 2.4 18 244-261 16-33 (107)
26 smart00058 FN1 Fibronectin typ 70.4 6.6 0.00014 25.5 3.2 31 83-113 6-43 (45)
27 PF15430 SVWC: Single domain v 70.3 7.2 0.00016 26.5 3.6 35 12-48 1-39 (65)
28 smart00058 FN1 Fibronectin typ 69.7 7.4 0.00016 25.2 3.3 29 16-44 7-43 (45)
29 PF12191 stn_TNFRSF12A: Tumour 69.7 2.4 5.3E-05 33.6 1.2 69 183-265 36-106 (129)
30 PRK11875 psbT photosystem II r 62.8 9.4 0.0002 22.6 2.4 20 244-263 5-24 (31)
31 PF06459 RR_TM4-6: Ryanodine R 61.2 7.2 0.00016 35.2 2.7 19 245-263 175-193 (274)
32 CHL00031 psbT photosystem II p 60.9 8.9 0.00019 23.1 2.1 21 243-263 4-24 (33)
33 PF15106 TMEM156: TMEM156 prot 60.2 8.7 0.00019 33.2 2.8 25 237-263 173-197 (226)
34 PF08693 SKG6: Transmembrane a 57.9 6.2 0.00013 25.0 1.2 18 248-265 21-40 (40)
35 PF09443 CFC: Cripto_Frl-1_Cry 57.8 3.4 7.5E-05 25.5 0.0 25 88-113 5-29 (36)
36 TIGR03024 arch_pef_cterm PEF-C 55.6 15 0.00033 20.9 2.5 16 249-264 9-24 (26)
37 PF15102 TMEM154: TMEM154 prot 55.4 6.8 0.00015 32.0 1.4 19 246-264 66-84 (146)
38 PF04530 Viral_Beta_CD: Viral 51.9 36 0.00077 26.9 4.8 10 219-228 5-14 (122)
39 PF06692 MNSV_P7B: Melon necro 51.7 17 0.00036 24.6 2.5 18 245-262 14-31 (61)
40 TIGR03068 srtB_sig_NPQTN sorta 48.8 17 0.00037 21.8 2.0 15 245-262 11-25 (33)
41 PF01405 PsbT: Photosystem II 48.0 26 0.00056 20.5 2.6 20 244-263 5-24 (29)
42 PF15168 TRIQK: Triple QxxK/R 45.5 23 0.00049 25.6 2.6 11 252-262 57-67 (79)
43 COG4839 FtsL Protein required 44.6 23 0.0005 27.8 2.8 22 241-262 37-58 (120)
44 PF09889 DUF2116: Uncharacteri 43.7 35 0.00076 23.4 3.3 15 240-254 35-49 (59)
45 PHA02831 EEV host range protei 43.5 89 0.0019 28.2 6.7 13 251-263 243-255 (268)
46 KOG3208|consensus 42.2 25 0.00054 30.7 2.9 22 240-262 208-229 (231)
47 PHA02819 hypothetical protein; 41.6 44 0.00096 23.8 3.5 21 240-260 44-64 (71)
48 PF01034 Syndecan: Syndecan do 39.2 10 0.00022 26.5 0.1 15 250-264 23-37 (64)
49 PF09889 DUF2116: Uncharacteri 38.9 30 0.00064 23.8 2.3 21 237-257 38-58 (59)
50 PHA03240 envelope glycoprotein 38.4 27 0.00058 30.5 2.4 26 237-262 207-232 (258)
51 PHA02650 hypothetical protein; 36.2 66 0.0014 23.4 3.8 19 242-260 49-67 (81)
52 PHA03054 IMV membrane protein; 36.0 71 0.0015 22.8 3.8 20 241-260 47-66 (72)
53 PF06084 Cytomega_TRL10: Cytom 35.9 20 0.00044 28.0 1.2 14 251-264 70-83 (150)
54 PF03908 Sec20: Sec20; InterP 34.8 41 0.00088 24.9 2.7 17 247-263 75-91 (92)
55 PF15220 HILPDA: Hypoxia-induc 34.5 50 0.0011 22.3 2.8 18 243-260 6-23 (63)
56 PF11027 DUF2615: Protein of u 34.2 40 0.00088 25.9 2.6 16 247-262 57-72 (103)
57 COG3726 AhpA Uncharacterized m 33.6 44 0.00095 28.7 3.0 22 240-261 161-183 (214)
58 PHA02844 putative transmembran 33.6 68 0.0015 23.1 3.5 16 244-259 50-65 (75)
59 PF10717 ODV-E18: Occlusion-de 32.3 52 0.0011 24.2 2.8 15 248-262 31-45 (85)
60 PF12911 OppC_N: N-terminal TM 32.0 54 0.0012 21.5 2.7 15 246-260 19-33 (56)
61 PTZ00370 STEVOR; Provisional 31.9 44 0.00095 30.4 2.9 20 246-265 264-283 (296)
62 TIGR01478 STEVOR variant surfa 31.7 44 0.00096 30.3 2.9 20 246-265 268-287 (295)
63 PF11346 DUF3149: Protein of u 30.9 65 0.0014 20.6 2.7 16 246-261 14-29 (42)
64 PF06679 DUF1180: Protein of u 30.5 67 0.0015 26.7 3.6 25 240-264 94-118 (163)
65 PF04272 Phospholamban: Phosph 30.1 98 0.0021 20.1 3.4 26 236-261 20-46 (52)
66 PF00746 Gram_pos_anchor: Gram 29.9 17 0.00038 22.2 0.0 21 244-264 19-39 (39)
67 PF04021 Class_IIIsignal: Clas 29.6 73 0.0016 18.4 2.6 6 252-257 16-21 (28)
68 TIGR01294 P_lamban phospholamb 29.5 99 0.0021 20.1 3.4 27 235-261 19-46 (52)
69 PHA03093 EEV glycoprotein; Pro 29.4 60 0.0013 27.5 3.1 22 239-260 38-59 (185)
70 PF06024 DUF912: Nucleopolyhed 28.5 49 0.0011 25.1 2.3 6 256-261 77-82 (101)
71 PF15188 CCDC-167: Coiled-coil 28.5 61 0.0013 24.0 2.7 17 245-261 68-84 (85)
72 PHA02673 ORF109 EEV glycoprote 28.4 65 0.0014 26.7 3.1 21 240-260 36-56 (161)
73 PF15201 Rod_cone_degen: Progr 28.2 46 0.00099 21.8 1.7 15 250-264 4-18 (54)
74 PF06682 DUF1183: Protein of u 28.0 45 0.00098 30.8 2.3 16 247-262 159-174 (318)
75 PF13150 DUF3989: Protein of u 27.7 91 0.002 23.0 3.5 24 237-260 23-46 (85)
76 KOG3462|consensus 27.5 1E+02 0.0022 23.3 3.6 30 233-262 24-53 (105)
77 PHA01815 hypothetical protein 27.4 1E+02 0.0023 20.0 3.2 14 249-262 38-51 (55)
78 PF09604 Potass_KdpF: F subuni 27.1 78 0.0017 17.9 2.3 15 248-262 4-18 (25)
79 PF05781 MRVI1: MRVI1 protein; 27.0 55 0.0012 32.3 2.8 21 242-262 478-498 (538)
80 PHA02955 hypothetical protein; 26.8 57 0.0012 28.3 2.6 21 244-264 181-201 (213)
81 PF02495 7kD_coat: 7kD viral c 25.3 61 0.0013 21.9 2.1 16 246-261 2-17 (59)
82 PF05545 FixQ: Cbb3-type cytoc 24.8 1.2E+02 0.0026 19.5 3.4 19 243-261 12-30 (49)
83 COG0690 SecE Preprotein transl 24.8 1.1E+02 0.0024 21.8 3.4 21 237-257 39-59 (73)
84 KOG3385|consensus 24.1 72 0.0016 25.0 2.5 15 247-261 100-114 (118)
85 PF02553 CbiN: Cobalt transpor 23.9 71 0.0015 23.0 2.2 16 246-261 6-21 (74)
86 PF03742 PetN: PetN ; InterPr 23.4 1.3E+02 0.0027 17.6 2.8 18 246-263 9-26 (29)
87 PHA03231 glycoprotein BALF4; P 23.2 60 0.0013 33.9 2.4 17 246-262 709-725 (829)
88 PHA03255 BDLF3; Provisional 23.1 63 0.0014 27.2 2.1 15 245-259 194-208 (234)
89 PF11712 Vma12: Endoplasmic re 23.1 97 0.0021 24.8 3.2 25 235-259 70-94 (142)
90 PF05084 GRA6: Granule antigen 22.6 1E+02 0.0023 25.7 3.3 27 237-264 146-173 (215)
91 TIGR02115 potass_kdpF K+-trans 22.6 1E+02 0.0022 17.6 2.3 7 256-262 11-17 (26)
92 PF07204 Orthoreo_P10: Orthore 22.4 60 0.0013 24.5 1.7 12 254-265 55-66 (98)
93 PF12575 DUF3753: Protein of u 22.0 1.5E+02 0.0032 21.3 3.5 12 247-258 53-64 (72)
94 PHA02692 hypothetical protein; 21.8 1.4E+02 0.003 21.3 3.3 10 250-259 54-63 (70)
95 PF03935 SKN1: Beta-glucan syn 21.7 78 0.0017 31.1 2.8 27 234-261 36-62 (504)
96 PTZ00227 variable surface prot 21.3 65 0.0014 30.9 2.1 22 241-262 324-346 (418)
97 PF11857 DUF3377: Domain of un 21.3 73 0.0016 22.9 1.8 21 240-260 34-54 (74)
98 PF09788 Tmemb_55A: Transmembr 20.7 1.2E+02 0.0027 27.0 3.6 12 222-233 179-190 (256)
99 PRK03625 tatE twin arginine tr 20.1 75 0.0016 22.4 1.7 10 251-260 11-20 (67)
100 PF05795 Plasmodium_Vir: Plasm 20.1 77 0.0017 28.6 2.3 19 244-262 285-303 (354)
101 PHA02975 hypothetical protein; 20.0 1.3E+02 0.0027 21.4 2.8 16 244-259 46-61 (69)
No 1
>PF00093 VWC: von Willebrand factor type C domain; InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.37 E-value=5.9e-13 Score=91.72 Aligned_cols=56 Identities=46% Similarity=1.234 Sum_probs=34.3
Q ss_pred ccccCCeEEcCCcEEccCCCccceecCCeEEEeecCCCCCCCCCC-CCCCCCCCcccc
Q psy10995 79 CMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQQCPAAACSNP-RPPEPGTCCPRC 135 (265)
Q Consensus 79 C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~~-~~~~~g~CCp~C 135 (265)
|. ++|++|.+|++|+++.|+.|+|.+|+|.|.++.||.+.|+++ ....+|+|||+|
T Consensus 1 C~-~~g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~~~g~CCp~C 57 (57)
T PF00093_consen 1 CS-FNGRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVKPPGECCPVC 57 (57)
T ss_dssp EE-SSS-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S-------SS-S--S-
T ss_pred Ce-eCCEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceecCCCCcCCCC
Confidence 55 899999999999999999999999999999999999999998 223899999986
No 2
>PF00093 VWC: von Willebrand factor type C domain; InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.36 E-value=6e-13 Score=91.72 Aligned_cols=54 Identities=37% Similarity=1.003 Sum_probs=33.3
Q ss_pred ecCCCeEeCCCCeee-cCCcceEECCCceEEeeccCCCCCCCCC--CCCCCCCCcCCC
Q psy10995 12 QDEDGVWRDDSEIWR-DGCRECHCIDGREMCTLIACPTLSCQHP--IPANSTQCCPYC 66 (265)
Q Consensus 12 c~~~G~~y~~Ge~w~-~~C~~C~C~~G~v~C~~~~Cp~~~C~~~--~~~~~g~CCp~C 66 (265)
|.++|+.|.+|++|+ ++|+.|+|++|.|.|.++.||+++|+++ +. .+|+|||+|
T Consensus 1 C~~~g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~-~~g~CCp~C 57 (57)
T PF00093_consen 1 CSFNGRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVK-PPGECCPVC 57 (57)
T ss_dssp EESSS-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S--------SS-S--S-
T ss_pred CeeCCEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceec-CCCCcCCCC
Confidence 568999999999999 6699999999999999999999999998 55 899999986
No 3
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.27 E-value=6.5e-12 Score=87.13 Aligned_cols=55 Identities=35% Similarity=0.984 Sum_probs=49.3
Q ss_pred ecCCCeEeCCCCeee-cCCcceEECCCc-eEEeeccCCCC-CCCCCCC-CCCCCCcCCC
Q psy10995 12 QDEDGVWRDDSEIWR-DGCRECHCIDGR-EMCTLIACPTL-SCQHPIP-ANSTQCCPYC 66 (265)
Q Consensus 12 c~~~G~~y~~Ge~w~-~~C~~C~C~~G~-v~C~~~~Cp~~-~C~~~~~-~~~g~CCp~C 66 (265)
|.++|+.|++|++|+ ++|+.|+|.+|. |.|.++.||++ +|+++.. ..+|+|||+|
T Consensus 1 C~~~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C 59 (59)
T smart00214 1 CVHNGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC 59 (59)
T ss_pred CccCCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence 568999999999999 569999999999 99999999986 8998872 3899999976
No 4
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.27 E-value=7.6e-12 Score=86.78 Aligned_cols=56 Identities=46% Similarity=1.271 Sum_probs=50.8
Q ss_pred ccccCCeEEcCCcEEccCCCccceecCCe-EEEeecCCCCC-CCCCCCC-CCCCCCcccc
Q psy10995 79 CMDFHGELKLEGQHWTLNPCTDCTCREGK-VLCYSQQCPAA-ACSNPRP-PEPGTCCPRC 135 (265)
Q Consensus 79 C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~-v~C~~~~Cp~~-~C~~~~~-~~~g~CCp~C 135 (265)
|. ++|++|.+|++|++++|++|+|.+|. |.|.++.||.+ .|+++.. ..+|+|||+|
T Consensus 1 C~-~~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C 59 (59)
T smart00214 1 CV-HNGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC 59 (59)
T ss_pred Cc-cCCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence 55 89999999999999999999999999 99999999986 8998872 3899999976
No 5
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=98.21 E-value=1.1e-06 Score=62.28 Aligned_cols=42 Identities=29% Similarity=0.770 Sum_probs=38.4
Q ss_pred ecCCCeEeCCCCeeecCCcceEECCCceEEeeccCCCCCCCC
Q psy10995 12 QDEDGVWRDDSEIWRDGCRECHCIDGREMCTLIACPTLSCQH 53 (265)
Q Consensus 12 c~~~G~~y~~Ge~w~~~C~~C~C~~G~v~C~~~~Cp~~~C~~ 53 (265)
|.++|+.|++|+.|.++|.+|+|.+|.+.|+++.|++..|..
T Consensus 1 C~~~G~~y~~g~~w~d~Cn~CtC~~G~v~Ct~~~C~~~~c~l 42 (67)
T smart00215 1 CYNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCGPKPCLL 42 (67)
T ss_pred CeECCEEcCCCCccccCCCCCEecCCCEEecCCcCCCchhhc
Confidence 568999999999999999999999999999999999877653
No 6
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=97.73 E-value=2.6e-05 Score=55.33 Aligned_cols=42 Identities=31% Similarity=0.813 Sum_probs=37.1
Q ss_pred ccccCCeEEcCCcEEccCCCccceecCCeEEEeecCCCCCCCCC
Q psy10995 79 CMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQQCPAAACSN 122 (265)
Q Consensus 79 C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~ 122 (265)
|+ ++|+.|.+|++|. +.|..|+|.+|.|.|.+..|+..+|..
T Consensus 1 C~-~~G~~y~~g~~w~-d~Cn~CtC~~G~v~Ct~~~C~~~~c~l 42 (67)
T smart00215 1 CY-NNGSYYPPGAKWD-DDCNRCTCLNGRVSCTKVWCGPKPCLL 42 (67)
T ss_pred Ce-ECCEEcCCCCccc-cCCCCCEecCCCEEecCCcCCCchhhc
Confidence 44 7899999999997 899999999999999999998766653
No 7
>PF12714 TILa: TILa domain
Probab=97.29 E-value=0.00045 Score=47.22 Aligned_cols=40 Identities=38% Similarity=1.021 Sum_probs=34.2
Q ss_pred ccccCCeEEcCCcEEccCCCc-ccee-cCCeEEEeecCCCCC
Q psy10995 79 CMDFHGELKLEGQHWTLNPCT-DCTC-REGKVLCYSQQCPAA 118 (265)
Q Consensus 79 C~~~~g~~y~~Ge~w~~~~C~-~C~C-~~g~v~C~~~~Cp~~ 118 (265)
|.+++|+.|+.||+|..+.|+ .|+| .+|.|.|+...|++.
T Consensus 2 C~d~~G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C~~~ 43 (56)
T PF12714_consen 2 CTDYNGRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSCPPG 43 (56)
T ss_pred CcCcCCEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCCCCC
Confidence 556799999999999987776 7999 689999999888753
No 8
>PF12714 TILa: TILa domain
Probab=97.24 E-value=0.00055 Score=46.79 Aligned_cols=38 Identities=29% Similarity=0.764 Sum_probs=32.6
Q ss_pred eec-CCCeEeCCCCeee-cCCc-ceEE-CCCceEEeeccCCC
Q psy10995 11 CQD-EDGVWRDDSEIWR-DGCR-ECHC-IDGREMCTLIACPT 48 (265)
Q Consensus 11 cc~-~~G~~y~~Ge~w~-~~C~-~C~C-~~G~v~C~~~~Cp~ 48 (265)
+|. ++|+.|..||+|. +.|+ .|+| .+|.|+|++..|++
T Consensus 1 GC~d~~G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C~~ 42 (56)
T PF12714_consen 1 GCTDYNGRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSCPP 42 (56)
T ss_pred CCcCcCCEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCCCC
Confidence 355 9999999999998 6786 7999 67999999998875
No 9
>PF05375 Pacifastin_I: Pacifastin inhibitor (LCMII); InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) []. Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=95.56 E-value=0.016 Score=36.63 Aligned_cols=29 Identities=34% Similarity=0.909 Sum_probs=24.4
Q ss_pred CCCCeeecCCcceEEC-CCceEEeeccCCC
Q psy10995 20 DDSEIWRDGCRECHCI-DGREMCTLIACPT 48 (265)
Q Consensus 20 ~~Ge~w~~~C~~C~C~-~G~v~C~~~~Cp~ 48 (265)
..|+.|...|..|+|. +|...|+++.|++
T Consensus 3 ~pG~~~k~dCN~C~C~~~G~~~CT~~~C~~ 32 (39)
T PF05375_consen 3 TPGSTFKKDCNTCTCSSDGKWACTRKACPP 32 (39)
T ss_dssp -TTTEEEETTEEEEEBTTSSEEEE-SSSS-
T ss_pred CCCCeeeCCCCCCCCCCCCcCccCcccCcC
Confidence 3689999999999999 8999999999975
No 10
>CHL00038 psbL photosystem II protein L
Probab=95.31 E-value=0.04 Score=33.83 Aligned_cols=32 Identities=31% Similarity=0.541 Sum_probs=24.7
Q ss_pred CCCccchhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995 231 SPESATNRQIRESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (265)
+|-.+.-+-=|-|||++|+.||.+.+||--|.
T Consensus 5 NPN~q~VELNRTSLy~GLLlifvl~vlfssyf 36 (38)
T CHL00038 5 NPNKQNVELNRTSLYWGLLLIFVLAVLFSNYF 36 (38)
T ss_pred CCCCCccchhhhhHHHHHHHHHHHHHHHHHHh
Confidence 34444444457899999999999999998774
No 11
>PRK00753 psbL photosystem II reaction center L; Provisional
Probab=94.93 E-value=0.05 Score=33.56 Aligned_cols=32 Identities=31% Similarity=0.564 Sum_probs=24.2
Q ss_pred CCCccchhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995 231 SPESATNRQIRESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (265)
+|-.+.-+--|-|||++|+.||.+.+||--|.
T Consensus 6 NpN~q~VELNRTSLy~GlLlifvl~vLFssYf 37 (39)
T PRK00753 6 NPNKQPVELNRTSLYLGLLLVFVLGILFSSYF 37 (39)
T ss_pred CCCCCCceechhhHHHHHHHHHHHHHHHHhhc
Confidence 33334434457899999999999999998773
No 12
>PF02419 PsbL: PsbL protein; InterPro: IPR003372 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbL found in PSII. PsbL is located in a gene cluster with PsbE, PsbF and PsbJ (PsbEFJL). Both PsbL and PsbJ (IPR002682 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbL prevent the formation of both PSII core dimers and PSII-light harvesting complex []. In addition, both PsbL and PsbJ are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_L 3A0B_l 3ARC_l 1S5L_l 2AXT_l 3BZ2_L 4FBY_L 3PRQ_L 3PRR_L 3KZI_L ....
Probab=94.77 E-value=0.034 Score=34.08 Aligned_cols=25 Identities=36% Similarity=0.595 Sum_probs=20.4
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995 238 RQIRESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (265)
+-=|-|||++|+.||.+.+||--|.
T Consensus 11 ELNRTSLY~GLllifvl~vLFssyf 35 (37)
T PF02419_consen 11 ELNRTSLYWGLLLIFVLAVLFSSYF 35 (37)
T ss_dssp E--CCHHHHHHHHHHHHHHHHHHHH
T ss_pred chhHHhHHHHHHHHHHHHHHhhhhh
Confidence 3346899999999999999998774
No 13
>PF05825 PSP94: Beta-microseminoprotein (PSP-94); InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=94.61 E-value=0.043 Score=41.46 Aligned_cols=39 Identities=26% Similarity=0.755 Sum_probs=33.7
Q ss_pred cCccccccCCeEEcCCcEEccCCCccceecCCeEEEeec
Q psy10995 75 EAAVCMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQ 113 (265)
Q Consensus 75 ~~~~C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~ 113 (265)
....|.+.+|+.|.-|++|..+.|..|+|.+..+.|-..
T Consensus 14 ~~~~C~D~dG~~h~l~S~W~T~~C~~C~C~~~Gi~CC~~ 52 (94)
T PF05825_consen 14 SPNECKDWDGKKHPLNSTWKTDDCEECSCSEDGISCCST 52 (94)
T ss_dssp STTCEEETTTEEEETTEEEEETTSEEEEEESSEEEEEES
T ss_pred CCCccEeCCCcEEeCCCeECCCCCcEEEecCCceEeccC
Confidence 345798889999999999999999999999888888543
No 14
>PF05375 Pacifastin_I: Pacifastin inhibitor (LCMII); InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) []. Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=93.81 E-value=0.063 Score=33.85 Aligned_cols=28 Identities=36% Similarity=1.022 Sum_probs=23.4
Q ss_pred CCcEEccCCCccceec-CCeEEEeecCCCC
Q psy10995 89 EGQHWTLNPCTDCTCR-EGKVLCYSQQCPA 117 (265)
Q Consensus 89 ~Ge~w~~~~C~~C~C~-~g~v~C~~~~Cp~ 117 (265)
+|++|. +.|..|+|. +|.+.|.++.|++
T Consensus 4 pG~~~k-~dCN~C~C~~~G~~~CT~~~C~~ 32 (39)
T PF05375_consen 4 PGSTFK-KDCNTCTCSSDGKWACTRKACPP 32 (39)
T ss_dssp TTTEEE-ETTEEEEEBTTSSEEEE-SSSS-
T ss_pred CCCeee-CCCCCCCCCCCCcCccCcccCcC
Confidence 588888 469999999 8999999999975
No 15
>PF05825 PSP94: Beta-microseminoprotein (PSP-94); InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=89.57 E-value=0.35 Score=36.53 Aligned_cols=35 Identities=34% Similarity=0.797 Sum_probs=29.7
Q ss_pred ceecCCCeEeCCCCeee-cCCcceEECCCceEEeec
Q psy10995 10 ECQDEDGVWRDDSEIWR-DGCRECHCIDGREMCTLI 44 (265)
Q Consensus 10 ~cc~~~G~~y~~Ge~w~-~~C~~C~C~~G~v~C~~~ 44 (265)
.|...+|+.|.-|+.|. +.|..|+|.+..+.|-..
T Consensus 17 ~C~D~dG~~h~l~S~W~T~~C~~C~C~~~Gi~CC~~ 52 (94)
T PF05825_consen 17 ECKDWDGKKHPLNSTWKTDDCEECSCSEDGISCCST 52 (94)
T ss_dssp CEEETTTEEEETTEEEEETTSEEEEEESSEEEEEES
T ss_pred ccEeCCCcEEeCCCeECCCCCcEEEecCCceEeccC
Confidence 45668999999999999 899999999877877543
No 16
>KOG1544|consensus
Probab=89.30 E-value=0.33 Score=44.55 Aligned_cols=57 Identities=25% Similarity=0.659 Sum_probs=44.2
Q ss_pred CCCCCcCC----CCCCCCCCccCccccccCCeEEcCCcEEccCCCccceecCCeEEEeecCCC
Q psy10995 58 NSTQCCPY----CPDATSSGHEAAVCMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQQCP 116 (265)
Q Consensus 58 ~~g~CCp~----C~~~~~~~~~~~~C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~~Cp 116 (265)
...+|||. |.+......+..+|+ .+|+.|.+|.... +.|..|+|.++.+.|.-..|-
T Consensus 87 ~~sDCCPDf~~fCRg~pp~~Qp~~gc~-~gg~~y~~G~t~~-~NCn~CTC~n~qWKCdq~~CL 147 (470)
T KOG1544|consen 87 TVSDCCPDFWDFCRGVPPPFQPIQGCM-HGGRIYPVGGTYW-DNCNRCTCQNRQWKCDQEPCL 147 (470)
T ss_pred CCcccCcCHHHHhcCCCCCCCChhhcc-cCceecccCCeee-ccccceeecCCceecCCceee
Confidence 35678883 665433333446898 7999999999888 689999999999999987774
No 17
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=84.03 E-value=1.4 Score=28.34 Aligned_cols=30 Identities=27% Similarity=0.626 Sum_probs=24.7
Q ss_pred CCeEEcCCcEEcc---CCCccceecC--CeEEEee
Q psy10995 83 HGELKLEGQHWTL---NPCTDCTCRE--GKVLCYS 112 (265)
Q Consensus 83 ~g~~y~~Ge~w~~---~~C~~C~C~~--g~v~C~~ 112 (265)
.++.|..|++|.. ..=..|+|.. |.+.|..
T Consensus 8 t~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~ 42 (43)
T cd00061 8 TGVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP 42 (43)
T ss_pred cccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence 4899999999984 3456899998 9999875
No 18
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=83.50 E-value=1.6 Score=28.10 Aligned_cols=28 Identities=29% Similarity=0.803 Sum_probs=23.5
Q ss_pred CeEeCCCCeee---c-CCcceEECC--CceEEee
Q psy10995 16 GVWRDDSEIWR---D-GCRECHCID--GREMCTL 43 (265)
Q Consensus 16 G~~y~~Ge~w~---~-~C~~C~C~~--G~v~C~~ 43 (265)
|+.|+.|++|. + .=..|+|.. |.+.|.+
T Consensus 9 ~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~ 42 (43)
T cd00061 9 GVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP 42 (43)
T ss_pred ccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence 89999999997 3 345799998 9999875
No 19
>KOG1544|consensus
Probab=81.22 E-value=1.1 Score=41.33 Aligned_cols=37 Identities=19% Similarity=0.396 Sum_probs=33.9
Q ss_pred eecCCCeEeCCCCeeecCCcceEECCCceEEeeccCC
Q psy10995 11 CQDEDGVWRDDSEIWRDGCRECHCIDGREMCTLIACP 47 (265)
Q Consensus 11 cc~~~G~~y~~Ge~w~~~C~~C~C~~G~v~C~~~~Cp 47 (265)
+|..+|+.|.+|.+..++|..|||+++...|....|-
T Consensus 111 gc~~gg~~y~~G~t~~~NCn~CTC~n~qWKCdq~~CL 147 (470)
T KOG1544|consen 111 GCMHGGRIYPVGGTYWDNCNRCTCQNRQWKCDQEPCL 147 (470)
T ss_pred hcccCceecccCCeeeccccceeecCCceecCCceee
Confidence 5678999999999999999999999999999988884
No 20
>PF14828 Amnionless: Amnionless
Probab=80.93 E-value=0.85 Score=43.91 Aligned_cols=34 Identities=15% Similarity=0.374 Sum_probs=28.7
Q ss_pred eEEEECCCCe-------e--cCCCCcCCCcccCCCCCCCCCCC
Q psy10995 195 TNCTCLHGGQ-------K--KCTASMCERKALEKGDTPTPPSG 228 (265)
Q Consensus 195 ~~C~C~~g~v-------~--~Cp~~~C~~p~~~~g~CC~~C~~ 228 (265)
.-|.|.+..+ . +|+.+.|.+|+...|+||+.|=.
T Consensus 192 ~gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~P~GhCC~iCGa 234 (437)
T PF14828_consen 192 SGCPCGNDEVLEWICSNVLQRCPKPHCRSPLRPEGHCCPICGA 234 (437)
T ss_pred ccCccCcccchhhhhHHhhCcCCCCccCCCCCCCCCchhhcce
Confidence 5889976655 4 89999999999999999999853
No 21
>PF00039 fn1: Fibronectin type I domain; InterPro: IPR000083 Fibronectin type I repeats are one of the three repeats found in the fibronectin protein. Fibronectin is a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin. Type I domain (FN1) is approximately 40 residues in length. Four conserved cysteines are involved in disulphide bonds. The 3D structure of the FN1 domain has been determined [, , ]. It consists of two antiparallel beta-sheets, first a double-stranded one, that is linked by a disulphide bond to a triple-stranded beta-sheet. The second conserved disulphide bridge links the C-terminal adjacent strands of the domain. In human tissue plasminogen activator chain A the FN1 domain together with the following epidermal growth factor (EGF)-like domain are involved in fibrin-binding []. It has been suggested that these two modules form a single structural and functional unit []. The two domains keep their specific tertiary structure, but interact intimately to bury a hydrophobic core; the inter-module linker makes up the third strand of the EGF-module's major beta-sheet.; GO: 0005576 extracellular region; PDB: 2RKY_A 1FBR_A 2RL0_D 3M7P_A 3MQL_A 3ZRZ_A 2RKZ_F 2CKU_A 2CG6_A 3CAL_C ....
Probab=78.22 E-value=2.2 Score=26.77 Aligned_cols=28 Identities=29% Similarity=0.716 Sum_probs=21.0
Q ss_pred CCeEEcCCcEEcc---CCCccceecC---CeEEE
Q psy10995 83 HGELKLEGQHWTL---NPCTDCTCRE---GKVLC 110 (265)
Q Consensus 83 ~g~~y~~Ge~w~~---~~C~~C~C~~---g~v~C 110 (265)
.++.|..|++|.. ..=..|+|.. |.+.|
T Consensus 6 t~~~Y~vge~W~R~~~g~~~~CtC~G~g~G~~~C 39 (39)
T PF00039_consen 6 TGQFYQVGETWERPYQGHMMQCTCLGNGRGEWKC 39 (39)
T ss_dssp TTEEEETTEEEEEEETSSEEEEEEEETTTTEEEE
T ss_pred CccEEeCCCEEEeecCCeEEEeEECCCCccceEC
Confidence 4999999999983 3345699953 66765
No 22
>PF15430 SVWC: Single domain von Willebrand factor type C
Probab=76.71 E-value=3.5 Score=28.16 Aligned_cols=35 Identities=31% Similarity=0.779 Sum_probs=28.7
Q ss_pred ccccCCeEeeCCCEecC-CCceEEEEC--CCCe--ecCCCC
Q psy10995 175 CLLENGTLLHVGDTYFA-DPCTNCTCL--HGGQ--KKCTAS 210 (265)
Q Consensus 175 C~~~~g~~y~~Ge~w~~-~~C~~C~C~--~g~v--~~Cp~~ 210 (265)
|.| +|+.+.+|+.|.. ++|..=+|. +|.+ ..|+..
T Consensus 1 C~y-~~~~~~~g~~~~~~~pC~~~~C~~~~~~v~v~~C~~~ 40 (65)
T PF15430_consen 1 CYY-NGRTIPSGESYNPEEPCERWTCDASDGYVTVEGCPPP 40 (65)
T ss_pred CEE-CCEEcCCCcEecCCCCCceEEEECCCCEEEEEeCCCC
Confidence 556 7999999999874 799999998 5666 888874
No 23
>TIGR03063 srtB_target sortase B cell surface sorting signal. Two different classes of sorting signal, both analogous to the sortase A signal LPXTG, may be recognized by the sortase SrtB. These are given as NXZTN and NPKXZ. Proteins sorted by this class of sortase are less common than the sortase A and LPXTG system. This model describes a number of cell surface protein C-terminal regions from Gram-positive bacteria that appear to be sortase B (SrtB) sorting signals.
Probab=76.44 E-value=2.9 Score=24.57 Aligned_cols=13 Identities=31% Similarity=0.327 Sum_probs=8.1
Q ss_pred HHHHHHHHHhhhc
Q psy10995 252 FAVSTLFAFYACR 264 (265)
Q Consensus 252 ~~~~~~~~~~~~~ 264 (265)
|..|.+||+++.|
T Consensus 17 ~~~s~~~Li~k~~ 29 (29)
T TIGR03063 17 FLGSGLFLIRKRK 29 (29)
T ss_pred HHHHHHHHhhccC
Confidence 3446678887653
No 24
>PF14828 Amnionless: Amnionless
Probab=75.76 E-value=1.3 Score=42.62 Aligned_cols=39 Identities=28% Similarity=0.913 Sum_probs=32.7
Q ss_pred cceEECCCc---eEEeec--cCCCCCCCCCCCCCCCCCcCCCCCC
Q psy10995 30 RECHCIDGR---EMCTLI--ACPTLSCQHPIPANSTQCCPYCPDA 69 (265)
Q Consensus 30 ~~C~C~~G~---v~C~~~--~Cp~~~C~~~~~~~~g~CCp~C~~~ 69 (265)
+-|.|.+.. .+|..+ .||.+.|..|.. +.|+||++|-..
T Consensus 192 ~gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~-P~GhCC~iCGa~ 235 (437)
T PF14828_consen 192 SGCPCGNDEVLEWICSNVLQRCPKPHCRSPLR-PEGHCCPICGAI 235 (437)
T ss_pred ccCccCcccchhhhhHHhhCcCCCCccCCCCC-CCCCchhhcceE
Confidence 579998754 568887 999999999988 899999999753
No 25
>COG4537 ComGC Competence protein ComGC [Intracellular trafficking and secretion]
Probab=74.79 E-value=2.7 Score=32.06 Aligned_cols=18 Identities=39% Similarity=0.670 Sum_probs=15.3
Q ss_pred hHHHHHHHHHHHHHHHHh
Q psy10995 244 LYILLIVLFAVSTLFAFY 261 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~~~ 261 (265)
|-=||+|++.||+|||++
T Consensus 16 LvEMLiVLlIISiLlLl~ 33 (107)
T COG4537 16 LVEMLIVLLIISILLLLF 33 (107)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 445899999999999986
No 26
>smart00058 FN1 Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
Probab=70.40 E-value=6.6 Score=25.46 Aligned_cols=31 Identities=26% Similarity=0.529 Sum_probs=23.9
Q ss_pred CCeEEcCCcEEcc---C-CCccceecC---CeEEEeec
Q psy10995 83 HGELKLEGQHWTL---N-PCTDCTCRE---GKVLCYSQ 113 (265)
Q Consensus 83 ~g~~y~~Ge~w~~---~-~C~~C~C~~---g~v~C~~~ 113 (265)
.++.|..|++|.. . .-..|+|.. |++.|...
T Consensus 6 t~~~Y~ige~W~r~~~~g~~~~C~C~G~g~G~~~C~~~ 43 (45)
T smart00058 6 TGTTYRIGDTWERPYEGGHVLQCTCLGNGRGEWKCDPV 43 (45)
T ss_pred cccEEecCCEEEeecCCCcEEEeEEcCCCceEEEEEec
Confidence 4899999999984 2 456899985 45999864
No 27
>PF15430 SVWC: Single domain von Willebrand factor type C
Probab=70.28 E-value=7.2 Score=26.51 Aligned_cols=35 Identities=23% Similarity=0.430 Sum_probs=26.5
Q ss_pred ecCCCeEeCCCCeee--cCCcceEEC--CCceEEeeccCCC
Q psy10995 12 QDEDGVWRDDSEIWR--DGCRECHCI--DGREMCTLIACPT 48 (265)
Q Consensus 12 c~~~G~~y~~Ge~w~--~~C~~C~C~--~G~v~C~~~~Cp~ 48 (265)
|.++|+.+.+|+.|. ++|..-+|. +|.++ ...|+.
T Consensus 1 C~y~~~~~~~g~~~~~~~pC~~~~C~~~~~~v~--v~~C~~ 39 (65)
T PF15430_consen 1 CYYNGRTIPSGESYNPEEPCERWTCDASDGYVT--VEGCPP 39 (65)
T ss_pred CEECCEEcCCCcEecCCCCCceEEEECCCCEEE--EEeCCC
Confidence 567899999999998 799999999 45444 334543
No 28
>smart00058 FN1 Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
Probab=69.71 E-value=7.4 Score=25.23 Aligned_cols=29 Identities=17% Similarity=0.538 Sum_probs=22.8
Q ss_pred CeEeCCCCeee---c--CCcceEECCCc---eEEeec
Q psy10995 16 GVWRDDSEIWR---D--GCRECHCIDGR---EMCTLI 44 (265)
Q Consensus 16 G~~y~~Ge~w~---~--~C~~C~C~~G~---v~C~~~ 44 (265)
|+.|+.|++|. + .-..|+|..+. +.|.+.
T Consensus 7 ~~~Y~ige~W~r~~~~g~~~~C~C~G~g~G~~~C~~~ 43 (45)
T smart00058 7 GTTYRIGDTWERPYEGGHVLQCTCLGNGRGEWKCDPV 43 (45)
T ss_pred ccEEecCCEEEeecCCCcEEEeEEcCCCceEEEEEec
Confidence 79999999997 2 34579999654 888864
No 29
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=69.67 E-value=2.4 Score=33.59 Aligned_cols=69 Identities=22% Similarity=0.335 Sum_probs=16.4
Q ss_pred eeCCCEecC--CCceEEEECCCCeecCCCCcCCCcccCCCCCCCCCCCCCCCCccchhhhhhhhHHHHHHHHHHHHHHHH
Q psy10995 183 LHVGDTYFA--DPCTNCTCLHGGQKKCTASMCERKALEKGDTPTPPSGNDSPESATNRQIRESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 183 y~~Ge~w~~--~~C~~C~C~~g~v~~Cp~~~C~~p~~~~g~CC~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (265)
-+.|+.|+. ++|..|+ .|+.. ..-.=|..|...+...-..--.|.-|...+++|+.++|.|+..
T Consensus 36 Cp~G~sWs~dLdkCm~Cs-------sC~~~-------P~S~~C~~C~a~p~~p~~l~~pi~~sal~v~lVl~llsg~lv~ 101 (129)
T PF12191_consen 36 CPRGSSWSADLDKCMSCS-------SCPAY-------PKSPFCQGCPAAPPAPFPLLWPILGSALSVVLVLALLSGFLVW 101 (129)
T ss_dssp S-SSEEEETTTTEEEECC-------CHCC--------TT-CCCCCHSS-SSS-SSSS-----------------------
T ss_pred CCCCCcCcccccccccCc-------cCCCC-------CCCccccCCCCCCCCCcceehhhhhhHHHHHHHHHHHHHHHHH
Confidence 356889986 5888772 23320 1111123343322111222233555554556666666655555
Q ss_pred hhhcC
Q psy10995 261 YACRQ 265 (265)
Q Consensus 261 ~~~~~ 265 (265)
.+||+
T Consensus 102 rrcrr 106 (129)
T PF12191_consen 102 RRCRR 106 (129)
T ss_dssp -----
T ss_pred hhhhc
Confidence 55553
No 30
>PRK11875 psbT photosystem II reaction center protein T; Reviewed
Probab=62.80 E-value=9.4 Score=22.62 Aligned_cols=20 Identities=35% Similarity=0.501 Sum_probs=16.0
Q ss_pred hHHHHHHHHHHHHHHHHhhh
Q psy10995 244 LYILLIVLFAVSTLFAFYAC 263 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~ 263 (265)
.|++|++..+.+.+||++-+
T Consensus 5 ~Ytfll~~tlgiiFFAIfFR 24 (31)
T PRK11875 5 AYILILTLALVTLFFAIAFR 24 (31)
T ss_pred HHHHHHHHHHHHHHHhhhcc
Confidence 57888888888888888754
No 31
>PF06459 RR_TM4-6: Ryanodine Receptor TM 4-6; InterPro: IPR009460 The release of Ca2+ ions from intracellular stores is a key step in a wide variety of cellular functions. In striated muscle, the release of Ca2+ from the sarcoplasmic reticulum (SR) leads to muscle contraction. Ca2+ release occurs through large, high-conductance Ca2+ release channels, also known as ryanodine receptors (RyRs) because they bind the plant alkaloid ryanodine with high affinity and specificity []. This region covers TM regions 4-6 of the ryanodine receptor 1 family.; GO: 0005219 ryanodine-sensitive calcium-release channel activity, 0006874 cellular calcium ion homeostasis, 0016021 integral to membrane
Probab=61.22 E-value=7.2 Score=35.21 Aligned_cols=19 Identities=26% Similarity=0.728 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHHHHhhh
Q psy10995 245 YILLIVLFAVSTLFAFYAC 263 (265)
Q Consensus 245 ~~~~~~~~~~~~~~~~~~~ 263 (265)
|+-|+|.|||=+++|||+-
T Consensus 175 ~lALflAFaINFILLFYKV 193 (274)
T PF06459_consen 175 FLALFLAFAINFILLFYKV 193 (274)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 4668999999999999983
No 32
>CHL00031 psbT photosystem II protein T
Probab=60.86 E-value=8.9 Score=23.07 Aligned_cols=21 Identities=24% Similarity=0.333 Sum_probs=16.6
Q ss_pred hhHHHHHHHHHHHHHHHHhhh
Q psy10995 243 SLYILLIVLFAVSTLFAFYAC 263 (265)
Q Consensus 243 ~~~~~~~~~~~~~~~~~~~~~ 263 (265)
-.|++|++..+.+.+||++-+
T Consensus 4 lvYtfll~~tlgilFFAI~FR 24 (33)
T CHL00031 4 LVYTFLLVSTLGIIFFAIFFR 24 (33)
T ss_pred hHHHHHHHHHHHHHHHhheec
Confidence 368899998888888888654
No 33
>PF15106 TMEM156: TMEM156 protein family
Probab=60.21 E-value=8.7 Score=33.22 Aligned_cols=25 Identities=20% Similarity=0.558 Sum_probs=21.1
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHhhh
Q psy10995 237 NRQIRESLYILLIVLFAVSTLFAFYAC 263 (265)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 263 (265)
+..|. +|+|++++|.|.+.|.||+-
T Consensus 173 smKIT--WYvLVllVfiflii~iI~KI 197 (226)
T PF15106_consen 173 SMKIT--WYVLVLLVFIFLIILIIYKI 197 (226)
T ss_pred ehhhH--HHHHHHHHHHHHHHHHHHHH
Confidence 34455 99999999999999999984
No 34
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=57.95 E-value=6.2 Score=24.97 Aligned_cols=18 Identities=28% Similarity=0.458 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHH--hhhcC
Q psy10995 248 LIVLFAVSTLFAF--YACRQ 265 (265)
Q Consensus 248 ~~~~~~~~~~~~~--~~~~~ 265 (265)
+.||.++..+||| |++++
T Consensus 21 V~vI~~vl~~~l~~~~rR~k 40 (40)
T PF08693_consen 21 VGVIIIVLGAFLFFWYRRKK 40 (40)
T ss_pred hHHHHHHHHHHhheEEeccC
Confidence 3444444444444 76653
No 35
>PF09443 CFC: Cripto_Frl-1_Cryptic (CFC); InterPro: IPR019011 This entry represents the CFC domain found in the membrane protein Cripto (or teratocarcinoma-derived growth factor), a protein over expressed in many tumours [, ] and structurally similar to the C-terminal extracellular portions of Jagged 1 and Jagged 2 []. CFC is approx 40-residues long, compacted by three internal disulphide bridges, and binds Alk4 via a hydrophobic patch. CFC is structurally homologous to the VWFC-like domain []. The protein Cripto is the founding member of the extra-cellular EGF-CFC growth factors, which are composed of two adjacent cysteine-rich domains: the EGF-like (IPR006210 from INTERPRO) and the CFC domains. Members of the EGF-CFC family play key roles in embryonic development and are also implicated in tumourigenesis []. The Cripto protein could play a role in the determination of the epiblastic cells that subsequently give rise to the mesoderm. Although both the EGF and CFC domains are involved in the tumourigenic activity of Crispto proteins, the CFC domain appears to play a crucial role, as it is through the CFC domain that Crispto interferes with the onco-suppressive activity of Activins, either by blocking the Activin receptor ALK4 or by antagonising proteins of the TGF-beta family []. The Cryptic protein is involved in the correct establishment of the left-right axis. May play a role in mesoderm and/or neural patterning during gastrulation.; PDB: 2J5H_A.
Probab=57.85 E-value=3.4 Score=25.49 Aligned_cols=25 Identities=32% Similarity=0.950 Sum_probs=6.9
Q ss_pred cCCcEEccCCCccceecCCeEEEeec
Q psy10995 88 LEGQHWTLNPCTDCTCREGKVLCYSQ 113 (265)
Q Consensus 88 ~~Ge~w~~~~C~~C~C~~g~v~C~~~ 113 (265)
.+|+ |....|..|+|..|...|.+.
T Consensus 5 ~Hg~-W~~~~C~lCrC~~G~LhC~p~ 29 (36)
T PF09443_consen 5 PHGE-WVPKGCSLCRCWYGTLHCFPQ 29 (36)
T ss_dssp --TT--SS---------SSS--S--S
T ss_pred cCCC-eeecCceeeEeecCcEEEcCc
Confidence 3454 888899999999999999864
No 36
>TIGR03024 arch_pef_cterm PEF-C-terminal archaeal protein sorting domain. This domain, distantly related to the PEP-Cterm domain described in model TIGR02595, is found in Methanosarcina mazei in four different proteins, as well as in other archaea such as Methanococcoides burtonii. Several proteins with this domain have their genes only a short distance from a distant homology of EpsH, a proposed integral membrane transpeptidase.
Probab=55.59 E-value=15 Score=20.95 Aligned_cols=16 Identities=13% Similarity=0.299 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHhhhc
Q psy10995 249 IVLFAVSTLFAFYACR 264 (265)
Q Consensus 249 ~~~~~~~~~~~~~~~~ 264 (265)
+.|.|+.++++++.+|
T Consensus 9 l~I~all~i~~i~~rr 24 (26)
T TIGR03024 9 LPIIALLAIIVILRRR 24 (26)
T ss_pred HHHHHHHHHHHHHhhc
Confidence 4555666667777665
No 37
>PF15102 TMEM154: TMEM154 protein family
Probab=55.44 E-value=6.8 Score=31.97 Aligned_cols=19 Identities=26% Similarity=0.490 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHhhhc
Q psy10995 246 ILLIVLFAVSTLFAFYACR 264 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~~~~ 264 (265)
++|.++++++|++++|.+|
T Consensus 66 VLLvlLLl~vV~lv~~~kR 84 (146)
T PF15102_consen 66 VLLVLLLLSVVCLVIYYKR 84 (146)
T ss_pred HHHHHHHHHHHHheeEEee
Confidence 4555555666666666655
No 38
>PF04530 Viral_Beta_CD: Viral Beta C/D like family; InterPro: IPR007617 This is a family of ssRNA positive-strand viral proteins. Conserved region is found in the Beta C and Beta D transcripts.
Probab=51.86 E-value=36 Score=26.92 Aligned_cols=10 Identities=10% Similarity=-0.204 Sum_probs=5.3
Q ss_pred CCCCCCCCCC
Q psy10995 219 KGDTPTPPSG 228 (265)
Q Consensus 219 ~g~CC~~C~~ 228 (265)
.|-+|+.|.-
T Consensus 5 ~~C~CsdC~w 14 (122)
T PF04530_consen 5 SGCCCSDCQW 14 (122)
T ss_pred CCCcCCcccC
Confidence 4455666553
No 39
>PF06692 MNSV_P7B: Melon necrotic spot virus P7B protein; InterPro: IPR009575 This family consists of several Melon necrotic spot virus (MNSV) P7B proteins. The function of this family is unknown.
Probab=51.67 E-value=17 Score=24.65 Aligned_cols=18 Identities=17% Similarity=0.180 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHhh
Q psy10995 245 YILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 245 ~~~~~~~~~~~~~~~~~~ 262 (265)
..+|+|||.-.+||+||.
T Consensus 14 ~~~lLiliis~~f~lI~~ 31 (61)
T PF06692_consen 14 SGPLLILIISFVFFLITS 31 (61)
T ss_pred hhHHHHHHHHHHHHHHhh
Confidence 346667777777888874
No 40
>TIGR03068 srtB_sig_NPQTN sortase B signal domain, NPQTN class. This model represents one of the boutique (rare) sortase signals, recognized by sortase B (SrtB) rather than by the housekeeping-type SrtA class sortase. This sequence, beginning NPQTN, shows little similarity to several other SrtB substrates.
Probab=48.82 E-value=17 Score=21.85 Aligned_cols=15 Identities=33% Similarity=0.392 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHhh
Q psy10995 245 YILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 245 ~~~~~~~~~~~~~~~~~~ 262 (265)
|+.++.++| |||+|.
T Consensus 11 y~y~Ip~v~---lflL~~ 25 (33)
T TIGR03068 11 YIYAIPVAS---LALLIA 25 (33)
T ss_pred hhhHHHHHH---HHHHHH
Confidence 556665555 565554
No 41
>PF01405 PsbT: Photosystem II reaction centre T protein; InterPro: IPR001743 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbT found in PSII, which is thought to be associated with the D1 (PsbA) - D2 (PsbD) heterodimer. PsbT may be involved in the formation and/or stabilisation of dimeric PSII complexes, because in the absence of this protein dimeric PSII complexes were found to be less abundant. Furthermore, although PsbT does not confer photo-protection, it is required for the efficient recovery of photo-damaged PSII [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3BZ1_T 1S5L_t 2AXT_t 3KZI_T 3PRQ_T 3BZ2_T 3PRR_T 4FBY_g 3A0H_t 3A0B_T ....
Probab=48.04 E-value=26 Score=20.54 Aligned_cols=20 Identities=20% Similarity=0.330 Sum_probs=14.8
Q ss_pred hHHHHHHHHHHHHHHHHhhh
Q psy10995 244 LYILLIVLFAVSTLFAFYAC 263 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~ 263 (265)
.|++|++..+.+.+||+.-+
T Consensus 5 vY~~ll~~tlgilffAI~FR 24 (29)
T PF01405_consen 5 VYTFLLIGTLGILFFAIFFR 24 (29)
T ss_dssp HHHHHHHHHHHHHHHHHHSS
T ss_pred HHHHHHHHHHHHHHhhhhcc
Confidence 57888888777777877643
No 42
>PF15168 TRIQK: Triple QxxK/R motif-containing protein family
Probab=45.53 E-value=23 Score=25.61 Aligned_cols=11 Identities=45% Similarity=0.667 Sum_probs=5.0
Q ss_pred HHHHHHHHHhh
Q psy10995 252 FAVSTLFAFYA 262 (265)
Q Consensus 252 ~~~~~~~~~~~ 262 (265)
+.+..||+||+
T Consensus 57 ail~lL~a~Ya 67 (79)
T PF15168_consen 57 AILVLLLAFYA 67 (79)
T ss_pred HHHHHHHHHHH
Confidence 33444555553
No 43
>COG4839 FtsL Protein required for the initiation of cell division [Cell division and chromosome partitioning]
Probab=44.63 E-value=23 Score=27.85 Aligned_cols=22 Identities=23% Similarity=0.211 Sum_probs=17.2
Q ss_pred hhhhHHHHHHHHHHHHHHHHhh
Q psy10995 241 RESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~ 262 (265)
.--+|+++.|++++.+++++|.
T Consensus 37 EKvly~~~~va~L~vai~ii~~ 58 (120)
T COG4839 37 EKVLYTTLAVAALVVAISIISV 58 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3447889999988888888774
No 44
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=43.67 E-value=35 Score=23.45 Aligned_cols=15 Identities=20% Similarity=0.547 Sum_probs=7.0
Q ss_pred hhhhhHHHHHHHHHH
Q psy10995 240 IRESLYILLIVLFAV 254 (265)
Q Consensus 240 ~~~~~~~~~~~~~~~ 254 (265)
++-..||++.++++|
T Consensus 35 ~~~~~~i~~~~~i~~ 49 (59)
T PF09889_consen 35 MRKTQYIFFGIFILF 49 (59)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333444555554443
No 45
>PHA02831 EEV host range protein; Provisional
Probab=43.53 E-value=89 Score=28.17 Aligned_cols=13 Identities=23% Similarity=0.761 Sum_probs=6.8
Q ss_pred HHHHHHHHHHhhh
Q psy10995 251 LFAVSTLFAFYAC 263 (265)
Q Consensus 251 ~~~~~~~~~~~~~ 263 (265)
||.+.+++||-.|
T Consensus 243 ~~~~~~~~~~~~~ 255 (268)
T PHA02831 243 IFVLGLIALFLSC 255 (268)
T ss_pred HHHHHHHHHhhcc
Confidence 4455555555444
No 46
>KOG3208|consensus
Probab=42.24 E-value=25 Score=30.68 Aligned_cols=22 Identities=41% Similarity=0.560 Sum_probs=17.7
Q ss_pred hhhhhHHHHHHHHHHHHHHHHhh
Q psy10995 240 IRESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~ 262 (265)
-|.+| ||..||.+-..||+||+
T Consensus 208 rrdsl-ILa~Vis~C~llllfy~ 229 (231)
T KOG3208|consen 208 RRDSL-ILAAVISVCTLLLLFYW 229 (231)
T ss_pred hhhhH-HHHHHHHHHHHHHHHHH
Confidence 34555 88899998889999996
No 47
>PHA02819 hypothetical protein; Provisional
Probab=41.61 E-value=44 Score=23.77 Aligned_cols=21 Identities=14% Similarity=0.227 Sum_probs=10.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHH
Q psy10995 240 IRESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~ 260 (265)
..-.+.|++++++++.+||.|
T Consensus 44 ~~~~~~ii~l~~~~~~~~~~f 64 (71)
T PHA02819 44 FLRYYLIIGLVTIVFVIIFII 64 (71)
T ss_pred hhHHHHHHHHHHHHHHHHHHH
Confidence 333444555555555555544
No 48
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=39.17 E-value=10 Score=26.51 Aligned_cols=15 Identities=27% Similarity=0.481 Sum_probs=0.6
Q ss_pred HHHHHHHHHHHhhhc
Q psy10995 250 VLFAVSTLFAFYACR 264 (265)
Q Consensus 250 ~~~~~~~~~~~~~~~ 264 (265)
+.+.+.+||++|+.|
T Consensus 23 l~ailLIlf~iyR~r 37 (64)
T PF01034_consen 23 LFAILLILFLIYRMR 37 (64)
T ss_dssp --------------S
T ss_pred HHHHHHHHHHHHHHH
Confidence 334445678888765
No 49
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=38.87 E-value=30 Score=23.81 Aligned_cols=21 Identities=19% Similarity=0.417 Sum_probs=10.1
Q ss_pred hhhhhhhhHHHHHHHHHHHHH
Q psy10995 237 NRQIRESLYILLIVLFAVSTL 257 (265)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~ 257 (265)
.++|...+.++++++|++.++
T Consensus 38 ~~~i~~~~~i~~l~v~~~~~~ 58 (59)
T PF09889_consen 38 TQYIFFGIFILFLAVWIFMTF 58 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 455555554444444444443
No 50
>PHA03240 envelope glycoprotein M; Provisional
Probab=38.41 E-value=27 Score=30.46 Aligned_cols=26 Identities=8% Similarity=0.442 Sum_probs=16.7
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995 237 NRQIRESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (265)
+....|.-.|+++||+.++++++|++
T Consensus 207 ~~~aaH~~WIiilIIiIiIIIL~cfK 232 (258)
T PHA03240 207 SKDAAHIAWIFIAIIIIIVIILFFFK 232 (258)
T ss_pred ccccchHhHHHHHHHHHHHHHHHHHh
Confidence 33467777777777776665555554
No 51
>PHA02650 hypothetical protein; Provisional
Probab=36.24 E-value=66 Score=23.45 Aligned_cols=19 Identities=32% Similarity=0.640 Sum_probs=8.9
Q ss_pred hhhHHHHHHHHHHHHHHHH
Q psy10995 242 ESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 242 ~~~~~~~~~~~~~~~~~~~ 260 (265)
-.+.|++++++++.+||.|
T Consensus 49 ~~~~ii~i~~v~i~~l~~f 67 (81)
T PHA02650 49 GQNFIFLIFSLIIVALFSF 67 (81)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3344555444445555443
No 52
>PHA03054 IMV membrane protein; Provisional
Probab=36.03 E-value=71 Score=22.76 Aligned_cols=20 Identities=10% Similarity=0.243 Sum_probs=9.6
Q ss_pred hhhhHHHHHHHHHHHHHHHH
Q psy10995 241 RESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~ 260 (265)
.-.+.|++++++++.+||.|
T Consensus 47 ~~~~~ii~l~~v~~~~l~~f 66 (72)
T PHA03054 47 GWYWLIIIFFIVLILLLLIY 66 (72)
T ss_pred hHHHHHHHHHHHHHHHHHHH
Confidence 33444555554455555444
No 53
>PF06084 Cytomega_TRL10: Cytomegalovirus TRL10 protein; InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=35.88 E-value=20 Score=28.02 Aligned_cols=14 Identities=14% Similarity=0.477 Sum_probs=9.8
Q ss_pred HHHHHHHHHHhhhc
Q psy10995 251 LFAVSTLFAFYACR 264 (265)
Q Consensus 251 ~~~~~~~~~~~~~~ 264 (265)
|+++++||.||++.
T Consensus 70 iillviffviy~re 83 (150)
T PF06084_consen 70 IILLVIFFVIYSRE 83 (150)
T ss_pred HHHHHHhheeEecc
Confidence 44566788899863
No 54
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=34.76 E-value=41 Score=24.87 Aligned_cols=17 Identities=12% Similarity=0.467 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHhhh
Q psy10995 247 LLIVLFAVSTLFAFYAC 263 (265)
Q Consensus 247 ~~~~~~~~~~~~~~~~~ 263 (265)
+.+.+|.+++++.+|++
T Consensus 75 ~~~~~f~~~v~yI~~rR 91 (92)
T PF03908_consen 75 FAFLFFLLVVLYILWRR 91 (92)
T ss_pred HHHHHHHHHHHHHhhhc
Confidence 45556666666666654
No 55
>PF15220 HILPDA: Hypoxia-inducible lipid droplet-associated
Probab=34.46 E-value=50 Score=22.30 Aligned_cols=18 Identities=33% Similarity=0.539 Sum_probs=15.1
Q ss_pred hhHHHHHHHHHHHHHHHH
Q psy10995 243 SLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 243 ~~~~~~~~~~~~~~~~~~ 260 (265)
+||++..|+.++|+|+-+
T Consensus 6 nlyllgvvltllsifvrl 23 (63)
T PF15220_consen 6 NLYLLGVVLTLLSIFVRL 23 (63)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 689999999999988754
No 56
>PF11027 DUF2615: Protein of unknown function (DUF2615); InterPro: IPR020309 This entry represents a group of uncharacterised protein from the Metazoa, including CD034 (or C4orf34) and YQF4 (or C34C12.4).
Probab=34.24 E-value=40 Score=25.86 Aligned_cols=16 Identities=13% Similarity=0.453 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHhh
Q psy10995 247 LLIVLFAVSTLFAFYA 262 (265)
Q Consensus 247 ~~~~~~~~~~~~~~~~ 262 (265)
++.++|.+++++||+-
T Consensus 57 ~~~~~w~~~A~~ly~~ 72 (103)
T PF11027_consen 57 MMMMLWMVLAMALYLL 72 (103)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 4445566666555553
No 57
>COG3726 AhpA Uncharacterized membrane protein affecting hemolysin expression [General function prediction only]
Probab=33.63 E-value=44 Score=28.68 Aligned_cols=22 Identities=18% Similarity=0.415 Sum_probs=17.7
Q ss_pred hhhhhHHHHHHHHHHHHHHH-Hh
Q psy10995 240 IRESLYILLIVLFAVSTLFA-FY 261 (265)
Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~-~~ 261 (265)
--|-+|.||+++|++.+|++ +|
T Consensus 161 ~~~~l~~mlll~~l~gvlLa~~l 183 (214)
T COG3726 161 TTNILRLMLLLSLLIGVLLARLL 183 (214)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34668899999999999887 54
No 58
>PHA02844 putative transmembrane protein; Provisional
Probab=33.57 E-value=68 Score=23.09 Aligned_cols=16 Identities=25% Similarity=0.650 Sum_probs=6.7
Q ss_pred hHHHHHHHHHHHHHHH
Q psy10995 244 LYILLIVLFAVSTLFA 259 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~ 259 (265)
+.|++++++++++||.
T Consensus 50 ~~ii~i~~v~~~~~~~ 65 (75)
T PHA02844 50 IWILTIIFVVFATFLT 65 (75)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344444444444443
No 59
>PF10717 ODV-E18: Occlusion-derived virus envelope protein ODV-E18; InterPro: IPR019655 Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=32.33 E-value=52 Score=24.23 Aligned_cols=15 Identities=33% Similarity=0.576 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHhh
Q psy10995 248 LIVLFAVSTLFAFYA 262 (265)
Q Consensus 248 ~~~~~~~~~~~~~~~ 262 (265)
|+|++.|+.|.+|++
T Consensus 31 LivLVIIiLlImlfq 45 (85)
T PF10717_consen 31 LIVLVIIILLIMLFQ 45 (85)
T ss_pred HHHHHHHHHHHHHHh
Confidence 344444444444443
No 60
>PF12911 OppC_N: N-terminal TM domain of oligopeptide transport permease C
Probab=32.04 E-value=54 Score=21.47 Aligned_cols=15 Identities=13% Similarity=0.476 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHH
Q psy10995 246 ILLIVLFAVSTLFAF 260 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~ 260 (265)
++.++|+++.+++++
T Consensus 19 ~~gl~il~~~vl~ai 33 (56)
T PF12911_consen 19 VIGLIILLILVLLAI 33 (56)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344444444444444
No 61
>PTZ00370 STEVOR; Provisional
Probab=31.87 E-value=44 Score=30.39 Aligned_cols=20 Identities=15% Similarity=0.398 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHhhhcC
Q psy10995 246 ILLIVLFAVSTLFAFYACRQ 265 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~ 265 (265)
++++++++|+...++|++|.
T Consensus 264 llil~vvliilYiwlyrrRK 283 (296)
T PTZ00370 264 LLILAVVLIILYIWLYRRRK 283 (296)
T ss_pred HHHHHHHHHHHHHHHHHhhc
Confidence 44444555555667888873
No 62
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=31.67 E-value=44 Score=30.32 Aligned_cols=20 Identities=10% Similarity=0.376 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHhhhcC
Q psy10995 246 ILLIVLFAVSTLFAFYACRQ 265 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~ 265 (265)
++++++++|+...++|++|.
T Consensus 268 llil~vvliiLYiWlyrrRK 287 (295)
T TIGR01478 268 LIILTVVLIILYIWLYRRRK 287 (295)
T ss_pred HHHHHHHHHHHHHHHHHhhc
Confidence 44444455555567888873
No 63
>PF11346 DUF3149: Protein of unknown function (DUF3149); InterPro: IPR021494 This bacterial family of proteins has no known function.
Probab=30.94 E-value=65 Score=20.57 Aligned_cols=16 Identities=25% Similarity=0.403 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHh
Q psy10995 246 ILLIVLFAVSTLFAFY 261 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~ 261 (265)
+.|+||+..+.+.++|
T Consensus 14 ~Sl~vI~~~igm~~~~ 29 (42)
T PF11346_consen 14 MSLIVIVFTIGMGVFF 29 (42)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4566666666555554
No 64
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=30.48 E-value=67 Score=26.74 Aligned_cols=25 Identities=20% Similarity=0.364 Sum_probs=17.0
Q ss_pred hhhhhHHHHHHHHHHHHHHHHhhhc
Q psy10995 240 IRESLYILLIVLFAVSTLFAFYACR 264 (265)
Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~ 264 (265)
+.-++|+++.+..++++.|+|...|
T Consensus 94 l~R~~~Vl~g~s~l~i~yfvir~~R 118 (163)
T PF06679_consen 94 LKRALYVLVGLSALAILYFVIRTFR 118 (163)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHh
Confidence 4456677777777777777776654
No 65
>PF04272 Phospholamban: Phospholamban; InterPro: IPR005984 Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17. The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=30.10 E-value=98 Score=20.07 Aligned_cols=26 Identities=19% Similarity=0.220 Sum_probs=13.2
Q ss_pred chhhhhhhhHHH-HHHHHHHHHHHHHh
Q psy10995 236 TNRQIRESLYIL-LIVLFAVSTLFAFY 261 (265)
Q Consensus 236 ~~~~~~~~~~~~-~~~~~~~~~~~~~~ 261 (265)
.|+|+|..+.-+ ++....++-|+++|
T Consensus 20 v~~qa~qnlqelfvnfclilicllli~ 46 (52)
T PF04272_consen 20 VPQQARQNLQELFVNFCLILICLLLIC 46 (52)
T ss_dssp SCHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466777766432 22333344555555
No 66
>PF00746 Gram_pos_anchor: Gram positive anchor; InterPro: IPR019948 Viruses, parasites and bacteria are covered in protein and sugar molecules that help them gain entry into a host by counteracting the host's defences. One such molecule is the M protein produced by certain streptococcal bacteria. M proteins embody a motif that is now known to be shared by many Gram-positive bacterial surface proteins. The motif includes a conserved hexapeptide, which precedes a hydrophobic C-terminal membrane anchor, which itself precedes a cluster of basic residues [, ]. This structure is represented in the following schematic representation: +--------------------------------------------+-+--------+-+ | Variable length extracellular domain |H| Anchor |B| +--------------------------------------------+-+--------+-+ 'H': conserved hexapeptide. 'B': cluster of basic residues. It has been proposed that this hexapeptide sequence is responsible for a post- translational modification necessary for the proper anchoring of the proteins which bear it, to the cell wall.; PDB: 2XTL_B 3QDH_A 2Y1V_C 2X9X_A 3RPK_A 2X9W_A 2X9Y_A.
Probab=29.88 E-value=17 Score=22.22 Aligned_cols=21 Identities=14% Similarity=0.159 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHHHHHhhhc
Q psy10995 244 LYILLIVLFAVSTLFAFYACR 264 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~ 264 (265)
.++++-++++++++++++++|
T Consensus 19 ~~~~~G~l~~~~~~~~~~krr 39 (39)
T PF00746_consen 19 ILTILGALLLLGGGLLLVKRR 39 (39)
T ss_dssp ---------------------
T ss_pred HHHHHHHHHHHHHHHHheecC
Confidence 334444455555555555544
No 67
>PF04021 Class_IIIsignal: Class III signal peptide; InterPro: IPR007166 This entry represents an amino terminal motif QXSXEXXXL thought to be part of a class III signal sequence for a family of archaeal proteins. The Q residue is the +1 residue of the signal peptidase cleavage site []. Two proteins containing this motif are cleaved by a type IV pilin-like signal peptidase.
Probab=29.63 E-value=73 Score=18.39 Aligned_cols=6 Identities=17% Similarity=0.218 Sum_probs=2.1
Q ss_pred HHHHHH
Q psy10995 252 FAVSTL 257 (265)
Q Consensus 252 ~~~~~~ 257 (265)
+.+++.
T Consensus 16 vlv~~~ 21 (28)
T PF04021_consen 16 VLVIAI 21 (28)
T ss_pred HHHHHH
Confidence 333333
No 68
>TIGR01294 P_lamban phospholamban. This model represents the short (52 residue) transmembrane phosphoprotein phospholamban. Phospholamban, in its unphosphorylated form, inhibits SERCA2, the cardiac sarcoplasmic reticulum Ca-ATPase.
Probab=29.45 E-value=99 Score=20.06 Aligned_cols=27 Identities=19% Similarity=0.215 Sum_probs=13.6
Q ss_pred cchhhhhhhhHHH-HHHHHHHHHHHHHh
Q psy10995 235 ATNRQIRESLYIL-LIVLFAVSTLFAFY 261 (265)
Q Consensus 235 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 261 (265)
..|+|+|..|.-+ ++....++-|+++|
T Consensus 19 e~~~qar~~lq~lfvnf~lilicllli~ 46 (52)
T TIGR01294 19 EMPQQARQNLQNLFINFCLILICLLLIC 46 (52)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3467788777532 22223334455554
No 69
>PHA03093 EEV glycoprotein; Provisional
Probab=29.37 E-value=60 Score=27.52 Aligned_cols=22 Identities=32% Similarity=0.487 Sum_probs=17.2
Q ss_pred hhhhhhHHHHHHHHHHHHHHHH
Q psy10995 239 QIRESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~ 260 (265)
-+|.+.-|.++++.++++++||
T Consensus 38 ~~RisiiiSIlsL~~i~~~LAl 59 (185)
T PHA03093 38 CIRISIIISILSLIAITATLAL 59 (185)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4777787888888888887776
No 70
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=28.47 E-value=49 Score=25.08 Aligned_cols=6 Identities=50% Similarity=1.093 Sum_probs=2.5
Q ss_pred HHHHHh
Q psy10995 256 TLFAFY 261 (265)
Q Consensus 256 ~~~~~~ 261 (265)
+|+++|
T Consensus 77 ily~Iy 82 (101)
T PF06024_consen 77 ILYAIY 82 (101)
T ss_pred HHhhhe
Confidence 334444
No 71
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=28.45 E-value=61 Score=23.99 Aligned_cols=17 Identities=18% Similarity=0.247 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHh
Q psy10995 245 YILLIVLFAVSTLFAFY 261 (265)
Q Consensus 245 ~~~~~~~~~~~~~~~~~ 261 (265)
=.|++.+..|.++|+||
T Consensus 68 K~~~ls~~l~~v~~LvY 84 (85)
T PF15188_consen 68 KSMLLSVALFFVCFLVY 84 (85)
T ss_pred hhHHHHHHHHHHHHHHc
Confidence 34677777788888888
No 72
>PHA02673 ORF109 EEV glycoprotein; Provisional
Probab=28.37 E-value=65 Score=26.66 Aligned_cols=21 Identities=29% Similarity=0.350 Sum_probs=14.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHH
Q psy10995 240 IRESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~ 260 (265)
+|.+.-|.++|+.+++|++|+
T Consensus 36 ~Ri~~~iSIisL~~l~v~LaL 56 (161)
T PHA02673 36 FRLMAAIAIIVLAILVVILAL 56 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 566666667777777776665
No 73
>PF15201 Rod_cone_degen: Progressive rod-cone degeneration
Probab=28.22 E-value=46 Score=21.83 Aligned_cols=15 Identities=40% Similarity=0.563 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHhhhc
Q psy10995 250 VLFAVSTLFAFYACR 264 (265)
Q Consensus 250 ~~~~~~~~~~~~~~~ 264 (265)
-+|++|+|..++.+|
T Consensus 4 tlfllstlamlwrrr 18 (54)
T PF15201_consen 4 TLFLLSTLAMLWRRR 18 (54)
T ss_pred hHHHHHHHHHHHHHH
Confidence 367888888888776
No 74
>PF06682 DUF1183: Protein of unknown function (DUF1183); InterPro: IPR009567 This family consists of several eukaryotic proteins of around 360 residues in length. The function of this family is unknown.
Probab=27.98 E-value=45 Score=30.81 Aligned_cols=16 Identities=25% Similarity=0.613 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHHhh
Q psy10995 247 LLIVLFAVSTLFAFYA 262 (265)
Q Consensus 247 ~~~~~~~~~~~~~~~~ 262 (265)
|+.|||+++++|+||+
T Consensus 159 lf~ii~l~vla~ivY~ 174 (318)
T PF06682_consen 159 LFWIIFLLVLAFIVYS 174 (318)
T ss_pred hhhHHHHHHHHHHHHH
Confidence 4455555566666664
No 75
>PF13150 DUF3989: Protein of unknown function (DUF3989)
Probab=27.70 E-value=91 Score=22.99 Aligned_cols=24 Identities=25% Similarity=0.455 Sum_probs=14.0
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHH
Q psy10995 237 NRQIRESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~ 260 (265)
++..|.-+-+.++++||+..|+.+
T Consensus 23 sp~~R~~vvl~ml~~fa~l~ly~~ 46 (85)
T PF13150_consen 23 SPKQRLRVVLVMLVLFAALCLYMT 46 (85)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444666565666666666655544
No 76
>KOG3462|consensus
Probab=27.53 E-value=1e+02 Score=23.31 Aligned_cols=30 Identities=17% Similarity=0.202 Sum_probs=22.9
Q ss_pred CccchhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995 233 ESATNRQIRESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (265)
+.+.++....++..+|-+||++.-||+=|+
T Consensus 24 ~~~~~eD~~pdYmn~lgmIfsmcGlM~r~K 53 (105)
T KOG3462|consen 24 QGAANEDPPPDYMNFLGMIFSMCGLMFRLK 53 (105)
T ss_pred ccccccCCChhHHHHHHHHHHHHHHHHHHH
Confidence 344556667778889999999999987664
No 77
>PHA01815 hypothetical protein
Probab=27.35 E-value=1e+02 Score=20.00 Aligned_cols=14 Identities=29% Similarity=0.702 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHhh
Q psy10995 249 IVLFAVSTLFAFYA 262 (265)
Q Consensus 249 ~~~~~~~~~~~~~~ 262 (265)
++||-++.|+.+|+
T Consensus 38 ~iifyiifl~viya 51 (55)
T PHA01815 38 LIIFYIIFLMVIYA 51 (55)
T ss_pred HHHHHHHHHHHHHH
Confidence 56666677777775
No 78
>PF09604 Potass_KdpF: F subunit of K+-transporting ATPase (Potass_KdpF); InterPro: IPR011726 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the F subunit (KdpF) of a P-type K+-translocating ATPase (Kdp). KdpF is a very small integral membrane peptide. The kdpABC operon of Escherichia coli codes for the high affinity K+-translocating Kdp complex []. KdpF is found upstream of the KdpA subunit (IPR004623 from INTERPRO). Because of its very small size and highly hydrophobic character, it is sometimes missed in genome annotation. More information about this protein can be found at Protein of the Month: ATP Synthases [].
Probab=27.09 E-value=78 Score=17.86 Aligned_cols=15 Identities=20% Similarity=0.279 Sum_probs=6.2
Q ss_pred HHHHHHHHHHHHHhh
Q psy10995 248 LIVLFAVSTLFAFYA 262 (265)
Q Consensus 248 ~~~~~~~~~~~~~~~ 262 (265)
+.++.+.....++|+
T Consensus 4 ~~~v~~~L~~YL~~a 18 (25)
T PF09604_consen 4 GGIVAVALFVYLFYA 18 (25)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333333333444543
No 79
>PF05781 MRVI1: MRVI1 protein; InterPro: IPR008677 This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene []. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP []. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins [].
Probab=27.00 E-value=55 Score=32.34 Aligned_cols=21 Identities=19% Similarity=0.523 Sum_probs=14.2
Q ss_pred hhhHHHHHHHHHHHHHHHHhh
Q psy10995 242 ESLYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 242 ~~~~~~~~~~~~~~~~~~~~~ 262 (265)
--|+|+|.+|+.|.+|+.||.
T Consensus 478 K~LWIsvAliVLLAaLlSfLt 498 (538)
T PF05781_consen 478 KVLWISVALIVLLAALLSFLT 498 (538)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 345677777777777777775
No 80
>PHA02955 hypothetical protein; Provisional
Probab=26.77 E-value=57 Score=28.34 Aligned_cols=21 Identities=29% Similarity=0.444 Sum_probs=16.0
Q ss_pred hHHHHHHHHHHHHHHHHhhhc
Q psy10995 244 LYILLIVLFAVSTLFAFYACR 264 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~ 264 (265)
.||++.|+.+++++|+-|..|
T Consensus 181 w~ii~~v~ii~~~v~l~yikR 201 (213)
T PHA02955 181 WFIIYIVLCLLILIILGYIYR 201 (213)
T ss_pred chhHHHHHHHHHHHHHHHHHH
Confidence 577888888778888777766
No 81
>PF02495 7kD_coat: 7kD viral coat protein; InterPro: IPR003411 This family consists of a 7 kDa coat protein from Carlavirus and Potexvirus [].
Probab=25.29 E-value=61 Score=21.95 Aligned_cols=16 Identities=31% Similarity=0.627 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHh
Q psy10995 246 ILLIVLFAVSTLFAFY 261 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~ 261 (265)
++++.+.++++++++|
T Consensus 2 ~~l~~i~~l~~~~~l~ 17 (59)
T PF02495_consen 2 ILLLLIGLLAFLLTLY 17 (59)
T ss_pred cHHHHHHHHHHHHHHH
Confidence 3444455555544444
No 82
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=24.85 E-value=1.2e+02 Score=19.51 Aligned_cols=19 Identities=21% Similarity=0.376 Sum_probs=7.4
Q ss_pred hhHHHHHHHHHHHHHHHHh
Q psy10995 243 SLYILLIVLFAVSTLFAFY 261 (265)
Q Consensus 243 ~~~~~~~~~~~~~~~~~~~ 261 (265)
++.+++++++.+.+++..|
T Consensus 12 ~~~~v~~~~~F~gi~~w~~ 30 (49)
T PF05545_consen 12 SIGTVLFFVFFIGIVIWAY 30 (49)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3433333333333333333
No 83
>COG0690 SecE Preprotein translocase subunit SecE [Intracellular trafficking and secretion]
Probab=24.81 E-value=1.1e+02 Score=21.79 Aligned_cols=21 Identities=24% Similarity=0.365 Sum_probs=9.1
Q ss_pred hhhhhhhhHHHHHHHHHHHHH
Q psy10995 237 NRQIRESLYILLIVLFAVSTL 257 (265)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~ 257 (265)
.+|+..+..+-|.+++.++++
T Consensus 39 rke~~~~t~~Vl~~v~~~s~~ 59 (73)
T COG0690 39 RKELIRSTLIVLVVVAFFSLF 59 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 455555553333333333333
No 84
>KOG3385|consensus
Probab=24.11 E-value=72 Score=25.03 Aligned_cols=15 Identities=13% Similarity=0.578 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHh
Q psy10995 247 LLIVLFAVSTLFAFY 261 (265)
Q Consensus 247 ~~~~~~~~~~~~~~~ 261 (265)
..+.+|++++||+||
T Consensus 100 ~~m~~f~lV~~fi~~ 114 (118)
T KOG3385|consen 100 CWMAVFSLVAFFILW 114 (118)
T ss_pred HHHHHHHHHHHHHhh
Confidence 334566666777665
No 85
>PF02553 CbiN: Cobalt transport protein component CbiN; InterPro: IPR003705 The cobalt transport protein CbiN is part of the active cobalt transport system involved in uptake of cobalt in to the cell involved with cobalamin biosynthesis (vitamin B12). It has been suggested that CbiN may function as the periplasmic binding protein component of the active cobalt transport system [].; GO: 0015087 cobalt ion transmembrane transporter activity, 0006824 cobalt ion transport, 0009236 cobalamin biosynthetic process, 0016020 membrane
Probab=23.90 E-value=71 Score=23.02 Aligned_cols=16 Identities=25% Similarity=0.451 Sum_probs=7.6
Q ss_pred HHHHHHHHHHHHHHHh
Q psy10995 246 ILLIVLFAVSTLFAFY 261 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~ 261 (265)
|+++|++.+++.|++|
T Consensus 6 l~~~vv~l~~~pl~~~ 21 (74)
T PF02553_consen 6 LLLLVVALAVVPLFLY 21 (74)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4445444444444444
No 86
>PF03742 PetN: PetN ; InterPro: IPR005497 PetN is a small hydrophobic protein, crucial for cytochrome b6-f complex assembly and/or stability. It is found in bacteria and plants. Cytochrome b6-f complex is composed of 4 large subunits: cytochrome b6, subunit IV (17 kDa polypeptide, petD), cytochrome f and the Rieske protein, as well as 4 small subunits: petG, petL, petM and petN. The complex functions as a dimer. The cytochrome b6-f complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI) [].; GO: 0045158 electron transporter, transferring electrons within cytochrome b6/f complex of photosystem II activity, 0017004 cytochrome complex assembly, 0009512 cytochrome b6f complex; PDB: 2ZT9_H 2D2C_H 2E76_H 1VF5_U 2E75_H 2E74_H.
Probab=23.43 E-value=1.3e+02 Score=17.65 Aligned_cols=18 Identities=22% Similarity=0.403 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHHhhh
Q psy10995 246 ILLIVLFAVSTLFAFYAC 263 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~~~ 263 (265)
..|.++|.||.-+..+.+
T Consensus 9 aal~~~ftfSlalVVWGR 26 (29)
T PF03742_consen 9 AALMVVFTFSLALVVWGR 26 (29)
T ss_dssp HHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHhccceeEEEec
Confidence 567888888887776654
No 87
>PHA03231 glycoprotein BALF4; Provisional
Probab=23.20 E-value=60 Score=33.90 Aligned_cols=17 Identities=29% Similarity=0.477 Sum_probs=7.7
Q ss_pred HHHHHHHHHHHHHHHhh
Q psy10995 246 ILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 246 ~~~~~~~~~~~~~~~~~ 262 (265)
|+|+||+.|++.|++|.
T Consensus 709 iillvia~vv~v~l~~r 725 (829)
T PHA03231 709 IGLLVIAVLVAVFLAYR 725 (829)
T ss_pred HHHHHHHHhhhhhHHHH
Confidence 44444444444444443
No 88
>PHA03255 BDLF3; Provisional
Probab=23.13 E-value=63 Score=27.16 Aligned_cols=15 Identities=27% Similarity=0.831 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHH
Q psy10995 245 YILLIVLFAVSTLFA 259 (265)
Q Consensus 245 ~~~~~~~~~~~~~~~ 259 (265)
|+||+.|||...+|+
T Consensus 194 flmlilifaagimma 208 (234)
T PHA03255 194 FLMLILIFAAGLMMS 208 (234)
T ss_pred HHHHHHHHHhhHhhh
No 89
>PF11712 Vma12: Endoplasmic reticulum-based factor for assembly of V-ATPase; InterPro: IPR021013 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. The yeast vacuolar proton-translocating ATPase (V-ATPase) is the best characterised member of the V-ATPase family. A total of thirteen genes are required for encoding the subunits of the enzyme complex itself and an additional three for providing factors necessary for the assembly of the whole. Vma12 is one of these latter, all three of which are localised to the endoplasmic reticulum [].
Probab=23.11 E-value=97 Score=24.80 Aligned_cols=25 Identities=16% Similarity=0.328 Sum_probs=18.5
Q ss_pred cchhhhhhhhHHHHHHHHHHHHHHH
Q psy10995 235 ATNRQIRESLYILLIVLFAVSTLFA 259 (265)
Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~ 259 (265)
+...+++..+.+.+++|+++++.|+
T Consensus 70 ~~~k~~~~qls~v~Nilvsv~~~~~ 94 (142)
T PF11712_consen 70 QELKSVKRQLSTVFNILVSVFAVFF 94 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4457788888888888888765544
No 90
>PF05084 GRA6: Granule antigen protein (GRA6); InterPro: IPR008119 Toxoplasma gondii is an obligate intracellular apicomplexan protozoan parasite, with a complex lifestyle involving varied hosts []. It has two phases of growth: an intestinal phase in feline hosts, and an extra-intestinal phase in other mammals. Oocysts from infected cats develop into tachyzoites, and eventually, bradyzoites and zoitocysts in the extraintestinal host []. Transmission of the parasite occurs through contact with infected cats or raw/undercooked meat; in immunocompromised individuals, it can cause severe and often lethal toxoplasmosis. Acute infection in healthy humans can sometimes also cause tissue damage []. The protozoan utilises a variety of secretory and antigenic proteins to invade a host and gain access to the intracellular environment []. These originate from distinct organelles in the T. gondii cell termed micronemes, rhoptries, and dense granules. They are released at specific times during invasion to ensure the proteins are allocated to their correct target destinations []. Dense granule antigens (GRAs) are released from the T. gondii tachyzoite while still encapsulated in a host vacuole. Gra6, one of these moieties, is associated with the parasitophorous vacuole []. It possesses a hydrophobic central region flanked by two hydrophilic domains, and is present as a single copy gene in the Toxoplasma gondii genome []. Gra6 shares a similar function with Gra2, in that it is rapidly targeted to a network of membranous tubules that connect with the vacuolar membrane []. Indeed, these two proteins, together with Gra4, form a multimeric complex that stabilises the parasite within the vacuole.
Probab=22.64 E-value=1e+02 Score=25.68 Aligned_cols=27 Identities=33% Similarity=0.314 Sum_probs=16.8
Q ss_pred hhhhhhhhHHHHHHHHHHH-HHHHHhhhc
Q psy10995 237 NRQIRESLYILLIVLFAVS-TLFAFYACR 264 (265)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 264 (265)
..|.||-| |.-.||++++ +|++||.+|
T Consensus 146 R~Q~RHR~-IG~~VlA~~VA~L~~~F~RR 173 (215)
T PF05084_consen 146 RTQKRHRL-IGAVVLAVSVAMLTWFFLRR 173 (215)
T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHh
Confidence 46889988 6666655443 455556554
No 91
>TIGR02115 potass_kdpF K+-transporting ATPase, KdpF subunit. This model describes a very small integral membrane peptide KdpF, a subunit of the K(+)-translocating Kdp complex. It is found upstream of the KdpA subunit (TIGR00680). Because of its very small size and highly hydrophobic character, it is sometimes missed in genome annotation.
Probab=22.61 E-value=1e+02 Score=17.59 Aligned_cols=7 Identities=43% Similarity=1.032 Sum_probs=2.8
Q ss_pred HHHHHhh
Q psy10995 256 TLFAFYA 262 (265)
Q Consensus 256 ~~~~~~~ 262 (265)
...|+|+
T Consensus 11 ~~YL~~a 17 (26)
T TIGR02115 11 FIYLFYA 17 (26)
T ss_pred HHHHHHH
Confidence 3344443
No 92
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=22.43 E-value=60 Score=24.50 Aligned_cols=12 Identities=17% Similarity=0.412 Sum_probs=7.3
Q ss_pred HHHHHHHhhhcC
Q psy10995 254 VSTLFAFYACRQ 265 (265)
Q Consensus 254 ~~~~~~~~~~~~ 265 (265)
+++|+|||-||+
T Consensus 55 lIii~Lv~CC~~ 66 (98)
T PF07204_consen 55 LIIIALVCCCRA 66 (98)
T ss_pred HHHHHHHHHhhh
Confidence 334777777763
No 93
>PF12575 DUF3753: Protein of unknown function (DUF3753); InterPro: IPR009175 This group represents an uncharacterised conserved protein belonging to poxvirus family I2.
Probab=21.95 E-value=1.5e+02 Score=21.30 Aligned_cols=12 Identities=17% Similarity=0.445 Sum_probs=4.7
Q ss_pred HHHHHHHHHHHH
Q psy10995 247 LLIVLFAVSTLF 258 (265)
Q Consensus 247 ~~~~~~~~~~~~ 258 (265)
++++++++++||
T Consensus 53 i~ii~v~ii~~l 64 (72)
T PF12575_consen 53 ISIIFVLIIVLL 64 (72)
T ss_pred HHHHHHHHHHHH
Confidence 333333334443
No 94
>PHA02692 hypothetical protein; Provisional
Probab=21.78 E-value=1.4e+02 Score=21.26 Aligned_cols=10 Identities=20% Similarity=0.401 Sum_probs=3.8
Q ss_pred HHHHHHHHHH
Q psy10995 250 VLFAVSTLFA 259 (265)
Q Consensus 250 ~~~~~~~~~~ 259 (265)
+++++++||+
T Consensus 54 ~~~~~~vll~ 63 (70)
T PHA02692 54 IAAAIGVLLC 63 (70)
T ss_pred HHHHHHHHHH
Confidence 3333344443
No 95
>PF03935 SKN1: Beta-glucan synthesis-associated protein (SKN1); InterPro: IPR005629 This family consists of the beta-glucan synthesis-associated proteins KRE6 and SKN1. Beta1,6-Glucan is a key component of the yeast cell wall, interconnecting cell wall proteins, beta1,3-glucan, and chitin. It has been postulated that the synthesis of beta1,6-glucan begins in the endoplasmic reticulum with the formation of protein-bound primer structures and that these primer structures are extended in the Golgi complex by two putative glucosyltransferases that are functionally redundant, Kre6 and Skn1. This is followed by maturation steps at the cell surface and by coupling to other cell wall macromolecules [].
Probab=21.74 E-value=78 Score=31.15 Aligned_cols=27 Identities=22% Similarity=0.335 Sum_probs=17.5
Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHh
Q psy10995 234 SATNRQIRESLYILLIVLFAVSTLFAFY 261 (265)
Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 (265)
..+.|.+. ++..|+++++++++||+.|
T Consensus 36 ~~s~RGl~-Nlg~LilL~lGLL~LFigy 62 (504)
T PF03935_consen 36 IFSRRGLL-NLGGLILLILGLLMLFIGY 62 (504)
T ss_pred ccchhhhh-HHHHHHHHHHHHHHHHHHh
Confidence 34445444 4556677777788888877
No 96
>PTZ00227 variable surface protein Vir14; Provisional
Probab=21.30 E-value=65 Score=30.91 Aligned_cols=22 Identities=23% Similarity=0.395 Sum_probs=12.1
Q ss_pred hhhhHHHHHHHHHHH-HHHHHhh
Q psy10995 241 RESLYILLIVLFAVS-TLFAFYA 262 (265)
Q Consensus 241 ~~~~~~~~~~~~~~~-~~~~~~~ 262 (265)
++.+...++.||+++ +||++|+
T Consensus 324 ~~~~~~~~~~i~~~~~~~~~~yk 346 (418)
T PTZ00227 324 TDDITPPLLGIVGVLLILWALYK 346 (418)
T ss_pred hhhccchHHHHHHHHHHHHHHHh
Confidence 344444455555544 4778884
No 97
>PF11857 DUF3377: Domain of unknown function (DUF3377); InterPro: IPR021805 This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=21.25 E-value=73 Score=22.94 Aligned_cols=21 Identities=29% Similarity=0.569 Sum_probs=14.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHH
Q psy10995 240 IRESLYILLIVLFAVSTLFAF 260 (265)
Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~ 260 (265)
|-.-+.++|+|++.+-++|.|
T Consensus 34 VviPl~L~LCiLvl~yai~~f 54 (74)
T PF11857_consen 34 VVIPLVLLLCILVLIYAIFQF 54 (74)
T ss_pred EeHHHHHHHHHHHHHHHhhee
Confidence 334466788888887777753
No 98
>PF09788 Tmemb_55A: Transmembrane protein 55A; InterPro: IPR019178 Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Probab=20.75 E-value=1.2e+02 Score=27.02 Aligned_cols=12 Identities=8% Similarity=-0.297 Sum_probs=5.9
Q ss_pred CCCCCCCCCCCC
Q psy10995 222 TPTPPSGNDSPE 233 (265)
Q Consensus 222 CC~~C~~~~~~~ 233 (265)
=||.|..-.+-+
T Consensus 179 RCPHCrKvSSVG 190 (256)
T PF09788_consen 179 RCPHCRKVSSVG 190 (256)
T ss_pred cCCCCceecccc
Confidence 355666544333
No 99
>PRK03625 tatE twin arginine translocase protein E; Validated
Probab=20.11 E-value=75 Score=22.40 Aligned_cols=10 Identities=20% Similarity=0.398 Sum_probs=4.2
Q ss_pred HHHHHHHHHH
Q psy10995 251 LFAVSTLFAF 260 (265)
Q Consensus 251 ~~~~~~~~~~ 260 (265)
|+++++|++|
T Consensus 11 iIlvI~lllF 20 (67)
T PRK03625 11 VVAALVVLLF 20 (67)
T ss_pred HHHHHHHHHc
Confidence 3344444443
No 100
>PF05795 Plasmodium_Vir: Plasmodium vivax Vir protein; InterPro: IPR008780 This family consists of several Vir proteins specific to the genus Plasmodium and Plasmodium vivax in particular. The vir genes are present at about 600-1,000 copies per haploid genome and encode proteins that are immunovariant in natural infections, indicating that they may have a functional role in establishing chronic infection through antigenic variation [].
Probab=20.08 E-value=77 Score=28.62 Aligned_cols=19 Identities=26% Similarity=0.272 Sum_probs=14.9
Q ss_pred hHHHHHHHHHHHHHHHHhh
Q psy10995 244 LYILLIVLFAVSTLFAFYA 262 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~~~~ 262 (265)
....|.++.++++||+||+
T Consensus 285 v~~~~~~~G~~~~~f~LYK 303 (354)
T PF05795_consen 285 VSPVLSVLGIPLIFFLLYK 303 (354)
T ss_pred hhhhhhhHHHHHHHHHHhc
Confidence 3456777888888999997
No 101
>PHA02975 hypothetical protein; Provisional
Probab=20.01 E-value=1.3e+02 Score=21.38 Aligned_cols=16 Identities=19% Similarity=0.636 Sum_probs=6.3
Q ss_pred hHHHHHHHHHHHHHHH
Q psy10995 244 LYILLIVLFAVSTLFA 259 (265)
Q Consensus 244 ~~~~~~~~~~~~~~~~ 259 (265)
+.|++++.+++++||.
T Consensus 46 ~~ii~i~~v~~~~~~~ 61 (69)
T PHA02975 46 ILIIFIIFITCIAVFT 61 (69)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344433333444443
Done!