Query psy14123
Match_columns 102
No_of_seqs 91 out of 93
Neff 3.7
Searched_HMMs 46136
Date Fri Aug 16 20:09:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14123.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14123hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF14770 TMEM18: Transmembrane 100.0 5.7E-40 1.2E-44 235.9 5.8 88 4-91 25-121 (123)
2 PF12357 PLD_C: Phospholipase 92.4 0.11 2.4E-06 35.1 2.3 27 23-49 12-38 (74)
3 PF05827 ATP-synt_S1: Vacuolar 52.9 9.7 0.00021 29.6 1.8 29 38-67 248-276 (282)
4 PF08031 BBE: Berberine and be 45.1 9.5 0.00021 22.7 0.6 26 26-52 14-41 (47)
5 PLN02270 phospholipase D alpha 38.5 20 0.00043 33.5 1.8 22 23-44 736-757 (808)
6 PLN02352 phospholipase D epsil 38.0 23 0.0005 32.8 2.1 26 24-49 686-711 (758)
7 PRK14399 membrane protein; Pro 37.7 1E+02 0.0023 25.1 5.6 53 36-90 192-249 (258)
8 TIGR03834 EAGR_box EAGR box. T 36.2 14 0.0003 21.0 0.3 11 42-52 18-28 (28)
9 KOG2574|consensus 34.6 20 0.00043 31.7 1.1 31 24-54 457-487 (492)
10 PF13783 DUF4177: Domain of un 34.4 18 0.00038 22.3 0.6 16 24-39 22-37 (61)
11 PF02428 Prot_inhib_II: Potato 34.2 15 0.00033 23.4 0.2 13 40-52 37-49 (52)
12 PF14997 CECR6_TMEM121: CECR6/ 34.2 27 0.00057 26.9 1.6 48 5-64 8-55 (195)
13 PLN02292 ferric-chelate reduct 32.4 41 0.00088 30.7 2.7 43 26-69 36-79 (702)
14 PF01473 CW_binding_1: Putativ 32.1 19 0.0004 17.6 0.3 8 43-50 11-18 (19)
15 PLN03008 Phospholipase D delta 30.8 37 0.00081 32.0 2.3 21 24-44 798-818 (868)
16 PF06703 SPC25: Microsomal sig 30.1 23 0.00049 25.6 0.6 15 41-55 136-150 (162)
17 PF05399 EVI2A: Ectropic viral 29.3 39 0.00085 27.3 1.9 46 49-95 129-176 (227)
18 PF04336 DUF479: Protein of un 28.1 33 0.00071 23.4 1.1 13 31-43 4-16 (106)
19 PF15156 CLN6: Ceroid-lipofusc 27.9 80 0.0017 26.1 3.4 32 40-71 224-255 (284)
20 PF04030 ALO: D-arabinono-1,4- 27.2 15 0.00033 27.8 -0.7 20 34-54 233-254 (259)
21 PF06365 CD34_antigen: CD34/Po 25.5 1.1E+02 0.0023 24.1 3.6 29 41-69 92-120 (202)
22 PF12400 DUF3661: Vaculolar me 25.4 2E+02 0.0043 20.8 4.8 64 17-82 18-94 (129)
23 PF09779 Ima1_N: Ima1 N-termin 24.8 32 0.00069 24.6 0.6 14 40-54 27-40 (131)
24 PLN02844 oxidoreductase/ferric 24.5 89 0.0019 28.7 3.5 47 21-69 23-72 (722)
25 PF06413 Neugrin: Neugrin; In 23.8 40 0.00087 26.6 1.0 13 40-52 210-222 (225)
26 PF01219 DAGK_prokar: Prokaryo 23.2 81 0.0017 21.9 2.4 36 5-40 29-70 (104)
27 PF13209 DUF4017: Protein of u 21.2 84 0.0018 20.6 2.0 31 24-69 23-53 (60)
28 PF03620 IBV_3C: IBV 3C protei 20.8 83 0.0018 22.2 2.0 28 44-72 8-35 (93)
29 PF03140 DUF247: Plant protein 20.3 55 0.0012 26.7 1.2 40 24-63 348-390 (391)
30 COG3558 Uncharacterized protei 20.1 25 0.00055 26.6 -0.7 18 27-51 99-116 (154)
No 1
>PF14770 TMEM18: Transmembrane protein 18
Probab=100.00 E-value=5.7e-40 Score=235.89 Aligned_cols=88 Identities=39% Similarity=0.745 Sum_probs=82.1
Q ss_pred hhhhHhhhccchh---------hhHHHhhhhhhhhHHHHhhHhhhccccccCCCCceehhhhHHHHHHHHHHHHHHHHHH
Q psy14123 4 LLVFSIVLKPRPY---------LFILLVYFSENLNEFAHNNWRSFARHEYFDEKGMFISIVFSIPLLLNCMILVLKRKYT 74 (102)
Q Consensus 4 l~~~~i~l~~~~~---------~ll~lVy~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISvv~S~PLLl~~~iili~~l~~ 74 (102)
+++++.++.+|+. +++++||+||+|||+||+|||+||+||||||+|+|||+|||+|+|++|++++++|+++
T Consensus 25 ~~~~~~~~~tr~~~~~q~~lf~~ll~~v~~aE~iN~~~a~nW~~Fs~qnYFDs~G~Fisvv~s~PlLl~~~ii~~~~l~~ 104 (123)
T PF14770_consen 25 VLLLLLAILTRRRYNFQMILFLILLLLVYCAEYINEYAARNWRSFSKQNYFDSSGVFISVVFSAPLLLNCLIILVNWLYQ 104 (123)
T ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcCCCCeeehHHHHHhHHHHHHHHHHHHHHH
Confidence 5566677777766 7788999999999999999999999999999999999999999999999999999999
Q ss_pred hcccccccccccccccc
Q psy14123 75 CKEPYPNLKTPLCLTPL 91 (102)
Q Consensus 75 ~~~lmv~~Kraql~~~~ 91 (102)
++++||++||+|+|+|+
T Consensus 105 ~~~lmv~~Kr~qlr~~~ 121 (123)
T PF14770_consen 105 LCSLMVQVKRAQLRRKA 121 (123)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 99999999999999876
No 2
>PF12357 PLD_C: Phospholipase D C terminal ; InterPro: IPR024632 Phospholipase D (PLD) catalyses the hydrolysis of the phosphodiester bond of glycerophospholipids to generate phosphatidic acid and a free head group. Phospholipase D activities have been detected in simple to complex organisms from viruses and bacteria to yeast, plants, and mammals []. In higher organisms, PLD specifically catalyzes the hydrolysis of phosphatidylcholine (PC) to phosphatidic acid (PA) and choline and is activated in response to stimulators of vesicle transport, endocytosis, exocytosis, cell migration, and mitosis. This entry represents the C-terminal domain of eukaryotic phospholipase D. The domain is approximately 70 amino acids in length and contains a conserved FPD sequence motif.
Probab=92.43 E-value=0.11 Score=35.12 Aligned_cols=27 Identities=33% Similarity=0.580 Sum_probs=22.4
Q ss_pred hhhhhhhHHHHhhHhhhccccccCCCC
Q psy14123 23 YFSENLNEFAHNNWRSFARHEYFDEKG 49 (102)
Q Consensus 23 y~aE~iNe~aa~nW~~Fs~qnYFDs~G 49 (102)
=|..++|+.|.+||+.|++...=|=.|
T Consensus 12 eCVr~Vn~iae~nW~~y~~ee~~dl~G 38 (74)
T PF12357_consen 12 ECVRRVNEIAEENWKQYASEEVTDLPG 38 (74)
T ss_pred HHHHHHHHHHHHHHHHhhccccccCCC
Confidence 367899999999999999888755555
No 3
>PF05827 ATP-synt_S1: Vacuolar ATP synthase subunit S1 (ATP6S1); InterPro: IPR024722 This family consists of metazoan vacuolar ATP synthase subunit S1 proteins [] and fungal proteins belonging to the BIG family. In Candida albicans BIG is required for normal beta-1,6-glucan synthesis, hyphal morphogenesis, adhesion and virulence [].
Probab=52.93 E-value=9.7 Score=29.55 Aligned_cols=29 Identities=21% Similarity=0.451 Sum_probs=22.5
Q ss_pred hhccccccCCCCceehhhhHHHHHHHHHHH
Q psy14123 38 SFARHEYFDEKGMFISIVFSIPLLLNCMIL 67 (102)
Q Consensus 38 ~Fs~qnYFDs~G~FISvv~S~PLLl~~~ii 67 (102)
.|.+.|+|.+ |.|++++-++.++.+.++-
T Consensus 248 lf~~yQFftp-gi~mglii~~~ll~IL~~g 276 (282)
T PF05827_consen 248 LFFDYQFFTP-GIWMGLIISLVLLSILYVG 276 (282)
T ss_pred eehhheeeec-cHHHHHHHHHHHHHHHHHH
Confidence 4889998876 7999999888877665543
No 4
>PF08031 BBE: Berberine and berberine like ; InterPro: IPR012951 This domain is found in the berberine bridge and berberine bridge-like enzymes, which are involved in the biosynthesis of numerous isoquinoline alkaloids. They catalyse the transformation of the N-methyl group of (S)-reticuline into the C-8 berberine bridge carbon of (S)-scoulerine [].; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 2IPI_A 2Y3S_A 2Y3R_D 2Y08_B 2Y4G_A 3D2H_A 3FW9_A 3FW8_A 3FWA_A 3D2J_A ....
Probab=45.15 E-value=9.5 Score=22.72 Aligned_cols=26 Identities=15% Similarity=0.427 Sum_probs=16.7
Q ss_pred hhhhHHHHhhHhhhc--cccccCCCCcee
Q psy14123 26 ENLNEFAHNNWRSFA--RHEYFDEKGMFI 52 (102)
Q Consensus 26 E~iNe~aa~nW~~Fs--~qnYFDs~G~FI 52 (102)
+...++-++||+... ++.| ||+|+|-
T Consensus 14 ~~~~~yyg~n~~rL~~iK~~y-DP~n~F~ 41 (47)
T PF08031_consen 14 DWQEAYYGENYDRLRAIKRKY-DPDNVFR 41 (47)
T ss_dssp HHHHHHHGGGHHHHHHHHHHH--TT-TS-
T ss_pred HHHHHHhchhHHHHHHHHHHh-CccceeC
Confidence 446667778888763 6666 9999995
No 5
>PLN02270 phospholipase D alpha
Probab=38.52 E-value=20 Score=33.46 Aligned_cols=22 Identities=14% Similarity=0.392 Sum_probs=19.4
Q ss_pred hhhhhhhHHHHhhHhhhccccc
Q psy14123 23 YFSENLNEFAHNNWRSFARHEY 44 (102)
Q Consensus 23 y~aE~iNe~aa~nW~~Fs~qnY 44 (102)
=|-.++|+.|.+||+.|++.+.
T Consensus 736 ~cv~~v~~~a~~~w~~y~~~~~ 757 (808)
T PLN02270 736 ECIQKVNQIADKYWDLYSSETL 757 (808)
T ss_pred HHHHHHHHHHHHHHHHhccccc
Confidence 3678999999999999998876
No 6
>PLN02352 phospholipase D epsilon
Probab=38.01 E-value=23 Score=32.78 Aligned_cols=26 Identities=19% Similarity=0.450 Sum_probs=21.6
Q ss_pred hhhhhhHHHHhhHhhhccccccCCCC
Q psy14123 24 FSENLNEFAHNNWRSFARHEYFDEKG 49 (102)
Q Consensus 24 ~aE~iNe~aa~nW~~Fs~qnYFDs~G 49 (102)
|-.++|+.|.+||+.|++.+.=|=.|
T Consensus 686 c~~~v~~~~~~~w~~y~~~~~~~~~g 711 (758)
T PLN02352 686 CVRRLRTIGEQMWEIYSGEEVVDMEG 711 (758)
T ss_pred HHHHHHHHHHHHHHhhccchhccCCC
Confidence 67889999999999999888755444
No 7
>PRK14399 membrane protein; Provisional
Probab=37.74 E-value=1e+02 Score=25.06 Aligned_cols=53 Identities=9% Similarity=0.155 Sum_probs=27.2
Q ss_pred HhhhccccccCCCCceehh-----hhHHHHHHHHHHHHHHHHHHhccccccccccccccc
Q psy14123 36 WRSFARHEYFDEKGMFISI-----VFSIPLLLNCMILVLKRKYTCKEPYPNLKTPLCLTP 90 (102)
Q Consensus 36 W~~Fs~qnYFDs~G~FISv-----v~S~PLLl~~~iili~~l~~~~~lmv~~Kraql~~~ 90 (102)
|+.-...+|||+.+.|-+= +|-+++++.---.++-|-++.- .+++|+.|+++-
T Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~i~~H~~N--I~rl~~~~E~k~ 249 (258)
T PRK14399 192 WKNYILFSLLNSFHYWFSNIWASGMLEGNIIILIGGLILVVRHSQN--IKRIKNKTEPDT 249 (258)
T ss_pred hhcchhhhhccccccccchhhhchhHHHHHHHHHHHHHHHHHHHHh--HHHHhcCCCcch
Confidence 5555566778876643332 3333333322233333444433 677788777543
No 8
>TIGR03834 EAGR_box EAGR box. The EAGR box (Enriched in Aromatic and Glycine Residues) is found in three different proteins of the Mycoplasma genitalium terminal organelle, which acts in both cytadherence and gliding motility. The presence of this domain in a genome predicts the Mycoplasma-type terminal organelle structure, gliding motility, and cytadherence. The EAGR box may occur from one to nine times in a protein.
Probab=36.21 E-value=14 Score=20.97 Aligned_cols=11 Identities=45% Similarity=1.045 Sum_probs=7.7
Q ss_pred ccccCCCCcee
Q psy14123 42 HEYFDEKGMFI 52 (102)
Q Consensus 42 qnYFDs~G~FI 52 (102)
.-|||++|-|+
T Consensus 18 ~GyFDe~~~w~ 28 (28)
T TIGR03834 18 KGYFDEDGNWV 28 (28)
T ss_pred eeEeCccCCCC
Confidence 45888888664
No 9
>KOG2574|consensus
Probab=34.61 E-value=20 Score=31.72 Aligned_cols=31 Identities=26% Similarity=0.280 Sum_probs=26.7
Q ss_pred hhhhhhHHHHhhHhhhccccccCCCCceehh
Q psy14123 24 FSENLNEFAHNNWRSFARHEYFDEKGMFISI 54 (102)
Q Consensus 24 ~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISv 54 (102)
.-|-+|.-++++=..=+..|||-++|.|..+
T Consensus 457 glEivnp~~~~~~~~e~~~~yFS~~g~F~~i 487 (492)
T KOG2574|consen 457 GLEIVNPQAAEQQQPEENSKYFSSSGSFSKI 487 (492)
T ss_pred chhhcCHHHHhhcCcccccceecCCCceeec
Confidence 3578899999988778899999999999865
No 10
>PF13783 DUF4177: Domain of unknown function (DUF4177)
Probab=34.44 E-value=18 Score=22.27 Aligned_cols=16 Identities=31% Similarity=0.719 Sum_probs=13.8
Q ss_pred hhhhhhHHHHhhHhhh
Q psy14123 24 FSENLNEFAHNNWRSF 39 (102)
Q Consensus 24 ~aE~iNe~aa~nW~~F 39 (102)
+-+.||++|++-|+.=
T Consensus 22 ~~~~Ln~~g~eGWeLV 37 (61)
T PF13783_consen 22 LEEILNEYGKEGWELV 37 (61)
T ss_pred HHHHHHHHHhCCcEEE
Confidence 4688999999999973
No 11
>PF02428 Prot_inhib_II: Potato type II proteinase inhibitor family; InterPro: IPR003465 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Members of the Pin2 family are proteinase inhibitors that belong to MEROPS inhibitor family I20, clan IA and are restricted to plants. They inhibit serine peptidases belonging to MEROPS peptidase family S1 [] (IPR001254 from INTERPRO). They have a multidomain structure [], which permits circular permutation of the sequences. It was been shown that some naturally occurring Pin2 proteins, have an `ancestral' circularly permuted structure []. Circular permutation/ rearrangements of sequences has also been observed between species, such as favin from Vicia faba and the lectin concanavalin A from Canavalia ensiformis [] or amongst members of the plant aspartyl proteinases and human lung surfactant proteins []. The Pin2 family of proteinase inhibitors are present in seeds, leaves and other organs. Perhaps the best known representatives are the wound-induced proteinase inhibitors [, ], which contain up to eight sequence-repeats (the `IP repeats'). The sequence of the IP repeats is quite variable, only the cysteines constituting the four disulphide bridges and a single proline residue are conserved throughout all the known repeat sequences. The structure of the proteinase inhibitor complex is known [].; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1PJU_C 1OYV_I 1QH2_B 1FYB_A 1CE3_A 2JZM_A 1YTP_A 2JYY_A 1TIH_A 4SGB_I ....
Probab=34.17 E-value=15 Score=23.41 Aligned_cols=13 Identities=31% Similarity=0.746 Sum_probs=11.3
Q ss_pred ccccccCCCCcee
Q psy14123 40 ARHEYFDEKGMFI 52 (102)
Q Consensus 40 s~qnYFDs~G~FI 52 (102)
.+=+||+.+|.||
T Consensus 37 KGC~yy~~dGtfi 49 (52)
T PF02428_consen 37 KGCNYYSADGTFI 49 (52)
T ss_dssp TTSEEEETTTEEE
T ss_pred cccEEECCCCcEE
Confidence 4678999999997
No 12
>PF14997 CECR6_TMEM121: CECR6/TMEM121 family
Probab=34.16 E-value=27 Score=26.89 Aligned_cols=48 Identities=23% Similarity=0.431 Sum_probs=33.4
Q ss_pred hhhHhhhccchhhhHHHhhhhhhhhHHHHhhHhhhccccccCCCCceehhhhHHHHHHHH
Q psy14123 5 LVFSIVLKPRPYLFILLVYFSENLNEFAHNNWRSFARHEYFDEKGMFISIVFSIPLLLNC 64 (102)
Q Consensus 5 ~~~~i~l~~~~~~ll~lVy~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISvv~S~PLLl~~ 64 (102)
++++++|.||...+...+. +|+ =....+||++++.+....|+|++...
T Consensus 8 ~lY~~~L~~Kv~~i~~~~~-~d~-----------l~~~~~~~~~~l~l~L~ls~p~lf~L 55 (195)
T PF14997_consen 8 FLYSIVLTPKVVFIFQNVI-ADY-----------LIELDIFGPTVLKLLLSLSVPLLFLL 55 (195)
T ss_pred HHHHHHHHHHHhhheeehh-hhh-----------ccccccccchhHHHHHHHHHHHHHHH
Confidence 4567888888766554333 332 33456799999999999999976543
No 13
>PLN02292 ferric-chelate reductase
Probab=32.36 E-value=41 Score=30.68 Aligned_cols=43 Identities=16% Similarity=0.371 Sum_probs=30.7
Q ss_pred hhhhHHHHhhHhhhc-cccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123 26 ENLNEFAHNNWRSFA-RHEYFDEKGMFISIVFSIPLLLNCMILVL 69 (102)
Q Consensus 26 E~iNe~aa~nW~~Fs-~qnYFDs~G~FISvv~S~PLLl~~~iili 69 (102)
+.-++.-..+|+.-. +..||-.+|... ++||.|+++++++-.+
T Consensus 36 ~~~~~~~~~~~~~~~~~~t~fg~~g~~~-~~~~~p~~~~a~~~~~ 79 (702)
T PLN02292 36 STYKTIWLPSMRAKLGKSTYFGAPGVNL-LVYMFPMILLACLGCI 79 (702)
T ss_pred HHHHhhccHHHHHhccCCceecccchhh-HHHhhHHHHHHHHHHH
Confidence 333444445566655 577999999865 7899999999876543
No 14
>PF01473 CW_binding_1: Putative cell wall binding repeat; InterPro: IPR018337 The cell wall-binding repeat (CW) is an about 20 amino acid residue module, essentially found in two bacterial Gram-positive protein families; the choline binding proteins and glucosyltransferases (2.4.1.5 from EC). In choline-binding proteins cell wall binding repeats bind to choline moieties of both teichoic and lipoteichoic acids, two components peculiar to the cell surface of Gram-positive bacteria [, ]. In glucosyltransferases the region spanning the CW repeats is a glucan binding domain []. Several crystal structures of CW have been solved [, ]. In the choline binding protein LytA, the repeats adopt a solenoid fold consisting exclusively of beta-hairpins that stack to form a left-handed superhelix with a boomerang-like shape. The choline groups bind between beta-hairpin 'steps' of the superhelix []. In Cpl-1 CW repeats assemble in two sub-domains: an N-terminal superhelical moiety similar to the LytA one and a C-terminal beta-sheet involved in interactions with the lysozyme domain. Choline is bound between repeats 1 and 2, and, 2 and 3 of the superhelical sub-domain []. Some proteins known to contain cell-wall binding repeats include: Pneumococcal N-acetylmuramoyl-L-alanine amidase (autolysin, lytA) (3.5.1.28 from EC). It is a surface-exposed enzyme that rules the self-destruction of pneumococcal cells through degradation of their peptidoglycan backbone. It mediates the release of toxic substances that damage the host tissues. Pneumococcal endo-beta-N-acetylglucosaminidase (lytB) (3.2.1.96 from EC). It plays an important role in cell wall degradation and cell separation. Pneumococcal teichoic acid phosphorylcholine esterase (pce or cbpE), a cell wall hydrolase important for cellular adhesion and colonisation. Lactobacillales glucosyltransferase. It catalyses the transfer of glucosyl units from the cleavage of sucrose to a growing chain of glucan. Clostridium difficile toxin A (tcdA) and toxin B (tcdb). They are the causative agents of the antibiotic-associated pseudomembranous colitis. They are intracellular acting toxins that reach their targets after receptor-mediated endocytosis. Clostridium acetobutylicum cspA protein. Siphoviridae bacteriophages N-acetylmuramoyl-L-alanine amidase. It lyses the bacterial host cell wall. Podoviridae lysozyme protein (cpl-1). It is capable of digesting the pneumococcal cell wall. The cell wall binding repeats are also known as the choline-binding repeats (ChBr) or the choline-binding domain (ChBD). ; PDB: 1GVM_C 2BML_B 1HCX_A 1OBA_A 1H09_A 2J8F_A 2IXU_A 2J8G_A 2IXV_A 2X8O_A ....
Probab=32.13 E-value=19 Score=17.63 Aligned_cols=8 Identities=50% Similarity=0.995 Sum_probs=6.4
Q ss_pred cccCCCCc
Q psy14123 43 EYFDEKGM 50 (102)
Q Consensus 43 nYFDs~G~ 50 (102)
=|||++|.
T Consensus 11 Yy~~~~G~ 18 (19)
T PF01473_consen 11 YYFDSDGY 18 (19)
T ss_dssp EEETTTSB
T ss_pred EEeCCCcc
Confidence 48999985
No 15
>PLN03008 Phospholipase D delta
Probab=30.81 E-value=37 Score=32.04 Aligned_cols=21 Identities=19% Similarity=0.629 Sum_probs=19.0
Q ss_pred hhhhhhHHHHhhHhhhccccc
Q psy14123 24 FSENLNEFAHNNWRSFARHEY 44 (102)
Q Consensus 24 ~aE~iNe~aa~nW~~Fs~qnY 44 (102)
|-.++|+.|.+||+.|++...
T Consensus 798 cv~~vn~~a~~~w~~y~~~~~ 818 (868)
T PLN03008 798 CLKKVNTISEENWKRFIDPKF 818 (868)
T ss_pred HHHHHHHHHHHHHHHhhcccc
Confidence 667899999999999998877
No 16
>PF06703 SPC25: Microsomal signal peptidase 25 kDa subunit (SPC25); InterPro: IPR009582 This family consists of several microsomal signal peptidase 25 kDa subunit proteins. Translocation of polypeptide chains across the endoplasmic reticulum (ER) membrane is triggered by signal sequences. Subsequently, signal recognition particle interacts with its membrane receptor and the ribosome-bound nascent chain is targeted to the ER where it is transferred into a protein-conducting channel. At some point, a second signal sequence recognition event takes place in the membrane and translocation of the nascent chain through the membrane occurs. The signal sequence of most secretory and membrane proteins is cleaved off at this stage. Cleavage occurs by the signal peptidase complex (SPC) as soon as the lumenal domain of the translocating polypeptide is large enough to expose its cleavage site to the enzyme. The signal peptidase complex is possibly also involved in proteolytic events in the ER membrane other than the processing of the signal sequence, for example the further digestion of the cleaved signal peptide or the degradation of membrane proteins. Mammalian signal peptidase is as a complex of five different polypeptide chains [].; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0005787 signal peptidase complex, 0016021 integral to membrane
Probab=30.10 E-value=23 Score=25.56 Aligned_cols=15 Identities=27% Similarity=0.631 Sum_probs=11.4
Q ss_pred cccccCCCCceehhh
Q psy14123 41 RHEYFDEKGMFISIV 55 (102)
Q Consensus 41 ~qnYFDs~G~FISvv 55 (102)
=-+|||++|.|..=.
T Consensus 136 v~~~Fde~G~l~~~~ 150 (162)
T PF06703_consen 136 VGKFFDEDGYLVEDL 150 (162)
T ss_pred hhhEECCCCEEeHHH
Confidence 457999999987543
No 17
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=29.32 E-value=39 Score=27.28 Aligned_cols=46 Identities=24% Similarity=0.332 Sum_probs=33.3
Q ss_pred CceehhhhHHHHHHHHHHHHHHHHHHhcccccccccccc--cccchhcc
Q psy14123 49 GMFISIVFSIPLLLNCMILVLKRKYTCKEPYPNLKTPLC--LTPLLKNG 95 (102)
Q Consensus 49 G~FISvv~S~PLLl~~~iili~~l~~~~~lmv~~Kraql--~~~~l~~~ 95 (102)
+|-|-++.-+-|+++|.++.++.+.-. .-+-.+||+++ ||+--+||
T Consensus 129 amLIClIIIAVLfLICT~LfLSTVVLA-NKVS~LKrskQ~gKRqpRSNG 176 (227)
T PF05399_consen 129 AMLICLIIIAVLFLICTLLFLSTVVLA-NKVSSLKRSKQVGKRQPRSNG 176 (227)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCccc
Confidence 455566666777888888877776644 44567899987 78888887
No 18
>PF04336 DUF479: Protein of unknown function, DUF479; InterPro: IPR007431 This entry contains the Escherichia coli gene yajB, now renamed acpH, which encodes an ACP hydrolase. AcpH converts holo-ACP to apo-ACP by hydrolytic cleavage of the phosphopantetheine prosthetic group from ACP []. A mutant E. coli strain having a total deletion of the acpH grows normally, showing that phosphodiesterase activity is not essential for growth, although it is required for turnover of the ACP prosthetic group in vivo. AcpH is found only in Gram-negative organisms suggesting that it plays a role in some aspect of lipid metabolism that is unique to these organisms. The most obvious of which is biosynthesis of lipid A. Because AcpH is a hydrolase, it could possibly be an editing enzyme that intercepts acyl-ACPs that would give an inappropriate lipid A structure if used as acyl donors []. ; GO: 0008770 [acyl-carrier-protein] phosphodiesterase activity, 0006633 fatty acid biosynthetic process
Probab=28.14 E-value=33 Score=23.37 Aligned_cols=13 Identities=23% Similarity=0.761 Sum_probs=10.2
Q ss_pred HHHhhHhhhcccc
Q psy14123 31 FAHNNWRSFARHE 43 (102)
Q Consensus 31 ~aa~nW~~Fs~qn 43 (102)
+.|+||+.||++.
T Consensus 4 fLA~~W~~~s~~~ 16 (106)
T PF04336_consen 4 FLAKHWSQFSDQP 16 (106)
T ss_pred HHHHhHHHHCcCC
Confidence 5789999988763
No 19
>PF15156 CLN6: Ceroid-lipofuscinosis neuronal protein 6
Probab=27.86 E-value=80 Score=26.10 Aligned_cols=32 Identities=22% Similarity=0.472 Sum_probs=24.4
Q ss_pred ccccccCCCCceehhhhHHHHHHHHHHHHHHH
Q psy14123 40 ARHEYFDEKGMFISIVFSIPLLLNCMILVLKR 71 (102)
Q Consensus 40 s~qnYFDs~G~FISvv~S~PLLl~~~iili~~ 71 (102)
-+--.=||+|.|.---|++-|++.++...-.|
T Consensus 224 rkgl~~dsNGlFl~~SF~~tLllVa~WV~wLW 255 (284)
T PF15156_consen 224 RKGLFLDSNGLFLFYSFALTLLLVALWVAWLW 255 (284)
T ss_pred ccCcccCCCceEeHHHHHHHHHHHHHHHHHHh
Confidence 34456799999999999888888877655443
No 20
>PF04030 ALO: D-arabinono-1,4-lactone oxidase ; InterPro: IPR007173 This domain is specific to D-arabinono-1,4-lactone oxidase 1.1.3.37 from EC, which is involved in the final step of the D-erythroascorbic acid biosynthesis pathway [].; GO: 0003885 D-arabinono-1,4-lactone oxidase activity, 0055114 oxidation-reduction process, 0016020 membrane; PDB: 2VFU_A 2VFV_A 2VFT_A 2VFS_A 2VFR_A.
Probab=27.18 E-value=15 Score=27.80 Aligned_cols=20 Identities=25% Similarity=0.589 Sum_probs=11.3
Q ss_pred hhHhhh--ccccccCCCCceehh
Q psy14123 34 NNWRSF--ARHEYFDEKGMFISI 54 (102)
Q Consensus 34 ~nW~~F--s~qnYFDs~G~FISv 54 (102)
.+|..| .+++ +||+|+|.+-
T Consensus 233 p~~~~F~~~r~~-~DP~g~F~n~ 254 (259)
T PF04030_consen 233 PRLDDFLAVRKK-LDPQGVFLND 254 (259)
T ss_dssp TTHHHHHHHHHH-H-TT-TT--H
T ss_pred cCHHHHHHHHHH-hCCCCCCCCH
Confidence 567787 2555 5999999874
No 21
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=25.47 E-value=1.1e+02 Score=24.15 Aligned_cols=29 Identities=24% Similarity=0.227 Sum_probs=22.3
Q ss_pred cccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123 41 RHEYFDEKGMFISIVFSIPLLLNCMILVL 69 (102)
Q Consensus 41 ~qnYFDs~G~FISvv~S~PLLl~~~iili 69 (102)
.+.|=|+.++||++|-++-.+++++++..
T Consensus 92 ~~~~~~r~~~lI~lv~~g~~lLla~~~~~ 120 (202)
T PF06365_consen 92 HQSSSDRYPTLIALVTSGSFLLLAILLGA 120 (202)
T ss_pred CCCcCccceEEEehHHhhHHHHHHHHHHH
Confidence 45566667899999999977777666665
No 22
>PF12400 DUF3661: Vaculolar membrane protein ; InterPro: IPR022127 This domain family is found in eukaryotes, and is typically between 123 and 138 amino acids in length.
Probab=25.41 E-value=2e+02 Score=20.76 Aligned_cols=64 Identities=13% Similarity=0.225 Sum_probs=35.6
Q ss_pred hhHHHhhhhhhhhHHHHhhHhhhccccc---cCCC---------CceehhhhHHHHHHHHHHHHHH-HHHHhccccccc
Q psy14123 17 LFILLVYFSENLNEFAHNNWRSFARHEY---FDEK---------GMFISIVFSIPLLLNCMILVLK-RKYTCKEPYPNL 82 (102)
Q Consensus 17 ~ll~lVy~aE~iNe~aa~nW~~Fs~qnY---FDs~---------G~FISvv~S~PLLl~~~iili~-~l~~~~~lmv~~ 82 (102)
++...+-..|++-++ .||+++.+-|| =|+- +.|++++...-+.+..+..... ++....+.+.++
T Consensus 18 il~~~l~~~~~~~~~--~~~~~i~sG~Y~~~g~pp~~~~w~~Ql~iy~~~l~~~K~~v~~i~~~~~~~l~~~~~~lL~~ 94 (129)
T PF12400_consen 18 ILYLFLRLVTKLASR--FGWENIESGNYGPPGDPPSFRAWLKQLIIYILILVIMKIIVFLILLIFPPFLNWFGDWLLSW 94 (129)
T ss_pred HHHHHHHHHHHHHHh--CCcccccccccCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhc
Confidence 333444444544444 67999999999 3332 3556666655555555555443 555555544443
No 23
>PF09779 Ima1_N: Ima1 N-terminal domain; InterPro: IPR018617 Members of this family of uncharacterised novel proteins have no known function.
Probab=24.75 E-value=32 Score=24.62 Aligned_cols=14 Identities=36% Similarity=0.413 Sum_probs=11.0
Q ss_pred ccccccCCCCceehh
Q psy14123 40 ARHEYFDEKGMFISI 54 (102)
Q Consensus 40 s~qnYFDs~G~FISv 54 (102)
.-.||||++|= |.-
T Consensus 27 ~q~N~f~e~Gd-~~d 40 (131)
T PF09779_consen 27 EQYNGFDEDGD-ITD 40 (131)
T ss_pred CCccCccccCC-CCC
Confidence 47899999997 543
No 24
>PLN02844 oxidoreductase/ferric-chelate reductase
Probab=24.52 E-value=89 Score=28.67 Aligned_cols=47 Identities=21% Similarity=0.441 Sum_probs=32.6
Q ss_pred HhhhhhhhhHHHHhhH---hhhccccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123 21 LVYFSENLNEFAHNNW---RSFARHEYFDEKGMFISIVFSIPLLLNCMILVL 69 (102)
Q Consensus 21 lVy~aE~iNe~aa~nW---~~Fs~qnYFDs~G~FISvv~S~PLLl~~~iili 69 (102)
++|.--+-|.| .+.| +..++..||-.+|... ++||.|+++++++=.+
T Consensus 23 ~~~~~~pt~~~-~~~~~~~~~~~~~t~fg~~g~~~-~~~~~p~~~~a~~~~~ 72 (722)
T PLN02844 23 ALWILKPTNLW-TRKWKQAEDSARHTVFGYYGLNF-AVYTFPPIALAIIGLV 72 (722)
T ss_pred HHheeeCCHHH-HhhhhhHHhcccCceecccchhh-HhHhhHHHHHHHHHHH
Confidence 34444445665 3334 4456788999999865 7899999999876544
No 25
>PF06413 Neugrin: Neugrin; InterPro: IPR010487 This family contains mouse and human neugrin proteins. Neugrin and m-neugrin are mainly expressed in neurons in the nervous system, and are thought to play an important role in the process of neuronal differentiation []. Homologues of neugrin are found in fungi suggesting they have a alternative role or more than one function.
Probab=23.83 E-value=40 Score=26.58 Aligned_cols=13 Identities=38% Similarity=0.884 Sum_probs=11.0
Q ss_pred ccccccCCCCcee
Q psy14123 40 ARHEYFDEKGMFI 52 (102)
Q Consensus 40 s~qnYFDs~G~FI 52 (102)
-+..|||++|=|.
T Consensus 210 ~G~eFfDsdGnFL 222 (225)
T PF06413_consen 210 KGREFFDSDGNFL 222 (225)
T ss_pred cCcccCCCCCCee
Confidence 3789999999884
No 26
>PF01219 DAGK_prokar: Prokaryotic diacylglycerol kinase; InterPro: IPR000829 Diacylglycerol kinase (2.7.1.107 from EC) (DAGK) is an enzyme that catalyses the formation of phosphatidic acid from diacylglycerol and ATP, an important step in phospholipid biosynthesis. In bacteria DAGK is very small (13 to 15 kD) membrane protein which seems to contain three transmembrane domains []. The best conserved region, is a stretch of 12 residues which are located in a cytoplasmic loop between the second and third transmembrane domains.; GO: 0004143 diacylglycerol kinase activity, 0008654 phospholipid biosynthetic process, 0016020 membrane; PDB: 2KDC_B.
Probab=23.18 E-value=81 Score=21.90 Aligned_cols=36 Identities=28% Similarity=0.329 Sum_probs=21.6
Q ss_pred hhhHhhhccchh------hhHHHhhhhhhhhHHHHhhHhhhc
Q psy14123 5 LVFSIVLKPRPY------LFILLVYFSENLNEFAHNNWRSFA 40 (102)
Q Consensus 5 ~~~~i~l~~~~~------~ll~lVy~aE~iNe~aa~nW~~Fs 40 (102)
+.+.+.+...+. +.+++|..+|-+|+--.+=...-+
T Consensus 29 ~~~~~~l~~s~~ew~~li~~~~~Vl~~EllNTAIE~~vD~v~ 70 (104)
T PF01219_consen 29 LIAAFFLGLSPWEWALLILAIFLVLIAELLNTAIERLVDLVS 70 (104)
T ss_dssp HHHHHH-----SHHHHHHHHHHHHHHHHTHHHHHHHHHTT--
T ss_pred HHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHcc
Confidence 344455555554 668899999999998776666544
No 27
>PF13209 DUF4017: Protein of unknown function (DUF4017)
Probab=21.20 E-value=84 Score=20.59 Aligned_cols=31 Identities=26% Similarity=0.651 Sum_probs=22.2
Q ss_pred hhhhhhHHHHhhHhhhccccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123 24 FSENLNEFAHNNWRSFARHEYFDEKGMFISIVFSIPLLLNCMILVL 69 (102)
Q Consensus 24 ~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISvv~S~PLLl~~~iili 69 (102)
.||--|.. -|+.|-+| +++.|+++++.+|..
T Consensus 23 aSegYN~v---gWKlfvGQ------------~YAiPif~i~aiitF 53 (60)
T PF13209_consen 23 ASEGYNTV---GWKLFVGQ------------AYAIPIFIITAIITF 53 (60)
T ss_pred cccCcccc---chhheecc------------hhHhHHHHHHHHHhh
Confidence 46666654 48877666 568999999887753
No 28
>PF03620 IBV_3C: IBV 3C protein; InterPro: IPR005296 These proteins are the product of ORF 3C from Infectious bronchitis virus. Currently, the function of this protein remains unknown.
Probab=20.81 E-value=83 Score=22.23 Aligned_cols=28 Identities=21% Similarity=0.405 Sum_probs=19.1
Q ss_pred ccCCCCceehhhhHHHHHHHHHHHHHHHH
Q psy14123 44 YFDEKGMFISIVFSIPLLLNCMILVLKRK 72 (102)
Q Consensus 44 YFDs~G~FISvv~S~PLLl~~~iili~~l 72 (102)
.||++|.|.+.++ .-+=++++.++..-+
T Consensus 8 sleeNG~Flt~lY-v~~gfialYllgk~L 35 (93)
T PF03620_consen 8 SLEENGSFLTALY-VLLGFIALYLLGKAL 35 (93)
T ss_pred HHHhcCcHHHHHH-HHHHHHHHHHHHHHH
Confidence 4789999999998 334455566655444
No 29
>PF03140 DUF247: Plant protein of unknown function; InterPro: IPR004158 The function of the plant proteins constituting this family is unknown.
Probab=20.28 E-value=55 Score=26.73 Aligned_cols=40 Identities=38% Similarity=0.693 Sum_probs=31.4
Q ss_pred hhhhhhHHHHhhHhhh---ccccccCCCCceehhhhHHHHHHH
Q psy14123 24 FSENLNEFAHNNWRSF---ARHEYFDEKGMFISIVFSIPLLLN 63 (102)
Q Consensus 24 ~aE~iNe~aa~nW~~F---s~qnYFDs~G~FISvv~S~PLLl~ 63 (102)
.-|.+|++-...|+.. -.++||.+--.++|++.++=+++.
T Consensus 348 v~~~l~~~~~~~~~~~~a~l~~~yf~~pw~~is~~aa~~ll~l 390 (391)
T PF03140_consen 348 VCEDLNKYYQSRWNRWWAWLRRKYFSNPWTFISLVAAIILLLL 390 (391)
T ss_pred HHHHHHHHhcChHHHHHHHHhhhhcCCcHHHHHHHHHHHHHHh
Confidence 4578999999876664 489999999999999887765543
No 30
>COG3558 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.15 E-value=25 Score=26.59 Aligned_cols=18 Identities=33% Similarity=0.938 Sum_probs=12.4
Q ss_pred hhhHHHHhhHhhhccccccCCCCce
Q psy14123 27 NLNEFAHNNWRSFARHEYFDEKGMF 51 (102)
Q Consensus 27 ~iNe~aa~nW~~Fs~qnYFDs~G~F 51 (102)
..-+|+.+||+ ||++|+.
T Consensus 99 wfraygnenwe-------fde~glm 116 (154)
T COG3558 99 WFRAYGNENWE-------FDEAGLM 116 (154)
T ss_pred HHHHhCCcccc-------cchhhHH
Confidence 34566667774 8999875
Done!