Query         psy14123
Match_columns 102
No_of_seqs    91 out of 93
Neff          3.7 
Searched_HMMs 46136
Date          Fri Aug 16 20:09:08 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14123.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14123hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF14770 TMEM18:  Transmembrane 100.0 5.7E-40 1.2E-44  235.9   5.8   88    4-91     25-121 (123)
  2 PF12357 PLD_C:  Phospholipase   92.4    0.11 2.4E-06   35.1   2.3   27   23-49     12-38  (74)
  3 PF05827 ATP-synt_S1:  Vacuolar  52.9     9.7 0.00021   29.6   1.8   29   38-67    248-276 (282)
  4 PF08031 BBE:  Berberine and be  45.1     9.5 0.00021   22.7   0.6   26   26-52     14-41  (47)
  5 PLN02270 phospholipase D alpha  38.5      20 0.00043   33.5   1.8   22   23-44    736-757 (808)
  6 PLN02352 phospholipase D epsil  38.0      23  0.0005   32.8   2.1   26   24-49    686-711 (758)
  7 PRK14399 membrane protein; Pro  37.7   1E+02  0.0023   25.1   5.6   53   36-90    192-249 (258)
  8 TIGR03834 EAGR_box EAGR box. T  36.2      14  0.0003   21.0   0.3   11   42-52     18-28  (28)
  9 KOG2574|consensus               34.6      20 0.00043   31.7   1.1   31   24-54    457-487 (492)
 10 PF13783 DUF4177:  Domain of un  34.4      18 0.00038   22.3   0.6   16   24-39     22-37  (61)
 11 PF02428 Prot_inhib_II:  Potato  34.2      15 0.00033   23.4   0.2   13   40-52     37-49  (52)
 12 PF14997 CECR6_TMEM121:  CECR6/  34.2      27 0.00057   26.9   1.6   48    5-64      8-55  (195)
 13 PLN02292 ferric-chelate reduct  32.4      41 0.00088   30.7   2.7   43   26-69     36-79  (702)
 14 PF01473 CW_binding_1:  Putativ  32.1      19  0.0004   17.6   0.3    8   43-50     11-18  (19)
 15 PLN03008 Phospholipase D delta  30.8      37 0.00081   32.0   2.3   21   24-44    798-818 (868)
 16 PF06703 SPC25:  Microsomal sig  30.1      23 0.00049   25.6   0.6   15   41-55    136-150 (162)
 17 PF05399 EVI2A:  Ectropic viral  29.3      39 0.00085   27.3   1.9   46   49-95    129-176 (227)
 18 PF04336 DUF479:  Protein of un  28.1      33 0.00071   23.4   1.1   13   31-43      4-16  (106)
 19 PF15156 CLN6:  Ceroid-lipofusc  27.9      80  0.0017   26.1   3.4   32   40-71    224-255 (284)
 20 PF04030 ALO:  D-arabinono-1,4-  27.2      15 0.00033   27.8  -0.7   20   34-54    233-254 (259)
 21 PF06365 CD34_antigen:  CD34/Po  25.5 1.1E+02  0.0023   24.1   3.6   29   41-69     92-120 (202)
 22 PF12400 DUF3661:  Vaculolar me  25.4   2E+02  0.0043   20.8   4.8   64   17-82     18-94  (129)
 23 PF09779 Ima1_N:  Ima1 N-termin  24.8      32 0.00069   24.6   0.6   14   40-54     27-40  (131)
 24 PLN02844 oxidoreductase/ferric  24.5      89  0.0019   28.7   3.5   47   21-69     23-72  (722)
 25 PF06413 Neugrin:  Neugrin;  In  23.8      40 0.00087   26.6   1.0   13   40-52    210-222 (225)
 26 PF01219 DAGK_prokar:  Prokaryo  23.2      81  0.0017   21.9   2.4   36    5-40     29-70  (104)
 27 PF13209 DUF4017:  Protein of u  21.2      84  0.0018   20.6   2.0   31   24-69     23-53  (60)
 28 PF03620 IBV_3C:  IBV 3C protei  20.8      83  0.0018   22.2   2.0   28   44-72      8-35  (93)
 29 PF03140 DUF247:  Plant protein  20.3      55  0.0012   26.7   1.2   40   24-63    348-390 (391)
 30 COG3558 Uncharacterized protei  20.1      25 0.00055   26.6  -0.7   18   27-51     99-116 (154)

No 1  
>PF14770 TMEM18:  Transmembrane protein 18
Probab=100.00  E-value=5.7e-40  Score=235.89  Aligned_cols=88  Identities=39%  Similarity=0.745  Sum_probs=82.1

Q ss_pred             hhhhHhhhccchh---------hhHHHhhhhhhhhHHHHhhHhhhccccccCCCCceehhhhHHHHHHHHHHHHHHHHHH
Q psy14123          4 LLVFSIVLKPRPY---------LFILLVYFSENLNEFAHNNWRSFARHEYFDEKGMFISIVFSIPLLLNCMILVLKRKYT   74 (102)
Q Consensus         4 l~~~~i~l~~~~~---------~ll~lVy~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISvv~S~PLLl~~~iili~~l~~   74 (102)
                      +++++.++.+|+.         +++++||+||+|||+||+|||+||+||||||+|+|||+|||+|+|++|++++++|+++
T Consensus        25 ~~~~~~~~~tr~~~~~q~~lf~~ll~~v~~aE~iN~~~a~nW~~Fs~qnYFDs~G~Fisvv~s~PlLl~~~ii~~~~l~~  104 (123)
T PF14770_consen   25 VLLLLLAILTRRRYNFQMILFLILLLLVYCAEYINEYAARNWRSFSKQNYFDSSGVFISVVFSAPLLLNCLIILVNWLYQ  104 (123)
T ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcCCCCeeehHHHHHhHHHHHHHHHHHHHHH
Confidence            5566677777766         7788999999999999999999999999999999999999999999999999999999


Q ss_pred             hcccccccccccccccc
Q psy14123         75 CKEPYPNLKTPLCLTPL   91 (102)
Q Consensus        75 ~~~lmv~~Kraql~~~~   91 (102)
                      ++++||++||+|+|+|+
T Consensus       105 ~~~lmv~~Kr~qlr~~~  121 (123)
T PF14770_consen  105 LCSLMVQVKRAQLRRKA  121 (123)
T ss_pred             HHHHHHHHHHHHHHHHh
Confidence            99999999999999876


No 2  
>PF12357 PLD_C:  Phospholipase D C terminal ;  InterPro: IPR024632 Phospholipase D (PLD) catalyses the hydrolysis of the phosphodiester bond of glycerophospholipids to generate phosphatidic acid and a free head group. Phospholipase D activities have been detected in simple to complex organisms from viruses and bacteria to yeast, plants, and mammals []. In higher organisms, PLD specifically catalyzes the hydrolysis of phosphatidylcholine (PC) to phosphatidic acid (PA) and choline and is activated in response to stimulators of vesicle transport, endocytosis, exocytosis, cell migration, and mitosis. This entry represents the C-terminal domain of eukaryotic phospholipase D. The domain is approximately 70 amino acids in length and contains a conserved FPD sequence motif.
Probab=92.43  E-value=0.11  Score=35.12  Aligned_cols=27  Identities=33%  Similarity=0.580  Sum_probs=22.4

Q ss_pred             hhhhhhhHHHHhhHhhhccccccCCCC
Q psy14123         23 YFSENLNEFAHNNWRSFARHEYFDEKG   49 (102)
Q Consensus        23 y~aE~iNe~aa~nW~~Fs~qnYFDs~G   49 (102)
                      =|..++|+.|.+||+.|++...=|=.|
T Consensus        12 eCVr~Vn~iae~nW~~y~~ee~~dl~G   38 (74)
T PF12357_consen   12 ECVRRVNEIAEENWKQYASEEVTDLPG   38 (74)
T ss_pred             HHHHHHHHHHHHHHHHhhccccccCCC
Confidence            367899999999999999888755555


No 3  
>PF05827 ATP-synt_S1:  Vacuolar ATP synthase subunit S1 (ATP6S1);  InterPro: IPR024722 This family consists of metazoan vacuolar ATP synthase subunit S1 proteins [] and fungal proteins belonging to the BIG family. In Candida albicans BIG is required for normal beta-1,6-glucan synthesis, hyphal morphogenesis, adhesion and virulence [].
Probab=52.93  E-value=9.7  Score=29.55  Aligned_cols=29  Identities=21%  Similarity=0.451  Sum_probs=22.5

Q ss_pred             hhccccccCCCCceehhhhHHHHHHHHHHH
Q psy14123         38 SFARHEYFDEKGMFISIVFSIPLLLNCMIL   67 (102)
Q Consensus        38 ~Fs~qnYFDs~G~FISvv~S~PLLl~~~ii   67 (102)
                      .|.+.|+|.+ |.|++++-++.++.+.++-
T Consensus       248 lf~~yQFftp-gi~mglii~~~ll~IL~~g  276 (282)
T PF05827_consen  248 LFFDYQFFTP-GIWMGLIISLVLLSILYVG  276 (282)
T ss_pred             eehhheeeec-cHHHHHHHHHHHHHHHHHH
Confidence            4889998876 7999999888877665543


No 4  
>PF08031 BBE:  Berberine and berberine like ;  InterPro: IPR012951 This domain is found in the berberine bridge and berberine bridge-like enzymes, which are involved in the biosynthesis of numerous isoquinoline alkaloids. They catalyse the transformation of the N-methyl group of (S)-reticuline into the C-8 berberine bridge carbon of (S)-scoulerine [].; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 2IPI_A 2Y3S_A 2Y3R_D 2Y08_B 2Y4G_A 3D2H_A 3FW9_A 3FW8_A 3FWA_A 3D2J_A ....
Probab=45.15  E-value=9.5  Score=22.72  Aligned_cols=26  Identities=15%  Similarity=0.427  Sum_probs=16.7

Q ss_pred             hhhhHHHHhhHhhhc--cccccCCCCcee
Q psy14123         26 ENLNEFAHNNWRSFA--RHEYFDEKGMFI   52 (102)
Q Consensus        26 E~iNe~aa~nW~~Fs--~qnYFDs~G~FI   52 (102)
                      +...++-++||+...  ++.| ||+|+|-
T Consensus        14 ~~~~~yyg~n~~rL~~iK~~y-DP~n~F~   41 (47)
T PF08031_consen   14 DWQEAYYGENYDRLRAIKRKY-DPDNVFR   41 (47)
T ss_dssp             HHHHHHHGGGHHHHHHHHHHH--TT-TS-
T ss_pred             HHHHHHhchhHHHHHHHHHHh-CccceeC
Confidence            446667778888763  6666 9999995


No 5  
>PLN02270 phospholipase D alpha
Probab=38.52  E-value=20  Score=33.46  Aligned_cols=22  Identities=14%  Similarity=0.392  Sum_probs=19.4

Q ss_pred             hhhhhhhHHHHhhHhhhccccc
Q psy14123         23 YFSENLNEFAHNNWRSFARHEY   44 (102)
Q Consensus        23 y~aE~iNe~aa~nW~~Fs~qnY   44 (102)
                      =|-.++|+.|.+||+.|++.+.
T Consensus       736 ~cv~~v~~~a~~~w~~y~~~~~  757 (808)
T PLN02270        736 ECIQKVNQIADKYWDLYSSETL  757 (808)
T ss_pred             HHHHHHHHHHHHHHHHhccccc
Confidence            3678999999999999998876


No 6  
>PLN02352 phospholipase D epsilon
Probab=38.01  E-value=23  Score=32.78  Aligned_cols=26  Identities=19%  Similarity=0.450  Sum_probs=21.6

Q ss_pred             hhhhhhHHHHhhHhhhccccccCCCC
Q psy14123         24 FSENLNEFAHNNWRSFARHEYFDEKG   49 (102)
Q Consensus        24 ~aE~iNe~aa~nW~~Fs~qnYFDs~G   49 (102)
                      |-.++|+.|.+||+.|++.+.=|=.|
T Consensus       686 c~~~v~~~~~~~w~~y~~~~~~~~~g  711 (758)
T PLN02352        686 CVRRLRTIGEQMWEIYSGEEVVDMEG  711 (758)
T ss_pred             HHHHHHHHHHHHHHhhccchhccCCC
Confidence            67889999999999999888755444


No 7  
>PRK14399 membrane protein; Provisional
Probab=37.74  E-value=1e+02  Score=25.06  Aligned_cols=53  Identities=9%  Similarity=0.155  Sum_probs=27.2

Q ss_pred             HhhhccccccCCCCceehh-----hhHHHHHHHHHHHHHHHHHHhccccccccccccccc
Q psy14123         36 WRSFARHEYFDEKGMFISI-----VFSIPLLLNCMILVLKRKYTCKEPYPNLKTPLCLTP   90 (102)
Q Consensus        36 W~~Fs~qnYFDs~G~FISv-----v~S~PLLl~~~iili~~l~~~~~lmv~~Kraql~~~   90 (102)
                      |+.-...+|||+.+.|-+=     +|-+++++.---.++-|-++.-  .+++|+.|+++-
T Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~i~~H~~N--I~rl~~~~E~k~  249 (258)
T PRK14399        192 WKNYILFSLLNSFHYWFSNIWASGMLEGNIIILIGGLILVVRHSQN--IKRIKNKTEPDT  249 (258)
T ss_pred             hhcchhhhhccccccccchhhhchhHHHHHHHHHHHHHHHHHHHHh--HHHHhcCCCcch
Confidence            5555566778876643332     3333333322233333444433  677788777543


No 8  
>TIGR03834 EAGR_box EAGR box. The EAGR box (Enriched in Aromatic and Glycine Residues) is found in three different proteins of the Mycoplasma genitalium terminal organelle, which acts in both cytadherence and gliding motility. The presence of this domain in a genome predicts the Mycoplasma-type terminal organelle structure, gliding motility, and cytadherence. The EAGR box may occur from one to nine times in a protein.
Probab=36.21  E-value=14  Score=20.97  Aligned_cols=11  Identities=45%  Similarity=1.045  Sum_probs=7.7

Q ss_pred             ccccCCCCcee
Q psy14123         42 HEYFDEKGMFI   52 (102)
Q Consensus        42 qnYFDs~G~FI   52 (102)
                      .-|||++|-|+
T Consensus        18 ~GyFDe~~~w~   28 (28)
T TIGR03834        18 KGYFDEDGNWV   28 (28)
T ss_pred             eeEeCccCCCC
Confidence            45888888664


No 9  
>KOG2574|consensus
Probab=34.61  E-value=20  Score=31.72  Aligned_cols=31  Identities=26%  Similarity=0.280  Sum_probs=26.7

Q ss_pred             hhhhhhHHHHhhHhhhccccccCCCCceehh
Q psy14123         24 FSENLNEFAHNNWRSFARHEYFDEKGMFISI   54 (102)
Q Consensus        24 ~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISv   54 (102)
                      .-|-+|.-++++=..=+..|||-++|.|..+
T Consensus       457 glEivnp~~~~~~~~e~~~~yFS~~g~F~~i  487 (492)
T KOG2574|consen  457 GLEIVNPQAAEQQQPEENSKYFSSSGSFSKI  487 (492)
T ss_pred             chhhcCHHHHhhcCcccccceecCCCceeec
Confidence            3578899999988778899999999999865


No 10 
>PF13783 DUF4177:  Domain of unknown function (DUF4177)
Probab=34.44  E-value=18  Score=22.27  Aligned_cols=16  Identities=31%  Similarity=0.719  Sum_probs=13.8

Q ss_pred             hhhhhhHHHHhhHhhh
Q psy14123         24 FSENLNEFAHNNWRSF   39 (102)
Q Consensus        24 ~aE~iNe~aa~nW~~F   39 (102)
                      +-+.||++|++-|+.=
T Consensus        22 ~~~~Ln~~g~eGWeLV   37 (61)
T PF13783_consen   22 LEEILNEYGKEGWELV   37 (61)
T ss_pred             HHHHHHHHHhCCcEEE
Confidence            4688999999999973


No 11 
>PF02428 Prot_inhib_II:  Potato type II proteinase inhibitor family;  InterPro: IPR003465 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  Members of the Pin2 family are proteinase inhibitors that belong to MEROPS inhibitor family I20, clan IA and are restricted to plants. They inhibit serine peptidases belonging to MEROPS peptidase family S1 [] (IPR001254 from INTERPRO). They have a multidomain structure [], which permits circular permutation of the sequences. It was been shown that some naturally occurring Pin2 proteins, have an `ancestral' circularly permuted structure []. Circular permutation/ rearrangements of sequences has also been observed between species, such as favin from Vicia faba and the lectin concanavalin A from Canavalia ensiformis [] or amongst members of the plant aspartyl proteinases and human lung surfactant proteins [].  The Pin2 family of proteinase inhibitors are present in seeds, leaves and other organs. Perhaps the best known representatives are the wound-induced proteinase inhibitors [, ], which contain up to eight sequence-repeats (the `IP repeats'). The sequence of the IP repeats is quite variable, only the cysteines constituting the four disulphide bridges and a single proline residue are conserved throughout all the known repeat sequences. The structure of the proteinase inhibitor complex is known [].; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1PJU_C 1OYV_I 1QH2_B 1FYB_A 1CE3_A 2JZM_A 1YTP_A 2JYY_A 1TIH_A 4SGB_I ....
Probab=34.17  E-value=15  Score=23.41  Aligned_cols=13  Identities=31%  Similarity=0.746  Sum_probs=11.3

Q ss_pred             ccccccCCCCcee
Q psy14123         40 ARHEYFDEKGMFI   52 (102)
Q Consensus        40 s~qnYFDs~G~FI   52 (102)
                      .+=+||+.+|.||
T Consensus        37 KGC~yy~~dGtfi   49 (52)
T PF02428_consen   37 KGCNYYSADGTFI   49 (52)
T ss_dssp             TTSEEEETTTEEE
T ss_pred             cccEEECCCCcEE
Confidence            4678999999997


No 12 
>PF14997 CECR6_TMEM121:  CECR6/TMEM121 family
Probab=34.16  E-value=27  Score=26.89  Aligned_cols=48  Identities=23%  Similarity=0.431  Sum_probs=33.4

Q ss_pred             hhhHhhhccchhhhHHHhhhhhhhhHHHHhhHhhhccccccCCCCceehhhhHHHHHHHH
Q psy14123          5 LVFSIVLKPRPYLFILLVYFSENLNEFAHNNWRSFARHEYFDEKGMFISIVFSIPLLLNC   64 (102)
Q Consensus         5 ~~~~i~l~~~~~~ll~lVy~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISvv~S~PLLl~~   64 (102)
                      ++++++|.||...+...+. +|+           =....+||++++.+....|+|++...
T Consensus         8 ~lY~~~L~~Kv~~i~~~~~-~d~-----------l~~~~~~~~~~l~l~L~ls~p~lf~L   55 (195)
T PF14997_consen    8 FLYSIVLTPKVVFIFQNVI-ADY-----------LIELDIFGPTVLKLLLSLSVPLLFLL   55 (195)
T ss_pred             HHHHHHHHHHHhhheeehh-hhh-----------ccccccccchhHHHHHHHHHHHHHHH
Confidence            4567888888766554333 332           33456799999999999999976543


No 13 
>PLN02292 ferric-chelate reductase
Probab=32.36  E-value=41  Score=30.68  Aligned_cols=43  Identities=16%  Similarity=0.371  Sum_probs=30.7

Q ss_pred             hhhhHHHHhhHhhhc-cccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123         26 ENLNEFAHNNWRSFA-RHEYFDEKGMFISIVFSIPLLLNCMILVL   69 (102)
Q Consensus        26 E~iNe~aa~nW~~Fs-~qnYFDs~G~FISvv~S~PLLl~~~iili   69 (102)
                      +.-++.-..+|+.-. +..||-.+|... ++||.|+++++++-.+
T Consensus        36 ~~~~~~~~~~~~~~~~~~t~fg~~g~~~-~~~~~p~~~~a~~~~~   79 (702)
T PLN02292         36 STYKTIWLPSMRAKLGKSTYFGAPGVNL-LVYMFPMILLACLGCI   79 (702)
T ss_pred             HHHHhhccHHHHHhccCCceecccchhh-HHHhhHHHHHHHHHHH
Confidence            333444445566655 577999999865 7899999999876543


No 14 
>PF01473 CW_binding_1:  Putative cell wall binding repeat;  InterPro: IPR018337 The cell wall-binding repeat (CW) is an about 20 amino acid residue module, essentially found in two bacterial Gram-positive protein families; the choline binding proteins and glucosyltransferases (2.4.1.5 from EC). In choline-binding proteins cell wall binding repeats bind to choline moieties of both teichoic and lipoteichoic acids, two components peculiar to the cell surface of Gram-positive bacteria [, ]. In glucosyltransferases the region spanning the CW repeats is a glucan binding domain []. Several crystal structures of CW have been solved [, ]. In the choline binding protein LytA, the repeats adopt a solenoid fold consisting exclusively of beta-hairpins that stack to form a left-handed superhelix with a boomerang-like shape. The choline groups bind between beta-hairpin 'steps' of the superhelix []. In Cpl-1 CW repeats assemble in two sub-domains: an N-terminal superhelical moiety similar to the LytA one and a C-terminal beta-sheet involved in interactions with the lysozyme domain. Choline is bound between repeats 1 and 2, and, 2 and 3 of the superhelical sub-domain []. Some proteins known to contain cell-wall binding repeats include:  Pneumococcal N-acetylmuramoyl-L-alanine amidase (autolysin, lytA) (3.5.1.28 from EC). It is a surface-exposed enzyme that rules the self-destruction of pneumococcal cells through degradation of their peptidoglycan backbone. It mediates the release of toxic substances that damage the host tissues. Pneumococcal endo-beta-N-acetylglucosaminidase (lytB) (3.2.1.96 from EC). It plays an important role in cell wall degradation and cell separation. Pneumococcal teichoic acid phosphorylcholine esterase (pce or cbpE), a cell wall hydrolase important for cellular adhesion and colonisation. Lactobacillales glucosyltransferase. It catalyses the transfer of glucosyl units from the cleavage of sucrose to a growing chain of glucan.  Clostridium difficile toxin A (tcdA) and toxin B (tcdb). They are the causative agents of the antibiotic-associated pseudomembranous colitis. They are intracellular acting toxins that reach their targets after receptor-mediated endocytosis.  Clostridium acetobutylicum cspA protein. Siphoviridae bacteriophages N-acetylmuramoyl-L-alanine amidase. It lyses the bacterial host cell wall. Podoviridae lysozyme protein (cpl-1). It is capable of digesting the pneumococcal cell wall.  The cell wall binding repeats are also known as the choline-binding repeats (ChBr) or the choline-binding domain (ChBD). ; PDB: 1GVM_C 2BML_B 1HCX_A 1OBA_A 1H09_A 2J8F_A 2IXU_A 2J8G_A 2IXV_A 2X8O_A ....
Probab=32.13  E-value=19  Score=17.63  Aligned_cols=8  Identities=50%  Similarity=0.995  Sum_probs=6.4

Q ss_pred             cccCCCCc
Q psy14123         43 EYFDEKGM   50 (102)
Q Consensus        43 nYFDs~G~   50 (102)
                      =|||++|.
T Consensus        11 Yy~~~~G~   18 (19)
T PF01473_consen   11 YYFDSDGY   18 (19)
T ss_dssp             EEETTTSB
T ss_pred             EEeCCCcc
Confidence            48999985


No 15 
>PLN03008 Phospholipase D delta
Probab=30.81  E-value=37  Score=32.04  Aligned_cols=21  Identities=19%  Similarity=0.629  Sum_probs=19.0

Q ss_pred             hhhhhhHHHHhhHhhhccccc
Q psy14123         24 FSENLNEFAHNNWRSFARHEY   44 (102)
Q Consensus        24 ~aE~iNe~aa~nW~~Fs~qnY   44 (102)
                      |-.++|+.|.+||+.|++...
T Consensus       798 cv~~vn~~a~~~w~~y~~~~~  818 (868)
T PLN03008        798 CLKKVNTISEENWKRFIDPKF  818 (868)
T ss_pred             HHHHHHHHHHHHHHHhhcccc
Confidence            667899999999999998877


No 16 
>PF06703 SPC25:  Microsomal signal peptidase 25 kDa subunit (SPC25);  InterPro: IPR009582 This family consists of several microsomal signal peptidase 25 kDa subunit proteins. Translocation of polypeptide chains across the endoplasmic reticulum (ER) membrane is triggered by signal sequences. Subsequently, signal recognition particle interacts with its membrane receptor and the ribosome-bound nascent chain is targeted to the ER where it is transferred into a protein-conducting channel. At some point, a second signal sequence recognition event takes place in the membrane and translocation of the nascent chain through the membrane occurs. The signal sequence of most secretory and membrane proteins is cleaved off at this stage. Cleavage occurs by the signal peptidase complex (SPC) as soon as the lumenal domain of the translocating polypeptide is large enough to expose its cleavage site to the enzyme. The signal peptidase complex is possibly also involved in proteolytic events in the ER membrane other than the processing of the signal sequence, for example the further digestion of the cleaved signal peptide or the degradation of membrane proteins. Mammalian signal peptidase is as a complex of five different polypeptide chains [].; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0005787 signal peptidase complex, 0016021 integral to membrane
Probab=30.10  E-value=23  Score=25.56  Aligned_cols=15  Identities=27%  Similarity=0.631  Sum_probs=11.4

Q ss_pred             cccccCCCCceehhh
Q psy14123         41 RHEYFDEKGMFISIV   55 (102)
Q Consensus        41 ~qnYFDs~G~FISvv   55 (102)
                      =-+|||++|.|..=.
T Consensus       136 v~~~Fde~G~l~~~~  150 (162)
T PF06703_consen  136 VGKFFDEDGYLVEDL  150 (162)
T ss_pred             hhhEECCCCEEeHHH
Confidence            457999999987543


No 17 
>PF05399 EVI2A:  Ectropic viral integration site 2A protein (EVI2A);  InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=29.32  E-value=39  Score=27.28  Aligned_cols=46  Identities=24%  Similarity=0.332  Sum_probs=33.3

Q ss_pred             CceehhhhHHHHHHHHHHHHHHHHHHhcccccccccccc--cccchhcc
Q psy14123         49 GMFISIVFSIPLLLNCMILVLKRKYTCKEPYPNLKTPLC--LTPLLKNG   95 (102)
Q Consensus        49 G~FISvv~S~PLLl~~~iili~~l~~~~~lmv~~Kraql--~~~~l~~~   95 (102)
                      +|-|-++.-+-|+++|.++.++.+.-. .-+-.+||+++  ||+--+||
T Consensus       129 amLIClIIIAVLfLICT~LfLSTVVLA-NKVS~LKrskQ~gKRqpRSNG  176 (227)
T PF05399_consen  129 AMLICLIIIAVLFLICTLLFLSTVVLA-NKVSSLKRSKQVGKRQPRSNG  176 (227)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCccc
Confidence            455566666777888888877776644 44567899987  78888887


No 18 
>PF04336 DUF479:  Protein of unknown function, DUF479;  InterPro: IPR007431 This entry contains the Escherichia coli gene yajB, now renamed acpH, which encodes an ACP hydrolase. AcpH converts holo-ACP to apo-ACP by hydrolytic cleavage of the phosphopantetheine prosthetic group from ACP []. A mutant E. coli strain having a total deletion of the acpH grows normally, showing that phosphodiesterase activity is not essential for growth, although it is required for turnover of the ACP prosthetic group in vivo. AcpH is found only in Gram-negative organisms suggesting that it plays a role in some aspect of lipid metabolism that is unique to these organisms. The most obvious of which is biosynthesis of lipid A. Because AcpH is a hydrolase, it could possibly be an editing enzyme that intercepts acyl-ACPs that would give an inappropriate lipid A structure if used as acyl donors []. ; GO: 0008770 [acyl-carrier-protein] phosphodiesterase activity, 0006633 fatty acid biosynthetic process
Probab=28.14  E-value=33  Score=23.37  Aligned_cols=13  Identities=23%  Similarity=0.761  Sum_probs=10.2

Q ss_pred             HHHhhHhhhcccc
Q psy14123         31 FAHNNWRSFARHE   43 (102)
Q Consensus        31 ~aa~nW~~Fs~qn   43 (102)
                      +.|+||+.||++.
T Consensus         4 fLA~~W~~~s~~~   16 (106)
T PF04336_consen    4 FLAKHWSQFSDQP   16 (106)
T ss_pred             HHHHhHHHHCcCC
Confidence            5789999988763


No 19 
>PF15156 CLN6:  Ceroid-lipofuscinosis neuronal protein 6
Probab=27.86  E-value=80  Score=26.10  Aligned_cols=32  Identities=22%  Similarity=0.472  Sum_probs=24.4

Q ss_pred             ccccccCCCCceehhhhHHHHHHHHHHHHHHH
Q psy14123         40 ARHEYFDEKGMFISIVFSIPLLLNCMILVLKR   71 (102)
Q Consensus        40 s~qnYFDs~G~FISvv~S~PLLl~~~iili~~   71 (102)
                      -+--.=||+|.|.---|++-|++.++...-.|
T Consensus       224 rkgl~~dsNGlFl~~SF~~tLllVa~WV~wLW  255 (284)
T PF15156_consen  224 RKGLFLDSNGLFLFYSFALTLLLVALWVAWLW  255 (284)
T ss_pred             ccCcccCCCceEeHHHHHHHHHHHHHHHHHHh
Confidence            34456799999999999888888877655443


No 20 
>PF04030 ALO:  D-arabinono-1,4-lactone oxidase ;  InterPro: IPR007173 This domain is specific to D-arabinono-1,4-lactone oxidase 1.1.3.37 from EC, which is involved in the final step of the D-erythroascorbic acid biosynthesis pathway [].; GO: 0003885 D-arabinono-1,4-lactone oxidase activity, 0055114 oxidation-reduction process, 0016020 membrane; PDB: 2VFU_A 2VFV_A 2VFT_A 2VFS_A 2VFR_A.
Probab=27.18  E-value=15  Score=27.80  Aligned_cols=20  Identities=25%  Similarity=0.589  Sum_probs=11.3

Q ss_pred             hhHhhh--ccccccCCCCceehh
Q psy14123         34 NNWRSF--ARHEYFDEKGMFISI   54 (102)
Q Consensus        34 ~nW~~F--s~qnYFDs~G~FISv   54 (102)
                      .+|..|  .+++ +||+|+|.+-
T Consensus       233 p~~~~F~~~r~~-~DP~g~F~n~  254 (259)
T PF04030_consen  233 PRLDDFLAVRKK-LDPQGVFLND  254 (259)
T ss_dssp             TTHHHHHHHHHH-H-TT-TT--H
T ss_pred             cCHHHHHHHHHH-hCCCCCCCCH
Confidence            567787  2555 5999999874


No 21 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=25.47  E-value=1.1e+02  Score=24.15  Aligned_cols=29  Identities=24%  Similarity=0.227  Sum_probs=22.3

Q ss_pred             cccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123         41 RHEYFDEKGMFISIVFSIPLLLNCMILVL   69 (102)
Q Consensus        41 ~qnYFDs~G~FISvv~S~PLLl~~~iili   69 (102)
                      .+.|=|+.++||++|-++-.+++++++..
T Consensus        92 ~~~~~~r~~~lI~lv~~g~~lLla~~~~~  120 (202)
T PF06365_consen   92 HQSSSDRYPTLIALVTSGSFLLLAILLGA  120 (202)
T ss_pred             CCCcCccceEEEehHHhhHHHHHHHHHHH
Confidence            45566667899999999977777666665


No 22 
>PF12400 DUF3661:  Vaculolar membrane protein ;  InterPro: IPR022127  This domain family is found in eukaryotes, and is typically between 123 and 138 amino acids in length. 
Probab=25.41  E-value=2e+02  Score=20.76  Aligned_cols=64  Identities=13%  Similarity=0.225  Sum_probs=35.6

Q ss_pred             hhHHHhhhhhhhhHHHHhhHhhhccccc---cCCC---------CceehhhhHHHHHHHHHHHHHH-HHHHhccccccc
Q psy14123         17 LFILLVYFSENLNEFAHNNWRSFARHEY---FDEK---------GMFISIVFSIPLLLNCMILVLK-RKYTCKEPYPNL   82 (102)
Q Consensus        17 ~ll~lVy~aE~iNe~aa~nW~~Fs~qnY---FDs~---------G~FISvv~S~PLLl~~~iili~-~l~~~~~lmv~~   82 (102)
                      ++...+-..|++-++  .||+++.+-||   =|+-         +.|++++...-+.+..+..... ++....+.+.++
T Consensus        18 il~~~l~~~~~~~~~--~~~~~i~sG~Y~~~g~pp~~~~w~~Ql~iy~~~l~~~K~~v~~i~~~~~~~l~~~~~~lL~~   94 (129)
T PF12400_consen   18 ILYLFLRLVTKLASR--FGWENIESGNYGPPGDPPSFRAWLKQLIIYILILVIMKIIVFLILLIFPPFLNWFGDWLLSW   94 (129)
T ss_pred             HHHHHHHHHHHHHHh--CCcccccccccCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhc
Confidence            333444444544444  67999999999   3332         3556666655555555555443 555555544443


No 23 
>PF09779 Ima1_N:  Ima1 N-terminal domain;  InterPro: IPR018617  Members of this family of uncharacterised novel proteins have no known function. 
Probab=24.75  E-value=32  Score=24.62  Aligned_cols=14  Identities=36%  Similarity=0.413  Sum_probs=11.0

Q ss_pred             ccccccCCCCceehh
Q psy14123         40 ARHEYFDEKGMFISI   54 (102)
Q Consensus        40 s~qnYFDs~G~FISv   54 (102)
                      .-.||||++|= |.-
T Consensus        27 ~q~N~f~e~Gd-~~d   40 (131)
T PF09779_consen   27 EQYNGFDEDGD-ITD   40 (131)
T ss_pred             CCccCccccCC-CCC
Confidence            47899999997 543


No 24 
>PLN02844 oxidoreductase/ferric-chelate reductase
Probab=24.52  E-value=89  Score=28.67  Aligned_cols=47  Identities=21%  Similarity=0.441  Sum_probs=32.6

Q ss_pred             HhhhhhhhhHHHHhhH---hhhccccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123         21 LVYFSENLNEFAHNNW---RSFARHEYFDEKGMFISIVFSIPLLLNCMILVL   69 (102)
Q Consensus        21 lVy~aE~iNe~aa~nW---~~Fs~qnYFDs~G~FISvv~S~PLLl~~~iili   69 (102)
                      ++|.--+-|.| .+.|   +..++..||-.+|... ++||.|+++++++=.+
T Consensus        23 ~~~~~~pt~~~-~~~~~~~~~~~~~t~fg~~g~~~-~~~~~p~~~~a~~~~~   72 (722)
T PLN02844         23 ALWILKPTNLW-TRKWKQAEDSARHTVFGYYGLNF-AVYTFPPIALAIIGLV   72 (722)
T ss_pred             HHheeeCCHHH-HhhhhhHHhcccCceecccchhh-HhHhhHHHHHHHHHHH
Confidence            34444445665 3334   4456788999999865 7899999999876544


No 25 
>PF06413 Neugrin:  Neugrin;  InterPro: IPR010487 This family contains mouse and human neugrin proteins. Neugrin and m-neugrin are mainly expressed in neurons in the nervous system, and are thought to play an important role in the process of neuronal differentiation []. Homologues of neugrin are found in fungi suggesting they have a alternative role or more than one function.
Probab=23.83  E-value=40  Score=26.58  Aligned_cols=13  Identities=38%  Similarity=0.884  Sum_probs=11.0

Q ss_pred             ccccccCCCCcee
Q psy14123         40 ARHEYFDEKGMFI   52 (102)
Q Consensus        40 s~qnYFDs~G~FI   52 (102)
                      -+..|||++|=|.
T Consensus       210 ~G~eFfDsdGnFL  222 (225)
T PF06413_consen  210 KGREFFDSDGNFL  222 (225)
T ss_pred             cCcccCCCCCCee
Confidence            3789999999884


No 26 
>PF01219 DAGK_prokar:  Prokaryotic diacylglycerol kinase;  InterPro: IPR000829 Diacylglycerol kinase (2.7.1.107 from EC) (DAGK) is an enzyme that catalyses the formation of phosphatidic acid from diacylglycerol and ATP, an important step in phospholipid biosynthesis. In bacteria DAGK is very small (13 to 15 kD) membrane protein which seems to contain three transmembrane domains []. The best conserved region, is a stretch of 12 residues which are located in a cytoplasmic loop between the second and third transmembrane domains.; GO: 0004143 diacylglycerol kinase activity, 0008654 phospholipid biosynthetic process, 0016020 membrane; PDB: 2KDC_B.
Probab=23.18  E-value=81  Score=21.90  Aligned_cols=36  Identities=28%  Similarity=0.329  Sum_probs=21.6

Q ss_pred             hhhHhhhccchh------hhHHHhhhhhhhhHHHHhhHhhhc
Q psy14123          5 LVFSIVLKPRPY------LFILLVYFSENLNEFAHNNWRSFA   40 (102)
Q Consensus         5 ~~~~i~l~~~~~------~ll~lVy~aE~iNe~aa~nW~~Fs   40 (102)
                      +.+.+.+...+.      +.+++|..+|-+|+--.+=...-+
T Consensus        29 ~~~~~~l~~s~~ew~~li~~~~~Vl~~EllNTAIE~~vD~v~   70 (104)
T PF01219_consen   29 LIAAFFLGLSPWEWALLILAIFLVLIAELLNTAIERLVDLVS   70 (104)
T ss_dssp             HHHHHH-----SHHHHHHHHHHHHHHHHTHHHHHHHHHTT--
T ss_pred             HHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHcc
Confidence            344455555554      668899999999998776666544


No 27 
>PF13209 DUF4017:  Protein of unknown function (DUF4017)
Probab=21.20  E-value=84  Score=20.59  Aligned_cols=31  Identities=26%  Similarity=0.651  Sum_probs=22.2

Q ss_pred             hhhhhhHHHHhhHhhhccccccCCCCceehhhhHHHHHHHHHHHHH
Q psy14123         24 FSENLNEFAHNNWRSFARHEYFDEKGMFISIVFSIPLLLNCMILVL   69 (102)
Q Consensus        24 ~aE~iNe~aa~nW~~Fs~qnYFDs~G~FISvv~S~PLLl~~~iili   69 (102)
                      .||--|..   -|+.|-+|            +++.|+++++.+|..
T Consensus        23 aSegYN~v---gWKlfvGQ------------~YAiPif~i~aiitF   53 (60)
T PF13209_consen   23 ASEGYNTV---GWKLFVGQ------------AYAIPIFIITAIITF   53 (60)
T ss_pred             cccCcccc---chhheecc------------hhHhHHHHHHHHHhh
Confidence            46666654   48877666            568999999887753


No 28 
>PF03620 IBV_3C:  IBV 3C protein;  InterPro: IPR005296 These proteins are the product of ORF 3C from Infectious bronchitis virus. Currently, the function of this protein remains unknown.
Probab=20.81  E-value=83  Score=22.23  Aligned_cols=28  Identities=21%  Similarity=0.405  Sum_probs=19.1

Q ss_pred             ccCCCCceehhhhHHHHHHHHHHHHHHHH
Q psy14123         44 YFDEKGMFISIVFSIPLLLNCMILVLKRK   72 (102)
Q Consensus        44 YFDs~G~FISvv~S~PLLl~~~iili~~l   72 (102)
                      .||++|.|.+.++ .-+=++++.++..-+
T Consensus         8 sleeNG~Flt~lY-v~~gfialYllgk~L   35 (93)
T PF03620_consen    8 SLEENGSFLTALY-VLLGFIALYLLGKAL   35 (93)
T ss_pred             HHHhcCcHHHHHH-HHHHHHHHHHHHHHH
Confidence            4789999999998 334455566655444


No 29 
>PF03140 DUF247:  Plant protein of unknown function;  InterPro: IPR004158 The function of the plant proteins constituting this family is unknown.
Probab=20.28  E-value=55  Score=26.73  Aligned_cols=40  Identities=38%  Similarity=0.693  Sum_probs=31.4

Q ss_pred             hhhhhhHHHHhhHhhh---ccccccCCCCceehhhhHHHHHHH
Q psy14123         24 FSENLNEFAHNNWRSF---ARHEYFDEKGMFISIVFSIPLLLN   63 (102)
Q Consensus        24 ~aE~iNe~aa~nW~~F---s~qnYFDs~G~FISvv~S~PLLl~   63 (102)
                      .-|.+|++-...|+..   -.++||.+--.++|++.++=+++.
T Consensus       348 v~~~l~~~~~~~~~~~~a~l~~~yf~~pw~~is~~aa~~ll~l  390 (391)
T PF03140_consen  348 VCEDLNKYYQSRWNRWWAWLRRKYFSNPWTFISLVAAIILLLL  390 (391)
T ss_pred             HHHHHHHHhcChHHHHHHHHhhhhcCCcHHHHHHHHHHHHHHh
Confidence            4578999999876664   489999999999999887765543


No 30 
>COG3558 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.15  E-value=25  Score=26.59  Aligned_cols=18  Identities=33%  Similarity=0.938  Sum_probs=12.4

Q ss_pred             hhhHHHHhhHhhhccccccCCCCce
Q psy14123         27 NLNEFAHNNWRSFARHEYFDEKGMF   51 (102)
Q Consensus        27 ~iNe~aa~nW~~Fs~qnYFDs~G~F   51 (102)
                      ..-+|+.+||+       ||++|+.
T Consensus        99 wfraygnenwe-------fde~glm  116 (154)
T COG3558          99 WFRAYGNENWE-------FDEAGLM  116 (154)
T ss_pred             HHHHhCCcccc-------cchhhHH
Confidence            34566667774       8999875


Done!