Query         psy9638
Match_columns 85
No_of_seqs    100 out of 138
Neff          4.0 
Searched_HMMs 46136
Date          Sat Aug 17 00:49:03 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9638.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9638hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF07189 SF3b10:  Splicing fact 100.0 1.1E-56 2.4E-61  294.1   9.0   79    3-81      1-79  (79)
  2 KOG3485|consensus              100.0 9.1E-52   2E-56  273.7   7.2   84    1-84      1-85  (86)
  3 PF10022 DUF2264:  Uncharacteri  82.5     3.9 8.4E-05   32.9   5.6   74    3-81    173-255 (361)
  4 PF02909 TetR_C:  Tetracyclin r  53.6      36 0.00078   21.8   4.4   44   27-72      7-50  (139)
  5 COG4519 Uncharacterized protei  53.6      11 0.00024   25.7   1.9   16   28-43     76-91  (95)
  6 PF03181 BURP:  BURP domain;  I  43.2      31 0.00066   26.1   3.1   35   46-84     46-80  (216)
  7 PF13215 DUF4023:  Protein of u  41.8      30 0.00065   20.1   2.3   24    3-26      9-32  (38)
  8 PF04614 Pex19:  Pex19 protein   37.9      12 0.00025   28.5   0.2   16   65-81    201-216 (248)
  9 PF07627 PSCyt3:  Protein of un  37.2      46 0.00099   22.4   3.0   41   42-82      4-45  (101)
 10 PF12616 DUF3775:  Protein of u  36.8      93   0.002   20.0   4.3   52    3-56     11-69  (75)
 11 cd07594 BAR_Endophilin_B The B  31.7 1.1E+02  0.0025   23.2   4.7   32   28-72     29-60  (229)
 12 PF05750 Rubella_Capsid:  Rubel  30.5      23  0.0005   27.8   0.8   12   71-82    192-203 (300)
 13 PF15178 TOM_sub5:  Mitochondri  30.1 1.4E+02  0.0029   18.4   4.1   36   18-57      4-39  (51)
 14 PF01152 Bac_globin:  Bacterial  27.5 1.1E+02  0.0023   19.7   3.4   48   30-80     10-57  (120)
 15 KOG2468|consensus               27.0      26 0.00057   30.1   0.6   10   38-47    423-432 (510)
 16 PF07466 DUF1517:  Protein of u  25.4      67  0.0014   25.2   2.5   46    6-57    119-164 (289)
 17 PF13978 DUF4223:  Protein of u  25.1      51  0.0011   20.6   1.5   13   69-81     43-55  (56)
 18 PF08960 DUF1874:  Domain of un  25.1      22 0.00048   24.4  -0.2   42   25-66     24-67  (105)
 19 PF14198 TnpV:  Transposon-enco  25.0 1.9E+02  0.0042   19.5   4.5   51   28-79     30-80  (111)
 20 KOG3541|consensus               25.0      57  0.0012   27.9   2.2   36   39-74    262-299 (477)
 21 PF07757 AdoMet_MTase:  Predict  24.8      52  0.0011   23.0   1.6   17    2-18      2-18  (112)
 22 cd00252 SPARC_EC SPARC_EC; ext  23.4 1.1E+02  0.0025   20.6   3.1   19   29-47     15-35  (116)
 23 PF13774 Longin:  Regulated-SNA  23.1      19  0.0004   22.1  -0.8   27   47-73     31-57  (83)
 24 PRK11188 rrmJ 23S rRNA methylt  23.0      65  0.0014   23.2   1.9   25   22-46      3-28  (209)
 25 KOG3821|consensus               21.7      64  0.0014   28.1   1.9   45   36-83    306-350 (563)
 26 TIGR02791 VirB5 P-type DNA tra  21.4 1.4E+02   0.003   22.0   3.4   19   34-52     58-76  (220)
 27 PF00284 Cytochrom_B559a:  Lume  20.0      66  0.0014   18.9   1.2   13    2-14     26-38  (40)

No 1  
>PF07189 SF3b10:  Splicing factor 3B subunit 10 (SF3b10);  InterPro: IPR009846 This family consists of several eukaryotic splicing factor 3B subunit 5 (SF3b5) proteins. SF3b5 is a 10 kDa subunit of the splicing factor SF3b. SF3b associates with the splicing factor SF3a and a 12S RNA unit to form the U2 small nuclear ribonucleoproteins complex. SF3b5 and SF3b14b are also thought to facilitate the interaction of U2 with the branch site []. Also included in this entry is RDS3 complex subunit 10, another protein involved in mRNA splicing [].
Probab=100.00  E-value=1.1e-56  Score=294.07  Aligned_cols=79  Identities=72%  Similarity=1.230  Sum_probs=78.4

Q ss_pred             hhhhhHHHHHHHHhhccCCCCCCCchHHHHHHhhhHHHHhhhCChhHHHHHHHhcCchHHHHHHHHHHHhhCCCCCCCC
Q psy9638           3 DRYNIYSQLEHLQSKYVGTGHADTTKFEWLVNIHRDSYASYMGHYNMLNLFSINENEAKARVRFNFMERMLQPCGPPPE   81 (85)
Q Consensus         3 dr~~~~~Qle~Lq~KY~GtG~~dtTk~EW~tn~hRDT~aS~~gH~~lL~Y~aia~nes~~r~r~~llekM~~pcgppp~   81 (85)
                      ||+++++||||||+||+||||+||||+||++||||||||||+||++||+|||||+|+|++|+|++||+||+|||||||+
T Consensus         1 Dk~~~~~qle~Lq~KY~GtG~~dTTk~EW~tnihRDT~aS~~gH~~lL~Y~aia~ne~~~r~r~~ll~kM~~p~g~pp~   79 (79)
T PF07189_consen    1 DKYRIQQQLEHLQSKYVGTGHADTTKEEWLTNIHRDTYASIIGHPDLLEYFAIAENESKARVRFNLLEKMVQPCGPPPP   79 (79)
T ss_pred             ChhhHHHHHHHHHHHhCCCCCCCcCHHHHHHHHHHHHHHHHhcCHHHHHHHHHhcCCCHHHHHHHHHHHHhccCCCCCC
Confidence            8999999999999999999999999999999999999999999999999999999999999999999999999999996


No 2  
>KOG3485|consensus
Probab=100.00  E-value=9.1e-52  Score=273.74  Aligned_cols=84  Identities=81%  Similarity=1.336  Sum_probs=80.7

Q ss_pred             CchhhhhHHHHHHHHhhccCCCCCCCchHHHHHHhhhHHHHhhhCChhHHHHHHHhcCchHHHHHHHHHHHhhCCCCC-C
Q psy9638           1 MGDRYNIYSQLEHLQSKYVGTGHADTTKFEWLVNIHRDSYASYMGHYNMLNLFSINENEAKARVRFNFMERMLQPCGP-P   79 (85)
Q Consensus         1 M~dr~~~~~Qle~Lq~KY~GtG~~dtTk~EW~tn~hRDT~aS~~gH~~lL~Y~aia~nes~~r~r~~llekM~~pcgp-p   79 (85)
                      |+|||++++||||||+||+||||+||||+||++||||||++|+|||+++|.||+||||||++|+|+|+|+||++|||| |
T Consensus         1 ~~dRf~i~aqLEhLQskYvGtg~a~~tk~ew~vnq~RdS~~S~vgh~~~l~Y~a~ae~Ep~~rvr~N~lekml~pcg~wp   80 (86)
T KOG3485|consen    1 MGDRFNIHAQLEHLQSKYVGTGHADTTKFEWLVNQHRDSLASYVGHYPLLNYFAIAENEPKARVRFNLLEKMLQPCGPWP   80 (86)
T ss_pred             CcchhhHHHHHHHHHHHhhcccccccchHHHHHhcchhhhhhhcCCchHHHHHHHhccCchhhhhhcHHHHhhcccCCCC
Confidence            899999999999999999999999999999999999999999999999999999999999999999999999999999 5


Q ss_pred             CCCCC
Q psy9638          80 PEKPE   84 (85)
Q Consensus        80 p~~~~   84 (85)
                      |.+++
T Consensus        81 p~~e~   85 (86)
T KOG3485|consen   81 PEKEE   85 (86)
T ss_pred             Ccccc
Confidence            55544


No 3  
>PF10022 DUF2264:  Uncharacterized protein conserved in bacteria (DUF2264);  InterPro: IPR016624 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=82.55  E-value=3.9  Score=32.92  Aligned_cols=74  Identities=28%  Similarity=0.385  Sum_probs=56.8

Q ss_pred             hhhhhHHHHHHHHhhccCCCCCCCchHHHHHHhhhHHHHhhhCChhHHHHHHHhcCchHH---HHHH------HHHHHhh
Q psy9638           3 DRYNIYSQLEHLQSKYVGTGHADTTKFEWLVNIHRDSYASYMGHYNMLNLFSINENEAKA---RVRF------NFMERML   73 (85)
Q Consensus         3 dr~~~~~Qle~Lq~KY~GtG~~dtTk~EW~tn~hRDT~aS~~gH~~lL~Y~aia~nes~~---r~r~------~llekM~   73 (85)
                      |..++..-++.+.+=|+|=|=-.-..     +-|.|=|+|.+-|+-+|-|..++....+.   +.|.      +-.++|+
T Consensus       173 d~~~i~~~l~~~e~~Y~GdGWY~DG~-----~~~~DYYns~aih~y~l~~~~~~~~~~~~~~~~~~~Ra~~fa~~~~~~f  247 (361)
T PF10022_consen  173 DEERIDYDLERIEEWYLGDGWYSDGP-----EFQFDYYNSWAIHPYLLLYARLMGDEDPERAARYRQRAQRFAEDYERMF  247 (361)
T ss_pred             cHHHHHHHHHHHHHHhccCCccccCC-----ccCCcchHHHHHHHHHHHHHHHhcccCHHHHHHHHHHHHHHHHHHHHHc
Confidence            55677888999999999998653332     66999999999999999999999877532   3332      2357899


Q ss_pred             CCCCCCCC
Q psy9638          74 QPCGPPPE   81 (85)
Q Consensus        74 ~pcgppp~   81 (85)
                      .|.|-+|+
T Consensus       248 ~~dG~~~~  255 (361)
T PF10022_consen  248 SPDGAAPP  255 (361)
T ss_pred             CCCCCcCC
Confidence            99996654


No 4  
>PF02909 TetR_C:  Tetracyclin repressor, C-terminal all-alpha domain;  InterPro: IPR004111 The antibiotic tetracycline has a broad spectrum of activity, acting to inhibit bacterial protein synthesis by binding to the 30S ribosomal subunit, which prevents the association of the aminoacyl-tRNA to the ribosomal acceptor A site. Tetracycline binding is reversible, therefore diluting out the antibiotic can reverse its effects. Tetracycline resistance genes are often located on mobile elements, such as plasmids, transposons and/or conjugative transposons, which can sometimes be transferred between bacterial species. In certain cases, tetracycline can enhance the transfer of these elements, thereby promoting resistance amongst a bacterial colony. There are three types of tetracycline resistance: tetracycline efflux, ribosomal protection, and tetracycline modification [, ]:    Tetracycline efflux proteins belong to the major facilitator superfamily. Efflux proteins are membrane-associated proteins that recognise and export tetracycline from the cell. They are found in both Gram-positive and Gram-negative bacteria []. There are at least 22 different tetracycline efflux proteins, grouped according to sequence similarity: Group 1 are Tet(A), Tet(B), Tet(C), Tet(D), Tet(E), Tet(G), Tet(H), Tet(J), Tet(Z) and Tet(30); Group 2 are Tet(K) and Tet(L); Group 3 are Otr(B) and Tcr(3); Group 4 is TetA(P); Group 5 is Tet(V). In addition, there are the efflux proteins Tet(31), Tet(33), Tet(V), Tet(Y), Tet(34), and Tet(35).     Ribosomal protection proteins are cytoplasmic proteins that display homology with the elongation factors EF-Tu and EF-G. Protection proteins bind the ribosome, causing an alteration in ribosomal conformation that prevents tetracycline from binding. There are at least ten ribosomal protection proteins: Tet(M), Tet(O), Tet(S), Tet(W), Tet(32), Tet(36), Tet(Q), Tet(T), Otr(A), and TetB(P). Both Tet(M) and Tet(O) have ribosome-dependent GTPase activity, the hydrolysis of GTP providing the energy for the ribosomal conformational changes.      Tetracycline modification proteins include the enzymes Tet(37) and Tet(X), both of which inactivate tetracycline. In addition, there are the tetracycline resistance proteins Tet(U) and Otr(C).   The expression of several of these tet genes is controlled by a family of tetracycline transcriptional regulators known as TetR. TetR family regulators are involved in the transcriptional control of multidrug efflux pumps, pathways for the biosynthesis of antibiotics, response to osmotic stress and toxic chemicals, control of catabolic pathways, differentiation processes, and pathogenicity []. The TetR proteins identified in over 115 genera of bacteria and archaea share a common helix-turn-helix (HTH) structure in their DNA-binding domain. However, TetR proteins can work in different ways: they can bind a target operator directly to exert their effect (e.g. TetR binds Tet(A) gene to repress it in the absence of tetracycline), or they can be involved in complex regulatory cascades in which the TetR protein can either be modulated by another regulator or TetR can trigger the cellular response.  This entry represents the C-terminal domain found in the tetracycline transcriptional repressor TetR, which binds to the Tet(A) gene to repress its expression in the absence of tetracycline []. Tet(A) is a membrane-associated efflux protein that exports tetracycline from the cell before it can attach to ribosomes and inhibit polypeptide chain growth. TetR occurs as a homodimer and uses two helix-turn-helix (HTH) motifs to bind tandem DNA operators, thereby blocking the expression of the associated genes, TetA and TetR. The structure of the class D TetR repressor protein [] involves 10 alpha-helices, with connecting turns and loops. The three N-terminal helices constitute the DNA-binding HTH domain, which has an inverse orientation compared with HTH motifs in other DNA-binding proteins. The core of the protein, formed by helices 5-10, is responsible for dimerisation and contains, for each monomer, a binding pocket that accommodates tetracycline in the presence of a divalent cation.; GO: 0045892 negative regulation of transcription, DNA-dependent; PDB: 2Y30_B 3ZQL_C 2Y31_B 2Y2Z_A 2VPR_A 3B6A_A 3B6C_A 2OPT_A 2NS7_B 2NS8_C ....
Probab=53.62  E-value=36  Score=21.84  Aligned_cols=44  Identities=23%  Similarity=0.209  Sum_probs=32.4

Q ss_pred             chHHHHHHhhhHHHHhhhCChhHHHHHHHhcCchHHHHHHHHHHHh
Q psy9638          27 TKFEWLVNIHRDSYASYMGHYNMLNLFSINENEAKARVRFNFMERM   72 (85)
Q Consensus        27 Tk~EW~tn~hRDT~aS~~gH~~lL~Y~aia~nes~~r~r~~llekM   72 (85)
                      .=.||+...-+...+.+.-||.+..+++-....+...++  ++|.+
T Consensus         7 ~W~~~l~~~a~~~r~~~~~hP~~~~~~~~~~~~~p~~l~--~~e~~   50 (139)
T PF02909_consen    7 DWRERLRALARAYRAALLRHPWLAELLLARPPPGPNALR--LMEAM   50 (139)
T ss_dssp             EHHHHHHHHHHHHHHHHHTSTTHHHHHHTSSCTSHHHHH--HHHHH
T ss_pred             CHHHHHHHHHHHHHHHHHHCcCHHHHHHhcCCCChhHHH--HHHHH
Confidence            457899999999999999999999998655444444333  44443


No 5  
>COG4519 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.58  E-value=11  Score=25.67  Aligned_cols=16  Identities=38%  Similarity=0.655  Sum_probs=14.6

Q ss_pred             hHHHHHHhhhHHHHhh
Q psy9638          28 KFEWLVNIHRDSYASY   43 (85)
Q Consensus        28 k~EW~tn~hRDT~aS~   43 (85)
                      +.||++.+|||..|++
T Consensus        76 ~~~Wv~E~h~~IaAal   91 (95)
T COG4519          76 RREWVVEQHRRIAAAL   91 (95)
T ss_pred             hHHHHHHHHHHHHHHh
Confidence            6799999999999986


No 6  
>PF03181 BURP:  BURP domain;  InterPro: IPR004873 The BURP domain is a ~230-residue module, which has been named for the four members of the group initially identified, BNM2, USP, RD22, and PG1beta. It is found in the C-terminal part of a number of plant cell wall proteins, which are defined not only by the BURP domain, but also by the overall similarity in their modular construction. The BURP domain proteins consists of either three or four modules: (i) an N-terminal hydrophobic domain - a presumptive transit peptide, joined to (ii) a short conserved segment or other short segment, (iii) an optional segment consisting of repeated units which is unique to each member, and (iv) the C-terminal BURP domain. Although the BURP domain proteins share primary structural features, their expression patterns and the conditions under which they are expressed differ. The presence of the conserved BURP domain in diverse plant proteins suggests an important and fundamental functional role for this domain []. It is possible that the BURP domain represents a general motif for localization of proteins within the cell wall matrix. The other structural domains associated with the BURP domain may specify other target sites for intermolecular interactions []. Some proteins known to contain a BURP domain are listed below [, , ]:  Brassica protein BNM2, which is expressed during the induction of microspore embryogenesis. Field bean USPs, abundant non-storage seed proteins with unknown function. Soybean USP-like proteins ADR6 (or SALI5-4A), an auxin-repressible, aluminium-inducible protein and SALI3-2, a protein that is up-regulated by aluminium. Soybean seed coat BURP-domain protein 1 (SCB1). It might play a role in the differentiation of the seed coat parenchyma cells. Arabidopsis RD22 drought induced protein. Maize ZRP2, a protein of unknown function in cortex parenchyma. Tomato PG1beta, the beta-subunit of polygalacturonase isozyme 1 (PG1), which is expressed in ripening fruits. Cereal RAFTIN. It is essential specifically for the maturation phase of pollen development.  
Probab=43.15  E-value=31  Score=26.12  Aligned_cols=35  Identities=34%  Similarity=0.512  Sum_probs=26.2

Q ss_pred             ChhHHHHHHHhcCchHHHHHHHHHHHhhCCCCCCCCCCC
Q psy9638          46 HYNMLNLFSINENEAKARVRFNFMERMLQPCGPPPEKPE   84 (85)
Q Consensus        46 H~~lL~Y~aia~nes~~r~r~~llekM~~pcgppp~~~~   84 (85)
                      -+.+|.+|+|..|-..+..-.+-|+    -|..||.+-|
T Consensus        46 l~~iL~~Fsi~~~S~~A~~m~~Tl~----~Ce~~~~~GE   80 (216)
T PF03181_consen   46 LPEILQMFSIPPGSPMAKAMKNTLE----ECESPPIKGE   80 (216)
T ss_pred             HHHHHHHhcCCCCCHHHHHHHHHHH----HhhcCCCCCc
Confidence            4789999999999877755544444    4998887655


No 7  
>PF13215 DUF4023:  Protein of unknown function (DUF4023)
Probab=41.83  E-value=30  Score=20.11  Aligned_cols=24  Identities=17%  Similarity=0.281  Sum_probs=18.6

Q ss_pred             hhhhhHHHHHHHHhhccCCCCCCC
Q psy9638           3 DRYNIYSQLEHLQSKYVGTGHADT   26 (85)
Q Consensus         3 dr~~~~~Qle~Lq~KY~GtG~~dt   26 (85)
                      ||++..+.-...-.+..|.||++-
T Consensus         9 ~Kl~e~Q~K~e~Nk~~qG~G~P~~   32 (38)
T PF13215_consen    9 EKLNETQEKQEKNKKHQGKGNPSK   32 (38)
T ss_pred             HHHHHHHHHHHHHHhccCCCCchh
Confidence            677777777677777899999863


No 8  
>PF04614 Pex19:  Pex19 protein family;  InterPro: IPR006708  Peroxisome(s) form an intracellular compartment, bounded by a typical lipid bilayer membrane. Peroxisome functions are often specialised by organism and cell type; two widely distributed and well-conserved functions are H2O2-based respiration and fatty acid beta-oxidation. Other functions include ether lipid (plasmalogen) synthesis and cholesterol synthesis in animals, the glyoxylate cycle in germinating seeds ("glyoxysomes"), photorespiration in leaves, glycolysis in trypanosomes ("glycosomes"), and methanol and/or amine oxidation and assimilation in some yeasts.  PEX genes encode the machinery ("peroxins") required to assemble the peroxisome. Membrane assembly and maintenance requires three of these (peroxins 3, 16, and 19) and may occur without the import of the matrix (lumen) enzymes. Matrix protein import follows a branched pathway of soluble recycling receptors, with one branch for each class of peroxisome targeting sequence (two are well characterised), and a common trunk for all. At least one of these receptors, Pex5p, enters and exits peroxisomes as it functions. Proliferation of the organelle is regulated by Pex11p. Peroxisome biogenesis is remarkably conserved among eukaryotes. A group of fatal, inherited neuropathologies are recognised as peroxisome biogenesis diseases. ; GO: 0005777 peroxisome; PDB: 2WL8_B 2W85_B.
Probab=37.94  E-value=12  Score=28.54  Aligned_cols=16  Identities=44%  Similarity=0.806  Sum_probs=8.1

Q ss_pred             HHHHHHHhhCCCCCCCC
Q psy9638          65 RFNFMERMLQPCGPPPE   81 (85)
Q Consensus        65 r~~llekM~~pcgppp~   81 (85)
                      =..|+.+| |-||.||.
T Consensus       201 i~~lmqem-Q~~G~PP~  216 (248)
T PF04614_consen  201 IMELMQEM-QELGQPPE  216 (248)
T ss_dssp             HHHHHHHH-HHT----G
T ss_pred             HHHHHHHH-HHcCCCcH
Confidence            34577777 66999996


No 9  
>PF07627 PSCyt3:  Protein of unknown function (DUF1588);  InterPro: IPR013039  A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013042 from INTERPRO and IPR013043 from INTERPRO.
Probab=37.21  E-value=46  Score=22.42  Aligned_cols=41  Identities=20%  Similarity=0.199  Sum_probs=29.3

Q ss_pred             hhhCChhHHHHHHHhcC-chHHHHHHHHHHHhhCCCCCCCCC
Q psy9638          42 SYMGHYNMLNLFSINEN-EAKARVRFNFMERMLQPCGPPPEK   82 (85)
Q Consensus        42 S~~gH~~lL~Y~aia~n-es~~r~r~~llekM~~pcgppp~~   82 (85)
                      -+.+|.++|+-.|-... .++.|=++-+=.=|.+|.+|||+.
T Consensus         4 GlLt~~~~Lt~~s~~~~tsPv~RG~~v~~~lLc~~~ppPP~~   45 (101)
T PF07627_consen    4 GLLTQGAFLTRTSDGDRTSPVHRGVWVRERLLCQPPPPPPPN   45 (101)
T ss_pred             hhhhhHHHHhccCCCCCCCchHHHHHHHHHHcCCCCCCCCCC
Confidence            46788888888887664 466666666666677888888863


No 10 
>PF12616 DUF3775:  Protein of unknown function (DUF3775);  InterPro: IPR022254  This domain family is found in bacteria, and is approximately 80 amino acids in length. There is a single completely conserved residue G that may be functionally important. 
Probab=36.85  E-value=93  Score=19.98  Aligned_cols=52  Identities=17%  Similarity=0.217  Sum_probs=38.2

Q ss_pred             hhhhhHHHHHHHHhhccCCCCCCCchHHHHH--Hhh-----hHHHHhhhCChhHHHHHHHh
Q psy9638           3 DRYNIYSQLEHLQSKYVGTGHADTTKFEWLV--NIH-----RDSYASYMGHYNMLNLFSIN   56 (85)
Q Consensus         3 dr~~~~~Qle~Lq~KY~GtG~~dtTk~EW~t--n~h-----RDT~aS~~gH~~lL~Y~aia   56 (85)
                      +.++..+|.+.+---|+|=|  |-+-.||-.  ...     ..|..=++|+|.+-.|+.=+
T Consensus        11 ~~l~~deqaeLvALmwiGRG--d~~~eew~~a~~~A~~~~~~~ta~YLl~~p~ladyLe~G   69 (75)
T PF12616_consen   11 EDLNEDEQAELVALMWIGRG--DFEAEEWEEAVAEARERASARTADYLLGTPMLADYLEEG   69 (75)
T ss_pred             HhCCHHHHHHHHHHHHhcCC--CCCHHHHHHHHHHHHHhccchHHHHHHcCCcHHHHHHHH
Confidence            56788889999999999999  666666643  222     23455589999999997543


No 11 
>cd07594 BAR_Endophilin_B The Bin/Amphiphysin/Rvs (BAR) domain of Endophilin-B. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Endophilins play roles in synaptic vesicle formation, virus budding, mitochondrial morphology maintenance, receptor-mediated endocytosis inhibition, and endosomal sorting. Endophilins contain an N-terminal N-BAR domain (BAR domain with an additional N-terminal amphipathic helix), followed by a variable region containing proline clusters, and a C-terminal SH3 domain. They are classified into two types, A and B. Vertebrates contain two endophilin-B isoforms. Endophilin-B proteins are cytoplasmic proteins expressed mainly in the heart, placenta, and skeletal muscle.
Probab=31.74  E-value=1.1e+02  Score=23.17  Aligned_cols=32  Identities=22%  Similarity=0.255  Sum_probs=26.8

Q ss_pred             hHHHHHHhhhHHHHhhhCChhHHHHHHHhcCchHHHHHHHHHHHh
Q psy9638          28 KFEWLVNIHRDSYASYMGHYNMLNLFSINENEAKARVRFNFMERM   72 (85)
Q Consensus        28 k~EW~tn~hRDT~aS~~gH~~lL~Y~aia~nes~~r~r~~llekM   72 (85)
                      ..+|..+|++.|-+-+.+.|+.             |++-.+.+||
T Consensus        29 ~~~~~e~i~~~~~~~lqpNp~~-------------r~~~~~~~k~   60 (229)
T cd07594          29 TKVWTEKILKQTEAVLQPNPNV-------------RVEDFIYEKL   60 (229)
T ss_pred             HHHHHHHHHHHHHHHhCCChhh-------------hHHHHHHHHh
Confidence            3589999999999999988864             7777777887


No 12 
>PF05750 Rubella_Capsid:  Rubella capsid protein;  InterPro: IPR008819 Rubella virus is an enveloped positive-strand RNA virus of the family Togaviridae. Virions are composed of three structural proteins: a capsid and two membrane-spanning glycoproteins, E2 and E1. During virus assembly, the capsid interacts with genomic RNA to form nucleocapsids. It has been discovered that capsid phosphorylation serves to negatively regulate binding of viral genomic RNA. This may delay the initiation of nucleocapsid assembly until sufficient amounts of virus glycoproteins accumulate at the budding site and/or prevent non-specific binding to cellular RNA when levels of genomic RNA are low. It follows that at a late stage in replication, the capsid may undergo dephosphorylation before nucleocapsid assembly occurs []. This family is found together with IPR008820 from INTERPRO and IPR008821 from INTERPRO.; GO: 0016021 integral to membrane, 0019013 viral nucleocapsid
Probab=30.50  E-value=23  Score=27.76  Aligned_cols=12  Identities=50%  Similarity=1.173  Sum_probs=9.8

Q ss_pred             HhhCCCCCCCCC
Q psy9638          71 RMLQPCGPPPEK   82 (85)
Q Consensus        71 kM~~pcgppp~~   82 (85)
                      -|..||||-|+.
T Consensus       192 lmynpcgpeppa  203 (300)
T PF05750_consen  192 LMYNPCGPEPPA  203 (300)
T ss_pred             hhcCCCCCCChh
Confidence            478999998874


No 13 
>PF15178 TOM_sub5:  Mitochondrial import receptor subunit TOM5 homolog
Probab=30.11  E-value=1.4e+02  Score=18.35  Aligned_cols=36  Identities=19%  Similarity=0.180  Sum_probs=27.0

Q ss_pred             ccCCCCCCCchHHHHHHhhhHHHHhhhCChhHHHHHHHhc
Q psy9638          18 YVGTGHADTTKFEWLVNIHRDSYASYMGHYNMLNLFSINE   57 (85)
Q Consensus        18 Y~GtG~~dtTk~EW~tn~hRDT~aS~~gH~~lL~Y~aia~   57 (85)
                      -.|+| +-..-+|--....+|+++|+   .+.|-|+|+-.
T Consensus         4 ~egl~-pk~DPeE~k~kmR~dvissv---rnFliyVALlR   39 (51)
T PF15178_consen    4 IEGLG-PKMDPEEMKRKMREDVISSV---RNFLIYVALLR   39 (51)
T ss_pred             cccCC-CCCCHHHHHHHHHHHHHHHH---HHHHHHHHHHH
Confidence            34666 34456788888999999998   57888888753


No 14 
>PF01152 Bac_globin:  Bacterial-like globin;  InterPro: IPR001486 Globins are haem-containing proteins involved in binding and/or transporting oxygen. They belong to a very large and well studied family that is widely distributed in many organisms []. Globins have evolved from a common ancestor and can be divided into three groups: single-domain globins, and two types of chimeric globins, flavohaemoglobins and globin-coupled sensors. Bacteria have all three types of globins, while archaea lack flavohaemoglobins, and eukaryotes lack globin-coupled sensors []. Several functionally different haemoglobins can coexist in the same species. The major types of globins include:   Haemoglobin (Hb): trimer of two alpha and two beta chains, although embryonic and foetal forms can substitute the alpha or beta chain for ones with higher oxygen affinity, such as gamma, delta, epsilon or zeta chains. Hb transports oxygen from lungs to other tissues in vertebrates []. Hb proteins are also present in unicellular organisms where they act as enzymes or sensors []. Myoglobin (Mb): monomeric protein responsible for oxygen storage in vertebrate muscle [].  Neuroglobin: a myoglobin-like haemprotein expressed in vertebrate brain and retina, where it is involved in neuroprotection from damage due to hypoxia or ischemia []. Neuroglobin belongs to a branch of the globin family that diverged early in evolution.  Cytoglobin: an oxygen sensor expressed in multiple tissues. Related to neuroglobin []. Erythrocruorin: highly cooperative extracellular respiratory proteins found in annelids and arthropods that are assembled from as many as 180 subunit into hexagonal bilayers []. Leghaemoglobin (legHb or symbiotic Hb): occurs in the root nodules of leguminous plants, where it facilitates the diffusion of oxygen to symbiotic bacteriods in order to promote nitrogen fixation. Non-symbiotic haemoglobin (NsHb): occurs in non-leguminous plants, and can be over-expressed in stressed plants []. Flavohaemoglobins (FHb): chimeric, with an N-terminal globin domain and a C-terminal ferredoxin reductase-like NAD/FAD-binding domain. FHb provides protection against nitric oxide via its C-terminal domain, which transfers electrons to haem in the globin []. Globin-coupled sensors: chimeric, with an N-terminal myoglobin-like domain and a C-terminal domain that resembles the cytoplasmic signalling domain of bacterial chemoreceptors. They bind oxygen, and act to initiate an aerotactic response or regulate gene expression [, ].  Protoglobin: a single domain globin found in archaea that is related to the N-terminal domain of globin-coupled sensors []. Truncated 2/2 globin: lack the first helix, giving them a 2-over-2 instead of the canonical 3-over-3 alpha-helical sandwich fold. Can be divided into three main groups (I, II and II) based on structural features [].   This entry represents a group of haemoglobin-like proteins found in eubacteria, cyanobacteria, protozoa, algae and plants, but not in animals or yeast. These proteins have a truncated 2-over-2 rather than the canonical 3-over-3 alpha-helical sandwich fold []. This entry includes:   HbN (or GlbN): a truncated haemoglobin-like protein that binds oxygen cooperatively with a very high affinity and a slow dissociation rate, which may exclude it from oxygen transport. It appears to be involved in bacterial nitric oxide detoxification and in nitrosative stress []. Cyanoglobin (or GlbN): a truncated haemoprotein found in cyanobacteria that has high oxygen affinity, and which appears to serve as part of a terminal oxidase, rather than as a respiratory pigment []. HbO (or GlbO): a truncated haemoglobin-like protein with a lower oxygen affinity than HbN. HbO associates with the bacterial cell membrane, where it significantly increases oxygen uptake over membranes lacking this protein. HbO appears to interact with a terminal oxidase, and could participate in an oxygen/electron-transfer process that facilitates oxygen transfer during aerobic metabolism []. Glb3: a nuclear-encoded truncated haemoglobin from plants that appears more closely related to HbO than HbN. Glb3 from Arabidopsis thaliana (Mouse-ear cress) exhibits an unusual concentration-independent binding of oxygen and carbon dioxide [].  ; GO: 0019825 oxygen binding, 0015671 oxygen transport; PDB: 2BKM_B 1UVY_A 1DLW_A 2XYK_B 2IG3_A 2GKM_B 1S61_A 1S56_B 1RTE_B 2GLN_A ....
Probab=27.55  E-value=1.1e+02  Score=19.72  Aligned_cols=48  Identities=17%  Similarity=0.183  Sum_probs=34.6

Q ss_pred             HHHHHhhhHHHHhhhCChhHHHHHHHhcCchHHHHHHHHHHHhhCCCCCCC
Q psy9638          30 EWLVNIHRDSYASYMGHYNMLNLFSINENEAKARVRFNFMERMLQPCGPPP   80 (85)
Q Consensus        30 EW~tn~hRDT~aS~~gH~~lL~Y~aia~nes~~r~r~~llekM~~pcgppp   80 (85)
                      |=+..+...-|.-+.-++.+..||.   +......|..+.+=+++=+|.|+
T Consensus        10 ~~I~~lv~~fY~rv~~d~~l~~~F~---~~d~~~~~~~~~~fl~~~~GGp~   57 (120)
T PF01152_consen   10 EGIRALVDAFYDRVLADPRLKPFFE---GIDLEKHKEKQAEFLSQLLGGPP   57 (120)
T ss_dssp             HHHHHHHHHHHHHHHT-TTTGGGGT---TSCHHHHHHHHHHHHHHHTTSSS
T ss_pred             HHHHHHHHHHHHHHHcCHHHHhhcC---CCCHHHHHHHHHHHHHHHhCCCC
Confidence            3356667778888888888888887   55666777777777777788775


No 15 
>KOG2468|consensus
Probab=27.01  E-value=26  Score=30.11  Aligned_cols=10  Identities=40%  Similarity=0.594  Sum_probs=9.1

Q ss_pred             HHHHhhhCCh
Q psy9638          38 DSYASYMGHY   47 (85)
Q Consensus        38 DT~aS~~gH~   47 (85)
                      ||.||++||.
T Consensus       423 DTmASiiG~r  432 (510)
T KOG2468|consen  423 DTMASIIGKR  432 (510)
T ss_pred             hHHHHHHhhh
Confidence            8999999985


No 16 
>PF07466 DUF1517:  Protein of unknown function (DUF1517);  InterPro: IPR010903 This family consists of several hypothetical glycine rich plant and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=25.41  E-value=67  Score=25.18  Aligned_cols=46  Identities=17%  Similarity=0.192  Sum_probs=39.9

Q ss_pred             hhHHHHHHHHhhccCCCCCCCchHHHHHHhhhHHHHhhhCChhHHHHHHHhc
Q psy9638           6 NIYSQLEHLQSKYVGTGHADTTKFEWLVNIHRDSYASYMGHYNMLNLFSINE   57 (85)
Q Consensus         6 ~~~~Qle~Lq~KY~GtG~~dtTk~EW~tn~hRDT~aS~~gH~~lL~Y~aia~   57 (85)
                      ..|..|+.|-.+      +||+.-||+..+-.+|.-.+.=|+...-|-...-
T Consensus       119 ~lQ~~L~~iA~~------adt~s~~Gl~~~L~E~~l~LLR~~~~wv~~~~~~  164 (289)
T PF07466_consen  119 SLQRDLNRIAET------ADTSSPEGLARVLQETTLALLRHPEYWVYGSSES  164 (289)
T ss_pred             HHHHHHHHHHHh------cCCCChhHHHHHHHHHHHHHHhCcCceEEEeecc
Confidence            467788888766      8999999999999999999999999988876554


No 17 
>PF13978 DUF4223:  Protein of unknown function (DUF4223)
Probab=25.11  E-value=51  Score=20.60  Aligned_cols=13  Identities=23%  Similarity=0.925  Sum_probs=10.5

Q ss_pred             HHHhhCCCCCCCC
Q psy9638          69 MERMLQPCGPPPE   81 (85)
Q Consensus        69 lekM~~pcgppp~   81 (85)
                      |.||+-.|||+.+
T Consensus        43 ISKiIGGCGp~~~   55 (56)
T PF13978_consen   43 ISKIIGGCGPAAQ   55 (56)
T ss_pred             HHHHhcCCCCccc
Confidence            5699999999754


No 18 
>PF08960 DUF1874:  Domain of unknown function (DUF1874);  InterPro: IPR015055 This entry is represented by Acidianus two-tailed virus, Orf117. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.; PDB: 2J85_A 2X4I_C.
Probab=25.10  E-value=22  Score=24.44  Aligned_cols=42  Identities=19%  Similarity=0.173  Sum_probs=22.6

Q ss_pred             CCchHHHHHHhhhHHHHhhhCChhHHHHHHHhcCc--hHHHHHH
Q psy9638          25 DTTKFEWLVNIHRDSYASYMGHYNMLNLFSINENE--AKARVRF   66 (85)
Q Consensus        25 dtTk~EW~tn~hRDT~aS~~gH~~lL~Y~aia~ne--s~~r~r~   66 (85)
                      +.|..|.-..++-+.+-|.|||.+-.+.++---|.  +..|+.+
T Consensus        24 ~it~~Ea~~~l~~~~~vSaIGH~sTa~lls~llg~~ip~NR~~i   67 (105)
T PF08960_consen   24 KITVEEAKELLQNEGFVSAIGHDSTAQLLSELLGVNIPMNRIQI   67 (105)
T ss_dssp             EE-HHHHHHHHCTTTEEE----HHHHHHHHHHHT----------
T ss_pred             eCCHHHHHHHHhcCceEEeeCcHHHHHHHHHHhCCccccccEEE
Confidence            45777777777889999999999988877755554  5555443


No 19 
>PF14198 TnpV:  Transposon-encoded protein TnpV
Probab=25.04  E-value=1.9e+02  Score=19.52  Aligned_cols=51  Identities=14%  Similarity=0.157  Sum_probs=41.5

Q ss_pred             hHHHHHHhhhHHHHhhhCChhHHHHHHHhcCchHHHHHHHHHHHhhCCCCCC
Q psy9638          28 KFEWLVNIHRDSYASYMGHYNMLNLFSINENEAKARVRFNFMERMLQPCGPP   79 (85)
Q Consensus        28 k~EW~tn~hRDT~aS~~gH~~lL~Y~aia~nes~~r~r~~llekM~~pcgpp   79 (85)
                      +.+||...++-.|..++--..|.+|++=.+.+...++ -+++++|....|.|
T Consensus        30 ~~~yLke~~p~~Y~~ll~~g~L~~~l~eid~~A~e~~-e~l~~q~~~~~gvt   80 (111)
T PF14198_consen   30 RKRYLKEHKPILYNNLLLSGKLNEHLAEIDEQAQERF-ERLVEQMAEKEGVT   80 (111)
T ss_pred             HHHHHHHhHHHHHHHHHHcchHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCc
Confidence            3589999999999999999999999987776655443 46788888888876


No 20 
>KOG3541|consensus
Probab=25.03  E-value=57  Score=27.86  Aligned_cols=36  Identities=28%  Similarity=0.465  Sum_probs=29.8

Q ss_pred             HHHhhhCChhHHHHHHHhc--CchHHHHHHHHHHHhhC
Q psy9638          39 SYASYMGHYNMLNLFSINE--NEAKARVRFNFMERMLQ   74 (85)
Q Consensus        39 T~aS~~gH~~lL~Y~aia~--nes~~r~r~~llekM~~   74 (85)
                      |+.+|+.|.+.|+|++--|  -..+.+.|..|||.|++
T Consensus       262 sie~y~~wfn~Lsa~~Atevlk~~kk~~rsamlef~iD  299 (477)
T KOG3541|consen  262 SIERYMSWFNHLSALCATEVLKAAKKQTRSAMLEFLID  299 (477)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4567899999999998777  34788899999999875


No 21 
>PF07757 AdoMet_MTase:  Predicted AdoMet-dependent methyltransferase;  InterPro: IPR011671 tRNA (uracil-O(2)-)-methyltransferase catalyses the formation of O(2)-methyl-uracil at position 44 (m2U44) in tRNA(Ser) [].; GO: 0008168 methyltransferase activity
Probab=24.83  E-value=52  Score=23.04  Aligned_cols=17  Identities=18%  Similarity=0.264  Sum_probs=15.3

Q ss_pred             chhhhhHHHHHHHHhhc
Q psy9638           2 GDRYNIYSQLEHLQSKY   18 (85)
Q Consensus         2 ~dr~~~~~Qle~Lq~KY   18 (85)
                      -+|-+.|..|..||+||
T Consensus         2 V~~~~yq~~Y~~LK~kY   18 (112)
T PF07757_consen    2 VPKERYQDTYQRLKEKY   18 (112)
T ss_pred             CCHHHHHHHHHHHHHHH
Confidence            36788999999999999


No 22 
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=23.35  E-value=1.1e+02  Score=20.65  Aligned_cols=19  Identities=26%  Similarity=0.450  Sum_probs=14.1

Q ss_pred             HHHHHHhhhHHHHhh--hCCh
Q psy9638          29 FEWLVNIHRDSYASY--MGHY   47 (85)
Q Consensus        29 ~EW~tn~hRDT~aS~--~gH~   47 (85)
                      -||+.|+++|++--.  -||.
T Consensus        15 ~dW~~~~~~~~~~~~~~~~~~   35 (116)
T cd00252          15 RDWFKNVHEDLKERDELEKHK   35 (116)
T ss_pred             HHHHHHHHHHHhhcccchhhh
Confidence            489999999998633  4454


No 23 
>PF13774 Longin:  Regulated-SNARE-like domain; PDB: 1IOU_A 3BW6_A 1H8M_A 3EGX_C 2NUP_C 3EGD_C 2NUT_C 3KYQ_A 1IFQ_B 2VX8_D ....
Probab=23.10  E-value=19  Score=22.13  Aligned_cols=27  Identities=22%  Similarity=0.421  Sum_probs=22.7

Q ss_pred             hhHHHHHHHhcCchHHHHHHHHHHHhh
Q psy9638          47 YNMLNLFSINENEAKARVRFNFMERML   73 (85)
Q Consensus        47 ~~lL~Y~aia~nes~~r~r~~llekM~   73 (85)
                      .+=+.|++|+..+-+.|+-|.||+.+.
T Consensus        31 ~~~i~~~citd~~~~~r~aF~fL~~i~   57 (83)
T PF13774_consen   31 EDGIAYLCITDKSYPKRVAFAFLEEIK   57 (83)
T ss_dssp             ETTEEEEEEEETTS-HHHHHHHHHHHH
T ss_pred             cCCeEEEEEEcCCCCcchHHHHHHHHH
Confidence            566889999999999999999999875


No 24 
>PRK11188 rrmJ 23S rRNA methyltransferase J; Provisional
Probab=22.99  E-value=65  Score=23.24  Aligned_cols=25  Identities=20%  Similarity=0.432  Sum_probs=18.3

Q ss_pred             CCC-CCchHHHHHHhhhHHHHhhhCC
Q psy9638          22 GHA-DTTKFEWLVNIHRDSYASYMGH   46 (85)
Q Consensus        22 G~~-dtTk~EW~tn~hRDT~aS~~gH   46 (85)
                      |+. ..+...|+.++.||-|.+..-.
T Consensus         3 ~~~~~~~~~~~~~~~~~d~~~~~~~~   28 (209)
T PRK11188          3 GKKRSASSSRWLQEHFSDKYVQQAQK   28 (209)
T ss_pred             ccccccchHHHHHHhhcCHHHHHHhh
Confidence            443 4467889999999988776433


No 25 
>KOG3821|consensus
Probab=21.70  E-value=64  Score=28.14  Aligned_cols=45  Identities=20%  Similarity=0.229  Sum_probs=26.0

Q ss_pred             hhHHHHhhhCChhHHHHHHHhcCchHHHHHHHHHHHhhCCCCCCCCCC
Q psy9638          36 HRDSYASYMGHYNMLNLFSINENEAKARVRFNFMERMLQPCGPPPEKP   83 (85)
Q Consensus        36 hRDT~aS~~gH~~lL~Y~aia~nes~~r~r~~llekM~~pcgppp~~~   83 (85)
                      +++++-|.+.|-+.+--=||-   ....=+-++.+||.|-||+|.+.+
T Consensus       306 g~~~iesvl~~i~v~iseAIm---~~q~N~~~lt~kV~q~Cg~p~~~p  350 (563)
T KOG3821|consen  306 GPFNIESVLLPIHVKISEAIM---AAQENSDKLTAKVFQGCGPPKPTP  350 (563)
T ss_pred             CcchHHHHHhhhhhHHHHHHH---HHHHhhHHHHHHHHhhcCCCCCCc
Confidence            456666666655443211111   111234568899999999997653


No 26 
>TIGR02791 VirB5 P-type DNA transfer protein VirB5. The VirB5 protein is involved in the type IV DNA secretion systems typified by the Agrobacterium Ti plasmid vir system where it interacts with several other proteins essential for proper pilus formation. VirB5 is homologous to the IncN (N-type) conjugation system protein TraC as well as the P-type protein TrbJ and the F-type protein TraE.
Probab=21.43  E-value=1.4e+02  Score=22.01  Aligned_cols=19  Identities=26%  Similarity=0.405  Sum_probs=13.0

Q ss_pred             HhhhHHHHhhhCChhHHHH
Q psy9638          34 NIHRDSYASYMGHYNMLNL   52 (85)
Q Consensus        34 n~hRDT~aS~~gH~~lL~Y   52 (85)
                      ++.++.|+|+-|++++-..
T Consensus        58 ~q~k~~y~sltG~r~~g~l   76 (220)
T TIGR02791        58 EQYKQQYGSLTGNRGMGDL   76 (220)
T ss_pred             HHHHHHHHHHhCCCcHHHH
Confidence            6677777777777765443


No 27 
>PF00284 Cytochrom_B559a:  Lumenal portion of Cytochrome b559, alpha (gene psbE) subunit family.;  InterPro: IPR013082 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].  The alpha subunit (PsbE) of cytochrome b559, forms a haem-binding heterodimer with the beta subunit (PsbF) (IPR006241 from INTERPRO) within the reaction centre core of PSII. Both PsbE and PsbF are essential components for PSII assembly, and are probably involved in secondary electron transport mechanisms that help to protect PSII from photo-damage []. This domain occurs in the lumenal region of the alpha subunit. It is usually found in conjuction with an N-terminal domain (IPR013081 from INTERPRO).; GO: 0046872 metal ion binding, 0015979 photosynthesis, 0009523 photosystem II, 0016021 integral to membrane; PDB: 1W5C_K 1S5L_e 3BZ2_E 3PRQ_E 2AXT_E 4FBY_R 3PRR_E 3BZ1_E 3KZI_E 3A0H_E ....
Probab=20.01  E-value=66  Score=18.85  Aligned_cols=13  Identities=38%  Similarity=0.675  Sum_probs=10.0

Q ss_pred             chhhhhHHHHHHH
Q psy9638           2 GDRYNIYSQLEHL   14 (85)
Q Consensus         2 ~dr~~~~~Qle~L   14 (85)
                      .|||+..+|++.+
T Consensus        26 ~dRf~a~~qld~~   38 (40)
T PF00284_consen   26 TDRFNAKQQLDEF   38 (40)
T ss_dssp             --STTHHHHHHHH
T ss_pred             ccccCHHHHHHHh
Confidence            6899999999876


Done!