Query         psy8172
Match_columns 98
No_of_seqs    26 out of 28
Neff          2.4 
Searched_HMMs 46136
Date          Fri Aug 16 20:28:08 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8172.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8172hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 smart00391 MBD Methyl-CpG bind  99.5 2.5E-14 5.5E-19   94.4   4.7   56   20-82      8-73  (77)
  2 cd01397 HAT_MBD Methyl-CpG bin  98.7 1.5E-08 3.2E-13   67.9   4.2   54   20-80      6-68  (73)
  3 PF01429 MBD:  Methyl-CpG bindi  98.6   6E-08 1.3E-12   63.0   4.3   54   20-79     11-74  (77)
  4 cd00122 MBD MeCP2, MBD1, MBD2,  98.6 7.3E-08 1.6E-12   60.4   4.1   36   21-56      7-51  (62)
  5 cd01396 MeCP2_MBD MeCP2, MBD1,  98.2 1.8E-06 3.8E-11   57.2   4.0   51   20-77      7-66  (77)
  6 cd01395 HMT_MBD Methyl-CpG bin  97.9 1.1E-05 2.5E-10   52.2   3.0   34   22-55      8-49  (60)
  7 KOG4161|consensus               93.0   0.061 1.3E-06   42.7   1.9   39   19-57     18-66  (272)
  8 PF12043 DUF3527:  Domain of un  78.5     1.8 3.9E-05   36.3   2.5   47   12-68    227-276 (346)
  9 PF12104 Tcell_CD4_Cterm:  T ce  68.5     2.8 6.1E-05   24.3   1.0   19   43-61      5-23  (28)
 10 cd03142 GATase1_ThuA Type 1 gl  62.4     7.9 0.00017   29.8   2.7   34   20-53    173-209 (215)
 11 PF07624 PSD2:  Protein of unkn  60.0     8.4 0.00018   24.7   2.1   19   37-55      1-19  (76)
 12 PF06085 Rz1:  Lipoprotein Rz1   59.1       4 8.8E-05   25.3   0.5   29    9-40     10-39  (40)
 13 cd03078 GST_N_Metaxin1_like GS  57.4      14 0.00031   23.2   2.8   28   29-56     45-73  (73)
 14 PF06283 ThuA:  Trehalose utili  53.1     7.5 0.00016   28.1   1.2   26   14-39    169-195 (217)
 15 cd00525 AE_Prim_S_like AE_Prim  48.6      31 0.00067   22.8   3.5   38   41-78     22-65  (136)
 16 KOG3859|consensus               44.5      13 0.00028   32.1   1.4   17   33-49    153-169 (406)
 17 PF01473 CW_binding_1:  Putativ  42.7      20 0.00042   17.6   1.4   18   22-40      1-18  (19)
 18 cd01774 Faf1_like2_UBX Faf1 ik  42.5      58  0.0013   21.5   4.1   44   35-84     10-61  (85)
 19 PF12677 DUF3797:  Domain of un  37.2      14  0.0003   23.7   0.4    7   56-62     40-46  (49)
 20 PF08671 SinI:  Anti-repressor   33.9      31 0.00068   19.7   1.5   11   44-54     18-28  (30)
 21 PF10568 Tom37:  Outer mitochon  33.5      40 0.00087   21.6   2.2   38   16-53     33-72  (72)
 22 PF10379 nec1:  Virulence prote  33.0      24 0.00051   27.6   1.2   20   69-88    156-175 (184)
 23 smart00504 Ubox Modified RING   32.9      51  0.0011   19.1   2.4   25   33-58     14-38  (63)
 24 COG3828 Uncharacterized protei  32.3      31 0.00068   28.1   1.8   30   14-43    167-196 (239)
 25 PF00397 WW:  WW domain;  Inter  31.6      40 0.00087   18.2   1.7   18   21-38      2-22  (31)
 26 smart00034 CLECT C-type lectin  31.2      66  0.0014   19.4   2.7   31   21-56      2-33  (126)
 27 PF10281 Ish1:  Putative stress  31.0      44 0.00096   18.9   1.8   14   43-56      4-17  (38)
 28 PF11307 DUF3109:  Protein of u  30.8      12 0.00025   29.3  -0.8   12   65-76    126-137 (183)
 29 cd00173 SH2 Src homology 2 dom  30.7   1E+02  0.0022   18.7   3.6   33   33-66     58-90  (94)
 30 PHA01635 hypothetical protein   30.7 1.3E+02  0.0029   24.5   5.0   61   13-76    125-192 (231)
 31 cd00246 RabGEF Nucleotide exch  30.2      30 0.00064   24.9   1.2   37    1-37     54-97  (103)
 32 PF13409 GST_N_2:  Glutathione   29.8      40 0.00086   20.3   1.6   31   25-55     38-70  (70)
 33 smart00456 WW Domain with 2 co  29.6      37 0.00081   17.7   1.3   18   20-37      1-20  (32)
 34 cd03593 CLECT_NK_receptors_lik  26.6 1.1E+02  0.0025   19.0   3.3   31   21-56      2-33  (116)
 35 COG3139 Uncharacterized protei  25.5      13 0.00029   26.4  -1.2   22   18-44      9-35  (90)
 36 PF02943 FeThRed_B:  Ferredoxin  24.3      30 0.00064   24.6   0.3   11   54-64     77-87  (108)
 37 TIGR01204 bioW 6-carboxyhexano  23.8 1.2E+02  0.0025   24.5   3.6   34   18-55    184-227 (232)
 38 PF00799 Gemini_AL1:  Geminivir  22.6      48   0.001   23.5   1.1   23   35-57     81-103 (114)
 39 PF13417 GST_N_3:  Glutathione   22.1      47   0.001   20.0   0.9   31   24-54     38-69  (75)
 40 PF03128 CXCXC:  CXCXC repeat;   21.8      38 0.00083   16.3   0.4    8   54-61      5-12  (14)

No 1  
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=99.50  E-value=2.5e-14  Score=94.38  Aligned_cols=56  Identities=23%  Similarity=0.398  Sum_probs=50.8

Q ss_pred             ecccceeeec---------cce-eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCCCCccc
Q psy8172          20 TTTDYTSEVG---------FWK-QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPRERA   82 (98)
Q Consensus        20 vp~gw~r~v~---------k~~-~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~~~P   82 (98)
                      +|.||+|++-         ++- +|+||+|..|+|..||..||...|.|+++++|       |+|++.++.-+
T Consensus         8 lp~GW~R~~~~r~~g~~~~~~dV~Y~sP~GkklRs~~ev~~YL~~~~~~~~~~~~-------F~F~~~~~~~~   73 (77)
T smart00391        8 LPCGWRRETKQRKSGRSAGKFDVYYISPCGKKLRSKSELARYLHKNGDLSLDLEC-------FDFNATVPVGP   73 (77)
T ss_pred             CCCCcEEEEEEecCCCCCCcccEEEECCCCCeeeCHHHHHHHHHhCCCccccccc-------ccCcCCccccc
Confidence            6899999986         333 89999999999999999999999999999998       99999998754


No 2  
>cd01397 HAT_MBD Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.
Probab=98.73  E-value=1.5e-08  Score=67.86  Aligned_cols=54  Identities=24%  Similarity=0.330  Sum_probs=42.8

Q ss_pred             ecccceeee------ccce---eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCCCCc
Q psy8172          20 TTTDYTSEV------GFWK---QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPRE   80 (98)
Q Consensus        20 vp~gw~r~v------~k~~---~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~~   80 (98)
                      .++||+|++      .+.+   +|.||+|..|+|..||..||...+      .|-|..| -|.|+++++.
T Consensus         6 l~~GW~Re~vir~~~~~~~~dV~Y~aPcGKklRs~~ev~~yL~~~~------~~~Lt~d-nFsF~~~~~v   68 (73)
T cd01397           6 LELGWRRETRIRGLGGRIQGEVAYYAPCGKKLRQYPEVIKYLSKNG------ISLLSRE-NFSFSARAPV   68 (73)
T ss_pred             CCCCceeEEEeccCCCCccceEEEECCCCcccccHHHHHHHHHhCC------ccCccHh-HccccCCccc
Confidence            469999998      2333   899999999999999999999876      2344444 5899988765


No 3  
>PF01429 MBD:  Methyl-CpG binding domain;  InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=98.61  E-value=6e-08  Score=63.00  Aligned_cols=54  Identities=20%  Similarity=0.149  Sum_probs=38.7

Q ss_pred             ecccceeeec--c-----ce---eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCCCC
Q psy8172          20 TTTDYTSEVG--F-----WK---QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPR   79 (98)
Q Consensus        20 vp~gw~r~v~--k-----~~---~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~   79 (98)
                      +|.||+|++-  +     ++   +|+||+|..++|+.||..||...+-      +...--.-|.|++...
T Consensus        11 Lp~GW~re~~~R~~g~~~~~~dv~Y~sP~Gk~~RS~~eV~~yL~~~~~------~~~l~~~~F~F~~~~~   74 (77)
T PF01429_consen   11 LPDGWKREVVVRKSGSSAGKKDVYYYSPCGKRFRSKKEVVRYLKENPS------EHDLKPENFSFSKRLI   74 (77)
T ss_dssp             STTT-EEEEEESSSSTTTTSEEEEEEETTSEEESSHHHHHHHHTTSS---------SS-CTTBBTTTTB-
T ss_pred             CCCCCEEEEEEecCCCcCCceEEEEECCCCCEEeCHHHHHHHHHhCCC------cccCCHhHCCCCCCcc
Confidence            6899999876  1     23   9999999999999999999999876      2222334577777654


No 4  
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin.  MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=98.59  E-value=7.3e-08  Score=60.42  Aligned_cols=36  Identities=25%  Similarity=0.345  Sum_probs=32.1

Q ss_pred             cccceeeecc------ce---eEeCCCccccccHHHHHHHHhhCc
Q psy8172          21 TTDYTSEVGF------WK---QTTRPSNTALSSMEQVKAYLQTQG   56 (98)
Q Consensus        21 p~gw~r~v~k------~~---~YiSPsgt~L~S~eqv~~YL~t~G   56 (98)
                      |.||+|++-.      ++   +|+||.|..|+|+.||..||...|
T Consensus         7 p~GW~R~~~~r~~g~~~k~dv~Y~sP~Gk~~Rs~~ev~~yL~~~~   51 (62)
T cd00122           7 PPGWKRELVIRKSGSAGKGDVYYYSPCGKKLRSKPEVARYLEKTG   51 (62)
T ss_pred             CCCeEEEEEEcCCCCCCcceEEEECCCCceecCHHHHHHHHHhCC
Confidence            8999998742      33   899999999999999999999988


No 5  
>cd01396 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin.  MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
Probab=98.22  E-value=1.8e-06  Score=57.19  Aligned_cols=51  Identities=22%  Similarity=0.299  Sum_probs=39.1

Q ss_pred             ecccceeeecc------ce---eEeCCCccccccHHHHHHHHhhCceeccCCCccccccceeecCCC
Q psy8172          20 TTTDYTSEVGF------WK---QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAK   77 (98)
Q Consensus        20 vp~gw~r~v~k------~~---~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~   77 (98)
                      .|+||+|++-.      ++   +|+||.|..++|.-||..||...+      ++-+.+ .-|+|.+.
T Consensus         7 lp~GW~r~~~~R~~gs~~k~DvyY~sP~Gkk~RS~~ev~~yL~~~~------~~~~~~-~~FdF~~~   66 (77)
T cd01396           7 LPPGWKRELVPRKSGSAGKFDVYYISPTGKKFRSKVELARYLEKNG------PTSLDL-SDFDFTVP   66 (77)
T ss_pred             CCCCCEEEEEEecCCCCCcceEEEECCCCCEEECHHHHHHHHHhCC------CCCCcH-hHcccCCC
Confidence            79999998721      22   899999999999999999999987      223333 35777764


No 6  
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=97.89  E-value=1.1e-05  Score=52.20  Aligned_cols=34  Identities=18%  Similarity=0.246  Sum_probs=30.6

Q ss_pred             ccceeeeccce--------eEeCCCccccccHHHHHHHHhhC
Q psy8172          22 TDYTSEVGFWK--------QTTRPSNTALSSMEQVKAYLQTQ   55 (98)
Q Consensus        22 ~gw~r~v~k~~--------~YiSPsgt~L~S~eqv~~YL~t~   55 (98)
                      .||+|+..+.+        +|.+|.|..|+++.||..||...
T Consensus         8 ~gw~R~~~~~~~~~~k~~V~Y~aPCGr~Lr~~~EV~~YL~~t   49 (60)
T cd01395           8 CGFQRMKYRARVGKVKKHVIYKAPCGRSLRNMSEVHRYLRET   49 (60)
T ss_pred             cCeEEEEEeccCCCcccceEEECCcchhhhcHHHHHHHHHhc
Confidence            59999986655        89999999999999999999876


No 7  
>KOG4161|consensus
Probab=92.98  E-value=0.061  Score=42.69  Aligned_cols=39  Identities=26%  Similarity=0.288  Sum_probs=33.0

Q ss_pred             Eecccceeee---------ccce-eEeCCCccccccHHHHHHHHhhCce
Q psy8172          19 LTTTDYTSEV---------GFWK-QTTRPSNTALSSMEQVKAYLQTQGT   57 (98)
Q Consensus        19 ~vp~gw~r~v---------~k~~-~YiSPsgt~L~S~eqv~~YL~t~GT   57 (98)
                      .+|-||+|.+         +++- +|+||+|..++|--+.+.||-..|+
T Consensus        18 ~lp~GW~~~~~~r~~~~~~g~~dv~~~sp~g~~frsk~~l~~~~~~~~~   66 (272)
T KOG4161|consen   18 ALPPGWTREEVQRSSGLSAGKSDVYYISPSGKKFRSKPQLARYLGKVGD   66 (272)
T ss_pred             CCCCCcchhhhcccCCCcccccceEEeCCcccccccccHHHHHhccccc
Confidence            5799999874         3444 9999999999999999999987765


No 8  
>PF12043 DUF3527:  Domain of unknown function (DUF3527);  InterPro: IPR021916  This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 120 amino acids in length. This domain has a conserved CDCGGWD sequence motif. 
Probab=78.55  E-value=1.8  Score=36.29  Aligned_cols=47  Identities=23%  Similarity=0.487  Sum_probs=35.4

Q ss_pred             CCCCCeeEecccceeeeccceeEeCCCccccccHHHHHHHHhhCceeccC---CCccccc
Q psy8172          12 PDPKSTVLTTTDYTSEVGFWKQTTRPSNTALSSMEQVKAYLQTQGTCKCG---LECPLQC   68 (98)
Q Consensus        12 p~p~s~v~vp~gw~r~v~k~~~YiSPsgt~L~S~eqv~~YL~t~GTCKCG---LeCPl~~   68 (98)
                      .+-+.+|++|.|+-.+-..+-  -.||        -+.+=..+.|-|-||   |-|||.+
T Consensus       227 ~~~s~tVvlP~G~Hg~P~~g~--~~PS--------~LI~RWrSGG~CDCGGWDmgC~L~V  276 (346)
T PF12043_consen  227 SSCSTTVVLPSGVHGLPNTGN--GGPS--------SLIDRWRSGGSCDCGGWDMGCPLRV  276 (346)
T ss_pred             CCccEEEEecCCccCCCCCCC--CCCh--------HHHHHhhhCCCcCCCCcccCCCeEE
Confidence            345678999999998876651  1233        356778899999998   8999975


No 9  
>PF12104 Tcell_CD4_Cterm:  T cell CD4 receptor C terminal region;  InterPro: IPR021963  This domain is the C-terminal domain of the CD4 T cell receptor. The C-terminal domain is the cytoplasmic domain which relays the signal for T cell activation. This process involves co-receptor internalisation. This domain is involved in binding to the N-terminal of Lck co-receptor in a Zn2+ clasp structure. ; PDB: 2JKR_Q 3B71_D 1Q68_A 2JKT_Q 2KLU_A.
Probab=68.48  E-value=2.8  Score=24.34  Aligned_cols=19  Identities=37%  Similarity=0.751  Sum_probs=13.3

Q ss_pred             ccHHHHHHHHhhCceeccC
Q psy8172          43 SSMEQVKAYLQTQGTCKCG   61 (98)
Q Consensus        43 ~S~eqv~~YL~t~GTCKCG   61 (98)
                      .-+.|++.||.---||.|.
T Consensus         5 ~Rmsqikr~LsekkTCqC~   23 (28)
T PF12104_consen    5 ERMSQIKRLLSEKKTCQCP   23 (28)
T ss_dssp             TCHHCHHHHHHTTSS----
T ss_pred             HHHHHHHHHHhhcccCCCc
Confidence            3478999999999999985


No 10 
>cd03142 GATase1_ThuA Type 1 glutamine amidotransferase (GATase1)-like domain found in Sinorhizobium meliloti Rm1021 ThuA (SmThuA). Type 1 glutamine amidotransferase (GATase1)-like domain found in Sinorhizobium meliloti Rm1021 ThuA (SmThuA).  This group includes proteins similar to SmThuA which plays a role in a major pathway for trehalose catabolism. SmThuA is induced by trehalose but not by related structurally similar disaccharides like sucrose or maltose. Proteins in this group lack the catalytic triad of typical GATase1 domains:  a His replaces the reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. S. meliloti Rm1021 thuA mutants are impaired in competitive colonization of Medicago sativa roots but are more competitive than the wild-type Rml021 in infecting alfalfa roots and forming nitrogen-fixing nodules.
Probab=62.38  E-value=7.9  Score=29.76  Aligned_cols=34  Identities=15%  Similarity=0.100  Sum_probs=24.6

Q ss_pred             ecccceeeeccce-eEeCCCccccc--cHHHHHHHHh
Q psy8172          20 TTTDYTSEVGFWK-QTTRPSNTALS--SMEQVKAYLQ   53 (98)
Q Consensus        20 vp~gw~r~v~k~~-~YiSPsgt~L~--S~eqv~~YL~   53 (98)
                      -|.+|++..|+|| .|.+|-.+.-.  .-.++++-|.
T Consensus       173 ~p~~W~~~~G~GRVFyta~GH~~~~~~~~p~~~~ll~  209 (215)
T cd03142         173 FRSGCTYRRGAGRIFYFSPGHETYPVYHHPDVQKVLA  209 (215)
T ss_pred             ceEEEEEEECCcCEEEECCCCCCcccccCHhHHHHHH
Confidence            4899999999999 88888775422  3355555443


No 11 
>PF07624 PSD2:  Protein of unknown function (DUF1585);  InterPro: IPR011478 This entry represents a conserved region at the C terminus of a family of cytochrome-like proteins found in bacteria such as Rhodopirellula baltica and Solibacter usitatus. These proteins also contain IPR013036 from INTERPRO, IPR013039 from INTERPRO, IPR013042 from INTERPRO and IPR013043 from INTERPRO.
Probab=59.96  E-value=8.4  Score=24.74  Aligned_cols=19  Identities=26%  Similarity=0.588  Sum_probs=17.9

Q ss_pred             CCccccccHHHHHHHHhhC
Q psy8172          37 PSNTALSSMEQVKAYLQTQ   55 (98)
Q Consensus        37 Psgt~L~S~eqv~~YL~t~   55 (98)
                      |+|+...+..++++||..+
T Consensus         1 pdG~~f~~~~eLk~~L~~~   19 (76)
T PF07624_consen    1 PDGTSFEGAAELKQYLAER   19 (76)
T ss_pred             CCCCccCCHHHHHHHHHHC
Confidence            8999999999999999876


No 12 
>PF06085 Rz1:  Lipoprotein Rz1 precursor;  InterPro: IPR010346 This family consists of several bacteria and phage lipoprotein Rz1 precursors. Rz1 is a proline-rich lipoprotein from bacteriophage lambda, which is known to have fusogenic properties. Rz1-induced liposome fusion is thought to be mediated primarily by the generation of local perturbation in the bilayer lipid membrane and to a lesser extent by electrostatic forces [].; GO: 0019064 viral envelope fusion with host membrane, 0019867 outer membrane
Probab=59.05  E-value=4  Score=25.26  Aligned_cols=29  Identities=17%  Similarity=0.313  Sum_probs=22.3

Q ss_pred             CCCCCCCCeeEec-ccceeeeccceeEeCCCcc
Q psy8172           9 LQPPDPKSTVLTT-TDYTSEVGFWKQTTRPSNT   40 (98)
Q Consensus         9 ~~~p~p~s~v~vp-~gw~r~v~k~~~YiSPsgt   40 (98)
                      ++||+|..=+|-| -+|+.+.+.=   ||+|++
T Consensus        10 ~kpppPpAW~M~p~~n~~~~Ld~i---is~S~~   39 (40)
T PF06085_consen   10 VKPPPPPAWAMQPPPNWQQLLDGI---ISVSET   39 (40)
T ss_pred             cCCCCCchhhccCChhhHHHhhce---ecccCC
Confidence            5667776666666 7999999887   888875


No 13 
>cd03078 GST_N_Metaxin1_like GST_N family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins including Tom37 from fungi. Mammalian metaxin (or metaxin 1) and the fungal protein Tom37 are components of preprotein import complexes of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals.
Probab=57.44  E-value=14  Score=23.16  Aligned_cols=28  Identities=32%  Similarity=0.345  Sum_probs=24.0

Q ss_pred             ccce-eEeCCCccccccHHHHHHHHhhCc
Q psy8172          29 GFWK-QTTRPSNTALSSMEQVKAYLQTQG   56 (98)
Q Consensus        29 ~k~~-~YiSPsgt~L~S~eqv~~YL~t~G   56 (98)
                      .+|+ +||-.+|+.+...+.|.+||...|
T Consensus        45 p~gkLP~l~~~~~~i~d~~~Ii~~L~~~~   73 (73)
T cd03078          45 PTGKLPALLTSGTKISGPEKIIEYLRKQG   73 (73)
T ss_pred             CCCccCEEEECCEEecChHHHHHHHHHcC
Confidence            4677 788889999999999999998654


No 14 
>PF06283 ThuA:  Trehalose utilisation; PDB: 4E5V_A 1T0B_A.
Probab=53.05  E-value=7.5  Score=28.10  Aligned_cols=26  Identities=15%  Similarity=-0.041  Sum_probs=18.0

Q ss_pred             CCCeeEecccceeeeccce-eEeCCCc
Q psy8172          14 PKSTVLTTTDYTSEVGFWK-QTTRPSN   39 (98)
Q Consensus        14 p~s~v~vp~gw~r~v~k~~-~YiSPsg   39 (98)
                      +..-...|..|.+.+||+| .|.++-.
T Consensus       169 ~~~~~~~Pv~W~~~~GkGRvf~~~lGH  195 (217)
T PF06283_consen  169 GGEGGDHPVAWTREYGKGRVFYTTLGH  195 (217)
T ss_dssp             TTTSSEEEEEEEEECTTEEEEEE----
T ss_pred             ccCCCeEEEEEEEEeCCeeEEEECCCC
Confidence            3555667889999999999 7777744


No 15 
>cd00525 AE_Prim_S_like AE_Prim_S_like: primase domain similar to that found in the small subunit of archaeal and eukaryotic (A/E) DNA primases. The replication machineries of A/Es are distinct from that of bacteria. Primases are DNA-dependent RNA polymerases which synthesis the short RNA primers required for DNA replication. In eukaryotes, this small catalytically active primase subunit (p50) and a larger primase subunit (p60), referred to jointly as the core primase, associate with the B subunit and the DNA polymerase alpha subunit in a complex, called Pol alpha-pri. In addition to its catalytic role in replication, eukaryotic DNA primase may play a role in coupling replication to DNA damage repair and in checkpoint control during S phase. Pfu41 and Pfu46 comprise the primase complex of the archaea Pyrococcus furiosus; these proteins have sequence identity to the eukaryotic p50 and p60 primase proteins respectively. Pfu41 preferentially uses dNTPs as substrate. Pfu46 regulates the pri
Probab=48.57  E-value=31  Score=22.80  Aligned_cols=38  Identities=21%  Similarity=0.194  Sum_probs=32.5

Q ss_pred             ccccHHHHHHHHhhCceeccCCCccc------cccceeecCCCC
Q psy8172          41 ALSSMEQVKAYLQTQGTCKCGLECPL------QCDSVFSFDAKP   78 (98)
Q Consensus        41 ~L~S~eqv~~YL~t~GTCKCGLeCPl------~~~~vFnFdp~v   78 (98)
                      +..+.+|+++|+...+.+--|+.|..      .-.-||++|+..
T Consensus        22 ~~~d~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~iv~DiD~~~   65 (136)
T cd00525          22 ATTDDAEILAWLANLPPGNIGLSLGRYDKLWKPDLLVFDLDPDD   65 (136)
T ss_pred             cccCcHHHHHHHHhcCCeEEEcCccccCccCCCCEEEEECCCCC
Confidence            67899999999999999888988887      467788888764


No 16 
>KOG3859|consensus
Probab=44.50  E-value=13  Score=32.12  Aligned_cols=17  Identities=24%  Similarity=0.380  Sum_probs=15.5

Q ss_pred             eEeCCCccccccHHHHH
Q psy8172          33 QTTRPSNTALSSMEQVK   49 (98)
Q Consensus        33 ~YiSPsgt~L~S~eqv~   49 (98)
                      |||||+|+-|.|++.|-
T Consensus       153 YFI~PTGH~LKslDLvt  169 (406)
T KOG3859|consen  153 YFISPTGHSLKSLDLVT  169 (406)
T ss_pred             EEecCCCcchhHHHHHH
Confidence            89999999999999874


No 17 
>PF01473 CW_binding_1:  Putative cell wall binding repeat;  InterPro: IPR018337 The cell wall-binding repeat (CW) is an about 20 amino acid residue module, essentially found in two bacterial Gram-positive protein families; the choline binding proteins and glucosyltransferases (2.4.1.5 from EC). In choline-binding proteins cell wall binding repeats bind to choline moieties of both teichoic and lipoteichoic acids, two components peculiar to the cell surface of Gram-positive bacteria [, ]. In glucosyltransferases the region spanning the CW repeats is a glucan binding domain []. Several crystal structures of CW have been solved [, ]. In the choline binding protein LytA, the repeats adopt a solenoid fold consisting exclusively of beta-hairpins that stack to form a left-handed superhelix with a boomerang-like shape. The choline groups bind between beta-hairpin 'steps' of the superhelix []. In Cpl-1 CW repeats assemble in two sub-domains: an N-terminal superhelical moiety similar to the LytA one and a C-terminal beta-sheet involved in interactions with the lysozyme domain. Choline is bound between repeats 1 and 2, and, 2 and 3 of the superhelical sub-domain []. Some proteins known to contain cell-wall binding repeats include:  Pneumococcal N-acetylmuramoyl-L-alanine amidase (autolysin, lytA) (3.5.1.28 from EC). It is a surface-exposed enzyme that rules the self-destruction of pneumococcal cells through degradation of their peptidoglycan backbone. It mediates the release of toxic substances that damage the host tissues. Pneumococcal endo-beta-N-acetylglucosaminidase (lytB) (3.2.1.96 from EC). It plays an important role in cell wall degradation and cell separation. Pneumococcal teichoic acid phosphorylcholine esterase (pce or cbpE), a cell wall hydrolase important for cellular adhesion and colonisation. Lactobacillales glucosyltransferase. It catalyses the transfer of glucosyl units from the cleavage of sucrose to a growing chain of glucan.  Clostridium difficile toxin A (tcdA) and toxin B (tcdb). They are the causative agents of the antibiotic-associated pseudomembranous colitis. They are intracellular acting toxins that reach their targets after receptor-mediated endocytosis.  Clostridium acetobutylicum cspA protein. Siphoviridae bacteriophages N-acetylmuramoyl-L-alanine amidase. It lyses the bacterial host cell wall. Podoviridae lysozyme protein (cpl-1). It is capable of digesting the pneumococcal cell wall.  The cell wall binding repeats are also known as the choline-binding repeats (ChBr) or the choline-binding domain (ChBD). ; PDB: 1GVM_C 2BML_B 1HCX_A 1OBA_A 1H09_A 2J8F_A 2IXU_A 2J8G_A 2IXV_A 2X8O_A ....
Probab=42.68  E-value=20  Score=17.65  Aligned_cols=18  Identities=11%  Similarity=0.305  Sum_probs=12.6

Q ss_pred             ccceeeeccceeEeCCCcc
Q psy8172          22 TDYTSEVGFWKQTTRPSNT   40 (98)
Q Consensus        22 ~gw~r~v~k~~~YiSPsgt   40 (98)
                      +||..+-++| +|+..+|.
T Consensus         1 n~W~~~~~~w-Yy~~~~G~   18 (19)
T PF01473_consen    1 NGWVQDNGNW-YYFDSDGY   18 (19)
T ss_dssp             SEEEEETTEE-EEETTTSB
T ss_pred             CcCEEECCEE-EEeCCCcc
Confidence            4677774333 99999885


No 18 
>cd01774 Faf1_like2_UBX Faf1 ike-2 UBX domain. Faf1_like2 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains.  This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation.  The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=42.53  E-value=58  Score=21.50  Aligned_cols=44  Identities=20%  Similarity=0.306  Sum_probs=29.7

Q ss_pred             eCCCcccc-------ccHHHHHHHHhhCceeccCCCccccccceeecCCCCCc-cccc
Q psy8172          35 TRPSNTAL-------SSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDAKPRE-RATE   84 (98)
Q Consensus        35 iSPsgt~L-------~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp~v~~-~P~~   84 (98)
                      .=|+|+.|       .+++.|.+|+.+.|      ..|-...-+.||-.++.. .|.+
T Consensus        10 Rlp~G~Rl~rrF~~~~tl~~l~~fv~~~~------~~~~~f~L~t~FPrr~~~~~~~~   61 (85)
T cd01774          10 KLPNGTRVERRFLFTQSLRVIHDFLFSLK------ETPEKFQIVTNFPRRVLPCLPSE   61 (85)
T ss_pred             ECCCCCEEEEEeCCCCcHHHHHHHHHhCC------CCCCcEEEecCCCCccccccccc
Confidence            33555554       78999999996433      466667777888777665 3444


No 19 
>PF12677 DUF3797:  Domain of unknown function (DUF3797);  InterPro: IPR024256 This presumed domain is functionally uncharacterised. This domain family is found in bacteria and viruses, and is approximately 50 amino acids in length. There is a conserved CGN sequence motif.
Probab=37.19  E-value=14  Score=23.73  Aligned_cols=7  Identities=71%  Similarity=1.725  Sum_probs=5.6

Q ss_pred             ceeccCC
Q psy8172          56 GTCKCGL   62 (98)
Q Consensus        56 GTCKCGL   62 (98)
                      -|||||.
T Consensus        40 RtCkCGf   46 (49)
T PF12677_consen   40 RTCKCGF   46 (49)
T ss_pred             eeecccc
Confidence            4899995


No 20 
>PF08671 SinI:  Anti-repressor SinI;  InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=33.94  E-value=31  Score=19.72  Aligned_cols=11  Identities=27%  Similarity=0.622  Sum_probs=8.9

Q ss_pred             cHHHHHHHHhh
Q psy8172          44 SMEQVKAYLQT   54 (98)
Q Consensus        44 S~eqv~~YL~t   54 (98)
                      |.||||+||.+
T Consensus        18 s~eeir~FL~~   28 (30)
T PF08671_consen   18 SKEEIREFLEF   28 (30)
T ss_dssp             -HHHHHHHHHH
T ss_pred             CHHHHHHHHHh
Confidence            78999999975


No 21 
>PF10568 Tom37:  Outer mitochondrial membrane transport complex protein;  InterPro: IPR019564 Tom37 is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with Tom70, it guides peptides without an mitochondrial targeting sequence (MTS) into Tom40, the protein that forms the passage through the outer membrane []. It has homology with metaxin, also part of the outer mitochondrial membrane beta-barrel protein transport complex []. This entry represents outer mitochondrial membrane transport complex proteins Tom37 and metaxin.; GO: 0006626 protein targeting to mitochondrion, 0005741 mitochondrial outer membrane
Probab=33.55  E-value=40  Score=21.60  Aligned_cols=38  Identities=11%  Similarity=0.041  Sum_probs=29.3

Q ss_pred             CeeEecccceeeeccce-eEeCC-CccccccHHHHHHHHh
Q psy8172          16 STVLTTTDYTSEVGFWK-QTTRP-SNTALSSMEQVKAYLQ   53 (98)
Q Consensus        16 s~v~vp~gw~r~v~k~~-~YiSP-sgt~L~S~eqv~~YL~   53 (98)
                      .--||+...+.+.-.|+ +++-- .++.++..++|.+||.
T Consensus        33 ~~~vv~s~n~~~Sptg~LP~L~~~~~~~vsg~~~Iv~yL~   72 (72)
T PF10568_consen   33 QFKVVPSNNPWLSPTGELPALIDSGGTWVSGFRNIVEYLR   72 (72)
T ss_pred             eEEEEEcCCCCcCCCCCCCEEEECCCcEEECHHHHHHhhC
Confidence            36677777777777777 55555 7888999999999984


No 22 
>PF10379 nec1:  Virulence protein nec1;  InterPro: IPR018836  The Nec1 protein has necrogenic activity on excised potato tuber tissue, and the encoding gene is highly conserved in plant-pathogenic Streptomyces spp. The G+C content of nec1 indicates lateral transfer from an unrelated taxon, but its origins are unclear. Deletion analysis of nec1 demonstrated that the 151-amino-acid C-terminal region of the Nec1 protein is sufficient to confer necrogenic activity. Streptomyces turgidiscabies containing a nec1 deletion was greatly compromised in virulence on Arabidopsis thaliana (Mouse-ear cress), Nicotiana tabacum (Common tobacco), and Raphanus sativus (Radish) seedlings. The wild-type strain, S. turgidiscabies Car8, aggressively colonized and infected the root meristem of radish, whereas the delta-nec1 mutant Car811 did not. Taken together, the data suggest that Nec1 is a secreted virulence protein with a conserved plant cell target that acts early in plant infection []. 
Probab=33.02  E-value=24  Score=27.56  Aligned_cols=20  Identities=15%  Similarity=0.101  Sum_probs=16.2

Q ss_pred             cceeecCCCCCcccccCCCC
Q psy8172          69 DSVFSFDAKPRERATEVHEP   88 (98)
Q Consensus        69 ~~vFnFdp~v~~~P~~~~~~   88 (98)
                      ..-||||-++++.||+-+-+
T Consensus       156 aqrfnfdqgipnlpwndqvs  175 (184)
T PF10379_consen  156 AQRFNFDQGIPNLPWNDQVS  175 (184)
T ss_pred             hhhcccccCCCCCCcccccc
Confidence            45699999999999985443


No 23 
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=32.88  E-value=51  Score=19.05  Aligned_cols=25  Identities=20%  Similarity=0.291  Sum_probs=18.3

Q ss_pred             eEeCCCccccccHHHHHHHHhhCcee
Q psy8172          33 QTTRPSNTALSSMEQVKAYLQTQGTC   58 (98)
Q Consensus        33 ~YiSPsgt~L~S~eqv~~YL~t~GTC   58 (98)
                      +.++|+|+ .-..+-+.+|+...++|
T Consensus        14 Pv~~~~G~-v~~~~~i~~~~~~~~~c   38 (63)
T smart00504       14 PVILPSGQ-TYERRAIEKWLLSHGTD   38 (63)
T ss_pred             CEECCCCC-EEeHHHHHHHHHHCCCC
Confidence            46778884 34788899999886554


No 24 
>COG3828 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.32  E-value=31  Score=28.12  Aligned_cols=30  Identities=13%  Similarity=-0.012  Sum_probs=23.7

Q ss_pred             CCCeeEecccceeeeccceeEeCCCccccc
Q psy8172          14 PKSTVLTTTDYTSEVGFWKQTTRPSNTALS   43 (98)
Q Consensus        14 p~s~v~vp~gw~r~v~k~~~YiSPsgt~L~   43 (98)
                      =+--|++|-=|+|-.||||+|-|-=|+.-+
T Consensus       167 ~~~~v~mPVvWtR~~GkGRvFYssLGHta~  196 (239)
T COG3828         167 LIDGVVMPVVWTRRYGKGRVFYSSLGHTAS  196 (239)
T ss_pred             cccCeeeeeEEEeeecCceEEEeccCCCcc
Confidence            344599999999999999977777776544


No 25 
>PF00397 WW:  WW domain;  InterPro: IPR001202 Synonym(s): Rsp5 or WWP domain The WW domain is a short conserved region in a number of unrelated proteins, which folds as a stable, triple stranded beta-sheet. This short domain of approximately 40 amino acids, may be repeated up to four times in some proteins [, , , ]. The name WW or WWP derives from the presence of two signature tryptophan residues that are spaced 20-23 amino acids apart and are present in most WW domains known to date, as well as that of a conserved Pro. The WW domain binds to proteins with particular proline-motifs, [AP]-P-P-[AP]-Y, and/or phosphoserine- phosphothreonine-containing motifs [, ]. It is frequently associated with other domains typical for proteins in signal transduction processes. A large variety of proteins containing the WW domain are known. These include; dystrophin, a multidomain cytoskeletal protein; utrophin, a dystrophin-like protein of unknown function; vertebrate YAP protein, substrate of an unknown serine kinase; Mus musculus (Mouse) NEDD-4, involved in the embryonic development and differentiation of the central nervous system; Saccharomyces cerevisiae (Baker's yeast) RSP5, similar to NEDD-4 in its molecular organisation; Rattus norvegicus (Rat) FE65, a transcription-factor activator expressed preferentially in liver; Nicotiana tabacum (Common tobacco) DB10 protein, amongst others.; GO: 0005515 protein binding; PDB: 2JXW_A 2DK1_A 2JOC_A 2JO9_A 1YIU_A 1O6W_A 2JMF_A 1TK7_A 2KYK_A 2L5F_A ....
Probab=31.60  E-value=40  Score=18.25  Aligned_cols=18  Identities=0%  Similarity=-0.180  Sum_probs=13.1

Q ss_pred             cccceeeec-c-ce-eEeCCC
Q psy8172          21 TTDYTSEVG-F-WK-QTTRPS   38 (98)
Q Consensus        21 p~gw~r~v~-k-~~-~YiSPs   38 (98)
                      |-||+..++ . || +|+...
T Consensus         2 P~gW~~~~~~~~g~~YY~N~~   22 (31)
T PF00397_consen    2 PPGWEEYFDPDSGRPYYYNHE   22 (31)
T ss_dssp             STTEEEEEETTTSEEEEEETT
T ss_pred             CcCCEEEEcCCCCCEEEEeCC
Confidence            779998885 3 77 776553


No 26 
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=31.20  E-value=66  Score=19.40  Aligned_cols=31  Identities=16%  Similarity=0.154  Sum_probs=17.8

Q ss_pred             cccceeeeccce-eEeCCCccccccHHHHHHHHhhCc
Q psy8172          21 TTDYTSEVGFWK-QTTRPSNTALSSMEQVKAYLQTQG   56 (98)
Q Consensus        21 p~gw~r~v~k~~-~YiSPsgt~L~S~eqv~~YL~t~G   56 (98)
                      |.||...  .++ ++++...   .+.++.++|+...|
T Consensus         2 p~~w~~~--~~~Cy~~~~~~---~~~~~A~~~C~~~~   33 (126)
T smart00034        2 PSGWISY--GGKCYKFSTEK---KTWADAQAFCQSLG   33 (126)
T ss_pred             Ccchhhh--CCEEEEEECCc---cCHHHHHHHHHhcC
Confidence            6788887  444 5555542   45555555555443


No 27 
>PF10281 Ish1:  Putative stress-responsive nuclear envelope protein;  InterPro: IPR018803  This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues []. 
Probab=31.00  E-value=44  Score=18.87  Aligned_cols=14  Identities=29%  Similarity=0.524  Sum_probs=11.9

Q ss_pred             ccHHHHHHHHhhCc
Q psy8172          43 SSMEQVKAYLQTQG   56 (98)
Q Consensus        43 ~S~eqv~~YL~t~G   56 (98)
                      +|.+|+++||.+-|
T Consensus         4 Ws~~~L~~wL~~~g   17 (38)
T PF10281_consen    4 WSDSDLKSWLKSHG   17 (38)
T ss_pred             CCHHHHHHHHHHcC
Confidence            57899999998865


No 28 
>PF11307 DUF3109:  Protein of unknown function (DUF3109);  InterPro: IPR021458  This bacterial family of proteins has no known function. 
Probab=30.78  E-value=12  Score=29.27  Aligned_cols=12  Identities=33%  Similarity=0.642  Sum_probs=7.1

Q ss_pred             cccccceeecCC
Q psy8172          65 PLQCDSVFSFDA   76 (98)
Q Consensus        65 Pl~~~~vFnFdp   76 (98)
                      |+++.+.|.||.
T Consensus       126 PIRv~~~~~~~~  137 (183)
T PF11307_consen  126 PIRVKKYSGFDA  137 (183)
T ss_pred             ceEEEeecCcCE
Confidence            555566666664


No 29 
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=30.66  E-value=1e+02  Score=18.73  Aligned_cols=33  Identities=15%  Similarity=0.099  Sum_probs=24.2

Q ss_pred             eEeCCCccccccHHHHHHHHhhCceeccCCCccc
Q psy8172          33 QTTRPSNTALSSMEQVKAYLQTQGTCKCGLECPL   66 (98)
Q Consensus        33 ~YiSPsgt~L~S~eqv~~YL~t~GTCKCGLeCPl   66 (98)
                      .++...+..-.|++|+.+|...... .-++.|.|
T Consensus        58 ~~~~~~~~~f~sl~eLv~~y~~~~~-~~~~~~~L   90 (94)
T cd00173          58 YYLLGEGRSFPSLPELIEHYQKNPL-SDGLGVKL   90 (94)
T ss_pred             eEEecCCCccCCHHHHHHHHhhCcc-CCCcccEe
Confidence            4444478889999999999998876 44455544


No 30 
>PHA01635 hypothetical protein
Probab=30.65  E-value=1.3e+02  Score=24.51  Aligned_cols=61  Identities=20%  Similarity=0.336  Sum_probs=47.4

Q ss_pred             CCCCeeEecc-cceeeeccce--eEeCCCcc----ccccHHHHHHHHhhCceeccCCCccccccceeecCC
Q psy8172          13 DPKSTVLTTT-DYTSEVGFWK--QTTRPSNT----ALSSMEQVKAYLQTQGTCKCGLECPLQCDSVFSFDA   76 (98)
Q Consensus        13 ~p~s~v~vp~-gw~r~v~k~~--~YiSPsgt----~L~S~eqv~~YL~t~GTCKCGLeCPl~~~~vFnFdp   76 (98)
                      ||+|+|-||. ||.-+|.-+-  .=+.|-|.    ..-+.+||.+|=+..|.   .+.||=.+-.||+.--
T Consensus       125 pP~svi~vPEYGF~nLvs~~~rLleVkpg~in~Y~iiYdy~EvveYe~e~g~---nv~~pPdPy~v~Sy~~  192 (231)
T PHA01635        125 PPNSVIHVPEYGFVNLVSLSGRLIEVKPGSLNLYAIIYDYAEVVEYEQEAGI---NVASPPDPYAVFSYQF  192 (231)
T ss_pred             CCCCEEEccccceeeeeeecceEEEeecCCceEEEEEecHHHHhhHHHHhCc---ccCCCCCCeEEEEEEe
Confidence            7999999996 7888776443  66777764    34689999999999997   6788888888886443


No 31 
>cd00246 RabGEF Nucleotide exchange factor for Rab-like small GTPases (RabGEF), Mss4 type; RabGEF positely regulates the function of  Rab GTPase by promoting exchange of GDP for GTP; members of the Rab subfamily of Ras GTPases are important in vesicular transport;
Probab=30.16  E-value=30  Score=24.86  Aligned_cols=37  Identities=14%  Similarity=0.047  Sum_probs=27.8

Q ss_pred             CcccccCCCCCCCCCCeeEe-------cccceeeeccceeEeCC
Q psy8172           1 MFALMNSNLQPPDPKSTVLT-------TTDYTSEVGFWKQTTRP   37 (98)
Q Consensus         1 ~~~~~~~~~~~p~p~s~v~v-------p~gw~r~v~k~~~YiSP   37 (98)
                      ||+.+|-...-+.+.--.++       |+||..++++..-|++.
T Consensus        54 m~~FeNigfs~~~~~~k~L~CadCe~GPiG~~~~~~~~~~yva~   97 (103)
T cd00246          54 MFTFENVGFSKDVGNLKYLVCADCEKGPIGYHCLDDKNSEYVAL   97 (103)
T ss_pred             cceeeeeeeccCCCCeeEEEeccCCCCceEeEeecccchhhhhh
Confidence            67788877766666666777       89999988777667764


No 32 
>PF13409 GST_N_2:  Glutathione S-transferase, N-terminal domain; PDB: 3C8E_B 3M1G_A 3R3E_A 3O3T_A 1RK4_A 1K0O_B 1K0N_A 3QR6_A 3SWL_A 3TGZ_B ....
Probab=29.77  E-value=40  Score=20.32  Aligned_cols=31  Identities=19%  Similarity=0.150  Sum_probs=23.6

Q ss_pred             eeeeccce--eEeCCCccccccHHHHHHHHhhC
Q psy8172          25 TSEVGFWK--QTTRPSNTALSSMEQVKAYLQTQ   55 (98)
Q Consensus        25 ~r~v~k~~--~YiSPsgt~L~S~eqv~~YL~t~   55 (98)
                      .++-.+++  +-+-++|+++....+|-+||-++
T Consensus        38 ~~~~p~~~VP~L~~~~g~vi~eS~~I~~yL~~~   70 (70)
T PF13409_consen   38 LALNPRGKVPVLVDPDGTVINESLAILEYLEEQ   70 (70)
T ss_dssp             HHHSTT-SSSEEEETTTEEEESHHHHHHHHHHT
T ss_pred             hccCcCeEEEEEEECCCCEeeCHHHHHHHHhcC
Confidence            45566777  45558999999999999999753


No 33 
>smart00456 WW Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Probab=29.64  E-value=37  Score=17.71  Aligned_cols=18  Identities=0%  Similarity=-0.193  Sum_probs=12.4

Q ss_pred             ecccceeeeccc-e-eEeCC
Q psy8172          20 TTTDYTSEVGFW-K-QTTRP   37 (98)
Q Consensus        20 vp~gw~r~v~k~-~-~YiSP   37 (98)
                      .|.||+...+.. + +|+..
T Consensus         1 lp~gW~~~~~~~g~~yy~n~   20 (32)
T smart00456        1 LPPGWEERKDPDGRPYYYNH   20 (32)
T ss_pred             CCCCCEEEECCCCCEEEEEC
Confidence            378998887754 4 66653


No 34 
>cd03593 CLECT_NK_receptors_like C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs). CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1),  human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis up
Probab=26.58  E-value=1.1e+02  Score=18.99  Aligned_cols=31  Identities=10%  Similarity=0.045  Sum_probs=19.5

Q ss_pred             cccceeeeccce-eEeCCCccccccHHHHHHHHhhCc
Q psy8172          21 TTDYTSEVGFWK-QTTRPSNTALSSMEQVKAYLQTQG   56 (98)
Q Consensus        21 p~gw~r~v~k~~-~YiSPsgt~L~S~eqv~~YL~t~G   56 (98)
                      |.||.+.-+  + +|++...   .+.++.+.|....|
T Consensus         2 p~gw~~~~~--~Cy~~~~~~---~~w~~A~~~C~~~g   33 (116)
T cd03593           2 PKDWICYGN--KCYYFSMEK---KTWNESKEACSSKN   33 (116)
T ss_pred             CCcCEEeCC--EEEEEEcCC---CCHHHHHHHHHhCC
Confidence            678877543  4 7777764   45666666655554


No 35 
>COG3139 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.48  E-value=13  Score=26.37  Aligned_cols=22  Identities=32%  Similarity=0.625  Sum_probs=18.3

Q ss_pred             eEecccceee-----eccceeEeCCCcccccc
Q psy8172          18 VLTTTDYTSE-----VGFWKQTTRPSNTALSS   44 (98)
Q Consensus        18 v~vp~gw~r~-----v~k~~~YiSPsgt~L~S   44 (98)
                      .|+|.-|||+     .|||     |+|++|..
T Consensus         9 ~mtPEiYQrL~~AvElGKW-----PdG~~Ltq   35 (90)
T COG3139           9 SMTPEIYQRLSTAVELGKW-----PDGVALTQ   35 (90)
T ss_pred             hcCHHHHHHHHHHHHhcCC-----CCCCcCCH
Confidence            4789999996     4788     99999964


No 36 
>PF02943 FeThRed_B:  Ferredoxin thioredoxin reductase catalytic beta chain;  InterPro: IPR004209 Ferredoxin thioredoxin reductase is a [4FE-4S] protein present in organisms performing oxygenic photosynthesis, and plays an important role in the ferredoxin/thioredoxin regulatory chain. It converts an electron signal (photoreduced ferredoxin) to a thiol signal (reduced thioredoxin), regulating enzymes by reduction of specific disulphide groups. It catalyses the light-dependent activation of several photosynthetis enzymes. Ferredoxin thioredoxin reductase is a heterodimer of subunit alpha and subunit beta. Subunit alpha is the variable subunit, and beta is the catalytic chain [].  The structure of the beta subunit has been determined and found to fold around the FeS cluster [].; GO: 0008937 ferredoxin-NAD(P) reductase activity, 0055114 oxidation-reduction process; PDB: 2PUK_E 2PVD_A 2PVG_A 2PUO_A 2PU9_A 1DJ7_A 2PVO_A.
Probab=24.28  E-value=30  Score=24.58  Aligned_cols=11  Identities=45%  Similarity=1.053  Sum_probs=5.0

Q ss_pred             hCceeccCCCc
Q psy8172          54 TQGTCKCGLEC   64 (98)
Q Consensus        54 t~GTCKCGLeC   64 (98)
                      ..|.|.|||=+
T Consensus        77 ~~G~ChC~Lf~   87 (108)
T PF02943_consen   77 EYGHCHCGLFV   87 (108)
T ss_dssp             -----TTSSSB
T ss_pred             ccccceEeeeE
Confidence            78999999965


No 37 
>TIGR01204 bioW 6-carboxyhexanoate--CoA ligase. Alternate name: pimeloyl-CoA synthase.
Probab=23.75  E-value=1.2e+02  Score=24.49  Aligned_cols=34  Identities=9%  Similarity=-0.039  Sum_probs=24.8

Q ss_pred             eEecc-cceeee--------ccce-eEeCCCccccccHHHHHHHHhhC
Q psy8172          18 VLTTT-DYTSEV--------GFWK-QTTRPSNTALSSMEQVKAYLQTQ   55 (98)
Q Consensus        18 v~vp~-gw~r~v--------~k~~-~YiSPsgt~L~S~eqv~~YL~t~   55 (98)
                      |-+.. ||.||-        .-|| +|++++  +  +++++.+||...
T Consensus       184 VA~~~~gY~RI~~lK~~G~~~GGRvffv~~~--~--~l~~~i~yLE~~  227 (232)
T TIGR01204       184 VSGKEIGYVRITPLKEKGDELGGRVFFVSRK--N--ELSEYIHCLEQK  227 (232)
T ss_pred             EEeCCCCeEeCccccccCCCCCCEEEEEeCC--C--CHHHHHHHHhcC
Confidence            55565 898874        2267 777776  3  999999999864


No 38 
>PF00799 Gemini_AL1:  Geminivirus Rep catalytic domain;  InterPro: IPR022690 Geminiviruses are characterised by a genome of circular single-stranded DNA encapsidated in twinned (geminate) quasi-isometric particles, from which the group derives its name []. Most geminiviruses can be divided into two subgroups on the basis of host range and/or insect vector: i.e. those that infect dicotyledenous plants and are transmitted by the same whitefly species, and those that infect monocotyledenous plants and are transmitted by different leafhopper vectors. The genomes of the whitefly-transmitted African cassava mosaic virus, Tomato golden mosaic virus (TGMV) and Bean golden mosaic virus (BGMV) possess a bipartite genome. By contrast, only a single DNA component has been identified for the leafhopper-transmitted Maize streak virus (MSV) and Wheat dwarf virus (WDV) [, ]. Beet curly top virus (BCTV), and Tobacco yellow dwarf virus belong to a third possible subgroup. Like MSV and WDV, BCTV is transmitted by a specific leafhopper species, yet like the whitefly-transmitted geminiviruses it has a host range confined to dicotyledenous plants. Sequence comparison of the whitefly-transmitted Squash leaf curl virus (SqLCV) and Tomato yellow leaf curl virus (TYLCV) with the genomic components of TGMV and BGMV reveals a close evolutionary relationship [, , ]. Amino acid sequence alignments of Potato yellow mosaic virus (PYMV) proteins with those encoded by other geminiviruses show that PYMV is closely related to geminiviruses isolated from the New World, especially in the putative coat protein gene regions []. Comparison of MSV DNA-encoded proteins with those of other geminiviruses infecting monocotyledonous plants, including Panicum streak virus [] and Miscanthus streak virus (MiSV) [], reveal high levels of similarity.  The AL1 proteins encodes the replication initiator protein (Rep) of geminiviruses, which is a replicon-specific initiator enzyme and is an essential component of the replisome []. For geminivirus Rep protein, this N-terminal region is crucial for origin recognition and DNA cleavage and nucleotidyl transfer []. It is found in association with PF08283 from PFAM. ; GO: 0006260 DNA replication; PDB: 1L5I_A 1L2M_A.
Probab=22.61  E-value=48  Score=23.54  Aligned_cols=23  Identities=26%  Similarity=0.479  Sum_probs=15.2

Q ss_pred             eCCCccccccHHHHHHHHhhCce
Q psy8172          35 TRPSNTALSSMEQVKAYLQTQGT   57 (98)
Q Consensus        35 iSPsgt~L~S~eqv~~YL~t~GT   57 (98)
                      ..|--...+|.++|++|+..||.
T Consensus        81 fHPNIq~aKs~s~vk~YI~KDgd  103 (114)
T PF00799_consen   81 FHPNIQGAKSSSDVKSYIEKDGD  103 (114)
T ss_dssp             E--EEEEESSSTHHHHHHH--SS
T ss_pred             cccccccccCHHHHHHHhhcCCC
Confidence            34555566889999999999993


No 39 
>PF13417 GST_N_3:  Glutathione S-transferase, N-terminal domain; PDB: 3ERG_B 3IBH_A 3ERF_A 3UBL_A 3UBK_A 3IR4_A 3M8N_B 2R4V_A 2PER_A 2R5G_A ....
Probab=22.07  E-value=47  Score=19.98  Aligned_cols=31  Identities=13%  Similarity=0.096  Sum_probs=25.4

Q ss_pred             ceeeeccce-eEeCCCccccccHHHHHHHHhh
Q psy8172          24 YTSEVGFWK-QTTRPSNTALSSMEQVKAYLQT   54 (98)
Q Consensus        24 w~r~v~k~~-~YiSPsgt~L~S~eqv~~YL~t   54 (98)
                      +.++..+++ +.+-++|..+.....|.+||-.
T Consensus        38 ~~~~~p~~~vPvL~~~g~~l~dS~~I~~yL~~   69 (75)
T PF13417_consen   38 FLKLNPKGKVPVLVDDGEVLTDSAAIIEYLEE   69 (75)
T ss_dssp             HHHHSTTSBSSEEEETTEEEESHHHHHHHHHH
T ss_pred             HHhhcccccceEEEECCEEEeCHHHHHHHHHH
Confidence            455667788 6666889999999999999965


No 40 
>PF03128 CXCXC:  CXCXC repeat;  InterPro: IPR004153 This repeat contains the conserved pattern CXCXC where X can be any amino acid. The repeat is found in up to five copies in Vascular endothelial growth factor C []. In the salivary glands of the dipteran Chironomus tentans, a specific messenger ribonucleoprotein (mRNP) particle, the Balbiani ring (BR) granule, can be visualized during its assembly on the gene and during its nucleocytoplasmic transport. This repeat is found over 70 copies in the balbiani ring protein 3 (Q03376 from SWISSPROT). It is also found in some silk proteins [].
Probab=21.85  E-value=38  Score=16.27  Aligned_cols=8  Identities=38%  Similarity=1.153  Sum_probs=5.9

Q ss_pred             hCceeccC
Q psy8172          54 TQGTCKCG   61 (98)
Q Consensus        54 t~GTCKCG   61 (98)
                      .+.||+|+
T Consensus         5 n~~tC~C~   12 (14)
T PF03128_consen    5 NDDTCQCE   12 (14)
T ss_pred             cCCCcCcc
Confidence            46789886


Done!