Query         psy1631
Match_columns 77
No_of_seqs    27 out of 29
Neff          3.0 
Searched_HMMs 46136
Date          Fri Aug 16 18:02:33 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1631.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1631hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF13998 MgrB:  MgrB protein     80.0    0.67 1.4E-05   25.8   0.3   15   12-27      9-23  (29)
  2 PRK10299 PhoPQ regulatory prot  66.3     2.3   5E-05   25.9   0.3   16   11-27     26-41  (47)
  3 PF08209 Sgf11:  Sgf11 (transcr  65.1     3.4 7.3E-05   23.1   0.8   12   35-46     16-27  (33)
  4 PF11201 DUF2982:  Protein of u  63.0     4.2 9.1E-05   29.4   1.2   20   27-46    130-150 (152)
  5 PF00042 Globin:  Globin plant   61.3      17 0.00038   21.9   3.6   52   19-70     36-88  (110)
  6 PF05962 HutD:  HutD;  InterPro  56.6     2.9 6.4E-05   29.8  -0.5   17   24-40     34-50  (184)
  7 smart00071 Galanin Galanin. Ga  54.1      11 0.00023   26.2   2.0   34   22-55     41-76  (103)
  8 cd04885 ACT_ThrD-I Tandem C-te  47.5       9  0.0002   22.4   0.7    8   33-40      7-14  (68)
  9 PF02087 Nitrophorin:  Nitropho  40.7     7.3 0.00016   29.2  -0.5   12   16-27     12-23  (178)
 10 cd01228 PH_BCR-related BCR (br  40.1     9.1  0.0002   26.1  -0.1   17   16-32     40-56  (96)
 11 cd01040 globin Globins are hem  40.0      55  0.0012   20.1   3.5   51   19-70     39-89  (140)
 12 PRK11396 hypothetical protein;  39.8     9.7 0.00021   28.3   0.0   18   24-41     35-52  (191)
 13 PF01267 F-actin_cap_A:  F-acti  38.3      11 0.00025   28.6   0.2   11   16-26    154-164 (271)
 14 PHA03004 putative membrane pro  35.7      28  0.0006   27.6   1.9   53    7-72    175-227 (270)
 15 PF08014 DUF1704:  Domain of un  35.4      66  0.0014   25.5   4.0   55   20-75     27-83  (349)
 16 KOG2859|consensus               34.8      12 0.00025   29.9  -0.2   33   33-70    215-248 (293)
 17 KOG0759|consensus               34.2      11 0.00024   30.0  -0.5   50    6-56    127-179 (286)
 18 PF14144 DOG1:  Seed dormancy c  33.1      11 0.00024   24.1  -0.5   16   15-30     44-59  (80)
 19 KOG3623|consensus               32.3      15 0.00032   33.5  -0.1   20   32-51    319-340 (1007)
 20 PF08137 DVL:  DVL family;  Int  31.9      29 0.00063   17.6   1.0    9   42-50     10-18  (19)
 21 PF03722 Hemocyanin_N:  Hemocya  31.8   1E+02  0.0023   20.8   4.1   39   33-71     61-102 (126)
 22 COG2999 GrxB Glutaredoxin 2 [P  30.8      35 0.00076   26.3   1.7   35   34-68    129-163 (215)
 23 TIGR02421 QEGLA conserved hypo  28.6 1.1E+02  0.0023   24.9   4.2   52   23-75     56-109 (366)
 24 PF11798 IMS_HHH:  IMS family H  27.8      24 0.00052   18.8   0.3   13   24-36      8-20  (32)
 25 KOG0409|consensus               27.1      83  0.0018   25.6   3.4   39   33-71    263-301 (327)
 26 KOG2655|consensus               25.8      21 0.00045   29.0  -0.2   27    2-33     81-107 (366)
 27 KOG0836|consensus               25.2      28 0.00061   27.8   0.4   16   16-31    161-180 (282)
 28 cd02682 MIT_AAA_Arch MIT: doma  24.9 1.6E+02  0.0036   18.7   3.8   34   31-64     15-51  (75)
 29 PF02208 Sorb:  Sorbin homologo  24.8      67  0.0015   19.6   1.9   24   26-49     16-39  (47)
 30 KOG2859|consensus               24.8      77  0.0017   25.4   2.7   29   32-60    169-200 (293)
 31 KOG3197|consensus               24.2 1.3E+02  0.0027   23.3   3.7   22   30-51     48-69  (210)
 32 PF13925 Katanin_con80:  con80   23.7      99  0.0022   21.5   2.9   33   42-74     26-60  (164)
 33 PF01832 Glucosaminidase:  Mann  23.5      34 0.00074   22.0   0.5   11   15-26     36-46  (136)
 34 PF05501 DUF755:  Domain of unk  23.4      82  0.0018   21.7   2.4   28   46-73     38-65  (123)
 35 PF15059 Speriolin_C:  Sperioli  23.2      73  0.0016   23.3   2.2   31    7-39     75-105 (146)
 36 PF07693 KAP_NTPase:  KAP famil  23.0      45 0.00097   23.9   1.1   26   27-52    176-202 (325)
 37 PF13929 mRNA_stabil:  mRNA sta  22.7 1.5E+02  0.0033   23.5   4.0   39   35-76    197-237 (292)
 38 PRK13291 metal-dependent hydro  22.1   2E+02  0.0044   19.9   4.2   20   26-45      5-24  (173)
 39 PF06812 ImpA-rel_N:  ImpA-rela  20.6 1.5E+02  0.0033   17.3   2.9   25   47-71      7-32  (62)
 40 KOG3904|consensus               20.5      27 0.00058   26.8  -0.5   30   10-41    138-167 (209)
 41 COG3326 Predicted membrane pro  20.3      39 0.00084   23.0   0.3   10   20-29     30-39  (94)
 42 cd06336 PBP1_ABC_ligand_bindin  20.2 2.3E+02   0.005   20.5   4.3   26   35-60    278-303 (347)
 43 PF11817 Foie-gras_1:  Foie gra  20.1 2.3E+02   0.005   20.5   4.3   33   37-69    212-244 (247)
 44 PF09571 RE_XcyI:  XcyI restric  20.0      58  0.0013   26.5   1.2   40   32-71    115-162 (318)

No 1  
>PF13998 MgrB:  MgrB protein
Probab=80.03  E-value=0.67  Score=25.81  Aligned_cols=15  Identities=20%  Similarity=0.281  Sum_probs=13.1

Q ss_pred             cccccccccccccccc
Q psy1631          12 SCLTSVVHFGLGLWRT   27 (77)
Q Consensus        12 fc~rnktNFFnGiWri   27 (77)
                      +|+.- .+||+||++|
T Consensus         9 ~CDQg-~~F~~GiC~i   23 (29)
T PF13998_consen    9 YCDQG-EQFFSGICSI   23 (29)
T ss_pred             HhcCC-CCceeeeeEE
Confidence            58866 8999999987


No 2  
>PRK10299 PhoPQ regulatory protein; Provisional
Probab=66.29  E-value=2.3  Score=25.92  Aligned_cols=16  Identities=25%  Similarity=0.357  Sum_probs=13.7

Q ss_pred             ccccccccccccccccc
Q psy1631          11 QSCLTSVVHFGLGLWRT   27 (77)
Q Consensus        11 ~fc~rnktNFFnGiWri   27 (77)
                      .+|++- ..||.||++|
T Consensus        26 ~~CDQg-~~F~~GIC~i   41 (47)
T PRK10299         26 MMCDQD-VQFFSGICAI   41 (47)
T ss_pred             HHhcCC-ccceeeeeee
Confidence            369977 8999999987


No 3  
>PF08209 Sgf11:  Sgf11 (transcriptional regulation protein);  InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=65.08  E-value=3.4  Score=23.08  Aligned_cols=12  Identities=25%  Similarity=0.587  Sum_probs=10.4

Q ss_pred             CCchhHHHHHHH
Q psy1631          35 PGSFAYHMSRSI   46 (77)
Q Consensus        35 PGsFa~Hm~Rsv   46 (77)
                      ++-||.|+.||+
T Consensus        16 a~RfA~HLekCm   27 (33)
T PF08209_consen   16 ASRFAPHLEKCM   27 (33)
T ss_dssp             GGGHHHHHHHHT
T ss_pred             hhhhHHHHHHHH
Confidence            468999999996


No 4  
>PF11201 DUF2982:  Protein of unknown function (DUF2982);  InterPro: IPR021367  This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. 
Probab=62.95  E-value=4.2  Score=29.39  Aligned_cols=20  Identities=35%  Similarity=0.468  Sum_probs=18.4

Q ss_pred             ccccccCCCC-chhHHHHHHH
Q psy1631          27 TPIADLDRPG-SFAYHMSRSI   46 (77)
Q Consensus        27 iPv~dIDRPG-sFa~Hm~Rsv   46 (77)
                      ||.+++||+. .|....+|+.
T Consensus       130 i~~~~LDR~~~eFv~Llr~~~  150 (152)
T PF11201_consen  130 IPASDLDRDPEEFVGLLRRYQ  150 (152)
T ss_pred             eEhhhccCCHHHHHHHHHHHH
Confidence            8999999999 9999999875


No 5  
>PF00042 Globin:  Globin plant globin signature erythrocruorin family signature alpha hemoglobin signature myoglobin signature thalassemia.;  InterPro: IPR000971 Globins are haem-containing proteins involved in binding and/or transporting oxygen. They belong to a very large and well studied family that is widely distributed in many organisms []. Globins have evolved from a common ancestor and can be divided into three groups: single-domain globins, and two types of chimeric globins, flavohaemoglobins and globin-coupled sensors. Bacteria have all three types of globins, while archaea lack flavohaemoglobins, and eukaryotes lack globin-coupled sensors []. Several functionally different haemoglobins can coexist in the same species. The major types of globins include:   Haemoglobin (Hb): trimer of two alpha and two beta chains, although embryonic and foetal forms can substitute the alpha or beta chain for ones with higher oxygen affinity, such as gamma, delta, epsilon or zeta chains. Hb transports oxygen from lungs to other tissues in vertebrates []. Hb proteins are also present in unicellular organisms where they act as enzymes or sensors []. Myoglobin (Mb): monomeric protein responsible for oxygen storage in vertebrate muscle [].  Neuroglobin: a myoglobin-like haemprotein expressed in vertebrate brain and retina, where it is involved in neuroprotection from damage due to hypoxia or ischemia []. Neuroglobin belongs to a branch of the globin family that diverged early in evolution.  Cytoglobin: an oxygen sensor expressed in multiple tissues. Related to neuroglobin []. Erythrocruorin: highly cooperative extracellular respiratory proteins found in annelids and arthropods that are assembled from as many as 180 subunit into hexagonal bilayers []. Leghaemoglobin (legHb or symbiotic Hb): occurs in the root nodules of leguminous plants, where it facilitates the diffusion of oxygen to symbiotic bacteriods in order to promote nitrogen fixation. Non-symbiotic haemoglobin (NsHb): occurs in non-leguminous plants, and can be over-expressed in stressed plants []. Flavohaemoglobins (FHb): chimeric, with an N-terminal globin domain and a C-terminal ferredoxin reductase-like NAD/FAD-binding domain. FHb provides protection against nitric oxide via its C-terminal domain, which transfers electrons to haem in the globin []. Globin-coupled sensors: chimeric, with an N-terminal myoglobin-like domain and a C-terminal domain that resembles the cytoplasmic signalling domain of bacterial chemoreceptors. They bind oxygen, and act to initiate an aerotactic response or regulate gene expression [, ].  Protoglobin: a single domain globin found in archaea that is related to the N-terminal domain of globin-coupled sensors []. Truncated 2/2 globin: lack the first helix, giving them a 2-over-2 instead of the canonical 3-over-3 alpha-helical sandwich fold. Can be divided into three main groups (I, II and II) based on structural features [].   This entry covers most of the globin family of proteins, but it omits some bacterial globins and the protoglobins. More information about these proteins can be found at Protein of the Month: Haemoglobin [].; GO: 0005506 iron ion binding, 0020037 heme binding; PDB: 2WTH_A 2WTG_A 3A59_G 3FS4_C 3CY5_C 3D1A_B 2RI4_J 3EU1_B 1JEB_A 2Z6N_B ....
Probab=61.32  E-value=17  Score=21.94  Aligned_cols=52  Identities=15%  Similarity=0.252  Sum_probs=42.0

Q ss_pred             ccccccc-cccccccCCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhc
Q psy1631          19 HFGLGLW-RTPIADLDRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALK   70 (77)
Q Consensus        19 NFFnGiW-riPv~dIDRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~   70 (77)
                      +.|...= ..|.+++.....|..|..|-+..+-++...++|-+.+......|.
T Consensus        36 ~~F~~~~~~~~~~~l~~~~~~~~h~~~v~~~l~~~v~~l~~~~~l~~~l~~lg   88 (110)
T PF00042_consen   36 KLFPKFKDIVPLEELKNNPEFKAHAQRVMEALDEAVDNLDDPESLEAMLRKLG   88 (110)
T ss_dssp             GGGTTGTTTSSHHHHTTSHHHHHHHHHHHHHHHHHHHTTTSTHGHHHHHHHHH
T ss_pred             hhcccccccchHHHHhccchHHHHHHHHHHHHHHHHHccCChHhHHHHHHHHH
Confidence            4555442 258888888899999999999999999999999988877766654


No 6  
>PF05962 HutD:  HutD;  InterPro: IPR010282 This entry contains proteins of unknown function, which include HutD from Pseudomonas fluorescens and Ves from Escherichia coli K12. HutD from P. fluorescens is a component of the histidine uptake and utilisation operon. HutD is operonic with the well characterised repressor protein HutC. Genetic analysis using transcriptional fusions (lacZ) and deletion mutants shows that hutD is necessary to maintain fitness in environments replete with histidine. HutD probably sets an upper bound on the level of hut operon transcription []. The mechanistic basis is unknown, but in silico molecular docking studies based on the crystal structure of HutD from Pseudomonas aeruginosa show that urocanate (the first breakdown product of histidine) docks with the active site of HutD.; PDB: 3ESG_A 1YLL_D.
Probab=56.60  E-value=2.9  Score=29.78  Aligned_cols=17  Identities=35%  Similarity=0.718  Sum_probs=13.2

Q ss_pred             cccccccccCCCCchhH
Q psy1631          24 LWRTPIADLDRPGSFAY   40 (77)
Q Consensus        24 iWriPv~dIDRPGsFa~   40 (77)
                      -|||-+++|+.+|+|+.
T Consensus        34 ~wRiS~A~V~~~g~FS~   50 (184)
T PF05962_consen   34 DWRISIATVEADGPFSD   50 (184)
T ss_dssp             SEEEEEEEE-SSEEE--
T ss_pred             eEEEEEEEEcCCCCCCC
Confidence            39999999999999975


No 7  
>smart00071 Galanin Galanin. Galanin [1,2,3] is a neuropeptide that controls various biological activities: it regulates the release growth hormone, inhibits the release of insulin and somatostatin, contracts smooth muscle of the gastrointestinal and genitourinary tract and may be involved in the control of adrenal secretion
Probab=54.14  E-value=11  Score=26.22  Aligned_cols=34  Identities=26%  Similarity=0.418  Sum_probs=25.6

Q ss_pred             cccccccccccCCCCchhHHHH--HHHHHHHHHHHh
Q psy1631          22 LGLWRTPIADLDRPGSFAYHMS--RSIHLVVDLLSE   55 (77)
Q Consensus        22 nGiWriPv~dIDRPGsFa~Hm~--Rsv~LLl~vL~e   55 (77)
                      .|=-.+|.+|.-|||+|..|.+  .-|.-++|.|.=
T Consensus        41 ~gKREl~pEde~r~g~~~r~l~e~~~vrTiiEFLsf   76 (103)
T smart00071       41 TGKRELEPEDEARPGSFDRPLSESNIVRTIIEFLTF   76 (103)
T ss_pred             cccccCChhhhcccCccCCcCchhhHHHHHHHHHHH
Confidence            4777899999999999977764  346666666643


No 8  
>cd04885 ACT_ThrD-I Tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes each of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=47.47  E-value=9  Score=22.40  Aligned_cols=8  Identities=38%  Similarity=0.924  Sum_probs=6.6

Q ss_pred             CCCCchhH
Q psy1631          33 DRPGSFAY   40 (77)
Q Consensus        33 DRPGsFa~   40 (77)
                      ||||+|+.
T Consensus         7 dkPG~l~~   14 (68)
T cd04885           7 ERPGALKK   14 (68)
T ss_pred             CCCCHHHH
Confidence            89999864


No 9  
>PF02087 Nitrophorin:  Nitrophorin;  InterPro: IPR002351 Nitrophorins are haemoproteins found in saliva of blood-feeding insects [, ]. Saliva of the blood-sucking bug Rhodnius prolixus (Triatomid bug) contains four homologous nitrophorins, designated NP1 to NP4 in order of their relative abundance in the glands []. As isolated, nitrophorins contain nitric oxide (NO) ligated to the ferric (FeIII) haem iron. Histamine, which is released by the host in response to tissue damage, is another nitrophorin ligand. Nitrophorins transport NO to the feeding site. Dilution, binding of histamine and increase in pH (from pH ~5 in salivary gland to pH ~7.4 in the host tissue) facilitate the release of NO into the tissue where it induces vasodilatation. The salivary nitrophorin from the hemipteran Cimex lectularius (Bed bug) has no sequence similarity to R. prolixus nitrophorins. It is suggested that the two classes of insect nitrophorins have arisen as a product of the convergent evolution []. 3-D structures of several nitrophorin complexes are known []. The nitrophorin structures reveal lipocalin-like eight-stranded beta-barrel, three alpha-helices and two disulphide bonds, with haem inserted into one end of the barrel. Members of the lipocalin family are known to bind a variety of small hydrophobic ligands, including biliverdin, in a similar fashion (see [] for review). The haem iron is ligated to His59. The position of His59 is restrained through water-mediated hydrogen bond to the carboxylate of Asp70. The His59-Fe bond is bent ~15 degrees out of the imidazole plane. Asp70 forms an unusual hydrogen bond with one of the haem propionates, suggesting the residue has an altered pKa. In NP1-histamine structure, the planes of His59 and histamine imidazole rings lie in an arrangement almost identical to that found in oxidised cytochrome b5. This entry represents the nitrophorin structural domain.; GO: 0051381 histamine binding, 0070026 nitric oxide binding; PDB: 1SXX_A 2OFM_X 1X8Q_A 3TGA_A 1SXU_A 1IKJ_A 1YWD_A 1X8N_A 3FLL_A 1X8O_A ....
Probab=40.66  E-value=7.3  Score=29.21  Aligned_cols=12  Identities=17%  Similarity=0.335  Sum_probs=9.7

Q ss_pred             cccccccccccc
Q psy1631          16 SVVHFGLGLWRT   27 (77)
Q Consensus        16 nktNFFnGiWri   27 (77)
                      +|.+||+|+|=+
T Consensus        12 dk~k~F~g~WYv   23 (178)
T PF02087_consen   12 DKDKYFSGTWYV   23 (178)
T ss_dssp             -HHHHTSSEEEE
T ss_pred             CHHHccCceEEE
Confidence            688999999964


No 10 
>cd01228 PH_BCR-related BCR (breakpoint cluster region)-related pleckstrin homology (PH) domain. BCR (breakpoint cluster region)-related pleckstrin homology (PH) domain.  The BCR-related protein has a RhoGEF(DH) domain followed by a PH domain, a C2 domain and a RhoGAP domain.  PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding.  Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
Probab=40.08  E-value=9.1  Score=26.06  Aligned_cols=17  Identities=29%  Similarity=0.325  Sum_probs=14.0

Q ss_pred             ccccccccccccccccc
Q psy1631          16 SVVHFGLGLWRTPIADL   32 (77)
Q Consensus        16 nktNFFnGiWriPv~dI   32 (77)
                      +|++=++--|-||++|+
T Consensus        40 ~k~~kY~~~w~IPL~dl   56 (96)
T cd01228          40 GKHQQYDCKWYIPLADL   56 (96)
T ss_pred             ccccccceeEEEEhHHh
Confidence            45567788899999998


No 11 
>cd01040 globin Globins are heme proteins, which bind and transport oxygen. This family summarizes a diverse set of homologous protein domains, including: (1) tetrameric vertebrate hemoglobins, which are the major protein component of erythrocytes and transport oxygen in the bloodstream, (2) microorganismal flavohemoglobins, which are linked to C-terminal FAD-dependend reductase domains, (3) homodimeric bacterial hemoglobins, such as from Vitreoscilla, (4) plant leghemoglobins (symbiotic hemoglobins, involved in nitrogen metabolism in plant rhizomes), (5) plant non-symbiotic hexacoordinate globins and hexacoordinate globins from bacteria and animals, such as neuroglobin, (6) invertebrate hemoglobins, which may occur in tandem-repeat arrangements, and (7) monomeric myoglobins found in animal muscle tissue.
Probab=40.00  E-value=55  Score=20.14  Aligned_cols=51  Identities=14%  Similarity=0.153  Sum_probs=39.3

Q ss_pred             ccccccccccccccCCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhc
Q psy1631          19 HFGLGLWRTPIADLDRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALK   70 (77)
Q Consensus        19 NFFnGiWriPv~dIDRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~   70 (77)
                      ++|...=..+. ++-....|..|..+=+..|-.+...++|.+.+.....+|.
T Consensus        39 ~~F~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~v~~l~~~~~l~~~l~~lg   89 (140)
T cd01040          39 ALFSRFGGLSA-ALKGSPKFKAHGKRVLNALDEAIKNLDDLEALKALLAKLG   89 (140)
T ss_pred             HHhHHhCCchH-hHccCHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHH
Confidence            34544444455 7777789999999999999999999999988776665553


No 12 
>PRK11396 hypothetical protein; Provisional
Probab=39.85  E-value=9.7  Score=28.28  Aligned_cols=18  Identities=33%  Similarity=0.789  Sum_probs=16.5

Q ss_pred             cccccccccCCCCchhHH
Q psy1631          24 LWRTPIADLDRPGSFAYH   41 (77)
Q Consensus        24 iWriPv~dIDRPGsFa~H   41 (77)
                      -|||-+++|..+|-|+..
T Consensus        35 ~WRiSiA~I~~~GpFS~F   52 (191)
T PRK11396         35 YWRASIASIAANGEFSLF   52 (191)
T ss_pred             eEEEEEEEecCCCCCCCC
Confidence            699999999999999864


No 13 
>PF01267 F-actin_cap_A:  F-actin capping protein alpha subunit;  InterPro: IPR002189 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. The actin filament system, a prominent part of the cytoskeleton in eukaryotic cells, is both a static structure and a dynamic network that can undergo rearrangements: it is thought to be involved in processes such as cell movement and phagocytosis [], as well as muscle contraction. The F-actin capping protein binds in a calcium-independent manner to the fast growing ends of actin filaments (barbed end) thereby blocking the exchange of subunits at these ends. Unlike gelsolin and severin this protein does not sever actin filaments. The F-actin capping protein is a heterodimer composed of two unrelated subunits: alpha and beta (see IPR001698 from INTERPRO). Neither of the subunits shows sequence similarity to other filament-capping proteins []. The alpha subunit is a protein of about 268 to 286 amino acid residues whose sequence is well conserved in eukaryotic species [].; GO: 0003779 actin binding, 0030036 actin cytoskeleton organization, 0008290 F-actin capping protein complex; PDB: 1MQ1_D 1MWN_X 2KBM_Y 2KZ7_A 3AAE_C 3AA6_A 3LK4_G 1IZN_A 3AA7_A 3AAA_A ....
Probab=38.30  E-value=11  Score=28.62  Aligned_cols=11  Identities=36%  Similarity=0.664  Sum_probs=9.5

Q ss_pred             ccccccccccc
Q psy1631          16 SVVHFGLGLWR   26 (77)
Q Consensus        16 nktNFFnGiWr   26 (77)
                      |-.||.||-||
T Consensus       154 n~~NfwnG~Wr  164 (271)
T PF01267_consen  154 NPKNFWNGRWR  164 (271)
T ss_dssp             EGGGTEEEEEE
T ss_pred             ccccccCceee
Confidence            56799999997


No 14 
>PHA03004 putative membrane protein; Provisional
Probab=35.73  E-value=28  Score=27.64  Aligned_cols=53  Identities=15%  Similarity=0.113  Sum_probs=42.0

Q ss_pred             ccccccccccccccccccccccccccCCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccC
Q psy1631           7 TYHPQSCLTSVVHFGLGLWRTPIADLDRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDE   72 (77)
Q Consensus         7 ~~~~~fc~rnktNFFnGiWriPv~dIDRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~   72 (77)
                      .|-|+|.--...|+|.|=.-++...|++|+-=-             -....|.+||.-+|..|+-|
T Consensus       175 DYKPlFsrl~~~~~y~~~fp~~~~~i~~p~~~~-------------~~~~sDyETLaNlSaIly~T  227 (270)
T PHA03004        175 DYKPLFSRLKENNIYTGPFPIGQSRINHPNLPR-------------GRSPSDYETLANLSAILYLT  227 (270)
T ss_pred             CccchhcccCcCccccCCCCCCchhhcCCCCCC-------------CCCchhHHHHHhHHHHHhhc
Confidence            488999999999999999888889999887211             12456899999998888654


No 15 
>PF08014 DUF1704:  Domain of unknown function (DUF1704);  InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=35.40  E-value=66  Score=25.48  Aligned_cols=55  Identities=18%  Similarity=0.264  Sum_probs=47.8

Q ss_pred             cccccccccccccCCCC--chhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccCCCC
Q psy1631          20 FGLGLWRTPIADLDRPG--SFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDEPEA   75 (77)
Q Consensus        20 FFnGiWriPv~dIDRPG--sFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~Pd~   75 (77)
                      +-.-+..||++.|+-|.  ..-..+-++....+++|...+. ..-.+.|.+|.-.|+.
T Consensus        27 ~~~~l~~l~~~~~~d~~l~~l~~~~~~e~~~~i~mL~~~Gt-~~F~~~S~~lYG~p~~   83 (349)
T PF08014_consen   27 LKRELYSLPIEIIEDPPLGQLYRDKIDEYDLQIDMLEARGT-PDFYYNSLRLYGEPSD   83 (349)
T ss_pred             HHHHHHhhHHhhcCCchHHHHHHHHHHHHHHHHHHHHhcCc-hHHHHHHHHHhCCCCh
Confidence            34467789999999886  7888899999999999999988 8999999999999874


No 16 
>KOG2859|consensus
Probab=34.78  E-value=12  Score=29.94  Aligned_cols=33  Identities=24%  Similarity=0.252  Sum_probs=27.3

Q ss_pred             CCCC-chhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhc
Q psy1631          33 DRPG-SFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALK   70 (77)
Q Consensus        33 DRPG-sFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~   70 (77)
                      -|.| +|..|..|||.+|+     .+|+.-++++-+|+-
T Consensus       215 y~~~En~P~~~gRsvClll-----iky~~~~~k~~qq~v  248 (293)
T KOG2859|consen  215 YRFGENLPEAGGRSVCLLL-----IKYHVRMQKIQQQIV  248 (293)
T ss_pred             ccccccCccccccceeeee-----ccccHHHHHHHHHHH
Confidence            3566 79999999998776     899999999888763


No 17 
>KOG0759|consensus
Probab=34.21  E-value=11  Score=29.99  Aligned_cols=50  Identities=18%  Similarity=0.105  Sum_probs=31.7

Q ss_pred             cccccccccccccccccccccccccccCC---CCchhHHHHHHHHHHHHHHHhc
Q psy1631           6 DTYHPQSCLTSVVHFGLGLWRTPIADLDR---PGSFAYHMSRSIHLVVDLLSEI   56 (77)
Q Consensus         6 ~~~~~~fc~rnktNFFnGiWriPv~dIDR---PGsFa~Hm~Rsv~LLl~vL~e~   56 (77)
                      |.--|.-=-||=+|+|+|+-||.-+|==.   -|+ .--+.|++.+.+--|+.-
T Consensus       127 D~~LP~~~RRNYknv~dgL~rI~reEG~~~L~~G~-~~tv~Ra~lvt~~QlA~Y  179 (286)
T KOG0759|consen  127 DGRLPPEQRRNYKNVFDGLVRITREEGVTALFRGC-KPTVSRAMLVTASQLASY  179 (286)
T ss_pred             cCCCCHHHhhhhhhHhhHHhhhhhhhhHHHHHcCc-hHHHHHHHHHHHHHHHHH
Confidence            33446666799999999999998776110   122 224667777766555443


No 18 
>PF14144 DOG1:  Seed dormancy control
Probab=33.13  E-value=11  Score=24.10  Aligned_cols=16  Identities=25%  Similarity=0.646  Sum_probs=13.0

Q ss_pred             cccccccccccccccc
Q psy1631          15 TSVVHFGLGLWRTPIA   30 (77)
Q Consensus        15 rnktNFFnGiWriPv~   30 (77)
                      .+.-++|.|.|+.|.+
T Consensus        44 ~DV~~~~s~~W~sp~E   59 (80)
T PF14144_consen   44 ADVFHLLSPPWKSPLE   59 (80)
T ss_pred             cchHHHhCCCCCCHHH
Confidence            3566789999999976


No 19 
>KOG3623|consensus
Probab=32.29  E-value=15  Score=33.52  Aligned_cols=20  Identities=35%  Similarity=0.695  Sum_probs=15.7

Q ss_pred             cCCCCchhHHHH--HHHHHHHH
Q psy1631          32 LDRPGSFAYHMS--RSIHLVVD   51 (77)
Q Consensus        32 IDRPGsFa~Hm~--Rsv~LLl~   51 (77)
                      +.-+|||++||+  +||.|.+.
T Consensus       319 FSHSGSySSHmSSKKCIsli~v  340 (1007)
T KOG3623|consen  319 FSHSGSYSSHMSSKKCISLILV  340 (1007)
T ss_pred             cccCCcccccccccchhhhhcc
Confidence            457899999996  78877653


No 20 
>PF08137 DVL:  DVL family;  InterPro: IPR012552 This family consists of the DVL family of proteins. In a gain-of-function genetic screen for genes that influence fruit development in Arabidopsis, DEVIL (DVL) gene was identified. DVL is a small protein and over expression of the protein results in pleiotropic phenotypes featured by shortened stature, rounder rosette leaves, clustered inflorescences, shortened pedicles, and siliques with pronged tips. DVL family is a novel class of small polypeptides and the over expression phenotypes suggest that these polypeptides may have a role in plant development [].
Probab=31.95  E-value=29  Score=17.62  Aligned_cols=9  Identities=11%  Similarity=0.541  Sum_probs=7.3

Q ss_pred             HHHHHHHHH
Q psy1631          42 MSRSIHLVV   50 (77)
Q Consensus        42 m~Rsv~LLl   50 (77)
                      +.|||++|+
T Consensus        10 irrCv~mLl   18 (19)
T PF08137_consen   10 IRRCVVMLL   18 (19)
T ss_pred             HHHhHHHhc
Confidence            678998875


No 21 
>PF03722 Hemocyanin_N:  Hemocyanin, all-alpha domain;  InterPro: IPR005204 Haemocyanins are copper-containing oxygen transport proteins found in the haemolymph of many invertebrates. They are divided into 2 main groups, arthropodan and molluscan. These have structurally similar oxygen-binding centres, which are similar to the oxygen-binding centre of tyrosinases [], but their quaternary structures are arranged differently. The arthropodan proteins exist as hexamers comprising 3 heterogeneous subunits (a, b and c) and possess 1 oxygen-binding centre per subunit; and the molluscan proteins exist as cylindrical oligomers of 10 to 20 subunits and possess 7 or 8 oxygen-binding centres per subunit []. Although the proteins have similar amino acid compositions, the only real similarity in their primary sequences is in the region corresponding to the second copper-binding domain, which also shows similarity to the copper-binding domain of tyrosinases [].  Larval storage proteins (LSP) [] are proteins from the hemolymph of insects, which may serve as a store of amino acids for synthesis of adult proteins. There are two classes of LSP's, arylphorins, which are rich in aromatic amino acids, and methionine-rich LSP's. LSP's forms hexameric complexes. LSP's are structurally related to arthropods hemocyanins.; PDB: 1HCY_A 1HC1_C 3GWJ_F 3IXV_E 3IXW_L 3HHS_A 1NOL_A 1LLA_A 1OXY_A 1LL1_A.
Probab=31.80  E-value=1e+02  Score=20.82  Aligned_cols=39  Identities=28%  Similarity=0.267  Sum_probs=28.2

Q ss_pred             CCCCchh---HHHHHHHHHHHHHHHhcCcHhHHHHHHHHhcc
Q psy1631          33 DRPGSFA---YHMSRSIHLVVDLLSEIKDAKTLMDIAVALKD   71 (77)
Q Consensus        33 DRPGsFa---~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r   71 (77)
                      .|-+-|+   ....+-...|.++|-.-+|++|+.+.|..++.
T Consensus        61 pr~~~FS~f~~~h~~ea~~L~~~l~~akd~dtf~~~a~~aR~  102 (126)
T PF03722_consen   61 PRGEIFSLFNPRHREEAIALFDLLYSAKDFDTFYKTAVWARD  102 (126)
T ss_dssp             -TTS---TTSHHHHHHHHHHHHHHHCSSSHHHHHHHHHHHHT
T ss_pred             CCCCeecCCCHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHH
Confidence            3444444   34556677899999999999999999988764


No 22 
>COG2999 GrxB Glutaredoxin 2 [Posttranslational modification, protein turnover, chaperones]
Probab=30.77  E-value=35  Score=26.32  Aligned_cols=35  Identities=14%  Similarity=0.265  Sum_probs=28.4

Q ss_pred             CCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHH
Q psy1631          34 RPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVA   68 (77)
Q Consensus        34 RPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~   68 (77)
                      -.|+|++|.+.+-.++..+=..++-.+-|+...+.
T Consensus       129 s~g~F~~~l~~t~~~~~~i~~dl~~l~~Li~~~s~  163 (215)
T COG2999         129 SEGSFESLLNHTAQYLKRIQADLRALDKLIVGPSA  163 (215)
T ss_pred             ccccHHHHHhchHHHHHHHHHHHHHHHHHhcCcch
Confidence            35999999999999999988877777777665543


No 23 
>TIGR02421 QEGLA conserved hypothetical protein. Members of this family include a possible metal-binding motif HEXXXH and, nearby, a perfectly conserved motif QEGLA. All members belong to the Proteobacteria, including Agrobacterium tumefaciens and several species of Vibrio and Pseudomonas, and are found in only one copy per chromosome (Vibrio vulnificus, with two chromosomes, has two). The function is unknown.
Probab=28.57  E-value=1.1e+02  Score=24.85  Aligned_cols=52  Identities=27%  Similarity=0.426  Sum_probs=45.4

Q ss_pred             ccccccccccCCC--CchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccCCCC
Q psy1631          23 GLWRTPIADLDRP--GSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDEPEA   75 (77)
Q Consensus        23 GiWriPv~dIDRP--GsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~Pd~   75 (77)
                      -+..||++.|+-|  |.+...+-++..+.+++|...+- +.-.+.|.+|.-.|+.
T Consensus        56 ~l~~l~~e~i~d~~l~~l~r~~~~e~~~~i~mL~~~Gt-~~F~~~S~~lYG~p~~  109 (366)
T TIGR02421        56 ELYSLPIDIIRDPPLGQLYREKQDEYDLVIDLLESIGT-ATFLYNSLRLYGAPSD  109 (366)
T ss_pred             HHHhhHHhhccChhHHHHHHHHHHHHHHHHHHHHhcCc-hHHHHHHHHHcCCCCc
Confidence            4567899988887  47888899999999999999887 8899999999999985


No 24 
>PF11798 IMS_HHH:  IMS family HHH motif;  InterPro: IPR024728 This helix-hairpin-helix motif is found in proteins belonging to the type-Y family of DNA polymerases []. This type of polymerases are thought to be involved in UV protection and mutation [, ]. ; PDB: 3PZP_B 2OH2_B 2W7O_B 3IN5_B 1T94_A 2W7P_B 2W8K_A 2AGQ_A 1RYR_A 3RAX_A ....
Probab=27.82  E-value=24  Score=18.82  Aligned_cols=13  Identities=38%  Similarity=0.700  Sum_probs=9.8

Q ss_pred             cccccccccCCCC
Q psy1631          24 LWRTPIADLDRPG   36 (77)
Q Consensus        24 iWriPv~dIDRPG   36 (77)
                      +|.+|+.|+=-=|
T Consensus         8 l~~lpi~~~~GIG   20 (32)
T PF11798_consen    8 LWPLPIRKFWGIG   20 (32)
T ss_dssp             HHCSBGGGSTTS-
T ss_pred             HhcCCHHhhCCcc
Confidence            7999999986555


No 25 
>KOG0409|consensus
Probab=27.14  E-value=83  Score=25.60  Aligned_cols=39  Identities=21%  Similarity=0.209  Sum_probs=33.8

Q ss_pred             CCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhcc
Q psy1631          33 DRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKD   71 (77)
Q Consensus        33 DRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r   71 (77)
                      =.||.+..||-|-+.|.+..-.+.+----+..+++|+.+
T Consensus       263 y~p~f~~~~m~KDLgla~~~a~~~~~~~P~~slA~qly~  301 (327)
T KOG0409|consen  263 YNPGFALKLMVKDLGLALNAAESVKVPMPLGSLAHQLYK  301 (327)
T ss_pred             CCCcchHHHHHHHHHHHHHhhhccCCCCchHHHHHHHHH
Confidence            579999999999999999998888877778888888765


No 26 
>KOG2655|consensus
Probab=25.83  E-value=21  Score=29.01  Aligned_cols=27  Identities=30%  Similarity=0.463  Sum_probs=20.9

Q ss_pred             cccccccccccccccccccccccccccccccC
Q psy1631           2 VTLLDTYHPQSCLTSVVHFGLGLWRTPIADLD   33 (77)
Q Consensus         2 ~~~~~~~~~~fc~rnktNFFnGiWriPv~dID   33 (77)
                      +|+.||  |.|++ +-.|  .+-|+.+++-||
T Consensus        81 Ltvidt--PGfGD-~vdn--s~~w~pi~~yi~  107 (366)
T KOG2655|consen   81 LTVIDT--PGFGD-AVDN--SNCWRPIVNYID  107 (366)
T ss_pred             eEEecc--CCCcc-cccc--cccchhhhHHHH
Confidence            356666  88998 8888  789999887664


No 27 
>KOG0836|consensus
Probab=25.23  E-value=28  Score=27.76  Aligned_cols=16  Identities=25%  Similarity=0.434  Sum_probs=12.0

Q ss_pred             ccccccccccc----ccccc
Q psy1631          16 SVVHFGLGLWR----TPIAD   31 (77)
Q Consensus        16 nktNFFnGiWr----iPv~d   31 (77)
                      |--||.||-||    +++.+
T Consensus       161 ~pkNfwNG~WRS~w~~~v~~  180 (282)
T KOG0836|consen  161 QPKNFWNGRWRSEWNYDVQP  180 (282)
T ss_pred             CcccccCCceeeeeeeccCc
Confidence            55799999887    56665


No 28 
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=24.87  E-value=1.6e+02  Score=18.73  Aligned_cols=34  Identities=12%  Similarity=0.287  Sum_probs=28.0

Q ss_pred             ccCCCCchh---HHHHHHHHHHHHHHHhcCcHhHHHH
Q psy1631          31 DLDRPGSFA---YHMSRSIHLVVDLLSEIKDAKTLMD   64 (77)
Q Consensus        31 dIDRPGsFa---~Hm~Rsv~LLl~vL~e~~D~~tLl~   64 (77)
                      +.|.-|.|.   .+-..+|-+|+++++-..|..+..-
T Consensus        15 e~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~~   51 (75)
T cd02682          15 KAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRLI   51 (75)
T ss_pred             HHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHHH
Confidence            356777777   7788999999999999999988544


No 29 
>PF02208 Sorb:  Sorbin homologous domain;  InterPro: IPR003127 Sorbin is an active peptide present in the digestive tract, where it has pro-absorptive and anti-secretory effects in different parts of the intestine, including the ability to decrease VIP (vasoactive intestinal peptide) and cholera toxin-induced secretion. It is expressed in some intestinal and pancreatic endocrine tumours in humans []. Sorbin-homology domains are found in adaptor proteins such as vinexin, CAP/ponsin and argBP2, which regulate various cellular functions, including cell adhesion, cytoskeletal organisation, and growth factor signalling []. In addition to the sorbin domain, these proteins contain three SH3 (src homology 3) domains. The sorbin homology domain mediates the interaction of vinexin and CAP with flotillin, which is crucial for the localisation of SH3-binding proteins to the lipid raft, a region of the plasma membrane rich in cholesterol and sphingolipids that acts to concentrate certain signalling molecules. The sorbin homology domain of adaptor proteins may mediate interactions with the lipid raft that are crucial to intracellular communication [].
Probab=24.78  E-value=67  Score=19.57  Aligned_cols=24  Identities=29%  Similarity=0.603  Sum_probs=20.5

Q ss_pred             cccccccCCCCchhHHHHHHHHHH
Q psy1631          26 RTPIADLDRPGSFAYHMSRSIHLV   49 (77)
Q Consensus        26 riPv~dIDRPGsFa~Hm~Rsv~LL   49 (77)
                      .||+...|||.-...-|.|.|.-+
T Consensus        16 giP~~~vd~~kDWYktMFkqIHk~   39 (47)
T PF02208_consen   16 GIPLSNVDRPKDWYKTMFKQIHKL   39 (47)
T ss_pred             CCccccccchhHHHHHHHHHHHhh
Confidence            478999999999999999988643


No 30 
>KOG2859|consensus
Probab=24.77  E-value=77  Score=25.44  Aligned_cols=29  Identities=34%  Similarity=0.481  Sum_probs=22.1

Q ss_pred             cCCCCchhHHH--HHHHHHHHHHHHhc-CcHh
Q psy1631          32 LDRPGSFAYHM--SRSIHLVVDLLSEI-KDAK   60 (77)
Q Consensus        32 IDRPGsFa~Hm--~Rsv~LLl~vL~e~-~D~~   60 (77)
                      +|||.+-.+||  +|+..++.+.|+++ +|+.
T Consensus       169 ~Dr~~~le~~n~~~~~lr~~~q~LeKL~~d~~  200 (293)
T KOG2859|consen  169 LDRPKILEKRNSMYRHLRLLQQRLEKLCKDAI  200 (293)
T ss_pred             ecccchHhhhhhHHHHHHHHHHHHHHHHhhhe
Confidence            69999999985  56677788888764 5554


No 31 
>KOG3197|consensus
Probab=24.21  E-value=1.3e+02  Score=23.27  Aligned_cols=22  Identities=32%  Similarity=0.368  Sum_probs=18.1

Q ss_pred             cccCCCCchhHHHHHHHHHHHH
Q psy1631          30 ADLDRPGSFAYHMSRSIHLVVD   51 (77)
Q Consensus        30 ~dIDRPGsFa~Hm~Rsv~LLl~   51 (77)
                      -+|++|-|-|-||||=-.+.|-
T Consensus        48 ~gv~~pESIadHMYRM~llaml   69 (210)
T KOG3197|consen   48 RGVNEPESIADHMYRMALLAML   69 (210)
T ss_pred             cCCCCcchHHHHHHHHHHHHHH
Confidence            4789999999999997666554


No 32 
>PF13925 Katanin_con80:  con80 domain of Katanin
Probab=23.75  E-value=99  Score=21.47  Aligned_cols=33  Identities=21%  Similarity=0.366  Sum_probs=28.2

Q ss_pred             HHHHHHHHHHHHHhcCcHhHHHHHHHHhc--cCCC
Q psy1631          42 MSRSIHLVVDLLSEIKDAKTLMDIAVALK--DEPE   74 (77)
Q Consensus        42 m~Rsv~LLl~vL~e~~D~~tLl~l~~~L~--r~Pd   74 (77)
                      .++++.-.++.+.+.+|...+.++...|.  .+|+
T Consensus        26 ~~~~~k~ai~~~~~~~D~svlvD~L~vl~~~~~~~   60 (164)
T PF13925_consen   26 RRNDIKGAIEYAVRMNDPSVLVDVLSVLNQSLKPE   60 (164)
T ss_pred             hcCcHHHHHHHHHhcCCchHHHHHHHHHHHhcCcC
Confidence            35678888999999999999999999998  6654


No 33 
>PF01832 Glucosaminidase:  Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase;  InterPro: IPR002901 This domain is found in many different proteins including mannosyl-glycoprotein endo-beta-N-acetylglucosamidase (3.2.1.96 from EC).It is also found in flagellar protein J (P75942 from SWISSPROT), which has been shown to hydrolyse peptidoglycan [].; GO: 0004040 amidase activity, 0009253 peptidoglycan catabolic process; PDB: 3FI7_A 2ZYC_A 3K3T_A.
Probab=23.47  E-value=34  Score=22.02  Aligned_cols=11  Identities=18%  Similarity=0.163  Sum_probs=5.9

Q ss_pred             cccccccccccc
Q psy1631          15 TSVVHFGLGLWR   26 (77)
Q Consensus        15 rnktNFFnGiWr   26 (77)
                      +++.||| ||+-
T Consensus        36 ~~~nN~f-Gi~~   46 (136)
T PF01832_consen   36 KEYNNFF-GIKA   46 (136)
T ss_dssp             SCTT-TT---BS
T ss_pred             ccccccc-ceec
Confidence            3677999 7777


No 34 
>PF05501 DUF755:  Domain of unknown function (DUF755) ;  InterPro: IPR008474 This family is predominated by ORFs from Anelloviridae. The function of this family remains to be determined.
Probab=23.39  E-value=82  Score=21.70  Aligned_cols=28  Identities=29%  Similarity=0.423  Sum_probs=24.2

Q ss_pred             HHHHHHHHHhcCcHhHHHHHHHHhccCC
Q psy1631          46 IHLVVDLLSEIKDAKTLMDIAVALKDEP   73 (77)
Q Consensus        46 v~LLl~vL~e~~D~~tLl~l~~~L~r~P   73 (77)
                      ..|.=++|+|..--..|+.++.||++.+
T Consensus        38 ~~L~~qLlKEC~kt~~L~~l~~qlq~~~   65 (123)
T PF05501_consen   38 DSLQKQLLKECQKTSQLMQLLQQLQKQP   65 (123)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence            4566789999999999999999997765


No 35 
>PF15059 Speriolin_C:  Speriolin C-terminus
Probab=23.17  E-value=73  Score=23.26  Aligned_cols=31  Identities=19%  Similarity=0.104  Sum_probs=22.5

Q ss_pred             ccccccccccccccccccccccccccCCCCchh
Q psy1631           7 TYHPQSCLTSVVHFGLGLWRTPIADLDRPGSFA   39 (77)
Q Consensus         7 ~~~~~fc~rnktNFFnGiWriPv~dIDRPGsFa   39 (77)
                      ++||.|+| --.|-| ||-|---+--.++|+..
T Consensus        75 ~~HP~lsE-~lVN~y-GILr~rp~l~a~~~~~y  105 (146)
T PF15059_consen   75 RVHPGLSE-FLVNTY-GILRERPELAASEGGSY  105 (146)
T ss_pred             ccCchHHH-HHHHHc-cccccCcccccCcCcCC
Confidence            68999999 566777 88776556666666553


No 36 
>PF07693 KAP_NTPase:  KAP family P-loop domain;  InterPro: IPR011646 The KAP (after Kidins220/ARMS and PifA) family of predicted NTPases are sporadically distributed across a wide phylogenetic range in bacteria and in animals. Many of the prokaryotic KAP NTPases are encoded in plasmids and tend to undergo disruption to form pseudogenes. A unique feature of all eukaryotic and certain bacterial KAP NTPases is the presence of two or four transmembrane helices inserted into the P-loop NTPase domain. These transmembrane helices anchor KAP NTPases in the membrane such that the P-loop domain is located on the intracellular side [].
Probab=22.99  E-value=45  Score=23.91  Aligned_cols=26  Identities=31%  Similarity=0.428  Sum_probs=19.9

Q ss_pred             ccccccCC-CCchhHHHHHHHHHHHHH
Q psy1631          27 TPIADLDR-PGSFAYHMSRSIHLVVDL   52 (77)
Q Consensus        27 iPv~dIDR-PGsFa~Hm~Rsv~LLl~v   52 (77)
                      +-+||+|| +...+--+-+.|.+++++
T Consensus       176 iiIDdLDR~~~~~i~~~l~~ik~~~~~  202 (325)
T PF07693_consen  176 IIIDDLDRCSPEEIVELLEAIKLLLDF  202 (325)
T ss_pred             EEEcchhcCCcHHHHHHHHHHHHhcCC
Confidence            45799999 677777777777777664


No 37 
>PF13929 mRNA_stabil:  mRNA stabilisation
Probab=22.67  E-value=1.5e+02  Score=23.52  Aligned_cols=39  Identities=18%  Similarity=0.296  Sum_probs=27.4

Q ss_pred             CCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccC--CCCC
Q psy1631          35 PGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDE--PEAE   76 (77)
Q Consensus        35 PGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~--Pd~~   76 (77)
                      +|.-..+|   |...+++|.+.+||..|++.=-+.-..  |.+|
T Consensus       197 ~~~l~~~v---i~~Il~~L~~~~dW~kl~~fW~~~~~~~~~~~D  237 (292)
T PF13929_consen  197 SKSLTRNV---IISILEILAESRDWNKLFQFWEQCIPNSVPGND  237 (292)
T ss_pred             ccCCChhH---HHHHHHHHHhcccHHHHHHHHHHhcccCCCCCC
Confidence            35555454   556689999999999999986665444  5444


No 38 
>PRK13291 metal-dependent hydrolase; Provisional
Probab=22.11  E-value=2e+02  Score=19.92  Aligned_cols=20  Identities=20%  Similarity=0.370  Sum_probs=16.1

Q ss_pred             cccccccCCCCchhHHHHHH
Q psy1631          26 RTPIADLDRPGSFAYHMSRS   45 (77)
Q Consensus        26 riPv~dIDRPGsFa~Hm~Rs   45 (77)
                      |.|+.++.=|+.|..+--..
T Consensus         5 ~~~~~~~~~~~~~~~~~~~~   24 (173)
T PRK13291          5 RYPIGQFQYPEEITEEQIQE   24 (173)
T ss_pred             CCCCCCCCCCCCCCHHHHHH
Confidence            67999999999998775543


No 39 
>PF06812 ImpA-rel_N:  ImpA-related N-terminal;  InterPro: IPR010657 This entry represents a conserved region located towards the N-terminal end of ImpA and related proteins. ImpA is an inner membrane protein, which has been suggested to be involved with proteins that are exported and associated with colony variations in Actinobacillus actinomycetemcomitans []. Note that many members are hypothetical proteins.
Probab=20.63  E-value=1.5e+02  Score=17.32  Aligned_cols=25  Identities=28%  Similarity=0.364  Sum_probs=19.3

Q ss_pred             HHHHHHH-HhcCcHhHHHHHHHHhcc
Q psy1631          47 HLVVDLL-SEIKDAKTLMDIAVALKD   71 (77)
Q Consensus        47 ~LLl~vL-~e~~D~~tLl~l~~~L~r   71 (77)
                      .+-.++| ++.||.+.+..++..+.+
T Consensus         7 ~~~~~lL~~~sKDlrv~~~l~~a~~~   32 (62)
T PF06812_consen    7 ELALELLSEQSKDLRVAVWLTEALLR   32 (62)
T ss_pred             HHHHHHHHHcCchHHHHHHHHHHHHH
Confidence            4567788 888999998888877654


No 40 
>KOG3904|consensus
Probab=20.51  E-value=27  Score=26.79  Aligned_cols=30  Identities=27%  Similarity=0.484  Sum_probs=21.1

Q ss_pred             cccccccccccccccccccccccCCCCchhHH
Q psy1631          10 PQSCLTSVVHFGLGLWRTPIADLDRPGSFAYH   41 (77)
Q Consensus        10 ~~fc~rnktNFFnGiWriPv~dIDRPGsFa~H   41 (77)
                      |+--++|--|  .-+|++|.=|+.|=|+|.+-
T Consensus       138 Pl~~ls~a~~--a~vW~~~~yelsr~~~f~~~  167 (209)
T KOG3904|consen  138 PLESLSPALL--AFVWLIPFYELSRVGVFISL  167 (209)
T ss_pred             cccccchhhh--cceeccchhhHhHhhhhhhH
Confidence            4434444333  35999999999999998763


No 41 
>COG3326 Predicted membrane protein [Function unknown]
Probab=20.27  E-value=39  Score=22.97  Aligned_cols=10  Identities=40%  Similarity=0.743  Sum_probs=7.6

Q ss_pred             cccccccccc
Q psy1631          20 FGLGLWRTPI   29 (77)
Q Consensus        20 FFnGiWriPv   29 (77)
                      --+|.||||=
T Consensus        30 A~~~~wRIpE   39 (94)
T COG3326          30 ARKGKWRIPE   39 (94)
T ss_pred             HHccCCCCcH
Confidence            3468999994


No 42 
>cd06336 PBP1_ABC_ligand_binding_like_3 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. This group includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters (HAAT), such as leucine-isoleucine-valine-binding protein (LIVBP); however their ligand specificity has not been determined experimentally.
Probab=20.20  E-value=2.3e+02  Score=20.52  Aligned_cols=26  Identities=12%  Similarity=0.086  Sum_probs=22.9

Q ss_pred             CCchhHHHHHHHHHHHHHHHhcCcHh
Q psy1631          35 PGSFAYHMSRSIHLVVDLLSEIKDAK   60 (77)
Q Consensus        35 PGsFa~Hm~Rsv~LLl~vL~e~~D~~   60 (77)
                      |+.|+...+-++.++.+.+++-++..
T Consensus       278 p~~~~~~~y~~~~~~~~Al~~ag~~~  303 (347)
T cd06336         278 PNSEAAVSYDAVYILKAAMEAAGSVD  303 (347)
T ss_pred             CcHHHHHHHHHHHHHHHHHHhcCCCC
Confidence            88899999999999999999887444


No 43 
>PF11817 Foie-gras_1:  Foie gras liver health family 1;  InterPro: IPR021773  Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats. 
Probab=20.14  E-value=2.3e+02  Score=20.51  Aligned_cols=33  Identities=6%  Similarity=0.214  Sum_probs=26.6

Q ss_pred             chhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHh
Q psy1631          37 SFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVAL   69 (77)
Q Consensus        37 sFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L   69 (77)
                      +...=+.-.+..|++|..+++|.+..+.++..|
T Consensus       212 gW~~l~~~~l~~l~~Ca~~~~~~~~~l~~~leL  244 (247)
T PF11817_consen  212 GWWSLLTEVLWRLLECAKRLGDVEDYLTTSLEL  244 (247)
T ss_pred             CcHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence            333445567788999999999999999999876


No 44 
>PF09571 RE_XcyI:  XcyI restriction endonuclease;  InterPro: IPR019071 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below:   Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA.   Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone [].   This entry represents the restriction endonuclease XcyI, which recognises and cleaves the double-stranded sequence C^CCGGG. ; GO: 0000287 magnesium ion binding, 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=20.03  E-value=58  Score=26.45  Aligned_cols=40  Identities=18%  Similarity=0.316  Sum_probs=30.8

Q ss_pred             cCCC--CchhHHHHHHHHHHHHHHH------hcCcHhHHHHHHHHhcc
Q psy1631          32 LDRP--GSFAYHMSRSIHLVVDLLS------EIKDAKTLMDIAVALKD   71 (77)
Q Consensus        32 IDRP--GsFa~Hm~Rsv~LLl~vL~------e~~D~~tLl~l~~~L~r   71 (77)
                      .++|  --.|.|+.++...|++...      ++=|-=|||.+..|||-
T Consensus       115 ~~~~~i~dLc~~~~~~~s~Ll~~~~~~~Is~~llddltlLtlG~Ql~G  162 (318)
T PF09571_consen  115 AQEPDIQDLCVAFNASASKLLEGIEIQRISRELLDDLTLLTLGPQLRG  162 (318)
T ss_pred             cccccHHHHHHHHHHHHHHHHhccccccccHHHHHHHHhhccChhhcc
Confidence            4555  3678999999999998876      44455578899999873


Done!