Query psy1631
Match_columns 77
No_of_seqs 27 out of 29
Neff 3.0
Searched_HMMs 46136
Date Fri Aug 16 18:02:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1631.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1631hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13998 MgrB: MgrB protein 80.0 0.67 1.4E-05 25.8 0.3 15 12-27 9-23 (29)
2 PRK10299 PhoPQ regulatory prot 66.3 2.3 5E-05 25.9 0.3 16 11-27 26-41 (47)
3 PF08209 Sgf11: Sgf11 (transcr 65.1 3.4 7.3E-05 23.1 0.8 12 35-46 16-27 (33)
4 PF11201 DUF2982: Protein of u 63.0 4.2 9.1E-05 29.4 1.2 20 27-46 130-150 (152)
5 PF00042 Globin: Globin plant 61.3 17 0.00038 21.9 3.6 52 19-70 36-88 (110)
6 PF05962 HutD: HutD; InterPro 56.6 2.9 6.4E-05 29.8 -0.5 17 24-40 34-50 (184)
7 smart00071 Galanin Galanin. Ga 54.1 11 0.00023 26.2 2.0 34 22-55 41-76 (103)
8 cd04885 ACT_ThrD-I Tandem C-te 47.5 9 0.0002 22.4 0.7 8 33-40 7-14 (68)
9 PF02087 Nitrophorin: Nitropho 40.7 7.3 0.00016 29.2 -0.5 12 16-27 12-23 (178)
10 cd01228 PH_BCR-related BCR (br 40.1 9.1 0.0002 26.1 -0.1 17 16-32 40-56 (96)
11 cd01040 globin Globins are hem 40.0 55 0.0012 20.1 3.5 51 19-70 39-89 (140)
12 PRK11396 hypothetical protein; 39.8 9.7 0.00021 28.3 0.0 18 24-41 35-52 (191)
13 PF01267 F-actin_cap_A: F-acti 38.3 11 0.00025 28.6 0.2 11 16-26 154-164 (271)
14 PHA03004 putative membrane pro 35.7 28 0.0006 27.6 1.9 53 7-72 175-227 (270)
15 PF08014 DUF1704: Domain of un 35.4 66 0.0014 25.5 4.0 55 20-75 27-83 (349)
16 KOG2859|consensus 34.8 12 0.00025 29.9 -0.2 33 33-70 215-248 (293)
17 KOG0759|consensus 34.2 11 0.00024 30.0 -0.5 50 6-56 127-179 (286)
18 PF14144 DOG1: Seed dormancy c 33.1 11 0.00024 24.1 -0.5 16 15-30 44-59 (80)
19 KOG3623|consensus 32.3 15 0.00032 33.5 -0.1 20 32-51 319-340 (1007)
20 PF08137 DVL: DVL family; Int 31.9 29 0.00063 17.6 1.0 9 42-50 10-18 (19)
21 PF03722 Hemocyanin_N: Hemocya 31.8 1E+02 0.0023 20.8 4.1 39 33-71 61-102 (126)
22 COG2999 GrxB Glutaredoxin 2 [P 30.8 35 0.00076 26.3 1.7 35 34-68 129-163 (215)
23 TIGR02421 QEGLA conserved hypo 28.6 1.1E+02 0.0023 24.9 4.2 52 23-75 56-109 (366)
24 PF11798 IMS_HHH: IMS family H 27.8 24 0.00052 18.8 0.3 13 24-36 8-20 (32)
25 KOG0409|consensus 27.1 83 0.0018 25.6 3.4 39 33-71 263-301 (327)
26 KOG2655|consensus 25.8 21 0.00045 29.0 -0.2 27 2-33 81-107 (366)
27 KOG0836|consensus 25.2 28 0.00061 27.8 0.4 16 16-31 161-180 (282)
28 cd02682 MIT_AAA_Arch MIT: doma 24.9 1.6E+02 0.0036 18.7 3.8 34 31-64 15-51 (75)
29 PF02208 Sorb: Sorbin homologo 24.8 67 0.0015 19.6 1.9 24 26-49 16-39 (47)
30 KOG2859|consensus 24.8 77 0.0017 25.4 2.7 29 32-60 169-200 (293)
31 KOG3197|consensus 24.2 1.3E+02 0.0027 23.3 3.7 22 30-51 48-69 (210)
32 PF13925 Katanin_con80: con80 23.7 99 0.0022 21.5 2.9 33 42-74 26-60 (164)
33 PF01832 Glucosaminidase: Mann 23.5 34 0.00074 22.0 0.5 11 15-26 36-46 (136)
34 PF05501 DUF755: Domain of unk 23.4 82 0.0018 21.7 2.4 28 46-73 38-65 (123)
35 PF15059 Speriolin_C: Sperioli 23.2 73 0.0016 23.3 2.2 31 7-39 75-105 (146)
36 PF07693 KAP_NTPase: KAP famil 23.0 45 0.00097 23.9 1.1 26 27-52 176-202 (325)
37 PF13929 mRNA_stabil: mRNA sta 22.7 1.5E+02 0.0033 23.5 4.0 39 35-76 197-237 (292)
38 PRK13291 metal-dependent hydro 22.1 2E+02 0.0044 19.9 4.2 20 26-45 5-24 (173)
39 PF06812 ImpA-rel_N: ImpA-rela 20.6 1.5E+02 0.0033 17.3 2.9 25 47-71 7-32 (62)
40 KOG3904|consensus 20.5 27 0.00058 26.8 -0.5 30 10-41 138-167 (209)
41 COG3326 Predicted membrane pro 20.3 39 0.00084 23.0 0.3 10 20-29 30-39 (94)
42 cd06336 PBP1_ABC_ligand_bindin 20.2 2.3E+02 0.005 20.5 4.3 26 35-60 278-303 (347)
43 PF11817 Foie-gras_1: Foie gra 20.1 2.3E+02 0.005 20.5 4.3 33 37-69 212-244 (247)
44 PF09571 RE_XcyI: XcyI restric 20.0 58 0.0013 26.5 1.2 40 32-71 115-162 (318)
No 1
>PF13998 MgrB: MgrB protein
Probab=80.03 E-value=0.67 Score=25.81 Aligned_cols=15 Identities=20% Similarity=0.281 Sum_probs=13.1
Q ss_pred cccccccccccccccc
Q psy1631 12 SCLTSVVHFGLGLWRT 27 (77)
Q Consensus 12 fc~rnktNFFnGiWri 27 (77)
+|+.- .+||+||++|
T Consensus 9 ~CDQg-~~F~~GiC~i 23 (29)
T PF13998_consen 9 YCDQG-EQFFSGICSI 23 (29)
T ss_pred HhcCC-CCceeeeeEE
Confidence 58866 8999999987
No 2
>PRK10299 PhoPQ regulatory protein; Provisional
Probab=66.29 E-value=2.3 Score=25.92 Aligned_cols=16 Identities=25% Similarity=0.357 Sum_probs=13.7
Q ss_pred ccccccccccccccccc
Q psy1631 11 QSCLTSVVHFGLGLWRT 27 (77)
Q Consensus 11 ~fc~rnktNFFnGiWri 27 (77)
.+|++- ..||.||++|
T Consensus 26 ~~CDQg-~~F~~GIC~i 41 (47)
T PRK10299 26 MMCDQD-VQFFSGICAI 41 (47)
T ss_pred HHhcCC-ccceeeeeee
Confidence 369977 8999999987
No 3
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=65.08 E-value=3.4 Score=23.08 Aligned_cols=12 Identities=25% Similarity=0.587 Sum_probs=10.4
Q ss_pred CCchhHHHHHHH
Q psy1631 35 PGSFAYHMSRSI 46 (77)
Q Consensus 35 PGsFa~Hm~Rsv 46 (77)
++-||.|+.||+
T Consensus 16 a~RfA~HLekCm 27 (33)
T PF08209_consen 16 ASRFAPHLEKCM 27 (33)
T ss_dssp GGGHHHHHHHHT
T ss_pred hhhhHHHHHHHH
Confidence 468999999996
No 4
>PF11201 DUF2982: Protein of unknown function (DUF2982); InterPro: IPR021367 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=62.95 E-value=4.2 Score=29.39 Aligned_cols=20 Identities=35% Similarity=0.468 Sum_probs=18.4
Q ss_pred ccccccCCCC-chhHHHHHHH
Q psy1631 27 TPIADLDRPG-SFAYHMSRSI 46 (77)
Q Consensus 27 iPv~dIDRPG-sFa~Hm~Rsv 46 (77)
||.+++||+. .|....+|+.
T Consensus 130 i~~~~LDR~~~eFv~Llr~~~ 150 (152)
T PF11201_consen 130 IPASDLDRDPEEFVGLLRRYQ 150 (152)
T ss_pred eEhhhccCCHHHHHHHHHHHH
Confidence 8999999999 9999999875
No 5
>PF00042 Globin: Globin plant globin signature erythrocruorin family signature alpha hemoglobin signature myoglobin signature thalassemia.; InterPro: IPR000971 Globins are haem-containing proteins involved in binding and/or transporting oxygen. They belong to a very large and well studied family that is widely distributed in many organisms []. Globins have evolved from a common ancestor and can be divided into three groups: single-domain globins, and two types of chimeric globins, flavohaemoglobins and globin-coupled sensors. Bacteria have all three types of globins, while archaea lack flavohaemoglobins, and eukaryotes lack globin-coupled sensors []. Several functionally different haemoglobins can coexist in the same species. The major types of globins include: Haemoglobin (Hb): trimer of two alpha and two beta chains, although embryonic and foetal forms can substitute the alpha or beta chain for ones with higher oxygen affinity, such as gamma, delta, epsilon or zeta chains. Hb transports oxygen from lungs to other tissues in vertebrates []. Hb proteins are also present in unicellular organisms where they act as enzymes or sensors []. Myoglobin (Mb): monomeric protein responsible for oxygen storage in vertebrate muscle []. Neuroglobin: a myoglobin-like haemprotein expressed in vertebrate brain and retina, where it is involved in neuroprotection from damage due to hypoxia or ischemia []. Neuroglobin belongs to a branch of the globin family that diverged early in evolution. Cytoglobin: an oxygen sensor expressed in multiple tissues. Related to neuroglobin []. Erythrocruorin: highly cooperative extracellular respiratory proteins found in annelids and arthropods that are assembled from as many as 180 subunit into hexagonal bilayers []. Leghaemoglobin (legHb or symbiotic Hb): occurs in the root nodules of leguminous plants, where it facilitates the diffusion of oxygen to symbiotic bacteriods in order to promote nitrogen fixation. Non-symbiotic haemoglobin (NsHb): occurs in non-leguminous plants, and can be over-expressed in stressed plants []. Flavohaemoglobins (FHb): chimeric, with an N-terminal globin domain and a C-terminal ferredoxin reductase-like NAD/FAD-binding domain. FHb provides protection against nitric oxide via its C-terminal domain, which transfers electrons to haem in the globin []. Globin-coupled sensors: chimeric, with an N-terminal myoglobin-like domain and a C-terminal domain that resembles the cytoplasmic signalling domain of bacterial chemoreceptors. They bind oxygen, and act to initiate an aerotactic response or regulate gene expression [, ]. Protoglobin: a single domain globin found in archaea that is related to the N-terminal domain of globin-coupled sensors []. Truncated 2/2 globin: lack the first helix, giving them a 2-over-2 instead of the canonical 3-over-3 alpha-helical sandwich fold. Can be divided into three main groups (I, II and II) based on structural features []. This entry covers most of the globin family of proteins, but it omits some bacterial globins and the protoglobins. More information about these proteins can be found at Protein of the Month: Haemoglobin [].; GO: 0005506 iron ion binding, 0020037 heme binding; PDB: 2WTH_A 2WTG_A 3A59_G 3FS4_C 3CY5_C 3D1A_B 2RI4_J 3EU1_B 1JEB_A 2Z6N_B ....
Probab=61.32 E-value=17 Score=21.94 Aligned_cols=52 Identities=15% Similarity=0.252 Sum_probs=42.0
Q ss_pred ccccccc-cccccccCCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhc
Q psy1631 19 HFGLGLW-RTPIADLDRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALK 70 (77)
Q Consensus 19 NFFnGiW-riPv~dIDRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~ 70 (77)
+.|...= ..|.+++.....|..|..|-+..+-++...++|-+.+......|.
T Consensus 36 ~~F~~~~~~~~~~~l~~~~~~~~h~~~v~~~l~~~v~~l~~~~~l~~~l~~lg 88 (110)
T PF00042_consen 36 KLFPKFKDIVPLEELKNNPEFKAHAQRVMEALDEAVDNLDDPESLEAMLRKLG 88 (110)
T ss_dssp GGGTTGTTTSSHHHHTTSHHHHHHHHHHHHHHHHHHHTTTSTHGHHHHHHHHH
T ss_pred hhcccccccchHHHHhccchHHHHHHHHHHHHHHHHHccCChHhHHHHHHHHH
Confidence 4555442 258888888899999999999999999999999988877766654
No 6
>PF05962 HutD: HutD; InterPro: IPR010282 This entry contains proteins of unknown function, which include HutD from Pseudomonas fluorescens and Ves from Escherichia coli K12. HutD from P. fluorescens is a component of the histidine uptake and utilisation operon. HutD is operonic with the well characterised repressor protein HutC. Genetic analysis using transcriptional fusions (lacZ) and deletion mutants shows that hutD is necessary to maintain fitness in environments replete with histidine. HutD probably sets an upper bound on the level of hut operon transcription []. The mechanistic basis is unknown, but in silico molecular docking studies based on the crystal structure of HutD from Pseudomonas aeruginosa show that urocanate (the first breakdown product of histidine) docks with the active site of HutD.; PDB: 3ESG_A 1YLL_D.
Probab=56.60 E-value=2.9 Score=29.78 Aligned_cols=17 Identities=35% Similarity=0.718 Sum_probs=13.2
Q ss_pred cccccccccCCCCchhH
Q psy1631 24 LWRTPIADLDRPGSFAY 40 (77)
Q Consensus 24 iWriPv~dIDRPGsFa~ 40 (77)
-|||-+++|+.+|+|+.
T Consensus 34 ~wRiS~A~V~~~g~FS~ 50 (184)
T PF05962_consen 34 DWRISIATVEADGPFSD 50 (184)
T ss_dssp SEEEEEEEE-SSEEE--
T ss_pred eEEEEEEEEcCCCCCCC
Confidence 39999999999999975
No 7
>smart00071 Galanin Galanin. Galanin [1,2,3] is a neuropeptide that controls various biological activities: it regulates the release growth hormone, inhibits the release of insulin and somatostatin, contracts smooth muscle of the gastrointestinal and genitourinary tract and may be involved in the control of adrenal secretion
Probab=54.14 E-value=11 Score=26.22 Aligned_cols=34 Identities=26% Similarity=0.418 Sum_probs=25.6
Q ss_pred cccccccccccCCCCchhHHHH--HHHHHHHHHHHh
Q psy1631 22 LGLWRTPIADLDRPGSFAYHMS--RSIHLVVDLLSE 55 (77)
Q Consensus 22 nGiWriPv~dIDRPGsFa~Hm~--Rsv~LLl~vL~e 55 (77)
.|=-.+|.+|.-|||+|..|.+ .-|.-++|.|.=
T Consensus 41 ~gKREl~pEde~r~g~~~r~l~e~~~vrTiiEFLsf 76 (103)
T smart00071 41 TGKRELEPEDEARPGSFDRPLSESNIVRTIIEFLTF 76 (103)
T ss_pred cccccCChhhhcccCccCCcCchhhHHHHHHHHHHH
Confidence 4777899999999999977764 346666666643
No 8
>cd04885 ACT_ThrD-I Tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes each of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=47.47 E-value=9 Score=22.40 Aligned_cols=8 Identities=38% Similarity=0.924 Sum_probs=6.6
Q ss_pred CCCCchhH
Q psy1631 33 DRPGSFAY 40 (77)
Q Consensus 33 DRPGsFa~ 40 (77)
||||+|+.
T Consensus 7 dkPG~l~~ 14 (68)
T cd04885 7 ERPGALKK 14 (68)
T ss_pred CCCCHHHH
Confidence 89999864
No 9
>PF02087 Nitrophorin: Nitrophorin; InterPro: IPR002351 Nitrophorins are haemoproteins found in saliva of blood-feeding insects [, ]. Saliva of the blood-sucking bug Rhodnius prolixus (Triatomid bug) contains four homologous nitrophorins, designated NP1 to NP4 in order of their relative abundance in the glands []. As isolated, nitrophorins contain nitric oxide (NO) ligated to the ferric (FeIII) haem iron. Histamine, which is released by the host in response to tissue damage, is another nitrophorin ligand. Nitrophorins transport NO to the feeding site. Dilution, binding of histamine and increase in pH (from pH ~5 in salivary gland to pH ~7.4 in the host tissue) facilitate the release of NO into the tissue where it induces vasodilatation. The salivary nitrophorin from the hemipteran Cimex lectularius (Bed bug) has no sequence similarity to R. prolixus nitrophorins. It is suggested that the two classes of insect nitrophorins have arisen as a product of the convergent evolution []. 3-D structures of several nitrophorin complexes are known []. The nitrophorin structures reveal lipocalin-like eight-stranded beta-barrel, three alpha-helices and two disulphide bonds, with haem inserted into one end of the barrel. Members of the lipocalin family are known to bind a variety of small hydrophobic ligands, including biliverdin, in a similar fashion (see [] for review). The haem iron is ligated to His59. The position of His59 is restrained through water-mediated hydrogen bond to the carboxylate of Asp70. The His59-Fe bond is bent ~15 degrees out of the imidazole plane. Asp70 forms an unusual hydrogen bond with one of the haem propionates, suggesting the residue has an altered pKa. In NP1-histamine structure, the planes of His59 and histamine imidazole rings lie in an arrangement almost identical to that found in oxidised cytochrome b5. This entry represents the nitrophorin structural domain.; GO: 0051381 histamine binding, 0070026 nitric oxide binding; PDB: 1SXX_A 2OFM_X 1X8Q_A 3TGA_A 1SXU_A 1IKJ_A 1YWD_A 1X8N_A 3FLL_A 1X8O_A ....
Probab=40.66 E-value=7.3 Score=29.21 Aligned_cols=12 Identities=17% Similarity=0.335 Sum_probs=9.7
Q ss_pred cccccccccccc
Q psy1631 16 SVVHFGLGLWRT 27 (77)
Q Consensus 16 nktNFFnGiWri 27 (77)
+|.+||+|+|=+
T Consensus 12 dk~k~F~g~WYv 23 (178)
T PF02087_consen 12 DKDKYFSGTWYV 23 (178)
T ss_dssp -HHHHTSSEEEE
T ss_pred CHHHccCceEEE
Confidence 688999999964
No 10
>cd01228 PH_BCR-related BCR (breakpoint cluster region)-related pleckstrin homology (PH) domain. BCR (breakpoint cluster region)-related pleckstrin homology (PH) domain. The BCR-related protein has a RhoGEF(DH) domain followed by a PH domain, a C2 domain and a RhoGAP domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
Probab=40.08 E-value=9.1 Score=26.06 Aligned_cols=17 Identities=29% Similarity=0.325 Sum_probs=14.0
Q ss_pred ccccccccccccccccc
Q psy1631 16 SVVHFGLGLWRTPIADL 32 (77)
Q Consensus 16 nktNFFnGiWriPv~dI 32 (77)
+|++=++--|-||++|+
T Consensus 40 ~k~~kY~~~w~IPL~dl 56 (96)
T cd01228 40 GKHQQYDCKWYIPLADL 56 (96)
T ss_pred ccccccceeEEEEhHHh
Confidence 45567788899999998
No 11
>cd01040 globin Globins are heme proteins, which bind and transport oxygen. This family summarizes a diverse set of homologous protein domains, including: (1) tetrameric vertebrate hemoglobins, which are the major protein component of erythrocytes and transport oxygen in the bloodstream, (2) microorganismal flavohemoglobins, which are linked to C-terminal FAD-dependend reductase domains, (3) homodimeric bacterial hemoglobins, such as from Vitreoscilla, (4) plant leghemoglobins (symbiotic hemoglobins, involved in nitrogen metabolism in plant rhizomes), (5) plant non-symbiotic hexacoordinate globins and hexacoordinate globins from bacteria and animals, such as neuroglobin, (6) invertebrate hemoglobins, which may occur in tandem-repeat arrangements, and (7) monomeric myoglobins found in animal muscle tissue.
Probab=40.00 E-value=55 Score=20.14 Aligned_cols=51 Identities=14% Similarity=0.153 Sum_probs=39.3
Q ss_pred ccccccccccccccCCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhc
Q psy1631 19 HFGLGLWRTPIADLDRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALK 70 (77)
Q Consensus 19 NFFnGiWriPv~dIDRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~ 70 (77)
++|...=..+. ++-....|..|..+=+..|-.+...++|.+.+.....+|.
T Consensus 39 ~~F~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~v~~l~~~~~l~~~l~~lg 89 (140)
T cd01040 39 ALFSRFGGLSA-ALKGSPKFKAHGKRVLNALDEAIKNLDDLEALKALLAKLG 89 (140)
T ss_pred HHhHHhCCchH-hHccCHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHH
Confidence 34544444455 7777789999999999999999999999988776665553
No 12
>PRK11396 hypothetical protein; Provisional
Probab=39.85 E-value=9.7 Score=28.28 Aligned_cols=18 Identities=33% Similarity=0.789 Sum_probs=16.5
Q ss_pred cccccccccCCCCchhHH
Q psy1631 24 LWRTPIADLDRPGSFAYH 41 (77)
Q Consensus 24 iWriPv~dIDRPGsFa~H 41 (77)
-|||-+++|..+|-|+..
T Consensus 35 ~WRiSiA~I~~~GpFS~F 52 (191)
T PRK11396 35 YWRASIASIAANGEFSLF 52 (191)
T ss_pred eEEEEEEEecCCCCCCCC
Confidence 699999999999999864
No 13
>PF01267 F-actin_cap_A: F-actin capping protein alpha subunit; InterPro: IPR002189 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. The actin filament system, a prominent part of the cytoskeleton in eukaryotic cells, is both a static structure and a dynamic network that can undergo rearrangements: it is thought to be involved in processes such as cell movement and phagocytosis [], as well as muscle contraction. The F-actin capping protein binds in a calcium-independent manner to the fast growing ends of actin filaments (barbed end) thereby blocking the exchange of subunits at these ends. Unlike gelsolin and severin this protein does not sever actin filaments. The F-actin capping protein is a heterodimer composed of two unrelated subunits: alpha and beta (see IPR001698 from INTERPRO). Neither of the subunits shows sequence similarity to other filament-capping proteins []. The alpha subunit is a protein of about 268 to 286 amino acid residues whose sequence is well conserved in eukaryotic species [].; GO: 0003779 actin binding, 0030036 actin cytoskeleton organization, 0008290 F-actin capping protein complex; PDB: 1MQ1_D 1MWN_X 2KBM_Y 2KZ7_A 3AAE_C 3AA6_A 3LK4_G 1IZN_A 3AA7_A 3AAA_A ....
Probab=38.30 E-value=11 Score=28.62 Aligned_cols=11 Identities=36% Similarity=0.664 Sum_probs=9.5
Q ss_pred ccccccccccc
Q psy1631 16 SVVHFGLGLWR 26 (77)
Q Consensus 16 nktNFFnGiWr 26 (77)
|-.||.||-||
T Consensus 154 n~~NfwnG~Wr 164 (271)
T PF01267_consen 154 NPKNFWNGRWR 164 (271)
T ss_dssp EGGGTEEEEEE
T ss_pred ccccccCceee
Confidence 56799999997
No 14
>PHA03004 putative membrane protein; Provisional
Probab=35.73 E-value=28 Score=27.64 Aligned_cols=53 Identities=15% Similarity=0.113 Sum_probs=42.0
Q ss_pred ccccccccccccccccccccccccccCCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccC
Q psy1631 7 TYHPQSCLTSVVHFGLGLWRTPIADLDRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDE 72 (77)
Q Consensus 7 ~~~~~fc~rnktNFFnGiWriPv~dIDRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~ 72 (77)
.|-|+|.--...|+|.|=.-++...|++|+-=- -....|.+||.-+|..|+-|
T Consensus 175 DYKPlFsrl~~~~~y~~~fp~~~~~i~~p~~~~-------------~~~~sDyETLaNlSaIly~T 227 (270)
T PHA03004 175 DYKPLFSRLKENNIYTGPFPIGQSRINHPNLPR-------------GRSPSDYETLANLSAILYLT 227 (270)
T ss_pred CccchhcccCcCccccCCCCCCchhhcCCCCCC-------------CCCchhHHHHHhHHHHHhhc
Confidence 488999999999999999888889999887211 12456899999998888654
No 15
>PF08014 DUF1704: Domain of unknown function (DUF1704); InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=35.40 E-value=66 Score=25.48 Aligned_cols=55 Identities=18% Similarity=0.264 Sum_probs=47.8
Q ss_pred cccccccccccccCCCC--chhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccCCCC
Q psy1631 20 FGLGLWRTPIADLDRPG--SFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDEPEA 75 (77)
Q Consensus 20 FFnGiWriPv~dIDRPG--sFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~Pd~ 75 (77)
+-.-+..||++.|+-|. ..-..+-++....+++|...+. ..-.+.|.+|.-.|+.
T Consensus 27 ~~~~l~~l~~~~~~d~~l~~l~~~~~~e~~~~i~mL~~~Gt-~~F~~~S~~lYG~p~~ 83 (349)
T PF08014_consen 27 LKRELYSLPIEIIEDPPLGQLYRDKIDEYDLQIDMLEARGT-PDFYYNSLRLYGEPSD 83 (349)
T ss_pred HHHHHHhhHHhhcCCchHHHHHHHHHHHHHHHHHHHHhcCc-hHHHHHHHHHhCCCCh
Confidence 34467789999999886 7888899999999999999988 8999999999999874
No 16
>KOG2859|consensus
Probab=34.78 E-value=12 Score=29.94 Aligned_cols=33 Identities=24% Similarity=0.252 Sum_probs=27.3
Q ss_pred CCCC-chhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhc
Q psy1631 33 DRPG-SFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALK 70 (77)
Q Consensus 33 DRPG-sFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~ 70 (77)
-|.| +|..|..|||.+|+ .+|+.-++++-+|+-
T Consensus 215 y~~~En~P~~~gRsvClll-----iky~~~~~k~~qq~v 248 (293)
T KOG2859|consen 215 YRFGENLPEAGGRSVCLLL-----IKYHVRMQKIQQQIV 248 (293)
T ss_pred ccccccCccccccceeeee-----ccccHHHHHHHHHHH
Confidence 3566 79999999998776 899999999888763
No 17
>KOG0759|consensus
Probab=34.21 E-value=11 Score=29.99 Aligned_cols=50 Identities=18% Similarity=0.105 Sum_probs=31.7
Q ss_pred cccccccccccccccccccccccccccCC---CCchhHHHHHHHHHHHHHHHhc
Q psy1631 6 DTYHPQSCLTSVVHFGLGLWRTPIADLDR---PGSFAYHMSRSIHLVVDLLSEI 56 (77)
Q Consensus 6 ~~~~~~fc~rnktNFFnGiWriPv~dIDR---PGsFa~Hm~Rsv~LLl~vL~e~ 56 (77)
|.--|.-=-||=+|+|+|+-||.-+|==. -|+ .--+.|++.+.+--|+.-
T Consensus 127 D~~LP~~~RRNYknv~dgL~rI~reEG~~~L~~G~-~~tv~Ra~lvt~~QlA~Y 179 (286)
T KOG0759|consen 127 DGRLPPEQRRNYKNVFDGLVRITREEGVTALFRGC-KPTVSRAMLVTASQLASY 179 (286)
T ss_pred cCCCCHHHhhhhhhHhhHHhhhhhhhhHHHHHcCc-hHHHHHHHHHHHHHHHHH
Confidence 33446666799999999999998776110 122 224667777766555443
No 18
>PF14144 DOG1: Seed dormancy control
Probab=33.13 E-value=11 Score=24.10 Aligned_cols=16 Identities=25% Similarity=0.646 Sum_probs=13.0
Q ss_pred cccccccccccccccc
Q psy1631 15 TSVVHFGLGLWRTPIA 30 (77)
Q Consensus 15 rnktNFFnGiWriPv~ 30 (77)
.+.-++|.|.|+.|.+
T Consensus 44 ~DV~~~~s~~W~sp~E 59 (80)
T PF14144_consen 44 ADVFHLLSPPWKSPLE 59 (80)
T ss_pred cchHHHhCCCCCCHHH
Confidence 3566789999999976
No 19
>KOG3623|consensus
Probab=32.29 E-value=15 Score=33.52 Aligned_cols=20 Identities=35% Similarity=0.695 Sum_probs=15.7
Q ss_pred cCCCCchhHHHH--HHHHHHHH
Q psy1631 32 LDRPGSFAYHMS--RSIHLVVD 51 (77)
Q Consensus 32 IDRPGsFa~Hm~--Rsv~LLl~ 51 (77)
+.-+|||++||+ +||.|.+.
T Consensus 319 FSHSGSySSHmSSKKCIsli~v 340 (1007)
T KOG3623|consen 319 FSHSGSYSSHMSSKKCISLILV 340 (1007)
T ss_pred cccCCcccccccccchhhhhcc
Confidence 457899999996 78877653
No 20
>PF08137 DVL: DVL family; InterPro: IPR012552 This family consists of the DVL family of proteins. In a gain-of-function genetic screen for genes that influence fruit development in Arabidopsis, DEVIL (DVL) gene was identified. DVL is a small protein and over expression of the protein results in pleiotropic phenotypes featured by shortened stature, rounder rosette leaves, clustered inflorescences, shortened pedicles, and siliques with pronged tips. DVL family is a novel class of small polypeptides and the over expression phenotypes suggest that these polypeptides may have a role in plant development [].
Probab=31.95 E-value=29 Score=17.62 Aligned_cols=9 Identities=11% Similarity=0.541 Sum_probs=7.3
Q ss_pred HHHHHHHHH
Q psy1631 42 MSRSIHLVV 50 (77)
Q Consensus 42 m~Rsv~LLl 50 (77)
+.|||++|+
T Consensus 10 irrCv~mLl 18 (19)
T PF08137_consen 10 IRRCVVMLL 18 (19)
T ss_pred HHHhHHHhc
Confidence 678998875
No 21
>PF03722 Hemocyanin_N: Hemocyanin, all-alpha domain; InterPro: IPR005204 Haemocyanins are copper-containing oxygen transport proteins found in the haemolymph of many invertebrates. They are divided into 2 main groups, arthropodan and molluscan. These have structurally similar oxygen-binding centres, which are similar to the oxygen-binding centre of tyrosinases [], but their quaternary structures are arranged differently. The arthropodan proteins exist as hexamers comprising 3 heterogeneous subunits (a, b and c) and possess 1 oxygen-binding centre per subunit; and the molluscan proteins exist as cylindrical oligomers of 10 to 20 subunits and possess 7 or 8 oxygen-binding centres per subunit []. Although the proteins have similar amino acid compositions, the only real similarity in their primary sequences is in the region corresponding to the second copper-binding domain, which also shows similarity to the copper-binding domain of tyrosinases []. Larval storage proteins (LSP) [] are proteins from the hemolymph of insects, which may serve as a store of amino acids for synthesis of adult proteins. There are two classes of LSP's, arylphorins, which are rich in aromatic amino acids, and methionine-rich LSP's. LSP's forms hexameric complexes. LSP's are structurally related to arthropods hemocyanins.; PDB: 1HCY_A 1HC1_C 3GWJ_F 3IXV_E 3IXW_L 3HHS_A 1NOL_A 1LLA_A 1OXY_A 1LL1_A.
Probab=31.80 E-value=1e+02 Score=20.82 Aligned_cols=39 Identities=28% Similarity=0.267 Sum_probs=28.2
Q ss_pred CCCCchh---HHHHHHHHHHHHHHHhcCcHhHHHHHHHHhcc
Q psy1631 33 DRPGSFA---YHMSRSIHLVVDLLSEIKDAKTLMDIAVALKD 71 (77)
Q Consensus 33 DRPGsFa---~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r 71 (77)
.|-+-|+ ....+-...|.++|-.-+|++|+.+.|..++.
T Consensus 61 pr~~~FS~f~~~h~~ea~~L~~~l~~akd~dtf~~~a~~aR~ 102 (126)
T PF03722_consen 61 PRGEIFSLFNPRHREEAIALFDLLYSAKDFDTFYKTAVWARD 102 (126)
T ss_dssp -TTS---TTSHHHHHHHHHHHHHHHCSSSHHHHHHHHHHHHT
T ss_pred CCCCeecCCCHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHH
Confidence 3444444 34556677899999999999999999988764
No 22
>COG2999 GrxB Glutaredoxin 2 [Posttranslational modification, protein turnover, chaperones]
Probab=30.77 E-value=35 Score=26.32 Aligned_cols=35 Identities=14% Similarity=0.265 Sum_probs=28.4
Q ss_pred CCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHH
Q psy1631 34 RPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVA 68 (77)
Q Consensus 34 RPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~ 68 (77)
-.|+|++|.+.+-.++..+=..++-.+-|+...+.
T Consensus 129 s~g~F~~~l~~t~~~~~~i~~dl~~l~~Li~~~s~ 163 (215)
T COG2999 129 SEGSFESLLNHTAQYLKRIQADLRALDKLIVGPSA 163 (215)
T ss_pred ccccHHHHHhchHHHHHHHHHHHHHHHHHhcCcch
Confidence 35999999999999999988877777777665543
No 23
>TIGR02421 QEGLA conserved hypothetical protein. Members of this family include a possible metal-binding motif HEXXXH and, nearby, a perfectly conserved motif QEGLA. All members belong to the Proteobacteria, including Agrobacterium tumefaciens and several species of Vibrio and Pseudomonas, and are found in only one copy per chromosome (Vibrio vulnificus, with two chromosomes, has two). The function is unknown.
Probab=28.57 E-value=1.1e+02 Score=24.85 Aligned_cols=52 Identities=27% Similarity=0.426 Sum_probs=45.4
Q ss_pred ccccccccccCCC--CchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccCCCC
Q psy1631 23 GLWRTPIADLDRP--GSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDEPEA 75 (77)
Q Consensus 23 GiWriPv~dIDRP--GsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~Pd~ 75 (77)
-+..||++.|+-| |.+...+-++..+.+++|...+- +.-.+.|.+|.-.|+.
T Consensus 56 ~l~~l~~e~i~d~~l~~l~r~~~~e~~~~i~mL~~~Gt-~~F~~~S~~lYG~p~~ 109 (366)
T TIGR02421 56 ELYSLPIDIIRDPPLGQLYREKQDEYDLVIDLLESIGT-ATFLYNSLRLYGAPSD 109 (366)
T ss_pred HHHhhHHhhccChhHHHHHHHHHHHHHHHHHHHHhcCc-hHHHHHHHHHcCCCCc
Confidence 4567899988887 47888899999999999999887 8899999999999985
No 24
>PF11798 IMS_HHH: IMS family HHH motif; InterPro: IPR024728 This helix-hairpin-helix motif is found in proteins belonging to the type-Y family of DNA polymerases []. This type of polymerases are thought to be involved in UV protection and mutation [, ]. ; PDB: 3PZP_B 2OH2_B 2W7O_B 3IN5_B 1T94_A 2W7P_B 2W8K_A 2AGQ_A 1RYR_A 3RAX_A ....
Probab=27.82 E-value=24 Score=18.82 Aligned_cols=13 Identities=38% Similarity=0.700 Sum_probs=9.8
Q ss_pred cccccccccCCCC
Q psy1631 24 LWRTPIADLDRPG 36 (77)
Q Consensus 24 iWriPv~dIDRPG 36 (77)
+|.+|+.|+=-=|
T Consensus 8 l~~lpi~~~~GIG 20 (32)
T PF11798_consen 8 LWPLPIRKFWGIG 20 (32)
T ss_dssp HHCSBGGGSTTS-
T ss_pred HhcCCHHhhCCcc
Confidence 7999999986555
No 25
>KOG0409|consensus
Probab=27.14 E-value=83 Score=25.60 Aligned_cols=39 Identities=21% Similarity=0.209 Sum_probs=33.8
Q ss_pred CCCCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhcc
Q psy1631 33 DRPGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKD 71 (77)
Q Consensus 33 DRPGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r 71 (77)
=.||.+..||-|-+.|.+..-.+.+----+..+++|+.+
T Consensus 263 y~p~f~~~~m~KDLgla~~~a~~~~~~~P~~slA~qly~ 301 (327)
T KOG0409|consen 263 YNPGFALKLMVKDLGLALNAAESVKVPMPLGSLAHQLYK 301 (327)
T ss_pred CCCcchHHHHHHHHHHHHHhhhccCCCCchHHHHHHHHH
Confidence 579999999999999999998888877778888888765
No 26
>KOG2655|consensus
Probab=25.83 E-value=21 Score=29.01 Aligned_cols=27 Identities=30% Similarity=0.463 Sum_probs=20.9
Q ss_pred cccccccccccccccccccccccccccccccC
Q psy1631 2 VTLLDTYHPQSCLTSVVHFGLGLWRTPIADLD 33 (77)
Q Consensus 2 ~~~~~~~~~~fc~rnktNFFnGiWriPv~dID 33 (77)
+|+.|| |.|++ +-.| .+-|+.+++-||
T Consensus 81 Ltvidt--PGfGD-~vdn--s~~w~pi~~yi~ 107 (366)
T KOG2655|consen 81 LTVIDT--PGFGD-AVDN--SNCWRPIVNYID 107 (366)
T ss_pred eEEecc--CCCcc-cccc--cccchhhhHHHH
Confidence 356666 88998 8888 789999887664
No 27
>KOG0836|consensus
Probab=25.23 E-value=28 Score=27.76 Aligned_cols=16 Identities=25% Similarity=0.434 Sum_probs=12.0
Q ss_pred ccccccccccc----ccccc
Q psy1631 16 SVVHFGLGLWR----TPIAD 31 (77)
Q Consensus 16 nktNFFnGiWr----iPv~d 31 (77)
|--||.||-|| +++.+
T Consensus 161 ~pkNfwNG~WRS~w~~~v~~ 180 (282)
T KOG0836|consen 161 QPKNFWNGRWRSEWNYDVQP 180 (282)
T ss_pred CcccccCCceeeeeeeccCc
Confidence 55799999887 56665
No 28
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=24.87 E-value=1.6e+02 Score=18.73 Aligned_cols=34 Identities=12% Similarity=0.287 Sum_probs=28.0
Q ss_pred ccCCCCchh---HHHHHHHHHHHHHHHhcCcHhHHHH
Q psy1631 31 DLDRPGSFA---YHMSRSIHLVVDLLSEIKDAKTLMD 64 (77)
Q Consensus 31 dIDRPGsFa---~Hm~Rsv~LLl~vL~e~~D~~tLl~ 64 (77)
+.|.-|.|. .+-..+|-+|+++++-..|..+..-
T Consensus 15 e~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~~ 51 (75)
T cd02682 15 KAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRLI 51 (75)
T ss_pred HHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHHH
Confidence 356777777 7788999999999999999988544
No 29
>PF02208 Sorb: Sorbin homologous domain; InterPro: IPR003127 Sorbin is an active peptide present in the digestive tract, where it has pro-absorptive and anti-secretory effects in different parts of the intestine, including the ability to decrease VIP (vasoactive intestinal peptide) and cholera toxin-induced secretion. It is expressed in some intestinal and pancreatic endocrine tumours in humans []. Sorbin-homology domains are found in adaptor proteins such as vinexin, CAP/ponsin and argBP2, which regulate various cellular functions, including cell adhesion, cytoskeletal organisation, and growth factor signalling []. In addition to the sorbin domain, these proteins contain three SH3 (src homology 3) domains. The sorbin homology domain mediates the interaction of vinexin and CAP with flotillin, which is crucial for the localisation of SH3-binding proteins to the lipid raft, a region of the plasma membrane rich in cholesterol and sphingolipids that acts to concentrate certain signalling molecules. The sorbin homology domain of adaptor proteins may mediate interactions with the lipid raft that are crucial to intracellular communication [].
Probab=24.78 E-value=67 Score=19.57 Aligned_cols=24 Identities=29% Similarity=0.603 Sum_probs=20.5
Q ss_pred cccccccCCCCchhHHHHHHHHHH
Q psy1631 26 RTPIADLDRPGSFAYHMSRSIHLV 49 (77)
Q Consensus 26 riPv~dIDRPGsFa~Hm~Rsv~LL 49 (77)
.||+...|||.-...-|.|.|.-+
T Consensus 16 giP~~~vd~~kDWYktMFkqIHk~ 39 (47)
T PF02208_consen 16 GIPLSNVDRPKDWYKTMFKQIHKL 39 (47)
T ss_pred CCccccccchhHHHHHHHHHHHhh
Confidence 478999999999999999988643
No 30
>KOG2859|consensus
Probab=24.77 E-value=77 Score=25.44 Aligned_cols=29 Identities=34% Similarity=0.481 Sum_probs=22.1
Q ss_pred cCCCCchhHHH--HHHHHHHHHHHHhc-CcHh
Q psy1631 32 LDRPGSFAYHM--SRSIHLVVDLLSEI-KDAK 60 (77)
Q Consensus 32 IDRPGsFa~Hm--~Rsv~LLl~vL~e~-~D~~ 60 (77)
+|||.+-.+|| +|+..++.+.|+++ +|+.
T Consensus 169 ~Dr~~~le~~n~~~~~lr~~~q~LeKL~~d~~ 200 (293)
T KOG2859|consen 169 LDRPKILEKRNSMYRHLRLLQQRLEKLCKDAI 200 (293)
T ss_pred ecccchHhhhhhHHHHHHHHHHHHHHHHhhhe
Confidence 69999999985 56677788888764 5554
No 31
>KOG3197|consensus
Probab=24.21 E-value=1.3e+02 Score=23.27 Aligned_cols=22 Identities=32% Similarity=0.368 Sum_probs=18.1
Q ss_pred cccCCCCchhHHHHHHHHHHHH
Q psy1631 30 ADLDRPGSFAYHMSRSIHLVVD 51 (77)
Q Consensus 30 ~dIDRPGsFa~Hm~Rsv~LLl~ 51 (77)
-+|++|-|-|-||||=-.+.|-
T Consensus 48 ~gv~~pESIadHMYRM~llaml 69 (210)
T KOG3197|consen 48 RGVNEPESIADHMYRMALLAML 69 (210)
T ss_pred cCCCCcchHHHHHHHHHHHHHH
Confidence 4789999999999997666554
No 32
>PF13925 Katanin_con80: con80 domain of Katanin
Probab=23.75 E-value=99 Score=21.47 Aligned_cols=33 Identities=21% Similarity=0.366 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHHhcCcHhHHHHHHHHhc--cCCC
Q psy1631 42 MSRSIHLVVDLLSEIKDAKTLMDIAVALK--DEPE 74 (77)
Q Consensus 42 m~Rsv~LLl~vL~e~~D~~tLl~l~~~L~--r~Pd 74 (77)
.++++.-.++.+.+.+|...+.++...|. .+|+
T Consensus 26 ~~~~~k~ai~~~~~~~D~svlvD~L~vl~~~~~~~ 60 (164)
T PF13925_consen 26 RRNDIKGAIEYAVRMNDPSVLVDVLSVLNQSLKPE 60 (164)
T ss_pred hcCcHHHHHHHHHhcCCchHHHHHHHHHHHhcCcC
Confidence 35678888999999999999999999998 6654
No 33
>PF01832 Glucosaminidase: Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase; InterPro: IPR002901 This domain is found in many different proteins including mannosyl-glycoprotein endo-beta-N-acetylglucosamidase (3.2.1.96 from EC).It is also found in flagellar protein J (P75942 from SWISSPROT), which has been shown to hydrolyse peptidoglycan [].; GO: 0004040 amidase activity, 0009253 peptidoglycan catabolic process; PDB: 3FI7_A 2ZYC_A 3K3T_A.
Probab=23.47 E-value=34 Score=22.02 Aligned_cols=11 Identities=18% Similarity=0.163 Sum_probs=5.9
Q ss_pred cccccccccccc
Q psy1631 15 TSVVHFGLGLWR 26 (77)
Q Consensus 15 rnktNFFnGiWr 26 (77)
+++.||| ||+-
T Consensus 36 ~~~nN~f-Gi~~ 46 (136)
T PF01832_consen 36 KEYNNFF-GIKA 46 (136)
T ss_dssp SCTT-TT---BS
T ss_pred ccccccc-ceec
Confidence 3677999 7777
No 34
>PF05501 DUF755: Domain of unknown function (DUF755) ; InterPro: IPR008474 This family is predominated by ORFs from Anelloviridae. The function of this family remains to be determined.
Probab=23.39 E-value=82 Score=21.70 Aligned_cols=28 Identities=29% Similarity=0.423 Sum_probs=24.2
Q ss_pred HHHHHHHHHhcCcHhHHHHHHHHhccCC
Q psy1631 46 IHLVVDLLSEIKDAKTLMDIAVALKDEP 73 (77)
Q Consensus 46 v~LLl~vL~e~~D~~tLl~l~~~L~r~P 73 (77)
..|.=++|+|..--..|+.++.||++.+
T Consensus 38 ~~L~~qLlKEC~kt~~L~~l~~qlq~~~ 65 (123)
T PF05501_consen 38 DSLQKQLLKECQKTSQLMQLLQQLQKQP 65 (123)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence 4566789999999999999999997765
No 35
>PF15059 Speriolin_C: Speriolin C-terminus
Probab=23.17 E-value=73 Score=23.26 Aligned_cols=31 Identities=19% Similarity=0.104 Sum_probs=22.5
Q ss_pred ccccccccccccccccccccccccccCCCCchh
Q psy1631 7 TYHPQSCLTSVVHFGLGLWRTPIADLDRPGSFA 39 (77)
Q Consensus 7 ~~~~~fc~rnktNFFnGiWriPv~dIDRPGsFa 39 (77)
++||.|+| --.|-| ||-|---+--.++|+..
T Consensus 75 ~~HP~lsE-~lVN~y-GILr~rp~l~a~~~~~y 105 (146)
T PF15059_consen 75 RVHPGLSE-FLVNTY-GILRERPELAASEGGSY 105 (146)
T ss_pred ccCchHHH-HHHHHc-cccccCcccccCcCcCC
Confidence 68999999 566777 88776556666666553
No 36
>PF07693 KAP_NTPase: KAP family P-loop domain; InterPro: IPR011646 The KAP (after Kidins220/ARMS and PifA) family of predicted NTPases are sporadically distributed across a wide phylogenetic range in bacteria and in animals. Many of the prokaryotic KAP NTPases are encoded in plasmids and tend to undergo disruption to form pseudogenes. A unique feature of all eukaryotic and certain bacterial KAP NTPases is the presence of two or four transmembrane helices inserted into the P-loop NTPase domain. These transmembrane helices anchor KAP NTPases in the membrane such that the P-loop domain is located on the intracellular side [].
Probab=22.99 E-value=45 Score=23.91 Aligned_cols=26 Identities=31% Similarity=0.428 Sum_probs=19.9
Q ss_pred ccccccCC-CCchhHHHHHHHHHHHHH
Q psy1631 27 TPIADLDR-PGSFAYHMSRSIHLVVDL 52 (77)
Q Consensus 27 iPv~dIDR-PGsFa~Hm~Rsv~LLl~v 52 (77)
+-+||+|| +...+--+-+.|.+++++
T Consensus 176 iiIDdLDR~~~~~i~~~l~~ik~~~~~ 202 (325)
T PF07693_consen 176 IIIDDLDRCSPEEIVELLEAIKLLLDF 202 (325)
T ss_pred EEEcchhcCCcHHHHHHHHHHHHhcCC
Confidence 45799999 677777777777777664
No 37
>PF13929 mRNA_stabil: mRNA stabilisation
Probab=22.67 E-value=1.5e+02 Score=23.52 Aligned_cols=39 Identities=18% Similarity=0.296 Sum_probs=27.4
Q ss_pred CCchhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHhccC--CCCC
Q psy1631 35 PGSFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVALKDE--PEAE 76 (77)
Q Consensus 35 PGsFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L~r~--Pd~~ 76 (77)
+|.-..+| |...+++|.+.+||..|++.=-+.-.. |.+|
T Consensus 197 ~~~l~~~v---i~~Il~~L~~~~dW~kl~~fW~~~~~~~~~~~D 237 (292)
T PF13929_consen 197 SKSLTRNV---IISILEILAESRDWNKLFQFWEQCIPNSVPGND 237 (292)
T ss_pred ccCCChhH---HHHHHHHHHhcccHHHHHHHHHHhcccCCCCCC
Confidence 35555454 556689999999999999986665444 5444
No 38
>PRK13291 metal-dependent hydrolase; Provisional
Probab=22.11 E-value=2e+02 Score=19.92 Aligned_cols=20 Identities=20% Similarity=0.370 Sum_probs=16.1
Q ss_pred cccccccCCCCchhHHHHHH
Q psy1631 26 RTPIADLDRPGSFAYHMSRS 45 (77)
Q Consensus 26 riPv~dIDRPGsFa~Hm~Rs 45 (77)
|.|+.++.=|+.|..+--..
T Consensus 5 ~~~~~~~~~~~~~~~~~~~~ 24 (173)
T PRK13291 5 RYPIGQFQYPEEITEEQIQE 24 (173)
T ss_pred CCCCCCCCCCCCCCHHHHHH
Confidence 67999999999998775543
No 39
>PF06812 ImpA-rel_N: ImpA-related N-terminal; InterPro: IPR010657 This entry represents a conserved region located towards the N-terminal end of ImpA and related proteins. ImpA is an inner membrane protein, which has been suggested to be involved with proteins that are exported and associated with colony variations in Actinobacillus actinomycetemcomitans []. Note that many members are hypothetical proteins.
Probab=20.63 E-value=1.5e+02 Score=17.32 Aligned_cols=25 Identities=28% Similarity=0.364 Sum_probs=19.3
Q ss_pred HHHHHHH-HhcCcHhHHHHHHHHhcc
Q psy1631 47 HLVVDLL-SEIKDAKTLMDIAVALKD 71 (77)
Q Consensus 47 ~LLl~vL-~e~~D~~tLl~l~~~L~r 71 (77)
.+-.++| ++.||.+.+..++..+.+
T Consensus 7 ~~~~~lL~~~sKDlrv~~~l~~a~~~ 32 (62)
T PF06812_consen 7 ELALELLSEQSKDLRVAVWLTEALLR 32 (62)
T ss_pred HHHHHHHHHcCchHHHHHHHHHHHHH
Confidence 4567788 888999998888877654
No 40
>KOG3904|consensus
Probab=20.51 E-value=27 Score=26.79 Aligned_cols=30 Identities=27% Similarity=0.484 Sum_probs=21.1
Q ss_pred cccccccccccccccccccccccCCCCchhHH
Q psy1631 10 PQSCLTSVVHFGLGLWRTPIADLDRPGSFAYH 41 (77)
Q Consensus 10 ~~fc~rnktNFFnGiWriPv~dIDRPGsFa~H 41 (77)
|+--++|--| .-+|++|.=|+.|=|+|.+-
T Consensus 138 Pl~~ls~a~~--a~vW~~~~yelsr~~~f~~~ 167 (209)
T KOG3904|consen 138 PLESLSPALL--AFVWLIPFYELSRVGVFISL 167 (209)
T ss_pred cccccchhhh--cceeccchhhHhHhhhhhhH
Confidence 4434444333 35999999999999998763
No 41
>COG3326 Predicted membrane protein [Function unknown]
Probab=20.27 E-value=39 Score=22.97 Aligned_cols=10 Identities=40% Similarity=0.743 Sum_probs=7.6
Q ss_pred cccccccccc
Q psy1631 20 FGLGLWRTPI 29 (77)
Q Consensus 20 FFnGiWriPv 29 (77)
--+|.||||=
T Consensus 30 A~~~~wRIpE 39 (94)
T COG3326 30 ARKGKWRIPE 39 (94)
T ss_pred HHccCCCCcH
Confidence 3468999994
No 42
>cd06336 PBP1_ABC_ligand_binding_like_3 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. This group includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters (HAAT), such as leucine-isoleucine-valine-binding protein (LIVBP); however their ligand specificity has not been determined experimentally.
Probab=20.20 E-value=2.3e+02 Score=20.52 Aligned_cols=26 Identities=12% Similarity=0.086 Sum_probs=22.9
Q ss_pred CCchhHHHHHHHHHHHHHHHhcCcHh
Q psy1631 35 PGSFAYHMSRSIHLVVDLLSEIKDAK 60 (77)
Q Consensus 35 PGsFa~Hm~Rsv~LLl~vL~e~~D~~ 60 (77)
|+.|+...+-++.++.+.+++-++..
T Consensus 278 p~~~~~~~y~~~~~~~~Al~~ag~~~ 303 (347)
T cd06336 278 PNSEAAVSYDAVYILKAAMEAAGSVD 303 (347)
T ss_pred CcHHHHHHHHHHHHHHHHHHhcCCCC
Confidence 88899999999999999999887444
No 43
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=20.14 E-value=2.3e+02 Score=20.51 Aligned_cols=33 Identities=6% Similarity=0.214 Sum_probs=26.6
Q ss_pred chhHHHHHHHHHHHHHHHhcCcHhHHHHHHHHh
Q psy1631 37 SFAYHMSRSIHLVVDLLSEIKDAKTLMDIAVAL 69 (77)
Q Consensus 37 sFa~Hm~Rsv~LLl~vL~e~~D~~tLl~l~~~L 69 (77)
+...=+.-.+..|++|..+++|.+..+.++..|
T Consensus 212 gW~~l~~~~l~~l~~Ca~~~~~~~~~l~~~leL 244 (247)
T PF11817_consen 212 GWWSLLTEVLWRLLECAKRLGDVEDYLTTSLEL 244 (247)
T ss_pred CcHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 333445567788999999999999999999876
No 44
>PF09571 RE_XcyI: XcyI restriction endonuclease; InterPro: IPR019071 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry represents the restriction endonuclease XcyI, which recognises and cleaves the double-stranded sequence C^CCGGG. ; GO: 0000287 magnesium ion binding, 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=20.03 E-value=58 Score=26.45 Aligned_cols=40 Identities=18% Similarity=0.316 Sum_probs=30.8
Q ss_pred cCCC--CchhHHHHHHHHHHHHHHH------hcCcHhHHHHHHHHhcc
Q psy1631 32 LDRP--GSFAYHMSRSIHLVVDLLS------EIKDAKTLMDIAVALKD 71 (77)
Q Consensus 32 IDRP--GsFa~Hm~Rsv~LLl~vL~------e~~D~~tLl~l~~~L~r 71 (77)
.++| --.|.|+.++...|++... ++=|-=|||.+..|||-
T Consensus 115 ~~~~~i~dLc~~~~~~~s~Ll~~~~~~~Is~~llddltlLtlG~Ql~G 162 (318)
T PF09571_consen 115 AQEPDIQDLCVAFNASASKLLEGIEIQRISRELLDDLTLLTLGPQLRG 162 (318)
T ss_pred cccccHHHHHHHHHHHHHHHHhccccccccHHHHHHHHhhccChhhcc
Confidence 4555 3678999999999998876 44455578899999873
Done!