Query 032255
Match_columns 144
No_of_seqs 19 out of 21
Neff 2.2
Searched_HMMs 46136
Date Fri Mar 29 11:40:36 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/032255.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/032255hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF11568 Med29: Mediator compl 96.2 0.01 2.2E-07 46.9 5.0 68 3-75 12-80 (148)
2 PF06419 COG6: Conserved oligo 72.9 13 0.00029 34.1 6.7 81 1-85 157-237 (618)
3 PF10882 bPH_5: Bacterial PH d 67.2 2.8 6.1E-05 28.6 1.0 19 51-69 82-100 (100)
4 COG4575 ElaB Uncharacterized c 58.2 82 0.0018 24.1 7.6 59 5-63 10-80 (104)
5 PF08397 IMD: IRSp53/MIM homol 57.1 30 0.00065 27.3 5.2 24 3-26 3-26 (219)
6 PF12731 Mating_N: Mating-type 55.2 35 0.00076 24.4 4.9 15 38-52 33-47 (95)
7 smart00150 SPEC Spectrin repea 53.1 54 0.0012 20.5 5.9 25 33-57 71-95 (101)
8 TIGR02501 type_III_yscE type I 52.1 23 0.00049 24.6 3.4 25 45-69 42-66 (67)
9 PRK06801 hypothetical protein; 50.0 21 0.00045 30.3 3.5 57 56-124 159-215 (286)
10 PF07007 DUF1311: Protein of u 49.4 67 0.0014 20.4 7.0 41 6-58 25-66 (95)
11 KOG4718 Non-SMC (structural ma 45.6 39 0.00084 29.2 4.4 70 34-115 38-111 (235)
12 COG1548 Predicted transcriptio 40.1 79 0.0017 28.4 5.6 63 3-82 250-313 (330)
13 PF13320 DUF4091: Domain of un 38.3 21 0.00045 24.4 1.5 16 95-110 43-58 (68)
14 PF12450 vWF_A: von Willebrand 37.3 5.6 0.00012 29.7 -1.6 38 74-111 32-69 (99)
15 PRK09195 gatY tagatose-bisphos 37.3 45 0.00097 28.4 3.6 61 52-124 154-214 (284)
16 TIGR00996 Mtu_fam_mce virulenc 36.5 2.2E+02 0.0048 22.7 7.2 31 37-67 197-227 (291)
17 TIGR01858 tag_bisphos_ald clas 35.0 51 0.0011 28.0 3.6 62 50-124 150-212 (282)
18 PRK12857 fructose-1,6-bisphosp 32.5 58 0.0013 27.7 3.5 62 51-124 153-214 (284)
19 PF01371 Trp_repressor: Trp re 32.5 1.2E+02 0.0026 22.0 4.6 42 11-52 2-43 (87)
20 PRK12737 gatY tagatose-bisphos 30.7 66 0.0014 27.3 3.5 60 52-124 154-214 (284)
21 PF09361 Phasin_2: Phasin prot 30.0 1.6E+02 0.0034 19.1 7.3 29 37-65 23-51 (102)
22 smart00685 DM14 Repeats in fly 29.8 45 0.00099 22.9 2.0 19 41-59 1-19 (59)
23 cd00176 SPEC Spectrin repeats, 29.7 2E+02 0.0043 20.1 7.0 53 9-65 53-105 (213)
24 KOG0994 Extracellular matrix g 29.6 2.4E+02 0.0052 30.3 7.7 75 2-119 1170-1244(1758)
25 PF06348 DUF1059: Protein of u 29.2 86 0.0019 20.7 3.2 34 37-70 22-55 (57)
26 PRK05835 fructose-bisphosphate 28.9 72 0.0016 27.7 3.5 62 50-122 152-214 (307)
27 PF11640 TAN: Telomere-length 28.5 45 0.00098 25.0 2.0 18 97-114 77-94 (155)
28 PRK13012 2-oxoacid dehydrogena 27.8 4.4E+02 0.0095 26.4 8.9 30 103-132 90-120 (896)
29 PRK07709 fructose-bisphosphate 27.6 81 0.0018 26.9 3.6 60 52-124 155-215 (285)
30 KOG0040 Ca2+-binding actin-bun 27.4 86 0.0019 34.3 4.3 57 4-60 1311-1394(2399)
31 TIGR03706 exo_poly_only exopol 27.2 69 0.0015 26.2 3.0 39 35-73 52-102 (300)
32 cd00947 TBP_aldolase_IIB Tagat 26.7 98 0.0021 26.3 3.9 59 55-124 150-208 (276)
33 PRK08610 fructose-bisphosphate 25.8 91 0.002 26.6 3.6 61 52-124 155-215 (286)
34 PF08223 PaaX_C: PaaX-like pro 25.8 45 0.00097 25.3 1.6 42 5-52 91-132 (172)
35 PRK12738 kbaY tagatose-bisphos 25.7 90 0.0019 26.7 3.5 60 52-124 154-214 (286)
36 KOG0624 dsRNA-activated protei 24.8 54 0.0012 30.8 2.2 14 114-127 468-481 (504)
37 PF01213 CAP_N: Adenylate cycl 24.7 3E+02 0.0065 23.9 6.5 47 10-56 47-94 (312)
38 PRK15366 type III secretion sy 24.5 2.2E+02 0.0048 21.1 4.9 26 45-70 45-70 (80)
39 PF06148 COG2: COG (conserved 24.2 1.7E+02 0.0037 21.3 4.3 56 3-63 45-100 (133)
40 PF00435 Spectrin: Spectrin re 23.7 1.9E+02 0.0042 17.9 7.7 48 8-59 53-100 (105)
41 PF15155 MRFAP1: MORF4 family- 23.5 3.5E+02 0.0076 21.5 6.1 17 57-73 110-126 (127)
42 PF12729 4HB_MCP_1: Four helix 22.8 2.5E+02 0.0055 19.0 7.9 28 37-64 107-134 (181)
43 cd04926 ACT_ACR_4 C-terminal 21.7 74 0.0016 20.7 1.8 16 59-74 57-72 (72)
44 PF13361 UvrD_C: UvrD-like hel 21.5 65 0.0014 24.5 1.8 21 40-60 322-342 (351)
45 PRK00810 nifW nitrogenase stab 21.1 63 0.0014 24.7 1.6 13 6-18 68-80 (113)
46 TIGR03687 pupylate_cterm ubiqu 20.9 72 0.0016 20.3 1.6 14 56-69 18-31 (33)
47 PF01591 6PF2K: 6-phosphofruct 20.7 1.6E+02 0.0035 24.1 4.0 34 20-53 140-173 (222)
48 PF00390 malic: Malic enzyme, 20.6 75 0.0016 25.9 2.0 21 58-78 147-167 (182)
49 PRK10404 hypothetical protein; 20.5 3.6E+02 0.0077 19.8 7.8 27 6-32 8-34 (101)
50 TIGR02277 PaaX_trns_reg phenyl 20.4 2.5E+02 0.0055 23.4 5.1 43 5-52 169-211 (280)
No 1
>PF11568 Med29: Mediator complex subunit 29; InterPro: IPR021018 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med29, along with Med11 and Med28, in mammals, is part of the core head-region of the complex. Med29 is the apparent orthologue of the Drosophila melanogaster Intersex protein, which interacts directly with, and functions as a transcriptional coactivator for, the DNA-binding transcription factor Doublesex, so it is likely that mammalian Med29 serves as a target for one or more DNA-binding transcriptional activators []. ; GO: 0016592 mediator complex
Probab=96.18 E-value=0.01 Score=46.93 Aligned_cols=68 Identities=25% Similarity=0.445 Sum_probs=55.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhhhhhhh-hhhhhHHHHHHHHHHHHHHHHhhhchHHHHHHHHHhhcccccc
Q 032255 3 NIVDALSSAYKEFVAAAANVLEAKELSVAQ-KTAATDAALENFKQRWELFRVACDQAEEFVESVKQRIGSECLV 75 (144)
Q Consensus 3 s~Vd~l~~AY~~fv~AaaavleAke~~gg~-ktaaTDaAle~fkqr~eLF~vACD~AeelVeSvrqRIgSeclV 75 (144)
++|-.|.+++..++..||-.|.-+...+.. +..-+|.-...|...+|-|+.-|||-| +--|.+.||+-
T Consensus 12 ~Lv~~LreSl~~~~k~AA~~l~qn~~~D~g~~~~~~d~~~~RFdK~lEeFysiCDQIE-----l~L~ta~ec~~ 80 (148)
T PF11568_consen 12 SLVGPLRESLSNLMKTAAQNLQQNSLVDNGTRGKSSDEPVPRFDKNLEEFYSICDQIE-----LHLKTAIECLS 80 (148)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccCcHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHH
Confidence 467889999999999999999987666433 335577788999999999999999998 55677778865
No 2
>PF06419 COG6: Conserved oligomeric complex COG6; InterPro: IPR010490 COG6 is a component of the conserved oligomeric golgi complex, which is composed of eight different subunits and is required for normal golgi morphology and localisation.
Probab=72.87 E-value=13 Score=34.14 Aligned_cols=81 Identities=17% Similarity=0.188 Sum_probs=62.3
Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHHhhhchHHHHHHHHHhhccccccccccc
Q 032255 1 MDNIVDALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFRVACDQAEEFVESVKQRIGSECLVDEATG 80 (144)
Q Consensus 1 MDs~Vd~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~vACD~AeelVeSvrqRIgSeclVdeATg 80 (144)
|+.+=..+++||+++..-...-...-.....+....--.|+..+++|-.||.-..| .+-..|+++-++.+++--|+
T Consensus 157 M~~~~~~~e~a~erl~~w~q~e~~~l~~~~~~~~~~l~~al~~L~~rp~lf~~~l~----~~~~~R~~~l~~~F~~aLt~ 232 (618)
T PF06419_consen 157 MEQMSKYLERAYERLYRWVQRECRSLNLDNPEVSPLLRRALRYLRERPVLFNYCLD----EFAEARSKALLRRFLDALTR 232 (618)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccchHHHHHHHHHhcChHHHHHHHH----HHHHHHHHHHHHHHHHHHcC
Confidence 67777889999999998886655544333444566677999999999999977654 46678888888999998888
Q ss_pred cCCCC
Q 032255 81 SLAGK 85 (144)
Q Consensus 81 ~~~~~ 85 (144)
...++
T Consensus 233 g~~~~ 237 (618)
T PF06419_consen 233 GGPGG 237 (618)
T ss_pred CCCCC
Confidence 76644
No 3
>PF10882 bPH_5: Bacterial PH domain; InterPro: IPR020482 This entry contains membrane proteins with no known function.
Probab=67.22 E-value=2.8 Score=28.57 Aligned_cols=19 Identities=32% Similarity=0.711 Sum_probs=16.6
Q ss_pred HHHhhhchHHHHHHHHHhh
Q 032255 51 FRVACDQAEEFVESVKQRI 69 (144)
Q Consensus 51 F~vACD~AeelVeSvrqRI 69 (144)
+.++|++.++|++.+++|.
T Consensus 82 y~isp~~~~~fi~~l~~r~ 100 (100)
T PF10882_consen 82 YVISPEDPEEFIEALKKRA 100 (100)
T ss_pred EEEcCCCHHHHHHHHHhcC
Confidence 4579999999999999983
No 4
>COG4575 ElaB Uncharacterized conserved protein [Function unknown]
Probab=58.17 E-value=82 Score=24.09 Aligned_cols=59 Identities=19% Similarity=0.366 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhh----hhhHHHHHHHHHHHHHH--------HHhhhchHHHHH
Q 032255 5 VDALSSAYKEFVAAAANVLEAKELSVAQKT----AATDAALENFKQRWELF--------RVACDQAEEFVE 63 (144)
Q Consensus 5 Vd~l~~AY~~fv~AaaavleAke~~gg~kt----aaTDaAle~fkqr~eLF--------~vACD~AeelVe 63 (144)
+|.|.+-.++|++-+-.+|..+....+++- ++-+.+|+.-+.|+.=- +.+-+.+|+||+
T Consensus 10 ~~~l~~el~~L~d~lEevL~ssg~~a~~e~~~lR~r~~~~Lk~~r~rl~~~~d~v~~~sk~a~~~tD~yV~ 80 (104)
T COG4575 10 IDQLLAELQELLDTLEEVLKSSGSLAGDEAEELRSKAESALKEARDRLGDTGDAVVQRSKAAADATDDYVR 80 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH
Confidence 388999999999999999998877655543 23566777766665433 455667777764
No 5
>PF08397 IMD: IRSp53/MIM homology domain; InterPro: IPR013606 The IMD (IRSp53 and MIM (missing in metastases) homology) domain is a BAR-like domain of approximately 250 amino acids found at the N-terminal in the insulin receptor tyrosine kinase substrate p53 (IRSp53) and in the evolutionarily related IRSp53/MIM family. In IRSp53, a ubiquitous regulator o the actin cytoskeleton, the IMD domain acts as conserved F-actin bundling domain involved in filopodium formation. Filopodium-inducing IMD activity is regulated by Cdc42 and Rac1 (Rho-family GTPases) and is SH3-independent [, , ]. The IRSp53/MIM family is a novel F-actin bundling protein family that includes invertebrate relatives: Vertebrate MIM (missing in metastasis), an actin-binding scaffold protein that may be involved in cancer metastasis. Vertebrate ABBA-1, a MIM-related protein. Vertebrate brain-specific angiogenesis inhibitor 1-associated protein 2 (BAI1-associated protein 2) or insulin receptor tyrosine kinase substrate p53 (IRSp53), a multifunctional adaptor protein that links Rac1 with a Wiskott-Aldrich syndrome family verprolin-homologous protein 2 (WAVE2) to induce lamellipodia or Cdc42 with Mena to induce filopodia []. Vertebrate brain-specific angiogenesis inhibitor 1-associated protein 2-like proteins 1 and 2 (BAI1-associated protein 2-like proteins 1 and 2). Drosophila melanogaster (Fruit fly) CG32082-PA. Caenorhabditis elegans M04F3.5 protein. The vertebrate IRSp53/MIM family is divided into two major groups: the IRSp53 subfamily and the MIM/ABBA subfamily. The putative invertebrate homologues are positioned between them. The IRSp53 subfamily members contain an SH3 domain, and the MIM/ABBA subfamily proteins contain a WH2 (WASP-homology 2) domain. The vertebrate SH3-containing subfamily is further divided into three groups according to the presence or absence of the WWB and the half-CRIB motif. The IMD domain can bind to and bundle actin filaments, bind to membranes and interact with the small GTPase Rac [, ]. The IMD domain folds as a coiled coil of three extended alpha-helices and a shorter C-terminal helix. Helix 4 packs tightly against the other three helices, and thus represents an integral part of the domain. The fold of the IMD domain closely resembles that of the BAR (Bin-Amphiphysin-RVS) domain, a functional module serving both as a sensor and inducer of membrane curvature []. The WH2 domain performs a scaffolding function [].; GO: 0008093 cytoskeletal adaptor activity, 0017124 SH3 domain binding, 0007165 signal transduction, 0046847 filopodium assembly; PDB: 2D1L_A 3OK8_B 1WDZ_B 1Y2O_A 2YKT_A.
Probab=57.09 E-value=30 Score=27.34 Aligned_cols=24 Identities=21% Similarity=0.400 Sum_probs=20.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhh
Q 032255 3 NIVDALSSAYKEFVAAAANVLEAK 26 (144)
Q Consensus 3 s~Vd~l~~AY~~fv~AaaavleAk 26 (144)
+|++.|+-+|++||.++.....|=
T Consensus 3 ~i~~~~~P~~e~lv~~~~kY~~al 26 (219)
T PF08397_consen 3 NIMEDFNPAWENLVSLGKKYQKAL 26 (219)
T ss_dssp HHHHTHHHHHHHHHHHHHHHHHHH
T ss_pred hHHhhcCHHHHHHHHHHHHHHHHH
Confidence 689999999999999998877643
No 6
>PF12731 Mating_N: Mating-type protein beta 1; InterPro: IPR024333 This entry represents a group of homeodomain-containing transcription factor proteins involved in mating [].
Probab=55.20 E-value=35 Score=24.41 Aligned_cols=15 Identities=40% Similarity=0.778 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHH
Q 032255 38 DAALENFKQRWELFR 52 (144)
Q Consensus 38 DaAle~fkqr~eLF~ 52 (144)
+.+|+.|..+|+-|.
T Consensus 33 ~~~L~~F~~~w~~f~ 47 (95)
T PF12731_consen 33 SDALESFLSSWSSFD 47 (95)
T ss_pred hHHHHHHHHHHHHHH
Confidence 446999999999994
No 7
>smart00150 SPEC Spectrin repeats.
Probab=53.11 E-value=54 Score=20.49 Aligned_cols=25 Identities=20% Similarity=0.558 Sum_probs=19.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHHhhhc
Q 032255 33 KTAATDAALENFKQRWELFRVACDQ 57 (144)
Q Consensus 33 ktaaTDaAle~fkqr~eLF~vACD~ 57 (144)
........++.+.+||+-++..|..
T Consensus 71 ~~~~i~~~~~~l~~~w~~l~~~~~~ 95 (101)
T smart00150 71 DAEEIEERLEELNERWEELKELAEE 95 (101)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455677799999999988877754
No 8
>TIGR02501 type_III_yscE type III secretion system protein, YseE family. Members of this family are found exclusively in type III secretion appparatus gene clusters in bacteria. Those bacteria with a protein from this family tend to target animal cells, as does Yersinia pestis. This protein is small (about 70 amino acids) and not well characterized.
Probab=52.12 E-value=23 Score=24.59 Aligned_cols=25 Identities=28% Similarity=0.466 Sum_probs=22.3
Q ss_pred HHHHHHHHHhhhchHHHHHHHHHhh
Q 032255 45 KQRWELFRVACDQAEEFVESVKQRI 69 (144)
Q Consensus 45 kqr~eLF~vACD~AeelVeSvrqRI 69 (144)
+|.|..-+.||+.|+..++.+.-|.
T Consensus 42 Yq~l~~~~~A~~aA~~II~~l~~ry 66 (67)
T TIGR02501 42 YQEWQLLADAIEAAIKIIEILTGRY 66 (67)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 6789999999999999999988764
No 9
>PRK06801 hypothetical protein; Provisional
Probab=49.98 E-value=21 Score=30.29 Aligned_cols=57 Identities=33% Similarity=0.541 Sum_probs=40.3
Q ss_pred hchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHHHhhhhhcCCCCCC
Q 032255 56 DQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRWLVIELQHGGTGGS 124 (144)
Q Consensus 56 D~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrwlvielq~g~g~~~ 124 (144)
++|.+|++ +.|-+||-- +-|+++|+.. +.|++.--||+++.+++..=+ .+||+.|-+
T Consensus 159 e~a~~f~~----~tgvD~LAv-aiGt~Hg~y~-----~~~~l~~e~l~~i~~~~~~PL--VlHGGSgi~ 215 (286)
T PRK06801 159 QLARDFVD----RTGIDALAV-AIGNAHGKYK-----GEPKLDFARLAAIHQQTGLPL--VLHGGSGIS 215 (286)
T ss_pred HHHHHHHH----HHCcCEEEe-ccCCCCCCCC-----CCCCCCHHHHHHHHHhcCCCE--EEECCCCCC
Confidence 45555553 357788766 8899999863 356788889999999887522 568876533
No 10
>PF07007 DUF1311: Protein of unknown function (DUF1311); InterPro: IPR009739 This family consists of several bacterial proteins of around 120 residues in length. Members of this family contain four highly conserved cysteine residues. The function of this family is unknown.; PDB: 3GI7_A.
Probab=49.43 E-value=67 Score=20.44 Aligned_cols=41 Identities=24% Similarity=0.454 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHH-hhhch
Q 032255 6 DALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFRV-ACDQA 58 (144)
Q Consensus 6 d~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~v-ACD~A 58 (144)
..|+++|+.++.... .. ..+-..|+.-...|..||- .|+..
T Consensus 25 ~~l~~~y~~~~~~l~----~~--------~~~~~~l~~~Q~~W~~~r~~~C~~~ 66 (95)
T PF07007_consen 25 KELNKAYKALLKRLK----QE--------ERARKALKESQRAWLKYRDAECAFA 66 (95)
T ss_dssp HHHHHHHHHHHHHHH----S-----------HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhH----Hh--------hHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 356677776665543 11 1134677777789999987 67643
No 11
>KOG4718 consensus Non-SMC (structural maintenance of chromosomes) element 1 protein (NSE1) [Chromatin structure and dynamics]
Probab=45.61 E-value=39 Score=29.16 Aligned_cols=70 Identities=24% Similarity=0.340 Sum_probs=44.2
Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhchHHHHHHHHHhh----ccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhH
Q 032255 34 TAATDAALENFKQRWELFRVACDQAEEFVESVKQRI----GSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAV 109 (144)
Q Consensus 34 taaTDaAle~fkqr~eLF~vACD~AeelVeSvrqRI----gSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAV 109 (144)
+..+..++|+|.+..- |.+..|=..|++-+ |-+-||-. +.++.|-...+||.| ++-||=|+||+
T Consensus 38 ~~g~k~~~edfinein------~~lhnld~~ikr~~~~~dgr~~lvyv---Nla~tp~SkmaT~f~---~nEielfrkal 105 (235)
T KOG4718|consen 38 KPGSKEAIEDFINEIN------DTLHNLDQLIKRIKYPVDGREYLVYV---NLAATPDSKMATGFT---ANEIELFRKAL 105 (235)
T ss_pred ccchHhHHHHHHHHHH------HHHhhhhhhheeeeecCCCceEEEEE---ecCCChhHHhcCCCC---HHHHHHHHHHH
Confidence 4557888999988765 44555545555422 22222222 233444445677776 47899999999
Q ss_pred HHHhhh
Q 032255 110 RWLVIE 115 (144)
Q Consensus 110 rwlvie 115 (144)
+|+.++
T Consensus 106 E~im~s 111 (235)
T KOG4718|consen 106 EKIMSS 111 (235)
T ss_pred HHHHhh
Confidence 999987
No 12
>COG1548 Predicted transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
Probab=40.10 E-value=79 Score=28.44 Aligned_cols=63 Identities=27% Similarity=0.417 Sum_probs=47.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHHhhhchHHHHHHHHHhhcccc-cccccccc
Q 032255 3 NIVDALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFRVACDQAEEFVESVKQRIGSEC-LVDEATGS 81 (144)
Q Consensus 3 s~Vd~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~vACD~AeelVeSvrqRIgSec-lVdeATg~ 81 (144)
.+.|--+++|..++.--+-.++.+-+--|.++ -|+|---|.|.+-+-.|+|-|| .+||--|.
T Consensus 250 ~v~d~A~~~~~~~l~~l~e~I~~~a~r~gL~~-----------------Vv~~GlGefLi~~A~~~lg~ec~~i~e~~g~ 312 (330)
T COG1548 250 EVIDLAEKAYNALLELLAENIEEKAKRYGLNT-----------------VVATGLGEFLIQEACKRLGYECISIDETYGK 312 (330)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcChhh-----------------hhhccchHHHHHHHHHhhCCeEEEhhhhhcc
Confidence 45667788999999988888887766656555 3667777788888888899999 67777665
Q ss_pred C
Q 032255 82 L 82 (144)
Q Consensus 82 ~ 82 (144)
-
T Consensus 313 ~ 313 (330)
T COG1548 313 E 313 (330)
T ss_pred c
Confidence 4
No 13
>PF13320 DUF4091: Domain of unknown function (DUF4091)
Probab=38.31 E-value=21 Score=24.42 Aligned_cols=16 Identities=31% Similarity=0.733 Sum_probs=13.9
Q ss_pred CCchhHhHHHHHhhHH
Q 032255 95 PPISAVRLEQMSKAVR 110 (144)
Q Consensus 95 ppisavRleqmskAVr 110 (144)
-|++.+|||.|-.+++
T Consensus 43 ~p~~SiRle~lr~G~q 58 (68)
T PF13320_consen 43 GPVSSIRLEVLREGIQ 58 (68)
T ss_pred CcccCHHHHHHHHHHH
Confidence 5889999999998874
No 14
>PF12450 vWF_A: von Willebrand factor ; InterPro: IPR022156 This domain family is found in bacteria, and is approximately 100 amino acids in length. The family is found in association with PF00092 from PFAM. There are two conserved sequence motifs: STF and DVD. There are two completely conserved residues (E and N) that may be functionally important. In hemostasis, platelet adhesion to the damaged vessel wall is mediated by several proteins, including von Willebrand factor. In solution vWF becomes immobilized via its A3 domain on the fibrillar collagen of the vessel wall and acts as an intermediary between collagen and the platelet receptor glycoprotein Ibalpha (GPIbalpha), which is the only platelet receptor that does not require prior activation for bond formation.
Probab=37.32 E-value=5.6 Score=29.71 Aligned_cols=38 Identities=29% Similarity=0.368 Sum_probs=22.1
Q ss_pred ccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHH
Q 032255 74 LVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRW 111 (144)
Q Consensus 74 lVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrw 111 (144)
-+|.-|+|-+--.--...--+||-.|||+|.|-...++
T Consensus 32 SiDVDtaSYs~vRr~L~~G~lPp~~aVR~EE~iNyF~Y 69 (99)
T PF12450_consen 32 SIDVDTASYSNVRRFLNQGQLPPPDAVRIEEMINYFDY 69 (99)
T ss_pred eEEcCcccHHHHHHHHHCCCCCCcCceeHHHHhhcccC
Confidence 34555554332211122233899999999999765543
No 15
>PRK09195 gatY tagatose-bisphosphate aldolase; Reviewed
Probab=37.31 E-value=45 Score=28.39 Aligned_cols=61 Identities=33% Similarity=0.429 Sum_probs=42.4
Q ss_pred HHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHHHhhhhhcCCCCCC
Q 032255 52 RVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRWLVIELQHGGTGGS 124 (144)
Q Consensus 52 ~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrwlvielq~g~g~~~ 124 (144)
+-.-++|++||+ +-|-+|| =.+-|++||.-. +-|-|.--||+.+.+++.- =+ .+||+.|-+
T Consensus 154 ~T~peea~~Fv~----~TgvD~L-AvaiGt~HG~y~-----~~p~Ld~~~L~~I~~~~~v-PL-VLHGgSG~~ 214 (284)
T PRK09195 154 YTDPAQAREFVE----ATGIDSL-AVAIGTAHGMYK-----GEPKLDFDRLENIRQWVNI-PL-VLHGASGLP 214 (284)
T ss_pred CCCHHHHHHHHH----HHCcCEE-eeccCccccccC-----CCCcCCHHHHHHHHHHhCC-Ce-EEecCCCCC
Confidence 445678899988 4455554 357788888853 2467888899999998853 11 468887643
No 16
>TIGR00996 Mtu_fam_mce virulence factor Mce family protein. Members of this paralogous family are found as six tandem homologous proteins in the same orientation per cassette, in four separate cassettes in Mycobacterium tuberculosis. The six members of each cassette represent six subfamilies. One subfamily includes the protein mce (mycobacterial cell entry), a virulence protein required for invasion of non-phagocytic cells.
Probab=36.48 E-value=2.2e+02 Score=22.75 Aligned_cols=31 Identities=10% Similarity=0.169 Sum_probs=25.2
Q ss_pred hHHHHHHHHHHHHHHHHhhhchHHHHHHHHH
Q 032255 37 TDAALENFKQRWELFRVACDQAEEFVESVKQ 67 (144)
Q Consensus 37 TDaAle~fkqr~eLF~vACD~AeelVeSvrq 67 (144)
.+..++++.++++-+....++...++..+++
T Consensus 197 l~~l~~~l~~~~~~l~~~v~~l~~~~~~l~~ 227 (291)
T TIGR00996 197 LNRVLDVLADRSDQLDRLLDNLATLTAQLAD 227 (291)
T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHh
Confidence 4566788888888888888888888887765
No 17
>TIGR01858 tag_bisphos_ald class II aldolase, tagatose bisphosphate family. This model describes tagatose-1,6-bisphosphate aldolases, and perhaps other closely related class II aldolases. This tetrameric, Zn2+-dependent enzyme is related to the class II fructose bisphosphate aldolase; fructose 1,6-bisphosphate and tagatose 1,6 bisphosphate differ only in chirality at C4.
Probab=35.01 E-value=51 Score=28.04 Aligned_cols=62 Identities=27% Similarity=0.397 Sum_probs=44.6
Q ss_pred HHHHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHH-HhhhhhcCCCCCC
Q 032255 50 LFRVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRW-LVIELQHGGTGGS 124 (144)
Q Consensus 50 LF~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrw-lvielq~g~g~~~ 124 (144)
.+.-.-|+|.+||+ +-|-+|| =.+-|++||.-.. -|-|.--||+.+.+++.- | .+||+.|-+
T Consensus 150 ~~~T~peea~~Fv~----~TgvD~L-AvaiGt~HG~yk~-----~p~Ldf~~L~~I~~~~~iPL---VlHGgSG~~ 212 (282)
T TIGR01858 150 ALYTDPQEAKEFVE----ATGVDSL-AVAIGTAHGLYKK-----TPKLDFDRLAEIREVVDVPL---VLHGASDVP 212 (282)
T ss_pred hccCCHHHHHHHHH----HHCcCEE-ecccCccccCcCC-----CCccCHHHHHHHHHHhCCCe---EEecCCCCC
Confidence 33556688999987 5565665 4678889987542 467888899999999864 3 468887643
No 18
>PRK12857 fructose-1,6-bisphosphate aldolase; Reviewed
Probab=32.49 E-value=58 Score=27.66 Aligned_cols=62 Identities=29% Similarity=0.423 Sum_probs=42.8
Q ss_pred HHHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHHHhhhhhcCCCCCC
Q 032255 51 FRVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRWLVIELQHGGTGGS 124 (144)
Q Consensus 51 F~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrwlvielq~g~g~~~ 124 (144)
|+-.=++|.+||+ +-|-+|| =.+-|++||.-. +-|-|.--||+.+.+++.-- + .+||+.|-+
T Consensus 153 ~~T~pe~a~~Fv~----~TgvD~L-AvaiGt~HG~y~-----~~p~Ld~~~L~~i~~~~~vP-L-VlHGgSG~~ 214 (284)
T PRK12857 153 AMTDPEEARRFVE----ETGVDAL-AIAIGTAHGPYK-----GEPKLDFDRLAKIKELVNIP-I-VLHGSSGVP 214 (284)
T ss_pred hcCCHHHHHHHHH----HHCCCEE-eeccCccccccC-----CCCcCCHHHHHHHHHHhCCC-E-EEeCCCCCC
Confidence 3445578888885 4465665 457788888753 35678888999999988631 1 578887643
No 19
>PF01371 Trp_repressor: Trp repressor protein; InterPro: IPR000831 The Trp repressor (TrpR) binds to at least five operators in the Escherichia coli genome, repressing gene expression. The operators at which it binds vary considerably in DNA sequence and location within the promoter; when bound to the Trp operon it recognises the sequence 5'-ACTAGT-3' and acts to prevent the initiation of transcription. The TrpR controls the trpEDCBA (trpO) operon and the genes for trpR, aroH, mtr and aroL, which are involved in the biosynthesis and uptake of the amino acid tryptophan []. The repressor binds to the operators only in the presence of L-tryptophan, thereby controlling the intracellular level of its effector; the complex also regulates Trp repressor biosynthesis by binding to its own regulatory region. TrpR acts as a dimer that is composed of identical 6-helical subunits, where four of the helices form the core of the protein and intertwine with the corresponding helices from the other subunit.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 3FRW_H 3KOR_A 3SSW_N 1P6Z_N 1CO0_B 1JHG_A 1WRT_S 1WRS_R 1WRP_R 1RCS_B ....
Probab=32.46 E-value=1.2e+02 Score=21.96 Aligned_cols=42 Identities=21% Similarity=0.348 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHH
Q 032255 11 AYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFR 52 (144)
Q Consensus 11 AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~ 52 (144)
.|+.|+.+....=...+...--.-.-|-.=++++-|||++++
T Consensus 2 ~~~~l~~~i~~l~~~ee~~~f~~dL~T~~E~~~l~~R~~va~ 43 (87)
T PF01371_consen 2 DWDELFEAILSLKDEEECYDFFEDLCTPDELEALAQRWQVAK 43 (87)
T ss_dssp HHHHHHHHHHCHHCHHCHHHHHHHHSSHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHcCCHHHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 456666665554444444444445667888999999999987
No 20
>PRK12737 gatY tagatose-bisphosphate aldolase; Reviewed
Probab=30.70 E-value=66 Score=27.34 Aligned_cols=60 Identities=33% Similarity=0.471 Sum_probs=42.7
Q ss_pred HHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHH-HhhhhhcCCCCCC
Q 032255 52 RVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRW-LVIELQHGGTGGS 124 (144)
Q Consensus 52 ~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrw-lvielq~g~g~~~ 124 (144)
+-.-++|.+||+- -|-+|| =.+-|++||.-. +-|-|.--||+.+.+++.- | .+||+.|-|
T Consensus 154 ~T~peeA~~Fv~~----TgvD~L-AvaiGt~HG~y~-----~~p~Ld~~~L~~I~~~~~iPL---VlHGgSG~~ 214 (284)
T PRK12737 154 YTNPDAAAEFVER----TGIDSL-AVAIGTAHGLYK-----GEPKLDFERLAEIREKVSIPL---VLHGASGVP 214 (284)
T ss_pred CCCHHHHHHHHHH----hCCCEE-eeccCccccccC-----CCCcCCHHHHHHHHHHhCCCE---EEeCCCCCC
Confidence 4455788888875 465655 467888898853 3567888899999999864 3 468887643
No 21
>PF09361 Phasin_2: Phasin protein; InterPro: IPR018968 Phasins (or granule-associate proteins) are surface proteins found covering Polyhydroxyalkanoate (PHA) storage granules in bacteria. Polyhydroxyalkanoates are linear polyesters produced by bacterial fermentation of sugar or lipids for the purpose of storing carbon and energy, and are accumulated as intracellular granules by many bacteria under unfavorable conditions, enhancing their fitness and stress resistance []. The layer of phasins stabilises the granules and prevents coalescence of separated granules in the cytoplasm and nonspecific binding of other proteins to the hydrophobic surfaces of the granules. For example, in Ralstonia eutropha (strain ATCC 17699/H16/DSM 428/Stanier 337) (Cupriavidus necator (strain ATCC 17699 / H16 / DSM 428 / Stanier 337)), the major surface protein of polyhydroxybutyrate (PHB) granules is phasin PhaP1(Reu), which occurs along with three homologues (PhaP2, PhaP3, and PhaP4) that have the capacity to bind to PHB granules but are present at minor levels [, ]. These four phasins lack a highly conserved domain but share homologous hydrophobic regions. This entry describes a group of phasins that associate with polyhydroxyalkanoate (PHA) inclusions, the most common of which consist of polyhydroxybutyrate (PHB).
Probab=29.95 E-value=1.6e+02 Score=19.08 Aligned_cols=29 Identities=14% Similarity=0.013 Sum_probs=14.3
Q ss_pred hHHHHHHHHHHHHHHHHhhhchHHHHHHH
Q 032255 37 TDAALENFKQRWELFRVACDQAEEFVESV 65 (144)
Q Consensus 37 TDaAle~fkqr~eLF~vACD~AeelVeSv 65 (144)
.+.+-+=.+...++++-.++.+-++++.+
T Consensus 23 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~l 51 (102)
T PF09361_consen 23 LEGAEALAALNLEAARQNVEEAFAQAQQL 51 (102)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333344445555555555555555444
No 22
>smart00685 DM14 Repeats in fly CG4713, worm Y37H9A.3 and human FLJ20241.
Probab=29.85 E-value=45 Score=22.87 Aligned_cols=19 Identities=37% Similarity=0.399 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHhhhchH
Q 032255 41 LENFKQRWELFRVACDQAE 59 (144)
Q Consensus 41 le~fkqr~eLF~vACD~Ae 59 (144)
|+.+.||.+-|..|+.+|.
T Consensus 1 l~~L~~R~~~yk~Aa~~AK 19 (59)
T smart00685 1 LALLQQRQEQYKQAALQAK 19 (59)
T ss_pred ChHHHHHHHHHHHHHHHHH
Confidence 5778999999999998764
No 23
>cd00176 SPEC Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Probab=29.67 E-value=2e+02 Score=20.12 Aligned_cols=53 Identities=17% Similarity=0.341 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHHhhhchHHHHHHH
Q 032255 9 SSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFRVACDQAEEFVESV 65 (144)
Q Consensus 9 ~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~vACD~AeelVeSv 65 (144)
...+..+...+..+++... .....--.-++.+..||+-|...|+.=....+..
T Consensus 53 ~~~~~~l~~~~~~L~~~~~----~~~~~i~~~~~~l~~~w~~l~~~~~~r~~~L~~~ 105 (213)
T cd00176 53 EERVEALNELGEQLIEEGH----PDAEEIQERLEELNQRWEELRELAEERRQRLEEA 105 (213)
T ss_pred HHHHHHHHHHHHHHHhcCC----CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555555555554321 2334455667788888888887777655555544
No 24
>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures]
Probab=29.64 E-value=2.4e+02 Score=30.31 Aligned_cols=75 Identities=21% Similarity=0.271 Sum_probs=52.2
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHHhhhchHHHHHHHHHhhcccccccccccc
Q 032255 2 DNIVDALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFRVACDQAEEFVESVKQRIGSECLVDEATGS 81 (144)
Q Consensus 2 Ds~Vd~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~vACD~AeelVeSvrqRIgSeclVdeATg~ 81 (144)
|.|+.+|.+..++|+..|..+-. .| ++..| ...=+..|+=++-||.-++
T Consensus 1170 D~il~~L~~rt~rl~~~A~~l~~----tG---------v~gay-------~s~f~~me~kl~~ir~il~----------- 1218 (1758)
T KOG0994|consen 1170 DAILQELALRTHRLINRAKELKQ----TG---------VLGAY-------ASRFLDMEEKLEEIRAILS----------- 1218 (1758)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhh----cc---------Cchhh-------HhHHHHHHHHHHHHHHHhc-----------
Confidence 88999999999999998876532 11 12222 2233456777777877443
Q ss_pred CCCCCCCCcCCCCCCchhHhHHHHHhhHHHHhhhhhcC
Q 032255 82 LAGKSGQAATTGLPPISAVRLEQMSKAVRWLVIELQHG 119 (144)
Q Consensus 82 ~~~~~~~~aa~g~ppisavRleqmskAVrwlvielq~g 119 (144)
-|.+|++-|+||+++..-|==+||-+
T Consensus 1219 ------------~~svs~~~i~~l~~~~~~lr~~l~~~ 1244 (1758)
T KOG0994|consen 1219 ------------APSVSAEDIAQLASATESLRRQLQAL 1244 (1758)
T ss_pred ------------CCCccHHHHHHHHHHHHHHHHHHHHH
Confidence 46789999999999987766665544
No 25
>PF06348 DUF1059: Protein of unknown function (DUF1059); InterPro: IPR009409 This entry consists of short hypothetical archaeal and bacterial proteins of unknown function.
Probab=29.17 E-value=86 Score=20.65 Aligned_cols=34 Identities=21% Similarity=0.217 Sum_probs=27.7
Q ss_pred hHHHHHHHHHHHHHHHHhhhchHHHHHHHHHhhc
Q 032255 37 TDAALENFKQRWELFRVACDQAEEFVESVKQRIG 70 (144)
Q Consensus 37 TDaAle~fkqr~eLF~vACD~AeelVeSvrqRIg 70 (144)
.|--|+++.+...--|.--|--+++++-||++|-
T Consensus 22 edEll~~~~~Ha~~~Hg~~~~~~el~~~ir~~I~ 55 (57)
T PF06348_consen 22 EDELLEAVVEHAREVHGMTEIPEELREKIRSAIK 55 (57)
T ss_pred HHHHHHHHHHHHHHhcCCccCCHHHHHHHHHHhh
Confidence 3667888888888888777777899999999884
No 26
>PRK05835 fructose-bisphosphate aldolase; Provisional
Probab=28.92 E-value=72 Score=27.73 Aligned_cols=62 Identities=26% Similarity=0.375 Sum_probs=44.3
Q ss_pred HHHHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHH-HhhhhhcCCCC
Q 032255 50 LFRVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRW-LVIELQHGGTG 122 (144)
Q Consensus 50 LF~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrw-lvielq~g~g~ 122 (144)
.++-.-|+|.+||+ |-|-+|| =.+=|++||.-.+. +-|-|.--||+.+.+++.- | ..||+.|
T Consensus 152 ~~~TdPeeA~~Fv~----~TgvD~L-AvaiGt~HG~Yk~~---~~p~L~f~~L~~I~~~~~iPL---VLHGgSG 214 (307)
T PRK05835 152 AVLVNPKEAEQFVK----ESQVDYL-APAIGTSHGAFKFK---GEPKLDFERLQEVKRLTNIPL---VLHGASA 214 (307)
T ss_pred ccCCCHHHHHHHHH----hhCCCEE-EEccCccccccCCC---CCCccCHHHHHHHHHHhCCCE---EEeCCCC
Confidence 34566788999987 4465553 46778899885431 3577888999999999864 3 3588776
No 27
>PF11640 TAN: Telomere-length maintenance and DNA damage repair; InterPro: IPR021668 ATM is a large protein kinase, in humans, critical for responding to DNA double-strand breaks (DSBs). Tel1, the orthologue from budding yeast, also regulates responses to DSBs. Tel1 is important for maintaining viability and for phosphorylation of the DNA damage signal transducer kinase Rad53 (an orthologue of mammalian CHK2). In addition to functioning in the response to DSBs, numerous findings indicate that Tel1/ATM regulates telomeres. The overall domain structure of Tel1/ATM is shared by proteins of the phosphatidylinositol 3-kinase (PI3K)-related kinase (PIKK) family, but this family carries a unique and functionally important TAN sequence motif, near its N-terminal, LxxxKxxE/DRxxxL. which is conserved specifically in the Tel1/ATM subclass of the PIKKs. The TAN motif is essential for both telomere length maintenance and Tel1 action in response to DNA damage []. It is classified as an 2.7.11.1 from EC. ; GO: 0004674 protein serine/threonine kinase activity
Probab=28.52 E-value=45 Score=24.98 Aligned_cols=18 Identities=28% Similarity=0.368 Sum_probs=15.9
Q ss_pred chhHhHHHHHhhHHHHhh
Q 032255 97 ISAVRLEQMSKAVRWLVI 114 (144)
Q Consensus 97 isavRleqmskAVrwlvi 114 (144)
+...||+.++.++||+|-
T Consensus 77 ~~~~RL~~~a~~lR~~ve 94 (155)
T PF11640_consen 77 TAESRLSSCASALRLFVE 94 (155)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 467899999999999984
No 28
>PRK13012 2-oxoacid dehydrogenase subunit E1; Provisional
Probab=27.76 E-value=4.4e+02 Score=26.40 Aligned_cols=30 Identities=13% Similarity=0.315 Sum_probs=20.9
Q ss_pred HHHHhhHHHHhhhhhcCCCCCC-CCCCCCCC
Q 032255 103 EQMSKAVRWLVIELQHGGTGGS-SAHAHPSV 132 (144)
Q Consensus 103 eqmskAVrwlvielq~g~g~~~-~~~~~~s~ 132 (144)
+.+.+.+||++|...+-++... +.++|++.
T Consensus 90 ~~i~~~iR~~a~~mv~~A~~~~~~~GGH~~s 120 (896)
T PRK13012 90 ERLAAIIRWNALAMVVRANRAYGELGGHIAS 120 (896)
T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCCCCCCcc
Confidence 4489999999999988777422 33455544
No 29
>PRK07709 fructose-bisphosphate aldolase; Provisional
Probab=27.64 E-value=81 Score=26.87 Aligned_cols=60 Identities=32% Similarity=0.488 Sum_probs=43.0
Q ss_pred HHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHH-HhhhhhcCCCCCC
Q 032255 52 RVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRW-LVIELQHGGTGGS 124 (144)
Q Consensus 52 ~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrw-lvielq~g~g~~~ 124 (144)
.-.-++|.+||+ +-|-+|| -.+=|++||.-. +-|-|.--||+++.+++.- | ..||+.|-+
T Consensus 155 yT~peeA~~Fv~----~TgvD~L-AvaiGt~HG~Y~-----~~p~L~~~~L~~I~~~~~iPL---VLHGgSG~~ 215 (285)
T PRK07709 155 YADPAECKHLVE----ATGIDCL-APALGSVHGPYK-----GEPNLGFAEMEQVRDFTGVPL---VLHGGTGIP 215 (285)
T ss_pred CCCHHHHHHHHH----HhCCCEE-EEeecccccCcC-----CCCccCHHHHHHHHHHHCCCE---EEeCCCCCC
Confidence 456688888886 3476775 347788888753 3567888999999999864 3 468887643
No 30
>KOG0040 consensus Ca2+-binding actin-bundling protein (spectrin), alpha chain (EF-Hand protein superfamily) [Cytoskeleton]
Probab=27.38 E-value=86 Score=34.28 Aligned_cols=57 Identities=19% Similarity=0.205 Sum_probs=43.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHH---------------------------HHHHHHhhh
Q 032255 4 IVDALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQR---------------------------WELFRVACD 56 (144)
Q Consensus 4 ~Vd~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr---------------------------~eLF~vACD 56 (144)
+.+.+.+.|...+++=+.+.+|=++.|.+.-+.-+-|-...+++ .+||++.|+
T Consensus 1311 Allerhqe~rte~daRa~tfqA~eqf~~~ll~~~h~As~eie~kl~~~~l~r~~le~awv~rr~~ldq~lelqLf~~dce 1390 (2399)
T KOG0040|consen 1311 ALLERHQEHRTEIDARAGTFQAFEQFGNELLDSGHYASPEIEKKLQAVKLERDDLEKAWVKRRKILDQCLELQLFQRDCE 1390 (2399)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhccCcCCHHHHHHHHHHHhhccchHHHHHHHHHHHHhhhhHHHHHhhhH
Confidence 45778889999999999999999998887766655554443333 358999999
Q ss_pred chHH
Q 032255 57 QAEE 60 (144)
Q Consensus 57 ~Aee 60 (144)
+++-
T Consensus 1391 q~e~ 1394 (2399)
T KOG0040|consen 1391 QAES 1394 (2399)
T ss_pred HHHH
Confidence 9863
No 31
>TIGR03706 exo_poly_only exopolyphosphatase. It appears that a single enzyme may act as both exopolyphosphatase (Ppx) and guanosine pentaphosphate phosphohydrolase (GppA) in a number of species. Members of the seed alignment use to define this exception-level model are encoded adjacent to a polyphosphate kinase 1 gene, and the trusted cutoff is set high enough (425) that no genome has a second hit. Therefore all members may be presumed to at least share exopolyphospatase activity, and may lack GppA activity. GppA acts in the stringent response.
Probab=27.17 E-value=69 Score=26.25 Aligned_cols=39 Identities=21% Similarity=0.289 Sum_probs=30.6
Q ss_pred hhhHHHHHHHHHHHHHHHHh------------hhchHHHHHHHHHhhcccc
Q 032255 35 AATDAALENFKQRWELFRVA------------CDQAEEFVESVKQRIGSEC 73 (144)
Q Consensus 35 aaTDaAle~fkqr~eLF~vA------------CD~AeelVeSvrqRIgSec 73 (144)
.++-.+|+.|++.++-|.+. +.+.++|++.|+++.|-+.
T Consensus 52 ~~~~~~l~~f~~~~~~~~v~~i~~vaTsa~R~A~N~~~~~~~i~~~tgi~i 102 (300)
T TIGR03706 52 ERALEALKRFAELLRGFPVDEVRAVATAALRDAKNGPEFLREAEAILGLPI 102 (300)
T ss_pred HHHHHHHHHHHHHHHhCCCCeEEEEEcHHHHcCCCHHHHHHHHHHHHCCCe
Confidence 34678899999999877652 4689999999999987443
No 32
>cd00947 TBP_aldolase_IIB Tagatose-1,6-bisphosphate (TBP) aldolase and related Type B Class II aldolases. TBP aldolase is a tetrameric class II aldolase that catalyzes the reversible condensation of dihydroxyacetone phosphate with glyceraldehyde 3-phsophate to produce tagatose 1,6-bisphosphate. There is an absolute requirement for a divalent metal ion, usually zinc, and in addition the enzymes are activated by monovalent cations such as Na+. The type A and type B Class II FBPA's differ in the presence and absence of distinct indels in the sequence that result in differing loop lengths in the structures.
Probab=26.71 E-value=98 Score=26.25 Aligned_cols=59 Identities=34% Similarity=0.499 Sum_probs=41.0
Q ss_pred hhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHHHhhhhhcCCCCCC
Q 032255 55 CDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRWLVIELQHGGTGGS 124 (144)
Q Consensus 55 CD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrwlvielq~g~g~~~ 124 (144)
=++|++||+- -|-+|| =.+-|++||.-.+ |-|-|.--||+++.++|+- -=.+||+.|-+
T Consensus 150 pe~a~~Fv~~----TgvD~L-AvsiGt~HG~Y~~----~~p~L~~~~L~~i~~~~~v--PLVlHGgSG~~ 208 (276)
T cd00947 150 PEEAEEFVEE----TGVDAL-AVAIGTSHGAYKG----GEPKLDFDRLKEIAERVNV--PLVLHGGSGIP 208 (276)
T ss_pred HHHHHHHHHH----HCCCEE-EeccCccccccCC----CCCccCHHHHHHHHHHhCC--CEEEeCCCCCC
Confidence 4567777753 465665 3577888888533 4677888899999999963 22578887643
No 33
>PRK08610 fructose-bisphosphate aldolase; Reviewed
Probab=25.81 E-value=91 Score=26.63 Aligned_cols=61 Identities=28% Similarity=0.455 Sum_probs=41.5
Q ss_pred HHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHHHhhhhhcCCCCCC
Q 032255 52 RVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRWLVIELQHGGTGGS 124 (144)
Q Consensus 52 ~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrwlvielq~g~g~~~ 124 (144)
+-.=++|++||+ +-|-+||= .+=|++||.-. +-|-|.--||+.+.+++.-- + .+||+.|-+
T Consensus 155 yT~peea~~Fv~----~TgvD~LA-vaiGt~HG~Y~-----~~p~Ld~~~L~~I~~~~~vP-L-VLHGgSG~~ 215 (286)
T PRK08610 155 YADPKECQELVE----KTGIDALA-PALGSVHGPYK-----GEPKLGFKEMEEIGLSTGLP-L-VLHGGTGIP 215 (286)
T ss_pred cCCHHHHHHHHH----HHCCCEEE-eeccccccccC-----CCCCCCHHHHHHHHHHHCCC-E-EEeCCCCCC
Confidence 344477888885 55766652 45588888753 34678888999999988641 1 468887644
No 34
>PF08223 PaaX_C: PaaX-like protein C-terminal domain; InterPro: IPR013225 This family contains proteins that are similar to the product of the paaX gene of Escherichia coli (P76086 from SWISSPROT). This protein is involved in the regulation of expression of a group of proteins known to participate in the metabolism of phenylacetic acid [].; PDB: 3L09_B 3KFW_X.
Probab=25.79 E-value=45 Score=25.26 Aligned_cols=42 Identities=33% Similarity=0.386 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHH
Q 032255 5 VDALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFR 52 (144)
Q Consensus 5 Vd~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~ 52 (144)
+|+|...|++|++.-...++.- .......-.+.|.-|..|-|
T Consensus 91 l~~l~~~Y~~f~~~~~~~~~~l------~~~~~~~~~~a~~~r~~L~h 132 (172)
T PF08223_consen 91 LDELAAAYEAFLARFRPLLAAL------ARAARLDPEEAFVARLLLVH 132 (172)
T ss_dssp HHHHHHHHHHHHHC--S----------------CCHHHCHHHHHHHHH
T ss_pred cHHHHHHHHHHHHHhhhccccc------ccccCCCHHHHHHHHHHHHH
Confidence 5889999999999988777432 11112333456777766654
No 35
>PRK12738 kbaY tagatose-bisphosphate aldolase; Reviewed
Probab=25.66 E-value=90 Score=26.68 Aligned_cols=60 Identities=28% Similarity=0.386 Sum_probs=42.1
Q ss_pred HHhhhchHHHHHHHHHhhccccccccccccCCCCCCCCcCCCCCCchhHhHHHHHhhHHH-HhhhhhcCCCCCC
Q 032255 52 RVACDQAEEFVESVKQRIGSECLVDEATGSLAGKSGQAATTGLPPISAVRLEQMSKAVRW-LVIELQHGGTGGS 124 (144)
Q Consensus 52 ~vACD~AeelVeSvrqRIgSeclVdeATg~~~~~~~~~aa~g~ppisavRleqmskAVrw-lvielq~g~g~~~ 124 (144)
+-.=++|.+||+- -|-+|| =.+-|++||.-. +-|-|.--||+.+.+++.- | .+||+.|-+
T Consensus 154 ~T~peea~~Fv~~----TgvD~L-AvaiGt~HG~Y~-----~~p~Ldfd~l~~I~~~~~vPL---VLHGgSG~~ 214 (286)
T PRK12738 154 LTDPQEAKRFVEL----TGVDSL-AVAIGTAHGLYS-----KTPKIDFQRLAEIREVVDVPL---VLHGASDVP 214 (286)
T ss_pred CCCHHHHHHHHHH----hCCCEE-EeccCcccCCCC-----CCCcCCHHHHHHHHHHhCCCE---EEeCCCCCC
Confidence 4445677788753 365555 467889998853 3478889999999999863 3 468887644
No 36
>KOG0624 consensus dsRNA-activated protein kinase inhibitor P58, contains TPR and DnaJ domains [Defense mechanisms]
Probab=24.80 E-value=54 Score=30.80 Aligned_cols=14 Identities=43% Similarity=0.560 Sum_probs=9.3
Q ss_pred hhhhcCCCCCCCCC
Q 032255 114 IELQHGGTGGSSAH 127 (144)
Q Consensus 114 ielq~g~g~~~~~~ 127 (144)
-|-|+||||+++-+
T Consensus 468 ~Es~q~GGGg~~Hg 481 (504)
T KOG0624|consen 468 PESQQGGGGGPFHG 481 (504)
T ss_pred hhhccCCCCCCCCC
Confidence 37788888855433
No 37
>PF01213 CAP_N: Adenylate cyclase associated (CAP) N terminal; InterPro: IPR013992 Cyclase-associated proteins (CAPs) are highly conserved actin-binding proteins present in a wide range of organisms including yeast, fly, plants, and mammals. CAPs are multifunctional proteins that contain several structural domains. CAP is involved in species-specific signalling pathways [, , , ]. In Drosophila, CAP functions in Hedgehog-mediated eye development and in establishing oocyte polarity. In Dictyostelium (slim mold), CAP is involved in microfilament reorganisation near the plasma membrane in a PIP2-regulated manner and is required to perpetuate the cAMP relay signal to organise fruitbody formation. In plants, CAP is involved in plant signalling pathways required for co-ordinated organ expansion. In yeast, CAP is involved in adenylate cyclase activation, as well as in vesicle trafficking and endocytosis. In both yeast and mammals, CAPs appear to be involved in recycling G-actin monomers from ADF/cofilins for subsequent rounds of filament assembly [, ]. In mammals, there are two different CAPs (CAP1 and CAP2) that share 64% amino acid identity. All CAPs appear to contain a C-terminal actin-binding domain that regulates actin remodelling in response to cellular signals and is required for normal cellular morphology, cell division, growth and locomotion in eukaryotes. CAP directly regulates actin filament dynamics and has been implicated in a number of complex developmental and morphological processes, including mRNA localisation and the establishment of cell polarity. Actin exists both as globular (G) (monomeric) actin subunits and assembled into filamentous (F) actin. In cells, actin cycles between these two forms. Proteins that bind F-actin often regulate F-actin assembly and its interaction with other proteins, while proteins that interact with G-actin often control the availability of unpolymerised actin. CAPs bind G-actin. In addition to actin-binding, CAPs can have additional roles, and may act as bifunctional proteins. In Saccharomyces cerevisiae (Baker's yeast), CAP is a component of the adenylyl cyclase complex (Cyr1p) that serves as an effector of Ras during normal cell signalling. S. cerevisiae CAP functions to expose adenylate cyclase binding sites to Ras, thereby enabling adenylate cyclase to be activated by Ras regulatory signals. In Schizosaccharomyces pombe (Fission yeast), CAP is also required for adenylate cyclase activity, but not through the Ras pathway. In both organisms, the N-terminal domain is responsible for adenylate cyclase activation, but the S cerevisiae and S. pombe N-termini cannot complement one another. Yeast CAPs are unique among the CAP family of proteins, because they are the only ones to directly interact with and activate adenylate cyclase []. S. cerevisiae CAP has four major domains. In addition to the N-terminal adenylate cyclase-interacting domain, and the C-terminal actin-binding domain, it possesses two other domains: a proline-rich domain that interacts with Src homology 3 (SH3) domains of specific proteins, and a domain that is responsible for CAP oligomerisation to form multimeric complexes (although oligomerisation appears to involve the N- and C-terminal domains as well). The proline-rich domain interacts with profilin, a protein that catalyses nucleotide exchange on G-actin monomers and promotes addition to barbed ends of filamentous F-actin []. Since CAP can bind profilin via a proline-rich domain, and G-actin via a C-terminal domain, it has been suggested that a ternary G-actin/CAP/profilin complex could be formed. This entry represents the N-terminal domain of CAP proteins. This domain has an all-alpha structure consisting of six helices in a bundle with a left-handed twist and an up-and-down topology [].; GO: 0003779 actin binding, 0007010 cytoskeleton organization; PDB: 1TJF_B 1S0P_A.
Probab=24.66 E-value=3e+02 Score=23.93 Aligned_cols=47 Identities=13% Similarity=0.218 Sum_probs=38.9
Q ss_pred HHHHHHH-HHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHHhhh
Q 032255 10 SAYKEFV-AAAANVLEAKELSVAQKTAATDAALENFKQRWELFRVACD 56 (144)
Q Consensus 10 ~AY~~fv-~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~vACD 56 (144)
.+|++|| .....+++....=||.-...+..-.+.|..-.++..++|-
T Consensus 47 ~afD~~i~~~l~~f~~~S~~igg~V~~~a~~v~~aF~~qr~~L~~as~ 94 (312)
T PF01213_consen 47 EAFDELINGPLKPFVELSKKIGGDVAEQAQLVKKAFQAQRKFLLVASK 94 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 5899999 5678889988888988777889999999988888888874
No 38
>PRK15366 type III secretion system chaperone SsaE; Provisional
Probab=24.51 E-value=2.2e+02 Score=21.10 Aligned_cols=26 Identities=27% Similarity=0.245 Sum_probs=22.6
Q ss_pred HHHHHHHHHhhhchHHHHHHHHHhhc
Q 032255 45 KQRWELFRVACDQAEEFVESVKQRIG 70 (144)
Q Consensus 45 kqr~eLF~vACD~AeelVeSvrqRIg 70 (144)
+|+.++.+.||-+|+..+.-+--|--
T Consensus 45 Yqq~t~l~ea~lqA~~IIn~i~~ryh 70 (80)
T PRK15366 45 YQQNTLLLEAIEQAENIINIIYYRYH 70 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 78899999999999999988876654
No 39
>PF06148 COG2: COG (conserved oligomeric Golgi) complex component, COG2; InterPro: IPR024602 This entry represents the uncharacterised N-terminal domain of subunit 2 of the COG complex. The COG complex comprises eight proteins COG1-8 and plays critical roles in Golgi structure and function [].; PDB: 2JQQ_A.
Probab=24.23 E-value=1.7e+02 Score=21.28 Aligned_cols=56 Identities=21% Similarity=0.438 Sum_probs=18.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHHhhhchHHHHH
Q 032255 3 NIVDALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFRVACDQAEEFVE 63 (144)
Q Consensus 3 s~Vd~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~vACD~AeelVe 63 (144)
.+|+-+|+-|++||.-+.....-. ........-|.+|+...+-.+..=+...+-|+
T Consensus 45 ~Li~lIN~dY~dFv~Ls~~L~g~~-----~~i~~l~~~L~~~~~~v~~~~~~l~~~~~~i~ 100 (133)
T PF06148_consen 45 ELIELINDDYADFVSLSTNLVGMD-----EKIEELRKPLSQFREEVESVRDELDNTQEEIE 100 (133)
T ss_dssp ----------------------------------HHHHHHHHHHHHHHHHHS-STTHHHHH
T ss_pred HHHHHHHhhHHHHHHHHHHHccHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 578889999999999776543321 23444667788888888877776666655444
No 40
>PF00435 Spectrin: Spectrin repeat; InterPro: IPR002017 Spectrin repeats [] are found in several proteins involved in cytoskeletal structure. These include spectrin alpha and beta subunits [, ], alpha-actinin [] and dystrophin. The spectrin repeat forms a three-helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.; GO: 0005515 protein binding; PDB: 1HCI_A 1QUU_A 3FB2_B 1S35_A 1U5P_A 1U4Q_A 1CUN_B 1YDI_B 3EDV_A 1AJ3_A ....
Probab=23.70 E-value=1.9e+02 Score=17.93 Aligned_cols=48 Identities=15% Similarity=0.339 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHHhhhchH
Q 032255 8 LSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFRVACDQAE 59 (144)
Q Consensus 8 l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~vACD~Ae 59 (144)
....++.+...+..++... ...+.....-++++.+||+-|...|+.=.
T Consensus 53 ~~~~l~~l~~~~~~L~~~~----~~~~~~i~~~~~~l~~~w~~l~~~~~~r~ 100 (105)
T PF00435_consen 53 RQERLESLNEQAQQLIDSG----PEDSDEIQEKLEELNQRWEALCELVEERR 100 (105)
T ss_dssp HHHHHHHHHHHHHHHHHTT----HTTHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHcC----CCcHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555555443322 33445667788999999999987776543
No 41
>PF15155 MRFAP1: MORF4 family-associated protein1
Probab=23.47 E-value=3.5e+02 Score=21.50 Aligned_cols=17 Identities=41% Similarity=0.341 Sum_probs=14.2
Q ss_pred chHHHHHHHHHhhcccc
Q 032255 57 QAEEFVESVKQRIGSEC 73 (144)
Q Consensus 57 ~AeelVeSvrqRIgSec 73 (144)
-|+-|||-+|+=-+|||
T Consensus 110 MaemlVeLvrrIe~ses 126 (127)
T PF15155_consen 110 MAEMLVELVRRIERSES 126 (127)
T ss_pred HHHHHHHHHHHHHhccC
Confidence 46779999998888888
No 42
>PF12729 4HB_MCP_1: Four helix bundle sensory module for signal transduction; InterPro: IPR024478 This entry represents a four-helix bundle that operates as a ubiquitous sensory module in prokaryotic signal-transduction, which is known as four-helix bundles methyl-accepting chemotaxis protein (4HB_MCP) domain. The 4HB_MCP is always found between two predicted transmembrane helices indicating that it detects only extracellular signals. In many cases the domain is associated with a cytoplasmic HAMP domain suggesting that most proteins carrying the bundle might share the mechanism of transmembrane signalling which is well-characterised in E coli chemoreceptors [].
Probab=22.76 E-value=2.5e+02 Score=18.95 Aligned_cols=28 Identities=36% Similarity=0.615 Sum_probs=20.8
Q ss_pred hHHHHHHHHHHHHHHHHhhhchHHHHHH
Q 032255 37 TDAALENFKQRWELFRVACDQAEEFVES 64 (144)
Q Consensus 37 TDaAle~fkqr~eLF~vACD~AeelVeS 64 (144)
....++.|++.|+-|+...++.-+++.+
T Consensus 107 ~~~~~~~~~~~~~~y~~~~~~~~~~~~~ 134 (181)
T PF12729_consen 107 EKQLLEEFKEAWKAYRKLRDQVIELAKS 134 (181)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHC
Confidence 3557888888888888888777666544
No 43
>cd04926 ACT_ACR_4 C-terminal ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). This CD includes the C-terminal ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). ACR proteins, found only in Arabidopsis and Oryza, as yet, are proposed to function as novel regulatory or sensor proteins in plants. Nine ACR gene products have been described (ACR1-8 in Arabidopsis and OsARC1-9 in Oryza) and are represented in this CD. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=21.74 E-value=74 Score=20.71 Aligned_cols=16 Identities=44% Similarity=0.875 Sum_probs=13.6
Q ss_pred HHHHHHHHHhhccccc
Q 032255 59 EEFVESVKQRIGSECL 74 (144)
Q Consensus 59 eelVeSvrqRIgSecl 74 (144)
.+..+.++++||.-||
T Consensus 57 ~~~~~~l~~~l~~~~~ 72 (72)
T cd04926 57 PKTIEAVRQEIGPACL 72 (72)
T ss_pred HHHHHHHHHHhccccC
Confidence 4688899999999886
No 44
>PF13361 UvrD_C: UvrD-like helicase C-terminal domain; PDB: 1UAA_B 3U4Q_A 3U44_A 1W36_B 3K70_B 2IS6_B 3LFU_A 2IS2_B 2IS1_B 2IS4_A ....
Probab=21.53 E-value=65 Score=24.50 Aligned_cols=21 Identities=29% Similarity=0.430 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHhhhchHH
Q 032255 40 ALENFKQRWELFRVACDQAEE 60 (144)
Q Consensus 40 Ale~fkqr~eLF~vACD~Aee 60 (144)
-.+.+.+.+-||+|||-||++
T Consensus 322 ~~~~~~Ee~rl~YVA~TRAk~ 342 (351)
T PF13361_consen 322 DRQELEEERRLFYVAMTRAKE 342 (351)
T ss_dssp HHHHHHHHHHHHHHHHHTEEE
T ss_pred hHhhhHHHHhHheEecchhhc
Confidence 456677889999999999864
No 45
>PRK00810 nifW nitrogenase stabilizing/protective protein; Provisional
Probab=21.09 E-value=63 Score=24.69 Aligned_cols=13 Identities=46% Similarity=0.774 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHH
Q 032255 6 DALSSAYKEFVAA 18 (144)
Q Consensus 6 d~l~~AY~~fv~A 18 (144)
..|++||++|+..
T Consensus 68 ~aL~~AY~dF~~S 80 (113)
T PRK00810 68 AVLERAYADFVAS 80 (113)
T ss_pred HHHHHHHHHHccC
Confidence 5688999999864
No 46
>TIGR03687 pupylate_cterm ubiquitin-like protein Pup. Members of this protein family are Pup, a small protein whose ligation to target proteins steers them toward degradation. This protein family occurs in a number of bacteria, especially Actinobacteria such as Mycobacterium tuberculosis, that possess an archeal-type proteasome. All members of this protein family known during model construction end with the C-terminal motif [FY][VI]QKGG[QE]. Ligation is thought to occur between the C-terminal COOH of Pup and an epsilon-amino group of a Lys on the target protein. The N-terminal half of this protein is poorly conserved and not represented in the seed alignment.
Probab=20.94 E-value=72 Score=20.26 Aligned_cols=14 Identities=43% Similarity=0.508 Sum_probs=11.3
Q ss_pred hchHHHHHHHHHhh
Q 032255 56 DQAEEFVESVKQRI 69 (144)
Q Consensus 56 D~AeelVeSvrqRI 69 (144)
.+||+||++-.|.=
T Consensus 18 ~NAe~FV~~fVQKG 31 (33)
T TIGR03687 18 SNAEEFVRGFVQKG 31 (33)
T ss_pred HhHHHHHHHHHHcc
Confidence 57999999988853
No 47
>PF01591 6PF2K: 6-phosphofructo-2-kinase; InterPro: IPR013079 6-Phosphofructo-2-kinase (2.7.1.105 from EC, 3.1.3.46 from EC) is a bifunctional enzyme that catalyses both the synthesis and the degradation of fructose-2, 6-bisphosphate. The fructose-2,6-bisphosphatase reaction involves a phosphohistidine intermediate. The catalytic pathway is: ATP + D-fructose 6-phosphate = ADP + D-fructose 2,6-bisphosphate D-fructose 2,6-bisphosphate + H2O = 6-fructose 6-phosphate + Pi The enzyme is important in the regulation of hepatic carbohydrate metabolism and is found in greatest quantities in the liver, kidney and heart. In mammals, several genes often encode different isoforms, each of which differs in its tissue distribution and enzymatic activity []. The family described here bears a resemblance to the ATP-driven phospho-fructokinases, however, they share little sequence similarity, although a few residues seem key to their interaction with fructose 6-phosphate []. This domain forms the N-terminal region of this enzyme, while IPR013078 from INTERPRO forms the C-terminal domain.; GO: 0003873 6-phosphofructo-2-kinase activity, 0005524 ATP binding, 0006000 fructose metabolic process; PDB: 2DWO_A 3QPW_A 3QPV_A 3QPU_A 2I1V_B 2DWP_A 2AXN_A 1K6M_B 3BIF_A 2BIF_A ....
Probab=20.72 E-value=1.6e+02 Score=24.10 Aligned_cols=34 Identities=32% Similarity=0.419 Sum_probs=23.1
Q ss_pred HHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHH
Q 032255 20 ANVLEAKELSVAQKTAATDAALENFKQRWELFRV 53 (144)
Q Consensus 20 aavleAke~~gg~ktaaTDaAle~fkqr~eLF~v 53 (144)
.++.+.+..|.-=+.-..+.|+++|++|++.+.-
T Consensus 140 ~NI~~~~~~spDY~~~~~e~A~~Df~~RI~~Ye~ 173 (222)
T PF01591_consen 140 RNIREKKQNSPDYKGMDPEEAIEDFKKRIEHYEK 173 (222)
T ss_dssp HHHHHHHTTSGGGTTS-HHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHcCCcccccCCHHHHHHHHHHHHHhhcc
Confidence 3556666654444555568899999999998753
No 48
>PF00390 malic: Malic enzyme, N-terminal domain; InterPro: IPR012301 Malic enzymes (malate oxidoreductases) catalyse the oxidative decarboxylation of malate to form pyruvate [], a reaction important in a number of metabolic pathways - e.g. carbon dioxide released from the reaction may be used in sugar production during the Calvin cycle of photosynthesis []. There are 3 forms of the enzyme []: an NAD-dependent form that decarboxylates oxaloacetate; an NAD-dependent form that does not decarboxylate oxalo-acetate; and an NADPH-dependent form []. Other proteins known to be similar to malic enzymes are the Escherichia coli scfA protein; an enzyme from Zea mays (Maize), formerly thought to be cinnamyl-alcohol dehydrogenase []; and the hypothetical Saccharomyces cerevisiae protein YKL029c. Studies on the duck liver malic enzyme reveals that it can be alkylated by bromopyruvate, resulting in the loss of oxidative decarboxylation and the subsequent enhancement of pyruvate reductase activity []. The alkylated form is able to bind NADPH but not L-malate, indicating impaired substrate-or divalent metal ion-binding in the active site []. Sequence analysis has highlighted a cysteine residue as the point of alkylation, suggesting that it may play an important role in the activity of the enzyme [], although it is absent in the sequences from some species. There are three well conserved regions in the enzyme sequences. Two of them seem to be involved in the binding NAD or NADP. The significance of the third one, located in the central part of the enzymes, is not yet known.; GO: 0016616 oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor, 0055114 oxidation-reduction process; PDB: 2DVM_B 1WW8_A 2HAE_B 1VL6_A 1PJ2_A 1PJL_B 1GZ3_A 1PJ4_A 1PJ3_C 1EFL_A ....
Probab=20.56 E-value=75 Score=25.94 Aligned_cols=21 Identities=43% Similarity=0.759 Sum_probs=17.8
Q ss_pred hHHHHHHHHHhhccccccccc
Q 032255 58 AEEFVESVKQRIGSECLVDEA 78 (144)
Q Consensus 58 AeelVeSvrqRIgSeclVdeA 78 (144)
.+|||++|+++-|..|+|...
T Consensus 147 vdefv~av~~~~gp~~~IqfE 167 (182)
T PF00390_consen 147 VDEFVEAVKRRFGPNALIQFE 167 (182)
T ss_dssp HHHHHHHHHHHHGCTSEEEE-
T ss_pred HHHHHHHHHHHhCCCeEEEEe
Confidence 589999999999999988543
No 49
>PRK10404 hypothetical protein; Provisional
Probab=20.46 E-value=3.6e+02 Score=19.80 Aligned_cols=27 Identities=11% Similarity=0.108 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhh
Q 032255 6 DALSSAYKEFVAAAANVLEAKELSVAQ 32 (144)
Q Consensus 6 d~l~~AY~~fv~AaaavleAke~~gg~ 32 (144)
+.|.+-.+.+++-+..+|......+++
T Consensus 8 ~~l~~dl~~L~~dle~Ll~~~~~~a~e 34 (101)
T PRK10404 8 TRIDDDLTLLSETLEEVLRSSGDPADQ 34 (101)
T ss_pred hHHHHHHHHHHHHHHHHHHHhhhhhHH
Confidence 456666677777777777665544444
No 50
>TIGR02277 PaaX_trns_reg phenylacetic acid degradation operon negative regulatory protein PaaX. This transcriptional regulator is always found in association with operons believed to be involved in the degradation of phenylacetic acid. The gene product has been shown to bind to the promoter sites and repress their transcription.
Probab=20.36 E-value=2.5e+02 Score=23.44 Aligned_cols=43 Identities=16% Similarity=0.242 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHH
Q 032255 5 VDALSSAYKEFVAAAANVLEAKELSVAQKTAATDAALENFKQRWELFR 52 (144)
Q Consensus 5 Vd~l~~AY~~fv~AaaavleAke~~gg~ktaaTDaAle~fkqr~eLF~ 52 (144)
+|+|+..|++|+..-..++..-..... .. .-.+.|..|.-|.|
T Consensus 169 L~~l~~~Y~~f~~~~~~~~~~~~~~~~---~~--~~~~a~~~r~lLvh 211 (280)
T TIGR02277 169 LDEIADAYLAFIQKFGPLYVLLKNKAG---EL--SPAAAFVARLLLVH 211 (280)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccc---CC--CHHHHHHHHHHHHH
Confidence 478999999999998887763211000 01 12455776666654
Done!