Query psy1836
Match_columns 127
No_of_seqs 222 out of 1048
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 17:36:10 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1836.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1836hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd06152 YjgF_YER057c_UK114_lik 100.0 1.5E-37 3.2E-42 209.5 12.9 107 16-122 1-111 (114)
2 TIGR03610 RutC pyrimidine util 100.0 1.3E-36 2.8E-41 208.4 12.5 109 13-123 13-123 (127)
3 cd02198 YjgH_like YjgH belongs 100.0 2.4E-36 5.2E-41 202.5 11.2 107 16-123 1-108 (111)
4 cd06154 YjgF_YER057c_UK114_lik 100.0 6E-36 1.3E-40 202.8 12.7 112 10-122 5-117 (119)
5 PRK11401 putative endoribonucl 100.0 4.8E-36 1E-40 206.0 12.3 112 11-123 10-124 (129)
6 COG0251 TdcF Putative translat 100.0 6.1E-36 1.3E-40 205.8 12.9 112 10-123 11-125 (130)
7 TIGR00004 endoribonuclease L-P 100.0 2.3E-35 5E-40 200.9 13.3 113 10-123 8-121 (124)
8 PF01042 Ribonuc_L-PSP: Endori 100.0 1.6E-35 3.5E-40 201.0 12.4 112 11-123 4-118 (121)
9 cd06150 YjgF_YER057c_UK114_lik 100.0 2.7E-34 5.8E-39 190.8 11.9 101 16-122 1-102 (105)
10 cd06156 eu_AANH_C_2 A group of 100.0 3.8E-33 8.3E-38 189.0 12.4 102 18-122 1-116 (118)
11 cd02199 YjgF_YER057c_UK114_lik 100.0 1.1E-32 2.3E-37 192.2 13.1 116 7-123 5-140 (142)
12 cd06153 YjgF_YER057c_UK114_lik 100.0 1.7E-32 3.6E-37 184.9 11.1 102 18-122 1-112 (114)
13 cd06151 YjgF_YER057c_UK114_lik 100.0 5.3E-32 1.1E-36 185.2 12.5 104 18-122 1-124 (126)
14 cd00448 YjgF_YER057c_UK114_fam 100.0 9.4E-31 2E-35 172.6 11.8 103 18-122 1-105 (107)
15 cd06155 eu_AANH_C_1 A group of 100.0 6.3E-30 1.4E-34 168.8 11.2 94 22-122 4-98 (101)
16 KOG2317|consensus 100.0 4.4E-28 9.6E-33 166.6 9.7 117 6-123 10-127 (138)
17 PF14588 YjgF_endoribonc: YjgF 99.6 1.5E-14 3.3E-19 100.9 9.7 111 12-123 15-145 (148)
18 TIGR01884 cas_HTH CRISPR locus 66.0 28 0.0006 25.2 6.1 48 47-97 36-84 (203)
19 PF02662 FlpD: Methyl-viologen 57.7 55 0.0012 22.0 6.8 58 26-87 54-111 (124)
20 KOG2240|consensus 56.9 20 0.00044 25.5 3.8 77 17-95 77-156 (162)
21 KOG0100|consensus 53.7 41 0.0009 28.0 5.5 52 51-103 342-393 (663)
22 PRK00394 transcription factor; 51.8 28 0.0006 25.1 3.9 38 27-70 50-87 (179)
23 cd04918 ACT_AK1-AT_2 ACT domai 50.0 47 0.001 19.2 4.2 44 54-97 16-65 (65)
24 PHA02094 hypothetical protein 46.8 48 0.001 20.2 3.8 52 70-121 23-80 (81)
25 PF07066 DUF3882: Lactococcus 45.3 1.1E+02 0.0024 21.7 5.9 49 20-73 21-69 (159)
26 PF08868 YugN: YugN-like famil 44.8 26 0.00057 24.1 2.8 30 33-63 103-132 (132)
27 cd04518 TBP_archaea archaeal T 43.3 41 0.00089 24.1 3.7 37 27-69 51-87 (174)
28 COG1908 FrhD Coenzyme F420-red 43.1 1E+02 0.0022 21.1 5.3 44 50-96 75-118 (132)
29 PRK13689 hypothetical protein; 42.7 40 0.00087 20.9 3.1 23 48-70 9-31 (75)
30 PF07736 CM_1: Chorismate muta 42.2 1.1E+02 0.0023 20.7 5.7 41 47-87 15-57 (118)
31 PF04088 Peroxin-13_N: Peroxin 42.2 22 0.00047 25.2 2.1 27 40-66 13-39 (158)
32 PRK14643 hypothetical protein; 41.7 1.3E+02 0.0027 21.4 6.8 65 46-110 3-86 (164)
33 TIGR01796 CM_mono_aroH monofun 40.8 1.1E+02 0.0024 20.6 5.2 37 52-88 20-58 (117)
34 cd04937 ACT_AKi-DapG-BS_2 ACT 39.3 75 0.0016 18.1 4.3 45 52-96 15-63 (64)
35 PF00352 TBP: Transcription fa 39.1 27 0.00058 21.7 2.0 20 47-66 67-86 (86)
36 TIGR02671 cas_csx9 CRISPR-asso 38.7 1.5E+02 0.0033 23.3 6.3 47 45-102 15-61 (377)
37 PF11230 DUF3029: Protein of u 38.5 57 0.0012 27.2 4.2 45 46-90 410-460 (487)
38 PF12262 Lipase_bact_N: Bacter 38.5 80 0.0017 24.3 4.8 52 45-96 202-255 (268)
39 cd02185 AroH Chorismate mutase 38.1 1.3E+02 0.0027 20.4 5.2 38 51-88 19-58 (117)
40 COG1242 Predicted Fe-S oxidore 37.7 2E+02 0.0044 22.6 7.6 70 25-99 44-113 (312)
41 cd04516 TBP_eukaryotes eukaryo 37.5 54 0.0012 23.5 3.6 37 27-69 51-87 (174)
42 cd04517 TLF TBP-like factors ( 37.2 60 0.0013 23.2 3.8 37 27-69 51-87 (174)
43 TIGR02499 HrpE_YscL_not type I 35.6 1.4E+02 0.0031 20.3 6.2 48 74-124 115-162 (166)
44 COG1022 FAA1 Long-chain acyl-C 35.1 29 0.00064 29.8 2.2 32 26-59 195-228 (613)
45 COG3453 Uncharacterized protei 34.6 6.7 0.00014 26.8 -1.4 76 21-97 3-88 (130)
46 PF00816 Histone_HNS: H-NS his 34.3 21 0.00046 22.5 0.9 24 49-72 20-43 (93)
47 PTZ00009 heat shock 70 kDa pro 32.5 1.7E+02 0.0037 25.2 6.4 53 45-98 302-358 (653)
48 PF02108 FliH: Flagellar assem 32.2 1.4E+02 0.0031 19.2 5.3 66 52-123 50-115 (128)
49 KOG0813|consensus 31.9 52 0.0011 25.3 2.9 32 19-57 132-163 (265)
50 KOG3887|consensus 31.8 1.3E+02 0.0027 23.6 4.8 44 52-98 158-204 (347)
51 KOG1179|consensus 31.6 1.2E+02 0.0025 26.4 5.0 44 27-77 247-290 (649)
52 KOG3010|consensus 31.4 1.1E+02 0.0025 23.4 4.6 44 52-99 116-159 (261)
53 KOG1394|consensus 31.1 1.1E+02 0.0023 25.1 4.5 48 53-101 302-351 (440)
54 cd00652 TBP_TLF TATA box bindi 30.7 78 0.0017 22.6 3.5 37 27-69 51-87 (174)
55 PF01071 GARS_A: Phosphoribosy 30.6 61 0.0013 23.7 3.0 53 28-81 124-177 (194)
56 cd04915 ACT_AK-Ectoine_2 ACT d 30.3 1.2E+02 0.0025 17.6 4.3 45 53-97 16-66 (66)
57 cd04919 ACT_AK-Hom3_2 ACT doma 30.0 1.1E+02 0.0023 17.2 4.2 44 53-96 16-65 (66)
58 COG3082 Uncharacterized protei 29.7 1E+02 0.0022 18.9 3.2 23 48-70 9-31 (74)
59 CHL00123 rps6 ribosomal protei 29.7 74 0.0016 20.4 3.0 24 48-71 20-43 (97)
60 KOG2708|consensus 29.4 2.7E+02 0.0058 21.5 6.8 68 39-107 37-112 (336)
61 PLN02633 palmitoyl protein thi 29.4 2.9E+02 0.0063 21.9 7.8 58 43-109 71-128 (314)
62 COG2101 SPT15 TATA-box binding 29.4 76 0.0017 23.1 3.2 35 27-67 150-184 (185)
63 PRK13411 molecular chaperone D 29.3 1.8E+02 0.004 25.1 6.0 52 45-98 297-353 (653)
64 PF07108 PipA: PipA protein; 29.1 56 0.0012 23.5 2.4 34 28-65 40-73 (200)
65 PF01849 NAC: NAC domain; Int 28.3 55 0.0012 18.8 2.0 26 57-82 3-29 (58)
66 PF02256 Fe_hyd_SSU: Iron hydr 27.8 52 0.0011 19.2 1.8 20 80-99 24-43 (60)
67 PF08309 LVIVD: LVIVD repeat; 27.4 92 0.002 16.8 2.6 19 17-35 2-22 (42)
68 PF13964 Kelch_6: Kelch motif 27.3 76 0.0016 17.0 2.4 15 18-32 4-18 (50)
69 PLN00062 TATA-box-binding prot 26.8 2.4E+02 0.0053 20.2 7.8 36 27-68 51-86 (179)
70 cd04922 ACT_AKi-HSDH-ThrA_2 AC 26.8 1.2E+02 0.0027 16.8 4.1 44 53-96 16-65 (66)
71 PF09658 Cas_Csx9: CRISPR-asso 26.3 3.2E+02 0.007 21.7 6.3 47 45-102 15-61 (377)
72 PF14552 Tautomerase_2: Tautom 26.2 1.6E+02 0.0034 18.3 3.9 29 45-73 39-68 (82)
73 PTZ00186 heat shock 70 kDa pre 26.2 2.6E+02 0.0057 24.3 6.4 53 45-98 323-379 (657)
74 PTZ00400 DnaK-type molecular c 25.9 3.2E+02 0.0069 23.7 6.9 53 45-99 337-394 (663)
75 COG0443 DnaK Molecular chapero 25.8 2.2E+02 0.0048 24.3 5.9 52 45-97 283-334 (579)
76 PRK11678 putative chaperone; P 25.6 3.1E+02 0.0066 22.6 6.5 47 47-98 378-425 (450)
77 PRK10328 DNA binding protein, 25.2 70 0.0015 22.0 2.3 26 49-74 52-77 (134)
78 COG3655 Predicted transcriptio 25.2 85 0.0018 19.4 2.5 21 54-74 44-64 (73)
79 PRK00290 dnaK molecular chaper 25.1 2.9E+02 0.0062 23.6 6.4 49 50-99 305-353 (627)
80 PF06526 DUF1107: Protein of u 24.5 88 0.0019 18.8 2.4 39 25-63 21-61 (64)
81 PF01344 Kelch_1: Kelch motif; 24.4 94 0.002 16.2 2.4 14 19-32 5-18 (47)
82 PF08066 PMC2NT: PMC2NT (NUC01 24.3 67 0.0014 20.2 2.0 27 45-71 61-87 (91)
83 PF12205 GIT1_C: G protein-cou 24.3 1.1E+02 0.0024 20.8 3.1 23 45-67 12-34 (123)
84 PRK13410 molecular chaperone D 23.6 2.4E+02 0.0053 24.5 5.8 53 45-98 298-354 (668)
85 TIGR02619 putative CRISPR-asso 23.4 2.2E+02 0.0049 19.8 4.6 28 72-99 34-61 (149)
86 PF07490 Tir_receptor_N: Trans 23.0 88 0.0019 23.4 2.6 35 25-59 151-185 (266)
87 PF01250 Ribosomal_S6: Ribosom 22.8 1.3E+02 0.0028 18.6 3.1 25 48-72 15-39 (92)
88 PHA02754 hypothetical protein; 22.6 1.8E+02 0.0039 17.3 3.7 26 47-72 12-37 (67)
89 COG3797 Uncharacterized protei 22.6 1.6E+02 0.0035 21.3 3.8 70 24-114 42-112 (178)
90 KOG0808|consensus 22.5 1.5E+02 0.0031 23.3 3.8 32 45-76 89-120 (387)
91 PF02120 Flg_hook: Flagellar h 22.2 82 0.0018 19.0 2.1 27 45-71 47-73 (85)
92 TIGR00329 gcp_kae1 metallohydr 22.1 1.5E+02 0.0032 22.9 3.9 29 45-73 43-71 (305)
93 PF00438 S-AdoMet_synt_N: S-ad 22.0 1.5E+02 0.0034 19.3 3.4 51 16-71 34-88 (100)
94 PRK05952 3-oxoacyl-(acyl carri 21.9 2.2E+02 0.0049 22.7 5.0 46 53-99 255-302 (381)
95 PRK14646 hypothetical protein; 21.6 2.9E+02 0.0063 19.3 6.2 58 52-109 7-81 (155)
96 PF00814 Peptidase_M22: Glycop 21.5 2.1E+02 0.0045 21.7 4.5 41 45-86 24-64 (268)
97 PF02801 Ketoacyl-synt_C: Beta 21.2 2.1E+02 0.0045 18.5 4.0 35 45-83 22-56 (119)
98 TIGR03725 bact_YeaZ universal 21.0 1.5E+02 0.0032 21.3 3.5 42 46-88 29-70 (202)
99 PF10548 P22_AR_C: P22AR C-ter 20.7 1.6E+02 0.0035 18.0 3.1 26 46-71 21-46 (74)
100 PLN02606 palmitoyl-protein thi 20.4 4.4E+02 0.0095 20.8 7.5 58 43-109 72-129 (306)
101 PF02200 STE: STE like transcr 20.4 1.2E+02 0.0026 19.9 2.6 20 14-33 10-29 (110)
102 PF14359 DUF4406: Domain of un 20.2 1.6E+02 0.0036 18.6 3.2 57 28-96 1-57 (92)
103 PTZ00112 origin recognition co 20.1 4.8E+02 0.01 24.5 6.9 65 25-99 781-845 (1164)
No 1
>cd06152 YjgF_YER057c_UK114_like_4 YjgF, YER057c, and UK114 belong to a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00 E-value=1.5e-37 Score=209.51 Aligned_cols=107 Identities=27% Similarity=0.335 Sum_probs=102.2
Q ss_pred CCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcC-CCccceEEEEEEeCCC---CCHHHHHHH
Q psy1836 16 ASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAG-GSYEHVVKTTILLNDI---NDFATVNNV 91 (127)
Q Consensus 16 ~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G-~~~~~vv~~~vyl~d~---~~~~~~~~v 91 (127)
.+|||++++|+++|+|||+|.|++.+.+++|+++|++++|+||+++|+++| ++++||+|+|+|++|+ ++++.++++
T Consensus 1 ~~ys~av~~g~~v~~SGq~g~d~~g~~~~~d~~~Q~~~~~~Nl~~~L~~aG~~~~~dVvk~tvyltd~~~~~~~~~~~~~ 80 (114)
T cd06152 1 LHYSQAVRIGDRIEISGQGGWDPDTGKIPEDLEEEIDQAFDNVELALKAAGGKGWEQVYKVNSYHVDIKNEEAFGLMVEN 80 (114)
T ss_pred CCCCCeEEECCEEEEeccCCcCCCCCccCcCHHHHHHHHHHHHHHHHHHhCCCCHHHEEEEEEEEecCCcHHHHHHHHHH
Confidence 379999999999999999999995557779999999999999999999999 9999999999999999 799999999
Q ss_pred HHhhcCCCCCccEEEEcCCCCCCCEEEEEec
Q psy1836 92 YGQFFKPPYPARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 92 ~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
|.+||++++|+||+++|++|+.|+++||||+
T Consensus 81 ~~~~f~~~~Pa~t~v~V~~L~~p~~lVEIe~ 111 (114)
T cd06152 81 FKKWMPNHQPIWTCVGVTALGLPGMRVEIEV 111 (114)
T ss_pred HHHHcCCCCCCeEEEEeccCCCCCcEEEEEE
Confidence 9999999999999999999999999999996
No 2
>TIGR03610 RutC pyrimidine utilization protein C. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the endoribonuclease L-PSP family defined by pfam01042.
Probab=100.00 E-value=1.3e-36 Score=208.38 Aligned_cols=109 Identities=25% Similarity=0.415 Sum_probs=102.1
Q ss_pred cCCCCcceEEEECCEEEEccccccCCCCCccc--ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836 13 VGSASAFQAVQHDNTLYISGVLGIDASSGKMA--EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNN 90 (127)
Q Consensus 13 ~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~ 90 (127)
.+.++|||++++|+++|+|||+|.|+ +|... +|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+++
T Consensus 13 ~~~~~ys~av~~g~~v~vSGq~~~d~-~g~~~~~~d~~~Q~~~~l~ni~~iL~~aG~~~~dvv~~~iyl~d~~~~~~~~~ 91 (127)
T TIGR03610 13 KPLAPFVPGTLADGVVYVSGTLPFDK-DNNVVHVGDAAAQTRHVLETIKSVIETAGGTMDDVTFNHIFIRDWADYAAINE 91 (127)
T ss_pred CCCCCCCCeEEECCEEEEeccCCcCC-CCCeeCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEcCHHHHHHHHH
Confidence 35668999999999999999999999 77764 7999999999999999999999999999999999999999999999
Q ss_pred HHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836 91 VYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 91 v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
+|.+||++++|+||+++| +|..|+++||||+-
T Consensus 92 ~~~~~f~~~~Pa~t~v~v-~l~~p~~lVEIe~v 123 (127)
T TIGR03610 92 VYAEYFPGEKPARYCIQC-GLVKPDALVEIASV 123 (127)
T ss_pred HHHHHcCCCCCcEEEEEe-ccCCCCCEEEEEEE
Confidence 999999988999999998 57789999999973
No 3
>cd02198 YjgH_like YjgH belongs to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00 E-value=2.4e-36 Score=202.48 Aligned_cols=107 Identities=32% Similarity=0.476 Sum_probs=102.3
Q ss_pred CCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCC-CCHHHHHHHHHh
Q psy1836 16 ASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDI-NDFATVNNVYGQ 94 (127)
Q Consensus 16 ~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~-~~~~~~~~v~~~ 94 (127)
.+||+++++|+++|+|||+|.|+ +|++++|+++|++++|+||+++|+++|++++||+|+++|++|. ++++.++++|.+
T Consensus 1 ~~ys~av~~g~~l~vSGq~~~d~-~g~~~~d~~~Q~~~~~~ni~~~L~~aG~~~~dvvk~~vyl~~~~~~~~~~~~~~~~ 79 (111)
T cd02198 1 FGYSPAVRVGDTLFVSGQVGSDA-DGSVAEDFEAQFRLAFQNLGAVLEAAGCSFDDVVELTTFHVDMAAHLPAFAAVKDE 79 (111)
T ss_pred CCCcceEEECCEEEEecccCcCC-CCCcCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEeccHHHHHHHHHHHHH
Confidence 37999999999999999999999 8888899999999999999999999999999999999999965 899999999999
Q ss_pred hcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836 95 FFKPPYPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 95 ~f~~~~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
||++++|+||+++|++|+.|+++||||+-
T Consensus 80 ~f~~~~Pa~t~v~V~~L~~~~~~vEIe~~ 108 (111)
T cd02198 80 YFKEPYPAWTAVGVAWLARPGLLVEIKVV 108 (111)
T ss_pred HcCCCCCceehhhhhhcCCCCcEEEEEEE
Confidence 99988999999999999999999999973
No 4
>cd06154 YjgF_YER057c_UK114_like_6 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00 E-value=6e-36 Score=202.82 Aligned_cols=112 Identities=29% Similarity=0.359 Sum_probs=105.7
Q ss_pred ccccCCCCcceEEEECCEEEEccccccCCCCCcc-cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHH
Q psy1836 10 VSLVGSASAFQAVQHDNTLYISGVLGIDASSGKM-AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATV 88 (127)
Q Consensus 10 ~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~-~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~ 88 (127)
..+.+.++|||++++|+++|+|||+|.|+ +|.. ++|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+
T Consensus 5 ~~~~~~~~ys~av~~g~~l~vSGq~~~d~-~~~~~~~d~~~Q~~~~~~ni~~~L~~aG~~~~dVvk~~vyl~d~~~~~~~ 83 (119)
T cd06154 5 SPWEEQAGYSRAVRVGNWVFVSGTTGYDY-DGMVMPGDAYEQTRQCLEIIEAALAEAGASLEDVVRTRMYVTDIADFEAV 83 (119)
T ss_pred CCcccccCcccEEEECCEEEEeCcCcCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEECCHHHHHHH
Confidence 45667889999999999999999999999 5544 489999999999999999999999999999999999999999999
Q ss_pred HHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEec
Q psy1836 89 NNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 89 ~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
+++|.+||++++|+||+++|..|+.|+++||||+
T Consensus 84 ~~~~~~~f~~~~Part~v~v~~L~~~~~lVEIe~ 117 (119)
T cd06154 84 GRAHGEVFGDIRPAATMVVVSLLVDPEMLVEIEV 117 (119)
T ss_pred HHHHHHHcCCCCCceEEEEecccCCCCcEEEEEE
Confidence 9999999999999999999999999999999996
No 5
>PRK11401 putative endoribonuclease L-PSP; Provisional
Probab=100.00 E-value=4.8e-36 Score=205.98 Aligned_cols=112 Identities=34% Similarity=0.538 Sum_probs=104.6
Q ss_pred cccCCCCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836 11 SLVGSASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNN 90 (127)
Q Consensus 11 ~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~ 90 (127)
.+.+.++|||++++|+++|+|||+|.|+.+|++++|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+++
T Consensus 10 ~~~~~~~ys~av~~g~~v~vSGq~~~d~~~~~~~~d~~~Q~~~~~~ni~~~L~aaG~~~~~Vvk~~vyl~d~~~~~~~~~ 89 (129)
T PRK11401 10 APGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINE 89 (129)
T ss_pred CCCCCCCccceEEECCEEEEcCcCCccCCCCccCcCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEccHHHHHHHHH
Confidence 34567899999999999999999999986677779999999999999999999999999999999999999999999999
Q ss_pred HHHhhcCCC---CCccEEEEcCCCCCCCEEEEEecc
Q psy1836 91 VYGQFFKPP---YPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 91 v~~~~f~~~---~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
+|.+||+++ +|+||+++|+.|+ ++++||||+-
T Consensus 90 v~~~~f~~~~~~~Part~v~v~~L~-~~~~VEIe~~ 124 (129)
T PRK11401 90 VYKQFFDEHQATYPTRSCVQVARLP-KDVKLEIEAI 124 (129)
T ss_pred HHHHHhCCCCCCCCceEEEEcccCC-CCCeEEEEEE
Confidence 999999974 8999999999997 7899999974
No 6
>COG0251 TdcF Putative translation initiation inhibitor, yjgF family [Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=6.1e-36 Score=205.81 Aligned_cols=112 Identities=37% Similarity=0.586 Sum_probs=105.8
Q ss_pred ccccCCCCcceEEEECCEEEEccccccCCCCCccc--ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHH
Q psy1836 10 VSLVGSASAFQAVQHDNTLYISGVLGIDASSGKMA--EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFAT 87 (127)
Q Consensus 10 ~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~ 87 (127)
..+.+.++|||++++|+++|+|||+|.|+ +|+.. +|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.
T Consensus 11 ~~~~~~~~yS~av~~~~~vfvSGQi~~~~-~g~~v~~~d~~~Q~~~~l~ni~a~L~~aG~~~~~Vvk~~v~l~d~~~f~~ 89 (130)
T COG0251 11 NAPAPIGPYSQAVVAGGLVFVSGQIPLDP-TGELVGGEDIEAQTRQALANIKAVLEAAGSTLDDVVKVTVFLTDMNDFAA 89 (130)
T ss_pred CCCCCCCCccceEEECCEEEEeCcCCcCC-CCcccCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEecCchHHHH
Confidence 44556779999999999999999999999 88888 5999999999999999999999999999999999999999999
Q ss_pred HHHHHHhhcCCC-CCccEEEEcCCCCCCCEEEEEecc
Q psy1836 88 VNNVYGQFFKPP-YPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 88 ~~~v~~~~f~~~-~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
+|++|.+||+.+ +|+||+|+|+.|| |+++||||+-
T Consensus 90 ~n~v~~~~f~~~~~PArs~V~v~~l~-~~~~VEIeai 125 (130)
T COG0251 90 MNEVYDEFFEVGGYPARSAVGVALLP-PDALVEIEAI 125 (130)
T ss_pred HHHHHHHHhccCCCCceeEEEhhhCC-CCCeEEEEEE
Confidence 999999999976 9999999999998 9999999984
No 7
>TIGR00004 endoribonuclease L-PSP, putative. This protein was described initially as an inhibitor of protein synthesis intiation but is now viewed as an endoribonuclease active on single-stranded mRNA. The cleavage of mRNA is responsible for the inhibition of protein synthesis. A role in purine regulation has also been suggested.
Probab=100.00 E-value=2.3e-35 Score=200.92 Aligned_cols=113 Identities=43% Similarity=0.659 Sum_probs=105.7
Q ss_pred ccccCCCCcceEEEECCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHH
Q psy1836 10 VSLVGSASAFQAVQHDNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATV 88 (127)
Q Consensus 10 ~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~ 88 (127)
..+.+.++|||++++|+++|+|||+|.|+.+|+++ +|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+
T Consensus 8 ~~~~~~~~ys~av~~g~~v~vSGq~~~~~~~g~~~~~d~~~Q~~~~~~ni~~~L~~aG~~~~dvv~~~vyv~~~~~~~~~ 87 (124)
T TIGR00004 8 KAPAAIGPYSQAVKVGNTLFVSGQIPLDPSTGELVGGDIAEQAEQVLENLKAILEAAGLSLDDVVKTTVFLTDLNDFAEV 87 (124)
T ss_pred CCCCCCCCCcceEEECCEEEEeeeCCCcCCCCcCCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEeChHHHHHH
Confidence 34457789999999999999999999999558887 89999999999999999999999999999999999999999999
Q ss_pred HHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836 89 NNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 89 ~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
+++|.+||++++|+||+++|..|+ ++++||||+-
T Consensus 88 ~~~~~~~f~~~~Pa~t~v~v~~L~-~~~~vEIe~v 121 (124)
T TIGR00004 88 NEVYGQYFDEPYPARSAVQVAALP-KGVLVEIEAI 121 (124)
T ss_pred HHHHHHHcCCCCCceEEEECccCC-CCCEEEEEEE
Confidence 999999999889999999999997 7999999973
No 8
>PF01042 Ribonuc_L-PSP: Endoribonuclease L-PSP; InterPro: IPR006175 This domain is found in endoribonuclease, that is active on single-stranded mRNA and inhibits protein synthesis by cleavage of mRNA []. Previously it was thought to inhibit protein synthesis initiation []. This endoribonuclease may also be involved in the regulation of purine biosynthesis []. ; PDB: 3GTZ_B 3V4D_E 1J7H_A 3R0P_D 2IG8_A 1QD9_B 3L7Q_E 3VCZ_A 3QUW_A 2EWC_K ....
Probab=100.00 E-value=1.6e-35 Score=201.02 Aligned_cols=112 Identities=38% Similarity=0.571 Sum_probs=103.1
Q ss_pred cccCCCCcceEEEECCEEEEccccccCCCCCcc-cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHH
Q psy1836 11 SLVGSASAFQAVQHDNTLYISGVLGIDASSGKM-AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVN 89 (127)
Q Consensus 11 ~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~-~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~ 89 (127)
.+.|.++|||+++.|+++|+|||+|.|+.+|++ ++|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.++
T Consensus 4 a~~p~~~Ys~av~~g~~v~isGq~~~d~~~~~~~~~~~~~Q~~~~l~ni~~~L~~~G~~~~dvv~~~~yl~d~~~~~~~~ 83 (121)
T PF01042_consen 4 APEPIGPYSQAVRAGDTVFISGQVGIDPATGQVVPGDIEEQTRQALDNIERILAAAGASLDDVVKVTVYLTDMSDFPAVN 83 (121)
T ss_dssp SCCCSSSSBSEEEETTEEEEEEEESBCTTTSSBSSSSHHHHHHHHHHHHHHHHHHTTS-GGGEEEEEEEESSGGGHHHHH
T ss_pred CCCCCCCCCCEEEECCEEEEeeeCCcCCCCCcCCCCCHHHHHHHHHHhhhhhhhcCCCcceeEeeeeehhhhhhhhHHHH
Confidence 456899999999999999999999999878888 5999999999999999999999999999999999999999999999
Q ss_pred HHHHhhcCCC--CCccEEEEcCCCCCCCEEEEEecc
Q psy1836 90 NVYGQFFKPP--YPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 90 ~v~~~~f~~~--~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
++|.+||++. +|+||+++|..|+ ++++|||++-
T Consensus 84 ~v~~~~f~~~~~~Pa~t~v~v~~L~-~~~~vEIe~~ 118 (121)
T PF01042_consen 84 EVWKEFFPDHPHRPARTTVGVSALP-PGALVEIEAI 118 (121)
T ss_dssp HHHHHHSTSSTS--EEEEEEESBSG-GG-SEEEEEE
T ss_pred HHHHHHhcccCCCCcEEEEEeCcCC-CCCcEEEEEE
Confidence 9999999988 4999999999999 9999999973
No 9
>cd06150 YjgF_YER057c_UK114_like_2 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00 E-value=2.7e-34 Score=190.82 Aligned_cols=101 Identities=32% Similarity=0.527 Sum_probs=95.8
Q ss_pred CCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhh
Q psy1836 16 ASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQF 95 (127)
Q Consensus 16 ~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~ 95 (127)
.+|||++++|+++|+|||+|.|+ + +|+++|++++|+||+++|+++|++++||+|+++|++|+++++.++++|.+|
T Consensus 1 ~~~s~av~~g~~v~iSGq~~~~~-~----~~~~~Q~~~~~~nl~~~L~~~G~~~~dvvk~~vyl~d~~~~~~~~~~~~~~ 75 (105)
T cd06150 1 ARMSQAVVHNGTVYLAGQVADDT-S----ADITGQTRQVLAKIDALLAEAGSDKSRILSATIWLADMADFAAMNAVWDAW 75 (105)
T ss_pred CCcCCEEEECCEEEEeCcCCcCC-C----CCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEccHHHHHHHHHHHHHH
Confidence 37999999999999999999997 3 799999999999999999999999999999999999999999999999999
Q ss_pred cCC-CCCccEEEEcCCCCCCCEEEEEec
Q psy1836 96 FKP-PYPARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 96 f~~-~~Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
|++ ++|+||++++ .|..|+++||||+
T Consensus 76 f~~~~~Pa~t~v~~-~l~~~~~lvEIe~ 102 (105)
T cd06150 76 VPPGHAPARACVEA-KLADPGYLVEIVV 102 (105)
T ss_pred cCCCCCCCeEEEEe-cccCCCCEEEEEE
Confidence 997 6999999996 6888999999997
No 10
>cd06156 eu_AANH_C_2 A group of hypothetical eukaryotic proteins, characterized by the presence of an adenine nucleotide alpha hydrolase (AANH)-like domain located N-terminal to two distinctly different YjgF-YER057c-UK114-like domains. This CD contains the second of these domains. The YjgF-YER057c-UK114 protein family is a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00 E-value=3.8e-33 Score=189.03 Aligned_cols=102 Identities=19% Similarity=0.253 Sum_probs=96.5
Q ss_pred cceEEEECCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836 18 AFQAVQHDNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF 96 (127)
Q Consensus 18 ys~av~~g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f 96 (127)
|||++++|+++|+|||+|.|+.++.++ +|+++|++++|+||+++|+++|+ +||+|+++|++|+++++.++++|.+||
T Consensus 1 yS~av~~~~~i~vSGQ~g~d~~~~~~~~~~~~~Q~~qal~Ni~~vL~~aG~--~dVvk~~iyl~d~~~~~~~~~v~~~~f 78 (118)
T cd06156 1 YSQAIVVPKVAYISGQIGLIPATMTLLEGGITLQAVLSLQHLERVAKAMNV--QWVLAAVCYVTDESSVPIARSAWSKYC 78 (118)
T ss_pred CCceEEECCEEEEEeeCCccCCCCccCCCCHHHHHHHHHHHHHHHHHHcCC--CCEEEEEEEEcChHHHHHHHHHHHHHh
Confidence 899999999999999999999544665 89999999999999999999999 999999999999999999999999999
Q ss_pred CCC-------------CCccEEEEcCCCCCCCEEEEEec
Q psy1836 97 KPP-------------YPARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 97 ~~~-------------~Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
+.+ +|+||+++|..|| ++++|||++
T Consensus 79 ~~~~~~~~~~~~~~~~~P~~t~v~V~~L~-~~~~VEie~ 116 (118)
T cd06156 79 SELDLEDESRNESDDVNPPLVIVVVPELP-RGALVEWQG 116 (118)
T ss_pred cCccccccccccccCCCCcEEEEEcccCC-CCCeEEEEE
Confidence 974 9999999999998 789999986
No 11
>cd02199 YjgF_YER057c_UK114_like_1 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00 E-value=1.1e-32 Score=192.21 Aligned_cols=116 Identities=27% Similarity=0.292 Sum_probs=101.3
Q ss_pred cccccccCCCCcceEEEECCEEEEccccccCCC----CCccc-----ccHHHHHHHHHHHHHHHHHHcCCCcc---ceEE
Q psy1836 7 ASLVSLVGSASAFQAVQHDNTLYISGVLGIDAS----SGKMA-----EGVENQAKQALTNMGHILKEAGGSYE---HVVK 74 (127)
Q Consensus 7 ~~~~~~~~~~~ys~av~~g~~v~iSG~~~~d~~----~g~~~-----~d~~~Q~~~~~~ni~~~L~~~G~~~~---~vv~ 74 (127)
.....+.+.++|||++++|+++|+|||+|.|+. .|.+. +|+++|++++|+||+++|+++|++++ ||+|
T Consensus 5 ~~~~~~~~~~~ys~av~~g~~l~vSGq~~~d~~~~~~~g~i~~~~~~~d~~~Qt~~~~~Ni~~vL~~aG~~~~~~~dVvk 84 (142)
T cd02199 5 ELPPAPAPVGNYVPAVRTGNLLYVSGQLPRVDGKLVYTGKVGADLSVEEGQEAARLCALNALAALKAALGDLDRVKRVVR 84 (142)
T ss_pred cCCCCCCCCCccceEEEECCEEEEeCcCCCCCCCccccCccccccChHHHHHHHHHHHHHHHHHHHHhcCChhhcCCEEE
Confidence 344556778899999999999999999999983 13332 58999999999999999999999988 9999
Q ss_pred EEEEeCCCCCHHH-------HHHHHHhhcCC-CCCccEEEEcCCCCCCCEEEEEecc
Q psy1836 75 TTILLNDINDFAT-------VNNVYGQFFKP-PYPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 75 ~~vyl~d~~~~~~-------~~~v~~~~f~~-~~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
+++|++|+++|+. ++++|.+||++ ++|+||+++|+.|+ ++++||||+-
T Consensus 85 ~~vyl~d~~~~~~~~~~~~~~~~v~~~~f~~~~~Part~v~V~~L~-~~~~VEIe~~ 140 (142)
T cd02199 85 LTGFVNSAPDFTEQPKVANGASDLLVEVFGEAGRHARSAVGVASLP-LNAAVEVEAI 140 (142)
T ss_pred EEEEEechHHhhhchhhhHHHHHHHHHHcCCCCCCceEEEEhhhCC-CCCEEEEEEE
Confidence 9999999999987 48899999986 59999999999996 4699999974
No 12
>cd06153 YjgF_YER057c_UK114_like_5 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00 E-value=1.7e-32 Score=184.90 Aligned_cols=102 Identities=24% Similarity=0.367 Sum_probs=95.2
Q ss_pred cceEEEE----CCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCC-----ccceEEEEEEeCCCCCHHH
Q psy1836 18 AFQAVQH----DNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGS-----YEHVVKTTILLNDINDFAT 87 (127)
Q Consensus 18 ys~av~~----g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~-----~~~vv~~~vyl~d~~~~~~ 87 (127)
||+|+.+ |+++|+|||+|.|+ +|++. +|+++|++++|+||+.+|+++|++ ++||+|+++|++|+++++.
T Consensus 1 ~s~a~~~~~~~g~~v~vSGq~~~d~-~g~~~~~d~~~Q~~~~l~ni~~~L~~aG~~~~~~~~~dVvk~~vyl~d~~~~~~ 79 (114)
T cd06153 1 FSRATLLAAGGRTHLFISGTASIVG-HGTVHPGDVEAQTRETLENIEALLEAAGRGGGAQFLADLLRLKVYLRDREDLPA 79 (114)
T ss_pred CCCceeeccCCCcEEEEEeECcCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHcCCCCCccchhheeEEEEEEccHHHHHH
Confidence 8999999 89999999999999 78775 899999999999999999999999 9999999999999999999
Q ss_pred HHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEec
Q psy1836 88 VNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 88 ~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
++++|.+||++++|+ |++. ..|..|+++||||+
T Consensus 80 ~~~v~~~~f~~~~P~-t~~~-~~l~~p~~lvEIe~ 112 (114)
T cd06153 80 VRAILAARLGPAVPA-VFLQ-ADVCRPDLLVEIEA 112 (114)
T ss_pred HHHHHHHHcCCCCCE-EEEE-eeecCCCcEEEEEE
Confidence 999999999987775 7764 67889999999997
No 13
>cd06151 YjgF_YER057c_UK114_like_3 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=99.98 E-value=5.3e-32 Score=185.15 Aligned_cols=104 Identities=32% Similarity=0.378 Sum_probs=95.1
Q ss_pred cceEEEEC---CEEEEccccccCCCCCc-----c--cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeC-CCC---
Q psy1836 18 AFQAVQHD---NTLYISGVLGIDASSGK-----M--AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLN-DIN--- 83 (127)
Q Consensus 18 ys~av~~g---~~v~iSG~~~~d~~~g~-----~--~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~--- 83 (127)
|||++++. ++||+|||+|.|+ +++ . .+|+++|++++|+||+++|+++|++++||+|+++|++ |.+
T Consensus 1 ~s~~~~v~~~~~~i~vSGq~~~~~-d~~~~~g~~~~~~d~~~Q~~~~l~ni~~~L~~aG~~~~dVvk~~vyl~~~~~~~~ 79 (126)
T cd06151 1 IAQAVEVPAGAATIYLSGTVPAVV-NASAPKGSPARYGDTETQTISVLKRIETILQSQGLTMGDVVKMRVFLVADPALDG 79 (126)
T ss_pred CCceEEeCCCceEEEEeccCCCCC-CCCCCCCcccCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEecCccccc
Confidence 79999994 7999999999876 433 2 2799999999999999999999999999999999998 565
Q ss_pred --CHHHHHHHHHhhcCC----CCCccEEEEcCCCCCCCEEEEEec
Q psy1836 84 --DFATVNNVYGQFFKP----PYPARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 84 --~~~~~~~v~~~~f~~----~~Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
+++.++++|.+||++ ++|+||+++|++|+.|+++||||+
T Consensus 80 ~~~~~~~~~~~~~~f~~~~~~~~Pa~t~v~V~~L~~p~~~VEIe~ 124 (126)
T cd06151 80 KMDFAGFMKAYRQFFGTAEQPNKPARSTLQVAGLVNPGWLVEIEV 124 (126)
T ss_pred hhhHHHHHHHHHHHhccccCCCCCceEEEEeeecCCCCcEEEEEE
Confidence 899999999999998 689999999999999999999996
No 14
>cd00448 YjgF_YER057c_UK114_family YjgF, YER057c, and UK114 belong to a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=99.97 E-value=9.4e-31 Score=172.60 Aligned_cols=103 Identities=39% Similarity=0.626 Sum_probs=98.1
Q ss_pred cceEEEECCEEEEccccccCCCCCcc-cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836 18 AFQAVQHDNTLYISGVLGIDASSGKM-AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF 96 (127)
Q Consensus 18 ys~av~~g~~v~iSG~~~~d~~~g~~-~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f 96 (127)
|||++++|+++|+|||+|.|+ ++.. ++|+++|++++|+||+++|+++|++++||+++++|++|+++++.++++|.+||
T Consensus 1 ys~~~~~~~~~~~sGq~~~~~-~~~~~~~~~~~Q~~~~~~ni~~~L~~~g~~~~~iv~~~~yv~~~~~~~~~~~~~~~~~ 79 (107)
T cd00448 1 YSQAVRVGNLVFVSGQIPLDP-DGELVPGDIEAQTRQALENLEAVLEAAGGSLDDVVKVTVYLTDMADFAAVNEVYDEFF 79 (107)
T ss_pred CCCeEEECCEEEEeccCCcCC-CCcccCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEecHHHHHHHHHHHHHHh
Confidence 899999999999999999999 5544 49999999999999999999999999999999999999999999999999999
Q ss_pred CC-CCCccEEEEcCCCCCCCEEEEEec
Q psy1836 97 KP-PYPARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 97 ~~-~~Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
++ ++|+||+++|..| .|+++||||+
T Consensus 80 ~~~~~Pa~t~v~v~~l-~~~~~VEie~ 105 (107)
T cd00448 80 GEGPPPARTAVGVAAL-PPGALVEIEA 105 (107)
T ss_pred CCCCCCceEEEEeccC-CCCCEEEEEE
Confidence 98 7999999999999 8999999986
No 15
>cd06155 eu_AANH_C_1 A group of hypothetical eukaryotic proteins, characterized by the presence of an adenine nucleotide alpha hydrolase (AANH)-like domain located N-terminal to two distinctly different YjgF-YER057c-UK114-like domains. This CD contains the first of these domains. The YjgF-YER057c-UK114 protein family is a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=99.97 E-value=6.3e-30 Score=168.80 Aligned_cols=94 Identities=32% Similarity=0.517 Sum_probs=87.7
Q ss_pred EEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC-C
Q psy1836 22 VQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP-Y 100 (127)
Q Consensus 22 v~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~-~ 100 (127)
.++|+++|+|||+|.|+ .+|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+|++|.+||+++ +
T Consensus 4 ~~~g~~v~vSG~~~~~~-----~~d~~~Q~~~v~~ni~~~L~~aG~~~~dVv~~~iyl~d~~~~~~~n~~~~~~f~~~~~ 78 (101)
T cd06155 4 NRTGGLLWISNVTASES-----DETVEEQMESIFSKLREILQSNGLSLSDILYVTLYLRDMSDFAEVNSVYGTFFDKPNP 78 (101)
T ss_pred EEECCEEEEecCCCCCC-----CCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEECCHHHHHHHHHHHHHHcCCCCC
Confidence 46799999999999987 26999999999999999999999999999999999999999999999999999964 8
Q ss_pred CccEEEEcCCCCCCCEEEEEec
Q psy1836 101 PARSTFQVGKLPLIDFCLVVLG 122 (127)
Q Consensus 101 Pa~t~v~V~~L~~~~~liEi~~ 122 (127)
|+|+++++ .|+ |+++|||++
T Consensus 79 Par~~v~~-~l~-~~~lvEIe~ 98 (101)
T cd06155 79 PSRVCVEC-GLP-EGCDVQLSC 98 (101)
T ss_pred CceEEEEe-ccC-CCCEEEEEE
Confidence 99999987 677 899999986
No 16
>KOG2317|consensus
Probab=99.95 E-value=4.4e-28 Score=166.64 Aligned_cols=117 Identities=41% Similarity=0.558 Sum_probs=110.3
Q ss_pred ccccccccCCCCcceEEEECCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCC
Q psy1836 6 HASLVSLVGSASAFQAVQHDNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIND 84 (127)
Q Consensus 6 ~~~~~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~ 84 (127)
-.++|.+.+.+||||++++++++|+|||+|.+|.++++. +++.+|++++++|++.+|+++|++.+.+|+.++|+.|+.+
T Consensus 10 v~S~~Ap~~igPYsQa~~~~~~~~~SGqigl~P~s~~~~~gg~~~q~~q~l~n~~~il~~a~a~~~~~V~~~i~l~d~~~ 89 (138)
T KOG2317|consen 10 VISYWAPANIGPYSQATKANDVVFISGQIGLDPPSMKLVEGGIVDQTEQALLNLEEILKAAGASLDLVVKVTIFLADIID 89 (138)
T ss_pred EeeccCCCCcCChhHheeeCCEEEEeccccccCCCCCEeccchHHHHHHHHHHHHHHHHHhccCccccEEEEEEEecchh
Confidence 367899999999999999999999999999999888888 8899999999999999999999999999999999999999
Q ss_pred HHHHHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836 85 FATVNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 85 ~~~~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
|..+|++|.+||.++.|+|+++.|..||.+ ..||+|..
T Consensus 90 f~~vn~v~~k~~~~~~pars~~~v~alp~~-~~ie~~~i 127 (138)
T KOG2317|consen 90 FAAVNKVYAKYFPKPNPARSCVQVAALPLN-GKIEIECI 127 (138)
T ss_pred HHHHHHHHHHHcCCCCcchhhHHHhhcCCC-CceEEeee
Confidence 999999999999999999999999999999 56787764
No 17
>PF14588 YjgF_endoribonc: YjgF/chorismate_mutase-like, putative endoribonuclease; PDB: 2OTM_B 3D01_D.
Probab=99.58 E-value=1.5e-14 Score=100.94 Aligned_cols=111 Identities=29% Similarity=0.293 Sum_probs=84.1
Q ss_pred ccCCCCcceEEEECCEEEEccccccCCC----CCccc-----ccHHHHHHHHHHHHHHHHHHcCCCccc---eEEEEEEe
Q psy1836 12 LVGSASAFQAVQHDNTLYISGVLGIDAS----SGKMA-----EGVENQAKQALTNMGHILKEAGGSYEH---VVKTTILL 79 (127)
Q Consensus 12 ~~~~~~ys~av~~g~~v~iSG~~~~d~~----~g~~~-----~d~~~Q~~~~~~ni~~~L~~~G~~~~~---vv~~~vyl 79 (127)
+.|.+.|.+++++|+++|+|||++.+.. .|++. ++-++-+|.+.-|+-+.|+++=++++. |+|++.|+
T Consensus 15 ~~p~g~Y~p~~~~G~ll~vSGq~p~~~g~~~~~G~vG~~~s~e~g~~AAr~~~Ln~La~lk~~~G~LdrV~~ivkl~g~V 94 (148)
T PF14588_consen 15 PAPVGNYVPAVRVGNLLYVSGQLPRDDGKLLYTGKVGEDLSVEEGYEAARLCALNALAALKAALGDLDRVKRIVKLTGFV 94 (148)
T ss_dssp ---SSSC-SEEEETTEEEEEEE--EETTEE-SBS-BTTTB-HHHHHHHHHHHHHHHHHHHHHHCTSGGGECEEEEEEEEE
T ss_pred CCCCceeeeEEEECCEEEEeccCcccCCEEeeecCCCCCCCHHHHHHHHHHHHHHHHHHHHHHhCCHhHEeEEEEEEEEE
Confidence 3467889999999999999999998752 23333 567788899999999999998778876 59999999
Q ss_pred CCCCCH-------HHHHHHHHhhcCCC-CCccEEEEcCCCCCCCEEEEEecc
Q psy1836 80 NDINDF-------ATVNNVYGQFFKPP-YPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 80 ~d~~~~-------~~~~~v~~~~f~~~-~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
....+| +..++++.+.||+. +++|+.+||..||..-. ||||..
T Consensus 95 ~s~~~F~~~p~V~ngaSdll~~vfGe~G~HaRsAvGv~sLP~~a~-VEie~i 145 (148)
T PF14588_consen 95 NSTPDFTEHPAVANGASDLLVEVFGEAGRHARSAVGVASLPLNAP-VEIELI 145 (148)
T ss_dssp EB-TT---HHHHHHHHHHHHHHHHGGGG-BEEEEEEESC-GGGBS-EEEEEE
T ss_pred ecCCCcccCchhhhhHHHHHHHHhCcCCCCcccccccccCCCCCe-EEEEEE
Confidence 988887 56788899999976 99999999999998764 677754
No 18
>TIGR01884 cas_HTH CRISPR locus-related DNA-binding protein. Most but not all examples of this family are associated with CRISPR loci, a combination of DNA repeats and characteristic proteins encoded near the repeat cluster. The C-terminal region of this protein is homologous to DNA-binding helix-turn-helix domains with predicted transcriptional regulatory activity.
Probab=66.00 E-value=28 Score=25.16 Aligned_cols=48 Identities=21% Similarity=0.415 Sum_probs=33.1
Q ss_pred HHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHH-HHHHHHhhcC
Q psy1836 47 VENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFAT-VNNVYGQFFK 97 (127)
Q Consensus 47 ~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~-~~~v~~~~f~ 97 (127)
.+++.+++.++++++++..+.. .+++.++-.|..++.. +.++..-..+
T Consensus 36 ~~~~~~~a~~~l~~~~~~~~~~---~~~~~~~~vd~~d~~~~~~~v~~~i~~ 84 (203)
T TIGR01884 36 IEDGARRAVESLRAIISDLGGN---LVEGTIKEIELKDVPSILRQMSDIIKE 84 (203)
T ss_pred CchHHHHHHHHHHHHHHHhccC---CCcceEEEEecCCHHHHHHHHHHHHHh
Confidence 3578999999999999998743 3556677777777654 4444444443
No 19
>PF02662 FlpD: Methyl-viologen-reducing hydrogenase, delta subunit; InterPro: IPR003813 Methyl-viologen-reducing hydrogenase (MVH) is one of the enzymes involved in methanogenesis and coded in the mth-flp-mvh-mrt cluster of methane genes in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) []. No specific functions have been assigned to the delta subunit.; GO: 0015948 methanogenesis, 0055114 oxidation-reduction process
Probab=57.69 E-value=55 Score=21.96 Aligned_cols=58 Identities=19% Similarity=0.177 Sum_probs=40.8
Q ss_pred CEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHH
Q psy1836 26 NTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFAT 87 (127)
Q Consensus 26 ~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~ 87 (127)
+=|++.|..+-|-.. ..-...++.-++.++.+|++.|...+.|--..++..+.+.|..
T Consensus 54 DGV~V~gC~~g~Ch~----~~Gn~~a~~Rv~~~k~~L~~~Gi~~eRv~~~~~~~~~~~~fa~ 111 (124)
T PF02662_consen 54 DGVLVAGCHPGDCHY----REGNYRAEKRVERLKKLLEELGIEPERVRLYWISAPEGKRFAE 111 (124)
T ss_pred CEEEEeCCCCCCCCc----chhhHHHHHHHHHHHHHHHHcCCChhHeEEEEeCcccHHHHHH
Confidence 667777766444211 2445678888999999999999999987666666666555543
No 20
>KOG2240|consensus
Probab=56.86 E-value=20 Score=25.54 Aligned_cols=77 Identities=17% Similarity=0.160 Sum_probs=47.2
Q ss_pred CcceEEEECCEEEEccccccCCCCCccc---ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHH
Q psy1836 17 SAFQAVQHDNTLYISGVLGIDASSGKMA---EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYG 93 (127)
Q Consensus 17 ~ys~av~~g~~v~iSG~~~~d~~~g~~~---~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~ 93 (127)
+--|+...++++-|+|+...++.+.-++ .++-.|.-..++.+...|........+-. .++|.. .++++.+.+-+.
T Consensus 77 ~~vqasl~~nTf~ItG~~~~k~l~EmlPsilsqLg~~sl~~Lr~La~~l~~~~~~~~~~~-~~~~~~-dd~~p~Lvenf~ 154 (162)
T KOG2240|consen 77 PKVQASLAANTFTITGHAETKQLTEMLPSILSQLGPDSLTALRRLAEQLPKQDGKEGADA-ATIEDD-DDDVPELVENFD 154 (162)
T ss_pred ccccccccCCeEEEecCCcccchhhhcchhhhhcChhhHHHHHHHHHhcccccccCCCCc-cchhhc-ccchHHHHHhHH
Confidence 4556666799999999999988444444 34444555666666666664444333333 566665 456666665554
Q ss_pred hh
Q psy1836 94 QF 95 (127)
Q Consensus 94 ~~ 95 (127)
++
T Consensus 155 ~~ 156 (162)
T KOG2240|consen 155 EA 156 (162)
T ss_pred HH
Confidence 44
No 21
>KOG0100|consensus
Probab=53.74 E-value=41 Score=28.01 Aligned_cols=52 Identities=13% Similarity=0.320 Sum_probs=40.9
Q ss_pred HHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCcc
Q psy1836 51 AKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPAR 103 (127)
Q Consensus 51 ~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~ 103 (127)
.|..++-++.+|+.+|+..+||- -.+.+-.-...+.+.++..+||.+..|.+
T Consensus 342 Fr~TlkPv~kvl~Ds~lkKsdid-eiVLVGGsTrIPKvQqllk~fF~GKepsk 393 (663)
T KOG0100|consen 342 FRKTLKPVQKVLEDSDLKKSDID-EIVLVGGSTRIPKVQQLLKDFFNGKEPSK 393 (663)
T ss_pred HHHhhHHHHHHHhhcCcccccCc-eEEEecCcccChhHHHHHHHHhCCCCccC
Confidence 56677789999999999888874 34556566678999999999997665544
No 22
>PRK00394 transcription factor; Reviewed
Probab=51.84 E-value=28 Score=25.09 Aligned_cols=38 Identities=18% Similarity=0.126 Sum_probs=28.5
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCcc
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYE 70 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~ 70 (127)
+||-||-+-... ..-+++++.+++.+..+|+..|....
T Consensus 50 lIf~sGKiv~tG------a~S~~~a~~a~~~~~~~l~~~g~~~~ 87 (179)
T PRK00394 50 LIFRSGKVVCTG------AKSVEDLHEAVKIIIKKLKELGIKVI 87 (179)
T ss_pred EEEcCCcEEEEc------cCCHHHHHHHHHHHHHHHHHcCCCcc
Confidence 677777654422 23457999999999999999998764
No 23
>cd04918 ACT_AK1-AT_2 ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1). This CD includes the second of two ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1), which can be synergistically inhibited by S-adenosylmethionine (SAM). This isoenzyme is found in higher plants, Arabidopsis thaliana (AT) and Zea mays, and also in Chlorophyta. In its inactive state, Arabidopsis AK1 binds the effectors lysine and SAM (two molecules each) at the interface of two ACT1 domain subunits. The second ACT domain (ACT2), this CD, does not interact with an effector. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=49.99 E-value=47 Score=19.22 Aligned_cols=44 Identities=20% Similarity=0.333 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHcCCCccceE------EEEEEeCCCCCHHHHHHHHHhhcC
Q psy1836 54 ALTNMGHILKEAGGSYEHVV------KTTILLNDINDFATVNNVYGQFFK 97 (127)
Q Consensus 54 ~~~ni~~~L~~~G~~~~~vv------~~~vyl~d~~~~~~~~~v~~~~f~ 97 (127)
++.++-..|++.|....-+. ++.+.+.+-+.-..+..++++||.
T Consensus 16 ~~~~i~~aL~~~~I~v~~i~~g~s~~sis~~v~~~~~~~av~~Lh~~f~~ 65 (65)
T cd04918 16 ILERAFHVLYTKGVNVQMISQGASKVNISLIVNDSEAEGCVQALHKSFFE 65 (65)
T ss_pred HHHHHHHHHHHCCCCEEEEEecCccceEEEEEeHHHHHHHHHHHHHHHhC
Confidence 56677788888887774443 555555444444577788888763
No 24
>PHA02094 hypothetical protein
Probab=46.82 E-value=48 Score=20.24 Aligned_cols=52 Identities=10% Similarity=0.049 Sum_probs=33.5
Q ss_pred cceEEEEEEeCCCC-----CHHHHHHHHHhhcCCC-CCccEEEEcCCCCCCCEEEEEe
Q psy1836 70 EHVVKTTILLNDIN-----DFATVNNVYGQFFKPP-YPARSTFQVGKLPLIDFCLVVL 121 (127)
Q Consensus 70 ~~vv~~~vyl~d~~-----~~~~~~~v~~~~f~~~-~Pa~t~v~V~~L~~~~~liEi~ 121 (127)
.+=+.-++++.... .++.+.+-..+..|.. |.|.|+|.-..-.+.+++|||.
T Consensus 23 ~n~l~~t~~l~~~~~k~n~~ye~~~ksig~m~g~~hptaktwvakpnpfrdgvlveir 80 (81)
T PHA02094 23 NNKLSNTIMLPSTRKKLNEKYEFFAKSVGAMIGELHPTAKTWVAKPNPFRDGVLVEIR 80 (81)
T ss_pred HHhccceeeeHHHHHHHHHHHHHHHHHHHHHhcccCccccccccCCCCCccceEEeec
Confidence 34445567776542 2444555555666654 7788888766667899999984
No 25
>PF07066 DUF3882: Lactococcus phage M3 protein; InterPro: IPR009773 This family consists of several Lactococcus bacteriophage 712, middle-3 (M3) proteins of around 160 residues in length. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. The function of this family is unknown.
Probab=45.28 E-value=1.1e+02 Score=21.68 Aligned_cols=49 Identities=14% Similarity=0.171 Sum_probs=38.4
Q ss_pred eEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceE
Q psy1836 20 QAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVV 73 (127)
Q Consensus 20 ~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv 73 (127)
-|++-|+-++++|...-.+. .++-|-++.++..|+.++++...-..+|+
T Consensus 21 ~A~~~~~~~~~~si~~~~k~-----Ks~~ER~k~ias~Lk~ii~~~d~~~y~i~ 69 (159)
T PF07066_consen 21 WAFFKGSDLVVGSIKAKHKS-----KSFFERAKSIASELKTIIQKYDLKFYIIV 69 (159)
T ss_pred eEEecCCeEEEeeeeecCcc-----cCHHHHHHHHHHHHHHHHHHhCCCcceEE
Confidence 35566888888887565553 68999999999999999999987666654
No 26
>PF08868 YugN: YugN-like family; InterPro: IPR014967 This entry contains proteins related to Bacillus subtilis YugN, they are functionally uncharacterised. ; PDB: 2R5X_A 2PWW_A.
Probab=44.80 E-value=26 Score=24.13 Aligned_cols=30 Identities=17% Similarity=0.240 Sum_probs=20.5
Q ss_pred ccccCCCCCcccccHHHHHHHHHHHHHHHHH
Q psy1836 33 VLGIDASSGKMAEGVENQAKQALTNMGHILK 63 (127)
Q Consensus 33 ~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~ 63 (127)
|-|.|+ +..++....++.+..++.|+..|+
T Consensus 103 ~eP~d~-D~~~p~~~v~~~~~~l~ele~~L~ 132 (132)
T PF08868_consen 103 QEPVDK-DAEFPEKWVDKGEELLKELEDELK 132 (132)
T ss_dssp -----T--B-EEHHHHHHHHHHHHHHHHHHH
T ss_pred ccccCC-CCcCCHHHHHHHHHHHHHHHHhhC
Confidence 356677 788888899999999999998885
No 27
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=43.30 E-value=41 Score=24.09 Aligned_cols=37 Identities=16% Similarity=0.177 Sum_probs=27.6
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY 69 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~ 69 (127)
+||-||-+-... ..-+++++.+++.+..+|+..|...
T Consensus 51 lIF~SGKiv~tG------aks~~~a~~a~~~~~~~L~~~g~~~ 87 (174)
T cd04518 51 LIFRSGKMVCTG------AKSVEDLHRAVKEIIKKLKDYGIKV 87 (174)
T ss_pred EEECCCeEEEEc------cCCHHHHHHHHHHHHHHHHhcCCCc
Confidence 677777655432 2345789999999999999999764
No 28
>COG1908 FrhD Coenzyme F420-reducing hydrogenase, delta subunit [Energy production and conversion]
Probab=43.12 E-value=1e+02 Score=21.12 Aligned_cols=44 Identities=20% Similarity=0.183 Sum_probs=30.2
Q ss_pred HHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836 50 QAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF 96 (127)
Q Consensus 50 Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f 96 (127)
-+..=+++++.+|++.|...+.|--+. .+.++++.+.+..++|.
T Consensus 75 ka~rR~~~lke~l~elgie~eRv~~~w---iSa~E~ekf~e~~~efv 118 (132)
T COG1908 75 KAKRRMELLKELLKELGIEPERVRVLW---ISAAEGEKFAETINEFV 118 (132)
T ss_pred HHHHHHHHHHHHHHHhCCCcceEEEEE---EehhhHHHHHHHHHHHH
Confidence 345567899999999999999664444 45555655555555553
No 29
>PRK13689 hypothetical protein; Provisional
Probab=42.71 E-value=40 Score=20.91 Aligned_cols=23 Identities=17% Similarity=0.313 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHcCCCcc
Q psy1836 48 ENQAKQALTNMGHILKEAGGSYE 70 (127)
Q Consensus 48 ~~Q~~~~~~ni~~~L~~~G~~~~ 70 (127)
.+|+++++..+-.+|+.+..+.+
T Consensus 9 d~qvE~il~el~~VLeKH~Ap~D 31 (75)
T PRK13689 9 DEQVEQLLAELLAVLEKHKAPTD 31 (75)
T ss_pred HHHHHHHHHHHHHHHHhcCCCcc
Confidence 47999999999999999987654
No 30
>PF07736 CM_1: Chorismate mutase type I; InterPro: IPR008243 Chorismate mutase (CM; 5.4.99.5 from EC) catalyses the reaction at the branch point of the biosynthetic pathway leading to the three aromatic amino acids, phenylalanine, tryptophan and tyrosine (chorismic acid is the last common intermediate, and CM leads to the L-phenylalanine/L-tyrosine branch). It is part of the shikimate pathway, which is present only in bacteria, fungi and plants. This entry represents a family of monofunctional (non-fused) chorismate mutases from Gram-positive bacteria (Firmicutes) and cyanobacteria. Trusted members of the family are found in operons with other enzymes of the chorismate pathways, both up- and downstream of CM (Listeria, Bacillus, Oceanobacillus) or are the sole CM in the genome where the other members of the chorismate pathways are found elsewhere in the genome (Nostoc, Thermosynechococcus). They are monofunctional, homotrimeric, nonallosteric enzymes and are not regulated by the end-product aromatic amino acids. The three types of CM are AroQ class, Prokaryotic type (e.g., IPR008239 from INTERPRO amongst others); AroQ class, Eukaryotic type (IPR008238 from INTERPRO); and AroH class. They fall into two structural folds (AroQ class and AroH class) which are completely unrelated []. The two types of the AroQ structural class (the Escherichia coli CM dimer and the yeast CM monomer) can be structurally superimposed, and the topology of the four-helix bundle forming the active site is conserved []. For additional information please see [, , , , , , ].; PDB: 2CHS_K 2CHT_L 1COM_J 1FNJ_A 1FNK_A 1DBF_C 1UI9_A 1ODE_A 1UFY_A 1XHO_C ....
Probab=42.24 E-value=1.1e+02 Score=20.74 Aligned_cols=41 Identities=20% Similarity=0.247 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeC-CCC-CHHH
Q psy1836 47 VENQAKQALTNMGHILKEAGGSYEHVVKTTILLN-DIN-DFAT 87 (127)
Q Consensus 47 ~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~-~~~~ 87 (127)
-++=.+.+-+=+..++++.+.+.+||+++..-+| |+. .||+
T Consensus 15 ~e~I~~at~eLl~~i~~~N~l~~~dIvSi~FT~T~DL~a~fPA 57 (118)
T PF07736_consen 15 PEEILEATRELLEEILERNELSPEDIVSIIFTVTPDLDAAFPA 57 (118)
T ss_dssp HHHHHHHHHHHHHHHHHHTT--GGGEEEEEEEE-TT--SS-TC
T ss_pred HHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEeCCCcCccChH
Confidence 3333344555667889999999999999988888 443 4543
No 31
>PF04088 Peroxin-13_N: Peroxin 13, N-terminal region; InterPro: IPR007223 Peroxin-13 is a component of the peroxisomal translocation machinery with Peroxin-14 and Peroxin-17. Both termini of Peroxin-13 are oriented to the cytosol. It is required for peroxisomal association of peroxin-14 []. The proteins also contain an SH3 domain (IPR001452 from INTERPRO).; GO: 0016560 protein import into peroxisome matrix, docking, 0005777 peroxisome, 0016021 integral to membrane
Probab=42.23 E-value=22 Score=25.25 Aligned_cols=27 Identities=11% Similarity=0.319 Sum_probs=22.9
Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHcC
Q psy1836 40 SGKMAEGVENQAKQALTNMGHILKEAG 66 (127)
Q Consensus 40 ~g~~~~d~~~Q~~~~~~ni~~~L~~~G 66 (127)
.+.+...+++-++.+|+-||.++.+.|
T Consensus 13 ~~~~~q~~eestr~tFq~IESIV~Afg 39 (158)
T PF04088_consen 13 PNSFTQQAEESTRATFQSIESIVGAFG 39 (158)
T ss_pred ccHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 444557889999999999999998877
No 32
>PRK14643 hypothetical protein; Provisional
Probab=41.69 E-value=1.3e+02 Score=21.40 Aligned_cols=65 Identities=11% Similarity=0.091 Sum_probs=47.8
Q ss_pred cHHHHHHHHHHHHHHHHHHcCCCccceEE--------EEEEeCC---------CCCHHHHHHHHHhhcCC--CCCccEEE
Q psy1836 46 GVENQAKQALTNMGHILKEAGGSYEHVVK--------TTILLND---------INDFATVNNVYGQFFKP--PYPARSTF 106 (127)
Q Consensus 46 d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~--------~~vyl~d---------~~~~~~~~~v~~~~f~~--~~Pa~t~v 106 (127)
|++....++.+.++.++++.|..+-+|-- +++|+.+ ++|...+++...+.+.. +.|..=.+
T Consensus 3 ~~~~~~~~l~~l~~p~~~~~G~eL~die~~~~~~~~~lrV~Id~~~~~~ggvtldDC~~vSr~is~~LD~~d~i~~~Y~L 82 (164)
T PRK14643 3 DFESIKFQINELVNKELEVLNLKVYEINNLKEFENDMIQILVEDILQANKPLDFDILIKANDLVSNKIDQFIKTSEKYLL 82 (164)
T ss_pred cHHHHHHHHHHHHHHHHHhcCCEEEEEEEEecCCCcEEEEEEecCCCcCCCcCHHHHHHHHHHHHHHhCccCCCCCCeEE
Confidence 67777778888999999999988777631 6888843 23678888888888873 35566666
Q ss_pred EcCC
Q psy1836 107 QVGK 110 (127)
Q Consensus 107 ~V~~ 110 (127)
.|++
T Consensus 83 EVSS 86 (164)
T PRK14643 83 EISS 86 (164)
T ss_pred EecC
Confidence 7763
No 33
>TIGR01796 CM_mono_aroH monofunctional chorismate mutase, gram positive type, clade 1. This model represents a family of monofunctional (non-fused) chorismate mutases from gram positive bacteria (Firmicutes) and cyanobacteria. Trusted members of the family are found in operons with other enzymes of the chorismate pathways, both up- and downstream of CM (Listeria, Bacillus, Oceanobacillus) or are the sole CM in the genome where the other members of the chorismate pathways are found elsewhere in the genome (Nostoc, Thermosynechococcus).
Probab=40.82 E-value=1.1e+02 Score=20.63 Aligned_cols=37 Identities=8% Similarity=0.261 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHHHcCCCccceEEEEEEeCC-CC-CHHHH
Q psy1836 52 KQALTNMGHILKEAGGSYEHVVKTTILLND-IN-DFATV 88 (127)
Q Consensus 52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d-~~-~~~~~ 88 (127)
..+-+=++.++++.+.+.+||+++..-+|. +. .|++.
T Consensus 20 ~at~eLl~~ii~~N~l~~edivSv~FT~T~DL~a~FPA~ 58 (117)
T TIGR01796 20 EAVAELLTELMERNELTPEDLISVIFTVTEDLHADFPAA 58 (117)
T ss_pred HHHHHHHHHHHHHcCCCHHHEEEEEEEecCcccccChHH
Confidence 334445678899999999999999998884 43 46653
No 34
>cd04937 ACT_AKi-DapG-BS_2 ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI. This CD includes the C-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) strain 168), Clostridia, and Actinobacteria bacterial species. In B. subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive AK isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The BS AKI is tetrameric consisting of two alpha and two beta subunits; th
Probab=39.30 E-value=75 Score=18.15 Aligned_cols=45 Identities=13% Similarity=0.119 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHcCCCccceE----EEEEEeCCCCCHHHHHHHHHhhc
Q psy1836 52 KQALTNMGHILKEAGGSYEHVV----KTTILLNDINDFATVNNVYGQFF 96 (127)
Q Consensus 52 ~~~~~ni~~~L~~~G~~~~~vv----~~~vyl~d~~~~~~~~~v~~~~f 96 (127)
..++.++-..|++.|.....+. ++.+.+.+-+.-..++.++++|+
T Consensus 15 ~gi~~~if~aL~~~~I~v~~~~~Se~~is~~v~~~~~~~av~~Lh~~f~ 63 (64)
T cd04937 15 PGVMAKIVGALSKEGIEILQTADSHTTISCLVSEDDVKEAVNALHEAFE 63 (64)
T ss_pred cCHHHHHHHHHHHCCCCEEEEEcCccEEEEEEcHHHHHHHHHHHHHHhc
Confidence 3456677778888888776554 44444444333456677777764
No 35
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=39.08 E-value=27 Score=21.74 Aligned_cols=20 Identities=35% Similarity=0.522 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHcC
Q psy1836 47 VENQAKQALTNMGHILKEAG 66 (127)
Q Consensus 47 ~~~Q~~~~~~ni~~~L~~~G 66 (127)
=+++++.+++.+..+|++.|
T Consensus 67 s~~~~~~a~~~i~~~L~~~~ 86 (86)
T PF00352_consen 67 SEEEAKKAIEKILPILQKLG 86 (86)
T ss_dssp SHHHHHHHHHHHHHHHHHTT
T ss_pred CHHHHHHHHHHHHHHHHHcC
Confidence 45789999999999998876
No 36
>TIGR02671 cas_csx9 CRISPR-associated protein (provisional), Csx9 family. Members of this family, so far, are archaeal proteins found in CRISPR-associated (cas) gene regions. So far, this rare cas protein is found in only three genomes: Pyrococcus horikoshii shinkaj OT3, Pyrococcus abyssi GE5, and Thermococcus kodakarensis KOD1. In each case it is found immediately upstream of cas3 in loci that resemble the Apern type but lack Csa1 and Csa4 genes.
Probab=38.67 E-value=1.5e+02 Score=23.30 Aligned_cols=47 Identities=19% Similarity=0.234 Sum_probs=33.0
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCc
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPA 102 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa 102 (127)
.++.+-.+.+++.++.-++ .+.+ . -...|...+++++++||+..+|-
T Consensus 15 ~~lad~~e~~~dslk~r~~-~~tS--k--------igrNDr~S~~~v~~~WF~~~~Pe 61 (377)
T TIGR02671 15 EELADALEGALDSLKSRAM-INTS--K--------IGRNDRNSFDRVLSAWFGRSAPE 61 (377)
T ss_pred HHHHHHHHHHHHHHHHHHh-cccc--c--------ccccchhhHHHHHHHHhCCCChH
Confidence 5677777777777776666 3332 1 13568899999999999977663
No 37
>PF11230 DUF3029: Protein of unknown function (DUF3029); InterPro: IPR016905 Members of this family are homologs to enzymes known to undergo activation by a radical SAM protein to create an active site glycyl radical. This family appears to be activated by the YjjW radical SAM protein, usually encoded by an adjacent gene.
Probab=38.52 E-value=57 Score=27.21 Aligned_cols=45 Identities=20% Similarity=0.099 Sum_probs=31.7
Q ss_pred cHHHHHHHHHHHHHHHHHH------cCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836 46 GVENQAKQALTNMGHILKE------AGGSYEHVVKTTILLNDINDFATVNN 90 (127)
Q Consensus 46 d~~~Q~~~~~~ni~~~L~~------~G~~~~~vv~~~vyl~d~~~~~~~~~ 90 (127)
+++.-=+.+++-++..++. +..+-+|+||+|-|++..+|.+.+.+
T Consensus 410 T~k~NP~A~ldi~kGAf~~G~Ryfs~y~~~sDlVRVTGYlVK~SdieK~~~ 460 (487)
T PF11230_consen 410 TVKNNPEALLDICKGAFSLGMRYFSAYVSDSDLVRVTGYLVKRSDIEKFRQ 460 (487)
T ss_pred hhHhCHHHHHHHHHHHHhcCCeEEEEecCCCceEEEeeeeehhhhHHHHHc
Confidence 4444445566666666653 23567899999999999999877765
No 38
>PF12262 Lipase_bact_N: Bacterial virulence factor lipase N-terminal
Probab=38.51 E-value=80 Score=24.31 Aligned_cols=52 Identities=13% Similarity=0.170 Sum_probs=39.9
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCC--CHHHHHHHHHhhc
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIN--DFATVNNVYGQFF 96 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~--~~~~~~~v~~~~f 96 (127)
.+-..|.+++.++.+.+++++|.+.++|+--..|-|.-. -+..+..+...-|
T Consensus 202 ~~~l~~lq~li~~~e~~~~~~gv~~~~IvyS~~FTTqS~~~vl~a~k~~~a~~~ 255 (268)
T PF12262_consen 202 TGALAQLQQLINSYEALFAAAGVDKDDIVYSSWFTTQSVGDVLEAVKAAIASGL 255 (268)
T ss_pred hHHHHHHHHHHHHHHHHHHhcCCCccceEEEeeeeccccHHHHHHHHHHHHhhh
Confidence 456678899999999999999999999999999988542 2455555555444
No 39
>cd02185 AroH Chorismate mutase (AroH) is one of at least five chorismate-utilizing enzymes present in microorganisms that catalyze the rearrangement of chorismate to prephenic acid, the first committed step in the biosynthesis of aromatic amino acids. In prokaryotes, chorismate mutase may be fused to prephenate dehydratase, prephenate dehydrogenase, or 3-deoxy-D-arabino-heptulosonat-7-phosphate (DAHP) as part of a bifunctional enzyme. The AroH domain forms a homotrimer with three-fold symmetry.
Probab=38.13 E-value=1.3e+02 Score=20.38 Aligned_cols=38 Identities=8% Similarity=0.181 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHHcCCCccceEEEEEEeCC-CC-CHHHH
Q psy1836 51 AKQALTNMGHILKEAGGSYEHVVKTTILLND-IN-DFATV 88 (127)
Q Consensus 51 ~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d-~~-~~~~~ 88 (127)
.+.+-+=++.++++.+.+.+||+++..-+|. +. .||+.
T Consensus 19 ~~at~eLl~~i~~~N~l~~edivSv~FT~T~DL~a~FPA~ 58 (117)
T cd02185 19 LEATRELLEEIIERNNIKPEDIISVIFTVTPDLDAAFPAK 58 (117)
T ss_pred HHHHHHHHHHHHHHcCCCHHHEEEEEEEeCCcccccChHH
Confidence 3344456678899999999999999998884 53 46654
No 40
>COG1242 Predicted Fe-S oxidoreductase [General function prediction only]
Probab=37.72 E-value=2e+02 Score=22.64 Aligned_cols=70 Identities=14% Similarity=0.108 Sum_probs=48.4
Q ss_pred CCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836 25 DNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP 99 (127)
Q Consensus 25 g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~ 99 (127)
|+..|+|-..+-|. .++...++++|.+...+.+.+--.... -++-..-|..--+..+.+.+.|.+.+..+
T Consensus 44 GGCtFC~~~g~~d~-~~~~~~~i~~Q~~~q~~~~~kK~~~~k----yiaYFQ~~TNTyApvevLre~ye~aL~~~ 113 (312)
T COG1242 44 GGCTFCSVAGSGDF-AGQPKISIAEQFKEQAERMHKKWKRGK----YIAYFQAYTNTYAPVEVLREMYEQALSEA 113 (312)
T ss_pred CceeeecCCCCCcc-ccCcccCHHHHHHHHHHHHHHhhcCCc----EEEEEeccccccCcHHHHHHHHHHHhCcC
Confidence 78899876555555 455557899998887776665433332 56666666666677788888888887754
No 41
>cd04516 TBP_eukaryotes eukaryotic TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=37.50 E-value=54 Score=23.48 Aligned_cols=37 Identities=19% Similarity=0.260 Sum_probs=27.5
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY 69 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~ 69 (127)
+||-||-+-+.. .--+++++.+++++..+|+..|...
T Consensus 51 lIF~SGKiviTG------aks~e~a~~a~~~i~~~L~~~g~~~ 87 (174)
T cd04516 51 LIFSSGKMVCTG------AKSEDDSKLAARKYARIIQKLGFPA 87 (174)
T ss_pred EEECCCeEEEEe------cCCHHHHHHHHHHHHHHHHHcCCCC
Confidence 677777655422 2345789999999999999999754
No 42
>cd04517 TLF TBP-like factors (TLF; also called TLP, TRF, TRP), which are found in most metazoans. TLFs and TBPs have well-conserved core domains; however, they only share about 60% similarity. TLFs, like TBPs, interact with TFIIA and TFIIB, which are part of the basal transcription machinery. Yet, in contrast to TBPs, TLFs seem not to interact with the TATA-box and even have a negative effect on the transcription of TATA-containing promoters. Recent results indicate that TLFs are involved in the transcription via TATA-less promoters.
Probab=37.19 E-value=60 Score=23.22 Aligned_cols=37 Identities=27% Similarity=0.377 Sum_probs=26.7
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY 69 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~ 69 (127)
+||-||-+-+.. ..-+++++.+++.+..+|+..|...
T Consensus 51 lIF~sGKiviTG------aks~~~~~~a~~~~~~~l~~~g~~~ 87 (174)
T cd04517 51 SVWSSGKITITG------ATSEEEAKQAARRAARLLQKLGFKV 87 (174)
T ss_pred EEECCCeEEEEc------cCCHHHHHHHHHHHHHHHHHcCCCc
Confidence 667666554422 2346788999999999999999765
No 43
>TIGR02499 HrpE_YscL_not type III secretion apparatus protein, HrpE/YscL family. This model is related to Pfam model pfam06188, but is broader. pfam06188 describes HrpE-like proteins, components of bacterial type III secretion systems primarily in bacteria that infect plants. This model includes also the homologous proteins of animal pathogens, such as YscL of Yersinia pestis. This model excludes the related protein FliH of the bacterial flagellar apparatus (see pfam02108)
Probab=35.62 E-value=1.4e+02 Score=20.30 Aligned_cols=48 Identities=10% Similarity=-0.008 Sum_probs=34.8
Q ss_pred EEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEeccc
Q psy1836 74 KTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGGH 124 (127)
Q Consensus 74 ~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~~ 124 (127)
.+++++ +.+|++.+.+.+.+++...+ .+.+.=..|...++.||=..|.
T Consensus 115 ~v~I~v-~P~d~~~l~~~l~~~~~~~~--~~i~~D~~l~~G~c~vet~~G~ 162 (166)
T TIGR02499 115 RLTLRV-HPEQLDEVREALAERLALEP--WELEPDASLAPGACVLETESGV 162 (166)
T ss_pred ceEEEE-CHHHHHHHHHHHHHHhccCC--eEEeeCCCCCCCCEEEEeCCce
Confidence 445554 67788888888888775443 6666656788999999887764
No 44
>COG1022 FAA1 Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism]
Probab=35.14 E-value=29 Score=29.83 Aligned_cols=32 Identities=22% Similarity=0.281 Sum_probs=25.5
Q ss_pred CEEEEccccccCCCCCccc--ccHHHHHHHHHHHHH
Q psy1836 26 NTLYISGVLGIDASSGKMA--EGVENQAKQALTNMG 59 (127)
Q Consensus 26 ~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~ 59 (127)
-++||||++|. | .|-+. ..+..|+..+.+++.
T Consensus 195 tiiYTSGTTG~-P-KGVmLth~N~~~~v~~~~~~~~ 228 (613)
T COG1022 195 TIIYTSGTTGT-P-KGVMLTHRNLLAQVAGIDEVLP 228 (613)
T ss_pred EEEEcCCCCCC-C-ceEEEehHHHHHHHHHHHhhCC
Confidence 47999999987 5 67665 788888888887775
No 45
>COG3453 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.57 E-value=6.7 Score=26.83 Aligned_cols=76 Identities=14% Similarity=0.139 Sum_probs=46.6
Q ss_pred EEEECCEEEEccccccCCC-----CC--ccc---ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836 21 AVQHDNTLYISGVLGIDAS-----SG--KMA---EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNN 90 (127)
Q Consensus 21 av~~g~~v~iSG~~~~d~~-----~g--~~~---~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~ 90 (127)
.+++++.+.+|||+..+.- .| .++ +|-|+--.--++.++...+++|++..+| =++-+-...++...+.+
T Consensus 3 i~~I~d~lsVsgQi~~~D~~~iaa~GFksiI~nRPDgEe~~QP~~~~i~~aa~~aGl~y~~i-PV~~~~iT~~dV~~f~~ 81 (130)
T COG3453 3 IRRINDRLSVSGQISPADIASIAALGFKSIICNRPDGEEPGQPGFAAIAAAAEAAGLTYTHI-PVTGGGITEADVEAFQR 81 (130)
T ss_pred ceecccceeecCCCCHHHHHHHHHhccceecccCCCCCCCCCCChHHHHHHHHhcCCceEEe-ecCCCCCCHHHHHHHHH
Confidence 4678899999999765321 01 111 4444444446778889999999987665 23332223345667777
Q ss_pred HHHhhcC
Q psy1836 91 VYGQFFK 97 (127)
Q Consensus 91 v~~~~f~ 97 (127)
.+.+.=+
T Consensus 82 Al~eaeg 88 (130)
T COG3453 82 ALDEAEG 88 (130)
T ss_pred HHHHhCC
Confidence 7666543
No 46
>PF00816 Histone_HNS: H-NS histone family Partial NMR structure.; InterPro: IPR001801 The histone-like nucleoid-structuring (H-NS) protein belongs to a family of bacterial proteins that play a role in the formation of nucleoid structure and affect gene expression under certain conditions [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 2LEV_A 1HNS_A 1LR1_B 1HNR_A 1NI8_A 1OV9_A 2JR1_A 3NR7_A 2L93_A 2L92_A.
Probab=34.29 E-value=21 Score=22.51 Aligned_cols=24 Identities=21% Similarity=0.375 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHcCCCccce
Q psy1836 49 NQAKQALTNMGHILKEAGGSYEHV 72 (127)
Q Consensus 49 ~Q~~~~~~ni~~~L~~~G~~~~~v 72 (127)
.+.+.++..|..++++.|.+++|+
T Consensus 20 ~e~~~~~~~i~~~~~~~Gis~~el 43 (93)
T PF00816_consen 20 QEREEAIAEIRELMAEYGISPEEL 43 (93)
T ss_dssp HCCHHHHHHHHHHHHHTT--HHHC
T ss_pred HHHHHHHHHHHHHHHHhCCCHHHh
Confidence 345668999999999999999999
No 47
>PTZ00009 heat shock 70 kDa protein; Provisional
Probab=32.53 E-value=1.7e+02 Score=25.19 Aligned_cols=53 Identities=21% Similarity=0.340 Sum_probs=38.9
Q ss_pred ccHHHHHH----HHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC
Q psy1836 45 EGVENQAK----QALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP 98 (127)
Q Consensus 45 ~d~~~Q~~----~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~ 98 (127)
.++++-++ .+++.++.+|+.+|.+.++|- --+.+=.....+.+.+...++|+.
T Consensus 302 ~~fe~l~~~l~~~~~~~i~~~L~~a~~~~~~i~-~ViLvGGssriP~v~~~i~~~f~~ 358 (653)
T PTZ00009 302 ARFEELCGDYFRNTLQPVEKVLKDAGMDKRSVH-EVVLVGGSTRIPKVQSLIKDFFNG 358 (653)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHCc-EEEEECCCCCChhHHHHHHHHhCC
Confidence 55655544 456678889999999998874 233344567889999999999974
No 48
>PF02108 FliH: Flagellar assembly protein FliH; InterPro: IPR018035 This entry represents a region found in the flagellar assembly protein FliH, as well as in type III secretion system protein HrpE. Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export []. The sequence of fliH has been deduced and shown to encode a protein of molecular mass of 25,782 Da. Bacterial HrpE proteins are belived to function on the type III secretion system, specifically the secretion of HrpZ (harpinPss) [].
Probab=32.19 E-value=1.4e+02 Score=19.18 Aligned_cols=66 Identities=15% Similarity=0.120 Sum_probs=42.0
Q ss_pred HHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836 52 KQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG 123 (127)
Q Consensus 52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~ 123 (127)
..+++-|+.+|.... .+--++++|+ +.+|++.+.+...+.+... ...++.=..|+.+++.||-..|
T Consensus 50 ~~i~~~i~~al~~~~---~~~~~v~I~v-~p~d~~~l~~~~~~~~~~~--~~~l~~D~~l~~G~c~iet~~g 115 (128)
T PF02108_consen 50 EAILNLIREALQELP---RDEEKVTIRV-HPDDYEALEELLEDELPEL--GWELVADPSLAPGDCRIETEDG 115 (128)
T ss_pred HHHHHHHHHHHHHhh---ccCCCeEEEE-CHHHHHHHHHHHHHHHhhc--CCEEEecCCCCCCCEEEEECCe
Confidence 344455555552222 2233566666 6778888888877665432 2277776789999999998776
No 49
>KOG0813|consensus
Probab=31.85 E-value=52 Score=25.35 Aligned_cols=32 Identities=28% Similarity=0.482 Sum_probs=24.5
Q ss_pred ceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHH
Q psy1836 19 FQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTN 57 (127)
Q Consensus 19 s~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~n 57 (127)
.+++-.|+.+|.+|. |.+-++..+|+..+++.
T Consensus 132 e~~iFtGDtlf~~Gc-------G~~FEgt~~~M~~sl~~ 163 (265)
T KOG0813|consen 132 ERAIFTGDTLFGAGC-------GRFFEGTAEQMDSSLNE 163 (265)
T ss_pred CCeEEeCCceeecCc-------cchhcCCHHHHHHhHHH
Confidence 457777999998884 44446788899888887
No 50
>KOG3887|consensus
Probab=31.81 E-value=1.3e+02 Score=23.57 Aligned_cols=44 Identities=23% Similarity=0.364 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHHHcCCCccceEEEEEEeCCCCC---HHHHHHHHHhhcCC
Q psy1836 52 KQALTNMGHILKEAGGSYEHVVKTTILLNDIND---FATVNNVYGQFFKP 98 (127)
Q Consensus 52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~---~~~~~~v~~~~f~~ 98 (127)
|.+.++...-|+.+|. ++ ++++.|+|.+-| |+++.++-++..+.
T Consensus 158 rdI~qr~~d~l~d~gl--e~-v~vsf~LTSIyDHSIfEAFSkvVQkLipq 204 (347)
T KOG3887|consen 158 RDIHQRTNDELADAGL--EK-VQVSFYLTSIYDHSIFEAFSKVVQKLIPQ 204 (347)
T ss_pred HHHHHHhhHHHHhhhh--cc-ceEEEEEeeecchHHHHHHHHHHHHHhhh
Confidence 4577777788888876 44 489999998855 88999998887653
No 51
>KOG1179|consensus
Probab=31.63 E-value=1.2e+02 Score=26.38 Aligned_cols=44 Identities=25% Similarity=0.317 Sum_probs=27.4
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEE
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTI 77 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~v 77 (127)
+||+||++|.-. .-+-.+.| .+.--.-.....|++.+||+..+.
T Consensus 247 yIYTSGTTGLPK------aAvith~r-~~~~a~g~~~~~g~~~~DvvY~~l 290 (649)
T KOG1179|consen 247 YIYTSGTTGLPK------AAVITHLR-YLQGAAGFYYVFGMTADDVVYTTL 290 (649)
T ss_pred EEEcCCCCCCCH------HHHHHHHH-HHHHHHHHHHHhCCCccceEEEcc
Confidence 799999998843 12222221 222222367788999999987654
No 52
>KOG3010|consensus
Probab=31.36 E-value=1.1e+02 Score=23.45 Aligned_cols=44 Identities=14% Similarity=0.321 Sum_probs=35.3
Q ss_pred HHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836 52 KQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP 99 (127)
Q Consensus 52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~ 99 (127)
+..++++.++|+.-|+ ++-+..|-.|.-+++++..++.++....
T Consensus 116 e~fy~~~~rvLRk~Gg----~iavW~Y~dd~v~~pE~dsv~~r~~~~~ 159 (261)
T KOG3010|consen 116 ERFYKEAYRVLRKDGG----LIAVWNYNDDFVDWPEFDSVMLRLYDST 159 (261)
T ss_pred HHHHHHHHHHcCCCCC----EEEEEEccCCCcCCHHHHHHHHHHhhcc
Confidence 3566777888888775 6788999999999999999998887644
No 53
>KOG1394|consensus
Probab=31.09 E-value=1.1e+02 Score=25.14 Aligned_cols=48 Identities=23% Similarity=0.156 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHcCCCccceEEEEEEeCCCC--CHHHHHHHHHhhcCCCCC
Q psy1836 53 QALTNMGHILKEAGGSYEHVVKTTILLNDIN--DFATVNNVYGQFFKPPYP 101 (127)
Q Consensus 53 ~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~--~~~~~~~v~~~~f~~~~P 101 (127)
-+...|+..|+.+|.+++||-.++..-|... |..+... .++.|+++.+
T Consensus 302 Ga~~am~raL~~Agl~pe~i~YvNAHATST~~GD~aE~~A-i~~vf~~~~~ 351 (440)
T KOG1394|consen 302 GAVLAMERALKDAGLSPEDIDYVNAHATSTPLGDAAEAEA-IKRVFGEHNI 351 (440)
T ss_pred hHHHHHHHHHHHcCCChhhcCeeecccccCcCchHHHHHH-HHHHhccCCc
Confidence 4567789999999999999999999888754 4444443 4556666533
No 54
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA. New members of the TBP family, called TBP-like proteins (TBLP, TLF, TLP) or TBP-related factors (TRF1, TRF2,TRP), are similar to the core domain of TBPs, with identical or chemically similar amino acids at many
Probab=30.72 E-value=78 Score=22.56 Aligned_cols=37 Identities=27% Similarity=0.327 Sum_probs=26.2
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY 69 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~ 69 (127)
+||-||-+-... ..=+++++.+++.+...|+..|...
T Consensus 51 lIf~sGKivitG------aks~~~~~~a~~~~~~~L~~~g~~~ 87 (174)
T cd00652 51 LIFSSGKMVITG------AKSEEDAKLAARKYARILQKLGFPV 87 (174)
T ss_pred EEECCCEEEEEe------cCCHHHHHHHHHHHHHHHHHcCCCc
Confidence 566666544321 1345788999999999999999765
No 55
>PF01071 GARS_A: Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain; InterPro: IPR020561 Phosphoribosylglycinamide synthetase (6.3.4.13 from EC) (GARS) (phosphoribosylamine glycine ligase) [] catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by phosphoribosylglycinamide synthetase is the ATP-dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide: ATP + 5-phosphoribosylamine + glycine = ADP + Pi + 5'-phosphoribosylglycinamide In bacteria, GARS is a monofunctional enzyme (encoded by the purD gene). In yeast, GARS is part of a bifunctional enzyme (encoded by the ADE5/7 gene) in conjunction with phosphoribosylformylglycinamidine cyclo-ligase (AIRS) (IPR000728 from INTERPRO). In higher eukaryotes, GARS is part of a trifunctional enzyme in conjunction with AIRS (IPR000728 from INTERPRO) and with phosphoribosylglycinamide formyltransferase (GART) (), forming GARS-AIRS-GART. This entry represents the A-domain of the enzyme, and is related to the ATP-grasp domain of biotin carboxylase/carbamoyl phosphate synthetase.; PDB: 1GSO_A 3LP8_A 2IP4_A 1VKZ_A 2QK4_A 2XD4_A 2XCL_A 3MJF_A 2YRW_A 2YS6_A ....
Probab=30.59 E-value=61 Score=23.70 Aligned_cols=53 Identities=15% Similarity=0.168 Sum_probs=34.8
Q ss_pred EEEccccccCCCCCcccccHHHHHHH-HHHHHHHHHHHcCCCccceEEEEEEeCC
Q psy1836 28 LYISGVLGIDASSGKMAEGVENQAKQ-ALTNMGHILKEAGGSYEHVVKTTILLND 81 (127)
Q Consensus 28 v~iSG~~~~d~~~g~~~~d~~~Q~~~-~~~ni~~~L~~~G~~~~~vv~~~vyl~d 81 (127)
--+.|+.+.-| .-.+..+..++++. +++.+...|++.|....-|+...+.+++
T Consensus 124 pnTGGMGa~sp-~p~~~~~~~~~i~~~I~~pt~~~l~~eg~~y~GvLy~glMlt~ 177 (194)
T PF01071_consen 124 PNTGGMGAYSP-VPFITDELLEEIIEEILEPTLKGLKKEGIPYRGVLYAGLMLTE 177 (194)
T ss_dssp EEESESEEEES-TTTS-HHHHHHHHHHTHHHHHHHHHHTT---EEEEEEEEEEET
T ss_pred CCCCCccceee-cccCCHHHHHHHHHHHHHHHHHHHHhcCCCcceeeeeeeEEeC
Confidence 33445544444 22223566666665 9999999999999999999999998885
No 56
>cd04915 ACT_AK-Ectoine_2 ACT domains located C-terminal to the catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway found in Methylomicrobium alcaliphilum, Vibrio cholerae, and various other halotolerant or halophilic bacteria. Bacteria exposed to hyperosmotic stress accumulate organic solutes called 'compatible solutes' of which ectoine, a heterocyclic amino acid, is one. Apart from its osmotic function, ectoine also exhibits a protective effect on proteins, nucleic acids and membranes against a variety of stress factors. de novo synthesis of ectoine starts with the phosphorylation of L-aspartate and shares its first two enzymatic steps with the biosynthesis of amino acids of the aspartate family: aspartokinas
Probab=30.29 E-value=1.2e+02 Score=17.64 Aligned_cols=45 Identities=11% Similarity=0.025 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHcCCCccceE------EEEEEeCCCCCHHHHHHHHHhhcC
Q psy1836 53 QALTNMGHILKEAGGSYEHVV------KTTILLNDINDFATVNNVYGQFFK 97 (127)
Q Consensus 53 ~~~~ni~~~L~~~G~~~~~vv------~~~vyl~d~~~~~~~~~v~~~~f~ 97 (127)
.+..++-..|.+.|..+.-+. ++.+.+.+-+.-+.++.++++||.
T Consensus 16 gv~~ki~~~L~~~~I~v~~i~~~~s~~~is~~V~~~~~~~av~~Lh~~f~~ 66 (66)
T cd04915 16 GVLARGLAALAEAGIEPIAAHQSMRNVDVQFVVDRDDYDNAIKALHAALVE 66 (66)
T ss_pred hHHHHHHHHHHHCCCCEEEEEecCCeeEEEEEEEHHHHHHHHHHHHHHHhC
Confidence 477888889999998874333 444555444445677888888873
No 57
>cd04919 ACT_AK-Hom3_2 ACT domains located C-terminal to the catalytic domain of the aspartokinase (AK) HOM3. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the aspartokinase (AK) HOM3, a monofunctional class enzyme found in Saccharomyces cerevisiae, and other related ACT domains. AK is the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP, and in fungi, is responsible for the production of threonine, isoleucine and methionine. S. cerevisiae has a single AK, which is regulated by feedback, allosteric inhibition by L-threonine. Recent studies shown that the allosteric transition triggered by binding of threonine to AK involves a large change in the conformation of the native hexameric enzyme that is converted to an inactive one of different shape and substantially smaller hydrodynamic size. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=29.96 E-value=1.1e+02 Score=17.17 Aligned_cols=44 Identities=9% Similarity=0.132 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHHcCCCccceEE------EEEEeCCCCCHHHHHHHHHhhc
Q psy1836 53 QALTNMGHILKEAGGSYEHVVK------TTILLNDINDFATVNNVYGQFF 96 (127)
Q Consensus 53 ~~~~ni~~~L~~~G~~~~~vv~------~~vyl~d~~~~~~~~~v~~~~f 96 (127)
.....+-..|++.|....-+.+ +.+.+.+-+.-..++.+++++|
T Consensus 16 ~~~~~if~~L~~~~I~v~~i~q~~s~~~isf~v~~~~~~~a~~~lh~~~~ 65 (66)
T cd04919 16 GIAGRMFTTLADHRINIEMISQGASEINISCVIDEKDAVKALNIIHTNLL 65 (66)
T ss_pred CHHHHHHHHHHHCCCCEEEEEecCccceEEEEEeHHHHHHHHHHHHHHHh
Confidence 4456677888898887765543 4444444332344566676665
No 58
>COG3082 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.66 E-value=1e+02 Score=18.89 Aligned_cols=23 Identities=17% Similarity=0.316 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHHHHcCCCcc
Q psy1836 48 ENQAKQALTNMGHILKEAGGSYE 70 (127)
Q Consensus 48 ~~Q~~~~~~ni~~~L~~~G~~~~ 70 (127)
.+|+++++..+.++|+.+....+
T Consensus 9 De~ve~il~e~iaVLeKH~ap~d 31 (74)
T COG3082 9 DEQVEQILNELIAVLEKHKAPTD 31 (74)
T ss_pred HHHHHHHHHHHHHHHHhcCCCcc
Confidence 47999999999999999987543
No 59
>CHL00123 rps6 ribosomal protein S6; Validated
Probab=29.65 E-value=74 Score=20.36 Aligned_cols=24 Identities=17% Similarity=0.274 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHHHHHHHcCCCccc
Q psy1836 48 ENQAKQALTNMGHILKEAGGSYEH 71 (127)
Q Consensus 48 ~~Q~~~~~~ni~~~L~~~G~~~~~ 71 (127)
++|...++++++.+|++.|+....
T Consensus 20 e~~~~~~~~~~~~~i~~~gg~i~~ 43 (97)
T CHL00123 20 EEELLKWIENYKKLLRKRGAKNIS 43 (97)
T ss_pred HHHHHHHHHHHHHHHHHCCCEEEE
Confidence 679999999999999999986443
No 60
>KOG2708|consensus
Probab=29.42 E-value=2.7e+02 Score=21.54 Aligned_cols=68 Identities=13% Similarity=0.175 Sum_probs=49.8
Q ss_pred CCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCC------HHHHHHHHHhhcCCC-CCccEEEE
Q psy1836 39 SSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIND------FATVNNVYGQFFKPP-YPARSTFQ 107 (127)
Q Consensus 39 ~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~------~~~~~~v~~~~f~~~-~Pa~t~v~ 107 (127)
..|-++ +..+---..++.-++..|+++|.+.+|| .+-.|.+...- -..+.+....++..+ -+..-|++
T Consensus 37 G~GFlP~~TA~HHr~~il~Lv~~al~ea~v~~~di-D~icyTKGPGmgaPL~~vaivaRtlsllw~kPlv~VNHCig 112 (336)
T KOG2708|consen 37 GEGFLPRDTARHHRAWILGLVKQALEEAGVTSDDI-DCICYTKGPGMGAPLSVVAIVARTLSLLWNKPLVGVNHCIG 112 (336)
T ss_pred CCCCCcchhHHHHHHHHHHHHHHHHHHcCCChhhC-CEEEEcCCCCCCCchhhHHHHHHHHHHHhCCCcccchhhhh
Confidence 356666 6666677779999999999999999997 78888886532 344566667777766 55666665
No 61
>PLN02633 palmitoyl protein thioesterase family protein
Probab=29.39 E-value=2.9e+02 Score=21.90 Aligned_cols=58 Identities=14% Similarity=0.049 Sum_probs=42.9
Q ss_pred ccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcC
Q psy1836 43 MAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVG 109 (127)
Q Consensus 43 ~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~ 109 (127)
...++.+|++.+.+.|.. .++..-+ +-+...+.=..+-+.+-+++++++|.++.|-.+
T Consensus 71 ~~~~~~~Qve~vce~l~~-~~~l~~G--------~naIGfSQGGlflRa~ierc~~~p~V~nlISlg 128 (314)
T PLN02633 71 WLMPLTQQAEIACEKVKQ-MKELSQG--------YNIVGRSQGNLVARGLIEFCDGGPPVYNYISLA 128 (314)
T ss_pred ceeCHHHHHHHHHHHHhh-chhhhCc--------EEEEEEccchHHHHHHHHHCCCCCCcceEEEec
Confidence 337899999999999988 4444322 223345666788999999998878999998644
No 62
>COG2101 SPT15 TATA-box binding protein (TBP), component of TFIID and TFIIIB [Transcription]
Probab=29.38 E-value=76 Score=23.08 Aligned_cols=35 Identities=34% Similarity=0.371 Sum_probs=22.8
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCC
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGG 67 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~ 67 (127)
++|-||-+=+.. .--++.++++++++..-|++.|.
T Consensus 150 LiF~SGK~ViTG------aK~~ed~~~Av~~i~~~L~elgl 184 (185)
T COG2101 150 LLFGSGKLVITG------AKSEEDAEQAVEKIQSRLEELGL 184 (185)
T ss_pred EEecCCcEEEec------CCCHHHHHHHHHHHHHHHHHhcc
Confidence 566666544422 33456677888888888888774
No 63
>PRK13411 molecular chaperone DnaK; Provisional
Probab=29.33 E-value=1.8e+02 Score=25.06 Aligned_cols=52 Identities=23% Similarity=0.331 Sum_probs=37.6
Q ss_pred ccHHHHHH----HHHHHHHHHHHHcCCCccceEEEEEEeC-CCCCHHHHHHHHHhhcCC
Q psy1836 45 EGVENQAK----QALTNMGHILKEAGGSYEHVVKTTILLN-DINDFATVNNVYGQFFKP 98 (127)
Q Consensus 45 ~d~~~Q~~----~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~~~~~~~~v~~~~f~~ 98 (127)
.++++=++ .+.+-++.+|+.+|.+.++|- .|.++ .....+.+.+...++|+.
T Consensus 297 ~~fe~l~~~l~~~~~~~i~~~L~~a~~~~~~id--~ViLvGGssriP~v~~~l~~~f~~ 353 (653)
T PRK13411 297 AKFEELTKDLVEATIEPMQQALKDAGLKPEDID--RVILVGGSTRIPAVQEAIQKFFGG 353 (653)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHCc--EEEEECCCCCcchHHHHHHHHcCC
Confidence 44544444 455567788899999988873 34444 567889999999999974
No 64
>PF07108 PipA: PipA protein; InterPro: IPR010777 This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis [].
Probab=29.10 E-value=56 Score=23.48 Aligned_cols=34 Identities=26% Similarity=0.503 Sum_probs=21.4
Q ss_pred EEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHc
Q psy1836 28 LYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEA 65 (127)
Q Consensus 28 v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~ 65 (127)
.|+||-.|+|| +-++-+|+.. .|.+.+..+|+.+
T Consensus 40 y~iSg~ggidp-d~EiDdd~Y~---eC~~eLS~il~~A 73 (200)
T PF07108_consen 40 YLISGAGGIDP-DTEIDDDIYD---ECYDELSSILQNA 73 (200)
T ss_pred HHHhCCCCCCC-cccccchHHH---HHHHHHHHHHHHH
Confidence 57999999999 5555455444 4455555555544
No 65
>PF01849 NAC: NAC domain; InterPro: IPR002715 Nascent polypeptide-associated complex (NAC) is among the first ribosome-associated entities to bind the nascent polypeptide after peptide bond formation. The nascent polypeptide-associated complex (NAC) of yeast functions in the targeting process of ribosomes to the ER membrane []. NAC may prevent binding of ribosome nascent chains (RNCs) without a signal sequence to yeast membranes.; PDB: 3MCE_D 3MCB_A 3LKX_B 1TR8_B.
Probab=28.32 E-value=55 Score=18.80 Aligned_cols=26 Identities=23% Similarity=0.254 Sum_probs=16.5
Q ss_pred HHHHHHHHcCC-CccceEEEEEEeCCC
Q psy1836 57 NMGHILKEAGG-SYEHVVKTTILLNDI 82 (127)
Q Consensus 57 ni~~~L~~~G~-~~~~vv~~~vyl~d~ 82 (127)
.++..|+..|+ ...+|-+++++..|-
T Consensus 3 k~~~~l~klgl~~i~~i~eV~i~~~dg 29 (58)
T PF01849_consen 3 KLQKMLKKLGLKEIPGIEEVTIRKDDG 29 (58)
T ss_dssp -----GHHCT-EEETTEEEEEEEETTT
T ss_pred HHHHHHHHcCCcccCCcEEEEEEECCc
Confidence 46778888888 566899999999873
No 66
>PF02256 Fe_hyd_SSU: Iron hydrogenase small subunit; InterPro: IPR003149 Many microorganisms, such as methanogenic, acetogenic, nitrogen-fixing, photosynthetic, or sulphate-reducing bacteria, metabolise hydrogen. Hydrogen activation is mediated by a family of enzymes, termed hydrogenases, which either provide these organisms with reducing power from hydrogen oxidation, or act as electron sinks. There are two hydrogenases families that differ functionally from each other: NiFe hydrogenases tend to be more involved in hydrogen oxidation, while Iron-only FeFe (Fe only) hydrogenases in hydrogen production. Fe only hydrogenases (1.12.7.2 from EC) show a common core structure, which contains a moiety, deeply buried inside the protein, with an Fe-Fe dinuclear centre, nonproteic bridging, terminal CO and CN- ligands attached to each of the iron atoms, and a dithio moiety, which also bridges the two iron atoms and has been tentatively assigned as a di(thiomethyl)amine. This common core also harbours three [4Fe-4S] iron-sulphur clusters []. In FeFe hydrogenases, as in NiFe hydrogenases, the set of iron-sulphur clusters is dispersed regularly between the dinuclear Fe-Fe centre and the molecular surface. These clusters are distant by about 1.2 nm from each other but the [4Fe-4S] cluster closest to the dinuclear centre is covalently bound to one of the iron atoms though a thiolate bridging ligand. The moiety including the dinuclear centre, the thiolate bridging ligand, and the proximal [4Fe-4S] cluster is known as the H-cluster. A channel, lined with hydrophobic amino acid side chains, nearly connects the dinuclear centre and the molecular surface. Furthermore hydrogen-bonded water molecule sites have been identified at the interior and at the surface of the protein. The small subunit is comprised of alternating random coil and alpha helical structures that encompass the large subunit in a novel protein fold [].; PDB: 3LX4_A 1C4C_A 1FEH_A 3C8Y_A 1C4A_A 1HFE_S 1GX7_D 1E08_D.
Probab=27.81 E-value=52 Score=19.21 Aligned_cols=20 Identities=20% Similarity=0.486 Sum_probs=14.8
Q ss_pred CCCCCHHHHHHHHHhhcCCC
Q psy1836 80 NDINDFATVNNVYGQFFKPP 99 (127)
Q Consensus 80 ~d~~~~~~~~~v~~~~f~~~ 99 (127)
....+-+.+.++|.+|++++
T Consensus 24 r~s~eNp~v~~lY~~~lg~p 43 (60)
T PF02256_consen 24 RKSHENPEVQELYKEFLGGP 43 (60)
T ss_dssp -SGGG-HHHHHHHHHTTSST
T ss_pred CCCCcCHHHHHHHHHHhCCC
Confidence 34456789999999999965
No 67
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=27.37 E-value=92 Score=16.85 Aligned_cols=19 Identities=26% Similarity=0.370 Sum_probs=12.1
Q ss_pred CcceEEEE-CCEEEEc-cccc
Q psy1836 17 SAFQAVQH-DNTLYIS-GVLG 35 (127)
Q Consensus 17 ~ys~av~~-g~~v~iS-G~~~ 35 (127)
++.+.+.+ |+++|++ +..|
T Consensus 2 G~a~~v~v~g~yaYva~~~~G 22 (42)
T PF08309_consen 2 GDARDVAVSGNYAYVADGNNG 22 (42)
T ss_pred ceEEEEEEECCEEEEEeCCCC
Confidence 34455544 9999998 4333
No 68
>PF13964 Kelch_6: Kelch motif
Probab=27.32 E-value=76 Score=17.03 Aligned_cols=15 Identities=27% Similarity=0.468 Sum_probs=10.2
Q ss_pred cceEEEECCEEEEcc
Q psy1836 18 AFQAVQHDNTLYISG 32 (127)
Q Consensus 18 ys~av~~g~~v~iSG 32 (127)
...++..++.||+-|
T Consensus 4 ~~s~v~~~~~iyv~G 18 (50)
T PF13964_consen 4 GHSAVVVGGKIYVFG 18 (50)
T ss_pred cCEEEEECCEEEEEC
Confidence 345666788888776
No 69
>PLN00062 TATA-box-binding protein; Provisional
Probab=26.83 E-value=2.4e+02 Score=20.23 Aligned_cols=36 Identities=19% Similarity=0.288 Sum_probs=26.7
Q ss_pred EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCC
Q psy1836 27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGS 68 (127)
Q Consensus 27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~ 68 (127)
+||-||-+-+.. ..-+++++.+++++..+|+..|..
T Consensus 51 lIF~SGKiviTG------aks~e~a~~a~~~~~~~L~~lg~~ 86 (179)
T PLN00062 51 LIFASGKMVCTG------AKSEHDSKLAARKYARIIQKLGFP 86 (179)
T ss_pred EEECCCeEEEEe------cCCHHHHHHHHHHHHHHHHHcCCC
Confidence 677777654422 234579999999999999999974
No 70
>cd04922 ACT_AKi-HSDH-ThrA_2 ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). This CD includes the second of two ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). The ACT domains are positioned between the N-terminal catalytic domain of AK and the C-terminal HSDH domain found in bacteria (Escherichia coli (EC) ThrA) and higher plants (Zea mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. HSDH is the first committed reaction in the branch of the pathway that leads to Thr and Met. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathwa
Probab=26.76 E-value=1.2e+02 Score=16.78 Aligned_cols=44 Identities=11% Similarity=0.198 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHHcCCCccceE------EEEEEeCCCCCHHHHHHHHHhhc
Q psy1836 53 QALTNMGHILKEAGGSYEHVV------KTTILLNDINDFATVNNVYGQFF 96 (127)
Q Consensus 53 ~~~~ni~~~L~~~G~~~~~vv------~~~vyl~d~~~~~~~~~v~~~~f 96 (127)
..+..+-..|.++|....-+. ++.+.+.+-+.-.....+++++|
T Consensus 16 ~~~~~i~~~l~~~~I~v~~i~~~~s~~~is~~v~~~~~~~~~~~lh~~~~ 65 (66)
T cd04922 16 GVAATFFSALAKANVNIRAIAQGSSERNISAVIDEDDATKALRAVHERFF 65 (66)
T ss_pred cHHHHHHHHHHHCCCCEEEEEecCcccEEEEEEeHHHHHHHHHHHHHHHh
Confidence 456667778888887775553 34445544333345666677765
No 71
>PF09658 Cas_Csx9: CRISPR-associated protein (Cas_Csx9); InterPro: IPR013488 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents the Cxs9 family of Cas proteins found in archaea. These proteins are encoded in the midst of a cas gene operon [].
Probab=26.29 E-value=3.2e+02 Score=21.67 Aligned_cols=47 Identities=21% Similarity=0.278 Sum_probs=32.2
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCc
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPA 102 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa 102 (127)
.++.+-.+.+++.++.-.+ .+.+ | -..+|...++.++++||+-.+|-
T Consensus 15 ~~laeale~~~dslesR~~-i~ss-----K-----lsrndRnsfdkv~~~wF~i~~pe 61 (377)
T PF09658_consen 15 EELAEALENAMDSLESRKK-INSS-----K-----LSRNDRNSFDKVLNAWFGISAPE 61 (377)
T ss_pred HHHHHHHHHHHHHHHHhhh-cCcc-----c-----ccccchhhHHHHHHHHhCCCCch
Confidence 5677777777777766665 3433 1 13467889999999999877663
No 72
>PF14552 Tautomerase_2: Tautomerase enzyme; PDB: 2AAG_C 2AAL_A 2AAJ_A 1MWW_C.
Probab=26.23 E-value=1.6e+02 Score=18.32 Aligned_cols=29 Identities=24% Similarity=0.194 Sum_probs=19.4
Q ss_pred ccHHHHHHHHHHHHHHHHHH-cCCCccceE
Q psy1836 45 EGVENQAKQALTNMGHILKE-AGGSYEHVV 73 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~-~G~~~~~vv 73 (127)
+--.+|=++-++.|..-|++ .|.+.+||.
T Consensus 39 gRs~e~K~~ly~~l~~~L~~~~gi~p~Dv~ 68 (82)
T PF14552_consen 39 GRSTEQKKALYRALAERLAEKLGIRPEDVM 68 (82)
T ss_dssp ---HHHHHHHHHHHHHHHHHHH---GGGEE
T ss_pred CCCHHHHHHHHHHHHHHHHHHcCCCHHHEE
Confidence 34567888888999999976 899999984
No 73
>PTZ00186 heat shock 70 kDa precursor protein; Provisional
Probab=26.17 E-value=2.6e+02 Score=24.26 Aligned_cols=53 Identities=15% Similarity=0.322 Sum_probs=39.0
Q ss_pred ccHHHH----HHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC
Q psy1836 45 EGVENQ----AKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP 98 (127)
Q Consensus 45 ~d~~~Q----~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~ 98 (127)
.++++- .+.+++-++.+|+.+|.+.++|-. -+.+=.....|.+.+...++|+.
T Consensus 323 ~efe~l~~~l~~r~~~~v~~~L~~a~~~~~dId~-VvLVGGssriP~V~~~l~~~fg~ 379 (657)
T PTZ00186 323 SKFEGITQRLIERSIAPCKQCMKDAGVELKEIND-VVLVGGMTRMPKVVEEVKKFFQK 379 (657)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCChhhCCE-EEEECCcccChHHHHHHHHHhCC
Confidence 455555 445556677888999999998853 34444667899999999999985
No 74
>PTZ00400 DnaK-type molecular chaperone; Provisional
Probab=25.92 E-value=3.2e+02 Score=23.67 Aligned_cols=53 Identities=15% Similarity=0.300 Sum_probs=39.1
Q ss_pred ccHHHHH----HHHHHHHHHHHHHcCCCccceEEEEEEeC-CCCCHHHHHHHHHhhcCCC
Q psy1836 45 EGVENQA----KQALTNMGHILKEAGGSYEHVVKTTILLN-DINDFATVNNVYGQFFKPP 99 (127)
Q Consensus 45 ~d~~~Q~----~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~~~~~~~~v~~~~f~~~ 99 (127)
.++++-+ +.+++-++.+|+++|.+.++|-. ++++ .....|.+.+...++|+..
T Consensus 337 ~efe~l~~~l~~~~~~~i~~~L~~a~~~~~~i~~--ViLvGGssriP~v~~~l~~~f~~~ 394 (663)
T PTZ00400 337 AKLEELTHDLLKKTIEPCEKCIKDAGVKKDELND--VILVGGMTRMPKVSETVKKIFGKE 394 (663)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHCcE--EEEECCccCChHHHHHHHHHhCCC
Confidence 5555555 44555677889999999998733 4455 5678899999999999853
No 75
>COG0443 DnaK Molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=25.79 E-value=2.2e+02 Score=24.29 Aligned_cols=52 Identities=19% Similarity=0.325 Sum_probs=41.8
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcC
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFK 97 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~ 97 (127)
.-+.+....+...++..|+.+|.+..+| ...+.+--...++.+.+...++|+
T Consensus 283 ~l~~dll~r~~~~~~~al~~a~l~~~~I-~~VilvGGstriP~V~~~v~~~f~ 334 (579)
T COG0443 283 ELILDLLERTIEPVEQALKDAGLEKSDI-DLVILVGGSTRIPAVQELVKEFFG 334 (579)
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCChhhC-ceEEEccceeccHHHHHHHHHHhC
Confidence 3455666777788889999999999887 444556667789999999999999
No 76
>PRK11678 putative chaperone; Provisional
Probab=25.58 E-value=3.1e+02 Score=22.60 Aligned_cols=47 Identities=13% Similarity=0.085 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeC-CCCCHHHHHHHHHhhcCC
Q psy1836 47 VENQAKQALTNMGHILKEAGGSYEHVVKTTILLN-DINDFATVNNVYGQFFKP 98 (127)
Q Consensus 47 ~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~~~~~~~~v~~~~f~~ 98 (127)
++.-.+++++-++..|+++|.+.+ .|+++ .....+.+.+...++|+.
T Consensus 378 i~~~l~ri~~~i~~~L~~a~~~~d-----~VvLvGGsSriP~V~~~l~~~fg~ 425 (450)
T PRK11678 378 ISQPLARILELVQLALDQAQVKPD-----VIYLTGGSARSPLIRAALAQQLPG 425 (450)
T ss_pred HHHHHHHHHHHHHHHHHHcCCCCC-----EEEEcCcccchHHHHHHHHHHCCC
Confidence 334455566666788888998754 35676 456789999999999976
No 77
>PRK10328 DNA binding protein, nucleoid-associated; Provisional
Probab=25.19 E-value=70 Score=22.02 Aligned_cols=26 Identities=12% Similarity=0.224 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHcCCCccceEE
Q psy1836 49 NQAKQALTNMGHILKEAGGSYEHVVK 74 (127)
Q Consensus 49 ~Q~~~~~~ni~~~L~~~G~~~~~vv~ 74 (127)
.+-..++..+...+++.|.++++++.
T Consensus 52 ~er~~~l~~i~~~~~~~Git~eeL~~ 77 (134)
T PRK10328 52 AERQEKINTWLELMKADGINPEELLG 77 (134)
T ss_pred HHHHHHHHHHHHHHHHhCCCHHHHhh
Confidence 34456889999999999999999953
No 78
>COG3655 Predicted transcriptional regulator [Transcription]
Probab=25.17 E-value=85 Score=19.39 Aligned_cols=21 Identities=14% Similarity=0.200 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHcCCCccceEE
Q psy1836 54 ALTNMGHILKEAGGSYEHVVK 74 (127)
Q Consensus 54 ~~~ni~~~L~~~G~~~~~vv~ 74 (127)
.|++++++.++..+.+.||+.
T Consensus 44 ~~~tL~~iC~~LeCqpgDile 64 (73)
T COG3655 44 RLSTLEKICKALECQPGDILE 64 (73)
T ss_pred eHHHHHHHHHHcCCChhheeE
Confidence 467899999999999999975
No 79
>PRK00290 dnaK molecular chaperone DnaK; Provisional
Probab=25.08 E-value=2.9e+02 Score=23.63 Aligned_cols=49 Identities=20% Similarity=0.325 Sum_probs=35.5
Q ss_pred HHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836 50 QAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP 99 (127)
Q Consensus 50 Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~ 99 (127)
-...+++-++.+|+.+|.+.++|- --+.+=.....+.+.+...++|+..
T Consensus 305 l~~~~~~~i~~~l~~a~~~~~~id-~ViLvGGssriP~v~~~l~~~fg~~ 353 (627)
T PRK00290 305 LVERTIEPCKQALKDAGLSVSDID-EVILVGGSTRMPAVQELVKEFFGKE 353 (627)
T ss_pred HHHHHHHHHHHHHHHcCCChhhCc-EEEEECCcCCChHHHHHHHHHhCCC
Confidence 344555666788899999988873 3334445678899999999999753
No 80
>PF06526 DUF1107: Protein of unknown function (DUF1107); InterPro: IPR009491 This family consists of several short, hypothetical bacterial proteins of unknown function.; PDB: 2JRO_A.
Probab=24.47 E-value=88 Score=18.81 Aligned_cols=39 Identities=13% Similarity=0.420 Sum_probs=27.3
Q ss_pred CCEEEEccccccCCCCCccc--ccHHHHHHHHHHHHHHHHH
Q psy1836 25 DNTLYISGVLGIDASSGKMA--EGVENQAKQALTNMGHILK 63 (127)
Q Consensus 25 g~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~~~L~ 63 (127)
.+.+||.|....+-+.|.+. ..-..|...++..|.....
T Consensus 21 ~Gr~~I~g~G~feFd~Gkillp~~~~~~~~~~~~EiN~~I~ 61 (64)
T PF06526_consen 21 RGRIYIKGIGAFEFDNGKILLPKKADKRHLSVMSEINQEIR 61 (64)
T ss_dssp SEEEEETTTEEEEEETTEE---SS--HHHHHHHHHHHHHHH
T ss_pred cceEEEEecccEEEcCCEEeCCccccHHHHHHHHHHHHHHH
Confidence 67899999988877678876 5555677778877766554
No 81
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=24.40 E-value=94 Score=16.16 Aligned_cols=14 Identities=36% Similarity=0.584 Sum_probs=10.0
Q ss_pred ceEEEECCEEEEcc
Q psy1836 19 FQAVQHDNTLYISG 32 (127)
Q Consensus 19 s~av~~g~~v~iSG 32 (127)
..++..++.||+-|
T Consensus 5 ~~~~~~~~~iyv~G 18 (47)
T PF01344_consen 5 HAAVVVGNKIYVIG 18 (47)
T ss_dssp EEEEEETTEEEEEE
T ss_pred CEEEEECCEEEEEe
Confidence 44566688888877
No 82
>PF08066 PMC2NT: PMC2NT (NUC016) domain; InterPro: IPR012588 Exosomes are nano-compartments that function in the degradation or processing of RNA (including mRNA, rRNA, snRNA and snoRNA) [, ]. Exosomes occur in both archaea and eukaryotes, and have a similar overall structure to each other and to bacterial/organelle PNPases (polynucleotide phosphorylases; 2.7.7.8 from EC) [], consisting of a barrel structure composed of a hexameric ring of PH domains that act as a degradation chamber, and an S1-domain/KH-domain containing cap that binds the RNA substrate (and sometimes accessory proteins) in order to regulate and restrict entry into the degradation chamber []. There are two types of exosomes in eukaryotes, cytoplasmic exosomes that are responsible for 3'-5' exoribonuclease degradation of mRNAs, and nuclear exosomes that degrade pre-mRNAs (such as nonsense transcripts) and degrade rRNAs, snRNAs and snoRNAs. Unstructured RNA substrates feed in through the pore made by the S1 domains, are degraded by the PH domain ring, and exit as nucleotides via the PH pore at the opposite end of the barrel [, ]. There are several accessory proteins that help degrade, unwind or polyadenylate RNA substrate before they enter the exosome. This entry represents the N-terminal domain of Rrp6 (exosome component 10 in humans), a nuclear exosome accessory factor that interacts with the bottom of the hexameric PH-ring opposite the cap. Rrp6 functions as a hydrolytic exonuclease, and is homologous to RNase-D in Escherichia coli. More information about these proteins can be found at Protein of the Month: RNA Exosomes [].; GO: 0006396 RNA processing, 0000176 nuclear exosome (RNase complex)
Probab=24.29 E-value=67 Score=20.25 Aligned_cols=27 Identities=15% Similarity=0.228 Sum_probs=20.1
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEH 71 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~ 71 (127)
.+++++.+.+.+-++.+|+.++..++.
T Consensus 61 ~d~d~~~~~vvd~~D~LlEk~D~~LDe 87 (91)
T PF08066_consen 61 DDVDERWDSVVDVNDSLLEKADISLDE 87 (91)
T ss_pred ccHHHHHHHHHHHHHHHHHHHhHHHHh
Confidence 467778888888888888877766654
No 83
>PF12205 GIT1_C: G protein-coupled receptor kinase-interacting protein 1 C term; InterPro: IPR022018 This domain family is found in eukaryotes, and is approximately 120 amino acids in length. The family is found in association with PF01412 from PFAM, PF00023 from PFAM, PF08518 from PFAM. GIT1 plays an important role in cell adhesion, motility, cytoskeletal remodeling and membrane trafficking. To perform this function, it localises p21-activated kinase (PAK) and PAK-interactive exchange factor to focal adhesions. Its activation is regulated by interaction between its paxillin-binding C-terminal and the LD motifs of paxillin. The C-terminal folds into a four helix bundle. ; PDB: 2JX0_A.
Probab=24.26 E-value=1.1e+02 Score=20.78 Aligned_cols=23 Identities=22% Similarity=0.319 Sum_probs=21.3
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCC
Q psy1836 45 EGVENQAKQALTNMGHILKEAGG 67 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~ 67 (127)
+++...++++-++|+.+|+++-.
T Consensus 12 e~Vi~~TE~vTk~IqeLl~aAQ~ 34 (123)
T PF12205_consen 12 EDVIRRTEQVTKRIQELLRAAQE 34 (123)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 89999999999999999999864
No 84
>PRK13410 molecular chaperone DnaK; Provisional
Probab=23.61 E-value=2.4e+02 Score=24.48 Aligned_cols=53 Identities=19% Similarity=0.267 Sum_probs=38.7
Q ss_pred ccHHHHHHHHHHHH----HHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC
Q psy1836 45 EGVENQAKQALTNM----GHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP 98 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni----~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~ 98 (127)
.++++-++..++.+ +.+|+.+|.+.++|-.+ +.+=.....+.+.+...++|+.
T Consensus 298 ~~FE~l~~~l~~r~~~~i~~~L~~ag~~~~dId~V-vLVGGssRiP~V~~~l~~~fg~ 354 (668)
T PRK13410 298 KQFESLCGDLLDRLLRPVKRALKDAGLSPEDIDEV-VLVGGSTRMPMVQQLVRTLIPR 354 (668)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCChhhCcEE-EEECCccccHHHHHHHHHHcCC
Confidence 56666666655554 57788889998888533 3344567889999999999985
No 85
>TIGR02619 putative CRISPR-associated protein, APE2256 family. This model represents a conserved domain of about 150 amino acids found in at least five archaeal species and three bacterial species, exclusively in species with CRISPR (Clustered Regularly Interspaced Short Palidromic Repeats). In six of eight species, the member of this family is in the vicinity of a CRISPR/Cas locus.
Probab=23.37 E-value=2.2e+02 Score=19.82 Aligned_cols=28 Identities=11% Similarity=0.107 Sum_probs=20.9
Q ss_pred eEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836 72 VVKTTILLNDINDFATVNNVYGQFFKPP 99 (127)
Q Consensus 72 vv~~~vyl~d~~~~~~~~~v~~~~f~~~ 99 (127)
-.++-.|.+|..+-....++.++|+...
T Consensus 34 ~d~~~Ll~SDT~~G~~~a~ilk~yl~~~ 61 (149)
T TIGR02619 34 DDKAILYHSDTAQGRFCASILKRFLERE 61 (149)
T ss_pred CcEEEEEEcCCHHHHHHHHHHHHHHHHh
Confidence 3477778888888777788888887655
No 86
>PF07490 Tir_receptor_N: Translocated intimin receptor (Tir) N-terminus; InterPro: IPR022633 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens, such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia []. The type III secretion system is of great interest, as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis. However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself [], type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. Exotoxins secreted by the type III system do not possess a secretion signal, and are considered unique for this reason []. Enteropathogenic and entero- haemorrhagic Escherichia coli secrete the bacterial adhesion mediation molecule intimin [], which targets the translocated intimin receptor, Tir. Tir is secreted by the bacteria and is embedded in the target cell's plasma membrane []. This facilitates bacterial cell attachment to the host.
Probab=23.00 E-value=88 Score=23.36 Aligned_cols=35 Identities=11% Similarity=0.233 Sum_probs=19.7
Q ss_pred CCEEEEccccccCCCCCcccccHHHHHHHHHHHHH
Q psy1836 25 DNTLYISGVLGIDASSGKMAEGVENQAKQALTNMG 59 (127)
Q Consensus 25 g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~ 59 (127)
|++||+.|-.|..-.--++..||.+.-+.+++.++
T Consensus 151 ~kFVFTGgRGGaGHaMvTvASDI~eAR~rIl~kLe 185 (266)
T PF07490_consen 151 GKFVFTGGRGGAGHAMVTVASDIAEARQRILAKLE 185 (266)
T ss_pred ceEEEecCCCCCCcceeeehhHHHHHHHHHHHhcC
Confidence 67999998766533111122677655555555443
No 87
>PF01250 Ribosomal_S6: Ribosomal protein S6; InterPro: IPR000529 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S6 is one of the proteins from the small ribosomal subunit. In Escherichia coli, S6 is known to bind together with S18 to 16S ribosomal RNA. It belongs to a family of ribosomal proteins which, on the basis of sequence similarities, groups bacterial, red algal chloroplast and cyanelle S6 ribosomal proteins.; GO: 0003735 structural constituent of ribosome, 0019843 rRNA binding, 0006412 translation, 0005840 ribosome; PDB: 3BBN_F 3R3T_B 3F1E_F 2QNH_g 2OW8_g 3PYQ_F 3PYS_F 3PYU_F 3MR8_F 3PYN_F ....
Probab=22.77 E-value=1.3e+02 Score=18.60 Aligned_cols=25 Identities=24% Similarity=0.489 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHHcCCCccce
Q psy1836 48 ENQAKQALTNMGHILKEAGGSYEHV 72 (127)
Q Consensus 48 ~~Q~~~~~~ni~~~L~~~G~~~~~v 72 (127)
+++..+.++++...+.+.|+...++
T Consensus 15 ~~~~~~~~~~~~~~i~~~gg~v~~~ 39 (92)
T PF01250_consen 15 EEEIKKLIERVKKIIEKNGGVVRSV 39 (92)
T ss_dssp HHHHHHHHHHHHHHHHHTTEEEEEE
T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEE
Confidence 4699999999999999999866554
No 88
>PHA02754 hypothetical protein; Provisional
Probab=22.65 E-value=1.8e+02 Score=17.25 Aligned_cols=26 Identities=31% Similarity=0.482 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHHHHHHcCCCccce
Q psy1836 47 VENQAKQALTNMGHILKEAGGSYEHV 72 (127)
Q Consensus 47 ~~~Q~~~~~~ni~~~L~~~G~~~~~v 72 (127)
++.....+++.++.+|.++|.=.+.|
T Consensus 12 ~eK~Fke~MRelkD~LSe~GiYi~RI 37 (67)
T PHA02754 12 MEKDFKEAMRELKDILSEAGIYIDRI 37 (67)
T ss_pred HHhHHHHHHHHHHHHHhhCceEEEEE
Confidence 45567778889999999999755544
No 89
>COG3797 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.59 E-value=1.6e+02 Score=21.34 Aligned_cols=70 Identities=23% Similarity=0.240 Sum_probs=39.1
Q ss_pred ECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC-CCCc
Q psy1836 24 HDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP-PYPA 102 (127)
Q Consensus 24 ~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~-~~Pa 102 (127)
.||++|.|... . ..++++++.+|+ +..|. ++.+.+.+.++|..+-+-.--+... ..|+
T Consensus 42 SGNlvf~s~~~---~------~el~~klE~afe------~~fg~------~~dilVrs~~~~~~i~a~nPf~~~~~~~~~ 100 (178)
T COG3797 42 SGNLVFESEAG---A------AELEAKLEAAFE------KRFGR------HVDILVRSAEDWETIAAANPFPEEAAGEPA 100 (178)
T ss_pred cCCEEEEcCCC---h------HHHHHHHHHHHH------HHcCC------CccEEEeeHHHHHHHHhcCCCchhhccCcc
Confidence 38888887643 1 356666666654 33454 3456666777776554432211111 1477
Q ss_pred cEEEEcCCCCCC
Q psy1836 103 RSTFQVGKLPLI 114 (127)
Q Consensus 103 ~t~v~V~~L~~~ 114 (127)
+.++-+-.-|.+
T Consensus 101 ~~~v~f~~ep~d 112 (178)
T COG3797 101 RVCVRFYREPLD 112 (178)
T ss_pred eEEEEEEcCCCc
Confidence 777776555543
No 90
>KOG0808|consensus
Probab=22.49 E-value=1.5e+02 Score=23.29 Aligned_cols=32 Identities=13% Similarity=0.141 Sum_probs=27.5
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceEEEE
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTT 76 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~ 76 (127)
..+.+|++.+++.+++++++++..--+|+-+.
T Consensus 89 apv~eq~~aih~r~kaiieaaa~agvniiclq 120 (387)
T KOG0808|consen 89 APVSEQTRAIHDRLKAIIEAAAVAGVNIICLQ 120 (387)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHhcCccEEEee
Confidence 57899999999999999999998777776554
No 91
>PF02120 Flg_hook: Flagellar hook-length control protein FliK; InterPro: IPR021136 This entry represents the C-terminal domain of the flagellar hook-length control protein FliK. This entry also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. During flagellar morphogenesis in Salmonella typhimurium and Escherichia coli, flagellar hook-length control protein (FliK) controls the length of the hook by directly measuring the hook length [, ]. It is considered unlikely that FliK functions as a molecular ruler for determining hook length, but that it is more likely to be employing a novel mechanism. The deduced amino acid sequences of FliK proteins from S. typhimurium and E. coli have molecular masses of 41,748 and 39,246 Da, respectively, and are fairly hydrophilic []. Sequence comparison reveals around 50% identity, with greatest conservation in the C-terminal region, with 71% identity in the last 154 amino acids - mutagenesis of this conserved region completely abolishes motility. The central and C-terminal regions are rich in proline and glutamine respectively; it is thought that they may constitute distinct domains [].; PDB: 2RRL_A.
Probab=22.20 E-value=82 Score=18.95 Aligned_cols=27 Identities=15% Similarity=0.171 Sum_probs=23.3
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEH 71 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~ 71 (127)
.+..+..++.+..|...|++.|....+
T Consensus 47 ~~~~~~L~~~~~~L~~~L~~~G~~~~~ 73 (85)
T PF02120_consen 47 PETKELLRQNLPELKERLQAQGLEVVN 73 (85)
T ss_dssp SHHHHHHHHTHHHHHHHHHTTT-EEEE
T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCeEE
Confidence 579999999999999999999997665
No 92
>TIGR00329 gcp_kae1 metallohydrolase, glycoprotease/Kae1 family. This subfamily includes the well-studied secreted O-sialoglycoprotein endopeptidase (glycoprotease, EC 3.4.24.57) of Pasteurella haemolytica, a pathogen. A member from Riemerella anatipestifer, associated with cohemolysin activity, likewise is exported without benefit of a classical signal peptide and shows glycoprotease activity on the test substrate glycophorin. However, archaeal members of this subfamily show unrelated activities as demonstrated in Pyrococcus abyssi: DNA binding, iron binding, apurinic endonuclease activity, genomic association with a kinase domain, and no glycoprotease activity. This family thus pulls together a set of proteins as a homology group that appears to be near-universal in life, yet heterogeneous in assayed function between bacteria and archaea.
Probab=22.05 E-value=1.5e+02 Score=22.91 Aligned_cols=29 Identities=14% Similarity=0.105 Sum_probs=26.0
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceE
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVV 73 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv 73 (127)
.....+.++...-++.+|+++|.+++||-
T Consensus 43 ~~~~~H~~~l~~~i~~~l~~~~~~~~did 71 (305)
T TIGR00329 43 EASRHHAENIPPLLERALIESNVDKSEID 71 (305)
T ss_pred hhHHHHHHHHHHHHHHHHHHcCCCHHHCC
Confidence 56788999999999999999999999873
No 93
>PF00438 S-AdoMet_synt_N: S-adenosylmethionine synthetase, N-terminal domain; InterPro: IPR022628 The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold. This entry represents the N-terminal domain of S-adenosylmethionine synthetase and is found in association with PF02772 from PFAM and PF02773 from PFAM. S-adenosylmethionine synthetase (MAT, 2.5.1.6 from EC) is the enzyme that catalyzes the formation of S-adenosylmethionine (AdoMet) from methionine and ATP []. AdoMet is an important methyl donor for transmethylation and is also the propylamino donor in polyamine biosynthesis. In bacteria there is a single isoform of AdoMet synthetase (gene metK), there are two in budding yeast (genes SAM1 and SAM2) and in mammals while in plants there is generally a multigene family. The sequence of AdoMet synthetase is highly conserved throughout isozymes and species. The active sites of both the Escherichia coli and rat liver MAT reside between two subunits, with contributions from side chains of residues from both subunits, resulting in a dimer as the minimal catalytic entity. The side chains that contribute to the ligand binding sites are conserved between the two proteins. In the structures of complexes with the E. coli enzyme, the phosphate groups have the same positions in the (PPi plus Pi) complex and the (ADP plus Pi) complex, and are located at the bottom of a deep cavity with the adenosyl group nearer the entrance []; GO: 0004478 methionine adenosyltransferase activity; PDB: 3RV2_A 3TDE_B 3S82_B 3IML_B 2P02_A 2OBV_A 1QM4_B 1O9T_B 1O93_A 1O92_B ....
Probab=22.03 E-value=1.5e+02 Score=19.30 Aligned_cols=51 Identities=16% Similarity=0.219 Sum_probs=30.7
Q ss_pred CCcceEEE----ECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836 16 ASAFQAVQ----HDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEH 71 (127)
Q Consensus 16 ~~ys~av~----~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~ 71 (127)
.|+++... .+++++++|-+.... ..|+++=+|.++++|.-.=.+.|.+++.
T Consensus 34 dp~arVA~E~~~~~~~v~i~GEi~~~~-----~vd~~~ivR~~i~~IGY~~~~~gfd~~t 88 (100)
T PF00438_consen 34 DPNARVACETLVSTGMVIIAGEITSRA-----YVDIEKIVREVIKDIGYDDSEYGFDYDT 88 (100)
T ss_dssp -TT-EEEEEEEEETTEEEEEEEEESSH-----HHTHHHHHHHHHHHHT-EEGGGTEETTT
T ss_pred CCCCeEEEEEEeeccEEEEEEEeccch-----hhhHHHHHHHHHHHhCCCCccCCCCCCc
Confidence 35666433 289999999886543 1678777777777665444444444443
No 94
>PRK05952 3-oxoacyl-(acyl carrier protein) synthase II; Reviewed
Probab=21.87 E-value=2.2e+02 Score=22.73 Aligned_cols=46 Identities=15% Similarity=0.137 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHcCCCccceEEEEEEeCCCC--CHHHHHHHHHhhcCCC
Q psy1836 53 QALTNMGHILKEAGGSYEHVVKTTILLNDIN--DFATVNNVYGQFFKPP 99 (127)
Q Consensus 53 ~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~--~~~~~~~v~~~~f~~~ 99 (127)
...+.++.+|+.+|.+.+||--+..+=+... |-.+.. ..++.|+..
T Consensus 255 ~~~~ai~~Al~~agi~p~dId~i~~hgtgt~~~D~~E~~-al~~~fg~~ 302 (381)
T PRK05952 255 SAIAAIQQCLARSGLTPEDIDYIHAHGTATRLNDQREAN-LIQALFPHR 302 (381)
T ss_pred HHHHHHHHHHHHhCCCHHHeeEEEccCCCCCCCcHHHHH-HHHHHcCCC
Confidence 3456778899999999999988887766543 333333 335566643
No 95
>PRK14646 hypothetical protein; Provisional
Probab=21.62 E-value=2.9e+02 Score=19.27 Aligned_cols=58 Identities=9% Similarity=0.078 Sum_probs=40.7
Q ss_pred HHHHHHHHHHHHHcCCCccceE--------EEEEEeCC-------CCCHHHHHHHHHhhcCC--CCCccEEEEcC
Q psy1836 52 KQALTNMGHILKEAGGSYEHVV--------KTTILLND-------INDFATVNNVYGQFFKP--PYPARSTFQVG 109 (127)
Q Consensus 52 ~~~~~ni~~~L~~~G~~~~~vv--------~~~vyl~d-------~~~~~~~~~v~~~~f~~--~~Pa~t~v~V~ 109 (127)
.++.+-++.++++.|..+-||- .+++|+.. ++|...+++...+++.. +.|..=.+.|+
T Consensus 7 ~~i~~li~p~~~~~G~eLvdve~~~~~~~~~LrV~IDk~~g~gVtldDC~~vSr~is~~LD~~D~i~~~Y~LEVS 81 (155)
T PRK14646 7 SKLEILLEKVANEFDLKICSLNIQTNQNPIVIKIIIKKTNGDDISLDDCALFNTPASEEIENSNLLNCSYVLEIS 81 (155)
T ss_pred HHHHHHHHHHHHHcCCEEEEEEEEeCCCCeEEEEEEECCCCCCccHHHHHHHHHHHHHHhCcCCCCCCCeEEEEc
Confidence 3555677788889998777763 26788742 45678888888898873 35556666666
No 96
>PF00814 Peptidase_M22: Glycoprotease family; InterPro: IPR000905 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M22 (clan MK). The type example being O-sialoglycoprotein endopeptidase (3.4.24.57 from EC) from Pasteurella haemolytica (Mannheimia haemolytica). O-Sialoglycoprotein endopeptidase is secreted by the bacterium P. haemolytica, and digests only proteins that are heavily sialylated, in particular those with sialylated serine and threonine residues []. Substrate proteins include glycophorin A and leukocyte surface antigens CD34, CD43, CD44 and CD45 [, ]. Removal of glycosylation, by treatment with neuraminidase, completely negates susceptibility to O-sialoglycoprotein endopeptidase digestion [, ]. Sequence similarity searches have revealed other members of the M22 family, from yeast, Mycobacterium, Haemophilus influenzae and the cyanobacterium Synechocystis []. The zinc-binding and catalytic residues of this family have not been determined, although the motif HMEGH may be a zinc-binding region [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2A6A_A 2GEL_G 2GEM_B 1OKJ_B 3ENO_A 3EN9_B 2VWB_B 3ENH_B 2IVO_D 2IVP_A ....
Probab=21.49 E-value=2.1e+02 Score=21.67 Aligned_cols=41 Identities=17% Similarity=0.218 Sum_probs=29.3
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHH
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFA 86 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~ 86 (127)
.-.+...+..+..++.+|+++|.+++||-.+-+ -.....|.
T Consensus 24 ~a~r~H~~~L~~~i~~~l~~~~~~~~did~iav-t~GPGsft 64 (268)
T PF00814_consen 24 IASRQHSENLPPLIEELLKEAGISLSDIDAIAV-TRGPGSFT 64 (268)
T ss_dssp CHHHHHHHHHHHHHHHHHHHHTS-GGGESEEEE-EEESS-HH
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHCCEEEE-ecCCCccc
Confidence 456778899999999999999999998855433 33444553
No 97
>PF02801 Ketoacyl-synt_C: Beta-ketoacyl synthase, C-terminal domain; InterPro: IPR014031 Beta-ketoacyl-ACP synthase 2.3.1.41 from EC (KAS) [] is the enzyme that catalyzes the condensation of malonyl-ACP with the growing fatty acid chain. It is found as a component of a number of enzymatic systems, including fatty acid synthetase (FAS), which catalyzes the formation of long-chain fatty acids from acetyl-CoA, malonyl-CoA and NADPH; the multi-functional 6-methysalicylic acid synthase (MSAS) from Penicillium patulum [], which is involved in the biosynthesis of a polyketide antibiotic; polyketide antibiotic synthase enzyme systems; Emericella nidulans multifunctional protein Wa, which is involved in the biosynthesis of conidial green pigment; Rhizobium nodulation protein nodE, which probably acts as a beta-ketoacyl synthase in the synthesis of the nodulation Nod factor fatty acyl chain; and yeast mitochondrial protein CEM1. The condensation reaction is a two step process, first the acyl component of an activated acyl primer is transferred to a cysteine residue of the enzyme and is then condensed with an activated malonyl donor with the concomitant release of carbon dioxide. This entry represents the C-terminal domain of beta-ketoacyl-ACP synthases. The active site is contained in a cleft betweeen N- and C-terminal domains, with residues from both domains contributing to substrate binding and catalysis [].; PDB: 2UV8_B 3HMJ_A 2VKZ_C 4EWG_A 1TQY_H 1E5M_A 1J3N_B 2VZ8_A 2VZ9_B 3O04_A ....
Probab=21.21 E-value=2.1e+02 Score=18.51 Aligned_cols=35 Identities=17% Similarity=0.041 Sum_probs=22.6
Q ss_pred ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCC
Q psy1836 45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIN 83 (127)
Q Consensus 45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~ 83 (127)
.+...|.+ .|+..|+.+|.+.+||--+..+=+...
T Consensus 22 p~~~~~~~----~i~~al~~agi~~~~I~~i~~hg~Gt~ 56 (119)
T PF02801_consen 22 PNGAALAR----AIRRALADAGISPEDIDYIEAHGTGTP 56 (119)
T ss_dssp TTHHHHHH----HHHHHHHHHTS-GGGEEEEE----SSH
T ss_pred cCHHHHHH----HHHHHHhhhccccccceeeeeeccccc
Confidence 45555543 389999999999999977777766554
No 98
>TIGR03725 bact_YeaZ universal bacterial protein YeaZ. This family describes a protein family, YeaZ, that appears to be universal in bacteria, but whose function is unknown. This family is related to the gcp (glycoprotease) protein family, also universal in bacteria and unknown in function. In Gram-positive lineages, members of these two related families often belong to the same operon, along with the ribosomal-protein-alanine acetyltransferase gene. Members of this family may occur as fusions with gcp or the ribosomal protein N-acetyltransferase rimI, and is frequently encoded next to rimI.
Probab=21.01 E-value=1.5e+02 Score=21.31 Aligned_cols=42 Identities=19% Similarity=0.225 Sum_probs=32.4
Q ss_pred cHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHH
Q psy1836 46 GVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATV 88 (127)
Q Consensus 46 d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~ 88 (127)
.-..+.+..+..|+.+|+++|.+++|| ..-.+-.....|..+
T Consensus 29 ~~~~h~~~l~~~i~~~l~~~~~~~~~i-~~iav~~GPGSfTGl 70 (202)
T TIGR03725 29 AGRNHSEILLPMIEELLAEAGLSLQDL-DAIAVGVGPGSFTGL 70 (202)
T ss_pred hhHHHHHHHHHHHHHHHHHcCCCHHHC-CEEEEecCCChHHhH
Confidence 457799999999999999999999998 344445566666443
No 99
>PF10548 P22_AR_C: P22AR C-terminal domain; InterPro: IPR018876 This entry represents the carboxy-terminal domain of the Enterobacteria phage P22 antirepressor ((P03037 from SWISSPROT) []. It is found associated with IPR018875 from INTERPRO.
Probab=20.67 E-value=1.6e+02 Score=18.01 Aligned_cols=26 Identities=15% Similarity=0.212 Sum_probs=22.1
Q ss_pred cHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836 46 GVENQAKQALTNMGHILKEAGGSYEH 71 (127)
Q Consensus 46 d~~~Q~~~~~~ni~~~L~~~G~~~~~ 71 (127)
-.-.+++..++++...|++.|+.+.-
T Consensus 21 ~~~~~m~~~~~~l~p~L~~lgS~~a~ 46 (74)
T PF10548_consen 21 FAAERMRELCQELYPALKALGSNYAG 46 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHhCCCcCc
Confidence 35678999999999999999997764
No 100
>PLN02606 palmitoyl-protein thioesterase
Probab=20.41 E-value=4.4e+02 Score=20.82 Aligned_cols=58 Identities=16% Similarity=0.105 Sum_probs=42.7
Q ss_pred ccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcC
Q psy1836 43 MAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVG 109 (127)
Q Consensus 43 ~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~ 109 (127)
+..++.+|++.+.+.|.. .++..-+ +-+...+.=..+-+.+-+++++.+|.++.|-.+
T Consensus 72 ~~~~~~~Qv~~vce~l~~-~~~L~~G--------~naIGfSQGglflRa~ierc~~~p~V~nlISlg 129 (306)
T PLN02606 72 LFMPLRQQASIACEKIKQ-MKELSEG--------YNIVAESQGNLVARGLIEFCDNAPPVINYVSLG 129 (306)
T ss_pred cccCHHHHHHHHHHHHhc-chhhcCc--------eEEEEEcchhHHHHHHHHHCCCCCCcceEEEec
Confidence 447889999999999988 4444322 223345666788999999998878999998644
No 101
>PF02200 STE: STE like transcription factor; InterPro: IPR003120 This family consists of transcription factors related to STE and is found associated with the C2H2 zinc finger in some proteins.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=20.40 E-value=1.2e+02 Score=19.92 Aligned_cols=20 Identities=20% Similarity=0.285 Sum_probs=17.5
Q ss_pred CCCCcceEEEECCEEEEccc
Q psy1836 14 GSASAFQAVQHDNTLYISGV 33 (127)
Q Consensus 14 ~~~~ys~av~~g~~v~iSG~ 33 (127)
|.+.|-.+|-.+++.|++|+
T Consensus 10 ~~~e~isCV~Wn~l~~ItGT 29 (110)
T PF02200_consen 10 PNGEFISCVLWNNLFHITGT 29 (110)
T ss_pred CCCCEEEEEEEccEEEEecc
Confidence 55669999999999999996
No 102
>PF14359 DUF4406: Domain of unknown function (DUF4406)
Probab=20.15 E-value=1.6e+02 Score=18.61 Aligned_cols=57 Identities=11% Similarity=0.149 Sum_probs=32.7
Q ss_pred EEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836 28 LYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF 96 (127)
Q Consensus 28 v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f 96 (127)
|||||-..- ..+.-+.+|.+..+.|++.|...-+=+.. ...+-.+++...++--+.+
T Consensus 1 iYIaGPmtG----------~~~~N~~~f~~~a~~L~~~G~~vvnPa~~--~~~~~~~~~~ym~~~l~~L 57 (92)
T PF14359_consen 1 IYIAGPMTG----------LPDYNRPAFNAAAKRLRAKGYEVVNPAEL--GIPEGLSWEEYMRICLAML 57 (92)
T ss_pred CeEeCCcCC----------CcchHHHHHHHHHHHHHHCCCEEeCchhh--CCCCCCCHHHHHHHHHHHH
Confidence 588885532 22456789999999999999432221111 2233334556555554444
No 103
>PTZ00112 origin recognition complex 1 protein; Provisional
Probab=20.10 E-value=4.8e+02 Score=24.45 Aligned_cols=65 Identities=20% Similarity=0.227 Sum_probs=34.1
Q ss_pred CCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836 25 DNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP 99 (127)
Q Consensus 25 g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~ 99 (127)
++.+||+|..|.-. ..-++.+++.++..-+..+...=.++.++++-- ......+..+|.++++..
T Consensus 781 nnvLYIyG~PGTGK---------TATVK~VLrELqeeaeqk~lp~f~vVYINCm~L-stp~sIYqvI~qqL~g~~ 845 (1164)
T PTZ00112 781 NQILYISGMPGTGK---------TATVYSVIQLLQHKTKQKLLPSFNVFEINGMNV-VHPNAAYQVLYKQLFNKK 845 (1164)
T ss_pred CceEEEECCCCCCH---------HHHHHHHHHHHHHHHhhccCCCceEEEEeCCcc-CCHHHHHHHHHHHHcCCC
Confidence 46789999987532 134555666666544443332222333333210 123455666676666554
Done!