Query         psy1836
Match_columns 127
No_of_seqs    222 out of 1048
Neff          7.8 
Searched_HMMs 46136
Date          Fri Aug 16 17:36:10 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1836.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1836hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd06152 YjgF_YER057c_UK114_lik 100.0 1.5E-37 3.2E-42  209.5  12.9  107   16-122     1-111 (114)
  2 TIGR03610 RutC pyrimidine util 100.0 1.3E-36 2.8E-41  208.4  12.5  109   13-123    13-123 (127)
  3 cd02198 YjgH_like YjgH belongs 100.0 2.4E-36 5.2E-41  202.5  11.2  107   16-123     1-108 (111)
  4 cd06154 YjgF_YER057c_UK114_lik 100.0   6E-36 1.3E-40  202.8  12.7  112   10-122     5-117 (119)
  5 PRK11401 putative endoribonucl 100.0 4.8E-36   1E-40  206.0  12.3  112   11-123    10-124 (129)
  6 COG0251 TdcF Putative translat 100.0 6.1E-36 1.3E-40  205.8  12.9  112   10-123    11-125 (130)
  7 TIGR00004 endoribonuclease L-P 100.0 2.3E-35   5E-40  200.9  13.3  113   10-123     8-121 (124)
  8 PF01042 Ribonuc_L-PSP:  Endori 100.0 1.6E-35 3.5E-40  201.0  12.4  112   11-123     4-118 (121)
  9 cd06150 YjgF_YER057c_UK114_lik 100.0 2.7E-34 5.8E-39  190.8  11.9  101   16-122     1-102 (105)
 10 cd06156 eu_AANH_C_2 A group of 100.0 3.8E-33 8.3E-38  189.0  12.4  102   18-122     1-116 (118)
 11 cd02199 YjgF_YER057c_UK114_lik 100.0 1.1E-32 2.3E-37  192.2  13.1  116    7-123     5-140 (142)
 12 cd06153 YjgF_YER057c_UK114_lik 100.0 1.7E-32 3.6E-37  184.9  11.1  102   18-122     1-112 (114)
 13 cd06151 YjgF_YER057c_UK114_lik 100.0 5.3E-32 1.1E-36  185.2  12.5  104   18-122     1-124 (126)
 14 cd00448 YjgF_YER057c_UK114_fam 100.0 9.4E-31   2E-35  172.6  11.8  103   18-122     1-105 (107)
 15 cd06155 eu_AANH_C_1 A group of 100.0 6.3E-30 1.4E-34  168.8  11.2   94   22-122     4-98  (101)
 16 KOG2317|consensus              100.0 4.4E-28 9.6E-33  166.6   9.7  117    6-123    10-127 (138)
 17 PF14588 YjgF_endoribonc:  YjgF  99.6 1.5E-14 3.3E-19  100.9   9.7  111   12-123    15-145 (148)
 18 TIGR01884 cas_HTH CRISPR locus  66.0      28  0.0006   25.2   6.1   48   47-97     36-84  (203)
 19 PF02662 FlpD:  Methyl-viologen  57.7      55  0.0012   22.0   6.8   58   26-87     54-111 (124)
 20 KOG2240|consensus               56.9      20 0.00044   25.5   3.8   77   17-95     77-156 (162)
 21 KOG0100|consensus               53.7      41  0.0009   28.0   5.5   52   51-103   342-393 (663)
 22 PRK00394 transcription factor;  51.8      28  0.0006   25.1   3.9   38   27-70     50-87  (179)
 23 cd04918 ACT_AK1-AT_2 ACT domai  50.0      47   0.001   19.2   4.2   44   54-97     16-65  (65)
 24 PHA02094 hypothetical protein   46.8      48   0.001   20.2   3.8   52   70-121    23-80  (81)
 25 PF07066 DUF3882:  Lactococcus   45.3 1.1E+02  0.0024   21.7   5.9   49   20-73     21-69  (159)
 26 PF08868 YugN:  YugN-like famil  44.8      26 0.00057   24.1   2.8   30   33-63    103-132 (132)
 27 cd04518 TBP_archaea archaeal T  43.3      41 0.00089   24.1   3.7   37   27-69     51-87  (174)
 28 COG1908 FrhD Coenzyme F420-red  43.1   1E+02  0.0022   21.1   5.3   44   50-96     75-118 (132)
 29 PRK13689 hypothetical protein;  42.7      40 0.00087   20.9   3.1   23   48-70      9-31  (75)
 30 PF07736 CM_1:  Chorismate muta  42.2 1.1E+02  0.0023   20.7   5.7   41   47-87     15-57  (118)
 31 PF04088 Peroxin-13_N:  Peroxin  42.2      22 0.00047   25.2   2.1   27   40-66     13-39  (158)
 32 PRK14643 hypothetical protein;  41.7 1.3E+02  0.0027   21.4   6.8   65   46-110     3-86  (164)
 33 TIGR01796 CM_mono_aroH monofun  40.8 1.1E+02  0.0024   20.6   5.2   37   52-88     20-58  (117)
 34 cd04937 ACT_AKi-DapG-BS_2 ACT   39.3      75  0.0016   18.1   4.3   45   52-96     15-63  (64)
 35 PF00352 TBP:  Transcription fa  39.1      27 0.00058   21.7   2.0   20   47-66     67-86  (86)
 36 TIGR02671 cas_csx9 CRISPR-asso  38.7 1.5E+02  0.0033   23.3   6.3   47   45-102    15-61  (377)
 37 PF11230 DUF3029:  Protein of u  38.5      57  0.0012   27.2   4.2   45   46-90    410-460 (487)
 38 PF12262 Lipase_bact_N:  Bacter  38.5      80  0.0017   24.3   4.8   52   45-96    202-255 (268)
 39 cd02185 AroH Chorismate mutase  38.1 1.3E+02  0.0027   20.4   5.2   38   51-88     19-58  (117)
 40 COG1242 Predicted Fe-S oxidore  37.7   2E+02  0.0044   22.6   7.6   70   25-99     44-113 (312)
 41 cd04516 TBP_eukaryotes eukaryo  37.5      54  0.0012   23.5   3.6   37   27-69     51-87  (174)
 42 cd04517 TLF TBP-like factors (  37.2      60  0.0013   23.2   3.8   37   27-69     51-87  (174)
 43 TIGR02499 HrpE_YscL_not type I  35.6 1.4E+02  0.0031   20.3   6.2   48   74-124   115-162 (166)
 44 COG1022 FAA1 Long-chain acyl-C  35.1      29 0.00064   29.8   2.2   32   26-59    195-228 (613)
 45 COG3453 Uncharacterized protei  34.6     6.7 0.00014   26.8  -1.4   76   21-97      3-88  (130)
 46 PF00816 Histone_HNS:  H-NS his  34.3      21 0.00046   22.5   0.9   24   49-72     20-43  (93)
 47 PTZ00009 heat shock 70 kDa pro  32.5 1.7E+02  0.0037   25.2   6.4   53   45-98    302-358 (653)
 48 PF02108 FliH:  Flagellar assem  32.2 1.4E+02  0.0031   19.2   5.3   66   52-123    50-115 (128)
 49 KOG0813|consensus               31.9      52  0.0011   25.3   2.9   32   19-57    132-163 (265)
 50 KOG3887|consensus               31.8 1.3E+02  0.0027   23.6   4.8   44   52-98    158-204 (347)
 51 KOG1179|consensus               31.6 1.2E+02  0.0025   26.4   5.0   44   27-77    247-290 (649)
 52 KOG3010|consensus               31.4 1.1E+02  0.0025   23.4   4.6   44   52-99    116-159 (261)
 53 KOG1394|consensus               31.1 1.1E+02  0.0023   25.1   4.5   48   53-101   302-351 (440)
 54 cd00652 TBP_TLF TATA box bindi  30.7      78  0.0017   22.6   3.5   37   27-69     51-87  (174)
 55 PF01071 GARS_A:  Phosphoribosy  30.6      61  0.0013   23.7   3.0   53   28-81    124-177 (194)
 56 cd04915 ACT_AK-Ectoine_2 ACT d  30.3 1.2E+02  0.0025   17.6   4.3   45   53-97     16-66  (66)
 57 cd04919 ACT_AK-Hom3_2 ACT doma  30.0 1.1E+02  0.0023   17.2   4.2   44   53-96     16-65  (66)
 58 COG3082 Uncharacterized protei  29.7   1E+02  0.0022   18.9   3.2   23   48-70      9-31  (74)
 59 CHL00123 rps6 ribosomal protei  29.7      74  0.0016   20.4   3.0   24   48-71     20-43  (97)
 60 KOG2708|consensus               29.4 2.7E+02  0.0058   21.5   6.8   68   39-107    37-112 (336)
 61 PLN02633 palmitoyl protein thi  29.4 2.9E+02  0.0063   21.9   7.8   58   43-109    71-128 (314)
 62 COG2101 SPT15 TATA-box binding  29.4      76  0.0017   23.1   3.2   35   27-67    150-184 (185)
 63 PRK13411 molecular chaperone D  29.3 1.8E+02   0.004   25.1   6.0   52   45-98    297-353 (653)
 64 PF07108 PipA:  PipA protein;    29.1      56  0.0012   23.5   2.4   34   28-65     40-73  (200)
 65 PF01849 NAC:  NAC domain;  Int  28.3      55  0.0012   18.8   2.0   26   57-82      3-29  (58)
 66 PF02256 Fe_hyd_SSU:  Iron hydr  27.8      52  0.0011   19.2   1.8   20   80-99     24-43  (60)
 67 PF08309 LVIVD:  LVIVD repeat;   27.4      92   0.002   16.8   2.6   19   17-35      2-22  (42)
 68 PF13964 Kelch_6:  Kelch motif   27.3      76  0.0016   17.0   2.4   15   18-32      4-18  (50)
 69 PLN00062 TATA-box-binding prot  26.8 2.4E+02  0.0053   20.2   7.8   36   27-68     51-86  (179)
 70 cd04922 ACT_AKi-HSDH-ThrA_2 AC  26.8 1.2E+02  0.0027   16.8   4.1   44   53-96     16-65  (66)
 71 PF09658 Cas_Csx9:  CRISPR-asso  26.3 3.2E+02   0.007   21.7   6.3   47   45-102    15-61  (377)
 72 PF14552 Tautomerase_2:  Tautom  26.2 1.6E+02  0.0034   18.3   3.9   29   45-73     39-68  (82)
 73 PTZ00186 heat shock 70 kDa pre  26.2 2.6E+02  0.0057   24.3   6.4   53   45-98    323-379 (657)
 74 PTZ00400 DnaK-type molecular c  25.9 3.2E+02  0.0069   23.7   6.9   53   45-99    337-394 (663)
 75 COG0443 DnaK Molecular chapero  25.8 2.2E+02  0.0048   24.3   5.9   52   45-97    283-334 (579)
 76 PRK11678 putative chaperone; P  25.6 3.1E+02  0.0066   22.6   6.5   47   47-98    378-425 (450)
 77 PRK10328 DNA binding protein,   25.2      70  0.0015   22.0   2.3   26   49-74     52-77  (134)
 78 COG3655 Predicted transcriptio  25.2      85  0.0018   19.4   2.5   21   54-74     44-64  (73)
 79 PRK00290 dnaK molecular chaper  25.1 2.9E+02  0.0062   23.6   6.4   49   50-99    305-353 (627)
 80 PF06526 DUF1107:  Protein of u  24.5      88  0.0019   18.8   2.4   39   25-63     21-61  (64)
 81 PF01344 Kelch_1:  Kelch motif;  24.4      94   0.002   16.2   2.4   14   19-32      5-18  (47)
 82 PF08066 PMC2NT:  PMC2NT (NUC01  24.3      67  0.0014   20.2   2.0   27   45-71     61-87  (91)
 83 PF12205 GIT1_C:  G protein-cou  24.3 1.1E+02  0.0024   20.8   3.1   23   45-67     12-34  (123)
 84 PRK13410 molecular chaperone D  23.6 2.4E+02  0.0053   24.5   5.8   53   45-98    298-354 (668)
 85 TIGR02619 putative CRISPR-asso  23.4 2.2E+02  0.0049   19.8   4.6   28   72-99     34-61  (149)
 86 PF07490 Tir_receptor_N:  Trans  23.0      88  0.0019   23.4   2.6   35   25-59    151-185 (266)
 87 PF01250 Ribosomal_S6:  Ribosom  22.8 1.3E+02  0.0028   18.6   3.1   25   48-72     15-39  (92)
 88 PHA02754 hypothetical protein;  22.6 1.8E+02  0.0039   17.3   3.7   26   47-72     12-37  (67)
 89 COG3797 Uncharacterized protei  22.6 1.6E+02  0.0035   21.3   3.8   70   24-114    42-112 (178)
 90 KOG0808|consensus               22.5 1.5E+02  0.0031   23.3   3.8   32   45-76     89-120 (387)
 91 PF02120 Flg_hook:  Flagellar h  22.2      82  0.0018   19.0   2.1   27   45-71     47-73  (85)
 92 TIGR00329 gcp_kae1 metallohydr  22.1 1.5E+02  0.0032   22.9   3.9   29   45-73     43-71  (305)
 93 PF00438 S-AdoMet_synt_N:  S-ad  22.0 1.5E+02  0.0034   19.3   3.4   51   16-71     34-88  (100)
 94 PRK05952 3-oxoacyl-(acyl carri  21.9 2.2E+02  0.0049   22.7   5.0   46   53-99    255-302 (381)
 95 PRK14646 hypothetical protein;  21.6 2.9E+02  0.0063   19.3   6.2   58   52-109     7-81  (155)
 96 PF00814 Peptidase_M22:  Glycop  21.5 2.1E+02  0.0045   21.7   4.5   41   45-86     24-64  (268)
 97 PF02801 Ketoacyl-synt_C:  Beta  21.2 2.1E+02  0.0045   18.5   4.0   35   45-83     22-56  (119)
 98 TIGR03725 bact_YeaZ universal   21.0 1.5E+02  0.0032   21.3   3.5   42   46-88     29-70  (202)
 99 PF10548 P22_AR_C:  P22AR C-ter  20.7 1.6E+02  0.0035   18.0   3.1   26   46-71     21-46  (74)
100 PLN02606 palmitoyl-protein thi  20.4 4.4E+02  0.0095   20.8   7.5   58   43-109    72-129 (306)
101 PF02200 STE:  STE like transcr  20.4 1.2E+02  0.0026   19.9   2.6   20   14-33     10-29  (110)
102 PF14359 DUF4406:  Domain of un  20.2 1.6E+02  0.0036   18.6   3.2   57   28-96      1-57  (92)
103 PTZ00112 origin recognition co  20.1 4.8E+02    0.01   24.5   6.9   65   25-99    781-845 (1164)

No 1  
>cd06152 YjgF_YER057c_UK114_like_4 YjgF, YER057c, and UK114 belong to a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function.  The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00  E-value=1.5e-37  Score=209.51  Aligned_cols=107  Identities=27%  Similarity=0.335  Sum_probs=102.2

Q ss_pred             CCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcC-CCccceEEEEEEeCCC---CCHHHHHHH
Q psy1836          16 ASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAG-GSYEHVVKTTILLNDI---NDFATVNNV   91 (127)
Q Consensus        16 ~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G-~~~~~vv~~~vyl~d~---~~~~~~~~v   91 (127)
                      .+|||++++|+++|+|||+|.|++.+.+++|+++|++++|+||+++|+++| ++++||+|+|+|++|+   ++++.++++
T Consensus         1 ~~ys~av~~g~~v~~SGq~g~d~~g~~~~~d~~~Q~~~~~~Nl~~~L~~aG~~~~~dVvk~tvyltd~~~~~~~~~~~~~   80 (114)
T cd06152           1 LHYSQAVRIGDRIEISGQGGWDPDTGKIPEDLEEEIDQAFDNVELALKAAGGKGWEQVYKVNSYHVDIKNEEAFGLMVEN   80 (114)
T ss_pred             CCCCCeEEECCEEEEeccCCcCCCCCccCcCHHHHHHHHHHHHHHHHHHhCCCCHHHEEEEEEEEecCCcHHHHHHHHHH
Confidence            379999999999999999999995557779999999999999999999999 9999999999999999   799999999


Q ss_pred             HHhhcCCCCCccEEEEcCCCCCCCEEEEEec
Q psy1836          92 YGQFFKPPYPARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus        92 ~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                      |.+||++++|+||+++|++|+.|+++||||+
T Consensus        81 ~~~~f~~~~Pa~t~v~V~~L~~p~~lVEIe~  111 (114)
T cd06152          81 FKKWMPNHQPIWTCVGVTALGLPGMRVEIEV  111 (114)
T ss_pred             HHHHcCCCCCCeEEEEeccCCCCCcEEEEEE
Confidence            9999999999999999999999999999996


No 2  
>TIGR03610 RutC pyrimidine utilization protein C. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the endoribonuclease L-PSP family defined by pfam01042.
Probab=100.00  E-value=1.3e-36  Score=208.38  Aligned_cols=109  Identities=25%  Similarity=0.415  Sum_probs=102.1

Q ss_pred             cCCCCcceEEEECCEEEEccccccCCCCCccc--ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836          13 VGSASAFQAVQHDNTLYISGVLGIDASSGKMA--EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNN   90 (127)
Q Consensus        13 ~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~   90 (127)
                      .+.++|||++++|+++|+|||+|.|+ +|...  +|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+++
T Consensus        13 ~~~~~ys~av~~g~~v~vSGq~~~d~-~g~~~~~~d~~~Q~~~~l~ni~~iL~~aG~~~~dvv~~~iyl~d~~~~~~~~~   91 (127)
T TIGR03610        13 KPLAPFVPGTLADGVVYVSGTLPFDK-DNNVVHVGDAAAQTRHVLETIKSVIETAGGTMDDVTFNHIFIRDWADYAAINE   91 (127)
T ss_pred             CCCCCCCCeEEECCEEEEeccCCcCC-CCCeeCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEcCHHHHHHHHH
Confidence            35668999999999999999999999 77764  7999999999999999999999999999999999999999999999


Q ss_pred             HHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836          91 VYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        91 v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      +|.+||++++|+||+++| +|..|+++||||+-
T Consensus        92 ~~~~~f~~~~Pa~t~v~v-~l~~p~~lVEIe~v  123 (127)
T TIGR03610        92 VYAEYFPGEKPARYCIQC-GLVKPDALVEIASV  123 (127)
T ss_pred             HHHHHcCCCCCcEEEEEe-ccCCCCCEEEEEEE
Confidence            999999988999999998 57789999999973


No 3  
>cd02198 YjgH_like YjgH belongs to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00  E-value=2.4e-36  Score=202.48  Aligned_cols=107  Identities=32%  Similarity=0.476  Sum_probs=102.3

Q ss_pred             CCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCC-CCHHHHHHHHHh
Q psy1836          16 ASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDI-NDFATVNNVYGQ   94 (127)
Q Consensus        16 ~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~-~~~~~~~~v~~~   94 (127)
                      .+||+++++|+++|+|||+|.|+ +|++++|+++|++++|+||+++|+++|++++||+|+++|++|. ++++.++++|.+
T Consensus         1 ~~ys~av~~g~~l~vSGq~~~d~-~g~~~~d~~~Q~~~~~~ni~~~L~~aG~~~~dvvk~~vyl~~~~~~~~~~~~~~~~   79 (111)
T cd02198           1 FGYSPAVRVGDTLFVSGQVGSDA-DGSVAEDFEAQFRLAFQNLGAVLEAAGCSFDDVVELTTFHVDMAAHLPAFAAVKDE   79 (111)
T ss_pred             CCCcceEEECCEEEEecccCcCC-CCCcCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEeccHHHHHHHHHHHHH
Confidence            37999999999999999999999 8888899999999999999999999999999999999999965 899999999999


Q ss_pred             hcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836          95 FFKPPYPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        95 ~f~~~~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      ||++++|+||+++|++|+.|+++||||+-
T Consensus        80 ~f~~~~Pa~t~v~V~~L~~~~~~vEIe~~  108 (111)
T cd02198          80 YFKEPYPAWTAVGVAWLARPGLLVEIKVV  108 (111)
T ss_pred             HcCCCCCceehhhhhhcCCCCcEEEEEEE
Confidence            99988999999999999999999999973


No 4  
>cd06154 YjgF_YER057c_UK114_like_6 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function.  The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00  E-value=6e-36  Score=202.82  Aligned_cols=112  Identities=29%  Similarity=0.359  Sum_probs=105.7

Q ss_pred             ccccCCCCcceEEEECCEEEEccccccCCCCCcc-cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHH
Q psy1836          10 VSLVGSASAFQAVQHDNTLYISGVLGIDASSGKM-AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATV   88 (127)
Q Consensus        10 ~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~-~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~   88 (127)
                      ..+.+.++|||++++|+++|+|||+|.|+ +|.. ++|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+
T Consensus         5 ~~~~~~~~ys~av~~g~~l~vSGq~~~d~-~~~~~~~d~~~Q~~~~~~ni~~~L~~aG~~~~dVvk~~vyl~d~~~~~~~   83 (119)
T cd06154           5 SPWEEQAGYSRAVRVGNWVFVSGTTGYDY-DGMVMPGDAYEQTRQCLEIIEAALAEAGASLEDVVRTRMYVTDIADFEAV   83 (119)
T ss_pred             CCcccccCcccEEEECCEEEEeCcCcCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEECCHHHHHHH
Confidence            45667889999999999999999999999 5544 489999999999999999999999999999999999999999999


Q ss_pred             HHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEec
Q psy1836          89 NNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus        89 ~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                      +++|.+||++++|+||+++|..|+.|+++||||+
T Consensus        84 ~~~~~~~f~~~~Part~v~v~~L~~~~~lVEIe~  117 (119)
T cd06154          84 GRAHGEVFGDIRPAATMVVVSLLVDPEMLVEIEV  117 (119)
T ss_pred             HHHHHHHcCCCCCceEEEEecccCCCCcEEEEEE
Confidence            9999999999999999999999999999999996


No 5  
>PRK11401 putative endoribonuclease L-PSP; Provisional
Probab=100.00  E-value=4.8e-36  Score=205.98  Aligned_cols=112  Identities=34%  Similarity=0.538  Sum_probs=104.6

Q ss_pred             cccCCCCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836          11 SLVGSASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNN   90 (127)
Q Consensus        11 ~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~   90 (127)
                      .+.+.++|||++++|+++|+|||+|.|+.+|++++|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+++
T Consensus        10 ~~~~~~~ys~av~~g~~v~vSGq~~~d~~~~~~~~d~~~Q~~~~~~ni~~~L~aaG~~~~~Vvk~~vyl~d~~~~~~~~~   89 (129)
T PRK11401         10 APGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINE   89 (129)
T ss_pred             CCCCCCCccceEEECCEEEEcCcCCccCCCCccCcCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEccHHHHHHHHH
Confidence            34567899999999999999999999986677779999999999999999999999999999999999999999999999


Q ss_pred             HHHhhcCCC---CCccEEEEcCCCCCCCEEEEEecc
Q psy1836          91 VYGQFFKPP---YPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        91 v~~~~f~~~---~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      +|.+||+++   +|+||+++|+.|+ ++++||||+-
T Consensus        90 v~~~~f~~~~~~~Part~v~v~~L~-~~~~VEIe~~  124 (129)
T PRK11401         90 VYKQFFDEHQATYPTRSCVQVARLP-KDVKLEIEAI  124 (129)
T ss_pred             HHHHHhCCCCCCCCceEEEEcccCC-CCCeEEEEEE
Confidence            999999974   8999999999997 7899999974


No 6  
>COG0251 TdcF Putative translation initiation inhibitor, yjgF family [Translation, ribosomal structure and biogenesis]
Probab=100.00  E-value=6.1e-36  Score=205.81  Aligned_cols=112  Identities=37%  Similarity=0.586  Sum_probs=105.8

Q ss_pred             ccccCCCCcceEEEECCEEEEccccccCCCCCccc--ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHH
Q psy1836          10 VSLVGSASAFQAVQHDNTLYISGVLGIDASSGKMA--EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFAT   87 (127)
Q Consensus        10 ~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~   87 (127)
                      ..+.+.++|||++++|+++|+|||+|.|+ +|+..  +|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.
T Consensus        11 ~~~~~~~~yS~av~~~~~vfvSGQi~~~~-~g~~v~~~d~~~Q~~~~l~ni~a~L~~aG~~~~~Vvk~~v~l~d~~~f~~   89 (130)
T COG0251          11 NAPAPIGPYSQAVVAGGLVFVSGQIPLDP-TGELVGGEDIEAQTRQALANIKAVLEAAGSTLDDVVKVTVFLTDMNDFAA   89 (130)
T ss_pred             CCCCCCCCccceEEECCEEEEeCcCCcCC-CCcccCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEecCchHHHH
Confidence            44556779999999999999999999999 88888  5999999999999999999999999999999999999999999


Q ss_pred             HHHHHHhhcCCC-CCccEEEEcCCCCCCCEEEEEecc
Q psy1836          88 VNNVYGQFFKPP-YPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        88 ~~~v~~~~f~~~-~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      +|++|.+||+.+ +|+||+|+|+.|| |+++||||+-
T Consensus        90 ~n~v~~~~f~~~~~PArs~V~v~~l~-~~~~VEIeai  125 (130)
T COG0251          90 MNEVYDEFFEVGGYPARSAVGVALLP-PDALVEIEAI  125 (130)
T ss_pred             HHHHHHHHhccCCCCceeEEEhhhCC-CCCeEEEEEE
Confidence            999999999976 9999999999998 9999999984


No 7  
>TIGR00004 endoribonuclease L-PSP, putative. This protein was described initially as an inhibitor of protein synthesis intiation but is now viewed as an endoribonuclease active on single-stranded mRNA. The cleavage of mRNA is responsible for the inhibition of protein synthesis. A role in purine regulation has also been suggested.
Probab=100.00  E-value=2.3e-35  Score=200.92  Aligned_cols=113  Identities=43%  Similarity=0.659  Sum_probs=105.7

Q ss_pred             ccccCCCCcceEEEECCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHH
Q psy1836          10 VSLVGSASAFQAVQHDNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATV   88 (127)
Q Consensus        10 ~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~   88 (127)
                      ..+.+.++|||++++|+++|+|||+|.|+.+|+++ +|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+
T Consensus         8 ~~~~~~~~ys~av~~g~~v~vSGq~~~~~~~g~~~~~d~~~Q~~~~~~ni~~~L~~aG~~~~dvv~~~vyv~~~~~~~~~   87 (124)
T TIGR00004         8 KAPAAIGPYSQAVKVGNTLFVSGQIPLDPSTGELVGGDIAEQAEQVLENLKAILEAAGLSLDDVVKTTVFLTDLNDFAEV   87 (124)
T ss_pred             CCCCCCCCCcceEEECCEEEEeeeCCCcCCCCcCCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEeChHHHHHH
Confidence            34457789999999999999999999999558887 89999999999999999999999999999999999999999999


Q ss_pred             HHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836          89 NNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        89 ~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      +++|.+||++++|+||+++|..|+ ++++||||+-
T Consensus        88 ~~~~~~~f~~~~Pa~t~v~v~~L~-~~~~vEIe~v  121 (124)
T TIGR00004        88 NEVYGQYFDEPYPARSAVQVAALP-KGVLVEIEAI  121 (124)
T ss_pred             HHHHHHHcCCCCCceEEEECccCC-CCCEEEEEEE
Confidence            999999999889999999999997 7999999973


No 8  
>PF01042 Ribonuc_L-PSP:  Endoribonuclease L-PSP;  InterPro: IPR006175  This domain is found in endoribonuclease, that is active on single-stranded mRNA and inhibits protein synthesis by cleavage of mRNA []. Previously it was thought to inhibit protein synthesis initiation []. This endoribonuclease may also be involved in the regulation of purine biosynthesis []. ; PDB: 3GTZ_B 3V4D_E 1J7H_A 3R0P_D 2IG8_A 1QD9_B 3L7Q_E 3VCZ_A 3QUW_A 2EWC_K ....
Probab=100.00  E-value=1.6e-35  Score=201.02  Aligned_cols=112  Identities=38%  Similarity=0.571  Sum_probs=103.1

Q ss_pred             cccCCCCcceEEEECCEEEEccccccCCCCCcc-cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHH
Q psy1836          11 SLVGSASAFQAVQHDNTLYISGVLGIDASSGKM-AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVN   89 (127)
Q Consensus        11 ~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~-~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~   89 (127)
                      .+.|.++|||+++.|+++|+|||+|.|+.+|++ ++|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.++
T Consensus         4 a~~p~~~Ys~av~~g~~v~isGq~~~d~~~~~~~~~~~~~Q~~~~l~ni~~~L~~~G~~~~dvv~~~~yl~d~~~~~~~~   83 (121)
T PF01042_consen    4 APEPIGPYSQAVRAGDTVFISGQVGIDPATGQVVPGDIEEQTRQALDNIERILAAAGASLDDVVKVTVYLTDMSDFPAVN   83 (121)
T ss_dssp             SCCCSSSSBSEEEETTEEEEEEEESBCTTTSSBSSSSHHHHHHHHHHHHHHHHHHTTS-GGGEEEEEEEESSGGGHHHHH
T ss_pred             CCCCCCCCCCEEEECCEEEEeeeCCcCCCCCcCCCCCHHHHHHHHHHhhhhhhhcCCCcceeEeeeeehhhhhhhhHHHH
Confidence            456899999999999999999999999878888 5999999999999999999999999999999999999999999999


Q ss_pred             HHHHhhcCCC--CCccEEEEcCCCCCCCEEEEEecc
Q psy1836          90 NVYGQFFKPP--YPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        90 ~v~~~~f~~~--~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      ++|.+||++.  +|+||+++|..|+ ++++|||++-
T Consensus        84 ~v~~~~f~~~~~~Pa~t~v~v~~L~-~~~~vEIe~~  118 (121)
T PF01042_consen   84 EVWKEFFPDHPHRPARTTVGVSALP-PGALVEIEAI  118 (121)
T ss_dssp             HHHHHHSTSSTS--EEEEEEESBSG-GG-SEEEEEE
T ss_pred             HHHHHHhcccCCCCcEEEEEeCcCC-CCCcEEEEEE
Confidence            9999999988  4999999999999 9999999973


No 9  
>cd06150 YjgF_YER057c_UK114_like_2 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function.  The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00  E-value=2.7e-34  Score=190.82  Aligned_cols=101  Identities=32%  Similarity=0.527  Sum_probs=95.8

Q ss_pred             CCcceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhh
Q psy1836          16 ASAFQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQF   95 (127)
Q Consensus        16 ~~ys~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~   95 (127)
                      .+|||++++|+++|+|||+|.|+ +    +|+++|++++|+||+++|+++|++++||+|+++|++|+++++.++++|.+|
T Consensus         1 ~~~s~av~~g~~v~iSGq~~~~~-~----~~~~~Q~~~~~~nl~~~L~~~G~~~~dvvk~~vyl~d~~~~~~~~~~~~~~   75 (105)
T cd06150           1 ARMSQAVVHNGTVYLAGQVADDT-S----ADITGQTRQVLAKIDALLAEAGSDKSRILSATIWLADMADFAAMNAVWDAW   75 (105)
T ss_pred             CCcCCEEEECCEEEEeCcCCcCC-C----CCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEccHHHHHHHHHHHHHH
Confidence            37999999999999999999997 3    799999999999999999999999999999999999999999999999999


Q ss_pred             cCC-CCCccEEEEcCCCCCCCEEEEEec
Q psy1836          96 FKP-PYPARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus        96 f~~-~~Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                      |++ ++|+||++++ .|..|+++||||+
T Consensus        76 f~~~~~Pa~t~v~~-~l~~~~~lvEIe~  102 (105)
T cd06150          76 VPPGHAPARACVEA-KLADPGYLVEIVV  102 (105)
T ss_pred             cCCCCCCCeEEEEe-cccCCCCEEEEEE
Confidence            997 6999999996 6888999999997


No 10 
>cd06156 eu_AANH_C_2 A group of hypothetical eukaryotic proteins, characterized by the presence of an adenine nucleotide alpha hydrolase (AANH)-like domain located N-terminal to two distinctly different YjgF-YER057c-UK114-like domains. This CD contains the second of these domains. The YjgF-YER057c-UK114 protein family is a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function.  The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00  E-value=3.8e-33  Score=189.03  Aligned_cols=102  Identities=19%  Similarity=0.253  Sum_probs=96.5

Q ss_pred             cceEEEECCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836          18 AFQAVQHDNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF   96 (127)
Q Consensus        18 ys~av~~g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f   96 (127)
                      |||++++|+++|+|||+|.|+.++.++ +|+++|++++|+||+++|+++|+  +||+|+++|++|+++++.++++|.+||
T Consensus         1 yS~av~~~~~i~vSGQ~g~d~~~~~~~~~~~~~Q~~qal~Ni~~vL~~aG~--~dVvk~~iyl~d~~~~~~~~~v~~~~f   78 (118)
T cd06156           1 YSQAIVVPKVAYISGQIGLIPATMTLLEGGITLQAVLSLQHLERVAKAMNV--QWVLAAVCYVTDESSVPIARSAWSKYC   78 (118)
T ss_pred             CCceEEECCEEEEEeeCCccCCCCccCCCCHHHHHHHHHHHHHHHHHHcCC--CCEEEEEEEEcChHHHHHHHHHHHHHh
Confidence            899999999999999999999544665 89999999999999999999999  999999999999999999999999999


Q ss_pred             CCC-------------CCccEEEEcCCCCCCCEEEEEec
Q psy1836          97 KPP-------------YPARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus        97 ~~~-------------~Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                      +.+             +|+||+++|..|| ++++|||++
T Consensus        79 ~~~~~~~~~~~~~~~~~P~~t~v~V~~L~-~~~~VEie~  116 (118)
T cd06156          79 SELDLEDESRNESDDVNPPLVIVVVPELP-RGALVEWQG  116 (118)
T ss_pred             cCccccccccccccCCCCcEEEEEcccCC-CCCeEEEEE
Confidence            974             9999999999998 789999986


No 11 
>cd02199 YjgF_YER057c_UK114_like_1 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function.  The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00  E-value=1.1e-32  Score=192.21  Aligned_cols=116  Identities=27%  Similarity=0.292  Sum_probs=101.3

Q ss_pred             cccccccCCCCcceEEEECCEEEEccccccCCC----CCccc-----ccHHHHHHHHHHHHHHHHHHcCCCcc---ceEE
Q psy1836           7 ASLVSLVGSASAFQAVQHDNTLYISGVLGIDAS----SGKMA-----EGVENQAKQALTNMGHILKEAGGSYE---HVVK   74 (127)
Q Consensus         7 ~~~~~~~~~~~ys~av~~g~~v~iSG~~~~d~~----~g~~~-----~d~~~Q~~~~~~ni~~~L~~~G~~~~---~vv~   74 (127)
                      .....+.+.++|||++++|+++|+|||+|.|+.    .|.+.     +|+++|++++|+||+++|+++|++++   ||+|
T Consensus         5 ~~~~~~~~~~~ys~av~~g~~l~vSGq~~~d~~~~~~~g~i~~~~~~~d~~~Qt~~~~~Ni~~vL~~aG~~~~~~~dVvk   84 (142)
T cd02199           5 ELPPAPAPVGNYVPAVRTGNLLYVSGQLPRVDGKLVYTGKVGADLSVEEGQEAARLCALNALAALKAALGDLDRVKRVVR   84 (142)
T ss_pred             cCCCCCCCCCccceEEEECCEEEEeCcCCCCCCCccccCccccccChHHHHHHHHHHHHHHHHHHHHhcCChhhcCCEEE
Confidence            344556778899999999999999999999983    13332     58999999999999999999999988   9999


Q ss_pred             EEEEeCCCCCHHH-------HHHHHHhhcCC-CCCccEEEEcCCCCCCCEEEEEecc
Q psy1836          75 TTILLNDINDFAT-------VNNVYGQFFKP-PYPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        75 ~~vyl~d~~~~~~-------~~~v~~~~f~~-~~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      +++|++|+++|+.       ++++|.+||++ ++|+||+++|+.|+ ++++||||+-
T Consensus        85 ~~vyl~d~~~~~~~~~~~~~~~~v~~~~f~~~~~Part~v~V~~L~-~~~~VEIe~~  140 (142)
T cd02199          85 LTGFVNSAPDFTEQPKVANGASDLLVEVFGEAGRHARSAVGVASLP-LNAAVEVEAI  140 (142)
T ss_pred             EEEEEechHHhhhchhhhHHHHHHHHHHcCCCCCCceEEEEhhhCC-CCCEEEEEEE
Confidence            9999999999987       48899999986 59999999999996 4699999974


No 12 
>cd06153 YjgF_YER057c_UK114_like_5 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function.   The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=100.00  E-value=1.7e-32  Score=184.90  Aligned_cols=102  Identities=24%  Similarity=0.367  Sum_probs=95.2

Q ss_pred             cceEEEE----CCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCC-----ccceEEEEEEeCCCCCHHH
Q psy1836          18 AFQAVQH----DNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGS-----YEHVVKTTILLNDINDFAT   87 (127)
Q Consensus        18 ys~av~~----g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~-----~~~vv~~~vyl~d~~~~~~   87 (127)
                      ||+|+.+    |+++|+|||+|.|+ +|++. +|+++|++++|+||+.+|+++|++     ++||+|+++|++|+++++.
T Consensus         1 ~s~a~~~~~~~g~~v~vSGq~~~d~-~g~~~~~d~~~Q~~~~l~ni~~~L~~aG~~~~~~~~~dVvk~~vyl~d~~~~~~   79 (114)
T cd06153           1 FSRATLLAAGGRTHLFISGTASIVG-HGTVHPGDVEAQTRETLENIEALLEAAGRGGGAQFLADLLRLKVYLRDREDLPA   79 (114)
T ss_pred             CCCceeeccCCCcEEEEEeECcCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHcCCCCCccchhheeEEEEEEccHHHHHH
Confidence            8999999    89999999999999 78775 899999999999999999999999     9999999999999999999


Q ss_pred             HHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEec
Q psy1836          88 VNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus        88 ~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                      ++++|.+||++++|+ |++. ..|..|+++||||+
T Consensus        80 ~~~v~~~~f~~~~P~-t~~~-~~l~~p~~lvEIe~  112 (114)
T cd06153          80 VRAILAARLGPAVPA-VFLQ-ADVCRPDLLVEIEA  112 (114)
T ss_pred             HHHHHHHHcCCCCCE-EEEE-eeecCCCcEEEEEE
Confidence            999999999987775 7764 67889999999997


No 13 
>cd06151 YjgF_YER057c_UK114_like_3 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function.  The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=99.98  E-value=5.3e-32  Score=185.15  Aligned_cols=104  Identities=32%  Similarity=0.378  Sum_probs=95.1

Q ss_pred             cceEEEEC---CEEEEccccccCCCCCc-----c--cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeC-CCC---
Q psy1836          18 AFQAVQHD---NTLYISGVLGIDASSGK-----M--AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLN-DIN---   83 (127)
Q Consensus        18 ys~av~~g---~~v~iSG~~~~d~~~g~-----~--~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~---   83 (127)
                      |||++++.   ++||+|||+|.|+ +++     .  .+|+++|++++|+||+++|+++|++++||+|+++|++ |.+   
T Consensus         1 ~s~~~~v~~~~~~i~vSGq~~~~~-d~~~~~g~~~~~~d~~~Q~~~~l~ni~~~L~~aG~~~~dVvk~~vyl~~~~~~~~   79 (126)
T cd06151           1 IAQAVEVPAGAATIYLSGTVPAVV-NASAPKGSPARYGDTETQTISVLKRIETILQSQGLTMGDVVKMRVFLVADPALDG   79 (126)
T ss_pred             CCceEEeCCCceEEEEeccCCCCC-CCCCCCCcccCCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEecCccccc
Confidence            79999994   7999999999876 433     2  2799999999999999999999999999999999998 565   


Q ss_pred             --CHHHHHHHHHhhcCC----CCCccEEEEcCCCCCCCEEEEEec
Q psy1836          84 --DFATVNNVYGQFFKP----PYPARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus        84 --~~~~~~~v~~~~f~~----~~Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                        +++.++++|.+||++    ++|+||+++|++|+.|+++||||+
T Consensus        80 ~~~~~~~~~~~~~~f~~~~~~~~Pa~t~v~V~~L~~p~~~VEIe~  124 (126)
T cd06151          80 KMDFAGFMKAYRQFFGTAEQPNKPARSTLQVAGLVNPGWLVEIEV  124 (126)
T ss_pred             hhhHHHHHHHHHHHhccccCCCCCceEEEEeeecCCCCcEEEEEE
Confidence              899999999999998    689999999999999999999996


No 14 
>cd00448 YjgF_YER057c_UK114_family YjgF, YER057c, and UK114 belong to a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=99.97  E-value=9.4e-31  Score=172.60  Aligned_cols=103  Identities=39%  Similarity=0.626  Sum_probs=98.1

Q ss_pred             cceEEEECCEEEEccccccCCCCCcc-cccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836          18 AFQAVQHDNTLYISGVLGIDASSGKM-AEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF   96 (127)
Q Consensus        18 ys~av~~g~~v~iSG~~~~d~~~g~~-~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f   96 (127)
                      |||++++|+++|+|||+|.|+ ++.. ++|+++|++++|+||+++|+++|++++||+++++|++|+++++.++++|.+||
T Consensus         1 ys~~~~~~~~~~~sGq~~~~~-~~~~~~~~~~~Q~~~~~~ni~~~L~~~g~~~~~iv~~~~yv~~~~~~~~~~~~~~~~~   79 (107)
T cd00448           1 YSQAVRVGNLVFVSGQIPLDP-DGELVPGDIEAQTRQALENLEAVLEAAGGSLDDVVKVTVYLTDMADFAAVNEVYDEFF   79 (107)
T ss_pred             CCCeEEECCEEEEeccCCcCC-CCcccCCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEEecHHHHHHHHHHHHHHh
Confidence            899999999999999999999 5544 49999999999999999999999999999999999999999999999999999


Q ss_pred             CC-CCCccEEEEcCCCCCCCEEEEEec
Q psy1836          97 KP-PYPARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus        97 ~~-~~Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                      ++ ++|+||+++|..| .|+++||||+
T Consensus        80 ~~~~~Pa~t~v~v~~l-~~~~~VEie~  105 (107)
T cd00448          80 GEGPPPARTAVGVAAL-PPGALVEIEA  105 (107)
T ss_pred             CCCCCCceEEEEeccC-CCCCEEEEEE
Confidence            98 7999999999999 8999999986


No 15 
>cd06155 eu_AANH_C_1 A group of hypothetical eukaryotic proteins, characterized by the presence of an adenine nucleotide alpha hydrolase (AANH)-like domain located N-terminal to two distinctly different YjgF-YER057c-UK114-like domains. This CD contains the first of these domains. The YjgF-YER057c-UK114 protein family is a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function.  The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site.
Probab=99.97  E-value=6.3e-30  Score=168.80  Aligned_cols=94  Identities=32%  Similarity=0.517  Sum_probs=87.7

Q ss_pred             EEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC-C
Q psy1836          22 VQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP-Y  100 (127)
Q Consensus        22 v~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~-~  100 (127)
                      .++|+++|+|||+|.|+     .+|+++|++++|+||+++|+++|++++||+|+++|++|+++|+.+|++|.+||+++ +
T Consensus         4 ~~~g~~v~vSG~~~~~~-----~~d~~~Q~~~v~~ni~~~L~~aG~~~~dVv~~~iyl~d~~~~~~~n~~~~~~f~~~~~   78 (101)
T cd06155           4 NRTGGLLWISNVTASES-----DETVEEQMESIFSKLREILQSNGLSLSDILYVTLYLRDMSDFAEVNSVYGTFFDKPNP   78 (101)
T ss_pred             EEECCEEEEecCCCCCC-----CCCHHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEECCHHHHHHHHHHHHHHcCCCCC
Confidence            46799999999999987     26999999999999999999999999999999999999999999999999999964 8


Q ss_pred             CccEEEEcCCCCCCCEEEEEec
Q psy1836         101 PARSTFQVGKLPLIDFCLVVLG  122 (127)
Q Consensus       101 Pa~t~v~V~~L~~~~~liEi~~  122 (127)
                      |+|+++++ .|+ |+++|||++
T Consensus        79 Par~~v~~-~l~-~~~lvEIe~   98 (101)
T cd06155          79 PSRVCVEC-GLP-EGCDVQLSC   98 (101)
T ss_pred             CceEEEEe-ccC-CCCEEEEEE
Confidence            99999987 677 899999986


No 16 
>KOG2317|consensus
Probab=99.95  E-value=4.4e-28  Score=166.64  Aligned_cols=117  Identities=41%  Similarity=0.558  Sum_probs=110.3

Q ss_pred             ccccccccCCCCcceEEEECCEEEEccccccCCCCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCC
Q psy1836           6 HASLVSLVGSASAFQAVQHDNTLYISGVLGIDASSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIND   84 (127)
Q Consensus         6 ~~~~~~~~~~~~ys~av~~g~~v~iSG~~~~d~~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~   84 (127)
                      -.++|.+.+.+||||++++++++|+|||+|.+|.++++. +++.+|++++++|++.+|+++|++.+.+|+.++|+.|+.+
T Consensus        10 v~S~~Ap~~igPYsQa~~~~~~~~~SGqigl~P~s~~~~~gg~~~q~~q~l~n~~~il~~a~a~~~~~V~~~i~l~d~~~   89 (138)
T KOG2317|consen   10 VISYWAPANIGPYSQATKANDVVFISGQIGLDPPSMKLVEGGIVDQTEQALLNLEEILKAAGASLDLVVKVTIFLADIID   89 (138)
T ss_pred             EeeccCCCCcCChhHheeeCCEEEEeccccccCCCCCEeccchHHHHHHHHHHHHHHHHHhccCccccEEEEEEEecchh
Confidence            367899999999999999999999999999999888888 8899999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836          85 FATVNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        85 ~~~~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      |..+|++|.+||.++.|+|+++.|..||.+ ..||+|..
T Consensus        90 f~~vn~v~~k~~~~~~pars~~~v~alp~~-~~ie~~~i  127 (138)
T KOG2317|consen   90 FAAVNKVYAKYFPKPNPARSCVQVAALPLN-GKIEIECI  127 (138)
T ss_pred             HHHHHHHHHHHcCCCCcchhhHHHhhcCCC-CceEEeee
Confidence            999999999999999999999999999999 56787764


No 17 
>PF14588 YjgF_endoribonc:  YjgF/chorismate_mutase-like, putative endoribonuclease; PDB: 2OTM_B 3D01_D.
Probab=99.58  E-value=1.5e-14  Score=100.94  Aligned_cols=111  Identities=29%  Similarity=0.293  Sum_probs=84.1

Q ss_pred             ccCCCCcceEEEECCEEEEccccccCCC----CCccc-----ccHHHHHHHHHHHHHHHHHHcCCCccc---eEEEEEEe
Q psy1836          12 LVGSASAFQAVQHDNTLYISGVLGIDAS----SGKMA-----EGVENQAKQALTNMGHILKEAGGSYEH---VVKTTILL   79 (127)
Q Consensus        12 ~~~~~~ys~av~~g~~v~iSG~~~~d~~----~g~~~-----~d~~~Q~~~~~~ni~~~L~~~G~~~~~---vv~~~vyl   79 (127)
                      +.|.+.|.+++++|+++|+|||++.+..    .|++.     ++-++-+|.+.-|+-+.|+++=++++.   |+|++.|+
T Consensus        15 ~~p~g~Y~p~~~~G~ll~vSGq~p~~~g~~~~~G~vG~~~s~e~g~~AAr~~~Ln~La~lk~~~G~LdrV~~ivkl~g~V   94 (148)
T PF14588_consen   15 PAPVGNYVPAVRVGNLLYVSGQLPRDDGKLLYTGKVGEDLSVEEGYEAARLCALNALAALKAALGDLDRVKRIVKLTGFV   94 (148)
T ss_dssp             ---SSSC-SEEEETTEEEEEEE--EETTEE-SBS-BTTTB-HHHHHHHHHHHHHHHHHHHHHHCTSGGGECEEEEEEEEE
T ss_pred             CCCCceeeeEEEECCEEEEeccCcccCCEEeeecCCCCCCCHHHHHHHHHHHHHHHHHHHHHHhCCHhHEeEEEEEEEEE
Confidence            3467889999999999999999998752    23333     567788899999999999998778876   59999999


Q ss_pred             CCCCCH-------HHHHHHHHhhcCCC-CCccEEEEcCCCCCCCEEEEEecc
Q psy1836          80 NDINDF-------ATVNNVYGQFFKPP-YPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        80 ~d~~~~-------~~~~~v~~~~f~~~-~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      ....+|       +..++++.+.||+. +++|+.+||..||..-. ||||..
T Consensus        95 ~s~~~F~~~p~V~ngaSdll~~vfGe~G~HaRsAvGv~sLP~~a~-VEie~i  145 (148)
T PF14588_consen   95 NSTPDFTEHPAVANGASDLLVEVFGEAGRHARSAVGVASLPLNAP-VEIELI  145 (148)
T ss_dssp             EB-TT---HHHHHHHHHHHHHHHHGGGG-BEEEEEEESC-GGGBS-EEEEEE
T ss_pred             ecCCCcccCchhhhhHHHHHHHHhCcCCCCcccccccccCCCCCe-EEEEEE
Confidence            988887       56788899999976 99999999999998764 677754


No 18 
>TIGR01884 cas_HTH CRISPR locus-related DNA-binding protein. Most but not all examples of this family are associated with CRISPR loci, a combination of DNA repeats and characteristic proteins encoded near the repeat cluster. The C-terminal region of this protein is homologous to DNA-binding helix-turn-helix domains with predicted transcriptional regulatory activity.
Probab=66.00  E-value=28  Score=25.16  Aligned_cols=48  Identities=21%  Similarity=0.415  Sum_probs=33.1

Q ss_pred             HHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHH-HHHHHHhhcC
Q psy1836          47 VENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFAT-VNNVYGQFFK   97 (127)
Q Consensus        47 ~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~-~~~v~~~~f~   97 (127)
                      .+++.+++.++++++++..+..   .+++.++-.|..++.. +.++..-..+
T Consensus        36 ~~~~~~~a~~~l~~~~~~~~~~---~~~~~~~~vd~~d~~~~~~~v~~~i~~   84 (203)
T TIGR01884        36 IEDGARRAVESLRAIISDLGGN---LVEGTIKEIELKDVPSILRQMSDIIKE   84 (203)
T ss_pred             CchHHHHHHHHHHHHHHHhccC---CCcceEEEEecCCHHHHHHHHHHHHHh
Confidence            3578999999999999998743   3556677777777654 4444444443


No 19 
>PF02662 FlpD:  Methyl-viologen-reducing hydrogenase, delta subunit;  InterPro: IPR003813 Methyl-viologen-reducing hydrogenase (MVH) is one of the enzymes involved in methanogenesis and coded in the mth-flp-mvh-mrt cluster of methane genes in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) []. No specific functions have been assigned to the delta subunit.; GO: 0015948 methanogenesis, 0055114 oxidation-reduction process
Probab=57.69  E-value=55  Score=21.96  Aligned_cols=58  Identities=19%  Similarity=0.177  Sum_probs=40.8

Q ss_pred             CEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHH
Q psy1836          26 NTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFAT   87 (127)
Q Consensus        26 ~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~   87 (127)
                      +=|++.|..+-|-..    ..-...++.-++.++.+|++.|...+.|--..++..+.+.|..
T Consensus        54 DGV~V~gC~~g~Ch~----~~Gn~~a~~Rv~~~k~~L~~~Gi~~eRv~~~~~~~~~~~~fa~  111 (124)
T PF02662_consen   54 DGVLVAGCHPGDCHY----REGNYRAEKRVERLKKLLEELGIEPERVRLYWISAPEGKRFAE  111 (124)
T ss_pred             CEEEEeCCCCCCCCc----chhhHHHHHHHHHHHHHHHHcCCChhHeEEEEeCcccHHHHHH
Confidence            667777766444211    2445678888999999999999999987666666666555543


No 20 
>KOG2240|consensus
Probab=56.86  E-value=20  Score=25.54  Aligned_cols=77  Identities=17%  Similarity=0.160  Sum_probs=47.2

Q ss_pred             CcceEEEECCEEEEccccccCCCCCccc---ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHH
Q psy1836          17 SAFQAVQHDNTLYISGVLGIDASSGKMA---EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYG   93 (127)
Q Consensus        17 ~ys~av~~g~~v~iSG~~~~d~~~g~~~---~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~   93 (127)
                      +--|+...++++-|+|+...++.+.-++   .++-.|.-..++.+...|........+-. .++|.. .++++.+.+-+.
T Consensus        77 ~~vqasl~~nTf~ItG~~~~k~l~EmlPsilsqLg~~sl~~Lr~La~~l~~~~~~~~~~~-~~~~~~-dd~~p~Lvenf~  154 (162)
T KOG2240|consen   77 PKVQASLAANTFTITGHAETKQLTEMLPSILSQLGPDSLTALRRLAEQLPKQDGKEGADA-ATIEDD-DDDVPELVENFD  154 (162)
T ss_pred             ccccccccCCeEEEecCCcccchhhhcchhhhhcChhhHHHHHHHHHhcccccccCCCCc-cchhhc-ccchHHHHHhHH
Confidence            4556666799999999999988444444   34444555666666666664444333333 566665 456666665554


Q ss_pred             hh
Q psy1836          94 QF   95 (127)
Q Consensus        94 ~~   95 (127)
                      ++
T Consensus       155 ~~  156 (162)
T KOG2240|consen  155 EA  156 (162)
T ss_pred             HH
Confidence            44


No 21 
>KOG0100|consensus
Probab=53.74  E-value=41  Score=28.01  Aligned_cols=52  Identities=13%  Similarity=0.320  Sum_probs=40.9

Q ss_pred             HHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCcc
Q psy1836          51 AKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPAR  103 (127)
Q Consensus        51 ~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~  103 (127)
                      .|..++-++.+|+.+|+..+||- -.+.+-.-...+.+.++..+||.+..|.+
T Consensus       342 Fr~TlkPv~kvl~Ds~lkKsdid-eiVLVGGsTrIPKvQqllk~fF~GKepsk  393 (663)
T KOG0100|consen  342 FRKTLKPVQKVLEDSDLKKSDID-EIVLVGGSTRIPKVQQLLKDFFNGKEPSK  393 (663)
T ss_pred             HHHhhHHHHHHHhhcCcccccCc-eEEEecCcccChhHHHHHHHHhCCCCccC
Confidence            56677789999999999888874 34556566678999999999997665544


No 22 
>PRK00394 transcription factor; Reviewed
Probab=51.84  E-value=28  Score=25.09  Aligned_cols=38  Identities=18%  Similarity=0.126  Sum_probs=28.5

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCcc
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYE   70 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~   70 (127)
                      +||-||-+-...      ..-+++++.+++.+..+|+..|....
T Consensus        50 lIf~sGKiv~tG------a~S~~~a~~a~~~~~~~l~~~g~~~~   87 (179)
T PRK00394         50 LIFRSGKVVCTG------AKSVEDLHEAVKIIIKKLKELGIKVI   87 (179)
T ss_pred             EEEcCCcEEEEc------cCCHHHHHHHHHHHHHHHHHcCCCcc
Confidence            677777654422      23457999999999999999998764


No 23 
>cd04918 ACT_AK1-AT_2 ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1). This CD includes the second of two ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1), which can be synergistically inhibited by S-adenosylmethionine (SAM). This isoenzyme is found in higher plants, Arabidopsis thaliana (AT) and Zea mays, and also in Chlorophyta. In its inactive state, Arabidopsis AK1 binds the effectors lysine and SAM (two molecules each) at the interface of two ACT1 domain subunits. The second ACT domain (ACT2), this CD, does not interact with an effector. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=49.99  E-value=47  Score=19.22  Aligned_cols=44  Identities=20%  Similarity=0.333  Sum_probs=29.0

Q ss_pred             HHHHHHHHHHHcCCCccceE------EEEEEeCCCCCHHHHHHHHHhhcC
Q psy1836          54 ALTNMGHILKEAGGSYEHVV------KTTILLNDINDFATVNNVYGQFFK   97 (127)
Q Consensus        54 ~~~ni~~~L~~~G~~~~~vv------~~~vyl~d~~~~~~~~~v~~~~f~   97 (127)
                      ++.++-..|++.|....-+.      ++.+.+.+-+.-..+..++++||.
T Consensus        16 ~~~~i~~aL~~~~I~v~~i~~g~s~~sis~~v~~~~~~~av~~Lh~~f~~   65 (65)
T cd04918          16 ILERAFHVLYTKGVNVQMISQGASKVNISLIVNDSEAEGCVQALHKSFFE   65 (65)
T ss_pred             HHHHHHHHHHHCCCCEEEEEecCccceEEEEEeHHHHHHHHHHHHHHHhC
Confidence            56677788888887774443      555555444444577788888763


No 24 
>PHA02094 hypothetical protein
Probab=46.82  E-value=48  Score=20.24  Aligned_cols=52  Identities=10%  Similarity=0.049  Sum_probs=33.5

Q ss_pred             cceEEEEEEeCCCC-----CHHHHHHHHHhhcCCC-CCccEEEEcCCCCCCCEEEEEe
Q psy1836          70 EHVVKTTILLNDIN-----DFATVNNVYGQFFKPP-YPARSTFQVGKLPLIDFCLVVL  121 (127)
Q Consensus        70 ~~vv~~~vyl~d~~-----~~~~~~~v~~~~f~~~-~Pa~t~v~V~~L~~~~~liEi~  121 (127)
                      .+=+.-++++....     .++.+.+-..+..|.. |.|.|+|.-..-.+.+++|||.
T Consensus        23 ~n~l~~t~~l~~~~~k~n~~ye~~~ksig~m~g~~hptaktwvakpnpfrdgvlveir   80 (81)
T PHA02094         23 NNKLSNTIMLPSTRKKLNEKYEFFAKSVGAMIGELHPTAKTWVAKPNPFRDGVLVEIR   80 (81)
T ss_pred             HHhccceeeeHHHHHHHHHHHHHHHHHHHHHhcccCccccccccCCCCCccceEEeec
Confidence            34445567776542     2444555555666654 7788888766667899999984


No 25 
>PF07066 DUF3882:  Lactococcus phage M3 protein;  InterPro: IPR009773 This family consists of several Lactococcus bacteriophage 712, middle-3 (M3) proteins of around 160 residues in length. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. The function of this family is unknown.
Probab=45.28  E-value=1.1e+02  Score=21.68  Aligned_cols=49  Identities=14%  Similarity=0.171  Sum_probs=38.4

Q ss_pred             eEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceE
Q psy1836          20 QAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVV   73 (127)
Q Consensus        20 ~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv   73 (127)
                      -|++-|+-++++|...-.+.     .++-|-++.++..|+.++++...-..+|+
T Consensus        21 ~A~~~~~~~~~~si~~~~k~-----Ks~~ER~k~ias~Lk~ii~~~d~~~y~i~   69 (159)
T PF07066_consen   21 WAFFKGSDLVVGSIKAKHKS-----KSFFERAKSIASELKTIIQKYDLKFYIIV   69 (159)
T ss_pred             eEEecCCeEEEeeeeecCcc-----cCHHHHHHHHHHHHHHHHHHhCCCcceEE
Confidence            35566888888887565553     68999999999999999999987666654


No 26 
>PF08868 YugN:  YugN-like family;  InterPro: IPR014967 This entry contains proteins related to Bacillus subtilis YugN, they are functionally uncharacterised. ; PDB: 2R5X_A 2PWW_A.
Probab=44.80  E-value=26  Score=24.13  Aligned_cols=30  Identities=17%  Similarity=0.240  Sum_probs=20.5

Q ss_pred             ccccCCCCCcccccHHHHHHHHHHHHHHHHH
Q psy1836          33 VLGIDASSGKMAEGVENQAKQALTNMGHILK   63 (127)
Q Consensus        33 ~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~   63 (127)
                      |-|.|+ +..++....++.+..++.|+..|+
T Consensus       103 ~eP~d~-D~~~p~~~v~~~~~~l~ele~~L~  132 (132)
T PF08868_consen  103 QEPVDK-DAEFPEKWVDKGEELLKELEDELK  132 (132)
T ss_dssp             -----T--B-EEHHHHHHHHHHHHHHHHHHH
T ss_pred             ccccCC-CCcCCHHHHHHHHHHHHHHHHhhC
Confidence            356677 788888899999999999998885


No 27 
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=43.30  E-value=41  Score=24.09  Aligned_cols=37  Identities=16%  Similarity=0.177  Sum_probs=27.6

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY   69 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~   69 (127)
                      +||-||-+-...      ..-+++++.+++.+..+|+..|...
T Consensus        51 lIF~SGKiv~tG------aks~~~a~~a~~~~~~~L~~~g~~~   87 (174)
T cd04518          51 LIFRSGKMVCTG------AKSVEDLHRAVKEIIKKLKDYGIKV   87 (174)
T ss_pred             EEECCCeEEEEc------cCCHHHHHHHHHHHHHHHHhcCCCc
Confidence            677777655432      2345789999999999999999764


No 28 
>COG1908 FrhD Coenzyme F420-reducing hydrogenase, delta subunit [Energy production and conversion]
Probab=43.12  E-value=1e+02  Score=21.12  Aligned_cols=44  Identities=20%  Similarity=0.183  Sum_probs=30.2

Q ss_pred             HHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836          50 QAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF   96 (127)
Q Consensus        50 Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f   96 (127)
                      -+..=+++++.+|++.|...+.|--+.   .+.++++.+.+..++|.
T Consensus        75 ka~rR~~~lke~l~elgie~eRv~~~w---iSa~E~ekf~e~~~efv  118 (132)
T COG1908          75 KAKRRMELLKELLKELGIEPERVRVLW---ISAAEGEKFAETINEFV  118 (132)
T ss_pred             HHHHHHHHHHHHHHHhCCCcceEEEEE---EehhhHHHHHHHHHHHH
Confidence            345567899999999999999664444   45555655555555553


No 29 
>PRK13689 hypothetical protein; Provisional
Probab=42.71  E-value=40  Score=20.91  Aligned_cols=23  Identities=17%  Similarity=0.313  Sum_probs=20.1

Q ss_pred             HHHHHHHHHHHHHHHHHcCCCcc
Q psy1836          48 ENQAKQALTNMGHILKEAGGSYE   70 (127)
Q Consensus        48 ~~Q~~~~~~ni~~~L~~~G~~~~   70 (127)
                      .+|+++++..+-.+|+.+..+.+
T Consensus         9 d~qvE~il~el~~VLeKH~Ap~D   31 (75)
T PRK13689          9 DEQVEQLLAELLAVLEKHKAPTD   31 (75)
T ss_pred             HHHHHHHHHHHHHHHHhcCCCcc
Confidence            47999999999999999987654


No 30 
>PF07736 CM_1:  Chorismate mutase type I;  InterPro: IPR008243 Chorismate mutase (CM; 5.4.99.5 from EC) catalyses the reaction at the branch point of the biosynthetic pathway leading to the three aromatic amino acids, phenylalanine, tryptophan and tyrosine (chorismic acid is the last common intermediate, and CM leads to the L-phenylalanine/L-tyrosine branch). It is part of the shikimate pathway, which is present only in bacteria, fungi and plants.  This entry represents a family of monofunctional (non-fused) chorismate mutases from Gram-positive bacteria (Firmicutes) and cyanobacteria. Trusted members of the family are found in operons with other enzymes of the chorismate pathways, both up- and downstream of CM (Listeria, Bacillus, Oceanobacillus) or are the sole CM in the genome where the other members of the chorismate pathways are found elsewhere in the genome (Nostoc, Thermosynechococcus). They are monofunctional, homotrimeric, nonallosteric enzymes and are not regulated by the end-product aromatic amino acids. The three types of CM are AroQ class, Prokaryotic type (e.g., IPR008239 from INTERPRO amongst others); AroQ class, Eukaryotic type (IPR008238 from INTERPRO); and AroH class. They fall into two structural folds (AroQ class and AroH class) which are completely unrelated []. The two types of the AroQ structural class (the Escherichia coli CM dimer and the yeast CM monomer) can be structurally superimposed, and the topology of the four-helix bundle forming the active site is conserved []. For additional information please see [, , , , , , ].; PDB: 2CHS_K 2CHT_L 1COM_J 1FNJ_A 1FNK_A 1DBF_C 1UI9_A 1ODE_A 1UFY_A 1XHO_C ....
Probab=42.24  E-value=1.1e+02  Score=20.74  Aligned_cols=41  Identities=20%  Similarity=0.247  Sum_probs=24.9

Q ss_pred             HHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeC-CCC-CHHH
Q psy1836          47 VENQAKQALTNMGHILKEAGGSYEHVVKTTILLN-DIN-DFAT   87 (127)
Q Consensus        47 ~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~-~~~~   87 (127)
                      -++=.+.+-+=+..++++.+.+.+||+++..-+| |+. .||+
T Consensus        15 ~e~I~~at~eLl~~i~~~N~l~~~dIvSi~FT~T~DL~a~fPA   57 (118)
T PF07736_consen   15 PEEILEATRELLEEILERNELSPEDIVSIIFTVTPDLDAAFPA   57 (118)
T ss_dssp             HHHHHHHHHHHHHHHHHHTT--GGGEEEEEEEE-TT--SS-TC
T ss_pred             HHHHHHHHHHHHHHHHHHcCCCHHHEEEEEEEeCCCcCccChH
Confidence            3333344555667889999999999999988888 443 4543


No 31 
>PF04088 Peroxin-13_N:  Peroxin 13, N-terminal region;  InterPro: IPR007223 Peroxin-13 is a component of the peroxisomal translocation machinery with Peroxin-14 and Peroxin-17. Both termini of Peroxin-13 are oriented to the cytosol. It is required for peroxisomal association of peroxin-14 []. The proteins also contain an SH3 domain (IPR001452 from INTERPRO).; GO: 0016560 protein import into peroxisome matrix, docking, 0005777 peroxisome, 0016021 integral to membrane
Probab=42.23  E-value=22  Score=25.25  Aligned_cols=27  Identities=11%  Similarity=0.319  Sum_probs=22.9

Q ss_pred             CCcccccHHHHHHHHHHHHHHHHHHcC
Q psy1836          40 SGKMAEGVENQAKQALTNMGHILKEAG   66 (127)
Q Consensus        40 ~g~~~~d~~~Q~~~~~~ni~~~L~~~G   66 (127)
                      .+.+...+++-++.+|+-||.++.+.|
T Consensus        13 ~~~~~q~~eestr~tFq~IESIV~Afg   39 (158)
T PF04088_consen   13 PNSFTQQAEESTRATFQSIESIVGAFG   39 (158)
T ss_pred             ccHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence            444557889999999999999998877


No 32 
>PRK14643 hypothetical protein; Provisional
Probab=41.69  E-value=1.3e+02  Score=21.40  Aligned_cols=65  Identities=11%  Similarity=0.091  Sum_probs=47.8

Q ss_pred             cHHHHHHHHHHHHHHHHHHcCCCccceEE--------EEEEeCC---------CCCHHHHHHHHHhhcCC--CCCccEEE
Q psy1836          46 GVENQAKQALTNMGHILKEAGGSYEHVVK--------TTILLND---------INDFATVNNVYGQFFKP--PYPARSTF  106 (127)
Q Consensus        46 d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~--------~~vyl~d---------~~~~~~~~~v~~~~f~~--~~Pa~t~v  106 (127)
                      |++....++.+.++.++++.|..+-+|--        +++|+.+         ++|...+++...+.+..  +.|..=.+
T Consensus         3 ~~~~~~~~l~~l~~p~~~~~G~eL~die~~~~~~~~~lrV~Id~~~~~~ggvtldDC~~vSr~is~~LD~~d~i~~~Y~L   82 (164)
T PRK14643          3 DFESIKFQINELVNKELEVLNLKVYEINNLKEFENDMIQILVEDILQANKPLDFDILIKANDLVSNKIDQFIKTSEKYLL   82 (164)
T ss_pred             cHHHHHHHHHHHHHHHHHhcCCEEEEEEEEecCCCcEEEEEEecCCCcCCCcCHHHHHHHHHHHHHHhCccCCCCCCeEE
Confidence            67777778888999999999988777631        6888843         23678888888888873  35566666


Q ss_pred             EcCC
Q psy1836         107 QVGK  110 (127)
Q Consensus       107 ~V~~  110 (127)
                      .|++
T Consensus        83 EVSS   86 (164)
T PRK14643         83 EISS   86 (164)
T ss_pred             EecC
Confidence            7763


No 33 
>TIGR01796 CM_mono_aroH monofunctional chorismate mutase, gram positive type, clade 1. This model represents a family of monofunctional (non-fused) chorismate mutases from gram positive bacteria (Firmicutes) and cyanobacteria. Trusted members of the family are found in operons with other enzymes of the chorismate pathways, both up- and downstream of CM (Listeria, Bacillus, Oceanobacillus) or are the sole CM in the genome where the other members of the chorismate pathways are found elsewhere in the genome (Nostoc, Thermosynechococcus).
Probab=40.82  E-value=1.1e+02  Score=20.63  Aligned_cols=37  Identities=8%  Similarity=0.261  Sum_probs=27.7

Q ss_pred             HHHHHHHHHHHHHcCCCccceEEEEEEeCC-CC-CHHHH
Q psy1836          52 KQALTNMGHILKEAGGSYEHVVKTTILLND-IN-DFATV   88 (127)
Q Consensus        52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d-~~-~~~~~   88 (127)
                      ..+-+=++.++++.+.+.+||+++..-+|. +. .|++.
T Consensus        20 ~at~eLl~~ii~~N~l~~edivSv~FT~T~DL~a~FPA~   58 (117)
T TIGR01796        20 EAVAELLTELMERNELTPEDLISVIFTVTEDLHADFPAA   58 (117)
T ss_pred             HHHHHHHHHHHHHcCCCHHHEEEEEEEecCcccccChHH
Confidence            334445678899999999999999998884 43 46653


No 34 
>cd04937 ACT_AKi-DapG-BS_2 ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI. This CD includes the C-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) strain 168), Clostridia, and Actinobacteria bacterial species. In B. subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive AK isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The BS AKI is tetrameric consisting of two alpha and two beta subunits; th
Probab=39.30  E-value=75  Score=18.15  Aligned_cols=45  Identities=13%  Similarity=0.119  Sum_probs=27.6

Q ss_pred             HHHHHHHHHHHHHcCCCccceE----EEEEEeCCCCCHHHHHHHHHhhc
Q psy1836          52 KQALTNMGHILKEAGGSYEHVV----KTTILLNDINDFATVNNVYGQFF   96 (127)
Q Consensus        52 ~~~~~ni~~~L~~~G~~~~~vv----~~~vyl~d~~~~~~~~~v~~~~f   96 (127)
                      ..++.++-..|++.|.....+.    ++.+.+.+-+.-..++.++++|+
T Consensus        15 ~gi~~~if~aL~~~~I~v~~~~~Se~~is~~v~~~~~~~av~~Lh~~f~   63 (64)
T cd04937          15 PGVMAKIVGALSKEGIEILQTADSHTTISCLVSEDDVKEAVNALHEAFE   63 (64)
T ss_pred             cCHHHHHHHHHHHCCCCEEEEEcCccEEEEEEcHHHHHHHHHHHHHHhc
Confidence            3456677778888888776554    44444444333456677777764


No 35 
>PF00352 TBP:  Transcription factor TFIID (or TATA-binding protein, TBP);  InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=39.08  E-value=27  Score=21.74  Aligned_cols=20  Identities=35%  Similarity=0.522  Sum_probs=17.0

Q ss_pred             HHHHHHHHHHHHHHHHHHcC
Q psy1836          47 VENQAKQALTNMGHILKEAG   66 (127)
Q Consensus        47 ~~~Q~~~~~~ni~~~L~~~G   66 (127)
                      =+++++.+++.+..+|++.|
T Consensus        67 s~~~~~~a~~~i~~~L~~~~   86 (86)
T PF00352_consen   67 SEEEAKKAIEKILPILQKLG   86 (86)
T ss_dssp             SHHHHHHHHHHHHHHHHHTT
T ss_pred             CHHHHHHHHHHHHHHHHHcC
Confidence            45789999999999998876


No 36 
>TIGR02671 cas_csx9 CRISPR-associated protein (provisional), Csx9 family. Members of this family, so far, are archaeal proteins found in CRISPR-associated (cas) gene regions. So far, this rare cas protein is found in only three genomes: Pyrococcus horikoshii shinkaj OT3, Pyrococcus abyssi GE5, and Thermococcus kodakarensis KOD1. In each case it is found immediately upstream of cas3 in loci that resemble the Apern type but lack Csa1 and Csa4 genes.
Probab=38.67  E-value=1.5e+02  Score=23.30  Aligned_cols=47  Identities=19%  Similarity=0.234  Sum_probs=33.0

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCc
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPA  102 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa  102 (127)
                      .++.+-.+.+++.++.-++ .+.+  .        -...|...+++++++||+..+|-
T Consensus        15 ~~lad~~e~~~dslk~r~~-~~tS--k--------igrNDr~S~~~v~~~WF~~~~Pe   61 (377)
T TIGR02671        15 EELADALEGALDSLKSRAM-INTS--K--------IGRNDRNSFDRVLSAWFGRSAPE   61 (377)
T ss_pred             HHHHHHHHHHHHHHHHHHh-cccc--c--------ccccchhhHHHHHHHHhCCCChH
Confidence            5677777777777776666 3332  1        13568899999999999977663


No 37 
>PF11230 DUF3029:  Protein of unknown function (DUF3029);  InterPro: IPR016905  Members of this family are homologs to enzymes known to undergo activation by a radical SAM protein to create an active site glycyl radical. This family appears to be activated by the YjjW radical SAM protein, usually encoded by an adjacent gene. 
Probab=38.52  E-value=57  Score=27.21  Aligned_cols=45  Identities=20%  Similarity=0.099  Sum_probs=31.7

Q ss_pred             cHHHHHHHHHHHHHHHHHH------cCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836          46 GVENQAKQALTNMGHILKE------AGGSYEHVVKTTILLNDINDFATVNN   90 (127)
Q Consensus        46 d~~~Q~~~~~~ni~~~L~~------~G~~~~~vv~~~vyl~d~~~~~~~~~   90 (127)
                      +++.-=+.+++-++..++.      +..+-+|+||+|-|++..+|.+.+.+
T Consensus       410 T~k~NP~A~ldi~kGAf~~G~Ryfs~y~~~sDlVRVTGYlVK~SdieK~~~  460 (487)
T PF11230_consen  410 TVKNNPEALLDICKGAFSLGMRYFSAYVSDSDLVRVTGYLVKRSDIEKFRQ  460 (487)
T ss_pred             hhHhCHHHHHHHHHHHHhcCCeEEEEecCCCceEEEeeeeehhhhHHHHHc
Confidence            4444445566666666653      23567899999999999999877765


No 38 
>PF12262 Lipase_bact_N:  Bacterial virulence factor lipase N-terminal
Probab=38.51  E-value=80  Score=24.31  Aligned_cols=52  Identities=13%  Similarity=0.170  Sum_probs=39.9

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCC--CHHHHHHHHHhhc
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIN--DFATVNNVYGQFF   96 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~--~~~~~~~v~~~~f   96 (127)
                      .+-..|.+++.++.+.+++++|.+.++|+--..|-|.-.  -+..+..+...-|
T Consensus       202 ~~~l~~lq~li~~~e~~~~~~gv~~~~IvyS~~FTTqS~~~vl~a~k~~~a~~~  255 (268)
T PF12262_consen  202 TGALAQLQQLINSYEALFAAAGVDKDDIVYSSWFTTQSVGDVLEAVKAAIASGL  255 (268)
T ss_pred             hHHHHHHHHHHHHHHHHHHhcCCCccceEEEeeeeccccHHHHHHHHHHHHhhh
Confidence            456678899999999999999999999999999988542  2455555555444


No 39 
>cd02185 AroH Chorismate mutase (AroH) is one of at least five chorismate-utilizing enzymes present in microorganisms that catalyze the rearrangement of chorismate to prephenic acid, the first committed step in the biosynthesis of aromatic amino acids. In prokaryotes, chorismate mutase may be fused to prephenate dehydratase, prephenate dehydrogenase, or 3-deoxy-D-arabino-heptulosonat-7-phosphate (DAHP) as part of a bifunctional enzyme.  The AroH domain forms a homotrimer with three-fold symmetry.
Probab=38.13  E-value=1.3e+02  Score=20.38  Aligned_cols=38  Identities=8%  Similarity=0.181  Sum_probs=28.4

Q ss_pred             HHHHHHHHHHHHHHcCCCccceEEEEEEeCC-CC-CHHHH
Q psy1836          51 AKQALTNMGHILKEAGGSYEHVVKTTILLND-IN-DFATV   88 (127)
Q Consensus        51 ~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d-~~-~~~~~   88 (127)
                      .+.+-+=++.++++.+.+.+||+++..-+|. +. .||+.
T Consensus        19 ~~at~eLl~~i~~~N~l~~edivSv~FT~T~DL~a~FPA~   58 (117)
T cd02185          19 LEATRELLEEIIERNNIKPEDIISVIFTVTPDLDAAFPAK   58 (117)
T ss_pred             HHHHHHHHHHHHHHcCCCHHHEEEEEEEeCCcccccChHH
Confidence            3344456678899999999999999998884 53 46654


No 40 
>COG1242 Predicted Fe-S oxidoreductase [General function prediction only]
Probab=37.72  E-value=2e+02  Score=22.64  Aligned_cols=70  Identities=14%  Similarity=0.108  Sum_probs=48.4

Q ss_pred             CCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836          25 DNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP   99 (127)
Q Consensus        25 g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~   99 (127)
                      |+..|+|-..+-|. .++...++++|.+...+.+.+--....    -++-..-|..--+..+.+.+.|.+.+..+
T Consensus        44 GGCtFC~~~g~~d~-~~~~~~~i~~Q~~~q~~~~~kK~~~~k----yiaYFQ~~TNTyApvevLre~ye~aL~~~  113 (312)
T COG1242          44 GGCTFCSVAGSGDF-AGQPKISIAEQFKEQAERMHKKWKRGK----YIAYFQAYTNTYAPVEVLREMYEQALSEA  113 (312)
T ss_pred             CceeeecCCCCCcc-ccCcccCHHHHHHHHHHHHHHhhcCCc----EEEEEeccccccCcHHHHHHHHHHHhCcC
Confidence            78899876555555 455557899998887776665433332    56666666666677788888888887754


No 41 
>cd04516 TBP_eukaryotes eukaryotic TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=37.50  E-value=54  Score=23.48  Aligned_cols=37  Identities=19%  Similarity=0.260  Sum_probs=27.5

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY   69 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~   69 (127)
                      +||-||-+-+..      .--+++++.+++++..+|+..|...
T Consensus        51 lIF~SGKiviTG------aks~e~a~~a~~~i~~~L~~~g~~~   87 (174)
T cd04516          51 LIFSSGKMVCTG------AKSEDDSKLAARKYARIIQKLGFPA   87 (174)
T ss_pred             EEECCCeEEEEe------cCCHHHHHHHHHHHHHHHHHcCCCC
Confidence            677777655422      2345789999999999999999754


No 42 
>cd04517 TLF TBP-like factors (TLF; also called TLP, TRF, TRP), which are found in most metazoans. TLFs and TBPs have well-conserved core domains; however, they only share about 60% similarity. TLFs, like TBPs, interact with TFIIA and TFIIB, which are part of the basal transcription machinery. Yet, in contrast to TBPs, TLFs seem not to interact with the TATA-box and even have a negative effect on the transcription of TATA-containing promoters. Recent results indicate that TLFs are involved in the transcription via TATA-less promoters.
Probab=37.19  E-value=60  Score=23.22  Aligned_cols=37  Identities=27%  Similarity=0.377  Sum_probs=26.7

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY   69 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~   69 (127)
                      +||-||-+-+..      ..-+++++.+++.+..+|+..|...
T Consensus        51 lIF~sGKiviTG------aks~~~~~~a~~~~~~~l~~~g~~~   87 (174)
T cd04517          51 SVWSSGKITITG------ATSEEEAKQAARRAARLLQKLGFKV   87 (174)
T ss_pred             EEECCCeEEEEc------cCCHHHHHHHHHHHHHHHHHcCCCc
Confidence            667666554422      2346788999999999999999765


No 43 
>TIGR02499 HrpE_YscL_not type III secretion apparatus protein, HrpE/YscL family. This model is related to Pfam model pfam06188, but is broader. pfam06188 describes HrpE-like proteins, components of bacterial type III secretion systems primarily in bacteria that infect plants. This model includes also the homologous proteins of animal pathogens, such as YscL of Yersinia pestis. This model excludes the related protein FliH of the bacterial flagellar apparatus (see pfam02108)
Probab=35.62  E-value=1.4e+02  Score=20.30  Aligned_cols=48  Identities=10%  Similarity=-0.008  Sum_probs=34.8

Q ss_pred             EEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEeccc
Q psy1836          74 KTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGGH  124 (127)
Q Consensus        74 ~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~~  124 (127)
                      .+++++ +.+|++.+.+.+.+++...+  .+.+.=..|...++.||=..|.
T Consensus       115 ~v~I~v-~P~d~~~l~~~l~~~~~~~~--~~i~~D~~l~~G~c~vet~~G~  162 (166)
T TIGR02499       115 RLTLRV-HPEQLDEVREALAERLALEP--WELEPDASLAPGACVLETESGV  162 (166)
T ss_pred             ceEEEE-CHHHHHHHHHHHHHHhccCC--eEEeeCCCCCCCCEEEEeCCce
Confidence            445554 67788888888888775443  6666656788999999887764


No 44 
>COG1022 FAA1 Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism]
Probab=35.14  E-value=29  Score=29.83  Aligned_cols=32  Identities=22%  Similarity=0.281  Sum_probs=25.5

Q ss_pred             CEEEEccccccCCCCCccc--ccHHHHHHHHHHHHH
Q psy1836          26 NTLYISGVLGIDASSGKMA--EGVENQAKQALTNMG   59 (127)
Q Consensus        26 ~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~   59 (127)
                      -++||||++|. | .|-+.  ..+..|+..+.+++.
T Consensus       195 tiiYTSGTTG~-P-KGVmLth~N~~~~v~~~~~~~~  228 (613)
T COG1022         195 TIIYTSGTTGT-P-KGVMLTHRNLLAQVAGIDEVLP  228 (613)
T ss_pred             EEEEcCCCCCC-C-ceEEEehHHHHHHHHHHHhhCC
Confidence            47999999987 5 67665  788888888887775


No 45 
>COG3453 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.57  E-value=6.7  Score=26.83  Aligned_cols=76  Identities=14%  Similarity=0.139  Sum_probs=46.6

Q ss_pred             EEEECCEEEEccccccCCC-----CC--ccc---ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHH
Q psy1836          21 AVQHDNTLYISGVLGIDAS-----SG--KMA---EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNN   90 (127)
Q Consensus        21 av~~g~~v~iSG~~~~d~~-----~g--~~~---~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~   90 (127)
                      .+++++.+.+|||+..+.-     .|  .++   +|-|+--.--++.++...+++|++..+| =++-+-...++...+.+
T Consensus         3 i~~I~d~lsVsgQi~~~D~~~iaa~GFksiI~nRPDgEe~~QP~~~~i~~aa~~aGl~y~~i-PV~~~~iT~~dV~~f~~   81 (130)
T COG3453           3 IRRINDRLSVSGQISPADIASIAALGFKSIICNRPDGEEPGQPGFAAIAAAAEAAGLTYTHI-PVTGGGITEADVEAFQR   81 (130)
T ss_pred             ceecccceeecCCCCHHHHHHHHHhccceecccCCCCCCCCCCChHHHHHHHHhcCCceEEe-ecCCCCCCHHHHHHHHH
Confidence            4678899999999765321     01  111   4444444446778889999999987665 23332223345667777


Q ss_pred             HHHhhcC
Q psy1836          91 VYGQFFK   97 (127)
Q Consensus        91 v~~~~f~   97 (127)
                      .+.+.=+
T Consensus        82 Al~eaeg   88 (130)
T COG3453          82 ALDEAEG   88 (130)
T ss_pred             HHHHhCC
Confidence            7666543


No 46 
>PF00816 Histone_HNS:  H-NS histone family Partial NMR structure.;  InterPro: IPR001801 The histone-like nucleoid-structuring (H-NS) protein belongs to a family of bacterial proteins that play a role in the formation of nucleoid structure and affect gene expression under certain conditions [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 2LEV_A 1HNS_A 1LR1_B 1HNR_A 1NI8_A 1OV9_A 2JR1_A 3NR7_A 2L93_A 2L92_A.
Probab=34.29  E-value=21  Score=22.51  Aligned_cols=24  Identities=21%  Similarity=0.375  Sum_probs=18.8

Q ss_pred             HHHHHHHHHHHHHHHHcCCCccce
Q psy1836          49 NQAKQALTNMGHILKEAGGSYEHV   72 (127)
Q Consensus        49 ~Q~~~~~~ni~~~L~~~G~~~~~v   72 (127)
                      .+.+.++..|..++++.|.+++|+
T Consensus        20 ~e~~~~~~~i~~~~~~~Gis~~el   43 (93)
T PF00816_consen   20 QEREEAIAEIRELMAEYGISPEEL   43 (93)
T ss_dssp             HCCHHHHHHHHHHHHHTT--HHHC
T ss_pred             HHHHHHHHHHHHHHHHhCCCHHHh
Confidence            345668999999999999999999


No 47 
>PTZ00009 heat shock 70 kDa protein; Provisional
Probab=32.53  E-value=1.7e+02  Score=25.19  Aligned_cols=53  Identities=21%  Similarity=0.340  Sum_probs=38.9

Q ss_pred             ccHHHHHH----HHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC
Q psy1836          45 EGVENQAK----QALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP   98 (127)
Q Consensus        45 ~d~~~Q~~----~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~   98 (127)
                      .++++-++    .+++.++.+|+.+|.+.++|- --+.+=.....+.+.+...++|+.
T Consensus       302 ~~fe~l~~~l~~~~~~~i~~~L~~a~~~~~~i~-~ViLvGGssriP~v~~~i~~~f~~  358 (653)
T PTZ00009        302 ARFEELCGDYFRNTLQPVEKVLKDAGMDKRSVH-EVVLVGGSTRIPKVQSLIKDFFNG  358 (653)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHCc-EEEEECCCCCChhHHHHHHHHhCC
Confidence            55655544    456678889999999998874 233344567889999999999974


No 48 
>PF02108 FliH:  Flagellar assembly protein FliH;  InterPro: IPR018035 This entry represents a region found in the flagellar assembly protein FliH, as well as in type III secretion system protein HrpE. Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export []. The sequence of fliH has been deduced and shown to encode a protein of molecular mass of 25,782 Da. Bacterial HrpE proteins are belived to function on the type III secretion system, specifically the secretion of HrpZ (harpinPss) [].
Probab=32.19  E-value=1.4e+02  Score=19.18  Aligned_cols=66  Identities=15%  Similarity=0.120  Sum_probs=42.0

Q ss_pred             HHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcCCCCCCCEEEEEecc
Q psy1836          52 KQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVGKLPLIDFCLVVLGG  123 (127)
Q Consensus        52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~~L~~~~~liEi~~~  123 (127)
                      ..+++-|+.+|....   .+--++++|+ +.+|++.+.+...+.+...  ...++.=..|+.+++.||-..|
T Consensus        50 ~~i~~~i~~al~~~~---~~~~~v~I~v-~p~d~~~l~~~~~~~~~~~--~~~l~~D~~l~~G~c~iet~~g  115 (128)
T PF02108_consen   50 EAILNLIREALQELP---RDEEKVTIRV-HPDDYEALEELLEDELPEL--GWELVADPSLAPGDCRIETEDG  115 (128)
T ss_pred             HHHHHHHHHHHHHhh---ccCCCeEEEE-CHHHHHHHHHHHHHHHhhc--CCEEEecCCCCCCCEEEEECCe
Confidence            344455555552222   2233566666 6778888888877665432  2277776789999999998776


No 49 
>KOG0813|consensus
Probab=31.85  E-value=52  Score=25.35  Aligned_cols=32  Identities=28%  Similarity=0.482  Sum_probs=24.5

Q ss_pred             ceEEEECCEEEEccccccCCCCCcccccHHHHHHHHHHH
Q psy1836          19 FQAVQHDNTLYISGVLGIDASSGKMAEGVENQAKQALTN   57 (127)
Q Consensus        19 s~av~~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~n   57 (127)
                      .+++-.|+.+|.+|.       |.+-++..+|+..+++.
T Consensus       132 e~~iFtGDtlf~~Gc-------G~~FEgt~~~M~~sl~~  163 (265)
T KOG0813|consen  132 ERAIFTGDTLFGAGC-------GRFFEGTAEQMDSSLNE  163 (265)
T ss_pred             CCeEEeCCceeecCc-------cchhcCCHHHHHHhHHH
Confidence            457777999998884       44446788899888887


No 50 
>KOG3887|consensus
Probab=31.81  E-value=1.3e+02  Score=23.57  Aligned_cols=44  Identities=23%  Similarity=0.364  Sum_probs=33.7

Q ss_pred             HHHHHHHHHHHHHcCCCccceEEEEEEeCCCCC---HHHHHHHHHhhcCC
Q psy1836          52 KQALTNMGHILKEAGGSYEHVVKTTILLNDIND---FATVNNVYGQFFKP   98 (127)
Q Consensus        52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~---~~~~~~v~~~~f~~   98 (127)
                      |.+.++...-|+.+|.  ++ ++++.|+|.+-|   |+++.++-++..+.
T Consensus       158 rdI~qr~~d~l~d~gl--e~-v~vsf~LTSIyDHSIfEAFSkvVQkLipq  204 (347)
T KOG3887|consen  158 RDIHQRTNDELADAGL--EK-VQVSFYLTSIYDHSIFEAFSKVVQKLIPQ  204 (347)
T ss_pred             HHHHHHhhHHHHhhhh--cc-ceEEEEEeeecchHHHHHHHHHHHHHhhh
Confidence            4577777788888876  44 489999998855   88999998887653


No 51 
>KOG1179|consensus
Probab=31.63  E-value=1.2e+02  Score=26.38  Aligned_cols=44  Identities=25%  Similarity=0.317  Sum_probs=27.4

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEE
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTI   77 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~v   77 (127)
                      +||+||++|.-.      .-+-.+.| .+.--.-.....|++.+||+..+.
T Consensus       247 yIYTSGTTGLPK------aAvith~r-~~~~a~g~~~~~g~~~~DvvY~~l  290 (649)
T KOG1179|consen  247 YIYTSGTTGLPK------AAVITHLR-YLQGAAGFYYVFGMTADDVVYTTL  290 (649)
T ss_pred             EEEcCCCCCCCH------HHHHHHHH-HHHHHHHHHHHhCCCccceEEEcc
Confidence            799999998843      12222221 222222367788999999987654


No 52 
>KOG3010|consensus
Probab=31.36  E-value=1.1e+02  Score=23.45  Aligned_cols=44  Identities=14%  Similarity=0.321  Sum_probs=35.3

Q ss_pred             HHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836          52 KQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP   99 (127)
Q Consensus        52 ~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~   99 (127)
                      +..++++.++|+.-|+    ++-+..|-.|.-+++++..++.++....
T Consensus       116 e~fy~~~~rvLRk~Gg----~iavW~Y~dd~v~~pE~dsv~~r~~~~~  159 (261)
T KOG3010|consen  116 ERFYKEAYRVLRKDGG----LIAVWNYNDDFVDWPEFDSVMLRLYDST  159 (261)
T ss_pred             HHHHHHHHHHcCCCCC----EEEEEEccCCCcCCHHHHHHHHHHhhcc
Confidence            3566777888888775    6788999999999999999998887644


No 53 
>KOG1394|consensus
Probab=31.09  E-value=1.1e+02  Score=25.14  Aligned_cols=48  Identities=23%  Similarity=0.156  Sum_probs=35.1

Q ss_pred             HHHHHHHHHHHHcCCCccceEEEEEEeCCCC--CHHHHHHHHHhhcCCCCC
Q psy1836          53 QALTNMGHILKEAGGSYEHVVKTTILLNDIN--DFATVNNVYGQFFKPPYP  101 (127)
Q Consensus        53 ~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~--~~~~~~~v~~~~f~~~~P  101 (127)
                      -+...|+..|+.+|.+++||-.++..-|...  |..+... .++.|+++.+
T Consensus       302 Ga~~am~raL~~Agl~pe~i~YvNAHATST~~GD~aE~~A-i~~vf~~~~~  351 (440)
T KOG1394|consen  302 GAVLAMERALKDAGLSPEDIDYVNAHATSTPLGDAAEAEA-IKRVFGEHNI  351 (440)
T ss_pred             hHHHHHHHHHHHcCCChhhcCeeecccccCcCchHHHHHH-HHHHhccCCc
Confidence            4567789999999999999999999888754  4444443 4556666533


No 54 
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA. New members of the TBP family, called TBP-like proteins (TBLP, TLF, TLP) or TBP-related factors (TRF1, TRF2,TRP), are similar to the core domain of TBPs, with identical or chemically similar amino acids at many
Probab=30.72  E-value=78  Score=22.56  Aligned_cols=37  Identities=27%  Similarity=0.327  Sum_probs=26.2

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCc
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSY   69 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~   69 (127)
                      +||-||-+-...      ..=+++++.+++.+...|+..|...
T Consensus        51 lIf~sGKivitG------aks~~~~~~a~~~~~~~L~~~g~~~   87 (174)
T cd00652          51 LIFSSGKMVITG------AKSEEDAKLAARKYARILQKLGFPV   87 (174)
T ss_pred             EEECCCEEEEEe------cCCHHHHHHHHHHHHHHHHHcCCCc
Confidence            566666544321      1345788999999999999999765


No 55 
>PF01071 GARS_A:  Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain;  InterPro: IPR020561 Phosphoribosylglycinamide synthetase (6.3.4.13 from EC) (GARS) (phosphoribosylamine glycine ligase) [] catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by phosphoribosylglycinamide synthetase is the ATP-dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide:  ATP + 5-phosphoribosylamine + glycine = ADP + Pi + 5'-phosphoribosylglycinamide  In bacteria, GARS is a monofunctional enzyme (encoded by the purD gene). In yeast, GARS is part of a bifunctional enzyme (encoded by the ADE5/7 gene) in conjunction with phosphoribosylformylglycinamidine cyclo-ligase (AIRS) (IPR000728 from INTERPRO). In higher eukaryotes, GARS is part of a trifunctional enzyme in conjunction with AIRS (IPR000728 from INTERPRO) and with phosphoribosylglycinamide formyltransferase (GART) (), forming GARS-AIRS-GART. This entry represents the A-domain of the enzyme, and is related to the ATP-grasp domain of biotin carboxylase/carbamoyl phosphate synthetase.; PDB: 1GSO_A 3LP8_A 2IP4_A 1VKZ_A 2QK4_A 2XD4_A 2XCL_A 3MJF_A 2YRW_A 2YS6_A ....
Probab=30.59  E-value=61  Score=23.70  Aligned_cols=53  Identities=15%  Similarity=0.168  Sum_probs=34.8

Q ss_pred             EEEccccccCCCCCcccccHHHHHHH-HHHHHHHHHHHcCCCccceEEEEEEeCC
Q psy1836          28 LYISGVLGIDASSGKMAEGVENQAKQ-ALTNMGHILKEAGGSYEHVVKTTILLND   81 (127)
Q Consensus        28 v~iSG~~~~d~~~g~~~~d~~~Q~~~-~~~ni~~~L~~~G~~~~~vv~~~vyl~d   81 (127)
                      --+.|+.+.-| .-.+..+..++++. +++.+...|++.|....-|+...+.+++
T Consensus       124 pnTGGMGa~sp-~p~~~~~~~~~i~~~I~~pt~~~l~~eg~~y~GvLy~glMlt~  177 (194)
T PF01071_consen  124 PNTGGMGAYSP-VPFITDELLEEIIEEILEPTLKGLKKEGIPYRGVLYAGLMLTE  177 (194)
T ss_dssp             EEESESEEEES-TTTS-HHHHHHHHHHTHHHHHHHHHHTT---EEEEEEEEEEET
T ss_pred             CCCCCccceee-cccCCHHHHHHHHHHHHHHHHHHHHhcCCCcceeeeeeeEEeC
Confidence            33445544444 22223566666665 9999999999999999999999998885


No 56 
>cd04915 ACT_AK-Ectoine_2 ACT domains located C-terminal to the catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway found in Methylomicrobium alcaliphilum, Vibrio cholerae, and various other halotolerant or halophilic bacteria. Bacteria exposed to hyperosmotic stress accumulate organic solutes called 'compatible solutes'  of which ectoine, a heterocyclic amino acid, is one. Apart from its osmotic function, ectoine also exhibits a protective effect on proteins, nucleic acids and membranes against a variety of stress factors. de novo synthesis of ectoine starts with the phosphorylation of L-aspartate and shares its first two enzymatic steps with the biosynthesis of amino acids of the aspartate family: aspartokinas
Probab=30.29  E-value=1.2e+02  Score=17.64  Aligned_cols=45  Identities=11%  Similarity=0.025  Sum_probs=31.1

Q ss_pred             HHHHHHHHHHHHcCCCccceE------EEEEEeCCCCCHHHHHHHHHhhcC
Q psy1836          53 QALTNMGHILKEAGGSYEHVV------KTTILLNDINDFATVNNVYGQFFK   97 (127)
Q Consensus        53 ~~~~ni~~~L~~~G~~~~~vv------~~~vyl~d~~~~~~~~~v~~~~f~   97 (127)
                      .+..++-..|.+.|..+.-+.      ++.+.+.+-+.-+.++.++++||.
T Consensus        16 gv~~ki~~~L~~~~I~v~~i~~~~s~~~is~~V~~~~~~~av~~Lh~~f~~   66 (66)
T cd04915          16 GVLARGLAALAEAGIEPIAAHQSMRNVDVQFVVDRDDYDNAIKALHAALVE   66 (66)
T ss_pred             hHHHHHHHHHHHCCCCEEEEEecCCeeEEEEEEEHHHHHHHHHHHHHHHhC
Confidence            477888889999998874333      444555444445677888888873


No 57 
>cd04919 ACT_AK-Hom3_2 ACT domains located C-terminal to the catalytic domain of the aspartokinase (AK) HOM3. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the aspartokinase (AK) HOM3, a monofunctional class enzyme found in Saccharomyces cerevisiae, and other related ACT domains. AK is the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP, and in fungi, is responsible for the production of threonine, isoleucine and methionine. S. cerevisiae has a single AK, which is regulated by feedback, allosteric inhibition by L-threonine. Recent studies shown that the allosteric transition triggered by binding of threonine to AK involves a large change in the conformation of the native hexameric enzyme that is converted to an inactive one of different shape and substantially smaller hydrodynamic size. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=29.96  E-value=1.1e+02  Score=17.17  Aligned_cols=44  Identities=9%  Similarity=0.132  Sum_probs=26.6

Q ss_pred             HHHHHHHHHHHHcCCCccceEE------EEEEeCCCCCHHHHHHHHHhhc
Q psy1836          53 QALTNMGHILKEAGGSYEHVVK------TTILLNDINDFATVNNVYGQFF   96 (127)
Q Consensus        53 ~~~~ni~~~L~~~G~~~~~vv~------~~vyl~d~~~~~~~~~v~~~~f   96 (127)
                      .....+-..|++.|....-+.+      +.+.+.+-+.-..++.+++++|
T Consensus        16 ~~~~~if~~L~~~~I~v~~i~q~~s~~~isf~v~~~~~~~a~~~lh~~~~   65 (66)
T cd04919          16 GIAGRMFTTLADHRINIEMISQGASEINISCVIDEKDAVKALNIIHTNLL   65 (66)
T ss_pred             CHHHHHHHHHHHCCCCEEEEEecCccceEEEEEeHHHHHHHHHHHHHHHh
Confidence            4456677888898887765543      4444444332344566676665


No 58 
>COG3082 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.66  E-value=1e+02  Score=18.89  Aligned_cols=23  Identities=17%  Similarity=0.316  Sum_probs=19.8

Q ss_pred             HHHHHHHHHHHHHHHHHcCCCcc
Q psy1836          48 ENQAKQALTNMGHILKEAGGSYE   70 (127)
Q Consensus        48 ~~Q~~~~~~ni~~~L~~~G~~~~   70 (127)
                      .+|+++++..+.++|+.+....+
T Consensus         9 De~ve~il~e~iaVLeKH~ap~d   31 (74)
T COG3082           9 DEQVEQILNELIAVLEKHKAPTD   31 (74)
T ss_pred             HHHHHHHHHHHHHHHHhcCCCcc
Confidence            47999999999999999987543


No 59 
>CHL00123 rps6 ribosomal protein S6; Validated
Probab=29.65  E-value=74  Score=20.36  Aligned_cols=24  Identities=17%  Similarity=0.274  Sum_probs=20.7

Q ss_pred             HHHHHHHHHHHHHHHHHcCCCccc
Q psy1836          48 ENQAKQALTNMGHILKEAGGSYEH   71 (127)
Q Consensus        48 ~~Q~~~~~~ni~~~L~~~G~~~~~   71 (127)
                      ++|...++++++.+|++.|+....
T Consensus        20 e~~~~~~~~~~~~~i~~~gg~i~~   43 (97)
T CHL00123         20 EEELLKWIENYKKLLRKRGAKNIS   43 (97)
T ss_pred             HHHHHHHHHHHHHHHHHCCCEEEE
Confidence            679999999999999999986443


No 60 
>KOG2708|consensus
Probab=29.42  E-value=2.7e+02  Score=21.54  Aligned_cols=68  Identities=13%  Similarity=0.175  Sum_probs=49.8

Q ss_pred             CCCccc-ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCC------HHHHHHHHHhhcCCC-CCccEEEE
Q psy1836          39 SSGKMA-EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIND------FATVNNVYGQFFKPP-YPARSTFQ  107 (127)
Q Consensus        39 ~~g~~~-~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~------~~~~~~v~~~~f~~~-~Pa~t~v~  107 (127)
                      ..|-++ +..+---..++.-++..|+++|.+.+|| .+-.|.+...-      -..+.+....++..+ -+..-|++
T Consensus        37 G~GFlP~~TA~HHr~~il~Lv~~al~ea~v~~~di-D~icyTKGPGmgaPL~~vaivaRtlsllw~kPlv~VNHCig  112 (336)
T KOG2708|consen   37 GEGFLPRDTARHHRAWILGLVKQALEEAGVTSDDI-DCICYTKGPGMGAPLSVVAIVARTLSLLWNKPLVGVNHCIG  112 (336)
T ss_pred             CCCCCcchhHHHHHHHHHHHHHHHHHHcCCChhhC-CEEEEcCCCCCCCchhhHHHHHHHHHHHhCCCcccchhhhh
Confidence            356666 6666677779999999999999999997 78888886532      344566667777766 55666665


No 61 
>PLN02633 palmitoyl protein thioesterase family protein
Probab=29.39  E-value=2.9e+02  Score=21.90  Aligned_cols=58  Identities=14%  Similarity=0.049  Sum_probs=42.9

Q ss_pred             ccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcC
Q psy1836          43 MAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVG  109 (127)
Q Consensus        43 ~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~  109 (127)
                      ...++.+|++.+.+.|.. .++..-+        +-+...+.=..+-+.+-+++++++|.++.|-.+
T Consensus        71 ~~~~~~~Qve~vce~l~~-~~~l~~G--------~naIGfSQGGlflRa~ierc~~~p~V~nlISlg  128 (314)
T PLN02633         71 WLMPLTQQAEIACEKVKQ-MKELSQG--------YNIVGRSQGNLVARGLIEFCDGGPPVYNYISLA  128 (314)
T ss_pred             ceeCHHHHHHHHHHHHhh-chhhhCc--------EEEEEEccchHHHHHHHHHCCCCCCcceEEEec
Confidence            337899999999999988 4444322        223345666788999999998878999998644


No 62 
>COG2101 SPT15 TATA-box binding protein (TBP), component of TFIID and TFIIIB [Transcription]
Probab=29.38  E-value=76  Score=23.08  Aligned_cols=35  Identities=34%  Similarity=0.371  Sum_probs=22.8

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCC
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGG   67 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~   67 (127)
                      ++|-||-+=+..      .--++.++++++++..-|++.|.
T Consensus       150 LiF~SGK~ViTG------aK~~ed~~~Av~~i~~~L~elgl  184 (185)
T COG2101         150 LLFGSGKLVITG------AKSEEDAEQAVEKIQSRLEELGL  184 (185)
T ss_pred             EEecCCcEEEec------CCCHHHHHHHHHHHHHHHHHhcc
Confidence            566666544422      33456677888888888888774


No 63 
>PRK13411 molecular chaperone DnaK; Provisional
Probab=29.33  E-value=1.8e+02  Score=25.06  Aligned_cols=52  Identities=23%  Similarity=0.331  Sum_probs=37.6

Q ss_pred             ccHHHHHH----HHHHHHHHHHHHcCCCccceEEEEEEeC-CCCCHHHHHHHHHhhcCC
Q psy1836          45 EGVENQAK----QALTNMGHILKEAGGSYEHVVKTTILLN-DINDFATVNNVYGQFFKP   98 (127)
Q Consensus        45 ~d~~~Q~~----~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~~~~~~~~v~~~~f~~   98 (127)
                      .++++=++    .+.+-++.+|+.+|.+.++|-  .|.++ .....+.+.+...++|+.
T Consensus       297 ~~fe~l~~~l~~~~~~~i~~~L~~a~~~~~~id--~ViLvGGssriP~v~~~l~~~f~~  353 (653)
T PRK13411        297 AKFEELTKDLVEATIEPMQQALKDAGLKPEDID--RVILVGGSTRIPAVQEAIQKFFGG  353 (653)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHCc--EEEEECCCCCcchHHHHHHHHcCC
Confidence            44544444    455567788899999988873  34444 567889999999999974


No 64 
>PF07108 PipA:  PipA protein;  InterPro: IPR010777 This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis [].
Probab=29.10  E-value=56  Score=23.48  Aligned_cols=34  Identities=26%  Similarity=0.503  Sum_probs=21.4

Q ss_pred             EEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHc
Q psy1836          28 LYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEA   65 (127)
Q Consensus        28 v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~   65 (127)
                      .|+||-.|+|| +-++-+|+..   .|.+.+..+|+.+
T Consensus        40 y~iSg~ggidp-d~EiDdd~Y~---eC~~eLS~il~~A   73 (200)
T PF07108_consen   40 YLISGAGGIDP-DTEIDDDIYD---ECYDELSSILQNA   73 (200)
T ss_pred             HHHhCCCCCCC-cccccchHHH---HHHHHHHHHHHHH
Confidence            57999999999 5555455444   4455555555544


No 65 
>PF01849 NAC:  NAC domain;  InterPro: IPR002715 Nascent polypeptide-associated complex (NAC) is among the first ribosome-associated entities to bind the nascent polypeptide after peptide bond formation. The nascent polypeptide-associated complex (NAC) of yeast functions in the targeting process of ribosomes to the ER membrane []. NAC may prevent binding of ribosome nascent chains (RNCs) without a signal sequence to yeast membranes.; PDB: 3MCE_D 3MCB_A 3LKX_B 1TR8_B.
Probab=28.32  E-value=55  Score=18.80  Aligned_cols=26  Identities=23%  Similarity=0.254  Sum_probs=16.5

Q ss_pred             HHHHHHHHcCC-CccceEEEEEEeCCC
Q psy1836          57 NMGHILKEAGG-SYEHVVKTTILLNDI   82 (127)
Q Consensus        57 ni~~~L~~~G~-~~~~vv~~~vyl~d~   82 (127)
                      .++..|+..|+ ...+|-+++++..|-
T Consensus         3 k~~~~l~klgl~~i~~i~eV~i~~~dg   29 (58)
T PF01849_consen    3 KLQKMLKKLGLKEIPGIEEVTIRKDDG   29 (58)
T ss_dssp             -----GHHCT-EEETTEEEEEEEETTT
T ss_pred             HHHHHHHHcCCcccCCcEEEEEEECCc
Confidence            46778888888 566899999999873


No 66 
>PF02256 Fe_hyd_SSU:  Iron hydrogenase small subunit;  InterPro: IPR003149  Many microorganisms, such as methanogenic, acetogenic, nitrogen-fixing, photosynthetic, or sulphate-reducing bacteria, metabolise hydrogen. Hydrogen activation is mediated by a family of enzymes, termed hydrogenases, which either provide these organisms with reducing power from hydrogen oxidation, or act as electron sinks. There are two hydrogenases families that differ functionally from each other: NiFe hydrogenases tend to be more involved in hydrogen oxidation, while Iron-only FeFe (Fe only) hydrogenases in hydrogen production.  Fe only hydrogenases (1.12.7.2 from EC) show a common core structure, which contains a moiety, deeply buried inside the protein, with an Fe-Fe dinuclear centre, nonproteic bridging, terminal CO and CN- ligands attached to each of the iron atoms, and a dithio moiety, which also bridges the two iron atoms and has been tentatively assigned as a di(thiomethyl)amine. This common core also harbours three [4Fe-4S] iron-sulphur clusters []. In FeFe hydrogenases, as in NiFe hydrogenases, the set of iron-sulphur clusters is dispersed regularly between the dinuclear Fe-Fe centre and the molecular surface. These clusters are distant by about 1.2 nm from each other but the [4Fe-4S] cluster closest to the dinuclear centre is covalently bound to one of the iron atoms though a thiolate bridging ligand. The moiety including the dinuclear centre, the thiolate bridging ligand, and the proximal [4Fe-4S] cluster is known as the H-cluster. A channel, lined with hydrophobic amino acid side chains, nearly connects the dinuclear centre and the molecular surface. Furthermore hydrogen-bonded water molecule sites have been identified at the interior and at the surface of the protein. The small subunit is comprised of alternating random coil and alpha helical structures that encompass the large subunit in a novel protein fold [].; PDB: 3LX4_A 1C4C_A 1FEH_A 3C8Y_A 1C4A_A 1HFE_S 1GX7_D 1E08_D.
Probab=27.81  E-value=52  Score=19.21  Aligned_cols=20  Identities=20%  Similarity=0.486  Sum_probs=14.8

Q ss_pred             CCCCCHHHHHHHHHhhcCCC
Q psy1836          80 NDINDFATVNNVYGQFFKPP   99 (127)
Q Consensus        80 ~d~~~~~~~~~v~~~~f~~~   99 (127)
                      ....+-+.+.++|.+|++++
T Consensus        24 r~s~eNp~v~~lY~~~lg~p   43 (60)
T PF02256_consen   24 RKSHENPEVQELYKEFLGGP   43 (60)
T ss_dssp             -SGGG-HHHHHHHHHTTSST
T ss_pred             CCCCcCHHHHHHHHHHhCCC
Confidence            34456789999999999965


No 67 
>PF08309 LVIVD:  LVIVD repeat;  InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=27.37  E-value=92  Score=16.85  Aligned_cols=19  Identities=26%  Similarity=0.370  Sum_probs=12.1

Q ss_pred             CcceEEEE-CCEEEEc-cccc
Q psy1836          17 SAFQAVQH-DNTLYIS-GVLG   35 (127)
Q Consensus        17 ~ys~av~~-g~~v~iS-G~~~   35 (127)
                      ++.+.+.+ |+++|++ +..|
T Consensus         2 G~a~~v~v~g~yaYva~~~~G   22 (42)
T PF08309_consen    2 GDARDVAVSGNYAYVADGNNG   22 (42)
T ss_pred             ceEEEEEEECCEEEEEeCCCC
Confidence            34455544 9999998 4333


No 68 
>PF13964 Kelch_6:  Kelch motif
Probab=27.32  E-value=76  Score=17.03  Aligned_cols=15  Identities=27%  Similarity=0.468  Sum_probs=10.2

Q ss_pred             cceEEEECCEEEEcc
Q psy1836          18 AFQAVQHDNTLYISG   32 (127)
Q Consensus        18 ys~av~~g~~v~iSG   32 (127)
                      ...++..++.||+-|
T Consensus         4 ~~s~v~~~~~iyv~G   18 (50)
T PF13964_consen    4 GHSAVVVGGKIYVFG   18 (50)
T ss_pred             cCEEEEECCEEEEEC
Confidence            345666788888776


No 69 
>PLN00062 TATA-box-binding protein; Provisional
Probab=26.83  E-value=2.4e+02  Score=20.23  Aligned_cols=36  Identities=19%  Similarity=0.288  Sum_probs=26.7

Q ss_pred             EEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCC
Q psy1836          27 TLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGS   68 (127)
Q Consensus        27 ~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~   68 (127)
                      +||-||-+-+..      ..-+++++.+++++..+|+..|..
T Consensus        51 lIF~SGKiviTG------aks~e~a~~a~~~~~~~L~~lg~~   86 (179)
T PLN00062         51 LIFASGKMVCTG------AKSEHDSKLAARKYARIIQKLGFP   86 (179)
T ss_pred             EEECCCeEEEEe------cCCHHHHHHHHHHHHHHHHHcCCC
Confidence            677777654422      234579999999999999999974


No 70 
>cd04922 ACT_AKi-HSDH-ThrA_2 ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). This CD includes the second  of two ACT domains of the bifunctional enzyme aspartokinase (AK) - homoserine dehydrogenase (HSDH). The ACT domains are positioned between the N-terminal catalytic domain of AK and the C-terminal HSDH domain found in bacteria (Escherichia coli (EC) ThrA) and higher plants (Zea mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. HSDH is the first committed reaction in the branch of the pathway that leads to Thr and Met. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathwa
Probab=26.76  E-value=1.2e+02  Score=16.78  Aligned_cols=44  Identities=11%  Similarity=0.198  Sum_probs=26.7

Q ss_pred             HHHHHHHHHHHHcCCCccceE------EEEEEeCCCCCHHHHHHHHHhhc
Q psy1836          53 QALTNMGHILKEAGGSYEHVV------KTTILLNDINDFATVNNVYGQFF   96 (127)
Q Consensus        53 ~~~~ni~~~L~~~G~~~~~vv------~~~vyl~d~~~~~~~~~v~~~~f   96 (127)
                      ..+..+-..|.++|....-+.      ++.+.+.+-+.-.....+++++|
T Consensus        16 ~~~~~i~~~l~~~~I~v~~i~~~~s~~~is~~v~~~~~~~~~~~lh~~~~   65 (66)
T cd04922          16 GVAATFFSALAKANVNIRAIAQGSSERNISAVIDEDDATKALRAVHERFF   65 (66)
T ss_pred             cHHHHHHHHHHHCCCCEEEEEecCcccEEEEEEeHHHHHHHHHHHHHHHh
Confidence            456667778888887775553      34445544333345666677765


No 71 
>PF09658 Cas_Csx9:  CRISPR-associated protein (Cas_Csx9);  InterPro: IPR013488 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny.  This entry represents the Cxs9 family of Cas proteins found in archaea. These proteins are encoded in the midst of a cas gene operon []. 
Probab=26.29  E-value=3.2e+02  Score=21.67  Aligned_cols=47  Identities=21%  Similarity=0.278  Sum_probs=32.2

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCc
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPA  102 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa  102 (127)
                      .++.+-.+.+++.++.-.+ .+.+     |     -..+|...++.++++||+-.+|-
T Consensus        15 ~~laeale~~~dslesR~~-i~ss-----K-----lsrndRnsfdkv~~~wF~i~~pe   61 (377)
T PF09658_consen   15 EELAEALENAMDSLESRKK-INSS-----K-----LSRNDRNSFDKVLNAWFGISAPE   61 (377)
T ss_pred             HHHHHHHHHHHHHHHHhhh-cCcc-----c-----ccccchhhHHHHHHHHhCCCCch
Confidence            5677777777777766665 3433     1     13467889999999999877663


No 72 
>PF14552 Tautomerase_2:  Tautomerase enzyme; PDB: 2AAG_C 2AAL_A 2AAJ_A 1MWW_C.
Probab=26.23  E-value=1.6e+02  Score=18.32  Aligned_cols=29  Identities=24%  Similarity=0.194  Sum_probs=19.4

Q ss_pred             ccHHHHHHHHHHHHHHHHHH-cCCCccceE
Q psy1836          45 EGVENQAKQALTNMGHILKE-AGGSYEHVV   73 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~-~G~~~~~vv   73 (127)
                      +--.+|=++-++.|..-|++ .|.+.+||.
T Consensus        39 gRs~e~K~~ly~~l~~~L~~~~gi~p~Dv~   68 (82)
T PF14552_consen   39 GRSTEQKKALYRALAERLAEKLGIRPEDVM   68 (82)
T ss_dssp             ---HHHHHHHHHHHHHHHHHHH---GGGEE
T ss_pred             CCCHHHHHHHHHHHHHHHHHHcCCCHHHEE
Confidence            34567888888999999976 899999984


No 73 
>PTZ00186 heat shock 70 kDa precursor protein; Provisional
Probab=26.17  E-value=2.6e+02  Score=24.26  Aligned_cols=53  Identities=15%  Similarity=0.322  Sum_probs=39.0

Q ss_pred             ccHHHH----HHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC
Q psy1836          45 EGVENQ----AKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP   98 (127)
Q Consensus        45 ~d~~~Q----~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~   98 (127)
                      .++++-    .+.+++-++.+|+.+|.+.++|-. -+.+=.....|.+.+...++|+.
T Consensus       323 ~efe~l~~~l~~r~~~~v~~~L~~a~~~~~dId~-VvLVGGssriP~V~~~l~~~fg~  379 (657)
T PTZ00186        323 SKFEGITQRLIERSIAPCKQCMKDAGVELKEIND-VVLVGGMTRMPKVVEEVKKFFQK  379 (657)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCCChhhCCE-EEEECCcccChHHHHHHHHHhCC
Confidence            455555    445556677888999999998853 34444667899999999999985


No 74 
>PTZ00400 DnaK-type molecular chaperone; Provisional
Probab=25.92  E-value=3.2e+02  Score=23.67  Aligned_cols=53  Identities=15%  Similarity=0.300  Sum_probs=39.1

Q ss_pred             ccHHHHH----HHHHHHHHHHHHHcCCCccceEEEEEEeC-CCCCHHHHHHHHHhhcCCC
Q psy1836          45 EGVENQA----KQALTNMGHILKEAGGSYEHVVKTTILLN-DINDFATVNNVYGQFFKPP   99 (127)
Q Consensus        45 ~d~~~Q~----~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~~~~~~~~v~~~~f~~~   99 (127)
                      .++++-+    +.+++-++.+|+++|.+.++|-.  ++++ .....|.+.+...++|+..
T Consensus       337 ~efe~l~~~l~~~~~~~i~~~L~~a~~~~~~i~~--ViLvGGssriP~v~~~l~~~f~~~  394 (663)
T PTZ00400        337 AKLEELTHDLLKKTIEPCEKCIKDAGVKKDELND--VILVGGMTRMPKVSETVKKIFGKE  394 (663)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHCcE--EEEECCccCChHHHHHHHHHhCCC
Confidence            5555555    44555677889999999998733  4455 5678899999999999853


No 75 
>COG0443 DnaK Molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=25.79  E-value=2.2e+02  Score=24.29  Aligned_cols=52  Identities=19%  Similarity=0.325  Sum_probs=41.8

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcC
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFK   97 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~   97 (127)
                      .-+.+....+...++..|+.+|.+..+| ...+.+--...++.+.+...++|+
T Consensus       283 ~l~~dll~r~~~~~~~al~~a~l~~~~I-~~VilvGGstriP~V~~~v~~~f~  334 (579)
T COG0443         283 ELILDLLERTIEPVEQALKDAGLEKSDI-DLVILVGGSTRIPAVQELVKEFFG  334 (579)
T ss_pred             HHHHHHHHHHHHHHHHHHHHcCCChhhC-ceEEEccceeccHHHHHHHHHHhC
Confidence            3455666777788889999999999887 444556667789999999999999


No 76 
>PRK11678 putative chaperone; Provisional
Probab=25.58  E-value=3.1e+02  Score=22.60  Aligned_cols=47  Identities=13%  Similarity=0.085  Sum_probs=34.4

Q ss_pred             HHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeC-CCCCHHHHHHHHHhhcCC
Q psy1836          47 VENQAKQALTNMGHILKEAGGSYEHVVKTTILLN-DINDFATVNNVYGQFFKP   98 (127)
Q Consensus        47 ~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~-d~~~~~~~~~v~~~~f~~   98 (127)
                      ++.-.+++++-++..|+++|.+.+     .|+++ .....+.+.+...++|+.
T Consensus       378 i~~~l~ri~~~i~~~L~~a~~~~d-----~VvLvGGsSriP~V~~~l~~~fg~  425 (450)
T PRK11678        378 ISQPLARILELVQLALDQAQVKPD-----VIYLTGGSARSPLIRAALAQQLPG  425 (450)
T ss_pred             HHHHHHHHHHHHHHHHHHcCCCCC-----EEEEcCcccchHHHHHHHHHHCCC
Confidence            334455566666788888998754     35676 456789999999999976


No 77 
>PRK10328 DNA binding protein, nucleoid-associated; Provisional
Probab=25.19  E-value=70  Score=22.02  Aligned_cols=26  Identities=12%  Similarity=0.224  Sum_probs=21.4

Q ss_pred             HHHHHHHHHHHHHHHHcCCCccceEE
Q psy1836          49 NQAKQALTNMGHILKEAGGSYEHVVK   74 (127)
Q Consensus        49 ~Q~~~~~~ni~~~L~~~G~~~~~vv~   74 (127)
                      .+-..++..+...+++.|.++++++.
T Consensus        52 ~er~~~l~~i~~~~~~~Git~eeL~~   77 (134)
T PRK10328         52 AERQEKINTWLELMKADGINPEELLG   77 (134)
T ss_pred             HHHHHHHHHHHHHHHHhCCCHHHHhh
Confidence            34456889999999999999999953


No 78 
>COG3655 Predicted transcriptional regulator [Transcription]
Probab=25.17  E-value=85  Score=19.39  Aligned_cols=21  Identities=14%  Similarity=0.200  Sum_probs=18.5

Q ss_pred             HHHHHHHHHHHcCCCccceEE
Q psy1836          54 ALTNMGHILKEAGGSYEHVVK   74 (127)
Q Consensus        54 ~~~ni~~~L~~~G~~~~~vv~   74 (127)
                      .|++++++.++..+.+.||+.
T Consensus        44 ~~~tL~~iC~~LeCqpgDile   64 (73)
T COG3655          44 RLSTLEKICKALECQPGDILE   64 (73)
T ss_pred             eHHHHHHHHHHcCCChhheeE
Confidence            467899999999999999975


No 79 
>PRK00290 dnaK molecular chaperone DnaK; Provisional
Probab=25.08  E-value=2.9e+02  Score=23.63  Aligned_cols=49  Identities=20%  Similarity=0.325  Sum_probs=35.5

Q ss_pred             HHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836          50 QAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP   99 (127)
Q Consensus        50 Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~   99 (127)
                      -...+++-++.+|+.+|.+.++|- --+.+=.....+.+.+...++|+..
T Consensus       305 l~~~~~~~i~~~l~~a~~~~~~id-~ViLvGGssriP~v~~~l~~~fg~~  353 (627)
T PRK00290        305 LVERTIEPCKQALKDAGLSVSDID-EVILVGGSTRMPAVQELVKEFFGKE  353 (627)
T ss_pred             HHHHHHHHHHHHHHHcCCChhhCc-EEEEECCcCCChHHHHHHHHHhCCC
Confidence            344555666788899999988873 3334445678899999999999753


No 80 
>PF06526 DUF1107:  Protein of unknown function (DUF1107);  InterPro: IPR009491 This family consists of several short, hypothetical bacterial proteins of unknown function.; PDB: 2JRO_A.
Probab=24.47  E-value=88  Score=18.81  Aligned_cols=39  Identities=13%  Similarity=0.420  Sum_probs=27.3

Q ss_pred             CCEEEEccccccCCCCCccc--ccHHHHHHHHHHHHHHHHH
Q psy1836          25 DNTLYISGVLGIDASSGKMA--EGVENQAKQALTNMGHILK   63 (127)
Q Consensus        25 g~~v~iSG~~~~d~~~g~~~--~d~~~Q~~~~~~ni~~~L~   63 (127)
                      .+.+||.|....+-+.|.+.  ..-..|...++..|.....
T Consensus        21 ~Gr~~I~g~G~feFd~Gkillp~~~~~~~~~~~~EiN~~I~   61 (64)
T PF06526_consen   21 RGRIYIKGIGAFEFDNGKILLPKKADKRHLSVMSEINQEIR   61 (64)
T ss_dssp             SEEEEETTTEEEEEETTEE---SS--HHHHHHHHHHHHHHH
T ss_pred             cceEEEEecccEEEcCCEEeCCccccHHHHHHHHHHHHHHH
Confidence            67899999988877678876  5555677778877766554


No 81 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=24.40  E-value=94  Score=16.16  Aligned_cols=14  Identities=36%  Similarity=0.584  Sum_probs=10.0

Q ss_pred             ceEEEECCEEEEcc
Q psy1836          19 FQAVQHDNTLYISG   32 (127)
Q Consensus        19 s~av~~g~~v~iSG   32 (127)
                      ..++..++.||+-|
T Consensus         5 ~~~~~~~~~iyv~G   18 (47)
T PF01344_consen    5 HAAVVVGNKIYVIG   18 (47)
T ss_dssp             EEEEEETTEEEEEE
T ss_pred             CEEEEECCEEEEEe
Confidence            44566688888877


No 82 
>PF08066 PMC2NT:  PMC2NT (NUC016) domain;  InterPro: IPR012588  Exosomes are nano-compartments that function in the degradation or processing of RNA (including mRNA, rRNA, snRNA and snoRNA) [, ]. Exosomes occur in both archaea and eukaryotes, and have a similar overall structure to each other and to bacterial/organelle PNPases (polynucleotide phosphorylases; 2.7.7.8 from EC) [], consisting of a barrel structure composed of a hexameric ring of PH domains that act as a degradation chamber, and an S1-domain/KH-domain containing cap that binds the RNA substrate (and sometimes accessory proteins) in order to regulate and restrict entry into the degradation chamber []. There are two types of exosomes in eukaryotes, cytoplasmic exosomes that are responsible for 3'-5' exoribonuclease degradation of mRNAs, and nuclear exosomes that degrade pre-mRNAs (such as nonsense transcripts) and degrade rRNAs, snRNAs and snoRNAs. Unstructured RNA substrates feed in through the pore made by the S1 domains, are degraded by the PH domain ring, and exit as nucleotides via the PH pore at the opposite end of the barrel [, ].  There are several accessory proteins that help degrade, unwind or polyadenylate RNA substrate before they enter the exosome. This entry represents the N-terminal domain of Rrp6 (exosome component 10 in humans), a nuclear exosome accessory factor that interacts with the bottom of the hexameric PH-ring opposite the cap. Rrp6 functions as a hydrolytic exonuclease, and is homologous to RNase-D in Escherichia coli. More information about these proteins can be found at Protein of the Month: RNA Exosomes [].; GO: 0006396 RNA processing, 0000176 nuclear exosome (RNase complex)
Probab=24.29  E-value=67  Score=20.25  Aligned_cols=27  Identities=15%  Similarity=0.228  Sum_probs=20.1

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEH   71 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~   71 (127)
                      .+++++.+.+.+-++.+|+.++..++.
T Consensus        61 ~d~d~~~~~vvd~~D~LlEk~D~~LDe   87 (91)
T PF08066_consen   61 DDVDERWDSVVDVNDSLLEKADISLDE   87 (91)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHhHHHHh
Confidence            467778888888888888877766654


No 83 
>PF12205 GIT1_C:  G protein-coupled receptor kinase-interacting protein 1 C term;  InterPro: IPR022018  This domain family is found in eukaryotes, and is approximately 120 amino acids in length. The family is found in association with PF01412 from PFAM, PF00023 from PFAM, PF08518 from PFAM. GIT1 plays an important role in cell adhesion, motility, cytoskeletal remodeling and membrane trafficking. To perform this function, it localises p21-activated kinase (PAK) and PAK-interactive exchange factor to focal adhesions. Its activation is regulated by interaction between its paxillin-binding C-terminal and the LD motifs of paxillin. The C-terminal folds into a four helix bundle. ; PDB: 2JX0_A.
Probab=24.26  E-value=1.1e+02  Score=20.78  Aligned_cols=23  Identities=22%  Similarity=0.319  Sum_probs=21.3

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCC
Q psy1836          45 EGVENQAKQALTNMGHILKEAGG   67 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~   67 (127)
                      +++...++++-++|+.+|+++-.
T Consensus        12 e~Vi~~TE~vTk~IqeLl~aAQ~   34 (123)
T PF12205_consen   12 EDVIRRTEQVTKRIQELLRAAQE   34 (123)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHh
Confidence            89999999999999999999864


No 84 
>PRK13410 molecular chaperone DnaK; Provisional
Probab=23.61  E-value=2.4e+02  Score=24.48  Aligned_cols=53  Identities=19%  Similarity=0.267  Sum_probs=38.7

Q ss_pred             ccHHHHHHHHHHHH----HHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC
Q psy1836          45 EGVENQAKQALTNM----GHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP   98 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni----~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~   98 (127)
                      .++++-++..++.+    +.+|+.+|.+.++|-.+ +.+=.....+.+.+...++|+.
T Consensus       298 ~~FE~l~~~l~~r~~~~i~~~L~~ag~~~~dId~V-vLVGGssRiP~V~~~l~~~fg~  354 (668)
T PRK13410        298 KQFESLCGDLLDRLLRPVKRALKDAGLSPEDIDEV-VLVGGSTRMPMVQQLVRTLIPR  354 (668)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCCChhhCcEE-EEECCccccHHHHHHHHHHcCC
Confidence            56666666655554    57788889998888533 3344567889999999999985


No 85 
>TIGR02619 putative CRISPR-associated protein, APE2256 family. This model represents a conserved domain of about 150 amino acids found in at least five archaeal species and three bacterial species, exclusively in species with CRISPR (Clustered Regularly Interspaced Short Palidromic Repeats). In six of eight species, the member of this family is in the vicinity of a CRISPR/Cas locus.
Probab=23.37  E-value=2.2e+02  Score=19.82  Aligned_cols=28  Identities=11%  Similarity=0.107  Sum_probs=20.9

Q ss_pred             eEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836          72 VVKTTILLNDINDFATVNNVYGQFFKPP   99 (127)
Q Consensus        72 vv~~~vyl~d~~~~~~~~~v~~~~f~~~   99 (127)
                      -.++-.|.+|..+-....++.++|+...
T Consensus        34 ~d~~~Ll~SDT~~G~~~a~ilk~yl~~~   61 (149)
T TIGR02619        34 DDKAILYHSDTAQGRFCASILKRFLERE   61 (149)
T ss_pred             CcEEEEEEcCCHHHHHHHHHHHHHHHHh
Confidence            3477778888888777788888887655


No 86 
>PF07490 Tir_receptor_N:  Translocated intimin receptor (Tir) N-terminus;  InterPro: IPR022633 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens, such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia []. The type III secretion system is of great interest, as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis. However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself [], type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. Exotoxins secreted by the type III system do not possess a secretion signal, and are considered unique for this reason []. Enteropathogenic and entero- haemorrhagic Escherichia coli secrete the bacterial adhesion mediation molecule intimin [], which targets the translocated intimin receptor, Tir. Tir is secreted by the bacteria and is embedded in the target cell's plasma membrane []. This facilitates bacterial cell attachment to the host. 
Probab=23.00  E-value=88  Score=23.36  Aligned_cols=35  Identities=11%  Similarity=0.233  Sum_probs=19.7

Q ss_pred             CCEEEEccccccCCCCCcccccHHHHHHHHHHHHH
Q psy1836          25 DNTLYISGVLGIDASSGKMAEGVENQAKQALTNMG   59 (127)
Q Consensus        25 g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~   59 (127)
                      |++||+.|-.|..-.--++..||.+.-+.+++.++
T Consensus       151 ~kFVFTGgRGGaGHaMvTvASDI~eAR~rIl~kLe  185 (266)
T PF07490_consen  151 GKFVFTGGRGGAGHAMVTVASDIAEARQRILAKLE  185 (266)
T ss_pred             ceEEEecCCCCCCcceeeehhHHHHHHHHHHHhcC
Confidence            67999998766533111122677655555555443


No 87 
>PF01250 Ribosomal_S6:  Ribosomal protein S6;  InterPro: IPR000529 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S6 is one of the proteins from the small ribosomal subunit. In Escherichia coli, S6 is known to bind together with S18 to 16S ribosomal RNA. It belongs to a family of ribosomal proteins which, on the basis of sequence similarities, groups bacterial, red algal chloroplast and cyanelle S6 ribosomal proteins.; GO: 0003735 structural constituent of ribosome, 0019843 rRNA binding, 0006412 translation, 0005840 ribosome; PDB: 3BBN_F 3R3T_B 3F1E_F 2QNH_g 2OW8_g 3PYQ_F 3PYS_F 3PYU_F 3MR8_F 3PYN_F ....
Probab=22.77  E-value=1.3e+02  Score=18.60  Aligned_cols=25  Identities=24%  Similarity=0.489  Sum_probs=21.4

Q ss_pred             HHHHHHHHHHHHHHHHHcCCCccce
Q psy1836          48 ENQAKQALTNMGHILKEAGGSYEHV   72 (127)
Q Consensus        48 ~~Q~~~~~~ni~~~L~~~G~~~~~v   72 (127)
                      +++..+.++++...+.+.|+...++
T Consensus        15 ~~~~~~~~~~~~~~i~~~gg~v~~~   39 (92)
T PF01250_consen   15 EEEIKKLIERVKKIIEKNGGVVRSV   39 (92)
T ss_dssp             HHHHHHHHHHHHHHHHHTTEEEEEE
T ss_pred             HHHHHHHHHHHHHHHHHCCCEEEEE
Confidence            4699999999999999999866554


No 88 
>PHA02754 hypothetical protein; Provisional
Probab=22.65  E-value=1.8e+02  Score=17.25  Aligned_cols=26  Identities=31%  Similarity=0.482  Sum_probs=19.9

Q ss_pred             HHHHHHHHHHHHHHHHHHcCCCccce
Q psy1836          47 VENQAKQALTNMGHILKEAGGSYEHV   72 (127)
Q Consensus        47 ~~~Q~~~~~~ni~~~L~~~G~~~~~v   72 (127)
                      ++.....+++.++.+|.++|.=.+.|
T Consensus        12 ~eK~Fke~MRelkD~LSe~GiYi~RI   37 (67)
T PHA02754         12 MEKDFKEAMRELKDILSEAGIYIDRI   37 (67)
T ss_pred             HHhHHHHHHHHHHHHHhhCceEEEEE
Confidence            45567778889999999999755544


No 89 
>COG3797 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.59  E-value=1.6e+02  Score=21.34  Aligned_cols=70  Identities=23%  Similarity=0.240  Sum_probs=39.1

Q ss_pred             ECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCC-CCCc
Q psy1836          24 HDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKP-PYPA  102 (127)
Q Consensus        24 ~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~-~~Pa  102 (127)
                      .||++|.|...   .      ..++++++.+|+      +..|.      ++.+.+.+.++|..+-+-.--+... ..|+
T Consensus        42 SGNlvf~s~~~---~------~el~~klE~afe------~~fg~------~~dilVrs~~~~~~i~a~nPf~~~~~~~~~  100 (178)
T COG3797          42 SGNLVFESEAG---A------AELEAKLEAAFE------KRFGR------HVDILVRSAEDWETIAAANPFPEEAAGEPA  100 (178)
T ss_pred             cCCEEEEcCCC---h------HHHHHHHHHHHH------HHcCC------CccEEEeeHHHHHHHHhcCCCchhhccCcc
Confidence            38888887643   1      356666666654      33454      3456666777776554432211111 1477


Q ss_pred             cEEEEcCCCCCC
Q psy1836         103 RSTFQVGKLPLI  114 (127)
Q Consensus       103 ~t~v~V~~L~~~  114 (127)
                      +.++-+-.-|.+
T Consensus       101 ~~~v~f~~ep~d  112 (178)
T COG3797         101 RVCVRFYREPLD  112 (178)
T ss_pred             eEEEEEEcCCCc
Confidence            777776555543


No 90 
>KOG0808|consensus
Probab=22.49  E-value=1.5e+02  Score=23.29  Aligned_cols=32  Identities=13%  Similarity=0.141  Sum_probs=27.5

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceEEEE
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTT   76 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~   76 (127)
                      ..+.+|++.+++.+++++++++..--+|+-+.
T Consensus        89 apv~eq~~aih~r~kaiieaaa~agvniiclq  120 (387)
T KOG0808|consen   89 APVSEQTRAIHDRLKAIIEAAAVAGVNIICLQ  120 (387)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHhcCccEEEee
Confidence            57899999999999999999998777776554


No 91 
>PF02120 Flg_hook:  Flagellar hook-length control protein FliK;  InterPro: IPR021136 This entry represents the C-terminal domain of the flagellar hook-length control protein FliK. This entry also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. During flagellar morphogenesis in Salmonella typhimurium and Escherichia coli, flagellar hook-length control protein (FliK) controls the length of the hook by directly measuring the hook length [, ]. It is considered unlikely that FliK functions as a molecular ruler for determining hook length, but that it is more likely to be employing a novel mechanism. The deduced amino acid sequences of FliK proteins from S. typhimurium and E. coli have molecular masses of 41,748 and 39,246 Da, respectively, and are fairly hydrophilic []. Sequence comparison reveals around 50% identity, with greatest conservation in the C-terminal region, with 71% identity in the last 154 amino acids - mutagenesis of this conserved region completely abolishes motility. The central and C-terminal regions are rich in proline and glutamine respectively; it is thought that they may constitute distinct domains [].; PDB: 2RRL_A.
Probab=22.20  E-value=82  Score=18.95  Aligned_cols=27  Identities=15%  Similarity=0.171  Sum_probs=23.3

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEH   71 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~   71 (127)
                      .+..+..++.+..|...|++.|....+
T Consensus        47 ~~~~~~L~~~~~~L~~~L~~~G~~~~~   73 (85)
T PF02120_consen   47 PETKELLRQNLPELKERLQAQGLEVVN   73 (85)
T ss_dssp             SHHHHHHHHTHHHHHHHHHTTT-EEEE
T ss_pred             HHHHHHHHHHHHHHHHHHHHCCCCeEE
Confidence            579999999999999999999997665


No 92 
>TIGR00329 gcp_kae1 metallohydrolase, glycoprotease/Kae1 family. This subfamily includes the well-studied secreted O-sialoglycoprotein endopeptidase (glycoprotease, EC 3.4.24.57) of Pasteurella haemolytica, a pathogen. A member from Riemerella anatipestifer, associated with cohemolysin activity, likewise is exported without benefit of a classical signal peptide and shows glycoprotease activity on the test substrate glycophorin. However, archaeal members of this subfamily show unrelated activities as demonstrated in Pyrococcus abyssi: DNA binding, iron binding, apurinic endonuclease activity, genomic association with a kinase domain, and no glycoprotease activity. This family thus pulls together a set of proteins as a homology group that appears to be near-universal in life, yet heterogeneous in assayed function between bacteria and archaea.
Probab=22.05  E-value=1.5e+02  Score=22.91  Aligned_cols=29  Identities=14%  Similarity=0.105  Sum_probs=26.0

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceE
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVV   73 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv   73 (127)
                      .....+.++...-++.+|+++|.+++||-
T Consensus        43 ~~~~~H~~~l~~~i~~~l~~~~~~~~did   71 (305)
T TIGR00329        43 EASRHHAENIPPLLERALIESNVDKSEID   71 (305)
T ss_pred             hhHHHHHHHHHHHHHHHHHHcCCCHHHCC
Confidence            56788999999999999999999999873


No 93 
>PF00438 S-AdoMet_synt_N:  S-adenosylmethionine synthetase, N-terminal domain;  InterPro: IPR022628  The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold. This entry represents the N-terminal domain of S-adenosylmethionine synthetase and is found in association with PF02772 from PFAM and PF02773 from PFAM. S-adenosylmethionine synthetase (MAT, 2.5.1.6 from EC) is the enzyme that catalyzes the formation of S-adenosylmethionine (AdoMet) from methionine and ATP []. AdoMet is an important methyl donor for transmethylation and is also the propylamino donor in polyamine biosynthesis. In bacteria there is a single isoform of AdoMet synthetase (gene metK), there are two in budding yeast (genes SAM1 and SAM2) and in mammals while in plants there is generally a multigene family. The sequence of AdoMet synthetase is highly conserved throughout isozymes and species. The active sites of both the Escherichia coli and rat liver MAT reside between two subunits, with contributions from side chains of residues from both subunits, resulting in a dimer as the minimal catalytic entity. The side chains that contribute to the ligand binding sites are conserved between the two proteins. In the structures of complexes with the E. coli enzyme, the phosphate groups have the same positions in the (PPi plus Pi) complex and the (ADP plus Pi) complex, and are located at the bottom of a deep cavity with the adenosyl group nearer the entrance []; GO: 0004478 methionine adenosyltransferase activity; PDB: 3RV2_A 3TDE_B 3S82_B 3IML_B 2P02_A 2OBV_A 1QM4_B 1O9T_B 1O93_A 1O92_B ....
Probab=22.03  E-value=1.5e+02  Score=19.30  Aligned_cols=51  Identities=16%  Similarity=0.219  Sum_probs=30.7

Q ss_pred             CCcceEEE----ECCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836          16 ASAFQAVQ----HDNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEH   71 (127)
Q Consensus        16 ~~ys~av~----~g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~   71 (127)
                      .|+++...    .+++++++|-+....     ..|+++=+|.++++|.-.=.+.|.+++.
T Consensus        34 dp~arVA~E~~~~~~~v~i~GEi~~~~-----~vd~~~ivR~~i~~IGY~~~~~gfd~~t   88 (100)
T PF00438_consen   34 DPNARVACETLVSTGMVIIAGEITSRA-----YVDIEKIVREVIKDIGYDDSEYGFDYDT   88 (100)
T ss_dssp             -TT-EEEEEEEEETTEEEEEEEEESSH-----HHTHHHHHHHHHHHHT-EEGGGTEETTT
T ss_pred             CCCCeEEEEEEeeccEEEEEEEeccch-----hhhHHHHHHHHHHHhCCCCccCCCCCCc
Confidence            35666433    289999999886543     1678777777777665444444444443


No 94 
>PRK05952 3-oxoacyl-(acyl carrier protein) synthase II; Reviewed
Probab=21.87  E-value=2.2e+02  Score=22.73  Aligned_cols=46  Identities=15%  Similarity=0.137  Sum_probs=30.9

Q ss_pred             HHHHHHHHHHHHcCCCccceEEEEEEeCCCC--CHHHHHHHHHhhcCCC
Q psy1836          53 QALTNMGHILKEAGGSYEHVVKTTILLNDIN--DFATVNNVYGQFFKPP   99 (127)
Q Consensus        53 ~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~--~~~~~~~v~~~~f~~~   99 (127)
                      ...+.++.+|+.+|.+.+||--+..+=+...  |-.+.. ..++.|+..
T Consensus       255 ~~~~ai~~Al~~agi~p~dId~i~~hgtgt~~~D~~E~~-al~~~fg~~  302 (381)
T PRK05952        255 SAIAAIQQCLARSGLTPEDIDYIHAHGTATRLNDQREAN-LIQALFPHR  302 (381)
T ss_pred             HHHHHHHHHHHHhCCCHHHeeEEEccCCCCCCCcHHHHH-HHHHHcCCC
Confidence            3456778899999999999988887766543  333333 335566643


No 95 
>PRK14646 hypothetical protein; Provisional
Probab=21.62  E-value=2.9e+02  Score=19.27  Aligned_cols=58  Identities=9%  Similarity=0.078  Sum_probs=40.7

Q ss_pred             HHHHHHHHHHHHHcCCCccceE--------EEEEEeCC-------CCCHHHHHHHHHhhcCC--CCCccEEEEcC
Q psy1836          52 KQALTNMGHILKEAGGSYEHVV--------KTTILLND-------INDFATVNNVYGQFFKP--PYPARSTFQVG  109 (127)
Q Consensus        52 ~~~~~ni~~~L~~~G~~~~~vv--------~~~vyl~d-------~~~~~~~~~v~~~~f~~--~~Pa~t~v~V~  109 (127)
                      .++.+-++.++++.|..+-||-        .+++|+..       ++|...+++...+++..  +.|..=.+.|+
T Consensus         7 ~~i~~li~p~~~~~G~eLvdve~~~~~~~~~LrV~IDk~~g~gVtldDC~~vSr~is~~LD~~D~i~~~Y~LEVS   81 (155)
T PRK14646          7 SKLEILLEKVANEFDLKICSLNIQTNQNPIVIKIIIKKTNGDDISLDDCALFNTPASEEIENSNLLNCSYVLEIS   81 (155)
T ss_pred             HHHHHHHHHHHHHcCCEEEEEEEEeCCCCeEEEEEEECCCCCCccHHHHHHHHHHHHHHhCcCCCCCCCeEEEEc
Confidence            3555677788889998777763        26788742       45678888888898873  35556666666


No 96 
>PF00814 Peptidase_M22:  Glycoprotease family;  InterPro: IPR000905 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M22 (clan MK). The type example being O-sialoglycoprotein endopeptidase (3.4.24.57 from EC) from Pasteurella haemolytica (Mannheimia haemolytica). O-Sialoglycoprotein endopeptidase is secreted by the bacterium P. haemolytica, and digests only proteins that are heavily sialylated, in particular those with sialylated serine and threonine residues []. Substrate proteins include glycophorin A and leukocyte surface antigens CD34, CD43, CD44 and CD45 [, ]. Removal of glycosylation, by treatment with neuraminidase, completely negates susceptibility to O-sialoglycoprotein endopeptidase digestion [, ]. Sequence similarity searches have revealed other members of the M22 family, from yeast, Mycobacterium, Haemophilus influenzae and the cyanobacterium Synechocystis []. The zinc-binding and catalytic residues of this family have not been determined, although the motif HMEGH may be a zinc-binding region [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2A6A_A 2GEL_G 2GEM_B 1OKJ_B 3ENO_A 3EN9_B 2VWB_B 3ENH_B 2IVO_D 2IVP_A ....
Probab=21.49  E-value=2.1e+02  Score=21.67  Aligned_cols=41  Identities=17%  Similarity=0.218  Sum_probs=29.3

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHH
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFA   86 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~   86 (127)
                      .-.+...+..+..++.+|+++|.+++||-.+-+ -.....|.
T Consensus        24 ~a~r~H~~~L~~~i~~~l~~~~~~~~did~iav-t~GPGsft   64 (268)
T PF00814_consen   24 IASRQHSENLPPLIEELLKEAGISLSDIDAIAV-TRGPGSFT   64 (268)
T ss_dssp             CHHHHHHHHHHHHHHHHHHHHTS-GGGESEEEE-EEESS-HH
T ss_pred             HHHHHHHHHHHHHHHHHHHHcCCCHHHCCEEEE-ecCCCccc
Confidence            456778899999999999999999998855433 33444553


No 97 
>PF02801 Ketoacyl-synt_C:  Beta-ketoacyl synthase, C-terminal domain;  InterPro: IPR014031 Beta-ketoacyl-ACP synthase 2.3.1.41 from EC (KAS) [] is the enzyme that catalyzes the condensation of malonyl-ACP with the growing fatty acid chain. It is found as a component of a number of enzymatic systems, including fatty acid synthetase (FAS), which catalyzes the formation of long-chain fatty acids from acetyl-CoA, malonyl-CoA and NADPH; the multi-functional 6-methysalicylic acid synthase (MSAS) from Penicillium patulum [], which is involved in the biosynthesis of a polyketide antibiotic; polyketide antibiotic synthase enzyme systems; Emericella nidulans multifunctional protein Wa, which is involved in the biosynthesis of conidial green pigment; Rhizobium nodulation protein nodE, which probably acts as a beta-ketoacyl synthase in the synthesis of the nodulation Nod factor fatty acyl chain; and yeast mitochondrial protein CEM1. The condensation reaction is a two step process, first the acyl component of an activated acyl primer is transferred to a cysteine residue of the enzyme and is then condensed with an activated malonyl donor with the concomitant release of carbon dioxide. This entry represents the C-terminal domain of beta-ketoacyl-ACP synthases. The active site is contained in a cleft betweeen N- and C-terminal domains, with residues from both domains contributing to substrate binding and catalysis [].; PDB: 2UV8_B 3HMJ_A 2VKZ_C 4EWG_A 1TQY_H 1E5M_A 1J3N_B 2VZ8_A 2VZ9_B 3O04_A ....
Probab=21.21  E-value=2.1e+02  Score=18.51  Aligned_cols=35  Identities=17%  Similarity=0.041  Sum_probs=22.6

Q ss_pred             ccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCC
Q psy1836          45 EGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDIN   83 (127)
Q Consensus        45 ~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~   83 (127)
                      .+...|.+    .|+..|+.+|.+.+||--+..+=+...
T Consensus        22 p~~~~~~~----~i~~al~~agi~~~~I~~i~~hg~Gt~   56 (119)
T PF02801_consen   22 PNGAALAR----AIRRALADAGISPEDIDYIEAHGTGTP   56 (119)
T ss_dssp             TTHHHHHH----HHHHHHHHHTS-GGGEEEEE----SSH
T ss_pred             cCHHHHHH----HHHHHHhhhccccccceeeeeeccccc
Confidence            45555543    389999999999999977777766554


No 98 
>TIGR03725 bact_YeaZ universal bacterial protein YeaZ. This family describes a protein family, YeaZ, that appears to be universal in bacteria, but whose function is unknown. This family is related to the gcp (glycoprotease) protein family, also universal in bacteria and unknown in function. In Gram-positive lineages, members of these two related families often belong to the same operon, along with the ribosomal-protein-alanine acetyltransferase gene. Members of this family may occur as fusions with gcp or the ribosomal protein N-acetyltransferase rimI, and is frequently encoded next to rimI.
Probab=21.01  E-value=1.5e+02  Score=21.31  Aligned_cols=42  Identities=19%  Similarity=0.225  Sum_probs=32.4

Q ss_pred             cHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHH
Q psy1836          46 GVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATV   88 (127)
Q Consensus        46 d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~   88 (127)
                      .-..+.+..+..|+.+|+++|.+++|| ..-.+-.....|..+
T Consensus        29 ~~~~h~~~l~~~i~~~l~~~~~~~~~i-~~iav~~GPGSfTGl   70 (202)
T TIGR03725        29 AGRNHSEILLPMIEELLAEAGLSLQDL-DAIAVGVGPGSFTGL   70 (202)
T ss_pred             hhHHHHHHHHHHHHHHHHHcCCCHHHC-CEEEEecCCChHHhH
Confidence            457799999999999999999999998 344445566666443


No 99 
>PF10548 P22_AR_C:  P22AR C-terminal domain;  InterPro: IPR018876  This entry represents the carboxy-terminal domain of the Enterobacteria phage P22 antirepressor ((P03037 from SWISSPROT) []. It is found associated with IPR018875 from INTERPRO. 
Probab=20.67  E-value=1.6e+02  Score=18.01  Aligned_cols=26  Identities=15%  Similarity=0.212  Sum_probs=22.1

Q ss_pred             cHHHHHHHHHHHHHHHHHHcCCCccc
Q psy1836          46 GVENQAKQALTNMGHILKEAGGSYEH   71 (127)
Q Consensus        46 d~~~Q~~~~~~ni~~~L~~~G~~~~~   71 (127)
                      -.-.+++..++++...|++.|+.+.-
T Consensus        21 ~~~~~m~~~~~~l~p~L~~lgS~~a~   46 (74)
T PF10548_consen   21 FAAERMRELCQELYPALKALGSNYAG   46 (74)
T ss_pred             HHHHHHHHHHHHHHHHHHHhCCCcCc
Confidence            35678999999999999999997764


No 100
>PLN02606 palmitoyl-protein thioesterase
Probab=20.41  E-value=4.4e+02  Score=20.82  Aligned_cols=58  Identities=16%  Similarity=0.105  Sum_probs=42.7

Q ss_pred             ccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCCCCccEEEEcC
Q psy1836          43 MAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPPYPARSTFQVG  109 (127)
Q Consensus        43 ~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~~Pa~t~v~V~  109 (127)
                      +..++.+|++.+.+.|.. .++..-+        +-+...+.=..+-+.+-+++++.+|.++.|-.+
T Consensus        72 ~~~~~~~Qv~~vce~l~~-~~~L~~G--------~naIGfSQGglflRa~ierc~~~p~V~nlISlg  129 (306)
T PLN02606         72 LFMPLRQQASIACEKIKQ-MKELSEG--------YNIVAESQGNLVARGLIEFCDNAPPVINYVSLG  129 (306)
T ss_pred             cccCHHHHHHHHHHHHhc-chhhcCc--------eEEEEEcchhHHHHHHHHHCCCCCCcceEEEec
Confidence            447889999999999988 4444322        223345666788999999998878999998644


No 101
>PF02200 STE:  STE like transcription factor;  InterPro: IPR003120 This family consists of transcription factors related to STE and is found associated with the C2H2 zinc finger in some proteins.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=20.40  E-value=1.2e+02  Score=19.92  Aligned_cols=20  Identities=20%  Similarity=0.285  Sum_probs=17.5

Q ss_pred             CCCCcceEEEECCEEEEccc
Q psy1836          14 GSASAFQAVQHDNTLYISGV   33 (127)
Q Consensus        14 ~~~~ys~av~~g~~v~iSG~   33 (127)
                      |.+.|-.+|-.+++.|++|+
T Consensus        10 ~~~e~isCV~Wn~l~~ItGT   29 (110)
T PF02200_consen   10 PNGEFISCVLWNNLFHITGT   29 (110)
T ss_pred             CCCCEEEEEEEccEEEEecc
Confidence            55669999999999999996


No 102
>PF14359 DUF4406:  Domain of unknown function (DUF4406)
Probab=20.15  E-value=1.6e+02  Score=18.61  Aligned_cols=57  Identities=11%  Similarity=0.149  Sum_probs=32.7

Q ss_pred             EEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhc
Q psy1836          28 LYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFF   96 (127)
Q Consensus        28 v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f   96 (127)
                      |||||-..-          ..+.-+.+|.+..+.|++.|...-+=+..  ...+-.+++...++--+.+
T Consensus         1 iYIaGPmtG----------~~~~N~~~f~~~a~~L~~~G~~vvnPa~~--~~~~~~~~~~ym~~~l~~L   57 (92)
T PF14359_consen    1 IYIAGPMTG----------LPDYNRPAFNAAAKRLRAKGYEVVNPAEL--GIPEGLSWEEYMRICLAML   57 (92)
T ss_pred             CeEeCCcCC----------CcchHHHHHHHHHHHHHHCCCEEeCchhh--CCCCCCCHHHHHHHHHHHH
Confidence            588885532          22456789999999999999432221111  2233334556555554444


No 103
>PTZ00112 origin recognition complex 1 protein; Provisional
Probab=20.10  E-value=4.8e+02  Score=24.45  Aligned_cols=65  Identities=20%  Similarity=0.227  Sum_probs=34.1

Q ss_pred             CCEEEEccccccCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCccceEEEEEEeCCCCCHHHHHHHHHhhcCCC
Q psy1836          25 DNTLYISGVLGIDASSGKMAEGVENQAKQALTNMGHILKEAGGSYEHVVKTTILLNDINDFATVNNVYGQFFKPP   99 (127)
Q Consensus        25 g~~v~iSG~~~~d~~~g~~~~d~~~Q~~~~~~ni~~~L~~~G~~~~~vv~~~vyl~d~~~~~~~~~v~~~~f~~~   99 (127)
                      ++.+||+|..|.-.         ..-++.+++.++..-+..+...=.++.++++-- ......+..+|.++++..
T Consensus       781 nnvLYIyG~PGTGK---------TATVK~VLrELqeeaeqk~lp~f~vVYINCm~L-stp~sIYqvI~qqL~g~~  845 (1164)
T PTZ00112        781 NQILYISGMPGTGK---------TATVYSVIQLLQHKTKQKLLPSFNVFEINGMNV-VHPNAAYQVLYKQLFNKK  845 (1164)
T ss_pred             CceEEEECCCCCCH---------HHHHHHHHHHHHHHHhhccCCCceEEEEeCCcc-CCHHHHHHHHHHHHcCCC
Confidence            46789999987532         134555666666544443332222333333210 123455666676666554


Done!