Query         030744
Match_columns 172
No_of_seqs    118 out of 632
Neff          7.6 
Searched_HMMs 46136
Date          Fri Mar 29 03:55:14 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/030744.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/030744hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF10607 CLTH:  CTLH/CRA C-term 100.0 3.1E-34 6.7E-39  214.2  14.4  141   10-152     1-144 (145)
  2 KOG2659 LisH motif-containing  100.0 6.8E-34 1.5E-38  224.4  15.4  154    2-155    56-210 (228)
  3 KOG0396 Uncharacterized conser  99.9 8.9E-21 1.9E-25  157.7  13.8  137    6-144   148-285 (389)
  4 smart00757 CRA CT11-RanBPM. pr  99.8 1.9E-19 4.2E-24  125.9  11.0   94   64-157     2-98  (99)
  5 KOG2817 Predicted E3 ubiquitin  99.7 3.9E-17 8.5E-22  137.3  11.8  132   11-144   156-291 (394)
  6 smart00668 CTLH C-terminal to   99.5 1.1E-13 2.4E-18   87.6   5.9   56   11-66      2-57  (58)
  7 KOG1477 SPRY domain-containing  99.0 7.8E-11 1.7E-15  103.2   0.9  157   14-171   308-468 (469)
  8 KOG0293 WD40 repeat-containing  98.4 3.5E-06 7.5E-11   72.2   9.8  137   13-154    52-191 (519)
  9 PF14559 TPR_19:  Tetratricopep  89.3     2.1 4.5E-05   26.5   6.1   60   20-84      1-60  (68)
 10 COG5109 Uncharacterized conser  81.7      17 0.00036   30.9   9.1  131   10-144   138-293 (396)
 11 cd08044 TAF5_NTD2 TAF5_NTD2 is  78.8    0.88 1.9E-05   33.3   0.7   97   46-144    28-125 (133)
 12 PRK02289 4-oxalocrotonate taut  65.3      10 0.00022   23.6   3.2   26  110-135    12-37  (60)
 13 PF13838 Clathrin_H_link:  Clat  65.0      15 0.00031   23.9   3.9   39   50-90      7-47  (66)
 14 PF04494 TFIID_90kDa:  WD40 ass  62.0      15 0.00033   27.0   4.2   47   46-94     39-85  (142)
 15 COG3071 HemY Uncharacterized e  61.1      85  0.0018   27.5   9.0   74   15-100   268-341 (400)
 16 PTZ00196 60S ribosomal protein  58.1      19 0.00042   25.2   3.8   32   49-80     48-79  (98)
 17 KOG0275 Conserved WD40 repeat-  57.1      53  0.0011   28.3   6.9  131   12-154    42-176 (508)
 18 PF14689 SPOB_a:  Sensor_kinase  54.9      37  0.0008   21.3   4.5   31   11-41     24-54  (62)
 19 PF12895 Apc3:  Anaphase-promot  53.9      12 0.00026   24.3   2.3   19   17-35     32-50  (84)
 20 PF13432 TPR_16:  Tetratricopep  51.3      53  0.0011   19.8   5.4   56   17-77      4-59  (65)
 21 PRK01271 4-oxalocrotonate taut  50.5      25 0.00055   23.3   3.3   26  110-135    13-38  (76)
 22 PF01361 Tautomerase:  Tautomer  48.4      25 0.00055   21.4   3.0   25  111-135    12-36  (60)
 23 PF01158 Ribosomal_L36e:  Ribos  46.7      37  0.0008   23.8   3.8   32   49-80     48-79  (98)
 24 KOG2910 Uncharacterized conser  46.5      86  0.0019   24.7   6.1   63   12-76     41-116 (209)
 25 PF14973 TINF2_N:  TERF1-intera  46.4      73  0.0016   23.8   5.6   76   16-95     46-129 (145)
 26 PF07721 TPR_4:  Tetratricopept  45.9      27 0.00058   17.7   2.3   17   18-34      9-25  (26)
 27 PF12854 PPR_1:  PPR repeat      45.1      50  0.0011   17.8   3.7   26   10-35      7-32  (34)
 28 KOG0396 Uncharacterized conser  43.7   1E+02  0.0022   26.8   6.7   86   12-100   118-206 (389)
 29 PF14691 Fer4_20:  Dihydroprymi  43.6      33 0.00072   24.3   3.3   23   16-38     44-66  (111)
 30 cd00491 4Oxalocrotonate_Tautom  43.5      41 0.00088   20.1   3.3   25  111-135    12-36  (58)
 31 TIGR00756 PPR pentatricopeptid  43.3      39 0.00085   17.1   2.9   22   15-36      5-26  (35)
 32 PRK01964 4-oxalocrotonate taut  42.9      40 0.00088   20.9   3.3   26  110-135    12-37  (64)
 33 PRK07143 hypothetical protein;  42.5      17 0.00037   30.1   1.9   24   15-38    153-176 (279)
 34 PRK02220 4-oxalocrotonate taut  42.0      42 0.00092   20.4   3.3   26  110-135    12-37  (61)
 35 TIGR00013 taut 4-oxalocrotonat  41.8      45 0.00097   20.4   3.4   26  110-135    12-37  (63)
 36 COG4105 ComL DNA uptake lipopr  41.5   2E+02  0.0043   23.6   9.0   62   14-81     38-103 (254)
 37 TIGR00083 ribF riboflavin kina  41.3      27 0.00059   29.0   2.9   24   15-38    147-170 (288)
 38 COG0268 RpsT Ribosomal protein  41.1      48   0.001   22.7   3.6   28   15-42     33-60  (88)
 39 PF04157 EAP30:  EAP30/Vps36 fa  41.0 1.8E+02  0.0039   22.9   9.0   79   65-143    39-121 (223)
 40 PF00627 UBA:  UBA/TS-N domain;  40.7      46   0.001   18.4   3.0   19   15-33     17-37  (37)
 41 KOG3452 60S ribosomal protein   40.4      53  0.0012   23.0   3.7   45   49-96     50-94  (102)
 42 PF12793 SgrR_N:  Sugar transpo  40.3      55  0.0012   23.3   4.0   24   14-38     74-97  (115)
 43 PF07035 Mic1:  Colon cancer-as  39.7      54  0.0012   25.1   4.1   62   14-75     49-115 (167)
 44 PF04121 Nup84_Nup100:  Nuclear  38.9      43 0.00093   31.3   4.1   29    8-36    131-159 (697)
 45 KOG2235 Uncharacterized conser  38.3      72  0.0016   29.7   5.2   65   58-129    66-131 (776)
 46 PF06957 COPI_C:  Coatomer (COP  38.2 1.5E+02  0.0034   26.1   7.2   26   11-36    119-144 (422)
 47 KOG3060 Uncharacterized conser  38.1 1.3E+02  0.0028   25.1   6.2   65   23-92     99-163 (289)
 48 PF07208 DUF1414:  Protein of u  37.2      60  0.0013   19.3   3.1   20  110-129    24-43  (44)
 49 COG5443 FlbT Flagellar biosynt  37.1      59  0.0013   24.1   3.7   31    8-38     92-122 (148)
 50 PF10827 DUF2552:  Protein of u  37.1      23  0.0005   23.3   1.5   16   25-40     60-75  (79)
 51 PRK00745 4-oxalocrotonate taut  36.8      60  0.0013   19.7   3.4   25  110-134    12-36  (62)
 52 COG5051 RPL36A Ribosomal prote  36.7      78  0.0017   21.7   4.0   44   50-96     51-94  (97)
 53 PF07079 DUF1347:  Protein of u  36.2 2.9E+02  0.0062   25.1   8.4   51   14-64    132-190 (549)
 54 PF13934 ELYS:  Nuclear pore co  36.1      79  0.0017   25.2   4.7   16   60-75    151-166 (226)
 55 KOG1156 N-terminal acetyltrans  35.5 3.8E+02  0.0083   25.2  13.4  127   11-144   186-332 (700)
 56 PRK04984 fatty acid metabolism  35.2      82  0.0018   24.7   4.7   24   54-77    194-217 (239)
 57 PRK00794 flbT flagellar biosyn  34.5      88  0.0019   23.0   4.4   30    9-38     92-121 (132)
 58 PF14276 DUF4363:  Domain of un  33.7 1.7E+02  0.0037   20.5   7.8   47   13-59     31-77  (121)
 59 PRK05627 bifunctional riboflav  32.9      34 0.00074   28.7   2.2   24   15-38    164-187 (305)
 60 PF12931 Sec16_C:  Sec23-bindin  32.9      46   0.001   27.4   3.0   20   16-35      1-20  (284)
 61 PF13371 TPR_9:  Tetratricopept  32.8 1.2E+02  0.0026   18.5   7.0   55   20-79      5-59  (73)
 62 PF07378 FlbT:  Flagellar prote  32.5      87  0.0019   22.8   4.0   32    7-38     88-119 (126)
 63 PF13041 PPR_2:  PPR repeat fam  32.5      63  0.0014   18.6   2.8   24   14-37      7-30  (50)
 64 PF12169 DNA_pol3_gamma3:  DNA   32.4 1.1E+02  0.0025   21.7   4.8   23   15-37     19-41  (143)
 65 PF13812 PPR_3:  Pentatricopept  32.0      75  0.0016   16.2   2.9   23   14-36      5-27  (34)
 66 PF01535 PPR:  PPR repeat;  Int  31.7      60  0.0013   16.1   2.4   22   15-36      5-26  (31)
 67 TIGR01470 cysG_Nterm siroheme   31.6 1.2E+02  0.0027   23.6   5.1   64   12-76    135-204 (205)
 68 PF10602 RPN7:  26S proteasome   30.6 2.4E+02  0.0052   21.4   7.2   53   16-68     79-132 (177)
 69 COG1942 Uncharacterized protei  30.6      90  0.0019   20.2   3.5   26  110-135    13-38  (69)
 70 PF14691 Fer4_20:  Dihydroprymi  30.0      65  0.0014   22.8   3.0   26   50-75     39-64  (111)
 71 PRK15359 type III secretion sy  29.7 2.1E+02  0.0045   20.6   5.8   57   16-77     30-86  (144)
 72 COG5516 Conserved protein cont  29.7 1.4E+02  0.0031   23.2   4.9   57  112-171    42-99  (196)
 73 KOG0989 Replication factor C,   29.2 2.5E+02  0.0055   24.1   6.7   73   21-93    219-299 (346)
 74 PRK00239 rpsT 30S ribosomal pr  29.0      98  0.0021   21.1   3.6   27   16-42     34-60  (88)
 75 PRK09239 chorismate mutase; Pr  28.8 2.1E+02  0.0045   20.0   6.2   49  108-158    50-102 (104)
 76 PF14498 Glyco_hyd_65N_2:  Glyc  28.3      38 0.00082   26.9   1.7   34    4-37     48-81  (236)
 77 PRK10421 DNA-binding transcrip  28.3 1.4E+02   0.003   23.7   5.0   29   10-38    190-218 (253)
 78 PF12569 NARP1:  NMDA receptor-  28.2 1.4E+02  0.0031   27.0   5.5   71   20-95     14-84  (517)
 79 PF10607 CLTH:  CTLH/CRA C-term  27.5      72  0.0016   22.9   3.0   30    8-37     38-67  (145)
 80 PF01649 Ribosomal_S20p:  Ribos  25.7 1.2E+02  0.0027   20.4   3.6   28   15-42     32-59  (84)
 81 PF00701 DHDPS:  Dihydrodipicol  25.4 3.7E+02   0.008   21.8   8.0   19   15-33    215-233 (289)
 82 TIGR03504 FimV_Cterm FimV C-te  25.1   1E+02  0.0023   18.0   2.8   19   18-36      7-25  (44)
 83 PF13428 TPR_14:  Tetratricopep  25.1   1E+02  0.0022   17.3   2.8   18   18-35      9-26  (44)
 84 PRK10225 DNA-binding transcrip  25.0 1.2E+02  0.0027   24.0   4.2   26   53-78    201-226 (257)
 85 PLN02150 terpene synthase/cycl  24.4 1.6E+02  0.0035   20.2   4.1   35  116-152    33-67  (96)
 86 KOG3930 Uncharacterized conser  23.9 1.5E+02  0.0032   25.4   4.4   50  106-155    21-71  (389)
 87 PF08625 Utp13:  Utp13 specific  23.7 2.3E+02   0.005   20.9   5.1   22   15-36      2-23  (141)
 88 PF10475 DUF2450:  Protein of u  23.5 1.2E+02  0.0025   25.1   3.8   29   12-40    129-157 (291)
 89 PF12862 Apc5:  Anaphase-promot  23.4 2.3E+02  0.0051   18.8   6.3   28   17-44      5-32  (94)
 90 PRK12791 flbT flagellar biosyn  22.8 1.7E+02  0.0038   21.5   4.2   27   12-38     92-118 (131)
 91 COG5117 NOC3 Protein involved   22.6 5.9E+02   0.013   23.2   8.0  103   11-127   375-480 (657)
 92 cd00194 UBA Ubiquitin Associat  22.4      69  0.0015   17.4   1.6   21   14-34     15-37  (38)
 93 KOG2659 LisH motif-containing   22.2 2.1E+02  0.0045   23.1   4.8   63   14-78     31-93  (228)
 94 PF07719 TPR_2:  Tetratricopept  21.9 1.1E+02  0.0024   15.4   2.4   16   20-35     11-26  (34)
 95 TIGR00029 S20 ribosomal protei  21.9 1.6E+02  0.0035   20.0   3.6   28   15-42     33-60  (87)
 96 PF09712 PHA_synth_III_E:  Poly  21.9 2.2E+02  0.0048   23.7   5.2   68   87-154   222-290 (293)
 97 PRK09464 pdhR transcriptional   21.0 2.1E+02  0.0047   22.5   4.8   23   16-38    204-226 (254)
 98 cd00686 Terpene_cyclase_cis_tr  20.8 1.8E+02  0.0038   25.2   4.3   79   89-172   214-294 (357)
 99 COG3492 Uncharacterized protei  20.4 2.9E+02  0.0062   19.2   4.5   46  109-154     5-50  (104)
100 PF13424 TPR_12:  Tetratricopep  20.1 2.3E+02   0.005   17.5   5.0   56   20-76     15-73  (78)
101 PRK04964 hypothetical protein;  20.1      99  0.0021   19.8   2.1   26   64-89     22-47  (66)

No 1  
>PF10607 CLTH:  CTLH/CRA C-terminal to LisH motif domain;  InterPro: IPR019589 This entry represents the CRA (or CT11-RanBPM) domain, which is a protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi) and which is found in Ran-binding proteins such as Ran-binding protein 9 (RanBP9 or RanBPM) and RanBP10. RanBPM is a scaffolding protein important in regulating cellular function in both the immune system and the nervous system, and may act as an adapter protein to couple membrane receptors to intracellular signaling pathways. This domain is at the C terminus of the proteins and is the binding domain for the CRA motif, which is comprised of approximately 100 amino acids at the C-terminal of RanBPM. It was found to be important for the interaction of RanBPM with fragile X mental retardation protein (FMRP), but its functional significance has yet to be determined []. 
Probab=100.00  E-value=3.1e-34  Score=214.24  Aligned_cols=141  Identities=35%  Similarity=0.574  Sum_probs=133.5

Q ss_pred             hHHHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHH
Q 030744           10 DMEMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKL   89 (172)
Q Consensus        10 ~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l   89 (172)
                      ++.+|+.|+++|++|++++|++|+++++|.+++.++.++|.|++|+|||||+++++.+||+|||+++.++.  ..+.+++
T Consensus         1 ~~~~r~~I~~~I~~g~i~~Ai~w~~~~~~~l~~~~~~L~f~L~~q~fiell~~~~~~~Ai~y~r~~l~~~~--~~~~~~l   78 (145)
T PF10607_consen    1 SFKERKKIRQAILNGDIDPAIEWLNENFPELLKRNSSLEFELRCQQFIELLREGDIMEAIEYARKHLSPFN--DEFLEEL   78 (145)
T ss_pred             CHHHHHHHHHHHHcCCHHHHHHHHHHcCHHHHhcCCchhHHHHHHHHHHHHHHHhHHHHHHHHHHHhhhhH--HHHHHHH
Confidence            47899999999999999999999999999999999999999999999999999999999999999997665  3689999


Q ss_pred             HHHHhHhcccCCCC---CcchhcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHH
Q 030744           90 EDFMALLAYEEPEK---SPMFHLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCL  152 (172)
Q Consensus        90 ~~~~~LLay~~~~~---sp~~~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l  152 (172)
                      +++|+||+|++|.+   +|+++++++++|+.+|+.||++|+.+.|.|+.|+|+.++++..++..++
T Consensus        79 ~~~~~lL~~~~~~~~~~s~~~~l~~~~~~~~la~~~~~~~l~~~~~~~~s~L~~~~~~g~~~l~~l  144 (145)
T PF10607_consen   79 KKLMSLLAYPDPEEPLPSPYKELLSPERREELAEEFNSAILKSYGLPKESPLEVILKAGLSALKTL  144 (145)
T ss_pred             HHHHHHHHcCCcccccchHHHHHhChHHHHHHHHHHHHHHHHHhCcCCCCHHHHHHHHHHHHhhhc
Confidence            99999999999987   7999999999999999999999999999999999999999998887654


No 2  
>KOG2659 consensus LisH motif-containing protein [Cytoskeleton]
Probab=100.00  E-value=6.8e-34  Score=224.39  Aligned_cols=154  Identities=39%  Similarity=0.539  Sum_probs=147.8

Q ss_pred             CCchhcHHhHHHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCC
Q 030744            2 KQPANCLEDMEMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGK   81 (172)
Q Consensus         2 ~~~~~~~~~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~   81 (172)
                      ++|..|++++..|..|+.+|..|+|+.|++.+++++|+++++|.+|.|.|++|+||||||.|..++||+|||+++++++.
T Consensus        56 ~~~~~d~~~~~eR~~Ir~~I~~G~Ie~Aie~in~l~PeiLd~n~~l~F~Lq~q~lIEliR~~~~eeal~F~q~~LA~~a~  135 (228)
T KOG2659|consen   56 KPPSIDLDSMDERLQIRRAIEEGQIEEAIEKVNQLNPEILDTNRELFFHLQQLHLIELIREGKTEEALEFAQTKLAPFAE  135 (228)
T ss_pred             CCccCchhhHhHHHHHHHHHHhccHHHHHHHHHHhChHHHccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHcccccc
Confidence            45689999999999999999999999999999999999999999999999999999999999999999999999999997


Q ss_pred             -chhhHHHHHHHHhHhcccCCCCCcchhcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHHh
Q 030744           82 -VQKYVEKLEDFMALLAYEEPEKSPMFHLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQE  155 (172)
Q Consensus        82 -~~~~~~~l~~~~~LLay~~~~~sp~~~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~~  155 (172)
                       +++++.+++++|++|+|++|+.||++++++.++|+++|++||++||.+++.+..+.|..+++...+.+..+...
T Consensus       136 e~~~~~~elE~~l~lLvf~~~~~sp~~~l~~~s~R~kvA~~vN~aiL~~~~~~~~~~l~~llk~~~~~~~~~~~~  210 (228)
T KOG2659|consen  136 ENPKKMEELERTLALLVFELSQESPSAELLSQSLRQKVASEVNSAILASQEHESEPKLPFLLKLISWAQEELDRE  210 (228)
T ss_pred             ccHHHHHHHHHHHHHHHcCCcccCcHHHHHHHHHHHHHHHHHHHHHHHHhcccccchHHHHHHHHHHHHHHHhHh
Confidence             77899999999999999999999999999999999999999999999999999999999999888888777665


No 3  
>KOG0396 consensus Uncharacterized conserved protein [Function unknown]
Probab=99.86  E-value=8.9e-21  Score=157.71  Aligned_cols=137  Identities=22%  Similarity=0.265  Sum_probs=128.4

Q ss_pred             hcHHhHHHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhh
Q 030744            6 NCLEDMEMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKY   85 (172)
Q Consensus         6 ~~~~~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~   85 (172)
                      .|.+-.+.-+.|+++|+.|++.+|+.||++|.-.+-+.++.++|.++.|+|||||+.+++.+||+|++++|++++  .++
T Consensus       148 vD~Dv~~~~~~I~~sll~~~l~~~Lswc~ehk~~LkK~~S~lEf~lRlQefIELi~~~~~~~Ai~~akk~f~~~~--~~~  225 (389)
T KOG0396|consen  148 VDSDVYKRAYGIRDSLLAGELEPALSWCKEHKVELKKEESSLEFQLRLQEFIELIKVDNYDKAIAFAKKHFAPWA--KSH  225 (389)
T ss_pred             HhHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHhccchhhhHHHHHHHHHHHHhccHHHHHHHHHHHHhhhh--hhh
Confidence            466777888999999999999999999999999999999999999999999999999999999999999999998  488


Q ss_pred             HHHHHHHHhHhcccC-CCCCcchhcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHH
Q 030744           86 VEKLEDFMALLAYEE-PEKSPMFHLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQ  144 (172)
Q Consensus        86 ~~~l~~~~~LLay~~-~~~sp~~~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~  144 (172)
                      ...++.+||+|||+. +..++|..+++.+||+.+++.+-+-.+...|+|..|+|-..++.
T Consensus       226 ~~~Lk~a~g~laF~~~t~~sky~~l~~~~rw~~l~~lF~s~a~~l~~i~~~~~L~~~l~~  285 (389)
T KOG0396|consen  226 KSDLKLAMGLLAFPKYTSSSKYLNLLTADRWSVLADLFLSEALKLFGIPINPALTIYLQA  285 (389)
T ss_pred             HHHHHHHHHhhcCccccCcccccCcccHHHHHHHHHHhhHHHHHHhCCCCCcHHHHHHHh
Confidence            999999999999986 55567999999999999999999999999999999999888874


No 4  
>smart00757 CRA CT11-RanBPM. protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi)
Probab=99.82  E-value=1.9e-19  Score=125.93  Aligned_cols=94  Identities=39%  Similarity=0.560  Sum_probs=87.1

Q ss_pred             ChHHHHHHHHhhccCcCC-chhhHHHHHHHHhHhcccCC-CCCcchhcCCHHHHHHHHHHHHHHHHHhc-CCCCchHHHH
Q 030744           64 KCTEALEFAQTKLTPFGK-VQKYVEKLEDFMALLAYEEP-EKSPMFHLLSLEYRQHVADNLNRAILAHA-ERPRYAAMER  140 (172)
Q Consensus        64 ~~~eAi~~ar~~l~~~~~-~~~~~~~l~~~~~LLay~~~-~~sp~~~ll~~~~r~~la~~vn~aIL~~~-g~~~~s~Le~  140 (172)
                      ++.+||+|||+++++|.. ++.+.++++++||||||+++ +.+|+++++++++|+.+|++||++||... |.+.+|.|+.
T Consensus         2 ~~~eAi~yar~~l~~~~~~~~~~~~el~~~m~llaf~~~~~~sp~~~ll~~~~~~~la~~~n~~~l~~~~~~~~~s~L~~   81 (99)
T smart00757        2 KIEEALAYARELLAPFAKEHEKFLKELEKTMALLAYPDPTEPSPYKELLSPSQREKLAEELNSAILELLHGKSSESPLEI   81 (99)
T ss_pred             cHHHHHHHHHHHHhhhcccCHHHHHHHHHHHHHHhcCCCCCCccHHHHCCHHHHHHHHHHHHHHHHHHccCCCCCChHHH
Confidence            578999999999999986 55568899999999999998 89999999999999999999999999998 9999999999


Q ss_pred             HHHHHHHHHHHHHHhcC
Q 030744          141 LIQQTTAVRQCLSQELG  157 (172)
Q Consensus       141 l~~~~~~~~~~l~~~~~  157 (172)
                      ++++..++...+...++
T Consensus        82 ~~~~~~~~~~~l~~~~~   98 (99)
T smart00757       82 LLSAGLAALKTLLEKGG   98 (99)
T ss_pred             HHHHHHHHHHHHHHhcC
Confidence            99999999998887664


No 5  
>KOG2817 consensus Predicted E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=99.73  E-value=3.9e-17  Score=137.25  Aligned_cols=132  Identities=28%  Similarity=0.387  Sum_probs=121.2

Q ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChH--HHHHHHHhhccCcCCchhhHHH
Q 030744           11 MEMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCT--EALEFAQTKLTPFGKVQKYVEK   88 (172)
Q Consensus        11 ~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~--eAi~~ar~~l~~~~~~~~~~~~   88 (172)
                      .-+-++|.++|..||+++|++|+..+...|.+.++.|+|+|+.++|+++++.|.-.  +||.|||++++||+  ..+.++
T Consensus       156 F~el~~Iv~~lke~Dl~~aLeWa~~~~~~L~~~~s~LE~~Lh~l~fl~l~~~g~~~~~eAl~Yar~~~~~F~--~~~~~e  233 (394)
T KOG2817|consen  156 FVELNQIVEALKERDLEPALEWAESNRQKLKEKSSSLEFKLHSLHFLSLIRGGKSDQREALRYARTHFAPFV--ADHLRE  233 (394)
T ss_pred             HHHHHHHHHHHHhccchhHHHHHHHhhhhhccccccHHHHHHHHHHHHHHhcCCcCcHHHHHHHHHhcCccc--cchHHH
Confidence            44568899999999999999999999999999999999999999999999988766  99999999999998  456899


Q ss_pred             HHHHHhHhccc-C-CCCCcchhcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHH
Q 030744           89 LEDFMALLAYE-E-PEKSPMFHLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQ  144 (172)
Q Consensus        89 l~~~~~LLay~-~-~~~sp~~~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~  144 (172)
                      ++.+|++|.|- + .++|||.+++++..|..+.+++-+-.+..+|+|.+|+|-.++.-
T Consensus       234 IQklm~sl~~l~~gl~~spy~~~ls~~~w~~~~~~f~r~ycallg~s~eSPL~v~v~a  291 (394)
T KOG2817|consen  234 IQKLMGSLLYLRNGLEKSPYSEILSPKLWKELTEEFTREYCALLGISVESPLSVLVNA  291 (394)
T ss_pred             HHHHHHHHHHHHcCCCCCChHHHhCHHHHHHHHHHHHHHHHHHcCCCccCcHHHHHHh
Confidence            99999999994 3 57999999999999999999999999999999999888877764


No 6  
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=99.47  E-value=1.1e-13  Score=87.55  Aligned_cols=56  Identities=38%  Similarity=0.524  Sum_probs=52.8

Q ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChH
Q 030744           11 MEMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCT   66 (172)
Q Consensus        11 ~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~   66 (172)
                      ..+|..|+++|+.|+|++|++|++.++|.+.+.++.+.|.|++|+|+||++.++..
T Consensus         2 ~~~~~~i~~~i~~g~~~~a~~~~~~~~~~l~~~~~~l~f~L~~q~~lell~~~~~~   57 (58)
T smart00668        2 FDERKRIRELILKGDWDEALEWLSSLKPPLLERNSKLEFELRKQKFLELVRQGKLE   57 (58)
T ss_pred             hHHHHHHHHHHHcCCHHHHHHHHHHcCHHHhccCCCchhHHHHHHHHHHHHcCCcC
Confidence            46899999999999999999999999999999999999999999999999987653


No 7  
>KOG1477 consensus SPRY domain-containing proteins [General function prediction only]
Probab=99.00  E-value=7.8e-11  Score=103.16  Aligned_cols=157  Identities=24%  Similarity=0.165  Sum_probs=130.6

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHHhcHHHHc-cCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCC---chhhHHHH
Q 030744           14 RKRILHFALEGNALKAIELTEELAQDLLE-KNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGK---VQKYVEKL   89 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~-~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~---~~~~~~~l   89 (172)
                      |+.+.....+|....|+.-....+....+ +.+...+.+.|+.+|.+.+.+.+...+++.+..+++...   +.....++
T Consensus       308 ~~g~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~v~~e~~~~k~~l~~~~g~~~~~~~~~~~  387 (469)
T KOG1477|consen  308 RKGRGQFTRNGAYNAALIPTYRKVGQVFEVDYPQRGAKDPCGLHVNLGRAGFVFIEANAKKWELAKDYGIKKNSAAVGML  387 (469)
T ss_pred             hcCcceeechhhhcccccccccccceeecccccchhhccchhhhhhHHHHHHHHHHHHHHHHhhhhhhCcCccccccccc
Confidence            34444445555555555444444432222 457899999999999999999999999999988887654   66788999


Q ss_pred             HHHHhHhcccCCCCCcchhcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHHhcCCCCCCCcchhhh
Q 030744           90 EDFMALLAYEEPEKSPMFHLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQELGKDVHPPFSLKDF  169 (172)
Q Consensus        90 ~~~~~LLay~~~~~sp~~~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~~~~~~~~~~~~~~~~  169 (172)
                      +.+++||+|++|..||...++++.+|+-+++.+|.+||...+.+..+.||+++.|+.++...+...+++.. .++++.+.
T Consensus       388 ~~s~~Llays~p~~s~~g~~~~~~~~e~v~~~~n~~il~t~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~  466 (469)
T KOG1477|consen  388 SDSSSLLAYSDPEESPVGYLLDPIQREPVAEALNSAILETDNNSKDPDLERVLSQTPAELSLYARDNPPRN-DFVRDRDY  466 (469)
T ss_pred             cchHHHHHhcCcccCccccccCcccchhHHhhhcccccccCCCCccchhhhhhccchhhHhhhhhcCCCcc-ceecchhh
Confidence            99999999999999999999999999999999999999999999999999999999999999999988877 78888877


Q ss_pred             hc
Q 030744          170 MK  171 (172)
Q Consensus       170 ~~  171 (172)
                      ++
T Consensus       467 ~~  468 (469)
T KOG1477|consen  467 FH  468 (469)
T ss_pred             hc
Confidence            65


No 8  
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.35  E-value=3.5e-06  Score=72.18  Aligned_cols=137  Identities=16%  Similarity=0.139  Sum_probs=104.5

Q ss_pred             HHHHHHHHHHcCCHHHHHHHHHHh-cHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHH
Q 030744           13 MRKRILHFALEGNALKAIELTEEL-AQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLED   91 (172)
Q Consensus        13 ~R~~I~~~I~~G~i~~Ai~~i~~~-~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~   91 (172)
                      +-+...+++++|+|+.++..+... ++. ........|.+.+|.|+|.++.|++..|+...|..+.+..   .-.+.+.+
T Consensus        52 t~klf~q~vlqg~w~q~v~~~~~i~~~d-e~~~~ea~fLv~kQ~fLEf~k~~~is~al~~l~~~~~~lr---~~~kk~~e  127 (519)
T KOG0293|consen   52 TTKLFDQQVLQGQWDQQVMSLVRISFED-ERNRKEAMFLVNKQIFLEFLKTGSISHALPVLRNPVLYLR---KNKKKFHE  127 (519)
T ss_pred             hHHHHHHHHHcccHHHHHHHHhhccCcc-hhhhHHHHHHHHHHHHHHHHhhccHhhhhHhhhcchhhhh---hhHHHHHH
Confidence            345678999999999999888877 554 5566789999999999999999999999999997665553   23456677


Q ss_pred             HHhHhcccCCCCC-cc-hhcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHH
Q 030744           92 FMALLAYEEPEKS-PM-FHLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQ  154 (172)
Q Consensus        92 ~~~LLay~~~~~s-p~-~~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~  154 (172)
                      +.+.|.+++...+ .. ..-.....|.+|.+++...|-...-+| +-+||++++|+...+..-..
T Consensus       128 l~~sll~sn~~~~ne~~~~~~~~n~R~~ll~elskyi~p~illP-~rRLehLl~qAv~~Q~d~cv  191 (519)
T KOG0293|consen  128 LASSLLVSNDQFSNEENTTAQLNNERDKLLDELSKYIPPNILLP-KRRLEHLLEQAVKYQRDSCV  191 (519)
T ss_pred             HHHHHhccccccccccchhhhhchhHHHHHHHHHhhCCHhhcCC-hHHHHHHHHHHHHHHHhHhH
Confidence            7777788764322 11 112233568899999999999988885 69999999999876544433


No 9  
>PF14559 TPR_19:  Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=89.34  E-value=2.1  Score=26.53  Aligned_cols=60  Identities=22%  Similarity=0.117  Sum_probs=37.3

Q ss_pred             HHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchh
Q 030744           20 FALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQK   84 (172)
Q Consensus        20 ~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~   84 (172)
                      ++..|+++.|++.+++... ....+.++.+.|    -.=+++.|+.++|.....+.+.....++.
T Consensus         1 ll~~~~~~~A~~~~~~~l~-~~p~~~~~~~~l----a~~~~~~g~~~~A~~~l~~~~~~~~~~~~   60 (68)
T PF14559_consen    1 LLKQGDYDEAIELLEKALQ-RNPDNPEARLLL----AQCYLKQGQYDEAEELLERLLKQDPDNPE   60 (68)
T ss_dssp             HHHTTHHHHHHHHHHHHHH-HTTTSHHHHHHH----HHHHHHTT-HHHHHHHHHCCHGGGTTHHH
T ss_pred             ChhccCHHHHHHHHHHHHH-HCCCCHHHHHHH----HHHHHHcCCHHHHHHHHHHHHHHCcCHHH
Confidence            5789999999999876531 111223333322    22356889999999999876665543333


No 10 
>COG5109 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=81.72  E-value=17  Score=30.94  Aligned_cols=131  Identities=18%  Similarity=0.261  Sum_probs=87.3

Q ss_pred             hHHHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhh--HHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHH
Q 030744           10 DMEMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLL--SLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVE   87 (172)
Q Consensus        10 ~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~--~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~   87 (172)
                      -...-+.|.+-|.+.+...-++|. +....+-+.++..++.|.  ...++-++-. .+++|+.+-++.++.|.  +.+..
T Consensus       138 ~f~~lK~v~~gI~~k~~~l~iE~~-Qi~gyl~kgdtesel~l~~~~~esl~l~hk-~~~~a~r~c~t~~a~f~--~kh~~  213 (396)
T COG5109         138 GFVKLKKVISGISEKSTFLLIEFL-QIEGYLSKGDTESELELYLVSHESLLLIHK-RYDEALRLCFTKLASFV--PKHIQ  213 (396)
T ss_pred             HHHHHHHHHHhhccchhHhHHHHH-HhcCccccCCchhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH--HHhcc
Confidence            344455666777778888888888 444444444444444444  4444444443 89999999999999988  66777


Q ss_pred             HHHHHHhHhcccCC------CC-C--cchhcCC--------------HHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHH
Q 030744           88 KLEDFMALLAYEEP------EK-S--PMFHLLS--------------LEYRQHVADNLNRAILAHAERPRYAAMERLIQQ  144 (172)
Q Consensus        88 ~l~~~~~LLay~~~------~~-s--p~~~ll~--------------~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~  144 (172)
                      .++..|-.+.+-+.      ++ +  ...+++.              ..-|..+-+.+-+-.+...|++-.|+|..++..
T Consensus       214 dv~~~~~~l~nap~dcfrhrekelmqnI~~~l~ksligqPiEdIDkvnk~~k~l~~lF~~eycaa~gm~~~spL~~~v~t  293 (396)
T COG5109         214 DVKPLLRFLVNAPTDCFRHREKELMQNIQEALKKSLIGQPIEDIDKVNKSRKKLIELFKSEYCAANGMPNRSPLRELVET  293 (396)
T ss_pred             chHHHHHHHHcCchHHhhhcchhHHHHHHHHHHHhhcCCcHHHHHHhhhhHHHHHHHHHHHHHHhcCCCccChHHHHHHh
Confidence            77777776666321      11 0  0111111              135788888999999999999999999988875


No 11 
>cd08044 TAF5_NTD2 TAF5_NTD2 is the second conserved N-terminal region of TATA Binding Protein (TBP) Associated Factor 5 (TAF5), involved in forming Transcription Factor IID (TFIID). The TATA Binding Protein (TBP) Associated Factor 5 (TAF5) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TAF5 contains three domains, two conserved sequence motifs at the N-terminal and one at the C-terminal region. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the preinitiation complex. TFIID complex is composed of the TBP and at least 13 TAFs.  In yeast and human cells, TAFs have been found as components of other complexes besides TFIID. TAF5 may play a major role in forming TFIID and its related complexes. TAFs from various 
Probab=78.82  E-value=0.88  Score=33.31  Aligned_cols=97  Identities=14%  Similarity=0.210  Sum_probs=58.1

Q ss_pred             cceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHHHhHhcccCCCCCcch-hcCCHHHHHHHHHHHHH
Q 030744           46 DLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDFMALLAYEEPEKSPMF-HLLSLEYRQHVADNLNR  124 (172)
Q Consensus        46 ~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~LLay~~~~~sp~~-~ll~~~~r~~la~~vn~  124 (172)
                      .+.|=+-+.=|++||.+|...+|..|-.+.-..+.  +.+.+.++.+.++..-....+.+.. .+-+..-+-.+....-.
T Consensus        28 ~lLyPiFvh~yL~lv~~~~~~~A~~F~~~f~~~~~--~~~~~~i~~L~~i~~p~~l~~n~~~~~~r~~Ky~i~ms~~s~~  105 (133)
T cd08044          28 QLLYPIFVHSYLDLVASGHLEEAKSFFERFSGDFE--DSHSEDIKKLSSITTPEHLKENELAKLFRSNKYVIRMSRDAYS  105 (133)
T ss_pred             HHHHHHHHHHHHHHHHCCCHHHHHHHHHHhhHhhH--HHHHHHHHHHHccCCHHHHhhhHHHHHHHhCCEEEEECHHHHH
Confidence            36788888889999999999999999987655553  5677777777644333332333322 22233334444444444


Q ss_pred             HHHHhcCCCCchHHHHHHHH
Q 030744          125 AILAHAERPRYAAMERLIQQ  144 (172)
Q Consensus       125 aIL~~~g~~~~s~Le~l~~~  144 (172)
                      .++.++.....+.+..++++
T Consensus       106 lL~~~L~~~~~~~~~~il~~  125 (133)
T cd08044         106 LLLRFLESWGGSLLLKILNE  125 (133)
T ss_pred             HHHHHHHHCCcHHHHHHHHh
Confidence            44444433444555555554


No 12 
>PRK02289 4-oxalocrotonate tautomerase; Provisional
Probab=65.31  E-value=10  Score=23.57  Aligned_cols=26  Identities=19%  Similarity=0.249  Sum_probs=23.5

Q ss_pred             CCHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          110 LSLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      .+.+++++|++.|..++.+.+|.|+.
T Consensus        12 rs~EqK~~L~~~it~a~~~~~~~p~~   37 (60)
T PRK02289         12 RSQEQKNALAREVTEVVSRIAKAPKE   37 (60)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhCcCcc
Confidence            47899999999999999999999764


No 13 
>PF13838 Clathrin_H_link:  Clathrin-H-link; PDB: 2XZG_A 3GD1_I 1BPO_C 1C9I_B 1C9L_A.
Probab=64.99  E-value=15  Score=23.87  Aligned_cols=39  Identities=26%  Similarity=0.454  Sum_probs=27.6

Q ss_pred             hhhHHHHHHHHhhcChHHHHHHHHhhccCcC--CchhhHHHHH
Q 030744           50 DLLSLHFVELVCSRKCTEALEFAQTKLTPFG--KVQKYVEKLE   90 (172)
Q Consensus        50 ~L~~q~fIELir~~~~~eAi~~ar~~l~~~~--~~~~~~~~l~   90 (172)
                      .|..++|=+++..|++.+|-+.|-+  +|-.  .+++.+.+++
T Consensus         7 ~l~~~~F~~l~~~g~y~eAA~~AA~--sP~giLRt~~Ti~rFk   47 (66)
T PF13838_consen    7 DLYVQQFNELFSQGQYEEAAKVAAN--SPRGILRTPETINRFK   47 (66)
T ss_dssp             HHHHHHHHHHHHTT-HHHHHHHHHH--SGGGTT-SHHHHHHHH
T ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHh--CccchhcCHHHHHHHH
Confidence            4678999999999999999998874  3322  2556555555


No 14 
>PF04494 TFIID_90kDa:  WD40 associated region in TFIID subunit;  InterPro: IPR007582 This region, possibly a domain is found in subunits of transcription factor TFIID. The function of this region is unknown.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2J4B_D 2J49_A 2NXP_F.
Probab=62.00  E-value=15  Score=27.04  Aligned_cols=47  Identities=19%  Similarity=0.302  Sum_probs=37.4

Q ss_pred             cceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHHHh
Q 030744           46 DLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDFMA   94 (172)
Q Consensus        46 ~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~   94 (172)
                      .+.|=+-+.=|++||.+|...+|-.|-.++-..+.  ..+...|+.+.+
T Consensus        39 ~lLyPvFvh~YL~Lv~~~~~~~A~~F~~kf~~~~~--~~~~~~i~~L~~   85 (142)
T PF04494_consen   39 RLLYPVFVHSYLDLVSKGHPEEAKSFLEKFSPDFE--DSHQEDIEKLSS   85 (142)
T ss_dssp             GGHHHHHHHHHHHHHHTT-HHHHHHHHHHHGGGGH--GHGHHHHHHHTT
T ss_pred             HHHHHHHHHHHHHHHHCCChHHHHHHHHHHHHHHh--HHHHHHHHHHHh
Confidence            47888889999999999999999999998766665  456666777763


No 15 
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=61.15  E-value=85  Score=27.47  Aligned_cols=74  Identities=12%  Similarity=0.054  Sum_probs=56.2

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHHHh
Q 030744           15 KRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDFMA   94 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~   94 (172)
                      .-+.++|..|+-++|.+++.+..+.-.+.+        ...||.-++-++...-++-+++-+.....+|    .+-.+.|
T Consensus       268 ~~a~~li~l~~~~~A~~~i~~~Lk~~~D~~--------L~~~~~~l~~~d~~~l~k~~e~~l~~h~~~p----~L~~tLG  335 (400)
T COG3071         268 AYAERLIRLGDHDEAQEIIEDALKRQWDPR--------LCRLIPRLRPGDPEPLIKAAEKWLKQHPEDP----LLLSTLG  335 (400)
T ss_pred             HHHHHHHHcCChHHHHHHHHHHHHhccChh--------HHHHHhhcCCCCchHHHHHHHHHHHhCCCCh----hHHHHHH
Confidence            356789999999999999999888777664        4568888888888888888887777665444    5566667


Q ss_pred             HhcccC
Q 030744           95 LLAYEE  100 (172)
Q Consensus        95 LLay~~  100 (172)
                      -||+.+
T Consensus       336 ~L~~k~  341 (400)
T COG3071         336 RLALKN  341 (400)
T ss_pred             HHHHHh
Confidence            777764


No 16 
>PTZ00196 60S ribosomal protein L36; Provisional
Probab=58.10  E-value=19  Score=25.16  Aligned_cols=32  Identities=25%  Similarity=0.364  Sum_probs=27.4

Q ss_pred             ehhhHHHHHHHHhhcChHHHHHHHHhhccCcC
Q 030744           49 FDLLSLHFVELVCSRKCTEALEFAQTKLTPFG   80 (172)
Q Consensus        49 F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~   80 (172)
                      |-=+-.+.+||++.+.--.|++|+++.+..+.
T Consensus        48 faPYErr~mELLkv~kdKrAlKfaKkRlGth~   79 (98)
T PTZ00196         48 FSPYERRMIELLKVGKDKRALKYAKKRLGTHK   79 (98)
T ss_pred             ccHHHHHHHHHHHhcchHHHHHHHHHHhhhHH
Confidence            44567789999999999999999999987765


No 17 
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=57.13  E-value=53  Score=28.29  Aligned_cols=131  Identities=15%  Similarity=0.119  Sum_probs=79.0

Q ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCC----chhhHH
Q 030744           12 EMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGK----VQKYVE   87 (172)
Q Consensus        12 ~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~----~~~~~~   87 (172)
                      ..--...+.|-+|+||.++..+++..-     -..-...|+.|-.+|||.-..+..|-..+|+. .|..-    .|+-.-
T Consensus        42 DSvd~Fv~dI~sG~WD~VL~~vqsLKL-----P~kkL~dLYEqivlEliELREL~tAR~~lRQT-dpM~~lKQ~~peRy~  115 (508)
T KOG0275|consen   42 DSVDGFVNDINSGHWDTVLKTVQSLKL-----PDKKLIDLYEQIVLELIELRELGTARSLLRQT-DPMIMLKQIQPERYI  115 (508)
T ss_pred             hhHHHHHHhcccCchHHHHHHHHhccC-----chhHHHHHHHHHHHHHHHHHhhhHHHHHHhcc-CceehhhccChHHHH
Confidence            333456678889999999998877531     11224567888899998766666666666632 22221    344444


Q ss_pred             HHHHHHhHhcccCCCCCcchhcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHH
Q 030744           88 KLEDFMALLAYEEPEKSPMFHLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQ  154 (172)
Q Consensus        88 ~l~~~~~LLay~~~~~sp~~~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~  154 (172)
                      .++.++.= .|-|| ...|++--...+|..+|..+..    ..+.-+.|+|-.++-|+-.+++....
T Consensus       116 ~lE~ll~R-~YFDp-~EaY~dssKEkrRa~IAQ~ls~----EV~VVppSRLlaLlGQaLKWQqHQGL  176 (508)
T KOG0275|consen  116 RLENLLNR-SYFDP-REAYGDSSKEKRRAVIAQALSG----EVHVVPPSRLLALLGQALKWQQHQGL  176 (508)
T ss_pred             HHHHHhcc-cccCh-hhhcCcchHHHHHHHHHHHhcC----ceEEcChHHHHHHHHHHhhhHhhcCC
Confidence            55554422 23343 2335553334566777776653    33444669999999998777655443


No 18 
>PF14689 SPOB_a:  Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=54.93  E-value=37  Score=21.31  Aligned_cols=31  Identities=26%  Similarity=0.261  Sum_probs=22.3

Q ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHhcHHHH
Q 030744           11 MEMRKRILHFALEGNALKAIELTEELAQDLL   41 (172)
Q Consensus        11 ~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll   41 (172)
                      +..=+-|...+..|++++|.+.+++..-++-
T Consensus        24 ~NhLqvI~gllqlg~~~~a~eYi~~~~~~~~   54 (62)
T PF14689_consen   24 LNHLQVIYGLLQLGKYEEAKEYIKELSKDLQ   54 (62)
T ss_dssp             HHHHHHHHHHHHTT-HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH
Confidence            4445667788889999999999987765443


No 19 
>PF12895 Apc3:  Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=53.89  E-value=12  Score=24.25  Aligned_cols=19  Identities=26%  Similarity=0.324  Sum_probs=12.4

Q ss_pred             HHHHHHcCCHHHHHHHHHH
Q 030744           17 ILHFALEGNALKAIELTEE   35 (172)
Q Consensus        17 I~~~I~~G~i~~Ai~~i~~   35 (172)
                      -.-....|+++.|+++++.
T Consensus        32 a~~~~~~~~y~~A~~~~~~   50 (84)
T PF12895_consen   32 AQCYFQQGKYEEAIELLQK   50 (84)
T ss_dssp             HHHHHHTTHHHHHHHHHHC
T ss_pred             HHHHHHCCCHHHHHHHHHH
Confidence            4445666777777777755


No 20 
>PF13432 TPR_16:  Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=51.26  E-value=53  Score=19.78  Aligned_cols=56  Identities=21%  Similarity=0.162  Sum_probs=33.2

Q ss_pred             HHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhcc
Q 030744           17 ILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLT   77 (172)
Q Consensus        17 I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~   77 (172)
                      -+.++..|+++.|++.+++    +++.+++-. ..+...-.=+...|+..+|+.+-++.+.
T Consensus         4 a~~~~~~g~~~~A~~~~~~----~l~~~P~~~-~a~~~lg~~~~~~g~~~~A~~~~~~a~~   59 (65)
T PF13432_consen    4 ARALYQQGDYDEAIAAFEQ----ALKQDPDNP-EAWYLLGRILYQQGRYDEALAYYERALE   59 (65)
T ss_dssp             HHHHHHCTHHHHHHHHHHH----HHCCSTTHH-HHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred             HHHHHHcCCHHHHHHHHHH----HHHHCCCCH-HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence            3567889999999988754    445544311 1111111123367899999988886544


No 21 
>PRK01271 4-oxalocrotonate tautomerase; Provisional
Probab=50.48  E-value=25  Score=23.26  Aligned_cols=26  Identities=19%  Similarity=0.301  Sum_probs=23.5

Q ss_pred             CCHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          110 LSLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      .+.++.+.+|++|.+++-+.+|.+++
T Consensus        13 ~s~EqK~~La~~iT~a~~~~lg~~~e   38 (76)
T PRK01271         13 LDEEQKAALAADITDVIIRHLNSKDS   38 (76)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhCcCcc
Confidence            57899999999999999999999863


No 22 
>PF01361 Tautomerase:  Tautomerase enzyme;  InterPro: IPR004370 4-Oxalocrotonate tautomerase (4-OT) catalyzes the isomerisation of beta,gamma-unsaturated enones to their alpha,beta-isomers. The enzyme is part of a plasmid-encoded pathway, which enables bacteria harbouring the plasmid to use various aromatic hydrocarbons as their sole sources of carbon and energy. The enzyme is a barrel-shaped hexamer, which can be viewed as a trimer of dimers. The hexamer contains a hydrophobic core formed by three beta-sheets and surrounded by three pairs of alpha-helices. Each 4-OT monomer of 62 amino acids has a relatively simple beta-alpha-beta fold as described by the structure of the enzyme from Pseudomonas putida []. The monomer begins with a conserved proline at the start of a beta-strand, followed by an alpha-helix and a 310 helix preceding a second parallel beta-strand, and ends with a beta-hairpin near the C terminus. The dimer results from antiparallel interactions between the beta-sheets and alpha-helices of the two monomers, forming a four-stranded beta-sheet with antiparallel alpha-helices on one side, creating two active sites, one at each end of the beta-sheet. Three dimers further associate to form a hexamer by the interactions of the strands of the C-terminal beta-hairpin loops with the edges of the four-stranded beta-sheets of neighbouring dimers, creating a series of cross-links that stabilise the hexamer Pro-1 of the mature protein functions as the general base while Arg-39 and an ordered water molecule each provide a hydrogen bond to the C-2 oxygen of substrate. Arg-39 plays an additional role in the binding of the C-1 carboxylate group. Arg-11 participates both in substrate binding and in catalysis. It interacts with the C-6 carboxylate group, thereby holding the substrate in place and drawing electron density to the C-5 position. The hydrophobic nature of the active site, which lowers the pKa of Pro-1 and provides a favourable environment for catalysis, is largely maintained by Phe-50. Because several Arg residues located near the active site are not conserved among all members of this family and because of the presence of fairly distantly related paralogs in Campylobacter jejuni, the family is regarded as not necessarily uniform in function.; GO: 0016853 isomerase activity, 0006725 cellular aromatic compound metabolic process; PDB: 4OTA_H 4OTC_G 4OTB_J 2FM7_A 1BJP_B 1S0Y_K 3EJ9_E 3EJ7_K 3EJ3_I 3MB2_A ....
Probab=48.43  E-value=25  Score=21.42  Aligned_cols=25  Identities=16%  Similarity=0.286  Sum_probs=21.2

Q ss_pred             CHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          111 SLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       111 ~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      +.+++.+++..+..++.+.+|.|+.
T Consensus        12 ~~e~K~~l~~~it~~~~~~lg~~~~   36 (60)
T PF01361_consen   12 TAEQKRELAEAITDAVVEVLGIPPE   36 (60)
T ss_dssp             -HHHHHHHHHHHHHHHHHHHTS-GG
T ss_pred             CHHHHHHHHHHHHHHHHHHhCcCCC
Confidence            7789999999999999999999754


No 23 
>PF01158 Ribosomal_L36e:  Ribosomal protein L36e;  InterPro: IPR000509 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. A number of eukaryotic ribosomal proteins can be grouped on the basis of sequence similarities. The L36E ribosomal family consists of mammalian, Caenorhabditis elegans and Drosophila L36, Candida albicans L39, and yeast YL39 ribosomal proteins [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1B_Q 4A1D_Q 4A19_Q 4A18_Q 3IZS_k 3IZR_k.
Probab=46.75  E-value=37  Score=23.76  Aligned_cols=32  Identities=28%  Similarity=0.293  Sum_probs=26.9

Q ss_pred             ehhhHHHHHHHHhhcChHHHHHHHHhhccCcC
Q 030744           49 FDLLSLHFVELVCSRKCTEALEFAQTKLTPFG   80 (172)
Q Consensus        49 F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~   80 (172)
                      |-=+-.+-+||++.+.--.|+.|+++.+....
T Consensus        48 faPYEkr~mELlkv~kdKrAlKf~KKRlGth~   79 (98)
T PF01158_consen   48 FAPYEKRAMELLKVSKDKRALKFAKKRLGTHI   79 (98)
T ss_dssp             HCHHHHHHHHHHHHCCHHHHHHHHHHHHCHHH
T ss_pred             CChHHHHHHHHHhcchhHHHHHHHHHHhhhhH
Confidence            34456789999999999999999999987664


No 24 
>KOG2910 consensus Uncharacterized conserved protein predicted to be involved in protein sorting [General function prediction only]
Probab=46.46  E-value=86  Score=24.73  Aligned_cols=63  Identities=21%  Similarity=0.264  Sum_probs=47.2

Q ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHh--cHHHHcc-----------CCcceehhhHHHHHHHHhhcChHHHHHHHHhhc
Q 030744           12 EMRKRILHFALEGNALKAIELTEEL--AQDLLEK-----------NKDLHFDLLSLHFVELVCSRKCTEALEFAQTKL   76 (172)
Q Consensus        12 ~~R~~I~~~I~~G~i~~Ai~~i~~~--~p~ll~~-----------~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l   76 (172)
                      .+|...|++|+.|+=+.|+-++...  +-+++..           -++++|.....++++=++.|+  +||+-.++.|
T Consensus        41 ~Er~~Ar~lird~rKdrAlllLKkKryQE~Ll~qt~~qL~nlEqmvsdiEft~vqk~V~~gLk~GN--~~lkkl~~~~  116 (209)
T KOG2910|consen   41 AERQLARDLIRDGRKDRALLLLKKKRYQEELLTQTDNQLINLEQMVSDIEFTQVQKKVMEGLKQGN--EALKKLQQEF  116 (209)
T ss_pred             HHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhc
Confidence            4688899999999999998777543  2244442           358999999999999999883  5666666543


No 25 
>PF14973 TINF2_N:  TERF1-interacting nuclear factor 2 N-terminus
Probab=46.43  E-value=73  Score=23.83  Aligned_cols=76  Identities=17%  Similarity=0.261  Sum_probs=45.0

Q ss_pred             HHHHHHHcCCHHHHHHHHHHhcHHHHcc-----CCcceehhhHHHHHHHHhh--cChHHHHHHHHhhcc-CcCCchhhHH
Q 030744           16 RILHFALEGNALKAIELTEELAQDLLEK-----NKDLHFDLLSLHFVELVCS--RKCTEALEFAQTKLT-PFGKVQKYVE   87 (172)
Q Consensus        16 ~I~~~I~~G~i~~Ai~~i~~~~p~ll~~-----~~~l~F~L~~q~fIELir~--~~~~eAi~~ar~~l~-~~~~~~~~~~   87 (172)
                      .|.+++++|.-.. +..|+.|+|.+ ..     .++..-.--...|..++++  .+-.+--.|-++.|. .|+  +.|..
T Consensus        46 lILELc~~~~~~d-l~~I~~Hl~~~-~~~~~~~~~D~~~~~~~~~F~~LV~~Ll~dp~~r~~f~qe~f~~eYG--~~f~~  121 (145)
T PF14973_consen   46 LILELCRQERPWD-LKAIQPHLPRI-PQDPNATSKDHKMEEAHENFCQLVQNLLEDPEERENFFQEVFPQEYG--EPFDA  121 (145)
T ss_pred             HHHHHHhCCCCch-HHHHHHhcccc-cccccccccHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHC--hHHHH
Confidence            4567777665444 99999999998 32     2233334455667777764  344444455554443 444  56666


Q ss_pred             HHHHHHhH
Q 030744           88 KLEDFMAL   95 (172)
Q Consensus        88 ~l~~~~~L   95 (172)
                      .+++++.=
T Consensus       122 ~Le~L~~e  129 (145)
T PF14973_consen  122 ALEKLLWE  129 (145)
T ss_pred             HHHHHHHH
Confidence            66666543


No 26 
>PF07721 TPR_4:  Tetratricopeptide repeat;  InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=45.92  E-value=27  Score=17.65  Aligned_cols=17  Identities=18%  Similarity=0.155  Sum_probs=13.7

Q ss_pred             HHHHHcCCHHHHHHHHH
Q 030744           18 LHFALEGNALKAIELTE   34 (172)
Q Consensus        18 ~~~I~~G~i~~Ai~~i~   34 (172)
                      +-+...|++++|..++.
T Consensus         9 ~~~~~~G~~~eA~~~l~   25 (26)
T PF07721_consen    9 RALLAQGDPDEAERLLR   25 (26)
T ss_pred             HHHHHcCCHHHHHHHHh
Confidence            45678899999998875


No 27 
>PF12854 PPR_1:  PPR repeat
Probab=45.09  E-value=50  Score=17.80  Aligned_cols=26  Identities=23%  Similarity=0.233  Sum_probs=20.8

Q ss_pred             hHHHHHHHHHHHHcCCHHHHHHHHHH
Q 030744           10 DMEMRKRILHFALEGNALKAIELTEE   35 (172)
Q Consensus        10 ~~~~R~~I~~~I~~G~i~~Ai~~i~~   35 (172)
                      ...--..|.-.-.+|++++|++++++
T Consensus         7 ~~ty~~lI~~~Ck~G~~~~A~~l~~~   32 (34)
T PF12854_consen    7 VVTYNTLIDGYCKAGRVDEAFELFDE   32 (34)
T ss_pred             HhHHHHHHHHHHHCCCHHHHHHHHHh
Confidence            34555678888899999999998865


No 28 
>KOG0396 consensus Uncharacterized conserved protein [Function unknown]
Probab=43.73  E-value=1e+02  Score=26.80  Aligned_cols=86  Identities=15%  Similarity=0.153  Sum_probs=62.5

Q ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCC---chhhHHH
Q 030744           12 EMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGK---VQKYVEK   88 (172)
Q Consensus        12 ~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~---~~~~~~~   88 (172)
                      -+|.-+...+++|.|++|..++.+..-+-+   .|+-+...-+.-..-+..|.+.+|+.|-.++=..+.+   .=++.-.
T Consensus       118 l~r~vvdhmlr~gy~~~A~~L~K~s~ledl---vD~Dv~~~~~~I~~sll~~~l~~~Lswc~ehk~~LkK~~S~lEf~lR  194 (389)
T KOG0396|consen  118 LDRFVVDHMLRNGYFGAAVLLGKKSQLEDL---VDSDVYKRAYGIRDSLLAGELEPALSWCKEHKVELKKEESSLEFQLR  194 (389)
T ss_pred             HHHHHHHHHHHcCchhHHHHHHHhhhhhhh---HhHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHhccchhhhHHH
Confidence            357777889999999999999977544322   3344555566677788899999999999877665543   2356777


Q ss_pred             HHHHHhHhcccC
Q 030744           89 LEDFMALLAYEE  100 (172)
Q Consensus        89 l~~~~~LLay~~  100 (172)
                      +++...|+=-++
T Consensus       195 lQefIELi~~~~  206 (389)
T KOG0396|consen  195 LQEFIELIKVDN  206 (389)
T ss_pred             HHHHHHHHHhcc
Confidence            888887776553


No 29 
>PF14691 Fer4_20:  Dihydroprymidine dehydrogenase domain II, 4Fe-4S cluster; PDB: 2VDC_G 1H7X_C 1H7W_A 1GT8_A 1GTE_B 1GTH_B.
Probab=43.65  E-value=33  Score=24.34  Aligned_cols=23  Identities=30%  Similarity=0.307  Sum_probs=13.6

Q ss_pred             HHHHHHHcCCHHHHHHHHHHhcH
Q 030744           16 RILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        16 ~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      .+..+|..|++++|++++.+..|
T Consensus        44 ~~i~~i~~g~~~~A~~~i~~~np   66 (111)
T PF14691_consen   44 EYIRLIREGNFKEAYELIREDNP   66 (111)
T ss_dssp             HHHHHHHCT-HHHHHHHHHHH-T
T ss_pred             HHHHHHHCCCHHHHHHHHHHhCC
Confidence            44566677777777777766544


No 30 
>cd00491 4Oxalocrotonate_Tautomerase 4-Oxalocrotonate Tautomerase:  Catalyzes the isomerization of unsaturated ketones. The structure is a homohexamer that is arranged as a trimer of dimers. The hexamer contains six active sites, each formed by residues from three monomers, two from one dimer and the third from a neighboring monomer.  Each monomer is a beta-alpha-beta fold with two small beta strands at the C-terminus that fold back on themselves. A pair of monomers form a dimer with two-fold symmetry, consisting of a 4-stranded beta sheet with two helices on one side and two additional small beta strands at each end. The dimers are assembled around a 3-fold axis of rotation to form a hexamer, with the short beta strands from each dimer contacting the neighboring dimers.
Probab=43.51  E-value=41  Score=20.09  Aligned_cols=25  Identities=12%  Similarity=0.203  Sum_probs=22.5

Q ss_pred             CHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          111 SLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       111 ~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      +.+++.++++.+..++-..+|.|+.
T Consensus        12 t~eqk~~l~~~i~~~l~~~~g~~~~   36 (58)
T cd00491          12 TDEQKRELIERVTEAVSEILGAPEA   36 (58)
T ss_pred             CHHHHHHHHHHHHHHHHHHhCcCcc
Confidence            6899999999999999999999764


No 31 
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=43.31  E-value=39  Score=17.11  Aligned_cols=22  Identities=27%  Similarity=0.312  Sum_probs=18.4

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHh
Q 030744           15 KRILHFALEGNALKAIELTEEL   36 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~   36 (172)
                      ..|......|++++|.+++++.
T Consensus         5 ~li~~~~~~~~~~~a~~~~~~M   26 (35)
T TIGR00756         5 TLIDGLCKAGRVEEALELFKEM   26 (35)
T ss_pred             HHHHHHHHCCCHHHHHHHHHHH
Confidence            4577888999999999998765


No 32 
>PRK01964 4-oxalocrotonate tautomerase; Provisional
Probab=42.90  E-value=40  Score=20.85  Aligned_cols=26  Identities=19%  Similarity=0.194  Sum_probs=23.2

Q ss_pred             CCHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          110 LSLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      -+.+++.+++..|..++-..+|.|+.
T Consensus        12 rt~eqk~~l~~~it~~l~~~lg~p~~   37 (64)
T PRK01964         12 RPEEKIKNLIREVTEAISATLDVPKE   37 (64)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhCcChh
Confidence            37799999999999999999999863


No 33 
>PRK07143 hypothetical protein; Provisional
Probab=42.54  E-value=17  Score=30.10  Aligned_cols=24  Identities=13%  Similarity=-0.136  Sum_probs=21.1

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhcH
Q 030744           15 KRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      ..||++|.+|+++.|-+++-+.|.
T Consensus       153 T~IR~~l~~G~i~~A~~lLGr~y~  176 (279)
T PRK07143        153 SLLKEFIEFGDIELLNSLLLYNYS  176 (279)
T ss_pred             HHHHHHHHcCCHHHHHHHcCCCcE
Confidence            579999999999999999977663


No 34 
>PRK02220 4-oxalocrotonate tautomerase; Provisional
Probab=42.01  E-value=42  Score=20.37  Aligned_cols=26  Identities=12%  Similarity=0.142  Sum_probs=23.2

Q ss_pred             CCHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          110 LSLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      .+.+++.+++..|..++...+|.|+.
T Consensus        12 rs~eqk~~l~~~it~~l~~~~~~p~~   37 (61)
T PRK02220         12 RTEEQLKALVKDVTAAVSKNTGAPAE   37 (61)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhCcChh
Confidence            47899999999999999999999753


No 35 
>TIGR00013 taut 4-oxalocrotonate tautomerase family enzyme. 4-oxalocrotonate tautomerase is a homohexamer in which each monomer is very small, at about 62 amino acids. Pro-1 of the mature protein serves as a general base. The enzyme functions in meta-cleavage pathways of aromatic hydrocarbon catabolism. Because several Arg residues located near the active site in the crystal structure of Pseudomonas putida are not conserved among all members of this family, because the literature describes a general role in the isomerization of beta,gamma-unsaturated enones to their alpha,beta-isomers, and because of the presence of fairly distantly related paralogs in Campylobacter jejuni, the family is regarded as not necessarily uniform in function.
Probab=41.84  E-value=45  Score=20.38  Aligned_cols=26  Identities=8%  Similarity=0.059  Sum_probs=23.3

Q ss_pred             CCHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          110 LSLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      .+.+++.+++..+..++-..+|.|+.
T Consensus        12 rt~eqK~~l~~~it~~l~~~lg~~~~   37 (63)
T TIGR00013        12 RTDEQKRQLIEGVTEAMAETLGANLE   37 (63)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhCCCcc
Confidence            47899999999999999999999864


No 36 
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=41.52  E-value=2e+02  Score=23.65  Aligned_cols=62  Identities=16%  Similarity=0.115  Sum_probs=46.1

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHH----hhcChHHHHHHHHhhccCcCC
Q 030744           14 RKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELV----CSRKCTEALEFAQTKLTPFGK   81 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELi----r~~~~~eAi~~ar~~l~~~~~   81 (172)
                      =......+..|++++|++..+..    ...++.-.+.  .|--|+++    ++++.++|+.++-..+.-|..
T Consensus        38 Y~~g~~~L~~gn~~~A~~~fe~l----~~~~p~s~~~--~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~  103 (254)
T COG4105          38 YNEGLTELQKGNYEEAIKYFEAL----DSRHPFSPYS--EQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPT  103 (254)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHH----HHcCCCCccc--HHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCC
Confidence            46788899999999999877553    4555554544  66666665    567999999999888776654


No 37 
>TIGR00083 ribF riboflavin kinase/FMN adenylyltransferase. multifunctional enzyme: riboflavin kinase (EC 2.7.1.26) (flavokinase) / FMN adenylyltransferase (EC 2.7.7.2) (FAD pyrophosphorylase) (FAD synthetase).
Probab=41.34  E-value=27  Score=29.04  Aligned_cols=24  Identities=17%  Similarity=-0.046  Sum_probs=21.4

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhcH
Q 030744           15 KRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      ..||++|.+|+++.|-+++-..|.
T Consensus       147 T~IR~~l~~G~i~~A~~lLGr~y~  170 (288)
T TIGR00083       147 SAIRQALKNGDLELANKLLGRPYF  170 (288)
T ss_pred             HHHHHHHHcCCHHHHHHhhhhhhc
Confidence            579999999999999999987664


No 38 
>COG0268 RpsT Ribosomal protein S20 [Translation, ribosomal structure and biogenesis]
Probab=41.07  E-value=48  Score=22.72  Aligned_cols=28  Identities=14%  Similarity=0.096  Sum_probs=23.5

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhcHHHHc
Q 030744           15 KRILHFALEGNALKAIELTEELAQDLLE   42 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~p~ll~   42 (172)
                      +.+..+|..||.+.|.+.+..-+|.+-.
T Consensus        33 Kk~~~ai~~gd~~~A~~~l~~a~~~idk   60 (88)
T COG0268          33 KKVEAAIEAGDKEAAKAALKEAQKKIDK   60 (88)
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHHHHHH
Confidence            3456788999999999999999987765


No 39 
>PF04157 EAP30:  EAP30/Vps36 family;  InterPro: IPR007286 EAP30 is a subunit of the ELL complex. The ELL is an 80kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of transcription elongation, but is unable to repress initiation of transcription by RNA polymerase II as is the case of ELL. EAP30 is thought to lead to the derepression of ELL's transcriptional inhibitory activity. ; PDB: 2ZME_A 3CUQ_A 1W7P_D 1U5T_B.
Probab=41.03  E-value=1.8e+02  Score=22.94  Aligned_cols=79  Identities=13%  Similarity=0.125  Sum_probs=56.1

Q ss_pred             hHHHHHHHHhhccCcCCchhhHHHHHHHHhHhcccCCCCCc-chhcC-CHH-HHHHHHHHHHHHHHHhcCCCC-chHHHH
Q 030744           65 CTEALEFAQTKLTPFGKVQKYVEKLEDFMALLAYEEPEKSP-MFHLL-SLE-YRQHVADNLNRAILAHAERPR-YAAMER  140 (172)
Q Consensus        65 ~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~LLay~~~~~sp-~~~ll-~~~-~r~~la~~vn~aIL~~~g~~~-~s~Le~  140 (172)
                      ..-+-+|++++-.....+|++..++.+.+.-|-.++|...+ ....+ +.. ...+||-+|-..++....... .-.|..
T Consensus        39 ~~~l~~fa~k~~~~i~~~~~~r~~f~~~~~~lGvdp~~s~~~~s~~l~~~~~f~~ELa~qi~e~c~~~~~~~GGii~L~d  118 (223)
T PF04157_consen   39 VELLENFARKHKSEIKSDPEFRSQFQSMCASLGVDPLASSKFWSESLKGSGDFYYELAVQIAEVCLATRSKNGGIISLSD  118 (223)
T ss_dssp             HHHHHHHHHHHCCCCCCSHHHHHHHHHHHHHHT--CHCCTTCCCCCCSCHHHHHHHHHHHHHHHHHHHCCTTTSEEEHHH
T ss_pred             HHHHHHHHHHhhccccCCchHHHHHHHHHHHcCCCcccchhhhhhccccchhHHHHHHHHHHHHHHHHHhcCCCEEEHHH
Confidence            34566789988887777899999999999999998886666 33444 554 449999999999998875543 334444


Q ss_pred             HHH
Q 030744          141 LIQ  143 (172)
Q Consensus       141 l~~  143 (172)
                      +++
T Consensus       119 l~~  121 (223)
T PF04157_consen  119 LYC  121 (223)
T ss_dssp             HHH
T ss_pred             HHH
Confidence            444


No 40 
>PF00627 UBA:  UBA/TS-N domain;  InterPro: IPR000449  UBA domains are a commonly occurring sequence motif of approximately 45 amino acid residues that are found in diverse proteins involved in the ubiquitin/proteasome pathway, DNA excision-repair, and cell signalling via protein kinases []. The human homologue of yeast Rad23A is one example of a nucleotide excision-repair protein that contains both an internal and a C-terminal UBA domain. The solution structure of human Rad23A UBA(2) showed that the domain forms a compact three-helix bundle []. Comparison of the structures of UBA(1) and UBA(2) reveals that both form very similar folds and have a conserved large hydrophobic surface patch which may be a common protein-interacting surface present in diverse UBA domains. Evidence that ubiquitin binds to UBA domains leads to the prediction that the hydrophobic surface patch of UBA domains interacts with the hydrophobic surface on the five-stranded beta-sheet of ubiquitin []. This domain is similar in sequence to the N-terminal domain of translation elongation factor EF1B (or EF-Ts) from bacteria, mitochondria and chloroplasts. More information about EF1B (EF-Ts) proteins can be found at Protein of the Month: Elongation Factors [].; GO: 0005515 protein binding; PDB: 2DAI_A 2OO9_C 2JUJ_A 1WHC_A 1YLA_A 2O25_B 3K9O_A 3K9P_A 3F92_A 3E46_A ....
Probab=40.71  E-value=46  Score=18.35  Aligned_cols=19  Identities=16%  Similarity=0.211  Sum_probs=12.1

Q ss_pred             HHHHHHHH--cCCHHHHHHHH
Q 030744           15 KRILHFAL--EGNALKAIELT   33 (172)
Q Consensus        15 ~~I~~~I~--~G~i~~Ai~~i   33 (172)
                      ...+.++.  .|+++.|++|+
T Consensus        17 ~~~~~AL~~~~~nve~A~~~L   37 (37)
T PF00627_consen   17 EQAREALRACNGNVERAVDWL   37 (37)
T ss_dssp             HHHHHHHHHTTTSHHHHHHHH
T ss_pred             HHHHHHHHHcCCCHHHHHHhC
Confidence            34444444  36888888885


No 41 
>KOG3452 consensus 60S ribosomal protein L36 [Translation, ribosomal structure and biogenesis]
Probab=40.39  E-value=53  Score=22.96  Aligned_cols=45  Identities=22%  Similarity=0.263  Sum_probs=32.1

Q ss_pred             ehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHHHhHh
Q 030744           49 FDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDFMALL   96 (172)
Q Consensus        49 F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~LL   96 (172)
                      |-=+-.+-+||++..+-..|++++++.+..+.   .-..+.+++...|
T Consensus        50 ~aPyErr~meLlkvskdkrA~K~lKkRlGth~---RAk~KrEELsnvl   94 (102)
T KOG3452|consen   50 FAPYERRAMELLKVSKDKRALKLLKKRLGTHK---RAKRKREELSNVL   94 (102)
T ss_pred             CChHHHHHHHHHHHcccHHHHHHHHHHhhHHH---HHHHHHHHHHHHH
Confidence            34456678999999999999999999988775   2233444444433


No 42 
>PF12793 SgrR_N:  Sugar transport-related sRNA regulator N-term
Probab=40.28  E-value=55  Score=23.34  Aligned_cols=24  Identities=17%  Similarity=0.286  Sum_probs=20.2

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHHhcH
Q 030744           14 RKRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      ...++++|.+|+++.|++++. .++
T Consensus        74 ~~~~~~~l~~g~~~~a~~ll~-~~~   97 (115)
T PF12793_consen   74 EQQAEELLEQGKYEQALQLLD-FDQ   97 (115)
T ss_pred             HHHHHHHHHcCCHHHHHHHHH-hCH
Confidence            467889999999999999998 444


No 43 
>PF07035 Mic1:  Colon cancer-associated protein Mic1-like;  InterPro: IPR009755 This entry represents the C terminus (approximately 160 residues) of a number of proteins that resemble colon cancer-associated protein Mic1.
Probab=39.69  E-value=54  Score=25.11  Aligned_cols=62  Identities=16%  Similarity=0.141  Sum_probs=38.7

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHH--hcHHHHccCCcceehhh-HHH-HHH-HHhhcChHHHHHHHHhh
Q 030744           14 RKRILHFALEGNALKAIELTEE--LAQDLLEKNKDLHFDLL-SLH-FVE-LVCSRKCTEALEFAQTK   75 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~--~~p~ll~~~~~l~F~L~-~q~-fIE-Lir~~~~~eAi~~ar~~   75 (172)
                      ++-|+..+..-...-|..++.-  .+|...+..-|+...|. .-. -+| |+..|++-+|+.|+|+.
T Consensus        49 ~qllq~~Vi~DSk~lA~~LLs~~~~~~~~~Ql~lDMLkRL~~~~~~iievLL~~g~vl~ALr~ar~~  115 (167)
T PF07035_consen   49 HQLLQYHVIPDSKPLACQLLSLGNQYPPAYQLGLDMLKRLGTAYEEIIEVLLSKGQVLEALRYARQY  115 (167)
T ss_pred             HHHHhhcccCCcHHHHHHHHHhHccChHHHHHHHHHHHHhhhhHHHHHHHHHhCCCHHHHHHHHHHc
Confidence            3444444444455555555543  45666665556666665 333 345 77889999999999974


No 44 
>PF04121 Nup84_Nup100:  Nuclear pore protein 84 / 107 ;  InterPro: IPR007252 Nup84p forms a complex with five proteins, including Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3CQC_A 3CQG_A 3I4R_A 3IKO_I 3JRO_C.
Probab=38.95  E-value=43  Score=31.25  Aligned_cols=29  Identities=24%  Similarity=0.252  Sum_probs=22.0

Q ss_pred             HHhHHHHHHHHHHHHcCCHHHHHHHHHHh
Q 030744            8 LEDMEMRKRILHFALEGNALKAIELTEEL   36 (172)
Q Consensus         8 ~~~~~~R~~I~~~I~~G~i~~Ai~~i~~~   36 (172)
                      ...-..=+.|-.+|++|++++|.+||++.
T Consensus       131 ~~e~~~~~~i~~llR~G~~~eA~~lc~~~  159 (697)
T PF04121_consen  131 EDERALLKYIFELLRAGRIEEAQELCRER  159 (697)
T ss_dssp             HHHHHHHHHHHHHHHTT-HHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHHHCCCHHHHHHHHHHC
Confidence            33344456788999999999999999875


No 45 
>KOG2235 consensus Uncharacterized conserved protein [Function unknown]
Probab=38.30  E-value=72  Score=29.69  Aligned_cols=65  Identities=15%  Similarity=0.242  Sum_probs=43.8

Q ss_pred             HHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHHHhHhcccCCCC-CcchhcCCHHHHHHHHHHHHHHHHHh
Q 030744           58 ELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDFMALLAYEEPEK-SPMFHLLSLEYRQHVADNLNRAILAH  129 (172)
Q Consensus        58 ELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~LLay~~~~~-sp~~~ll~~~~r~~la~~vn~aIL~~  129 (172)
                      |++..|.....++.+.+.--.       ...++.....+.-.++.- -..+.+++.+.|.++|++||..+-+.
T Consensus        66 El~v~GgRaslvDla~tlnVD-------l~hIEk~a~dIv~~d~~v~Lv~geiide~Y~d~iaeEinekLqE~  131 (776)
T KOG2235|consen   66 ELIVAGGRASLVDLAVTLNVD-------LDHIEKTARDIVSTDDEVTLVLGEIIDEEYVDRIAEEINEKLQEQ  131 (776)
T ss_pred             HHHHhCCcchhHHHHHHhCcC-------HHHHHHHHHHHhhcCCceEEehhhhhhHHHHHHHHHHHHHHHHHh
Confidence            444445444555555543222       346777777777777654 35689999999999999999876554


No 46 
>PF06957 COPI_C:  Coatomer (COPI) alpha subunit C-terminus;  InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=38.16  E-value=1.5e+02  Score=26.13  Aligned_cols=26  Identities=12%  Similarity=0.082  Sum_probs=19.7

Q ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHh
Q 030744           11 MEMRKRILHFALEGNALKAIELTEEL   36 (172)
Q Consensus        11 ~~~R~~I~~~I~~G~i~~Ai~~i~~~   36 (172)
                      .++-...-++|..|.++.|+.++++.
T Consensus       119 ~~nS~laadhvAAGsFetAm~LLnrQ  144 (422)
T PF06957_consen  119 VRNSSLAADHVAAGSFETAMQLLNRQ  144 (422)
T ss_dssp             HHH--SHHHHHHCT-HHHHHHHHHHH
T ss_pred             HHcCCcHHHHHHhCCHHHHHHHHHHH
Confidence            44555677999999999999999886


No 47 
>KOG3060 consensus Uncharacterized conserved protein [Function unknown]
Probab=38.11  E-value=1.3e+02  Score=25.07  Aligned_cols=65  Identities=26%  Similarity=0.228  Sum_probs=48.1

Q ss_pred             cCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHH
Q 030744           23 EGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDF   92 (172)
Q Consensus        23 ~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~   92 (172)
                      .|++++|++..+    .+++++ ...+..+..+..=+--.|+-.+||+-..+++..|-.|.+-..++.++
T Consensus        99 ~~~~~~A~e~y~----~lL~dd-pt~~v~~KRKlAilka~GK~l~aIk~ln~YL~~F~~D~EAW~eLaei  163 (289)
T KOG3060|consen   99 TGNYKEAIEYYE----SLLEDD-PTDTVIRKRKLAILKAQGKNLEAIKELNEYLDKFMNDQEAWHELAEI  163 (289)
T ss_pred             hhchhhHHHHHH----HHhccC-cchhHHHHHHHHHHHHcCCcHHHHHHHHHHHHHhcCcHHHHHHHHHH
Confidence            588888887764    455664 56777788777766667888899999999988887766666665554


No 48 
>PF07208 DUF1414:  Protein of unknown function (DUF1414);  InterPro: IPR009857 This family consists of several hypothetical bacterial proteins of around 70 residues in length. Members of this family are often referred to as YejL. The function of this family is unknown.; PDB: 2JPQ_A 2JUZ_B 2JUW_B 2QTI_A 2OTA_A 2JR2_A 2JRX_A.
Probab=37.15  E-value=60  Score=19.31  Aligned_cols=20  Identities=20%  Similarity=0.351  Sum_probs=16.6

Q ss_pred             CCHHHHHHHHHHHHHHHHHh
Q 030744          110 LSLEYRQHVADNLNRAILAH  129 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~  129 (172)
                      +.+++|+.+|+.+..++..+
T Consensus        24 V~~~qR~~iAe~Fa~AL~~S   43 (44)
T PF07208_consen   24 VPPAQRQAIAEKFAQALKSS   43 (44)
T ss_dssp             S-HHHHHHHHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHHHHHHhc
Confidence            46789999999999998765


No 49 
>COG5443 FlbT Flagellar biosynthesis regulator FlbT [Cell motility and secretion]
Probab=37.13  E-value=59  Score=24.06  Aligned_cols=31  Identities=26%  Similarity=0.305  Sum_probs=27.0

Q ss_pred             HHhHHHHHHHHHHHHcCCHHHHHHHHHHhcH
Q 030744            8 LEDMEMRKRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus         8 ~~~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      -+-+..-+.|-.++.+|.+-+|++.++..||
T Consensus        92 ~eil~~lk~Id~lV~~~~~feALkaiR~lyp  122 (148)
T COG5443          92 AEILAALKRIDGLVMAGRAFEALKAIRGLYP  122 (148)
T ss_pred             HHHHHHHHHHHHHHhccHHHHHHHHHhhhch
Confidence            4556778889999999999999999999998


No 50 
>PF10827 DUF2552:  Protein of unknown function (DUF2552) ;  InterPro: IPR020157 This entry contains proteins with no known function.
Probab=37.09  E-value=23  Score=23.31  Aligned_cols=16  Identities=19%  Similarity=0.034  Sum_probs=14.0

Q ss_pred             CHHHHHHHHHHhcHHH
Q 030744           25 NALKAIELTEELAQDL   40 (172)
Q Consensus        25 ~i~~Ai~~i~~~~p~l   40 (172)
                      -++.|++||.++.|.+
T Consensus        60 tld~Ai~Wi~e~M~~i   75 (79)
T PF10827_consen   60 TLDLAIAWIGEHMPHI   75 (79)
T ss_pred             cHHHHHHHHHhcccch
Confidence            4799999999999875


No 51 
>PRK00745 4-oxalocrotonate tautomerase; Provisional
Probab=36.83  E-value=60  Score=19.75  Aligned_cols=25  Identities=12%  Similarity=0.332  Sum_probs=22.7

Q ss_pred             CCHHHHHHHHHHHHHHHHHhcCCCC
Q 030744          110 LSLEYRQHVADNLNRAILAHAERPR  134 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~~g~~~  134 (172)
                      .+.+++.+|+..|..++...+|.|+
T Consensus        12 rs~eqk~~l~~~it~~l~~~~~~p~   36 (62)
T PRK00745         12 RTVEQKRKLVEEITRVTVETLGCPP   36 (62)
T ss_pred             CCHHHHHHHHHHHHHHHHHHcCCCh
Confidence            3789999999999999999999976


No 52 
>COG5051 RPL36A Ribosomal protein L36E [Translation, ribosomal structure and biogenesis]
Probab=36.68  E-value=78  Score=21.71  Aligned_cols=44  Identities=16%  Similarity=0.301  Sum_probs=33.1

Q ss_pred             hhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHHHhHh
Q 030744           50 DLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDFMALL   96 (172)
Q Consensus        50 ~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~LL   96 (172)
                      .=+-.+.|+||++.+-..|-+.+++.|..+.   .-..+++++-..|
T Consensus        51 sPyErr~i~Lirns~~krArKlakKRLGs~k---RAkaKvEel~~~i   94 (97)
T COG5051          51 SPYERRVIELIRNSQDKRARKLAKKRLGSLK---RAKAKVEELTSVI   94 (97)
T ss_pred             CHHHHHHHHHHHhcccHHHHHHHHHHhhhHH---HHHHHHHHHHHHH
Confidence            3356779999999999999999999998875   4455566655544


No 53 
>PF07079 DUF1347:  Protein of unknown function (DUF1347);  InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=36.19  E-value=2.9e+02  Score=25.06  Aligned_cols=51  Identities=16%  Similarity=0.115  Sum_probs=40.6

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHHhcHHHHccCC--------cceehhhHHHHHHHHhhcC
Q 030744           14 RKRILHFALEGNALKAIELTEELAQDLLEKNK--------DLHFDLLSLHFVELVCSRK   64 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~--------~l~F~L~~q~fIELir~~~   64 (172)
                      -..+-.+|..|.+.++...+|+..|.+++...        .+...|-+-.|+|+.....
T Consensus       132 ~i~a~sLIe~g~f~EgR~iLn~i~~~llkrE~~w~~d~yd~~vlmlsrSYfLEl~e~~s  190 (549)
T PF07079_consen  132 EIEAHSLIETGRFSEGRAILNRIIERLLKRECEWNSDMYDRAVLMLSRSYFLELKESMS  190 (549)
T ss_pred             HHHHHHHHhcCCcchHHHHHHHHHHHHhhhhhcccHHHHHHHHHHHhHHHHHHHHHhcc
Confidence            45567889999999999999999999999532        3555777888899876543


No 54 
>PF13934 ELYS:  Nuclear pore complex assembly
Probab=36.14  E-value=79  Score=25.16  Aligned_cols=16  Identities=25%  Similarity=0.387  Sum_probs=7.9

Q ss_pred             HhhcChHHHHHHHHhh
Q 030744           60 VCSRKCTEALEFAQTK   75 (172)
Q Consensus        60 ir~~~~~eAi~~ar~~   75 (172)
                      +.++.+.||+.|.|++
T Consensus       151 La~~~v~EAf~~~R~~  166 (226)
T PF13934_consen  151 LANGLVTEAFSFQRSY  166 (226)
T ss_pred             HHcCCHHHHHHHHHhC
Confidence            4445555555555543


No 55 
>KOG1156 consensus N-terminal acetyltransferase [Chromatin structure and dynamics]
Probab=35.49  E-value=3.8e+02  Score=25.20  Aligned_cols=127  Identities=19%  Similarity=0.216  Sum_probs=79.3

Q ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHH
Q 030744           11 MEMRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLE   90 (172)
Q Consensus        11 ~~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~   90 (172)
                      ...-.+++-.+.+|..+.|++.+..+-|.+++.   +.|....-.+  +.+-+++++|+...+..+.....|-.+...++
T Consensus       186 e~~Ly~n~i~~E~g~~q~ale~L~~~e~~i~Dk---la~~e~ka~l--~~kl~~lEeA~~~y~~Ll~rnPdn~~Yy~~l~  260 (700)
T KOG1156|consen  186 ELLLYQNQILIEAGSLQKALEHLLDNEKQIVDK---LAFEETKADL--LMKLGQLEEAVKVYRRLLERNPDNLDYYEGLE  260 (700)
T ss_pred             HHHHHHHHHHHHcccHHHHHHHHHhhhhHHHHH---HHHhhhHHHH--HHHHhhHHhHHHHHHHHHhhCchhHHHHHHHH
Confidence            334556778899999999999999999988875   2333333332  23457889999999998888765555666666


Q ss_pred             HHHh--------H-hccc----CC--CCCc----chhcCCHHHHHHHHHHHHHHHHHhcCCCC-chHHHHHHHH
Q 030744           91 DFMA--------L-LAYE----EP--EKSP----MFHLLSLEYRQHVADNLNRAILAHAERPR-YAAMERLIQQ  144 (172)
Q Consensus        91 ~~~~--------L-Lay~----~~--~~sp----~~~ll~~~~r~~la~~vn~aIL~~~g~~~-~s~Le~l~~~  144 (172)
                      .+++        + .+|.    .+  ..+|    ...+-..+-+..+..-++..+-.  |+|+ -+.|+.+.+.
T Consensus       261 ~~lgk~~d~~~~lk~ly~~ls~~y~r~e~p~Rlplsvl~~eel~~~vdkyL~~~l~K--g~p~vf~dl~SLyk~  332 (700)
T KOG1156|consen  261 KALGKIKDMLEALKALYAILSEKYPRHECPRRLPLSVLNGEELKEIVDKYLRPLLSK--GVPSVFKDLRSLYKD  332 (700)
T ss_pred             HHHHHHhhhHHHHHHHHHHHhhcCcccccchhccHHHhCcchhHHHHHHHHHHHhhc--CCCchhhhhHHHHhc
Confidence            6664        1 1222    11  1234    33333455666666666655443  3443 4566666655


No 56 
>PRK04984 fatty acid metabolism regulator; Provisional
Probab=35.16  E-value=82  Score=24.70  Aligned_cols=24  Identities=8%  Similarity=-0.050  Sum_probs=11.6

Q ss_pred             HHHHHHHhhcChHHHHHHHHhhcc
Q 030744           54 LHFVELVCSRKCTEALEFAQTKLT   77 (172)
Q Consensus        54 q~fIELir~~~~~eAi~~ar~~l~   77 (172)
                      .+.++.|+.|+.+.|.+..+.|+.
T Consensus       194 ~~I~~Ai~~~D~~~a~~~~~~H~~  217 (239)
T PRK04984        194 HKLSALCEEGNHDQVPECVRQYGI  217 (239)
T ss_pred             HHHHHHHHhCCHHHHHHHHHHHHH
Confidence            344445555555555444444443


No 57 
>PRK00794 flbT flagellar biosynthesis repressor FlbT; Reviewed
Probab=34.50  E-value=88  Score=23.02  Aligned_cols=30  Identities=17%  Similarity=0.106  Sum_probs=23.8

Q ss_pred             HhHHHHHHHHHHHHcCCHHHHHHHHHHhcH
Q 030744            9 EDMEMRKRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus         9 ~~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      +....=..|.++|.+|++-+|++.+...+|
T Consensus        92 ~~~~~l~~i~~~V~~g~~y~ALk~lR~L~~  121 (132)
T PRK00794         92 DILAGLKAIDELVEAGRYYEALKALRGLYP  121 (132)
T ss_pred             HHHHHHHHHHHHHHCCcHHHHHHHHHHhhH
Confidence            334555678889999999999999988877


No 58 
>PF14276 DUF4363:  Domain of unknown function (DUF4363)
Probab=33.73  E-value=1.7e+02  Score=20.54  Aligned_cols=47  Identities=17%  Similarity=0.072  Sum_probs=38.5

Q ss_pred             HHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHH
Q 030744           13 MRKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVEL   59 (172)
Q Consensus        13 ~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIEL   59 (172)
                      .=..+.+.|.+++|+.|.+.+.+..-.--+..+.+.|.+..+++=++
T Consensus        31 ~l~~i~~~i~~~dW~~A~~~~~~l~~~W~k~~~~~~~~~~h~eid~i   77 (121)
T PF14276_consen   31 QLEQIEEAIENEDWEKAYKETEELEKEWDKNKKRWSILIEHQEIDNI   77 (121)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhchheeeeecHHHHHHH
Confidence            35678899999999999999988777767777788888888886443


No 59 
>PRK05627 bifunctional riboflavin kinase/FMN adenylyltransferase; Reviewed
Probab=32.94  E-value=34  Score=28.66  Aligned_cols=24  Identities=21%  Similarity=0.043  Sum_probs=21.1

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhcH
Q 030744           15 KRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      ..||++|.+|+++.|-+++-..|-
T Consensus       164 T~IR~~I~~G~i~~A~~lLg~~y~  187 (305)
T PRK05627        164 TAIRQALAEGDLELANKLLGRPYS  187 (305)
T ss_pred             HHHHHHHHcCCHHHHHhhhcCCCc
Confidence            579999999999999999977664


No 60 
>PF12931 Sec16_C:  Sec23-binding domain of Sec16; PDB: 3MZK_C.
Probab=32.93  E-value=46  Score=27.37  Aligned_cols=20  Identities=35%  Similarity=0.517  Sum_probs=12.1

Q ss_pred             HHHHHHHcCCHHHHHHHHHH
Q 030744           16 RILHFALEGNALKAIELTEE   35 (172)
Q Consensus        16 ~I~~~I~~G~i~~Ai~~i~~   35 (172)
                      +|+++++.|+.++|++++-+
T Consensus         1 ~I~~~Ll~G~~~~Av~~al~   20 (284)
T PF12931_consen    1 KIQQLLLVGNREEAVELALD   20 (284)
T ss_dssp             HHHHHHHTT-HHHHHHHHHH
T ss_pred             CHHHHHhCCCHHHHHHHHHH
Confidence            46666777777777666643


No 61 
>PF13371 TPR_9:  Tetratricopeptide repeat
Probab=32.82  E-value=1.2e+02  Score=18.48  Aligned_cols=55  Identities=24%  Similarity=0.161  Sum_probs=37.4

Q ss_pred             HHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCc
Q 030744           20 FALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPF   79 (172)
Q Consensus        20 ~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~   79 (172)
                      ++.+|+++.|++.++..    +..+|+ ...++...=.=+.+.|+..+|++...+.+...
T Consensus         5 ~~~~~~~~~A~~~~~~~----l~~~p~-~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~   59 (73)
T PF13371_consen    5 YLQQEDYEEALEVLERA----LELDPD-DPELWLQRARCLFQLGRYEEALEDLERALELS   59 (73)
T ss_pred             HHhCCCHHHHHHHHHHH----HHhCcc-cchhhHHHHHHHHHhccHHHHHHHHHHHHHHC
Confidence            57889999999988653    344444 33344444444567789999999988776543


No 62 
>PF07378 FlbT:  Flagellar protein FlbT;  InterPro: IPR009967 This family consists of several FlbT proteins. FlbT is a post-transcriptional repressor function in flagellum biogenesis. FlbT is associated with the 5' untranslated region (UTR) of fljK (25 kDa flagellin) mRNA and that this association requires a predicted loop structure in the transcript. Mutations within this loop abolish FlbT association and result in increased mRNA stability. It is therefore thought that FlbT promotes the degradation of flagellin mRNA by associating with the 5' UTR [].; GO: 0048027 mRNA 5'-UTR binding, 0006402 mRNA catabolic process, 0045718 negative regulation of flagellum assembly
Probab=32.48  E-value=87  Score=22.83  Aligned_cols=32  Identities=16%  Similarity=0.068  Sum_probs=25.7

Q ss_pred             cHHhHHHHHHHHHHHHcCCHHHHHHHHHHhcH
Q 030744            7 CLEDMEMRKRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus         7 ~~~~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      +.+....=..|.++|.+|++-+|++.+...+|
T Consensus        88 ~p~~~~~l~~~~~~v~~g~~y~ALk~~R~L~~  119 (126)
T PF07378_consen   88 DPDAREGLDEANELVEAGRYYKALKALRKLIP  119 (126)
T ss_pred             CHHHHHHHHHHHHHHHCCcHHHHHHHHHHhHH
Confidence            44555666778899999999999999988876


No 63 
>PF13041 PPR_2:  PPR repeat family 
Probab=32.47  E-value=63  Score=18.63  Aligned_cols=24  Identities=21%  Similarity=0.295  Sum_probs=18.9

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHHhc
Q 030744           14 RKRILHFALEGNALKAIELTEELA   37 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~~~   37 (172)
                      =..|.-....|++++|.++.++.-
T Consensus         7 n~li~~~~~~~~~~~a~~l~~~M~   30 (50)
T PF13041_consen    7 NTLISGYCKAGKFEEALKLFKEMK   30 (50)
T ss_pred             HHHHHHHHHCcCHHHHHHHHHHHH
Confidence            346778888999999999887653


No 64 
>PF12169 DNA_pol3_gamma3:  DNA polymerase III subunits gamma and tau domain III;  InterPro: IPR022754  This domain is found in bacteria and eukaryotes, and is approximately 110 amino acids in length. It is found in association with PF00004 from PFAM. This domain is also present in the tau subunit before it undergoes cleavage. Domains I-III are shared between the tau and the gamma subunits, while most of the DnaB-binding Domain IV and all of the alpha-interacting Domain V are unique to tau. ; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 1NJF_B 3GLG_G 1XXH_I 1NJG_A 3GLF_B 3GLI_G.
Probab=32.40  E-value=1.1e+02  Score=21.75  Aligned_cols=23  Identities=43%  Similarity=0.563  Sum_probs=14.2

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhc
Q 030744           15 KRILHFALEGNALKAIELTEELA   37 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~   37 (172)
                      ..+-++|.+||..+|+..+++.+
T Consensus        19 ~~l~~ai~~~d~~~~l~~~~~l~   41 (143)
T PF12169_consen   19 FELLDAILEGDAAEALELLNELL   41 (143)
T ss_dssp             HHHHHHHHTT-HHHHHHHHHHHH
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHH
Confidence            34566777777777777776544


No 65 
>PF13812 PPR_3:  Pentatricopeptide repeat domain
Probab=31.98  E-value=75  Score=16.15  Aligned_cols=23  Identities=22%  Similarity=0.284  Sum_probs=18.1

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHHh
Q 030744           14 RKRILHFALEGNALKAIELTEEL   36 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~~   36 (172)
                      ...|+-....|+++.|.++++..
T Consensus         5 ~~ll~a~~~~g~~~~a~~~~~~M   27 (34)
T PF13812_consen    5 NALLRACAKAGDPDAALQLFDEM   27 (34)
T ss_pred             HHHHHHHHHCCCHHHHHHHHHHH
Confidence            34577888899999999888653


No 66 
>PF01535 PPR:  PPR repeat;  InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) [].  The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=31.72  E-value=60  Score=16.09  Aligned_cols=22  Identities=23%  Similarity=0.317  Sum_probs=17.2

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHh
Q 030744           15 KRILHFALEGNALKAIELTEEL   36 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~   36 (172)
                      ..|.-....|++++|.++.++.
T Consensus         5 ~li~~~~~~~~~~~a~~~~~~M   26 (31)
T PF01535_consen    5 SLISGYCKMGQFEEALEVFDEM   26 (31)
T ss_pred             HHHHHHHccchHHHHHHHHHHH
Confidence            3466778889999999888663


No 67 
>TIGR01470 cysG_Nterm siroheme synthase, N-terminal domain. This model represents a subfamily of CysG N-terminal region-related sequences. All sequences in the seed alignment for this model are N-terminal regions of known or predicted siroheme synthases. The C-terminal region of each is uroporphyrin-III C-methyltransferase (EC 2.1.1.107), which catalyzes the first step committed to the biosynthesis of either siroheme or cobalamin (vitamin B12) rather than protoheme (heme). The region represented by this model completes the process of oxidation and iron insertion to yield siroheme. Siroheme is a cofactor for nitrite and sulfite reductases, so siroheme synthase is CysG of cysteine biosynthesis in some organisms.
Probab=31.63  E-value=1.2e+02  Score=23.59  Aligned_cols=64  Identities=16%  Similarity=0.125  Sum_probs=42.2

Q ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHhcHHHHccCC------cceehhhHHHHHHHHhhcChHHHHHHHHhhc
Q 030744           12 EMRKRILHFALEGNALKAIELTEELAQDLLEKNK------DLHFDLLSLHFVELVCSRKCTEALEFAQTKL   76 (172)
Q Consensus        12 ~~R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~------~l~F~L~~q~fIELir~~~~~eAi~~ar~~l   76 (172)
                      .-|.+|...+-. .++.-++++......+.+..+      .+...+..-.|.+++++|+..+|.+.+.+.+
T Consensus       135 ~lr~~ie~~l~~-~~~~~~~~~~~~R~~~k~~~~~~~~r~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~  204 (205)
T TIGR01470       135 LLRERIETLLPP-SLGDLATLAATWRDAVKKRLPNGAARRRFWEKFFDGAFAERVLAGREEQAERVLATRL  204 (205)
T ss_pred             HHHHHHHHhcch-hHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHhccHHHHHHHcCCHHHHHHHHHHhh
Confidence            346667766643 556677777777776665433      2333344446788899999999988887654


No 68 
>PF10602 RPN7:  26S proteasome subunit RPN7;  InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis [].  The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity [].   The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=30.65  E-value=2.4e+02  Score=21.38  Aligned_cols=53  Identities=13%  Similarity=-0.006  Sum_probs=24.2

Q ss_pred             HHHHHHHcCCHHHHHHHHHHhcHHHHcc-CCcceehhhHHHHHHHHhhcChHHH
Q 030744           16 RILHFALEGNALKAIELTEELAQDLLEK-NKDLHFDLLSLHFVELVCSRKCTEA   68 (172)
Q Consensus        16 ~I~~~I~~G~i~~Ai~~i~~~~p~ll~~-~~~l~F~L~~q~fIELir~~~~~eA   68 (172)
                      .||-+|.-|||..+...+++...-+-+. +....-+|++..-+-.+..++..+|
T Consensus        79 ~irv~i~~~d~~~v~~~i~ka~~~~~~~~d~~~~nrlk~~~gL~~l~~r~f~~A  132 (177)
T PF10602_consen   79 VIRVAIFFGDWSHVEKYIEKAESLIEKGGDWERRNRLKVYEGLANLAQRDFKEA  132 (177)
T ss_pred             HHHHHHHhCCHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHhchHHHH
Confidence            3444555555555555544433322221 1233444555555555555555554


No 69 
>COG1942 Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]
Probab=30.57  E-value=90  Score=20.24  Aligned_cols=26  Identities=12%  Similarity=0.158  Sum_probs=22.2

Q ss_pred             CCHHHHHHHHHHHHHHHHHhcCCCCc
Q 030744          110 LSLEYRQHVADNLNRAILAHAERPRY  135 (172)
Q Consensus       110 l~~~~r~~la~~vn~aIL~~~g~~~~  135 (172)
                      .+.++..+||+.|..++...+|.++.
T Consensus        13 ~~~~~K~~la~~vT~~~~~~lg~~~~   38 (69)
T COG1942          13 LDEEQKAELAAEVTEVTVETLGKDPS   38 (69)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhCCCcc
Confidence            45567899999999999999999763


No 70 
>PF14691 Fer4_20:  Dihydroprymidine dehydrogenase domain II, 4Fe-4S cluster; PDB: 2VDC_G 1H7X_C 1H7W_A 1GT8_A 1GTE_B 1GTH_B.
Probab=30.05  E-value=65  Score=22.82  Aligned_cols=26  Identities=15%  Similarity=0.271  Sum_probs=19.9

Q ss_pred             hhhHHHHHHHHhhcChHHHHHHHHhh
Q 030744           50 DLLSLHFVELVCSRKCTEALEFAQTK   75 (172)
Q Consensus        50 ~L~~q~fIELir~~~~~eAi~~ar~~   75 (172)
                      .+....||.+|+.|+..+|++..++.
T Consensus        39 ~~dip~~i~~i~~g~~~~A~~~i~~~   64 (111)
T PF14691_consen   39 HIDIPEYIRLIREGNFKEAYELIRED   64 (111)
T ss_dssp             ---HHHHHHHHHCT-HHHHHHHHHHH
T ss_pred             CCcHHHHHHHHHCCCHHHHHHHHHHh
Confidence            45568899999999999999999863


No 71 
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=29.74  E-value=2.1e+02  Score=20.58  Aligned_cols=57  Identities=14%  Similarity=-0.034  Sum_probs=35.2

Q ss_pred             HHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhcc
Q 030744           16 RILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLT   77 (172)
Q Consensus        16 ~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~   77 (172)
                      .-..+...|++++|++.+..    .+..++. ....+...=.-+.+.|+..+|+.+-+.-+.
T Consensus        30 ~g~~~~~~g~~~~A~~~~~~----al~~~P~-~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~   86 (144)
T PRK15359         30 SGYASWQEGDYSRAVIDFSW----LVMAQPW-SWRAHIALAGTWMMLKEYTTAINFYGHALM   86 (144)
T ss_pred             HHHHHHHcCCHHHHHHHHHH----HHHcCCC-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHh
Confidence            35567889999999887755    3444433 222223333334566788888887776554


No 72 
>COG5516 Conserved protein containing a Zn-ribbon-like motif, possibly RNA-binding [General function prediction only]
Probab=29.70  E-value=1.4e+02  Score=23.20  Aligned_cols=57  Identities=12%  Similarity=0.064  Sum_probs=41.6

Q ss_pred             HHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHH-hcCCCCCCCcchhhhhc
Q 030744          112 LEYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQ-ELGKDVHPPFSLKDFMK  171 (172)
Q Consensus       112 ~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~  171 (172)
                      .+++.+.|.+++....   +.++.+.|-..+.++..++..|-+ ..+++.+.+-|+.+||.
T Consensus        42 isr~~~aA~e~~~~~~---~v~ap~al~~vai~LRE~i~~l~~~r~~g~~~rt~dl~~flr   99 (196)
T COG5516          42 ISRFAMAAGEVDETLA---GVYAPGALLAVAIHLREVIDRLFRTRKAGKRVRTDDLPDFLR   99 (196)
T ss_pred             HHHHHHHhhhhhhhhc---CCCCchhHHHHHHHHHHHHHHHHHHHhccCcCChhhHHHHHH
Confidence            4688888888887543   556667777778888777766655 34555668999999985


No 73 
>KOG0989 consensus Replication factor C, subunit RFC4 [Replication, recombination and repair]
Probab=29.16  E-value=2.5e+02  Score=24.06  Aligned_cols=73  Identities=14%  Similarity=0.094  Sum_probs=50.7

Q ss_pred             HHcCCHHHHHHHHHHhcHHHHccC--------CcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHH
Q 030744           21 ALEGNALKAIELTEELAQDLLEKN--------KDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDF   92 (172)
Q Consensus        21 I~~G~i~~Ai~~i~~~~p~ll~~~--------~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~   92 (172)
                      +-.|+.-.|+-.+++..+.--...        ..+.-.-....++|+.++++..+.++++|+.+..-..--..+.+|.++
T Consensus       219 ~S~GdLR~Ait~Lqsls~~gk~It~~~~~e~~~GvVp~~~l~~lle~a~S~d~~~~v~~~Rei~~sg~~~~~lmsQLa~v  298 (346)
T KOG0989|consen  219 ISDGDLRRAITTLQSLSLLGKRITTSLVNEELAGVVPDEKLLDLLELALSADTPNTVKRVREIMRSGYSPLQLMSQLAEV  298 (346)
T ss_pred             HcCCcHHHHHHHHHHhhccCcccchHHHHHHHhccCCHHHHHHHHHHHHccChHHHHHHHHHHHHhccCHHHHHHHHHHH
Confidence            456999999999999988111111        124455567889999999999999999998776533212445555555


Q ss_pred             H
Q 030744           93 M   93 (172)
Q Consensus        93 ~   93 (172)
                      +
T Consensus       299 i  299 (346)
T KOG0989|consen  299 I  299 (346)
T ss_pred             H
Confidence            4


No 74 
>PRK00239 rpsT 30S ribosomal protein S20; Reviewed
Probab=29.02  E-value=98  Score=21.06  Aligned_cols=27  Identities=11%  Similarity=0.057  Sum_probs=20.0

Q ss_pred             HHHHHHHcCCHHHHHHHHHHhcHHHHc
Q 030744           16 RILHFALEGNALKAIELTEELAQDLLE   42 (172)
Q Consensus        16 ~I~~~I~~G~i~~Ai~~i~~~~p~ll~   42 (172)
                      .+..+|..|+.++|.+.+...++.|-.
T Consensus        34 k~~~ai~~~~~~~a~~~~~~a~s~iDk   60 (88)
T PRK00239         34 KVEAAIAAGDKEAAEEALKAAQSKIDK   60 (88)
T ss_pred             HHHHHHHcCCHHHHHHHHHHHHHHHHH
Confidence            455777789999988888877765543


No 75 
>PRK09239 chorismate mutase; Provisional
Probab=28.80  E-value=2.1e+02  Score=20.00  Aligned_cols=49  Identities=10%  Similarity=0.106  Sum_probs=31.5

Q ss_pred             hcCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHH----HHHHHHHHHHHhcCC
Q 030744          108 HLLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQ----QTTAVRQCLSQELGK  158 (172)
Q Consensus       108 ~ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~----~~~~~~~~l~~~~~~  158 (172)
                      ..++++|+..+.+.+. ......|+++ ...+.+++    ....-+.......|+
T Consensus        50 ~i~dp~RE~~vl~~~~-~~a~~~gl~p-~~~~~i~~~ii~esir~q~~i~~~~~~  102 (104)
T PRK09239         50 PPADPAREAYQIERLR-QLAKDANLDP-DFAEKFLNFIIKEVIRHHERIAAEHGD  102 (104)
T ss_pred             CCCCHHHHHHHHHHHH-HHHHHCCCCH-HHHHHHHHHHHHHHHHHHHHHHhccCC
Confidence            3788999999999985 4667789975 34444444    444444544444444


No 76 
>PF14498 Glyco_hyd_65N_2:  Glycosyl hydrolase family 65, N-terminal domain; PDB: 2EAE_A 2EAB_B 2EAC_A 2EAD_B 2RDY_A.
Probab=28.34  E-value=38  Score=26.92  Aligned_cols=34  Identities=26%  Similarity=0.197  Sum_probs=24.6

Q ss_pred             chhcHHhHHHHHHHHHHHHcCCHHHHHHHHHHhc
Q 030744            4 PANCLEDMEMRKRILHFALEGNALKAIELTEELA   37 (172)
Q Consensus         4 ~~~~~~~~~~R~~I~~~I~~G~i~~Ai~~i~~~~   37 (172)
                      +.........=.+||++|.+|++++|-+++++++
T Consensus        48 ~~~~~~~~~~L~~iR~l~~~g~~~~A~~l~~~~~   81 (236)
T PF14498_consen   48 DRTPPDAAEYLPEIRELLFEGDYEEAEELAEENF   81 (236)
T ss_dssp             SSSSHHHHHHHHHHHHHHHTT-CCHHHHHHCCS-
T ss_pred             cccCccHHHHHHHHHHHHHcCChhHHHHHHHHhc
Confidence            3344445666778999999999999999886554


No 77 
>PRK10421 DNA-binding transcriptional repressor LldR; Provisional
Probab=28.26  E-value=1.4e+02  Score=23.72  Aligned_cols=29  Identities=17%  Similarity=0.100  Sum_probs=17.6

Q ss_pred             hHHHHHHHHHHHHcCCHHHHHHHHHHhcH
Q 030744           10 DMEMRKRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        10 ~~~~R~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      .+..=..|.++|.+||.+.|...+..|.-
T Consensus       190 ~~~~H~~I~~AI~~~D~~~A~~~~~~H~~  218 (253)
T PRK10421        190 LTEQHQAVMDAILAGDAEGARKAMMAHLS  218 (253)
T ss_pred             HHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence            34455566666666666666666666543


No 78 
>PF12569 NARP1:  NMDA receptor-regulated protein 1 ;  InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=28.23  E-value=1.4e+02  Score=26.97  Aligned_cols=71  Identities=20%  Similarity=0.162  Sum_probs=46.7

Q ss_pred             HHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCCchhhHHHHHHHHhH
Q 030744           20 FALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGKVQKYVEKLEDFMAL   95 (172)
Q Consensus        20 ~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~~~~~~~~l~~~~~L   95 (172)
                      ....|+.+.|++.+.++-..+++.-.-++     .+--=+++-|..++|-..-+..+.....|..|..-+..++++
T Consensus        14 l~e~g~~~~AL~~L~~~~~~I~Dk~~~~E-----~rA~ll~kLg~~~eA~~~y~~Li~rNPdn~~Yy~~L~~~~g~   84 (517)
T PF12569_consen   14 LEEAGDYEEALEHLEKNEKQILDKLAVLE-----KRAELLLKLGRKEEAEKIYRELIDRNPDNYDYYRGLEEALGL   84 (517)
T ss_pred             HHHCCCHHHHHHHHHhhhhhCCCHHHHHH-----HHHHHHHHcCCHHHHHHHHHHHHHHCCCcHHHHHHHHHHHhh
Confidence            46789999999999888776666421111     111124556888888888887776654455666666666643


No 79 
>PF10607 CLTH:  CTLH/CRA C-terminal to LisH motif domain;  InterPro: IPR019589 This entry represents the CRA (or CT11-RanBPM) domain, which is a protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi) and which is found in Ran-binding proteins such as Ran-binding protein 9 (RanBP9 or RanBPM) and RanBP10. RanBPM is a scaffolding protein important in regulating cellular function in both the immune system and the nervous system, and may act as an adapter protein to couple membrane receptors to intracellular signaling pathways. This domain is at the C terminus of the proteins and is the binding domain for the CRA motif, which is comprised of approximately 100 amino acids at the C-terminal of RanBPM. It was found to be important for the interaction of RanBPM with fragile X mental retardation protein (FMRP), but its functional significance has yet to be determined []. 
Probab=27.49  E-value=72  Score=22.89  Aligned_cols=30  Identities=23%  Similarity=0.278  Sum_probs=24.7

Q ss_pred             HHhHHHHHHHHHHHHcCCHHHHHHHHHHhc
Q 030744            8 LEDMEMRKRILHFALEGNALKAIELTEELA   37 (172)
Q Consensus         8 ~~~~~~R~~I~~~I~~G~i~~Ai~~i~~~~   37 (172)
                      +..--.++...+.|+.|++.+|+++++.+.
T Consensus        38 L~f~L~~q~fiell~~~~~~~Ai~y~r~~l   67 (145)
T PF10607_consen   38 LEFELRCQQFIELLREGDIMEAIEYARKHL   67 (145)
T ss_pred             hhHHHHHHHHHHHHHHHhHHHHHHHHHHHh
Confidence            344445788889999999999999999965


No 80 
>PF01649 Ribosomal_S20p:  Ribosomal protein S20;  InterPro: IPR002583 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This family consists of bacterial (and chloroplast) examples of the ribosomal small subunit protein S20. Bacterial ribosomal protein S20 forms part of the 30S ribosomal subunit, and interacts with 16S rRNA.; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1I94_T 1FJG_T 4DH9_T 3KNJ_T 3TVG_W 3UYF_W 3V28_T 3KIS_t 3HUY_T 1HNX_T ....
Probab=25.70  E-value=1.2e+02  Score=20.35  Aligned_cols=28  Identities=18%  Similarity=0.129  Sum_probs=20.9

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhcHHHHc
Q 030744           15 KRILHFALEGNALKAIELTEELAQDLLE   42 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~p~ll~   42 (172)
                      +.++.+|..|+-+.|.+.+...++.|-.
T Consensus        32 Kk~~~ai~~~~~~~a~~~l~~a~s~iDk   59 (84)
T PF01649_consen   32 KKFREAIEAGDKEEAKELLRKAYSAIDK   59 (84)
T ss_dssp             HHHHHHHHTT-HHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHccChHHHHHHHHHHHHHHHH
Confidence            3567788889999999999887776544


No 81 
>PF00701 DHDPS:  Dihydrodipicolinate synthetase family;  InterPro: IPR002220 Dihydropicolinate synthase (DHDPS) is the key enzyme in lysine biosynthesis via the diaminopimelate pathway of prokaryotes, some phycomycetes and higher plants. The enzyme catalyses the condensation of L-aspartate-beta- semialdehyde and pyruvate to dihydropicolinic acid via a ping-pong mechanism in which pyruvate binds to the enzyme by forming a Schiff-base with a lysine residue []. Three other proteins are structurally related to DHDPS and probably also act via a similar catalytic mechanism. These are Escherichia coli N-acetylneuraminate lyase (4.1.3.3 from EC) (gene nanA), which catalyzes the condensation of N-acetyl-D-mannosamine and pyruvate to form N-acetylneuraminate; Rhizobium meliloti (Sinorhizobium meliloti) protein mosA [], which is involved in the biosynthesis of the rhizopine 3-o-methyl-scyllo-inosamine; and E. coli hypothetical protein yjhH. The sequences of DHDPS from different sources are well-conserved. The structure takes the form of a homotetramer, in which 2 monomers are related by an approximate 2-fold symmetry []. Each monomer comprises 2 domains: an 8-fold alpha-/beta-barrel, and a C-terminal alpha-helical domain. The fold resembles that of N-acetylneuraminate lyase. The active site lysine is located in the barrel domain, and has access via 2 channels on the C-terminal side of the barrel.; GO: 0016829 lyase activity, 0008152 metabolic process; PDB: 3B4U_B 3S8H_A 3QZE_B 1XXX_F 3L21_F 3IRD_A 3A5F_B 3G0S_B 3DAQ_C 3UQN_A ....
Probab=25.35  E-value=3.7e+02  Score=21.77  Aligned_cols=19  Identities=26%  Similarity=0.144  Sum_probs=10.8

Q ss_pred             HHHHHHHHcCCHHHHHHHH
Q 030744           15 KRILHFALEGNALKAIELT   33 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i   33 (172)
                      ..|.+++.+|+++.|.++-
T Consensus       215 ~~i~~~~~~Gd~~~A~~l~  233 (289)
T PF00701_consen  215 VEIYDAFQAGDWEEARELQ  233 (289)
T ss_dssp             HHHHHHHHTTCHHHHHHHH
T ss_pred             HHHHHHHHcCcHHHHHHHH
Confidence            4456666666666665443


No 82 
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=25.08  E-value=1e+02  Score=18.02  Aligned_cols=19  Identities=32%  Similarity=0.274  Sum_probs=13.7

Q ss_pred             HHHHHcCCHHHHHHHHHHh
Q 030744           18 LHFALEGNALKAIELTEEL   36 (172)
Q Consensus        18 ~~~I~~G~i~~Ai~~i~~~   36 (172)
                      +-.|.-|+.+.|.+++++.
T Consensus         7 ~ayie~Gd~e~Ar~lL~ev   25 (44)
T TIGR03504         7 RAYIEMGDLEGARELLEEV   25 (44)
T ss_pred             HHHHHcCChHHHHHHHHHH
Confidence            4567778888888877654


No 83 
>PF13428 TPR_14:  Tetratricopeptide repeat
Probab=25.06  E-value=1e+02  Score=17.29  Aligned_cols=18  Identities=17%  Similarity=0.093  Sum_probs=11.7

Q ss_pred             HHHHHcCCHHHHHHHHHH
Q 030744           18 LHFALEGNALKAIELTEE   35 (172)
Q Consensus        18 ~~~I~~G~i~~Ai~~i~~   35 (172)
                      +-....|+.+.|++++++
T Consensus         9 ~~~~~~G~~~~A~~~~~~   26 (44)
T PF13428_consen    9 RAYRRLGQPDEAERLLRR   26 (44)
T ss_pred             HHHHHcCCHHHHHHHHHH
Confidence            345667777777776654


No 84 
>PRK10225 DNA-binding transcriptional repressor UxuR; Provisional
Probab=25.00  E-value=1.2e+02  Score=24.01  Aligned_cols=26  Identities=8%  Similarity=0.005  Sum_probs=14.8

Q ss_pred             HHHHHHHHhhcChHHHHHHHHhhccC
Q 030744           53 SLHFVELVCSRKCTEALEFAQTKLTP   78 (172)
Q Consensus        53 ~q~fIELir~~~~~eAi~~ar~~l~~   78 (172)
                      ..+.++.|++|+.+.|.+..+.|+..
T Consensus       201 H~~I~~Ai~~~D~~~A~~~m~~Hi~~  226 (257)
T PRK10225        201 HKQILAALIKKDARAAKLAMWQHLEN  226 (257)
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHH
Confidence            34455566666666666666666543


No 85 
>PLN02150 terpene synthase/cyclase family protein
Probab=24.42  E-value=1.6e+02  Score=20.18  Aligned_cols=35  Identities=6%  Similarity=0.034  Sum_probs=24.4

Q ss_pred             HHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHH
Q 030744          116 QHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCL  152 (172)
Q Consensus       116 ~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l  152 (172)
                      +..|..+|...+..... +.+.++.++.-++ +.+.+
T Consensus        33 ~~~WK~iN~e~l~~~~~-p~~~~~~~~NlaR-~~~~~   67 (96)
T PLN02150         33 RDNYKIVMEEFLTIKDV-PRPVLVRCLNLAR-LIDVY   67 (96)
T ss_pred             HHHHHHHHHHHcCCCCC-CHHHHHHHHHHHH-HHHhh
Confidence            45788899999886666 4677888777554 44444


No 86 
>KOG3930 consensus Uncharacterized conserved protein [Function unknown]
Probab=23.89  E-value=1.5e+02  Score=25.35  Aligned_cols=50  Identities=12%  Similarity=0.167  Sum_probs=38.6

Q ss_pred             chhcCCHHHH-HHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHHh
Q 030744          106 MFHLLSLEYR-QHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQE  155 (172)
Q Consensus       106 ~~~ll~~~~r-~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~~  155 (172)
                      |++.+-..|- ..+.-.+|..+|..+|+.-.--+-.|++|.++++.+....
T Consensus        21 YA~~Fv~NRIqk~MLldLnKe~l~ElGvT~iGDiiaILrh~K~v~~q~~re   71 (389)
T KOG3930|consen   21 YAKSFVTNRIQKEMLLDLNKETLSELGVTAIGDIIAILRHIKAVKSQIERE   71 (389)
T ss_pred             HHHHHHhhhhhHHHHhhhhHHHHHHhchhhhhhHHHHHHHHHHHHHHHHHh
Confidence            5544433332 5677789999999999999999999999999987776554


No 87 
>PF08625 Utp13:  Utp13 specific WD40 associated domain;  InterPro: IPR013934 A large ribonuclear protein complex is required for the processing of the small-ribosomal-subunit rRNA - the small-subunit (SSU) processome [, ]. This preribosomal complex contains the U3 snoRNA and at least 40 proteins, which have the following properties:   They are nucleolar. They are able to coimmunoprecipitate with the U3 snoRNA and Mpp10 (a protein specific to the SSU processome).  They are required for 18S rRNA biogenesis.  There appears to be a linkage between polymerase I transcription and the formation of the SSU processome; as some, but not all, of the SSU processome components are required for pre-rRNA transcription initiation. These SSU processome components have been termed t-Utps. They form a pre-complex with pre-18S rRNA in the absence of snoRNA U3 and other SSU processome components. It has been proposed that the t-Utp complex proteins are both rDNA and rRNA binding proteins that are involved in the initiation of pre18S rRNA transcription. Initially binding to rDNA then associating with the 5' end of the nascent pre18S rRNA. The t-Utpcomplex forms the nucleus around which the rest of the SSU processome components, including snoRNA U3, assemble []. From electron microscopy the SSU processome may correspond to the terminal knobs visualized at the 5' ends of nascent 18S rRNA.   Utp13 is a nucleolar protein and component of the small subunit (SSU) processome containing the U3 snoRNA that is involved in processing of pre-18S rRNA [].  Upt13 is also a component of the Pwp2 complex that forms part of a stable particle subunit independent of the U3 small nucleolar ribonucleoprotein that is essential for the initial assembly steps of the 90S pre-ribosome []. Components of the Pwp2 complex are: Utp1 (Pwp2), Utp6, Utp12 (Dip2), Utp13, Utp18, and Utp21. The relationship between the Pwp2 complex and the t-Utps complex [] that also associates with the 5' end of nascent pre-18S rRNA is unclear. ; GO: 0006364 rRNA processing, 0032040 small-subunit processome
Probab=23.68  E-value=2.3e+02  Score=20.88  Aligned_cols=22  Identities=14%  Similarity=0.224  Sum_probs=18.3

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHh
Q 030744           15 KRILHFALEGNALKAIELTEEL   36 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~   36 (172)
                      +.+.+++.+|++.+|+.++=+.
T Consensus         2 Q~L~N~l~~~~y~~Al~LAl~L   23 (141)
T PF08625_consen    2 QELSNLLRQKDYKEALRLALKL   23 (141)
T ss_pred             chHHHHHHhhhHHHHHHHHHhc
Confidence            4678899999999999987554


No 88 
>PF10475 DUF2450:  Protein of unknown function N-terminal domain (DUF2450)  ;  InterPro: IPR019515  This entry represents Vacuolar protein sorting-associated protein 54, and is thought to be involved in retrograde transport from early and late endosomes to late Golgi found in eukaryotes, but its function is not known. 
Probab=23.48  E-value=1.2e+02  Score=25.07  Aligned_cols=29  Identities=31%  Similarity=0.384  Sum_probs=24.3

Q ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHhcHHH
Q 030744           12 EMRKRILHFALEGNALKAIELTEELAQDL   40 (172)
Q Consensus        12 ~~R~~I~~~I~~G~i~~Ai~~i~~~~p~l   40 (172)
                      +....|+.++.+|++..|++++.+..--+
T Consensus       129 ~~~~~l~~ll~~~dy~~Al~li~~~~~~l  157 (291)
T PF10475_consen  129 QTQSRLQELLEEGDYPGALDLIEECQQLL  157 (291)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHHH
Confidence            45778899999999999999998876544


No 89 
>PF12862 Apc5:  Anaphase-promoting complex subunit 5
Probab=23.36  E-value=2.3e+02  Score=18.79  Aligned_cols=28  Identities=14%  Similarity=0.109  Sum_probs=21.5

Q ss_pred             HHHHHHcCCHHHHHHHHHHhcHHHHccC
Q 030744           17 ILHFALEGNALKAIELTEELAQDLLEKN   44 (172)
Q Consensus        17 I~~~I~~G~i~~Ai~~i~~~~p~ll~~~   44 (172)
                      --+++..||+..|++.+.+.+---...+
T Consensus         5 ~~~~~~~~dy~~A~d~L~~~fD~~~~~~   32 (94)
T PF12862_consen    5 YLNALRSGDYSEALDALHRYFDYAKQSN   32 (94)
T ss_pred             HHHHHHcCCHHHHHHHHHHHHHHHhhcc
Confidence            3478999999999999988875444443


No 90 
>PRK12791 flbT flagellar biosynthesis repressor FlbT; Reviewed
Probab=22.79  E-value=1.7e+02  Score=21.46  Aligned_cols=27  Identities=22%  Similarity=0.299  Sum_probs=20.1

Q ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHhcH
Q 030744           12 EMRKRILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        12 ~~R~~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      ..=..|.++|.+|++-+|++.+...+|
T Consensus        92 ~~l~~~~~~v~~g~~Y~ALK~~R~Li~  118 (131)
T PRK12791         92 PIIEAINNHILNGDLYKALKELRKLIA  118 (131)
T ss_pred             HHHHHHHHHHhCCCHHHHHHHHHHhHH
Confidence            334456778888888888888887776


No 91 
>COG5117 NOC3 Protein involved in the nuclear export of pre-ribosomes [Translation, ribosomal structure and biogenesis / Intracellular trafficking and secretion]
Probab=22.57  E-value=5.9e+02  Score=23.18  Aligned_cols=103  Identities=17%  Similarity=0.162  Sum_probs=59.4

Q ss_pred             HHHHHHHHHHHHcC-CHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccCcCC--chhhHH
Q 030744           11 MEMRKRILHFALEG-NALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTPFGK--VQKYVE   87 (172)
Q Consensus        11 ~~~R~~I~~~I~~G-~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~~~~--~~~~~~   87 (172)
                      -++|++|.+..++- ++..|++.- .+-.++++.=-.++|+.         -+|+.-..|-|+=+-+..|..  |+++..
T Consensus       375 ~KE~~~I~~Emr~ae~i~~a~e~e-knqseIlkiif~~Yf~v---------Lk~~~k~lig~vleGl~k~~~~~n~eflG  444 (657)
T COG5117         375 EKERLRIQSEMRDAEDIEEAIEEE-KNQSEILKIIFRLYFMV---------LKGDRKDLIGYVLEGLVKYRKIINPEFLG  444 (657)
T ss_pred             HHHHHHHHHHhhhHHHHHHHHHHH-hhHHHHHHHHHHHHHHH---------HhcchHHHHHHHHHHHHHHHhhcCHHHHh
Confidence            45678888777654 444554433 23345555422233332         234444455555555555554  678887


Q ss_pred             HHHHHHhHhcccCCCCCcchhcCCHHHHHHHHHHHHHHHH
Q 030744           88 KLEDFMALLAYEEPEKSPMFHLLSLEYRQHVADNLNRAIL  127 (172)
Q Consensus        88 ~l~~~~~LLay~~~~~sp~~~ll~~~~r~~la~~vn~aIL  127 (172)
                      .+-++|--|.-++    |+..+-+.++|+.+.--+.+.=+
T Consensus       445 D~Levl~eL~~d~----~~dk~ss~a~r~alLcI~tAF~i  480 (657)
T COG5117         445 DLLEVLYELLNDN----PLDKISSDARRQALLCILTAFEI  480 (657)
T ss_pred             HHHHHHHHHHcCC----chhhhhHHHHHHHHHHhhHHHHH
Confidence            7777777666665    55566677888887766644333


No 92 
>cd00194 UBA Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
Probab=22.39  E-value=69  Score=17.43  Aligned_cols=21  Identities=19%  Similarity=0.123  Sum_probs=14.6

Q ss_pred             HHHHHHHHH--cCCHHHHHHHHH
Q 030744           14 RKRILHFAL--EGNALKAIELTE   34 (172)
Q Consensus        14 R~~I~~~I~--~G~i~~Ai~~i~   34 (172)
                      +..++.++.  .|+++.|++|+-
T Consensus        15 ~~~~~~AL~~~~~d~~~A~~~L~   37 (38)
T cd00194          15 REEARKALRATNNNVERAVEWLL   37 (38)
T ss_pred             HHHHHHHHHHhCCCHHHHHHHHh
Confidence            445555543  589999999873


No 93 
>KOG2659 consensus LisH motif-containing protein [Cytoskeleton]
Probab=22.15  E-value=2.1e+02  Score=23.14  Aligned_cols=63  Identities=17%  Similarity=0.067  Sum_probs=40.4

Q ss_pred             HHHHHHHHHcCCHHHHHHHHHHhcHHHHccCCcceehhhHHHHHHHHhhcChHHHHHHHHhhccC
Q 030744           14 RKRILHFALEGNALKAIELTEELAQDLLEKNKDLHFDLLSLHFVELVCSRKCTEALEFAQTKLTP   78 (172)
Q Consensus        14 R~~I~~~I~~G~i~~Ai~~i~~~~p~ll~~~~~l~F~L~~q~fIELir~~~~~eAi~~ar~~l~~   78 (172)
                      |--|-..+-.|..+.|.....+.  ++-....++.+.=.+.+-.++|+.|++++||+...+....
T Consensus        31 ~LVmnylv~eg~~EaA~~Fa~e~--~i~~~~~d~~~~~eR~~Ir~~I~~G~Ie~Aie~in~l~Pe   93 (228)
T KOG2659|consen   31 RLVMNYLVHEGYVEAAEKFAKES--GIKPPSIDLDSMDERLQIRRAIEEGQIEEAIEKVNQLNPE   93 (228)
T ss_pred             HHHHHHHHhccHHHHHHHhcccc--CCCCccCchhhHhHHHHHHHHHHhccHHHHHHHHHHhChH
Confidence            44444555566666555544332  1111134666666778888899999999999999975543


No 94 
>PF07719 TPR_2:  Tetratricopeptide repeat;  InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=21.87  E-value=1.1e+02  Score=15.44  Aligned_cols=16  Identities=38%  Similarity=0.370  Sum_probs=11.1

Q ss_pred             HHHcCCHHHHHHHHHH
Q 030744           20 FALEGNALKAIELTEE   35 (172)
Q Consensus        20 ~I~~G~i~~Ai~~i~~   35 (172)
                      ....|++++|++.+++
T Consensus        11 ~~~~~~~~~A~~~~~~   26 (34)
T PF07719_consen   11 YYQLGNYEEAIEYFEK   26 (34)
T ss_dssp             HHHTT-HHHHHHHHHH
T ss_pred             HHHhCCHHHHHHHHHH
Confidence            4567888888887754


No 95 
>TIGR00029 S20 ribosomal protein S20. This family consists of bacterial (and chloroplast) examples of the bacteria ribosomal small subunit protein S20.
Probab=21.86  E-value=1.6e+02  Score=20.00  Aligned_cols=28  Identities=29%  Similarity=0.247  Sum_probs=20.2

Q ss_pred             HHHHHHHHcCCHHHHHHHHHHhcHHHHc
Q 030744           15 KRILHFALEGNALKAIELTEELAQDLLE   42 (172)
Q Consensus        15 ~~I~~~I~~G~i~~Ai~~i~~~~p~ll~   42 (172)
                      +.+..+|..|+.+.|.+.+..-++.|-.
T Consensus        33 Kk~~~ai~~~d~~~a~~~l~~a~s~iDk   60 (87)
T TIGR00029        33 KKVYAAIAAGDKDKAQEAFKEAAKKLDR   60 (87)
T ss_pred             HHHHHHHHCCCHHHHHHHHHHHHHHHHH
Confidence            3456777889988888888777665544


No 96 
>PF09712 PHA_synth_III_E:  Poly(R)-hydroxyalkanoic acid synthase subunit (PHA_synth_III_E)
Probab=21.86  E-value=2.2e+02  Score=23.67  Aligned_cols=68  Identities=12%  Similarity=0.121  Sum_probs=44.6

Q ss_pred             HHHHHHHhHhcccCCCCCcchhcCCHHHH-HHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHH
Q 030744           87 EKLEDFMALLAYEEPEKSPMFHLLSLEYR-QHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQ  154 (172)
Q Consensus        87 ~~l~~~~~LLay~~~~~sp~~~ll~~~~r-~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~  154 (172)
                      ...++++.-+++++...--++.+++.... ..-...+...+|..+|+|..+-+..+.+.+..+...++.
T Consensus       222 ~~ae~~~~~~~~S~ef~~~~g~~~~a~m~~r~~~~~~~e~~L~~l~lPTr~evd~l~k~l~eLrre~r~  290 (293)
T PF09712_consen  222 DAAEEAYEELFRSEEFAQAYGQLVNALMDLRKQQQEVVEEYLRSLNLPTRSEVDELYKRLHELRREVRA  290 (293)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH
Confidence            33444444444433222224555555433 455666777899999999999999999999888777654


No 97 
>PRK09464 pdhR transcriptional regulator PdhR; Reviewed
Probab=21.02  E-value=2.1e+02  Score=22.53  Aligned_cols=23  Identities=26%  Similarity=0.281  Sum_probs=14.2

Q ss_pred             HHHHHHHcCCHHHHHHHHHHhcH
Q 030744           16 RILHFALEGNALKAIELTEELAQ   38 (172)
Q Consensus        16 ~I~~~I~~G~i~~Ai~~i~~~~p   38 (172)
                      .|.++|.+||.+.|...+.+|.-
T Consensus       204 ~I~~Ai~~~D~~~A~~~~~~Hl~  226 (254)
T PRK09464        204 RIFEAIVAGKPEKAREASHRHLA  226 (254)
T ss_pred             HHHHHHHhCCHHHHHHHHHHHHH
Confidence            56666666666666666666543


No 98 
>cd00686 Terpene_cyclase_cis_trans_C1 Cis, Trans, Terpene Cyclases, Class 1. This CD includes the terpenoid cyclase, trichodiene synthase, which catalyzes the cyclization of farnesyl diphosphate (FPP) to trichodiene using a cis-trans pathway, and is the first committed step in the biosynthesis of trichothecene toxins and antibiotics. As with other enzymes with the 'terpenoid synthase fold', this enzyme has two conserved metal binding motifs that coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function as homodimers and are found in several genera of fungi.
Probab=20.84  E-value=1.8e+02  Score=25.17  Aligned_cols=79  Identities=13%  Similarity=0.058  Sum_probs=42.9

Q ss_pred             HHHHHhHhcccCCCCCcchhcCCH-HHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHHhcCCCCC-CCcch
Q 030744           89 LEDFMALLAYEEPEKSPMFHLLSL-EYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQELGKDVH-PPFSL  166 (172)
Q Consensus        89 l~~~~~LLay~~~~~sp~~~ll~~-~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~~~~~~~~-~~~~~  166 (172)
                      +.+++..+.|-|.--|=|++.+.+ ++=..+.+.++     ..|.+...+|+.+.+.+......+..--|+..| -.--+
T Consensus       214 Ip~~~~~i~~~NDILSFYKEe~~~~E~~n~V~Nya~-----~~GiS~~eAL~~lt~dTv~~s~rv~~VLse~dp~l~~av  288 (357)
T cd00686         214 IAQMENWMVWVNDLMSFYKEFDDERDQISLVKNYVV-----SDEISLHEALEKLTQDTLHSSKQMVAVFSDKDPQVMDTI  288 (357)
T ss_pred             HHHHHHHHHhhhhhhheehhhcccccccchHHHhhh-----hcCCCHHHHHHHHHHHHHHHHHHHHHHhcCCChHHHHHH
Confidence            333333334443323445554432 33344555554     389999999999998887777666555433321 03345


Q ss_pred             hhhhcC
Q 030744          167 KDFMKS  172 (172)
Q Consensus       167 ~~~~~~  172 (172)
                      +.|++|
T Consensus       289 ~~Fi~G  294 (357)
T cd00686         289 ECFMHG  294 (357)
T ss_pred             HHHHHH
Confidence            555554


No 99 
>COG3492 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.43  E-value=2.9e+02  Score=19.18  Aligned_cols=46  Identities=24%  Similarity=0.281  Sum_probs=33.4

Q ss_pred             cCCHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHHHHHHHH
Q 030744          109 LLSLEYRQHVADNLNRAILAHAERPRYAAMERLIQQTTAVRQCLSQ  154 (172)
Q Consensus       109 ll~~~~r~~la~~vn~aIL~~~g~~~~s~Le~l~~~~~~~~~~l~~  154 (172)
                      -+++++|.++....-+.++.++...+.-.=-.+..-+--++++|+.
T Consensus         5 ~ls~~q~~~leAAaFRrLv~HL~~rsdvQNIDLMnLAgFCRNCLs~   50 (104)
T COG3492           5 ELSEEQRDRLEAAAFRRLVEHLQERSDVQNIDLMNLAGFCRNCLSN   50 (104)
T ss_pred             hcCHHHHHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHHHHH
Confidence            4678899999999999999997664444334455556667777764


No 100
>PF13424 TPR_12:  Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=20.10  E-value=2.3e+02  Score=17.46  Aligned_cols=56  Identities=21%  Similarity=0.096  Sum_probs=30.3

Q ss_pred             HHHcCCHHHHHHHHHHhcHHHHccCCc-ceehhhHHHHHHH--HhhcChHHHHHHHHhhc
Q 030744           20 FALEGNALKAIELTEELAQDLLEKNKD-LHFDLLSLHFVEL--VCSRKCTEALEFAQTKL   76 (172)
Q Consensus        20 ~I~~G~i~~Ai~~i~~~~p~ll~~~~~-l~F~L~~q~fIEL--ir~~~~~eAi~~ar~~l   76 (172)
                      ....|+++.|+++.++--.- .+.-.+ -.....+..=+-.  .+.|+.++|+++.++-+
T Consensus        15 ~~~~~~~~~A~~~~~~al~~-~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al   73 (78)
T PF13424_consen   15 YRELGRYDEALDYYEKALDI-EEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKAL   73 (78)
T ss_dssp             HHHTT-HHHHHHHHHHHHHH-HHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHH
T ss_pred             HHHcCCHHHHHHHHHHHHHH-HHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence            44789999999999776544 332111 1111222111111  24578888888887643


No 101
>PRK04964 hypothetical protein; Provisional
Probab=20.07  E-value=99  Score=19.82  Aligned_cols=26  Identities=15%  Similarity=0.142  Sum_probs=16.2

Q ss_pred             ChHHHHHHHHhhccCcCCchhhHHHH
Q 030744           64 KCTEALEFAQTKLTPFGKVQKYVEKL   89 (172)
Q Consensus        64 ~~~eAi~~ar~~l~~~~~~~~~~~~l   89 (172)
                      -+..||..+-+-|...+.|+....++
T Consensus        22 YiP~Ai~ca~k~L~~IAad~~Lp~~v   47 (66)
T PRK04964         22 YVPDALGCVLKALNEIAADEALPESV   47 (66)
T ss_pred             cCcHHHHHHHHHHHHHhccccCCHHH
Confidence            47888888877776665444433333


Done!