Query         psy4937
Match_columns 356
No_of_seqs    189 out of 1041
Neff          5.7 
Searched_HMMs 46136
Date          Fri Aug 16 21:39:51 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4937.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4937hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3959|consensus              100.0 9.5E-79 2.1E-83  560.3   9.2  293    1-356     1-302 (306)
  2 PF13532 2OG-FeII_Oxy_2:  2OG-F 100.0 3.7E-32   8E-37  244.8  13.8  168  119-333     1-194 (194)
  3 KOG3200|consensus              100.0 1.7E-29 3.7E-34  225.1  13.5  178  118-337    12-216 (224)
  4 PRK15401 alpha-ketoglutarate-d  99.9 4.1E-26   9E-31  211.7  17.4  172  115-335    15-213 (213)
  5 TIGR00568 alkb DNA alkylation   99.9 2.6E-22 5.6E-27  180.6  14.4  147  124-318     2-169 (169)
  6 KOG4176|consensus               99.8 3.2E-20 6.9E-25  181.5  13.1  181  115-340   124-309 (323)
  7 COG3145 AlkB Alkylated DNA rep  99.8 5.5E-19 1.2E-23  161.7  12.9  167  115-330    12-194 (194)
  8 smart00702 P4Hc Prolyl 4-hydro  98.1 9.5E-05 2.1E-09   66.0  14.9  168  118-332     1-175 (178)
  9 PF03171 2OG-FeII_Oxy:  2OG-Fe(  98.0 7.3E-06 1.6E-10   66.1   5.1   83  199-333     2-95  (98)
 10 PRK05467 Fe(II)-dependent oxyg  97.9 0.00017 3.7E-09   68.1  12.5  165  120-332     2-174 (226)
 11 KOG2731|consensus               96.4  0.0046   1E-07   61.7   5.0  116  157-319   171-292 (378)
 12 PLN00052 prolyl 4-hydroxylase;  95.4    0.14 3.1E-06   50.7  10.6   94  117-219    53-151 (310)
 13 COG3128 PiuC Uncharacterized i  95.0    0.25 5.5E-06   45.8  10.1  169  119-333     3-178 (229)
 14 PF13640 2OG-FeII_Oxy_3:  2OG-F  94.3    0.03 6.4E-07   45.0   2.4   32  298-332    66-97  (100)
 15 KOG1591|consensus               90.8     1.2 2.7E-05   43.8   8.6   93  117-219    96-195 (289)
 16 TIGR01762 chlorin-enz chlorina  88.2      13 0.00028   36.3  13.6   43  292-338   207-250 (288)
 17 PF05721 PhyH:  Phytanoyl-CoA d  85.0     1.7 3.7E-05   37.9   5.1   25  118-142     4-28  (211)
 18 PF13719 zinc_ribbon_5:  zinc-r  84.6    0.62 1.3E-05   31.8   1.6   35   37-97      2-36  (37)
 19 PF10083 DUF2321:  Uncharacteri  80.9    0.49 1.1E-05   42.4  -0.0   50   34-104    36-85  (158)
 20 PF13248 zf-ribbon_3:  zinc-rib  79.9    0.86 1.9E-05   28.6   0.9   12   83-94     13-24  (26)
 21 TIGR02098 MJ0042_CXXC MJ0042 f  76.6     1.7 3.6E-05   29.3   1.6   13   85-97     24-36  (38)
 22 PF13717 zinc_ribbon_4:  zinc-r  75.4       2 4.4E-05   29.1   1.7   15   82-96     21-35  (36)
 23 PF12933 FTO_NTD:  FTO catalyti  75.2     9.3  0.0002   36.8   6.7   42  291-334   209-250 (253)
 24 PF09859 Oxygenase-NA:  Oxygena  70.8      25 0.00055   32.1   8.0  114  173-331    41-167 (173)
 25 PF13240 zinc_ribbon_2:  zinc-r  70.7     2.1 4.6E-05   26.3   0.9   10   85-94     12-21  (23)
 26 PF12851 Tet_JBP:  Oxygenase do  68.3     6.7 0.00014   35.5   3.9   38  292-332   126-167 (171)
 27 PF14205 Cys_rich_KTR:  Cystein  67.1     3.3 7.1E-05   31.0   1.4   10   36-45      3-12  (55)
 28 PF13759 2OG-FeII_Oxy_5:  Putat  66.6     8.6 0.00019   31.2   3.9   38  291-332    63-100 (101)
 29 PF10571 UPF0547:  Uncharacteri  55.5     7.2 0.00016   24.7   1.2   15   83-97     11-25  (26)
 30 PF12773 DZR:  Double zinc ribb  54.9     8.3 0.00018   27.3   1.7   11   85-95     28-38  (50)
 31 PLN03001 oxidoreductase, 2OG-F  54.9      46 0.00099   32.1   7.3   42  291-332   161-209 (262)
 32 COG3809 Uncharacterized protei  54.0     6.8 0.00015   31.5   1.2   13   85-97     20-32  (88)
 33 PRK14892 putative transcriptio  51.3       4 8.7E-05   34.1  -0.6   39   84-135    40-78  (99)
 34 PLN02984 oxidoreductase, 2OG-F  51.1      60  0.0013   32.5   7.6   42  291-332   245-294 (341)
 35 COG4306 Uncharacterized protei  48.7     8.5 0.00018   33.7   1.0   77   34-141    36-112 (160)
 36 PF01927 Mut7-C:  Mut7-C RNAse   45.1      15 0.00032   32.2   2.0   15   82-96    120-134 (147)
 37 PLN02904 oxidoreductase         45.0      76  0.0017   31.9   7.3   42  291-332   253-301 (357)
 38 PRK14559 putative protein seri  43.8      11 0.00023   41.2   1.1   55   83-137    38-104 (645)
 39 PLN02299 1-aminocyclopropane-1  43.6      55  0.0012   32.4   6.0   42  291-332   204-252 (321)
 40 PF08209 Sgf11:  Sgf11 (transcr  42.5     9.5 0.00021   25.6   0.3   15   36-50      3-17  (33)
 41 PF01927 Mut7-C:  Mut7-C RNAse   42.2      13 0.00028   32.6   1.2   39   13-51     92-139 (147)
 42 PLN02485 oxidoreductase         41.9      74  0.0016   31.4   6.6   42  291-332   235-283 (329)
 43 PLN00417 oxidoreductase, 2OG-F  41.0      81  0.0017   31.6   6.8   42  291-332   249-297 (348)
 44 COG4888 Uncharacterized Zn rib  41.0     7.2 0.00016   32.7  -0.6   62   34-133    19-80  (104)
 45 PF08792 A2L_zn_ribbon:  A2L zi  37.4      22 0.00048   23.7   1.5   16   82-97     17-32  (33)
 46 PLN02912 oxidoreductase, 2OG-F  37.3 1.3E+02  0.0029   30.0   7.7   42  291-332   242-290 (348)
 47 PF14353 CpXC:  CpXC protein     36.8      14 0.00031   31.3   0.6   32   14-45      3-46  (128)
 48 PLN02750 oxidoreductase, 2OG-F  36.0 1.6E+02  0.0034   29.4   7.9   42  291-332   240-288 (345)
 49 COG1326 Uncharacterized archae  35.4      22 0.00048   33.2   1.7   16   82-97     26-41  (201)
 50 PTZ00273 oxidase reductase; Pr  34.7 1.4E+02   0.003   29.3   7.3   42  291-332   224-271 (320)
 51 cd04476 RPA1_DBD_C RPA1_DBD_C:  33.5      24 0.00051   31.2   1.5   11   35-45     32-42  (166)
 52 PLN02947 oxidoreductase         33.2 1.5E+02  0.0033   30.0   7.4   42  291-332   270-318 (374)
 53 PLN02393 leucoanthocyanidin di  32.7 1.4E+02   0.003   30.1   6.9   42  291-332   259-307 (362)
 54 PRK04023 DNA polymerase II lar  31.9      27 0.00059   40.0   1.9   22   11-45    625-646 (1121)
 55 PF09538 FYDLN_acid:  Protein o  31.1      29 0.00064   29.3   1.6   13   85-97     25-37  (108)
 56 PF10083 DUF2321:  Uncharacteri  30.3      17 0.00036   32.9  -0.1   47    6-52     33-85  (158)
 57 TIGR02466 conserved hypothetic  30.2      72  0.0016   29.7   4.2   36  293-332   161-196 (201)
 58 PLN02515 naringenin,2-oxogluta  30.1 2.2E+02  0.0048   28.7   7.9   41  292-332   243-290 (358)
 59 COG2850 Uncharacterized conser  29.6      80  0.0017   32.4   4.6   55  179-243   103-157 (383)
 60 PRK12380 hydrogenase nickel in  29.6      38 0.00083   28.6   2.0   27   11-46     69-95  (113)
 61 COG1656 Uncharacterized conser  29.4      25 0.00055   31.9   1.0   38   14-51     99-145 (165)
 62 PF01155 HypA:  Hydrogenase exp  29.2      23 0.00051   29.8   0.7   27   11-46     69-95  (113)
 63 PLN02997 flavonol synthase      29.2   2E+02  0.0043   28.5   7.3   42  291-332   228-276 (325)
 64 PLN02156 gibberellin 2-beta-di  28.3 1.8E+02  0.0039   29.1   6.9   42  291-332   226-274 (335)
 65 PLN02758 oxidoreductase, 2OG-F  28.2 1.9E+02  0.0042   29.0   7.2   42  291-332   258-306 (361)
 66 PLN02216 protein SRG1           27.5 2.6E+02  0.0057   28.1   7.9   42  291-332   256-304 (357)
 67 COG0266 Nei Formamidopyrimidin  27.4      35 0.00076   33.5   1.6   13   82-94    261-273 (273)
 68 PRK14714 DNA polymerase II lar  26.6      51  0.0011   38.7   2.9   10   86-95    709-718 (1337)
 69 PLN02254 gibberellin 3-beta-di  26.4 2.2E+02  0.0047   28.8   7.1   42  291-332   256-304 (358)
 70 PF05118 Asp_Arg_Hydrox:  Aspar  26.4 1.9E+02  0.0041   25.6   6.1   38  295-337   125-162 (163)
 71 PRK14811 formamidopyrimidine-D  26.1      42 0.00091   32.5   1.9   16   82-97    251-266 (269)
 72 smart00661 RPOL9 RNA polymeras  25.9      52  0.0011   23.1   1.9   12   86-97     20-31  (52)
 73 PF12171 zf-C2H2_jaz:  Zinc-fin  25.6      30 0.00066   21.3   0.5   11   87-97      2-12  (27)
 74 PLN02704 flavonol synthase      25.3   2E+02  0.0044   28.5   6.6   42  291-332   244-292 (335)
 75 PF07754 DUF1610:  Domain of un  25.1      60  0.0013   20.4   1.8   12   82-93     12-23  (24)
 76 PF09788 Tmemb_55A:  Transmembr  24.7      37 0.00081   32.9   1.2   28   14-46    159-186 (256)
 77 KOG3084|consensus               24.5      54  0.0012   33.0   2.3   46   86-131   173-226 (345)
 78 PRK00398 rpoP DNA-directed RNA  24.2      52  0.0011   23.0   1.6   10   86-95     21-30  (46)
 79 smart00659 RPOLCX RNA polymera  24.2      42  0.0009   23.8   1.1   24   14-45      4-27  (44)
 80 PRK12775 putative trifunctiona  23.5      44 0.00095   38.4   1.7   51   36-93    795-845 (1006)
 81 PLN02403 aminocyclopropanecarb  22.9 2.6E+02  0.0057   27.5   6.8   42  291-332   199-248 (303)
 82 PRK00762 hypA hydrogenase nick  22.6      64  0.0014   27.6   2.2   31   11-45     69-100 (124)
 83 PRK03824 hypA hydrogenase nick  22.1      60  0.0013   28.3   1.9   35   11-45     69-115 (135)
 84 PRK01103 formamidopyrimidine/5  21.9      55  0.0012   31.6   1.8   13   82-94    261-273 (274)
 85 smart00531 TFIIE Transcription  21.5      37 0.00081   29.7   0.5   12   35-46     97-108 (147)
 86 PF11672 DUF3268:  Protein of u  21.3 1.2E+02  0.0025   25.6   3.4   17   38-54      3-22  (102)
 87 PRK03681 hypA hydrogenase nick  21.3      67  0.0014   27.1   2.0   28   11-46     69-96  (114)
 88 PF06677 Auto_anti-p27:  Sjogre  21.3      57  0.0012   22.9   1.3    9   85-93     33-41  (41)
 89 COG1996 RPC10 DNA-directed RNA  21.3      45 0.00098   24.4   0.8   25   14-45      8-32  (49)
 90 PRK11827 hypothetical protein;  21.1      74  0.0016   24.2   2.0   15   85-99     25-39  (60)
 91 PRK00564 hypA hydrogenase nick  20.5      71  0.0015   27.1   2.0   29   10-46     69-97  (117)
 92 PF01096 TFIIS_C:  Transcriptio  20.3      86  0.0019   21.4   2.0   15   82-96     24-38  (39)
 93 TIGR00100 hypA hydrogenase nic  20.3      70  0.0015   27.0   1.9   27   11-46     69-95  (115)
 94 COG3751 EGL-9 Predicted prolin  20.2 4.3E+02  0.0093   25.7   7.4   39  292-331   197-235 (252)
 95 TIGR00373 conserved hypothetic  20.1      35 0.00076   30.4   0.0   11   35-45    107-117 (158)

No 1  
>KOG3959|consensus
Probab=100.00  E-value=9.5e-79  Score=560.31  Aligned_cols=293  Identities=39%  Similarity=0.655  Sum_probs=264.6

Q ss_pred             CC-CCCcccccceeeccccccccCcccccccccccccccccCCCCcccccCccccccCCCCCCCCccccccccccccccc
Q psy4937           1 MD-RSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLT   79 (356)
Q Consensus         1 ~~-~~~~c~ckg~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   79 (356)
                      |+ +.|+|||||+|||+.||+..+..+                                                  ...
T Consensus         1 m~~~~r~CGCKG~RtCl~CE~~~r~s~--------------------------------------------------~~~   30 (306)
T KOG3959|consen    1 MGSAERACGCKGARTCLLCETTERVSK--------------------------------------------------LRV   30 (306)
T ss_pred             CCccccccCCccceeeehhhhhhhccc--------------------------------------------------cch
Confidence            55 789999999999999998433332                                                  233


Q ss_pred             cccccceee-ccccccccccCccccccC-CCCCC-CCCCCCCCeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceecc
Q psy4937          80 SQTKETYVY-CPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNF  156 (356)
Q Consensus        80 ~~~~~~~~~-c~~c~~~~~g~~~~~~~~-~~~h~-~~~~~ipGl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~y  156 (356)
                      ++.++.++| |++|.....|||+...++ |.+|+ +.++++|||.+++||||++||++|++.||..||.+||||||||+|
T Consensus        31 ~e~~~a~~y~fIyc~i~~~gwd~~~e~~d~e~~~~d~~~p~pG~~lie~Fls~~Eea~l~~~~D~~pW~~SQSGRRKQdy  110 (306)
T KOG3959|consen   31 VEDKHANYYVFIYCHIQSSGWDIIDESTDCESVSTDGSIPIPGLTLIENFLSESEEAKLLNMIDTVPWAQSQSGRRKQDY  110 (306)
T ss_pred             hhccccceEEEEEechhhcccccccccccccccccCCccccCCeeehhhhhccchHhHHHHHhccCchhhhccccccccc
Confidence            456778888 999998888999987665 67766 578999999999999999999999999999999999999999999


Q ss_pred             CCccccCCCcccCCCCCCCcHHHHHHHHHHhhcccCCCCCCcceeeeEecCCCCCCccCCCCCcccccCceEEEecCCce
Q psy4937         157 GPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS  236 (356)
Q Consensus       157 G~~~nyk~~kl~~g~f~~lP~~l~~l~~Ri~~~~~l~~f~~~~q~~leY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~  236 (356)
                      |||+|||++|++.+.|.|||.+...|++||...+++.+|+|++||+|||.|.+|++|+||+||.|+||+|+++++++++.
T Consensus       111 GPKvNFkk~Klkt~~F~G~P~~~~~v~rrm~~yp~l~gfqp~EqCnLeYep~kgsaIdpH~DD~WiWGeRlv~~n~l~d~  190 (306)
T KOG3959|consen  111 GPKVNFKKKKLKTDTFVGMPEYADMVLRRMSEYPVLKGFQPFEQCNLEYEPVKGSAIDPHQDDMWIWGERLVRSNRLFDF  190 (306)
T ss_pred             CCccchhhhhhccCcccCCchHHHHHHHHhhccchhhccCcHHHcCcccccccCCccCccccchhhhhhheeehhhccHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEEeecCCC-CCCcCCCCCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccc
Q psy4937         237 VLTLTYYNGD-SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQ  315 (356)
Q Consensus       237 vm~f~~~~~~-~~~~~l~~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~  315 (356)
                      |+++.+++-. +++|||+++.++.++...+            .+.+.+++..++ +..+.|+||+||||||+|+|||+|+
T Consensus       191 vl~lc~~e~~~sg~~nL~~~~s~~~e~l~~------------~li~~s~~~l~~-~~~~~ipmP~rSLlvl~g~aRyqwk  257 (306)
T KOG3959|consen  191 VLKLCSKECLASGIINLNTNFSESNEFLSI------------NLINGSVMTLNK-SFLCYIPMPHRSLLVLAGEARYQWK  257 (306)
T ss_pred             HHHhhhhhhhccceeeeccCccccccccch------------hhcccchhhhcc-ceEEEeecCcceeEEeechhHhhHH
Confidence            9999976543 7899999999988765432            135556666666 8899999999999999999999999


Q ss_pred             cccccCCCCCCeEEEEecccCCCCCCh----hhhHHHHHHhhhcC
Q psy4937         316 HCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF  356 (356)
Q Consensus       316 H~I~~~dvr~rRVSLTfR~l~~~~~~g----~~~~~~~~~a~~f~  356 (356)
                      |+|.+++|++||||+|||+++++|.+|    ++|++|++++.+||
T Consensus       258 H~vlr~hi~~RRvcvt~RE~~~~f~~Gg~~qe~G~el~r~~~if~  302 (306)
T KOG3959|consen  258 HGVLRHHIRGRRVCVTMREAAKDFAEGGELQEKGAELIRLGNIFV  302 (306)
T ss_pred             HHHHHHhhhhceeeeeHHhhhHhhccchhhhhcchHHHHhhhhhc
Confidence            999999999999999999999999998    78999999999998


No 2  
>PF13532 2OG-FeII_Oxy_2:  2OG-Fe(II) oxygenase superfamily; PDB: 2IUW_A 3BTZ_A 3RZL_A 3RZH_A 3S5A_A 3RZG_A 3RZJ_A 3BUC_A 3H8X_A 3H8R_A ....
Probab=99.98  E-value=3.7e-32  Score=244.84  Aligned_cols=168  Identities=27%  Similarity=0.510  Sum_probs=118.2

Q ss_pred             CeEEeCCCCCHHHHHHHHHHhh-cCCCCcccc--C------cc----eeccC--CccccCCC-cccCCCCCCCcHHHHHH
Q psy4937         119 GVYIKEDFLSEAETEDVMKQLD-DLPWDKSQS--G------RR----KQNFG--PKCNFKKQ-KIKLGDFQGFPACTKFI  182 (356)
Q Consensus       119 Gl~~i~dFIS~~Ee~~Ll~~id-~~pW~~sqs--g------RR----~q~yG--~~~nyk~~-kl~~g~f~~lP~~l~~l  182 (356)
                      |++|++||||++|+++|++.+. ..+|.....  +      |+    .+|+|  ..|+|+.. ......+.+||+++..+
T Consensus         1 G~~~~~~fls~~e~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~y~~~~~~~~~~~~~~p~~l~~~   80 (194)
T PF13532_consen    1 GLYYIPNFLSEEEAAELLNELRESAPFRQPTYPMGKVYSLPRKLCGGLSWVGDGPSYRYSGKRPVRSKPWPPFPEWLSRL   80 (194)
T ss_dssp             -EEEETTSS-HHHHHHHHHHHHHHS--B-GCCCCCCECCECCE-SSEEEEEECT--CCCTCC-EECCCEBSCCHHHHHHH
T ss_pred             CEEEECCCCCHHHHHHHHHHHHhhCCCcCCeEcCCCEEccceecceeeEEECCCCCeEcCCccccCCCCCCCccHHHHHH
Confidence            8999999999999999999986 467753311  1      11    23443  55666654 44556678999999999


Q ss_pred             HHHHhhcccCCCCCCcceeee-EecCCCCCCccCCCCCccc-ccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccC
Q psy4937         183 QDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWI-WGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQ  260 (356)
Q Consensus       183 ~~Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd~~~-~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~  260 (356)
                      ++++.+...+.....+|+|++ +|.+|.  +|+||+|+.+. +|+.||+||||++++|.|+....               
T Consensus        81 ~~~~~~~~~~~~~~~~n~~liN~Y~~g~--~i~~H~D~~~~~~~~~I~slSLG~~~~~~f~~~~~---------------  143 (194)
T PF13532_consen   81 LERLVEATGIPPGWRPNQCLINYYRDGS--GIGPHSDDEEYGFGPPIASLSLGSSRVFRFRNKSD---------------  143 (194)
T ss_dssp             HHHHHHHHT-SHSS--SEEEEEEESSTT---EEEE---TTC-CCSEEEEEEEES-EEEEEEECGG---------------
T ss_pred             HHHHHHHhccccCCCCCEEEEEecCCCC--CcCCCCCcccccCCCcEEEEEEccCceEEEeeccC---------------
Confidence            999987543122246788887 999955  59999999877 89999999999999999987421               


Q ss_pred             CCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCC--------CCCeEEEEe
Q psy4937         261 KKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDV--------IGRRVCIAY  332 (356)
Q Consensus       261 ~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dv--------r~rRVSLTf  332 (356)
                                                  . ...+.+.||+||||||+|++|+.| |+|++.+.        +++||||||
T Consensus       144 ----------------------------~-~~~~~~~L~~gsl~vm~g~~r~~~-H~I~~~~~~~~~~~~~~~~RislTf  193 (194)
T PF13532_consen  144 ----------------------------D-DEPIEVPLPPGSLLVMSGEARYDW-HGIPPVKKDTHPSHYVRGRRISLTF  193 (194)
T ss_dssp             ----------------------------T-S-EEEEEE-TTEEEEEETTHHHHE-EEE-S-SCEEEESTEE-S-EEEEEE
T ss_pred             ----------------------------C-CccEEEEcCCCCEEEeChHHhhhe-eEcccccCCccccccCCCCEEEEEe
Confidence                                        2 568999999999999999999999 99999843        779999999


Q ss_pred             c
Q psy4937         333 R  333 (356)
Q Consensus       333 R  333 (356)
                      |
T Consensus       194 R  194 (194)
T PF13532_consen  194 R  194 (194)
T ss_dssp             E
T ss_pred             C
Confidence            9


No 3  
>KOG3200|consensus
Probab=99.96  E-value=1.7e-29  Score=225.13  Aligned_cols=178  Identities=20%  Similarity=0.297  Sum_probs=141.9

Q ss_pred             CCeEEeCCCCCHHHHHHHHHHhhcC---CCCccccCcceeccCCccccCCCcccCCCCCCCcHHHHHHHHHHhhcccCCC
Q psy4937         118 GGVYIKEDFLSEAETEDVMKQLDDL---PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH  194 (356)
Q Consensus       118 pGl~~i~dFIS~~Ee~~Ll~~id~~---pW~~sqsgRR~q~yG~~~nyk~~kl~~g~f~~lP~~l~~l~~Ri~~~~~l~~  194 (356)
                      |-+.|||+||+++||..+++.|...   .|. ....||.|+||..+.-  ..+-+   +.+|+||+.+.+++.+.+.+++
T Consensus        12 pt~~YIPnfIt~EEe~~~lshIe~ap~pkW~-~L~NRRLqNyGGvvh~--~glip---eelP~wLq~~v~kinnlglF~s   85 (224)
T KOG3200|consen   12 PTMIYIPNFITEEEENLYLSHIENAPQPKWR-VLANRRLQNYGGVVHK--TGLIP---EELPPWLQYYVDKINNLGLFKS   85 (224)
T ss_pred             ceEEEcCCccChHHHHHHHHHHhcCCCchhH-HHHhhhhhhcCCcccc--CCcCc---cccCHHHHHHHHHhhcccccCC
Confidence            4679999999999999999999874   588 4568999999987762  22221   6899999999999999888765


Q ss_pred             CCCcceeee-EecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccccc
Q psy4937         195 FQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVND  273 (356)
Q Consensus       195 f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~~~  273 (356)
                        ++|++++ ||.||+|  |+||.|+ ++|-+.|++|||||.+||.|......      +....++      +     + 
T Consensus        86 --~~NHVLVNeY~pgqG--ImPHtDG-Paf~piVstiSlGsh~vldf~~p~r~------e~~d~te------~-----~-  142 (224)
T KOG3200|consen   86 --PANHVLVNEYLPGQG--IMPHTDG-PAFHPIVSTISLGSHTVLDFYDPVRQ------EVNDGTE------S-----K-  142 (224)
T ss_pred             --CcceeEeecccCCCC--cCcCCCC-CcccceEEEEecCCceEEeccccccc------ccCCccc------c-----C-
Confidence              6787776 9999998  9999999 67789999999999999999874321      0000000      0     0 


Q ss_pred             cccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccC--C---------------------CCCCeEEE
Q psy4937         274 LSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRS--D---------------------VIGRRVCI  330 (356)
Q Consensus       274 ~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~--d---------------------vr~rRVSL  330 (356)
                                  .... .....+.|.+|||||+.|+|+.++.|||...  |                     +++.||||
T Consensus       143 ------------dqp~-R~~fsllleprslLilkd~aYtd~LHgIs~s~~d~l~~~~sna~ac~s~k~Gd~lvr~tRvSL  209 (224)
T KOG3200|consen  143 ------------DQPL-RYLFSLLLEPRSLLILKDDAYTDFLHGISDSPTDCLNQVVSNALACSSRKDGDKLVRQTRVSL  209 (224)
T ss_pred             ------------CCCc-cceeeeeeccceEEEEcCcHHHHHHhhcccChHHHHHHHhhhhhhccccCCcceeeecceeEE
Confidence                        0012 4578899999999999999999999999864  2                     49999999


Q ss_pred             EecccCC
Q psy4937         331 AYREFTA  337 (356)
Q Consensus       331 TfR~l~~  337 (356)
                      |+|.+..
T Consensus       210 TiR~VPk  216 (224)
T KOG3200|consen  210 TIRLVPK  216 (224)
T ss_pred             EEecchH
Confidence            9999865


No 4  
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=99.94  E-value=4.1e-26  Score=211.65  Aligned_cols=172  Identities=17%  Similarity=0.232  Sum_probs=131.7

Q ss_pred             CCCCCeEEeCCCCCHHHHHHHHHHhhc----CCCCcccc--C-----cce-----eccCC--ccccCCCc-ccCCCCCCC
Q psy4937         115 FDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQS--G-----RRK-----QNFGP--KCNFKKQK-IKLGDFQGF  175 (356)
Q Consensus       115 ~~ipGl~~i~dFIS~~Ee~~Ll~~id~----~pW~~sqs--g-----RR~-----q~yG~--~~nyk~~k-l~~g~f~~l  175 (356)
                      ...||..++++|. .++++.|++.+.+    .||..-..  |     |.+     .|||-  .|.|.... ....+|.+|
T Consensus        15 ~~~~g~~~~~~~~-~~~~~~l~~~~~~~~~~~p~~~~~~~gg~~msv~mt~~G~~~W~~d~~~YrYs~~~~~~~~pwp~~   93 (213)
T PRK15401         15 PLAPGAVLLRGFA-LAAAEALLAAIEAVAAQAPFRHMVTPGGYTMSVAMTNCGALGWVTDRRGYRYSPIDPLTGKPWPAM   93 (213)
T ss_pred             ecCCCcEEeCCCC-HHHHHHHHHHHHHHHhcCCccceecCCCCcceeEEeccccceEecCCCCcccCCcCCCCCCCCCCc
Confidence            4578999999995 8899999988754    79975322  2     222     45542  34454332 333467788


Q ss_pred             cHHHHHHHHHHhhcccCCCCCCcceeee-EecCCCCCCccCCCC-CcccccCceEEEecCCceEEEEeecCCCCCCcCCC
Q psy4937         176 PACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHID-DCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLN  253 (356)
Q Consensus       176 P~~l~~l~~Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvD-d~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~  253 (356)
                      |+++..|.+++....++..| .+|+|++ +|+||.  +|+||.| ++..+++.||+||||++++|.|++...        
T Consensus        94 P~~l~~L~~~~~~~~~~~~~-~p~a~LvN~Y~~G~--~mg~H~D~~E~~~~~pI~SvSLG~~~~F~~~~~~~--------  162 (213)
T PRK15401         94 PASFLALAQRAAAAAGFPGF-QPDACLINRYAPGA--KLSLHQDKDERDFRAPIVSVSLGLPAVFQFGGLKR--------  162 (213)
T ss_pred             hHHHHHHHHHHHHHcCCCCC-CCCEEEEEeccCcC--ccccccCCCcccCCCCEEEEeCCCCeEEEecccCC--------
Confidence            88999999999877655444 5688887 999954  6999999 466788999999999999999987421        


Q ss_pred             CCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCC------CCCCe
Q psy4937         254 SPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSD------VIGRR  327 (356)
Q Consensus       254 ~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d------vr~rR  327 (356)
                                                         . ....+|.|++||||||.|++|+.| |+|++..      +.+.|
T Consensus       163 -----------------------------------~-~~~~~l~L~~Gdllvm~G~sr~~~-HgVp~~~~~~~p~~g~~R  205 (213)
T PRK15401        163 -----------------------------------S-DPLQRILLEHGDVVVWGGPSRLRY-HGILPLKAGEHPLTGECR  205 (213)
T ss_pred             -----------------------------------C-CceEEEEeCCCCEEEECchHhhee-ccCCcCCCCcCCCCCCCe
Confidence                                               1 446799999999999999999855 9999863      13489


Q ss_pred             EEEEeccc
Q psy4937         328 VCIAYREF  335 (356)
Q Consensus       328 VSLTfR~l  335 (356)
                      ||||||.+
T Consensus       206 INLTFR~~  213 (213)
T PRK15401        206 INLTFRKA  213 (213)
T ss_pred             EEEEeEcC
Confidence            99999974


No 5  
>TIGR00568 alkb DNA alkylation damage repair protein AlkB. Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
Probab=99.89  E-value=2.6e-22  Score=180.59  Aligned_cols=147  Identities=14%  Similarity=0.252  Sum_probs=113.2

Q ss_pred             CCCCCHHHHHHHHHHhhc----CCCCcccc--Cc-----cee-----cc--CCccccCCCcccCC-CCCCCcHHHHHHHH
Q psy4937         124 EDFLSEAETEDVMKQLDD----LPWDKSQS--GR-----RKQ-----NF--GPKCNFKKQKIKLG-DFQGFPACTKFIQD  184 (356)
Q Consensus       124 ~dFIS~~Ee~~Ll~~id~----~pW~~sqs--gR-----R~q-----~y--G~~~nyk~~kl~~g-~f~~lP~~l~~l~~  184 (356)
                      .+|+..+++..|++++..    .||.+...  |+     |.+     ||  |..|+|+.+..... ++.+||++|..|.+
T Consensus         2 ~~~~~~~~~~~l~~~~~~~~~~~~w~~~~~~~gk~~~~pr~~~~~l~W~~~g~~Y~ys~~~~~~~~~~p~~P~~L~~L~~   81 (169)
T TIGR00568         2 KRYFAFNAQEQLIRDINDVASQDPFRQYVTPGGYTMSVAMTNLGKLGWTTHGQGYLYSPKDPQTNKPWPAMPQDLGDLCE   81 (169)
T ss_pred             CCccChHHHHHHHHHHHHHhhcCCCcCeEecCCeEeeehhhhcccceEEcCCCcccCCCcccCCCCCCCCCCHHHHHHHH
Confidence            578888899999987644    79987533  42     222     77  57788887766333 34559999999999


Q ss_pred             HHhhcccCCCCCCcceeee-EecCCCCCCccCCCCC-cccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCC
Q psy4937         185 RFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDD-CWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKK  262 (356)
Q Consensus       185 Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd-~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~  262 (356)
                      +++...++..+ ++|+|++ +|+||  ++|+||+|+ +..+++.||+||||++++|.|++...                 
T Consensus        82 ~v~~~~g~~~~-~~n~~LvN~Y~~G--d~mg~H~D~~e~~~~~pI~SvSLG~~r~F~~~~~~~-----------------  141 (169)
T TIGR00568        82 RVATAAGFPDF-QPDACLVNRYAPG--ATLSLHQDRDEPDLRAPLLSVSLGLPAIFLIGGLKR-----------------  141 (169)
T ss_pred             HHHHHhCCCCC-CCCEEEEEeecCC--CccccccccccccCCCCEEEEeCCCCEEEEecCCcC-----------------
Confidence            99877654443 6788887 99995  569999996 22245899999999999999987531                 


Q ss_pred             CCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhccccccc
Q psy4937         263 YPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV  318 (356)
Q Consensus       263 e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I  318 (356)
                                                . ....+|.|++||||||+|++|+.| |||
T Consensus       142 --------------------------~-~~~~~l~L~sGsllvM~G~sR~~~-Hgv  169 (169)
T TIGR00568       142 --------------------------N-DPPKRLRLHSGDVVIMGGESRLAF-HGV  169 (169)
T ss_pred             --------------------------C-CceEEEEeCCCCEEEECCchhccc-cCC
Confidence                                      2 446889999999999999999966 987


No 6  
>KOG4176|consensus
Probab=99.83  E-value=3.2e-20  Score=181.45  Aligned_cols=181  Identities=20%  Similarity=0.338  Sum_probs=147.9

Q ss_pred             CCCCC-eEEeCCCCCHHHHHHHHHHhhcCCCCc--cccCcceeccCCccccCCCcccCCC-CCCCcHHHHHHHHHHhhcc
Q psy4937         115 FDIGG-VYIKEDFLSEAETEDVMKQLDDLPWDK--SQSGRRKQNFGPKCNFKKQKIKLGD-FQGFPACTKFIQDRFHQED  190 (356)
Q Consensus       115 ~~ipG-l~~i~dFIS~~Ee~~Ll~~id~~pW~~--sqsgRR~q~yG~~~nyk~~kl~~g~-f~~lP~~l~~l~~Ri~~~~  190 (356)
                      ...|| +.+++||+++.+++.|+..+....|..  +..+|++.+||..++|..+.++... ..++|..++.+++|+..+.
T Consensus       124 ~~~~~e~~~~~d~V~el~e~~l~~~~~~e~~~~~~~gk~R~~iq~G~~f~y~~~~~d~~~~~~piPs~~~~ii~rlv~~~  203 (323)
T KOG4176|consen  124 VFIPGELSLIVDFVTELEEKGLIGALVDETFTYQESGKHREVIQLGYPFDYRTNNVDESKPVDPIPSLFKSIIDRLVSWR  203 (323)
T ss_pred             ccChhhceehhhhhhhhHHhhhhcccccccceeeccccceeeeecCceeccCCCcccccCccCCCchHHHHHHHHhhhhc
Confidence            34455 999999999999999999998766664  3458999999999999888877543 5789999999999999998


Q ss_pred             cCCCCCCcceeee-EecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccc
Q psy4937         191 VLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKST  269 (356)
Q Consensus       191 ~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~  269 (356)
                      .+..  .||||++ .|.||.|  |.||+|. ..|++.|++|||+|+++|.|++.....                      
T Consensus       204 ~ip~--~pd~~~iN~Ye~G~~--i~ph~~~-~~F~~Pi~slS~lSe~~m~Fg~~~~~~----------------------  256 (323)
T KOG4176|consen  204 VIPE--RPDQCTINFYEPGDG--IPPHIDH-SAFLDPISSLSFLSECTMEFGHGLLSD----------------------  256 (323)
T ss_pred             cCCC--CCCeeEEEeeCCCCC--CCCCCCh-HHhcCceEEEEeecceeEEeccccccc----------------------
Confidence            8876  5899998 9999666  9999955 789999999999999999999963210                      


Q ss_pred             cccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEecccCCCCC
Q psy4937         270 IVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM  340 (356)
Q Consensus       270 ~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR~l~~~~~  340 (356)
                                    +...+. + ..+++++.||++||.|.+---=.|.++.  ++..|||||||.+.+.-.
T Consensus       257 --------------~~~~~~-g-~~s~p~~~g~~lvi~~~~ad~~~~~~~~--~~~kRisitfrki~~~~~  309 (323)
T KOG4176|consen  257 --------------NIGNFR-G-SLSLPLRYGSVLVIRGRSADVAPHCIRP--SRNKRISITFRKIRPDPC  309 (323)
T ss_pred             --------------Cccccc-c-ccccccccCeEEEeCCCcccccccccCC--CCCceEEEEEEEeccCCC
Confidence                          001111 2 5678888888888888887777888887  799999999999987544


No 7  
>COG3145 AlkB Alkylated DNA repair protein [DNA replication, recombination, and repair]
Probab=99.80  E-value=5.5e-19  Score=161.72  Aligned_cols=167  Identities=18%  Similarity=0.282  Sum_probs=118.2

Q ss_pred             CCCCCeEEeCCCCCHHHHHHHHHH----hhcCCCCccc---cCcc--ee-ccCCccc---cCCCcccCCCCCCCcHHHHH
Q psy4937         115 FDIGGVYIKEDFLSEAETEDVMKQ----LDDLPWDKSQ---SGRR--KQ-NFGPKCN---FKKQKIKLGDFQGFPACTKF  181 (356)
Q Consensus       115 ~~ipGl~~i~dFIS~~Ee~~Ll~~----id~~pW~~sq---sgRR--~q-~yG~~~n---yk~~kl~~g~f~~lP~~l~~  181 (356)
                      ...+|+.+.++|+ -.++.+|++.    +...||....   .|+.  ++ .+|+..+   |.+......+..+||..+..
T Consensus        12 ~~~~G~~~~~~~~-~~~~~~l~~~l~~~~~~~P~~~~~~~~~g~~~sV~r~~~W~~d~~gy~y~~~~p~~~~p~p~l~~~   90 (194)
T COG3145          12 QLAPGAVILPGFL-LLTQGALVAALLFLLSQAPWFRPRRTPYGKPMSVPRLLGWVTDRRGYRYSLRSPLTGKPWPPLLAL   90 (194)
T ss_pred             cCCCCeEEEeccc-ccchHHHHHHHHHhcccCcccceeecCCCcEeeeeeccceecccccccccccccCCCCCCCccHHH
Confidence            4567999999999 4444455444    4558987421   1211  11 4666555   66666655555566765555


Q ss_pred             HHHHHhhcccCCCCCCcceeee-EecCCCCCCccCCCCCccccc-CceEEEecCCceEEEEeecCCCCCCcCCCCCcccc
Q psy4937         182 IQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWG-ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSY  259 (356)
Q Consensus       182 l~~Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~G-e~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~  259 (356)
                      ..+...+.+.  .+..+++|++ +|+|  |++|+||.|.....+ +.||+||||++++|.|++...              
T Consensus        91 ~~~~~~~~g~--~~~~~ea~Lvn~Y~p--Gd~ig~HqD~~e~~~~~~v~slSLg~~~~F~~~~~~r--------------  152 (194)
T COG3145          91 FHDLFGAAGY--PFEGPEAVLVNRYRP--GASIGWHQDKDEEDDRPPVASLSLGAPCIFRLRGRRR--------------  152 (194)
T ss_pred             HHHHHHHhcC--CCCChhheeEEeccC--CCccccccccccccCCCceEEEecCCCeEEEeccccC--------------
Confidence            5555554432  3457888987 9999  477999999865443 369999999999999998632              


Q ss_pred             CCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCC-CCCCeEEE
Q psy4937         260 QKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-VIGRRVCI  330 (356)
Q Consensus       260 ~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-vr~rRVSL  330 (356)
                                                   . +...++.|.+|||+||.|++|+.|.|.|+++. ....||||
T Consensus       153 -----------------------------~-~~~~~~~L~~Gdvvvm~G~~r~~~~h~~p~~~~~~~~Rinl  194 (194)
T COG3145         153 -----------------------------R-GPGLRLRLEHGDVVVMGGPSRLAWHHIIPKTSRLTGQRINL  194 (194)
T ss_pred             -----------------------------C-CCceeEEecCCCEEEecCCccccccccccccccCCcccccC
Confidence                                         2 56789999999999999999999999999873 34477764


No 8  
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Probab=98.14  E-value=9.5e-05  Score=65.99  Aligned_cols=168  Identities=21%  Similarity=0.199  Sum_probs=91.8

Q ss_pred             CCeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceeccCCccccCCCcccCCCCCC--CcHHHHHHHHHHhhcccCCC-
Q psy4937         118 GGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG--FPACTKFIQDRFHQEDVLSH-  194 (356)
Q Consensus       118 pGl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~yG~~~nyk~~kl~~g~f~~--lP~~l~~l~~Ri~~~~~l~~-  194 (356)
                      |++++++||||++|.+.|++.....-|. +...+....=+..-+|+.....  .+..  -++....|.+|+.+...+.. 
T Consensus         1 P~i~~~~~~ls~~ec~~li~~~~~~~~~-~~~~~~~~~~~~~~~~R~~~~~--~l~~~~~~~~~~~l~~~i~~~~~~~~~   77 (178)
T smart00702        1 PGVVVFHDFLSPAECQKLLEEAEPLGWR-GEVTRGDTNPNHDSKYRQSNGT--WLELLKGDLVIERIRQRLADFLGLLRG   77 (178)
T ss_pred             CcEEEECCCCCHHHHHHHHHHhhhhccc-ceeecCCCCccccCCCEeecce--ecCCCCCCHHHHHHHHHHHHHHCCCch
Confidence            7899999999999999999998876664 3221111100011122222211  1122  15667788888887655431 


Q ss_pred             -CCCcceeee-EecCCCCCCccCCCCCcccc--cCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCccccc
Q psy4937         195 -FQTIEQCTL-EYNSENGSSIDNHIDDCWIW--GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTI  270 (356)
Q Consensus       195 -f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~--Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~  270 (356)
                       ....+.+.+ .|.+|  ....+|.|....-  |.+++++.+          +=..        .      .       .
T Consensus        78 ~~~~~~~~~~~~Y~~g--~~~~~H~D~~~~~~~~~r~~T~~~----------yLn~--------~------~-------~  124 (178)
T smart00702       78 LPLSAEDAQVARYGPG--GHYGPHVDNFEDDENGDRIATFLL----------YLND--------V------E-------E  124 (178)
T ss_pred             hhccCcceEEEEECCC--CcccCcCCCCCCCCCCCeEEEEEE----------Eecc--------C------C-------c
Confidence             122344555 99994  5699999975321  233433332          2000        0      0       0


Q ss_pred             ccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937         271 VNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY  332 (356)
Q Consensus       271 ~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf  332 (356)
                      .|.+..   ...      .......|.-..|+++++...-. ...|++.+.. .+.|++++.
T Consensus       125 GG~~~f---~~~------~~~~~~~v~P~~G~~v~f~~~~~-~~~H~v~pv~-~G~r~~~~~  175 (178)
T smart00702      125 GGELVF---PGL------GLMVCATVKPKKGDLLFFPSGRG-RSLHGVCPVT-RGSRWAITG  175 (178)
T ss_pred             CceEEe---cCC------CCccceEEeCCCCcEEEEeCCCC-CccccCCcce-eCCEEEEEE
Confidence            010000   000      00123567778899999874322 4559988842 488999885


No 9  
>PF03171 2OG-FeII_Oxy:  2OG-Fe(II) oxygenase superfamily Entry for Lysyl hydrolases This Prosite entry is a sub-family of the Pfam entry;  InterPro: IPR005123 This domain is found in members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily [], as well as the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity (1.14.11.2 from EC) catalysing the reaction:   Procollagen L-proline + 2-oxoglutarate + O2 = procollagen trans-4-hydroxy-L-proline + succinate + CO2.   The full enzyme consists of a alpha2 beta2 complex with the alpha subunit contributing most of the parts of the active site []. The family also includes lysyl hydrolases, isopenicillin synthases and AlkB. ; GO: 0016491 oxidoreductase activity, 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process; PDB: 3ON7_D 1BK0_A 1IPS_B 1QIQ_A 1OC1_A 2Y86_A 2Y60_A 1W03_A 2VE1_A 1QJF_A ....
Probab=98.04  E-value=7.3e-06  Score=66.08  Aligned_cols=83  Identities=24%  Similarity=0.352  Sum_probs=47.0

Q ss_pred             ceeee-Eec-CCCCCCccCCCCCcccccCceEEEecC-CceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccccccc
Q psy4937         199 EQCTL-EYN-SENGSSIDNHIDDCWIWGERIVTVNLI-GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLS  275 (356)
Q Consensus       199 ~q~~l-eY~-pg~G~~I~pHvDd~~~~Ge~IaslSLg-s~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~~~~~  275 (356)
                      +++.+ +|. +.+|.++++|.|.+    ..+++|.+. ...-|.|...                                
T Consensus         2 ~~~~~~~Y~~~~~~~~~~~H~D~~----~~~~Til~~~~~~gL~~~~~--------------------------------   45 (98)
T PF03171_consen    2 SQLRLNRYPPPENGVGIGPHTDDE----DGLLTILFQDEVGGLQVRDD--------------------------------   45 (98)
T ss_dssp             -EEEEEEE-SCCGCEEEEEEEES------SSEEEEEETSTS-EEEEET--------------------------------
T ss_pred             CEEEEEECCCcccCCceeCCCcCC----CCeEEEEecccchheecccc--------------------------------
Confidence            45666 999 55566799999985    567777776 6666666653                                


Q ss_pred             cccccccccccccCCceEEEEecCCCeEEEecch--------hhcccccccccCCCCCCeEEEEec
Q psy4937         276 LNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGE--------ARYEWQHCVLRSDVIGRRVCIAYR  333 (356)
Q Consensus       276 ~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~--------ARy~W~H~I~~~dvr~rRVSLTfR  333 (356)
                                     +..+.+...+++++|+-|+        .+..+.|+|.+.. .+.|+|+||+
T Consensus        46 ---------------~~~~~v~~~~~~~~v~~G~~l~~~t~g~~~~~~HrV~~~~-~~~R~s~~~f   95 (98)
T PF03171_consen   46 ---------------GEWVDVPPPPGGFIVNFGDALEILTNGRYPATLHRVVPPT-EGERYSLTFF   95 (98)
T ss_dssp             ---------------TEEEE----TTCEEEEEBHHHHHHTTTSS----EEEE--S-TS-EEEEEEE
T ss_pred             ---------------ccccCccCccceeeeeceeeeecccCCccCCceeeeEcCC-CCCEEEEEEE
Confidence                           1234444444455555555        8899999999965 7999999996


No 10 
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=97.90  E-value=0.00017  Score=68.15  Aligned_cols=165  Identities=19%  Similarity=0.185  Sum_probs=91.1

Q ss_pred             eEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceeccCCccccCCCcccCCCCCCCcHHHH----HHHHHHhhcccCCCC
Q psy4937         120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK----FIQDRFHQEDVLSHF  195 (356)
Q Consensus       120 l~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~yG~~~nyk~~kl~~g~f~~lP~~l~----~l~~Ri~~~~~l~~f  195 (356)
                      ++.|+++||++|-++|++.++..+|..   |+.+..++.+ .+|.+ .+..  ..-| ..+    .|.+++.+.+.+..+
T Consensus         2 i~~I~~vLs~eec~~~~~~le~~~~~d---g~~taG~~~~-~vKnN-~ql~--~d~~-~a~~l~~~i~~~L~~~~l~~sa   73 (226)
T PRK05467          2 LLHIPDVLSPEEVAQIRELLDAAEWVD---GRVTAGAQAA-QVKNN-QQLP--EDSP-LARELGNLILDALTRNPLFFSA   73 (226)
T ss_pred             eeeecccCCHHHHHHHHHHHHhcCCcc---CCcCcCccch-hcccc-cccC--CCCH-HHHHHHHHHHHHHhcCchhhhh
Confidence            578999999999999999999999983   6666655421 22221 1111  1222 333    344455444443222


Q ss_pred             CCcc----eeeeEecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccc
Q psy4937         196 QTIE----QCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV  271 (356)
Q Consensus       196 ~~~~----q~~leY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~  271 (356)
                      .-+.    -..+.|.+|+  ..+||+|....-.+. -  .-.--+.++|.-+=.+        ...|+           .
T Consensus        74 ~lp~~i~~~~f~rY~~G~--~y~~H~D~~~~~~~~-~--~~~~rs~lS~~lyLnd--------~~~ye-----------G  129 (226)
T PRK05467         74 ALPRKIHPPLFNRYEGGM--SYGFHVDNAVRSLPG-T--GGRVRTDLSATLFLSD--------PDDYD-----------G  129 (226)
T ss_pred             ccccccccceEEEECCCC--ccCccccCCcccCCC-C--CcceeEEEEEEEEeCC--------CCCCc-----------C
Confidence            2111    2334999965  599999984321000 0  0000133444433110        01110           0


Q ss_pred             cccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937         272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY  332 (356)
Q Consensus       272 ~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf  332 (356)
                      |-...         .. . .....|.+|.|++++....    -.|++.+. .++.|+++++
T Consensus       130 GEl~~---------~~-~-~g~~~Vkp~aG~~vlfps~----~lH~v~pV-t~G~R~~~~~  174 (226)
T PRK05467        130 GELVI---------ED-T-YGEHRVKLPAGDLVLYPST----SLHRVTPV-TRGVRVASFF  174 (226)
T ss_pred             CceEE---------ec-C-CCcEEEecCCCeEEEECCC----Cceeeeec-cCccEEEEEe
Confidence            10110         00 1 2246789999999999864    55999885 3899999998


No 11 
>KOG2731|consensus
Probab=96.38  E-value=0.0046  Score=61.70  Aligned_cols=116  Identities=10%  Similarity=0.108  Sum_probs=77.0

Q ss_pred             CCccccCCCc----ccCCCCCCCcHHHHHHHHHHhhcccCCCCCCcceee-eEecCCCCCCccCCCCC-cccccCceEEE
Q psy4937         157 GPKCNFKKQK----IKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCT-LEYNSENGSSIDNHIDD-CWIWGERIVTV  230 (356)
Q Consensus       157 G~~~nyk~~k----l~~g~f~~lP~~l~~l~~Ri~~~~~l~~f~~~~q~~-leY~pg~G~~I~pHvDd-~~~~Ge~Iasl  230 (356)
                      |..+|...+.    +........|..+.....++....++.++...+-++ +.|.+  |+.++-|.|. +..+++.+.+.
T Consensus       171 G~~~dw~s~~~~~~~s~k~~~~~~~ll~~~~~~~~~a~~~~~~~~~~Gli~nYlsi--~~tl~ih~d~reld~~~pf~s~  248 (378)
T KOG2731|consen  171 GNQYDWSSKDIFIFLSKKHYNIKPSLLGLLREKVKAAKGFSHIVIRPGLIKNYLSI--DDTLGIHLDCRELDLSKPFYSP  248 (378)
T ss_pred             ccccCCccccccccccccCCCCChHHhhhhhhhhhhhcCccceeccCcceeeeccc--CcEEEEEeehhhcccCCccccc
Confidence            5555555555    211111234455556666666655555554444444 48999  4569999996 55667789999


Q ss_pred             ecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchh
Q psy4937         231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEA  310 (356)
Q Consensus       231 SLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~A  310 (356)
                      |||-++++-+.-...                                           . .....+.|..|..++|.|.+
T Consensus       249 s~g~~ai~lLg~m~l-------------------------------------------~-e~p~p~~lrsGdv~im~Gfs  284 (378)
T KOG2731|consen  249 SLGQGAILLLGMMCL-------------------------------------------G-ENPDPMTLRSGDVVIMDGFS  284 (378)
T ss_pred             cccccceeeeccccc-------------------------------------------C-CCCCccccccCceEeecchH
Confidence            999999988764321                                           0 12344788999999999999


Q ss_pred             hcccccccc
Q psy4937         311 RYEWQHCVL  319 (356)
Q Consensus       311 Ry~W~H~I~  319 (356)
                      |.-+ |+|+
T Consensus       285 rlv~-haIp  292 (378)
T KOG2731|consen  285 RLVE-HAIP  292 (378)
T ss_pred             HHHh-hccc
Confidence            9776 9999


No 12 
>PLN00052 prolyl 4-hydroxylase; Provisional
Probab=95.35  E-value=0.14  Score=50.68  Aligned_cols=94  Identities=18%  Similarity=0.296  Sum_probs=53.2

Q ss_pred             CCCeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceecc--CCccccCCCcccCCCC-C-CCcHHHHHHHHHHhhcccC
Q psy4937         117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNF--GPKCNFKKQKIKLGDF-Q-GFPACTKFIQDRFHQEDVL  192 (356)
Q Consensus       117 ipGl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~y--G~~~nyk~~kl~~g~f-~-~lP~~l~~l~~Ri~~~~~l  192 (356)
                      -|-+++++||||++|.+.|++.... .+..+.    +..-  |.... +..+...+.| . .--+.+..|.+||....++
T Consensus        53 ~P~i~~~~nfLs~~Ecd~Li~la~~-~l~~S~----v~~~~~g~~~~-s~~RTS~~~~l~~~~dpvv~~I~~Ria~~t~l  126 (310)
T PLN00052         53 QPRIFVYKGFLSDAECDHLVKLAKK-KIQRSM----VADNKSGKSVM-SEVRTSSGMFLDKRQDPVVSRIEERIAAWTFL  126 (310)
T ss_pred             CCCEEEECCcCCHHHHHHHHHhccc-ccccce----eecCCCCcccc-CCCEEecceeecCCCCHHHHHHHHHHHHHhCC
Confidence            4789999999999999999987643 222111    0000  10000 0001111111 1 1123677888888886554


Q ss_pred             CCCCCcceeee-EecCCCCCCccCCCCC
Q psy4937         193 SHFQTIEQCTL-EYNSENGSSIDNHIDD  219 (356)
Q Consensus       193 ~~f~~~~q~~l-eY~pg~G~~I~pHvDd  219 (356)
                      .- ...+.+.+ .|.+|+  ...||.|-
T Consensus       127 p~-~~~E~lQVlrY~~Gq--~Y~~H~D~  151 (310)
T PLN00052        127 PE-ENAENIQILRYEHGQ--KYEPHFDY  151 (310)
T ss_pred             Cc-ccCcceEEEecCCCC--CCCCCCCc
Confidence            31 12345555 999965  49999994


No 13 
>COG3128 PiuC Uncharacterized iron-regulated protein [Function unknown]
Probab=94.96  E-value=0.25  Score=45.78  Aligned_cols=169  Identities=17%  Similarity=0.142  Sum_probs=88.7

Q ss_pred             CeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceeccC-CccccCCCcccCCCCCCCcHHHH-HHHHHHhhcccCCCCC
Q psy4937         119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFG-PKCNFKKQKIKLGDFQGFPACTK-FIQDRFHQEDVLSHFQ  196 (356)
Q Consensus       119 Gl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~yG-~~~nyk~~kl~~g~f~~lP~~l~-~l~~Ri~~~~~l~~f~  196 (356)
                      -+.-||..||+++.+.+-+.+...+|..   ||-.|-+- +++.   +..+...-.++-..+. .|++++.+.+.+-.-.
T Consensus         3 m~lhIp~VLs~a~va~iRa~l~~A~w~d---Grat~g~q~a~vk---~n~qlp~~s~l~~~vg~~il~al~~~plff~aA   76 (229)
T COG3128           3 MMLHIPEVLSEAQVARIRAALEQAEWVD---GRATQGPQGAQVK---NNLQLPQDSALARELGNEILQALTAHPLFFAAA   76 (229)
T ss_pred             eEEechhhCCHHHHHHHHHHHhhccccc---cccccCcchhhhh---ccccCCcccHHHHHHHHHHHHHHHhchhHHHhh
Confidence            3567899999999999999999999984   78776653 3332   2222211122222222 3556665544321100


Q ss_pred             Ccce----eeeEecCCCCCCccCCCCCcccc-cCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccc
Q psy4937         197 TIEQ----CTLEYNSENGSSIDNHIDDCWIW-GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV  271 (356)
Q Consensus       197 ~~~q----~~leY~pg~G~~I~pHvDd~~~~-Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~  271 (356)
                      -+.-    .-+.|.-|  +..++|+|+--.. .+ .-+..|-  +++..+.--.++        .+|+           .
T Consensus        77 Lp~t~~~P~Fn~Y~eg--~~f~fHvDgavr~~hp-~~~~~lr--tdls~tlfl~DP--------edYd-----------G  132 (229)
T COG3128          77 LPRTCLPPLFNRYQEG--DFFGFHVDGAVRSIHP-GSGFRLR--TDLSCTLFLSDP--------EDYD-----------G  132 (229)
T ss_pred             cccccCCchhhhccCC--CcccccccCcccccCC-CCCceeE--eeeeeeeecCCc--------cccC-----------C
Confidence            0111    22478884  5599999973211 00 0000111  111111110111        1111           1


Q ss_pred             cccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEec
Q psy4937         272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYR  333 (356)
Q Consensus       272 ~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR  333 (356)
                      |  ++       .++. . --..+|.||.|+|++.-+.+    .|.|.+.. |+.|+.+-|-
T Consensus       133 G--eL-------Vv~d-t-Yg~h~VklPAGdLVlypStS----lH~VtPVT-Rg~R~asffW  178 (229)
T COG3128         133 G--EL-------VVND-T-YGNHRVKLPAGDLVLYPSTS----LHEVTPVT-RGERFASFFW  178 (229)
T ss_pred             c--eE-------EEec-c-ccceEEeccCCCEEEccccc----ceeccccc-cCceEEEeee
Confidence            1  00       0111 1 22578999999999988876    58888842 8999888773


No 14 
>PF13640 2OG-FeII_Oxy_3:  2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=94.35  E-value=0.03  Score=44.98  Aligned_cols=32  Identities=19%  Similarity=0.179  Sum_probs=25.9

Q ss_pred             cCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937         298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY  332 (356)
Q Consensus       298 Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf  332 (356)
                      ..+|+++++.+   ....|++.+....++|++|++
T Consensus        66 p~~g~~v~F~~---~~~~H~v~~v~~~~~R~~l~~   97 (100)
T PF13640_consen   66 PKPGRLVIFPS---DNSLHGVTPVGEGGRRYSLTF   97 (100)
T ss_dssp             -BTTEEEEEES---CTCEEEEEEE-EESEEEEEEE
T ss_pred             CCCCEEEEEeC---CCCeecCcccCCCCCEEEEEE
Confidence            78889999999   566799999755999999986


No 15 
>KOG1591|consensus
Probab=90.85  E-value=1.2  Score=43.75  Aligned_cols=93  Identities=18%  Similarity=0.217  Sum_probs=54.4

Q ss_pred             CCCeEEeCCCCCHHHHHHHHHHhhc--CCCCccccCccee-ccC-CccccCCCcccCCCC--CCCcHHHHHHHHHHhhcc
Q psy4937         117 IGGVYIKEDFLSEAETEDVMKQLDD--LPWDKSQSGRRKQ-NFG-PKCNFKKQKIKLGDF--QGFPACTKFIQDRFHQED  190 (356)
Q Consensus       117 ipGl~~i~dFIS~~Ee~~Ll~~id~--~pW~~sqsgRR~q-~yG-~~~nyk~~kl~~g~f--~~lP~~l~~l~~Ri~~~~  190 (356)
                      -|.+.+++|||+++|.+.|+..-..  .++.       +. +.| .+.+-..-+...+.|  .+=.+.+..|-+||....
T Consensus        96 ~P~~~~yhd~ls~~e~d~l~~lak~~l~~st-------v~~~~~~~~~~~~~~R~S~~t~l~~~~~~~~~~i~~ri~~~T  168 (289)
T KOG1591|consen   96 DPRVVLYHDFLSDEECDHLISLAKPKLERST-------VVADKGTGHSTTSAVRTSSGTFLPDGASPVVSRIEQRIADLT  168 (289)
T ss_pred             CCceEeehhcCCHHHHHHHHHhhhhhhhcee-------eeccCCcccccceeeEecceeEecCCCCHHHHHHHHHHHhcc
Confidence            4679999999999999999876543  1222       21 222 011100001111212  123456777888888877


Q ss_pred             cCCCCCCcceeee-EecCCCCCCccCCCCC
Q psy4937         191 VLSHFQTIEQCTL-EYNSENGSSIDNHIDD  219 (356)
Q Consensus       191 ~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd  219 (356)
                      ++. +..-+.+.+ .|.+|+  .-.||.|-
T Consensus       169 ~l~-~e~~E~lqVlnYg~Gg--~Y~~H~D~  195 (289)
T KOG1591|consen  169 GLP-VENGESLQVLNYGLGG--HYEPHYDY  195 (289)
T ss_pred             CCC-cccCccceEEEecCCc--cccccccc
Confidence            653 233456665 899954  69999884


No 16 
>TIGR01762 chlorin-enz chlorinating enzymes. This model represents a a group of highly homologous enzymes related to dioxygenases which chlorinate amino acid methyl groups. BarB1 and BarB2 are proposed to trichlorinate one of the methyl groups of a leucine residue in the biosynthesis of barbamide in the cyanobacterium Lyngbya majuscula. SyrB2 is proposed to chlorinate the methyl group of threonine in the biosynthesis of syringomycin in Pseudomonas syringae. CmaB is proposed to chlorinate the beta-methyl group of alloisoleucine in the process of ring closure in the biosynthesis of coronamic acid, a component of coronatine also in Pseudomonas syringae.
Probab=88.24  E-value=13  Score=36.27  Aligned_cols=43  Identities=16%  Similarity=0.264  Sum_probs=34.5

Q ss_pred             eEEEEecCCCeEEEecchhhcccccccccCCC-CCCeEEEEecccCCC
Q psy4937         292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDV-IGRRVCIAYREFTAP  338 (356)
Q Consensus       292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dv-r~rRVSLTfR~l~~~  338 (356)
                      ..+.+.|++|+++++.+-+=    |+-.+..- ..+|+++++|.+.+.
T Consensus       207 ~~v~~~lkaGd~~~f~~~t~----HgS~~N~S~~~~R~~~~~ry~~~~  250 (288)
T TIGR01762       207 SAVPMQMKAGQFIIFWSTLM----HASYPNSGESQMRMGFASRYVPSF  250 (288)
T ss_pred             ceeeeeeCCceEEEECCCce----ecCCCCCCCCceEEEEEEEEcCCC
Confidence            46899999999999998763    77766643 467999999998663


No 17 
>PF05721 PhyH:  Phytanoyl-CoA dioxygenase (PhyH);  InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=85.01  E-value=1.7  Score=37.95  Aligned_cols=25  Identities=32%  Similarity=0.484  Sum_probs=21.3

Q ss_pred             CCeEEeCCCCCHHHHHHHHHHhhcC
Q psy4937         118 GGVYIKEDFLSEAETEDVMKQLDDL  142 (356)
Q Consensus       118 pGl~~i~dFIS~~Ee~~Ll~~id~~  142 (356)
                      .|..+++++|+++|-+.|.+.+++.
T Consensus         4 ~Gyvvi~~~l~~~~~~~l~~~~~~~   28 (211)
T PF05721_consen    4 DGYVVIRNVLSPEEVERLREELDRL   28 (211)
T ss_dssp             HSEEEETTSS-HHHHHHHHHHHHHH
T ss_pred             CcEEEECCcCCHHHHHHHHHHHHHH
Confidence            4899999999999999999998763


No 18 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=84.61  E-value=0.62  Score=31.75  Aligned_cols=35  Identities=31%  Similarity=0.812  Sum_probs=23.9

Q ss_pred             ccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccc
Q psy4937          37 YIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQ   97 (356)
Q Consensus        37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~   97 (356)
                      .|.||+|+.                   .|.|-     +++.  ......+-|+.|+..|.
T Consensus         2 ~i~CP~C~~-------------------~f~v~-----~~~l--~~~~~~vrC~~C~~~f~   36 (37)
T PF13719_consen    2 IITCPNCQT-------------------RFRVP-----DDKL--PAGGRKVRCPKCGHVFR   36 (37)
T ss_pred             EEECCCCCc-------------------eEEcC-----HHHc--ccCCcEEECCCCCcEee
Confidence            378999998                   44444     2232  24567899999998763


No 19 
>PF10083 DUF2321:  Uncharacterized protein conserved in bacteria (DUF2321);  InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=80.88  E-value=0.49  Score=42.43  Aligned_cols=50  Identities=28%  Similarity=0.581  Sum_probs=33.2

Q ss_pred             cccccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccccCccccc
Q psy4937          34 EETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAY  104 (356)
Q Consensus        34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~g~~~~~~  104 (356)
                      ++|-..||+|+-+-.|..      |...   -|.++           ...+.-+||..|++.|| |-..++
T Consensus        36 ~~tI~~Cp~C~~~IrG~y------~v~g---v~~~g-----------~~~~~PsYC~~CGkpyP-Wt~~~L   85 (158)
T PF10083_consen   36 AKTITSCPNCSTPIRGDY------HVEG---VFGLG-----------GHYEAPSYCHNCGKPYP-WTENAL   85 (158)
T ss_pred             HHHHHHCcCCCCCCCCce------ecCC---eeeeC-----------CCCCCChhHHhCCCCCc-hHHHHH
Confidence            467788999998766652      2211   11111           24557799999999999 976654


No 20 
>PF13248 zf-ribbon_3:  zinc-ribbon domain
Probab=79.93  E-value=0.86  Score=28.62  Aligned_cols=12  Identities=25%  Similarity=0.916  Sum_probs=9.1

Q ss_pred             ccceeecccccc
Q psy4937          83 KETYVYCPHCNK   94 (356)
Q Consensus        83 ~~~~~~c~~c~~   94 (356)
                      .....|||.|+.
T Consensus        13 ~~~~~fC~~CG~   24 (26)
T PF13248_consen   13 DPDAKFCPNCGA   24 (26)
T ss_pred             CcccccChhhCC
Confidence            456789999984


No 21 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=76.58  E-value=1.7  Score=29.27  Aligned_cols=13  Identities=31%  Similarity=0.871  Sum_probs=10.3

Q ss_pred             ceeeccccccccc
Q psy4937          85 TYVYCPHCNKAWQ   97 (356)
Q Consensus        85 ~~~~c~~c~~~~~   97 (356)
                      ..+.||.|+..|.
T Consensus        24 ~~v~C~~C~~~~~   36 (38)
T TIGR02098        24 GKVRCGKCGHVWY   36 (38)
T ss_pred             CEEECCCCCCEEE
Confidence            4789999997753


No 22 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=75.35  E-value=2  Score=29.13  Aligned_cols=15  Identities=27%  Similarity=0.678  Sum_probs=11.6

Q ss_pred             cccceeecccccccc
Q psy4937          82 TKETYVYCPHCNKAW   96 (356)
Q Consensus        82 ~~~~~~~c~~c~~~~   96 (356)
                      ..-.-+-|+.|+..|
T Consensus        21 ~~g~~v~C~~C~~~f   35 (36)
T PF13717_consen   21 PKGRKVRCSKCGHVF   35 (36)
T ss_pred             CCCcEEECCCCCCEe
Confidence            444678999999876


No 23 
>PF12933 FTO_NTD:  FTO catalytic domain;  InterPro: IPR024367 Alpha-ketoglutarate-dependent dioxygenase FTO, also known as Fat mass and obesity-associated protein, is a nucleus protein which belongs to the FTO family. This enzyme is a dioxygenase that repairs alkylated DNA and RNA by oxidative demethylation []. FTO activity is highest towards single-stranded RNA containing 3-methyluracil, followed by single-stranded DNA containing 3-methylthymine. FTO has low demethylase activity towards single-stranded DNA containing 1-methyladenine or 3-methylcytosine []. FTO has no activity towards 1-methylguanine. It has no detectable activity towards double-stranded DNA. FTO requires molecular oxygen, alpha-ketoglutarate and iron. FTO contributes to the regulation of the global metabolic rate, energy expenditure and energy homeostasis. It contributes to the regulation of body size and body fat accumulation as well []. This domain is the catalytic AlkB-like domain from the FTO protein []. This domain catalyses a demethylase activity with a preference for 3-methylthymidine.; PDB: 3LFM_A.
Probab=75.15  E-value=9.3  Score=36.85  Aligned_cols=42  Identities=19%  Similarity=0.309  Sum_probs=31.3

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEecc
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE  334 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR~  334 (356)
                      .+...|+|+.|+.|.|-++--..-+|+|..-  ..+|+|-|-|-
T Consensus       209 tP~L~vPL~sgd~Y~Mldd~N~tHqH~VlaG--~~~RfSSTHRV  250 (253)
T PF12933_consen  209 TPGLAVPLRSGDCYYMLDDFNATHQHCVLAG--SSARFSSTHRV  250 (253)
T ss_dssp             S-EEEEEE-TT-EEEE-TTHHHHEEEEEE----SS-EEEEEEE-
T ss_pred             CCeeEEeccCCCeEEEccccchhhHHHHhcC--CCcccccccee
Confidence            5679999999999999999999999999996  45599999873


No 24 
>PF09859 Oxygenase-NA:  Oxygenase, catalysing oxidative methylation of damaged DNA;  InterPro: IPR018655  This family of various hypothetical prokaryotic proteins, has no known function. 
Probab=70.75  E-value=25  Score=32.12  Aligned_cols=114  Identities=22%  Similarity=0.252  Sum_probs=65.8

Q ss_pred             CCCcHHHHHHHHHHhhcccCCCCCCcceeeeEecCCCCCCccCCCCCcccccCceE---EEecCCceEEEEeecCCCCCC
Q psy4937         173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV---TVNLIGDSVLTLTYYNGDSNK  249 (356)
Q Consensus       173 ~~lP~~l~~l~~Ri~~~~~l~~f~~~~q~~leY~pg~G~~I~pHvDd~~~~Ge~Ia---slSLgs~~vm~f~~~~~~~~~  249 (356)
                      ..+|+.+..++++..+.+-    ..+-.+++.|.+  |+...-|.|.   +|+.+.   .+=|+|.---+|+-.+     
T Consensus        41 ~~yP~~~~~fl~~ch~aGQ----~rptplllrY~~--gdyn~LHqdl---yGe~vFPlQvv~lLs~Pg~DftGGE-----  106 (173)
T PF09859_consen   41 ARYPATLAEFLARCHAAGQ----TRPTPLLLRYGP--GDYNCLHQDL---YGEHVFPLQVVILLSEPGEDFTGGE-----  106 (173)
T ss_pred             CCCCccHHHHHHHHHhccC----CCCchhhheeCC--CCccccccCC---CCCcccCeEEEEEcCCCCCcccCce-----
Confidence            4577888888888777643    234457779999  6789999994   675422   1122222111111100     


Q ss_pred             cCCCCCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEec----------chhhcccccccc
Q psy4937         250 YNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW----------GEARYEWQHCVL  319 (356)
Q Consensus       250 ~~l~~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMs----------G~ARy~W~H~I~  319 (356)
                      |                            +..+++... . ....-++|..|..+|+.          |--|-.-.|||.
T Consensus       107 F----------------------------VltEQrPR~-Q-SR~~V~~L~qGda~if~t~~RPv~G~rG~yRv~~RHgVS  156 (173)
T PF09859_consen  107 F----------------------------VLTEQRPRM-Q-SRAMVLPLRQGDALIFATNHRPVRGARGYYRVNMRHGVS  156 (173)
T ss_pred             E----------------------------EEEEecCCc-c-CccccCCcCCCCEEEEecCCCCcCCCccceecccccccc
Confidence            0                            000011010 1 34456888899999885          566677889988


Q ss_pred             cCCCCCCeEEEE
Q psy4937         320 RSDVIGRRVCIA  331 (356)
Q Consensus       320 ~~dvr~rRVSLT  331 (356)
                      +-. .++|..|.
T Consensus       157 ~vr-sG~R~tLg  167 (173)
T PF09859_consen  157 RVR-SGERHTLG  167 (173)
T ss_pred             ccc-ccceEEEE
Confidence            842 67777664


No 25 
>PF13240 zinc_ribbon_2:  zinc-ribbon domain
Probab=70.72  E-value=2.1  Score=26.27  Aligned_cols=10  Identities=30%  Similarity=1.115  Sum_probs=7.0

Q ss_pred             ceeecccccc
Q psy4937          85 TYVYCPHCNK   94 (356)
Q Consensus        85 ~~~~c~~c~~   94 (356)
                      .-.||++|+.
T Consensus        12 ~~~fC~~CG~   21 (23)
T PF13240_consen   12 DAKFCPNCGT   21 (23)
T ss_pred             cCcchhhhCC
Confidence            4557888874


No 26 
>PF12851 Tet_JBP:  Oxygenase domain of the 2OGFeDO superfamily ;  InterPro: IPR024779 TETs are 2OG- and Fe(II)-dependent oxygenases that catalyse the conversion of 5 methyl-Cytosine (5-MC) to 5-hydroxymethyl-cytosine (hmC) in cultured cells and in vitro []. Interestingly TET2 is considered as an oncogene, as it is found mutated in some types of cancer []. This entry represents the double-stranded beta helix (DSBH) fold of the 2-oxoglutarate (2OG) - Fe(II) oxygenases. DSBH comprises a part of the catalytic domain in TETS. It is found in many organisms including fruit fly, African malaria mosquito, zebrafish, mouse and human.
Probab=68.30  E-value=6.7  Score=35.50  Aligned_cols=38  Identities=13%  Similarity=0.307  Sum_probs=31.6

Q ss_pred             eEEEEecCCCeEEEecchhhcccccccccCCC----CCCeEEEEe
Q psy4937         292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDV----IGRRVCIAY  332 (356)
Q Consensus       292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dv----r~rRVSLTf  332 (356)
                      .-+++.+.+||+|++-|.   ...|++.+-+-    .+.|+||.|
T Consensus       126 ~g~~~~~~~GtVl~~~~~---~~~Hgvtpv~~~~~~~~~R~slvf  167 (171)
T PF12851_consen  126 LGVAFAYQPGTVLIFCAK---RELHGVTPVESPNRNHGTRISLVF  167 (171)
T ss_pred             CCEEEecCCCcEEEEccc---ceeeecCcccCCCCCCCeEEEEEE
Confidence            458899999999999986   57899998431    399999987


No 27 
>PF14205 Cys_rich_KTR:  Cysteine-rich KTR
Probab=67.11  E-value=3.3  Score=30.97  Aligned_cols=10  Identities=40%  Similarity=1.252  Sum_probs=8.2

Q ss_pred             cccccCCCCc
Q psy4937          36 TYIYCPHCNK   45 (356)
Q Consensus        36 ~~~~~~~~~~   45 (356)
                      ..+.||.|+.
T Consensus         3 ~Wi~CP~Cgn   12 (55)
T PF14205_consen    3 EWILCPICGN   12 (55)
T ss_pred             eEEECCCCCC
Confidence            4689999985


No 28 
>PF13759 2OG-FeII_Oxy_5:  Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=66.55  E-value=8.6  Score=31.15  Aligned_cols=38  Identities=16%  Similarity=0.121  Sum_probs=24.6

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf  332 (356)
                      .....+....|.|+|+-+-..    |++.+..-.+.||||.|
T Consensus        63 ~~~~~~~p~~G~lvlFPs~l~----H~v~p~~~~~~Risisf  100 (101)
T PF13759_consen   63 SPYYIVEPEEGDLVLFPSWLW----HGVPPNNSDEERISISF  100 (101)
T ss_dssp             -SEEEE---TTEEEEEETTSE----EEE----SSS-EEEEEE
T ss_pred             CceEEeCCCCCEEEEeCCCCE----EeccCcCCCCCEEEEEc
Confidence            456778889999999997654    99999877899999987


No 29 
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=55.45  E-value=7.2  Score=24.71  Aligned_cols=15  Identities=33%  Similarity=1.012  Sum_probs=10.4

Q ss_pred             ccceeeccccccccc
Q psy4937          83 KETYVYCPHCNKAWQ   97 (356)
Q Consensus        83 ~~~~~~c~~c~~~~~   97 (356)
                      ..+-.+||+|+..|.
T Consensus        11 ~~~~~~Cp~CG~~F~   25 (26)
T PF10571_consen   11 PESAKFCPHCGYDFE   25 (26)
T ss_pred             hhhcCcCCCCCCCCc
Confidence            446667888887653


No 30 
>PF12773 DZR:  Double zinc ribbon
Probab=54.91  E-value=8.3  Score=27.26  Aligned_cols=11  Identities=27%  Similarity=0.872  Sum_probs=5.3

Q ss_pred             ceeeccccccc
Q psy4937          85 TYVYCPHCNKA   95 (356)
Q Consensus        85 ~~~~c~~c~~~   95 (356)
                      ...+|+.|+..
T Consensus        28 ~~~~C~~Cg~~   38 (50)
T PF12773_consen   28 SKKICPNCGAE   38 (50)
T ss_pred             CCCCCcCCcCC
Confidence            34455555554


No 31 
>PLN03001 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=54.90  E-value=46  Score=32.06  Aligned_cols=42  Identities=12%  Similarity=0.154  Sum_probs=33.7

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf  332 (356)
                      +..+.|.-.+++++|.-|+.=-.|..+.-+.-       -...|+||.|
T Consensus       161 g~Wi~V~p~p~a~vVNiGD~l~~~tng~~~S~~HRVv~~~~~~R~Sia~  209 (262)
T PLN03001        161 AEWLMVPPISDAILIIIADQTEIITNGNYKSAQHRAIANANKARLSVAT  209 (262)
T ss_pred             CeEEECCCCCCcEEEEccHHHHHHhCCccccccceEEcCCCCCEEEEEE
Confidence            56788888899999999999999996655431       1456999998


No 32 
>COG3809 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.02  E-value=6.8  Score=31.53  Aligned_cols=13  Identities=38%  Similarity=1.209  Sum_probs=10.2

Q ss_pred             ceeeccccccccc
Q psy4937          85 TYVYCPHCNKAWQ   97 (356)
Q Consensus        85 ~~~~c~~c~~~~~   97 (356)
                      .+-|||.|.-.|-
T Consensus        20 EiD~CPrCrGVWL   32 (88)
T COG3809          20 EIDYCPRCRGVWL   32 (88)
T ss_pred             eeeeCCccccEee
Confidence            5779999996654


No 33 
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=51.29  E-value=4  Score=34.06  Aligned_cols=39  Identities=10%  Similarity=0.109  Sum_probs=23.4

Q ss_pred             cceeeccccccccccCccccccCCCCCCCCCCCCCCeEEeCCCCCHHHHHHH
Q psy4937          84 ETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDV  135 (356)
Q Consensus        84 ~~~~~c~~c~~~~~g~~~~~~~~~~~h~~~~~~ipGl~~i~dFIS~~Ee~~L  135 (356)
                      .....|+.|+.-+ |....            ....-|-++.+||++=++-+|
T Consensus        40 ~~h~~C~~CG~y~-~~~V~------------~l~epIDVY~~wiD~~~eg~i   78 (99)
T PRK14892         40 IAIITCGNCGLYT-EFEVP------------SVYDEVDVYNKFIDLYLEGKI   78 (99)
T ss_pred             cceEECCCCCCcc-CEECC------------ccccchhhHHHHHHHHHhcCC
Confidence            5788999999554 34322            122335677777776655443


No 34 
>PLN02984 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=51.10  E-value=60  Score=32.55  Aligned_cols=42  Identities=21%  Similarity=0.201  Sum_probs=33.2

Q ss_pred             ceEEEEecCCCeEEEecchhhcccc--------cccccCCCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQ--------HCVLRSDVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~--------H~I~~~dvr~rRVSLTf  332 (356)
                      +..+.|+-.+++++|.-|+.=-.|+        |.|........|+||.|
T Consensus       245 g~Wv~V~p~pgalVVNiGD~Le~wTNg~~kSt~HRVv~~~~~~~R~Sia~  294 (341)
T PLN02984        245 GEWFNVKPIANTLVVNLGDMMQVISDDEYKSVLHRVGKRNKKKERYSICY  294 (341)
T ss_pred             CceEECCCCCCeEEEECChhhhhhcCCeeeCCCCccccCCCCCCeEEEEE
Confidence            6688999999999999999999998        55521112456999998


No 35 
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.74  E-value=8.5  Score=33.70  Aligned_cols=77  Identities=18%  Similarity=0.267  Sum_probs=44.9

Q ss_pred             cccccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccccCccccccCCCCCCCC
Q psy4937          34 EETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK  113 (356)
Q Consensus        34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~g~~~~~~~~~~~h~~~  113 (356)
                      +.|-..||.||-+-.+++-++--..-.                    -..+.-+||..|++-|| |...++      .|.
T Consensus        36 eati~qcp~csasirgd~~vegvlglg--------------------~dye~psfchncgs~fp-wterki------aga   88 (160)
T COG4306          36 EATITQCPICSASIRGDYYVEGVLGLG--------------------GDYEPPSFCHNCGSRFP-WTERKI------AGA   88 (160)
T ss_pred             hHHHhcCCccCCcccccceeeeeeccC--------------------CCCCCcchhhcCCCCCC-cHHHHH------hHH
Confidence            667788999988666654443211111                    13557799999999999 865543      111


Q ss_pred             CCCCCCeEEeCCCCCHHHHHHHHHHhhc
Q psy4937         114 SFDIGGVYIKEDFLSEAETEDVMKQLDD  141 (356)
Q Consensus       114 ~~~ipGl~~i~dFIS~~Ee~~Ll~~id~  141 (356)
                      ..    |.=-..=|+++|-+++-..+-+
T Consensus        89 ~e----lvea~~~l~pdevqqf~tdlt~  112 (160)
T COG4306          89 VE----LVEAGENLNPDEVQQFRTDLTD  112 (160)
T ss_pred             HH----HHHccccCCHHHHHHHHhhHHH
Confidence            10    0001123578888777666543


No 36 
>PF01927 Mut7-C:  Mut7-C RNAse domain;  InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=45.06  E-value=15  Score=32.24  Aligned_cols=15  Identities=33%  Similarity=0.884  Sum_probs=12.7

Q ss_pred             cccceeecccccccc
Q psy4937          82 TKETYVYCPHCNKAW   96 (356)
Q Consensus        82 ~~~~~~~c~~c~~~~   96 (356)
                      ....|..|+.|+++|
T Consensus       120 ~~~~f~~C~~C~kiy  134 (147)
T PF01927_consen  120 TYDEFWRCPGCGKIY  134 (147)
T ss_pred             cCCeEEECCCCCCEe
Confidence            455699999999997


No 37 
>PLN02904 oxidoreductase
Probab=45.02  E-value=76  Score=31.92  Aligned_cols=42  Identities=17%  Similarity=0.177  Sum_probs=33.4

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccC---C----CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS---D----VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~---d----vr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|.-|+.=-.|.-+.-+.   .    ....|+||+|
T Consensus       253 g~Wi~V~p~pgalVVNiGD~Le~~TNG~~kSt~HRVv~~~~~~R~Si~~  301 (357)
T PLN02904        253 KNWVCVPYIEGALIVQLGDQVEVMSNGIYKSVVHRVTVNKDYKRLSFAS  301 (357)
T ss_pred             CCEEECCCCCCeEEEEccHHHHHHhCCeeeccCCcccCCCCCCEEEEEE
Confidence            5688899999999999999988887665543   1    1456999998


No 38 
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=43.82  E-value=11  Score=41.19  Aligned_cols=55  Identities=15%  Similarity=0.175  Sum_probs=29.3

Q ss_pred             ccceeeccccccccc-cCccccccCCCCCC-----------CCCCCCCCeEEeCCCCCHHHHHHHHH
Q psy4937          83 KETYVYCPHCNKAWQ-GSDLYAYQSHPNHS-----------GKSFDIGGVYIKEDFLSEAETEDVMK  137 (356)
Q Consensus        83 ~~~~~~c~~c~~~~~-g~~~~~~~~~~~h~-----------~~~~~ipGl~~i~dFIS~~Ee~~Ll~  137 (356)
                      ....-||++|+.... -|-.--.+..++|.           ......+....+..||++.+-=+|++
T Consensus        38 ~~~~~fC~~CG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ld~~~ry~l~~  104 (645)
T PRK14559         38 PVDEAHCPNCGAETGTIWWAIIAQASPNSEVLESGEATQQSESSLTPSSSPLYGSYLDPGQRYQLLA  104 (645)
T ss_pred             CcccccccccCCcccchhhhhcccccccccccccccccccccccCCCcccchhhhhcCcccceEecc
Confidence            445669999997643 12222222233332           12344455677777777766555554


No 39 
>PLN02299 1-aminocyclopropane-1-carboxylate oxidase
Probab=43.56  E-value=55  Score=32.40  Aligned_cols=42  Identities=21%  Similarity=0.201  Sum_probs=34.6

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|.-|++=-.|+.+..+.-       ..+.|+||+|
T Consensus       204 g~Wi~V~p~pg~lvVNiGD~l~~~Tng~~kS~~HRVv~~~~~~R~Si~~  252 (321)
T PLN02299        204 GEWVDVPPMRHSIVVNLGDQLEVITNGKYKSVMHRVVAQTDGNRMSIAS  252 (321)
T ss_pred             CeEEECCCCCCeEEEEeCHHHHHHhCCceecccceeecCCCCCEEEEEE
Confidence            56788998999999999999999998776641       1456999998


No 40 
>PF08209 Sgf11:  Sgf11 (transcriptional regulation protein);  InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=42.48  E-value=9.5  Score=25.63  Aligned_cols=15  Identities=33%  Similarity=0.964  Sum_probs=10.3

Q ss_pred             cccccCCCCcccccC
Q psy4937          36 TYIYCPHCNKAWQGS   50 (356)
Q Consensus        36 ~~~~~~~~~~~~~~~   50 (356)
                      .|+.||+|++...++
T Consensus         3 ~~~~C~nC~R~v~a~   17 (33)
T PF08209_consen    3 PYVECPNCGRPVAAS   17 (33)
T ss_dssp             -EEE-TTTSSEEEGG
T ss_pred             CeEECCCCcCCcchh
Confidence            478999999975554


No 41 
>PF01927 Mut7-C:  Mut7-C RNAse domain;  InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=42.20  E-value=13  Score=32.60  Aligned_cols=39  Identities=33%  Similarity=0.706  Sum_probs=25.6

Q ss_pred             eeccccccccCcccccccc--------cccccccccCCCCcc-cccCc
Q psy4937          13 RTCIKCEQEFGYENKNIVE--------FTEETYIYCPHCNKA-WQGSD   51 (356)
Q Consensus        13 r~c~~ce~~~~~~~~~~~~--------~~~~~~~~~~~~~~~-~~~~~   51 (356)
                      ..|..|+.......++.++        .+...+-.||.|+|- |.||-
T Consensus        92 sRC~~CN~~L~~v~~~~v~~~vp~~v~~~~~~f~~C~~C~kiyW~GsH  139 (147)
T PF01927_consen   92 SRCPKCNGPLRPVSKEEVKDRVPPYVYETYDEFWRCPGCGKIYWEGSH  139 (147)
T ss_pred             CccCCCCcEeeechhhccccccCccccccCCeEEECCCCCCEeccccc
Confidence            5699999866544332221        123347789999998 98874


No 42 
>PLN02485 oxidoreductase
Probab=41.88  E-value=74  Score=31.39  Aligned_cols=42  Identities=21%  Similarity=0.285  Sum_probs=33.1

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf  332 (356)
                      +..+.|+-.+++++|.-|++=-.|+.+.-+.-       -...|+||.|
T Consensus       235 g~Wi~V~p~pg~~vVNiGD~L~~~TnG~~~St~HRVv~~~~~~R~Si~~  283 (329)
T PLN02485        235 GEWIWAIPIPGTFVCNIGDMLKIWSNGVYQSTLHRVINNSPKYRVCVAF  283 (329)
T ss_pred             CcEEECCCCCCcEEEEhHHHHHHHHCCEeeCCCceecCCCCCCeEEEEE
Confidence            56888999999999999999999985544430       1346999998


No 43 
>PLN00417 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=41.04  E-value=81  Score=31.61  Aligned_cols=42  Identities=14%  Similarity=0.202  Sum_probs=34.2

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|.-|+.=-.|+.+.-+.-       ....|+||.|
T Consensus       249 g~Wi~V~p~pg~lVVNiGD~Le~~Tng~~kSt~HRVv~~~~~~R~Si~f  297 (348)
T PLN00417        249 GKWYKAPIVPDTILINVGDQMEIMSNGIYKSPVHRVVTNREKERISVAT  297 (348)
T ss_pred             CeEEECCCCCCcEEEEcChHHHHHhCCeecccceEEecCCCCCEEEEEE
Confidence            56788999999999999999999987666541       1456999998


No 44 
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=41.03  E-value=7.2  Score=32.73  Aligned_cols=62  Identities=11%  Similarity=0.178  Sum_probs=35.7

Q ss_pred             cccccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccccCccccccCCCCCCCC
Q psy4937          34 EETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK  113 (356)
Q Consensus        34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~g~~~~~~~~~~~h~~~  113 (356)
                      -.++.+||+||.-.             .|+|+-..            +..-.+.-|-.|+.-+. .+            .
T Consensus        19 L~k~FtCp~Cghe~-------------vs~ctvkk------------~~~~g~~~Cg~CGls~e-~e------------v   60 (104)
T COG4888          19 LPKTFTCPRCGHEK-------------VSSCTVKK------------TVNIGTAVCGNCGLSFE-CE------------V   60 (104)
T ss_pred             CCceEecCccCCee-------------eeEEEEEe------------cCceeEEEcccCcceEE-Ee------------c
Confidence            46889999999832             33333111            13456778888886654 21            1


Q ss_pred             CCCCCCeEEeCCCCCHHHHH
Q psy4937         114 SFDIGGVYIKEDFLSEAETE  133 (356)
Q Consensus       114 ~~~ipGl~~i~dFIS~~Ee~  133 (356)
                      +....++-++.+|++.-.|-
T Consensus        61 ~~l~~~vDvYs~wvDay~eg   80 (104)
T COG4888          61 PELSEPVDVYSAWVDAYLEG   80 (104)
T ss_pred             cccccchhHHHHHHHHHHhc
Confidence            22344566666776654443


No 45 
>PF08792 A2L_zn_ribbon:  A2L zinc ribbon domain;  InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors []. 
Probab=37.45  E-value=22  Score=23.72  Aligned_cols=16  Identities=19%  Similarity=0.711  Sum_probs=12.8

Q ss_pred             cccceeeccccccccc
Q psy4937          82 TKETYVYCPHCNKAWQ   97 (356)
Q Consensus        82 ~~~~~~~c~~c~~~~~   97 (356)
                      +...+.+|+.|+..++
T Consensus        17 ~~~~~~~C~~Cg~~~~   32 (33)
T PF08792_consen   17 KEDDYEVCIFCGSSFP   32 (33)
T ss_pred             ecCCeEEcccCCcEee
Confidence            5668899999997764


No 46 
>PLN02912 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=37.27  E-value=1.3e+02  Score=30.03  Aligned_cols=42  Identities=14%  Similarity=0.119  Sum_probs=34.1

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|--|++=-.|..+.-+.       .....|+||+|
T Consensus       242 g~Wi~V~p~pgalvVNiGD~L~~~TNG~~kSt~HRVv~~~~~~R~Sia~  290 (348)
T PLN02912        242 GKWIAVNPIPNTFIVNLGDQMQVISNDKYKSVLHRAVVNTDKERISIPT  290 (348)
T ss_pred             CcEEECCCcCCeEEEEcCHHHHHHhCCEEEcccccccCCCCCCEEEEEE
Confidence            5688899999999999999988888766653       12566999998


No 47 
>PF14353 CpXC:  CpXC protein
Probab=36.76  E-value=14  Score=31.27  Aligned_cols=32  Identities=28%  Similarity=0.635  Sum_probs=21.9

Q ss_pred             eccccccccCcccccccccc------------cccccccCCCCc
Q psy4937          14 TCIKCEQEFGYENKNIVEFT------------EETYIYCPHCNK   45 (356)
Q Consensus        14 ~c~~ce~~~~~~~~~~~~~~------------~~~~~~~~~~~~   45 (356)
                      ||..|...+..+-...+...            ..--++||+|+.
T Consensus         3 tCP~C~~~~~~~v~~~I~~~~~p~l~e~il~g~l~~~~CP~Cg~   46 (128)
T PF14353_consen    3 TCPHCGHEFEFEVWTSINADEDPELKEKILDGSLFSFTCPSCGH   46 (128)
T ss_pred             CCCCCCCeeEEEEEeEEcCcCCHHHHHHHHcCCcCEEECCCCCC
Confidence            78889888877754333332            234678999998


No 48 
>PLN02750 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=36.03  E-value=1.6e+02  Score=29.41  Aligned_cols=42  Identities=14%  Similarity=0.247  Sum_probs=34.6

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf  332 (356)
                      +..+.|+-.+++++|--|++=-.|+-+.-+.       .....|+||.|
T Consensus       240 g~Wi~V~p~pg~~vVNiGD~L~~~Tng~~~St~HRVv~~~~~~R~Si~~  288 (345)
T PLN02750        240 GEWIPVKPIPDAFIINIGNCMQVWTNDLYWSAEHRVVVNSQKERFSIPF  288 (345)
T ss_pred             CeEEEccCCCCeEEEEhHHHHHHHhCCeeecccceeccCCCCCEEEEEE
Confidence            6688999999999999999998998776653       12567999998


No 49 
>COG1326 Uncharacterized archaeal Zn-finger protein [General function prediction only]
Probab=35.43  E-value=22  Score=33.17  Aligned_cols=16  Identities=25%  Similarity=0.613  Sum_probs=13.2

Q ss_pred             cccceeeccccccccc
Q psy4937          82 TKETYVYCPHCNKAWQ   97 (356)
Q Consensus        82 ~~~~~~~c~~c~~~~~   97 (356)
                      +.+..+-|..|+..++
T Consensus        26 g~~~lvrC~eCG~V~~   41 (201)
T COG1326          26 GREPLVRCEECGTVHP   41 (201)
T ss_pred             CCceEEEccCCCcEee
Confidence            4557889999999986


No 50 
>PTZ00273 oxidase reductase; Provisional
Probab=34.65  E-value=1.4e+02  Score=29.27  Aligned_cols=42  Identities=19%  Similarity=0.260  Sum_probs=33.7

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC------CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD------VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d------vr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|.-|++=-.|.-+.-+.-      ....|+||.|
T Consensus       224 g~Wi~V~p~pg~lvVNvGD~l~~~TnG~~kSt~HRVv~~~~~R~Si~~  271 (320)
T PTZ00273        224 GEWMDVPPLEGSFVVNIGDMMEMWSNGRYRSTPHRVVNTGVERYSMPF  271 (320)
T ss_pred             CCEEeCCCCCCeEEEEHHHHHHHHHCCeeeCCCccccCCCCCeEEEEE
Confidence            56788999999999999999888877665541      1456999998


No 51 
>cd04476 RPA1_DBD_C RPA1_DBD_C: A subfamily of OB folds corresponding to the C-terminal OB fold, the ssDNA-binding domain (DBD)-C, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-C, RPA1 contains three other OB folds: DBD-A, DBD-B, and RPA1N. The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is involved in DNA binding and trimerization. It contains two structural insertions not found to date in other OB-folds: a zinc ribbon and a three-helix bundle. RPA1 DBD-C also contains a Cys4-type zinc-binding motif, which plays a role in the ssDNA binding fun
Probab=33.53  E-value=24  Score=31.19  Aligned_cols=11  Identities=55%  Similarity=0.951  Sum_probs=9.9

Q ss_pred             ccccccCCCCc
Q psy4937          35 ETYIYCPHCNK   45 (356)
Q Consensus        35 ~~~~~~~~~~~   45 (356)
                      ..|.-||+|+|
T Consensus        32 ~~Y~aC~~C~k   42 (166)
T cd04476          32 WWYPACPGCNK   42 (166)
T ss_pred             eEEccccccCc
Confidence            57999999999


No 52 
>PLN02947 oxidoreductase
Probab=33.21  E-value=1.5e+02  Score=30.02  Aligned_cols=42  Identities=14%  Similarity=0.131  Sum_probs=33.9

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf  332 (356)
                      +..+.|+-.+++++|--|+.=-.|.-+.-+.       .-...|+||.|
T Consensus       270 g~Wi~V~p~pga~VVNvGD~Lq~~SNG~~kS~~HRVv~~~~~~R~Sia~  318 (374)
T PLN02947        270 GRWVTVEPIPGSFVVNVGDHLEIFSNGRYKSVLHRVRVNSTKPRISVAS  318 (374)
T ss_pred             CEEEeCCCCCCeEEEEeCceeeeeeCCEEeccccccccCCCCCEEEEEE
Confidence            5688899999999999999988888666554       11457999998


No 53 
>PLN02393 leucoanthocyanidin dioxygenase like protein
Probab=32.74  E-value=1.4e+02  Score=30.12  Aligned_cols=42  Identities=17%  Similarity=0.280  Sum_probs=33.3

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|.-|++=-.|.-+.-+.       .....|+|+.|
T Consensus       259 g~W~~V~p~pgalVVNiGD~l~~~Tng~~kSt~HRVv~~~~~~R~Siaf  307 (362)
T PLN02393        259 DAWITVKPVPDAFIVNIGDQIQVLSNAIYKSVEHRVIVNSAKERVSLAF  307 (362)
T ss_pred             CEEEECCCCCCeEEEEcchhhHhhcCCeeeccceecccCCCCCEEEEEE
Confidence            5688899999999999999998886555543       11456999998


No 54 
>PRK04023 DNA polymerase II large subunit; Validated
Probab=31.95  E-value=27  Score=39.95  Aligned_cols=22  Identities=41%  Similarity=0.975  Sum_probs=15.9

Q ss_pred             ceeeccccccccCcccccccccccccccccCCCCc
Q psy4937          11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNK   45 (356)
Q Consensus        11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~   45 (356)
                      |.|+|-.|-...             .+..||+|+.
T Consensus       625 g~RfCpsCG~~t-------------~~frCP~CG~  646 (1121)
T PRK04023        625 GRRKCPSCGKET-------------FYRRCPFCGT  646 (1121)
T ss_pred             cCccCCCCCCcC-------------CcccCCCCCC
Confidence            678888888652             3457888887


No 55 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=31.12  E-value=29  Score=29.30  Aligned_cols=13  Identities=23%  Similarity=0.871  Sum_probs=10.3

Q ss_pred             ceeeccccccccc
Q psy4937          85 TYVYCPHCNKAWQ   97 (356)
Q Consensus        85 ~~~~c~~c~~~~~   97 (356)
                      .-+-||+|+..|+
T Consensus        25 ~PivCP~CG~~~~   37 (108)
T PF09538_consen   25 DPIVCPKCGTEFP   37 (108)
T ss_pred             CCccCCCCCCccC
Confidence            3466999998886


No 56 
>PF10083 DUF2321:  Uncharacterized protein conserved in bacteria (DUF2321);  InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=30.30  E-value=17  Score=32.85  Aligned_cols=47  Identities=26%  Similarity=0.675  Sum_probs=34.0

Q ss_pred             cccccceeeccccccccCccc--cccccc--ccccccccCCCCcc--cccCcc
Q psy4937           6 FCGCKGVRTCIKCEQEFGYEN--KNIVEF--TEETYIYCPHCNKA--WQGSDL   52 (356)
Q Consensus         6 ~c~ckg~r~c~~ce~~~~~~~--~~~~~~--~~~~~~~~~~~~~~--~~~~~~   52 (356)
                      .||=|.|+.|..|...-+..-  ...+.|  .-+.--||-+|+++  |-.+.|
T Consensus        33 kCG~~tI~~Cp~C~~~IrG~y~v~gv~~~g~~~~~PsYC~~CGkpyPWt~~~L   85 (158)
T PF10083_consen   33 KCGAKTITSCPNCSTPIRGDYHVEGVFGLGGHYEAPSYCHNCGKPYPWTENAL   85 (158)
T ss_pred             HhhHHHHHHCcCCCCCCCCceecCCeeeeCCCCCCChhHHhCCCCCchHHHHH
Confidence            599999999999999887772  334444  23455699999997  654433


No 57 
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=30.23  E-value=72  Score=29.69  Aligned_cols=36  Identities=17%  Similarity=0.308  Sum_probs=29.2

Q ss_pred             EEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937         293 VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY  332 (356)
Q Consensus       293 ~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf  332 (356)
                      .+.+.-.+|.|+|+-.-.+    |++++-.-.+.||||.|
T Consensus       161 ~~~v~P~~G~lvlFPS~L~----H~v~p~~~~~~RISiSF  196 (201)
T TIGR02466       161 FVYVPPQEGRVLLFESWLR----HEVPPNESEEERISVSF  196 (201)
T ss_pred             cEEECCCCCeEEEECCCCc----eecCCCCCCCCEEEEEE
Confidence            3456668899988887654    99999877899999998


No 58 
>PLN02515 naringenin,2-oxoglutarate 3-dioxygenase
Probab=30.06  E-value=2.2e+02  Score=28.67  Aligned_cols=41  Identities=15%  Similarity=0.119  Sum_probs=32.8

Q ss_pred             eEEEEecCCCeEEEecchhhcccccccccC---C----CCCCeEEEEe
Q psy4937         292 VVIRIPMPARSLLVLWGEARYEWQHCVLRS---D----VIGRRVCIAY  332 (356)
Q Consensus       292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~---d----vr~rRVSLTf  332 (356)
                      ..+.|+-.+++|+|.-|+.=-.|+-+.-+.   .    ....|+||.|
T Consensus       243 ~Wi~Vpp~pgalVVNiGD~L~~~TNG~~kSt~HRVv~~~~~~R~Si~~  290 (358)
T PLN02515        243 TWITVQPVEGAFVVNLGDHGHYLSNGRFKNADHQAVVNSNCSRLSIAT  290 (358)
T ss_pred             eEEECCCCCCeEEEEccHHHHHHhCCeeeeecceEECCCCCCEEEEEE
Confidence            578899999999999999988897665543   1    1456999998


No 59 
>COG2850 Uncharacterized conserved protein [Function unknown]
Probab=29.61  E-value=80  Score=32.36  Aligned_cols=55  Identities=13%  Similarity=0.315  Sum_probs=34.6

Q ss_pred             HHHHHHHHhhcccCCCCCCcceeeeEecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeec
Q psy4937         179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYY  243 (356)
Q Consensus       179 l~~l~~Ri~~~~~l~~f~~~~q~~leY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~  243 (356)
                      +..+.++++   .++.+ +.+.+.+.|.+.+| +.+||.|.-     .|..|=.-+.+.......
T Consensus       103 v~~l~~~Fr---flP~w-r~ddiMIS~a~~GG-gvg~H~D~Y-----DVfliQg~G~RRW~v~~~  157 (383)
T COG2850         103 VAALMEPFR---FLPDW-RIDDIMISFAAPGG-GVGPHFDQY-----DVFLIQGQGRRRWRVGKK  157 (383)
T ss_pred             HHHHHHHhc---cCccc-cccceEEEEecCCC-ccCccccch-----heeEEeecccceeecCCc
Confidence            444444444   34444 45778888887654 899999962     455666666666665544


No 60 
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=29.61  E-value=38  Score=28.56  Aligned_cols=27  Identities=26%  Similarity=0.609  Sum_probs=18.2

Q ss_pred             ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937          11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA   46 (356)
Q Consensus        11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~   46 (356)
                      +.-.|..|...+.....         +..||+|+-+
T Consensus        69 ~~~~C~~Cg~~~~~~~~---------~~~CP~Cgs~   95 (113)
T PRK12380         69 AQAWCWDCSQVVEIHQH---------DAQCPHCHGE   95 (113)
T ss_pred             cEEEcccCCCEEecCCc---------CccCcCCCCC
Confidence            56679999866554432         3459999963


No 61 
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=29.44  E-value=25  Score=31.95  Aligned_cols=38  Identities=29%  Similarity=0.734  Sum_probs=25.7

Q ss_pred             eccccccccCcccccccccccc--------cccccCCCCcc-cccCc
Q psy4937          14 TCIKCEQEFGYENKNIVEFTEE--------TYIYCPHCNKA-WQGSD   51 (356)
Q Consensus        14 ~c~~ce~~~~~~~~~~~~~~~~--------~~~~~~~~~~~-~~~~~   51 (356)
                      .|..|+..-..-+...|+..+.        ..-.||.|++- |.||-
T Consensus        99 RCp~CN~~L~~vs~eev~~~Vp~~~~~~~~~f~~C~~CgkiYW~GsH  145 (165)
T COG1656          99 RCPECNGELEKVSREEVKEKVPEKVYRNYEEFYRCPKCGKIYWKGSH  145 (165)
T ss_pred             cCcccCCEeccCcHHHHhhccchhhhhcccceeECCCCcccccCchH
Confidence            5889988766665545444433        23459999998 88873


No 62 
>PF01155 HypA:  Hydrogenase expression/synthesis hypA family;  InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=29.22  E-value=23  Score=29.76  Aligned_cols=27  Identities=26%  Similarity=0.735  Sum_probs=17.9

Q ss_pred             ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937          11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA   46 (356)
Q Consensus        11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~   46 (356)
                      ..-.|..|...+..+...         ..||+|+..
T Consensus        69 ~~~~C~~Cg~~~~~~~~~---------~~CP~Cgs~   95 (113)
T PF01155_consen   69 ARARCRDCGHEFEPDEFD---------FSCPRCGSP   95 (113)
T ss_dssp             -EEEETTTS-EEECHHCC---------HH-SSSSSS
T ss_pred             CcEECCCCCCEEecCCCC---------CCCcCCcCC
Confidence            456799999887776553         359999874


No 63 
>PLN02997 flavonol synthase
Probab=29.20  E-value=2e+02  Score=28.55  Aligned_cols=42  Identities=10%  Similarity=0.142  Sum_probs=32.2

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|--|+.=-.|.-|.-+.       .-...|+||+|
T Consensus       228 g~Wi~V~p~pgalvVNiGD~Le~~TNG~~kSt~HRVv~~~~~~R~Si~f  276 (325)
T PLN02997        228 EQWLDLNYINSAVVVIIGDQLMRMTNGRFKNVLHRAKTDKERLRISWPV  276 (325)
T ss_pred             CcEEECCCCCCeEEEEechHHHHHhCCccccccceeeCCCCCCEEEEEE
Confidence            5578888899999999999998898554432       11345999988


No 64 
>PLN02156 gibberellin 2-beta-dioxygenase
Probab=28.31  E-value=1.8e+02  Score=29.05  Aligned_cols=42  Identities=21%  Similarity=0.319  Sum_probs=34.8

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC---C----CCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD---V----IGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d---v----r~rRVSLTf  332 (356)
                      +..+.|+-.+++++|--|+.=-.|..+.-+.-   |    ...|+||.|
T Consensus       226 g~Wi~Vpp~pga~VVNiGD~l~~wTNg~~kSt~HRVv~~~~~~R~Siaf  274 (335)
T PLN02156        226 GTWVDVPPDHSSFFVLVGDTLQVMTNGRFKSVKHRVVTNTKRSRISMIY  274 (335)
T ss_pred             CCEEEccCCCCcEEEEhHHHHHHHhCCeeeccceeeecCCCCCEEEEEE
Confidence            56888999999999999999999998877641   1    445999998


No 65 
>PLN02758 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=28.22  E-value=1.9e+02  Score=29.05  Aligned_cols=42  Identities=12%  Similarity=0.088  Sum_probs=33.6

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC---C----CCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD---V----IGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d---v----r~rRVSLTf  332 (356)
                      +..+.|+-.+|+++|.-|+.=-.|.-+.-+.-   |    ...|+||.|
T Consensus       258 g~Wi~V~p~pgalVVNiGD~L~~~SNG~~kS~~HRVv~~~~~~R~Sia~  306 (361)
T PLN02758        258 NTWVPVHPVPNALVINIGDTLEVLTNGKYKSVEHRAVTNKEKDRLSIVT  306 (361)
T ss_pred             CEEEeCCCCCCeEEEEccchhhhhcCCeeecccceeecCCCCCEEEEEE
Confidence            56788999999999999999999976666541   1    456999997


No 66 
>PLN02216 protein SRG1
Probab=27.52  E-value=2.6e+02  Score=28.08  Aligned_cols=42  Identities=17%  Similarity=0.124  Sum_probs=33.1

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|.-|+.=-.|.-+.-+.       .....|+||.|
T Consensus       256 g~Wi~V~p~pgalvVNiGD~L~~~TNG~~kS~~HRVv~~~~~~R~Si~~  304 (357)
T PLN02216        256 GKWVSVKPLPNALVVNVGDILEIITNGTYRSIEHRGVVNSEKERLSVAT  304 (357)
T ss_pred             CEEEECCCCCCeEEEEcchhhHhhcCCeeeccCceeecCCCCCEEEEEE
Confidence            6688899999999999999999998544432       11456999998


No 67 
>COG0266 Nei Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]
Probab=27.36  E-value=35  Score=33.48  Aligned_cols=13  Identities=38%  Similarity=0.979  Sum_probs=9.8

Q ss_pred             cccceeecccccc
Q psy4937          82 TKETYVYCPHCNK   94 (356)
Q Consensus        82 ~~~~~~~c~~c~~   94 (356)
                      +--+..|||.|.+
T Consensus       261 ~gR~t~~CP~CQ~  273 (273)
T COG0266         261 GGRSTFYCPVCQK  273 (273)
T ss_pred             cCCcCEeCCCCCC
Confidence            4457789999974


No 68 
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=26.55  E-value=51  Score=38.72  Aligned_cols=10  Identities=40%  Similarity=1.019  Sum_probs=7.4

Q ss_pred             eeeccccccc
Q psy4937          86 YVYCPHCNKA   95 (356)
Q Consensus        86 ~~~c~~c~~~   95 (356)
                      ..+||.|+.-
T Consensus       709 a~~CP~CGtp  718 (1337)
T PRK14714        709 RVECPRCDVE  718 (1337)
T ss_pred             cccCCCCCCc
Confidence            5579999843


No 69 
>PLN02254 gibberellin 3-beta-dioxygenase
Probab=26.40  E-value=2.2e+02  Score=28.75  Aligned_cols=42  Identities=21%  Similarity=0.194  Sum_probs=33.9

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|--|+.=-.|.-+.-+.-       -...|+||+|
T Consensus       256 ~~Wi~V~p~pgalVVNiGD~lq~~SNg~~kS~~HRVv~~~~~~R~Sia~  304 (358)
T PLN02254        256 VGWVTVPPVPGSLVVNVGDLLHILSNGRFPSVLHRAVVNKTRHRISVAY  304 (358)
T ss_pred             CEEEEcccCCCCEEEEhHHHHHHHhCCeeccccceeecCCCCCEEEEEE
Confidence            36888999999999999999888987766541       1467999998


No 70 
>PF05118 Asp_Arg_Hydrox:  Aspartyl/Asparaginyl beta-hydroxylase;  InterPro: IPR007803 The alpha-ketoglutarate-dependent dioxygenase aspartyl (asparaginyl) beta-hydroxylase (1.14.11.16 from EC) specifically hydroxylates one aspartic or asparagine residue in certain epidermal growth factor-like domains of a number of proteins. Its action may be due to histidine-675, which, when mutated to an alanine residue, causes the loss of enzymatic activity in the protein [].  An invertebrate alpha-ketoglutarate-dependent aspartyl/asparaginyl beta-hydroxylase, which posttranslationally hydroxylates specific aspartyl or asparaginyl residues within epidermal growth factor-like modules [], activity was found to be similar to that of the purified mammalian aspartyl/asparaginyl beta-hydroxylase with respect to cofactor requirements, stereochemistry and substrate sequence specificity []. This enzyme requires Fe2+ as a cofactor. Some vitamin K-dependent coagulation factors, as well as synthetic peptides based on the structure of the first epidermal growth factor domain of human coagulation factor IX or X, can act as acceptors.; GO: 0018193 peptidyl-amino acid modification, 0030176 integral to endoplasmic reticulum membrane; PDB: 3RCQ_A 1E5S_A 1E5R_B.
Probab=26.35  E-value=1.9e+02  Score=25.61  Aligned_cols=38  Identities=13%  Similarity=0.040  Sum_probs=23.6

Q ss_pred             EEecCCCeEEEecchhhcccccccccCCCCCCeEEEEecccCC
Q psy4937         295 RIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA  337 (356)
Q Consensus       295 ~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR~l~~  337 (356)
                      ...+..|-++++...    +.|.+-.. -.+.||.|.+--..|
T Consensus       125 ~~~w~~G~~~~fD~s----~~H~~~N~-~~~~Rv~L~vD~~hP  162 (163)
T PF05118_consen  125 TRHWREGECWVFDDS----FEHEVWNN-GDEDRVVLIVDFWHP  162 (163)
T ss_dssp             EEB--CTEEEEE-TT----S-EEEEES-SSS-EEEEEEEEE-T
T ss_pred             EEEeccCcEEEEeCC----EEEEEEeC-CCCCEEEEEEEeecC
Confidence            477789999998876    56777664 268899998765544


No 71 
>PRK14811 formamidopyrimidine-DNA glycosylase; Provisional
Probab=26.07  E-value=42  Score=32.47  Aligned_cols=16  Identities=19%  Similarity=0.567  Sum_probs=12.1

Q ss_pred             cccceeeccccccccc
Q psy4937          82 TKETYVYCPHCNKAWQ   97 (356)
Q Consensus        82 ~~~~~~~c~~c~~~~~   97 (356)
                      +--+-.|||.|.+..|
T Consensus       251 ~gR~ty~Cp~CQ~~~~  266 (269)
T PRK14811        251 GGRGTHFCPQCQPLRP  266 (269)
T ss_pred             CCCCcEECCCCcCCCC
Confidence            3457789999997754


No 72 
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=25.95  E-value=52  Score=23.12  Aligned_cols=12  Identities=17%  Similarity=0.382  Sum_probs=9.3

Q ss_pred             eeeccccccccc
Q psy4937          86 YVYCPHCNKAWQ   97 (356)
Q Consensus        86 ~~~c~~c~~~~~   97 (356)
                      ...|+.|+..+.
T Consensus        20 ~~vC~~Cg~~~~   31 (52)
T smart00661       20 RFVCRKCGYEEP   31 (52)
T ss_pred             EEECCcCCCeEE
Confidence            667999997654


No 73 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=25.59  E-value=30  Score=21.30  Aligned_cols=11  Identities=36%  Similarity=1.102  Sum_probs=9.6

Q ss_pred             eeccccccccc
Q psy4937          87 VYCPHCNKAWQ   97 (356)
Q Consensus        87 ~~c~~c~~~~~   97 (356)
                      .||..|++.+.
T Consensus         2 ~~C~~C~k~f~   12 (27)
T PF12171_consen    2 FYCDACDKYFS   12 (27)
T ss_dssp             CBBTTTTBBBS
T ss_pred             CCcccCCCCcC
Confidence            48999999986


No 74 
>PLN02704 flavonol synthase
Probab=25.30  E-value=2e+02  Score=28.48  Aligned_cols=42  Identities=10%  Similarity=0.015  Sum_probs=34.4

Q ss_pred             ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937         291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf  332 (356)
                      +..+.|+-.+++|+|.-|+.=-.|.-+.-+.-       -...|+||+|
T Consensus       244 g~Wi~V~p~pg~lvVNvGD~L~~~TNg~~kSt~HRVv~~~~~~R~Si~~  292 (335)
T PLN02704        244 DHWFDVKYIPNALVIHIGDQIEILSNGKYKSVLHRTTVNKEKTRMSWPV  292 (335)
T ss_pred             CEEEeCCCCCCeEEEEechHHHHHhCCeeecccceeecCCCCCeEEEEE
Confidence            56888999999999999999999987766541       1466999998


No 75 
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=25.12  E-value=60  Score=20.35  Aligned_cols=12  Identities=25%  Similarity=0.764  Sum_probs=8.6

Q ss_pred             cccceeeccccc
Q psy4937          82 TKETYVYCPHCN   93 (356)
Q Consensus        82 ~~~~~~~c~~c~   93 (356)
                      +...-..||.|+
T Consensus        12 ~~~v~f~CPnCG   23 (24)
T PF07754_consen   12 EQAVPFPCPNCG   23 (24)
T ss_pred             ccCceEeCCCCC
Confidence            445667788887


No 76 
>PF09788 Tmemb_55A:  Transmembrane protein 55A;  InterPro: IPR019178  Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction:  1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.  
Probab=24.73  E-value=37  Score=32.94  Aligned_cols=28  Identities=36%  Similarity=0.833  Sum_probs=21.1

Q ss_pred             eccccccccCcccccccccccccccccCCCCcc
Q psy4937          14 TCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA   46 (356)
Q Consensus        14 ~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~   46 (356)
                      +|.+|...|-....     +..|...||||.|-
T Consensus       159 ~CghC~~~Fl~~~~-----~~~tlARCPHCrKv  186 (256)
T PF09788_consen  159 ICGHCSNTFLFNTL-----TSNTLARCPHCRKV  186 (256)
T ss_pred             ECCCCCCcEeccCC-----CCCccccCCCCcee
Confidence            79999988765533     23578899999983


No 77 
>KOG3084|consensus
Probab=24.53  E-value=54  Score=32.99  Aligned_cols=46  Identities=26%  Similarity=0.358  Sum_probs=33.3

Q ss_pred             eeeccccccccccCccccccCCCCCCCC-------CCCCCCe-EEeCCCCCHHH
Q psy4937          86 YVYCPHCNKAWQGSDLYAYQSHPNHSGK-------SFDIGGV-YIKEDFLSEAE  131 (356)
Q Consensus        86 ~~~c~~c~~~~~g~~~~~~~~~~~h~~~-------~~~ipGl-~~i~dFIS~~E  131 (356)
                      -.+|+.|++.||-.|+-.|-.--||.+.       --..+|| ..+.+||-+-|
T Consensus       173 ~~~C~~~n~~yPr~dPvVIm~li~~d~~~~LL~R~~r~~~gl~t~lAGFlEpGE  226 (345)
T KOG3084|consen  173 DETCPSCNVIYPRTDPVVIMLLIDHDGKHALLGRQKRYPPGLWTCLAGFLEPGE  226 (345)
T ss_pred             cccCCcCCeeccCCCCeEEEEEEcCCCCEeeeecccCCCCchhhhhhccCCccc
Confidence            4699999999998888777666677642       2456776 46778996544


No 78 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=24.23  E-value=52  Score=23.03  Aligned_cols=10  Identities=40%  Similarity=0.790  Sum_probs=8.0

Q ss_pred             eeeccccccc
Q psy4937          86 YVYCPHCNKA   95 (356)
Q Consensus        86 ~~~c~~c~~~   95 (356)
                      .+-||.|+.-
T Consensus        21 ~~~Cp~CG~~   30 (46)
T PRK00398         21 GVRCPYCGYR   30 (46)
T ss_pred             ceECCCCCCe
Confidence            6889999853


No 79 
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=24.23  E-value=42  Score=23.82  Aligned_cols=24  Identities=25%  Similarity=0.627  Sum_probs=17.3

Q ss_pred             eccccccccCcccccccccccccccccCCCCc
Q psy4937          14 TCIKCEQEFGYENKNIVEFTEETYIYCPHCNK   45 (356)
Q Consensus        14 ~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~   45 (356)
                      .|..|..++.....        .-|.||+|+-
T Consensus         4 ~C~~Cg~~~~~~~~--------~~irC~~CG~   27 (44)
T smart00659        4 ICGECGRENEIKSK--------DVVRCRECGY   27 (44)
T ss_pred             ECCCCCCEeecCCC--------CceECCCCCc
Confidence            58888877666533        3489999975


No 80 
>PRK12775 putative trifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta/ferritin domain-containing protein; Provisional
Probab=23.49  E-value=44  Score=38.38  Aligned_cols=51  Identities=24%  Similarity=0.479  Sum_probs=31.1

Q ss_pred             cccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccc
Q psy4937          36 TYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCN   93 (356)
Q Consensus        36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~   93 (356)
                      .-.+||.||+|..+++-|.-=-+...+---=+|+++|+-       -...|-.||+|+
T Consensus       795 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~  845 (1006)
T PRK12775        795 GVATCPKCHRPLEGDEEYVCCATSELQWRCDDCGKVSEG-------FAFPYGMCPACG  845 (1006)
T ss_pred             CCccCcccCCCCCCCceeEEecCcceeeehhhhcccccc-------ccCCcCcCcccc
Confidence            346899999998888776543333322222246666552       233567788887


No 81 
>PLN02403 aminocyclopropanecarboxylate oxidase
Probab=22.86  E-value=2.6e+02  Score=27.46  Aligned_cols=42  Identities=24%  Similarity=0.260  Sum_probs=31.0

Q ss_pred             ceEEEEecCC-CeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937         291 SVVIRIPMPA-RSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY  332 (356)
Q Consensus       291 ~~~v~I~Lp~-rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf  332 (356)
                      +..+.|+-.+ ++|+|--|+.=-.|.=+.-+.       .-...|+||+|
T Consensus       199 g~Wi~V~p~p~~~lvVNvGD~L~~~Tng~~~S~~HRVv~~~~~~R~Si~~  248 (303)
T PLN02403        199 GKWVPIPPSKNNTIFVNTGDQLEVLSNGRYKSTLHRVMADKNGSRLSIAT  248 (303)
T ss_pred             CeEEECCCCCCCEEEEEehHHHHHHhCCeeecccceeecCCCCCEEEEEE
Confidence            5567777666 699999999988887665553       11556999998


No 82 
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=22.57  E-value=64  Score=27.64  Aligned_cols=31  Identities=26%  Similarity=0.488  Sum_probs=18.4

Q ss_pred             ceeeccccccccCccccccccccccc-ccccCCCCc
Q psy4937          11 GVRTCIKCEQEFGYENKNIVEFTEET-YIYCPHCNK   45 (356)
Q Consensus        11 g~r~c~~ce~~~~~~~~~~~~~~~~~-~~~~~~~~~   45 (356)
                      +.-.| .|...+......+   .... +..||+|+-
T Consensus        69 ~~~~C-~Cg~~~~~~~~~~---~~~~~~~~CP~Cgs  100 (124)
T PRK00762         69 VEIEC-ECGYEGVVDEDEI---DHYAAVIECPVCGN  100 (124)
T ss_pred             eeEEe-eCcCcccccccch---hccccCCcCcCCCC
Confidence            56689 9997766542111   0111 357999985


No 83 
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=22.13  E-value=60  Score=28.26  Aligned_cols=35  Identities=20%  Similarity=0.650  Sum_probs=21.6

Q ss_pred             ceeeccccccccCccc------------ccccccccccccccCCCCc
Q psy4937          11 GVRTCIKCEQEFGYEN------------KNIVEFTEETYIYCPHCNK   45 (356)
Q Consensus        11 g~r~c~~ce~~~~~~~------------~~~~~~~~~~~~~~~~~~~   45 (356)
                      ..-.|..|...+....            +.++..+...+..||.|+-
T Consensus        69 ~~~~C~~CG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~CP~Cgs  115 (135)
T PRK03824         69 AVLKCRNCGNEWSLKEVKESLDEEIREAIHFIPEVVHAFLKCPKCGS  115 (135)
T ss_pred             eEEECCCCCCEEecccccccccccccccccccccccccCcCCcCCCC
Confidence            4567999987765541            1222222356678999986


No 84 
>PRK01103 formamidopyrimidine/5-formyluracil/ 5-hydroxymethyluracil DNA glycosylase; Validated
Probab=21.85  E-value=55  Score=31.61  Aligned_cols=13  Identities=31%  Similarity=0.976  Sum_probs=9.8

Q ss_pred             cccceeecccccc
Q psy4937          82 TKETYVYCPHCNK   94 (356)
Q Consensus        82 ~~~~~~~c~~c~~   94 (356)
                      +--+-.|||.|.+
T Consensus       261 ~gR~t~~CP~CQ~  273 (274)
T PRK01103        261 GGRSTFFCPRCQK  273 (274)
T ss_pred             CCCCcEECcCCCC
Confidence            3457789999975


No 85 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=21.49  E-value=37  Score=29.71  Aligned_cols=12  Identities=33%  Similarity=1.074  Sum_probs=8.7

Q ss_pred             ccccccCCCCcc
Q psy4937          35 ETYIYCPHCNKA   46 (356)
Q Consensus        35 ~~~~~~~~~~~~   46 (356)
                      ..+..||+|+.-
T Consensus        97 ~~~Y~Cp~C~~~  108 (147)
T smart00531       97 NAYYKCPNCQSK  108 (147)
T ss_pred             CcEEECcCCCCE
Confidence            346679999873


No 86 
>PF11672 DUF3268:  Protein of unknown function (DUF3268);  InterPro: IPR021686 This entry is represented by Listeria phage P100, Gp150. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=21.32  E-value=1.2e+02  Score=25.58  Aligned_cols=17  Identities=41%  Similarity=1.023  Sum_probs=12.0

Q ss_pred             cccCCCCcc---cccCcccc
Q psy4937          38 IYCPHCNKA---WQGSDLYA   54 (356)
Q Consensus        38 ~~~~~~~~~---~~~~~~~~   54 (356)
                      +.||-|+..   ..+++||-
T Consensus         3 ~~CpYCg~~~~l~~~~~iYg   22 (102)
T PF11672_consen    3 IICPYCGGPAELVDGSEIYG   22 (102)
T ss_pred             cccCCCCCeeEEcccchhcC
Confidence            579999876   55666654


No 87 
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=21.27  E-value=67  Score=27.13  Aligned_cols=28  Identities=21%  Similarity=0.471  Sum_probs=17.6

Q ss_pred             ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937          11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA   46 (356)
Q Consensus        11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~   46 (356)
                      +.-.|..|...+....        ..+..||+|+-.
T Consensus        69 ~~~~C~~Cg~~~~~~~--------~~~~~CP~Cgs~   96 (114)
T PRK03681         69 AECWCETCQQYVTLLT--------QRVRRCPQCHGD   96 (114)
T ss_pred             cEEEcccCCCeeecCC--------ccCCcCcCcCCC
Confidence            5567999985433322        234679999863


No 88 
>PF06677 Auto_anti-p27:  Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27);  InterPro: IPR009563 The proteins in this entry are functionally uncharacterised and include several proteins that characterise Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27). It is thought that the potential association of anti-p27 with anti-centromere antibodies suggests that autoantigen p27 might play a role in mitosis [].
Probab=21.27  E-value=57  Score=22.90  Aligned_cols=9  Identities=33%  Similarity=1.077  Sum_probs=6.6

Q ss_pred             ceeeccccc
Q psy4937          85 TYVYCPHCN   93 (356)
Q Consensus        85 ~~~~c~~c~   93 (356)
                      .-+||+.|+
T Consensus        33 g~~~Cv~C~   41 (41)
T PF06677_consen   33 GKIYCVSCG   41 (41)
T ss_pred             CCEECCCCC
Confidence            457899885


No 89 
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=21.26  E-value=45  Score=24.42  Aligned_cols=25  Identities=32%  Similarity=0.689  Sum_probs=18.1

Q ss_pred             eccccccccCcccccccccccccccccCCCCc
Q psy4937          14 TCIKCEQEFGYENKNIVEFTEETYIYCPHCNK   45 (356)
Q Consensus        14 ~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~   45 (356)
                      -|..|-+.+.       .-++.+.|-||+|+-
T Consensus         8 ~C~~Cg~~~~-------~~~~~~~irCp~Cg~   32 (49)
T COG1996           8 KCARCGREVE-------LDQETRGIRCPYCGS   32 (49)
T ss_pred             EhhhcCCeee-------hhhccCceeCCCCCc
Confidence            5888886651       135678999999985


No 90 
>PRK11827 hypothetical protein; Provisional
Probab=21.13  E-value=74  Score=24.24  Aligned_cols=15  Identities=20%  Similarity=0.160  Sum_probs=11.8

Q ss_pred             ceeeccccccccccC
Q psy4937          85 TYVYCPHCNKAWQGS   99 (356)
Q Consensus        85 ~~~~c~~c~~~~~g~   99 (356)
                      ....|..|+.+||=+
T Consensus        25 ~~Lic~~~~laYPI~   39 (60)
T PRK11827         25 QELICKLDNLAFPLR   39 (60)
T ss_pred             CeEECCccCeecccc
Confidence            456799999999844


No 91 
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=20.54  E-value=71  Score=27.11  Aligned_cols=29  Identities=24%  Similarity=0.508  Sum_probs=18.2

Q ss_pred             cceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937          10 KGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA   46 (356)
Q Consensus        10 kg~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~   46 (356)
                      ++.-.|..|...+.....        .|..||.|+-+
T Consensus        69 p~~~~C~~Cg~~~~~~~~--------~~~~CP~Cgs~   97 (117)
T PRK00564         69 KVELECKDCSHVFKPNAL--------DYGVCEKCHSK   97 (117)
T ss_pred             CCEEEhhhCCCccccCCc--------cCCcCcCCCCC
Confidence            356678889855433322        34569999864


No 92 
>PF01096 TFIIS_C:  Transcription factor S-II (TFIIS);  InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre.  TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site [].  Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=20.35  E-value=86  Score=21.43  Aligned_cols=15  Identities=20%  Similarity=0.797  Sum_probs=11.7

Q ss_pred             cccceeecccccccc
Q psy4937          82 TKETYVYCPHCNKAW   96 (356)
Q Consensus        82 ~~~~~~~c~~c~~~~   96 (356)
                      ....|..|..|++.|
T Consensus        24 ~~T~fy~C~~C~~~w   38 (39)
T PF01096_consen   24 PMTLFYVCCNCGHRW   38 (39)
T ss_dssp             SSEEEEEESSSTEEE
T ss_pred             CCeEEEEeCCCCCee
Confidence            456788899999776


No 93 
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=20.32  E-value=70  Score=27.02  Aligned_cols=27  Identities=26%  Similarity=0.562  Sum_probs=17.9

Q ss_pred             ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937          11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA   46 (356)
Q Consensus        11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~   46 (356)
                      +.-.|..|...+.....         +..||+|+-+
T Consensus        69 ~~~~C~~Cg~~~~~~~~---------~~~CP~Cgs~   95 (115)
T TIGR00100        69 VECECEDCSEEVSPEID---------LYRCPKCHGI   95 (115)
T ss_pred             cEEEcccCCCEEecCCc---------CccCcCCcCC
Confidence            45678888865544332         4579999863


No 94 
>COG3751 EGL-9 Predicted proline hydroxylase [Posttranslational modification, protein turnover, chaperones]
Probab=20.21  E-value=4.3e+02  Score=25.72  Aligned_cols=39  Identities=15%  Similarity=0.107  Sum_probs=30.1

Q ss_pred             eEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEE
Q psy4937         292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIA  331 (356)
Q Consensus       292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLT  331 (356)
                      ....|.=+-+||.+|.-.-.+.| |.|..-.....|+|||
T Consensus       197 ~~~ti~P~fn~lv~F~s~~~Hs~-h~V~~~~~~~~RlsV~  235 (252)
T COG3751         197 SFKTIAPVFNSLVFFKSRPSHSV-HSVEEPYAAADRLSVT  235 (252)
T ss_pred             cccccCCCCceEEEEEecCCccc-eeccccccccceEEEe
Confidence            34556667788888887776666 7777767899999999


No 95 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=20.05  E-value=35  Score=30.44  Aligned_cols=11  Identities=18%  Similarity=0.655  Sum_probs=7.3

Q ss_pred             ccccccCCCCc
Q psy4937          35 ETYIYCPHCNK   45 (356)
Q Consensus        35 ~~~~~~~~~~~   45 (356)
                      ..|-.||+|++
T Consensus       107 ~~~Y~Cp~c~~  117 (158)
T TIGR00373       107 NMFFICPNMCV  117 (158)
T ss_pred             CCeEECCCCCc
Confidence            44566777776


Done!