Query psy4937
Match_columns 356
No_of_seqs 189 out of 1041
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 21:39:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4937.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4937hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3959|consensus 100.0 9.5E-79 2.1E-83 560.3 9.2 293 1-356 1-302 (306)
2 PF13532 2OG-FeII_Oxy_2: 2OG-F 100.0 3.7E-32 8E-37 244.8 13.8 168 119-333 1-194 (194)
3 KOG3200|consensus 100.0 1.7E-29 3.7E-34 225.1 13.5 178 118-337 12-216 (224)
4 PRK15401 alpha-ketoglutarate-d 99.9 4.1E-26 9E-31 211.7 17.4 172 115-335 15-213 (213)
5 TIGR00568 alkb DNA alkylation 99.9 2.6E-22 5.6E-27 180.6 14.4 147 124-318 2-169 (169)
6 KOG4176|consensus 99.8 3.2E-20 6.9E-25 181.5 13.1 181 115-340 124-309 (323)
7 COG3145 AlkB Alkylated DNA rep 99.8 5.5E-19 1.2E-23 161.7 12.9 167 115-330 12-194 (194)
8 smart00702 P4Hc Prolyl 4-hydro 98.1 9.5E-05 2.1E-09 66.0 14.9 168 118-332 1-175 (178)
9 PF03171 2OG-FeII_Oxy: 2OG-Fe( 98.0 7.3E-06 1.6E-10 66.1 5.1 83 199-333 2-95 (98)
10 PRK05467 Fe(II)-dependent oxyg 97.9 0.00017 3.7E-09 68.1 12.5 165 120-332 2-174 (226)
11 KOG2731|consensus 96.4 0.0046 1E-07 61.7 5.0 116 157-319 171-292 (378)
12 PLN00052 prolyl 4-hydroxylase; 95.4 0.14 3.1E-06 50.7 10.6 94 117-219 53-151 (310)
13 COG3128 PiuC Uncharacterized i 95.0 0.25 5.5E-06 45.8 10.1 169 119-333 3-178 (229)
14 PF13640 2OG-FeII_Oxy_3: 2OG-F 94.3 0.03 6.4E-07 45.0 2.4 32 298-332 66-97 (100)
15 KOG1591|consensus 90.8 1.2 2.7E-05 43.8 8.6 93 117-219 96-195 (289)
16 TIGR01762 chlorin-enz chlorina 88.2 13 0.00028 36.3 13.6 43 292-338 207-250 (288)
17 PF05721 PhyH: Phytanoyl-CoA d 85.0 1.7 3.7E-05 37.9 5.1 25 118-142 4-28 (211)
18 PF13719 zinc_ribbon_5: zinc-r 84.6 0.62 1.3E-05 31.8 1.6 35 37-97 2-36 (37)
19 PF10083 DUF2321: Uncharacteri 80.9 0.49 1.1E-05 42.4 -0.0 50 34-104 36-85 (158)
20 PF13248 zf-ribbon_3: zinc-rib 79.9 0.86 1.9E-05 28.6 0.9 12 83-94 13-24 (26)
21 TIGR02098 MJ0042_CXXC MJ0042 f 76.6 1.7 3.6E-05 29.3 1.6 13 85-97 24-36 (38)
22 PF13717 zinc_ribbon_4: zinc-r 75.4 2 4.4E-05 29.1 1.7 15 82-96 21-35 (36)
23 PF12933 FTO_NTD: FTO catalyti 75.2 9.3 0.0002 36.8 6.7 42 291-334 209-250 (253)
24 PF09859 Oxygenase-NA: Oxygena 70.8 25 0.00055 32.1 8.0 114 173-331 41-167 (173)
25 PF13240 zinc_ribbon_2: zinc-r 70.7 2.1 4.6E-05 26.3 0.9 10 85-94 12-21 (23)
26 PF12851 Tet_JBP: Oxygenase do 68.3 6.7 0.00014 35.5 3.9 38 292-332 126-167 (171)
27 PF14205 Cys_rich_KTR: Cystein 67.1 3.3 7.1E-05 31.0 1.4 10 36-45 3-12 (55)
28 PF13759 2OG-FeII_Oxy_5: Putat 66.6 8.6 0.00019 31.2 3.9 38 291-332 63-100 (101)
29 PF10571 UPF0547: Uncharacteri 55.5 7.2 0.00016 24.7 1.2 15 83-97 11-25 (26)
30 PF12773 DZR: Double zinc ribb 54.9 8.3 0.00018 27.3 1.7 11 85-95 28-38 (50)
31 PLN03001 oxidoreductase, 2OG-F 54.9 46 0.00099 32.1 7.3 42 291-332 161-209 (262)
32 COG3809 Uncharacterized protei 54.0 6.8 0.00015 31.5 1.2 13 85-97 20-32 (88)
33 PRK14892 putative transcriptio 51.3 4 8.7E-05 34.1 -0.6 39 84-135 40-78 (99)
34 PLN02984 oxidoreductase, 2OG-F 51.1 60 0.0013 32.5 7.6 42 291-332 245-294 (341)
35 COG4306 Uncharacterized protei 48.7 8.5 0.00018 33.7 1.0 77 34-141 36-112 (160)
36 PF01927 Mut7-C: Mut7-C RNAse 45.1 15 0.00032 32.2 2.0 15 82-96 120-134 (147)
37 PLN02904 oxidoreductase 45.0 76 0.0017 31.9 7.3 42 291-332 253-301 (357)
38 PRK14559 putative protein seri 43.8 11 0.00023 41.2 1.1 55 83-137 38-104 (645)
39 PLN02299 1-aminocyclopropane-1 43.6 55 0.0012 32.4 6.0 42 291-332 204-252 (321)
40 PF08209 Sgf11: Sgf11 (transcr 42.5 9.5 0.00021 25.6 0.3 15 36-50 3-17 (33)
41 PF01927 Mut7-C: Mut7-C RNAse 42.2 13 0.00028 32.6 1.2 39 13-51 92-139 (147)
42 PLN02485 oxidoreductase 41.9 74 0.0016 31.4 6.6 42 291-332 235-283 (329)
43 PLN00417 oxidoreductase, 2OG-F 41.0 81 0.0017 31.6 6.8 42 291-332 249-297 (348)
44 COG4888 Uncharacterized Zn rib 41.0 7.2 0.00016 32.7 -0.6 62 34-133 19-80 (104)
45 PF08792 A2L_zn_ribbon: A2L zi 37.4 22 0.00048 23.7 1.5 16 82-97 17-32 (33)
46 PLN02912 oxidoreductase, 2OG-F 37.3 1.3E+02 0.0029 30.0 7.7 42 291-332 242-290 (348)
47 PF14353 CpXC: CpXC protein 36.8 14 0.00031 31.3 0.6 32 14-45 3-46 (128)
48 PLN02750 oxidoreductase, 2OG-F 36.0 1.6E+02 0.0034 29.4 7.9 42 291-332 240-288 (345)
49 COG1326 Uncharacterized archae 35.4 22 0.00048 33.2 1.7 16 82-97 26-41 (201)
50 PTZ00273 oxidase reductase; Pr 34.7 1.4E+02 0.003 29.3 7.3 42 291-332 224-271 (320)
51 cd04476 RPA1_DBD_C RPA1_DBD_C: 33.5 24 0.00051 31.2 1.5 11 35-45 32-42 (166)
52 PLN02947 oxidoreductase 33.2 1.5E+02 0.0033 30.0 7.4 42 291-332 270-318 (374)
53 PLN02393 leucoanthocyanidin di 32.7 1.4E+02 0.003 30.1 6.9 42 291-332 259-307 (362)
54 PRK04023 DNA polymerase II lar 31.9 27 0.00059 40.0 1.9 22 11-45 625-646 (1121)
55 PF09538 FYDLN_acid: Protein o 31.1 29 0.00064 29.3 1.6 13 85-97 25-37 (108)
56 PF10083 DUF2321: Uncharacteri 30.3 17 0.00036 32.9 -0.1 47 6-52 33-85 (158)
57 TIGR02466 conserved hypothetic 30.2 72 0.0016 29.7 4.2 36 293-332 161-196 (201)
58 PLN02515 naringenin,2-oxogluta 30.1 2.2E+02 0.0048 28.7 7.9 41 292-332 243-290 (358)
59 COG2850 Uncharacterized conser 29.6 80 0.0017 32.4 4.6 55 179-243 103-157 (383)
60 PRK12380 hydrogenase nickel in 29.6 38 0.00083 28.6 2.0 27 11-46 69-95 (113)
61 COG1656 Uncharacterized conser 29.4 25 0.00055 31.9 1.0 38 14-51 99-145 (165)
62 PF01155 HypA: Hydrogenase exp 29.2 23 0.00051 29.8 0.7 27 11-46 69-95 (113)
63 PLN02997 flavonol synthase 29.2 2E+02 0.0043 28.5 7.3 42 291-332 228-276 (325)
64 PLN02156 gibberellin 2-beta-di 28.3 1.8E+02 0.0039 29.1 6.9 42 291-332 226-274 (335)
65 PLN02758 oxidoreductase, 2OG-F 28.2 1.9E+02 0.0042 29.0 7.2 42 291-332 258-306 (361)
66 PLN02216 protein SRG1 27.5 2.6E+02 0.0057 28.1 7.9 42 291-332 256-304 (357)
67 COG0266 Nei Formamidopyrimidin 27.4 35 0.00076 33.5 1.6 13 82-94 261-273 (273)
68 PRK14714 DNA polymerase II lar 26.6 51 0.0011 38.7 2.9 10 86-95 709-718 (1337)
69 PLN02254 gibberellin 3-beta-di 26.4 2.2E+02 0.0047 28.8 7.1 42 291-332 256-304 (358)
70 PF05118 Asp_Arg_Hydrox: Aspar 26.4 1.9E+02 0.0041 25.6 6.1 38 295-337 125-162 (163)
71 PRK14811 formamidopyrimidine-D 26.1 42 0.00091 32.5 1.9 16 82-97 251-266 (269)
72 smart00661 RPOL9 RNA polymeras 25.9 52 0.0011 23.1 1.9 12 86-97 20-31 (52)
73 PF12171 zf-C2H2_jaz: Zinc-fin 25.6 30 0.00066 21.3 0.5 11 87-97 2-12 (27)
74 PLN02704 flavonol synthase 25.3 2E+02 0.0044 28.5 6.6 42 291-332 244-292 (335)
75 PF07754 DUF1610: Domain of un 25.1 60 0.0013 20.4 1.8 12 82-93 12-23 (24)
76 PF09788 Tmemb_55A: Transmembr 24.7 37 0.00081 32.9 1.2 28 14-46 159-186 (256)
77 KOG3084|consensus 24.5 54 0.0012 33.0 2.3 46 86-131 173-226 (345)
78 PRK00398 rpoP DNA-directed RNA 24.2 52 0.0011 23.0 1.6 10 86-95 21-30 (46)
79 smart00659 RPOLCX RNA polymera 24.2 42 0.0009 23.8 1.1 24 14-45 4-27 (44)
80 PRK12775 putative trifunctiona 23.5 44 0.00095 38.4 1.7 51 36-93 795-845 (1006)
81 PLN02403 aminocyclopropanecarb 22.9 2.6E+02 0.0057 27.5 6.8 42 291-332 199-248 (303)
82 PRK00762 hypA hydrogenase nick 22.6 64 0.0014 27.6 2.2 31 11-45 69-100 (124)
83 PRK03824 hypA hydrogenase nick 22.1 60 0.0013 28.3 1.9 35 11-45 69-115 (135)
84 PRK01103 formamidopyrimidine/5 21.9 55 0.0012 31.6 1.8 13 82-94 261-273 (274)
85 smart00531 TFIIE Transcription 21.5 37 0.00081 29.7 0.5 12 35-46 97-108 (147)
86 PF11672 DUF3268: Protein of u 21.3 1.2E+02 0.0025 25.6 3.4 17 38-54 3-22 (102)
87 PRK03681 hypA hydrogenase nick 21.3 67 0.0014 27.1 2.0 28 11-46 69-96 (114)
88 PF06677 Auto_anti-p27: Sjogre 21.3 57 0.0012 22.9 1.3 9 85-93 33-41 (41)
89 COG1996 RPC10 DNA-directed RNA 21.3 45 0.00098 24.4 0.8 25 14-45 8-32 (49)
90 PRK11827 hypothetical protein; 21.1 74 0.0016 24.2 2.0 15 85-99 25-39 (60)
91 PRK00564 hypA hydrogenase nick 20.5 71 0.0015 27.1 2.0 29 10-46 69-97 (117)
92 PF01096 TFIIS_C: Transcriptio 20.3 86 0.0019 21.4 2.0 15 82-96 24-38 (39)
93 TIGR00100 hypA hydrogenase nic 20.3 70 0.0015 27.0 1.9 27 11-46 69-95 (115)
94 COG3751 EGL-9 Predicted prolin 20.2 4.3E+02 0.0093 25.7 7.4 39 292-331 197-235 (252)
95 TIGR00373 conserved hypothetic 20.1 35 0.00076 30.4 0.0 11 35-45 107-117 (158)
No 1
>KOG3959|consensus
Probab=100.00 E-value=9.5e-79 Score=560.31 Aligned_cols=293 Identities=39% Similarity=0.655 Sum_probs=264.6
Q ss_pred CC-CCCcccccceeeccccccccCcccccccccccccccccCCCCcccccCccccccCCCCCCCCccccccccccccccc
Q psy4937 1 MD-RSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLT 79 (356)
Q Consensus 1 ~~-~~~~c~ckg~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (356)
|+ +.|+|||||+|||+.||+..+..+ ...
T Consensus 1 m~~~~r~CGCKG~RtCl~CE~~~r~s~--------------------------------------------------~~~ 30 (306)
T KOG3959|consen 1 MGSAERACGCKGARTCLLCETTERVSK--------------------------------------------------LRV 30 (306)
T ss_pred CCccccccCCccceeeehhhhhhhccc--------------------------------------------------cch
Confidence 55 789999999999999998433332 233
Q ss_pred cccccceee-ccccccccccCccccccC-CCCCC-CCCCCCCCeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceecc
Q psy4937 80 SQTKETYVY-CPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNF 156 (356)
Q Consensus 80 ~~~~~~~~~-c~~c~~~~~g~~~~~~~~-~~~h~-~~~~~ipGl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~y 156 (356)
++.++.++| |++|.....|||+...++ |.+|+ +.++++|||.+++||||++||++|++.||..||.+||||||||+|
T Consensus 31 ~e~~~a~~y~fIyc~i~~~gwd~~~e~~d~e~~~~d~~~p~pG~~lie~Fls~~Eea~l~~~~D~~pW~~SQSGRRKQdy 110 (306)
T KOG3959|consen 31 VEDKHANYYVFIYCHIQSSGWDIIDESTDCESVSTDGSIPIPGLTLIENFLSESEEAKLLNMIDTVPWAQSQSGRRKQDY 110 (306)
T ss_pred hhccccceEEEEEechhhcccccccccccccccccCCccccCCeeehhhhhccchHhHHHHHhccCchhhhccccccccc
Confidence 456778888 999998888999987665 67766 578999999999999999999999999999999999999999999
Q ss_pred CCccccCCCcccCCCCCCCcHHHHHHHHHHhhcccCCCCCCcceeeeEecCCCCCCccCCCCCcccccCceEEEecCCce
Q psy4937 157 GPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236 (356)
Q Consensus 157 G~~~nyk~~kl~~g~f~~lP~~l~~l~~Ri~~~~~l~~f~~~~q~~leY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~ 236 (356)
|||+|||++|++.+.|.|||.+...|++||...+++.+|+|++||+|||.|.+|++|+||+||.|+||+|+++++++++.
T Consensus 111 GPKvNFkk~Klkt~~F~G~P~~~~~v~rrm~~yp~l~gfqp~EqCnLeYep~kgsaIdpH~DD~WiWGeRlv~~n~l~d~ 190 (306)
T KOG3959|consen 111 GPKVNFKKKKLKTDTFVGMPEYADMVLRRMSEYPVLKGFQPFEQCNLEYEPVKGSAIDPHQDDMWIWGERLVRSNRLFDF 190 (306)
T ss_pred CCccchhhhhhccCcccCCchHHHHHHHHhhccchhhccCcHHHcCcccccccCCccCccccchhhhhhheeehhhccHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEeecCCC-CCCcCCCCCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccc
Q psy4937 237 VLTLTYYNGD-SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQ 315 (356)
Q Consensus 237 vm~f~~~~~~-~~~~~l~~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~ 315 (356)
|+++.+++-. +++|||+++.++.++...+ .+.+.+++..++ +..+.|+||+||||||+|+|||+|+
T Consensus 191 vl~lc~~e~~~sg~~nL~~~~s~~~e~l~~------------~li~~s~~~l~~-~~~~~ipmP~rSLlvl~g~aRyqwk 257 (306)
T KOG3959|consen 191 VLKLCSKECLASGIINLNTNFSESNEFLSI------------NLINGSVMTLNK-SFLCYIPMPHRSLLVLAGEARYQWK 257 (306)
T ss_pred HHHhhhhhhhccceeeeccCccccccccch------------hhcccchhhhcc-ceEEEeecCcceeEEeechhHhhHH
Confidence 9999976543 7899999999988765432 135556666666 8899999999999999999999999
Q ss_pred cccccCCCCCCeEEEEecccCCCCCCh----hhhHHHHHHhhhcC
Q psy4937 316 HCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF 356 (356)
Q Consensus 316 H~I~~~dvr~rRVSLTfR~l~~~~~~g----~~~~~~~~~a~~f~ 356 (356)
|+|.+++|++||||+|||+++++|.+| ++|++|++++.+||
T Consensus 258 H~vlr~hi~~RRvcvt~RE~~~~f~~Gg~~qe~G~el~r~~~if~ 302 (306)
T KOG3959|consen 258 HGVLRHHIRGRRVCVTMREAAKDFAEGGELQEKGAELIRLGNIFV 302 (306)
T ss_pred HHHHHHhhhhceeeeeHHhhhHhhccchhhhhcchHHHHhhhhhc
Confidence 999999999999999999999999998 78999999999998
No 2
>PF13532 2OG-FeII_Oxy_2: 2OG-Fe(II) oxygenase superfamily; PDB: 2IUW_A 3BTZ_A 3RZL_A 3RZH_A 3S5A_A 3RZG_A 3RZJ_A 3BUC_A 3H8X_A 3H8R_A ....
Probab=99.98 E-value=3.7e-32 Score=244.84 Aligned_cols=168 Identities=27% Similarity=0.510 Sum_probs=118.2
Q ss_pred CeEEeCCCCCHHHHHHHHHHhh-cCCCCcccc--C------cc----eeccC--CccccCCC-cccCCCCCCCcHHHHHH
Q psy4937 119 GVYIKEDFLSEAETEDVMKQLD-DLPWDKSQS--G------RR----KQNFG--PKCNFKKQ-KIKLGDFQGFPACTKFI 182 (356)
Q Consensus 119 Gl~~i~dFIS~~Ee~~Ll~~id-~~pW~~sqs--g------RR----~q~yG--~~~nyk~~-kl~~g~f~~lP~~l~~l 182 (356)
|++|++||||++|+++|++.+. ..+|..... + |+ .+|+| ..|+|+.. ......+.+||+++..+
T Consensus 1 G~~~~~~fls~~e~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~y~~~~~~~~~~~~~~p~~l~~~ 80 (194)
T PF13532_consen 1 GLYYIPNFLSEEEAAELLNELRESAPFRQPTYPMGKVYSLPRKLCGGLSWVGDGPSYRYSGKRPVRSKPWPPFPEWLSRL 80 (194)
T ss_dssp -EEEETTSS-HHHHHHHHHHHHHHS--B-GCCCCCCECCECCE-SSEEEEEECT--CCCTCC-EECCCEBSCCHHHHHHH
T ss_pred CEEEECCCCCHHHHHHHHHHHHhhCCCcCCeEcCCCEEccceecceeeEEECCCCCeEcCCccccCCCCCCCccHHHHHH
Confidence 8999999999999999999986 467753311 1 11 23443 55666654 44556678999999999
Q ss_pred HHHHhhcccCCCCCCcceeee-EecCCCCCCccCCCCCccc-ccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccC
Q psy4937 183 QDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWI-WGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQ 260 (356)
Q Consensus 183 ~~Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd~~~-~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~ 260 (356)
++++.+...+.....+|+|++ +|.+|. +|+||+|+.+. +|+.||+||||++++|.|+....
T Consensus 81 ~~~~~~~~~~~~~~~~n~~liN~Y~~g~--~i~~H~D~~~~~~~~~I~slSLG~~~~~~f~~~~~--------------- 143 (194)
T PF13532_consen 81 LERLVEATGIPPGWRPNQCLINYYRDGS--GIGPHSDDEEYGFGPPIASLSLGSSRVFRFRNKSD--------------- 143 (194)
T ss_dssp HHHHHHHHT-SHSS--SEEEEEEESSTT---EEEE---TTC-CCSEEEEEEEES-EEEEEEECGG---------------
T ss_pred HHHHHHHhccccCCCCCEEEEEecCCCC--CcCCCCCcccccCCCcEEEEEEccCceEEEeeccC---------------
Confidence 999987543122246788887 999955 59999999877 89999999999999999987421
Q ss_pred CCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCC--------CCCeEEEEe
Q psy4937 261 KKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDV--------IGRRVCIAY 332 (356)
Q Consensus 261 ~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dv--------r~rRVSLTf 332 (356)
. ...+.+.||+||||||+|++|+.| |+|++.+. +++||||||
T Consensus 144 ----------------------------~-~~~~~~~L~~gsl~vm~g~~r~~~-H~I~~~~~~~~~~~~~~~~RislTf 193 (194)
T PF13532_consen 144 ----------------------------D-DEPIEVPLPPGSLLVMSGEARYDW-HGIPPVKKDTHPSHYVRGRRISLTF 193 (194)
T ss_dssp ----------------------------T-S-EEEEEE-TTEEEEEETTHHHHE-EEE-S-SCEEEESTEE-S-EEEEEE
T ss_pred ----------------------------C-CccEEEEcCCCCEEEeChHHhhhe-eEcccccCCccccccCCCCEEEEEe
Confidence 2 568999999999999999999999 99999843 779999999
Q ss_pred c
Q psy4937 333 R 333 (356)
Q Consensus 333 R 333 (356)
|
T Consensus 194 R 194 (194)
T PF13532_consen 194 R 194 (194)
T ss_dssp E
T ss_pred C
Confidence 9
No 3
>KOG3200|consensus
Probab=99.96 E-value=1.7e-29 Score=225.13 Aligned_cols=178 Identities=20% Similarity=0.297 Sum_probs=141.9
Q ss_pred CCeEEeCCCCCHHHHHHHHHHhhcC---CCCccccCcceeccCCccccCCCcccCCCCCCCcHHHHHHHHHHhhcccCCC
Q psy4937 118 GGVYIKEDFLSEAETEDVMKQLDDL---PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH 194 (356)
Q Consensus 118 pGl~~i~dFIS~~Ee~~Ll~~id~~---pW~~sqsgRR~q~yG~~~nyk~~kl~~g~f~~lP~~l~~l~~Ri~~~~~l~~ 194 (356)
|-+.|||+||+++||..+++.|... .|. ....||.|+||..+.- ..+-+ +.+|+||+.+.+++.+.+.+++
T Consensus 12 pt~~YIPnfIt~EEe~~~lshIe~ap~pkW~-~L~NRRLqNyGGvvh~--~glip---eelP~wLq~~v~kinnlglF~s 85 (224)
T KOG3200|consen 12 PTMIYIPNFITEEEENLYLSHIENAPQPKWR-VLANRRLQNYGGVVHK--TGLIP---EELPPWLQYYVDKINNLGLFKS 85 (224)
T ss_pred ceEEEcCCccChHHHHHHHHHHhcCCCchhH-HHHhhhhhhcCCcccc--CCcCc---cccCHHHHHHHHHhhcccccCC
Confidence 4679999999999999999999874 588 4568999999987762 22221 6899999999999999888765
Q ss_pred CCCcceeee-EecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccccc
Q psy4937 195 FQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVND 273 (356)
Q Consensus 195 f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~~~ 273 (356)
++|++++ ||.||+| |+||.|+ ++|-+.|++|||||.+||.|...... +....++ + +
T Consensus 86 --~~NHVLVNeY~pgqG--ImPHtDG-Paf~piVstiSlGsh~vldf~~p~r~------e~~d~te------~-----~- 142 (224)
T KOG3200|consen 86 --PANHVLVNEYLPGQG--IMPHTDG-PAFHPIVSTISLGSHTVLDFYDPVRQ------EVNDGTE------S-----K- 142 (224)
T ss_pred --CcceeEeecccCCCC--cCcCCCC-CcccceEEEEecCCceEEeccccccc------ccCCccc------c-----C-
Confidence 6787776 9999998 9999999 67789999999999999999874321 0000000 0 0
Q ss_pred cccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccC--C---------------------CCCCeEEE
Q psy4937 274 LSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRS--D---------------------VIGRRVCI 330 (356)
Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~--d---------------------vr~rRVSL 330 (356)
.... .....+.|.+|||||+.|+|+.++.|||... | +++.||||
T Consensus 143 ------------dqp~-R~~fsllleprslLilkd~aYtd~LHgIs~s~~d~l~~~~sna~ac~s~k~Gd~lvr~tRvSL 209 (224)
T KOG3200|consen 143 ------------DQPL-RYLFSLLLEPRSLLILKDDAYTDFLHGISDSPTDCLNQVVSNALACSSRKDGDKLVRQTRVSL 209 (224)
T ss_pred ------------CCCc-cceeeeeeccceEEEEcCcHHHHHHhhcccChHHHHHHHhhhhhhccccCCcceeeecceeEE
Confidence 0012 4578899999999999999999999999864 2 49999999
Q ss_pred EecccCC
Q psy4937 331 AYREFTA 337 (356)
Q Consensus 331 TfR~l~~ 337 (356)
|+|.+..
T Consensus 210 TiR~VPk 216 (224)
T KOG3200|consen 210 TIRLVPK 216 (224)
T ss_pred EEecchH
Confidence 9999865
No 4
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=99.94 E-value=4.1e-26 Score=211.65 Aligned_cols=172 Identities=17% Similarity=0.232 Sum_probs=131.7
Q ss_pred CCCCCeEEeCCCCCHHHHHHHHHHhhc----CCCCcccc--C-----cce-----eccCC--ccccCCCc-ccCCCCCCC
Q psy4937 115 FDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQS--G-----RRK-----QNFGP--KCNFKKQK-IKLGDFQGF 175 (356)
Q Consensus 115 ~~ipGl~~i~dFIS~~Ee~~Ll~~id~----~pW~~sqs--g-----RR~-----q~yG~--~~nyk~~k-l~~g~f~~l 175 (356)
...||..++++|. .++++.|++.+.+ .||..-.. | |.+ .|||- .|.|.... ....+|.+|
T Consensus 15 ~~~~g~~~~~~~~-~~~~~~l~~~~~~~~~~~p~~~~~~~gg~~msv~mt~~G~~~W~~d~~~YrYs~~~~~~~~pwp~~ 93 (213)
T PRK15401 15 PLAPGAVLLRGFA-LAAAEALLAAIEAVAAQAPFRHMVTPGGYTMSVAMTNCGALGWVTDRRGYRYSPIDPLTGKPWPAM 93 (213)
T ss_pred ecCCCcEEeCCCC-HHHHHHHHHHHHHHHhcCCccceecCCCCcceeEEeccccceEecCCCCcccCCcCCCCCCCCCCc
Confidence 4578999999995 8899999988754 79975322 2 222 45542 34454332 333467788
Q ss_pred cHHHHHHHHHHhhcccCCCCCCcceeee-EecCCCCCCccCCCC-CcccccCceEEEecCCceEEEEeecCCCCCCcCCC
Q psy4937 176 PACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHID-DCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLN 253 (356)
Q Consensus 176 P~~l~~l~~Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvD-d~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~ 253 (356)
|+++..|.+++....++..| .+|+|++ +|+||. +|+||.| ++..+++.||+||||++++|.|++...
T Consensus 94 P~~l~~L~~~~~~~~~~~~~-~p~a~LvN~Y~~G~--~mg~H~D~~E~~~~~pI~SvSLG~~~~F~~~~~~~-------- 162 (213)
T PRK15401 94 PASFLALAQRAAAAAGFPGF-QPDACLINRYAPGA--KLSLHQDKDERDFRAPIVSVSLGLPAVFQFGGLKR-------- 162 (213)
T ss_pred hHHHHHHHHHHHHHcCCCCC-CCCEEEEEeccCcC--ccccccCCCcccCCCCEEEEeCCCCeEEEecccCC--------
Confidence 88999999999877655444 5688887 999954 6999999 466788999999999999999987421
Q ss_pred CCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCC------CCCCe
Q psy4937 254 SPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSD------VIGRR 327 (356)
Q Consensus 254 ~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d------vr~rR 327 (356)
. ....+|.|++||||||.|++|+.| |+|++.. +.+.|
T Consensus 163 -----------------------------------~-~~~~~l~L~~Gdllvm~G~sr~~~-HgVp~~~~~~~p~~g~~R 205 (213)
T PRK15401 163 -----------------------------------S-DPLQRILLEHGDVVVWGGPSRLRY-HGILPLKAGEHPLTGECR 205 (213)
T ss_pred -----------------------------------C-CceEEEEeCCCCEEEECchHhhee-ccCCcCCCCcCCCCCCCe
Confidence 1 446799999999999999999855 9999863 13489
Q ss_pred EEEEeccc
Q psy4937 328 VCIAYREF 335 (356)
Q Consensus 328 VSLTfR~l 335 (356)
||||||.+
T Consensus 206 INLTFR~~ 213 (213)
T PRK15401 206 INLTFRKA 213 (213)
T ss_pred EEEEeEcC
Confidence 99999974
No 5
>TIGR00568 alkb DNA alkylation damage repair protein AlkB. Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
Probab=99.89 E-value=2.6e-22 Score=180.59 Aligned_cols=147 Identities=14% Similarity=0.252 Sum_probs=113.2
Q ss_pred CCCCCHHHHHHHHHHhhc----CCCCcccc--Cc-----cee-----cc--CCccccCCCcccCC-CCCCCcHHHHHHHH
Q psy4937 124 EDFLSEAETEDVMKQLDD----LPWDKSQS--GR-----RKQ-----NF--GPKCNFKKQKIKLG-DFQGFPACTKFIQD 184 (356)
Q Consensus 124 ~dFIS~~Ee~~Ll~~id~----~pW~~sqs--gR-----R~q-----~y--G~~~nyk~~kl~~g-~f~~lP~~l~~l~~ 184 (356)
.+|+..+++..|++++.. .||.+... |+ |.+ || |..|+|+.+..... ++.+||++|..|.+
T Consensus 2 ~~~~~~~~~~~l~~~~~~~~~~~~w~~~~~~~gk~~~~pr~~~~~l~W~~~g~~Y~ys~~~~~~~~~~p~~P~~L~~L~~ 81 (169)
T TIGR00568 2 KRYFAFNAQEQLIRDINDVASQDPFRQYVTPGGYTMSVAMTNLGKLGWTTHGQGYLYSPKDPQTNKPWPAMPQDLGDLCE 81 (169)
T ss_pred CCccChHHHHHHHHHHHHHhhcCCCcCeEecCCeEeeehhhhcccceEEcCCCcccCCCcccCCCCCCCCCCHHHHHHHH
Confidence 578888899999987644 79987533 42 222 77 57788887766333 34559999999999
Q ss_pred HHhhcccCCCCCCcceeee-EecCCCCCCccCCCCC-cccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCC
Q psy4937 185 RFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDD-CWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKK 262 (356)
Q Consensus 185 Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd-~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~ 262 (356)
+++...++..+ ++|+|++ +|+|| ++|+||+|+ +..+++.||+||||++++|.|++...
T Consensus 82 ~v~~~~g~~~~-~~n~~LvN~Y~~G--d~mg~H~D~~e~~~~~pI~SvSLG~~r~F~~~~~~~----------------- 141 (169)
T TIGR00568 82 RVATAAGFPDF-QPDACLVNRYAPG--ATLSLHQDRDEPDLRAPLLSVSLGLPAIFLIGGLKR----------------- 141 (169)
T ss_pred HHHHHhCCCCC-CCCEEEEEeecCC--CccccccccccccCCCCEEEEeCCCCEEEEecCCcC-----------------
Confidence 99877654443 6788887 99995 569999996 22245899999999999999987531
Q ss_pred CCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhccccccc
Q psy4937 263 YPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318 (356)
Q Consensus 263 e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I 318 (356)
. ....+|.|++||||||+|++|+.| |||
T Consensus 142 --------------------------~-~~~~~l~L~sGsllvM~G~sR~~~-Hgv 169 (169)
T TIGR00568 142 --------------------------N-DPPKRLRLHSGDVVIMGGESRLAF-HGV 169 (169)
T ss_pred --------------------------C-CceEEEEeCCCCEEEECCchhccc-cCC
Confidence 2 446889999999999999999966 987
No 6
>KOG4176|consensus
Probab=99.83 E-value=3.2e-20 Score=181.45 Aligned_cols=181 Identities=20% Similarity=0.338 Sum_probs=147.9
Q ss_pred CCCCC-eEEeCCCCCHHHHHHHHHHhhcCCCCc--cccCcceeccCCccccCCCcccCCC-CCCCcHHHHHHHHHHhhcc
Q psy4937 115 FDIGG-VYIKEDFLSEAETEDVMKQLDDLPWDK--SQSGRRKQNFGPKCNFKKQKIKLGD-FQGFPACTKFIQDRFHQED 190 (356)
Q Consensus 115 ~~ipG-l~~i~dFIS~~Ee~~Ll~~id~~pW~~--sqsgRR~q~yG~~~nyk~~kl~~g~-f~~lP~~l~~l~~Ri~~~~ 190 (356)
...|| +.+++||+++.+++.|+..+....|.. +..+|++.+||..++|..+.++... ..++|..++.+++|+..+.
T Consensus 124 ~~~~~e~~~~~d~V~el~e~~l~~~~~~e~~~~~~~gk~R~~iq~G~~f~y~~~~~d~~~~~~piPs~~~~ii~rlv~~~ 203 (323)
T KOG4176|consen 124 VFIPGELSLIVDFVTELEEKGLIGALVDETFTYQESGKHREVIQLGYPFDYRTNNVDESKPVDPIPSLFKSIIDRLVSWR 203 (323)
T ss_pred ccChhhceehhhhhhhhHHhhhhcccccccceeeccccceeeeecCceeccCCCcccccCccCCCchHHHHHHHHhhhhc
Confidence 34455 999999999999999999998766664 3458999999999999888877543 5789999999999999998
Q ss_pred cCCCCCCcceeee-EecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccc
Q psy4937 191 VLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKST 269 (356)
Q Consensus 191 ~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~ 269 (356)
.+.. .||||++ .|.||.| |.||+|. ..|++.|++|||+|+++|.|++.....
T Consensus 204 ~ip~--~pd~~~iN~Ye~G~~--i~ph~~~-~~F~~Pi~slS~lSe~~m~Fg~~~~~~---------------------- 256 (323)
T KOG4176|consen 204 VIPE--RPDQCTINFYEPGDG--IPPHIDH-SAFLDPISSLSFLSECTMEFGHGLLSD---------------------- 256 (323)
T ss_pred cCCC--CCCeeEEEeeCCCCC--CCCCCCh-HHhcCceEEEEeecceeEEeccccccc----------------------
Confidence 8876 5899998 9999666 9999955 789999999999999999999963210
Q ss_pred cccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEecccCCCCC
Q psy4937 270 IVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340 (356)
Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR~l~~~~~ 340 (356)
+...+. + ..+++++.||++||.|.+---=.|.++. ++..|||||||.+.+.-.
T Consensus 257 --------------~~~~~~-g-~~s~p~~~g~~lvi~~~~ad~~~~~~~~--~~~kRisitfrki~~~~~ 309 (323)
T KOG4176|consen 257 --------------NIGNFR-G-SLSLPLRYGSVLVIRGRSADVAPHCIRP--SRNKRISITFRKIRPDPC 309 (323)
T ss_pred --------------Cccccc-c-ccccccccCeEEEeCCCcccccccccCC--CCCceEEEEEEEeccCCC
Confidence 001111 2 5678888888888888887777888887 799999999999987544
No 7
>COG3145 AlkB Alkylated DNA repair protein [DNA replication, recombination, and repair]
Probab=99.80 E-value=5.5e-19 Score=161.72 Aligned_cols=167 Identities=18% Similarity=0.282 Sum_probs=118.2
Q ss_pred CCCCCeEEeCCCCCHHHHHHHHHH----hhcCCCCccc---cCcc--ee-ccCCccc---cCCCcccCCCCCCCcHHHHH
Q psy4937 115 FDIGGVYIKEDFLSEAETEDVMKQ----LDDLPWDKSQ---SGRR--KQ-NFGPKCN---FKKQKIKLGDFQGFPACTKF 181 (356)
Q Consensus 115 ~~ipGl~~i~dFIS~~Ee~~Ll~~----id~~pW~~sq---sgRR--~q-~yG~~~n---yk~~kl~~g~f~~lP~~l~~ 181 (356)
...+|+.+.++|+ -.++.+|++. +...||.... .|+. ++ .+|+..+ |.+......+..+||..+..
T Consensus 12 ~~~~G~~~~~~~~-~~~~~~l~~~l~~~~~~~P~~~~~~~~~g~~~sV~r~~~W~~d~~gy~y~~~~p~~~~p~p~l~~~ 90 (194)
T COG3145 12 QLAPGAVILPGFL-LLTQGALVAALLFLLSQAPWFRPRRTPYGKPMSVPRLLGWVTDRRGYRYSLRSPLTGKPWPPLLAL 90 (194)
T ss_pred cCCCCeEEEeccc-ccchHHHHHHHHHhcccCcccceeecCCCcEeeeeeccceecccccccccccccCCCCCCCccHHH
Confidence 4567999999999 4444455444 4558987421 1211 11 4666555 66666655555566765555
Q ss_pred HHHHHhhcccCCCCCCcceeee-EecCCCCCCccCCCCCccccc-CceEEEecCCceEEEEeecCCCCCCcCCCCCcccc
Q psy4937 182 IQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWG-ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSY 259 (356)
Q Consensus 182 l~~Ri~~~~~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~G-e~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~ 259 (356)
..+...+.+. .+..+++|++ +|+| |++|+||.|.....+ +.||+||||++++|.|++...
T Consensus 91 ~~~~~~~~g~--~~~~~ea~Lvn~Y~p--Gd~ig~HqD~~e~~~~~~v~slSLg~~~~F~~~~~~r-------------- 152 (194)
T COG3145 91 FHDLFGAAGY--PFEGPEAVLVNRYRP--GASIGWHQDKDEEDDRPPVASLSLGAPCIFRLRGRRR-------------- 152 (194)
T ss_pred HHHHHHHhcC--CCCChhheeEEeccC--CCccccccccccccCCCceEEEecCCCeEEEeccccC--------------
Confidence 5555554432 3457888987 9999 477999999865443 369999999999999998632
Q ss_pred CCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCC-CCCCeEEE
Q psy4937 260 QKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-VIGRRVCI 330 (356)
Q Consensus 260 ~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-vr~rRVSL 330 (356)
. +...++.|.+|||+||.|++|+.|.|.|+++. ....||||
T Consensus 153 -----------------------------~-~~~~~~~L~~Gdvvvm~G~~r~~~~h~~p~~~~~~~~Rinl 194 (194)
T COG3145 153 -----------------------------R-GPGLRLRLEHGDVVVMGGPSRLAWHHIIPKTSRLTGQRINL 194 (194)
T ss_pred -----------------------------C-CCceeEEecCCCEEEecCCccccccccccccccCCcccccC
Confidence 2 56789999999999999999999999999873 34477764
No 8
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Probab=98.14 E-value=9.5e-05 Score=65.99 Aligned_cols=168 Identities=21% Similarity=0.199 Sum_probs=91.8
Q ss_pred CCeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceeccCCccccCCCcccCCCCCC--CcHHHHHHHHHHhhcccCCC-
Q psy4937 118 GGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG--FPACTKFIQDRFHQEDVLSH- 194 (356)
Q Consensus 118 pGl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~yG~~~nyk~~kl~~g~f~~--lP~~l~~l~~Ri~~~~~l~~- 194 (356)
|++++++||||++|.+.|++.....-|. +...+....=+..-+|+..... .+.. -++....|.+|+.+...+..
T Consensus 1 P~i~~~~~~ls~~ec~~li~~~~~~~~~-~~~~~~~~~~~~~~~~R~~~~~--~l~~~~~~~~~~~l~~~i~~~~~~~~~ 77 (178)
T smart00702 1 PGVVVFHDFLSPAECQKLLEEAEPLGWR-GEVTRGDTNPNHDSKYRQSNGT--WLELLKGDLVIERIRQRLADFLGLLRG 77 (178)
T ss_pred CcEEEECCCCCHHHHHHHHHHhhhhccc-ceeecCCCCccccCCCEeecce--ecCCCCCCHHHHHHHHHHHHHHCCCch
Confidence 7899999999999999999998876664 3221111100011122222211 1122 15667788888887655431
Q ss_pred -CCCcceeee-EecCCCCCCccCCCCCcccc--cCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCccccc
Q psy4937 195 -FQTIEQCTL-EYNSENGSSIDNHIDDCWIW--GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTI 270 (356)
Q Consensus 195 -f~~~~q~~l-eY~pg~G~~I~pHvDd~~~~--Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~ 270 (356)
....+.+.+ .|.+| ....+|.|....- |.+++++.+ +=.. . . .
T Consensus 78 ~~~~~~~~~~~~Y~~g--~~~~~H~D~~~~~~~~~r~~T~~~----------yLn~--------~------~-------~ 124 (178)
T smart00702 78 LPLSAEDAQVARYGPG--GHYGPHVDNFEDDENGDRIATFLL----------YLND--------V------E-------E 124 (178)
T ss_pred hhccCcceEEEEECCC--CcccCcCCCCCCCCCCCeEEEEEE----------Eecc--------C------C-------c
Confidence 122344555 99994 5699999975321 233433332 2000 0 0 0
Q ss_pred ccccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937 271 VNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY 332 (356)
Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf 332 (356)
.|.+.. ... .......|.-..|+++++...-. ...|++.+.. .+.|++++.
T Consensus 125 GG~~~f---~~~------~~~~~~~v~P~~G~~v~f~~~~~-~~~H~v~pv~-~G~r~~~~~ 175 (178)
T smart00702 125 GGELVF---PGL------GLMVCATVKPKKGDLLFFPSGRG-RSLHGVCPVT-RGSRWAITG 175 (178)
T ss_pred CceEEe---cCC------CCccceEEeCCCCcEEEEeCCCC-CccccCCcce-eCCEEEEEE
Confidence 010000 000 00123567778899999874322 4559988842 488999885
No 9
>PF03171 2OG-FeII_Oxy: 2OG-Fe(II) oxygenase superfamily Entry for Lysyl hydrolases This Prosite entry is a sub-family of the Pfam entry; InterPro: IPR005123 This domain is found in members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily [], as well as the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity (1.14.11.2 from EC) catalysing the reaction: Procollagen L-proline + 2-oxoglutarate + O2 = procollagen trans-4-hydroxy-L-proline + succinate + CO2. The full enzyme consists of a alpha2 beta2 complex with the alpha subunit contributing most of the parts of the active site []. The family also includes lysyl hydrolases, isopenicillin synthases and AlkB. ; GO: 0016491 oxidoreductase activity, 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process; PDB: 3ON7_D 1BK0_A 1IPS_B 1QIQ_A 1OC1_A 2Y86_A 2Y60_A 1W03_A 2VE1_A 1QJF_A ....
Probab=98.04 E-value=7.3e-06 Score=66.08 Aligned_cols=83 Identities=24% Similarity=0.352 Sum_probs=47.0
Q ss_pred ceeee-Eec-CCCCCCccCCCCCcccccCceEEEecC-CceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccccccc
Q psy4937 199 EQCTL-EYN-SENGSSIDNHIDDCWIWGERIVTVNLI-GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLS 275 (356)
Q Consensus 199 ~q~~l-eY~-pg~G~~I~pHvDd~~~~Ge~IaslSLg-s~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~~~~~ 275 (356)
+++.+ +|. +.+|.++++|.|.+ ..+++|.+. ...-|.|...
T Consensus 2 ~~~~~~~Y~~~~~~~~~~~H~D~~----~~~~Til~~~~~~gL~~~~~-------------------------------- 45 (98)
T PF03171_consen 2 SQLRLNRYPPPENGVGIGPHTDDE----DGLLTILFQDEVGGLQVRDD-------------------------------- 45 (98)
T ss_dssp -EEEEEEE-SCCGCEEEEEEEES------SSEEEEEETSTS-EEEEET--------------------------------
T ss_pred CEEEEEECCCcccCCceeCCCcCC----CCeEEEEecccchheecccc--------------------------------
Confidence 45666 999 55566799999985 567777776 6666666653
Q ss_pred cccccccccccccCCceEEEEecCCCeEEEecch--------hhcccccccccCCCCCCeEEEEec
Q psy4937 276 LNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGE--------ARYEWQHCVLRSDVIGRRVCIAYR 333 (356)
Q Consensus 276 ~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~--------ARy~W~H~I~~~dvr~rRVSLTfR 333 (356)
+..+.+...+++++|+-|+ .+..+.|+|.+.. .+.|+|+||+
T Consensus 46 ---------------~~~~~v~~~~~~~~v~~G~~l~~~t~g~~~~~~HrV~~~~-~~~R~s~~~f 95 (98)
T PF03171_consen 46 ---------------GEWVDVPPPPGGFIVNFGDALEILTNGRYPATLHRVVPPT-EGERYSLTFF 95 (98)
T ss_dssp ---------------TEEEE----TTCEEEEEBHHHHHHTTTSS----EEEE--S-TS-EEEEEEE
T ss_pred ---------------ccccCccCccceeeeeceeeeecccCCccCCceeeeEcCC-CCCEEEEEEE
Confidence 1234444444455555555 8899999999965 7999999996
No 10
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=97.90 E-value=0.00017 Score=68.15 Aligned_cols=165 Identities=19% Similarity=0.185 Sum_probs=91.1
Q ss_pred eEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceeccCCccccCCCcccCCCCCCCcHHHH----HHHHHHhhcccCCCC
Q psy4937 120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK----FIQDRFHQEDVLSHF 195 (356)
Q Consensus 120 l~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~yG~~~nyk~~kl~~g~f~~lP~~l~----~l~~Ri~~~~~l~~f 195 (356)
++.|+++||++|-++|++.++..+|.. |+.+..++.+ .+|.+ .+.. ..-| ..+ .|.+++.+.+.+..+
T Consensus 2 i~~I~~vLs~eec~~~~~~le~~~~~d---g~~taG~~~~-~vKnN-~ql~--~d~~-~a~~l~~~i~~~L~~~~l~~sa 73 (226)
T PRK05467 2 LLHIPDVLSPEEVAQIRELLDAAEWVD---GRVTAGAQAA-QVKNN-QQLP--EDSP-LARELGNLILDALTRNPLFFSA 73 (226)
T ss_pred eeeecccCCHHHHHHHHHHHHhcCCcc---CCcCcCccch-hcccc-cccC--CCCH-HHHHHHHHHHHHHhcCchhhhh
Confidence 578999999999999999999999983 6666655421 22221 1111 1222 333 344455444443222
Q ss_pred CCcc----eeeeEecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccc
Q psy4937 196 QTIE----QCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271 (356)
Q Consensus 196 ~~~~----q~~leY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~ 271 (356)
.-+. -..+.|.+|+ ..+||+|....-.+. - .-.--+.++|.-+=.+ ...|+ .
T Consensus 74 ~lp~~i~~~~f~rY~~G~--~y~~H~D~~~~~~~~-~--~~~~rs~lS~~lyLnd--------~~~ye-----------G 129 (226)
T PRK05467 74 ALPRKIHPPLFNRYEGGM--SYGFHVDNAVRSLPG-T--GGRVRTDLSATLFLSD--------PDDYD-----------G 129 (226)
T ss_pred ccccccccceEEEECCCC--ccCccccCCcccCCC-C--CcceeEEEEEEEEeCC--------CCCCc-----------C
Confidence 2111 2334999965 599999984321000 0 0000133444433110 01110 0
Q ss_pred cccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY 332 (356)
Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf 332 (356)
|-... .. . .....|.+|.|++++.... -.|++.+. .++.|+++++
T Consensus 130 GEl~~---------~~-~-~g~~~Vkp~aG~~vlfps~----~lH~v~pV-t~G~R~~~~~ 174 (226)
T PRK05467 130 GELVI---------ED-T-YGEHRVKLPAGDLVLYPST----SLHRVTPV-TRGVRVASFF 174 (226)
T ss_pred CceEE---------ec-C-CCcEEEecCCCeEEEECCC----Cceeeeec-cCccEEEEEe
Confidence 10110 00 1 2246789999999999864 55999885 3899999998
No 11
>KOG2731|consensus
Probab=96.38 E-value=0.0046 Score=61.70 Aligned_cols=116 Identities=10% Similarity=0.108 Sum_probs=77.0
Q ss_pred CCccccCCCc----ccCCCCCCCcHHHHHHHHHHhhcccCCCCCCcceee-eEecCCCCCCccCCCCC-cccccCceEEE
Q psy4937 157 GPKCNFKKQK----IKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCT-LEYNSENGSSIDNHIDD-CWIWGERIVTV 230 (356)
Q Consensus 157 G~~~nyk~~k----l~~g~f~~lP~~l~~l~~Ri~~~~~l~~f~~~~q~~-leY~pg~G~~I~pHvDd-~~~~Ge~Iasl 230 (356)
|..+|...+. +........|..+.....++....++.++...+-++ +.|.+ |+.++-|.|. +..+++.+.+.
T Consensus 171 G~~~dw~s~~~~~~~s~k~~~~~~~ll~~~~~~~~~a~~~~~~~~~~Gli~nYlsi--~~tl~ih~d~reld~~~pf~s~ 248 (378)
T KOG2731|consen 171 GNQYDWSSKDIFIFLSKKHYNIKPSLLGLLREKVKAAKGFSHIVIRPGLIKNYLSI--DDTLGIHLDCRELDLSKPFYSP 248 (378)
T ss_pred ccccCCccccccccccccCCCCChHHhhhhhhhhhhhcCccceeccCcceeeeccc--CcEEEEEeehhhcccCCccccc
Confidence 5555555555 211111234455556666666655555554444444 48999 4569999996 55667789999
Q ss_pred ecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEecchh
Q psy4937 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEA 310 (356)
Q Consensus 231 SLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~A 310 (356)
|||-++++-+.-... . .....+.|..|..++|.|.+
T Consensus 249 s~g~~ai~lLg~m~l-------------------------------------------~-e~p~p~~lrsGdv~im~Gfs 284 (378)
T KOG2731|consen 249 SLGQGAILLLGMMCL-------------------------------------------G-ENPDPMTLRSGDVVIMDGFS 284 (378)
T ss_pred cccccceeeeccccc-------------------------------------------C-CCCCccccccCceEeecchH
Confidence 999999988764321 0 12344788999999999999
Q ss_pred hcccccccc
Q psy4937 311 RYEWQHCVL 319 (356)
Q Consensus 311 Ry~W~H~I~ 319 (356)
|.-+ |+|+
T Consensus 285 rlv~-haIp 292 (378)
T KOG2731|consen 285 RLVE-HAIP 292 (378)
T ss_pred HHHh-hccc
Confidence 9776 9999
No 12
>PLN00052 prolyl 4-hydroxylase; Provisional
Probab=95.35 E-value=0.14 Score=50.68 Aligned_cols=94 Identities=18% Similarity=0.296 Sum_probs=53.2
Q ss_pred CCCeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceecc--CCccccCCCcccCCCC-C-CCcHHHHHHHHHHhhcccC
Q psy4937 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNF--GPKCNFKKQKIKLGDF-Q-GFPACTKFIQDRFHQEDVL 192 (356)
Q Consensus 117 ipGl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~y--G~~~nyk~~kl~~g~f-~-~lP~~l~~l~~Ri~~~~~l 192 (356)
-|-+++++||||++|.+.|++.... .+..+. +..- |.... +..+...+.| . .--+.+..|.+||....++
T Consensus 53 ~P~i~~~~nfLs~~Ecd~Li~la~~-~l~~S~----v~~~~~g~~~~-s~~RTS~~~~l~~~~dpvv~~I~~Ria~~t~l 126 (310)
T PLN00052 53 QPRIFVYKGFLSDAECDHLVKLAKK-KIQRSM----VADNKSGKSVM-SEVRTSSGMFLDKRQDPVVSRIEERIAAWTFL 126 (310)
T ss_pred CCCEEEECCcCCHHHHHHHHHhccc-ccccce----eecCCCCcccc-CCCEEecceeecCCCCHHHHHHHHHHHHHhCC
Confidence 4789999999999999999987643 222111 0000 10000 0001111111 1 1123677888888886554
Q ss_pred CCCCCcceeee-EecCCCCCCccCCCCC
Q psy4937 193 SHFQTIEQCTL-EYNSENGSSIDNHIDD 219 (356)
Q Consensus 193 ~~f~~~~q~~l-eY~pg~G~~I~pHvDd 219 (356)
.- ...+.+.+ .|.+|+ ...||.|-
T Consensus 127 p~-~~~E~lQVlrY~~Gq--~Y~~H~D~ 151 (310)
T PLN00052 127 PE-ENAENIQILRYEHGQ--KYEPHFDY 151 (310)
T ss_pred Cc-ccCcceEEEecCCCC--CCCCCCCc
Confidence 31 12345555 999965 49999994
No 13
>COG3128 PiuC Uncharacterized iron-regulated protein [Function unknown]
Probab=94.96 E-value=0.25 Score=45.78 Aligned_cols=169 Identities=17% Similarity=0.142 Sum_probs=88.7
Q ss_pred CeEEeCCCCCHHHHHHHHHHhhcCCCCccccCcceeccC-CccccCCCcccCCCCCCCcHHHH-HHHHHHhhcccCCCCC
Q psy4937 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFG-PKCNFKKQKIKLGDFQGFPACTK-FIQDRFHQEDVLSHFQ 196 (356)
Q Consensus 119 Gl~~i~dFIS~~Ee~~Ll~~id~~pW~~sqsgRR~q~yG-~~~nyk~~kl~~g~f~~lP~~l~-~l~~Ri~~~~~l~~f~ 196 (356)
-+.-||..||+++.+.+-+.+...+|.. ||-.|-+- +++. +..+...-.++-..+. .|++++.+.+.+-.-.
T Consensus 3 m~lhIp~VLs~a~va~iRa~l~~A~w~d---Grat~g~q~a~vk---~n~qlp~~s~l~~~vg~~il~al~~~plff~aA 76 (229)
T COG3128 3 MMLHIPEVLSEAQVARIRAALEQAEWVD---GRATQGPQGAQVK---NNLQLPQDSALARELGNEILQALTAHPLFFAAA 76 (229)
T ss_pred eEEechhhCCHHHHHHHHHHHhhccccc---cccccCcchhhhh---ccccCCcccHHHHHHHHHHHHHHHhchhHHHhh
Confidence 3567899999999999999999999984 78776653 3332 2222211122222222 3556665544321100
Q ss_pred Ccce----eeeEecCCCCCCccCCCCCcccc-cCceEEEecCCceEEEEeecCCCCCCcCCCCCccccCCCCCCcccccc
Q psy4937 197 TIEQ----CTLEYNSENGSSIDNHIDDCWIW-GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271 (356)
Q Consensus 197 ~~~q----~~leY~pg~G~~I~pHvDd~~~~-Ge~IaslSLgs~~vm~f~~~~~~~~~~~l~~~~~~~~~~e~d~k~~~~ 271 (356)
-+.- .-+.|.-| +..++|+|+--.. .+ .-+..|- +++..+.--.++ .+|+ .
T Consensus 77 Lp~t~~~P~Fn~Y~eg--~~f~fHvDgavr~~hp-~~~~~lr--tdls~tlfl~DP--------edYd-----------G 132 (229)
T COG3128 77 LPRTCLPPLFNRYQEG--DFFGFHVDGAVRSIHP-GSGFRLR--TDLSCTLFLSDP--------EDYD-----------G 132 (229)
T ss_pred cccccCCchhhhccCC--CcccccccCcccccCC-CCCceeE--eeeeeeeecCCc--------cccC-----------C
Confidence 0111 22478884 5599999973211 00 0000111 111111110111 1111 1
Q ss_pred cccccccccccccccccCCceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEec
Q psy4937 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYR 333 (356)
Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR 333 (356)
| ++ .++. . --..+|.||.|+|++.-+.+ .|.|.+.. |+.|+.+-|-
T Consensus 133 G--eL-------Vv~d-t-Yg~h~VklPAGdLVlypStS----lH~VtPVT-Rg~R~asffW 178 (229)
T COG3128 133 G--EL-------VVND-T-YGNHRVKLPAGDLVLYPSTS----LHEVTPVT-RGERFASFFW 178 (229)
T ss_pred c--eE-------EEec-c-ccceEEeccCCCEEEccccc----ceeccccc-cCceEEEeee
Confidence 1 00 0111 1 22578999999999988876 58888842 8999888773
No 14
>PF13640 2OG-FeII_Oxy_3: 2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=94.35 E-value=0.03 Score=44.98 Aligned_cols=32 Identities=19% Similarity=0.179 Sum_probs=25.9
Q ss_pred cCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY 332 (356)
Q Consensus 298 Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf 332 (356)
..+|+++++.+ ....|++.+....++|++|++
T Consensus 66 p~~g~~v~F~~---~~~~H~v~~v~~~~~R~~l~~ 97 (100)
T PF13640_consen 66 PKPGRLVIFPS---DNSLHGVTPVGEGGRRYSLTF 97 (100)
T ss_dssp -BTTEEEEEES---CTCEEEEEEE-EESEEEEEEE
T ss_pred CCCCEEEEEeC---CCCeecCcccCCCCCEEEEEE
Confidence 78889999999 566799999755999999986
No 15
>KOG1591|consensus
Probab=90.85 E-value=1.2 Score=43.75 Aligned_cols=93 Identities=18% Similarity=0.217 Sum_probs=54.4
Q ss_pred CCCeEEeCCCCCHHHHHHHHHHhhc--CCCCccccCccee-ccC-CccccCCCcccCCCC--CCCcHHHHHHHHHHhhcc
Q psy4937 117 IGGVYIKEDFLSEAETEDVMKQLDD--LPWDKSQSGRRKQ-NFG-PKCNFKKQKIKLGDF--QGFPACTKFIQDRFHQED 190 (356)
Q Consensus 117 ipGl~~i~dFIS~~Ee~~Ll~~id~--~pW~~sqsgRR~q-~yG-~~~nyk~~kl~~g~f--~~lP~~l~~l~~Ri~~~~ 190 (356)
-|.+.+++|||+++|.+.|+..-.. .++. +. +.| .+.+-..-+...+.| .+=.+.+..|-+||....
T Consensus 96 ~P~~~~yhd~ls~~e~d~l~~lak~~l~~st-------v~~~~~~~~~~~~~~R~S~~t~l~~~~~~~~~~i~~ri~~~T 168 (289)
T KOG1591|consen 96 DPRVVLYHDFLSDEECDHLISLAKPKLERST-------VVADKGTGHSTTSAVRTSSGTFLPDGASPVVSRIEQRIADLT 168 (289)
T ss_pred CCceEeehhcCCHHHHHHHHHhhhhhhhcee-------eeccCCcccccceeeEecceeEecCCCCHHHHHHHHHHHhcc
Confidence 4679999999999999999876543 1222 21 222 011100001111212 123456777888888877
Q ss_pred cCCCCCCcceeee-EecCCCCCCccCCCCC
Q psy4937 191 VLSHFQTIEQCTL-EYNSENGSSIDNHIDD 219 (356)
Q Consensus 191 ~l~~f~~~~q~~l-eY~pg~G~~I~pHvDd 219 (356)
++. +..-+.+.+ .|.+|+ .-.||.|-
T Consensus 169 ~l~-~e~~E~lqVlnYg~Gg--~Y~~H~D~ 195 (289)
T KOG1591|consen 169 GLP-VENGESLQVLNYGLGG--HYEPHYDY 195 (289)
T ss_pred CCC-cccCccceEEEecCCc--cccccccc
Confidence 653 233456665 899954 69999884
No 16
>TIGR01762 chlorin-enz chlorinating enzymes. This model represents a a group of highly homologous enzymes related to dioxygenases which chlorinate amino acid methyl groups. BarB1 and BarB2 are proposed to trichlorinate one of the methyl groups of a leucine residue in the biosynthesis of barbamide in the cyanobacterium Lyngbya majuscula. SyrB2 is proposed to chlorinate the methyl group of threonine in the biosynthesis of syringomycin in Pseudomonas syringae. CmaB is proposed to chlorinate the beta-methyl group of alloisoleucine in the process of ring closure in the biosynthesis of coronamic acid, a component of coronatine also in Pseudomonas syringae.
Probab=88.24 E-value=13 Score=36.27 Aligned_cols=43 Identities=16% Similarity=0.264 Sum_probs=34.5
Q ss_pred eEEEEecCCCeEEEecchhhcccccccccCCC-CCCeEEEEecccCCC
Q psy4937 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDV-IGRRVCIAYREFTAP 338 (356)
Q Consensus 292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dv-r~rRVSLTfR~l~~~ 338 (356)
..+.+.|++|+++++.+-+= |+-.+..- ..+|+++++|.+.+.
T Consensus 207 ~~v~~~lkaGd~~~f~~~t~----HgS~~N~S~~~~R~~~~~ry~~~~ 250 (288)
T TIGR01762 207 SAVPMQMKAGQFIIFWSTLM----HASYPNSGESQMRMGFASRYVPSF 250 (288)
T ss_pred ceeeeeeCCceEEEECCCce----ecCCCCCCCCceEEEEEEEEcCCC
Confidence 46899999999999998763 77766643 467999999998663
No 17
>PF05721 PhyH: Phytanoyl-CoA dioxygenase (PhyH); InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=85.01 E-value=1.7 Score=37.95 Aligned_cols=25 Identities=32% Similarity=0.484 Sum_probs=21.3
Q ss_pred CCeEEeCCCCCHHHHHHHHHHhhcC
Q psy4937 118 GGVYIKEDFLSEAETEDVMKQLDDL 142 (356)
Q Consensus 118 pGl~~i~dFIS~~Ee~~Ll~~id~~ 142 (356)
.|..+++++|+++|-+.|.+.+++.
T Consensus 4 ~Gyvvi~~~l~~~~~~~l~~~~~~~ 28 (211)
T PF05721_consen 4 DGYVVIRNVLSPEEVERLREELDRL 28 (211)
T ss_dssp HSEEEETTSS-HHHHHHHHHHHHHH
T ss_pred CcEEEECCcCCHHHHHHHHHHHHHH
Confidence 4899999999999999999998763
No 18
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=84.61 E-value=0.62 Score=31.75 Aligned_cols=35 Identities=31% Similarity=0.812 Sum_probs=23.9
Q ss_pred ccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccc
Q psy4937 37 YIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQ 97 (356)
Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~ 97 (356)
.|.||+|+. .|.|- +++. ......+-|+.|+..|.
T Consensus 2 ~i~CP~C~~-------------------~f~v~-----~~~l--~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 2 IITCPNCQT-------------------RFRVP-----DDKL--PAGGRKVRCPKCGHVFR 36 (37)
T ss_pred EEECCCCCc-------------------eEEcC-----HHHc--ccCCcEEECCCCCcEee
Confidence 378999998 44444 2232 24567899999998763
No 19
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=80.88 E-value=0.49 Score=42.43 Aligned_cols=50 Identities=28% Similarity=0.581 Sum_probs=33.2
Q ss_pred cccccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccccCccccc
Q psy4937 34 EETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAY 104 (356)
Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~g~~~~~~ 104 (356)
++|-..||+|+-+-.|.. |... -|.++ ...+.-+||..|++.|| |-..++
T Consensus 36 ~~tI~~Cp~C~~~IrG~y------~v~g---v~~~g-----------~~~~~PsYC~~CGkpyP-Wt~~~L 85 (158)
T PF10083_consen 36 AKTITSCPNCSTPIRGDY------HVEG---VFGLG-----------GHYEAPSYCHNCGKPYP-WTENAL 85 (158)
T ss_pred HHHHHHCcCCCCCCCCce------ecCC---eeeeC-----------CCCCCChhHHhCCCCCc-hHHHHH
Confidence 467788999998766652 2211 11111 24557799999999999 976654
No 20
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=79.93 E-value=0.86 Score=28.62 Aligned_cols=12 Identities=25% Similarity=0.916 Sum_probs=9.1
Q ss_pred ccceeecccccc
Q psy4937 83 KETYVYCPHCNK 94 (356)
Q Consensus 83 ~~~~~~c~~c~~ 94 (356)
.....|||.|+.
T Consensus 13 ~~~~~fC~~CG~ 24 (26)
T PF13248_consen 13 DPDAKFCPNCGA 24 (26)
T ss_pred CcccccChhhCC
Confidence 456789999984
No 21
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=76.58 E-value=1.7 Score=29.27 Aligned_cols=13 Identities=31% Similarity=0.871 Sum_probs=10.3
Q ss_pred ceeeccccccccc
Q psy4937 85 TYVYCPHCNKAWQ 97 (356)
Q Consensus 85 ~~~~c~~c~~~~~ 97 (356)
..+.||.|+..|.
T Consensus 24 ~~v~C~~C~~~~~ 36 (38)
T TIGR02098 24 GKVRCGKCGHVWY 36 (38)
T ss_pred CEEECCCCCCEEE
Confidence 4789999997753
No 22
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=75.35 E-value=2 Score=29.13 Aligned_cols=15 Identities=27% Similarity=0.678 Sum_probs=11.6
Q ss_pred cccceeecccccccc
Q psy4937 82 TKETYVYCPHCNKAW 96 (356)
Q Consensus 82 ~~~~~~~c~~c~~~~ 96 (356)
..-.-+-|+.|+..|
T Consensus 21 ~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 21 PKGRKVRCSKCGHVF 35 (36)
T ss_pred CCCcEEECCCCCCEe
Confidence 444678999999876
No 23
>PF12933 FTO_NTD: FTO catalytic domain; InterPro: IPR024367 Alpha-ketoglutarate-dependent dioxygenase FTO, also known as Fat mass and obesity-associated protein, is a nucleus protein which belongs to the FTO family. This enzyme is a dioxygenase that repairs alkylated DNA and RNA by oxidative demethylation []. FTO activity is highest towards single-stranded RNA containing 3-methyluracil, followed by single-stranded DNA containing 3-methylthymine. FTO has low demethylase activity towards single-stranded DNA containing 1-methyladenine or 3-methylcytosine []. FTO has no activity towards 1-methylguanine. It has no detectable activity towards double-stranded DNA. FTO requires molecular oxygen, alpha-ketoglutarate and iron. FTO contributes to the regulation of the global metabolic rate, energy expenditure and energy homeostasis. It contributes to the regulation of body size and body fat accumulation as well []. This domain is the catalytic AlkB-like domain from the FTO protein []. This domain catalyses a demethylase activity with a preference for 3-methylthymidine.; PDB: 3LFM_A.
Probab=75.15 E-value=9.3 Score=36.85 Aligned_cols=42 Identities=19% Similarity=0.309 Sum_probs=31.3
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEecc
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR~ 334 (356)
.+...|+|+.|+.|.|-++--..-+|+|..- ..+|+|-|-|-
T Consensus 209 tP~L~vPL~sgd~Y~Mldd~N~tHqH~VlaG--~~~RfSSTHRV 250 (253)
T PF12933_consen 209 TPGLAVPLRSGDCYYMLDDFNATHQHCVLAG--SSARFSSTHRV 250 (253)
T ss_dssp S-EEEEEE-TT-EEEE-TTHHHHEEEEEE----SS-EEEEEEE-
T ss_pred CCeeEEeccCCCeEEEccccchhhHHHHhcC--CCcccccccee
Confidence 5679999999999999999999999999996 45599999873
No 24
>PF09859 Oxygenase-NA: Oxygenase, catalysing oxidative methylation of damaged DNA; InterPro: IPR018655 This family of various hypothetical prokaryotic proteins, has no known function.
Probab=70.75 E-value=25 Score=32.12 Aligned_cols=114 Identities=22% Similarity=0.252 Sum_probs=65.8
Q ss_pred CCCcHHHHHHHHHHhhcccCCCCCCcceeeeEecCCCCCCccCCCCCcccccCceE---EEecCCceEEEEeecCCCCCC
Q psy4937 173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV---TVNLIGDSVLTLTYYNGDSNK 249 (356)
Q Consensus 173 ~~lP~~l~~l~~Ri~~~~~l~~f~~~~q~~leY~pg~G~~I~pHvDd~~~~Ge~Ia---slSLgs~~vm~f~~~~~~~~~ 249 (356)
..+|+.+..++++..+.+- ..+-.+++.|.+ |+...-|.|. +|+.+. .+=|+|.---+|+-.+
T Consensus 41 ~~yP~~~~~fl~~ch~aGQ----~rptplllrY~~--gdyn~LHqdl---yGe~vFPlQvv~lLs~Pg~DftGGE----- 106 (173)
T PF09859_consen 41 ARYPATLAEFLARCHAAGQ----TRPTPLLLRYGP--GDYNCLHQDL---YGEHVFPLQVVILLSEPGEDFTGGE----- 106 (173)
T ss_pred CCCCccHHHHHHHHHhccC----CCCchhhheeCC--CCccccccCC---CCCcccCeEEEEEcCCCCCcccCce-----
Confidence 4577888888888777643 234457779999 6789999994 675422 1122222111111100
Q ss_pred cCCCCCccccCCCCCCcccccccccccccccccccccccCCceEEEEecCCCeEEEec----------chhhcccccccc
Q psy4937 250 YNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW----------GEARYEWQHCVL 319 (356)
Q Consensus 250 ~~l~~~~~~~~~~e~d~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~Lp~rSLLVMs----------G~ARy~W~H~I~ 319 (356)
| +..+++... . ....-++|..|..+|+. |--|-.-.|||.
T Consensus 107 F----------------------------VltEQrPR~-Q-SR~~V~~L~qGda~if~t~~RPv~G~rG~yRv~~RHgVS 156 (173)
T PF09859_consen 107 F----------------------------VLTEQRPRM-Q-SRAMVLPLRQGDALIFATNHRPVRGARGYYRVNMRHGVS 156 (173)
T ss_pred E----------------------------EEEEecCCc-c-CccccCCcCCCCEEEEecCCCCcCCCccceecccccccc
Confidence 0 000011010 1 34456888899999885 566677889988
Q ss_pred cCCCCCCeEEEE
Q psy4937 320 RSDVIGRRVCIA 331 (356)
Q Consensus 320 ~~dvr~rRVSLT 331 (356)
+-. .++|..|.
T Consensus 157 ~vr-sG~R~tLg 167 (173)
T PF09859_consen 157 RVR-SGERHTLG 167 (173)
T ss_pred ccc-ccceEEEE
Confidence 842 67777664
No 25
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=70.72 E-value=2.1 Score=26.27 Aligned_cols=10 Identities=30% Similarity=1.115 Sum_probs=7.0
Q ss_pred ceeecccccc
Q psy4937 85 TYVYCPHCNK 94 (356)
Q Consensus 85 ~~~~c~~c~~ 94 (356)
.-.||++|+.
T Consensus 12 ~~~fC~~CG~ 21 (23)
T PF13240_consen 12 DAKFCPNCGT 21 (23)
T ss_pred cCcchhhhCC
Confidence 4557888874
No 26
>PF12851 Tet_JBP: Oxygenase domain of the 2OGFeDO superfamily ; InterPro: IPR024779 TETs are 2OG- and Fe(II)-dependent oxygenases that catalyse the conversion of 5 methyl-Cytosine (5-MC) to 5-hydroxymethyl-cytosine (hmC) in cultured cells and in vitro []. Interestingly TET2 is considered as an oncogene, as it is found mutated in some types of cancer []. This entry represents the double-stranded beta helix (DSBH) fold of the 2-oxoglutarate (2OG) - Fe(II) oxygenases. DSBH comprises a part of the catalytic domain in TETS. It is found in many organisms including fruit fly, African malaria mosquito, zebrafish, mouse and human.
Probab=68.30 E-value=6.7 Score=35.50 Aligned_cols=38 Identities=13% Similarity=0.307 Sum_probs=31.6
Q ss_pred eEEEEecCCCeEEEecchhhcccccccccCCC----CCCeEEEEe
Q psy4937 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDV----IGRRVCIAY 332 (356)
Q Consensus 292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dv----r~rRVSLTf 332 (356)
.-+++.+.+||+|++-|. ...|++.+-+- .+.|+||.|
T Consensus 126 ~g~~~~~~~GtVl~~~~~---~~~Hgvtpv~~~~~~~~~R~slvf 167 (171)
T PF12851_consen 126 LGVAFAYQPGTVLIFCAK---RELHGVTPVESPNRNHGTRISLVF 167 (171)
T ss_pred CCEEEecCCCcEEEEccc---ceeeecCcccCCCCCCCeEEEEEE
Confidence 458899999999999986 57899998431 399999987
No 27
>PF14205 Cys_rich_KTR: Cysteine-rich KTR
Probab=67.11 E-value=3.3 Score=30.97 Aligned_cols=10 Identities=40% Similarity=1.252 Sum_probs=8.2
Q ss_pred cccccCCCCc
Q psy4937 36 TYIYCPHCNK 45 (356)
Q Consensus 36 ~~~~~~~~~~ 45 (356)
..+.||.|+.
T Consensus 3 ~Wi~CP~Cgn 12 (55)
T PF14205_consen 3 EWILCPICGN 12 (55)
T ss_pred eEEECCCCCC
Confidence 4689999985
No 28
>PF13759 2OG-FeII_Oxy_5: Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=66.55 E-value=8.6 Score=31.15 Aligned_cols=38 Identities=16% Similarity=0.121 Sum_probs=24.6
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf 332 (356)
.....+....|.|+|+-+-.. |++.+..-.+.||||.|
T Consensus 63 ~~~~~~~p~~G~lvlFPs~l~----H~v~p~~~~~~Risisf 100 (101)
T PF13759_consen 63 SPYYIVEPEEGDLVLFPSWLW----HGVPPNNSDEERISISF 100 (101)
T ss_dssp -SEEEE---TTEEEEEETTSE----EEE----SSS-EEEEEE
T ss_pred CceEEeCCCCCEEEEeCCCCE----EeccCcCCCCCEEEEEc
Confidence 456778889999999997654 99999877899999987
No 29
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=55.45 E-value=7.2 Score=24.71 Aligned_cols=15 Identities=33% Similarity=1.012 Sum_probs=10.4
Q ss_pred ccceeeccccccccc
Q psy4937 83 KETYVYCPHCNKAWQ 97 (356)
Q Consensus 83 ~~~~~~c~~c~~~~~ 97 (356)
..+-.+||+|+..|.
T Consensus 11 ~~~~~~Cp~CG~~F~ 25 (26)
T PF10571_consen 11 PESAKFCPHCGYDFE 25 (26)
T ss_pred hhhcCcCCCCCCCCc
Confidence 446667888887653
No 30
>PF12773 DZR: Double zinc ribbon
Probab=54.91 E-value=8.3 Score=27.26 Aligned_cols=11 Identities=27% Similarity=0.872 Sum_probs=5.3
Q ss_pred ceeeccccccc
Q psy4937 85 TYVYCPHCNKA 95 (356)
Q Consensus 85 ~~~~c~~c~~~ 95 (356)
...+|+.|+..
T Consensus 28 ~~~~C~~Cg~~ 38 (50)
T PF12773_consen 28 SKKICPNCGAE 38 (50)
T ss_pred CCCCCcCCcCC
Confidence 34455555554
No 31
>PLN03001 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=54.90 E-value=46 Score=32.06 Aligned_cols=42 Identities=12% Similarity=0.154 Sum_probs=33.7
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf 332 (356)
+..+.|.-.+++++|.-|+.=-.|..+.-+.- -...|+||.|
T Consensus 161 g~Wi~V~p~p~a~vVNiGD~l~~~tng~~~S~~HRVv~~~~~~R~Sia~ 209 (262)
T PLN03001 161 AEWLMVPPISDAILIIIADQTEIITNGNYKSAQHRAIANANKARLSVAT 209 (262)
T ss_pred CeEEECCCCCCcEEEEccHHHHHHhCCccccccceEEcCCCCCEEEEEE
Confidence 56788888899999999999999996655431 1456999998
No 32
>COG3809 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.02 E-value=6.8 Score=31.53 Aligned_cols=13 Identities=38% Similarity=1.209 Sum_probs=10.2
Q ss_pred ceeeccccccccc
Q psy4937 85 TYVYCPHCNKAWQ 97 (356)
Q Consensus 85 ~~~~c~~c~~~~~ 97 (356)
.+-|||.|.-.|-
T Consensus 20 EiD~CPrCrGVWL 32 (88)
T COG3809 20 EIDYCPRCRGVWL 32 (88)
T ss_pred eeeeCCccccEee
Confidence 5779999996654
No 33
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=51.29 E-value=4 Score=34.06 Aligned_cols=39 Identities=10% Similarity=0.109 Sum_probs=23.4
Q ss_pred cceeeccccccccccCccccccCCCCCCCCCCCCCCeEEeCCCCCHHHHHHH
Q psy4937 84 ETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDV 135 (356)
Q Consensus 84 ~~~~~c~~c~~~~~g~~~~~~~~~~~h~~~~~~ipGl~~i~dFIS~~Ee~~L 135 (356)
.....|+.|+.-+ |.... ....-|-++.+||++=++-+|
T Consensus 40 ~~h~~C~~CG~y~-~~~V~------------~l~epIDVY~~wiD~~~eg~i 78 (99)
T PRK14892 40 IAIITCGNCGLYT-EFEVP------------SVYDEVDVYNKFIDLYLEGKI 78 (99)
T ss_pred cceEECCCCCCcc-CEECC------------ccccchhhHHHHHHHHHhcCC
Confidence 5788999999554 34322 122335677777776655443
No 34
>PLN02984 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=51.10 E-value=60 Score=32.55 Aligned_cols=42 Identities=21% Similarity=0.201 Sum_probs=33.2
Q ss_pred ceEEEEecCCCeEEEecchhhcccc--------cccccCCCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQ--------HCVLRSDVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~--------H~I~~~dvr~rRVSLTf 332 (356)
+..+.|+-.+++++|.-|+.=-.|+ |.|........|+||.|
T Consensus 245 g~Wv~V~p~pgalVVNiGD~Le~wTNg~~kSt~HRVv~~~~~~~R~Sia~ 294 (341)
T PLN02984 245 GEWFNVKPIANTLVVNLGDMMQVISDDEYKSVLHRVGKRNKKKERYSICY 294 (341)
T ss_pred CceEECCCCCCeEEEECChhhhhhcCCeeeCCCCccccCCCCCCeEEEEE
Confidence 6688999999999999999999998 55521112456999998
No 35
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.74 E-value=8.5 Score=33.70 Aligned_cols=77 Identities=18% Similarity=0.267 Sum_probs=44.9
Q ss_pred cccccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccccCccccccCCCCCCCC
Q psy4937 34 EETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113 (356)
Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~g~~~~~~~~~~~h~~~ 113 (356)
+.|-..||.||-+-.+++-++--..-. -..+.-+||..|++-|| |...++ .|.
T Consensus 36 eati~qcp~csasirgd~~vegvlglg--------------------~dye~psfchncgs~fp-wterki------aga 88 (160)
T COG4306 36 EATITQCPICSASIRGDYYVEGVLGLG--------------------GDYEPPSFCHNCGSRFP-WTERKI------AGA 88 (160)
T ss_pred hHHHhcCCccCCcccccceeeeeeccC--------------------CCCCCcchhhcCCCCCC-cHHHHH------hHH
Confidence 667788999988666654443211111 13557799999999999 865543 111
Q ss_pred CCCCCCeEEeCCCCCHHHHHHHHHHhhc
Q psy4937 114 SFDIGGVYIKEDFLSEAETEDVMKQLDD 141 (356)
Q Consensus 114 ~~~ipGl~~i~dFIS~~Ee~~Ll~~id~ 141 (356)
.. |.=-..=|+++|-+++-..+-+
T Consensus 89 ~e----lvea~~~l~pdevqqf~tdlt~ 112 (160)
T COG4306 89 VE----LVEAGENLNPDEVQQFRTDLTD 112 (160)
T ss_pred HH----HHHccccCCHHHHHHHHhhHHH
Confidence 10 0001123578888777666543
No 36
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=45.06 E-value=15 Score=32.24 Aligned_cols=15 Identities=33% Similarity=0.884 Sum_probs=12.7
Q ss_pred cccceeecccccccc
Q psy4937 82 TKETYVYCPHCNKAW 96 (356)
Q Consensus 82 ~~~~~~~c~~c~~~~ 96 (356)
....|..|+.|+++|
T Consensus 120 ~~~~f~~C~~C~kiy 134 (147)
T PF01927_consen 120 TYDEFWRCPGCGKIY 134 (147)
T ss_pred cCCeEEECCCCCCEe
Confidence 455699999999997
No 37
>PLN02904 oxidoreductase
Probab=45.02 E-value=76 Score=31.92 Aligned_cols=42 Identities=17% Similarity=0.177 Sum_probs=33.4
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccC---C----CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS---D----VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~---d----vr~rRVSLTf 332 (356)
+..+.|+-.+++|+|.-|+.=-.|.-+.-+. . ....|+||+|
T Consensus 253 g~Wi~V~p~pgalVVNiGD~Le~~TNG~~kSt~HRVv~~~~~~R~Si~~ 301 (357)
T PLN02904 253 KNWVCVPYIEGALIVQLGDQVEVMSNGIYKSVVHRVTVNKDYKRLSFAS 301 (357)
T ss_pred CCEEECCCCCCeEEEEccHHHHHHhCCeeeccCCcccCCCCCCEEEEEE
Confidence 5688899999999999999988887665543 1 1456999998
No 38
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=43.82 E-value=11 Score=41.19 Aligned_cols=55 Identities=15% Similarity=0.175 Sum_probs=29.3
Q ss_pred ccceeeccccccccc-cCccccccCCCCCC-----------CCCCCCCCeEEeCCCCCHHHHHHHHH
Q psy4937 83 KETYVYCPHCNKAWQ-GSDLYAYQSHPNHS-----------GKSFDIGGVYIKEDFLSEAETEDVMK 137 (356)
Q Consensus 83 ~~~~~~c~~c~~~~~-g~~~~~~~~~~~h~-----------~~~~~ipGl~~i~dFIS~~Ee~~Ll~ 137 (356)
....-||++|+.... -|-.--.+..++|. ......+....+..||++.+-=+|++
T Consensus 38 ~~~~~fC~~CG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ld~~~ry~l~~ 104 (645)
T PRK14559 38 PVDEAHCPNCGAETGTIWWAIIAQASPNSEVLESGEATQQSESSLTPSSSPLYGSYLDPGQRYQLLA 104 (645)
T ss_pred CcccccccccCCcccchhhhhcccccccccccccccccccccccCCCcccchhhhhcCcccceEecc
Confidence 445669999997643 12222222233332 12344455677777777766555554
No 39
>PLN02299 1-aminocyclopropane-1-carboxylate oxidase
Probab=43.56 E-value=55 Score=32.40 Aligned_cols=42 Identities=21% Similarity=0.201 Sum_probs=34.6
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf 332 (356)
+..+.|+-.+++|+|.-|++=-.|+.+..+.- ..+.|+||+|
T Consensus 204 g~Wi~V~p~pg~lvVNiGD~l~~~Tng~~kS~~HRVv~~~~~~R~Si~~ 252 (321)
T PLN02299 204 GEWVDVPPMRHSIVVNLGDQLEVITNGKYKSVMHRVVAQTDGNRMSIAS 252 (321)
T ss_pred CeEEECCCCCCeEEEEeCHHHHHHhCCceecccceeecCCCCCEEEEEE
Confidence 56788998999999999999999998776641 1456999998
No 40
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=42.48 E-value=9.5 Score=25.63 Aligned_cols=15 Identities=33% Similarity=0.964 Sum_probs=10.3
Q ss_pred cccccCCCCcccccC
Q psy4937 36 TYIYCPHCNKAWQGS 50 (356)
Q Consensus 36 ~~~~~~~~~~~~~~~ 50 (356)
.|+.||+|++...++
T Consensus 3 ~~~~C~nC~R~v~a~ 17 (33)
T PF08209_consen 3 PYVECPNCGRPVAAS 17 (33)
T ss_dssp -EEE-TTTSSEEEGG
T ss_pred CeEECCCCcCCcchh
Confidence 478999999975554
No 41
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=42.20 E-value=13 Score=32.60 Aligned_cols=39 Identities=33% Similarity=0.706 Sum_probs=25.6
Q ss_pred eeccccccccCcccccccc--------cccccccccCCCCcc-cccCc
Q psy4937 13 RTCIKCEQEFGYENKNIVE--------FTEETYIYCPHCNKA-WQGSD 51 (356)
Q Consensus 13 r~c~~ce~~~~~~~~~~~~--------~~~~~~~~~~~~~~~-~~~~~ 51 (356)
..|..|+.......++.++ .+...+-.||.|+|- |.||-
T Consensus 92 sRC~~CN~~L~~v~~~~v~~~vp~~v~~~~~~f~~C~~C~kiyW~GsH 139 (147)
T PF01927_consen 92 SRCPKCNGPLRPVSKEEVKDRVPPYVYETYDEFWRCPGCGKIYWEGSH 139 (147)
T ss_pred CccCCCCcEeeechhhccccccCccccccCCeEEECCCCCCEeccccc
Confidence 5699999866544332221 123347789999998 98874
No 42
>PLN02485 oxidoreductase
Probab=41.88 E-value=74 Score=31.39 Aligned_cols=42 Identities=21% Similarity=0.285 Sum_probs=33.1
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf 332 (356)
+..+.|+-.+++++|.-|++=-.|+.+.-+.- -...|+||.|
T Consensus 235 g~Wi~V~p~pg~~vVNiGD~L~~~TnG~~~St~HRVv~~~~~~R~Si~~ 283 (329)
T PLN02485 235 GEWIWAIPIPGTFVCNIGDMLKIWSNGVYQSTLHRVINNSPKYRVCVAF 283 (329)
T ss_pred CcEEECCCCCCcEEEEhHHHHHHHHCCEeeCCCceecCCCCCCeEEEEE
Confidence 56888999999999999999999985544430 1346999998
No 43
>PLN00417 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=41.04 E-value=81 Score=31.61 Aligned_cols=42 Identities=14% Similarity=0.202 Sum_probs=34.2
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf 332 (356)
+..+.|+-.+++|+|.-|+.=-.|+.+.-+.- ....|+||.|
T Consensus 249 g~Wi~V~p~pg~lVVNiGD~Le~~Tng~~kSt~HRVv~~~~~~R~Si~f 297 (348)
T PLN00417 249 GKWYKAPIVPDTILINVGDQMEIMSNGIYKSPVHRVVTNREKERISVAT 297 (348)
T ss_pred CeEEECCCCCCcEEEEcChHHHHHhCCeecccceEEecCCCCCEEEEEE
Confidence 56788999999999999999999987666541 1456999998
No 44
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=41.03 E-value=7.2 Score=32.73 Aligned_cols=62 Identities=11% Similarity=0.178 Sum_probs=35.7
Q ss_pred cccccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccccccccCccccccCCCCCCCC
Q psy4937 34 EETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113 (356)
Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~g~~~~~~~~~~~h~~~ 113 (356)
-.++.+||+||.-. .|+|+-.. +..-.+.-|-.|+.-+. .+ .
T Consensus 19 L~k~FtCp~Cghe~-------------vs~ctvkk------------~~~~g~~~Cg~CGls~e-~e------------v 60 (104)
T COG4888 19 LPKTFTCPRCGHEK-------------VSSCTVKK------------TVNIGTAVCGNCGLSFE-CE------------V 60 (104)
T ss_pred CCceEecCccCCee-------------eeEEEEEe------------cCceeEEEcccCcceEE-Ee------------c
Confidence 46889999999832 33333111 13456778888886654 21 1
Q ss_pred CCCCCCeEEeCCCCCHHHHH
Q psy4937 114 SFDIGGVYIKEDFLSEAETE 133 (356)
Q Consensus 114 ~~~ipGl~~i~dFIS~~Ee~ 133 (356)
+....++-++.+|++.-.|-
T Consensus 61 ~~l~~~vDvYs~wvDay~eg 80 (104)
T COG4888 61 PELSEPVDVYSAWVDAYLEG 80 (104)
T ss_pred cccccchhHHHHHHHHHHhc
Confidence 22344566666776654443
No 45
>PF08792 A2L_zn_ribbon: A2L zinc ribbon domain; InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors [].
Probab=37.45 E-value=22 Score=23.72 Aligned_cols=16 Identities=19% Similarity=0.711 Sum_probs=12.8
Q ss_pred cccceeeccccccccc
Q psy4937 82 TKETYVYCPHCNKAWQ 97 (356)
Q Consensus 82 ~~~~~~~c~~c~~~~~ 97 (356)
+...+.+|+.|+..++
T Consensus 17 ~~~~~~~C~~Cg~~~~ 32 (33)
T PF08792_consen 17 KEDDYEVCIFCGSSFP 32 (33)
T ss_pred ecCCeEEcccCCcEee
Confidence 5668899999997764
No 46
>PLN02912 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=37.27 E-value=1.3e+02 Score=30.03 Aligned_cols=42 Identities=14% Similarity=0.119 Sum_probs=34.1
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf 332 (356)
+..+.|+-.+++|+|--|++=-.|..+.-+. .....|+||+|
T Consensus 242 g~Wi~V~p~pgalvVNiGD~L~~~TNG~~kSt~HRVv~~~~~~R~Sia~ 290 (348)
T PLN02912 242 GKWIAVNPIPNTFIVNLGDQMQVISNDKYKSVLHRAVVNTDKERISIPT 290 (348)
T ss_pred CcEEECCCcCCeEEEEcCHHHHHHhCCEEEcccccccCCCCCCEEEEEE
Confidence 5688899999999999999988888766653 12566999998
No 47
>PF14353 CpXC: CpXC protein
Probab=36.76 E-value=14 Score=31.27 Aligned_cols=32 Identities=28% Similarity=0.635 Sum_probs=21.9
Q ss_pred eccccccccCcccccccccc------------cccccccCCCCc
Q psy4937 14 TCIKCEQEFGYENKNIVEFT------------EETYIYCPHCNK 45 (356)
Q Consensus 14 ~c~~ce~~~~~~~~~~~~~~------------~~~~~~~~~~~~ 45 (356)
||..|...+..+-...+... ..--++||+|+.
T Consensus 3 tCP~C~~~~~~~v~~~I~~~~~p~l~e~il~g~l~~~~CP~Cg~ 46 (128)
T PF14353_consen 3 TCPHCGHEFEFEVWTSINADEDPELKEKILDGSLFSFTCPSCGH 46 (128)
T ss_pred CCCCCCCeeEEEEEeEEcCcCCHHHHHHHHcCCcCEEECCCCCC
Confidence 78889888877754333332 234678999998
No 48
>PLN02750 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=36.03 E-value=1.6e+02 Score=29.41 Aligned_cols=42 Identities=14% Similarity=0.247 Sum_probs=34.6
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf 332 (356)
+..+.|+-.+++++|--|++=-.|+-+.-+. .....|+||.|
T Consensus 240 g~Wi~V~p~pg~~vVNiGD~L~~~Tng~~~St~HRVv~~~~~~R~Si~~ 288 (345)
T PLN02750 240 GEWIPVKPIPDAFIINIGNCMQVWTNDLYWSAEHRVVVNSQKERFSIPF 288 (345)
T ss_pred CeEEEccCCCCeEEEEhHHHHHHHhCCeeecccceeccCCCCCEEEEEE
Confidence 6688999999999999999998998776653 12567999998
No 49
>COG1326 Uncharacterized archaeal Zn-finger protein [General function prediction only]
Probab=35.43 E-value=22 Score=33.17 Aligned_cols=16 Identities=25% Similarity=0.613 Sum_probs=13.2
Q ss_pred cccceeeccccccccc
Q psy4937 82 TKETYVYCPHCNKAWQ 97 (356)
Q Consensus 82 ~~~~~~~c~~c~~~~~ 97 (356)
+.+..+-|..|+..++
T Consensus 26 g~~~lvrC~eCG~V~~ 41 (201)
T COG1326 26 GREPLVRCEECGTVHP 41 (201)
T ss_pred CCceEEEccCCCcEee
Confidence 4557889999999986
No 50
>PTZ00273 oxidase reductase; Provisional
Probab=34.65 E-value=1.4e+02 Score=29.27 Aligned_cols=42 Identities=19% Similarity=0.260 Sum_probs=33.7
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC------CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD------VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d------vr~rRVSLTf 332 (356)
+..+.|+-.+++|+|.-|++=-.|.-+.-+.- ....|+||.|
T Consensus 224 g~Wi~V~p~pg~lvVNvGD~l~~~TnG~~kSt~HRVv~~~~~R~Si~~ 271 (320)
T PTZ00273 224 GEWMDVPPLEGSFVVNIGDMMEMWSNGRYRSTPHRVVNTGVERYSMPF 271 (320)
T ss_pred CCEEeCCCCCCeEEEEHHHHHHHHHCCeeeCCCccccCCCCCeEEEEE
Confidence 56788999999999999999888877665541 1456999998
No 51
>cd04476 RPA1_DBD_C RPA1_DBD_C: A subfamily of OB folds corresponding to the C-terminal OB fold, the ssDNA-binding domain (DBD)-C, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-C, RPA1 contains three other OB folds: DBD-A, DBD-B, and RPA1N. The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is involved in DNA binding and trimerization. It contains two structural insertions not found to date in other OB-folds: a zinc ribbon and a three-helix bundle. RPA1 DBD-C also contains a Cys4-type zinc-binding motif, which plays a role in the ssDNA binding fun
Probab=33.53 E-value=24 Score=31.19 Aligned_cols=11 Identities=55% Similarity=0.951 Sum_probs=9.9
Q ss_pred ccccccCCCCc
Q psy4937 35 ETYIYCPHCNK 45 (356)
Q Consensus 35 ~~~~~~~~~~~ 45 (356)
..|.-||+|+|
T Consensus 32 ~~Y~aC~~C~k 42 (166)
T cd04476 32 WWYPACPGCNK 42 (166)
T ss_pred eEEccccccCc
Confidence 57999999999
No 52
>PLN02947 oxidoreductase
Probab=33.21 E-value=1.5e+02 Score=30.02 Aligned_cols=42 Identities=14% Similarity=0.131 Sum_probs=33.9
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf 332 (356)
+..+.|+-.+++++|--|+.=-.|.-+.-+. .-...|+||.|
T Consensus 270 g~Wi~V~p~pga~VVNvGD~Lq~~SNG~~kS~~HRVv~~~~~~R~Sia~ 318 (374)
T PLN02947 270 GRWVTVEPIPGSFVVNVGDHLEIFSNGRYKSVLHRVRVNSTKPRISVAS 318 (374)
T ss_pred CEEEeCCCCCCeEEEEeCceeeeeeCCEEeccccccccCCCCCEEEEEE
Confidence 5688899999999999999988888666554 11457999998
No 53
>PLN02393 leucoanthocyanidin dioxygenase like protein
Probab=32.74 E-value=1.4e+02 Score=30.12 Aligned_cols=42 Identities=17% Similarity=0.280 Sum_probs=33.3
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf 332 (356)
+..+.|+-.+++|+|.-|++=-.|.-+.-+. .....|+|+.|
T Consensus 259 g~W~~V~p~pgalVVNiGD~l~~~Tng~~kSt~HRVv~~~~~~R~Siaf 307 (362)
T PLN02393 259 DAWITVKPVPDAFIVNIGDQIQVLSNAIYKSVEHRVIVNSAKERVSLAF 307 (362)
T ss_pred CEEEECCCCCCeEEEEcchhhHhhcCCeeeccceecccCCCCCEEEEEE
Confidence 5688899999999999999998886555543 11456999998
No 54
>PRK04023 DNA polymerase II large subunit; Validated
Probab=31.95 E-value=27 Score=39.95 Aligned_cols=22 Identities=41% Similarity=0.975 Sum_probs=15.9
Q ss_pred ceeeccccccccCcccccccccccccccccCCCCc
Q psy4937 11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNK 45 (356)
Q Consensus 11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~ 45 (356)
|.|+|-.|-... .+..||+|+.
T Consensus 625 g~RfCpsCG~~t-------------~~frCP~CG~ 646 (1121)
T PRK04023 625 GRRKCPSCGKET-------------FYRRCPFCGT 646 (1121)
T ss_pred cCccCCCCCCcC-------------CcccCCCCCC
Confidence 678888888652 3457888887
No 55
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=31.12 E-value=29 Score=29.30 Aligned_cols=13 Identities=23% Similarity=0.871 Sum_probs=10.3
Q ss_pred ceeeccccccccc
Q psy4937 85 TYVYCPHCNKAWQ 97 (356)
Q Consensus 85 ~~~~c~~c~~~~~ 97 (356)
.-+-||+|+..|+
T Consensus 25 ~PivCP~CG~~~~ 37 (108)
T PF09538_consen 25 DPIVCPKCGTEFP 37 (108)
T ss_pred CCccCCCCCCccC
Confidence 3466999998886
No 56
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=30.30 E-value=17 Score=32.85 Aligned_cols=47 Identities=26% Similarity=0.675 Sum_probs=34.0
Q ss_pred cccccceeeccccccccCccc--cccccc--ccccccccCCCCcc--cccCcc
Q psy4937 6 FCGCKGVRTCIKCEQEFGYEN--KNIVEF--TEETYIYCPHCNKA--WQGSDL 52 (356)
Q Consensus 6 ~c~ckg~r~c~~ce~~~~~~~--~~~~~~--~~~~~~~~~~~~~~--~~~~~~ 52 (356)
.||=|.|+.|..|...-+..- ...+.| .-+.--||-+|+++ |-.+.|
T Consensus 33 kCG~~tI~~Cp~C~~~IrG~y~v~gv~~~g~~~~~PsYC~~CGkpyPWt~~~L 85 (158)
T PF10083_consen 33 KCGAKTITSCPNCSTPIRGDYHVEGVFGLGGHYEAPSYCHNCGKPYPWTENAL 85 (158)
T ss_pred HhhHHHHHHCcCCCCCCCCceecCCeeeeCCCCCCChhHHhCCCCCchHHHHH
Confidence 599999999999999887772 334444 23455699999997 654433
No 57
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=30.23 E-value=72 Score=29.69 Aligned_cols=36 Identities=17% Similarity=0.308 Sum_probs=29.2
Q ss_pred EEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEEe
Q psy4937 293 VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAY 332 (356)
Q Consensus 293 ~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTf 332 (356)
.+.+.-.+|.|+|+-.-.+ |++++-.-.+.||||.|
T Consensus 161 ~~~v~P~~G~lvlFPS~L~----H~v~p~~~~~~RISiSF 196 (201)
T TIGR02466 161 FVYVPPQEGRVLLFESWLR----HEVPPNESEEERISVSF 196 (201)
T ss_pred cEEECCCCCeEEEECCCCc----eecCCCCCCCCEEEEEE
Confidence 3456668899988887654 99999877899999998
No 58
>PLN02515 naringenin,2-oxoglutarate 3-dioxygenase
Probab=30.06 E-value=2.2e+02 Score=28.67 Aligned_cols=41 Identities=15% Similarity=0.119 Sum_probs=32.8
Q ss_pred eEEEEecCCCeEEEecchhhcccccccccC---C----CCCCeEEEEe
Q psy4937 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRS---D----VIGRRVCIAY 332 (356)
Q Consensus 292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~---d----vr~rRVSLTf 332 (356)
..+.|+-.+++|+|.-|+.=-.|+-+.-+. . ....|+||.|
T Consensus 243 ~Wi~Vpp~pgalVVNiGD~L~~~TNG~~kSt~HRVv~~~~~~R~Si~~ 290 (358)
T PLN02515 243 TWITVQPVEGAFVVNLGDHGHYLSNGRFKNADHQAVVNSNCSRLSIAT 290 (358)
T ss_pred eEEECCCCCCeEEEEccHHHHHHhCCeeeeecceEECCCCCCEEEEEE
Confidence 578899999999999999988897665543 1 1456999998
No 59
>COG2850 Uncharacterized conserved protein [Function unknown]
Probab=29.61 E-value=80 Score=32.36 Aligned_cols=55 Identities=13% Similarity=0.315 Sum_probs=34.6
Q ss_pred HHHHHHHHhhcccCCCCCCcceeeeEecCCCCCCccCCCCCcccccCceEEEecCCceEEEEeec
Q psy4937 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYY 243 (356)
Q Consensus 179 l~~l~~Ri~~~~~l~~f~~~~q~~leY~pg~G~~I~pHvDd~~~~Ge~IaslSLgs~~vm~f~~~ 243 (356)
+..+.++++ .++.+ +.+.+.+.|.+.+| +.+||.|.- .|..|=.-+.+.......
T Consensus 103 v~~l~~~Fr---flP~w-r~ddiMIS~a~~GG-gvg~H~D~Y-----DVfliQg~G~RRW~v~~~ 157 (383)
T COG2850 103 VAALMEPFR---FLPDW-RIDDIMISFAAPGG-GVGPHFDQY-----DVFLIQGQGRRRWRVGKK 157 (383)
T ss_pred HHHHHHHhc---cCccc-cccceEEEEecCCC-ccCccccch-----heeEEeecccceeecCCc
Confidence 444444444 34444 45778888887654 899999962 455666666666665544
No 60
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=29.61 E-value=38 Score=28.56 Aligned_cols=27 Identities=26% Similarity=0.609 Sum_probs=18.2
Q ss_pred ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937 11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA 46 (356)
Q Consensus 11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~ 46 (356)
+.-.|..|...+..... +..||+|+-+
T Consensus 69 ~~~~C~~Cg~~~~~~~~---------~~~CP~Cgs~ 95 (113)
T PRK12380 69 AQAWCWDCSQVVEIHQH---------DAQCPHCHGE 95 (113)
T ss_pred cEEEcccCCCEEecCCc---------CccCcCCCCC
Confidence 56679999866554432 3459999963
No 61
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=29.44 E-value=25 Score=31.95 Aligned_cols=38 Identities=29% Similarity=0.734 Sum_probs=25.7
Q ss_pred eccccccccCcccccccccccc--------cccccCCCCcc-cccCc
Q psy4937 14 TCIKCEQEFGYENKNIVEFTEE--------TYIYCPHCNKA-WQGSD 51 (356)
Q Consensus 14 ~c~~ce~~~~~~~~~~~~~~~~--------~~~~~~~~~~~-~~~~~ 51 (356)
.|..|+..-..-+...|+..+. ..-.||.|++- |.||-
T Consensus 99 RCp~CN~~L~~vs~eev~~~Vp~~~~~~~~~f~~C~~CgkiYW~GsH 145 (165)
T COG1656 99 RCPECNGELEKVSREEVKEKVPEKVYRNYEEFYRCPKCGKIYWKGSH 145 (165)
T ss_pred cCcccCCEeccCcHHHHhhccchhhhhcccceeECCCCcccccCchH
Confidence 5889988766665545444433 23459999998 88873
No 62
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=29.22 E-value=23 Score=29.76 Aligned_cols=27 Identities=26% Similarity=0.735 Sum_probs=17.9
Q ss_pred ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937 11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA 46 (356)
Q Consensus 11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~ 46 (356)
..-.|..|...+..+... ..||+|+..
T Consensus 69 ~~~~C~~Cg~~~~~~~~~---------~~CP~Cgs~ 95 (113)
T PF01155_consen 69 ARARCRDCGHEFEPDEFD---------FSCPRCGSP 95 (113)
T ss_dssp -EEEETTTS-EEECHHCC---------HH-SSSSSS
T ss_pred CcEECCCCCCEEecCCCC---------CCCcCCcCC
Confidence 456799999887776553 359999874
No 63
>PLN02997 flavonol synthase
Probab=29.20 E-value=2e+02 Score=28.55 Aligned_cols=42 Identities=10% Similarity=0.142 Sum_probs=32.2
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf 332 (356)
+..+.|+-.+++|+|--|+.=-.|.-|.-+. .-...|+||+|
T Consensus 228 g~Wi~V~p~pgalvVNiGD~Le~~TNG~~kSt~HRVv~~~~~~R~Si~f 276 (325)
T PLN02997 228 EQWLDLNYINSAVVVIIGDQLMRMTNGRFKNVLHRAKTDKERLRISWPV 276 (325)
T ss_pred CcEEECCCCCCeEEEEechHHHHHhCCccccccceeeCCCCCCEEEEEE
Confidence 5578888899999999999998898554432 11345999988
No 64
>PLN02156 gibberellin 2-beta-dioxygenase
Probab=28.31 E-value=1.8e+02 Score=29.05 Aligned_cols=42 Identities=21% Similarity=0.319 Sum_probs=34.8
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC---C----CCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD---V----IGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d---v----r~rRVSLTf 332 (356)
+..+.|+-.+++++|--|+.=-.|..+.-+.- | ...|+||.|
T Consensus 226 g~Wi~Vpp~pga~VVNiGD~l~~wTNg~~kSt~HRVv~~~~~~R~Siaf 274 (335)
T PLN02156 226 GTWVDVPPDHSSFFVLVGDTLQVMTNGRFKSVKHRVVTNTKRSRISMIY 274 (335)
T ss_pred CCEEEccCCCCcEEEEhHHHHHHHhCCeeeccceeeecCCCCCEEEEEE
Confidence 56888999999999999999999998877641 1 445999998
No 65
>PLN02758 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=28.22 E-value=1.9e+02 Score=29.05 Aligned_cols=42 Identities=12% Similarity=0.088 Sum_probs=33.6
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC---C----CCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD---V----IGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d---v----r~rRVSLTf 332 (356)
+..+.|+-.+|+++|.-|+.=-.|.-+.-+.- | ...|+||.|
T Consensus 258 g~Wi~V~p~pgalVVNiGD~L~~~SNG~~kS~~HRVv~~~~~~R~Sia~ 306 (361)
T PLN02758 258 NTWVPVHPVPNALVINIGDTLEVLTNGKYKSVEHRAVTNKEKDRLSIVT 306 (361)
T ss_pred CEEEeCCCCCCeEEEEccchhhhhcCCeeecccceeecCCCCCEEEEEE
Confidence 56788999999999999999999976666541 1 456999997
No 66
>PLN02216 protein SRG1
Probab=27.52 E-value=2.6e+02 Score=28.08 Aligned_cols=42 Identities=17% Similarity=0.124 Sum_probs=33.1
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf 332 (356)
+..+.|+-.+++|+|.-|+.=-.|.-+.-+. .....|+||.|
T Consensus 256 g~Wi~V~p~pgalvVNiGD~L~~~TNG~~kS~~HRVv~~~~~~R~Si~~ 304 (357)
T PLN02216 256 GKWVSVKPLPNALVVNVGDILEIITNGTYRSIEHRGVVNSEKERLSVAT 304 (357)
T ss_pred CEEEECCCCCCeEEEEcchhhHhhcCCeeeccCceeecCCCCCEEEEEE
Confidence 6688899999999999999999998544432 11456999998
No 67
>COG0266 Nei Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]
Probab=27.36 E-value=35 Score=33.48 Aligned_cols=13 Identities=38% Similarity=0.979 Sum_probs=9.8
Q ss_pred cccceeecccccc
Q psy4937 82 TKETYVYCPHCNK 94 (356)
Q Consensus 82 ~~~~~~~c~~c~~ 94 (356)
+--+..|||.|.+
T Consensus 261 ~gR~t~~CP~CQ~ 273 (273)
T COG0266 261 GGRSTFYCPVCQK 273 (273)
T ss_pred cCCcCEeCCCCCC
Confidence 4457789999974
No 68
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=26.55 E-value=51 Score=38.72 Aligned_cols=10 Identities=40% Similarity=1.019 Sum_probs=7.4
Q ss_pred eeeccccccc
Q psy4937 86 YVYCPHCNKA 95 (356)
Q Consensus 86 ~~~c~~c~~~ 95 (356)
..+||.|+.-
T Consensus 709 a~~CP~CGtp 718 (1337)
T PRK14714 709 RVECPRCDVE 718 (1337)
T ss_pred cccCCCCCCc
Confidence 5579999843
No 69
>PLN02254 gibberellin 3-beta-dioxygenase
Probab=26.40 E-value=2.2e+02 Score=28.75 Aligned_cols=42 Identities=21% Similarity=0.194 Sum_probs=33.9
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf 332 (356)
+..+.|+-.+++|+|--|+.=-.|.-+.-+.- -...|+||+|
T Consensus 256 ~~Wi~V~p~pgalVVNiGD~lq~~SNg~~kS~~HRVv~~~~~~R~Sia~ 304 (358)
T PLN02254 256 VGWVTVPPVPGSLVVNVGDLLHILSNGRFPSVLHRAVVNKTRHRISVAY 304 (358)
T ss_pred CEEEEcccCCCCEEEEhHHHHHHHhCCeeccccceeecCCCCCEEEEEE
Confidence 36888999999999999999888987766541 1467999998
No 70
>PF05118 Asp_Arg_Hydrox: Aspartyl/Asparaginyl beta-hydroxylase; InterPro: IPR007803 The alpha-ketoglutarate-dependent dioxygenase aspartyl (asparaginyl) beta-hydroxylase (1.14.11.16 from EC) specifically hydroxylates one aspartic or asparagine residue in certain epidermal growth factor-like domains of a number of proteins. Its action may be due to histidine-675, which, when mutated to an alanine residue, causes the loss of enzymatic activity in the protein []. An invertebrate alpha-ketoglutarate-dependent aspartyl/asparaginyl beta-hydroxylase, which posttranslationally hydroxylates specific aspartyl or asparaginyl residues within epidermal growth factor-like modules [], activity was found to be similar to that of the purified mammalian aspartyl/asparaginyl beta-hydroxylase with respect to cofactor requirements, stereochemistry and substrate sequence specificity []. This enzyme requires Fe2+ as a cofactor. Some vitamin K-dependent coagulation factors, as well as synthetic peptides based on the structure of the first epidermal growth factor domain of human coagulation factor IX or X, can act as acceptors.; GO: 0018193 peptidyl-amino acid modification, 0030176 integral to endoplasmic reticulum membrane; PDB: 3RCQ_A 1E5S_A 1E5R_B.
Probab=26.35 E-value=1.9e+02 Score=25.61 Aligned_cols=38 Identities=13% Similarity=0.040 Sum_probs=23.6
Q ss_pred EEecCCCeEEEecchhhcccccccccCCCCCCeEEEEecccCC
Q psy4937 295 RIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA 337 (356)
Q Consensus 295 ~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLTfR~l~~ 337 (356)
...+..|-++++... +.|.+-.. -.+.||.|.+--..|
T Consensus 125 ~~~w~~G~~~~fD~s----~~H~~~N~-~~~~Rv~L~vD~~hP 162 (163)
T PF05118_consen 125 TRHWREGECWVFDDS----FEHEVWNN-GDEDRVVLIVDFWHP 162 (163)
T ss_dssp EEB--CTEEEEE-TT----S-EEEEES-SSS-EEEEEEEEE-T
T ss_pred EEEeccCcEEEEeCC----EEEEEEeC-CCCCEEEEEEEeecC
Confidence 477789999998876 56777664 268899998765544
No 71
>PRK14811 formamidopyrimidine-DNA glycosylase; Provisional
Probab=26.07 E-value=42 Score=32.47 Aligned_cols=16 Identities=19% Similarity=0.567 Sum_probs=12.1
Q ss_pred cccceeeccccccccc
Q psy4937 82 TKETYVYCPHCNKAWQ 97 (356)
Q Consensus 82 ~~~~~~~c~~c~~~~~ 97 (356)
+--+-.|||.|.+..|
T Consensus 251 ~gR~ty~Cp~CQ~~~~ 266 (269)
T PRK14811 251 GGRGTHFCPQCQPLRP 266 (269)
T ss_pred CCCCcEECCCCcCCCC
Confidence 3457789999997754
No 72
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=25.95 E-value=52 Score=23.12 Aligned_cols=12 Identities=17% Similarity=0.382 Sum_probs=9.3
Q ss_pred eeeccccccccc
Q psy4937 86 YVYCPHCNKAWQ 97 (356)
Q Consensus 86 ~~~c~~c~~~~~ 97 (356)
...|+.|+..+.
T Consensus 20 ~~vC~~Cg~~~~ 31 (52)
T smart00661 20 RFVCRKCGYEEP 31 (52)
T ss_pred EEECCcCCCeEE
Confidence 667999997654
No 73
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=25.59 E-value=30 Score=21.30 Aligned_cols=11 Identities=36% Similarity=1.102 Sum_probs=9.6
Q ss_pred eeccccccccc
Q psy4937 87 VYCPHCNKAWQ 97 (356)
Q Consensus 87 ~~c~~c~~~~~ 97 (356)
.||..|++.+.
T Consensus 2 ~~C~~C~k~f~ 12 (27)
T PF12171_consen 2 FYCDACDKYFS 12 (27)
T ss_dssp CBBTTTTBBBS
T ss_pred CCcccCCCCcC
Confidence 48999999986
No 74
>PLN02704 flavonol synthase
Probab=25.30 E-value=2e+02 Score=28.48 Aligned_cols=42 Identities=10% Similarity=0.015 Sum_probs=34.4
Q ss_pred ceEEEEecCCCeEEEecchhhcccccccccCC-------CCCCeEEEEe
Q psy4937 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSD-------VIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~d-------vr~rRVSLTf 332 (356)
+..+.|+-.+++|+|.-|+.=-.|.-+.-+.- -...|+||+|
T Consensus 244 g~Wi~V~p~pg~lvVNvGD~L~~~TNg~~kSt~HRVv~~~~~~R~Si~~ 292 (335)
T PLN02704 244 DHWFDVKYIPNALVIHIGDQIEILSNGKYKSVLHRTTVNKEKTRMSWPV 292 (335)
T ss_pred CEEEeCCCCCCeEEEEechHHHHHhCCeeecccceeecCCCCCeEEEEE
Confidence 56888999999999999999999987766541 1466999998
No 75
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=25.12 E-value=60 Score=20.35 Aligned_cols=12 Identities=25% Similarity=0.764 Sum_probs=8.6
Q ss_pred cccceeeccccc
Q psy4937 82 TKETYVYCPHCN 93 (356)
Q Consensus 82 ~~~~~~~c~~c~ 93 (356)
+...-..||.|+
T Consensus 12 ~~~v~f~CPnCG 23 (24)
T PF07754_consen 12 EQAVPFPCPNCG 23 (24)
T ss_pred ccCceEeCCCCC
Confidence 445667788887
No 76
>PF09788 Tmemb_55A: Transmembrane protein 55A; InterPro: IPR019178 Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Probab=24.73 E-value=37 Score=32.94 Aligned_cols=28 Identities=36% Similarity=0.833 Sum_probs=21.1
Q ss_pred eccccccccCcccccccccccccccccCCCCcc
Q psy4937 14 TCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA 46 (356)
Q Consensus 14 ~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~ 46 (356)
+|.+|...|-.... +..|...||||.|-
T Consensus 159 ~CghC~~~Fl~~~~-----~~~tlARCPHCrKv 186 (256)
T PF09788_consen 159 ICGHCSNTFLFNTL-----TSNTLARCPHCRKV 186 (256)
T ss_pred ECCCCCCcEeccCC-----CCCccccCCCCcee
Confidence 79999988765533 23578899999983
No 77
>KOG3084|consensus
Probab=24.53 E-value=54 Score=32.99 Aligned_cols=46 Identities=26% Similarity=0.358 Sum_probs=33.3
Q ss_pred eeeccccccccccCccccccCCCCCCCC-------CCCCCCe-EEeCCCCCHHH
Q psy4937 86 YVYCPHCNKAWQGSDLYAYQSHPNHSGK-------SFDIGGV-YIKEDFLSEAE 131 (356)
Q Consensus 86 ~~~c~~c~~~~~g~~~~~~~~~~~h~~~-------~~~ipGl-~~i~dFIS~~E 131 (356)
-.+|+.|++.||-.|+-.|-.--||.+. --..+|| ..+.+||-+-|
T Consensus 173 ~~~C~~~n~~yPr~dPvVIm~li~~d~~~~LL~R~~r~~~gl~t~lAGFlEpGE 226 (345)
T KOG3084|consen 173 DETCPSCNVIYPRTDPVVIMLLIDHDGKHALLGRQKRYPPGLWTCLAGFLEPGE 226 (345)
T ss_pred cccCCcCCeeccCCCCeEEEEEEcCCCCEeeeecccCCCCchhhhhhccCCccc
Confidence 4699999999998888777666677642 2456776 46778996544
No 78
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=24.23 E-value=52 Score=23.03 Aligned_cols=10 Identities=40% Similarity=0.790 Sum_probs=8.0
Q ss_pred eeeccccccc
Q psy4937 86 YVYCPHCNKA 95 (356)
Q Consensus 86 ~~~c~~c~~~ 95 (356)
.+-||.|+.-
T Consensus 21 ~~~Cp~CG~~ 30 (46)
T PRK00398 21 GVRCPYCGYR 30 (46)
T ss_pred ceECCCCCCe
Confidence 6889999853
No 79
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=24.23 E-value=42 Score=23.82 Aligned_cols=24 Identities=25% Similarity=0.627 Sum_probs=17.3
Q ss_pred eccccccccCcccccccccccccccccCCCCc
Q psy4937 14 TCIKCEQEFGYENKNIVEFTEETYIYCPHCNK 45 (356)
Q Consensus 14 ~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~ 45 (356)
.|..|..++..... .-|.||+|+-
T Consensus 4 ~C~~Cg~~~~~~~~--------~~irC~~CG~ 27 (44)
T smart00659 4 ICGECGRENEIKSK--------DVVRCRECGY 27 (44)
T ss_pred ECCCCCCEeecCCC--------CceECCCCCc
Confidence 58888877666533 3489999975
No 80
>PRK12775 putative trifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta/ferritin domain-containing protein; Provisional
Probab=23.49 E-value=44 Score=38.38 Aligned_cols=51 Identities=24% Similarity=0.479 Sum_probs=31.1
Q ss_pred cccccCCCCcccccCccccccCCCCCCCCccccccccccccccccccccceeeccccc
Q psy4937 36 TYIYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCN 93 (356)
Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~ 93 (356)
.-.+||.||+|..+++-|.-=-+...+---=+|+++|+- -...|-.||+|+
T Consensus 795 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~ 845 (1006)
T PRK12775 795 GVATCPKCHRPLEGDEEYVCCATSELQWRCDDCGKVSEG-------FAFPYGMCPACG 845 (1006)
T ss_pred CCccCcccCCCCCCCceeEEecCcceeeehhhhcccccc-------ccCCcCcCcccc
Confidence 346899999998888776543333322222246666552 233567788887
No 81
>PLN02403 aminocyclopropanecarboxylate oxidase
Probab=22.86 E-value=2.6e+02 Score=27.46 Aligned_cols=42 Identities=24% Similarity=0.260 Sum_probs=31.0
Q ss_pred ceEEEEecCC-CeEEEecchhhcccccccccC-------CCCCCeEEEEe
Q psy4937 291 SVVIRIPMPA-RSLLVLWGEARYEWQHCVLRS-------DVIGRRVCIAY 332 (356)
Q Consensus 291 ~~~v~I~Lp~-rSLLVMsG~ARy~W~H~I~~~-------dvr~rRVSLTf 332 (356)
+..+.|+-.+ ++|+|--|+.=-.|.=+.-+. .-...|+||+|
T Consensus 199 g~Wi~V~p~p~~~lvVNvGD~L~~~Tng~~~S~~HRVv~~~~~~R~Si~~ 248 (303)
T PLN02403 199 GKWVPIPPSKNNTIFVNTGDQLEVLSNGRYKSTLHRVMADKNGSRLSIAT 248 (303)
T ss_pred CeEEECCCCCCCEEEEEehHHHHHHhCCeeecccceeecCCCCCEEEEEE
Confidence 5567777666 699999999988887665553 11556999998
No 82
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=22.57 E-value=64 Score=27.64 Aligned_cols=31 Identities=26% Similarity=0.488 Sum_probs=18.4
Q ss_pred ceeeccccccccCccccccccccccc-ccccCCCCc
Q psy4937 11 GVRTCIKCEQEFGYENKNIVEFTEET-YIYCPHCNK 45 (356)
Q Consensus 11 g~r~c~~ce~~~~~~~~~~~~~~~~~-~~~~~~~~~ 45 (356)
+.-.| .|...+......+ .... +..||+|+-
T Consensus 69 ~~~~C-~Cg~~~~~~~~~~---~~~~~~~~CP~Cgs 100 (124)
T PRK00762 69 VEIEC-ECGYEGVVDEDEI---DHYAAVIECPVCGN 100 (124)
T ss_pred eeEEe-eCcCcccccccch---hccccCCcCcCCCC
Confidence 56689 9997766542111 0111 357999985
No 83
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=22.13 E-value=60 Score=28.26 Aligned_cols=35 Identities=20% Similarity=0.650 Sum_probs=21.6
Q ss_pred ceeeccccccccCccc------------ccccccccccccccCCCCc
Q psy4937 11 GVRTCIKCEQEFGYEN------------KNIVEFTEETYIYCPHCNK 45 (356)
Q Consensus 11 g~r~c~~ce~~~~~~~------------~~~~~~~~~~~~~~~~~~~ 45 (356)
..-.|..|...+.... +.++..+...+..||.|+-
T Consensus 69 ~~~~C~~CG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~CP~Cgs 115 (135)
T PRK03824 69 AVLKCRNCGNEWSLKEVKESLDEEIREAIHFIPEVVHAFLKCPKCGS 115 (135)
T ss_pred eEEECCCCCCEEecccccccccccccccccccccccccCcCCcCCCC
Confidence 4567999987765541 1222222356678999986
No 84
>PRK01103 formamidopyrimidine/5-formyluracil/ 5-hydroxymethyluracil DNA glycosylase; Validated
Probab=21.85 E-value=55 Score=31.61 Aligned_cols=13 Identities=31% Similarity=0.976 Sum_probs=9.8
Q ss_pred cccceeecccccc
Q psy4937 82 TKETYVYCPHCNK 94 (356)
Q Consensus 82 ~~~~~~~c~~c~~ 94 (356)
+--+-.|||.|.+
T Consensus 261 ~gR~t~~CP~CQ~ 273 (274)
T PRK01103 261 GGRSTFFCPRCQK 273 (274)
T ss_pred CCCCcEECcCCCC
Confidence 3457789999975
No 85
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=21.49 E-value=37 Score=29.71 Aligned_cols=12 Identities=33% Similarity=1.074 Sum_probs=8.7
Q ss_pred ccccccCCCCcc
Q psy4937 35 ETYIYCPHCNKA 46 (356)
Q Consensus 35 ~~~~~~~~~~~~ 46 (356)
..+..||+|+.-
T Consensus 97 ~~~Y~Cp~C~~~ 108 (147)
T smart00531 97 NAYYKCPNCQSK 108 (147)
T ss_pred CcEEECcCCCCE
Confidence 346679999873
No 86
>PF11672 DUF3268: Protein of unknown function (DUF3268); InterPro: IPR021686 This entry is represented by Listeria phage P100, Gp150. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=21.32 E-value=1.2e+02 Score=25.58 Aligned_cols=17 Identities=41% Similarity=1.023 Sum_probs=12.0
Q ss_pred cccCCCCcc---cccCcccc
Q psy4937 38 IYCPHCNKA---WQGSDLYA 54 (356)
Q Consensus 38 ~~~~~~~~~---~~~~~~~~ 54 (356)
+.||-|+.. ..+++||-
T Consensus 3 ~~CpYCg~~~~l~~~~~iYg 22 (102)
T PF11672_consen 3 IICPYCGGPAELVDGSEIYG 22 (102)
T ss_pred cccCCCCCeeEEcccchhcC
Confidence 579999876 55666654
No 87
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=21.27 E-value=67 Score=27.13 Aligned_cols=28 Identities=21% Similarity=0.471 Sum_probs=17.6
Q ss_pred ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937 11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA 46 (356)
Q Consensus 11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~ 46 (356)
+.-.|..|...+.... ..+..||+|+-.
T Consensus 69 ~~~~C~~Cg~~~~~~~--------~~~~~CP~Cgs~ 96 (114)
T PRK03681 69 AECWCETCQQYVTLLT--------QRVRRCPQCHGD 96 (114)
T ss_pred cEEEcccCCCeeecCC--------ccCCcCcCcCCC
Confidence 5567999985433322 234679999863
No 88
>PF06677 Auto_anti-p27: Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27); InterPro: IPR009563 The proteins in this entry are functionally uncharacterised and include several proteins that characterise Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27). It is thought that the potential association of anti-p27 with anti-centromere antibodies suggests that autoantigen p27 might play a role in mitosis [].
Probab=21.27 E-value=57 Score=22.90 Aligned_cols=9 Identities=33% Similarity=1.077 Sum_probs=6.6
Q ss_pred ceeeccccc
Q psy4937 85 TYVYCPHCN 93 (356)
Q Consensus 85 ~~~~c~~c~ 93 (356)
.-+||+.|+
T Consensus 33 g~~~Cv~C~ 41 (41)
T PF06677_consen 33 GKIYCVSCG 41 (41)
T ss_pred CCEECCCCC
Confidence 457899885
No 89
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=21.26 E-value=45 Score=24.42 Aligned_cols=25 Identities=32% Similarity=0.689 Sum_probs=18.1
Q ss_pred eccccccccCcccccccccccccccccCCCCc
Q psy4937 14 TCIKCEQEFGYENKNIVEFTEETYIYCPHCNK 45 (356)
Q Consensus 14 ~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~ 45 (356)
-|..|-+.+. .-++.+.|-||+|+-
T Consensus 8 ~C~~Cg~~~~-------~~~~~~~irCp~Cg~ 32 (49)
T COG1996 8 KCARCGREVE-------LDQETRGIRCPYCGS 32 (49)
T ss_pred EhhhcCCeee-------hhhccCceeCCCCCc
Confidence 5888886651 135678999999985
No 90
>PRK11827 hypothetical protein; Provisional
Probab=21.13 E-value=74 Score=24.24 Aligned_cols=15 Identities=20% Similarity=0.160 Sum_probs=11.8
Q ss_pred ceeeccccccccccC
Q psy4937 85 TYVYCPHCNKAWQGS 99 (356)
Q Consensus 85 ~~~~c~~c~~~~~g~ 99 (356)
....|..|+.+||=+
T Consensus 25 ~~Lic~~~~laYPI~ 39 (60)
T PRK11827 25 QELICKLDNLAFPLR 39 (60)
T ss_pred CeEECCccCeecccc
Confidence 456799999999844
No 91
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=20.54 E-value=71 Score=27.11 Aligned_cols=29 Identities=24% Similarity=0.508 Sum_probs=18.2
Q ss_pred cceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937 10 KGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA 46 (356)
Q Consensus 10 kg~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~ 46 (356)
++.-.|..|...+..... .|..||.|+-+
T Consensus 69 p~~~~C~~Cg~~~~~~~~--------~~~~CP~Cgs~ 97 (117)
T PRK00564 69 KVELECKDCSHVFKPNAL--------DYGVCEKCHSK 97 (117)
T ss_pred CCEEEhhhCCCccccCCc--------cCCcCcCCCCC
Confidence 356678889855433322 34569999864
No 92
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=20.35 E-value=86 Score=21.43 Aligned_cols=15 Identities=20% Similarity=0.797 Sum_probs=11.7
Q ss_pred cccceeecccccccc
Q psy4937 82 TKETYVYCPHCNKAW 96 (356)
Q Consensus 82 ~~~~~~~c~~c~~~~ 96 (356)
....|..|..|++.|
T Consensus 24 ~~T~fy~C~~C~~~w 38 (39)
T PF01096_consen 24 PMTLFYVCCNCGHRW 38 (39)
T ss_dssp SSEEEEEESSSTEEE
T ss_pred CCeEEEEeCCCCCee
Confidence 456788899999776
No 93
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=20.32 E-value=70 Score=27.02 Aligned_cols=27 Identities=26% Similarity=0.562 Sum_probs=17.9
Q ss_pred ceeeccccccccCcccccccccccccccccCCCCcc
Q psy4937 11 GVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKA 46 (356)
Q Consensus 11 g~r~c~~ce~~~~~~~~~~~~~~~~~~~~~~~~~~~ 46 (356)
+.-.|..|...+..... +..||+|+-+
T Consensus 69 ~~~~C~~Cg~~~~~~~~---------~~~CP~Cgs~ 95 (115)
T TIGR00100 69 VECECEDCSEEVSPEID---------LYRCPKCHGI 95 (115)
T ss_pred cEEEcccCCCEEecCCc---------CccCcCCcCC
Confidence 45678888865544332 4579999863
No 94
>COG3751 EGL-9 Predicted proline hydroxylase [Posttranslational modification, protein turnover, chaperones]
Probab=20.21 E-value=4.3e+02 Score=25.72 Aligned_cols=39 Identities=15% Similarity=0.107 Sum_probs=30.1
Q ss_pred eEEEEecCCCeEEEecchhhcccccccccCCCCCCeEEEE
Q psy4937 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIA 331 (356)
Q Consensus 292 ~~v~I~Lp~rSLLVMsG~ARy~W~H~I~~~dvr~rRVSLT 331 (356)
....|.=+-+||.+|.-.-.+.| |.|..-.....|+|||
T Consensus 197 ~~~ti~P~fn~lv~F~s~~~Hs~-h~V~~~~~~~~RlsV~ 235 (252)
T COG3751 197 SFKTIAPVFNSLVFFKSRPSHSV-HSVEEPYAAADRLSVT 235 (252)
T ss_pred cccccCCCCceEEEEEecCCccc-eeccccccccceEEEe
Confidence 34556667788888887776666 7777767899999999
No 95
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=20.05 E-value=35 Score=30.44 Aligned_cols=11 Identities=18% Similarity=0.655 Sum_probs=7.3
Q ss_pred ccccccCCCCc
Q psy4937 35 ETYIYCPHCNK 45 (356)
Q Consensus 35 ~~~~~~~~~~~ 45 (356)
..|-.||+|++
T Consensus 107 ~~~Y~Cp~c~~ 117 (158)
T TIGR00373 107 NMFFICPNMCV 117 (158)
T ss_pred CCeEECCCCCc
Confidence 44566777776
Done!