Query psy945
Match_columns 344
No_of_seqs 337 out of 823
Neff 5.5
Searched_HMMs 46136
Date Fri Aug 16 19:38:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy945.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/945hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF02936 COX4: Cytochrome c ox 100.0 2.8E-51 6.1E-56 356.9 6.9 142 201-343 1-142 (142)
2 KOG4075|consensus 100.0 2E-49 4.4E-54 347.7 8.4 148 195-343 20-167 (167)
3 cd00922 Cyt_c_Oxidase_IV Cytoc 100.0 1.1E-46 2.4E-51 325.9 9.7 135 202-337 2-136 (136)
4 COG1834 N-Dimethylarginine dim 100.0 1E-29 2.2E-34 240.6 3.5 211 85-314 14-256 (267)
5 KOG4075|consensus 99.9 6.8E-25 1.5E-29 193.1 -1.3 82 29-110 55-140 (167)
6 PF02274 Amidinotransf: Amidin 99.9 1.1E-23 2.4E-28 199.6 -2.8 200 95-313 23-267 (281)
7 cd00922 Cyt_c_Oxidase_IV Cytoc 99.8 1.6E-22 3.4E-27 175.4 -3.6 79 30-108 31-113 (136)
8 PF02936 COX4: Cytochrome c ox 99.8 1.7E-22 3.6E-27 176.4 -3.5 82 28-109 29-114 (142)
9 TIGR01078 arcA arginine deimin 99.7 1.3E-17 2.9E-22 167.5 8.6 152 94-263 34-290 (405)
10 PRK01388 arginine deiminase; P 99.7 3.8E-17 8.2E-22 164.3 7.1 140 94-239 41-280 (406)
11 COG4874 Uncharacterized protei 98.6 1.1E-07 2.3E-12 89.6 6.5 64 99-162 52-118 (318)
12 COG2235 ArcA Arginine deiminas 98.5 4.4E-07 9.5E-12 91.2 8.7 138 96-237 42-280 (409)
13 PF06592 DUF1138: Protein of u 77.6 3.9 8.4E-05 32.1 4.0 46 276-322 3-51 (73)
14 COG3911 Predicted ATPase [Gene 64.9 9.5 0.00021 34.6 4.1 32 96-127 140-171 (183)
15 PF06945 DUF1289: Protein of u 52.5 6.6 0.00014 28.5 0.8 19 38-56 25-43 (51)
16 PRK12906 secA preprotein trans 47.2 21 0.00045 39.8 4.0 61 63-126 97-158 (796)
17 PRK13103 secA preprotein trans 44.6 14 0.0003 41.7 2.1 61 63-126 99-160 (913)
18 COG3048 DsdA D-serine dehydrat 41.5 39 0.00085 34.0 4.5 86 39-138 142-239 (443)
19 PF11216 DUF3012: Protein of u 39.9 9.4 0.0002 25.5 -0.0 13 33-45 8-20 (32)
20 cd03143 A4_beta-galactosidase_ 35.7 98 0.0021 26.4 5.7 28 102-129 24-51 (154)
21 cd03130 GATase1_CobB Type 1 gl 35.7 70 0.0015 29.1 5.0 35 106-145 13-49 (198)
22 PF09153 DUF1938: Domain of un 35.3 32 0.0007 27.9 2.3 38 97-134 31-69 (86)
23 TIGR02144 LysX_arch Lysine bio 34.3 34 0.00073 32.0 2.8 27 99-125 5-31 (280)
24 PF03480 SBP_bac_7: Bacterial 31.9 1.1E+02 0.0024 29.0 5.8 52 41-125 214-265 (286)
25 CHL00122 secA preprotein trans 31.6 50 0.0011 37.3 3.8 60 63-125 93-153 (870)
26 PRK09200 preprotein translocas 31.0 50 0.0011 36.9 3.7 77 68-145 98-178 (790)
27 PF14156 AbbA_antirepres: Anti 30.8 46 0.00099 25.4 2.3 21 44-64 7-27 (63)
28 PRK02724 hypothetical protein; 30.1 24 0.00053 29.6 0.9 45 92-136 58-102 (104)
29 PRK01372 ddl D-alanine--D-alan 28.6 63 0.0014 30.7 3.6 32 97-128 16-47 (304)
30 cd09013 BphC-JF8_N_like N-term 27.8 81 0.0018 25.2 3.6 35 104-138 74-108 (121)
31 PF00505 HMG_box: HMG (high mo 27.5 25 0.00054 25.7 0.5 23 41-64 37-59 (69)
32 cd01388 SOX-TCF_HMG-box SOX-TC 26.0 69 0.0015 24.1 2.7 33 41-74 38-70 (72)
33 cd04501 SGNH_hydrolase_like_4 25.4 1.6E+02 0.0034 25.4 5.3 38 91-128 70-107 (183)
34 PF07517 SecA_DEAD: SecA DEAD- 25.2 55 0.0012 31.7 2.5 75 63-141 94-172 (266)
35 TIGR03714 secA2 accessory Sec 24.2 99 0.0021 34.5 4.5 57 67-124 89-146 (762)
36 cd01390 HMGB-UBF_HMG-box HMGB- 23.9 79 0.0017 22.7 2.6 22 40-62 36-57 (66)
37 PF14258 DUF4350: Domain of un 22.8 2E+02 0.0042 21.3 4.7 53 106-170 7-61 (70)
38 cd01828 sialate_O-acetylestera 22.3 2.3E+02 0.0051 23.9 5.7 38 90-127 58-97 (169)
39 PRK14569 D-alanyl-alanine synt 21.8 2E+02 0.0043 27.6 5.6 31 96-126 14-44 (296)
40 cd08344 MhqB_like_N N-terminal 21.8 1.3E+02 0.0029 23.6 3.8 34 104-138 66-99 (112)
41 PRK12326 preprotein translocas 21.7 1.3E+02 0.0028 33.6 4.7 61 63-126 95-156 (764)
42 TIGR00768 rimK_fam alpha-L-glu 21.6 72 0.0016 29.4 2.5 22 105-126 12-33 (277)
43 PRK10307 putative glycosyl tra 21.4 1.6E+02 0.0034 29.0 5.0 36 96-131 11-46 (412)
44 PF13239 2TM: 2TM domain 21.4 2.3E+02 0.0051 22.0 5.0 37 281-322 46-82 (83)
45 PRK01077 cobyrinic acid a,c-di 21.2 1.6E+02 0.0034 30.4 5.1 36 106-146 260-297 (451)
46 PF06757 Ins_allergen_rp: Inse 21.1 96 0.0021 27.8 3.1 64 39-122 99-174 (179)
47 PF09011 HMG_box_2: HMG-box do 20.8 32 0.00069 26.1 -0.1 16 243-258 42-57 (73)
48 cd07265 2_3_CTD_N N-terminal d 20.7 1.6E+02 0.0034 23.4 4.1 35 104-138 73-109 (122)
49 COG3313 Predicted Fe-S protein 20.7 55 0.0012 26.0 1.2 24 38-61 30-53 (74)
50 PF15176 LRR19-TM: Leucine-ric 20.3 1.7E+02 0.0038 24.5 4.1 30 265-294 8-37 (102)
51 PRK02868 hypothetical protein; 20.2 85 0.0018 30.2 2.6 23 44-66 65-87 (245)
No 1
>PF02936 COX4: Cytochrome c oxidase subunit IV; InterPro: IPR004203 Cytochrome c oxidase, a 13 sub-unit complex (1.9.3.1 from EC) is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium discoideum (Slime mould) member of this family is called COX VI. The Saccharomyces cerevisiae protein YGX6_YEAST appears to be the yeast COX IV subunit.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3ABK_Q 3AG1_Q 3ASN_Q 1OCZ_D 2EIN_Q 2OCC_D 2YBB_O 3AG3_D 1OCO_Q 1V55_Q ....
Probab=100.00 E-value=2.8e-51 Score=356.88 Aligned_cols=142 Identities=50% Similarity=0.867 Sum_probs=113.5
Q ss_pred EEEeeeccCCCcccCCCCCCCCCccccccccccccccccccCCCcCCChHHHHhhchhhccccccCCcCCCcchhhHHHH
Q psy945 201 EVVGYGFNGEPGYIDNPMFPYPSIRFKAITKELEPLKQKEKGDWKNLTLEEKKTLYRASFAQTFVEMNAPTGEWKGAFGI 280 (344)
Q Consensus 201 ~~VG~G~rt~p~y~d~~d~pLpd~rf~~~~~~l~~L~ekekgdWk~Ls~eEk~alY~~aF~~~~pe~~~~~~e~k~vfg~ 280 (344)
++||+|.+|.|.|+|++++|+|++||.++++++++|++||||||++||+|||+|+||+|||+||||+..++++|+.|||+
T Consensus 1 ~~v~~~~~s~p~y~d~~~~w~p~~~~~~~~~~~~~LkeKekg~Wk~LS~eEKkalY~isFg~~g~r~~~~~gewk~v~~~ 80 (142)
T PF02936_consen 1 EVVGYGANSNPSYVDREDRWLPMPRFQEQTEIIEALKEKEKGDWKKLSLEEKKALYRISFGQTGPRMKAPTGEWKKVFGG 80 (142)
T ss_dssp ----GGGTTS-----BTTBSS-SS---S--HHHHHHHHHTTS-GGGS-HHHHHHHHHHH-SS-HHHHT---SHHHHHHHH
T ss_pred CccccccCCCCcccchhcCCCCCcchhhhHHHHHHHHHHHhCCHhhCCHHHHHHHHHhhhcCcccccccCCcchHHHHHH
Confidence 47999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHhccCCCCCCCCcHHHHHHHHHHHHHcCCCCCcccccccccccCccC
Q psy945 281 AFMVVTVSLLTYVLMYKVGIFPEKAETLSEEHRQEMLQKMIDLRWNPITGTASKWDYEKNDWK 343 (344)
Q Consensus 281 ~~v~ig~~~~~~~~~r~~~~~~p~P~tlseewk~a~~~~~~~~~~nPi~G~ss~wdY~~~~wk 343 (344)
+++++|+++++|.++|.+ +|+|+|+|||+|||+||+||||+|++|||+|+||+|||+||+||
T Consensus 81 ~~~~i~~s~~l~~~~r~~-~~~~~P~T~~~Ew~ea~~~~m~~~~~nPi~G~ss~wdy~~~~wk 142 (142)
T PF02936_consen 81 VFIFIGFSVLLFIWQRSY-VYPPLPHTFSKEWQEAQNERMLDMKVNPITGFSSKWDYEKNEWK 142 (142)
T ss_dssp HHHHHHHHHHHHHHHHHH-T-----GGGSHHHHHHHHHHHHHTTSSTTTTSGGGEETTTTEE-
T ss_pred HHHHHHHHHHHHHHHHHH-hCCCCCCCcCHHHHHHHHHHHHHcCCCCCCCCcCCcCCCCCcCC
Confidence 999999999999999999 89999999999999999999999999999999999999999998
No 2
>KOG4075|consensus
Probab=100.00 E-value=2e-49 Score=347.68 Aligned_cols=148 Identities=42% Similarity=0.797 Sum_probs=145.0
Q ss_pred cCCCCeEEEeeeccCCCcccCCCCCCCCCccccccccccccccccccCCCcCCChHHHHhhchhhccccccCCcCCCcch
Q psy945 195 DFFPNREVVGYGFNGEPGYIDNPMFPYPSIRFKAITKELEPLKQKEKGDWKNLTLEEKKTLYRASFAQTFVEMNAPTGEW 274 (344)
Q Consensus 195 ~~ig~r~~VG~G~rt~p~y~d~~d~pLpd~rf~~~~~~l~~L~ekekgdWk~Ls~eEk~alY~~aF~~~~pe~~~~~~e~ 274 (344)
.++..+.+||+|.++.|.|.|++++|+|.+||.|++.++++|+||||+||++||.|||||+||+|||||++|+++++++|
T Consensus 20 ~~~a~~~~vg~~~~~~p~y~Drrd~pmP~~~~~e~~~~~~aL~eKek~~Wk~LS~~EKkalYrisF~et~ae~~~~~~ew 99 (167)
T KOG4075|consen 20 DKAAKRQLVGYGDNGLPIYLDRRDYPMPAIRFRELSAEIKALREKEKAPWKQLSTEEKKALYRISFGETFAERNRGSNEW 99 (167)
T ss_pred hhhhcccccccccccCchhhccccCCCcchhhhcccHHHHHHHHHhcCChhhcCHHHHHHHHHHHhccccccccCCCCcc
Confidence 35688899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCcHHHHHHHHHHHHHcCCCCCcccccccccccCccC
Q psy945 275 KGAFGIAFMVVTVSLLTYVLMYKVGIFPEKAETLSEEHRQEMLQKMIDLRWNPITGTASKWDYEKNDWK 343 (344)
Q Consensus 275 k~vfg~~~v~ig~~~~~~~~~r~~~~~~p~P~tlseewk~a~~~~~~~~~~nPi~G~ss~wdY~~~~wk 343 (344)
|.|||+.++++|++.++.+|++.+ +|+|.|+||++|||+||+|||||+++|||+|++|+||||+|+||
T Consensus 100 Ktv~g~~~~f~Gl~~~v~l~~~v~-vy~~~P~Tf~~Ewq~kq~erml~~~~NPi~Glas~wdye~k~wK 167 (167)
T KOG4075|consen 100 KTVFGVAGFFLGLTISVILFGKVR-VYGPLPKTFNKEWQEKQLERMLDLKINPIGGLASKWDYEEKKWK 167 (167)
T ss_pred cchhhHHHHHHHHHHHHHHHHhhe-ecCCCCcchhHHHHHHHHHHHHHcccCCccCcccccChhhhccC
Confidence 999999999999999999999999 99999999999999999999999999999999999999999998
No 3
>cd00922 Cyt_c_Oxidase_IV Cytochrome c oxidase subunit IV. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit IV is the largest of the nuclear-encoded subunits. It binds ATP at the matrix side, leading to an allosteric inhibition of enzyme activity at high intramitochondrial ATP/ADP ratios. In mammals, subunit IV has a lung-specific isoform and a ubiquitously expressed isoform.
Probab=100.00 E-value=1.1e-46 Score=325.88 Aligned_cols=135 Identities=50% Similarity=0.876 Sum_probs=133.4
Q ss_pred EEeeeccCCCcccCCCCCCCCCccccccccccccccccccCCCcCCChHHHHhhchhhccccccCCcCCCcchhhHHHHH
Q psy945 202 VVGYGFNGEPGYIDNPMFPYPSIRFKAITKELEPLKQKEKGDWKNLTLEEKKTLYRASFAQTFVEMNAPTGEWKGAFGIA 281 (344)
Q Consensus 202 ~VG~G~rt~p~y~d~~d~pLpd~rf~~~~~~l~~L~ekekgdWk~Ls~eEk~alY~~aF~~~~pe~~~~~~e~k~vfg~~ 281 (344)
+||+|.++.|.|+|++++|+|+|||.++++++++|++||||||++||++||||+||+|||+|+||++++++|||.|||++
T Consensus 2 ~~~~~~~s~p~y~d~~~~p~p~~~~~e~~~~~~~Lrekek~dW~~LT~~EKkAlY~isfg~~~~e~~~~~~ewk~v~~~~ 81 (136)
T cd00922 2 VVGYGANSLPTYADIPDRPLPMIRFRELSAEIKALREKEKGDWKQLTLEEKKALYRISFGETGPEMNAPTGEWKTVFGGV 81 (136)
T ss_pred cccccccCCccccCCCCCCCCcCccccchHHHHHHHHHhhCCHhhCCHHHHhhHhhhhhccccccccCCCccHHHHHHHH
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHhccCCCCCCCCcHHHHHHHHHHHHHcCCCCCccccccccc
Q psy945 282 FMVVTVSLLTYVLMYKVGIFPEKAETLSEEHRQEMLQKMIDLRWNPITGTASKWDY 337 (344)
Q Consensus 282 ~v~ig~~~~~~~~~r~~~~~~p~P~tlseewk~a~~~~~~~~~~nPi~G~ss~wdY 337 (344)
++++|+++++|+++|.+ +|+|.|+||++|||+||+|||+++|+|||||+||+|||
T Consensus 82 ~~~i~~s~~~~~~~r~~-~~~~~P~T~t~Ewqea~~er~~~~~~NPitG~ss~~dy 136 (136)
T cd00922 82 LAFIGITGVIFGLQRAF-VYGPKPHTFTEEWQEAQLERMLDMKVNPITGYASKWDY 136 (136)
T ss_pred HHHHHHHHHHHHHHHHh-ccCCCCCCcCHHHHHHHHHHHHHhCCCCCcccccCCcC
Confidence 99999999999999999 89999999999999999999999999999999999998
No 4
>COG1834 N-Dimethylarginine dimethylaminohydrolase [Amino acid transport and metabolism]
Probab=99.95 E-value=1e-29 Score=240.64 Aligned_cols=211 Identities=18% Similarity=0.208 Sum_probs=169.7
Q ss_pred eeeeEEeeccc---ccCh--HHHHHHHHHHHHHHHHCCCeEEEeCCCCCCCCceecCCceEEECCeEEEeCCCCccCcCh
Q psy945 85 TVSLLTYVLMY---KVGI--DEARRQHESFVALLRDLGLDVIELPPDEDLPESVFIEDTAVIVNGIVLITKPGNTQRHKE 159 (344)
Q Consensus 85 ~vs~lin~~M~---~vD~--d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~~PD~VF~~D~a~v~~~~avi~r~~~~~Rr~E 159 (344)
.+.+.+|+||+ ..++ +.|++||++|+++|+++||+|++++|.+++||+||++|+++|++.+||++||+++.|++|
T Consensus 14 ~~~~~~~~~~~~~~~~~~~~e~A~aQh~~lve~l~~~gv~V~ll~~~~~~Pd~VFt~D~~~v~~~~avl~r~~~p~R~gE 93 (267)
T COG1834 14 LVDYVINPWMKGLGEEDPQIERAVAQHEALVEALEKNGVEVHLLPPIEGLPDQVFTRDPGLVTGEGAVLARMGAPERRGE 93 (267)
T ss_pred HhhhhhhhhhcccCCCCchhHHHHHHHHHHHHHHHHCCCEEEEcCcccCCCcceEeccceeEecccEEEeccCChhhccC
Confidence 46678898888 3344 489999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhHHHHhhcc-cccccccccccccCCCCcccccccCC-CCeEEEeeeccCCCcccCCCC----------CCCCCcccc
Q psy945 160 VAVSATRALIKP-KSLAASYSAASAHSNEPVAKEHFDFF-PNREVVGYGFNGEPGYIDNPM----------FPYPSIRFK 227 (344)
Q Consensus 160 ~~~~~t~~~~~l-~~~~i~~~~~~~~~~eG~~~~~~~~i-g~r~~VG~G~rt~p~y~d~~d----------~pLpd~rf~ 227 (344)
...+. .|++. +.+.+. +..+. ..+| .||++.. |+++++|+|+||+...+.... .-.-+.+|+
T Consensus 94 ~~~~~--~~~~~lgi~i~~-~~~~~-~~eG--~GD~l~~~~~~v~iG~s~RTn~egi~~l~~~L~~~~~v~~~~~~~~~l 167 (267)
T COG1834 94 EEAIK--ETLESLGIPIYP-RVEAG-VFEG--AGDVLMDGGDTVYIGYSFRTNLEGIEQLQAWLEEGYEVSLVRLDERYL 167 (267)
T ss_pred HHHHH--HHHHHcCCcccc-cccCC-Cccc--cccEEEeCCcEEEEEeccccchHHHHHHHHHhccCcEEEEEecCCcee
Confidence 99987 67772 222222 22333 3344 4577777 777899999999999883221 225578899
Q ss_pred ccccccccccccccCCCcCCChHHHHhhchhhccccccCC--------cCCCcchhhHHHHHHHHHHHHHHH-------H
Q psy945 228 AITKELEPLKQKEKGDWKNLTLEEKKTLYRASFAQTFVEM--------NAPTGEWKGAFGIAFMVVTVSLLT-------Y 292 (344)
Q Consensus 228 ~~~~~l~~L~ekekgdWk~Ls~eEk~alY~~aF~~~~pe~--------~~~~~e~k~vfg~~~v~ig~~~~~-------~ 292 (344)
|+|++++.|++. ...+|+.+|++.++++ +..+.++...++||.+.+|...++ .
T Consensus 168 HLdt~~~~l~e~------------~al~y~~~~~~~~~~~lk~r~~~~I~Vpe~e~~~l~~n~v~~g~~~v~~~~~~~~~ 235 (267)
T COG1834 168 HLDTVFNPLAEG------------LALAYPPAFSEGANDVLKERGFELIEVPEEEAFALGCNVVSLGPNVVIALPRTPKA 235 (267)
T ss_pred ehhheeeeccCc------------ceeecchhcchhHHHHHhhCCceEEecCHhHhhhhccceeecCCceeecCcccchH
Confidence 999999999976 6889999999744432 346677788899999999877555 4
Q ss_pred HHHHHhccCCCCCCCCcHHHHH
Q psy945 293 VLMYKVGIFPEKAETLSEEHRQ 314 (344)
Q Consensus 293 ~~~r~~~~~~p~P~tlseewk~ 314 (344)
..++.. ||..+|.+++|..|.
T Consensus 236 e~L~~~-GfeVi~~~~~ei~k~ 256 (267)
T COG1834 236 EQLAAA-GFEVIEVDLTEILKG 256 (267)
T ss_pred HHHHhC-CceEEecCHHHHHHc
Confidence 677888 999999999988764
No 5
>KOG4075|consensus
Probab=99.89 E-value=6.8e-25 Score=193.08 Aligned_cols=82 Identities=32% Similarity=0.507 Sum_probs=75.5
Q ss_pred cCCCccchhhccCCccccCHHHHHHHHHHhhhhhHHHhhccCCCchhhhhhheeeeeeeeEEeeccc----ccChHHHHH
Q psy945 29 HSNEPPLKQKEKGDWKNLTLEEKKTLYRASFAQTFVEMNASTGEWKGAFGIAFMVVTVSLLTYVLMY----KVGIDEARR 104 (344)
Q Consensus 29 ~~~~~~l~~ke~~~w~~l~~~ek~~lyr~~f~~t~~e~~~~~~ewk~~~~~~~~~~~vs~lin~~M~----~vD~d~A~~ 104 (344)
.+++++||||||+||++||.|||++||||||||||+|||+++||||+|+|++++|+|+++++-.||+ +++.....+
T Consensus 55 ~~~~~aL~eKek~~Wk~LS~~EKkalYrisF~et~ae~~~~~~ewKtv~g~~~~f~Gl~~~v~l~~~v~vy~~~P~Tf~~ 134 (167)
T KOG4075|consen 55 SAEIKALREKEKAPWKQLSTEEKKALYRISFGETFAERNRGSNEWKTVFGVAGFFLGLTISVILFGKVRVYGPLPKTFNK 134 (167)
T ss_pred cHHHHHHHHHhcCChhhcCHHHHHHHHHHHhccccccccCCCCcccchhhHHHHHHHHHHHHHHHHhheecCCCCcchhH
Confidence 4578999999999999999999999999999999999999999999999999999999999999988 567778888
Q ss_pred HHHHHH
Q psy945 105 QHESFV 110 (344)
Q Consensus 105 Qh~~lv 110 (344)
||+.-.
T Consensus 135 Ewq~kq 140 (167)
T KOG4075|consen 135 EWQEKQ 140 (167)
T ss_pred HHHHHH
Confidence 887643
No 6
>PF02274 Amidinotransf: Amidinotransferase; InterPro: IPR003198 This family contains glycine and inosamine amidinotransferases, enzymes which are involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, which catalyse the reversible reaction: arginine + H2O = citrulline + NH3 The Streptococcus anti-tumour glycoprotein is also found in this family [].; GO: 0016813 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amidines, 0005737 cytoplasm; PDB: 2CI7_A 2CI1_A 2CI4_A 2CI3_A 2CI5_A 2C6Z_A 2CI6_A 3I4A_B 3I2E_B 2JAI_A ....
Probab=99.86 E-value=1.1e-23 Score=199.63 Aligned_cols=200 Identities=22% Similarity=0.250 Sum_probs=141.3
Q ss_pred cccChHHHHHHHHHHHHHHHHCCCeEEEeCCC--CCCCCceecCCceEEECCeEEEeCCCCccCcChhhhhHHHHhhcc-
Q psy945 95 YKVGIDEARRQHESFVALLRDLGLDVIELPPD--EDLPESVFIEDTAVIVNGIVLITKPGNTQRHKEVAVSATRALIKP- 171 (344)
Q Consensus 95 ~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~--~~~PD~VF~~D~a~v~~~~avi~r~~~~~Rr~E~~~~~t~~~~~l- 171 (344)
+++|+++|++||++|+++|+++|++|+.+++. +++||+||++|+++++++++|++||+.++||+|...+. .+++.
T Consensus 23 ~~~~~~~a~~e~~~l~~~L~~~Gv~V~~~~~~~~~~~p~~vF~rD~~~~~~~~~ii~~m~~~~R~~E~~~~~--~~~~~~ 100 (281)
T PF02274_consen 23 DPPDPEKAREEHDALVEALRSNGVEVIELPPLLEEPLPDMVFTRDPGVVIGGGVIIGRMRAPSRRGEEDVYK--EIFEKH 100 (281)
T ss_dssp S---HHHHHHHHHHHHHHHHTTT-EEEEEHHHHHTT-TTTT-TTCCEEEECTEEEE-B-SSGGGHGHHHHHH--HHHHHS
T ss_pred CcchHHHHHHHHHHHHHHHHhCCcEEEEeCCccCCCCCCeEEcCCcEEEEcCCEEEeCCChHHhcCcchHHH--HHHHhh
Confidence 48999999999999999999999999999999 99999999999999999999999999999999999977 56653
Q ss_pred --cc---cccc-cccccccCCCCcccccccCCCCeEEE-eeeccCCCcccC------CCC---------CCCCCcccccc
Q psy945 172 --KS---LAAS-YSAASAHSNEPVAKEHFDFFPNREVV-GYGFNGEPGYID------NPM---------FPYPSIRFKAI 229 (344)
Q Consensus 172 --~~---~~i~-~~~~~~~~~eG~~~~~~~~ig~r~~V-G~G~rt~p~y~d------~~d---------~pLpd~rf~~~ 229 (344)
.. ..+. ....+++.+|| |++.+++..++ |+|.||++..++ ... .-..++.++|+
T Consensus 101 ~~~~~~~~~~~~~~~~~~~lEGG----Dv~~~~~~~~v~G~g~RTn~~g~~~l~~~l~~~~~~~~v~~~~~~~~~~~~HL 176 (281)
T PF02274_consen 101 PFNIPRVLDIEEENGDPGYLEGG----DVLVLGDNVLVIGVGSRTNEEGIEQLARALGEEEVVFEVVVVVVPVDPGFLHL 176 (281)
T ss_dssp CCCTCCEEEEEEC--TTS-B-GG----GEEEESTEEEEEEESSSS-HHHHHHHHHHHCCTTSESEEEEEEEEECSSSSSG
T ss_pred ccccceeeCCccccccCceecCc----EEEEECCEEEEEeecCCCCHHHHHHHHHHhcccccccccceeeccCccCcccc
Confidence 11 1121 11246778888 55555666654 999999999883 111 12446788999
Q ss_pred ccccccccccccCCCcCCChHHHHhhchhhcccc------------ccCCcCCCcchhhHHHHHHHHHHH--------HH
Q psy945 230 TKELEPLKQKEKGDWKNLTLEEKKTLYRASFAQT------------FVEMNAPTGEWKGAFGIAFMVVTV--------SL 289 (344)
Q Consensus 230 ~~~l~~L~ekekgdWk~Ls~eEk~alY~~aF~~~------------~pe~~~~~~e~k~vfg~~~v~ig~--------~~ 289 (344)
|+.++.|++. ...+|+..|++. +-+.+..+.+....+++|++.+|= ..
T Consensus 177 D~~~~~l~~~------------~~l~~~~~~~~~~~~~l~~~l~~~~~~iI~v~~~e~~~~~~N~l~l~~~~vi~~~~~~ 244 (281)
T PF02274_consen 177 DTVFNPLDPD------------LVLVYPDAFDPEEEEELEQALKERGFEIIEVPEEEQWNFACNVLSLGPGKVIAYASNP 244 (281)
T ss_dssp GGTEEEEETT------------EEEEECCHHCTHHHHHHHHHHSSSTCEEEEESSCSCSGGGGS-EEECTTEEEEETTHH
T ss_pred ceEEEEcCCC------------EEEEeCcccchHHHHHHHHHhcccCcEEEEeccchhhhccCCEEEecCCEEEECCCCH
Confidence 9999999965 456677766543 334455667777799999988872 23
Q ss_pred HHHHHHHHhccCCCCCCCCcHHHH
Q psy945 290 LTYVLMYKVGIFPEKAETLSEEHR 313 (344)
Q Consensus 290 ~~~~~~r~~~~~~p~P~tlseewk 313 (344)
-+...++.. |+..++.+++|.-|
T Consensus 245 ~~~~~L~~~-G~~v~~v~~~el~k 267 (281)
T PF02274_consen 245 RTNEQLEKA-GIEVIEVDFSELEK 267 (281)
T ss_dssp HHHHHHHHT-T-EEEEE-HHHHHT
T ss_pred HHHHHHHhc-CCeEEEEcHHHhhc
Confidence 346677888 89999999987654
No 7
>cd00922 Cyt_c_Oxidase_IV Cytochrome c oxidase subunit IV. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit IV is the largest of the nuclear-encoded subunits. It binds ATP at the matrix side, leading to an allosteric inhibition of enzyme activity at high intramitochondrial ATP/ADP ratios. In mammals, subunit IV has a lung-specific isoform and a ubiquitously expressed isoform.
Probab=99.83 E-value=1.6e-22 Score=175.38 Aligned_cols=79 Identities=42% Similarity=0.632 Sum_probs=69.8
Q ss_pred CCCccchhhccCCccccCHHHHHHHHHHhhhhhHHHhhccCCCchhhhhhheeeeeeeeEEeecccc----cChHHHHHH
Q psy945 30 SNEPPLKQKEKGDWKNLTLEEKKTLYRASFAQTFVEMNASTGEWKGAFGIAFMVVTVSLLTYVLMYK----VGIDEARRQ 105 (344)
Q Consensus 30 ~~~~~l~~ke~~~w~~l~~~ek~~lyr~~f~~t~~e~~~~~~ewk~~~~~~~~~~~vs~lin~~M~~----vD~d~A~~Q 105 (344)
.++.+|||||||||++||.+|||+|||+|||||++||++++||||+|+|+++++++|++++..++.. +-....-.|
T Consensus 31 ~~~~~Lrekek~dW~~LT~~EKkAlY~isfg~~~~e~~~~~~ewk~v~~~~~~~i~~s~~~~~~~r~~~~~~~P~T~t~E 110 (136)
T cd00922 31 AEIKALREKEKGDWKQLTLEEKKALYRISFGETGPEMNAPTGEWKTVFGGVLAFIGITGVIFGLQRAFVYGPKPHTFTEE 110 (136)
T ss_pred HHHHHHHHHhhCCHhhCCHHHHhhHhhhhhccccccccCCCccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCcCHH
Confidence 4578999999999999999999999999999999999999999999999999999999999999883 234445556
Q ss_pred HHH
Q psy945 106 HES 108 (344)
Q Consensus 106 h~~ 108 (344)
|++
T Consensus 111 wqe 113 (136)
T cd00922 111 WQE 113 (136)
T ss_pred HHH
Confidence 654
No 8
>PF02936 COX4: Cytochrome c oxidase subunit IV; InterPro: IPR004203 Cytochrome c oxidase, a 13 sub-unit complex (1.9.3.1 from EC) is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium discoideum (Slime mould) member of this family is called COX VI. The Saccharomyces cerevisiae protein YGX6_YEAST appears to be the yeast COX IV subunit.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3ABK_Q 3AG1_Q 3ASN_Q 1OCZ_D 2EIN_Q 2OCC_D 2YBB_O 3AG3_D 1OCO_Q 1V55_Q ....
Probab=99.83 E-value=1.7e-22 Score=176.37 Aligned_cols=82 Identities=41% Similarity=0.617 Sum_probs=63.5
Q ss_pred ccCCCccchhhccCCccccCHHHHHHHHHHhhhhhHHHhhccCCCchhhhhhheeeeeeeeEEeecccc----cChHHHH
Q psy945 28 AHSNEPPLKQKEKGDWKNLTLEEKKTLYRASFAQTFVEMNASTGEWKGAFGIAFMVVTVSLLTYVLMYK----VGIDEAR 103 (344)
Q Consensus 28 ~~~~~~~l~~ke~~~w~~l~~~ek~~lyr~~f~~t~~e~~~~~~ewk~~~~~~~~~~~vs~lin~~M~~----vD~d~A~ 103 (344)
....+++|||||||||++||+|||++|||||||||++||++++||||+|+|++|++++++.++..|++. +.....-
T Consensus 29 ~~~~~~~LkeKekg~Wk~LS~eEKkalY~isFg~~g~r~~~~~gewk~v~~~~~~~i~~s~~l~~~~r~~~~~~~P~T~~ 108 (142)
T PF02936_consen 29 QTEIIEALKEKEKGDWKKLSLEEKKALYRISFGQTGPRMKAPTGEWKKVFGGVFIFIGFSVLLFIWQRSYVYPPLPHTFS 108 (142)
T ss_dssp --HHHHHHHHHTTS-GGGS-HHHHHHHHHHH-SS-HHHHT---SHHHHHHHHHHHHHHHHHHHHHHHHHHT-----GGGS
T ss_pred hHHHHHHHHHHHhCCHhhCCHHHHHHHHHhhhcCcccccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCcC
Confidence 456789999999999999999999999999999999999999999999999999999999999999982 2244455
Q ss_pred HHHHHH
Q psy945 104 RQHESF 109 (344)
Q Consensus 104 ~Qh~~l 109 (344)
+||++-
T Consensus 109 ~Ew~ea 114 (142)
T PF02936_consen 109 KEWQEA 114 (142)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 666654
No 9
>TIGR01078 arcA arginine deiminase. Arginine deiminase is the first enzyme of the arginine deiminase pathway of arginine degradation.
Probab=99.71 E-value=1.3e-17 Score=167.52 Aligned_cols=152 Identities=16% Similarity=0.197 Sum_probs=120.8
Q ss_pred ccccChHHHHHHHHHHHHHHHHCCCeEEEeCCC-----------------------------------------------
Q psy945 94 MYKVGIDEARRQHESFVALLRDLGLDVIELPPD----------------------------------------------- 126 (344)
Q Consensus 94 M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~----------------------------------------------- 126 (344)
-+.++.++|++||+.|+++||++||+|+.|+..
T Consensus 34 d~i~~~~~a~~EHd~f~~~Lr~~GveV~~l~dll~e~l~~~~~r~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~l~~ 113 (405)
T TIGR01078 34 DDIPWVEDAQKEHDQFANTLRDNGIEVLYLEDLLAETLDLPEAKEKFIDEFLSESEILGLGLKVELRDYLKSLDTRELVE 113 (405)
T ss_pred cCCCCHHHHHHHHHHHHHHHHHCCCEEEEhHHHHHHHhCCHHHHHHHHHHHHhhccCCCHHHHHHHHHHHHhCCHHHHHH
Confidence 347899999999999999999999999988621
Q ss_pred ----------------------------------CCCCCceecCCceEEECCeEEEeCCCCccCcChhhhhHHHHhhc--
Q psy945 127 ----------------------------------EDLPESVFIEDTAVIVNGIVLITKPGNTQRHKEVAVSATRALIK-- 170 (344)
Q Consensus 127 ----------------------------------~~~PD~VF~~D~a~v~~~~avi~r~~~~~Rr~E~~~~~t~~~~~-- 170 (344)
+.+||++|+||.+++++++++|++|+.+.||+|...++ .+++
T Consensus 114 ~li~g~~~~e~~~~~~~~~~~~~~~~~~~~~~~i~PlpN~vFtRD~~~vi~~g~vi~~m~~~~Rr~E~~~~~--~i~~~h 191 (405)
T TIGR01078 114 KLMAGVAKNELPASEGSEKSLMDLGVEGDSDFVIDPMPNLYFTRDPFASIGNGVTINPMYYKARQRETLFTR--AIFKHH 191 (405)
T ss_pred HHHcCCCHHHcCcccccchhhHHHhccCCCceeccCCCCcEeccCceEEECCeEEEeCCCchhhCCcHHHHH--HHHHhC
Confidence 35899999999999999999999999999999999977 4544
Q ss_pred -----ccccccccccccccCCCCcccccccCCCC-eEEEeeeccCCCcccC--------C-----CC--CCCCCcc-ccc
Q psy945 171 -----PKSLAASYSAASAHSNEPVAKEHFDFFPN-REVVGYGFNGEPGYID--------N-----PM--FPYPSIR-FKA 228 (344)
Q Consensus 171 -----l~~~~i~~~~~~~~~~eG~~~~~~~~ig~-r~~VG~G~rt~p~y~d--------~-----~d--~pLpd~r-f~~ 228 (344)
.+.+.+.....+++.+|| |++.+++ .++||+|.||++.... . .. +++|..| |.|
T Consensus 192 p~f~~~g~~i~~~~~~~~~iEGG----Dvl~l~~~~l~IG~s~RTn~~gie~La~~L~~~~~~~~~Vi~v~ip~~r~~mH 267 (405)
T TIGR01078 192 PRFANTEFPIWYDRSETASIEGG----DVLVLNKDVLAIGISERTSAQSVEKLAKSLFANKGGFKKVLAINIPKNRALMH 267 (405)
T ss_pred chhhccCCcEEecCCCCCccccc----eEEEECCCEEEEEecccCCHHHHHHHHHHhhhccCCCCEEEEEEcCCCCCccc
Confidence 122333323356778888 5556664 4599999999998873 1 11 5577776 889
Q ss_pred cccccccccccccCCCcCCChHHHHhhchhhcccc
Q psy945 229 ITKELEPLKQKEKGDWKNLTLEEKKTLYRASFAQT 263 (344)
Q Consensus 229 ~~~~l~~L~ekekgdWk~Ls~eEk~alY~~aF~~~ 263 (344)
+|+.++.+++. +..+|+..|+..
T Consensus 268 LDtvft~vd~d------------~~l~~p~~~~~~ 290 (405)
T TIGR01078 268 LDTVFTMVDYD------------KFTVFPEVVDVL 290 (405)
T ss_pred CCccccccCCC------------eEEEchhHhccc
Confidence 99999999965 677888888654
No 10
>PRK01388 arginine deiminase; Provisional
Probab=99.68 E-value=3.8e-17 Score=164.26 Aligned_cols=140 Identities=20% Similarity=0.285 Sum_probs=111.3
Q ss_pred ccccChHHHHHHHHHHHHHHHHCCCeEEEeCCC-----------------------------------------------
Q psy945 94 MYKVGIDEARRQHESFVALLRDLGLDVIELPPD----------------------------------------------- 126 (344)
Q Consensus 94 M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~----------------------------------------------- 126 (344)
-+.++.++|++||+.|+++||++|++|++|+..
T Consensus 41 ddi~~l~~aq~EHd~f~~~Lr~~GveV~~l~dll~e~l~~~~~r~~~l~~~~~~~~~~~~~~~~l~~~l~~~~~~~l~~~ 120 (406)
T PRK01388 41 DDVPWVERAQKEHDAFAQTLRDRGVEVLYLEDLLAETLANPEAREWFLDRQISEARVGLGLADELRAYLESLDNRELAEK 120 (406)
T ss_pred ccCCCHHHHHHHHHHHHHHHHHCCCEEEEhHHHHHHHhcCHHHHHHHHHHHhhhccCCHHHHHHHHHHHHhCCHHHHHHH
Confidence 347899999999999999999999999998732
Q ss_pred ------------------------------CCCCCceecCCceEEECCeEEEeCCCCccCcChhhhhHHHHhhc--c---
Q psy945 127 ------------------------------EDLPESVFIEDTAVIVNGIVLITKPGNTQRHKEVAVSATRALIK--P--- 171 (344)
Q Consensus 127 ------------------------------~~~PD~VF~~D~a~v~~~~avi~r~~~~~Rr~E~~~~~t~~~~~--l--- 171 (344)
+.+||++|+||.+++++++++|+||+.+.||+|...++ .+++ +
T Consensus 121 ~i~g~~~~el~~~~~~~l~~~~~~~~~f~~~PlpN~vFtRD~~~~i~~g~ii~~m~~~~Rr~E~~~~~--~i~~~hp~f~ 198 (406)
T PRK01388 121 LIGGVAKSELPESKAKSLVRLMHDPYDFVLDPLPNLLFTRDPSAWIGGGVTINPMAWPARRRETLLTE--AIYKHHPRFA 198 (406)
T ss_pred HHcCCcHHHcccccchhHHHHhhcCCccccCCCCCeEeccCcEEEECCEEEEeCCCChhhCccHHHHH--HHHHhCcccc
Confidence 35899999999999999999999999999999999876 4443 1
Q ss_pred --cccccccccccccCCCCcccccccCCCCe-EEEeeeccCCCcccCCC------------C--CCCCCcc-cccccccc
Q psy945 172 --KSLAASYSAASAHSNEPVAKEHFDFFPNR-EVVGYGFNGEPGYIDNP------------M--FPYPSIR-FKAITKEL 233 (344)
Q Consensus 172 --~~~~i~~~~~~~~~~eG~~~~~~~~ig~r-~~VG~G~rt~p~y~d~~------------d--~pLpd~r-f~~~~~~l 233 (344)
+.+.+.....+++.+|| |++.+++. ++||+|.||++...+.. . +++|..| |.|||+.+
T Consensus 199 ~~~~~i~~~~~~~~~iEGG----Dvl~l~~~~l~IG~s~RTn~~gie~La~~l~~~~~~~~Vi~v~ip~~r~~mHLDtvf 274 (406)
T PRK01388 199 GADVPVWDDRHGNATLEGG----DVLVLGKGVVAIGMSERTSPQAIEQLARSLFKKGAAKRVLAVEIPKSRAFMHLDTVF 274 (406)
T ss_pred cCCceEEecCCCCCCCccc----eEEEECCCEEEEEecccCCHHHHHHHHHHHhhcCCCcEEEEEeccCCCCceeCCccc
Confidence 12222222367788888 66666764 69999999999887311 1 4467665 88999999
Q ss_pred cccccc
Q psy945 234 EPLKQK 239 (344)
Q Consensus 234 ~~L~ek 239 (344)
+.+++.
T Consensus 275 t~vd~d 280 (406)
T PRK01388 275 TMVDYD 280 (406)
T ss_pred cccCCC
Confidence 999965
No 11
>COG4874 Uncharacterized protein conserved in bacteria containing a pentein-type domain [Function unknown]
Probab=98.56 E-value=1.1e-07 Score=89.64 Aligned_cols=64 Identities=23% Similarity=0.322 Sum_probs=59.0
Q ss_pred hHHHHHHHHHHHHHHHHCCCeEEEeCC--CCCCCCceecCCceEEEC-CeEEEeCCCCccCcChhhh
Q psy945 99 IDEARRQHESFVALLRDLGLDVIELPP--DEDLPESVFIEDTAVIVN-GIVLITKPGNTQRHKEVAV 162 (344)
Q Consensus 99 ~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~--~~~~PD~VF~~D~a~v~~-~~avi~r~~~~~Rr~E~~~ 162 (344)
.+.|..|++++++-|++.||+|++++. .++.||+||++||+.++. |.+++++|..++||.|...
T Consensus 52 ~q~A~~Ef~amve~L~~~GvdV~ifddtg~~~TPDsvFPNNWFSTh~~g~v~LyPM~~~nRRlER~~ 118 (318)
T COG4874 52 LQRAMSEFNAMVEGLRQAGVDVVIFDDTGQGETPDSVFPNNWFSTHEAGEVFLYPMACANRRLERPE 118 (318)
T ss_pred HHHHHHHHHHHHHHHHhcCceEEEeecCCCCCCCcccCCCcccccCcCCeEEEeeccCccccccchH
Confidence 678999999999999999999999996 567899999999999997 6699999999999999876
No 12
>COG2235 ArcA Arginine deiminase [Amino acid transport and metabolism]
Probab=98.48 E-value=4.4e-07 Score=91.20 Aligned_cols=138 Identities=17% Similarity=0.239 Sum_probs=104.3
Q ss_pred ccChHHHHHHHHHHHHHHHHCCCeEEEeCC--------C-----------------------------------------
Q psy945 96 KVGIDEARRQHESFVALLRDLGLDVIELPP--------D----------------------------------------- 126 (344)
Q Consensus 96 ~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~--------~----------------------------------------- 126 (344)
.+..+.|+++|+.|+++||+.|++|++|+. .
T Consensus 42 v~~l~~AqkEH~~f~~~Lr~~gveV~~L~dll~e~l~~~~~~r~~fl~~~i~~~~~~~~~~~~~~r~~l~~l~~~~~l~~ 121 (409)
T COG2235 42 VPYLEKAQKEHDTFAQILREQGIEVLYLEDLLAETLDKANAARNQFIETYLDEAEIKGTGTADEVRAYLLSLDKNKELIS 121 (409)
T ss_pred cccHHHHHHHHHHHHHHHHHcCCeEEEeHHHHHHHhccCHHHHHHHHHHHhhhcccccccHHHHHHHHHHhcCchHHHHH
Confidence 567899999999999999999999998751 0
Q ss_pred --------------------------------CCCCCceecCCceEEECCeEEEeCCCCccCcChhhhhHHHHhhccccc
Q psy945 127 --------------------------------EDLPESVFIEDTAVIVNGIVLITKPGNTQRHKEVAVSATRALIKPKSL 174 (344)
Q Consensus 127 --------------------------------~~~PD~VF~~D~a~v~~~~avi~r~~~~~Rr~E~~~~~t~~~~~l~~~ 174 (344)
+.+|+..|+||...++++|++|++|..+.|++|....+.-.-+++.-.
T Consensus 122 ~l~~G~~~~e~p~~~~~~~~~~~~~~~~~f~~~PLpNlyFtRD~~~~i~~GV~in~M~~~aR~rEt~i~~~i~~~hp~f~ 201 (409)
T COG2235 122 VLISGMGKIDLPEIDAKNSLEDRATINSDFLIDPLPNLYFTRDPFASIGNGVSINRMKYPARRRETLIAEFILKFHPIFA 201 (409)
T ss_pred HHhhceeecccCCccccccchhhccCCCceeccCCcccccccCCceEeeCCEEecccchhhhhhHHHHHHHHHHhCcccc
Confidence 126889999999999999999999999999999988653222333333
Q ss_pred c---cccccccccCCCCcccccccCCC-CeEEEeeeccCCCcccC---------C------CCCCCCC-ccccccccccc
Q psy945 175 A---ASYSAASAHSNEPVAKEHFDFFP-NREVVGYGFNGEPGYID---------N------PMFPYPS-IRFKAITKELE 234 (344)
Q Consensus 175 ~---i~~~~~~~~~~eG~~~~~~~~ig-~r~~VG~G~rt~p~y~d---------~------~d~pLpd-~rf~~~~~~l~ 234 (344)
+ +.......+.+|| |++.++ +..++|.|-||++..+. . ..+.+|- --+-||||.+.
T Consensus 202 ~~~~~~~~~~~~~iEGG----D~~~~g~~~~lIGvseRT~~~~ie~La~~lf~~~~~~~~v~a~~~~k~ra~MHLDTvFt 277 (409)
T COG2235 202 GVPIWFGRDDKATIEGG----DFLPLGKDVVLIGVSERTDWGAVEALAKLLFKDDVGFKRVIAIEIPKNRAFMHLDTVFT 277 (409)
T ss_pred CCeeecCCCccceecCc----ceEEecCceEEEEecccCCHHHHHHHHHHhhcCCcceeEEeeeecccchhhhchheeee
Confidence 3 1112244778888 566667 77899999999998882 1 1144665 33459999998
Q ss_pred ccc
Q psy945 235 PLK 237 (344)
Q Consensus 235 ~L~ 237 (344)
-.+
T Consensus 278 ~vd 280 (409)
T COG2235 278 MVD 280 (409)
T ss_pred eec
Confidence 877
No 13
>PF06592 DUF1138: Protein of unknown function (DUF1138); InterPro: IPR009515 This family consists of several hypothetical short plant proteins from Arabidopsis thaliana and Oryza sativa. The function of this family is unknown.
Probab=77.57 E-value=3.9 Score=32.11 Aligned_cols=46 Identities=20% Similarity=0.113 Sum_probs=28.8
Q ss_pred hHHHHHHHHHHHHHHH--HHHHHHhccCCCCCCCCcH-HHHHHHHHHHHH
Q psy945 276 GAFGIAFMVVTVSLLT--YVLMYKVGIFPEKAETLSE-EHRQEMLQKMID 322 (344)
Q Consensus 276 ~vfg~~~v~ig~~~~~--~~~~r~~~~~~p~P~tlse-ewk~a~~~~~~~ 322 (344)
.|++..+.+.+++.+. +...+.- --..+|+|+++ ||.++..+|+..
T Consensus 3 yIi~al~gsfa~ay~c~~~vs~~K~-FGGt~p~Tv~~keW~~aT~~K~qa 51 (73)
T PF06592_consen 3 YIIGALAGSFAVAYVCDQYVSDKKI-FGGTTPHTVSNKEWWKATDKKFQA 51 (73)
T ss_pred EEeehHHHHHHHHHHHHHHHhcccc-cCCCCCCccccHHHHHHHHHHHhh
Confidence 4555555555555433 3333333 23468888875 999999999875
No 14
>COG3911 Predicted ATPase [General function prediction only]
Probab=64.91 E-value=9.5 Score=34.63 Aligned_cols=32 Identities=38% Similarity=0.616 Sum_probs=29.0
Q ss_pred ccChHHHHHHHHHHHHHHHHCCCeEEEeCCCC
Q psy945 96 KVGIDEARRQHESFVALLRDLGLDVIELPPDE 127 (344)
Q Consensus 96 ~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~ 127 (344)
+...|.|+..|+.|+++|.++|.+++.||+.+
T Consensus 140 k~tldeAv~~~e~lv~aYt~LGyelv~lp~a~ 171 (183)
T COG3911 140 KITLDEAVAFYEVLVAAYTELGYELVPLPPAP 171 (183)
T ss_pred ccCHHHHHHHHHHHHHHHHhcCceeeecCCcc
Confidence 46789999999999999999999999998754
No 15
>PF06945 DUF1289: Protein of unknown function (DUF1289); InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=52.51 E-value=6.6 Score=28.54 Aligned_cols=19 Identities=26% Similarity=0.681 Sum_probs=16.4
Q ss_pred hccCCccccCHHHHHHHHH
Q psy945 38 KEKGDWKNLTLEEKKTLYR 56 (344)
Q Consensus 38 ke~~~w~~l~~~ek~~lyr 56 (344)
.|..+|+.||.+||.++.+
T Consensus 25 dEI~~W~~~s~~er~~i~~ 43 (51)
T PF06945_consen 25 DEIRDWKSMSDDERRAILA 43 (51)
T ss_pred HHHHHHhhCCHHHHHHHHH
Confidence 4788999999999998765
No 16
>PRK12906 secA preprotein translocase subunit SecA; Reviewed
Probab=47.18 E-value=21 Score=39.79 Aligned_cols=61 Identities=23% Similarity=0.107 Sum_probs=44.9
Q ss_pred HHHhhccCCCchhhhhhheeee-eeeeEEeecccccChHHHHHHHHHHHHHHHHCCCeEEEeCCC
Q psy945 63 FVEMNASTGEWKGAFGIAFMVV-TVSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDVIELPPD 126 (344)
Q Consensus 63 ~~e~~~~~~ewk~~~~~~~~~~-~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~ 126 (344)
+|||+ |||-||+++.+.... +.++----.| .++--+|.+.++.+...|+.+|++|-.+...
T Consensus 97 iaEm~--TGEGKTLvA~l~a~l~al~G~~v~vv-T~neyLA~Rd~e~~~~~~~~LGl~vg~i~~~ 158 (796)
T PRK12906 97 IAEMK--TGEGKTLTATLPVYLNALTGKGVHVV-TVNEYLSSRDATEMGELYRWLGLTVGLNLNS 158 (796)
T ss_pred ccccc--CCCCCcHHHHHHHHHHHHcCCCeEEE-eccHHHHHhhHHHHHHHHHhcCCeEEEeCCC
Confidence 67885 899999997755444 2222222222 4566699999999999999999999998753
No 17
>PRK13103 secA preprotein translocase subunit SecA; Reviewed
Probab=44.61 E-value=14 Score=41.65 Aligned_cols=61 Identities=30% Similarity=0.212 Sum_probs=44.8
Q ss_pred HHHhhccCCCchhhhhhheeee-eeeeEEeecccccChHHHHHHHHHHHHHHHHCCCeEEEeCCC
Q psy945 63 FVEMNASTGEWKGAFGIAFMVV-TVSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDVIELPPD 126 (344)
Q Consensus 63 ~~e~~~~~~ewk~~~~~~~~~~-~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~ 126 (344)
+|||+ |||-||+++.+-.+. +.+.---.. -.++--+|.+.++.+...|+-+|+.|-.+...
T Consensus 99 iaEM~--TGEGKTLvA~l~a~l~al~G~~Vhv-vT~ndyLA~RD~e~m~~l~~~lGl~v~~i~~~ 160 (913)
T PRK13103 99 IAEMR--TGEGKTLVGTLAVYLNALSGKGVHV-VTVNDYLARRDANWMRPLYEFLGLSVGIVTPF 160 (913)
T ss_pred ccccc--CCCCChHHHHHHHHHHHHcCCCEEE-EeCCHHHHHHHHHHHHHHhcccCCEEEEECCC
Confidence 57774 899999987754443 333322222 24566699999999999999999999999754
No 18
>COG3048 DsdA D-serine dehydratase [Amino acid transport and metabolism]
Probab=41.47 E-value=39 Score=34.02 Aligned_cols=86 Identities=28% Similarity=0.353 Sum_probs=56.2
Q ss_pred ccCCccccCHHHHHHHHHHhhhhhHHHhhccCCCchhhhhhheeeeeeeeEEeecccccChHHHHHHHHHHHHHHHHCCC
Q psy945 39 EKGDWKNLTLEEKKTLYRASFAQTFVEMNASTGEWKGAFGIAFMVVTVSLLTYVLMYKVGIDEARRQHESFVALLRDLGL 118 (344)
Q Consensus 39 e~~~w~~l~~~ek~~lyr~~f~~t~~e~~~~~~ewk~~~~~~~~~~~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv 118 (344)
+..|...|-.++.+. |-..|..---.||.--.-||++---+||..-+. |+ .=-+||.. +.||.+||
T Consensus 142 ~~DDYs~L~~~~f~~-----FFs~ysIaVGSTGNLGlSIGI~sA~lGF~vtVH--MS-----ADAr~WKK--d~LRs~gV 207 (443)
T COG3048 142 LEDDYSILLSEEFKD-----FFSRYSIAVGSTGNLGLSIGIMSAALGFKVTVH--MS-----ADARAWKK--DKLRSHGV 207 (443)
T ss_pred ccchHHHhhcHHHHH-----HHHhheEeecccCccceehhhhhhhhcceEEEE--ec-----chHHHHHH--HHHHhcCc
Confidence 344555555444442 223334445678887778888777777665544 33 12367875 78999999
Q ss_pred eEEEeCCC------------CCCCCceecCCc
Q psy945 119 DVIELPPD------------EDLPESVFIEDT 138 (344)
Q Consensus 119 ~V~~L~~~------------~~~PD~VF~~D~ 138 (344)
+|++.+.+ +..|.|-|+.|-
T Consensus 208 ~ViEYe~DY~~AVeeGRk~a~~DP~c~FiDDE 239 (443)
T COG3048 208 TVVEYEQDYGVAVEEGRKEAESDPNCFFIDDE 239 (443)
T ss_pred eEEEecchhhHHHHHhhhhhccCCceEEeccc
Confidence 99999863 247999999885
No 19
>PF11216 DUF3012: Protein of unknown function (DUF3012); InterPro: IPR021379 This family of proteins with unknown function is restricted to Gammaproteobacteria.
Probab=39.85 E-value=9.4 Score=25.52 Aligned_cols=13 Identities=46% Similarity=0.966 Sum_probs=10.7
Q ss_pred ccchhhccCCccc
Q psy945 33 PPLKQKEKGDWKN 45 (344)
Q Consensus 33 ~~l~~ke~~~w~~ 45 (344)
+.+++|.||||..
T Consensus 8 ~~m~~kpK~dWta 20 (32)
T PF11216_consen 8 EDMKEKPKGDWTA 20 (32)
T ss_pred HHHhhCCcccCcH
Confidence 3689999999964
No 20
>cd03143 A4_beta-galactosidase_middle_domain A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to beta-galactosidase from Thermus thermophilus. Beta-Galactosidase hydrolyzes the beta-1,4-D-galactosidic linkage of lactose, as well as those of related chromogens, o-nitrophenyl-beta-D-galactopyranoside (ONP-Gal) and 5-bromo-4-chloro-3-indolyl-beta-D-galactoside (X-gal). This A4 beta-galactosidase middle domain lacks the catalytic triad of typical GATase1 domains. The reactive Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in typical GATase1 domains is not conserved in this group.
Probab=35.69 E-value=98 Score=26.41 Aligned_cols=28 Identities=36% Similarity=0.507 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHCCCeEEEeCCCCCC
Q psy945 102 ARRQHESFVALLRDLGLDVIELPPDEDL 129 (344)
Q Consensus 102 A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~~ 129 (344)
-..+...+.++|++.|+.|.++++...+
T Consensus 24 y~~~~~~~~~~l~~~gi~~d~v~~~~~l 51 (154)
T cd03143 24 YLDLALALYRALRELGIPVDVVPPDADL 51 (154)
T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCCc
Confidence 3456777888999999999999965544
No 21
>cd03130 GATase1_CobB Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. CobB plays a role in cobalamin biosythesis catalyzing the conversion of cobyrinic acid to cobyrinic acid a,c-diamide. CobB belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobB.
Probab=35.67 E-value=70 Score=29.07 Aligned_cols=35 Identities=26% Similarity=0.504 Sum_probs=26.8
Q ss_pred HHHHHHHHHHCCCeEEEeCCC--CCCCCceecCCceEEECCe
Q psy945 106 HESFVALLRDLGLDVIELPPD--EDLPESVFIEDTAVIVNGI 145 (344)
Q Consensus 106 h~~lv~~Lr~~Gv~V~~L~~~--~~~PD~VF~~D~a~v~~~~ 145 (344)
|++-.++|++.|++|+.+.+. +.+|+ -| ++++||+
T Consensus 13 y~e~~~~l~~~G~~v~~~s~~~~~~l~~----~D-~lilPGG 49 (198)
T cd03130 13 YPENLELLEAAGAELVPFSPLKDEELPD----AD-GLYLGGG 49 (198)
T ss_pred cHHHHHHHHHCCCEEEEECCCCCCCCCC----CC-EEEECCC
Confidence 556678999999999999884 55665 14 7778876
No 22
>PF09153 DUF1938: Domain of unknown function (DUF1938); InterPro: IPR015236 This domain, which is predominantly found in the archaeal protein O6-alkylguanine-DNA alkyltransferase, adopts a secondary structure consisting of a three stranded antiparallel beta-sheet and three alpha helices. The exact function has not, as yet, been defined, though it has been postulated that this domain may confer thermostability to the protein []. ; GO: 0005737 cytoplasm; PDB: 1MGT_A.
Probab=35.33 E-value=32 Score=27.91 Aligned_cols=38 Identities=26% Similarity=0.367 Sum_probs=24.3
Q ss_pred cChHHHHH-HHHHHHHHHHHCCCeEEEeCCCCCCCCcee
Q psy945 97 VGIDEARR-QHESFVALLRDLGLDVIELPPDEDLPESVF 134 (344)
Q Consensus 97 vD~d~A~~-Qh~~lv~~Lr~~Gv~V~~L~~~~~~PD~VF 134 (344)
.|-.+-.+ |-..|+.-|++.|+.|..-....+.|+-||
T Consensus 31 lDg~efl~eri~~L~~~L~kRgv~v~L~~~~S~yp~lV~ 69 (86)
T PF09153_consen 31 LDGEEFLRERISRLIEFLKKRGVSVSLDEEPSDYPELVF 69 (86)
T ss_dssp SSHHHHHH-HHHHHHHHHHHTT------B---SHHHHHH
T ss_pred eccHHHHHHHHHHHHHHHHhcCceeEEeecCCCchHHHH
Confidence 56555555 999999999999999988888888999988
No 23
>TIGR02144 LysX_arch Lysine biosynthesis enzyme LysX. The family of proteins found in this equivalog include the characterized LysX from Thermus thermophilus which is part of a well-organized lysine biosynthesis gene cluster. LysX is believed to carry out an ATP-dependent acylation of the amino group of alpha-aminoadipate in the prokaryotic version of the fungal AAA lysine biosynthesis pathway. No species having a sequence in this equivalog contains the elements of the more common diaminopimelate lysine biosythesis pathway, and none has been shown to be a lysine auxotroph. These sequences have mainly recieved the name of the related enzyme, "ribosomal protein S6 modification protein RimK". RimK has been characterized in E. coli, and acts by ATP-dependent condensation of S6 with glutamate residues.
Probab=34.29 E-value=34 Score=31.97 Aligned_cols=27 Identities=22% Similarity=0.274 Sum_probs=22.2
Q ss_pred hHHHHHHHHHHHHHHHHCCCeEEEeCC
Q psy945 99 IDEARRQHESFVALLRDLGLDVIELPP 125 (344)
Q Consensus 99 ~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~ 125 (344)
++..+.+.+.|.++|+++|++|+.++.
T Consensus 5 ~~~~~~~~~~l~~al~~~g~~~~~~~~ 31 (280)
T TIGR02144 5 YDIIRPDEKMLIEELEKLGLPYRKIYV 31 (280)
T ss_pred ecCCCHHHHHHHHHHHHcCCceEEEEh
Confidence 455667789999999999999998754
No 24
>PF03480 SBP_bac_7: Bacterial extracellular solute-binding protein, family 7; InterPro: IPR018389 This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein DctP [, ] and the sialic acid-binding protein SiaP. The structure of the SiaP receptor has revealed an overall topology similar to ATP binding cassette ESR (extracytoplasmic solute receptors) proteins []. Upon binding of sialic acid, SiaP undergoes domain closure about a hinge region and kinking of an alpha-helix hinge component [].; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space; PDB: 2HZK_C 2HZL_B 2HPG_C 2XWI_A 2XWK_A 2WX9_A 2CEY_A 2WYP_A 3B50_A 2CEX_B ....
Probab=31.92 E-value=1.1e+02 Score=28.97 Aligned_cols=52 Identities=21% Similarity=0.271 Sum_probs=36.0
Q ss_pred CCccccCHHHHHHHHHHhhhhhHHHhhccCCCchhhhhhheeeeeeeeEEeecccccChHHHHHHHHHHHHHHHHCCCeE
Q psy945 41 GDWKNLTLEEKKTLYRASFAQTFVEMNASTGEWKGAFGIAFMVVTVSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDV 120 (344)
Q Consensus 41 ~~w~~l~~~ek~~lyr~~f~~t~~e~~~~~~ewk~~~~~~~~~~~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V 120 (344)
.-|++||.|.+.+|....- .+...+ .+.+.+.-++..+.+++.|++|
T Consensus 214 ~~w~~L~~e~q~~l~~~~~-~~~~~~--------------------------------~~~~~~~~~~~~~~~~~~G~~v 260 (286)
T PF03480_consen 214 DWWDSLPDEDQEALDDAAD-EAEARA--------------------------------REYYEAEDEEALKELEENGVTV 260 (286)
T ss_dssp HHHHHS-HHHHHHHHHHHH-HHHHHH--------------------------------HHHHHHHHHHHHHHHHHTT-EE
T ss_pred HHHhcCCHHHHHHHHHHHH-HHHHHH--------------------------------HHHHHHHHHHHHHHHHHCcCEE
Confidence 4599999999998876553 222222 2456667778888999999999
Q ss_pred EEeCC
Q psy945 121 IELPP 125 (344)
Q Consensus 121 ~~L~~ 125 (344)
+.+++
T Consensus 261 ~~~s~ 265 (286)
T PF03480_consen 261 VELSD 265 (286)
T ss_dssp EEGCH
T ss_pred eCCCH
Confidence 88764
No 25
>CHL00122 secA preprotein translocase subunit SecA; Validated
Probab=31.61 E-value=50 Score=37.25 Aligned_cols=60 Identities=27% Similarity=0.188 Sum_probs=42.8
Q ss_pred HHHhhccCCCchhhhhhheeee-eeeeEEeecccccChHHHHHHHHHHHHHHHHCCCeEEEeCC
Q psy945 63 FVEMNASTGEWKGAFGIAFMVV-TVSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDVIELPP 125 (344)
Q Consensus 63 ~~e~~~~~~ewk~~~~~~~~~~-~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~ 125 (344)
+|||+ |||-||.+..+...+ ++++ -.+..-.++--+|.+..+.+...|+-+|++|-.+.+
T Consensus 93 IaEm~--TGEGKTL~a~l~ayl~aL~G-~~VhVvT~NdyLA~RD~e~m~pvy~~LGLsvg~i~~ 153 (870)
T CHL00122 93 IAEMK--TGEGKTLVATLPAYLNALTG-KGVHIVTVNDYLAKRDQEWMGQIYRFLGLTVGLIQE 153 (870)
T ss_pred ccccc--CCCCchHHHHHHHHHHHhcC-CceEEEeCCHHHHHHHHHHHHHHHHHcCCceeeeCC
Confidence 67775 899999876544432 2111 111222577889999999999999999999998865
No 26
>PRK09200 preprotein translocase subunit SecA; Reviewed
Probab=30.97 E-value=50 Score=36.91 Aligned_cols=77 Identities=22% Similarity=0.101 Sum_probs=50.4
Q ss_pred ccCCCchhhhhhheeee-eeeeEEeecccccChHHHHHHHHHHHHHHHHCCCeEEEeCCCCCC---CCceecCCceEEEC
Q psy945 68 ASTGEWKGAFGIAFMVV-TVSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDVIELPPDEDL---PESVFIEDTAVIVN 143 (344)
Q Consensus 68 ~~~~ewk~~~~~~~~~~-~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~~---PD~VF~~D~a~v~~ 143 (344)
..|||-||++..+-.+. .++.- ....-.|...+|.+.++.+...++-+|++|-.+...... ....|..|..+.++
T Consensus 98 m~TGeGKTL~a~lp~~l~al~G~-~v~VvTpt~~LA~qd~e~~~~l~~~lGl~v~~i~g~~~~~~~r~~~y~~dIvygT~ 176 (790)
T PRK09200 98 MQTGEGKTLTATMPLYLNALEGK-GVHLITVNDYLAKRDAEEMGQVYEFLGLTVGLNFSDIDDASEKKAIYEADIIYTTN 176 (790)
T ss_pred ecCCCcchHHHHHHHHHHHHcCC-CeEEEeCCHHHHHHHHHHHHHHHhhcCCeEEEEeCCCCcHHHHHHhcCCCEEEECC
Confidence 46899999977654442 22211 122235778899999999999999999999988754321 12345556555555
Q ss_pred Ce
Q psy945 144 GI 145 (344)
Q Consensus 144 ~~ 145 (344)
+.
T Consensus 177 ~~ 178 (790)
T PRK09200 177 SE 178 (790)
T ss_pred cc
Confidence 44
No 27
>PF14156 AbbA_antirepres: Antirepressor AbbA
Probab=30.83 E-value=46 Score=25.40 Aligned_cols=21 Identities=43% Similarity=0.532 Sum_probs=19.4
Q ss_pred cccCHHHHHHHHHHhhhhhHH
Q psy945 44 KNLTLEEKKTLYRASFAQTFV 64 (344)
Q Consensus 44 ~~l~~~ek~~lyr~~f~~t~~ 64 (344)
..||.|||+.|--+=|+|.||
T Consensus 7 ~rlt~EE~~LLLdiLf~q~YA 27 (63)
T PF14156_consen 7 ERLTEEEKKLLLDILFQQNYA 27 (63)
T ss_pred HHhhHHHHHHHHHHHHHHHHH
Confidence 468999999999999999997
No 28
>PRK02724 hypothetical protein; Provisional
Probab=30.06 E-value=24 Score=29.59 Aligned_cols=45 Identities=11% Similarity=0.069 Sum_probs=37.9
Q ss_pred ecccccChHHHHHHHHHHHHHHHHCCCeEEEeCCCCCCCCceecC
Q psy945 92 VLMYKVGIDEARRQHESFVALLRDLGLDVIELPPDEDLPESVFIE 136 (344)
Q Consensus 92 ~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~~PD~VF~~ 136 (344)
||+..-|.-..+.|..++++.-.+.|-+...-+.+...||+||+.
T Consensus 58 ~WIs~~~~i~lLN~~TeiIN~WQe~~~~~~~~eaq~kFPd~~f~g 102 (104)
T PRK02724 58 HWITENERVEVLNKATEVINYWQEEGKGKPLAEAQLKFPEVTFCG 102 (104)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHccCCCChHHHHhcCCCeEEee
Confidence 666777778888999999999999887777777888899999975
No 29
>PRK01372 ddl D-alanine--D-alanine ligase; Reviewed
Probab=28.61 E-value=63 Score=30.66 Aligned_cols=32 Identities=25% Similarity=0.386 Sum_probs=24.0
Q ss_pred cChHHHHHHHHHHHHHHHHCCCeEEEeCCCCC
Q psy945 97 VGIDEARRQHESFVALLRDLGLDVIELPPDED 128 (344)
Q Consensus 97 vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~ 128 (344)
.+.+.....=..+.++|+++|++|+.++..+.
T Consensus 16 ~~~~~~~~s~~~i~~al~~~g~~v~~i~~~~~ 47 (304)
T PRK01372 16 AEREVSLNSGAAVLAALREAGYDAHPIDPGED 47 (304)
T ss_pred CCceEeHHhHHHHHHHHHHCCCEEEEEecCcc
Confidence 33444445558899999999999999987654
No 30
>cd09013 BphC-JF8_N_like N-terminal, non-catalytic, domain of BphC_JF8, (2,3-dihydroxybiphenyl 1,2-dioxygenase) from Bacillus sp. JF8 and similar proteins. 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC) catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, a key step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). BphC belongs to the type I extradiol dioxygenase family, which requires a metal ion in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of BphCs. This subfamily of BphC is represented by the enzyme purified from the thermophilic biphenyl and naphthalene degrader, Bacillus sp. JF8. The members in this family of BphC enzymes may use either Mn(II) or Fe(II) as cofactors. The enzyme purified from Bacillus sp. JF8 is Mn(II)-dependent, however, the enzyme from Rhodococcus jostii RHAI has Fe(II) bound to it. BphC_JF8 is thermostable and its optimum activity is at 85 degrees C
Probab=27.77 E-value=81 Score=25.15 Aligned_cols=35 Identities=20% Similarity=0.113 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHCCCeEEEeCCCCCCCCceecCCc
Q psy945 104 RQHESFVALLRDLGLDVIELPPDEDLPESVFIEDT 138 (344)
Q Consensus 104 ~Qh~~lv~~Lr~~Gv~V~~L~~~~~~PD~VF~~D~ 138 (344)
++.+++.+.|++.|+++...+..++....+|.+|.
T Consensus 74 ~~v~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~DP 108 (121)
T cd09013 74 EALERRVAALEASGLGIGWIEGDPGHGKAYRFRSP 108 (121)
T ss_pred HHHHHHHHHHHHcCCccccccCCCCCcceEEEECC
Confidence 56677778899999998554444444556666654
No 31
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=27.47 E-value=25 Score=25.73 Aligned_cols=23 Identities=35% Similarity=0.471 Sum_probs=15.8
Q ss_pred CCccccCHHHHHHHHHHhhhhhHH
Q psy945 41 GDWKNLTLEEKKTLYRASFAQTFV 64 (344)
Q Consensus 41 ~~w~~l~~~ek~~lyr~~f~~t~~ 64 (344)
.-|++||.+||. -|+--+...-.
T Consensus 37 ~~W~~l~~~eK~-~y~~~a~~~~~ 59 (69)
T PF00505_consen 37 QMWKNLSEEEKA-PYKEEAEEEKE 59 (69)
T ss_dssp HHHHCSHHHHHH-HHHHHHHHHHH
T ss_pred HHHhcCCHHHHH-HHHHHHHHHHH
Confidence 369999999987 56555544433
No 32
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=26.01 E-value=69 Score=24.14 Aligned_cols=33 Identities=24% Similarity=0.280 Sum_probs=19.9
Q ss_pred CCccccCHHHHHHHHHHhhhhhHHHhhccCCCch
Q psy945 41 GDWKNLTLEEKKTLYRASFAQTFVEMNASTGEWK 74 (344)
Q Consensus 41 ~~w~~l~~~ek~~lyr~~f~~t~~e~~~~~~ewk 74 (344)
..|+.||.+||.- |.-...+--.+...-..+||
T Consensus 38 ~~Wk~ls~~eK~~-y~~~a~~~k~~y~~~~p~y~ 70 (72)
T cd01388 38 DRWKALSNEEKQP-YYEEAKKLKELHMKLYPDYK 70 (72)
T ss_pred HHHHcCCHHHHHH-HHHHHHHHHHHHHHHCcCCC
Confidence 4699999999984 44333344444444444443
No 33
>cd04501 SGNH_hydrolase_like_4 Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad from other serine hydrolases, but may lack the carboxlic acid.
Probab=25.43 E-value=1.6e+02 Score=25.36 Aligned_cols=38 Identities=18% Similarity=0.231 Sum_probs=30.1
Q ss_pred eecccccChHHHHHHHHHHHHHHHHCCCeEEEeCCCCC
Q psy945 91 YVLMYKVGIDEARRQHESFVALLRDLGLDVIELPPDED 128 (344)
Q Consensus 91 n~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~ 128 (344)
|-.....+.+...+.++.+++.+++.|.+|+++.+.|.
T Consensus 70 ND~~~~~~~~~~~~~~~~li~~~~~~~~~~il~~~~p~ 107 (183)
T cd04501 70 NDIIVNTSLEMIKDNIRSMVELAEANGIKVILASPLPV 107 (183)
T ss_pred CccccCCCHHHHHHHHHHHHHHHHHCCCcEEEEeCCCc
Confidence 33333457888999999999999999999888876653
No 34
>PF07517 SecA_DEAD: SecA DEAD-like domain; InterPro: IPR011115 SecA protein binds to the plasma membrane where it interacts with proOmpA to support translocation of proOmpA through the membrane. SecA protein achieves this translocation, in association with SecY protein, in an ATP-dependent manner [,]. This domain represents the N-terminal ATP-dependent helicase domain, which is related to the IPR0011545 from INTERPRO.; GO: 0005524 ATP binding, 0017038 protein import, 0016020 membrane; PDB: 1NL3_B 1NKT_B 3DIN_B 3JUX_A 2FSG_B 2VDA_A 2FSH_A 2FSF_A 2FSI_A 3BXZ_A ....
Probab=25.16 E-value=55 Score=31.72 Aligned_cols=75 Identities=23% Similarity=0.178 Sum_probs=48.5
Q ss_pred HHHhhccCCCchhhhhhheeee-eeeeEEeecccccChHHHHHHHHHHHHHHHHCCCeEEEeCCCCCCCC---ceecCCc
Q psy945 63 FVEMNASTGEWKGAFGIAFMVV-TVSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDVIELPPDEDLPE---SVFIEDT 138 (344)
Q Consensus 63 ~~e~~~~~~ewk~~~~~~~~~~-~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~~PD---~VF~~D~ 138 (344)
++|| .|||-||++.++.-.+ ++.+ -.+..-.++--+|.+.++.+...|.-+|++|-.+.+... ++ .+|..|.
T Consensus 94 laEm--~TGEGKTli~~l~a~~~AL~G-~~V~vvT~NdyLA~RD~~~~~~~y~~LGlsv~~~~~~~~-~~~r~~~Y~~dI 169 (266)
T PF07517_consen 94 LAEM--KTGEGKTLIAALPAALNALQG-KGVHVVTSNDYLAKRDAEEMRPFYEFLGLSVGIITSDMS-SEERREAYAADI 169 (266)
T ss_dssp EEEE--STTSHHHHHHHHHHHHHHTTS-S-EEEEESSHHHHHHHHHHHHHHHHHTT--EEEEETTTE-HHHHHHHHHSSE
T ss_pred eEEe--cCCCCcHHHHHHHHHHHHHhc-CCcEEEeccHHHhhccHHHHHHHHHHhhhccccCccccC-HHHHHHHHhCcc
Confidence 4444 5999999988755443 1111 112223567789999999999999999999988876543 22 3677774
Q ss_pred eEE
Q psy945 139 AVI 141 (344)
Q Consensus 139 a~v 141 (344)
.-.
T Consensus 170 ~Y~ 172 (266)
T PF07517_consen 170 VYG 172 (266)
T ss_dssp EEE
T ss_pred ccc
Confidence 333
No 35
>TIGR03714 secA2 accessory Sec system translocase SecA2. Members of this protein family are homologous to SecA and part of the accessory Sec system. This system, including both five core proteins for export and a variable number of proteins for glycosylation, operates in certain Gram-positive pathogens for the maturation and delivery of serine-rich glycoproteins such as the cell surface glycoprotein GspB in Streptococcus gordonii.
Probab=24.19 E-value=99 Score=34.49 Aligned_cols=57 Identities=19% Similarity=0.074 Sum_probs=39.7
Q ss_pred hccCCCchhhhhhheeee-eeeeEEeecccccChHHHHHHHHHHHHHHHHCCCeEEEeC
Q psy945 67 NASTGEWKGAFGIAFMVV-TVSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDVIELP 124 (344)
Q Consensus 67 ~~~~~ewk~~~~~~~~~~-~vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~ 124 (344)
+.+|||-||....+-... ..+.- ....-.|...+|.++++.+...++.+|++|..+-
T Consensus 89 em~TGeGKTLta~Lpa~l~aL~g~-~V~VVTpn~yLA~Rdae~m~~l~~~LGLsv~~~~ 146 (762)
T TIGR03714 89 EMKTGEGKTLTATMPLYLNALTGK-GAMLVTTNDYLAKRDAEEMGPVYEWLGLTVSLGV 146 (762)
T ss_pred EecCCcchHHHHHHHHHHHhhcCC-ceEEeCCCHHHHHHHHHHHHHHHhhcCCcEEEEE
Confidence 357899999976655321 11110 1222257889999999999999999999987754
No 36
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=23.91 E-value=79 Score=22.70 Aligned_cols=22 Identities=36% Similarity=0.585 Sum_probs=15.3
Q ss_pred cCCccccCHHHHHHHHHHhhhhh
Q psy945 40 KGDWKNLTLEEKKTLYRASFAQT 62 (344)
Q Consensus 40 ~~~w~~l~~~ek~~lyr~~f~~t 62 (344)
.-.|++||.+||. -|.-.+.+-
T Consensus 36 ~~~W~~ls~~eK~-~y~~~a~~~ 57 (66)
T cd01390 36 GEKWKELSEEEKK-KYEEKAEKD 57 (66)
T ss_pred HHHHHhCCHHHHH-HHHHHHHHH
Confidence 3479999999998 555444333
No 37
>PF14258 DUF4350: Domain of unknown function (DUF4350)
Probab=22.79 E-value=2e+02 Score=21.32 Aligned_cols=53 Identities=19% Similarity=0.273 Sum_probs=30.2
Q ss_pred HHHHHHHHHHCCCeEEEeCCC-CCCCCceecCCceEEECCeEEEeCCCCccCc-ChhhhhHHHHhhc
Q psy945 106 HESFVALLRDLGLDVIELPPD-EDLPESVFIEDTAVIVNGIVLITKPGNTQRH-KEVAVSATRALIK 170 (344)
Q Consensus 106 h~~lv~~Lr~~Gv~V~~L~~~-~~~PD~VF~~D~a~v~~~~avi~r~~~~~Rr-~E~~~~~t~~~~~ 170 (344)
..+|.+.|++.|++|...... +.++. .+++.|+..+....-. .|.+.+. .|.+
T Consensus 7 ~~a~~~~L~~~g~~v~~~~~~~~~l~~----------~~~tll~i~~~~~~~~~~~~~~l~--~~v~ 61 (70)
T PF14258_consen 7 TYALYQLLEEQGVKVERWRKPYEALEA----------DDGTLLVIGPDLRLSEPEEAEALL--EWVE 61 (70)
T ss_pred HHHHHHHHHHCCCeeEEecccHHHhCC----------CCCEEEEEeCCCCCCchHHHHHHH--HHHH
Confidence 356788999999999876642 12321 3445555555433332 4454444 4544
No 38
>cd01828 sialate_O-acetylesterase_like2 sialate_O-acetylesterase_like subfamily of the SGNH-hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=22.32 E-value=2.3e+02 Score=23.95 Aligned_cols=38 Identities=8% Similarity=0.019 Sum_probs=31.4
Q ss_pred EeecccccChHHHHHHHHHHHHHHHH--CCCeEEEeCCCC
Q psy945 90 TYVLMYKVGIDEARRQHESFVALLRD--LGLDVIELPPDE 127 (344)
Q Consensus 90 in~~M~~vD~d~A~~Qh~~lv~~Lr~--~Gv~V~~L~~~~ 127 (344)
+|-.+...+.+...+..+.+++.+++ .+.+|+++.+.+
T Consensus 58 ~ND~~~~~~~~~~~~~l~~li~~~~~~~~~~~vi~~~~~p 97 (169)
T cd01828 58 INDLAQGTSDEDIVANYRTILEKLRKHFPNIKIVVQSILP 97 (169)
T ss_pred ccCCCCCCCHHHHHHHHHHHHHHHHHHCCCCeEEEEecCC
Confidence 45555567889999999999999999 899999987654
No 39
>PRK14569 D-alanyl-alanine synthetase A; Provisional
Probab=21.85 E-value=2e+02 Score=27.63 Aligned_cols=31 Identities=10% Similarity=0.155 Sum_probs=26.8
Q ss_pred ccChHHHHHHHHHHHHHHHHCCCeEEEeCCC
Q psy945 96 KVGIDEARRQHESFVALLRDLGLDVIELPPD 126 (344)
Q Consensus 96 ~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~ 126 (344)
....+-..+.-.+..++|++.|++|+.++..
T Consensus 14 s~e~~vsl~s~~~v~~aL~~~g~~~~~~~~~ 44 (296)
T PRK14569 14 SPEREVSLKSGKAVLDSLISQGYDAVGVDAS 44 (296)
T ss_pred CCchHhHHHHHHHHHHHHHHcCCEEEEEcCC
Confidence 4567888888899999999999999999864
No 40
>cd08344 MhqB_like_N N-terminal domain of MhqB, a type I extradiol dioxygenase, and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of Burkholderia sp. NF100 MhqB and similar proteins. MhqB is a type I extradiol dioxygenase involved in the catabolism of methylhydroquinone, an intermediate in the degradation of fenitrothion. The purified enzyme has shown extradiol ring cleavage activity toward 3-methylcatechol. Fe2+ was suggested as a cofactor, the same as most other enzymes in the family. Burkholderia sp. NF100 MhqB is encoded on the plasmid pNF1. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=21.80 E-value=1.3e+02 Score=23.59 Aligned_cols=34 Identities=21% Similarity=0.434 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHCCCeEEEeCCCCCCCCceecCCc
Q psy945 104 RQHESFVALLRDLGLDVIELPPDEDLPESVFIEDT 138 (344)
Q Consensus 104 ~Qh~~lv~~Lr~~Gv~V~~L~~~~~~PD~VF~~D~ 138 (344)
...+++++.|+++|+++.. ++.+....++|.+|.
T Consensus 66 ~d~~~~~~~l~~~Gi~~~~-~~~~~~~~~~~~~DP 99 (112)
T cd08344 66 DDFAAFARHLEAAGVALAA-APPGADPDGVWFRDP 99 (112)
T ss_pred hhHHHHHHHHHHcCCceec-CCCcCCCCEEEEECC
Confidence 4567789999999999753 333333456666554
No 41
>PRK12326 preprotein translocase subunit SecA; Reviewed
Probab=21.73 E-value=1.3e+02 Score=33.59 Aligned_cols=61 Identities=30% Similarity=0.200 Sum_probs=44.8
Q ss_pred HHHhhccCCCchhhhhhheeeee-eeeEEeecccccChHHHHHHHHHHHHHHHHCCCeEEEeCCC
Q psy945 63 FVEMNASTGEWKGAFGIAFMVVT-VSLLTYVLMYKVGIDEARRQHESFVALLRDLGLDVIELPPD 126 (344)
Q Consensus 63 ~~e~~~~~~ewk~~~~~~~~~~~-vs~lin~~M~~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~ 126 (344)
.|||+ |||-||+++.+..+.. .++-- +..-.++--+|.+-++.+...|+.+|++|-++.+.
T Consensus 95 VaEM~--TGEGKTLvA~l~a~l~AL~G~~-VhvvT~NdyLA~RDae~m~~ly~~LGLsvg~i~~~ 156 (764)
T PRK12326 95 VIEMA--TGEGKTLAGAIAAAGYALQGRR-VHVITVNDYLARRDAEWMGPLYEALGLTVGWITEE 156 (764)
T ss_pred ccccc--CCCCHHHHHHHHHHHHHHcCCC-eEEEcCCHHHHHHHHHHHHHHHHhcCCEEEEECCC
Confidence 57775 8999999887655542 22221 22225667799999999999999999999998753
No 42
>TIGR00768 rimK_fam alpha-L-glutamate ligases, RimK family. This family, related to bacterial glutathione synthetases, contains at least two different alpha-L-glutamate ligases. One is RimK, as in E. coli, which adds additional Glu residues to the native Glu-Glu C-terminus of ribosomal protein S6, but not to Lys-Glu mutants. Most species with a member of this subfamily lack an S6 homolog ending in Glu-Glu, however. Members in Methanococcus jannaschii act instead as a tetrahydromethanopterin:alpha-l-glutamate ligase (MJ0620) and a gamma-F420-2:alpha-l-glutamate ligase (MJ1001).
Probab=21.57 E-value=72 Score=29.38 Aligned_cols=22 Identities=18% Similarity=0.363 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHCCCeEEEeCCC
Q psy945 105 QHESFVALLRDLGLDVIELPPD 126 (344)
Q Consensus 105 Qh~~lv~~Lr~~Gv~V~~L~~~ 126 (344)
..+.|+++++++|++|..+++.
T Consensus 12 ~~~~l~~a~~~~g~~~~~~~~~ 33 (277)
T TIGR00768 12 DEKMLKEAAEELGIDYKVVTPP 33 (277)
T ss_pred HHHHHHHHHHHcCCceEEEEhH
Confidence 5678999999999999999864
No 43
>PRK10307 putative glycosyl transferase; Provisional
Probab=21.43 E-value=1.6e+02 Score=28.98 Aligned_cols=36 Identities=11% Similarity=0.134 Sum_probs=26.8
Q ss_pred ccChHHHHHHHHHHHHHHHHCCCeEEEeCCCCCCCC
Q psy945 96 KVGIDEARRQHESFVALLRDLGLDVIELPPDEDLPE 131 (344)
Q Consensus 96 ~vD~d~A~~Qh~~lv~~Lr~~Gv~V~~L~~~~~~PD 131 (344)
.++..-+.....+|++.|.+.|++|+++.+.+..|+
T Consensus 11 ~P~~~g~~~~~~~l~~~L~~~G~~V~vit~~~~~~~ 46 (412)
T PRK10307 11 APELTGIGKYTGEMAEWLAARGHEVRVITAPPYYPQ 46 (412)
T ss_pred CCCccchhhhHHHHHHHHHHCCCeEEEEecCCCCCC
Confidence 344443444567999999999999999998765554
No 44
>PF13239 2TM: 2TM domain
Probab=21.38 E-value=2.3e+02 Score=22.01 Aligned_cols=37 Identities=11% Similarity=0.108 Sum_probs=22.3
Q ss_pred HHHHHHHHHHHHHHHHHhccCCCCCCCCcHHHHHHHHHHHHH
Q psy945 281 AFMVVTVSLLTYVLMYKVGIFPEKAETLSEEHRQEMLQKMID 322 (344)
Q Consensus 281 ~~v~ig~~~~~~~~~r~~~~~~p~P~tlseewk~a~~~~~~~ 322 (344)
.+++-|+.+++..+ +.+ +. +..++++|++++.|+.++
T Consensus 46 ~~~~Wgi~L~~h~~-~vf-~~---~~~~~~~We~rki~k~m~ 82 (83)
T PF13239_consen 46 PILGWGIGLAIHAL-KVF-GS---KYFFGKDWEERKIQKLME 82 (83)
T ss_pred HHHHHHHHHHHHHH-HHH-cc---cccccchHHHHHHHHHhc
Confidence 33444555554433 444 22 266778999999988764
No 45
>PRK01077 cobyrinic acid a,c-diamide synthase; Validated
Probab=21.22 E-value=1.6e+02 Score=30.36 Aligned_cols=36 Identities=28% Similarity=0.461 Sum_probs=25.9
Q ss_pred HHHHHHHHHHCCCeEEEeCCC--CCCCCceecCCceEEECCeE
Q psy945 106 HESFVALLRDLGLDVIELPPD--EDLPESVFIEDTAVIVNGIV 146 (344)
Q Consensus 106 h~~lv~~Lr~~Gv~V~~L~~~--~~~PD~VF~~D~a~v~~~~a 146 (344)
|++-.+.|++.|++|..++|. +.+||. | +++++|+.
T Consensus 260 y~e~~~~L~~~g~~~~~~~~~~~~~l~~~----D-~lilpGG~ 297 (451)
T PRK01077 260 YPENLELLRAAGAELVFFSPLADEALPDC----D-GLYLGGGY 297 (451)
T ss_pred HHHHHHHHHHCCCEEEEeCCcCCCCCCCC----C-EEEeCCCc
Confidence 344467888999999999983 346643 4 77778774
No 46
>PF06757 Ins_allergen_rp: Insect allergen related repeat, nitrile-specifier detoxification; InterPro: IPR010629 This entry represents several insect specific allergen repeats. These repeats are commonly found in various proteins from cockroaches, fruit flies and mosquitos. It has been suggested that the repeat sequences have evolved by duplication of an ancestral amino acid domain, which may have arisen from the mitochondrial energy transfer proteins []. This family exemplifies a case of novel gene evolution. The case in point is the arms-race between plants and their infective insective herbivores in the area of the glucosinolate-myrosinase system. Brassicas have developed the glucosinolate-myrosinase system as chemical defence mechanism against the insects, and consequently the insects have adapted to produce a detoxifying molecule, nitrile-specifier protein (NSP). NSP is present in the Pieris rapae (Cabbage white butterfly). NSP is structurally different from and has no amino acid homology to any known detoxifying enzymes, and it appears to have arisen by a process of domain and gene duplication of a sequence of unknown function that is widespread in insect species and referred to as insect-allergen-repeat protein. Thus this family is found either as a single domain or as a multiple repeat-domain [].
Probab=21.09 E-value=96 Score=27.80 Aligned_cols=64 Identities=22% Similarity=0.323 Sum_probs=41.0
Q ss_pred ccCCccc--------cCHHHHHHHHHHhhhhh--HHHhhc--cCCCchhhhhhheeeeeeeeEEeecccccChHHHHHHH
Q psy945 39 EKGDWKN--------LTLEEKKTLYRASFAQT--FVEMNA--STGEWKGAFGIAFMVVTVSLLTYVLMYKVGIDEARRQH 106 (344)
Q Consensus 39 e~~~w~~--------l~~~ek~~lyr~~f~~t--~~e~~~--~~~ewk~~~~~~~~~~~vs~lin~~M~~vD~d~A~~Qh 106 (344)
..|.|+. |..++-++||.-+-..+ |+++-. .+.||+.+ ++.-.+..++
T Consensus 99 ~~~g~~g~~~di~~~lP~~~l~aL~~~K~~~s~~F~~f~~~l~S~ef~~~--------------------~~~~~~~~~~ 158 (179)
T PF06757_consen 99 RGGGLNGFVDDILALLPRDKLRALYEEKLATSPEFAEFVEALRSPEFQQL--------------------YNALWASPEF 158 (179)
T ss_pred cCCCHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHcCHHHHHH--------------------HHHHHcCHHH
Confidence 4556654 45566667776665332 444422 34455554 4455677899
Q ss_pred HHHHHHHHHCCCeEEE
Q psy945 107 ESFVALLRDLGLDVIE 122 (344)
Q Consensus 107 ~~lv~~Lr~~Gv~V~~ 122 (344)
+.+++.|+++|++|..
T Consensus 159 ~~~~~~L~~~Gvdv~~ 174 (179)
T PF06757_consen 159 QRLLNELRENGVDVDY 174 (179)
T ss_pred HHHHHHHHHcCCCHHH
Confidence 9999999999998743
No 47
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=20.85 E-value=32 Score=26.10 Aligned_cols=16 Identities=38% Similarity=0.615 Sum_probs=11.9
Q ss_pred CCcCCChHHHHhhchh
Q psy945 243 DWKNLTLEEKKTLYRA 258 (344)
Q Consensus 243 dWk~Ls~eEk~alY~~ 258 (344)
.|++||.+||..|..-
T Consensus 42 ~Wk~Ls~~EK~~Y~~~ 57 (73)
T PF09011_consen 42 RWKSLSEEEKEPYEER 57 (73)
T ss_dssp HHHHS-HHHHHHHHHH
T ss_pred HHHhcCHHHHHHHHHH
Confidence 4999999999887543
No 48
>cd07265 2_3_CTD_N N-terminal domain of catechol 2,3-dioxygenase. This subfamily contains the N-terminal, non-catalytic, domain of catechol 2,3-dioxygenase. Catechol 2,3-dioxygenase (2,3-CTD, catechol:oxygen 2,3-oxidoreductase) catalyzes an extradiol cleavage of catechol to form 2-hydroxymuconate semialdehyde with the insertion of two atoms of oxygen. The enzyme is a homotetramer and contains catalytically essential Fe(II) . The reaction proceeds by an ordered bi-unit mechanism. First, catechol binds to the enzyme, this is then followed by the binding of dioxygen to form a tertiary complex, and then the aromatic ring is cleaved to produce 2-hydroxymuconate semialdehyde. Catechol 2,3-dioxygenase belongs to the type I extradiol dioxygenase family. The subunit comprises the N- and C-terminal domains of similar structure fold, resulting from an ancient gene duplication. The active site is located in a funnel-shaped space of the C-terminal domain. This subfamily represents the N-terminal do
Probab=20.75 E-value=1.6e+02 Score=23.42 Aligned_cols=35 Identities=20% Similarity=0.255 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHCCCeEEEeCCC--CCCCCceecCCc
Q psy945 104 RQHESFVALLRDLGLDVIELPPD--EDLPESVFIEDT 138 (344)
Q Consensus 104 ~Qh~~lv~~Lr~~Gv~V~~L~~~--~~~PD~VF~~D~ 138 (344)
.+.+++++.|+++|++|...+.. ++...++|.+|.
T Consensus 73 ~dv~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~DP 109 (122)
T cd07265 73 ADLEKLEARLQAYGVAVERIPAGELPGVGRRVRFQLP 109 (122)
T ss_pred HHHHHHHHHHHHCCCcEEEcccCCCCCCceEEEEECC
Confidence 45677888999999998765432 223356666664
No 49
>COG3313 Predicted Fe-S protein [General function prediction only]
Probab=20.68 E-value=55 Score=25.96 Aligned_cols=24 Identities=17% Similarity=0.383 Sum_probs=20.6
Q ss_pred hccCCccccCHHHHHHHHHHhhhh
Q psy945 38 KEKGDWKNLTLEEKKTLYRASFAQ 61 (344)
Q Consensus 38 ke~~~w~~l~~~ek~~lyr~~f~~ 61 (344)
-|-+.|..||.+||.++.+..+..
T Consensus 30 ~Ei~~W~~msd~Er~aVl~~l~qr 53 (74)
T COG3313 30 DEIFNWSSMSDDERRAVLRLLPQR 53 (74)
T ss_pred HHHHHHhhCCHHHHHHHHHHhHHH
Confidence 367899999999999999988833
No 50
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=20.28 E-value=1.7e+02 Score=24.53 Aligned_cols=30 Identities=17% Similarity=0.206 Sum_probs=22.7
Q ss_pred cCCcCCCcchhhHHHHHHHHHHHHHHHHHH
Q psy945 265 VEMNAPTGEWKGAFGIAFMVVTVSLLTYVL 294 (344)
Q Consensus 265 pe~~~~~~e~k~vfg~~~v~ig~~~~~~~~ 294 (344)
|..-.....|+..+|.++..+.++++++..
T Consensus 8 ~~~~~~g~sW~~LVGVv~~al~~SlLIala 37 (102)
T PF15176_consen 8 PGPGEGGRSWPFLVGVVVTALVTSLLIALA 37 (102)
T ss_pred CCCCCCCcccHhHHHHHHHHHHHHHHHHHH
Confidence 344445788999999999888888888543
No 51
>PRK02868 hypothetical protein; Provisional
Probab=20.16 E-value=85 Score=30.23 Aligned_cols=23 Identities=35% Similarity=0.538 Sum_probs=21.1
Q ss_pred cccCHHHHHHHHHHhhhhhHHHh
Q psy945 44 KNLTLEEKKTLYRASFAQTFVEM 66 (344)
Q Consensus 44 ~~l~~~ek~~lyr~~f~~t~~e~ 66 (344)
++||+|||..|-|+|+..||+-+
T Consensus 65 ~~ms~eqq~~ll~~s~~~~~s~l 87 (245)
T PRK02868 65 QNMSPEQQQILLKASAASTFSGL 87 (245)
T ss_pred HhCCHHHHHHHHHHHHHHHHHHH
Confidence 68999999999999999999854
Done!