Query psy4570
Match_columns 95
No_of_seqs 103 out of 244
Neff 5.9
Searched_HMMs 46136
Date Fri Aug 16 20:24:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4570.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4570hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1233|consensus 99.9 8.2E-24 1.8E-28 168.9 6.1 93 2-94 422-514 (613)
2 PF02913 FAD-oxidase_C: FAD li 98.6 1.5E-08 3.3E-13 72.1 1.3 86 1-86 62-154 (248)
3 TIGR00387 glcD glycolate oxida 98.1 5.7E-06 1.2E-10 65.5 4.8 81 1-85 234-319 (413)
4 PRK11230 glycolate oxidase sub 97.6 0.00011 2.4E-09 60.0 5.3 80 1-84 291-375 (499)
5 PLN02805 D-lactate dehydrogena 97.2 0.0008 1.7E-08 55.9 5.4 82 1-84 366-452 (555)
6 COG3286 Uncharacterized protei 93.5 0.37 8.1E-06 35.8 6.5 84 3-86 41-135 (204)
7 PF09840 DUF2067: Uncharacteri 79.3 11 0.00023 27.6 6.5 84 3-86 38-132 (190)
8 cd00248 Mth938-like Mth938-lik 68.8 3 6.5E-05 27.5 1.3 33 29-66 56-88 (109)
9 cd05560 Xcc1710_like Xcc1710_l 60.0 4.9 0.00011 26.6 1.1 28 39-66 61-88 (109)
10 PF14257 DUF4349: Domain of un 52.4 29 0.00062 25.8 4.2 76 10-85 59-143 (262)
11 COG3010 NanE Putative N-acetyl 49.6 30 0.00065 26.3 3.9 71 3-84 156-227 (229)
12 KOG0496|consensus 48.3 19 0.00041 31.2 3.0 64 10-73 142-210 (649)
13 KOG4179|consensus 45.6 8.4 0.00018 32.2 0.5 19 52-71 511-529 (568)
14 cd05125 Mth938_2P1-like Mth938 43.6 15 0.00033 24.6 1.4 34 29-67 58-91 (114)
15 COG3737 Uncharacterized conser 42.4 13 0.00027 25.9 0.9 47 17-68 58-107 (127)
16 COG1071 AcoA Pyruvate/2-oxoglu 39.5 1.1E+02 0.0024 24.6 5.9 73 6-87 229-323 (358)
17 cd05126 Mth938 Mth938 domain. 38.5 18 0.00038 24.3 1.1 48 15-67 47-96 (117)
18 PF04430 DUF498: Protein of un 37.4 14 0.00031 24.1 0.5 29 39-67 62-90 (110)
19 cd07947 DRE_TIM_Re_CS Clostrid 35.1 99 0.0022 23.6 4.9 72 9-81 110-197 (279)
20 TIGR00510 lipA lipoate synthas 34.9 49 0.0011 25.7 3.2 54 1-58 213-275 (302)
21 cd00886 MogA_MoaB MogA_MoaB fa 34.8 71 0.0015 21.8 3.7 38 45-84 21-58 (152)
22 TIGR00177 molyb_syn molybdenum 33.3 83 0.0018 21.2 3.8 39 44-84 27-65 (144)
23 cd05782 DNA_polB_like1_exo Unc 31.9 27 0.00058 25.3 1.3 39 15-59 81-119 (208)
24 PF01218 Coprogen_oxidas: Copr 30.7 39 0.00084 26.6 2.0 77 8-87 133-209 (296)
25 PF02662 FlpD: Methyl-viologen 30.3 54 0.0012 22.0 2.5 48 36-85 71-122 (124)
26 PF00994 MoCF_biosynth: Probab 29.5 73 0.0016 21.2 3.0 39 45-85 18-56 (144)
27 PF00487 FA_desaturase: Fatty 29.0 44 0.00094 22.9 1.9 25 65-93 228-252 (257)
28 cd07045 BMC_CcmK_like Carbon d 28.5 93 0.002 19.6 3.2 27 2-28 40-66 (84)
29 PRK09487 sdhC succinate dehydr 28.2 20 0.00044 24.5 0.1 26 49-74 89-114 (129)
30 PF12738 PTCB-BRCT: twin BRCT 28.0 23 0.0005 20.2 0.3 28 2-32 2-29 (63)
31 cd07963 Anticodon_Ia_Cys Antic 27.8 40 0.00086 22.9 1.5 27 44-70 128-155 (156)
32 PF00665 rve: Integrase core d 27.8 58 0.0013 20.1 2.2 67 11-82 48-119 (120)
33 PF13250 DUF4041: Domain of un 27.4 63 0.0014 19.0 2.1 22 64-85 15-36 (56)
34 PF04304 DUF454: Protein of un 26.8 38 0.00082 20.1 1.1 19 40-58 6-24 (71)
35 PF01766 Birna_VP2: Birnavirus 25.9 6.6 0.00014 32.4 -3.0 48 3-50 121-173 (441)
36 PLN03220 uncharacterized prote 25.7 21 0.00046 24.0 -0.2 16 39-54 50-67 (105)
37 PHA03321 tegument protein VP11 25.1 74 0.0016 27.8 2.9 56 38-94 352-415 (694)
38 PLN02873 coproporphyrinogen-II 25.1 89 0.0019 24.4 3.1 77 8-87 111-187 (274)
39 PF07045 DUF1330: Protein of u 25.0 79 0.0017 18.5 2.3 20 12-31 4-23 (65)
40 PF00676 E1_dh: Dehydrogenase 24.7 3E+02 0.0064 21.0 6.0 75 5-87 192-288 (300)
41 PF11867 DUF3387: Domain of un 23.7 80 0.0017 24.4 2.7 29 66-94 285-313 (335)
42 PRK05330 coproporphyrinogen II 23.3 93 0.002 24.5 3.0 77 8-87 138-214 (300)
43 PF07914 DUF1679: Protein of u 23.2 31 0.00067 27.8 0.3 14 63-76 9-22 (414)
44 PF10979 DUF2786: Protein of u 23.1 1.3E+02 0.0029 16.7 2.8 21 7-27 19-39 (43)
45 COG5470 Uncharacterized conser 23.0 97 0.0021 20.5 2.6 59 10-79 16-79 (96)
46 COG4047 Uncharacterized protei 22.6 1.4E+02 0.003 22.9 3.6 82 9-91 19-140 (243)
47 COG5275 BRCT domain type II [G 22.6 71 0.0015 24.6 2.1 37 1-39 160-208 (276)
48 PF13368 Toprim_C_rpt: Topoiso 22.5 18 0.0004 21.4 -0.9 16 40-56 10-25 (61)
49 COG2723 BglB Beta-glucosidase/ 22.2 61 0.0013 27.0 1.8 47 48-94 61-116 (460)
50 smart00852 MoCF_biosynth Proba 22.1 1.6E+02 0.0036 19.2 3.7 38 44-83 18-55 (135)
51 PRK12378 hypothetical protein; 21.7 2E+02 0.0044 21.6 4.4 51 10-61 101-159 (235)
52 KOG0831|consensus 21.4 46 0.00099 26.7 0.9 22 41-62 143-164 (334)
53 PF02120 Flg_hook: Flagellar h 21.3 80 0.0017 18.9 1.9 25 43-67 54-78 (85)
54 cd00758 MoCF_BD MoCF_BD: molyb 21.3 1.7E+02 0.0038 19.2 3.7 39 44-84 19-57 (133)
55 cd00027 BRCT Breast Cancer Sup 21.0 1.2E+02 0.0026 16.1 2.4 28 2-31 3-30 (72)
56 cd03522 MoeA_like MoeA_like. T 21.0 2.8E+02 0.006 21.7 5.2 67 14-83 137-216 (312)
57 PRK06063 DNA polymerase III su 20.9 1E+02 0.0022 23.9 2.7 26 1-29 236-261 (313)
58 cd05783 DNA_polB_B1_exo DEDDy 20.8 55 0.0012 23.7 1.2 21 39-59 93-113 (204)
59 PRK06195 DNA polymerase III su 20.8 86 0.0019 24.0 2.3 27 1-29 224-250 (309)
60 PF11308 GHL1-3: Glycosyl hydr 20.8 1.1E+02 0.0025 23.8 3.0 34 9-42 145-178 (307)
61 COG4571 OmpT Outer membrane pr 20.3 33 0.00071 27.1 -0.1 35 29-66 177-214 (314)
62 PF10281 Ish1: Putative stress 20.1 1.4E+02 0.003 15.7 2.5 29 47-80 7-35 (38)
No 1
>KOG1233|consensus
Probab=99.89 E-value=8.2e-24 Score=168.85 Aligned_cols=93 Identities=48% Similarity=0.931 Sum_probs=90.8
Q ss_pred EEEEecChHHHHhHHHHHHHHHHHcCCccCCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHH
Q psy4570 2 ALLVLGDPEDVKNNEDKIYSIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKK 81 (95)
Q Consensus 2 ~~gfEG~~~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~ 81 (95)
++.|||+.++|+.+.+++++|+.+++|+..|+++|+.+|.++|.++||||+.+.+|++.|+||||+|||++..||++||+
T Consensus 422 TllfEGdre~V~qhE~~~y~iAekF~G~~aG~~NGqrGY~LTfvIAYiRDlgl~~gvlgESFETSvPWDrv~~LCRnVKe 501 (613)
T KOG1233|consen 422 TLLFEGDREEVDQHEERLYKIAEKFHGVVAGAENGQRGYRLTFVIAYIRDLGLNHGVLGESFETSVPWDRVLSLCRNVKE 501 (613)
T ss_pred hheecccHHHHHHHHHHHHHHHHHhCCccccccccccceEEEEeHHHHHhhcccccchhhcccccCCHHHHHHHHHHHHH
Confidence 68899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHhhCCCCC
Q psy4570 82 RLTRECTGRWLPV 94 (95)
Q Consensus 82 a~~~a~~~~g~~~ 94 (95)
++.++|+++|+..
T Consensus 502 r~~rEck~~gv~~ 514 (613)
T KOG1233|consen 502 RMKRECKAQGVTH 514 (613)
T ss_pred HHHHHHHhcCCCc
Confidence 9999999999864
No 2
>PF02913 FAD-oxidase_C: FAD linked oxidases, C-terminal domain; InterPro: IPR004113 Some oxygen-dependent oxidoreductases are flavoproteins that contain a covalently bound FAD group which is attached to a histidine via an 8-alpha-(N3-histidyl)-riboflavin linkage. The region around the histidine that binds the FAD group is conserved in these enzymes (see IPR006093 from INTERPRO).; GO: 0003824 catalytic activity, 0050660 flavin adenine dinucleotide binding; PDB: 1WVE_B 1DII_B 1WVF_A 1DIQ_A 2UUU_B 2UUV_A 1W1M_A 1E8H_B 1E0Y_B 1DZN_B ....
Probab=98.59 E-value=1.5e-08 Score=72.09 Aligned_cols=86 Identities=23% Similarity=0.280 Sum_probs=69.0
Q ss_pred CEEEEecChHH-HHhHHH-HHHHHHHHcCCcc----CC-CccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcH
Q psy4570 1 MALLVLGDPED-VKNNED-KIYSIAKRYGGIP----AG-ESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTV 73 (95)
Q Consensus 1 ~~~gfEG~~~~-v~~~~~-~~~~i~~~~GG~~----~G-~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~ 73 (95)
+++.|+|+++. ++.+.+ .+.++++++|+.. .+ +...+.|..-++..||+++.....+...+++|++++|++++
T Consensus 62 llv~~~g~~~~~~~~~~~~~i~~~~~~~~~~~~~~a~~~~~~~~~W~~R~~~~~~~~~~~~~~~~~~~~~dv~vp~~~l~ 141 (248)
T PF02913_consen 62 LLVEFEGSDEEAVEEQLEAEIEEICKKYGGEDVVIADDEEEQERLWAIRRAIMPYLRDAAGRAGPVWDTEDVAVPPSRLP 141 (248)
T ss_dssp EEEECCCHHHCCHHHHHHHHHHHHHCTCTCCEEEEEHCHHCTSTHHHHHHHHCCGGGCSHCTTEEEEEEEEEESCHHHHH
T ss_pred EEEEECCCcHHHHHHHHHHHHHHHHhhcCCceeEEeCCHHHHHhhhhhhhhhcccccccccccCCceeeeeecccchhhh
Confidence 47899998854 555566 7888888888754 12 23456788777788899999999999999999999999999
Q ss_pred HHHHHHHHHHHHH
Q psy4570 74 LLCINVKKRLTRE 86 (95)
Q Consensus 74 ~l~~~V~~a~~~a 86 (95)
++++.+++.+.+.
T Consensus 142 ~~~~~~~~~~~~~ 154 (248)
T PF02913_consen 142 EFLREIRALLREY 154 (248)
T ss_dssp HHHHHHHHHHHHC
T ss_pred hHHHhhhhhhhhc
Confidence 9999998777654
No 3
>TIGR00387 glcD glycolate oxidase, subunit GlcD. This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
Probab=98.06 E-value=5.7e-06 Score=65.52 Aligned_cols=81 Identities=16% Similarity=0.166 Sum_probs=61.9
Q ss_pred CEEEEecChHHHHhHHHHHHHHHHHcCCccC----C-CccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHH
Q psy4570 1 MALLVLGDPEDVKNNEDKIYSIAKRYGGIPA----G-ESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLL 75 (95)
Q Consensus 1 ~~~gfEG~~~~v~~~~~~~~~i~~~~GG~~~----G-~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l 75 (95)
+++.|+|+.++++.+.+++.++++++|+... + ++..+.|...++..|++++.. ...+ +.|+|+||++++++
T Consensus 234 l~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~~r~~~~~~~~~~~-~~~~---~~d~~vp~~~l~~~ 309 (413)
T TIGR00387 234 LLVEIDGVHEAVERDEEKIEQICRKNGAVDVQIAQDEEERALLWAGRRNAFKAASKLS-PLYL---IEDGTVPRSKLPEA 309 (413)
T ss_pred EEEEecCCcHHHHHHHHHHHHHHHhCCCceEEEeCCHHHHHHHHHHHHHhHHHHHhhC-CCcc---eeEEecCHHHHHHH
Confidence 3578999998999999999999999988533 2 234566877777888998742 2222 46799999999999
Q ss_pred HHHHHHHHHH
Q psy4570 76 CINVKKRLTR 85 (95)
Q Consensus 76 ~~~V~~a~~~ 85 (95)
++.+++.+.+
T Consensus 310 ~~~~~~~~~~ 319 (413)
T TIGR00387 310 LRGIADIARK 319 (413)
T ss_pred HHHHHHHHHH
Confidence 9999776644
No 4
>PRK11230 glycolate oxidase subunit GlcD; Provisional
Probab=97.60 E-value=0.00011 Score=59.96 Aligned_cols=80 Identities=9% Similarity=0.067 Sum_probs=57.5
Q ss_pred CEEEEecChHHHHhHHHHHHHHHHHcCCccCC--C---ccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHH
Q psy4570 1 MALLVLGDPEDVKNNEDKIYSIAKRYGGIPAG--E---SNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLL 75 (95)
Q Consensus 1 ~~~gfEG~~~~v~~~~~~~~~i~~~~GG~~~G--~---~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l 75 (95)
+++.|+|++++++.+.+++.++++++|+..+. + +..+.|..-+...|+++.. ..+++ ..+++|||++++.+
T Consensus 291 ll~e~~g~~~~v~~~~~~l~~~~~~~g~~~~~~a~~~~~~~~~W~~R~~~~~~~~~~--~~~~~--~~dv~vP~~~l~~~ 366 (499)
T PRK11230 291 LLCELDGVESDVQEDCERVNDILLKAGATDVRLAQDEAERVRFWAGRKNAFPAVGRI--SPDYY--CMDGTIPRRELPGV 366 (499)
T ss_pred EEEEecCCchHHHHHHHHHHHHHHhcCCceEEEeCCHHHHHHHHHHHHhhHHHHHhh--CCCee--EEeecCChHHHHHH
Confidence 36788999999999999999999999975332 1 2234565533456787753 23344 34999999999999
Q ss_pred HHHHHHHHH
Q psy4570 76 CINVKKRLT 84 (95)
Q Consensus 76 ~~~V~~a~~ 84 (95)
++.+++...
T Consensus 367 ~~~~~~~~~ 375 (499)
T PRK11230 367 LEGIARLSQ 375 (499)
T ss_pred HHHHHHHHH
Confidence 999977443
No 5
>PLN02805 D-lactate dehydrogenase [cytochrome]
Probab=97.16 E-value=0.0008 Score=55.88 Aligned_cols=82 Identities=15% Similarity=0.077 Sum_probs=54.2
Q ss_pred CEEEEecChHHHHhHHHHHHHHHHHcCCc--cCCCcc---chhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHH
Q psy4570 1 MALLVLGDPEDVKNNEDKIYSIAKRYGGI--PAGESN---GRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLL 75 (95)
Q Consensus 1 ~~~gfEG~~~~v~~~~~~~~~i~~~~GG~--~~G~~~---G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l 75 (95)
+++.|+|++++++.+.+.+.+|++++|+. .+...+ .+.|. .|-.....-- ...-+....+.+++|||++++++
T Consensus 366 Ll~e~~g~~~~~~~~~~~~~~i~~~~g~~~~~~a~~~~e~~~lW~-~R~~~~~~~~-~~~~~~~~~~~DvaVP~s~L~e~ 443 (555)
T PLN02805 366 LMFEFIGTEAYAREQTLIVQKIASKHNGSDFVFAEEPEAKKELWK-IRKEALWACF-AMEPKYEAMITDVCVPLSHLAEL 443 (555)
T ss_pred EEEEEecCcHHHHHHHHHHHHHHHhCCCceEEEeCCHHHHHHHHH-HHHHHHHHHh-hcCCCCceeEEEEEEEHHHHHHH
Confidence 46788999999999999999999999984 233333 34455 3332221110 01113333467899999999999
Q ss_pred HHHHHHHHH
Q psy4570 76 CINVKKRLT 84 (95)
Q Consensus 76 ~~~V~~a~~ 84 (95)
++.+++.+.
T Consensus 444 i~~~~~~~~ 452 (555)
T PLN02805 444 ISRSKKELD 452 (555)
T ss_pred HHHHHHHHH
Confidence 999976654
No 6
>COG3286 Uncharacterized protein conserved in archaea [Function unknown]
Probab=93.46 E-value=0.37 Score=35.81 Aligned_cols=84 Identities=14% Similarity=0.182 Sum_probs=59.1
Q ss_pred EEEecChHHHHhHHHHHHHHHHHcCCccCCCccc-----hhhhhccchhh--hhhHHHHhccccee----eeeccccccC
Q psy4570 3 LLVLGDPEDVKNNEDKIYSIAKRYGGIPAGESNG-----RRGYMLTYIIA--YIRDFACDYYFIGD----SFETSVPWDK 71 (95)
Q Consensus 3 ~gfEG~~~~v~~~~~~~~~i~~~~GG~~~G~~~G-----~~W~~~rF~~P--YLRd~l~~~g~~~d----T~ETA~~Ws~ 71 (95)
+=..|.+++++..-+.+.++.+..-+..-+..-| .--...+=+.| -|+|.|--+|+=++ -++|++||+.
T Consensus 41 V~v~G~~~eireair~irel~~~vr~r~~~~~~~ly~l~~~~~~a~p~Vp~~vl~daLk~~GyrVevr~~~l~T~ap~~e 120 (204)
T COG3286 41 VNVFGTKDEIREAIRAIRELHRRVRRRLYPDRQGLYTLYRIFEEASPNVPPDVLIDALKLLGYRVEVRGGELKTNAPWSE 120 (204)
T ss_pred EEEecchHHHHHHHHHHHHHHHHHHhhhccCccceEeeHhHHHhhcCCCCHHHHHHHHHhCCceEEeeCceeecCCCHHH
Confidence 3457888899999888888888776644333222 11112222222 89999999999775 4899999999
Q ss_pred cHHHHHHHHHHHHHH
Q psy4570 72 TVLLCINVKKRLTRE 86 (95)
Q Consensus 72 ~~~l~~~V~~a~~~a 86 (95)
+..+.+.+-+..+++
T Consensus 121 v~E~vreLse~~~E~ 135 (204)
T COG3286 121 VVELVRELSEVYREA 135 (204)
T ss_pred HHHHHHHHHHHHHHH
Confidence 999998887766654
No 7
>PF09840 DUF2067: Uncharacterized protein conserved in archaea (DUF2067); InterPro: IPR019202 This family of archaeal proteins, have no known function.
Probab=79.29 E-value=11 Score=27.61 Aligned_cols=84 Identities=12% Similarity=0.085 Sum_probs=59.4
Q ss_pred EEEecChHHHHhHHHHHHHHHHHcCCccCCCccc---hhhhh----ccchhhhhhHHHHhcccce----eeeeccccccC
Q psy4570 3 LLVLGDPEDVKNNEDKIYSIAKRYGGIPAGESNG---RRGYM----LTYIIAYIRDFACDYYFIG----DSFETSVPWDK 71 (95)
Q Consensus 3 ~gfEG~~~~v~~~~~~~~~i~~~~GG~~~G~~~G---~~W~~----~rF~~PYLRd~l~~~g~~~----dT~ETA~~Ws~ 71 (95)
+=..|.+.+++...+.+.++.+...+.+-+.+.- -.... ..|..|-|-|.|--.|+=+ +.+.|+++|+.
T Consensus 38 I~i~G~~~eike~~~~Ik~~~~~vr~k~~~~g~~~y~l~~i~r~a~~~vp~d~L~~~L~~~G~~ae~~~~~i~T~a~~ee 117 (190)
T PF09840_consen 38 IEIQGYEKEIKEAIRRIKELVRRVRSKYNKRGLYRYSLDDIFREAGYPVPPDLLVDALKLLGYKAEYREDVIKTDAPLEE 117 (190)
T ss_pred EEEecChHHHHHHHHHHHHHHHHHHHHhccCCceEEcHHHHHHHcCCCCCHHHHHHHHHhCCCeeEEeCCeEEecCCHHH
Confidence 3356888888888888887777655532211110 01111 5667779999999999866 57899999999
Q ss_pred cHHHHHHHHHHHHHH
Q psy4570 72 TVLLCINVKKRLTRE 86 (95)
Q Consensus 72 ~~~l~~~V~~a~~~a 86 (95)
+.++.+.+-++..+.
T Consensus 118 v~~l~~~Lse~~~e~ 132 (190)
T PF09840_consen 118 VVELAERLSEIYKEL 132 (190)
T ss_pred HHHHHHHHHHHHHHH
Confidence 999999987776643
No 8
>cd00248 Mth938-like Mth938-like domain. The members of this family include: Mth938, 2P1, Xcr35, Rpa2829, and several uncharacterized sequences. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer. 2P1 is a partially characterized nuclear protein which is homologous to E3-3 from rat and known to be alternately spliced. Xcr35 and Rpa2829 are hypothetical proteins of unknown function from the Xanthomonas campestris and Rhodopseudomonas palustris genomes, respectively, for which the crystal structures have been determined.
Probab=68.78 E-value=3 Score=27.53 Aligned_cols=33 Identities=12% Similarity=0.094 Sum_probs=27.4
Q ss_pred ccCCCccchhhhhccchhhhhhHHHHhcccceeeeecc
Q psy4570 29 IPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETS 66 (95)
Q Consensus 29 ~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA 66 (95)
+.+|+ +...+|-.|-+++.|.++|+-+|.+.|.
T Consensus 56 liiGT-----G~~~~~~~~~~~~~l~~~gI~vE~m~T~ 88 (109)
T cd00248 56 LLIGT-----GAEIAFLPRALRAALRAAGIGVEVMSTG 88 (109)
T ss_pred EEEcC-----CCCCCcCCHHHHHHHHHcCCeEEEeCcH
Confidence 34576 4455788899999999999999999997
No 9
>cd05560 Xcc1710_like Xcc1710_like family, specific to proteobacteria. Xcc1710 is a hypothetical protein from Xanthomonas campestris pv. campestris str. ATCC 33913, similar to Mth938, a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. Their three-dimensional structures have been determined, but their functions are unknown.
Probab=59.96 E-value=4.9 Score=26.55 Aligned_cols=28 Identities=7% Similarity=-0.140 Sum_probs=24.8
Q ss_pred hhhccchhhhhhHHHHhcccceeeeecc
Q psy4570 39 GYMLTYIIAYIRDFACDYYFIGDSFETS 66 (95)
Q Consensus 39 W~~~rF~~PYLRd~l~~~g~~~dT~ETA 66 (95)
+...+|-.|-+++.|.++|+.+|.+.|.
T Consensus 61 G~~~~~~~~~~~~~l~~~gi~vE~m~T~ 88 (109)
T cd05560 61 GERQRFPPPALLAPLLARGIGVEVMDTQ 88 (109)
T ss_pred CCCCCcCCHHHHHHHHHcCCeEEEECHH
Confidence 5567787899999999999999999997
No 10
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=52.40 E-value=29 Score=25.77 Aligned_cols=76 Identities=20% Similarity=0.318 Sum_probs=53.2
Q ss_pred HHHHhHHHHHHHHHHHcCCccCCCc-------cchhhhhccchhh--hhhHHHHhcccceeeeeccccccCcHHHHHHHH
Q psy4570 10 EDVKNNEDKIYSIAKRYGGIPAGES-------NGRRGYMLTYIIA--YIRDFACDYYFIGDSFETSVPWDKTVLLCINVK 80 (95)
Q Consensus 10 ~~v~~~~~~~~~i~~~~GG~~~G~~-------~G~~W~~~rF~~P--YLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~ 80 (95)
++++...+++.++++++||...-.. .+.......+++| .+.+++.+..=+.....-.+.=.++..-|..+.
T Consensus 59 ~d~~~a~~~i~~~~~~~gG~i~~~~~~~~~~~~~~~~~~ltiRVP~~~~~~~l~~l~~~g~v~~~~~~~~DvT~~y~D~~ 138 (262)
T PF14257_consen 59 KDVEKAVKKIENLVESYGGYIESSSSSSSGGSDDERSASLTIRVPADKFDSFLDELSELGKVTSRNISSEDVTEQYVDLE 138 (262)
T ss_pred CCHHHHHHHHHHHHHHcCCEEEEEeeecccCCCCcceEEEEEEECHHHHHHHHHHHhccCceeeeeccccchHHHHHHHH
Confidence 4577888999999999999655332 2355667777888 777777776665555666665577877777766
Q ss_pred HHHHH
Q psy4570 81 KRLTR 85 (95)
Q Consensus 81 ~a~~~ 85 (95)
.+++.
T Consensus 139 arl~~ 143 (262)
T PF14257_consen 139 ARLKN 143 (262)
T ss_pred HHHHH
Confidence 66554
No 11
>COG3010 NanE Putative N-acetylmannosamine-6-phosphate epimerase [Carbohydrate transport and metabolism]
Probab=49.63 E-value=30 Score=26.27 Aligned_cols=71 Identities=7% Similarity=-0.066 Sum_probs=50.3
Q ss_pred EEEec-ChHHHHhHHHHHHHHHHHcCCccCCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHH
Q psy4570 3 LLVLG-DPEDVKNNEDKIYSIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKK 81 (95)
Q Consensus 3 ~gfEG-~~~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~ 81 (95)
-||++ ++.+.+.-..-+.++++ .|-..+.+ +||+.|-+-...+++|..+=|.=+|.| +.....+...+
T Consensus 156 sGYT~~~~~~~~pDf~lvk~l~~-~~~~vIAE--------Gr~~tP~~Ak~a~~~Ga~aVvVGsAIT--Rp~~It~~F~~ 224 (229)
T COG3010 156 SGYTGYTEKPTEPDFQLVKQLSD-AGCRVIAE--------GRYNTPEQAKKAIEIGADAVVVGSAIT--RPEEITQWFVD 224 (229)
T ss_pred ccccCCCCCCCCCcHHHHHHHHh-CCCeEEee--------CCCCCHHHHHHHHHhCCeEEEECcccC--CHHHHHHHHHH
Confidence 46777 44455555555556554 66555554 799999999999999999999999987 56666555555
Q ss_pred HHH
Q psy4570 82 RLT 84 (95)
Q Consensus 82 a~~ 84 (95)
+++
T Consensus 225 ~ik 227 (229)
T COG3010 225 AIK 227 (229)
T ss_pred HHh
Confidence 544
No 12
>KOG0496|consensus
Probab=48.25 E-value=19 Score=31.17 Aligned_cols=64 Identities=20% Similarity=0.210 Sum_probs=46.3
Q ss_pred HHHHhHHHHHHH----HHHHcCC-ccCCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcH
Q psy4570 10 EDVKNNEDKIYS----IAKRYGG-IPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTV 73 (95)
Q Consensus 10 ~~v~~~~~~~~~----i~~~~GG-~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~ 73 (95)
.+++...+.+.. +..+.|| +.+.+---|-..-.|+-.++-|..+.+...++.++.|.++|=..-
T Consensus 142 ~~~~~~~~~iv~~mk~L~~~qGGPIIl~QIENEYG~~~~~~~~~~k~y~~w~a~m~~~l~~gvpw~mCk 210 (649)
T KOG0496|consen 142 AEMERWTTKIVPMMKKLFASQGGPIILVQIENEYGNYLRALGAEGKSYLKWAAVLATSLGTGVPWVMCK 210 (649)
T ss_pred HHHHHHHHHHHHHHHHHHhhcCCCEEEEEeechhhHHHHHHHHHHHHhhccceEEEEecCCCCceeEec
Confidence 446666666665 7778888 444442223336677778999999999999999999999996543
No 13
>KOG4179|consensus
Probab=45.60 E-value=8.4 Score=32.20 Aligned_cols=19 Identities=37% Similarity=0.760 Sum_probs=15.5
Q ss_pred HHHhcccceeeeeccccccC
Q psy4570 52 FACDYYFIGDSFETSVPWDK 71 (95)
Q Consensus 52 ~l~~~g~~~dT~ETA~~Ws~ 71 (95)
+--|-|++.|| |||+.|||
T Consensus 511 ytg~~gylsdt-ets~~w~~ 529 (568)
T KOG4179|consen 511 YTGDSGYLSDT-ETSQQWDN 529 (568)
T ss_pred ecCCCccccCc-cccccccc
Confidence 34577899988 99999995
No 14
>cd05125 Mth938_2P1-like Mth938_2P1-like domain. This model contains sequences that are similar to 2P1, a partially characterized nuclear protein, which is homologous to E3-3 from rat and known to be alternatively spliced. Its function is unknown. This family is part of the Mth938 family, for which structures, but no functional data are available.
Probab=43.63 E-value=15 Score=24.58 Aligned_cols=34 Identities=12% Similarity=0.093 Sum_probs=27.8
Q ss_pred ccCCCccchhhhhccchhhhhhHHHHhcccceeeeeccc
Q psy4570 29 IPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSV 67 (95)
Q Consensus 29 ~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~ 67 (95)
+.+|. +...+|-.|-+++.|.++|+-+|.+.|..
T Consensus 58 liiGt-----G~~~~~~~~~~~~~l~~~gi~vevm~T~~ 91 (114)
T cd05125 58 LVIGT-----GRKSRPLSPELRKYFKKLGIAVEVVDTRN 91 (114)
T ss_pred EEEcc-----CCCCCcCCHHHHHHHHHcCCEEEEECHHH
Confidence 55677 33367888999999999999999999964
No 15
>COG3737 Uncharacterized conserved protein [Function unknown]
Probab=42.37 E-value=13 Score=25.86 Aligned_cols=47 Identities=11% Similarity=0.065 Sum_probs=35.8
Q ss_pred HHHHHHHHHcCC---ccCCCccchhhhhccchhhhhhHHHHhcccceeeeecccc
Q psy4570 17 DKIYSIAKRYGG---IPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVP 68 (95)
Q Consensus 17 ~~~~~i~~~~GG---~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~ 68 (95)
....++++...- +.+|+ +..+||--|-||..|...|+.+|.+-|...
T Consensus 58 e~f~~vl~~a~~~EilliGT-----G~~~rf~p~~l~aal~~~gIsve~Mst~AA 107 (127)
T COG3737 58 EDFERVLAEAPDVEILLIGT-----GARLRFPPPKLRAALKAAGISVEPMSTGAA 107 (127)
T ss_pred HHHHHHHhcCCCceEEEEec-----CccccCCCHHHHHHHHHcCCccccccchhh
Confidence 344455555553 34566 678999889999999999999999999765
No 16
>COG1071 AcoA Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]
Probab=39.46 E-value=1.1e+02 Score=24.63 Aligned_cols=73 Identities=11% Similarity=0.086 Sum_probs=49.2
Q ss_pred ecCh-HHHHhHHHHHHHHHHHcCCccC--------C----Ccc---------chhhhhccchhhhhhHHHHhcccceeee
Q psy4570 6 LGDP-EDVKNNEDKIYSIAKRYGGIPA--------G----ESN---------GRRGYMLTYIIAYIRDFACDYYFIGDSF 63 (95)
Q Consensus 6 EG~~-~~v~~~~~~~~~i~~~~GG~~~--------G----~~~---------G~~W~~~rF~~PYLRd~l~~~g~~~dT~ 63 (95)
.|.+ -.|-...+.|.+.+++.+|=.| | +++ -+.|.. |=-++-||++|.+.|++.|
T Consensus 229 DG~D~~avy~~~~~A~e~AR~g~GPtLIE~~tYR~~~HS~sDd~~~YRskeE~~~~~~-~DPi~r~~~~L~~~g~~se-- 305 (358)
T COG1071 229 DGNDVLAVYEAAKEAVERARAGEGPTLIEAVTYRYGGHSTSDDPSKYRSKEEVEEWKK-RDPIVRLRKYLIEAGILSE-- 305 (358)
T ss_pred CCcCHHHHHHHHHHHHHHHHcCCCCEEEEEEEeecCCCCCCCCccccCCHHHHHHHhc-cChHHHHHHHHHHcCCCCH--
Confidence 4544 3477778999999998888322 1 122 145655 3334489999999999877
Q ss_pred eccccccCcHHHHHHHHHHHHHHH
Q psy4570 64 ETSVPWDKTVLLCINVKKRLTREC 87 (95)
Q Consensus 64 ETA~~Ws~~~~l~~~V~~a~~~a~ 87 (95)
..+.++.+.+++.|.+++
T Consensus 306 ------e~~~~i~~e~~~~V~ea~ 323 (358)
T COG1071 306 ------EELEAIEAEAKAEVDEAV 323 (358)
T ss_pred ------HHHHHHHHHHHHHHHHHH
Confidence 566777777777777643
No 17
>cd05126 Mth938 Mth938 domain. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer. The function of the protein has not been determined.
Probab=38.48 E-value=18 Score=24.32 Aligned_cols=48 Identities=8% Similarity=-0.109 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHcCC-ccCCCccchhhhhccc-hhhhhhHHHHhcccceeeeeccc
Q psy4570 15 NEDKIYSIAKRYGG-IPAGESNGRRGYMLTY-IIAYIRDFACDYYFIGDSFETSV 67 (95)
Q Consensus 15 ~~~~~~~i~~~~GG-~~~G~~~G~~W~~~rF-~~PYLRd~l~~~g~~~dT~ETA~ 67 (95)
......+++...-= +.+|+ +..++| --|-+++.|.+.|+-++.+.|..
T Consensus 47 ~~~~l~~ll~~~peivliGT-----G~~~~~~~~~~~~~~l~~~Gi~ve~m~T~a 96 (117)
T cd05126 47 QPEELEELLEEGVEVIVIGT-----GQSGALKVPPETVEKLEKRGVEVLVLPTEE 96 (117)
T ss_pred CHHHHHHHHhcCCCEEEEcC-----CCCccccCCHHHHHHHHhcCCEEEEcChHH
Confidence 34455566643211 45577 567774 57889999999999999999864
No 18
>PF04430 DUF498: Protein of unknown function (DUF498/DUF598); InterPro: IPR007523 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This is entry represents an essential factor for the assembly of mitochondrial NADH:ubiquinone oxidoreductase complex (complex I) []. The crystal structure of this protein revealed a 3-layer beta+alpha/beta/alpha topology [].; PDB: 2K2E_A 2Q4Q_B 2AB1_A 2FVT_A 2CYJ_A 1IHN_B 2GM2_A 3CPK_A 2FI9_A.
Probab=37.38 E-value=14 Score=24.08 Aligned_cols=29 Identities=10% Similarity=0.132 Sum_probs=22.2
Q ss_pred hhhccchhhhhhHHHHhcccceeeeeccc
Q psy4570 39 GYMLTYIIAYIRDFACDYYFIGDSFETSV 67 (95)
Q Consensus 39 W~~~rF~~PYLRd~l~~~g~~~dT~ETA~ 67 (95)
+....|-.|-+++.+.++|+-+|.+-|.-
T Consensus 62 G~~~~~~~~~~~~~l~~~GI~ve~m~T~~ 90 (110)
T PF04430_consen 62 GKRQLFLPPELREYLRKKGIGVEVMDTPA 90 (110)
T ss_dssp TTS-SECTHHHHHHHHTTT-EEEEE-HHH
T ss_pred CCccccCCHHHHHHHHHcCCeEEEECHHH
Confidence 34478889999999999999999998853
No 19
>cd07947 DRE_TIM_Re_CS Clostridium kluyveri Re-citrate synthase and related proteins, catalytic TIM barrel domain. Re-citrate synthase (Re-CS) is a Clostridium kluyveri enzyme that converts acetyl-CoA and oxaloacetate to citrate. In most organisms, this reaction is catalyzed by Si-citrate synthase which is Si-face stereospecific with respect to C-2 of oxaloacetate, and phylogenetically unrelated to Re-citrate synthase. Re-citrate synthase is also found in a few other strictly anaerobic organisms. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with
Probab=35.06 E-value=99 Score=23.58 Aligned_cols=72 Identities=11% Similarity=0.075 Sum_probs=40.5
Q ss_pred hHHHHhHHHHHHHHHHHcCCccCCCccchhhhhc--cchhhhhhHHHHh---ccc-----ceeeeecccccc------Cc
Q psy4570 9 PEDVKNNEDKIYSIAKRYGGIPAGESNGRRGYML--TYIIAYIRDFACD---YYF-----IGDSFETSVPWD------KT 72 (95)
Q Consensus 9 ~~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~--rF~~PYLRd~l~~---~g~-----~~dT~ETA~~Ws------~~ 72 (95)
.+++-...+.+.+.++++|-. +--.+...++.. -+..+|+.+.+-. +|. ++||...++|++ ++
T Consensus 110 ~~e~l~~~~~~v~~a~~~g~~-v~~~~ed~~r~d~~~~v~~~~~~~~~~~~~~G~~~~i~l~DTvG~a~P~~~~~~p~~v 188 (279)
T cd07947 110 REEAMEKYLEIVEEALDHGIK-PRCHLEDITRADIYGFVLPFVNKLMKLSKESGIPVKIRLCDTLGYGVPYPGASLPRSV 188 (279)
T ss_pred HHHHHHHHHHHHHHHHHCCCe-EEEEEEcccCCCcccchHHHHHHHHHHHHHCCCCEEEEeccCCCcCCccccccchHHH
Confidence 344555566677778888642 211122233321 2334577665542 564 469999999995 45
Q ss_pred HHHHHHHHH
Q psy4570 73 VLLCINVKK 81 (95)
Q Consensus 73 ~~l~~~V~~ 81 (95)
..+.+.+++
T Consensus 189 ~~l~~~l~~ 197 (279)
T cd07947 189 PKIIYGLRK 197 (279)
T ss_pred HHHHHHHHH
Confidence 555555544
No 20
>TIGR00510 lipA lipoate synthase. The family shows strong sequence conservation.
Probab=34.89 E-value=49 Score=25.67 Aligned_cols=54 Identities=17% Similarity=0.239 Sum_probs=35.2
Q ss_pred CEEEEecChHHHHhHHHHHHHHHHHcCCc--cCCCc--c-chhhhhccchhh----hhhHHHHhccc
Q psy4570 1 MALLVLGDPEDVKNNEDKIYSIAKRYGGI--PAGES--N-GRRGYMLTYIIA----YIRDFACDYYF 58 (95)
Q Consensus 1 ~~~gfEG~~~~v~~~~~~~~~i~~~~GG~--~~G~~--~-G~~W~~~rF~~P----YLRd~l~~~g~ 58 (95)
+++|| |..+ ....+....+++.|-. .+|+. | -+++--.||.-| ++|+..+++|+
T Consensus 213 iIVGl-GETe---ee~~etl~~Lrelg~d~v~igqYl~p~~~~~~v~~~~~p~~f~~~~~~a~~~gf 275 (302)
T TIGR00510 213 IMVGL-GETN---EEIKQTLKDLRDHGVTMVTLGQYLRPSRRHLPVKRYVSPEEFDYYRSVALEMGF 275 (302)
T ss_pred EEEEC-CCCH---HHHHHHHHHHHhcCCCEEEeecccCCCCCCCccccCCCHHHHHHHHHHHHHcCC
Confidence 47899 7443 3445555666666663 34553 3 356777788777 78888888887
No 21
>cd00886 MogA_MoaB MogA_MoaB family. Members of this family are involved in biosynthesis of the molybdenum cofactor (MoCF) an essential cofactor of a diverse group of redox enzymes. MoCF biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea, and eukaryotes. MoCF contains a tricyclic pyranopterin, termed molybdopterin (MPT). MogA, together with MoeA, is responsible for the metal incorporation into MPT, the third step in MoCF biosynthesis. The plant homolog Cnx1 is a MoeA-MogA fusion protein. The mammalian homolog gephyrin is a MogA-MoeA fusion protein, that plays a critical role in postsynaptic anchoring of inhibitory glycine receptors and major GABAa receptor subtypes. In contrast, MoaB shows high similarity to MogA, but little is known about its physiological role. All well studied members of this family form highly stable trimers.
Probab=34.83 E-value=71 Score=21.76 Aligned_cols=38 Identities=3% Similarity=-0.162 Sum_probs=29.2
Q ss_pred hhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHHH
Q psy4570 45 IIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRLT 84 (95)
Q Consensus 45 ~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~~ 84 (95)
+.|+|...|.++|+ +..+..+--|+...+.+.++++++
T Consensus 21 n~~~l~~~l~~~G~--~v~~~~~v~Dd~~~i~~~l~~~~~ 58 (152)
T cd00886 21 SGPALVELLEEAGH--EVVAYEIVPDDKDEIREALIEWAD 58 (152)
T ss_pred hHHHHHHHHHHcCC--eeeeEEEcCCCHHHHHHHHHHHHh
Confidence 57899999999995 666676677888887777766554
No 22
>TIGR00177 molyb_syn molybdenum cofactor synthesis domain. The Drosophila protein cinnamon, the Arabidopsis protein cnx1, and rat protein gephyrin each have one domain like MoeA and one like MoaB and Mog. These domains are, however, distantly related to each other, as captured by this model. Gephyrin is unusual in that it seems to be a tubulin-binding neuroprotein involved in the clustering of both blycine receptors and GABA receptors, rather than a protein of molybdenum cofactor biosynthesis.
Probab=33.25 E-value=83 Score=21.21 Aligned_cols=39 Identities=5% Similarity=-0.160 Sum_probs=29.0
Q ss_pred chhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHHH
Q psy4570 44 YIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRLT 84 (95)
Q Consensus 44 F~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~~ 84 (95)
-+.|+|+..|.++| ++..+..+--|+...+.+.+++++.
T Consensus 27 ~n~~~l~~~l~~~G--~~v~~~~~v~Dd~~~i~~~l~~~~~ 65 (144)
T TIGR00177 27 SNGPLLAALLEEAG--FNVSRLGIVPDDPEEIREILRKAVD 65 (144)
T ss_pred CcHHHHHHHHHHCC--CeEEEEeecCCCHHHHHHHHHHHHh
Confidence 36899999999998 4555566666888888777766543
No 23
>cd05782 DNA_polB_like1_exo Uncharacterized bacterial subgroup of the DEDDy 3'-5' exonuclease domain of family-B DNA polymerases. A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair.
Probab=31.88 E-value=27 Score=25.31 Aligned_cols=39 Identities=3% Similarity=-0.195 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHcCCccCCCccchhhhhccchhhhhhHHHHhcccc
Q psy4570 15 NEDKIYSIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFI 59 (95)
Q Consensus 15 ~~~~~~~i~~~~GG~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~ 59 (95)
-.++..++++++.=+.+ .|=-..|+.|||+..++.+|+-
T Consensus 81 lL~~F~~~i~~~~p~lv------~yNg~~FDlP~L~~Ra~~~gi~ 119 (208)
T cd05782 81 LLEDFFQLIEKKNPRLV------SFNGRGFDLPVLHLRALIHGVS 119 (208)
T ss_pred HHHHHHHHHHHhCCEEE------ecCCCcCCHHHHHHHHHHhCCC
Confidence 34555566666633222 3555689999999999999983
No 24
>PF01218 Coprogen_oxidas: Coproporphyrinogen III oxidase; InterPro: IPR001260 Coprogen oxidase (i.e. coproporphyrin III oxidase or coproporphyrinogenase) catalyses the oxidative decarboxylation of coproporphyrinogen III to proto-porhyrinogen IX in the haem and chlorophyll biosynthetic pathways [, ]. The protein is a homodimer containing two internally bound iron atoms per molecule of native protein []. The enzyme is active in the presence of molecular oxygen that acts as an electron acceptor. The enzyme is widely distributed having been found in a variety of eukaryotic and prokaryotic sources.; GO: 0004109 coproporphyrinogen oxidase activity, 0006779 porphyrin-containing compound biosynthetic process, 0055114 oxidation-reduction process; PDB: 3EJO_B 3DWR_B 1VJU_B 2QT8_B 3DWS_A 2AEX_A 1TKL_B 1TK1_A 1TLB_Q 1TXN_A ....
Probab=30.70 E-value=39 Score=26.60 Aligned_cols=77 Identities=10% Similarity=-0.028 Sum_probs=57.8
Q ss_pred ChHHHHhHHHHHHHHHHHcCCccCCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHHHHHH
Q psy4570 8 DPEDVKNNEDKIYSIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRLTREC 87 (95)
Q Consensus 8 ~~~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~~~a~ 87 (95)
.++++..-.+..+++|.+|+...= .-.++|-+.-|.+|-..+.-=-=|+..|.|.| -+|++.-.+.++|-+++..++
T Consensus 133 ~~eD~~~FH~~~K~~cd~~~~~~Y--p~fK~wcD~YFyipHR~E~RGiGGIFfD~l~~-~~~e~~f~f~~~vg~~fl~aY 209 (296)
T PF01218_consen 133 FEEDAKHFHQTLKEACDPHDPSYY--PRFKKWCDEYFYIPHRNEHRGIGGIFFDDLNT-GDFEKDFAFVKDVGEAFLPAY 209 (296)
T ss_dssp -HHHHHHHHHHHHHHHGGG-TTHH--HHHHHHHHHHSEEGGGTEESSSEEEEEEEE-S-S-HHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHhcChHHH--HHHHhhchhhcchhhcCCCcCCceeeeccccc-cchhhHHHHHHHHHHHHHHHH
Confidence 346788888999999999975321 12589999999999887777777899999987 667777788888888777764
No 25
>PF02662 FlpD: Methyl-viologen-reducing hydrogenase, delta subunit; InterPro: IPR003813 Methyl-viologen-reducing hydrogenase (MVH) is one of the enzymes involved in methanogenesis and coded in the mth-flp-mvh-mrt cluster of methane genes in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) []. No specific functions have been assigned to the delta subunit.; GO: 0015948 methanogenesis, 0055114 oxidation-reduction process
Probab=30.27 E-value=54 Score=22.00 Aligned_cols=48 Identities=4% Similarity=-0.004 Sum_probs=38.4
Q ss_pred chhhhhccchhhhhhHHHHhcccceeeeec----cccccCcHHHHHHHHHHHHH
Q psy4570 36 GRRGYMLTYIIAYIRDFACDYYFIGDSFET----SVPWDKTVLLCINVKKRLTR 85 (95)
Q Consensus 36 G~~W~~~rF~~PYLRd~l~~~g~~~dT~ET----A~~Ws~~~~l~~~V~~a~~~ 85 (95)
|..|.+.|+ -.+++.|.+.|+=-|-++. +..+.++.+..+.+.+.|++
T Consensus 71 Gn~~a~~Rv--~~~k~~L~~~Gi~~eRv~~~~~~~~~~~~fa~~~~~f~~~i~~ 122 (124)
T PF02662_consen 71 GNYRAEKRV--ERLKKLLEELGIEPERVRLYWISAPEGKRFAEIVNEFTERIKE 122 (124)
T ss_pred hhHHHHHHH--HHHHHHHHHcCCChhHeEEEEeCcccHHHHHHHHHHHHHHHHH
Confidence 446777766 7899999999999999988 77888888888877666653
No 26
>PF00994 MoCF_biosynth: Probable molybdopterin binding domain; InterPro: IPR001453 Eukaryotic and prokaryotic molybdoenzymes require a molybdopterin cofactor (MoCF) for their activity. The biosynthesis of this cofactor involves a complex multistep enzymatic pathway. One of the eukaryotic proteins involved in this pathway is the Drosophila protein cinnamon [] which is highly similar to gephyrin, a rat microtubule-associated protein which was thought to anchor the glycine receptor to subsynaptic microtubules. Cinnamon and gephyrin are evolutionary related, in their N-terminal half, to the Escherichia coli MoCF biosynthesis proteins mog/chlG and moaB/chlA2 and, in their C-terminal half, to E. coli moeA/chlE.; GO: 0006777 Mo-molybdopterin cofactor biosynthetic process; PDB: 3TCR_B 1O8O_B 1O8Q_G 1EAV_D 1O8N_C 1UUX_A 1UUY_A 2G2C_A 2G4R_C 3K6A_F ....
Probab=29.52 E-value=73 Score=21.22 Aligned_cols=39 Identities=8% Similarity=0.078 Sum_probs=27.9
Q ss_pred hhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHHHH
Q psy4570 45 IIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRLTR 85 (95)
Q Consensus 45 ~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~~~ 85 (95)
+.|+|...|.++|+ +.....+--|+...+.+.+++++.+
T Consensus 18 n~~~l~~~l~~~G~--~v~~~~~v~Dd~~~i~~~l~~~~~~ 56 (144)
T PF00994_consen 18 NGPFLAALLEELGI--EVIRYGIVPDDPDAIKEALRRALDR 56 (144)
T ss_dssp HHHHHHHHHHHTTE--EEEEEEEEESSHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHcCC--eeeEEEEECCCHHHHHHHHHhhhcc
Confidence 47899999999888 5554445558888888777555443
No 27
>PF00487 FA_desaturase: Fatty acid desaturase This entry is only a subset of the Pfam family.; InterPro: IPR005804 Fatty acid desaturases are enzymes that catalyse the insertion of a double bond at the delta position of fatty acids. There seem to be two distinct families of fatty acid desaturases which do not seem to be evolutionary related. Family 1 is composed of: Stearoyl-CoA desaturase (SCD) (1.14.19.1 from EC) []. Family 2 is composed of: Bacterial fatty acid desaturases. Plant stearoyl-acyl-carrier-protein desaturase (1.14.19.1 from EC) [], this enzyme catalyzes the introduction of a double bond at the delta(9) position of steraoyl-ACP to produce oleoyl-ACP. This enzyme is responsible for the conversion of saturated fatty acids to unsaturated fatty acids in the synthesis of vegetable oils. Cyanobacterial DesA [], an enzyme that can introduce a second cis double bond at the delta(12) position of fatty acid bound to membranes glycerolipids. DesA is involved in chilling tolerance; the phase transition temperature of lipids of cellular membranes being dependent on the degree of unsaturation of fatty acids of the membrane lipids. This entry contains fatty acid desaturases belonging to Family 1. ; GO: 0006629 lipid metabolic process
Probab=29.03 E-value=44 Score=22.88 Aligned_cols=25 Identities=24% Similarity=0.383 Sum_probs=17.6
Q ss_pred ccccccCcHHHHHHHHHHHHHHHhhCCCC
Q psy4570 65 TSVPWDKTVLLCINVKKRLTRECTGRWLP 93 (95)
Q Consensus 65 TA~~Ws~~~~l~~~V~~a~~~a~~~~g~~ 93 (95)
-+++|.+++++++.+++ .+++++.+
T Consensus 228 P~vp~~~l~~~~~~~~~----~~~~~~~~ 252 (257)
T PF00487_consen 228 PGVPWYNLPEAHPILKE----VCPEYGVP 252 (257)
T ss_pred CCcCHHHHHHHHHHHHH----HHHHcCCc
Confidence 47889999999988854 45555443
No 28
>cd07045 BMC_CcmK_like Carbon dioxide concentrating mechanism K (CcmK)-like proteins, Bacterial Micro-Compartment (BMC) domain. Bacterial micro-compartments are primitive protein-based organelles that sequester specific metabolic pathways in bacterial cells. The prototypical bacterial microcompartment is the carboxysome shell, a bacterial polyhedral organelle which increase the efficiency of CO2 fixation by encapsulating RuBisCO and carbonic anhydrase. They can be divided into two types: alpha-type carboxysomes (alpha-cyanobacteria and proteobacteria) and beta-type carboxysomes (beta-cyanobacteria). Potential functional differences between the two types are not yet fully understood. In addition to these proteins there are several homologous shell proteins including those found in pdu organelles involved in coenzyme B12-dependent degradation of 1,2-propanediol and eut organelles involved in the cobalamin-dependent degradation of ethanolamine. Structure evidence shows that several carbox
Probab=28.53 E-value=93 Score=19.57 Aligned_cols=27 Identities=26% Similarity=0.307 Sum_probs=23.6
Q ss_pred EEEEecChHHHHhHHHHHHHHHHHcCC
Q psy4570 2 ALLVLGDPEDVKNNEDKIYSIAKRYGG 28 (95)
Q Consensus 2 ~~gfEG~~~~v~~~~~~~~~i~~~~GG 28 (95)
++.+.|+-..|+...+.+.+.++++|.
T Consensus 40 ~vii~GdvsaV~~Av~a~~~~~~~~~~ 66 (84)
T cd07045 40 TVKITGDVAAVKAAVEAGAAAAERIGE 66 (84)
T ss_pred EEEEEEcHHHHHHHHHHHHHHHhccCc
Confidence 678999999999999999998888765
No 29
>PRK09487 sdhC succinate dehydrogenase cytochrome b556 large membrane subunit; Provisional
Probab=28.24 E-value=20 Score=24.49 Aligned_cols=26 Identities=19% Similarity=0.387 Sum_probs=20.7
Q ss_pred hhHHHHhcccceeeeeccccccCcHH
Q psy4570 49 IRDFACDYYFIGDSFETSVPWDKTVL 74 (95)
Q Consensus 49 LRd~l~~~g~~~dT~ETA~~Ws~~~~ 74 (95)
+|+.+||.|+..|+++++...+.+.-
T Consensus 89 IRHL~wD~g~g~~~~~~~~~sa~~v~ 114 (129)
T PRK09487 89 IRHLLMDFGYLEETLEAGKRSAKISF 114 (129)
T ss_pred HHHHHHHccccchhHHHHHHHHHHHH
Confidence 79999999999888888876554443
No 30
>PF12738 PTCB-BRCT: twin BRCT domain; PDB: 3PA6_A 3KTF_C 2WT8_C 3EF1_A 3EF0_A.
Probab=28.02 E-value=23 Score=20.24 Aligned_cols=28 Identities=11% Similarity=0.166 Sum_probs=16.2
Q ss_pred EEEEecChHHHHhHHHHHHHHHHHcCCccCC
Q psy4570 2 ALLVLGDPEDVKNNEDKIYSIAKRYGGIPAG 32 (95)
Q Consensus 2 ~~gfEG~~~~v~~~~~~~~~i~~~~GG~~~G 32 (95)
++.|.|-...- +..+.+++..+||.+..
T Consensus 2 ~i~~sg~~~~~---~~~l~~~i~~~Gg~~~~ 29 (63)
T PF12738_consen 2 VICFSGFSGKE---RSQLRKLIEALGGKYSK 29 (63)
T ss_dssp EEEEEEB-TTT---CCHHHHHHHCTT-EEES
T ss_pred EEEECCCCHHH---HHHHHHHHHHCCCEEec
Confidence 44455533222 66777899999996554
No 31
>cd07963 Anticodon_Ia_Cys Anticodon-binding domain of cysteinyl tRNA synthetases. This domain is found in cysteinyl tRNA synthetases (CysRS), which belong to the class Ia aminoacyl tRNA synthetases. It lies C-terminal to the catalytic core domain, and recognizes and specifically binds to the tRNA anticodon. CysRS catalyzes the transfer of cysteine to the 3'-end of its tRNA.
Probab=27.81 E-value=40 Score=22.93 Aligned_cols=27 Identities=22% Similarity=0.234 Sum_probs=21.7
Q ss_pred chhh-hhhHHHHhcccceeeeecccccc
Q psy4570 44 YIIA-YIRDFACDYYFIGDSFETSVPWD 70 (95)
Q Consensus 44 F~~P-YLRd~l~~~g~~~dT~ETA~~Ws 70 (95)
|... .|||-|..+|+.++.----++|.
T Consensus 128 f~~AD~IRd~L~~~Gi~i~Dt~~Gt~w~ 155 (156)
T cd07963 128 WAEADRIRDELAAQGIILEDSPEGTTWR 155 (156)
T ss_pred HHHHHHHHHHHHHCCcEEEECCCCceee
Confidence 6555 99999999999998765566775
No 32
>PF00665 rve: Integrase core domain; InterPro: IPR001584 Integrase comprises three domains capable of folding independently and whose three-dimensional structures are known. However, the manner in which the N-terminal, catalytic, and C-terminal domains interact in the holoenzyme remains obscure. Numerous studies indicate that the enzyme functions as a multimer, minimally a dimer. The integrase proteins from Human immunodeficiency virus 1 (HIV-1) and Avian sarcoma virus (ASV) have been studied most carefully with respect to the structural basis of catalysis. Although the active site of ASV integrase does not undergo significant conformational changes on binding the required metal cofactor, that of HIV-1 does. This active site-mediated conformational change in HIV-1 reorganises the catalytic core and C-terminal domains and appears to promote an interaction that is favourable for catalysis []. Retroviral integrase is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. The presence of retrovirus integrase-related gene sequences in eukaryotes is known. Bacterial transposases involved in the transposition of the insertion sequence also belong to this group. HIV integrase catalyses the incorporation of virally derived DNA into the human genome. This unique step in the virus life cycle provides a variety of points for intervention and hence is an attractive target for the development of new therapeutics for the treatment of AIDS []. Substrate recognition by the retroviral integrase enzyme is critical for retroviral integration. To catalyse this recombination event, integrase must recognise and act on two types of substrates, viral DNA and host DNA, yet the necessary interactions exhibit markedly different degrees of specificity [].; GO: 0015074 DNA integration; PDB: 3AO3_A 3OVN_A 3AO5_A 3AO4_A 3AO1_A 1C6V_D 3HPG_A 3HPH_A 3OYD_A 3OYF_B ....
Probab=27.80 E-value=58 Score=20.13 Aligned_cols=67 Identities=12% Similarity=0.114 Sum_probs=44.8
Q ss_pred HHHhHHHHHHHHHHHcCCcc---CCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccC--cHHHHHHHHHH
Q psy4570 11 DVKNNEDKIYSIAKRYGGIP---AGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDK--TVLLCINVKKR 82 (95)
Q Consensus 11 ~v~~~~~~~~~i~~~~GG~~---~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~--~~~l~~~V~~a 82 (95)
......+...+++...|+.. +=.+.| .-|..+.++..+..+|+-.-..-...+|+| ++..+.-+++.
T Consensus 48 ~~~~~~~~l~~~~~~~~~~~p~~i~tD~g-----~~f~~~~~~~~~~~~~i~~~~~~~~~p~~ng~vEr~~~~l~~~ 119 (120)
T PF00665_consen 48 TAEAALRALKRAIEKRGGRPPRVIRTDNG-----SEFTSHAFEAWCKHLGIKHVFTPPYTPQQNGFVERFNRTLKRR 119 (120)
T ss_dssp HHHHHHHHHHHHHHHHS-SE-SEEEEESC-----HHHHSHHHHHHHHHHT-EEEESSTSSTHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccccccccccceecccccc-----cccccchhhhHHHHcCceEeeCCCCChhhccHHHHHHHHHHHh
Confidence 45556666677888888842 222222 234445888999999998888888889988 77777777654
No 33
>PF13250 DUF4041: Domain of unknown function (DUF4041)
Probab=27.41 E-value=63 Score=19.04 Aligned_cols=22 Identities=9% Similarity=0.194 Sum_probs=17.3
Q ss_pred eccccccCcHHHHHHHHHHHHH
Q psy4570 64 ETSVPWDKTVLLCINVKKRLTR 85 (95)
Q Consensus 64 ETA~~Ws~~~~l~~~V~~a~~~ 85 (95)
=.-|+|+|++.+-+.++.+...
T Consensus 15 i~kv~~~Ni~~~~~rI~ksf~~ 36 (56)
T PF13250_consen 15 ISKVKYNNIDTMEKRIEKSFEQ 36 (56)
T ss_pred HHhCChhhHHHHHHHHHHHHHH
Confidence 3458999999999888777654
No 34
>PF04304 DUF454: Protein of unknown function (DUF454); InterPro: IPR007401 This is a predicted membrane protein.
Probab=26.80 E-value=38 Score=20.15 Aligned_cols=19 Identities=16% Similarity=0.132 Sum_probs=14.4
Q ss_pred hhccchhhhhhHHHHhccc
Q psy4570 40 YMLTYIIAYIRDFACDYYF 58 (95)
Q Consensus 40 ~~~rF~~PYLRd~l~~~g~ 58 (95)
.++|+-+||+||.--+.|+
T Consensus 6 ~~h~~~g~~I~~w~~~r~i 24 (71)
T PF04304_consen 6 LNHRLFGPYIRNWEEHRGI 24 (71)
T ss_pred HcCchhHHHHHHHHHCCCc
Confidence 4677889999998776554
No 35
>PF01766 Birna_VP2: Birnavirus VP2 protein; InterPro: IPR002662 Infectious pancreatic necrosis virus (IPNV), a birnavirus, is an important pathogen in fish farms. Analyses of viral proteins showed that VP2 is the major structural and immunogenic polypeptide of the virus [, ]. All neutralizing monoclonal antibodies are specific to VP2 and bind to continuous or discontinuous epitopes. The variable domain of VP2 and the 20 adjacent amino acids of the conserved C-terminal are probably the most important in inducing an immune response for the protection of animals []. The large RNA segment of the Birnaviridae codes for a polyprotein (N-VP2-VP4-VP3-C), most of which is then processed to generate the constituent polypeptides. VP4 protein is involved in generating VP2 and VP3 []. Recombinant VP3 is more immunogenic than recombinant VP2 [].; GO: 0005198 structural molecule activity; PDB: 2DF7_T 3IDE_D 3FBM_A 2GSY_J 2IMU_A 1WCD_J.
Probab=25.87 E-value=6.6 Score=32.38 Aligned_cols=48 Identities=13% Similarity=0.184 Sum_probs=26.7
Q ss_pred EEEecChHHHHhH-----HHHHHHHHHHcCCccCCCccchhhhhccchhhhhh
Q psy4570 3 LLVLGDPEDVKNN-----EDKIYSIAKRYGGIPAGESNGRRGYMLTYIIAYIR 50 (95)
Q Consensus 3 ~gfEG~~~~v~~~-----~~~~~~i~~~~GG~~~G~~~G~~W~~~rF~~PYLR 50 (95)
+.|+|+..+|... +.....+.-+-|.+.+|++.+=-=.-.+|+.||.|
T Consensus 121 vt~~G~lSEv~~~~Y~~llS~tsn~~DKvgnvlV~dGv~VLSLPt~fD~pYVR 173 (441)
T PF01766_consen 121 VTFQGSLSEVEDLSYNSLLSATSNPLDKVGNVLVGDGVAVLSLPTGFDNPYVR 173 (441)
T ss_dssp EEESS-GGG-S--STTGGGGS-S-GGGEEEEEETTT-EEEE---S-SS---EE
T ss_pred EEeccCHHHcccccccceeeeccChhhhhcCEEecCceEEEecCcccCCCcee
Confidence 5688988887643 34444667788999999987777778899999999
No 36
>PLN03220 uncharacterized protein; Provisional
Probab=25.68 E-value=21 Score=24.02 Aligned_cols=16 Identities=0% Similarity=0.144 Sum_probs=12.1
Q ss_pred hhhccchhh--hhhHHHH
Q psy4570 39 GYMLTYIIA--YIRDFAC 54 (95)
Q Consensus 39 W~~~rF~~P--YLRd~l~ 54 (95)
+...||.+| ||.+.++
T Consensus 50 ~e~kRFVVPv~yL~hP~F 67 (105)
T PLN03220 50 MEKKRFVVPISFLNHPSF 67 (105)
T ss_pred ccceEEEEEHHHcCChHH
Confidence 457899998 8877654
No 37
>PHA03321 tegument protein VP11/12; Provisional
Probab=25.08 E-value=74 Score=27.75 Aligned_cols=56 Identities=13% Similarity=0.007 Sum_probs=43.8
Q ss_pred hhhhccchhhhhhHHHHhcc-------cceeeeeccccccCcHHHHHH-HHHHHHHHHhhCCCCC
Q psy4570 38 RGYMLTYIIAYIRDFACDYY-------FIGDSFETSVPWDKTVLLCIN-VKKRLTRECTGRWLPV 94 (95)
Q Consensus 38 ~W~~~rF~~PYLRd~l~~~g-------~~~dT~ETA~~Ws~~~~l~~~-V~~a~~~a~~~~g~~~ 94 (95)
-|.++=..-||||..|-..+ -++-|+ |++.|.+++.=... .+.++.+++-.+|+|.
T Consensus 352 cWl~Ggl~N~yLr~ALrsQ~RF~h~~G~LfPTM-s~~SWa~mErst~aWF~~AlArSl~~hG~PT 415 (694)
T PHA03321 352 VWASDSLNNEYLRAAVERQERFCRTTAPLFPTM-TASSWARMERSIKAWFEAALATELFRTGVPS 415 (694)
T ss_pred EeccCCCcCHHHHHHHHHHHHHHHHhhhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHHcCCCc
Confidence 58888888899998887764 345556 89999999876655 7888888888888774
No 38
>PLN02873 coproporphyrinogen-III oxidase
Probab=25.06 E-value=89 Score=24.38 Aligned_cols=77 Identities=9% Similarity=-0.060 Sum_probs=59.9
Q ss_pred ChHHHHhHHHHHHHHHHHcCCccCCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHHHHHH
Q psy4570 8 DPEDVKNNEDKIYSIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRLTREC 87 (95)
Q Consensus 8 ~~~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~~~a~ 87 (95)
.++++..-.+.++++|.+|+--+= .-.++|-+.-|.+|-..+.-==-|+..|.|.+ -+|++.-.+.++|-+++..++
T Consensus 111 ~~eD~~~FH~~~K~aCd~~~~~~Y--~~fK~wCD~YFyi~HR~E~RGvGGiFfDdln~-~~~e~~f~f~~~vg~afl~aY 187 (274)
T PLN02873 111 FEEDVKHFHGTYKDVCDKHDPEFY--PRFKKWCDDYFYIKHRGERRGLGGIFFDDLND-RDQEDLLAFATDVANSVVPAY 187 (274)
T ss_pred ChHHHHHHHHHHHHHHhccCchhh--HHHHHHHHHhcchhhcCCCCCCceeecCCCCC-CCHHHHHHHHHHHHHHHHHHh
Confidence 356788888999999999986421 12589999999888877777777888998876 467777788888888887774
No 39
>PF07045 DUF1330: Protein of unknown function (DUF1330); InterPro: IPR010753 This family consists of several hypothetical bacterial proteins of around 90 residues in length. The function of this family is unknown.; PDB: 2FIU_B 3HHL_A 3DCA_D 3LO3_I.
Probab=24.99 E-value=79 Score=18.48 Aligned_cols=20 Identities=20% Similarity=0.423 Sum_probs=14.4
Q ss_pred HHhHHHHHHHHHHHcCCccC
Q psy4570 12 VKNNEDKIYSIAKRYGGIPA 31 (95)
Q Consensus 12 v~~~~~~~~~i~~~~GG~~~ 31 (95)
.+.-.+.+..++++|||..+
T Consensus 4 ~~~Y~~~~~~~l~~~GG~~l 23 (65)
T PF07045_consen 4 YQEYREAVPPILEKYGGRVL 23 (65)
T ss_dssp HHHHHHHHHHHHHHTT-EEE
T ss_pred HHHHHHHHHHHHHHcCCEEE
Confidence 34456788899999999665
No 40
>PF00676 E1_dh: Dehydrogenase E1 component; InterPro: IPR001017 This entry includes a number of dehydrogenases all of which use thiamine pyrophosphate as a cofactor and are members of a multienzyme complex. Pyruvate dehydrogenase (1.2.4.1 from EC), a component of the multienzyme pyruvate dehydrogenase complex; 2-oxoglutarate dehydrogenase (1.2.4.2 from EC), a component of the multienzyme 2-oxoglutarate dehydrogenase which contains multiple copies of three enzymatic components: 2-oxoglutarate dehydrogenase (E1), dihydrolipoamide succinyltransferase (E2) and lipoamide dehydrogenase (E3); and 2-oxoisovalerate dehydrogenase (1.2.4.4 from EC), a component of the multienzyme branched-chain alpha-keto dehydrogenase complex all belong to this family.; GO: 0016624 oxidoreductase activity, acting on the aldehyde or oxo group of donors, disulfide as acceptor, 0008152 metabolic process; PDB: 1X7Y_A 1V1M_A 1X7W_A 1OLU_A 2J9F_A 2BEW_A 1V11_A 2BFE_A 1U5B_A 2BEU_A ....
Probab=24.71 E-value=3e+02 Score=21.03 Aligned_cols=75 Identities=13% Similarity=0.108 Sum_probs=43.9
Q ss_pred EecCh-HHHHhHHHHHHHHHHHcCCccC---------C---Cc-c--------chhhhhccchhhhhhHHHHhcccceee
Q psy4570 5 VLGDP-EDVKNNEDKIYSIAKRYGGIPA---------G---ES-N--------GRRGYMLTYIIAYIRDFACDYYFIGDS 62 (95)
Q Consensus 5 fEG~~-~~v~~~~~~~~~i~~~~GG~~~---------G---~~-~--------G~~W~~~rF~~PYLRd~l~~~g~~~dT 62 (95)
..|.+ ..|-...+.+.+.+++.+|-.+ | .+ + -+.|++.+=-++.+++.|.+.|++-+
T Consensus 192 VDG~D~~av~~a~~~A~~~~R~g~gP~lie~~tyR~~gHs~~Dd~~~yr~~~e~~~~~~~~DPi~~~~~~L~~~g~~t~- 270 (300)
T PF00676_consen 192 VDGNDVEAVYEAAKEAVEYARAGKGPVLIEAVTYRLRGHSESDDPTFYRSPEEYEEWWKKRDPIKRFRRYLIEEGVLTE- 270 (300)
T ss_dssp EETTSHHHHHHHHHHHHHHHHTTT--EEEEEEE--SS-SSTTSCGGGTSHHHHHHHHHHCT-HHHHHHHHHHHTTSS-H-
T ss_pred ECCEeHHHHHHHHHHHHHHHhcCCCCEEEEEeeccCCCCCCCCCccccCCHHHHHHHHhcCcHHHHHHHHHHHcCCCCH-
Confidence 45665 4577788899999999888322 1 11 1 12344444334489999999999865
Q ss_pred eeccccccCcHHHHHHHHHHHHHHH
Q psy4570 63 FETSVPWDKTVLLCINVKKRLTREC 87 (95)
Q Consensus 63 ~ETA~~Ws~~~~l~~~V~~a~~~a~ 87 (95)
+.+.++.+.+++.|.++.
T Consensus 271 -------~~~~~i~~e~~~~v~~a~ 288 (300)
T PF00676_consen 271 -------EELDAIEAEIKAEVEEAV 288 (300)
T ss_dssp -------HHHHHHHHHHHHHHHHHH
T ss_pred -------HHHHHHHHHHHHHHHHHH
Confidence 344455555555555443
No 41
>PF11867 DUF3387: Domain of unknown function (DUF3387); InterPro: IPR021810 This domain is functionally uncharacterised. This domain is found in bacteria and archaea. This presumed domain is typically between 255 to 340 amino acids in length. This domain is found associated with PF04851 from PFAM, PF04313 from PFAM.
Probab=23.75 E-value=80 Score=24.44 Aligned_cols=29 Identities=17% Similarity=0.275 Sum_probs=26.8
Q ss_pred cccccCcHHHHHHHHHHHHHHHhhCCCCC
Q psy4570 66 SVPWDKTVLLCINVKKRLTRECTGRWLPV 94 (95)
Q Consensus 66 A~~Ws~~~~l~~~V~~a~~~a~~~~g~~~ 94 (95)
.++|.+=++....|+..|+..|.+.|.|+
T Consensus 285 ~vDW~~ke~~ra~~r~~Ik~~L~k~~ypp 313 (335)
T PF11867_consen 285 TVDWTKKEDVRAKMRRAIKRLLRKYGYPP 313 (335)
T ss_pred CcCceeCccHHHHHHHHHHHHHHHcCCCh
Confidence 46999999999999999999999999886
No 42
>PRK05330 coproporphyrinogen III oxidase; Provisional
Probab=23.31 E-value=93 Score=24.54 Aligned_cols=77 Identities=6% Similarity=-0.050 Sum_probs=59.4
Q ss_pred ChHHHHhHHHHHHHHHHHcCCccCCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHHHHHH
Q psy4570 8 DPEDVKNNEDKIYSIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRLTREC 87 (95)
Q Consensus 8 ~~~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~~~a~ 87 (95)
.++++..-.+.++++|.+|+...= .-.++|-+.-|.+|-..+.-===|+..|.|.+ -+|++.-++.++|-+++..++
T Consensus 138 ~~eD~~~FH~~~K~~Cd~~~~~~Y--~~fK~wCD~YFyi~HR~E~RGvGGiFfD~l~~-~~~e~~f~F~~~vg~~fl~aY 214 (300)
T PRK05330 138 FEEDAVHFHRTAKDACDPFGPEYY--PRFKKWCDEYFYLKHRNEPRGVGGIFFDDLNS-PGFERDFAFTQAVGDAFLDAY 214 (300)
T ss_pred ChHHHHHHHHHHHHHHhccChhhh--HHHHHHHHHhcChhhcCCCCCCCeeecCCCCC-CCHHHHHHHHHHHHHHHHHHH
Confidence 346778888999999999997541 22689999999888777766667888999987 457777778888877777664
No 43
>PF07914 DUF1679: Protein of unknown function (DUF1679); InterPro: IPR012877 This region is found in a number of Caenorhabditis elegans and Caenorhabditis briggsae proteins, in one case (Q19034 from SWISSPROT) as a repeat. In many of the family members, this region is associated with the CHK region described by SMART as being found in zinc finger-C4 and HLH domain-containing kinases. In fact, one member of this family (Q9GUC1 from SWISSPROT) is annotated as being a member of the nuclear hormone receptor family, and contains regions typical of such proteins (IPR000536 from INTERPRO, IPR008946 from INTERPRO, and IPR001628 from INTERPRO).
Probab=23.22 E-value=31 Score=27.80 Aligned_cols=14 Identities=29% Similarity=0.510 Sum_probs=10.8
Q ss_pred eeccccccCcHHHH
Q psy4570 63 FETSVPWDKTVLLC 76 (95)
Q Consensus 63 ~ETA~~Ws~~~~l~ 76 (95)
|||.|+|..|..-.
T Consensus 9 l~Thvtw~dve~~l 22 (414)
T PF07914_consen 9 LETHVTWEDVEEAL 22 (414)
T ss_pred ccCceeHHHHHHHH
Confidence 79999998765544
No 44
>PF10979 DUF2786: Protein of unknown function (DUF2786); InterPro: IPR024498 This domain is found in proteins that have no known function.
Probab=23.11 E-value=1.3e+02 Score=16.69 Aligned_cols=21 Identities=14% Similarity=0.352 Sum_probs=17.0
Q ss_pred cChHHHHhHHHHHHHHHHHcC
Q psy4570 7 GDPEDVKNNEDKIYSIAKRYG 27 (95)
Q Consensus 7 G~~~~v~~~~~~~~~i~~~~G 27 (95)
+++++.....+.+.+++.+||
T Consensus 19 ~~~~EA~~A~~kAq~Lm~ky~ 39 (43)
T PF10979_consen 19 SNEHEAEAALAKAQRLMAKYG 39 (43)
T ss_pred CCHHHHHHHHHHHHHHHHHhC
Confidence 345678888899999999987
No 45
>COG5470 Uncharacterized conserved protein [Function unknown]
Probab=22.96 E-value=97 Score=20.55 Aligned_cols=59 Identities=14% Similarity=0.008 Sum_probs=36.7
Q ss_pred HHHHhHHHHHHHHHHHcCCccC--CCc---cchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHHHHHH
Q psy4570 10 EDVKNNEDKIYSIAKRYGGIPA--GES---NGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINV 79 (95)
Q Consensus 10 ~~v~~~~~~~~~i~~~~GG~~~--G~~---~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V 79 (95)
+..+.-...+..++++|||.+| |.+ +--.|.-.|- .+ +=++|+|+|-.|=|=+.....+
T Consensus 16 e~y~~Y~~~~~~a~~~~Ggr~LvRGG~v~~lEG~w~ptr~-------vv----iEFps~~~ar~~y~SpeYq~a~ 79 (96)
T COG5470 16 EQYKDYVSKAKPAIEKFGGRYLVRGGEVETLEGEWRPTRN-------VV----IEFPSLEAARDCYNSPEYQAAA 79 (96)
T ss_pred HHHHHHHHHhHHHHHHhCCeeEeeCCCeeeccCCCCcccE-------EE----EEcCCHHHHHHHhcCHHHHHHH
Confidence 4566667888999999999777 322 1123664432 11 2246777777777766665544
No 46
>COG4047 Uncharacterized protein conserved in archaea [Function unknown]
Probab=22.64 E-value=1.4e+02 Score=22.85 Aligned_cols=82 Identities=15% Similarity=0.071 Sum_probs=47.5
Q ss_pred hHHHHhHHHHHHHHHHHcCCccC-------------CCccchhhhhccc---hhhhhhHHHHhcc---------------
Q psy4570 9 PEDVKNNEDKIYSIAKRYGGIPA-------------GESNGRRGYMLTY---IIAYIRDFACDYY--------------- 57 (95)
Q Consensus 9 ~~~v~~~~~~~~~i~~~~GG~~~-------------G~~~G~~W~~~rF---~~PYLRd~l~~~g--------------- 57 (95)
+++|+.|.+.+..+.+.+|.... =++-|+.|+.-+. +.--+||.+.++-
T Consensus 19 Ee~vD~Qykav~~l~~~~~~~~~vrlaianaLvSYqLtgkGEewW~eF~kyf~~~~vr~i~ray~~fLp~s~fnrrlieq 98 (243)
T COG4047 19 EENVDPQYKAVSYLSKGIGSTKFVRLAIANALVSYQLTGKGEEWWWEFAKYFRGREVRDIYRAYKEFLPNSRFNRRLIEQ 98 (243)
T ss_pred HhhcCHHHHHHHHHHhccCchHHHHHHHHHHHHHHhhccCcHHHHHHHHHHHhhccHHHHHHHHHHhccCCchhHHHHHH
Confidence 35699999999999999998422 1234667754432 2222333332221
Q ss_pred ------cceeeeeccccccCcHHHHHHH---HHHHHHHHhhCC
Q psy4570 58 ------FIGDSFETSVPWDKTVLLCINV---KKRLTRECTGRW 91 (95)
Q Consensus 58 ------~~~dT~ETA~~Ws~~~~l~~~V---~~a~~~a~~~~g 91 (95)
=+.+-+||- +|+|+..+|+.+ ..++..++...+
T Consensus 99 KlrRi~ra~~fl~~L-~~~~~~~yyedm~~l~~~la~~lg~d~ 140 (243)
T COG4047 99 KLRRIRRAESFLETL-TEENIEVYYEDMSLLLEALARALGADR 140 (243)
T ss_pred HHHHHHHHHHHHHHh-hhhhHHHHHhhHHHHHHHHHHHhCCCc
Confidence 122223333 889999988875 456666665543
No 47
>COG5275 BRCT domain type II [General function prediction only]
Probab=22.64 E-value=71 Score=24.61 Aligned_cols=37 Identities=24% Similarity=0.520 Sum_probs=25.1
Q ss_pred CEEEEecChHHHHhHHHHHHHHHHHcCC------------ccCCCccchhh
Q psy4570 1 MALLVLGDPEDVKNNEDKIYSIAKRYGG------------IPAGESNGRRG 39 (95)
Q Consensus 1 ~~~gfEG~~~~v~~~~~~~~~i~~~~GG------------~~~G~~~G~~W 39 (95)
+++.|+|.-... .+..+..+++.||| +.+|..+|.+-
T Consensus 160 ~~fVfTG~l~Tl--sR~~a~~lvk~yGgrvT~~pSskTtflvlGdnaGP~K 208 (276)
T COG5275 160 KVFVFTGDLKTL--SRDDAKTLVKVYGGRVTAVPSSKTTFLVLGDNAGPSK 208 (276)
T ss_pred cEEEEecccccc--cchhHHHHHHHhCCeeecccccceeEEEecCCCChHH
Confidence 367889865443 34556678999999 35677777443
No 48
>PF13368 Toprim_C_rpt: Topoisomerase C-terminal repeat
Probab=22.52 E-value=18 Score=21.38 Aligned_cols=16 Identities=6% Similarity=-0.047 Sum_probs=11.2
Q ss_pred hhccchhhhhhHHHHhc
Q psy4570 40 YMLTYIIAYIRDFACDY 56 (95)
Q Consensus 40 ~~~rF~~PYLRd~l~~~ 56 (95)
..+|| +||+.+.....
T Consensus 10 ~~GRf-GPYv~~g~~~~ 25 (61)
T PF13368_consen 10 KNGRF-GPYVKHGKKNA 25 (61)
T ss_pred eECCC-CceEEECCccc
Confidence 56788 89997664443
No 49
>COG2723 BglB Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism]
Probab=22.25 E-value=61 Score=26.98 Aligned_cols=47 Identities=23% Similarity=0.222 Sum_probs=35.6
Q ss_pred hhhHHHHhcccceeeeeccccccCcHHH-HH--------HHHHHHHHHHhhCCCCC
Q psy4570 48 YIRDFACDYYFIGDSFETSVPWDKTVLL-CI--------NVKKRLTRECTGRWLPV 94 (95)
Q Consensus 48 YLRd~l~~~g~~~dT~ETA~~Ws~~~~l-~~--------~V~~a~~~a~~~~g~~~ 94 (95)
|=.|..+-.-+...+|=||+.|||+-+= -. +..+.|-++|.++|+.+
T Consensus 61 YkeDi~L~~emG~~~~R~SI~WsRIfP~g~~~e~N~~gl~fY~~l~del~~~gIep 116 (460)
T COG2723 61 YKEDIALAKEMGLNAFRTSIEWSRIFPNGDGGEVNEKGLRFYDRLFDELKARGIEP 116 (460)
T ss_pred hHHHHHHHHHcCCCEEEeeeeEEEeecCCCCCCcCHHHHHHHHHHHHHHHHcCCEE
Confidence 6678888888889999999999998651 11 13456688888888765
No 50
>smart00852 MoCF_biosynth Probable molybdopterin binding domain. This domain is found a variety of proteins involved in biosynthesis of molybdopterin cofactor. The domain is presumed to bind molybdopterin. The structure of this domain is known, and it forms an alpha/beta structure. In the known structure of Gephyrin this domain mediates trimerisation.
Probab=22.07 E-value=1.6e+02 Score=19.21 Aligned_cols=38 Identities=8% Similarity=0.047 Sum_probs=27.7
Q ss_pred chhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHH
Q psy4570 44 YIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRL 83 (95)
Q Consensus 44 F~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~ 83 (95)
-+.|+|+..|.++|+-+...... + |+...+.+.+++.+
T Consensus 18 ~~~~~l~~~l~~~G~~~~~~~~v-~-Dd~~~I~~~l~~~~ 55 (135)
T smart00852 18 SNGPALAELLTELGIEVTRYVIV-P-DDKEAIKEALREAL 55 (135)
T ss_pred CcHHHHHHHHHHCCCeEEEEEEe-C-CCHHHHHHHHHHHH
Confidence 36799999999999766555553 3 88887777776554
No 51
>PRK12378 hypothetical protein; Provisional
Probab=21.73 E-value=2e+02 Score=21.60 Aligned_cols=51 Identities=14% Similarity=0.130 Sum_probs=28.9
Q ss_pred HHHHhHHHHHHHHHHHcCCccCCCccchhhhhcc---chhh-----hhhHHHHhccccee
Q psy4570 10 EDVKNNEDKIYSIAKRYGGIPAGESNGRRGYMLT---YIIA-----YIRDFACDYYFIGD 61 (95)
Q Consensus 10 ~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~r---F~~P-----YLRd~l~~~g~~~d 61 (95)
+-..+-...++.++++|||. +|....-+|.-.| +..+ =+-+.+++.|.-.|
T Consensus 101 DN~nRt~~~vr~~f~K~gg~-l~~~gsv~~~Fe~kG~i~i~~~~~d~~~e~aieaGa~~e 159 (235)
T PRK12378 101 DNVNRTVANVRSAFNKNGGN-LGTSGSVAFMFDHKGVFVFEGDDEDELLEALIDADVDVE 159 (235)
T ss_pred CCHHHHHHHHHHHHhhcCCe-ECCCCceeeeeecceEEEeCCCCHHHHHHHHHhCCCCcc
Confidence 34445567788999999995 4443223443222 2222 45566677777333
No 52
>KOG0831|consensus
Probab=21.42 E-value=46 Score=26.67 Aligned_cols=22 Identities=9% Similarity=0.209 Sum_probs=18.2
Q ss_pred hccchhhhhhHHHHhcccceee
Q psy4570 41 MLTYIIAYIRDFACDYYFIGDS 62 (95)
Q Consensus 41 ~~rF~~PYLRd~l~~~g~~~dT 62 (95)
..-|..|.+||.++..|++.=+
T Consensus 143 ~~~F~~P~~Re~l~~~Gl~svS 164 (334)
T KOG0831|consen 143 SGQFYTPFLREYLMSLGLCSVS 164 (334)
T ss_pred ccceeccHHHHHHHHcCCcccc
Confidence 4568899999999999997644
No 53
>PF02120 Flg_hook: Flagellar hook-length control protein FliK; InterPro: IPR021136 This entry represents the C-terminal domain of the flagellar hook-length control protein FliK. This entry also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. During flagellar morphogenesis in Salmonella typhimurium and Escherichia coli, flagellar hook-length control protein (FliK) controls the length of the hook by directly measuring the hook length [, ]. It is considered unlikely that FliK functions as a molecular ruler for determining hook length, but that it is more likely to be employing a novel mechanism. The deduced amino acid sequences of FliK proteins from S. typhimurium and E. coli have molecular masses of 41,748 and 39,246 Da, respectively, and are fairly hydrophilic []. Sequence comparison reveals around 50% identity, with greatest conservation in the C-terminal region, with 71% identity in the last 154 amino acids - mutagenesis of this conserved region completely abolishes motility. The central and C-terminal regions are rich in proline and glutamine respectively; it is thought that they may constitute distinct domains [].; PDB: 2RRL_A.
Probab=21.29 E-value=80 Score=18.89 Aligned_cols=25 Identities=4% Similarity=-0.157 Sum_probs=19.2
Q ss_pred cchhhhhhHHHHhcccceeeeeccc
Q psy4570 43 TYIIAYIRDFACDYYFIGDSFETSV 67 (95)
Q Consensus 43 rF~~PYLRd~l~~~g~~~dT~ETA~ 67 (95)
+=..|.|++.|-+.|+-++.+....
T Consensus 54 ~~~~~~L~~~L~~~G~~~~~~~v~~ 78 (85)
T PF02120_consen 54 RQNLPELKERLQAQGLEVVNLSVSQ 78 (85)
T ss_dssp HHTHHHHHHHHHTTT-EEEEEEEES
T ss_pred HHHHHHHHHHHHHCCCCeEEEEEEE
Confidence 3357999999999999999877543
No 54
>cd00758 MoCF_BD MoCF_BD: molybdenum cofactor (MoCF) binding domain (BD). This domain is found a variety of proteins involved in biosynthesis of molybdopterin cofactor, like MoaB, MogA, and MoeA. The domain is presumed to bind molybdopterin.
Probab=21.28 E-value=1.7e+02 Score=19.23 Aligned_cols=39 Identities=8% Similarity=-0.171 Sum_probs=25.9
Q ss_pred chhhhhhHHHHhcccceeeeeccccccCcHHHHHHHHHHHH
Q psy4570 44 YIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVKKRLT 84 (95)
Q Consensus 44 F~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~~a~~ 84 (95)
-+.|+|+..|.++|+-+....+. + |+...+.+.+++++.
T Consensus 19 ~n~~~l~~~l~~~G~~v~~~~~v-~-Dd~~~i~~~i~~~~~ 57 (133)
T cd00758 19 TNGPALEALLEDLGCEVIYAGVV-P-DDADSIRAALIEASR 57 (133)
T ss_pred chHHHHHHHHHHCCCEEEEeeec-C-CCHHHHHHHHHHHHh
Confidence 36799999999999554333332 3 777777766665544
No 55
>cd00027 BRCT Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Probab=21.04 E-value=1.2e+02 Score=16.12 Aligned_cols=28 Identities=18% Similarity=0.472 Sum_probs=18.2
Q ss_pred EEEEecChHHHHhHHHHHHHHHHHcCCccC
Q psy4570 2 ALLVLGDPEDVKNNEDKIYSIAKRYGGIPA 31 (95)
Q Consensus 2 ~~gfEG~~~~v~~~~~~~~~i~~~~GG~~~ 31 (95)
.+-+.|... ...+....++++.+||...
T Consensus 3 ~~~i~g~~~--~~~~~~l~~~i~~~Gg~v~ 30 (72)
T cd00027 3 TFVITGDLP--SEERDELKELIEKLGGKVT 30 (72)
T ss_pred EEEEEecCC--CcCHHHHHHHHHHcCCEEe
Confidence 344555432 4556777789999999544
No 56
>cd03522 MoeA_like MoeA_like. This domain is similar to a domain found in a variety of proteins involved in biosynthesis of molybdopterin cofactor, like MoaB, MogA, and MoeA. There this domain is presumed to bind molybdopterin. The exact function of this subgroup is unknown.
Probab=21.01 E-value=2.8e+02 Score=21.66 Aligned_cols=67 Identities=7% Similarity=-0.065 Sum_probs=39.5
Q ss_pred hHHHHHHHHHHHcCCc-------------cCCCccchhhhhccchhhhhhHHHHhcccceeeeeccccccCcHHHHHHHH
Q psy4570 14 NNEDKIYSIAKRYGGI-------------PAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVK 80 (95)
Q Consensus 14 ~~~~~~~~i~~~~GG~-------------~~G~~~G~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~ 80 (95)
.+.+++.++++..|=. ..|.+....-.... +.|.|+..|.++|+- ..+..+--|+...+.+.++
T Consensus 137 ~~~~~a~~~~~~~gi~~V~v~r~~rv~II~TG~Ev~~G~i~D~-~~~~l~~~L~~~G~~--v~~~~iv~Dd~~~I~~ai~ 213 (312)
T cd03522 137 ALVERAEALARDGPLLRVAPFRPLRVGLIVTGSEVYGGRIEDK-FGPVLRARLAALGVE--LVEQVIVPHDEAAIAAAIA 213 (312)
T ss_pred HHHHHHHHHHHhCCCcEEEecCCCEEEEEEcCCcCCCCcEEEh-HHHHHHHHHHHCCCE--EEEEEEcCCCHHHHHHHHH
Confidence 4556777788653331 23665432222222 479999999999954 4444444577777776665
Q ss_pred HHH
Q psy4570 81 KRL 83 (95)
Q Consensus 81 ~a~ 83 (95)
+++
T Consensus 214 ~~~ 216 (312)
T cd03522 214 EAL 216 (312)
T ss_pred HHh
Confidence 554
No 57
>PRK06063 DNA polymerase III subunit epsilon; Provisional
Probab=20.92 E-value=1e+02 Score=23.87 Aligned_cols=26 Identities=8% Similarity=0.093 Sum_probs=20.9
Q ss_pred CEEEEecChHHHHhHHHHHHHHHHHcCCc
Q psy4570 1 MALLVLGDPEDVKNNEDKIYSIAKRYGGI 29 (95)
Q Consensus 1 ~~~gfEG~~~~v~~~~~~~~~i~~~~GG~ 29 (95)
+.+.|+|+=+ ..+..+.+++.++||.
T Consensus 236 ~~~v~TG~l~---~~R~e~~~~~~~~G~~ 261 (313)
T PRK06063 236 MRVALSAEVS---RTHEELVERILHAGLA 261 (313)
T ss_pred CEEEEecCCC---CCHHHHHHHHHHcCCE
Confidence 3688999754 4778899999999994
No 58
>cd05783 DNA_polB_B1_exo DEDDy 3'-5' exonuclease domain of Sulfolobus solfataricus DNA polymerase B1 and similar archaeal family-B DNA polymerases. The 3'-5' exonuclease domain of Sulfolobus solfataricus DNA polymerase B1 and similar archaeal proteins. B1 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B1displays thermostable polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family-B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Family-B DNA polymerases from thermophilic archaea are uniq
Probab=20.85 E-value=55 Score=23.66 Aligned_cols=21 Identities=10% Similarity=-0.086 Sum_probs=18.9
Q ss_pred hhhccchhhhhhHHHHhcccc
Q psy4570 39 GYMLTYIIAYIRDFACDYYFI 59 (95)
Q Consensus 39 W~~~rF~~PYLRd~l~~~g~~ 59 (95)
|=-..|+.|||.+.+..+|+-
T Consensus 93 ~N~~~FDlpyl~~R~~~~gi~ 113 (204)
T cd05783 93 FNGDNFDLPYLYNRALKLGIP 113 (204)
T ss_pred eCCCCcCHHHHHHHHHHhCCC
Confidence 777889999999999999976
No 59
>PRK06195 DNA polymerase III subunit epsilon; Validated
Probab=20.80 E-value=86 Score=23.98 Aligned_cols=27 Identities=19% Similarity=0.368 Sum_probs=20.5
Q ss_pred CEEEEecChHHHHhHHHHHHHHHHHcCCc
Q psy4570 1 MALLVLGDPEDVKNNEDKIYSIAKRYGGI 29 (95)
Q Consensus 1 ~~~gfEG~~~~v~~~~~~~~~i~~~~GG~ 29 (95)
+.+.|+|+-+ ...++.+.++++.+||.
T Consensus 224 ~~~vfTG~l~--~~~R~~~~~~~~~~Gg~ 250 (309)
T PRK06195 224 EVVVFTGGLA--SMTRDEAMILVRRLGGT 250 (309)
T ss_pred CEEEEccccC--CCCHHHHHHHHHHhCCE
Confidence 3678888642 23588899999999994
No 60
>PF11308 GHL1-3: Glycosyl hydrolases related to GH101 family, GHL1-GHL3; InterPro: IPR021459 Some members in this family of proteins with unknown function are annotated as lipoproteins however this cannot be confirmed. Currently no function is known.
Probab=20.78 E-value=1.1e+02 Score=23.84 Aligned_cols=34 Identities=9% Similarity=-0.045 Sum_probs=25.3
Q ss_pred hHHHHhHHHHHHHHHHHcCCccCCCccchhhhhc
Q psy4570 9 PEDVKNNEDKIYSIAKRYGGIPAGESNGRRGYML 42 (95)
Q Consensus 9 ~~~v~~~~~~~~~i~~~~GG~~~G~~~G~~W~~~ 42 (95)
+.+....+.+..+.+++..|+++|++.|..|...
T Consensus 145 ~~~~~~~r~~~~~~l~~~~~~v~GSE~Gnd~a~~ 178 (307)
T PF11308_consen 145 RAQDLAARLARMQYLREELGLVLGSEDGNDWAAP 178 (307)
T ss_pred HHHHHHHHHHHHHHHHhcCceeeccCchhhhhcc
Confidence 4555555666667777888888999999988764
No 61
>COG4571 OmpT Outer membrane protease [Cell envelope biogenesis, outer membrane]
Probab=20.30 E-value=33 Score=27.10 Aligned_cols=35 Identities=20% Similarity=0.274 Sum_probs=24.1
Q ss_pred ccCCCccc---hhhhhccchhhhhhHHHHhcccceeeeecc
Q psy4570 29 IPAGESNG---RRGYMLTYIIAYIRDFACDYYFIGDSFETS 66 (95)
Q Consensus 29 ~~~G~~~G---~~W~~~rF~~PYLRd~l~~~g~~~dT~ETA 66 (95)
..+|.-|+ -..|+.||.+||+- ++--+..+.||--
T Consensus 177 ~~ig~fP~g~r~IgY~Qrf~m~yiG---lag~Yr~~~~e~~ 214 (314)
T COG4571 177 FYIGNFPNGVRAIGYKQRFKMPYIG---LAGSYRYNDFEFG 214 (314)
T ss_pred cccccCCCCceeeeecccccceeee---ccceEEEeeeeee
Confidence 34555443 45799999999985 5666666777754
No 62
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=20.11 E-value=1.4e+02 Score=15.73 Aligned_cols=29 Identities=7% Similarity=-0.166 Sum_probs=18.7
Q ss_pred hhhhHHHHhcccceeeeeccccccCcHHHHHHHH
Q psy4570 47 AYIRDFACDYYFIGDSFETSVPWDKTVLLCINVK 80 (95)
Q Consensus 47 PYLRd~l~~~g~~~dT~ETA~~Ws~~~~l~~~V~ 80 (95)
-.||+.|.++|+.+.--- .+-.+|.+.|+
T Consensus 7 ~~L~~wL~~~gi~~~~~~-----~~rd~Ll~~~k 35 (38)
T PF10281_consen 7 SDLKSWLKSHGIPVPKSA-----KTRDELLKLAK 35 (38)
T ss_pred HHHHHHHHHcCCCCCCCC-----CCHHHHHHHHH
Confidence 468889999998876432 34455555554
Done!