Query psy8079
Match_columns 135
No_of_seqs 103 out of 1042
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 17:52:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8079.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8079hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00379 Chitin_bind_4: Insect 99.7 5.5E-17 1.2E-21 102.3 7.7 52 35-91 1-52 (52)
2 PF06004 DUF903: Bacterial pro 83.7 2.1 4.6E-05 26.8 3.6 39 37-83 2-40 (50)
3 PRK15401 alpha-ketoglutarate-d 59.5 24 0.00052 28.2 5.1 49 68-116 46-105 (213)
4 PHA02698 hypothetical protein; 56.7 7.2 0.00016 26.9 1.5 21 89-109 19-39 (89)
5 PF10902 DUF2693: Protein of u 53.8 51 0.0011 22.7 5.3 57 33-93 1-63 (83)
6 COG0450 AhpC Peroxiredoxin [Po 48.5 53 0.0012 26.1 5.4 61 43-118 106-166 (194)
7 KOG1691|consensus 40.5 1.5E+02 0.0033 23.9 6.9 32 64-95 79-110 (210)
8 COG4786 FlgG Flagellar basal b 39.8 50 0.0011 27.4 4.2 56 47-102 80-138 (265)
9 PRK05461 apaG CO2+/MG2+ efflux 36.1 85 0.0018 23.0 4.5 21 62-82 93-113 (127)
10 PF09244 DUF1964: Domain of un 34.9 42 0.0009 22.2 2.4 21 68-88 4-24 (68)
11 TIGR01643 YD_repeat_2x YD repe 34.1 71 0.0015 17.9 3.2 21 71-93 9-29 (42)
12 cd07240 ED_TypeI_classII_N N-t 32.8 69 0.0015 20.9 3.4 20 69-88 98-117 (117)
13 PF12118 SprA-related: SprA-re 31.2 75 0.0016 26.7 4.0 17 69-88 175-192 (294)
14 cd07252 BphC1-RGP6_N_like N-te 30.7 70 0.0015 21.6 3.2 21 68-88 100-120 (120)
15 PF05423 Mycobact_memb: Mycoba 30.5 1E+02 0.0022 23.0 4.2 24 66-90 51-74 (140)
16 PHA02978 hypothetical protein; 29.7 46 0.00099 24.5 2.2 15 69-83 76-90 (135)
17 PF05899 Cupin_3: Protein of u 28.8 1.5E+02 0.0032 19.2 4.4 26 62-89 28-53 (74)
18 PF13511 DUF4124: Domain of un 27.3 54 0.0012 20.1 2.0 18 67-87 13-30 (60)
19 PRK14818 NADH dehydrogenase su 25.5 56 0.0012 25.5 2.1 21 97-117 133-153 (173)
20 PF10929 DUF2811: Protein of u 25.3 73 0.0016 20.6 2.3 20 103-122 5-25 (57)
21 PF03562 MltA: MltA specific i 24.4 53 0.0011 25.2 1.8 54 64-121 82-143 (158)
22 COG3422 Uncharacterized conser 23.0 1.1E+02 0.0023 19.9 2.7 20 27-46 7-26 (59)
23 TIGR03795 chp_BMA0021 conserve 22.5 1.4E+02 0.0031 21.7 3.7 23 98-120 89-112 (114)
24 PF00659 POLO_box: POLO box du 22.2 2.1E+02 0.0046 17.9 5.5 13 34-46 4-16 (68)
25 cd08344 MhqB_like_N N-terminal 22.1 1.1E+02 0.0024 20.3 2.9 17 69-85 93-109 (112)
26 PF09242 FCSD-flav_bind: Flavo 21.9 92 0.002 20.4 2.4 29 66-94 7-36 (71)
27 PF10681 Rot1: Chaperone for p 20.1 3.8E+02 0.0082 21.7 5.9 60 28-94 28-93 (212)
No 1
>PF00379 Chitin_bind_4: Insect cuticle protein; InterPro: IPR000618 Insect cuticle is composed of proteins and chitin. The cuticular proteins seem to be specific to the type of cuticle (flexible or stiff) that occur at stages of the insect development. The proteins found in the flexible cuticle of larva and pupa of different insects share a conserved C-terminal section [] such a region is also found in the soft endocuticle of adults insects [] as well as in other cuticular proteins including in arachnids []. In addition, cuticular proteins share hydrophobic regions dominated by tetrapeptide repeats (A-A-P-A/V), which are presumed to be functionally important [, ]. Many insect cuticle proteins also include a 35-36 amino acid motif known as the R and R consensus. An extended form of this motif has been shown [] to bind chitin. It has no sequence similiarity to the cysteine-containing chitin-binding domain of chitinases and some peritrophic membrane proteins, suggesting that arthropods have two distinct classes of chitin-binding proteins, those with the chitin-binding domains found in lectins, chitinases and peritrophic membranes (cysCBD), and those with the type of chitin-binding domains found in cuticular proteins (non-cysCBD) []. The cuticle protein signature has been found in locust cuticle proteins 7 (LM-7), 8 (LM-8), 19 (LM-19) and endocuticle structural glycoprotein ABD-4; Hyalophora cecropia (Cecropia moth) cuticle proteins 12 and 66; Drosophila melanogaster (Fruit fly) larval cuticles proteins I, II, III and IV (LCP1 to LCP4); drosophila pupal cuticle proteins PCP, EDG-78E and EDG-84E; Manduca sexta (Tobacco hawkmoth) cuticle protein LCP-14; Tenebrio molitor (Yellow mealworm) cuticle proteins ACP-20, A1A, A2B and A3A; and Araneus diadematus (Spider) cuticle proteins ACP 11.9, ACP 12.4, ACP 12.6, ACP 15.5 and ACP 15.7.; GO: 0042302 structural constituent of cuticle
Probab=99.71 E-value=5.5e-17 Score=102.26 Aligned_cols=52 Identities=40% Similarity=0.788 Sum_probs=46.4
Q ss_pred eeEEEecCCCceEEEeeeeccCCCCCcccceEEEEEEEECCCCCEEEEEEEeCCCCc
Q psy8079 35 YQYSFQTSNGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADGNPYSLQYTADEEGF 91 (135)
Q Consensus 35 Y~f~Ye~~dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADenGf 91 (135)
|.|+|+++|| .++|++. .++ ++..++|+|+|+|++|||+.++|+|+||++||
T Consensus 1 Y~f~Y~~~dg-~~~e~~~---~~~-~~~~~~v~GsY~y~~pdG~~~~V~Y~Ad~~Gf 52 (52)
T PF00379_consen 1 YSFGYETSDG-SRQEEGR---PET-EDEGGVVRGSYSYIDPDGQTRTVTYVADENGF 52 (52)
T ss_pred CeeEeEcCCC-CEEEEEE---ccc-CCCCCEEEEEEEEECCCCCEEEEEEECCCCCC
Confidence 7899999999 7888887 222 56779999999999999999999999999998
No 2
>PF06004 DUF903: Bacterial protein of unknown function (DUF903); InterPro: IPR010305 This family consists of several small bacterial proteins several of which are classified as putative lipoproteins. The function of this family is unknown.; PDB: 2RB6_B 3BDU_F 2RA2_F 2RD1_C 3FIF_H 2JN0_A 2K57_A.
Probab=83.68 E-value=2.1 Score=26.84 Aligned_cols=39 Identities=31% Similarity=0.500 Sum_probs=28.4
Q ss_pred EEEecCCCceEEEeeeeccCCCCCcccceEEEEEEEECCCCCEEEEE
Q psy8079 37 YSFQTSNGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADGNPYSLQ 83 (135)
Q Consensus 37 f~Ye~~dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG~~~~V~ 83 (135)
|--.+.||....-.|..+.. .-.|.|+|.+.+|+...|+
T Consensus 2 yvi~t~dG~~i~t~gkP~~D--------~~tG~~~y~D~~G~~~qIn 40 (50)
T PF06004_consen 2 YVITTNDGRTIVTDGKPKYD--------KDTGMYSYTDADGKEQQIN 40 (50)
T ss_dssp EEEEETTSEEEEESSEEEEE--------TTTTEEEEEBTTS-EEEEE
T ss_pred cEEEeCCCcEEEeCCCcccc--------CCCCcEEEECCCCCEEEEc
Confidence 34567788877777776632 2579999999999998875
No 3
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=59.54 E-value=24 Score=28.22 Aligned_cols=49 Identities=20% Similarity=0.306 Sum_probs=34.0
Q ss_pred EEEEEECCCCCEEEE--------EEEeCCCCccccCCC--CCCC-CCChHHHHHHHHHHH
Q psy8079 68 GAYTYTGADGNPYSL--------QYTADEEGFKPSANY--LPTP-PPIPAEIVKALEFLA 116 (135)
Q Consensus 68 GsYsy~~pdG~~~~V--------~Y~ADenGf~~~g~~--lP~~-pp~p~~i~~al~~~~ 116 (135)
..+....|.|...+| -|++|+.||+=.+.+ -+.| ||+|+.+.+....++
T Consensus 46 p~~~~~~~gg~~msv~mt~~G~~~W~~d~~~YrYs~~~~~~~~pwp~~P~~l~~L~~~~~ 105 (213)
T PRK15401 46 PFRHMVTPGGYTMSVAMTNCGALGWVTDRRGYRYSPIDPLTGKPWPAMPASFLALAQRAA 105 (213)
T ss_pred CccceecCCCCcceeEEeccccceEecCCCCcccCCcCCCCCCCCCCchHHHHHHHHHHH
Confidence 345667788876666 799999999998754 3444 678877666555443
No 4
>PHA02698 hypothetical protein; Provisional
Probab=56.74 E-value=7.2 Score=26.87 Aligned_cols=21 Identities=29% Similarity=0.725 Sum_probs=16.9
Q ss_pred CCccccCCCCCCCCCChHHHH
Q psy8079 89 EGFKPSANYLPTPPPIPAEIV 109 (135)
Q Consensus 89 nGf~~~g~~lP~~pp~p~~i~ 109 (135)
.|-|+.|+..|+|-|+|+.|-
T Consensus 19 ddvqvnGDlmPtp~p~peeV~ 39 (89)
T PHA02698 19 DDVQVNGDLMPTPEPTPEEVP 39 (89)
T ss_pred hccccCCcccCCCCCChhhhc
Confidence 367888999999988887653
No 5
>PF10902 DUF2693: Protein of unknown function (DUF2693); InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown.
Probab=53.78 E-value=51 Score=22.71 Aligned_cols=57 Identities=11% Similarity=0.219 Sum_probs=36.2
Q ss_pred CCeeEEEecCCCceEEEeeeeccCCCCCcccceEEEEEEEECCCC------CEEEEEEEeCCCCccc
Q psy8079 33 GSYQYSFQTSNGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADG------NPYSLQYTADEEGFKP 93 (135)
Q Consensus 33 G~Y~f~Ye~~dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG------~~~~V~Y~ADenGf~~ 93 (135)
|.-.|-|+-.||+.|.-.|+++..-- .....|.+.+..+.. ....+-|--+.+|||-
T Consensus 1 G~h~~~feK~DGtiR~a~gT~d~~li----~~~~~~~~~~~~~~~~~rkes~~s~~yfDve~~~WRS 63 (83)
T PF10902_consen 1 GEHEFVFEKVDGTIRVAKGTRDPDLI----PYEEKGKEMFPPPTKPERKESTTSVRYFDVEKKGWRS 63 (83)
T ss_pred CcEEEEEEecCCCEEEEEEecCcccc----cceecCcccccCCCCccccCCcceEEEEEeccCceee
Confidence 45678999999999999999874211 124555555543332 1334456666778774
No 6
>COG0450 AhpC Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
Probab=48.49 E-value=53 Score=26.11 Aligned_cols=61 Identities=21% Similarity=0.275 Sum_probs=40.7
Q ss_pred CCceEEEeeeeccCCCCCcccceEEEEEEEECCCCCEEEEEEEeCCCCccccCCCCCCCCCChHHHHHHHHHHHcC
Q psy8079 43 NGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADGNPYSLQYTADEEGFKPSANYLPTPPPIPAEIVKALEFLASQ 118 (135)
Q Consensus 43 dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADenGf~~~g~~lP~~pp~p~~i~~al~~~~~~ 118 (135)
++.....-|.+. ++.++.++|.|= |||||..+.+.+.....|=.+. -.-.+++||++.++|
T Consensus 106 ~~~vs~~ygvl~-----~~~g~a~R~~FI-IDp~g~ir~~~v~~~~iGRn~d---------EilR~idAlq~~~~h 166 (194)
T COG0450 106 KGEIARAYGVLH-----PEEGLALRGTFI-IDPDGVIRHILVNPLTIGRNVD---------EILRVIDALQFVAKH 166 (194)
T ss_pred chhHHHHcCCcc-----cCCCcceeEEEE-ECCCCeEEEEEEecCCCCcCHH---------HHHHHHHHHHHHHHh
Confidence 344444455544 345578999974 8999999998888776553332 012467788898887
No 7
>KOG1691|consensus
Probab=40.54 E-value=1.5e+02 Score=23.86 Aligned_cols=32 Identities=31% Similarity=0.649 Sum_probs=27.6
Q ss_pred ceEEEEEEEECCCCCEEEEEEEeCCCCccccC
Q psy8079 64 QVAQGAYTYTGADGNPYSLQYTADEEGFKPSA 95 (135)
Q Consensus 64 ~~v~GsYsy~~pdG~~~~V~Y~ADenGf~~~g 95 (135)
.+.+|+|+|+..+-..+.+.++||.-|.+|..
T Consensus 79 nv~~gqFaFta~e~~~y~~Cf~~~~~~~~p~~ 110 (210)
T KOG1691|consen 79 NVTKGQFAFTAEESGMYEACFTADVPGHKPET 110 (210)
T ss_pred ccccceEEEEeccCCcEEEEEecccCCCCCCc
Confidence 34599999999999999999999988888744
No 8
>COG4786 FlgG Flagellar basal body rod protein [Cell motility and secretion]
Probab=39.83 E-value=50 Score=27.43 Aligned_cols=56 Identities=27% Similarity=0.383 Sum_probs=38.1
Q ss_pred EEEeeeeccCCCCCcccceEEEEEEEECCCC-CEEEE--EEEeCCCCccccCCCCCCCC
Q psy8079 47 AQESGFLKNAGNPEAEVQVAQGAYTYTGADG-NPYSL--QYTADEEGFKPSANYLPTPP 102 (135)
Q Consensus 47 ~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG-~~~~V--~Y~ADenGf~~~g~~lP~~p 102 (135)
.-+.|.++..+.+-|-.+.-.|+|....||| ..||= .++-|++||-+..+-.|+-+
T Consensus 80 ~~tqGsl~~T~n~lD~AI~g~gfF~I~~~dG~~~YTR~G~F~~d~~G~LVT~~G~~vl~ 138 (265)
T COG4786 80 VFTQGSLQKTDNPLDLAITGDGFFQIQTPDGTIAYTRDGSFTVDEEGQLVTSNGYPVLD 138 (265)
T ss_pred eccccCceecCCccceEEcCCceEEEEcCCCCEEEeeCCceeECCCCCEEeCCCCCccC
Confidence 4456777766664455566678899999999 55552 56668888888776555544
No 9
>PRK05461 apaG CO2+/MG2+ efflux protein ApaG; Reviewed
Probab=36.05 E-value=85 Score=23.05 Aligned_cols=21 Identities=24% Similarity=0.499 Sum_probs=16.9
Q ss_pred ccceEEEEEEEECCCCCEEEE
Q psy8079 62 EVQVAQGAYTYTGADGNPYSL 82 (135)
Q Consensus 62 ~~~~v~GsYsy~~pdG~~~~V 82 (135)
....++|+|.+++++|....|
T Consensus 93 p~G~M~G~y~~~~~~G~~F~v 113 (127)
T PRK05461 93 PSGTMQGHYQMVDEDGERFEV 113 (127)
T ss_pred CCEEEEEEEEEEeCCCCEEEE
Confidence 446899999999999986543
No 10
>PF09244 DUF1964: Domain of unknown function (DUF1964); InterPro: IPR015325 This domain is C-terminal to the catalytic sucrose phosphorylase beta/alpha barrel domain. It adopts a beta-sandwich fold, with Greek-key topology and is functionally uncharacterised []. ; PDB: 1R7A_B 2GDU_A 2GDV_A.
Probab=34.87 E-value=42 Score=22.23 Aligned_cols=21 Identities=14% Similarity=0.504 Sum_probs=14.7
Q ss_pred EEEEEECCCCCEEEEEEEeCC
Q psy8079 68 GAYTYTGADGNPYSLQYTADE 88 (135)
Q Consensus 68 GsYsy~~pdG~~~~V~Y~ADe 88 (135)
|.|||--.++..++.+|++.+
T Consensus 4 G~FSy~~dgdtSitf~W~g~~ 24 (68)
T PF09244_consen 4 GEFSYEADGDTSITFTWTGAT 24 (68)
T ss_dssp SEEEEEEETTTEEEEEEE-SS
T ss_pred ceeeEecCCCcEEEEEEeccc
Confidence 667777777777777887764
No 11
>TIGR01643 YD_repeat_2x YD repeat (two copies). This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Probab=34.14 E-value=71 Score=17.88 Aligned_cols=21 Identities=33% Similarity=0.379 Sum_probs=13.2
Q ss_pred EEECCCCCEEEEEEEeCCCCccc
Q psy8079 71 TYTGADGNPYSLQYTADEEGFKP 93 (135)
Q Consensus 71 sy~~pdG~~~~V~Y~ADenGf~~ 93 (135)
+.++|+|...+..| |..|-..
T Consensus 9 ~~~~p~G~~~~~~Y--D~~Grl~ 29 (42)
T TIGR01643 9 GSTDADGTTTRYTY--DAAGRLV 29 (42)
T ss_pred EEECCCCCEEEEEE--CCCCCEE
Confidence 35678888766666 5555443
No 12
>cd07240 ED_TypeI_classII_N N-terminal domain of type I, class II extradiol dioxygenases; non-catalytic domain. This family contains the N-terminal, non-catalytic, domain of type I, class II extradiol dioxygenases. Dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. Two major groups of dioxygenases have been identified according to the cleavage site; extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon, whereas intradiol enzymes cleave the aromatic ring between two hydroxyl groups. Extradiol dioxygenases are classified into type I and type II enzymes. Type I extradiol dioxygenases include class I and class II enzymes. These two classes of enzymes show sequence similarity; the two-domain class II enzymes evolved from a class I enzyme through gene duplication. The extradiol dioxygenases represented in this fa
Probab=32.78 E-value=69 Score=20.92 Aligned_cols=20 Identities=25% Similarity=0.413 Sum_probs=15.8
Q ss_pred EEEEECCCCCEEEEEEEeCC
Q psy8079 69 AYTYTGADGNPYSLQYTADE 88 (135)
Q Consensus 69 sYsy~~pdG~~~~V~Y~ADe 88 (135)
++.+.||||..+.|.+..|+
T Consensus 98 ~~~~~DP~G~~ie~~~~~~~ 117 (117)
T cd07240 98 GLRFQDPDGHLLELFVEADK 117 (117)
T ss_pred EEEEECCCCCEEEEEEccCC
Confidence 45688999999998876553
No 13
>PF12118 SprA-related: SprA-related family; InterPro: IPR021973 Most members of this uncharacterised protein family are annotated as being SprA-related.
Probab=31.21 E-value=75 Score=26.74 Aligned_cols=17 Identities=41% Similarity=0.968 Sum_probs=13.1
Q ss_pred EEEEE-CCCCCEEEEEEEeCC
Q psy8079 69 AYTYT-GADGNPYSLQYTADE 88 (135)
Q Consensus 69 sYsy~-~pdG~~~~V~Y~ADe 88 (135)
+|+|. +|||+.|- |.+|
T Consensus 175 sy~y~~GpDg~~Ya---vgGe 192 (294)
T PF12118_consen 175 SYSYQRGPDGKRYA---VGGE 192 (294)
T ss_pred ceeeeeCCCCceee---ccce
Confidence 67775 99999888 5556
No 14
>cd07252 BphC1-RGP6_N_like N-terminal domain of 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC, EC 1.13.11.39) 1 from Rhodococcus globerulus P6 (BphC1-RGP6) and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of BphC1-RGP6 and similar proteins. BphC catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, the third step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). This subfamily of BphCs belongs to the type I extradiol dioxygenase family, which require a metal in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of 2,3-dihydroxybiphenyl 1,2-dioxygenases. For example, three types of BphC enzymes have been found in Rhodococcus globerulus (BphC1-RGP6 - BphC3-RGP6), all three enzymes are type I extradiol dioxygenases. BphC1-RGP6 has an internal duplication, it is a two-domain dioxygenase which forms octamers, and has Fe(II) at the catalytic site. Its N-
Probab=30.70 E-value=70 Score=21.58 Aligned_cols=21 Identities=29% Similarity=0.502 Sum_probs=16.6
Q ss_pred EEEEEECCCCCEEEEEEEeCC
Q psy8079 68 GAYTYTGADGNPYSLQYTADE 88 (135)
Q Consensus 68 GsYsy~~pdG~~~~V~Y~ADe 88 (135)
.++.+.||||..+.+.|.-+|
T Consensus 100 ~~~~~~DPdG~~iE~~~~~~~ 120 (120)
T cd07252 100 GLIRFADPDGNRHELFWGPKE 120 (120)
T ss_pred EEEEEECCCCCEEEEEecccC
Confidence 367899999999998876543
No 15
>PF05423 Mycobact_memb: Mycobacterium membrane protein; InterPro: IPR008693 This family contains several membrane proteins from Mycobacterium species [].
Probab=30.49 E-value=1e+02 Score=23.00 Aligned_cols=24 Identities=29% Similarity=0.592 Sum_probs=18.5
Q ss_pred EEEEEEEECCCCCEEEEEEEeCCCC
Q psy8079 66 AQGAYTYTGADGNPYSLQYTADEEG 90 (135)
Q Consensus 66 v~GsYsy~~pdG~~~~V~Y~ADenG 90 (135)
.+=.|+-.++.|...+|+|. |++|
T Consensus 51 k~V~YEV~G~~G~~~~I~Y~-D~~~ 74 (140)
T PF05423_consen 51 KTVTYEVTGPPGSTATISYL-DADG 74 (140)
T ss_pred cEEEEEEEcCCCCeEEEEEE-cCCC
Confidence 34468888899999999997 5544
No 16
>PHA02978 hypothetical protein; Provisional
Probab=29.71 E-value=46 Score=24.47 Aligned_cols=15 Identities=20% Similarity=0.525 Sum_probs=12.3
Q ss_pred EEEEECCCCCEEEEE
Q psy8079 69 AYTYTGADGNPYSLQ 83 (135)
Q Consensus 69 sYsy~~pdG~~~~V~ 83 (135)
.+||.+|||..++|+
T Consensus 76 y~sy~~~~gisiqvs 90 (135)
T PHA02978 76 YFSYADPDGISIQVS 90 (135)
T ss_pred EEEecCCCceEEEEe
Confidence 567899999988874
No 17
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=28.83 E-value=1.5e+02 Score=19.17 Aligned_cols=26 Identities=27% Similarity=0.422 Sum_probs=20.4
Q ss_pred ccceEEEEEEEECCCCCEEEEEEEeCCC
Q psy8079 62 EVQVAQGAYTYTGADGNPYSLQYTADEE 89 (135)
Q Consensus 62 ~~~~v~GsYsy~~pdG~~~~V~Y~ADen 89 (135)
-..++.|+-..++.+|.. +++.|++-
T Consensus 28 ~~~vleG~v~it~~~G~~--~~~~aGD~ 53 (74)
T PF05899_consen 28 FFYVLEGEVTITDEDGET--VTFKAGDA 53 (74)
T ss_dssp EEEEEEEEEEEEETTTEE--EEEETTEE
T ss_pred EEEEEEeEEEEEECCCCE--EEEcCCcE
Confidence 347899999999999996 55677653
No 18
>PF13511 DUF4124: Domain of unknown function (DUF4124)
Probab=27.30 E-value=54 Score=20.06 Aligned_cols=18 Identities=17% Similarity=0.486 Sum_probs=13.6
Q ss_pred EEEEEEECCCCCEEEEEEEeC
Q psy8079 67 QGAYTYTGADGNPYSLQYTAD 87 (135)
Q Consensus 67 ~GsYsy~~pdG~~~~V~Y~AD 87 (135)
..-|+|+|.+|+. .|+-.
T Consensus 13 a~vYk~~D~~G~v---~ysd~ 30 (60)
T PF13511_consen 13 AEVYKWVDENGVV---HYSDT 30 (60)
T ss_pred ccEEEEECCCCCE---EECcc
Confidence 4679999999984 55555
No 19
>PRK14818 NADH dehydrogenase subunit B; Provisional
Probab=25.46 E-value=56 Score=25.54 Aligned_cols=21 Identities=33% Similarity=0.634 Sum_probs=15.5
Q ss_pred CCCCCCCChHHHHHHHHHHHc
Q psy8079 97 YLPTPPPIPAEIVKALEFLAS 117 (135)
Q Consensus 97 ~lP~~pp~p~~i~~al~~~~~ 117 (135)
.+|-=||.|++|+.+|..+.+
T Consensus 133 yIPGCPP~PeaIl~gil~L~~ 153 (173)
T PRK14818 133 YVPGCPPRPEALTEGLLRLQE 153 (173)
T ss_pred EccCCCCCHHHHHHHHHHHHH
Confidence 566679999999888754443
No 20
>PF10929 DUF2811: Protein of unknown function (DUF2811); InterPro: IPR021231 This is a bacterial family of uncharacterised proteins.
Probab=25.28 E-value=73 Score=20.56 Aligned_cols=20 Identities=30% Similarity=0.772 Sum_probs=16.4
Q ss_pred CChHHHHHHHH-HHHcCCCCC
Q psy8079 103 PIPAEIVKALE-FLASQPKID 122 (135)
Q Consensus 103 p~p~~i~~al~-~~~~~p~~~ 122 (135)
.||+.+.+++. ||.+||.=+
T Consensus 5 eiPe~L~~~m~~fie~hP~WD 25 (57)
T PF10929_consen 5 EIPEDLHQAMKDFIETHPNWD 25 (57)
T ss_pred cccHHHHHHHHHHHHcCCCch
Confidence 47888888987 999999754
No 21
>PF03562 MltA: MltA specific insert domain; InterPro: IPR005300 This group of proteins includes MltA; a membrane-bound, murein degrading transglycosylase enzyme which plays an important role in the controlled growth of the stress-bearing sacculus of Escherichia coli [, ].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 2PI8_D 2AE0_X 2PIC_A 2GAE_A 2PJJ_A 3CZB_A 2G6G_A 2PNW_A 2G5D_A.
Probab=24.43 E-value=53 Score=25.15 Aligned_cols=54 Identities=17% Similarity=0.308 Sum_probs=33.1
Q ss_pred ceEEEEEEEECCCCCEEEEEEEeCCCC--ccccCC------CCCCCCCChHHHHHHHHHHHcCCCC
Q psy8079 64 QVAQGAYTYTGADGNPYSLQYTADEEG--FKPSAN------YLPTPPPIPAEIVKALEFLASQPKI 121 (135)
Q Consensus 64 ~~v~GsYsy~~pdG~~~~V~Y~ADenG--f~~~g~------~lP~~pp~p~~i~~al~~~~~~p~~ 121 (135)
+.|+||=.-..|||....|.|- +.|| |+..|. .|+...- ..+...+|+++||.+
T Consensus 82 lqIQGSGrl~l~DG~~~rvgYA-g~NG~pY~sIGr~Li~~G~i~~~~~---Smq~Ir~wl~~~P~~ 143 (158)
T PF03562_consen 82 LQIQGSGRLRLPDGSTVRVGYA-GQNGHPYTSIGRLLIDRGEIPREQM---SMQAIRAWLRAHPEE 143 (158)
T ss_dssp HHHHSEEEEE-TTSSEEEEEEE-EE-SS----HHHHHHHTTSS-TTS----SHHHHHHHHHHTGGG
T ss_pred eeecCcEEEEcCCCCEEEEEEc-ccCCCccccHHHHHHHcCCcChhhC---CHHHHHHHHHHCHHH
Confidence 5688999999999999999995 4566 454443 3432211 133345799999875
No 22
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=23.02 E-value=1.1e+02 Score=19.93 Aligned_cols=20 Identities=20% Similarity=0.507 Sum_probs=15.5
Q ss_pred cccCCCCCeeEEEecCCCce
Q psy8079 27 TQVNPDGSYQYSFQTSNGIA 46 (135)
Q Consensus 27 ~~~~~dG~Y~f~Ye~~dG~~ 46 (135)
-..+.+|.|.|.+..+||..
T Consensus 7 i~kdk~Ge~rfrlkA~N~ei 26 (59)
T COG3422 7 IYKDKAGEYRFRLKAANGEI 26 (59)
T ss_pred EEEcCCCcEEEEEEccCccE
Confidence 34578899999999888754
No 23
>TIGR03795 chp_BMA0021 conserved hypothetical protein, BMA_0021 family. Members of this protein family are found sparsely, mostly in members of the genus Burkholderia. Members often occur as tandem homologous genes, such as BMA_0021 and BMA_0022 in Burkholderia mallei ATCC 23344. The genes regularly are encoded near the so-called docking protein of TOMM (thiazole/oxazole-modified microcins) biosynthetic clusters, suggesting a role in bacteriocin biosynthesis. The function is unknown.
Probab=22.47 E-value=1.4e+02 Score=21.69 Aligned_cols=23 Identities=30% Similarity=0.439 Sum_probs=13.1
Q ss_pred CCCCCCChHHHHHHH-HHHHcCCC
Q psy8079 98 LPTPPPIPAEIVKAL-EFLASQPK 120 (135)
Q Consensus 98 lP~~pp~p~~i~~al-~~~~~~p~ 120 (135)
||.+|..-+...-|| +|+.+|+.
T Consensus 89 iP~aP~~~~~~a~ALAayn~~~~~ 112 (114)
T TIGR03795 89 IPPAPEHLAEEAIALAAYNDAGPA 112 (114)
T ss_pred cCCCCCccchHHHHHHHHhccCcc
Confidence 777655433232344 48888874
No 24
>PF00659 POLO_box: POLO box duplicated region; InterPro: IPR000959 A subgroup of serine/threonine protein kinases, Polo or Polo-like kinases play multiple roles during the cell cycle. Polo kinases are required at several key points through mitosis, starting from control of the G2/M transition through phosphorylation of Cdc25C and mitotic cyclins. Polo kinases are characterised by an amino terminal catalytic domain, and a carboxy terminal non-catalytic domain consisting of three blocks of conserved sequences known as polo boxes which form one single functional domain []. The domain is named after its founding member encoded by the polo gene of Drosophila melanogaster []. This domain of around 70 amino acids has been found in species ranging from yeast to mammals. Polo boxes appear to mediate interaction with multiple proteins through protein:protein interactions; some but not all of these proteins are substrates for the kinase domain of the molecule []. The crystal structure of the polo domain of the murine protein, Sak, is dimeric, consisting of two alpha-helices and two six-stranded beta-sheets []. The topology of one polypeptide subunit of the dimer consists of, from its N- to C terminus, an extended strand segment, five beta-strands, one alpha-helix (A) and a C-terminal beta-strand. Beta-strands from one subunit form a contiguous antiparallel beta-sheet with beta-strands from the second subunit. The two beta-sheets pack with a crossing angle of 110 degrees, orienting the hydrophobic surfaces inward and the hydrophilic surfaces outward. Helix A, which is colinear with beta-strand 6 of the same polypeptide, buries a large portion of the non-overlapping hydrophobic beta-sheet surfaces. Interactions involving helices A comprise a majority of the hydrophobic core structure and also the dimer interface. Point mutations in the Polo box of the budding yeast Cdc5 protein abolish the ability of overexpressed Cdc5 to interact with the spindle poles and to organise cytokinetic structures [].; GO: 0005515 protein binding; PDB: 1MBY_B 3P37_A 3MHN_A 1Q4K_A 3HIK_A 3Q1I_A 3P35_A 3MHQ_A 1UMW_B 3MQ8_B ....
Probab=22.22 E-value=2.1e+02 Score=17.89 Aligned_cols=13 Identities=38% Similarity=0.646 Sum_probs=11.0
Q ss_pred CeeEEEecCCCce
Q psy8079 34 SYQYSFQTSNGIA 46 (135)
Q Consensus 34 ~Y~f~Ye~~dG~~ 46 (135)
.|.+.|..+||+.
T Consensus 4 k~gi~~~LSng~v 16 (68)
T PF00659_consen 4 KYGIGYQLSNGTV 16 (68)
T ss_dssp CSEEEEEETTSEE
T ss_pred ceEEEEEEeCCCE
Confidence 4789999999984
No 25
>cd08344 MhqB_like_N N-terminal domain of MhqB, a type I extradiol dioxygenase, and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of Burkholderia sp. NF100 MhqB and similar proteins. MhqB is a type I extradiol dioxygenase involved in the catabolism of methylhydroquinone, an intermediate in the degradation of fenitrothion. The purified enzyme has shown extradiol ring cleavage activity toward 3-methylcatechol. Fe2+ was suggested as a cofactor, the same as most other enzymes in the family. Burkholderia sp. NF100 MhqB is encoded on the plasmid pNF1. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=22.12 E-value=1.1e+02 Score=20.28 Aligned_cols=17 Identities=18% Similarity=0.335 Sum_probs=14.0
Q ss_pred EEEEECCCCCEEEEEEE
Q psy8079 69 AYTYTGADGNPYSLQYT 85 (135)
Q Consensus 69 sYsy~~pdG~~~~V~Y~ 85 (135)
++.+.||||..+.+.+-
T Consensus 93 ~~~~~DP~Gn~iel~~~ 109 (112)
T cd08344 93 GVWFRDPDGNLLQVKVA 109 (112)
T ss_pred EEEEECCCCCEEEEecC
Confidence 47799999999998753
No 26
>PF09242 FCSD-flav_bind: Flavocytochrome c sulphide dehydrogenase, flavin-binding; InterPro: IPR015323 This entry represents the flavin-binding domain of flavocytochrome c sulphide dehydrogenase (FCSD), enzymes found in sulphur-oxidising bacteria such as the purple phototrophic bacteria Chromatium vinosum [, ]. These enzymes are complexes of flavoprotein and a dihaem cytochrome that carry out hydrogen sulphide-dependent cytochrome C reduction. The dihaem cytochrome folds into two domains, each of which resembles mitochondrial cytochrome c, with the two haem groups bound to the interior of the subunit. The flavoprotein subunit has a glutathione reductase-like fold consisting of a beta(3,4)-alpha(3) core, and an alpha+beta sandwich. The active site of the flavoprotein subunit contains a catalytically important disulphide bridge located above the pyrimidine portion of the flavin ring []. Electrons are transferred from the flavin to one of the haem groups in the cytochrome. This entry represents a flavoprotein domain required for binding to flavin, and subsequent electron transfer. ; PDB: 1FCD_A.
Probab=21.91 E-value=92 Score=20.44 Aligned_cols=29 Identities=17% Similarity=0.211 Sum_probs=19.6
Q ss_pred EEEEEEEECCC-CCEEEEEEEeCCCCcccc
Q psy8079 66 AQGAYTYTGAD-GNPYSLQYTADEEGFKPS 94 (135)
Q Consensus 66 v~GsYsy~~pd-G~~~~V~Y~ADenGf~~~ 94 (135)
+.==||+++|| |..+.-.|..++.++...
T Consensus 7 ~NtCYSlv~p~~aIsVa~vy~~~~~~i~~v 36 (71)
T PF09242_consen 7 ANTCYSLVAPDYAISVAGVYRYDDGKIVEV 36 (71)
T ss_dssp EEEEEEEEETTEEEEEEEEEEE-TTTEEE-
T ss_pred ceeEEEEeCCCceEEEEEEEEECCCeEEEe
Confidence 44468888886 777777888888775443
No 27
>PF10681 Rot1: Chaperone for protein-folding within the ER, fungal; InterPro: IPR019623 This conserved fungal family is an essential molecular chaperone in the endoplasmic reticulum. Molecular chaperones transiently interact with unfolded proteins to inhibit their self-aggregation and to support their folding and/or assembly. Rot1 is a general chaperone with some substrate specificity, its substrates being the structurally unrelated Kre5 Kre6 Big1 Atg22, which are type I, type II, and polytopic membrane proteins. The dependencies of each for Rot1 do not share similarities. However, their folding does require BiP, and one of these proteins was simultaneously associated with both Rot1 and BiP. In addition, Rot1 may cooperate with BiP/Kar2 in the folding of Kre6 [].
Probab=20.06 E-value=3.8e+02 Score=21.66 Aligned_cols=60 Identities=23% Similarity=0.409 Sum_probs=31.8
Q ss_pred ccCCCCCeeEEEecCCCceEEEeeeeccCCCCC------cccceEEEEEEEECCCCCEEEEEEEeCCCCcccc
Q psy8079 28 QVNPDGSYQYSFQTSNGIAAQESGFLKNAGNPE------AEVQVAQGAYTYTGADGNPYSLQYTADEEGFKPS 94 (135)
Q Consensus 28 ~~~~dG~Y~f~Ye~~dG~~~~E~G~~~~~G~~~------~~~~~v~GsYsy~~pdG~~~~V~Y~ADenGf~~~ 94 (135)
+.+.-| .+|+| +.||-..+.. ....+++. ..-+-++|.|+.. ++|.+...=+..| |.|..
T Consensus 28 eP~~~G-ISYSF-T~DG~~EeA~--Yr~~~Np~~p~C~~a~l~wQHGtY~l~-~nGsl~L~P~~~D--GrQl~ 93 (212)
T PF10681_consen 28 EPSLPG-ISYSF-TEDGYFEEAQ--YRVTSNPTNPSCPTAVLIWQHGTYELN-SNGSLTLTPFAVD--GRQLV 93 (212)
T ss_pred CCCCCc-eeEEE-cCCCeeeEEE--EEEccCCCCCCCCceEEEEecceEEEC-CCCcEEEeecCCC--Cceec
Confidence 444555 55555 8888553333 33333321 1223568999865 6676665555444 44443
Done!