Query psy10510
Match_columns 91
No_of_seqs 84 out of 86
Neff 3.5
Searched_HMMs 46136
Date Fri Aug 16 20:04:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10510.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10510hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04845 PurA: PurA ssDNA and 100.0 4.2E-29 9E-34 191.3 8.4 73 19-91 84-159 (218)
2 smart00712 PUR DNA/RNA-binding 99.9 6.7E-27 1.4E-31 149.3 7.7 60 20-80 4-63 (63)
3 KOG3074|consensus 99.9 4.6E-26 9.9E-31 177.9 6.9 68 22-89 107-179 (263)
4 PF04845 PurA: PurA ssDNA and 99.9 1.8E-22 3.8E-27 154.9 8.0 69 20-89 6-74 (218)
5 KOG3074|consensus 99.9 6.7E-22 1.4E-26 154.7 6.9 63 21-85 28-90 (263)
6 PF11680 DUF3276: Protein of u 97.9 7E-05 1.5E-09 53.9 7.4 63 19-81 13-83 (124)
7 PF11950 DUF3467: Protein of u 76.6 4.5 9.7E-05 26.9 3.6 29 54-82 58-86 (92)
8 PF03703 bPH_2: Bacterial PH d 58.7 36 0.00078 20.1 5.2 62 12-73 10-78 (80)
9 COG2051 RPS27A Ribosomal prote 57.5 6.6 0.00014 25.9 1.3 10 37-46 12-21 (67)
10 PF08814 XisH: XisH protein; 55.9 28 0.0006 25.5 4.5 51 22-81 28-84 (135)
11 PF10384 Scm3: Centromere prot 49.9 23 0.00051 22.2 2.9 20 63-82 9-28 (58)
12 PHA01746 hypothetical protein 47.9 38 0.00081 24.8 4.1 54 26-79 30-90 (131)
13 PF13691 Lactamase_B_4: tRNase 47.0 18 0.00038 23.0 2.0 20 19-38 13-33 (63)
14 cd04096 eEF2_snRNP_like_C eEF2 44.5 28 0.0006 21.5 2.6 40 36-77 23-64 (80)
15 smart00732 YqgFc Likely ribonu 40.2 86 0.0019 19.2 4.4 16 64-79 35-50 (99)
16 cd04097 mtEFG1_C mtEFG1_C: C-t 39.3 34 0.00073 21.1 2.4 40 36-77 23-62 (78)
17 smart00838 EFG_C Elongation fa 39.2 26 0.00056 21.9 1.9 42 36-79 25-66 (85)
18 PLN02994 1-aminocyclopropane-1 36.1 31 0.00067 24.7 2.1 19 61-79 90-108 (153)
19 PF09186 DUF1949: Domain of un 33.3 67 0.0015 18.0 2.9 21 55-75 32-52 (56)
20 PF12651 RHH_3: Ribbon-helix-h 32.9 36 0.00079 19.8 1.7 12 68-79 32-43 (44)
21 CHL00100 ilvH acetohydroxyacid 31.7 1.2E+02 0.0025 22.6 4.6 32 52-83 42-75 (174)
22 PF11065 DUF2866: Protein of u 31.3 1.3E+02 0.0027 19.8 4.2 39 36-74 13-53 (65)
23 cd03710 BipA_TypA_C BipA_TypA_ 31.2 36 0.00077 21.2 1.6 42 35-77 22-63 (79)
24 cd03709 lepA_C lepA_C: This fa 30.2 49 0.0011 20.7 2.1 41 36-77 23-64 (80)
25 PF05326 SVA: Seminal vesicle 29.6 77 0.0017 22.9 3.3 25 26-50 92-116 (124)
26 cd04870 ACT_PSP_1 CT domains f 28.6 85 0.0018 19.0 3.0 25 57-81 44-69 (75)
27 TIGR03422 mito_frataxin fratax 28.1 48 0.001 22.5 1.9 19 7-25 30-52 (97)
28 PF13601 HTH_34: Winged helix 27.8 1.5E+02 0.0032 18.9 4.1 35 42-76 43-77 (80)
29 PF07944 DUF1680: Putative gly 27.6 1.4E+02 0.0031 25.0 4.9 38 22-63 456-493 (520)
30 PHA03069 DNA-binding protein; 26.7 41 0.00089 24.4 1.4 16 29-47 18-33 (119)
31 PHA03244 large tegument protei 25.3 1.2E+02 0.0026 26.5 4.2 34 12-48 246-279 (478)
32 PF00797 Acetyltransf_2: N-ace 25.0 80 0.0017 22.9 2.7 19 18-36 88-106 (240)
33 cd04875 ACT_F4HF-DF N-terminal 24.6 1.3E+02 0.0028 18.0 3.2 24 57-80 46-71 (74)
34 cd03713 EFG_mtEFG_C EFG_mtEFG_ 24.5 67 0.0015 19.6 2.0 40 36-77 23-62 (78)
35 cd00201 WW Two conserved trypt 24.2 96 0.0021 15.5 2.3 16 22-37 7-22 (31)
36 PRK00415 rps27e 30S ribosomal 23.5 51 0.0011 21.1 1.3 11 36-46 3-13 (59)
37 PF11774 Lsr2: Lsr2 ; InterPr 23.1 1.2E+02 0.0026 20.9 3.2 26 53-78 28-53 (110)
38 PF09582 AnfO_nitrog: Iron onl 23.0 83 0.0018 23.9 2.6 22 60-81 43-64 (202)
39 smart00456 WW Domain with 2 co 22.7 1.1E+02 0.0023 15.7 2.3 16 22-37 8-23 (32)
40 PF08612 Med20: TATA-binding r 22.0 2.4E+02 0.0052 21.1 4.8 51 13-82 140-190 (225)
41 COG3533 Uncharacterized protei 21.9 1.3E+02 0.0028 27.0 3.8 46 14-64 428-478 (589)
42 PF02244 Propep_M14: Carboxype 21.8 1.5E+02 0.0033 17.7 3.2 22 53-74 33-54 (74)
43 cd03711 Tet_C Tet_C: C-terminu 21.8 84 0.0018 19.3 2.0 39 36-76 23-61 (78)
44 cd05687 S1_RPS1_repeat_ec1_hs1 21.8 71 0.0015 18.7 1.6 30 19-49 5-34 (70)
45 PRK13738 conjugal transfer pil 20.8 1.2E+02 0.0026 23.4 3.0 28 26-53 163-207 (209)
46 PF13333 rve_2: Integrase core 20.7 1.1E+02 0.0024 17.7 2.3 19 63-81 18-36 (52)
47 PLN00064 photosystem II protei 20.6 50 0.0011 25.1 0.9 28 55-82 78-111 (166)
48 PF01491 Frataxin_Cyay: Fratax 20.1 83 0.0018 21.4 1.9 20 6-25 32-55 (109)
No 1
>PF04845 PurA: PurA ssDNA and RNA-binding protein; InterPro: IPR006628 The Pur protein family consists of four known members in humans and is strongly conserved throughout evolution. Pur-alpha is a highly conserved, sequence-specific DNA- and RNA-binding protein involved in diverse cellular and viral functions including transcription, replication, and cell growth. Pur-alpha has a modular structure with alternating three basic aromatic class I and two acidic leucine-rich class II repeats in the central region of the protein []. In addition to its involved in basic cellular function, Pur-alpha, has been implicated in the development of blood cells and cells of the central nervous system; it has also been implicated in the inhibition of oncogenic transformation and along with Pur-beta in myelodysplastic syndrome progressing to acute myelogenous leukemia. Pur-alpha can influence viral interaction through functional associations, for example with the Tat protein and TAR RNA of HIV-1, and with large T-antigen and DNA regulatory regions of JC virus. JC virus causes opportunistic infections in the brains of certain HIV-1-infected individuals [].; PDB: 3K44_A.
Probab=99.96 E-value=4.2e-29 Score=191.31 Aligned_cols=73 Identities=49% Similarity=0.706 Sum_probs=57.3
Q ss_pred EeEeeeeCceEEEEEcccCCCcceEEEEEeccCC--CCcceEEEecchHHHHHHHHHHHHHHhcCCC-CCCCCCCC
Q psy10510 19 VAVVLTKPILVYCTVVLTKPILVYCTVSQTIPRG--GPRSQIAIPAQGMIEFRDALTDLLEEFGTED-GGYKGELP 91 (91)
Q Consensus 19 ~~~v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rG--g~RsqIalPaqg~~EFRdaL~dlie~yg~~~-~~~~~~Lp 91 (91)
.++++..++|+||||||||+|||||||||+.++| |+|+||+||||||+||+|||++|||+||.++ ++++++||
T Consensus 84 kSe~i~~e~r~yylDLKEN~RGrfLRisq~~~~~~~~~R~qIaiPa~G~~ef~~al~~lid~~g~~~~~~~~~~Lp 159 (218)
T PF04845_consen 84 KSEKINVENRRYYLDLKENSRGRFLRISQTVSRGPRGRRSQIAIPAQGMIEFRDALTDLIDEFGEDDEEDEQGELP 159 (218)
T ss_dssp EEEEEEETTEEEEEEEEEETTEEEEEEEEEETTT----EEEEEEEGGGHHHHHHHHHHHHHHH-------------
T ss_pred eehheeeccceEEEEcccccCcceEEEEEeeccCCCCCccEEEEehHHHHHHHHHHHHHHHHhcCcccccccccCC
Confidence 5789999999999999999999999999999888 8999999999999999999999999999987 45777787
No 2
>smart00712 PUR DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria.
Probab=99.94 E-value=6.7e-27 Score=149.28 Aligned_cols=60 Identities=33% Similarity=0.448 Sum_probs=55.6
Q ss_pred eEeeeeCceEEEEEcccCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHHHhc
Q psy10510 20 AVVLTKPILVYCTVVLTKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLEEFG 80 (91)
Q Consensus 20 ~~v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie~yg 80 (91)
+++|++++|+||||||+|.||+||||||+... ++|++|+||+++|.+|+|+|++++|+|+
T Consensus 4 Sk~l~~~~k~fyfDvk~N~rG~fLrIsE~~~~-~~r~~I~lp~~~~~~F~~~l~~~~~~~~ 63 (63)
T smart00712 4 SKKLTVENKRFYFDVKENRRGRFLRISEVKNN-GGRSSITVPEQGAAEFRDALNKLIEKYG 63 (63)
T ss_pred ccEEEeCCcEEEEEecccCCccEEEEEEecCC-CCceEEEEEHHHHHHHHHHHHHHHHhhC
Confidence 56899999999999999999999999999753 3499999999999999999999999986
No 3
>KOG3074|consensus
Probab=99.93 E-value=4.6e-26 Score=177.94 Aligned_cols=68 Identities=40% Similarity=0.513 Sum_probs=61.1
Q ss_pred eeeeCceEEEEEcccCCCcceEEEEEeccCCC-Ccce---EEEecchHHHHHHHHHHHHHHhcCCCC-CCCCC
Q psy10510 22 VLTKPILVYCTVVLTKPILVYCTVSQTIPRGG-PRSQ---IAIPAQGMIEFRDALTDLLEEFGTEDG-GYKGE 89 (91)
Q Consensus 22 v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg-~Rsq---IalPaqg~~EFRdaL~dlie~yg~~~~-~~~~~ 89 (91)
-|..++++||||||||.|||||||+|+..+|+ +|+| |+||+|||+||||+|++|+|+||..++ .+.+|
T Consensus 107 ~L~~d~RkfY~DLKEN~RGRFLRI~qt~~rg~~~~~Q~~tIalPaqG~aEfrdaLaelle~~G~~~~gqelpe 179 (263)
T KOG3074|consen 107 ELQRDNRKFYLDLKENQRGRFLRISQTPMRGRQTRTQVQTIALPAQGIAEFRDALAELLEDFGEGDEGQELPE 179 (263)
T ss_pred HHhhcCceEEEEcccccccceEEeeecccCCCCccccccceeccchhHHHHHHHHHHHHHHhCCccccCCCCC
Confidence 45679999999999999999999999998877 6777 999999999999999999999999988 44443
No 4
>PF04845 PurA: PurA ssDNA and RNA-binding protein; InterPro: IPR006628 The Pur protein family consists of four known members in humans and is strongly conserved throughout evolution. Pur-alpha is a highly conserved, sequence-specific DNA- and RNA-binding protein involved in diverse cellular and viral functions including transcription, replication, and cell growth. Pur-alpha has a modular structure with alternating three basic aromatic class I and two acidic leucine-rich class II repeats in the central region of the protein []. In addition to its involved in basic cellular function, Pur-alpha, has been implicated in the development of blood cells and cells of the central nervous system; it has also been implicated in the inhibition of oncogenic transformation and along with Pur-beta in myelodysplastic syndrome progressing to acute myelogenous leukemia. Pur-alpha can influence viral interaction through functional associations, for example with the Tat protein and TAR RNA of HIV-1, and with large T-antigen and DNA regulatory regions of JC virus. JC virus causes opportunistic infections in the brains of certain HIV-1-infected individuals [].; PDB: 3K44_A.
Probab=99.87 E-value=1.8e-22 Score=154.91 Aligned_cols=69 Identities=19% Similarity=0.246 Sum_probs=57.4
Q ss_pred eEeeeeCceEEEEEcccCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHHHhcCCCCCCCCC
Q psy10510 20 AVVLTKPILVYCTVVLTKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLEEFGTEDGGYKGE 89 (91)
Q Consensus 20 ~~v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie~yg~~~~~~~~~ 89 (91)
++.+..|+||||+|||||.||||+||+|+++ ||++|+|+|.++.++||||+|++|||.|++.++..+..
T Consensus 6 sk~l~iq~KrfYlDVkqN~RGrfiKiaE~~~-~g~ksri~lsm~~A~efrd~L~~f~~~ya~l~p~n~~~ 74 (218)
T PF04845_consen 6 SKMLQIQNKRFYLDVKQNRRGRFIKIAEVGA-GGRKSRITLSMSAAAEFRDHLGDFIEFYAQLGPSNPEN 74 (218)
T ss_dssp EEEEEETTEEEEEEEEEETTEEEEEEEEE-T-T--EEEEEEEHHHHHHHHHHHHHHHHHHHCS-S--SS-
T ss_pred hhheEecceEEEEEecccCcceeEEEeeccC-CCCcceEEEechhHHHHHHHHHHHHHHHhccCCCCccc
Confidence 3578889999999999999999999999985 68999999999999999999999999999987654433
No 5
>KOG3074|consensus
Probab=99.86 E-value=6.7e-22 Score=154.66 Aligned_cols=63 Identities=22% Similarity=0.262 Sum_probs=58.0
Q ss_pred EeeeeCceEEEEEcccCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHHHhcCCCCC
Q psy10510 21 VVLTKPILVYCTVVLTKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLEEFGTEDGG 85 (91)
Q Consensus 21 ~v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie~yg~~~~~ 85 (91)
+-+..|+||||||||||+|||||||+|++. +.||+|+||+++++||+|.|+++++.|+..++.
T Consensus 28 k~l~iq~KrfYlDvkqN~rGRflKIaE~g~--~~~Sri~ls~s~aaef~d~l~~f~~~ya~lg~~ 90 (263)
T KOG3074|consen 28 KRLQIQHKRFYLDVKQNPRGRFLKIAEVGA--GGRSRIILSLSVAAEFRDILNDFIEHYAQLGPP 90 (263)
T ss_pred hhhhhhhheEEEEeccCCCcceEEEEEecc--CCcceEEEehhhHHHHHHHHHHHHHHHHhcCCC
Confidence 456679999999999999999999999986 567999999999999999999999999998764
No 6
>PF11680 DUF3276: Protein of unknown function (DUF3276); InterPro: IPR021694 This bacterial family of proteins has no known function. ; PDB: 3NM7_C 3N8B_A.
Probab=97.89 E-value=7e-05 Score=53.85 Aligned_cols=63 Identities=14% Similarity=0.166 Sum_probs=49.1
Q ss_pred EeEeeeeCceEEEEEcccCCCcc-eEEEEEeccC---CC----CcceEEEecchHHHHHHHHHHHHHHhcC
Q psy10510 19 VAVVLTKPILVYCTVVLTKPILV-YCTVSQTIPR---GG----PRSQIAIPAQGMIEFRDALTDLLEEFGT 81 (91)
Q Consensus 19 ~~~v~~~e~KrFY~DlkeN~RGR-FLRIsE~~~r---Gg----~RsqIalPaqg~~EFRdaL~dlie~yg~ 81 (91)
.++++++.++.||||||+|..|. ||-|+|-... +| .|..|.|=..=+..|.++|.+.|+....
T Consensus 13 fSk~v~agkRtYFFDVK~~r~gD~yLtItESKk~~~~~g~~~f~khkI~LYkEDf~~F~~~L~e~i~~i~~ 83 (124)
T PF11680_consen 13 FSKVVKAGKRTYFFDVKENRKGDYYLTITESKKFTDGDGSFSFEKHKIFLYKEDFEKFLEGLDEAIDFIKK 83 (124)
T ss_dssp EEEE--SSSEEEEEEEEE-TTS-EEEEEEEEEE-TTS-----EEEEEEEEEGGGHHHHHHHHHHHHHHHHH
T ss_pred eeeeeecCCeeEEEEeecccCCCEEEEEEeecccCCCCcceeeeeeEEEEeHHHHHHHHHHHHHHHHHHHH
Confidence 46789999999999999999996 8999996521 23 5889999999999999999998887543
No 7
>PF11950 DUF3467: Protein of unknown function (DUF3467); InterPro: IPR021857 This entry is represented by Bacteriophage 92, 0rf53. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family of proteins are functionally uncharacterised. This protein is found in bacteria, archaea and viruses. Proteins in this family are typically between 101 to 118 amino acids in length.
Probab=76.60 E-value=4.5 Score=26.85 Aligned_cols=29 Identities=17% Similarity=0.258 Sum_probs=25.5
Q ss_pred CcceEEEecchHHHHHHHHHHHHHHhcCC
Q psy10510 54 PRSQIAIPAQGMIEFRDALTDLLEEFGTE 82 (91)
Q Consensus 54 ~RsqIalPaqg~~EFRdaL~dlie~yg~~ 82 (91)
-.++|+++.+.+.++.++|.+-|++|-..
T Consensus 58 v~sRVimsP~~AKrL~~aL~~~l~~YE~~ 86 (92)
T PF11950_consen 58 VSSRVIMSPQHAKRLLKALQQNLQKYEQR 86 (92)
T ss_pred eEEEEEeCHHHHHHHHHHHHHHHHHHHHH
Confidence 46689999999999999999999998544
No 8
>PF03703 bPH_2: Bacterial PH domain; InterPro: IPR005182 A domain that is found in uncharacterised family of membrane proteins. 1-3 copies found in each protein, with each copy flanked by transmembrane helices.
Probab=58.65 E-value=36 Score=20.09 Aligned_cols=62 Identities=15% Similarity=0.236 Sum_probs=42.1
Q ss_pred CceEEEEEeEeeee-----CceEEEEEcccCCCcceEEEEEec--cCCCCcceEEEecchHHHHHHHHH
Q psy10510 12 EGTLTIRVAVVLTK-----PILVYCTVVLTKPILVYCTVSQTI--PRGGPRSQIAIPAQGMIEFRDALT 73 (91)
Q Consensus 12 ~~~~~~~~~~v~~~-----e~KrFY~DlkeN~RGRFLRIsE~~--~rGg~RsqIalPaqg~~EFRdaL~ 73 (91)
+..|.++-|..-+. -+|.=.+.+++|+-.|.+++..+. ..|+....+.||.-...+.++..+
T Consensus 10 ~~~l~i~~G~~~~~~~~i~~~~Iq~v~~~q~~~~r~~g~~~i~i~~~~~~~~~~~i~~~~~~~a~~i~~ 78 (80)
T PF03703_consen 10 DDRLIIRSGLFSKRTTIIPLDRIQSVSIKQNPLQRLFGLGTIKIDTAGGSGEKIEIPFLSIEDAEEIYD 78 (80)
T ss_pred CCEEEEEECeEEEEEEEEEhhHeEEEEEEcCHHHHhCccEEEEEEECCCCCceeEEecCCHHHHHHHHh
Confidence 44555555443222 345567899999999999998765 444555558899888888876543
No 9
>COG2051 RPS27A Ribosomal protein S27E [Translation, ribosomal structure and biogenesis]
Probab=57.46 E-value=6.6 Score=25.89 Aligned_cols=10 Identities=20% Similarity=0.308 Sum_probs=9.1
Q ss_pred CCCcceEEEE
Q psy10510 37 KPILVYCTVS 46 (91)
Q Consensus 37 N~RGRFLRIs 46 (91)
|+||+||||.
T Consensus 12 ~p~s~Fl~Vk 21 (67)
T COG2051 12 EPRSRFLRVK 21 (67)
T ss_pred CCCceEEEEE
Confidence 9999999986
No 10
>PF08814 XisH: XisH protein; InterPro: IPR014919 The fdxN element, along with two other DNA elements, is excised from the chromosome during heterocyst differentiation in cyanobacteria. The xisH as well as the xisF and xisI genes are required []. ; PDB: 2OKF_A.
Probab=55.93 E-value=28 Score=25.50 Aligned_cols=51 Identities=14% Similarity=0.072 Sum_probs=25.1
Q ss_pred eeeeCceEEEEEcccCCCcceEEEEEeccCCCCcceEEEe------cchHHHHHHHHHHHHHHhcC
Q psy10510 22 VLTKPILVYCTVVLTKPILVYCTVSQTIPRGGPRSQIAIP------AQGMIEFRDALTDLLEEFGT 81 (91)
Q Consensus 22 v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg~RsqIalP------aqg~~EFRdaL~dlie~yg~ 81 (91)
.++...+.+|.||+-+. +=-+|. ....|||- .|-+.+|+.||++|+-+-.-
T Consensus 28 ~l~~~~~~~~iDLgAek----~iaAek-----~~~kIAVEIKSF~~~S~i~df~~AlGQ~~~Yr~~ 84 (135)
T PF08814_consen 28 RLKYGGVDLYIDLGAEK----LIAAEK-----DGEKIAVEIKSFLGPSPISDFHTALGQFLNYRLA 84 (135)
T ss_dssp --EETTEE--------------EEEEE-----TTEEEEEEE----TS-HHHHHHHHHHHHHHHHHH
T ss_pred EEEECcEEEEeccchHH----HHHhhc-----cCceEEEEEeCCCCCcccHHHHHHHHHHHHHHHH
Confidence 34556789999999874 222332 23356543 68899999999999876543
No 11
>PF10384 Scm3: Centromere protein Scm3; InterPro: IPR018465 The centromere protein Scm3 is a non-histone component of centromeric chromatin that binds to CenH3-H4 histones, which are required for kinetochore assembly. Scm3 is required for Cse4 localisation and is required for its centromeric association [, ]. The histone H3 variant Cse4 replaces conventional histone H3 in centromeric chromatin and helps direct the assembly of the kinetochore. In addition, Scm3 has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for G2/M progression []. Scm3 is required to maintain kinetochore function throughout the cell cycle. Scm3 contains a nuclear export signal (NES). The N-terminal region of Scm3 is well conserved and functions as the CenH3-interacting domain, while the C-terminal region is variable in size and sometimes consists of DNA binding motifs [].; GO: 0005634 nucleus; PDB: 3R45_C 2YFV_C 2L5A_A.
Probab=49.87 E-value=23 Score=22.25 Aligned_cols=20 Identities=20% Similarity=0.493 Sum_probs=16.8
Q ss_pred chHHHHHHHHHHHHHHhcCC
Q psy10510 63 QGMIEFRDALTDLLEEFGTE 82 (91)
Q Consensus 63 qg~~EFRdaL~dlie~yg~~ 82 (91)
+....|...+..++++|+.-
T Consensus 9 ~s~~r~k~~~e~I~~KY~~~ 28 (58)
T PF10384_consen 9 QSDQRFKSRWESIIEKYGQP 28 (58)
T ss_dssp HHHHHHHHHHHHHHHHHCSG
T ss_pred HHHHHHHHHHHHHHHHhcCc
Confidence 34567999999999999993
No 12
>PHA01746 hypothetical protein
Probab=47.94 E-value=38 Score=24.82 Aligned_cols=54 Identities=28% Similarity=0.276 Sum_probs=37.3
Q ss_pred CceEEEEEcccCCCcceEEEEEec-cCCCCcceEEEecchH------HHHHHHHHHHHHHh
Q psy10510 26 PILVYCTVVLTKPILVYCTVSQTI-PRGGPRSQIAIPAQGM------IEFRDALTDLLEEF 79 (91)
Q Consensus 26 e~KrFY~DlkeN~RGRFLRIsE~~-~rGg~RsqIalPaqg~------~EFRdaL~dlie~y 79 (91)
.-++.-+-+.--|-|--+|-|++. ++...+.||-|.++-+ ++|...-++++++|
T Consensus 30 g~~rivlQiNAvP~dg~vr~SMTiHs~nNfkKQIGiTa~DAgdL~lIa~FLeKYS~~LnkY 90 (131)
T PHA01746 30 GIKRIVVQLNAVPQDGKIRYSMTIHSQNNFRKQIGITPQDAGDLKLIAEFLEKYSDFLNEY 90 (131)
T ss_pred CcceEEEEeeccCCCCeeEEEEEEecccchhhhcCCCccchhhHHHHHHHHHHHHHHHHHH
Confidence 345666677777888889999988 3345788999988754 45555555555555
No 13
>PF13691 Lactamase_B_4: tRNase Z endonuclease
Probab=46.95 E-value=18 Score=22.99 Aligned_cols=20 Identities=10% Similarity=-0.145 Sum_probs=16.7
Q ss_pred EeEeeeeCceEEEE-EcccCC
Q psy10510 19 VAVVLTKPILVYCT-VVLTKP 38 (91)
Q Consensus 19 ~~~v~~~e~KrFY~-DlkeN~ 38 (91)
.+.++..+++||.| ++.|+-
T Consensus 13 p~l~l~~d~~rYlFGn~gEGt 33 (63)
T PF13691_consen 13 PSLLLFFDSRRYLFGNCGEGT 33 (63)
T ss_pred CEEEEEeCCceEEeccCCcHH
Confidence 45688899999999 999863
No 14
>cd04096 eEF2_snRNP_like_C eEF2_snRNP_like_C: this family represents a C-terminal domain of eukaryotic elongation factor 2 (eEF-2) and a homologous domain of the spliceosomal human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and, its yeast counterpart Snu114p. Yeast Snu114p is essential for cell viability and for splicing in vivo. U5-116 kD binds GTP. Experiments suggest that GTP binding and probably GTP hydrolysis is important for the function of the U5-116 kD/Snu114p. In complex with GTP, EF-2 promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site, the uncharged tRNA from the P site to the E-site and, the mRNA is shifted one codon relative to the ribosome.
Probab=44.52 E-value=28 Score=21.52 Aligned_cols=40 Identities=13% Similarity=0.061 Sum_probs=28.8
Q ss_pred cCCCcceEEEEEeccCCC--CcceEEEecchHHHHHHHHHHHHH
Q psy10510 36 TKPILVYCTVSQTIPRGG--PRSQIAIPAQGMIEFRDALTDLLE 77 (91)
Q Consensus 36 eN~RGRFLRIsE~~~rGg--~RsqIalPaqg~~EFRdaL~dlie 77 (91)
.+.||.++..... .++ .+-+-.+|+..+..|.+.|..+-.
T Consensus 23 ~~rrg~i~~~~~~--~~~~~~~i~~~iP~~e~~~~~~~Lrs~T~ 64 (80)
T cd04096 23 SKRRGHVLSEEPK--EGTPLFEIKAYLPVIESFGFETDLRSATS 64 (80)
T ss_pred hhCeeEEeEEeec--CCCccEEEEEEEeHHHHhCcHHHHHhhCC
Confidence 5778888876543 122 344667999999999999988643
No 15
>smart00732 YqgFc Likely ribonuclease with RNase H fold. YqgF proteins are likely to function as an alternative to RuvC in most bacteria, and could be the principal holliday junction resolvases in low-GC Gram-positive bacteria. In Spt6p orthologues, the catalytic residues are substituted indicating that they lack enzymatic functions.
Probab=40.17 E-value=86 Score=19.24 Aligned_cols=16 Identities=13% Similarity=0.193 Sum_probs=11.8
Q ss_pred hHHHHHHHHHHHHHHh
Q psy10510 64 GMIEFRDALTDLLEEF 79 (91)
Q Consensus 64 g~~EFRdaL~dlie~y 79 (91)
...++.+.|.+++++|
T Consensus 35 ~~~~~~~~l~~~i~~~ 50 (99)
T smart00732 35 NKEADAARLKKLIKKY 50 (99)
T ss_pred CcchHHHHHHHHHHHh
Confidence 4567778888888776
No 16
>cd04097 mtEFG1_C mtEFG1_C: C-terminus of mitochondrial Elongation factor G1 (mtEFG1)-like proteins found in eukaryotes. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. Eukaryotic EF-2 operates in the cytosolic protein synthesis machinery of eukaryotes, EF-Gs in protein synthesis in bacteria. Eukaryotic mtEFG1 proteins show significant homology to bacterial EF-Gs. Mutants in yeast mtEFG1 have impaired mitochondrial protein synthesis, respiratory defects and a tendency to lose mitochondrial DNA. There are two forms of mtEFG present in mammals (desig
Probab=39.25 E-value=34 Score=21.13 Aligned_cols=40 Identities=10% Similarity=0.115 Sum_probs=29.3
Q ss_pred cCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHH
Q psy10510 36 TKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLE 77 (91)
Q Consensus 36 eN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie 77 (91)
.+.||+++-..... +..+-+-.+|+..+..|.+.|..+-.
T Consensus 23 ~~rrg~i~~~~~~~--~~~~i~~~~P~~e~~g~~~~Lr~~T~ 62 (78)
T cd04097 23 NKRKGTIVDTDTGE--DEFTLEAEVPLNDMFGYSTELRSMTQ 62 (78)
T ss_pred HHCCCEEeceEecC--CeEEEEEEECHHHhhChHHHHHhhCC
Confidence 56799998776432 23345667999999999999987543
No 17
>smart00838 EFG_C Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold.
Probab=39.17 E-value=26 Score=21.94 Aligned_cols=42 Identities=12% Similarity=0.056 Sum_probs=29.9
Q ss_pred cCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHHHh
Q psy10510 36 TKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLEEF 79 (91)
Q Consensus 36 eN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie~y 79 (91)
.+.||.++.+.... ....-+-.+|+..+..|++.|..+-.-+
T Consensus 25 ~~rrG~i~~~~~~~--~~~~i~~~iP~~~~~~~~~~Lrs~T~G~ 66 (85)
T smart00838 25 NSRRGKIEGMEQRG--GAQVIKAKVPLSEMFGYATDLRSATQGR 66 (85)
T ss_pred HHcCCEEECeeccC--CcEEEEEECCHHHHhchHHHHHHhcCCe
Confidence 57899887666432 1334466789999999999998765433
No 18
>PLN02994 1-aminocyclopropane-1-carboxylate synthase
Probab=36.06 E-value=31 Score=24.67 Aligned_cols=19 Identities=26% Similarity=0.599 Sum_probs=16.6
Q ss_pred ecchHHHHHHHHHHHHHHh
Q psy10510 61 PAQGMIEFRDALTDLLEEF 79 (91)
Q Consensus 61 Paqg~~EFRdaL~dlie~y 79 (91)
+..|..+||+++.+++.++
T Consensus 90 ~~~G~~~lR~AiA~~l~~~ 108 (153)
T PLN02994 90 DYHGLANFRKAIANFMAEA 108 (153)
T ss_pred CCCCcHHHHHHHHHHHHHH
Confidence 6679999999999999664
No 19
>PF09186 DUF1949: Domain of unknown function (DUF1949); InterPro: IPR015269 Members of this entry are a set of functionally uncharacterised hypothetical bacterial proteins. They adopt a ferredoxin-like fold, with a beta-alpha-beta-beta-alpha-beta arrangement []. This entry contains the protein Impact, which is a translational regulator that ensures constant high levels of translation under amino acid starvation. It acts by interacting with Gcn1/Gcn1L1, thereby preventing activation of Gcn2 protein kinases (EIF2AK1 to 4) and subsequent down-regulation of protein synthesis. It is evolutionary conserved from eukaryotes to archaea []. ; PDB: 2CVE_A 1VI7_A.
Probab=33.30 E-value=67 Score=18.00 Aligned_cols=21 Identities=38% Similarity=0.662 Sum_probs=18.6
Q ss_pred cceEEEecchHHHHHHHHHHH
Q psy10510 55 RSQIAIPAQGMIEFRDALTDL 75 (91)
Q Consensus 55 RsqIalPaqg~~EFRdaL~dl 75 (91)
.-+|.||.+=..+|.+.|.++
T Consensus 32 ~~~v~v~~~~~~~f~~~l~~~ 52 (56)
T PF09186_consen 32 TLTVAVPEEEVEEFKAQLTDL 52 (56)
T ss_dssp EEEEEEECCCHHHHHHHHHHH
T ss_pred EEEEEECHHHHHHHHHHHHHH
Confidence 458999999999999999886
No 20
>PF12651 RHH_3: Ribbon-helix-helix domain
Probab=32.94 E-value=36 Score=19.78 Aligned_cols=12 Identities=50% Similarity=0.952 Sum_probs=9.9
Q ss_pred HHHHHHHHHHHh
Q psy10510 68 FRDALTDLLEEF 79 (91)
Q Consensus 68 FRdaL~dlie~y 79 (91)
+++||.+|+++|
T Consensus 32 l~eAle~~l~ky 43 (44)
T PF12651_consen 32 LREALEDYLEKY 43 (44)
T ss_pred HHHHHHHHHHhc
Confidence 568888998887
No 21
>CHL00100 ilvH acetohydroxyacid synthase small subunit
Probab=31.65 E-value=1.2e+02 Score=22.62 Aligned_cols=32 Identities=19% Similarity=0.381 Sum_probs=26.4
Q ss_pred CCCcceEEEecc--hHHHHHHHHHHHHHHhcCCC
Q psy10510 52 GGPRSQIAIPAQ--GMIEFRDALTDLLEEFGTED 83 (91)
Q Consensus 52 Gg~RsqIalPaq--g~~EFRdaL~dlie~yg~~~ 83 (91)
|..|-+|.+|++ -+.++++.|.+|+|-....+
T Consensus 42 ~~sr~TIvv~~~~~~ieqL~kQL~KLidVl~V~~ 75 (174)
T CHL00100 42 GISRITMVVPGDDRTIEQLTKQLYKLVNILKVQD 75 (174)
T ss_pred CccEEEEEEECCHHHHHHHHHHHHHHhHhhEEEe
Confidence 446789999986 58899999999999887653
No 22
>PF11065 DUF2866: Protein of unknown function (DUF2866); InterPro: IPR021294 This bacterial family of proteins have no known function.
Probab=31.34 E-value=1.3e+02 Score=19.78 Aligned_cols=39 Identities=23% Similarity=0.339 Sum_probs=30.5
Q ss_pred cCCCcceEEEEEe-ccCCCCcceEEEecc-hHHHHHHHHHH
Q psy10510 36 TKPILVYCTVSQT-IPRGGPRSQIAIPAQ-GMIEFRDALTD 74 (91)
Q Consensus 36 eN~RGRFLRIsE~-~~rGg~RsqIalPaq-g~~EFRdaL~d 74 (91)
+-|+|+-.||-|= ....|+.+.+++|++ .-+|.++++..
T Consensus 13 ~~PWG~~cRiVEW~i~~~g~~~RrvVpa~~T~~EIa~~ir~ 53 (65)
T PF11065_consen 13 QQPWGRGCRIVEWTIDHDGRISRRVVPADSTEAEIAEAIRS 53 (65)
T ss_pred cCCCCCceEEEEEEecCCcceeEEeecccCChHHHHHHHHc
Confidence 5689999999997 444578899999985 46777777764
No 23
>cd03710 BipA_TypA_C BipA_TypA_C: a C-terminal portion of BipA or TypA having homology to the C terminal domains of the elongation factors EF-G and EF-2. A member of the ribosome binding GTPase superfamily, BipA is widely distributed in bacteria and plants. BipA is a highly conserved protein with global regulatory properties in Escherichia coli. BipA is phosphorylated on a tyrosine residue under some cellular conditions. Mutants show altered regulation of some pathways. BipA functions as a translation factor that is required specifically for the expression of the transcriptional modulator Fis. BipA binds to ribosomes at a site that coincides with that of EF-G and has a GTPase activity that is sensitive to high GDP:GTP ratios and, is stimulated by 70S ribosomes programmed with mRNA and aminoacylated tRNAs. The growth rate-dependent induction of BipA allows the efficient expression of Fis, thereby modulating a range of downstream processes, including DNA metabolism and type III secreti
Probab=31.16 E-value=36 Score=21.20 Aligned_cols=42 Identities=24% Similarity=0.305 Sum_probs=30.1
Q ss_pred ccCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHH
Q psy10510 35 LTKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLE 77 (91)
Q Consensus 35 keN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie 77 (91)
-.+.||.++.+..... +...-+-.+|+..+..|.+.|..+-.
T Consensus 22 l~~rrg~i~~~~~~~~-~~~~i~~~~P~~~~~~~~~~Lrs~T~ 63 (79)
T cd03710 22 LGKRKGEMVDMEPDGN-GRTRLEFKIPSRGLIGFRSEFLTDTR 63 (79)
T ss_pred HHhCCCEEEccEECCC-CEEEEEEEECHHHHcCcHHHHHhhCC
Confidence 3568999998775421 12345678999999999999987543
No 24
>cd03709 lepA_C lepA_C: This family represents the C-terminal region of LepA, a GTP-binding protein localized in the cytoplasmic membrane. LepA is ubiquitous in Bacteria and Eukaryota (e.g. Saccharomyces cerevisiae GUF1p), but is missing from Archaea. LepA exhibits significant homology to elongation factors (EFs) Tu and G. The function(s) of the proteins in this family are unknown. The N-terminal domain of LepA is homologous to a domain of similar size found in initiation factor 2 (IF2), and in EF-Tu and EF-G (factors required for translation in Escherichia coli). Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including S. cerevisiae GUF1) originated within the bacterial LepA family. LepA has never been observed in archaea, and eukaryl LepA is organellar. LepA is therefore a true bacterial GTPase, found only in the bacterial lineage.
Probab=30.22 E-value=49 Score=20.73 Aligned_cols=41 Identities=12% Similarity=0.099 Sum_probs=29.1
Q ss_pred cCCCcceEEEEEeccCCCCcceEEEecchHH-HHHHHHHHHHH
Q psy10510 36 TKPILVYCTVSQTIPRGGPRSQIAIPAQGMI-EFRDALTDLLE 77 (91)
Q Consensus 36 eN~RGRFLRIsE~~~rGg~RsqIalPaqg~~-EFRdaL~dlie 77 (91)
...||.++.+.+... +..+-+=.+|+.+++ .|+..|..+-.
T Consensus 23 ~~rrG~i~~~~~~~~-~~~~i~~~~P~~~~~~g~~~~L~s~T~ 64 (80)
T cd03709 23 QERRGVQKDMEYLDA-NRVMLTYELPLAEIVYDFFDKLKSISK 64 (80)
T ss_pred HHhCCEEeccEecCC-CeEEEEEECCHHHHhhhHHHHhHhhcC
Confidence 357899998875421 133445579999995 99999987644
No 25
>PF05326 SVA: Seminal vesicle autoantigen (SVA); InterPro: IPR007990 This family consists of seminal vesicle autoantigen and prolactin-inducible (PIP) proteins. Seminal vesicle autoantigen (SVA) is specifically present in the seminal plasma of mice. This 19 kDa secretory glycoprotein suppresses the motility of spermatozoa by interacting with phospholipid. PIP has several known functions. In saliva, this protein plays a role in host defence by binding to microorganisms such as Streptococcus. PIP is an aspartyl proteinase and it acts as a factor capable of suppressing T-cell apoptosis through its interaction with CD4 [].; GO: 0005576 extracellular region; PDB: 3ES6_B.
Probab=29.65 E-value=77 Score=22.93 Aligned_cols=25 Identities=8% Similarity=0.018 Sum_probs=15.9
Q ss_pred CceEEEEEcccCCCcceEEEEEecc
Q psy10510 26 PILVYCTVVLTKPILVYCTVSQTIP 50 (91)
Q Consensus 26 e~KrFY~DlkeN~RGRFLRIsE~~~ 50 (91)
.-+.||||+.-|.-++.=-.+++.+
T Consensus 92 ~~r~FyWDi~~~~t~~i~~~adii~ 116 (124)
T PF05326_consen 92 YPRTFYWDIQVNRTVTIAAVADIIR 116 (124)
T ss_dssp S-EEEEEEE--SS-EEEEEEEEE--
T ss_pred CCccEEEEEEECceEEEEEEEEEEh
Confidence 3688999999999888877777653
No 26
>cd04870 ACT_PSP_1 CT domains found N-terminal of phosphoserine phosphatase (PSP, SerB). The ACT_PSP_1 CD includes the first of the two ACT domains found N-terminal of phosphoserine phosphatase (PSP, SerB). PSPs belong to the L-2-haloacid dehalogenase-like protein superfamily. PSP is involved in serine metabolism; serine is synthesized from phosphoglycerate through sequential reactions catalyzed by 3-phosphoglycerate dehydrogenase (SerA), 3-phosphoserine aminotransferase (SerC), and SerB. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=28.61 E-value=85 Score=19.03 Aligned_cols=25 Identities=20% Similarity=0.202 Sum_probs=20.2
Q ss_pred eEEEecc-hHHHHHHHHHHHHHHhcC
Q psy10510 57 QIAIPAQ-GMIEFRDALTDLLEEFGT 81 (91)
Q Consensus 57 qIalPaq-g~~EFRdaL~dlie~yg~ 81 (91)
++.+|.+ -+.+++++|..+.++.+-
T Consensus 44 ~v~~p~~~~~~~l~~~l~~l~~~l~l 69 (75)
T cd04870 44 LVQIPDSADSEALLKDLLFKAHELGL 69 (75)
T ss_pred EEEcCCCCCHHHHHHHHHHHHHHcCc
Confidence 4557776 689999999999988764
No 27
>TIGR03422 mito_frataxin frataxin. Frataxin is a mitochondrial protein, mutation of which leads to the disease Friedreich's ataxia. Its orthologs are widely distributed in the bacteria, associated with the ISC system for iron-sulfur cluster assembly, and designated CyaY. This exception-type model allows those examples of frataxin per se that score above the trusted cutoff to the CyaY equivalog-type model (TIGR03421) to be named appropriately.
Probab=28.09 E-value=48 Score=22.47 Aligned_cols=19 Identities=42% Similarity=0.651 Sum_probs=15.1
Q ss_pred eeeecCceEEEEEeE----eeee
Q psy10510 7 QVECSEGTLTIRVAV----VLTK 25 (91)
Q Consensus 7 ~~~~~~~~~~~~~~~----v~~~ 25 (91)
-+|++.|-|||.+.. |+++
T Consensus 30 D~e~~~gVLti~~~~~~~~VINk 52 (97)
T TIGR03422 30 DVEYSSGVLTLELPSVGTYVINK 52 (97)
T ss_pred ccccCCCEEEEEECCCCEEEEeC
Confidence 578899999999764 6666
No 28
>PF13601 HTH_34: Winged helix DNA-binding domain; PDB: 1UB9_A.
Probab=27.79 E-value=1.5e+02 Score=18.90 Aligned_cols=35 Identities=20% Similarity=0.352 Sum_probs=22.0
Q ss_pred eEEEEEeccCCCCcceEEEecchHHHHHHHHHHHH
Q psy10510 42 YCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLL 76 (91)
Q Consensus 42 FLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dli 76 (91)
|+.+......+.++..+.|-..|-.+|.+.++.|-
T Consensus 43 yV~~~k~~~~~~p~t~~~lT~~Gr~~~~~~~~~L~ 77 (80)
T PF13601_consen 43 YVEVEKEFEGRRPRTWYSLTDKGREAFERYVAALR 77 (80)
T ss_dssp SEEEEEE-SSS--EEEEEE-HHHHHHHHHHHHHHH
T ss_pred CEEEEEeccCCCCeEEEEECHHHHHHHHHHHHHHH
Confidence 44554433223467889999999999999887653
No 29
>PF07944 DUF1680: Putative glycosyl hydrolase of unknown function (DUF1680); InterPro: IPR012878 The members of this family are sequences derived from hypothetical bacterial and eukaryotic proteins of unknown function. One member of this family is annotated as a possible arabinosidase, but no references were found to back this.
Probab=27.61 E-value=1.4e+02 Score=24.97 Aligned_cols=38 Identities=16% Similarity=0.078 Sum_probs=27.9
Q ss_pred eeeeCceEEEEEcccCCCcceEEEEEeccCCCCcceEEEecc
Q psy10510 22 VLTKPILVYCTVVLTKPILVYCTVSQTIPRGGPRSQIAIPAQ 63 (91)
Q Consensus 22 v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg~RsqIalPaq 63 (91)
.++.+.+.. ...-..|.|+.|+-.+. .|.+-+|.+||+
T Consensus 456 ~i~vNG~~~---~~~~~~~gy~~i~r~W~-~gD~v~l~lpm~ 493 (520)
T PF07944_consen 456 TIRVNGEPV---VDTAVPGGYLTIEREWK-DGDVVELRLPME 493 (520)
T ss_pred EEEECCEeC---CCCcCCCCeEEEEeecc-CCcEEEEEecCe
Confidence 444555544 55667899999998874 478889999986
No 30
>PHA03069 DNA-binding protein; Provisional
Probab=26.71 E-value=41 Score=24.38 Aligned_cols=16 Identities=6% Similarity=-0.074 Sum_probs=12.5
Q ss_pred EEEEEcccCCCcceEEEEE
Q psy10510 29 VYCTVVLTKPILVYCTVSQ 47 (91)
Q Consensus 29 rFY~DlkeN~RGRFLRIsE 47 (91)
-||+|+++ ||||-+.-
T Consensus 18 PFiint~e---GryLVLKa 33 (119)
T PHA03069 18 PFILNTDD---GRYLVLKA 33 (119)
T ss_pred ceEEecCC---CeEEEEEE
Confidence 48999998 99875543
No 31
>PHA03244 large tegument protein UL36; Provisional
Probab=25.27 E-value=1.2e+02 Score=26.47 Aligned_cols=34 Identities=12% Similarity=0.081 Sum_probs=30.5
Q ss_pred CceEEEEEeEeeeeCceEEEEEcccCCCcceEEEEEe
Q psy10510 12 EGTLTIRVAVVLTKPILVYCTVVLTKPILVYCTVSQT 48 (91)
Q Consensus 12 ~~~~~~~~~~v~~~e~KrFY~DlkeN~RGRFLRIsE~ 48 (91)
+++..+|+-+++-.-.-.|+||+.+| .+.||++-
T Consensus 246 ~~~F~~~~~VL~D~~GaVYg~dv~~~---~v~RLADD 279 (478)
T PHA03244 246 RAEFHVRQYVLMDPFGAVYGYDPALD---AVYRLAED 279 (478)
T ss_pred CCeeeeEEEEEECCCceEEEEeccCC---eEEEehhh
Confidence 37889999999999999999999998 89999974
No 32
>PF00797 Acetyltransf_2: N-acetyltransferase; InterPro: IPR001447 Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30 kDa. It facilitates the transfer of an acetyl group from acetyl coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation of these compounds generally results in inactivation. NAT is found in many species from Mycobacteria (Mycobacterium tuberculosis, Mycobacterium smegmatis etc) to Homo sapiens (Human). It was the first enzyme to be observed to have polymorphic activity amongst human individuals. NAT is responsible for the inactivation of Isoniazid (a drug used to treat tuberculosis) in humans. The NAT protein has also been shown to be involved in the breakdown of folic acid. NAT catalyses the reaction: Acetyl-coA + arylamine = coA + N-acetylarylamine NAT is the target of a common genetic polymorphism of clinical relevance in humans. The N-acetylation polymorphism is determined by low or high NAT activity in liver. NAT has been implicated in the action and toxicity of amine-containing drugs, and in the susceptibility to cancer and systematic lupus erythematosus. Two highly similar human genes for NAT, termed NAT1 and NAT2, encode genetically invariant and variant NAT proteins, respectively. ; GO: 0016407 acetyltransferase activity, 0008152 metabolic process; PDB: 1W6F_A 1W5R_A 1GX3_D 2PQT_A 2IJA_A 1W4T_A 2BSZ_B 3D9W_B 3LTW_A 3LNB_A ....
Probab=24.96 E-value=80 Score=22.91 Aligned_cols=19 Identities=16% Similarity=-0.001 Sum_probs=14.4
Q ss_pred EEeEeeeeCceEEEEEccc
Q psy10510 18 RVAVVLTKPILVYCTVVLT 36 (91)
Q Consensus 18 ~~~~v~~~e~KrFY~Dlke 36 (91)
-++-+++-++++|+.||+=
T Consensus 88 H~~liV~~~~~~ylvDvGf 106 (240)
T PF00797_consen 88 HLVLIVTLDGERYLVDVGF 106 (240)
T ss_dssp EEEEEEEETTEEEEE-SSS
T ss_pred eEEEEEEECCEEEEEeccC
Confidence 3566778899999999974
No 33
>cd04875 ACT_F4HF-DF N-terminal ACT domain of formyltetrahydrofolate deformylase (F4HF-DF; formyltetrahydrofolate hydrolase). This CD includes the N-terminal ACT domain of formyltetrahydrofolate deformylase (F4HF-DF; formyltetrahydrofolate hydrolase) which catalyzes the hydrolysis of 10-formyltetrahydrofolate (formyl-FH4) to FH4 and formate. Formyl-FH4 hydrolase generates the formate that is used by purT-encoded 5'-phosphoribosylglycinamide transformylase for step three of de novo purine nucleotide synthesis. Formyl-FH4 hydrolase, a hexamer which is activated by methionine and inhibited by glycine, is proposed to regulate the balance FH4 and C1-FH4 in response to changing growth conditions. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=24.65 E-value=1.3e+02 Score=17.99 Aligned_cols=24 Identities=13% Similarity=0.232 Sum_probs=19.4
Q ss_pred eEEEecc--hHHHHHHHHHHHHHHhc
Q psy10510 57 QIAIPAQ--GMIEFRDALTDLLEEFG 80 (91)
Q Consensus 57 qIalPaq--g~~EFRdaL~dlie~yg 80 (91)
++.+|+. .+.+|+++|..+.++++
T Consensus 46 ~~~~~~~~~~~~~l~~~l~~l~~~l~ 71 (74)
T cd04875 46 EFELEGFDLSREALEAAFAPVAAEFD 71 (74)
T ss_pred EEEeCCCCCCHHHHHHHHHHHHHHcC
Confidence 5567764 69999999999988875
No 34
>cd03713 EFG_mtEFG_C EFG_mtEFG_C: domains similar to the C-terminal domain of the bacterial translational elongation factor (EF) EF-G. Included in this group is the C-terminus of mitochondrial Elongation factor G1 (mtEFG1) and G2 (mtEFG2) proteins. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. During the process of peptide synthesis and tRNA site changes, the ribosome is moved along the mRNA a distance equal to one codon with the addition of each amino acid. In bacteria this translocation step is catalyzed by EF-G_GTP, which is hydrolyzed to provide
Probab=24.52 E-value=67 Score=19.57 Aligned_cols=40 Identities=13% Similarity=0.115 Sum_probs=27.9
Q ss_pred cCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHH
Q psy10510 36 TKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLE 77 (91)
Q Consensus 36 eN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie 77 (91)
.+.||.++-.... .+...-+-.+|+..+..|.+.|..+-.
T Consensus 23 ~~rrg~i~~~~~~--~~~~~i~~~iP~~e~~~~~~~Lr~~T~ 62 (78)
T cd03713 23 SSRRGQILGTESR--GGWKVIKAEVPLAEMFGYSTDLRSLTQ 62 (78)
T ss_pred HHcCCceEceecc--CCcEEEEEEcCHHHHhChHHHHHhhcC
Confidence 5678888866543 122334557899999999999887543
No 35
>cd00201 WW Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Probab=24.18 E-value=96 Score=15.50 Aligned_cols=16 Identities=25% Similarity=-0.068 Sum_probs=11.1
Q ss_pred eeeeCceEEEEEcccC
Q psy10510 22 VLTKPILVYCTVVLTK 37 (91)
Q Consensus 22 v~~~e~KrFY~DlkeN 37 (91)
....+.+.||++...+
T Consensus 7 ~~~~~g~~yy~n~~t~ 22 (31)
T cd00201 7 RWDPDGRVYYYNHNTK 22 (31)
T ss_pred EECCCCCEEEEECCCC
Confidence 3445588999998654
No 36
>PRK00415 rps27e 30S ribosomal protein S27e; Reviewed
Probab=23.50 E-value=51 Score=21.10 Aligned_cols=11 Identities=18% Similarity=0.301 Sum_probs=9.9
Q ss_pred cCCCcceEEEE
Q psy10510 36 TKPILVYCTVS 46 (91)
Q Consensus 36 eN~RGRFLRIs 46 (91)
++|+++||+|.
T Consensus 3 ~~p~S~F~~Vk 13 (59)
T PRK00415 3 PQPRSRFLKVK 13 (59)
T ss_pred CCCCCeEEEEE
Confidence 68999999996
No 37
>PF11774 Lsr2: Lsr2 ; InterPro: IPR024412 This entry represents Lsr2, which is a small, basic DNA-bridging protein present in Mycobacterium and related actinomycetes. It is a functional homologue of the H-NS-like proteins []. H-NS proteins play a role in nucleoid organisation and also function as a pleiotropic regulator of gene expression [, ].; PDB: 4E1R_B 4E1P_B 2KNG_A.
Probab=23.08 E-value=1.2e+02 Score=20.92 Aligned_cols=26 Identities=27% Similarity=0.461 Sum_probs=20.9
Q ss_pred CCcceEEEecchHHHHHHHHHHHHHH
Q psy10510 53 GPRSQIAIPAQGMIEFRDALTDLLEE 78 (91)
Q Consensus 53 g~RsqIalPaqg~~EFRdaL~dlie~ 78 (91)
|.--.|=|...-..+||++|..||+.
T Consensus 28 G~~YeIDLs~~na~~lr~~l~~yi~~ 53 (110)
T PF11774_consen 28 GVDYEIDLSAENAAKLRDALAPYIAA 53 (110)
T ss_dssp TEEEEEEE-HHHHHHHHHHHHHHHHH
T ss_pred CeEEEEECCHHHHHHHHHHHHHHHHH
Confidence 34457888889999999999999986
No 38
>PF09582 AnfO_nitrog: Iron only nitrogenase protein AnfO (AnfO_nitrog); InterPro: IPR014287 Proteins in this entry include Anf1 from Rhodobacter capsulatus (Rhodopseudomonas capsulata) and AnfO from Azotobacter vinelandii. They are found exclusively in species which contain the iron-only nitrogenase, and are encoded immediately downstream of the structural genes for the nitrogenase enzyme in these species.
Probab=22.98 E-value=83 Score=23.89 Aligned_cols=22 Identities=32% Similarity=0.646 Sum_probs=19.5
Q ss_pred EecchHHHHHHHHHHHHHHhcC
Q psy10510 60 IPAQGMIEFRDALTDLLEEFGT 81 (91)
Q Consensus 60 lPaqg~~EFRdaL~dlie~yg~ 81 (91)
-+..||.++|..+.++++.+++
T Consensus 43 ~~~~~l~~iR~~~~~li~~L~d 64 (202)
T PF09582_consen 43 CDAKGLAEIRQKISELIEFLGD 64 (202)
T ss_pred CCCCCHHHHHHHHHHHHHHhCC
Confidence 3579999999999999999875
No 39
>smart00456 WW Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Probab=22.69 E-value=1.1e+02 Score=15.70 Aligned_cols=16 Identities=19% Similarity=-0.201 Sum_probs=11.0
Q ss_pred eeeeCceEEEEEcccC
Q psy10510 22 VLTKPILVYCTVVLTK 37 (91)
Q Consensus 22 v~~~e~KrFY~DlkeN 37 (91)
..+.+++.||++.+.+
T Consensus 8 ~~~~~g~~yy~n~~t~ 23 (32)
T smart00456 8 RKDPDGRPYYYNHETK 23 (32)
T ss_pred EECCCCCEEEEECCCC
Confidence 3445678899987654
No 40
>PF08612 Med20: TATA-binding related factor (TRF) of subunit 20 of Mediator complex; InterPro: IPR013921 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Proteins in this entry are subunit Med20 of the Mediator complex, and is found in the non-essential part of the head []. and related to the TATA-binding protein (TBP). TBP is a highly conserved RNA polymerase II general transcription factor that binds to the core promoter and initiates assembly of the pre-initiation complex. Human TRF has been shown to associate with an RNA polymerase II-SRB complex []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex; PDB: 2HZS_C 3RJ1_M 2HZM_G.
Probab=21.96 E-value=2.4e+02 Score=21.06 Aligned_cols=51 Identities=22% Similarity=0.362 Sum_probs=34.4
Q ss_pred ceEEEEEeEeeeeCceEEEEEcccCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHHHHhcCC
Q psy10510 13 GTLTIRVAVVLTKPILVYCTVVLTKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLLEEFGTE 82 (91)
Q Consensus 13 ~~~~~~~~~v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dlie~yg~~ 82 (91)
|.++||+|.|.-.+ +.||--+-|.=. | ..|..-+....+.+.+|++.|+..
T Consensus 140 gDf~VRvg~v~~~~----------~~rGilvEVEy~---~------~~p~~~~~~~~~li~efl~~~~~~ 190 (225)
T PF08612_consen 140 GDFTVRVGNVFQGQ----------SFRGILVEVEYR---G------SDPCVSFEQCWELIREFLQSFGIP 190 (225)
T ss_dssp TTEEEEEEEEEETT----------EEEEEEEEEEE------------SSSTTHHHHHHHHHHHHHHTTS-
T ss_pred CCEEEEEEEEEECC----------CCcEEEEEEEEc---c------CCccccHHHHHHHHHHHHHHhCCC
Confidence 56778888876555 566666655432 1 467777888888888999999754
No 41
>COG3533 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.89 E-value=1.3e+02 Score=27.00 Aligned_cols=46 Identities=22% Similarity=0.278 Sum_probs=35.7
Q ss_pred eEEEEEe-----EeeeeCceEEEEEcccCCCcceEEEEEeccCCCCcceEEEecch
Q psy10510 14 TLTIRVA-----VVLTKPILVYCTVVLTKPILVYCTVSQTIPRGGPRSQIAIPAQG 64 (91)
Q Consensus 14 ~~~~~~~-----~v~~~e~KrFY~DlkeN~RGRFLRIsE~~~rGg~RsqIalPaqg 64 (91)
||.+|+- ..++.+. .|+-+-.+.=|+||+..+. .|.|-++.|||..
T Consensus 428 tlaLRlP~W~a~~tl~vNG----~~~~~~~~~GYa~i~R~Wq-~GDrV~L~LpM~v 478 (589)
T COG3533 428 TLALRLPAWCAAPTLRVNG----KEVIQTRGKGYARISREWQ-AGDRVELMLPMPV 478 (589)
T ss_pred EEEEecccccCCcEEEEcC----cchhhccCCCeeeeeehhc-CCCeEEEeeccee
Confidence 4555542 2566666 6888999999999999885 4899999999864
No 42
>PF02244 Propep_M14: Carboxypeptidase activation peptide; InterPro: IPR003146 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The peptidases are synthesised as inactive molecules, zymogens, with propeptides that must be removed by proteolytic cleavage to activate the enzyme. Structural studies of carboxypeptidases A and B reveal the propeptide to exist as a globular domain, followed by an extended alpha-helix; this shields the catalytic site, without specifically binding to it, while the substrate-binding site is blocked by making specific contacts [, ]. Members of this propeptide family are found in the metallocarboxypeptidases: A1, A2 [], A3, A4, A5, A6, U, insect gut carboxypeptidase and B [], and and are associated with peptidases belonging to MEROPS peptidase family M14A. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase.; GO: 0004180 carboxypeptidase activity, 0006508 proteolysis; PDB: 3D68_C 3D67_B 3D66_A 1PBA_A 3GLJ_A 1NSA_A 1KWM_A 2BOA_B 1PYT_A 1JQG_A ....
Probab=21.77 E-value=1.5e+02 Score=17.68 Aligned_cols=22 Identities=23% Similarity=0.578 Sum_probs=18.6
Q ss_pred CCcceEEEecchHHHHHHHHHH
Q psy10510 53 GPRSQIAIPAQGMIEFRDALTD 74 (91)
Q Consensus 53 g~RsqIalPaqg~~EFRdaL~d 74 (91)
|.-..|.||.+....|.+.|..
T Consensus 33 ~~~~dv~V~p~~~~~f~~~L~~ 54 (74)
T PF02244_consen 33 GRPVDVMVPPEKLEEFEELLKE 54 (74)
T ss_dssp TSEEEEEEEGGGHHHHHHHHHH
T ss_pred CCeEEEEECHHHHHHHHHHHHH
Confidence 4567999999999999987764
No 43
>cd03711 Tet_C Tet_C: C-terminus of ribosomal protection proteins Tet(M) and Tet(O). This domain has homology to the C terminal domains of the elongation factors EF-G and EF-2. Tet(M) and Tet(O) catalyze the release of tetracycline (Tc) from the ribosome in a GTP-dependent manner thereby mediating Tc resistance. Tcs are broad-spectrum antibiotics. Typical Tcs bind to the ribosome and inhibit the elongation phase of protein synthesis, by inhibiting the occupation of site A by aminoacyl-tRNA.
Probab=21.76 E-value=84 Score=19.32 Aligned_cols=39 Identities=10% Similarity=0.029 Sum_probs=27.0
Q ss_pred cCCCcceEEEEEeccCCCCcceEEEecchHHHHHHHHHHHH
Q psy10510 36 TKPILVYCTVSQTIPRGGPRSQIAIPAQGMIEFRDALTDLL 76 (91)
Q Consensus 36 eN~RGRFLRIsE~~~rGg~RsqIalPaqg~~EFRdaL~dli 76 (91)
.+.||.++-..... +...-+-.+|+..+..|.+.|..+-
T Consensus 23 ~~rrg~i~~~~~~~--~~~~i~~~~P~~~~~g~~~~Lr~~T 61 (78)
T cd03711 23 AKMGATFEDPQIKG--DEVTLEGTIPVATSQDYQSELPSYT 61 (78)
T ss_pred HHcCCEeeCcEecC--CEEEEEEEECHHHHhhHHHHhHhhc
Confidence 56789888655321 2233466789999999999887753
No 44
>cd05687 S1_RPS1_repeat_ec1_hs1 S1_RPS1_repeat_ec1_hs1: Ribosomal protein S1 (RPS1) domain. RPS1 is a component of the small ribosomal subunit thought to be involved in the recognition and binding of mRNA's during translation initiation. The bacterial RPS1 domain architecture consists of 4-6 tandem S1 domains. In some bacteria, the tandem S1 array is located C-terminal to a 4-hydroxy-3-methylbut-2-enyl diphosphate reductase (HMBPP reductase) domain. While RPS1 is found primarily in bacteria, proteins with tandem RPS1-like domains have been identified in plants and humans, however these lack the N-terminal HMBPP reductase domain. This CD includes S1 repeat 1 of the Escherichia coli and Homo sapiens RPS1 (ec1 and hs1, respectively). Autoantibodies to double-stranded DNA from patients with systemic lupus erythematosus cross-react with the human RPS1 homolog.
Probab=21.76 E-value=71 Score=18.73 Aligned_cols=30 Identities=10% Similarity=-0.191 Sum_probs=23.0
Q ss_pred EeEeeeeCceEEEEEcccCCCcceEEEEEec
Q psy10510 19 VAVVLTKPILVYCTVVLTKPILVYCTVSQTI 49 (91)
Q Consensus 19 ~~~v~~~e~KrFY~DlkeN~RGRFLRIsE~~ 49 (91)
.|+|.....+.-++|+.....| ||.++|..
T Consensus 5 ~g~V~~i~~~~~~v~l~~~~~g-~l~~~e~~ 34 (70)
T cd05687 5 KGTVVSVDDDEVLVDIGYKSEG-IIPISEFS 34 (70)
T ss_pred EEEEEEEeCCEEEEEeCCCceE-EEEHHHhC
Confidence 5788888888889999776666 77777764
No 45
>PRK13738 conjugal transfer pilus assembly protein TraW; Provisional
Probab=20.77 E-value=1.2e+02 Score=23.39 Aligned_cols=28 Identities=11% Similarity=-0.203 Sum_probs=20.3
Q ss_pred CceEEEEEccc---------------C--CCcceEEEEEeccCCC
Q psy10510 26 PILVYCTVVLT---------------K--PILVYCTVSQTIPRGG 53 (91)
Q Consensus 26 e~KrFY~Dlke---------------N--~RGRFLRIsE~~~rGg 53 (91)
=+.++|||-.- - +.|+.|+|+|+...++
T Consensus 163 l~~~vYfdQ~G~Lt~rF~I~~VPAvV~~~q~G~~l~I~E~~~~~~ 207 (209)
T PRK13738 163 LDSRIYFDQNGVLCQRFGIDQVPARVSAVPGGRFLKVEFIPAEEG 207 (209)
T ss_pred hCCceEEcCcchHHHhcCCeeeceEEEEcCCCCEEEEEEECCCCC
Confidence 45679999643 2 7899999999875443
No 46
>PF13333 rve_2: Integrase core domain
Probab=20.73 E-value=1.1e+02 Score=17.73 Aligned_cols=19 Identities=11% Similarity=0.385 Sum_probs=16.6
Q ss_pred chHHHHHHHHHHHHHHhcC
Q psy10510 63 QGMIEFRDALTDLLEEFGT 81 (91)
Q Consensus 63 qg~~EFRdaL~dlie~yg~ 81 (91)
....|++.++.++|+.|..
T Consensus 18 ~t~eel~~~I~~YI~~yN~ 36 (52)
T PF13333_consen 18 KTREELKQAIDEYIDYYNN 36 (52)
T ss_pred chHHHHHHHHHHHHHHhcc
Confidence 5789999999999999954
No 47
>PLN00064 photosystem II protein Psb27; Provisional
Probab=20.61 E-value=50 Score=25.14 Aligned_cols=28 Identities=25% Similarity=0.469 Sum_probs=23.1
Q ss_pred cceEEEe------cchHHHHHHHHHHHHHHhcCC
Q psy10510 55 RSQIAIP------AQGMIEFRDALTDLLEEFGTE 82 (91)
Q Consensus 55 RsqIalP------aqg~~EFRdaL~dlie~yg~~ 82 (91)
|++|.|| ++...|.|+.++||+.+|--.
T Consensus 78 r~tI~L~~ddp~~a~a~aeaR~~iNdyvSrYRr~ 111 (166)
T PLN00064 78 RSTINMDKTDPNVADAVAELRETSNSWVAKYRRE 111 (166)
T ss_pred HHHHcCCCCCccHHHHHHHHHHHHHHHHHHhcCC
Confidence 6789998 456789999999999999654
No 48
>PF01491 Frataxin_Cyay: Frataxin-like domain; InterPro: IPR002908 The eukaryotic proteins in this entry include frataxin, the protein that is mutated in Friedreich's ataxia [], and related sequences. Friedreich's ataxia is a progressive neurodegenerative disorder caused by loss of function mutations in the gene encoding frataxin (FRDA). Frataxin mRNA is predominantly expressed in tissues with a high metabolic rate (including liver, kidney, brown fat and heart). Mouse and yeast frataxin homologues contain a potential N-terminal mitochondrial targeting sequence, and human frataxin has been observed to co-localise with a mitochondrial protein. Furthermore, disruption of the yeast gene has been shown to result in mitochondrial dysfunction. Friedreich's ataxia is thus believed to be a mitochondrial disease caused by a mutation in the nuclear genome (specifically, expansion of an intronic GAA triplet repeat) [, , ]. The bacterial proteins in this entry are iron-sulphur cluster (FeS) metabolism CyaY proteins hmologous to eukaryotic frataxin. Partial Phylogenetic Profiling [] suggests that CyaY most likely functions as part of the ISC system for FeS cluster biosynthesis, and is supported by expermimental data in some species [, ]. ; PDB: 1EW4_A 2P1X_A 1SOY_A 2EFF_A 3T3T_B 3S4M_A 3T3K_A 3S5D_A 1LY7_A 3T3X_B ....
Probab=20.07 E-value=83 Score=21.37 Aligned_cols=20 Identities=45% Similarity=0.504 Sum_probs=15.9
Q ss_pred ceeeecCceEEEEEeE----eeee
Q psy10510 6 SQVECSEGTLTIRVAV----VLTK 25 (91)
Q Consensus 6 ~~~~~~~~~~~~~~~~----v~~~ 25 (91)
-.++++.|-|||.+.. |+++
T Consensus 32 ~d~e~~~gVLti~~~~~~~~VINk 55 (109)
T PF01491_consen 32 IDVERSGGVLTIEFPDGGQYVINK 55 (109)
T ss_dssp EEEEEETTEEEEEETTSEEEEEEE
T ss_pred eEEEccCCEEEEEECCCCEEEEeC
Confidence 4688999999999843 7775
Done!