Query 033056
Match_columns 128
No_of_seqs 102 out of 193
Neff 3.5
Searched_HMMs 46136
Date Fri Mar 29 09:16:24 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/033056.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/033056hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN00186 ribosomal protein S26 100.0 2.9E-63 6.4E-68 365.8 7.9 104 1-105 1-104 (109)
2 PTZ00172 40S ribosomal protein 100.0 3.8E-63 8.2E-68 364.8 8.0 104 1-104 1-104 (108)
3 PF01283 Ribosomal_S26e: Ribos 100.0 1.1E-63 2.4E-68 369.8 4.8 105 1-106 1-105 (113)
4 PRK09335 30S ribosomal protein 100.0 4E-57 8.7E-62 326.8 7.0 94 1-98 1-94 (95)
5 KOG1768 40s ribosomal protein 100.0 9.4E-54 2E-58 317.6 3.9 103 1-104 1-103 (115)
6 COG4830 RPS26B Ribosomal prote 100.0 4.6E-52 9.9E-57 304.8 5.8 97 1-97 1-97 (108)
7 COG1400 SEC65 Signal recogniti 68.7 1.1 2.4E-05 32.7 -0.9 38 27-66 20-58 (93)
8 PF13119 DUF3973: Domain of un 64.7 2.5 5.5E-05 27.0 0.3 11 71-81 1-12 (41)
9 PF02591 DUF164: Putative zinc 56.5 5.6 0.00012 25.2 0.9 13 18-30 44-56 (56)
10 PF07503 zf-HYPF: HypF finger; 55.0 5.6 0.00012 24.1 0.6 13 15-27 16-28 (35)
11 PF08209 Sgf11: Sgf11 (transcr 53.0 8.6 0.00019 23.1 1.2 15 18-32 2-16 (33)
12 PF09889 DUF2116: Uncharacteri 46.5 7.9 0.00017 26.0 0.4 17 21-37 4-20 (59)
13 TIGR01031 rpmF_bact ribosomal 44.4 18 0.00039 23.6 1.8 28 2-29 2-35 (55)
14 PRK04016 DNA-directed RNA poly 41.7 13 0.00027 25.5 0.8 14 19-32 3-16 (62)
15 COG4481 Uncharacterized protei 40.8 16 0.00034 25.0 1.1 12 19-30 33-44 (60)
16 PF04726 Microvir_J: Microviru 40.5 16 0.00034 21.0 0.9 15 1-15 1-15 (24)
17 COG1644 RPB10 DNA-directed RNA 39.9 13 0.00028 25.7 0.6 13 19-31 3-15 (63)
18 COG5112 UFD2 U1-like Zn-finger 39.6 9.9 0.00021 29.2 0.0 31 49-81 31-65 (126)
19 COG5134 Uncharacterized conser 39.5 16 0.00035 31.1 1.2 44 20-77 42-85 (272)
20 PF13248 zf-ribbon_3: zinc-rib 36.7 14 0.0003 20.4 0.3 13 21-33 3-15 (26)
21 PLN00032 DNA-directed RNA poly 36.1 18 0.00038 25.5 0.8 12 19-30 3-14 (71)
22 KOG2612 Predicted integral mem 34.7 14 0.0003 27.5 0.1 14 17-30 71-84 (103)
23 PF10122 Mu-like_Com: Mu-like 33.6 19 0.00041 23.9 0.6 18 19-36 3-20 (51)
24 KOG3408 U1-like Zn-finger-cont 33.2 13 0.00029 28.8 -0.1 18 62-81 50-67 (129)
25 PF01922 SRP19: SRP19 protein; 32.2 11 0.00024 26.8 -0.7 22 26-47 16-37 (95)
26 PF01194 RNA_pol_N: RNA polyme 31.3 17 0.00036 24.7 0.1 12 20-31 4-15 (60)
27 COG2888 Predicted Zn-ribbon RN 30.7 34 0.00075 23.5 1.5 35 18-53 7-42 (61)
28 PF06639 BAP: Basal layer anti 27.8 14 0.00031 26.2 -0.7 23 75-97 5-27 (75)
29 KOG3198 Signal recognition par 26.7 18 0.00038 28.8 -0.5 18 27-44 32-49 (152)
30 PF03604 DNA_RNApol_7kD: DNA d 26.3 26 0.00057 20.8 0.3 13 18-30 15-27 (32)
31 PF13240 zinc_ribbon_2: zinc-r 26.1 28 0.0006 19.1 0.4 12 22-33 1-12 (23)
32 PRK12286 rpmF 50S ribosomal pr 24.2 47 0.001 21.9 1.3 29 2-30 4-37 (57)
33 KOG3497 DNA-directed RNA polym 24.2 35 0.00076 23.9 0.7 12 19-30 3-14 (69)
34 PF12230 PRP21_like_P: Pre-mRN 24.0 26 0.00056 27.8 0.0 32 16-48 164-195 (229)
35 PF14832 Tautomerase_3: Putati 23.9 50 0.0011 25.0 1.6 25 48-74 20-44 (136)
36 PF10589 NADH_4Fe-4S: NADH-ubi 23.6 22 0.00048 22.2 -0.4 10 24-33 14-23 (46)
37 PRK14890 putative Zn-ribbon RN 23.6 41 0.00089 22.9 0.9 27 19-46 6-33 (59)
38 PF13717 zinc_ribbon_4: zinc-r 23.5 45 0.00098 19.8 1.0 14 16-29 21-34 (36)
39 TIGR02174 CXXU_selWTH selT/sel 21.8 37 0.00081 22.6 0.5 14 69-82 1-14 (72)
40 PF06107 DUF951: Bacterial pro 21.3 57 0.0012 22.0 1.2 15 20-34 31-47 (57)
41 PF07282 OrfB_Zn_ribbon: Putat 21.2 68 0.0015 20.5 1.6 19 15-33 41-59 (69)
42 KOG3286 Selenoprotein T [Gener 21.1 59 0.0013 27.3 1.6 14 68-81 71-84 (226)
43 COG1326 Uncharacterized archae 20.6 86 0.0019 26.0 2.4 24 15-38 25-54 (201)
44 PF07754 DUF1610: Domain of un 20.3 76 0.0017 17.9 1.4 16 12-27 8-23 (24)
No 1
>PLN00186 ribosomal protein S26; Provisional
Probab=100.00 E-value=2.9e-63 Score=365.83 Aligned_cols=104 Identities=85% Similarity=1.403 Sum_probs=101.9
Q ss_pred CCcccccCCCCCCCCCcccceeecCCcceeecccceeeeeecccchhhHHhhHHhhccccccccceeeeeeEEeeeecee
Q 033056 1 MTFKRRNGGRNKHGRGHVNFIRCSNCGKCCPKDKAIKRFLVRNIVEQAAVRDVQDACIYDNYVLPKLYAKMQYCVSCAIH 80 (128)
Q Consensus 1 M~kKRrNnGr~KkgrGhv~~V~C~NCgr~vPKDKAIKrf~irNiVEaaavrDi~eAsv~~~y~lPKlyvKl~YCVSCAIH 80 (128)
||+|||||||+|+|+|||++|+|+|||+|||||||||+|+|+||||+++++||+||+||++|.|||||+|+|||||||||
T Consensus 1 M~kKRrN~GR~K~~rGhv~~V~C~nCgr~vPKDKAIkrf~irniVe~aa~rDl~~a~vy~~y~lPKly~K~~YCVSCAIH 80 (109)
T PLN00186 1 MTKKRRNGGRNKHGRGHVKRIRCSNCGKCVPKDKAIKRFLVRNIVEQAALRDVQEACVYDGYTLPKLYAKVQYCISCAIH 80 (109)
T ss_pred CCcccccCCCCCCCCCCCcceeeCCCcccccccceEEEEecccCccHHHHHHHHhhhcccccccchhhhceEEEEeehhc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccChhhhcccCCCccccccc
Q 033056 81 SHVVRVRSRTDRRNREPPKRFMRRR 105 (128)
Q Consensus 81 skVVRvRS~e~RK~r~pp~r~~~~~ 105 (128)
++|||+||+|+||+|+||++| ++.
T Consensus 81 ~~iVRvRs~e~Rk~r~pp~r~-~~~ 104 (109)
T PLN00186 81 SRVVRVRSRENRRIREPPPRF-RRR 104 (109)
T ss_pred cceeecCChHHccccCCCccc-ccc
Confidence 999999999999999999998 553
No 2
>PTZ00172 40S ribosomal protein S26; Provisional
Probab=100.00 E-value=3.8e-63 Score=364.82 Aligned_cols=104 Identities=71% Similarity=1.197 Sum_probs=101.7
Q ss_pred CCcccccCCCCCCCCCcccceeecCCcceeecccceeeeeecccchhhHHhhHHhhccccccccceeeeeeEEeeeecee
Q 033056 1 MTFKRRNGGRNKHGRGHVNFIRCSNCGKCCPKDKAIKRFLVRNIVEQAAVRDVQDACIYDNYVLPKLYAKMQYCVSCAIH 80 (128)
Q Consensus 1 M~kKRrNnGr~KkgrGhv~~V~C~NCgr~vPKDKAIKrf~irNiVEaaavrDi~eAsv~~~y~lPKlyvKl~YCVSCAIH 80 (128)
||+|||||||+|+|+|||++|+|+|||+|||||||||+|+|+||||+|+++||+||+||++|+|||||+|+|||||||||
T Consensus 1 M~kKRrN~GR~K~~rGhv~~V~C~nCgr~vPKDKAIkrf~irniVe~aa~rDl~~a~v~~~y~lPKly~k~~YCVSCAIH 80 (108)
T PTZ00172 1 MTSKRRNNGRSKHGRGHVKPVRCSNCGRCVPKDKAIKRFVVRNIVDAASVRDIAEASVYYGYPLPKLYMKQQYCVSCAIH 80 (108)
T ss_pred CCcccccCCCCCCCCCCCccEEeCCccccccccceEEEEeccCCccHHHHHHHHHhhchhccccccceeeeEEeeehhhc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccChhhhcccCCCcccccc
Q 033056 81 SHVVRVRSRTDRRNREPPKRFMRR 104 (128)
Q Consensus 81 skVVRvRS~e~RK~r~pp~r~~~~ 104 (128)
++|||+||+|+||+|+||+++.++
T Consensus 81 ~~iVRvRs~e~Rk~r~pp~r~~~~ 104 (108)
T PTZ00172 81 SRVVRVRSREDRKIRTPPKRPFRP 104 (108)
T ss_pred CCeeecCChHHccccCCCCCCCCC
Confidence 999999999999999999988444
No 3
>PF01283 Ribosomal_S26e: Ribosomal protein S26e; InterPro: IPR000892 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. A number of eukaryotic ribosomal proteins can be grouped on the basis of sequence similarities. One of these families, the S26E family, includes mammalian S26 []; Octopus S26 []; Drosophila S26 (DS31) []; plant cytoplasmic S26; and fungal S26 []. These proteins have 114 to 127 amino acids.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3U5G_a 3U5C_a 2XZM_5 2XZN_5.
Probab=100.00 E-value=1.1e-63 Score=369.81 Aligned_cols=105 Identities=70% Similarity=1.249 Sum_probs=73.0
Q ss_pred CCcccccCCCCCCCCCcccceeecCCcceeecccceeeeeecccchhhHHhhHHhhccccccccceeeeeeEEeeeecee
Q 033056 1 MTFKRRNGGRNKHGRGHVNFIRCSNCGKCCPKDKAIKRFLVRNIVEQAAVRDVQDACIYDNYVLPKLYAKMQYCVSCAIH 80 (128)
Q Consensus 1 M~kKRrNnGr~KkgrGhv~~V~C~NCgr~vPKDKAIKrf~irNiVEaaavrDi~eAsv~~~y~lPKlyvKl~YCVSCAIH 80 (128)
||+|||||||+|+|+|||++|+|+|||+|||||||||+|+|+||||++++|||+||+||++|+|||||+|+|||||||||
T Consensus 1 M~~KRrN~Gr~KkgrGhv~~V~C~nCgr~vPKDKAIkrf~i~niVeaaa~rdi~~a~v~~~y~lPKlyvK~~YCvSCAIH 80 (113)
T PF01283_consen 1 MTKKRRNNGRSKKGRGHVQPVRCDNCGRCVPKDKAIKRFVIRNIVEAAAVRDISEASVYDAYVLPKLYVKLYYCVSCAIH 80 (113)
T ss_dssp -----TTTTSS-SSSS---EEE-TTTB-EEECCCSEEEEEEEESS-CCCHHHHHHCB-SSS--S-EEEEEEEE-CHHHHH
T ss_pred CCcccccCCCCCCCCCCCcCEeeCcccccCcCCceEEEEEccCCccHHHHHHHhhcceeeecccccceeEEEEeeeeeee
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccChhhhcccCCCcccccccC
Q 033056 81 SHVVRVRSRTDRRNREPPKRFMRRRD 106 (128)
Q Consensus 81 skVVRvRS~e~RK~r~pp~r~~~~~~ 106 (128)
++|||+||+|+||+|+||++| ++..
T Consensus 81 ~~IVr~Rs~e~RK~r~~p~~~-~~~~ 105 (113)
T PF01283_consen 81 SKIVRVRSREERKDRTPPPRF-RPRK 105 (113)
T ss_dssp TTSS----TCCCC--S----------
T ss_pred ccccccCChHHccccCCCCcC-Cccc
Confidence 999999999999999999999 5543
No 4
>PRK09335 30S ribosomal protein S26e; Provisional
Probab=100.00 E-value=4e-57 Score=326.76 Aligned_cols=94 Identities=30% Similarity=0.675 Sum_probs=92.2
Q ss_pred CCcccccCCCCCCCCCcccceeecCCcceeecccceeeeeecccchhhHHhhHHhhccccccccceeeeeeEEeeeecee
Q 033056 1 MTFKRRNGGRNKHGRGHVNFIRCSNCGKCCPKDKAIKRFLVRNIVEQAAVRDVQDACIYDNYVLPKLYAKMQYCVSCAIH 80 (128)
Q Consensus 1 M~kKRrNnGr~KkgrGhv~~V~C~NCgr~vPKDKAIKrf~irNiVEaaavrDi~eAsv~~~y~lPKlyvKl~YCVSCAIH 80 (128)
||+|||||||+|+|+||+++|+|+|||+|||||||||+|+|+||||+++++||+||++| |||||+|+|||||||||
T Consensus 1 M~kKRrn~GR~K~~rGhv~~V~C~nCgr~vPKDKAIkrf~i~n~Ve~a~~rdl~~a~~~----lpk~~~k~~YCvSCAiH 76 (95)
T PRK09335 1 MPKKRENRGRRKGDKGHVGYVQCDNCGRRVPRDKAVCVTKMYSPVDPQLAKELEKKGAI----IARYPVTKCYCVNCAVH 76 (95)
T ss_pred CCcccccCCCCCCCCCCCccEEeCCCCCcCcCCceEEEEEecCCCCHHHHHHHHhCcee----eeeeeeeeEEechhhhh
Confidence 99999999999999999999999999999999999999999999999999999999887 99999999999999999
Q ss_pred cccccccChhhhcccCCC
Q 033056 81 SHVVRVRSRTDRRNREPP 98 (128)
Q Consensus 81 skVVRvRS~e~RK~r~pp 98 (128)
++|||+||+|+||+|+|.
T Consensus 77 ~~IVrvRs~e~Rk~r~~~ 94 (95)
T PRK09335 77 LGIIKIRPEEERKKKAPL 94 (95)
T ss_pred ccccccCChHHcccccCC
Confidence 999999999999999864
No 5
>KOG1768 consensus 40s ribosomal protein S26 [Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=9.4e-54 Score=317.61 Aligned_cols=103 Identities=72% Similarity=1.225 Sum_probs=101.2
Q ss_pred CCcccccCCCCCCCCCcccceeecCCcceeecccceeeeeecccchhhHHhhHHhhccccccccceeeeeeEEeeeecee
Q 033056 1 MTFKRRNGGRNKHGRGHVNFIRCSNCGKCCPKDKAIKRFLVRNIVEQAAVRDVQDACIYDNYVLPKLYAKMQYCVSCAIH 80 (128)
Q Consensus 1 M~kKRrNnGr~KkgrGhv~~V~C~NCgr~vPKDKAIKrf~irNiVEaaavrDi~eAsv~~~y~lPKlyvKl~YCVSCAIH 80 (128)
|++||+|+|++|+|+||+.+|+|+||++|+|||||||+|+|+||||++++|||+|||||++|+|||||+|||||||||||
T Consensus 1 m~~kr~~~gr~k~~~g~v~~i~c~~c~~~~~kdKaIk~f~i~niVEaaavrdiseasv~d~y~~pKly~Klhycvscaih 80 (115)
T KOG1768|consen 1 MTKKRRNAGRNKKGRGHVIPIRCTNCGRCMPKDKAIKRFVIRNIVEAAAVRDISEASVFDAYVLPKLYVKLHYCVSCAIH 80 (115)
T ss_pred CCcccccCCCCCCCCcceeeeeeccccccchHHHHHHHHHHHHHHHHHHhhhhhhheeccccccccccceeeeeEeeeee
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccChhhhcccCCCcccccc
Q 033056 81 SHVVRVRSRTDRRNREPPKRFMRR 104 (128)
Q Consensus 81 skVVRvRS~e~RK~r~pp~r~~~~ 104 (128)
++|||+||.|.||+|+||++| .+
T Consensus 81 skVvR~rS~e~rrir~pp~rf-~~ 103 (115)
T KOG1768|consen 81 SKVVRVRSREARRIRTPPPRF-SP 103 (115)
T ss_pred eeeeccchhhhhcccCCCccc-Cc
Confidence 999999999999999999988 44
No 6
>COG4830 RPS26B Ribosomal protein S26 [Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=4.6e-52 Score=304.81 Aligned_cols=97 Identities=63% Similarity=1.108 Sum_probs=96.7
Q ss_pred CCcccccCCCCCCCCCcccceeecCCcceeecccceeeeeecccchhhHHhhHHhhccccccccceeeeeeEEeeeecee
Q 033056 1 MTFKRRNGGRNKHGRGHVNFIRCSNCGKCCPKDKAIKRFLVRNIVEQAAVRDVQDACIYDNYVLPKLYAKMQYCVSCAIH 80 (128)
Q Consensus 1 M~kKRrNnGr~KkgrGhv~~V~C~NCgr~vPKDKAIKrf~irNiVEaaavrDi~eAsv~~~y~lPKlyvKl~YCVSCAIH 80 (128)
||+||+||||+|+|+||+.+|+|+|||..||||||||+|.|+|+||+++++||++|++|+.|.+||+|.|+|||||||||
T Consensus 1 mpkkR~N~GR~K~~rGhv~~v~CdnCg~~vPkdKAikr~~i~s~Ve~a~~rdL~~asIy~~y~vpk~~~k~qyCVsCAih 80 (108)
T COG4830 1 MPKKRRNRGRNKKGRGHVKYVRCDNCGKAVPKDKAIKRTAIRSPVEAAAARDLSEASIYSEYAVPKTYNKLQYCVSCAIH 80 (108)
T ss_pred CcchhhhcCCCCCCCCCccceeeccccccCCccceeeEeeccCcccHHHHHHHhhceeeeeeeccccccceeeeeeeeee
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccccChhhhcccCC
Q 033056 81 SHVVRVRSRTDRRNREP 97 (128)
Q Consensus 81 skVVRvRS~e~RK~r~p 97 (128)
++|||+||+|+||+++|
T Consensus 81 ~~IvrVRSre~RK~r~p 97 (108)
T COG4830 81 ARIVRVRSREERKIRAP 97 (108)
T ss_pred eeEEEEecchhhhhcCC
Confidence 99999999999999998
No 7
>COG1400 SEC65 Signal recognition particle 19 kDa protein [Intracellular trafficking and secretion]
Probab=68.73 E-value=1.1 Score=32.72 Aligned_cols=38 Identities=24% Similarity=0.388 Sum_probs=26.5
Q ss_pred cceeecccceeeeeecccchhhHHhhHHhhc-cccccccce
Q 033056 27 GKCCPKDKAIKRFLVRNIVEQAAVRDVQDAC-IYDNYVLPK 66 (128)
Q Consensus 27 gr~vPKDKAIKrf~irNiVEaaavrDi~eAs-v~~~y~lPK 66 (128)
||+|||+.||..+...+|+| |+++|-=.. +.....-|+
T Consensus 20 GRrvpk~laV~~P~~~ei~~--a~~~LGl~~~v~~dk~yPr 58 (93)
T COG1400 20 GRRVPKELAVENPSLEEIAE--ALRELGLKPKVERDKKYPR 58 (93)
T ss_pred ccccchhhcccCCCHHHHHH--HHHHcCCCeeechhhcCCC
Confidence 59999999999999999888 456654322 333444444
No 8
>PF13119 DUF3973: Domain of unknown function (DUF3973)
Probab=64.74 E-value=2.5 Score=26.97 Aligned_cols=11 Identities=55% Similarity=1.428 Sum_probs=8.3
Q ss_pred eEEeeee-ceec
Q 033056 71 MQYCVSC-AIHS 81 (128)
Q Consensus 71 l~YCVSC-AIHs 81 (128)
++|||+| -||.
T Consensus 1 MyYCi~Cs~~h~ 12 (41)
T PF13119_consen 1 MYYCINCSEIHH 12 (41)
T ss_pred CEEEEEhHHhHH
Confidence 4799998 5664
No 9
>PF02591 DUF164: Putative zinc ribbon domain; InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=56.47 E-value=5.6 Score=25.25 Aligned_cols=13 Identities=31% Similarity=0.841 Sum_probs=10.5
Q ss_pred ccceeecCCccee
Q 033056 18 VNFIRCSNCGKCC 30 (128)
Q Consensus 18 v~~V~C~NCgr~v 30 (128)
...+.|+||||.+
T Consensus 44 ~~i~~Cp~CgRiL 56 (56)
T PF02591_consen 44 DEIVFCPNCGRIL 56 (56)
T ss_pred CCeEECcCCCccC
Confidence 4678999999863
No 10
>PF07503 zf-HYPF: HypF finger; InterPro: IPR011125 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Proteins of the HypF family are involved in the maturation and regulation of hydrogenase []. In the N terminus they appear to have two zinc finger domains that are similar to those found in the DnaJ chaperone []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3TTD_A 3TSQ_A 3TTC_A 3TSP_A 3TTF_A 3TSU_A.
Probab=54.96 E-value=5.6 Score=24.07 Aligned_cols=13 Identities=62% Similarity=1.300 Sum_probs=8.8
Q ss_pred CCcccceeecCCc
Q 033056 15 RGHVNFIRCSNCG 27 (128)
Q Consensus 15 rGhv~~V~C~NCg 27 (128)
|=|-++|-|++||
T Consensus 16 R~~~~~isC~~CG 28 (35)
T PF07503_consen 16 RFHYQFISCTNCG 28 (35)
T ss_dssp TTT-TT--BTTCC
T ss_pred cccCcCccCCCCC
Confidence 4688999999999
No 11
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=53.01 E-value=8.6 Score=23.15 Aligned_cols=15 Identities=27% Similarity=0.888 Sum_probs=10.3
Q ss_pred ccceeecCCcceeec
Q 033056 18 VNFIRCSNCGKCCPK 32 (128)
Q Consensus 18 v~~V~C~NCgr~vPK 32 (128)
...+.|.||+|-|.-
T Consensus 2 ~~~~~C~nC~R~v~a 16 (33)
T PF08209_consen 2 SPYVECPNCGRPVAA 16 (33)
T ss_dssp S-EEE-TTTSSEEEG
T ss_pred CCeEECCCCcCCcch
Confidence 357899999997753
No 12
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=46.51 E-value=7.9 Score=26.01 Aligned_cols=17 Identities=35% Similarity=0.735 Sum_probs=14.6
Q ss_pred eeecCCcceeeccccee
Q 033056 21 IRCSNCGKCCPKDKAIK 37 (128)
Q Consensus 21 V~C~NCgr~vPKDKAIK 37 (128)
-||-+||.-+|-|++..
T Consensus 4 kHC~~CG~~Ip~~~~fC 20 (59)
T PF09889_consen 4 KHCPVCGKPIPPDESFC 20 (59)
T ss_pred CcCCcCCCcCCcchhhh
Confidence 37999999999998765
No 13
>TIGR01031 rpmF_bact ribosomal protein L32. This protein describes bacterial ribosomal protein L32. The noise cutoff is set low enough to include the equivalent protein from mitochondria and chloroplasts. No related proteins from the Archaea nor from the eukaryotic cytosol are detected by this model. This model is a fragment model; the putative L32 of some species shows similarity only toward the N-terminus.
Probab=44.35 E-value=18 Score=23.64 Aligned_cols=28 Identities=25% Similarity=0.680 Sum_probs=17.9
Q ss_pred CcccccCCCCCCCCCc------ccceeecCCcce
Q 033056 2 TFKRRNGGRNKHGRGH------VNFIRCSNCGKC 29 (128)
Q Consensus 2 ~kKRrNnGr~KkgrGh------v~~V~C~NCgr~ 29 (128)
||+|-+..|..+=|.| ...+.|.+||..
T Consensus 2 PKrk~Sksr~~~RRah~~kl~~p~l~~C~~cG~~ 35 (55)
T TIGR01031 2 PKRKTSKSRKRKRRSHDAKLTAPTLVVCPNCGEF 35 (55)
T ss_pred CCCcCCcccccchhcCcccccCCcceECCCCCCc
Confidence 4555555555555555 457889999964
No 14
>PRK04016 DNA-directed RNA polymerase subunit N; Provisional
Probab=41.67 E-value=13 Score=25.53 Aligned_cols=14 Identities=36% Similarity=0.764 Sum_probs=11.4
Q ss_pred cceeecCCcceeec
Q 033056 19 NFIRCSNCGKCCPK 32 (128)
Q Consensus 19 ~~V~C~NCgr~vPK 32 (128)
-||+|..||+.+--
T Consensus 3 iPvRCFTCGkvi~~ 16 (62)
T PRK04016 3 IPVRCFTCGKVIAE 16 (62)
T ss_pred CCeEecCCCCChHH
Confidence 48999999997743
No 15
>COG4481 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.79 E-value=16 Score=25.03 Aligned_cols=12 Identities=42% Similarity=0.969 Sum_probs=9.5
Q ss_pred cceeecCCccee
Q 033056 19 NFIRCSNCGKCC 30 (128)
Q Consensus 19 ~~V~C~NCgr~v 30 (128)
-.|.|+|||..|
T Consensus 33 IkikC~nC~h~v 44 (60)
T COG4481 33 IKIKCENCGHSV 44 (60)
T ss_pred EEEEecCCCcEE
Confidence 468899999954
No 16
>PF04726 Microvir_J: Microvirus J protein; InterPro: IPR006815 This small protein is involved in DNA packaging, interacting with DNA via its hydrophobic C terminus. In bacteriophage phi-X174, J is present in 60 copies, and forms an S-shaped polypeptide chain without any secondary structure. It is thought to interact with DNA through simple charge interactions [].; GO: 0003677 DNA binding, 0019073 viral DNA genome packaging, 0019028 viral capsid; PDB: 1M06_J 1GFF_3 1RB8_J 2BPA_3.
Probab=40.49 E-value=16 Score=20.96 Aligned_cols=15 Identities=53% Similarity=0.851 Sum_probs=7.7
Q ss_pred CCcccccCCCCCCCC
Q 033056 1 MTFKRRNGGRNKHGR 15 (128)
Q Consensus 1 M~kKRrNnGr~Kkgr 15 (128)
|-++||+.|++|+.|
T Consensus 1 ~k~~rrs~~~~kgar 15 (24)
T PF04726_consen 1 MKSKRRSGGKRKGAR 15 (24)
T ss_dssp --GGGS---SSSSS-
T ss_pred CcccccCCCccCceE
Confidence 567899999999865
No 17
>COG1644 RPB10 DNA-directed RNA polymerase, subunit N (RpoN/RPB10) [Transcription]
Probab=39.94 E-value=13 Score=25.71 Aligned_cols=13 Identities=38% Similarity=0.754 Sum_probs=10.6
Q ss_pred cceeecCCcceee
Q 033056 19 NFIRCSNCGKCCP 31 (128)
Q Consensus 19 ~~V~C~NCgr~vP 31 (128)
-||||-+||+.+-
T Consensus 3 iPiRCFsCGkvi~ 15 (63)
T COG1644 3 IPVRCFSCGKVIG 15 (63)
T ss_pred CceEeecCCCCHH
Confidence 4899999998653
No 18
>COG5112 UFD2 U1-like Zn-finger-containing protein [General function prediction only]
Probab=39.57 E-value=9.9 Score=29.18 Aligned_cols=31 Identities=23% Similarity=0.339 Sum_probs=19.0
Q ss_pred HHhhHHhhccccc----cccceeeeeeEEeeeeceec
Q 033056 49 AVRDVQDACIYDN----YVLPKLYAKMQYCVSCAIHS 81 (128)
Q Consensus 49 avrDi~eAsv~~~----y~lPKlyvKl~YCVSCAIHs 81 (128)
.-.||++..-++- -.||- .-.|||+.||-|.
T Consensus 31 i~nDls~~Es~~Klp~Dp~lPG--lGqhYCieCaryf 65 (126)
T COG5112 31 IKNDLSTKESQKKLPYDPELPG--LGQHYCIECARYF 65 (126)
T ss_pred HHHhcchhhhhccCCCCCCCCC--CceeeeehhHHHH
Confidence 3457765444432 23443 4689999999774
No 19
>COG5134 Uncharacterized conserved protein [Function unknown]
Probab=39.47 E-value=16 Score=31.13 Aligned_cols=44 Identities=30% Similarity=0.502 Sum_probs=28.9
Q ss_pred ceeecCCcceeecccceeeeeecccchhhHHhhHHhhccccccccceeeeeeEEeeee
Q 033056 20 FIRCSNCGKCCPKDKAIKRFLVRNIVEQAAVRDVQDACIYDNYVLPKLYAKMQYCVSC 77 (128)
Q Consensus 20 ~V~C~NCgr~vPKDKAIKrf~irNiVEaaavrDi~eAsv~~~y~lPKlyvKl~YCVSC 77 (128)
+|+|-||+-.+||.+-. | ++++|...--|.+ -|.|.-..-|--|
T Consensus 42 ~~RCL~C~~YI~K~~rf------N-----avkE~~~dK~y~~---~kiYRf~I~C~~C 85 (272)
T COG5134 42 PVRCLNCENYIQKGTRF------N-----AVKEEIGDKSYYT---TKIYRFSIKCHLC 85 (272)
T ss_pred ceeecchhhhhhcccch------h-----HHHHHhcccccce---eEEEEEEEEccCC
Confidence 79999999999998744 3 6777766444433 3455444444444
No 20
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=36.70 E-value=14 Score=20.45 Aligned_cols=13 Identities=38% Similarity=0.820 Sum_probs=9.5
Q ss_pred eeecCCcceeecc
Q 033056 21 IRCSNCGKCCPKD 33 (128)
Q Consensus 21 V~C~NCgr~vPKD 33 (128)
+.|.|||.-++.|
T Consensus 3 ~~Cp~Cg~~~~~~ 15 (26)
T PF13248_consen 3 MFCPNCGAEIDPD 15 (26)
T ss_pred CCCcccCCcCCcc
Confidence 5688888866655
No 21
>PLN00032 DNA-directed RNA polymerase; Provisional
Probab=36.12 E-value=18 Score=25.49 Aligned_cols=12 Identities=42% Similarity=0.850 Sum_probs=10.3
Q ss_pred cceeecCCccee
Q 033056 19 NFIRCSNCGKCC 30 (128)
Q Consensus 19 ~~V~C~NCgr~v 30 (128)
-||||-.||+.+
T Consensus 3 iPVRCFTCGkvi 14 (71)
T PLN00032 3 IPVRCFTCGKVI 14 (71)
T ss_pred CceeecCCCCCc
Confidence 389999999876
No 22
>KOG2612 consensus Predicted integral membrane protein [Function unknown]
Probab=34.75 E-value=14 Score=27.54 Aligned_cols=14 Identities=21% Similarity=0.434 Sum_probs=11.4
Q ss_pred cccceeecCCccee
Q 033056 17 HVNFIRCSNCGKCC 30 (128)
Q Consensus 17 hv~~V~C~NCgr~v 30 (128)
..+.++|.||+|.|
T Consensus 71 k~~~~hCeNC~RdV 84 (103)
T KOG2612|consen 71 KPMDCHCENCDRDV 84 (103)
T ss_pred CCccccCCCCccHH
Confidence 45678999999976
No 23
>PF10122 Mu-like_Com: Mu-like prophage protein Com; InterPro: IPR019294 Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ].
Probab=33.63 E-value=19 Score=23.89 Aligned_cols=18 Identities=33% Similarity=0.667 Sum_probs=13.4
Q ss_pred cceeecCCcceeecccce
Q 033056 19 NFIRCSNCGKCCPKDKAI 36 (128)
Q Consensus 19 ~~V~C~NCgr~vPKDKAI 36 (128)
+-|||.+|++++-+-..+
T Consensus 3 ~eiRC~~CnklLa~~g~~ 20 (51)
T PF10122_consen 3 KEIRCGHCNKLLAKAGEV 20 (51)
T ss_pred cceeccchhHHHhhhcCc
Confidence 568999999988774333
No 24
>KOG3408 consensus U1-like Zn-finger-containing protein, probabl erole in RNA processing/splicing [RNA processing and modification]
Probab=33.19 E-value=13 Score=28.76 Aligned_cols=18 Identities=33% Similarity=0.545 Sum_probs=13.0
Q ss_pred cccceeeeeeEEeeeeceec
Q 033056 62 YVLPKLYAKMQYCVSCAIHS 81 (128)
Q Consensus 62 y~lPKlyvKl~YCVSCAIHs 81 (128)
+-||- .-++||+.||-|.
T Consensus 50 ~dlPG--~GqfyCi~CaRyF 67 (129)
T KOG3408|consen 50 PDLPG--GGQFYCIECARYF 67 (129)
T ss_pred CCCCC--Cceeehhhhhhhh
Confidence 34553 4589999999774
No 25
>PF01922 SRP19: SRP19 protein; InterPro: IPR002778 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes [, ]. SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. Eukaryotic SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor []. In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. In archaea, the SRP complex contains 7S RNA like its eukaryotic counterpart, yet only includes two of the six protein subunits found in the eukarytic complex: SRP19 and SRP54 []. This entry represents the SRP19 subunit. The SRP19 protein is unstructured but forms a compact core domain and two extended RNA-binding loops upon binding the signal recognition particle (SRP) RNA [].; GO: 0008312 7S RNA binding, 0006614 SRP-dependent cotranslational protein targeting to membrane, 0048500 signal recognition particle; PDB: 3DLU_A 3DLV_B 2J37_B 1MFQ_B 3KTV_D 1RY1_B 1JID_A 1KVV_A 1KVN_A 3KTW_B ....
Probab=32.22 E-value=11 Score=26.83 Aligned_cols=22 Identities=23% Similarity=0.412 Sum_probs=13.6
Q ss_pred Ccceeecccceeeeeecccchh
Q 033056 26 CGKCCPKDKAIKRFLVRNIVEQ 47 (128)
Q Consensus 26 Cgr~vPKDKAIKrf~irNiVEa 47 (128)
-||.|||+.|+..-.+..|.++
T Consensus 16 ~GRrv~k~~aV~~P~~~EI~~a 37 (95)
T PF01922_consen 16 EGRRVPKELAVENPTLEEIADA 37 (95)
T ss_dssp TT--SSTTTSBSS--HHHHHHH
T ss_pred hccccChhhcCCCCCHHHHHHH
Confidence 4799999999986666665553
No 26
>PF01194 RNA_pol_N: RNA polymerases N / 8 kDa subunit; InterPro: IPR000268 In eukaryotes, there are three different forms of DNA-dependent RNA polymerases (2.7.7.6 from EC) transcribing different sets of genes. Each class of RNA polymerase is an assemblage of ten to twelve different polypeptides. In archaebacteria, there is generally a single form of RNA polymerase which also consists of an oligomeric assemblage of 10 to 13 polypeptides. Archaebacterial subunit N (gene rpoN) [] is a small protein of about 8 kDa, it is evolutionary related [] to a 8.3 kDa component shared by all three forms of eukaryotic RNA polymerases (gene RPB10 in yeast and POLR2J in mammals) as well as to African swine fever virus (ASFV) protein CP80R []. There is a conserved region which is located at the N-terminal extremity of these polymerase subunits; this region contains two cysteines that binds a zinc ion [].; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_N 3HKZ_N 1EF4_A 3H0G_V 2Y0S_N 2R92_J 3M4O_J 3S2D_J 1R9S_J 1Y1W_J ....
Probab=31.29 E-value=17 Score=24.71 Aligned_cols=12 Identities=42% Similarity=0.883 Sum_probs=9.5
Q ss_pred ceeecCCcceee
Q 033056 20 FIRCSNCGKCCP 31 (128)
Q Consensus 20 ~V~C~NCgr~vP 31 (128)
||||-.||+.+-
T Consensus 4 PVRCFTCGkvi~ 15 (60)
T PF01194_consen 4 PVRCFTCGKVIG 15 (60)
T ss_dssp SSS-STTTSBTC
T ss_pred ceecCCCCCChh
Confidence 899999998774
No 27
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=30.67 E-value=34 Score=23.49 Aligned_cols=35 Identities=29% Similarity=0.436 Sum_probs=26.3
Q ss_pred ccceeecCCccee-ecccceeeeeecccchhhHHhhH
Q 033056 18 VNFIRCSNCGKCC-PKDKAIKRFLVRNIVEQAAVRDV 53 (128)
Q Consensus 18 v~~V~C~NCgr~v-PKDKAIKrf~irNiVEaaavrDi 53 (128)
..+-.|++||+-+ |...|++ |.=-|-=|....|..
T Consensus 7 ~~~~~CtSCg~~i~p~e~~v~-F~CPnCGe~~I~Rc~ 42 (61)
T COG2888 7 KDPPVCTSCGREIAPGETAVK-FPCPNCGEVEIYRCA 42 (61)
T ss_pred cCCceeccCCCEeccCCceeE-eeCCCCCceeeehhh
Confidence 3456899999999 8888886 888887666555543
No 28
>PF06639 BAP: Basal layer antifungal peptide (BAP); InterPro: IPR009540 This family consists of several basal layer antifungal peptide (BAP) sequences specific to Zea mays (Maize). The BAP2 peptide exhibits potent broad-range activity against a range of filamentous fungi, including several plant pathogens [].
Probab=27.80 E-value=14 Score=26.22 Aligned_cols=23 Identities=26% Similarity=0.600 Sum_probs=18.2
Q ss_pred eeeceecccccccChhhhcccCC
Q 033056 75 VSCAIHSHVVRVRSRTDRRNREP 97 (128)
Q Consensus 75 VSCAIHskVVRvRS~e~RK~r~p 97 (128)
-||.+|++|++-+-+|+-.-+.+
T Consensus 5 AS~V~hA~ii~Gqtke~~nt~s~ 27 (75)
T PF06639_consen 5 ASCVIHAHIISGQTKEDSNTGSM 27 (75)
T ss_pred hhhHhhHHhhcCceeeccCCCce
Confidence 48999999999988887655433
No 29
>KOG3198 consensus Signal recognition particle, subunit Srp19 [Intracellular trafficking, secretion, and vesicular transport]
Probab=26.66 E-value=18 Score=28.79 Aligned_cols=18 Identities=39% Similarity=0.696 Sum_probs=13.4
Q ss_pred cceeecccceeeeeeccc
Q 033056 27 GKCCPKDKAIKRFLVRNI 44 (128)
Q Consensus 27 gr~vPKDKAIKrf~irNi 44 (128)
||++||||||..=.-.+|
T Consensus 32 GRripke~aVeNP~a~eI 49 (152)
T KOG3198|consen 32 GRRIPKEKAVENPLAKEI 49 (152)
T ss_pred ccccCHHHhhcCcchhHH
Confidence 699999999975444444
No 30
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=26.31 E-value=26 Score=20.79 Aligned_cols=13 Identities=38% Similarity=0.820 Sum_probs=9.3
Q ss_pred ccceeecCCccee
Q 033056 18 VNFIRCSNCGKCC 30 (128)
Q Consensus 18 v~~V~C~NCgr~v 30 (128)
..+|+|.+||--+
T Consensus 15 ~~~irC~~CG~RI 27 (32)
T PF03604_consen 15 GDPIRCPECGHRI 27 (32)
T ss_dssp SSTSSBSSSS-SE
T ss_pred CCcEECCcCCCeE
Confidence 3579999999643
No 31
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=26.08 E-value=28 Score=19.06 Aligned_cols=12 Identities=42% Similarity=0.894 Sum_probs=8.8
Q ss_pred eecCCcceeecc
Q 033056 22 RCSNCGKCCPKD 33 (128)
Q Consensus 22 ~C~NCgr~vPKD 33 (128)
.|.+||.-++.|
T Consensus 1 ~Cp~CG~~~~~~ 12 (23)
T PF13240_consen 1 YCPNCGAEIEDD 12 (23)
T ss_pred CCcccCCCCCCc
Confidence 378888877754
No 32
>PRK12286 rpmF 50S ribosomal protein L32; Reviewed
Probab=24.18 E-value=47 Score=21.88 Aligned_cols=29 Identities=24% Similarity=0.626 Sum_probs=16.5
Q ss_pred CcccccCCCCCCCCCc-----ccceeecCCccee
Q 033056 2 TFKRRNGGRNKHGRGH-----VNFIRCSNCGKCC 30 (128)
Q Consensus 2 ~kKRrNnGr~KkgrGh-----v~~V~C~NCgr~v 30 (128)
||+|-+..|.-+=|.| ...+.|.+||-..
T Consensus 4 PKrk~S~srr~~RRsh~~l~~~~l~~C~~CG~~~ 37 (57)
T PRK12286 4 PKRKTSKSRKRKRRAHFKLKAPGLVECPNCGEPK 37 (57)
T ss_pred CcCcCChhhcchhcccccccCCcceECCCCCCcc
Confidence 4555444444444444 3466799998643
No 33
>KOG3497 consensus DNA-directed RNA polymerase, subunit RPB10 [Transcription]
Probab=24.15 E-value=35 Score=23.85 Aligned_cols=12 Identities=50% Similarity=0.874 Sum_probs=10.0
Q ss_pred cceeecCCccee
Q 033056 19 NFIRCSNCGKCC 30 (128)
Q Consensus 19 ~~V~C~NCgr~v 30 (128)
-||+|-.||..+
T Consensus 3 iPiRCFtCGKvi 14 (69)
T KOG3497|consen 3 IPIRCFTCGKVI 14 (69)
T ss_pred eeeEeeeccccc
Confidence 389999999865
No 34
>PF12230 PRP21_like_P: Pre-mRNA splicing factor PRP21 like protein; InterPro: IPR022030 This domain family is found in eukaryotes, and is typically between 212 and 238 amino acids in length. The family is found in association with PF01805 from PFAM. There are two completely conserved residues (W and H) that may be functionally important. PRP21 is required for assembly of the prespliceosome and it interacts with U2 snRNP and/or pre-mRNA in the prespliceosome. This family also contains proteins similar to PRP21, such as the mammalian SF3a. SF3a also interacts with U2 snRNP from the prespliceosome, converting it to its active form. ; PDB: 4DGW_B.
Probab=23.96 E-value=26 Score=27.77 Aligned_cols=32 Identities=16% Similarity=0.324 Sum_probs=0.0
Q ss_pred CcccceeecCCcceeecccceeeeeecccchhh
Q 033056 16 GHVNFIRCSNCGKCCPKDKAIKRFLVRNIVEQA 48 (128)
Q Consensus 16 Ghv~~V~C~NCgr~vPKDKAIKrf~irNiVEaa 48 (128)
..+..+.|..||..||-|+-=. -.-.+++|+.
T Consensus 164 ~~~~~~~cPitGe~IP~~e~~e-HmRi~LlDP~ 195 (229)
T PF12230_consen 164 PKEKMIICPITGEMIPADEMDE-HMRIELLDPR 195 (229)
T ss_dssp ---------------------------------
T ss_pred cccccccccccccccccccccc-cccccccccc
Confidence 4567899999999999998654 2223445443
No 35
>PF14832 Tautomerase_3: Putative oxalocrotonate tautomerase enzyme; PDB: 3C6V_C 3N4D_I 3N4G_C 3N4H_A 2FLZ_C 3MF8_A 3MF7_A 2FLT_A.
Probab=23.89 E-value=50 Score=25.02 Aligned_cols=25 Identities=28% Similarity=0.626 Sum_probs=18.1
Q ss_pred hHHhhHHhhccccccccceeeeeeEEe
Q 033056 48 AAVRDVQDACIYDNYVLPKLYAKMQYC 74 (128)
Q Consensus 48 aavrDi~eAsv~~~y~lPKlyvKl~YC 74 (128)
+++.+|.+ +|.++-||.+||-..+.
T Consensus 20 ~LA~~IT~--~y~~~glP~FyV~V~F~ 44 (136)
T PF14832_consen 20 ALAEAITD--IYTSIGLPAFYVNVRFI 44 (136)
T ss_dssp HHHHHHHH--HHHHTTTTGGG-EEEEE
T ss_pred HHHHHHHH--HHhCCCCCCEEEEEEEE
Confidence 45666665 67777899999998776
No 36
>PF10589 NADH_4Fe-4S: NADH-ubiquinone oxidoreductase-F iron-sulfur binding region; InterPro: IPR019575 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This entry describes the F subunit of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoF. This family does not have any members in chloroplast or cyanobacteria, where the quinone may be plastoquinone and NADH may be replaced by NADPH, nor in Methanosarcina, where NADH is replaced by F420H2. This entry represents the iron-sulphur binding domain of the F subunit.; GO: 0055114 oxidation-reduction process; PDB: 3IAS_S 2FUG_A 3I9V_A 3M9S_1 3IAM_A 2YBB_1.
Probab=23.62 E-value=22 Score=22.16 Aligned_cols=10 Identities=50% Similarity=1.262 Sum_probs=4.2
Q ss_pred cCCcceeecc
Q 033056 24 SNCGKCCPKD 33 (128)
Q Consensus 24 ~NCgr~vPKD 33 (128)
.+||+|+|-=
T Consensus 14 ESCGkC~PCR 23 (46)
T PF10589_consen 14 ESCGKCTPCR 23 (46)
T ss_dssp H--S--HHHH
T ss_pred cCCCCCCCcH
Confidence 5799999953
No 37
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=23.56 E-value=41 Score=22.85 Aligned_cols=27 Identities=41% Similarity=0.796 Sum_probs=18.6
Q ss_pred cceeecCCcc-eeecccceeeeeecccch
Q 033056 19 NFIRCSNCGK-CCPKDKAIKRFLVRNIVE 46 (128)
Q Consensus 19 ~~V~C~NCgr-~vPKDKAIKrf~irNiVE 46 (128)
.+..|++||+ +.|.++|.+ |.=-|==|
T Consensus 6 ~~~~CtSCg~~i~~~~~~~~-F~CPnCG~ 33 (59)
T PRK14890 6 EPPKCTSCGIEIAPREKAVK-FLCPNCGE 33 (59)
T ss_pred cCccccCCCCcccCCCccCE-eeCCCCCC
Confidence 4557999998 556888875 76665433
No 38
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=23.48 E-value=45 Score=19.81 Aligned_cols=14 Identities=36% Similarity=0.802 Sum_probs=10.8
Q ss_pred CcccceeecCCcce
Q 033056 16 GHVNFIRCSNCGKC 29 (128)
Q Consensus 16 Ghv~~V~C~NCgr~ 29 (128)
++...|+|++||..
T Consensus 21 ~~g~~v~C~~C~~~ 34 (36)
T PF13717_consen 21 PKGRKVRCSKCGHV 34 (36)
T ss_pred CCCcEEECCCCCCE
Confidence 45568999999864
No 39
>TIGR02174 CXXU_selWTH selT/selW/selH selenoprotein domain. This model represents a domain found in both bacteria and animals, including animal proteins SelT, SelW, and SelH, all of which are selenoproteins. In a CXXC motif near the N-terminus of the domain, selenocysteine may replace the second Cys. Proteins with this domain may include an insert of about 70 amino acids. This model is broader than the current SelW model pfam05169 in Pfam.
Probab=21.83 E-value=37 Score=22.59 Aligned_cols=14 Identities=29% Similarity=0.871 Sum_probs=9.9
Q ss_pred eeeEEeeeeceecc
Q 033056 69 AKMQYCVSCAIHSH 82 (128)
Q Consensus 69 vKl~YCVSCAIHsk 82 (128)
|...||.+|-...+
T Consensus 1 V~IeyC~~C~y~~R 14 (72)
T TIGR02174 1 VEIEYCGSCGYKPR 14 (72)
T ss_pred CEEEECCCCCChHH
Confidence 45789999974433
No 40
>PF06107 DUF951: Bacterial protein of unknown function (DUF951); InterPro: IPR009296 This family consists of several short hypothetical bacterial proteins of unknown function.
Probab=21.31 E-value=57 Score=22.00 Aligned_cols=15 Identities=40% Similarity=1.012 Sum_probs=11.1
Q ss_pred ceeecCCcc--eeeccc
Q 033056 20 FIRCSNCGK--CCPKDK 34 (128)
Q Consensus 20 ~V~C~NCgr--~vPKDK 34 (128)
.+.|++||+ .+|+-+
T Consensus 31 kikC~gCg~~imlpR~~ 47 (57)
T PF06107_consen 31 KIKCLGCGRQIMLPRSK 47 (57)
T ss_pred EEEECCCCCEEEEeHHH
Confidence 578999999 456543
No 41
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=21.25 E-value=68 Score=20.49 Aligned_cols=19 Identities=32% Similarity=0.609 Sum_probs=13.0
Q ss_pred CCcccceeecCCcceeecc
Q 033056 15 RGHVNFIRCSNCGKCCPKD 33 (128)
Q Consensus 15 rGhv~~V~C~NCgr~vPKD 33 (128)
........|.+||..+.+|
T Consensus 41 ~~~~r~~~C~~Cg~~~~rD 59 (69)
T PF07282_consen 41 RRSGRVFTCPNCGFEMDRD 59 (69)
T ss_pred ccccceEEcCCCCCEECcH
Confidence 3455566788888777766
No 42
>KOG3286 consensus Selenoprotein T [General function prediction only]
Probab=21.14 E-value=59 Score=27.33 Aligned_cols=14 Identities=29% Similarity=0.847 Sum_probs=11.0
Q ss_pred eeeeEEeeeeceec
Q 033056 68 YAKMQYCVSCAIHS 81 (128)
Q Consensus 68 yvKl~YCVSCAIHs 81 (128)
-++..|||||-.-.
T Consensus 71 tl~i~fCvSCgYk~ 84 (226)
T KOG3286|consen 71 TLEINFCVSCGYKQ 84 (226)
T ss_pred cEEEEEEEecCcHH
Confidence 37899999997644
No 43
>COG1326 Uncharacterized archaeal Zn-finger protein [General function prediction only]
Probab=20.59 E-value=86 Score=25.95 Aligned_cols=24 Identities=33% Similarity=0.833 Sum_probs=16.9
Q ss_pred CCcccceeecCCccee------ecccceee
Q 033056 15 RGHVNFIRCSNCGKCC------PKDKAIKR 38 (128)
Q Consensus 15 rGhv~~V~C~NCgr~v------PKDKAIKr 38 (128)
+|-.-.++|.|||-.- ||-.+++.
T Consensus 25 ~g~~~lvrC~eCG~V~~~~i~~~k~~~v~v 54 (201)
T COG1326 25 RGREPLVRCEECGTVHPAIIKTPKPVRVRV 54 (201)
T ss_pred cCCceEEEccCCCcEeeceeeccccceEEE
Confidence 4555889999999865 55555553
No 44
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=20.33 E-value=76 Score=17.94 Aligned_cols=16 Identities=31% Similarity=0.596 Sum_probs=10.6
Q ss_pred CCCCCcccceeecCCc
Q 033056 12 KHGRGHVNFIRCSNCG 27 (128)
Q Consensus 12 KkgrGhv~~V~C~NCg 27 (128)
--++++...-.|.|||
T Consensus 8 i~~r~~~v~f~CPnCG 23 (24)
T PF07754_consen 8 IAPREQAVPFPCPNCG 23 (24)
T ss_pred ccCcccCceEeCCCCC
Confidence 3456666667788877
Done!