Query psy14233
Match_columns 122
No_of_seqs 139 out of 1113
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 22:48:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14233.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14233hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00048 DSRM Double-stranded R 99.8 5.8E-19 1.3E-23 109.5 9.6 66 6-72 1-68 (68)
2 PHA02701 ORF020 dsRNA-binding 99.8 6.7E-19 1.5E-23 131.4 10.7 71 4-75 107-178 (183)
3 PHA03103 double-strand RNA-bin 99.8 1.7E-18 3.7E-23 129.6 10.7 70 4-74 108-177 (183)
4 smart00358 DSRM Double-strande 99.8 3.4E-18 7.3E-23 106.3 9.3 65 7-72 1-66 (67)
5 PRK12371 ribonuclease III; Rev 99.7 1.2E-17 2.6E-22 129.2 10.5 70 4-74 160-231 (235)
6 PF00035 dsrm: Double-stranded 99.7 2.4E-17 5.2E-22 102.8 9.8 65 7-72 1-67 (67)
7 PRK14718 ribonuclease III; Pro 99.7 1.9E-16 4.2E-21 131.5 10.4 70 4-73 150-221 (467)
8 COG0571 Rnc dsRNA-specific rib 99.7 3.1E-16 6.8E-21 121.6 9.8 70 4-74 160-231 (235)
9 PRK12372 ribonuclease III; Rev 99.7 5.5E-16 1.2E-20 127.8 10.7 71 4-74 150-222 (413)
10 KOG3732|consensus 99.6 5.1E-15 1.1E-19 119.0 10.2 69 3-73 140-210 (339)
11 PRK00102 rnc ribonuclease III; 99.6 6.8E-15 1.5E-19 111.9 10.3 69 4-73 157-227 (229)
12 TIGR02191 RNaseIII ribonucleas 99.6 1E-14 2.2E-19 110.0 9.6 68 4-72 151-220 (220)
13 PF14709 DND1_DSRM: double str 99.6 6.7E-15 1.4E-19 97.0 7.3 68 5-72 1-79 (80)
14 KOG3732|consensus 99.4 6.7E-13 1.5E-17 106.8 9.4 70 5-76 38-108 (339)
15 KOG4334|consensus 99.2 1.6E-11 3.4E-16 103.2 2.8 107 5-120 375-482 (650)
16 KOG1817|consensus 98.8 3E-08 6.4E-13 82.8 8.4 69 5-74 427-503 (533)
17 KOG2777|consensus 98.7 3.7E-08 8.1E-13 84.0 7.5 66 4-75 89-155 (542)
18 KOG0921|consensus 98.4 9.1E-07 2E-11 79.5 7.3 85 5-89 1-85 (1282)
19 PF03368 Dicer_dimer: Dicer di 98.3 8.2E-06 1.8E-10 54.6 8.8 73 8-85 2-84 (90)
20 KOG3769|consensus 97.9 2.3E-05 4.9E-10 63.0 4.8 70 3-73 230-302 (333)
21 PF14954 LIX1: Limb expression 95.5 0.032 6.8E-07 43.4 5.2 67 4-70 20-93 (252)
22 KOG2334|consensus 93.6 0.024 5.1E-07 47.9 0.5 68 3-73 373-441 (477)
23 KOG3792|consensus 91.1 0.24 5.1E-06 44.3 3.4 62 5-72 506-569 (816)
24 KOG2777|consensus 89.1 0.24 5.3E-06 42.9 1.9 49 20-75 8-57 (542)
25 KOG4334|consensus 88.9 2.4 5.2E-05 36.9 7.6 66 4-73 489-557 (650)
26 PF02169 LPP20: LPP20 lipoprot 88.5 0.43 9.3E-06 30.8 2.4 27 47-73 13-39 (92)
27 KOG0921|consensus 86.7 0.26 5.6E-06 45.5 0.7 70 5-74 167-240 (1282)
28 KOG3792|consensus 83.1 0.71 1.5E-05 41.4 1.7 64 6-72 371-441 (816)
29 PF14657 Integrase_AP2: AP2-li 72.0 16 0.00035 20.8 5.0 22 50-72 20-41 (46)
30 TIGR01021 rpsE_bact ribosomal 61.4 14 0.00031 27.0 3.8 42 32-73 19-63 (154)
31 cd04516 TBP_eukaryotes eukaryo 58.4 49 0.0011 24.5 6.3 64 10-77 18-85 (174)
32 CHL00138 rps5 ribosomal protei 55.4 22 0.00048 25.7 3.9 42 32-73 41-85 (143)
33 PRK00550 rpsE 30S ribosomal pr 54.7 21 0.00046 26.5 3.8 42 32-73 30-74 (168)
34 COG1944 Uncharacterized conser 49.5 90 0.0019 26.4 7.1 29 48-76 60-88 (398)
35 KOG1881|consensus 45.8 14 0.0003 33.6 1.9 66 5-73 349-420 (793)
36 PRK04044 rps5p 30S ribosomal p 45.5 35 0.00076 26.4 3.8 42 32-73 67-111 (211)
37 KOG0701|consensus 45.0 30 0.00065 34.1 4.0 70 7-77 1516-1601(1606)
38 PRK00394 transcription factor; 43.9 1E+02 0.0022 22.9 6.1 42 33-77 43-84 (179)
39 cd04518 TBP_archaea archaeal T 39.7 1.3E+02 0.0028 22.2 6.1 42 33-77 44-85 (174)
40 KOG3302|consensus 39.3 1.1E+02 0.0024 23.5 5.7 42 33-77 65-106 (200)
41 PLN00062 TATA-box-binding prot 38.7 1.5E+02 0.0033 22.0 6.3 64 10-77 18-85 (179)
42 cd00652 TBP_TLF TATA box bindi 36.8 1.7E+02 0.0037 21.5 6.3 42 34-78 45-86 (174)
43 PF12909 DUF3832: Protein of u 35.1 1.3E+02 0.0028 20.0 4.9 32 40-73 20-51 (89)
44 PF12724 Flavodoxin_5: Flavodo 32.4 31 0.00068 24.0 1.7 21 48-68 3-23 (143)
45 cd04517 TLF TBP-like factors ( 32.2 2.1E+02 0.0045 21.1 6.1 63 10-77 19-85 (174)
46 PF00333 Ribosomal_S5: Ribosom 28.9 1.5E+02 0.0033 18.5 4.3 41 32-72 19-62 (67)
47 PF00352 TBP: Transcription fa 28.6 1.6E+02 0.0035 18.7 6.1 38 34-74 47-84 (86)
48 TIGR01020 rpsE_arch ribosomal 28.3 88 0.0019 24.2 3.6 49 32-80 67-122 (212)
49 PHA00280 putative NHN endonucl 28.1 2.1E+02 0.0047 20.0 5.4 31 33-64 77-107 (121)
50 PF10195 Phospho_p8: DNA-bindi 27.6 30 0.00064 21.6 0.7 14 50-63 23-36 (60)
51 PRK12359 flavodoxin FldB; Prov 27.1 41 0.00088 24.8 1.5 20 48-67 6-25 (172)
52 PRK09267 flavodoxin FldA; Vali 25.1 48 0.001 23.5 1.6 20 48-67 7-26 (169)
53 PRK05569 flavodoxin; Provision 22.7 56 0.0012 22.3 1.5 19 48-66 7-25 (141)
54 TIGR01752 flav_long flavodoxin 21.0 64 0.0014 23.1 1.6 19 48-66 5-23 (167)
55 PRK03739 2-isopropylmalate syn 20.8 5.5E+02 0.012 22.4 7.5 67 6-72 478-550 (552)
56 PF12327 FtsZ_C: FtsZ family, 20.7 1E+02 0.0022 20.3 2.4 22 48-70 4-25 (95)
57 PF05165 GGDN: GGDN family; I 20.0 1.9E+02 0.0041 22.9 4.1 25 49-74 210-234 (246)
No 1
>cd00048 DSRM Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
Probab=99.80 E-value=5.8e-19 Score=109.50 Aligned_cols=66 Identities=35% Similarity=0.607 Sum_probs=61.4
Q ss_pred ChHHHHHHHHHhc-CCCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 6 LVKAFLNEWCQKN-SKQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 6 n~ks~L~E~~qk~-~~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
|||+.|+||||++ +..|.|++ ...|++|.+.|+|.|.|+|. +++.|.|.|||+|++.||+.||+.|
T Consensus 1 ~p~~~L~e~~~~~~~~~~~y~~~~~~g~~~~~~f~~~v~i~~~-~~~~g~g~sKk~Ak~~AA~~al~~L 68 (68)
T cd00048 1 NPKSLLQELAQKRGKPLPEYELVEEEGPDHAPRFTVEVTVGGK-ITGEGEGSSKKEAKQNAAEAALRKL 68 (68)
T ss_pred ChHHHHHHHHHHcCCCCCeEEEeeeeCCCCCCeEEEEEEECCE-EEEEeecCCHHHHHHHHHHHHHHhC
Confidence 7999999999999 89999998 57899999999999999996 4799999999999999999999875
No 2
>PHA02701 ORF020 dsRNA-binding PKR inhibitor; Provisional
Probab=99.79 E-value=6.7e-19 Score=131.40 Aligned_cols=71 Identities=25% Similarity=0.394 Sum_probs=63.3
Q ss_pred CCChHHHHHHHHHhcCCCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHc
Q psy14233 4 DGLVKAFLNEWCQKNSKQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRS 75 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~~~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~ 75 (122)
..|||+.||||||+++..+.|.+ ...||+|.+.|++.|.|+|.. ++.|.|+|||+|||+||+.||+.|...
T Consensus 107 ~~DpKS~LQE~~Q~~~~~l~Y~li~~~GpdH~~~Ftv~V~V~g~~-~g~G~G~SKKeAEQ~AAk~AL~~L~~~ 178 (183)
T PHA02701 107 TLNPVSAVNEFCMRTHRPLEFCETRSGGHDHCPLFTCTIVVSGKV-VATASGCSKKLARHAACADALTILINN 178 (183)
T ss_pred CCCccHHHHHHHHhcCCCCeEEEEEeECCCCCceEEEEEEECCEE-EEEEEeCCHHHHHHHHHHHHHHHHHhh
Confidence 46999999999999955448986 467999999999999999975 899999999999999999999999643
No 3
>PHA03103 double-strand RNA-binding protein; Provisional
Probab=99.78 E-value=1.7e-18 Score=129.57 Aligned_cols=70 Identities=29% Similarity=0.461 Sum_probs=62.4
Q ss_pred CCChHHHHHHHHHhcCCCceEEEEeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKNSKQPEFDVRAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~~~~p~Y~~~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
..|||+.||||||+++....|.+...||+|.+.|++.|.|+|.. ++.|.|+|||+|||+||+.||..|..
T Consensus 108 ~kNpKS~LQE~~Qk~~~~~y~~i~~~Gp~H~p~F~v~V~I~g~~-~g~G~G~SKKeAEQ~AAk~AL~~L~~ 177 (183)
T PHA03103 108 DKNPCTVINEYCQITSRDWSINITSSGPSHSPTFTASVIISGIK-FKPAIGSTKKEAKNNAAKLAMDKILN 177 (183)
T ss_pred cCChhHHHHHHHHHhCCCeEEEEEeeCCCCCceEEEEEEECCEE-EEEeeeCCHHHHHHHHHHHHHHHHHh
Confidence 36899999999999955555566678999999999999999975 79999999999999999999999964
No 4
>smart00358 DSRM Double-stranded RNA binding motif.
Probab=99.77 E-value=3.4e-18 Score=106.26 Aligned_cols=65 Identities=34% Similarity=0.640 Sum_probs=59.7
Q ss_pred hHHHHHHHHHhcCCCceEEEE-eeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 7 VKAFLNEWCQKNSKQPEFDVR-AAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 7 ~ks~L~E~~qk~~~~p~Y~~~-~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
||+.|+||||+++..|.|++. ..|++|.+.|+|.|.|+|. +++.|.|.|||+|++.||+.||+.|
T Consensus 1 p~~~L~e~~~~~~~~~~y~~~~~~g~~~~~~f~~~v~i~~~-~~~~g~g~sKk~Ak~~AA~~al~~L 66 (67)
T smart00358 1 PKSLLQELAQKRGLPPEYELVKEEGPDHAPRFTVTVKVGGE-YTGEGEGSSKKEAKQRAAEAALRSL 66 (67)
T ss_pred CchHHHHHHHHCCCCCEEEEEeeeCCCCCCcEEEEEEECCE-EEEEeccCCHHHHHHHHHHHHHHhc
Confidence 789999999999449999986 4799999999999999997 4799999999999999999999987
No 5
>PRK12371 ribonuclease III; Reviewed
Probab=99.74 E-value=1.2e-17 Score=129.17 Aligned_cols=70 Identities=26% Similarity=0.453 Sum_probs=64.5
Q ss_pred CCChHHHHHHHHHhc-CCCceEEEE-eeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN-SKQPEFDVR-AAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~-~~~p~Y~~~-~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
..|||+.||||||+. +..|.|.+. ..||+|.+.|+|.|+|+|.. ++.|.|+|||+||+.||+.||+.|..
T Consensus 160 ~~d~Ks~LqE~~q~~~~~~p~Y~~~~~~Gp~h~~~F~v~v~v~~~~-~~~g~G~sKK~Ae~~AA~~al~~~~~ 231 (235)
T PRK12371 160 RRDAKTELQEWAHAQFGVTPVYRVDSRSGPDHDPRFTVEVEVKGFA-PETGEGRSKRAAEQVAAEKMLEREGV 231 (235)
T ss_pred cCCHHHHHHHHHHhcCCCCCeEEEEEeecCCCCCeEEEEEEECCEE-EEEeeeCCHHHHHHHHHHHHHHHhhh
Confidence 469999999999998 789999975 77999999999999999985 79999999999999999999999853
No 6
>PF00035 dsrm: Double-stranded RNA binding motif; InterPro: IPR001159 The DsRBD domain is found in a variety of RNA-binding proteins with different structures and exhibiting a diversity of functions []. It is involved in localisation of at least five different mRNAs in the early Drosophila embryo and by interferon-induced protein kinase in humans, which is part of the cellular response to dsRNA.; GO: 0003725 double-stranded RNA binding, 0005622 intracellular; PDB: 1EKZ_A 1STU_A 1QU6_A 2L2M_A 3ADJ_A 1WHN_A 3LLH_B 2B7V_A 2L3J_A 1UHZ_A ....
Probab=99.74 E-value=2.4e-17 Score=102.84 Aligned_cols=65 Identities=42% Similarity=0.705 Sum_probs=56.6
Q ss_pred hHHHHHHHHHhcCCCceEEE-EeeCCCCC-ccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 7 VKAFLNEWCQKNSKQPEFDV-RAAGPKHR-QRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 7 ~ks~L~E~~qk~~~~p~Y~~-~~~Gp~h~-~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
||+.|+|||++++..|.|.. ...+++|. +.|.|.|.|+|.. ++.|.|.|||+|++.||+.||+.|
T Consensus 1 ~~~~L~e~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~i~~~~-~~~g~g~sKk~Ak~~AA~~al~~L 67 (67)
T PF00035_consen 1 PKSRLNEYCQKNKFPPPYYYIEEEGPSHHRPRFICTVYIDGKE-YGEGEGSSKKEAKQQAAKKALQKL 67 (67)
T ss_dssp HHHHHHHHHHHCTSSEEEEEEEEESSSSSSEEEEEEEEETTEE-EEEEEESSHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCCCCCEEEEEEeCCCCCCceEEEEEEECCEE-EeEeccCCHHHHHHHHHHHHHHhC
Confidence 79999999999977776664 45565555 8999999999985 789999999999999999999987
No 7
>PRK14718 ribonuclease III; Provisional
Probab=99.68 E-value=1.9e-16 Score=131.53 Aligned_cols=70 Identities=23% Similarity=0.357 Sum_probs=64.6
Q ss_pred CCChHHHHHHHHHhc-CCCceEEEE-eeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN-SKQPEFDVR-AAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~-~~~p~Y~~~-~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
..|||+.||||||++ +..|+|.+. +.||+|.+.|.+.|+|+|..++|.|.|.|||+|||.||+.||+.|.
T Consensus 150 ~kDyKS~LQE~~Qk~~~~~PeY~li~esGPdH~k~F~V~V~v~g~~~~G~G~G~SKKeAEQ~AAk~AL~kL~ 221 (467)
T PRK14718 150 GKDAKTLLQEYLQGHKIALPTYTVVATHGAAHNQQFEVECTVPKLDIKVSGSGASRRAAEQAAAKKALDEVT 221 (467)
T ss_pred ccCHHHHHHHHHHhcCCCCCeeEEeeeecCCCCCeEEEEEEECCeeeEEEEEcCCHHHHHHHHHHHHHHHhc
Confidence 469999999999999 788999974 6899999999999999996556999999999999999999999996
No 8
>COG0571 Rnc dsRNA-specific ribonuclease [Transcription]
Probab=99.67 E-value=3.1e-16 Score=121.55 Aligned_cols=70 Identities=29% Similarity=0.395 Sum_probs=64.8
Q ss_pred CCChHHHHHHHHHhc-CCCceEEEE-eeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN-SKQPEFDVR-AAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~-~~~p~Y~~~-~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
..|||+.||||||.+ +..|.|.+. ..||+|++.|++.|.|+|.. +|.|.|+|||+|||.||+.||..|..
T Consensus 160 ~~D~Kt~LQe~~q~~~~~~p~Y~~v~~~g~~h~~~F~v~v~v~~~~-~g~G~G~skk~AEq~AA~~al~~l~~ 231 (235)
T COG0571 160 FKDPKTRLQELLQAQGLVLPEYRLVKEEGPAHDKEFTVEVAVGGKE-LGTGKGRSKKEAEQAAAEQALKKLGV 231 (235)
T ss_pred ccChhHHHHHHHHhcCCCCCeEEEeeccCCCCCceEEEEEEECCee-EEEecccCHHHHHHHHHHHHHHHhcc
Confidence 479999999999999 889999985 56999999999999999974 89999999999999999999999964
No 9
>PRK12372 ribonuclease III; Reviewed
Probab=99.66 E-value=5.5e-16 Score=127.77 Aligned_cols=71 Identities=23% Similarity=0.361 Sum_probs=65.0
Q ss_pred CCChHHHHHHHHHhc-CCCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN-SKQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~-~~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
..|||+.||||||++ +..|+|.+ ...||+|++.|++.|+|+|..+.+.|.|.|||+|||.||+.||+.|..
T Consensus 150 ~~D~KS~LQE~~Q~~~~~~P~Y~lv~e~Gp~h~~~F~V~V~v~g~~~~g~G~G~SKKeAEQ~AAr~AL~kL~~ 222 (413)
T PRK12372 150 GKDAKTLLQEYLQGHKIALPTYTVVATHGAAHNQQFEVECTVPKLDVKVSGSGASRRAAEQAAAKKALDEVMA 222 (413)
T ss_pred cCCHHHHHHHHHHhcCCCCCeeEEeeeecCCCCceEEEEEEECCeEEEEEEEeCCHHHHHHHHHHHHHHHHhc
Confidence 469999999999999 78899997 478999999999999999965568999999999999999999999973
No 10
>KOG3732|consensus
Probab=99.60 E-value=5.1e-15 Score=118.98 Aligned_cols=69 Identities=26% Similarity=0.348 Sum_probs=64.7
Q ss_pred CCCChHHHHHHHHHhc-CCCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 3 GDGLVKAFLNEWCQKN-SKQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 3 ~~~n~ks~L~E~~qk~-~~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
+..||+++|+|+||++ |..|+|++ .+.|++|.+.|++.|.|.+. ...|.|.|||.||++||..+|..|.
T Consensus 140 ~~~NPI~~L~e~~q~k~~k~P~yelv~E~G~~~~rEFv~q~sv~~~--~~~GkG~sKKiAKRnAAeamLe~l~ 210 (339)
T KOG3732|consen 140 QVLNPIGRLQELAQAKKWKLPEYELVQESGVPHRREFVIQCSVENF--TEEGKGPSKKIAKRNAAEAMLESLG 210 (339)
T ss_pred cccChHHHHHHHHHHhCCCCCceEEEeccCCCccceEEEEEEecce--eeecCCchHHHHHHHHHHHHHHHhc
Confidence 4689999999999999 99999998 58899999999999999995 6999999999999999999999995
No 11
>PRK00102 rnc ribonuclease III; Reviewed
Probab=99.60 E-value=6.8e-15 Score=111.91 Aligned_cols=69 Identities=30% Similarity=0.469 Sum_probs=63.5
Q ss_pred CCChHHHHHHHHHhc-CCCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN-SKQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~-~~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
..|||+.|+||||+. +..|.|.+ ...|+.|.+.|+|.|+++|. ++|.|.|.|||+||+.||+.||+.|.
T Consensus 157 ~~~pk~~L~e~~~~~~~~~p~y~~~~~~g~~~~~~f~~~v~i~~~-~~~~g~g~skk~Ae~~AA~~Al~~l~ 227 (229)
T PRK00102 157 VKDYKTRLQELLQGRGLPLPEYELVKEEGPAHDKEFTVEVTVNGK-ELGEGTGSSKKEAEQAAAKQALKKLK 227 (229)
T ss_pred cCCHHHHHHHHHHHcCCCCCceEEeeccCCCCCceEEEEEEECCE-EEEEeeeCCHHHHHHHHHHHHHHHHh
Confidence 579999999999998 67899987 46799999999999999997 58999999999999999999999985
No 12
>TIGR02191 RNaseIII ribonuclease III, bacterial. This family consists of bacterial examples of ribonuclease III. This enzyme cleaves double-stranded rRNA. It is involved in processing ribosomal RNA precursors. It is found even in minimal genones such as Mycoplasma genitalium and Buchnera aphidicola, and in some cases has been shown to be an essential gene. These bacterial proteins contain a double-stranded RNA binding motif (pfam00035) and a ribonuclease III domain (pfam00636). Eukaryotic homologs tend to be much longer proteins with additional domains, localized to the nucleus, and not included in this family.
Probab=99.58 E-value=1e-14 Score=110.05 Aligned_cols=68 Identities=34% Similarity=0.536 Sum_probs=62.2
Q ss_pred CCChHHHHHHHHHhc-CCCceEEEE-eeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN-SKQPEFDVR-AAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~-~~~p~Y~~~-~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
..|||+.|+||||++ +..|.|++. ..|++|.+.|.|.|.++|. +++.|.|.|||+|++.||+.||+.|
T Consensus 151 ~~~pk~~L~e~~~~~~~~~p~y~~~~~~g~~~~~~f~~~v~~~~~-~~~~g~g~skk~A~~~AA~~Al~~l 220 (220)
T TIGR02191 151 LKDYKTALQEWAQARGKPLPEYRLIKEEGPDHDKEFTVEVSVNGE-PYGEGKGKSKKEAEQNAAKAALEKL 220 (220)
T ss_pred cCChHHHHHHHHHHcCCCCceEEEecccCCCCCceEEEEEEECCE-EEEEeeeCCHHHHHHHHHHHHHHhC
Confidence 579999999999998 678999985 6699999999999999997 4899999999999999999999875
No 13
>PF14709 DND1_DSRM: double strand RNA binding domain from DEAD END PROTEIN 1
Probab=99.58 E-value=6.7e-15 Score=96.98 Aligned_cols=68 Identities=37% Similarity=0.595 Sum_probs=58.5
Q ss_pred CChHHHHHHHHHhc-CCCceEEEE-eeCCCCCccEEEEEEECCEEEE---------EeEeccCHHHHHHHHHHHHHHHH
Q psy14233 5 GLVKAFLNEWCQKN-SKQPEFDVR-AAGPKHRQRFLCELRIAGIGYV---------GAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 5 ~n~ks~L~E~~qk~-~~~p~Y~~~-~~Gp~h~~~F~~~V~v~g~~~~---------g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
+++++.|+|+|+++ |+.|.|++. ..||+|.+.|.++|.|.+..+- +.---.+||+|+..||+.||+.|
T Consensus 1 k~a~~~L~elC~k~~W~~P~y~l~~~~Gp~~~~~F~ykV~i~~~~~~~~~~~~~~~p~~~~~~~k~Ak~~AA~~~L~~L 79 (80)
T PF14709_consen 1 KSAVSLLNELCQKNKWGPPVYELVSESGPDHRKLFLYKVVIPGLEYPFEGSIECFGPTKPSSTKKEAKESAAQQALQAL 79 (80)
T ss_pred CCHHHHHHHHHHhcCCCCCeEEEEeccCCCccEEEEEEEEEcCCCCCCcceEEEccCCCcCccHHHHHHHHHHHHHHhc
Confidence 47999999999999 999999985 7899999999999999886521 12334789999999999999987
No 14
>KOG3732|consensus
Probab=99.43 E-value=6.7e-13 Score=106.82 Aligned_cols=70 Identities=23% Similarity=0.400 Sum_probs=64.9
Q ss_pred CChHHHHHHHHHhcCCCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcC
Q psy14233 5 GLVKAFLNEWCQKNSKQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSG 76 (122)
Q Consensus 5 ~n~ks~L~E~~qk~~~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~ 76 (122)
++|+|.|||+|.+++..|.|++ .+.||.|.+.|+++|.|+. |.+.|.|+|||.||+.||..+|..|...-
T Consensus 38 KS~IS~l~E~~~r~~~~v~fevl~eeGp~H~~~fv~rvtvg~--~~a~GeG~sKK~AKh~AA~~~L~~lk~l~ 108 (339)
T KOG3732|consen 38 KSPISLLQEYGLRRGLTPVYEVLREEGPPHMPNFVFRVTVGE--ITATGEGKSKKLAKHRAAEALLKELKKLP 108 (339)
T ss_pred CChHHHHHHHHHHhCCCcceeeeeccCCccCCCeEEEEEEee--eEEecCCCchhHHHHHHHHHHHHHHhcCC
Confidence 7899999999999999999998 4689999999999999986 57999999999999999999999998643
No 15
>KOG4334|consensus
Probab=99.15 E-value=1.6e-11 Score=103.23 Aligned_cols=107 Identities=19% Similarity=0.224 Sum_probs=81.2
Q ss_pred CChHHHHHHHHHhc-CCCceEEEEeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCCCCCCCC
Q psy14233 5 GLVKAFLNEWCQKN-SKQPEFDVRAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQVNERDV 83 (122)
Q Consensus 5 ~n~ks~L~E~~qk~-~~~p~Y~~~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~~~~~~~ 83 (122)
+..+..|+||+|+. ...|.|++.+.. .....|.+.|.+|+.. ||.|.|.|||.|+..||+.+|+.|.-.=....++.
T Consensus 375 ks~vCiLhEy~q~~lk~~pvyef~e~~-n~stpysa~v~~d~~~-yGsG~g~sKK~Ak~~AAR~tLeiLIPd~~~~~~n~ 452 (650)
T KOG4334|consen 375 KSKVCILHEYAQQCLKSLPVYEFAEND-NNSTPYSAGVLPDLFP-YGSGVGASKKTAKLVAARDTLEILIPDLRVSEDNV 452 (650)
T ss_pred ceeeehHHHHHHHHhhhcceeehhhcc-CCCCcccccccccccc-cccccccchHHHHHHHHHHHHHHhcchhhhccccc
Confidence 45677899999998 889999986533 5677899999999976 89999999999999999999999963211111111
Q ss_pred CCCCCCCCCCCCCcccCCCCceeeeecccCCCCcccc
Q psy14233 84 PVENIPMTPTGPSVTVNTSKRFSFIRRSSIGLPRSYE 120 (122)
Q Consensus 84 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 120 (122)
+-.++. .-+.-+..+++|+.++|++||++|
T Consensus 453 ---~d~k~~----~~~k~q~~le~F~~I~Iedprv~e 482 (650)
T KOG4334|consen 453 ---CDGKVE----EDGKQQGFLELFKKIKIEDPRVVE 482 (650)
T ss_pred ---cccccc----ccccchhHHHHhhcccccCchHHH
Confidence 111211 223356789999999999999987
No 16
>KOG1817|consensus
Probab=98.78 E-value=3e-08 Score=82.76 Aligned_cols=69 Identities=23% Similarity=0.331 Sum_probs=61.2
Q ss_pred CChHHHHHHHHHhcC-------CCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 5 GLVKAFLNEWCQKNS-------KQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 5 ~n~ks~L~E~~qk~~-------~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
.|||+.|+.+|...+ ..|.|.+ ...||.+++.|++.|+++|+. +|.|.|+|-|.|+..||++||+.+..
T Consensus 427 ndpkskLqq~cl~~rys~~~epdip~y~V~~~~gpa~~r~y~Vavyf~gkr-lat~~G~nik~Ae~rAA~~ALe~~~~ 503 (533)
T KOG1817|consen 427 NDPKSKLQQCCLTLRYSLGGEPDIPLYKVLGAKGPANDRNYKVAVYFKGKR-LATGVGSNIKQAEMRAAMQALENLKM 503 (533)
T ss_pred cCcHHHHHHHHHHHhcccCCCCCCceEEEecccCCCCCCceEEEEEECCEE-EeeccCchHhHHHHHHHHHHHHHHHh
Confidence 689999999998862 2577776 467999999999999999986 79999999999999999999999964
No 17
>KOG2777|consensus
Probab=98.72 E-value=3.7e-08 Score=83.98 Aligned_cols=66 Identities=26% Similarity=0.385 Sum_probs=58.3
Q ss_pred CCChHHHHHHHHHhcCCCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHc
Q psy14233 4 DGLVKAFLNEWCQKNSKQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRS 75 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~~~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~ 75 (122)
..||++.|+|+++ .+.|++ ...||.|.+.|.|.|.|||.. +..| |.|||+|++.||+.||+.|...
T Consensus 89 ~~npv~ll~e~~~----~~~~~~~~~~~~~~~~~F~~~~~vdg~~-~~~~-~~sKk~ak~~aa~~al~~l~~~ 155 (542)
T KOG2777|consen 89 GKNPVSLLHELAN----GLFFDFVNESGPQHAPKFVMSVVVDGRW-FEGG-GRSKKEAKQEAAMAALQVLFKI 155 (542)
T ss_pred cCCchHHHHHHhc----ccceeeeccCCCCCCceEEEEEEECCEE-ccCC-CcchHHHHHHHHHHHHHHHHhc
Confidence 4799999999998 688887 478999999999999999985 3445 9999999999999999999753
No 18
>KOG0921|consensus
Probab=98.39 E-value=9.1e-07 Score=79.47 Aligned_cols=85 Identities=54% Similarity=0.917 Sum_probs=77.9
Q ss_pred CChHHHHHHHHHhcCCCceEEEEeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCCCCCCCCC
Q psy14233 5 GLVKAFLNEWCQKNSKQPEFDVRAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQVNERDVP 84 (122)
Q Consensus 5 ~n~ks~L~E~~qk~~~~p~Y~~~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~~~~~~~p 84 (122)
+|-|..|..||.|+...|.|++...++....+|.|.|.+.+..+.+.|...+||+|+..||+..++.|..+|+.+.++.|
T Consensus 1 ~d~k~fly~~~~k~~~~p~~d~~~~~~~~rqrf~ce~~~~~~~~~~~~~stnkKda~knac~dfv~ylvr~Gk~n~~d~p 80 (1282)
T KOG0921|consen 1 RDVKEFLYAWLGKNKYGPTYDIRSEGRKGRQRFLCEVRVEGFGYTAVGNSTNKKDAATNAAQDFCQYLVREGKMQQSDIP 80 (1282)
T ss_pred CcHHHHHHHHHhhhccCcceehhhhcccchhheeeeeeccCCcceeeecccccchhhHHHHHHHHHHhhhhccccccCCc
Confidence 46789999999999877999999889999999999999998877899999999999999999999999999999999998
Q ss_pred CCCCC
Q psy14233 85 VENIP 89 (122)
Q Consensus 85 ~~~~~ 89 (122)
...+.
T Consensus 81 ~~~s~ 85 (1282)
T KOG0921|consen 81 TLTSS 85 (1282)
T ss_pred ccccc
Confidence 77643
No 19
>PF03368 Dicer_dimer: Dicer dimerisation domain; InterPro: IPR005034 This domain is found in members of the Dicer protein family of dsRNA nucleases. This entry represents a dsRNA-binding domain. RNA interference (RNAi) is an ancient gene-silencing process that plays a fundamental role in diverse eukaryotic functions including viral defence, chromatin remodelling, genome rearrangement, developmental timing, brain morphogenesis, and stem cell maintenance. All RNAi pathways require the multidomain ribonuclease Dicer, which initiates RNAi by cleaving double-stranded RNA (dsRNA) substrates into small fragments ~25 nuleotides in length. A typical eukaryotic Dicer consists of a helicase domain (PDOC51192 from PROSITEDOC), a domain of unknown function, and a PAZ domain (PDOC50821 from PROSITEDOC) at the amino (N)-terminus as well as two ribonuclease III domains (PDOC00448 from PROSITEDOC) and a dsRNA-binding domain (dsRBD) (PDOC50137 from PROSITEDOC) at the carboxy (C)-terminus. The domain of unknown function of ~100 amino acids is predicted to adopt the canonical alpha-beta-beta-beta-alpha-fold found in all dsRBDs [, , , ].; GO: 0016891 endoribonuclease activity, producing 5'-phosphomonoesters; PDB: 2KOU_A.
Probab=98.30 E-value=8.2e-06 Score=54.56 Aligned_cols=73 Identities=26% Similarity=0.497 Sum_probs=49.7
Q ss_pred HHHHHHHHHhc-C-----CCceEEEEeeCCCCCccEEEEEEECC----EEEEEeEeccCHHHHHHHHHHHHHHHHHHcCC
Q psy14233 8 KAFLNEWCQKN-S-----KQPEFDVRAAGPKHRQRFLCELRIAG----IGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQ 77 (122)
Q Consensus 8 ks~L~E~~qk~-~-----~~p~Y~~~~~Gp~h~~~F~~~V~v~g----~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~ 77 (122)
++.|+.||++. . ..|.|.+...+. .|.|+|.+-. +.+.|. .-.||+.||+.||-+|+..|.+.|.
T Consensus 2 i~lL~~yC~~Lp~d~~~~~~P~~~~~~~~~----~~~c~v~LP~~~pi~~i~g~-~~~sk~~AK~sAAf~Ac~~L~~~g~ 76 (90)
T PF03368_consen 2 ISLLNRYCSTLPSDSFTNLKPEFEIEKIGS----GFICTVILPINSPIRSIEGP-PMRSKKLAKRSAAFEACKKLHEAGE 76 (90)
T ss_dssp HHHHHHHHTTSSS-TT--SS-EEEEEE--G-----EEEEEE--TT-SS--EEEE---SSHHHHHHHHHHHHHHHHHHH-S
T ss_pred HHHHHHHHhcCCCCCCccCCceEEEEEcCC----cEEEEEECCCCCCCCeEEcc-ccccHHHHHHHHHHHHHHHHHHcCC
Confidence 57899999996 3 479999876443 8999999732 123455 5689999999999999999999999
Q ss_pred CCCCCCCC
Q psy14233 78 VNERDVPV 85 (122)
Q Consensus 78 ~~~~~~p~ 85 (122)
++..-.|.
T Consensus 77 ldd~L~P~ 84 (90)
T PF03368_consen 77 LDDHLLPI 84 (90)
T ss_dssp -TTTS--H
T ss_pred CccccCCC
Confidence 88876554
No 20
>KOG3769|consensus
Probab=97.85 E-value=2.3e-05 Score=63.02 Aligned_cols=70 Identities=20% Similarity=0.300 Sum_probs=61.2
Q ss_pred CCCChHHHHHHHHHhc-CCCceEEEE-eeC-CCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 3 GDGLVKAFLNEWCQKN-SKQPEFDVR-AAG-PKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 3 ~~~n~ks~L~E~~qk~-~~~p~Y~~~-~~G-p~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
...+|...|-++|+++ ...|++.+. +.| ....+.|.+.++-|.+ .+|.|.|.|-|.|++.||..||.+|.
T Consensus 230 ql~~P~~~L~~lckr~~l~epe~Rll~esGr~S~~PvyvVgiYs~kk-llGqG~Gesl~~A~e~AA~dAL~k~y 302 (333)
T KOG3769|consen 230 QLQWPRRLLSRLCKRRGLKEPESRLLAESGRNSAEPVYVVGIYSGKK-LLGQGQGESLKLAEEQAARDALIKLY 302 (333)
T ss_pred cccchHHHHHHHHHHcCCCCchhHHHHHhccCccCceEEEEeecCch-hhccCcchHHHHHHHHHHHHHHHHHH
Confidence 3458999999999999 689999974 444 4678899999998886 58999999999999999999999996
No 21
>PF14954 LIX1: Limb expression 1
Probab=95.53 E-value=0.032 Score=43.40 Aligned_cols=67 Identities=25% Similarity=0.346 Sum_probs=45.6
Q ss_pred CCChHHHHHHHHHhc-CCCceE---E--EEeeCCCCCccEEEEEEECCEEEEEe-EeccCHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN-SKQPEF---D--VRAAGPKHRQRFLCELRIAGIGYVGA-GNSTNKKDAQTNAAKDFVQ 70 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~-~~~p~Y---~--~~~~Gp~h~~~F~~~V~v~g~~~~g~-G~G~SKK~Aeq~AA~~Al~ 70 (122)
..|-+..|||+=|.. .....+ . +-+..|...+-|+|-|++-|-.-+|. -.-.||-+|++.||+.||-
T Consensus 20 ~vnvV~~LqeFWq~Kq~r~a~~~~~~lv~YEs~ps~~ppyVcyVTLPGGSCFGnfq~C~tkAEARR~AAKiALm 93 (252)
T PF14954_consen 20 DVNVVEALQEFWQMKQSRGADLKSEALVVYESVPSPSPPYVCYVTLPGGSCFGNFQNCPTKAEARRSAAKIALM 93 (252)
T ss_pred cchHHHHHHHHHHHHHhccccCCCCCeeeeeccCCCCCCeEEEEeCCCCCccCccccCCcHHHHHhhhHHHHHH
Confidence 357889999987765 211111 1 12556778889999999965211222 3457999999999999984
No 22
>KOG2334|consensus
Probab=93.62 E-value=0.024 Score=47.90 Aligned_cols=68 Identities=22% Similarity=0.192 Sum_probs=56.6
Q ss_pred CCCChHHHHHHHHHhc-CCCceEEEEeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 3 GDGLVKAFLNEWCQKN-SKQPEFDVRAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 3 ~~~n~ks~L~E~~qk~-~~~p~Y~~~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
+...||..|..||-+. ...|.|++... -++.|.+.+.++|+.+...=+-.+||-|+|.||..+|..+.
T Consensus 373 ~~~~~k~~l~~~~~~~~~~~~~ye~~~~---~d~lf~si~~~~~~~~~ssi~~~n~k~aeq~aa~~~l~~s~ 441 (477)
T KOG2334|consen 373 KWDTPKMVLADLCVKTKANGPVYETVQR---TDKLFSSIATARGQKYNSSIWSPNKKSAEQDAAIVALRKSN 441 (477)
T ss_pred CCCCHHHHHHHhhhhhcCCCcchhhhhh---hhhhhHHHhhhhhhhhhccccCcchhhHHHHHHHHHHHhcC
Confidence 3468999999999998 78999998763 57789999999998655555668999999999999988764
No 23
>KOG3792|consensus
Probab=91.05 E-value=0.24 Score=44.33 Aligned_cols=62 Identities=23% Similarity=0.218 Sum_probs=48.1
Q ss_pred CChHHHHHHHHHhcCCCceEEE-EeeC-CCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 5 GLVKAFLNEWCQKNSKQPEFDV-RAAG-PKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 5 ~n~ks~L~E~~qk~~~~p~Y~~-~~~G-p~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
+++...|+|- ..--.|++ .+.| -.|.++|+..|.++|+ .+.|-+.+|+-|+..||.-|++.+
T Consensus 506 K~vsd~L~Ek----~rg~k~El~set~~gs~~~R~v~gV~rvG~--~akG~~~~gd~a~~~a~Lca~~pt 569 (816)
T KOG3792|consen 506 KLVSDELAEK----RRGDKYELPSETGTGSHDKRFVKGVMRVGI--LAKGLLLNGDRAVELALLCAEKPT 569 (816)
T ss_pred cchhHHHhhh----ccccceecccccCCCCCCceeeeeeeeeeh--hhccccccchHHHHHHHHhccCcc
Confidence 4555556554 23456776 3444 6899999999999996 589999999999999999888755
No 24
>KOG2777|consensus
Probab=89.12 E-value=0.24 Score=42.93 Aligned_cols=49 Identities=20% Similarity=0.416 Sum_probs=41.9
Q ss_pred CCceEEE-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHc
Q psy14233 20 KQPEFDV-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRS 75 (122)
Q Consensus 20 ~~p~Y~~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~ 75 (122)
..+.|.. .+.||.|.+.|.+.|.|+|..+ . ||.|+..||+.|++.+...
T Consensus 8 ~~~~~~~~~q~~p~~~p~~~~~~~v~~~~~--~-----~k~~~~~~a~~~~~~~~~~ 57 (542)
T KOG2777|consen 8 PGLQYNLVSQTGPVHAPLFPFSVEVNGQEF--P-----KKKAKQRAAEKALRVFLQF 57 (542)
T ss_pred cccccccccccCCCCCCcccceEEeccccc--c-----cccccchhhhHHHHHHhhc
Confidence 3478875 5889999999999999999752 2 9999999999999999753
No 25
>KOG4334|consensus
Probab=88.88 E-value=2.4 Score=36.86 Aligned_cols=66 Identities=14% Similarity=0.146 Sum_probs=45.6
Q ss_pred CCChHHHHHHHHHhc--CCCceEEEEeeC-CCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 4 DGLVKAFLNEWCQKN--SKQPEFDVRAAG-PKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 4 ~~n~ks~L~E~~qk~--~~~p~Y~~~~~G-p~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
.-.|-..|.+..+++ |.......+..| ......|++. + |+ ....+...+|++++|.|.+..|+.|.
T Consensus 489 ~psPy~iL~~cl~Rn~g~~d~~ik~E~i~~~nqkse~im~--~-Gk-ht~~~~cknkr~gkQlASQ~ilq~lH 557 (650)
T KOG4334|consen 489 IPSPYNILRDCLSRNLGWNDLVIKKEMIGNGNQKSEVIMI--L-GK-HTEEAECKNKRQGKQLASQRILQKLH 557 (650)
T ss_pred CCCHHHHHHHHHHhhcCCcceeeeeeccCCCCccceeEee--e-cc-ceeeeeeechhHHHHHHHHHHHHHhC
Confidence 346778899999888 444444433222 2344445543 3 54 36889999999999999999999874
No 26
>PF02169 LPP20: LPP20 lipoprotein; InterPro: IPR002217 A major antigen has been recognised in Helicobacter pylori, a protein with an apparent molecular weight of 20,000 and mass 18,283 kDa []. DNA sequence analysis revealed a 525 bp gene, encoding a 175-amino acid residue product with a typical 21-residue lipoprotein signal peptide and consensus prolipoprotein processing site []. Results of experimental work with Lpp20 are consistent with it being a nonessential lipoprotein []. Prokaryotic membrane lipoproteins are synthesised with precursor signal peptides that are cleaved by specific peptidases (signal peptidase II). The enzyme recognises a conserved sequence, cutting upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [].; GO: 0009279 cell outer membrane
Probab=88.53 E-value=0.43 Score=30.80 Aligned_cols=27 Identities=11% Similarity=0.293 Sum_probs=22.8
Q ss_pred EEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 47 GYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 47 ~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
.++|.|.|.+.+.|++.|-..+.+.|.
T Consensus 13 ~l~a~G~~~~~~~A~~~A~~~la~~i~ 39 (92)
T PF02169_consen 13 YLYAVGSGSSREQAKQDALANLAEQIS 39 (92)
T ss_pred EEEEEEcccChHHHHHHHHHHHHHhee
Confidence 368999999999999988888888774
No 27
>KOG0921|consensus
Probab=86.72 E-value=0.26 Score=45.53 Aligned_cols=70 Identities=30% Similarity=0.362 Sum_probs=59.4
Q ss_pred CChHHHHHHHHHhcCCCceEEEEeeCCCCCccEEEEEEEC----CEEEEEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 5 GLVKAFLNEWCQKNSKQPEFDVRAAGPKHRQRFLCELRIA----GIGYVGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 5 ~n~ks~L~E~~qk~~~~p~Y~~~~~Gp~h~~~F~~~V~v~----g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
.|.|..|+++-|+......|+....|++|.+.|+.+..+- .....+.+.|++||.|...+|...+..|..
T Consensus 167 eN~K~~ln~~~q~~~~~~~y~~~~~g~~~~~s~~~e~si~v~~~~~~~~~~~~gsnkk~~~~~ca~s~vrqm~h 240 (1282)
T KOG0921|consen 167 ENAKKALNEYLQKMRIQDNYKYTIVGPEHVRSFEAEASIYVPQLNRNLVAKETGSNKKVAEASCALSLVRQLFH 240 (1282)
T ss_pred chhHHHHhHHHhhhhhccccceeecCCccccchhhhHHHhhhhhchhhhhhhccccceecCcchHHHHHHHHHH
Confidence 5889999999999966999999989999999999987752 222358899999999999999998888763
No 28
>KOG3792|consensus
Probab=83.14 E-value=0.71 Score=41.41 Aligned_cols=64 Identities=22% Similarity=0.253 Sum_probs=48.3
Q ss_pred ChHHHHHHHHHhcCCCceEE------E-EeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 6 LVKAFLNEWCQKNSKQPEFD------V-RAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 6 n~ks~L~E~~qk~~~~p~Y~------~-~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
|+...+.-+-+. +....|+ + ...+|.|.++|+.+|.+||.. ....|.|||.|+-.||..-|+..
T Consensus 371 d~nak~mhl~gr-RhrLQYk~kv~p~Lvv~t~P~~~~~~t~e~r~~~~~--~~a~gps~~~~~wh~~~k~lq~~ 441 (816)
T KOG3792|consen 371 DPNAKEMHLKGR-RHRLQYKQKVDPDLVVDTKPSHRPRRTMEVRVNGLP--AEAEGPSKKTAKWHAARKRLQNE 441 (816)
T ss_pred CcchHHhhhhcc-cccceeccccCCCceeccCCcccchhhhhhhhcCCc--cccCCcccccchHHHHHHHhhcc
Confidence 344444433332 3456776 4 578999999999999999975 56668999999999999888876
No 29
>PF14657 Integrase_AP2: AP2-like DNA-binding integrase domain
Probab=71.97 E-value=16 Score=20.84 Aligned_cols=22 Identities=27% Similarity=0.203 Sum_probs=15.0
Q ss_pred EeEeccCHHHHHHHHHHHHHHHH
Q psy14233 50 GAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 50 g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
-.....+|++|+..+++ ++..+
T Consensus 20 ~k~GF~TkkeA~~~~~~-~~~~~ 41 (46)
T PF14657_consen 20 TKRGFKTKKEAEKALAK-IEAEL 41 (46)
T ss_pred EcCCCCcHHHHHHHHHH-HHHHH
Confidence 34557999999996666 44444
No 30
>TIGR01021 rpsE_bact ribosomal protein S5, bacterial/organelle type. This model finds chloroplast ribosomal protein S5 as well as bacterial ribosomal protein S5. A candidate mitochondrial form (Saccharomyces cerevisiae YBR251W and its homolog) differs substantially and is not included in this model.
Probab=61.40 E-value=14 Score=26.96 Aligned_cols=42 Identities=29% Similarity=0.350 Sum_probs=27.3
Q ss_pred CCCccEEEEEEEC-CEEEEEeEeccCH--HHHHHHHHHHHHHHHH
Q psy14233 32 KHRQRFLCELRIA-GIGYVGAGNSTNK--KDAQTNAAKDFVQYLV 73 (122)
Q Consensus 32 ~h~~~F~~~V~v~-g~~~~g~G~G~SK--K~Aeq~AA~~Al~~L~ 73 (122)
.....|.+-|.|+ |.-++|.|.|+++ ..|-+.|...|...|.
T Consensus 19 Gr~~sf~alvvvGn~~G~vG~G~GKa~ev~~Ai~kA~~~Ak~nl~ 63 (154)
T TIGR01021 19 GRRFSFSALVVVGDGKGRVGFGFGKAKEVPDAIKKAVEKARKNLI 63 (154)
T ss_pred CcEEEEEEEEEEECCCCeEeeeeeecchHHHHHHHHHHHHHhCeE
Confidence 4556677777773 2223566666666 6777778888877775
No 31
>cd04516 TBP_eukaryotes eukaryotic TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=58.41 E-value=49 Score=24.52 Aligned_cols=64 Identities=14% Similarity=0.236 Sum_probs=40.0
Q ss_pred HHHHHHHhcCCCceEEEEe----eCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCC
Q psy14233 10 FLNEWCQKNSKQPEFDVRA----AGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQ 77 (122)
Q Consensus 10 ~L~E~~qk~~~~p~Y~~~~----~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~ 77 (122)
.|.+++..... .+|+=.. .-.-+++.-++.++-.|+ ++..| ++|..+|+. |++..++.|.+-|+
T Consensus 18 dL~~la~~~~n-~eYePe~fpgli~Rl~~Pk~t~lIF~SGK-iviTG-aks~e~a~~-a~~~i~~~L~~~g~ 85 (174)
T cd04516 18 DLKKIALRARN-AEYNPKRFAAVIMRIREPKTTALIFSSGK-MVCTG-AKSEDDSKL-AARKYARIIQKLGF 85 (174)
T ss_pred cHHHHHhhCCC-CEECCccCcEEEEEeCCCcEEEEEECCCe-EEEEe-cCCHHHHHH-HHHHHHHHHHHcCC
Confidence 46777755422 4553210 001245677788888997 44555 588888888 66777788877665
No 32
>CHL00138 rps5 ribosomal protein S5; Validated
Probab=55.43 E-value=22 Score=25.75 Aligned_cols=42 Identities=24% Similarity=0.373 Sum_probs=27.1
Q ss_pred CCCccEEEEEEECC-EEEEEeEeccCH--HHHHHHHHHHHHHHHH
Q psy14233 32 KHRQRFLCELRIAG-IGYVGAGNSTNK--KDAQTNAAKDFVQYLV 73 (122)
Q Consensus 32 ~h~~~F~~~V~v~g-~~~~g~G~G~SK--K~Aeq~AA~~Al~~L~ 73 (122)
....+|.+-|.+++ .-.+|.|.|+++ .+|-+.|...|...|.
T Consensus 41 GR~~~f~alvvvGn~~G~vG~G~GKa~ev~~Ai~KA~~~Akknl~ 85 (143)
T CHL00138 41 GKKLSFRAIVVIGNENGWVGVGVGKADDVQNAVKKAVTDAKKNLI 85 (143)
T ss_pred CcEEEEEEEEEEECCCCeEEEeeEecchHHHHHHHHHHHHHhCeE
Confidence 45666777777732 223566666655 6777788888877775
No 33
>PRK00550 rpsE 30S ribosomal protein S5; Validated
Probab=54.65 E-value=21 Score=26.46 Aligned_cols=42 Identities=21% Similarity=0.316 Sum_probs=27.1
Q ss_pred CCCccEEEEEEECC-EEEEEeEeccCH--HHHHHHHHHHHHHHHH
Q psy14233 32 KHRQRFLCELRIAG-IGYVGAGNSTNK--KDAQTNAAKDFVQYLV 73 (122)
Q Consensus 32 ~h~~~F~~~V~v~g-~~~~g~G~G~SK--K~Aeq~AA~~Al~~L~ 73 (122)
.....|.+-|.|++ .-++|.|.|+++ ..|-+.|...|...|.
T Consensus 30 Gr~~~f~alvvvGn~~G~vG~G~gKa~ev~~Ai~kA~~~A~~nl~ 74 (168)
T PRK00550 30 GRRFSFSALVVVGDGKGRVGFGYGKAREVPEAIRKAVEDAKKNMI 74 (168)
T ss_pred CcEEEEEEEEEEECCCCeEeeeeeecchHHHHHHHHHHHHHhCEE
Confidence 45667777777732 223566666655 6777777777777665
No 34
>COG1944 Uncharacterized conserved protein [Function unknown]
Probab=49.50 E-value=90 Score=26.40 Aligned_cols=29 Identities=21% Similarity=0.213 Sum_probs=25.9
Q ss_pred EEEeEeccCHHHHHHHHHHHHHHHHHHcC
Q psy14233 48 YVGAGNSTNKKDAQTNAAKDFVQYLVRSG 76 (122)
Q Consensus 48 ~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~ 76 (122)
.+..|.|.||+.|+..|..++++.+..+-
T Consensus 60 ~~~~GKGat~~~A~vSAimE~~Er~sAe~ 88 (398)
T COG1944 60 SVSQGKGATKAAARVSALMEALERLSAEY 88 (398)
T ss_pred eeecCCCCCHHHHHHHHHHHHHHHhhccc
Confidence 36789999999999999999999998654
No 35
>KOG1881|consensus
Probab=45.83 E-value=14 Score=33.55 Aligned_cols=66 Identities=20% Similarity=0.276 Sum_probs=51.0
Q ss_pred CChHHHHHHHHHhcCCCceEEEEeeCCCCCccEEEEEEECCEE------EEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 5 GLVKAFLNEWCQKNSKQPEFDVRAAGPKHRQRFLCELRIAGIG------YVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 5 ~n~ks~L~E~~qk~~~~p~Y~~~~~Gp~h~~~F~~~V~v~g~~------~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
.+||-.|+.+.-+.+....|++...|+. .|.|.|..-+.. +.+...-..||+|--.||..||+.|.
T Consensus 349 qdPkk~l~~ffereg~~l~~~~deq~~~---~w~c~v~lp~~~~~~~~~v~~~~~~g~kke~~iqc~ldaCrild 420 (793)
T KOG1881|consen 349 QDPKKALLGFFEREGEDLEYEFDEQGHG---KWVCRVELPVDDSGGARAVAEAAHSGEKKEAFIQCALDACRILD 420 (793)
T ss_pred cCHHHHHHHHHHhhhhhhhhhhhhcCCc---eEEeeeeeecccccchhHHHHHhhhchhHHHHHHHHHhhhhhcc
Confidence 4899999999998888999998776644 899999974321 11222333899999999999999885
No 36
>PRK04044 rps5p 30S ribosomal protein S5P; Reviewed
Probab=45.53 E-value=35 Score=26.35 Aligned_cols=42 Identities=26% Similarity=0.400 Sum_probs=27.9
Q ss_pred CCCccEEEEEEECC-EEEEEeEeccCH--HHHHHHHHHHHHHHHH
Q psy14233 32 KHRQRFLCELRIAG-IGYVGAGNSTNK--KDAQTNAAKDFVQYLV 73 (122)
Q Consensus 32 ~h~~~F~~~V~v~g-~~~~g~G~G~SK--K~Aeq~AA~~Al~~L~ 73 (122)
....+|.+-|.|++ .-++|.|.|+++ ..|-+.|..+|...|.
T Consensus 67 Gr~~sf~alVvVGn~~G~vG~G~GKa~ev~~Ai~kA~~~Akknli 111 (211)
T PRK04044 67 GRRVRFRVVVVVGNRDGYVGVGQGKAKQVGPAIRKAIRNAKLNII 111 (211)
T ss_pred CcEEEEEEEEEEeCCCCeEeeeeeeccchHHHHHHHHHHHHhCeE
Confidence 45666777777732 223577777766 6777778888877775
No 37
>KOG0701|consensus
Probab=44.97 E-value=30 Score=34.13 Aligned_cols=70 Identities=24% Similarity=0.199 Sum_probs=50.1
Q ss_pred hHHHHHHHHHhc-CCCceEEE-EeeCCCCCccEEEE--------------EEECCEEEEEeEeccCHHHHHHHHHHHHHH
Q psy14233 7 VKAFLNEWCQKN-SKQPEFDV-RAAGPKHRQRFLCE--------------LRIAGIGYVGAGNSTNKKDAQTNAAKDFVQ 70 (122)
Q Consensus 7 ~ks~L~E~~qk~-~~~p~Y~~-~~~Gp~h~~~F~~~--------------V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~ 70 (122)
.++.+++.|-.. |..|.+.. .+.++.|..+|... |.+.+++ ...|.|...+.||..||+.|++
T Consensus 1516 dsg~~~~~~~~~~~a~p~~s~~~E~~~~h~~~~~~~~~~k~~d~~~~~~tv~~~~~~-~~~~~g~~~~~aK~s~~k~A~~ 1594 (1606)
T KOG0701|consen 1516 DSGNMMEPCIEKFWALPPRSPIRELLELHPERALFGKCEKVADAGKVRVTVDVFNKE-VFAGEGRNYRIAKASAAKAALK 1594 (1606)
T ss_pred CcccccchHhhcCcCCCCccchhhhccccceeeccchhhhhhhccceEEEEEecccc-hhhhcchhhhhhhhhHHHHHHH
Confidence 344678888888 88888876 45566666665433 2233333 4678899999999999999999
Q ss_pred HHHHcCC
Q psy14233 71 YLVRSGQ 77 (122)
Q Consensus 71 ~L~~~~~ 77 (122)
.|..++.
T Consensus 1595 ll~~~~~ 1601 (1606)
T KOG0701|consen 1595 LLKKLGL 1601 (1606)
T ss_pred HHHHhhh
Confidence 9976543
No 38
>PRK00394 transcription factor; Reviewed
Probab=43.90 E-value=1e+02 Score=22.90 Aligned_cols=42 Identities=19% Similarity=0.218 Sum_probs=30.7
Q ss_pred CCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCC
Q psy14233 33 HRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQ 77 (122)
Q Consensus 33 h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~ 77 (122)
++|.=++.|+-.|+ ++..| ++|..+|+. |++..++.|.+.|+
T Consensus 43 ~~Pk~t~lIf~sGK-iv~tG-a~S~~~a~~-a~~~~~~~l~~~g~ 84 (179)
T PRK00394 43 EDPKIAALIFRSGK-VVCTG-AKSVEDLHE-AVKIIIKKLKELGI 84 (179)
T ss_pred cCCceEEEEEcCCc-EEEEc-cCCHHHHHH-HHHHHHHHHHHcCC
Confidence 45556666777897 44555 489999988 67888888887665
No 39
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=39.68 E-value=1.3e+02 Score=22.23 Aligned_cols=42 Identities=19% Similarity=0.272 Sum_probs=30.6
Q ss_pred CCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCC
Q psy14233 33 HRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQ 77 (122)
Q Consensus 33 h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~ 77 (122)
+++.=++.|+-.|+ ++..| ++|..+|+. |++..++.|++.|+
T Consensus 44 ~~Pk~t~lIF~SGK-iv~tG-aks~~~a~~-a~~~~~~~L~~~g~ 85 (174)
T cd04518 44 EDPKIAALIFRSGK-MVCTG-AKSVEDLHR-AVKEIIKKLKDYGI 85 (174)
T ss_pred cCCcEEEEEECCCe-EEEEc-cCCHHHHHH-HHHHHHHHHHhcCC
Confidence 34556666777997 45555 478888888 67888888887765
No 40
>KOG3302|consensus
Probab=39.34 E-value=1.1e+02 Score=23.45 Aligned_cols=42 Identities=21% Similarity=0.306 Sum_probs=29.8
Q ss_pred CCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCC
Q psy14233 33 HRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQ 77 (122)
Q Consensus 33 h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~ 77 (122)
+.|+=++.++=.|+ ++..| +.|..+|+. ||+.-.+.|++.|+
T Consensus 65 r~P~~ta~I~ssGK-i~ctg-A~se~~ar~-aark~aRilqkLgf 106 (200)
T KOG3302|consen 65 RSPRTTALIFSSGK-IVCTG-AKSEDSARL-AARKYARILQKLGF 106 (200)
T ss_pred cCCceEEEEecCCc-EEEec-cCCHHHHHH-HHHHHHHHHHHcCC
Confidence 45566777777897 45665 688888887 66666677776664
No 41
>PLN00062 TATA-box-binding protein; Provisional
Probab=38.69 E-value=1.5e+02 Score=22.00 Aligned_cols=64 Identities=14% Similarity=0.235 Sum_probs=38.2
Q ss_pred HHHHHHHhcCCCceEEEEe----eCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCC
Q psy14233 10 FLNEWCQKNSKQPEFDVRA----AGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQ 77 (122)
Q Consensus 10 ~L~E~~qk~~~~p~Y~~~~----~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~ 77 (122)
.|.+++.... ..+|+=.. .-.-+++.=++.++-.|+ ++..| ++|..+|+. |++..++.|.+.|+
T Consensus 18 dL~~la~~~~-n~eYePe~fpgli~Rl~~Pk~t~lIF~SGK-iviTG-aks~e~a~~-a~~~~~~~L~~lg~ 85 (179)
T PLN00062 18 DLKKIALQAR-NAEYNPKRFAAVIMRIREPKTTALIFASGK-MVCTG-AKSEHDSKL-AARKYARIIQKLGF 85 (179)
T ss_pred cHHHHHhhCC-CCEECCccCcEEEEEeCCCcEEEEEECCCe-EEEEe-cCCHHHHHH-HHHHHHHHHHHcCC
Confidence 4677776542 13443210 000234455677777997 44555 478888887 67777788887765
No 42
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA. New members of the TBP family, called TBP-like proteins (TBLP, TLF, TLP) or TBP-related factors (TRF1, TRF2,TRP), are similar to the core domain of TBPs, with identical or chemically similar amino acids at many
Probab=36.77 E-value=1.7e+02 Score=21.50 Aligned_cols=42 Identities=21% Similarity=0.299 Sum_probs=29.9
Q ss_pred CccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCCC
Q psy14233 34 RQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQV 78 (122)
Q Consensus 34 ~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~~ 78 (122)
++.=++.++-.|+ ++..| ++|..+|+. |++..++.|.+.|+.
T Consensus 45 ~P~~t~lIf~sGK-ivitG-aks~~~~~~-a~~~~~~~L~~~g~~ 86 (174)
T cd00652 45 EPKTTALIFSSGK-MVITG-AKSEEDAKL-AARKYARILQKLGFP 86 (174)
T ss_pred CCcEEEEEECCCE-EEEEe-cCCHHHHHH-HHHHHHHHHHHcCCC
Confidence 4556666677997 44555 488999988 678888888776643
No 43
>PF12909 DUF3832: Protein of unknown function (DUF3832); InterPro: IPR024308 This domain found in bacteria and Acanthamoeba polyphaga mimivirus (APMV) L591 protein has no known function. In Q0AZ30 from SWISSPROT and other uncharacterised proteins this domain is found C-terminal, while the N terminus shows remote homology to the bacterial toxin/antitoxin 'addiction module' (PF12910 from PFAM).; PDB: 3K6Q_C.
Probab=35.13 E-value=1.3e+02 Score=20.00 Aligned_cols=32 Identities=13% Similarity=0.241 Sum_probs=24.6
Q ss_pred EEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHH
Q psy14233 40 ELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLV 73 (122)
Q Consensus 40 ~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~ 73 (122)
.+.+.+..+ .|.|.|+.+|....+..+++.-.
T Consensus 20 ~~~l~~ldi--~~~g~T~eeA~~~lie~l~dYAe 51 (89)
T PF12909_consen 20 TVSLPELDI--YANGPTLEEAIEDLIEDLRDYAE 51 (89)
T ss_dssp EEEECCGTE--EEEESSHHHHHHHHHHHHHHHHH
T ss_pred EEEecCeeE--EecCCCHHHHHHHHHHHHHHHHH
Confidence 677888764 46669999999998888877553
No 44
>PF12724 Flavodoxin_5: Flavodoxin domain
Probab=32.42 E-value=31 Score=23.96 Aligned_cols=21 Identities=24% Similarity=-0.135 Sum_probs=17.8
Q ss_pred EEEeEeccCHHHHHHHHHHHH
Q psy14233 48 YVGAGNSTNKKDAQTNAAKDF 68 (122)
Q Consensus 48 ~~g~G~G~SKK~Aeq~AA~~A 68 (122)
+|.++.|.||+.|+..|...-
T Consensus 3 vY~S~~G~Tk~~A~~ia~~l~ 23 (143)
T PF12724_consen 3 VYFSKTGNTKKIAEWIAEKLG 23 (143)
T ss_pred EEECCCchHHHHHHHHHHHHh
Confidence 478899999999999887654
No 45
>cd04517 TLF TBP-like factors (TLF; also called TLP, TRF, TRP), which are found in most metazoans. TLFs and TBPs have well-conserved core domains; however, they only share about 60% similarity. TLFs, like TBPs, interact with TFIIA and TFIIB, which are part of the basal transcription machinery. Yet, in contrast to TBPs, TLFs seem not to interact with the TATA-box and even have a negative effect on the transcription of TATA-containing promoters. Recent results indicate that TLFs are involved in the transcription via TATA-less promoters.
Probab=32.22 E-value=2.1e+02 Score=21.11 Aligned_cols=63 Identities=19% Similarity=0.243 Sum_probs=37.7
Q ss_pred HHHHHHHhcC---CCceEE-EEeeCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHHcCC
Q psy14233 10 FLNEWCQKNS---KQPEFD-VRAAGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVRSGQ 77 (122)
Q Consensus 10 ~L~E~~qk~~---~~p~Y~-~~~~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~~~~ 77 (122)
-|.+++.... ..|.|- +.-. -+++.=++.++-.|+ ++..| ++|..+|+. |++..++.|++-|+
T Consensus 19 dL~~la~~l~n~eYeP~fpgli~R--~~~Pk~t~lIF~sGK-iviTG-aks~~~~~~-a~~~~~~~l~~~g~ 85 (174)
T cd04517 19 DLRKLALAGRNVEYNPRYPKVTMR--LREPRATASVWSSGK-ITITG-ATSEEEAKQ-AARRAARLLQKLGF 85 (174)
T ss_pred cHHHHHhhCCCCEEeCCCCEEEEE--ecCCcEEEEEECCCe-EEEEc-cCCHHHHHH-HHHHHHHHHHHcCC
Confidence 4677775532 233332 1111 134555666666897 45555 478888888 57888888876654
No 46
>PF00333 Ribosomal_S5: Ribosomal protein S5, N-terminal domain; InterPro: IPR013810 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S5 is one of the proteins from the small ribosomal subunit, and is a protein of 166 to 254 amino-acid residues. In Escherichia coli, S5 is known to be important in the assembly and function of the 30S ribosomal subunit. Mutations in S5 have been shown to increase translational error frequencies. It belongs to a family of ribosomal proteins which, on the basis of sequence similarities [, ], groups bacterial, cyanelle, red algal chloroplast, archaeal and fungal mitochondrial S5; mammalian, Caenorhabditis elegans, Drosophila and plant S2; and yeast S4 (SUP44). This entry represents the N-terminal domain of ribosomal protein S5, which has an alpha-beta(3)-alpha structure that folds into two layers, alpha/beta.; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2ZKQ_e 2XZN_E 2XZM_E 3O30_B 1S1H_E 3IZB_E 3U5C_C 3U5G_C 3O2Z_B 3BBN_E ....
Probab=28.86 E-value=1.5e+02 Score=18.49 Aligned_cols=41 Identities=20% Similarity=0.225 Sum_probs=24.4
Q ss_pred CCCccEEEEEEEC---CEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 32 KHRQRFLCELRIA---GIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 32 ~h~~~F~~~V~v~---g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
.....|.+-|.++ |..=+|.|.+..-..|-+.|-++|...|
T Consensus 19 Gr~~~~~alvvvGn~~G~vG~G~gKa~~~~~Ai~kA~~~A~knl 62 (67)
T PF00333_consen 19 GRIFSFRALVVVGNGNGLVGFGVGKAKEVPDAIRKAKRKAKKNL 62 (67)
T ss_dssp EEEEEEEEEEEEECSSSEEEEEEEEESSHHHHHHHHHHHHHCSE
T ss_pred CceeEEEEEEEEecCCCcEecCcccchhHHHHHHHHHHHHHhCC
Confidence 4555677766663 4311344555555778888887776543
No 47
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=28.57 E-value=1.6e+02 Score=18.74 Aligned_cols=38 Identities=16% Similarity=0.237 Sum_probs=26.4
Q ss_pred CccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 34 RQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 34 ~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
++.-++.|+-.|+ ++-.| ++|.++|++ |++..+..|.+
T Consensus 47 ~p~~t~~IF~sGk-i~itG-aks~~~~~~-a~~~i~~~L~~ 84 (86)
T PF00352_consen 47 NPKATVLIFSSGK-IVITG-AKSEEEAKK-AIEKILPILQK 84 (86)
T ss_dssp TTTEEEEEETTSE-EEEEE-ESSHHHHHH-HHHHHHHHHHH
T ss_pred CCcEEEEEEcCCE-EEEEe-cCCHHHHHH-HHHHHHHHHHH
Confidence 4466666666897 34445 479999988 67777777764
No 48
>TIGR01020 rpsE_arch ribosomal protein S5(archaeal type)/S2(eukaryote cytosolic type). This model finds eukaryotic ribosomal protein S2 as well as archaeal ribosomal protein S5.
Probab=28.33 E-value=88 Score=24.21 Aligned_cols=49 Identities=27% Similarity=0.327 Sum_probs=30.4
Q ss_pred CCCccEEEEEEECC-EEEEEeEeccCH--HHHHHHHHHHHHHHHH--Hc--CCCCC
Q psy14233 32 KHRQRFLCELRIAG-IGYVGAGNSTNK--KDAQTNAAKDFVQYLV--RS--GQVNE 80 (122)
Q Consensus 32 ~h~~~F~~~V~v~g-~~~~g~G~G~SK--K~Aeq~AA~~Al~~L~--~~--~~~~~ 80 (122)
....+|.+-|.|++ .-++|.|.|++| ..|-+.|...|...|. .. |+|..
T Consensus 67 Gr~~~F~alVvVGn~~G~vG~G~GKa~ev~~AI~kA~~~Ak~n~i~V~rg~g~w~~ 122 (212)
T TIGR01020 67 GRRTRFRAFVVVGNRDGYVGLGIGKAKEVAPAIRKAIINAKLNIIPVRRGCGSWEC 122 (212)
T ss_pred CcEEEEEEEEEEeCCCCeEeeceeecchHHHHHHHHHHHHHhCEEEEecccccccc
Confidence 46667777777732 223677777777 5566666666666665 23 66764
No 49
>PHA00280 putative NHN endonuclease
Probab=28.13 E-value=2.1e+02 Score=20.00 Aligned_cols=31 Identities=6% Similarity=-0.114 Sum_probs=22.5
Q ss_pred CCccEEEEEEECCEEEEEeEeccCHHHHHHHH
Q psy14233 33 HRQRFLCELRIAGIGYVGAGNSTNKKDAQTNA 64 (122)
Q Consensus 33 h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~A 64 (122)
....|.+.|.++|+. +..|.-.+..+|..+-
T Consensus 77 ~~~kw~A~I~~~gK~-~~lG~f~~~e~A~~a~ 107 (121)
T PHA00280 77 EREMWRGTVTAEGKQ-HNFRSRDLLEVVAWIY 107 (121)
T ss_pred CCCeEEEEEEECCEE-EEcCCCCCHHHHHHHH
Confidence 456799999999987 4677766666655544
No 50
>PF10195 Phospho_p8: DNA-binding nuclear phosphoprotein p8; InterPro: IPR018792 P8 is a short 80-82 amino acid protein that is conserved from nematodes to humans. It carries at least one protein kinase C domain suggesting a possible role in signal transduction and it is thought to be a phosphoprotein, but the sites of phosphorylation and the kinases involved remain to be determined [].
Probab=27.57 E-value=30 Score=21.57 Aligned_cols=14 Identities=36% Similarity=0.672 Sum_probs=11.5
Q ss_pred EeEeccCHHHHHHH
Q psy14233 50 GAGNSTNKKDAQTN 63 (122)
Q Consensus 50 g~G~G~SKK~Aeq~ 63 (122)
|.|.|+||++|...
T Consensus 23 GsgKgRTK~E~~~h 36 (60)
T PF10195_consen 23 GSGKGRTKKEAAEH 36 (60)
T ss_pred CCCCCccHHHHHHh
Confidence 67888999998764
No 51
>PRK12359 flavodoxin FldB; Provisional
Probab=27.11 E-value=41 Score=24.80 Aligned_cols=20 Identities=10% Similarity=0.092 Sum_probs=17.5
Q ss_pred EEEeEeccCHHHHHHHHHHH
Q psy14233 48 YVGAGNSTNKKDAQTNAAKD 67 (122)
Q Consensus 48 ~~g~G~G~SKK~Aeq~AA~~ 67 (122)
+|++..|+||+.|+..|.+.
T Consensus 6 ~Y~S~TGNTe~vAe~I~~~l 25 (172)
T PRK12359 6 FYGSSTCYTEMAAEKIRDII 25 (172)
T ss_pred EEECCCCHHHHHHHHHHHHh
Confidence 68999999999999988653
No 52
>PRK09267 flavodoxin FldA; Validated
Probab=25.12 E-value=48 Score=23.52 Aligned_cols=20 Identities=15% Similarity=0.200 Sum_probs=17.2
Q ss_pred EEEeEeccCHHHHHHHHHHH
Q psy14233 48 YVGAGNSTNKKDAQTNAAKD 67 (122)
Q Consensus 48 ~~g~G~G~SKK~Aeq~AA~~ 67 (122)
+|++..|.|++.|+..|...
T Consensus 7 iY~S~tGnT~~vA~~Ia~~l 26 (169)
T PRK09267 7 FFGSDTGNTEDIAKMIQKKL 26 (169)
T ss_pred EEECCCChHHHHHHHHHHHh
Confidence 57999999999999987653
No 53
>PRK05569 flavodoxin; Provisional
Probab=22.70 E-value=56 Score=22.26 Aligned_cols=19 Identities=16% Similarity=-0.143 Sum_probs=15.8
Q ss_pred EEEeEeccCHHHHHHHHHH
Q psy14233 48 YVGAGNSTNKKDAQTNAAK 66 (122)
Q Consensus 48 ~~g~G~G~SKK~Aeq~AA~ 66 (122)
+|+++.|+|++.|+..|..
T Consensus 7 iY~S~tGnT~~iA~~i~~~ 25 (141)
T PRK05569 7 IYWSCGGNVEVLANTIADG 25 (141)
T ss_pred EEECCCCHHHHHHHHHHHH
Confidence 4788999999999987753
No 54
>TIGR01752 flav_long flavodoxin, long chain. Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They can act in nitrogen fixation by nitrogenase, in sulfite reduction, and light-dependent NADP+ reduction in during photosynthesis, among other roles. This model describes the long chain type, typical for nitrogen fixation but associated with pyruvate formate-lyase activation and cobalamin-dependent methionine synthase activity in E. coli.
Probab=21.05 E-value=64 Score=23.09 Aligned_cols=19 Identities=11% Similarity=0.204 Sum_probs=16.6
Q ss_pred EEEeEeccCHHHHHHHHHH
Q psy14233 48 YVGAGNSTNKKDAQTNAAK 66 (122)
Q Consensus 48 ~~g~G~G~SKK~Aeq~AA~ 66 (122)
+|++..|+|++.|+..|..
T Consensus 5 iY~S~tGnTe~vA~~Ia~~ 23 (167)
T TIGR01752 5 FYGTDTGNTEGIAEKIQKE 23 (167)
T ss_pred EEECCCChHHHHHHHHHHH
Confidence 5788999999999998865
No 55
>PRK03739 2-isopropylmalate synthase; Validated
Probab=20.75 E-value=5.5e+02 Score=22.43 Aligned_cols=67 Identities=19% Similarity=0.182 Sum_probs=40.4
Q ss_pred ChHHHHHHHHHhc-C---CCceEEEEe--eCCCCCccEEEEEEECCEEEEEeEeccCHHHHHHHHHHHHHHHH
Q psy14233 6 LVKAFLNEWCQKN-S---KQPEFDVRA--AGPKHRQRFLCELRIAGIGYVGAGNSTNKKDAQTNAAKDFVQYL 72 (122)
Q Consensus 6 n~ks~L~E~~qk~-~---~~p~Y~~~~--~Gp~h~~~F~~~V~v~g~~~~g~G~G~SKK~Aeq~AA~~Al~~L 72 (122)
-|...|..-.++. + ..-.|.+.. .|.+..-.=.+++..+|..+.|.|.....-+|--.|=-.|+++|
T Consensus 478 GPVdAl~~AL~~~~~~~v~L~dY~v~al~~GtdA~~~v~I~~~~~g~~~~G~Gvs~DIi~AS~~Ali~a~n~l 550 (552)
T PRK03739 478 GPIDAFVNALSQALGVDVRVLDYEEHALGAGSDAQAAAYVELRVGGRTVFGVGIDANIVTASLKAVVSAVNRA 550 (552)
T ss_pred CHHHHHHHHHHHHhCCCcEEEEEEEEccCCCCCeEEEEEEEEEECCeEEEEEEeCCCHHHHHHHHHHHHHHhh
Confidence 4566665555543 3 345566542 23232233333444478877888888888888877777777765
No 56
>PF12327 FtsZ_C: FtsZ family, C-terminal domain; InterPro: IPR024757 The FtsZ family of proteins are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial cell division. It is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ is a GTPase, like tubulin []. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria and archaea []. This entry represents a domain of FtsZ. In most FtsZ proteins is found in the C terminus, except in some alphaproteobacteria proteins where there is an extension C-terminal domain TIGR03483 from TIGRFAMs.; PDB: 2RHO_B 2RHJ_A 2VXY_A 2RHL_B 2RHH_A 2VAM_A 1W5F_B 2R75_1 2R6R_1 1RQ7_A ....
Probab=20.65 E-value=1e+02 Score=20.28 Aligned_cols=22 Identities=18% Similarity=0.451 Sum_probs=13.1
Q ss_pred EEEeEeccCHHHHHHHHHHHHHH
Q psy14233 48 YVGAGNSTNKKDAQTNAAKDFVQ 70 (122)
Q Consensus 48 ~~g~G~G~SKK~Aeq~AA~~Al~ 70 (122)
++|.|.|+....|+. |+++||.
T Consensus 4 ~iGiG~a~G~~r~~~-Av~~Al~ 25 (95)
T PF12327_consen 4 MIGIGEASGENRAEE-AVEQALN 25 (95)
T ss_dssp EEEEEEEESTTHHHH-HHHHHHT
T ss_pred EEEEEEECCccHHHH-HHHHHHh
Confidence 456676666555544 6666664
No 57
>PF05165 GGDN: GGDN family; InterPro: IPR007839 GTP cyclohydrolase III catalyses the formation of 2-amino-5-formylamino-6- ribofuranosylamino-4(3H)-pyrimidinone ribonucleotide monophosphate and inorganic phosphate from GTP. The enzyme also has an independent pyrophosphate phosphohydrolase activity. The proteins are 200-270 amino acids in length.; GO: 0003933 GTP cyclohydrolase activity, 0009058 biosynthetic process; PDB: 2QV6_B.
Probab=20.03 E-value=1.9e+02 Score=22.91 Aligned_cols=25 Identities=24% Similarity=0.323 Sum_probs=21.6
Q ss_pred EEeEeccCHHHHHHHHHHHHHHHHHH
Q psy14233 49 VGAGNSTNKKDAQTNAAKDFVQYLVR 74 (122)
Q Consensus 49 ~g~G~G~SKK~Aeq~AA~~Al~~L~~ 74 (122)
+|+|.|++=.+|-. .|..||+.++.
T Consensus 210 vGIGig~ta~~A~~-~At~ALe~iR~ 234 (246)
T PF05165_consen 210 VGIGIGKTARDAGM-LATKALETIRE 234 (246)
T ss_dssp EEEEEESSHHHHHH-HHHHHHHHHHT
T ss_pred EeecCCCCHHHHHH-HHHHHHHHHHh
Confidence 69999999999987 67789999874
Done!