Query psy16701
Match_columns 272
No_of_seqs 207 out of 593
Neff 5.6
Searched_HMMs 46136
Date Fri Aug 16 19:49:02 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16701.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16701hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG2101 SPT15 TATA-box binding 100.0 1.4E-62 3E-67 420.6 20.0 177 32-208 4-184 (185)
2 PRK00394 transcription factor; 100.0 4.8E-61 1E-65 418.3 23.4 173 36-208 1-177 (179)
3 cd04518 TBP_archaea archaeal T 100.0 5.5E-61 1.2E-65 416.2 23.1 172 35-206 1-174 (174)
4 PLN00062 TATA-box-binding prot 100.0 1E-59 2.2E-64 409.8 22.9 172 36-207 2-175 (179)
5 cd04516 TBP_eukaryotes eukaryo 100.0 1.3E-59 2.8E-64 407.6 23.3 171 36-206 2-174 (174)
6 cd00652 TBP_TLF TATA box bindi 100.0 8.7E-59 1.9E-63 402.4 23.1 171 35-205 1-174 (174)
7 cd04517 TLF TBP-like factors ( 100.0 2E-57 4.3E-62 394.0 23.1 170 35-205 2-174 (174)
8 KOG3302|consensus 100.0 1.5E-57 3.2E-62 395.3 20.5 176 31-206 18-195 (200)
9 cd04516 TBP_eukaryotes eukaryo 100.0 1.1E-57 2.4E-62 395.5 19.3 148 125-272 2-149 (174)
10 COG2101 SPT15 TATA-box binding 100.0 7E-58 1.5E-62 391.6 16.3 151 122-272 5-157 (185)
11 PLN00062 TATA-box-binding prot 100.0 2.6E-57 5.7E-62 394.7 19.0 148 125-272 2-149 (179)
12 PRK00394 transcription factor; 100.0 1.8E-54 4E-59 376.9 19.5 148 125-272 1-150 (179)
13 cd04518 TBP_archaea archaeal T 100.0 2E-54 4.4E-59 375.1 19.1 148 124-272 1-149 (174)
14 cd00652 TBP_TLF TATA box bindi 100.0 2.2E-54 4.8E-59 374.9 19.3 149 124-272 1-150 (174)
15 KOG3302|consensus 100.0 1.8E-54 3.9E-59 376.0 17.2 152 121-272 19-170 (200)
16 cd04517 TLF TBP-like factors ( 100.0 5.7E-54 1.2E-58 372.3 19.4 148 124-272 2-150 (174)
17 PF00352 TBP: Transcription fa 100.0 3.8E-30 8.2E-35 199.1 11.8 86 122-207 1-86 (86)
18 PF00352 TBP: Transcription fa 100.0 3.1E-28 6.7E-33 188.4 11.2 84 35-118 3-86 (86)
19 PF11858 DUF3378: Domain of un 96.4 0.003 6.5E-08 48.7 3.3 34 158-191 27-60 (81)
20 PF11858 DUF3378: Domain of un 95.9 0.014 3E-07 45.1 4.7 47 53-100 12-58 (81)
21 COG4871 Uncharacterized protei 95.5 0.086 1.9E-06 45.9 8.2 73 131-206 37-109 (193)
22 COG4871 Uncharacterized protei 94.5 0.16 3.6E-06 44.2 7.4 63 52-117 47-109 (193)
23 TIGR00716 rnhC ribonuclease HI 93.0 0.16 3.5E-06 47.7 5.1 33 155-187 22-54 (284)
24 COG1039 RnhC Ribonuclease HIII 92.6 0.28 6.1E-06 46.3 6.0 42 158-199 28-69 (297)
25 TIGR00716 rnhC ribonuclease HI 92.1 0.23 5.1E-06 46.6 4.9 36 66-101 22-57 (284)
26 PRK00996 ribonuclease HIII; Pr 92.0 0.26 5.6E-06 46.7 5.2 32 156-187 26-57 (304)
27 COG1039 RnhC Ribonuclease HIII 90.0 0.5 1.1E-05 44.6 4.8 40 69-108 28-67 (297)
28 PRK00996 ribonuclease HIII; Pr 89.2 0.59 1.3E-05 44.3 4.8 34 67-100 26-59 (304)
29 PRK12475 thiamine/molybdopteri 88.5 1.1 2.4E-05 42.8 6.2 57 49-110 277-336 (338)
30 PRK07688 thiamine/molybdopteri 85.7 1.9 4.2E-05 41.3 6.1 59 49-110 277-337 (339)
31 PRK12475 thiamine/molybdopteri 74.2 12 0.00026 35.8 7.3 35 163-199 302-336 (338)
32 PF13575 DUF4135: Domain of un 71.5 23 0.00051 33.9 8.7 49 188-237 129-177 (370)
33 PRK15468 carboxysome structura 67.6 9.5 0.0002 31.2 4.2 35 174-210 70-105 (111)
34 PRK07688 thiamine/molybdopteri 66.2 14 0.00031 35.4 5.9 38 162-199 300-337 (339)
35 TIGR02187 GlrX_arch Glutaredox 61.6 50 0.0011 29.0 8.1 113 82-197 79-213 (215)
36 PLN00410 U5 snRNP protein, DIM 56.1 25 0.00053 29.8 5.0 81 125-205 25-125 (142)
37 PRK15468 carboxysome structura 51.0 24 0.00051 28.9 3.8 31 88-120 73-104 (111)
38 cd01554 EPT-like Enol pyruvate 46.4 16 0.00036 34.8 2.7 106 88-212 239-345 (408)
39 cd03065 PDI_b_Calsequestrin_N 42.4 38 0.00083 27.6 3.9 30 171-200 89-119 (120)
40 PLN00410 U5 snRNP protein, DIM 39.8 46 0.001 28.2 4.1 62 33-94 22-92 (142)
41 PRK09381 trxA thioredoxin; Pro 39.6 40 0.00087 25.7 3.5 28 172-199 78-107 (109)
42 PTZ00129 40S ribosomal protein 39.1 1.2E+02 0.0026 26.1 6.6 53 68-120 35-91 (149)
43 KOG0910|consensus 37.0 52 0.0011 28.4 4.0 77 124-201 62-149 (150)
44 PTZ00129 40S ribosomal protein 35.3 1E+02 0.0022 26.6 5.5 51 159-209 37-91 (149)
45 cd01644 RT_pepA17 RT_pepA17: R 34.5 47 0.001 29.6 3.6 30 183-212 145-174 (213)
46 COG4978 Transcriptional regula 33.2 1.1E+02 0.0024 26.2 5.4 101 99-213 24-125 (153)
47 PRK11509 hydrogenase-1 operon 33.2 79 0.0017 26.5 4.5 30 171-200 93-124 (132)
48 PRK13663 hypothetical protein; 32.5 85 0.0018 31.7 5.2 133 90-253 109-261 (493)
49 COG0533 QRI7 Metal-dependent p 32.2 1.2E+02 0.0027 29.5 6.2 51 69-120 125-176 (342)
50 PHA02278 thioredoxin-like prot 31.8 63 0.0014 25.4 3.5 24 171-194 74-99 (103)
51 TIGR00090 iojap_ybeB iojap-lik 31.2 91 0.002 24.5 4.3 30 180-210 32-61 (99)
52 PF06200 tify: tify domain; I 29.4 91 0.002 20.5 3.3 27 81-107 5-31 (36)
53 PRK10996 thioredoxin 2; Provis 29.1 76 0.0016 26.0 3.7 28 172-199 109-138 (139)
54 cd02963 TRX_DnaJ TRX domain, D 27.4 69 0.0015 25.0 3.0 27 172-198 82-110 (111)
55 PF04461 DUF520: Protein of un 26.4 65 0.0014 28.1 2.9 143 43-216 2-160 (160)
56 cd03065 PDI_b_Calsequestrin_N 25.7 90 0.002 25.4 3.5 29 82-110 89-118 (120)
57 PF13192 Thioredoxin_3: Thiore 25.4 78 0.0017 23.0 2.9 21 176-196 54-75 (76)
58 PRK05412 putative nucleotide-b 24.8 80 0.0017 27.6 3.1 147 44-216 3-160 (161)
59 KOG0910|consensus 24.6 97 0.0021 26.7 3.6 74 33-112 60-149 (150)
60 cd02948 TRX_NDPK TRX domain, T 24.4 1E+02 0.0022 23.5 3.5 26 172-198 74-101 (102)
61 PRK09381 trxA thioredoxin; Pro 24.0 90 0.002 23.7 3.1 28 83-110 78-107 (109)
62 cd00148 PROF Profilin binds ac 23.8 2.3E+02 0.0051 23.0 5.7 44 76-119 80-127 (127)
63 PF07960 CBP4: CBP4; InterPro 23.5 25 0.00054 29.5 -0.2 18 10-27 12-29 (128)
64 KOG4180|consensus 23.0 89 0.0019 30.5 3.4 71 174-260 256-334 (395)
65 PF00085 Thioredoxin: Thioredo 22.9 1.7E+02 0.0037 21.3 4.4 27 172-198 74-102 (103)
66 TIGR00090 iojap_ybeB iojap-lik 22.9 1.5E+02 0.0032 23.3 4.2 31 89-120 30-60 (99)
67 PF03135 CagE_TrbE_VirB: CagE, 22.4 1E+02 0.0023 26.7 3.5 30 181-211 153-182 (205)
68 smart00392 PROF Profilin. Bind 21.7 3.1E+02 0.0068 22.2 6.1 42 78-119 84-129 (129)
69 PF05005 Ocnus: Janus/Ocnus fa 21.0 1.5E+02 0.0033 24.0 3.9 56 149-209 4-60 (108)
70 COG3445 Acid-induced glycyl ra 20.8 31 0.00068 28.0 -0.1 51 135-196 68-119 (127)
71 COG0100 RpsK Ribosomal protein 20.1 4.4E+02 0.0095 22.2 6.5 52 69-120 26-80 (129)
No 1
>COG2101 SPT15 TATA-box binding protein (TBP), component of TFIID and TFIIIB [Transcription]
Probab=100.00 E-value=1.4e-62 Score=420.60 Aligned_cols=177 Identities=36% Similarity=0.542 Sum_probs=169.9
Q ss_pred cCCcceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHH
Q psy16701 32 KNRRGTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFA 111 (272)
Q Consensus 32 ~~~~~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~ 111 (272)
.+..++|+|||||++|+++|||++++..++|+||||++|||+++|+++|++++|||+||||+||||||++|++.|.++++
T Consensus 4 ~~~~i~IeNIVAS~~L~~elDL~~~~~~l~~aeYnP~qFpGlv~Rl~ePk~a~LIF~SGK~VcTGaKs~ed~~~av~~~~ 83 (185)
T COG2101 4 SEPTITIENIVASVDLGQELDLEEVALDLPGAEYNPEQFPGLVYRLEEPKTAALIFRSGKVVCTGAKSVEDVHRAVKKLA 83 (185)
T ss_pred CCCccEEEEEEEEechhhhccHHHHHhhCCCCccCHhHCCeeEEEecCCcceEEEEecCcEEEeccCcHHHHHHHHHHHH
Confidence 34578999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhcCCCc--eeeeEEEEEEEEEEeCCCCChhhHhhcCC--CeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEccc
Q psy16701 112 KILKQLGHPE--IIFFFFRNMVGTISVGCPLDLNQINSRVR--YSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAK 187 (272)
Q Consensus 112 ~~L~~lg~~~--~~~~~I~NVVas~~l~~~ldL~~la~~~~--n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGak 187 (272)
++|+++|.+. .+.++|||||||+|++.++||+.+|..+. |++|||||||||+||+.+|++++|||+|||+||||||
T Consensus 84 ~~L~~~g~~~~~~p~i~iQNIVaSadL~~~lnL~~iA~~lg~e~~eYEPEqFPGLVYRl~~P~VV~LiF~SGK~ViTGaK 163 (185)
T COG2101 84 KKLKDGGIDIDFEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYEPEQFPGLVYRLDEPRVVLLLFGSGKLVITGAK 163 (185)
T ss_pred HHHHhcCcCcCCCCceEEEEEEEEeccCccccHHHHHHhccccccccccccCCeeEEEcCCCCEEEEEecCCcEEEecCC
Confidence 9999998763 57999999999999999999999998765 5999999999999999999999999999999999999
Q ss_pred CHHHHHHHHHHHHHHHHhCCC
Q psy16701 188 HEHDCKLASRKFAKILKQLGH 208 (272)
Q Consensus 188 S~e~~~~A~~ki~~iL~klg~ 208 (272)
+++|++.|++++.+.|+++|.
T Consensus 164 ~~ed~~~Av~~i~~~L~elgl 184 (185)
T COG2101 164 SEEDAEQAVEKIQSRLEELGL 184 (185)
T ss_pred CHHHHHHHHHHHHHHHHHhcc
Confidence 999999999999999999974
No 2
>PRK00394 transcription factor; Reviewed
Probab=100.00 E-value=4.8e-61 Score=418.25 Aligned_cols=173 Identities=35% Similarity=0.549 Sum_probs=168.7
Q ss_pred ceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHHH
Q psy16701 36 GTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILK 115 (272)
Q Consensus 36 ~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L~ 115 (272)
++|+|||||++|+++|||++|+..++|+||||++|||++||+++|++|++||+||||+||||+|+++++.|+++++++|+
T Consensus 1 i~i~NvVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~Rl~~Pk~t~lIf~sGKiv~tGa~S~~~a~~a~~~~~~~l~ 80 (179)
T PRK00394 1 IKIENIVASTDLGQELDLEKVAEDLPNAEYNPEQFPGLVYRLEDPKIAALIFRSGKVVCTGAKSVEDLHEAVKIIIKKLK 80 (179)
T ss_pred CEEEEEEEEEEcCCCcCHHHHHhhCCCceeCcccCceEEEEecCCceEEEEEcCCcEEEEccCCHHHHHHHHHHHHHHHH
Confidence 47999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcCCCc--eeeeEEEEEEEEEEeCCCCChhhHhhcC--CCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHH
Q psy16701 116 QLGHPE--IIFFFFRNMVGTISVGCPLDLNQINSRV--RYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHD 191 (272)
Q Consensus 116 ~lg~~~--~~~~~I~NVVas~~l~~~ldL~~la~~~--~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~ 191 (272)
++|++. ..+++|+|||||++++++|||++++..+ ++++||||+||||+||+.+|+++++||+||||+||||||++|
T Consensus 81 ~~g~~~~~~~~~~i~NiVas~~l~~~i~L~~la~~~~~~~~~YePe~fPglvyR~~~pk~~~lIF~SGKvvitGaks~~~ 160 (179)
T PRK00394 81 ELGIKVIDEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYEPEQFPGLVYRLDDPKVVVLLFGSGKLVITGAKSEED 160 (179)
T ss_pred HcCCCccCCCceEEEEEEEEEEcCCeEcHHHHHHhcCcCCcEECcccCceEEEEecCCcEEEEEEcCCEEEEEecCCHHH
Confidence 999985 5799999999999999999999999876 899999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhCCC
Q psy16701 192 CKLASRKFAKILKQLGH 208 (272)
Q Consensus 192 ~~~A~~ki~~iL~klg~ 208 (272)
++.|+++++++|+++|.
T Consensus 161 ~~~a~~~i~~~l~~~g~ 177 (179)
T PRK00394 161 AEKAVEKILEKLEELGL 177 (179)
T ss_pred HHHHHHHHHHHHHHcCC
Confidence 99999999999999986
No 3
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=100.00 E-value=5.5e-61 Score=416.16 Aligned_cols=172 Identities=33% Similarity=0.549 Sum_probs=167.3
Q ss_pred cceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHH
Q psy16701 35 RGTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKIL 114 (272)
Q Consensus 35 ~~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L 114 (272)
.++|+|||||++|+|+|||++|+..++|+||||++|||++||+++|+++++||+||||+||||+|+++++.|+++++++|
T Consensus 1 ~~~I~NvVas~~l~~~ldL~~la~~~~n~eY~P~~fpgli~Rl~~Pk~t~lIF~SGKiv~tGaks~~~a~~a~~~~~~~L 80 (174)
T cd04518 1 SLKIENIVASVDLGQELDLEKVAAELPNAEYNPDQFPGLVYRLEDPKIAALIFRSGKMVCTGAKSVEDLHRAVKEIIKKL 80 (174)
T ss_pred CcEEEEEEEEEEcCCeecHHHHHhhCCCcEECCCcCcEEEEEccCCcEEEEEECCCeEEEEccCCHHHHHHHHHHHHHHH
Confidence 37899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhcCCCc--eeeeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHH
Q psy16701 115 KQLGHPE--IIFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDC 192 (272)
Q Consensus 115 ~~lg~~~--~~~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~ 192 (272)
+++|++. ..+++|+|||||++++++|||+.++..+++++||||+||||+||+.+|+++++||+||||+||||||+||+
T Consensus 81 ~~~g~~~~~~~~~~i~NIVas~~l~~~i~L~~la~~~~~~~YePe~fpglvyR~~~pk~~~lIF~SGKvvitGaks~~~~ 160 (174)
T cd04518 81 KDYGIKVIEKPEIKVQNIVASADLGREVNLDAIAIGLPNAEYEPEQFPGLVYRLDEPKVVLLLFSSGKMVITGAKSEEDA 160 (174)
T ss_pred HhcCCCccCCCceEEEEEEEEEEcCCccCHHHHHhhCCCCccCcccCceEEEEecCCcEEEEEeCCCEEEEEecCCHHHH
Confidence 9999874 47899999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhC
Q psy16701 193 KLASRKFAKILKQL 206 (272)
Q Consensus 193 ~~A~~ki~~iL~kl 206 (272)
+.|+++++++|+++
T Consensus 161 ~~a~~~i~~~l~~~ 174 (174)
T cd04518 161 KRAVEKLLSRLKEL 174 (174)
T ss_pred HHHHHHHHHHHhhC
Confidence 99999999999874
No 4
>PLN00062 TATA-box-binding protein; Provisional
Probab=100.00 E-value=1e-59 Score=409.83 Aligned_cols=172 Identities=40% Similarity=0.674 Sum_probs=167.4
Q ss_pred ceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHHH
Q psy16701 36 GTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILK 115 (272)
Q Consensus 36 ~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L~ 115 (272)
++|+|||||++++|+|||++|+..++|+||||++|||++||+++|+++++||+||||+||||+|+++++.|+++++++|+
T Consensus 2 ~~I~NvVas~~l~~~idL~~la~~~~n~eYePe~fpgli~Rl~~Pk~t~lIF~SGKiviTGaks~e~a~~a~~~~~~~L~ 81 (179)
T PLN00062 2 PTLQNIVSTVNLDCKLDLKKIALQARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVCTGAKSEHDSKLAARKYARIIQ 81 (179)
T ss_pred cEEEEEEEEEEcCCcccHHHHHhhCCCCEECCccCcEEEEEeCCCcEEEEEECCCeEEEEecCCHHHHHHHHHHHHHHHH
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcCCC-ceeeeEEEEEEEEEEeCCCCChhhHhhc-CCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHH
Q psy16701 116 QLGHP-EIIFFFFRNMVGTISVGCPLDLNQINSR-VRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCK 193 (272)
Q Consensus 116 ~lg~~-~~~~~~I~NVVas~~l~~~ldL~~la~~-~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~ 193 (272)
++|++ ...+|+|||||||+|++++|||+.||.. .++++||||+||||+||+.+|+++++||+||||+||||||++|++
T Consensus 82 ~lg~~~~~~~f~v~NIvas~~l~~~i~L~~la~~~~~~~~YePE~fPgliyr~~~pk~~~liF~sGkvvitGaks~~~~~ 161 (179)
T PLN00062 82 KLGFPAKFKDFKIQNIVGSCDVKFPIRLEGLAYAHGAFSSYEPELFPGLIYRMKQPKIVLLIFVSGKIVITGAKVREEIY 161 (179)
T ss_pred HcCCCcCCCccEEEEEEEEEECCCcccHHHHHHhchhhcccCcccCceEEEEeCCCcEEEEEeCCCEEEEEecCCHHHHH
Confidence 99997 3689999999999999999999999964 679999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHhCC
Q psy16701 194 LASRKFAKILKQLG 207 (272)
Q Consensus 194 ~A~~ki~~iL~klg 207 (272)
+|+++++++|.+++
T Consensus 162 ~ai~~i~p~L~~~~ 175 (179)
T PLN00062 162 TAFENIYPVLTEFR 175 (179)
T ss_pred HHHHHHHHHHHHhc
Confidence 99999999999986
No 5
>cd04516 TBP_eukaryotes eukaryotic TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=100.00 E-value=1.3e-59 Score=407.64 Aligned_cols=171 Identities=40% Similarity=0.706 Sum_probs=165.9
Q ss_pred ceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHHH
Q psy16701 36 GTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILK 115 (272)
Q Consensus 36 ~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L~ 115 (272)
|+|+|||||++++|+|||++|+..++|+||||++|||++||+++|+++++||+||||+||||+|+|+++.|+++++++|+
T Consensus 2 ~~I~NvVas~~l~~~idL~~la~~~~n~eYePe~fpgli~Rl~~Pk~t~lIF~SGKiviTGaks~e~a~~a~~~i~~~L~ 81 (174)
T cd04516 2 PKIQNIVATVNLGCKLDLKKIALRARNAEYNPKRFAAVIMRIREPKTTALIFSSGKMVCTGAKSEDDSKLAARKYARIIQ 81 (174)
T ss_pred CEEEEEEEEEEcCCeecHHHHHhhCCCCEECCccCcEEEEEeCCCcEEEEEECCCeEEEEecCCHHHHHHHHHHHHHHHH
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcCCC-ceeeeEEEEEEEEEEeCCCCChhhHhhc-CCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHH
Q psy16701 116 QLGHP-EIIFFFFRNMVGTISVGCPLDLNQINSR-VRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCK 193 (272)
Q Consensus 116 ~lg~~-~~~~~~I~NVVas~~l~~~ldL~~la~~-~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~ 193 (272)
++|++ ...+|+|+|||||+|++++|||++++.. .++++||||+||||+||+.+|+++++||+||||+|||+||++|++
T Consensus 82 ~~g~~~~~~~~~v~Nivat~~l~~~i~L~~la~~~~~~~~YePE~fPgliyr~~~pk~~~liF~sGkvvitGaks~~~~~ 161 (174)
T cd04516 82 KLGFPAKFTDFKIQNIVGSCDVKFPIRLEGLAHAHKQFSSYEPELFPGLIYRMVKPKIVLLIFVSGKIVLTGAKSREEIY 161 (174)
T ss_pred HcCCCCCCCceEEEEEEEEEECCCcccHHHHHHhChhccEeCCccCceEEEEecCCcEEEEEeCCCEEEEEecCCHHHHH
Confidence 99997 4689999999999999999999999974 578999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHhC
Q psy16701 194 LASRKFAKILKQL 206 (272)
Q Consensus 194 ~A~~ki~~iL~kl 206 (272)
+|+++++++|.+.
T Consensus 162 ~a~~~i~p~L~~~ 174 (174)
T cd04516 162 QAFENIYPILLQF 174 (174)
T ss_pred HHHHHHHHHHhhC
Confidence 9999999999763
No 6
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA. New members of the TBP family, called TBP-like proteins (TBLP, TLF, TLP) or TBP-related factors (TRF1, TRF2,TRP), are similar to the core domain of TBPs, with identical or chemically similar amino acids at many
Probab=100.00 E-value=8.7e-59 Score=402.39 Aligned_cols=171 Identities=42% Similarity=0.701 Sum_probs=166.1
Q ss_pred cceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHH
Q psy16701 35 RGTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKIL 114 (272)
Q Consensus 35 ~~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L 114 (272)
.++|+||||+++++|+|||++|+..++|+||||++|||++||+++|+++++||+||||+||||+|+++++.|+++++++|
T Consensus 1 ~~~i~NvVas~~l~~~idL~~la~~~~n~~YePe~fpgli~R~~~P~~t~lIf~sGKivitGaks~~~~~~a~~~~~~~L 80 (174)
T cd00652 1 SPKIQNIVATVNLGCELDLRKIALAARNAEYNPKRFPGVIMRLREPKTTALIFSSGKMVITGAKSEEDAKLAARKYARIL 80 (174)
T ss_pred CcEEEEEEEEEEcCCccCHHHHHhhCCCcEECCCccceEEEEcCCCcEEEEEECCCEEEEEecCCHHHHHHHHHHHHHHH
Confidence 47899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhcCCCc--eeeeEEEEEEEEEEeCCCCChhhHhhcCC-CeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHH
Q psy16701 115 KQLGHPE--IIFFFFRNMVGTISVGCPLDLNQINSRVR-YSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHD 191 (272)
Q Consensus 115 ~~lg~~~--~~~~~I~NVVas~~l~~~ldL~~la~~~~-n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~ 191 (272)
+++|++. ..+++|+|||||++++++|||++||...+ +++||||+||||+||+.+|++|++||+||||+||||||++|
T Consensus 81 ~~~g~~~~~~~~~~v~NIvas~~l~~~i~L~~la~~~~~~~~YePe~fpgli~r~~~pk~t~lIF~sGkvvitGaks~~~ 160 (174)
T cd00652 81 QKLGFPVEKFPEFKVQNIVASCDLGFPIRLEELALKHPENASYEPELFPGLIYRMDEPKVVLLIFVSGKIVITGAKSRED 160 (174)
T ss_pred HHcCCCccccCceEEEEEEEEEECCCcccHHHHHhhhhcccEECCccCceEEEEecCCcEEEEEEcCCEEEEEecCCHHH
Confidence 9999874 57999999999999999999999999875 99999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHh
Q psy16701 192 CKLASRKFAKILKQ 205 (272)
Q Consensus 192 ~~~A~~ki~~iL~k 205 (272)
++.|+++++++|.+
T Consensus 161 ~~~a~~~i~~~L~~ 174 (174)
T cd00652 161 IYEAVEKIYPILKE 174 (174)
T ss_pred HHHHHHHHHHHHhC
Confidence 99999999999964
No 7
>cd04517 TLF TBP-like factors (TLF; also called TLP, TRF, TRP), which are found in most metazoans. TLFs and TBPs have well-conserved core domains; however, they only share about 60% similarity. TLFs, like TBPs, interact with TFIIA and TFIIB, which are part of the basal transcription machinery. Yet, in contrast to TBPs, TLFs seem not to interact with the TATA-box and even have a negative effect on the transcription of TATA-containing promoters. Recent results indicate that TLFs are involved in the transcription via TATA-less promoters.
Probab=100.00 E-value=2e-57 Score=393.95 Aligned_cols=170 Identities=29% Similarity=0.505 Sum_probs=164.8
Q ss_pred cceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHH
Q psy16701 35 RGTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKIL 114 (272)
Q Consensus 35 ~~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L 114 (272)
+++|+|||||++|+|+|||++++..++|+|||| +|||++||+++|+++++||+||||+|||++|+++++.|+++++++|
T Consensus 2 ~~~i~Nvvas~~l~~~idL~~la~~l~n~eYeP-~fpgli~R~~~Pk~t~lIF~sGKiviTGaks~~~~~~a~~~~~~~l 80 (174)
T cd04517 2 DILIVNVVCQFSLRCHIDLRKLALAGRNVEYNP-RYPKVTMRLREPRATASVWSSGKITITGATSEEEAKQAARRAARLL 80 (174)
T ss_pred ccEEEEEEEEEEcCCcccHHHHHhhCCCCEEeC-CCCEEEEEecCCcEEEEEECCCeEEEEccCCHHHHHHHHHHHHHHH
Confidence 689999999999999999999999999999999 9999999999999999999999999999999999999999999999
Q ss_pred HhcCCCc--eeeeEEEEEEEEEEeCCCCChhhHhhc-CCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHH
Q psy16701 115 KQLGHPE--IIFFFFRNMVGTISVGCPLDLNQINSR-VRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHD 191 (272)
Q Consensus 115 ~~lg~~~--~~~~~I~NVVas~~l~~~ldL~~la~~-~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~ 191 (272)
+++|++. ..+|+|+|||||++++++|||++|+.. .++++||||+||||+||+.+|++|++||+||||+|||+||++|
T Consensus 81 ~~~g~~~~~~~~f~v~nIvat~~~~~~i~L~~la~~~~~~~~YePE~fPgliyr~~~p~~t~lIF~sGkivitGaks~~~ 160 (174)
T cd04517 81 QKLGFKVVRFSNFRVVNVLATCSMPFPIRLDELAAKNRSSASYEPELHPGVVYRITGPRATLSIFSTGSVTVTGARSMED 160 (174)
T ss_pred HHcCCCcccCCceEEEEEEEEEeCCCcccHHHHHHhchhhcEeCCccCCEEEEEECCCcEEEEEeCCCEEEEEecCCHHH
Confidence 9999874 689999999999999999999999975 5799999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHh
Q psy16701 192 CKLASRKFAKILKQ 205 (272)
Q Consensus 192 ~~~A~~ki~~iL~k 205 (272)
+++|+++++++|.+
T Consensus 161 ~~~a~~~i~pil~~ 174 (174)
T cd04517 161 VREAVEKIYPIVFE 174 (174)
T ss_pred HHHHHHHHHHHHhC
Confidence 99999999999864
No 8
>KOG3302|consensus
Probab=100.00 E-value=1.5e-57 Score=395.29 Aligned_cols=176 Identities=38% Similarity=0.636 Sum_probs=170.0
Q ss_pred ccCCcceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHH
Q psy16701 31 QKNRRGTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKF 110 (272)
Q Consensus 31 ~~~~~~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~ 110 (272)
|..=+++++||||+++++|+|||.+|+..++|+||||++|++++||+++|++|++||+||||+||||+|+++|+.|++|+
T Consensus 18 ~~~i~~~l~nivc~~~~~c~ldLk~ial~~~N~ey~Pk~~~aVimrir~P~~ta~I~ssGKi~ctgA~se~~ar~aark~ 97 (200)
T KOG3302|consen 18 DSGLDPTLQNIVCTVNLNCKLDLKEIALHARNAEYNPKRFAAVIMRIRSPRTTALIFSSGKIVCTGAKSEDSARLAARKY 97 (200)
T ss_pred ccccceEEEeEEEEEeccceecHHHHhhhccccccCcccccEEEEEEcCCceEEEEecCCcEEEeccCCHHHHHHHHHHH
Confidence 44557899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhcCCC-ceeeeEEEEEEEEEEeCCCCChhhHhhcC-CCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccC
Q psy16701 111 AKILKQLGHP-EIIFFFFRNMVGTISVGCPLDLNQINSRV-RYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188 (272)
Q Consensus 111 ~~~L~~lg~~-~~~~~~I~NVVas~~l~~~ldL~~la~~~-~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS 188 (272)
+|+||+|||+ .+.+|+++||+||||++|+|+|+.++... ..+.||||+||||+|||.+|+++++||+||||++|||++
T Consensus 98 aRilqkLgf~~~f~~fki~nv~asc~vpF~IrLe~~~~~h~~~ssYepel~PgliYrm~~pkv~l~IF~tG~VvvtgA~~ 177 (200)
T KOG3302|consen 98 ARILQKLGFPVKFRDFKINNVVASCDVPFPIRLEGLALRHPVFSSYEPELFPGLIYRMVKPKVVLLIFVTGKVVVTGAKV 177 (200)
T ss_pred HHHHHHcCCCceehheeeEEEEEEEeccceeehhHhhhhCCcccccCcccCceeEEEecCCcEEEEEecCCEEEEEeccc
Confidence 9999999998 48999999999999999999999999864 689999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHhC
Q psy16701 189 EHDCKLASRKFAKILKQL 206 (272)
Q Consensus 189 ~e~~~~A~~ki~~iL~kl 206 (272)
.+++++|+++|+|+|.++
T Consensus 178 ~~~i~~Ai~~IyPil~~f 195 (200)
T KOG3302|consen 178 REETYEAIENIYPILLEF 195 (200)
T ss_pred HHHHHHHHHHHhHHHHHh
Confidence 999999999999999987
No 9
>cd04516 TBP_eukaryotes eukaryotic TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=100.00 E-value=1.1e-57 Score=395.53 Aligned_cols=148 Identities=58% Similarity=1.052 Sum_probs=145.9
Q ss_pred eEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHHH
Q psy16701 125 FFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILK 204 (272)
Q Consensus 125 ~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL~ 204 (272)
++|+|||||++++++|||++|+..++|++||||+||||+||+++|+++++||+||||+||||||+|+++.|+++++++|+
T Consensus 2 ~~I~NvVas~~l~~~idL~~la~~~~n~eYePe~fpgli~Rl~~Pk~t~lIF~SGKiviTGaks~e~a~~a~~~i~~~L~ 81 (174)
T cd04516 2 PKIQNIVATVNLGCKLDLKKIALRARNAEYNPKRFAAVIMRIREPKTTALIFSSGKMVCTGAKSEDDSKLAARKYARIIQ 81 (174)
T ss_pred CEEEEEEEEEEcCCeecHHHHHhhCCCCEECCccCcEEEEEeCCCcEEEEEECCCeEEEEecCCHHHHHHHHHHHHHHHH
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hCCCCCCCcceEEEEEEEEeecCCccChHHHHhHcCCCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 205 QLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 205 klg~~~~~~~f~V~NIvas~d~~f~I~Le~la~~~~~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
++|+++++++|+|||||||+||+|+||||+||..|+++++|||||||||+|||.+|++++|||+||||
T Consensus 82 ~~g~~~~~~~~~v~Nivat~~l~~~i~L~~la~~~~~~~~YePE~fPgliyr~~~pk~~~liF~sGkv 149 (174)
T cd04516 82 KLGFPAKFTDFKIQNIVGSCDVKFPIRLEGLAHAHKQFSSYEPELFPGLIYRMVKPKIVLLIFVSGKI 149 (174)
T ss_pred HcCCCCCCCceEEEEEEEEEECCCcccHHHHHHhChhccEeCCccCceEEEEecCCcEEEEEeCCCEE
Confidence 99999999999999999999999999999999988899999999999999999999999999999996
No 10
>COG2101 SPT15 TATA-box binding protein (TBP), component of TFIID and TFIIIB [Transcription]
Probab=100.00 E-value=7e-58 Score=391.64 Aligned_cols=151 Identities=38% Similarity=0.651 Sum_probs=146.5
Q ss_pred eeeeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHH
Q psy16701 122 IIFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAK 201 (272)
Q Consensus 122 ~~~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~ 201 (272)
...++|+|||||.+++++|||++++..+++++|||++||||+||+++|++++|||+|||+|||||||.++++.|++++++
T Consensus 5 ~~~i~IeNIVAS~~L~~elDL~~~~~~l~~aeYnP~qFpGlv~Rl~ePk~a~LIF~SGK~VcTGaKs~ed~~~av~~~~~ 84 (185)
T COG2101 5 EPTITIENIVASVDLGQELDLEEVALDLPGAEYNPEQFPGLVYRLEEPKTAALIFRSGKVVCTGAKSVEDVHRAVKKLAK 84 (185)
T ss_pred CCccEEEEEEEEechhhhccHHHHHhhCCCCccCHhHCCeeEEEecCCcceEEEEecCcEEEeccCcHHHHHHHHHHHHH
Confidence 46789999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhCCCCCCC-cceEEEEEEEEeecCCccChHHHHhHcC-CCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 202 ILKQLGHPIKY-QGFKIHNIVCTCDVRFPVKLDALHHVHS-QFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 202 iL~klg~~~~~-~~f~V~NIvas~d~~f~I~Le~la~~~~-~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
+|+++|+++.+ ++|+|||||||+|+++++||+++|..++ ++++|||||||||||||++|++|+|||+|||+
T Consensus 85 ~L~~~g~~~~~~p~i~iQNIVaSadL~~~lnL~~iA~~lg~e~~eYEPEqFPGLVYRl~~P~VV~LiF~SGK~ 157 (185)
T COG2101 85 KLKDGGIDIDFEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYEPEQFPGLVYRLDEPRVVLLLFGSGKL 157 (185)
T ss_pred HHHhcCcCcCCCCceEEEEEEEEeccCccccHHHHHHhccccccccccccCCeeEEEcCCCCEEEEEecCCcE
Confidence 99999999887 8999999999999999999999999776 67999999999999999999999999999996
No 11
>PLN00062 TATA-box-binding protein; Provisional
Probab=100.00 E-value=2.6e-57 Score=394.68 Aligned_cols=148 Identities=55% Similarity=1.010 Sum_probs=145.9
Q ss_pred eEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHHH
Q psy16701 125 FFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILK 204 (272)
Q Consensus 125 ~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL~ 204 (272)
++|+|||||++++++|||++|+..++|++||||+|||++||+++|+++++||+||||+||||||+++++.|+++++++|+
T Consensus 2 ~~I~NvVas~~l~~~idL~~la~~~~n~eYePe~fpgli~Rl~~Pk~t~lIF~SGKiviTGaks~e~a~~a~~~~~~~L~ 81 (179)
T PLN00062 2 PTLQNIVSTVNLDCKLDLKKIALQARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVCTGAKSEHDSKLAARKYARIIQ 81 (179)
T ss_pred cEEEEEEEEEEcCCcccHHHHHhhCCCCEECCccCcEEEEEeCCCcEEEEEECCCeEEEEecCCHHHHHHHHHHHHHHHH
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hCCCCCCCcceEEEEEEEEeecCCccChHHHHhHcCCCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 205 QLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 205 klg~~~~~~~f~V~NIvas~d~~f~I~Le~la~~~~~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
++|+++++++|+|||||||+|++|+|||+.||..|+++++||||+||||+|||.+|++++|||+||||
T Consensus 82 ~lg~~~~~~~f~v~NIvas~~l~~~i~L~~la~~~~~~~~YePE~fPgliyr~~~pk~~~liF~sGkv 149 (179)
T PLN00062 82 KLGFPAKFKDFKIQNIVGSCDVKFPIRLEGLAYAHGAFSSYEPELFPGLIYRMKQPKIVLLIFVSGKI 149 (179)
T ss_pred HcCCCcCCCccEEEEEEEEEECCCcccHHHHHHhchhhcccCcccCceEEEEeCCCcEEEEEeCCCEE
Confidence 99999999999999999999999999999999988999999999999999999999999999999996
No 12
>PRK00394 transcription factor; Reviewed
Probab=100.00 E-value=1.8e-54 Score=376.93 Aligned_cols=148 Identities=35% Similarity=0.630 Sum_probs=143.3
Q ss_pred eEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHHH
Q psy16701 125 FFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILK 204 (272)
Q Consensus 125 ~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL~ 204 (272)
++|+|||||++++++|||++|+..++|++||||+|||++||+++|+++++||+||||+||||+|+++++.|+++++++|+
T Consensus 1 i~i~NvVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~Rl~~Pk~t~lIf~sGKiv~tGa~S~~~a~~a~~~~~~~l~ 80 (179)
T PRK00394 1 IKIENIVASTDLGQELDLEKVAEDLPNAEYNPEQFPGLVYRLEDPKIAALIFRSGKVVCTGAKSVEDLHEAVKIIIKKLK 80 (179)
T ss_pred CEEEEEEEEEEcCCCcCHHHHHhhCCCceeCcccCceEEEEecCCceEEEEEcCCcEEEEccCCHHHHHHHHHHHHHHHH
Confidence 47999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hCCCCCC-CcceEEEEEEEEeecCCccChHHHHhHc-CCCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 205 QLGHPIK-YQGFKIHNIVCTCDVRFPVKLDALHHVH-SQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 205 klg~~~~-~~~f~V~NIvas~d~~f~I~Le~la~~~-~~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
++|+++. .++|+|||||||+|++++|||+.+|..+ .++++||||+||||+||+.+|++++|||+||||
T Consensus 81 ~~g~~~~~~~~~~i~NiVas~~l~~~i~L~~la~~~~~~~~~YePe~fPglvyR~~~pk~~~lIF~SGKv 150 (179)
T PRK00394 81 ELGIKVIDEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYEPEQFPGLVYRLDDPKVVVLLFGSGKL 150 (179)
T ss_pred HcCCCccCCCceEEEEEEEEEEcCCeEcHHHHHHhcCcCCcEECcccCceEEEEecCCcEEEEEEcCCEE
Confidence 9999986 5899999999999999999999999865 488999999999999999999999999999996
No 13
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=100.00 E-value=2e-54 Score=375.11 Aligned_cols=148 Identities=36% Similarity=0.661 Sum_probs=143.0
Q ss_pred eeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHH
Q psy16701 124 FFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKIL 203 (272)
Q Consensus 124 ~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL 203 (272)
.++|+|||||++++++|||++|+..++|++||||+|||++||+++|+++++||+||||+||||||+++++.|+++++++|
T Consensus 1 ~~~I~NvVas~~l~~~ldL~~la~~~~n~eY~P~~fpgli~Rl~~Pk~t~lIF~SGKiv~tGaks~~~a~~a~~~~~~~L 80 (174)
T cd04518 1 SLKIENIVASVDLGQELDLEKVAAELPNAEYNPDQFPGLVYRLEDPKIAALIFRSGKMVCTGAKSVEDLHRAVKEIIKKL 80 (174)
T ss_pred CcEEEEEEEEEEcCCeecHHHHHhhCCCcEECCCcCcEEEEEccCCcEEEEEECCCeEEEEccCCHHHHHHHHHHHHHHH
Confidence 36899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhCCCCCCC-cceEEEEEEEEeecCCccChHHHHhHcCCCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 204 KQLGHPIKY-QGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 204 ~klg~~~~~-~~f~V~NIvas~d~~f~I~Le~la~~~~~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
+++|+++.. .+|+|||||||+|++++|||+.+|..++ +++||||+||||+||+.+|++++|||+||||
T Consensus 81 ~~~g~~~~~~~~~~i~NIVas~~l~~~i~L~~la~~~~-~~~YePe~fpglvyR~~~pk~~~lIF~SGKv 149 (174)
T cd04518 81 KDYGIKVIEKPEIKVQNIVASADLGREVNLDAIAIGLP-NAEYEPEQFPGLVYRLDEPKVVLLLFSSGKM 149 (174)
T ss_pred HhcCCCccCCCceEEEEEEEEEEcCCccCHHHHHhhCC-CCccCcccCceEEEEecCCcEEEEEeCCCEE
Confidence 999998754 8999999999999999999999998775 8999999999999999999999999999996
No 14
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA. New members of the TBP family, called TBP-like proteins (TBLP, TLF, TLP) or TBP-related factors (TRF1, TRF2,TRP), are similar to the core domain of TBPs, with identical or chemically similar amino acids at many
Probab=100.00 E-value=2.2e-54 Score=374.86 Aligned_cols=149 Identities=54% Similarity=0.982 Sum_probs=145.5
Q ss_pred eeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHH
Q psy16701 124 FFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKIL 203 (272)
Q Consensus 124 ~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL 203 (272)
+++|+||||+++++++|||++|+..++|++||||+|||++||+.+|+++++||+||||+||||+|+++++.|+++++++|
T Consensus 1 ~~~i~NvVas~~l~~~idL~~la~~~~n~~YePe~fpgli~R~~~P~~t~lIf~sGKivitGaks~~~~~~a~~~~~~~L 80 (174)
T cd00652 1 SPKIQNIVATVNLGCELDLRKIALAARNAEYNPKRFPGVIMRLREPKTTALIFSSGKMVITGAKSEEDAKLAARKYARIL 80 (174)
T ss_pred CcEEEEEEEEEEcCCccCHHHHHhhCCCcEECCCccceEEEEcCCCcEEEEEECCCEEEEEecCCHHHHHHHHHHHHHHH
Confidence 36899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhCCCCC-CCcceEEEEEEEEeecCCccChHHHHhHcCCCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 204 KQLGHPI-KYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 204 ~klg~~~-~~~~f~V~NIvas~d~~f~I~Le~la~~~~~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
+++|+++ ++.+|+|||||||+|++|+|||++||..++++++||||+||||+||+.+|++++|||+||||
T Consensus 81 ~~~g~~~~~~~~~~v~NIvas~~l~~~i~L~~la~~~~~~~~YePe~fpgli~r~~~pk~t~lIF~sGkv 150 (174)
T cd00652 81 QKLGFPVEKFPEFKVQNIVASCDLGFPIRLEELALKHPENASYEPELFPGLIYRMDEPKVVLLIFVSGKI 150 (174)
T ss_pred HHcCCCccccCceEEEEEEEEEECCCcccHHHHHhhhhcccEECCccCceEEEEecCCcEEEEEEcCCEE
Confidence 9999987 78999999999999999999999999988889999999999999999999999999999996
No 15
>KOG3302|consensus
Probab=100.00 E-value=1.8e-54 Score=376.00 Aligned_cols=152 Identities=53% Similarity=0.960 Sum_probs=149.7
Q ss_pred ceeeeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHH
Q psy16701 121 EIIFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFA 200 (272)
Q Consensus 121 ~~~~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~ 200 (272)
...++.+|||||+++++|+|||.+||.+.+|++|||++|+++++|+++|++|++||+||||+||||+|+++++.|+++++
T Consensus 19 ~~i~~~l~nivc~~~~~c~ldLk~ial~~~N~ey~Pk~~~aVimrir~P~~ta~I~ssGKi~ctgA~se~~ar~aark~a 98 (200)
T KOG3302|consen 19 SGLDPTLQNIVCTVNLNCKLDLKEIALHARNAEYNPKRFAAVIMRIRSPRTTALIFSSGKIVCTGAKSEDSARLAARKYA 98 (200)
T ss_pred cccceEEEeEEEEEeccceecHHHHhhhccccccCcccccEEEEEEcCCceEEEEecCCcEEEeccCCHHHHHHHHHHHH
Confidence 45789999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhCCCCCCCcceEEEEEEEEeecCCccChHHHHhHcCCCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 201 KILKQLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 201 ~iL~klg~~~~~~~f~V~NIvas~d~~f~I~Le~la~~~~~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
++||++||++++.||++|||||||||||+|+||+++..|+.+++||||+||||+|||.+|+++|+||+||||
T Consensus 99 RilqkLgf~~~f~~fki~nv~asc~vpF~IrLe~~~~~h~~~ssYepel~PgliYrm~~pkv~l~IF~tG~V 170 (200)
T KOG3302|consen 99 RILQKLGFPVKFRDFKINNVVASCDVPFPIRLEGLALRHPVFSSYEPELFPGLIYRMVKPKVVLLIFVTGKV 170 (200)
T ss_pred HHHHHcCCCceehheeeEEEEEEEeccceeehhHhhhhCCcccccCcccCceeEEEecCCcEEEEEecCCEE
Confidence 999999999999999999999999999999999999999999999999999999999999999999999996
No 16
>cd04517 TLF TBP-like factors (TLF; also called TLP, TRF, TRP), which are found in most metazoans. TLFs and TBPs have well-conserved core domains; however, they only share about 60% similarity. TLFs, like TBPs, interact with TFIIA and TFIIB, which are part of the basal transcription machinery. Yet, in contrast to TBPs, TLFs seem not to interact with the TATA-box and even have a negative effect on the transcription of TATA-containing promoters. Recent results indicate that TLFs are involved in the transcription via TATA-less promoters.
Probab=100.00 E-value=5.7e-54 Score=372.34 Aligned_cols=148 Identities=36% Similarity=0.734 Sum_probs=144.9
Q ss_pred eeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHH
Q psy16701 124 FFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKIL 203 (272)
Q Consensus 124 ~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL 203 (272)
+++|+|||||++++++|||++++..++|++||| +||||+||+++|+++++||+|||++|||+||+++++.|+++++++|
T Consensus 2 ~~~i~Nvvas~~l~~~idL~~la~~l~n~eYeP-~fpgli~R~~~Pk~t~lIF~sGKiviTGaks~~~~~~a~~~~~~~l 80 (174)
T cd04517 2 DILIVNVVCQFSLRCHIDLRKLALAGRNVEYNP-RYPKVTMRLREPRATASVWSSGKITITGATSEEEAKQAARRAARLL 80 (174)
T ss_pred ccEEEEEEEEEEcCCcccHHHHHhhCCCCEEeC-CCCEEEEEecCCcEEEEEECCCeEEEEccCCHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999 9999999999999999999999999999999999999999999999
Q ss_pred HhCCCCC-CCcceEEEEEEEEeecCCccChHHHHhHcCCCceeccccCceeEEEEcCCcEEEEEeecCcC
Q psy16701 204 KQLGHPI-KYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272 (272)
Q Consensus 204 ~klg~~~-~~~~f~V~NIvas~d~~f~I~Le~la~~~~~~~~YEPE~fPGLvyR~~~pk~~~lIF~SGkv 272 (272)
+++|++. ++++|+|||||||+|+||+|||++||..|.++++||||+||||+||+.+|+++++||+||||
T Consensus 81 ~~~g~~~~~~~~f~v~nIvat~~~~~~i~L~~la~~~~~~~~YePE~fPgliyr~~~p~~t~lIF~sGki 150 (174)
T cd04517 81 QKLGFKVVRFSNFRVVNVLATCSMPFPIRLDELAAKNRSSASYEPELHPGVVYRITGPRATLSIFSTGSV 150 (174)
T ss_pred HHcCCCcccCCceEEEEEEEEEeCCCcccHHHHHHhchhhcEeCCccCCEEEEEECCCcEEEEEeCCCEE
Confidence 9999986 78999999999999999999999999988899999999999999999999999999999996
No 17
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=99.97 E-value=3.8e-30 Score=199.12 Aligned_cols=86 Identities=37% Similarity=0.633 Sum_probs=81.5
Q ss_pred eeeeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHH
Q psy16701 122 IIFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAK 201 (272)
Q Consensus 122 ~~~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~ 201 (272)
+++++|+||||+++++++|||++|+..++|++||||+|||++||+++|+++++||+||||+||||||+++++.|++++++
T Consensus 1 ~~~~~i~NIva~~~l~~~idL~~la~~~~~~~YePe~fpgl~~r~~~p~~t~~IF~sGki~itGaks~~~~~~a~~~i~~ 80 (86)
T PF00352_consen 1 FPDFKIVNIVASFDLPFEIDLEELAEELENVEYEPERFPGLIYRLRNPKATVLIFSSGKIVITGAKSEEEAKKAIEKILP 80 (86)
T ss_dssp -EEEEEEEEEEEEE-SSEB-HHHHHHHSTTEEEETTTESSEEEEETTTTEEEEEETTSEEEEEEESSHHHHHHHHHHHHH
T ss_pred CCccEEEEEEEEEECCCccCHHHHHhhccCcEEeeccCCeEEEeecCCcEEEEEEcCCEEEEEecCCHHHHHHHHHHHHH
Confidence 36899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhCC
Q psy16701 202 ILKQLG 207 (272)
Q Consensus 202 iL~klg 207 (272)
+|+++|
T Consensus 81 ~L~~~~ 86 (86)
T PF00352_consen 81 ILQKLG 86 (86)
T ss_dssp HHHHTT
T ss_pred HHHHcC
Confidence 999986
No 18
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=99.95 E-value=3.1e-28 Score=188.44 Aligned_cols=84 Identities=37% Similarity=0.636 Sum_probs=80.5
Q ss_pred cceEEEEEEEEEeCCccChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHH
Q psy16701 35 RGTNLNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKIL 114 (272)
Q Consensus 35 ~~~i~NvV~s~~l~~~ldL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L 114 (272)
+++|+||||+++++++|||++|+..++|++|||++|||+++|+++|+++++||+||||+||||+|+++++.|++++.++|
T Consensus 3 ~~~i~NIva~~~l~~~idL~~la~~~~~~~YePe~fpgl~~r~~~p~~t~~IF~sGki~itGaks~~~~~~a~~~i~~~L 82 (86)
T PF00352_consen 3 DFKIVNIVASFDLPFEIDLEELAEELENVEYEPERFPGLIYRLRNPKATVLIFSSGKIVITGAKSEEEAKKAIEKILPIL 82 (86)
T ss_dssp EEEEEEEEEEEE-SSEB-HHHHHHHSTTEEEETTTESSEEEEETTTTEEEEEETTSEEEEEEESSHHHHHHHHHHHHHHH
T ss_pred ccEEEEEEEEEECCCccCHHHHHhhccCcEEeeccCCeEEEeecCCcEEEEEEcCCEEEEEecCCHHHHHHHHHHHHHHH
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhcC
Q psy16701 115 KQLG 118 (272)
Q Consensus 115 ~~lg 118 (272)
+++|
T Consensus 83 ~~~~ 86 (86)
T PF00352_consen 83 QKLG 86 (86)
T ss_dssp HHTT
T ss_pred HHcC
Confidence 9986
No 19
>PF11858 DUF3378: Domain of unknown function (DUF3378); InterPro: IPR024568 Ribonuclease HIII 3.1.26 from EC cleaves RNA from DNA-RNA hybrids. It catalyses endonucleolytic cleavage to 5'-phospho-monoesters. Two types of ribonuclease H in Bacillus subtilis, RNase HII (rnhB) and RNase HIII (rnhC), are both known experimentally and are quite similar to each other []. The only RNase H homologue in the Mycoplasmas resembles rnhC. Archaeal forms resemble HII more closely than HIII. This entry represents the uncharacterised N-terminal domain of bacterial RNase HIII.; PDB: 3ASM_A 2D0C_A 2D0B_A 2D0A_A.
Probab=96.44 E-value=0.003 Score=48.75 Aligned_cols=34 Identities=24% Similarity=0.304 Sum_probs=28.5
Q ss_pred CcceEEEEEcCCeeEEEEecCCeEEEEcccCHHH
Q psy16701 158 KFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHD 191 (272)
Q Consensus 158 ~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~ 191 (272)
.=|+.+|+..-+.+|+++|.||||++.|...+.+
T Consensus 27 ~~p~~~f~aK~~~~tIt~Y~SGKV~FQG~~Ae~~ 60 (81)
T PF11858_consen 27 KPPYAVFQAKYNGVTITAYKSGKVVFQGKNAEQE 60 (81)
T ss_dssp --TTEEEEEEETTEEEEEETTSEEEEESTTHHHH
T ss_pred CCCCEEEEEeCCCeEEEEEeCCeEEEECCCHHHH
Confidence 3399999999999999999999999999754433
No 20
>PF11858 DUF3378: Domain of unknown function (DUF3378); InterPro: IPR024568 Ribonuclease HIII 3.1.26 from EC cleaves RNA from DNA-RNA hybrids. It catalyses endonucleolytic cleavage to 5'-phospho-monoesters. Two types of ribonuclease H in Bacillus subtilis, RNase HII (rnhB) and RNase HIII (rnhC), are both known experimentally and are quite similar to each other []. The only RNase H homologue in the Mycoplasmas resembles rnhC. Archaeal forms resemble HII more closely than HIII. This entry represents the uncharacterised N-terminal domain of bacterial RNase HIII.; PDB: 3ASM_A 2D0C_A 2D0B_A 2D0A_A.
Probab=95.94 E-value=0.014 Score=45.07 Aligned_cols=47 Identities=17% Similarity=0.209 Sum_probs=32.7
Q ss_pred hhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCCh
Q psy16701 53 LNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHE 100 (272)
Q Consensus 53 L~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~ 100 (272)
+++|...+..-. -+..-|+++.+.+.+.+|+++|.|||++..|...+
T Consensus 12 i~~l~~~y~~~~-~~~~~p~~~f~aK~~~~tIt~Y~SGKV~FQG~~Ae 58 (81)
T PF11858_consen 12 IEKLKKYYKPYL-TSSKPPYAVFQAKYNGVTITAYKSGKVVFQGKNAE 58 (81)
T ss_dssp HHHHHHHSTT-B--SS--TTEEEEEEETTEEEEEETTSEEEEESTTHH
T ss_pred HHHHHHHHHHhc-ccCCCCCEEEEEeCCCeEEEEEeCCeEEEECCCHH
Confidence 344444443322 13455899999999999999999999999997433
No 21
>COG4871 Uncharacterized protein conserved in archaea [Function unknown]
Probab=95.46 E-value=0.086 Score=45.94 Aligned_cols=73 Identities=16% Similarity=0.160 Sum_probs=58.3
Q ss_pred EEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHHHhC
Q psy16701 131 VGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQL 206 (272)
Q Consensus 131 Vas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL~kl 206 (272)
.+..+-++.==|..|+..++++.|.-.+ ..|+++.-+ .-++||.||||.+|--|+++++++.+..+..++++.
T Consensus 37 i~~md~~lg~ilplla~l~P~anY~~kk-~~l~~~kge--rIitiy~sGkVsm~~ikdedEAkeilgel~d~ineA 109 (193)
T COG4871 37 IANMDPPLGGILPLLAPLFPRANYSDKK-NILILQKGE--RIITIYGSGKVSMTMIKDEDEAKEILGELMDIINEA 109 (193)
T ss_pred EeecCCCcchhHHHhHhhCCCccccccc-ceEEEeecc--EEEEEccCCeEEeeeecCHHHHHHHHHHHHHHHHHH
Confidence 3433433433477788888999999665 888888654 567899999999999999999999999999998765
No 22
>COG4871 Uncharacterized protein conserved in archaea [Function unknown]
Probab=94.52 E-value=0.16 Score=44.22 Aligned_cols=63 Identities=19% Similarity=0.205 Sum_probs=53.2
Q ss_pred ChhhHhhhCCCceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHHHhc
Q psy16701 52 DLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQL 117 (272)
Q Consensus 52 dL~~la~~~~n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L~~l 117 (272)
=|.-|+..+.++.|+.++ .+++++.-+ .-++||.|||+..|=.+++++|+..+..+.+++++.
T Consensus 47 ilplla~l~P~anY~~kk-~~l~~~kge--rIitiy~sGkVsm~~ikdedEAkeilgel~d~ineA 109 (193)
T COG4871 47 ILPLLAPLFPRANYSDKK-NILILQKGE--RIITIYGSGKVSMTMIKDEDEAKEILGELMDIINEA 109 (193)
T ss_pred hHHHhHhhCCCccccccc-ceEEEeecc--EEEEEccCCeEEeeeecCHHHHHHHHHHHHHHHHHH
Confidence 466788888999999765 677666544 567899999999999999999999999999888775
No 23
>TIGR00716 rnhC ribonuclease HIII. Two types of ribonuclease H in Bacillus subtilis, RNase HII (rnhB) and RNase HIII (rnhC), are both known experimentally and are quite similar to each other. The only RNase H homolog in the Mycoplasmas resembles rnhC. Archaeal forms resemble HII more closely than HIII. This model describes bacterial RNase III.
Probab=93.05 E-value=0.16 Score=47.66 Aligned_cols=33 Identities=27% Similarity=0.384 Sum_probs=29.1
Q ss_pred cCCCcceEEEEEcCCeeEEEEecCCeEEEEccc
Q psy16701 155 NPGKFHGLIMKILNPRTTCLAFQSGKLLILGAK 187 (272)
Q Consensus 155 ePe~fpgli~R~~~p~~t~lIF~SGKivitGak 187 (272)
.+..=|+.+|+...+.+|+++|.|||+++.|..
T Consensus 22 ~~~~~~~~~f~~k~~~~~it~Y~SgKv~fQG~~ 54 (284)
T TIGR00716 22 TKSNPPYTVFQLEGPGVKVTYYQSGKLLIQGKN 54 (284)
T ss_pred ccCCCCCeEEEEeCCCeEEEEEeCCEEEEeCCC
Confidence 445569999999999999999999999999943
No 24
>COG1039 RnhC Ribonuclease HIII [DNA replication, recombination, and repair]
Probab=92.58 E-value=0.28 Score=46.27 Aligned_cols=42 Identities=21% Similarity=0.195 Sum_probs=35.4
Q ss_pred CcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHH
Q psy16701 158 KFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKF 199 (272)
Q Consensus 158 ~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki 199 (272)
..|+.+|+.+.|.+++.+|.|||+++.|-.+++.+++-...+
T Consensus 28 ~~~~~~f~ak~~gvtv~~Y~Sgk~~~QG~~ae~~~~~~l~~~ 69 (297)
T COG1039 28 NPPYTVFAAKSPGVTVTIYKSGKVVIQGKGAEAFAKEFLNPI 69 (297)
T ss_pred CCCceEEEeeCCCeEEEEEccceEEEecCCHHHHHHHHhhhh
Confidence 679999999999999999999999999976666666555533
No 25
>TIGR00716 rnhC ribonuclease HIII. Two types of ribonuclease H in Bacillus subtilis, RNase HII (rnhB) and RNase HIII (rnhC), are both known experimentally and are quite similar to each other. The only RNase H homolog in the Mycoplasmas resembles rnhC. Archaeal forms resemble HII more closely than HIII. This model describes bacterial RNase III.
Probab=92.13 E-value=0.23 Score=46.57 Aligned_cols=36 Identities=28% Similarity=0.389 Sum_probs=30.2
Q ss_pred cCCCcceEEEEEcCCceEEEeecCCcEEEEecCChh
Q psy16701 66 NPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEH 101 (272)
Q Consensus 66 ~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~ 101 (272)
.+..-|+.+.+.+.|.+|+++|.|||++.+|...++
T Consensus 22 ~~~~~~~~~f~~k~~~~~it~Y~SgKv~fQG~~ae~ 57 (284)
T TIGR00716 22 TKSNPPYTVFQLEGPGVKVTYYQSGKLLIQGKNSEK 57 (284)
T ss_pred ccCCCCCeEEEEeCCCeEEEEEeCCEEEEeCCCHHH
Confidence 445568999999999999999999999999954433
No 26
>PRK00996 ribonuclease HIII; Provisional
Probab=92.02 E-value=0.26 Score=46.67 Aligned_cols=32 Identities=28% Similarity=0.464 Sum_probs=28.5
Q ss_pred CCCcceEEEEEcCCeeEEEEecCCeEEEEccc
Q psy16701 156 PGKFHGLIMKILNPRTTCLAFQSGKLLILGAK 187 (272)
Q Consensus 156 Pe~fpgli~R~~~p~~t~lIF~SGKivitGak 187 (272)
+..-|+..|+...+.+++.+|.|||+++.|..
T Consensus 26 ~~~~~~~~f~~k~~~~~it~Y~SGKv~~QG~~ 57 (304)
T PRK00996 26 PSLPPGAVFAAKKPGVTITAYKSGKVVFQGKG 57 (304)
T ss_pred cCCCCceEEEEcCCCeEEEEEeCCEEEEeCCC
Confidence 34568999999999999999999999999954
No 27
>COG1039 RnhC Ribonuclease HIII [DNA replication, recombination, and repair]
Probab=89.96 E-value=0.5 Score=44.64 Aligned_cols=40 Identities=23% Similarity=0.201 Sum_probs=34.6
Q ss_pred CcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHH
Q psy16701 69 KFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASR 108 (272)
Q Consensus 69 ~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~ 108 (272)
..|..+.+.+.|.+|+++|.|||+++.|-.+++.++.=..
T Consensus 28 ~~~~~~f~ak~~gvtv~~Y~Sgk~~~QG~~ae~~~~~~l~ 67 (297)
T COG1039 28 NPPYTVFAAKSPGVTVTIYKSGKVVIQGKGAEAFAKEFLN 67 (297)
T ss_pred CCCceEEEeeCCCeEEEEEccceEEEecCCHHHHHHHHhh
Confidence 6689999999999999999999999999877777665444
No 28
>PRK00996 ribonuclease HIII; Provisional
Probab=89.17 E-value=0.59 Score=44.27 Aligned_cols=34 Identities=29% Similarity=0.464 Sum_probs=28.9
Q ss_pred CCCcceEEEEEcCCceEEEeecCCcEEEEecCCh
Q psy16701 67 PGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHE 100 (272)
Q Consensus 67 P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~ 100 (272)
+...|+.+.+.+.|.+|+++|.|||++.+|...+
T Consensus 26 ~~~~~~~~f~~k~~~~~it~Y~SGKv~~QG~~ae 59 (304)
T PRK00996 26 PSLPPGAVFAAKKPGVTITAYKSGKVVFQGKGAE 59 (304)
T ss_pred cCCCCceEEEEcCCCeEEEEEeCCEEEEeCCCHH
Confidence 3456789999999999999999999999995433
No 29
>PRK12475 thiamine/molybdopterin biosynthesis MoeB-like protein; Provisional
Probab=88.50 E-value=1.1 Score=42.85 Aligned_cols=57 Identities=19% Similarity=0.370 Sum_probs=44.2
Q ss_pred CccChhhHhhhCC---CceecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHH
Q psy16701 49 CPLDLNQINSRVR---YSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKF 110 (272)
Q Consensus 49 ~~ldL~~la~~~~---n~eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~ 110 (272)
.++|+++++.+++ ..+.|| ..++++-..-.+.+|++|++++.|++++.+|+.-..++
T Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~f~~gr~~~~g~~~~~~a~~~~~~~ 336 (338)
T PRK12475 277 RRLNLEEIKKRLQKIGKVDANP-----YLLSFQLDEYRFVLFTDGRAFIHGTNDIKKAKRLYARY 336 (338)
T ss_pred CccCHHHHHHHHhhcCEEEecc-----cEEEEEECCEEEEEEcCCcEEEECCCCHHHHHHHHHHh
Confidence 6899999998775 234443 33455555589999999999999999999999776665
No 30
>PRK07688 thiamine/molybdopterin biosynthesis ThiF/MoeB-like protein; Validated
Probab=85.73 E-value=1.9 Score=41.26 Aligned_cols=59 Identities=17% Similarity=0.337 Sum_probs=47.3
Q ss_pred CccChhhHhhhCCCc--eecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHH
Q psy16701 49 CPLDLNQINSRVRYS--EYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKF 110 (272)
Q Consensus 49 ~~ldL~~la~~~~n~--eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~ 110 (272)
.++++.+++..++.. ++.. +-..++++.+.-.+..|++|++++.|++++.+|+.-..++
T Consensus 277 ~~i~~~~~~~~l~~~~~~~~~---~~~ll~vr~~~~~~~~~~~gr~~i~g~~~~~~a~~~~~~~ 337 (339)
T PRK07688 277 EEYDLEELAELLRDRGLDVNV---NPYLLSFSLEEKRLVLFKDGRVLVHGTKDISEAKTIYHRY 337 (339)
T ss_pred CccCHHHHHHHHHhcccccCC---CcEEEEEecCCeEEEEEcCCCEEEECCCCHHHHHHHHHHh
Confidence 568999999887543 4443 3466788888899999999999999999999999776665
No 31
>PRK12475 thiamine/molybdopterin biosynthesis MoeB-like protein; Provisional
Probab=74.23 E-value=12 Score=35.83 Aligned_cols=35 Identities=14% Similarity=0.295 Sum_probs=28.4
Q ss_pred EEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHH
Q psy16701 163 IMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKF 199 (272)
Q Consensus 163 i~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki 199 (272)
.++..+ -.+.+|+.|+++|-|.+++++|+.-..++
T Consensus 302 ~~~~~~--~~~~~f~~gr~~~~g~~~~~~a~~~~~~~ 336 (338)
T PRK12475 302 SFQLDE--YRFVLFTDGRAFIHGTNDIKKAKRLYARY 336 (338)
T ss_pred EEEECC--EEEEEEcCCcEEEECCCCHHHHHHHHHHh
Confidence 445544 68899999999999999999998776654
No 32
>PF13575 DUF4135: Domain of unknown function (DUF4135)
Probab=71.54 E-value=23 Score=33.94 Aligned_cols=49 Identities=16% Similarity=0.282 Sum_probs=35.2
Q ss_pred CHHHHHHHHHHHHHHHHhCCCCCCCcceEEEEEEEEeecCCccChHHHHh
Q psy16701 188 HEHDCKLASRKFAKILKQLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHH 237 (272)
Q Consensus 188 S~e~~~~A~~ki~~iL~klg~~~~~~~f~V~NIvas~d~~f~I~Le~la~ 237 (272)
+.++++.-..++-..|.=+ +-....|+--.||+|+.+.|+.||||-|-.
T Consensus 129 ~~~ev~~yY~r~G~llal~-y~L~~~DlH~ENIIa~g~~PvlIDlETlf~ 177 (370)
T PF13575_consen 129 SEEEVERYYYRLGVLLALL-YLLNGTDLHFENIIASGEYPVLIDLETLFH 177 (370)
T ss_pred CHHHHHHHHHHHHHHHHHH-HHhCCCcccccceEEeCCCcEEEehhhhCC
Confidence 7778777776665554322 222335677899999999999999998864
No 33
>PRK15468 carboxysome structural protein EutS; Provisional
Probab=67.63 E-value=9.5 Score=31.18 Aligned_cols=35 Identities=26% Similarity=0.210 Sum_probs=29.3
Q ss_pred EEecCCeEEEEcccCHHHHHHHHHHHHHHHHh-CCCCC
Q psy16701 174 LAFQSGKLLILGAKHEHDCKLASRKFAKILKQ-LGHPI 210 (272)
Q Consensus 174 lIF~SGKivitGakS~e~~~~A~~ki~~iL~k-lg~~~ 210 (272)
+=--||.+++|| +.++++.|++.+.+.|++ |||.+
T Consensus 70 ~DRFsGslvitG--dvs~Ve~Al~~V~~~l~~~L~F~~ 105 (111)
T PRK15468 70 LDRFSGALVIYG--SVGAVEEALSQTVSGLGRLLNYTL 105 (111)
T ss_pred eeccceeEEEEc--cHHHHHHHHHHHHHHHHhhcCccc
Confidence 334589999999 588999999999999998 78864
No 34
>PRK07688 thiamine/molybdopterin biosynthesis ThiF/MoeB-like protein; Validated
Probab=66.18 E-value=14 Score=35.36 Aligned_cols=38 Identities=16% Similarity=0.391 Sum_probs=30.6
Q ss_pred EEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHH
Q psy16701 162 LIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKF 199 (272)
Q Consensus 162 li~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki 199 (272)
+..+++.+.-.+..|++|+++|-|.+++.+|+.-..++
T Consensus 300 ~ll~vr~~~~~~~~~~~gr~~i~g~~~~~~a~~~~~~~ 337 (339)
T PRK07688 300 YLLSFSLEEKRLVLFKDGRVLVHGTKDISEAKTIYHRY 337 (339)
T ss_pred EEEEEecCCeEEEEEcCCCEEEECCCCHHHHHHHHHHh
Confidence 34455555689999999999999999999988776654
No 35
>TIGR02187 GlrX_arch Glutaredoxin-like domain protein. This family of archaeal proteins contains a C-terminal domain with homology to bacterial and eukaryotic glutaredoxins, including a CPYC motif. There is an N-terminal domain which has even more distant homology to glutaredoxins. The name "glutaredoxin" may be inappropriate in the sense of working in tandem with glutathione and glutathione reductase which may not be present in the archaea. The overall domain structure appears to be related to bacterial alkylhydroperoxide reductases, but the homology may be distant enough that the function of this family is wholly different.
Probab=61.56 E-value=50 Score=29.02 Aligned_cols=113 Identities=12% Similarity=0.045 Sum_probs=64.2
Q ss_pred eEEEeecCCcEE---EEecCChhHHHHHHHHHHHHHHhcC--CCc--------ee-eeEEEEEEEEEEeCCCC---Chhh
Q psy16701 82 TTCLAFQSGKLL---ILGAKHEHDCKLASRKFAKILKQLG--HPE--------II-FFFFRNMVGTISVGCPL---DLNQ 144 (272)
Q Consensus 82 ~t~lIf~sGKiv---~tGakS~~~a~~a~~k~~~~L~~lg--~~~--------~~-~~~I~NVVas~~l~~~l---dL~~ 144 (272)
.|+.+|.+|+.+ ..|..+.++...-++.+..+ ...+ .+. .. ...|.=..+..+-.|+. -+++
T Consensus 79 Pt~~~f~~g~~~~~~~~G~~~~~~l~~~i~~~~~~-~~~~~~L~~~~~~~l~~~~~pv~I~~F~a~~C~~C~~~~~~l~~ 157 (215)
T TIGR02187 79 PTTIILEEGKDGGIRYTGIPAGYEFAALIEDIVRV-SQGEPGLSEKTVELLQSLDEPVRIEVFVTPTCPYCPYAVLMAHK 157 (215)
T ss_pred CEEEEEeCCeeeEEEEeecCCHHHHHHHHHHHHHh-cCCCCCCCHHHHHHHHhcCCCcEEEEEECCCCCCcHHHHHHHHH
Confidence 488999999986 46988887766555554322 1111 111 01 11222233444444542 2445
Q ss_pred HhhcCCC---eEEcCCCcceEEEE--EcCCeeEEEEecCCeEEEEcccCHHHHHHHHH
Q psy16701 145 INSRVRY---SEYNPGKFHGLIMK--ILNPRTTCLAFQSGKLLILGAKHEHDCKLASR 197 (272)
Q Consensus 145 la~~~~n---~eYePe~fpgli~R--~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ 197 (272)
++...++ ..+|++.+|.+.-+ +.. --|+.+|..|+. ..|..+.+++...+.
T Consensus 158 l~~~~~~i~~~~vD~~~~~~~~~~~~V~~-vPtl~i~~~~~~-~~G~~~~~~l~~~l~ 213 (215)
T TIGR02187 158 FALANDKILGEMIEANENPDLAEKYGVMS-VPKIVINKGVEE-FVGAYPEEQFLEYIL 213 (215)
T ss_pred HHHhcCceEEEEEeCCCCHHHHHHhCCcc-CCEEEEecCCEE-EECCCCHHHHHHHHH
Confidence 5543333 45788888776533 322 246778988875 889999888766654
No 36
>PLN00410 U5 snRNP protein, DIM1 family; Provisional
Probab=56.15 E-value=25 Score=29.83 Aligned_cols=81 Identities=10% Similarity=0.170 Sum_probs=48.4
Q ss_pred eEEEEEEEEEEeCCCC---ChhhHhhcCCC----eEEcCCCcceE--EEEEcCCeeEEEEecCCeEEE---Ec-------
Q psy16701 125 FFFRNMVGTISVGCPL---DLNQINSRVRY----SEYNPGKFHGL--IMKILNPRTTCLAFQSGKLLI---LG------- 185 (272)
Q Consensus 125 ~~I~NVVas~~l~~~l---dL~~la~~~~n----~eYePe~fpgl--i~R~~~p~~t~lIF~SGKivi---tG------- 185 (272)
+.+...-|+-+-+|.. -|++++..+++ ..=|=+..|.+ .|.++++-.++.+|.+|++.+ ||
T Consensus 25 lVVvdF~A~WCgpCk~m~p~l~~la~~~~~~~~~~kVDVDe~~dla~~y~I~~~~t~~~ffk~g~~~vd~~tG~~~k~~~ 104 (142)
T PLN00410 25 LVVIRFGHDWDETCMQMDEVLASVAETIKNFAVIYLVDITEVPDFNTMYELYDPCTVMFFFRNKHIMIDLGTGNNNKINW 104 (142)
T ss_pred EEEEEEECCCChhHHHHHHHHHHHHHHcCCceEEEEEECCCCHHHHHHcCccCCCcEEEEEECCeEEEEEeccccccccc
Confidence 3344555555555533 25666665544 12233344443 355666667778999999655 77
Q ss_pred -ccCHHHHHHHHHHHHHHHHh
Q psy16701 186 -AKHEHDCKLASRKFAKILKQ 205 (272)
Q Consensus 186 -akS~e~~~~A~~ki~~iL~k 205 (272)
..+.+++...++.+++--++
T Consensus 105 ~~~~k~~l~~~i~~~~~~a~~ 125 (142)
T PLN00410 105 ALKDKQEFIDIVETVYRGARK 125 (142)
T ss_pred ccCCHHHHHHHHHHHHHHHhc
Confidence 56778888887777776433
No 37
>PRK15468 carboxysome structural protein EutS; Provisional
Probab=50.96 E-value=24 Score=28.90 Aligned_cols=31 Identities=26% Similarity=0.202 Sum_probs=27.3
Q ss_pred cCCcEEEEecCChhHHHHHHHHHHHHHHh-cCCC
Q psy16701 88 QSGKLLILGAKHEHDCKLASRKFAKILKQ-LGHP 120 (272)
Q Consensus 88 ~sGKiv~tGakS~~~a~~a~~k~~~~L~~-lg~~ 120 (272)
=||.+++|| ++.+.+.|++.+.+.|++ |||.
T Consensus 73 FsGslvitG--dvs~Ve~Al~~V~~~l~~~L~F~ 104 (111)
T PRK15468 73 FSGALVIYG--SVGAVEEALSQTVSGLGRLLNYT 104 (111)
T ss_pred cceeEEEEc--cHHHHHHHHHHHHHHHHhhcCcc
Confidence 589999999 788889999999999988 7885
No 38
>cd01554 EPT-like Enol pyruvate transferases family includes EPSP synthases and UDP-N-acetylglucosamine enolpyruvyl transferase. Both enzymes catalyze the reaction of enolpyruvyl transfer.
Probab=46.37 E-value=16 Score=34.81 Aligned_cols=106 Identities=17% Similarity=0.125 Sum_probs=56.2
Q ss_pred cCCcEEEEecCChhHHHHHHHHHHHHHHhcCCCceeeeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEc
Q psy16701 88 QSGKLLILGAKHEHDCKLASRKFAKILKQLGHPEIIFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKIL 167 (272)
Q Consensus 88 ~sGKiv~tGakS~~~a~~a~~k~~~~L~~lg~~~~~~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~ 167 (272)
..|++.+.|..... ....+.+.|+++|.+... .-.++. +... . ++..+++-+.+|... -..
T Consensus 239 ~~~~v~i~~~~~~~----~~~~~~~~L~~~G~~v~~--~~~~i~----v~~~-~-------~~~~~i~~~~~~~~~-d~~ 299 (408)
T cd01554 239 APGRLVLQNVGINE----TRTGIIDVLRAMGAKIEI--GEDTIS----VESS-D-------LKATEICGALIPRLI-DEL 299 (408)
T ss_pred cCCeEEEecCCCCc----hhhHHHHHHHHcCCEEEE--eCCeEE----EecC-C-------ceeEEeccccCCCCc-hHH
Confidence 34788888875432 667888999999864211 000111 1111 1 122222222211000 000
Q ss_pred CCeeEEEEecCCeEEEEcccCHHHHH-HHHHHHHHHHHhCCCCCCC
Q psy16701 168 NPRTTCLAFQSGKLLILGAKHEHDCK-LASRKFAKILKQLGHPIKY 212 (272)
Q Consensus 168 ~p~~t~lIF~SGKivitGakS~e~~~-~A~~ki~~iL~klg~~~~~ 212 (272)
..-+.+..|..|+.+++|......-+ .-+..+.+.|+++|.++..
T Consensus 300 p~l~~~a~~a~g~~~i~~~~~lr~ke~dr~~~~~~~L~~~G~~i~~ 345 (408)
T cd01554 300 PIIALLALQAQGTTVIKDAEELKVKETDRIFVVADELNSMGADIEP 345 (408)
T ss_pred HHHHHHHHcCCCcEEEECcccccccchhhHHHHHHHHHHcCCEEEE
Confidence 11245667899999999986632111 2245678999999987654
No 39
>cd03065 PDI_b_Calsequestrin_N PDIb family, Calsequestrin subfamily, N-terminal TRX-fold domain; Calsequestrin is the major calcium storage protein in the sarcoplasmic reticulum (SR) of skeletal and cardiac muscle. It stores calcium ions in sufficient quantities (up to 20 mM) to allow repetitive contractions and is essential to maintain movement, respiration and heart beat. A missense mutation in human cardiac calsequestrin is associated with catecholamine-induced polymorphic ventricular tachycardia (CPVT), a rare disease characterized by seizures or sudden death in response to physiologic or emotional stress. Calsequestrin is a highly acidic protein with up to 50 calcium binding sites formed simply by the clustering of two or more acidic residues. The monomer contains three redox inactive TRX-fold domains. Calsequestrin is condensed as a linear polymer in the SR lumen and is membrane-anchored through binding with intra-membrane proteins triadin, junctin and ryanodine receptor (RyR) Ca2
Probab=42.42 E-value=38 Score=27.62 Aligned_cols=30 Identities=7% Similarity=0.032 Sum_probs=23.2
Q ss_pred eEEEEecCCeEE-EEcccCHHHHHHHHHHHH
Q psy16701 171 TTCLAFQSGKLL-ILGAKHEHDCKLASRKFA 200 (272)
Q Consensus 171 ~t~lIF~SGKiv-itGakS~e~~~~A~~ki~ 200 (272)
-|+.+|.+|+++ .+|+.+.+.+...++++.
T Consensus 89 PTl~lfk~G~~v~~~G~~~~~~l~~~l~~~~ 119 (120)
T cd03065 89 DSIYVFKDDEVIEYDGEFAADTLVEFLLDLI 119 (120)
T ss_pred cEEEEEECCEEEEeeCCCCHHHHHHHHHHHh
Confidence 488999999944 579999888877766553
No 40
>PLN00410 U5 snRNP protein, DIM1 family; Provisional
Probab=39.81 E-value=46 Score=28.18 Aligned_cols=62 Identities=11% Similarity=0.225 Sum_probs=37.4
Q ss_pred CCcceEEEEEEEEEeCCc-c--ChhhHhhhCCC-c---eecCCCcceEE--EEEcCCceEEEeecCCcEEE
Q psy16701 33 NRRGTNLNMVGTISVGCP-L--DLNQINSRVRY-S---EYNPGKFHGLI--MKILNPRTTCLAFQSGKLLI 94 (272)
Q Consensus 33 ~~~~~i~NvV~s~~l~~~-l--dL~~la~~~~n-~---eY~P~~f~~l~--~R~~~P~~t~lIf~sGKiv~ 94 (272)
++.+.+...-|+-.-.|. + =|++++...++ + .-|=++.+.+. +.++.|-+++.+|++|++.+
T Consensus 22 ~~~lVVvdF~A~WCgpCk~m~p~l~~la~~~~~~~~~~kVDVDe~~dla~~y~I~~~~t~~~ffk~g~~~v 92 (142)
T PLN00410 22 EERLVVIRFGHDWDETCMQMDEVLASVAETIKNFAVIYLVDITEVPDFNTMYELYDPCTVMFFFRNKHIMI 92 (142)
T ss_pred CCCEEEEEEECCCChhHHHHHHHHHHHHHHcCCceEEEEEECCCCHHHHHHcCccCCCcEEEEEECCeEEE
Confidence 455666665555444443 2 35788887776 2 22444444443 55666767888999999544
No 41
>PRK09381 trxA thioredoxin; Provisional
Probab=39.61 E-value=40 Score=25.71 Aligned_cols=28 Identities=25% Similarity=0.233 Sum_probs=20.9
Q ss_pred EEEEecCCeEE--EEcccCHHHHHHHHHHH
Q psy16701 172 TCLAFQSGKLL--ILGAKHEHDCKLASRKF 199 (272)
Q Consensus 172 t~lIF~SGKiv--itGakS~e~~~~A~~ki 199 (272)
++.+|.+|+++ .+|..+.++++..++..
T Consensus 78 t~~~~~~G~~~~~~~G~~~~~~l~~~i~~~ 107 (109)
T PRK09381 78 TLLLFKNGEVAATKVGALSKGQLKEFLDAN 107 (109)
T ss_pred EEEEEeCCeEEEEecCCCCHHHHHHHHHHh
Confidence 56777889887 57888888877766554
No 42
>PTZ00129 40S ribosomal protein S14; Provisional
Probab=39.07 E-value=1.2e+02 Score=26.10 Aligned_cols=53 Identities=13% Similarity=0.203 Sum_probs=41.0
Q ss_pred CCcceEEEEEcCCceEEEee-cCCcEEEEecCC---hhHHHHHHHHHHHHHHhcCCC
Q psy16701 68 GKFHGLIMKILNPRTTCLAF-QSGKLLILGAKH---EHDCKLASRKFAKILKQLGHP 120 (272)
Q Consensus 68 ~~f~~l~~R~~~P~~t~lIf-~sGKiv~tGakS---~~~a~~a~~k~~~~L~~lg~~ 120 (272)
.-|+--++-+.++.-.+++| ++|.+-..|.+. .=.|..|++++++...++|+.
T Consensus 35 as~NNTiItiTD~~G~~~~w~SsG~~gfKg~r~KsTpyAAq~aa~~~a~k~~~~Gi~ 91 (149)
T PTZ00129 35 ASFNDTFIHVTDLSGRETLVRVTGGMKVKADRDESSPYAAMMAAQDVAARCKELGIN 91 (149)
T ss_pred cccCCeEEEEEcccCCEEEEEecCcceecccccCCCHHHHHHHHHHHHHHHHHcCCe
Confidence 34555667777777776777 679999999883 446888999999999999984
No 43
>KOG0910|consensus
Probab=37.01 E-value=52 Score=28.37 Aligned_cols=77 Identities=18% Similarity=0.230 Sum_probs=53.9
Q ss_pred eeEEEEEEEEEEeCCCC---ChhhHhhcCCC----eEEcCCCcceE--EEEEcCCeeEEEEecCCeEE--EEcccCHHHH
Q psy16701 124 FFFFRNMVGTISVGCPL---DLNQINSRVRY----SEYNPGKFHGL--IMKILNPRTTCLAFQSGKLL--ILGAKHEHDC 192 (272)
Q Consensus 124 ~~~I~NVVas~~l~~~l---dL~~la~~~~n----~eYePe~fpgl--i~R~~~p~~t~lIF~SGKiv--itGakS~e~~ 192 (272)
.+.+++.-|.-+=+|.. -|++++.+..+ +.-|-+.+|-+ .|.++- --|+++|.+|..+ +.|+...+.+
T Consensus 62 ~PVlVdF~A~WCgPCk~l~P~l~~~~~~~~g~~k~~kvdtD~~~ela~~Y~I~a-vPtvlvfknGe~~d~~vG~~~~~~l 140 (150)
T KOG0910|consen 62 VPVLVDFHAEWCGPCKMLGPILEELVSEYAGKFKLYKVDTDEHPELAEDYEISA-VPTVLVFKNGEKVDRFVGAVPKEQL 140 (150)
T ss_pred CCEEEEEecCcCccHhHhhHHHHHHHHhhcCeEEEEEEccccccchHhhcceee-eeEEEEEECCEEeeeecccCCHHHH
Confidence 45677888998888876 36666654322 33333444444 344442 3589999999988 7899999999
Q ss_pred HHHHHHHHH
Q psy16701 193 KLASRKFAK 201 (272)
Q Consensus 193 ~~A~~ki~~ 201 (272)
+..++|+++
T Consensus 141 ~~~i~k~l~ 149 (150)
T KOG0910|consen 141 RSLIKKFLK 149 (150)
T ss_pred HHHHHHHhc
Confidence 999998864
No 44
>PTZ00129 40S ribosomal protein S14; Provisional
Probab=35.25 E-value=1e+02 Score=26.59 Aligned_cols=51 Identities=14% Similarity=0.190 Sum_probs=37.5
Q ss_pred cceEEEEEcCCeeEEE-EecCCeEEEEccc---CHHHHHHHHHHHHHHHHhCCCC
Q psy16701 159 FHGLIMKILNPRTTCL-AFQSGKLLILGAK---HEHDCKLASRKFAKILKQLGHP 209 (272)
Q Consensus 159 fpgli~R~~~p~~t~l-IF~SGKivitGak---S~e~~~~A~~ki~~iL~klg~~ 209 (272)
|.-.++-+.|..-..+ ..++|.+-..|++ |+-.++.|++++.+...++|+.
T Consensus 37 ~NNTiItiTD~~G~~~~w~SsG~~gfKg~r~KsTpyAAq~aa~~~a~k~~~~Gi~ 91 (149)
T PTZ00129 37 FNDTFIHVTDLSGRETLVRVTGGMKVKADRDESSPYAAMMAAQDVAARCKELGIN 91 (149)
T ss_pred cCCeEEEEEcccCCEEEEEecCcceecccccCCCHHHHHHHHHHHHHHHHHcCCe
Confidence 4455566666544444 4466999999888 3557899999999999999875
No 45
>cd01644 RT_pepA17 RT_pepA17: Reverse transcriptase (RTs) in retrotransposons. This subfamily represents the RT domain of a multifunctional enzyme. C-terminal to the RT domain is a domain homologous to aspartic proteinases (corresponding to Merops family A17) encoded by retrotransposons and retroviruses. RT catalyzes DNA replication from an RNA template and is responsible for the replication of retroelements.
Probab=34.47 E-value=47 Score=29.60 Aligned_cols=30 Identities=10% Similarity=0.293 Sum_probs=26.0
Q ss_pred EEcccCHHHHHHHHHHHHHHHHhCCCCCCC
Q psy16701 183 ILGAKHEHDCKLASRKFAKILKQLGHPIKY 212 (272)
Q Consensus 183 itGakS~e~~~~A~~ki~~iL~klg~~~~~ 212 (272)
+.|+++++++...++.+..+|++.|++.+.
T Consensus 145 li~~~s~~e~~~~~~~v~~~L~~~Gf~l~k 174 (213)
T cd01644 145 LVSTDTLNEAVNVAKRLIALLKKGGFNLRK 174 (213)
T ss_pred eecCCCHHHHHHHHHHHHHHHHhCCccchh
Confidence 347789999999999999999999997653
No 46
>COG4978 Transcriptional regulator, effector-binding domain/component [Transcription / Signal transduction mechanisms]
Probab=33.18 E-value=1.1e+02 Score=26.21 Aligned_cols=101 Identities=13% Similarity=-0.005 Sum_probs=62.2
Q ss_pred ChhHHHHHHHHHHHHHHhcCCCceeeeEEEEEEEEEEeCCCCChhhHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecC
Q psy16701 99 HEHDCKLASRKFAKILKQLGHPEIIFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQS 178 (272)
Q Consensus 99 S~~~a~~a~~k~~~~L~~lg~~~~~~~~I~NVVas~~l~~~ldL~~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~S 178 (272)
.+........++...++..|... +.-+.++..... ++=+. +.| ..-+|....--.++......+++
T Consensus 24 ~~~~~~~~~~el~~~~~~~~~~~-----~~~~~~~~~~~~-~~~~~-------~~~-~~s~~i~~~~~~~~~~~~~~~P~ 89 (153)
T COG4978 24 PERLIEQVYSELCNFLKSNGIIP-----IGPYGATIFHEP-LKEED-------VDI-EVSIPISGEVEGDIDIKIKTLPK 89 (153)
T ss_pred cHHHHHHHHHHHHHHHhhcCccc-----cCCceEEEEeee-ecccc-------ccc-ceeEEEEEecCCCCcceeEEccC
Confidence 67777778888888888887321 111122221111 11111 111 11112222211467889999999
Q ss_pred CeEEEEccc-CHHHHHHHHHHHHHHHHhCCCCCCCc
Q psy16701 179 GKLLILGAK-HEHDCKLASRKFAKILKQLGHPIKYQ 213 (272)
Q Consensus 179 GKivitGak-S~e~~~~A~~ki~~iL~klg~~~~~~ 213 (272)
||+.++=-+ +.+++..|.+++...+++.|..+..+
T Consensus 90 g~~a~~~~~G~~~~~~~~y~rli~~iee~g~~i~g~ 125 (153)
T COG4978 90 GKYACIIHKGSYEEVEQAYKRLIEYIEENGLEIIGP 125 (153)
T ss_pred ceEEEEEEEcCcccHHHHHHHHHHHHHHhCCcccCc
Confidence 988876555 89999999999999999999876653
No 47
>PRK11509 hydrogenase-1 operon protein HyaE; Provisional
Probab=33.17 E-value=79 Score=26.52 Aligned_cols=30 Identities=17% Similarity=0.181 Sum_probs=24.5
Q ss_pred eEEEEecCCeEE--EEcccCHHHHHHHHHHHH
Q psy16701 171 TTCLAFQSGKLL--ILGAKHEHDCKLASRKFA 200 (272)
Q Consensus 171 ~t~lIF~SGKiv--itGakS~e~~~~A~~ki~ 200 (272)
-|+++|++|+.+ +.|..+.++...-+++++
T Consensus 93 PTLl~FkdGk~v~~i~G~~~k~~l~~~I~~~L 124 (132)
T PRK11509 93 PATLVFTGGNYRGVLNGIHPWAELINLMRGLV 124 (132)
T ss_pred CEEEEEECCEEEEEEeCcCCHHHHHHHHHHHh
Confidence 489999999997 579999988877766554
No 48
>PRK13663 hypothetical protein; Provisional
Probab=32.53 E-value=85 Score=31.65 Aligned_cols=133 Identities=24% Similarity=0.355 Sum_probs=78.2
Q ss_pred CcEEEEecCChhHHHHHHHHHHHHHHhcCCCceeeeEEEEEEEEEEeCCCCChhhHhhc--CCCeEEcCCCcceEEEEEc
Q psy16701 90 GKLLILGAKHEHDCKLASRKFAKILKQLGHPEIIFFFFRNMVGTISVGCPLDLNQINSR--VRYSEYNPGKFHGLIMKIL 167 (272)
Q Consensus 90 GKiv~tGakS~~~a~~a~~k~~~~L~~lg~~~~~~~~I~NVVas~~l~~~ldL~~la~~--~~n~eYePe~fpgli~R~~ 167 (272)
|.+|+|=- +...++..|.+.|+++|++......|. |.|-|.+.|... ..--+|=+...|=+++.-.
T Consensus 109 ~sVVITqy----~~qp~a~~F~~rLe~~GIkvy~Hy~i~--------GYP~dv~~IVSdeGyGkN~yIeTtrpLVVVTAP 176 (493)
T PRK13663 109 GSVVITQY----DGQPAADAFRNRLERLGIKVYRHYPIK--------GYPTDVDHIVSDEGYGKNDYIETTRPLVVVTAP 176 (493)
T ss_pred eeEEEEec----CCChHHHHHHHHHHHCCCceEEecCcC--------CCCCCCCceECcCCCCCCCceeccCCeEEEeCC
Confidence 55666654 223456788999999999765444332 677788877653 4445776666677766655
Q ss_pred CCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHHHhCCCC---CCCcceEEEEE-----------EEEeecCCccChH
Q psy16701 168 NPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQLGHP---IKYQGFKIHNI-----------VCTCDVRFPVKLD 233 (272)
Q Consensus 168 ~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL~klg~~---~~~~~f~V~NI-----------vas~d~~f~I~Le 233 (272)
.|. |||+ ...+..++-. .+.|++ .+|+.|.|=|+ -||+|+.--=-++
T Consensus 177 GPG-------SGKl-----------ATCLSQlYhe-~krGi~AGYAKFETFPiWNlPLkHPVNlAYEAATaDL~DvNmID 237 (493)
T PRK13663 177 GPG-------SGKL-----------ATCLSQLYHD-QKRGIKAGYAKFETFPVWNLPLKHPVNLAYEAATADLDDVNMID 237 (493)
T ss_pred CCC-------ccHH-----------HHHHHHHHHH-hhcCccccccccccCcccCCcCCChHHHHHHHhhcCcccccccC
Confidence 553 4543 3445555555 344654 56777888775 2555554311112
Q ss_pred HHHh-HcC-CCceecc--ccCcee
Q psy16701 234 ALHH-VHS-QFSSYEP--ELFPGL 253 (272)
Q Consensus 234 ~la~-~~~-~~~~YEP--E~fPGL 253 (272)
-++. .|+ .-+.|+. |.||=|
T Consensus 238 pfHLeAYg~~tvNYNRDvE~FPvL 261 (493)
T PRK13663 238 PFHLEAYGKTAVNYNRDIEVFPVL 261 (493)
T ss_pred chhHHhhCccccccccccHHHHHH
Confidence 2221 334 3467776 888855
No 49
>COG0533 QRI7 Metal-dependent proteases with possible chaperone activity [Posttranslational modification, protein turnover, chaperones]
Probab=32.24 E-value=1.2e+02 Score=29.47 Aligned_cols=51 Identities=25% Similarity=0.402 Sum_probs=43.2
Q ss_pred CcceEEEEEcCCceEEEeecC-CcEEEEecCChhHHHHHHHHHHHHHHhcCCC
Q psy16701 69 KFHGLIMKILNPRTTCLAFQS-GKLLILGAKHEHDCKLASRKFAKILKQLGHP 120 (272)
Q Consensus 69 ~f~~l~~R~~~P~~t~lIf~s-GKiv~tGakS~~~a~~a~~k~~~~L~~lg~~ 120 (272)
.||.+.+-..--.+.+.-+++ |++-+.|.+..+.+=.|.+|++|.|- |++|
T Consensus 125 ~~p~v~LlVSGGHTqli~~~~~g~y~ilGeTlDdA~Gea~DKvAR~lG-L~yP 176 (342)
T COG0533 125 AFPPVALLVSGGHTQLIAVRGIGRYEVLGETLDDAAGEAFDKVARLLG-LGYP 176 (342)
T ss_pred CCCcEEEEEecCceEEEEEcCCCcEEEEeeechhhhhHHHHHHHHHhC-CCCC
Confidence 899999888999999999999 99999998777777799999999752 3444
No 50
>PHA02278 thioredoxin-like protein
Probab=31.79 E-value=63 Score=25.41 Aligned_cols=24 Identities=8% Similarity=0.191 Sum_probs=19.5
Q ss_pred eEEEEecCCeEE--EEcccCHHHHHH
Q psy16701 171 TTCLAFQSGKLL--ILGAKHEHDCKL 194 (272)
Q Consensus 171 ~t~lIF~SGKiv--itGakS~e~~~~ 194 (272)
.|+++|..|+.+ +.|..+.+++++
T Consensus 74 PT~i~fk~G~~v~~~~G~~~~~~l~~ 99 (103)
T PHA02278 74 PVLIGYKDGQLVKKYEDQVTPMQLQE 99 (103)
T ss_pred cEEEEEECCEEEEEEeCCCCHHHHHh
Confidence 488999999999 889877776543
No 51
>TIGR00090 iojap_ybeB iojap-like ribosome-associated protein. This model describes a widely distributed family of bacterial proteins related to iojap from plants. It includes YbeB from E. coli. The gene iojap is a pattern-striping gene in maize, reflecting a chloroplast development defect in some cells. More recent work in bacteria suggests that the bacterial iojap-related protein physically associates with ribosomes. The function remains unknown.
Probab=31.19 E-value=91 Score=24.51 Aligned_cols=30 Identities=17% Similarity=0.187 Sum_probs=25.3
Q ss_pred eEEEEcccCHHHHHHHHHHHHHHHHhCCCCC
Q psy16701 180 KLLILGAKHEHDCKLASRKFAKILKQLGHPI 210 (272)
Q Consensus 180 KivitGakS~e~~~~A~~ki~~iL~klg~~~ 210 (272)
-|++|| +|.-+++..++.+.+.+++.|.++
T Consensus 32 ~VI~Tg-~S~rh~~aia~~v~~~~k~~~~~~ 61 (99)
T TIGR00090 32 FVIASG-TSSRHVKAIADNVEEELKEAGLKP 61 (99)
T ss_pred EEEEEe-CCHHHHHHHHHHHHHHHHHcCCCc
Confidence 467776 699999999999999999988754
No 52
>PF06200 tify: tify domain; InterPro: IPR010399 The tify domain is a 36-amino acid domain only found among Embryophyta (land plants). It has been named after the most conserved amino acid pattern (TIF[F/Y]XG) it contains, but was previously known as the Zim domain. As the use of uppercase characters (TIFY) might imply that the domain is fully conserved across proteins, a lowercase lettering has been chosen in an attempt to highlight the reality of its natural variability. Based on the domain architecture, tify domain containing proteins can be classified into two groups. Group I is formed by proteins possessing a CCT (CONSTANS, CO-like, and TOC1) domain and a GATA-type zinc finger in addition to the tify domain. Group II contains proteins characterised by the tify domain but lacking a GATA-type zinc finger. Tify domain containing proteins might be involved in developmental processes and some of them have features that are characteristic for transcription factors: a nuclear localisation and the presence of a putative DNA-binding domain []. Some proteins known to contain a tify domain include: Arabidopsis thaliana Zinc-finger protein expressed in Inflorescence Meristem (ZIM), a putative transcription factor involved in inflorescence and flower development [, ]. A. thaliana ZIM-like proteins (ZML) []. A. thaliana PEAPOD1 and PEAPOD2 (PPD1 and PPD2) [].
Probab=29.35 E-value=91 Score=20.49 Aligned_cols=27 Identities=7% Similarity=0.130 Sum_probs=20.8
Q ss_pred ceEEEeecCCcEEEEecCChhHHHHHH
Q psy16701 81 RTTCLAFQSGKLLILGAKHEHDCKLAS 107 (272)
Q Consensus 81 ~~t~lIf~sGKiv~tGakS~~~a~~a~ 107 (272)
....+||..|++.+.-.=+.+.|+.-+
T Consensus 5 ~~qLTIfY~G~V~Vfd~v~~~Ka~~im 31 (36)
T PF06200_consen 5 TAQLTIFYGGQVCVFDDVPPDKAQEIM 31 (36)
T ss_pred CCcEEEEECCEEEEeCCCCHHHHHHHH
Confidence 467899999999999766677666544
No 53
>PRK10996 thioredoxin 2; Provisional
Probab=29.12 E-value=76 Score=25.98 Aligned_cols=28 Identities=18% Similarity=0.255 Sum_probs=21.1
Q ss_pred EEEEecCCeEE--EEcccCHHHHHHHHHHH
Q psy16701 172 TCLAFQSGKLL--ILGAKHEHDCKLASRKF 199 (272)
Q Consensus 172 t~lIF~SGKiv--itGakS~e~~~~A~~ki 199 (272)
++.+|.+|+++ +.|..+.++++..+++.
T Consensus 109 tlii~~~G~~v~~~~G~~~~e~l~~~l~~~ 138 (139)
T PRK10996 109 TIMIFKNGQVVDMLNGAVPKAPFDSWLNEA 138 (139)
T ss_pred EEEEEECCEEEEEEcCCCCHHHHHHHHHHh
Confidence 46778889987 57888888777776653
No 54
>cd02963 TRX_DnaJ TRX domain, DnaJ domain containing protein family; composed of uncharacterized proteins of about 500-800 amino acids, containing an N-terminal DnaJ domain followed by one redox active TRX domain. DnaJ is a member of the 40 kDa heat-shock protein (Hsp40) family of molecular chaperones, which regulate the activity of Hsp70s. TRX is involved in the redox regulation of many protein substrates through the reduction of disulfide bonds. TRX has been implicated to catalyse the reduction of Hsp33, a chaperone holdase that binds to unfolded protein intermediates. The presence of DnaJ and TRX domains in members of this family suggests that they could be involved in a redox-regulated chaperone network.
Probab=27.39 E-value=69 Score=24.96 Aligned_cols=27 Identities=11% Similarity=0.089 Sum_probs=18.9
Q ss_pred EEEEecCCeEEE--EcccCHHHHHHHHHH
Q psy16701 172 TCLAFQSGKLLI--LGAKHEHDCKLASRK 198 (272)
Q Consensus 172 t~lIF~SGKivi--tGakS~e~~~~A~~k 198 (272)
|+.+|..|+++- .|..+.+++...+++
T Consensus 82 t~~i~~~g~~~~~~~G~~~~~~l~~~i~~ 110 (111)
T cd02963 82 AIVGIINGQVTFYHDSSFTKQHVVDFVRK 110 (111)
T ss_pred EEEEEECCEEEEEecCCCCHHHHHHHHhc
Confidence 467778888874 577777776666554
No 55
>PF04461 DUF520: Protein of unknown function (DUF520); InterPro: IPR007551 This entry represents the UPF0234 family of uncharacterised proteins.; PDB: 1IN0_A.
Probab=26.39 E-value=65 Score=28.08 Aligned_cols=143 Identities=15% Similarity=0.226 Sum_probs=64.6
Q ss_pred EEEEeCCccChhhHhhhCCCc------eecCCCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHHHh
Q psy16701 43 GTISVGCPLDLNQINSRVRYS------EYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 116 (272)
Q Consensus 43 ~s~~l~~~ldL~~la~~~~n~------eY~P~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L~~ 116 (272)
.|+|.-+++|+.++...+.++ .|+=+ .-+++ |=.+++.+..=|.++...+....-+...|-+
T Consensus 2 pSFDIVSevD~qEv~NAv~qa~kEi~~RyDFK----------gs~a~--iel~~~~i~l~a~~e~kl~~v~diL~~kl~K 69 (160)
T PF04461_consen 2 PSFDIVSEVDMQEVDNAVDQANKEIATRYDFK----------GSKAS--IELNKKTITLTAEDEFKLKQVKDILRSKLIK 69 (160)
T ss_dssp -EEEEE----HHHHHHHHHHHHHHHCTBGGGT----------TS-EE--EEEETTEEEEEESSHHHHHHHHHHHHHHHHH
T ss_pred CCcceEEeccHHHHHHHHHHHHHHHhcccccC----------CCceE--EEEcCCEEEEEeCCHHHHHHHHHHHHHHHHH
Confidence 588999999999998766543 44432 22222 2223333333345666666555555555666
Q ss_pred cCCCc-eeee------EEEEEEEEEEeCCCCChh---hHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcc
Q psy16701 117 LGHPE-IIFF------FFRNMVGTISVGCPLDLN---QINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGA 186 (272)
Q Consensus 117 lg~~~-~~~~------~I~NVVas~~l~~~ldL~---~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGa 186 (272)
-|.+. ..++ .=..+--...+..-|+-+ .|.....+. .+++.+.|= .-.+-+||
T Consensus 70 R~i~~k~ld~~k~e~asg~~vrq~i~lk~GI~~d~AKkIvK~IKd~---------------klKVqa~IQ-gd~vRVtg- 132 (160)
T PF04461_consen 70 RGIDLKALDFGKIESASGGTVRQVIKLKQGIDQDTAKKIVKLIKDS---------------KLKVQAQIQ-GDQVRVTG- 132 (160)
T ss_dssp TT--GGGEE--SS-EEETTEEEEEEEE--S--HHHHHHHHHHHHHH-----------------SEEEEEE-TTEEEEEE-
T ss_pred cCCCHHHcCCCCCccccCCEEEEEEEeecccCHHHHHHHHHHHHhc---------------CCceeEEec-CcEEEEec-
Confidence 66652 1111 111222445566666644 222222222 233333332 23577888
Q ss_pred cCHHHHHHHHHHHHHHHHhCCCCCCCcceE
Q psy16701 187 KHEHDCKLASRKFAKILKQLGHPIKYQGFK 216 (272)
Q Consensus 187 kS~e~~~~A~~ki~~iL~klg~~~~~~~f~ 216 (272)
|+.+|++.++..+-.. .+|+|..|.||+
T Consensus 133 KkrDDLQ~viallk~~--d~~~pLQF~NfR 160 (160)
T PF04461_consen 133 KKRDDLQEVIALLKEQ--DLGIPLQFNNFR 160 (160)
T ss_dssp S-HHHHHHHHHHHHHS----SS--EEEEEE
T ss_pred CCHHHHHHHHHHHHcc--cCCCCceeccCC
Confidence 6899999987655432 556777777764
No 56
>cd03065 PDI_b_Calsequestrin_N PDIb family, Calsequestrin subfamily, N-terminal TRX-fold domain; Calsequestrin is the major calcium storage protein in the sarcoplasmic reticulum (SR) of skeletal and cardiac muscle. It stores calcium ions in sufficient quantities (up to 20 mM) to allow repetitive contractions and is essential to maintain movement, respiration and heart beat. A missense mutation in human cardiac calsequestrin is associated with catecholamine-induced polymorphic ventricular tachycardia (CPVT), a rare disease characterized by seizures or sudden death in response to physiologic or emotional stress. Calsequestrin is a highly acidic protein with up to 50 calcium binding sites formed simply by the clustering of two or more acidic residues. The monomer contains three redox inactive TRX-fold domains. Calsequestrin is condensed as a linear polymer in the SR lumen and is membrane-anchored through binding with intra-membrane proteins triadin, junctin and ryanodine receptor (RyR) Ca2
Probab=25.66 E-value=90 Score=25.43 Aligned_cols=29 Identities=7% Similarity=0.043 Sum_probs=20.2
Q ss_pred eEEEeecCCcEE-EEecCChhHHHHHHHHH
Q psy16701 82 TTCLAFQSGKLL-ILGAKHEHDCKLASRKF 110 (272)
Q Consensus 82 ~t~lIf~sGKiv-~tGakS~~~a~~a~~k~ 110 (272)
.|+.+|.+|+++ .+|+.+.++....++++
T Consensus 89 PTl~lfk~G~~v~~~G~~~~~~l~~~l~~~ 118 (120)
T cd03065 89 DSIYVFKDDEVIEYDGEFAADTLVEFLLDL 118 (120)
T ss_pred cEEEEEECCEEEEeeCCCCHHHHHHHHHHH
Confidence 377899999844 55888777666555543
No 57
>PF13192 Thioredoxin_3: Thioredoxin domain; PDB: 1ZYP_B 1ZYN_A 1HYU_A 1ILO_A 1J08_F 2YWM_B 2AYT_B 2HLS_B 1A8L_A 2K8S_B ....
Probab=25.45 E-value=78 Score=23.04 Aligned_cols=21 Identities=19% Similarity=0.336 Sum_probs=15.5
Q ss_pred ecCCeEEEEc-ccCHHHHHHHH
Q psy16701 176 FQSGKLLILG-AKHEHDCKLAS 196 (272)
Q Consensus 176 F~SGKivitG-akS~e~~~~A~ 196 (272)
+-+|+++.+| ..|.++++..+
T Consensus 54 vIng~~~~~G~~p~~~el~~~l 75 (76)
T PF13192_consen 54 VINGKVVFVGRVPSKEELKELL 75 (76)
T ss_dssp EETTEEEEESS--HHHHHHHHH
T ss_pred EECCEEEEEecCCCHHHHHHHh
Confidence 4479999999 88888887765
No 58
>PRK05412 putative nucleotide-binding protein; Reviewed
Probab=24.79 E-value=80 Score=27.56 Aligned_cols=147 Identities=14% Similarity=0.230 Sum_probs=72.5
Q ss_pred EEEeCCccChhhHhhhCCCceecC-CCcceEEEEEcCCceEEEeecCCcEEEEecCChhHHHHHHHHHHHHHHhcCCCc-
Q psy16701 44 TISVGCPLDLNQINSRVRYSEYNP-GKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQLGHPE- 121 (272)
Q Consensus 44 s~~l~~~ldL~~la~~~~n~eY~P-~~f~~l~~R~~~P~~t~lIf~sGKiv~tGakS~~~a~~a~~k~~~~L~~lg~~~- 121 (272)
|+|.-+++|+.++-..+.++.=+= .|| -++.-.+++-. +++.+..=|.|+...+....-+...|-+-|.+.
T Consensus 3 SFDIVSevD~qEv~NAv~qa~kEi~~Ry-----DFKgs~a~iel--~~~~i~l~a~~d~kl~~v~diL~~kl~KR~i~~k 75 (161)
T PRK05412 3 SFDIVSEVDLQEVDNAVDQANKEISTRF-----DFKGSKASIEL--KEEEITLTAESDFQLKQVKDILRSKLIKRGIDLK 75 (161)
T ss_pred CcceeeeCcHHHHHHHHHHHHHHHhccc-----ccCCCccEEEE--cCCEEEEEeCCHHHHHHHHHHHHHHHHHcCCCHH
Confidence 677788889888887665532111 011 12222222222 223333333455555544444444555555441
Q ss_pred eee------eEEEEEEEEEEeCCCCChh---hHhhcCCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHH
Q psy16701 122 IIF------FFFRNMVGTISVGCPLDLN---QINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDC 192 (272)
Q Consensus 122 ~~~------~~I~NVVas~~l~~~ldL~---~la~~~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~ 192 (272)
..+ ..=.++=-...+..-|+-+ .|.....+. ..++.+.| ..-+|-+|| |+.+|+
T Consensus 76 ~ld~~~~e~~sG~~vrq~i~lk~GI~~e~AKkIvK~IKd~---------------klKVqa~I-QGd~vRVtg-KkrDDL 138 (161)
T PRK05412 76 ALDYGKVEKASGKTVKQEVKLKQGIDQELAKKIVKLIKDS---------------KLKVQAQI-QGDQVRVTG-KKRDDL 138 (161)
T ss_pred HcCCCCccccCCCEEEEEEehhhccCHHHHHHHHHHHHhc---------------CCceeEEe-cCcEEEEec-CCHhHH
Confidence 111 1112233444555555543 233323332 23333333 233577888 689999
Q ss_pred HHHHHHHHHHHHhCCCCCCCcceE
Q psy16701 193 KLASRKFAKILKQLGHPIKYQGFK 216 (272)
Q Consensus 193 ~~A~~ki~~iL~klg~~~~~~~f~ 216 (272)
+.++..+-. +.+|+|..|.||+
T Consensus 139 Q~viallk~--~d~~~pLQF~NfR 160 (161)
T PRK05412 139 QAVIALLRK--ADLGQPLQFNNFR 160 (161)
T ss_pred HHHHHHHHh--ccCCCCceeccCC
Confidence 998776543 2567788887775
No 59
>KOG0910|consensus
Probab=24.64 E-value=97 Score=26.69 Aligned_cols=74 Identities=24% Similarity=0.377 Sum_probs=46.5
Q ss_pred CCcceEEEEEEEEEeCCcc---ChhhHhhhCCCceecCCCcceEEEEEcC-C----------ceEEEeecCCcEE--EEe
Q psy16701 33 NRRGTNLNMVGTISVGCPL---DLNQINSRVRYSEYNPGKFHGLIMKILN-P----------RTTCLAFQSGKLL--ILG 96 (272)
Q Consensus 33 ~~~~~i~NvV~s~~l~~~l---dL~~la~~~~n~eY~P~~f~~l~~R~~~-P----------~~t~lIf~sGKiv--~tG 96 (272)
-+.|++++.-|.===.|+. -|++++. +| -.+|.-..+++.+ | -.|+++|.+|.-+ ..|
T Consensus 60 S~~PVlVdF~A~WCgPCk~l~P~l~~~~~-----~~-~g~~k~~kvdtD~~~ela~~Y~I~avPtvlvfknGe~~d~~vG 133 (150)
T KOG0910|consen 60 SDVPVLVDFHAEWCGPCKMLGPILEELVS-----EY-AGKFKLYKVDTDEHPELAEDYEISAVPTVLVFKNGEKVDRFVG 133 (150)
T ss_pred cCCCEEEEEecCcCccHhHhhHHHHHHHH-----hh-cCeEEEEEEccccccchHhhcceeeeeEEEEEECCEEeeeecc
Confidence 4567777776654444544 2334443 33 1233333333332 1 3589999999987 889
Q ss_pred cCChhHHHHHHHHHHH
Q psy16701 97 AKHEHDCKLASRKFAK 112 (272)
Q Consensus 97 akS~~~a~~a~~k~~~ 112 (272)
+...+..+..++|+.+
T Consensus 134 ~~~~~~l~~~i~k~l~ 149 (150)
T KOG0910|consen 134 AVPKEQLRSLIKKFLK 149 (150)
T ss_pred cCCHHHHHHHHHHHhc
Confidence 9999999999998853
No 60
>cd02948 TRX_NDPK TRX domain, TRX and NDP-kinase (NDPK) fusion protein family; most members of this group are fusion proteins which contain one redox active TRX domain containing a CXXC motif and three NDPK domains, and are characterized as intermediate chains (ICs) of axonemal outer arm dynein. Dyneins are molecular motors that generate force against microtubules to produce cellular movement, and are divided into two classes: axonemal and cytoplasmic. They are supramolecular complexes consisting of three protein groups classified according to size: dynein heavy, intermediate and light chains. Axonemal dyneins form two structures, the inner and outer arms, which are attached to doublet microtubules throughout the cilia and flagella. The human homolog is the sperm-specific Sptrx-2, presumed to be a component of the human sperm axoneme architecture. Included in this group is another human protein, TRX-like protein 2, a smaller fusion protein containing one TRX and one NDPK domain, which
Probab=24.41 E-value=1e+02 Score=23.46 Aligned_cols=26 Identities=27% Similarity=0.365 Sum_probs=19.3
Q ss_pred EEEEecCCeEE--EEcccCHHHHHHHHHH
Q psy16701 172 TCLAFQSGKLL--ILGAKHEHDCKLASRK 198 (272)
Q Consensus 172 t~lIF~SGKiv--itGakS~e~~~~A~~k 198 (272)
|+.+|.+|+.+ +.|+ +.+++++++++
T Consensus 74 t~~~~~~g~~~~~~~G~-~~~~~~~~i~~ 101 (102)
T cd02948 74 TFLFYKNGELVAVIRGA-NAPLLNKTITE 101 (102)
T ss_pred EEEEEECCEEEEEEecC-ChHHHHHHHhh
Confidence 67889999976 6776 66777766654
No 61
>PRK09381 trxA thioredoxin; Provisional
Probab=24.04 E-value=90 Score=23.68 Aligned_cols=28 Identities=25% Similarity=0.233 Sum_probs=19.9
Q ss_pred EEEeecCCcEE--EEecCChhHHHHHHHHH
Q psy16701 83 TCLAFQSGKLL--ILGAKHEHDCKLASRKF 110 (272)
Q Consensus 83 t~lIf~sGKiv--~tGakS~~~a~~a~~k~ 110 (272)
|+.+|.+|+++ ..|..+.++.+..+++.
T Consensus 78 t~~~~~~G~~~~~~~G~~~~~~l~~~i~~~ 107 (109)
T PRK09381 78 TLLLFKNGEVAATKVGALSKGQLKEFLDAN 107 (109)
T ss_pred EEEEEeCCeEEEEecCCCCHHHHHHHHHHh
Confidence 67888999988 56877776665555443
No 62
>cd00148 PROF Profilin binds actin monomers, membrane polyphosphoinositides such as PI(4,5)P2, and poly-L-proline. Profilin can inhibit actin polymerization into F-actin by binding to monomeric actin (G-actin) and terminal F-actin subunits, but - as a regulator of the cytoskeleton - it may also promote actin polymerization. It plays a role in the assembly of branched actin filament networks, by activating WASP via binding to WASP's proline rich domain. Profilin may link the cytoskeleton with major signalling pathways by interacting with components of the phosphatidylinositol cycle and Ras pathway.
Probab=23.76 E-value=2.3e+02 Score=22.98 Aligned_cols=44 Identities=16% Similarity=0.184 Sum_probs=33.9
Q ss_pred EEcCCceEEEeecCCcEEEEecCCh----hHHHHHHHHHHHHHHhcCC
Q psy16701 76 KILNPRTTCLAFQSGKLLILGAKHE----HDCKLASRKFAKILKQLGH 119 (272)
Q Consensus 76 R~~~P~~t~lIf~sGKiv~tGakS~----~~a~~a~~k~~~~L~~lg~ 119 (272)
+-+..+.-+.+.+++..++.|.-.+ ..+..++.++++.|++.|+
T Consensus 80 ~~kk~~~Gi~i~kT~~~ivi~~y~e~~~~g~~~~~v~~ladYL~~~gy 127 (127)
T cd00148 80 YGKKGAGGVVIVKTKQALVIGMYEEGVQPGQANKVVEKLADYLRSQGY 127 (127)
T ss_pred EeeeCCCeEEEEECCCEEEEEEcCCCCCHHHHHHHHHHHHHHHHHcCC
Confidence 3344456677888888888887654 4788999999999999885
No 63
>PF07960 CBP4: CBP4; InterPro: IPR012420 The CBP4 gene in Saccharomyces cerevisiae is essential for the expression and activity of ubiquinol-cytochrome c reductase [, ]. This family appears to be fungal specific.
Probab=23.50 E-value=25 Score=29.54 Aligned_cols=18 Identities=33% Similarity=0.442 Sum_probs=14.6
Q ss_pred ccccceeecCCeeeEEEE
Q psy16701 10 YGIGSACIGGGQACFCLI 27 (272)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~ 27 (272)
|+.|+++||||-+++-.+
T Consensus 12 ~~~G~~ii~~G~~l~~y~ 29 (128)
T PF07960_consen 12 LVAGAVIIGGGPALVKYT 29 (128)
T ss_pred HHhcceeEeechHHheec
Confidence 567899999999987554
No 64
>KOG4180|consensus
Probab=23.03 E-value=89 Score=30.53 Aligned_cols=71 Identities=15% Similarity=0.262 Sum_probs=49.6
Q ss_pred EEecCCeEEEEcccCH-------HHHHHHHHHHHHHHHhCCCC-CCCcceEEEEEEEEeecCCccChHHHHhHcCCCcee
Q psy16701 174 LAFQSGKLLILGAKHE-------HDCKLASRKFAKILKQLGHP-IKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSY 245 (272)
Q Consensus 174 lIF~SGKivitGakS~-------e~~~~A~~ki~~iL~klg~~-~~~~~f~V~NIvas~d~~f~I~Le~la~~~~~~~~Y 245 (272)
.-=+||-++|||.-|. .-+.+|+..++.++...|-. +-+ . .| ..|.++..+.+...|
T Consensus 256 KqKssgl~vctgTGstsw~~~iNria~q~v~d~l~~l~~~~~~~vp~---~-Re-----------~ve~i~~~~nq~llF 320 (395)
T KOG4180|consen 256 KQKSSGLVVCTGTGSTSWTFNINRIAEQAVGDLLMILLSRDNLQVPF---M-RE-----------LVEEISTAYNQHLLF 320 (395)
T ss_pred cccCCCeeEecCCCcceEeecccHHHHHHHHHHHHHHHhcCcccchh---h-hh-----------hhHHHHHHhhhcCcc
Confidence 5568999999886553 35678888899998888643 222 1 11 236667777788999
Q ss_pred ccccCceeEEEEcCC
Q psy16701 246 EPELFPGLIYRMVKP 260 (272)
Q Consensus 246 EPE~fPGLvyR~~~p 260 (272)
+|+ .|-|-|-+.+|
T Consensus 321 ~PD-~p~l~fSiRep 334 (395)
T KOG4180|consen 321 KPD-RPQLAFSIREP 334 (395)
T ss_pred CCC-Ccchhhhhhhh
Confidence 999 67777776665
No 65
>PF00085 Thioredoxin: Thioredoxin; InterPro: IPR013766 Thioredoxins [, , , ] are small disulphide-containing redox proteins that have been found in all the kingdoms of living organisms. Thioredoxin serves as a general protein disulphide oxidoreductase. It interacts with a broad range of proteins by a redox mechanism based on reversible oxidation of two cysteine thiol groups to a disulphide, accompanied by the transfer of two electrons and two protons. The net result is the covalent interconversion of a disulphide and a dithiol. In the NADPH-dependent protein disulphide reduction, thioredoxin reductase (TR) catalyses the reduction of oxidised thioredoxin (trx) by NADPH using FAD and its redox-active disulphide; reduced thioredoxin then directly reduces the disulphide in the substrate protein []. Thioredoxin is present in prokaryotes and eukaryotes and the sequence around the redox-active disulphide bond is well conserved. All thioredoxins contain a cis-proline located in a loop preceding beta-strand 4, which makes contact with the active site cysteines, and is important for stability and function []. Thioredoxin belongs to a structural family that includes glutaredoxin, glutathione peroxidase, bacterial protein disulphide isomerase DsbA, and the N-terminal domain of glutathione transferase []. Thioredoxins have a beta-alpha unit preceding the motif common to all these proteins. A number of eukaryotic proteins contain domains evolutionary related to thioredoxin, most of them are protein disulphide isomerases (PDI). PDI (5.3.4.1 from EC) [, , ] is an endoplasmic reticulum multi-functional enzyme that catalyses the formation and rearrangement of disulphide bonds during protein folding []. All PDI contains two or three (ERp72) copies of the thioredoxin domain, each of which contributes to disulphide isomerase activity, but which are functionally non-equivalent []. Moreover, PDI exhibits chaperone-like activity towards proteins that contain no disulphide bonds, i.e. behaving independently of its disulphide isomerase activity []. The various forms of PDI which are currently known are: PDI major isozyme; a multifunctional protein that also function as the beta subunit of prolyl 4-hydroxylase (1.14.11.2 from EC), as a component of oligosaccharyl transferase (2.4.1.119 from EC), as thyroxine deiodinase (3.8.1.4 from EC), as glutathione-insulin transhydrogenase (1.8.4.2 from EC) and as a thyroid hormone-binding protein ERp60 (ER-60; 58 Kd microsomal protein). ERp60 was originally thought to be a phosphoinositide-specific phospholipase C isozyme and later to be a protease. ERp72. ERp5. Bacterial proteins that act as thiol:disulphide interchange proteins that allows disulphide bond formation in some periplasmic proteins also contain a thioredoxin domain. These proteins include: Escherichia coli DsbA (or PrfA) and its orthologs in Vibrio cholerae (TtcpG) and Haemophilus influenzae (Por). E. coli DsbC (or XpRA) and its orthologues in Erwinia chrysanthemi and H. influenzae. E. coli DsbD (or DipZ) and its H. influenzae orthologue. E. coli DsbE (or CcmG) and orthologues in H. influenzae. Rhodobacter capsulatus (Rhodopseudomonas capsulata) (HelX), Rhiziobiacae (CycY and TlpA). This entry represents the thioredoxin domain.; GO: 0045454 cell redox homeostasis; PDB: 3ED3_B 1EP7_A 1EP8_B 1TOF_A 2OE3_B 2OE1_B 2OE0_B 1V98_A 3H79_A 3CXG_A ....
Probab=22.88 E-value=1.7e+02 Score=21.27 Aligned_cols=27 Identities=22% Similarity=0.263 Sum_probs=20.6
Q ss_pred EEEEecCCeEE--EEcccCHHHHHHHHHH
Q psy16701 172 TCLAFQSGKLL--ILGAKHEHDCKLASRK 198 (272)
Q Consensus 172 t~lIF~SGKiv--itGakS~e~~~~A~~k 198 (272)
++.+|.+|+.+ +.|..+.+++...+++
T Consensus 74 t~~~~~~g~~~~~~~g~~~~~~l~~~i~~ 102 (103)
T PF00085_consen 74 TIIFFKNGKEVKRYNGPRNAESLIEFIEK 102 (103)
T ss_dssp EEEEEETTEEEEEEESSSSHHHHHHHHHH
T ss_pred EEEEEECCcEEEEEECCCCHHHHHHHHHc
Confidence 46677777776 5888899988887764
No 66
>TIGR00090 iojap_ybeB iojap-like ribosome-associated protein. This model describes a widely distributed family of bacterial proteins related to iojap from plants. It includes YbeB from E. coli. The gene iojap is a pattern-striping gene in maize, reflecting a chloroplast development defect in some cells. More recent work in bacteria suggests that the bacterial iojap-related protein physically associates with ribosomes. The function remains unknown.
Probab=22.86 E-value=1.5e+02 Score=23.30 Aligned_cols=31 Identities=16% Similarity=0.172 Sum_probs=25.4
Q ss_pred CCcEEEEecCChhHHHHHHHHHHHHHHhcCCC
Q psy16701 89 SGKLLILGAKHEHDCKLASRKFAKILKQLGHP 120 (272)
Q Consensus 89 sGKiv~tGakS~~~a~~a~~k~~~~L~~lg~~ 120 (272)
.--|+||| +|..+.+..++.+.+.+++.|.+
T Consensus 30 dy~VI~Tg-~S~rh~~aia~~v~~~~k~~~~~ 60 (99)
T TIGR00090 30 DYFVIASG-TSSRHVKAIADNVEEELKEAGLK 60 (99)
T ss_pred CEEEEEEe-CCHHHHHHHHHHHHHHHHHcCCC
Confidence 44577887 69999999999999999888764
No 67
>PF03135 CagE_TrbE_VirB: CagE, TrbE, VirB family, component of type IV transporter system; InterPro: IPR018145 This domain is found in (amongst others): the Helicobacter pylori protein CagE (see examples), which together with other proteins from the cag pathogenicity island (PAI), encodes a type IV transporter secretion system. The precise role of CagE is not known, but studies in animal models have shown that it is essential for pathogenesis in Helicobacter pylori induced gastritis and peptic ulceration []. Indeed, the expression of the cag PAI has been shown to be essential for stimulating human gastric epithelial cell apoptosis in vitro []. Similar type IV transport systems are also found in other bacteria. This domain is also found in proteins from the trb and Vir conjugal transfer systems in Agrobacterium tumefaciens and homologues of VirB proteins from other species.; GO: 0005524 ATP binding
Probab=22.42 E-value=1e+02 Score=26.71 Aligned_cols=30 Identities=13% Similarity=0.246 Sum_probs=25.1
Q ss_pred EEEEcccCHHHHHHHHHHHHHHHHhCCCCCC
Q psy16701 181 LLILGAKHEHDCKLASRKFAKILKQLGHPIK 211 (272)
Q Consensus 181 ivitGakS~e~~~~A~~ki~~iL~klg~~~~ 211 (272)
|++. ++|++++++.++++...|...|+.+.
T Consensus 153 i~v~-~~~~~~l~~~~~~v~~~l~~~G~~~~ 182 (205)
T PF03135_consen 153 IVVF-ADDPEELDDKVAEVSSALNNLGFVAV 182 (205)
T ss_pred EEEE-cCCHHHHHHHHHHHHHHHHHCCCEEE
Confidence 4444 58999999999999999999998654
No 68
>smart00392 PROF Profilin. Binds actin monomers, membrane polyphosphoinositides and poly-L-proline.
Probab=21.71 E-value=3.1e+02 Score=22.20 Aligned_cols=42 Identities=19% Similarity=0.253 Sum_probs=33.9
Q ss_pred cCCceEEEeecCCcEEEEecCCh----hHHHHHHHHHHHHHHhcCC
Q psy16701 78 LNPRTTCLAFQSGKLLILGAKHE----HDCKLASRKFAKILKQLGH 119 (272)
Q Consensus 78 ~~P~~t~lIf~sGKiv~tGakS~----~~a~~a~~k~~~~L~~lg~ 119 (272)
+.++.-+.|.++...++.|.-.+ .++..++.++++.|++.|+
T Consensus 84 kk~~~Gv~i~kT~~aivI~~y~e~~~~g~~~~~v~~ladYL~~~Gy 129 (129)
T smart00392 84 KKGAGGVVIVKTKQALIIGMYKEGVQPGQANKTVEKLADYLRSSGY 129 (129)
T ss_pred ecCCceEEEEECCCEEEEEECCCCCChHHHHHHHHHHHHHHHHcCC
Confidence 33566788888888888887655 5889999999999999885
No 69
>PF05005 Ocnus: Janus/Ocnus family (Ocnus); InterPro: IPR007702 This family is comprised of the Ocnus, Janus-A and Janus-B proteins. These proteins have been found to be testes specific in Drosophila melanogaster [].; PDB: 2OZX_A 2OZW_A 2NMM_C 2AI6_A 2HW4_A.
Probab=20.99 E-value=1.5e+02 Score=24.01 Aligned_cols=56 Identities=16% Similarity=0.212 Sum_probs=40.9
Q ss_pred CCCeEEc-CCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHHHHHHHHHHhCCCC
Q psy16701 149 VRYSEYN-PGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQLGHP 209 (272)
Q Consensus 149 ~~n~eYe-Pe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~~ki~~iL~klg~~ 209 (272)
.+.++=+ +-.|.-+.+|+..+. -...|.++-|.+..+-=.+.++++.+.|+++|+.
T Consensus 4 vP~V~id~~G~fKYvLi~v~~~~-----~~~~k~iVRG~~~~~yH~di~d~~~~el~~~gl~ 60 (108)
T PF05005_consen 4 VPDVDIDPEGVFKYVLIRVTDHG-----SGESKYIVRGYKRAEYHADIYDEVQEELEKLGLC 60 (108)
T ss_dssp S-SEEEESSSEEEEEEEEEES-G--------EEEEEEEETT-SSHHHHHHHHHHHHHHCTEE
T ss_pred CCeEEecCCceEEEEEEEEEeCC-----CCEEEEEEECCcchhhHHHHHHHHHHHHHHcCCe
Confidence 4556666 457888888888776 5678999999988777778899999999999864
No 70
>COG3445 Acid-induced glycyl radical enzyme [General function prediction only]
Probab=20.77 E-value=31 Score=27.98 Aligned_cols=51 Identities=14% Similarity=0.218 Sum_probs=35.5
Q ss_pred EeCCCCChhhHhhc-CCCeEEcCCCcceEEEEEcCCeeEEEEecCCeEEEEcccCHHHHHHHH
Q psy16701 135 SVGCPLDLNQINSR-VRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLAS 196 (272)
Q Consensus 135 ~l~~~ldL~~la~~-~~n~eYePe~fpgli~R~~~p~~t~lIF~SGKivitGakS~e~~~~A~ 196 (272)
.=|.++|..-|... ++++.-+||.||-|+.|.. |.-|=--+-++|+-+..+
T Consensus 68 eggqhlnvnvl~retledav~~pekypqltirvs-----------gyavrfnsltpeqqrdvi 119 (127)
T COG3445 68 EGGQHLNVNVLRRETLEDAVKHPEKYPQLTIRVS-----------GYAVRFNSLTPEQQRDVI 119 (127)
T ss_pred ccCceeeeeeeehhhHHHHhhCcccCCceEEEEe-----------eEEEEeccCCHHHhhhHH
Confidence 34778888877754 6888999999999998864 444444445566665554
No 71
>COG0100 RpsK Ribosomal protein S11 [Translation, ribosomal structure and biogenesis]
Probab=20.11 E-value=4.4e+02 Score=22.19 Aligned_cols=52 Identities=23% Similarity=0.322 Sum_probs=41.3
Q ss_pred CcceEEEEEcCCceEEEee-cCCcEEEEecC--ChhHHHHHHHHHHHHHHhcCCC
Q psy16701 69 KFHGLIMKILNPRTTCLAF-QSGKLLILGAK--HEHDCKLASRKFAKILKQLGHP 120 (272)
Q Consensus 69 ~f~~l~~R~~~P~~t~lIf-~sGKiv~tGak--S~~~a~~a~~k~~~~L~~lg~~ 120 (272)
-|+--++-+.++.-.+++| ++|.+..-|.+ ++-.|-.|++++++..++.|+.
T Consensus 26 sfNNTivtitD~~Gn~i~wassG~~gfk~~rk~tpyAA~~aa~~aa~~a~e~Gi~ 80 (129)
T COG0100 26 SFNNTIVTITDLTGNVIIWASSGGMGFKGSRKSTPYAAQLAAEDAAKKAKEHGIK 80 (129)
T ss_pred ccCCcEEEecCCCCCEEEEEecCCceEcCCCCCCHHHHHHHHHHHHHHHHHhCcc
Confidence 5555667888888888888 78999999987 2445667899999998888875
Done!