Query psy6071
Match_columns 185
No_of_seqs 94 out of 96
Neff 4.4
Searched_HMMs 46136
Date Fri Aug 16 16:55:22 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6071.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6071hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4508|consensus 100.0 1.5E-46 3.2E-51 341.6 6.1 171 2-184 256-441 (522)
2 PF10155 DUF2363: Uncharacteri 99.8 4.4E-20 9.6E-25 145.7 3.5 60 126-185 1-60 (126)
3 PF09030 Creb_binding: Creb bi 57.3 8.4 0.00018 30.1 2.1 33 79-112 54-86 (104)
4 PF13490 zf-HC2: Putative zinc 53.7 24 0.00052 21.2 3.4 29 79-107 2-30 (36)
5 PF07637 PSD5: Protein of unkn 53.3 23 0.00051 24.5 3.7 56 77-147 3-59 (64)
6 PF14327 CSTF2_hinge: Hinge do 53.2 21 0.00046 26.1 3.6 57 81-144 15-71 (84)
7 cd08318 Death_NMPP84 Death dom 46.3 24 0.00052 25.7 3.0 46 91-136 3-49 (86)
8 PF05819 NolX: NolX protein; 43.4 77 0.0017 31.4 6.6 84 52-136 246-339 (624)
9 PF10303 DUF2408: Protein of u 39.1 1.7E+02 0.0036 23.4 7.0 89 80-168 19-129 (134)
10 PF11212 DUF2999: Protein of u 39.0 52 0.0011 24.6 3.7 48 91-138 11-65 (82)
11 PF04391 DUF533: Protein of un 38.9 1.3E+02 0.0029 25.4 6.7 37 92-137 112-148 (188)
12 COG2979 Uncharacterized protei 37.4 1.5E+02 0.0032 26.2 6.8 82 79-182 130-213 (225)
13 COG4691 StbC Plasmid stability 36.7 26 0.00056 26.2 1.8 44 76-126 27-70 (80)
14 PF08278 DnaG_DnaB_bind: DNA p 35.0 58 0.0013 24.5 3.6 29 77-105 99-127 (127)
15 cd01670 Death Death Domain: a 34.3 55 0.0012 22.3 3.1 60 99-158 3-70 (79)
16 cd08780 Death_TRADD Death Doma 30.6 1E+02 0.0022 23.6 4.3 61 94-154 1-72 (90)
17 PF13833 EF-hand_8: EF-hand do 28.7 1.3E+02 0.0029 18.8 4.1 32 78-110 7-39 (54)
18 PF04296 DUF448: Protein of un 27.9 85 0.0018 22.5 3.3 28 78-105 51-78 (78)
19 PF13919 ASXH: Asx homology do 27.6 1.2E+02 0.0026 24.4 4.4 63 79-146 33-103 (138)
20 TIGR02501 type_III_yscE type I 27.4 1.3E+02 0.0027 21.5 4.1 29 78-106 22-50 (67)
21 smart00005 DEATH DEATH domain, 26.9 85 0.0018 21.9 3.1 51 94-144 4-60 (88)
22 PF04218 CENP-B_N: CENP-B N-te 26.2 96 0.0021 20.6 3.1 39 91-129 5-46 (53)
23 PF13475 DUF4116: Domain of un 26.0 1.2E+02 0.0027 18.7 3.5 10 126-135 23-32 (49)
24 PF08484 Methyltransf_14: C-me 24.6 50 0.0011 26.8 1.8 35 97-131 56-103 (160)
25 PF13767 DUF4168: Domain of un 23.7 1.6E+02 0.0034 20.9 4.1 31 94-124 32-63 (78)
26 PF03564 DUF1759: Protein of u 23.5 3.3E+02 0.0072 20.6 7.9 49 79-127 9-61 (145)
27 cd08319 Death_RAIDD Death doma 23.5 1.4E+02 0.0031 21.9 3.8 49 94-142 1-54 (83)
28 PF13022 HTH_Tnp_1_2: Helix-tu 23.2 1.4E+02 0.0031 24.5 4.1 45 86-130 4-59 (142)
29 PRK10455 periplasmic protein; 22.9 2.3E+02 0.0049 23.3 5.3 53 77-136 61-115 (161)
30 cd02434 Nodulin-21_like_3 Nodu 22.0 2.5E+02 0.0055 24.0 5.7 54 77-134 79-138 (225)
31 PRK10715 flk flagella biosynth 21.6 84 0.0018 29.2 2.8 46 81-126 229-274 (335)
32 PF10925 DUF2680: Protein of u 21.5 1.6E+02 0.0035 20.5 3.6 38 82-119 21-59 (59)
33 PF11198 DUF2857: Protein of u 21.1 63 0.0014 26.8 1.8 33 93-125 2-35 (180)
34 PF11239 DUF3040: Protein of u 20.9 1.2E+02 0.0027 21.7 3.1 25 92-117 2-31 (82)
35 PRK13266 Thf1-like protein; Re 20.9 2.3E+02 0.0049 24.9 5.2 86 38-129 13-117 (225)
36 PF11103 DUF2887: Protein of u 20.6 1.2E+02 0.0027 25.9 3.4 32 76-107 151-182 (200)
37 KOG2054|consensus 20.6 2.3E+02 0.005 30.3 5.9 109 33-153 237-358 (1121)
38 PF01191 RNA_pol_Rpb5_C: RNA p 20.4 63 0.0014 23.6 1.4 32 92-133 14-45 (74)
39 TIGR03060 PS_II_psb29 photosys 20.2 91 0.002 27.2 2.6 67 38-105 13-98 (214)
No 1
>KOG4508|consensus
Probab=100.00 E-value=1.5e-46 Score=341.63 Aligned_cols=171 Identities=35% Similarity=0.559 Sum_probs=163.6
Q ss_pred CCCccchhhHHHHHHHHHhcCCCCCCCCCCCCccccCCC---------CCccccCCchh------hhhhcCCCCCCcccc
Q psy6071 2 SHSNFNEQTAYKVAEELLTKTSGPGQLNHSSPLVNQYYK---------PEFLRLVPPLH------LIWLNLVDPSEHKIA 66 (185)
Q Consensus 2 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~f~---------P~flRp~Ppl~------l~Wlnp~~~~~h~~~ 66 (185)
++|.|| ++.+|++++.+| +.+...++|. |+||||.||.+ +.|++|+.+ .|++.
T Consensus 256 ~~pefd--vat~V~~~m~SG---------d~~e~~s~F~~~~t~~~~~p~~IrP~pp~~~~c~del~w~~ps~p-~hev~ 323 (522)
T KOG4508|consen 256 VDPEFD--VATPVIYSMCSG---------DQEELMSAFDELETIAEYQPDLIRPEPPELNDCDDELSWSSPSAP-PHEVS 323 (522)
T ss_pred CCCccc--cccccchhhcCC---------CchHHHHHHhhhhhhhhcCccccCCCCcccCchHhHhhccCCCCC-ccccc
Confidence 357788 678999999999 8999998888 99999999999 899999887 49999
Q ss_pred ccCCccccCCchHHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHhhhChHHHHHHHHHHhcCCCh
Q psy6071 67 YDSTMCVSNSAGVEAKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLVENNPLIAIEVLLKLIQSNHI 146 (185)
Q Consensus 67 wD~sm~~~~~~~~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LVEnNP~IAievLlkL~~s~~i 146 (185)
||..||++.+.|.++|+++.|++|++|+..+|.+++-+|.+|||+|+|||+||+|||+|||+||.+|+|+|+|+++|.+|
T Consensus 324 wD~~mc~d~s~Gv~lkd~~ikglk~tlstt~qsdmm~el~~dPklv~h~gvtp~klp~lVe~Np~~A~e~L~kl~~S~qi 403 (522)
T KOG4508|consen 324 WDLEMCLDISDGVDLKDEVIKGLKGTLSTTSQSDMMIELVTDPKLVLHAGVTPEKLPSLVETNPKFAGEFLVKLALSEQI 403 (522)
T ss_pred cchhhcccccCcccHHHHHHHHccCCCCCccHHHHHHHHhcCcceeEecccChhhhHHHHHcChHHHHHHHHHHhhHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hHHHHHHhhhhhhhhHHHHHHHHHHhccccccchhccc
Q psy6071 147 TEYFSVLVNMEMSLHSMEVVNSVLVNMEMSLHSMEVVN 184 (185)
Q Consensus 147 ~eY~~~Lv~md~SLhSmEVVNrLtt~v~LP~eFi~v~~ 184 (185)
++||.+|.+||||||||||||||+|.|++|.+|||+.+
T Consensus 404 ~~Y~~~l~nmdmslHSmEvVnrLttav~lp~~fi~~yi 441 (522)
T KOG4508|consen 404 KRYMRLLENMDMSLHSMEVVNRLTTAVILPDKFIQEYI 441 (522)
T ss_pred HHHHHHHhcccchhHHHHHHHHHhhhhcCCHHHHHHHH
Confidence 99999999999999999999999999999999999754
No 2
>PF10155 DUF2363: Uncharacterized conserved protein (DUF2363); InterPro: IPR019312 This entry represents a region of 120 amino acids in proteins conserved from plants to humans. Their function is not known.
Probab=99.79 E-value=4.4e-20 Score=145.72 Aligned_cols=60 Identities=48% Similarity=0.722 Sum_probs=58.2
Q ss_pred hhhChHHHHHHHHHHhcCCChhHHHHHHhhhhhhhhHHHHHHHHHHhccccccchhcccC
Q psy6071 126 VENNPLIAIEVLLKLIQSNHITEYFSVLVNMEMSLHSMEVVNSVLVNMEMSLHSMEVVNR 185 (185)
Q Consensus 126 VEnNP~IAievLlkL~~s~~i~eY~~~Lv~md~SLhSmEVVNrLtt~v~LP~eFi~v~~~ 185 (185)
|||||.||+|+|.++++++++++|||+|++||+|+|||||||||++++++|+||+|.+++
T Consensus 1 ienNp~IA~~~l~~l~~s~~~~~yld~lv~~~~sl~s~EvVn~L~~~~~~p~efl~~yI~ 60 (126)
T PF10155_consen 1 IENNPNIAIEILVKLINSPNFKEYLDVLVSMDMSLHSMEVVNRLTTSFSLPQEFLHMYIS 60 (126)
T ss_pred CCCcHHHHHHHHHHHcCCchHHHHHHHHHcCCCchhHHHHHHHHHcCCCCcHHHHHHHHH
Confidence 799999999999999999999999999999999999999999999999999999999863
No 3
>PF09030 Creb_binding: Creb binding; InterPro: IPR014744 This entry represents the interlocking domain of the eukaryotic nuclear receptor coactivators CREBP and p300. The interlocking domain forms a 3-helical non-globular array that forms interlocked heterodimers with its target. Nuclear receptors are ligand-activated transcription factors involved in the regulation of many processes, including development, reproduction and homeostasis. Nuclear receptor coactivators act to modulate the function of nuclear receptors. Coactivators associate with promoters and enhancers primarily through protein-protein contacts to facilitate the interaction between DNA-bound transcription factors and the transcription machinery. Many of these coactivators are structurally related, including CBP (CREB-binding protein) and p300 []. CBP and p300 both have histone acetyltransferase activity (2.3.1.48 from EC). CBP/p300 proteins function synergistically to activate transcription, acting to remodel chromatin and to recruit RNA polymerase II and the basal transcription machinery. CBP is required for proper cell cycle control, differentiation and apoptosis. The interaction of CBP/p300 with transcription factors involves several small domains. The IBiD domain in the C-terminal of CBP is responsible for CBP interaction with IRF-3, as well as with the adenoviral oncoprotein E1A, TIF-2 coactivator, and the IRF homologue KSHV IRF-1 []. ; GO: 0003713 transcription coactivator activity, 0004402 histone acetyltransferase activity, 0006355 regulation of transcription, DNA-dependent, 0016573 histone acetylation, 0000123 histone acetyltransferase complex, 0005634 nucleus; PDB: 2KKJ_A 2C52_A 1JJS_A 2L14_A 1KBH_B 1ZOQ_C.
Probab=57.31 E-value=8.4 Score=30.14 Aligned_cols=33 Identities=33% Similarity=0.476 Sum_probs=27.1
Q ss_pred HHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcchh
Q psy6071 79 VEAKKLMTRAFKSALTLQQQQHLLAELQHDSKLV 112 (185)
Q Consensus 79 ~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv 112 (185)
..+.+|| +.+|++=|+.+||++++-|+++|.|.
T Consensus 54 ~ALQ~LL-rTLkSPsSP~QQQQVLnILkSNPqLM 86 (104)
T PF09030_consen 54 QALQQLL-RTLKSPSSPQQQQQVLNILKSNPQLM 86 (104)
T ss_dssp CHHHHHH-HCHCSSSSCHHHHHHHHHHHTSHHHH
T ss_pred HHHHHHH-HHhcCCCCHHHHHHHHHHHhhCHHHH
Confidence 3455554 67999999999999999999988754
No 4
>PF13490 zf-HC2: Putative zinc-finger; PDB: 2Z2S_F 2Q1Z_B 3HUG_T.
Probab=53.71 E-value=24 Score=21.18 Aligned_cols=29 Identities=21% Similarity=0.349 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHhCcCCHHHHHHHHHHHhc
Q psy6071 79 VEAKKLMTRAFKSALTLQQQQHLLAELQH 107 (185)
Q Consensus 79 ~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~ 107 (185)
.+++++|..-+.+.|++.++..|-..|+.
T Consensus 2 ~~~~~~l~~y~dg~L~~~~~~~~~~HL~~ 30 (36)
T PF13490_consen 2 EEVRELLSAYLDGELSPEERARLEAHLAS 30 (36)
T ss_dssp ---HHHHHHHHCT-S-HHHHHHHHHHHCT
T ss_pred hHHHHHHHHHHcCCCCHHHHHHHHHHHHc
Confidence 36899999999999999999999988864
No 5
>PF07637 PSD5: Protein of unknown function (DUF1595); InterPro: IPR013043 A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013039 from INTERPRO and IPR013042 from INTERPRO.
Probab=53.32 E-value=23 Score=24.49 Aligned_cols=56 Identities=23% Similarity=0.250 Sum_probs=39.3
Q ss_pred chHHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHhhhChHHHH-HHHHHHhcCCChh
Q psy6071 77 AGVEAKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLVENNPLIAI-EVLLKLIQSNHIT 147 (185)
Q Consensus 77 ~~~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LVEnNP~IAi-evLlkL~~s~~i~ 147 (185)
...-+++...+|++.|+++.|.+.+++..++.-+ .| .+..-|+ +.|..++.||++-
T Consensus 3 a~~~l~~Fa~rAfRRp~~~~e~~~~~~~~~~~~~----~g-----------~~~~~a~~~~l~aiL~SP~FL 59 (64)
T PF07637_consen 3 AREILRRFARRAFRRPLTDEEVDRYLALYDSARA----QG-----------EDFEEALKEALQAILCSPSFL 59 (64)
T ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHH----cC-----------CCHHHHHHHHHHHHHcCcchh
Confidence 3466889999999999999999998887775332 12 2233344 4677777788753
No 6
>PF14327 CSTF2_hinge: Hinge domain of cleavage stimulation factor subunit 2; PDB: 4EBA_G.
Probab=53.20 E-value=21 Score=26.11 Aligned_cols=57 Identities=21% Similarity=0.346 Sum_probs=36.6
Q ss_pred HHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHhhhChHHHHHHHHHHhcCC
Q psy6071 81 AKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLVENNPLIAIEVLLKLIQSN 144 (185)
Q Consensus 81 ~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LVEnNP~IAievLlkL~~s~ 144 (185)
..+-|+++ -+.+++.|.-.++.++|. -+.=.|++.-+|...||.+|+.++..++--+
T Consensus 15 ~~~~Is~~-l~~l~~~ql~ell~~mK~------l~~~~p~~ar~lL~~nPqLa~Al~qa~l~lg 71 (84)
T PF14327_consen 15 APDAISQT-LSSLPPEQLYELLSQMKQ------LAQQNPEQARQLLQQNPQLAYALFQALLLLG 71 (84)
T ss_dssp HHHHHHTT-SSTSHHHHHHHHHHHHHH------HHC----HHHHHHHS-THHHHHHHHHHHHTS
T ss_pred cHHHHHHH-HHhCCHHHHHHHHHHHHH------HHHhCHHHHHHHHHHCcHHHHHHHHHHHHhC
Confidence 33444433 356788888888888775 3446678888999999999998876665543
No 7
>cd08318 Death_NMPP84 Death domain of Nuclear Matrix Protein P84. Death domain (DD) found in the Nuclear Matrix Protein P84 (also known as HPR1 or THOC1). HPR1/p84 resides in the nuclear matrix and is part of the THO complex, also called TREX (transcription/export) complex, which functions in mRNP biogenesis at the interface between transcription and export of mRNA from the nucleus. Mice lacking THOC1 have abnormal testis development and are sterile. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.
Probab=46.33 E-value=24 Score=25.75 Aligned_cols=46 Identities=17% Similarity=0.231 Sum_probs=39.2
Q ss_pred CcCCHHHHHHHHHHHhcCcc-hhhhcCCCCCchhHHhhhChHHHHHH
Q psy6071 91 SALTLQQQQHLLAELQHDSK-LVYHIGLTPAKLPDLVENNPLIAIEV 136 (185)
Q Consensus 91 ~~L~~~qqq~ll~~L~~Dpk-lv~~~gltp~kLP~LVEnNP~IAiev 136 (185)
.+++..|.+.|.+.+-.|.+ |+.+.||+...+-.+-.+|+.+...+
T Consensus 3 ~~~t~~~l~~ia~~iG~~Wk~Lar~LGls~~dI~~i~~~~~~~~eq~ 49 (86)
T cd08318 3 KPVTGEQITVFANKLGEDWKTLAPHLEMKDKEIRAIESDSEDIKMQA 49 (86)
T ss_pred CCCCHHHHHHHHHHHhhhHHHHHHHcCCCHHHHHHHHhcCCCHHHHH
Confidence 47899999999999999998 99999999999988777777754333
No 8
>PF05819 NolX: NolX protein; InterPro: IPR008718 This family consists of Rhizobium NolX and Xanthomonas HrpF proteins. The interaction between the plant pathogen Xanthomonas campestris pv. vesicatoria (strain 85-10) and its host plants is controlled by hrp genes (hypersensitive reaction and pathogenicity), which encode a type III protein secretion system. Among type III-secreted proteins are avirulence proteins, effectors involved in the induction of plant defence reactions. HrpF is dispensable for protein secretion but required for AvrBs3 recognition in planta, is thought to function as a translocator of effector proteins into the host cell []. NolX, a Glycine max (Soybean) cultivar specificity protein, is secreted by a type III secretion system (TTSS) and shows homology to HrpF. It is not known whether NolX functions at the bacterium-plant interface or acts inside the host cell. NolX is expressed in planta only during the early stages of nodule development [].; GO: 0009877 nodulation
Probab=43.41 E-value=77 Score=31.39 Aligned_cols=84 Identities=17% Similarity=0.231 Sum_probs=57.6
Q ss_pred hhhhcCCCCCCccccccCCccccCCchHHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcC----------CCCCc
Q psy6071 52 LIWLNLVDPSEHKIAYDSTMCVSNSAGVEAKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIG----------LTPAK 121 (185)
Q Consensus 52 l~Wlnp~~~~~h~~~wD~sm~~~~~~~~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~g----------ltp~k 121 (185)
..|---... +.+++-=..+....+.-.--++=+.+-..-|=+|.+.+.-++.|.+||.+.|-+| |+..-
T Consensus 246 itw~gg~lt-~~eleiVavL~RHkDk~pl~w~~l~~k~nDPsTPpDLk~Al~~L~~DP~lf~AigsqgdGkcgGKI~AkD 324 (624)
T PF05819_consen 246 ITWNGGTLT-QNELEIVAVLNRHKDKCPLDWKSLDDKINDPSTPPDLKAALQALQQDPALFYAIGSQGDGKCGGKIKAKD 324 (624)
T ss_pred ceecCCccc-ccchhhhhhhhhccccCCcchhhHHhhccCCCCChhHHHHHHHHhhChHHHHHhccCCCCCcCCeechhh
Confidence 777655443 2333221111111122233355566677889999999999999999999998654 78888
Q ss_pred hhHHhhhChHHHHHH
Q psy6071 122 LPDLVENNPLIAIEV 136 (185)
Q Consensus 122 LP~LVEnNP~IAiev 136 (185)
|-.+.+|+|+||..-
T Consensus 325 ls~fs~~H~qva~y~ 339 (624)
T PF05819_consen 325 LSKFSSHHPQVAEYA 339 (624)
T ss_pred HHHHHhcChHHHHHH
Confidence 999999999998764
No 9
>PF10303 DUF2408: Protein of unknown function (DUF2408); InterPro: IPR018810 This entry represents a family of proteins conserved in fungi whose function is unknown.
Probab=39.09 E-value=1.7e+02 Score=23.38 Aligned_cols=89 Identities=16% Similarity=0.251 Sum_probs=63.9
Q ss_pred HHHHHHHHHHhCcCC------HHHHHHHHHHHh-------cCcchhhhc-CCCCCc----hhHHhhhChHHHHHHHHHHh
Q psy6071 80 EAKKLMTRAFKSALT------LQQQQHLLAELQ-------HDSKLVYHI-GLTPAK----LPDLVENNPLIAIEVLLKLI 141 (185)
Q Consensus 80 e~r~Lm~kA~~~~L~------~~qqq~ll~~L~-------~Dpklv~~~-gltp~k----LP~LVEnNP~IAievLlkL~ 141 (185)
--|.|+.-+-++.-+ ..+-+.+.++|+ .|.|++-.- |=.|.. +-.|++..=.++.+++.+--
T Consensus 19 IrR~L~~~~t~~k~~~~~~~~~~el~~lq~qL~eIe~~R~~DGKF~~~~~g~~~~~gQ~~l~~LLd~C~~li~dl~~~~~ 98 (134)
T PF10303_consen 19 IRRSLLSLNTRSKFSDSSEESSSELKPLQEQLKEIESMRDVDGKFVSPDTGEVPPGGQAVLNGLLDDCFDLIEDLLERKG 98 (134)
T ss_pred HHHHHHHHHhccCCCccccccHHHHHHHHHHHHHHHHhccCCCCeeCCCCCCCCcchHHHHHHHHHHHHHHHHHHHHhcc
Confidence 345666677777777 778777766664 677766644 433433 56688888888888877655
Q ss_pred ---c-CCChhHHHHHHhhhhhhhhHHHHHHH
Q psy6071 142 ---Q-SNHITEYFSVLVNMEMSLHSMEVVNS 168 (185)
Q Consensus 142 ---~-s~~i~eY~~~Lv~md~SLhSmEVVNr 168 (185)
. ++.+.+.++.|+.|.-+|-.|.+.-|
T Consensus 99 ~~~~~~~~l~~iY~~L~~ik~~LE~L~lT~r 129 (134)
T PF10303_consen 99 EEIEVDPSLQPIYDQLIDIKNTLENLLLTRR 129 (134)
T ss_pred ccccccHHHHHHHHHHHHHHHHHHhhhhhcc
Confidence 2 45788899999999999988876543
No 10
>PF11212 DUF2999: Protein of unknown function (DUF2999); InterPro: IPR021376 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=39.03 E-value=52 Score=24.58 Aligned_cols=48 Identities=23% Similarity=0.377 Sum_probs=36.3
Q ss_pred CcCCHHHHHHHHHHHhcCcc----hhhhcCCCCCchhHH---hhhChHHHHHHHH
Q psy6071 91 SALTLQQQQHLLAELQHDSK----LVYHIGLTPAKLPDL---VENNPLIAIEVLL 138 (185)
Q Consensus 91 ~~L~~~qqq~ll~~L~~Dpk----lv~~~gltp~kLP~L---VEnNP~IAievLl 138 (185)
.-.|+++...+++.|-.+|- .+-..|++|+||..| |-.||.+..|...
T Consensus 11 hnvsd~qi~elFq~lT~NPl~AMa~i~qLGip~eKLQ~lm~~VMqnP~LikeAv~ 65 (82)
T PF11212_consen 11 HNVSDEQINELFQALTQNPLAAMATIQQLGIPQEKLQQLMAQVMQNPALIKEAVE 65 (82)
T ss_pred cCCCHHHHHHHHHHHhhCHHHHHHHHHHcCCCHHHHHHHHHHHhcChHHHHHHHH
Confidence 45678888888888888885 445889999998865 6778887766543
No 11
>PF04391 DUF533: Protein of unknown function (DUF533); InterPro: IPR007486 Some family members may be secreted or integral membrane proteins.
Probab=38.91 E-value=1.3e+02 Score=25.35 Aligned_cols=37 Identities=35% Similarity=0.436 Sum_probs=21.8
Q ss_pred cCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHhhhChHHHHHHH
Q psy6071 92 ALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLVENNPLIAIEVL 137 (185)
Q Consensus 92 ~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LVEnNP~IAievL 137 (185)
.++.++++.|..+|.+ =++|+.|-.-| .||..|.|+-
T Consensus 112 g~d~e~~~~l~~eL~~--------P~d~~~la~~v-~~~e~A~evY 148 (188)
T PF04391_consen 112 GLDAEERAWLQAELAA--------PLDPDALAAAV-TDPEQAAEVY 148 (188)
T ss_pred CCCHHHHHHHHHHHhC--------CCCHHHHHHhC-CCHHHHHHHH
Confidence 3444444444444442 25566666666 7888888863
No 12
>COG2979 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.38 E-value=1.5e+02 Score=26.21 Aligned_cols=82 Identities=20% Similarity=0.274 Sum_probs=49.6
Q ss_pred HHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHhhhChHHHHHHHHHHhcCCChhHHHHHHhhhhh
Q psy6071 79 VEAKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLVENNPLIAIEVLLKLIQSNHITEYFSVLVNMEM 158 (185)
Q Consensus 79 ~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LVEnNP~IAievLlkL~~s~~i~eY~~~Lv~md~ 158 (185)
.|-+.||.|--.+-+.++-++-|-.||++ | ++|+.+-.-+.| +..|.|+ |+-...-+|.
T Consensus 130 ~ERa~I~~~l~esG~d~e~~~~le~El~~-P-------lD~~~ia~~a~~-ee~a~El------------Y~ASrl~id~ 188 (225)
T COG2979 130 KERARIMQKLQESGVDPEAQAFLEQELEQ-P-------LDPDEIAAAARN-EEQALEL------------YLASRLAIDD 188 (225)
T ss_pred HHHHHHHHHHHHcCCCHHHHHHHHHHHhC-C-------CCHHHHHHHhcC-HHHHHHH------------HHHHHHhcCc
Confidence 44455555555555555555555555543 2 566666666655 8888886 5544444444
Q ss_pred hhhHHHHH--HHHHHhccccccchhc
Q psy6071 159 SLHSMEVV--NSVLVNMEMSLHSMEV 182 (185)
Q Consensus 159 SLhSmEVV--NrLtt~v~LP~eFi~v 182 (185)
.|-||-+ |+|.....||+.++.=
T Consensus 189 -d~r~Er~YL~~La~~L~L~dalvd~ 213 (225)
T COG2979 189 -DSRMERSYLNALAGALGLPDALVDH 213 (225)
T ss_pred -hhHHHHHHHHHHHHHhCCCHHHHHH
Confidence 3455544 7888888888887653
No 13
>COG4691 StbC Plasmid stability protein [General function prediction only]
Probab=36.74 E-value=26 Score=26.20 Aligned_cols=44 Identities=25% Similarity=0.332 Sum_probs=32.9
Q ss_pred CchHHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHh
Q psy6071 76 SAGVEAKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLV 126 (185)
Q Consensus 76 ~~~~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LV 126 (185)
|..+|+|+++..|+.+ |--..-.+|+.| .-.++++.|++++..+
T Consensus 27 S~EaE~ReIL~~al~~-----E~~p~~~alAad--~grei~l~~er~~~~~ 70 (80)
T COG4691 27 SVEAEAREILTAALLG-----EEVPMPVALAAD--SGREIDLEPERLGLIA 70 (80)
T ss_pred chHHHHHHHHHHHhcc-----cCCCcHHHHHhc--cCcccccCccccchhh
Confidence 3568999999999887 444556677776 4457889999988754
No 14
>PF08278 DnaG_DnaB_bind: DNA primase DnaG DnaB-binding ; InterPro: IPR013173 Eubacterial DnaG primases interact with several factors to form the replisome. One of these factors is DnaB, a helicase. This domain has been demonstrated to be responsible for the interaction between DnaG and DnaB []. This domain has a multi-helical structure that forms an orthogonal bundle [].; GO: 0003896 DNA primase activity, 0006269 DNA replication, synthesis of RNA primer; PDB: 2HAJ_A 1T3W_B.
Probab=35.01 E-value=58 Score=24.48 Aligned_cols=29 Identities=21% Similarity=0.203 Sum_probs=20.6
Q ss_pred chHHHHHHHHHHHhCcCCHHHHHHHHHHH
Q psy6071 77 AGVEAKKLMTRAFKSALTLQQQQHLLAEL 105 (185)
Q Consensus 77 ~~~e~r~Lm~kA~~~~L~~~qqq~ll~~L 105 (185)
...++.+|.+|..+++|+.+|++.+...|
T Consensus 99 ~~~~i~~L~~k~~~~~Lt~eEk~el~~Ll 127 (127)
T PF08278_consen 99 LERRIEELKAKPRRGGLTDEEKQELRRLL 127 (127)
T ss_dssp HHHHHHHHHHHHTTT---HHHHHHHHHHH
T ss_pred HHHHHHHHHHhhccCCcCHHHHHHHHHhC
Confidence 45778889999998999999988876543
No 15
>cd01670 Death Death Domain: a protein-protein interaction domain. Death Domains (DDs) are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. Structural analysis of DD-DD complexes show that the domains interact with each other in many different ways. DD-containing proteins serve as adaptors in signaling pathways and they can recruit other proteins into signaling complexes. In mammals, they are prominent components of the programmed cell death (apoptosis) pathway and are found in a number of other signaling pathways. In invertebrates, they are involved in transcriptional regulation of zygotic patterning genes in insect embryogenesis, and are components of the ToII/NF-kappaB pathway, a conserved innate immune pathway in a
Probab=34.26 E-value=55 Score=22.32 Aligned_cols=60 Identities=25% Similarity=0.394 Sum_probs=43.3
Q ss_pred HHHHHHHhcCcc-hhhhcCCCCCchhHHhhhCh---HHHHHHHHHHhcCC----ChhHHHHHHhhhhh
Q psy6071 99 QHLLAELQHDSK-LVYHIGLTPAKLPDLVENNP---LIAIEVLLKLIQSN----HITEYFSVLVNMEM 158 (185)
Q Consensus 99 q~ll~~L~~Dpk-lv~~~gltp~kLP~LVEnNP---~IAievLlkL~~s~----~i~eY~~~Lv~md~ 158 (185)
+.|.+.|-+|.+ ++...|++...+-.+=++|| .-+.++|..-.+.. .+....++|-.|+.
T Consensus 3 ~~ia~~lg~~W~~la~~Lgl~~~~I~~i~~~~~~~~~~~~~mL~~W~~~~~~~at~~~L~~aL~~~~~ 70 (79)
T cd01670 3 DKLAKKLGKDWKKLARKLGLSDGEIDQIEEDNPRVREQAYQLLLKWEEREGDNATVGNLIEALREIGR 70 (79)
T ss_pred HHHHHHHhhHHHHHHHHhCCCHHHHHHHHHhCCCHHHHHHHHHHHHHhccCcCcHHHHHHHHHHHcCH
Confidence 456677778887 88899999999999999997 55666665555543 34556666666665
No 16
>cd08780 Death_TRADD Death Domain of Tumor Necrosis Factor Receptor 1-Associated Death Domain protein. Death domain (DD) of TRADD (TNF Receptor 1-Associated Death Domain or TNFRSF1A-associated via death domain) protein. TRADD is a central signaling adaptor for TNF-receptor 1 (TNFR1), mediating activation of Nuclear Factor -kappaB (NF-kB) and c-Jun N-terminal kinase (JNK), as well as caspase-dependent apoptosis. It also carries important immunological roles including germinal center formation, DR3-mediated T-cell stimulation, and TNFalpha-mediated inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into s
Probab=30.56 E-value=1e+02 Score=23.56 Aligned_cols=61 Identities=23% Similarity=0.218 Sum_probs=45.6
Q ss_pred CHHHHHHHHHHHhcCcc-hhhhcC-----CCCCchhHHhhhCh-----HHHHHHHHHHhcCCChhHHHHHHh
Q psy6071 94 TLQQQQHLLAELQHDSK-LVYHIG-----LTPAKLPDLVENNP-----LIAIEVLLKLIQSNHITEYFSVLV 154 (185)
Q Consensus 94 ~~~qqq~ll~~L~~Dpk-lv~~~g-----ltp~kLP~LVEnNP-----~IAievLlkL~~s~~i~eY~~~Lv 154 (185)
+++++|.+-+.+=++.| +....| ++-.++..+-+..| .-|.+.|-+-.++..-..-++.|+
T Consensus 1 ~~~~~q~~~~nvGr~WK~laR~Lg~~cral~d~~ID~I~~~y~r~gL~EqvyQ~L~~W~~~eg~~Atv~~Lv 72 (90)
T cd08780 1 TPADQQHFAKSVGKKWKPVGRSLQKNCRALRDPAIDNLAYEYDREGLYEQAYQLLRRFIQSEGKKATLQRLV 72 (90)
T ss_pred CHHHHHHHHHHHhHHHHHHHHHHcccccccchhHHHHHHhhcccccHHHHHHHHHHHHHHhccccchHHHHH
Confidence 57899999999999999 777999 88888888775555 468888877777654444444443
No 17
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=28.73 E-value=1.3e+02 Score=18.84 Aligned_cols=32 Identities=16% Similarity=0.281 Sum_probs=26.6
Q ss_pred hHHHHHHHHHHHhCc-CCHHHHHHHHHHHhcCcc
Q psy6071 78 GVEAKKLMTRAFKSA-LTLQQQQHLLAELQHDSK 110 (185)
Q Consensus 78 ~~e~r~Lm~kA~~~~-L~~~qqq~ll~~L~~Dpk 110 (185)
..+++..| +.+.-. +++++.+.++..+..|.+
T Consensus 7 ~~~~~~~l-~~~g~~~~s~~e~~~l~~~~D~~~~ 39 (54)
T PF13833_consen 7 REEFRRAL-SKLGIKDLSEEEVDRLFREFDTDGD 39 (54)
T ss_dssp HHHHHHHH-HHTTSSSSCHHHHHHHHHHHTTSSS
T ss_pred HHHHHHHH-HHhCCCCCCHHHHHHHHHhcccCCC
Confidence 46788888 666667 999999999999998775
No 18
>PF04296 DUF448: Protein of unknown function (DUF448); InterPro: IPR007393 This entry represents a group of uncharacterised proteins. Some member sequences retain zinc-binding residues. The structure of the hypothetical cytosolic protein SP0554 from Streptococcus pneumoniae revealed an alpha+beta fold that could have evolved from a glucocorticoid receptor-like zinc finger domain [].; PDB: 1G2R_A.
Probab=27.93 E-value=85 Score=22.49 Aligned_cols=28 Identities=32% Similarity=0.430 Sum_probs=19.5
Q ss_pred hHHHHHHHHHHHhCcCCHHHHHHHHHHH
Q psy6071 78 GVEAKKLMTRAFKSALTLQQQQHLLAEL 105 (185)
Q Consensus 78 ~~e~r~Lm~kA~~~~L~~~qqq~ll~~L 105 (185)
.+.-+..+.+|++.+.++.-++.|.++|
T Consensus 51 ~a~kkk~l~ra~k~~v~~~~~~~L~~~l 78 (78)
T PF04296_consen 51 KAKKKKALNRAFKTKVPEEVYEQLEEEL 78 (78)
T ss_dssp HHHHHTHHHHHHTS---HHHHHHHHHH-
T ss_pred HHHHHCHHHHHHCCCCCHHHHHHHHhhC
Confidence 3556679999999999999888887765
No 19
>PF13919 ASXH: Asx homology domain
Probab=27.65 E-value=1.2e+02 Score=24.37 Aligned_cols=63 Identities=19% Similarity=0.179 Sum_probs=39.0
Q ss_pred HHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcCC---CCCchh---HHhhhChHHHHHHH--HHHhcCCCh
Q psy6071 79 VEAKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIGL---TPAKLP---DLVENNPLIAIEVL--LKLIQSNHI 146 (185)
Q Consensus 79 ~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~gl---tp~kLP---~LVEnNP~IAievL--lkL~~s~~i 146 (185)
..+|.|+...-=+-|++.+||.|+..|=. +....- ++..+| .+.. |+.....+- ...+.+..+
T Consensus 33 ~dLr~L~N~~tw~~L~~eeq~eLl~LLP~----~D~~~~~~~~~~~~~l~~S~ln-n~~F~~a~~~fqe~L~~G~~ 103 (138)
T PF13919_consen 33 ADLRALLNPETWSCLPEEEQQELLKLLPE----VDRQVGPDPPDDSLPLSESALN-NEFFRDACQEFQERLAEGEF 103 (138)
T ss_pred cCHHHHhCHHHHhcCCHHHHHHHHHhCCC----CCcccccCCCcccccCCHHHhc-CHHHHHHHHHHHHHHHcCCC
Confidence 55789996544499999999999999965 212111 455555 5554 666655542 344444443
No 20
>TIGR02501 type_III_yscE type III secretion system protein, YseE family. Members of this family are found exclusively in type III secretion appparatus gene clusters in bacteria. Those bacteria with a protein from this family tend to target animal cells, as does Yersinia pestis. This protein is small (about 70 amino acids) and not well characterized.
Probab=27.41 E-value=1.3e+02 Score=21.54 Aligned_cols=29 Identities=14% Similarity=-0.114 Sum_probs=23.5
Q ss_pred hHHHHHHHHHHHhCcCCHHHHHHHHHHHh
Q psy6071 78 GVEAKKLMTRAFKSALTLQQQQHLLAELQ 106 (185)
Q Consensus 78 ~~e~r~Lm~kA~~~~L~~~qqq~ll~~L~ 106 (185)
..+++..+.+++..+-+|+|.|.+....+
T Consensus 22 L~~a~~~lk~qL~~~~tp~qYq~l~~~~~ 50 (67)
T TIGR02501 22 LEAALAELKEQLSRGGDPQQYQEWQLLAD 50 (67)
T ss_pred HHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Confidence 45677788899999999999988776654
No 21
>smart00005 DEATH DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
Probab=26.88 E-value=85 Score=21.87 Aligned_cols=51 Identities=24% Similarity=0.331 Sum_probs=36.6
Q ss_pred CHHHHHHHHHH-HhcCcc-hhhhcCCCCCchhHHhhhCh----HHHHHHHHHHhcCC
Q psy6071 94 TLQQQQHLLAE-LQHDSK-LVYHIGLTPAKLPDLVENNP----LIAIEVLLKLIQSN 144 (185)
Q Consensus 94 ~~~qqq~ll~~-L~~Dpk-lv~~~gltp~kLP~LVEnNP----~IAievLlkL~~s~ 144 (185)
+.++...+.+. +-+|.+ ++.+.|++...+..+-.+|| .-+.++|..-....
T Consensus 4 ~~~~~~~l~~~~~g~~W~~la~~Lg~~~~~i~~i~~~~~~~~~~~~~~lL~~W~~~~ 60 (88)
T smart00005 4 TREKLAKLLDHPLGLDWRELARKLGLSEADIDQIRTEAPRDLAEQSVQLLRLWEQRE 60 (88)
T ss_pred HHHHHHHHHcCccchHHHHHHHHcCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHcc
Confidence 34555566666 666777 77899999999999998988 46666665555543
No 22
>PF04218 CENP-B_N: CENP-B N-terminal DNA-binding domain; InterPro: IPR006695 Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B dimers either bind two separate DNA molecules or alternatively, they may bind two CENP-B boxes on one DNA molecule, with the intervening stretch of DNA forming a loop structure. The CENP-B DNA-binding domain consists of two repeating domains, RP1 and RP2. This family corresponds to RP1 has been shown to consist of four helices in a helix-turn-helix structure [].; GO: 0003677 DNA binding, 0000775 chromosome, centromeric region; PDB: 1BW6_A 1HLV_A 2ELH_A.
Probab=26.21 E-value=96 Score=20.62 Aligned_cols=39 Identities=15% Similarity=0.452 Sum_probs=28.8
Q ss_pred CcCCHHHHHHHHHHHhcCcc---hhhhcCCCCCchhHHhhhC
Q psy6071 91 SALTLQQQQHLLAELQHDSK---LVYHIGLTPAKLPDLVENN 129 (185)
Q Consensus 91 ~~L~~~qqq~ll~~L~~Dpk---lv~~~gltp~kLP~LVEnN 129 (185)
..|+..|.-.++..++..++ ++...|+....+-+++.|-
T Consensus 5 ~~LTl~eK~~iI~~~e~g~s~~~ia~~fgv~~sTv~~I~K~k 46 (53)
T PF04218_consen 5 KSLTLEEKLEIIKRLEEGESKRDIAREFGVSRSTVSTILKNK 46 (53)
T ss_dssp SS--HHHHHHHHHHHHCTT-HHHHHHHHT--CCHHHHHHHCH
T ss_pred ccCCHHHHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHHHhH
Confidence 46889999999999999987 7779999999888877664
No 23
>PF13475 DUF4116: Domain of unknown function (DUF4116)
Probab=26.00 E-value=1.2e+02 Score=18.69 Aligned_cols=10 Identities=10% Similarity=0.358 Sum_probs=4.1
Q ss_pred hhhChHHHHH
Q psy6071 126 VENNPLIAIE 135 (185)
Q Consensus 126 VEnNP~IAie 135 (185)
+.+|..++.+
T Consensus 23 lk~D~e~vl~ 32 (49)
T PF13475_consen 23 LKNDKEFVLK 32 (49)
T ss_pred HhCCHHHHHH
Confidence 3444444433
No 24
>PF08484 Methyltransf_14: C-methyltransferase C-terminal domain; InterPro: IPR013691 This domain is found in bacterial C-methyltransferase proteins, often together with other methyltransferase domains such as IPR013216 from INTERPRO or IPR013217 from INTERPRO. ; PDB: 4E2X_A 3NDJ_A 3NDI_A 4E32_A 4E33_A 4E31_A 4E2Y_A 4E2W_A 4E2Z_A 4E30_A.
Probab=24.61 E-value=50 Score=26.84 Aligned_cols=35 Identities=29% Similarity=0.529 Sum_probs=22.5
Q ss_pred HHHHHHHHHhcCcc-------------hhhhcCCCCCchhHHhhhChH
Q psy6071 97 QQQHLLAELQHDSK-------------LVYHIGLTPAKLPDLVENNPL 131 (185)
Q Consensus 97 qqq~ll~~L~~Dpk-------------lv~~~gltp~kLP~LVEnNP~ 131 (185)
+...+++.++.+.+ ++..||++++.++-+|+.||.
T Consensus 56 ~l~~~L~~~~~~gk~I~~yGA~~kg~tlln~~g~~~~~I~~vvD~np~ 103 (160)
T PF08484_consen 56 ELREFLEKLKAEGKRIAGYGAGAKGNTLLNYFGLDNDLIDYVVDDNPL 103 (160)
T ss_dssp HHHHHHHHHHHTT--EEEE---SHHHHHHHHHT--TTTS--EEES-GG
T ss_pred HHHHHHHHHHHcCCEEEEECcchHHHHHHHHhCCCcceeEEEEeCChh
Confidence 34456667777777 678999999999999999985
No 25
>PF13767 DUF4168: Domain of unknown function (DUF4168)
Probab=23.71 E-value=1.6e+02 Score=20.85 Aligned_cols=31 Identities=23% Similarity=0.264 Sum_probs=13.6
Q ss_pred CHHHHHHHHHHHhcCcc-hhhhcCCCCCchhH
Q psy6071 94 TLQQQQHLLAELQHDSK-LVYHIGLTPAKLPD 124 (185)
Q Consensus 94 ~~~qqq~ll~~L~~Dpk-lv~~~gltp~kLP~ 124 (185)
++.+.+.|..+....-. .+...|||+.++-.
T Consensus 32 ~~~~~~~l~~~a~~~~~~~I~~~GLtv~~fN~ 63 (78)
T PF13767_consen 32 DPEEIQELQEEAQEEMVEAIEENGLTVERFNE 63 (78)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHH
Confidence 33444444433333222 44555555555443
No 26
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=23.50 E-value=3.3e+02 Score=20.61 Aligned_cols=49 Identities=16% Similarity=0.163 Sum_probs=33.8
Q ss_pred HHHHHHHHHHHh--CcCCHHHHHH-HHHHHhcCcc-hhhhcCCCCCchhHHhh
Q psy6071 79 VEAKKLMTRAFK--SALTLQQQQH-LLAELQHDSK-LVYHIGLTPAKLPDLVE 127 (185)
Q Consensus 79 ~e~r~Lm~kA~~--~~L~~~qqq~-ll~~L~~Dpk-lv~~~gltp~kLP~LVE 127 (185)
...++.....+- ..+++.+.-. |...|+.+.+ ++.++.+++......++
T Consensus 9 ~~F~~~F~~~v~~n~~~~d~~K~~~L~~~L~G~A~~~i~~~~~~~~~Y~~a~~ 61 (145)
T PF03564_consen 9 PEFIDQFDSLVHENPDLSDIEKLNYLRSCLKGEAKELIRGLPLSEENYEEAWE 61 (145)
T ss_pred HHHHHHHHHHHhcccCCCHHHHHHHHHHHhcchHHHHHHcccccchhhHHHHH
Confidence 456666666654 5788877655 7777888877 77888887776665444
No 27
>cd08319 Death_RAIDD Death domain of RIP-associated ICH-1 homologous protein with a death domain. Death domain (DD) of RAIDD (RIP-associated ICH-1 homologous protein with a death domain), also known as CRADD (Caspase and RIP adaptor). RAIDD is an adaptor protein that together with the p53-inducible protein PIDD and caspase-2, forms the PIDDosome complex, which is required for caspase-2 activation and plays a role in mediating stress-induced apoptosis. RAIDD contains an N-terminal Caspase Activation and Recruitment Domain (CARD), which interacts with the caspase-2 CARD, and a C-terminal DD, which interacts with the DD of PIDD. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD, DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other pr
Probab=23.45 E-value=1.4e+02 Score=21.93 Aligned_cols=49 Identities=24% Similarity=0.345 Sum_probs=38.6
Q ss_pred CHHHHHHHHHHHhcCcc-hhhhcCCCCCchhHHhhhChH----HHHHHHHHHhc
Q psy6071 94 TLQQQQHLLAELQHDSK-LVYHIGLTPAKLPDLVENNPL----IAIEVLLKLIQ 142 (185)
Q Consensus 94 ~~~qqq~ll~~L~~Dpk-lv~~~gltp~kLP~LVEnNP~----IAievLlkL~~ 142 (185)
++.|.+.|-+.|-+|.+ ++.+.|+|-..+-.+-++||. =+.+.|.+=-+
T Consensus 1 t~~~L~~la~~LG~~W~~Lar~Lgls~~~I~~i~~~~p~~l~eQv~~mL~~W~~ 54 (83)
T cd08319 1 TDRELNQLAQRLGPEWEQVLLDLGLSQTDIYRCKENHPHNVQSQIVEALVKWRQ 54 (83)
T ss_pred CHHHHHHHHHHHhhhHHHHHHHcCCCHHHHHHHHHhCCCCHHHHHHHHHHHHHH
Confidence 46788889999999998 888999999999999999884 34445544444
No 28
>PF13022 HTH_Tnp_1_2: Helix-turn-helix of insertion element transposase; PDB: 2AO9_I.
Probab=23.17 E-value=1.4e+02 Score=24.52 Aligned_cols=45 Identities=22% Similarity=0.378 Sum_probs=25.0
Q ss_pred HHHHhCcCCHHHHHH----HHHHHhcCc-------chhhhcCCCCCchhHHhhhCh
Q psy6071 86 TRAFKSALTLQQQQH----LLAELQHDS-------KLVYHIGLTPAKLPDLVENNP 130 (185)
Q Consensus 86 ~kA~~~~L~~~qqq~----ll~~L~~Dp-------klv~~~gltp~kLP~LVEnNP 130 (185)
.|.+..+|+++|++- +.+++-.+. .++.++|+++..|=.-=-+|.
T Consensus 4 ~~~le~~L~~~Q~kAa~ll~~ne~~~~~~~r~T~~eiAee~Gis~~tLYrWr~~~~ 59 (142)
T PF13022_consen 4 LKELEAKLTLQQRKAAQLLVENELMPENGERRTQAEIAEEVGISRSTLYRWRQQNK 59 (142)
T ss_dssp HHHHHTTS-HHHHHHHHHHHHHHHS------S-HHHHHHHHTS-HHHHHHHHHH-H
T ss_pred HHHHHHHcCHHHHHHHHHHHHHHHhhhccccchHHHHHHHhCCCHHHHHHHHhcCH
Confidence 367789999999883 233443332 367788888776655444443
No 29
>PRK10455 periplasmic protein; Reviewed
Probab=22.85 E-value=2.3e+02 Score=23.26 Aligned_cols=53 Identities=9% Similarity=0.098 Sum_probs=28.6
Q ss_pred chHHHHHHHHHHHhC--cCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHhhhChHHHHHH
Q psy6071 77 AGVEAKKLMTRAFKS--ALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLVENNPLIAIEV 136 (185)
Q Consensus 77 ~~~e~r~Lm~kA~~~--~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LVEnNP~IAiev 136 (185)
-..++|+||.++-.. ..+..+.+.+.+.+..|+ |+..++..+++.--....+-
T Consensus 61 Qrqqir~im~~~r~~~~~~~~~~r~~l~~li~ad~-------FDeaavra~~~k~~~~~~~~ 115 (161)
T PRK10455 61 QKQQIRDIMKAQRDQMKRPPLEERRAMHDIIASDT-------FDKAKAEAQITKMEAQRKAR 115 (161)
T ss_pred HHHHHHHHHHHHHHhhccccHHHHHHHHHHHccCc-------cCHHHHHHHHHHHHHHHHHH
Confidence 446667776666554 344455555555555544 55555666666544444443
No 30
>cd02434 Nodulin-21_like_3 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_3: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter.
Probab=21.97 E-value=2.5e+02 Score=23.95 Aligned_cols=54 Identities=19% Similarity=0.204 Sum_probs=37.7
Q ss_pred chHHHHHHHHHHHhCcCCHHHHHHHHHHHhcCcch------hhhcCCCCCchhHHhhhChHHHH
Q psy6071 77 AGVEAKKLMTRAFKSALTLQQQQHLLAELQHDSKL------VYHIGLTPAKLPDLVENNPLIAI 134 (185)
Q Consensus 77 ~~~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpkl------v~~~gltp~kLP~LVEnNP~IAi 134 (185)
+..|.+|+..-=...-+++++.+++.+.+.+++++ ..+.|+.+... +.||..+.
T Consensus 79 pe~E~~el~~iy~~kG~~~~~a~~v~~~l~~~~~~~~~~m~~~elg~~~~~~----~~~P~~aA 138 (225)
T cd02434 79 PEGEKSEMVEIYSLKGLSEEVADQVVELLSKYRKLFVDIMMTEELGLIPDDE----LPSPLKTA 138 (225)
T ss_pred cHHHHHHHHHHHHHcCCCHHHHHHHHHHHHhCchhhHHHHHHhhccCCCCcc----cCCHHHHH
Confidence 44555555553344569999999999999999976 24678877652 35777654
No 31
>PRK10715 flk flagella biosynthesis regulator; Provisional
Probab=21.59 E-value=84 Score=29.20 Aligned_cols=46 Identities=30% Similarity=0.309 Sum_probs=38.0
Q ss_pred HHHHHHHHHhCcCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHh
Q psy6071 81 AKKLMTRAFKSALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLV 126 (185)
Q Consensus 81 ~r~Lm~kA~~~~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LV 126 (185)
....+..|.|.|+..+++|.+.++....-...-..-||+.++-+++
T Consensus 229 tL~~L~~lLkQp~~~~erQal~Dy~rq~Fg~t~~t~LT~~QlqdvL 274 (335)
T PRK10715 229 TLESLQAALKQPLDAQEQQLLSDYAQQRFQASPQTPLTPAQVQDLL 274 (335)
T ss_pred HHHHHHHHhCCCCChhHHHHHHHHHHHHhCCCCCCcCCHHHHHHHH
Confidence 7788899999999999999999999986665556667887777754
No 32
>PF10925 DUF2680: Protein of unknown function (DUF2680); InterPro: IPR024485 Members in this family of proteins are annotated as YckD however currently no function is known.
Probab=21.52 E-value=1.6e+02 Score=20.45 Aligned_cols=38 Identities=13% Similarity=0.365 Sum_probs=28.7
Q ss_pred HHHHHHHHh-CcCCHHHHHHHHHHHhcCcchhhhcCCCC
Q psy6071 82 KKLMTRAFK-SALTLQQQQHLLAELQHDSKLVYHIGLTP 119 (185)
Q Consensus 82 r~Lm~kA~~-~~L~~~qqq~ll~~L~~Dpklv~~~gltp 119 (185)
+.++.|+.+ |.|+..|.+.+.+.++..-.-....|+-|
T Consensus 21 K~~idk~Ve~G~iTqeqAd~ik~~id~~~~~~~qnGf~p 59 (59)
T PF10925_consen 21 KQIIDKYVEAGVITQEQADAIKKHIDQRQEYMQQNGFVP 59 (59)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHcCCCC
Confidence 455666654 78999999999999998777666667654
No 33
>PF11198 DUF2857: Protein of unknown function (DUF2857); InterPro: IPR021364 This is a bacterial family of uncharacterised proteins.
Probab=21.06 E-value=63 Score=26.78 Aligned_cols=33 Identities=21% Similarity=0.386 Sum_probs=23.8
Q ss_pred CCHHHHHHHHHHHhcCc-chhhhcCCCCCchhHH
Q psy6071 93 LTLQQQQHLLAELQHDS-KLVYHIGLTPAKLPDL 125 (185)
Q Consensus 93 L~~~qqq~ll~~L~~Dp-klv~~~gltp~kLP~L 125 (185)
|+..=..+++..|+... +-....||+++.+-.|
T Consensus 2 ln~avL~~~l~~l~~G~~~~c~~lG~~~eel~~L 35 (180)
T PF11198_consen 2 LNQAVLTQALHDLKEGNIRRCEALGFSPEELDAL 35 (180)
T ss_pred HHHHHHHHHHHHHHcccHHHHHHcCCCHHHHHHH
Confidence 34445567888888887 5778999999855443
No 34
>PF11239 DUF3040: Protein of unknown function (DUF3040); InterPro: IPR021401 Some members in this family of proteins with unknown function are annotated as membrane proteins however this cannot be confirmed.
Probab=20.94 E-value=1.2e+02 Score=21.73 Aligned_cols=25 Identities=20% Similarity=0.355 Sum_probs=16.8
Q ss_pred cCCHHHHHHHHHHH-----hcCcchhhhcCC
Q psy6071 92 ALTLQQQQHLLAEL-----QHDSKLVYHIGL 117 (185)
Q Consensus 92 ~L~~~qqq~ll~~L-----~~Dpklv~~~gl 117 (185)
+||+.||+ .++++ +.||++.....-
T Consensus 2 ~LSe~E~r-~L~eiEr~L~~~DP~fa~~l~~ 31 (82)
T PF11239_consen 2 PLSEHEQR-RLEEIERQLRADDPRFAARLRS 31 (82)
T ss_pred CCCHHHHH-HHHHHHHHHHhcCcHHHHHhcc
Confidence 68999987 45555 457887775444
No 35
>PRK13266 Thf1-like protein; Reviewed
Probab=20.88 E-value=2.3e+02 Score=24.89 Aligned_cols=86 Identities=16% Similarity=0.183 Sum_probs=44.8
Q ss_pred CCCCCccccCCchh-------hhhhcCCCCCCccccccCCccc----cCC------chHHHHHHHHHHHhCcC--CHHHH
Q psy6071 38 YYKPEFLRLVPPLH-------LIWLNLVDPSEHKIAYDSTMCV----SNS------AGVEAKKLMTRAFKSAL--TLQQQ 98 (185)
Q Consensus 38 ~f~P~flRp~Ppl~-------l~Wlnp~~~~~h~~~wD~sm~~----~~~------~~~e~r~Lm~kA~~~~L--~~~qq 98 (185)
.|--.|-||+|+++ ++-++.-.. +-++.||+=.|. .-+ +-.+-|+.|=.|+++.+ .++|.
T Consensus 13 ~F~~~~p~pI~siYrrvv~ELLVElHLl~~-n~~F~yDplfAlGlvt~fd~fm~GY~Pee~~~~IF~Alc~a~~~dp~~~ 91 (225)
T PRK13266 13 AFYAAFPRPINSIYRRVVDELLVELHLLSV-NSDFKYDPLFALGLVTVFDRFMQGYRPEEHKDSIFNALCQAVGFDPEQL 91 (225)
T ss_pred HHHHhCCCCCchHHHHHHHHHHHHHHHHHh-ccCceeCchHHhhHHHHHHHHHcCCCChHHHHHHHHHHHHHcCCCHHHH
Confidence 45567899999999 555554333 245555553321 111 12455666667777664 44444
Q ss_pred HHHHHHHhcCcchhhhcCCCCCchhHHhhhC
Q psy6071 99 QHLLAELQHDSKLVYHIGLTPAKLPDLVENN 129 (185)
Q Consensus 99 q~ll~~L~~Dpklv~~~gltp~kLP~LVEnN 129 (185)
.+--+.+... --|.+.+.+-+.+...
T Consensus 92 r~dA~~l~~~-----a~~~s~~~i~~~l~~~ 117 (225)
T PRK13266 92 RQDAERLLEL-----AKGKSLKEILSWLTQK 117 (225)
T ss_pred HHHHHHHHHH-----HhcCCHHHHHHHHhcc
Confidence 4443333321 1245555555544443
No 36
>PF11103 DUF2887: Protein of unknown function (DUF2887); InterPro: IPR022573 This bacterial group of proteins has no known function.
Probab=20.62 E-value=1.2e+02 Score=25.93 Aligned_cols=32 Identities=25% Similarity=0.297 Sum_probs=28.7
Q ss_pred CchHHHHHHHHHHHhCcCCHHHHHHHHHHHhc
Q psy6071 76 SAGVEAKKLMTRAFKSALTLQQQQHLLAELQH 107 (185)
Q Consensus 76 ~~~~e~r~Lm~kA~~~~L~~~qqq~ll~~L~~ 107 (185)
....++|.|+++|-.......+++.+++.++.
T Consensus 151 ~a~~~A~~Li~~~~~e~~~~~~~~~lleLIEt 182 (200)
T PF11103_consen 151 EAPEQARQLIEQAQQEITDEAQQRELLELIET 182 (200)
T ss_pred HHHHHHHHHHHHHhhhcCCcHHHHHHHHHHHH
Confidence 35688999999999999999999999999985
No 37
>KOG2054|consensus
Probab=20.58 E-value=2.3e+02 Score=30.31 Aligned_cols=109 Identities=18% Similarity=0.226 Sum_probs=70.2
Q ss_pred CccccCCCCCccccCCchh------hhhhcCCCCC------CccccccCCccccCCchHHHHHHHHHHHhCcCCHHHHHH
Q psy6071 33 PLVNQYYKPEFLRLVPPLH------LIWLNLVDPS------EHKIAYDSTMCVSNSAGVEAKKLMTRAFKSALTLQQQQH 100 (185)
Q Consensus 33 ~~~~~~f~P~flRp~Ppl~------l~Wlnp~~~~------~h~~~wD~sm~~~~~~~~e~r~Lm~kA~~~~L~~~qqq~ 100 (185)
.-......|+|+||.|-.. +-|-|-.... ...+.|+.++-.+ .-..+.++.|.|++++.=+-.+.=-
T Consensus 237 ~~~~~~~~~~~li~~~~~~f~~~kllp~~~~ir~~~e~~e~ppTP~yN~svL~~-~~le~~~q~L~K~~s~~~~f~da~~ 315 (1121)
T KOG2054|consen 237 ERLVTVRPPDFLIPCRLLPFKNNKLLPWYNGIRPAGEGSEEPPTPRYNTSVLED-QVLEEYLQLLSKTLSSAKGFKDALA 315 (1121)
T ss_pred cccccccCccccccccccccccccccchhcccCccccCCCCCCCCccchhHHHH-HHHHHHHHHHHHHHhhhhhHHHHHH
Confidence 3344556899999998776 6777652211 2567787776433 4567888899999988766555544
Q ss_pred H-HHHHhcCcchhhhcCCCCCchhHHhhhChHHHHHHHHHHhcCCChhHHHHHH
Q psy6071 101 L-LAELQHDSKLVYHIGLTPAKLPDLVENNPLIAIEVLLKLIQSNHITEYFSVL 153 (185)
Q Consensus 101 l-l~~L~~Dpklv~~~gltp~kLP~LVEnNP~IAievLlkL~~s~~i~eY~~~L 153 (185)
| ..+|+....-..+-||++. +..-++.-|+++..|+.|++.+
T Consensus 316 Llk~WlrqRs~~~~~~gfg~f-----------~~s~lvv~L~s~~ki~~~~S~y 358 (1121)
T KOG2054|consen 316 LLKVWLRQRSLDIGQGGFGGF-----------LLSALVVYLVSTRKIHTTLSAY 358 (1121)
T ss_pred HHHHHHHhhhhhcccCcchHH-----------HHHHHHHHHHhcCchhhcchHH
Confidence 3 3456666444456777765 4444556777777777766543
No 38
>PF01191 RNA_pol_Rpb5_C: RNA polymerase Rpb5, C-terminal domain; InterPro: IPR000783 Prokaryotes contain a single DNA-dependent RNA polymerase (RNAP; 2.7.7.6 from EC) that is responsible for the transcription of all genes, while eukaryotes have three classes of RNAPs (I-III) that transcribe different sets of genes. Each class of RNA polymerase is an assemblage of ten to twelve different polypeptides. Certain subunits of RNAPs, including RPB5 (POLR2E in mammals), are common to all three eukaryotic polymerases. RPB5 plays a role in the transcription activation process. Eukaryotic RPB5 has a bipartite structure consisting of a unique N-terminal region (IPR005571 from INTERPRO), plus a C-terminal region that is structurally homologous to the prokaryotic RPB5 homologue, subunit H (gene rpoH) [, , , ]. This entry represents prokaryotic subunit H and the C-terminal domain of eukaryotic RPB5, which share a two-layer alpha/beta fold, with a core structure of beta/alpha/beta/alpha/beta(2). ; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 1EIK_A 2Y0S_Z 1DZF_A 3GTG_E 2VUM_E 3GTP_E 3GTO_E 3S17_E 3S1R_E 1I3Q_E ....
Probab=20.38 E-value=63 Score=23.61 Aligned_cols=32 Identities=22% Similarity=0.596 Sum_probs=20.9
Q ss_pred cCCHHHHHHHHHHHhcCcchhhhcCCCCCchhHHhhhChHHH
Q psy6071 92 ALTLQQQQHLLAELQHDSKLVYHIGLTPAKLPDLVENNPLIA 133 (185)
Q Consensus 92 ~L~~~qqq~ll~~L~~Dpklv~~~gltp~kLP~LVEnNP~IA 133 (185)
.|++.|.+.+++. .++.+.+||.+....|.+-
T Consensus 14 ils~eE~~~lL~~----------y~i~~~qLP~I~~~DPv~r 45 (74)
T PF01191_consen 14 ILSEEEKKELLKK----------YNIKPEQLPKILSSDPVAR 45 (74)
T ss_dssp EE-HHHHHHHHHH----------TT--TTCSSEEETTSHHHH
T ss_pred EcCHHHHHHHHHH----------hCCChhhCCcccccChhhh
Confidence 4677777776554 4578889999888888653
No 39
>TIGR03060 PS_II_psb29 photosystem II biogenesis protein Psp29. Psp29, originally designated sll1414 in Synechocystis 6803, is found universally in Cyanobacteria and in Arabidopsis. It was isolated and partially sequenced from purified photosystem II (PS II) in Synechocystis. While its function is unknown, mutant studies show an impairment in photosystem II biogenesis and/or stability, rather than in PS II core function.
Probab=20.18 E-value=91 Score=27.15 Aligned_cols=67 Identities=21% Similarity=0.299 Sum_probs=37.1
Q ss_pred CCCCCccccCCchh-------hhhhcCCCCCCccccccCCccc----cC------CchHHHHHHHHHHHhCcC--CHHHH
Q psy6071 38 YYKPEFLRLVPPLH-------LIWLNLVDPSEHKIAYDSTMCV----SN------SAGVEAKKLMTRAFKSAL--TLQQQ 98 (185)
Q Consensus 38 ~f~P~flRp~Ppl~-------l~Wlnp~~~~~h~~~wD~sm~~----~~------~~~~e~r~Lm~kA~~~~L--~~~qq 98 (185)
.|--.|-||+|+++ ++-++.-.. +-++.||+=.|. .- =+-.+-|+.|=.|+++.+ .++|.
T Consensus 13 ~F~~~~p~pI~siYrrvv~ELLVElHLl~~-n~~F~yDplfAlGlvt~fd~fm~GY~Pee~~~~IF~Alc~a~~~dp~~~ 91 (214)
T TIGR03060 13 AFHAAFPRVIPPLYRRVVDELLVELHLLSH-QSDFKYDPLFALGLVTVFDRFMEGYRPEEHLDALFDALCNSNGFDPEQL 91 (214)
T ss_pred HHHHhCCCCCchHHHHHHHHHHHHHHHHHh-ccCceeCchHHhhHHHHHHHHHcCCCChHHHHHHHHHHHHhcCCCHHHH
Confidence 45567899999999 555554332 235555553221 11 122556777777777765 44444
Q ss_pred HHHHHHH
Q psy6071 99 QHLLAEL 105 (185)
Q Consensus 99 q~ll~~L 105 (185)
.+--+.+
T Consensus 92 r~dA~~l 98 (214)
T TIGR03060 92 REDAKQL 98 (214)
T ss_pred HHHHHHH
Confidence 4433333
Done!