Query 019902
Match_columns 334
No_of_seqs 217 out of 456
Neff 4.8
Searched_HMMs 46136
Date Fri Mar 29 05:26:51 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/019902.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/019902hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00719 Plus3 Short conserv 100.0 6.7E-33 1.5E-37 230.9 11.1 104 164-269 1-109 (109)
2 PF03126 Plus-3: Plus-3 domain 100.0 4.9E-31 1.1E-35 218.5 6.0 102 168-270 1-107 (108)
3 KOG2402 Paf1/RNA polymerase II 100.0 4.6E-30 9.9E-35 259.9 12.4 160 163-331 214-378 (525)
4 KOG1946 RNA polymerase I trans 100.0 3E-30 6.4E-35 241.4 5.8 135 39-175 96-240 (240)
5 COG5296 Transcription factor i 99.9 7.2E-24 1.6E-28 208.2 13.0 140 160-304 205-349 (521)
6 PF02201 SWIB: SWIB/MDM2 domai 99.9 5.7E-23 1.2E-27 161.2 2.5 70 42-117 7-76 (76)
7 smart00151 SWIB SWI complex, B 99.9 7.4E-22 1.6E-26 155.2 7.2 71 42-118 7-77 (77)
8 COG5531 SWIB-domain-containing 99.7 1.6E-18 3.4E-23 162.1 5.8 71 42-118 127-197 (237)
9 PRK14724 DNA topoisomerase III 99.7 5.7E-18 1.2E-22 184.7 7.1 70 42-117 918-987 (987)
10 PRK06319 DNA topoisomerase I/S 99.3 2E-12 4.3E-17 140.0 4.5 71 42-118 790-860 (860)
11 KOG2570 SWI/SNF transcription 99.1 1.2E-10 2.7E-15 116.0 5.2 145 42-196 211-371 (420)
12 KOG2522 Filamentous baseplate 80.1 2.4 5.2E-05 44.2 4.6 53 46-98 367-419 (560)
13 PF10281 Ish1: Putative stress 73.0 6.7 0.00015 26.8 3.8 37 38-74 1-38 (38)
14 PRK05350 acyl carrier protein; 62.8 8.1 0.00017 30.2 2.9 58 61-121 2-59 (82)
15 PHA02696 hypothetical protein; 62.0 5.2 0.00011 31.7 1.7 23 247-269 40-62 (79)
16 PRK05828 acyl carrier protein; 53.5 24 0.00051 28.4 4.2 58 61-121 1-58 (84)
17 PRK12449 acyl carrier protein; 47.6 32 0.0007 26.3 4.1 58 61-121 1-58 (80)
18 CHL00124 acpP acyl carrier pro 42.1 27 0.00058 26.9 2.9 58 61-121 1-58 (82)
19 KOG4814 Uncharacterized conser 37.3 23 0.0005 38.9 2.3 55 272-330 524-585 (872)
20 TIGR03759 conj_TIGR03759 integ 36.7 47 0.001 31.3 4.0 68 256-323 14-89 (200)
21 PTZ00171 acyl carrier protein; 31.8 55 0.0012 29.1 3.5 61 58-121 63-123 (148)
22 cd00454 Trunc_globin Truncated 31.3 82 0.0018 25.5 4.2 41 254-294 71-115 (116)
23 PF06627 DUF1153: Protein of u 27.5 35 0.00076 28.3 1.4 27 250-276 55-81 (90)
24 PTZ00444 hypothetical protein; 26.2 1.3E+02 0.0028 28.0 4.9 30 261-290 39-68 (184)
25 cd00238 ERp29c ERp29 and ERp38 23.3 77 0.0017 26.0 2.6 39 252-290 54-92 (93)
26 PF08349 DUF1722: Protein of u 23.2 2.5E+02 0.0053 23.5 5.8 71 257-327 27-109 (117)
27 PF02037 SAP: SAP domain; Int 23.1 1.3E+02 0.0028 20.1 3.3 32 41-74 4-35 (35)
28 PRK14524 rpsP 30S ribosomal pr 22.6 1.8E+02 0.0038 24.3 4.6 79 199-289 2-80 (94)
29 PF01152 Bac_globin: Bacterial 22.3 1.3E+02 0.0028 24.6 3.9 42 253-294 74-119 (120)
30 smart00513 SAP Putative DNA-bi 20.9 1.8E+02 0.0039 19.2 3.7 33 40-74 3-35 (35)
31 PRK02542 photosystem I assembl 20.7 1.7E+02 0.0037 27.4 4.6 59 213-293 123-182 (188)
32 TIGR00002 S16 ribosomal protei 20.6 2.4E+02 0.0052 22.6 4.9 73 199-286 1-76 (78)
33 CHL00036 ycf4 photosystem I as 20.4 1.6E+02 0.0034 27.5 4.3 59 213-293 119-178 (184)
No 1
>smart00719 Plus3 Short conserved domain in transcriptional regulators. Plus3 domains occur in the Saccharomyces cerevisiae Rtf1p protein, which interacts with Spt6p, and in parsley CIP, which interacts with the bZIP protein CPRF1.
Probab=100.00 E-value=6.7e-33 Score=230.87 Aligned_cols=104 Identities=48% Similarity=0.669 Sum_probs=96.0
Q ss_pred ccCccccceeeeeHHHHHHHhcCcccccccccccEEEEecCCCCCCCccceeEEEEeeeeecCCC---CCcccceeEEEe
Q 019902 164 AIVPDNIKLVYLKRSLVQDLLKDSETFECKVVGSFVRVKSDPNDYLQKNSHQLLQVIGTKKVPGT---DDKSTEILLKVS 240 (334)
Q Consensus 164 ai~~~nI~lIyL~Rsl~~~l~~~P~~Fe~kVvGcFVRI~ig~~~~~~~~~YRLvqV~Gv~k~~~~---g~~~Td~~L~v~ 240 (334)
|++.+||++|||+|+.|++||+.| +|+++|+||||||+||++++++ ++||||||+||.++.+. |++.||++|.+.
T Consensus 1 ~~~l~~l~~i~L~R~~l~~~~~~p-~Fe~~v~GcFVRv~ig~~~~~~-~~Yrl~qI~gv~~~~k~Y~~~~~~t~~~L~v~ 78 (109)
T smart00719 1 AIVFDNLNLLRLRRSLVEELLKPP-TFESKVVGCFVRVKIGPNDQKQ-PIYRLVQVTGVKEADKPYELGGKTTNVLLEVL 78 (109)
T ss_pred CCChHHhheEEEEHHHHHHHhcCC-chhHhceEeEEEEEECCCCCCC-CeEEEEEEeeEEecCcceecCCceeeEEEEEe
Confidence 468999999999999999999999 8999999999999999999987 89999999999997553 788999999994
Q ss_pred C--CceeEEeeeccCCCCCHHHHHHHHHHHH
Q 019902 241 N--FVKDIRIATLSDDNFSEEECEDLRQRVK 269 (334)
Q Consensus 241 ~--~~k~~~i~~iSd~~FTEeE~~r~~q~~~ 269 (334)
+ ..+.|+|++|||+||||+||++|++.|+
T Consensus 79 ~g~~~~~~~i~~iSn~~fte~E~~~w~~~~~ 109 (109)
T smart00719 79 NGDSEKVVQINFISNQDFTEEEFQRWKQAIK 109 (109)
T ss_pred cCCceEEEEEEEecCCCCCHHHHHHHHHHhC
Confidence 3 5579999999999999999999999884
No 2
>PF03126 Plus-3: Plus-3 domain; InterPro: IPR004343 The yeast Paf1 complex consists of Pfa1, Rtf1, Cdc73, Ctr9, and Leo1. The complex regulates histone H2B ubiquitination, histone H3 methylation, RNA polymerase II carboxy-terminal domain (CTD) Ser2 phosphorylation, and RNA 3' end processing. The conservation of Paf1 complex function in higher eukaryotes has been confirmed in human cells, Drosophila and Arabidopsis. The Plus3 domain spans the most conserved regions of the Rtf1 protein and is surrounded by regions of low complexity and coiled-coil propensity []. It contains only a limited number of highly conserved amino acids, among which are three positively charged residues that gave the Plus3 domain its name. The capacity to bind single-stranded DNA is at least one function of the Plus3 domain []. The plus-3 domain is about 90 residues in length and is often found associated with the GYF domain (IPR003169 from INTERPRO). The Plus3 domain structure consists of six alpha helices intervened by a sequence of six beta strands in a mixed alpha/beta topology. Beta strands 1, 2, 5, and 6 compose a four-stranded antiparallel beta sheet with a beta-hairpin insertion formed by strands 3 and 4. The N-terminal helices alpha1-alpha3 and C-terminal helix alpha6 pack together to form an alpha subdomain, while the beta strands and the small 3(10) helix alpha 4 form a beta subdomain. The two subdomains pack together to form a compact, globular protein [].; GO: 0003677 DNA binding, 0006352 transcription initiation, DNA-dependent, 0016570 histone modification, 0005634 nucleus; PDB: 2BZE_A 3U1U_B 2DB9_A.
Probab=99.97 E-value=4.9e-31 Score=218.50 Aligned_cols=102 Identities=35% Similarity=0.494 Sum_probs=85.9
Q ss_pred cccceeeeeHHHHHHHhcCcccccccccccEEEEecCCCCCCCccceeEEEEeeeeecCC---CCCcccceeEEEeC--C
Q 019902 168 DNIKLVYLKRSLVQDLLKDSETFECKVVGSFVRVKSDPNDYLQKNSHQLLQVIGTKKVPG---TDDKSTEILLKVSN--F 242 (334)
Q Consensus 168 ~nI~lIyL~Rsl~~~l~~~P~~Fe~kVvGcFVRI~ig~~~~~~~~~YRLvqV~Gv~k~~~---~g~~~Td~~L~v~~--~ 242 (334)
+||++|||+||+|++||++| +|+++|+||||||+||++++++.++||||||+||.+++. .+++.||++|.+.+ .
T Consensus 1 e~i~~i~l~Rs~l~~~~~~P-~Fe~~v~G~fVRv~ig~~~~~~~~~Yrl~qI~~v~~~~k~Y~~~~~~t~~~L~l~~g~~ 79 (108)
T PF03126_consen 1 EDIKRIRLSRSQLEKWLFKP-FFEDAVVGCFVRVNIGPNDYTQNPVYRLCQIVGVKEGKKPYKLGSKKTNKYLVLRHGNS 79 (108)
T ss_dssp HHHHHTBE-HHHHHHHTTST-THHHHHTT-EEEEEEETTSSS-E--EEEEEEEEEEEEEEEEEETTEEEEEEEEEEETTE
T ss_pred CchHeEEEEHHHHHHHHcCC-ChhHheeEEEEEEEecCCCCCCceeEEEEEEEEEecccccEecCCeEEEEEEEEEECCc
Confidence 58999999999999999999 899999999999999999998877899999999999984 38999999999954 4
Q ss_pred ceeEEeeeccCCCCCHHHHHHHHHHHHh
Q 019902 243 VKDIRIATLSDDNFSEEECEDLRQRVKD 270 (334)
Q Consensus 243 ~k~~~i~~iSd~~FTEeE~~r~~q~~~~ 270 (334)
.+.|+|++|||++|||+||++|++.|++
T Consensus 80 ~r~~~i~~vSn~~~te~E~~~w~~~~~~ 107 (108)
T PF03126_consen 80 ERDFPIDMVSNSPFTEEEFERWKQSCEK 107 (108)
T ss_dssp EEEEEGGGBBSS---HHHHHHHHHHH--
T ss_pred eeEEEeEeeECCCCCHHHHHHHHHHhcc
Confidence 5799999999999999999999999986
No 3
>KOG2402 consensus Paf1/RNA polymerase II complex, RTF1 component (involved in regulation of TATA box-binding protein) [Transcription]
Probab=99.96 E-value=4.6e-30 Score=259.89 Aligned_cols=160 Identities=20% Similarity=0.206 Sum_probs=146.5
Q ss_pred cccCccccceeeeeHHHHHHHhcCcccccccccccEEEEecCCCCCCCccceeEEEEeeeeecCCC---CCcccceeEEE
Q 019902 163 AAIVPDNIKLVYLKRSLVQDLLKDSETFECKVVGSFVRVKSDPNDYLQKNSHQLLQVIGTKKVPGT---DDKSTEILLKV 239 (334)
Q Consensus 163 aai~~~nI~lIyL~Rsl~~~l~~~P~~Fe~kVvGcFVRI~ig~~~~~~~~~YRLvqV~Gv~k~~~~---g~~~Td~~L~v 239 (334)
...+.++++.++|+|+.|+.||+.| +|+++|+||||||+||..+. +++||||+|+||.++..+ |.+.||.+|.|
T Consensus 214 ~v~~~~dln~~rlsR~~vak~~~~P-~Fe~~V~gCfvRv~ig~~~~--~p~YRv~~I~gV~es~k~Y~l~~~~Tnk~l~~ 290 (525)
T KOG2402|consen 214 RVPSISDLNALRLSRHKVAKWLFSP-FFESTVTGCFVRVGIGGEGS--KPVYRVAEIVGVLESDKPYKLEGVKTNKYLRV 290 (525)
T ss_pred CCCCHHHHHHHHhhHHHHHHHHhhh-HHHHHhhhheEEeeecCcCC--CCceeEEEEeeecccCccccccceeecceeee
Confidence 3448889999999999999999999 99999999999999999954 689999999999997764 89999999999
Q ss_pred -eC-CceeEEeeeccCCCCCHHHHHHHHHHHHhCCCCCCcHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHHHHhcccc
Q 019902 240 -SN-FVKDIRIATLSDDNFSEEECEDLRQRVKDGLLKRPTVVELQEKTQILHEDITKHWLVRELALLQKLIDRANVKGMA 317 (334)
Q Consensus 240 -~~-~~k~~~i~~iSd~~FTEeE~~r~~q~~~~~~~krPTv~dl~eKa~~i~~~i~~~w~e~El~~L~~lid~A~EkG~~ 317 (334)
+| ..++|.|.+|||++|||+||++|+..|.+..+.+||+..|.+|..+|..++++.+.++++.. ++++|-..
T Consensus 291 ~~G~s~r~f~m~~iSn~~f~e~Efq~w~~~~~~s~~~~PT~~~i~~K~~~i~~a~~~~~sd~~v~~------~v~~k~~~ 364 (525)
T KOG2402|consen 291 RHGRSERVFRMNFISNGEFTEEEFQDWLRACKNSHGIMPTVDLISRKKLDIVKALNYRLSDKEVDQ------MVAEKFEA 364 (525)
T ss_pred ecCcchhhcchhhhcCCcccHHHHHHHHHHHHhhcCCCchHHHHHHHHHHHHHHhcCccCcccHHH------HHHhhhhc
Confidence 55 33699999999999999999999999999999999999999999999999999999999988 88888888
Q ss_pred cccCcchhhhcccc
Q 019902 318 QRISSITSVQCGRM 331 (334)
Q Consensus 318 ~~~~~~~~~~~~~~ 331 (334)
+.-|.||||.=..+
T Consensus 365 ~~~p~N~ameK~~l 378 (525)
T KOG2402|consen 365 SPRPRNVAMEKTGL 378 (525)
T ss_pred CcCcchHHHHHHhH
Confidence 99999999964443
No 4
>KOG1946 consensus RNA polymerase I transcription factor UAF [Transcription]
Probab=99.96 E-value=3e-30 Score=241.43 Aligned_cols=135 Identities=35% Similarity=0.531 Sum_probs=102.1
Q ss_pred cCCCCHHHHHHHhhcCCCC-CCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccc
Q 019902 39 DGWGSIPLIEFLQSLGKDT-TQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYA 117 (334)
Q Consensus 39 ~~w~S~eL~~fl~~~g~d~-~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~ 117 (334)
.+|||..|..|+.+|++++ +++|||++|+++||+|||+||||||.||+.|+||++|++|||+++|+||+|+|||.+||+
T Consensus 96 ~~~g~~kl~~ls~~L~~~~G~~~lsR~~vvk~iw~YIke~nLqDP~nkr~IlCDekL~~iF~~k~v~~fem~KLL~~H~~ 175 (240)
T KOG1946|consen 96 ASWGSTKLIPLSPSLARFVGTSELSRTDVVKKIWAYIKEHNLQDPKNKREILCDEKLKSIFGKKRVGMFEMLKLLTKHFL 175 (240)
T ss_pred cCcCcccccccCHHHHhhcccccccHHHHHHHHHHHHHHhccCCccccCeeeeHHHHHHHhccCccceeeHHHHHHHhcc
Confidence 4499999999999998888 999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCCCC----C----Cc-cccchhhccccChhhhhccccccccccccCCcccccccCccccceeee
Q 019902 118 ENQFESD----D----DF-LFSSDEENNLFSSEKQKSRKVPLKKLCLETPKSCWAAIVPDNIKLVYL 175 (334)
Q Consensus 118 ~~~~~~~----~----~~-~~~se~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~aai~~~nI~lIyL 175 (334)
++++... . +. .+.++++.+...+...+..+...++ .-.++..||+++.+|++++|+
T Consensus 176 ~~~d~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~e~~~k~~~~~--~~~~~~~~a~~~~~~~~l~~~ 240 (240)
T KOG1946|consen 176 KNQDMVGTDLKQGEETETKEFKEEEDKSQDPCLSEKEAKETKKV--VKKPRGTFAAIVSDNIKLLYL 240 (240)
T ss_pred CccccccchhhcCccccccccccccccccccchhhhhhhhhhhc--ccccccchhhhhccccccccC
Confidence 9886431 1 11 1112222222221112222222111 122334999999999999986
No 5
>COG5296 Transcription factor involved in TATA site selection and in elongation by RNA polymerase II [Transcription]
Probab=99.90 E-value=7.2e-24 Score=208.20 Aligned_cols=140 Identities=21% Similarity=0.285 Sum_probs=125.9
Q ss_pred ccccccCccccceeeeeHHHHHHHhcCcccccccccccEEEEecCCCCCCCccceeEEEEeeeeecCCC---CCccccee
Q 019902 160 SCWAAIVPDNIKLVYLKRSLVQDLLKDSETFECKVVGSFVRVKSDPNDYLQKNSHQLLQVIGTKKVPGT---DDKSTEIL 236 (334)
Q Consensus 160 ~~~aai~~~nI~lIyL~Rsl~~~l~~~P~~Fe~kVvGcFVRI~ig~~~~~~~~~YRLvqV~Gv~k~~~~---g~~~Td~~ 236 (334)
+.||. .-|+|.++|+|+.|+++|++| +|++.|+|||+||+||-.+ |+-+||||+|.++...... |.+.|+.|
T Consensus 205 d~~a~--l~Dfn~~rvgR~~VA~~~f~P-~Fed~v~Gcf~Rv~Ig~R~--g~~vyRiv~V~~~~~~~kpy~~~~v~Tn~y 279 (521)
T COG5296 205 DDFAE--LYDFNQCRVGRDMVARNVFKP-IFEDEVIGCFTRVRIGERG--GYLVYRIVGVGKGSTYSKPYGRKEVKTNRY 279 (521)
T ss_pred chhhh--hhhhhhhhhhHHHHHHHhccc-chhhhhcceeeEEEecccC--CeeEEEEEEeccceeccccccccceeeeee
Confidence 56665 457999999999999999999 7999999999999999443 4579999999999996653 79999999
Q ss_pred EEEe-CCc-eeEEeeeccCCCCCHHHHHHHHHHHHhCCCCCCcHHHHHHHHHHHHHHHhhhhhHHHHHHH
Q 019902 237 LKVS-NFV-KDIRIATLSDDNFSEEECEDLRQRVKDGLLKRPTVVELQEKTQILHEDITKHWLVRELALL 304 (334)
Q Consensus 237 L~v~-~~~-k~~~i~~iSd~~FTEeE~~r~~q~~~~~~~krPTv~dl~eKa~~i~~~i~~~w~e~El~~L 304 (334)
|.|+ |-. |+|+|..|||+||.++||+||...++.|.+.+|++.-|.+|...|-..+.+...++|+..+
T Consensus 280 l~v~~Gr~~kvF~in~~Sn~pf~~~eyQr~~r~~~~~kl~~PS~~~v~~k~~~l~d~~~~~LSdkeis~~ 349 (521)
T COG5296 280 LDVSTGRTYKVFRINNISNSPFLREEYQRVWRSFKVGKLSMPSIAKVKEKYDKLVDTMGRRLSDKEISKM 349 (521)
T ss_pred EeeecCcceeeeEeecccCCcccHHHHHHHHHHHhccccccchHHHHHHHHHHHHHHhCCcCchhHHHHH
Confidence 9995 533 5899999999999999999999999999999999999999999999999999999998544
No 6
>PF02201 SWIB: SWIB/MDM2 domain; InterPro: IPR003121 The SWI/SNF family of complexes, which are conserved from yeast to humans, are ATP-dependent chromatin-remodelling proteins that facilitate transcription activation []. The mammalian complexes are made up of 9-12 proteins called BAFs (BRG1-associated factors). The BAF60 family have at least three members: BAF60a, which is ubiquitous, BAF60b and BAF60c, which are expressed in muscle and pancreatic tissues, respectively. BAF60b is present in alternative forms of the SWI/SNF complex, including complex B (SWIB), which lacks BAF60a. The SWIB domain is a conserved region found within the BAF60b proteins [], and can be found fused to the C terminus of DNA topoisomerase in Chlamydia. MDM2 is an oncoprotein that acts as a cellular inhibitor of the p53 tumour suppressor by binding to the transactivation domain of p53 and suppressing its ability to activate transcription []. p53 acts in response to DNA damage, inducing cell cycle arrest and apoptosis. Inactivation of p53 is a common occurrence in neoplastic transformations. The core of MDM2 folds into an open bundle of four helices, which is capped by two small 3-stranded beta-sheets. It consists of a duplication of two structural repeats. MDM2 has a deep hydrophobic cleft on which the p53 alpha-helix binds; p53 residues involved in transactivation are buried deep within the cleft of MDM2, thereby concealing the p53 transactivation domain. The SWIB and MDM2 domains are homologous and share a common fold.; GO: 0005515 protein binding; PDB: 1V31_A 3FE7_A 3JZQ_B 3EQY_B 2VYR_A 3JZO_A 3DAB_E 3LBJ_E 3FEA_A 3FDO_A ....
Probab=99.86 E-value=5.7e-23 Score=161.18 Aligned_cols=70 Identities=36% Similarity=0.511 Sum_probs=65.2
Q ss_pred CCHHHHHHHhhcCCCCCCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccc
Q 019902 42 GSIPLIEFLQSLGKDTTQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYA 117 (334)
Q Consensus 42 ~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~ 117 (334)
.|++|++|+| ..++||++|++.||+|||+||||||+||+.|+||++|+.|||+++|++|+|+++|.+|+.
T Consensus 7 ls~~L~~~lg------~~~~sr~~v~~~lw~YIk~~~L~dp~~k~~I~cD~~L~~lf~~~~v~~~~i~~~l~~hl~ 76 (76)
T PF02201_consen 7 LSPELAEFLG------EDELSRSEVVKRLWQYIKENNLQDPKDKRIIICDEKLKKLFGKDSVNFFEIPKLLKPHLI 76 (76)
T ss_dssp HHHHHHHHTT-------SCEEHHHHHHHHHHHHHHTTSBESSSTTEEE-TTSHHHHHHTSECSEEETTHHHHHHHE
T ss_pred CCHHHHHHhC------CCCCCHHHHHHHHHHHHHHhcCCCcccCceEecCHHHHHHhCCCeecHhhHHHHHHHhcC
Confidence 5899999998 466999999999999999999999999999999999999999999999999999999984
No 7
>smart00151 SWIB SWI complex, BAF60b domains.
Probab=99.86 E-value=7.4e-22 Score=155.23 Aligned_cols=71 Identities=23% Similarity=0.420 Sum_probs=68.2
Q ss_pred CCHHHHHHHhhcCCCCCCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhcccc
Q 019902 42 GSIPLIEFLQSLGKDTTQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAE 118 (334)
Q Consensus 42 ~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~ 118 (334)
.|++|++|+| .+++||++|++.||+|||+||||||.||+.|+||+.|+.+||+++|.+++|+++|.+|+.+
T Consensus 7 ls~~L~~~lg------~~~~tr~ev~~~lw~YIk~n~L~d~~~k~~i~~D~~L~~l~~~~~v~~~~~~~ll~~Hl~~ 77 (77)
T smart00151 7 LSPELAKVLG------APEMTRTEIIKRLWEYIKEHNLQDPQNKREILCDSKLEQIFGKDRMDMFEMNKLLTPHLIK 77 (77)
T ss_pred CCHHHHHHhC------CCcCcHHHHHHHHHHHHHHhcccCCccCCEEecCHHHHHHHCcCeecHHHHHHHHHHHcCC
Confidence 6999999998 4899999999999999999999999999999999999999999999999999999999864
No 8
>COG5531 SWIB-domain-containing proteins implicated in chromatin remodeling [Chromatin structure and dynamics]
Probab=99.74 E-value=1.6e-18 Score=162.10 Aligned_cols=71 Identities=18% Similarity=0.275 Sum_probs=69.0
Q ss_pred CCHHHHHHHhhcCCCCCCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhcccc
Q 019902 42 GSIPLIEFLQSLGKDTTQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAE 118 (334)
Q Consensus 42 ~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~ 118 (334)
.|+.|+.||| ..++||+|||+.||+|||+||||||.||+.|+||++|+.|||.+.+.||+|+++|.+|+.+
T Consensus 127 lS~~La~ilG------~~~~tr~~~v~~lw~YIk~h~lq~~~nkr~I~~D~~L~~v~g~~p~~mf~~~k~l~~hl~~ 197 (237)
T COG5531 127 LSPKLAAILG------LEPGTRPEAVKKLWKYIKKHNLQDPNNKRLILCDSKLKKVLGSDPIDMFELTKPLSPHLIK 197 (237)
T ss_pred cCHHHHHHhC------CCCCCccHHHHHHHHHHHHhcCCCccccceecccHHHHHHhCCCchhhhhhhcccccceec
Confidence 6999999998 5899999999999999999999999999999999999999998899999999999999987
No 9
>PRK14724 DNA topoisomerase III; Provisional
Probab=99.72 E-value=5.7e-18 Score=184.70 Aligned_cols=70 Identities=20% Similarity=0.399 Sum_probs=67.7
Q ss_pred CCHHHHHHHhhcCCCCCCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccc
Q 019902 42 GSIPLIEFLQSLGKDTTQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYA 117 (334)
Q Consensus 42 ~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~ 117 (334)
.|++|++||| ..++||++|++.||+|||+||||||.||+.|+||++|+.|||+++|.||+|+++|.+|+.
T Consensus 918 ls~~La~~lg------~~~~~r~~v~~~lW~YIK~~~Lqdp~~k~~i~cD~~L~~vfg~~~~~~~~~~~~l~~hl~ 987 (987)
T PRK14724 918 PSAALAAVIG------AEPVARPEVIKKLWDYIKANNLQDPADKRAINADAKLRPVFGKDQVTMFELAGIVGKHLS 987 (987)
T ss_pred CCHHHHHHhC------CCcCCHHHHHHHHHHHHHHccCCCcccCCeeccchHHHHHhCCCcccHHHHHHHHHHhcC
Confidence 6999999998 578999999999999999999999999999999999999999999999999999999984
No 10
>PRK06319 DNA topoisomerase I/SWI domain fusion protein; Validated
Probab=99.28 E-value=2e-12 Score=140.04 Aligned_cols=71 Identities=20% Similarity=0.332 Sum_probs=67.4
Q ss_pred CCHHHHHHHhhcCCCCCCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhcccc
Q 019902 42 GSIPLIEFLQSLGKDTTQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAE 118 (334)
Q Consensus 42 ~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~ 118 (334)
.|++|+.|++ +.+++|.++++.||+|||+|+||||.+|+.|+||++|+.+||++++.||.|+++|+.|+.+
T Consensus 790 ~S~~La~~~g------~~~~sr~~~~~~lw~yIk~~~lqdp~~Kr~i~~d~kl~kvf~~~~~~~~~~~k~l~~hl~~ 860 (860)
T PRK06319 790 PSPALAAMIG------AEPVGRGEATKKVWDYIKEHGLQSPENKKLIIPDSKLQGVIGPDPIDMFQLSKKLSQHLIK 860 (860)
T ss_pred cccccccccC------cCccCchHHHHHHHHHHHHhcccCccccccCCCchhhhhhhCcCccchhhhHHHHHhhhcC
Confidence 5899999987 5899999999999999999999999999999999999999999999999999999999863
No 11
>KOG2570 consensus SWI/SNF transcription activation complex subunit [Chromatin structure and dynamics; Transcription]
Probab=99.06 E-value=1.2e-10 Score=116.01 Aligned_cols=145 Identities=17% Similarity=0.165 Sum_probs=103.2
Q ss_pred CCHHHHHHHhhcCCCCCCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccccCCC
Q 019902 42 GSIPLIEFLQSLGKDTTQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAENQF 121 (334)
Q Consensus 42 ~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~~~~ 121 (334)
.||.|+..|| -..=||++++..||.|||-|+||||.++..|.||.-|+.+||++++.+.+++.+|.+|+.+.++
T Consensus 211 lsp~La~lLG------i~t~Trp~iI~alWqYIk~n~Lqd~~e~~~incD~~l~qif~~~rl~F~elp~~l~~lL~P~dP 284 (420)
T KOG2570|consen 211 LSPRLANLLG------IHTGTRPDIVTALWQYIKTNKLQDPEDSDFINCDKALEQIFGVDRLKFPELPQLLNPLLSPPDP 284 (420)
T ss_pred cCHHHHHHhh------hccCcchHHHHHHHHHHHHhccCCcccchhhcchHHHHHhhcccccccccchhhhhhccCCCCC
Confidence 7999999998 3666899999999999999999999999999999999999999999999999999999998876
Q ss_pred CC-CCCccccc--------hhhccccC-hhhhhccccccccccccCCcccccc------cCccccceeeeeHHHHHHHhc
Q 019902 122 ES-DDDFLFSS--------DEENNLFS-SEKQKSRKVPLKKLCLETPKSCWAA------IVPDNIKLVYLKRSLVQDLLK 185 (334)
Q Consensus 122 ~~-~~~~~~~s--------e~d~~~~~-~~~~~~~~~~~~~~~~~~~~~~~aa------i~~~nI~lIyL~Rsl~~~l~~ 185 (334)
.- ++.|..|. +|.++-+. +.+..-+.- .........-+| -+.++|+...-+|.+|.++..
T Consensus 285 Ivi~h~I~v~~~~~~~tacyDIdV~v~~p~~~q~~nf----l~~~~~~~eI~alD~kI~~li~~ine~~~rr~Ff~~Fs~ 360 (420)
T KOG2570|consen 285 IVIDHTISVDGNDTKVTACYDIDVEVEDPRKSQMSNF----LASTESQKEIAALDRKITELIQQINESKERRDFFLEFSK 360 (420)
T ss_pred eeecceeccCCCcccceeEEeeeeccCCccchhhhhh----hcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 43 34443332 12111111 111100000 000001111111 134678888999999999999
Q ss_pred Ccccccccccc
Q 019902 186 DSETFECKVVG 196 (334)
Q Consensus 186 ~P~~Fe~kVvG 196 (334)
+|-.|-...+-
T Consensus 361 dPvefin~wi~ 371 (420)
T KOG2570|consen 361 DPVEFINEWIE 371 (420)
T ss_pred CHHHHHHHHHH
Confidence 99778766553
No 12
>KOG2522 consensus Filamentous baseplate protein Ligatin, contains PUA domain [Translation, ribosomal structure and biogenesis]
Probab=80.09 E-value=2.4 Score=44.18 Aligned_cols=53 Identities=17% Similarity=0.364 Sum_probs=44.8
Q ss_pred HHHHHhhcCCCCCCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhh
Q 019902 46 LIEFLQSLGKDTTQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLF 98 (334)
Q Consensus 46 L~~fl~~~g~d~~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lF 98 (334)
..+.+.+.|-+-..-++-.||...+-.||++|||-|+.||..|+-|+=|-...
T Consensus 367 ~~~Lf~evg~~kg~lyt~seir~~V~kYi~knnLad~~nKg~VrLDpILfd~~ 419 (560)
T KOG2522|consen 367 AKDLFKEVGLAKGTLYTSSEIRSAVSKYISKNNLADTKNKGKVRLDPILFDMV 419 (560)
T ss_pred HHHHHHhcCccccceeeHHHHHHHHHHHhhhhhccccccCCcEEeccHHHHHH
Confidence 34566666777778899999999999999999999999999998888777643
No 13
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=72.96 E-value=6.7 Score=26.77 Aligned_cols=37 Identities=24% Similarity=0.452 Sum_probs=31.7
Q ss_pred ccCCCCHHHHHHHhhcCCCCCCcc-chHHHHHHHHHHh
Q 019902 38 FDGWGSIPLIEFLQSLGKDTTQQI-SRYDVTDIINKYV 74 (334)
Q Consensus 38 ~~~w~S~eL~~fl~~~g~d~~~~l-sr~~v~k~lw~YI 74 (334)
|..|-..+|.+||.+-|-...+.. +|.++++.+-+|+
T Consensus 1 fdtWs~~~L~~wL~~~gi~~~~~~~~rd~Ll~~~k~~y 38 (38)
T PF10281_consen 1 FDTWSDSDLKSWLKSHGIPVPKSAKTRDELLKLAKKNY 38 (38)
T ss_pred CCCCCHHHHHHHHHHcCCCCCCCCCCHHHHHHHHHHhC
Confidence 567999999999999888777777 9999999987764
No 14
>PRK05350 acyl carrier protein; Provisional
Probab=62.75 E-value=8.1 Score=30.19 Aligned_cols=58 Identities=9% Similarity=0.092 Sum_probs=49.0
Q ss_pred cchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccccCCC
Q 019902 61 ISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAENQF 121 (334)
Q Consensus 61 lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~~~~ 121 (334)
|+|.++...|.++|.+. +.-|. -.|-.|..|..-+|-+++.+.++.-.|+.||--..+
T Consensus 2 m~~~~i~~~v~~ii~~~-~~~~~--~~i~~d~~l~~dlg~DSld~veli~~lE~~fgI~i~ 59 (82)
T PRK05350 2 MTREEILERLRAILVEL-FEIDP--EDITPEANLYEDLDLDSIDAVDLVVHLQKLTGKKIK 59 (82)
T ss_pred CCHHHHHHHHHHHHHHH-hCCCH--HHCCCCccchhhcCCCHHHHHHHHHHHHHHHCCccC
Confidence 78999999999999987 54233 369999999888899999999999999999975544
No 15
>PHA02696 hypothetical protein; Provisional
Probab=61.98 E-value=5.2 Score=31.73 Aligned_cols=23 Identities=48% Similarity=0.569 Sum_probs=19.4
Q ss_pred EeeeccCCCCCHHHHHHHHHHHH
Q 019902 247 RIATLSDDNFSEEECEDLRQRVK 269 (334)
Q Consensus 247 ~i~~iSd~~FTEeE~~r~~q~~~ 269 (334)
-|..|.|.-|||+||++=.|.+.
T Consensus 40 ViQtCdDDYFTEeEFdDgkQvVa 62 (79)
T PHA02696 40 VIQTCDDDYFTEEEFDDGKQVVA 62 (79)
T ss_pred eeeecccccccHhhcccHHHHHH
Confidence 46788899999999999888753
No 16
>PRK05828 acyl carrier protein; Validated
Probab=53.54 E-value=24 Score=28.35 Aligned_cols=58 Identities=9% Similarity=0.083 Sum_probs=47.0
Q ss_pred cchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccccCCC
Q 019902 61 ISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAENQF 121 (334)
Q Consensus 61 lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~~~~ 121 (334)
|+|.++..+|-+-|.+.++.=+.+ .|.+|..|.. +|-+++.+.++--.|+.+|--..+
T Consensus 1 m~~~eI~~~i~~ii~e~~~~~~~d--~i~~~~~~~d-Lg~DSLd~velv~~lE~~f~I~i~ 58 (84)
T PRK05828 1 MQEMEILLKIKEIAKKKNFAVTLD--ESNINKPYRE-LKIDSLDMFSIIVSLESEFNIEFS 58 (84)
T ss_pred CCHHHHHHHHHHHHHHhccCCCcc--cccCCCCHHh-cCCCHHHHHHHHHHHHHHHCCCcC
Confidence 689999999999998865533322 3677888976 999999999999999999975544
No 17
>PRK12449 acyl carrier protein; Provisional
Probab=47.65 E-value=32 Score=26.34 Aligned_cols=58 Identities=9% Similarity=0.111 Sum_probs=47.2
Q ss_pred cchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccccCCC
Q 019902 61 ISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAENQF 121 (334)
Q Consensus 61 lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~~~~ 121 (334)
|+|.++...|-+++.+.-=.+|. .|..|..|..-+|-+++.+.++.-.|+..|--..+
T Consensus 1 m~~~~i~~~l~~il~~~~~~~~~---~i~~~~~l~~dlg~DSl~~~~li~~lE~~f~i~i~ 58 (80)
T PRK12449 1 MTREEIFERLINLIQKQRSYLSL---AITEQTHLKDDLAVDSIELVEFIINVEDEFHIAIP 58 (80)
T ss_pred CCHHHHHHHHHHHHHHHhCCCcc---ccCCCCcHHHHcCCcHHHHHHHHHHHHHHhCCCCC
Confidence 56788888888888875433443 49999999999999999999999999998865543
No 18
>CHL00124 acpP acyl carrier protein; Validated
Probab=42.09 E-value=27 Score=26.91 Aligned_cols=58 Identities=14% Similarity=0.200 Sum_probs=48.3
Q ss_pred cchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccccCCC
Q 019902 61 ISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAENQF 121 (334)
Q Consensus 61 lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~~~~ 121 (334)
|+|.+|...|-++|.+.-=.+|. .|..|..|..-+|-+++.+.++.-.|+.+|--..+
T Consensus 1 M~~~~i~~~l~~ii~~~~~~~~~---~i~~d~~l~~dlg~DSl~~~eli~~le~~f~i~i~ 58 (82)
T CHL00124 1 MTKNDIFEKVQSIVAEQLGIEKS---EVTLDANFTRDLGADSLDVVELVMAIEEKFDIEIP 58 (82)
T ss_pred CCHHHHHHHHHHHHHHHHCCCHH---HCCCCcchhhhcCCcHHHHHHHHHHHHHHHCCccC
Confidence 57888999999999887433443 49999999999999999999999999999876554
No 19
>KOG4814 consensus Uncharacterized conserved protein [Function unknown]
Probab=37.25 E-value=23 Score=38.92 Aligned_cols=55 Identities=18% Similarity=0.042 Sum_probs=40.5
Q ss_pred CCCCCcHHHHHHHHHHHHHHHhh-------hhhHHHHHHHHhHHHHHHhcccccccCcchhhhccc
Q 019902 272 LLKRPTVVELQEKTQILHEDITK-------HWLVRELALLQKLIDRANVKGMAQRISSITSVQCGR 330 (334)
Q Consensus 272 ~~krPTv~dl~eKa~~i~~~i~~-------~w~e~El~~L~~lid~A~EkG~~~~~~~~~~~~~~~ 330 (334)
+-+.||+.+.+.||..+-+.++= .|=++ +-.+ =-+|||.-||+.+..|-|++||.
T Consensus 524 L~~~~~~~~~~~ka~e~~~~~~Cln~~~~~~FGe~-~~~~---E~~~nE~~WFa~T~WNLav~C~K 585 (872)
T KOG4814|consen 524 LNYLPTVQKLLQKAVEFLKTVKCLNQLPPDVFGEA-IYQQ---EVAVNEIEWFASTSWNLAVKCLK 585 (872)
T ss_pred HhhccchhHHHHHHHHHHHHHHHHhhcCCchhhhH-HHHH---HhhhhHHHHHHHHHHHHHHHhcc
Confidence 45789999999999998765432 13333 2222 23788999999999999999984
No 20
>TIGR03759 conj_TIGR03759 integrating conjugative element protein, PFL_4693 family. Members of this protein family, such as model protein PFL_4693 from Pseudomonas fluorescens Pf-5, belong to extended genomic regions that appear to be spread by conjugative transfer. Most members have a predicted N-terminal signal sequence. The function is unknown.
Probab=36.68 E-value=47 Score=31.27 Aligned_cols=68 Identities=15% Similarity=0.247 Sum_probs=42.6
Q ss_pred CCHHHHHHHHHHHHh------CCCCCCcHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHH--HHHHhcccccccCcc
Q 019902 256 FSEEECEDLRQRVKD------GLLKRPTVVELQEKTQILHEDITKHWLVRELALLQKLI--DRANVKGMAQRISSI 323 (334)
Q Consensus 256 FTEeE~~r~~q~~~~------~~~krPTv~dl~eKa~~i~~~i~~~w~e~El~~L~~li--d~A~EkG~~~~~~~~ 323 (334)
+|++|+.||++.|+. -++...|.--|+.+-.+=..-...-|-..|-.+.++.+ ++|--+-|.|-||--
T Consensus 14 Lt~~Ew~RY~~LmqG~rG~~SPgLdPLtaLGIeArsd~ERrryAEl~vk~E~~rvekeLA~qrayd~A~~RL~p~~ 89 (200)
T TIGR03759 14 LTEDEWQRYQQLMQGPRGVYSPGLDPLTALGIEARSDEERRRYAELWVKQEAQRVEKELAFQRAYDAAWQRLYPNV 89 (200)
T ss_pred CCHHHHHHHHHHhcCCccCcCCCCChhhhhccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC
Confidence 799999999999983 23556666666655444444455555566666665543 344444467776643
No 21
>PTZ00171 acyl carrier protein; Provisional
Probab=31.76 E-value=55 Score=29.10 Aligned_cols=61 Identities=15% Similarity=0.214 Sum_probs=51.4
Q ss_pred CCccchHHHHHHHHHHhhhcCCCCCCCCCeeecchhhhhhhCCCccchHhHHHHHHhccccCCC
Q 019902 58 TQQISRYDVTDIINKYVNENKLVHSVKKKRVVSDERLLSLFGKKSFLRIKIYDLLEAHYAENQF 121 (334)
Q Consensus 58 ~~~lsr~~v~k~lw~YIK~nnLqDP~~Kr~IiCDekLk~lFgk~~v~~f~m~klL~~H~~~~~~ 121 (334)
..-|++.+|...|.++|.+.-=-+|. .|..|..+..-+|-+++.+.++--.|+.+|--..+
T Consensus 63 ~~~~~~~~v~~~l~eiiae~l~vd~~---~I~~ds~~~~dLg~DSLd~veLv~~LEdeFgI~Ip 123 (148)
T PTZ00171 63 QYLLSKEDVLTRVKKVVKNFEKVDAS---KITPESNFVKDLGADSLDVVELLIAIEQEFNLTIP 123 (148)
T ss_pred ccccCHHHHHHHHHHHHHHHhCCCHh---hCCCCcchhhhcCCCHHHHHHHHHHHHHHHCCccC
Confidence 45688999999999999988533443 48899999999999999999999999999865554
No 22
>cd00454 Trunc_globin Truncated hemoglobins (trHbs) are a family of oxygen-binding heme proteins found in cyanobacteria, eubacteria, unicellular eukaryotes, and plants. The truncated hemoglobins have a characteristic two-over-two alpha helical folding pattern that is distinct from the three-over-three pattern found in other globins. A subset of these have been demonstrated to form homodimers.
Probab=31.29 E-value=82 Score=25.48 Aligned_cols=41 Identities=10% Similarity=0.110 Sum_probs=32.6
Q ss_pred CCCCHHHHHHHHH----HHHhCCCCCCcHHHHHHHHHHHHHHHhh
Q 019902 254 DNFSEEECEDLRQ----RVKDGLLKRPTVVELQEKTQILHEDITK 294 (334)
Q Consensus 254 ~~FTEeE~~r~~q----~~~~~~~krPTv~dl~eKa~~i~~~i~~ 294 (334)
-++|+++|++|.. ++..-+.+-+...++...+..+...+++
T Consensus 71 ~~I~~~~f~~~l~~l~~al~~~~~~~~~~~~~~~~~~~~~~~~v~ 115 (116)
T cd00454 71 FPITEEEFDAWLELLRDALDELGVPAELADALLARAERIADHMVN 115 (116)
T ss_pred CCCCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHhc
Confidence 4899999999986 5555666778888888888888777765
No 23
>PF06627 DUF1153: Protein of unknown function (DUF1153); InterPro: IPR009534 This family consists of several short, hypothetical bacterial proteins of unknown function.; PDB: 2OA4_A 2JRT_A.
Probab=27.48 E-value=35 Score=28.29 Aligned_cols=27 Identities=22% Similarity=0.115 Sum_probs=19.9
Q ss_pred eccCCCCCHHHHHHHHHHHHhCCCCCC
Q 019902 250 TLSDDNFSEEECEDLRQRVKDGLLKRP 276 (334)
Q Consensus 250 ~iSd~~FTEeE~~r~~q~~~~~~~krP 276 (334)
.+.-=.+|++||..|.+.+..++..-+
T Consensus 55 A~~rY~Ls~eEf~~W~~av~rhge~aL 81 (90)
T PF06627_consen 55 ACRRYGLSEEEFESWQRAVDRHGENAL 81 (90)
T ss_dssp HHHCTTSSHHHHHHHHHHCCT--TTSS
T ss_pred HHHHhCCCHHHHHHHHHHHHHHhHHHH
Confidence 344457899999999999999887654
No 24
>PTZ00444 hypothetical protein; Provisional
Probab=26.21 E-value=1.3e+02 Score=28.01 Aligned_cols=30 Identities=17% Similarity=0.216 Sum_probs=24.3
Q ss_pred HHHHHHHHHhCCCCCCcHHHHHHHHHHHHH
Q 019902 261 CEDLRQRVKDGLLKRPTVVELQEKTQILHE 290 (334)
Q Consensus 261 ~~r~~q~~~~~~~krPTv~dl~eKa~~i~~ 290 (334)
+++--+.++.....+||-.+|.+|...|.+
T Consensus 39 ikDia~e~k~~~e~lPt~~~v~~~~~~id~ 68 (184)
T PTZ00444 39 IKDIAKELKLASETLPTPEQVSAKIHRIDK 68 (184)
T ss_pred HHHHHHHHHHHHhcCCCHHHHHHHHHhcCH
Confidence 566677888889999999999999665554
No 25
>cd00238 ERp29c ERp29 and ERp38, C-terminal domain; composed of the protein disulfide isomerase (PDI)-like proteins ERp29 and ERp38. ERp29 (also called ERp28) is a ubiquitous endoplasmic reticulum (ER)-resident protein expressed in high levels in secretory cells. It contains a redox inactive TRX-like domain at the N-terminus. The expression profile of ERp29 suggests a role in secretory protein production, distinct from that of PDI. It has also been identified as a member of the thyroglobulin folding complex and is essential in regulating the secretion of thyroglobulin. The Drosophila homolog, Wind, is the product of windbeutel, an essential gene in the development of dorsal-ventral patterning. Wind is required for correct targeting of Pipe, a Golgi-resident type II transmembrane protein with homology to 2-O-sulfotransferase. ERp38 is a P5-like protein, first isolated from alfalfa (the cDNA clone was named G1), which contains two redox active TRX domains at the N-terminus, like human P5.
Probab=23.28 E-value=77 Score=25.95 Aligned_cols=39 Identities=21% Similarity=0.308 Sum_probs=33.9
Q ss_pred cCCCCCHHHHHHHHHHHHhCCCCCCcHHHHHHHHHHHHH
Q 019902 252 SDDNFSEEECEDLRQRVKDGLLKRPTVVELQEKTQILHE 290 (334)
Q Consensus 252 Sd~~FTEeE~~r~~q~~~~~~~krPTv~dl~eKa~~i~~ 290 (334)
-..+|-+.|..|+..-+..+.+..-+.+++..|...|..
T Consensus 54 kg~~yv~~E~~RL~~iL~~~~ls~~K~del~~R~NIL~~ 92 (93)
T cd00238 54 KGEDYVEKELARLERLLEKKGLAPEKADELTRRLNILRS 92 (93)
T ss_pred cchhHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhc
Confidence 478999999999999999888899999999988887754
No 26
>PF08349 DUF1722: Protein of unknown function (DUF1722); InterPro: IPR013560 This domain of unknown function is found in bacteria and archaea and is homologous to the hypothetical protein ybgA from Escherichia coli.
Probab=23.18 E-value=2.5e+02 Score=23.49 Aligned_cols=71 Identities=20% Similarity=0.140 Sum_probs=59.3
Q ss_pred CHHHHHHHHHHHHhCCCCCCcHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHHHHhc------------ccccccCcch
Q 019902 257 SEEECEDLRQRVKDGLLKRPTVVELQEKTQILHEDITKHWLVRELALLQKLIDRANVK------------GMAQRISSIT 324 (334)
Q Consensus 257 TEeE~~r~~q~~~~~~~krPTv~dl~eKa~~i~~~i~~~w~e~El~~L~~lid~A~Ek------------G~~~~~~~~~ 324 (334)
.++-++.|.+.+.......||.+..-+=+..|-..+.++.+++|...+..+|+.=.+. .|-+++|..|
T Consensus 27 ~~~~~~~Y~~~l~~al~~~~~~~~~~Nvl~Hi~Gyfk~~ls~~EK~~~~~~i~~yr~g~i~l~~~l~~L~~~~~ry~~~Y 106 (117)
T PF08349_consen 27 LEEVFEEYEELLMEALSKPPTRGSHINVLQHIFGYFKKKLSSEEKQHFLDLIEDYREGKIPLSVPLTLLKHLARRYPDEY 106 (117)
T ss_pred hHHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHcCCccHHHHHHHHHHHHHHCCCHH
Confidence 6788999999999999999999999999999999999999999999999988865443 3445566555
Q ss_pred hhh
Q 019902 325 SVQ 327 (334)
Q Consensus 325 ~~~ 327 (334)
-+.
T Consensus 107 L~~ 109 (117)
T PF08349_consen 107 LLE 109 (117)
T ss_pred Hhh
Confidence 443
No 27
>PF02037 SAP: SAP domain; InterPro: IPR003034 The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins involved in chromosomal organisation [], including in apoptosis []. In yeast, SAP is found in the most distal N-terminal region of E3 SUMO-protein ligase SIZ1, where it is involved in nuclear localization [].; GO: 0003676 nucleic acid binding; PDB: 2RNN_A 1JEQ_A 2KW9_A 2KVU_A 2DO1_A 1ZBU_B 1ZBH_A 2DO5_A 2RNO_A 1H1J_S ....
Probab=23.15 E-value=1.3e+02 Score=20.08 Aligned_cols=32 Identities=16% Similarity=0.191 Sum_probs=24.8
Q ss_pred CCCHHHHHHHhhcCCCCCCccchHHHHHHHHHHh
Q 019902 41 WGSIPLIEFLQSLGKDTTQQISRYDVTDIINKYV 74 (334)
Q Consensus 41 w~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YI 74 (334)
|--++|.+++...|-.++. ++.+++.+|-+||
T Consensus 4 l~v~eLk~~l~~~gL~~~G--~K~~Li~Rl~~~l 35 (35)
T PF02037_consen 4 LTVAELKEELKERGLSTSG--KKAELIERLKEHL 35 (35)
T ss_dssp SHHHHHHHHHHHTTS-STS--SHHHHHHHHHHHH
T ss_pred CcHHHHHHHHHHCCCCCCC--CHHHHHHHHHHhC
Confidence 4457899999988877655 4899999998886
No 28
>PRK14524 rpsP 30S ribosomal protein S16; Provisional
Probab=22.57 E-value=1.8e+02 Score=24.26 Aligned_cols=79 Identities=11% Similarity=0.123 Sum_probs=45.9
Q ss_pred EEEecCCCCCCCccceeEEEEeeeeecCCCCCcccceeEEEeCCceeEEeeeccCCCCCHHHHHHHHHHHHhCCCCCCcH
Q 019902 199 VRVKSDPNDYLQKNSHQLLQVIGTKKVPGTDDKSTEILLKVSNFVKDIRIATLSDDNFSEEECEDLRQRVKDGLLKRPTV 278 (334)
Q Consensus 199 VRI~ig~~~~~~~~~YRLvqV~Gv~k~~~~g~~~Td~~L~v~~~~k~~~i~~iSd~~FTEeE~~r~~q~~~~~~~krPTv 278 (334)
|||+.-..+...++.||+|-+-.=. +..|.+ ++..|.-. |+ .+...-.=..+|...-+..|..+-.||
T Consensus 2 VkIRL~R~G~KkrPfYrIVvadsr~--~RdGk~-----iE~lG~Yn--P~---~~~~~i~l~~eri~~Wl~~GAqpT~tV 69 (94)
T PRK14524 2 VRIRLTRMGKRKQPFYRIVVVDSRK--RRDGAY-----IESLGYYN--PL---KEPYEIKVDVERAVEWILKGAQPSDTV 69 (94)
T ss_pred cEEhhccCCCCCCCeEEEEEEecCC--CCCCCc-----eeEeeecC--CC---CCCceEEEcHHHHHHHHHcCCccCHHH
Confidence 6777777777778999999764322 223443 33333111 11 111111112345555567799999999
Q ss_pred HHHHHHHHHHH
Q 019902 279 VELQEKTQILH 289 (334)
Q Consensus 279 ~dl~eKa~~i~ 289 (334)
..|..|+..+.
T Consensus 70 ~~Llkk~g~~~ 80 (94)
T PRK14524 70 RDILRKFGVMK 80 (94)
T ss_pred HHHHHHccchh
Confidence 99999977543
No 29
>PF01152 Bac_globin: Bacterial-like globin; InterPro: IPR001486 Globins are haem-containing proteins involved in binding and/or transporting oxygen. They belong to a very large and well studied family that is widely distributed in many organisms []. Globins have evolved from a common ancestor and can be divided into three groups: single-domain globins, and two types of chimeric globins, flavohaemoglobins and globin-coupled sensors. Bacteria have all three types of globins, while archaea lack flavohaemoglobins, and eukaryotes lack globin-coupled sensors []. Several functionally different haemoglobins can coexist in the same species. The major types of globins include: Haemoglobin (Hb): trimer of two alpha and two beta chains, although embryonic and foetal forms can substitute the alpha or beta chain for ones with higher oxygen affinity, such as gamma, delta, epsilon or zeta chains. Hb transports oxygen from lungs to other tissues in vertebrates []. Hb proteins are also present in unicellular organisms where they act as enzymes or sensors []. Myoglobin (Mb): monomeric protein responsible for oxygen storage in vertebrate muscle []. Neuroglobin: a myoglobin-like haemprotein expressed in vertebrate brain and retina, where it is involved in neuroprotection from damage due to hypoxia or ischemia []. Neuroglobin belongs to a branch of the globin family that diverged early in evolution. Cytoglobin: an oxygen sensor expressed in multiple tissues. Related to neuroglobin []. Erythrocruorin: highly cooperative extracellular respiratory proteins found in annelids and arthropods that are assembled from as many as 180 subunit into hexagonal bilayers []. Leghaemoglobin (legHb or symbiotic Hb): occurs in the root nodules of leguminous plants, where it facilitates the diffusion of oxygen to symbiotic bacteriods in order to promote nitrogen fixation. Non-symbiotic haemoglobin (NsHb): occurs in non-leguminous plants, and can be over-expressed in stressed plants []. Flavohaemoglobins (FHb): chimeric, with an N-terminal globin domain and a C-terminal ferredoxin reductase-like NAD/FAD-binding domain. FHb provides protection against nitric oxide via its C-terminal domain, which transfers electrons to haem in the globin []. Globin-coupled sensors: chimeric, with an N-terminal myoglobin-like domain and a C-terminal domain that resembles the cytoplasmic signalling domain of bacterial chemoreceptors. They bind oxygen, and act to initiate an aerotactic response or regulate gene expression [, ]. Protoglobin: a single domain globin found in archaea that is related to the N-terminal domain of globin-coupled sensors []. Truncated 2/2 globin: lack the first helix, giving them a 2-over-2 instead of the canonical 3-over-3 alpha-helical sandwich fold. Can be divided into three main groups (I, II and II) based on structural features []. This entry represents a group of haemoglobin-like proteins found in eubacteria, cyanobacteria, protozoa, algae and plants, but not in animals or yeast. These proteins have a truncated 2-over-2 rather than the canonical 3-over-3 alpha-helical sandwich fold []. This entry includes: HbN (or GlbN): a truncated haemoglobin-like protein that binds oxygen cooperatively with a very high affinity and a slow dissociation rate, which may exclude it from oxygen transport. It appears to be involved in bacterial nitric oxide detoxification and in nitrosative stress []. Cyanoglobin (or GlbN): a truncated haemoprotein found in cyanobacteria that has high oxygen affinity, and which appears to serve as part of a terminal oxidase, rather than as a respiratory pigment []. HbO (or GlbO): a truncated haemoglobin-like protein with a lower oxygen affinity than HbN. HbO associates with the bacterial cell membrane, where it significantly increases oxygen uptake over membranes lacking this protein. HbO appears to interact with a terminal oxidase, and could participate in an oxygen/electron-transfer process that facilitates oxygen transfer during aerobic metabolism []. Glb3: a nuclear-encoded truncated haemoglobin from plants that appears more closely related to HbO than HbN. Glb3 from Arabidopsis thaliana (Mouse-ear cress) exhibits an unusual concentration-independent binding of oxygen and carbon dioxide []. ; GO: 0019825 oxygen binding, 0015671 oxygen transport; PDB: 2BKM_B 1UVY_A 1DLW_A 2XYK_B 2IG3_A 2GKM_B 1S61_A 1S56_B 1RTE_B 2GLN_A ....
Probab=22.34 E-value=1.3e+02 Score=24.57 Aligned_cols=42 Identities=17% Similarity=0.216 Sum_probs=31.3
Q ss_pred CCCCCHHHHHHHHH----HHHhCCCCCCcHHHHHHHHHHHHHHHhh
Q 019902 253 DDNFSEEECEDLRQ----RVKDGLLKRPTVVELQEKTQILHEDITK 294 (334)
Q Consensus 253 d~~FTEeE~~r~~q----~~~~~~~krPTv~dl~eKa~~i~~~i~~ 294 (334)
+-++|+++|++|.+ ++..-+++-+...++.+.+..+...|++
T Consensus 74 ~l~it~~~f~~~~~~~~~al~~~~v~~~~~~~~~~~~~~~~~~i~n 119 (120)
T PF01152_consen 74 HLGITEEHFDRWLELLKQALDELGVPEELIDELLARLESLRDDIVN 119 (120)
T ss_dssp TS-BBHHHHHHHHHHHHHHHHHTTCTHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHhcC
Confidence 34899999999875 5556677777778888888888777764
No 30
>smart00513 SAP Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation.
Probab=20.90 E-value=1.8e+02 Score=19.16 Aligned_cols=33 Identities=15% Similarity=0.157 Sum_probs=25.9
Q ss_pred CCCCHHHHHHHhhcCCCCCCccchHHHHHHHHHHh
Q 019902 40 GWGSIPLIEFLQSLGKDTTQQISRYDVTDIINKYV 74 (334)
Q Consensus 40 ~w~S~eL~~fl~~~g~d~~~~lsr~~v~k~lw~YI 74 (334)
.+.-.+|.+++...|-.++. .+.+++.+|-+|+
T Consensus 3 ~l~~~~Lk~~l~~~gl~~~G--~K~~Lv~Rl~~~~ 35 (35)
T smart00513 3 KLKVSELKDELKKRGLSTSG--TKAELVDRLLEAL 35 (35)
T ss_pred cCcHHHHHHHHHHcCCCCCC--CHHHHHHHHHHhC
Confidence 34567899999988887654 6889999988874
No 31
>PRK02542 photosystem I assembly protein Ycf4; Provisional
Probab=20.67 E-value=1.7e+02 Score=27.37 Aligned_cols=59 Identities=14% Similarity=0.189 Sum_probs=38.5
Q ss_pred ceeEEEEeeeeecCCCC-CcccceeEEEeCCceeEEeeeccCCCCCHHHHHHHHHHHHhCCCCCCcHHHHHHHHHHHHHH
Q 019902 213 SHQLLQVIGTKKVPGTD-DKSTEILLKVSNFVKDIRIATLSDDNFSEEECEDLRQRVKDGLLKRPTVVELQEKTQILHED 291 (334)
Q Consensus 213 ~YRLvqV~Gv~k~~~~g-~~~Td~~L~v~~~~k~~~i~~iSd~~FTEeE~~r~~q~~~~~~~krPTv~dl~eKa~~i~~~ 291 (334)
.|-|-.|..|+---..| ...+.+||++.+ .+++|+.-+. +| +|..++|+||.+|-++
T Consensus 123 ~~pl~dIqsIrveikeGlnprr~iyL~~kg-~~~IPLTrig-~p--------------------l~l~eiE~qAaeLA~F 180 (188)
T PRK02542 123 EYPLEDIQAVKVEIREGLNPRRRLYLRLKG-RRDIPLTRVG-QP--------------------LPLAELENQGAELARF 180 (188)
T ss_pred EeEhHHeEEEEEEEecCcCCccEEEEEEcC-CCcCCcccCC-CC--------------------CCHHHHHHHHHHHHHH
Confidence 35555555554433334 334559999988 6788888773 34 4556699999999887
Q ss_pred Hh
Q 019902 292 IT 293 (334)
Q Consensus 292 i~ 293 (334)
+.
T Consensus 181 L~ 182 (188)
T PRK02542 181 LG 182 (188)
T ss_pred hC
Confidence 64
No 32
>TIGR00002 S16 ribosomal protein S16. This model describes ribosomal S16 of bacteria and organelles.
Probab=20.63 E-value=2.4e+02 Score=22.63 Aligned_cols=73 Identities=11% Similarity=0.193 Sum_probs=43.7
Q ss_pred EEEecCCCCCCCccceeEEEEeeeeecCCCCCcccceeEEEeCCceeEEeeeccCC---CCCHHHHHHHHHHHHhCCCCC
Q 019902 199 VRVKSDPNDYLQKNSHQLLQVIGTKKVPGTDDKSTEILLKVSNFVKDIRIATLSDD---NFSEEECEDLRQRVKDGLLKR 275 (334)
Q Consensus 199 VRI~ig~~~~~~~~~YRLvqV~Gv~k~~~~g~~~Td~~L~v~~~~k~~~i~~iSd~---~FTEeE~~r~~q~~~~~~~kr 275 (334)
|||+.-..+.+.++.||+|-+-.-.. ..|.+ ++..|.-. |+ .+. .+..+ |...-+..|..+-
T Consensus 1 vkIRL~R~G~k~~PfYrIVv~d~r~~--RdGk~-----iE~lG~Yn--P~---~~~~~i~l~~~---ri~~Wl~~GAqps 65 (78)
T TIGR00002 1 VKIRLKRGGRKKRPFYRIVVADSRSR--RDGRY-----IEELGFYN--PL---TKESRVKLNVE---RIKYWLSKGAQPT 65 (78)
T ss_pred CEEecccCCCCCCCeEEEEEEecCCC--CCCCc-----eeEeeecc--CC---CCCcEEEEcHH---HHHHHHHCCCccC
Confidence 67777777777789999997755322 23443 44433110 11 121 23334 4555567799999
Q ss_pred CcHHHHHHHHH
Q 019902 276 PTVVELQEKTQ 286 (334)
Q Consensus 276 PTv~dl~eKa~ 286 (334)
.||..|..|+.
T Consensus 66 ~tV~~Ll~~~g 76 (78)
T TIGR00002 66 DTVRNLLKKAG 76 (78)
T ss_pred HHHHHHHHHcc
Confidence 99998887753
No 33
>CHL00036 ycf4 photosystem I assembly protein Ycf4
Probab=20.35 E-value=1.6e+02 Score=27.47 Aligned_cols=59 Identities=12% Similarity=0.170 Sum_probs=38.6
Q ss_pred ceeEEEEeeeeecCCCC-CcccceeEEEeCCceeEEeeeccCCCCCHHHHHHHHHHHHhCCCCCCcHHHHHHHHHHHHHH
Q 019902 213 SHQLLQVIGTKKVPGTD-DKSTEILLKVSNFVKDIRIATLSDDNFSEEECEDLRQRVKDGLLKRPTVVELQEKTQILHED 291 (334)
Q Consensus 213 ~YRLvqV~Gv~k~~~~g-~~~Td~~L~v~~~~k~~~i~~iSd~~FTEeE~~r~~q~~~~~~~krPTv~dl~eKa~~i~~~ 291 (334)
.|-|-.|..|+-.-..| ...+.+||++.+ .+++|+.-+.+ | +|..++|+||.+|-++
T Consensus 119 ~~pl~dI~sIrieikeGlnprr~iyL~~kg-~~~IPLTrig~-p--------------------l~l~eiE~qAaeLA~F 176 (184)
T CHL00036 119 RFLIKDIQSIRIEVKEGLNPRRVLYLEIKG-QRDIPLTRTGE-P--------------------LTLREIEQKAAELAYF 176 (184)
T ss_pred EeEhHHeEEEEEEEecCcCcccEEEEEEcC-CCcCCcccCCC-C--------------------CCHHHHHHHHHHHHHH
Confidence 34555555554433334 344569999987 67888876643 3 4556699999999887
Q ss_pred Hh
Q 019902 292 IT 293 (334)
Q Consensus 292 i~ 293 (334)
+.
T Consensus 177 L~ 178 (184)
T CHL00036 177 LR 178 (184)
T ss_pred cC
Confidence 54
Done!