Query psy7578
Match_columns 280
No_of_seqs 295 out of 1206
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 22:01:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7578.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7578hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PTZ00199 high mobility group p 99.8 6E-21 1.3E-25 151.1 10.6 77 64-140 14-92 (94)
2 cd01389 MATA_HMG-box MATA_HMG- 99.8 5.7E-21 1.2E-25 145.0 9.8 76 72-147 1-76 (77)
3 KOG0527|consensus 99.8 1.7E-20 3.7E-25 178.1 8.3 85 67-151 57-141 (331)
4 cd01388 SOX-TCF_HMG-box SOX-TC 99.8 2.6E-19 5.6E-24 134.4 9.4 70 72-141 1-70 (72)
5 cd01390 HMGB-UBF_HMG-box HMGB- 99.8 1.8E-18 4E-23 125.8 9.1 66 73-138 1-66 (66)
6 PF00505 HMG_box: HMG (high mo 99.8 3.4E-18 7.3E-23 125.6 8.1 68 73-140 1-68 (69)
7 smart00398 HMG high mobility g 99.7 1.7E-17 3.8E-22 121.2 9.6 69 72-140 1-69 (70)
8 cd00084 HMG-box High Mobility 99.7 9E-17 1.9E-21 116.1 9.1 65 73-137 1-65 (66)
9 PF09011 HMG_box_2: HMG-box do 99.7 6E-17 1.3E-21 121.8 8.3 71 70-140 1-72 (73)
10 COG5648 NHP6B Chromatin-associ 99.7 9.6E-17 2.1E-21 142.9 8.9 77 64-140 62-138 (211)
11 KOG0381|consensus 99.7 5.7E-16 1.2E-20 121.3 9.6 70 71-140 21-91 (96)
12 KOG0526|consensus 99.5 1.2E-14 2.6E-19 143.6 8.0 75 63-141 526-600 (615)
13 KOG0528|consensus 99.4 8.8E-14 1.9E-18 136.2 2.8 81 65-145 318-398 (511)
14 KOG3248|consensus 99.1 9.4E-11 2E-15 111.0 6.6 72 69-140 188-259 (421)
15 KOG4715|consensus 98.9 2.4E-09 5.3E-14 100.8 7.5 75 68-142 60-134 (410)
16 KOG2746|consensus 98.7 9.8E-09 2.1E-13 104.4 4.6 70 67-136 176-247 (683)
17 PF14887 HMG_box_5: HMG (high 98.1 2.1E-05 4.5E-10 60.7 8.7 68 73-141 4-71 (85)
18 COG5648 NHP6B Chromatin-associ 97.2 0.00016 3.5E-09 65.2 1.5 113 14-139 96-210 (211)
19 PF06382 DUF1074: Protein of u 97.0 0.0024 5.3E-08 56.4 7.3 49 77-129 83-131 (183)
20 PF04690 YABBY: YABBY protein; 96.8 0.0028 6E-08 55.8 5.6 48 69-116 118-165 (170)
21 PF08073 CHDNT: CHDNT (NUC034) 94.5 0.052 1.1E-06 39.4 3.6 40 77-116 13-52 (55)
22 PF04769 MAT_Alpha1: Mating-ty 94.1 0.13 2.7E-06 46.5 6.1 55 68-128 39-93 (201)
23 PF06244 DUF1014: Protein of u 89.8 0.59 1.3E-05 39.1 4.7 48 70-117 69-117 (122)
24 PRK15117 ABC transporter perip 84.4 4.5 9.7E-05 36.5 7.6 47 95-141 65-113 (211)
25 TIGR03481 HpnM hopanoid biosyn 84.4 2.1 4.5E-05 38.3 5.3 46 96-141 62-109 (198)
26 PF05494 Tol_Tol_Ttg2: Toluene 69.1 12 0.00027 31.9 5.6 46 96-141 36-83 (170)
27 PF12881 NUT_N: NUT protein N 61.9 19 0.00041 34.8 5.7 55 77-131 229-283 (328)
28 KOG3223|consensus 51.0 21 0.00046 32.4 3.9 55 69-126 160-215 (221)
29 PF01352 KRAB: KRAB box; Inte 37.8 27 0.00059 23.6 2.0 29 99-127 2-31 (41)
30 PF06945 DUF1289: Protein of u 35.6 50 0.0011 23.2 3.1 25 99-128 23-47 (51)
31 PF12650 DUF3784: Domain of un 34.1 27 0.00058 27.3 1.7 15 111-125 26-40 (97)
32 COG2854 Ttg2D ABC-type transpo 33.3 66 0.0014 29.3 4.3 51 96-146 68-120 (202)
33 PF13875 DUF4202: Domain of un 32.2 95 0.0021 27.9 5.0 40 78-120 130-169 (185)
34 PRK09706 transcriptional repre 31.6 82 0.0018 25.8 4.3 38 103-140 88-125 (135)
35 PRK12751 cpxP periplasmic stre 29.6 1.1E+02 0.0024 26.8 4.9 33 103-135 119-151 (162)
36 PRK10236 hypothetical protein; 28.9 54 0.0012 30.6 3.0 22 103-124 118-139 (237)
37 PF11304 DUF3106: Protein of u 28.1 2E+02 0.0043 23.2 5.9 37 90-126 33-71 (107)
38 PF11304 DUF3106: Protein of u 28.0 94 0.002 25.1 4.0 26 102-127 11-36 (107)
39 cd07081 ALDH_F20_ACDH_EutE-lik 27.1 1.2E+02 0.0026 30.3 5.3 39 102-140 6-44 (439)
40 KOG1827|consensus 26.9 5.1 0.00011 42.0 -4.4 44 76-119 552-595 (629)
41 smart00500 SFM Splicing Factor 26.0 27 0.00059 24.2 0.4 23 199-221 2-24 (44)
42 PF12356 DUF3643: Protein of u 25.7 35 0.00076 30.1 1.1 14 233-246 76-89 (165)
43 PF08367 M16C_assoc: Peptidase 23.6 1.4E+02 0.003 27.1 4.7 32 101-132 13-44 (248)
44 cd07749 NT_Pol-beta-like_1 Nuc 22.4 61 0.0013 28.3 2.0 29 206-238 2-30 (156)
45 PF05388 Carbpep_Y_N: Carboxyp 22.4 1.4E+02 0.0029 24.7 4.0 29 100-128 45-73 (113)
46 KOG1610|consensus 22.3 2.2E+02 0.0047 27.8 5.9 49 82-130 187-248 (322)
47 PRK12750 cpxP periplasmic repr 20.8 2.3E+02 0.0049 24.8 5.3 35 104-138 127-161 (170)
48 cd07133 ALDH_CALDH_CalB Conife 20.6 1.8E+02 0.0039 28.7 5.1 39 102-140 5-43 (434)
49 cd07132 ALDH_F3AB Aldehyde deh 20.4 1.8E+02 0.004 28.8 5.2 39 102-140 5-43 (443)
50 cd07661 BAR_ICA69 The Bin/Amph 20.3 54 0.0012 29.9 1.3 65 208-275 40-106 (204)
51 PRK10363 cpxP periplasmic repr 20.3 2.2E+02 0.0047 25.2 5.0 37 102-139 112-148 (166)
52 PRK14830 undecaprenyl pyrophos 20.2 3.8E+02 0.0082 25.0 6.9 58 200-257 84-141 (251)
No 1
>PTZ00199 high mobility group protein; Provisional
Probab=99.85 E-value=6e-21 Score=151.09 Aligned_cols=77 Identities=26% Similarity=0.502 Sum_probs=73.3
Q ss_pred cccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 64 EDDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMD--FSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 64 ~~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls--~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
++..+|+++||||+||||+|++++|.+|..+||+++ +.+|+++||++|++|+++||++|+++|+.++++|.++|.+|
T Consensus 14 ~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e~~~Y 92 (94)
T PTZ00199 14 KRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEY 92 (94)
T ss_pred CCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345679999999999999999999999999999986 89999999999999999999999999999999999999987
No 2
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.85 E-value=5.7e-21 Score=145.03 Aligned_cols=76 Identities=17% Similarity=0.279 Sum_probs=73.4
Q ss_pred CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCCCCCCc
Q psy7578 72 IEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPAQKTKS 147 (280)
Q Consensus 72 ~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~~p~Kk 147 (280)
+||||+||||||+++.|.+++.+||++++.||+++||++|++|+++||++|.++|++++++|.+++.+|...|+++
T Consensus 1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky~p~~~ 76 (77)
T cd01389 1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKYTPRKS 76 (77)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcccCCCC
Confidence 6899999999999999999999999999999999999999999999999999999999999999999998888774
No 3
>KOG0527|consensus
Probab=99.82 E-value=1.7e-20 Score=178.12 Aligned_cols=85 Identities=27% Similarity=0.441 Sum_probs=80.2
Q ss_pred ccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCCCCCC
Q psy7578 67 LSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPAQKTK 146 (280)
Q Consensus 67 ~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~~p~K 146 (280)
+...+++||||||||+|++.+|++|.++||++++.||||+||.+||.|+++||.+|.++|+++|++|.++..+|.+.|++
T Consensus 57 k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYRPRR 136 (331)
T KOG0527|consen 57 KTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYRPRR 136 (331)
T ss_pred CCCccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCccccccc
Confidence 35678999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred ccCCC
Q psy7578 147 STYTP 151 (280)
Q Consensus 147 k~~~~ 151 (280)
|.+..
T Consensus 137 Kkk~~ 141 (331)
T KOG0527|consen 137 KKKKR 141 (331)
T ss_pred ccccc
Confidence 75443
No 4
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.80 E-value=2.6e-19 Score=134.43 Aligned_cols=70 Identities=24% Similarity=0.386 Sum_probs=67.8
Q ss_pred CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCC
Q psy7578 72 IEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAP 141 (280)
Q Consensus 72 ~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~ 141 (280)
++|||+||||+|++++|.+++.+||++++.||+++||++|+.|+++||++|.++|++++++|.+++..|.
T Consensus 1 ~iKrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~ 70 (72)
T cd01388 1 HIKRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYK 70 (72)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCC
Confidence 5799999999999999999999999999999999999999999999999999999999999999998873
No 5
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.77 E-value=1.8e-18 Score=125.79 Aligned_cols=66 Identities=32% Similarity=0.609 Sum_probs=64.3
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHh
Q psy7578 73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMS 138 (280)
Q Consensus 73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema 138 (280)
||||+|||++|++++|.+++.+||++++.+|++.||++|++|+++||++|.++|++++++|+++|.
T Consensus 1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~~ 66 (66)
T cd01390 1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEMK 66 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhhC
Confidence 799999999999999999999999999999999999999999999999999999999999999873
No 6
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.75 E-value=3.4e-18 Score=125.61 Aligned_cols=68 Identities=29% Similarity=0.536 Sum_probs=64.3
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
||||+|||++|++++|..++.+||++++.+|++++|++|++|+++||++|.++|++++++|++++..|
T Consensus 1 PkrP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y 68 (69)
T PF00505_consen 1 PKRPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEY 68 (69)
T ss_dssp SSSS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcCCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 79999999999999999999999999999999999999999999999999999999999999999765
No 7
>smart00398 HMG high mobility group.
Probab=99.73 E-value=1.7e-17 Score=121.16 Aligned_cols=69 Identities=29% Similarity=0.548 Sum_probs=67.0
Q ss_pred CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 72 IEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 72 ~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
+||||+|||++|++++|..++.+||++++.+|++.+|++|+.|++++|++|.++|++++++|.+++..|
T Consensus 1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y 69 (70)
T smart00398 1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEY 69 (70)
T ss_pred CcCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 489999999999999999999999999999999999999999999999999999999999999999876
No 8
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.70 E-value=9e-17 Score=116.10 Aligned_cols=65 Identities=28% Similarity=0.606 Sum_probs=63.4
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHH
Q psy7578 73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKM 137 (280)
Q Consensus 73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~em 137 (280)
||||+|||++|+++.|..++.+||++++.+|++.+|++|+.|++++|++|.++|+.++++|.+++
T Consensus 1 pkrp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~ 65 (66)
T cd00084 1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM 65 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence 79999999999999999999999999999999999999999999999999999999999999875
No 9
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.70 E-value=6e-17 Score=121.85 Aligned_cols=71 Identities=23% Similarity=0.409 Sum_probs=63.1
Q ss_pred CCCCCCCCChHHHHHHHHHHHHHHh-CCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 70 ESIEKARFTAYMLWAKQIRQKLIKS-NPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 70 ~~~PKRP~sAY~LF~ke~R~kik~e-~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
|++||||+|||++|+++++.+++.+ .+...+.|+++.++++|++||++||.+|+++|+.++++|+++|..|
T Consensus 1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~ 72 (73)
T PF09011_consen 1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEW 72 (73)
T ss_dssp SSS--SSSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcCCCCCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 5789999999999999999999988 7888999999999999999999999999999999999999999865
No 10
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.68 E-value=9.6e-17 Score=142.90 Aligned_cols=77 Identities=23% Similarity=0.416 Sum_probs=74.3
Q ss_pred cccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 64 EDDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 64 ~~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
.++..|+|.||||+||||+|+.++|.+|++++|++.|.+|++.+|++|++|+++||++|..+|..++++|+.++..|
T Consensus 62 ~r~k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y 138 (211)
T COG5648 62 VRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEY 138 (211)
T ss_pred HHHhcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhh
Confidence 44567999999999999999999999999999999999999999999999999999999999999999999999998
No 11
>KOG0381|consensus
Probab=99.66 E-value=5.7e-16 Score=121.32 Aligned_cols=70 Identities=27% Similarity=0.580 Sum_probs=68.3
Q ss_pred CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHh-hC
Q psy7578 71 SIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMS-KA 140 (280)
Q Consensus 71 ~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema-~~ 140 (280)
+.||||++||++|+++.|..++.+||++++.||++++|++|++|++++|++|..+|..++++|.++|. +|
T Consensus 21 ~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~ 91 (96)
T KOG0381|consen 21 QAPKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEY 91 (96)
T ss_pred CCCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 59999999999999999999999999999999999999999999999999999999999999999998 66
No 12
>KOG0526|consensus
Probab=99.54 E-value=1.2e-14 Score=143.60 Aligned_cols=75 Identities=24% Similarity=0.455 Sum_probs=70.7
Q ss_pred ccccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCC
Q psy7578 63 DEDDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAP 141 (280)
Q Consensus 63 k~~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~ 141 (280)
+-++.+|||+|||+++|||+|.+..|..|+.+ +++++||+|++|++|+.|+. |.+|+++|+.+|.+|+.+|++|.
T Consensus 526 ~~kk~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk 600 (615)
T KOG0526|consen 526 KGKKKKDPNAPKRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK 600 (615)
T ss_pred CcccCCCCCCCccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence 34567899999999999999999999999988 99999999999999999999 88999999999999999999994
No 13
>KOG0528|consensus
Probab=99.40 E-value=8.8e-14 Score=136.17 Aligned_cols=81 Identities=28% Similarity=0.439 Sum_probs=75.5
Q ss_pred ccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCCCC
Q psy7578 65 DDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPAQK 144 (280)
Q Consensus 65 ~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~~p 144 (280)
.+...++|+||||||||+|.++.|.+|.+.+||+.+.+|+|+||.+||.|+..||++|+++-.++-..|.+....|.+.|
T Consensus 318 g~~ss~PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrYkP 397 (511)
T KOG0528|consen 318 GRASSEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRYKP 397 (511)
T ss_pred CcCCCCccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccccCC
Confidence 35567899999999999999999999999999999999999999999999999999999999999999999888887766
Q ss_pred C
Q psy7578 145 T 145 (280)
Q Consensus 145 ~ 145 (280)
+
T Consensus 398 R 398 (511)
T KOG0528|consen 398 R 398 (511)
T ss_pred C
Confidence 6
No 14
>KOG3248|consensus
Probab=99.12 E-value=9.4e-11 Score=111.02 Aligned_cols=72 Identities=19% Similarity=0.300 Sum_probs=66.0
Q ss_pred CCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 69 QESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 69 d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
.+.++|+|+||||+|++|+|.++..|+---.-.+|.++||++|.+||.||..+|+++|+++++-|.+....+
T Consensus 188 KkphiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~W 259 (421)
T KOG3248|consen 188 KKPHIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGW 259 (421)
T ss_pred cCccccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCc
Confidence 456899999999999999999999999766688999999999999999999999999999999998876544
No 15
>KOG4715|consensus
Probab=98.92 E-value=2.4e-09 Score=100.82 Aligned_cols=75 Identities=19% Similarity=0.407 Sum_probs=70.2
Q ss_pred cCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCC
Q psy7578 68 SQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPA 142 (280)
Q Consensus 68 ~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~ 142 (280)
+.+.-|-+|+-.||.|++..|++++..||++...||.|+||.||..|+++||+.|..+++..|..|++.|+.|-.
T Consensus 60 kpPkppekpl~pymrySrkvWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~ 134 (410)
T KOG4715|consen 60 KPPKPPEKPLMPYMRYSRKVWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHN 134 (410)
T ss_pred CCCCCCCcccchhhHHhhhhhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 346677889999999999999999999999999999999999999999999999999999999999999998743
No 16
>KOG2746|consensus
Probab=98.72 E-value=9.8e-09 Score=104.36 Aligned_cols=70 Identities=23% Similarity=0.339 Sum_probs=66.4
Q ss_pred ccCCCCCCCCCChHHHHHHHHH--HHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHH
Q psy7578 67 LSQESIEKARFTAYMLWAKQIR--QKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQK 136 (280)
Q Consensus 67 ~~d~~~PKRP~sAY~LF~ke~R--~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~e 136 (280)
+.+++|++||||||++|++.+| ..+.+.||+.+++-|+++|||+|-.|-+.||+.|.++|.+.++.|-+.
T Consensus 176 kr~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka 247 (683)
T KOG2746|consen 176 KRDKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA 247 (683)
T ss_pred cCcchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence 4678899999999999999999 889999999999999999999999999999999999999999998874
No 17
>PF14887 HMG_box_5: HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.12 E-value=2.1e-05 Score=60.72 Aligned_cols=68 Identities=15% Similarity=0.289 Sum_probs=58.9
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCC
Q psy7578 73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAP 141 (280)
Q Consensus 73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~ 141 (280)
|..|-+|--+|.+.....+...+++-...+ -+.+...|++|++.+|-+|..+|.++..+|+.+|.++.
T Consensus 4 PE~PKt~qe~Wqq~vi~dYla~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r 71 (85)
T PF14887_consen 4 PETPKTAQEIWQQSVIGDYLAKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMR 71 (85)
T ss_dssp S----THHHHHHHHHHHHHHHHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS
T ss_pred CCCCCCHHHHHHHHHHHHHHHHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHh
Confidence 667789999999999999999998877776 56999999999999999999999999999999999983
No 18
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=97.17 E-value=0.00016 Score=65.18 Aligned_cols=113 Identities=13% Similarity=0.234 Sum_probs=83.4
Q ss_pred ccccccch--hhhhhhhccCCCCCCccccccCCcccccccCCCCcccccccccccccCCCCCCCCCChHHHHHHHHHHHH
Q psy7578 14 VNISDISD--SELYNSAQAVGSGADEEFGEADSSLDAESIGSGDLADNLLIDEDDLSQESIEKARFTAYMLWAKQIRQKL 91 (280)
Q Consensus 14 v~~se~sk--se~~~t~s~~~~g~~e~~~~a~s~~~~~s~g~~k~~kk~~~k~~~~~d~~~PKRP~sAY~LF~ke~R~ki 91 (280)
+.|.++++ +|.|+.-.+.+.......+..+.. -|...+ .. -.....+++|.-.|.-+..+.|.++
T Consensus 96 l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~e----rYq~ek-------~~--y~~k~~~~~~~~~~~e~~~~~r~~~ 162 (211)
T COG5648 96 LTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRE----RYQREK-------EE--YNKKLPNKAPIGPFIENEPKIRPKV 162 (211)
T ss_pred CChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHH----HHHHHH-------Hh--hhcccCCCCCCchhhhccHHhcccc
Confidence 46788888 999998766555544433322211 010000 00 1112346778888888999999999
Q ss_pred HHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhh
Q psy7578 92 IKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSK 139 (280)
Q Consensus 92 k~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~ 139 (280)
...+|+....+..++++..|++|+++-|.+|.+.++.++.+|...+.+
T Consensus 163 ~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~~~~~~~~~ 210 (211)
T COG5648 163 EGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE 210 (211)
T ss_pred CCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHHHHhhhccc
Confidence 999999999999999999999999999999999999999999887653
No 19
>PF06382 DUF1074: Protein of unknown function (DUF1074); InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=97.00 E-value=0.0024 Score=56.37 Aligned_cols=49 Identities=18% Similarity=0.369 Sum_probs=43.9
Q ss_pred CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHH
Q psy7578 77 FTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRL 129 (280)
Q Consensus 77 ~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~ 129 (280)
-+||+-|.+++|.+ |.+++..|+....+.+|..|+++||..|+.++...
T Consensus 83 nnaYLNFLReFRrk----h~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~~ 131 (183)
T PF06382_consen 83 NNAYLNFLREFRRK----HCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPSV 131 (183)
T ss_pred chHHHHHHHHHHHH----ccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcchh
Confidence 47899999998874 78999999999999999999999999999976643
No 20
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=96.75 E-value=0.0028 Score=55.79 Aligned_cols=48 Identities=17% Similarity=0.299 Sum_probs=42.9
Q ss_pred CCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCCh
Q psy7578 69 QESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPF 116 (280)
Q Consensus 69 d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~Lse 116 (280)
.+.+..|-++||..|+++.-++||.+||+++..|.-+..+..|+..+.
T Consensus 118 PPEKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~ph 165 (170)
T PF04690_consen 118 PPEKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFPH 165 (170)
T ss_pred CccccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCcc
Confidence 345566889999999999999999999999999999999999987653
No 21
>PF08073 CHDNT: CHDNT (NUC034) domain; InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=94.46 E-value=0.052 Score=39.44 Aligned_cols=40 Identities=25% Similarity=0.458 Sum_probs=35.8
Q ss_pred CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCCh
Q psy7578 77 FTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPF 116 (280)
Q Consensus 77 ~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~Lse 116 (280)
.+-|-+|.+..|+.|.+.||++..+.+...++.+|+.-++
T Consensus 13 lt~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~~ 52 (55)
T PF08073_consen 13 LTNYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQE 52 (55)
T ss_pred HHHHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHHh
Confidence 3568899999999999999999999999999999986543
No 22
>PF04769 MAT_Alpha1: Mating-type protein MAT alpha 1; InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=94.12 E-value=0.13 Score=46.52 Aligned_cols=55 Identities=18% Similarity=0.268 Sum_probs=42.7
Q ss_pred cCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHH
Q psy7578 68 SQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADR 128 (280)
Q Consensus 68 ~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~ 128 (280)
....++|||.|+||.|+.-+- ...|+....++|..|+..|+.= --|..|.-.|+.
T Consensus 39 ~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~d--p~k~~W~l~ak~ 93 (201)
T PF04769_consen 39 RSPEKAKRPLNGFMAFRSYYS----PIFPPLPQKELSGILTKLWEKD--PFKNKWSLMAKA 93 (201)
T ss_pred ccccccccchhHHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCC--ccHhHHHHHhhh
Confidence 445678999999999987665 3568889999999999999773 335667766653
No 23
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=89.77 E-value=0.59 Score=39.14 Aligned_cols=48 Identities=25% Similarity=0.421 Sum_probs=41.9
Q ss_pred CCCCCCCC-ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChh
Q psy7578 70 ESIEKARF-TAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFN 117 (280)
Q Consensus 70 ~~~PKRP~-sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~Lsee 117 (280)
..||-|.+ -||.-|...+-+.++.+||++..+++-.+|-.+|..-++.
T Consensus 69 drHPErR~KAAy~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPeN 117 (122)
T PF06244_consen 69 DRHPERRMKAAYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPEN 117 (122)
T ss_pred CCCcchhHHHHHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCCC
Confidence 45776666 6799999999999999999999999999999999876653
No 24
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=84.44 E-value=4.5 Score=36.48 Aligned_cols=47 Identities=13% Similarity=0.174 Sum_probs=41.1
Q ss_pred CCCCCHHHHHH-HHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCC
Q psy7578 95 NPEMDFSQVSK-KLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAP 141 (280)
Q Consensus 95 ~P~ls~~eIsK-~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~ 141 (280)
.|..+|..+++ .||.-|++.++++|+.|.+.-.+ +..-|-..+..|.
T Consensus 65 ~p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~ 113 (211)
T PRK15117 65 LPYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYH 113 (211)
T ss_pred cccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 38899999987 78999999999999999998887 5668889998884
No 25
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=84.38 E-value=2.1 Score=38.26 Aligned_cols=46 Identities=13% Similarity=0.336 Sum_probs=40.6
Q ss_pred CCCCHHHHHH-HHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCC
Q psy7578 96 PEMDFSQVSK-KLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAP 141 (280)
Q Consensus 96 P~ls~~eIsK-~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~ 141 (280)
|..++..|++ .||.-|+.+++++|+.|.+...+ +...|-..+..|.
T Consensus 62 ~~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~ 109 (198)
T TIGR03481 62 EAFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYA 109 (198)
T ss_pred HhCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4789999987 78999999999999999999988 6788988888873
No 26
>PF05494 Tol_Tol_Ttg2: Toluene tolerance, Ttg2 ; InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=69.09 E-value=12 Score=31.89 Aligned_cols=46 Identities=17% Similarity=0.496 Sum_probs=36.4
Q ss_pred CCCCHHHHHH-HHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCC
Q psy7578 96 PEMDFSQVSK-KLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAP 141 (280)
Q Consensus 96 P~ls~~eIsK-~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~ 141 (280)
|.+++..+++ .||.-|+.+++++|+.|.+...+ +...|-..+..|.
T Consensus 36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~ 83 (170)
T PF05494_consen 36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYS 83 (170)
T ss_dssp GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-S
T ss_pred HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 7888998886 46889999999999999998887 5667888888874
No 27
>PF12881 NUT_N: NUT protein N terminus; InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=61.87 E-value=19 Score=34.85 Aligned_cols=55 Identities=11% Similarity=0.173 Sum_probs=43.3
Q ss_pred CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHH
Q psy7578 77 FTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAA 131 (280)
Q Consensus 77 ~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~ke 131 (280)
..||..|.--.-..+....|.++..|--.+.-+.|...|.=||..|+++|++..|
T Consensus 229 ~EAlSCFLIpvLrsLar~kPtMtlEeGl~ra~qEW~~~SnfdRmifyemaekFmE 283 (328)
T PF12881_consen 229 AEALSCFLIPVLRSLARLKPTMTLEEGLWRAVQEWQHTSNFDRMIFYEMAEKFME 283 (328)
T ss_pred hhhhhhhHHHHHHHHHhcCCCccHHHHHHHHHHHhhccccccHHHHHHHHHHHcc
Confidence 3455555555555556667778888888888899999999999999999998755
No 28
>KOG3223|consensus
Probab=51.05 E-value=21 Score=32.37 Aligned_cols=55 Identities=24% Similarity=0.418 Sum_probs=44.1
Q ss_pred CCCCCCCCC-ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHH
Q psy7578 69 QESIEKARF-TAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQA 126 (280)
Q Consensus 69 d~~~PKRP~-sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA 126 (280)
|..||-|.| -||.-|-...-+.++.+||++..++.-.+|-.+|+.-++. ||.+.+
T Consensus 160 ddrHPEkRmrAA~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN---P~Nq~~ 215 (221)
T KOG3223|consen 160 DDRHPEKRMRAAFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN---PFNQAA 215 (221)
T ss_pred cccChHHHHHHHHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC---hhhHHh
Confidence 346775555 5588888888999999999999999999999999887765 555443
No 29
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=37.80 E-value=27 Score=23.61 Aligned_cols=29 Identities=17% Similarity=0.251 Sum_probs=15.9
Q ss_pred CHHHHHHHHH-HHHcCCChhhHHHHHHHHH
Q psy7578 99 DFSQVSKKLG-ELWHTVPFNEKYGWKRQAD 127 (280)
Q Consensus 99 s~~eIsK~Lg-e~Wk~LseeEK~~Y~~kA~ 127 (280)
.|.||+--.+ +.|..|.+.+|.-|.+--.
T Consensus 2 tf~Dvav~fs~eEW~~L~~~Qk~ly~dvm~ 31 (41)
T PF01352_consen 2 TFEDVAVYFSQEEWELLDPAQKNLYRDVML 31 (41)
T ss_dssp -----TT---HHHHHTS-HHHHHHHHHHHH
T ss_pred eEEEEEEEcChhhcccccceecccchhHHH
Confidence 3555655555 5699999999999887543
No 30
>PF06945 DUF1289: Protein of unknown function (DUF1289); InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=35.64 E-value=50 Score=23.19 Aligned_cols=25 Identities=8% Similarity=0.315 Sum_probs=18.2
Q ss_pred CHHHHHHHHHHHHcCCChhhHHHHHHHHHH
Q psy7578 99 DFSQVSKKLGELWHTVPFNEKYGWKRQADR 128 (280)
Q Consensus 99 s~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~ 128 (280)
+..||.. |+.|+++||....+....
T Consensus 23 T~dEI~~-----W~~~s~~er~~i~~~l~~ 47 (51)
T PF06945_consen 23 TLDEIRD-----WKSMSDDERRAILARLRA 47 (51)
T ss_pred cHHHHHH-----HhhCCHHHHHHHHHHHHH
Confidence 4566665 999999999876665443
No 31
>PF12650 DUF3784: Domain of unknown function (DUF3784); InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=34.09 E-value=27 Score=27.27 Aligned_cols=15 Identities=20% Similarity=0.428 Sum_probs=12.8
Q ss_pred HcCCChhhHHHHHHH
Q psy7578 111 WHTVPFNEKYGWKRQ 125 (280)
Q Consensus 111 Wk~LseeEK~~Y~~k 125 (280)
|++||+|||+.|.++
T Consensus 26 yntms~eEk~~~D~~ 40 (97)
T PF12650_consen 26 YNTMSKEEKEKYDKK 40 (97)
T ss_pred cccCCHHHHHHhhHH
Confidence 899999999988653
No 32
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=33.27 E-value=66 Score=29.26 Aligned_cols=51 Identities=22% Similarity=0.330 Sum_probs=41.2
Q ss_pred CCCCHHHHH-HHHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCCCCCCC
Q psy7578 96 PEMDFSQVS-KKLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAPAQKTK 146 (280)
Q Consensus 96 P~ls~~eIs-K~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~~~p~K 146 (280)
|-+++.-++ ..||.-|++.++++++.|.+.... +...|-..+..|..++-+
T Consensus 68 p~vd~~~~a~~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~~q~~~ 120 (202)
T COG2854 68 PYVDFKYAAKLVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYKGQTLK 120 (202)
T ss_pred hhhcHHHHHHHHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHccCCCce
Confidence 777776654 568999999999999999988877 567899999999655544
No 33
>PF13875 DUF4202: Domain of unknown function (DUF4202)
Probab=32.16 E-value=95 Score=27.91 Aligned_cols=40 Identities=10% Similarity=0.211 Sum_probs=34.0
Q ss_pred ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHH
Q psy7578 78 TAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKY 120 (280)
Q Consensus 78 sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~ 120 (280)
-+-++|...+-..+...| +-..+-.++..-|+.||+.-++
T Consensus 130 vacLVFL~~~f~~F~~~~---deeK~v~Il~KTw~KMS~~g~~ 169 (185)
T PF13875_consen 130 VACLVFLEYYFEDFAAKH---DEEKIVDILRKTWRKMSERGHE 169 (185)
T ss_pred hHHHHhHHHHHHHHHhcC---CHHHHHHHHHHHHHHCCHHHHH
Confidence 458899999999998887 5667888999999999998764
No 34
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=31.64 E-value=82 Score=25.81 Aligned_cols=38 Identities=16% Similarity=0.206 Sum_probs=33.9
Q ss_pred HHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 103 VSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 103 IsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
-.+.+-+.|+.|++++++...+..+...+.|++.++++
T Consensus 88 ~~~~ll~~~~~L~~~~~~~~l~~l~~~~~~~~~~~~~~ 125 (135)
T PRK09706 88 DQKELLELFDALPESEQDAQLSEMRARVENFNKLFEEL 125 (135)
T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35788899999999999999999999989998888876
No 35
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=29.60 E-value=1.1e+02 Score=26.78 Aligned_cols=33 Identities=12% Similarity=0.207 Sum_probs=26.1
Q ss_pred HHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHH
Q psy7578 103 VSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQ 135 (280)
Q Consensus 103 IsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ 135 (280)
..+..-++++-|++|+|..|.+..++..++..+
T Consensus 119 ~~~~~~qmy~lLTPEQra~l~~~~e~r~~~~~~ 151 (162)
T PRK12751 119 MAKVRNQMYNLLTPEQKEALNKKHQERIEKLQQ 151 (162)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHh
Confidence 346667889999999999999988876666544
No 36
>PRK10236 hypothetical protein; Provisional
Probab=28.92 E-value=54 Score=30.56 Aligned_cols=22 Identities=14% Similarity=0.286 Sum_probs=18.6
Q ss_pred HHHHHHHHHcCCChhhHHHHHH
Q psy7578 103 VSKKLGELWHTVPFNEKYGWKR 124 (280)
Q Consensus 103 IsK~Lge~Wk~LseeEK~~Y~~ 124 (280)
+.|.+...|+.||++|++.+.+
T Consensus 118 l~kll~~a~~kms~eE~~~L~~ 139 (237)
T PRK10236 118 LEQFLRNTWKKMDEEHKQEFLH 139 (237)
T ss_pred HHHHHHHHHHHCCHHHHHHHHH
Confidence 4789999999999999976554
No 37
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=28.07 E-value=2e+02 Score=23.23 Aligned_cols=37 Identities=11% Similarity=0.183 Sum_probs=19.5
Q ss_pred HHHHhCCCCCHHHHHHHHH--HHHcCCChhhHHHHHHHH
Q psy7578 90 KLIKSNPEMDFSQVSKKLG--ELWHTVPFNEKYGWKRQA 126 (280)
Q Consensus 90 kik~e~P~ls~~eIsK~Lg--e~Wk~LseeEK~~Y~~kA 126 (280)
.+...++.++..+-.+... ..|.+|++++|..-.+.+
T Consensus 33 ~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~ 71 (107)
T PF11304_consen 33 QIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENY 71 (107)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 3444556666554433322 367777777665444433
No 38
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=27.99 E-value=94 Score=25.10 Aligned_cols=26 Identities=27% Similarity=0.607 Sum_probs=16.8
Q ss_pred HHHHHHHHHHcCCChhhHHHHHHHHH
Q psy7578 102 QVSKKLGELWHTVPFNEKYGWKRQAD 127 (280)
Q Consensus 102 eIsK~Lge~Wk~LseeEK~~Y~~kA~ 127 (280)
++-.-+.+.|+.|+++.|..|...|.
T Consensus 11 ~~L~pl~~~W~~l~~~qr~k~l~~a~ 36 (107)
T PF11304_consen 11 QALAPLAERWNSLPPEQRRKWLQIAE 36 (107)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHH
Confidence 34556666777777777776666654
No 39
>cd07081 ALDH_F20_ACDH_EutE-like Coenzyme A acylating aldehyde dehydrogenase (ACDH), Ethanolamine utilization protein EutE, and related proteins. Coenzyme A acylating aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10), functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA. The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH, and may be critical enzymes in the fermentative pathway.
Probab=27.11 E-value=1.2e+02 Score=30.30 Aligned_cols=39 Identities=13% Similarity=-0.100 Sum_probs=33.0
Q ss_pred HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
+.++..-+.|+.++.++|..+..++.+..+++..++++.
T Consensus 6 ~~A~~A~~~W~~~~~~~R~~iL~~~a~~l~~~~~ela~~ 44 (439)
T cd07081 6 AAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKL 44 (439)
T ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345666678999999999999999999999999888765
No 40
>KOG1827|consensus
Probab=26.86 E-value=5.1 Score=41.97 Aligned_cols=44 Identities=27% Similarity=0.416 Sum_probs=40.7
Q ss_pred CCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhH
Q psy7578 76 RFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEK 119 (280)
Q Consensus 76 P~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK 119 (280)
-+++|+.|..+.|..+-.++|+..+++++.+.|..|+.++..-|
T Consensus 552 ~~~~~~~~s~~~~~~~~~~np~v~~~~~~~~vg~~~~~lp~~~k 595 (629)
T KOG1827|consen 552 SPEPYILDSIENRTIIWFENPTVGFGEVSIIVGNDWDKLPNINK 595 (629)
T ss_pred CCccccccccccCceeeeeCCCcccceeEEeecCCcccCccccc
Confidence 57889999999999999999999999999999999999995554
No 41
>smart00500 SFM Splicing Factor Motif, present in Prp18 and Pr04.
Probab=25.96 E-value=27 Score=24.21 Aligned_cols=23 Identities=30% Similarity=0.512 Sum_probs=19.5
Q ss_pred ChhhHHHHHHHhhhhHHHhhhhh
Q psy7578 199 QPLDIAAHLRLLGDNLTIIGERL 221 (280)
Q Consensus 199 ~~id~aahl~ll~~~l~~~g~~~ 221 (280)
.-.||-+-|+.|||..+++||.-
T Consensus 2 ~d~eV~~~LR~lgePi~lFGE~~ 24 (44)
T smart00500 2 PDSEVIRRLRELGEPITLFGEDD 24 (44)
T ss_pred CHHHHHHHHHHcCCCeeecCCCh
Confidence 34688899999999999999853
No 42
>PF12356 DUF3643: Protein of unknown function (DUF3643) ; InterPro: IPR022103 This family of proteins is found in eukaryotes. Proteins in this family are typically between 217 and 4852 amino acids in length. There is a conserved TLA sequence motif.
Probab=25.70 E-value=35 Score=30.10 Aligned_cols=14 Identities=29% Similarity=0.646 Sum_probs=11.1
Q ss_pred chhhhhHHHHHhhh
Q psy7578 233 GMSLLLDSFLCALG 246 (280)
Q Consensus 233 ~~s~lld~~~c~~~ 246 (280)
.|=-=+||+|||||
T Consensus 76 ~LKkaiDSlLCS~C 89 (165)
T PF12356_consen 76 PLKKAIDSLLCSMC 89 (165)
T ss_pred HHHHHHHHHHHHHH
Confidence 44555899999997
No 43
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=23.59 E-value=1.4e+02 Score=27.08 Aligned_cols=32 Identities=19% Similarity=0.163 Sum_probs=26.8
Q ss_pred HHHHHHHHHHHcCCChhhHHHHHHHHHHHHHH
Q psy7578 101 SQVSKKLGELWHTVPFNEKYGWKRQADRLAAK 132 (280)
Q Consensus 101 ~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kek 132 (280)
.+..+.|.+.+.+|+++|++...+.++++++.
T Consensus 13 ~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~ 44 (248)
T PF08367_consen 13 EEEKEKLAAYKASLSEEEKEKIIEQTKELKER 44 (248)
T ss_dssp HHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHH
Confidence 45778999999999999999999999987554
No 44
>cd07749 NT_Pol-beta-like_1 Nucleotidyltransferase (NT) domain of an uncharacterized subgroup of the Pol beta-like NT superfamily. The Pol beta-like NT superfamily includes DNA polymerase beta and other family X DNA Polymerases, as well as Class I and Class II CCA-adding enzymes, RelA- and SpoT-like ppGpp synthetases and hydrolases, 2'5'-oligoadenylate (2-5A)synthetases, Escherichia coli adenylyltransferase (GlnE), Escherichia coli uridylyl transferase (GlnD), poly(A) polymerases, terminal uridylyl transferases, Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins. Proteins belonging to this subgroup are uncharacterized. In the majority of the Pol beta-like superfamily NTs, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations essential for catalysis. These divalent metal ions are involved in a two-metal ion mechanism of nucleotide addition. These carboxylate residues are conserved in this subgroup.
Probab=22.44 E-value=61 Score=28.33 Aligned_cols=29 Identities=38% Similarity=0.476 Sum_probs=24.1
Q ss_pred HHHHhhhhHHHhhhhhcccCCeeeeccchhhhh
Q psy7578 206 HLRLLGDNLTIIGERLKDTQGRMAISGGMSLLL 238 (280)
Q Consensus 206 hl~ll~~~l~~~g~~~~~~~~~~~~~~~~s~ll 238 (280)
|++. |+-|+++|++.+..-+|.||+++-|
T Consensus 2 ~~~~----L~~i~~~ln~~~i~W~lgGS~~L~l 30 (156)
T cd07749 2 YLRA----LRKFYERLKNINVNWALTGSLSFAL 30 (156)
T ss_pred hHHH----HHHHHHHhccCCceEEehhhHHHHH
Confidence 5555 5789999999999999999987544
No 45
>PF05388 Carbpep_Y_N: Carboxypeptidase Y pro-peptide; InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=22.44 E-value=1.4e+02 Score=24.66 Aligned_cols=29 Identities=21% Similarity=0.243 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHcCCChhhHHHHHHHHHH
Q psy7578 100 FSQVSKKLGELWHTVPFNEKYGWKRQADR 128 (280)
Q Consensus 100 ~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~ 128 (280)
+..+++.++|.++.|+.|-|+.|.+....
T Consensus 45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~~ 73 (113)
T PF05388_consen 45 LEKISKYLNEPLKSLTSEAKALWDEMMLL 73 (113)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHHH
Confidence 55688889999999999999999997664
No 46
>KOG1610|consensus
Probab=22.32 E-value=2.2e+02 Score=27.81 Aligned_cols=49 Identities=16% Similarity=0.332 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHhC-------CC------CCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHH
Q psy7578 82 LWAKQIRQKLIKSN-------PE------MDFSQVSKKLGELWHTVPFNEKYGWKRQADRLA 130 (280)
Q Consensus 82 LF~ke~R~kik~e~-------P~------ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~k 130 (280)
.|+...|.++..-. |+ .....+.+.+.+.|..++++.|+.|-+.+....
T Consensus 187 af~D~lR~EL~~fGV~VsiiePG~f~T~l~~~~~~~~~~~~~w~~l~~e~k~~YGedy~~~~ 248 (322)
T KOG1610|consen 187 AFSDSLRRELRPFGVKVSIIEPGFFKTNLANPEKLEKRMKEIWERLPQETKDEYGEDYFEDY 248 (322)
T ss_pred HHHHHHHHHHHhcCcEEEEeccCccccccCChHHHHHHHHHHHhcCCHHHHHHHHHHHHHHH
Confidence 47788888876421 33 224778999999999999999999988776553
No 47
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=20.78 E-value=2.3e+02 Score=24.84 Aligned_cols=35 Identities=11% Similarity=0.127 Sum_probs=28.7
Q ss_pred HHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHh
Q psy7578 104 SKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMS 138 (280)
Q Consensus 104 sK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema 138 (280)
.+..-+.+.-|++++|..|.+.-++..++|.+.+.
T Consensus 127 ~~~~~~~~~vLTpEQRak~~e~~~~r~~~~~~~~~ 161 (170)
T PRK12750 127 LEKRHQMLSILTPEQKAKFQELQQERMQECQDKMH 161 (170)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455689999999999999998888888887774
No 48
>cd07133 ALDH_CALDH_CalB Coniferyl aldehyde dehydrogenase-like. Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of Pseudomonas sp. strain HR199 (CalB) which catalyzes the NAD+-dependent oxidation of coniferyl aldehyde to ferulic acid, and similar sequences, are present in this CD.
Probab=20.57 E-value=1.8e+02 Score=28.71 Aligned_cols=39 Identities=10% Similarity=-0.008 Sum_probs=31.8
Q ss_pred HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
+.++..-..|+.++..+|..+.+++.+..+++..++.+.
T Consensus 5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~ 43 (434)
T cd07133 5 ERQKAAFLANPPPSLEERRDRLDRLKALLLDNQDALAEA 43 (434)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345566677999999999999999888888888888764
No 49
>cd07132 ALDH_F3AB Aldehyde dehydrogenase family 3 members A1, A2, and B1 and related proteins. NAD(P)+-dependent, aldehyde dehydrogenase, family 3 members A1 and B1 (ALDH3A1, ALDH3B1, EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and similar sequences are included in this CD. Human ALDH3A1 is a homodimer with a critical role in cellular defense against oxidative stress; it catalyzes the oxidation of various cellular membrane lipid-derived aldehydes. Corneal crystalline ALDH3A1 protects the cornea and underlying lens against UV-induced oxidative stress. Human ALDH3A2, a microsomal homodimer, catalyzes the oxidation of long-chain aliphatic aldehydes to fatty acids. Human ALDH3B1 is highly expressed in the kidney and liver and catalyzes the oxidation of various medium- and long-chain saturated and unsaturated aliphatic aldehydes.
Probab=20.42 E-value=1.8e+02 Score=28.78 Aligned_cols=39 Identities=10% Similarity=0.039 Sum_probs=32.2
Q ss_pred HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578 102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA 140 (280)
Q Consensus 102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~ 140 (280)
+.++..-..|+.++..+|..+.+++....+++.+++.+.
T Consensus 5 ~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~ 43 (443)
T cd07132 5 RRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEA 43 (443)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 345666678999999999999999988888888887764
No 50
>cd07661 BAR_ICA69 The Bin/Amphiphysin/Rvs (BAR) domain of Islet Cell Autoantigen 69-kDa. The BAR domain of Arfaptin-like proteins, also called the Arfaptin domain, is a dimerization and lipid binding module that can detect and drive membrane curvature. Islet cell autoantigen 69-kDa (ICA69) is a diabetes-associated autoantigen that is highly expressed in brain and beta cells. It is involved in membrane trafficking at the Golgi complex in neurosecretory cells. It is coexpressed with Protein Interacting with C Kinase 1 (PICK1), also a the BAR domain containing protein, in many tissues at different developmental stages. In neurons, ICA69 colocalizes with PICK1 in cell bodies and dendrites but is absent in synapses where PICK1 is enriched. ICA69 contains an N-terminal BAR domain and a conserved C-terminal domain of unknown function. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. ICA69 associate
Probab=20.35 E-value=54 Score=29.87 Aligned_cols=65 Identities=28% Similarity=0.315 Sum_probs=48.2
Q ss_pred HHhhhhHHHhhhhhcccCCee--eeccchhhhhHHHHHhhhhHHHhhhcCCCccCCChhHHHHhhhhhhh
Q psy7578 208 RLLGDNLTIIGERLKDTQGRM--AISGGMSLLLDSFLCALGPLLCLTQQIPEENGCSPETLSHVLDNIAY 275 (280)
Q Consensus 208 ~ll~~~l~~~g~~~~~~~~~~--~~~~~~s~lld~~~c~~~pl~~~~~~~~~~~~~~~~~~~~~l~n~~~ 275 (280)
.=||+.|.-.|.+=++..|.+ ++.-+|+-==--+|++-+||..+.+.|--|. -.+++|||.+|.-
T Consensus 40 ~~LG~fl~e~~~~d~t~ag~~m~~t~KaL~~sg~qrl~~r~pl~~f~~~v~Tf~---~rai~Dtl~Ti~~ 106 (204)
T cd07661 40 NVLGKFLKEQGKIDKTTAGKMMAATGKALSFSSQQRLALRVPLLRLYQEVETFR---ERAIADTLQTIQR 106 (204)
T ss_pred HHHHHHHHHHhccChhhhccHHHHHHHHHHHhHHHHHHHHhhHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence 457888888887777666533 4555666666778999999999998876664 4678899988853
No 51
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=20.31 E-value=2.2e+02 Score=25.22 Aligned_cols=37 Identities=3% Similarity=0.148 Sum_probs=30.0
Q ss_pred HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhh
Q psy7578 102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSK 139 (280)
Q Consensus 102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~ 139 (280)
+..|+--++++-|++|+|..|++..++...++++ ++.
T Consensus 112 em~k~~nqmy~lLTPEQKaq~~~~~~~rm~~~~~-~~~ 148 (166)
T PRK10363 112 EMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRD-VTQ 148 (166)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHH-HHh
Confidence 4456677899999999999999998888888765 544
No 52
>PRK14830 undecaprenyl pyrophosphate synthase; Provisional
Probab=20.21 E-value=3.8e+02 Score=24.98 Aligned_cols=58 Identities=16% Similarity=0.094 Sum_probs=50.6
Q ss_pred hhhHHHHHHHhhhhHHHhhhhhcccCCeeeeccchhhhhHHHHHhhhhHHHhhhcCCC
Q psy7578 200 PLDIAAHLRLLGDNLTIIGERLKDTQGRMAISGGMSLLLDSFLCALGPLLCLTQQIPE 257 (280)
Q Consensus 200 ~id~aahl~ll~~~l~~~g~~~~~~~~~~~~~~~~s~lld~~~c~~~pl~~~~~~~~~ 257 (280)
+-+|.+++.|+-+.|.-...++.+.+-.|.|.|-++.|=|.+..++--+--.|..=..
T Consensus 84 ~~Ev~~Lm~l~~~~l~~~~~~~~~~~iri~viG~~~~Lp~~~~~~~~~~e~~T~~~~~ 141 (251)
T PRK14830 84 KDEVKFLMNLPVEFLDKFVPELIENNVKVNVIGDTDRLPEHTLRALEKAIEKTKNNTG 141 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCEEEEEcChhhCCHHHHHHHHHHHHHccCCCc
Confidence 5688999999999998888889999999999999999999999998887777766444
Done!