Query 037321
Match_columns 75
No_of_seqs 106 out of 621
Neff 7.3
Searched_HMMs 46136
Date Fri Mar 29 11:02:34 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/037321.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/037321hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN02150 terpene synthase/cycl 100.0 4.5E-33 9.8E-38 166.3 8.5 73 1-75 22-95 (96)
2 cd00684 Terpene_cyclase_plant_ 99.9 5E-22 1.1E-26 146.0 8.6 71 1-73 471-542 (542)
3 PLN02279 ent-kaur-16-ene synth 99.8 4.2E-21 9.2E-26 146.0 7.7 71 1-75 705-777 (784)
4 PLN02592 ent-copalyl diphospha 99.1 5.7E-10 1.2E-14 85.9 7.5 67 1-75 732-800 (800)
5 cd00868 Terpene_cyclase_C1 Ter 96.8 0.0052 1.1E-07 41.2 5.7 47 1-48 237-284 (284)
6 cd00385 Isoprenoid_Biosyn_C1 I 89.9 0.33 7.2E-06 30.6 2.6 41 1-42 203-243 (243)
7 PF14805 THDPS_N_2: Tetrahydro 73.2 14 0.0003 20.7 4.6 35 5-42 1-35 (70)
8 PF10540 Membr_traf_MHD: Munc1 63.4 33 0.00072 21.2 5.4 51 5-55 25-106 (137)
9 PF09504 RE_Bsp6I: Bsp6I restr 61.5 23 0.0005 23.3 4.3 41 13-56 24-68 (180)
10 PF06086 Pox_A30L_A26L: Orthop 59.3 21 0.00045 24.4 3.9 46 4-53 23-68 (220)
11 COG4046 Uncharacterized protei 58.5 34 0.00074 24.9 5.1 53 4-57 101-154 (368)
12 PF07431 DUF1512: Protein of u 55.7 40 0.00087 24.6 5.1 53 4-57 88-141 (355)
13 PF08285 DPM3: Dolichol-phosph 54.4 16 0.00035 21.4 2.5 20 1-20 67-86 (91)
14 PF07637 PSD5: Protein of unkn 52.8 36 0.00078 18.3 5.3 56 1-73 1-56 (64)
15 PF01484 Col_cuticle_N: Nemato 47.7 15 0.00033 18.5 1.5 14 5-18 40-53 (53)
16 cd05026 S-100Z S-100Z: S-100Z 46.4 49 0.0011 18.8 3.7 37 32-69 4-41 (93)
17 PF11707 Npa1: Ribosome 60S bi 42.9 82 0.0018 22.1 5.1 58 5-71 173-233 (330)
18 PF11989 Dsl1_C: Retrograde tr 40.7 25 0.00054 24.8 2.2 31 40-72 241-271 (291)
19 cd04470 S1_EF-P_repeat_1 S1_EF 40.6 3.4 7.3E-05 22.3 -1.8 15 42-56 3-17 (61)
20 PF11613 UCN2: Agonist of cort 40.2 18 0.00038 18.0 1.0 17 29-45 7-23 (38)
21 PF02269 TFIID-18kDa: Transcri 38.6 26 0.00057 20.4 1.8 26 44-71 8-33 (93)
22 PRK10626 hypothetical protein; 38.1 25 0.00054 24.3 1.8 57 4-60 88-148 (239)
23 PF04659 Arch_fla_DE: Archaeal 38.0 46 0.001 19.8 2.8 42 29-71 50-91 (99)
24 PF11802 CENP-K: Centromere-as 37.9 46 0.001 23.4 3.1 20 2-21 16-35 (268)
25 PHA02676 A-type inclusion prot 37.3 35 0.00076 26.1 2.6 48 2-53 6-53 (520)
26 cd07018 S49_SppA_67K_type Sign 36.3 1.1E+02 0.0023 20.2 4.6 38 1-39 155-192 (222)
27 PF13798 PCYCGC: Protein of un 36.0 29 0.00064 22.5 1.8 23 4-30 134-156 (158)
28 PF10835 DUF2573: Protein of u 35.5 66 0.0014 18.6 3.0 21 2-23 57-77 (82)
29 cd01389 MATA_HMG-box MATA_HMG- 35.5 27 0.00058 19.1 1.4 17 5-21 30-46 (77)
30 PF12357 PLD_C: Phospholipase 34.4 52 0.0011 18.7 2.4 22 2-23 12-33 (74)
31 PF05341 DUF708: Protein of un 34.2 55 0.0012 19.9 2.7 35 12-49 20-54 (105)
32 PF12910 RelB_N: Antitoxin of 33.9 37 0.0008 17.1 1.7 16 2-17 8-23 (46)
33 PF01132 EFP: Elongation facto 32.6 2.7 5.9E-05 22.2 -2.9 16 42-57 2-17 (55)
34 PF10305 Fmp27_SW: RNA pol II 30.9 57 0.0012 19.3 2.4 19 2-20 1-19 (103)
35 PF14394 DUF4423: Domain of un 30.8 63 0.0014 20.8 2.7 24 1-24 131-154 (171)
36 PF01023 S_100: S-100/ICaBP ty 30.5 76 0.0016 15.9 2.5 18 50-68 19-36 (44)
37 PF13864 Enkurin: Calmodulin-b 30.4 83 0.0018 18.2 3.0 23 2-24 40-62 (98)
38 TIGR00965 dapD 2,3,4,5-tetrahy 30.1 1.3E+02 0.0028 21.1 4.3 31 6-39 1-31 (269)
39 TIGR00706 SppA_dom signal pept 30.1 1.5E+02 0.0033 19.2 4.5 37 1-38 137-173 (207)
40 cd07978 TAF13 The TATA Binding 30.0 61 0.0013 18.9 2.3 26 44-71 9-34 (92)
41 PF14799 FAM195: FAM195 family 29.6 86 0.0019 18.6 3.0 21 3-23 33-53 (99)
42 PF12807 eIF3_p135: Translatio 29.4 1.4E+02 0.003 18.9 4.1 19 7-25 46-64 (169)
43 cd01388 SOX-TCF_HMG-box SOX-TC 29.3 40 0.00087 18.1 1.4 17 5-21 30-46 (72)
44 cd07019 S49_SppA_1 Signal pept 29.0 1.2E+02 0.0025 19.8 3.8 35 2-37 146-180 (211)
45 PF07553 Lipoprotein_Ltp: Host 28.8 32 0.0007 17.8 0.9 12 45-56 27-38 (48)
46 PF08494 DEAD_assoc: DEAD/H as 28.8 1.7E+02 0.0036 18.9 5.7 42 12-57 80-121 (187)
47 PF09011 HMG_box_2: HMG-box do 27.8 68 0.0015 17.2 2.2 17 5-21 33-49 (73)
48 cd01390 HMGB-UBF_HMG-box HMGB- 27.8 52 0.0011 16.8 1.7 18 4-21 28-45 (66)
49 PF01713 Smr: Smr domain; Int 27.6 59 0.0013 17.9 1.9 22 1-22 9-30 (83)
50 PRK11830 dapD 2,3,4,5-tetrahyd 26.9 2E+02 0.0044 20.1 4.8 31 6-39 3-33 (272)
51 PF13904 DUF4207: Domain of un 26.9 17 0.00038 25.0 -0.4 14 62-75 250-263 (264)
52 cd00687 Terpene_cyclase_nonpla 26.8 75 0.0016 21.5 2.7 22 1-22 244-265 (303)
53 PHA02718 hypothetical protein; 26.7 21 0.00045 19.4 -0.0 28 41-72 28-55 (69)
54 KOG0027 Calmodulin and related 26.7 1.2E+02 0.0026 18.5 3.4 35 37-73 7-41 (151)
55 smart00463 SMR Small MutS-rela 26.5 66 0.0014 17.5 2.0 19 1-19 12-30 (80)
56 PF14555 UBA_4: UBA-like domai 26.3 44 0.00095 16.4 1.1 18 36-53 25-42 (43)
57 COG3351 FlaD Putative archaeal 26.1 63 0.0014 22.0 2.1 41 29-71 140-180 (214)
58 PHA02896 A-type inclusion like 25.8 81 0.0018 24.5 2.9 39 2-45 10-49 (616)
59 PF15581 Imm35: Immunity prote 25.2 37 0.00081 20.1 0.8 17 5-21 33-49 (93)
60 PRK01371 sec-independent trans 24.9 1.2E+02 0.0027 19.1 3.2 20 3-22 35-54 (137)
61 KOG1766 Enhancer of rudimentar 24.8 1.7E+02 0.0036 17.6 4.1 54 2-57 27-81 (104)
62 smart00398 HMG high mobility g 24.6 56 0.0012 16.7 1.4 17 5-21 30-46 (70)
63 smart00872 Alpha-mann_mid Alph 24.5 1.1E+02 0.0024 16.7 2.7 23 29-55 42-64 (79)
64 PF09184 PPP4R2: PPP4R2; Inte 24.5 1.4E+02 0.0031 20.9 3.8 19 5-23 42-60 (288)
65 KOG2204 Mannosyl-oligosacchari 24.4 69 0.0015 25.1 2.3 22 2-23 174-195 (625)
66 PF05002 SGS: SGS domain ; In 23.9 1.2E+02 0.0025 17.5 2.7 25 40-71 26-50 (82)
67 PF02512 UK: Virulence determi 23.7 44 0.00096 19.5 0.9 21 9-34 50-70 (96)
68 PF03564 DUF1759: Protein of u 23.5 1.2E+02 0.0025 18.3 2.9 14 12-25 57-70 (145)
69 PHA02935 Hypothetical protein; 23.5 1.1E+02 0.0024 21.2 3.0 49 8-57 256-307 (349)
70 cd00084 HMG-box High Mobility 23.4 72 0.0016 16.1 1.7 16 5-20 29-44 (66)
71 PF14684 Tricorn_C1: Tricorn p 23.4 1.2E+02 0.0026 16.4 2.7 20 5-24 2-21 (70)
72 PF00505 HMG_box: HMG (high mo 23.4 68 0.0015 16.6 1.6 18 4-21 28-45 (69)
73 PTZ00195 60S ribosomal protein 21.8 82 0.0018 21.2 2.0 37 14-57 44-80 (198)
74 PF08153 NGP1NT: NGP1NT (NUC09 21.7 38 0.00083 21.3 0.4 35 2-37 47-81 (130)
75 cd07023 S49_Sppa_N_C Signal pe 21.6 2E+02 0.0044 18.5 3.9 38 1-39 142-179 (208)
76 PF12763 EF-hand_4: Cytoskelet 21.5 89 0.0019 18.5 2.0 20 21-41 33-52 (104)
77 cd05027 S-100B S-100B: S-100B 21.1 1.7E+02 0.0038 16.5 3.4 32 36-68 6-38 (88)
78 PF10859 DUF2660: Protein of u 20.7 99 0.0021 17.9 2.0 13 7-19 40-52 (86)
79 PRK14857 tatA twin arginine tr 20.5 1.9E+02 0.0042 16.9 3.2 22 3-24 37-58 (90)
80 PF00036 EF-hand_1: EF hand; 20.4 1.1E+02 0.0023 13.7 1.9 26 42-69 4-29 (29)
81 cd07022 S49_Sppa_36K_type Sign 20.4 2.6E+02 0.0056 18.2 4.3 31 1-32 149-179 (214)
82 PHA03369 capsid maturational p 20.3 2E+02 0.0043 22.8 4.0 23 31-53 299-321 (663)
83 COG2900 SlyX Uncharacterized p 20.3 1.6E+02 0.0034 16.7 2.7 21 1-21 21-41 (72)
No 1
>PLN02150 terpene synthase/cyclase family protein
Probab=100.00 E-value=4.5e-33 Score=166.29 Aligned_cols=73 Identities=21% Similarity=0.305 Sum_probs=70.3
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHh-hccCCCCCCCchhhHHHHHHhhccccCC
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCM-YQFGEGHGHGVQEITKARVLSLIVDPIA 75 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~-Y~~~Dg~t~~~~~~~k~~I~sll~~Pi~ 75 (75)
|||+++|++||+++||+||+|++++++ +|++++++++|+||+++++ |++|||||.+++. +|++|.+||++||+
T Consensus 22 eeA~~~i~~li~~~WK~iN~e~l~~~~-~p~~~~~~~~NlaR~~~~~~Y~~~Dg~t~~~~~-~K~~I~sLlv~pi~ 95 (96)
T PLN02150 22 EEAVSELKKMIRDNYKIVMEEFLTIKD-VPRPVLVRCLNLARLIDVYCYNEGDGFTYPHGK-LKDLITSLFFHPLP 95 (96)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCC-CCHHHHHHHHHHHHHHHhheecCCCCCCCCcHH-HHHHHHHHhccCCC
Confidence 689999999999999999999999888 9999999999999999999 9999999988777 99999999999986
No 2
>cd00684 Terpene_cyclase_plant_C1 Plant Terpene Cyclases, Class 1. This CD includes a diverse group of monomeric plant terpene cyclases (Tspa-Tspf) that convert the acyclic isoprenoid diphosphates, geranyl diphosphate (GPP), farnesyl diphosphate (FPP), or geranylgeranyl diphosphate (GGPP) into cyclic monoterpenes, diterpenes, or sesquiterpenes, respectively; a few form acyclic species. Terpnoid cyclases are soluble enzymes localized to the cytosol (sesquiterpene synthases) or plastids (mono- and diterpene synthases). All monoterpene and diterpene synthases have restrict substrate specificity, however, some sesquiterpene synthases can accept both FPP and GPP. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl diphosphates, via bridging Mg2+ ions (K+ preferred by gymnosperm cyclases), inducing conformational changes such that an N-terminal regi
Probab=99.87 E-value=5e-22 Score=146.02 Aligned_cols=71 Identities=31% Similarity=0.535 Sum_probs=68.1
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcC-CCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcccc
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVN-PHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDP 73 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~-~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~P 73 (75)
|||+++++++|+++||+||++++++ ++ +|++|+++++|+||+++++|+++||||.|++. +|++|++||++|
T Consensus 471 eeA~~~i~~~ie~~wk~ln~e~l~~~~~-~p~~~~~~~~n~~r~~~~~Y~~~D~~t~~~~~-~~~~i~~ll~~p 542 (542)
T cd00684 471 EEAREEIKKMIEDAWKELNEEFLKPSSD-VPRPIKQRFLNLARVIDVFYKEGDGFTHPEGE-IKDHITSLLFEP 542 (542)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCHHHHHHHHHHHHHHHHHhcCCCCCCCccHH-HHHHHHHHhcCC
Confidence 6899999999999999999999987 77 99999999999999999999999999999777 999999999998
No 3
>PLN02279 ent-kaur-16-ene synthase
Probab=99.84 E-value=4.2e-21 Score=146.01 Aligned_cols=71 Identities=18% Similarity=0.292 Sum_probs=66.5
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcC--CCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhccccCC
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVN--PHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDPIA 75 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~--~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~Pi~ 75 (75)
|||+++++++|+++||+||++++++ +. +|++|+++++|++|++++||++|||||.+ . +|++|++||++||+
T Consensus 705 EEAi~~i~~~Ie~~wKeLn~~~l~~~~~~-vp~~~~~~~ln~aR~~~~~Y~~~Dgyt~~--~-~k~~i~~ll~ePi~ 777 (784)
T PLN02279 705 EEAIESMKGLIESQRRELLRLVLQEKGSN-VPRECKDLFWKMSKVLHLFYRKDDGFTSN--D-MMSLVKSVIYEPVS 777 (784)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCC-CCHHHHHHHHHHHHhhhhheeCCCCCChH--H-HHHHHHHHhccCCc
Confidence 6899999999999999999999963 46 99999999999999999999999999963 6 99999999999986
No 4
>PLN02592 ent-copalyl diphosphate synthase
Probab=99.06 E-value=5.7e-10 Score=85.86 Aligned_cols=67 Identities=9% Similarity=0.124 Sum_probs=59.5
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhc--CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhccccCC
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFV--NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDPIA 75 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~--~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~Pi~ 75 (75)
+||.+++++.|+..+++|.+.+++ ++. +|++|++.+++++| +||.. ||+.| ++ ++++|..++++|++
T Consensus 732 ~~a~~~~~~~ie~~~~eL~~lvl~~~~~~-vp~~cK~~f~~~~k---~fy~~--~~~~~-~~-~~~~i~~vl~epv~ 800 (800)
T PLN02592 732 PEEKSKTTPSIESDMQELVQLVLQNSSDD-IDPVIKQTFLMVAK---SFYYA--AYCDP-GT-INYHIAKVLFERVA 800 (800)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCC-CCHHHHHHHHHHHH---HHHHh--hcCCH-HH-HHHHHHHHhCCCCC
Confidence 379999999999999999999996 245 99999999999999 55655 99998 56 99999999999985
No 5
>cd00868 Terpene_cyclase_C1 Terpene cyclases, Class 1. Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid biosynthesis enzymes, which share the 'isoprenoid synthase fold' and convert linear, all-trans, isoprenoids, geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate into numerous cyclic forms of monoterpenes, diterpenes, and sesquiterpenes. Also included in this CD are the cis-trans terpene cyclases such as trichodiene synthase. The class I terpene cyclization reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational ch
Probab=96.76 E-value=0.0052 Score=41.15 Aligned_cols=47 Identities=26% Similarity=0.437 Sum_probs=41.3
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcC-CCCCcHHHHHHHHHHHHHHHHhh
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVN-PHFVSDVFVGIAMNLARMSQCMY 48 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~-~~~~p~~~~~~~~NlaR~~~~~Y 48 (75)
+||++++.+++++.|+++++..... ++ .++.+++.+.+..|+....|
T Consensus 237 ~eA~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~g~~~w~ 284 (284)
T cd00868 237 EEALEELRKMIEEAWKELNEEVLKLSSD-VPRAVLETLLNLARGIYVWY 284 (284)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCHHHHHHHHHHHHhhhhcC
Confidence 5899999999999999999998853 35 78999999999999987765
No 6
>cd00385 Isoprenoid_Biosyn_C1 Isoprenoid Biosynthesis enzymes, Class 1. Superfamily of trans-isoprenyl diphosphate synthases (IPPS) and class I terpene cyclases which either synthesis geranyl/farnesyl diphosphates (GPP/FPP) or longer chained products from isoprene precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), or use geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate as substrate. These enzymes produce a myriad of precursors for such end products as steroids, cholesterol, sesquiterpenes, heme, carotenoids, retinoids, and diterpenes; and are widely distributed among archaea, bacteria, and eukaryota.The enzymes in this superfamily share the same 'isoprenoid synthase fold' and include several subgroups. The head-to-tail (HT) IPPS catalyze the successive 1'-4 condensation of the 5-carbon IPP to the growing isoprene chain to form linear, all-trans, C10-, C15-, C20- C25-, C30-, C35-, C40-, C45-, or C50-isoprenoid diphosphates. Cyclic monoter
Probab=89.93 E-value=0.33 Score=30.55 Aligned_cols=41 Identities=27% Similarity=0.392 Sum_probs=31.3
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHH
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLAR 42 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR 42 (75)
++|.+++..++++.|+++++.....+. .++.+++.+.++.|
T Consensus 203 ~~a~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 243 (243)
T cd00385 203 EEALEELAKLAEEALKELNELILSLPD-VPRALLALALNLYR 243 (243)
T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCcHH-HHHHHHHHHHHHhC
Confidence 478999999999999999987664323 56777777777653
No 7
>PF14805 THDPS_N_2: Tetrahydrodipicolinate N-succinyltransferase N-terminal; PDB: 3EG4_A 3TDT_A 2TDT_A 1KGT_A 1TDT_A 1KGQ_A 3BXY_A 3GOS_A 3TK8_A.
Probab=73.24 E-value=14 Score=20.70 Aligned_cols=35 Identities=11% Similarity=0.347 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLAR 42 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR 42 (75)
+.++..|+++|.. ++-++++. .+...+..+..+-.
T Consensus 1 ~~l~~~Ie~aw~~--r~~l~~~~-~~~~~~~av~~~i~ 35 (70)
T PF14805_consen 1 SQLQKIIEAAWEN--RDELTPSN-ADPELRDAVEEVIE 35 (70)
T ss_dssp HHHHHHHHHHHHG--GGG-BTTT---HHHHHHHHHHHH
T ss_pred ChHHHHHHHHHHh--HhhCCCcc-CCHHHHHHHHHHHH
Confidence 4688999999987 77776655 56666666655543
No 8
>PF10540 Membr_traf_MHD: Munc13 (mammalian uncoordinated) homology domain; InterPro: IPR019558 Mammalian uncoordinated homology 13 (Munc13) proteins constitute a family of three highly homologous molecules (Munc13-1, Munc13-2 and Munc13-3) with homology to Caenorhabditis elegans unc-13p. Munc13 proteins contain a phorbol ester-binding C1 domain and two C2 domains, which are Ca2+/phospholipid binding domains. Sequence analyses have uncovered two regions called Munc13 homology domains 1 (MHD1) and 2 (MHD2) that are arranged between two flanking C2 domains. MHD1 and MHD2 domains are present in a wide variety of proteins from Arabidopsis thaliana (Mouse-ear cress), C. elegans, Drosophila melanogaster (Fruit fly), Mus musculus (Mouse), Rattus norvegicus (Rat) and Homo sapiens (Human), some of which may function in a Munc13-like manner to regulate membrane trafficking. The MHD1 and MHD2 domains are predicted to be alpha-helical. ; PDB: 3SWH_A.
Probab=63.38 E-value=33 Score=21.24 Aligned_cols=51 Identities=10% Similarity=0.160 Sum_probs=28.7
Q ss_pred HHHHHHHHHHHHHHHHHh---h-cC---------------------------CCCCcHHHHHHHHHHHHHHHHhhccCCC
Q 037321 5 EHIHDLIAQTWMKMNRDR---F-VN---------------------------PHFVSDVFVGIAMNLARMSQCMYQFGEG 53 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e~---l-~~---------------------------~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg 53 (75)
+-+++++.+.|+.+-... + -| .+.-|+.|.....=+--+.+|||.+|+|
T Consensus 25 ~~f~~vl~~lW~~vl~~l~~llvlP~ls~~~~~~~~~~~~~~~~~~~~~~~~~~Lt~~q~~~l~~~L~~L~~FFhA~G~G 104 (137)
T PF10540_consen 25 ENFKRVLKELWKVVLETLEELLVLPPLSDKPMLGLLQSAVSSLSSHGIGGSQRPLTPKQCDRLFKWLDTLKDFFHAEGNG 104 (137)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHTTS-G------------GG-TTS-------------TCHHHHHHHHHHHHHHHHCCCTS
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcchhHHHHHHHHHHHHHHhhcccccCCCCCHHHHHHHHHHHHHHHHHHhCCCCC
Confidence 346788889999887553 4 11 1101333444444444457999998888
Q ss_pred CC
Q 037321 54 HG 55 (75)
Q Consensus 54 ~t 55 (75)
-.
T Consensus 105 l~ 106 (137)
T PF10540_consen 105 LP 106 (137)
T ss_dssp --
T ss_pred CC
Confidence 64
No 9
>PF09504 RE_Bsp6I: Bsp6I restriction endonuclease; InterPro: IPR019037 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry includes the restriction endonuclease Bsp6I, which recognises and cleaves the double-stranded sequence GC^NGC.
Probab=61.54 E-value=23 Score=23.33 Aligned_cols=41 Identities=20% Similarity=0.229 Sum_probs=26.3
Q ss_pred HHHHHHHHHhhcC----CCCCcHHHHHHHHHHHHHHHHhhccCCCCCC
Q 037321 13 QTWMKMNRDRFVN----PHFVSDVFVGIAMNLARMSQCMYQFGEGHGH 56 (75)
Q Consensus 13 ~~wK~ln~e~l~~----~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~ 56 (75)
-.||+||...... -. +|.++-+..+=.+ ..+.-..|+|++.
T Consensus 24 ~~WK~LN~~IKn~~~RGiN-~pd~fSE~l~Cy~--~~y~~~~G~~~~D 68 (180)
T PF09504_consen 24 FKWKDLNNYIKNIYSRGIN-FPDVFSEPLVCYC--NGYYRNRGTGSGD 68 (180)
T ss_pred HHHHHHHHHhhhccCCCcc-chHHHHHHHHHHH--HhhhhhcCCcccc
Confidence 4699999986642 24 8888777766554 4444456666543
No 10
>PF06086 Pox_A30L_A26L: Orthopoxvirus A26L/A30L protein; InterPro: IPR009285 This family consists of several Orthopoxvirus A26L and A30L proteins. The Vaccinia A30L gene is regulated by a late promoter and encodes a protein of approximately 9 kDa. It is thought that the A30L protein is needed for vaccinia virus morphogenesis, specifically the association of the dense viroplasm with viral membranes [].
Probab=59.33 E-value=21 Score=24.42 Aligned_cols=46 Identities=17% Similarity=0.125 Sum_probs=35.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCC
Q 037321 4 REHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEG 53 (75)
Q Consensus 4 ~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg 53 (75)
+..++.||++.|..-..+ .+. +|+.-.+++=|+-|--..-+..-||
T Consensus 23 I~~F~slV~~~W~~~L~~---~sC-I~R~~RKiIRnViR~ym~s~p~~d~ 68 (220)
T PF06086_consen 23 IREFKSLVSETWNKPLNE---NSC-IPREDRKIIRNVIREYMKSAPKMDE 68 (220)
T ss_pred HHHHHHHHHHhcCCcccc---ccc-cCHHHHHHHHHHHHHHHHhCCCCCc
Confidence 678999999999753332 567 9999999999999976665654444
No 11
>COG4046 Uncharacterized protein conserved in archaea [Function unknown]
Probab=58.52 E-value=34 Score=24.90 Aligned_cols=53 Identities=13% Similarity=0.124 Sum_probs=40.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhc-CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321 4 REHIHDLIAQTWMKMNRDRFV-NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG 57 (75)
Q Consensus 4 ~~~i~~~i~~~wK~ln~e~l~-~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~ 57 (75)
.+|+-++=++..+++-+-+.. -+. +.++=+..++..++.++++||+-+-|-..
T Consensus 101 ~~~Ll~~~~dr~~~~v~r~~p~~D~-v~r~~l~~a~ev~~~m~~I~K~vrH~l~~ 154 (368)
T COG4046 101 LRHLLEMGEDRFRKLVRRLVPAADG-VRRSNLEMALEVAASMNFIYKYVRHYLLL 154 (368)
T ss_pred HHHHHHhhhHHHHHHHHHcCCcccc-eehhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356666667777777776663 255 88999999999999999999988776543
No 12
>PF07431 DUF1512: Protein of unknown function (DUF1512); InterPro: IPR009995 This family consists of several archaeal proteins of around 370 residues in length. The function of this family is unknown.
Probab=55.74 E-value=40 Score=24.64 Aligned_cols=53 Identities=13% Similarity=0.038 Sum_probs=40.8
Q ss_pred HHHHHHHHHHHHHHHHHHhhc-CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321 4 REHIHDLIAQTWMKMNRDRFV-NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG 57 (75)
Q Consensus 4 ~~~i~~~i~~~wK~ln~e~l~-~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~ 57 (75)
.+|+-+.-++.+|++-+..+. .+. +-++-...++..+|.++++||--+-|...
T Consensus 88 ~~hll~t~e~~~~~~v~~~~p~~d~-~~~s~~~~~lei~~~ln~IyKvvrHy~~l 141 (355)
T PF07431_consen 88 LKHLLRTYEDRFEKEVKRLAPNADE-VEVSNIETALEIARSLNMIYKVVRHYLLL 141 (355)
T ss_pred HHHHHHhhHHHHHHHHHHhCCCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356667777888888887663 244 67788899999999999999887776644
No 13
>PF08285 DPM3: Dolichol-phosphate mannosyltransferase subunit 3 (DPM3); InterPro: IPR013174 This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex [].
Probab=54.35 E-value=16 Score=21.41 Aligned_cols=20 Identities=15% Similarity=0.265 Sum_probs=17.6
Q ss_pred ChHHHHHHHHHHHHHHHHHH
Q 037321 1 REAREHIHDLIAQTWMKMNR 20 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~ 20 (75)
+||.+++++.|++|-++|-+
T Consensus 67 peA~~eL~~eI~eAK~dLr~ 86 (91)
T PF08285_consen 67 PEAAKELQKEIKEAKADLRK 86 (91)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 58999999999999988765
No 14
>PF07637 PSD5: Protein of unknown function (DUF1595); InterPro: IPR013043 A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013039 from INTERPRO and IPR013042 from INTERPRO.
Probab=52.82 E-value=36 Score=18.30 Aligned_cols=56 Identities=18% Similarity=0.240 Sum_probs=37.2
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcccc
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDP 73 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~P 73 (75)
+.|++-|+.....+|+. + ++..=++..+.+..- -+..|++|. .. ++.-+..+|..|
T Consensus 1 ~~a~~~l~~Fa~rAfRR---------p-~~~~e~~~~~~~~~~---~~~~g~~~~---~a-~~~~l~aiL~SP 56 (64)
T PF07637_consen 1 ACAREILRRFARRAFRR---------P-LTDEEVDRYLALYDS---ARAQGEDFE---EA-LKEALQAILCSP 56 (64)
T ss_pred ChHHHHHHHHHHHHhCC---------C-CCHHHHHHHHHHHHH---HHHcCCCHH---HH-HHHHHHHHHcCc
Confidence 35778888888888854 3 555546666665432 355566654 34 899999998877
No 15
>PF01484 Col_cuticle_N: Nematode cuticle collagen N-terminal domain; InterPro: IPR002486 The function of this domain is unknown. It is found in the N-terminal region of nematode cuticle collagens (see IPR008160 from INTERPRO). Cuticle is a tough elastic structure secreted by hypodermal cells and is primarily composed of collagen proteins [, ].; GO: 0042302 structural constituent of cuticle
Probab=47.68 E-value=15 Score=18.52 Aligned_cols=14 Identities=21% Similarity=0.667 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKM 18 (75)
Q Consensus 5 ~~i~~~i~~~wK~l 18 (75)
+..|...+++|++|
T Consensus 40 ~~fk~~s~d~W~~m 53 (53)
T PF01484_consen 40 EEFKEISDDAWNEM 53 (53)
T ss_pred HHHHHHHHHHHhhC
Confidence 46677778888764
No 16
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=46.38 E-value=49 Score=18.83 Aligned_cols=37 Identities=16% Similarity=0.047 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHHHHHhh-ccCCCCCCCchhhHHHHHHhh
Q 037321 32 VFVGIAMNLARMSQCMY-QFGEGHGHGVQEITKARVLSL 69 (75)
Q Consensus 32 ~~~~~~~NlaR~~~~~Y-~~~Dg~t~~~~~~~k~~I~sl 69 (75)
.+=..+-.+.++.+.+= ++|||++.+..+ ++..+.+.
T Consensus 4 ~le~a~~~~~~~F~~~dd~dgdg~~Is~~E-L~~ll~~~ 41 (93)
T cd05026 4 QLEGAMDTLIRIFHNYSGKEGDRYKLSKGE-LKELLQRE 41 (93)
T ss_pred HHHHHHHHHHHHHHHHHccCCCCCEECHHH-HHHHHHHH
Confidence 34344455555555532 357777666556 88888664
No 17
>PF11707 Npa1: Ribosome 60S biogenesis N-terminal; InterPro: IPR021714 Npa1p is required for ribosome biogenesis and operates in the same functional environment as Rsa3p and Dbp6p during early maturation of 60S ribosomal subunits []. The protein partners of Npa1p include eight putative helicases as well as the novel Npa2p factor. Npa1p can also associate with a subset of H/ACA and C/D small nucleolar RNPs (snoRNPs) involved in the chemical modification of residues in the vicinity of the peptidyl transferase centre []. The protein has also been referred to as Urb1, and this domain at the N-terminal is one of several conserved regions along the length.
Probab=42.88 E-value=82 Score=22.14 Aligned_cols=58 Identities=14% Similarity=0.154 Sum_probs=35.0
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH---HHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321 5 EHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN---LARMSQCMYQFGEGHGHGVQEITKARVLSLIV 71 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N---laR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~ 71 (75)
+-|..+++--|+.+ +.+.. +|+..+-.+|| +.+++. +|...++ ..... +.+.+..+|.
T Consensus 173 ~~v~~iL~~l~~~V----l~~~~-v~r~~K~~~fn~~~L~~l~~-Ly~~~~~--~~~~~-~~~~vh~fL~ 233 (330)
T PF11707_consen 173 ETVILILETLKDKV----LKDSS-VSRSTKCKLFNEWTLSQLAS-LYSRDGE--DEKSS-VADLVHEFLL 233 (330)
T ss_pred HHHHHHHHHHHHHh----ccCCC-CChhhhhhhcCHHHHHHHHH-HhcccCC--cccch-HHHHHHHHHH
Confidence 34445555555333 34556 99999999998 677777 5665555 22234 6666665543
No 18
>PF11989 Dsl1_C: Retrograde transport protein Dsl1 C terminal; InterPro: IPR021876 Dsl1 is a peripheral membrane protein required for transport between the Golgi and the endoplasmic reticulum []. It is localised to the ER membrane, and in vitro it specifically binds to coatomer, the major component of the protein coat of COPI vesicles []. Binding sites for coatomer are found on a disorganised region between the C and N termini of Dsl1 []. The C-terminal domain is involved in binding to the Sec39 subunit of the Dsl1p complex []. The N-terminal complexes with another subunit of the Dsl1p complex called Tip20 which forms heterodimers by pairing the N termini of each protein []. ; PDB: 3K8P_C.
Probab=40.65 E-value=25 Score=24.84 Aligned_cols=31 Identities=29% Similarity=0.356 Sum_probs=20.8
Q ss_pred HHHHHHHhhccCCCCCCCchhhHHHHHHhhccc
Q 037321 40 LARMSQCMYQFGEGHGHGVQEITKARVLSLIVD 72 (75)
Q Consensus 40 laR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~ 72 (75)
++=++..||+ ||=|-.+..+ +-+.|++||.+
T Consensus 241 LkDIm~~Fy~-Gel~~fsTdE-lI~lIkslFad 271 (291)
T PF11989_consen 241 LKDIMEMFYQ-GELYDFSTDE-LIQLIKSLFAD 271 (291)
T ss_dssp HHHHHHHHHT-TGGGGS-HHH-HHHHHHHHS--
T ss_pred HHHHHHHHhc-CchhcccHHH-HHHHHHHHhcC
Confidence 6677888887 4544444446 99999999986
No 19
>cd04470 S1_EF-P_repeat_1 S1_EF-P_repeat_1: Translation elongation factor P (EF-P), S1-like RNA-binding domain, repeat 1. EF-P stimulates the peptidyltransferase activity in the prokaryotic 70S ribosome. EF-P enhances the synthesis of certain dipeptides with N-formylmethionyl-tRNA and puromycine in vitro. EF-P binds to both the 30S and 50S ribosomal subunits. EF-P binds near the streptomycine binding site of the 16S rRNA in the 30S subunit. EF-P interacts with domains 2 and 5 of the 23S rRNA. The L16 ribosomal protein of the 50S or its N-terminal fragment are required for EF-P mediated peptide bond synthesis, whereas L11, L15, and L7/L12 are not required in this reaction, suggesting that EF-P may function at a different ribosomal site than most other translation factors. EF-P is essential for cell viability and is required for protein synthesis. EF-P is mainly present in bacteria. The EF-P homologs in archaea and eukaryotes are the initiation factors aIF5A and eIF5A, respectively. EF-P
Probab=40.56 E-value=3.4 Score=22.31 Aligned_cols=15 Identities=27% Similarity=0.339 Sum_probs=11.3
Q ss_pred HHHHHhhccCCCCCC
Q 037321 42 RMSQCMYQFGEGHGH 56 (75)
Q Consensus 42 R~~~~~Y~~~Dg~t~ 56 (75)
|-++++|++||+|..
T Consensus 3 ~~~qylY~dg~~~~F 17 (61)
T cd04470 3 REMQYLYKDGDNYVF 17 (61)
T ss_pred ceEEEEEeCCCEEEE
Confidence 456888999988743
No 20
>PF11613 UCN2: Agonist of corticotropin releasing factor R2, Urocortin-2; InterPro: IPR024270 This entry represents urocortin II and III, which are members of the corticotropin-releasing factor (CRF) family of neuropeptides. Urocortin II and III are secreted peptides and bind with high affinity to CRF receptors 2-alpha and 2-beta. They are involved in the suppression of food intake, the delay of gastric emptying, and may represent endogenous ligands for maintaining homeostasis after stress. Their backbone is mainly alpha-helical, but contains a helix-loop-helix motif [].; GO: 0051431 corticotropin-releasing hormone receptor 2 binding, 0006950 response to stress, 0007586 digestion, 0005615 extracellular space; PDB: 2RMH_A 3N93_C 3N95_F 2RMG_A.
Probab=40.18 E-value=18 Score=17.97 Aligned_cols=17 Identities=24% Similarity=0.409 Sum_probs=14.8
Q ss_pred CcHHHHHHHHHHHHHHH
Q 037321 29 VSDVFVGIAMNLARMSQ 45 (75)
Q Consensus 29 ~p~~~~~~~~NlaR~~~ 45 (75)
+|..+.+..+|.||.=.
T Consensus 7 VPt~im~iL~~~Akak~ 23 (38)
T PF11613_consen 7 VPTNIMNILFNIAKAKN 23 (38)
T ss_dssp TTHHHHHHHHHCCHHHH
T ss_pred CchhHHHHHHHHHHHHH
Confidence 99999999999998643
No 21
>PF02269 TFIID-18kDa: Transcription initiation factor IID, 18kD subunit; InterPro: IPR003195 This family includes the Spt3 yeast transcription factors and the 18 kDa subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone fold [].; GO: 0006366 transcription from RNA polymerase II promoter; PDB: 1BH9_A 1BH8_A.
Probab=38.61 E-value=26 Score=20.37 Aligned_cols=26 Identities=31% Similarity=0.414 Sum_probs=14.1
Q ss_pred HHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321 44 SQCMYQFGEGHGHGVQEITKARVLSLIV 71 (75)
Q Consensus 44 ~~~~Y~~~Dg~t~~~~~~~k~~I~sll~ 71 (75)
.++||..||. ..|..+ +-+.|..++.
T Consensus 8 ~~mMy~fGD~-~~P~~e-Tv~lvE~iv~ 33 (93)
T PF02269_consen 8 RQMMYGFGDV-EEPLPE-TVDLVEDIVR 33 (93)
T ss_dssp HHHHHCTTS--SS--HH-HHHHHHHHHH
T ss_pred HHHHHHcCCC-CCCcHH-HHHHHHHHHH
Confidence 4678999986 334444 5555555543
No 22
>PRK10626 hypothetical protein; Provisional
Probab=38.08 E-value=25 Score=24.28 Aligned_cols=57 Identities=11% Similarity=0.119 Sum_probs=31.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhc-CCC-C-CcHHHHHHHHHH-HHHHHHhhccCCCCCCCchh
Q 037321 4 REHIHDLIAQTWMKMNRDRFV-NPH-F-VSDVFVGIAMNL-ARMSQCMYQFGEGHGHGVQE 60 (75)
Q Consensus 4 ~~~i~~~i~~~wK~ln~e~l~-~~~-~-~p~~~~~~~~Nl-aR~~~~~Y~~~Dg~t~~~~~ 60 (75)
.+...+=++.++.-|.+-... ... + +-..+-...-++ .|+-.++|..+|||+.++..
T Consensus 88 ~~~a~~~l~~A~~alD~Vi~~~~G~~snvr~rl~~l~~~l~~q~~~i~e~r~d~~~f~~~~ 148 (239)
T PRK10626 88 DEGAKSRLEKARVALDKVIVQELGESSNVRNRLTKLDAQLKQQMNRIIEHRSDGLTFHHQA 148 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccCccchHHHHHHHHHHHHHHHHHHHHHccCCceeehHHH
Confidence 334444555666666655542 111 0 211222332222 46777889999999988653
No 23
>PF04659 Arch_fla_DE: Archaeal flagella protein ; InterPro: IPR006752 Archaeal flagella are unique motility structures, and the absence of bacterial structural motility genes in the complete genome sequences of flagellated archaeal species has always suggested that archaeal flagellar biogenesis is likely mediated by novel components. FlaD and FlaE, are present in the cell as membrane-associated proteins but are not major components of isolated flagellar filaments. Interestingly, flaD was found to encode two proteins, each translated from a separate ribosome binding site. This group of sequences contain the archaeal flaD and flaE proteins. The conserved region that defines these sequences is found in the N-teminal region of flaE but towards the C-terminal region of flaD [].; GO: 0001539 ciliary or flagellar motility
Probab=37.96 E-value=46 Score=19.81 Aligned_cols=42 Identities=10% Similarity=-0.107 Sum_probs=23.1
Q ss_pred CcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321 29 VSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIV 71 (75)
Q Consensus 29 ~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~ 71 (75)
++..+..-....+|.+...-.+.+.-+...-. +.||+.||++
T Consensus 50 ISe~V~~~L~~~~~G~~~~~~~~~~~~~~~Lt-~~DH~~SL~y 91 (99)
T PF04659_consen 50 ISEEVREQLLKYLKGFKYDHDDFEWKPSWKLT-IDDHIKSLLY 91 (99)
T ss_pred cCHHHHHHHHHHHhCCCCCccccccCCCCCCC-HHHHHHHHHH
Confidence 55566666667776665322111111222224 7899999975
No 24
>PF11802 CENP-K: Centromere-associated protein K; InterPro: IPR020993 Cenp-K is one of seven new Cenp-A-nucleosome distal (CAD) centromere components (the others being Cenp-L, Cenp-O, Cenp-P, Cenp-Q, Cenp-R and Cenp-S) that are identified as assembling on the Cenp-A nucleosome associated complex, NAC []. The Cenp-A NAC is essential, as disruption of the complex causes errors of chromosome alignment and segregation that preclude cell survival despite continued centromere-derived mitotic checkpoint signalling. Cenp-K is centromere-associated through its interaction with one or more components of the Cenp-A NAC.; GO: 0005634 nucleus
Probab=37.86 E-value=46 Score=23.39 Aligned_cols=20 Identities=20% Similarity=0.420 Sum_probs=17.6
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q 037321 2 EAREHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e 21 (75)
+|.+.+.+-.++.||+|++.
T Consensus 16 ~~~~~l~~eCEe~wk~me~~ 35 (268)
T PF11802_consen 16 DAKEELIKECEELWKDMEEC 35 (268)
T ss_pred hHHHHHHHHHHHHHHHHHHH
Confidence 57888999999999999864
No 25
>PHA02676 A-type inclusion protein; Provisional
Probab=37.34 E-value=35 Score=26.08 Aligned_cols=48 Identities=8% Similarity=0.067 Sum_probs=34.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCC
Q 037321 2 EAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEG 53 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg 53 (75)
.-++.++.||++.|..-..+ .+. +|+.-.+++=|+-|--..-+---||
T Consensus 6 klV~~FkslV~~tW~~~Ln~---~sC-IpR~~RKiIRnVlR~YI~~aP~i~g 53 (520)
T PHA02676 6 SLVAEFKSLVQRHWHHPLNS---VSC-IPREDRKLIRDILREYIRLAPPIDG 53 (520)
T ss_pred HHHHHHHHHHHHhcCCcccc---ccc-ccHHHHHHHHHHHHHHHHhCCCccc
Confidence 45678999999999653332 566 9999999999999965444433344
No 26
>cd07018 S49_SppA_67K_type Signal peptide peptidase A (SppA) 67K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 67K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily contain an amino-terminal domain in addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members (sometimes referred to as 67K type), similar to E. coli and Arabidopsis thaliana SppA peptidases. Unlike the eukaryotic functional homologs that are proposed to be aspartic proteases, site-directed mutagenesis and sequence analysis have shown that members in this subfamily, mostly bacterial, are serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys
Probab=36.35 E-value=1.1e+02 Score=20.22 Aligned_cols=38 Identities=21% Similarity=0.266 Sum_probs=28.9
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN 39 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N 39 (75)
+++++.++.++++.|...-+..-+... ++...++...+
T Consensus 155 ~~~r~~~~~~l~~~~~~f~~~Va~~R~-~~~~~~~~~~~ 192 (222)
T cd07018 155 PEAREQTQALLDSLWDQYLADVAASRG-LSPDALEALID 192 (222)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcC-CCHHHHHHHHH
Confidence 478899999999999998888876555 66665554443
No 27
>PF13798 PCYCGC: Protein of unknown function with PCYCGC motif
Probab=36.02 E-value=29 Score=22.52 Aligned_cols=23 Identities=9% Similarity=0.124 Sum_probs=16.8
Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCc
Q 037321 4 REHIHDLIAQTWMKMNRDRFVNPHFVS 30 (75)
Q Consensus 4 ~~~i~~~i~~~wK~ln~e~l~~~~~~p 30 (75)
.++||..|++.||+ .+.+|++ -|
T Consensus 134 ~~eIR~~ID~kYk~---g~~~pTp-Tp 156 (158)
T PF13798_consen 134 PKEIRQYIDEKYKE---GYAKPTP-TP 156 (158)
T ss_pred HHHHHHHHHHHHHh---CCCCCCC-CC
Confidence 46899999999985 3666665 44
No 28
>PF10835 DUF2573: Protein of unknown function (DUF2573); InterPro: IPR020393 This entry contains proteins with no known function.
Probab=35.53 E-value=66 Score=18.64 Aligned_cols=21 Identities=33% Similarity=0.529 Sum_probs=17.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHhh
Q 037321 2 EAREHIHDLIAQTWMKMNRDRF 23 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l 23 (75)
||.+.|+.+|++- |+||++.-
T Consensus 57 eaK~~ik~li~~I-k~lNe~~r 77 (82)
T PF10835_consen 57 EAKEEIKELIEEI-KQLNEAHR 77 (82)
T ss_pred hHHHHHHHHHHHH-HHHHHHHH
Confidence 7899999999875 69998754
No 29
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=35.46 E-value=27 Score=19.08 Aligned_cols=17 Identities=18% Similarity=0.517 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e 21 (75)
.+|..+|.+.|+.|..+
T Consensus 30 ~eisk~~g~~Wk~ls~e 46 (77)
T cd01389 30 NEISRIIGRMWRSESPE 46 (77)
T ss_pred HHHHHHHHHHHhhCCHH
Confidence 46788888999876543
No 30
>PF12357 PLD_C: Phospholipase D C terminal ; InterPro: IPR024632 Phospholipase D (PLD) catalyses the hydrolysis of the phosphodiester bond of glycerophospholipids to generate phosphatidic acid and a free head group. Phospholipase D activities have been detected in simple to complex organisms from viruses and bacteria to yeast, plants, and mammals []. In higher organisms, PLD specifically catalyzes the hydrolysis of phosphatidylcholine (PC) to phosphatidic acid (PA) and choline and is activated in response to stimulators of vesicle transport, endocytosis, exocytosis, cell migration, and mitosis. This entry represents the C-terminal domain of eukaryotic phospholipase D. The domain is approximately 70 amino acids in length and contains a conserved FPD sequence motif.
Probab=34.37 E-value=52 Score=18.73 Aligned_cols=22 Identities=9% Similarity=0.472 Sum_probs=17.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHhh
Q 037321 2 EAREHIHDLIAQTWMKMNRDRF 23 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l 23 (75)
|.++.++++.++.|+..-.+-+
T Consensus 12 eCVr~Vn~iae~nW~~y~~ee~ 33 (74)
T PF12357_consen 12 ECVRRVNEIAEENWKQYASEEV 33 (74)
T ss_pred HHHHHHHHHHHHHHHHhhcccc
Confidence 5678999999999987665544
No 31
>PF05341 DUF708: Protein of unknown function (DUF708); InterPro: IPR008005 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf68; it is a family of uncharacterised viral proteins.
Probab=34.25 E-value=55 Score=19.87 Aligned_cols=35 Identities=11% Similarity=0.136 Sum_probs=23.2
Q ss_pred HHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhc
Q 037321 12 AQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQ 49 (75)
Q Consensus 12 ~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~ 49 (75)
|.|||+|-=..+..++ ..--+..++-|-.-+|=|+
T Consensus 20 e~AWkdLii~~l~~tP---~~tyRt~l~ka~~EnFDYk 54 (105)
T PF05341_consen 20 EHAWKDLIILLLRNTP---RSTYRTMLRKANFENFDYK 54 (105)
T ss_pred HHHHHHHHHHHHhcCc---HHHHHhhhhhhhhhccCCC
Confidence 6899999998887654 4333555666655555554
No 32
>PF12910 RelB_N: Antitoxin of toxin-antitoxin stability system N-terminal; PDB: 3K6Q_C.
Probab=33.87 E-value=37 Score=17.10 Aligned_cols=16 Identities=31% Similarity=0.401 Sum_probs=13.2
Q ss_pred hHHHHHHHHHHHHHHH
Q 037321 2 EAREHIHDLIAQTWMK 17 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ 17 (75)
||+.++.++++.+..+
T Consensus 8 eAR~~ls~l~d~v~~~ 23 (46)
T PF12910_consen 8 EARNNLSKLIDRVVNG 23 (46)
T ss_dssp HHHHTHHHHHHHHHHT
T ss_pred HHHHhHHHHHHHHHcC
Confidence 7888999998888754
No 33
>PF01132 EFP: Elongation factor P (EF-P) OB domain; InterPro: IPR001059 Elongation factor P (EF-P) is a prokaryotic protein translation factor required for efficient peptide bond synthesis on 70S ribosomes from fMet-tRNAfMet []. Probably functions indirectly by altering the affinity of the ribosome for aminoacyl-tRNA, thus increasing their reactivity as acceptors for peptidyl transferase. This entry reresents the central domain of elongation factor P and its homologues. It forms an oligonucleotide-binding (OB) fold, though it is not clear if this region is involved in binding nucleic acids [].; GO: 0003746 translation elongation factor activity, 0006414 translational elongation; PDB: 1YBY_A 3A5Z_H 3TRE_A 1UEB_B 3HUW_V 3HUY_V 3OYY_B.
Probab=32.57 E-value=2.7 Score=22.18 Aligned_cols=16 Identities=25% Similarity=0.245 Sum_probs=11.4
Q ss_pred HHHHHhhccCCCCCCC
Q 037321 42 RMSQCMYQFGEGHGHG 57 (75)
Q Consensus 42 R~~~~~Y~~~Dg~t~~ 57 (75)
|-++++|++||+|..-
T Consensus 2 r~~qylY~dgd~~~FM 17 (55)
T PF01132_consen 2 REMQYLYKDGDNYVFM 17 (55)
T ss_dssp EEEEEEEEESSEEEEE
T ss_pred ceEEEEEeCCCEEEEe
Confidence 3457889998887654
No 34
>PF10305 Fmp27_SW: RNA pol II promoter Fmp27 protein domain; InterPro: IPR019415 The function of the FMP27 protein is not known. FMP27 is the product of a nuclear encoded gene but it is detected in highly purified mitochondria in high-throughput studies []. This entry represents a conserved region within FMP27 that contains characteristic SW and GKG sequence motifs.
Probab=30.94 E-value=57 Score=19.29 Aligned_cols=19 Identities=32% Similarity=0.667 Sum_probs=14.7
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q 037321 2 EAREHIHDLIAQTWMKMNR 20 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~ 20 (75)
||+..+.+-...+|+.=-+
T Consensus 1 eA~~rL~~~~S~SWi~ri~ 19 (103)
T PF10305_consen 1 EARERLQENNSTSWIRRIR 19 (103)
T ss_pred CHHHHHHHHHHHHHHHHHH
Confidence 6888888888888876433
No 35
>PF14394 DUF4423: Domain of unknown function (DUF4423)
Probab=30.80 E-value=63 Score=20.79 Aligned_cols=24 Identities=25% Similarity=0.068 Sum_probs=20.0
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhc
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFV 24 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~ 24 (75)
+++.++|+++|++.+|+++...=+
T Consensus 131 ~~~~~ki~~~i~~fRk~i~~i~~~ 154 (171)
T PF14394_consen 131 REDYEKIKKEIREFRKKIIAIAEE 154 (171)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Confidence 368899999999999999976543
No 36
>PF01023 S_100: S-100/ICaBP type calcium binding domain; InterPro: IPR013787 The calcium-binding domain found in S100 and CaBP-9k proteins is a subfamily of the EF-hand calcium-binding domain []. S100s are small dimeric acidic calcium and zinc-binding proteins abundant in the brain, with S100B playing an important role in modulating the proliferation and differentiation of neurons and glia cells []. S100 proteins have two different types of calcium-binding sites: a low affinity one with a special structure, and a 'normal' EF-hand type high-affinity site. Calbindin-D9k (CaBP-9k) also belong to this family of proteins, but it does not form dimers. CaBP-9k is a cytosolic protein expressed in a variety of tissues. Although its precise function is unknown, it appears to be under the control of the steroid hormones oestrogen and progesterone in the female reproductive system []. In the intestine, CaBP-9k may be involved in calcium absorption by mediating intracellular diffusion []. This entry represents a subdomain of the calcium-binding domain found in S100, CaBP-9k, and related proteins.; PDB: 2RGI_A 4DUQ_B 2KAY_B 2KAX_A 2CNP_A 1CNP_A 1A03_A 1JWD_B 2JTT_A 1XK4_B ....
Probab=30.45 E-value=76 Score=15.86 Aligned_cols=18 Identities=17% Similarity=0.082 Sum_probs=10.5
Q ss_pred cCCCCCCCchhhHHHHHHh
Q 037321 50 FGEGHGHGVQEITKARVLS 68 (75)
Q Consensus 50 ~~Dg~t~~~~~~~k~~I~s 68 (75)
+||..+.+..+ +|..+..
T Consensus 19 ~Gd~~~Lsk~E-lk~Ll~~ 36 (44)
T PF01023_consen 19 EGDKDTLSKKE-LKELLEK 36 (44)
T ss_dssp SSSTTSEEHHH-HHHHHHH
T ss_pred CCCCCeEcHHH-HHHHHHH
Confidence 45666665545 7766653
No 37
>PF13864 Enkurin: Calmodulin-binding
Probab=30.36 E-value=83 Score=18.21 Aligned_cols=23 Identities=17% Similarity=0.469 Sum_probs=18.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhc
Q 037321 2 EAREHIHDLIAQTWMKMNRDRFV 24 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l~ 24 (75)
+-+..+-.-+...|.++|.++-.
T Consensus 40 eER~~lL~~Lk~~~~el~~ey~~ 62 (98)
T PF13864_consen 40 EERQELLEGLKKNWDELNKEYQK 62 (98)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 34566777788999999999874
No 38
>TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. The closely related TabB protein of Pseudomonas syringae (pv. tabaci) appears to act in the biosynthesis of tabtoxin rather than lysine. The trusted cutoff is set high enough to exclude this gene. Sequences below trusted also include a version of this enzyme which apparently utilize acetate rather than succinate (EC: 2.3.1.89).
Probab=30.12 E-value=1.3e+02 Score=21.09 Aligned_cols=31 Identities=10% Similarity=0.247 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321 6 HIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN 39 (75)
Q Consensus 6 ~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N 39 (75)
+++..|+++|.+ ++-+.+.. .+...++.+..
T Consensus 1 ~~~~~i~~~~~~--~~~~~~~~-~~~~~~~~~~~ 31 (269)
T TIGR00965 1 QLQNIIETAFER--RAEITPAN-ADTVTKEAVNE 31 (269)
T ss_pred CHHHHHHHHHHH--HHhCCCcC-CCHHHHHHHHH
Confidence 368899999987 44465444 44445444433
No 39
>TIGR00706 SppA_dom signal peptide peptidase SppA, 36K type. The member of this family from Bacillus subtilis was shown to have properties consistent with a role in degrading signal peptides after cleavage from precursor proteins, although it was not demonstrated conclusively.
Probab=30.12 E-value=1.5e+02 Score=19.23 Aligned_cols=37 Identities=11% Similarity=0.097 Sum_probs=27.3
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHH
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAM 38 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~ 38 (75)
++.++.++.++++.|...-...-+... ++..-++.++
T Consensus 137 ~~~~e~~~~~l~~~~~~f~~~va~~R~-~~~~~~~~~~ 173 (207)
T TIGR00706 137 PEERDILQNLVNESYEQFVQVVAKGRN-LPVEDVKKFA 173 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHh
Confidence 367899999999999998888776555 6655444433
No 40
>cd07978 TAF13 The TATA Binding Protein (TBP) Associated Factor 13 (TAF13) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 13 (TAF13) is one of several TAFs that bind TBP and is involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAFs orthologs and paralogs. Several hy
Probab=29.96 E-value=61 Score=18.90 Aligned_cols=26 Identities=31% Similarity=0.498 Sum_probs=14.7
Q ss_pred HHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321 44 SQCMYQFGEGHGHGVQEITKARVLSLIV 71 (75)
Q Consensus 44 ~~~~Y~~~Dg~t~~~~~~~k~~I~sll~ 71 (75)
.+++|..||.- .|..+ +-+.|..++.
T Consensus 9 ~~mmy~~GD~~-~P~~e-Tv~llE~iv~ 34 (92)
T cd07978 9 RQMMYGFGDVQ-NPLPE-TVDLLEDIVV 34 (92)
T ss_pred HHHHHHcCCCC-CCCHH-HHHHHHHHHH
Confidence 46778888862 24334 5555554443
No 41
>PF14799 FAM195: FAM195 family
Probab=29.64 E-value=86 Score=18.59 Aligned_cols=21 Identities=24% Similarity=0.491 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhh
Q 037321 3 AREHIHDLIAQTWMKMNRDRF 23 (75)
Q Consensus 3 A~~~i~~~i~~~wK~ln~e~l 23 (75)
|-+++-+.|-++|..+.++.=
T Consensus 33 qHEElVrfI~esW~~V~~~~~ 53 (99)
T PF14799_consen 33 QHEELVRFIYESWNKVERELE 53 (99)
T ss_pred hHHHHHHHHHHHHHHHHHHcc
Confidence 457788899999999998853
No 42
>PF12807 eIF3_p135: Translation initiation factor eIF3 subunit 135
Probab=29.41 E-value=1.4e+02 Score=18.85 Aligned_cols=19 Identities=16% Similarity=0.345 Sum_probs=16.8
Q ss_pred HHHHHHHHHHHHHHHhhcC
Q 037321 7 IHDLIAQTWMKMNRDRFVN 25 (75)
Q Consensus 7 i~~~i~~~wK~ln~e~l~~ 25 (75)
+.+||..+.|.+-++.+..
T Consensus 46 ~~EmiaR~~K~i~r~~l~~ 64 (169)
T PF12807_consen 46 LQEMIARAAKHILREYLRS 64 (169)
T ss_pred HHHHHHHHHHHHHHHHHHc
Confidence 4789999999999999974
No 43
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=29.26 E-value=40 Score=18.14 Aligned_cols=17 Identities=12% Similarity=0.585 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e 21 (75)
..|.+++.+.|+.|..+
T Consensus 30 ~eisk~l~~~Wk~ls~~ 46 (72)
T cd01388 30 RAISKILGDRWKALSNE 46 (72)
T ss_pred HHHHHHHHHHHHcCCHH
Confidence 46777888889876543
No 44
>cd07019 S49_SppA_1 Signal peptide peptidase A (SppA), a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV): SppAs in this subfamily are found in all three domains of life and are involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Site-directed mutagenesis and sequence analysis have shown these bacterial, archaeal and thylakoid SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad (both residues absolutely conserved within bacteria, chloroplast and mitochondrial signal peptidase family members) and not the usual Ser-His-Asp catalytic triad found in the majority of serine proteases. In addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members, the E. coli SppA contains an amino-te
Probab=28.97 E-value=1.2e+02 Score=19.84 Aligned_cols=35 Identities=11% Similarity=-0.016 Sum_probs=26.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHH
Q 037321 2 EAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIA 37 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~ 37 (75)
|+++.++.++++.|....+..-+... +|..-++..
T Consensus 146 e~r~~~~~~ld~~~~~f~~~Va~~R~-~~~~~l~~~ 180 (211)
T cd07019 146 EAQLGLQLSIENGYKRFITLVADARH-STPEQIDKI 180 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCC-CCHHHHHHh
Confidence 67889999999999999887776555 665543333
No 45
>PF07553 Lipoprotein_Ltp: Host cell surface-exposed lipoprotein; InterPro: IPR011434 This domain is found as 1-3 copies in a small family of proteins of unknown function.
Probab=28.78 E-value=32 Score=17.80 Aligned_cols=12 Identities=33% Similarity=0.598 Sum_probs=8.8
Q ss_pred HHhhccCCCCCC
Q 037321 45 QCMYQFGEGHGH 56 (75)
Q Consensus 45 ~~~Y~~~Dg~t~ 56 (75)
++.+.+|||||.
T Consensus 27 QL~se~ge~Ft~ 38 (48)
T PF07553_consen 27 QLTSEYGEGFTE 38 (48)
T ss_pred HHHhhcccCCCH
Confidence 445678999985
No 46
>PF08494 DEAD_assoc: DEAD/H associated; InterPro: IPR013701 This domain is found in ATP-dependent helicases as well as a number of hypothetical proteins together with the helicase conserved C-terminal domain (IPR011545 from INTERPRO) and the IPR001650 from INTERPRO domain. ; GO: 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
Probab=28.75 E-value=1.7e+02 Score=18.92 Aligned_cols=42 Identities=12% Similarity=0.048 Sum_probs=31.9
Q ss_pred HHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321 12 AQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG 57 (75)
Q Consensus 12 ~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~ 57 (75)
.+...++-++.+..++ -++..+.+.|+.+-.+-++.-|...|
T Consensus 80 ~~~l~~~l~~al~~s~----lfk~rfr~vA~r~gli~r~~~g~~~~ 121 (187)
T PF08494_consen 80 PDNLEEDLREALKRSE----LFKRRFRHVARRFGLIPRNYPGKRKS 121 (187)
T ss_pred HHHHHHHHHHHHhcCH----HHHHHHHHHHHHhCCCcCCCCCCcCc
Confidence 5777788888886554 78899999999998887765555544
No 47
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=27.84 E-value=68 Score=17.22 Aligned_cols=17 Identities=12% Similarity=0.522 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e 21 (75)
..+...|.+.|+.|..+
T Consensus 33 ~e~~k~~~~~Wk~Ls~~ 49 (73)
T PF09011_consen 33 REVMKEISERWKSLSEE 49 (73)
T ss_dssp HHHHHHHHHHHHHS-HH
T ss_pred HHHHHHHHHHHHhcCHH
Confidence 45667888889887754
No 48
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=27.76 E-value=52 Score=16.83 Aligned_cols=18 Identities=6% Similarity=0.562 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHH
Q 037321 4 REHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 4 ~~~i~~~i~~~wK~ln~e 21 (75)
...|...+.+.|+.|..+
T Consensus 28 ~~~i~~~~~~~W~~ls~~ 45 (66)
T cd01390 28 VTEVTKILGEKWKELSEE 45 (66)
T ss_pred HHHHHHHHHHHHHhCCHH
Confidence 346777788888776543
No 49
>PF01713 Smr: Smr domain; InterPro: IPR002625 This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 (P23909 from SWISSPROT) protein in the case of Smr proteins and with the N-terminal MutS related region of MutS2, P94545 from SWISSPROT [].; PDB: 3QD7_X 2D9I_A 3FAU_A 2VKC_A 2ZQE_A.
Probab=27.57 E-value=59 Score=17.86 Aligned_cols=22 Identities=14% Similarity=0.237 Sum_probs=17.3
Q ss_pred ChHHHHHHHHHHHHHHHHHHHh
Q 037321 1 REAREHIHDLIAQTWMKMNRDR 22 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~ 22 (75)
+||+..+...++++|+.-.+.+
T Consensus 9 ~eA~~~l~~~l~~~~~~~~~~~ 30 (83)
T PF01713_consen 9 EEALRALEEFLDEARQRGIREL 30 (83)
T ss_dssp HHHHHHHHHHHHHHHHTTHSEE
T ss_pred HHHHHHHHHHHHHHHHcCCCEE
Confidence 4799999999999987655554
No 50
>PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional
Probab=26.95 E-value=2e+02 Score=20.06 Aligned_cols=31 Identities=10% Similarity=0.322 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321 6 HIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN 39 (75)
Q Consensus 6 ~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N 39 (75)
.++..||++|.. ++-+.+.. .+...++.+..
T Consensus 3 ~~~~~i~~~~~~--~~~~~~~~-~~~~~~~~~~~ 33 (272)
T PRK11830 3 QLQKIIEEAWER--RAELTPAT-ADTEVREAVEE 33 (272)
T ss_pred HHHHHHHHHHHh--HhhCCccC-CCHHHHHHHHH
Confidence 588999999987 44465544 45555444443
No 51
>PF13904 DUF4207: Domain of unknown function (DUF4207)
Probab=26.87 E-value=17 Score=24.98 Aligned_cols=14 Identities=14% Similarity=0.434 Sum_probs=11.0
Q ss_pred HHHHHHhhccccCC
Q 037321 62 TKARVLSLIVDPIA 75 (75)
Q Consensus 62 ~k~~I~sll~~Pi~ 75 (75)
+.++|.-+|++|+|
T Consensus 250 Lrgt~s~~y~NP~p 263 (264)
T PF13904_consen 250 LRGTYSPIYINPIP 263 (264)
T ss_pred ccCcCCccccCCCC
Confidence 66667788999986
No 52
>cd00687 Terpene_cyclase_nonplant_C1 Non-plant Terpene Cyclases, Class 1. This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in
Probab=26.85 E-value=75 Score=21.47 Aligned_cols=22 Identities=14% Similarity=0.166 Sum_probs=18.8
Q ss_pred ChHHHHHHHHHHHHHHHHHHHh
Q 037321 1 REAREHIHDLIAQTWMKMNRDR 22 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~ 22 (75)
|+|++++.+++.+.+++..+..
T Consensus 244 ~eA~~~~~~~~~~~~~~f~~~~ 265 (303)
T cd00687 244 EEAISVVRDMHNERITQFEELE 265 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 5899999999999998877553
No 53
>PHA02718 hypothetical protein; Provisional
Probab=26.70 E-value=21 Score=19.43 Aligned_cols=28 Identities=29% Similarity=0.266 Sum_probs=19.9
Q ss_pred HHHHHHhhccCCCCCCCchhhHHHHHHhhccc
Q 037321 41 ARMSQCMYQFGEGHGHGVQEITKARVLSLIVD 72 (75)
Q Consensus 41 aR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~ 72 (75)
-|..+|+|-.+..||.- +|+.+.|-+++
T Consensus 28 hrssdflylsdrdytsi----tketlvseive 55 (69)
T PHA02718 28 HRSSDFLYLSDRDYTSI----TKETLVSEIVE 55 (69)
T ss_pred eeccceEEEcccccchh----hHHHHHHHHHH
Confidence 48889999988888743 56666655554
No 54
>KOG0027 consensus Calmodulin and related proteins (EF-Hand superfamily) [Signal transduction mechanisms]
Probab=26.69 E-value=1.2e+02 Score=18.50 Aligned_cols=35 Identities=26% Similarity=0.209 Sum_probs=26.4
Q ss_pred HHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcccc
Q 037321 37 AMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDP 73 (75)
Q Consensus 37 ~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~P 73 (75)
...+.++.+.+-+++||+-.. .+ +...+++|-..|
T Consensus 7 ~~el~~~F~~fD~d~~G~i~~-~e-l~~~lr~lg~~~ 41 (151)
T KOG0027|consen 7 ILELKEAFQLFDKDGDGKISV-EE-LGAVLRSLGQNP 41 (151)
T ss_pred HHHHHHHHHHHCCCCCCcccH-HH-HHHHHHHcCCCC
Confidence 455677777878889987765 46 888888887765
No 55
>smart00463 SMR Small MutS-related domain.
Probab=26.51 E-value=66 Score=17.50 Aligned_cols=19 Identities=16% Similarity=0.190 Sum_probs=15.6
Q ss_pred ChHHHHHHHHHHHHHHHHH
Q 037321 1 REAREHIHDLIAQTWMKMN 19 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln 19 (75)
+||+..+...+++.|+.-.
T Consensus 12 ~eA~~~l~~~l~~~~~~~~ 30 (80)
T smart00463 12 EEALTALDKFLNNARLKGL 30 (80)
T ss_pred HHHHHHHHHHHHHHHHcCC
Confidence 5899999999998887643
No 56
>PF14555 UBA_4: UBA-like domain; PDB: 2DAL_A 3BQ3_A 2L4E_A 2L4F_A 2DZL_A 2L2D_A 2DAM_A 1V92_A 3E21_A.
Probab=26.28 E-value=44 Score=16.38 Aligned_cols=18 Identities=17% Similarity=0.242 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHhhccCCC
Q 037321 36 IAMNLARMSQCMYQFGEG 53 (75)
Q Consensus 36 ~~~NlaR~~~~~Y~~~Dg 53 (75)
.-+|+-+.++.+|..+++
T Consensus 25 ~~wdle~Av~~y~~~~~~ 42 (43)
T PF14555_consen 25 NNWDLEAAVNAYFDDGEA 42 (43)
T ss_dssp TTT-HHHHHHHHHHSS-S
T ss_pred cCCCHHHHHHHHHhCCCC
Confidence 346899999999987765
No 57
>COG3351 FlaD Putative archaeal flagellar protein D/E [Cell motility and secretion]
Probab=26.09 E-value=63 Score=21.96 Aligned_cols=41 Identities=12% Similarity=0.219 Sum_probs=24.7
Q ss_pred CcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321 29 VSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIV 71 (75)
Q Consensus 29 ~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~ 71 (75)
++..++...+.++|.+-.... .+.+..-... +.|||-||+|
T Consensus 140 Is~~v~~qLl~f~kg~~~~~~-~~~~~s~kLt-~~DHiiSLly 180 (214)
T COG3351 140 ISEKVLEQLLKFAKGTKVMND-DQEEKSIKLT-MQDHIISLLF 180 (214)
T ss_pred ccHHHHHHHHHHHhcCCCCCh-hhhhhccCCC-HHHHHHHHHH
Confidence 455667777777777655321 1233322234 8999999976
No 58
>PHA02896 A-type inclusion like protein; Provisional
Probab=25.78 E-value=81 Score=24.53 Aligned_cols=39 Identities=21% Similarity=0.265 Sum_probs=30.5
Q ss_pred hHHHHHHHHHHHHHHH-HHHHhhcCCCCCcHHHHHHHHHHHHHHH
Q 037321 2 EAREHIHDLIAQTWMK-MNRDRFVNPHFVSDVFVGIAMNLARMSQ 45 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~-ln~e~l~~~~~~p~~~~~~~~NlaR~~~ 45 (75)
.-+..++.+|++.|.+ ||+ .+. +|+.-.+++=|+-|-.-
T Consensus 10 KcV~eFkSlVertWnk~Lns----~SC-IpRk~RKiIRNILR~YI 49 (616)
T PHA02896 10 KCIRKFESLIIRTWDHDLNE----RSF-LNRKDRKIIRNIFRCFI 49 (616)
T ss_pred HHHHHHHHHHHHhhCCcccc----ccC-cCHHHHHHHHHHHHHHH
Confidence 4567899999999953 333 567 99999999999999543
No 59
>PF15581 Imm35: Immunity protein 35
Probab=25.25 E-value=37 Score=20.07 Aligned_cols=17 Identities=18% Similarity=0.604 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e 21 (75)
.-+..+|++.||-|-++
T Consensus 33 ~~l~~lIe~eWRGl~~~ 49 (93)
T PF15581_consen 33 RNLESLIEHEWRGLPEE 49 (93)
T ss_pred HHHHHHHHHHHcCCCHH
Confidence 34567899999875543
No 60
>PRK01371 sec-independent translocase; Provisional
Probab=24.94 E-value=1.2e+02 Score=19.14 Aligned_cols=20 Identities=0% Similarity=0.156 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q 037321 3 AREHIHDLIAQTWMKMNRDR 22 (75)
Q Consensus 3 A~~~i~~~i~~~wK~ln~e~ 22 (75)
++..+|+++.+++.++.+|+
T Consensus 35 ~ir~~R~~~~~ak~~i~~El 54 (137)
T PRK01371 35 TLRQLREMANNARNDLRSEL 54 (137)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 67889999999999999887
No 61
>KOG1766 consensus Enhancer of rudimentary [General function prediction only]
Probab=24.77 E-value=1.7e+02 Score=17.64 Aligned_cols=54 Identities=20% Similarity=0.148 Sum_probs=36.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhc-CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321 2 EAREHIHDLIAQTWMKMNRDRFV-NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG 57 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l~-~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~ 57 (75)
||.+-|.+|-|+.-|+.|-..-. ... ++ .+-+++=.++-++..+|+..-|-..|
T Consensus 27 e~megiCk~yEe~Lkk~nPs~~~ITYD-Is-qlfeFiD~L~DlS~lVy~~~t~tY~p 81 (104)
T KOG1766|consen 27 ECMEGICKMYEEHLKKKNPSAPPITYD-IS-QLFEFIDDLADLSMLVYNRETGTYIP 81 (104)
T ss_pred HHHHHHHHHHHHHHHhcCCCCCCccee-HH-HHHHHHHHHhhhhhhheecccccccC
Confidence 67788888888888887643221 112 33 34466667888888899887766666
No 62
>smart00398 HMG high mobility group.
Probab=24.58 E-value=56 Score=16.74 Aligned_cols=17 Identities=12% Similarity=0.489 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e 21 (75)
..+..++...|+.|..+
T Consensus 30 ~~i~~~~~~~W~~l~~~ 46 (70)
T smart00398 30 AEISKKLGERWKLLSEE 46 (70)
T ss_pred HHHHHHHHHHHHcCCHH
Confidence 45677788888776543
No 63
>smart00872 Alpha-mann_mid Alpha mannosidase, middle domain. Members of this entry belong to the glycosyl hydrolase family 38, This domain, which is found in the central region adopts a structure consisting of three alpha helices, in an immunoglobulin/albumin-binding domain-like fold. The domain is predominantly found in the enzyme alpha-mannosidase PUBMED:12634058.
Probab=24.51 E-value=1.1e+02 Score=16.72 Aligned_cols=23 Identities=9% Similarity=-0.028 Sum_probs=10.3
Q ss_pred CcHHHHHHHHHHHHHHHHhhccCCCCC
Q 037321 29 VSDVFVGIAMNLARMSQCMYQFGEGHG 55 (75)
Q Consensus 29 ~p~~~~~~~~NlaR~~~~~Y~~~Dg~t 55 (75)
.|...++.++... ++-+.-|..+
T Consensus 42 ~~~~~l~~~wk~l----l~~q~HD~i~ 64 (79)
T smart00872 42 YPSEQLEELWKAL----LLNQFHDAIT 64 (79)
T ss_pred CcHHHHHHHHHHH----HHhcCcccCC
Confidence 4555444444322 3344456554
No 64
>PF09184 PPP4R2: PPP4R2; InterPro: IPR015267 PPP4R2 (protein phosphatase 4 core regulatory subunit R2) is the regulatory subunit of the histone H2A phosphatase complex. It has been shown to confer resistance to the anticancer drug cisplatin in yeast [], and may confer resistance in higher eukaryotes.
Probab=24.48 E-value=1.4e+02 Score=20.92 Aligned_cols=19 Identities=16% Similarity=0.237 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHHHhh
Q 037321 5 EHIHDLIAQTWMKMNRDRF 23 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e~l 23 (75)
..++.++...+..+-.+++
T Consensus 42 ~~lk~l~~~kl~~v~~e~~ 60 (288)
T PF09184_consen 42 SLLKSLFRHKLEKVIDEFF 60 (288)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4677777777777777777
No 65
>KOG2204 consensus Mannosyl-oligosaccharide alpha-1,2-mannosidase and related glycosyl hydrolases [Carbohydrate transport and metabolism]
Probab=24.44 E-value=69 Score=25.07 Aligned_cols=22 Identities=18% Similarity=0.626 Sum_probs=19.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHhh
Q 037321 2 EAREHIHDLIAQTWMKMNRDRF 23 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l 23 (75)
++|++|++||..+|-.--...+
T Consensus 174 ~rrekVkemm~hAw~~Yk~yaw 195 (625)
T KOG2204|consen 174 ERREKVKEMMKHAWNNYKKYAW 195 (625)
T ss_pred HHHHHHHHHHHHHHHhhhhhcc
Confidence 6899999999999988776665
No 66
>PF05002 SGS: SGS domain ; InterPro: IPR007699 This domain was thought to be unique to the SGT1-like proteins, but is also found in calcyclin binding proteins. Sgt1p is a highly conserved eukaryotic protein that is required for both SCF (Skp1p/Cdc53p-Cullin-F-box)-mediated ubiquitination and kinetochore function in yeast and also plays a role in the cAMP pathway. Calcyclin (S100A6) is a member of the S100A family of calcium binding proteins and appears to play a role in cell proliferation [].; PDB: 1X5M_A 2JTT_D.
Probab=23.94 E-value=1.2e+02 Score=17.54 Aligned_cols=25 Identities=20% Similarity=0.377 Sum_probs=14.9
Q ss_pred HHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321 40 LARMSQCMYQFGEGHGHGVQEITKARVLSLIV 71 (75)
Q Consensus 40 laR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~ 71 (75)
+-.+++-||.+||. + +|..|..-|.
T Consensus 26 lm~lfkkiY~~gDD------e-~KRam~KSf~ 50 (82)
T PF05002_consen 26 LMNLFKKIYDNGDD------E-MKRAMMKSFT 50 (82)
T ss_dssp HHHHHHHHHTTS-S------C-HHHHHHHHHH
T ss_pred HHHHHHHHHccCCH------H-HHHHHHHHHH
Confidence 44567788998876 3 5555554443
No 67
>PF02512 UK: Virulence determinant
Probab=23.71 E-value=44 Score=19.52 Aligned_cols=21 Identities=24% Similarity=0.411 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHhhcCCCCCcHHHH
Q 037321 9 DLIAQTWMKMNRDRFVNPHFVSDVFV 34 (75)
Q Consensus 9 ~~i~~~wK~ln~e~l~~~~~~p~~~~ 34 (75)
+-|.+-||.+|+ ... +|-.|-
T Consensus 50 ewiaeywkgi~r----gnd-vpcccp 70 (96)
T PF02512_consen 50 EWIAEYWKGIKR----GND-VPCCCP 70 (96)
T ss_pred HHHHHHHHhccc----CCC-CCccCC
Confidence 457778888887 444 665553
No 68
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=23.54 E-value=1.2e+02 Score=18.26 Aligned_cols=14 Identities=7% Similarity=0.147 Sum_probs=8.6
Q ss_pred HHHHHHHHHHhhcC
Q 037321 12 AQTWMKMNRDRFVN 25 (75)
Q Consensus 12 ~~~wK~ln~e~l~~ 25 (75)
+.+|+.|.+.|=++
T Consensus 57 ~~a~~~L~~~yg~~ 70 (145)
T PF03564_consen 57 EEAWELLEERYGNP 70 (145)
T ss_pred HHHHHHHHHHhCCc
Confidence 56677766665443
No 69
>PHA02935 Hypothetical protein; Provisional
Probab=23.52 E-value=1.1e+02 Score=21.16 Aligned_cols=49 Identities=10% Similarity=0.167 Sum_probs=34.5
Q ss_pred HHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCC---CCCCC
Q 037321 8 HDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGE---GHGHG 57 (75)
Q Consensus 8 ~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~D---g~t~~ 57 (75)
+.+.|..|.-||..-=..+. .|.+--..+-+++-|+.-+-.+.| +|+.|
T Consensus 256 rhicesewealnnnndnsss-mpashnnlandlssmmsqlqndnddnndysap 307 (349)
T PHA02935 256 RHICESEWEALNNNNDNSSS-MPASHNNLANDLSSMMSQLQNDNDDNNDYSAP 307 (349)
T ss_pred hhhhHHHHHHhhcCCCcccc-CcchhcchhhHHHHHHHHhhccCCccccccCC
Confidence 45778999999876554555 888877888888877766554443 56655
No 70
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=23.45 E-value=72 Score=16.09 Aligned_cols=16 Identities=13% Similarity=0.586 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHH
Q 037321 5 EHIHDLIAQTWMKMNR 20 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~ 20 (75)
..|..++...|+.|..
T Consensus 29 ~~i~~~~~~~W~~l~~ 44 (66)
T cd00084 29 GEISKILGEMWKSLSE 44 (66)
T ss_pred HHHHHHHHHHHHhCCH
Confidence 4567777778876543
No 71
>PF14684 Tricorn_C1: Tricorn protease C1 domain; PDB: 1N6F_D 1N6D_C 1N6E_C 1K32_A.
Probab=23.42 E-value=1.2e+02 Score=16.36 Aligned_cols=20 Identities=5% Similarity=0.332 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHHHHhhc
Q 037321 5 EHIHDLIAQTWMKMNRDRFV 24 (75)
Q Consensus 5 ~~i~~~i~~~wK~ln~e~l~ 24 (75)
++=+.+.++.|+.+++.++.
T Consensus 2 ~E~~~~F~~~W~~~~~~f~d 21 (70)
T PF14684_consen 2 AEWRQMFDEAWRLVRENFYD 21 (70)
T ss_dssp HHHHHHHHHHHHHHHHHSS-
T ss_pred HHHHHHHHHHHHHHHHhcCC
Confidence 34567888899888887663
No 72
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=23.35 E-value=68 Score=16.58 Aligned_cols=18 Identities=22% Similarity=0.680 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHH
Q 037321 4 REHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 4 ~~~i~~~i~~~wK~ln~e 21 (75)
...|..++...|+.|-.+
T Consensus 28 ~~~i~~~~~~~W~~l~~~ 45 (69)
T PF00505_consen 28 NKEISKILAQMWKNLSEE 45 (69)
T ss_dssp HHHHHHHHHHHHHCSHHH
T ss_pred cccchhhHHHHHhcCCHH
Confidence 356778888889876543
No 73
>PTZ00195 60S ribosomal protein L18; Provisional
Probab=21.82 E-value=82 Score=21.22 Aligned_cols=37 Identities=8% Similarity=-0.032 Sum_probs=24.2
Q ss_pred HHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321 14 TWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG 57 (75)
Q Consensus 14 ~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~ 57 (75)
-||.+-+..+.+.. ....+|++|+..++ +.+||.+.+
T Consensus 44 fnk~VakRL~~sr~------nRppVsLSrI~r~~-k~~d~~~~~ 80 (198)
T PTZ00195 44 FNKVVYQRLIKSRS------NRAPISLSRIAVVM-KRKAVFTAK 80 (198)
T ss_pred HHHHHHHHHhcccc------cCCceeHHHHHHHH-hcccccccc
Confidence 38888887775432 13457899998875 666665544
No 74
>PF08153 NGP1NT: NGP1NT (NUC091) domain; InterPro: IPR012971 This N-terminal domain is found in a subfamily of hypothetical nucleolar GTP-binding proteins similar to human NGP1 [].
Probab=21.69 E-value=38 Score=21.28 Aligned_cols=35 Identities=9% Similarity=0.018 Sum_probs=26.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHH
Q 037321 2 EAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIA 37 (75)
Q Consensus 2 eA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~ 37 (75)
+|.+++++-+.+.-++=-.-.|+.+. +|.++++=.
T Consensus 47 ~~L~~FReem~~~~~DPy~VlLk~~K-LPmsLL~d~ 81 (130)
T PF08153_consen 47 EALEKFREEMGEKVKDPYSVLLKRSK-LPMSLLQDS 81 (130)
T ss_pred HHHHHHHHHHHHhhcCCceEeeeCCc-CCHHHhccc
Confidence 67788888888777776666677777 999987655
No 75
>cd07023 S49_Sppa_N_C Signal peptide peptidase A (SppA), a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV): SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. This subfamily contains members with either a single domain (sometimes referred to as 36K type), such as sohB peptidase, protein C and archaeal signal peptide peptidase, or an amino-terminal domain in addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members (sometimes referred to as 67K type), similar to E. coli and Arabidopsis thaliana SppA peptidases. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad
Probab=21.57 E-value=2e+02 Score=18.49 Aligned_cols=38 Identities=11% Similarity=0.063 Sum_probs=27.1
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN 39 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N 39 (75)
++.++.++.++++.|..+-+..-+... ++..-++..++
T Consensus 142 ~~~~e~~~~~l~~~~~~f~~~Va~~R~-~~~~~~~~~~~ 179 (208)
T cd07023 142 EEERAILQALVDDIYDQFVDVVAEGRG-MSGERLDKLAD 179 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHhcC
Confidence 357889999999999888887776555 65554444333
No 76
>PF12763 EF-hand_4: Cytoskeletal-regulatory complex EF hand; PDB: 2QPT_A 2KSP_A 2KFG_A 2JQ6_A 2KFH_A 2KFF_A 1IQ3_A 3FIA_A 2KHN_A 2KGR_A ....
Probab=21.49 E-value=89 Score=18.53 Aligned_cols=20 Identities=20% Similarity=0.265 Sum_probs=15.5
Q ss_pred HhhcCCCCCcHHHHHHHHHHH
Q 037321 21 DRFVNPHFVSDVFVGIAMNLA 41 (75)
Q Consensus 21 e~l~~~~~~p~~~~~~~~Nla 41 (75)
++|..+. +|...+..+|++|
T Consensus 33 ~~f~~S~-L~~~~L~~IW~La 52 (104)
T PF12763_consen 33 EFFMKSG-LPRDVLAQIWNLA 52 (104)
T ss_dssp HHHHHTT-SSHHHHHHHHHHH
T ss_pred HHHHHcC-CCHHHHHHHHhhh
Confidence 3454567 9999999999986
No 77
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=21.10 E-value=1.7e+02 Score=16.46 Aligned_cols=32 Identities=16% Similarity=0.093 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHhh-ccCCCCCCCchhhHHHHHHh
Q 037321 36 IAMNLARMSQCMY-QFGEGHGHGVQEITKARVLS 68 (75)
Q Consensus 36 ~~~NlaR~~~~~Y-~~~Dg~t~~~~~~~k~~I~s 68 (75)
.+..+.++.+.+= ++|||+..+..+ ++..+.+
T Consensus 6 ~~~~l~~aF~~fD~~dgdG~~I~~~e-L~~ll~~ 38 (88)
T cd05027 6 AMVALIDVFHQYSGREGDKHKLKKSE-LKELINN 38 (88)
T ss_pred HHHHHHHHHHHhcccCCCcCEECHHH-HHHHHHH
Confidence 3444555554432 467886444445 8887776
No 78
>PF10859 DUF2660: Protein of unknown function (DUF2660); InterPro: IPR022589 This is a bacterial group of proteins with unknown function found in Rickettsiales.
Probab=20.74 E-value=99 Score=17.87 Aligned_cols=13 Identities=15% Similarity=0.496 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHH
Q 037321 7 IHDLIAQTWMKMN 19 (75)
Q Consensus 7 i~~~i~~~wK~ln 19 (75)
+++.||-+||-|=
T Consensus 40 Lqe~iElSWkFLY 52 (86)
T PF10859_consen 40 LQERIELSWKFLY 52 (86)
T ss_pred HHHHHHHHHHHHH
Confidence 6788999998654
No 79
>PRK14857 tatA twin arginine translocase protein A; Provisional
Probab=20.50 E-value=1.9e+02 Score=16.91 Aligned_cols=22 Identities=5% Similarity=-0.017 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHHhhc
Q 037321 3 AREHIHDLIAQTWMKMNRDRFV 24 (75)
Q Consensus 3 A~~~i~~~i~~~wK~ln~e~l~ 24 (75)
++..+++.+++.+.+++++.-+
T Consensus 37 ~i~~fkk~~~~~~~e~~~~~~~ 58 (90)
T PRK14857 37 TLKGFQEASKEFENEIKREMAE 58 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 6788999999999999988753
No 80
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=20.41 E-value=1.1e+02 Score=13.74 Aligned_cols=26 Identities=19% Similarity=0.087 Sum_probs=16.5
Q ss_pred HHHHHhhccCCCCCCCchhhHHHHHHhh
Q 037321 42 RMSQCMYQFGEGHGHGVQEITKARVLSL 69 (75)
Q Consensus 42 R~~~~~Y~~~Dg~t~~~~~~~k~~I~sl 69 (75)
++...+=+++||+=.. .+ ++..+++|
T Consensus 4 ~~F~~~D~d~dG~I~~-~E-f~~~~~~L 29 (29)
T PF00036_consen 4 EAFREFDKDGDGKIDF-EE-FKEMMKKL 29 (29)
T ss_dssp HHHHHHSTTSSSEEEH-HH-HHHHHHHT
T ss_pred HHHHHHCCCCCCcCCH-HH-HHHHHHhC
Confidence 4455555788887655 35 77776654
No 81
>cd07022 S49_Sppa_36K_type Signal peptide peptidase A (SppA) 36K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 36K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily are all bacterial and include sohB peptidase and protein C. These are sometimes referred to as 36K type since they contain only one domain, unlike E. coli SppA that also contains an amino-terminal domain. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad and not the usual Ser-His-Asp catalytic triad found in the majority of serine proteases.
Probab=20.37 E-value=2.6e+02 Score=18.19 Aligned_cols=31 Identities=16% Similarity=0.139 Sum_probs=24.8
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHH
Q 037321 1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDV 32 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~ 32 (75)
++.++.++.++++.|...-+..-+... .+..
T Consensus 149 ~~~re~~~~~l~~~~~~f~~~V~~~R~-~~~~ 179 (214)
T cd07022 149 DEARARLQAEVDALYAMFVAAVARNRG-LSAA 179 (214)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCC-CCHH
Confidence 468899999999999999888876555 5544
No 82
>PHA03369 capsid maturational protease; Provisional
Probab=20.28 E-value=2e+02 Score=22.85 Aligned_cols=23 Identities=13% Similarity=0.323 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHhhccCCC
Q 037321 31 DVFVGIAMNLARMSQCMYQFGEG 53 (75)
Q Consensus 31 ~~~~~~~~NlaR~~~~~Y~~~Dg 53 (75)
..+.+.+.|+-|++..+|++.|+
T Consensus 299 e~~~~ql~~~~k~l~~~~~~kde 321 (663)
T PHA03369 299 EALADQLNNLYKLLRTIYKHKDE 321 (663)
T ss_pred HHHHHHHHHHHHHHHHhccCccc
Confidence 34566778888999999999887
No 83
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.27 E-value=1.6e+02 Score=16.69 Aligned_cols=21 Identities=14% Similarity=0.327 Sum_probs=16.4
Q ss_pred ChHHHHHHHHHHHHHHHHHHH
Q 037321 1 REAREHIHDLIAQTWMKMNRD 21 (75)
Q Consensus 1 EeA~~~i~~~i~~~wK~ln~e 21 (75)
|.+++++...+.+.|+.+-+-
T Consensus 21 E~tieeLn~~laEq~~~i~k~ 41 (72)
T COG2900 21 EQTIEELNDALAEQQLVIDKL 41 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 457788889999999887653
Done!