Query 038921
Match_columns 197
No_of_seqs 128 out of 224
Neff 3.9
Searched_HMMs 46136
Date Fri Mar 29 04:35:17 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/038921.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/038921hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03791 KNOX2: KNOX2 domain ; 100.0 1E-28 2.2E-33 171.5 6.9 51 138-188 2-52 (52)
2 PF03790 KNOX1: KNOX1 domain ; 99.9 4.5E-24 9.7E-29 144.5 4.0 44 87-130 1-44 (45)
3 KOG0773 Transcription factor M 98.3 1.9E-07 4.1E-12 83.1 1.6 105 87-191 50-164 (342)
4 PF11348 DUF3150: Protein of u 65.2 9.7 0.00021 33.8 4.2 46 105-164 61-118 (257)
5 PF03792 PBC: PBC domain; Int 63.0 47 0.001 29.0 7.8 41 87-127 27-78 (191)
6 PF06295 DUF1043: Protein of u 61.1 15 0.00032 29.3 4.2 35 152-186 35-69 (128)
7 KOG0774 Transcription factor P 60.6 65 0.0014 30.0 8.6 79 85-163 27-125 (334)
8 PF11288 DUF3089: Protein of u 60.5 3.1 6.7E-05 36.2 0.2 30 85-118 109-138 (207)
9 COG3105 Uncharacterized protei 54.8 33 0.00073 28.5 5.3 42 146-187 38-79 (138)
10 KOG4420 Uncharacterized conser 46.4 27 0.00059 32.4 3.9 49 148-196 177-241 (325)
11 PF05190 MutS_IV: MutS family 44.8 23 0.00051 24.9 2.7 25 142-166 1-25 (92)
12 PF13097 CENP-U: CENP-A nucleo 44.1 69 0.0015 27.6 5.8 42 145-186 104-147 (175)
13 PRK11677 hypothetical protein; 41.8 87 0.0019 25.7 5.8 42 145-186 32-73 (134)
14 PF07425 Pardaxin: Pardaxin; 37.8 22 0.00048 22.7 1.4 19 89-107 5-23 (33)
15 PF11014 DUF2852: Protein of u 36.2 12 0.00026 30.2 0.0 35 117-153 78-112 (115)
16 PF05016 Plasmid_stabil: Plasm 35.5 1E+02 0.0022 21.1 4.7 22 171-192 24-45 (90)
17 KOG4460 Nuclear pore complex, 32.8 79 0.0017 32.2 5.0 37 148-187 569-605 (741)
18 PF15225 IL32: Interleukin 32 32.2 2.6E+02 0.0056 22.2 7.3 50 118-169 2-62 (104)
19 PF02290 SRP14: Signal recogni 30.4 41 0.00088 25.6 2.1 18 143-160 71-88 (93)
20 PF03401 TctC: Tripartite tric 29.4 1.1E+02 0.0023 26.8 4.8 18 111-128 211-228 (274)
21 cd03204 GST_C_GDAP1 GST_C fami 28.2 96 0.0021 24.0 3.9 33 152-184 5-46 (111)
22 KOG1070 rRNA processing protei 28.1 1.7E+02 0.0038 32.9 6.8 91 86-185 1444-1538(1710)
23 PF12057 DUF3538: Domain of un 26.4 1.2E+02 0.0025 24.8 4.1 41 147-187 15-69 (120)
24 PF13713 BRX_N: Transcription 26.0 75 0.0016 21.1 2.5 20 170-189 8-27 (39)
25 KOG4445 Uncharacterized conser 24.2 1.6E+02 0.0034 28.0 5.1 43 146-188 135-179 (368)
26 KOG2828 Acetyl-CoA hydrolase [ 24.1 42 0.00091 32.6 1.4 23 92-114 429-451 (454)
27 TIGR00270 conserved hypothetic 23.7 97 0.0021 25.6 3.3 43 145-187 61-103 (154)
28 PF01563 Alpha_E3_glycop: Alph 22.5 45 0.00098 23.9 1.0 23 89-111 28-50 (56)
29 COG5661 Predicted secreted Zn- 21.8 2.3E+02 0.005 25.1 5.3 42 142-186 109-150 (210)
30 PF10112 Halogen_Hydrol: 5-bro 21.3 2.2E+02 0.0048 23.6 5.0 43 145-187 125-173 (199)
31 PHA03397 vlf-1 very late expre 21.3 67 0.0014 30.3 2.1 29 86-114 12-40 (363)
32 cd03209 GST_C_Mu GST_C family, 20.9 2.8E+02 0.0061 20.4 5.1 14 171-184 39-52 (121)
33 PF13865 FoP_duplication: C-te 20.6 65 0.0014 23.4 1.6 8 144-151 46-53 (74)
34 PF10152 DUF2360: Predicted co 20.6 2.9E+02 0.0062 22.5 5.5 42 147-188 1-49 (148)
35 PF14689 SPOB_a: Sensor_kinase 20.6 3E+02 0.0065 19.0 5.5 37 151-187 9-57 (62)
No 1
>PF03791 KNOX2: KNOX2 domain ; InterPro: IPR005541 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=99.95 E-value=1e-28 Score=171.48 Aligned_cols=51 Identities=55% Similarity=0.935 Sum_probs=49.3
Q ss_pred CCCCCCCChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhhc
Q 038921 138 PTCFGADPELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLCT 188 (197)
Q Consensus 138 ~~~~g~DPELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~~ 188 (197)
++++|+|||||||||+||.||+||||||+|||+||++|||+||+||++|||
T Consensus 2 ~~~~~~dpELDqFMeaYc~~L~kykeeL~~p~~EA~~f~~~ie~qL~~Lt~ 52 (52)
T PF03791_consen 2 SSSIGADPELDQFMEAYCDMLVKYKEELQRPFQEAMEFCREIEQQLSSLTG 52 (52)
T ss_pred CCCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 567899999999999999999999999999999999999999999999996
No 2
>PF03790 KNOX1: KNOX1 domain ; InterPro: IPR005540 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=99.89 E-value=4.5e-24 Score=144.49 Aligned_cols=44 Identities=70% Similarity=1.090 Sum_probs=41.5
Q ss_pred HHHHHHHhhCCChHHHHHHHHhccccCCchhhHhHHHHHHhhhh
Q 038921 87 TVIKAKIASHPCYPRLLEAYIDCQKVGAPPEIAYLLDAIRSEKD 130 (197)
Q Consensus 87 ~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev~~~Ldei~~~~~ 130 (197)
+.|||||++||+||+||+|||+|||||||||++++||++.++++
T Consensus 1 e~iKA~I~~HP~Y~~Ll~Ayi~C~KVGAP~e~~~~L~e~~~~~~ 44 (45)
T PF03790_consen 1 EAIKAKIASHPLYPRLLAAYIDCQKVGAPPEVVARLDEILAESQ 44 (45)
T ss_pred ChHHHHHHcCCCcHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhc
Confidence 36999999999999999999999999999999999999998764
No 3
>KOG0773 consensus Transcription factor MEIS1 and related HOX domain proteins [Transcription]
Probab=98.32 E-value=1.9e-07 Score=83.08 Aligned_cols=105 Identities=15% Similarity=0.099 Sum_probs=86.9
Q ss_pred HHHHHHHhhCCChHHHHHHHHhccccCCchhhHhHHHHHHhhhhhh--cc-CCC-----CCCCCCCChhhhHHHHHHHHH
Q 038921 87 TVIKAKIASHPCYPRLLEAYIDCQKVGAPPEIAYLLDAIRSEKDLY--RR-DSV-----PTCFGADPELDEFMETYSDML 158 (197)
Q Consensus 87 ~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev~~~Ldei~~~~~~~--~~-~~~-----~~~~g~DPELDqFMeaYc~mL 158 (197)
..+|+.+.+||+|..++.||++|++++.+.+.+.+.++..+..... ++ +++ ....+.+++|+.||..|+.+|
T Consensus 50 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~s~~~~~~~~~~~~~~~~~k~~~~l 129 (342)
T KOG0773|consen 50 ASSKYLTAAQELLDEFCSAGLDCLKGKMPYDPVPRSPASLSPPEDKGARRGNATRESATLKAWLEEHRLNPYPSKLEKIL 129 (342)
T ss_pred ccccccccchhHHhHHhhccccccccccCcCccccccccccCccccccccccccccccccccchhhhhhccCchHHHHHH
Confidence 3589999999999999999999999999999998877665544321 11 111 124568999999999999999
Q ss_pred HHHHHHhhchH--HHHHHHHHHHHHHHHHhhcCCC
Q 038921 159 AKYKSDLARPF--DEATTFLNKVELQLRNLCTTTA 191 (197)
Q Consensus 159 ~kykEEL~rPf--~EA~~F~~~iE~QL~~L~~~~~ 191 (197)
..++..|+..+ ++++.+.++++..+...++.+.
T Consensus 130 l~~~~~~~~~~~~~~~~~a~r~~~~~~~~~~~~~~ 164 (342)
T KOG0773|consen 130 LAVITKLTLTQVSTWFANARRRLKKELKMTWGPTP 164 (342)
T ss_pred HHHHHHhhhhhHHHHHHHHHHHHHhccCCCCCCcc
Confidence 99999999998 7999999999999999887653
No 4
>PF11348 DUF3150: Protein of unknown function (DUF3150); InterPro: IPR021496 This bacterial family of proteins with unknown function appears to be restricted to Proteobacteria.
Probab=65.24 E-value=9.7 Score=33.82 Aligned_cols=46 Identities=28% Similarity=0.498 Sum_probs=30.5
Q ss_pred HHHhccccCC--------chhh----HhHHHHHHhhhhhhccCCCCCCCCCCChhhhHHHHHHHHHHHHHHH
Q 038921 105 AYIDCQKVGA--------PPEI----AYLLDAIRSEKDLYRRDSVPTCFGADPELDEFMETYSDMLAKYKSD 164 (197)
Q Consensus 105 AyidC~KVGA--------Ppev----~~~Ldei~~~~~~~~~~~~~~~~g~DPELDqFMeaYc~mL~kykEE 164 (197)
|.=-|.|||. |.+. ..+|++++.++ +.+.++|+..|=..+..+..+
T Consensus 61 A~r~~~~~G~rFlgG~aVP~~~~~~l~~~L~~i~~eF--------------~~~k~~Fl~~Yd~~i~~w~~~ 118 (257)
T PF11348_consen 61 AERLCLKVGVRFLGGYAVPEDKAEELAEELEDIKTEF--------------EQEKQDFLANYDQAIEEWIDR 118 (257)
T ss_pred HHHHHHHcCCcccceeEcCHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHH
Confidence 4456899998 4443 44444555544 345678888888888887777
No 5
>PF03792 PBC: PBC domain; InterPro: IPR005542 Pbx proteins are members of the TALE (three-amino-acid loop extension) family of atypical homeodomain proteins, whose members are characterised by a three-residue insertion in the first helix of the homeodomain involved in their interaction with Hox proteins. Examination of Pbx1 has shown that, in addition to the homeodomain, a short 16-residue C-terminal tail is essential for maximal cooperative interactions with Hox partners as well as for maximal monomeric binding of Pbx1 to DNA. The PBX domain is a bipartite acidic domain [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=62.99 E-value=47 Score=29.02 Aligned_cols=41 Identities=22% Similarity=0.306 Sum_probs=31.2
Q ss_pred HHHHHHHhhCCChHHHHHHHHhcc-ccCC---------chh-hHhHHHHHHh
Q 038921 87 TVIKAKIASHPCYPRLLEAYIDCQ-KVGA---------PPE-IAYLLDAIRS 127 (197)
Q Consensus 87 ~~iKAkI~sHPlYp~LL~AyidC~-KVGA---------Ppe-v~~~Ldei~~ 127 (197)
...|-.|-+||+||.|-+..++-. |++- ||| ++.|||.+-.
T Consensus 27 qa~K~~l~~hr~k~ALfsVLcE~KEkt~LSir~~qee~p~dpQl~RLDNML~ 78 (191)
T PF03792_consen 27 QARKHALNCHRMKPALFSVLCEIKEKTVLSIRNIQEEDPPDPQLMRLDNMLL 78 (191)
T ss_pred HHhchhhcCCCCchhhHHHHHHHHhhcCccccccCCcCCCchhhhhhhcchh
Confidence 789999999999999999987765 2221 444 7788887644
No 6
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=61.09 E-value=15 Score=29.29 Aligned_cols=35 Identities=20% Similarity=0.436 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHh
Q 038921 152 ETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNL 186 (197)
Q Consensus 152 eaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L 186 (197)
+.=-.-|..||.++..+|..+...+++|...-+.|
T Consensus 35 ~~~k~el~~yk~~V~~HF~~ta~Ll~~l~~~Y~~l 69 (128)
T PF06295_consen 35 EQAKQELEQYKQEVNDHFAQTAELLDNLTQDYQKL 69 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455689999999999999999999998877665
No 7
>KOG0774 consensus Transcription factor PBX and related HOX domain proteins [Transcription]
Probab=60.60 E-value=65 Score=30.01 Aligned_cols=79 Identities=20% Similarity=0.329 Sum_probs=46.7
Q ss_pred hhHHHHHHHhhCCChHHHHHHHHhcc-ccC---------Cchh-hHhHHHHHHhhhhhhccC----CCCCCCCCC-----
Q 038921 85 VSTVIKAKIASHPCYPRLLEAYIDCQ-KVG---------APPE-IAYLLDAIRSEKDLYRRD----SVPTCFGAD----- 144 (197)
Q Consensus 85 ~s~~iKAkI~sHPlYp~LL~AyidC~-KVG---------APpe-v~~~Ldei~~~~~~~~~~----~~~~~~g~D----- 144 (197)
+..+.|-.|-+||+||.|.+--++-. |++ .||| ++-|||.+.-.-.+..+. ......|.|
T Consensus 27 eaqa~K~~lnch~mk~AlfsVLcE~KeKt~lsir~~qdeep~dpqlmRLDnML~AEGVagPekgga~~~~Asgg~hsdYR 106 (334)
T KOG0774|consen 27 EAQARKHALNCHRMKPALFSVLCEIKEKTVLSIRGMQDEEPPDPQLMRLDNMLLAEGVAGPEKGGARAAAASGGDHSDYR 106 (334)
T ss_pred hHHhhhhccccccchHHHHHHHHHhhhhheeeeccccccCCCChHHHHHHHHHHHhcccCccccchhhhhccCCChHHHH
Confidence 34589999999999999998776543 222 3777 899999864432222211 001111222
Q ss_pred ChhhhHHHHHHHHHHHHHH
Q 038921 145 PELDEFMETYSDMLAKYKS 163 (197)
Q Consensus 145 PELDqFMeaYc~mL~kykE 163 (197)
..|-|--.-|-..|.||.+
T Consensus 107 ~kL~qiR~iy~~Elekyeq 125 (334)
T KOG0774|consen 107 AKLLQIRQIYHNELEKYEQ 125 (334)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3455555666666666654
No 8
>PF11288 DUF3089: Protein of unknown function (DUF3089); InterPro: IPR021440 This family of proteins has no known function.
Probab=60.46 E-value=3.1 Score=36.18 Aligned_cols=30 Identities=33% Similarity=0.658 Sum_probs=27.1
Q ss_pred hhHHHHHHHhhCCChHHHHHHHHhccccCCchhh
Q 038921 85 VSTVIKAKIASHPCYPRLLEAYIDCQKVGAPPEI 118 (197)
Q Consensus 85 ~s~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev 118 (197)
....||.+|..+|+..+||+||+ ||.|.-+
T Consensus 109 l~~LL~e~~~~~pl~~rLVAAYl----iG~~v~~ 138 (207)
T PF11288_consen 109 LLRLLKEEIAGDPLRKRLVAAYL----IGYPVTV 138 (207)
T ss_pred HHHHHHHHhcCchHHhhhheeee----cCccccH
Confidence 36789999999999999999999 9998765
No 9
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.78 E-value=33 Score=28.54 Aligned_cols=42 Identities=19% Similarity=0.400 Sum_probs=34.4
Q ss_pred hhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhh
Q 038921 146 ELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLC 187 (197)
Q Consensus 146 ELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~ 187 (197)
-|-.=|+.--.-|..||+||.+||.+..+.+++|-+.-+.|-
T Consensus 38 ~~q~ELe~~K~~ld~~rqel~~HFa~sAeLlktl~~dYqkly 79 (138)
T COG3105 38 KLQYELEKVKAQLDEYRQELVKHFARSAELLKTLAQDYQKLY 79 (138)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344446777788999999999999999999999988777663
No 10
>KOG4420 consensus Uncharacterized conserved protein (Ganglioside-induced differentiation associated protein 1, GDAP1) [Function unknown]
Probab=46.36 E-value=27 Score=32.44 Aligned_cols=49 Identities=27% Similarity=0.411 Sum_probs=38.8
Q ss_pred hhHHHHHHHHHHHHHHHhhc---------hHHHHHHHHHHHHHHHHH-------hhcCCCccccC
Q 038921 148 DEFMETYSDMLAKYKSDLAR---------PFDEATTFLNKVELQLRN-------LCTTTASVRSL 196 (197)
Q Consensus 148 DqFMeaYc~mL~kykEEL~r---------Pf~EA~~F~~~iE~QL~~-------L~~~~~s~~~~ 196 (197)
++--++|...+.|....|-+ -.+|-.+|+..+|+.|.. |||...|+..|
T Consensus 177 pdla~ay~akqkkl~~kl~~hdd~s~lkkild~l~~~Ld~VEteLe~r~~~~~wL~G~efslADV 241 (325)
T KOG4420|consen 177 PDLAEAYLAKQKKLMAKLLEHDDVSYLKKILDELAMVLDQVETELEKRKLCELWLCGCEFSLADV 241 (325)
T ss_pred chhhHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHhhccccceeeccccchHHHH
Confidence 44458999888888766554 468999999999999998 78888776554
No 11
>PF05190 MutS_IV: MutS family domain IV C-terminus.; InterPro: IPR007861 Mismatch repair contributes to the overall fidelity of DNA replication and is essential for combating the adverse effects of damage to the genome. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex. The post-replicative Mismatch Repair System (MMRS) of Escherichia coli involves MutS (Mutator S), MutL and MutH proteins, and acts to correct point mutations or small insertion/deletion loops produced during DNA replication []. MutS and MutL are involved in preventing recombination between partially homologous DNA sequences. The assembly of MMRS is initiated by MutS, which recognises and binds to mispaired nucleotides and allows further action of MutL and MutH to eliminate a portion of newly synthesized DNA strand containing the mispaired base []. MutS can also collaborate with methyltransferases in the repair of O(6)-methylguanine damage, which would otherwise pair with thymine during replication to create an O(6)mG:T mismatch []. MutS exists as a dimer, where the two monomers have different conformations and form a heterodimer at the structural level []. Only one monomer recognises the mismatch specifically and has ADP bound. Non-specific major groove DNA-binding domains from both monomers embrace the DNA in a clamp-like structure. Mismatch binding induces ATP uptake and a conformational change in the MutS protein, resulting in a clamp that translocates on DNA. MutS is a modular protein with a complex structure [], and is composed of: N-terminal mismatch-recognition domain, which is similar in structure to tRNA endonuclease. Connector domain, which is similar in structure to Holliday junction resolvase ruvC. Core domain, which is composed of two separate subdomains that join together to form a helical bundle; from within the core domain, two helices act as levers that extend towards (but do not touch) the DNA. Clamp domain, which is inserted between the two subdomains of the core domain at the top of the lever helices; the clamp domain has a beta-sheet structure. ATPase domain (connected to the core domain), which has a classical Walker A motif. HTH (helix-turn-helix) domain, which is involved in dimer contacts. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair. Homologues of MutS have been found in many species including eukaryotes (MSH 1, 2, 3, 4, 5, and 6 proteins), archaea and bacteria, and together these proteins have been grouped into the MutS family. Although many of these proteins have similar activities to the E. coli MutS, there is significant diversity of function among the MutS family members. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein [].This diversity is even seen within species, where many species encode multiple MutS homologues with distinct functions []. Inter-species homologues may have arisen through frequent ancient horizontal gene transfer of MutS (and MutL) from bacteria to archaea and eukaryotes via endosymbiotic ancestors of mitochondria and chloroplasts []. This entry represents the clamp domain (domain 4) found in proteins of the MutS family. The clamp domain is inserted within the core domain at the top of the lever helices. It has a beta-sheet structure [].; GO: 0005524 ATP binding, 0030983 mismatched DNA binding, 0006298 mismatch repair; PDB: 2WTU_A 1OH7_A 1OH5_B 1W7A_B 1NG9_A 1OH8_B 1WBD_A 1WB9_A 3K0S_A 1OH6_A ....
Probab=44.83 E-value=23 Score=24.87 Aligned_cols=25 Identities=36% Similarity=0.550 Sum_probs=18.5
Q ss_pred CCCChhhhHHHHHHHHHHHHHHHhh
Q 038921 142 GADPELDEFMETYSDMLAKYKSDLA 166 (197)
Q Consensus 142 g~DPELDqFMeaYc~mL~kykEEL~ 166 (197)
|-|++||+-.+.|.++.....+.+.
T Consensus 1 g~d~~Ld~~~~~~~~~~~~l~~~~~ 25 (92)
T PF05190_consen 1 GFDEELDELREEYEEIEEELEELLE 25 (92)
T ss_dssp TSSHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 4689999999998887666554443
No 12
>PF13097 CENP-U: CENP-A nucleosome associated complex (NAC) subunit
Probab=44.12 E-value=69 Score=27.59 Aligned_cols=42 Identities=21% Similarity=0.405 Sum_probs=36.2
Q ss_pred ChhhhHHHHHHHHHHHHHHHhhchH-HHHH-HHHHHHHHHHHHh
Q 038921 145 PELDEFMETYSDMLAKYKSDLARPF-DEAT-TFLNKVELQLRNL 186 (197)
Q Consensus 145 PELDqFMeaYc~mL~kykEEL~rPf-~EA~-~F~~~iE~QL~~L 186 (197)
-|||=...++-.++.+||+.+.-.+ .+|+ .|+..+..||-.+
T Consensus 104 tELDVvL~~FEk~~~eYkq~ieS~~cr~AI~~F~~~~keqL~~~ 147 (175)
T PF13097_consen 104 TELDVVLSAFEKTALEYKQSIESKICRKAINKFYSNFKEQLIEM 147 (175)
T ss_pred hHHHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHH
Confidence 4999999999999999999998876 6664 6999999888654
No 13
>PRK11677 hypothetical protein; Provisional
Probab=41.79 E-value=87 Score=25.67 Aligned_cols=42 Identities=21% Similarity=0.392 Sum_probs=32.4
Q ss_pred ChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHh
Q 038921 145 PELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNL 186 (197)
Q Consensus 145 PELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L 186 (197)
.+|.+=.+.=-.-|..||+|+..||.++..-+++|-.+-+.|
T Consensus 32 ~~le~eLe~~k~ele~YkqeV~~HFa~TA~Ll~~L~~~Y~~L 73 (134)
T PRK11677 32 QALQYELEKNKAELEEYRQELVSHFARSAELLDTMAKDYRQL 73 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444455556799999999999999999999888776655
No 14
>PF07425 Pardaxin: Pardaxin; InterPro: IPR009990 This family consists of several Pardaxin proteins. Pardaxin, a 33-amino-acid pore-forming polypeptide toxin isolated from the Red Sea Moses sole Pardachirus marmoratus, has a helix-hinge-helix structure. This is a common structural motif found both in antibacterial peptides that can act selectively on bacterial membranes (e.g., cecropin), and in cytotoxic peptides that can lyse both mammalian and bacterial cells (e.g., melittin). Pardaxin possesses a high antibacterial activity with a significantly reduced haemolytic activity towards human red blood cells compared with melittin []. Pardaxin has also been found to have a shark repellent action [].; GO: 0005576 extracellular region; PDB: 1XC0_A 2KNS_A.
Probab=37.78 E-value=22 Score=22.67 Aligned_cols=19 Identities=37% Similarity=0.436 Sum_probs=15.7
Q ss_pred HHHHHhhCCChHHHHHHHH
Q 038921 89 IKAKIASHPCYPRLLEAYI 107 (197)
Q Consensus 89 iKAkI~sHPlYp~LL~Ayi 107 (197)
+--||++-|++..||+|--
T Consensus 5 lipkiissplfktllsavg 23 (33)
T PF07425_consen 5 LIPKIISSPLFKTLLSAVG 23 (33)
T ss_dssp CHHHHCCTTTCHHHHHHHH
T ss_pred hhhHHHccHHHHHHHHHHH
Confidence 3468999999999998843
No 15
>PF11014 DUF2852: Protein of unknown function (DUF2852); InterPro: IPR021273 This bacterial family of proteins has no known function.
Probab=36.20 E-value=12 Score=30.19 Aligned_cols=35 Identities=17% Similarity=0.185 Sum_probs=21.2
Q ss_pred hhHhHHHHHHhhhhhhccCCCCCCCCCCChhhhHHHH
Q 038921 117 EIAYLLDAIRSEKDLYRRDSVPTCFGADPELDEFMET 153 (197)
Q Consensus 117 ev~~~Ldei~~~~~~~~~~~~~~~~g~DPELDqFMea 153 (197)
+.+.+||+.+++++.+-..+ ..-.+..|.|+||..
T Consensus 78 ~tL~RLEeEq~eF~~Fl~rL--R~AKDk~EFD~FM~~ 112 (115)
T PF11014_consen 78 DTLRRLEEEQREFEDFLERL--RRAKDKEEFDQFMAE 112 (115)
T ss_pred HHHHHHHHHHHHHHHHHHHH--HHhhhHHHHHHHHHh
Confidence 46778888888775431110 001245799999974
No 16
>PF05016 Plasmid_stabil: Plasmid stabilisation system protein; InterPro: IPR007712 Members of this family are involved in plasmid stabilisation. The exact molecular function of this protein is not known.; PDB: 3KXE_A 1WMI_A 3KIQ_y 3KIX_y 2KC9_A 3KHA_B 3KIS_y 3KIU_y 2KC8_A 1Z8M_A ....
Probab=35.49 E-value=1e+02 Score=21.07 Aligned_cols=22 Identities=18% Similarity=0.377 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHhhcCCCc
Q 038921 171 EATTFLNKVELQLRNLCTTTAS 192 (197)
Q Consensus 171 EA~~F~~~iE~QL~~L~~~~~s 192 (197)
-|..|...|+.++..|...+..
T Consensus 24 ~a~~~~~~i~~~i~~l~~~P~~ 45 (90)
T PF05016_consen 24 AAERFIDAIEKAIERLSENPEM 45 (90)
T ss_dssp HHHHHHHHHHHHHHHHHCCCCT
T ss_pred HHHHHHHHHHHHHHhccccccc
Confidence 4999999999999999876643
No 17
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=32.80 E-value=79 Score=32.24 Aligned_cols=37 Identities=22% Similarity=0.278 Sum_probs=31.0
Q ss_pred hhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhh
Q 038921 148 DEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLC 187 (197)
Q Consensus 148 DqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~ 187 (197)
.-|-|.|..-=.+-|+|++|++ ...|+.+++||+.|.
T Consensus 569 ~vfrEqYi~~~dlV~~e~qrH~---~~l~~~k~~QlQ~l~ 605 (741)
T KOG4460|consen 569 QVFREQYILKQDLVKEEIQRHV---KLLCDQKKKQLQDLS 605 (741)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH
Confidence 4588888888888899999986 467899999999885
No 18
>PF15225 IL32: Interleukin 32
Probab=32.22 E-value=2.6e+02 Score=22.19 Aligned_cols=50 Identities=22% Similarity=0.248 Sum_probs=31.8
Q ss_pred hHhHHHHHHhhhhhh-ccCCCCCCCCCCChh----------hhHHHHHHHHHHHHHHHhhchH
Q 038921 118 IAYLLDAIRSEKDLY-RRDSVPTCFGADPEL----------DEFMETYSDMLAKYKSDLARPF 169 (197)
Q Consensus 118 v~~~Ldei~~~~~~~-~~~~~~~~~g~DPEL----------DqFMeaYc~mL~kykEEL~rPf 169 (197)
..++|++.+++...+ +|+. .--.+||++ |.-|+-+|.||+....-+++-+
T Consensus 2 l~PLLeeeRd~LR~RgnRsp--vp~~EDp~te~~eP~Esf~dkvmR~FqamlqrLQ~ww~~vl 62 (104)
T PF15225_consen 2 LTPLLEEERDELRCRGNRSP--VPDVEDPATEVWEPGESFCDKVMRWFQAMLQRLQTWWQAVL 62 (104)
T ss_pred CccccHHHHHHHHhccCCCC--CCCCCCcccccCCcchhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456788888775432 2221 111245544 6789999999999887777664
No 19
>PF02290 SRP14: Signal recognition particle 14kD protein; InterPro: IPR003210 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes [, ]. SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. Eukaryotic SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor []. In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. In archaea, the SRP complex contains 7S RNA like its eukaryotic counterpart, yet only includes two of the six protein subunits found in the eukarytic complex: SRP19 and SRP54 []. This entry represents the 14 kDa SRP14 component. Both SRP9 and SRP14 have the same (beta)-alpha-beta(3)-alpha fold. The heterodimer has pseudo two-fold symmetry and is saddle-like, consisting of a curved six-stranded beta-sheet that has four helices packed on the convex side and an exposed concave surface lined with positively charged residues. The SRP9/SRP14 heterodimer is essential for SRP RNA binding, mediating the pausing of synthesis of ribosome associated nascent polypeptides that have been engaged by the targeting domain of SRP [].; GO: 0008312 7S RNA binding, 0030942 endoplasmic reticulum signal peptide binding, 0006614 SRP-dependent cotranslational protein targeting to membrane, 0005786 signal recognition particle, endoplasmic reticulum targeting; PDB: 1914_A 1RY1_D 1E8O_B 2W9J_B.
Probab=30.45 E-value=41 Score=25.57 Aligned_cols=18 Identities=28% Similarity=0.667 Sum_probs=15.1
Q ss_pred CCChhhhHHHHHHHHHHH
Q 038921 143 ADPELDEFMETYSDMLAK 160 (197)
Q Consensus 143 ~DPELDqFMeaYc~mL~k 160 (197)
.-.+|++|..+|+.+|..
T Consensus 71 ~~~~l~~F~~~Y~~v~K~ 88 (93)
T PF02290_consen 71 DPDDLDKFWQSYANVLKA 88 (93)
T ss_dssp ETTCHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHh
Confidence 346999999999999863
No 20
>PF03401 TctC: Tripartite tricarboxylate transporter family receptor; InterPro: IPR005064 Bordetella pertussis, the causative agent of human whooping cough (pertussis), is an obligate human pathogen with diverse high-affinity transport systems for the assimilation of iron, a biometal that is essential for growth []. Periplasmic binding proteins of a new family, particularly well represented in this organism (and more generally in beta-proteobacteria), have been called Bug receptors []. They adopt a characteristic Venus flytrap fold with two globular domains bisected by a ligand-binding cleft. The family is specific for carboxylated solutes, with a characteristic mode of binding involving two highly conserved beta strand-beta turn-alpha helix motifs originating from each domain. These two motifs form hydrogen bonds with a carboxylate group of the ligand, both directly and via conserved water molecules, and have thus been termed the carboxylate pincers. Domain 1 recognises the ligand and the carboxylate group serves as an initial anchoring point. Domain 2 discriminates between productively and non-productively bound ligands as proper interactions with this domain is needed for the of the closed conformation []. BugE has a glutamate bound ligand. No charged residues are involved in glutamate binding by BugE, unlike what has been described for all glutamate receptors reported so far. The Bug architecture is highly conserved despite limited sequence identity [].; GO: 0030288 outer membrane-bounded periplasmic space; PDB: 2QPQ_C 2DVZ_A 2F5X_A.
Probab=29.40 E-value=1.1e+02 Score=26.80 Aligned_cols=18 Identities=28% Similarity=0.300 Sum_probs=13.1
Q ss_pred ccCCchhhHhHHHHHHhh
Q 038921 111 KVGAPPEIAYLLDAIRSE 128 (197)
Q Consensus 111 KVGAPpev~~~Ldei~~~ 128 (197)
+-|+|+|++.+|.++-++
T Consensus 211 p~gtp~~~~~~l~~a~~~ 228 (274)
T PF03401_consen 211 PKGTPDEIVDKLADAIKK 228 (274)
T ss_dssp ETTS-HHHHHHHHHHHHH
T ss_pred CCCCCHHHHHHHHHHHHH
Confidence 348999999999876553
No 21
>cd03204 GST_C_GDAP1 GST_C family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal thioredoxin-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates.
Probab=28.21 E-value=96 Score=24.03 Aligned_cols=33 Identities=33% Similarity=0.426 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHHhhch---------HHHHHHHHHHHHHHHH
Q 038921 152 ETYSDMLAKYKSDLARP---------FDEATTFLNKVELQLR 184 (197)
Q Consensus 152 eaYc~mL~kykEEL~rP---------f~EA~~F~~~iE~QL~ 184 (197)
++|..-..|.|.+|..| +.+....+..||++|.
T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~l~~LE~~L~ 46 (111)
T cd03204 5 TAYIAKQKKLKSKLLDHDNVEYLKKILDELEMVLDQVEQELQ 46 (111)
T ss_pred HHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56666666777666555 4677778888899886
No 22
>KOG1070 consensus rRNA processing protein Rrp5 [RNA processing and modification]
Probab=28.07 E-value=1.7e+02 Score=32.88 Aligned_cols=91 Identities=18% Similarity=0.227 Sum_probs=64.8
Q ss_pred hHHHHHHHhhCCChHHHHHHHHhccccCCchhhHhHHHHHHhhhhhhccCCCCCCCCCCChhhhHHHHHHHHHHHHH--H
Q 038921 86 STVIKAKIASHPCYPRLLEAYIDCQKVGAPPEIAYLLDAIRSEKDLYRRDSVPTCFGADPELDEFMETYSDMLAKYK--S 163 (197)
Q Consensus 86 s~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev~~~Ldei~~~~~~~~~~~~~~~~g~DPELDqFMeaYc~mL~kyk--E 163 (197)
.+.+-..|.+||.=.-+.--||.-+= .+..|+++|.-. .|.....-+.+..|++.---||.++...|. +
T Consensus 1444 aeDferlvrssPNSSi~WI~YMaf~L------elsEiekAR~ia---erAL~tIN~REeeEKLNiWiA~lNlEn~yG~ee 1514 (1710)
T KOG1070|consen 1444 AEDFERLVRSSPNSSILWIRYMAFHL------ELSEIEKARKIA---ERALKTINFREEEEKLNIWIAYLNLENAYGTEE 1514 (1710)
T ss_pred HHHHHHHHhcCCCcchHHHHHHHHHh------hhhhhHHHHHHH---HHHhhhCCcchhHHHHHHHHHHHhHHHhhCcHH
Confidence 44566789999999999988986543 233444433332 222223345678899999999999999997 8
Q ss_pred HhhchHHHHHHHHH--HHHHHHHH
Q 038921 164 DLARPFDEATTFLN--KVELQLRN 185 (197)
Q Consensus 164 EL~rPf~EA~~F~~--~iE~QL~~ 185 (197)
.|++-|..|-.+|+ .+...|..
T Consensus 1515 sl~kVFeRAcqycd~~~V~~~L~~ 1538 (1710)
T KOG1070|consen 1515 SLKKVFERACQYCDAYTVHLKLLG 1538 (1710)
T ss_pred HHHHHHHHHHHhcchHHHHHHHHH
Confidence 89999999999998 44444443
No 23
>PF12057 DUF3538: Domain of unknown function (DUF3538); InterPro: IPR021925 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 120 amino acids in length. This domain is found associated with PF00240 from PFAM. This domain has a conserved SDL sequence motif.
Probab=26.36 E-value=1.2e+02 Score=24.76 Aligned_cols=41 Identities=32% Similarity=0.599 Sum_probs=30.1
Q ss_pred hhhHHHHHHHHHH-----HH------HHHhhchH---HHHHHHHHHHHHHHHHhh
Q 038921 147 LDEFMETYSDMLA-----KY------KSDLARPF---DEATTFLNKVELQLRNLC 187 (197)
Q Consensus 147 LDqFMeaYc~mL~-----ky------kEEL~rPf---~EA~~F~~~iE~QL~~L~ 187 (197)
|-=||+.|+++|. .| +|+=+|-| -||..++..-..-|++|-
T Consensus 15 L~Pflery~~iL~~~~~~e~~~~~~~re~~QRi~d~VsEalh~lsHA~hAlSDL~ 69 (120)
T PF12057_consen 15 LQPFLERYHEILQEDPSFEYENNTQEREEDQRIFDLVSEALHYLSHAYHALSDLM 69 (120)
T ss_pred HhHHHHHHHHHHhcCCCCCCCCCccchhhHhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3448888998886 34 55556665 588888888888888874
No 24
>PF13713 BRX_N: Transcription factor BRX N-terminal domain
Probab=25.99 E-value=75 Score=21.08 Aligned_cols=20 Identities=15% Similarity=0.370 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHhhcC
Q 038921 170 DEATTFLNKVELQLRNLCTT 189 (197)
Q Consensus 170 ~EA~~F~~~iE~QL~~L~~~ 189 (197)
.-|.+|++.|..||++++..
T Consensus 8 kaaKe~IKsLt~QlK~maek 27 (39)
T PF13713_consen 8 KAAKEVIKSLTAQLKDMAEK 27 (39)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 46889999999999999753
No 25
>KOG4445 consensus Uncharacterized conserved protein, contains RWD domain [Function unknown]
Probab=24.24 E-value=1.6e+02 Score=28.03 Aligned_cols=43 Identities=16% Similarity=0.268 Sum_probs=35.2
Q ss_pred hhhhHHHHHHH--HHHHHHHHhhchHHHHHHHHHHHHHHHHHhhc
Q 038921 146 ELDEFMETYSD--MLAKYKSDLARPFDEATTFLNKVELQLRNLCT 188 (197)
Q Consensus 146 ELDqFMeaYc~--mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~~ 188 (197)
+-|.||--||. -|+.....|.+-|++|-.-...+-.|.+.+|.
T Consensus 135 ~C~Hy~H~~ClaRyl~~~~~~lrqe~q~~~~~~qh~~~~~eavcp 179 (368)
T KOG4445|consen 135 ACDHYMHFACLARYLTECLTGLRQEIQDAQKERQHMKEQVEAVCP 179 (368)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHhhhhh
Confidence 67999999995 36666666777788899989999999999974
No 26
>KOG2828 consensus Acetyl-CoA hydrolase [Energy production and conversion]
Probab=24.12 E-value=42 Score=32.60 Aligned_cols=23 Identities=35% Similarity=0.343 Sum_probs=20.4
Q ss_pred HHhhCCChHHHHHHHHhccccCC
Q 038921 92 KIASHPCYPRLLEAYIDCQKVGA 114 (197)
Q Consensus 92 kI~sHPlYp~LL~AyidC~KVGA 114 (197)
+|..||.|..|+.+|.|-.||=.
T Consensus 429 ~i~~p~dre~L~k~afdr~kvmp 451 (454)
T KOG2828|consen 429 QICAPPDREALLKAAFDRAKVMP 451 (454)
T ss_pred HhhCCchHHHHHHHHHHHHhccc
Confidence 68999999999999999888743
No 27
>TIGR00270 conserved hypothetical protein TIGR00270.
Probab=23.69 E-value=97 Score=25.60 Aligned_cols=43 Identities=9% Similarity=0.219 Sum_probs=30.9
Q ss_pred ChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhh
Q 038921 145 PELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLC 187 (197)
Q Consensus 145 PELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~ 187 (197)
++++++++.|-..|.++++++.-..+|-..-+.-=.+.++.+-
T Consensus 61 d~~~~l~~~~g~~Ir~~Re~~glSqeeLA~~lgvs~s~IsriE 103 (154)
T TIGR00270 61 DTTEELVEDYGIIIRREREKRGWSQEQLAKKIQEKESLIKKIE 103 (154)
T ss_pred chHHHHHHHHHHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHH
Confidence 4788999999999999999887776665554444444444443
No 28
>PF01563 Alpha_E3_glycop: Alphavirus E3 glycoprotein; InterPro: IPR002533 Alphaviruses are enveloped RNA viruses that use arthropods such as mosquitoes for transmission to their vertebrate hosts, and include Semliki Forest and Sindbis viruses []. Alphaviruses consist of three structural proteins: the core nucleocapsid protein C, and the envelope proteins P62 and E1 (IPR002548 from INTERPRO) that associate as a heterodimer. The viral membrane-anchored surface glycoproteins are responsible for receptor recognition and entry into target cells through membrane fusion. The proteolytic maturation of P62 into E2 (IPR000936 from INTERPRO) and E3 causes a change in the viral surface. Together the E1, E2, and sometimes E3 glycoprotein "spikes" form an E1/E2 dimer or an E1/E2/E3 trimer, where E2 extends from the centre to the vertices, E1 fills the space between the vertices, and E3, if present, is at the distal end of the spike [, ]. Upon exposure of the virus to the acidity of the endosome, E1 dissociates from E2 to form an E1 homotrimer, which is necessary for the fusion step to drive the cellular and viral membranes together []. This entry represents the alphaviral E3 glycoprotein. Most alphaviruses lose the peripheral protein E3, but in Semliki viruses it remains associated with the viral surface.; GO: 0004252 serine-type endopeptidase activity, 0019028 viral capsid, 0055036 virion membrane; PDB: 3J0C_H 3J0G_N 3N40_P 3N41_A 3N44_A 3N42_A 3N43_A.
Probab=22.53 E-value=45 Score=23.95 Aligned_cols=23 Identities=30% Similarity=0.567 Sum_probs=17.0
Q ss_pred HHHHHhhCCChHHHHHHHHhccc
Q 038921 89 IKAKIASHPCYPRLLEAYIDCQK 111 (197)
Q Consensus 89 iKAkI~sHPlYp~LL~AyidC~K 111 (197)
|=..=+.||-|-.||+|-+.|.-
T Consensus 28 mLe~Nvd~p~Y~~LL~a~l~C~~ 50 (56)
T PF01563_consen 28 MLEDNVDNPGYDDLLEAVLRCPS 50 (56)
T ss_dssp HHHCTSSSTTHHHHHHHHCC--S
T ss_pred HHHHhCCCccHHHHHHHHhhCCC
Confidence 44445789999999999999964
No 29
>COG5661 Predicted secreted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=21.77 E-value=2.3e+02 Score=25.14 Aligned_cols=42 Identities=26% Similarity=0.260 Sum_probs=35.8
Q ss_pred CCCChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHh
Q 038921 142 GADPELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNL 186 (197)
Q Consensus 142 g~DPELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L 186 (197)
++||||-.|-.+|-.-+.+-+ +++.+=|..--++||.+|.+|
T Consensus 109 ~a~pElalvwdtlladikrHE---ErH~eiar~har~mEkaL~sl 150 (210)
T COG5661 109 KADPELALVWDTLLADIKRHE---ERHAEIARYHAREMEKALLSL 150 (210)
T ss_pred CCCHhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhc
Confidence 589999999999877666554 478888999999999999998
No 30
>PF10112 Halogen_Hydrol: 5-bromo-4-chloroindolyl phosphate hydrolysis protein; InterPro: IPR018770 This entry consists of prokaryotic proteins that mediate the hydrolysis of 5-bromo-4-chloroindolyl phosphate bonds.
Probab=21.32 E-value=2.2e+02 Score=23.60 Aligned_cols=43 Identities=26% Similarity=0.414 Sum_probs=29.4
Q ss_pred ChhhhHHHHHH----HHHHHHHHHhhchH--HHHHHHHHHHHHHHHHhh
Q 038921 145 PELDEFMETYS----DMLAKYKSDLARPF--DEATTFLNKVELQLRNLC 187 (197)
Q Consensus 145 PELDqFMeaYc----~mL~kykEEL~rPf--~EA~~F~~~iE~QL~~L~ 187 (197)
...+.||..|- +++.||.+==.+|. .|+...+.++|.-|..|+
T Consensus 125 ~~a~~Fl~~yLp~~~~l~~kY~~l~~~~~~~~~~~~~l~e~~~~L~~l~ 173 (199)
T PF10112_consen 125 TQARKFLYYYLPTAVKLLEKYAELESQPVKSEEIKQSLEEIEETLDTLN 173 (199)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHhccCCChhHHHHHHHHHHHHHHHH
Confidence 57899999995 56777877555665 555566666666666554
No 31
>PHA03397 vlf-1 very late expression factor 1; Provisional
Probab=21.32 E-value=67 Score=30.27 Aligned_cols=29 Identities=28% Similarity=0.415 Sum_probs=25.6
Q ss_pred hHHHHHHHhhCCChHHHHHHHHhccccCC
Q 038921 86 STVIKAKIASHPCYPRLLEAYIDCQKVGA 114 (197)
Q Consensus 86 s~~iKAkI~sHPlYp~LL~AyidC~KVGA 114 (197)
-..-|.+|-+||+|+...+.=++=||-+-
T Consensus 12 ~~~w~~~i~~~~~f~~~~~~~~~rq~~~~ 40 (363)
T PHA03397 12 YNIWKLVIQNHPRFEEVFDLAIDRQKCTP 40 (363)
T ss_pred HHHHHHHHhccchHHHHHHHHHHhhcCCC
Confidence 45789999999999999999999999543
No 32
>cd03209 GST_C_Mu GST_C family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the m
Probab=20.95 E-value=2.8e+02 Score=20.42 Aligned_cols=14 Identities=14% Similarity=-0.088 Sum_probs=8.0
Q ss_pred HHHHHHHHHHHHHH
Q 038921 171 EATTFLNKVELQLR 184 (197)
Q Consensus 171 EA~~F~~~iE~QL~ 184 (197)
+....+..+|.+|.
T Consensus 39 ~~~~~l~~le~~L~ 52 (121)
T cd03209 39 KLPDKLKLFSDFLG 52 (121)
T ss_pred HHHHHHHHHHHHhC
Confidence 33445666666664
No 33
>PF13865 FoP_duplication: C-terminal duplication domain of Friend of PRMT1
Probab=20.65 E-value=65 Score=23.44 Aligned_cols=8 Identities=63% Similarity=1.111 Sum_probs=4.2
Q ss_pred CChhhhHH
Q 038921 144 DPELDEFM 151 (197)
Q Consensus 144 DPELDqFM 151 (197)
|.|||+||
T Consensus 46 DaELD~Ym 53 (74)
T PF13865_consen 46 DAELDAYM 53 (74)
T ss_pred HHHHHHHH
Confidence 44555555
No 34
>PF10152 DUF2360: Predicted coiled-coil domain-containing protein (DUF2360); InterPro: IPR019309 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53.
Probab=20.61 E-value=2.9e+02 Score=22.51 Aligned_cols=42 Identities=17% Similarity=0.320 Sum_probs=28.1
Q ss_pred hhhHHHHHHHHHHHHHHH-------hhchHHHHHHHHHHHHHHHHHhhc
Q 038921 147 LDEFMETYSDMLAKYKSD-------LARPFDEATTFLNKVELQLRNLCT 188 (197)
Q Consensus 147 LDqFMeaYc~mL~kykEE-------L~rPf~EA~~F~~~iE~QL~~L~~ 188 (197)
+.+|+..=|..|.+|--. +++-++.+-.-+.=+|..|.+|-+
T Consensus 1 vN~Fi~~tv~fLN~F~~~cE~kL~~~e~~Lq~~E~~l~iLEaKL~SIpg 49 (148)
T PF10152_consen 1 VNHFIVHTVQFLNRFASVCEEKLSDMEQRLQRLEATLNILEAKLSSIPG 49 (148)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 357888888888888533 344456666666667777777643
No 35
>PF14689 SPOB_a: Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=20.57 E-value=3e+02 Score=19.03 Aligned_cols=37 Identities=19% Similarity=0.372 Sum_probs=28.1
Q ss_pred HHHHHHHHHHHHHHhhchH------------HHHHHHHHHHHHHHHHhh
Q 038921 151 METYSDMLAKYKSDLARPF------------DEATTFLNKVELQLRNLC 187 (197)
Q Consensus 151 MeaYc~mL~kykEEL~rPf------------~EA~~F~~~iE~QL~~L~ 187 (197)
++.+.+.|++++=+.-+++ ++|..+++++-.+++.+.
T Consensus 9 ~~~~~~~lR~~RHD~~NhLqvI~gllqlg~~~~a~eYi~~~~~~~~~~s 57 (62)
T PF14689_consen 9 LEELIDSLRAQRHDFLNHLQVIYGLLQLGKYEEAKEYIKELSKDLQQES 57 (62)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 3556677777777777764 799999999988887763
Done!