Query psy9431
Match_columns 119
No_of_seqs 119 out of 478
Neff 6.2
Searched_HMMs 46136
Date Fri Aug 16 19:11:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9431.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9431hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG5279 CYK3 Uncharacterized p 99.9 2.1E-26 4.5E-31 192.8 6.5 99 1-115 221-319 (521)
2 smart00460 TGc Transglutaminas 99.5 5.1E-14 1.1E-18 88.0 6.0 50 2-53 19-68 (68)
3 KOG4575|consensus 99.4 2.1E-14 4.6E-19 123.6 0.6 101 2-115 527-627 (874)
4 PF01841 Transglut_core: Trans 99.1 1.9E-10 4.1E-15 77.5 4.2 50 1-51 63-113 (113)
5 COG1305 Transglutaminase-like 98.4 2.2E-07 4.7E-12 71.4 3.9 57 1-57 206-264 (319)
6 KOG4575|consensus 98.0 1.1E-06 2.3E-11 76.8 -1.2 105 8-113 407-532 (874)
7 PF04473 DUF553: Transglutamin 96.9 0.0022 4.7E-08 47.2 5.3 42 2-54 88-129 (153)
8 PF14381 EDR1: Ethylene-respon 96.3 0.0072 1.6E-07 46.4 4.7 46 2-57 155-201 (204)
9 PF00797 Acetyltransf_2: N-ace 94.2 0.14 3E-06 38.9 5.6 51 2-56 59-109 (240)
10 COG3012 Uncharacterized protei 92.4 0.088 1.9E-06 38.7 1.9 16 39-54 115-130 (151)
11 PF14402 7TM_transglut: 7 tran 91.9 0.17 3.7E-06 41.3 3.2 48 2-56 6-53 (313)
12 KOG0909|consensus 90.6 0.075 1.6E-06 45.3 0.0 39 1-55 232-272 (500)
13 PF13471 Transglut_core3: Tran 86.5 0.92 2E-05 31.0 3.2 34 2-46 68-101 (117)
14 PF01473 CW_binding_1: Putativ 85.6 0.79 1.7E-05 21.9 1.8 12 40-51 4-15 (19)
15 PRK15047 N-hydroxyarylamine O- 85.5 2.9 6.3E-05 33.4 6.0 50 2-56 79-128 (281)
16 PF15644 Tox-PL: Papain fold t 70.6 4.6 0.0001 27.5 2.6 19 34-52 88-106 (111)
17 PRK04233 hypothetical protein; 67.7 4.1 8.8E-05 29.2 1.9 14 38-51 115-128 (129)
18 PF09027 GTPase_binding: GTPas 67.0 2.2 4.8E-05 27.3 0.4 29 78-106 34-62 (66)
19 cd06890 PX_Bem1p The phosphoin 55.5 3.8 8.2E-05 28.1 -0.1 38 45-100 15-52 (112)
20 PF12893 Lumazine_bd_2: Putati 53.6 16 0.00035 24.4 2.8 19 35-53 97-115 (116)
21 PRK00183 hypothetical protein; 51.6 9.9 0.00021 28.1 1.6 16 38-53 115-130 (157)
22 PRK01842 hypothetical protein; 48.4 11 0.00024 27.6 1.5 13 39-51 132-145 (149)
23 PRK01617 hypothetical protein; 46.9 11 0.00025 27.6 1.3 15 39-53 117-131 (154)
24 PF12419 DUF3670: SNF2 Helicas 46.5 15 0.00032 26.2 1.8 20 36-55 96-115 (141)
25 PF03835 Rad4: Rad4 transgluta 45.1 37 0.0008 24.0 3.7 23 32-54 33-57 (145)
26 PRK01752 hypothetical protein; 42.5 15 0.00032 27.1 1.3 15 39-53 117-131 (156)
27 PRK02250 hypothetical protein; 41.7 16 0.00035 27.0 1.5 15 40-54 114-128 (166)
28 PF12870 Lumazine_bd: Lumazine 39.6 35 0.00077 21.7 2.7 15 35-49 97-111 (111)
29 COG3549 HigB Plasmid maintenan 38.0 34 0.00074 23.4 2.4 17 34-50 61-77 (94)
30 PF02099 Josephin: Josephin; 37.8 54 0.0012 24.0 3.7 27 33-59 107-133 (157)
31 COG1489 SfsA DNA-binding prote 37.3 1.1E+02 0.0023 24.3 5.4 49 10-58 27-87 (235)
32 PF05015 Plasmid_killer: Plasm 36.1 30 0.00065 22.8 2.0 14 36-49 63-76 (93)
33 PF03749 SfsA: Sugar fermentat 35.0 99 0.0021 23.8 4.9 27 32-58 50-77 (215)
34 TIGR00230 sfsA sugar fermentat 32.4 1.6E+02 0.0034 23.0 5.7 27 32-58 63-89 (232)
35 PRK09216 rplM 50S ribosomal pr 30.0 33 0.00072 25.0 1.5 13 42-54 11-23 (144)
36 PF08272 Topo_Zn_Ribbon: Topoi 29.1 7.1 0.00015 22.8 -1.8 16 33-48 24-39 (42)
37 CHL00159 rpl13 ribosomal prote 28.9 36 0.00077 24.8 1.5 13 42-54 12-24 (143)
38 COG0764 FabA 3-hydroxymyristoy 27.8 28 0.00061 25.3 0.8 19 84-102 38-57 (147)
39 PF02982 Scytalone_dh: Scytalo 27.1 69 0.0015 23.9 2.7 17 33-49 117-136 (160)
40 PF13610 DDE_Tnp_IS240: DDE do 26.8 52 0.0011 23.0 2.0 16 40-55 11-26 (140)
41 PF03543 Peptidase_C58: Yersin 24.8 77 0.0017 24.0 2.7 27 33-59 142-168 (204)
42 PF00391 PEP-utilizers: PEP-ut 24.6 24 0.00052 22.5 -0.0 13 1-13 47-59 (80)
43 PF11447 DUF3201: Protein of u 23.7 61 0.0013 23.8 1.9 11 40-50 37-47 (150)
44 PF12883 DUF3828: Protein of u 23.1 1.1E+02 0.0024 20.9 3.1 18 35-52 100-117 (120)
45 PF09724 DUF2036: Uncharacteri 22.1 42 0.00091 26.8 0.9 23 36-58 148-170 (325)
46 PF15232 DUF4585: Domain of un 21.8 77 0.0017 20.8 1.9 15 39-53 9-26 (75)
47 PLN00205 ribisomal protein L13 21.7 53 0.0011 25.1 1.3 11 44-54 15-25 (191)
48 PF09173 eIF2_C: Initiation fa 21.7 64 0.0014 21.6 1.6 11 40-50 74-84 (88)
49 PF04276 DUF443: Protein of un 21.6 97 0.0021 23.3 2.7 14 38-51 5-18 (199)
50 smart00785 AARP2CN AARP2CN (NU 21.2 1.1E+02 0.0023 20.0 2.5 17 10-26 45-61 (83)
51 PF03736 EPTP: EPTP domain; I 21.1 1.5E+02 0.0033 16.5 2.9 22 32-53 12-33 (44)
52 smart00849 Lactamase_B Metallo 20.3 1.5E+02 0.0033 20.1 3.4 24 33-56 4-27 (183)
53 cd01287 FabA FabA, beta-hydrox 20.1 48 0.001 23.8 0.8 16 87-102 35-51 (150)
No 1
>COG5279 CYK3 Uncharacterized protein involved in cytokinesis, contains TGc (transglutaminase/protease-like) domain [Cell division and chromosome partitioning]
Probab=99.93 E-value=2.1e-26 Score=192.81 Aligned_cols=99 Identities=22% Similarity=0.456 Sum_probs=83.3
Q ss_pred CcccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeEEEEeccCCCccccCCccccccccCCCCCCccccc
Q psy9431 1 MFESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAWRFVQCNWGARHLVNAKEVPKVGAKGKSDSLRYEY 80 (119)
Q Consensus 1 ~lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W~liD~Twgag~~~~~~~~p~~~~~~~~~~f~~~~ 80 (119)
+||.++||+|++|.||+|+.-+ .+.++ ..|||||+|+||++||+||+|||++...+.+ ..+ ...
T Consensus 221 ~lcn~lgIp~~iIegf~k~~~~-~~~~~---~iNHaWN~VkiD~~yy~VDtTwgdPI~~d~~----------~~~--~~~ 284 (521)
T COG5279 221 ELCNALGIPCEIIEGFLKSPIY-YTRDI---NINHAWNIVKIDNEYYLVDTTWGDPIHPDQN----------SKS--SKI 284 (521)
T ss_pred HHHHhcCCceEEEeeccccccc-ccCCc---cccceeeEEEECCeEEEEeeecCCCCccccc----------ccc--ccc
Confidence 5999999999999999999832 23333 4799999999999999999999999765421 111 367
Q ss_pred CccccccCchhchhhccCCCCCcccccCCCccccC
Q psy9431 81 DDHYFLTDPREFIYEFFPLQPEWRTHVYQASVLTT 115 (119)
Q Consensus 81 ~~~YFl~~P~~fi~tH~P~d~~wQll~~pis~~~~ 115 (119)
+.+||++.|.++|.||+|..++||++.+|++.+++
T Consensus 285 n~~YF~lap~qMi~tH~p~k~~wq~~~~~~d~~~~ 319 (521)
T COG5279 285 NHSYFLLAPNQMIATHVPEKDDWQFIKPDLDMPIV 319 (521)
T ss_pred CchhccCChHHHhhhcCCCcchhhccCCchhhhhh
Confidence 89999999999999999999999999999997654
No 2
>smart00460 TGc Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
Probab=99.50 E-value=5.1e-14 Score=87.97 Aligned_cols=50 Identities=24% Similarity=0.434 Sum_probs=41.9
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeEEEEeccC
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAWRFVQCNW 53 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W~liD~Tw 53 (119)
|||++||+|++|.|+.++.+...+.. ....+|+|++|+++|+|+.+|+||
T Consensus 19 llr~~GIpar~v~g~~~~~~~~~~~~--~~~~~H~W~ev~~~~~W~~~D~~~ 68 (68)
T smart00460 19 LLRSLGIPARVVSGYLKAPDTIGGLR--SIWEAHAWAEVYLEGGWVPVDPTP 68 (68)
T ss_pred HHHHCCCCeEEEeeeecCCCCCcccc--cCCCcEEEEEEEECCCeEEEeCCC
Confidence 79999999999999999875433221 124799999999999999999998
No 3
>KOG4575|consensus
Probab=99.43 E-value=2.1e-14 Score=123.64 Aligned_cols=101 Identities=22% Similarity=0.187 Sum_probs=77.1
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeEEEEeccCCCccccCCccccccccCCCCCCcccccC
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAWRFVQCNWGARHLVNAKEVPKVGAKGKSDSLRYEYD 81 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W~liD~Twgag~~~~~~~~p~~~~~~~~~~f~~~~~ 81 (119)
|...+||.|+||.||.|.+.... -. .+.||+|+.|++|.+|++||+..++.. + .+++. ...+.. ..+
T Consensus 527 ~~~sLdlwCEvi~gflK~P~~i~-~~---fkyNh~w~~~~~d~e~r~iD~s~~~s~---p-ih~~~----~~~s~~-~~~ 593 (874)
T KOG4575|consen 527 RPLSLDLWCEVILGFLKIPFVIS-LF---FKYNHSWIDNKLDEETRYIDKSGAASI---P-IHLPP----KGDSLI-FHY 593 (874)
T ss_pred hhhhcchhHHHHHhhhcCcccee-ee---eecchhhhhhhccceeEEEeecccccc---c-cccCc----CCcchh-ccc
Confidence 56789999999999999874321 11 246999999999999999999874431 1 23321 222322 335
Q ss_pred ccccccCchhchhhccCCCCCcccccCCCccccC
Q psy9431 82 DHYFLTDPREFIYEFFPLQPEWRTHVYQASVLTT 115 (119)
Q Consensus 82 ~~YFl~~P~~fi~tH~P~d~~wQll~~pis~~~~ 115 (119)
++|||+.|+|.|+||+|++|..|++.+-|+.+..
T Consensus 594 n~yFlm~P~E~i~Th~penp~~qfImpsv~~~~~ 627 (874)
T KOG4575|consen 594 NLYFLMSPEENIITHMPENPLKQFIMPSVTEDVC 627 (874)
T ss_pred cceeecChhhheecCCCCchhhhhccccccchhe
Confidence 6899999999999999999999999999987653
No 4
>PF01841 Transglut_core: Transglutaminase-like superfamily; InterPro: IPR002931 This domain is found in many proteins known to have transglutaminase activity, i.e. which cross-link proteins through an acyl-transfer reaction between the gamma-carboxamide group of peptide-bound glutamine and the epsilon-amino group of peptide-bound lysine, resulting in a epsilon-(gamma-glutamyl)lysine isopeptide bond. Tranglutaminases have been found in a diverse range of species, from bacteria through to mammals. The enzymes require calcium binding and their activity leads to post-translational modification of proteins through acyl-transfer reactions, involving peptidyl glutamine residues as acyl donors and a variety of primary amines as acyl acceptors, with the generation of proteinase resistant isopeptide bonds []. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterised transglutaminase, the human blood clotting factor XIIIa' []. On the basis of the experimentally demonstrated activity of the Methanobacterium phage psiM2 pseudomurein endoisopeptidase [], it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease []. A subunit of plasma Factor XIII revealed that each Factor XIIIA subunit is composed of four domains (termed N-terminal beta-sandwich, core domain (containing the catalytic and the regulatory sites), and C-terminal beta-barrels 1 and 2) and that two monomers assemble into the native dimer through the surfaces in domains 1 and 2, in opposite orientation. This organisation in four domains is highly conserved during evolution among transglutaminase isoforms [].; PDB: 2F4M_A 2F4O_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B ....
Probab=99.05 E-value=1.9e-10 Score=77.51 Aligned_cols=50 Identities=20% Similarity=0.420 Sum_probs=36.3
Q ss_pred CcccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEE-CCeEEEEec
Q psy9431 1 MFESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYV-AGAWRFVQC 51 (119)
Q Consensus 1 ~lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~l-dg~W~liD~ 51 (119)
.|||++||+|++|.|+.++.+........ ....|+||+|++ +|+|+.+|+
T Consensus 63 allr~~Gipar~v~g~~~~~~~~~~~~~~-~~~~H~w~ev~~~~~~W~~~Dp 113 (113)
T PF01841_consen 63 ALLRALGIPARVVSGYVKGPDPDGDYSVD-GNDNHAWVEVYLPGGGWIPLDP 113 (113)
T ss_dssp HHHHHHT--EEEEEEEEEECSSTTCTSTS-SEEEEEEEEEEETTTEEEEEET
T ss_pred HHHhhCCCceEEEEEEcCCccccccccCC-CCCCEEEEEEEEcCCcEEEcCC
Confidence 38999999999999999975321111111 236899999999 899999996
No 5
>COG1305 Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]
Probab=98.43 E-value=2.2e-07 Score=71.42 Aligned_cols=57 Identities=19% Similarity=0.312 Sum_probs=43.3
Q ss_pred CcccccCCcEEEEEEEeCCCCCCCCCc-CCCCCCCceEEEEEECC-eEEEEeccCCCcc
Q psy9431 1 MFESYAGLHCVVIKGYSKSAGYQPGVK-FEDNRFRNSWNAVYVAG-AWRFVQCNWGARH 57 (119)
Q Consensus 1 ~lc~~aGi~~~~V~G~~k~~~~~pg~~-~~~~~~~HaWN~V~ldg-~W~liD~Twgag~ 57 (119)
.|||++||+|++|+||.-+.....+.. .......|||.+|+++| .|+-+|+|-+...
T Consensus 206 al~Ra~GIpAR~V~Gy~~~~~~~~~~~~~~~~~~~Haw~ev~~~~~gW~~~Dpt~~~~~ 264 (319)
T COG1305 206 ALLRAAGIPARYVSGYLGAEVEPLSGRPLVRNDDAHAWAEVYLPGRGWVPLDPTNGLLA 264 (319)
T ss_pred HHHHHcCCcceeeeccccCCCCcccccccccCcccceeeeeecCCCccEeecCCCCCcc
Confidence 389999999999999987763222111 12234689999999999 5999999988753
No 6
>KOG4575|consensus
Probab=97.95 E-value=1.1e-06 Score=76.75 Aligned_cols=105 Identities=57% Similarity=1.030 Sum_probs=73.7
Q ss_pred Cc--EEEEEEEe--CCCC-CCCCCc-CCCCC------CCceEEEEEECC----eEEEEeccCCCccccCCccccccccCC
Q psy9431 8 LH--CVVIKGYS--KSAG-YQPGVK-FEDNR------FRNSWNAVYVAG----AWRFVQCNWGARHLVNAKEVPKVGAKG 71 (119)
Q Consensus 8 i~--~~~V~G~~--k~~~-~~pg~~-~~~~~------~~HaWN~V~ldg----~W~liD~Twgag~~~~~~~~p~~~~~~ 71 (119)
++ |.++.|+. +..| +.||+- +...+ ..-.||+|.+|| .|+++-++|+|+.+....+ .-|+.+.
T Consensus 407 lhvy~~v~~g~~ci~v~gd~~Pgq~sv~~~~~Nmnfsa~~tw~r~i~dgrvqvSw~~~q~n~~Ar~~~tal~-~~r~~~i 485 (874)
T KOG4575|consen 407 LHVYCVVIKGFSCIKVAGDYQPGQYSVDDHRFNMNFSARNTWNRVILDGRVQVSWRFVQCNWGARHLVTALD-GSREAKI 485 (874)
T ss_pred ccchhhhhcccchhhhccCCCceeEeccccccccchhhhhchheeeecCcccccHHHHhhccCcchhhhHhh-hccceee
Confidence 55 77888877 5554 667622 32222 245899999999 8999999999995533211 0111111
Q ss_pred CCCCcccccCccccccCc---hhchh--hccCCCCCcccccCCCccc
Q psy9431 72 KSDSLRYEYDDHYFLTDP---REFIY--EFFPLQPEWRTHVYQASVL 113 (119)
Q Consensus 72 ~~~~f~~~~~~~YFl~~P---~~fi~--tH~P~d~~wQll~~pis~~ 113 (119)
.-...+...|+.||.+++ ++||+ .-+|.+-.|++.+.|.|+.
T Consensus 486 ~~~E~~~~tdd~~~~~e~rtse~~iyeg~~tP~e~awl~k~~~~sLd 532 (874)
T KOG4575|consen 486 DGNELRYETDDHYFMTESRTSEEFIYEGEFTPSEHAWLLKPRPLSLD 532 (874)
T ss_pred eeeeeeEeeccccccccccchhhheeccccChhHHHHHhhhhhhhcc
Confidence 122234678999999999 89999 7889999999999999883
No 7
>PF04473 DUF553: Transglutaminase-like domain; InterPro: IPR007562 This entry represents a transglutaminase-like domain found in a family of uncharacterised archaeal proteins that had previously been called DUF553 and UPF0252.
Probab=96.92 E-value=0.0022 Score=47.24 Aligned_cols=42 Identities=14% Similarity=0.071 Sum_probs=30.6
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeEEEEeccCC
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAWRFVQCNWG 54 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W~liD~Twg 54 (119)
|+-.+|+....|.+..-.. .. .|||++|+|||+||++|.+.-
T Consensus 88 lLl~~g~~~~yi~~~~~~~----------~~-~Haa~aV~ing~~yvlDq~~p 129 (153)
T PF04473_consen 88 LLLNMGYSPVYILHIEFDN----------DP-GHAAVAVKINGKYYVLDQHLP 129 (153)
T ss_pred HHHHCCCCceEEEEEecCC----------CC-CeEEEEEEECCEEEEEeCCCC
Confidence 4556788777666655221 12 599999999999999998753
No 8
>PF14381 EDR1: Ethylene-responsive protein kinase Le-CTR1
Probab=96.30 E-value=0.0072 Score=46.40 Aligned_cols=46 Identities=17% Similarity=0.164 Sum_probs=36.4
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEEC-CeEEEEeccCCCcc
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVA-GAWRFVQCNWGARH 57 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ld-g~W~liD~Twgag~ 57 (119)
||+..||+|++|.|.-.... ...||||.|+++ ++=|+||..-.=|.
T Consensus 155 LAD~iglpCrLvrG~~y~g~----------~~~~a~~~V~~~~~~eyiVDLm~~PG~ 201 (204)
T PF14381_consen 155 LADRIGLPCRLVRGCYYCGW----------DDDDASNLVKFDDGREYIVDLMGAPGQ 201 (204)
T ss_pred HHHhcCCCceEEeeccCCcc----------CCCCceEEEEcCCCcEEEEEcCCCCCC
Confidence 79999999999999754321 147999999997 67799998766554
No 9
>PF00797 Acetyltransf_2: N-acetyltransferase; InterPro: IPR001447 Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30 kDa. It facilitates the transfer of an acetyl group from acetyl coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation of these compounds generally results in inactivation. NAT is found in many species from Mycobacteria (Mycobacterium tuberculosis, Mycobacterium smegmatis etc) to Homo sapiens (Human). It was the first enzyme to be observed to have polymorphic activity amongst human individuals. NAT is responsible for the inactivation of Isoniazid (a drug used to treat tuberculosis) in humans. The NAT protein has also been shown to be involved in the breakdown of folic acid. NAT catalyses the reaction: Acetyl-coA + arylamine = coA + N-acetylarylamine NAT is the target of a common genetic polymorphism of clinical relevance in humans. The N-acetylation polymorphism is determined by low or high NAT activity in liver. NAT has been implicated in the action and toxicity of amine-containing drugs, and in the susceptibility to cancer and systematic lupus erythematosus. Two highly similar human genes for NAT, termed NAT1 and NAT2, encode genetically invariant and variant NAT proteins, respectively. ; GO: 0016407 acetyltransferase activity, 0008152 metabolic process; PDB: 1W6F_A 1W5R_A 1GX3_D 2PQT_A 2IJA_A 1W4T_A 2BSZ_B 3D9W_B 3LTW_A 3LNB_A ....
Probab=94.18 E-value=0.14 Score=38.92 Aligned_cols=51 Identities=14% Similarity=0.124 Sum_probs=38.8
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeEEEEeccCCCc
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAWRFVQCNWGAR 56 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W~liD~Twgag 56 (119)
+.+.+|.++..+.|-....+.... . ....|.=+.|.+||+.|+||+.+|++
T Consensus 59 lL~~lGf~v~~~~arv~~~~~~~~--~--~~~~H~~liV~~~~~~ylvDvGfG~~ 109 (240)
T PF00797_consen 59 LLRELGFDVTLVSARVYSPGGPDY--W--PPRTHLVLIVTLDGERYLVDVGFGGP 109 (240)
T ss_dssp HHHHCT-EEEEEEEEEETTTTTCC--S--SSEEEEEEEEEETTEEEEE-SSSTTC
T ss_pred HHHHCCCeEEEEEEEEEeCCCCCC--C--CCCceEEEEEEECCEEEEEeccCCCc
Confidence 457899999999998877632210 1 23589999999999999999999998
No 10
>COG3012 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=92.41 E-value=0.088 Score=38.67 Aligned_cols=16 Identities=19% Similarity=0.493 Sum_probs=14.4
Q ss_pred EEEECCeEEEEeccCC
Q psy9431 39 AVYVAGAWRFVQCNWG 54 (119)
Q Consensus 39 ~V~ldg~W~liD~Twg 54 (119)
-|+|||+||+||.|-.
T Consensus 115 Fvk~ngrWyyiDgtv~ 130 (151)
T COG3012 115 FVKINGRWYYIDGTVP 130 (151)
T ss_pred heEECCEEEEECCCCC
Confidence 4899999999999876
No 11
>PF14402 7TM_transglut: 7 transmembrane helices usually fused to an inactive transglutaminase
Probab=91.90 E-value=0.17 Score=41.26 Aligned_cols=48 Identities=21% Similarity=0.470 Sum_probs=37.9
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeEEEEeccCCCc
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAWRFVQCNWGAR 56 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W~liD~Twgag 56 (119)
|++.+|||+++|.|.-=..+- . ...-+-|.+|+-+++|.++|+.=|..
T Consensus 6 lL~~a~Ipar~v~gl~Led~r----r---~q~l~~~lev~~~~~W~~f~p~tg~~ 53 (313)
T PF14402_consen 6 LLAMAGIPARVVHGLKLEDGR----R---RQSLEPWLEVFNGGKWVLFNPRTGEQ 53 (313)
T ss_pred HHHhCCCCccEeeeEEecCCc----c---ccCcHhHHheeeCCeEEEECCCCCCc
Confidence 678999999999997654321 1 13568999999999999999976664
No 12
>KOG0909|consensus
Probab=90.65 E-value=0.075 Score=45.31 Aligned_cols=39 Identities=23% Similarity=0.346 Sum_probs=32.3
Q ss_pred CcccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEEC--CeEEEEeccCCC
Q psy9431 1 MFESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVA--GAWRFVQCNWGA 55 (119)
Q Consensus 1 ~lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ld--g~W~liD~Twga 55 (119)
.+|+++|+.+++|=- ...|-|++|+.+ .+|..||++=+.
T Consensus 232 llcralg~daR~i~d----------------~tDHVWtEvYS~~qqRW~HvDpcE~v 272 (500)
T KOG0909|consen 232 LLCRALGLDARYIWD----------------RTDHVWTEVYSNAQQRWVHVDPCENV 272 (500)
T ss_pred HHHHHcCCcceEEee----------------cCcchhHHhhhhhhheeEeecccccc
Confidence 379999999999832 247999999998 689999997554
No 13
>PF13471 Transglut_core3: Transglutaminase-like superfamily
Probab=86.47 E-value=0.92 Score=31.03 Aligned_cols=34 Identities=18% Similarity=0.381 Sum_probs=27.0
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeE
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAW 46 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W 46 (119)
|++..||++.++.|..|..+ ++ ..|||-+ .||+=
T Consensus 68 ~L~~~gi~~~l~iGv~~~~~-----~~----~aHAWve--~~g~~ 101 (117)
T PF13471_consen 68 LLRRRGIPATLVIGVRKDDD-----PF----AAHAWVE--CGGRV 101 (117)
T ss_pred HHHhcCCCcEEEEEEeeCCC-----Cc----eEEEEEE--ECCEE
Confidence 67888999999999999863 33 4899987 56553
No 14
>PF01473 CW_binding_1: Putative cell wall binding repeat; InterPro: IPR018337 The cell wall-binding repeat (CW) is an about 20 amino acid residue module, essentially found in two bacterial Gram-positive protein families; the choline binding proteins and glucosyltransferases (2.4.1.5 from EC). In choline-binding proteins cell wall binding repeats bind to choline moieties of both teichoic and lipoteichoic acids, two components peculiar to the cell surface of Gram-positive bacteria [, ]. In glucosyltransferases the region spanning the CW repeats is a glucan binding domain []. Several crystal structures of CW have been solved [, ]. In the choline binding protein LytA, the repeats adopt a solenoid fold consisting exclusively of beta-hairpins that stack to form a left-handed superhelix with a boomerang-like shape. The choline groups bind between beta-hairpin 'steps' of the superhelix []. In Cpl-1 CW repeats assemble in two sub-domains: an N-terminal superhelical moiety similar to the LytA one and a C-terminal beta-sheet involved in interactions with the lysozyme domain. Choline is bound between repeats 1 and 2, and, 2 and 3 of the superhelical sub-domain []. Some proteins known to contain cell-wall binding repeats include: Pneumococcal N-acetylmuramoyl-L-alanine amidase (autolysin, lytA) (3.5.1.28 from EC). It is a surface-exposed enzyme that rules the self-destruction of pneumococcal cells through degradation of their peptidoglycan backbone. It mediates the release of toxic substances that damage the host tissues. Pneumococcal endo-beta-N-acetylglucosaminidase (lytB) (3.2.1.96 from EC). It plays an important role in cell wall degradation and cell separation. Pneumococcal teichoic acid phosphorylcholine esterase (pce or cbpE), a cell wall hydrolase important for cellular adhesion and colonisation. Lactobacillales glucosyltransferase. It catalyses the transfer of glucosyl units from the cleavage of sucrose to a growing chain of glucan. Clostridium difficile toxin A (tcdA) and toxin B (tcdb). They are the causative agents of the antibiotic-associated pseudomembranous colitis. They are intracellular acting toxins that reach their targets after receptor-mediated endocytosis. Clostridium acetobutylicum cspA protein. Siphoviridae bacteriophages N-acetylmuramoyl-L-alanine amidase. It lyses the bacterial host cell wall. Podoviridae lysozyme protein (cpl-1). It is capable of digesting the pneumococcal cell wall. The cell wall binding repeats are also known as the choline-binding repeats (ChBr) or the choline-binding domain (ChBD). ; PDB: 1GVM_C 2BML_B 1HCX_A 1OBA_A 1H09_A 2J8F_A 2IXU_A 2J8G_A 2IXV_A 2X8O_A ....
Probab=85.57 E-value=0.79 Score=21.87 Aligned_cols=12 Identities=25% Similarity=0.650 Sum_probs=9.8
Q ss_pred EEECCeEEEEec
Q psy9431 40 VYVAGAWRFVQC 51 (119)
Q Consensus 40 V~ldg~W~liD~ 51 (119)
|+++|+||.+|.
T Consensus 4 ~~~~~~wYy~~~ 15 (19)
T PF01473_consen 4 VQDNGNWYYFDS 15 (19)
T ss_dssp EEETTEEEEETT
T ss_pred EEECCEEEEeCC
Confidence 567899999975
No 15
>PRK15047 N-hydroxyarylamine O-acetyltransferase; Provisional
Probab=85.47 E-value=2.9 Score=33.39 Aligned_cols=50 Identities=12% Similarity=0.127 Sum_probs=37.9
Q ss_pred cccccCCcEEEEEEEeCCCCCCCCCcCCCCCCCceEEEEEECCeEEEEeccCCCc
Q psy9431 2 FESYAGLHCVVIKGYSKSAGYQPGVKFEDNRFRNSWNAVYVAGAWRFVQCNWGAR 56 (119)
Q Consensus 2 lc~~aGi~~~~V~G~~k~~~~~pg~~~~~~~~~HaWN~V~ldg~W~liD~Twgag 56 (119)
+.+++|..+..+.|-....+. ++ . ....|.=+.|.|||+-||+|+-.|+.
T Consensus 79 ~L~~LGF~v~~~~arV~~~~~-~~--~--~~~tH~~l~V~i~~~~yLvDVGFG~~ 128 (281)
T PRK15047 79 VLRELGFNVRSLLGRVVLSNP-PA--L--PPRTHRLLLVELEGEKWIADVGFGGQ 128 (281)
T ss_pred HHHHcCCcEEEEEEEEEecCC-CC--C--CCcCcEEEEEEECCeeEEEEecCCCC
Confidence 457889999998888764311 11 1 12489999999999999999999964
No 16
>PF15644 Tox-PL: Papain fold toxin 1; PDB: 3B21_A.
Probab=70.60 E-value=4.6 Score=27.46 Aligned_cols=19 Identities=21% Similarity=0.330 Sum_probs=16.0
Q ss_pred CceEEEEEECCeEEEEecc
Q psy9431 34 RNSWNAVYVAGAWRFVQCN 52 (119)
Q Consensus 34 ~HaWN~V~ldg~W~liD~T 52 (119)
.|+||+|.-+|+=.++|+=
T Consensus 88 gHa~nvv~~~G~i~~~D~Q 106 (111)
T PF15644_consen 88 GHAFNVVNQNGKIVFLDPQ 106 (111)
T ss_dssp TTEEEEEEE-SSEEEEBTT
T ss_pred ceEEEEEeCCCeEEEEeCC
Confidence 7999999999998888863
No 17
>PRK04233 hypothetical protein; Provisional
Probab=67.69 E-value=4.1 Score=29.17 Aligned_cols=14 Identities=21% Similarity=0.574 Sum_probs=11.4
Q ss_pred EEEEECCeEEEEec
Q psy9431 38 NAVYVAGAWRFVQC 51 (119)
Q Consensus 38 N~V~ldg~W~liD~ 51 (119)
.=|+.||+|+++|.
T Consensus 115 ~F~r~~g~W~YvDg 128 (129)
T PRK04233 115 RFVREDGRWYYLDA 128 (129)
T ss_pred eEEEECCEEEEecC
Confidence 34778999999984
No 18
>PF09027 GTPase_binding: GTPase binding; InterPro: IPR015116 The GTPase binding domain binds to the G protein Cdc42, inhibiting both its intrinsic and stimulated GTPase activity. The domain is largely unstructured in the absence of Cdc42 []. ; PDB: 1CF4_B.
Probab=66.99 E-value=2.2 Score=27.33 Aligned_cols=29 Identities=21% Similarity=0.351 Sum_probs=2.7
Q ss_pred cccCccccccCchhchhhccCCCCCcccc
Q psy9431 78 YEYDDHYFLTDPREFIYEFFPLQPEWRTH 106 (119)
Q Consensus 78 ~~~~~~YFl~~P~~fi~tH~P~d~~wQll 106 (119)
.++++.||.++|-.+++.......-.|++
T Consensus 34 ~~idn~yl~mdp~~~~~~~~s~a~p~~~~ 62 (66)
T PF09027_consen 34 SEIDNNYLNMDPIDMAYNLNSTARPQQHL 62 (66)
T ss_dssp ----TTT----------------------
T ss_pred hhhhhhhhcccccchhhcccccccccccc
Confidence 36688899999999998865544334544
No 19
>cd06890 PX_Bem1p The phosphoinositide binding Phox Homology domain of Bem1p. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions such as cell signaling, vesicular trafficking, protein sorting, and lipid modification, among others. Members of this subfamily bear similarity to Saccharomyces cerevisiae Bem1p, containing two Src Homology 3 (SH3) domains at the N-terminus, a central PX domain, and a C-terminal PB1 domain. Bem1p is a scaffolding protein that is critical for proper Cdc42p activation during bud formation in yeast. During budding and mating, Bem1p migrates to the plasma membrane where it can serve as an adaptor for Cdc42p and some other proteins. Bem1p also functions as an effector of the G1 cyclin Cln3p and the cyclin-dependent kinase Cdc28p in promoting vacuolar fusion. The PX domain is involved in targeting of proteins to PI-enriched membranes, and may also be involved in protein-protein interaction. The PX domain of Bem1p
Probab=55.49 E-value=3.8 Score=28.12 Aligned_cols=38 Identities=24% Similarity=0.384 Sum_probs=23.0
Q ss_pred eEEEEeccCCCccccCCccccccccCCCCCCcccccCccccccCchhchhhccCCC
Q psy9431 45 AWRFVQCNWGARHLVNAKEVPKVGAKGKSDSLRYEYDDHYFLTDPREFIYEFFPLQ 100 (119)
Q Consensus 45 ~W~liD~Twgag~~~~~~~~p~~~~~~~~~~f~~~~~~~YFl~~P~~fi~tH~P~d 100 (119)
-||.|+++|.+|.. -...+.|.+||=|-. =+...||.+
T Consensus 15 y~Y~i~v~~s~~~~---------------~~v~RrY~dFy~Lh~---~L~~~fp~e 52 (112)
T cd06890 15 YWYRVRATLSDGKT---------------RYLCRYYQDFYKLHI---ALLDLFPAE 52 (112)
T ss_pred EEEEEEEEEcCCcE---------------EEEEEEHHHHHHHHH---HHHHhCcHh
Confidence 39999999988631 122356777765522 244555543
No 20
>PF12893 Lumazine_bd_2: Putative lumazine-binding; PDB: 3BLZ_C 3DUK_F 3FKA_C.
Probab=53.56 E-value=16 Score=24.43 Aligned_cols=19 Identities=21% Similarity=0.560 Sum_probs=15.6
Q ss_pred ceEEEEEECCeEEEEeccC
Q psy9431 35 NSWNAVYVAGAWRFVQCNW 53 (119)
Q Consensus 35 HaWN~V~ldg~W~liD~Tw 53 (119)
-....+++||+|.+|.-.|
T Consensus 97 d~~~L~K~dg~WkIv~k~~ 115 (116)
T PF12893_consen 97 DYFTLVKTDGGWKIVSKVY 115 (116)
T ss_dssp EEEEEEEETTEEEEEEEEE
T ss_pred EEEEEEEECCEEEEEEEec
Confidence 4577899999999998765
No 21
>PRK00183 hypothetical protein; Provisional
Probab=51.59 E-value=9.9 Score=28.09 Aligned_cols=16 Identities=25% Similarity=0.440 Sum_probs=12.5
Q ss_pred EEEEECCeEEEEeccC
Q psy9431 38 NAVYVAGAWRFVQCNW 53 (119)
Q Consensus 38 N~V~ldg~W~liD~Tw 53 (119)
+-|+.||+||.||..-
T Consensus 115 ~F~r~~g~W~YvDG~~ 130 (157)
T PRK00183 115 AFVQHQGRWYFIDPTV 130 (157)
T ss_pred eeeEeCCEEEeccccC
Confidence 3477899999999533
No 22
>PRK01842 hypothetical protein; Provisional
Probab=48.42 E-value=11 Score=27.62 Aligned_cols=13 Identities=38% Similarity=0.731 Sum_probs=10.4
Q ss_pred EEEE-CCeEEEEec
Q psy9431 39 AVYV-AGAWRFVQC 51 (119)
Q Consensus 39 ~V~l-dg~W~liD~ 51 (119)
=|+. ||+||++|-
T Consensus 132 F~r~~~G~W~YvDG 145 (149)
T PRK01842 132 FVRDEQGRWRYVDG 145 (149)
T ss_pred eEECCCCeEEEeCC
Confidence 3676 899999994
No 23
>PRK01617 hypothetical protein; Provisional
Probab=46.95 E-value=11 Score=27.60 Aligned_cols=15 Identities=13% Similarity=0.446 Sum_probs=12.1
Q ss_pred EEEECCeEEEEeccC
Q psy9431 39 AVYVAGAWRFVQCNW 53 (119)
Q Consensus 39 ~V~ldg~W~liD~Tw 53 (119)
=|+.||+|+++|..-
T Consensus 117 F~r~~g~W~Yvdg~~ 131 (154)
T PRK01617 117 FLKENGQWYYIDGTR 131 (154)
T ss_pred eEEeCCCEEecCCCC
Confidence 378899999999643
No 24
>PF12419 DUF3670: SNF2 Helicase protein ; InterPro: IPR022138 This domain family is found in bacteria, archaea and eukaryotes, and is approximately 140 amino acids in length. The family is found in association with PF00271 from PFAM, PF00176 from PFAM. Most of the proteins in this family are annotated as SNF2 helicases but there is little accompanying literature to confirm this.
Probab=46.46 E-value=15 Score=26.18 Aligned_cols=20 Identities=15% Similarity=0.053 Sum_probs=15.4
Q ss_pred eEEEEEECCeEEEEeccCCC
Q psy9431 36 SWNAVYVAGAWRFVQCNWGA 55 (119)
Q Consensus 36 aWN~V~ldg~W~liD~Twga 55 (119)
.--.|++.|+|..+|+---.
T Consensus 96 ~~~LV~~rg~WV~ld~~~l~ 115 (141)
T PF12419_consen 96 KRPLVRFRGRWVELDPEELR 115 (141)
T ss_pred CCCeEEECCEEEEECHHHHH
Confidence 34469999999999985533
No 25
>PF03835 Rad4: Rad4 transglutaminase-like domain; InterPro: IPR018325 RAD4/Xp-C proteins contain an ancient transglutaminase fold that is also found in peptide-N-glycanases (PNGases), which remove glycans from glycoproteins during their degradation. The PNGases retain the catalytic triad that is typical of this fold and are predicted to have a reaction mechanism similar to that involved in transglutamination. In contrast, the RAD4/Xp-C proteins are predicted to be inactive and are likely to only possess the interaction function in DNA repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0005634 nucleus; PDB: 2QSG_A 2QSF_A 2QSH_A 1X3W_A 1X3Z_A 3ESW_A.
Probab=45.07 E-value=37 Score=24.03 Aligned_cols=23 Identities=30% Similarity=0.577 Sum_probs=20.4
Q ss_pred CCCceEEEEEEC--CeEEEEeccCC
Q psy9431 32 RFRNSWNAVYVA--GAWRFVQCNWG 54 (119)
Q Consensus 32 ~~~HaWN~V~ld--g~W~liD~Twg 54 (119)
...|-|.+|+.. .+|.-||+.=+
T Consensus 33 ~~~~~W~EV~~~~~~rWI~VDp~~~ 57 (145)
T PF03835_consen 33 PYPNFWVEVYSPEEKRWIHVDPVVG 57 (145)
T ss_dssp TTTCEEEEEEETTTTEEEEEETTTS
T ss_pred CCCeEEEEEEecCCCeEEEeeeecc
Confidence 368999999996 89999999987
No 26
>PRK01752 hypothetical protein; Provisional
Probab=42.50 E-value=15 Score=27.14 Aligned_cols=15 Identities=27% Similarity=0.448 Sum_probs=12.3
Q ss_pred EEEECCeEEEEeccC
Q psy9431 39 AVYVAGAWRFVQCNW 53 (119)
Q Consensus 39 ~V~ldg~W~liD~Tw 53 (119)
-++.||+|+.+|-+-
T Consensus 117 F~r~~g~W~YvDG~~ 131 (156)
T PRK01752 117 FVKIDNRWYFVDPTV 131 (156)
T ss_pred heeccCCEEEecCCC
Confidence 467899999999655
No 27
>PRK02250 hypothetical protein; Provisional
Probab=41.73 E-value=16 Score=27.01 Aligned_cols=15 Identities=13% Similarity=0.554 Sum_probs=12.1
Q ss_pred EEECCeEEEEeccCC
Q psy9431 40 VYVAGAWRFVQCNWG 54 (119)
Q Consensus 40 V~ldg~W~liD~Twg 54 (119)
++.||+|+++|-...
T Consensus 114 ~r~~g~W~Yvdg~~~ 128 (166)
T PRK02250 114 LKENGLWYYIDGTFP 128 (166)
T ss_pred EeeCCEEEecCCCcC
Confidence 677999999996543
No 28
>PF12870 Lumazine_bd: Lumazine-binding domain; InterPro: IPR024267 This entry represents a lumazine-binding domain found in a family of putative lipoproteins from bacteria. Lumazine is a fluorescent accessory protein having 6,7-dimethyl-8-(1'-D-ribityl) lumazine (DMRL) as its authentic chromophore; it modulates the emission of bacterial luciferase to shorter wavelengths with increasing luminous strength.; PDB: 3K7C_C.
Probab=39.62 E-value=35 Score=21.67 Aligned_cols=15 Identities=20% Similarity=0.357 Sum_probs=12.3
Q ss_pred ceEEEEEECCeEEEE
Q psy9431 35 NSWNAVYVAGAWRFV 49 (119)
Q Consensus 35 HaWN~V~ldg~W~li 49 (119)
-....|+.||+|++.
T Consensus 97 ~~~~lvk~dg~Wkv~ 111 (111)
T PF12870_consen 97 FTVPLVKEDGKWKVC 111 (111)
T ss_dssp EEEEEEEETTEEEE-
T ss_pred EEEEEEEECCEEEeC
Confidence 478899999999974
No 29
>COG3549 HigB Plasmid maintenance system killer protein [General function prediction only]
Probab=38.00 E-value=34 Score=23.35 Aligned_cols=17 Identities=18% Similarity=0.444 Sum_probs=13.9
Q ss_pred CceEEEEEECCeEEEEe
Q psy9431 34 RNSWNAVYVAGAWRFVQ 50 (119)
Q Consensus 34 ~HaWN~V~ldg~W~liD 50 (119)
.--|-.+.||++|+||=
T Consensus 61 ~~g~ySIrvN~QwrLiF 77 (94)
T COG3549 61 LKGLYSIRVNQQWRLIF 77 (94)
T ss_pred ccCcEEEEECCEEEEEE
Confidence 34578899999999983
No 30
>PF02099 Josephin: Josephin; InterPro: IPR006155 Human genes containing triplet repeats can markedly expand in length, leading to neuropsychiatric disease. Expansion of triplet repeats explains the phenomenon of anticipation, i.e. the increasing severity or earlier age of onset in successive generations in a pedigree []. A novel gene containing CAG repeats has been identified and mapped to chromosome 14q32.1, the genetic locus for Machado-Joseph disease (MJD). Normally, the gene contains 13-36 CAG repeats, but most clinically diagnosed patients and all affected members of a family with the clinical and pathological diagnosis of MJD show expansion of the repeat number, from 68-79 []. Similar abnormalities in related genes may give rise to diseases similar to MJD. MJD is a neurodegenerative disorder characterised by cerebellar ataxia, pyramidal and extra-pyramidal signs, peripheral nerve palsy, external ophtalmoplegia, facial and lingual fasciculation and bulging. The disease is autosomal dominant, with late onset of symptoms, generally after the fourth decade.; GO: 0008242 omega peptidase activity; PDB: 3O65_G 1YZB_A 2JRI_A 2DOS_A 2AGA_A.
Probab=37.81 E-value=54 Score=24.01 Aligned_cols=27 Identities=15% Similarity=0.321 Sum_probs=22.0
Q ss_pred CCceEEEEEECCeEEEEeccCCCcccc
Q psy9431 33 FRNSWNAVYVAGAWRFVQCNWGARHLV 59 (119)
Q Consensus 33 ~~HaWN~V~ldg~W~liD~Twgag~~~ 59 (119)
..|....=+|+|.||-+|....++.+.
T Consensus 107 ~~HWf~iRki~~~wyNLDS~l~~P~~i 133 (157)
T PF02099_consen 107 SRHWFAIRKIGGQWYNLDSKLKEPELI 133 (157)
T ss_dssp TTEEEEEEEETTEEEEECTTTSS-EEE
T ss_pred CcceEEEEeeCCeeEeccCCCCCCccc
Confidence 578888888999999999999887543
No 31
>COG1489 SfsA DNA-binding protein, stimulates sugar fermentation [General function prediction only]
Probab=37.29 E-value=1.1e+02 Score=24.27 Aligned_cols=49 Identities=10% Similarity=0.069 Sum_probs=34.2
Q ss_pred EEEEEEEeCCCCCCCCCcCCC------------CCCCceEEEEEECCeEEEEeccCCCccc
Q psy9431 10 CVVIKGYSKSAGYQPGVKFED------------NRFRNSWNAVYVAGAWRFVQCNWGARHL 58 (119)
Q Consensus 10 ~~~V~G~~k~~~~~pg~~~~~------------~~~~HaWN~V~ldg~W~liD~Twgag~~ 58 (119)
-++++.+....|-..|-..++ .+..|.|-.|+.++.|..||..+.+..+
T Consensus 27 G~~~~~H~~ntGrm~~l~~pG~~v~l~~sd~~~rK~~~~~~~~~~~~~~v~VnT~l~N~l~ 87 (235)
T COG1489 27 GEEVTAHCPNTGRMTELLTPGNTVWLSRSDNPKRKYRYTLELVEADNTLVCVNTGLPNRLA 87 (235)
T ss_pred CeEEEEEcCCCCccccccCCCCEEEEEEecCCCccceEEEEEEEcCCeEEEEeCCccHHHH
Confidence 345566665555444322222 2478999999999999999999988744
No 32
>PF05015 Plasmid_killer: Plasmid maintenance system killer protein; InterPro: IPR007711 Several plasmids with proteic killer gene systems have been reported. All of them encode a stable toxin and an unstable antidote. Upon loss of the plasmid, the less stable inhibitor is inactivated more rapidly than the toxin, allowing the toxin to be activated. The activation of those systems result in cell filamentation and cessation of viable cell production. It has been verified that both the stable killer and the unstable inhibitor of the systems are short polypeptides. This family corresponds to the toxin.
Probab=36.06 E-value=30 Score=22.82 Aligned_cols=14 Identities=21% Similarity=0.705 Sum_probs=11.3
Q ss_pred eEEEEEECCeEEEE
Q psy9431 36 SWNAVYVAGAWRFV 49 (119)
Q Consensus 36 aWN~V~ldg~W~li 49 (119)
---+|.|+|.|++|
T Consensus 63 g~~Si~i~~~~Rli 76 (93)
T PF05015_consen 63 GQWSIRINGNWRLI 76 (93)
T ss_pred CcEEEEeCCCEEEE
Confidence 34579999999987
No 33
>PF03749 SfsA: Sugar fermentation stimulation protein; InterPro: IPR005224 The sugar fermentation stimulation protein is a probable regulatory factor involved in maltose metabolism. It contains a putative DNA-binding domain, and was isolated as a gene which enabled Escherichia coli W3110 (strain MK2001) to use maltose [].
Probab=34.98 E-value=99 Score=23.79 Aligned_cols=27 Identities=19% Similarity=0.302 Sum_probs=24.1
Q ss_pred CCCceEEEEEEC-CeEEEEeccCCCccc
Q psy9431 32 RFRNSWNAVYVA-GAWRFVQCNWGARHL 58 (119)
Q Consensus 32 ~~~HaWN~V~ld-g~W~liD~Twgag~~ 58 (119)
+..+.|-+|+.+ |.|..||+...+..+
T Consensus 50 Kt~y~l~av~~~~~~~V~int~~~N~lv 77 (215)
T PF03749_consen 50 KTKYTLEAVEKDNGVWVGINTQLPNRLV 77 (215)
T ss_pred CCcEEEEEEEcCCCeEEEEccchHHHHH
Confidence 578999999999 999999999887655
No 34
>TIGR00230 sfsA sugar fermentation stimulation protein. probable regulatory factor involved in maltose metabolism contains a putative DNA binding domain. Isolated as a gene which enabled E.coli strain MK2001 to use maltose.
Probab=32.35 E-value=1.6e+02 Score=23.00 Aligned_cols=27 Identities=26% Similarity=0.641 Sum_probs=23.8
Q ss_pred CCCceEEEEEECCeEEEEeccCCCccc
Q psy9431 32 RFRNSWNAVYVAGAWRFVQCNWGARHL 58 (119)
Q Consensus 32 ~~~HaWN~V~ldg~W~liD~Twgag~~ 58 (119)
+..+.|-+|+.+|.|..||+...+..+
T Consensus 63 K~~y~l~~v~~~g~~V~int~~~N~l~ 89 (232)
T TIGR00230 63 KLSYTWEAVQCDGGWVLVNTQLQNRLF 89 (232)
T ss_pred CCCEEEEEEEECCeEEEEcCccHHHHH
Confidence 467999999999999999999987755
No 35
>PRK09216 rplM 50S ribosomal protein L13; Reviewed
Probab=29.99 E-value=33 Score=24.96 Aligned_cols=13 Identities=15% Similarity=0.381 Sum_probs=10.2
Q ss_pred ECCeEEEEeccCC
Q psy9431 42 VAGAWRFVQCNWG 54 (119)
Q Consensus 42 ldg~W~liD~Twg 54 (119)
+..+|++|||+.-
T Consensus 11 ~~~~W~viDA~~~ 23 (144)
T PRK09216 11 VERKWYVIDAEGK 23 (144)
T ss_pred cCCCEEEEeCCCC
Confidence 3578999999763
No 36
>PF08272 Topo_Zn_Ribbon: Topoisomerase I zinc-ribbon-like ; InterPro: IPR013263 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. This entry represents the C-terminal zinc-ribbon-like domain found in bacterial topoisomerase I (type IA) enzymes. Escherichia coli topoisomerase I proteins contain five copies of a zinc-ribbon-like domain at their C terminus, two of which have lost their cysteine residues and are therefore probably not able to bind zinc []. This domain is still considered to be a member of the zinc-ribbon superfamily despite not being able to bind zinc. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0006265 DNA topological change, 0005694 chromosome; PDB: 1YUA_A.
Probab=29.15 E-value=7.1 Score=22.79 Aligned_cols=16 Identities=38% Similarity=0.895 Sum_probs=11.9
Q ss_pred CCceEEEEEECCeEEE
Q psy9431 33 FRNSWNAVYVAGAWRF 48 (119)
Q Consensus 33 ~~HaWN~V~ldg~W~l 48 (119)
-.-+|++.+.||+|+-
T Consensus 24 k~tgw~af~~~gkw~~ 39 (42)
T PF08272_consen 24 KFTGWSAFYKDGKWRV 39 (42)
T ss_dssp SEEEETCCCSSS-B-S
T ss_pred cEeeeeEeccCCeEec
Confidence 3579999999999974
No 37
>CHL00159 rpl13 ribosomal protein L13; Validated
Probab=28.86 E-value=36 Score=24.81 Aligned_cols=13 Identities=8% Similarity=0.235 Sum_probs=10.1
Q ss_pred ECCeEEEEeccCC
Q psy9431 42 VAGAWRFVQCNWG 54 (119)
Q Consensus 42 ldg~W~liD~Twg 54 (119)
+.-+|++|||+.-
T Consensus 12 ~~r~W~viDA~~~ 24 (143)
T CHL00159 12 KNRKWYIIDAKDQ 24 (143)
T ss_pred cCCCEEEEeCCCC
Confidence 3567999999764
No 38
>COG0764 FabA 3-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydratases [Lipid metabolism]
Probab=27.83 E-value=28 Score=25.27 Aligned_cols=19 Identities=21% Similarity=0.219 Sum_probs=14.0
Q ss_pred ccccCchhchhh-ccCCCCC
Q psy9431 84 YFLTDPREFIYE-FFPLQPE 102 (119)
Q Consensus 84 YFl~~P~~fi~t-H~P~d~~ 102 (119)
|+++++++..++ |||.+|-
T Consensus 38 ~k~Vt~nepfF~gHFP~~Pi 57 (147)
T COG0764 38 IKNVTINEPFFTGHFPGDPI 57 (147)
T ss_pred EEccCCCCCeeCCcCCCCCC
Confidence 677777765555 9999884
No 39
>PF02982 Scytalone_dh: Scytalone dehydratase; InterPro: IPR004235 Scytalone dehydratase is a member of the group of enzymes involved in fungal melanin biosynthesis. It was first identified in a phytopathogenic fungus, Magnaporthe grisea (Rice blast fungus), which causes rice blast disease. Scytalone dehydratase is a molecular target of inhibitor design efforts aimed at protecting rice plants from fungal disease [, ].; GO: 0030411 scytalone dehydratase activity, 0006582 melanin metabolic process; PDB: 4STD_A 3STD_A 6STD_A 7STD_C 1STD_A 5STD_C 1IDP_B 2STD_A.
Probab=27.09 E-value=69 Score=23.91 Aligned_cols=17 Identities=29% Similarity=0.610 Sum_probs=13.8
Q ss_pred CCceEEE---EEECCeEEEE
Q psy9431 33 FRNSWNA---VYVAGAWRFV 49 (119)
Q Consensus 33 ~~HaWN~---V~ldg~W~li 49 (119)
..|++|. ++|||+|.+-
T Consensus 117 h~h~~~~h~Y~KvdG~WK~a 136 (160)
T PF02982_consen 117 HGHGTNTHWYRKVDGVWKFA 136 (160)
T ss_dssp EEEEEEEEEEEEETTEEEEE
T ss_pred ccceeEEEEEEEeCCEEEEe
Confidence 4689987 5799999984
No 40
>PF13610 DDE_Tnp_IS240: DDE domain
Probab=26.84 E-value=52 Score=22.95 Aligned_cols=16 Identities=19% Similarity=0.565 Sum_probs=11.4
Q ss_pred EEECCeEEEEeccCCC
Q psy9431 40 VYVAGAWRFVQCNWGA 55 (119)
Q Consensus 40 V~ldg~W~liD~Twga 55 (119)
|+|+|+|+.+-..-++
T Consensus 11 iki~G~~~yl~~aiD~ 26 (140)
T PF13610_consen 11 IKIKGKWHYLWRAIDA 26 (140)
T ss_pred EEECCEEEEEEEeecc
Confidence 8889998887444444
No 41
>PF03543 Peptidase_C58: Yersinia/Haemophilus virulence surface antigen; InterPro: IPR022767 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases correspond to MEROPS peptidase family C58. Family C58 contains endopeptidases that also act as transamidases, attaching a lipid moiety to the newly exposed N terminus of the substrate. This entry includes protein YopT from Yersinia pestis involved in bacterial pathogenesis and Rhizobium sp. (strain NGR234) nodulation outer protein T (NopT).; GO: 0004197 cysteine-type endopeptidase activity; PDB: 1UKF_A.
Probab=24.77 E-value=77 Score=24.00 Aligned_cols=27 Identities=19% Similarity=0.154 Sum_probs=23.4
Q ss_pred CCceEEEEEECCeEEEEeccCCCcccc
Q psy9431 33 FRNSWNAVYVAGAWRFVQCNWGARHLV 59 (119)
Q Consensus 33 ~~HaWN~V~ldg~W~liD~Twgag~~~ 59 (119)
.+||.-+..-+++|.++|+..|.-.+.
T Consensus 142 ~~Ha~a~~~~~~~~~fFDPN~Ge~~f~ 168 (204)
T PF03543_consen 142 GGHAMAASVDGGKVTFFDPNYGEFSFP 168 (204)
T ss_dssp EEEEEEEEEETTEEEEEETTTEEEEEE
T ss_pred CCceEEEEEeCCEEEEECCCceEEEec
Confidence 589999998899999999999886553
No 42
>PF00391 PEP-utilizers: PEP-utilising enzyme, mobile domain; InterPro: IPR008279 A number of enzymes that catalyze the transfer of a phosphoryl group from phosphoenolpyruvate (PEP) via a phospho-histidine intermediate have been shown to be structurally related [, , , ]. All these enzymes share the same catalytic mechanism: they bind PEP and transfer the phosphoryl group from it to a histidine residue. This domain is a "swivelling" beta/beta/alpha domain which is thought to be mobile in all proteins known to contain it []. It is often found associated with the pyruvate phosphate dikinase, PEP/pyruvate-binding domain (IPR002192 from INTERPRO) at its N terminus.; GO: 0016772 transferase activity, transferring phosphorus-containing groups, 0016310 phosphorylation; PDB: 2X0S_A 2OLS_A 2HRO_A 2E28_A 2WQD_A 3T05_D 3T0T_D 3T07_B 2DIK_A 2FM4_A ....
Probab=24.64 E-value=24 Score=22.46 Aligned_cols=13 Identities=15% Similarity=0.289 Sum_probs=9.6
Q ss_pred CcccccCCcEEEE
Q psy9431 1 MFESYAGLHCVVI 13 (119)
Q Consensus 1 ~lc~~aGi~~~~V 13 (119)
+|||..||||.+=
T Consensus 47 IlAr~~giP~ivg 59 (80)
T PF00391_consen 47 ILARELGIPAIVG 59 (80)
T ss_dssp HHHHHTT-EEEES
T ss_pred HHHHHcCCCEEEe
Confidence 4799999998763
No 43
>PF11447 DUF3201: Protein of unknown function (DUF3201); InterPro: IPR024505 This archaeal family of proteins has no known function.; PDB: 1YB3_A.
Probab=23.69 E-value=61 Score=23.78 Aligned_cols=11 Identities=36% Similarity=1.123 Sum_probs=8.5
Q ss_pred EEECCeEEEEe
Q psy9431 40 VYVAGAWRFVQ 50 (119)
Q Consensus 40 V~ldg~W~liD 50 (119)
++|||+|+.+-
T Consensus 37 i~lDgeW~em~ 47 (150)
T PF11447_consen 37 IYLDGEWREMK 47 (150)
T ss_dssp EEETTEEEE--
T ss_pred EEecCeeeeec
Confidence 89999999864
No 44
>PF12883 DUF3828: Protein of unknown function (DUF3828); InterPro: IPR024289 This domain currently has no known function.; PDB: 3KZT_A.
Probab=23.10 E-value=1.1e+02 Score=20.88 Aligned_cols=18 Identities=17% Similarity=0.219 Sum_probs=15.7
Q ss_pred ceEEEEEECCeEEEEecc
Q psy9431 35 NSWNAVYVAGAWRFVQCN 52 (119)
Q Consensus 35 HaWN~V~ldg~W~liD~T 52 (119)
=.+..|+.+|+|.+-|++
T Consensus 100 ~~~~l~ke~g~WkI~~V~ 117 (120)
T PF12883_consen 100 VIVCLVKENGRWKIDDVR 117 (120)
T ss_dssp EEEEEEEETTEEEEEEES
T ss_pred EEEEEEEECCEEEEEEee
Confidence 378999999999998875
No 45
>PF09724 DUF2036: Uncharacterized conserved protein (DUF2036); InterPro: IPR019128 Sister chromatid cohesion protein DCC1 is a component of the RFC-like complex CTF18-RFC. This complex is required for the efficient establishment of chromosome cohesion during S-phase and may load or unload POL30/PCNA. During a clamp loading circle, the RFC:clamp complex binds to DNA and the recognition of the double-stranded/single-stranded junction stimulates ATP hydrolysis by RFC. The complex presumably provides bipartite ATP sites in which one subunit supplies a catalytic site for hydrolysis of ATP bound to the neighbouring subunit. Dissociation of RFC from the clamp leaves the clamp encircling DNA [, ].
Probab=22.08 E-value=42 Score=26.82 Aligned_cols=23 Identities=22% Similarity=0.394 Sum_probs=18.6
Q ss_pred eEEEEEECCeEEEEeccCCCccc
Q psy9431 36 SWNAVYVAGAWRFVQCNWGARHL 58 (119)
Q Consensus 36 aWN~V~ldg~W~liD~Twgag~~ 58 (119)
.-+++.|||.||++|...-...+
T Consensus 148 ~l~~~ei~G~~r~Ls~~~~~~~L 170 (325)
T PF09724_consen 148 ELGACEIDGYWRLLSPSYLFEIL 170 (325)
T ss_pred HCCEEEECCEEEECCHHHHHHHH
Confidence 45789999999999998766543
No 46
>PF15232 DUF4585: Domain of unknown function (DUF4585)
Probab=21.76 E-value=77 Score=20.77 Aligned_cols=15 Identities=20% Similarity=0.253 Sum_probs=11.0
Q ss_pred EEEEC---CeEEEEeccC
Q psy9431 39 AVYVA---GAWRFVQCNW 53 (119)
Q Consensus 39 ~V~ld---g~W~liD~Tw 53 (119)
.|.+| |++|+||+--
T Consensus 9 KvL~DP~SG~Yy~vd~P~ 26 (75)
T PF15232_consen 9 KVLQDPESGQYYVVDAPV 26 (75)
T ss_pred cEeecCCCCCEEEEecCC
Confidence 35666 8899998755
No 47
>PLN00205 ribisomal protein L13 family protein; Provisional
Probab=21.73 E-value=53 Score=25.14 Aligned_cols=11 Identities=18% Similarity=0.344 Sum_probs=8.9
Q ss_pred CeEEEEeccCC
Q psy9431 44 GAWRFVQCNWG 54 (119)
Q Consensus 44 g~W~liD~Twg 54 (119)
.+|++|||++-
T Consensus 15 r~W~VIDA~~~ 25 (191)
T PLN00205 15 LRWRVFDAKGQ 25 (191)
T ss_pred CcEEEEeCCCC
Confidence 47999998763
No 48
>PF09173 eIF2_C: Initiation factor eIF2 gamma, C terminal; InterPro: IPR015256 This entry represents a domain which is found in the initiation factors eIF2 and EF-Tu, adopting a beta barrel structure with Greek key topology. It is required for formation of the ternary complex with GTP and initiator tRNA []. ; PDB: 1S0U_A 1KK1_A 1KK0_A 1KK2_A 1KJZ_A 1KK3_A 2D74_A 2DCU_A 3P3M_A 3V11_A ....
Probab=21.72 E-value=64 Score=21.55 Aligned_cols=11 Identities=18% Similarity=0.685 Sum_probs=8.8
Q ss_pred EEECCeEEEEe
Q psy9431 40 VYVAGAWRFVQ 50 (119)
Q Consensus 40 V~ldg~W~liD 50 (119)
=+|+++|+||=
T Consensus 74 Rri~~rWRLIG 84 (88)
T PF09173_consen 74 RRIGNRWRLIG 84 (88)
T ss_dssp EEETTSEEEEE
T ss_pred hhccCeEEEEE
Confidence 36899999983
No 49
>PF04276 DUF443: Protein of unknown function (DUF443) ; InterPro: IPR005915 The members of this family share 50 % or greater sequence identity. They are found as eleven tandem genes, arranged head-to-tail, in Staphylococcus aureus (strain COL). Distant full-length homologs are found in a Staphylococcus haemolyticus plasmid and in Bacillus halodurans. The function of these proteins is unknown.
Probab=21.59 E-value=97 Score=23.30 Aligned_cols=14 Identities=0% Similarity=0.154 Sum_probs=12.6
Q ss_pred EEEEECCeEEEEec
Q psy9431 38 NAVYVAGAWRFVQC 51 (119)
Q Consensus 38 N~V~ldg~W~liD~ 51 (119)
..++.||++|++|.
T Consensus 5 ~ii~~~~~yy~iD~ 18 (199)
T PF04276_consen 5 KIIKYNNEYYMIDL 18 (199)
T ss_pred EEEEECCEEEEEEc
Confidence 46899999999998
No 50
>smart00785 AARP2CN AARP2CN (NUC121) domain. This domain is the central domain of AARP2. It is weakly similar to the GTP-binding domain of elongation factor TU PUBMED:15112237.
Probab=21.20 E-value=1.1e+02 Score=20.02 Aligned_cols=17 Identities=24% Similarity=0.575 Sum_probs=12.9
Q ss_pred EEEEEEEeCCCCCCCCC
Q psy9431 10 CVVIKGYSKSAGYQPGV 26 (119)
Q Consensus 10 ~~~V~G~~k~~~~~pg~ 26 (119)
...|.||.+|.+..+++
T Consensus 45 ~l~v~GyvRG~~l~~n~ 61 (83)
T smart00785 45 TLVVYGYVRGTGLNANQ 61 (83)
T ss_pred EEEEEEEEcCCCCCCCC
Confidence 47899999998655443
No 51
>PF03736 EPTP: EPTP domain; InterPro: IPR005492 Mutations in the LGI/EPT gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein (also known as Leucine-rich glioma-inactivated protein) was seen to contain a 130 amino acid repeat in its C-terminal section, although a sub-domain of 50 amino acids has now been further defined within this. The architecture and structural features of this repeat make it a likely member 7-bladed beta-propeller fold []. This protein has now been found in a number of proteins associated with neurological disorders suggesting that it may play a role in the development of epilepsy and other related conditions [].
Probab=21.13 E-value=1.5e+02 Score=16.54 Aligned_cols=22 Identities=9% Similarity=0.095 Sum_probs=17.8
Q ss_pred CCCceEEEEEECCeEEEEeccC
Q psy9431 32 RFRNSWNAVYVAGAWRFVQCNW 53 (119)
Q Consensus 32 ~~~HaWN~V~ldg~W~liD~Tw 53 (119)
...-+|-.+.|+|+.||+=+.-
T Consensus 12 ~~~~~~e~F~i~~~~fl~~a~~ 33 (44)
T PF03736_consen 12 RGARDVEPFSIGGDQFLAVASF 33 (44)
T ss_pred cceeEEEEEEECCEEEEEEEeC
Confidence 3567999999999999985543
No 52
>smart00849 Lactamase_B Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor.
Probab=20.31 E-value=1.5e+02 Score=20.08 Aligned_cols=24 Identities=13% Similarity=-0.039 Sum_probs=18.5
Q ss_pred CCceEEEEEECCeEEEEeccCCCc
Q psy9431 33 FRNSWNAVYVAGAWRFVQCNWGAR 56 (119)
Q Consensus 33 ~~HaWN~V~ldg~W~liD~Twgag 56 (119)
..+.-..|..+|++.|||+-++..
T Consensus 4 ~~~~~~li~~~~~~iliD~g~~~~ 27 (183)
T smart00849 4 VGVNSYLVEGDGGAILIDTGPGEA 27 (183)
T ss_pred cceeEEEEEeCCceEEEeCCCChh
Confidence 356667788889999999886653
No 53
>cd01287 FabA FabA, beta-hydroxydecanoyl-acyl carrier protein (ACP)-dehydratase: Bacterial protein of the type II, fatty acid synthase system that binds ACP and catalyzes both dehydration and isomerization reactions, apparently in the same active site. The FabA structure is a homodimer with two independent active sites located at the dimer interface. Each active site is tunnel-shaped and completely inaccessible to solvent. No metal ions or cofactors are required for ligand binding or catalysis.
Probab=20.06 E-value=48 Score=23.85 Aligned_cols=16 Identities=31% Similarity=0.501 Sum_probs=10.7
Q ss_pred cCchh-chhhccCCCCC
Q psy9431 87 TDPRE-FIYEFFPLQPE 102 (119)
Q Consensus 87 ~~P~~-fi~tH~P~d~~ 102 (119)
+++++ |+.-|||.+|.
T Consensus 35 v~~~e~ff~gHFp~~pv 51 (150)
T cd01287 35 IDPDDWFFPCHFHGDPV 51 (150)
T ss_pred cCCCCceEcCCCCCCCc
Confidence 33443 66779999873
Done!