Query psy13169
Match_columns 129
No_of_seqs 131 out of 1251
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 21:56:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13169.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13169hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR02411 leuko_A4_hydro leuko 100.0 5.5E-28 1.2E-32 205.8 13.1 115 7-128 79-194 (601)
2 TIGR02412 pepN_strep_liv amino 99.9 2.3E-26 5E-31 201.6 14.0 114 6-128 75-191 (831)
3 PF01433 Peptidase_M1: Peptida 99.9 7.5E-27 1.6E-31 187.3 9.6 120 7-128 76-198 (390)
4 KOG1932|consensus 99.9 1.4E-23 3.1E-28 184.0 11.4 121 6-128 122-249 (1180)
5 KOG1047|consensus 99.9 1.6E-23 3.5E-28 174.0 7.6 118 8-128 84-203 (613)
6 PRK14015 pepN aminopeptidase N 99.9 8.2E-22 1.8E-26 173.6 13.0 106 7-128 84-195 (875)
7 TIGR02414 pepN_proteo aminopep 99.8 5.4E-20 1.2E-24 161.9 12.3 105 8-128 72-182 (863)
8 KOG1046|consensus 99.8 5.8E-20 1.3E-24 162.2 11.3 118 8-128 107-227 (882)
9 COG0308 PepN Aminopeptidase N 99.8 1.3E-19 2.8E-24 159.7 10.5 98 23-128 106-208 (859)
10 PF12518 DUF3721: Protein of u 62.6 2.8 6.1E-05 22.8 0.2 10 65-74 24-33 (34)
11 PF10633 NPCBM_assoc: NPCBM-as 61.9 13 0.00028 23.1 3.2 39 80-120 24-62 (78)
12 PF05964 FYRN: F/Y-rich N-term 52.5 22 0.00048 20.9 3.0 23 63-85 31-53 (54)
13 COG3449 DNA gyrase inhibitor [ 46.3 47 0.001 24.1 4.4 45 70-116 52-100 (154)
14 PF12969 DUF3857: Domain of Un 38.0 50 0.0011 23.1 3.5 46 72-118 122-168 (177)
15 KOG3341|consensus 30.9 24 0.00052 27.3 0.9 17 55-71 220-237 (249)
16 PF10989 DUF2808: Protein of u 28.7 1.5E+02 0.0031 20.9 4.6 34 6-40 88-124 (146)
17 PF00703 Glyco_hydro_2: Glycos 26.0 1.7E+02 0.0037 18.2 6.5 34 58-102 70-103 (110)
18 PHA03370 virion protein US2; P 25.7 32 0.00069 27.0 0.8 24 58-81 174-197 (269)
19 PF07703 A2M_N_2: Alpha-2-macr 24.9 1.7E+02 0.0038 19.4 4.4 49 74-122 12-64 (136)
20 KOG3285|consensus 23.0 86 0.0019 23.6 2.6 26 61-89 135-160 (203)
21 PRK10016 DNA gyrase inhibitor; 21.8 2.8E+02 0.006 19.1 5.3 27 65-91 46-76 (156)
No 1
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=99.95 E-value=5.5e-28 Score=205.78 Aligned_cols=115 Identities=17% Similarity=0.234 Sum_probs=100.1
Q ss_pred ccEEEEcCCCCCCcce-EEEEEEEeCCCCceeEEEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEEEEE
Q psy13169 7 CGVHFVVPEGDGTLVD-RNAHLFTCPHENSGVHFVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRLEFT 85 (129)
Q Consensus 7 ~~L~I~lp~~~~~~~~-~i~I~y~~~~p~~Gl~f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l~v~ 85 (129)
+.|.|.||..+..+.. +|+|.|++.....|++|..++++ .+++.+|+||||||++||+||||+|+|++|++|+++|+
T Consensus 79 ~~L~I~l~~~l~~g~~~~l~I~Y~~~~~~~gl~~~~~~~t--~g~~~py~~Tq~qp~~AR~~fPC~D~P~~Katf~~~I~ 156 (601)
T TIGR02411 79 SPLTISLPIATSKNKELVLNISFSTTPKCTALQWLTPEQT--SGKKHPYLFSQCQAIHARSVIPCQDTPSVKSTYTAEVE 156 (601)
T ss_pred CeEEEEeCCccCCCceEEEEEEEeecCCCceeEEeccccc--CCCCCCEEEECCcccchheeeeecCCcccceEEEEEEe
Confidence 4699999998876654 99999997667789999998764 34567999999999999999999999999999999999
Q ss_pred EcCCCeEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 86 VDENLTAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 86 vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
+| +.|++||.++++. . ++..+|+|++++|||+||||||
T Consensus 157 ~P--~~av~sg~~~~~~--~-~~~~~~~F~~t~pmptYLia~a 194 (601)
T TIGR02411 157 SP--LPVLMSGIPDGET--S-NDPGKYLFKQKVPIPAYLIALA 194 (601)
T ss_pred eC--cceeccCCccccc--c-CCCceEEEEeCCCcchhhheee
Confidence 99 9999999876544 2 3456899999999999999997
No 2
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=99.94 E-value=2.3e-26 Score=201.60 Aligned_cols=114 Identities=18% Similarity=0.190 Sum_probs=93.9
Q ss_pred cccEEEEcCCCCCCcceEEEEEEE--eCCCCceeE-EEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEE
Q psy13169 6 FCGVHFVVPEGDGTLVDRNAHLFT--CPHENSGVH-FVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRL 82 (129)
Q Consensus 6 ~~~L~I~lp~~~~~~~~~i~I~y~--~~~p~~Gl~-f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l 82 (129)
..+.+|.||+ +..+..+|+|+|. +.++..|++ |.++++ ++.|++||+||.+||+||||||+|++||+|++
T Consensus 75 ~~~~~i~l~~-l~~g~~~l~i~~~~~~~~~~~Gl~~~~~~~~------g~~~~~Tq~ep~~Ar~~fPcfDeP~~KAtf~l 147 (831)
T TIGR02412 75 YDGSRIPLPG-LLTGENTLRVEATRAYTNTGEGLHRFVDPVD------GEVYLYTQFEPADARRVFAVFDQPDLKANFKF 147 (831)
T ss_pred cCCCEEEccC-CCCCceEEEEEEEEEecCCCceEEEEEeCCC------CeEEEEECCCCcCceeeEecCCCCCCceeEEE
Confidence 3457889987 5444335666654 567788998 554432 35799999999999999999999999999999
Q ss_pred EEEEcCCCeEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 83 EFTVDENLTAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 83 ~v~vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
+|++|++|+|+|||.+.+.. ..+++++|+|++++|||+|+++|+
T Consensus 148 tit~p~~~~v~sNg~~~~~~--~~~~~~~~~F~~t~pmstYL~a~~ 191 (831)
T TIGR02412 148 SVKAPEDWTVISNSRETDVT--PEPADRRWEFPETPKLSTYLTAVA 191 (831)
T ss_pred EEEECCCceEECCCcccccc--ccCCCeEEEecCCCCcccceEEEE
Confidence 99999999999999987665 445678899999999999999997
No 3
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=99.94 E-value=7.5e-27 Score=187.29 Aligned_cols=120 Identities=23% Similarity=0.224 Sum_probs=96.7
Q ss_pred ccEEEEcCCCCCCcce-EEEEEEEe--CCCCceeEEEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEEE
Q psy13169 7 CGVHFVVPEGDGTLVD-RNAHLFTC--PHENSGVHFVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRLE 83 (129)
Q Consensus 7 ~~L~I~lp~~~~~~~~-~i~I~y~~--~~p~~Gl~f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l~ 83 (129)
+.|.|.|+..+..+.. +|+|.|++ .+...|++...+... +.++..++++|++||.+||+||||+|+|++||+|+++
T Consensus 76 ~~l~I~l~~~l~~g~~~~L~I~y~g~~~~~~~G~~~~~y~~~-~~~~~~~~~~t~~~p~~ar~~fPc~D~p~~ka~f~~~ 154 (390)
T PF01433_consen 76 EKLTITLPKPLPPGSNYTLRIEYSGKISDDSSGLYRSSYTDQ-TNGNTRWYIYTQFEPNGARRWFPCFDEPSFKATFDLT 154 (390)
T ss_dssp TEEEEEEEEECSTTEEEEEEEEEEEECBSSSSEEEEEEEE-G-TSSSETCEEEEE-TTTTGGGTSSB--STTSEEEEEEE
T ss_pred ceeehhhhhhcccCcEEEEEEEEeecccccccccccceeecc-cccccCCceeecccccccceeeeeeccCCccceEEEe
Confidence 3599999998888864 99999995 455789985444320 1234578999999999999999999999999999999
Q ss_pred EEEcCCCeEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 84 FTVDENLTAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 84 v~vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
|++|++++|+|||.++++.... +++++++|+.+.|||+|++||+
T Consensus 155 i~~p~~~~~~sng~~~~~~~~~-~~~~~~~f~~t~p~~~yl~a~~ 198 (390)
T PF01433_consen 155 ITHPKDYTALSNGPLEEEESND-DGWKTTTFETTPPMPTYLFAFA 198 (390)
T ss_dssp EEEETTTEEEESSEEEEEEEET-TTEEEEEEEEEEEEEGGG--EE
T ss_pred eeccccceeecccccccccccc-ccceeEeeecccccCchhhhhh
Confidence 9999999999999998887444 5699999999999999999997
No 4
>KOG1932|consensus
Probab=99.90 E-value=1.4e-23 Score=183.96 Aligned_cols=121 Identities=40% Similarity=0.736 Sum_probs=106.2
Q ss_pred cccEEEEcCCCCCCc---ce--EEEEEEEeCCCCceeEEEccCCCCccCCCccEEEeccC--CCCcCeEEeeCCCCCCce
Q psy13169 6 FCGVHFVVPEGDGTL---VD--RNAHLFTCPHENSGVHFVVPEGDGTLVDRNAHLFTCPH--ENSARLWFPCIDTYSEVC 78 (129)
Q Consensus 6 ~~~L~I~lp~~~~~~---~~--~i~I~y~~~~p~~Gl~f~~~~~~~~~~~~~~~~~T~~q--~~~Ar~wfPc~D~P~~Ka 78 (129)
.|+|.|.+|++.... .. +|+|+|++.+|..|++|+.+.-. ...+.+||||.+- ..+||+||||+|.++++|
T Consensus 122 ~g~L~I~ipk~~~~~~ee~~~lr~~I~~s~~~pk~gi~Fv~~~~~--~~~~~~hvft~~~~~~s~ar~WfPCvD~~~e~~ 199 (1180)
T KOG1932|consen 122 NGELLIKIPKESKKVGEELKALRLRIDFSVREPKDGIKFVRPNYI--VSPRDKHVFTNNTQISSSARSWFPCVDSSYERC 199 (1180)
T ss_pred CCeEEEEcCchhhhhhhhccceEEEEEEEccCCCCCeEEeccCcc--cCcccCceEeecCccccccceEEeecCCccccc
Confidence 589999999974322 22 78899999999999999998621 1247789999883 567999999999999999
Q ss_pred eEEEEEEEcCCCeEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 79 TWRLEFTVDENLTAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 79 t~~l~v~vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
+|+|+++||+++++|++|+|.++...+|.+++||+|+.++|+++..||||
T Consensus 200 tWeLeftvp~~~~av~~geLl~~v~~~D~~Kkt~~ys~tvPvA~~~I~~A 249 (1180)
T KOG1932|consen 200 TWELEFTVPKNLVAVSCGELLEQVETPDLRKKTYHYSLTVPVAPSNIGFA 249 (1180)
T ss_pred eEEEEEEecccceeccchhhhheeecccccccEEEEEEeccCCcccccee
Confidence 99999999999999999999999877888899999999999999999998
No 5
>KOG1047|consensus
Probab=99.89 E-value=1.6e-23 Score=174.04 Aligned_cols=118 Identities=21% Similarity=0.252 Sum_probs=100.3
Q ss_pred cEEEEcCCCCCC-cc-eEEEEEEEeCCCCceeEEEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEEEEE
Q psy13169 8 GVHFVVPEGDGT-LV-DRNAHLFTCPHENSGVHFVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRLEFT 85 (129)
Q Consensus 8 ~L~I~lp~~~~~-~~-~~i~I~y~~~~p~~Gl~f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l~v~ 85 (129)
++.+.||....+ +. ..|.|.|...+...||+|+.|+++ .+++.||+|+|||+++||+.|||+|+|+.|.||++.|.
T Consensus 84 ~~~~~l~~~~~~a~~~~~l~i~y~Ts~~atalqwL~peQT--~gk~~PylfsQCQAIhaRsi~PC~DTPavK~ty~a~v~ 161 (613)
T KOG1047|consen 84 GQKLVLPAPSSKAGERLQLLIWYETSPSATALQWLNPEQT--SGKKHPYLFSQCQAIHARSIFPCQDTPAVKSTYTAEVE 161 (613)
T ss_pred ceEEEeccccccccCceEEEEEEeccCCcceeEEeccccc--cCCCCCchHHHHHHhHHheeccccCCCcceeEEEEEEE
Confidence 455666654332 23 399999998899999999999997 56677999999999999999999999999999999999
Q ss_pred EcCCCeEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 86 VDENLTAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 86 vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
+|.++.+++++...+++. ...++.+|+|++.+||++|||||+
T Consensus 162 vp~~l~a~mSai~~~~~~-~~~~~~~f~f~q~~pIP~YLiai~ 203 (613)
T KOG1047|consen 162 VPMGLTALMSAIPAGEKP-GSNGRAIFRFKQEVPIPSYLIAIA 203 (613)
T ss_pred cCCcceeeeeccccccCC-CCCCcceEEEEeccCchhhhHHHh
Confidence 999999999997766551 123477899999999999999997
No 6
>PRK14015 pepN aminopeptidase N; Provisional
Probab=99.87 E-value=8.2e-22 Score=173.61 Aligned_cols=106 Identities=12% Similarity=0.015 Sum_probs=86.8
Q ss_pred ccEEEEcCCCCCCcceEEEEEEEeCC----CCceeEEEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEE
Q psy13169 7 CGVHFVVPEGDGTLVDRNAHLFTCPH----ENSGVHFVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRL 82 (129)
Q Consensus 7 ~~L~I~lp~~~~~~~~~i~I~y~~~~----p~~Gl~f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l 82 (129)
+.|.|.+++. ..+|+|+|++.. ...|+|+.. .+++|||||++||+||||+|+|++|++|++
T Consensus 84 ~~L~I~~l~~----~~~l~I~y~~~P~~n~~l~Gly~s~-----------~~~~TQ~Ep~gAR~~fPc~D~P~~KAtf~i 148 (875)
T PRK14015 84 EGLTIENLPD----RFTLEIETEIDPEANTALEGLYRSG-----------GMFCTQCEAEGFRRITYFLDRPDVLARYTV 148 (875)
T ss_pred CEEEEecCCc----cEEEEEEEEEecCCCCCceeeEEEC-----------CEEEEeccccCcCCcccCCCCCCCCeeEEE
Confidence 4688875431 248899998532 246998752 178999999999999999999999999999
Q ss_pred EEEEcC-CC-eEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 83 EFTVDE-NL-TAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 83 ~v~vp~-~~-~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
+|++|+ .| +++|||.++++.. ..+++++++|+.+.|||+||++|+
T Consensus 149 tI~~p~~~~~~~lSNG~l~~~~~-~~~g~~~~~w~~~~PmpsYL~Al~ 195 (875)
T PRK14015 149 RIEADKAKYPVLLSNGNLVESGE-LPDGRHWATWEDPFPKPSYLFALV 195 (875)
T ss_pred EEEEccccCeEEecCCcccccee-ccCCeEEEEEEeCCCcccceEEEE
Confidence 999999 48 6899999987752 246688899999999999999997
No 7
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=99.83 E-value=5.4e-20 Score=161.93 Aligned_cols=105 Identities=11% Similarity=-0.030 Sum_probs=84.8
Q ss_pred cEEEEcCCCCCCcceEEEEEEEeC----CCCceeEEEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEEE
Q psy13169 8 GVHFVVPEGDGTLVDRNAHLFTCP----HENSGVHFVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRLE 83 (129)
Q Consensus 8 ~L~I~lp~~~~~~~~~i~I~y~~~----~p~~Gl~f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l~ 83 (129)
.|.|.+.+ +.-+|+|.|.+. ....|+|+.. .+++||||+++||+||||+|+|++||+|+++
T Consensus 72 ~L~I~~~~----~~~~l~i~~~~~p~~n~~l~GlY~s~-----------~~~~TQ~Ep~gaR~ifpc~DeP~~kAtf~vt 136 (863)
T TIGR02414 72 TLTIASVP----ESFTLEIETEIHPEENTSLEGLYKSG-----------GNFCTQCEAEGFRRITYFPDRPDVMSRYTVT 136 (863)
T ss_pred EEEEeeCC----ccEEEEEEEEeecccCCCCeEEEEeC-----------CeEEEEecCCCCCcCCCCCCCCCCceEEEEE
Confidence 46666432 223788888752 3457998753 2689999999999999999999999999999
Q ss_pred EEEcCC-C-eEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 84 FTVDEN-L-TAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 84 v~vp~~-~-~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
|++|++ | +++|||.++++.. ..+++++++|+.+.|||+||++|+
T Consensus 137 I~~p~~~y~v~lSNg~~~~~~~-~~~g~~~~~f~~t~pmptYLfA~v 182 (863)
T TIGR02414 137 ITADKKKYPVLLSNGNKIASGE-LPDGRHWAEWEDPFPKPSYLFALV 182 (863)
T ss_pred EEECCCcceEEEeCCcccccee-cCCCeEEEEEeCCCCcChhHheEE
Confidence 999986 6 6689998886652 246678899999999999999997
No 8
>KOG1046|consensus
Probab=99.82 E-value=5.8e-20 Score=162.22 Aligned_cols=118 Identities=17% Similarity=0.050 Sum_probs=95.1
Q ss_pred cEEEEcCCCCCCcce-EEEEEEEe--CCCCceeEEEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEEEE
Q psy13169 8 GVHFVVPEGDGTLVD-RNAHLFTC--PHENSGVHFVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRLEF 84 (129)
Q Consensus 8 ~L~I~lp~~~~~~~~-~i~I~y~~--~~p~~Gl~f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l~v 84 (129)
.|.+.+++.+..+.. .|+|.|.. .+...|+|...+...+ +..+...-||+||.+||..|||||+|+.||+|.++|
T Consensus 107 ~l~~~~~~~l~~~~~y~L~i~f~g~l~~~~~G~y~s~y~~~~--~~~~~~~~Tqfept~AR~~FPCfDeP~~KAtF~Itl 184 (882)
T KOG1046|consen 107 TLVFPLNETLLAGSSYTLTIEFTGKLNDSSEGFYRSSYTDSE--GSEKSIAATQFEPTDARRAFPCFDEPAFKATFTITL 184 (882)
T ss_pred EEEEEcccccccCCeEEEEEEEeEeecCCcceeeeecccCCC--CceEEEEEeccCccchhhcCCCCCcccccCceEEEE
Confidence 356666666666664 99999984 3455799988765421 112334459999999999999999999999999999
Q ss_pred EEcCCCeEeecCeeeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 85 TVDENLTAVSCGDLIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 85 ~vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
..|+++.|+|||..+++...+++ +++-.|..++|||+|++||+
T Consensus 185 ~hp~~~~aLSNm~v~~~~~~~~~-~~~~~F~~Tp~MstYLvAf~ 227 (882)
T KOG1046|consen 185 VHPKGYTALSNMPVIKEEPVDDG-WKTTTFEKTPKMSTYLVAFA 227 (882)
T ss_pred EecCCceEeecCcccccccccCC-eeEEEEEecCCCchhhheee
Confidence 99999999999988877644444 99999999999999999997
No 9
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=99.81 E-value=1.3e-19 Score=159.73 Aligned_cols=98 Identities=23% Similarity=0.248 Sum_probs=80.5
Q ss_pred EEEEEEEeC-----CCCceeEEEccCCCCccCCCccEEEeccCCCCcCeEEeeCCCCCCceeEEEEEEEcCCCeEeecCe
Q psy13169 23 RNAHLFTCP-----HENSGVHFVVPEGDGTLVDRNAHLFTCPHENSARLWFPCIDTYSEVCTWRLEFTVDENLTAVSCGD 97 (129)
Q Consensus 23 ~i~I~y~~~-----~p~~Gl~f~~~~~~~~~~~~~~~~~T~~q~~~Ar~wfPc~D~P~~Kat~~l~v~vp~~~~avsnG~ 97 (129)
.++|.|... ....|+|.... .+.++++||||+.+||+||||+|+|+.|++|+++|++++++++||||.
T Consensus 106 ~l~i~~~~~~~~s~~~~~Gly~~~~-------~~~~~~~TQ~Ea~~aR~~fpc~D~P~~katf~~~i~~~k~~~~iSN~~ 178 (859)
T COG0308 106 TLAITYEFTGPVSNDTLEGLYRSGY-------GGKPYLITQCEAEGARRIFPCIDEPDVKATFTLTIRADKGPKLISNGN 178 (859)
T ss_pred cEEEEEEecccccCccccceeecCC-------CCCeeEEeecccCCCceeeecCCCCCCcceeEEEEEecCcceeeecCC
Confidence 555555543 44567765531 126799999999999999999999999999999999999999999999
Q ss_pred eeeeEEecCCCcEEEEEEcCCCCCcceEeee
Q psy13169 98 LIEVVYTPDMRRKTFFYSLNIPAPAPSLEIY 128 (129)
Q Consensus 98 l~~~~~~~~~~~~t~~f~~~~P~s~yli~fa 128 (129)
++......+ ++++++|.+++|||+||++|.
T Consensus 179 ~~~~~~~~~-g~~~~~f~~~~~mptYL~al~ 208 (859)
T COG0308 179 LIDGGTLVD-GRKIVKFEDTPPMPTYLFALV 208 (859)
T ss_pred ccccccccC-CcEEEEEcCCCCcchHhhhee
Confidence 887763233 488999999999999999986
No 10
>PF12518 DUF3721: Protein of unknown function; InterPro: IPR022196 This domain family is found in bacteria and eukaryotes, and is approximately 30 amino acids in length. There is a conserved WMPC sequence motif. There are two completely conserved residues (A and C) that may be functionally important.
Probab=62.57 E-value=2.8 Score=22.83 Aligned_cols=10 Identities=30% Similarity=1.016 Sum_probs=8.4
Q ss_pred CeEEeeCCCC
Q psy13169 65 RLWFPCIDTY 74 (129)
Q Consensus 65 r~wfPc~D~P 74 (129)
..|+||-+++
T Consensus 24 ~~WMPC~~h~ 33 (34)
T PF12518_consen 24 DKWMPCSNHK 33 (34)
T ss_pred CccccCcccC
Confidence 3799999886
No 11
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=61.88 E-value=13 Score=23.11 Aligned_cols=39 Identities=18% Similarity=0.281 Sum_probs=19.6
Q ss_pred EEEEEEEcCCCeEeecCeeeeeEEecCCCcEEEEEEcCCCC
Q psy13169 80 WRLEFTVDENLTAVSCGDLIEVVYTPDMRRKTFFYSLNIPA 120 (129)
Q Consensus 80 ~~l~v~vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~~~P~ 120 (129)
..+++.+|++|.+-..+.-+.. .+.|...++.|..++|-
T Consensus 24 v~~~l~~P~GW~~~~~~~~~~~--l~pG~s~~~~~~V~vp~ 62 (78)
T PF10633_consen 24 VSLSLSLPEGWTVSASPASVPS--LPPGESVTVTFTVTVPA 62 (78)
T ss_dssp -EEEEE--TTSE---EEEEE----B-TTSEEEEEEEEEE-T
T ss_pred EEEEEeCCCCccccCCcccccc--CCCCCEEEEEEEEECCC
Confidence 6677778999994333333332 25566777888888774
No 12
>PF05964 FYRN: F/Y-rich N-terminus; InterPro: IPR003888 The "FY-rich" domain N-terminal region is sometimes closely juxtaposed with the C-terminal region (IPR003889 from INTERPRO), but sometimes is far distant. It is of unknown function, but occurs frequently in chromatin-associated proteins like trithorax and its homologues.; GO: 0005634 nucleus; PDB: 2WZO_A.
Probab=52.46 E-value=22 Score=20.90 Aligned_cols=23 Identities=9% Similarity=0.102 Sum_probs=18.7
Q ss_pred CcCeEEeeCCCCCCceeEEEEEE
Q psy13169 63 SARLWFPCIDTYSEVCTWRLEFT 85 (129)
Q Consensus 63 ~Ar~wfPc~D~P~~Kat~~l~v~ 85 (129)
.++++|...-+|..|+.|..+|.
T Consensus 31 ~s~R~y~S~~~p~~~~~Y~~~Il 53 (54)
T PF05964_consen 31 KSSRLYWSTVDPRRRCRYTCEIL 53 (54)
T ss_dssp EEEEEEE-SS-TTSEEEEEEEEE
T ss_pred EEEEEEccccCCCCEEEEEEEEe
Confidence 57899999999999999998874
No 13
>COG3449 DNA gyrase inhibitor [DNA replication, recombination, and repair]
Probab=46.28 E-value=47 Score=24.13 Aligned_cols=45 Identities=18% Similarity=0.135 Sum_probs=30.4
Q ss_pred eCCCCC----CceeEEEEEEEcCCCeEeecCeeeeeEEecCCCcEEEEEEc
Q psy13169 70 CIDTYS----EVCTWRLEFTVDENLTAVSCGDLIEVVYTPDMRRKTFFYSL 116 (129)
Q Consensus 70 c~D~P~----~Kat~~l~v~vp~~~~avsnG~l~~~~~~~~~~~~t~~f~~ 116 (129)
|.|+|+ +++.|++-+.++.....-+.|....+. ++|...++++.-
T Consensus 52 ~~ddP~~Tp~e~~R~D~cv~v~~~~~~n~~~v~~~~i--~GG~YAV~r~~~ 100 (154)
T COG3449 52 YQDDPDTTPAEKCRYDACVVVPEPIPENSEGVQLGEI--PGGLYAVARFRG 100 (154)
T ss_pred ecCCCCCCCHHHceeeEEEEcCCccCCCCCceeEeee--cCCceEEEEEec
Confidence 446665 788899999999766666666544444 666666666654
No 14
>PF12969 DUF3857: Domain of Unknown Function with PDB structure (DUF3857); InterPro: IPR024618 This domain is based on the first domain of the PDB structure 3KD4 (residues 1-228). It is structurally similar to domains in other hydrolases, eg. M1 family aminopeptidase, despite lack of any significant sequence similarity. The domain is N-terminal to a transglutaminase domain, which is found in many proteins known to have transglutaminase activity. The function of this domain is unknown. ; PDB: 3KD4_A.
Probab=38.01 E-value=50 Score=23.10 Aligned_cols=46 Identities=11% Similarity=0.023 Sum_probs=21.9
Q ss_pred CCCCCceeEEEEEEEcCCCeEeec-CeeeeeEEecCCCcEEEEEEcCC
Q psy13169 72 DTYSEVCTWRLEFTVDENLTAVSC-GDLIEVVYTPDMRRKTFFYSLNI 118 (129)
Q Consensus 72 D~P~~Kat~~l~v~vp~~~~avsn-G~l~~~~~~~~~~~~t~~f~~~~ 118 (129)
+.|-.++.|++.+-....+..-.+ |.+..+. ...++.++|+|+...
T Consensus 122 ~~pv~~~~~~v~~P~~~~l~~~~~~~~~~~~~-~~~~~~~~~~w~~~~ 168 (177)
T PF12969_consen 122 SIPVRRSRFTVIVPKDLPLNYKEYNGDPAPEE-KEEGGYTTYTWEAKN 168 (177)
T ss_dssp S--EEEEEEEEEEETTS--EEEEET-----EE-EEETTEEEEEEEEEE
T ss_pred CCceeeEEEEEEeCCCCeEEEEeeCCCCCcce-EEcCCcEEEEEEEec
Confidence 446666666555444444555444 5443332 245678889998765
No 15
>KOG3341|consensus
Probab=30.87 E-value=24 Score=27.32 Aligned_cols=17 Identities=24% Similarity=0.600 Sum_probs=11.5
Q ss_pred EEecc-CCCCcCeEEeeC
Q psy13169 55 LFTCP-HENSARLWFPCI 71 (129)
Q Consensus 55 ~~T~~-q~~~Ar~wfPc~ 71 (129)
+|.+. .+.-|++||||+
T Consensus 220 ~WiD~q~g~e~~YW~ps~ 237 (249)
T KOG3341|consen 220 AWIDLQAGDEAAYWFPSL 237 (249)
T ss_pred eeeeccCCcceeeechhh
Confidence 44442 345789999997
No 16
>PF10989 DUF2808: Protein of unknown function (DUF2808); InterPro: IPR021256 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=28.67 E-value=1.5e+02 Score=20.85 Aligned_cols=34 Identities=12% Similarity=0.011 Sum_probs=24.3
Q ss_pred cccEEEEcCCCCCCcceEEEEEEE-eCCCCc-eeE-EE
Q psy13169 6 FCGVHFVVPEGDGTLVDRNAHLFT-CPHENS-GVH-FV 40 (129)
Q Consensus 6 ~~~L~I~lp~~~~~~~~~i~I~y~-~~~p~~-Gl~-f~ 40 (129)
...|.|.+++.+.++. +|+|... +.||.. |+| |.
T Consensus 88 ~~~i~I~f~~PV~pG~-tv~V~l~~v~NP~~~G~Y~f~ 124 (146)
T PF10989_consen 88 GRTITITFDEPVPPGT-TVTVVLSPVRNPRSGGTYQFN 124 (146)
T ss_pred CCEEEEEeCCCCCCCC-EEEEEEEeeeCCCCCCeEEEE
Confidence 3468888888877775 7888874 557765 887 44
No 17
>PF00703 Glyco_hydro_2: Glycosyl hydrolases family 2; InterPro: IPR006102 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities: beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This entry describes the immunoglobulin-like beta-sandwich domain [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3FN9_C 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3GM8_A 3HN3_E 1BHG_A 2VZU_A 2X09_A ....
Probab=25.97 E-value=1.7e+02 Score=18.15 Aligned_cols=34 Identities=18% Similarity=0.453 Sum_probs=21.0
Q ss_pred ccCCCCcCeEEeeCCCCCCceeEEEEEEEcCCCeEeecCeeeeeE
Q psy13169 58 CPHENSARLWFPCIDTYSEVCTWRLEFTVDENLTAVSCGDLIEVV 102 (129)
Q Consensus 58 ~~q~~~Ar~wfPc~D~P~~Kat~~l~v~vp~~~~avsnG~l~~~~ 102 (129)
..+-..++.|.| ++ -..|++.+++ ..+|..++..
T Consensus 70 ~~~i~~~~lW~p--~~---P~LY~l~v~l------~~~g~~~d~~ 103 (110)
T PF00703_consen 70 TIEIPNPKLWSP--ED---PYLYTLEVEL------DDDGEVLDSI 103 (110)
T ss_dssp EEEEESS-BBES--SS---BSEEEEEEEE------EETTEEEEEE
T ss_pred EEEcCCCCCcCC--CC---ceEEEEEEEE------EeCCEEEEEE
Confidence 344457799999 33 3468988887 2556555443
No 18
>PHA03370 virion protein US2; Provisional
Probab=25.73 E-value=32 Score=26.98 Aligned_cols=24 Identities=17% Similarity=0.242 Sum_probs=20.4
Q ss_pred ccCCCCcCeEEeeCCCCCCceeEE
Q psy13169 58 CPHENSARLWFPCIDTYSEVCTWR 81 (129)
Q Consensus 58 ~~q~~~Ar~wfPc~D~P~~Kat~~ 81 (129)
-.++.-+..|=||||++-+|+-|+
T Consensus 174 ~~~~~~ts~W~P~~~~~~~k~~~~ 197 (269)
T PHA03370 174 GIQTLLTSTWEPKFDTTEDKAHFE 197 (269)
T ss_pred CCCcceeccCCCCCCchhhhhhhh
Confidence 346778899999999999998876
No 19
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=24.90 E-value=1.7e+02 Score=19.41 Aligned_cols=49 Identities=10% Similarity=0.027 Sum_probs=28.3
Q ss_pred CCCceeEEEEEEEc---CCCeEeecCeeeeeEEec-CCCcEEEEEEcCCCCCc
Q psy13169 74 YSEVCTWRLEFTVD---ENLTAVSCGDLIEVVYTP-DMRRKTFFYSLNIPAPA 122 (129)
Q Consensus 74 P~~Kat~~l~v~vp---~~~~avsnG~l~~~~~~~-~~~~~t~~f~~~~P~s~ 122 (129)
|.+++++.+.-..+ --|.+++.|.+....... .++..++.|..+.-|.+
T Consensus 12 ~Ge~~~v~v~~~~~~~~~~~~v~s~g~I~~~~~~~~~~~~~~~~~~v~~~~~P 64 (136)
T PF07703_consen 12 PGETAKVTVQSPFPNGTFLYLVESRGKIVSTGSVELKNGSTTFEFPVTPDMAP 64 (136)
T ss_dssp TTSEEEEEEEEESCESEEEEEEEETTEEEEEEEEECTTTSSEEEEEE-GGGTS
T ss_pred CCCEEEEEEEcCCCccEEEEEEEECCeEEEEEEEEecCCcEEEEEecchhcCC
Confidence 44555555555554 125577788777655333 34455788888776664
No 20
>KOG3285|consensus
Probab=22.95 E-value=86 Score=23.59 Aligned_cols=26 Identities=15% Similarity=0.470 Sum_probs=20.0
Q ss_pred CCCcCeEEeeCCCCCCceeEEEEEEEcCC
Q psy13169 61 ENSARLWFPCIDTYSEVCTWRLEFTVDEN 89 (129)
Q Consensus 61 ~~~Ar~wfPc~D~P~~Kat~~l~v~vp~~ 89 (129)
-...=.++||.|++ |+|++-+-.+++
T Consensus 135 ItasVtfLP~Le~~---ctFdvLiyTdkD 160 (203)
T KOG3285|consen 135 ITASVTFLPLLEEI---CTFDVLIYTDKD 160 (203)
T ss_pred Hhhheeecccccce---eEEEEEEEeCCC
Confidence 34456799999998 999998865544
No 21
>PRK10016 DNA gyrase inhibitor; Provisional
Probab=21.81 E-value=2.8e+02 Score=19.09 Aligned_cols=27 Identities=15% Similarity=0.031 Sum_probs=19.8
Q ss_pred CeEEeeCCCCCC----ceeEEEEEEEcCCCe
Q psy13169 65 RLWFPCIDTYSE----VCTWRLEFTVDENLT 91 (129)
Q Consensus 65 r~wfPc~D~P~~----Kat~~l~v~vp~~~~ 91 (129)
+.+.=+.|+|.. ++.+++.|.+|....
T Consensus 46 ~~~~i~~D~p~~~~~~~~R~d~~i~v~~~~~ 76 (156)
T PRK10016 46 EWVAVYYDNPDEVPAEKLRCDTVVTVPDDFV 76 (156)
T ss_pred cEEEEECCCCCCCChHHceeeEEEEeCCCcc
Confidence 356667788854 667999999998643
Done!