Query gi|254780899|ref|YP_003065312.1| methionine aminopeptidase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 273 No_of_seqs 139 out of 5592 Neff 7.7 Searched_HMMs 39220 Date Mon May 30 03:42:35 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780899.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK12318 methionine aminopepti 100.0 0 0 500.7 22.4 252 8-262 37-291 (291) 2 TIGR00500 met_pdase_I methioni 100.0 0 0 501.0 21.5 249 12-260 1-265 (265) 3 PRK05716 methionine aminopepti 100.0 0 0 495.1 23.7 251 11-263 2-252 (252) 4 PRK07281 methionine aminopepti 100.0 0 0 493.5 21.5 251 11-263 1-283 (286) 5 PRK12897 methionine aminopepti 100.0 0 0 490.7 22.4 248 11-260 1-248 (248) 6 PRK12896 methionine aminopepti 100.0 0 0 475.7 22.4 252 8-260 3-254 (254) 7 cd01086 MetAP1 Methionine Amin 100.0 0 0 449.8 20.1 238 20-259 1-238 (238) 8 COG0024 Map Methionine aminope 100.0 0 0 443.7 23.1 250 11-262 2-254 (255) 9 KOG2738 consensus 100.0 0 0 434.0 20.4 252 10-263 112-364 (369) 10 PRK09795 aminopeptidase; Provi 100.0 0 0 420.8 20.2 233 8-268 121-359 (361) 11 PRK10879 proline aminopeptidas 100.0 0 0 406.5 17.0 242 6-266 165-427 (436) 12 COG0006 PepP Xaa-Pro aminopept 100.0 0 0 403.6 17.8 237 2-266 142-382 (384) 13 cd01087 Prolidase Prolidase. E 100.0 0 0 375.0 18.2 223 20-259 1-243 (243) 14 cd01090 Creatinase Creatine am 100.0 0 0 367.6 19.0 226 20-259 1-228 (228) 15 cd01092 APP-like Similar to Pr 100.0 0 0 359.4 17.9 207 20-254 1-208 (208) 16 PRK08671 methionine aminopepti 100.0 0 0 344.9 21.6 226 20-259 2-293 (293) 17 PRK13607 proline dipeptidase; 100.0 0 0 347.5 17.1 246 8-260 155-438 (442) 18 cd01088 MetAP2 Methionine Amin 100.0 0 0 340.5 20.9 226 20-259 1-291 (291) 19 cd01089 PA2G4-like Related to 100.0 0 0 337.5 19.0 216 20-259 1-228 (228) 20 pfam00557 Peptidase_M24 Metall 100.0 1.6E-43 0 310.8 17.7 206 21-251 1-207 (207) 21 cd01066 APP_MetAP A family inc 100.0 4.2E-43 0 308.0 19.2 206 20-254 1-207 (207) 22 cd01085 APP X-Prolyl Aminopept 100.0 4.5E-42 0 301.1 15.2 206 22-254 6-219 (224) 23 KOG2414 consensus 100.0 3.1E-42 0 302.2 12.8 241 5-266 219-479 (488) 24 KOG2737 consensus 100.0 3.3E-41 1.4E-45 295.4 12.5 255 7-266 178-473 (492) 25 TIGR00501 met_pdase_II methion 100.0 4.7E-38 1.2E-42 274.3 14.8 236 18-259 6-327 (327) 26 cd01091 CDC68-like Related to 100.0 2.7E-37 6.9E-42 269.3 15.3 225 20-259 1-243 (243) 27 PTZ00053 methionine aminopepti 100.0 2.2E-34 5.6E-39 249.8 20.4 239 16-261 118-433 (435) 28 TIGR00495 crvDNA_42K DNA-bindi 100.0 5.4E-29 1.4E-33 213.9 14.2 265 1-265 1-351 (407) 29 TIGR02993 ectoine_eutD ectoine 100.0 1.1E-28 2.8E-33 211.8 11.5 239 2-266 146-389 (391) 30 KOG2413 consensus 100.0 9.4E-28 2.4E-32 205.6 13.0 233 6-263 299-555 (606) 31 KOG2776 consensus 99.9 4.2E-22 1.1E-26 167.8 17.8 263 1-263 1-343 (398) 32 KOG1189 consensus 99.9 3.3E-22 8.4E-27 168.5 13.3 241 7-267 130-385 (960) 33 KOG2775 consensus 99.9 1E-20 2.6E-25 158.6 16.0 233 15-260 80-394 (397) 34 COG5406 Nucleosome binding fac 99.6 1.4E-14 3.5E-19 117.7 10.9 245 2-264 158-422 (1001) 35 cd01088 MetAP2 Methionine Amin 97.0 0.0027 7E-08 42.2 6.6 96 25-122 104-203 (291) 36 cd01089 PA2G4-like Related to 96.9 0.0031 7.9E-08 41.9 6.4 90 128-220 3-99 (228) 37 PRK08671 methionine aminopepti 96.8 0.0039 1E-07 41.2 6.3 91 25-118 105-200 (293) 38 COG0024 Map Methionine aminope 96.8 0.0039 9.8E-08 41.2 6.0 86 129-219 14-102 (255) 39 PRK09795 aminopeptidase; Provi 96.5 0.04 1E-06 34.5 9.6 80 130-219 137-218 (361) 40 PRK12318 methionine aminopepti 96.4 0.011 2.9E-07 38.1 6.3 86 129-220 52-142 (291) 41 cd01086 MetAP1 Methionine Amin 96.3 0.052 1.3E-06 33.7 9.3 87 22-111 109-199 (238) 42 PRK07281 methionine aminopepti 96.2 0.019 4.8E-07 36.6 6.6 35 134-168 18-52 (286) 43 KOG2738 consensus 96.1 0.014 3.6E-07 37.5 5.5 89 129-223 125-216 (369) 44 TIGR00500 met_pdase_I methioni 96.1 0.027 6.8E-07 35.6 6.7 88 129-221 12-112 (265) 45 cd01092 APP-like Similar to Pr 96.0 0.083 2.1E-06 32.3 9.1 41 23-63 105-145 (208) 46 PRK05716 methionine aminopepti 96.0 0.074 1.9E-06 32.7 8.8 82 23-107 120-205 (252) 47 cd01090 Creatinase Creatine am 96.0 0.086 2.2E-06 32.2 9.1 38 25-62 113-150 (228) 48 pfam00557 Peptidase_M24 Metall 96.0 0.087 2.2E-06 32.2 9.0 82 24-108 106-189 (207) 49 cd01066 APP_MetAP A family inc 95.9 0.061 1.5E-06 33.3 8.0 99 22-123 103-203 (207) 50 PRK12896 methionine aminopepti 95.5 0.17 4.3E-06 30.3 8.8 39 24-62 125-163 (254) 51 COG0006 PepP Xaa-Pro aminopept 95.3 0.17 4.4E-06 30.2 8.5 81 130-220 164-246 (384) 52 cd01087 Prolidase Prolidase. E 95.3 0.17 4.3E-06 30.3 8.3 87 24-110 106-210 (243) 53 PRK12897 methionine aminopepti 94.5 0.096 2.4E-06 31.9 5.4 84 130-219 14-100 (248) 54 cd01091 CDC68-like Related to 93.9 0.41 1.1E-05 27.7 7.6 98 22-124 120-234 (243) 55 PRK10879 proline aminopeptidas 93.9 0.59 1.5E-05 26.7 9.1 80 129-218 182-263 (436) 56 PTZ00053 methionine aminopepti 92.7 0.47 1.2E-05 27.3 6.3 73 141-219 137-213 (435) 57 TIGR00501 met_pdase_II methion 92.1 0.43 1.1E-05 27.6 5.5 132 10-142 112-257 (327) 58 TIGR01857 FGAM-synthase phosph 81.7 4 0.0001 21.1 5.2 60 108-168 218-291 (1279) 59 TIGR02176 pyruv_ox_red pyruvat 69.5 4.6 0.00012 20.7 2.8 49 204-266 373-425 (1194) 60 TIGR01959 nuoF_fam NADH-quinon 65.5 3.4 8.6E-05 21.6 1.5 130 15-153 10-161 (420) 61 TIGR02939 RpoE_Sigma70 RNA pol 62.7 6.2 0.00016 19.8 2.4 110 34-158 35-156 (192) 62 KOG2611 consensus 62.3 5.8 0.00015 20.0 2.2 69 97-165 583-661 (698) 63 TIGR00838 argH argininosuccina 52.0 16 0.0004 17.1 3.0 12 219-230 261-272 (469) 64 TIGR01828 pyru_phos_dikin pyru 49.4 20 0.00051 16.4 4.2 49 92-143 512-566 (920) 65 COG4786 FlgG Flagellar basal b 43.9 22 0.00055 16.2 2.7 32 226-258 107-138 (265) 66 pfam11149 DUF2924 Protein of u 43.0 25 0.00064 15.8 4.0 39 200-264 77-115 (136) 67 TIGR02823 oxido_YhdH putative 41.2 27 0.00068 15.6 5.0 82 91-172 74-177 (330) 68 PRK01490 tig trigger factor; P 39.0 29 0.00073 15.4 5.1 21 90-110 153-173 (435) 69 PRK09854 cmtB putative mannito 38.8 29 0.00074 15.4 3.1 82 78-160 58-144 (147) 70 TIGR00438 rrmJ ribosomal RNA l 37.1 31 0.00078 15.2 2.6 42 97-151 102-144 (192) 71 pfam02287 Dehydratase_SU Dehyd 36.5 31 0.0008 15.1 5.3 43 21-63 69-115 (136) 72 COG1894 NuoF NADH:ubiquinone o 33.9 7.7 0.0002 19.2 -0.9 109 29-144 29-153 (424) 73 pfam08007 Cupin_4 Cupin superf 33.7 28 0.00071 15.5 1.9 49 92-140 178-226 (320) 74 TIGR02312 HpaH 2-oxo-hepta-3-e 32.9 20 0.0005 16.5 1.0 23 41-63 30-55 (271) 75 pfam03881 Fructosamin_kin Fruc 31.9 37 0.00095 14.6 5.4 32 188-220 187-218 (287) 76 pfam09657 Cas_Csx8 CRISPR-asso 30.4 39 0.001 14.5 6.0 42 13-55 81-123 (441) 77 KOG1511 consensus 30.2 40 0.001 14.5 2.2 23 125-147 257-279 (397) 78 PRK04750 ubiB putative ubiquin 29.4 41 0.001 14.4 2.9 63 98-167 301-367 (540) 79 pfam07305 DUF1454 Protein of u 29.2 41 0.001 14.3 4.1 74 126-215 114-187 (200) 80 KOG1527 consensus 28.8 13 0.00034 17.6 -0.4 75 92-170 276-363 (506) 81 PRK08087 L-fuculose phosphate 28.5 26 0.00067 15.6 1.1 17 92-108 47-63 (215) 82 TIGR03290 CoB_CoM_SS_C CoB--Co 28.4 31 0.00078 15.2 1.4 39 145-183 56-102 (144) 83 COG3589 Uncharacterized conser 27.9 43 0.0011 14.2 3.5 62 91-155 162-227 (360) 84 TIGR02669 SpoIID_LytB SpoIID/L 27.7 19 0.00049 16.6 0.2 22 13-35 19-40 (323) 85 cd01821 Rhamnogalacturan_acety 27.2 31 0.00078 15.2 1.2 76 93-172 61-150 (198) 86 PRK10372 L-ascorbate-specific 26.9 45 0.0011 14.1 7.3 85 78-163 59-150 (154) 87 TIGR01980 sufB FeS assembly pr 26.7 34 0.00087 14.9 1.4 16 241-256 216-231 (469) 88 COG2850 Uncharacterized conser 26.2 26 0.00067 15.6 0.7 46 91-136 179-224 (383) 89 TIGR02870 spore_II_D stage II 24.8 23 0.00059 16.0 0.3 25 18-44 100-124 (358) 90 PRK07226 fructose-bisphosphate 24.8 49 0.0013 13.8 4.7 31 116-146 110-142 (266) 91 TIGR00337 PyrG CTP synthase; I 24.2 50 0.0013 13.8 4.1 21 148-168 149-174 (571) 92 PRK10144 formate-dependent nit 22.9 53 0.0014 13.6 4.1 30 27-56 56-85 (126) 93 TIGR03541 reg_near_HchA LuxR f 22.2 55 0.0014 13.5 4.4 30 34-63 4-33 (232) 94 PRK10188 DNA-binding transcrip 22.2 55 0.0014 13.5 4.4 27 36-62 12-38 (240) 95 PRK08333 L-fuculose phosphate 21.2 36 0.00092 14.7 0.7 15 93-107 46-60 (189) 96 PRK09220 hypothetical protein; 21.0 39 0.00099 14.5 0.8 37 92-139 50-87 (205) 97 TIGR03147 cyt_nit_nrfF cytochr 20.6 59 0.0015 13.3 4.0 30 27-56 56-85 (126) 98 PRK06754 mtnB methylthioribulo 20.3 52 0.0013 13.6 1.3 56 47-108 8-67 (208) No 1 >PRK12318 methionine aminopeptidase; Provisional Probab=100.00 E-value=0 Score=500.74 Aligned_cols=252 Identities=43% Similarity=0.704 Sum_probs=238.4 Q ss_pred HCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCCCCCCC Q ss_conf 13978853999999999999999999999999757998899999999999998598303443324--4455323446421 Q gi|254780899|r 8 ESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRG--YKKSCCTSINHVI 85 (273) Q Consensus 8 ~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~--f~~~~~~s~N~~~ 85 (273) ....|.||||+||+.||+||+|++++++.+.+.++||+||.||+..+++++.++|++|++++|.+ ||.++|+|+|+++ T Consensus 37 ~~~~i~IKs~eEIe~mR~A~~I~a~~~~~~~~~ikpGvTe~Eld~~~~~~~~~~ga~pa~l~y~~~~Fp~~vc~s~N~~v 116 (291) T PRK12318 37 SQYDIIIKTPEQIEKIRKACQITARILDALCEAAKEGVTTNELDQLSRELHKKYDAIPAPLNYGHPPFPKTICTSLNEVI 116 (291) T ss_pred CCCCCEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCEEEECCCEE T ss_conf 76885768999999999999999999999983687999899999999999998789867667788788874487238873 Q ss_pred CCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 11212222001340579978766530002311123334444321036778776543221100112122104899999998 Q gi|254780899|r 86 CHGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAH 165 (273) Q Consensus 86 ~H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~ 165 (273) |||+|++++|++||+|++|+|+.|+||++|++|||++|+++++++++++++++|++++++++|||++++||+.+++++++ T Consensus 117 ~HgiP~~~~Lk~GDlV~iD~G~~y~GY~sD~tRT~~vG~~s~~~~~l~~~v~~A~~~aI~~ikpG~~~~dI~~ai~~~~~ 196 (291) T PRK12318 117 CHGIPNDIPLKNGDIMNIDVSCIVDGYYGDCSRMVMIGEVSEIKKKVCQASLECLNAAIAILKPNLPLYEIGEVIENCAD 196 (291) T ss_pred ECCCCCCCCCCCCCEEEEECCEEECCEECCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 16899986167999899956786898971213145348898999999999999999999980799647799999999999 Q ss_pred HCCCEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCC-EECCCCCEEEECCCCEEEEEE Q ss_conf 6296010475212332211354036531564545664511673788311144377762-564688668833787167844 Q gi|254780899|r 166 SERYSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSA-KVLSDGWTAVTRDRSLSAQYE 244 (273) Q Consensus 166 ~~g~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~-~~~~d~w~~~~~~g~~~~~~E 244 (273) ++||+++++|+|||||+++||.|.|+|+.. ....+|+||||||||||||+++... ..++|+||++|.||++++||| T Consensus 197 ~~G~~~~~~~~GHGIG~~vHE~P~I~~~~~---~~~~~L~pGMV~TIEPgI~~g~~~~~~~~~d~Wt~~t~d~~~~a~~E 273 (291) T PRK12318 197 TYGFSVVDQFVGHGVGIKFHENPYVPHHRN---RSKIPLAPGMIFTIEPMINVGKKEGVIDPINHWEARTCDNQPSAQWE 273 (291) T ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCEECCCCEEEECCCCCCCCCCCEECCCCCCEEEECCCCEEEEEC T ss_conf 849911768002872731034886564569---99878379998987992124788747737898579914998475615 Q ss_pred EEEEECCCCCEECCCCCC Q ss_conf 218984798576526798 Q gi|254780899|r 245 HTIGITKAGCEIFTLSPN 262 (273) Q Consensus 245 d~vlVT~~G~E~LT~~p~ 262 (273) |||+||++|+|+||...+ T Consensus 274 htvlIT~~G~eiLT~~d~ 291 (291) T PRK12318 274 HTILITETGYEILTLLDK 291 (291) T ss_pred EEEEEECCCCEECCCCCC T ss_conf 589991992678146899 No 2 >TIGR00500 met_pdase_I methionine aminopeptidase, type I; InterPro: IPR002467 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002468 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=100.00 E-value=0 Score=500.99 Aligned_cols=249 Identities=45% Similarity=0.772 Sum_probs=243.0 Q ss_pred EEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC-CCCCCCCCC------CCC---CCCCCCC Q ss_conf 885399999999999999999999999975799889999999999999859-830344332------444---5532344 Q gi|254780899|r 12 INIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENN-AIPATLNYR------GYK---KSCCTSI 81 (273) Q Consensus 12 ~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g-~~~~~~~~~------~f~---~~~~~s~ 81 (273) ++|||++||+.||+||+|+.+++..+...++||+|+.||+..+.+++.+++ |+|||++|+ +|| .++|+|+ T Consensus 1 i~l~s~~Ei~kirkag~l~a~vl~~~~~~v~PGvST~eLD~i~~~~i~~~~hA~pAf~GY~PPfsdvgfPPft~~~C~S~ 80 (265) T TIGR00500 1 ISLKSKDEIEKIRKAGRLVAEVLEILEREVKPGVSTKELDRIAKDFIEKRGHAKPAFLGYYPPFSDVGFPPFTGSVCISV 80 (265) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE T ss_conf 95578989999999999999999999863689887899999999999847972353567877866687885556156321 Q ss_pred CCCCCCCCCC-CCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCC-CCCCCHHHHHHHHHHHHHHHH-CCCCCCCCCHHHH Q ss_conf 6421112122-220013405799787665300023111233344-443210367787765432211-0011212210489 Q gi|254780899|r 82 NHVICHGIPS-NKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGK-IKRAAERILQVTYESLYKGIA-AVKLNANIEDIGK 158 (273) Q Consensus 82 N~~~~H~~p~-d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~-~~~~~~~l~~~~~~a~~~~i~-~ikpG~~~~ei~~ 158 (273) |+.|+||+|+ .++|++||+|.||+|+.|+||++|.++||++|+ ++++.++|.+++++|+..||+ .+|||++..||.+ T Consensus 81 Ne~V~HGiP~~~~vLKdGDivniD~g~~~~Gy~gD~a~Tf~~G~n~~~~~~KLl~~t~esl~~aI~P~~kpG~~~~~ig~ 160 (265) T TIGR00500 81 NEVVIHGIPDKKKVLKDGDIVNIDVGVIYDGYYGDTAITFLVGKNVSPEAEKLLEATEESLYKAIEPEIKPGKRIGEIGK 160 (265) T ss_pred CCEEEECCCCCCCEEECCCEEEEEEEEEECCEEECCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH T ss_conf 67888513884726513677875577886030211323134138778767889999999998522310147988778999 Q ss_pred HHHHHHHHCCCEECCCCCCCCCCCCCCCCC-EEEECCCCCCC--CCCEECCCCEEEECCEEECCCCCCEECCCCCEEEEC Q ss_conf 999999862960104752123322113540-36531564545--664511673788311144377762564688668833 Q gi|254780899|r 159 AIQRYAHSERYSVVEVFCGHGIGKSFHEKP-EILHFYDPLYP--SVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTR 235 (273) Q Consensus 159 a~~~~~~~~g~~~~~~~~GHgiGl~~hE~P-~i~~~~~~~~~--~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~ 235 (273) |+++++++.||.+++.|||||||..+||+| .|+||..+... ...+|++|||||||||++.+...+.++.||||+.|. T Consensus 161 AI~~~~~~~GF~vvr~y~GHGiG~~~HeePn~ipnY~~~~~K~nsn~~lk~GMV~~iEPMv~~~~~e~~~~~DgWTV~T~ 240 (265) T TIGR00500 161 AIEKYAEEKGFSVVREYCGHGIGRKLHEEPNQIPNYYKKFTKKNSNVRLKEGMVFAIEPMVNTGTEEIVIAADGWTVKTK 240 (265) T ss_pred HHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCEEECCCCEEEECCCCCEEEEE T ss_conf 99999876299786145276437744557353666667777667887106875788626121788225671688768700 Q ss_pred CCCEEEEEEEEEEECCCCCEECCCC Q ss_conf 7871678442189847985765267 Q gi|254780899|r 236 DRSLSAQYEHTIGITKAGCEIFTLS 260 (273) Q Consensus 236 ~g~~~~~~Ed~vlVT~~G~E~LT~~ 260 (273) |+++++|+||||+||++|+|+||.. T Consensus 241 D~~~~AhfEHT~~iT~~g~~ILT~~ 265 (265) T TIGR00500 241 DGSLSAHFEHTIVITENGPEILTER 265 (265) T ss_pred CCCEEEEEEEEEEEECCEEEEECCC T ss_conf 7742101578999988915780269 No 3 >PRK05716 methionine aminopeptidase; Validated Probab=100.00 E-value=0 Score=495.07 Aligned_cols=251 Identities=53% Similarity=0.924 Sum_probs=243.3 Q ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78853999999999999999999999999757998899999999999998598303443324445532344642111212 Q gi|254780899|r 11 SINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIP 90 (273) Q Consensus 11 ~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p 90 (273) +|||||++||++||+|++++++++.++.+.++||+||.||+..++..+.+.|+.+++.++.+||.++|+|.|+.++|+.| T Consensus 2 ~~sIKs~~EI~~mR~Aa~i~~~~~~~~~~~i~pG~te~El~~~~~~~~~~~Ga~~~~~~~~~~p~~i~~s~n~~~~H~~p 81 (252) T PRK05716 2 AITIKTPEEIEKMRVAGRIAAEVLDEIEPHVKPGVTTKELDRIAEDYIREQGAIPAPLGYHGFPKSICTSVNEVVCHGIP 81 (252) T ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCCEECCCC T ss_conf 15756899999999999999999999998775999799999999999998779315556678887459517855327999 Q ss_pred CCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCE Q ss_conf 22200134057997876653000231112333444432103677877654322110011212210489999999862960 Q gi|254780899|r 91 SNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYS 170 (273) Q Consensus 91 ~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~ 170 (273) ++++|++||+|++|+|+.|+||++|++|||++|+|+++++++|+++++|++++++++|||++++||++++++++++.||+ T Consensus 82 ~~r~l~~Gd~v~iD~g~~~~GY~sD~tRT~~vG~~s~~~~~~~~~~~~a~~~~i~~ikPG~~~~dv~~a~~~~~~~~g~~ 161 (252) T PRK05716 82 SDKVLKEGDIVNIDVTVIKDGYHGDTSRTFIVGEVSPEDKRLCEVTKEALYLGIKAVKPGARLGDIGHAIQKYAEAEGFS 161 (252) T ss_pred CCCCCCCCCEEEEEEEEEECCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 88351689989998635677774664503542861036799998999999999997567980899999999999975887 Q ss_pred ECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEEC Q ss_conf 10475212332211354036531564545664511673788311144377762564688668833787167844218984 Q gi|254780899|r 171 VVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGIT 250 (273) Q Consensus 171 ~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT 250 (273) .+++|+|||||+.+||+|.+++++.+ +++++|||||||||||+||.+..++..++|+|++++.||.+++||||||||| T Consensus 162 ~~~~~~GHgiGl~~hE~P~i~~~~~~--~~~~~Le~GMV~tiEP~i~~~~~~~~~~~D~w~~~~~~g~~~~riEd~VlVT 239 (252) T PRK05716 162 VVREYCGHGIGRVFHEEPQVLHYGAP--GDGPVLKEGMVFTIEPMINAGKREVKTLKDGWTVVTKDGSLSAQYEHTVAVT 239 (252) T ss_pred CEECCCCCCCCCCCCCCCEECCCCCC--CCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEEECCCCEEEEEEEEEEEC T ss_conf 20021245567766769703565569--9985356982899816150588743886898768822797078863099990 Q ss_pred CCCCEECCCCCCC Q ss_conf 7985765267988 Q gi|254780899|r 251 KAGCEIFTLSPNN 263 (273) Q Consensus 251 ~~G~E~LT~~p~~ 263 (273) +||||+||++|++ T Consensus 240 e~G~E~LT~~p~e 252 (252) T PRK05716 240 EDGPEILTLRPEE 252 (252) T ss_pred CCCCEECCCCCCC T ss_conf 9937789889998 No 4 >PRK07281 methionine aminopeptidase; Reviewed Probab=100.00 E-value=0 Score=493.51 Aligned_cols=251 Identities=34% Similarity=0.575 Sum_probs=239.8 Q ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC----CCCCCCCCCCCCCC Q ss_conf 78853999999999999999999999999757998899999999999998598303443324----44553234464211 Q gi|254780899|r 11 SINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRG----YKKSCCTSINHVIC 86 (273) Q Consensus 11 ~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~----f~~~~~~s~N~~~~ 86 (273) .|+|||++||++||+|++|++++++++.+.+|||+|+.||++.+++.+.++|+.|+++++.+ ||.++|+|+|+++| T Consensus 1 mi~IKs~~EIe~mr~A~~i~a~~~~~~~~~ikpGit~~eld~~~~~~~~e~ga~p~~~g~~g~~~~fp~~vc~svN~~v~ 80 (286) T PRK07281 1 MITLKSAREIEAMDRAGDFLASIHIGLRDLIKPGVDMWEVEEYVRRRCKEDNVLPLQIGVDGHMMDYPYATCCGLNDEVA 80 (286) T ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCCEEEECCCCCC T ss_conf 90706999999999999999999998884676999899999999999997597243347877655788513642276004 Q ss_pred CCCCCCCCCCCCEEEEEEEEE---------------------------EEEEECCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 121222200134057997876---------------------------65300023111233344443210367787765 Q gi|254780899|r 87 HGIPSNKQLREGDIVNVDVTY---------------------------VVNGWHGDSSRMYPVGKIKRAAERILQVTYES 139 (273) Q Consensus 87 H~~p~d~~l~~GD~v~iD~g~---------------------------~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a 139 (273) ||+|++++|++||+|+||+++ .|+||++|.+|||.+|++++++++|++++++| T Consensus 81 HgiP~~~~l~~GDiV~iD~~~g~~~d~~~~D~~~~~~~~~~~~~~~~~~y~G~~~D~~~t~~vG~~~~~~~~L~~v~~eA 160 (286) T PRK07281 81 HAFPRHYILKEGDLLKVDMVLSEPLDKSIVDVSKLDFDNVEQMKKYTESYRGGLADSCWAYAVGTPSDEVKNLMDVTKEA 160 (286) T ss_pred CCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 89987774689998999635542345433442111233322111322233753501589998879899999999999999 Q ss_pred HHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECC Q ss_conf 43221100112122104899999998629601047521233221135403653156454566451167378831114437 Q gi|254780899|r 140 LYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVG 219 (273) Q Consensus 140 ~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~ 219 (273) +.++|+++|||++++||+.+++++++++||+++++|+|||||+.+||+|.|+|++.+. ...+|+|||||||||||+.+ T Consensus 161 l~~gI~~~~pG~~~~dI~~ai~~~~~~~Gygvv~~~~GHGIG~~~HE~P~V~~y~~~~--~g~~L~~GMv~tIEPmi~~g 238 (286) T PRK07281 161 MYRGIEKAVVGNRIGDIGAAIQEYAESRGYGVVRDLVGHGVGPTMHEEPMVPNYGTAG--RGLRLREGMVLTIEPMINTG 238 (286) T ss_pred HHHHHHHHCCCCEEEHHHHHHHHHHHHCCCCCCCCCEECCCCCCCCCCCCCCCCCCCC--CCCCCCCCCEEEEECEECCC T ss_conf 9999997349978856889999999984996777854676575354698367767489--99686469889980617169 Q ss_pred CCCCEE-CCCCCEEEECCCCEEEEEEEEEEECCCCCEECCCCCCC Q ss_conf 776256-46886688337871678442189847985765267988 Q gi|254780899|r 220 GSSAKV-LSDGWTAVTRDRSLSAQYEHTIGITKAGCEIFTLSPNN 263 (273) Q Consensus 220 ~~~~~~-~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~~p~~ 263 (273) +..+.. ++|+||++|.||++++||||||+||++|+|+||+.+++ T Consensus 239 ~~~~~~d~~dgWt~~t~Dg~~sa~~EhtvlIT~~G~eVLT~~~~e 283 (286) T PRK07281 239 TWEIDTDMKTGWAHKTLDGGLSCQYEHQFVITKDGPVILTSQGEE 283 (286) T ss_pred CCCEEEECCCCEEEEECCCCEEEEECEEEEEECCCCEEECCCCCC T ss_conf 861564579974899489987773055999979912894088667 No 5 >PRK12897 methionine aminopeptidase; Reviewed Probab=100.00 E-value=0 Score=490.69 Aligned_cols=248 Identities=43% Similarity=0.689 Sum_probs=240.0 Q ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78853999999999999999999999999757998899999999999998598303443324445532344642111212 Q gi|254780899|r 11 SINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIP 90 (273) Q Consensus 11 ~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p 90 (273) .|+|||++||++||+|++|++++++.+.+.++||+||.||++.++.++.++|+.+.+.+|.+||.++|+|.|+++||+.| T Consensus 1 m~~IKs~~EIe~mr~A~~I~~~~~~~~~~~ikpG~te~el~~~~e~~~~~~Ga~~~~~g~~~fp~~v~~s~n~~~~Hg~p 80 (248) T PRK12897 1 MITIKTKNEIDLMHESGKLLASCHREIAKIMKPGITTKEINTFVEAYLEKHGATSEQKGYNGYPYAICASVNDEMCHAFP 80 (248) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCC T ss_conf 92107999999999999999999999997776999899999999999998599666457788880689506860667899 Q ss_pred CCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCE Q ss_conf 22200134057997876653000231112333444432103677877654322110011212210489999999862960 Q gi|254780899|r 91 SNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYS 170 (273) Q Consensus 91 ~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~ 170 (273) ++++|++||+|++|+|+.|+||++|++|||++|+++++++++++++++|+.++++++|||++++||+.++++++.++||+ T Consensus 81 ~~~~l~~GD~V~iD~G~~~~GY~sD~tRT~~vG~~~~~~~~l~~~~~~A~~~~i~~~~~G~~~~di~~a~~~~~~~~G~~ 160 (248) T PRK12897 81 ADVPLTEGDIVTIDMVVNLNGGLSDSAWTYRVGKVSDEAEKLLLVAENALYKGIDQAVIGNRVGDIGYAIESYVANEGFS 160 (248) T ss_pred CCCCCCCCCEEEEECCEEECCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCE T ss_conf 98514589999995748899996276335567877889999999999999999986258978424489999999876971 Q ss_pred ECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEEC Q ss_conf 10475212332211354036531564545664511673788311144377762564688668833787167844218984 Q gi|254780899|r 171 VVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGIT 250 (273) Q Consensus 171 ~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT 250 (273) +.++|+|||||+++||.|.|+|++.+ ++..+|++|||||||||||++......+.|+||++|.||++|+||||||+|| T Consensus 161 ~~~~~~GHGiG~~vHE~P~i~~~~~~--~~~~~L~~GMv~tIEPgI~~g~~~~~~~~d~W~~~~~~g~~g~r~E~~vlVT 238 (248) T PRK12897 161 VARDFTGHGIGKEIHEEPAIFHFGKQ--GQGPELQEGMVITIEPIVNVGMRYSKVDLNGWTARTMDGKLSAQYEHTIAIT 238 (248) T ss_pred EECCCCCCHHCCCCCCCCCCCCCCCC--CCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEECCCCCEEEEEEEEEEEE T ss_conf 60781502001486668862667759--9996746897899825460586514655999879835998799960599992 Q ss_pred CCCCEECCCC Q ss_conf 7985765267 Q gi|254780899|r 251 KAGCEIFTLS 260 (273) Q Consensus 251 ~~G~E~LT~~ 260 (273) +||||+||+. T Consensus 239 ~dG~eiLT~~ 248 (248) T PRK12897 239 KDGPIILTKL 248 (248) T ss_pred CCCCEECCCC T ss_conf 9915782359 No 6 >PRK12896 methionine aminopeptidase; Reviewed Probab=100.00 E-value=0 Score=475.65 Aligned_cols=252 Identities=40% Similarity=0.723 Sum_probs=241.0 Q ss_pred HCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 13978853999999999999999999999999757998899999999999998598303443324445532344642111 Q gi|254780899|r 8 ESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICH 87 (273) Q Consensus 8 ~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H 87 (273) +.++|+||||+||++||+|++|++++++++.+.++||+||.||+..+++.+.+.|+.+++.++.+|+.++|+|.|+.++| T Consensus 3 ~~~~~~IKs~~EI~~mR~Aa~i~~~~~~~~~~~i~pG~te~el~~~~~~~~~~~Ga~~~~~~~~~f~~~~~~s~n~~~~H 82 (254) T PRK12896 3 SPRGMEIKSPRELEKMRKIGRIVATALKEMGKAVEPGMTTAELDRIAEKRLEEHGAIPAPEGYYGFPGSTCISVNEEVAH 82 (254) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCCCCC T ss_conf 85774537899999999999999999999998776999799999999999998699115555557887417368876447 Q ss_pred CCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 21222200134057997876653000231112333444432103677877654322110011212210489999999862 Q gi|254780899|r 88 GIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSE 167 (273) Q Consensus 88 ~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~ 167 (273) +.|++++|++||+|++|+|+.|+||++|++|||++|+++++++++|++++++++++++++|||++++||+.+++++++++ T Consensus 83 ~~p~~r~l~~Gd~v~iD~g~~~~GY~sD~tRT~~vG~~~~~~~~l~~~~~ea~~~~i~~ikPG~~~~di~~a~~~~~~~~ 162 (254) T PRK12896 83 GIPGPRVIKDGDLVNIDVSAYKDGYHGDTGATFAVGPVSEEAEKLCRVAEEALWAGIKQVKAGRPLNDIGRAIEDFAKKN 162 (254) T ss_pred CCCCCCCCCCCCEEEEEEEEEECCEECCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC T ss_conf 89988545699899985339999892556651541752167999999999999999998359980899999999999986 Q ss_pred CCEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEE Q ss_conf 96010475212332211354036531564545664511673788311144377762564688668833787167844218 Q gi|254780899|r 168 RYSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTI 247 (273) Q Consensus 168 g~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~v 247 (273) ||..+++|+|||||+++||.|.++++.. ...++.+|+||||||||||+|++...+..++|+|++.+.|+++++|||||| T Consensus 163 G~~~~~~~~GHGiGl~~HE~P~~~~~~~-~~~~~~~Le~GMV~tiEP~i~~~~~~~~~~~d~w~~~~~~~~~~vr~Ed~v 241 (254) T PRK12896 163 GYSVVRNLTGHGVGRSLHEEPSVINYRT-DPSPNRLLRPGMVLAIEPFLNLGAKDTEELDDGWTVVTPDKSLSAQFEHTV 241 (254) T ss_pred CCCEEECCCEEECCCCCCCCCCCCCCCC-CCCCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEECCCCCEEEEEEEEE T ss_conf 9856742342555021267875477676-898870702796899805040487640665888658714896289970499 Q ss_pred EECCCCCEECCCC Q ss_conf 9847985765267 Q gi|254780899|r 248 GITKAGCEIFTLS 260 (273) Q Consensus 248 lVT~~G~E~LT~~ 260 (273) |||+||||+||.. T Consensus 242 lVTe~G~e~LT~~ 254 (254) T PRK12896 242 VVTRDGPEILTLR 254 (254) T ss_pred EECCCCCEECCCC T ss_conf 9929933764278 No 7 >cd01086 MetAP1 Methionine Aminopeptidase 1. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and Peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=100.00 E-value=0 Score=449.78 Aligned_cols=238 Identities=51% Similarity=0.872 Sum_probs=230.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 99999999999999999999757998899999999999998598303443324445532344642111212222001340 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIPSNKQLREGD 99 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~d~~l~~GD 99 (273) |++||+|++|+++++.++++.++||+||.||++.++..+.++|+++++.++.+||.++|+|.|++++|+.|++++|++|| T Consensus 1 Ie~mR~A~~i~~~a~~~~~~~irpG~tE~ei~~~~~~~~~~~Ga~~~~~~~~~f~~~~~~s~N~~~~H~~p~~~~l~~Gd 80 (238) T cd01086 1 IEGMREAGRIVAEVLDELAKAIKPGVTTKELDQIAHEFIEEHGAYPAPLGYYGFPKSICTSVNEVVCHGIPDDRVLKDGD 80 (238) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCC T ss_conf 98999999999999999998776899599999999999998739715555678886247325745037999884546999 Q ss_pred EEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECCCCCCCC Q ss_conf 57997876653000231112333444432103677877654322110011212210489999999862960104752123 Q gi|254780899|r 100 IVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVEVFCGHG 179 (273) Q Consensus 100 ~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~~GHg 179 (273) +|++|+|+.|+||++|++|||++|+++++++++|++++++++++++++|||++++||+++++++++++||+.+++++||| T Consensus 81 ~v~~D~g~~~~gY~sD~tRT~~vG~~~~~~~~~~~~v~ea~~~~i~~~kpG~~~~ev~~a~~~~~~~~g~~~~~~~~GHg 160 (238) T cd01086 81 IVNIDVGVELDGYHGDSARTFIVGEVSEEAKKLVEVTEEALYKGIEAVKPGNRIGDIGHAIEKYAEKNGYSVVREFGGHG 160 (238) T ss_pred EEEEECCEEECCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCEEECC T ss_conf 89996206688784551576625936899999999999999999987169989999999999999986998354633262 Q ss_pred CCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEECCCCCEECCC Q ss_conf 32211354036531564545664511673788311144377762564688668833787167844218984798576526 Q gi|254780899|r 180 IGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGITKAGCEIFTL 259 (273) Q Consensus 180 iGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~ 259 (273) ||+.+||+|.+++++.+ +++.+|++||||||||++|++..+...++|+|++.+.||.+|+|+||||+||+||||+||. T Consensus 161 vGl~~he~P~i~~~~~~--~~~~~L~~GmV~tiEP~i~~~~~~~~~~~d~w~~~~~~~~~g~riEd~ilITe~G~e~LT~ 238 (238) T cd01086 161 IGRKFHEEPQIPNYGRP--GTGPKLKPGMVFTIEPMINLGTYEVVTLPDGWTVVTKDGSLSAQFEHTVLITEDGPEILTL 238 (238) T ss_pred CCCCCCCCCCCCCCCCC--CCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEECCCCCCEEEEEEEEEECCCCCEECCC T ss_conf 46776669862644548--9984056984899926484388743887898768702897178873199993997766779 No 8 >COG0024 Map Methionine aminopeptidase [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=443.70 Aligned_cols=250 Identities=46% Similarity=0.778 Sum_probs=238.1 Q ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78853999999999999999999999999757998899999999999998598303443324445532344642111212 Q gi|254780899|r 11 SINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIP 90 (273) Q Consensus 11 ~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p 90 (273) .+.|||++||+.||+|++|+.++++++.+.++||+|+.||+.++++++.+.|++|++++|.+||.++|+|+|++++||+| T Consensus 2 ~i~ikt~~eiek~r~Ag~i~a~~l~~~~~~v~pGvtt~Eld~~~~~~i~~~ga~pa~~gy~g~~~~~ciSvNe~v~HgiP 81 (255) T COG0024 2 MISIKTPEEIEKMREAGKIAAKALKEVASLVKPGVTTLELDEIAEEFIREKGAYPAFLGYKGFPFPTCISVNEVVAHGIP 81 (255) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEHHCCCCCCCCCEEEEHHHEEEECCC T ss_conf 75327999999999999999999999998717999799999999999997486361106669885638403150351478 Q ss_pred C-CCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCC-CCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 2-220013405799787665300023111233344443-21036778776543221100112122104899999998629 Q gi|254780899|r 91 S-NKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKR-AAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSER 168 (273) Q Consensus 91 ~-d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~-~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g 168 (273) + +++|++||+|+||+|++++||++|.++||.+|+.++ ..++|.+++++|+.++|+++|||+++++|.++++++++++| T Consensus 82 ~d~~vlk~GDiv~IDvg~~~dG~~~Dsa~T~~vg~~~~~~~~~L~~~t~eal~~~I~~vkpG~~l~~Ig~aIq~~~~~~G 161 (255) T COG0024 82 GDKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDEDAKRLLEATKEALYAGIEAVKPGARLGDIGRAIQEYAESRG 161 (255) T ss_pred CCCCCCCCCCEEEEEEEEEECCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC T ss_conf 98734589998999889997772465789998788773789999999999999998750699888999999999999769 Q ss_pred CEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECC-CCCEEEECCCCEEEEEEEEE Q ss_conf 60104752123322113540365315645456645116737883111443777625646-88668833787167844218 Q gi|254780899|r 169 YSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLS-DGWTAVTRDRSLSAQYEHTI 247 (273) Q Consensus 169 ~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~-d~w~~~~~~g~~~~~~Ed~v 247 (273) |++++.|+|||+|..+||.|+++|++....+ .+|++|||||||||++.+...+..++ |+|+++|.||++++|||||| T Consensus 162 ~~vVr~~~GHgig~~~He~p~ip~y~~~~~~--~~l~~Gmv~aIEPmi~~G~~~~~~~~~d~Wt~~t~d~~~~aq~EHTv 239 (255) T COG0024 162 FSVVRNLTGHGIGRELHEEPSIPNYGKDGTG--VRLKEGMVFAIEPMINTGSGEVVEGPSDRWTLVTKDGSLSAQFEHTV 239 (255) T ss_pred CEEEECCCCCCCCCCCCCCCEECCCCCCCCC--CCCCCCCEEEEEEEEECCCCCEEECCCCCEEEEECCCCEEEEEEEEE T ss_conf 8771466477567545799702564688887--24369979997027875987327448997699947998775788899 Q ss_pred EECCCCCEECCCCCC Q ss_conf 984798576526798 Q gi|254780899|r 248 GITKAGCEIFTLSPN 262 (273) Q Consensus 248 lVT~~G~E~LT~~p~ 262 (273) +||++|+|+||..+. T Consensus 240 ~Vt~~g~eilT~~~~ 254 (255) T COG0024 240 IVTEDGCEILTLRPE 254 (255) T ss_pred EEECCCCEEEECCCC T ss_conf 992798489407889 No 9 >KOG2738 consensus Probab=100.00 E-value=0 Score=434.02 Aligned_cols=252 Identities=45% Similarity=0.796 Sum_probs=243.4 Q ss_pred CCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 97885399999999999999999999999975799889999999999999859830344332444553234464211121 Q gi|254780899|r 10 GSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGI 89 (273) Q Consensus 10 ~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~ 89 (273) ..+.|+++++|+.||+||+++.++++++...++||+|+.||+..++++++++|++|++++|++||.++|+|+|.++|||+ T Consensus 112 ~~i~i~~~e~ie~mR~ac~LarevLd~Aa~~v~PgvTTdEiD~~VH~a~Ierg~YPSPLnYy~FPKS~CTSVNEviCHGI 191 (369) T KOG2738 112 NEIKILDPEGIEGMRKACRLAREVLDYAATLVRPGVTTDEIDRAVHNAIIERGAYPSPLNYYGFPKSVCTSVNEVICHGI 191 (369) T ss_pred CCEECCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHCCHHHEEECCC T ss_conf 62351588999999999999999999875025797638899999999998648858875457886666242221465478 Q ss_pred CCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCC Q ss_conf 22220013405799787665300023111233344443210367787765432211001121221048999999986296 Q gi|254780899|r 90 PSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERY 169 (273) Q Consensus 90 p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~ 169 (273) |..|+|++||+|+||+...++|||+|..+||++|++++..++|.+..+++++.+|+++|||+++.+|...++++..++|| T Consensus 192 PD~RpLedGDIvNiDVtvY~~GyHGDlneTffvG~Vde~~k~LVkvT~EcL~kaI~~~kpGv~freiG~iI~kha~~~g~ 271 (369) T KOG2738 192 PDSRPLEDGDIVNIDVTVYLNGYHGDLNETFFVGNVDEKAKKLVKVTRECLEKAIAIVKPGVSFREIGNIIQKHATKNGY 271 (369) T ss_pred CCCCCCCCCCEEEEEEEEEECCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCC T ss_conf 88672777987868899994252576544268403698898899999999999887708960499999999987655176 Q ss_pred EECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEE Q ss_conf 01047521233221135403653156454566451167378831114437776256468866883378716784421898 Q gi|254780899|r 170 SVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGI 249 (273) Q Consensus 170 ~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlV 249 (273) ++++.|+|||||.-+|-.|.|+||.+. ....+|.+||+|||||||++|...-..|+|+||++|+||++++|||||+|| T Consensus 272 sVVr~ycGHGig~~FH~~PnipHya~n--~a~GvM~~G~tFTIEPmit~G~~~d~tWPD~WT~vTaDG~~sAQFEhTlLV 349 (369) T KOG2738 272 SVVRSYCGHGIGRVFHCAPNIPHYAKN--KAPGVMKPGQTFTIEPMITIGTWEDITWPDDWTAVTADGKRSAQFEHTLLV 349 (369) T ss_pred EEEHHHHCCCCCCCCCCCCCCHHHCCC--CCCCEEECCCEEEEEEEECCCCCCCCCCCCCCEEEECCCCEECCEEEEEEE T ss_conf 001132056655301468886001046--776344268558863024036655554899863781388511113568998 Q ss_pred CCCCCEECCCC-CCC Q ss_conf 47985765267-988 Q gi|254780899|r 250 TKAGCEIFTLS-PNN 263 (273) Q Consensus 250 T~~G~E~LT~~-p~~ 263 (273) |++|+|+||+. |.. T Consensus 350 T~tG~EILT~r~~~~ 364 (369) T KOG2738 350 TETGCEILTKRLPNS 364 (369) T ss_pred ECCCCEEHHCCCCCC T ss_conf 236600011026899 No 10 >PRK09795 aminopeptidase; Provisional Probab=100.00 E-value=0 Score=420.76 Aligned_cols=233 Identities=24% Similarity=0.366 Sum_probs=212.1 Q ss_pred HCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 13978853999999999999999999999999757998899999999999998598303443324445532344642111 Q gi|254780899|r 8 ESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICH 87 (273) Q Consensus 8 ~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H 87 (273) .+..|.||||+||++||+|++|+++++.++.+.++||+||.||++.+...+.+.|+.. .+|++++++|.|++.|| T Consensus 121 ~~~lR~IKs~~EI~~ir~A~~iad~~~~~~~~~i~~G~tE~eiaa~~~~~~~~~Ga~~-----~sf~~iv~sG~~~a~pH 195 (361) T PRK09795 121 PDVLRQIKTPEEVEKIRLACGIADRGAEHIRRFIQAGMSEREIAAELEWFMRQQGAEK-----ASFDTIVASGWRGALPH 195 (361) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCC-----CCCCEEECCCCCCCCCC T ss_conf 8887567999999999999999999999999876599889999999999999829986-----66660303667774677 Q ss_pred CCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCC-----HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH Q ss_conf 212222001340579978766530002311123334444321-----036778776543221100112122104899999 Q gi|254780899|r 88 GIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAA-----ERILQVTYESLYKGIAAVKLNANIEDIGKAIQR 162 (273) Q Consensus 88 ~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~-----~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~ 162 (273) +.|+|++|++||+|++|+|+.|+||++|++|||++|.+...+ .++|+++++|++++++++|||+++++|++++++ T Consensus 196 ~~~s~r~i~~Gd~v~iD~G~~~~GY~sD~tRT~~ig~~~~~~~~~~l~~~y~~v~ea~~~~~~~~kpG~~~~evd~~~r~ 275 (361) T PRK09795 196 GKASDKIVAAGEFVTLDFGALYQGYCSDMTRTLLVNGEGVSAESHPLFNVYQIVLQAQLAAISAIRPGVRCQQVDDAARR 275 (361) T ss_pred CCCCCCCCCCCCEEEEEEEEEECCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 88777351689989998758989785674699998897805766478999999999999999983699821068999999 Q ss_pred HHHHCCCE-ECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEE Q ss_conf 99862960-10475212332211354036531564545664511673788311144377762564688668833787167 Q gi|254780899|r 163 YAHSERYS-VVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSA 241 (273) Q Consensus 163 ~~~~~g~~-~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~ 241 (273) +++++||+ .+.|++|||+|+++||.|.+. .++..+|+|||||||||++|+++ ..|+ T Consensus 276 ~i~~~G~~~~f~h~~GHgvGl~vhE~P~~~------~~~~~~L~~GMv~tvEPgiy~~g-----------------~gGv 332 (361) T PRK09795 276 VITEAGYGDYFGHNTGHAIGIEVHEDPRFS------PRDTTTLQPGMLLTVEPGIYLPG-----------------QGGV 332 (361) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCC------CCCCCEECCCCEEEECCEEEECC-----------------CCEE T ss_conf 999807886678984054888878898325------89998778997999899789769-----------------8489 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCC Q ss_conf 844218984798576526798867688 Q gi|254780899|r 242 QYEHTIGITKAGCEIFTLSPNNLGQPG 268 (273) Q Consensus 242 ~~Ed~vlVT~~G~E~LT~~p~~l~~~~ 268 (273) |+||+|+||+||+|+||++||+|.-.| T Consensus 333 riEd~vlVT~~G~E~Lt~~pk~l~i~~ 359 (361) T PRK09795 333 RIEDVVLVTPQGAEVLYAMPKTVLLTG 359 (361) T ss_pred EEEEEEEECCCCCEECCCCCCEEEEEC T ss_conf 988799992995468888994689944 No 11 >PRK10879 proline aminopeptidase P II; Provisional Probab=100.00 E-value=0 Score=406.48 Aligned_cols=242 Identities=22% Similarity=0.296 Sum_probs=210.2 Q ss_pred HHHCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 33139788539999999999999999999999997579988999999999999985983034433244455323446421 Q gi|254780899|r 6 SRESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVI 85 (273) Q Consensus 6 ~~~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~ 85 (273) +.....|.|||++||++||+|++|+++++.++++.++||++|.||++.++..+.++|+. +.+|+.++++|.|+++ T Consensus 165 ~li~elR~IKs~~EI~~mr~A~~It~~a~~~~m~~~~pG~~E~ei~a~~~~~~~~~G~~-----~~ay~~Iva~G~na~~ 239 (436) T PRK10879 165 PWVHEMRLFKSPEEIAVLRRAGEISALAHTRAMEKCRPGMFEYQLEGEIHHEFNRHGAR-----YPSYNTIVGSGENGCI 239 (436) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC-----CCCCCCEECCCCCCCC T ss_conf 89999866799999999999999999999999853688873899999999999976898-----5774544237787756 Q ss_pred CCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHH Q ss_conf 1121222200134057997876653000231112333-444432103677877654322110011212210489999999 Q gi|254780899|r 86 CHGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPV-GKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYA 164 (273) Q Consensus 86 ~H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~-G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~ 164 (273) +||.++|++|++||+|++|+|+.|+||++|+||||.+ |+++++|+++|++++++|+++++++|||+++.||+.++.+++ T Consensus 240 lHy~~~~~~l~~gdlvliD~G~~~~gY~sDitRT~pv~Gkfs~~q~~iY~~vl~~q~~~i~~~kpGv~~~di~~~~~~~~ 319 (436) T PRK10879 240 LHYTENESEMRDGDLVLIDAGCEYKGYAGDITRTFPVNGKFTPAQREIYDIVLESLETSLRLYRPGTSIQEVTGEVVRIM 319 (436) T ss_pred CCCCCCCCCCCCCCEEEEECCCEECCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 67655544257777799823711687735753898858968899999999999999999998369996999999999999 Q ss_pred HH------------------CCCE-ECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEE Q ss_conf 86------------------2960-1047521233221135403653156454566451167378831114437776256 Q gi|254780899|r 165 HS------------------ERYS-VVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKV 225 (273) Q Consensus 165 ~~------------------~g~~-~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~ 225 (273) .+ ..+. .+.|.+||++||++||.|.+. .+...+|+||||||||||||+....- T Consensus 320 ~~~l~~lg~~~~~~~~~~~~~~~~~~~~Hg~GH~lGldvHD~~~~~------~~~~~~L~~GmV~tiEPGiY~~~~~~-- 391 (436) T PRK10879 320 VSGLVKLGILKGDVDELIAENAHRPFFMHGLSHWLGLDVHDVGVYG------QDRSRILEPGMVLTVEPGLYIAPDAD-- 391 (436) T ss_pred HHHHHHCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC------CCCCCCCCCCCEEEECCEEEECCCCC-- T ss_conf 9999866876787899976234557778887766896777886557------88887479985899899776578778-- Q ss_pred CCCCCEEEECCCCEEEEEEEEEEECCCCCEECCC-CCCCCCC Q ss_conf 4688668833787167844218984798576526-7988676 Q gi|254780899|r 226 LSDGWTAVTRDRSLSAQYEHTIGITKAGCEIFTL-SPNNLGQ 266 (273) Q Consensus 226 ~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~-~p~~l~~ 266 (273) .++ .-.++|+||||+|+||++|+|+||. .||++++ T Consensus 392 ~~~------~~~giGiRIEDdvliT~~G~e~Lt~~~pk~i~e 427 (436) T PRK10879 392 VPE------QYRGIGIRIEDDIVITETGNENLTASVVKKPDE 427 (436) T ss_pred CCC------CCCCEEEEECCEEEECCCCCEECCCCCCCCHHH T ss_conf 874------127348994207999899783236558999999 No 12 >COG0006 PepP Xaa-Pro aminopeptidase [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=403.60 Aligned_cols=237 Identities=27% Similarity=0.394 Sum_probs=219.6 Q ss_pred CCCHHHHCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 77203313978853999999999999999999999999757998899999999999998598303443324445532344 Q gi|254780899|r 2 LSSSSRESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSI 81 (273) Q Consensus 2 ~~~~~~~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~ 81 (273) ++........|.|||+.||++||+|+++++.++..+.+.+++|+||.|+++.+...+.+.|+.. .+|+.++++|. T Consensus 142 ~~~~~~i~~lR~iKs~~EI~~ir~A~~i~~~a~~~~~~~~~~G~tE~ev~a~l~~~~~~~G~~~-----~sf~~iva~G~ 216 (384) T COG0006 142 VDASDLVDRLRLIKSPAEIAKIRKAAEIADAALEAALEAIRPGMTEAEIAAELEYALRKGGAEG-----PSFDTIVASGE 216 (384) T ss_pred ECCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCC-----CCCCCEEEECC T ss_conf 5027789998855999999999999999999999999861079979999999999999808875-----77475787654 Q ss_pred CCCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 64211121222200134057997876653000231112333444432103677877654322110011212210489999 Q gi|254780899|r 82 NHVICHGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQ 161 (273) Q Consensus 82 N~~~~H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~ 161 (273) |+++|||.|+++++++||+|++|+|+.|+||++|+||||++|+|+++|+++|+++++||+++++++|||+++++|+.+++ T Consensus 217 n~a~pH~~~~~~~~~~gd~vliD~G~~~~gY~sDiTRT~~~G~~s~~~~~iy~~V~~aq~aa~~~~~pG~~~~~vd~~ar 296 (384) T COG0006 217 NAALPHYTPSDRKLRDGDLVLIDLGGVYNGYCSDITRTFPIGKPSDEQREIYEAVLEAQEAAIAAIRPGVTGGEVDAAAR 296 (384) T ss_pred CCCCCCCCCCCCCCCCCCEEEEEECEEECCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 33565678888766799989999460689811563568976999999999999999999999987279995999999999 Q ss_pred HHHHHCCCE-ECCCCCCCCCC--CCCCCCCE-EEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCC Q ss_conf 999862960-10475212332--21135403-653156454566451167378831114437776256468866883378 Q gi|254780899|r 162 RYAHSERYS-VVEVFCGHGIG--KSFHEKPE-ILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDR 237 (273) Q Consensus 162 ~~~~~~g~~-~~~~~~GHgiG--l~~hE~P~-i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g 237 (273) +++++.||+ .+.|++|||+| +++||.|. + ..+...+|+|||||++||++|.+ | T Consensus 297 ~~i~~~g~~~~~~h~~GHgvG~~l~vhE~p~~~------~~~~~~~L~~GMv~t~Epg~y~~-----------------g 353 (384) T COG0006 297 QVLEKAGYGLYFLHGTGHGVGFVLDVHEHPQYL------SPGSDTTLEPGMVFSIEPGIYIP-----------------G 353 (384) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC------CCCCCCEECCCEEEEECCCCCCC-----------------C T ss_conf 999971467667787477578877555475435------78987330498299726556568-----------------8 Q ss_pred CEEEEEEEEEEECCCCCEECCCCCCCCCC Q ss_conf 71678442189847985765267988676 Q gi|254780899|r 238 SLSAQYEHTIGITKAGCEIFTLSPNNLGQ 266 (273) Q Consensus 238 ~~~~~~Ed~vlVT~~G~E~LT~~p~~l~~ 266 (273) .+|+|+||+|+||++|+|+||.+|+++.. T Consensus 354 ~~GiRIEd~vlVte~G~e~LT~~~~~~~~ 382 (384) T COG0006 354 GGGVRIEDTVLVTEDGFEVLTRVPKELLV 382 (384) T ss_pred CCEEEEEEEEEECCCCCEECCCCHHHHHC T ss_conf 63798777899858985677776266534 No 13 >cd01087 Prolidase Prolidase. E.C. 3.4.13.9. Also known as Xaa-Pro dipeptidase, X-Pro dipeptidase, proline dipeptidase., imidodipeptidase, peptidase D, gamma-peptidase. Catalyses hydrolysis of Xaa-Pro dipeptides; also acts on aminoacyl-hydroxyproline analogs. No action on Pro-Pro. Probab=100.00 E-value=0 Score=375.04 Aligned_cols=223 Identities=24% Similarity=0.294 Sum_probs=194.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 99999999999999999999757998899999999999998598303443324445532344642111212222001340 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIPSNKQLREGD 99 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~d~~l~~GD 99 (273) |++||+|++|++++++++.+.++||+||.||++.+++.+.+.|+.++ |+.++++|.|++.+|+.|+|++|++|| T Consensus 1 Ie~mR~A~~i~~~~~~~~~~~i~pG~tE~el~~~~~~~~~~~G~~~~------~~~~~~~g~~~~~~H~~~~~r~l~~Gd 74 (243) T cd01087 1 IELMRKACDISAEAHRAAMKASRPGMSEYELEAEFEYEFRSRGARLA------YSYIVAAGSNAAILHYVHNDQPLKDGD 74 (243) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCC------CCCEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 98999999999999999998764898799999999999998699658------898699817865358888984167999 Q ss_pred EEEEEEEEEEEEECCCCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCC--------- Q ss_conf 57997876653000231112333-44443210367787765432211001121221048999999986296--------- Q gi|254780899|r 100 IVNVDVTYVVNGWHGDSSRMYPV-GKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERY--------- 169 (273) Q Consensus 100 ~v~iD~g~~~~GY~~D~~RT~~~-G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~--------- 169 (273) +|++|+|+.|+||++|++|||.+ |+|+++++++|++++++++++++++|||++++||++++++++.+.++ T Consensus 75 ~v~vD~g~~~~GY~sD~~Rt~~v~G~~~~~~~~~~~~~~~a~~~~~~~~kPG~~~~ei~~~~~~~~~~~~~~~g~~~~~~ 154 (243) T cd01087 75 LVLIDAGAEYGGYASDITRTFPVNGKFTDEQRELYEAVLAAQKAAIAACKPGVSYEDIHLLAHRVLAEGLKELGILKGDV 154 (243) T ss_pred EEEEEECEECCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCH T ss_conf 99997240129785443017761896799999999999999999999837998599999999999999988659754658 Q ss_pred ----------EECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCE Q ss_conf ----------0104752123322113540365315645456645116737883111443777625646886688337871 Q gi|254780899|r 170 ----------SVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSL 239 (273) Q Consensus 170 ----------~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~ 239 (273) ..+.|++|||||+++||.|..... .+.+.+|+|||||||||++|++...... .+..+.. T Consensus 155 ~~~~~~~~~~~~~~h~~GHgiGl~~He~~~~~~~----~~~~~~L~~GMv~aiEPgiy~~~~~~~~-------~~~~~~~ 223 (243) T cd01087 155 DEIVESGAYAKFFPHGLGHYLGLDVHDVGGYLRY----LRRARPLEPGMVITIEPGIYFIPDLLDV-------PEYFRGG 223 (243) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCEEEECCEEEECCCCCCC-------CCCCCCC T ss_conf 8887527864442577357667645778764566----8889785799889987979865755765-------5333500 Q ss_pred EEEEEEEEEECCCCCEECCC Q ss_conf 67844218984798576526 Q gi|254780899|r 240 SAQYEHTIGITKAGCEIFTL 259 (273) Q Consensus 240 ~~~~Ed~vlVT~~G~E~LT~ 259 (273) |+|+||||+||+||||+||+ T Consensus 224 gvr~EdtvlVTedG~e~LTk 243 (243) T cd01087 224 GIRIEDDVLVTEDGPENLTR 243 (243) T ss_pred EEEEEEEEEECCCCCEECCC T ss_conf 89973499998995833889 No 14 >cd01090 Creatinase Creatine amidinohydrolase. E.C.3.5.3.3. Hydrolyzes creatine to sarcosine and urea. Probab=100.00 E-value=0 Score=367.64 Aligned_cols=226 Identities=16% Similarity=0.162 Sum_probs=195.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999999999999999999975799889999999999999859830-344332444553234464211121222200134 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIP-ATLNYRGYKKSCCTSINHVICHGIPSNKQLREG 98 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~-~~~~~~~f~~~~~~s~N~~~~H~~p~d~~l~~G 98 (273) |++||+|++|++++|..+.+.++||+||.||+..+.+.+.+.++.. .+..+.+....+++|.|++.||+.|+||+|++| T Consensus 1 I~liR~Aa~iad~~~~~~~~~i~~G~tE~eia~~~~~~~~~~~a~~~~~~~~~~~~~~~~sG~~~~~~H~~~s~r~l~~G 80 (228) T cd01090 1 IALIRHGARIADIGGAAVVEAIREGVPEYEVALAGTQAMVREIAKTFPEVELMDTWTWFQSGINTDGAHNPVTNRKVQRG 80 (228) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 98999999999999999998734999699999999999998500116863113566630221477678899777674899 Q ss_pred EEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCE-ECCCCCC Q ss_conf 057997876653000231112333444432103677877654322110011212210489999999862960-1047521 Q gi|254780899|r 99 DIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYS-VVEVFCG 177 (273) Q Consensus 99 D~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~-~~~~~~G 177 (273) |+|++|+|+.|+||++|++|||++|+|+++++++|+++++||+++++++|||++++||+++++++++++||. ..++.+| T Consensus 81 d~v~iD~g~~~~GY~sd~~RT~~vG~~s~~~~~~~~~~~ea~~~a~~~ikPG~~~~dv~~~~~~~~~~~g~~~~~~~g~G 160 (228) T cd01090 81 DILSLNCFPMIAGYYTALERTLFLDEVSDAHLKIWEANVAVHERGLELIKPGARCKDIAAELNEMYREHDLLRYRTFGYG 160 (228) T ss_pred CEEEEEEEEEECCEECCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 98999889999999847635897268999999999999999999999816998499999999999997498620465577 Q ss_pred CCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEECCCCCEEC Q ss_conf 23322113540365315645456645116737883111443777625646886688337871678442189847985765 Q gi|254780899|r 178 HGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGITKAGCEIF 257 (273) Q Consensus 178 HgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~L 257 (273) ||+|+..|+.|.-... ....+++.+|+|||||||||++|++.+ .+|..|+|+||+|+||+||||+| T Consensus 161 hgiG~~~h~~g~~~~~-~~~~~~~~vL~pGMV~tvEPgiy~~~~-------------~~G~gG~rieD~vvVTe~G~E~L 226 (228) T cd01090 161 HSFGVLSHYYGREAGL-ELREDIDTVLEPGMVVSMEPMIMLPEG-------------QPGAGGYREHDILVINENGAENI 226 (228) T ss_pred CCCCCCCCCCCCCCCC-EECCCCCCEECCCCEEEECCEEEECCC-------------CCCCCEEEEEEEEEECCCCCCCC T ss_conf 7068647788777775-313899978479998999998665776-------------88655899761799999942148 Q ss_pred CC Q ss_conf 26 Q gi|254780899|r 258 TL 259 (273) Q Consensus 258 T~ 259 (273) |- T Consensus 227 Tg 228 (228) T cd01090 227 TG 228 (228) T ss_pred CC T ss_conf 79 No 15 >cd01092 APP-like Similar to Prolidase and Aminopeptidase P. The members of this subfamily presumably catalyse hydrolysis of Xaa-Pro dipeptides and/or release of any N-terminal amino acid, including proline, that is linked with proline. Probab=100.00 E-value=0 Score=359.37 Aligned_cols=207 Identities=28% Similarity=0.455 Sum_probs=193.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 99999999999999999999757998899999999999998598303443324445532344642111212222001340 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIPSNKQLREGD 99 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~d~~l~~GD 99 (273) |++||+|++|++++++++.+.++||+||.||++.+.+.+.+.|+.. .+|+.++++|.|.+.+|+.|++++|++|| T Consensus 1 Ie~mR~A~~i~~~~~~~~~~~i~pG~te~el~~~~~~~~~~~G~~~-----~~~~~~v~sG~~~~~~h~~~~~~~l~~Gd 75 (208) T cd01092 1 IELLRKAARIADKAFEELLEFIKPGMTEREVAAELEYFMRKLGAEG-----PSFDTIVASGPNSALPHGVPSDRKIEEGD 75 (208) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCC-----CCCCCEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 9899999999999999999977489989999999999999869986-----89786899862214567899985356999 Q ss_pred EEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCE-ECCCCCCC Q ss_conf 57997876653000231112333444432103677877654322110011212210489999999862960-10475212 Q gi|254780899|r 100 IVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYS-VVEVFCGH 178 (273) Q Consensus 100 ~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~-~~~~~~GH 178 (273) +|++|+|+.|+||++|++|||++|+|++++++++++++++++++++++|||++++||++++++++++.||. .+.|++|| T Consensus 76 ~v~iD~g~~~~gY~aD~~Rt~~vG~~~~~~~~~~~~~~~~~~~~i~~~kpG~~~~dv~~~~~~~~~~~g~~~~~~~~~GH 155 (208) T cd01092 76 LVLIDFGAIYDGYCSDITRTVAVGEPSDELKEIYEIVLEAQQAAIKAVKPGVTAKEVDKAARDVIEEAGYGEYFIHRTGH 155 (208) T ss_pred EEEEEEEECCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCCC T ss_conf 89998300549986303033348988999999999999999999972379990999999999999975987011037755 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEECCCCC Q ss_conf 3322113540365315645456645116737883111443777625646886688337871678442189847985 Q gi|254780899|r 179 GIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGITKAGC 254 (273) Q Consensus 179 giGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~ 254 (273) |+|+.+||.|.+. ++++.+|+|||||+|||++|++ |..|+|+||||+||+||| T Consensus 156 giGl~~he~P~i~------~~~~~~l~~GMv~~iEP~i~~~-----------------g~~g~~iedtv~VT~~Gc 208 (208) T cd01092 156 GVGLEVHEAPYIS------PGSDDVLEEGMVFTIEPGIYIP-----------------GKGGVRIEDDVLVTEDGC 208 (208) T ss_pred CCCCCCCCCCCCC------CCCCCCCCCCCEEEECCEEEEC-----------------CCCEEEEEEEEEECCCCC T ss_conf 4466677687026------8989723899999988910658-----------------954799731699858999 No 16 >PRK08671 methionine aminopeptidase; Provisional Probab=100.00 E-value=0 Score=344.94 Aligned_cols=226 Identities=30% Similarity=0.506 Sum_probs=204.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCC Q ss_conf 999999999999999999997579988999999999999985983034433244455323446421112122---22001 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIPS---NKQLR 96 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~---d~~l~ 96 (273) |+.+|+|++|+.++++.+.+.++||+|+.||+..+++++.+.|+.|+ || +|+|+|++++||+|. +++|+ T Consensus 2 ~~kyr~Ag~I~~~vl~~~~~~~kpG~t~~el~~~~e~~i~~~ga~pA------FP--~~iSvN~~v~H~~P~~~d~~~lk 73 (293) T PRK08671 2 LEKYREAGKIASKVREEAAKLIKPGASLLEVAEFVENRIRELGAKPA------FP--CNISINEVAAHYTPSPGDETVFP 73 (293) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCC------CC--CEECCCCEEECCCCCCCCCCCCC T ss_conf 58999999999999999997676999099999999999997599477------77--62433468777899988661625 Q ss_pred CCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECCCCC Q ss_conf 34057997876653000231112333444432103677877654322110011212210489999999862960104752 Q gi|254780899|r 97 EGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVEVFC 176 (273) Q Consensus 97 ~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~~ 176 (273) +||+|+||+|++++||++|.++|+.+| ++.++|.+++++|++++|+++|||++.+||..+++++++++||.+++.++ T Consensus 74 ~GDiVkiD~g~~idG~~~d~a~T~~vg---~~~~~Li~aa~~al~~ai~~~kpG~~~~dIg~~I~~~~~~~g~~~v~~~~ 150 (293) T PRK08671 74 EGDVVKLDLGAHVDGYIADTAVTVDLG---GKYSDLVEASREALEAAIEIVKPGVSVGEIGRAIEETIKSYGFKPIRNLT 150 (293) T ss_pred CCCEEEEEEEEEECCEEEEEEEEEEEC---CCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEECCCC T ss_conf 899899978889899998888999979---61689999999999999987069979789999999999986988951444 Q ss_pred CCCCCC-CCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECC---------------------------- Q ss_conf 123322-113540365315645456645116737883111443777625646---------------------------- Q gi|254780899|r 177 GHGIGK-SFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLS---------------------------- 227 (273) Q Consensus 177 GHgiGl-~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~---------------------------- 227 (273) |||+|. .+|+.|.|+|++. .....|++||||+|||+++.+...+.-.+ T Consensus 151 gHgigry~iH~~~~Ipn~~~---~~~~~l~~g~v~aIEpf~ttG~g~v~e~~~~tiy~~~~~~~~k~~~ar~~l~~I~~~ 227 (293) T PRK08671 151 GHGIERYDLHAGPSIPNYDE---GGGAVLKEGDVYAIEPFATDGEGQVVEGPEVEIYSLLRKRPVRLPAARKLLDKIEEE 227 (293) T ss_pred CCCCCCCCCCCCCEECCCCC---CCCCEECCCCEEEEEEEEECCCCEEEECCCEEEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 46766515427987052069---987155078589994476278875876796148874276543467899999999865 Q ss_pred -------CCC---------------------------EEEECCCCEEEEEEEEEEECCCCCEECCC Q ss_conf -------886---------------------------68833787167844218984798576526 Q gi|254780899|r 228 -------DGW---------------------------TAVTRDRSLSAQYEHTIGITKAGCEIFTL 259 (273) Q Consensus 228 -------d~w---------------------------~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~ 259 (273) +.| .++++||++.+||||||+||++|+|+||+ T Consensus 228 f~tlPFs~Rwl~~~~~~~~~~~~l~elv~~g~l~~Ypvl~ek~g~~vAQfehTvlVt~~G~evlTk 293 (293) T PRK08671 228 YKTLPFAERWLAKLLDVDRLELALRRLLRAGALYGYPVLIEVSGGLVSQFEHTVIVTEDGCEVTTK 293 (293) T ss_pred CCCCCCCCHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCEECCCCCEEEEEEEEEEECCCCCEECCC T ss_conf 589876224555113426789999999876984368822824998888988999999995678189 No 17 >PRK13607 proline dipeptidase; Provisional Probab=100.00 E-value=0 Score=347.49 Aligned_cols=246 Identities=17% Similarity=0.152 Sum_probs=190.7 Q ss_pred HCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 13978853999999999999999999999999757998899999999999998598303443324445532344642111 Q gi|254780899|r 8 ESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICH 87 (273) Q Consensus 8 ~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H 87 (273) ....|.|||++||++||+|++|+++++.++++.++||++|.||...+...+. .|+.. .+|+.++++|.|++++| T Consensus 155 l~e~R~iKs~~EI~~mr~A~~is~~ah~~~m~~~kpG~~E~ei~~~~~~~~~-~~~~~-----~~Y~~IVasG~na~~LH 228 (442) T PRK13607 155 LHYHRAYKTDYELACMREAQKIAVAGHRAAKEAFRSGMSEFDINLAYLTATG-QRDND-----VPYGNIVALNEHAAVLH 228 (442) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH-CCCCC-----CCCCCEEEECCCCCCCC T ss_conf 9887668999999999999999999999999977477709999999999997-68867-----89873798378764467 Q ss_pred CCCCC-CCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 21222-20013405799787665300023111233344443210367787765432211001121221048999999986 Q gi|254780899|r 88 GIPSN-KQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHS 166 (273) Q Consensus 88 ~~p~d-~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~ 166 (273) |..+| +.++.++.|++|.||.|+||+||+||||.++. +.++.+++++++++|.++++++|||+++.+++.++.+++.+ T Consensus 229 Y~~~d~~~~~~~~~vLiDaGae~~gYaSDITRTfp~~~-~~~~~~l~~~v~~aq~~~i~~~kpG~~~~d~h~~a~~~i~~ 307 (442) T PRK13607 229 YTKLDHQAPAEMRSFLIDAGAEYNGYASDITRTYAAKE-DNDFAALIKAVNKEQLALIATMKPGVSYVDLHIQTHQRIAK 307 (442) T ss_pred CCCCCCCCCCCCCEEEEECCCCCCCEEECCEEEEECCC-CHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 65456655677867999726231747714425630798-87899999999999999999817999756899999999999 Q ss_pred ----CCC--------------E-ECCCCCCCCCCCCCCCCCEEEEC--CC--------CCCCCCCEECCCCEEEECCEEE Q ss_conf ----296--------------0-10475212332211354036531--56--------4545664511673788311144 Q gi|254780899|r 167 ----ERY--------------S-VVEVFCGHGIGKSFHEKPEILHF--YD--------PLYPSVGTFQEGMVFTIEPMLN 217 (273) Q Consensus 167 ----~g~--------------~-~~~~~~GHgiGl~~hE~P~i~~~--~~--------~~~~~~~~le~GMV~aiEP~i~ 217 (273) .|. . .+.|++||++||++|+-...... +. +.-...++|+|||||||||||| T Consensus 308 ~L~~lgl~~~~~~~~~~~~~~~~f~~HglGH~lGLdVHDvg~~~~~~~g~~~~~~~~~~~lR~~r~L~~GmV~TVEPGiY 387 (442) T PRK13607 308 LLRDFGIVTGSEEAMVEQGITSAFFPHGLGHPLGLQVHDVAGFMQDDRGTHLAAPEKHPYLRCTRVLEPGMVLTIEPGLY 387 (442) T ss_pred HHHHCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCEEC T ss_conf 99976966788899986665657668888756775312255424566677667754465433356489998898899205 Q ss_pred CCCCCCEEC-------CCCCEEEECCCCE-EEEEEEEEEECCCCCEECCCC Q ss_conf 377762564-------6886688337871-678442189847985765267 Q gi|254780899|r 218 VGGSSAKVL-------SDGWTAVTRDRSL-SAQYEHTIGITKAGCEIFTLS 260 (273) Q Consensus 218 ~~~~~~~~~-------~d~w~~~~~~g~~-~~~~Ed~vlVT~~G~E~LT~~ 260 (273) +-..-...+ .-.|.++.+-..+ |+||||+||||++|+|+||+- T Consensus 388 f~~~ll~~~~~~~~~~~~~~~~i~~~~~~GGIRIEDdvlVT~~G~enLT~d 438 (442) T PRK13607 388 FIDSLLAPLREGPFSKHINWQKIDALKPFGGIRIEDNVVVHEDRVENMTRD 438 (442) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHCCCCCEEEECCEEEECCCCCEECCCC T ss_conf 272012200046211111134565417888797362799979978757713 No 18 >cd01088 MetAP2 Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=100.00 E-value=0 Score=340.49 Aligned_cols=226 Identities=28% Similarity=0.396 Sum_probs=203.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCC Q ss_conf 999999999999999999997579988999999999999985983034433244455323446421112122---22001 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIPS---NKQLR 96 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~---d~~l~ 96 (273) |+.+|+|++|+.++++.+.+.++||+|..||+..+++++.+.|+.++ || +|.|+|++++||+|+ +++|+ T Consensus 1 l~kyr~Ag~i~~~vl~~~~~~~kpG~s~~el~~~~e~~i~~~~a~~a------FP--~~iSvN~~v~H~~P~~~d~~~lk 72 (291) T cd01088 1 LEKYREAGEIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELGAGPA------FP--VNLSINECAAHYTPNAGDDTVLK 72 (291) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCC------CC--CEECCCCEEECCCCCCCCCEEEC T ss_conf 96899999999999999996676999599999999999998598067------78--62245467756889988754633 Q ss_pred CCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECCCCC Q ss_conf 34057997876653000231112333444432103677877654322110011212210489999999862960104752 Q gi|254780899|r 97 EGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVEVFC 176 (273) Q Consensus 97 ~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~~ 176 (273) +||+|+||+|++++||++|.++|+.+|+ ...+|.+++++|++++|+.+|||++.+||..+++++++++||+++++++ T Consensus 73 ~GDiVkiD~G~~idG~~~D~A~Tv~v~~---~~~~Li~aa~~al~~ai~~~kpG~~~~dIg~~Ie~~~~~~g~~~v~~l~ 149 (291) T cd01088 73 EGDVVKLDFGAHVDGYIADSAFTVDFDP---KYDDLLEAAKEALNAAIKEAGPDVRLGEIGEAIEEVIESYGFKPIRNLT 149 (291) T ss_pred CCCEEEEEEEEEECCEEEEEEEEEECCC---CHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEECCCC T ss_conf 7998998665787788868889996383---1778999999999999998538989899999999999986977003652 Q ss_pred CCCCCC-CCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEE------------------------------ Q ss_conf 123322-1135403653156454566451167378831114437776256------------------------------ Q gi|254780899|r 177 GHGIGK-SFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKV------------------------------ 225 (273) Q Consensus 177 GHgiGl-~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~------------------------------ 225 (273) |||+|. .+|+.|.|+|+... ...+|++|||||||||++.|...+.- T Consensus 150 GH~i~ry~ih~~~~Ipn~~~~---~~~~l~~g~v~aIEp~~t~G~g~v~~~~~~~iy~~~~~~~lk~~~~r~~l~~I~~~ 226 (291) T cd01088 150 GHSIERYRLHAGKSIPNVKGG---EGTRLEEGDVYAIEPFATTGKGYVHDGPECSIYMLNRDKPLRLPRARKLLDVIYEN 226 (291) T ss_pred CCCCCCCCCCCCCEECEECCC---CCCEECCCCEEEEEEEEECCCCEEECCCCEEEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 557777221699753425289---98366068689985574058752633896158886255543207789999999862 Q ss_pred -----CCCCC--------------------------EEEECCCCEEEEEEEEEEECCCCCEECCC Q ss_conf -----46886--------------------------68833787167844218984798576526 Q gi|254780899|r 226 -----LSDGW--------------------------TAVTRDRSLSAQYEHTIGITKAGCEIFTL 259 (273) Q Consensus 226 -----~~d~w--------------------------~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~ 259 (273) +...| .+++++|++.+||||||+||++|+|+||+ T Consensus 227 f~tlPFs~Rwl~~~~~~~~~~~l~el~~~g~l~~Ypvl~e~~g~~vAQfEhTvlvt~~G~~vlTk 291 (291) T cd01088 227 FGTLPFARRWLDRLGETKLLMALKNLCKAGIVYPYPVLKEISGGYVAQFEHTIIVREDGKEVTTR 291 (291) T ss_pred CCCCEEEHHHHHHCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCEEEEEEEEEEECCCCCEECCC T ss_conf 49713100112211235789999999865974368714813998898888899998994577289 No 19 >cd01089 PA2G4-like Related to aminopepdidase M, this family contains proliferation-associated protein 2G4. Family members have been implicated in cell cycle control. Probab=100.00 E-value=0 Score=337.53 Aligned_cols=216 Identities=18% Similarity=0.297 Sum_probs=193.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCC----CC Q ss_conf 999999999999999999997579988999999999999985983---03443324445532344642111212----22 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAI---PATLNYRGYKKSCCTSINHVICHGIP----SN 92 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~---~~~~~~~~f~~~~~~s~N~~~~H~~p----~d 92 (273) |++||+|++|++++++++.+.++||+|+.||+..+++++.++++. +....+.+|+..+|+|+|+.++||+| ++ T Consensus 1 IeKmR~Ag~i~a~~l~~~~~~i~pG~tt~eld~~~~~~i~~~~~~~~~~~~~~~~G~~~p~CiSvNe~v~HGiP~~~~~~ 80 (228) T cd01089 1 VTKYKTAGQIANKVLKQVISLCVPGAKVVDLCEKGDKLILEELGKVYKKEKKLEKGIAFPTCISVNNCVCHFSPLKSDAT 80 (228) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEECCCCEEECCCCCCCCCC T ss_conf 97899999999999999995786999699999999999998304325676666677786727502544456899988887 Q ss_pred CCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCC-----CHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 200134057997876653000231112333444432-----103677877654322110011212210489999999862 Q gi|254780899|r 93 KQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRA-----AERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSE 167 (273) Q Consensus 93 ~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~-----~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~ 167 (273) ++|++||+|+||+|+.|+||++|.+|||.+|++++. .++|++++++|++++++++|||++.+||..+++++++++ T Consensus 81 ~~L~~GDiV~iD~g~~~dGy~~D~a~T~~VG~~~~~~~~~~~~~L~~~t~~al~~gI~~~~pG~~~~dI~~ai~~~~~~~ 160 (228) T cd01089 81 YTLKDGDVVKIDLGCHIDGYIAVVAHTIVVGAEAETPVTGKKADVIAAAHYALEAALRLLRPGNQNSDITEAIQKVIVDY 160 (228) T ss_pred CCCCCCCEEEEEEEEEECCEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHC T ss_conf 35468998999776886877987569999489671023379999999999999999997079989689999999999984 Q ss_pred CCEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEE Q ss_conf 96010475212332211354036531564545664511673788311144377762564688668833787167844218 Q gi|254780899|r 168 RYSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTI 247 (273) Q Consensus 168 g~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~v 247 (273) ||+.++.++||++|..+|+.|..+. ....++.||++.++|+ .++||++++|||||| T Consensus 161 g~~~v~~~~gH~~~~~~~~~~~~~~-------~~e~~~~~~~~~~~~~-----------------~~~dg~~~A~fEhTv 216 (228) T cd01089 161 GCTPVEGVLSHQLKRVVSSGEGKAK-------LVECVKHGLLFPYPVL-----------------YEKEGEVVAQFKLTV 216 (228) T ss_pred CCEEEECCCCCCCCCCCCCCCCCCC-------CCCCCCCCEEEEEEEE-----------------ECCCCCEEEEEEEEE T ss_conf 9967745205699981777999888-------5413546718888789-----------------857998887833799 Q ss_pred EECCCCCEECCC Q ss_conf 984798576526 Q gi|254780899|r 248 GITKAGCEIFTL 259 (273) Q Consensus 248 lVT~~G~E~LT~ 259 (273) +||++|+|+||- T Consensus 217 lVT~~G~eILTg 228 (228) T cd01089 217 LLTPNGVTVLTG 228 (228) T ss_pred EECCCCCEECCC T ss_conf 997998877389 No 20 >pfam00557 Peptidase_M24 Metallopeptidase family M24. This family contains metallopeptidases. It also contains non-peptidase homologues such as the N terminal domain of Spt16 which is a histone H3-H4 binding module. Probab=100.00 E-value=1.6e-43 Score=310.78 Aligned_cols=206 Identities=28% Similarity=0.388 Sum_probs=183.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 99999999999999999997579988999999999999985983034433244455323446421112122220013405 Q gi|254780899|r 21 ENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIPSNKQLREGDI 100 (273) Q Consensus 21 ~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~d~~l~~GD~ 100 (273) ++||+|+++++++++++.+.++||+||.||++.+.+.+.+.++. .+.+|+.++++|.|...+|+.|++++|++||+ T Consensus 1 ~~~r~A~~i~~~~~~~~~~~i~~G~te~ei~~~~~~~~~~~~g~----~~~~~~~~v~~g~~~~~~h~~~~~~~l~~Gd~ 76 (207) T pfam00557 1 ELMRKAARIAAAALEAALAAIRPGVTERELAAELEEAFLARGGA----RGPAFPPIVASGPNAAVPHYIPSDRVLKDGDL 76 (207) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCC----CCCCCCCEEEECCCCCCCCCCCCCCCCCCCCE T ss_conf 98999999999999999997769998999999999999995799----88894876888733144268999750669998 Q ss_pred EEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECCCCCCCCC Q ss_conf 79978766530002311123334444321036778776543221100112122104899999998629601047521233 Q gi|254780899|r 101 VNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVEVFCGHGI 180 (273) Q Consensus 101 v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~~GHgi 180 (273) |++|+|+.|+||++|++|||++|+|+++++++|++++++++++++++|||++++||++++++.+++.||..+.+++|||+ T Consensus 77 v~~d~g~~~~gY~~d~~Rt~~~G~~~~~~~~~~~~~~~~~~~~~~~~rpG~~~~ev~~~~~~~~~~~g~~~~~~~~GHgi 156 (207) T pfam00557 77 VLIDVGAEYDGYHSDITRTFVVGKPTPEQRELYEAVLEAQEAAIAAVKPGVTGGDVDAAAREVLEEGGYGVFPHGLGHGI 156 (207) T ss_pred EEEECCCEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCEECCCC T ss_conf 99962520798985304333289989999999999999999999863799889999999999999859998766102512 Q ss_pred CCCCCCC-CEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEECC Q ss_conf 2211354-0365315645456645116737883111443777625646886688337871678442189847 Q gi|254780899|r 181 GKSFHEK-PEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGITK 251 (273) Q Consensus 181 Gl~~hE~-P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~ 251 (273) |+.+||. |.+.+ .+++.+|++||||++||++|.. +|..++|+||||+||+ T Consensus 157 Gl~~~~~~P~~~~-----~~~~~~l~~GMv~~iep~~~~~----------------~g~~g~~iedtv~VT~ 207 (207) T pfam00557 157 GLDVHDEGPYIIS-----GGSDRVLEPGMVFTIEPGIYFI----------------PGWGGVRIEDTVLVTE 207 (207) T ss_pred CCCCCCCCCCCCC-----CCCCCEECCCCEEEECCEEEEC----------------CCCEEEEEEEEEEECC T ss_conf 6677789864057-----9999897999899998961453----------------9966999866899769 No 21 >cd01066 APP_MetAP A family including aminopeptidase P, aminopeptidase M, and prolidase. Also known as metallopeptidase family M24. This family of enzymes is able to cleave amido-, imido- and amidino-containing bonds. Members exibit relatively narrow substrate specificity compared to other metallo-aminopeptidases, suggesting they play roles in regulation of biological processes rather than general protein degradation. Probab=100.00 E-value=4.2e-43 Score=308.03 Aligned_cols=206 Identities=23% Similarity=0.369 Sum_probs=189.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 99999999999999999999757998899999999999998598303443324445532344642111212222001340 Q gi|254780899|r 20 LENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIPSNKQLREGD 99 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~d~~l~~GD 99 (273) |++||+|+++++++++++.+.++||+||.|+++.+.+.+.+.|+.. +|+.+++++.|..++|+.|++++|++|| T Consensus 1 i~~~r~a~~i~~~~~~~~~~~~~~G~te~ei~~~~~~~~~~~g~~~------~~~~~~~~g~~~~~~h~~~~~~~l~~gd 74 (207) T cd01066 1 IARLRKAAEIAEAAMAAAAEAIRPGVTEAEVAAAIEQALRAAGGYP------AGPTIVGSGARTALPHYRPDDRRLQEGD 74 (207) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCC------CCCEEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 9899999999999999999977589989999999999999879988------8872888547445545799987868999 Q ss_pred EEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCE-ECCCCCCC Q ss_conf 57997876653000231112333444432103677877654322110011212210489999999862960-10475212 Q gi|254780899|r 100 IVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYS-VVEVFCGH 178 (273) Q Consensus 100 ~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~-~~~~~~GH 178 (273) +|++|+++.|+||++|++|||++|++++++++++++++++++++++.+|||++++||+.++++++++.||. ...+++|| T Consensus 75 ~v~~D~g~~~~gY~~d~~Rt~~~G~~~~~~~~~~~~~~~~~~~~~~~~~pG~~~~~i~~~~~~~~~~~g~~~~~~~~~GH 154 (207) T cd01066 75 LVLVDLGGVYDGYHADLTRTFVIGEPSDEQRELYEAVREAQEAALAALRPGVTAEEVDAAAREVLEEHGLGPNFGHRTGH 154 (207) T ss_pred EEEEEEEEEECCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEC T ss_conf 89999768899995203545437986610021789999999999985079997999999999999982888512352243 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEECCCCC Q ss_conf 3322113540365315645456645116737883111443777625646886688337871678442189847985 Q gi|254780899|r 179 GIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGITKAGC 254 (273) Q Consensus 179 giGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~ 254 (273) ++|+.+||.|.+ ..+++.+|++||||+|||+++.+ +..++|+||+|+||++|+ T Consensus 155 ~vG~~~~e~p~~------~~~~~~~l~~gmv~~iep~~~~~-----------------~~~g~~~ed~~~Vt~~G~ 207 (207) T cd01066 155 GIGLEIHEPPVL------KAGDDTVLEPGMVFAVEPGLYLP-----------------GGGGVRIEDTVLVTEDGP 207 (207) T ss_pred CCCCCCCCCCCC------CCCCCCEECCCCEEEECCEEEEC-----------------CCEEEEEEEEEEECCCCC T ss_conf 278647789855------79999887899999999960768-----------------934899821799849999 No 22 >cd01085 APP X-Prolyl Aminopeptidase 2. E.C. 3.4.11.9. Also known as X-Pro aminopeptidase, proline aminopeptidase, aminopeptidase P, and aminoacylproline aminopeptidase. Catalyses release of any N-terminal amino acid, including proline, that is linked with proline, even from a dipeptide or tripeptide. Probab=100.00 E-value=4.5e-42 Score=301.15 Aligned_cols=206 Identities=17% Similarity=0.169 Sum_probs=171.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC--CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCC Q ss_conf 99999999999999999975799--8899999999999998598303443324445532344642111212---222001 Q gi|254780899|r 22 NIRSACNVVARCLDSLTPIIKPG--TTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIP---SNKQLR 96 (273) Q Consensus 22 ~~R~A~~i~~~~~~~~~~~i~~G--~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p---~d~~l~ 96 (273) ++|....++. .+..+.+.++.| +||.|+++.++++..+..+.. ..+|++++++|.|+++|||.| +|++|+ T Consensus 6 hi~D~~A~~~-~l~wl~~~~~~~~~~TE~~~a~~le~~r~~~~g~~----~~SF~tIva~G~naA~pHy~~~~~~~~~i~ 80 (224) T cd01085 6 HIRDGVALVE-FLAWLEQEVPKGETITELSAADKLEEFRRQQKGYV----GLSFDTISGFGPNGAIVHYSPTEESNRKIS 80 (224) T ss_pred HHHHHHHHHH-HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCC----CCCHHHHHHCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999999-99999871125897379999999999998779987----878377984886520467788856673336 Q ss_pred CCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC-CCCCCHHHHHHHHHHHHCCCEECCCC Q ss_conf 34057997876653000231112333444432103677877654322110011-21221048999999986296010475 Q gi|254780899|r 97 EGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKL-NANIEDIGKAIQRYAHSERYSVVEVF 175 (273) Q Consensus 97 ~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikp-G~~~~ei~~a~~~~~~~~g~~~~~~~ 175 (273) +||+|++|+|+.|+||++|+||||++|+|+++|+++|+++++++.++.++++| |+++.+++.++|+++++.|+. +.|. T Consensus 81 ~gdlvLiD~G~~Y~gy~SDiTRT~~~G~~s~~~~~~Y~~Vl~a~~~~~~~~~p~g~~~~~lD~~aR~~l~~~G~~-y~Hg 159 (224) T cd01085 81 PDGLYLIDSGGQYLDGTTDITRTVHLGEPTAEQKRDYTLVLKGHIALARAKFPKGTTGSQLDALARQPLWKAGLD-YGHG 159 (224) T ss_pred CCCEEEECCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC-CCCC T ss_conf 898367727922516446655342069999999999999999999999998727985899999999999974988-8879 Q ss_pred CCCCCC--CCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEECCCC Q ss_conf 212332--211354036531564545664511673788311144377762564688668833787167844218984798 Q gi|254780899|r 176 CGHGIG--KSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGITKAG 253 (273) Q Consensus 176 ~GHgiG--l~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~~G 253 (273) +||||| |++||.|.... ...++.+|++|||||||||+|++ |.+|+|+||+|+||+.+ T Consensus 160 tGHGiG~~L~vHE~P~~~~----~~~~~~~L~~Gmv~TiEPGiY~~-----------------g~~GiRIEd~v~Vt~~~ 218 (224) T cd01085 160 TGHGVGSFLNVHEGPQSIS----PAPNNVPLKAGMILSNEPGYYKE-----------------GKYGIRIENLVLVVEAE 218 (224) T ss_pred CCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCEEEECCEEEEC-----------------CCEEEECCCEEEEEECC T ss_conf 9998899877779986667----78999662899799658984888-----------------97379770479999788 Q ss_pred C Q ss_conf 5 Q gi|254780899|r 254 C 254 (273) Q Consensus 254 ~ 254 (273) - T Consensus 219 ~ 219 (224) T cd01085 219 T 219 (224) T ss_pred C T ss_conf 7 No 23 >KOG2414 consensus Probab=100.00 E-value=3.1e-42 Score=302.22 Aligned_cols=241 Identities=20% Similarity=0.271 Sum_probs=209.9 Q ss_pred HHHHCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 03313978853999999999999999999999999757998899999999999998598303443324445532344642 Q gi|254780899|r 5 SSRESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHV 84 (273) Q Consensus 5 ~~~~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~ 84 (273) ....++.|.||||.|+++||+||.|+.+++...+-.-|+...|..|.+.++..++.+|+. +..||+.++.|.|+. T Consensus 219 ~~li~~lRlIKSpaEl~~Mr~a~~I~sq~~~~~m~~sr~~~~E~~l~a~~eye~r~rGad-----~~AYpPVVAgG~na~ 293 (488) T KOG2414 219 SNLIERLRLIKSPAELELMREACNIASQTFSETMFGSRDFHNEAALSALLEYECRRRGAD-----RLAYPPVVAGGKNAN 293 (488) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHEEECCCC-----CCCCCCEEECCCCCC T ss_conf 999998770288899999999850425889999861468752566756410001335753-----103687441476652 Q ss_pred CCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHCCCC--CCCCCHHHHHHH Q ss_conf 11121222200134057997876653000231112333-444432103677877654322110011--212210489999 Q gi|254780899|r 85 ICHGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPV-GKIKRAAERILQVTYESLYKGIAAVKL--NANIEDIGKAIQ 161 (273) Q Consensus 85 ~~H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~-G~~~~~~~~l~~~~~~a~~~~i~~ikp--G~~~~ei~~a~~ 161 (273) ..||..+|..|+++|+|++|.||.++||++|++|||.+ |+.++.|++||++++..|+..|+.++| |.+.++|+...- T Consensus 294 tIHY~~Nnq~l~d~emVLvDaGcelgGYvSDITRTWP~sGkFs~~Qr~LYeavL~vq~ecik~c~~~~g~sL~~l~~~s~ 373 (488) T KOG2414 294 TIHYVRNNQLLKDDEMVLVDAGCELGGYVSDITRTWPISGKFSDAQRDLYEAVLQVQEECIKYCKPSNGTSLSQLFERSN 373 (488) T ss_pred EEEEEECCCCCCCCCEEEEECCCCCCCEECCCEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 68886245546788479994574458667144001678783270789999999999999998615889953999999999 Q ss_pred HHH----HHCC------------CEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEE Q ss_conf 999----8629------------601047521233221135403653156454566451167378831114437776256 Q gi|254780899|r 162 RYA----HSER------------YSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKV 225 (273) Q Consensus 162 ~~~----~~~g------------~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~ 225 (273) +.+ .+-| ....+|++||-+|+++|+-|.+ +.+..|+||||||||||+|++... T Consensus 374 ~Ll~~~Lk~lGI~kt~~ee~~~~~klcPHhVgHyLGmDVHD~p~v--------~r~~pL~pg~ViTIEPGvYIP~d~--- 442 (488) T KOG2414 374 ELLGQELKELGIRKTDREEMIQAEKLCPHHVGHYLGMDVHDCPTV--------SRDIPLQPGMVITIEPGVYIPEDD--- 442 (488) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCHHCCCCCCCCCCC--------CCCCCCCCCCEEEECCCEECCCCC--- T ss_conf 999999998386541078887665308742210027643548888--------877658999659856850237657--- Q ss_pred CCCCCEEEECCCCEEEEEEEEEEECCCCCEECCCC-CCCCCC Q ss_conf 46886688337871678442189847985765267-988676 Q gi|254780899|r 226 LSDGWTAVTRDRSLSAQYEHTIGITKAGCEIFTLS-PNNLGQ 266 (273) Q Consensus 226 ~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~~-p~~l~~ 266 (273) .....-.+.|+||||+|+|+|||+|+||.. ||++.+ T Consensus 443 -----d~P~~FrGIGiRIEDDV~i~edg~evLT~a~pKei~~ 479 (488) T KOG2414 443 -----DPPEEFRGIGIRIEDDVAIGEDGPEVLTAACPKEIIE 479 (488) T ss_pred -----CCCHHHCCCEEEEECCEEECCCCCEEEHHCCCCCHHH T ss_conf -----8766763744786422675268751540005678899 No 24 >KOG2737 consensus Probab=100.00 E-value=3.3e-41 Score=295.38 Aligned_cols=255 Identities=18% Similarity=0.223 Sum_probs=206.4 Q ss_pred HHCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 313978853999999999999999999999999757998899999999999998598-3034433244455323446421 Q gi|254780899|r 7 RESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNA-IPATLNYRGYKKSCCTSINHVI 85 (273) Q Consensus 7 ~~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~-~~~~~~~~~f~~~~~~s~N~~~ 85 (273) .+..+|-|||+.||+.||.|++|.++++.++++.++||+.|.++.+.++.....+|+ +. .+|..++|+|.|+++ T Consensus 178 ~m~E~RviKs~~EieviRya~kISseaH~~vM~~~~pg~~Eyq~eslF~hh~y~~GGcRh-----~sYtcIc~sG~ns~v 252 (492) T KOG2737 178 ILAECRVIKSSLEIEVIRYANKISSEAHIEVMRAVRPGMKEYQLESLFLHHSYSYGGCRH-----LSYTCICASGDNSAV 252 (492) T ss_pred HHHHHEEECCHHHHHHHHHHHHHCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCC-----CCCCEEEECCCCCCE T ss_conf 776662017799999999988606098999997479307677678888876541477011-----566405523887544 Q ss_pred CCC----CCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH Q ss_conf 112----1222200134057997876653000231112333-44443210367787765432211001121221048999 Q gi|254780899|r 86 CHG----IPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPV-GKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAI 160 (273) Q Consensus 86 ~H~----~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~-G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~ 160 (273) .|| .|+|+.||+||++++|.|+.|++|.+|+|++|.. |+.+++|+.+|++++++++++++++|||+.+-|++... T Consensus 253 LHYgha~apNd~~iqdgd~cLfDmGaey~~yaSDITcsFP~nGKFTadqk~VYnaVLda~navm~a~KpGv~W~Dmh~La 332 (492) T KOG2737 253 LHYGHAGAPNDRTIQDGDLCLFDMGAEYHFYASDITCSFPVNGKFTADQKLVYNAVLDASNAVMEAMKPGVWWVDMHKLA 332 (492) T ss_pred EECCCCCCCCCCCCCCCCEEEEECCCCEEEEECCCCEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 43145689987636789789984684103330354152468986435689999999998899998528998654278887 Q ss_pred HHHHHH----CC-------------CE--ECCCCCCCCCCCCCCCCCE-EEECCCCCC------CCCCEECCCCEEEECC Q ss_conf 999986----29-------------60--1047521233221135403-653156454------5664511673788311 Q gi|254780899|r 161 QRYAHS----ER-------------YS--VVEVFCGHGIGKSFHEKPE-ILHFYDPLY------PSVGTFQEGMVFTIEP 214 (273) Q Consensus 161 ~~~~~~----~g-------------~~--~~~~~~GHgiGl~~hE~P~-i~~~~~~~~------~~~~~le~GMV~aiEP 214 (273) .+++-+ .| .+ -.+|..||=+|+++|+-.- ..+...|.. ...+.|++|||+|||| T Consensus 333 ~kvlle~laq~gIl~gdvd~m~~ar~~~vF~PHGLGH~lGlDvHDvGGyp~~~~rp~~P~l~~LR~aR~L~e~MviTvEP 412 (492) T KOG2737 333 EKVLLEHLAQMGILKGDVDEMVEARLGAVFMPHGLGHFLGLDVHDVGGYPEGVERPDEPGLRSLRTARHLKEGMVITVEP 412 (492) T ss_pred HHHHHHHHHHCCCEECCHHHHHHHCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEECC T ss_conf 87999999864834234899998514706435663311254310257998889998860466654444530583798348 Q ss_pred EEECCCCCCE-ECCCCCE-------EEE-CCCCEEEEEEEEEEECCCCCEECCCCCCCCCC Q ss_conf 1443777625-6468866-------883-37871678442189847985765267988676 Q gi|254780899|r 215 MLNVGGSSAK-VLSDGWT-------AVT-RDRSLSAQYEHTIGITKAGCEIFTLSPNNLGQ 266 (273) Q Consensus 215 ~i~~~~~~~~-~~~d~w~-------~~~-~~g~~~~~~Ed~vlVT~~G~E~LT~~p~~l~~ 266 (273) |+|+-.+-.. .+.|.-+ ++. --+.-||||||+|+||++|+|+||..|++.++ T Consensus 413 GcYFi~~Ll~ealadp~~~~f~n~e~~~rfr~~GGVRIEdDv~vt~~G~enlt~vprtvee 473 (492) T KOG2737 413 GCYFIDFLLDEALADPARAEFLNREVLQRFRGFGGVRIEDDVVVTKSGIENLTCVPRTVEE 473 (492) T ss_pred CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCEEEECCEEEECCCCCCCCCCCCCHHH T ss_conf 7309999988885481765563299999864468667532479942651000379999999 No 25 >TIGR00501 met_pdase_II methionine aminopeptidase, type II; InterPro: IPR002468 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002467 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2 and includes proteins which do not seem to be MAP, but that are clearly evolutionary related such as mouse proliferation-associated protein 1 and fission yeast curved DNA-binding protein. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=100.00 E-value=4.7e-38 Score=274.30 Aligned_cols=236 Identities=24% Similarity=0.383 Sum_probs=201.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHC-CCC-------CCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 99999999999999999999997579988999999999999985-983-------0344332444553234464211121 Q gi|254780899|r 18 EELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMEN-NAI-------PATLNYRGYKKSCCTSINHVICHGI 89 (273) Q Consensus 18 ~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~-g~~-------~~~~~~~~f~~~~~~s~N~~~~H~~ 89 (273) +.+.-+|+.++|..++-+++.+.|+||+.-.||+.+++..+++. |+. |...| .+||. ..|+|++.+||. T Consensus 6 E~~nD~~aGakI~~~vr~ea~~~I~PG~klleva~fvEnri~e~tGa~nllameDPksqG-iAFP~--n~S~N~~AAHfT 82 (327) T TIGR00501 6 EKLNDIEAGAKIVSKVRREAADRIVPGVKLLEVAEFVENRIRELTGAENLLAMEDPKSQG-IAFPC--NISINEVAAHFT 82 (327) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCC-CCCCC--CCCHHCCCCCCC T ss_conf 446777744799999999998526788612122121210121220641001103863255-22556--751213010168 Q ss_pred C---CCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 2---2220013405799787665300023111233344443210367787765432211001121221048999999986 Q gi|254780899|r 90 P---SNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHS 166 (273) Q Consensus 90 p---~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~ 166 (273) | .++.|++||+|++|+|++++||.+|.+-|+-+|+.+++..+|.+|+++|+.++|+.+++|+..+||..++++++++ T Consensus 83 p~~gD~~~~k~~DvvKlD~GahvdGyIaD~A~TV~l~~~ddky~~L~~A~~dAl~~~i~~~~~~v~v~eiG~~I~e~i~s 162 (327) T TIGR00501 83 PKAGDETVFKDGDVVKLDLGAHVDGYIADTAITVDLGDQDDKYKELVKAAKDALYTAIKEIRAGVEVGEIGKAIEEVIES 162 (327) T ss_pred CCCCCCCCCCCCCEEEEEECCEECCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHCC T ss_conf 68888631277876887522066557870478850688701589999999999887775006981260020776653221 Q ss_pred ---------CCCEECCCCCCCCCCCC-CCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECC--------- Q ss_conf ---------29601047521233221-13540365315645456645116737883111443777625646--------- Q gi|254780899|r 167 ---------ERYSVVEVFCGHGIGKS-FHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLS--------- 227 (273) Q Consensus 167 ---------~g~~~~~~~~GHgiGl~-~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~--------- 227 (273) +||+++.++|||+|-.. +|-++.|++... ++...|++||+||||||-+.|.+.+.--. T Consensus 163 yeveinGetYg~kP~~NL~GHs~~rY~~H~G~SiPn~~~---~~~~~leeG~~~AIEpFAt~G~G~V~~~~~~~iY~~l~ 239 (327) T TIGR00501 163 YEVEINGETYGVKPISNLTGHSMARYLLHAGLSIPNVKE---RDTTKLEEGDVVAIEPFATDGVGLVTDGGEVEIYKFLA 239 (327) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCHHHHCCCCCCEEEEC---CCCCEEECCCEEEECCCCCCCEEEEECCCCCHHHHHHC T ss_conf 357765720143342227356400154358950003427---98745102578884366768745651288610122105 Q ss_pred -------------------------------------------------------CCCEEEE-CCCCEEEEEEEEEEECC Q ss_conf -------------------------------------------------------8866883-37871678442189847 Q gi|254780899|r 228 -------------------------------------------------------DGWTAVT-RDRSLSAQYEHTIGITK 251 (273) Q Consensus 228 -------------------------------------------------------d~w~~~~-~~g~~~~~~Ed~vlVT~ 251 (273) .++.++. ..|++.+||||||+|++ T Consensus 240 ~DrpvrPtL~~Ar~lL~~i~~nY~~LPFA~RwL~~~~~~Kr~~~~l~~L~r~~~~~~YPvL~e~~G~~vaQ~EHTilv~~ 319 (327) T TIGR00501 240 EDRPVRPTLKSARKLLKKIEENYKTLPFAKRWLEKLEDEKRLRLALNKLIREGVIYDYPVLKEKSGGLVAQWEHTILVEE 319 (327) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEEEEC T ss_conf 68896733589999876786640788247889752333026899999988733334788752478874523245889964 Q ss_pred CCCEECCC Q ss_conf 98576526 Q gi|254780899|r 252 AGCEIFTL 259 (273) Q Consensus 252 ~G~E~LT~ 259 (273) +|+|+.|+ T Consensus 320 ~g~~v~tk 327 (327) T TIGR00501 320 DGVEVTTK 327 (327) T ss_pred CCEEEEEC T ss_conf 83468509 No 26 >cd01091 CDC68-like Related to aminopeptidase P and aminopeptidase M, a member of this domain family is present in cell division control protein 68, a transcription factor. Probab=100.00 E-value=2.7e-37 Score=269.25 Aligned_cols=225 Identities=17% Similarity=0.207 Sum_probs=181.9 Q ss_pred HHHHHHHHHHHHHHHHH-----HHHHCCCC--CCHHHHHHHHHHHHHHCC-----CCCCCCCCCCCCCCCCCCCC-CCCC Q ss_conf 99999999999999999-----99975799--889999999999999859-----83034433244455323446-4211 Q gi|254780899|r 20 LENIRSACNVVARCLDS-----LTPIIKPG--TTTEEIDDFVLKFGMENN-----AIPATLNYRGYKKSCCTSIN-HVIC 86 (273) Q Consensus 20 I~~~R~A~~i~~~~~~~-----~~~~i~~G--~se~el~~~~~~~~~~~g-----~~~~~~~~~~f~~~~~~s~N-~~~~ 86 (273) |..||+|++++..+|.. +...+-.+ +|-..|...++..+.... ..+... -..|++++.+|-| +..+ T Consensus 1 l~~ik~as~~s~~~~~~~~~~~~~~iiD~e~k~~h~~l~~~~e~~i~d~kk~~~~~d~~~v-d~~Y~PIVqSGg~y~~~~ 79 (243) T cd01091 1 LNNIKKASDATVDVLKKFFVDEVEEIIDQEKKVTHSKLSDKVEKAIEDKKKYKAKLDPEQL-DWCYPPIIQSGGNYDLLK 79 (243) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHCCHHCCCCCCHHHC-CEECCCCEEECCCCCCCC T ss_conf 9378999999999999999999998840121213999999999986270220378997887-532456414887623575 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 12122220013405799787665300023111233344443210367787765432211001121221048999999986 Q gi|254780899|r 87 HGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHS 166 (273) Q Consensus 87 H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~ 166 (273) |..++|+.+..+++|++|+|+.|+|||||++|||++ +|+++|+++|+.++++|+++++++|||+++++|+.++++++++ T Consensus 80 ~~~~~d~~~~~~g~Il~d~G~rYk~YcSditRT~~v-~pt~~q~~~Y~~vl~aq~~~i~~~kpG~~~~~v~~~a~~~I~~ 158 (243) T cd01091 80 SSSSSDKLLYHFGVIICSLGARYKSYCSNIARTFLI-DPTSEQQKNYNFLLALQEEILKELKPGAKLSDVYQKTLDYIKK 158 (243) T ss_pred CCCCCCCCCCCCCEEEEECCCCCCCEECCCEEEEEE-CCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 778875423779989998691357780265658985-7999999999999999999999972699689999999999998 Q ss_pred CC--CE-ECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEE-CCCCCCEECCCCCEEEECCCCEEEE Q ss_conf 29--60-10475212332211354036531564545664511673788311144-3777625646886688337871678 Q gi|254780899|r 167 ER--YS-VVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLN-VGGSSAKVLSDGWTAVTRDRSLSAQ 242 (273) Q Consensus 167 ~g--~~-~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~-~~~~~~~~~~d~w~~~~~~g~~~~~ 242 (273) .+ |. .+.|.+|||+|+++||.|.+.+ ..++.+|++||||+|+||++ +..... .....+.++++ T Consensus 159 ~~~~l~~~f~h~~GhgiGlE~hE~~~~l~-----~kn~~~lk~GMvftV~~Gf~nl~~~~~--------~~~~~k~yal~ 225 (243) T cd01091 159 KKPELEPNFTKNLGFGIGLEFRESSLIIN-----AKNDRKLKKGMVFNLSIGFSNLQNPEP--------KDKESKTYALL 225 (243) T ss_pred HHHHHHHHCCCCCCCEECEEECCCCCCCC-----CCCCEECCCCCEEEEECCEEECCCCCC--------CCCCCCEEEEE T ss_conf 77888725766762322557524665448-----997404179978999655055568766--------67667736999 Q ss_pred EEEEEEECCCCC-EECCC Q ss_conf 442189847985-76526 Q gi|254780899|r 243 YEHTIGITKAGC-EIFTL 259 (273) Q Consensus 243 ~Ed~vlVT~~G~-E~LT~ 259 (273) ++|||+||++|+ .+||. T Consensus 226 I~DtV~Vte~gp~~vlT~ 243 (243) T cd01091 226 LSDTILVTEDEPAIVLTN 243 (243) T ss_pred EEEEEEECCCCCCEECCC T ss_conf 964899858998064379 No 27 >PTZ00053 methionine aminopeptidase II; Provisional Probab=100.00 E-value=2.2e-34 Score=249.84 Aligned_cols=239 Identities=23% Similarity=0.293 Sum_probs=192.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CC Q ss_conf 999999999999999999999999757998899999999999998598303443324445532344642111212---22 Q gi|254780899|r 16 TPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIP---SN 92 (273) Q Consensus 16 s~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p---~d 92 (273) ..+-+.-+|+|+++..++-+.+...|||||+..||+..++...+..-.......-.+|| +|.|+|++++||.| .+ T Consensus 118 ~~~~~~~~r~aae~hr~vr~~~q~~ikpg~~~~~i~~~ie~~~r~l~~~~~l~~GiAFP--t~iSiN~~AAHYTPn~gD~ 195 (435) T PTZ00053 118 SEQQYQDLRKAAEVHRQVRRYAQSFIKPGISLIDMTDRIEKKTKELIEKDGLKRGWAFP--TGCSLNHCAAHYTPNTGDK 195 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCEEEECCCCCCCC T ss_conf 48999999999999999999999865799829999999999999985245612450046--6454101013158898986 Q ss_pred CCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC----- Q ss_conf 200134057997876653000231112333444432103677877654322110011212210489999999862----- Q gi|254780899|r 93 KQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSE----- 167 (273) Q Consensus 93 ~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~----- 167 (273) ++|+.||+|+||+|++++||.+|.+.|+.++ +....|.+++++|.+++++.++||++.+||.++++++++++ T Consensus 196 ~vL~~gDVVKID~GvHVDGyIaDsAfTV~f~---~kyd~Ll~A~keAt~taIk~agidvrl~eIG~aIqEvieSyevei~ 272 (435) T PTZ00053 196 TVLTYDDVMKVDFGTHVNGRIIDCAFTVAFN---EKYDPLLEATKEATNEGIKQAGIDARLCDIGEAIQEVIESYEVELN 272 (435) T ss_pred EEECCCCEEEEEEEEEECCEEEEEEEEEECC---CCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHEEEEC T ss_conf 1742799899974111363799789998758---7537899999999999999851686188899999999975234433 Q ss_pred ----CCEECCCCCCCCCCCC-CCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECC---------C----- Q ss_conf ----9601047521233221-13540365315645456645116737883111443777625646---------8----- Q gi|254780899|r 168 ----RYSVVEVFCGHGIGKS-FHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLS---------D----- 228 (273) Q Consensus 168 ----g~~~~~~~~GHgiGl~-~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~---------d----- 228 (273) ++..+++.+||+|+.. +|..+.|++.... +....||+|+||||||+.+.|.+.+.-.. + T Consensus 273 gk~y~vKPI~NL~GH~I~~Y~IHggksIP~v~~~--~~~k~mEEGEVyAIEtF~STG~G~V~e~~~~s~Y~~~~~~~~~~ 350 (435) T PTZ00053 273 GKVYPVKSIRNLTGHNIGPYIIHSGKSVPIVRGG--EATKIMEEGELFAIETFASTGRGFVNEDMECSHYMMNPGAEYVP 350 (435) T ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCEECCCCCC--CCCCCCCCCCEEEEEEEECCCCCEEECCCCEEEEEECCCCCCCC T ss_conf 6331674046777665302351168553211688--65622368966999666227864574388515999777877565 Q ss_pred ------------------------CC-------------------------E-EEECCCCEEEEEEEEEEECCCCCEECC Q ss_conf ------------------------86-------------------------6-883378716784421898479857652 Q gi|254780899|r 229 ------------------------GW-------------------------T-AVTRDRSLSAQYEHTIGITKAGCEIFT 258 (273) Q Consensus 229 ------------------------~w-------------------------~-~~~~~g~~~~~~Ed~vlVT~~G~E~LT 258 (273) .| . ++...|++.+|+||||++++++.|+|| T Consensus 351 lrl~~aR~lL~~I~~nF~TLPF~~RwLd~~~~~r~~l~L~~Lv~~GiV~~YPpL~d~~G~yVAQfEHTIlL~p~~kEVis 430 (435) T PTZ00053 351 LRLESAQELLKHINKTFSTLAFCRRWLDDDGFDRHLMNLNQLVDAGAVNPYPPLCDVKGSYTSQMEHTILLRPTCKEVLS 430 (435) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCEEEC T ss_conf 66789999999999877988764777754231268999999987798523896072699988775559998799607607 Q ss_pred CCC Q ss_conf 679 Q gi|254780899|r 259 LSP 261 (273) Q Consensus 259 ~~p 261 (273) +-. T Consensus 431 rgd 433 (435) T PTZ00053 431 RGD 433 (435) T ss_pred CCC T ss_conf 889 No 28 >TIGR00495 crvDNA_42K DNA-binding protein, 42 kDa; InterPro: IPR004545 Proteins of this family have been identified in a number of species as a nuclear (but not nucleolar) protein with a cell cycle dependence. Various names given to members of this family have included cell cycle protein p38-2G4, DNA-binding protein GBP16, and proliferation-associated protein 1. The proteins are closely related to methionine aminopeptidase, a cobalt-binding protein. This group sequences are classified as non-peptidase homologues in MEROPS peptidase family M24 (clan MG). . Probab=99.96 E-value=5.4e-29 Score=213.86 Aligned_cols=265 Identities=18% Similarity=0.264 Sum_probs=219.2 Q ss_pred CCCCHHHHCCCE--EECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC---CCCC-CCCCCC Q ss_conf 977203313978--8539999999999999999999999997579988999999999999985983---0344-332444 Q gi|254780899|r 1 MLSSSSRESGSI--NIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAI---PATL-NYRGYK 74 (273) Q Consensus 1 ~~~~~~~~~~~~--~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~---~~~~-~~~~f~ 74 (273) |...+++.+... ++-+|.-+..++.|+.|+..+++.+.+.+.||+.-.+|+....+++.+.-+. .+-. -++|-- T Consensus 1 Ms~~ee~~e~~~dy~Ls~P~vV~kYk~AG~i~n~vlk~vv~~c~~ga~v~diC~~GD~~i~e~~~k~y~~s~K~~~kGiA 80 (407) T TIGR00495 1 MSTKEEASEKAVDYSLSNPEVVTKYKIAGEIANNVLKKVVEKCKPGAKVVDICEKGDEFIEEETAKVYKKSKKELEKGIA 80 (407) T ss_pred CCCCCCCCHHHHHCCCCCCCEECHHHHHHHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 98520022133210157883121022346899999999998607984899602200589999987541023000115732 Q ss_pred CCCCCCCCCCCCCCCC----C---CCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCC-----CCCCCHHHHHHHHHHHHH Q ss_conf 5532344642111212----2---220013405799787665300023111233344-----443210367787765432 Q gi|254780899|r 75 KSCCTSINHVICHGIP----S---NKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGK-----IKRAAERILQVTYESLYK 142 (273) Q Consensus 75 ~~~~~s~N~~~~H~~p----~---d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~-----~~~~~~~l~~~~~~a~~~ 142 (273) +++|+|+|++++|..| . |+.|+.||+|+||+|||+|||.+-.+.|+++|. .+.+..+...|+..|+++ T Consensus 81 FPT~vSvNn~vgHfSPLkSd~eand~~Lk~GD~VKIdLG~HiDGFiA~vAhT~vig~~~~~~~tG~~ADvI~AA~~Al~a 160 (407) T TIGR00495 81 FPTCVSVNNCVGHFSPLKSDEEANDLVLKEGDVVKIDLGCHIDGFIALVAHTIVIGVASEEPVTGRKADVIAAAHLALEA 160 (407) T ss_pred CCCCCCCCCCHHHCCCCCCCCCHHHHHHHCCCEEEEECCCCEEEEEECCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 47641703603326887888313344533387479813664403564150789985145876557067899999999998 Q ss_pred HHHCCCCCCCCCHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCCCC-EEEECCCCC---CCCCCEECCCCEEEECCEEEC Q ss_conf 2110011212210489999999862960104752123322113540-365315645---456645116737883111443 Q gi|254780899|r 143 GIAAVKLNANIEDIGKAIQRYAHSERYSVVEVFCGHGIGKSFHEKP-EILHFYDPL---YPSVGTFQEGMVFTIEPMLNV 218 (273) Q Consensus 143 ~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~~GHgiGl~~hE~P-~i~~~~~~~---~~~~~~le~GMV~aiEP~i~~ 218 (273) |+..+|||++..+|.++++++...+|+..+..-+.|.+-..+.++- .|...+++. .-+...||+|.||+|+..+.. T Consensus 161 AlRlvkPG~~n~~VT~~~~KvA~~Y~c~~v~G~LSHqlkr~vIDG~K~v~~~~~d~Q~~~~~~~eFee~EVYavDil~St 240 (407) T TIGR00495 161 ALRLVKPGKTNTQVTEAIEKVADSYGCKPVEGMLSHQLKRHVIDGEKVVISNPSDSQKKEVDTVEFEENEVYAVDILVST 240 (407) T ss_pred HHHHCCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCEEEEECCCCEEEECCCCCCCCCCCEEEEEECCEEEEEEEEEC T ss_conf 87404887987147889999973249846532231022224866722797368820001122266420118899999973 Q ss_pred C-CCCCEECCCC--------------------------------------------------------------CEEEE- Q ss_conf 7-7762564688--------------------------------------------------------------66883- Q gi|254780899|r 219 G-GSSAKVLSDG--------------------------------------------------------------WTAVT- 234 (273) Q Consensus 219 ~-~~~~~~~~d~--------------------------------------------------------------w~~~~- 234 (273) + ++.....++. +.|++ T Consensus 241 gP~GK~k~~~~~qttIYkk~~d~~Y~LK~kASR~~~sei~~~F~~~PF~~R~~~~ek~a~lGlvEc~~H~~L~pY~VLyd 320 (407) T TIGR00495 241 GPEGKAKEEDERQTTIYKKDEDSTYKLKMKASRKFYSEIKKRFSAMPFTLRNFEFEKRARLGLVECVEHKLLQPYPVLYD 320 (407) T ss_pred CCCCCCCCCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCC T ss_conf 88975442014688543128886635355520356556752277868761456600000266345503771266243122 Q ss_pred CCCCEEEEEEEEEEECCCCCEECCCCCCCCC Q ss_conf 3787167844218984798576526798867 Q gi|254780899|r 235 RDRSLSAQYEHTIGITKAGCEIFTLSPNNLG 265 (273) Q Consensus 235 ~~g~~~~~~Ed~vlVT~~G~E~LT~~p~~l~ 265 (273) ++|.+.+||--|||++++|...||..+-.++ T Consensus 321 k~G~~Vaqfk~Tvll~pnG~~~l~~~~~~~~ 351 (407) T TIGR00495 321 KDGEFVAQFKFTVLLMPNGSIRLTSSEFKED 351 (407) T ss_pred CCCCEEEEEEEEEEECCCCCEEECCCCCCCC T ss_conf 7998899888788853798654348888855 No 29 >TIGR02993 ectoine_eutD ectoine utilization protein EutD; InterPro: IPR014335 Members of this entry are putative peptidases, or non-peptidase homologues that belong to the MEROPS peptidase family M24 (clan MG) or hydrolases similar to Xaa-Pro aminopeptidase. They belong to ectoine utilisation operons, as found in Mesorhizobium loti, Silicibacter pomeroyi, Agrobacterium tumefaciens, Pseudomonas putida and Sinorhizobium meliloti 1021, where it is known to be induced by ectoine. The exact function is unknown.. Probab=99.96 E-value=1.1e-28 Score=211.75 Aligned_cols=239 Identities=19% Similarity=0.198 Sum_probs=205.1 Q ss_pred CCCHHHHCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC-CCCCCCC--- Q ss_conf 772033139788539999999999999999999999997579988999999999999985983034433-2444553--- Q gi|254780899|r 2 LSSSSRESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNY-RGYKKSC--- 77 (273) Q Consensus 2 ~~~~~~~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~-~~f~~~~--- 77 (273) .++....+=.|.|||+.||++||+|++|++.+++.+.+.+.+||...+|.+.+......... ++ ..||.++ T Consensus 146 ~~a~~lvnWqravks~~e~~ymr~a~~ive~mh~ri~~~~e~Gmrk~~lva~iy~~G~~~~~-----~~GGdyPaivPll 220 (391) T TIGR02993 146 KDATALVNWQRAVKSETELKYMRKAAKIVEKMHQRILERVEPGMRKCDLVADIYDAGIRGVD-----GFGGDYPAIVPLL 220 (391) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCC-----CCCCCCCCCEECC T ss_conf 34567676676513378899999889999999988887630252046789998751011012-----3578664100025 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHH Q ss_conf 23446421112122220013405799787665300023111233344443210367787765432211001121221048 Q gi|254780899|r 78 CTSINHVICHGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIG 157 (273) Q Consensus 78 ~~s~N~~~~H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~ 157 (273) .+|.++..||..+.|++++.|+--++++.+.|+.||+.++||+++|+|+....+.-+++.+.+++++++.|||+++.||. T Consensus 221 P~G~~a~aPhltWdd~P~~~G~GtffeiaG~~~ryh~PlsrtvflGkP~q~fld~eka~~eG~~aGl~~a~~Gnt~~dia 300 (391) T TIGR02993 221 PSGADASAPHLTWDDKPLKVGEGTFFEIAGVYKRYHCPLSRTVFLGKPTQAFLDAEKAVLEGMEAGLEAAKPGNTCEDIA 300 (391) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEEHHHHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 67743146533226762204785201010112221465200466158603444123677887899898512787478999 Q ss_pred HHHHHHHHHCCCEECCCCCCCCCCCCCCCCCEEEEC-CCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECC Q ss_conf 999999986296010475212332211354036531-5645456645116737883111443777625646886688337 Q gi|254780899|r 158 KAIQRYAHSERYSVVEVFCGHGIGKSFHEKPEILHF-YDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRD 236 (273) Q Consensus 158 ~a~~~~~~~~g~~~~~~~~GHgiGl~~hE~P~i~~~-~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~ 236 (273) .+.-.++++++.. -..++|+.||++. +|.|... -+..+++...|++||+|.+-+++++...|.. T Consensus 301 ~~ff~vl~~y~i~-kd~r~GyPiG~sy--PPdWGertmslrP~d~~~l~~~mtfhfm~GlW~~d~G~e------------ 365 (391) T TIGR02993 301 KAFFAVLKKYGIE-KDSRTGYPIGLSY--PPDWGERTMSLRPGDKTVLKPGMTFHFMTGLWLEDMGLE------------ 365 (391) T ss_pred HHHHHHHHHHCCC-CCCCCCCCEECCC--CCCCCCCEEEECCCCCCHHCCCCEEEEECCCCHHHCCCC------------ T ss_conf 9999999860640-0365665111136--887465203325775210024650442134201003642------------ Q ss_pred CCEEEEEEEEEEECCCCCEECCCCCCCCCC Q ss_conf 871678442189847985765267988676 Q gi|254780899|r 237 RSLSAQYEHTIGITKAGCEIFTLSPNNLGQ 266 (273) Q Consensus 237 g~~~~~~Ed~vlVT~~G~E~LT~~p~~l~~ 266 (273) +-+.|.||++|.|.|+..||.|+- T Consensus 366 ------~tesi~i~~~G~e~l~~~Pr~l~v 389 (391) T TIGR02993 366 ------ITESILITETGVECLSSVPRKLFV 389 (391) T ss_pred ------EEEEEEECCCCCHHHHCCCCEEEE T ss_conf ------000134303340445407720131 No 30 >KOG2413 consensus Probab=99.95 E-value=9.4e-28 Score=205.55 Aligned_cols=233 Identities=21% Similarity=0.252 Sum_probs=182.3 Q ss_pred HHHCCCEEECCHHHHHHHHHHHHHHHH-HHHH----HHHHCCCC--CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 331397885399999999999999999-9999----99975799--8899999999999998598303443324445532 Q gi|254780899|r 6 SRESGSINIYTPEELENIRSACNVVAR-CLDS----LTPIIKPG--TTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCC 78 (273) Q Consensus 6 ~~~~~~~~IKs~~EI~~~R~A~~i~~~-~~~~----~~~~i~~G--~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~ 78 (273) +....++.||+..|++.||.+- |-|. ++.+ +-+.+.-| ++|.+++..++++-.++... .+ .+|+++.. T Consensus 299 Spi~~~kAiKN~~E~~gmr~sh-irD~~Alve~~~wle~~~~~g~~itE~~~A~kle~fR~~~~~f---mg-lSFeTIS~ 373 (606) T KOG2413 299 SPISRAKAIKNDDELKGMRNSH-IRDGAALVEYFAWLEKELHKGYTITEYDAADKLEEFRSRQDHF---MG-LSFETISS 373 (606) T ss_pred CHHHHHHHHCCHHHHHHHHHCC-HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCC---CC-CCCCEEEC T ss_conf 7889999855768763556531-0358999999998766530376204344888999998764424---57-67100224 Q ss_pred C-CCCCCCCCCCC---CCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC-CCCCCC Q ss_conf 3-44642111212---222001340579978766530002311123334444321036778776543221100-112122 Q gi|254780899|r 79 T-SINHVICHGIP---SNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAV-KLNANI 153 (273) Q Consensus 79 ~-s~N~~~~H~~p---~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~i-kpG~~~ 153 (273) + |.|++++||.| +++++.+..+.++|.|+.|.--.+|+|||+.+|+|+.++++.|..++...-+...++ ..|... T Consensus 374 s~G~NgAviHYsP~~e~n~~i~~~kiyL~DSGaQY~DGTTDvTRT~HfgePs~eek~~yT~VLkGhi~la~~vFP~~t~g 453 (606) T KOG2413 374 SVGPNGAVIHYSPPAETNRIVSPDKIYLCDSGAQYLDGTTDVTRTVHFGEPTAEEKEAYTLVLKGHIALARAVFPKGTKG 453 (606) T ss_pred CCCCCCEEEECCCCCCCCCEECCCEEEEECCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 67888503520798545632067637998068642468654247786699988999999999874367652116899985 Q ss_pred CHHHHHHHHHHHHCCCEECCCCCCCCCC--CCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCE Q ss_conf 1048999999986296010475212332--21135403653156454566451167378831114437776256468866 Q gi|254780899|r 154 EDIGKAIQRYAHSERYSVVEVFCGHGIG--KSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWT 231 (273) Q Consensus 154 ~ei~~a~~~~~~~~g~~~~~~~~GHgiG--l~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~ 231 (273) +.++..+|..+++.|.. +.|.+|||+| |++||+|....+.. ..+...|++|||+++|||.| T Consensus 454 ~~lD~laR~~LW~~gLD-y~HgTGHGVG~fLnVhE~P~~is~r~--~~~~~~l~ag~~~s~EPGYY-------------- 516 (606) T KOG2413 454 SVLDALARSALWKAGLD-YGHGTGHGVGSFLNVHEGPIGIGYRP--YSSNFPLQAGMVFSIEPGYY-------------- 516 (606) T ss_pred CHHHHHHHHHHHHHCCC-CCCCCCCCCCCCEEECCCCCEEEEEE--CCCCCHHCCCEEECCCCCCC-------------- T ss_conf 00699999999860666-68888766565067415886100124--58872003853862688611-------------- Q ss_pred EEECCCCEEEEEEEEEEECCCCC----------EECCCCCCC Q ss_conf 88337871678442189847985----------765267988 Q gi|254780899|r 232 AVTRDRSLSAQYEHTIGITKAGC----------EIFTLSPNN 263 (273) Q Consensus 232 ~~~~~g~~~~~~Ed~vlVT~~G~----------E~LT~~p~~ 263 (273) +||.||+|+|+-++|.+.+- |.||..|.. T Consensus 517 ---~dg~fGIRienv~~vvd~~~~~~~~~~L~fe~lT~vP~q 555 (606) T KOG2413 517 ---KDGEFGIRIENVVEVVDAGTKHNFRGFLTFEPLTLVPYQ 555 (606) T ss_pred ---CCCCCEEEEEEEEEEEECCCCCCCCCEEEECCCEECCEE T ss_conf ---068642888648999865424344340223110255326 No 31 >KOG2776 consensus Probab=99.90 E-value=4.2e-22 Score=167.85 Aligned_cols=263 Identities=19% Similarity=0.251 Sum_probs=200.2 Q ss_pred CCCCHHH-HCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC--CC-CCCCCCCCCC Q ss_conf 9772033-139788539999999999999999999999997579988999999999999985983--03-4433244455 Q gi|254780899|r 1 MLSSSSR-ESGSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAI--PA-TLNYRGYKKS 76 (273) Q Consensus 1 ~~~~~~~-~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~--~~-~~~~~~f~~~ 76 (273) |++.++. +...-+|-++..+..+|-|++|+..+++.+.+.+.||.+..||+.....++.+.-+. -. -....|-... T Consensus 1 mss~d~~~~~~d~tia~~~vvtKYk~AgeI~n~~lk~V~~~~~~gasv~eiC~~GD~~i~E~t~kiYK~eK~~~KGIAfP 80 (398) T KOG2776 1 MSSHDETITEKDKTIANDSVVTKYKMAGEIVNKVLKSVVELCQPGASVREICEKGDSLILEETGKIYKKEKDFEKGIAFP 80 (398) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 98543442220001034777776657899999999999998458960888887356899999988875344452265556 Q ss_pred CCCCCCCCCCCCCC----CCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCC-----CCCHHHHHHHHHHHHHHHHCC Q ss_conf 32344642111212----222001340579978766530002311123334444-----321036778776543221100 Q gi|254780899|r 77 CCTSINHVICHGIP----SNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIK-----RAAERILQVTYESLYKGIAAV 147 (273) Q Consensus 77 ~~~s~N~~~~H~~p----~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~-----~~~~~l~~~~~~a~~~~i~~i 147 (273) +|+|+|+++||+.| .+..|+.||+|+||+|++++||.+-++.|++++.++ ....++..++..|.++++..+ T Consensus 81 T~Isvnncv~h~sPlksd~~~~Lk~GDvVKIdLG~HiDGfiA~vaHT~VV~~~~~~~vtG~kADvI~AAh~A~eaa~rll 160 (398) T KOG2776 81 TSISVNNCVCHFSPLKSDADYTLKEGDVVKIDLGVHIDGFIALVAHTIVVGPAPDTPVTGRKADVIAAAHLAAEAALRLL 160 (398) T ss_pred CEECCCCCEECCCCCCCCCCCCCCCCCEEEEEECCEECCCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 05630010014677877876322479879997411130011001445895378888564713579999899999999984 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCCC-CEEEECCCC---CCCCCCEECCCCEEEECCEEECCCCCC Q ss_conf 1121221048999999986296010475212332211354-036531564---545664511673788311144377762 Q gi|254780899|r 148 KLNANIEDIGKAIQRYAHSERYSVVEVFCGHGIGKSFHEK-PEILHFYDP---LYPSVGTFQEGMVFTIEPMLNVGGSSA 223 (273) Q Consensus 148 kpG~~~~ei~~a~~~~~~~~g~~~~~~~~GHgiGl~~hE~-P~i~~~~~~---~~~~~~~le~GMV~aiEP~i~~~~~~~ 223 (273) |||++-..|.+++++.+.++++..+.....|..-..+.++ +.|....+. ..-....++++.|++++.-...+.... T Consensus 161 kpgn~n~~vT~~i~k~aas~~c~pVegmlshql~~~~idGeKtIi~n~sdqq~~~~e~~~fe~~Evya~Di~~stg~~~~ 240 (398) T KOG2776 161 KPGNTNTQVTRAIVKTAASYGCKPVEGMLSHQLKQHVIDGEKTIIQNPSDQQKKEHEKTEFEEHEVYAIDILVSTGEGSP 240 (398) T ss_pred CCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCEEECCCCHHHHCHHCCCCCCCCEEEEEEEEEECCCCCC T ss_conf 78999953778999999970886666611477776521577247627653431200035555543788888985488864 Q ss_pred EECCCC--------------------------------------------------------------CEE-EECCCCEE Q ss_conf 564688--------------------------------------------------------------668-83378716 Q gi|254780899|r 224 KVLSDG--------------------------------------------------------------WTA-VTRDRSLS 240 (273) Q Consensus 224 ~~~~d~--------------------------------------------------------------w~~-~~~~g~~~ 240 (273) ...++. +.+ +.++|.+. T Consensus 241 K~~~~~~~t~y~kd~~~~y~LKlKaSR~~~seI~k~~g~~PF~~rs~~~e~r~rmGl~Ec~~~~ll~p~pVl~~kp~~~v 320 (398) T KOG2776 241 KEGDDRAPTIYYKDESVSYMLKLKASRALLSEIKKKFGVMPFTLRSLEEEFRARLGLVECTNHGLLVPYPVLYEKPGEFV 320 (398) T ss_pred CCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHH T ss_conf 43444465369833216777888999999999875348755344664367776521687604763100025653886310 Q ss_pred EEEEEEEEECCCCCEECCCCCCC Q ss_conf 78442189847985765267988 Q gi|254780899|r 241 AQYEHTIGITKAGCEIFTLSPNN 263 (273) Q Consensus 241 ~~~Ed~vlVT~~G~E~LT~~p~~ 263 (273) +|++.||+...+|.-.||..|-+ T Consensus 321 aqfk~TvllmPng~~~l~~~p~~ 343 (398) T KOG2776 321 AQFKFTVLLMPNGSLRLTGSPFK 343 (398) T ss_pred EEEEEEEEECCCCCCCCCCCCCC T ss_conf 10246789635886536678988 No 32 >KOG1189 consensus Probab=99.89 E-value=3.3e-22 Score=168.53 Aligned_cols=241 Identities=16% Similarity=0.200 Sum_probs=178.6 Q ss_pred HHCCCEEECCHHHHHHHHHHHHHHHHHHHH-----HHHHCCCC--CCHHHHHHHHHHHHHHCCCCCCCC---CCCCCCCC Q ss_conf 313978853999999999999999999999-----99975799--889999999999999859830344---33244455 Q gi|254780899|r 7 RESGSINIYTPEELENIRSACNVVARCLDS-----LTPIIKPG--TTTEEIDDFVLKFGMENNAIPATL---NYRGYKKS 76 (273) Q Consensus 7 ~~~~~~~IKs~~EI~~~R~A~~i~~~~~~~-----~~~~i~~G--~se~el~~~~~~~~~~~g~~~~~~---~~~~f~~~ 76 (273) .......||++.||+.+|+|+.++...|.. +.+.+-.+ ++-..|...+...+...-..+... --+.||++ T Consensus 130 ~ls~l~avKDd~Ei~~irksa~~s~~vm~k~~~~~~~~aiD~ekkvthskLsD~~e~~I~~~k~s~~l~~~~~d~cY~PI 209 (960) T KOG1189 130 GLSKLFAVKDDEEIANIRKSAAASSAVMNKYLVDELVEAIDEEKKVTHSKLSDLMESAIEDKKYSPGLDPDLLDMCYPPI 209 (960) T ss_pred HHHHHEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHH T ss_conf 54565033778999888887788999999999999999865124010678888898764223247666843266336736 Q ss_pred CCCCCCCCC-CCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCH Q ss_conf 323446421-1121222200134057997876653000231112333444432103677877654322110011212210 Q gi|254780899|r 77 CCTSINHVI-CHGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIED 155 (273) Q Consensus 77 ~~~s~N~~~-~H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~e 155 (273) +.+|-|-.+ +....++..| + +++.-+|++|++||+.++|||.+ +|+.++++.|+..+.+|.+.++.+|||.+.++ T Consensus 210 iqSGg~ydlk~sa~s~~~~L--~-~I~cs~G~RynsYCSNv~RT~Li-dpssemq~nY~fLl~aqe~il~~lrpG~ki~d 285 (960) T KOG1189 210 IQSGGKYDLKPSAVSDDNHL--H-VILCSLGIRYNSYCSNVSRTYLI-DPSSEMQENYEFLLAAQEEILKLLRPGTKIGD 285 (960) T ss_pred HHCCCCCCCCCCCCCCCCCC--E-EEEEEECCHHHHHHHCCEEEEEE-CCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 63377645652313476653--0-58855020465541022003774-67089888899999999999997458981788 Q ss_pred HHHHHHHHHHHCCCEECCCC---CCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCCCCEE Q ss_conf 48999999986296010475---212332211354036531564545664511673788311144377762564688668 Q gi|254780899|r 156 IGKAIQRYAHSERYSVVEVF---CGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTA 232 (273) Q Consensus 156 i~~a~~~~~~~~g~~~~~~~---~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~ 232 (273) |+.++-+++++.+......| .|-||||.+-|.-.+. ...++.+|++||||.|--|+. .. .+ T Consensus 286 VY~~~l~~v~k~~Pel~~~~~k~lG~~iGlEFREssl~i-----naKnd~~lk~gmvFni~lGf~----nl---~n---- 349 (960) T KOG1189 286 VYEKALDYVEKNKPELVPNFTKNLGFGIGLEFRESSLVI-----NAKNDRVLKKGMVFNISLGFS----NL---TN---- 349 (960) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHCCCEEEEEECCCCCC-----CCCCHHHHCCCCEEEEEECCC----CC---CC---- T ss_conf 999999999853932456665430421323761154201-----666623415784899986666----56---78---- Q ss_pred EECCCCEEEEEEEEEEECCCCC-EECCCCCCCCCCC Q ss_conf 8337871678442189847985-7652679886768 Q gi|254780899|r 233 VTRDRSLSAQYEHTIGITKAGC-EIFTLSPNNLGQP 267 (273) Q Consensus 233 ~~~~g~~~~~~Ed~vlVT~~G~-E~LT~~p~~l~~~ 267 (273) -...+.+...+.|||+|+++++ ++||..++..-.+ T Consensus 350 ~~~~~~yaL~l~DTvlv~e~~p~~vLT~~~K~~~dv 385 (960) T KOG1189 350 PESKNSYALLLSDTVLVGEDPPAEVLTDSAKAVKDV 385 (960) T ss_pred CCCCCCHHHHCCCEEEECCCCCCHHHCCCCHHHCCC T ss_conf 655650423102237845899642311243333032 No 33 >KOG2775 consensus Probab=99.87 E-value=1e-20 Score=158.62 Aligned_cols=233 Identities=24% Similarity=0.348 Sum_probs=182.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH---HCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 399999999999999999999999975799889999999999999---85983034433244455323446421112122 Q gi|254780899|r 15 YTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGM---ENNAIPATLNYRGYKKSCCTSINHVICHGIPS 91 (273) Q Consensus 15 Ks~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~---~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p~ 91 (273) -..+...-+|+|+++..++-+++...|+|||+-.||+..++...+ ..++..+. .+||+- .|.|+|..||.|+ T Consensus 80 ~~~~i~~d~rraAE~HRqvR~yv~s~ikPGmtm~ei~e~iEnttR~li~e~gl~aG---i~FPtG--~SlN~cAAHyTpN 154 (397) T KOG2775 80 TESDIYQDLRRAAEAHRQVRKYVQSIIKPGMTMIEICETIENTTRKLILENGLNAG---IGFPTG--CSLNHCAAHYTPN 154 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC---CCCCCC--CCCCCHHHHCCCC T ss_conf 30487888999999999999999986167642999999998889999874551027---667776--6621034306899 Q ss_pred ---CCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf ---22001340579978766530002311123334444321036778776543221100112122104899999998629 Q gi|254780899|r 92 ---NKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSER 168 (273) Q Consensus 92 ---d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g 168 (273) ..+|+.+|+++||+|.+++|-..|++.|+.+ ++....|..++++|.+.+|+-..-.++..||.++++++++++. T Consensus 155 aGd~tVLqydDV~KiDfGthi~GrIiDsAFTv~F---~p~~d~Ll~AvreaT~tGIkeaGiDvRlcdiG~aiqEVmeSyE 231 (397) T KOG2775 155 AGDKTVLKYDDVMKIDFGTHIDGRIIDSAFTVAF---NPKYDPLLAAVREATNTGIKEAGIDVRLCDIGEAIQEVMESYE 231 (397) T ss_pred CCCCEEEEECCEEEEECCCCCCCEEEEEEEEEEE---CCCCCHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHEE T ss_conf 9983464206568874021106727534568860---7552089999998875026552840232103688999766307 Q ss_pred C---------EECCCCCCCCCCCC-CCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCCCCEECCC---------- Q ss_conf 6---------01047521233221-135403653156454566451167378831114437776256468---------- Q gi|254780899|r 169 Y---------SVVEVFCGHGIGKS-FHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGSSAKVLSD---------- 228 (273) Q Consensus 169 ~---------~~~~~~~GHgiGl~-~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~~~~~~~d---------- 228 (273) . ..++++.||+|+.. +|-.-.++.. .-++.+.+|+|..|+||.+-++|. +++.+| T Consensus 232 vEi~Gk~~~VKpIrnLnGHSI~~yrIH~gksVPiV---kgge~trmee~e~yAIETFgSTGk--G~v~ddmecSHymkn~ 306 (397) T KOG2775 232 VEINGKTYQVKPIRNLNGHSIAQYRIHGGKSVPIV---KGGEQTRMEEGEIYAIETFGSTGK--GYVHDDMECSHYMKNF 306 (397) T ss_pred EEECCCEEEEEECCCCCCCCCCCEEEECCCCCCEE---CCCCCEEECCCEEEEEEEECCCCC--CEECCCCCCCHHHHCC T ss_conf 87678162000131357874332476258332224---477311320770699984046776--2455886410465403 Q ss_pred ---------------------C---------C-------------------EEE-------ECCCCEEEEEEEEEEECCC Q ss_conf ---------------------8---------6-------------------688-------3378716784421898479 Q gi|254780899|r 229 ---------------------G---------W-------------------TAV-------TRDRSLSAQYEHTIGITKA 252 (273) Q Consensus 229 ---------------------~---------w-------------------~~~-------~~~g~~~~~~Ed~vlVT~~ 252 (273) + | +++ --+|++.+||||||+..+. T Consensus 307 ~~~~vplrl~~~K~ll~~I~knfgTLaFcrR~lDrlGetKyLmAlk~Lc~~Giv~pyPPLcDi~G~ytAQfEHTIll~pt 386 (397) T KOG2775 307 ELGHVPLRLQRSKGLLNTIDKNFGTLAFCRRWLDRLGETKYLMALKNLCDMGIVQPYPPLCDIKGSYTAQFEHTILLSPT 386 (397) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEECEEEEECCH T ss_conf 43446646177899999986104542112788976404789999876654154667886315676413200206784411 Q ss_pred CCEECCCC Q ss_conf 85765267 Q gi|254780899|r 253 GCEIFTLS 260 (273) Q Consensus 253 G~E~LT~~ 260 (273) +-|++|+- T Consensus 387 ~KEVvsrG 394 (397) T KOG2775 387 GKEVVSRG 394 (397) T ss_pred HCCCCCCC T ss_conf 13010156 No 34 >COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics] Probab=99.59 E-value=1.4e-14 Score=117.66 Aligned_cols=245 Identities=16% Similarity=0.118 Sum_probs=161.9 Q ss_pred CCCHHHHCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHH---CCCC---CCHHHHHHHHHHHHH----------HCCCCC Q ss_conf 772033139788539999999999999999999999997---5799---889999999999999----------859830 Q gi|254780899|r 2 LSSSSRESGSINIYTPEELENIRSACNVVARCLDSLTPI---IKPG---TTTEEIDDFVLKFGM----------ENNAIP 65 (273) Q Consensus 2 ~~~~~~~~~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~---i~~G---~se~el~~~~~~~~~----------~~g~~~ 65 (273) +|..-....+-.+|+.+||+.+|.+++..+..|....+. +--| +|-..+...+...+. +.|-.. T Consensus 158 ~DvslgLsk~~~~KD~~E~an~~~ss~~s~~~M~~~~~em~~~~D~~~kit~~KlsD~mes~iddv~f~q~~s~~l~~~~ 237 (1001) T COG5406 158 SDVSLGLSKMFLTKDAEEIANCRASSAASSVLMRYFVKEMEMLWDGAFKITHGKLSDLMESLIDDVEFFQTKSLKLGDID 237 (1001) T ss_pred HHHHHHHHHHHCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCHHHHHHHCCCCCCCCC T ss_conf 23323266776006688875310232377999999887799987400121221578876520013666643375424454 Q ss_pred CCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 344332444553234464-2111212222001340579978766530002311123334444321036778776543221 Q gi|254780899|r 66 ATLNYRGYKKSCCTSINH-VICHGIPSNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGI 144 (273) Q Consensus 66 ~~~~~~~f~~~~~~s~N~-~~~H~~p~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i 144 (273) --.-.+.|.+++.+|..- .-|.....++. -.||+|++-+|..|+|||+.++|||.. +|+.+|++-|+-++.+|...+ T Consensus 238 ~d~lew~ytpiiqsg~~~Dl~psa~s~~~~-l~gd~vl~s~GiRYn~YCSn~~RT~l~-dp~~e~~~Ny~fl~~lQk~i~ 315 (1001) T COG5406 238 LDQLEWCYTPIIQSGGSIDLTPSAFSFPME-LTGDVVLLSIGIRYNGYCSNMSRTILT-DPDSEQQKNYEFLYMLQKYIL 315 (1001) T ss_pred HHHHHHHCCHHHCCCCEEECCCCCCCCCHH-HCCCEEEEEEEEEECCCCCCCCCEEEE-CCCHHHHHHHHHHHHHHHHHH T ss_conf 034421035022158613056202158544-337457999513433500234415870-773676535899999999998 Q ss_pred HCCCCCCCCCHHHHHHHHHHHHCCCEECCCC---CCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEEECCCC Q ss_conf 1001121221048999999986296010475---2123322113540365315645456645116737883111443777 Q gi|254780899|r 145 AAVKLNANIEDIGKAIQRYAHSERYSVVEVF---CGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGGS 221 (273) Q Consensus 145 ~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~---~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~~ 221 (273) ..+|||...++|+..+.+++++.|.....+| +|-++|+.+.+...+.+. .+.++|++||+|.|--|+. T Consensus 316 ~~~rpG~~~g~iY~~~~~yi~~~~pel~pnF~~nvG~~igiefR~s~~~~nv-----kn~r~lq~g~~fnis~gf~---- 386 (1001) T COG5406 316 GLVRPGTDSGIIYSEAEKYISSNGPELGPNFIYNVGLMIGIEFRSSQKPFNV-----KNGRVLQAGCIFNISLGFG---- 386 (1001) T ss_pred HHCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCCEEC-----CCCCEECCCCEEEEEECCC---- T ss_conf 5207998705689999999872398647407643201113010014662542-----4784311350799961356---- Q ss_pred CCEECCCCCEEEECCCCEEEEEEEEEEECCCCCEECCCCCCCC Q ss_conf 6256468866883378716784421898479857652679886 Q gi|254780899|r 222 SAKVLSDGWTAVTRDRSLSAQYEHTIGITKAGCEIFTLSPNNL 264 (273) Q Consensus 222 ~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~~p~~l 264 (273) -.+.+ -....+..++-||+-|+-+-|-+||.+|+-- T Consensus 387 -nl~~~------~~~Nnyal~l~dt~qi~ls~p~~~t~~~kaq 422 (1001) T COG5406 387 -NLINP------HPKNNYALLLIDTEQISLSNPIVFTDSPKAQ 422 (1001) T ss_pred -CCCCC------CCCCCHHHHHCCCEEEECCCCEECCCCCCCC T ss_conf -55788------7665254441242586337860014674445 No 35 >cd01088 MetAP2 Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=97.01 E-value=0.0027 Score=42.23 Aligned_cols=96 Identities=15% Similarity=0.136 Sum_probs=60.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCCCCCCCCCC-CCCCCCCCCEEE Q ss_conf 9999999999999997579988999999999999985983034433244--4553234464211121-222200134057 Q gi|254780899|r 25 SACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGY--KKSCCTSINHVICHGI-PSNKQLREGDIV 101 (273) Q Consensus 25 ~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f--~~~~~~s~N~~~~H~~-p~d~~l~~GD~v 101 (273) +..+.+.+++..+++.++||.+..||...+++.+.+.|..+- .++.|- .... .--.-.+||+. +++.+|++|++. T Consensus 104 ~Li~aa~~al~~ai~~~kpG~~~~dIg~~Ie~~~~~~g~~~v-~~l~GH~i~ry~-ih~~~~Ipn~~~~~~~~l~~g~v~ 181 (291) T cd01088 104 DLLEAAKEALNAAIKEAGPDVRLGEIGEAIEEVIESYGFKPI-RNLTGHSIERYR-LHAGKSIPNVKGGEGTRLEEGDVY 181 (291) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEE-CCCCCCCCCCCC-CCCCCEECEECCCCCCEECCCCEE T ss_conf 899999999999999853898989999999999998697700-365255777722-169975342528998366068689 Q ss_pred EEEEEEEE-EEECCCCCCCCCC Q ss_conf 99787665-3000231112333 Q gi|254780899|r 102 NVDVTYVV-NGWHGDSSRMYPV 122 (273) Q Consensus 102 ~iD~g~~~-~GY~~D~~RT~~~ 122 (273) .||..+.- .|+..+-.+|.++ T Consensus 182 aIEp~~t~G~g~v~~~~~~~iy 203 (291) T cd01088 182 AIEPFATTGKGYVHDGPECSIY 203 (291) T ss_pred EEEEEEECCCCEEECCCCEEEE T ss_conf 9855740587526338961588 No 36 >cd01089 PA2G4-like Related to aminopepdidase M, this family contains proliferation-associated protein 2G4. Family members have been implicated in cell cycle control. Probab=96.94 E-value=0.0031 Score=41.89 Aligned_cols=90 Identities=19% Similarity=0.178 Sum_probs=59.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECCC--CCCCCC----CCCCCCCCEEEECCCCCC-CC Q ss_conf 21036778776543221100112122104899999998629601047--521233----221135403653156454-56 Q gi|254780899|r 128 AAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVEV--FCGHGI----GKSFHEKPEILHFYDPLY-PS 200 (273) Q Consensus 128 ~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~--~~GHgi----Gl~~hE~P~i~~~~~~~~-~~ 200 (273) +.|+.-+++.++++...+.+|||++..||++.+++++.+.+-..+.. ...+|+ -.++-| .+. .+.|.. .+ T Consensus 3 KmR~Ag~i~a~~l~~~~~~i~pG~tt~eld~~~~~~i~~~~~~~~~~~~~~~~G~~~p~CiSvNe--~v~-HGiP~~~~~ 79 (228) T cd01089 3 KYKTAGQIANKVLKQVISLCVPGAKVVDLCEKGDKLILEELGKVYKKEKKLEKGIAFPTCISVNN--CVC-HFSPLKSDA 79 (228) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEECCCC--EEE-CCCCCCCCC T ss_conf 89999999999999999578699969999999999999830432567666667778672750254--445-689998888 Q ss_pred CCEECCCCEEEECCEEECCC Q ss_conf 64511673788311144377 Q gi|254780899|r 201 VGTFQEGMVFTIEPMLNVGG 220 (273) Q Consensus 201 ~~~le~GMV~aiEP~i~~~~ 220 (273) +++|++|.+++|+-+++..+ T Consensus 80 ~~~L~~GDiV~iD~g~~~dG 99 (228) T cd01089 80 TYTLKDGDVVKIDLGCHIDG 99 (228) T ss_pred CCCCCCCCEEEEEEEEEECC T ss_conf 73546899899977688687 No 37 >PRK08671 methionine aminopeptidase; Provisional Probab=96.85 E-value=0.0039 Score=41.19 Aligned_cols=91 Identities=18% Similarity=0.196 Sum_probs=57.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC--CCC-CCCCCCCCCCCCCCC-CCCCCCCCEE Q ss_conf 999999999999999757998899999999999998598303443324--445-532344642111212-2220013405 Q gi|254780899|r 25 SACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRG--YKK-SCCTSINHVICHGIP-SNKQLREGDI 100 (273) Q Consensus 25 ~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~--f~~-~~~~s~N~~~~H~~p-~d~~l~~GD~ 100 (273) +..+.+..+++.+++.++||++..||...+++.+.+.|..+- .++.| ... .+- .+-.+||+.. +...+++|++ T Consensus 105 ~Li~aa~~al~~ai~~~kpG~~~~dIg~~I~~~~~~~g~~~v-~~~~gHgigry~iH--~~~~Ipn~~~~~~~~l~~g~v 181 (293) T PRK08671 105 DLVEASREALEAAIEIVKPGVSVGEIGRAIEETIKSYGFKPI-RNLTGHGIERYDLH--AGPSIPNYDEGGGAVLKEGDV 181 (293) T ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEE-CCCCCCCCCCCCCC--CCCEECCCCCCCCCEECCCCE T ss_conf 999999999999998706997978999999999998698895-14444676651542--798705206998715507858 Q ss_pred EEEEEEEEE-EEECCCCCC Q ss_conf 799787665-300023111 Q gi|254780899|r 101 VNVDVTYVV-NGWHGDSSR 118 (273) Q Consensus 101 v~iD~g~~~-~GY~~D~~R 118 (273) ..||..+.- .|+..+-.+ T Consensus 182 ~aIEpf~ttG~g~v~e~~~ 200 (293) T PRK08671 182 YAIEPFATDGEGQVVEGPE 200 (293) T ss_pred EEEEEEEECCCCEEEECCC T ss_conf 9994476278875876796 No 38 >COG0024 Map Methionine aminopeptidase [Translation, ribosomal structure and biogenesis] Probab=96.80 E-value=0.0039 Score=41.24 Aligned_cols=86 Identities=12% Similarity=0.114 Sum_probs=48.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCE-ECCCCCC--CCCCCCCCCCCEEEECCCCCCCCCCEEC Q ss_conf 103677877654322110011212210489999999862960-1047521--2332211354036531564545664511 Q gi|254780899|r 129 AERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYS-VVEVFCG--HGIGKSFHEKPEILHFYDPLYPSVGTFQ 205 (273) Q Consensus 129 ~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~-~~~~~~G--HgiGl~~hE~P~i~~~~~~~~~~~~~le 205 (273) .++.-+++.++++.+.+.+|||++..||+..+++++.++|.- .+.++-| -.+.+++-| ....+.|. +..+|+ T Consensus 14 ~r~Ag~i~a~~l~~~~~~v~pGvtt~Eld~~~~~~i~~~ga~pa~~gy~g~~~~~ciSvNe---~v~HgiP~--d~~vlk 88 (255) T COG0024 14 MREAGKIAAKALKEVASLVKPGVTTLELDEIAEEFIREKGAYPAFLGYKGFPFPTCISVNE---VVAHGIPG--DKKVLK 88 (255) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEHHCCCCCCCCCEEEEHHH---EEEECCCC--CCCCCC T ss_conf 9999999999999999871799979999999999999748636110666988563840315---03514789--873458 Q ss_pred CCCEEEECCEEECC Q ss_conf 67378831114437 Q gi|254780899|r 206 EGMVFTIEPMLNVG 219 (273) Q Consensus 206 ~GMV~aiEP~i~~~ 219 (273) +|-++.|+-++.+. T Consensus 89 ~GDiv~IDvg~~~d 102 (255) T COG0024 89 EGDIVKIDVGAHID 102 (255) T ss_pred CCCEEEEEEEEEEC T ss_conf 99989998899977 No 39 >PRK09795 aminopeptidase; Provisional Probab=96.52 E-value=0.04 Score=34.48 Aligned_cols=80 Identities=8% Similarity=0.001 Sum_probs=51.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEE--CCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCC Q ss_conf 036778776543221100112122104899999998629601--047521233221135403653156454566451167 Q gi|254780899|r 130 ERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSV--VEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEG 207 (273) Q Consensus 130 ~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~--~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~G 207 (273) ++..+++..++.++++.+|||++=.||.+.+...+.+.|-.. +...++.|-- ..++|. ...+.++++| T Consensus 137 r~A~~iad~~~~~~~~~i~~G~tE~eiaa~~~~~~~~~Ga~~~sf~~iv~sG~~------~a~pH~----~~s~r~i~~G 206 (361) T PRK09795 137 RLACGIADRGAEHIRRFIQAGMSEREIAAELEWFMRQQGAEKASFDTIVASGWR------GALPHG----KASDKIVAAG 206 (361) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCEEECCCCC------CCCCCC----CCCCCCCCCC T ss_conf 999999999999999876599889999999999999829986666603036677------746778----8777351689 Q ss_pred CEEEECCEEECC Q ss_conf 378831114437 Q gi|254780899|r 208 MVFTIEPMLNVG 219 (273) Q Consensus 208 MV~aiEP~i~~~ 219 (273) -.+.|+-+.... T Consensus 207 d~v~iD~G~~~~ 218 (361) T PRK09795 207 EFVTLDFGALYQ 218 (361) T ss_pred CEEEEEEEEEEC T ss_conf 989998758989 No 40 >PRK12318 methionine aminopeptidase; Provisional Probab=96.42 E-value=0.011 Score=38.13 Aligned_cols=86 Identities=17% Similarity=0.143 Sum_probs=56.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECC-CC----CCCCCCCCCCCCCEEEECCCCCCCCCCE Q ss_conf 103677877654322110011212210489999999862960104-75----2123322113540365315645456645 Q gi|254780899|r 129 AERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVE-VF----CGHGIGKSFHEKPEILHFYDPLYPSVGT 203 (273) Q Consensus 129 ~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~-~~----~GHgiGl~~hE~P~i~~~~~~~~~~~~~ 203 (273) .++.-+++.++++.+.+.+|||++..||++.+++++.++|...-. .+ +-..+-.+..+ .++|. -+++.+ T Consensus 52 mR~A~~I~a~~~~~~~~~ikpGvTe~Eld~~~~~~~~~~ga~pa~l~y~~~~Fp~~vc~s~N~--~v~Hg----iP~~~~ 125 (291) T PRK12318 52 IRKACQITARILDALCEAAKEGVTTNELDQLSRELHKKYDAIPAPLNYGHPPFPKTICTSLNE--VICHG----IPNDIP 125 (291) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCEEEECCC--EEECC----CCCCCC T ss_conf 999999999999999836879998999999999999987898676677887888744872388--73168----999861 Q ss_pred ECCCCEEEECCEEECCC Q ss_conf 11673788311144377 Q gi|254780899|r 204 FQEGMVFTIEPMLNVGG 220 (273) Q Consensus 204 le~GMV~aiEP~i~~~~ 220 (273) |++|.++.|+-+++..+ T Consensus 126 Lk~GDlV~iD~G~~y~G 142 (291) T PRK12318 126 LKNGDIMNIDVSCIVDG 142 (291) T ss_pred CCCCCEEEEECCEEECC T ss_conf 67999899956786898 No 41 >cd01086 MetAP1 Methionine Aminopeptidase 1. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and Peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=96.33 E-value=0.052 Score=33.68 Aligned_cols=87 Identities=15% Similarity=0.258 Sum_probs=56.1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC--CCC-CCC-CCCCCCC Q ss_conf 999999999999999999757998899999999999998598303443324445532344642--111-212-2220013 Q gi|254780899|r 22 NIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHV--ICH-GIP-SNKQLRE 97 (273) Q Consensus 22 ~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~--~~H-~~p-~d~~l~~ 97 (273) ..|++.+++.++++++.+.+|||++..||...+++.+.+.|-. ....+. ++.+....+.. +++ +.+ ++++|++ T Consensus 109 ~~~~~~~~v~ea~~~~i~~~kpG~~~~ev~~a~~~~~~~~g~~-~~~~~~--GHgvGl~~he~P~i~~~~~~~~~~~L~~ 185 (238) T cd01086 109 EAKKLVEVTEEALYKGIEAVKPGNRIGDIGHAIEKYAEKNGYS-VVREFG--GHGIGRKFHEEPQIPNYGRPGTGPKLKP 185 (238) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCC-CCCCEE--ECCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999999999999987169989999999999999986998-354633--2624677666986264454899840569 Q ss_pred CEEEEEEEEEEEEE Q ss_conf 40579978766530 Q gi|254780899|r 98 GDIVNVDVTYVVNG 111 (273) Q Consensus 98 GD~v~iD~g~~~~G 111 (273) |-++.++-+....+ T Consensus 186 GmV~tiEP~i~~~~ 199 (238) T cd01086 186 GMVFTIEPMINLGT 199 (238) T ss_pred CCEEEEECEEECCC T ss_conf 84899926484388 No 42 >PRK07281 methionine aminopeptidase; Reviewed Probab=96.24 E-value=0.019 Score=36.63 Aligned_cols=35 Identities=6% Similarity=0.046 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 78776543221100112122104899999998629 Q gi|254780899|r 134 QVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSER 168 (273) Q Consensus 134 ~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g 168 (273) +++.++++.+.+.+|||++..||++.+++.+.++| T Consensus 18 ~i~a~~~~~~~~~ikpGit~~eld~~~~~~~~e~g 52 (286) T PRK07281 18 DFLASIHIGLRDLIKPGVDMWEVEEYVRRRCKEDN 52 (286) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC T ss_conf 99999999888467699989999999999999759 No 43 >KOG2738 consensus Probab=96.14 E-value=0.014 Score=37.50 Aligned_cols=89 Identities=13% Similarity=0.246 Sum_probs=58.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC-CEECCCCC--CCCCCCCCCCCCEEEECCCCCCCCCCEEC Q ss_conf 1036778776543221100112122104899999998629-60104752--12332211354036531564545664511 Q gi|254780899|r 129 AERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSER-YSVVEVFC--GHGIGKSFHEKPEILHFYDPLYPSVGTFQ 205 (273) Q Consensus 129 ~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g-~~~~~~~~--GHgiGl~~hE~P~i~~~~~~~~~~~~~le 205 (273) ++.....+++.++.+-.++|||++..||+.++...+-+.| |.-.-++. .-++=.++.| +...+ -++.+.|| T Consensus 125 mR~ac~LarevLd~Aa~~v~PgvTTdEiD~~VH~a~Ierg~YPSPLnYy~FPKS~CTSVNE---viCHG---IPD~RpLe 198 (369) T KOG2738 125 MRKACRLAREVLDYAATLVRPGVTTDEIDRAVHNAIIERGAYPSPLNYYGFPKSVCTSVNE---VICHG---IPDSRPLE 198 (369) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHCCHHH---EEECC---CCCCCCCC T ss_conf 9999999999999875025797638899999999998648858875457886666242221---46547---88867277 Q ss_pred CCCEEEECCEEECCCCCC Q ss_conf 673788311144377762 Q gi|254780899|r 206 EGMVFTIEPMLNVGGSSA 223 (273) Q Consensus 206 ~GMV~aiEP~i~~~~~~~ 223 (273) .|..+.|+.-+|..++.+ T Consensus 199 dGDIvNiDVtvY~~GyHG 216 (369) T KOG2738 199 DGDIVNIDVTVYLNGYHG 216 (369) T ss_pred CCCEEEEEEEEEECCCCC T ss_conf 798786889999425257 No 44 >TIGR00500 met_pdase_I methionine aminopeptidase, type I; InterPro: IPR002467 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002468 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=96.07 E-value=0.027 Score=35.62 Aligned_cols=88 Identities=14% Similarity=0.166 Sum_probs=56.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC-C-EECCCCC------CCC--CC---CCCCCCCEEEECCC Q ss_conf 1036778776543221100112122104899999998629-6-0104752------123--32---21135403653156 Q gi|254780899|r 129 AERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSER-Y-SVVEVFC------GHG--IG---KSFHEKPEILHFYD 195 (273) Q Consensus 129 ~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g-~-~~~~~~~------GHg--iG---l~~hE~P~i~~~~~ 195 (273) .|++-+.+.++++..-..++||+++.||+..+++++++.| = +.+..+. |+- .| +++-| .+-| + T Consensus 12 irkag~l~a~vl~~~~~~v~PGvST~eLD~i~~~~i~~~~hA~pAf~GY~PPfsdvgfPPft~~~C~S~Ne--~V~H-G- 87 (265) T TIGR00500 12 IRKAGRLVAEVLEILEREVKPGVSTKELDRIAKDFIEKRGHAKPAFLGYYPPFSDVGFPPFTGSVCISVNE--VVIH-G- 87 (265) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECC--EEEE-C- T ss_conf 99999999999999986368988789999999999984797235356787786668788555615632167--8885-1- Q ss_pred CCCCCCCEECCCCEEEECCEEECCCC Q ss_conf 45456645116737883111443777 Q gi|254780899|r 196 PLYPSVGTFQEGMVFTIEPMLNVGGS 221 (273) Q Consensus 196 ~~~~~~~~le~GMV~aiEP~i~~~~~ 221 (273) .++...+|++|-+++|+-++.+.++ T Consensus 88 -iP~~~~vLKdGDivniD~g~~~~Gy 112 (265) T TIGR00500 88 -IPDKKKVLKDGDIVNIDVGVIYDGY 112 (265) T ss_pred -CCCCCCEEECCCEEEEEEEEEECCE T ss_conf -3884726513677875577886030 No 45 >cd01092 APP-like Similar to Prolidase and Aminopeptidase P. The members of this subfamily presumably catalyse hydrolysis of Xaa-Pro dipeptides and/or release of any N-terminal amino acid, including proline, that is linked with proline. Probab=96.03 E-value=0.083 Score=32.33 Aligned_cols=41 Identities=20% Similarity=0.299 Sum_probs=22.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 99999999999999999757998899999999999998598 Q gi|254780899|r 23 IRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNA 63 (273) Q Consensus 23 ~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~ 63 (273) .++..+++.++++++.+.+|||++..||...+++.+.+.|- T Consensus 105 ~~~~~~~~~~~~~~~i~~~kpG~~~~dv~~~~~~~~~~~g~ 145 (208) T cd01092 105 LKEIYEIVLEAQQAAIKAVKPGVTAKEVDKAARDVIEEAGY 145 (208) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 99999999999999997237999099999999999997598 No 46 >PRK05716 methionine aminopeptidase; Validated Probab=96.03 E-value=0.074 Score=32.68 Aligned_cols=82 Identities=17% Similarity=0.276 Sum_probs=45.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC--CCCC-CC-CCCCCCCC Q ss_conf 99999999999999999757998899999999999998598303443324445532344642--1112-12-22200134 Q gi|254780899|r 23 IRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHV--ICHG-IP-SNKQLREG 98 (273) Q Consensus 23 ~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~--~~H~-~p-~d~~l~~G 98 (273) .|+.-+++.++++++.+.+|||++..|+...+++.+.+.|-. ...++ ..+.+....... ++|+ .+ .+.+|++| T Consensus 120 ~~~~~~~~~~a~~~~i~~ikPG~~~~dv~~a~~~~~~~~g~~-~~~~~--~GHgiGl~~hE~P~i~~~~~~~~~~~Le~G 196 (252) T PRK05716 120 DKRLCEVTKEALYLGIKAVKPGARLGDIGHAIQKYAEAEGFS-VVREY--CGHGIGRVFHEEPQVLHYGAPGDGPVLKEG 196 (252) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCC-CEECC--CCCCCCCCCCCCCEECCCCCCCCCCCCCCC T ss_conf 799998999999999997567980899999999999975887-20021--245567766769703565569998535698 Q ss_pred EEEEEEEEE Q ss_conf 057997876 Q gi|254780899|r 99 DIVNVDVTY 107 (273) Q Consensus 99 D~v~iD~g~ 107 (273) -++.++-+. T Consensus 197 MV~tiEP~i 205 (252) T PRK05716 197 MVFTIEPMI 205 (252) T ss_pred CEEEEECEE T ss_conf 289981615 No 47 >cd01090 Creatinase Creatine amidinohydrolase. E.C.3.5.3.3. Hydrolyzes creatine to sarcosine and urea. Probab=96.02 E-value=0.086 Score=32.23 Aligned_cols=38 Identities=16% Similarity=0.190 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 99999999999999975799889999999999999859 Q gi|254780899|r 25 SACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENN 62 (273) Q Consensus 25 ~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g 62 (273) ++-+++.++++++.+.+|||++-.|++..+.+.+.+.| T Consensus 113 ~~~~~~~ea~~~a~~~ikPG~~~~dv~~~~~~~~~~~g 150 (228) T cd01090 113 KIWEANVAVHERGLELIKPGARCKDIAAELNEMYREHD 150 (228) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 99999999999999981699849999999999999749 No 48 >pfam00557 Peptidase_M24 Metallopeptidase family M24. This family contains metallopeptidases. It also contains non-peptidase homologues such as the N terminal domain of Spt16 which is a histone H3-H4 binding module. Probab=96.00 E-value=0.087 Score=32.19 Aligned_cols=82 Identities=16% Similarity=0.144 Sum_probs=44.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCEEE Q ss_conf 9999999999999999757998899999999999998598303443324445532344642111212--22200134057 Q gi|254780899|r 24 RSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHGIP--SNKQLREGDIV 101 (273) Q Consensus 24 R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~~p--~d~~l~~GD~v 101 (273) +++.+.+.++++.+.+.+|||++-.||...+++.+.+.|-.. +.. .+.+.+........|.... ++.+|++|-++ T Consensus 106 ~~~~~~~~~~~~~~~~~~rpG~~~~ev~~~~~~~~~~~g~~~-~~~--~~GHgiGl~~~~~~P~~~~~~~~~~l~~GMv~ 182 (207) T pfam00557 106 RELYEAVLEAQEAAIAAVKPGVTGGDVDAAAREVLEEGGYGV-FPH--GLGHGIGLDVHDEGPYIISGGSDRVLEPGMVF 182 (207) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCC-CCC--EECCCCCCCCCCCCCCCCCCCCCCEECCCCEE T ss_conf 999999999999999863799889999999999999859998-766--10251266777898640579999897999899 Q ss_pred EEEEEEE Q ss_conf 9978766 Q gi|254780899|r 102 NVDVTYV 108 (273) Q Consensus 102 ~iD~g~~ 108 (273) .++.+.. T Consensus 183 ~iep~~~ 189 (207) T pfam00557 183 TIEPGIY 189 (207) T ss_pred EECCEEE T ss_conf 9989614 No 49 >cd01066 APP_MetAP A family including aminopeptidase P, aminopeptidase M, and prolidase. Also known as metallopeptidase family M24. This family of enzymes is able to cleave amido-, imido- and amidino-containing bonds. Members exibit relatively narrow substrate specificity compared to other metallo-aminopeptidases, suggesting they play roles in regulation of biological processes rather than general protein degradation. Probab=95.94 E-value=0.061 Score=33.25 Aligned_cols=99 Identities=16% Similarity=0.097 Sum_probs=54.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCEE Q ss_conf 9999999999999999997579988999999999999985983034433244455323446421112-122220013405 Q gi|254780899|r 22 NIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHVICHG-IPSNKQLREGDI 100 (273) Q Consensus 22 ~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~~~H~-~p~d~~l~~GD~ 100 (273) .+|++.+.+.++++++.+.++||.+..|+...+.+.+.+.|........ ..+.+..+.... |-. .-.+.+|++|-+ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~pG~~~~~i~~~~~~~~~~~g~~~~~~~~--~GH~vG~~~~e~-p~~~~~~~~~l~~gmv 179 (207) T cd01066 103 EQRELYEAVREAQEAALAALRPGVTAEEVDAAAREVLEEHGLGPNFGHR--TGHGIGLEIHEP-PVLKAGDDTVLEPGMV 179 (207) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCC--EECCCCCCCCCC-CCCCCCCCCEECCCCE T ss_conf 0021789999999999985079997999999999999982888512352--243278647789-8557999988789999 Q ss_pred EEEEEEEEEEE-ECCCCCCCCCCC Q ss_conf 79978766530-002311123334 Q gi|254780899|r 101 VNVDVTYVVNG-WHGDSSRMYPVG 123 (273) Q Consensus 101 v~iD~g~~~~G-Y~~D~~RT~~~G 123 (273) +.++.+....+ +-..+.-|+++. T Consensus 180 ~~iep~~~~~~~~g~~~ed~~~Vt 203 (207) T cd01066 180 FAVEPGLYLPGGGGVRIEDTVLVT 203 (207) T ss_pred EEECCEEEECCCEEEEEEEEEEEC T ss_conf 999996076893489982179984 No 50 >PRK12896 methionine aminopeptidase; Reviewed Probab=95.47 E-value=0.17 Score=30.27 Aligned_cols=39 Identities=21% Similarity=0.319 Sum_probs=23.6 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 999999999999999975799889999999999999859 Q gi|254780899|r 24 RSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENN 62 (273) Q Consensus 24 R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g 62 (273) ++..+++.++++++.+.+|||++..|+...+++.+.+.| T Consensus 125 ~~l~~~~~ea~~~~i~~ikPG~~~~di~~a~~~~~~~~G 163 (254) T PRK12896 125 EKLCRVAEEALWAGIKQVKAGRPLNDIGRAIEDFAKKNG 163 (254) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 999999999999999983599808999999999999869 No 51 >COG0006 PepP Xaa-Pro aminopeptidase [Amino acid transport and metabolism] Probab=95.34 E-value=0.17 Score=30.23 Aligned_cols=81 Identities=12% Similarity=0.207 Sum_probs=52.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEE--CCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCC Q ss_conf 036778776543221100112122104899999998629601--047521233221135403653156454566451167 Q gi|254780899|r 130 ERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSV--VEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEG 207 (273) Q Consensus 130 ~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~--~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~G 207 (273) ++..+++..++.++++.++||++-.||.+.+...+.+.|... +...++ .|-+ ..++|+ ...+..+++| T Consensus 164 r~A~~i~~~a~~~~~~~~~~G~tE~ev~a~l~~~~~~~G~~~~sf~~iva--~G~n----~a~pH~----~~~~~~~~~g 233 (384) T COG0006 164 RKAAEIADAALEAALEAIRPGMTEAEIAAELEYALRKGGAEGPSFDTIVA--SGEN----AALPHY----TPSDRKLRDG 233 (384) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEE--ECCC----CCCCCC----CCCCCCCCCC T ss_conf 99999999999999986107997999999999999980887577475787--6543----356567----8888766799 Q ss_pred CEEEECCEEECCC Q ss_conf 3788311144377 Q gi|254780899|r 208 MVFTIEPMLNVGG 220 (273) Q Consensus 208 MV~aiEP~i~~~~ 220 (273) -.+.|+-+....+ T Consensus 234 d~vliD~G~~~~g 246 (384) T COG0006 234 DLVLIDLGGVYNG 246 (384) T ss_pred CEEEEEECEEECC T ss_conf 9899994606898 No 52 >cd01087 Prolidase Prolidase. E.C. 3.4.13.9. Also known as Xaa-Pro dipeptidase, X-Pro dipeptidase, proline dipeptidase., imidodipeptidase, peptidase D, gamma-peptidase. Catalyses hydrolysis of Xaa-Pro dipeptides; also acts on aminoacyl-hydroxyproline analogs. No action on Pro-Pro. Probab=95.30 E-value=0.17 Score=30.26 Aligned_cols=87 Identities=13% Similarity=0.138 Sum_probs=49.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC----CC-------CCCCCC-CCCCCCCCCCC---CCCC- Q ss_conf 99999999999999997579988999999999999985983----03-------443324-44553234464---2111- Q gi|254780899|r 24 RSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAI----PA-------TLNYRG-YKKSCCTSINH---VICH- 87 (273) Q Consensus 24 R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~----~~-------~~~~~~-f~~~~~~s~N~---~~~H- 87 (273) +++-+.+.++++++.+.+|||++-.||...+.+.+.+.+.. .. ...+.. |++.++-|+-- ..+. T Consensus 106 ~~~~~~~~~a~~~~~~~~kPG~~~~ei~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~h~~GHgiGl~~He~~~~ 185 (243) T cd01087 106 RELYEAVLAAQKAAIAACKPGVSYEDIHLLAHRVLAEGLKELGILKGDVDEIVESGAYAKFFPHGLGHYLGLDVHDVGGY 185 (243) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99999999999999998379985999999999999999886597546588887527864442577357667645778764 Q ss_pred --CCCCCCCCCCCEEEEEEEEEEEE Q ss_conf --21222200134057997876653 Q gi|254780899|r 88 --GIPSNKQLREGDIVNVDVTYVVN 110 (273) Q Consensus 88 --~~p~d~~l~~GD~v~iD~g~~~~ 110 (273) ....+++|++|-++.|+-|..+. T Consensus 186 ~~~~~~~~~L~~GMv~aiEPgiy~~ 210 (243) T cd01087 186 LRYLRRARPLEPGMVITIEPGIYFI 210 (243) T ss_pred CCCCCCCCCCCCCCEEEECCEEEEC T ss_conf 5668889785799889987979865 No 53 >PRK12897 methionine aminopeptidase; Reviewed Probab=94.53 E-value=0.096 Score=31.92 Aligned_cols=84 Identities=15% Similarity=0.230 Sum_probs=41.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEE-CCCCCC--CCCCCCCCCCCEEEECCCCCCCCCCEECC Q ss_conf 036778776543221100112122104899999998629601-047521--23322113540365315645456645116 Q gi|254780899|r 130 ERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSV-VEVFCG--HGIGKSFHEKPEILHFYDPLYPSVGTFQE 206 (273) Q Consensus 130 ~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~-~~~~~G--HgiGl~~hE~P~i~~~~~~~~~~~~~le~ 206 (273) ++.-+++.++++.+.+.+|||.+-.||++.+...+.++|-.. ...+.| .-+.....+. ++|. .+.+.+|++ T Consensus 14 r~A~~I~~~~~~~~~~~ikpG~te~el~~~~e~~~~~~Ga~~~~~g~~~fp~~v~~s~n~~--~~Hg----~p~~~~l~~ 87 (248) T PRK12897 14 HESGKLLASCHREIAKIMKPGITTKEINTFVEAYLEKHGATSEQKGYNGYPYAICASVNDE--MCHA----FPADVPLTE 87 (248) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCC--CCCC----CCCCCCCCC T ss_conf 9999999999999997776999899999999999998599666457788880689506860--6678----999851458 Q ss_pred CCEEEECCEEECC Q ss_conf 7378831114437 Q gi|254780899|r 207 GMVFTIEPMLNVG 219 (273) Q Consensus 207 GMV~aiEP~i~~~ 219 (273) |.++.|+-++... T Consensus 88 GD~V~iD~G~~~~ 100 (248) T PRK12897 88 GDIVTIDMVVNLN 100 (248) T ss_pred CCEEEEECCEEEC T ss_conf 9999995748899 No 54 >cd01091 CDC68-like Related to aminopeptidase P and aminopeptidase M, a member of this domain family is present in cell division control protein 68, a transcription factor. Probab=93.95 E-value=0.41 Score=27.69 Aligned_cols=98 Identities=11% Similarity=0.185 Sum_probs=58.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC----CCCCCCC--CCCCCCCC Q ss_conf 99999999999999999975799889999999999999859830344332444553234----4642111--21222200 Q gi|254780899|r 22 NIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTS----INHVICH--GIPSNKQL 95 (273) Q Consensus 22 ~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s----~N~~~~H--~~p~d~~l 95 (273) .+++.-.+.-++.+++.+.+|||++-.++...+.+.+.+.+.. .. ..|++.+..+ ..+. +. ...++++| T Consensus 120 ~q~~~Y~~vl~aq~~~i~~~kpG~~~~~v~~~a~~~I~~~~~~--l~--~~f~h~~GhgiGlE~hE~-~~~l~~kn~~~l 194 (243) T cd01091 120 EQQKNYNFLLALQEEILKELKPGAKLSDVYQKTLDYIKKKKPE--LE--PNFTKNLGFGIGLEFRES-SLIINAKNDRKL 194 (243) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH--HH--HHCCCCCCCEECEEECCC-CCCCCCCCCEEC T ss_conf 9999999999999999999726996899999999999987788--87--257667623225575246-654489974041 Q ss_pred CCCEEEEEEEEEE-E----------EEECCCCCCCCCCCC Q ss_conf 1340579978766-5----------300023111233344 Q gi|254780899|r 96 REGDIVNVDVTYV-V----------NGWHGDSSRMYPVGK 124 (273) Q Consensus 96 ~~GD~v~iD~g~~-~----------~GY~~D~~RT~~~G~ 124 (273) ++|-++.|.+|-. . +.|.--++=|+.+++ T Consensus 195 k~GMvftV~~Gf~nl~~~~~~~~~~k~yal~I~DtV~Vte 234 (243) T cd01091 195 KKGMVFNLSIGFSNLQNPEPKDKESKTYALLLSDTILVTE 234 (243) T ss_pred CCCCEEEEECCEEECCCCCCCCCCCCEEEEEEEEEEEECC T ss_conf 7997899965505556876667667736999964899858 No 55 >PRK10879 proline aminopeptidase P II; Provisional Probab=93.90 E-value=0.59 Score=26.67 Aligned_cols=80 Identities=11% Similarity=0.139 Sum_probs=51.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEE--CCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECC Q ss_conf 1036778776543221100112122104899999998629601--04752123322113540365315645456645116 Q gi|254780899|r 129 AERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSV--VEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQE 206 (273) Q Consensus 129 ~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~--~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~ 206 (273) .++..+++.+|+.++++.+|||.+=.+|.+.++..+.+.|... +...++- |.+ -.++|+. .++..|++ T Consensus 182 mr~A~~It~~a~~~~m~~~~pG~~E~ei~a~~~~~~~~~G~~~~ay~~Iva~--G~n----a~~lHy~----~~~~~l~~ 251 (436) T PRK10879 182 LRRAGEISALAHTRAMEKCRPGMFEYQLEGEIHHEFNRHGARYPSYNTIVGS--GEN----GCILHYT----ENESEMRD 251 (436) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEECC--CCC----CCCCCCC----CCCCCCCC T ss_conf 9999999999999998536888738999999999999768985774544237--787----7566765----55442577 Q ss_pred CCEEEECCEEEC Q ss_conf 737883111443 Q gi|254780899|r 207 GMVFTIEPMLNV 218 (273) Q Consensus 207 GMV~aiEP~i~~ 218 (273) |..+-|+-|... T Consensus 252 gdlvliD~G~~~ 263 (436) T PRK10879 252 GDLVLIDAGCEY 263 (436) T ss_pred CCEEEEECCCEE T ss_conf 777998237116 No 56 >PTZ00053 methionine aminopeptidase II; Provisional Probab=92.66 E-value=0.47 Score=27.30 Aligned_cols=73 Identities=12% Similarity=0.097 Sum_probs=38.4 Q ss_pred HHHHHCCCCCCCCCHHHHHHHH----HHHHCCCEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCEEEECCEE Q ss_conf 3221100112122104899999----998629601047521233221135403653156454566451167378831114 Q gi|254780899|r 141 YKGIAAVKLNANIEDIGKAIQR----YAHSERYSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQEGMVFTIEPML 216 (273) Q Consensus 141 ~~~i~~ikpG~~~~ei~~a~~~----~~~~~g~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le~GMV~aiEP~i 216 (273) +.+-+.+|||++..||...++. .+...|+. ...+--+|+++- .+...+.|..++..+|+.|-|+-|+-|+ T Consensus 137 ~~~q~~ikpg~~~~~i~~~ie~~~r~l~~~~~l~---~GiAFPt~iSiN---~~AAHYTPn~gD~~vL~~gDVVKID~Gv 210 (435) T PTZ00053 137 RYAQSFIKPGISLIDMTDRIEKKTKELIEKDGLK---RGWAFPTGCSLN---HCAAHYTPNTGDKTVLTYDDVMKVDFGT 210 (435) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCC---CCCCCCCCCCCC---CEEEECCCCCCCCEEECCCCEEEEEEEE T ss_conf 9999865799829999999999999985245612---450046645410---1013158898986174279989997411 Q ss_pred ECC Q ss_conf 437 Q gi|254780899|r 217 NVG 219 (273) Q Consensus 217 ~~~ 219 (273) .+. T Consensus 211 HVD 213 (435) T PTZ00053 211 HVN 213 (435) T ss_pred EEC T ss_conf 136 No 57 >TIGR00501 met_pdase_II methionine aminopeptidase, type II; InterPro: IPR002468 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002467 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2 and includes proteins which do not seem to be MAP, but that are clearly evolutionary related such as mouse proliferation-associated protein 1 and fission yeast curved DNA-binding protein. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=92.09 E-value=0.43 Score=27.56 Aligned_cols=132 Identities=17% Similarity=0.165 Sum_probs=78.2 Q ss_pred CCEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC---CC Q ss_conf 978853999999999999999999999999757998899999999999998598303443324445532344642---11 Q gi|254780899|r 10 GSINIYTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINHV---IC 86 (273) Q Consensus 10 ~~~~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~~---~~ 86 (273) .|++|-=.+.=+++.+--+-+.+|+..+++.+++|++-.||-..+++.+......-.... ++|.++.-.+-.+. .. T Consensus 112 ~A~TV~l~~~ddky~~L~~A~~dAl~~~i~~~~~~v~v~eiG~~I~e~i~syeveinGet-Yg~kP~~NL~GHs~~rY~~ 190 (327) T TIGR00501 112 TAITVDLGDQDDKYKELVKAAKDALYTAIKEIRAGVEVGEIGKAIEEVIESYEVEINGET-YGVKPISNLTGHSMARYLL 190 (327) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHCCEEEEECCCC-CCCCCCCCCCCCCCCHHHH T ss_conf 478850688701589999999999887775006981260020776653221357765720-1433422273564001543 Q ss_pred CC---CC-----CCCCCCCCEEEEEEEEEEE-EEECCCCCCCCCCCCCCC-CCHH-HHHHHHHHHHH Q ss_conf 12---12-----2220013405799787665-300023111233344443-2103-67787765432 Q gi|254780899|r 87 HG---IP-----SNKQLREGDIVNVDVTYVV-NGWHGDSSRMYPVGKIKR-AAER-ILQVTYESLYK 142 (273) Q Consensus 87 H~---~p-----~d~~l~~GD~v~iD~g~~~-~GY~~D~~RT~~~G~~~~-~~~~-l~~~~~~a~~~ 142 (273) |. +| +...|..||+|-|+.++.= -||..|...+.++--..+ ++.+ .+..+++.+.. T Consensus 191 H~G~SiPn~~~~~~~~leeG~~~AIEpFAt~G~G~V~~~~~~~iY~~l~~DrpvrPtL~~Ar~lL~~ 257 (327) T TIGR00501 191 HAGLSIPNVKERDTTKLEEGDVVAIEPFATDGVGLVTDGGEVEIYKFLAEDRPVRPTLKSARKLLKK 257 (327) T ss_pred CCCCCCEEEECCCCCEEECCCEEEECCCCCCCEEEEECCCCCHHHHHHCCCCCCCCCHHHHHHHHHH T ss_conf 5895000342798745102578884366768745651288610122105688967335899998767 No 58 >TIGR01857 FGAM-synthase phosphoribosylformylglycinamidine synthase; InterPro: IPR010141 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis, which represent a second clade of the enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP.. Probab=81.73 E-value=4 Score=21.09 Aligned_cols=60 Identities=12% Similarity=0.198 Sum_probs=39.6 Q ss_pred EEEEECCCCCC--CCCC---------CCCCCCCHHHHHHHHHHHHHHHHC---CCCCCCCCHHHHHHHHHHHHCC Q ss_conf 65300023111--2333---------444432103677877654322110---0112122104899999998629 Q gi|254780899|r 108 VVNGWHGDSSR--MYPV---------GKIKRAAERILQVTYESLYKGIAA---VKLNANIEDIGKAIQRYAHSER 168 (273) Q Consensus 108 ~~~GY~~D~~R--T~~~---------G~~~~~~~~l~~~~~~a~~~~i~~---ikpG~~~~ei~~a~~~~~~~~g 168 (273) .+|-||||.|| ||=- ++.....+..|+-.+...+..-.. -|| +++-|+.-.+.+++++.| T Consensus 218 VlDTYWSDHCRHTTFET~l~~v~~~~skf~~~~~~~y~~Y~~~r~~l~~~nln~k~-v~LMDmATI~~kylkk~G 291 (1279) T TIGR01857 218 VLDTYWSDHCRHTTFETELKEVTIEDSKFSKQLKKAYEDYLAMREELGRENLNEKP-VSLMDMATIVAKYLKKNG 291 (1279) T ss_pred EECCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCHHHHCCCC-CCCHHHHHHHHHHHHHCC T ss_conf 43266578851575443440476486422137899898999988762221110478-620035778878765268 No 59 >TIGR02176 pyruv_ox_red pyruvate:ferredoxin (flavodoxin) oxidoreductase; InterPro: IPR011895 The oxidative decarboxylation of pyruvate to acetyl-CoA, a central step in energy metabolism, can occur by two different mechanisms . In mitochondria and aerobic bacteria this reaction is catalysed by the multienzyme complex pyruvate dehydrogenase using NAD as electron acceptor. In anaerobic organisms, however, this reaction is reversibly catalysed by a single enzyme using either ferrodoxin or flavodoxin as the electron acceptor. Pyruvate:ferrodoxin/flavodoxin reductases (PFORs) in this entry occur in both obligately and facultatively anaerobic bacteria and also some eukaryotic microorganisms. These proteins are single-chain enzymes containing a thiamin pyrophosphate cofactor for the cleavage of carbon-carbon bonds next to a carbonyl group, and iron-sulphur clusters for electron transfer. The Desulfovibrio africanus enzyme is currently the only PFOR whose three dimensional structure is known , . It is a homodimer where each subunit contains one thiamin pyrophosphate cofactor and two ferrodoxin-like 4Fe-S clusters and an atypical 4Fe-S cluster. Each monomer is composed of seven domains - domains I, II and VI make intersubunit contacts, while domains III, IV and V are located at the suface of the dimer, and domain VII forms a long arm extending over the other subunit. The cofactor is bound at the interface of domains I and VI and is proximal to the atypical 4Fe-S bound by domain VI, while the ferrodoxin-like 4Fe-S clusters are bound by domain V. Comparison of this enzyme with the multi-chain PFORs shows a correspondance between the domains in this enzyme and the subunits of the multi-chain enzymes.; GO: 0005506 iron ion binding, 0016903 oxidoreductase activity acting on the aldehyde or oxo group of donors, 0006118 electron transport. Probab=69.49 E-value=4.6 Score=20.71 Aligned_cols=49 Identities=20% Similarity=0.289 Sum_probs=33.3 Q ss_pred ECCCCEEEECCEEECCCCCCEECCCCCEEE--ECCCCEEEEEEEEEEEC--CCCCEECCCCCCCCCC Q ss_conf 116737883111443777625646886688--33787167844218984--7985765267988676 Q gi|254780899|r 204 FQEGMVFTIEPMLNVGGSSAKVLSDGWTAV--TRDRSLSAQYEHTIGIT--KAGCEIFTLSPNNLGQ 266 (273) Q Consensus 204 le~GMV~aiEP~i~~~~~~~~~~~d~w~~~--~~~g~~~~~~Ed~vlVT--~~G~E~LT~~p~~l~~ 266 (273) +.|.||.+| + |+-..- .....|+|-|||+|=-| +-|.|.|.-.|+++.+ T Consensus 373 ~~PamvkaV-----F---------DNL~~ers~~K~~FTvGI~DDVT~TSL~~dn~~~~~~~~~~~~ 425 (1194) T TIGR02176 373 FTPAMVKAV-----F---------DNLEAERSAPKNHFTVGIDDDVTKTSLPVDNETLDLTPKSTIQ 425 (1194) T ss_pred CCCCCEEEE-----E---------ECCCCCCCCCCCCCCEEEECCCCCCCHHCCCCCCCCCCCCCEE T ss_conf 573301322-----1---------0110223678776304444177765312178454348786168 No 60 >TIGR01959 nuoF_fam NADH-quinone oxidoreductase, F subunit; InterPro: IPR011537 This entry describes the F chain of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoF. This family does not have any members in chloroplast or cyanobacteria, where the quinone may be plastoquinone and NADH may be replaced by NADPH, nor in Methanosarcina, where NADH is replaced by F420H2. ; GO: 0010181 FMN binding, 0016651 oxidoreductase activity acting on NADH or NADPH, 0051287 NAD binding, 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport. Probab=65.52 E-value=3.4 Score=21.61 Aligned_cols=130 Identities=15% Similarity=0.172 Sum_probs=81.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC---CCCC--CCCCCC Q ss_conf 39999999999999999999999997579988999999999999985983034433244455323---4464--211121 Q gi|254780899|r 15 YTPEELENIRSACNVVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCT---SINH--VICHGI 89 (273) Q Consensus 15 Ks~~EI~~~R~A~~i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~---s~N~--~~~H~~ 89 (273) +.++.|+-+.+-.+ -+.++++++.+..-++-.||.+.+.+...+. .|--|||+-.-. ..++ .-+||. T Consensus 10 ~~~~~L~~Y~~~gr--yrgY~~l~KaL~~~~~~ddii~~VK~SGLRG------RGGAGFpTGlKWSFmPK~~sg~~pkYL 81 (420) T TIGR01959 10 PESWTLEEYEKRGR--YRGYDALRKALEEKMSPDDIIEEVKDSGLRG------RGGAGFPTGLKWSFMPKDDSGPKPKYL 81 (420) T ss_pred CCCCCHHHHHHCCC--HHHHHHHHHHHHCCCCCHHHHHHHHHCCCCC------CCCCCCCCCCCCCCCCCCCCCCCCEEE T ss_conf 88510788963487--0247999999852678125889987437987------888876434422335678788885368 Q ss_pred --------C---CCCCCCCCEE-EEEEEEEEEEEECCCCCCCCCC--CCCCCCCHHHHHHHHHHHHHHH---HCCCCCCC Q ss_conf --------2---2220013405-7997876653000231112333--4444321036778776543221---10011212 Q gi|254780899|r 90 --------P---SNKQLREGDI-VNVDVTYVVNGWHGDSSRMYPV--GKIKRAAERILQVTYESLYKGI---AAVKLNAN 152 (273) Q Consensus 90 --------p---~d~~l~~GD~-v~iD~g~~~~GY~~D~~RT~~~--G~~~~~~~~l~~~~~~a~~~~i---~~ikpG~~ 152 (273) | .||.|-..++ .+|+ |..+-.|..-..+-|+. |+.-.+.++|-.|+-||..+++ ..+.-|.. T Consensus 82 vCNADE~EPGTcKDR~lm~~~PH~LIE-Gm~IaayA~ga~~gYiYiRGEf~~~~~~l~~Ai~EA~~~G~LGkNIlGsGFD 160 (420) T TIGR01959 82 VCNADEGEPGTCKDRDLMEFDPHQLIE-GMIIAAYAIGAKRGYIYIRGEFIKEAENLEAAIAEAYAAGLLGKNILGSGFD 160 (420) T ss_pred EEECCCCCCCCHHHHHHHHCCCCCEEH-HHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 872777977713148998628870305-7888778752531047861301678899999999999759820033788620 Q ss_pred C Q ss_conf 2 Q gi|254780899|r 153 I 153 (273) Q Consensus 153 ~ 153 (273) + T Consensus 161 f 161 (420) T TIGR01959 161 F 161 (420) T ss_pred E T ss_conf 5 No 61 >TIGR02939 RpoE_Sigma70 RNA polymerase sigma factor RpoE; InterPro: IPR014286 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents the clade of sigma factors called RpoE. These proteins are variously annotated as sigma-24, sigma-E factor, sigma-H factor, fecI-like sigma factor or alternative sigma factor AlgU.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=62.74 E-value=6.2 Score=19.85 Aligned_cols=110 Identities=12% Similarity=0.183 Sum_probs=55.0 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHH-CCCCCCCCCCCCCCC-----CCCCCCCCCCCCC--CCCCCCCCCCEEEEEEE Q ss_conf 999999757998899999999999998-598303443324445-----5323446421112--12222001340579978 Q gi|254780899|r 34 LDSLTPIIKPGTTTEEIDDFVLKFGME-NNAIPATLNYRGYKK-----SCCTSINHVICHG--IPSNKQLREGDIVNVDV 105 (273) Q Consensus 34 ~~~~~~~i~~G~se~el~~~~~~~~~~-~g~~~~~~~~~~f~~-----~~~~s~N~~~~H~--~p~d~~l~~GD~v~iD~ 105 (273) +.-+...| .+..|+.+.+++.+.+ .=|.++|.|--.|-+ .|-+-+|+-|.|+ .|.+.-+..-|-=.+|. T Consensus 35 ~~Lv~Ryv---~~~~e~~DVAQe~FvkAYRAl~~FRGdsaFYTWLyRIavNTAKNyLVaq~Rr~p~~dg~~~edAE~F~~ 111 (192) T TIGR02939 35 VSLVGRYV---RDSEEVEDVAQEAFVKAYRALASFRGDSAFYTWLYRIAVNTAKNYLVAQGRRPPTSDGVEAEDAEQFEE 111 (192) T ss_pred HHHHHHHH---CCHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCC T ss_conf 99998761---481003401477899999751235551367899999999887776642479887445676043312002 Q ss_pred EEEEEEECCCCCCCCCCCCCCCC----CHHHHHHHHHHHHHHHHCCCCCCCCCHHHH Q ss_conf 76653000231112333444432----103677877654322110011212210489 Q gi|254780899|r 106 TYVVNGWHGDSSRMYPVGKIKRA----AERILQVTYESLYKGIAAVKLNANIEDIGK 158 (273) Q Consensus 106 g~~~~GY~~D~~RT~~~G~~~~~----~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~ 158 (273) | |--|-+ + +|+ .++|-+.+.+|+++.=+-+|--++..|+.- T Consensus 112 a--------~~Lr~~---~-tPE~~ll~~el~~~v~~a~~aLPE~Lr~Ai~LREleG 156 (192) T TIGR02939 112 A--------DALRDI---D-TPERLLLSRELEQTVNKALEALPEDLRTAITLRELEG 156 (192) T ss_pred C--------CCCCCC---C-CHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCC T ss_conf 6--------766556---6-6167999999999999998617267874544651268 No 62 >KOG2611 consensus Probab=62.29 E-value=5.8 Score=20.02 Aligned_cols=69 Identities=17% Similarity=0.236 Sum_probs=38.6 Q ss_pred CCEEEEEEEEEEEEEECCCCCCCCCCCCCC-----CCCHHHHHHHHHHH-----HHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 340579978766530002311123334444-----32103677877654-----3221100112122104899999998 Q gi|254780899|r 97 EGDIVNVDVTYVVNGWHGDSSRMYPVGKIK-----RAAERILQVTYESL-----YKGIAAVKLNANIEDIGKAIQRYAH 165 (273) Q Consensus 97 ~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~-----~~~~~l~~~~~~a~-----~~~i~~ikpG~~~~ei~~a~~~~~~ 165 (273) .+|+-.+-+...|.|||+|...-|++|--+ +-.-=+.+.++++- -..++.+|||.--.++-.|.+.++. T Consensus 583 snD~taLvvS~aYkG~WsDLsELWFLGMQt~~G~lPLvPWLs~~AL~S~W~e~ivk~L~kVk~~tl~~nv~sAYe~~L~ 661 (698) T KOG2611 583 SNDPTALVVSIAYKGYWSDLSELWFLGMQTMCGVLPLVPWLSEFALESGWAEGIVKTLKKVKIGTLPANVKSAYEDFLS 661 (698) T ss_pred CCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCHHHHHCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHH T ss_conf 5897127951544213651899999768987071325354048877514578999997327888748789999999999 No 63 >TIGR00838 argH argininosuccinate lyase; InterPro: IPR009049 This entry represents argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens. ; GO: 0004056 argininosuccinate lyase activity, 0042450 arginine biosynthetic process via ornithine. Probab=52.02 E-value=16 Score=17.14 Aligned_cols=12 Identities=17% Similarity=0.041 Sum_probs=5.1 Q ss_pred CCCCCEECCCCC Q ss_conf 777625646886 Q gi|254780899|r 219 GGSSAKVLSDGW 230 (273) Q Consensus 219 ~~~~~~~~~d~w 230 (273) .+++...++|.| T Consensus 261 ~EFgfieL~D~~ 272 (469) T TIGR00838 261 QEFGFIELPDEF 272 (469) T ss_pred CCCCEEEECCCE T ss_conf 687606745732 No 64 >TIGR01828 pyru_phos_dikin pyruvate, phosphate dikinase; InterPro: IPR010121 Pyruvate phosphate dikinase (PPDK, or pyruvate orthophosphate dikinase) is found in plants, bacteria and archaea. The amino acid sequence identity between bacterial and plant enzymes is high, and they are similar in sequence to other PEP-utilizing enzymes. PPDK catalyses the reversible conversion of ATP and pyruvate to AMP and PEP (phosphoenolpyruvate). In bacteria such as Clostridium symbiosum, PPDK uses Mg2+ and NH4+ ions as cofactors . The enzyme has three domains: the N- and C-terminal domains each have an active site centre that catalyses a different step in the reaction, and the middle domain has a carrier histidine residue that moves between the two active centres. In plants, PPDK is localised predominantly in chloroplast stroma where it catalyses the rate-limiting step in the C4 photosynthetic pathway, namely the synthesis of PEP, which acts as the primary CO2 acceptor in C4 photosynthesis . PPDK activity in C4 plants is strictly regulated by light, its activity decreasing in darkness. This response is regulated by phosphorylation and dephosphorylation of the enzyme using ADP; such regulation is not seen in the bacterial form of the enzyme. PPDK is also found in C3 plants, but it is not known to have a photosynthetic role .; GO: 0050242 pyruvate phosphate dikinase activity. Probab=49.44 E-value=20 Score=16.43 Aligned_cols=49 Identities=16% Similarity=0.247 Sum_probs=28.4 Q ss_pred CCCCCCCEEEEEE--EEEEEEEEC----CCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 2200134057997--876653000----231112333444432103677877654322 Q gi|254780899|r 92 NKQLREGDIVNVD--VTYVVNGWH----GDSSRMYPVGKIKRAAERILQVTYESLYKG 143 (273) Q Consensus 92 d~~l~~GD~v~iD--~g~~~~GY~----~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~ 143 (273) ++.|++||+|.|| .|..|.|=. .+++.-|--+ |...+|..++-..+..+ T Consensus 512 g~~~~EGD~isiDGsTG~iYlGe~~~~~p~~~~~f~G~---Paedelvravd~im~WA 566 (920) T TIGR01828 512 GRVIKEGDIISIDGSTGEIYLGEVPLIEPEVSGDFEGE---PAEDELVRAVDTIMSWA 566 (920) T ss_pred CCEEECCEEEEEECCCCCEECCCCCCCCCCCCCCCCCC---CCHHHHHHHHHHHHHHH T ss_conf 94774343799708606235375442166778766787---50578999999999986 No 65 >COG4786 FlgG Flagellar basal body rod protein [Cell motility and secretion] Probab=43.85 E-value=22 Score=16.22 Aligned_cols=32 Identities=31% Similarity=0.396 Sum_probs=17.9 Q ss_pred CCCCCEEEECCCCEEEEEEEEEEECCCCCEECC Q ss_conf 468866883378716784421898479857652 Q gi|254780899|r 226 LSDGWTAVTRDRSLSAQYEHTIGITKAGCEIFT 258 (273) Q Consensus 226 ~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~LT 258 (273) .+||-..+|+||.|.+- ++-.|||.+|+-+|. T Consensus 107 ~~dG~~~YTR~G~F~~d-~~G~LVT~~G~~vl~ 138 (265) T COG4786 107 TPDGTIAYTRDGSFTVD-EEGQLVTSNGYPVLD 138 (265) T ss_pred CCCCCEEEEECCCEEEC-CCCCEEECCCCCCCC T ss_conf 49987888527744688-998788079977457 No 66 >pfam11149 DUF2924 Protein of unknown function (DUF2924). This bacterial family of proteins has no known function. Probab=43.04 E-value=25 Score=15.79 Aligned_cols=39 Identities=18% Similarity=0.136 Sum_probs=26.8 Q ss_pred CCCEECCCCEEEECCEEECCCCCCEECCCCCEEEECCCCEEEEEEEEEEECCCCCEECCCCCCCC Q ss_conf 66451167378831114437776256468866883378716784421898479857652679886 Q gi|254780899|r 200 SVGTFQEGMVFTIEPMLNVGGSSAKVLSDGWTAVTRDRSLSAQYEHTIGITKAGCEIFTLSPNNL 264 (273) Q Consensus 200 ~~~~le~GMV~aiEP~i~~~~~~~~~~~d~w~~~~~~g~~~~~~Ed~vlVT~~G~E~LT~~p~~l 264 (273) +...+.||.+|.=| | + | .+|.|.|++||+|.=-+.=++| T Consensus 77 ~~~~~~pGT~LvRe---w----~-----------------G--~~h~V~V~~dGf~~~G~~y~SL 115 (136) T pfam11149 77 RDRLLVPGTRLVRE---W----K-----------------G--VEHRVTVLADGFEYEGRTYKSL 115 (136) T ss_pred CCCCCCCCCEEEEE---E----C-----------------C--EEEEEEEECCCEEECCCCCCCH T ss_conf 65678998779998---8----9-----------------9--7899999579667899234669 No 67 >TIGR02823 oxido_YhdH putative quinone oxidoreductase, YhdH/YhfP family; InterPro: IPR014188 This entry represents a subfamily of the alcohol dehydrogenase, a superfamily in which some members are zinc-binding medium-chain alcohol dehydrogenases while others are quinone oxidoreductases with no bound zinc. This entry includes YhdH from Escherichia coli and YhfP from Bacillus subtilis both of which bind NADPH or NAD, but not zinc. Both proteins have been studied crystallographically for insight into function. . Probab=41.18 E-value=27 Score=15.60 Aligned_cols=82 Identities=17% Similarity=0.157 Sum_probs=50.7 Q ss_pred CCCCCCCCEEEEE---EEEEEEEEECCCCCCC---CCCCCC---CCCCHHHHH-HHHHHHHHHHHCCCCC-CCC------ Q ss_conf 2220013405799---7876653000231112---333444---432103677-8776543221100112-122------ Q gi|254780899|r 91 SNKQLREGDIVNV---DVTYVVNGWHGDSSRM---YPVGKI---KRAAERILQ-VTYESLYKGIAAVKLN-ANI------ 153 (273) Q Consensus 91 ~d~~l~~GD~v~i---D~g~~~~GY~~D~~RT---~~~G~~---~~~~~~l~~-~~~~a~~~~i~~ikpG-~~~------ 153 (273) +|-.|++||-|++ |+|-.+.|=+|+.+|- |++--| |.++-.++- |-..|.-+..+.-+-| .+= T Consensus 74 ~dp~F~~GD~VivTGyglG~~H~GGysqyaRVp~dWlVpLP~glsl~eAM~~GTAGfTAaL~V~aLe~~Gl~~P~~GpVl 153 (330) T TIGR02823 74 EDPRFRPGDEVIVTGYGLGVSHDGGYSQYARVPADWLVPLPEGLSLREAMALGTAGFTAALSVMALERNGLLTPEDGPVL 153 (330) T ss_pred CCCCCCCCCEEEEEEECCCCCCCCCCEEEEEECHHHEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEE T ss_conf 88775788718997402456515762235673323101588988988999860077899999999997268888888788 Q ss_pred -----CHHHHHHHHHHHHCCCEEC Q ss_conf -----1048999999986296010 Q gi|254780899|r 154 -----EDIGKAIQRYAHSERYSVV 172 (273) Q Consensus 154 -----~ei~~a~~~~~~~~g~~~~ 172 (273) +-|...+=+.+.+.||.++ T Consensus 154 VTGAtGGVGS~Av~~L~~lGY~V~ 177 (330) T TIGR02823 154 VTGATGGVGSLAVAILSKLGYEVV 177 (330) T ss_pred EECCCCCHHHHHHHHHHHCCCEEE T ss_conf 706778778999999983797699 No 68 >PRK01490 tig trigger factor; Provisional Probab=39.00 E-value=29 Score=15.38 Aligned_cols=21 Identities=24% Similarity=0.495 Sum_probs=9.0 Q ss_pred CCCCCCCCCEEEEEEEEEEEE Q ss_conf 222200134057997876653 Q gi|254780899|r 90 PSNKQLREGDIVNVDVTYVVN 110 (273) Q Consensus 90 p~d~~l~~GD~v~iD~g~~~~ 110 (273) |.++..+.||.|.+|+-+..+ T Consensus 153 ~~e~~~~~gD~v~id~~~~~d 173 (435) T PRK01490 153 PVERPAENGDRVTIDFVGSID 173 (435) T ss_pred CCCHHHCCCCEEEEEEEEEEC T ss_conf 654021269879999898766 No 69 >PRK09854 cmtB putative mannitol phosphotransferase subunit EIIA; Provisional Probab=38.76 E-value=29 Score=15.36 Aligned_cols=82 Identities=9% Similarity=0.050 Sum_probs=41.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCE-EEEEEEEEEEEEECCCCCCCCCC-CCCC-CCCH-HHHHHHHHHHH-HHHHCCCCCCC Q ss_conf 2344642111212222001340-57997876653000231112333-4444-3210-36778776543-22110011212 Q gi|254780899|r 78 CTSINHVICHGIPSNKQLREGD-IVNVDVTYVVNGWHGDSSRMYPV-GKIK-RAAE-RILQVTYESLY-KGIAAVKLNAN 152 (273) Q Consensus 78 ~~s~N~~~~H~~p~d~~l~~GD-~v~iD~g~~~~GY~~D~~RT~~~-G~~~-~~~~-~l~~~~~~a~~-~~i~~ikpG~~ 152 (273) -.+.+=++||..|.+.+++.|= ++.+.-+..+.+- -|-.+-+++ .-.+ +.+. -+.+.+.=..+ ..++.++--.+ T Consensus 58 vi~~giAlpHa~p~~gv~k~g~sl~~lk~pV~F~~~-~~~V~~i~~laa~d~~~Hl~~L~~L~~l~~~~~~~~~l~~a~~ 136 (147) T PRK09854 58 ILAPGVAMPHARPECGALKTGMSLTLLEQGVYFPGN-DEPIKLLIGLSAADADSHIGAIQALSELLCEEEILEQLLTASS 136 (147) T ss_pred EEECCEEECCCCCCCCCCCCEEEEEEECCCEECCCC-CCCEEEEEEEEECCHHHHHHHHHHHHHHHCCHHHHHHHHHCCC T ss_conf 983885603368667960011699995787361799-9646999999968879999999999999809899999997899 Q ss_pred CCHHHHHH Q ss_conf 21048999 Q gi|254780899|r 153 IEDIGKAI 160 (273) Q Consensus 153 ~~ei~~a~ 160 (273) -.||.+.+ T Consensus 137 ~~ei~~ii 144 (147) T PRK09854 137 EKQLADII 144 (147) T ss_pred HHHHHHHH T ss_conf 99999998 No 70 >TIGR00438 rrmJ ribosomal RNA large subunit methyltransferase J; InterPro: IPR004512 The ribosomal RNA large subunit methyltransferase J 2.1.1 from EC methylates the 23S rRNA. It specifically methylates the uridine in position 2552 of 23s rRNA in the 50S particle using S-adenosyl-L-methionine as a substrate. It was previously known as cell division protein ftsJ.; GO: 0016436 rRNA (uridine) methyltransferase activity, 0006364 rRNA processing. Probab=37.10 E-value=31 Score=15.19 Aligned_cols=42 Identities=19% Similarity=0.140 Sum_probs=26.9 Q ss_pred CCEEEEEEEEEEEEEECC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 340579978766530002-3111233344443210367787765432211001121 Q gi|254780899|r 97 EGDIVNVDVTYVVNGWHG-DSSRMYPVGKIKRAAERILQVTYESLYKGIAAVKLNA 151 (273) Q Consensus 97 ~GD~v~iD~g~~~~GY~~-D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~ikpG~ 151 (273) +=|+|.-|..+.+.|+++ |+.|.+- .+..|++-+-++++++- T Consensus 102 k~DVV~SDaaP~~SG~~~iDh~Rs~d-------------Lv~~aL~ia~~vL~~~G 144 (192) T TIGR00438 102 KVDVVMSDAAPNISGIWDIDHLRSID-------------LVELALDIAKEVLKPKG 144 (192) T ss_pred EEEEEEECCCCCCCCCCCCCHHHHHH-------------HHHHHHHHHHHHHHCCC T ss_conf 37789852688878987543443799-------------99999999999861589 No 71 >pfam02287 Dehydratase_SU Dehydratase small subunit. This family contains the small subunit of the trimeric diol dehydratases and glycerol dehydratases. These enzymes are produced by some enterobacteria in response to growth substances. Probab=36.55 E-value=31 Score=15.13 Aligned_cols=43 Identities=16% Similarity=0.220 Sum_probs=27.1 Q ss_pred HHHHHHHHHH---HHHHHHHHHHCCCCCCHH-HHHHHHHHHHHHCCC Q ss_conf 9999999999---999999999757998899-999999999998598 Q gi|254780899|r 21 ENIRSACNVV---ARCLDSLTPIIKPGTTTE-EIDDFVLKFGMENNA 63 (273) Q Consensus 21 ~~~R~A~~i~---~~~~~~~~~~i~~G~se~-el~~~~~~~~~~~g~ 63 (273) ..+|+|++++ +.-+-++.+.+||+.|+. ||.+.+.+...+.++ T Consensus 69 ~Nf~RAAELt~vPD~rvLeiYnaLRP~RStk~ELl~iA~eLe~~Y~A 115 (136) T pfam02287 69 RNFRRAAELTAVPDDRVLEIYNALRPYRSTKQELLAIADELETKYHA 115 (136) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 98999886106986999999996288834099999999999998696 No 72 >COG1894 NuoF NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion] Probab=33.86 E-value=7.7 Score=19.22 Aligned_cols=109 Identities=16% Similarity=0.149 Sum_probs=64.0 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC---CCCCCCCC-----------CCCCCC Q ss_conf 9999999999975799889999999999999859830344332444553234---46421112-----------122220 Q gi|254780899|r 29 VVARCLDSLTPIIKPGTTTEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTS---INHVICHG-----------IPSNKQ 94 (273) Q Consensus 29 i~~~~~~~~~~~i~~G~se~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s---~N~~~~H~-----------~p~d~~ 94 (273) ++...++.+.+.++ .|+-.+|.+.+.+...+. .+-.|||+-.-.| .+..--|| .-.||. T Consensus 29 ~a~ggy~~l~kal~-~m~p~~II~~Vk~SGLRG------RGGAGFpTGlKWsfm~k~~~~~kylvcNADE~ePGTfkDR~ 101 (424) T COG1894 29 LARGGYEGLRKALT-EMGPDEIIEEVKESGLRG------RGGAGFPTGLKWSFMPKATSDQKYLVCNADEGEPGTFKDRL 101 (424) T ss_pred HHCCCHHHHHHHHH-HCCHHHHHHHHHHCCCCC------CCCCCCCCCCEEEEEECCCCCCEEEEEECCCCCCCCCCCHH T ss_conf 85474999999998-659899999998717777------78777877752110124788964899848989975221378 Q ss_pred CCCCEEEEEEEEEEEEEECCCCCCCCCC--CCCCCCCHHHHHHHHHHHHHHH Q ss_conf 0134057997876653000231112333--4444321036778776543221 Q gi|254780899|r 95 LREGDIVNVDVTYVVNGWHGDSSRMYPV--GKIKRAAERILQVTYESLYKGI 144 (273) Q Consensus 95 l~~GD~v~iD~g~~~~GY~~D~~RT~~~--G~~~~~~~~l~~~~~~a~~~~i 144 (273) |-++|+-.+==|..+.+|..-.++.|+. |+...+.+++.++..+|..+++ T Consensus 102 ime~dPH~LIEGm~IA~yA~gA~~~YiYiRgEy~~a~~~l~~AI~eA~~~Gl 153 (424) T COG1894 102 IMEGDPHLLIEGMIIAAYAVGATKGYIYIRGEYPEAIERLQKAIEEAYAAGL 153 (424) T ss_pred HHCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHHHHHHHHCC T ss_conf 7705947888889999998445336899733309999999999999998085 No 73 >pfam08007 Cupin_4 Cupin superfamily protein. This family contains many hypothetical proteins that belong to the cupin superfamily. Probab=33.70 E-value=28 Score=15.46 Aligned_cols=49 Identities=24% Similarity=0.316 Sum_probs=37.2 Q ss_pred CCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 2200134057997876653000231112333444432103677877654 Q gi|254780899|r 92 NKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESL 140 (273) Q Consensus 92 d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~ 140 (273) +-+|++||++.|=-|.-.+|..-+-+-|+.+|=-.+...++.....+.+ T Consensus 178 d~vLepGD~LYiPpG~~H~g~a~~~s~~~SvGfrap~~~el~~~~~d~l 226 (320) T pfam08007 178 DFVLEPGDMLYLPRGFIHQGVALDDSLHYSVGFRAPTAAELLSSFLDYL 226 (320) T ss_pred EEEECCCCEEEECCCCCCCCCCCCCCEEEEECEECCCHHHHHHHHHHHH T ss_conf 8997688879858998644302888358881212798899999999999 No 74 >TIGR02312 HpaH 2-oxo-hepta-3-ene-1,7-dioic acid hydratase; InterPro: IPR012690 This entry represents the enzyme which hydrates the double bond of 2-oxo-hepta-3-ene-1,7-dioic acid to form 4-hydroxy-2-oxo-heptane-1,7-dioic acid in the catabolism of 4-hydroxyphenylacetic acid. The gene for this enzyme is generally found adjacent to other genes of this pathway.. Probab=32.90 E-value=20 Score=16.50 Aligned_cols=23 Identities=17% Similarity=0.192 Sum_probs=10.9 Q ss_pred CCCCCCHH---HHHHHHHHHHHHCCC Q ss_conf 57998899---999999999998598 Q gi|254780899|r 41 IKPGTTTE---EIDDFVLKFGMENNA 63 (273) Q Consensus 41 i~~G~se~---el~~~~~~~~~~~g~ 63 (273) =.|.||-. +|....-..-...|. T Consensus 30 ~yP~~tieDAY~iq~~WV~~K~A~Gr 55 (271) T TIGR02312 30 RYPEITIEDAYRIQRAWVALKLAEGR 55 (271) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 35404223578999999998872488 No 75 >pfam03881 Fructosamin_kin Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterized but probably have a similar or identical function. Probab=31.92 E-value=37 Score=14.64 Aligned_cols=32 Identities=22% Similarity=0.301 Sum_probs=16.9 Q ss_pred CEEEECCCCCCCCCCEECCCCEEEECCEEECCC Q ss_conf 036531564545664511673788311144377 Q gi|254780899|r 188 PEILHFYDPLYPSVGTFQEGMVFTIEPMLNVGG 220 (273) Q Consensus 188 P~i~~~~~~~~~~~~~le~GMV~aiEP~i~~~~ 220 (273) |.+.|. +...++...-..|...-|+|.+|.+. T Consensus 187 PsLlHG-DLW~GN~~~~~~G~p~liDPA~yyG~ 218 (287) T pfam03881 187 PSLLHG-DLWSGNAAFTADGEPVIFDPACYYGD 218 (287) T ss_pred CCEEEC-CCCCCCEEECCCCCEEEECCHHCCCC T ss_conf 300225-53565255847998789754311577 No 76 >pfam09657 Cas_Csx8 CRISPR-associated protein Csx8 (Cas_Csx8). Clusters of short DNA repeats with nonhomologous spacers, which are found at regular intervals in the genomes of phylogenetically distinct prokaryotic species, comprise a family with recognisable features. This family is known as CRISPR (short for Clustered, Regularly Interspaced Short Palindromic Repeats). A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This entry describes proteins of unknown function which are encoded in the midst of a cas gene operon. Probab=30.40 E-value=39 Score=14.48 Aligned_cols=42 Identities=24% Similarity=0.332 Sum_probs=30.5 Q ss_pred EECCHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCHHHHHHHHH Q ss_conf 853999999999999999999999999757-9988999999999 Q gi|254780899|r 13 NIYTPEELENIRSACNVVARCLDSLTPIIK-PGTTTEEIDDFVL 55 (273) Q Consensus 13 ~IKs~~EI~~~R~A~~i~~~~~~~~~~~i~-~G~se~el~~~~~ 55 (273) .=.|+++|..+.+-.. +..+|+.+...++ -|-...|+-..+. T Consensus 81 ~efs~E~ik~I~ekm~-aNtilKKvF~KikfdgeNkeEiL~lI~ 123 (441) T pfam09657 81 KEFTEEIIKSINEKLS-ANTVMKKVFGKIKFDGTNKDEILDLID 123 (441) T ss_pred CCCCHHHHHHHHHHHH-HCHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 4258999999998730-146999997404667889999999998 No 77 >KOG1511 consensus Probab=30.21 E-value=40 Score=14.46 Aligned_cols=23 Identities=9% Similarity=0.092 Sum_probs=9.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 44321036778776543221100 Q gi|254780899|r 125 IKRAAERILQVTYESLYKGIAAV 147 (273) Q Consensus 125 ~~~~~~~l~~~~~~a~~~~i~~i 147 (273) .++-.+.+|+++-+.-..+...+ T Consensus 257 fPevi~~i~~aid~is~ea~~il 279 (397) T KOG1511 257 FPEVIKAIFDAIDEISLEAVWIL 279 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 35889999999999999999997 No 78 >PRK04750 ubiB putative ubiquinone biosynthesis protein UbiB; Reviewed Probab=29.44 E-value=41 Score=14.37 Aligned_cols=63 Identities=16% Similarity=0.213 Sum_probs=26.6 Q ss_pred CEEEEEEEEEEEEEECCCCCCCCCCCC----CCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 405799787665300023111233344----4432103677877654322110011212210489999999862 Q gi|254780899|r 98 GDIVNVDVTYVVNGWHGDSSRMYPVGK----IKRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSE 167 (273) Q Consensus 98 GD~v~iD~g~~~~GY~~D~~RT~~~G~----~~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~ 167 (273) |.++.+|+|-. |..++-.|-....- .+...+++.++.. + ...+.+.....++..+++.+++.+ T Consensus 301 g~~~~lDfGmv--G~L~~~~r~~l~~l~~a~~~~D~~~va~~~~---~--~g~v~~~~~~~~l~~~i~~~~~~~ 367 (540) T PRK04750 301 PRYIALDFGIV--GSLNKEDKRYLAENFLAFFNRDYRRVAELHV---E--SGWVPPDTRVEELEFAIRAVCEPI 367 (540) T ss_pred CCEEEEEEEEE--CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---H--HCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 74788740132--1389999999999999999405999999999---9--589998888799999999999986 No 79 >pfam07305 DUF1454 Protein of unknown function (DUF1454). This family consists of several Enterobacterial sequences of around 200 residues in length which are often known as YiiQ proteins. The function of this family is unknown. Probab=29.17 E-value=41 Score=14.34 Aligned_cols=74 Identities=15% Similarity=0.165 Sum_probs=47.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEEC Q ss_conf 43210367787765432211001121221048999999986296010475212332211354036531564545664511 Q gi|254780899|r 126 KRAAERILQVTYESLYKGIAAVKLNANIEDIGKAIQRYAHSERYSVVEVFCGHGIGKSFHEKPEILHFYDPLYPSVGTFQ 205 (273) Q Consensus 126 ~~~~~~l~~~~~~a~~~~i~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~~~GHgiGl~~hE~P~i~~~~~~~~~~~~~le 205 (273) .++++..-+.+.+-+.+.+...-|-.+.....+.+++.+.+..- ..|..|-+|--- +|.. + .=+ T Consensus 114 ~~e~k~~r~~a~~Ym~Alir~F~P~ls~eQs~~~l~~lL~~gk~---~~~y~~~~GAiR----Yvva------d---~ge 177 (200) T pfam07305 114 GPEQKAAKAKALEYMAALIRHFTPTLSQAQSQQKLQNLLTAGKG---KRYYSHTEGAIR----YIVA------D---SGE 177 (200) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCC---CCCEECCCCCEE----EEEE------C---CCC T ss_conf 85789999999999999999708757989999999999984899---865011467169----9996------1---898 Q ss_pred CCCEEEECCE Q ss_conf 6737883111 Q gi|254780899|r 206 EGMVFTIEPM 215 (273) Q Consensus 206 ~GMV~aiEP~ 215 (273) .|++|+|||. T Consensus 178 kglTFAVEPI 187 (200) T pfam07305 178 KGLTFAIEPI 187 (200) T ss_pred CEEEEEEEEE T ss_conf 5158885126 No 80 >KOG1527 consensus Probab=28.81 E-value=13 Score=17.65 Aligned_cols=75 Identities=17% Similarity=0.269 Sum_probs=49.0 Q ss_pred CCCCCCCEEEEEEEEE--EEEEECCCCCCCCCCCCC----CCCCHHHHHHHHHHHHHHHHCCC--CCCCC-----CHHHH Q ss_conf 2200134057997876--653000231112333444----43210367787765432211001--12122-----10489 Q gi|254780899|r 92 NKQLREGDIVNVDVTY--VVNGWHGDSSRMYPVGKI----KRAAERILQVTYESLYKGIAAVK--LNANI-----EDIGK 158 (273) Q Consensus 92 d~~l~~GD~v~iD~g~--~~~GY~~D~~RT~~~G~~----~~~~~~l~~~~~~a~~~~i~~ik--pG~~~-----~ei~~ 158 (273) -+.|+.-|+++-|--. .+--....-+|+|+.|+. +..|+++++...++++++...+| -|-+. +|= T Consensus 276 ~~~I~sAD~~LaDkLVp~avL~Lipp~t~lfia~KfpGna~raQ~Elh~~~l~~l~~G~~VVRLKqGDPyifGRGGEE-- 353 (506) T KOG1527 276 VRVIQSADLLLADKLVPNAVLELIPPDTRLFIAGKFPGNASRAQEELHELLLNFLEAGATVVRLKQGDPYIFGRGGEE-- 353 (506) T ss_pred HHHHHHCCEEHHHHCCCHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCEEECCCCHH-- T ss_conf 998864230014210628788524899833884037883467899999999988757887999438895466678343-- Q ss_pred HHHHHHHHCCCE Q ss_conf 999999862960 Q gi|254780899|r 159 AIQRYAHSERYS 170 (273) Q Consensus 159 a~~~~~~~~g~~ 170 (273) -.++..+||- T Consensus 354 --~~Ff~qhGy~ 363 (506) T KOG1527 354 --MDFFQQHGYR 363 (506) T ss_pred --HHHHHHCCCE T ss_conf --2148767812 No 81 >PRK08087 L-fuculose phosphate aldolase; Provisional Probab=28.49 E-value=26 Score=15.64 Aligned_cols=17 Identities=29% Similarity=0.249 Sum_probs=12.3 Q ss_pred CCCCCCCEEEEEEEEEE Q ss_conf 22001340579978766 Q gi|254780899|r 92 NKQLREGDIVNVDVTYV 108 (273) Q Consensus 92 d~~l~~GD~v~iD~g~~ 108 (273) -..+++.|+|.+|.-+. T Consensus 47 ~~~l~~~dlv~vd~~G~ 63 (215) T PRK08087 47 YEKLTESHIVFIDGDGK 63 (215) T ss_pred HHHCCHHHEEEECCCCC T ss_conf 35588879264567787 No 82 >TIGR03290 CoB_CoM_SS_C CoB--CoM heterodisulfide reductase, subunit C. The last step in methanogenesis leaves two coenzymes of methanogenesis, CoM and CoB, linked by a disulfide bond. Members of this protein family are the C subunit of the enzyme that reduces the heterodisulfide to CoB-SH and CoM-SH. Similar enzyme complex subunits are found in various other species, but likely act on a different substrate. Probab=28.43 E-value=31 Score=15.21 Aligned_cols=39 Identities=21% Similarity=0.255 Sum_probs=25.9 Q ss_pred HCCCCCCCCCHHHHHHHHHHHHCCCEECCC--------CCCCCCCCC Q ss_conf 100112122104899999998629601047--------521233221 Q gi|254780899|r 145 AAVKLNANIEDIGKAIQRYAHSERYSVVEV--------FCGHGIGKS 183 (273) Q Consensus 145 ~~ikpG~~~~ei~~a~~~~~~~~g~~~~~~--------~~GHgiGl~ 183 (273) +..=-|+...++-.+++..+.+.|+-...| .+||+|+++ T Consensus 56 ~~CP~gV~~~~vI~~lR~~A~~~G~~~~~~~~~~~~i~~~GhaVp~n 102 (144) T TIGR03290 56 ERCPRDVKITDIIKALRNLAAKKGFMAKAHRKTASFVLKTGHAVPIN 102 (144) T ss_pred HHCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCC T ss_conf 13489998899999999999985998799999999999968778865 No 83 >COG3589 Uncharacterized conserved protein [Function unknown] Probab=27.90 E-value=43 Score=14.20 Aligned_cols=62 Identities=11% Similarity=0.087 Sum_probs=31.3 Q ss_pred CCCCCCCCEEEEEEEEEEEEEECCCCCCCCCC-CCCCCC---CHHHHHHHHHHHHHHHHCCCCCCCCCH Q ss_conf 22200134057997876653000231112333-444432---103677877654322110011212210 Q gi|254780899|r 91 SNKQLREGDIVNVDVTYVVNGWHGDSSRMYPV-GKIKRA---AERILQVTYESLYKGIAAVKLNANIED 155 (273) Q Consensus 91 ~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~-G~~~~~---~~~l~~~~~~a~~~~i~~ikpG~~~~e 155 (273) .++.++.-+ +.+++.+.+=-+..-|--.- |-|+-+ ....+..+..-...+++.+--|...-+ T Consensus 162 kn~~fk~~~---i~t~AFis~~~~~g~r~~~~~GlpTlE~hR~~~p~~qak~l~~~giD~VlIgd~~~s 227 (360) T COG3589 162 KNEIFKEYN---IKTAAFISSDGAEGPRGPLYEGLPTLEAHRYVEPFVQAKDLFKTGIDDVLIGDQFPS 227 (360) T ss_pred HHHHHHHCC---CCEEEEEECCCCCCCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHCCCEEEECCCCCC T ss_conf 889988559---745899845876776662004866168743788777899999838875996388899 No 84 >TIGR02669 SpoIID_LytB SpoIID/LytB domain; InterPro: IPR013486 This entry describes a region which is found, typically in two or three proteins per genome, in Cyanobacteria and Firmicutes, and sporadically in other genomes. One example is SpoIID from Bacillus subtilis. Another, also from B. subtilis, is LytB which contains this region at the C-terminus. LytB is encoded immediately upstream of an amidase, the autolysin LytC, and both these proteins show considerable homology in their N-terminal regions. Genes encoding proteins in this entry do not occur in conserved neighbourhoods, and many, such as SpoIID are monocistronic. One modeling study has suggested that SpoIID may bind DNA, but the function of these proteins is so far unknown.; GO: 0030435 sporulation. Probab=27.70 E-value=19 Score=16.58 Aligned_cols=22 Identities=23% Similarity=0.268 Sum_probs=14.9 Q ss_pred EECCHHHHHHHHHHHHHHHHHHH Q ss_conf 85399999999999999999999 Q gi|254780899|r 13 NIYTPEELENIRSACNVVARCLD 35 (273) Q Consensus 13 ~IKs~~EI~~~R~A~~i~~~~~~ 35 (273) +|...+++|.|+ |..|+++.+. T Consensus 19 Empa~wP~EALK-AQAvaARTYa 40 (323) T TIGR02669 19 EMPASWPMEALK-AQAVAARTYA 40 (323) T ss_pred CCCCCHHHHHHH-HHHHHHHHHH T ss_conf 179851899999-9899999999 No 85 >cd01821 Rhamnogalacturan_acetylesterase_like Rhamnogalacturan_acetylesterase_like subgroup of SGNH-hydrolases. Rhamnogalacturan acetylesterase removes acetyl esters from rhamnogalacturonan substrates, and renders them susceptible to degradation by rhamnogalacturonases. Rhamnogalacturonans are highly branched regions in pectic polysaccharides, consisting of repeating -(1,2)-L-Rha-(1,4)-D-GalUA disaccharide units, with many rhamnose residues substituted by neutral oligosaccharides such as arabinans, galactans and arabinogalactans. Extracellular enzymes participating in the degradation of plant cell wall polymers, such as Rhamnogalacturonan acetylesterase, would typically be found in saprophytic and plant pathogenic fungi and bacteria. Probab=27.17 E-value=31 Score=15.20 Aligned_cols=76 Identities=18% Similarity=0.207 Sum_probs=36.5 Q ss_pred CCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC----------CCCC----CCCHHHH Q ss_conf 2001340579978766530002311123334444321036778776543221100----------1121----2210489 Q gi|254780899|r 93 KQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVTYESLYKGIAAV----------KLNA----NIEDIGK 158 (273) Q Consensus 93 ~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a~~~~i~~i----------kpG~----~~~ei~~ 158 (273) +.+++||.|+|.+|++=.++..+ .++..+.+...-.+++.+ ++..++...+ ..|. +..+-.. T Consensus 61 ~~~~~GD~v~IqFGhND~~~~~~-~~~~~~~ty~~nL~~~i~---~~r~~ga~pIL~Tp~~Rr~~~~~~~~~~~~~~Y~~ 136 (198) T cd01821 61 KLIKPGDYVLIQFGHNDQKPKDP-EYTEPYTTYKEYLRRYIA---EARAKGATPILVTPVTRRTFDEGGKVEDTLGDYPA 136 (198) T ss_pred HHCCCCCEEEEEECCCCCCCCCC-CCCCCHHHHHHHHHHHHH---HHHHCCCEEEEECCCCCCCCCCCCEECCCCHHHHH T ss_conf 85488988999835666788898-876868899999999999---99975998999758877644899822377267999 Q ss_pred HHHHHHHHCCCEEC Q ss_conf 99999986296010 Q gi|254780899|r 159 AIQRYAHSERYSVV 172 (273) Q Consensus 159 a~~~~~~~~g~~~~ 172 (273) +++++.++.+-..+ T Consensus 137 a~r~vA~e~~Vp~I 150 (198) T cd01821 137 AMRELAAEEGVPLI 150 (198) T ss_pred HHHHHHHHCCCCEE T ss_conf 99999998299898 No 86 >PRK10372 L-ascorbate-specific enzyme IIA component of PTS; Provisional Probab=26.88 E-value=45 Score=14.08 Aligned_cols=85 Identities=13% Similarity=0.082 Sum_probs=47.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCE-EEEEEEEEEEEE---ECCCCCCCCCCCCCCCCCHH--HHHHHHHHHH-HHHHCCCCC Q ss_conf 2344642111212222001340-579978766530---00231112333444432103--6778776543-221100112 Q gi|254780899|r 78 CTSINHVICHGIPSNKQLREGD-IVNVDVTYVVNG---WHGDSSRMYPVGKIKRAAER--ILQVTYESLY-KGIAAVKLN 150 (273) Q Consensus 78 ~~s~N~~~~H~~p~d~~l~~GD-~v~iD~g~~~~G---Y~~D~~RT~~~G~~~~~~~~--l~~~~~~a~~-~~i~~ikpG 150 (273) ..+.+=++||+.|.+.+++.|= ++.+.-+..+++ --.++--++..-+. ..+.+ +.+.+.=.++ ..++.++-- T Consensus 59 vi~~giAiPHa~~~~gV~k~g~sll~Lk~pV~F~~~~~~pV~li~~laa~d~-~~Hl~~al~~L~~ll~d~~~~~~L~~a 137 (154) T PRK10372 59 VIAPGLAMPHGRPEEGVKKTGFALVTLKKPLEFNHEDNDPVDILITMAAVDA-NTHQEVGIMQIVNLFEDEENFDRLRAC 137 (154) T ss_pred EEECCEEECCCCCCCCCCCCEEEEEEECCCEEECCCCCCCEEEEEEEEECCC-CCHHHHHHHHHHHHHCCHHHHHHHHHC T ss_conf 9717877034773359741358999958987728989981389999996387-134999999999997499999999978 Q ss_pred CCCCHHHHHHHHH Q ss_conf 1221048999999 Q gi|254780899|r 151 ANIEDIGKAIQRY 163 (273) Q Consensus 151 ~~~~ei~~a~~~~ 163 (273) .+..||...+.++ T Consensus 138 ~s~~Ei~~ii~k~ 150 (154) T PRK10372 138 RTEQEVLDLIDRT 150 (154) T ss_pred CCHHHHHHHHHHH T ss_conf 8999999999986 No 87 >TIGR01980 sufB FeS assembly protein SufB; InterPro: IPR010231 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufB, which is part of the SUF system and forms a complex with SufBCD.; GO: 0005515 protein binding, 0016226 iron-sulfur cluster assembly. Probab=26.70 E-value=34 Score=14.88 Aligned_cols=16 Identities=31% Similarity=0.601 Sum_probs=14.0 Q ss_pred EEEEEEEEECCCCCEE Q ss_conf 7844218984798576 Q gi|254780899|r 241 AQYEHTIGITKAGCEI 256 (273) Q Consensus 241 ~~~Ed~vlVT~~G~E~ 256 (273) -|||||++|-+.|.++ T Consensus 216 GQFErTLiIadeGa~V 231 (469) T TIGR01980 216 GQFERTLIIADEGASV 231 (469) T ss_pred CCCEEEEEEEEECCEE T ss_conf 8632588999517748 No 88 >COG2850 Uncharacterized conserved protein [Function unknown] Probab=26.17 E-value=26 Score=15.63 Aligned_cols=46 Identities=22% Similarity=0.279 Sum_probs=36.5 Q ss_pred CCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 2220013405799787665300023111233344443210367787 Q gi|254780899|r 91 SNKQLREGDIVNVDVTYVVNGWHGDSSRMYPVGKIKRAAERILQVT 136 (273) Q Consensus 91 ~d~~l~~GD~v~iD~g~~~~GY~~D~~RT~~~G~~~~~~~~l~~~~ 136 (273) -+-++.+||++.|=-+--..||.-|-|-||.+|--++-..++..-- T Consensus 179 ~d~vlepGDiLYiPp~~~H~gvae~dc~tySvG~r~Pn~~ell~~~ 224 (383) T COG2850 179 IDEVLEPGDILYIPPGFPHYGVAEDDCMTYSVGFRAPNTRELLSGF 224 (383) T ss_pred HHHHHCCCCEEECCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHH T ss_conf 4654078865423898776785544556346613487689998888 No 89 >TIGR02870 spore_II_D stage II sporulation protein D; InterPro: IPR014225 This entry contains the stage II sporulation protein D (SpoIID), which is a protein involved in the endospore formation program. SpoIID is one of the three genes (spoIID, spoIIM and spoIIP, , , ), under the control of sigma E, that have been shown to be essential for the engulfment of the forespore by the mother cell. Their products are involved in degradation of the septal peptidoglycan - mutations in spoIID, spoIIM or spoIIP block sporulation at morphological stage II, prior to the engulfment stage. These three genes are absolutely conserved (sometimes even duplicated) in all endospore formers . Probab=24.83 E-value=23 Score=16.00 Aligned_cols=25 Identities=28% Similarity=0.454 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999999999999975799 Q gi|254780899|r 18 EELENIRSACNVVARCLDSLTPIIKPG 44 (273) Q Consensus 18 ~EI~~~R~A~~i~~~~~~~~~~~i~~G 44 (273) -|+|.|+ |..|+++.+. +.+++.+| T Consensus 100 F~~EALK-AQa~aARTf~-v~~~~~~~ 124 (358) T TIGR02870 100 FEIEALK-AQAVAARTFA-VKKMVQFG 124 (358) T ss_pred CCHHHHH-HHHHHHHHHH-HHHHHCCC T ss_conf 7355678-9999999999-99974378 No 90 >PRK07226 fructose-bisphosphate aldolase; Provisional Probab=24.83 E-value=49 Score=13.83 Aligned_cols=31 Identities=19% Similarity=0.109 Sum_probs=14.8 Q ss_pred CCCCCCCCCCCC--CCHHHHHHHHHHHHHHHHC Q ss_conf 111233344443--2103677877654322110 Q gi|254780899|r 116 SSRMYPVGKIKR--AAERILQVTYESLYKGIAA 146 (273) Q Consensus 116 ~~RT~~~G~~~~--~~~~l~~~~~~a~~~~i~~ 146 (273) +.-|+.+|.+.+ ..+++-+++.+|.+.++-. T Consensus 110 V~~~v~~Gs~~E~~~l~~l~~l~~ea~~~GlPl 142 (266) T PRK07226 110 VSVHVNVGSETEAEMLEDLGRIARECEEWGMPL 142 (266) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCE T ss_conf 999854799837999999999999999859956 No 91 >TIGR00337 PyrG CTP synthase; InterPro: IPR004468 CTP synthase is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. The enzyme catalyzes the reaction L-glutamine + H2O + UTP + ATP = CTP + phosphate + ADP + L-glutamate. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. This gene has been found circa 500 bp 5 upstream of enolase in both beta (Nitrosomonas europaea) and gamma (Escherichia coli) subdivisions of Proteobacterium .; GO: 0003883 CTP synthase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=24.17 E-value=50 Score=13.75 Aligned_cols=21 Identities=19% Similarity=0.378 Sum_probs=10.3 Q ss_pred CCCCCCCHHH-----HHHHHHHHHCC Q ss_conf 1121221048-----99999998629 Q gi|254780899|r 148 KLNANIEDIG-----KAIQRYAHSER 168 (273) Q Consensus 148 kpG~~~~ei~-----~a~~~~~~~~g 168 (273) =-|-|.+||- .|+|++-.+-| T Consensus 149 EiGGTVGDIEs~PFLEAiRQ~~~e~G 174 (571) T TIGR00337 149 EIGGTVGDIESLPFLEAIRQLKKEVG 174 (571) T ss_pred EECCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 83770000036258999999998738 No 92 >PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional Probab=22.94 E-value=53 Score=13.60 Aligned_cols=30 Identities=13% Similarity=0.232 Sum_probs=22.9 Q ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 999999999999975799889999999999 Q gi|254780899|r 27 CNVVARCLDSLTPIIKPGTTTEEIDDFVLK 56 (273) Q Consensus 27 ~~i~~~~~~~~~~~i~~G~se~el~~~~~~ 56 (273) +.++.++-..+.+.+..|.|..||-+++.+ T Consensus 56 s~iA~dmR~~Ir~~i~~G~sd~eI~~~l~~ 85 (126) T PRK10144 56 APVAVSMRHQVYSMVAEGKSEVEIIGWMTE 85 (126) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 799999999999999869999999999998 No 93 >TIGR03541 reg_near_HchA LuxR family transcriptional regulatory, chaperone HchA-associated. Members of this protein family belong to the LuxR transcriptional regulator family, and contain both autoinducer binding (pfam03472) and transcriptional regulator (pfam00196) domains. Members, however, occur only in a few members of the Gammaproteobacteria that have the chaperone/aminopeptidase HchA, and are always encoded by the adjacent gene. Probab=22.19 E-value=55 Score=13.50 Aligned_cols=30 Identities=27% Similarity=0.222 Sum_probs=20.0 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 999999757998899999999999998598 Q gi|254780899|r 34 LDSLTPIIKPGTTTEEIDDFVLKFGMENNA 63 (273) Q Consensus 34 ~~~~~~~i~~G~se~el~~~~~~~~~~~g~ 63 (273) +-.+.+.+.---|..+|...+.+.....|- T Consensus 4 lf~~l~~l~~a~t~~~l~~~l~~~~~~~Gf 33 (232) T TIGR03541 4 LFQAVRQIEQASTLEAIQDAVREFAQNLGY 33 (232) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCC T ss_conf 999999998489999999999999998499 No 94 >PRK10188 DNA-binding transcriptional activator SdiA; Provisional Probab=22.17 E-value=55 Score=13.50 Aligned_cols=27 Identities=7% Similarity=-0.052 Sum_probs=17.2 Q ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 999975799889999999999999859 Q gi|254780899|r 36 SLTPIIKPGTTTEEIDDFVLKFGMENN 62 (273) Q Consensus 36 ~~~~~i~~G~se~el~~~~~~~~~~~g 62 (273) .+...+..-.++.++-..+.+.....| T Consensus 12 ~~l~~~~~a~~~~~~~~~l~~~~~~lG 38 (240) T PRK10188 12 TMLLRFQEMAAAEEVYHELQLQTQQLE 38 (240) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCC T ss_conf 999999847999999999999999759 No 95 >PRK08333 L-fuculose phosphate aldolase; Provisional Probab=21.24 E-value=36 Score=14.72 Aligned_cols=15 Identities=7% Similarity=0.217 Sum_probs=11.4 Q ss_pred CCCCCCEEEEEEEEE Q ss_conf 200134057997876 Q gi|254780899|r 93 KQLREGDIVNVDVTY 107 (273) Q Consensus 93 ~~l~~GD~v~iD~g~ 107 (273) ..+++.|++.+|+-+ T Consensus 46 ~~l~~~div~vd~dG 60 (189) T PRK08333 46 DELTPSQVAVIKLNG 60 (189) T ss_pred HHCCHHHEEEEECCC T ss_conf 559768968993868 No 96 >PRK09220 hypothetical protein; Provisional Probab=20.99 E-value=39 Score=14.51 Aligned_cols=37 Identities=24% Similarity=0.252 Sum_probs=20.9 Q ss_pred CCCCCCCEEEEEEEEEE-EEEECCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 22001340579978766-5300023111233344443210367787765 Q gi|254780899|r 92 NKQLREGDIVNVDVTYV-VNGWHGDSSRMYPVGKIKRAAERILQVTYES 139 (273) Q Consensus 92 d~~l~~GD~v~iD~g~~-~~GY~~D~~RT~~~G~~~~~~~~l~~~~~~a 139 (273) -..|++.|+|.+|+-+. ++| . .+|+.+. .++..+++. T Consensus 50 ~~~l~~~Div~vd~~G~~~~~-----~-----~kPSsE~-~lH~~IYr~ 87 (205) T PRK09220 50 KGRLTEEDILLVDLAGNAVGS-----G-----RKPSAET-LLHTQLYRR 87 (205) T ss_pred CCCCCHHHEEEECCCCCCCCC-----C-----CCCCCCH-HHHHHHHHH T ss_conf 100988898998178878589-----9-----8888644-999999983 No 97 >TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF. Probab=20.60 E-value=59 Score=13.29 Aligned_cols=30 Identities=13% Similarity=0.300 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 999999999999975799889999999999 Q gi|254780899|r 27 CNVVARCLDSLTPIIKPGTTTEEIDDFVLK 56 (273) Q Consensus 27 ~~i~~~~~~~~~~~i~~G~se~el~~~~~~ 56 (273) +.++.++-..+.+.+..|.|..||-+++.+ T Consensus 56 a~~A~dmR~~I~~~i~~G~sd~eI~~~l~~ 85 (126) T TIGR03147 56 SPIAYDLRHEVYSMVNEGKSNQQIIDFMTA 85 (126) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 199999999999999859998999999999 No 98 >PRK06754 mtnB methylthioribulose-1-phosphate dehydratase; Reviewed Probab=20.33 E-value=52 Score=13.64 Aligned_cols=56 Identities=20% Similarity=0.152 Sum_probs=27.1 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC--CC--CCCCCCCCCCCCCEEEEEEEEEE Q ss_conf 9999999999999859830344332444553234464--21--11212222001340579978766 Q gi|254780899|r 47 TEEIDDFVLKFGMENNAIPATLNYRGYKKSCCTSINH--VI--CHGIPSNKQLREGDIVNVDVTYV 108 (273) Q Consensus 47 e~el~~~~~~~~~~~g~~~~~~~~~~f~~~~~~s~N~--~~--~H~~p~d~~l~~GD~v~iD~g~~ 108 (273) +.||+...+. +.++|..++..|-.+.. .+.+. .+ |-|. .-..|++.|+|.+|.-+. T Consensus 8 ~eel~~v~r~-l~~rG~~~gt~GNiS~R----~~~~~~~~lITPSG~-~k~~l~~eD~v~Vd~~G~ 67 (208) T PRK06754 8 WYELSEIKKE-LATRDWFPATSGNLSIK----VSDEPLTFLVTASGK-DKRKTTPEDFLLVDHDGV 67 (208) T ss_pred HHHHHHHHHH-HHHCCCCCCCCCEEEEE----ECCCCEEEEEECCCC-CCCCCCHHHEEEECCCCC T ss_conf 9999999999-99887933488888999----379970899917988-503288889888889987 Done!