Query gi|254780392|ref|YP_003064805.1| leucyl aminopeptidase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 494 No_of_seqs 253 out of 1989 Neff 7.4 Searched_HMMs 39220 Date Sun May 29 15:56:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780392.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00913 leucyl aminopeptidase 100.0 0 0 1186.0 41.9 478 11-494 6-491 (491) 2 cd00433 Peptidase_M17 Cytosol 100.0 0 0 1136.7 40.4 461 15-488 1-468 (468) 3 COG0260 PepB Leucyl aminopepti 100.0 0 0 1137.2 37.7 447 37-493 36-485 (485) 4 pfam00883 Peptidase_M17 Cytoso 100.0 0 0 940.8 20.0 307 178-486 1-312 (312) 5 PRK05015 aminopeptidase B; Pro 100.0 0 0 921.2 30.4 379 85-490 35-423 (424) 6 KOG2597 consensus 100.0 0 0 865.4 28.6 468 12-493 22-512 (513) 7 pfam02789 Peptidase_M17_N Cyto 99.7 3.2E-16 8.1E-21 134.5 11.5 118 23-140 7-126 (126) 8 PRK06837 acetylornithine deace 95.3 0.28 7.1E-06 29.1 10.5 135 175-316 20-174 (427) 9 PRK06915 acetylornithine deace 94.8 0.38 9.8E-06 28.1 9.0 135 175-316 17-170 (422) 10 PRK07522 acetylornithine deace 91.0 1.4 3.5E-05 24.2 9.3 40 176-215 6-47 (387) 11 PRK13013 succinyl-diaminopimel 89.2 1.4 3.5E-05 24.2 5.9 125 175-314 14-157 (427) 12 cd00986 PDZ_LON_protease PDZ d 88.5 0.16 4.2E-06 30.7 0.8 51 307-368 10-60 (79) 13 PRK13004 peptidase; Reviewed 85.3 3.1 7.8E-05 21.7 7.0 115 175-310 15-138 (397) 14 PRK04443 acetyl-lysine deacety 83.3 3.7 9.5E-05 21.1 10.2 36 177-212 8-44 (352) 15 PRK07906 hypothetical protein; 78.3 5.4 0.00014 20.0 9.9 14 39-52 40-53 (437) 16 PRK05111 acetylornithine deace 75.4 6.4 0.00016 19.4 9.4 12 480-491 369-380 (383) 17 TIGR01910 DapE-ArgE acetylorni 75.1 6.6 0.00017 19.4 6.9 90 209-305 144-241 (427) 18 cd00987 PDZ_serine_protease PD 72.0 1.7 4.4E-05 23.4 1.1 59 299-368 17-77 (90) 19 PRK13983 diaminopimelate amino 71.6 7.9 0.0002 18.8 10.0 33 284-316 120-154 (399) 20 PRK00466 acetyl-lysine deacety 71.1 8.1 0.00021 18.7 6.3 12 39-50 34-45 (345) 21 PRK13009 succinyl-diaminopimel 70.8 8.2 0.00021 18.7 7.3 16 390-405 264-279 (375) 22 cd00991 PDZ_archaeal_metallopr 69.1 2.5 6.3E-05 22.4 1.4 59 304-373 9-68 (79) 23 PRK07338 hypothetical protein; 67.9 9.4 0.00024 18.3 8.3 132 175-315 17-164 (407) 24 pfam00122 E1-E2_ATPase E1-E2 A 65.2 6.2 0.00016 19.6 2.8 64 294-374 24-87 (222) 25 TIGR02037 degP_htrA_DO proteas 64.5 1.7 4.4E-05 23.5 -0.2 31 307-337 418-449 (484) 26 PRK13034 serine hydroxymethylt 63.8 11 0.00028 17.8 5.2 58 342-400 196-258 (422) 27 PRK10942 serine endoprotease; 61.3 2.4 6.2E-05 22.4 0.1 28 310-337 317-345 (474) 28 TIGR00093 TIGR00093 conserved 60.7 2.1 5.3E-05 22.9 -0.3 30 102-132 67-96 (185) 29 cd05017 SIS_PGI_PMI_1 The memb 60.1 5.1 0.00013 20.2 1.6 62 282-358 5-68 (119) 30 TIGR01043 ATP_syn_A_arch ATP s 59.8 5.9 0.00015 19.7 1.9 136 67-231 250-388 (584) 31 KOG0884 consensus 59.2 3.9 9.9E-05 21.0 0.9 57 402-465 60-119 (161) 32 PRK06031 phosphoribosyltransfe 57.4 7.3 0.00019 19.1 2.0 26 166-191 162-187 (233) 33 KOG1349 consensus 56.4 15 0.00038 16.9 4.1 16 288-303 47-62 (309) 34 PRK06931 diaminobutyrate--2-ox 56.3 8.8 0.00023 18.5 2.3 97 189-305 234-343 (454) 35 TIGR01515 branching_enzym 1,4- 55.4 6 0.00015 19.7 1.3 23 85-107 149-171 (608) 36 PRK08652 acetylornithine deace 54.5 16 0.0004 16.7 9.7 16 37-52 24-39 (349) 37 PRK08554 peptidase; Reviewed 54.0 16 0.00041 16.6 5.3 14 39-52 31-44 (438) 38 PRK04155 chaperone protein Hch 53.9 6 0.00015 19.6 1.1 26 47-77 34-59 (288) 39 PRK08596 acetylornithine deace 53.7 16 0.00041 16.6 10.4 40 176-215 14-57 (421) 40 PRK06133 glutamate carboxypept 51.5 18 0.00045 16.4 13.0 119 176-316 46-180 (418) 41 PRK07473 carboxypeptidase; Pro 51.4 18 0.00045 16.4 10.0 117 179-315 15-147 (376) 42 KOG0780 consensus 51.3 18 0.00045 16.3 7.8 171 62-263 95-278 (483) 43 PRK08674 bifunctional phosphog 50.5 14 0.00037 16.9 2.6 15 178-192 181-195 (328) 44 PRK13007 dipeptidase; Reviewed 49.6 19 0.00048 16.2 6.6 27 176-202 9-35 (354) 45 pfam10447 EXOSC1 Exosome compo 49.2 3.5 9E-05 21.3 -0.7 12 319-330 56-67 (72) 46 COG1155 NtpA Archaeal/vacuolar 48.9 19 0.00049 16.1 4.3 151 55-234 236-390 (588) 47 cd00989 PDZ_metalloprotease PD 48.9 3.9 9.9E-05 21.0 -0.5 29 309-337 16-45 (79) 48 COG1921 SelA Selenocysteine sy 48.0 20 0.0005 16.0 4.5 109 287-402 93-248 (395) 49 COG4386 Mu-like prophage tail 47.9 20 0.00051 16.0 5.2 15 235-249 276-290 (487) 50 PRK10898 serine endoprotease; 47.7 5 0.00013 20.2 -0.1 54 307-371 282-336 (355) 51 cd05791 S1_CSL4 S1_CSL4: CSL4, 47.5 4 0.0001 20.9 -0.6 13 319-331 58-70 (92) 52 COG3896 Chloramphenicol 3-O-ph 47.0 12 0.0003 17.6 1.7 33 189-221 121-160 (205) 53 TIGR02149 glgA_Coryne glycogen 46.9 9.2 0.00024 18.3 1.2 40 208-248 147-195 (416) 54 PRK10139 serine endoprotease; 46.4 6.1 0.00015 19.6 0.2 24 313-336 398-422 (455) 55 KOG3846 consensus 43.2 9.1 0.00023 18.4 0.7 39 380-418 401-446 (465) 56 COG2222 AgaS Predicted phospho 41.9 10 0.00025 18.1 0.7 64 209-278 194-263 (340) 57 TIGR01139 cysK cysteine syntha 41.6 24 0.00062 15.3 3.3 29 284-312 272-301 (312) 58 cd03450 NodN NodN (nodulation 38.8 26 0.00067 15.1 2.5 21 112-134 22-42 (149) 59 PRK08087 L-fuculose phosphate 38.4 27 0.0007 15.0 5.9 59 153-212 156-214 (215) 60 KOG2367 consensus 37.8 23 0.00059 15.5 2.0 63 185-251 221-290 (560) 61 pfam09264 Sial-lect-inser Vibr 37.1 16 0.00041 16.7 1.1 31 323-353 112-149 (197) 62 COG0381 WecB UDP-N-acetylgluco 36.9 25 0.00064 15.2 2.1 22 292-315 226-247 (383) 63 PRK00911 dihydroxy-acid dehydr 36.0 30 0.00076 14.7 5.6 114 270-400 306-422 (557) 64 cd00988 PDZ_CTP_protease PDZ d 35.4 10 0.00026 18.0 -0.0 32 307-338 15-47 (85) 65 cd07144 ALDH_ALD2-YMR170C Sacc 34.4 31 0.0008 14.6 4.7 93 357-459 387-484 (484) 66 TIGR00587 nfo apurinic endonuc 33.2 6.9 0.00018 19.2 -1.2 45 231-280 179-229 (318) 67 PRK07334 threonine dehydratase 33.1 33 0.00084 14.4 2.8 54 285-354 177-230 (399) 68 COG0248 GppA Exopolyphosphatas 32.9 33 0.00085 14.4 5.3 81 177-272 80-165 (492) 69 PRK05639 4-aminobutyrate amino 32.3 34 0.00086 14.3 3.1 94 190-303 237-343 (457) 70 TIGR02198 rfaE_dom_I rfaE bifu 32.1 32 0.00082 14.5 2.0 153 39-228 77-249 (321) 71 TIGR00195 exoDNase_III exodeox 31.8 20 0.0005 16.0 0.9 11 129-139 222-232 (281) 72 cd00136 PDZ PDZ domain, also c 31.5 12 0.0003 17.6 -0.3 43 306-356 14-57 (70) 73 TIGR01752 flav_long flavodoxin 31.3 35 0.00089 14.2 4.4 52 41-108 76-127 (176) 74 COG4942 Membrane-bound metallo 30.5 36 0.00092 14.1 2.4 85 289-394 290-374 (420) 75 PRK08526 threonine dehydratase 30.5 26 0.00067 15.1 1.3 42 285-328 177-219 (403) 76 PRK12890 allantoate amidohydro 30.0 37 0.00094 14.1 6.8 136 274-427 230-369 (412) 77 COG0036 Rpe Pentose-5-phosphat 29.9 21 0.00053 15.8 0.8 27 176-202 100-126 (220) 78 pfam00926 DHBP_synthase 3,4-di 29.9 35 0.00089 14.2 1.9 44 183-229 27-70 (193) 79 COG4043 Preprotein translocase 29.9 20 0.00051 16.0 0.7 14 93-106 60-73 (111) 80 cd04461 S1_Rrp5_repeat_hs8_sc7 29.7 37 0.00095 14.0 3.1 48 302-350 34-82 (83) 81 pfam02112 PDEase_II cAMP phosp 29.2 38 0.00097 14.0 2.4 30 331-367 231-260 (323) 82 PRK10717 cysteine synthase A; 28.9 38 0.00098 14.0 3.6 42 284-327 184-225 (334) 83 TIGR01163 rpe ribulose-phospha 28.7 33 0.00084 14.4 1.6 48 84-134 151-202 (216) 84 pfam05666 Fels1 Fels-1 Prophag 28.4 21 0.00053 15.8 0.5 19 255-273 19-37 (45) 85 PRK09374 rplB 50S ribosomal pr 28.3 39 0.001 13.9 2.8 25 237-267 75-99 (276) 86 smart00228 PDZ Domain present 28.2 19 0.00048 16.1 0.3 31 307-337 28-59 (85) 87 PRK08588 succinyl-diaminopimel 27.9 40 0.001 13.8 9.8 15 38-52 25-39 (378) 88 cd07143 ALDH_AldA_AN0554 Asper 27.4 41 0.001 13.8 4.6 93 357-459 384-481 (481) 89 pfam04767 Pox_F17 DNA-binding 27.4 28 0.00071 15.0 1.0 13 335-348 5-17 (98) 90 PRK08198 threonine dehydratase 27.1 41 0.001 13.7 4.9 57 284-354 180-236 (406) 91 PRK07495 4-aminobutyrate amino 26.9 42 0.0011 13.7 3.2 61 189-269 210-271 (422) 92 PRK08091 ribulose-phosphate 3- 26.6 32 0.00082 14.5 1.2 16 94-109 31-46 (235) 93 PRK00326 cell division protein 26.5 11 0.00029 17.7 -1.1 28 325-353 70-97 (147) 94 PRK13590 putative bifunctional 26.3 43 0.0011 13.6 8.3 136 274-428 407-546 (590) 95 PRK09521 exosome complex RNA-b 26.3 14 0.00035 17.1 -0.7 13 319-331 117-129 (187) 96 TIGR01477 RIFIN variant surfac 25.6 27 0.00069 15.0 0.7 20 185-204 241-260 (395) 97 PRK06382 threonine dehydratase 25.6 42 0.0011 13.7 1.7 43 285-329 176-219 (400) 98 PRK00910 ribB 3,4-dihydroxy-2- 25.5 34 0.00086 14.3 1.2 60 183-245 43-102 (218) 99 PRK12394 putative metallo-depe 25.4 27 0.00068 15.1 0.7 47 320-373 204-251 (387) 100 PRK05581 ribulose-phosphate 3- 25.2 41 0.001 13.8 1.6 14 96-109 24-37 (220) 101 cd00990 PDZ_glycyl_aminopeptid 25.2 23 0.00059 15.5 0.3 31 307-337 14-45 (80) 102 TIGR03240 arg_catab_astD succi 25.2 45 0.0011 13.5 3.1 97 357-463 372-473 (484) 103 KOG3332 consensus 25.1 28 0.00071 14.9 0.7 22 181-202 108-129 (247) 104 PRK06833 L-fuculose phosphate 24.9 45 0.0011 13.5 6.0 54 153-208 158-211 (214) 105 pfam03123 CAT_RBD CAT RNA bind 24.1 46 0.0012 13.4 1.6 15 252-266 19-33 (55) 106 KOG1352 consensus 23.9 47 0.0012 13.3 2.9 121 66-211 269-396 (618) 107 PRK06446 hypothetical protein; 23.9 47 0.0012 13.3 8.0 16 478-493 417-432 (433) 108 PRK09290 allantoate amidohydro 23.7 47 0.0012 13.3 6.9 136 274-427 229-368 (412) 109 TIGR01664 DNA-3'-Pase DNA 3'-p 23.5 48 0.0012 13.3 2.2 31 33-65 51-81 (239) 110 cd00378 SHMT Serine-glycine hy 23.3 48 0.0012 13.3 6.3 80 342-423 186-286 (402) 111 pfam00595 PDZ PDZ domain (Also 23.2 32 0.00082 14.5 0.7 31 309-339 28-59 (80) 112 PRK06082 4-aminobutyrate amino 23.1 49 0.0012 13.2 3.1 94 192-305 246-354 (460) 113 cd02903 Macro_BAL_like Macro d 22.6 50 0.0013 13.2 9.6 108 1-109 1-113 (137) 114 TIGR00918 2A060602 transmembra 22.6 50 0.0013 13.2 1.8 181 172-354 105-331 (1215) 115 TIGR03320 ygeY M20/DapE family 22.2 50 0.0013 13.1 10.7 34 176-209 14-48 (395) 116 pfam11330 DUF3132 Protein of u 22.2 50 0.0013 13.1 3.2 60 300-359 53-113 (124) 117 COG1949 Orn Oligoribonuclease 22.2 51 0.0013 13.1 4.4 61 346-407 7-68 (184) 118 pfam11161 DUF2944 Protein of u 22.1 51 0.0013 13.1 1.9 24 38-61 49-72 (178) 119 PRK08088 4-aminobutyrate amino 22.1 51 0.0013 13.1 2.7 62 189-270 214-276 (426) 120 TIGR01736 FGAM_synth_II phosph 22.0 37 0.00095 14.0 0.9 18 90-107 488-505 (763) 121 TIGR00936 ahcY adenosylhomocys 21.9 51 0.0013 13.1 1.5 20 326-348 324-343 (422) 122 cd06325 PBP1_ABC_uncharacteriz 21.9 51 0.0013 13.1 3.2 14 410-423 211-224 (281) 123 PRK08005 ribulose-phosphate 3- 21.7 47 0.0012 13.3 1.3 15 95-109 20-34 (210) 124 KOG3857 consensus 21.1 16 0.00041 16.6 -1.1 116 242-380 266-392 (465) 125 TIGR00470 sepS O-phosphoseryl- 20.8 54 0.0014 12.9 2.0 50 94-146 235-284 (558) 126 pfam04393 DUF535 Protein of un 20.7 37 0.00095 14.0 0.6 69 352-423 190-263 (287) 127 COG1692 Calcineurin-like phosp 20.6 50 0.0013 13.1 1.3 45 185-229 119-166 (266) 128 PTZ00125 ornithine aminotransf 20.5 55 0.0014 12.9 3.8 96 189-304 211-321 (415) 129 PRK01792 ribB 3,4-dihydroxy-2- 20.4 55 0.0014 12.9 2.1 59 183-244 42-100 (214) 130 COG3480 SdrC Predicted secrete 20.3 55 0.0014 12.9 2.7 165 284-478 108-279 (342) 131 KOG3425 consensus 20.2 55 0.0014 12.8 1.9 55 433-487 42-102 (128) 132 pfam09182 PuR_N Bacterial puri 20.2 53 0.0013 13.0 1.3 53 178-231 7-60 (70) No 1 >PRK00913 leucyl aminopeptidase; Provisional Probab=100.00 E-value=0 Score=1186.01 Aligned_cols=478 Identities=41% Similarity=0.667 Sum_probs=441.0 Q ss_pred CCCCCCEEEEEECCCCCCC--CCCCCCCCHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCHHHCCHH-HH Q ss_conf 8776771699962686632--22212388889999998089983678789984378887445999944881558999-99 Q gi|254780392|r 11 PSIKQEGLAILLKTSFSDV--AGLSLIGSPSIVMRAVSVKNFIGESKSHLNILAPVDCVWDRLVVAGIGDPRDANFS-WL 87 (494) Q Consensus 11 ~~~~~d~lvV~~~~~~~~~--~~~~~~~~~~~L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rViLVGLG~~~~~~~~-~~ 87 (494) ...++|+++|++.++.... ....+....+.|+++++..+|+||.||++.++.+.+..++||+|||||++++++.+ +| T Consensus 6 ~~~~aD~LVvgV~e~~~l~~~~~~LD~~~~g~I~~lik~~~F~GK~Ge~~~l~~~~g~~a~rVlLVGLGk~~~ls~~~~R 85 (491) T PRK00913 6 EKQKADCLVVGVFEPRRLSGAAEQLDKASDGYISALLKRGDFKGKAGQTLLLHHVPGVLAERVLLVGLGKEEELDEDQLR 85 (491) T ss_pred HHHCCCEEEEEECCCCCCCHHHHHHHHHCCCHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCCCCCCHHHHH T ss_conf 55207989999678986698999999972519999996599998879989985799987788999926872318999999 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCC---HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 9999999985316974899417789999---8999999999971477754444364333232100013332245773789 Q gi|254780392|r 88 KAGGSIASYIEEDKNIEIFVDVPEYPIT---KAEIRDLVLGFMLKTYTFDQYKTKKRESSSPRSKENISVTIVTEMIQQS 164 (494) Q Consensus 88 ra~a~~~~~l~~~~~~~v~i~l~~~~~~---~~~~~~~~eG~~L~~Y~Fd~yKskk~~~~~~~~~~~~~i~~~~~~~~~~ 164 (494) ++++++++.++..+.+++.+.++..... ...+++++||+.|++|+|++|||++++..... .+.+........ T Consensus 86 ~a~a~aar~l~~~~~~~i~v~l~~~~~~~~~~~~a~a~aEG~~Lg~Y~Fd~yKs~k~~~~~~~-----~~~v~~~~~~~a 160 (491) T PRK00913 86 KAAGKAARALKKTKSKEAVLFLTELHVKGRDTKVAAAVAEGALLGAYRFDRYKSKKKEPRRPL-----VFNVPTRELTEA 160 (491) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCE-----EEEEECCCHHHH T ss_conf 999999999976798089997765454566599999999999966844564567887776642-----699605874889 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHH-HCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 9999999999999998642021254030299999999986-113321000011100013458988742025778506886 Q gi|254780392|r 165 SQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKL-ESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVM 243 (494) Q Consensus 165 ~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l-~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l 243 (494) ++++++++++++|+|+||||+|+|||+|||++||++++++ ++++++|+|+++++|+++|||+||+|||||.+|||||+| T Consensus 161 ~~a~~~g~aiae~~~~aRdL~n~P~n~~tP~~~a~~a~~~~~~~~~~v~v~~~~~l~~~gmg~~laVg~GS~~~p~li~l 240 (491) T PRK00913 161 EKAIAHGEAIAEGVNLARDLVNEPPNILTPAYLAERAKELAKEYGLEVEVLDEKEMEKLGMGALLAVGQGSANPPRLIVL 240 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCEEEEECHHHHHHCCCCCEEEECCCCCCCCCEEEE T ss_conf 99999999999999999998647870189999999999987650978999639999866997332562576899847999 Q ss_pred ECCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCC Q ss_conf 20578865651266302112006631003333200232211235689999999987087872899864102668999747 Q gi|254780392|r 244 KWEGGDSEEQPLAFIGKGVVFDTGGISIKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQR 323 (494) Q Consensus 244 ~Y~g~~~~~~piaLVGKGvTFDtGGislKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~ 323 (494) +|+|++++++||+|||||||||||||||||+.+|++||+||||||+|+|+|+++++|++|+||++++|+|||||||+||| T Consensus 241 ~y~g~~~~~~~i~lVGKGvtFDsGG~slKp~~~m~~Mk~Dm~Gaa~v~g~~~a~a~l~~~~nv~~~~~~~ENm~~g~a~~ 320 (491) T PRK00913 241 EYKGGPKDAKPIALVGKGLTFDSGGISLKPAAGMDEMKYDMGGAAAVLGTMRALAELKLPVNVVGVVAACENMPSGNAYR 320 (491) T ss_pred EECCCCCCCCEEEEECCEEEEECCCCCCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEHHHCCCCCCCCC T ss_conf 96789987622898435047506874568766878775564106899999999998499956999986121388888999 Q ss_pred HHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHH Q ss_conf 07763024895888634786640365236788752079613552111200235431776057870898999999998998 Q gi|254780392|r 324 PGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLS 403 (494) Q Consensus 324 pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~ 403 (494) ||||||+||||||||.||||||||||||+|+|| ++|+|+.|||+|||||||++|||++++|+|||||+|++++++||+. T Consensus 321 pgDvi~~~~GktvEi~nTDAEGRLvLaD~l~ya-~~~~p~~iiD~ATLTGa~~vaLG~~~~g~~~n~~~l~~~~~~a~~~ 399 (491) T PRK00913 321 PGDVLTSMSGKTIEVLNTDAEGRLVLADALTYA-ERFKPDAIIDVATLTGACVVALGHHTAGLMSNNDELADELLKAGEE 399 (491) T ss_pred CCCEEEECCCCEEEECCCCCCCEEEEHHHHHHH-HHCCCCEEEEEHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHH T ss_conf 855578069947983068852036640188888-6539978987121220115331787369971999999999999998 Q ss_pred HCCCCCCCCCCHHHHHHHCCCCCCEECCCCCCHHHHHHHHHHHHHCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCHH Q ss_conf 569717677686788861798641701888746799999999872388-6769994000011678888889986543589 Q gi|254780392|r 404 TGELLWRMPMNEEYNKLIESKFADMKNIGGRGAGSIVAAQFLEKFVQD-ASWAHIDIAGVATGGYPKEINQSWASGFGVR 482 (494) Q Consensus 404 ~ge~~w~lPl~~~y~~~~~s~~AD~~n~~~~~~g~~~aa~fl~~f~~~-~~w~HlDiag~a~~~~~~~~~~~g~tg~~vr 482 (494) +||++||||||++|+++|||++|||+|+|+|+||+||||+||+|||++ +||+|||||||||++++++|.|+|+|||||| T Consensus 400 ~ge~~W~lPl~~~y~~~l~S~~AD~~N~g~~~~g~itAa~FL~~Fv~~~~~W~HlDiAg~a~~~~~~~~~~~gatG~~vr 479 (491) T PRK00913 400 SGERAWRLPLGDEYQEQLKSPIADMANIGGRPGGSITAACFLSRFVEKGYPWAHLDIAGTAWNSKKGGGKPKGATGRGVR 479 (491) T ss_pred HCCCEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 19977608899889967799720230789995578999999999728998869996777662688888999986533299 Q ss_pred HHHHHHHHHCCC Q ss_conf 999999976689 Q gi|254780392|r 483 LLDEFVRAFYEK 494 (494) Q Consensus 483 ~l~~~~~~~~~~ 494 (494) ||+|||+++.|+ T Consensus 480 ~l~~~~~~~~~~ 491 (491) T PRK00913 480 LLVQFLLNRAEE 491 (491) T ss_pred HHHHHHHHHHCC T ss_conf 999999986319 No 2 >cd00433 Peptidase_M17 Cytosol aminopeptidase family, N-terminal and catalytic domains. Family M17 contains zinc- and manganese-dependent exopeptidases ( EC 3.4.11.1), including leucine aminopeptidase. They catalyze removal of amino acids from the N-terminus of a protein and play a key role in protein degradation and in the metabolism of biologically active peptides. They do not contain HEXXH motif (which is used as one of the signature patterns to group the peptidase families) in the metal-binding site. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase. The enzyme is a hexamer, with the catalytic domains clustered around the three-fold axis, and the two trimers related to one another by a two-fold rotation. The N-terminal domain is structurally similar to the ADP-ribose binding Macro domain. This family includes proteins from bacteria, archaea, animals and plants. Probab=100.00 E-value=0 Score=1136.74 Aligned_cols=461 Identities=43% Similarity=0.701 Sum_probs=420.8 Q ss_pred CCEEEEEECCC---CCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCHHHCCHH-HHHHH Q ss_conf 77169996268---663222212388889999998089983678789984378887445999944881558999-99999 Q gi|254780392|r 15 QEGLAILLKTS---FSDVAGLSLIGSPSIVMRAVSVKNFIGESKSHLNILAPVDCVWDRLVVAGIGDPRDANFS-WLKAG 90 (494) Q Consensus 15 ~d~lvV~~~~~---~~~~~~~~~~~~~~~L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rViLVGLG~~~~~~~~-~~ra~ 90 (494) .|++++++... +.......+....+.+.++++..+|+||.||++.++.+.+ ...||++||||+.++.+.+ +++++ T Consensus 1 ~d~LVvgv~~~~~~l~~~~~~ld~~~~g~l~~ll~~~~~~Gk~Ge~~~l~~~~~-~~~rvllvGLG~~~~~~~~~~r~a~ 79 (468) T cd00433 1 ADGLVLGVFEGEGGLPPAAEKLDAASSGALAALLKASGFKGKAGETLLLPALGG-GAKRVALVGLGKEEDLDVENLRKAA 79 (468) T ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC-CCCEEEEEECCCCCCCCHHHHHHHH T ss_conf 978999998899976889999989734699999974999988688899807999-7477999914874549999999999 Q ss_pred HHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 99999853169748994177899998999999999971477754444364333232100013332245773789999999 Q gi|254780392|r 91 GSIASYIEEDKNIEIFVDVPEYPITKAEIRDLVLGFMLKTYTFDQYKTKKRESSSPRSKENISVTIVTEMIQQSSQVVAD 170 (494) Q Consensus 91 a~~~~~l~~~~~~~v~i~l~~~~~~~~~~~~~~eG~~L~~Y~Fd~yKskk~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~ 170 (494) ++++++++..+..++.+..+. ..+.++++++|+.|++|+|++||++++...... ..........++.+++ T Consensus 80 a~aa~~l~~~~~~~i~v~~~~---~~~~~~~~aeGa~Lg~Y~f~~yKs~~~~~~~~~-------~~~~~~~~~~~~~~~~ 149 (468) T cd00433 80 GAAARALKKLGSKSVAVDLPT---LAEDAEAAAEGALLGAYRFDRYKSKKKKTPLLV-------VLELGNDKAAEAALER 149 (468) T ss_pred HHHHHHHHHCCCCEEEEECCC---CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCE-------EEECCCCHHHHHHHHH T ss_conf 999999854797089992899---589999999999965676101457898887503-------5632675789999999 Q ss_pred HHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHH-HCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCC Q ss_conf 9999999998642021254030299999999986-113321000011100013458988742025778506886205788 Q gi|254780392|r 171 IQSVVKGVNLARDIINEPANVLGTDEFCKQVRKL-ESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGD 249 (494) Q Consensus 171 ~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l-~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~ 249 (494) ++++++|+++||||+|+|||+|||++||++++++ +++|++|+|+|+++|+++|||+||+|||||.+|||||+|+|+|.+ T Consensus 150 ~~aia~gv~laRdLvn~P~N~ltP~~lA~~a~~la~~~g~~v~Vld~~~l~~~gmg~llaVg~GS~~~p~li~l~y~~~~ 229 (468) T cd00433 150 GEAIAEGVNLARDLVNTPANDLTPTYLAEEAKELAKELGVKVEVLDEKELEELGMGALLAVGKGSEEPPRLIVLEYKGKG 229 (468) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEHHHHHHCCCCHHHEECCCCCCCCEEEEEEECCCC T ss_conf 99999999999997448960169999999999986407987999709999875995021010256899717999856888 Q ss_pred CCCCCEEEEECCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHC Q ss_conf 65651266302112006631003333200232211235689999999987087872899864102668999747077630 Q gi|254780392|r 250 SEEQPLAFIGKGVVFDTGGISIKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVR 329 (494) Q Consensus 250 ~~~~piaLVGKGvTFDtGGislKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~ 329 (494) ++++||+|||||||||||||||||+.+|++||+||||||+|+|+|+++++|++|+||++++|+|||||||+||||||||| T Consensus 230 ~~~~~~~lVGKGitfDsGG~~lKp~~~m~~Mk~Dm~Gaa~v~g~~~a~a~l~~~~~v~~~~~~~EN~~~~~a~rpgDvi~ 309 (468) T cd00433 230 ASKKPIALVGKGITFDTGGLSLKPAAGMDGMKYDMGGAAAVLGAMKAIAELKLPVNVVGVLPLAENMISGNAYRPGDVIT 309 (468) T ss_pred CCCCCEEEECCEEEECCCCCCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHCCCCCCCCCCCCEEE T ss_conf 88775588611378516874557734288864232236899999999998589927999986231487888989846588 Q ss_pred CCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCC Q ss_conf 24895888634786640365236788752079613552111200235431776057870898999999998998569717 Q gi|254780392|r 330 SMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGELLW 409 (494) Q Consensus 330 ~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~~w 409 (494) +||||||||.||||||||||||+|+||+ +|+|+.|||+|||||||++|||++++|+|+|||+|++++++||+.+||++| T Consensus 310 ~~~GktvEi~nTDAEGRLvLaD~l~ya~-~~~p~~iiD~ATLTGa~~vaLG~~~~g~~~n~~~l~~~l~~a~~~~ge~~w 388 (468) T cd00433 310 SRSGKTVEILNTDAEGRLVLADALTYAQ-EFKPDLIIDIATLTGAAVVALGHDYAGLFTNDDELAKQLLAAGEASGERVW 388 (468) T ss_pred CCCCCEEEEECCCCCHHEEEHHHHHHHH-HCCCCEEEEECCCCCCEEEECCCCCEEEECCCHHHHHHHHHHHHHHCCCEE T ss_conf 2799689996688403111145888886-429977776144202110220787679954999999999999998099787 Q ss_pred CCCCCHHHHHHHCCCCCCEECCCCC-CHHHHHHHHHHHHHCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 6776867888617986417018887-46799999999872388-676999400001167888888998654358999999 Q gi|254780392|r 410 RMPMNEEYNKLIESKFADMKNIGGR-GAGSIVAAQFLEKFVQD-ASWAHIDIAGVATGGYPKEINQSWASGFGVRLLDEF 487 (494) Q Consensus 410 ~lPl~~~y~~~~~s~~AD~~n~~~~-~~g~~~aa~fl~~f~~~-~~w~HlDiag~a~~~~~~~~~~~g~tg~~vr~l~~~ 487 (494) |||||++|+++|||++|||+|+++| +||+||||+||+|||++ +||+|||||||+|.+++. |.|+|+||||||||+|| T Consensus 389 ~lPl~~~y~~~l~s~~AD~~N~~~~~~~g~~~aa~FL~~Fv~~~~~W~HlDiAg~a~~~~~~-~~~~g~TG~gvr~l~~~ 467 (468) T cd00433 389 RLPLWEEYREQLKSDIADLKNIGGRGPAGSITAALFLKEFVGDGIPWAHLDIAGTAWKSKPG-YLPKGATGFGVRLLVEF 467 (468) T ss_pred CCCCCHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCC-CCCCCCCCHHHHHHHHH T ss_conf 08899888937799836767789899534899999999970989987899767867667899-89998353769999976 Q ss_pred H Q ss_conf 9 Q gi|254780392|r 488 V 488 (494) Q Consensus 488 ~ 488 (494) | T Consensus 468 l 468 (468) T cd00433 468 L 468 (468) T ss_pred C T ss_conf 9 No 3 >COG0260 PepB Leucyl aminopeptidase [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=1137.18 Aligned_cols=447 Identities=45% Similarity=0.745 Sum_probs=411.8 Q ss_pred CHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCHHHCCH-HHHHHHHHHHHHHHHCCCEEEEECCCCCCCC Q ss_conf 888999999808998367878998437888744599994488155899-9999999999985316974899417789999 Q gi|254780392|r 37 SPSIVMRAVSVKNFIGESKSHLNILAPVDCVWDRLVVAGIGDPRDANF-SWLKAGGSIASYIEEDKNIEIFVDVPEYPIT 115 (494) Q Consensus 37 ~~~~L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rViLVGLG~~~~~~~-~~~ra~a~~~~~l~~~~~~~v~i~l~~~~~~ 115 (494) ....++...+..+|+||.|+++.++.+. ....|+++||||+.+ .+. .++++.+.+.+.++..+..++.+.++.++ . T Consensus 36 ~~~~~~~~~~~~~~~g~~g~~~~~~~~~-~~~~~~~~vglg~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~-~ 112 (485) T COG0260 36 SEGILSWLLKAGEFSGKAGQVLLLPGSV-GSVKRVLLVGLGKRK-LSEDTLRRAAGAAARALKKGKAWEVTLLLPELP-A 112 (485) T ss_pred HHHHHHHHHHHHCCCCCCCCEEEEECCC-CCCCEEEEEEECCCC-CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCC-H T ss_conf 1456899987414435677269840567-760168999716876-777789999999999742267336898346632-6 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHH Q ss_conf 89999999999714777544443643332321000133322457737899999999999999998642021254030299 Q gi|254780392|r 116 KAEIRDLVLGFMLKTYTFDQYKTKKRESSSPRSKENISVTIVTEMIQQSSQVVADIQSVVKGVNLARDIINEPANVLGTD 195 (494) Q Consensus 116 ~~~~~~~~eG~~L~~Y~Fd~yKskk~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~ltP~ 195 (494) .+.+..+++|+.+++|+|++||+.+++...+. ..+.+.... ...++.+++++++++||++||||+|+|||+|||+ T Consensus 113 ~~~~~~~~~g~~~~~Y~f~~yk~~k~~~~~~~----~~v~~~~~~-~~~~~~~~~~~ai~~~v~~aRdLvN~P~n~l~P~ 187 (485) T COG0260 113 AEIAAAAVEGFLLGSYRFDRYKSEKKEPDKEL----PKVTLLVPA-KSAEKAIREGEAIAEGVNLARDLVNTPANILTPE 187 (485) T ss_pred HHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCC----CEEEEECCC-HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHH T ss_conf 89999999998751451433045544544454----237883265-0467789999999999999998763896337999 Q ss_pred HHHHHHHHHHC-CCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCCCCCCCCC Q ss_conf 99999998611-33210000111000134589887420257785068862057886565126630211200663100333 Q gi|254780392|r 196 EFCKQVRKLES-LGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTGGISIKPS 274 (494) Q Consensus 196 ~lA~~a~~l~~-~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtGGislKp~ 274 (494) +||++++.|++ .+++|+|+++++|+++|||+||+|||||.+|||||+|+|+|.+++++|++|||||||||||||||||+ T Consensus 188 ~la~~a~~la~~~~v~veVl~~~~l~~~gmg~llaVg~GS~~~Prlivl~y~g~~~~~~~iaLVGKGitFDsGGisiKp~ 267 (485) T COG0260 188 ELAERAELLAKLGGVKVEVLDEKDLEKLGMGALLAVGKGSARPPRLIVLEYNGKGKAKKPIALVGKGITFDSGGISIKPA 267 (485) T ss_pred HHHHHHHHHHHCCCCEEEEECHHHHHHCCCCEEEEECCCCCCCCEEEEEECCCCCCCCCEEEEECCCEEECCCCCCCCCC T ss_conf 99999999865378169993599998759951664356778898489997379988886499975703465898567775 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHH Q ss_conf 32002322112356899999999870878728998641026689997470776302489588863478664036523678 Q gi|254780392|r 275 HGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALW 354 (494) Q Consensus 275 ~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ 354 (494) .+|++||+||||||+|+|+|.++|++++|+||++++|+|||||||+|||||||||+||||||||+||||||||||||||+ T Consensus 268 ~~M~~MK~DMgGAAaV~g~~~a~a~l~l~vnv~~vl~~~ENm~~g~A~rPGDVits~~GkTVEV~NTDAEGRLVLADaLt 347 (485) T COG0260 268 AGMDTMKYDMGGAAAVLGAMRALAELKLPVNVVGVLPAVENMPSGNAYRPGDVITSMNGKTVEVLNTDAEGRLVLADALT 347 (485) T ss_pred CCHHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCEEEECCCEEEEECCCCCCHHHHHHHHHH T ss_conf 54333122644499999999999971999349999761315878789999876780597089974567027787788899 Q ss_pred HHHHHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCEECCCC- Q ss_conf 8752079613552111200235431776057870898999999998998569717677686788861798641701888- Q gi|254780392|r 355 YCRTHYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFADMKNIGG- 433 (494) Q Consensus 355 ya~~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD~~n~~~- 433 (494) ||++ +||+.|||+|||||||+||||++++|+|||||+|+++|.+|++.+||++|||||+++|+++|+|+||||+|+++ T Consensus 348 YA~~-~kp~~iID~ATLTGA~~vALG~~~~g~fs~dd~La~~l~~As~~~gE~~WrLPl~~~y~~~l~S~~ADl~N~~~~ 426 (485) T COG0260 348 YAEE-LKPDLIIDVATLTGAAVVALGNDYTGLFSNDDELANQLLAASEETGEPLWRLPLHEEYRKQLKSNFADLANIGGG 426 (485) T ss_pred HHHH-CCCCEEEEEHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCHHHHCCCCC T ss_conf 9987-199879981205767998707875233328999999999999871983205879878896468962012147899 Q ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 746799999999872388676999400001167888888998654358999999997668 Q gi|254780392|r 434 RGAGSIVAAQFLEKFVQDASWAHIDIAGVATGGYPKEINQSWASGFGVRLLDEFVRAFYE 493 (494) Q Consensus 434 ~~~g~~~aa~fl~~f~~~~~w~HlDiag~a~~~~~~~~~~~g~tg~~vr~l~~~~~~~~~ 493 (494) |+|||||||+||+|||+++||+|||||||+|.+... +.|+||||+|||||++|++++++ T Consensus 427 ~~gGaitAA~FL~~Fv~~~~WaHlDIAGta~~~~~~-~~~~GAtG~~VrtL~~~l~~~~~ 485 (485) T COG0260 427 RFGGAITAALFLSEFVENVPWAHLDIAGTAWKSGSP-YWPKGATGVGVRTLAQFLLNRAE 485 (485) T ss_pred CCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC-CCCCCCEECCHHHHHHHHHHHCC T ss_conf 987479999999987189986999843412347887-68998740129999999998629 No 4 >pfam00883 Peptidase_M17 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase. Probab=100.00 E-value=0 Score=940.82 Aligned_cols=307 Identities=52% Similarity=0.872 Sum_probs=295.2 Q ss_pred HHHHHHHCCCCCEEECHHHHHHHHHHH-HCC-CCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCC-CCCCC Q ss_conf 998642021254030299999999986-113-321000011100013458988742025778506886205788-65651 Q gi|254780392|r 178 VNLARDIINEPANVLGTDEFCKQVRKL-ESL-GVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGD-SEEQP 254 (494) Q Consensus 178 v~lARDLvN~PaN~ltP~~lA~~a~~l-~~~-gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~-~~~~p 254 (494) ||+||||+|+|||+|||++|+++++++ ++. +++|+|+++++|+++|||+||+|||||.+|||||+|+|+|.+ ++++| T Consensus 1 VnlaRdLvn~P~N~ltP~~~a~~~~~~~~~~~~~~v~v~~~~~l~~~gmg~llaVg~GS~~~P~lv~l~y~~~~~~~~~~ 80 (312) T pfam00883 1 VNLARDLVNTPANILTPAVLAEEAEKLASNYSDVKIEVLDEEELEELGMGGLLAVNKGSENPPRFIVLSYKGNKKEDKKK 80 (312) T ss_pred CHHHHHHHCCCHHHCCHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHCCCCCEEEEECCCCCCCCEEEEEECCCCCCCCCE T ss_conf 92677400379455499999999999986569928999619999858997477662356889832799988988877762 Q ss_pred EEEEECCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCC Q ss_conf 26630211200663100333320023221123568999999998708787289986410266899974707763024895 Q gi|254780392|r 255 LAFIGKGVVFDTGGISIKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQ 334 (494) Q Consensus 255 iaLVGKGvTFDtGGislKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~ 334 (494) |+|||||||||||||||||+.+|++||+||||||+|+|+|++++++++|+||++++|+||||||++|||||||||+|||| T Consensus 81 i~lVGKGvTFDtGGl~lKp~~~M~~Mk~DM~GAA~v~g~~~aia~l~l~v~v~~i~~l~EN~is~~A~rPgDVi~s~~Gk 160 (312) T pfam00883 81 LALVGKGITFDSGGISIKPGAGMDLMKYDMGGAAAVLGAIKALAQLKPPVNVVAVLALTENMISGTAMRPGDIITAMNGK 160 (312) T ss_pred EEEECCCEEECCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCEEEECCCC T ss_conf 89953760633665788066226565455315899999999999849986479998701037899889999778917997 Q ss_pred EEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 88863478664036523678875207961355211120023543177605787089899999999899856971767768 Q gi|254780392|r 335 TIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGELLWRMPMN 414 (494) Q Consensus 335 tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~ 414 (494) ||||.||||||||||||+|+||+ +|+|+.|||+|||||||++|||.+++|+|+|||+++++++++|+.+||++|||||| T Consensus 161 TVEI~NTDAEGRLvLADaL~ya~-~~~p~~iiD~ATLTGa~~~ALG~~~ag~~sn~~~l~~~~~~a~~~~ge~~W~lPl~ 239 (312) T pfam00883 161 TVEVNNTDAEGRLVLADALTYAC-ELNPEVIIDVATLTGACVVALGTTYTGIFSNDDELANELIKAAEESGEKVWRMPLH 239 (312) T ss_pred EEEEECCCCCCEEEEHHHHHHHH-CCCCCEEEEECCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCC T ss_conf 89862698650145455777875-14986576500225467825087640121599999999999999717828988788 Q ss_pred HHHHHHHCCCCCCEECCCCC-CHHHHHHHHHHHHHCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 67888617986417018887-46799999999872388-67699940000116788888899865435899999 Q gi|254780392|r 415 EEYNKLIESKFADMKNIGGR-GAGSIVAAQFLEKFVQD-ASWAHIDIAGVATGGYPKEINQSWASGFGVRLLDE 486 (494) Q Consensus 415 ~~y~~~~~s~~AD~~n~~~~-~~g~~~aa~fl~~f~~~-~~w~HlDiag~a~~~~~~~~~~~g~tg~~vr~l~~ 486 (494) ++|+++++|++||++|++++ .+|+|+||+||+|||++ +||+|||||||+|++. .++.|+|+||||||||+| T Consensus 240 ~~y~~~l~S~~AD~~N~~~~~~~g~i~AA~FL~~Fv~~~~~W~HlDiAg~a~~~~-~~~~~~G~TG~gvr~Lve 312 (312) T pfam00883 240 EDYKKQLKSKVADLANIGGRRGAGAITAALFLKEFVEEGVPWAHLDIAGTAWNKK-KNQWPKGATGYGVRTLVE 312 (312) T ss_pred HHHHHHCCCCHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCCC-CCCCCCCCCCCCCCCCCC T ss_conf 8999754897201105898898327899999999659998879897578654589-999999965557040049 No 5 >PRK05015 aminopeptidase B; Provisional Probab=100.00 E-value=0 Score=921.20 Aligned_cols=379 Identities=34% Similarity=0.521 Sum_probs=331.8 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 99999999999853169748994177899998999999999971477754444364333232100013332245773789 Q gi|254780392|r 85 SWLKAGGSIASYIEEDKNIEIFVDVPEYPITKAEIRDLVLGFMLKTYTFDQYKTKKRESSSPRSKENISVTIVTEMIQQS 164 (494) Q Consensus 85 ~~~ra~a~~~~~l~~~~~~~v~i~l~~~~~~~~~~~~~~eG~~L~~Y~Fd~yKskk~~~~~~~~~~~~~i~~~~~~~~~~ 164 (494) +.++....++|++...+++.+.+.-. ...-+..-+|..|+ | +-|... .+....- .... T Consensus 35 ~~~~~iq~aarkl~~qgi~~v~l~g~--~w~~e~~waf~qgf----~------~~k~~~---------~~~~~~l-~~~~ 92 (424) T PRK05015 35 DDLGLIQRAARKLDGQGIKQVALAGE--GWDLERCWAFWQGF----R------TPKGTR---------KVEWPDL-DDAQ 92 (424) T ss_pred CCHHHHHHHHHHHHHCCCCEEEECCC--CCCHHHHHHHHHCC----C------CCCCCC---------EEECCCC-CHHH T ss_conf 73799999998765268441686167--66688878988514----4------678884---------3436878-9889 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHH-HC---CCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEE Q ss_conf 9999999999999998642021254030299999999986-11---3321000011100013458988742025778506 Q gi|254780392|r 165 SQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKL-ES---LGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYL 240 (494) Q Consensus 165 ~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l-~~---~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prl 240 (494) + ++..+-.+++.++|||+|+|||+++|+.||++|.++ ++ ..++++|+++++|+++|||+|++|||||.+|||| T Consensus 93 ~---~~l~~~~~~~~~~R~liN~p~~~l~P~~lA~~a~~~~~~~~~~~v~~~ii~ge~L~e~G~gGi~aVG~GS~~pPrl 169 (424) T PRK05015 93 Q---AELDARLKIIDWVRDTINAPAEELGPEQLAQRAADLISSVAGDAVSYRIIKGEDLREQGYMGIHTVGRGSERPPVL 169 (424) T ss_pred H---HHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHCCCCCEEEECCCCCCCCEE T ss_conf 9---9999999999999987459946649999999999999873646547999617999867984136664778999868 Q ss_pred EEEECCCCCCCCCCE--EEEECCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCC Q ss_conf 886205788656512--663021120066310033332002322112356899999999870878728998641026689 Q gi|254780392|r 241 AVMKWEGGDSEEQPL--AFIGKGVVFDTGGISIKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPG 318 (494) Q Consensus 241 i~l~Y~g~~~~~~pi--aLVGKGvTFDtGGislKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~ 318 (494) |+|+|+|.++.++|+ ||||||||||||||||||+.+|++||+||||||+|+|+|.+++++++|+||++++|++||||| T Consensus 170 v~L~Y~P~g~~~a~v~~aLVGKGITFDSGGlsLKps~~M~~MK~DMgGAAaV~ga~~~a~~~~l~~~V~~il~~aENm~s 249 (424) T PRK05015 170 LALDYNPTGDPDAPVSACLVGKGITFDSGGYSIKPSAGMDSMKSDMGGAATVTGALALAITRGLNKRVKLFLCCAENLIS 249 (424) T ss_pred EEEEECCCCCCCCCCEEEEECCCEEEECCCCCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCC T ss_conf 99996689987775218997662107089867376231545146552799999999999972999608999885505778 Q ss_pred CCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHH Q ss_conf 99747077630248958886347866403652367887520796135521112002354317760578708989999999 Q gi|254780392|r 319 SSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLL 398 (494) Q Consensus 319 ~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~ 398 (494) |+|||||||||+||||||||.||||||||||||+|+||+ +++|+.|||+|||||||++|||.+++|+|||||+++++++ T Consensus 250 g~A~rPGDVit~~nGkTVEI~NTDAEGRLVLADaL~yA~-~~~p~~IID~ATLTGA~~vALG~~~aglfsndd~l~~~l~ 328 (424) T PRK05015 250 GNAFKLGDIITYKNGKTVEVMNTDAEGRLVLADGLIDAS-EQGAPLIIDAATLTGAAKMALGNDYHALFSFDDELAQRLL 328 (424) T ss_pred CCCCCCHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHH-HHCCCEEEEECCCHHHHHHHCCCCCEEEECCCHHHHHHHH T ss_conf 778883888976399779984488502320288999987-4099789872220788998718654145439999999999 Q ss_pred HHHHHHCCCCCCCCCCHHHHHHHCCCCCCEECCC--CCCHHHHHHHHHHHHHCCC--CCEEEEEEEECCCCCCCCCCCCC Q ss_conf 9899856971767768678886179864170188--8746799999999872388--67699940000116788888899 Q gi|254780392|r 399 SSGLSTGELLWRMPMNEEYNKLIESKFADMKNIG--GRGAGSIVAAQFLEKFVQD--ASWAHIDIAGVATGGYPKEINQS 474 (494) Q Consensus 399 ~a~~~~ge~~w~lPl~~~y~~~~~s~~AD~~n~~--~~~~g~~~aa~fl~~f~~~--~~w~HlDiag~a~~~~~~~~~~~ 474 (494) ++++.+||++|||||+++|+++++|++|||+|++ ++.||+||||+||+|||++ .||+||||||+ |.++++++.|+ T Consensus 329 ~a~~~~gE~~W~lPL~~~y~~~l~S~vADl~N~~~g~~~~Ga~tAA~FL~~FV~d~~~~W~HlDIAGt-~~~~~~~~~~~ 407 (424) T PRK05015 329 ASAAQENEPFWRLPLAEFHRSQLPSNFADLANSGSGAYAAGASTAAGFLSHFVENYQQGWLHIDCSAT-YRKSAVDLWAA 407 (424) T ss_pred HHHHHCCCCEECCCCCHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCC-CCCCCCCCCCC T ss_conf 99997089567288998899677998210106889999534899999999980899998899985545-76689988999 Q ss_pred CCCCCCHHHHHHHHHH Q ss_conf 8654358999999997 Q gi|254780392|r 475 WASGFGVRLLDEFVRA 490 (494) Q Consensus 475 g~tg~~vr~l~~~~~~ 490 (494) |+||||||||++||.+ T Consensus 408 GaTG~GVRtL~~~L~~ 423 (424) T PRK05015 408 GATGLGVRTIANLLLA 423 (424) T ss_pred CCEEHHHHHHHHHHHH T ss_conf 9760759999999840 No 6 >KOG2597 consensus Probab=100.00 E-value=0 Score=865.35 Aligned_cols=468 Identities=32% Similarity=0.480 Sum_probs=397.2 Q ss_pred CCCCCEEEEEECCCCCCC--------CCCCCCCCHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCHHHC- Q ss_conf 776771699962686632--------22212388889999998089983678789984378887445999944881558- Q gi|254780392|r 12 SIKQEGLAILLKTSFSDV--------AGLSLIGSPSIVMRAVSVKNFIGESKSHLNILAPVDCVWDRLVVAGIGDPRDA- 82 (494) Q Consensus 12 ~~~~d~lvV~~~~~~~~~--------~~~~~~~~~~~L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rViLVGLG~~~~~- 82 (494) ....+.++|.+.+...+. ....+....+.+...++..+++++.|+...+..+. ....++.++|+|++... T Consensus 22 ~~~~~~lvVG~~~~~~~~~~~~~~~~~~~~d~~~~g~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~a~vg~gk~~~~~ 100 (513) T KOG2597 22 EAKADGLVVGVYSKEGDKKSKKKTPAGEKFDDKLGGKLIELIRESGVDGKLGYGRVLELVD-PEYDVVAVVGLGKEGAGP 100 (513) T ss_pred CCCCCEEEEEEECCCCCCCCCCCCCCHHHCCCCCCCCHHEEEECCCCCCCCCEEEEEEECC-CCCCHHHHHHHHHHHCCC T ss_conf 4448558998740001232000154112215534663011442578888643036898137-775510124666643276 Q ss_pred ------C--HH-HHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf ------9--99-99999999998531697489941778999989999999999714777544443643332321000133 Q gi|254780392|r 83 ------N--FS-WLKAGGSIASYIEEDKNIEIFVDVPEYPITKAEIRDLVLGFMLKTYTFDQYKTKKRESSSPRSKENIS 153 (494) Q Consensus 83 ------~--~~-~~ra~a~~~~~l~~~~~~~v~i~l~~~~~~~~~~~~~~eG~~L~~Y~Fd~yKskk~~~~~~~~~~~~~ 153 (494) . .+ .+.+.+..++.+.......+.++.+ ....++.++|..|+.|+|+.+|++++....++. T Consensus 101 ~~~~~~~e~~~~v~~a~~~~~~al~~~~~~~~~v~s~-----~~~~e~aa~g~~l~~~~~~~~~~~~~~~~~~~i----- 170 (513) T KOG2597 101 NAKEGLDEGMEFVRVAARFGARALVLALGRLVPVYSP-----MDYAEQAAEGAALALWRYNRNKSKKNRPSLPKI----- 170 (513) T ss_pred CHHHHCCCHHHHHHHHCCCCHHHHHHHHCCEEECCCC-----CCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEE----- T ss_conf 4032114124554553042068999733433111342-----235888642453221223332221345454226----- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHH-HCCCCCHHHHCCCCCCHHHHHHHHHHHC Q ss_conf 322457737899999999999999998642021254030299999999986-1133210000111000134589887420 Q gi|254780392|r 154 VTIVTEMIQQSSQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKL-ESLGVAVEILDKEAMHKLGMNALLAVAQ 232 (494) Q Consensus 154 i~~~~~~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l-~~~gl~v~Vld~~~L~klGMg~llaVgq 232 (494) .+.... ...+.++++...++++|+||+|+++|+|.|||..|+++++.+ ++.+++++|.+++|+++++||+|++|++ T Consensus 171 -~~~~~~--~~~~~~k~~~~~~~~~~lar~l~d~PaN~Mtp~~fae~a~~~~~~~~v~v~V~~~~~i~~~~~~~~l~V~k 247 (513) T KOG2597 171 -DLYGSG--DGEDAWKRGLFKAAAQNLARRLGDTPANRMTPTQFAEEAVDVLCPLGVTVEVRDEEWIEEQGMNSFLAVAK 247 (513) T ss_pred -EEECCC--CCHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHHHCCCCCEEEEECHHHHHHCCCCCEEEECC T ss_conf -641687--61178888888877888887752488100688899999998613247428994247776544650023214 Q ss_pred CCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEE Q ss_conf 25778506886205788656512663021120066310033332002322112356899999999870878728998641 Q gi|254780392|r 233 GSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTGGISIKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLAL 312 (494) Q Consensus 233 GS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtGGislKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~ 312 (494) ||++||+|++++|+|.++.++||+||||||||||||+||||+.+|++||.||+|||+|+|+|+++++|++|+|+++++|+ T Consensus 248 ~s~~pP~ll~lsY~g~~~~~~~i~lvGKGvtfDsGGl~iK~~~~M~~mr~dm~GAA~v~~~~~a~~~l~~~in~~~v~pl 327 (513) T KOG2597 248 ASCEPPRLLELSYKGTSGADKTILLVGKGVTFDSGGLSIKPKTGMDGMRRDMGGAAVVLGAFRAAAQLSLPINVHAVLPL 327 (513) T ss_pred CCCCCCEEEEEEECCCCCCCCEEEEEECCEEEECCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHCCCCCCEEEEEEE T ss_conf 56889879999962898775527998421587358720257777566532124627999999999861899752799750 Q ss_pred CCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHH Q ss_conf 02668999747077630248958886347866403652367887520796135521112002354317760578708989 Q gi|254780392|r 313 VENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDV 392 (494) Q Consensus 313 ~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~ 392 (494) |||||||+|||||||||++|||||||.||||||||||||+|+||+++|+|++|+|+||||||+.+|||..++|+|+|++. T Consensus 328 cENm~sg~A~kpgDVit~~nGKtveI~NTDAEGRLiLADaL~Ya~~~~~~~~I~d~aTLTGa~~ialG~~~~gi~Tns~~ 407 (513) T KOG2597 328 CENMPSGNATKPGDVITLRNGKTVEINNTDAEGRLVLADALLYAQETLKPKLIVDIATLTGAMRIALGEGAAGIFTNSEE 407 (513) T ss_pred ECCCCCCCCCCCCCEEEECCCCEEEECCCCCCCEEEECCHHHHHHHHCCCCEEEEECHHHHHHHHHCCCCEEEEECCHHH T ss_conf 00588755789874788127867875144765538950277886651399889982000225676527640688728399 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHH-CCCCCCEECCCC-CCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC-CCC Q ss_conf 99999998998569717677686788861-798641701888-746799999999872388676999400001167-888 Q gi|254780392|r 393 LAEQLLSSGLSTGELLWRMPMNEEYNKLI-ESKFADMKNIGG-RGAGSIVAAQFLEKFVQDASWAHIDIAGVATGG-YPK 469 (494) Q Consensus 393 ~~~~~~~a~~~~ge~~w~lPl~~~y~~~~-~s~~AD~~n~~~-~~~g~~~aa~fl~~f~~~~~w~HlDiag~a~~~-~~~ 469 (494) ||+++++||..+|||+||||||.+|..++ ++..+|++|++. +.+++||||+||++||++..|+|||||||...+ ... T Consensus 408 l~k~~~~Ag~~tGD~vwrmplf~~~~~~v~~~~~~D~~n~g~~~~s~~~~aAAfL~~~v~~~~W~HlDiag~~~~~~~~~ 487 (513) T KOG2597 408 LWKRLQKAGIETGDRVWRMPLFSYYYKAVADSGLADLNNTGSGRASRSCTAAAFLKEFVPCLDWAHLDIAGPGHTDIGGT 487 (513) T ss_pred HHHHHHHHHHHHCCCEEECHHHHHHHHHHHCCCHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCEECCCCCC T ss_conf 99999997776198233216899999987414123321478888762147999997505764146530357400178888 Q ss_pred CCCCCC-CCCCCHHHHHHHHHHHCC Q ss_conf 888998-654358999999997668 Q gi|254780392|r 470 EINQSW-ASGFGVRLLDEFVRAFYE 493 (494) Q Consensus 470 ~~~~~g-~tg~~vr~l~~~~~~~~~ 493 (494) +|..++ +||||+|||+||+.++-. T Consensus 488 ~y~~~~~mTg~p~~Tlve~l~q~s~ 512 (513) T KOG2597 488 PYLHKGLMTGRPTRTLVEFLYQMSC 512 (513) T ss_pred CCCCCCCCCCCCCEEHHHHHHHHCC T ss_conf 6644456468974136999997538 No 7 >pfam02789 Peptidase_M17_N Cytosol aminopeptidase family, N-terminal domain. Probab=99.69 E-value=3.2e-16 Score=134.49 Aligned_cols=118 Identities=23% Similarity=0.314 Sum_probs=100.0 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCHHHCCHH-HHHHHHHHHHHHHHCC Q ss_conf 268663222212388889999998089983678789984378887445999944881558999-9999999999853169 Q gi|254780392|r 23 KTSFSDVAGLSLIGSPSIVMRAVSVKNFIGESKSHLNILAPVDCVWDRLVVAGIGDPRDANFS-WLKAGGSIASYIEEDK 101 (494) Q Consensus 23 ~~~~~~~~~~~~~~~~~~L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rViLVGLG~~~~~~~~-~~ra~a~~~~~l~~~~ 101 (494) ...+.+.....+....+.|+++++..+|+||.||++.++.+++...+||++||||+.++++.+ .+++++++++.+++.+ T Consensus 7 ~~~l~~~~~~ld~~~~g~i~~~~~~~~f~Gk~G~~~~l~~~~~~~~~rillvGlG~~~~~~~~~~r~a~~~~~~~l~~~~ 86 (126) T pfam02789 7 DGKLSPAAEKLDEALDGLLSELLKEGDFSGKAGETLLLPSLPGVKAKRVLLVGLGKEEELSAEALRKAAGAAARALKGLK 86 (126) T ss_pred CCCCCHHHHHHHHHHCCHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 98668799998698551999999838998777728998578998744899997156553167999999999999997489 Q ss_pred CEEEEECCCCCCCC-HHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 74899417789999-8999999999971477754444364 Q gi|254780392|r 102 NIEIFVDVPEYPIT-KAEIRDLVLGFMLKTYTFDQYKTKK 140 (494) Q Consensus 102 ~~~v~i~l~~~~~~-~~~~~~~~eG~~L~~Y~Fd~yKskk 140 (494) .+++.+.+++.... .+.+++++||+.|++|+||+|||+| T Consensus 87 ~~~i~~~l~~~~~~~~~~~~~~aeg~~L~~Y~f~~ykskk 126 (126) T pfam02789 87 VKTVAVDLPTLNIDAAEDARAAAEGALLGAYRFDRYKSKK 126 (126) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 8779998787787849999999999997447467546798 No 8 >PRK06837 acetylornithine deacetylase; Provisional Probab=95.33 E-value=0.28 Score=29.05 Aligned_cols=135 Identities=16% Similarity=0.208 Sum_probs=73.8 Q ss_pred HHHHHHHHHHCCCCCEEECHHHHHHHHHH-HHCCCCCHHHHCCC--CCCHH-HHHHHHHHHCCCCCCCEEEEEECCCCCC Q ss_conf 99999864202125403029999999998-61133210000111--00013-4589887420257785068862057886 Q gi|254780392|r 175 VKGVNLARDIINEPANVLGTDEFCKQVRK-LESLGVAVEILDKE--AMHKL-GMNALLAVAQGSSRPPYLAVMKWEGGDS 250 (494) Q Consensus 175 aegv~lARDLvN~PaN~ltP~~lA~~a~~-l~~~gl~v~Vld~~--~L~kl-GMg~llaVgqGS~~~Prli~l~Y~g~~~ 250 (494) .+-+.|+++|+..|+.-=.-...++.+.+ |++.|++++.+..+ .+... |++- +.-+.+..| -|+-.|+++.. T Consensus 20 de~v~ll~~Lv~ipS~~g~e~~~~~~l~~~l~~~G~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-nvv~~~~~~~~ 95 (427) T PRK06837 20 DAQVAFTQDLVRFPSTRGAEAPCQDFLADAFRARGYEVDRFHIDVDDLADHPGAGP---IEIDYSGAP-IVVGTYRPSTG 95 (427) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCCCCCC---CCCCCCCCC-EEEEEECCCCC T ss_conf 88999999996899969764999999999999779936997516355542467675---332346775-08998347899 Q ss_pred CCCCEEEEECCEEECCCCCC----------CCC----CCCCHHHHHHHHHHHHHHHHHHHHHHCCC--CCCEEEEEEECC Q ss_conf 56512663021120066310----------033----33200232211235689999999987087--872899864102 Q gi|254780392|r 251 EEQPLAFIGKGVVFDTGGIS----------IKP----SHGMEEMKGDLGGAAAVTGLLHVLAERKA--KINAIGVLALVE 314 (494) Q Consensus 251 ~~~piaLVGKGvTFDtGGis----------lKp----~~~M~~Mk~DM~GaAav~g~~~aia~l~~--~v~v~~~~~~~E 314 (494) ..+++.|-|-==|...|-.+ ++- +.|--+|| ||-|+.+-+++++.+++. +-+|+.....=| T Consensus 96 ~G~~l~l~gH~DvVP~g~~~~W~~dPF~~~i~dg~lyGRGa~DmK---gg~aa~l~A~~~l~~~~~~~~~~i~l~~v~dE 172 (427) T PRK06837 96 TGRSLILQGHIDVVPEGPLEMWSTPPFEPVIRDGWMYGRGAQDMK---AGLAAMLFALDAIRAAGLAPAARVHFQSVVEE 172 (427) T ss_pred CCCEEEEECCCCCCCCCCCCCCCCCCCCEEEECCEEECCCHHHCC---HHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC T ss_conf 998899926887069998766746997509989999857757502---10899999999999728898997789985323 Q ss_pred CC Q ss_conf 66 Q gi|254780392|r 315 NM 316 (494) Q Consensus 315 N~ 316 (494) -. T Consensus 173 E~ 174 (427) T PRK06837 173 ES 174 (427) T ss_pred CC T ss_conf 33 No 9 >PRK06915 acetylornithine deacetylase; Validated Probab=94.82 E-value=0.38 Score=28.08 Aligned_cols=135 Identities=19% Similarity=0.236 Sum_probs=73.2 Q ss_pred HHHHHHHHHHCCCCCEEECHHHHHHHHHH-HHCCCCCHHHHCCC--CCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCC Q ss_conf 99999864202125403029999999998-61133210000111--0001345898874202577850688620578865 Q gi|254780392|r 175 VKGVNLARDIINEPANVLGTDEFCKQVRK-LESLGVAVEILDKE--AMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSE 251 (494) Q Consensus 175 aegv~lARDLvN~PaN~ltP~~lA~~a~~-l~~~gl~v~Vld~~--~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~ 251 (494) .+-+.+.++||+.|+---.....++.+.+ |++.|++++++.-. ++.. .+.+.+.-..-..-| =|+-+++|.++ T Consensus 17 ~e~i~ll~~LV~i~S~sg~e~~~~~~i~~~l~~~G~~~e~~~~~~~~~~~--~~~~~~~~~~~~~~~-nvia~~~g~~~- 92 (422) T PRK06915 17 EESVKLLKRLIQEKSVSGDESGAQAIVIEKLRELGLDLDIWEPSFSKLKD--HPYFVSPRTSFSDSP-NIVATLKGSGD- 92 (422) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCHHHCC--CCCCCCCCCCCCCCC-EEEEEECCCCC- T ss_conf 98999999995799989787889999999999779958863124101024--764334444568887-49999668899- Q ss_pred CCCEEEEECCEEECCC---CCCCCC-----------CCCCHHHHHHHHHHHHHHHHHHHHHHCCCC--CCEEEEEEECCC Q ss_conf 6512663021120066---310033-----------332002322112356899999999870878--728998641026 Q gi|254780392|r 252 EQPLAFIGKGVVFDTG---GISIKP-----------SHGMEEMKGDLGGAAAVTGLLHVLAERKAK--INAIGVLALVEN 315 (494) Q Consensus 252 ~~piaLVGKGvTFDtG---GislKp-----------~~~M~~Mk~DM~GaAav~g~~~aia~l~~~--v~v~~~~~~~EN 315 (494) .+.+.|.|-==|-..| +-+-.| +.|-.+|| ||-|+.+-+++++.+.+.+ .+|+.+.-.-|- T Consensus 93 g~~lll~gH~DvVP~gd~~~W~~~PF~~~i~dg~lyGRGa~DmK---gg~aa~l~a~~~l~~~g~~~~~~v~~~~~~dEE 169 (422) T PRK06915 93 GKSMILNGHIDVVPEGDVKQWDHHPYSGERIGGRIYGRGTTDMK---GGNVALMLAMEALIESGIELKGDIYFQSVIEEE 169 (422) T ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCCCEEECCEEECCCCCCCC---CCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEEC T ss_conf 98899968778679998665768998758879999767865555---419999999999997099999982799998542 Q ss_pred C Q ss_conf 6 Q gi|254780392|r 316 M 316 (494) Q Consensus 316 ~ 316 (494) . T Consensus 170 ~ 170 (422) T PRK06915 170 S 170 (422) T ss_pred C T ss_conf 3 No 10 >PRK07522 acetylornithine deacetylase; Provisional Probab=91.03 E-value=1.4 Score=24.20 Aligned_cols=40 Identities=13% Similarity=0.299 Sum_probs=25.4 Q ss_pred HHHHHHHHHCCCCCEEECHH-HHHHHHHH-HHCCCCCHHHHC Q ss_conf 99998642021254030299-99999998-611332100001 Q gi|254780392|r 176 KGVNLARDIINEPANVLGTD-EFCKQVRK-LESLGVAVEILD 215 (494) Q Consensus 176 egv~lARDLvN~PaN~ltP~-~lA~~a~~-l~~~gl~v~Vld 215 (494) +.+.+.|+||.-|+---.++ ..++..++ |++.|++++++. T Consensus 6 ~~v~ll~~LV~i~Svs~~~~~~~~~~l~~~l~~~G~~~~~~~ 47 (387) T PRK07522 6 SSLDILERLVAFDTVSRDSNLALIEWVRDYLAAHGIESELIH 47 (387) T ss_pred CHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 899999998099895998459999999999997799189997 No 11 >PRK13013 succinyl-diaminopimelate desuccinylase; Reviewed Probab=89.17 E-value=1.4 Score=24.17 Aligned_cols=125 Identities=22% Similarity=0.241 Sum_probs=67.2 Q ss_pred HHHHHHHHHHCCCCCEE---ECHHHHHHHHHH-HHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEE-EEEECCCCC Q ss_conf 99999864202125403---029999999998-6113321000011100013458988742025778506-886205788 Q gi|254780392|r 175 VKGVNLARDIINEPANV---LGTDEFCKQVRK-LESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYL-AVMKWEGGD 249 (494) Q Consensus 175 aegv~lARDLvN~PaN~---ltP~~lA~~a~~-l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prl-i~l~Y~g~~ 249 (494) .+=+.+.|+||..|+=- ......++...+ |++.|+++++++..-. .|.. ..-||. ++.+++|+. T Consensus 14 de~v~ll~~LV~ipSv~p~g~~~~~~a~~l~~~l~~~G~~~~~~~~~g~----------~g~~-~~~~r~Nlva~~~g~~ 82 (427) T PRK13013 14 DDLVALTQDLIRIPTLNPPGRNYREICEMLAARLAPRGFEVELIRAHGA----------PGDS-ETYPRWNLVARRQGAR 82 (427) T ss_pred HHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCC----------CCCC-CCCCCEEEEEEECCCC T ss_conf 8799999999589998989959899999999999977994899965788----------7754-5565237999967999 Q ss_pred CCCCCEEE--------EECCEEECCCCCCCCC----CCCCHHHHHHHHHHHHHHHHHHHHHHCCCC--CCEEEEEEECC Q ss_conf 65651266--------3021120066310033----332002322112356899999999870878--72899864102 Q gi|254780392|r 250 SEEQPLAF--------IGKGVVFDTGGISIKP----SHGMEEMKGDLGGAAAVTGLLHVLAERKAK--INAIGVLALVE 314 (494) Q Consensus 250 ~~~~piaL--------VGKGvTFDtGGislKp----~~~M~~Mk~DM~GaAav~g~~~aia~l~~~--v~v~~~~~~~E 314 (494) . .+.|.| +|.|=|+|-=+-.+|- +.|-.+|| ||-|+.+-+++++.+++++ -+|+.+...-| T Consensus 83 ~-~~~l~l~gH~DvVp~gd~W~~dPf~~~i~dg~lYGRGa~DmK---gg~aa~l~A~~~l~~~~~~~~g~v~~~~~~dE 157 (427) T PRK13013 83 P-GDCVHFNSHHDVVEVGHGWTRDPFGGEVEDGRIYGRGACDMK---GGLAASIIAAEAFIAVYPDFAGAIEISATADE 157 (427) T ss_pred C-CCEEEEECCCCCCCCCCCCCCCCCCCEEECCEEECCCCCCCC---CCHHHHHHHHHHHHHHCCCCCCEEEEEEEECC T ss_conf 9-877999688586389998877999767889999876620036---86799999999999746566861899999633 No 12 >cd00986 PDZ_LON_protease PDZ domain of ATP-dependent LON serine proteases. Most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this bacterial subfamily of protease-associated PDZ domains a C-terminal beta-strand is thought to form the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=88.53 E-value=0.16 Score=30.70 Aligned_cols=51 Identities=20% Similarity=0.342 Sum_probs=36.4 Q ss_pred EEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEE Q ss_conf 99864102668999747077630248958886347866403652367887520796135521 Q gi|254780392|r 307 IGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDL 368 (494) Q Consensus 307 ~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ 368 (494) +-+....+|+|...+.++||||++-+|+.|.= ..|-..|. ...+|---|.+ T Consensus 10 v~V~~V~~gsPA~~~Lk~GDvI~~vdGk~v~~----------~~~l~~~i-~~~~~Gd~V~l 60 (79) T cd00986 10 VYVTSVVEGMPAAGKLKAGDHIIAVDGKPFKE----------AEELIDYI-QSKKEGDTVKL 60 (79) T ss_pred EEEEEECCCCCHHHCCCCCCEEEEECCEECCC----------HHHHHHHH-HCCCCCCEEEE T ss_conf 89999679997377077899999999989579----------99999999-65999998999 No 13 >PRK13004 peptidase; Reviewed Probab=85.33 E-value=3.1 Score=21.70 Aligned_cols=115 Identities=16% Similarity=0.224 Sum_probs=60.5 Q ss_pred HHHHHHHHHHCCCCCEEECHHHHHHHHHH-HHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEEC---CCCCC Q ss_conf 99999864202125403029999999998-611332100001110001345898874202577850688620---57886 Q gi|254780392|r 175 VKGVNLARDIINEPANVLGTDEFCKQVRK-LESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKW---EGGDS 250 (494) Q Consensus 175 aegv~lARDLvN~PaN~ltP~~lA~~a~~-l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y---~g~~~ 250 (494) .+-+.+.++||..|+--..-...++..++ |++.|++....| ++|-+++.-.+ .+|+++..-+ -|.+. T Consensus 15 ~e~v~ll~~LV~ipS~sg~E~~~a~~l~~~l~~~G~~~~~~d-------~~~nv~a~~~~--g~~~ill~gH~DvVP~g~ 85 (397) T PRK13004 15 ADMTRFLRDLIAIPSESGDEKRVVHRIKEEMEKVGFDKVEID-------PMGNVLGYIGH--GKRLIAMDAHIDTVGVGD 85 (397) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEC-------CCCEEEEEECC--CCCEEEEECCCCCCCCCC T ss_conf 999999999818999498999999999999987799779988-------99709999589--997799987517889998 Q ss_pred -C---CCCEEE-EECCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEE Q ss_conf -5---651266-30211200663100333320023221123568999999998708787289986 Q gi|254780392|r 251 -E---EQPLAF-IGKGVVFDTGGISIKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVL 310 (494) Q Consensus 251 -~---~~piaL-VGKGvTFDtGGislKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~ 310 (494) . ..|... +--|.-| +.|-.+|| ||.|+.+.+++++.+++++.+..-++ T Consensus 86 ~~~W~~dPf~~~~~dg~ly---------GRGa~DmK---g~~aa~l~A~~~l~~~~~~~~~~l~~ 138 (397) T PRK13004 86 IKNWDFDPFEGMETDGRIY---------GRGTSDQK---GGMASMVYAGKIIKDLGLEDEYTLYV 138 (397) T ss_pred CCCCCCCCCCCEEECCEEE---------ECCCCCCC---CCHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 6668569966589899999---------46865454---64899999999999629999826999 No 14 >PRK04443 acetyl-lysine deacetylase; Provisional Probab=83.32 E-value=3.7 Score=21.12 Aligned_cols=36 Identities=22% Similarity=0.294 Sum_probs=15.3 Q ss_pred HHHHHHHHCCCCCEEECHHHHHHHHHH-HHCCCCCHH Q ss_conf 999864202125403029999999998-611332100 Q gi|254780392|r 177 GVNLARDIINEPANVLGTDEFCKQVRK-LESLGVAVE 212 (494) Q Consensus 177 gv~lARDLvN~PaN~ltP~~lA~~a~~-l~~~gl~v~ 212 (494) .+.|.++|+..|+---.-...|+...+ |++.|+++. T Consensus 8 ~v~ll~~Li~i~S~sg~E~~~a~~l~~~l~~~G~~~~ 44 (352) T PRK04443 8 AVELLKRLVEIPSPSGKEAEAAEFLVEFMESLGFDAF 44 (352) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEE T ss_conf 9999999829999194889999999999986899269 No 15 >PRK07906 hypothetical protein; Provisional Probab=78.31 E-value=5.4 Score=19.97 Aligned_cols=14 Identities=7% Similarity=-0.002 Sum_probs=7.4 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 89999998089983 Q gi|254780392|r 39 SIVMRAVSVKNFIG 52 (494) Q Consensus 39 ~~L~~~i~~~~F~G 52 (494) ..|.++++..+|+- T Consensus 40 ~~l~~~l~~~G~~~ 53 (437) T PRK07906 40 EYVAEQLAEVGLEP 53 (437) T ss_pred HHHHHHHHHCCCCE T ss_conf 99999999779957 No 16 >PRK05111 acetylornithine deacetylase; Provisional Probab=75.43 E-value=6.4 Score=19.44 Aligned_cols=12 Identities=25% Similarity=0.517 Sum_probs=6.7 Q ss_pred CHHHHHHHHHHH Q ss_conf 589999999976 Q gi|254780392|r 480 GVRLLDEFVRAF 491 (494) Q Consensus 480 ~vr~l~~~~~~~ 491 (494) .++.+.++|++| T Consensus 369 ~~~i~~~~I~~f 380 (383) T PRK05111 369 TRELLRQLIHHF 380 (383) T ss_pred HHHHHHHHHHHH T ss_conf 999999999987 No 17 >TIGR01910 DapE-ArgE acetylornithine deacetylase or succinyl-diaminopimelate desuccinylase; InterPro: IPR010182 This group of sequences contains annotations for both acetylornithine deacetylase and succinyl-diaminopimelate desuccinylase, but does not contain any members with experimental characterization. Bacillus, Staphylococcus and Sulfolobus species contain multiple hits to this subfamily and each may have a separate activity. Determining which is which must await further laboratory research.; GO: 0009014 succinyl-diaminopimelate desuccinylase activity, 0046872 metal ion binding, 0009085 lysine biosynthetic process. Probab=75.06 E-value=6.6 Score=19.37 Aligned_cols=90 Identities=13% Similarity=0.164 Sum_probs=48.1 Q ss_pred CCH-HHHCCCCCCHHHHHHHHHHHCCCCC------CCEEEEEECCCCCCCCCCEEEEECCEEECCCCCCCCC-CCCCHHH Q ss_conf 210-0001110001345898874202577------8506886205788656512663021120066310033-3320023 Q gi|254780392|r 209 VAV-EILDKEAMHKLGMNALLAVAQGSSR------PPYLAVMKWEGGDSEEQPLAFIGKGVVFDTGGISIKP-SHGMEEM 280 (494) Q Consensus 209 l~v-~Vld~~~L~klGMg~llaVgqGS~~------~Prli~l~Y~g~~~~~~piaLVGKGvTFDtGGislKp-~~~M~~M 280 (494) +.+ .|.||++- .-|.|+...+-||-.. ++-.+++.=-++.+ ...++.==||+.-=.=...=|. .++|. T Consensus 144 ~~l~~V~dEE~~-~g~~G~~~~~~~G~~~~faivaD~d~~li~EPsgek-~~~~~~~~kG~~~~~~~v~Gk~~H~s~p-- 219 (427) T TIGR01910 144 IILQAVVDEEET-SGEAGTLYLLQKGYFKDFAIVADADGVLIPEPSGEK-GDKIVIGQKGSIWFKVKVKGKQAHASTP-- 219 (427) T ss_pred EEEEEECCHHCC-CCCHHHHHHHHHHHHHCCEEECCHHHEEECCCCCCC-CCEEEEEEEEEEEEEEEEEEEEECCCCC-- T ss_conf 799885370105-522568999997654134233472000743888876-5525665313689999962023024576-- Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 2211235689999999987087872 Q gi|254780392|r 281 KGDLGGAAAVTGLLHVLAERKAKIN 305 (494) Q Consensus 281 k~DM~GaAav~g~~~aia~l~~~v~ 305 (494) --|.=|+.-+++.+.+|+==.. T Consensus 220 ---~~G~nAi~~~~k~~~~l~~~~~ 241 (427) T TIGR01910 220 ---EEGVNAIEKLAKLITELEELEE 241 (427) T ss_pred ---CCHHHHHHHHHHHHHHHHHHHH T ss_conf ---5238899999999999988876 No 18 >cd00987 PDZ_serine_protease PDZ domain of tryspin-like serine proteases, such as DegP/HtrA, which are oligomeric proteins involved in heat-shock response, chaperone function, and apoptosis. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=72.02 E-value=1.7 Score=23.44 Aligned_cols=59 Identities=20% Similarity=0.346 Sum_probs=39.2 Q ss_pred HCCCC-CCEEEEEEECCCCCCCCC-CCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEE Q ss_conf 70878-728998641026689997-47077630248958886347866403652367887520796135521 Q gi|254780392|r 299 ERKAK-INAIGVLALVENMPGSSA-QRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDL 368 (494) Q Consensus 299 ~l~~~-v~v~~~~~~~EN~~~~~a-~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ 368 (494) +++++ .+=+.+....+|.|..+| .+|||||+..||+.|+=.+ |-..+ ..+++|...+.+ T Consensus 17 ~~~~~~~~Gv~V~~V~~~spA~~aGl~~GDiI~~ing~~i~~~~----------~~~~~-l~~~~~g~~v~~ 77 (90) T cd00987 17 ELGLKDTKGVLVASVDPGSPAAKAGLKPGDVILAVNGKPVKSVA----------DLRRA-LAELKPGDKVTL 77 (90) T ss_pred HCCCCCCCEEEEEEECCCCHHHHCCCCCCCEEEEECCEECCCHH----------HHHHH-HHHCCCCCEEEE T ss_conf 84989997799999899995998299999899999999938999----------99999-982699987999 No 19 >PRK13983 diaminopimelate aminotransferase; Provisional Probab=71.62 E-value=7.9 Score=18.82 Aligned_cols=33 Identities=18% Similarity=-0.065 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHHHCCC--CCCEEEEEEECCCC Q ss_conf 1235689999999987087--87289986410266 Q gi|254780392|r 284 LGGAAAVTGLLHVLAERKA--KINAIGVLALVENM 316 (494) Q Consensus 284 M~GaAav~g~~~aia~l~~--~v~v~~~~~~~EN~ 316 (494) ++|-|+.+-+++++.+.+. +.+|..+.-..|-. T Consensus 120 Kgg~aa~l~A~~~l~~~~~~~~~~i~~~~~~dEE~ 154 (399) T PRK13983 120 GQGIVSSLLAAKALMELGIRPKYNLGLAFVSDEET 154 (399) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCC T ss_conf 40389999999999984899997189998624035 No 20 >PRK00466 acetyl-lysine deacetylase; Validated Probab=71.06 E-value=8.1 Score=18.73 Aligned_cols=12 Identities=17% Similarity=0.158 Sum_probs=5.2 Q ss_pred HHHHHHHHCCCC Q ss_conf 899999980899 Q gi|254780392|r 39 SIVMRAVSVKNF 50 (494) Q Consensus 39 ~~L~~~i~~~~F 50 (494) ..|.++++..+| T Consensus 34 ~~l~~~l~~~g~ 45 (345) T PRK00466 34 DFFEKIASELNL 45 (345) T ss_pred HHHHHHHHHCCC T ss_conf 999999998799 No 21 >PRK13009 succinyl-diaminopimelate desuccinylase; Reviewed Probab=70.80 E-value=8.2 Score=18.70 Aligned_cols=16 Identities=13% Similarity=0.061 Sum_probs=7.7 Q ss_pred CHHHHHHHHHHHHHHC Q ss_conf 9899999999899856 Q gi|254780392|r 390 NDVLAEQLLSSGLSTG 405 (494) Q Consensus 390 ~~~~~~~~~~a~~~~g 405 (494) .+++.+++.+.-+..+ T Consensus 264 ~e~v~~~i~~~~~~~~ 279 (375) T PRK13009 264 AESLKARVEAILDKHG 279 (375) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9999999999998749 No 22 >cd00991 PDZ_archaeal_metalloprotease PDZ domain of archaeal zinc metalloprotases, presumably membrane-associated or integral membrane proteases, which may be involved in signalling and regulatory mechanisms. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=69.11 E-value=2.5 Score=22.38 Aligned_cols=59 Identities=19% Similarity=0.371 Sum_probs=43.8 Q ss_pred CCEEEEEEECCCCCCC-CCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCH Q ss_conf 7289986410266899-974707763024895888634786640365236788752079613552111200 Q gi|254780392|r 304 INAIGVLALVENMPGS-SAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTG 373 (494) Q Consensus 304 v~v~~~~~~~EN~~~~-~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTG 373 (494) ++=+-+....++.|.. .-.+|||||++.||+.|+ -+.|-..+. .+++|.--|.+-.|-+ T Consensus 9 ~~Gv~V~~V~~gsPA~~AGL~~GDVI~~Ing~~I~----------~~~d~~~~l-~~~~pG~~v~v~v~R~ 68 (79) T cd00991 9 VAGVVIVGVIVGSPAENAVLHTGDVIYSINGTPIT----------TLEDFMEAL-KPTKPGEVITVTVLPS 68 (79) T ss_pred CCCEEEEEECCCCHHHHCCCCCCCEEEEECCEECC----------CHHHHHHHH-HCCCCCCEEEEEEEEC T ss_conf 59779999678996998699988899998999987----------999999999-6189999899999989 No 23 >PRK07338 hypothetical protein; Provisional Probab=67.86 E-value=9.4 Score=18.28 Aligned_cols=132 Identities=10% Similarity=0.072 Sum_probs=72.8 Q ss_pred HHHHHHHHHHCCCCCEEECHHHHHHHHHH----HHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCC Q ss_conf 99999864202125403029999999998----61133210000111000134589887420257785068862057886 Q gi|254780392|r 175 VKGVNLARDIINEPANVLGTDEFCKQVRK----LESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDS 250 (494) Q Consensus 175 aegv~lARDLvN~PaN~ltP~~lA~~a~~----l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~ 250 (494) .+-+.+-++|||..+---+++-..+.+.- |+..|++++.++.......+..+. -+...-...++.+++|++ T Consensus 17 ~~~l~~l~~lV~i~S~S~~~~gv~~v~~~l~~~l~~lg~~v~~~~~~~~~~~~~~g~----~~~~~~g~~l~~~~~~~~- 91 (407) T PRK07338 17 AAMVDQLHAWCAINSGSGNLDGLARMAETLADAFAALPGEIELITLPPVEVIDADGR----VVEQSHGPALHVRVRPEA- 91 (407) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCC----CCCCCCCCEEEEEECCCC- T ss_conf 999999999947889999999999999999999986799589971687432255675----233566866998626999- Q ss_pred CCCCEEEEECCEE-ECCCCCCCCC---------CCCCHHHHHHHHHHHHHHHHHHHHHHCCCC--CCEEEEEEECCC Q ss_conf 5651266302112-0066310033---------332002322112356899999999870878--728998641026 Q gi|254780392|r 251 EEQPLAFIGKGVV-FDTGGISIKP---------SHGMEEMKGDLGGAAAVTGLLHVLAERKAK--INAIGVLALVEN 315 (494) Q Consensus 251 ~~~piaLVGKGvT-FDtGGislKp---------~~~M~~Mk~DM~GaAav~g~~~aia~l~~~--v~v~~~~~~~EN 315 (494) .++|.|.|-==| |-.|-.-=+| +.|-.+|| +|-|+.+.+++++.+.+.. .+|..++-.-|- T Consensus 92 -~~~lll~gH~DTV~p~g~~~~~~~~~~~grlyG~G~~DmK---gGla~~l~Al~al~~~~~~~~~~i~v~~t~DEE 164 (407) T PRK07338 92 -PRQVLLTGHMDTVYPADHPFQTLSLLDDGTLNGPGVADMK---GGIAVMLAALAAFENSPLADRLGYDVLINSDEE 164 (407) T ss_pred -CCEEEEECCCCEECCCCCCCCCCEEECCCEEECCCCCCCC---CCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEEC T ss_conf -8228998115666799986768547249989836702254---209999999999985687788748999973034 No 24 >pfam00122 E1-E2_ATPase E1-E2 ATPase. Probab=65.21 E-value=6.2 Score=19.56 Aligned_cols=64 Identities=27% Similarity=0.296 Sum_probs=33.6 Q ss_pred HHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCH Q ss_conf 99998708787289986410266899974707763024895888634786640365236788752079613552111200 Q gi|254780392|r 294 LHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTG 373 (494) Q Consensus 294 ~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTG 373 (494) ++.+..+.++.-+..+=-=-+-.++.+-.+|||+|.-..|-+|- |.||++ ..+..+|-+.||| T Consensus 24 ~~~L~~l~p~~~~~V~rdG~~~~v~~~~l~~GDiv~v~~G~~IP-----aDg~vi------------~g~~~vdes~lTG 86 (222) T pfam00122 24 LKALKKLLPPTAVTVIRDGKEEEIPADELVVGDIVLLKPGDRVP-----ADGRII------------EGSLEVDESALTG 86 (222) T ss_pred HHHHHHCCCCCEEEEEECCEEEEEEHHHCCCCCEEEECCCCEEE-----CCCEEE------------CCCEEEEEHHHCC T ss_conf 99986249580189998999999888997889999989999962-----174575------------1838997222068 Q ss_pred H Q ss_conf 2 Q gi|254780392|r 374 A 374 (494) Q Consensus 374 a 374 (494) - T Consensus 87 E 87 (222) T pfam00122 87 E 87 (222) T ss_pred C T ss_conf 6 No 25 >TIGR02037 degP_htrA_DO protease Do; InterPro: IPR011782 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists serine peptidases belonging to MEROPS peptidase family S1, subfamily S1C (protease Do, clan PA(S)). They are variously designated DegP, DegQ, heat shock protein HtrA, MucD and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in Escherichia coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens . The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures .; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis. Probab=64.54 E-value=1.7 Score=23.50 Aligned_cols=31 Identities=26% Similarity=0.531 Sum_probs=14.7 Q ss_pred EEEEEECCCCCCC-CCCCHHHHHCCCCCCEEE Q ss_conf 9986410266899-974707763024895888 Q gi|254780392|r 307 IGVLALVENMPGS-SAQRPGDIVRSMSGQTIE 337 (494) Q Consensus 307 ~~~~~~~EN~~~~-~a~~pgdvi~~~~G~tve 337 (494) |.+.-...+.|.. .-.||||||.+-|++-|. T Consensus 418 v~V~~v~~~s~Aa~~Gl~~GDvI~~vN~~~V~ 449 (484) T TIGR02037 418 VVVTKVVSGSPAARAGLQPGDVILSVNQQPVS 449 (484) T ss_pred EEEEEECCCCHHHHCCCCCCCEEEEECCCCCC T ss_conf 89997338888997178766189950880146 No 26 >PRK13034 serine hydroxymethyltransferase; Reviewed Probab=63.83 E-value=11 Score=17.76 Aligned_cols=58 Identities=21% Similarity=0.256 Sum_probs=27.3 Q ss_pred CCCCEEEEEHHHHHH----HH-HCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHH Q ss_conf 866403652367887----52-079613552111200235431776057870898999999998 Q gi|254780392|r 342 DAEGRLILADALWYC----RT-HYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSS 400 (494) Q Consensus 342 DAEGRlvLaD~l~ya----~~-~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a 400 (494) |.-|-++|+|.=..+ .. .-+|=---|+-|-|-+-. --|++-.-+|+||+++++++..| T Consensus 196 d~vGA~l~~D~AHiaGLIA~g~~psP~~~ADvvT~TTHKT-LrGPrGGiIl~~~~el~~kId~a 258 (422) T PRK13034 196 DSVGAILMVDMAHIAGLVAAGRHPNPVPHAHVVTTTTHKT-LRGPRGGMILTNDEEIAKKINSA 258 (422) T ss_pred HHHCCEEEECHHHHHHHHCCCCCCCCCCCCEEEECCCCCC-CCCCCCCEEECCHHHHHHHHHCC T ss_conf 8729779811565545410333699877752886775147-78988435752669999986433 No 27 >PRK10942 serine endoprotease; Provisional Probab=61.25 E-value=2.4 Score=22.42 Aligned_cols=28 Identities=25% Similarity=0.529 Sum_probs=17.1 Q ss_pred EEECCCCCCCCC-CCHHHHHCCCCCCEEE Q ss_conf 641026689997-4707763024895888 Q gi|254780392|r 310 LALVENMPGSSA-QRPGDIVRSMSGQTIE 337 (494) Q Consensus 310 ~~~~EN~~~~~a-~~pgdvi~~~~G~tve 337 (494) .-..+|.|..+| .|+||||++.||+.|+ T Consensus 317 ~~V~~~sPA~kAGL~~GDVI~~vdG~~I~ 345 (474) T PRK10942 317 SQVLPNSSAAKAGIKAGDVITSLNGKPIS 345 (474) T ss_pred EECCCCCCHHHCCCCCCCEEEEECCEECC T ss_conf 20177993677699989999998998968 No 28 >TIGR00093 TIGR00093 conserved hypothetical protein; InterPro: IPR000748 Pseudouridine synthases are responsible for the synthesis of pseudouridine from uracil in ribosomal RNA. Several bacterial proteins from the Rsu family of pseudouridine synthases have been shown to share regions of similarities, including Escherichia coli and Haemophilus influenzae 16S pseudouridylate 516 synthase (gene rsuA); E. coli hypothetical proteins yciL and ymfC and the corresponding H. influenzae proteins HI1199 and HI0694 respectively; Aquifex aeolicus hypothetical proteins AQ_554 and AQ_1464; Bacillus subtilis hypothetical proteins ypuL and ytzF; Borrelia burgdorferi hypothetical protein BB0129; Helicobacter pylori hypothetical protein HP1459; and Synechocystis sp. (strain PCC 6803) hypothetical proteins slr0361 and slr0612. These are proteins of from 25 to 40 kD which contain a number of conserved regions in their central section.; GO: 0003723 RNA binding, 0004730 pseudouridylate synthase activity, 0001522 pseudouridine synthesis. Probab=60.67 E-value=2.1 Score=22.91 Aligned_cols=30 Identities=23% Similarity=0.354 Sum_probs=14.1 Q ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 7489941778999989999999999714777 Q gi|254780392|r 102 NIEIFVDVPEYPITKAEIRDLVLGFMLKTYT 132 (494) Q Consensus 102 ~~~v~i~l~~~~~~~~~~~~~~eG~~L~~Y~ 132 (494) .+++.|.+.+ ....+.++++.+|+.|..+. T Consensus 67 eK~Y~V~v~g-~v~~~~~~~~~~Gv~L~~~~ 96 (185) T TIGR00093 67 EKTYLVTVEG-PVTDEDLEALRKGVQLEDGK 96 (185) T ss_pred CCEEEEEECC-CCCHHHHHHHHCCCEECCCC T ss_conf 9669999559-85888998987176005873 No 29 >cd05017 SIS_PGI_PMI_1 The members of this protein family contain the SIS (Sugar ISomerase) domain and have both the phosphoglucose isomerase (PGI) and the phosphomannose isomerase (PMI) functions. These functions catalyze the reversible reactions of glucose 6-phosphate to fructose 6-phosphate, and mannose 6-phosphate to fructose 6-phosphate, respectively at an equal rate. This protein contains two SIS domains. This alignment is based on the first SIS domain. Probab=60.13 E-value=5.1 Score=20.17 Aligned_cols=62 Identities=27% Similarity=0.254 Sum_probs=39.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHH--HHCCCCCCEEEEECCCCCCEEEEEHHHHHHHH Q ss_conf 211235689999999987087872899864102668999747077--63024895888634786640365236788752 Q gi|254780392|r 282 GDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGD--IVRSMSGQTIEVINTDAEGRLILADALWYCRT 358 (494) Q Consensus 282 ~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgd--vi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~ 358 (494) .-|||+|.+--.++++..-..++.|...-. -+.|. -..+.| |..||||-|-|.++ ++-.|.+ T Consensus 5 ~GmGGS~i~gd~l~~~~~~~~~vPv~v~rd--y~lP~--~v~~~~lvi~~S~SGnTeEtl~-----------~~~~a~~ 68 (119) T cd05017 5 LGMGGSGIGGDLLESLLLDEAKIPVYVVKD--YTLPA--FVDRKTLVIAVSYSGNTEETLS-----------AVEQAKE 68 (119) T ss_pred EECCHHHHHHHHHHHHHHCCCCCCEEEECC--CCCCC--CCCCCCEEEEECCCCCCHHHHH-----------HHHHHHH T ss_conf 956388899999999970579988798489--98987--5587878999928979689999-----------9999998 No 30 >TIGR01043 ATP_syn_A_arch ATP synthase archaeal, A subunit; InterPro: IPR005726 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition with V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases . A-ATPases are composed of two linked complexes: the A1 complex consisting of seven subunits contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex consisting of at least two subunits forms the membrane-spanning pore . The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C . A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species. This entry represents the alpha subunit from the A1 complex of A-ATPases. The A1 complex contains three copies each of subunits alpha (or A) and beta (or B) (IPR005724 from INTERPRO), both of which form the headpiece that participates in nucleotide binding. However, only the alpha subunit is catalytic, the beta subunit being regulatory in function . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016021 integral to membrane, 0016469 proton-transporting two-sector ATPase complex. Probab=59.83 E-value=5.9 Score=19.70 Aligned_cols=136 Identities=13% Similarity=0.132 Sum_probs=69.9 Q ss_pred CCCEEEEEECCCHHHCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHH--HHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 744599994488155899999999999998531697489941778999989999--999999714777544443643332 Q gi|254780392|r 67 VWDRLVVAGIGDPRDANFSWLKAGGSIASYIEEDKNIEIFVDVPEYPITKAEIR--DLVLGFMLKTYTFDQYKTKKRESS 144 (494) Q Consensus 67 ~~~rViLVGLG~~~~~~~~~~ra~a~~~~~l~~~~~~~v~i~l~~~~~~~~~~~--~~~eG~~L~~Y~Fd~yKskk~~~~ 144 (494) .++-|+.||||+.-.--.+.++-.-.+-.---+.....-++.+.+.++-...++ .+-.|+.++-|==|. T Consensus 250 ~a~ivvYiGCGERGNEMt~vl~~FP~l~dp~TG~pLm~RT~LiANTSNMPVAAREas~Y~GiTiAEYfRD~--------- 320 (584) T TIGR01043 250 DADIVVYIGCGERGNEMTEVLEEFPELEDPKTGKPLMERTVLIANTSNMPVAAREASIYTGITIAEYFRDM--------- 320 (584) T ss_pred CCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCHHHHHCCEEEECCCCCCHHHHHHHHHHCCHHHHHHHCC--------- T ss_conf 47379996168875258999984878688635315654205335267875344011232110253344116--------- Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHH-HCCCCCHHHHCCCCCCHHH Q ss_conf 321000133322457737899999999999999998642021254030299999999986-1133210000111000134 Q gi|254780392|r 145 SPRSKENISVTIVTEMIQQSSQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKL-ESLGVAVEILDKEAMHKLG 223 (494) Q Consensus 145 ~~~~~~~~~i~~~~~~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l-~~~gl~v~Vld~~~L~klG 223 (494) -..+.+..+....-.+++++ +.-+|=.+|+..=+|.|||.+..++ .+.|.-.++-.+++ T Consensus 321 ------GYdv~L~ADSTSRWAEA~Re---------isgRlEEMPgEEGyPaYLasrLaefYERAG~~~~l~~~~~----- 380 (584) T TIGR01043 321 ------GYDVALMADSTSRWAEALRE---------ISGRLEEMPGEEGYPAYLASRLAEFYERAGRVKTLGGEER----- 380 (584) T ss_pred ------CCCEEEEECCCCHHHHHHHH---------HHHHHCCCCCCCCCHHHHHHHHHHHHHHCCEEEECCCCCC----- T ss_conf ------85168872475217899987---------5202102778788624688888776651460223168874----- Q ss_pred HHHHHHHH Q ss_conf 58988742 Q gi|254780392|r 224 MNALLAVA 231 (494) Q Consensus 224 Mg~llaVg 231 (494) .|++.-|| T Consensus 381 ~gsv~v~g 388 (584) T TIGR01043 381 EGSVTVVG 388 (584) T ss_pred EEEEEEEE T ss_conf 47899996 No 31 >KOG0884 consensus Probab=59.19 E-value=3.9 Score=20.97 Aligned_cols=57 Identities=23% Similarity=0.416 Sum_probs=41.5 Q ss_pred HHHCCCCCCCCCCHHHHHHHCCCCC---CEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEEEECCCC Q ss_conf 9856971767768678886179864---170188874679999999987238867699940000116 Q gi|254780392|r 402 LSTGELLWRMPMNEEYNKLIESKFA---DMKNIGGRGAGSIVAAQFLEKFVQDASWAHIDIAGVATG 465 (494) Q Consensus 402 ~~~ge~~w~lPl~~~y~~~~~s~~A---D~~n~~~~~~g~~~aa~fl~~f~~~~~w~HlDiag~a~~ 465 (494) .+-|+.+|--|+.+||.+-||.++- -|.|.|...-|+. -|+--..-.|||+--+.|. T Consensus 60 grgg~siwg~~fede~~~~lkh~~rg~vsmanngp~tn~sq-------ffity~kq~hldmkytvfg 119 (161) T KOG0884 60 GRGGNSIWGKKFEDEYSEYLKHNVRGVVSMANNGPNTNGSQ-------FFITYGKQPHLDMKYTVFG 119 (161) T ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCE-------EEEEECCCCCCCEEEEEEE T ss_conf 88875424874328899987506310687132799988756-------8999357766430566541 No 32 >PRK06031 phosphoribosyltransferase; Provisional Probab=57.38 E-value=7.3 Score=19.06 Aligned_cols=26 Identities=27% Similarity=0.336 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 99999999999999864202125403 Q gi|254780392|r 166 QVVADIQSVVKGVNLARDIINEPANV 191 (494) Q Consensus 166 ~~i~~~~aiaegv~lARDLvN~PaN~ 191 (494) +.+..|..|..+..|-+.-+-+|.-+ T Consensus 162 DVISSG~Si~a~l~LL~~~G~e~v~i 187 (233) T PRK06031 162 DVISSGASIVAALRLLATCGIEPAGI 187 (233) T ss_pred CHHCCCHHHHHHHHHHHHCCCCEEEE T ss_conf 12215565999999999759973788 No 33 >KOG1349 consensus Probab=56.37 E-value=15 Score=16.89 Aligned_cols=16 Identities=13% Similarity=0.067 Sum_probs=5.8 Q ss_pred HHHHHHHHHHHHCCCC Q ss_conf 6899999999870878 Q gi|254780392|r 288 AAVTGLLHVLAERKAK 303 (494) Q Consensus 288 Aav~g~~~aia~l~~~ 303 (494) |.|++.-+.+-+|+.| T Consensus 47 aNvl~~YrsvKrlGip 62 (309) T KOG1349 47 ANVLSVYRSVKRLGIP 62 (309) T ss_pred HHHHHHHHHHHHCCCC T ss_conf 8899999999980898 No 34 >PRK06931 diaminobutyrate--2-oxoglutarate aminotransferase; Provisional Probab=56.31 E-value=8.8 Score=18.46 Aligned_cols=97 Identities=18% Similarity=0.313 Sum_probs=52.8 Q ss_pred CEEECHHHHHHHHHHHH-CCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCC Q ss_conf 40302999999999861-13321000011100013458988742025778506886205788656512663021120066 Q gi|254780392|r 189 ANVLGTDEFCKQVRKLE-SLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTG 267 (494) Q Consensus 189 aN~ltP~~lA~~a~~l~-~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtG 267 (494) +-+..|..|-+.+++++ ++|+ +=|+||-+.==-.-|.+.+ .++-|-. +.|.-+|||+ +| T Consensus 234 G~~~~p~~yl~~lr~lc~~~gi-llI~DEV~tGfGRTG~~fa-------------~e~~gv~---PDIi~~gKgl---gg 293 (454) T PRK06931 234 GVVPAPVSWLQKIREVTQKHGI-LLIVDEVQAGFARSGKMFA-------------FEHAGIE---PDIVVMSKAV---GG 293 (454) T ss_pred CCCCCCHHHHHHHHHHHHHCCE-EEEECCCCCCCCCCCCCCH-------------HCCCCCC---CCEEEEHHHH---CC T ss_conf 9761888998899987511681-8995034437786778412-------------1025889---8960021532---48 Q ss_pred CCCC------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 3100------------3333200232211235689999999987087872 Q gi|254780392|r 268 GISI------------KPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKIN 305 (494) Q Consensus 268 Gisl------------Kp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~ 305 (494) |+-+ .|+....++-..--++|+.+.++..+-+.++.-| T Consensus 294 G~P~a~~~~~~~~~~~~~~~h~~T~~Gnpla~Aaala~L~~i~~e~l~~~ 343 (454) T PRK06931 294 GLPLAVLGIKKEFDAWQPAGHTGTFRGNQLAMATGLATLKILRDENLAQN 343 (454) T ss_pred CCCEEEEEEHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHH T ss_conf 88548998755552467888664568984788999999999986249999 No 35 >TIGR01515 branching_enzym 1,4-alpha-glucan branching enzyme; InterPro: IPR006407 This entry represents the core region of the glycogen branching enzymes, which are responsible for the transfer of chains of approximately seven alpha(1,4)-linked glucosyl residues to other similar chains (in new alpha-(1,6) linkages) in the biosynthesis of glycogen . The branching enzyme is responsible for the degree of alpha(1,6) branch linkages found in polysaccharides .; GO: 0003844 14-alpha-glucan branching enzyme activity, 0005978 glycogen biosynthetic process. Probab=55.35 E-value=6 Score=19.66 Aligned_cols=23 Identities=17% Similarity=0.241 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999999853169748994 Q gi|254780392|r 85 SWLKAGGSIASYIEEDKNIEIFV 107 (494) Q Consensus 85 ~~~ra~a~~~~~l~~~~~~~v~i 107 (494) .+++++-++.-.+|..+-+-+-+ T Consensus 149 SY~eL~~~L~pYvK~~GFTHiEL 171 (608) T TIGR01515 149 SYRELADQLIPYVKELGFTHIEL 171 (608) T ss_pred HHHHHHHHHHHHHHHCCCEEEEE T ss_conf 27888631015787508712541 No 36 >PRK08652 acetylornithine deacetylase; Provisional Probab=54.50 E-value=16 Score=16.68 Aligned_cols=16 Identities=6% Similarity=-0.037 Sum_probs=10.1 Q ss_pred CHHHHHHHHHCCCCCC Q ss_conf 8889999998089983 Q gi|254780392|r 37 SPSIVMRAVSVKNFIG 52 (494) Q Consensus 37 ~~~~L~~~i~~~~F~G 52 (494) ....|.++++..+|+- T Consensus 24 ~a~~l~~~l~~~G~~~ 39 (349) T PRK08652 24 IALYIMEFLHSLGYDV 39 (349) T ss_pred HHHHHHHHHHHCCCEE T ss_conf 9999999999779948 No 37 >PRK08554 peptidase; Reviewed Probab=53.96 E-value=16 Score=16.63 Aligned_cols=14 Identities=0% Similarity=0.019 Sum_probs=8.8 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 89999998089983 Q gi|254780392|r 39 SIVMRAVSVKNFIG 52 (494) Q Consensus 39 ~~L~~~i~~~~F~G 52 (494) ..|..+++..+|+- T Consensus 31 ~~l~~~l~~~G~~~ 44 (438) T PRK08554 31 AFIKDTLESWGIES 44 (438) T ss_pred HHHHHHHHHCCCCE T ss_conf 99999999779906 No 38 >PRK04155 chaperone protein HchA; Provisional Probab=53.92 E-value=6 Score=19.65 Aligned_cols=26 Identities=8% Similarity=0.028 Sum_probs=12.5 Q ss_pred CCCCCCCCCCEEEEECCCCCCCCEEEEEECC Q ss_conf 0899836787899843788874459999448 Q gi|254780392|r 47 VKNFIGESKSHLNILAPVDCVWDRLVVAGIG 77 (494) Q Consensus 47 ~~~F~GK~Ge~l~l~~~~g~~~~rViLVGLG 77 (494) ..||+|-... .+....-.+|++|..- T Consensus 34 ktd~~g~~y~-----~~y~g~~~kiL~i~te 59 (288) T PRK04155 34 KTDLDGVDYP-----KPYRGGKWKILMIAAD 59 (288) T ss_pred CCCCCCCCCC-----CCCCCCCCEEEEEEEC T ss_conf 6477898787-----8667887169999843 No 39 >PRK08596 acetylornithine deacetylase; Validated Probab=53.72 E-value=16 Score=16.60 Aligned_cols=40 Identities=18% Similarity=0.214 Sum_probs=22.1 Q ss_pred HHHHHHHHHCCCCCEEE---CHHHHHHHHHH-HHCCCCCHHHHC Q ss_conf 99998642021254030---29999999998-611332100001 Q gi|254780392|r 176 KGVNLARDIINEPANVL---GTDEFCKQVRK-LESLGVAVEILD 215 (494) Q Consensus 176 egv~lARDLvN~PaN~l---tP~~lA~~a~~-l~~~gl~v~Vld 215 (494) +-+.+.++||.-|+--- .-...++...+ |++.|+++++++ T Consensus 14 e~i~ll~~LV~i~S~sp~g~~~~~~~~~l~~~l~~~G~~~~~~~ 57 (421) T PRK08596 14 ELLELTKTLIRFETPAPPARNTNEAQQFIAEFLRKRNFSVDKWD 57 (421) T ss_pred HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999958999898986878999999999997799289998 No 40 >PRK06133 glutamate carboxypeptidase; Reviewed Probab=51.50 E-value=18 Score=16.36 Aligned_cols=119 Identities=18% Similarity=0.320 Sum_probs=75.8 Q ss_pred HHHHHHHHHCCCCCEEECHHHHHHHH----HHHHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCC Q ss_conf 99998642021254030299999999----98611332100001110001345898874202577850688620578865 Q gi|254780392|r 176 KGVNLARDIINEPANVLGTDEFCKQV----RKLESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSE 251 (494) Q Consensus 176 egv~lARDLvN~PaN~ltP~~lA~~a----~~l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~ 251 (494) +...+-|+|||.-+.-.+.+-..+.+ ++|+..|++++++.... ..--.++.+|+|.+ T Consensus 46 ~~l~~l~~lVni~Sgt~~~~gv~~v~~~~~~~l~~~G~~v~~~~~~~-----------------~~g~~l~a~~~g~g-- 106 (418) T PRK06133 46 AYLDTLRELVSIESGSGDAEGLKQVAALLAERLKALGATVELVPATP-----------------SAGDMVVATFKGTG-- 106 (418) T ss_pred HHHHHHHHHEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCC-----------------CCCCEEEEEECCCC-- T ss_conf 99999998735078999768999999999999997799589964788-----------------86887999978999-- Q ss_pred CCCEEEEECC-EEECCCCCCCCC---------CCCCHHHHHHHHHHHHHHHHHHHHHHCCCC--CCEEEEEEECCCC Q ss_conf 6512663021-120066310033---------332002322112356899999999870878--7289986410266 Q gi|254780392|r 252 EQPLAFIGKG-VVFDTGGISIKP---------SHGMEEMKGDLGGAAAVTGLLHVLAERKAK--INAIGVLALVENM 316 (494) Q Consensus 252 ~~piaLVGKG-vTFDtGGislKp---------~~~M~~Mk~DM~GaAav~g~~~aia~l~~~--v~v~~~~~~~EN~ 316 (494) +++|.|+|-= ..|.-|...-.| +.|--+|| +|-|+.+.+++++.+.+.+ .+|+.+.-.-|-. T Consensus 107 ~~~ilL~gH~DTVfp~g~~~~~Pf~~eggr~YGrGv~DMK---gGla~~L~Al~aL~~~g~~~~~~i~v~~t~DEE~ 180 (418) T PRK06133 107 KRKIMLIAHMDTVYLPGSLATQPFRIDGERAYGPGIADDK---GGVAVILHALAILQQQGFKDYGTLTVLFNPDEET 180 (418) T ss_pred CCCEEEEEECCCCCCCCCCCCCCEEEECCEEECCCHHHCC---HHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCC T ss_conf 8878999436887999965679869889999807656510---7699999999999963998897089999878887 No 41 >PRK07473 carboxypeptidase; Provisional Probab=51.40 E-value=18 Score=16.35 Aligned_cols=117 Identities=15% Similarity=0.198 Sum_probs=60.8 Q ss_pred HHHHHHCCCCCEEECHHHHHHHHH----HHHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCC Q ss_conf 986420212540302999999999----8611332100001110001345898874202577850688620578865651 Q gi|254780392|r 179 NLARDIINEPANVLGTDEFCKQVR----KLESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQP 254 (494) Q Consensus 179 ~lARDLvN~PaN~ltP~~lA~~a~----~l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~p 254 (494) ..-++|||.-+.-.+++-..+.+. +++..|++++.+...+ +.+ -.++.+|.+.++..+. T Consensus 15 ~~L~~lV~i~S~s~~~~gv~~~~~~~~~~l~~lG~~ve~~~~~~----~~~-------------~~~~~~~~~~~~g~p~ 77 (376) T PRK07473 15 QGLRPWVECESPTWDAAAVNRMLDLAARDMAIMGATIERIAGRQ----GFA-------------DCVRARFPHPRQGEPG 77 (376) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCC----CCC-------------CEEEEEECCCCCCCCC T ss_conf 99999964879999999999999999999986498489956777----778-------------6089981689999987 Q ss_pred EEEEECC-EEECCCCCCCCC---------CCCCHHHHHHHHHHHHHHHHHHHHHHCCC--CCCEEEEEEECCC Q ss_conf 2663021-120066310033---------33200232211235689999999987087--8728998641026 Q gi|254780392|r 255 LAFIGKG-VVFDTGGISIKP---------SHGMEEMKGDLGGAAAVTGLLHVLAERKA--KINAIGVLALVEN 315 (494) Q Consensus 255 iaLVGKG-vTFDtGGislKp---------~~~M~~Mk~DM~GaAav~g~~~aia~l~~--~v~v~~~~~~~EN 315 (494) |.|.|-= ..|..|...-.| +.|--+|| +|-++.+.+++++.+.+. +.+|..++-.-|- T Consensus 78 ill~gH~DTV~p~g~~~~~p~r~eg~r~yGpGv~DMK---gGla~~l~Al~aL~~~g~~~~~~i~~~~t~DEE 147 (376) T PRK07473 78 ILIAGHLDTVHPVGTLEKLPFRREGNKCYGPGILDMK---GGNYLALEAIRQLARASFTTPLPITVLFTPDEE 147 (376) T ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEECCCCCCCC---CCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCC T ss_conf 8999256888899966678779879999888721254---459999999999997278889758999973787 No 42 >KOG0780 consensus Probab=51.27 E-value=18 Score=16.34 Aligned_cols=171 Identities=21% Similarity=0.262 Sum_probs=81.0 Q ss_pred CCCCCCCCEEEEEECCCHHHCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHH-HHHHHHHHHH---CCCCCCCCC Q ss_conf 378887445999944881558999999999999985316974899417789999899-9999999971---477754444 Q gi|254780392|r 62 APVDCVWDRLVVAGIGDPRDANFSWLKAGGSIASYIEEDKNIEIFVDVPEYPITKAE-IRDLVLGFML---KTYTFDQYK 137 (494) Q Consensus 62 ~~~g~~~~rViLVGLG~~~~~~~~~~ra~a~~~~~l~~~~~~~v~i~l~~~~~~~~~-~~~~~eG~~L---~~Y~Fd~yK 137 (494) .|......-|++|||--.-+.+ ..++.+..+++.+.+...+.-+.+-..+.+ +.+-++-... ++|.--++- T Consensus 95 ~~~K~kpsVimfVGLqG~GKTT-----tc~KlA~y~kkkG~K~~LvcaDTFRagAfDQLkqnA~k~~iP~ygsyte~dpv 169 (483) T KOG0780 95 QPKKGKPSVIMFVGLQGSGKTT-----TCTKLAYYYKKKGYKVALVCADTFRAGAFDQLKQNATKARVPFYGSYTEADPV 169 (483) T ss_pred CCCCCCCCEEEEEECCCCCCCE-----EHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHCCEEEECCCCCCHH T ss_conf 6156897089998305788630-----08999999984687245776022453068999987674077068403665558 Q ss_pred CCCCCCCCCCCCCCCCCCCCCC--CHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEE------CHHHHHHHHHHHHC-CC Q ss_conf 3643332321000133322457--737899999999999999998642021254030------29999999998611-33 Q gi|254780392|r 138 TKKRESSSPRSKENISVTIVTE--MIQQSSQVVADIQSVVKGVNLARDIINEPANVL------GTDEFCKQVRKLES-LG 208 (494) Q Consensus 138 skk~~~~~~~~~~~~~i~~~~~--~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~l------tP~~lA~~a~~l~~-~g 208 (494) .-..+....-+++.+.+.+... -.+.....+++-..+.++++ |-|++ +-..-+.+++++++ .+ T Consensus 170 ~ia~egv~~fKke~fdvIIvDTSGRh~qe~sLfeEM~~v~~ai~--------Pd~vi~VmDasiGQaae~Qa~aFk~~vd 241 (483) T KOG0780 170 KIASEGVDRFKKENFDVIIVDTSGRHKQEASLFEEMKQVSKAIK--------PDEIIFVMDASIGQAAEAQARAFKETVD 241 (483) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHCC--------CCEEEEEEECCCCHHHHHHHHHHHHHHC T ss_conf 99999999888639728998278730124899999999985159--------8738999856200767999998877615 Q ss_pred CCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEE Q ss_conf 2100001110001345898874202577850688620578865651266302112 Q gi|254780392|r 209 VAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVV 263 (494) Q Consensus 209 l~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvT 263 (494) +---|+.+=+=...|=|+|-+|. ..+.||.|||-|=- T Consensus 242 vg~vIlTKlDGhakGGgAlSaVa------------------aTksPIiFIGtGEh 278 (483) T KOG0780 242 VGAVILTKLDGHAKGGGALSAVA------------------ATKSPIIFIGTGEH 278 (483) T ss_pred CCEEEEEECCCCCCCCCEEEEHH------------------HHCCCEEEEECCCC T ss_conf 40379972256777773453035------------------40798799816755 No 43 >PRK08674 bifunctional phosphoglucose/phosphomannose isomerase; Validated Probab=50.52 E-value=14 Score=16.95 Aligned_cols=15 Identities=20% Similarity=0.180 Sum_probs=6.3 Q ss_pred HHHHHHHCCCCCEEE Q ss_conf 998642021254030 Q gi|254780392|r 178 VNLARDIINEPANVL 192 (494) Q Consensus 178 v~lARDLvN~PaN~l 192 (494) -.+|+.|.+..|=+. T Consensus 181 k~LA~~l~~~~piiy 195 (328) T PRK08674 181 KRLASKIYGRIPVIY 195 (328) T ss_pred HHHHHHHCCCCCEEE T ss_conf 999999638986899 No 44 >PRK13007 dipeptidase; Reviewed Probab=49.62 E-value=19 Score=16.17 Aligned_cols=27 Identities=11% Similarity=0.161 Sum_probs=11.8 Q ss_pred HHHHHHHHHCCCCCEEECHHHHHHHHH Q ss_conf 999986420212540302999999999 Q gi|254780392|r 176 KGVNLARDIINEPANVLGTDEFCKQVR 202 (494) Q Consensus 176 egv~lARDLvN~PaN~ltP~~lA~~a~ 202 (494) +-+.|.|+|+..|+---.-...|+... T Consensus 9 d~i~Ll~~LV~i~S~sg~E~~~a~~l~ 35 (354) T PRK13007 9 DLAELTAALVDIPSVSGHEKALADAVE 35 (354) T ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHH T ss_conf 899999997299992978899999999 No 45 >pfam10447 EXOSC1 Exosome component EXOSC1/CSL4. This family of proteins are components of the exosome 3'-5' exoribonuclease complex. The exosome mediates degradation of unstable mRNAs that contain AU-rich elements (AREs) within their 3' untranslated regions. Probab=49.20 E-value=3.5 Score=21.26 Aligned_cols=12 Identities=58% Similarity=0.916 Sum_probs=6.1 Q ss_pred CCCCCHHHHHCC Q ss_conf 997470776302 Q gi|254780392|r 319 SSAQRPGDIVRS 330 (494) Q Consensus 319 ~~a~~pgdvi~~ 330 (494) ..+|||||||.+ T Consensus 56 ~~~FrPGDiVrA 67 (72) T pfam10447 56 YECFRPGDIVRA 67 (72) T ss_pred HHCCCCCCEEEE T ss_conf 468399989999 No 46 >COG1155 NtpA Archaeal/vacuolar-type H+-ATPase subunit A [Energy production and conversion] Probab=48.93 E-value=19 Score=16.09 Aligned_cols=151 Identities=9% Similarity=0.082 Sum_probs=69.6 Q ss_pred CCEEEEECCCC-CCCCEEEEEECCCHHHCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHH--HHHHHHHHHCCC Q ss_conf 87899843788-874459999448815589999999999999853169748994177899998999--999999971477 Q gi|254780392|r 55 KSHLNILAPVD-CVWDRLVVAGIGDPRDANFSWLKAGGSIASYIEEDKNIEIFVDVPEYPITKAEI--RDLVLGFMLKTY 131 (494) Q Consensus 55 Ge~l~l~~~~g-~~~~rViLVGLG~~~~~~~~~~ra~a~~~~~l~~~~~~~v~i~l~~~~~~~~~~--~~~~eG~~L~~Y 131 (494) |+++..+.... ..++-|+.+|||+...--.+.+.-.-.+-.--........++...+..+-...+ ..+-.|+.++-| T Consensus 236 GKTV~qh~laK~sdadiVVyigCGERGNEmtevL~eFPeL~Dp~tg~~lm~RT~liaNTSnMPVAAREasIYtGiTiaEY 315 (588) T COG1155 236 GKTVSQHTLSKLADGDIVIYVGCGERGNEMTEVLQEFPELKDPNTGQPLMDRTVLIANTSNMPVAAREASIYTGITIAEY 315 (588) T ss_pred CCEEHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHH T ss_conf 83752435656406888999815776523899998686666899997120003676257666188866435322319998 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHH-HCCCCC Q ss_conf 7544443643332321000133322457737899999999999999998642021254030299999999986-113321 Q gi|254780392|r 132 TFDQYKTKKRESSSPRSKENISVTIVTEMIQQSSQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKL-ESLGVA 210 (494) Q Consensus 132 ~Fd~yKskk~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l-~~~gl~ 210 (494) -.|.=. .+.+..+....-++++.+ +.-+|..+|+..-+|.||+.+..++ .+.|.. T Consensus 316 ~RDmGy---------------~v~lmADSTSRWAEAlRE---------isgRleEmPgeegyPaYL~srlA~fYERaG~v 371 (588) T COG1155 316 YRDMGY---------------DVALMADSTSRWAEALRE---------ISGRLEEMPGEEGYPAYLGSRLAEFYERAGRV 371 (588) T ss_pred HHHHHH---------------HHHHHHCHHHHHHHHHHH---------HHCCCCCCCCCCCCCHHHHHHHHHHHHHCCEE T ss_conf 876302---------------567750138999999998---------73350028864444167899999999854723 Q ss_pred HHHHCCCCCCHHHHHHHHHHHCCC Q ss_conf 000011100013458988742025 Q gi|254780392|r 211 VEILDKEAMHKLGMNALLAVAQGS 234 (494) Q Consensus 211 v~Vld~~~L~klGMg~llaVgqGS 234 (494) .. +...+ ..|++--+|-.| T Consensus 372 ~~-~~~~~----r~GsvtV~gaVS 390 (588) T COG1155 372 RL-VSPEE----RFGSITVIGAVS 390 (588) T ss_pred EE-CCCCC----CEEEEEEECCCC T ss_conf 42-27886----258999825778 No 47 >cd00989 PDZ_metalloprotease PDZ domain of bacterial and plant zinc metalloprotases, presumably membrane-associated or integral membrane proteases, which may be involved in signalling and regulatory mechanisms. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=48.89 E-value=3.9 Score=20.99 Aligned_cols=29 Identities=28% Similarity=0.511 Sum_probs=23.6 Q ss_pred EEEECCCCCCCCC-CCHHHHHCCCCCCEEE Q ss_conf 8641026689997-4707763024895888 Q gi|254780392|r 309 VLALVENMPGSSA-QRPGDIVRSMSGQTIE 337 (494) Q Consensus 309 ~~~~~EN~~~~~a-~~pgdvi~~~~G~tve 337 (494) +-...+|+|...| .+|||+|.+-||+.|. T Consensus 16 V~~V~~~spA~~AGl~~GD~I~~ing~~v~ 45 (79) T cd00989 16 IGEVVPGSPAAKAGLKAGDRILAINGQKIK 45 (79) T ss_pred EEEECCCCHHHHCCCCCCCEEEEECCEECC T ss_conf 999899998998599999999999999958 No 48 >COG1921 SelA Selenocysteine synthase [seryl-tRNASer selenium transferase] [Amino acid transport and metabolism] Probab=47.97 E-value=20 Score=15.99 Aligned_cols=109 Identities=31% Similarity=0.320 Sum_probs=69.6 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCC----------------------C Q ss_conf 568999999998708787289986410266899974707763024895888634786----------------------6 Q gi|254780392|r 287 AAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDA----------------------E 344 (494) Q Consensus 287 aAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDA----------------------E 344 (494) +++|+-+..+.++ +..| ++..-|+-+ |-+++-.|++..-+++-+||.+|-- + T Consensus 93 ~aAVll~~~al~~---~~Ev--Vis~g~lV~-gg~~~v~d~~~~aG~~l~EvG~tn~t~~~d~~~AIne~ta~llkV~s~ 166 (395) T COG1921 93 AAAVLLTLNALAE---GKEV--VVSRGELVE-GGAFRVPDIIRLAGAKLVEVGTTNRTHLKDYELAINENTALLLKVHSS 166 (395) T ss_pred HHHHHHHHHHHCC---CCEE--EEECCCCCC-CCCCCHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEEEEEC T ss_conf 8999998763205---7717--997131156-889986679997197799815657677888998854587058999503 Q ss_pred -----CEEEEEHHHHHHHHHCCCCEEEEECC---------CCHHHH-----------HHCCCCCEEEEECCHHHHHHHHH Q ss_conf -----40365236788752079613552111---------200235-----------43177605787089899999999 Q gi|254780392|r 345 -----GRLILADALWYCRTHYNPHLMIDLAT---------LTGAMV-----------VSLGNIYAGLFANNDVLAEQLLS 399 (494) Q Consensus 345 -----GRlvLaD~l~ya~~~~~p~~iid~AT---------LTGa~~-----------~alG~~~~g~~~n~~~~~~~~~~ 399 (494) |-|-+.|...-|. .++--.++|.|- |..++- -.||.-.+|++.-..+|..+++. T Consensus 167 ~~~f~~~l~~~~l~~ia~-~~~lpvivD~aSg~~v~~e~~l~~~la~GaDLV~~SgdKllgGPqaGii~GkKelI~~lq~ 245 (395) T COG1921 167 NYGFTGMLSEEELVEIAH-EKGLPVIVDLASGALVDKEPDLREALALGADLVSFSGDKLLGGPQAGIIVGKKELIEKLQS 245 (395) T ss_pred CCCCCCCCCHHHHHHHHH-HCCCCEEEECCCCCCCCCCCCHHHHHHCCCCEEEEECCHHCCCCCCCEEECHHHHHHHHHH T ss_conf 666146455999999998-7699779966776300355445688860797899754011279854337424999999976 Q ss_pred HHH Q ss_conf 899 Q gi|254780392|r 400 SGL 402 (494) Q Consensus 400 a~~ 402 (494) .+- T Consensus 246 ~~l 248 (395) T COG1921 246 HPL 248 (395) T ss_pred HHH T ss_conf 213 No 49 >COG4386 Mu-like prophage tail sheath protein gpL [General function prediction only] Probab=47.89 E-value=20 Score=15.98 Aligned_cols=15 Identities=7% Similarity=-0.090 Sum_probs=6.9 Q ss_pred CCCCEEEEEECCCCC Q ss_conf 778506886205788 Q gi|254780392|r 235 SRPPYLAVMKWEGGD 249 (494) Q Consensus 235 ~~~Prli~l~Y~g~~ 249 (494) .+.-++..+-|++.+ T Consensus 276 rNd~~~t~~~~~~ap 290 (487) T COG4386 276 RNDGHATCAWIHGAP 290 (487) T ss_pred CCCCEEEEEECCCCC T ss_conf 257437775316999 No 50 >PRK10898 serine endoprotease; Provisional Probab=47.66 E-value=5 Score=20.20 Aligned_cols=54 Identities=15% Similarity=0.191 Sum_probs=35.9 Q ss_pred EEEEEECCCCCCCCC-CCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCC Q ss_conf 998641026689997-47077630248958886347866403652367887520796135521112 Q gi|254780392|r 307 IGVLALVENMPGSSA-QRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATL 371 (494) Q Consensus 307 ~~~~~~~EN~~~~~a-~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATL 371 (494) +-+....++.|..+| .||||||++.+|+.|+=. .|.+.+ ..+++|.--+.+--+ T Consensus 282 v~V~~V~~~sPA~~AGL~~GDvI~~idg~~v~~~----------~~l~~~-l~~~~pGd~v~l~v~ 336 (355) T PRK10898 282 IVVNEVSPDGPAANAGIQVNDLIISVNNKPAISA----------LETMDQ-VAEIRPGSVIPVVVM 336 (355) T ss_pred CEEEEECCCCHHHHCCCCCCCEEEEECCEECCCH----------HHHHHH-HHHCCCCCEEEEEEE T ss_conf 2898879999589859998999999899893899----------999999-971899798999999 No 51 >cd05791 S1_CSL4 S1_CSL4: CSL4, S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. ScCSL4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In S. cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure". Probab=47.54 E-value=4 Score=20.87 Aligned_cols=13 Identities=54% Similarity=0.807 Sum_probs=7.1 Q ss_pred CCCCCHHHHHCCC Q ss_conf 9974707763024 Q gi|254780392|r 319 SSAQRPGDIVRSM 331 (494) Q Consensus 319 ~~a~~pgdvi~~~ 331 (494) .++|||||||++. T Consensus 58 ~~~FrpGDIVrA~ 70 (92) T cd05791 58 YKCFRPGDIVRAK 70 (92) T ss_pred HHCCCCCCEEEEE T ss_conf 1354999899999 No 52 >COG3896 Chloramphenicol 3-O-phosphotransferase [Defense mechanisms] Probab=47.03 E-value=12 Score=17.57 Aligned_cols=33 Identities=21% Similarity=0.296 Sum_probs=19.3 Q ss_pred CEEECHHHHHHHHHHHH--C---CCCCH--HHHCCCCCCH Q ss_conf 40302999999999861--1---33210--0001110001 Q gi|254780392|r 189 ANVLGTDEFCKQVRKLE--S---LGVAV--EILDKEAMHK 221 (494) Q Consensus 189 aN~ltP~~lA~~a~~l~--~---~gl~v--~Vld~~~L~k 221 (494) .-++|-..|.+.++-|. + .|+.| |++.+.+++. T Consensus 121 dv~~~r~~L~Dc~r~l~g~~v~~VGV~~p~E~~~~Re~rr 160 (205) T COG3896 121 DVIWTREWLVDCLRVLEGCRVWMVGVHVPDEEGARRELRR 160 (205) T ss_pred HCCCCHHHHHHHHHHHHCCCEEEEEEECCHHHHHHHHHHC T ss_conf 1056466677899998189658998416488999987525 No 53 >TIGR02149 glgA_Coryne glycogen synthase, Corynebacterium family; InterPro: IPR011875 This entry describes Corynebacterium glutamicum GlgA and closely related proteins in other species. This enzyme is required for glycogen biosynthesis and appears to replace the distantly related IPR011835 from INTERPRO, family of ADP-glucose type glycogen synthase in Corynebacterium glutamicum, Mycobacterium tuberculosis, Bifidobacterium longum, and Streptomyces coelicolor .. Probab=46.95 E-value=9.2 Score=18.32 Aligned_cols=40 Identities=13% Similarity=0.166 Sum_probs=23.2 Q ss_pred CCCHHHHCCCCCCHHHHHHHHHHHCCCC---------CCCEEEEEECCCC Q ss_conf 3210000111000134589887420257---------7850688620578 Q gi|254780392|r 208 GVAVEILDKEAMHKLGMNALLAVAQGSS---------RPPYLAVMKWEGG 248 (494) Q Consensus 208 gl~v~Vld~~~L~klGMg~llaVgqGS~---------~~Prli~l~Y~g~ 248 (494) |.++.-|-|+.--+- -..+.||++||. -+|-.|+.-|||- T Consensus 147 GY~lSsW~EktA~~a-Ad~vIAVS~amr~DiL~~YP~lD~~kv~Vv~NGI 195 (416) T TIGR02149 147 GYKLSSWAEKTAIEA-ADRVIAVSGAMREDILKVYPDLDPEKVHVVYNGI 195 (416) T ss_pred CCCCCHHHHHHHHHH-CCCEEEHHHHCHHHHHCCCCCCCCCCEEEEECCC T ss_conf 742024788889985-0406531110335583158688846468886476 No 54 >PRK10139 serine endoprotease; Provisional Probab=46.44 E-value=6.1 Score=19.61 Aligned_cols=24 Identities=17% Similarity=0.404 Sum_probs=11.4 Q ss_pred CCCCCCCCC-CCHHHHHCCCCCCEE Q ss_conf 026689997-470776302489588 Q gi|254780392|r 313 VENMPGSSA-QRPGDIVRSMSGQTI 336 (494) Q Consensus 313 ~EN~~~~~a-~~pgdvi~~~~G~tv 336 (494) ..+.|..++ .||||||++-|++.| T Consensus 398 ~~gS~Aa~aGLr~GDVI~~VN~~~V 422 (455) T PRK10139 398 VKGSPAAQAGLQKDDVIIGVNRDRV 422 (455) T ss_pred CCCCHHHHCCCCCCCEEEEECCEEC T ss_conf 7899899869999999997799873 No 55 >KOG3846 consensus Probab=43.22 E-value=9.1 Score=18.38 Aligned_cols=39 Identities=13% Similarity=0.254 Sum_probs=15.2 Q ss_pred CCCCEEEEECC-HHHHHHHHHHHHHHCC------CCCCCCCCHHHH Q ss_conf 77605787089-8999999998998569------717677686788 Q gi|254780392|r 380 GNIYAGLFANN-DVLAEQLLSSGLSTGE------LLWRMPMNEEYN 418 (494) Q Consensus 380 G~~~~g~~~n~-~~~~~~~~~a~~~~ge------~~w~lPl~~~y~ 418 (494) |-..+--||.. +...+.+.+-|-.... |+-+.||+..|. T Consensus 401 GcQLsl~Fssp~~~vfqeL~krGv~~DkR~PNvIRvAPvpLYN~f~ 446 (465) T KOG3846 401 GCQLSLYFSSPTPKVFQELHKRGVIGDKRRPNVIRVAPVPLYNTFS 446 (465) T ss_pred CCEEEEEECCCCHHHHHHHHHCCEECCCCCCCCEEEECCHHHCCHH T ss_conf 6469999679951889999873701325588832861420000189 No 56 >COG2222 AgaS Predicted phosphosugar isomerases [Cell envelope biogenesis, outer membrane] Probab=41.87 E-value=10 Score=18.09 Aligned_cols=64 Identities=19% Similarity=0.324 Sum_probs=35.5 Q ss_pred CCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCC------CCCCEEEEECCEEECCCCCCCCCCCCCH Q ss_conf 210000111000134589887420257785068862057886------5651266302112006631003333200 Q gi|254780392|r 209 VAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDS------EEQPLAFIGKGVVFDTGGISIKPSHGME 278 (494) Q Consensus 209 l~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~------~~~piaLVGKGvTFDtGGislKp~~~M~ 278 (494) +-.+..+.+-+-=+|.|.+.++++-+.-. |..|.|-.... -.-|..||++|+-| +-+++.+... T Consensus 194 ~~~~~~~~~~i~~lGsG~~~g~A~e~aLk--l~E~~~~~s~a~~s~E~~HGp~elv~~~~pv----i~~~~~d~tr 263 (340) T COG2222 194 FAEEYADEDRIYTLGSGPLYGAAYEAALK--LKEMQWIHSEAISSGEFRHGPKELVEEGTPV----LLFVSEDETR 263 (340) T ss_pred HHHHHCCCCEEEEECCCCCHHHHHHHHHH--HHHHCCCCCEEEECCCCCCCCHHHCCCCCEE----EEEECCCCCH T ss_conf 99973589779997785217999999999--9987040015511341014759773799529----9994487404 No 57 >TIGR01139 cysK cysteine synthase A; InterPro: IPR005859 This model discriminates cysteine synthase A (CysK) and cysteine synthase B (CysM) from cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain lacking from this protein. Bacterial proteins lacking the CBS domain but otherwise showing resemblance to cystathionine beta-synthases and considerable phylogenetic distance from known cysteine synthases were excluded. Cysteine synthase (O-acetylserine (thiol)-lyase, 2.5.1.47 from EC) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid. In bacteria such two forms of the enzyme are known (genes cysK and cysM). CysK differs from CysM in that it can also use sulphide instead of thiosulphate, to produce cysteine instead of cysteine thiosulphonate. ; GO: 0004124 cysteine synthase activity, 0006535 cysteine biosynthetic process from serine. Probab=41.61 E-value=24 Score=15.33 Aligned_cols=29 Identities=17% Similarity=0.190 Sum_probs=24.1 Q ss_pred HHHHHHHHHHHHHHHHCC-CCCCEEEEEEE Q ss_conf 123568999999998708-78728998641 Q gi|254780392|r 284 LGGAAAVTGLLHVLAERK-AKINAIGVLAL 312 (494) Q Consensus 284 M~GaAav~g~~~aia~l~-~~v~v~~~~~~ 312 (494) ++..|+|++|.+..-+++ +..+||+++|- T Consensus 272 ISSGA~vaAAl~~Ak~~~n~dK~iV~i~Ps 301 (312) T TIGR01139 272 ISSGAAVAAALKLAKRLENPDKLIVVILPS 301 (312) T ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEEEECC T ss_conf 557899999999987324889889998278 No 58 >cd03450 NodN NodN (nodulation factor N) contains a single hot dog fold similar to those of the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. Rhizobium and related species form nodules on the roots of their legume hosts, a symbiotic process that requires production of Nod factors, which are signal molecules involved in root hair deformation and meristematic cell division. The nodulation gene products, including NodN, are involved in producing the Nod factors, however the role played by NodN is unclear. Probab=38.80 E-value=26 Score=15.13 Aligned_cols=21 Identities=5% Similarity=0.026 Sum_probs=10.5 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 99998999999999971477754 Q gi|254780392|r 112 YPITKAEIRDLVLGFMLKTYTFD 134 (494) Q Consensus 112 ~~~~~~~~~~~~eG~~L~~Y~Fd 134 (494) .....+.+.+|++-. +.++|= T Consensus 22 ~~i~Q~~I~~FA~~T--~D~qwI 42 (149) T cd03450 22 VTVDQERIDQFADAT--GDHQWI 42 (149) T ss_pred EEECHHHHHHHHHHH--CCCCCC T ss_conf 898999999999974--898764 No 59 >PRK08087 L-fuculose phosphate aldolase; Provisional Probab=38.36 E-value=27 Score=14.99 Aligned_cols=59 Identities=14% Similarity=0.275 Sum_probs=43.1 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHHHCCCCCHH Q ss_conf 332245773789999999999999999864202125403029999999998611332100 Q gi|254780392|r 153 SVTIVTEMIQQSSQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKLESLGVAVE 212 (494) Q Consensus 153 ~i~~~~~~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l~~~gl~v~ 212 (494) .+...+.+...+-......+..++-...|+-+ ..|.+.|+++.+++.++.++.+|.+.+ T Consensus 156 G~~~~G~~l~eA~~~~~~lE~~A~i~l~a~~~-G~p~~~Lsdeei~~~~~~~~~yg~~~~ 214 (215) T PRK08087 156 GLIACEVNLEKALWLAHEVEVLAQLYLKTLAI-TDPVPVLSDEEIAVVLEKFKTYGLRIE 214 (215) T ss_pred CCEEECCCHHHHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 75663799999999999999999999999971-999999999999999999998486656 No 60 >KOG2367 consensus Probab=37.80 E-value=23 Score=15.52 Aligned_cols=63 Identities=13% Similarity=0.144 Sum_probs=35.8 Q ss_pred CCCCCE--EECHHHHHHHHHHHHC--C---CCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCC Q ss_conf 212540--3029999999998611--3---32100001110001345898874202577850688620578865 Q gi|254780392|r 185 INEPAN--VLGTDEFCKQVRKLES--L---GVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSE 251 (494) Q Consensus 185 vN~PaN--~ltP~~lA~~a~~l~~--~---gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~ 251 (494) +|.|.- +-+|..|++.++-++. . .+-+++.-..++-=-=-+++++-- -..|.|.-.-.|.++. T Consensus 221 vnipdTVgia~P~~y~dLI~y~~tn~~~~e~v~Is~HcHND~G~a~Ant~~g~~----AGA~~VE~~i~GiGER 290 (560) T KOG2367 221 VNIPDTVGIATPNEYGDLIEYLKTNTPGREKVCISTHCHNDLGCATANTELGLL----AGARQVEVTINGIGER 290 (560) T ss_pred CCCCCEECCCCHHHHHHHHHHHHCCCCCCEEEEEEEEECCCCCHHHHHHHHHHH----CCCCEEEEEEECCCCC T ss_conf 067640203683889999999970688730689987514874278888888775----2865688986224444 No 61 >pfam09264 Sial-lect-inser Vibrio cholerae sialidase, lectin insertion. Members of this family are predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. They bind to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme. Probab=37.11 E-value=16 Score=16.66 Aligned_cols=31 Identities=29% Similarity=0.505 Sum_probs=21.6 Q ss_pred CHHHHHCCCCC-----CEEEEECCCC--CCEEEEEHHH Q ss_conf 70776302489-----5888634786--6403652367 Q gi|254780392|r 323 RPGDIVRSMSG-----QTIEVINTDA--EGRLILADAL 353 (494) Q Consensus 323 ~pgdvi~~~~G-----~tvei~nTDA--EGRlvLaD~l 353 (494) .-|-+|++.+| ++|+-.|||| .|||-++|-. T Consensus 112 VDg~~vaTwSGe~sssk~I~fGnada~~dgrlh~~~i~ 149 (197) T pfam09264 112 VDGQQITTWAGEVSQENNIQFGNADAQIDGRLHVQKIV 149 (197) T ss_pred ECCEEEEEECCCCCCCCEEEECCCCCCCCCEEEEEEEE T ss_conf 87818988607526765368626565535447777665 No 62 >COG0381 WecB UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane] Probab=36.91 E-value=25 Score=15.25 Aligned_cols=22 Identities=18% Similarity=0.172 Sum_probs=12.2 Q ss_pred HHHHHHHHCCCCCCEEEEEEECCC Q ss_conf 999999870878728998641026 Q gi|254780392|r 292 GLLHVLAERKAKINAIGVLALVEN 315 (494) Q Consensus 292 g~~~aia~l~~~v~v~~~~~~~EN 315 (494) -++.-+++.. -++..+.|.--| T Consensus 226 ~al~~i~~~~--~~~~viyp~H~~ 247 (383) T COG0381 226 EALREIAEEY--PDVIVIYPVHPR 247 (383) T ss_pred HHHHHHHHHC--CCCEEEEECCCC T ss_conf 9999999867--895699747997 No 63 >PRK00911 dihydroxy-acid dehydratase; Provisional Probab=35.96 E-value=30 Score=14.74 Aligned_cols=114 Identities=20% Similarity=0.283 Sum_probs=52.2 Q ss_pred CCCCCCC--CHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEE Q ss_conf 0033332--00232211235689999999987087872899864102668999747077630248958886347866403 Q gi|254780392|r 270 SIKPSHG--MEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRL 347 (494) Q Consensus 270 slKp~~~--M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRl 347 (494) +|||++. |++.. .-||-.+|+--+.-...|...+--+-=-.+.||.-+-......|||++.+-- -..+|-| T Consensus 306 ~l~P~G~~~~~d~~-~aGGv~~v~k~L~~~glL~~d~~TVtG~tl~e~l~~~~~~~d~~VIr~~~~P------~~~~Ggl 378 (557) T PRK00911 306 DLKPSGRYVMEDLH-EAGGIPAVMKELLDAGLLHGDCLTVTGKTLAENLADVPPIPDQDVIRPLDNP------ISPTGGL 378 (557) T ss_pred EECCCCCHHHHHHH-HCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCEEECCCCC------CCCCCCE T ss_conf 62468643599988-6479999999999769967888757788899998439888994778046546------4787887 Q ss_pred EEEHHHHHHHHHCCCC-EEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHH Q ss_conf 6523678875207961-3552111200235431776057870898999999998 Q gi|254780392|r 348 ILADALWYCRTHYNPH-LMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSS 400 (494) Q Consensus 348 vLaD~l~ya~~~~~p~-~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a 400 (494) ..- +. .+-|+ .||-++-...-. =...|-||.+.+++.+.+..- T Consensus 379 ~vL----~G--NLap~GaViK~sav~~~~----~~GpA~VF~see~~~~ai~~g 422 (557) T PRK00911 379 AIL----KG--NLAPEGAVVKIAGVKPPM----FTGPARVFDSEEEAMAAILAG 422 (557) T ss_pred EEE----CC--CCCCCCEEEEEECCCCCE----EEEEEEEECCHHHHHHHHHCC T ss_conf 862----37--756776068740477645----776468858999999998689 No 64 >cd00988 PDZ_CTP_protease PDZ domain of C-terminal processing-, tail-specific-, and tricorn proteases, which function in posttranslational protein processing, maturation, and disassembly or degradation, in Bacteria, Archaea, and plant chloroplasts. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=35.39 E-value=10 Score=17.97 Aligned_cols=32 Identities=19% Similarity=0.508 Sum_probs=21.9 Q ss_pred EEEEEECCCCCCCCC-CCHHHHHCCCCCCEEEE Q ss_conf 998641026689997-47077630248958886 Q gi|254780392|r 307 IGVLALVENMPGSSA-QRPGDIVRSMSGQTIEV 338 (494) Q Consensus 307 ~~~~~~~EN~~~~~a-~~pgdvi~~~~G~tvei 338 (494) +-+.-..+|.|...| .+|||+|.+-||+.|.= T Consensus 15 ~~V~~v~~gsPA~~aGl~~GD~I~~Vng~~v~~ 47 (85) T cd00988 15 LVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDG 47 (85) T ss_pred EEEEEECCCCHHHHHCCCCCCEEEEECCEECCC T ss_conf 999996899958980899999999999999789 No 65 >cd07144 ALDH_ALD2-YMR170C Saccharomyces cerevisiae aldehyde dehydrogenase 2 (YMR170c)-like. NAD(P)+-dependent Saccharomyces cerevisiae aldehyde dehydrogenase 2 (YMR170c, ALD5, EC=1.2.1.5) and other similar sequences, are present in this CD. Probab=34.43 E-value=31 Score=14.58 Aligned_cols=93 Identities=18% Similarity=0.258 Sum_probs=60.4 Q ss_pred HHHCCCCE-EEEECCCCHHHHHHCCCC---CEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCEECCC Q ss_conf 52079613-552111200235431776---05787089899999999899856971767768678886179864170188 Q gi|254780392|r 357 RTHYNPHL-MIDLATLTGAMVVSLGNI---YAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFADMKNIG 432 (494) Q Consensus 357 ~~~~~p~~-iid~ATLTGa~~~alG~~---~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD~~n~~ 432 (494) .+-|.|-+ |+-+.++--|+-.|=... .+++||+|...+..+.+.= ++|--.+--+. .....-++.-+|++| T Consensus 387 eE~FGPVl~v~~~~d~eEAi~~aN~s~yGL~a~V~t~d~~~a~~~a~~l-~~G~V~iN~~~----~~~~~~PfGG~K~SG 461 (484) T cd07144 387 EEIFGPVVVISKFKTYEEAIKKANDTTYGLAAAVFTKDIRRAHRVAREL-EAGMVWINSSN----DSDVGVPFGGFKMSG 461 (484) T ss_pred CCCCCCEEEEEECCCHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHC-CCCEEEECCCC----CCCCCCCCCCCCCCC T ss_conf 5344627999960999999999858999866999848999999999858-84369986999----877898828745061 Q ss_pred -CCCHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf -874679999999987238867699940 Q gi|254780392|r 433 -GRGAGSIVAAQFLEKFVQDASWAHIDI 459 (494) Q Consensus 433 -~~~~g~~~aa~fl~~f~~~~~w~HlDi 459 (494) ||.+|. -=|++|+ ..+++|+++ T Consensus 462 ~Gre~G~----~gl~~ft-~~K~v~i~l 484 (484) T cd07144 462 IGRELGE----YGLETYT-QTKAVHINL 484 (484) T ss_pred CCCHHHH----HHHHHHC-CEEEEEEEC T ss_conf 7814379----9999834-722999929 No 66 >TIGR00587 nfo apurinic endonuclease (APN1); InterPro: IPR001719 DNA damaging agents such as the anti-tumour drugs bleomycin and neocarzinostatin or those that generate oxygen radicals produce a variety of lesions in DNA. Amongst these is base-loss which forms apurinic/apyrimidinic (AP) sites or strand breaks with atypical 3' termini. DNA repair at the AP sites is initiated by specific endonuclease cleavage of the phosphodiester backbone. Such endonucleases are also generally capable of removing blocking groups from the 3' terminus of DNA strand breaks. AP endonucleases can be classified into two families based on sequence similarity. Family 2 groups the enzymes listed below , . Bacterial endonuclease IV (3.1.21.2 from EC) (gene nfo). Mycobacterium leprae probable endonuclease (gene end) . Yeast apurinic endonuclease APN1 (4.2.99.18 from EC). Caenorhabditis elegans hypothetical protein APN-1 or T05H10.2. Escherichia coli endonuclease IV and its Saccharomyces cerevisiae homologue Apn1 have been shown to be transition metalloproteins that require zinc and manganese for activity .; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0006281 DNA repair, 0005622 intracellular. Probab=33.23 E-value=6.9 Score=19.21 Aligned_cols=45 Identities=22% Similarity=0.400 Sum_probs=34.4 Q ss_pred HCCCCCCCEEEEEECCCCC-CCCCCEEEEECCEEECC-----CCCCCCCCCCCHHH Q ss_conf 2025778506886205788-65651266302112006-----63100333320023 Q gi|254780392|r 231 AQGSSRPPYLAVMKWEGGD-SEEQPLAFIGKGVVFDT-----GGISIKPSHGMEEM 280 (494) Q Consensus 231 gqGS~~~Prli~l~Y~g~~-~~~~piaLVGKGvTFDt-----GGislKp~~~M~~M 280 (494) ||||.-...|-.|.|==.+ +.+.+| |||+|| .||+|......+.- T Consensus 179 G~Gn~~G~~f~eLk~iI~~i~~~~r~-----GVClDTCH~FAaGYDI~T~~~~~~v 229 (318) T TIGR00587 179 GQGNEIGRSFEELKYIIDGIKDKSRI-----GVCLDTCHTFAAGYDIRTVEDFEKV 229 (318) T ss_pred CCCCCCCCCHHHHHHHHHHHEECCCE-----EEECCHHHHHHCCCCCCCHHHHHHH T ss_conf 75667788889999998532004730-----4522222041214677777779999 No 67 >PRK07334 threonine dehydratase; Provisional Probab=33.10 E-value=33 Score=14.43 Aligned_cols=54 Identities=22% Similarity=0.409 Sum_probs=34.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHH Q ss_conf 2356899999999870878728998641026689997470776302489588863478664036523678 Q gi|254780392|r 285 GGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALW 354 (494) Q Consensus 285 ~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ 354 (494) ||..-.-|+-.++-.+++.++|+|+=| |+.| +...+..|+.++.. .| .+||++. T Consensus 177 GGGGLisGia~a~K~~~P~ikViGVEp--e~a~--------~m~~sl~~~~~~~~-~~-----TiADGla 230 (399) T PRK07334 177 GGGGLISGIATAAKALKPDIEIIGVQT--ELYP--------SMYAALKGVALPCG-GS-----TIAEGIA 230 (399) T ss_pred CCCHHHHHHHHHHHCCCCCCEEEEECC--CCCH--------HHHHHHCCCCCCCC-CC-----CCCCCCC T ss_conf 860479999999960499978999765--8776--------99998469998899-87-----3201224 No 68 >COG0248 GppA Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism] Probab=32.88 E-value=33 Score=14.41 Aligned_cols=81 Identities=27% Similarity=0.376 Sum_probs=54.0 Q ss_pred HHHHHHHHCCCCCEEECHHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEE Q ss_conf 99986420212540302999999999861133210000111000134589887420257785068862057886565126 Q gi|254780392|r 177 GVNLARDIINEPANVLGTDEFCKQVRKLESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLA 256 (494) Q Consensus 177 gv~lARDLvN~PaN~ltP~~lA~~a~~l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~pia 256 (494) ++.-+|+-.|. +.|-++++ +..|++++|+++++=.. =.+++|..+-...++.+++.-=|+.- + ++ T Consensus 80 ATsA~R~A~N~-------~eFl~rv~--~~~G~~ievIsGeeEAr---l~~lGv~~~~~~~~~~lv~DIGGGSt-E--l~ 144 (492) T COG0248 80 ATSALRDAPNG-------DEFLARVE--KELGLPIEVISGEEEAR---LIYLGVASTLPRKGDGLVIDIGGGST-E--LV 144 (492) T ss_pred HHHHHHCCCCH-------HHHHHHHH--HHHCCCEEEECCHHHHH---HHHHHHHHCCCCCCCEEEEEECCCEE-E--EE T ss_conf 26887718998-------99999999--98699638716077889---99988872478778879998068737-9--99 Q ss_pred EE-----ECCEEECCCCCCCC Q ss_conf 63-----02112006631003 Q gi|254780392|r 257 FI-----GKGVVFDTGGISIK 272 (494) Q Consensus 257 LV-----GKGvTFDtGGislK 272 (494) ++ ++++.++=|.++|. T Consensus 145 ~g~~~~~~~~~Sl~~G~v~lt 165 (492) T COG0248 145 LGDNFEIGLLISLPLGCVRLT 165 (492) T ss_pred EECCCCCCEEEEEECCEEEEE T ss_conf 713774224678633608724 No 69 >PRK05639 4-aminobutyrate aminotransferase; Provisional Probab=32.28 E-value=34 Score=14.34 Aligned_cols=94 Identities=20% Similarity=0.332 Sum_probs=49.6 Q ss_pred EEECHHHHHHHHHHHH-CCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCCC Q ss_conf 0302999999999861-133210000111000134589887420257785068862057886565126630211200663 Q gi|254780392|r 190 NVLGTDEFCKQVRKLE-SLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTGG 268 (494) Q Consensus 190 N~ltP~~lA~~a~~l~-~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtGG 268 (494) -+..|..|-+.+++++ ++|+ +=|+||-+.==-.-|.+.+ .+|-|- ++.|.-.|||+ +|| T Consensus 237 ~~~~~~~yl~~lr~lc~~~gi-llI~DEV~tGfGRtG~~fa-------------~e~~gv---~PDii~~gKgl---~~G 296 (457) T PRK05639 237 IVVPPENFFKELKKLLNELGI-LLVMDEVQTGIGRTGKWFA-------------SEWFSV---KPDMIIFGKGV---ASG 296 (457) T ss_pred CCCCCHHHHHHHHHHHHHCCC-EEEEECCCCCCCCCCCHHH-------------HHCCCC---CCCEEEECCHH---CCC T ss_conf 862898998899999976398-6873103227762565456-------------643799---99999977221---079 Q ss_pred CCC------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 100------------33332002322112356899999999870878 Q gi|254780392|r 269 ISI------------KPSHGMEEMKGDLGGAAAVTGLLHVLAERKAK 303 (494) Q Consensus 269 isl------------Kp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~ 303 (494) +.| .++..+.++-.+--++|+.+.++..+-+-++- T Consensus 297 ~plsav~~~~~i~~~~~g~~~~T~~gnpla~Aaa~a~L~~i~~~~l~ 343 (457) T PRK05639 297 MGLSGVIGRKEIMDITSGSALLTPAANPVISAAAEATLEIIEEENLL 343 (457) T ss_pred CCCEEEEECHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHH T ss_conf 87079998488775405765557886529999999999999965599 No 70 >TIGR02198 rfaE_dom_I rfaE bifunctional protein, domain I; InterPro: IPR011913 RfaE is a protein involved in the biosynthesis of ADP-L-glycero-D-manno-heptose, a precursor for LPS inner core biosynthesis. RfaE is a bifunctional protein in Escherichia coli, and separate proteins in some other genome. The longer, N-terminal domain I (this family) is suggested to act in D-glycero-D-manno-heptose 1-phosphate biosynthesis, while domain II (IPR011914 from INTERPRO) adds ADP to yield ADP-D-glycero-D-manno-heptose .; GO: 0016773 phosphotransferase activity alcohol group as acceptor, 0016779 nucleotidyltransferase activity, 0005975 carbohydrate metabolic process. Probab=32.11 E-value=32 Score=14.50 Aligned_cols=153 Identities=10% Similarity=0.111 Sum_probs=69.8 Q ss_pred HHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEE------EEEECCCHHH--CCHHH-HHHHHHHHHHHHHCCCEEEEECC Q ss_conf 899999980899836787899843788874459------9994488155--89999-99999999985316974899417 Q gi|254780392|r 39 SIVMRAVSVKNFIGESKSHLNILAPVDCVWDRL------VVAGIGDPRD--ANFSW-LKAGGSIASYIEEDKNIEIFVDV 109 (494) Q Consensus 39 ~~L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rV------iLVGLG~~~~--~~~~~-~ra~a~~~~~l~~~~~~~v~i~l 109 (494) ..|...++..+.. .....+....-+-..--|| =|+=+..++. ++.+. .++...+..++...+..=++=+- T Consensus 77 ~~L~~ll~~~g~~-~~~~l~~d~~rpT~~K~Rv~a~~~QQllR~D~E~~~~~~~~~~~~L~~~~~~~l~~~d~VvLSDYa 155 (321) T TIGR02198 77 KALEALLKEEGID-DTSGLIRDKSRPTTTKTRVLARANQQLLRVDFEERKPINAEEEARLLAAIKEQLASADAVVLSDYA 155 (321) T ss_pred HHHHHHHHHCCCC-CCCCEEEECCCCCEEEEEEEECCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 9999999745864-330027768989448888860465058997410277689778999999999972328789998668 Q ss_pred CCCCCCHHHHHHHHHHHHHC---------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 78999989999999999714---------777544443643332321000133322457737899999999999999998 Q gi|254780392|r 110 PEYPITKAEIRDLVLGFMLK---------TYTFDQYKTKKRESSSPRSKENISVTIVTEMIQQSSQVVADIQSVVKGVNL 180 (494) Q Consensus 110 ~~~~~~~~~~~~~~eG~~L~---------~Y~Fd~yKskk~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~aiaegv~l 180 (494) .......-++.+..-+.-. .=.|.+|+.. ++...+.++.+ T Consensus 156 -KGvLt~~v~~~~I~~Ar~~~~pVlVDPKg~df~~Y~GA---------------tl~TPN~~E~~--------------- 204 (321) T TIGR02198 156 -KGVLTPSVVQEVIAAAREAGKPVLVDPKGKDFSRYRGA---------------TLITPNRKEAE--------------- 204 (321) T ss_pred -CCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHHHHCCC---------------CCCCCCHHHHH--------------- T ss_conf -76358578999999999668919980787623451466---------------42366879999--------------- Q ss_pred HHHHCCCCC-EEECHHHHHHHHHHH-HCCCCCHHHHCCCCCCHHHHHHHH Q ss_conf 642021254-030299999999986-113321000011100013458988 Q gi|254780392|r 181 ARDIINEPA-NVLGTDEFCKQVRKL-ESLGVAVEILDKEAMHKLGMNALL 228 (494) Q Consensus 181 ARDLvN~Pa-N~ltP~~lA~~a~~l-~~~gl~v~Vld~~~L~klGMg~ll 228 (494) ..+-.|. ..-++..+.+.+++| .++.++-=++.+.| .||=.|. T Consensus 205 --~avG~~~~~~~~~~~~~~aa~~L~~~~~l~alLvTRsE---~GMtL~~ 249 (321) T TIGR02198 205 --AAVGIKFIACENEAELVKAAEKLLEELDLEALLVTRSE---KGMTLFT 249 (321) T ss_pred --HHHCCCCHHCCCCHHHHHHHHHHHHHCCCCEEEEEECC---CCCEEEC T ss_conf --98588701105817899999999997099679995077---6206676 No 71 >TIGR00195 exoDNase_III exodeoxyribonuclease III; InterPro: IPR000097 DNA damaging agents such as the antitumor drugs bleomycin and neocarzinostatin or those that generate oxygen radicals produce a variety of lesions in DNA. Amongst these is base-loss which forms apurinic/apyrimidinic (AP) sites or strand breaks with atypical 3' termini. DNA repair at the AP sites is initiated by specific endonuclease cleavage of the phosphodiester backbone. Such endonucleases are also generally capable of removing blocking groups from the 3' terminus of DNA strand breaks. AP endonucleases can be classified into two families based on sequence similarity. This family contains members of AP endonuclease family 1. Except for Rrp1 and arp, these enzymes are proteins of about 300 amino-acid residues. Rrp1 and arp both contain additional and unrelated sequences in their N-terminal section (about 400 residues for Rrp1 and 270 for arp). The proteins contain glutamate which has been shown , in the Escherichia coli enzyme to bind a divalent metal ion such as magnesium or manganese.; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0006281 DNA repair, 0005622 intracellular. Probab=31.78 E-value=20 Score=16.00 Aligned_cols=11 Identities=36% Similarity=0.752 Sum_probs=7.3 Q ss_pred CCCCCCCCCCC Q ss_conf 47775444436 Q gi|254780392|r 129 KTYTFDQYKTK 139 (494) Q Consensus 129 ~~Y~Fd~yKsk 139 (494) +.|+|-.|+++ T Consensus 222 ~~YtWwdYR~~ 232 (281) T TIGR00195 222 GAYTWWDYRTR 232 (281) T ss_pred CCCCEEECCCC T ss_conf 94003311336 No 72 >cd00136 PDZ PDZ domain, also called DHR (Dlg homologous region) or GLGF (after a conserved sequence motif). Many PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. Heterodimerization through PDZ-PDZ domain interactions adds to the domain's versatility, and PDZ domain-mediated interactions may be modulated dynamically through target phosphorylation. Some PDZ domains play a role in scaffolding supramolecular complexes. PDZ domains are found in diverse signaling proteins in bacteria, archebacteria, and eurkayotes. This CD contains two distinct structural subgroups with either a N- or C-terminal beta-strand forming the peptide-binding groove base. The circular permutation placing the strand on the N-terminus appears to be found in Eumetazoa only, while the C-terminal variant is found in all three kingdoms of life, and seems to co-occur with protease domains. PDZ domains have been named after PSD95(pos Probab=31.55 E-value=12 Score=17.58 Aligned_cols=43 Identities=21% Similarity=0.394 Sum_probs=28.9 Q ss_pred EEEEEEECCCCCCCCC-CCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHH Q ss_conf 8998641026689997-47077630248958886347866403652367887 Q gi|254780392|r 306 AIGVLALVENMPGSSA-QRPGDIVRSMSGQTIEVINTDAEGRLILADALWYC 356 (494) Q Consensus 306 v~~~~~~~EN~~~~~a-~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya 356 (494) .+-+....+|.|...| .+|||.|.+-||+.|+=.+ +.++..+- T Consensus 14 ~i~V~~v~~~spA~~aGL~~GD~I~~ing~~v~~~~--------~~~~~~~l 57 (70) T cd00136 14 GVVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLT--------LEDVAELL 57 (70) T ss_pred CEEEEECCCCCHHHHCCCCCCCEEEEECCEECCCCC--------HHHHHHHH T ss_conf 899998099897998799989999999999968998--------99999996 No 73 >TIGR01752 flav_long flavodoxin; InterPro: IPR010086 Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They function as electron transfer agents in a variety of microbial metabolic processes, including nitrogen fixation by nitrogenase , sulphite reduction , and light-dependent NADP+ reduction during photosynthesis . This entry describes the long chain type, typical for nitrogen fixation but associated with pyruvate formate-lyase activation and cobalamin-dependent methionine synthase activity in Escherichia coli .; GO: 0010181 FMN binding, 0006118 electron transport. Probab=31.34 E-value=35 Score=14.23 Aligned_cols=52 Identities=12% Similarity=0.166 Sum_probs=43.1 Q ss_pred HHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEECCCHHHCCHHHHHHHHHHHHHHHHCCCEEEEEC Q ss_conf 99999808998367878998437888744599994488155899999999999998531697489941 Q gi|254780392|r 41 VMRAVSVKNFIGESKSHLNILAPVDCVWDRLVVAGIGDPRDANFSWLKAGGSIASYIEEDKNIEIFVD 108 (494) Q Consensus 41 L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rViLVGLG~~~~~~~~~~ra~a~~~~~l~~~~~~~v~i~ 108 (494) +-..++.-+|+||. |-|+|||+....+..+.-+.+.+...++..+.+-+-.. T Consensus 76 ~~~~L~~~dl~GK~----------------VAlfGlGDQ~~Ysd~F~dgmg~L~d~~~~~Ga~~VG~w 127 (176) T TIGR01752 76 FLSDLEELDLKGKT----------------VALFGLGDQEGYSDTFCDGMGILYDKIKARGAKVVGFW 127 (176) T ss_pred HHHHHHHCCCCCCE----------------EEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 76544202358876----------------89972588461106678778899999851588077514 No 74 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=30.51 E-value=36 Score=14.14 Aligned_cols=85 Identities=21% Similarity=0.309 Sum_probs=62.6 Q ss_pred HHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEE Q ss_conf 89999999987087872899864102668999747077630248958886347866403652367887520796135521 Q gi|254780392|r 289 AVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDL 368 (494) Q Consensus 289 av~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ 368 (494) .++|.+ ..+|.+||+=.-+--.=++++++ ..-+|.||.+-.|-||.= =|-||.|+||-|- -|.-=+||| T Consensus 290 ~~~~~~--~G~l~~PV~G~il~rFG~~~~gg-~~wkG~vi~a~~Ga~V~A---~AdG~VvyA~~l~----GYG~vvIld- 358 (420) T COG4942 290 GGFGAL--RGQLAWPVTGRILRRFGQADGGG-LRWKGMVIGASAGATVKA---IADGRVVYADWLR----GYGLVVILD- 358 (420) T ss_pred CCCCCC--CCCCCCCCCCHHHHHHCCCCCCC-CCCCCEEEECCCCCEEEE---CCCCEEEECHHHC----CCCEEEEEE- T ss_conf 532002--57768887870787734567887-610675873599982562---0695699543325----675699997- Q ss_pred CCCCHHHHHHCCCCCEEEEECCHHHH Q ss_conf 11200235431776057870898999 Q gi|254780392|r 369 ATLTGAMVVSLGNIYAGLFANNDVLA 394 (494) Q Consensus 369 ATLTGa~~~alG~~~~g~~~n~~~~~ 394 (494) .|..|-.||++++.+. T Consensus 359 ----------hG~gy~slyg~~~~i~ 374 (420) T COG4942 359 ----------HGGGYHSLYGGNQSIL 374 (420) T ss_pred ----------CCCCCEEEECCCCEEE T ss_conf ----------4885378861664221 No 75 >PRK08526 threonine dehydratase; Provisional Probab=30.50 E-value=26 Score=15.14 Aligned_cols=42 Identities=24% Similarity=0.412 Sum_probs=29.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCC-CCCCCHHHHH Q ss_conf 2356899999999870878728998641026689-9974707763 Q gi|254780392|r 285 GGAAAVTGLLHVLAERKAKINAIGVLALVENMPG-SSAQRPGDIV 328 (494) Q Consensus 285 ~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~-~~a~~pgdvi 328 (494) ||..-.-|+..++-.+++.++|+|+=| |+.|+ -.|.+.|.++ T Consensus 177 GGGGLisGia~a~K~~~P~ikViGVEp--e~a~~m~~Sl~~g~~v 219 (403) T PRK08526 177 GGGGLISGIASAAKQINPNIKIIGVGA--KGAPAMKESFHAKKIK 219 (403) T ss_pred CCCCHHHHHHHHHHHHCCCCCEEEECC--CCCHHHHHHHHCCCCE T ss_conf 886168999999986299883899664--7876899998769945 No 76 >PRK12890 allantoate amidohydrolase; Reviewed Probab=30.03 E-value=37 Score=14.09 Aligned_cols=136 Identities=16% Similarity=0.135 Sum_probs=93.3 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHH Q ss_conf 33200232211235689999999987087872899864102668999747077630248958886347866403652367 Q gi|254780392|r 274 SHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADAL 353 (494) Q Consensus 274 ~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l 353 (494) +-.|..=|-=|.+||-.+-.+..++.... -..++.++-.+.-|+....-||.|. -|++|.+.|.+-+--+...+ T Consensus 230 tTPM~~R~DAl~aAa~~i~~i~~~a~~~~-~~~v~TVG~i~v~P~~~NvIPg~v~-----f~lDiRs~d~~~l~~~~~~i 303 (412) T PRK12890 230 TTPMDLRRDALVAAAEVVTAVERRARALP-GELVATVGRLDVEPNAINVIPGRAV-----FTVDLRSPDDAVLEAAEAAL 303 (412) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCC-CCEEEEEEEEEECCCCCCEECCEEE-----EEEEEECCCHHHHHHHHHHH T ss_conf 88832134899999999999999998539-8718998679842775542087589-----99991079999999999999 Q ss_pred HHHH----HHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCC Q ss_conf 8875----2079613552111200235431776057870898999999998998569717677686788861798641 Q gi|254780392|r 354 WYCR----THYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFAD 427 (494) Q Consensus 354 ~ya~----~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD 427 (494) --.. .+.+-. +++-.+.-.- -..-|+.+.+.+.++.+..|-+..+||-.--++-+.=+.+.+ T Consensus 304 ~~~~~~ia~~~gv~--~~~~~~~~~~----------pv~~d~~l~~~~~~aa~~~g~~~~~m~SGAGHDA~~~a~~~P 369 (412) T PRK12890 304 LAELEAIAARRGVR--IDLERTSRSR----------PVPCDPALVDAVEAAAARLGYASRRLPSGAGHDAAAIARIAP 369 (412) T ss_pred HHHHHHHHHHCCCE--EEEEEEEECC----------CCCCCHHHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHHHCC T ss_conf 99999999974984--9999988478----------757799999999999997599972527505699999970588 No 77 >COG0036 Rpe Pentose-5-phosphate-3-epimerase [Carbohydrate transport and metabolism] Probab=29.89 E-value=21 Score=15.83 Aligned_cols=27 Identities=11% Similarity=0.181 Sum_probs=13.0 Q ss_pred HHHHHHHHHCCCCCEEECHHHHHHHHH Q ss_conf 999986420212540302999999999 Q gi|254780392|r 176 KGVNLARDIINEPANVLGTDEFCKQVR 202 (494) Q Consensus 176 egv~lARDLvN~PaN~ltP~~lA~~a~ 202 (494) +-+.+-|+++=.++=.++|.+=.+..+ T Consensus 100 r~i~~Ik~~G~kaGv~lnP~Tp~~~i~ 126 (220) T COG0036 100 RTIQLIKELGVKAGLVLNPATPLEALE 126 (220) T ss_pred HHHHHHHHCCCEEEEEECCCCCHHHHH T ss_conf 999999975985779978999778999 No 78 >pfam00926 DHBP_synthase 3,4-dihydroxy-2-butanone 4-phosphate synthase. 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin. Sometimes found as a bifunctional enzyme with pfam00925. Probab=29.88 E-value=35 Score=14.24 Aligned_cols=44 Identities=18% Similarity=0.360 Sum_probs=19.5 Q ss_pred HHCCCCCEEECHHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHH Q ss_conf 20212540302999999999861133210000111000134589887 Q gi|254780392|r 183 DIINEPANVLGTDEFCKQVRKLESLGVAVEILDKEAMHKLGMNALLA 229 (494) Q Consensus 183 DLvN~PaN~ltP~~lA~~a~~l~~~gl~v~Vld~~~L~klGMg~lla 229 (494) ||+ .|+..+||+.++-..+. -.|+=|-.+.++.++++|.+.+.. T Consensus 27 dlv-~~Ae~~t~e~i~fm~~~--~~Glic~a~~~~~~~~LgLp~m~~ 70 (193) T pfam00926 27 DLV-IAAEKVTPESVNFMIRH--ARGLICVALTEERADRLGLPPMVD 70 (193) T ss_pred CEE-EEHHHCCHHHHHHHHHH--CCCCEEEECCHHHHHHCCCCCCCC T ss_conf 489-78444999999999995--897779716999997669956555 No 79 >COG4043 Preprotein translocase subunit Sec61beta [Intracellular trafficking, secretion, and vesicular transport] Probab=29.88 E-value=20 Score=15.96 Aligned_cols=14 Identities=7% Similarity=0.249 Sum_probs=6.3 Q ss_pred HHHHHHHCCCEEEE Q ss_conf 99985316974899 Q gi|254780392|r 93 IASYIEEDKNIEIF 106 (494) Q Consensus 93 ~~~~l~~~~~~~v~ 106 (494) +-..+....+.++. T Consensus 60 F~~mlreepiE~v~ 73 (111) T COG4043 60 FEEMLREEPIENVL 73 (111) T ss_pred HHHHHHHCCHHHHC T ss_conf 99999854866518 No 80 >cd04461 S1_Rrp5_repeat_hs8_sc7 S1_Rrp5_repeat_hs8_sc7: Rrp5 Homo sapiens S1 repeat 8 (hs8) and Saccharomyces cerevisiae S1 repeat 7 (sc7)-like domains. Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in S. cerevisiae Rrp5 and 14 S1 repeats in H. sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 8 and S. cerevisiae S1 repeat 7. Rrp5 is found in eukaryotes but not in prokaryotes or archaea. Probab=29.67 E-value=37 Score=14.04 Aligned_cols=48 Identities=17% Similarity=0.075 Sum_probs=31.3 Q ss_pred CCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCC-CEEEEE Q ss_conf 7872899864102668999747077630248958886347866-403652 Q gi|254780392|r 302 AKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAE-GRLILA 350 (494) Q Consensus 302 ~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAE-GRlvLa 350 (494) +.-++.|++|..|=. ...-..|.+.++--.=.++.|.+.|.| +|+.|. T Consensus 34 l~~~v~Glv~~s~ls-~~~~~~~~~~~~vGq~V~~~V~~iD~e~~ri~LS 82 (83) T cd04461 34 FLGGLTGLAPKSYIS-DEFVTDPSFGFKKGQSVTAKVTSVDEEKQRFLLS 82 (83) T ss_pred ECCCCEEEEEHHHCC-CCCCCCHHHHCCCCCEEEEEEEEEECCCCEEEEE T ss_conf 569948998999968-6422698994559999999999995898878875 No 81 >pfam02112 PDEase_II cAMP phosphodiesterases class-II. Probab=29.15 E-value=38 Score=13.98 Aligned_cols=30 Identities=23% Similarity=0.446 Sum_probs=16.1 Q ss_pred CCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEE Q ss_conf 4895888634786640365236788752079613552 Q gi|254780392|r 331 MSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMID 367 (494) Q Consensus 331 ~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid 367 (494) ++|.-||.-.+|. -.|...|. ++.|+.+|+ T Consensus 231 Lk~I~IEcS~~n~-----~~d~~LyG--HLtP~~Li~ 260 (323) T pfam02112 231 LKAILIECSFPNE-----IPDNELFG--HLTPRLLIN 260 (323) T ss_pred CCEEEEEECCCCC-----CCCCCCCC--CCCHHHHHH T ss_conf 4789999679999-----98440245--798499999 No 82 >PRK10717 cysteine synthase A; Provisional Probab=28.89 E-value=38 Score=13.95 Aligned_cols=42 Identities=19% Similarity=0.264 Sum_probs=29.2 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHH Q ss_conf 12356899999999870878728998641026689997470776 Q gi|254780392|r 284 LGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDI 327 (494) Q Consensus 284 M~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdv 327 (494) +||+....|+.+++-+.++.++|+++-| ++++--.+++.|+. T Consensus 184 vGtGG~i~Gi~~~lk~~~p~~~vi~vep--~gS~~~~~~~~g~~ 225 (334) T PRK10717 184 VGTGGTLAGVSRYLKETNPKIRIVLADP--TGSALYSYYKTGEL 225 (334) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEEEC--CCCCCHHHHHCCCC T ss_conf 1586238999999997599988999801--78631023205864 No 83 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=28.75 E-value=33 Score=14.41 Aligned_cols=48 Identities=15% Similarity=0.192 Sum_probs=22.8 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHH-H---HHHCCCCCC Q ss_conf 99999999999985316974899417789999899999999-9---971477754 Q gi|254780392|r 84 FSWLKAGGSIASYIEEDKNIEIFVDVPEYPITKAEIRDLVL-G---FMLKTYTFD 134 (494) Q Consensus 84 ~~~~ra~a~~~~~l~~~~~~~v~i~l~~~~~~~~~~~~~~e-G---~~L~~Y~Fd 134 (494) .+..|...+...++.......+.+ ++ ..+..+++.+++ | +..|||-|. T Consensus 151 ~~Kir~~R~~id~~~~~~~~~ieV--DG-Gv~~~ni~~~~~AGAD~~VaGSaiF~ 202 (216) T TIGR01163 151 LEKIRELRKMIDKLELGLSILIEV--DG-GVNEDNIAEVAEAGADILVAGSAIFG 202 (216) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEE--CC-CCCHHHHHHHHHCCCCEEEEEEEEEC T ss_conf 999999999998602799558997--17-98976799999758989998310208 No 84 >pfam05666 Fels1 Fels-1 Prophage Protein-like. Probab=28.42 E-value=21 Score=15.83 Aligned_cols=19 Identities=32% Similarity=0.294 Sum_probs=13.6 Q ss_pred EEEEECCEEECCCCCCCCC Q ss_conf 2663021120066310033 Q gi|254780392|r 255 LAFIGKGVVFDTGGISIKP 273 (494) Q Consensus 255 iaLVGKGvTFDtGGislKp 273 (494) ++=+--.+|.||+|||+.- T Consensus 19 ~CD~~~~~C~ds~giS~~l 37 (45) T pfam05666 19 LCDRKERLCADSGGISRAL 37 (45) T ss_pred EECCCCCEEECCCCCCHHH T ss_conf 5647567563688767889 No 85 >PRK09374 rplB 50S ribosomal protein L2; Validated Probab=28.30 E-value=39 Score=13.89 Aligned_cols=25 Identities=12% Similarity=0.442 Sum_probs=17.3 Q ss_pred CCEEEEEECCCCCCCCCCEEEEECCEEECCC Q ss_conf 8506886205788656512663021120066 Q gi|254780392|r 237 PPYLAVMKWEGGDSEEQPLAFIGKGVVFDTG 267 (494) Q Consensus 237 ~Prli~l~Y~g~~~~~~piaLVGKGvTFDtG 267 (494) +...+.+||-|.. ...|||| .|..| T Consensus 75 ~~~V~~IeYDPnR--sa~IALv----~y~dG 99 (276) T PRK09374 75 PAKVERIEYDPNR--SANIALL----HYADG 99 (276) T ss_pred CEEEEEEEECCCC--CCEEEEE----ECCCC T ss_conf 6699999867998--8338999----85899 No 86 >smart00228 PDZ Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities. Probab=28.23 E-value=19 Score=16.13 Aligned_cols=31 Identities=26% Similarity=0.535 Sum_probs=21.4 Q ss_pred EEEEEECCCCCCCCC-CCHHHHHCCCCCCEEE Q ss_conf 998641026689997-4707763024895888 Q gi|254780392|r 307 IGVLALVENMPGSSA-QRPGDIVRSMSGQTIE 337 (494) Q Consensus 307 ~~~~~~~EN~~~~~a-~~pgdvi~~~~G~tve 337 (494) +-+.-..++.|...+ .+|||+|.+-||..++ T Consensus 28 v~I~~v~~~s~A~~~Gl~~GD~I~~vng~~v~ 59 (85) T smart00228 28 VVVSSVVPGSPAAKAGLKVGDVILEVNGTSVE 59 (85) T ss_pred EEEEEECCCCHHHHCCCCCCCEEEEECCEECC T ss_conf 99999879994787689899999999999989 No 87 >PRK08588 succinyl-diaminopimelate desuccinylase; Reviewed Probab=27.93 E-value=40 Score=13.84 Aligned_cols=15 Identities=0% Similarity=-0.020 Sum_probs=9.3 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 889999998089983 Q gi|254780392|r 38 PSIVMRAVSVKNFIG 52 (494) Q Consensus 38 ~~~L~~~i~~~~F~G 52 (494) ...|..+++..+|.- T Consensus 25 ~~~l~~~l~~~G~~~ 39 (378) T PRK08588 25 ANYLQDLFAKHGIES 39 (378) T ss_pred HHHHHHHHHHCCCCE T ss_conf 999999999889967 No 88 >cd07143 ALDH_AldA_AN0554 Aspergillus nidulans aldehyde dehydrogenase, AldA (AN0554)-like. NAD(P)+-dependent aldehyde dehydrogenase (AldA) of Aspergillus nidulans (locus AN0554), and other similar sequences, are present in this CD. Probab=27.45 E-value=41 Score=13.78 Aligned_cols=93 Identities=22% Similarity=0.262 Sum_probs=57.3 Q ss_pred HHHCCCCE-EEEECCCCHHHHHHCCCC---CEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCEECCC Q ss_conf 52079613-552111200235431776---05787089899999999899856971767768678886179864170188 Q gi|254780392|r 357 RTHYNPHL-MIDLATLTGAMVVSLGNI---YAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFADMKNIG 432 (494) Q Consensus 357 ~~~~~p~~-iid~ATLTGa~~~alG~~---~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD~~n~~ 432 (494) .+-|.|-+ |+-+.++-=|+-.|=... .+++||+|...+.++.+.= ++|--.+--... + ...-++.-+|++| T Consensus 384 eEiFGPVl~i~~~~~~eEAi~~aN~t~yGL~a~VfT~d~~~a~~~a~~l-~~G~v~IN~~~~--~--~~~~PfGG~K~SG 458 (481) T cd07143 384 EEIFGPVVAVIKFKTEEEAIKRANDSTYGLAAAVFTNNINNAIRVANAL-KAGTVWVNCYNL--L--HHQVPFGGYKQSG 458 (481) T ss_pred CCCCCCEEEEEEECCHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHC-CEEEEEECCCCC--C--CCCCCCCCCCCCC T ss_conf 5566757999996999999999868999876999858999999999868-871799879988--7--8898837635271 Q ss_pred -CCCHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf -874679999999987238867699940 Q gi|254780392|r 433 -GRGAGSIVAAQFLEKFVQDASWAHIDI 459 (494) Q Consensus 433 -~~~~g~~~aa~fl~~f~~~~~w~HlDi 459 (494) ||.+|.- -|++|+ .++|+|+++ T Consensus 459 ~Gre~G~~----gl~~ft-~~K~v~i~l 481 (481) T cd07143 459 IGRELGEY----ALENYT-QIKAVHINL 481 (481) T ss_pred CCCHHHHH----HHHHHC-CCEEEEEEC T ss_conf 77242899----999833-601999949 No 89 >pfam04767 Pox_F17 DNA-binding 11 kDa phosphoprotein. Family of poxvirus proteins required for virus morphogenesis. Protein function necessary for proteolytic processing of the major viral structural proteins, P4a and P4b. Probab=27.41 E-value=28 Score=14.96 Aligned_cols=13 Identities=54% Similarity=0.708 Sum_probs=8.3 Q ss_pred EEEEECCCCCCEEE Q ss_conf 88863478664036 Q gi|254780392|r 335 TIEVINTDAEGRLI 348 (494) Q Consensus 335 tvei~nTDAEGRlv 348 (494) |==|+||| |||.. T Consensus 5 ~PFiint~-eGRyL 17 (98) T pfam04767 5 TPFIINTK-EGRYL 17 (98) T ss_pred CCEEEECC-CCEEE T ss_conf 87588725-76088 No 90 >PRK08198 threonine dehydratase; Provisional Probab=27.10 E-value=41 Score=13.74 Aligned_cols=57 Identities=25% Similarity=0.486 Sum_probs=37.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHH Q ss_conf 12356899999999870878728998641026689997470776302489588863478664036523678 Q gi|254780392|r 284 LGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALW 354 (494) Q Consensus 284 M~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ 354 (494) .||..-.-|+..++..+++.++|+|+=| |+.|+ =..--.+|+-|++...| .+||++. T Consensus 180 vGGGGLiaGia~a~K~~~P~ikViGVEp--e~a~~-------~~~Sl~ag~~v~~~~~~-----tiADGia 236 (406) T PRK08198 180 IGGGGLISGVATAVKALRPNVRVIGVQA--EGAPA-------MPLSLAAGRPVELESVD-----TIADGIA 236 (406) T ss_pred CCCHHHHHHHHHHHHHHCCCCCEEEEEE--CCCHH-------HHHHHHCCCCEECCCCC-----EEECCCC T ss_conf 7862688999999998689970899965--78768-------99999849956559999-----7971144 No 91 >PRK07495 4-aminobutyrate aminotransferase; Provisional Probab=26.89 E-value=42 Score=13.72 Aligned_cols=61 Identities=23% Similarity=0.281 Sum_probs=35.3 Q ss_pred CEEECHHHHHHHHHHHH-CCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCC Q ss_conf 40302999999999861-13321000011100013458988742025778506886205788656512663021120066 Q gi|254780392|r 189 ANVLGTDEFCKQVRKLE-SLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTG 267 (494) Q Consensus 189 aN~ltP~~lA~~a~~l~-~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtG 267 (494) +.+..|..|-+.+++++ ++|+ +=|+||-+.==-..|-+.+ .+|-+- ++.|+-+|||++ | T Consensus 210 G~~~~~~~yl~~lr~lc~~~gi-llI~DEV~TGfGRtG~~fa-------------~e~~gv---~PDiv~~gKgl~---g 269 (422) T PRK07495 210 GFYPAPAAFMKALRELCDQHGI-LLIADEVQTGFARTGKLFA-------------MEHHEV---AADLTTMAKSLA---G 269 (422) T ss_pred CCEECCHHHHHHHHHHHHHCCC-EEECCCCCCCCCCCCCCHH-------------HHCCCC---CCCEEEECHHHH---C T ss_conf 8663888999999999987396-6877630108876655026-------------544899---980999641542---8 Q ss_pred CC Q ss_conf 31 Q gi|254780392|r 268 GI 269 (494) Q Consensus 268 Gi 269 (494) |+ T Consensus 270 G~ 271 (422) T PRK07495 270 GF 271 (422) T ss_pred CC T ss_conf 87 No 92 >PRK08091 ribulose-phosphate 3-epimerase; Validated Probab=26.59 E-value=32 Score=14.50 Aligned_cols=16 Identities=13% Similarity=0.135 Sum_probs=6.5 Q ss_pred HHHHHHCCCEEEEECC Q ss_conf 9985316974899417 Q gi|254780392|r 94 ASYIEEDKNIEIFVDV 109 (494) Q Consensus 94 ~~~l~~~~~~~v~i~l 109 (494) .+.+...+...+.++. T Consensus 31 i~~l~~~g~d~lHiDI 46 (235) T PRK08091 31 LQQLEALNQPLLHFDI 46 (235) T ss_pred HHHHHHCCCCEEEECC T ss_conf 9999977999999818 No 93 >PRK00326 cell division protein MraZ; Reviewed Probab=26.50 E-value=11 Score=17.68 Aligned_cols=28 Identities=29% Similarity=0.553 Sum_probs=15.1 Q ss_pred HHHHCCCCCCEEEEECCCCCCEEEEEHHH Q ss_conf 77630248958886347866403652367 Q gi|254780392|r 325 GDIVRSMSGQTIEVINTDAEGRLILADAL 353 (494) Q Consensus 325 gdvi~~~~G~tvei~nTDAEGRlvLaD~l 353 (494) .+..+.+.+.+.|+ ..|..||++|-+.| T Consensus 70 ~~~~r~~~~~a~~~-~~D~~GRi~lP~~l 97 (147) T PRK00326 70 RAFQRLLLGGAVEV-ELDKQGRILIPPNL 97 (147) T ss_pred HHHHHHHHCCCCEE-EECCCCCEECCHHH T ss_conf 99999984377050-06798718689999 No 94 >PRK13590 putative bifunctional OHCU decarboxylase/allantoate amidohydrolase; Provisional Probab=26.29 E-value=43 Score=13.64 Aligned_cols=136 Identities=14% Similarity=0.135 Sum_probs=94.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHH Q ss_conf 33200232211235689999999987087872899864102668999747077630248958886347866403652367 Q gi|254780392|r 274 SHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADAL 353 (494) Q Consensus 274 ~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l 353 (494) +--|..=|-=|.+||-++-.+.-++.... +.++.++-.+--|+..-.-||.|. -||+|.+.|.+-|--+.+.+ T Consensus 407 TTPM~~RrDAl~aAAelil~ve~~a~~~~--~~VaTVG~i~v~Pns~NVIPG~v~-----ftlDiR~~~d~~~~~~~~~i 479 (590) T PRK13590 407 TTPMDRRRDAATAVAELALYVEQRAAADG--DSVGTVGMLEVPGGSINVVPGRCR-----FSLDLRAPTDAQRDALAADV 479 (590) T ss_pred CCCHHHHCCHHHHHHHHHHHHHHHHHHCC--CEEEEEEEEEECCCCCCEECCEEE-----EEEECCCCCHHHHHHHHHHH T ss_conf 88745522399999999999999986379--869999999606997541687489-----99982489989999999999 Q ss_pred HHHH----HHCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCE Q ss_conf 8875----20796135521112002354317760578708989999999989985697176776867888617986417 Q gi|254780392|r 354 WYCR----THYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFADM 428 (494) Q Consensus 354 ~ya~----~~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD~ 428 (494) .-.. .+.+-. +++-.+.-.. . ..-|..+.+.+.++.+..|-+.++||-.-.++-+.=+.+.+. T Consensus 480 ~~~~~~Ia~~rgl~--~~~e~~~~~~---------p-~~~d~~l~~~le~aa~~lG~~~~~m~SGAGHDA~~ma~i~Pt 546 (590) T PRK13590 480 LAQLEAIAERRGLH--YTLEETMRAA---------A-APSAPAWQRRWEAAVAALGLPVHRMPSGAGHDAMKLHEIMPQ 546 (590) T ss_pred HHHHHHHHHHCCCC--EEEEEEECCC---------C-CCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCCE T ss_conf 99999999984993--7999985179---------9-779999999999999976999740187726999999754887 No 95 >PRK09521 exosome complex RNA-binding protein Csl4; Provisional Probab=26.28 E-value=14 Score=17.08 Aligned_cols=13 Identities=46% Similarity=0.641 Sum_probs=7.9 Q ss_pred CCCCCHHHHHCCC Q ss_conf 9974707763024 Q gi|254780392|r 319 SSAQRPGDIVRSM 331 (494) Q Consensus 319 ~~a~~pgdvi~~~ 331 (494) .++|||||||+++ T Consensus 117 ~d~fr~gDIVrAk 129 (187) T PRK09521 117 TDAFKIGDIVRAK 129 (187) T ss_pred HHHCCCCCEEEEE T ss_conf 7524878689999 No 96 >TIGR01477 RIFIN variant surface antigen, rifin family; InterPro: IPR006373 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the rifin family of variant surface antigens, encoded by the rif (short for "repetetive interspersed family") genes. This is the largest variant surface antigen family in P. falciparum, with roughly 150 genes present in the genome of this parasite . The function of these proteins is not yet known, but rifins are strongly immunogenic and high levels of anti-rifin antibodies are correlated with rapid parasite clearance and asymptomatic infections . . Probab=25.59 E-value=27 Score=15.01 Aligned_cols=20 Identities=15% Similarity=0.122 Sum_probs=13.1 Q ss_pred CCCCCEEECHHHHHHHHHHH Q ss_conf 21254030299999999986 Q gi|254780392|r 185 INEPANVLGTDEFCKQVRKL 204 (494) Q Consensus 185 vN~PaN~ltP~~lA~~a~~l 204 (494) +=.|.|.-+|..|.+.+.+. T Consensus 241 i~~~~nY~~~~~l~~~i~~~ 260 (395) T TIGR01477 241 IINASNYTNPSLLSDAIYKE 260 (395) T ss_pred CCCCCCCCCHHHHHHHHHHH T ss_conf 04765654167899999998 No 97 >PRK06382 threonine dehydratase; Provisional Probab=25.57 E-value=42 Score=13.66 Aligned_cols=43 Identities=23% Similarity=0.417 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCC-CCCCHHHHHC Q ss_conf 23568999999998708787289986410266899-9747077630 Q gi|254780392|r 285 GGAAAVTGLLHVLAERKAKINAIGVLALVENMPGS-SAQRPGDIVR 329 (494) Q Consensus 285 ~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~-~a~~pgdvi~ 329 (494) ||..-+-|+-.++..+++.++|+|+=| |+.|+- .|.+.|-+++ T Consensus 176 GGGGLisGia~a~K~~~P~ikViGVEp--e~a~~~~~sl~~g~~v~ 219 (400) T PRK06382 176 GGGGLISGIALAAKHINPNVKIIGIES--ELSDSMKASLREGKIVA 219 (400) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEECC--CCCHHHHHHHHCCCCCC T ss_conf 871588999999998599975999796--89878999987799620 No 98 >PRK00910 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional Probab=25.50 E-value=34 Score=14.34 Aligned_cols=60 Identities=17% Similarity=0.298 Sum_probs=32.8 Q ss_pred HHCCCCCEEECHHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEEC Q ss_conf 202125403029999999998611332100001110001345898874202577850688620 Q gi|254780392|r 183 DIINEPANVLGTDEFCKQVRKLESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKW 245 (494) Q Consensus 183 DLvN~PaN~ltP~~lA~~a~~l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y 245 (494) ||+ .|+..+||+.++-.++. ..|+-|-.+.++.++++|.+.+..-+..+...+..|-+.+ T Consensus 43 Dlv-~aAe~~t~e~i~fm~~~--~~GliC~~~~~~~~~~L~Lp~mv~~~~~~~~taftvsvda 102 (218) T PRK00910 43 DII-YSVEHLTNAQMALMIRE--CSGIVCLCLTDEQANKLELPPMVVNNNSANQTAFTVSIEA 102 (218) T ss_pred CEE-EEHHHCCHHHHHHHHHH--CCCCEEEECCHHHHHHCCCCCCCCCCCCCCCCCCEEEEEE T ss_conf 168-67664999999999995--9971996479999977799865678888788762688760 No 99 >PRK12394 putative metallo-dependent hydrolase; Provisional Probab=25.43 E-value=27 Score=15.07 Aligned_cols=47 Identities=19% Similarity=0.356 Sum_probs=28.7 Q ss_pred CCCCHHHHHCC-CCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCH Q ss_conf 97470776302-4895888634786640365236788752079613552111200 Q gi|254780392|r 320 SAQRPGDIVRS-MSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTG 373 (494) Q Consensus 320 ~a~~pgdvi~~-~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTG 373 (494) .-+|||||+|- ++|+.=.|.+-| |+++ ..+.-| .+.-.++|+..=++ T Consensus 204 ~~lr~GDI~tH~f~~~~~~i~~~~--g~v~--~~v~~A---~~rGv~fDvghG~~ 251 (387) T PRK12394 204 SLLRRGDIIAHAFHGKGSTILTDE--GAVL--AEVRQA---RERGVIFDAANGRS 251 (387) T ss_pred HHHCCCCEEEEECCCCCCCEECCC--CCCH--HHHHHH---HCCCEEEEECCCCC T ss_conf 874569889862158888646888--7482--999987---50876899358888 No 100 >PRK05581 ribulose-phosphate 3-epimerase; Validated Probab=25.20 E-value=41 Score=13.76 Aligned_cols=14 Identities=36% Similarity=0.242 Sum_probs=5.5 Q ss_pred HHHHCCCEEEEECC Q ss_conf 85316974899417 Q gi|254780392|r 96 YIEEDKNIEIFVDV 109 (494) Q Consensus 96 ~l~~~~~~~v~i~l 109 (494) .+...+...+.+++ T Consensus 24 ~l~~~g~~~lHiDI 37 (220) T PRK05581 24 AVEAAGADWIHVDV 37 (220) T ss_pred HHHHCCCCEEEEEC T ss_conf 99976999899957 No 101 >cd00990 PDZ_glycyl_aminopeptidase PDZ domain associated with archaeal and bacterial M61 glycyl-aminopeptidases. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand is presumed to form the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=25.20 E-value=23 Score=15.53 Aligned_cols=31 Identities=19% Similarity=0.290 Sum_probs=24.3 Q ss_pred EEEEEECCCCCCCCC-CCHHHHHCCCCCCEEE Q ss_conf 998641026689997-4707763024895888 Q gi|254780392|r 307 IGVLALVENMPGSSA-QRPGDIVRSMSGQTIE 337 (494) Q Consensus 307 ~~~~~~~EN~~~~~a-~~pgdvi~~~~G~tve 337 (494) +-|.-..+|.|...| .+|||+|.+.||..|. T Consensus 14 ~~V~~V~~~sPA~~AGl~~GD~IvaidG~~v~ 45 (80) T cd00990 14 GKVTFVRDDSPADKAGLVAGDELVAVNGWRVD 45 (80) T ss_pred EEEEEECCCCHHHHCCCCCCCEEEEECCEEEH T ss_conf 99999888996998599989999999999923 No 102 >TIGR03240 arg_catab_astD succinylglutamic semialdehyde dehydrogenase. Members of this protein family are succinylglutamic semialdehyde dehydrogenase (EC 1.2.1.71), the fourth enzyme in the arginine succinyltransferase (AST) pathway for arginine catabolism. Probab=25.15 E-value=45 Score=13.50 Aligned_cols=97 Identities=18% Similarity=0.356 Sum_probs=62.1 Q ss_pred HHHCCCCE-EEEECCCCHHHHHHCCCC---CEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCEECCC Q ss_conf 52079613-552111200235431776---05787089899999999899856971767768678886179864170188 Q gi|254780392|r 357 RTHYNPHL-MIDLATLTGAMVVSLGNI---YAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFADMKNIG 432 (494) Q Consensus 357 ~~~~~p~~-iid~ATLTGa~~~alG~~---~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD~~n~~ 432 (494) ++-|.|-. |+-+-++--|+-.|=... .+++||+|..-+..+.+. -++|--.|--|.... ...-++.=+|++| T Consensus 372 eEiFGPVl~V~~~~~~dEAI~~aN~t~yGL~a~V~t~d~~~a~~~a~~-l~aG~V~iN~~~~~~---~~~~PFGG~K~SG 447 (484) T TIGR03240 372 EEHFGPLLQVIRYDDFDEAIAIANNTRFGLSAGLLSDDRELYDRFLLE-IRAGIVNWNKPLTGA---SSAAPFGGIGASG 447 (484) T ss_pred CCCCCCEEEEEEECCHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHH-CCEEEEEEECCCCCC---CCCCCCCCCCCCC T ss_conf 887471489998399999999984899888599984899999999986-882079998998888---8799878765285 Q ss_pred -CCCHHHHHHHHHHHHHCCCCCEEEEEEEECC Q ss_conf -8746799999999872388676999400001 Q gi|254780392|r 433 -GRGAGSIVAAQFLEKFVQDASWAHIDIAGVA 463 (494) Q Consensus 433 -~~~~g~~~aa~fl~~f~~~~~w~HlDiag~a 463 (494) ||.+|.-. +..|+. ||+||.+--.+ T Consensus 448 ~GRe~g~~~----~d~~~~--~~~~~~~~~~~ 473 (484) T TIGR03240 448 NHRPSAYYA----ADYCAY--PVASLEADSLT 473 (484) T ss_pred CCCCCCHHH----HHHHHH--HHHEECCCCCC T ss_conf 898876688----886331--15202056445 No 103 >KOG3332 consensus Probab=25.12 E-value=28 Score=14.94 Aligned_cols=22 Identities=9% Similarity=0.069 Sum_probs=13.1 Q ss_pred HHHHCCCCCEEECHHHHHHHHH Q ss_conf 6420212540302999999999 Q gi|254780392|r 181 ARDIINEPANVLGTDEFCKQVR 202 (494) Q Consensus 181 ARDLvN~PaN~ltP~~lA~~a~ 202 (494) ++++-+-|.....|+.+++... T Consensus 108 ~~~f~Dg~~~~Wd~~~v~~~l~ 129 (247) T KOG3332 108 TPFFQDGPGEDWDPDAVASILL 129 (247) T ss_pred CCCCCCCCCCCCCHHHHHHHHH T ss_conf 7767778544468889999999 No 104 >PRK06833 L-fuculose phosphate aldolase; Provisional Probab=24.85 E-value=45 Score=13.47 Aligned_cols=54 Identities=13% Similarity=0.203 Sum_probs=42.5 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHHHCCC Q ss_conf 33224577378999999999999999986420212540302999999999861133 Q gi|254780392|r 153 SVTIVTEMIQQSSQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKLESLG 208 (494) Q Consensus 153 ~i~~~~~~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l~~~g 208 (494) .+...+.+..++-......+-.++-...||.|+ .++.+.++.+.+..++++.+| T Consensus 158 Gvi~~G~~l~eA~~~~~~lE~~a~~~~~a~~~G--~~~~l~~e~i~~~~~~~~~yg 211 (214) T PRK06833 158 GLLAGANNLKNAFNIAEEIEFCAEIYYQTKSIG--EPKLLPEDEMENMAKKFKGYG 211 (214) T ss_pred CEEEECCCHHHHHHHHHHHHHHHHHHHHHHHCC--CCCCCCHHHHHHHHHHHHCCC T ss_conf 527868999999999999999999999998459--997499999999999985659 No 105 >pfam03123 CAT_RBD CAT RNA binding domain. This RNA binding domain is found at the amino terminus of transcriptional antitermination proteins such as BglG, SacY and LicT. These proteins control the expression of sugar metabolising operons in Gram+ and Gram- bacteria. This domain has been called the CAT (Co-AntiTerminator) domain. It binds as a dimer to short Ribonucleotidic Anti-Terminator (RAT) hairpin, each monomer interacting symmetrically with both strands of the RAT hairpin. In the full-length protein, CAT is followed by two phosphorylatable PTS regulation domains (pfam00874) that modulate the RNA binding activity of CAT. Upon activation, the dimeric proteins bind to RAT targets in the nascent mRNA, thereby preventing abortive dissociation of the RNA polymerase from the DNA template. Probab=24.09 E-value=46 Score=13.42 Aligned_cols=15 Identities=27% Similarity=0.709 Sum_probs=10.6 Q ss_pred CCCEEEEECCEEECC Q ss_conf 651266302112006 Q gi|254780392|r 252 EQPLAFIGKGVVFDT 266 (494) Q Consensus 252 ~~piaLVGKGvTFDt 266 (494) .+-+.+.||||-|-. T Consensus 19 ~~E~Iv~GkGIgF~k 33 (55) T pfam03123 19 GNEVIVMGKGIAFNK 33 (55) T ss_pred CCEEEEEECCEEECC T ss_conf 999999976602288 No 106 >KOG1352 consensus Probab=23.93 E-value=47 Score=13.35 Aligned_cols=121 Identities=12% Similarity=0.145 Sum_probs=56.0 Q ss_pred CCCCEEEEEECCCHHHCCHHHHHHHHHHHHHHHHCC--C-EEEEECCCCCCCCHHHH--HHHHHHHHHCCCCCCCCCCCC Q ss_conf 874459999448815589999999999999853169--7-48994177899998999--999999971477754444364 Q gi|254780392|r 66 CVWDRLVVAGIGDPRDANFSWLKAGGSIASYIEEDK--N-IEIFVDVPEYPITKAEI--RDLVLGFMLKTYTFDQYKTKK 140 (494) Q Consensus 66 ~~~~rViLVGLG~~~~~~~~~~ra~a~~~~~l~~~~--~-~~v~i~l~~~~~~~~~~--~~~~eG~~L~~Y~Fd~yKskk 140 (494) ++.+.|+.||||+...--.+.++-.-.+.-...... + +...+ +.+..+-...+ +.+-.|+.|+-|--|.-+ T Consensus 269 SNSD~iiYVGCGERGNEMsEVL~dFPeLt~ev~G~~esiMKRT~L-VANTSNMPVAAREASIYTGITlsEYfRDmG~--- 344 (618) T KOG1352 269 SNSDAIIYVGCGERGNEMSEVLMDFPELTMEVDGKTESIMKRTAL-VANTSNMPVAAREASIYTGITLSEYFRDMGY--- 344 (618) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHCHHHEEECCCCCHHHHHHHHH-HHCCCCCCHHHHHHHHHHCEEHHHHHHHCCC--- T ss_conf 578769997366652158999873712286348842145544122-1047888504432023315069999986276--- Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHHHHHH-HCCC-CCH Q ss_conf 3332321000133322457737899999999999999998642021254030299999999986-1133-210 Q gi|254780392|r 141 RESSSPRSKENISVTIVTEMIQQSSQVVADIQSVVKGVNLARDIINEPANVLGTDEFCKQVRKL-ESLG-VAV 211 (494) Q Consensus 141 ~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~aiaegv~lARDLvN~PaN~ltP~~lA~~a~~l-~~~g-l~v 211 (494) ++....+....-.+++++ ..-+|..+|+.--+|.||..+...+ .+.| ++| T Consensus 345 ------------nVsMmADStSRWAEALRE---------ISGRLaEMPADsGyPAYLgArLAsFYERAG~vkc 396 (618) T KOG1352 345 ------------NVSMMADSTSRWAEALRE---------ISGRLAEMPADSGYPAYLGARLASFYERAGRVKC 396 (618) T ss_pred ------------CEEEEECCHHHHHHHHHH---------HHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEE T ss_conf ------------445541544689999998---------6305641867679727888899999986470254 No 107 >PRK06446 hypothetical protein; Provisional Probab=23.93 E-value=47 Score=13.35 Aligned_cols=16 Identities=25% Similarity=0.509 Sum_probs=12.7 Q ss_pred CCCHHHHHHHHHHHCC Q ss_conf 4358999999997668 Q gi|254780392|r 478 GFGVRLLDEFVRAFYE 493 (494) Q Consensus 478 g~~vr~l~~~~~~~~~ 493 (494) -.+++.+.+||++||- T Consensus 417 ~k~i~~~~~~i~~y~~ 432 (433) T PRK06446 417 YKAIEHTKEFLREYYH 432 (433) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9999999999998748 No 108 >PRK09290 allantoate amidohydrolase; Reviewed Probab=23.70 E-value=47 Score=13.32 Aligned_cols=136 Identities=15% Similarity=0.179 Sum_probs=94.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHH Q ss_conf 33200232211235689999999987087872899864102668999747077630248958886347866403652367 Q gi|254780392|r 274 SHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADAL 353 (494) Q Consensus 274 ~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l 353 (494) .-.|+.=|-=|.+||-++-.+..++... .-..++.++-.+.-|+....-||-|. -|+||.+.|.|-+--+.+.+ T Consensus 229 ttPM~~R~DAl~aaa~~i~~v~~~a~~~-~~~~vaTvG~i~~~P~a~NvIPg~v~-----ftvDiRs~~~~~l~~~~~~i 302 (412) T PRK09290 229 TTPMALRRDALLAAAEIILAVERIAAAH-GPDLVATVGRLEVYPNSRNVIPGEVT-----FTLDIRHPDDAVLDAMVAEL 302 (412) T ss_pred CCCHHHCCCHHHHHHHHHHHHHHHHHHH-CCCCEEEEEEEEECCCCCCCCCCEEE-----EEEEECCCCHHHHHHHHHHH T ss_conf 8963534079999999999999999972-99838987899835874642188689-----99982278999999999999 Q ss_pred HHHHH----HCCCCEEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCC Q ss_conf 88752----079613552111200235431776057870898999999998998569717677686788861798641 Q gi|254780392|r 354 WYCRT----HYNPHLMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFAD 427 (494) Q Consensus 354 ~ya~~----~~~p~~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD 427 (494) .-..+ +.+-. +++-.+. ...+..| |+.+.+.+.++.+..|-+.-+||-.--++-+.=+.+.+ T Consensus 303 ~~~~~~ia~~~g~~--~e~~~~~--------~~~pv~~--d~~l~~~~~~aa~~~g~~~~~m~SGAGHDA~~~a~~~P 368 (412) T PRK09290 303 RAAAEAIAARRGVG--IEIELIS--------DFAPVPF--DPALVAALERAAEALGLSYRRLPSGAGHDAQILAAVVP 368 (412) T ss_pred HHHHHHHHHHHCCE--EEEEEEE--------ECCCCCC--CHHHHHHHHHHHHHCCCCCEECCCCHHHHHHHHHCCCC T ss_conf 99999999972976--9999998--------3598567--99999999999997599951405506789999865188 No 109 >TIGR01664 DNA-3'-Pase DNA 3'-phosphatase; InterPro: IPR006551 These proteins catalyse the dephosphorylation of DNA 3'-phosphates. It is believed that this activity is important for the repair of single-strand breaks in DNA caused by radiation or oxidative damage. This region is often , , but not always linked to a DNA 5'-kinase domain , . As is common in this superfamily, DNA 3-phosphatase is magnesium dependent. A difference between this enzyme and other HAD-superfamily phosphatases is in the third conserved catalytic motif which usually contains two conserved aspartate residues believed to be involved in binding the magnesium ion. Here, the second aspartate is usually replaced by an arginine residue which may indicate an interaction with the phosphate backbone of the substrate. Alternatively, there is an additional conserved aspartate downstream of the usual site which may indicate a slightly different fold in this region.. Probab=23.54 E-value=48 Score=13.30 Aligned_cols=31 Identities=6% Similarity=0.050 Sum_probs=13.7 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 123888899999980899836787899843788 Q gi|254780392|r 33 SLIGSPSIVMRAVSVKNFIGESKSHLNILAPVD 65 (494) Q Consensus 33 ~~~~~~~~L~~~i~~~~F~GK~Ge~l~l~~~~g 65 (494) .....+..|.+++..- .-+.|-.+.|.+.+. T Consensus 51 ~~~~~p~kL~~LCnii--d~~~gY~iVIftNQ~ 81 (239) T TIGR01664 51 LFKEVPEKLKELCNII--DEDEGYKIVIFTNQS 81 (239) T ss_pred CCCCCHHHHHHHHHHH--CCCCCEEEEEEECCC T ss_conf 2554147999885441--035870799987146 No 110 >cd00378 SHMT Serine-glycine hydroxymethyltransferase (SHMT). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). SHMT carries out interconversion of serine and glycine; it catalyzes the transfer of hydroxymethyl group of N5, N10-methylene tetrahydrofolate to glycine resulting in the formation of serine and tetrahydrofolate. Both eukaryotic and prokaryotic SHMT enzymes form tight obligate homodimers; the mammalian enzyme forms a homotetramer comprising four pyridoxal phosphate-bound active sites. Probab=23.33 E-value=48 Score=13.27 Aligned_cols=80 Identities=21% Similarity=0.145 Sum_probs=44.0 Q ss_pred CCCCEEEEEHHHHHHH----H-HCCCCEEEEECCCCHHHHHHC-CCCCEEEEECCHHHHHHHHHHH-------------- Q ss_conf 8664036523678875----2-079613552111200235431-7760578708989999999989-------------- Q gi|254780392|r 342 DAEGRLILADALWYCR----T-HYNPHLMIDLATLTGAMVVSL-GNIYAGLFANNDVLAEQLLSSG-------------- 401 (494) Q Consensus 342 DAEGRlvLaD~l~ya~----~-~~~p~~iid~ATLTGa~~~al-G~~~~g~~~n~~~~~~~~~~a~-------------- 401 (494) |.-|-++++|+=.++- . .-+|=.--|+-|=|-+- .| |+.-.-+|+|+++++++|.++- T Consensus 186 d~vgA~l~~D~aHi~GLIA~g~~p~Pl~~ADvvt~tTHK--TlrGPrGGiIl~~~~~~~~kId~avfPg~qggph~h~iA 263 (402) T cd00378 186 DEVGAYLLVDMAHVAGLVAGGVFPNPLPGADVVTTTTHK--TLRGPRGGLILTRKGELAKKINSAVFPGLQGGPHLHVIA 263 (402) T ss_pred HHCCCEEEEEHHHHHHHEECCCCCCCCCCCEEEECCCCC--CCCCCCCEEEEECCHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 870988985501234430356579976876577446434--578998408995427899885465098644674566789 Q ss_pred -HHHCCCCCCCCCCHHHHHHHCC Q ss_conf -9856971767768678886179 Q gi|254780392|r 402 -LSTGELLWRMPMNEEYNKLIES 423 (494) Q Consensus 402 -~~~ge~~w~lPl~~~y~~~~~s 423 (494) ...-=.-+.-|-+++|.+|+-. T Consensus 264 a~Aval~Ea~~~~f~~Ya~qVv~ 286 (402) T cd00378 264 AKAVALKEALEPEFKAYAKQVVE 286 (402) T ss_pred HHHHHHHHHHCHHHHHHHHHHHH T ss_conf 99999999868668999999999 No 111 >pfam00595 PDZ PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins. Probab=23.24 E-value=32 Score=14.48 Aligned_cols=31 Identities=23% Similarity=0.360 Sum_probs=20.4 Q ss_pred EEEECCCCCCCCC-CCHHHHHCCCCCCEEEEE Q ss_conf 8641026689997-470776302489588863 Q gi|254780392|r 309 VLALVENMPGSSA-QRPGDIVRSMSGQTIEVI 339 (494) Q Consensus 309 ~~~~~EN~~~~~a-~~pgdvi~~~~G~tvei~ 339 (494) +.-...|.|...+ .+|||+|..-||..|+=. T Consensus 28 V~~V~~~~~A~~~gL~~GD~Il~VNg~~v~~~ 59 (80) T pfam00595 28 VSEVLPGGAAEAGGLQVGDRILSINGQDLENM 59 (80) T ss_pred EEEECCCCCHHHCCCCCCCEEEEECCEECCCC T ss_conf 99977898055487999999999999998999 No 112 >PRK06082 4-aminobutyrate aminotransferase; Provisional Probab=23.10 E-value=49 Score=13.24 Aligned_cols=94 Identities=18% Similarity=0.212 Sum_probs=44.4 Q ss_pred ECHHHHHHHHHHHH-CCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCCCC- Q ss_conf 02999999999861-1332100001110001345898874202577850688620578865651266302112006631- Q gi|254780392|r 192 LGTDEFCKQVRKLE-SLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTGGI- 269 (494) Q Consensus 192 ltP~~lA~~a~~l~-~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtGGi- 269 (494) ..|..|-+.+++++ ++|+ .-|+||-+.-=-..|-+.+ .++-+- .+.|...|||++ ||+ T Consensus 246 ~P~~~yl~~lr~lc~~~gi-LlI~DEVqtGfGRTG~~fa-------------~e~~gv---~PDIv~~gKglg---gG~~ 305 (460) T PRK06082 246 VPSKAYWKRIREICDKHNV-LLIIDDIPNGMGRSGEWFT-------------HQAFDI---EPDILCIGKGLG---GGLV 305 (460) T ss_pred CCCHHHHHHHHHHHHHCCC-EEECCCCCCCCCCCCCHHH-------------HHCCCC---CCCEEEECHHHH---CCCC T ss_conf 6898999999999997298-5760332028875443446-------------631799---982899881643---8977 Q ss_pred CC-----C----CCC--CC--HHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 00-----3----333--20--0232211235689999999987087872 Q gi|254780392|r 270 SI-----K----PSH--GM--EEMKGDLGGAAAVTGLLHVLAERKAKIN 305 (494) Q Consensus 270 sl-----K----p~~--~M--~~Mk~DM~GaAav~g~~~aia~l~~~v~ 305 (494) -+ + ... .+ .+.-..--++|+.+.++..+-+-++.-| T Consensus 306 Pisav~~~~~~~~~~~~~~g~~T~~gnPl~cAaAlA~L~iie~e~l~~~ 354 (460) T PRK06082 306 PIAAMITKDKYNTAAQISLGHYTHEKSPIGCAAALATMEAIEQEGLLDK 354 (460) T ss_pred CEEEEEEHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHH T ss_conf 6389987488552015776543678987999999999999972578899 No 113 >cd02903 Macro_BAL_like Macro domain, BAL_like family. The macro domain is a high-affinity ADP-ribose binding module found in a variety of proteins as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Some macro domains recognize poly ADP-ribose as a ligand. Previously identified as displaying an Appr-1"-p (ADP-ribose-1"-monophosphate) processing activity, the macro domain may play roles in distinct ADP-ribose pathways, such as the ADP-ribosylation of proteins, an important post-translational modification which occurs in DNA repair, transcription, chromatin biology, and long-term memory formation, among other processes. Members of this family show similarity to BAL (B-aggressive lymphoma) proteins, which contain one to three macro domains. Most BAL family macro domains belong to this family except for the most N-terminal domain in multiple-domain containing proteins. Most BAL proteins also contain a C-termin Probab=22.61 E-value=50 Score=13.17 Aligned_cols=108 Identities=6% Similarity=-0.027 Sum_probs=54.3 Q ss_pred CEEEEEEECCCCCCCCEEEEEECCCCCCCCC-CC---CCCCHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEEEC Q ss_conf 9069995077877677169996268663222-21---2388889999998089983678789984378887445999944 Q gi|254780392|r 1 MDIKFSFAKNPSIKQEGLAILLKTSFSDVAG-LS---LIGSPSIVMRAVSVKNFIGESKSHLNILAPVDCVWDRLVVAGI 76 (494) Q Consensus 1 M~mkis~~~~~~~~~d~lvV~~~~~~~~~~~-~~---~~~~~~~L~~~i~~~~F~GK~Ge~l~l~~~~g~~~~rViLVGL 76 (494) |++++--.+....+.|++|-..+..+..... .. ...-...+.+.+....-.-..|++. +....+.+.++|+.+-. T Consensus 1 i~i~v~~GDIt~~~~DaIVNsan~~l~~~~ggv~~aI~~aaG~~l~~e~~~~~~~~~~g~v~-~T~gg~L~~k~ViH~v~ 79 (137) T cd02903 1 LTLQVAKGDIEDETTDVIVNSVNPDLFLLKGGVSKAILRKAGPELQKELDKAKLGQTVGSVI-VTKGGNLPCKYVYHVVL 79 (137) T ss_pred CEEEEEECCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHCHHHHHHHHHHCCCCCCCCEE-ECCCCCCCCCEEEEECC T ss_conf 98999947566176899992987554788887999999946889999999964899899489-86356888889999727 Q ss_pred CCHHHCCHH-HHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 881558999-999999999985316974899417 Q gi|254780392|r 77 GDPRDANFS-WLKAGGSIASYIEEDKNIEIFVDV 109 (494) Q Consensus 77 G~~~~~~~~-~~ra~a~~~~~l~~~~~~~v~i~l 109 (494) ..-.....+ .+++..+..+.....+.+++.+.. T Consensus 80 p~~~~~~~~~l~~~~~~~L~~A~~~~~~SIAfPa 113 (137) T cd02903 80 PNWSNGALKILKDIVSECLEKCEELSYTSISFPA 113 (137) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 8788756999999999999999986997899905 No 114 >TIGR00918 2A060602 transmembrane receptor Patched; InterPro: IPR004766 Patched (Ptc) is a Drosophila melanogaster (Fruit fly) membrane protein that plays a critical role in patterning embryonic and imaginal tissues. It constitutively inactivates the transcription of genes wingless, decapentaplegic, and patched itself. The secreted protein, hedgehog (Hh) induces transcription of genes by opposing the patched signaling pathway . In humans, mutations affecting patched (Ptc) or smoothened (Smo) homologues that trigger ligand-independent activity of the Hedgehog (Hh) signalling pathway are associated with tumours such as basal cell carcinoma (BCC) and medulloblastoma . Patched is associated with a complex belonging to the hedgehog pathway. Members of the complex include cubitus interruptus, sex-lethal, patched and smoothened. The complex is present at the plasma membrane and the association of patched changes depending on the activation state of the pathway . ; GO: 0008158 hedgehog receptor activity, 0016021 integral to membrane. Probab=22.56 E-value=50 Score=13.17 Aligned_cols=181 Identities=19% Similarity=0.211 Sum_probs=108.1 Q ss_pred HHHHHHHHHHHHH-C----CCCCEEECHHHHHHHHHHH-HCCCCCHHHHCCCCCCHH--HHHHHHHHHCC-C-----C-C Q ss_conf 9999999986420-2----1254030299999999986-113321000011100013--45898874202-5-----7-7 Q gi|254780392|r 172 QSVVKGVNLARDI-I----NEPANVLGTDEFCKQVRKL-ESLGVAVEILDKEAMHKL--GMNALLAVAQG-S-----S-R 236 (494) Q Consensus 172 ~aiaegv~lARDL-v----N~PaN~ltP~~lA~~a~~l-~~~gl~v~Vld~~~L~kl--GMg~llaVgqG-S-----~-~ 236 (494) +.|+|--.++..| + ++-.|+|||+.|-.+...+ +-..++|..+|++|-.+. .|.+-+..=.+ - + - T Consensus 105 ~kiGEea~~~~QllIQT~~~~g~nvLt~eAL~~HL~~a~~As~V~V~lY~~~W~L~~~Cyksg~p~~E~~y~ieqile~L 184 (1215) T TIGR00918 105 KKIGEEAMFTPQLLIQTPHQEGANVLTPEALLQHLDSALKASRVKVYLYNREWDLEDLCYKSGEPLTEGGYYIEQILEKL 184 (1215) T ss_pred HHCCHHHHCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEECCCCCHHHHCCCCCCCCCCHHHHHHHHHHHH T ss_conf 52052342173201026788878535189999999998743744689746854657650488864202036899999852 Q ss_pred CCEEEEE----ECCCCCC--CCCCEEEEECCEEECCCCCCCCCCCCCHHHHHHHHH---HHHHHHHHHH--HHHCCCC-C Q ss_conf 8506886----2057886--565126630211200663100333320023221123---5689999999--9870878-7 Q gi|254780392|r 237 PPYLAVM----KWEGGDS--EEQPLAFIGKGVVFDTGGISIKPSHGMEEMKGDLGG---AAAVTGLLHV--LAERKAK-I 304 (494) Q Consensus 237 ~Prli~l----~Y~g~~~--~~~piaLVGKGvTFDtGGislKp~~~M~~Mk~DM~G---aAav~g~~~a--ia~l~~~-v 304 (494) =|++|+= =|.|.+= .+-.+.|-||- =|===+|+-|..-|++||.=|.- +=-+=+.=+. =|+-+-- + T Consensus 185 ~PC~iiTPLDCFWEGakL~gPeg~~ylpG~n--p~l~WtnldP~~~l~elk~~~s~ekisfd~e~~ee~l~kA~vG~gYm 262 (1215) T TIGR00918 185 FPCLIITPLDCFWEGAKLQGPEGTAYLPGKN--PPLQWTNLDPASLLEELKQLASQEKISFDVESWEELLKKAAVGQGYM 262 (1215) T ss_pred CCCEEECCCCCCCCCCCCCCCCCCEEECCCC--CCCCEEECCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 8852645775310012033886216706888--86412107989999996333222000001143899987741054213 Q ss_pred CEEEEEEECC----CCCCCCCCCHHHHHCCCCCC---------------EEEEECCCCCCEEEEEHHHH Q ss_conf 2899864102----66899974707763024895---------------88863478664036523678 Q gi|254780392|r 305 NAIGVLALVE----NMPGSSAQRPGDIVRSMSGQ---------------TIEVINTDAEGRLILADALW 354 (494) Q Consensus 305 ~v~~~~~~~E----N~~~~~a~~pgdvi~~~~G~---------------tvei~nTDAEGRlvLaD~l~ 354 (494) +==|+=|.== -.|.-++++|=||=.-+||= -|==..-|-.|.|+=|.||= T Consensus 263 ~rPCLnP~dp~CP~~APNknst~p~dVa~~LsGGCyG~s~K~MHWqEELivGG~~rn~sG~L~~AqALQ 331 (1215) T TIGR00918 263 ERPCLNPADPDCPDTAPNKNSTQPPDVAAVLSGGCYGLSAKYMHWQEELIVGGTKRNRSGKLLKAQALQ 331 (1215) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCHHHHCCCCCCCHHHHHCCCCEECCCCCCCHHHHHH T ss_conf 878886775643478878667886441001046513331335777045441773057877510368887 No 115 >TIGR03320 ygeY M20/DapE family protein YgeY. Members of this protein family, including the YgeY protein of Escherichia coli, typically are found in extended genomic regions associated with purine catabolism. Homologs include peptidases and deacylases of the M20/M25 /M40 and DapE/ArgE families. The function is unknown. Probab=22.20 E-value=50 Score=13.12 Aligned_cols=34 Identities=18% Similarity=0.453 Sum_probs=16.1 Q ss_pred HHHHHHHHHCCCCCEEECHHHHHHHHHH-HHCCCC Q ss_conf 9999864202125403029999999998-611332 Q gi|254780392|r 176 KGVNLARDIINEPANVLGTDEFCKQVRK-LESLGV 209 (494) Q Consensus 176 egv~lARDLvN~PaN~ltP~~lA~~a~~-l~~~gl 209 (494) +-+.+.++||..|+---.-...++..++ |++.|+ T Consensus 14 e~v~ll~~LV~ipS~sg~e~~~a~~l~~~l~~~G~ 48 (395) T TIGR03320 14 DMIRFLRDLVAIPSESGDEKRVAERIKEEMEKLGF 48 (395) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 89999999828999397889999999999987899 No 116 >pfam11330 DUF3132 Protein of unknown function (DUF3132). This viral family of proteins are 55kDa. No function is currently known. Probab=22.19 E-value=50 Score=13.12 Aligned_cols=60 Identities=18% Similarity=0.314 Sum_probs=44.5 Q ss_pred CCCCCCEEEEEEE-CCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHH Q ss_conf 0878728998641-02668999747077630248958886347866403652367887520 Q gi|254780392|r 300 RKAKINAIGVLAL-VENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTH 359 (494) Q Consensus 300 l~~~v~v~~~~~~-~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~ 359 (494) ..+.-|+..+.-+ -|..++|.-+--+||+--..|.-|-|++.--|||+-.-|-+|--..+ T Consensus 53 qslgdnilsytllkeeghidgmrtagddvllekdgevvmildsrdegrmwikddvwaevte 113 (124) T pfam11330 53 QSLGDNILSYTLLKEEGHIDGMRTAGDDVLLEKDGEVVMILDSRDEGRMWIKDDVWAEVTE 113 (124) T ss_pred CCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCEEEEEECCCCCCEEEECHHHHHHHH T ss_conf 4334001321022341563440026870688038829999824655745662016888876 No 117 >COG1949 Orn Oligoribonuclease (3'-5' exoribonuclease) [RNA processing and modification] Probab=22.16 E-value=51 Score=13.11 Aligned_cols=61 Identities=18% Similarity=0.029 Sum_probs=40.8 Q ss_pred EEEEEHHHHHHHHHCCCCEEEEECCC-CHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCC Q ss_conf 03652367887520796135521112-002354317760578708989999999989985697 Q gi|254780392|r 346 RLILADALWYCRTHYNPHLMIDLATL-TGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGEL 407 (494) Q Consensus 346 RlvLaD~l~ya~~~~~p~~iid~ATL-TGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~ 407 (494) +||--|.=--.. ..+-+.||.+||+ |-+-..-|+...--++.-+++....+-+--.+++.+ T Consensus 7 nLiWIDlEMTGL-d~~~drIIEiA~iVTD~~Lnilaegp~~~Ihq~~e~L~~Mdew~~~~H~~ 68 (184) T COG1949 7 NLIWIDLEMTGL-DPERDRIIEIATIVTDANLNILAEGPVIAIHQSDEQLAKMDEWNTETHGR 68 (184) T ss_pred CEEEEEEEECCC-CCCCCEEEEEEEEEECCCCCCCCCCCEEEEECCHHHHHHHHHHHHHCCCC T ss_conf 628996320268-86764278888877547632103585578867999999889999971454 No 118 >pfam11161 DUF2944 Protein of unknown function (DUF2946). This family of proteins with unknown function appear to be restricted to Proteobacteria. Probab=22.10 E-value=51 Score=13.10 Aligned_cols=24 Identities=0% Similarity=-0.140 Sum_probs=12.2 Q ss_pred HHHHHHHHHCCCCCCCCCCEEEEE Q ss_conf 889999998089983678789984 Q gi|254780392|r 38 PSIVMRAVSVKNFIGESKSHLNIL 61 (494) Q Consensus 38 ~~~L~~~i~~~~F~GK~Ge~l~l~ 61 (494) ...|..+|..+--.-..|.-.+-. T Consensus 49 H~~L~~FI~RNY~~D~~G~WfFQN 72 (178) T pfam11161 49 HPALIEFIARNYECDAQGRWFFQN 72 (178) T ss_pred CHHHHHHHHHCCCCCCCCCEEEEC T ss_conf 489999998446767777678635 No 119 >PRK08088 4-aminobutyrate aminotransferase; Validated Probab=22.05 E-value=51 Score=13.10 Aligned_cols=62 Identities=21% Similarity=0.289 Sum_probs=35.3 Q ss_pred CEEECHHHHHHHHHHHH-CCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCC Q ss_conf 40302999999999861-13321000011100013458988742025778506886205788656512663021120066 Q gi|254780392|r 189 ANVLGTDEFCKQVRKLE-SLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTG 267 (494) Q Consensus 189 aN~ltP~~lA~~a~~l~-~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtG 267 (494) +-+..|..|-+.+++++ ++|+ .=|+||-+.-=-..|.+.+ .++-|- .+.|.-+|||++ | T Consensus 214 G~~~~~~~yl~~lr~lc~~~g~-llI~DEV~tGfGRtG~~fa-------------~~~~gv---~PDii~~gK~l~---g 273 (426) T PRK08088 214 GFYAASPAFMQRLRALCDQHGI-MLIADEVQTGAGRTGTFFA-------------MEQMGV---AADLTTFAKSIA---G 273 (426) T ss_pred CCEECCHHHHHHHHHHHHHCCC-EEECCCCCCCCCCCCCHHH-------------HHHCCC---CCCEEEHHHHHC---C T ss_conf 8663887999999999987690-5876330228776674135-------------664198---976201214403---8 Q ss_pred CCC Q ss_conf 310 Q gi|254780392|r 268 GIS 270 (494) Q Consensus 268 Gis 270 (494) |+. T Consensus 274 G~P 276 (426) T PRK08088 274 GFP 276 (426) T ss_pred CCC T ss_conf 887 No 120 >TIGR01736 FGAM_synth_II phosphoribosylformylglycinamidine synthase II; InterPro: IPR010074 Phosphoribosylformylglycinamidine synthase is a single, long polypeptide in most bacteria. Three proteins are required in Bacillus subtilis and many other species. This is the longest of the three and is designated PurL, phosphoribosylformylglycinamidine synthase II, or FGAM synthase II.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=22.02 E-value=37 Score=14.04 Aligned_cols=18 Identities=0% Similarity=-0.151 Sum_probs=7.8 Q ss_pred HHHHHHHHHHCCCEEEEE Q ss_conf 999999853169748994 Q gi|254780392|r 90 GGSIASYIEEDKNIEIFV 107 (494) Q Consensus 90 ~a~~~~~l~~~~~~~v~i 107 (494) .+.++|-|...+..-+++ T Consensus 488 vaEa~RNLa~vGA~PLa~ 505 (763) T TIGR01736 488 VAEAYRNLAAVGAEPLAI 505 (763) T ss_pred HHHHHHCEEEECCEEEEE T ss_conf 774311256408754888 No 121 >TIGR00936 ahcY adenosylhomocysteinase; InterPro: IPR000043 S-adenosyl-L-homocysteine hydrolase (3.3.1.1 from EC) (AdoHcyase) is an enzyme of the activated methyl cycle, responsible for the reversible hydration of S-adenosyl-L-homocysteine into adenosine and homocysteine. AdoHcyase is an ubiquitous enzyme which binds and requires NAD^+ as a cofactor. AdoHcyase is a highly conserved protein of about 430 to 470 amino acids. The family contains a glycine-rich region in the central part of AdoHcyase; a region thought to be involved in NAD-binding.; GO: 0004013 adenosylhomocysteinase activity, 0006730 one-carbon compound metabolic process. Probab=21.87 E-value=51 Score=13.11 Aligned_cols=20 Identities=40% Similarity=0.647 Sum_probs=8.8 Q ss_pred HHHCCCCCCEEEEECCCCCCEEE Q ss_conf 76302489588863478664036 Q gi|254780392|r 326 DIVRSMSGQTIEVINTDAEGRLI 348 (494) Q Consensus 326 dvi~~~~G~tvei~nTDAEGRlv 348 (494) |-....+|+.|-++ |||||| T Consensus 324 ~~Y~L~dGr~i~lL---aeGRLV 343 (422) T TIGR00936 324 DEYKLKDGRRIYLL---AEGRLV 343 (422) T ss_pred CEEECCCCCEEEEE---ECCCEE T ss_conf 34463889789998---277332 No 122 >cd06325 PBP1_ABC_uncharacterized_transporter Type I periplasmic ligand-binding domain of uncharacterized ABC-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This group includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This subgroup has high sequence similarity to members of the family of hydrophobic amino acid transporters (HAAT), such as leucine/isoleucine/valine binding protein (LIVBP); its ligand specificity has not been determined experimentally. Probab=21.86 E-value=51 Score=13.07 Aligned_cols=14 Identities=7% Similarity=0.444 Sum_probs=8.1 Q ss_pred CCCCCHHHHHHHCC Q ss_conf 67768678886179 Q gi|254780392|r 410 RMPMNEEYNKLIES 423 (494) Q Consensus 410 ~lPl~~~y~~~~~s 423 (494) ++|.+-.+...++. T Consensus 211 ~iPv~~~~~~~v~~ 224 (281) T cd06325 211 KIPVIASDDDMVKR 224 (281) T ss_pred CCCEEECCHHHHHC T ss_conf 99889367788616 No 123 >PRK08005 ribulose-phosphate 3-epimerase; Validated Probab=21.69 E-value=47 Score=13.31 Aligned_cols=15 Identities=7% Similarity=0.208 Sum_probs=6.8 Q ss_pred HHHHHCCCEEEEECC Q ss_conf 985316974899417 Q gi|254780392|r 95 SYIEEDKNIEIFVDV 109 (494) Q Consensus 95 ~~l~~~~~~~v~i~l 109 (494) +.+...+...+.++. T Consensus 20 ~~l~~~g~d~lHiDI 34 (210) T PRK08005 20 TALHDAPLGSLHLDI 34 (210) T ss_pred HHHHHCCCCEEEEEC T ss_conf 999977999899828 No 124 >KOG3857 consensus Probab=21.13 E-value=16 Score=16.60 Aligned_cols=116 Identities=20% Similarity=0.192 Sum_probs=47.9 Q ss_pred EEECCCCCCCC-----CCEEEEECCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCC Q ss_conf 86205788656-----5126630211200663100333320023221123568999999998708787289986410266 Q gi|254780392|r 242 VMKWEGGDSEE-----QPLAFIGKGVVFDTGGISIKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKINAIGVLALVENM 316 (494) Q Consensus 242 ~l~Y~g~~~~~-----~piaLVGKGvTFDtGGislKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v~v~~~~~~~EN~ 316 (494) .=.|+|..+.. +-+-||+|=.. -+++-+ -=++-+.||+=|-...|.---=|.+ +-.-++. -- T Consensus 266 rp~yqgsNPIsD~wA~~al~li~kyl~-----rAv~~p-~d~eARt~M~~As~~aG~gFgNAgv------hlcHgls-yp 332 (465) T KOG3857 266 RPLYQGSNPISDAWALKALELINKYLV-----RAVKDP-KDEEARTDMHYASYLAGMGFGNAGV------HLCHGLS-YP 332 (465) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHH-----HHHCCC-CCHHHHHHHHHHHHHHCCCCCCCCC------CCEEECC-CC T ss_conf 642136883388999999999999999-----874187-5477887778998861134576540------0000025-55 Q ss_pred CCC--CCCCHHHHHC----CCCCCEEEEECCCCCCEEEEEHHHHHHHHHCCCCEEEEECCCCHHHHHHCC Q ss_conf 899--9747077630----248958886347866403652367887520796135521112002354317 Q gi|254780392|r 317 PGS--SAQRPGDIVR----SMSGQTIEVINTDAEGRLILADALWYCRTHYNPHLMIDLATLTGAMVVSLG 380 (494) Q Consensus 317 ~~~--~a~~pgdvi~----~~~G~tvei~nTDAEGRlvLaD~l~ya~~~~~p~~iid~ATLTGa~~~alG 380 (494) +|| ++|||.|--. ---|..|-| ++-+..-....--|+.-...|-+-||...-+| T Consensus 333 isg~vk~~kakdy~~dh~liPHGlsv~v----------~~pavfeft~~~cP~rhl~aaq~LGa~~~h~~ 392 (465) T KOG3857 333 ISGQVKSYKAKDYYHDHNLIPHGLSVAV----------LLPAVFEFTAAACPDRHLEAAQRLGAIARHFG 392 (465) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHH----------HHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 5675232444455665566775133331----------20666631400185357899987505764144 No 125 >TIGR00470 sepS O-phosphoseryl-tRNA(Cys) synthetase; InterPro: IPR005246 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . All known phenylalanyl-tRNA synthetases, excepting a monomeric mitochondrial form, have an alpha 2 beta 2 heterotetrameric structure. This family of archaeal proteins resembles known phenylalanyl-tRNA synthetase alpha chains but is longer and of unknown function; more generally, it resembles class 2 aminoacyl tRNA ligases. . Probab=20.83 E-value=54 Score=12.93 Aligned_cols=50 Identities=16% Similarity=0.297 Sum_probs=28.5 Q ss_pred HHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 99853169748994177899998999999999971477754444364333232 Q gi|254780392|r 94 ASYIEEDKNIEIFVDVPEYPITKAEIRDLVLGFMLKTYTFDQYKTKKRESSSP 146 (494) Q Consensus 94 ~~~l~~~~~~~v~i~l~~~~~~~~~~~~~~eG~~L~~Y~Fd~yKskk~~~~~~ 146 (494) .++|-...+.+-.+. +...+-.+-.+++||+ |...-|.+||=++++++++ T Consensus 235 ~shLmtYhSASCVvV--deevsvDdGKaVAEgL-L~qfGF~~FrFrpDEKkSK 284 (558) T TIGR00470 235 ASHLMTYHSASCVVV--DEEVSVDDGKAVAEGL-LEQFGFEKFRFRPDEKKSK 284 (558) T ss_pred CCCCCCEEEEEEEEE--CCCCCCCCCHHHHHHH-HHHHCCCCCCCCCCCCCCC T ss_conf 310011345466785--7712436426889999-9870854475057646454 No 126 >pfam04393 DUF535 Protein of unknown function (DUF535). Family member Shigella flexneri VirK is a virulence protein required for the expression, or correct membrane localisation of IcsA (VirG) on the bacterial cell surface. This family also includes Pasteurella haemolytica lapB, which is thought to be membrane-associated. Probab=20.72 E-value=37 Score=14.05 Aligned_cols=69 Identities=17% Similarity=0.190 Sum_probs=43.6 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHCCCC---CEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHH--HHHCC Q ss_conf 6788752079613552111200235431776---057870898999999998998569717677686788--86179 Q gi|254780392|r 352 ALWYCRTHYNPHLMIDLATLTGAMVVSLGNI---YAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYN--KLIES 423 (494) Q Consensus 352 ~l~ya~~~~~p~~iid~ATLTGa~~~alG~~---~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~--~~~~s 423 (494) ++..-...++-+.|+-|.. .|++..++-.. -.-+..|=|++|+++- |+...+.+|+||+..+=+ +.|+| T Consensus 190 ~l~~lA~~l~i~~i~aVsn-~~Hiy~~~ry~~~k~~~~~adYD~fW~e~g--G~~~~~~~~~LPl~~~RK~leeI~S 263 (287) T pfam04393 190 ALKLLAAALGVEQILAVSN-KGHIYRRLRYRKKKRKRILADYDEFWQELG--GELINRGFYALPLAIERKDLEDIAS 263 (287) T ss_pred HHHHHHHHCCCCEEEEECC-CCCCHHHHHHHHHHCCEEECCHHHHHHHCC--CEECCCCEEECCCCCCCCCHHHCCC T ss_conf 9999999839865999567-750012565544205718743799999768--9988899486888735599556674 No 127 >COG1692 Calcineurin-like phosphoesterase [General function prediction only] Probab=20.56 E-value=50 Score=13.13 Aligned_cols=45 Identities=13% Similarity=0.127 Sum_probs=17.1 Q ss_pred CCCCCEEECHHHHHHHHHHHHCCC---CCHHHHCCCCCCHHHHHHHHH Q ss_conf 212540302999999999861133---210000111000134589887 Q gi|254780392|r 185 INEPANVLGTDEFCKQVRKLESLG---VAVEILDKEAMHKLGMNALLA 229 (494) Q Consensus 185 vN~PaN~ltP~~lA~~a~~l~~~g---l~v~Vld~~~L~klGMg~lla 229 (494) +++|+..=+|=.=+++.-.-.+.+ +=|.-.-|.--||+-||-++- T Consensus 119 v~m~~~~d~PF~~~d~l~~~~~~~~~~iiVDFHAEtTSEK~a~g~yld 166 (266) T COG1692 119 VFMPPALDNPFKAADKLLDEIKLGTDLIIVDFHAETTSEKNAFGWYLD 166 (266) T ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHEEEC T ss_conf 358600179789999999747468964999812652155520106886 No 128 >PTZ00125 ornithine aminotransferase; Provisional Probab=20.49 E-value=55 Score=12.88 Aligned_cols=96 Identities=26% Similarity=0.371 Sum_probs=51.9 Q ss_pred CEEECHHHHHHHHHHHH-CCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCEEEEECCEEECCC Q ss_conf 40302999999999861-13321000011100013458988742025778506886205788656512663021120066 Q gi|254780392|r 189 ANVLGTDEFCKQVRKLE-SLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMKWEGGDSEEQPLAFIGKGVVFDTG 267 (494) Q Consensus 189 aN~ltP~~lA~~a~~l~-~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~Y~g~~~~~~piaLVGKGvTFDtG 267 (494) +-+..|+.|-+.+++++ ++|+ +=|+||-+. || ||-. ++--.+|-|-. +.|..+|||+ +| T Consensus 211 G~~~p~~~yl~~lr~lc~~~gi-llI~DEV~t---Gf------GRtG----~~fa~e~~gv~---PDiv~~gKgl---~g 270 (415) T PTZ00125 211 GVIVPDEGYLKQVYSLCKKYNV-LLVADEVQT---GL------GRTG----KLLAVDHEGVK---PDIVLLGKAL---SG 270 (415) T ss_pred CCCCCCHHHHHHHHHHHHHHCC-EEECCCCEE---CC------CCCC----CCCCCCCCCCC---CCEEEECCCC---CC T ss_conf 8426999999999999998487-676452241---67------7567----62120178979---9821003144---68 Q ss_pred CC-C-------------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 31-0-------------0333320023221123568999999998708787 Q gi|254780392|r 268 GI-S-------------IKPSHGMEEMKGDLGGAAAVTGLLHVLAERKAKI 304 (494) Q Consensus 268 Gi-s-------------lKp~~~M~~Mk~DM~GaAav~g~~~aia~l~~~v 304 (494) |+ - ++|+..-.++-.+--++|+.+.++..+-+-++.- T Consensus 271 G~~Plsav~~~~~i~~~~~~~~h~~T~~gnpla~Aaa~a~L~~i~~~~l~~ 321 (415) T PTZ00125 271 GHLPISAVLANDDVMLTIKPGEHGSTYGGNPLAAAVCKEALEVLSEEKLAE 321 (415) T ss_pred CCCCCEEEECCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHH T ss_conf 877620353349999632689988899818999999999999998646999 No 129 >PRK01792 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional Probab=20.42 E-value=55 Score=12.87 Aligned_cols=59 Identities=15% Similarity=0.370 Sum_probs=30.9 Q ss_pred HHCCCCCEEECHHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHHHHCCCCCCCEEEEEE Q ss_conf 20212540302999999999861133210000111000134589887420257785068862 Q gi|254780392|r 183 DIINEPANVLGTDEFCKQVRKLESLGVAVEILDKEAMHKLGMNALLAVAQGSSRPPYLAVMK 244 (494) Q Consensus 183 DLvN~PaN~ltP~~lA~~a~~l~~~gl~v~Vld~~~L~klGMg~llaVgqGS~~~Prli~l~ 244 (494) ||+ .|+..+||+.++-..+. -.|+=|-.++++.++++|.+-+...+....+.+..+-+. T Consensus 42 Dlv-~aAe~~T~e~i~fm~~~--~~GliCva~~~~~~~~L~Lp~mv~~n~~~~~taFtvsvd 100 (214) T PRK01792 42 DLI-FPAETITPEQMAKLIRY--GSGIVCLCITDELCQQLDLPPMVQHNTSVNKTAFTVTIE 100 (214) T ss_pred CEE-EEHHHCCHHHHHHHHHH--CCCCEEEECCHHHHHHCCCCCCCCCCCCCCCCCEEEEEE T ss_conf 679-68554999999999995--897789736999997779902146677888887799996 No 130 >COG3480 SdrC Predicted secreted protein containing a PDZ domain [Signal transduction mechanisms] Probab=20.29 E-value=55 Score=12.85 Aligned_cols=165 Identities=18% Similarity=0.244 Sum_probs=94.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCEEE----EEEECCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCCEEEEEHHHHHHHHH Q ss_conf 1235689999999987087872899----864102668999747077630248958886347866403652367887520 Q gi|254780392|r 284 LGGAAAVTGLLHVLAERKAKINAIG----VLALVENMPGSSAQRPGDIVRSMSGQTIEVINTDAEGRLILADALWYCRTH 359 (494) Q Consensus 284 M~GaAav~g~~~aia~l~~~v~v~~----~~~~~EN~~~~~a~~pgdvi~~~~G~tvei~nTDAEGRlvLaD~l~ya~~~ 359 (494) -+=.+|+..+ .-.++-|+++.- ++-..||||.-.-..+||-|++-+|+.++- ..|.+.|..+ T Consensus 108 ~Sq~~A~y~A---~~~a~~pv~~~y~gvyv~~v~~~~~~~gkl~~gD~i~avdg~~f~s----------~~e~i~~v~~- 173 (342) T COG3480 108 TSQNAAIYAA---YKYAGKPVEVTYAGVYVLSVIDNSPFKGKLEAGDTIIAVDGEPFTS----------SDELIDYVSS- 173 (342) T ss_pred HHHHHHHHHH---HHHCCCCEEEEEEEEEEEECCCCCCHHCEECCCCEEEEECCEECCC----------HHHHHHHHHC- T ss_conf 5156799999---9974995589983279997147863102232687688558944578----------8999999854- Q ss_pred CCCC--EEEEECCCCHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCEECCCCCCHH Q ss_conf 7961--35521112002354317760578708989999999989985697176776867888617986417018887467 Q gi|254780392|r 360 YNPH--LMIDLATLTGAMVVSLGNIYAGLFANNDVLAEQLLSSGLSTGELLWRMPMNEEYNKLIESKFADMKNIGGRGAG 437 (494) Q Consensus 360 ~~p~--~iid~ATLTGa~~~alG~~~~g~~~n~~~~~~~~~~a~~~~ge~~w~lPl~~~y~~~~~s~~AD~~n~~~~~~g 437 (494) -+|- .-|++.---+.--.+-+. +..+|+ --...+-.+-....++|--|- +.. +..|+||..+| T Consensus 174 ~k~Gd~VtI~~~r~~~~~~~~~~t----l~~~~~-~g~~giGIsl~d~~~v~~~~~-------V~~---~~~~IGGPSAG 238 (342) T COG3480 174 KKPGDEVTIDYERHNETPEIVTIT----LIKNDD-NGKAGIGISLVDAPEVWAPPD-------VDF---NTENIGGPSAG 238 (342) T ss_pred CCCCCEEEEEEEECCCCCCEEEEE----EEEECC-CCCCEEEEEEECCCCCCCCCC-------EEE---ECCCCCCCCHH T ss_conf 688976999999516987268999----996046-886412158634765456872-------675---12447997543 Q ss_pred HHHHHHHHHHHC-CCCCEEEEEEEECCCCCCCCCCCCCCCCC Q ss_conf 999999998723-88676999400001167888888998654 Q gi|254780392|r 438 SIVAAQFLEKFV-QDASWAHIDIAGVATGGYPKEINQSWASG 478 (494) Q Consensus 438 ~~~aa~fl~~f~-~~~~w~HlDiag~a~~~~~~~~~~~g~tg 478 (494) -...=....++. +|..| -..||||..-+.+-...|=|+-. T Consensus 239 LMFSL~Iy~qlt~~DL~~-g~~IAGTGTI~~DG~VG~IGGI~ 279 (342) T COG3480 239 LMFSLAIYDQLTKGDLTG-GRFIAGTGTIEVDGKVGPIGGID 279 (342) T ss_pred HEEEHHHHHHCCCCCCCC-CEEEECCEEECCCCCCCCCCCHH T ss_conf 335298886405311358-66984111334688335745476 No 131 >KOG3425 consensus Probab=20.22 E-value=55 Score=12.84 Aligned_cols=55 Identities=15% Similarity=0.039 Sum_probs=22.1 Q ss_pred CCCHHHHHHHHH-----HHHHCCCCCEEEEEEEEC-CCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 874679999999-----987238867699940000-1167888888998654358999999 Q gi|254780392|r 433 GRGAGSIVAAQF-----LEKFVQDASWAHIDIAGV-ATGGYPKEINQSWASGFGVRLLDEF 487 (494) Q Consensus 433 ~~~~g~~~aa~f-----l~~f~~~~~w~HlDiag~-a~~~~~~~~~~~g~tg~~vr~l~~~ 487 (494) ..|.--|.+|.= |+++-++...+|.|.--- -|-+.+.++...-.+=-+|.||++| T Consensus 42 qSWCPdCV~AEPvi~~alk~ap~~~~~v~v~VG~rp~Wk~p~n~FR~d~~~lt~vPTLlrw 102 (128) T KOG3425 42 QSWCPDCVAAEPVINEALKHAPEDVHFVHVYVGNRPYWKDPANPFRKDPGILTAVPTLLRW 102 (128) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCEEECCEEEEE T ss_conf 8378218776389999997387761799997068774148998543498741403343687 No 132 >pfam09182 PuR_N Bacterial purine repressor, N-terminal. The N-terminal domain of the bacterial purine repressor PuR is a winged-helix domain, a subdivision of the HTH structural family. It consists of a canonical arrangement of secondary structures: a1-b1-a2-T-a3-b2-W-b3, where a2-T-a3 is the HTH motif, a3 is the recognition helix, and W is the wing. The domain allows for recognition of a conserved CGAA sequence in the centre of a DNA PurBox, resulting in binding to the major groove of DNA. Probab=20.16 E-value=53 Score=12.99 Aligned_cols=53 Identities=17% Similarity=0.358 Sum_probs=35.4 Q ss_pred HHHHHHHCCCCCEEECHHHHHHHHHHHHCC-CCCHHHHCCCCCCHHHHHHHHHHH Q ss_conf 998642021254030299999999986113-321000011100013458988742 Q gi|254780392|r 178 VNLARDIINEPANVLGTDEFCKQVRKLESL-GVAVEILDKEAMHKLGMNALLAVA 231 (494) Q Consensus 178 v~lARDLvN~PaN~ltP~~lA~~a~~l~~~-gl~v~Vld~~~L~klGMg~llaVg 231 (494) +.+++-|.+-|--..+=.+|+++-...|.. .=++. +-++-+++.|||-+--|. T Consensus 7 v~mt~~Ll~~P~~li~L~~Fae~~~~AKSsISEDl~-iik~~~~~~g~G~leTi~ 60 (70) T pfam09182 7 VAMTKYLLENPNKLIPLTYFAERFGSAKSSISEDLV-IIKETFEKFGLGKLETIP 60 (70) T ss_pred HHHHHHHHHCCCCEECHHHHHHHHCCCCCHHHHHHH-HHHHHHHHCCCCEEEEEC T ss_conf 999999982998557499999986601020375799-999999873986299845 Done!