Query gi|254780425|ref|YP_003064838.1| apolipoprotein N-acyltransferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 518 No_of_seqs 165 out of 2706 Neff 8.6 Searched_HMMs 39220 Date Sun May 29 16:22:48 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780425.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00302 lnt apolipoprotein N- 100.0 0 0 853.0 41.1 495 6-517 2-502 (509) 2 COG0815 Lnt Apolipoprotein N-a 100.0 0 0 761.9 37.2 500 1-517 2-508 (518) 3 TIGR00546 lnt apolipoprotein N 100.0 0 0 630.5 20.1 393 77-471 1-441 (441) 4 PRK12291 apolipoprotein N-acyl 100.0 0 0 600.2 30.4 396 14-469 16-415 (420) 5 cd07571 ALP_N-acyl_transferase 100.0 0 0 542.2 20.5 265 243-509 1-269 (270) 6 cd07565 aliphatic_amidase alip 100.0 2.6E-29 6.6E-34 209.5 16.2 226 243-492 1-250 (291) 7 cd07197 nitrilase Nitrilase su 100.0 8E-29 2E-33 206.3 14.9 213 245-490 1-237 (253) 8 cd07581 nitrilase_3 Uncharacte 100.0 2.2E-28 5.7E-33 203.4 16.1 218 245-490 1-239 (255) 9 cd07583 nitrilase_5 Uncharacte 100.0 2.8E-28 7.1E-33 202.8 15.6 216 244-491 1-238 (253) 10 cd07572 nit Nit1, Nit 2, and r 100.0 7.3E-28 1.9E-32 200.0 15.2 225 244-491 1-250 (265) 11 cd07573 CPA N-carbamoylputresc 100.0 1.7E-27 4.4E-32 197.6 15.4 226 243-493 1-262 (284) 12 PRK10438 hypothetical protein; 100.0 6.9E-27 1.7E-31 193.7 17.0 217 241-491 2-236 (256) 13 cd07582 nitrilase_4 Uncharacte 100.0 4.4E-27 1.1E-31 195.0 15.6 238 248-498 6-282 (294) 14 cd07575 Xc-1258_like Xanthomon 100.0 5.3E-27 1.3E-31 194.4 15.1 216 243-493 1-236 (252) 15 cd07568 ML_beta-AS_like mammal 100.0 8.1E-27 2.1E-31 193.2 15.9 229 242-492 3-263 (287) 16 cd07564 nitrilases_CHs Nitrila 99.9 1.9E-26 4.9E-31 190.8 16.3 219 243-490 1-270 (297) 17 cd07576 R-amidase_like Pseudom 99.9 1.9E-26 4.9E-31 190.8 15.4 215 244-491 1-237 (254) 18 cd07569 DCase N-carbamyl-D-ami 99.9 1.9E-26 4.7E-31 190.9 14.8 238 241-489 2-275 (302) 19 cd07586 nitrilase_8 Uncharacte 99.9 5.6E-26 1.4E-30 187.7 17.2 221 244-491 1-246 (269) 20 cd07584 nitrilase_6 Uncharacte 99.9 3.5E-26 9E-31 189.0 16.0 219 244-492 1-243 (258) 21 TIGR03381 agmatine_aguB N-carb 99.9 5E-26 1.3E-30 188.1 15.6 225 243-492 1-258 (279) 22 cd07578 nitrilase_1_R1 First n 99.9 5.5E-26 1.4E-30 187.8 15.3 214 243-490 1-238 (258) 23 cd07574 nitrilase_Rim1_like Un 99.9 1.4E-25 3.5E-30 185.2 14.7 221 243-492 1-261 (280) 24 cd07585 nitrilase_7 Uncharacte 99.9 2.1E-25 5.4E-30 184.0 15.3 218 244-491 1-240 (261) 25 cd07580 nitrilase_2 Uncharacte 99.9 5.5E-25 1.4E-29 181.3 16.3 216 244-490 1-246 (268) 26 cd07570 GAT_Gln-NAD-synth Glut 99.9 6.4E-25 1.6E-29 180.9 15.8 217 244-488 1-241 (261) 27 cd07577 Ph0642_like Pyrococcus 99.9 7.2E-25 1.8E-29 180.5 15.7 209 244-490 1-237 (259) 28 cd07579 nitrilase_1_R2 Second 99.9 6.6E-24 1.7E-28 174.2 14.2 215 244-489 1-250 (279) 29 COG0388 Predicted amidohydrola 99.9 2.8E-23 7.1E-28 170.2 15.0 221 242-490 2-247 (274) 30 PRK13825 conjugal transfer pro 99.9 2.4E-18 6.1E-23 138.0 26.0 183 261-459 203-389 (389) 31 pfam00795 CN_hydrolase Carbon- 99.9 1.6E-21 4E-26 158.7 7.6 159 244-426 1-172 (172) 32 KOG0807 consensus 99.8 2.9E-20 7.3E-25 150.5 9.3 234 243-497 16-275 (295) 33 PRK13287 amiF formamidase; Pro 99.8 5.1E-19 1.3E-23 142.4 15.6 229 237-490 8-260 (333) 34 PRK13981 NAD synthetase; Provi 99.8 4.4E-18 1.1E-22 136.3 13.7 219 243-489 1-242 (543) 35 pfam07186 consensus 99.8 1.6E-15 4.1E-20 119.6 24.9 184 260-459 209-396 (396) 36 PRK13286 amiE acylamide amidoh 99.8 4.7E-17 1.2E-21 129.6 14.8 230 237-490 7-261 (343) 37 cd07587 ML_beta-AS mammalian-l 99.7 2.8E-16 7.2E-21 124.5 12.4 239 240-500 61-347 (363) 38 KOG0806 consensus 99.6 1.4E-15 3.7E-20 119.9 9.6 228 240-488 11-269 (298) 39 PRK02628 nadE NAD synthetase; 99.6 1.2E-13 3.1E-18 107.3 15.3 234 240-487 10-276 (678) 40 cd07567 biotinidase_like bioti 99.5 3.4E-13 8.6E-18 104.5 13.9 207 262-494 24-280 (299) 41 cd07566 ScNTA1_like Saccharomy 99.5 1.1E-13 2.8E-18 107.6 11.0 216 244-470 1-260 (295) 42 KOG0805 consensus 99.1 3.9E-09 9.9E-14 78.0 11.6 214 241-489 16-290 (337) 43 KOG0808 consensus 98.3 3.4E-06 8.6E-11 58.8 7.9 239 240-500 71-357 (387) 44 pfam09142 TruB_C tRNA Pseudour 84.2 1.8 4.6E-05 21.5 4.3 40 439-478 6-45 (56) 45 KOG2303 consensus 79.6 4 0.0001 19.3 5.9 37 333-370 97-133 (706) 46 TIGR02225 recomb_XerD tyrosine 66.5 2.1 5.5E-05 21.0 0.6 53 298-361 134-187 (305) 47 COG3859 Predicted membrane pro 59.6 11 0.00028 16.4 8.5 61 19-90 13-77 (185) 48 TIGR02075 pyrH_bact uridylate 56.8 12 0.00031 16.1 5.0 77 396-478 102-180 (236) 49 KOG1195 consensus 49.1 14 0.00036 15.7 2.4 23 430-452 441-464 (567) 50 KOG2882 consensus 44.6 19 0.00048 14.9 2.8 85 346-431 95-194 (306) 51 TIGR02918 TIGR02918 conserved 43.3 9.2 0.00023 16.9 0.7 105 334-457 186-311 (511) 52 TIGR02224 recomb_XerC tyrosine 43.0 9.2 0.00023 16.9 0.7 22 334-362 175-196 (313) 53 cd00058 FGF Acidic and basic f 40.8 17 0.00044 15.1 1.8 28 447-474 37-64 (123) 54 COG4974 XerD Site-specific rec 39.6 12 0.00032 16.1 0.9 14 349-362 175-188 (300) 55 PRK03619 phosphoribosylformylg 39.5 20 0.00052 14.6 2.0 26 457-482 175-200 (223) 56 TIGR03663 conserved hypothetic 37.6 24 0.00061 14.2 18.1 11 139-149 299-309 (439) 57 pfam01039 Carboxyl_trans Carbo 37.4 24 0.00061 14.2 4.6 19 450-468 445-463 (487) 58 TIGR02743 TraW type-F conjugat 37.2 17 0.00045 15.1 1.4 29 313-345 162-190 (217) 59 KOG1897 consensus 35.5 24 0.0006 14.2 1.8 30 441-470 675-705 (1096) 60 TIGR01478 STEVOR variant surfa 35.3 26 0.00066 14.0 2.2 15 239-253 27-41 (315) 61 pfam03419 Peptidase_U4 Sporula 34.1 27 0.00069 13.8 10.9 10 348-357 226-235 (291) 62 cd01740 GATase1_FGAR_AT Type 1 32.8 27 0.00069 13.8 1.8 28 457-484 191-218 (238) 63 KOG4812 consensus 32.5 29 0.00073 13.7 5.8 17 500-516 222-238 (262) 64 smart00442 FGF Acidic and basi 31.8 30 0.00075 13.6 3.0 24 450-473 44-67 (126) 65 pfam08981 consensus 31.5 30 0.00076 13.6 2.7 15 416-430 153-169 (181) 66 PRK01175 phosphoribosylformylg 31.0 30 0.00077 13.5 5.4 44 459-505 201-244 (255) 67 TIGR03074 PQQ_membr_DH membran 30.9 30 0.00078 13.5 8.2 18 460-477 641-659 (764) 68 PRK00029 hypothetical protein; 30.4 19 0.00048 14.9 0.6 13 438-450 433-445 (487) 69 TIGR02168 SMC_prok_B chromosom 29.8 27 0.00068 13.9 1.3 17 408-424 1144-1161(1191) 70 TIGR00027 mthyl_TIGR00027 meth 28.7 33 0.00085 13.3 1.7 12 416-427 199-210 (281) 71 PRK00784 cobyric acid synthase 28.0 34 0.00087 13.2 2.0 77 241-320 252-333 (492) 72 pfam10321 7TM_GPCR_Srt Serpent 27.9 34 0.00088 13.2 5.7 68 133-203 64-131 (313) 73 pfam01994 Trm56 tRNA ribose 2' 27.5 35 0.00089 13.1 2.5 30 399-428 84-115 (121) 74 cd03232 ABC_PDR_domain2 The pl 27.2 35 0.0009 13.1 3.0 66 279-346 124-189 (192) 75 TIGR02903 spore_lon_C ATP-depe 26.5 31 0.0008 13.4 1.2 77 397-480 485-571 (616) 76 COG0528 PyrH Uridylate kinase 26.2 37 0.00094 13.0 4.1 65 401-471 108-174 (238) 77 COG0138 PurH AICAR transformyl 25.9 37 0.00095 12.9 1.6 73 402-474 399-492 (515) 78 COG0021 TktA Transketolase [Ca 25.2 38 0.00098 12.9 1.6 55 415-470 450-522 (663) 79 TIGR00232 tktlase_bact transke 24.9 39 0.00099 12.8 1.8 49 408-457 444-501 (675) 80 PRK13551 agmatine deiminase; P 23.8 41 0.001 12.7 1.7 22 435-456 316-337 (360) 81 PRK08622 galactose-6-phosphate 22.8 43 0.0011 12.6 3.2 54 398-451 81-148 (171) 82 KOG2792 consensus 22.7 35 0.00088 13.1 0.8 18 336-353 245-262 (280) 83 pfam01716 MSP Manganese-stabil 21.9 40 0.001 12.7 1.0 34 335-370 92-125 (245) 84 cd03009 TryX_like_TryX_NRX Try 21.8 44 0.0011 12.4 1.2 13 462-474 103-115 (131) 85 KOG2698 consensus 21.3 44 0.0011 12.5 1.1 68 282-365 85-157 (247) 86 TIGR00441 gmhA phosphoheptose 20.4 48 0.0012 12.2 5.0 47 402-454 91-138 (186) No 1 >PRK00302 lnt apolipoprotein N-acyltransferase; Reviewed Probab=100.00 E-value=0 Score=852.97 Aligned_cols=495 Identities=27% Similarity=0.475 Sum_probs=429.4 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 77888459758999999999999752611417899999999999741555433210000014579999999999999987 Q gi|254780425|r 6 GKVMLLSGFRRCFIAILAGVIGSFSMPSSDLFLSSFVSFTLLIWLLDGISSNLGRMSAISRVGSSFFVGWLFGVGYFLAG 85 (518) Q Consensus 6 ~~~~~~~~~~~~lla~l~G~l~~ls~pp~~~~~l~~ia~~~l~~~l~~~~~~~~~~~~~~~~~~~f~~g~~~G~~~~~~~ 85 (518) .+....+||||+++++++|+++++||||+++|+++|++++|+++++++.+ .+++|++||+||++||+.+ T Consensus 2 ~~~~~~~~~~~~lla~~sG~l~~lsfpP~~~~~l~~i~l~~l~~~~~~~~-----------~~~~f~~g~~~G~~~f~~~ 70 (509) T PRK00302 2 MASLLARGWKRLLLALLSGALGTLAFAPFDLWPLALLSLAGLLLLLRGAS-----------PKQAALLGFLWGFGYFGSG 70 (509) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCC-----------HHHHHHHHHHHHHHHHHHH T ss_conf 67899886999999999999999735442427999999999999993887-----------6899999999999999974 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCC-CCCCC Q ss_conf 68755566640330248999999999999999999999999865411110344334444465331244552446-30001 Q gi|254780425|r 86 LWWVREGIVDQIGSRFPFWGVVLFFVMISFLLAIFYGVATSLASLLWSTGMGRICVFACAVGFCEWLRSVLGIG-TWNAI 164 (518) Q Consensus 86 l~Wi~~sl~~~~~~~~~~~~~~~~~~~l~~~lal~~~~~~~l~~~~~~~~~~~~l~~~~~w~~~E~lR~~~~~g-PW~~l 164 (518) +|||.++++.+++ .++++.+++.+++++++++++++..++.+..+++....++.+|++|++.||+|+++++| ||+.+ T Consensus 71 l~Wi~~~~~~~g~--~~~~~~~~~~~~l~~~lsl~~~l~~~l~~~~~~~~~~~~l~~~~~w~~~E~lR~~~f~GFPW~~l 148 (509) T PRK00302 71 LYWIYVSILVFGG--MPWWLALFLVLLLAAYLALYPALFAALWRRLWPRGLRRALALPALWVLTEWLRGWVLTGFPWLAL 148 (509) T ss_pred HHHHHHHHHHHCC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 1787434577177--25999999999999999999999999999980243689999999999999999850768826667 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 21110147888633320133210466787778543311000001-44445568999887543025545554566555553 Q gi|254780425|r 165 GYAAMPIPVMMQSVHWIGLFGMNALSVFCFASPALFGTRRDVNI-GIILSSTLLILHIAYGWWILTEDSKSSLKFEKISP 243 (518) Q Consensus 165 g~~~~~~~~~~q~a~~~Gv~Glsflvv~i~~~~~~~~~~~~~~~-~~~~~~~ll~~~~~~g~~~~~~~~~~~~~~~~~~~ 243 (518) |++|+++++++|+++++|++|+||++++++++++.+..+++.+. .......++++...+|.+++...+.. +.+++++ T Consensus 149 g~s~~~~~~l~q~a~~~G~~glsflvv~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~g~~~l~~~~~~--~~~~~~l 226 (509) T PRK00302 149 GYSQIPDGPLAQLAPIFGVYGLSFLLVLVNALLALALLKRRWRLLPLLLAVLLLALPAGYGAYRLRQIQWT--TPAPEPL 226 (509) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCC--CCCCCCE T ss_conf 77851775799999986198999999999999999998623105899999999999999999970356546--6778854 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHC-CCCEEEECCCC Q ss_conf 21001244203555433335789987543101334668618983121011000112346677656410-79824201100 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQPSILKKIASVLK-INQLLIVGSMR 322 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~~~~~~~l~~~~~-~~~~li~G~~~ 322 (518) +|++||||+||++||++++..+...++.+++++. .+++|+|||||+++|++..++++..+.+.+.++ .|.++++|+.+ T Consensus 227 ~V~iVQpNi~q~~k~~~~~~~~~~~~~~~lt~~~-~~~~dlIvWPEta~p~~~~~~~~~~~~i~~~~~~~~~~ll~G~~~ 305 (509) T PRK00302 227 KVALVQGNIPQDLKWDPAGREATLEKYLDLSEPG-LGPADLIIWPETAIPFLEENPPPFLAAIDALLREKGSPLITGAVG 305 (509) T ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCC-CCCCCEEEEECCCCCCCHHHCHHHHHHHHHHHHHCCCEEEEEEEE T ss_conf 9999927988100247345999999999976403-688868996376676641115899999999987569849998511 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCEEECCC Q ss_conf 12345575121000036732122111256334221331001000012111001222210135877620001220010222 Q gi|254780425|r 323 KELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLSEKLRLYPLL 402 (518) Q Consensus 323 ~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~I 402 (518) .++ +++.++|||+++++|||++.++|||+||||||||+||+++++++...+.....|+++|++.+.+...++.+++++| T Consensus 306 ~~~-~~~~~~yNs~~~~~~~g~~~~~YdK~~LVPFGEyiP~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~I 384 (509) T PRK00302 306 ADK-GGQGRYYNSIYVLGPNGEILGRYDKHHLVPFGEYVPLESLLRPLVPFFALPMGDFSRGPGVQPPLLAKGFKLAPLI 384 (509) T ss_pred CCC-CCCCCEEEEEEEECCCCCCCCEECCEEEECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEEEE T ss_conf 004-8876678899998799971016631110266322675988888999845664667789887765357883477788 Q ss_pred CHHHHHHHHHHCC--CCCCEEEEECCCCCCCCC-HHHHHHHHHHHHHHHCCCEEEEEECCCCCEEECCCCEEEEECCCCC Q ss_conf 1666558888512--699779996277557676-2899999999999972991999855970469989980877738998 Q gi|254780425|r 403 FSDALFHQDINNL--ENVSAILNIIDDSGFMGS-GTDHSFRYAQIQAVEIGLPLIRATNNGVSAFLDERGQIISSVYADR 479 (518) Q Consensus 403 CyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s-~~~qH~~~a~~RAiE~~r~~vra~ntGiS~~id~~G~i~~~~~~~~ 479 (518) |||++||+.+|+. +||++|+|+|||+|||+| +|+||++|||+||||||||+|||||||+|++|||+|||++++|.++ T Consensus 385 CyE~~f~~~~r~~~~~ga~~lvn~TND~Wfg~s~~~~QHl~~ar~RAiE~~r~~vraantGiSa~Id~~G~i~~~~~~~~ 464 (509) T PRK00302 385 CYEIIFPELVRRYKAQGADLLLNITNDAWFGDSIGPYQHFQMARMRALELGRPLIRATNTGITAVIDPLGRIIAQLPLFT 464 (509) T ss_pred EEECCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCCEEEECCCCC T ss_conf 86320679999764158858999157654579855899999999999854788799818646899999988887638997 Q ss_pred CEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 27999996067687687865568999999999999972 Q gi|254780425|r 480 GASIDMHFQPKVRDSFRSSIQMRVFWIIEFILLILAVI 517 (518) Q Consensus 480 ~~~l~~~v~~~~~~T~y~~~g~~~~~~~~~~~~~l~~i 517 (518) ++++++++|.++++|+|+|||||+.++|+++++++.++ T Consensus 465 ~~~l~~~vp~~~~~T~y~r~Gd~~~~~~~~~~~~~~~~ 502 (509) T PRK00302 465 EGVLDGTVPPTTGLTPYARWGDWPLLLLLLLLLLLALL 502 (509) T ss_pred CEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 36999993178998799998789999999999999999 No 2 >COG0815 Lnt Apolipoprotein N-acyltransferase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=761.86 Aligned_cols=500 Identities=25% Similarity=0.433 Sum_probs=417.1 Q ss_pred CHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 90467778884597589999999999997526114178999999999997415554332100000145799999999999 Q gi|254780425|r 1 MGNIAGKVMLLSGFRRCFIAILAGVIGSFSMPSSDLFLSSFVSFTLLIWLLDGISSNLGRMSAISRVGSSFFVGWLFGVG 80 (518) Q Consensus 1 ~~~~~~~~~~~~~~~~~lla~l~G~l~~ls~pp~~~~~l~~ia~~~l~~~l~~~~~~~~~~~~~~~~~~~f~~g~~~G~~ 80 (518) |-+++.+....++++|.++++++|++.+||+||+++|+++++++++|.+++.+...... + +++.||+||++ T Consensus 2 ~~~~~~~~~~~~~~~~~~~a~~~g~l~~La~~p~~~~~~~~~~l~~L~~l~~~~~~~~~--------~-~~~~g~~~gf~ 72 (518) T COG0815 2 RAGLARQWVLGRPARRLLLALVFGALFALAFPPPDWWWLAWFSLAPLLWLVRGAPTSWE--------G-LAKSGFLFGFG 72 (518) T ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCC--------H-HHHHHHHHHHH T ss_conf 53355679875341367999999999997289465799999999999999944776520--------1-46789999999 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCC- Q ss_conf 9998768755566640330248999999999999999999999999865411110344334444465331244552446- Q gi|254780425|r 81 YFLAGLWWVREGIVDQIGSRFPFWGVVLFFVMISFLLAIFYGVATSLASLLWSTGMGRICVFACAVGFCEWLRSVLGIG- 159 (518) Q Consensus 81 ~~~~~l~Wi~~sl~~~~~~~~~~~~~~~~~~~l~~~lal~~~~~~~l~~~~~~~~~~~~l~~~~~w~~~E~lR~~~~~g- 159 (518) +++.++|||..+++++.+ .+.+...++.+++++++++++++...+.+..+..... .+++|++.||+|++.++| T Consensus 73 ~~~~~~~Wi~~~~~~~~~--~~~~~~~~~~~ll~~~lal~~~l~~~~~~~~~~~~~~----~~~~w~~~E~lR~~~~tGF 146 (518) T COG0815 73 FFLAGFYWLGTSLGVGLG--LLAVALPLLVLLLAAWLALFLLLVAVLTCRLWFALLV----VPSAWVAAEWLRGWSLTGF 146 (518) T ss_pred HHHHHHHHHHHHHHHCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHCCCCCC T ss_conf 999878898677764011--5789999999999999999999999999998754566----7789999999986778588 Q ss_pred CCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 30001211101478886333201332104667877785433110000014444556899988754302554555456655 Q gi|254780425|r 160 TWNAIGYAAMPIPVMMQSVHWIGLFGMNALSVFCFASPALFGTRRDVNIGIILSSTLLILHIAYGWWILTEDSKSSLKFE 239 (518) Q Consensus 160 PW~~lg~~~~~~~~~~q~a~~~Gv~Glsflvv~i~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~g~~~~~~~~~~~~~~~ 239 (518) ||+.+||+|++..+++|.++++|++|+||++++++++++....++..+.......++.+..+.+|+.... .......+ T Consensus 147 pW~~~Gy~q~~~~~l~q~a~i~Gv~~lsflvv~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~--~~~~~~~~ 224 (518) T COG0815 147 PWLLLGYSQWSPSPLLQLASLGGVWLLSFLVVAVNALLASVLLKRATGGRLLLLGLLLVVLLAYGCALLG--LAGSVPVG 224 (518) T ss_pred CCCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH--CCCCCCCC T ss_conf 7432253443686422403424799999999999999999997431121137999999999999987641--25557899 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC--CCCCCCEEEECCCCCCCCHHHHHHHHHHHHHH-HCCCCEE Q ss_conf 55532100124420355543333578998754310133--46686189831210110001123466776564-1079824 Q gi|254780425|r 240 KISPVIRIVQPGINPAIKEDREKILERYLSLTALPVSA--GELEPVIIVWAYLPFPFSIVDQPSILKKIASV-LKINQLL 316 (518) Q Consensus 240 ~~~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~--~~~~~dLiVWPEsa~p~~~~~~~~~~~~l~~~-~~~~~~l 316 (518) +...+|++||||+||+.|||.++..+....+.++...+ ..+++|+|||||+|+|+...+.+....++.+. .+.|+++ T Consensus 225 ~~~~~V~lvQ~nI~q~lk~~~~~~~~~~~~~~~~~~~~~~~~~~~dlVIwPEtA~p~~~~~~~~~~~~~~~~~~~~~~~~ 304 (518) T COG0815 225 EPTLTVALVQGNIPQDLKWDADALARLIAGYLEEEFLAAVDKQKPDLVVWPETALPFDLTRHPDALARLAEALQRVGAPL 304 (518) T ss_pred CCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCCHHHCCHHHHHHHHHHHCCCCCE T ss_conf 87547999647885002678888988887654333100123789888983675555541113127889999986049968 Q ss_pred EECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 20110012345575121000036732122111256334221331001000012111001222210135877620001220 Q gi|254780425|r 317 IVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLSEKL 396 (518) Q Consensus 317 i~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~ 396 (518) |+|.....+.+++..||||++.++++++..++|||+||||||||+||+++++++...+.....++.+|+..+++.+.++. T Consensus 305 iiG~~~~~~~~~~~~yyNSv~~~~~~~~~~~~ydK~~LVPFGEYiP~~~~l~~~~~~~~~~~~~f~~G~~~~v~~~~~~~ 384 (518) T COG0815 305 LIGTDVDGPAPGGGIYYNSVLVLDPGGEGVYRYDKVHLVPFGEYIPFPELLRPLYFFLNLPMSDFSRGPGPQVLLLAGGP 384 (518) T ss_pred EEEECCCCCCCCCCCEEEEEEEECCCCCCCCEECCEEEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99603445688875312479996689875550043443077057560777667777634552345579988761138884 Q ss_pred EEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCC-HHHHHHHHHHHHHHHCCCEEEEEECCCCCEEECCCCEEEE Q ss_conf 0102221666558888512--699779996277557676-2899999999999972991999855970469989980877 Q gi|254780425|r 397 RLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGS-GTDHSFRYAQIQAVEIGLPLIRATNNGVSAFLDERGQIIS 473 (518) Q Consensus 397 ~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s-~~~qH~~~a~~RAiE~~r~~vra~ntGiS~~id~~G~i~~ 473 (518) ++++.||||++||+.+|.. +++++|+|+|||+||++| +|+||++|+|+||||+|||++||||||+|++|||+||+++ T Consensus 385 ~~~~~ICYE~~F~~~~r~~~~qga~~Lin~SNDAWf~~s~~p~QH~~~a~~RAiE~grp~iRAtNtGiSavIdp~Gri~~ 464 (518) T COG0815 385 KIAPLICYEAIFPELVRASARQGAELLLNLSNDAWFGGSWGPYQHFQQARVRAVELGRPLVRATNTGISAVIDPRGRILA 464 (518) T ss_pred EEEEEEEEEHHCHHHHHHHHCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCEEEEECCCCCEEE T ss_conf 45025664223339888754177628997556534599933889999999998860872899717840589999997664 Q ss_pred ECCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 73899827999996067687687865568999999999999972 Q gi|254780425|r 474 SVYADRGASIDMHFQPKVRDSFRSSIQMRVFWIIEFILLILAVI 517 (518) Q Consensus 474 ~~~~~~~~~l~~~v~~~~~~T~y~~~g~~~~~~~~~~~~~l~~i 517 (518) +++.+++++++.+++.+.+.|+|.|+|||+.+++..++++..+. T Consensus 465 ~l~~~~~~~l~~~v~~~~~~t~y~r~g~~~~~ll~~l~~~~~~~ 508 (518) T COG0815 465 QLPYFTRGVLDATVPLKTGLTPYARWGDWPLWLLVGLALAAALL 508 (518) T ss_pred ECCCCCCCEEEEECCCCCCCCEEEEECCHHHHHHHHHHHHHHHH T ss_conf 25787741023311345787706872670999999999999999 No 3 >TIGR00546 lnt apolipoprotein N-acyltransferase; InterPro: IPR004563 Apolipoprotein N-acyltransferase 2.3.1 from EC transfers the acyl group to lipoproteins and is involved in lipoprotein biosynthesis. It is an integral membrane protein.; GO: 0016410 N-acyltransferase activity, 0042158 lipoprotein biosynthetic process, 0016021 integral to membrane. Probab=100.00 E-value=0 Score=630.51 Aligned_cols=393 Identities=28% Similarity=0.472 Sum_probs=321.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 999999987687555666403---30248999999999999999999999999865411110344334444465331244 Q gi|254780425|r 77 FGVGYFLAGLWWVREGIVDQI---GSRFPFWGVVLFFVMISFLLAIFYGVATSLASLLWSTGMGRICVFACAVGFCEWLR 153 (518) Q Consensus 77 ~G~~~~~~~l~Wi~~sl~~~~---~~~~~~~~~~~~~~~l~~~lal~~~~~~~l~~~~~~~~~~~~l~~~~~w~~~E~lR 153 (518) ||+++|+.+++|+..++.+++ ....+++...+++++++.+++.++++..++.++++++. +.+.+|.+|++.||+| T Consensus 1 f~~~~~~~~~~W~~~~l~~~~~~L~~~~~~~~~~~~v~~l~~~~~~~~~L~~~~~~~~~~~~--~~~~~~~~W~~~E~~r 78 (441) T TIGR00546 1 FGFGFFLAGLFWLGIALSVNGVWLVGGFIAFVAGLLVVGLPALLALFPGLAAYLLRRLAPRR--ALLALALLWTLAEWLR 78 (441) T ss_pred CEEEHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--CEEEHHHHHHHHHHHH T ss_conf 90001248999998645201035554667789999999999999999999987877631787--3375246999999998 Q ss_pred CCC-CC-CCCCCCCCCCCH-HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCC---------CHHHHHHHHHHHHHH Q ss_conf 552-44-630001211101-478886333201332104667877785433110000---------014444556899988 Q gi|254780425|r 154 SVL-GI-GTWNAIGYAAMP-IPVMMQSVHWIGLFGMNALSVFCFASPALFGTRRDV---------NIGIILSSTLLILHI 221 (518) Q Consensus 154 ~~~-~~-gPW~~lg~~~~~-~~~~~q~a~~~Gv~Glsflvv~i~~~~~~~~~~~~~---------~~~~~~~~~ll~~~~ 221 (518) ++. ++ +||+.+||+|++ .++++|+||++|++++||+++++|++++.+..++.. +....+...+++... T Consensus 79 ~~~~~gGfpW~~~G~~~~~~~~~l~Q~a~~~Gv~~~Sfl~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~~ 158 (441) T TIGR00546 79 SFGFLGGFPWGLIGYAQSPSLLPLIQYASIFGVWGLSFLVVFLNALLALVLKKKLSKKEVNQAVFKKLLAIAVLVLLLAL 158 (441) T ss_pred HHHHCCCCCCHHHHHHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 75402457702211323035553045410232899999999999999987544453223023455568999999999961 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC-------CCCCEEEECCCCCCC Q ss_conf 7543025545554566555553210012442035554333357899875431013346-------686189831210110 Q gi|254780425|r 222 AYGWWILTEDSKSSLKFEKISPVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGE-------LEPVIIVWAYLPFPF 294 (518) Q Consensus 222 ~~g~~~~~~~~~~~~~~~~~~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~-------~~~dLiVWPEsa~p~ 294 (518) .+...+..............+++|++||||+||++|++.+...+.+.+++++..+..+ +++|||||||||+|+ T Consensus 159 g~~~~~~~~~~~~~~~~~~~~~~ValvQ~~~p~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dlvvwPEta~~~ 238 (441) T TIGR00546 159 GFLLYALKAYLKLATPAPEPTLKVALVQPNIPQDLKFDKEGLEAILELLTSLTKQAVEKVHAGLYEKPDLVVWPETALPF 238 (441) T ss_pred CCCEEEEEEEEECCCCCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEECCHHHCC T ss_conf 33000358885067887773268999868888001341101799999999998852256676777873278846153323 Q ss_pred CHHHHHHH----------HHHHHHHH-CCCCEEEECCCCCCCCC---CCCCEEEEEECCCCCC----CCCCCCCCEEEEE Q ss_conf 00112346----------67765641-07982420110012345---5751210000367321----2211125633422 Q gi|254780425|r 295 SIVDQPSI----------LKKIASVL-KINQLLIVGSMRKELVD---GRSHFYKSIYIINSKG----EILASSNAKHLVP 356 (518) Q Consensus 295 ~~~~~~~~----------~~~l~~~~-~~~~~li~G~~~~~~~~---~~~~~yNs~~~~~~~g----~~~~~Y~K~~LVP 356 (518) +..++++. .+.++... +.+.++++|..+.+..+ ++.++|||+++++++| ++.++|||+|||| T Consensus 239 ~l~~~~~~~~~~Q~~f~~~~~~~~~~~~~~~p~~~G~~~~~~~~~~~~~~~~yNs~~~~~~~~~a~~~~~~~Y~K~~LVP 318 (441) T TIGR00546 239 DLENSPDKLHESQSAFDLLKALKALVLSKGIPILIGAPDADPKGQKGEEYEYYNSVYLLDPGGKAVLEVVQRYDKVKLVP 318 (441) T ss_pred CCCCCCHHCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHCCCEEECC T ss_conf 66643211033673789999999998740984899630103788888743111258998488620035053538745435 Q ss_pred HHHHHC--CCCHHHHHHHHCCCC--CCCCCCCCCCCCEEECCC-CEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCC Q ss_conf 133100--100001211100122--221013587762000122-00102221666558888512--69977999627755 Q gi|254780425|r 357 FAEYLP--YRNILKKLNLDFYMF--PLDYSSSDISPLLGLSEK-LRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSG 429 (518) Q Consensus 357 FGEyiP--~~~~~~~l~~~~~~~--~~~~~~g~~~~~~~~~~~-~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~W 429 (518) ||||+| ++.+++++...+... ..++.+|...+++...++ ++++++||||++||+.+|+. +||++|||+||||| T Consensus 319 FGEy~P~~~~~~f~~~~~~~~~~~~~~~~~~G~~~~v~~~~~~~~~~~~~ICYE~~Fp~~~r~~~~~Ga~~L~~~tNDaW 398 (441) T TIGR00546 319 FGEYIPFGLKFVFKWLSKLFFLASEQESFARGNGPQVLKLPGGAVKLAPLICYESIFPELVRALARQGAELLVNLTNDAW 398 (441) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCCEEEEEEEEHHHHHHHHHHHHCCCCCEEEEEEECCCC T ss_conf 62326776333444776543200222331127887653447760277753115541168898742588328997400564 Q ss_pred CCCC-HHHHHHHHHHHHHHHCCCEEEEEECCCCCEEECCCCEE Q ss_conf 7676-28999999999999729919998559704699899808 Q gi|254780425|r 430 FMGS-GTDHSFRYAQIQAVEIGLPLIRATNNGVSAFLDERGQI 471 (518) Q Consensus 430 f~~s-~~~qH~~~a~~RAiE~~r~~vra~ntGiS~~id~~G~i 471 (518) |++| +++||++|+|+||||+||++|||||||+|++|||+||+ T Consensus 399 f~~~~~~~QH~~~~~~RA~E~~~~~~Ra~NtG~sa~i~p~G~~ 441 (441) T TIGR00546 399 FGDSSGPWQHFALARFRAIENGRPLVRAANTGISAVIDPRGRI 441 (441) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCEEEEECCCCC T ss_conf 3666167889999888876448548999606625898067889 No 4 >PRK12291 apolipoprotein N-acyltransferase; Reviewed Probab=100.00 E-value=0 Score=600.23 Aligned_cols=396 Identities=18% Similarity=0.231 Sum_probs=292.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCHHHH--HHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 75899999999999975261141789--9999999999741555433210000014579999999999999987687555 Q gi|254780425|r 14 FRRCFIAILAGVIGSFSMPSSDLFLS--SFVSFTLLIWLLDGISSNLGRMSAISRVGSSFFVGWLFGVGYFLAGLWWVRE 91 (518) Q Consensus 14 ~~~~lla~l~G~l~~ls~pp~~~~~l--~~ia~~~l~~~l~~~~~~~~~~~~~~~~~~~f~~g~~~G~~~~~~~l~Wi~~ 91 (518) .|.+++|+++|++..|+|++..+++. .++++..++.++.. ..+++|++||++|++|| |||+. T Consensus 16 ~~~~~~a~l~~~~~~l~~~~~~~~~~~~~~~~~~~l~~~l~~------------~kk~aF~~Gf~fG~~~F----yWI~~ 79 (420) T PRK12291 16 IKAFLIAILFSLFIYLSFFLNIISIFLSSLLALLGLYLLLKS------------PRNSAFWAGFFIGILWF----YWIGL 79 (420) T ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHC------------CHHHHHHHHHHHHHHHH----HHHHH T ss_conf 999999999988998762353138999999999999999516------------44589999999999999----99988 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH Q ss_conf 66640330248999999999999999999999999865411110344334444465331244552446300012111014 Q gi|254780425|r 92 GIVDQIGSRFPFWGVVLFFVMISFLLAIFYGVATSLASLLWSTGMGRICVFACAVGFCEWLRSVLGIGTWNAIGYAAMPI 171 (518) Q Consensus 92 sl~~~~~~~~~~~~~~~~~~~l~~~lal~~~~~~~l~~~~~~~~~~~~l~~~~~w~~~E~lR~~~~~gPW~~lg~~~~~~ 171 (518) |++.++ .. +++++..++++++++++ +.+..+... ....+..+...||+|.+ ||||+.+|++|++ T Consensus 80 Sl~~~g---~~-~lipl~ii~l~l~ygl~----f~l~~~~~~-----~~~~~~~~~~l~~i~~~--GFPWl~~g~~~i~- 143 (420) T PRK12291 80 SFRYYD---LT-YLIPLIIIFIGLVYGLL----FYLLLYLKN-----PILRALLLLGLSFIHPF--GFDWLNPESFFVY- 143 (420) T ss_pred HHHHCC---HH-HHHHHHHHHHHHHHHHH----HHHHHHHHH-----HHHHHHHHHHHHHHHCC--CCCCHHHHHHHHH- T ss_conf 897347---67-99999999999999999----999999973-----89999999999999757--9883417899876- Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78886333201332104667877785433110000014444556899988754302554555456655555321001244 Q gi|254780425|r 172 PVMMQSVHWIGLFGMNALSVFCFASPALFGTRRDVNIGIILSSTLLILHIAYGWWILTEDSKSSLKFEKISPVIRIVQPG 251 (518) Q Consensus 172 ~~~~q~a~~~Gv~Glsflvv~i~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~g~~~~~~~~~~~~~~~~~~~~V~iVQpn 251 (518) +++|++|+|+.++++.++......+++.+ .....+++..+.+. .. .....+.+.+|++|||| T Consensus 144 -------si~Gv~glsf~~i~~~~~~~~~~~~k~~~---~~~i~lll~~~~~~---~~-----~~~~~~~~~~V~lVQ~N 205 (420) T PRK12291 144 -------SYFGPDKLSFALIFLAIIFLSKKPLKKYK---IIAILLLLFALDFQ---YN-----EKSALNLPLNIELVNTN 205 (420) T ss_pred -------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHH---HC-----CCCCCCCCEEEEEECCC T ss_conf -------67778999999999999999998767899---99999999998763---14-----54467887179998189 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCC Q ss_conf 20355543333578998754310133466861898312101100011234667765641079824201100123455751 Q gi|254780425|r 252 INPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQPSILKKIASVLKINQLLIVGSMRKELVDGRSH 331 (518) Q Consensus 252 i~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~~~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~ 331 (518) +||++|||++...+...++.++..++.++++|+|||||+|+|...++++...+.+.+. ..+..+++|+.+.| +++ T Consensus 206 I~q~~Kw~~~~~~~~~~~~~~l~~~~~~~~~dLiIwPEta~P~~~~~~~~~~~~l~~~-~~~~~ii~G~~~~e----~~~ 280 (420) T PRK12291 206 ISQDLKWDKEYLKSIINENLKEIDKAIDEKKTLIVLPESAFPLFLNNSPFLLDELKEL-SYKIAIIAGALRYE----NEH 280 (420) T ss_pred CCHHHHHCHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHCHHHHHHHHHH-CCCCEEEECCEECC----CCC T ss_conf 8967751954599999999998753214799889965744466674089999999852-67982997624424----784 Q ss_pred EEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHH-HHCCCCCCCCCCCCCCCCEEECCCCEEECCCCHHHHHHH Q ss_conf 2100003673212211125633422133100100001211-100122221013587762000122001022216665588 Q gi|254780425|r 332 FYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLN-LDFYMFPLDYSSSDISPLLGLSEKLRLYPLLFSDALFHQ 410 (518) Q Consensus 332 ~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ICyE~~f~~ 410 (518) +|||+++++. ++ .++|||+||||||||+||+++++.+. ..+.....|+++|++++.+.. ++.+++|+||||++||+ T Consensus 281 ~yNS~~~~~~-~~-~~~YdK~~LVPFGEyiPf~~~l~~~~~~~~~~~~~~f~~G~~~~~~~~-~~~~~~~~ICYE~~~~~ 357 (420) T PRK12291 281 IYNSTYLFSK-GN-VQIADKVILVPFGEEIPLPEFLKDPINKLFFGGASDFSKASNFSDFTL-DGVKFRNAICYEATSEE 357 (420) T ss_pred EEEEEEEECC-CC-CCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEE-CCEEEEEEEEEEECCHH T ss_conf 5668899668-87-124744301265357657999999999971677666889999985547-89179989847766734 Q ss_pred HHHCCCCCCEEEEECCCCCCCCC-HHHHHHHHHHHHHHHCCCEEEEEECCCCCEEECCCC Q ss_conf 88512699779996277557676-289999999999997299199985597046998998 Q gi|254780425|r 411 DINNLENVSAILNIIDDSGFMGS-GTDHSFRYAQIQAVEIGLPLIRATNNGVSAFLDERG 469 (518) Q Consensus 411 ~~r~~~~a~~lvn~tNd~Wf~~s-~~~qH~~~a~~RAiE~~r~~vra~ntGiS~~id~~G 469 (518) ..++ ++++++|+|||||||+| ||+||++|||+||+|+|||++||||||+||+|||+- T Consensus 358 ~~~~--~~~~lvniSNDaWFg~S~gP~Qh~~~a~~RAiE~g~~i~rsaN~GiSaiI~Pk~ 415 (420) T PRK12291 358 LYEG--PSKIIIAISNNAWFVPSIEPTLQKLLLKYYARKYGKTIYHAANGSPSEIITPKK 415 (420) T ss_pred HHCC--CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECHHH T ss_conf 4337--898899951043258962699999999999997099889983880349998244 No 5 >cd07571 ALP_N-acyl_transferase Apolipoprotein N-acyl transferase (class 9 nitrilases). ALP N-acyl transferase (Lnt), is an essential membrane-bound enzyme in gram-negative bacteria, which catalyzes the N-acylation of apolipoproteins, the final step in lipoprotein maturation. This is a reverse amidase (i.e. condensation) reaction. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 9. Probab=100.00 E-value=0 Score=542.21 Aligned_cols=265 Identities=25% Similarity=0.406 Sum_probs=235.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHC-CCCEEEECCC Q ss_conf 321001244203555433335789987543101334668618983121011000112346677656410-7982420110 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQPSILKKIASVLK-INQLLIVGSM 321 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~~~~~~~l~~~~~-~~~~li~G~~ 321 (518) ++|++||||+||++||+.+...+.+.++.+++.++..+++|||||||+++|.+..+++...+.+.+.++ .+.++|+|+. T Consensus 1 ~~V~lVQ~Ni~~~~k~~~~~~~~~~~~~~~ls~~~~~~~~dlIVwPEta~p~~~~~~~~~~~~~~~~~~~~~~~li~G~~ 80 (270) T cd07571 1 LRVALVQGNIPQDEKWDPEQRQATLDRYLDLTRELADEKPDLVVWPETALPFDLQRDPDALARLARAARAVGAPLLTGAP 80 (270) T ss_pred CEEEEEECCCCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHCHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 98999907998145128546999999999999877617999999489466765111889999999998755997999554 Q ss_pred CCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCEEECC Q ss_conf 01234557512100003673212211125633422133100100001211100122221013587762000122001022 Q gi|254780425|r 322 RKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLSEKLRLYPL 401 (518) Q Consensus 322 ~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 401 (518) ..+.. +.++|||+++++++|++.++|||+||||||||+||+++++++...+.....++.+|+...++...+..+++++ T Consensus 81 ~~~~~--~~~~yNs~~~~~~~g~~~~~Y~K~hLVPFGEyiP~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~ 158 (270) T cd07571 81 RREPG--GGRYYNSALLLDPGGGILGRYDKHHLVPFGEYVPLRDLLRFLGLLFDLPMGDFSPGTGPQPLLLGGGVRVGPL 158 (270) T ss_pred EEECC--CCCEEEEEEEECCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCCCCEEEEE T ss_conf 56569--8867889999989998420784332206541366177777788752466586314999864652798258889 Q ss_pred CCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCC-HHHHHHHHHHHHHHHCCCEEEEEECCCCCEEECCCCEEEEECCCC Q ss_conf 21666558888512--699779996277557676-289999999999997299199985597046998998087773899 Q gi|254780425|r 402 LFSDALFHQDINNL--ENVSAILNIIDDSGFMGS-GTDHSFRYAQIQAVEIGLPLIRATNNGVSAFLDERGQIISSVYAD 478 (518) Q Consensus 402 ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s-~~~qH~~~a~~RAiE~~r~~vra~ntGiS~~id~~G~i~~~~~~~ 478 (518) ||||++||+.+|+. +||++|+|+|||||||+| +|+||+++||+||||||||++||||||+|++|||+|||+++++.+ T Consensus 159 ICYE~~f~~~~r~~~~~ga~~lv~~tNdaWfg~s~~~~Qhl~~ar~RAiE~~r~~vr~~ntGiSa~Id~~G~i~~~~~~~ 238 (270) T cd07571 159 ICYESIFPELVRDAVRQGADLLVNITNDAWFGDSAGPYQHLAMARLRAIETGRPLVRAANTGISAVIDPDGRIVARLPLF 238 (270) T ss_pred EEEHHHCHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCCEEEECCCC T ss_conf 97534364999998644870999973664567884179999998988998299639866962689999998788874799 Q ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 8279999960676876878655689999999 Q gi|254780425|r 479 RGASIDMHFQPKVRDSFRSSIQMRVFWIIEF 509 (518) Q Consensus 479 ~~~~l~~~v~~~~~~T~y~~~g~~~~~~~~~ 509 (518) +++++++++++.+++|+|+|||||+.++|++ T Consensus 239 ~~~~l~~~v~~~~~~T~y~~~Gd~~~~~~~~ 269 (270) T cd07571 239 EAGVLVAEVPLRTGLTPYVRWGDWPLLLLLL 269 (270) T ss_pred CCEEEEEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 8279999950588987999987899999862 No 6 >cd07565 aliphatic_amidase aliphatic amidases (class 2 nitrilases). Aliphatic amidases catalyze the hydrolysis of short-chain aliphatic amides to form ammonia and the corresponding organic acid. This group includes Pseudomonas aeruginosa (Pa) AmiE, the amidase from Geobacillus pallidus RAPc8 (RAPc8 amidase), and Helicobacter pylori (Hp) AmiE and AmiF. PaAimE and HpAmiE hydrolyze various very short aliphatic amides, including propionamide, acetamide and acrylamide. HpAmiF is a formamidase which specifically hydrolyzes formamide. These proteins belong to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 2. Members of this superfamily generally form homomeric complexes, the basic Probab=99.97 E-value=2.6e-29 Score=209.50 Aligned_cols=226 Identities=15% Similarity=0.105 Sum_probs=159.2 Q ss_pred CCCCCCCCCCC-HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCC-H--HHH--------HHHHHHHHHHH Q ss_conf 32100124420-355543333578998754310133466861898312101100-0--112--------34667765641 Q gi|254780425|r 243 PVIRIVQPGIN-PAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFS-I--VDQ--------PSILKKIASVL 310 (518) Q Consensus 243 ~~V~iVQpni~-~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~-~--~~~--------~~~~~~l~~~~ 310 (518) ++|++||=..+ .+.+.|.++..++..++.+.+. ++.+++||||+||.+...+ . .+. ....+.+.+++ T Consensus 1 v~va~vQ~~~~~~~~~~dv~~n~~~i~~~i~~a~-a~~~GadLIVlPE~~~tG~~~~~~~~~~~a~~~~Gp~~~~l~~~A 79 (291) T cd07565 1 VGVAVVQYKVPVLHTKEEVLENAERIADMVEGTK-RGLPGMDLIVFPEYSTQGLMYDKWTMDETACTVPGPETDIFAEAC 79 (291) T ss_pred CEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHH-HCCCCCEEEECCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 9699996348888998999999999999999997-349998699888865578997767888754229987999999999 Q ss_pred CCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCC-C Q ss_conf 079824201100123455751210000367321221112563342213310010000121110012222101358776-2 Q gi|254780425|r 311 KINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISP-L 389 (518) Q Consensus 311 ~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~-~ 389 (518) +++...++++......+.+.++|||+++++++|++..+|+|.|| |. |.+ .+.+|+... + T Consensus 80 ~~~~v~iv~g~~E~~~~~~~~~yNtavvid~~G~ii~~YRKiHl--~~---~~e---------------~~~pGd~~~~v 139 (291) T cd07565 80 KEAKVWGVFSIMERNPDHGKNPYNTAIIIDDQGEIVLKYRKLHP--WV---PIE---------------PWYPGDLGTPV 139 (291) T ss_pred HHCCCEEEEEEEEECCCCCCCEEEEEEEECCCCCEEEEECCCCC--CC---CCC---------------CCCCCCCCCEE T ss_conf 86692899986876478789768899999899839989864168--88---777---------------84468887547 Q ss_pred EEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECC-------- Q ss_conf 00012200102221666558888512--69977999627755767628999999999999729919998559-------- Q gi|254780425|r 390 LGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATNN-------- 459 (518) Q Consensus 390 ~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~nt-------- 459 (518) .....+.++|.+||||.-||+..|.+ +||++|+++|+ | ......|...+.|.||+||+.+++-+... T Consensus 140 ~~t~~G~kiGl~ICyD~~FPE~~R~lal~GAeli~~Psa--~-~~~~~~~W~~l~rARAieN~~yvv~~Nr~G~e~~~~f 216 (291) T cd07565 140 CEGPKGSKIALIICHDGMYPEIARECAYKGAELIIRIQG--Y-MYPAKDQWIITNKANAWCNLMYTASVNLAGFDGVFSY 216 (291) T ss_pred EECCCCCEEEEEEECCCCCHHHHHHHHHCCCEEEEECCC--C-CCCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCEEE T ss_conf 876898689999863424289999999789949998887--8-9885178999999999970978999856646898477 Q ss_pred -CCCEEECCCCEEEEECCCCCCEEEEEEECCCCC Q ss_conf -704699899808777389982799999606768 Q gi|254780425|r 460 -GVSAFLDERGQIISSVYADRGASIDMHFQPKVR 492 (518) Q Consensus 460 -GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~ 492 (518) |-|.+|||+|+|+++.+..+++++.+++-+..- T Consensus 217 ~G~S~IvdP~G~vla~a~~~~eei~~AeiDl~~~ 250 (291) T cd07565 217 FGESMIVNFDGRTLGEGGREPDEIVTAELSPSLV 250 (291) T ss_pred EEEEEEECCCCCEEEECCCCCCEEEEEEECHHHH T ss_conf 4466999999888777579998799999878999 No 7 >cd07197 nitrilase Nitrilase superfamily, including nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes. This superfamily (also known as the C-N hydrolase superfamily) contains hydrolases that break carbon-nitrogen bonds; it includes nitrilases, cyanide dihydratases, aliphatic amidases, N-terminal amidases, beta-ureidopropionases, biotinidases, pantotheinase, N-carbamyl-D-amino acid amidohydrolases, the glutaminase domain of glutamine-dependent NAD+ synthetase, apolipoprotein N-acyltransferases, and N-carbamoylputrescine amidohydrolases, among others. These enzymes depend on a Glu-Lys-Cys catalytic triad, and work through a thiol acylenzyme intermediate. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. These oligomers include dimers, tetramers, hexamers, octamers, tetradecamers, octadecamers, as well as variable length helical arrangements and homo-oligomeric spirals. These proteins have roles in vitamin and Probab=99.96 E-value=8e-29 Score=206.30 Aligned_cols=213 Identities=21% Similarity=0.241 Sum_probs=158.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHH------------HHHHHHHHHHHHCC Q ss_conf 100124420355543333578998754310133466861898312101100011------------23466776564107 Q gi|254780425|r 245 IRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVD------------QPSILKKIASVLKI 312 (518) Q Consensus 245 V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~------------~~~~~~~l~~~~~~ 312 (518) |+++|.|... .|.+..+++..++.++ +.+.++||||+||.+++-+..+ .....+.+.+.+++ T Consensus 1 IA~~Q~~~~~---~d~~~Nl~~i~~~i~~---A~~~gadlvv~PE~~l~Gy~~~~~~~~~~~~~~~~~~~~~~l~~~a~~ 74 (253) T cd07197 1 IAAVQLAPKI---GDVEANLAKALRLIKE---AAEQGADLIVLPELFLTGYSFESAKEDLDLAEELDGPTLEALAELAKE 74 (253) T ss_pred EEEEECCCCC---CCHHHHHHHHHHHHHH---HHHCCCCEEECCCCCCCCCCHHHHHHCHHHHHHCCCHHHHHHHHHHHH T ss_conf 0999999876---8899999999999999---998859099958873035983565557766543772999999999988 Q ss_pred CCE-EEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEE Q ss_conf 982-4201100123455751210000367321221112563342213310010000121110012222101358776200 Q gi|254780425|r 313 NQL-LIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLG 391 (518) Q Consensus 313 ~~~-li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~ 391 (518) ++. +++|.... +++++||++++++++|+++.+|+|+||.+|+|... |++|+..+++. T Consensus 75 ~~i~i~~G~~e~----~~~~~yNs~~~i~~~G~~i~~y~K~~l~~~~E~~~------------------~~~G~~~~v~~ 132 (253) T cd07197 75 LGIYIVAGIAEK----DGDKLYNTAVVIDPDGEIIGKYRKIHLFDFGERRY------------------FSPGDEFPVFD 132 (253) T ss_pred CCCEEEEEEEEE----ECCEEEEEEEEECCCCCEEEEEEECCCCCCCHHHC------------------CCCCCCCEEEE T ss_conf 097899736894----39988898999879986898887569999414203------------------37999754776 Q ss_pred ECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCC-------- Q ss_conf 012200102221666558888512--6997799962775576762899999999999972991999855970-------- Q gi|254780425|r 392 LSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATNNGV-------- 461 (518) Q Consensus 392 ~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~ntGi-------- 461 (518) .++.++|.+||||..||+..|.+ +|+++++++|+ |.. +...++..+.+.||+||+.+++.+...|. T Consensus 133 -~~~~~ig~~IC~D~~~pe~~r~~~~~Ga~lil~psa--~~~-~~~~~~~~~~~arA~en~~~vv~~n~vG~~~~~~~~G 208 (253) T cd07197 133 -TPGGKIGLLICYDLRFPELARELALKGADIILVPAA--WPT-ARREHWELLLRARAIENGVYVVAANRVGEEGGLEFAG 208 (253) T ss_pred -ECCEEEEEEEECCCCCHHHHHHHHHCCCCEEEEECC--CCC-CCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCC T ss_conf -299689866720535779999999759999999267--899-8721456778888885497799976633689960446 Q ss_pred -CEEECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf -46998998087773899827999996067 Q gi|254780425|r 462 -SAFLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 462 -S~~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) |+++||+|+++.+.+. +++.+.+++.+. T Consensus 209 ~S~Ii~P~G~~l~~~~~-~e~~i~a~idl~ 237 (253) T cd07197 209 GSMIVDPDGEVLAEASE-EEGILVAELDLD 237 (253) T ss_pred EEEEECCCCCEEEECCC-CCEEEEEEECHH T ss_conf 57999599829863599-982999997099 No 8 >cd07581 nitrilase_3 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.96 E-value=2.2e-28 Score=203.40 Aligned_cols=218 Identities=17% Similarity=0.211 Sum_probs=155.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHH-----------HHHHHHHHHHHCCC Q ss_conf 1001244203555433335789987543101334668618983121011000112-----------34667765641079 Q gi|254780425|r 245 IRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQ-----------PSILKKIASVLKIN 313 (518) Q Consensus 245 V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~-----------~~~~~~l~~~~~~~ 313 (518) |+++|=+.. -|.+..+++..++.+. +.+.++||||+||.++..+..+. ....+.+.++++++ T Consensus 1 IA~~Q~~~~----gd~~~Nl~ki~~~i~~---A~~~gadlvvfPE~~l~~~~~~~~~~~~~a~~~~~~~~~~l~~~A~~~ 73 (255) T cd07581 1 VALAQFASS----GDKEENLEKVRRLLAE---AAAAGADLVVFPEYTMARFGDGLDDYARVAEPLDGPFVSALARLAREL 73 (255) T ss_pred CEEEECCCC----CCHHHHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHC T ss_conf 489994799----9999999999999999---998839899989882005994377888764246869999999999877 Q ss_pred CEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEE-HHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCC-CEE Q ss_conf 8242011001234557512100003673212211125633422-13310010000121110012222101358776-200 Q gi|254780425|r 314 QLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVP-FAEYLPYRNILKKLNLDFYMFPLDYSSSDISP-LLG 391 (518) Q Consensus 314 ~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVP-FGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~-~~~ 391 (518) ...|+++.... +++.++||++++++++|+++..|+|+||.+ || +....-|.+|+..+ ... T Consensus 74 ~i~iv~G~~e~--~~~~~~yNta~~i~~~G~i~~~y~K~hL~~~~~----------------~~E~~~f~~G~~~~~~~~ 135 (255) T cd07581 74 GITVVAGMFEP--AGDGRVYNTLVVVGPDGEIIAVYRKIHLYDAFG----------------FRESDTVAPGDELPPVVF 135 (255) T ss_pred CCEEEEEEEEE--CCCCCEEEEEEEEECCCCEEEEEEEEECCCCCC----------------CCEEEEEECCCCCCEEEE T ss_conf 99799800378--899948999999968996588995342577788----------------762345735998751899 Q ss_pred ECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHH-HHHHHHHHHHCCCEEEEEE-----CCCCCE Q ss_conf 012200102221666558888512--69977999627755767628999-9999999997299199985-----597046 Q gi|254780425|r 392 LSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHS-FRYAQIQAVEIGLPLIRAT-----NNGVSA 463 (518) Q Consensus 392 ~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH-~~~a~~RAiE~~r~~vra~-----ntGiS~ 463 (518) ..++.++|.+||||.-||+..|.+ +|+++++++|+ |....+..+| ..+.|.||+||+.+++.+. ..|-|+ T Consensus 136 ~~~g~~iG~~IC~D~~fPe~~r~la~~Ga~li~~psa--~~~~~~~~~~w~~~~~arA~en~~~vv~~n~~G~~~~G~S~ 213 (255) T cd07581 136 VVGGVKVGLATCYDLRFPELARALALAGADVIVVPAA--WVAGPGKEEHWETLLRARALENTVYVAAAGQAGPRGIGRSM 213 (255) T ss_pred EECCEEEEEEEEEECCCCHHHHHHHHCCCCEEEECCC--CCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCEEE T ss_conf 8299589978877067819999999879979998778--88988759999999999999659879999471799741599 Q ss_pred EECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf 998998087773899827999996067 Q gi|254780425|r 464 FLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 464 ~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) ++||+|+++++++. +++.+.+++-+. T Consensus 214 Iv~P~G~ila~~~~-~E~~i~a~iDl~ 239 (255) T cd07581 214 VVDPLGVVLADLGE-REGLLVADIDPE 239 (255) T ss_pred EECCCCCEEEECCC-CCCEEEEEECHH T ss_conf 99899749775599-982899995499 No 9 >cd07583 nitrilase_5 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.96 E-value=2.8e-28 Score=202.78 Aligned_cols=216 Identities=17% Similarity=0.204 Sum_probs=155.8 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHH---------HHHHHHHHHHHHCCCC Q ss_conf 2100124420355543333578998754310133466861898312101100011---------2346677656410798 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVD---------QPSILKKIASVLKINQ 314 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~---------~~~~~~~l~~~~~~~~ 314 (518) +|+++|.|.-. .|.+..+++..++.+. +.+.++||||+||.+++-+..+ +....+.+.+.++++. T Consensus 1 kIA~~Q~~~~~---gd~~~Nl~~i~~~i~~---A~~~ga~lvvfPE~~ltGY~~~~~~~~a~~~~~~~~~~l~~~a~~~~ 74 (253) T cd07583 1 KIALIQLDIVW---GDPEANIERVESLIEE---AAAAGADLIVLPEMWNTGYFLDDLYELADEDGGETVSFLSELAKKHG 74 (253) T ss_pred CEEEEECCCCC---CCHHHHHHHHHHHHHH---HHHCCCEEEEECCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHC T ss_conf 99999898987---9999999999999999---99883989991898635899789986333478299999999988749 Q ss_pred E-EEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEH-HHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEE Q ss_conf 2-420110012345575121000036732122111256334221-33100100001211100122221013587762000 Q gi|254780425|r 315 L-LIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPF-AEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGL 392 (518) Q Consensus 315 ~-li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPF-GEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 392 (518) . +++|..... .++++|||+++++|+|+++.+|+|+||.|+ +| ..-|++|+...++. T Consensus 75 i~iv~G~~~e~---~~~~~yNsa~~i~~~G~i~~~yrK~hl~~~~~E------------------~~~f~~G~~~~v~~- 132 (253) T cd07583 75 VNIVAGSVAEK---EGGKLYNTAYVIDPDGELIATYRKIHLFGLMGE------------------DKYLTAGDELEVFE- 132 (253) T ss_pred CEEEEEEEEEE---ECCEEEEEEEEECCCCEEEEEEECCCCCCCCCC------------------CEEEECCCCCCEEE- T ss_conf 79999006980---099499999999089859999923026997661------------------53786488641376- Q ss_pred CCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECC---------CC Q ss_conf 12200102221666558888512--69977999627755767628999999999999729919998559---------70 Q gi|254780425|r 393 SEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATNN---------GV 461 (518) Q Consensus 393 ~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~nt---------Gi 461 (518) .++.++|.+||||.-||+..|.+ +||++++++++ |-... ..|...+.+.||+||+.+++-+... |- T Consensus 133 ~~~~~iG~~IC~D~~fPe~~r~~a~~Gadii~~psa--~p~~~-~~~w~~~~~arA~en~~~vv~aN~~G~~~~~~~~G~ 209 (253) T cd07583 133 LDGGKVGLFICYDLRFPELFRKLALEGAEILFVPAE--WPAAR-IEHWRTLLRARAIENQAFVVACNRVGTDGGNEFGGH 209 (253) T ss_pred ECCEEEEEEEECCCCCHHHHHHHHHCCCCEEEECCC--CCCCC-HHHHHHHHHHHHHHCCCCEEEECCCCCCCCCEECCE T ss_conf 445278857860333789999999879979999069--99742-266766899999983994999766215899787344 Q ss_pred CEEECCCCEEEEECCCCCCEEEEEEECCCC Q ss_conf 469989980877738998279999960676 Q gi|254780425|r 462 SAFLDERGQIISSVYADRGASIDMHFQPKV 491 (518) Q Consensus 462 S~~id~~G~i~~~~~~~~~~~l~~~v~~~~ 491 (518) |++|||+|+++++.+. +++.+.+++-+.. T Consensus 210 S~Iv~P~G~ila~~~~-~e~~i~adiDl~~ 238 (253) T cd07583 210 SMVIDPWGEVLAEAGE-EEEILTAEIDLEE 238 (253) T ss_pred EEEECCCCCEEEECCC-CCEEEEEEECHHH T ss_conf 8999999869888799-9849999973999 No 10 >cd07572 nit Nit1, Nit 2, and related proteins, and the Nit1-like domain of NitFhit (class 10 nitrilases). This subgroup includes mammalian Nit1 and Nit2, the Nit1-like domain of the invertebrate NitFhit, and various uncharacterized bacterial and archaeal Nit-like proteins. Nit1 and Nit2 are candidate tumor suppressor proteins. In NitFhit, the Nit1-like domain is encoded as a fusion protein with the non-homologous tumor suppressor, fragile histidine triad (Fhit). Mammalian Nit1 and Fhit may affect distinct signal pathways, and both may participate in DNA damage-induced apoptosis. Nit1 is a negative regulator in T cells. Overexpression of Nit2 in HeLa cells leads to a suppression of cell growth through cell cycle arrest in G2. These Nit proteins and the Nit1-like domain of NitFhit belong to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in t Probab=99.96 E-value=7.3e-28 Score=200.05 Aligned_cols=225 Identities=16% Similarity=0.137 Sum_probs=157.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHH-----------HHHHHHHHHHHHHCC Q ss_conf 210012442035554333357899875431013346686189831210110001-----------123466776564107 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIV-----------DQPSILKKIASVLKI 312 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~-----------~~~~~~~~l~~~~~~ 312 (518) +|+++|-+.. -|.+..+++..++.+. +.+.++||||+||.....+.. ......+.+.+.+++ T Consensus 1 rIA~~Q~~~~----~d~~~Nl~k~~~~i~~---Aa~~gadlivfPE~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~A~~ 73 (265) T cd07572 1 RVALIQMTST----ADKEANLARAKELIEE---AAAQGAKLVVLPECFNYPGGTDAFKLALAEEEGDGPTLQALSELAKE 73 (265) T ss_pred CEEEEECCCC----CCHHHHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 9999968898----8999999999999999---99885979998865247788603556652454799899999999998 Q ss_pred CCEE-EECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEE Q ss_conf 9824-201100123455751210000367321221112563342213310010000121110012222101358776200 Q gi|254780425|r 313 NQLL-IVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLG 391 (518) Q Consensus 313 ~~~l-i~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~ 391 (518) ++.. ++|+..... ++++++|||+++++++|+++.+|+|.|| |+--+|=.. .+....-|++|+..+++. T Consensus 74 ~~i~iv~G~~~e~~-~~~g~~yNsa~vi~~~G~i~~~yrK~HL--f~~~~p~~~--------~~~E~~~f~~G~~~~vf~ 142 (265) T cd07572 74 HGIWLVGGSIPERD-DDDGKVYNTSLVFDPDGELVARYRKIHL--FDVDVPGGI--------SYRESDTLTPGDEVVVVD 142 (265) T ss_pred HCCEEEEEEEEEEE-CCCCEEEEEEEEECCCCCEEEEEEEEEE--ECCCCCCCC--------CEEEEEEEECCCEEEEEC T ss_conf 09889952489985-7899489999998799868878613786--034689976--------230133687798524502 Q ss_pred ECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHH-HHHHHHHHHHCCCEEEEEECC--------- Q ss_conf 012200102221666558888512--69977999627755767628999-999999999729919998559--------- Q gi|254780425|r 392 LSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHS-FRYAQIQAVEIGLPLIRATNN--------- 459 (518) Q Consensus 392 ~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH-~~~a~~RAiE~~r~~vra~nt--------- 459 (518) .++.++|.+||||.-||+..|.+ +|+++++++|. |-..+++. | ..+.|.||+||+-+++-+... T Consensus 143 -~~~~riG~~ICyD~~fPE~~r~la~~Ga~ii~~Psa--~~~~~g~~-~w~~~~~aRA~en~~~v~~~N~~G~~~~~~~~ 218 (265) T cd07572 143 -TPFGKIGLGICYDLRFPELARALARQGADILTVPAA--FTMTTGPA-HWELLLRARAIENQCYVVAAAQAGDHEAGRET 218 (265) T ss_pred -CCCCCCCEEEEECCCCCHHHHHHHHCCCCEEEECCC--CCCCCCHH-HHHHHHHHHHHHCCCEEEEECCCCCCCCCCEE T ss_conf -278434126760445849899877649969997453--57775288-99999999998779469841464458998587 Q ss_pred -CCCEEECCCCEEEEECCCCCCEEEEEEECCCC Q ss_conf -70469989980877738998279999960676 Q gi|254780425|r 460 -GVSAFLDERGQIISSVYADRGASIDMHFQPKV 491 (518) Q Consensus 460 -GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~ 491 (518) |-|+++||+|+++++.+. +++.+.+++-+.. T Consensus 219 ~G~S~I~~P~G~ila~~~~-~E~ll~adiDl~~ 250 (265) T cd07572 219 YGHSMIVDPWGEVLAEAGE-GEGVVVAEIDLDR 250 (265) T ss_pred CCCEEEECCCCCEEEECCC-CCEEEEEEECHHH T ss_conf 0335999599988876599-9879999964999 No 11 >cd07573 CPA N-carbamoylputrescine amidohydrolase (CPA) (class 11 nitrilases). CPA (EC 3.5.1.53, also known as N-carbamoylputrescine amidase and carbamoylputrescine hydrolase) converts N-carbamoylputrescine to putrescine, a step in polyamine biosynthesis in plants and bacteria. This subgroup includes Arabidopsis thaliana CPA, also known as nitrilase-like 1 (NLP1), and Pseudomonas aeruginosa AguB. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 11. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer; P. aeruginosa AugB is a homohexamer, Arabidopsis thaliana NLP1 is a homooctomer. Probab=99.96 E-value=1.7e-27 Score=197.61 Aligned_cols=226 Identities=17% Similarity=0.200 Sum_probs=161.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCH-H-------------HHHHHHHHHHH Q ss_conf 321001244203555433335789987543101334668618983121011000-1-------------12346677656 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSI-V-------------DQPSILKKIAS 308 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~-~-------------~~~~~~~~l~~ 308 (518) ++|+++|-+.. +|.++.+++..++.+. +.+.++||||+||....-+. . .+....+.+.+ T Consensus 1 mkiA~~Q~~~~----~d~~~Nl~~~~~~i~~---A~~~ga~lvvfPE~~~tgy~~~~~~~~~~~~a~~~~~~~~~~~l~~ 73 (284) T cd07573 1 VTVALVQMACS----EDPEANLAKAEELVRE---AAAQGAQIVCLQELFETPYFCQEEDEDYFDLAEPPIPGPTTARFQA 73 (284) T ss_pred CEEEEEECCCC----CCHHHHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHH T ss_conf 98999967189----9999999999999999---9988597999788833489812146478987423179989999999 Q ss_pred HHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCC- Q ss_conf 4107982420110012345575121000036732122111256334221331001000012111001222210135877- Q gi|254780425|r 309 VLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDIS- 387 (518) Q Consensus 309 ~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~- 387 (518) .+++++..|+++.... ++++++|||+++++++|++..+|+|+|| |.++ .+....-|++|+.. T Consensus 74 ~A~~~~i~i~~g~~e~--~~~~~~~Nta~~i~~~G~i~~~yrK~hL-p~~~--------------~~~E~~~f~~G~~~~ 136 (284) T cd07573 74 LAKELGVVIPVSLFEK--RGNGLYYNSAVVIDADGSLLGVYRKMHI-PDDP--------------GYYEKFYFTPGDTGF 136 (284) T ss_pred HHHHHCCEEEECCEEE--CCCCCEEEEEEEECCCCCEEEEEEEECC-CCCC--------------CEEEEEECCCCCCCC T ss_conf 9998291899631577--6998179889998689944644872010-6899--------------563664012688652 Q ss_pred CCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCC---CH--HHHHH-HHHHHHHHHCCCEEEEEECC Q ss_conf 6200012200102221666558888512--69977999627755767---62--89999-99999999729919998559 Q gi|254780425|r 388 PLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMG---SG--TDHSF-RYAQIQAVEIGLPLIRATNN 459 (518) Q Consensus 388 ~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~---s~--~~qH~-~~a~~RAiE~~r~~vra~nt 459 (518) +++. .++.++|.+||||.-||+..|.+ +|+++++++|.=+|... .. ...|. ...|.||+||+.+++-+... T Consensus 137 ~v~~-~~~g~iG~~ICyD~~fPE~~r~la~~Ga~iil~PsA~g~~~~~~~~~~~~~~~w~~l~~arA~eN~~yvv~~N~v 215 (284) T cd07573 137 KVFD-TRYGRIGVLICWDQWFPEAARLMALQGAEILFYPTAIGSEPQEPPEGLDQRDAWQRVQRGHAIANGVPVAAVNRV 215 (284) T ss_pred EEEE-ECCCEEEEEEEECCCCCHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCC T ss_conf 0585-088558754750004739999999869909997841467766655553369999999999887628827730233 Q ss_pred -------------CCCEEECCCCEEEEECCCCCCEEEEEEECCCCCC Q ss_conf -------------7046998998087773899827999996067687 Q gi|254780425|r 460 -------------GVSAFLDERGQIISSVYADRGASIDMHFQPKVRD 493 (518) Q Consensus 460 -------------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~~ 493 (518) |-|.++||+|+++++.+.++++.+.+++-+..-+ T Consensus 216 G~~~~~~~~~~f~G~S~Iv~P~G~vla~~~~~~e~~i~adiDl~~l~ 262 (284) T cd07573 216 GVEGDPGSGITFYGSSFIADPFGEILAQASRDEEEILVAEFDLDEIE 262 (284) T ss_pred CCCCCCCCCCEECCCEEEEECCCCEEEECCCCCCEEEEEEEEHHHHH T ss_conf 35788999840748728993899898734899878999997099999 No 12 >PRK10438 hypothetical protein; Provisional Probab=99.95 E-value=6.9e-27 Score=193.70 Aligned_cols=217 Identities=17% Similarity=0.184 Sum_probs=154.5 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCC-CCHH------HHHHHHHHHHHHHCCC Q ss_conf 55321001244203555433335789987543101334668618983121011-0001------1234667765641079 Q gi|254780425|r 241 ISPVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFP-FSIV------DQPSILKKIASVLKIN 313 (518) Q Consensus 241 ~~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p-~~~~------~~~~~~~~l~~~~~~~ 313 (518) ++++|+++|.++-.. |.+..+++..++.+.. .+.||||+||.... +... ..+...+.+.+.++++ T Consensus 2 ~~mkial~Q~~~~~~---D~~~Nl~~~~~~i~~a-----~gadlivlPElf~tGy~~~~~~~~~~~~~~~~~l~~~A~~~ 73 (256) T PRK10438 2 SGLKITLLQQPLVWM---DGPANLRHFDRQLEGI-----TGRDVIVLPEMFTTGFAMEAAASSLPQDDVVNWMTAKAQQT 73 (256) T ss_pred CCCEEEEEECCCCCC---CHHHHHHHHHHHHHHC-----CCCCEEECCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHC T ss_conf 977899995667637---9999999999999745-----49999990875567887215566421758999999999878 Q ss_pred CEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEEC Q ss_conf 82420110012345575121000036732122111256334221331001000012111001222210135877620001 Q gi|254780425|r 314 QLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLS 393 (518) Q Consensus 314 ~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~ 393 (518) ..+|.|+...+. +++.||++++++|+|+ +..|+|+||-.++ .....|.+|+...++. . T Consensus 74 ~~~i~g~~~~~~---~~~~~N~~~~i~~~G~-v~~YrK~HLf~~~-----------------~E~~~f~~G~~~~v~~-~ 131 (256) T PRK10438 74 NALIAGSVALQT---ESGAVNRFLLVEPGGT-VHFYDKRHLFRMA-----------------DEHLHYKAGNARVIVE-W 131 (256) T ss_pred CEEEEEEEEEEC---CCCCEEEEEEEECCCE-EEEECCEECCCCC-----------------CCEEEECCCCCCEEEE-E T ss_conf 909999998972---9972899999928966-9885224058988-----------------8201587999707999-7 Q ss_pred CCCEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHHH-HHHHHHHHHCCCEEEEEECC----------CCC Q ss_conf 2200102221666558888512699779996277557676289999-99999999729919998559----------704 Q gi|254780425|r 394 EKLRLYPLLFSDALFHQDINNLENVSAILNIIDDSGFMGSGTDHSF-RYAQIQAVEIGLPLIRATNN----------GVS 462 (518) Q Consensus 394 ~~~~~~~~ICyE~~f~~~~r~~~~a~~lvn~tNd~Wf~~s~~~qH~-~~a~~RAiE~~r~~vra~nt----------GiS 462 (518) ++.++|.+||||.-||+..|.++++++++.++| |- .. ...|. .+.|.||+||+-+++-+... |.| T Consensus 132 ~g~~iG~~ICYDlrFPe~~r~l~~~~~~~~~a~--~p-~~-~~~hW~~llrARAiENq~fv~~~N~~G~~~~g~~~~G~S 207 (256) T PRK10438 132 RGWRILPLVCYDLRFPVWSRNLNDYDLALYVAN--WP-AP-RSLHWQTLLTARAIENQAYVAGCNRVGSDGNGHHYRGDS 207 (256) T ss_pred CCCEEEEEEEECCCCHHHHHHHCCCCEEEEECC--CC-CC-CCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEECE T ss_conf 891379988623468499996475979999667--87-64-117899999999986582899971444589998884130 Q ss_pred EEECCCCEEEEECCCCCCEEEEEEECCCC Q ss_conf 69989980877738998279999960676 Q gi|254780425|r 463 AFLDERGQIISSVYADRGASIDMHFQPKV 491 (518) Q Consensus 463 ~~id~~G~i~~~~~~~~~~~l~~~v~~~~ 491 (518) .+|||+|+|+++.+.++++++.+++-+.. T Consensus 208 ~ivdP~G~vla~a~~~e~~~i~a~id~~~ 236 (256) T PRK10438 208 RIINPQGEIIATADPHQATRIDAELSLVA 236 (256) T ss_pred EEECCCCCEEEECCCCCCEEEEEEECHHH T ss_conf 99949970536478999789999951999 No 13 >cd07582 nitrilase_4 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.95 E-value=4.4e-27 Score=194.96 Aligned_cols=238 Identities=15% Similarity=0.113 Sum_probs=155.1 Q ss_pred CCCCCCH-HHHHHHHHHHHHHHHHHHHCCCCCC--CCCCEEEECCCCCCCCHHH---------------HHHHHHHHHHH Q ss_conf 1244203-5554333357899875431013346--6861898312101100011---------------23466776564 Q gi|254780425|r 248 VQPGINP-AIKEDREKILERYLSLTALPVSAGE--LEPVIIVWAYLPFPFSIVD---------------QPSILKKIASV 309 (518) Q Consensus 248 VQpni~~-~~k~~~~~~~~~~~~l~~~~~~~~~--~~~dLiVWPEsa~p~~~~~---------------~~~~~~~l~~~ 309 (518) +|.-..| +.+-|.+..+++..++.+.+...+. .+.||||+||.+++-+..+ +....+.+.++ T Consensus 6 ~~~~~~~~~~~~D~~aNl~~~~~~I~~Aa~~ga~~~~~~LvvfPE~~~tGy~~~~~~~~~~~~~~a~~~~Gp~~~~l~~~ 85 (294) T cd07582 6 LQPTCEAAEDRADILANIDRINEQIDAAVGFSGPGLPVRLVVLPEYALQGFPMGEPREVWQFDKAAIDIPGPETEALGEK 85 (294) T ss_pred EEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 98666520371009999999999999999768754699689998985547987762244778863557999899999999 Q ss_pred HCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCC-CCCCCCC-- Q ss_conf 1079824201100123455751210000367321221112563342213310010000121110012222-1013587-- Q gi|254780425|r 310 LKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPL-DYSSSDI-- 386 (518) Q Consensus 310 ~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~-~~~~g~~-- 386 (518) +++++..|+++....+.+..+++||++++++|+|+++.+|+|.|| ++.|-.|-.. +.+.... .+..+.. T Consensus 86 Ak~~~i~iv~G~~Er~~~~~g~~yNta~li~~~G~i~~~yrKihl-~~~~~~~~~~-------~~~~~~~~~~g~~~~~~ 157 (294) T cd07582 86 AKELNVYIAANAYERDPDFPGLYFNTAFIIDPSGEIILRYRKMNS-LAAEGSPSPH-------DVWDEYIEVYGYGLDAL 157 (294) T ss_pred HHHCCEEEEEEEEEECCCCCCCEEEEEEEECCCCCEEEEECCCCC-CCCCCCCCCC-------CCCEEEEEEECCCCCCC T ss_conf 998795999756885488899589999999999879999836677-8545678766-------44327899844789865 Q ss_pred CCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHH-HHHHHHHHHCCCEEEEEECC---- Q ss_conf 76200012200102221666558888512--699779996277557676289999-99999999729919998559---- Q gi|254780425|r 387 SPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSF-RYAQIQAVEIGLPLIRATNN---- 459 (518) Q Consensus 387 ~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~-~~a~~RAiE~~r~~vra~nt---- 459 (518) -++. ..+..++|.+||||.-||+..|.+ +|+++++++++ |.+.++ .+|. ...|.||+||+-++|-+ |. T Consensus 158 f~V~-dt~~GriG~~ICyD~~fPE~~R~la~~GA~vi~~paa--~~~~~~-~~~w~~~~raRAieN~~yVv~~-n~~g~~ 232 (294) T cd07582 158 FPVA-DTEIGNLGCLACEEGLYPEVARGLAMNGAEVLLRSSS--EVPSVE-LDPWEIANRARALENLAYVVSA-NSGGIY 232 (294) T ss_pred CEEE-ECCCCEEEEEEECCCCCHHHHHHHHHCCCCEEEECCC--CCCCCC-HHHHHHHHHHHHHHCCCEEEEE-ECCCCC T ss_conf 5567-5366608899704302489999999889979998975--899864-7799999999999739779998-053457 Q ss_pred ----------CCCEEECCCCEEEEECCCC-CCEEEEEEECCCCCCCHHHH Q ss_conf ----------7046998998087773899-82799999606768768786 Q gi|254780425|r 460 ----------GVSAFLDERGQIISSVYAD-RGASIDMHFQPKVRDSFRSS 498 (518) Q Consensus 460 ----------GiS~~id~~G~i~~~~~~~-~~~~l~~~v~~~~~~T~y~~ 498 (518) |-|++|||+|+++++.+.. +++++.+++-+..-.-.-.+ T Consensus 233 ~~~~~~~~~~G~S~IvdP~G~vla~a~~~~~e~iv~AeiDl~~~~~~R~~ 282 (294) T cd07582 233 GSPYPADSFGGGSMIVDYKGRVLAEAGYGPGSMVAGAEIDIEALRRARAR 282 (294) T ss_pred CCCCCCCEEEEEEEEECCCCCEEEECCCCCCCEEEEEEECHHHHHHHHHC T ss_conf 87787746600369987999888888899877899999728999999862 No 14 >cd07575 Xc-1258_like Xanthomonas campestris XC1258 and related proteins, members of the nitrilase superfamily (putative class 13 nitrilases). Uncharacterized subgroup belonging to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup either represents a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. XC1258 is a homotetramer. Probab=99.95 E-value=5.3e-27 Score=194.43 Aligned_cols=216 Identities=14% Similarity=0.153 Sum_probs=154.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCC-H-------HHHHHHHHHHHHHHCCCC Q ss_conf 32100124420355543333578998754310133466861898312101100-0-------112346677656410798 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFS-I-------VDQPSILKKIASVLKINQ 314 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~-~-------~~~~~~~~~l~~~~~~~~ 314 (518) ++|+++|-+....+ .++.+++..++.+. + ..++||||+||-+..-+ . ..+....+.+.+.+++++ T Consensus 1 mkial~Q~~~~~~d---~e~Nl~~~~~~i~~---a-~~~adlivfPEl~~tGy~~~~~~~ae~~~~~~~~~l~~~A~~~~ 73 (252) T cd07575 1 LKIALIQTDLVWED---PEANLAHFEEKIEQ---L-KEKTDLIVLPEMFTTGFSMNAEALAEPMNGPTLQWMKAQAKKKG 73 (252) T ss_pred CEEEEEECCCCCCC---HHHHHHHHHHHHHH---H-HCCCCEEECCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCC T ss_conf 98999966788799---99999999999999---8-66999999988655689989688463578889999999997569 Q ss_pred EEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEE-HHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEEC Q ss_conf 242011001234557512100003673212211125633422-1331001000012111001222210135877620001 Q gi|254780425|r 315 LLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVP-FAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLS 393 (518) Q Consensus 315 ~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVP-FGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~ 393 (518) ..|+|+....+ ++++|||+++++|+|++ ..|+|+||-+ ++| ..-|++|+...++. . T Consensus 74 i~i~g~~~~~~---~~~~yNta~~i~~~G~i-~~YrK~hL~~~~~E------------------~~~f~~G~~~~v~~-~ 130 (252) T cd07575 74 AAITGSLIIKE---GGKYYNRLYFVTPDGEV-YHYDKRHLFRMAGE------------------HKVYTAGNERVIVE-Y 130 (252) T ss_pred CEEEEEEEEEE---CCEEEEEEEEEECCCEE-EEEEEEECCCCCCC------------------CCCCCCCCCCEEEE-E T ss_conf 88999999987---99589999999199669-99802766998886------------------60003898657998-5 Q ss_pred CCCEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHHH-HHHHHHHHHCCCEEEEEECC----------CCC Q ss_conf 2200102221666558888512699779996277557676289999-99999999729919998559----------704 Q gi|254780425|r 394 EKLRLYPLLFSDALFHQDINNLENVSAILNIIDDSGFMGSGTDHSF-RYAQIQAVEIGLPLIRATNN----------GVS 462 (518) Q Consensus 394 ~~~~~~~~ICyE~~f~~~~r~~~~a~~lvn~tNd~Wf~~s~~~qH~-~~a~~RAiE~~r~~vra~nt----------GiS 462 (518) ++.++|.+||||.-||+..|+..++++++.++| | ..+ ..+|. .+.|.||+||+.+++-+... |-| T Consensus 131 ~g~riGl~ICyDl~FPe~~R~~~~~dil~~~a~--~-P~~-~~~~w~~l~~aRA~eN~~~vv~~N~~G~~~~~~~~~G~S 206 (252) T cd07575 131 KGWKILLQVCYDLRFPVWSRNTNDYDLLLYVAN--W-PAP-RRAAWDTLLKARAIENQAYVIGVNRVGTDGNGLEYSGDS 206 (252) T ss_pred CCEEEEEEEECCCCCHHHHHHHCCCCEEEEEEC--C-CCC-CHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEEEE T ss_conf 896677788646576799875347874798714--8-763-168999999999997497699965666689998884247 Q ss_pred EEECCCCEEEEECCCCCCEEEEEEECCCCCC Q ss_conf 6998998087773899827999996067687 Q gi|254780425|r 463 AFLDERGQIISSVYADRGASIDMHFQPKVRD 493 (518) Q Consensus 463 ~~id~~G~i~~~~~~~~~~~l~~~v~~~~~~ 493 (518) +++||+|+|+.+.+. +++++.+++.+..-. T Consensus 207 ~Iv~P~G~ila~a~~-~e~~~~a~iD~~~i~ 236 (252) T cd07575 207 AVIDPLGEPLAEAEE-DEGVLTATLDKEALQ 236 (252) T ss_pred EEECCCCCEEEECCC-CCEEEEEEECHHHHH T ss_conf 999999778797799-985999996589999 No 15 >cd07568 ML_beta-AS_like mammalian-like beta-alanine synthase (beta-AS) and similar proteins (class 5 nitrilases). This family includes mammalian-like beta-AS (EC 3.5.1.6, also known as beta-ureidopropionase or N-carbamoyl-beta-alanine amidohydrolase). This enzyme catalyzes the third and final step in the catabolic pyrimidine catabolic pathway responsible for the degradation of uracil and thymine, the hydrolysis of N-carbamyl-beta-alanine and N-carbamyl-beta-aminoisobutyrate to the beta-amino acids, beta-alanine and beta-aminoisobutyrate respectively. This family belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 5. Members of this superfamily generally form homomeric Probab=99.95 E-value=8.1e-27 Score=193.24 Aligned_cols=229 Identities=14% Similarity=0.184 Sum_probs=160.5 Q ss_pred CCCCCCCCCC-C-CH--HHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCC-HHH-------------HHHHH Q ss_conf 5321001244-2-03--55543333578998754310133466861898312101100-011-------------23466 Q gi|254780425|r 242 SPVIRIVQPG-I-NP--AIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFS-IVD-------------QPSIL 303 (518) Q Consensus 242 ~~~V~iVQpn-i-~~--~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~-~~~-------------~~~~~ 303 (518) .++|++||-. + |. -...|.+..+++..++.+ ++.+.++||||+||..++-+ ... +.... T Consensus 3 ~~~~~~~q~~~~~~~~aPv~~d~eanl~k~~~li~---eAa~~Ga~LivfPE~~~tgy~~~~~~~~~~~~a~~~~~gp~~ 79 (287) T cd07568 3 IVRVGLIQASNVIPTDAPIEKQKEAMIQKHVTMIR---EAAEAGAQIVCLQEIFYGPYFCAEQDTKWYEFAEEIPNGPTT 79 (287) T ss_pred EEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHH---HHHHCCCEEEECCCCCCCCCCCCCCCHHHHHHHCCCCCCHHH T ss_conf 69999991437889888712339999999999999---999885919987888656877654444788861705898899 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCC Q ss_conf 77656410798242011001234557512100003673212211125633422133100100001211100122221013 Q gi|254780425|r 304 KKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSS 383 (518) Q Consensus 304 ~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~ 383 (518) +.+.+.++++...++++....+ .++++||++++++++|++++.|+|.||-..+. +....-|.+ T Consensus 80 ~~l~~~Ar~~~i~iv~g~~e~~--~~g~lyNt~~~i~~~G~ii~~yrK~hlp~~~~---------------~~E~~~f~~ 142 (287) T cd07568 80 KRFAALAKEYNMVLILPIYEKE--QGGTLYNTAAVIDADGTYLGKYRKNHIPHVGG---------------FWEKFYFRP 142 (287) T ss_pred HHHHHHHHHCCCEEEEEEEEEC--CCCCEEEEEEEECCCCCEEEEECCCCCCCCCC---------------CCEEEEECC T ss_conf 9999999985928996357876--99968999999999997966880134979998---------------470035768 Q ss_pred CCCCCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEC--- Q ss_conf 58776200012200102221666558888512--6997799962775576762899999999999972991999855--- Q gi|254780425|r 384 SDISPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATN--- 458 (518) Q Consensus 384 g~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~n--- 458 (518) |+....+...+..++|.+||||.-||+..|.+ +|+++++++++ |..+.+.++.-...|.||+||+-+++-+.. T Consensus 143 G~~~~~Vf~t~~g~iG~lICyD~~fPE~~r~la~~Ga~ii~~Psa--~~~~~~~~~w~~~~~arA~en~~yv~~~n~~G~ 220 (287) T cd07568 143 GNLGYPVFDTAFGKIGVYICYDRHFPEGWRALGLNGAEIVFNPSA--TVAGLSEYLWKLEQPAAAVANGYFVGAINRVGT 220 (287) T ss_pred CCCCCEEEEECCEEEEEEECCCCCCHHHHHHHHHCCCCEEEECCC--CCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCC T ss_conf 898845999799069886512325729999999879979983777--777755045778988888875936999545566 Q ss_pred ---------CCCCEEECCCCEEEEECCCCCCEEEEEEECCCCC Q ss_conf ---------9704699899808777389982799999606768 Q gi|254780425|r 459 ---------NGVSAFLDERGQIISSVYADRGASIDMHFQPKVR 492 (518) Q Consensus 459 ---------tGiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~ 492 (518) .|-|.+|||+|+++++.+.++++++.+++-+..- T Consensus 221 e~~~~~~~~~G~S~Ii~P~G~via~~~~~~e~ii~adiDl~~~ 263 (287) T cd07568 221 EAPWNIGEFYGSSYFVDPRGQFVASASRDKDELLVAELDLDLI 263 (287) T ss_pred CCCCCCCCCCCCEEEECCCCCEEEECCCCCCEEEEEEECHHHH T ss_conf 6788778563750898899988865379988899999658999 No 16 >cd07564 nitrilases_CHs Nitrilases, cyanide hydratase (CH)s, and similar proteins (class 1 nitrilases). Nitrilases (nitrile aminohydrolases, EC:3.5.5.1) hydrolyze nitriles (RCN) to ammonia and the corresponding carboxylic acid. Most nitrilases prefer aromatic nitriles, some prefer arylacetonitriles and others aliphatic nitriles. This group includes the nitrilase cyanide dihydratase (CDH), which hydrolyzes inorganic cyanide (HCN) to produce formate. It also includes cyanide hydratase (CH), which hydrolyzes HCN to formamide. This group includes four Arabidopsis thaliana nitrilases (Ath)NIT1-4. AthNIT1-3 have a strong substrate preference for phenylpropionitrile (PPN) and other nitriles which may originate from the breakdown of glucosinolates. The product of PPN hydrolysis, phenylacetic acid has auxin activity. AthNIT1-3 can also convert indoacetonitrile to indole-3-acetic acid (IAA, auxin), but with a lower affinity and velocity. From their expression patterns, it has been speculated that Probab=99.95 E-value=1.9e-26 Score=190.77 Aligned_cols=219 Identities=16% Similarity=0.131 Sum_probs=151.4 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCH--------HH---------------H Q ss_conf 321001244203555433335789987543101334668618983121011000--------11---------------2 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSI--------VD---------------Q 299 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~--------~~---------------~ 299 (518) ++|+++|-+.-. .|.+..+++..++.++ +.+.++||||+||..++-+. .. + T Consensus 1 mkVA~vQ~~~~~---~d~~~Nl~~~~~~i~~---Aa~~ga~lvvfPE~~~tGYp~~~~~~~~~~~~~~~~~~~~~a~~~~ 74 (297) T cd07564 1 VKVAAVQAAPVF---LDLAATVEKACRLIEE---AAANGAQLVVFPEAFIPGYPYWIWFGAPAEGRELFARYYENSVEVD 74 (297) T ss_pred CEEEEEECCCCC---CCHHHHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHCCCCC T ss_conf 989999577877---9999999999999999---9988995999589986578724431684003799999997464699 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCC Q ss_conf 34667765641079824201100123455751210000367321221112563342213310010000121110012222 Q gi|254780425|r 300 PSILKKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPL 379 (518) Q Consensus 300 ~~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~ 379 (518) ....+.+.+.++++...|+++.... ++.++|||+++++++|+++.+|+|.|+- +.| .. T Consensus 75 g~~~~~l~~~A~~~~i~iv~G~~E~---~~~~~yNtav~i~~~G~i~~~yrKi~p~-~~E------------------~~ 132 (297) T cd07564 75 GPELERLAEAARENGIYVVLGVSER---DGGTLYNTQLLIDPDGELLGKHRKLKPT-HAE------------------RL 132 (297) T ss_pred CHHHHHHHHHHHHCCCEEEEEEEEC---CCCCEEEEEEEECCCCCEEEEECCCCCC-CCC------------------EE T ss_conf 9899999999998295999841663---6995686799987999589896576778-662------------------24 Q ss_pred CCCCCCC--CCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 1013587--76200012200102221666558888512--6997799962775576762899999999999972991999 Q gi|254780425|r 380 DYSSSDI--SPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIR 455 (518) Q Consensus 380 ~~~~g~~--~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vr 455 (518) -|++|+. .+++. .+..++|.+||||.-||+..|.+ +|+++++..+..+|-...+..+....+|.||+||+-+++- T Consensus 133 ~f~~G~g~~~~v~~-t~~griG~~ICyD~~fP~~~~~l~~~ga~i~va~~P~~~~~~~~~~~w~~l~rarAiEnq~fVva 211 (297) T cd07564 133 VWGQGDGSGLRVVD-TPIGRLGALICWENYMPLARYALYAQGEQIHVAPWPDFSPYYLSREAWLAASRHYALEGRCFVLS 211 (297) T ss_pred EECCCCCCCCEEEC-CCCEEEEEEECCCCHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEE T ss_conf 66357887745752-58716887862541069999999858980999737864467787899999999999855958999 Q ss_pred EECC------------------------CCCEEECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf 8559------------------------7046998998087773899827999996067 Q gi|254780425|r 456 ATNN------------------------GVSAFLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 456 a~nt------------------------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) +... |-|++|||+|+|+++...++++.+.+++-+. T Consensus 212 ~n~~g~~~~~~~~~~~~~~~~~~~~~~~G~S~Ii~P~G~vla~~~~~~e~ii~adiDl~ 270 (297) T cd07564 212 ACQVVTEEDIPADCEDDEEADPLEVLGGGGSAIVGPDGEVLAGPLPDEEGILYADIDLD 270 (297) T ss_pred ECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCEEECCCCCCCEEEEEEECHH T ss_conf 65313633332003565445567764375589998999786564799974999995699 No 17 >cd07576 R-amidase_like Pseudomonas sp. MCI3434 R-amidase and related proteins (putative class 13 nitrilases). Pseudomonas sp. MCI3434 R-amidase hydrolyzes (R,S)-piperazine-2-tert-butylcarboxamide to form (R)-piperazine-2-carboxylic acid. It does so with strict R-stereoselectively. Its preferred substrates are carboxamide compounds which have the amino or imino group connected to their beta- or gamma-carbon. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), class 13 represents proteins that at the time were difficult to place in a distinct similarity group. It has been suggested that this subgroup represents a new class. Members of the nitrilase superfamily generally form homomeric compl Probab=99.95 E-value=1.9e-26 Score=190.77 Aligned_cols=215 Identities=17% Similarity=0.203 Sum_probs=153.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHH----------HHHHHHHHHHHHHCCC Q ss_conf 210012442035554333357899875431013346686189831210110001----------1234667765641079 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIV----------DQPSILKKIASVLKIN 313 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~----------~~~~~~~~l~~~~~~~ 313 (518) +|+++|-|.-.. |.+..+++..++.+. +.+.++||||+||.+++-+.. .+....+.+.++++++ T Consensus 1 kiA~~Q~~~~~g---D~~~N~~~~~~~i~~---A~~~ga~lvvfPE~~l~Gy~~~~~~~~~a~~~~~~~~~~l~~~A~~~ 74 (254) T cd07576 1 RLALYQGPARDG---DVAANLARLDEAAAR---AAAAGADLLVFPELFLTGYNIGDAVARLAEPADGPALQALRAIARRH 74 (254) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHC T ss_conf 999998999889---999999999999999---99886979994888635898661234431147869999999999972 Q ss_pred CE-EEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEE Q ss_conf 82-42011001234557512100003673212211125633422133100100001211100122221013587762000 Q gi|254780425|r 314 QL-LIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGL 392 (518) Q Consensus 314 ~~-li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 392 (518) .. +++|.... +++++|||+++++++|++..+|+|+||-+..| ..-|.+|+..+++. T Consensus 75 ~i~iv~G~~e~----~~~~~yNs~~~i~~~G~i~~~y~K~hL~~~~E------------------~~~f~~G~~~~vf~- 131 (254) T cd07576 75 GIAIVVGYPER----AGGAVYNAAVLIDEDGTVLANYRKTHLFGDSE------------------RAAFTPGDRFPVVE- 131 (254) T ss_pred CCEEEEEEEEE----ECCEEEEEEEEECCCCEEEEEECCCCCCCCCC------------------CEEECCCCCCEEEE- T ss_conf 98699989997----59978899999928983988571444688663------------------05883898634668- Q ss_pred CCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEC---------CCC Q ss_conf 12200102221666558888512--6997799962775576762899999999999972991999855---------970 Q gi|254780425|r 393 SEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATN---------NGV 461 (518) Q Consensus 393 ~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~n---------tGi 461 (518) .++.++|.+||||.-||+..|.+ +|+++++++|+ |-...+ ..|-.+.+.||+||+.+++.+.. .|- T Consensus 132 ~~~~~iG~~IC~D~~fpe~~r~la~~Ga~ii~~psa--~~~~~~-~~~~~~~~arA~en~~~vv~~n~~G~~~~~~~~G~ 208 (254) T cd07576 132 LRGLRVGLLICYDVEFPELVRALALAGADLVLVPTA--LMEPYG-FVARTLVPARAFENQIFVAYANRCGAEDGLTYVGL 208 (254) T ss_pred ECCEEEEEEEECCCCCCHHHHHHHHCCCCEEEECCC--CCCCHH-HHHHHHHHHHHHHCCCEEEEEECCCCCCCEEECCC T ss_conf 898799988861543879999999869979998878--888520-69999999999982994999966415898077287 Q ss_pred CEEECCCCEEEEECCCCCCEEEEEEECCCC Q ss_conf 469989980877738998279999960676 Q gi|254780425|r 462 SAFLDERGQIISSVYADRGASIDMHFQPKV 491 (518) Q Consensus 462 S~~id~~G~i~~~~~~~~~~~l~~~v~~~~ 491 (518) |+++||+|+++++.+. +++++.+++-+.. T Consensus 209 S~ii~P~G~ila~~~~-~e~~~~adiD~~~ 237 (254) T cd07576 209 SSIAGPDGTVLARAGR-GEALLVADLDPAA 237 (254) T ss_pred EEEEECCCEEEEECCC-CCEEEEEEECHHH T ss_conf 6999189829876699-8869999971999 No 18 >cd07569 DCase N-carbamyl-D-amino acid amidohydrolase (DCase, class 6 nitrilases). DCase hydrolyses N-carbamyl-D-amino acids to produce D-amino acids. It is an important biocatalyst in the pharmaceutical industry, producing useful D-amino acids for example in the preparation of beta-lactam antibiotics. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 6. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Agrobacterium radiobacter DCase forms a tetramer (dimer of dimers). Some DCases may form trimers. Probab=99.95 E-value=1.9e-26 Score=190.87 Aligned_cols=238 Identities=15% Similarity=0.113 Sum_probs=160.7 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCC----CH-HHH------------HHHH Q ss_conf 553210012442035554333357899875431013346686189831210110----00-112------------3466 Q gi|254780425|r 241 ISPVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPF----SI-VDQ------------PSIL 303 (518) Q Consensus 241 ~~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~----~~-~~~------------~~~~ 303 (518) .+++|+++|=+--.. ..+++..+++..++.+ ++.+.++||||+||.++.- +. .+. .... T Consensus 2 ~~~rvA~~Q~~p~~~-~~~~e~nl~k~~~~i~---~Aa~~GA~LvvfPE~~~tgy~p~~~~~~~~~~~~~~e~~~~gp~~ 77 (302) T cd07569 2 RQVILAAAQMGPIAR-AETRESVVARLIALLE---EAASRGAQLVVFPELALTTFFPRWYFPDEAELDSFFETEMPNPET 77 (302) T ss_pred CEEEEEEEEEECCCC-CCCHHHHHHHHHHHHH---HHHHCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHH T ss_conf 278999990026778-6459999999999999---999880999987899766887443458999999766431899899 Q ss_pred HHHHHHHCCCCEEEE-CCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCC Q ss_conf 776564107982420-1100123455751210000367321221112563342213310010000121110012222101 Q gi|254780425|r 304 KKIASVLKINQLLIV-GSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYS 382 (518) Q Consensus 304 ~~l~~~~~~~~~li~-G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~ 382 (518) +.+.+.++++...++ |.......++..++||++++++++|+++.+|+|.||--++|+.++..+- .....-|. T Consensus 78 ~~l~~~A~~~~i~v~~G~~e~~~~~~~~~~yNt~~~i~~~G~il~~yrK~HLp~~~~~~~~~~~~-------~~E~~~f~ 150 (302) T cd07569 78 QPLFDRAKELGIGFYLGYAELTEDGGVKRRFNTSILVDKSGKIVGKYRKVHLPGHKEPEPYRPFQ-------HLEKRYFE 150 (302) T ss_pred HHHHHHHHHCCCEEEECCCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCC-------CCEEEEEC T ss_conf 99999999849899965641345567864277899998998499888701178865545787644-------33146765 Q ss_pred CCCCCCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCC-----CCCCCC-HHHHH-HHHHHHHHHHCCCEE Q ss_conf 358776200012200102221666558888512--699779996277-----557676-28999-999999999729919 Q gi|254780425|r 383 SSDISPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDD-----SGFMGS-GTDHS-FRYAQIQAVEIGLPL 453 (518) Q Consensus 383 ~g~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd-----~Wf~~s-~~~qH-~~~a~~RAiE~~r~~ 453 (518) +|+..-.+...++.++|.+||||.-||+..|.+ +||++++...|. .|.... ...+| ....+.||+||+.++ T Consensus 151 ~Gd~~~~V~~t~~griG~~ICyD~~fPE~~r~la~~GA~li~~~~~~p~~~~~~~~~~~~~~~~~~l~~~arA~en~~~v 230 (302) T cd07569 151 PGDLGFPVFRVPGGIMGMCICNDRRWPETWRVMGLQGVELVLLGYNTPTHNPPAPEHDHLRLFHNLLSMQAGAYQNGTWV 230 (302) T ss_pred CCCCCCEEEEECCEEEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEE T ss_conf 89876404763883799986046567499999997799599962677666888664356789999999863255347717 Q ss_pred EEEECC---------CCCEEECCCCEEEEECCCCCCEEEEEEECC Q ss_conf 998559---------704699899808777389982799999606 Q gi|254780425|r 454 IRATNN---------GVSAFLDERGQIISSVYADRGASIDMHFQP 489 (518) Q Consensus 454 vra~nt---------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~ 489 (518) +-+... |-|.+|||+|+++++.+..+++++.+++-+ T Consensus 231 v~~n~~G~~~~~~~~G~S~Ii~P~G~vlaea~~~~e~ll~AdlDl 275 (302) T cd07569 231 VAAAKAGMEDGCDLIGGSCIVAPTGEIVAQATTLEDEVIVADCDL 275 (302) T ss_pred EECCCCCCCCCCEEECCCEEECCCCCEEEECCCCCCEEEEEEECH T ss_conf 850456677897376341899899998760488998699999569 No 19 >cd07586 nitrilase_8 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.95 E-value=5.6e-26 Score=187.73 Aligned_cols=221 Identities=14% Similarity=0.118 Sum_probs=155.7 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHH----------HHHHHHHHHHHCCC Q ss_conf 21001244203555433335789987543101334668618983121011000112----------34667765641079 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQ----------PSILKKIASVLKIN 313 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~----------~~~~~~l~~~~~~~ 313 (518) +|+++|-|.-.. |.+..+++..++.+. +.+.++||||+||.+++-+...+ ....+.+.+.++ + T Consensus 1 kiA~~Q~~~~~g---D~~~Nl~~~~~~i~~---A~~~ga~livfPE~~~tGy~~~d~~~~~a~~~~~~~~~~l~~~a~-~ 73 (269) T cd07586 1 RVAIAQIDPVLG---DVEENLEKHLEIIET---ARERGADLVVFPELSLTGYNLGDLVYEVAMHADDPRLQALAEASG-G 73 (269) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHCC-C T ss_conf 999998999768---999999999999999---998819699988894017985676555430476799999999879-9 Q ss_pred CEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEEC Q ss_conf 82420110012345575121000036732122111256334221331001000012111001222210135877620001 Q gi|254780425|r 314 QLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLS 393 (518) Q Consensus 314 ~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~ 393 (518) ..+++|..... ++.++||+++++ ++|+++.+|+|.||-.+|. +....-|++|+..+++. . T Consensus 74 ~~iv~G~~~~~---~~~~~~Ns~~~~-~~G~i~~~yrK~hLp~~~~---------------~~E~~~f~~G~~~~v~~-~ 133 (269) T cd07586 74 ICVVFGFVEEG---RDGRFYNSAAYL-EDGRVVHVHRKVYLPTYGL---------------FEEGRYFAPGSHLRAFD-T 133 (269) T ss_pred CEEEEEEEEEC---CCCCEEEEEEEE-ECCEEEEEEEEEECCCCCC---------------CCCCCEECCCCCCEEEE-E T ss_conf 83999968987---999199999999-7992999971796599997---------------16221113999506899-7 Q ss_pred CCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCC--C-H-HHHHHHHHHHHHHHCCCEEEEEECC-------- Q ss_conf 2200102221666558888512--69977999627755767--6-2-8999999999999729919998559-------- Q gi|254780425|r 394 EKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMG--S-G-TDHSFRYAQIQAVEIGLPLIRATNN-------- 459 (518) Q Consensus 394 ~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~--s-~-~~qH~~~a~~RAiE~~r~~vra~nt-------- 459 (518) ++.++|.+||||.-||+..|.+ +||+++++++|-.+-.. . . +.++..+.|.||+||+.+++-+... T Consensus 134 ~~~~iG~~ICyD~~fPe~~r~la~~Ga~ii~~paa~p~~~~~~~~~~~~~w~~l~~arA~en~~~vv~~N~~G~~~~~~~ 213 (269) T cd07586 134 RFGRAGVLICEDAWHPSLPYLLALDGADVIFIPANSPARGVGGDFDNEENWETLLKFYAMMNGVYVVFANRVGVEDGVYF 213 (269) T ss_pred CCEEEEEEEECCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCEEE T ss_conf 89089987750104769999999879989999368865366664467999999999999970978999866227899079 Q ss_pred -CCCEEECCCCEEEEECCCCCCEEEEEEECCCC Q ss_conf -70469989980877738998279999960676 Q gi|254780425|r 460 -GVSAFLDERGQIISSVYADRGASIDMHFQPKV 491 (518) Q Consensus 460 -GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~ 491 (518) |-|++|||+|+++++.+.++++++.+++-+.. T Consensus 214 ~G~S~ii~P~G~ila~~~~~~e~vi~adiDl~~ 246 (269) T cd07586 214 WGGSRVVDPDGEVVAEAPLFEEDLLVAELDRSA 246 (269) T ss_pred ECCCEEECCCCCEEEECCCCCCEEEEEEEEHHH T ss_conf 564489979983988758999879999975599 No 20 >cd07584 nitrilase_6 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.95 E-value=3.5e-26 Score=189.04 Aligned_cols=219 Identities=16% Similarity=0.119 Sum_probs=156.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHH-------------HHHHHHHHHHHH Q ss_conf 2100124420355543333578998754310133466861898312101100011-------------234667765641 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVD-------------QPSILKKIASVL 310 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~-------------~~~~~~~l~~~~ 310 (518) +|+++|-|.-.. |.+..+++..++.+ ++.+.++||||+||.+++-+..+ .....+.+.+.+ T Consensus 1 riA~~Q~~~~~g---D~~~N~~~~~~~i~---~A~~~gadlvvfPEl~ltGy~~~~~~~~~~~~a~~~~~~~~~~l~~~a 74 (258) T cd07584 1 KVALIQMDSVLG---DVKANLKKAAELCK---EAAAEGADLICFPELATTGYRPDLLGPKLWELSEPIDGPTVRLFSELA 74 (258) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHH---HHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 999998989878---99999999999999---999887949995877424799578512489876407849999999998 Q ss_pred CCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCE Q ss_conf 07982420110012345575121000036732122111256334221331001000012111001222210135877620 Q gi|254780425|r 311 KINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLL 390 (518) Q Consensus 311 ~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~ 390 (518) +++...++++...+ .+..+++|||+++++++|++..+|+|+||.. .| ..-|.+|+..+++ T Consensus 75 ~~~~i~iv~g~~e~-~~~~~~~yNs~~~i~~~G~i~~~yrK~hl~~-~E------------------~~~f~~G~~~~v~ 134 (258) T cd07584 75 KELGVYIVCGFVEK-GGVPGKVYNSAVVIDPEGESLGVYRKIHLWG-LE------------------KQYFREGEQYPVF 134 (258) T ss_pred HHCCCEEEEECCEE-CCCCCCEEEEEEEECCCCCEEEEEECCCCCC-CC------------------CCCCCCCCCCEEE T ss_conf 87093899940314-4899958888999948983898797576676-52------------------1235789731057 Q ss_pred EECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEC---------C Q ss_conf 0012200102221666558888512--6997799962775576762899999999999972991999855---------9 Q gi|254780425|r 391 GLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATN---------N 459 (518) Q Consensus 391 ~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~n---------t 459 (518) . .++.++|.+||||.-||+..|.+ +|+++++++|+ |- .....++..+.+.||+||+.+++-+.. . T Consensus 135 ~-~~~gkiG~~IC~D~~fPe~~r~la~~Gadii~~psa--~~-~~~~~~~~~~~~arA~en~~~vv~~N~vG~~~~~~~~ 210 (258) T cd07584 135 D-TPFGKIGVMICYDMGFPEVARILTLKGAEVIFCPSA--WR-EQDADIWDINLPARALENTVFVAAVNRVGNEGDLVLF 210 (258) T ss_pred E-ECCCEEEEEEECCCCCCHHHHHHHHCCCCEEEECCC--CC-CCCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCEEEE T ss_conf 6-068467778871678717999999869989998788--88-8643799999999999869948998101158990783 Q ss_pred CCCEEECCCCEEEEECCCCCCEEEEEEECCCCC Q ss_conf 704699899808777389982799999606768 Q gi|254780425|r 460 GVSAFLDERGQIISSVYADRGASIDMHFQPKVR 492 (518) Q Consensus 460 GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~ 492 (518) |-|+++||+|+++.+.+.++++.+.+++-+..- T Consensus 211 G~S~iv~p~G~~la~~~~~~e~l~~adiDl~~~ 243 (258) T cd07584 211 GKSKILNPRGQVLAEASEEAEEILYAEIDLDAI 243 (258) T ss_pred EEEEEECCCCEEEEECCCCCCEEEEEEECHHHH T ss_conf 114899899909675389998899999639999 No 21 >TIGR03381 agmatine_aguB N-carbamoylputrescine amidase. Members of this family are N-carbamoylputrescine amidase (3.5.1.53). Bacterial genes are designated AguB. The AguAB pathway replaces SpeB for conversion of agmatine to putrescine in two steps rather than one. Probab=99.94 E-value=5e-26 Score=188.06 Aligned_cols=225 Identities=16% Similarity=0.158 Sum_probs=157.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCH--H------------HHHHHHHHHHH Q ss_conf 321001244203555433335789987543101334668618983121011000--1------------12346677656 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSI--V------------DQPSILKKIAS 308 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~--~------------~~~~~~~~l~~ 308 (518) ++|+++|-+.. +|.+..+++..++.+ ++.+.++||||.||-....+. . +.....+.+.+ T Consensus 1 mkvA~vQ~~~~----~d~~~Nl~~~~~~i~---~A~~~ga~livfPEl~~tgy~~~~~~~~~~~~a~~~~~~~~~~~l~~ 73 (279) T TIGR03381 1 VTVAALQMACS----DDVETNIARAERLVR---EAAARGAQIILLPELFEGPYFCKDQDEDYFALAQPVEGHPAIKRFQA 73 (279) T ss_pred CEEEEEECCCC----CCHHHHHHHHHHHHH---HHHHCCCEEEECCCCCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHH T ss_conf 98999945489----999999999999999---99988498999388766777544554447874032579899999999 Q ss_pred HHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 41079824201100123455751210000367321221112563342213310010000121110012222101358776 Q gi|254780425|r 309 VLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISP 388 (518) Q Consensus 309 ~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~ 388 (518) ++++++..|.++.... +++++|||+++++++|+++++|+|+|| |.+. .+....-|.+|+... T Consensus 74 ~A~~~~i~i~~g~~e~---~~~~~yNt~~vi~~~G~i~~~yrK~hl-p~~~--------------~~~E~~~f~~G~~~~ 135 (279) T TIGR03381 74 LAKELGVVIPVSFFEK---AGNAYYNSLAMIDADGSVLGVYRKSHI-PDGP--------------GYQEKFYFRPGDTGF 135 (279) T ss_pred HHHHCCEEEEEEEEEE---ECCEEEEEEEEECCCCCEEEEEEEECC-CCCC--------------CEEEEEEECCCCCCC T ss_conf 9987395999944786---088237889999899809999988538-9999--------------634787514888766 Q ss_pred CEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECC-CCCCCCC--HHHHHH-HHHHHHHHHCCCEEEEEECC--- Q ss_conf 200012200102221666558888512--69977999627-7557676--289999-99999999729919998559--- Q gi|254780425|r 389 LLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIID-DSGFMGS--GTDHSF-RYAQIQAVEIGLPLIRATNN--- 459 (518) Q Consensus 389 ~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tN-d~Wf~~s--~~~qH~-~~a~~RAiE~~r~~vra~nt--- 459 (518) .+...++.++|.+||||.-||+..|.+ +||++++++|. .+|.... ....|. ...+.||+||+.+++.+... T Consensus 136 ~v~~~~~~kiG~~ICyD~~fPe~~r~la~~Ga~ii~~Psa~~~~p~~~~~~~~~~w~~~~~arA~en~~~vv~an~~G~~ 215 (279) T TIGR03381 136 KVWDTRYGRIGVGICWDQWFPETARAMALMGAEVLFYPTAIGSEPHDPDLDSRDHWQRVMQGHAAANLVPVVAANRIGTE 215 (279) T ss_pred CEEEECCCEEEEEEEECCCCCHHHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCCCCCC T ss_conf 15985893698766604357489999998699799988414687776556628999999999999829536510310035 Q ss_pred ----------CCCEEECCCCEEEEECCCCCCEEEEEEECCCCC Q ss_conf ----------704699899808777389982799999606768 Q gi|254780425|r 460 ----------GVSAFLDERGQIISSVYADRGASIDMHFQPKVR 492 (518) Q Consensus 460 ----------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~ 492 (518) |-|.+|||+|+++.+.+.++++++.+++-+..- T Consensus 216 ~~~~~~~~f~G~S~Iv~P~G~ila~~~~~~e~~i~a~iDl~~i 258 (279) T TIGR03381 216 IGDGGEQTFYGSSFIADHTGELVAEAGRSEEAVLVATFDLDEI 258 (279) T ss_pred CCCCCCCCCCCCEEEECCCCCEEEECCCCCCEEEEEEEEHHHH T ss_conf 7888876584983899999999876689987899999759999 No 22 >cd07578 nitrilase_1_R1 First nitrilase domain of an uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). Members of this subgroup have two nitrilase domains. This is the first of those two domains. The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.94 E-value=5.5e-26 Score=187.81 Aligned_cols=214 Identities=19% Similarity=0.189 Sum_probs=148.9 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCH-HH-----------HHHHHHHHHHHH Q ss_conf 321001244203555433335789987543101334668618983121011000-11-----------234667765641 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSI-VD-----------QPSILKKIASVL 310 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~-~~-----------~~~~~~~l~~~~ 310 (518) .+|+++|-+.... |.+..+++..++.+. +.+.++||||.||-.++-+. .+ .....+.+.+++ T Consensus 1 fkvA~vQ~~~~~g---d~~~Nl~~~~~~i~~---A~~~ga~lvvfPEl~~tGy~~~~~~~~~~~~e~~~g~~~~~l~~~A 74 (258) T cd07578 1 YKAAAIQFEPEMG---EKERNIERLLALCEE---AARAGARLIVTPEMATTGYCWYDRAEIAPFVEPIPGPTTARFAELA 74 (258) T ss_pred CEEEEEECCCCCC---CHHHHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 9899996888788---999999999999999---9988697999898963479866988989877648988999999999 Q ss_pred CCCCEEEE-CCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 07982420-11001234557512100003673212211125633422133100100001211100122221013587762 Q gi|254780425|r 311 KINQLLIV-GSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPL 389 (518) Q Consensus 311 ~~~~~li~-G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~ 389 (518) +++...++ |....+ +++.++|||+++++++| ++.+|+|.||. +.| ..-+.+|+.... T Consensus 75 ~~~~i~iv~G~~e~~--~~~g~~yNt~~~i~~~G-~i~~yrK~h~~-~~E------------------~~~~~~G~~~~~ 132 (258) T cd07578 75 REHDCYIVVGLPEVD--SRSGIYYNSAVLIGPSG-VIGRHRKTHPY-ISE------------------PKWAADGDLGHQ 132 (258) T ss_pred HHCCCCEEEEEEEEE--CCCCCEEEEEEEECCCC-EEEEEECCCCC-CCC------------------CCCCCCCCCCCE T ss_conf 873992797654688--79995999999983995-89998345776-342------------------311048997623 Q ss_pred EEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEC--------- Q ss_conf 00012200102221666558888512--6997799962775576762899999999999972991999855--------- Q gi|254780425|r 390 LGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATN--------- 458 (518) Q Consensus 390 ~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~n--------- 458 (518) +...+..++|.+||||.-||+..|.+ +|+++|++++| |..+..+..|. +-||+||+-+++-+.. T Consensus 133 vf~t~~gkiG~~ICyD~~fpE~~R~la~~Ga~ii~~psa--~~~~~~~~~~~---~arA~eN~~~vv~~n~~G~~~~~~~ 207 (258) T cd07578 133 VFDTEIGRIALLICMDIHFFETARLLALGGADVICHISN--WLAERTPAPYW---INRAFENGCYLIESNRWGLERGVQF 207 (258) T ss_pred EEEECCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCC--CCCCCCHHHHH---HHHHHHCCCEEEEECCCCCCCCEEE T ss_conf 899589637767874653679999999879989998667--88878727999---9999976987999774326788078 Q ss_pred CCCCEEECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf 97046998998087773899827999996067 Q gi|254780425|r 459 NGVSAFLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 459 tGiS~~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) .|-|++|||+|+|+++.+. +++++.+++-+. T Consensus 208 ~G~S~Iv~P~G~vla~~~~-~e~i~~adiDl~ 238 (258) T cd07578 208 SGGSCIIEPDGTIQASIDS-GDGVALGEIDLD 238 (258) T ss_pred ECCEEEECCCCCEEECCCC-CCEEEEEEEEHH T ss_conf 3323999899878852799-986999996079 No 23 >cd07574 nitrilase_Rim1_like Uncharacterized subgroup of the nitrilase superfamily; some members of this subgroup have an N-terminal RimI domain (class 12 nitrilases). Some members of this subgroup are implicated in post-translational modification, as they contain an N-terminal GCN5-related N-acetyltransferase (GNAT) protein RimI family domain. The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 12. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.94 E-value=1.4e-25 Score=185.19 Aligned_cols=221 Identities=15% Similarity=0.143 Sum_probs=153.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCC-------CCCHHH-----------HHHHHH Q ss_conf 32100124420355543333578998754310133466861898312101-------100011-----------234667 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPF-------PFSIVD-----------QPSILK 304 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~-------p~~~~~-----------~~~~~~ 304 (518) ++|+++|=+++.. .|.++.+++..++.+ ++.+.++||||+||... |..... .+...+ T Consensus 1 ~rvA~vQ~~~~~~--~d~e~nl~~~~~~i~---~Aa~~GA~lvvfPE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (280) T cd07574 1 VRVAAAQYPLRRY--ASFEEFAAKVEYWVA---EAAGYGADLLVFPEYFTMELLSLLPEAIDGLDEAIRALAALTPDYVA 75 (280) T ss_pred CEEEEEECCCCCC--CCHHHHHHHHHHHHH---HHHHCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHH T ss_conf 9899995789899--999999999999999---99988596999588754634014762100289997642424869999 Q ss_pred HHHHHHCCCCEEEE-CCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCC Q ss_conf 76564107982420-11001234557512100003673212211125633422133100100001211100122221013 Q gi|254780425|r 305 KIASVLKINQLLIV-GSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSS 383 (518) Q Consensus 305 ~l~~~~~~~~~li~-G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~ 383 (518) .+.+++++++..|+ |+...+ +++++||++++++++|++ ..|+|.||.|+.|. ...+++ T Consensus 76 ~l~~lA~~~~i~iv~G~~~~~---~~~~~yNta~vi~~~G~i-~~~rK~hl~~~e~~-----------------~~~~~~ 134 (280) T cd07574 76 LFSELARKYGINIIAGSMPVR---EDGRLYNRAYLFGPDGTI-GHQDKLHMTPFERE-----------------EWGISG 134 (280) T ss_pred HHHHHHHHHCCEEEEEEEEEE---ECCEEEEEEEEECCCCCE-EEEEEEECCCCCCE-----------------EEEECC T ss_conf 999999982969999369997---699899999999589819-99986405998631-----------------552269 Q ss_pred CCCCCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEC--- Q ss_conf 58776200012200102221666558888512--6997799962775576762899999999999972991999855--- Q gi|254780425|r 384 SDISPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATN--- 458 (518) Q Consensus 384 g~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~n--- 458 (518) |+..+++.. ++.++|.+||||.-||+..|.+ +|+++++++|. |-...+.++....++.||+||+-+++-|.. T Consensus 135 G~~~~v~~~-~~griG~~ICyD~~fPe~~r~la~~Ga~ii~~Psa--~~~~~~~~~~~~~~~ARAieN~~yvv~~n~vG~ 211 (280) T cd07574 135 GDKLKVFDT-DLGKIGILICYDSEFPELARALAEAGADLLLVPSC--TDTRAGYWRVRIGAQARALENQCYVVQSGTVGN 211 (280) T ss_pred CCCCEEEEE-CCEEEEEEEECCCCCHHHHHHHHHCCCCEEEECCC--CCCCCHHHHHHHHHHHHHHHCEEEEEEECCCCC T ss_conf 995579997-99279877861002569999999879969998998--887555899999999998858079999256247 Q ss_pred ----------CCCCEEECCC------CEEEEECCCCCCEEEEEEECCCCC Q ss_conf ----------9704699899------808777389982799999606768 Q gi|254780425|r 459 ----------NGVSAFLDER------GQIISSVYADRGASIDMHFQPKVR 492 (518) Q Consensus 459 ----------tGiS~~id~~------G~i~~~~~~~~~~~l~~~v~~~~~ 492 (518) .|-|+++||. |.++.+.+.++++++.+++-+..- T Consensus 212 ~~~~~~~~~~~G~S~I~~P~~~~~~~~gvla~~~~~~E~il~AdiDl~~v 261 (280) T cd07574 212 APWSPAVDVNYGQAAVYTPCDFGFPEDGILAEGEPNTEGWLIADLDLEAL 261 (280) T ss_pred CCCCCCCCEECCEEEEECCCCCCCCCCCEEEECCCCCCEEEEEEEEHHHH T ss_conf 88878765441223998886676789998971279998899999746999 No 24 >cd07585 nitrilase_7 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.94 E-value=2.1e-25 Score=183.96 Aligned_cols=218 Identities=13% Similarity=0.166 Sum_probs=154.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHH----------HHHHHHHHHHHHCCC Q ss_conf 2100124420355543333578998754310133466861898312101100011----------234667765641079 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVD----------QPSILKKIASVLKIN 313 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~----------~~~~~~~l~~~~~~~ 313 (518) +|+++|-|.-.. |.+..+++..++.+ ++.+.++||||+||.+++-+... +....+.+.++++++ T Consensus 1 kial~Q~~~~~g---d~~~Nl~~~~~~i~---~A~~~ga~lvvfPE~~ltGy~~~~~~~~~~~~~~~~~~~~l~~~a~~~ 74 (261) T cd07585 1 RIALVQFEARVG---DKARNLAVIARWTR---KAAAQGAELVCFPEMCITGYTHVRALSREAEVPDGPSTQALSDLARRY 74 (261) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHH---HHHHCCCEEEECCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 999998989878---89999999999999---999886929998988302687024566651468989999999999875 Q ss_pred CEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEEC Q ss_conf 82420110012345575121000036732122111256334221331001000012111001222210135877620001 Q gi|254780425|r 314 QLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLS 393 (518) Q Consensus 314 ~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~ 393 (518) +..++++.... +++++||++++++++|.+ .+|+|.||-+ -| ..-|++|+..+++. . T Consensus 75 ~i~iv~g~~e~---~~~~~yNs~~~i~~~g~i-~~yrK~hl~~-~E------------------~~~f~~G~~~~vf~-~ 130 (261) T cd07585 75 GLTILAGLIEK---AGDRPYNTYLVCLPDGLV-HRYRKLHLFR-RE------------------HPYIAAGDEYPVFA-T 130 (261) T ss_pred CCEEEEEEEEE---CCCCEEEEEEEEECCCEE-EEEECCCCCC-CC------------------CCCCCCCCCCEEEE-E T ss_conf 96999967990---699489999999699616-8885377886-54------------------34347998424897-6 Q ss_pred CCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHH-HHHHHHHHHCCCEEEEEECC---------CC Q ss_conf 2200102221666558888512--699779996277557676289999-99999999729919998559---------70 Q gi|254780425|r 394 EKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSF-RYAQIQAVEIGLPLIRATNN---------GV 461 (518) Q Consensus 394 ~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~-~~a~~RAiE~~r~~vra~nt---------Gi 461 (518) ++.++|.+||||.-||+..|.+ +|++++++++...+........|. .+.+.||+||+.+++-+... |- T Consensus 131 ~~~~iG~~IC~D~~fPe~~r~la~~Ga~ii~~psa~~~~~~~~~~~~~~~~~~aRA~en~~~v~~~N~vG~~~~~~~~G~ 210 (261) T cd07585 131 PGVRFGILICYDNHFPENVRATALLGAEILFAPHATPGTTSPKGREWWMRWLPARAYDNGVFVAACNGVGRDGGEVFPGG 210 (261) T ss_pred CCEEEEEEECCCCCCCHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEECE T ss_conf 89279867613555879999999879979998865678777421889999999999972994898456656898157142 Q ss_pred CEEECCCCEEEEECCCCCCEEEEEEECCCC Q ss_conf 469989980877738998279999960676 Q gi|254780425|r 462 SAFLDERGQIISSVYADRGASIDMHFQPKV 491 (518) Q Consensus 462 S~~id~~G~i~~~~~~~~~~~l~~~v~~~~ 491 (518) |.++||+|+++.+.+.++++.+.+++-+.. T Consensus 211 S~Iv~P~G~il~~~~~~~e~~~~adiDl~~ 240 (261) T cd07585 211 AMILDPYGRVLAETTSGGDGMVVADLDLDL 240 (261) T ss_pred EEEECCCCCEEEECCCCCCEEEEEEEEHHH T ss_conf 799989983987338999889999988599 No 25 >cd07580 nitrilase_2 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.94 E-value=5.5e-25 Score=181.28 Aligned_cols=216 Identities=16% Similarity=0.120 Sum_probs=151.6 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHH-------------HHHHHHHHHHH Q ss_conf 21001244203555433335789987543101334668618983121011000112-------------34667765641 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQ-------------PSILKKIASVL 310 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~-------------~~~~~~l~~~~ 310 (518) +|+++|-|... .|.+..+++..++.+. +.+.++||||+||..++-+..++ ....+.+.+.+ T Consensus 1 riA~~Q~~~~~---gd~~~Nl~k~~~~i~~---A~~~ga~lvvfPE~~ltGy~~~~~~~~~~~~~~~~~~~~~~~l~~~a 74 (268) T cd07580 1 RVACVQFDPRV---GDLDANLARSIELIRE---AADAGANLVVLPELANTGYVFESRDEAFALAEEVPDGASTRAWAELA 74 (268) T ss_pred CEEEEECCCCC---CCHHHHHHHHHHHHHH---HHHCCCCEEECCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 98999898987---9999999999999999---99888919998888411698468899999865427988999999999 Q ss_pred CCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCE Q ss_conf 07982420110012345575121000036732122111256334221331001000012111001222210135877620 Q gi|254780425|r 311 KINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLL 390 (518) Q Consensus 311 ~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~ 390 (518) +++...|+++.... +++++|||+++++++| ++.+|+|.||-. .| ..-|++|+....+ T Consensus 75 ~~~~i~iv~g~~e~---~~~~~yNs~~~i~~~g-~i~~yrK~hl~~-~E------------------~~~f~~G~~~~~v 131 (268) T cd07580 75 AELGLYIVAGFAER---DGDRLYNSAVLVGPDG-VIGTYRKAHLWN-EE------------------KLLFEPGDLGLPV 131 (268) T ss_pred HHCCEEEEEEEEEC---CCCEEEEEEEEECCCC-EEEEEECCCCCC-CH------------------HHEEECCCCCCEE T ss_conf 97294999835773---2996999899963995-899995356897-07------------------8548579987038 Q ss_pred EECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCC--H---HHHHHHHHHHHHHHCCCEEEEEECC---- Q ss_conf 0012200102221666558888512--699779996277557676--2---8999999999999729919998559---- Q gi|254780425|r 391 GLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGS--G---TDHSFRYAQIQAVEIGLPLIRATNN---- 459 (518) Q Consensus 391 ~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s--~---~~qH~~~a~~RAiE~~r~~vra~nt---- 459 (518) ...+..++|.+||||.-||+..|.+ +|++++++++| |.... . ..+...+.+.||+||+-+++-+... T Consensus 132 ~~~~~g~iG~~IC~D~~fPe~~r~la~~Ga~ii~~p~~--~~~~~~~~~~~~~~~~~l~~arA~en~~~v~~~n~~G~~~ 209 (268) T cd07580 132 FDTPFGRIGVAICYDGWFPETFRLLALQGADIVCVPTN--WVPMPRPPEGGPPMANILAMAAAHSNGLFIACADRVGTER 209 (268) T ss_pred EEECCCEEEEEEECCCCCHHHHHHHHHCCCCEEEEECC--CCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCC T ss_conf 98078579998840113709999999869979999266--8886775310368999999999987798899967767768 Q ss_pred -----CCCEEECCCCEEEEE-CCCCCCEEEEEEECCC Q ss_conf -----704699899808777-3899827999996067 Q gi|254780425|r 460 -----GVSAFLDERGQIISS-VYADRGASIDMHFQPK 490 (518) Q Consensus 460 -----GiS~~id~~G~i~~~-~~~~~~~~l~~~v~~~ 490 (518) |-|++|||+|+++++ .+.++++++.+++-+. T Consensus 210 ~~~~~G~S~IvdP~G~vla~~~~~~~e~ll~adiDl~ 246 (268) T cd07580 210 GQPFIGQSLIVGPDGWPLAGPASGDEEEILLADIDLT 246 (268) T ss_pred CCEECCCCEEECCCCCEEEECCCCCCCEEEEEEECHH T ss_conf 9656488879999998884106899878999997169 No 26 >cd07570 GAT_Gln-NAD-synth Glutamine aminotransferase (GAT, glutaminase) domain of glutamine-dependent NAD synthetases (class 7 and 8 nitrilases). Glutamine-dependent NAD synthetases are bifunctional enzymes, which have an N-terminal GAT domain and a C-terminal NAD+ synthetase domain. The GAT domain is a glutaminase (EC 3.5.1.2) which hydrolyses L-glutamine to L-glutamate and ammonia. The ammonia is used by the NAD+ synthetase domain in the ATP-dependent amidation of nicotinic acid adenine dinucleotide. Glutamine aminotransferases are categorized depending on their active site residues into different unrelated classes. This class of GAT domain belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this sub Probab=99.93 E-value=6.4e-25 Score=180.86 Aligned_cols=217 Identities=17% Similarity=0.175 Sum_probs=150.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHH-----------HHHHHHHHHHHHHCC Q ss_conf 210012442035554333357899875431013346686189831210110001-----------123466776564107 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIV-----------DQPSILKKIASVLKI 312 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~-----------~~~~~~~~l~~~~~~ 312 (518) +|+++|.|.... |.+..+++..++.++ +.+.++||||+||.+++-+.- ......+.+.+.+++ T Consensus 1 kIa~~Q~~~~~g---d~~~Nl~~~~~~i~~---A~~~ga~livfPEl~ltGY~~~d~~~~~~~~~~~~~~l~~l~~~~~~ 74 (261) T cd07570 1 RIALAQLNPTVG---DLEGNAEKILEAIRE---AKAQGADLVVFPELSLTGYPPEDLLLRPDFLEAAEEALEELAAATAD 74 (261) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHHH---HHHCCCCEEECCCCCCCCCCHHHHHHCHHHHHHHHHHHHHHHHHHHH T ss_conf 999998999878---999999999999999---99889979998889544799788763656789999999999998865 Q ss_pred C-CEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEE Q ss_conf 9-824201100123455751210000367321221112563342213310010000121110012222101358776200 Q gi|254780425|r 313 N-QLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLG 391 (518) Q Consensus 313 ~-~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~ 391 (518) + ..+++|.+..+ ++++|||+++++ +|+++.+|+|.||-.+|+| ....-|++|+...++. T Consensus 75 ~~i~iv~G~~~~~----~~~~yNsa~~i~-~G~i~~~y~K~hLp~~~~f---------------~E~~~f~~G~~~~~~~ 134 (261) T cd07570 75 LDIAVVVGLPLRH----DGKLYNAAAVLQ-NGKILGVVPKQLLPNYGVF---------------DEKRYFTPGDKPDVLF 134 (261) T ss_pred CCEEEEEEEEEEE----CCEEEEEEEEEE-CCEEEEEEEEEECCCCCCC---------------CCHHHCCCCCCCCEEE T ss_conf 7969999678988----997888889964-9928889988916999863---------------6023123688773378 Q ss_pred ECCCCEEECCCCHHHHHHHH-HHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECC--------- Q ss_conf 01220010222166655888-8512--69977999627755767628999999999999729919998559--------- Q gi|254780425|r 392 LSEKLRLYPLLFSDALFHQD-INNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATNN--------- 459 (518) Q Consensus 392 ~~~~~~~~~~ICyE~~f~~~-~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~nt--------- 459 (518) . ++.++|.+||||.-||+. .|.+ +||+++++++.--|. ...+.++..+.+.||+||+.+++-+... T Consensus 135 ~-~~~kiG~~ICyD~~fpe~~~r~la~~Gadii~~psA~p~~-~~k~~~~~~l~~arA~en~~~vv~~N~vG~~~~~~f~ 212 (261) T cd07570 135 F-KGLRIGVEICEDLWVPDPPSAELALAGADLILNLSASPFH-LGKQDYRRELVSSRSARTGLPYVYVNQVGGQDDLVFD 212 (261) T ss_pred E-CCEEEEEEEECCCCCCCHHHHHHHHCCCCEEEECCCCCCC-CCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCEEEC T ss_conf 6-8968887655146788779999997599799964788655-6828999999999999829859999355789966983 Q ss_pred CCCEEECCCCEEEEECCCCCCEEEEEEEC Q ss_conf 70469989980877738998279999960 Q gi|254780425|r 460 GVSAFLDERGQIISSVYADRGASIDMHFQ 488 (518) Q Consensus 460 GiS~~id~~G~i~~~~~~~~~~~l~~~v~ 488 (518) |-|+++||+|+++++.+.+++.+.+.++. T Consensus 213 G~S~I~~P~G~vla~~~~~ee~~a~iDld 241 (261) T cd07570 213 GGSFIADNDGELLAEAPRFEEDLADVDLD 241 (261) T ss_pred CEEEEECCCCCEEEECCCCCEEEEEEEHH T ss_conf 60899999973899859994899998729 No 27 >cd07577 Ph0642_like Pyrococcus horikoshii Ph0642 and related proteins, members of the nitrilase superfamily (putative class 13 nitrilases). Uncharacterized subgroup of the nitrilase superfamily. This superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. Pyrococcus horikoshii Ph0642 is a hypothetical protein belonging to this subgroup. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). This subgroup was classified as belonging to class 13, which represents proteins that at the time were difficult to place in a distinct similarity group. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.93 E-value=7.2e-25 Score=180.53 Aligned_cols=209 Identities=18% Similarity=0.193 Sum_probs=148.1 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCH-H---H---------HHHHHHHHHHHH Q ss_conf 21001244203555433335789987543101334668618983121011000-1---1---------234667765641 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSI-V---D---------QPSILKKIASVL 310 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~-~---~---------~~~~~~~l~~~~ 310 (518) +|+++|-|.-. .|.+..+++..++.+. .++||||.||.+++-+. . + +....+.+.+.+ T Consensus 1 kIA~~Q~~~~~---gd~~~Nl~k~~~~i~~------a~adlvvfPEl~ltGY~~~~~~~~~~~~~~~~~~~~~~~l~~~a 71 (259) T cd07577 1 KVGYVQFNPKF---GEVEKNLKKVESLIKG------VEADLIVLPELFNTGYAFTSKEEVASLAESIPDGPTTRFLQELA 71 (259) T ss_pred CEEEEECCCCC---CCHHHHHHHHHHHHHH------CCCCEEECCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 99999898988---9999999999999997------59989997886102688899899998742536988999999999 Q ss_pred CCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCE Q ss_conf 07982420110012345575121000036732122111256334221331001000012111001222210135877620 Q gi|254780425|r 311 KINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLL 390 (518) Q Consensus 311 ~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~ 390 (518) +++...++++.... +++++|||+++++++|. +.+|+|.||. |.| ..-|.+|+....+ T Consensus 72 ~~~~i~iv~G~~e~---~~~~~yNs~~~i~~~g~-i~~y~K~hl~-~~E------------------~~~f~~G~~~~~v 128 (259) T cd07577 72 RETGAYIVAGLPER---DGDKFYNSAVVVGPEGY-IGIYRKTHLF-YEE------------------KLFFEPGDTGFRV 128 (259) T ss_pred HHCCCEEEEEEEEE---ECCEEEEEEEEECCCCE-EEEEECCCCC-CCC------------------CEEEECCCCCCEE T ss_conf 97295899979997---79989999999819933-8999654799-752------------------1288279975049 Q ss_pred EECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEC---------- Q ss_conf 0012200102221666558888512--6997799962775576762899999999999972991999855---------- Q gi|254780425|r 391 GLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATN---------- 458 (518) Q Consensus 391 ~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~n---------- 458 (518) ...++.++|.+||||.-||+..|.+ +|++++++++| |.. + +.-...+.||+||+.+++-|.. T Consensus 129 ~~~~~~kiG~~IC~D~~fPe~~r~la~~Ga~ii~~ps~--~~~---~-~~~~~~~arA~en~~~vv~~n~vG~~~~~~~~ 202 (259) T cd07577 129 FDIGDIRIGVMICFDWYFPEAARTLALKGADIIAHPAN--LVL---P-YCPKAMPIRALENRVFTITANRIGTEERGGET 202 (259) T ss_pred EEECCEEEEEEEEHHCCCHHHHHHHHHCCCCEEEEEEC--CCC---H-HHHHHHHHHHHHCCCEEEEEEEEECCCCCCCC T ss_conf 97799898645782117719999999879989999004--687---2-15999999999749869999755515789887 Q ss_pred ---CCCCEEECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf ---97046998998087773899827999996067 Q gi|254780425|r 459 ---NGVSAFLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 459 ---tGiS~~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) .|-|.++||+|+++++.+.++++++.+++-+. T Consensus 203 ~~f~G~S~i~~P~G~ila~~~~~~E~~l~adiDl~ 237 (259) T cd07577 203 LRFIGKSQITSPKGEVLARAPEDGEEVLVAEIDPR 237 (259) T ss_pred CEECCCEEEECCCCCEEEECCCCCCEEEEEEEEHH T ss_conf 34758949998999898877999888999997069 No 28 >cd07579 nitrilase_1_R2 Second nitrilase domain of an uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). Members of this subgroup have two nitrilase domains. This is the second of those two domains. The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=99.92 E-value=6.6e-24 Score=174.23 Aligned_cols=215 Identities=17% Similarity=0.152 Sum_probs=148.9 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHH-------HHHHHHHHHHHHHCCCCEE Q ss_conf 210012442035554333357899875431013346686189831210110001-------1234667765641079824 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIV-------DQPSILKKIASVLKINQLL 316 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~-------~~~~~~~~l~~~~~~~~~l 316 (518) +|+++|-+... |.+..+++..++.+ ++.+.++||||+||.++..+.. .+....+.+.++++++... T Consensus 1 rvA~~Q~~~~~----d~~~Nl~~~~~~i~---~Aa~~ga~lvvlPE~~~tg~~~~~~~a~~~~g~~~~~l~~lA~~~~i~ 73 (279) T cd07579 1 RIAVAQFAPTP----DIAGNLATIDRLAA---EAKATGAELVVFPELALTGLDDPASEAESDTGPAVSALRRLARRLRLY 73 (279) T ss_pred CEEEEECCCCC----CHHHHHHHHHHHHH---HHHHCCCEEEECCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCCEE T ss_conf 99999465899----99999999999999---999879959997999411599547743559988999999999975919 Q ss_pred EECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 20110012345575121000036732122111256334221331001000012111001222210135877620001220 Q gi|254780425|r 317 IVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLSEKL 396 (518) Q Consensus 317 i~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~ 396 (518) |+++.... +++++|||+++++++| .+.+|+|.|| |+. ...-+++|+..+++.. +.. T Consensus 74 iv~G~~e~---~~~~~yNs~~~i~~~G-~i~~YrK~HL--~~~-----------------E~~~~~~G~~~~v~~t-~~g 129 (279) T cd07579 74 LVAGFAEA---DGDGLYNSAVLVGPEG-LVGTYRKTHL--IEP-----------------ERSWATPGDTWPVYDL-PLG 129 (279) T ss_pred EEEEEEEC---CCCEEEEEEEEECCCC-CEEEEEECCC--CCC-----------------CCEEECCCCCCEEEEE-CCE T ss_conf 99989994---5993788899985997-6789970258--975-----------------4411338980559998-998 Q ss_pred EEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCC--------------CC-CCHH-HHHHHHHHHHHHHCCCEEEEEE- Q ss_conf 0102221666558888512--69977999627755--------------76-7628-9999999999997299199985- Q gi|254780425|r 397 RLYPLLFSDALFHQDINNL--ENVSAILNIIDDSG--------------FM-GSGT-DHSFRYAQIQAVEIGLPLIRAT- 457 (518) Q Consensus 397 ~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~W--------------f~-~s~~-~qH~~~a~~RAiE~~r~~vra~- 457 (518) ++|.+||||.-||++.|.+ +||++|+.++.-+. |. .++. ..|..+.|.||+||+.+++-+. T Consensus 130 riGl~ICyD~rFPE~~R~la~~GA~il~~PaA~~~~~~~~~~~~~~~~~~~~~~g~~~~hw~llraRAieN~~~v~~an~ 209 (279) T cd07579 130 RVGLLIGHDALFPEAGRVLALRGCDLLACPAAIAIPFVGAHAGTSVPQPYPIPTGADPTHWHLARVRAGENNVYFAFANV 209 (279) T ss_pred EEEEEEECCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEECC T ss_conf 89988880201289999999879939997663466755333453233455544465347999999888646759999435 Q ss_pred ------CCCCCEEECCCCEEEE---ECCCCCCEEEEEEECC Q ss_conf ------5970469989980877---7389982799999606 Q gi|254780425|r 458 ------NNGVSAFLDERGQIIS---SVYADRGASIDMHFQP 489 (518) Q Consensus 458 ------ntGiS~~id~~G~i~~---~~~~~~~~~l~~~v~~ 489 (518) .+|.|++++|.|.+.. ...-.+++++.+++-+ T Consensus 210 ~~~~~~~~G~s~i~gp~~~~~p~~~~~~~~~egi~~A~iDl 250 (279) T cd07579 210 PDPARGYTGWSGVFGPDTFAFPRQEAAIGDEEGIAWALIDT 250 (279) T ss_pred CCCCCCCCCEEEEEECCCCCCCCEEEECCCCCCEEEEEECH T ss_conf 46767866346999657554353135038877279999758 No 29 >COG0388 Predicted amidohydrolase [General function prediction only] Probab=99.91 E-value=2.8e-23 Score=170.16 Aligned_cols=221 Identities=19% Similarity=0.169 Sum_probs=157.9 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHH------------HHHHHHHHHHH Q ss_conf 532100124420355543333578998754310133466861898312101100011------------23466776564 Q gi|254780425|r 242 SPVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVD------------QPSILKKIASV 309 (518) Q Consensus 242 ~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~------------~~~~~~~l~~~ 309 (518) .++|+++|.+....+ .+...++..++.+. +.+.++||||.||-...-+..+ .....+.+.+. T Consensus 2 ~~riA~~Q~~~~~~d---~~~N~~~~~~~i~~---A~~~ga~LvvfPEl~~tgy~~~~~~~~~~~~~~~~~~~~~~l~~~ 75 (274) T COG0388 2 MMRVAAAQMAPKAGD---PAENLARILRLIRE---AAARGADLVVFPELFLTGYPCEDDLFLEEAAAEAGEETLEFLAAL 75 (274) T ss_pred CEEEEEEECCCCCCC---HHHHHHHHHHHHHH---HHHCCCCEEECCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 479999803666689---79999999999999---987699899937843557984125676654303562899999998 Q ss_pred HCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 10798242011001234557512100003673212211125633422133100100001211100122221013587762 Q gi|254780425|r 310 LKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPL 389 (518) Q Consensus 310 ~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~ 389 (518) ++++..+++|+...+.. +.||++.+++++|++.++|+|.||-+. ++.....+.+|+.... T Consensus 76 a~~~~~~ivg~~~~~~~----~~~~~~~~i~~~G~ii~~y~K~hl~~~----------------~~~e~~~~~~G~~~~~ 135 (274) T COG0388 76 AEEGGVIIVGGPLPERE----KLYNNAALIDPDGEILGKYRKLHLFDA----------------FYEERRFFTPGDEGVV 135 (274) T ss_pred HHHCCEEEEEEEEEECC----CCEEEEEEECCCCEEEEEEEEECCCCC----------------CCCHHHCCCCCCCCCE T ss_conf 87698899985066134----416789998689869999865827987----------------5224313147876650 Q ss_pred EEECCCCEEECCCCHHHHHHHHHHCC---CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEE--------- Q ss_conf 00012200102221666558888512---699779996277557676289999999999997299199985--------- Q gi|254780425|r 390 LGLSEKLRLYPLLFSDALFHQDINNL---ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRAT--------- 457 (518) Q Consensus 390 ~~~~~~~~~~~~ICyE~~f~~~~r~~---~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~--------- 457 (518) ....++.++|..||||.-||+..|.. .|+++++++++ |....+..+...+.+.||+||+.+++-+. T Consensus 136 v~~~~~~kiG~~IC~D~~fPe~~~~~~~~~Gaeii~~p~a--~~~~~~~~~w~~l~~arA~en~~~vi~~n~~g~~~~~~ 213 (274) T COG0388 136 VFETDGGKIGLLICYDLRFPELARRLLALGGAELLLVPAA--WPAERGLDHWEVLLRARAIENQVYVLAANRAGFDGAGL 213 (274) T ss_pred EEEECCEEEEEEECCCCCCCHHHHHHHHHCCCEEEEECCC--CCCCCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCC T ss_conf 6966885789998145478378999886269659998887--75543367899999752785362699966767767632 Q ss_pred -CCCCCEEECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf -597046998998087773899827999996067 Q gi|254780425|r 458 -NNGVSAFLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 458 -ntGiS~~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) ..|-|.++||+|+++++++.++++++..++.+. T Consensus 214 ~~~G~S~i~~p~G~v~~~~~~~~e~~~~~~id~~ 247 (274) T COG0388 214 EFCGHSAIIDPDGEVLAEAGEEEEGVLLADIDLA 247 (274) T ss_pred EECCEEEEECCCCCEEEECCCCCCCEEEEEECHH T ss_conf 5511049984899778743788886899997466 No 30 >PRK13825 conjugal transfer protein TraB; Provisional Probab=99.86 E-value=2.4e-18 Score=138.04 Aligned_cols=183 Identities=17% Similarity=0.179 Sum_probs=124.0 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCC Q ss_conf 33578998754310133466861898312101100011234667765641079824201100123455751210000367 Q gi|254780425|r 261 EKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQPSILKKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIIN 340 (518) Q Consensus 261 ~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~~~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~ 340 (518) +...+++.++.........++.++||+||+++++.....+...++ .....+..+|.|+...+ .+.|.|+++.++ T Consensus 203 ~~~~~~~r~li~~~~~~~~~g~~vvvLPEs~lg~Wtp~~~~lw~~--~l~~~~~tvi~Ga~v~~----~~Gydn~lv~i~ 276 (389) T PRK13825 203 DGSLLRHRELIATVRAAARAGARVVVLPESALGFWTPTTERLWRE--SLRGADATVIAGAAVVD----PGGYDNVLVAIS 276 (389) T ss_pred CHHHHHHHHHHHHHHHHCCCCCEEEEECCHHCCCCCCHHHHHHHH--HHCCCCCEEEEEEEECC----CCCCCCEEEEEC T ss_conf 336899999999988734799739994632115687138999999--74689928998236758----998744479971 Q ss_pred CCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCEEECCCCHHHHHH--HHHHCCCCC Q ss_conf 321221112563342213310010000121110012222101358776200012200102221666558--888512699 Q gi|254780425|r 341 SKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLSEKLRLYPLLFSDALFH--QDINNLENV 418 (518) Q Consensus 341 ~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ICyE~~f~--~~~r~~~~a 418 (518) ++|.. -.|+-+..||++++-|++.++..- .....++.. +++. ..++.|++++||||..-. .+..-..++ T Consensus 277 ~~~~r-ily~~RmPVP~sMWqPW~~~~g~~----g~~~a~~f~---npv~-~i~G~r~a~lICYEqll~wp~l~sml~~P 347 (389) T PRK13825 277 AGGGR-ILYRERMPVPVSMWQPWLPWTGES----GGARAHFFA---NPVV-TVDGRRVAPLICYEQLIVWPVLQSMLHAP 347 (389) T ss_pred CCCCE-EEECCCCCCCCCCCCCCCCCCCCC----CCCCCCCCC---CCEE-EECCEEEEEEEEHHHHHHHHHHHHHHCCC T ss_conf 89726-850367766510366775224899----972001245---8636-45680687754187878889999874299 Q ss_pred CEEEEECCCCCCC-C-CHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 7799962775576-7-628999999999999729919998559 Q gi|254780425|r 419 SAILNIIDDSGFM-G-SGTDHSFRYAQIQAVEIGLPLIRATNN 459 (518) Q Consensus 419 ~~lvn~tNd~Wf~-~-s~~~qH~~~a~~RAiE~~r~~vra~nt 459 (518) |++|...|| ||. + |-+.=|-+.++.=|-=.++|+|||.|+ T Consensus 348 dviva~~N~-Wwt~gTsi~aIQras~~awarLfg~Plv~A~N~ 389 (389) T PRK13825 348 DVIVAVGNG-WWTKGTSIVAIQRASAEAWARLFGVPLVRAFNT 389 (389) T ss_pred CEEEECCCC-CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 889975886-616899709999999999999837844876049 No 31 >pfam00795 CN_hydrolase Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6. Nitrilase-related proteins generally have a conserved E-K-C catalytic triad, and are multimeric alpha-beta-beta-alpha sandwich proteins. Probab=99.85 E-value=1.6e-21 Score=158.73 Aligned_cols=159 Identities=19% Similarity=0.255 Sum_probs=112.7 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHH-----------HHHHHHHHHHHHCC Q ss_conf 2100124420355543333578998754310133466861898312101100011-----------23466776564107 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVD-----------QPSILKKIASVLKI 312 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~-----------~~~~~~~l~~~~~~ 312 (518) +|+++|-|.. ..|.+..+++..++.+. +.+.++|+||+||.+++-+..+ +....+.+.+.+++ T Consensus 1 kiA~~Q~~~~---~~d~~~Nl~~~~~~i~~---A~~~gadlvv~PE~~l~gy~~~~~~~~~~~~~~~~~~~~~l~~~a~~ 74 (172) T pfam00795 1 RVALVQLPPS---AFDLEANLQKLLELIEE---AARQGADLIVFPELFIPGYAHGAAEYLELAEAIPGEVLQALSALARK 74 (172) T ss_pred CEEEEECCCC---CCCHHHHHHHHHHHHHH---HHHCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 9999989987---68999999999999999---99879989995641246787206777776454683999999999988 Q ss_pred CCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEE Q ss_conf 98242011001234557512100003673212211125633422133100100001211100122221013587762000 Q gi|254780425|r 313 NQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGL 392 (518) Q Consensus 313 ~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 392 (518) +...++++.... +++++|||+++++++|++..+|+|+||+|+|||..- .-+.+|+....+.. T Consensus 75 ~~i~i~~G~~~~---~~~~~yNs~~~i~~~G~i~~~Y~K~hL~~~~~~~E~---------------~~~~~G~~~~~v~~ 136 (172) T pfam00795 75 NGITVVAGIPER---DGGGLYNTLVLIDPDGELLGKYRKRHLVPFGEWVER---------------PLFGPGGATFPVFD 136 (172) T ss_pred CCCEEEEEEEEE---ECCEEEEEEEEECCCCEEEEEEEEECCCCCCCEEEE---------------EECCCCCCCCCEEC T ss_conf 596999946895---288888888999699869999933758997641278---------------75157887440104 Q ss_pred CCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECC Q ss_conf 12200102221666558888512--69977999627 Q gi|254780425|r 393 SEKLRLYPLLFSDALFHQDINNL--ENVSAILNIID 426 (518) Q Consensus 393 ~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tN 426 (518) .++.++|.+||||..||+..|.+ +|+++++++|. T Consensus 137 ~~~~riG~~ICyD~~fpe~~r~la~~Ga~ii~~PsA 172 (172) T pfam00795 137 TPVGKLGLLICYEIRFPELARMLALKGAEILANPSA 172 (172) T ss_pred CCCEEEEEEEECCCCCHHHHHHHHHCCCCEEEECCC T ss_conf 698899999860006579999999889969996999 No 32 >KOG0807 consensus Probab=99.83 E-value=2.9e-20 Score=150.53 Aligned_cols=234 Identities=16% Similarity=0.119 Sum_probs=160.1 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCC----------CCHHHHHHHHHHHHHHHCC Q ss_conf 321001244203555433335789987543101334668618983121011----------0001123466776564107 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFP----------FSIVDQPSILKKIASVLKI 312 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p----------~~~~~~~~~~~~l~~~~~~ 312 (518) .+|++-|-+-+.+. .+.++...+|.+ +++..++++|.+||..=- .....+..++++..++++. T Consensus 16 ~~vAv~Qm~S~~Dl----~kNl~~~keLi~---eA~~k~A~~iflPE~~dFi~~n~~esi~Lae~l~~k~m~~y~elar~ 88 (295) T KOG0807 16 KRVAVAQMTSSNDL----TKNLATCKELIS---EAAQKGAKLIFLPEAFDFIGQNPLESIELAEPLDGKFMEQYRELARS 88 (295) T ss_pred CEEEEEEECCCHHH----HHHHHHHHHHHH---HHHHCCCCEEECCHHHHHHCCCCCCCEECCCCCCHHHHHHHHHHHHH T ss_conf 10477752165678----888999999999---99973997997321332332796221322256684899999998774 Q ss_pred CCEEEE-CCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCC-CCCCCCCCE Q ss_conf 982420-110012345575121000036732122111256334221331001000012111001222210-135877620 Q gi|254780425|r 313 NQLLIV-GSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDY-SSSDISPLL 390 (518) Q Consensus 313 ~~~li~-G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~-~~g~~~~~~ 390 (518) ++..+. |+......+++.+.+|+.+++|.+|++.++|.|.||- ---+|-+..+ ...++ .||+.-... T Consensus 89 ~nIwlSlgg~~~r~~~~~~k~~N~hl~id~~G~i~a~Y~KlHLF--DVeipg~~~l---------kES~~t~pG~~i~~p 157 (295) T KOG0807 89 HNIWLSLGGHHERSDDGNQKLRNTHLLIDSKGEIRAEYQKLHLF--DVEIPGGPRL---------KESNTTQPGTAIESP 157 (295) T ss_pred CCEEEEECCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHCEE--EEECCCCCCC---------CCCCCCCCCCCCCCC T ss_conf 48258730435777300130343579986776689997652056--7635999610---------134676788655885 Q ss_pred EECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCC-CHHHHHHHHHHHHHHHCCCEEEEEECC-------- Q ss_conf 0012200102221666558888512--69977999627755767-628999999999999729919998559-------- Q gi|254780425|r 391 GLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMG-SGTDHSFRYAQIQAVEIGLPLIRATNN-------- 459 (518) Q Consensus 391 ~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~-s~~~qH~~~a~~RAiE~~r~~vra~nt-------- 459 (518) ....-.++|..||||+-||+..-.+ .||++|..+| -|+. +|+..-..+-|.||||+.-++|-++.. T Consensus 158 v~tP~GklGlaICYDiRFpE~sl~LR~~gA~iLtyPS---AFT~~TG~AHWEiLlRARAietQCYVvaaaQ~G~HneKR~ 234 (295) T KOG0807 158 VDTPLGKLGLAICYDIRFPELSLKLRKMGAQILTYPS---AFTIKTGEAHWEILLRARAIETQCYVVAAAQVGKHNEKRE 234 (295) T ss_pred CCCCCCCCCEEEEEECCCCHHHHHHHHCCCCEEECCC---HHHHCCCHHHHHHHHHHHHHHCCEEEEEHHHCCCCCCHHH T ss_conf 6786220110235422573688999872982796351---5431344889999999887640218985654266560223 Q ss_pred --CCCEEECCCCEEEEECCCCC-CEEEEEEECCCCCCCHHH Q ss_conf --70469989980877738998-279999960676876878 Q gi|254780425|r 460 --GVSAFLDERGQIISSVYADR-GASIDMHFQPKVRDSFRS 497 (518) Q Consensus 460 --GiS~~id~~G~i~~~~~~~~-~~~l~~~v~~~~~~T~y~ 497 (518) |.|++|||+|.|+++.+..+ .+...+++-+.--+++++ T Consensus 235 SyGhSMiVDPWGtVva~~se~~~~~l~~AdiDlslld~lr~ 275 (295) T KOG0807 235 SYGHSMIVDPWGTVVARCSERTGPGLILADIDLSLLDSLRT 275 (295) T ss_pred CCCCCEEECCHHHHHEECCCCCCCCEEEEECCHHHHHHHHH T ss_conf 04864687654555302687789965899756899999987 No 33 >PRK13287 amiF formamidase; Provisional Probab=99.83 E-value=5.1e-19 Score=142.41 Aligned_cols=229 Identities=14% Similarity=0.145 Sum_probs=159.3 Q ss_pred CCCCCCCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHH---HH--------HHHH Q ss_conf 655555321001244203-55543333578998754310133466861898312101100011---23--------4667 Q gi|254780425|r 237 KFEKISPVIRIVQPGINP-AIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVD---QP--------SILK 304 (518) Q Consensus 237 ~~~~~~~~V~iVQpni~~-~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~---~~--------~~~~ 304 (518) .+.+.++.|++||=..|- +.|.+-.+..++..++.... +.+-+..||||+||.|..-...+ .. ...+ T Consensus 8 ~~s~d~v~v~~vqy~~P~~~t~~ev~~n~~~I~~~v~~~-k~G~PgmDLvVFPEYSt~G~~~d~~~~~e~~~~ipGpet~ 86 (333) T PRK13287 8 NKPIEGVLVGLIQYPVPVVNSREDIDKQIEQIIKTVHKT-KAGYPGMDLIVFPEYSTQGLNTKKWTTEEFLCTVDGPEVD 86 (333) T ss_pred CCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH-HCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHCCCCHHHH T ss_conf 789985699999527886689999999999999999997-5689997489802413466678734557776149977899 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCC Q ss_conf 76564107982420110012345575121000036732122111256334221331001000012111001222210135 Q gi|254780425|r 305 KIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSS 384 (518) Q Consensus 305 ~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g 384 (518) .+.+.++++....+-+...+ .+++++.||+++++|++|+++.+|.|.|+ .+|.+. ..|| T Consensus 87 ~f~~acr~~~vwgvfSi~E~-np~~~~pYNT~iLIn~~Geiv~kYRKi~P-----WvPiE~---------------W~PG 145 (333) T PRK13287 87 AFAQACKENKVWGVFSIMER-NPDGNEPYNTAIIIDDQGEIILKYRKLHP-----WVPVEP---------------WEPG 145 (333) T ss_pred HHHHHHHHCCEEEEEEEEEE-CCCCCCCCEEEEEECCCCCEEEEEEECCC-----CCCCCC---------------CCCC T ss_conf 99999997794899998887-68999863028998899818889982256-----556777---------------3589 Q ss_pred CC-CCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECC-- Q ss_conf 87-76200012200102221666558888512--69977999627755767628999999999999729919998559-- Q gi|254780425|r 385 DI-SPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATNN-- 459 (518) Q Consensus 385 ~~-~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~nt-- 459 (518) +. .++...+.+.|++.+||+|-.||++.|+. +||++++-++. |.....+|+-.+++.+|-+|..+++-+.-+ T Consensus 146 d~~~pV~dgPkG~KI~liIC~DG~fPEiaRe~A~kGAEviiR~~G---Ym~p~~~qw~~t~k~~A~~N~~yva~vn~~G~ 222 (333) T PRK13287 146 DLGIPVCDGPGGSKLAVCICHDGMFPEMAREAAYKGANVMIRISG---YSTQVNDQWILTNRSNAWQNLMYTASVNLAGY 222 (333) T ss_pred CCCCEEEECCCCCEEEEEEECCCCCHHHHHHHHHCCCEEEEEECC---CCCCHHHHHHHHHHHHHHHHCCEEEEEEECCC T ss_conf 865156546887578999975899818999987468879998357---65613567999999988862557999750256 Q ss_pred -------CCCEEECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf -------7046998998087773899827999996067 Q gi|254780425|r 460 -------GVSAFLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 460 -------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) |.|+++|-+|+++.+-...+-+...+++.+. T Consensus 223 Dgv~~yfG~s~i~~fDG~~L~~~~~~~~~i~~Ael~~~ 260 (333) T PRK13287 223 DGVFYYFGEGQVCNFDGTTLVQGHRNPWEIVTAEVRPD 260 (333) T ss_pred CCCEEEECCCEEECCCCCEEEECCCCCCCEEEEEECHH T ss_conf 76414404616966788682104788882499996699 No 34 >PRK13981 NAD synthetase; Provisional Probab=99.79 E-value=4.4e-18 Score=136.28 Aligned_cols=219 Identities=16% Similarity=0.189 Sum_probs=147.2 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCH----HHHHHH-------HHHHHHHHC Q ss_conf 321001244203555433335789987543101334668618983121011000----112346-------677656410 Q gi|254780425|r 243 PVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSI----VDQPSI-------LKKIASVLK 311 (518) Q Consensus 243 ~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~----~~~~~~-------~~~l~~~~~ 311 (518) ++|++.|.|.-.. |.+...++..+..+ ++.+.++|++|.||-++.-+. -.++.+ .+.+.+..+ T Consensus 1 mkIalaQ~n~~vG---D~~~N~~~i~~~i~---~A~~~gadlvvfPEL~ltGY~~~DLl~~~~f~~~~~~al~~la~~~~ 74 (543) T PRK13981 1 LRIALAQLNPTVG---DIAGNAAKIRAAAA---EAADAGADLLLFPELFLSGYPPEDLLLRPAFLAACEAALERLAAATA 74 (543) T ss_pred CEEEEECCCCCCC---CHHHHHHHHHHHHH---HHHHCCCCEEEECCCHHCCCCHHHHHCCHHHHHHHHHHHHHHHHHCC T ss_conf 9899982899777---99999999999999---99987998999274021079957760388999999999999998538 Q ss_pred CCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEE Q ss_conf 79824201100123455751210000367321221112563342213310010000121110012222101358776200 Q gi|254780425|r 312 INQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLG 391 (518) Q Consensus 312 ~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~ 391 (518) .+..+++|.+..+ +++.||+++++. +|++...|+|++|=-.|| +....-|.+|+...++. T Consensus 75 ~~i~vivG~p~~~----~~~lyN~a~vi~-~G~i~~~~~K~~LPnY~v---------------FdE~RyF~~g~~~~~f~ 134 (543) T PRK13981 75 DGPAVLVGHPWRE----GGKLYNAAALLD-GGEILATYFKQDLPNYGV---------------FDEKRYFAPGPEPGPFE 134 (543) T ss_pred CCCEEEEEEEEEC----CCCEEEEEEEEE-CCEEEEEEECCCCCCCCC---------------HHHHHHCCCCCCCCEEE T ss_conf 9988999647824----990479999999-998999983211799850---------------20344214689776499 Q ss_pred ECCCCEEECCCCHHHHHHH-HHHC--CCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEC---------C Q ss_conf 0122001022216665588-8851--26997799962775576762899999999999972991999855---------9 Q gi|254780425|r 392 LSEKLRLYPLLFSDALFHQ-DINN--LENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATN---------N 459 (518) Q Consensus 392 ~~~~~~~~~~ICyE~~f~~-~~r~--~~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~n---------t 459 (518) .++.++|..||.|.-+|+ ..+. .+|+++++|++-- -|...-..+-..+.+-||.|++.|++-+.. . T Consensus 135 -~~g~~iGv~ICED~W~~~~p~~~l~~~GadliinlsAS-p~~~gK~~~R~~l~~~~a~~~~~~~iY~N~vGgqdelVFD 212 (543) T PRK13981 135 -LKGVRIGVPICEDIWWPEGPAETLAEAGAELLLVPNAS-PYHRGKPDLRESVLRARVRETGLPLVYLNQVGGQDELVFD 212 (543) T ss_pred -ECCEEEEEEEECCCCCCCCHHHHHHHCCCCEEEECCCC-HHHCCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEC T ss_conf -89999999720013778768999986599699981799-3463887999999999998669738986268998745761 Q ss_pred CCCEEECCCCEEEEECCCCCCEEEEEEECC Q ss_conf 704699899808777389982799999606 Q gi|254780425|r 460 GVSAFLDERGQIISSVYADRGASIDMHFQP 489 (518) Q Consensus 460 GiS~~id~~G~i~~~~~~~~~~~l~~~v~~ 489 (518) |-|.++|++|+++.+.+.|+++....++.. T Consensus 213 G~S~v~~~~G~l~~~~~~F~e~~~~vd~~~ 242 (543) T PRK13981 213 GASFVLNADGELAAQLPAFEEQIAVVDFDR 242 (543) T ss_pred CCCEEECCCCCHHHHCCCCCCCEEEEEEEC T ss_conf 864698799867564267652569998754 No 35 >pfam07186 consensus Probab=99.77 E-value=1.6e-15 Score=119.59 Aligned_cols=184 Identities=16% Similarity=0.124 Sum_probs=126.1 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECC Q ss_conf 33357899875431013346686189831210110001123466776564107982420110012345575121000036 Q gi|254780425|r 260 REKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIVDQPSILKKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYII 339 (518) Q Consensus 260 ~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~~~~~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~ 339 (518) +....+++.++....+....++.+++|.||++..+-....+...+. .+...+..+|.|+...+ ...|.|+++.+ T Consensus 209 ~~~~~~r~r~li~tv~~~~~~g~rvvvLPEs~lg~Wtp~~~~lW~~--~l~~~~~Tvi~Ga~v~~----~~GydnvlV~i 282 (396) T pfam07186 209 RDGSLQRQRDLIATVRRAAGEEARVVVLPESALGFWTPTVERLWRD--GLRGADVTVIAGAAVID----AGGYDNVMVTI 282 (396) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCEEEECCCHHHCCCCCCHHHHHHH--HHCCCCCEEEEEEEECC----CCCCEEEEEEE T ss_conf 6305799999999999751699869990523422677406889898--73579937998544359----98703589997 Q ss_pred CCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCEEECCCCHHHHH--HHHHHCCCC Q ss_conf 732122111256334221331001000012111001222210135877620001220010222166655--888851269 Q gi|254780425|r 340 NSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSSDISPLLGLSEKLRLYPLLFSDALF--HQDINNLEN 417 (518) Q Consensus 340 ~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ICyE~~f--~~~~r~~~~ 417 (518) +.++..+ .|+-+..||++-.-|++.++..- .....++.. ++... -++.+++++||||..- |.+-.-+.+ T Consensus 283 ~~~~~ri-ly~~RmPVPvsMWqPW~~~~g~~----g~a~a~~f~---npvv~-i~g~rva~lICYEqLl~wP~Lqsml~~ 353 (396) T pfam07186 283 SADEARI-LYRERMPVPVSMWQPWLSWTGQG----GGARAHFFA---NPFAE-VAGSRNAPLICYEQLIVWPVLQSMLYS 353 (396) T ss_pred CCCCCEE-EEECCCCCCHHHCCCCCCCCCCC----CCCCCCCCC---CCEEE-ECCEEEEEEEEHHHHHHHHHHHHHHCC T ss_conf 1898337-62047876422035664335898----874245356---97489-768068887658888878999987519 Q ss_pred CCEEEEECCCCCCC--CCHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 97799962775576--7628999999999999729919998559 Q gi|254780425|r 418 VSAILNIIDDSGFM--GSGTDHSFRYAQIQAVEIGLPLIRATNN 459 (518) Q Consensus 418 a~~lvn~tNd~Wf~--~s~~~qH~~~a~~RAiE~~r~~vra~nt 459 (518) +|++|.+.|+ ||. +|-+.=|-+.++.=|-=.++|+|||.|| T Consensus 354 PDvIVa~~N~-Wwt~gTsi~aIQrasv~AwaRLfg~Pvv~A~N~ 396 (396) T pfam07186 354 PDAIVAVGNG-WWTEGTSIVAIQRASVTAWAKLFGKPVVIAFNT 396 (396) T ss_pred CCEEEECCCC-CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 9889983787-745898619999999999999837844887159 No 36 >PRK13286 amiE acylamide amidohydrolase; Provisional Probab=99.76 E-value=4.7e-17 Score=129.57 Aligned_cols=230 Identities=12% Similarity=0.088 Sum_probs=155.5 Q ss_pred CCCCCCCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCC---CHHHH--------HHHHH Q ss_conf 655555321001244203-5554333357899875431013346686189831210110---00112--------34667 Q gi|254780425|r 237 KFEKISPVIRIVQPGINP-AIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPF---SIVDQ--------PSILK 304 (518) Q Consensus 237 ~~~~~~~~V~iVQpni~~-~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~---~~~~~--------~~~~~ 304 (518) .+.+.++.|++||=..|- +.|.+-.+..++..++.... +.+-+..||||+||-|.-- +...- ....+ T Consensus 7 sss~d~v~vavvqy~~P~~~t~~ev~~n~~~I~~mv~~~-k~G~PgmDLiVFPEYSt~Gi~yd~~~~~e~a~tipGpEt~ 85 (343) T PRK13286 7 SSSNDTVGVAVVNYKMPRLHTKAEVLENARKIADMIVGM-KQGLPGMDLVIFPEYSTHGIMYDRQEMYDTASTIPGEETA 85 (343) T ss_pred CCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH-HCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHCCCCHHHH T ss_conf 789885599999527876689999999999999999997-5699997479743300366667857888656359977899 Q ss_pred HHHHHHCCCCEEEECCCC--CCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCC Q ss_conf 765641079824201100--123455751210000367321221112563342213310010000121110012222101 Q gi|254780425|r 305 KIASVLKINQLLIVGSMR--KELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYS 382 (518) Q Consensus 305 ~l~~~~~~~~~li~G~~~--~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~ 382 (518) .+.+..+++...-+=+.. .++.+.+...||+++++|++|+++..|.|.|. .+|-+ ... T Consensus 86 ~f~~ACr~~~vWGvFSi~ge~he~~P~k~PyNt~ilInd~GEIv~kYRKi~P-----WvPiE---------------~W~ 145 (343) T PRK13286 86 IFAAACRKAKVWGVFSLTGERHEEHPRKAPYNTLILINDKGEIVQKYRKIMP-----WCPIE---------------GWY 145 (343) T ss_pred HHHHHHHHCCEEEEEEEECCCCCCCCCCCCCCEEEEECCCCHHHHHHHHCCC-----CCCCC---------------CCC T ss_conf 9999998679589999622332338888985238998798612121010567-----66556---------------865 Q ss_pred CCCCCCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECC- Q ss_conf 358776200012200102221666558888512--69977999627755767628999999999999729919998559- Q gi|254780425|r 383 SSDISPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATNN- 459 (518) Q Consensus 383 ~g~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~nt- 459 (518) ||+...+..-+.+.|++.+||.|-.||++.|+. +||+++|-.+. |.....+|+-.+++.+|=+|..+++-+.-+ T Consensus 146 PGd~~~V~eGPkG~KisliIC~DG~yPEi~RecA~KGAeliiR~~G---YmyP~~~qw~~tnk~~A~~N~~Yva~vn~aG 222 (343) T PRK13286 146 PGDCTYVSEGPKGLKISLIICDDGNYPEIWRDCAMKGAELIVRCQG---YMYPAKEQQVLIAKAMAWANNCYVAVANAAG 222 (343) T ss_pred CCCCCEEEECCCCCEEEEEEECCCCCHHHHHHHHHCCCEEEEEECC---CCCHHHHHHHHHHHHHHHHHCCEEEEEEECC T ss_conf 7862078317887578899966998868999987468879997335---6661777899988888875131577774035 Q ss_pred --------CCCEEECCCCEEEEECCCCCCEEEEEEECCC Q ss_conf --------7046998998087773899827999996067 Q gi|254780425|r 460 --------GVSAFLDERGQIISSVYADRGASIDMHFQPK 490 (518) Q Consensus 460 --------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~ 490 (518) |.|+++|-+|+++.+....+-+...+++.+. T Consensus 223 ~Dgvy~yfG~s~i~~fDG~tL~e~g~~~~~i~~Ael~~~ 261 (343) T PRK13286 223 FDGVYSYFGHSAIIGFDGRTLGECGEEEMGIQYAQLSKS 261 (343) T ss_pred CCCCCCCCCCEEEECCCCCEEEECCCCCCCEEEEEECHH T ss_conf 787511358706966898774204788772389996599 No 37 >cd07587 ML_beta-AS mammalian-like beta-alanine synthase (beta-AS) and similar proteins (class 5 nitrilases). This subgroup includes mammalian-like beta-AS (EC 3.5.1.6, also known as beta-ureidopropionase or N-carbamoyl-beta-alanine amidohydrolase). This enzyme catalyzes the third and final step in the catabolic pyrimidine catabolic pathway responsible for the degradation of uracil and thymine, the hydrolysis of N-carbamyl-beta-alanine and N-carbamyl-beta-aminoisobutyrate to the beta-amino acids, beta-alanine and beta-aminoisobutyrate respectively. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 5. Members of this superfamily generally form homomeric Probab=99.70 E-value=2.8e-16 Score=124.49 Aligned_cols=239 Identities=14% Similarity=0.114 Sum_probs=163.9 Q ss_pred CCCCCCCCCCCCCCH--HH--HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCC-CCCCHH--------------HHH Q ss_conf 555321001244203--55--54333357899875431013346686189831210-110001--------------123 Q gi|254780425|r 240 KISPVIRIVQPGINP--AI--KEDREKILERYLSLTALPVSAGELEPVIIVWAYLP-FPFSIV--------------DQP 300 (518) Q Consensus 240 ~~~~~V~iVQpni~~--~~--k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa-~p~~~~--------------~~~ 300 (518) ..-++|++||-.|.. .. ..+++.+.++..++.+ .++..++.++-.-|.- .|+.+- .+. T Consensus 61 pr~vrvg~iqn~i~~pt~~p~~~q~~ai~~~i~~~i~---~Aa~~gvni~c~qe~w~mpfafctrek~pw~efae~a~~G 137 (363) T cd07587 61 PRIVRVGLIQNKIVLPTTAPIAEQREAIHDRIKKIIE---AAAMAGVNIICFQEAWTMPFAFCTREKLPWCEFAESAEDG 137 (363) T ss_pred CCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH---HHHHCCCCEEEEEHHHCCCCEECCCCCCCHHHHCCCCCCC T ss_conf 8478997874045599986689999999999999999---9997598799840010587211135777602212667779 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCC Q ss_conf 46677656410798242011001234557512100003673212211125633422133100100001211100122221 Q gi|254780425|r 301 SILKKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLD 380 (518) Q Consensus 301 ~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~ 380 (518) ...+.+++++++.+-+|+.-...++...++-+||++++++.||++.+.|.|+| .|=+ | . +....- T Consensus 138 ptt~~~~~~A~k~~mViv~pi~Erd~~~~~~~~NTavVId~dG~~lGkyRK~H-IP~~---~----------~-f~EkfY 202 (363) T cd07587 138 PTTKFCQELAKKYNMVIVSPILERDEEHGDTIWNTAVVISNSGNVLGKSRKNH-IPRV---G----------D-FNESTY 202 (363) T ss_pred CHHHHHHHHHHHCCEEEEEECCCCCCCCCCEEEEEEEEECCCCCEEEEEECCC-CCCC---C----------C-CCEEEC T ss_conf 36999999999849699984200334458757866899978986642221133-8999---9----------9-553121 Q ss_pred CCCCCCCCCEEECCCCEEECCCCHHHHHHHHHHC--CCCCCEEEEECCCCCCCC-CHH-HHHHHHHHHHHHHCCCEEEEE Q ss_conf 0135877620001220010222166655888851--269977999627755767-628-999999999999729919998 Q gi|254780425|r 381 YSSSDISPLLGLSEKLRLYPLLFSDALFHQDINN--LENVSAILNIIDDSGFMG-SGT-DHSFRYAQIQAVEIGLPLIRA 456 (518) Q Consensus 381 ~~~g~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~--~~~a~~lvn~tNd~Wf~~-s~~-~qH~~~a~~RAiE~~r~~vra 456 (518) |.||+..-......-.++|+.||||--||+-.|. ++||+++.|+|. ..+. |.+ |+ ..+|-.||-|+.+++-+ T Consensus 203 f~pGn~G~~Vf~T~~GkIGV~IC~DqwfPe~aR~~aL~GAEIvf~PtA--t~g~~se~~W~--~eqr~hAiAN~~~v~ai 278 (363) T cd07587 203 YMEGNTGHPVFETQFGKIAVNICYGRHHPLNWLMYGLNGAEIVFNPSA--TVGALSEPMWP--IEARNAAIANSYFTVGI 278 (363) T ss_pred CCCCCCCCCCEECCCCEEEEEEECCCCCHHHHHHHHHCCCEEEECCCC--CCCCCCHHHHH--HHHHHHHHHCCCCEEEE T ss_conf 368878987554364116645631543748899864168649975455--45652367899--98899999708837982 Q ss_pred ECC-------------------------CCCEEECCCCEEEEECCCCCCEEEEEEECCCCCCCHHHHHH Q ss_conf 559-------------------------70469989980877738998279999960676876878655 Q gi|254780425|r 457 TNN-------------------------GVSAFLDERGQIISSVYADRGASIDMHFQPKVRDSFRSSIQ 500 (518) Q Consensus 457 ~nt-------------------------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~~T~y~~~g 500 (518) ..- |-|-+.||+|+++.+++..++++++.++-+...+..-..|| T Consensus 279 NRVG~E~~pne~tsGdg~~a~~d~g~FYGSSfia~P~G~~~~~lsr~~eglLV~eiDL~~~rqvR~~W~ 347 (363) T cd07587 279 NRVGTEVFPNEFTSGDGKPAHKDFGHFYGSSYVAAPDGSRTPGLSRTRDGLLVAELDLNLCRQVKDKWG 347 (363) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCEEEEEECHHHHHHHHHHCC T ss_conf 355625676544567775343557642252147889987645778887636999933689999987478 No 38 >KOG0806 consensus Probab=99.64 E-value=1.4e-15 Score=119.92 Aligned_cols=228 Identities=14% Similarity=0.035 Sum_probs=147.9 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCC---------CHHHH------HHHHH Q ss_conf 5553210012442035554333357899875431013346686189831210110---------00112------34667 Q gi|254780425|r 240 KISPVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPF---------SIVDQ------PSILK 304 (518) Q Consensus 240 ~~~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~---------~~~~~------~~~~~ 304 (518) ..+.+++.||--..-. +..+.+++..+..+. +.+.++++||.||-...- +..+. ....+ T Consensus 11 ~~~~~~a~vq~~~~l~---~~~~Ni~~~~~~i~~---aa~~g~~iIv~PE~~~~gy~~~~sf~py~E~i~~~~~~~ps~~ 84 (298) T KOG0806 11 LPNATEALVSLEEALL---LMNENIDILEKAVKE---AAKQGAKIIVFPEDGLYGYNFTESFYPYLEDIPDPGCRDPSRQ 84 (298) T ss_pred CCCCCEEEEECCCCHH---HHHHHHHHHHHHHHH---HHHCCCCEEECHHHCCCCCCCCCCCCCHHHHCCCCCCCCHHHH T ss_conf 5345332210122112---334658999999999---9864980998731325652456531214430899666883688 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCC Q ss_conf 76564107982420110012345575121000036732122111256334221331001000012111001222210135 Q gi|254780425|r 305 KIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSS 384 (518) Q Consensus 305 ~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g 384 (518) .+.+.+++...+++|+.-.+.. ..++.|||...++++|+-...|+|+||- ---.|... . +....-+++| T Consensus 85 ~ls~va~~~~~~~i~g~i~~~~-~~~k~yns~~~~~~~g~l~~~yrk~hlF--D~d~~~~~-------r-y~e~~~~~~g 153 (298) T KOG0806 85 GLSEVAERLSCYIIGGSIEEEA-LGDKLYNSCADSSCPGDGLAKYRKNHLF--DTDGPGVI-------R-YRESHLLSPG 153 (298) T ss_pred HHHHHHHHCEEEEECCCCHHHC-CCCCCCCCCCCCCCCCCHHHEEEEEEEE--CCCCCCCE-------E-EEEEECCCCC T ss_conf 7688877445899657614311-4664457512258880511245234773--36786620-------2-2321013776 Q ss_pred CCCCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCC---CCCHHHHHHHHHHHHHHHCCCEEEEEEC- Q ss_conf 8776200012200102221666558888512--699779996277557---6762899999999999972991999855- Q gi|254780425|r 385 DISPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGF---MGSGTDHSFRYAQIQAVEIGLPLIRATN- 458 (518) Q Consensus 385 ~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf---~~s~~~qH~~~a~~RAiE~~r~~vra~n- 458 (518) ........ ...++|..||||+.|+|..+.+ ++++.||-+| +|. ...++.|-....|+||.+|..+++-++- T Consensus 154 ~~f~~~~~-~~gkfGi~IC~Di~F~d~A~~~~~~g~~~ivyPt--aw~~~~l~~~~~hw~~~~~~~a~~n~~~v~~~s~~ 230 (298) T KOG0806 154 DQFTVVDT-SYGKFGIFICFDIRFYDPAMILVKDGADLIVYPT--AWNNELLSAVPLHWALLMRARANDNAANVHAPSPA 230 (298) T ss_pred CCCCCCCC-CCCCEEEEEEECCCCCCHHHHHHHCCCCEEEECC--HHHHHCCCCCCHHHHHHHHCCCCCCEEEEECCCCC T ss_conf 66774457-8871579998314552367999971884899360--67400146651579999747755220332115767 Q ss_pred ---------CC-CCEEECCCCEEEEECCCCCCEEEEEEEC Q ss_conf ---------97-0469989980877738998279999960 Q gi|254780425|r 459 ---------NG-VSAFLDERGQIISSVYADRGASIDMHFQ 488 (518) Q Consensus 459 ---------tG-iS~~id~~G~i~~~~~~~~~~~l~~~v~ 488 (518) -| .|-+++|.|+++.+....++ +....+- T Consensus 231 ~~~s~~y~~~gshs~~~~p~gkvl~a~~~~~e-~~~a~~d 269 (298) T KOG0806 231 RTGSGIYAPRGSHSIMVNPTGKVLAAAVEKEE-IIYADVD 269 (298) T ss_pred CCCCEEEECCCCCEEECCCCCEEEEECCCCCC-CCCCCCC T ss_conf 67860241587632202885247630257776-3124479 No 39 >PRK02628 nadE NAD synthetase; Reviewed Probab=99.59 E-value=1.2e-13 Score=107.33 Aligned_cols=234 Identities=15% Similarity=0.114 Sum_probs=136.5 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCH----HHHHHHH----HHHHHHHC Q ss_conf 555321001244203555433335789987543101334668618983121011000----1123466----77656410 Q gi|254780425|r 240 KISPVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSI----VDQPSIL----KKIASVLK 311 (518) Q Consensus 240 ~~~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~----~~~~~~~----~~l~~~~~ 311 (518) ..-++|++.|||+... |.+...++..+..+ ++.+.++||+|.||-++.-+. -.++.+. +.+.++++ T Consensus 10 ~gfirvA~a~~~~~Vg---D~~~N~~~I~~~i~---~A~~~ga~lvvfPEL~ltGY~~eDL~l~~~fl~~~~~aL~~l~~ 83 (678) T PRK02628 10 HGFVRVAAATPKVRVA---DPAFNAERILALAR---EAHDDGVALAVFPELSLSGYSCGDLFLQDALLDAVEDALAALVE 83 (678) T ss_pred CCEEEEEEECCCCCCC---CHHHHHHHHHHHHH---HHHHCCCCEEECCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 7712799947898753---88999999999999---99988998998255203389889985799999999999999998 Q ss_pred ---C-CCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHH--CCCCCCCCCCCC Q ss_conf ---7-9824201100123455751210000367321221112563342213310010000121110--012222101358 Q gi|254780425|r 312 ---I-NQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLD--FYMFPLDYSSSD 385 (518) Q Consensus 312 ---~-~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~--~~~~~~~~~~g~ 385 (518) . +..+++|.+... +++.||++.++ .+|++.+.|.|++|==.||+-=-|.|-++-... .......-.|-+ T Consensus 84 ~t~~~~~~viVG~P~~~----~~~LyN~A~vi-~~G~Ilg~~~K~~LPNY~vFdE~R~F~~g~~~~~~~i~~~g~~vPfG 158 (678) T PRK02628 84 ASADLDPVLVVGAPLRV----RHRLYNCAVVI-HRGRILGVVPKSYLPNYREFYEKRWFAPGDGIRGETIRLCGQEVPFG 158 (678) T ss_pred HCCCCCEEEEEECEEEC----CCCEEEEEEEE-ECCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCC T ss_conf 57898859999330620----89548899999-99999999704338799765746367777778861683478220456 Q ss_pred CCCCEE--ECCCCEEECCCCHHHHHHH---HHHCCCCCCEEEEECCCCC-CCCCHHHHHHHHHHHHHHHCCCEEEEEEC- Q ss_conf 776200--0122001022216665588---8851269977999627755-76762899999999999972991999855- Q gi|254780425|r 386 ISPLLG--LSEKLRLYPLLFSDALFHQ---DINNLENVSAILNIIDDSG-FMGSGTDHSFRYAQIQAVEIGLPLIRATN- 458 (518) Q Consensus 386 ~~~~~~--~~~~~~~~~~ICyE~~f~~---~~r~~~~a~~lvn~tNd~W-f~~s~~~qH~~~a~~RAiE~~r~~vra~n- 458 (518) ..-.+. ..++.++|..||.|.-.|. .....+||++|+|+|---. .|+.. +-..+.+-+|.+++-+.+-+.- T Consensus 159 ~dllF~~~~~~g~~~GveICEDlW~~~~P~~~la~~GA~lI~NlSASp~~~gK~~--~R~~lv~~~s~~~~~~yvY~n~G 236 (678) T PRK02628 159 TDLLFEAEDLPGFVFGVEICEDLWVPIPPSSYAALAGATVLANLSASNITVGKAD--YRRLLVASQSARCLAAYVYAAAG 236 (678) T ss_pred CCEEEEECCCCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEEECCCCHHHCCCHH--HHHHHHHHHHHHHCCCEEEEECC T ss_conf 6525654166883899997211488998589998749887971688803348699--99999999988628856998235 Q ss_pred C----------CCCEEECCCCEEEEECCCCCCE--EEEEEE Q ss_conf 9----------7046998998087773899827--999996 Q gi|254780425|r 459 N----------GVSAFLDERGQIISSVYADRGA--SIDMHF 487 (518) Q Consensus 459 t----------GiS~~id~~G~i~~~~~~~~~~--~l~~~v 487 (518) - |-| +|..+|+++++.+.|+++ .+.+++ T Consensus 237 ~gEsttDlVFdG~s-~I~~nG~lla~~~~F~~~~~l~~adi 276 (678) T PRK02628 237 VGESTTDLAWDGQT-LIYENGELLAESERFPREEQLIIADI 276 (678) T ss_pred CCCCCCCEEEECCE-EEECCCCEEEECCCCCCCCCEEEEEE T ss_conf 78767606996886-78728816753356666763689983 No 40 >cd07567 biotinidase_like biotinidase and vanins (class 4 nitrilases). These secondary amidases participate in vitamin recycling. Biotinidase (EC 3.5.1.12) has both a hydrolase and a transferase activity. It hydrolyzes free biocytin or small biotinyl-peptides produced during the proteolytic degradation of biotin-dependent carboxylases, to release free biotin (vitamin H), and it can transfer biotin to acceptor molecules such as histones. Biotinidase deficiency in humans is an autosomal recessive disorder characterized by neurological and cutaneous symptoms. This subgroup includes the three human vanins, vanin1-3. Vanins are ectoenzymes, Vanin-1, and -2 are membrane associated, vanin-3 is secreted. They are pantotheinases (EC 3.5.1.92, pantetheine hydrolase), which convert pantetheine, to pantothenic acid (vitamin B5) and cysteamine (2-aminoethanethiol, a potent anti-oxidant). They are potential targets for therapeutic intervention in inflammatory disorders. Vanin-1 deficient mice lacking Probab=99.54 E-value=3.4e-13 Score=104.46 Aligned_cols=207 Identities=18% Similarity=0.184 Sum_probs=132.5 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCC---------CH-------------------HHHHHHHHHHHHHHCCC Q ss_conf 357899875431013346686189831210110---------00-------------------11234667765641079 Q gi|254780425|r 262 KILERYLSLTALPVSAGELEPVIIVWAYLPFPF---------SI-------------------VDQPSILKKIASVLKIN 313 (518) Q Consensus 262 ~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~---------~~-------------------~~~~~~~~~l~~~~~~~ 313 (518) +.++.|.++.+ +++..++|+||.||..+-. .. .+....++++.=+++++ T Consensus 24 ~Nl~~y~~~i~---~a~~~~adIIVFPE~gl~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~l~~LSC~Ar~~ 100 (299) T cd07567 24 KNLDIYEEIIK---SAAKQGADIIVFPEDGLTGFIFTRFVIYPFLEDVPDPEVNWNPCLDPDRFDYTEVLQRLSCAAREN 100 (299) T ss_pred HHHHHHHHHHH---HHHHCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 99999999999---776559989988674446776577423521023799655666656844455189999877776417 Q ss_pred CEEEECCC-CCCC--------CCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCCCCCC Q ss_conf 82420110-0123--------45575121000036732122111256334221331001000012111001222210135 Q gi|254780425|r 314 QLLIVGSM-RKEL--------VDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLDYSSS 384 (518) Q Consensus 314 ~~li~G~~-~~~~--------~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~~~~g 384 (518) ...++-.. ..+. .+++..+||+.+++|.+|.++.||+|.||- +| | ..+..+. T Consensus 101 ~~YvV~Nl~ek~~C~~~~~~Cp~dg~~~yNTnVVFdr~G~vVaRYrK~nLf--~E--~---------------~~~~~~~ 161 (299) T cd07567 101 SIYVVANLGEKQPCDSSDPHCPPDGRYQYNTNVVFDRDGTLIARYRKYNLF--GE--P---------------GFDVPPE 161 (299) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCEEEECEEEECCCCCEEEEEECCCCC--CC--C---------------CCCCCCC T ss_conf 869999476454477888899755527860307875888378653022214--66--4---------------5578898 Q ss_pred CCCCCEEECCCCEEECCCCHHHHHHHHHHCC---CCCCEEEEECCCCCCCCCHHHHH-HHHHHHHHHHCCCEEEEEECC- Q ss_conf 8776200012200102221666558888512---69977999627755767628999-999999999729919998559- Q gi|254780425|r 385 DISPLLGLSEKLRLYPLLFSDALFHQDINNL---ENVSAILNIIDDSGFMGSGTDHS-FRYAQIQAVEIGLPLIRATNN- 459 (518) Q Consensus 385 ~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~~---~~a~~lvn~tNd~Wf~~s~~~qH-~~~a~~RAiE~~r~~vra~nt- 459 (518) .+...+...-|+++|..||||+.|.+-+.++ .+-+-+|-+| +||.. -|..+ .+.=+.=|..++.-++ |||. T Consensus 162 pe~~tF~TdFGV~FG~FiCFDiLF~~PA~~Lv~~~~V~~iv~pt--~W~s~-lPfLtAvq~q~aWA~~~~VNlL-AA~~~ 237 (299) T cd07567 162 PEIVTFDTDFGVTFGIFTCFDILFKEPALELVKKLGVDDIVFPT--AWFSE-LPFLTAVQIQQAWAYANGVNLL-AANYN 237 (299) T ss_pred CCCEEEECCCCCEEEEEEEEEECCCCHHHHHHHHCCCCEEEECH--HHHCC-CCCHHHHHHHHHHHHHCCCCEE-ECCCC T ss_conf 75137866876345269974114556179999863988599466--77445-7106788899999987597089-23688 Q ss_pred ------CCCEEECCC-CEEEEECCCC-CCEEEEEEECCCCCCC Q ss_conf ------704699899-8087773899-8279999960676876 Q gi|254780425|r 460 ------GVSAFLDER-GQIISSVYAD-RGASIDMHFQPKVRDS 494 (518) Q Consensus 460 ------GiS~~id~~-G~i~~~~~~~-~~~~l~~~v~~~~~~T 494 (518) +=|++-.++ |++....... ++.-++++||.....+ T Consensus 238 ~p~~~~tGSGIY~g~~g~~~~~~~~~~~~kLlva~vpk~p~~~ 280 (299) T cd07567 238 NPSAGMTGSGIYAGRSGALVYHYDNEPGGKLLVAEVPKLPSRR 280 (299) T ss_pred CCCCCCCCCCEECCCCCCEEEEECCCCCCEEEEEECCCCCCCC T ss_conf 9877886465657998537998369997459999778888878 No 41 >cd07566 ScNTA1_like Saccharomyces cerevisiae N-terminal amidase NTA1, and related proteins (class 3 nitrilases). Saccharomyces cerevisiae NTA1 functions in the N-end rule protein degradation pathway. It specifically deaminates the N-terminal asparagine and glutamine residues of substrates of this pathway, to aspartate and glutamate respectively, these latter are the destabilizing residues. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 3. Probab=99.53 E-value=1.1e-13 Score=107.65 Aligned_cols=216 Identities=17% Similarity=0.099 Sum_probs=117.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCEEEECCCCCCCCH----HHHH---------HHHHHHHHH Q ss_conf 21001244203555433335789987543101-334668618983121011000----1123---------466776564 Q gi|254780425|r 244 VIRIVQPGINPAIKEDREKILERYLSLTALPV-SAGELEPVIIVWAYLPFPFSI----VDQP---------SILKKIASV 309 (518) Q Consensus 244 ~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~-~~~~~~~dLiVWPEsa~p~~~----~~~~---------~~~~~l~~~ 309 (518) +|+++|-|--.. +.++.+++..++.+... .....++||||.||-++.-+. .+.. ...+...+. T Consensus 1 RIA~lQ~~P~~G---dV~~Nl~ra~~ll~~~~~~~~~~~~dLlVlPEl~~TGY~f~~~~ei~~~~e~~~~G~s~~~~~~~ 77 (295) T cd07566 1 RIACLQLNPQIG---QVEENLSRAWELLDKTKKRAKLKKPDILVLPELALTGYNFHSLEHIKPYLEPTTSGPSFEWAREV 77 (295) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 979997288478---77888999999999998653125999898687355688989978988765025887189999999 Q ss_pred HCC-CCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHH-HHCCCCCCCCCCCCCC Q ss_conf 107-98242011001234557512100003673212211125633422133100100001211-1001222210135877 Q gi|254780425|r 310 LKI-NQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLN-LDFYMFPLDYSSSDIS 387 (518) Q Consensus 310 ~~~-~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~-~~~~~~~~~~~~g~~~ 387 (518) +++ +..+++|.+..+. +...++|||+++++|+|+++..|+|.||=---| .+-..-+ ..+......+.++... T Consensus 78 A~k~~~~vvvGypE~~~-~~~~~~YNSa~vv~p~G~~l~~YRK~hL~~tde-----~Wa~~g~~~~~~~~~~~~~~~~~~ 151 (295) T cd07566 78 AKKFNCHVVIGYPEKVD-ESSPKLYNSALVVDPEGEVVFNYRKSFLYYTDE-----EWGCEENPGGFQTFPLPFAKDDDF 151 (295) T ss_pred HHHCCEEEEEEEEEECC-CCCCCEEEEEEEECCCCCEEEEEECCCCCCCCH-----HHHCCCCCCCCCEECCCCCCCCCC T ss_conf 99779189997303258-877623434799878986611561113655550-----232357888751010355666666 Q ss_pred CCEEECCCCEEECCCCHHH---------HHHHHHHC--CCCCCEEEEECCCCCCCCCH----------HH---HHHHHHH Q ss_conf 6200012200102221666---------55888851--26997799962775576762----------89---9999999 Q gi|254780425|r 388 PLLGLSEKLRLYPLLFSDA---------LFHQDINN--LENVSAILNIIDDSGFMGSG----------TD---HSFRYAQ 443 (518) Q Consensus 388 ~~~~~~~~~~~~~~ICyE~---------~f~~~~r~--~~~a~~lvn~tNd~Wf~~s~----------~~---qH~~~a~ 443 (518) .........+++..||+|+ ..-+..+. .+++++++.++ +|..... |. +..-+.| T Consensus 152 ~~~~~~~~~~~~~gICmDiNPYkFeaP~~~~Efa~~~~~~~a~lii~~m--aWl~~~s~~~~~~~~~~p~~~t~~yW~~R 229 (295) T cd07566 152 DGGSVDVTLKTSIGICMDLNPYKFEAPFTDFEFATHVLDNGTELIICPM--AWLHSLSPTELTVLPQEPDTETVSYWLQR 229 (295) T ss_pred CCCCCCCCEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC--CCCCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 7643357402699998348852356896288999999866998999977--55677883101367788611389999997 Q ss_pred HH----HHHCCCEEEEEECCCCCEEECCCCE Q ss_conf 99----9972991999855970469989980 Q gi|254780425|r 444 IQ----AVEIGLPLIRATNNGVSAFLDERGQ 470 (518) Q Consensus 444 ~R----AiE~~r~~vra~ntGiS~~id~~G~ 470 (518) +. +-|++..+|-|...|...-+...|. T Consensus 230 ~~pl~~~~~~~~~vv~aNR~G~E~~~~f~G~ 260 (295) T cd07566 230 FEPLRAEPLEGTQVVFCNRIGTENDTLYAGS 260 (295) T ss_pred HCHHHCCCCCCCEEEEEECCCCCCCEEECCC T ss_conf 3433227666618999701225488345155 No 42 >KOG0805 consensus Probab=99.06 E-value=3.9e-09 Score=77.99 Aligned_cols=214 Identities=14% Similarity=0.135 Sum_probs=133.6 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCHH--------------HHHH----- Q ss_conf 553210012442035554333357899875431013346686189831210110001--------------1234----- Q gi|254780425|r 241 ISPVIRIVQPGINPAIKEDREKILERYLSLTALPVSAGELEPVIIVWAYLPFPFSIV--------------DQPS----- 301 (518) Q Consensus 241 ~~~~V~iVQpni~~~~k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~~~~--------------~~~~----- 301 (518) ...+|++||......++ ...+++..++.. ++++.++.||+.||.-+.-+.. ..++ T Consensus 16 s~~~v~ivQ~~t~~~dt---paTL~K~~~~~~---Eaa~~Ga~LV~fPEAfiGGYPrg~~Fg~~~G~r~~eGR~ef~kY~ 89 (337) T KOG0805 16 SIVRVTIVQASTVYNDT---PATLDKAEKYIV---EAASKGAELVLFPEAFIGGYPRGFRFGLAVGVRNEEGRDEFRKYH 89 (337) T ss_pred CCEEEEEEECCCCCCCC---HHHHHHHHHHHH---HHHHCCCEEEEEEHHHHCCCCCCCEEEEEEEECCHHHHHHHHHHH T ss_conf 52479999726678898---778999999999---986078549981168606887775035888503623179999998 Q ss_pred ---------HHHHHHHHHCCC-CEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEH-HHHHCCCCHHHHH Q ss_conf ---------667765641079-82420110012345575121000036732122111256334221-3310010000121 Q gi|254780425|r 302 ---------ILKKIASVLKIN-QLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPF-AEYLPYRNILKKL 370 (518) Q Consensus 302 ---------~~~~l~~~~~~~-~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPF-GEyiP~~~~~~~l 370 (518) ..+++..++++| ..+.+|...++ +...|-++++++|+|..++... +|.|= +|..= T Consensus 90 a~AIev~gpEv~~l~~la~~~~v~lv~G~iEre----g~TLYCt~~f~~p~g~~lGKHR--KlmPTalERci-------- 155 (337) T KOG0805 90 ASAIEVPGPEVERLAELAKKNNVYLVMGAIERE----GYTLYCTVLFFSPQGQFLGKHR--KLMPTALERCI-------- 155 (337) T ss_pred HHHHCCCCHHHHHHHHHHHCCCEEEEEEEEECC----CCEEEEEEEEECCCCCCCCCCC--CCCCCHHHHEE-------- T ss_conf 877507984799999986147758999898405----6179999999878776110000--00641445505-------- Q ss_pred HHHCCCCCCCCCCCCCCCCEEECCCCEEECCCCHHHHHHHHHHCC--CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 110012222101358776200012200102221666558888512--699779996277557676289999999999997 Q gi|254780425|r 371 NLDFYMFPLDYSSSDISPLLGLSEKLRLYPLLFSDALFHQDINNL--ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVE 448 (518) Q Consensus 371 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ICyE~~f~~~~r~~--~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE 448 (518) -+..-|+.-|++.. .-.++|..||+|-.-|-+-..+ ++-+.-..+|-|+- ..||. --+-=|.| T Consensus 156 --------WGqGDGSTiPV~dT-~iGKIG~AICWEN~MPl~R~alY~KgieIycAPT~D~r----~~w~~--sM~~IAlE 220 (337) T KOG0805 156 --------WGQGDGSTIPVYDT-PIGKIGAAICWENRMPLYRTALYAKGIEIYCAPTADGR----KEWQS--SMLHIALE 220 (337) T ss_pred --------ECCCCCCCCCEEEC-CCCCCCEEEECCCCCHHHHHHHHHCCCEEEECCCCCCC----HHHHH--HHHHHEEC T ss_conf --------50588842202304-55400101211133679999987558579962677861----88998--64660102 Q ss_pred CCCEEEEEEC-----------------------------CCCCEEECCCCEEEEECCCCCCEEEEEEECC Q ss_conf 2991999855-----------------------------9704699899808777389982799999606 Q gi|254780425|r 449 IGLPLIRATN-----------------------------NGVSAFLDERGQIISSVYADRGASIDMHFQP 489 (518) Q Consensus 449 ~~r~~vra~n-----------------------------tGiS~~id~~G~i~~~~~~~~~~~l~~~v~~ 489 (518) -|-.++-|.. .|=|.+|||.|+|++.-...+++.+.+++.+ T Consensus 221 G~cFvlSA~QF~k~~d~p~~peyl~~~~~~~k~pD~vv~~GGSviI~PlG~VlagP~~~~EgL~tadldl 290 (337) T KOG0805 221 GGCFVLSACQFCKRKDFPDHPDYLFTDWYDDKEPDSVVSQGGSVIISPLGQVLAGPNFESEGLITADLDL 290 (337) T ss_pred CCEEEEEHHHHCCCCCCCCCCHHHCCCCHHHCCCCCCEECCCCEEECCCCCEECCCCCCCCCEEEEECCC T ss_conf 7469998212043125899704422620120288862346973887554233226886766038873060 No 43 >KOG0808 consensus Probab=98.31 E-value=3.4e-06 Score=58.83 Aligned_cols=239 Identities=14% Similarity=0.122 Sum_probs=139.7 Q ss_pred CCCCCCCCCCCCCCHHH----HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCC-CCCCHHHH--------------H Q ss_conf 55532100124420355----54333357899875431013346686189831210-11000112--------------3 Q gi|254780425|r 240 KISPVIRIVQPGINPAI----KEDREKILERYLSLTALPVSAGELEPVIIVWAYLP-FPFSIVDQ--------------P 300 (518) Q Consensus 240 ~~~~~V~iVQpni~~~~----k~~~~~~~~~~~~l~~~~~~~~~~~~dLiVWPEsa-~p~~~~~~--------------~ 300 (518) ..-++|++||-.|+.-. ..+...+.++...+.+. ++-.++.+|-+-|.- +|+.+-.. . T Consensus 71 pr~vrvgliqn~i~lpttapv~eq~~aih~r~kaiiea---aa~agvniiclqeawtmpfafctrerlpwtefaesv~~g 147 (387) T KOG0808 71 PRVVRVGLIQNSIALPTTAPVSEQTRAIHDRLKAIIEA---AAVAGVNIICLQEAWTMPFAFCTRERLPWTEFAESVDTG 147 (387) T ss_pred CCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH---HHHCCCCEEEEEHHHCCCHHHHCCCCCCHHHHCCCCCCC T ss_conf 74799865204456888873778889999999999999---875486579740122474133303447605430424448 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHHHCCCCCCC Q ss_conf 46677656410798242011001234557512100003673212211125633422133100100001211100122221 Q gi|254780425|r 301 SILKKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNLDFYMFPLD 380 (518) Q Consensus 301 ~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~~~~~~~~~ 380 (518) ..-+.++++++++..+|+.-.-..+.+.++-.+|++++++.+|.++++..|.|.---| +|. ...- T Consensus 148 ptt~flqklakkhdmvivspilerd~ehgdvlwntavvisn~g~vigk~rknhiprvg--------------dfn-esty 212 (387) T KOG0808 148 PTTKFLQKLAKKHDMVIVSPILERDIEHGDVLWNTAVVISNNGNVIGKHRKNHIPRVG--------------DFN-ESTY 212 (387) T ss_pred CHHHHHHHHHHHCCEEEEEHHHHCCCCCCCEEEEEEEEECCCCCEECCCCCCCCCCCC--------------CCC-CCEE T ss_conf 3579999988657869971013145434724554469982687533321235698535--------------667-5226 Q ss_pred CCCCCCCCCEEECCCCEEECCCCHHHHHH--HHHHCCCCCCEEEEECCCCCCCC-CHH-HHHHHHHHHHHHHCCCEEE-- Q ss_conf 01358776200012200102221666558--88851269977999627755767-628-9999999999997299199-- Q gi|254780425|r 381 YSSSDISPLLGLSEKLRLYPLLFSDALFH--QDINNLENVSAILNIIDDSGFMG-SGT-DHSFRYAQIQAVEIGLPLI-- 454 (518) Q Consensus 381 ~~~g~~~~~~~~~~~~~~~~~ICyE~~f~--~~~r~~~~a~~lvn~tNd~Wf~~-s~~-~qH~~~a~~RAiE~~r~~v-- 454 (518) +-.|+-.-++....-.+++..|||---.| .++-.++||+.+.|+|.. -|. |.| |- --||=-||.|.-... T Consensus 213 ymeg~lghpvfet~fgriavnicygrhhplnwlmy~lngaeiifnpsat--vgalseplwp--iearnaaianh~ft~~i 288 (387) T KOG0808 213 YMEGDLGHPVFETVFGRIAVNICYGRHHPLNWLMYGLNGAEIIFNPSAT--VGALSEPLWP--IEARNAAIANHYFTGSI 288 (387) T ss_pred EEECCCCCCEEEEECCEEEEEEECCCCCCHHHHHHHCCCCEEEECCCCC--CCCCCCCCCC--HHHHHHHHHHCEEEEEE T ss_conf 7505779830022103378886226777356565414673599777532--0000476676--02312565422478763 Q ss_pred -EEECC----------------------CCCEEECCCCEEEEECCCCCCEEEEEEECCCCCCCHHHHHH Q ss_conf -98559----------------------70469989980877738998279999960676876878655 Q gi|254780425|r 455 -RATNN----------------------GVSAFLDERGQIISSVYADRGASIDMHFQPKVRDSFRSSIQ 500 (518) Q Consensus 455 -ra~nt----------------------GiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~~T~y~~~g 500 (518) |+.+. |-|.+-.|++.-+-.+...+.|.+.+++-++..+-.-..|| T Consensus 289 nrvgtevfpneftsgdgkpah~dfghfygssy~aapd~srtp~lsr~rdgllia~ldlnlcrq~kd~wg 357 (387) T KOG0808 289 NRVGTEVFPNEFTSGDGKPAHNDFGHFYGSSYFAAPDASRTPSLSRYRDGLLIADLDLNLCRQYKDKWG 357 (387) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCEEEEECCHHHHHHHHHHHC T ss_conf 134341078866689997343435401143013268877786500003654776144688997566545 No 44 >pfam09142 TruB_C tRNA Pseudouridine synthase II, C terminal. The C terminal domain of tRNA Pseudouridine synthase II adopts a PUA (pfam01472) fold, with a four-stranded mixed beta-sheet flanked by one alpha-helix on each side. It allows for binding of the enzyme to RNA, as well as stabilisation of the RNA molecule. Probab=84.22 E-value=1.8 Score=21.51 Aligned_cols=40 Identities=20% Similarity=0.186 Sum_probs=33.6 Q ss_pred HHHHHHHHHHCCCEEEEEECCCCCEEECCCCEEEEECCCC Q ss_conf 9999999997299199985597046998998087773899 Q gi|254780425|r 439 FRYAQIQAVEIGLPLIRATNNGVSAFLDERGQIISSVYAD 478 (518) Q Consensus 439 ~~~a~~RAiE~~r~~vra~ntGiS~~id~~G~i~~~~~~~ 478 (518) +.-...+++-+||++-.+...|+.++++|+|+.++-++.. T Consensus 6 lt~~ea~~l~~Gr~L~~~g~~G~~aa~~PdG~liAL~~~~ 45 (56) T pfam09142 6 LTAAEARDLRHGRRLPPAGIPGPYAAVAPDGRLIALLEER 45 (56) T ss_pred CCHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEECC T ss_conf 7899999976888758888987789989999689998447 No 45 >KOG2303 consensus Probab=79.57 E-value=4 Score=19.26 Aligned_cols=37 Identities=16% Similarity=0.073 Sum_probs=18.4 Q ss_pred EEEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHH Q ss_conf 10000367321221112563342213310010000121 Q gi|254780425|r 333 YKSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKL 370 (518) Q Consensus 333 yNs~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l 370 (518) ||..+++ -||+++-+--|.-|--=|-|--.|.+.+|. T Consensus 97 yNCrv~~-~n~kil~IRpKm~lanDgnyRE~RwFt~W~ 133 (706) T KOG2303 97 YNCRVLF-LNRKILLIRPKMWLANDGNYRESRWFTPWT 133 (706) T ss_pred HCCCEEE-CCCEEEEECCCCEECCCCCCHHHCCCCCCC T ss_conf 0402120-277189970420333688702100233346 No 46 >TIGR02225 recomb_XerD tyrosine recombinase XerD; InterPro: IPR011932 Proteins in this entry are part of the wider so-called "phage" integrase fmaily which describes a number of recombinases with tyrosine active sites that transiently bind covalently to DNA. Many are associated with mobile DNA elements, including phage, transposons, and phase variation loci. This entry represents the chromosomally-encoded polypeptide XerD, which is closely related to, and interacts with, another chromosomally-encoded polypeptide, XerC (IPR011931 from INTERPRO). These proteins are site-specific tyrosine recombinases, which act by catalyzing the cutting and rejoining of the recombining DNA molecules . They form a cyclic heterotetrameric complex composed of two molecules of XerC and two molecules of XerD. The XerC-XerD complex is essential to convert dimers of the bacterial chromosome into monomers to permit their segregation at cell division. It also contributes to the segregational stability of plasmids. XerD is composed of two domains, an N-terminal alpha-helical domain, and a C-terminal domain composed mostly of alpha helices, with a three-stranded antiprallel beta sheet along one edge . The active site is found at the extreme C-terminal end of the polypeptide. Most of the C-terminal domain is structurally homologous to that of other "phage" integrases, but the active site regions of these enzymes are stucturally distinct.; GO: 0003677 DNA binding, 0006310 DNA recombination, 0007049 cell cycle, 0007059 chromosome segregation, 0015074 DNA integration, 0051301 cell division. Probab=66.51 E-value=2.1 Score=21.02 Aligned_cols=53 Identities=17% Similarity=0.258 Sum_probs=26.0 Q ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEE-EECCCCCCCCCCCCCCEEEEEHHHHH Q ss_conf 12346677656410798242011001234557512100-00367321221112563342213310 Q gi|254780425|r 298 DQPSILKKIASVLKINQLLIVGSMRKELVDGRSHFYKS-IYIINSKGEILASSNAKHLVPFAEYL 361 (518) Q Consensus 298 ~~~~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs-~~~~~~~g~~~~~Y~K~~LVPFGEyi 361 (518) ++...++-|..- .--..-++|-...+-.-+. .. +..+-.+| +|.+|||+||+- T Consensus 134 RDrAmLElLYAT-GlRVSELv~L~~~d~~l~~----g~~~~~v~GKG------~KERlVP~G~~A 187 (305) T TIGR02225 134 RDRAMLELLYAT-GLRVSELVGLRLEDVNLDE----GVKVRKVRGKG------SKERLVPLGEEA 187 (305) T ss_pred HHHHHHHHHHHC-CCCEEEEECCCHHHHHCCC----CEEEEEEEECC------CCEEECCCHHHH T ss_conf 547999999864-8835533056166520027----73799986168------841202202899 No 47 >COG3859 Predicted membrane protein [Function unknown] Probab=59.63 E-value=11 Score=16.43 Aligned_cols=61 Identities=20% Similarity=0.275 Sum_probs=36.7 Q ss_pred HHHHHHHHHHHHCCCC-CH--HHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH-HHHH Q ss_conf 9999999999752611-41--78999999999997415554332100000145799999999999999876-8755 Q gi|254780425|r 19 IAILAGVIGSFSMPSS-DL--FLSSFVSFTLLIWLLDGISSNLGRMSAISRVGSSFFVGWLFGVGYFLAGL-WWVR 90 (518) Q Consensus 19 la~l~G~l~~ls~pp~-~~--~~l~~ia~~~l~~~l~~~~~~~~~~~~~~~~~~~f~~g~~~G~~~~~~~l-~Wi~ 90 (518) +|+.+++.+.||+-|. ++ |.-.=++-.|.+.. +..+..+..+..|++||+..+..+- |=++ T Consensus 13 iAimaAlA~vLSfi~~~~~~~ggSvslgmIPi~li-----------afRrG~kaG~~tGLl~Gll~~i~G~~Y~lh 77 (185) T COG3859 13 IAIMAALAMVLSFIPIYDLPQGGSVSLGMIPILLI-----------AFRRGLKAGLLTGLLWGLLHLILGKAYILH 77 (185) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHCCHHHCC T ss_conf 99999999999770155456777202378779999-----------998634887898999999999717355335 No 48 >TIGR02075 pyrH_bact uridylate kinase; InterPro: IPR015963 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the bacterial proteins.. Probab=56.81 E-value=12 Score=16.12 Aligned_cols=77 Identities=16% Similarity=0.176 Sum_probs=52.5 Q ss_pred CEEECCCCHHHHHHHHHHCCCCCCEEEEE--CCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCEEECCCCEEEE Q ss_conf 00102221666558888512699779996--2775576762899999999999972991999855970469989980877 Q gi|254780425|r 396 LRLYPLLFSDALFHQDINNLENVSAILNI--IDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRATNNGVSAFLDERGQIIS 473 (518) Q Consensus 396 ~~~~~~ICyE~~f~~~~r~~~~a~~lvn~--tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~ntGiS~~id~~G~i~~ 473 (518) .++-+.||.--......+-+++.++++-- |-+--|++- .-|.+||+|.+-.++=-++.|+=+|.|.+=+.-+ T Consensus 102 i~~~~~i~E~Y~~~~a~~~le~g~vVIF~gGtGnPfFTTD------taA~LRAiEi~aD~~Lkgt~GVDGVY~~DPkknk 175 (236) T TIGR02075 102 ISMPKQICESYIRRKAIKHLEKGKVVIFSGGTGNPFFTTD------TAAALRAIEINADVILKGTNGVDGVYTADPKKNK 175 (236) T ss_pred EECCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCCCCHHH------HHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCC T ss_conf 5358641323227899998535978999558986963211------5888766643134799810488802358788598 Q ss_pred ECCCC Q ss_conf 73899 Q gi|254780425|r 474 SVYAD 478 (518) Q Consensus 474 ~~~~~ 478 (518) ..-.+ T Consensus 176 ~A~~y 180 (236) T TIGR02075 176 DAKKY 180 (236) T ss_pred CCEEC T ss_conf 75231 No 49 >KOG1195 consensus Probab=49.12 E-value=14 Score=15.67 Aligned_cols=23 Identities=13% Similarity=0.325 Sum_probs=19.4 Q ss_pred C-CCCHHHHHHHHHHHHHHHCCCE Q ss_conf 7-6762899999999999972991 Q gi|254780425|r 430 F-MGSGTDHSFRYAQIQAVEIGLP 452 (518) Q Consensus 430 f-~~s~~~qH~~~a~~RAiE~~r~ 452 (518) + |++||+-+..++|+|++|..-. T Consensus 441 ~~gdtg~yLQY~HsRL~Slerk~~ 464 (567) T KOG1195 441 FEGDTGPYLQYTHSRLRSLERKFS 464 (567) T ss_pred ECCCCHHHHHHHHHHHHHHHHHCC T ss_conf 168844788887878999998625 No 50 >KOG2882 consensus Probab=44.58 E-value=19 Score=14.89 Aligned_cols=85 Identities=16% Similarity=0.160 Sum_probs=40.2 Q ss_pred CCCCCCEEEEEHHHHHCC------CCHHHHHHHHCCCCCCCCCCC--CCCCCEEECCCCEEE-CCCCHHHHH--HHHHHC Q ss_conf 111256334221331001------000012111001222210135--877620001220010-222166655--888851 Q gi|254780425|r 346 LASSNAKHLVPFAEYLPY------RNILKKLNLDFYMFPLDYSSS--DISPLLGLSEKLRLY-PLLFSDALF--HQDINN 414 (518) Q Consensus 346 ~~~Y~K~~LVPFGEyiP~------~~~~~~l~~~~~~~~~~~~~g--~~~~~~~~~~~~~~~-~~ICyE~~f--~~~~r~ 414 (518) +..|-|++. |++++++. ++-++..+......+.+-... .........-...++ +++-||.-| +.+.+. T Consensus 95 ~a~ylk~~~-~~~k~Vyvig~~gi~~eL~~aG~~~~g~~~~~~~~~~~~~~~~~~~~d~~VgAVvvg~D~hfsy~KL~kA 173 (306) T KOG2882 95 IADYLKKRK-PFGKKVYVIGEEGIREELDEAGFEYFGGGPDGKDTDGAKSFVLSIGLDPDVGAVVVGYDEHFSYPKLMKA 173 (306) T ss_pred HHHHHHHHC-CCCCEEEEECCHHHHHHHHHCCCEEECCCCCCCCCCCCCCCHHHCCCCCCCCEEEEECCCCCCHHHHHHH T ss_conf 999999707-5777588855301657999728456347887665433343165527888889799944665679999999 Q ss_pred ---CCCCCEEEEECC-CCCCC Q ss_conf ---269977999627-75576 Q gi|254780425|r 415 ---LENVSAILNIID-DSGFM 431 (518) Q Consensus 415 ---~~~a~~lvn~tN-d~Wf~ 431 (518) +++++.+.+.|| |+|+. T Consensus 174 ~~yLqnP~clflatn~D~~~p 194 (306) T KOG2882 174 LNYLQNPGCLFLATNRDATTP 194 (306) T ss_pred HHHHCCCCCEEEECCCCCCCC T ss_conf 997479980898516765447 No 51 >TIGR02918 TIGR02918 conserved hypothetical protein TIGR02918; InterPro: IPR014267 Members of this entry are found only in Gram-positive bacteria of the Firmicutes lineage, including several species of Staphylococcus, Streptococcus, and Lactobacillus.. Probab=43.26 E-value=9.2 Score=16.91 Aligned_cols=105 Identities=20% Similarity=0.265 Sum_probs=68.4 Q ss_pred EEEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHHHH-HCCCCCCCCCCC-----CCCCCEEECCCCEEECCCCHHHH Q ss_conf 000036732122111256334221331001000012111-001222210135-----87762000122001022216665 Q gi|254780425|r 334 KSIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKLNL-DFYMFPLDYSSS-----DISPLLGLSEKLRLYPLLFSDAL 407 (518) Q Consensus 334 Ns~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l~~-~~~~~~~~~~~g-----~~~~~~~~~~~~~~~~~ICyE~~ 407 (518) ||++.| +||+++ |.|+-|| .-+++.|.. +-...-.|...| -.|.++...+.-|+|+.|==|=- T Consensus 186 ~s~F~F-~dG~iL--YsK~eli--------ayFl~~L~LT~~Di~IlDR~~~dknliiGQ~Vl~Nkg~AklgVVvHAEHf 254 (511) T TIGR02918 186 ESVFVF-KDGKIL--YSKQELI--------AYFLKQLNLTKKDIIILDRSTGDKNLIIGQAVLENKGPAKLGVVVHAEHF 254 (511) T ss_pred CEEEEC-CCCCEE--CCHHHHH--------HHHHHHCCCCCCCEEEEECCCCCCEEEEEEEEEEECCCEEEEEEEEEEEC T ss_conf 436783-797454--1869999--------99999606985668999736899305871388875087028999974636 Q ss_pred HHHHHH------------CCCCC---CEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 588885------------12699---779996277557676289999999999997299199985 Q gi|254780425|r 408 FHQDIN------------NLENV---SAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLIRAT 457 (518) Q Consensus 408 f~~~~r------------~~~~a---~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~vra~ 457 (518) -.+.+. .+.+| |+.++.|+ .+.|...-+|.==.+.+|-|++= T Consensus 255 ~~~~t~e~~ILWNNyYEYqF~nA~~iDFFItATd--------~Q~~~l~~QF~kY~~~~P~i~TI 311 (511) T TIGR02918 255 SESATNETYILWNNYYEYQFSNADYIDFFITATD--------IQNQILLEQFKKYKNIEPKIYTI 311 (511) T ss_pred CCCCCCCCCCCCCCCHHHHCCCCCEEEEEEECHH--------HHHHHHHHHHHHHCCCCCEEEEE T ss_conf 8886168602214652553156866511340057--------88999999888747998778885 No 52 >TIGR02224 recomb_XerC tyrosine recombinase XerC; InterPro: IPR011931 The phage integrase family describes a number of recombinases with tyrosine active sites that transiently bind covalently to DNA. Many are associated with mobile DNA elements, including phage, transposons, and phase variation loci. This entry represents XerC, which is closely related to the other chromosomal protein XerD (IPR011932 from INTERPRO). XerC and XerD are site-specific recombinases which help resolve chromosome dimers to monomers for cell division after DNA replication. XerC and XerD interact with cell division protein FtsK .; GO: 0003677 DNA binding, 0006310 DNA recombination, 0007049 cell cycle, 0007059 chromosome segregation, 0015074 DNA integration, 0051301 cell division. Probab=43.01 E-value=9.2 Score=16.89 Aligned_cols=22 Identities=32% Similarity=0.668 Sum_probs=14.9 Q ss_pred EEEECCCCCCCCCCCCCCEEEEEHHHHHC Q ss_conf 00003673212211125633422133100 Q gi|254780425|r 334 KSIYIINSKGEILASSNAKHLVPFAEYLP 362 (518) Q Consensus 334 Ns~~~~~~~g~~~~~Y~K~~LVPFGEyiP 362 (518) +.+. +-.+| +|.+.||||++.= T Consensus 175 ~~vr-V~GKG------~KeR~VP~G~~A~ 196 (313) T TIGR02224 175 GLVR-VRGKG------NKERIVPFGSYAR 196 (313) T ss_pred CEEE-EEECC------CCEEECCCHHHHH T ss_conf 7677-74148------8135303148999 No 53 >cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology). Probab=40.81 E-value=17 Score=15.13 Aligned_cols=28 Identities=18% Similarity=0.331 Sum_probs=10.0 Q ss_pred HHCCCEEEEEECCCCCEEECCCCEEEEE Q ss_conf 9729919998559704699899808777 Q gi|254780425|r 447 VEIGLPLIRATNNGVSAFLDERGQIISS 474 (518) Q Consensus 447 iE~~r~~vra~ntGiS~~id~~G~i~~~ 474 (518) ++.|+=.+|++-||-=-.+|..|++..+ T Consensus 37 v~~g~V~Ikgv~s~~YLcmn~~G~ly~~ 64 (123) T cd00058 37 VAVGVVSIKGVASCRYLCMNKCGKLYGS 64 (123) T ss_pred CCCCEEEEEEECCCCEEEECCCCCEEEC T ss_conf 2699799996021328999578979977 No 54 >COG4974 XerD Site-specific recombinase XerD [DNA replication, recombination, and repair] Probab=39.64 E-value=12 Score=16.06 Aligned_cols=14 Identities=43% Similarity=0.619 Sum_probs=11.4 Q ss_pred CCCEEEEEHHHHHC Q ss_conf 25633422133100 Q gi|254780425|r 349 SNAKHLVPFAEYLP 362 (518) Q Consensus 349 Y~K~~LVPFGEyiP 362 (518) =+|.++|||||+-= T Consensus 175 G~KeR~VP~g~~A~ 188 (300) T COG4974 175 GNKERLVPFGEEAV 188 (300) T ss_pred CCCEEEEECHHHHH T ss_conf 88255521339999 No 55 >PRK03619 phosphoribosylformylglycinamidine synthase subunit I; Provisional Probab=39.47 E-value=20 Score=14.64 Aligned_cols=26 Identities=15% Similarity=0.294 Sum_probs=22.8 Q ss_pred ECCCCCEEECCCCEEEEECCCCCCEE Q ss_conf 55970469989980877738998279 Q gi|254780425|r 457 TNNGVSAFLDERGQIISSVYADRGAS 482 (518) Q Consensus 457 ~ntGiS~~id~~G~i~~~~~~~~~~~ 482 (518) +..+|.++.|++|||+..+|.-|+.+ T Consensus 175 S~~~IAGI~~~~GrvlgmMPHPERa~ 200 (223) T PRK03619 175 SVNDIAGICNENGNVLGLMPHPERAV 200 (223) T ss_pred CHHHHHEEECCCCCEEEECCCCCCCC T ss_conf 96465285879999988889964463 No 56 >TIGR03663 conserved hypothetical protein TIGR03663. Members of this protein family, uncommon and rather sporadically distributed, are found almost always in the same genomes as members of family TIGR03662, and frequently as a nearby gene. Members show some N-terminal sequence similarity with pfam02366, dolichyl-phosphate-mannose-protein mannosyltransferase. The few invariant residues in this family, found toward the N-terminus, include a dipeptide DE, a tripeptide HGP, and two different Arg residues. Up to three members may be found in a genome. The function is unknown. Probab=37.64 E-value=24 Score=14.20 Aligned_cols=11 Identities=27% Similarity=0.591 Sum_probs=4.2 Q ss_pred HHHHHHHHHHH Q ss_conf 33444446533 Q gi|254780425|r 139 ICVFACAVGFC 149 (518) Q Consensus 139 ~l~~~~~w~~~ 149 (518) +..|.+.|.+. T Consensus 299 ~~~f~~yW~l~ 309 (439) T TIGR03663 299 FFLFACYWTLA 309 (439) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 57 >pfam01039 Carboxyl_trans Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction; transcarboxylation from biotin to an acceptor molecule. There are two recognized types of carboxyl transferase. One of them uses acyl-CoA and the other uses 2-oxoacid as the acceptor molecule of carbon dioxide. All of the members in this family utilize acyl-CoA as the acceptor molecule. Probab=37.44 E-value=24 Score=14.18 Aligned_cols=19 Identities=16% Similarity=0.216 Sum_probs=9.2 Q ss_pred CCEEEEEECCCCCEEECCC Q ss_conf 9919998559704699899 Q gi|254780425|r 450 GLPLIRATNNGVSAFLDER 468 (518) Q Consensus 450 ~r~~vra~ntGiS~~id~~ 468 (518) ..|...+++-.+-.+|||. T Consensus 445 ~~~~~~a~~~~vD~vIdP~ 463 (487) T pfam01039 445 SPPYVAAARGFADLVIDPG 463 (487) T ss_pred CCHHHHHHHCCCCCCCCHH T ss_conf 9999998706758718889 No 58 >TIGR02743 TraW type-F conjugative transfer system protein TraW; InterPro: IPR014114 This entry represents TraW, an essential component of the F-type conjugative transfer system for plasmid DNA transfer that has been shown to be localized to the periplasm , .. Probab=37.23 E-value=17 Score=15.07 Aligned_cols=29 Identities=21% Similarity=0.320 Sum_probs=11.0 Q ss_pred CCEEEECCCCCCCCCCCCCEEEEEECCCCCCCC Q ss_conf 982420110012345575121000036732122 Q gi|254780425|r 313 NQLLIVGSMRKELVDGRSHFYKSIYIINSKGEI 345 (518) Q Consensus 313 ~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~ 345 (518) .+.+|+||-..+-.+ --+|.+.||-+|.. T Consensus 162 ~K~ILt~Gs~~el~~----~L~~~IYFDQ~G~L 190 (217) T TIGR02743 162 VKWILTGGSVKELEK----RLDSRIYFDQNGKL 190 (217) T ss_pred CEEEECCCCHHHHHH----HCCCCCEECCCCHH T ss_conf 308871789799998----52896046367634 No 59 >KOG1897 consensus Probab=35.55 E-value=24 Score=14.22 Aligned_cols=30 Identities=13% Similarity=0.137 Sum_probs=11.7 Q ss_pred HHHHHHHHCCCEEEEEECCCCC-EEECCCCE Q ss_conf 9999999729919998559704-69989980 Q gi|254780425|r 441 YAQIQAVEIGLPLIRATNNGVS-AFLDERGQ 470 (518) Q Consensus 441 ~a~~RAiE~~r~~vra~ntGiS-~~id~~G~ 470 (518) |+.+++--.+-.++-|++++++ +-||+-++ T Consensus 675 ~c~f~s~a~~d~l~~~~~~~l~i~tid~iqk 705 (1096) T KOG1897 675 MCPFNSDAYPDSLASANGGALTIGTIDEIQK 705 (1096) T ss_pred HCCCCCCCCCCEEEEECCCCEEEEEECCHHH T ss_conf 2245665577338996499059999642322 No 60 >TIGR01478 STEVOR variant surface antigen, stevor family; InterPro: IPR006374 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the stevor (short for "subtelomeric variable open reading frame") family of predicted variant surface antigens. This is the third largest variant surface antigen family in P. falciparum, with 33 genes identified in the genome of this parasite . The function of these proteins is not known, but they are expressed during several stages of the parasite life-cycle and may play a variety of roles in parasite survival and infection .. Probab=35.27 E-value=26 Score=13.96 Aligned_cols=15 Identities=7% Similarity=-0.035 Sum_probs=9.0 Q ss_pred CCCCCCCCCCCCCCC Q ss_conf 555532100124420 Q gi|254780425|r 239 EKISPVIRIVQPGIN 253 (518) Q Consensus 239 ~~~~~~V~iVQpni~ 253 (518) .+....|+.+|-|.. T Consensus 27 ~N~~YNv~~iqNNtq 41 (315) T TIGR01478 27 LNKKYNVSYIQNNTQ 41 (315) T ss_pred CCCCCCEEEEECCCC T ss_conf 056645200014641 No 61 >pfam03419 Peptidase_U4 Sporulation factor SpoIIGA. Probab=34.08 E-value=27 Score=13.83 Aligned_cols=10 Identities=20% Similarity=0.504 Sum_probs=5.9 Q ss_pred CCCCEEEEEH Q ss_conf 1256334221 Q gi|254780425|r 348 SSNAKHLVPF 357 (518) Q Consensus 348 ~Y~K~~LVPF 357 (518) .-++.|++|| T Consensus 226 ~~~r~r~IPy 235 (291) T pfam03419 226 WAKRFRLIPY 235 (291) T ss_pred CCCCEEEEEE T ss_conf 3476599953 No 62 >cd01740 GATase1_FGAR_AT Type 1 glutamine amidotransferase (GATase1)-like domain found in Formylglycinamide ribonucleotide amidotransferase. Type 1 glutamine amidotransferase (GATase1)-like domain found in Formylglycinamide ribonucleotide amidotransferase (FGAR-AT). FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, Pi, and glutamate in the fourth step of the purine biosynthetic pathway. FGAR-AT is a glutamine amidotransferase. Glutamine amidotransferase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. FGAR-AT belongs to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site Probab=32.83 E-value=27 Score=13.83 Aligned_cols=28 Identities=14% Similarity=0.292 Sum_probs=23.9 Q ss_pred ECCCCCEEECCCCEEEEECCCCCCEEEE Q ss_conf 5597046998998087773899827999 Q gi|254780425|r 457 TNNGVSAFLDERGQIISSVYADRGASID 484 (518) Q Consensus 457 ~ntGiS~~id~~G~i~~~~~~~~~~~l~ 484 (518) +..+|.++.|++|||+..+|.-|+.+.. T Consensus 191 S~~~IAGI~s~~GrvlgmMPHPER~~~~ 218 (238) T cd01740 191 SLDGIAGICNEDGRVLGMMPHPERAVEP 218 (238) T ss_pred CHHCEEEEECCCCCEEEECCCHHHHCCC T ss_conf 8202127887999999974785773065 No 63 >KOG4812 consensus Probab=32.48 E-value=29 Score=13.67 Aligned_cols=17 Identities=29% Similarity=0.618 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 56899999999999997 Q gi|254780425|r 500 QMRVFWIIEFILLILAV 516 (518) Q Consensus 500 g~~~~~~~~~~~~~l~~ 516 (518) ..|+.|+++++.+++.+ T Consensus 222 q~wLwwi~~vlG~ll~l 238 (262) T KOG4812 222 QYWLWWIFLVLGLLLFL 238 (262) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 11999999999999999 No 64 >smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities. Probab=31.78 E-value=30 Score=13.59 Aligned_cols=24 Identities=13% Similarity=0.278 Sum_probs=8.0 Q ss_pred CCEEEEEECCCCCEEECCCCEEEE Q ss_conf 991999855970469989980877 Q gi|254780425|r 450 GLPLIRATNNGVSAFLDERGQIIS 473 (518) Q Consensus 450 ~r~~vra~ntGiS~~id~~G~i~~ 473 (518) |+=.+|++-||-=-.+|..|++.+ T Consensus 44 g~V~I~gv~s~~YLcmn~~G~Ly~ 67 (126) T smart00442 44 GVVAIKGVASCRYLCMNKCGKLYG 67 (126) T ss_pred CEEEEEECCCCCEEEECCCCCEEE T ss_conf 989998613354999978895887 No 65 >pfam08981 consensus Probab=31.51 E-value=30 Score=13.56 Aligned_cols=15 Identities=7% Similarity=0.273 Sum_probs=5.8 Q ss_pred CCCC--EEEEECCCCCC Q ss_conf 6997--79996277557 Q gi|254780425|r 416 ENVS--AILNIIDDSGF 430 (518) Q Consensus 416 ~~a~--~lvn~tNd~Wf 430 (518) .|+| +++-+.|-.-| T Consensus 153 ~GaDTA~Vi~PAhs~~f 169 (181) T pfam08981 153 RGADTALVIRPAHSNNF 169 (181) T ss_pred CCCCEEEEEEECCCCCC T ss_conf 77636999834556760 No 66 >PRK01175 phosphoribosylformylglycinamidine synthase subunit I; Provisional Probab=31.02 E-value=30 Score=13.51 Aligned_cols=44 Identities=9% Similarity=0.127 Sum_probs=31.7 Q ss_pred CCCCEEECCCCEEEEECCCCCCEEEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 97046998998087773899827999996067687687865568999 Q gi|254780425|r 459 NGVSAFLDERGQIISSVYADRGASIDMHFQPKVRDSFRSSIQMRVFW 505 (518) Q Consensus 459 tGiS~~id~~G~i~~~~~~~~~~~l~~~v~~~~~~T~y~~~g~~~~~ 505 (518) .+|.++.|++|||...+|.-|+..+....+..... ...||+..+ T Consensus 201 ~~IAGI~n~~GrvlgmMPHPER~~~~~q~~~~~~~---~~~g~g~~~ 244 (255) T PRK01175 201 DNIAGITNETGNVIGLMPHPERIYYGYQAMYNERR---KDYGTGKIF 244 (255) T ss_pred HHHCEEECCCCCEEEECCCCCCCCCCCCCCCCCCC---CCCCCHHHH T ss_conf 67208678999998878996325646558878666---777869999 No 67 >TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases. Probab=30.95 E-value=30 Score=13.50 Aligned_cols=18 Identities=22% Similarity=0.323 Sum_probs=8.1 Q ss_pred CCCEEECCC-CEEEEECCC Q ss_conf 704699899-808777389 Q gi|254780425|r 460 GVSAFLDER-GQIISSVYA 477 (518) Q Consensus 460 GiS~~id~~-G~i~~~~~~ 477 (518) |.-..||.+ |++..+.+. T Consensus 641 G~l~AiDl~TGk~~W~~p~ 659 (764) T TIGR03074 641 GYMAAIDLKTGKVVWQHPN 659 (764) T ss_pred EEEEEEECCCCCEEEECCC T ss_conf 5799998898958887037 No 68 >PRK00029 hypothetical protein; Validated Probab=30.42 E-value=19 Score=14.86 Aligned_cols=13 Identities=31% Similarity=0.355 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999999999729 Q gi|254780425|r 438 SFRYAQIQAVEIG 450 (518) Q Consensus 438 H~~~a~~RAiE~~ 450 (518) |+....+.++|+| T Consensus 433 hlvE~AI~aAe~G 445 (487) T PRK00029 433 HLAEQAIEAAEEG 445 (487) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999999817 No 69 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=29.75 E-value=27 Score=13.88 Aligned_cols=17 Identities=12% Similarity=0.247 Sum_probs=11.1 Q ss_pred HHHHHHCC-CCCCEEEEE Q ss_conf 58888512-699779996 Q gi|254780425|r 408 FHQDINNL-ENVSAILNI 424 (518) Q Consensus 408 f~~~~r~~-~~a~~lvn~ 424 (518) |-+++|++ ...+|||+- T Consensus 1144 F~~ll~~~~~~tQFI~iT 1161 (1191) T TIGR02168 1144 FANLLKEMSKKTQFIVIT 1161 (1191) T ss_pred HHHHHHHHHHCCCEEEEE T ss_conf 999999985189278886 No 70 >TIGR00027 mthyl_TIGR00027 methyltransferase, putative, TIGR00027 family; InterPro: IPR011610 This entry represents a paralogous family with ~15 members in Mycobacterium tuberculosis and two members in Mycobacteriam leprae. No other homologs are found. . Probab=28.69 E-value=33 Score=13.25 Aligned_cols=12 Identities=0% Similarity=0.099 Sum_probs=6.2 Q ss_pred CCCCEEEEECCC Q ss_conf 699779996277 Q gi|254780425|r 416 ENVSAILNIIDD 427 (518) Q Consensus 416 ~~a~~lvn~tNd 427 (518) .|+.+.....++ T Consensus 199 ~GS~~~~d~~~~ 210 (281) T TIGR00027 199 PGSRLAFDYVRK 210 (281) T ss_pred CCCEEEEECCCH T ss_conf 876888753775 No 71 >PRK00784 cobyric acid synthase; Provisional Probab=27.97 E-value=34 Score=13.17 Aligned_cols=77 Identities=6% Similarity=-0.039 Sum_probs=37.2 Q ss_pred CCCCCCCCC-CCCCHHHHHHHHHHH-HHHHHHHHHCCCCCCCCCCEEEECCCCCCC---CHHHHHHHHHHHHHHHCCCCE Q ss_conf 553210012-442035554333357-899875431013346686189831210110---001123466776564107982 Q gi|254780425|r 241 ISPVIRIVQ-PGINPAIKEDREKIL-ERYLSLTALPVSAGELEPVIIVWAYLPFPF---SIVDQPSILKKIASVLKINQL 315 (518) Q Consensus 241 ~~~~V~iVQ-pni~~~~k~~~~~~~-~~~~~l~~~~~~~~~~~~dLiVWPEsa~p~---~~~~~~~~~~~l~~~~~~~~~ 315 (518) ..++|++|+ |-++.....++-... +...+..+... .-.++|+||.|=|--.. ...++..+.+.+.+.++++.+ T Consensus 252 ~~l~IaVi~lP~iSNftDfdpL~~e~~V~l~~v~~~~--~l~~~D~vIlPGsK~Ti~DL~~Lr~~Gl~~~i~~~~~~G~~ 329 (492) T PRK00784 252 GRLRVAVPRLPRISNFTDFDPLRAEPGVDVRYVRPGE--PLPDADLVILPGSKNTIADLAWLRESGWDEAIRAHARRGGP 329 (492) T ss_pred CCEEEEEEECCCCCCCCCCHHHHHCCCEEEEEECCCC--CCCCCCEEEECCCCCCHHHHHHHHHCCHHHHHHHHHHCCCC T ss_conf 7358999806755686544356407872899978976--56899999979986648999999986949999999975997 Q ss_pred EEECC Q ss_conf 42011 Q gi|254780425|r 316 LIVGS 320 (518) Q Consensus 316 li~G~ 320 (518) |+|. T Consensus 330 -V~GI 333 (492) T PRK00784 330 -VLGI 333 (492) T ss_pred -EEEE T ss_conf -8998 No 72 >pfam10321 7TM_GPCR_Srt Serpentine type 7TM GPCR chemoreceptor Srt. Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf'. Probab=27.88 E-value=34 Score=13.16 Aligned_cols=68 Identities=16% Similarity=0.112 Sum_probs=40.6 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 11034433444446533124455244630001211101478886333201332104667877785433110 Q gi|254780425|r 133 STGMGRICVFACAVGFCEWLRSVLGIGTWNAIGYAAMPIPVMMQSVHWIGLFGMNALSVFCFASPALFGTR 203 (518) Q Consensus 133 ~~~~~~~l~~~~~w~~~E~lR~~~~~gPW~~lg~~~~~~~~~~q~a~~~Gv~Glsflvv~i~~~~~~~~~~ 203 (518) +.+-.++..+-+..-..+..-+-+.+|.-+-.|-++=+.|.++.. .|..|++.|+..+..+..+...| T Consensus 64 k~~~YKiM~~Lgi~Di~al~~nSi~TG~l~i~GavfC~yP~~iyi---~G~i~~g~W~~~C~~~i~L~~nR 131 (313) T pfam10321 64 KHPCYKIMLFLGIFDILALSINSIITGFLAIQGAVFCSYPTFIYI---AGAIGLGLWMGSCATSILLAINR 131 (313) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCHHHHH---HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 585999999999999999988777765676524010478389999---88887605888889999999999 No 73 >pfam01994 Trm56 tRNA ribose 2'-O-methyltransferase, aTrm56. This family is an aTrm56 that catalyses the 2'-O-methylation of the cytidine residue in archaeal tRNA, using S-adenosyl-L-methionine. Biochemical assays showed that aTrm56 forms a dimer and prefers the L-shaped tRNA to the lambda form as its substrate. aTrm56 consists of the SPOUT domain, which contains the characteristic deep trefoil knot for AdoMet binding, and a unique C-terminal beta-hairpin. Probab=27.52 E-value=35 Score=13.12 Aligned_cols=30 Identities=17% Similarity=0.166 Sum_probs=23.3 Q ss_pred ECCCCHHHHHH--HHHHCCCCCCEEEEECCCC Q ss_conf 02221666558--8885126997799962775 Q gi|254780425|r 399 YPLLFSDALFH--QDINNLENVSAILNIIDDS 428 (518) Q Consensus 399 ~~~ICyE~~f~--~~~r~~~~a~~lvn~tNd~ 428 (518) +-+|+-|..|. ++-+++.+|.+-|.+++.+ T Consensus 84 ALAiFLdrl~~G~el~~~F~~akl~ViPs~~G 115 (121) T pfam01994 84 ALAVFLDRLFEGKELEREFEDAKLKVIPSERG 115 (121) T ss_pred HHHHHHHHHCCCCHHHHCCCCCEEEEEECCCC T ss_conf 99999998617714541059973999666678 No 74 >cd03232 ABC_PDR_domain2 The pleiotropic drug resistance-like (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=27.17 E-value=35 Score=13.08 Aligned_cols=66 Identities=8% Similarity=0.124 Sum_probs=38.8 Q ss_pred CCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCC Q ss_conf 66861898312101100011234667765641079824201100123455751210000367321221 Q gi|254780425|r 279 ELEPVIIVWAYLPFPFSIVDQPSILKKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEIL 346 (518) Q Consensus 279 ~~~~dLiVWPEsa~p~~~~~~~~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~ 346 (518) -.+|+++++=|-.--.+........+.+.++.+++..+++-.-.. .+.-.......++++.+|+++ T Consensus 124 ~~~P~lllLDEPt~gLD~~~~~~i~~~l~~l~~~g~tiii~th~~--~~~i~~~~Driivl~~GG~iv 189 (192) T cd03232 124 AAKPSILFLDEPTSGLDSQAAYNIVRFLKKLADSGQAILCTIHQP--SASIFEKFDRLLLLKRGGKTV 189 (192) T ss_pred HCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--HHHHHHHCCEEEEECCCCEEE T ss_conf 449988998488768898999999999999996999999998363--799998799999997999999 No 75 >TIGR02903 spore_lon_C ATP-dependent protease, Lon family; InterPro: IPR014252 This entry shows some relation to the widely distributed ATP-dependent protease La, also called Lon or LonA (IPR004815 from INTERPRO), but is more closely related to LonB (IPR014251 from INTERPRO), a LonA paralog found only in endospore-forming bacteria. Proteins in this entry are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ). They are restricted to a subset of endospore-forming species, and probably participate in the program of endospore formation. We propose the designation LonC.. Probab=26.50 E-value=31 Score=13.42 Aligned_cols=77 Identities=16% Similarity=0.195 Sum_probs=45.0 Q ss_pred EEECCCCHHHHHH--HHHHCCCC-----CCEEEEECCCCCC-CCC-HHHHHHH-HHHHHHHHCCCEEEEEECCCCCEEEC Q ss_conf 0102221666558--88851269-----9779996277557-676-2899999-99999997299199985597046998 Q gi|254780425|r 397 RLYPLLFSDALFH--QDINNLEN-----VSAILNIIDDSGF-MGS-GTDHSFR-YAQIQAVEIGLPLIRATNNGVSAFLD 466 (518) Q Consensus 397 ~~~~~ICyE~~f~--~~~r~~~~-----a~~lvn~tNd~Wf-~~s-~~~qH~~-~a~~RAiE~~r~~vra~ntGiS~~id 466 (518) +.|+ -.-|+.|. ..+|+..+ =++=||+---|== |-| |..-.+. +|.+ .++|+=+= --+|+=|. T Consensus 485 TAGS-MaKDSvFNAasviRk~T~~D~~~yD~HVNViGGG~IDGPSAG~Ai~~~~~SA~----~~~p~rQD--vAiTGEiS 557 (616) T TIGR02903 485 TAGS-MAKDSVFNAASVIRKITGKDLKDYDIHVNVIGGGRIDGPSAGAAITLCIISAI----LDKPIRQD--VAITGEIS 557 (616) T ss_pred CCCC-CHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHH----CCCCCEEE--EEEEEEEE T ss_conf 6553-03577898899986530468341651788852770175325799999999987----08983022--56510388 Q ss_pred CCCEEEEECCCCCC Q ss_conf 99808777389982 Q gi|254780425|r 467 ERGQIISSVYADRG 480 (518) Q Consensus 467 ~~G~i~~~~~~~~~ 480 (518) =+|+|..--+.+|. T Consensus 558 ~~G~ikpVGGi~EK 571 (616) T TIGR02903 558 LRGKIKPVGGIFEK 571 (616) T ss_pred EECEEEECCCCCCC T ss_conf 60216512663333 No 76 >COG0528 PyrH Uridylate kinase [Nucleotide transport and metabolism] Probab=26.24 E-value=37 Score=12.97 Aligned_cols=65 Identities=17% Similarity=0.124 Sum_probs=38.0 Q ss_pred CCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCC-HHHHH-HHHHHHHHHHCCCEEEEEECCCCCEEECCCCEE Q ss_conf 221666558888512699779996277557676-28999-999999999729919998559704699899808 Q gi|254780425|r 401 LLFSDALFHQDINNLENVSAILNIIDDSGFMGS-GTDHS-FRYAQIQAVEIGLPLIRATNNGVSAFLDERGQI 471 (518) Q Consensus 401 ~ICyE~~f~~~~r~~~~a~~lvn~tNd~Wf~~s-~~~qH-~~~a~~RAiE~~r~~vra~ntGiS~~id~~G~i 471 (518) .+|-...+.+..|-+.+..+.|--. ++ .|+|- -..|.+||.|.+-.++--+++++-+|.|.+=+. T Consensus 108 ~~~e~~~~~~A~~~l~~grVvIf~g------Gtg~P~fTTDt~AALrA~ei~ad~ll~atn~VDGVY~~DPkk 174 (238) T COG0528 108 QVAEPYSRREAIRHLEKGRVVIFGG------GTGNPGFTTDTAAALRAEEIEADVLLKATNKVDGVYDADPKK 174 (238) T ss_pred CCCCCCCHHHHHHHHHCCCEEEEEC------CCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCEECCCCCC T ss_conf 6668667999999997499899918------889998734799999999838868998406778036899988 No 77 >COG0138 PurH AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) [Nucleotide transport and metabolism] Probab=25.87 E-value=37 Score=12.93 Aligned_cols=73 Identities=25% Similarity=0.324 Sum_probs=49.4 Q ss_pred CCHHHHHHHHHHCCCCCCEEEEECCCCCCC-CCH--HHHH-HHHHHHHHHHCCCE---EE--------------EEECCC Q ss_conf 216665588885126997799962775576-762--8999-99999999972991---99--------------985597 Q gi|254780425|r 402 LFSDALFHQDINNLENVSAILNIIDDSGFM-GSG--TDHS-FRYAQIQAVEIGLP---LI--------------RATNNG 460 (518) Q Consensus 402 ICyE~~f~~~~r~~~~a~~lvn~tNd~Wf~-~s~--~~qH-~~~a~~RAiE~~r~---~v--------------ra~ntG 460 (518) =--|..|...+-+..+++.+|...|...-| +.| .+=+ ..+|.-||-|.+.- .| +++-.| T Consensus 399 e~~dl~Fawkv~k~vKSNaIv~akd~~TvGiGaGQ~sRV~sariA~~kA~~~~~~~~Gav~ASDAFFPF~D~i~~aA~aG 478 (515) T COG0138 399 ELEDLLFAWKVVKHVKSNAIVYAKDGQTVGIGAGQMSRVDSARIAAEKAGDAGLRAHGAVLASDAFFPFPDGIDAAAKAG 478 (515) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHCC T ss_conf 99999999999867034539997098388756885115889999999877765312681775235578861399999749 Q ss_pred CCEEECCCCEEEEE Q ss_conf 04699899808777 Q gi|254780425|r 461 VSAFLDERGQIISS 474 (518) Q Consensus 461 iS~~id~~G~i~~~ 474 (518) ++++|-|-|.|+.+ T Consensus 479 i~aIIqPGGSirD~ 492 (515) T COG0138 479 IKAIIQPGGSIRDQ 492 (515) T ss_pred CEEEECCCCCCCCH T ss_conf 86997799866649 No 78 >COG0021 TktA Transketolase [Carbohydrate transport and metabolism] Probab=25.24 E-value=38 Score=12.86 Aligned_cols=55 Identities=24% Similarity=0.297 Sum_probs=38.7 Q ss_pred CCCCCEEEEECCCCCC-CCCHH-HHH-HHHHHHHHHHCCCEEEEEE---------------CCCCCEEECCCCE Q ss_conf 2699779996277557-67628-999-9999999997299199985---------------5970469989980 Q gi|254780425|r 415 LENVSAILNIIDDSGF-MGSGT-DHS-FRYAQIQAVEIGLPLIRAT---------------NNGVSAFLDERGQ 470 (518) Q Consensus 415 ~~~a~~lvn~tNd~Wf-~~s~~-~qH-~~~a~~RAiE~~r~~vra~---------------ntGiS~~id~~G~ 470 (518) +.+...+.+.|-|+=. |..|| +|- -++|-+|||.| ..++|-+ ..|.|++|=.+=. T Consensus 450 Lm~l~~~~V~THDSIgvGEDGPTHqPiEqLa~LRaiPN-~~V~RPaD~~Et~~aw~~Al~~~~gPt~LiltRQn 522 (663) T COG0021 450 LMGLPVIYVFTHDSIGVGEDGPTHQPVEQLASLRAIPN-LSVIRPADANETAAAWKYALERKDGPTALILTRQN 522 (663) T ss_pred HHCCCEEEEEECCCEECCCCCCCCCCHHHHHHHHCCCC-CEEEECCCHHHHHHHHHHHHHCCCCCEEEEEECCC T ss_conf 60898189994671220787887776888888652688-60473288378999999998567898699996677 No 79 >TIGR00232 tktlase_bact transketolase; InterPro: IPR005478 Transketolase (EC 2.2.1.1) (TK) catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor, such as ribose 5-phosphate, to form sedoheptulose 7-phosphate and glyceraldehyde 3- phosphate. This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. TK requires thiamine pyrophosphate as a cofactor. This group includes two proteins from the yeast Saccharomyces cerevisiae but excludes dihydroxyactetone synthases (formaldehyde transketolases) from various yeasts and the even more distant mammalian transketolases. Among the family of thiamine diphosphate-dependent enzymes that includes transketolases, dihydroxyacetone synthases, pyruvate dehydrogenase E1-beta subunits, and deoxyxylulose-5-phosphate synthases, mammalian and bacterial transketolases seem not to be orthologous. ; GO: 0004802 transketolase activity. Probab=24.85 E-value=39 Score=12.81 Aligned_cols=49 Identities=20% Similarity=0.182 Sum_probs=33.1 Q ss_pred HHHHHHC------CCCCCEEEEECCCCCC-CCCHH-HHH-HHHHHHHHHHCCCEEEEEE Q ss_conf 5888851------2699779996277557-67628-999-9999999997299199985 Q gi|254780425|r 408 FHQDINN------LENVSAILNIIDDSGF-MGSGT-DHS-FRYAQIQAVEIGLPLIRAT 457 (518) Q Consensus 408 f~~~~r~------~~~a~~lvn~tNd~Wf-~~s~~-~qH-~~~a~~RAiE~~r~~vra~ 457 (518) |-|+.|. +.+...+-+-|=||-= |.-|| +|= -++|-+||+|| ..+.|-| T Consensus 444 F~dY~~~AiRLaALm~~~~~~V~THDSIgvGEDGPTHQP~EqLa~LRa~PN-~~vwRP~ 501 (675) T TIGR00232 444 FVDYARPAIRLAALMKLPVIYVYTHDSIGVGEDGPTHQPIEQLASLRAIPN-LSVWRPA 501 (675) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCEECCCCCCCCCCCHHHHHHHHHCCC-CEEECCC T ss_conf 998655899999971789089997440113878884770223023322035-3162378 No 80 >PRK13551 agmatine deiminase; Provisional Probab=23.75 E-value=41 Score=12.68 Aligned_cols=22 Identities=5% Similarity=0.012 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHCCCEEEEE Q ss_conf 8999999999999729919998 Q gi|254780425|r 435 TDHSFRYAQIQAVEIGLPLIRA 456 (518) Q Consensus 435 ~~qH~~~a~~RAiE~~r~~vra 456 (518) +....++..+|..==+|-+|-. T Consensus 316 ~~D~~A~~~l~~~fP~R~VV~i 337 (360) T PRK13551 316 PTDDLARDILQEMFPDHKVVGV 337 (360) T ss_pred HHHHHHHHHHHHHCCCCEEEEE T ss_conf 6799999999998889869997 No 81 >PRK08622 galactose-6-phosphate isomerase subunit LacB; Reviewed Probab=22.81 E-value=43 Score=12.56 Aligned_cols=54 Identities=15% Similarity=-0.049 Sum_probs=38.1 Q ss_pred EECCCCHHHHHHHHHHCCCCCCEEEEECC-----------CCCCCCC--H-HHHHHHHHHHHHHHCCC Q ss_conf 10222166655888851269977999627-----------7557676--2-89999999999997299 Q gi|254780425|r 398 LYPLLFSDALFHQDINNLENVSAILNIID-----------DSGFMGS--G-TDHSFRYAQIQAVEIGL 451 (518) Q Consensus 398 ~~~~ICyE~~f~~~~r~~~~a~~lvn~tN-----------d~Wf~~s--~-~~qH~~~a~~RAiE~~r 451 (518) +-...|+|..-....|+.++|++|.--.- |.|..+. + .+|.-.+.++.++|.+- T Consensus 81 IRAAl~~d~~~A~~aR~HNnANVL~LGar~i~~~~A~~iv~~fL~T~Feggrh~~RRv~KI~~iE~~~ 148 (171) T PRK08622 81 IRSALVRDMTSALYAKEELNANVIGFGGKIIGELLMCDIIDAFINAEYKPTEENKKLIAKIDHLETHN 148 (171) T ss_pred EEEEEECCHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 48998589899999999659718998611059999999999997799899607878999999997418 No 82 >KOG2792 consensus Probab=22.69 E-value=35 Score=13.14 Aligned_cols=18 Identities=22% Similarity=0.523 Sum_probs=13.7 Q ss_pred EECCCCCCCCCCCCCCEE Q ss_conf 003673212211125633 Q gi|254780425|r 336 IYIINSKGEILASSNAKH 353 (518) Q Consensus 336 ~~~~~~~g~~~~~Y~K~~ 353 (518) ++++||+|+.++.|.+.+ T Consensus 245 mYLidPeg~Fvd~~GrN~ 262 (280) T KOG2792 245 MYLIDPEGEFVDYYGRNY 262 (280) T ss_pred EEEECCCCCEEHHHCCCC T ss_conf 999889863006534668 No 83 >pfam01716 MSP Manganese-stabilising protein / photosystem II polypeptide. This family consists of the 33 KDa photosystem II polypeptide from the oxygen evolving complex (OEC) of plants and cyanobacteria. The protein is also known as the manganese-stabilising protein as it is associated with the manganese complex of the OEC and may provide the ligands for the complex. Probab=21.89 E-value=40 Score=12.75 Aligned_cols=34 Identities=21% Similarity=0.368 Sum_probs=21.4 Q ss_pred EEECCCCCCCCCCCCCCEEEEEHHHHHCCCCHHHHH Q ss_conf 000367321221112563342213310010000121 Q gi|254780425|r 335 SIYIINSKGEILASSNAKHLVPFAEYLPYRNILKKL 370 (518) Q Consensus 335 s~~~~~~~g~~~~~Y~K~~LVPFGEyiP~~~~~~~l 370 (518) +.-+-..+| .+.+-=..++|-||.+||---...| T Consensus 92 ~l~F~E~dG--iDfq~iTVqlPGGErVPFlFTvK~L 125 (245) T pfam01716 92 SLTFVEKDG--IDFQAVTVQLPGGERVPFLFTIKNL 125 (245) T ss_pred CEEEEEECC--EECCEEEEECCCCCCCCEEEEEEEE T ss_conf 678898478--7401279884898324347774101 No 84 >cd03009 TryX_like_TryX_NRX Tryparedoxin (TryX)-like family, TryX and nucleoredoxin (NRX) subfamily; TryX and NRX are thioredoxin (TRX)-like protein disulfide oxidoreductases that alter the redox state of target proteins via the reversible oxidation of an active center CXXC motif. TryX is involved in the regulation of oxidative stress in parasitic trypanosomatids by reducing TryX peroxidase, which in turn catalyzes the reduction of hydrogen peroxide and organic hydroperoxides. TryX derives reducing equivalents from reduced trypanothione, a polyamine peptide conjugate unique to trypanosomatids, which is regenerated by the NADPH-dependent flavoprotein trypanothione reductase. Vertebrate NRX is a 400-amino acid nuclear protein with one redox active TRX domain containing a CPPC active site motif followed by one redox inactive TRX-like domain. Mouse NRX transcripts are expressed in all adult tissues but is restricted to the nervous system and limb buds in embryos. Plant NRX, longer than the Probab=21.85 E-value=44 Score=12.45 Aligned_cols=13 Identities=23% Similarity=0.616 Sum_probs=7.7 Q ss_pred CEEECCCCEEEEE Q ss_conf 4699899808777 Q gi|254780425|r 462 SAFLDERGQIISS 474 (518) Q Consensus 462 S~~id~~G~i~~~ 474 (518) +.+||++|+|+++ T Consensus 103 ~~lId~~G~IV~~ 115 (131) T cd03009 103 LIILDADGEVVTT 115 (131) T ss_pred EEEECCCCCEEHH T ss_conf 8999999979817 No 85 >KOG2698 consensus Probab=21.33 E-value=44 Score=12.49 Aligned_cols=68 Identities=21% Similarity=0.232 Sum_probs=31.0 Q ss_pred CCEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEHH--- Q ss_conf 61898312101100011234667765641079824201100123455751210000367321221112563342213--- Q gi|254780425|r 282 PVIIVWAYLPFPFSIVDQPSILKKIASVLKINQLLIVGSMRKELVDGRSHFYKSIYIINSKGEILASSNAKHLVPFA--- 358 (518) Q Consensus 282 ~dLiVWPEsa~p~~~~~~~~~~~~l~~~~~~~~~li~G~~~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~~LVPFG--- 358 (518) -.|+--||-+......-...+.+.+.+..+ ++...++-|+ ..++ .|=+. ..--.||||||= T Consensus 85 qgLl~TP~R~aKaMl~Ftkgy~~~l~dvlN-------~AvfdedhdE-------mviv-kdiem-fSmCEHHLvPF~GkV 148 (247) T KOG2698 85 QGLLKTPERAAKAMLYFTKGYEQNLNDVLN-------EAVFDEDHDE-------MVIV-KDIEM-FSMCEHHLVPFYGKV 148 (247) T ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHHHHHH-------HHCCCCCCCE-------EEEE-ECCHH-HHHHHHCCCCCEEEE T ss_conf 022218799999999998628878999852-------1203666652-------5888-50068-777664356501257 Q ss_pred --HHHCCCC Q ss_conf --3100100 Q gi|254780425|r 359 --EYLPYRN 365 (518) Q Consensus 359 --EyiP~~~ 365 (518) -|+|-.+ T Consensus 149 hIGY~pnkk 157 (247) T KOG2698 149 HIGYIPNKK 157 (247) T ss_pred EEEECCCCC T ss_conf 774226874 No 86 >TIGR00441 gmhA phosphoheptose isomerase; InterPro: IPR004515 Phosphoheptose isomerase is involved in lipopolysaccharide biosynthesis, and more specifically in the synthesis of glyceromannoheptose 7-phosphate. It may also have a role in virulence in Haemophilus ducreyi.; GO: 0008968 phosphoheptose isomerase activity, 0009244 lipopolysaccharide core region biosynthetic process, 0005737 cytoplasm. Probab=20.41 E-value=48 Score=12.24 Aligned_cols=47 Identities=21% Similarity=0.212 Sum_probs=35.5 Q ss_pred CCHHHHHHHHHHCC-CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEE Q ss_conf 21666558888512-699779996277557676289999999999997299199 Q gi|254780425|r 402 LFSDALFHQDINNL-ENVSAILNIIDDSGFMGSGTDHSFRYAQIQAVEIGLPLI 454 (518) Q Consensus 402 ICyE~~f~~~~r~~-~~a~~lvn~tNd~Wf~~s~~~qH~~~a~~RAiE~~r~~v 454 (518) =-||.+|..+++.+ +..|+|.-+|- =|+| .---.|.--|.|-+..+| T Consensus 91 ygyd~vF~RqVEAlG~~GDVL~GiST---SGNS---~NvlkA~~~Ak~~gm~~i 138 (186) T TIGR00441 91 YGYDEVFSRQVEALGQEGDVLLGIST---SGNS---KNVLKAIEAAKDKGMKTI 138 (186) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEC---CCCC---HHHHHHHHHHHHCCCEEE T ss_conf 02567877776632889868887424---7670---889999998845796699 Done!