RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780970|ref|YP_003065383.1| phosphoribosylformylglycinamidine synthase II [Candidatus Liberibacter asiaticus str. psy62] (737 letters) >gnl|CDD|179249 PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase II; Provisional. Length = 724 Score = 1181 bits (3059), Expect = 0.0 Identities = 384/726 (52%), Positives = 477/726 (65%), Gaps = 7/726 (0%) Query: 9 EILTAHGLTKDECDQIIHILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVI 68 E+ GLT DE ++I IL R+P TE+G+ S MW+EHCSYKSSK LR PT G V+ Sbjct: 6 ELYAEMGLTDDEYERIREILGREPNFTELGMFSVMWSEHCSYKSSKPLLRKFPTKGPRVL 65 Query: 69 QGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMN 128 QGPGENAGVVDIG G VVFK+ESHNHPS +EPYQGAATGVGGILRDIF+MGARP+A ++ Sbjct: 66 QGPGENAGVVDIGDGQAVVFKIESHNHPSAVEPYQGAATGVGGILRDIFSMGARPIALLD 125 Query: 129 SLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAK 188 SLRFG + HPKT++LL GVVAGI GY N GVPTVGGEV F YN N +VN G+ + Sbjct: 126 SLRFGELDHPKTRYLLEGVVAGIGGYGNCIGVPTVGGEVYFDESYNGNPLVNAMCVGLVR 185 Query: 189 TNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCL 248 + I +KA G+G P+VY+GAKTGRDGIGGAS AS E E +KRP VQVGDPF K L Sbjct: 186 HDDIVLAKASGVGNPVVYVGAKTGRDGIGGASFASAELSEESEEKRPAVQVGDPFMEKLL 245 Query: 249 LEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMML 308 +EACLEL+ T V+ IQDMGAAGLTCS+ EM +G LGI LDLDKVP EEGMT YE+ML Sbjct: 246 IEACLELIKTGLVVGIQDMGAAGLTCSSSEMAAKGGLGIELDLDKVPLREEGMTPYEIML 305 Query: 309 SESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALS 368 SESQERML+++ P K+ + I KW L ++IG T+D RV H GE VA++P +AL+ Sbjct: 306 SESQERMLLVVKPGKEEEVLAIFEKWDLDAAVIGEVTDDGRLRVYHHGEVVADVPAEALA 365 Query: 369 DEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSN 428 DEAP YDR + EP + ED ++LLKLLSSPN++S+ WVYEQYD +Q+N Sbjct: 366 DEAPVYDRPYKEPAYLDELQAD----PEDLKEALLKLLSSPNIASKEWVYEQYDHEVQTN 421 Query: 429 SIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPL 488 ++ PGGDA V+R+ G K LA ++D PRYV DP+EG K AVAE RN+ A GA PL Sbjct: 422 TVVKPGGDAAVLRIRG-GGKGLALTTDCNPRYVYLDPYEGAKLAVAEAARNLAAVGATPL 480 Query: 489 AITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTI 548 AITD LNFGNPEK E+M QFV +V+G+ +AC+ L P+V GNVS YNET G +I+PTP I Sbjct: 481 AITDCLNFGNPEKPEVMWQFVEAVRGLADACRALGTPVVGGNVSLYNETGGTAIYPTPVI 540 Query: 549 AGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEK 608 VG++ D S T EGDLI ++G L S Y G PPKVD EK Sbjct: 541 GMVGLIDDVSKRTTSGFKKEGDLIYLLGETKDELGGSEYLKVIHGHVGGRPPKVDLEAEK 600 Query: 609 NHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGE 668 V I +T+ HD+S GGL + LAEM I+ G ++ L + P LF E Sbjct: 601 RLQELVREAIREGLVTSAHDVSEGGLAVALAEMAIAGGLGAEVDLSDGL--RPDALLFSE 658 Query: 669 DQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGSTLSIHNILNIPVATLQTKYESWF 728 QGRYVV + PEN++ + A +P +G V G L + + L+ +E Sbjct: 659 SQGRYVVSVPPENEEAFEALAEAAGVPATRIGVVGGDALKVKGNDTESLEELREAWEGAL 718 Query: 729 PEFISE 734 P + Sbjct: 719 PRLLGG 724 >gnl|CDD|162510 TIGR01736, FGAM_synth_II, phosphoribosylformylglycinamidine synthase II. Phosphoribosylformylglycinamidine synthase is a single, long polypeptide in most Proteobacteria and eukarotes. Three proteins are required in Bacillus subtilis and many other species. This is the longest of the three and is designated PurL, phosphoribosylformylglycinamidine synthase II, or FGAM synthase II. Length = 715 Score = 962 bits (2488), Expect = 0.0 Identities = 346/721 (47%), Positives = 452/721 (62%), Gaps = 10/721 (1%) Query: 16 LTKDECDQIIHILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVIQGPGENA 75 L+ +E + I IL R+P TE+ + SAMW+EHCSYKSSKK L+ PT G +VIQGPGE+A Sbjct: 1 LSDEEMELIREILGREPNDTELAMFSAMWSEHCSYKSSKKLLKQFPTKGPNVIQGPGEDA 60 Query: 76 GVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAI 135 GVVDIG G VVFKMESHNHPS IEPY GAATGVGGILRDI +MGARP+A ++SLRFG + Sbjct: 61 GVVDIGDGYAVVFKMESHNHPSAIEPYNGAATGVGGILRDILSMGARPIALLDSLRFGPL 120 Query: 136 HHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSS 195 PK ++L GVVAGI+ Y N GVPTVGGEVEF YN N +VN G+ + + I + Sbjct: 121 DDPKNRYLFEGVVAGISDYGNRIGVPTVGGEVEFDESYNGNPLVNVMCVGLVRKDDIVTG 180 Query: 196 KARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAK-KRPTVQVGDPFTGKCLLEACLE 254 KA+G G LV +G KTGRDGIGGA+ ASEE E + RP VQVGDPFT K L+EA LE Sbjct: 181 KAKGPGNKLVLVGGKTGRDGIGGATFASEELSEEAEEEDRPAVQVGDPFTEKLLIEATLE 240 Query: 255 LMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQER 314 ++T V I+D+GAAGLT ++ EM +G LG + LDKVP E GMT YE+MLSESQER Sbjct: 241 AVDTGLVKGIKDLGAAGLTSASSEMAAKGGLGAEIYLDKVPLREPGMTPYEIMLSESQER 300 Query: 315 MLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEY 374 ML+++ PE + EI K+ L S+IG T++ R+ ++GE VA+LPI+ L+D APEY Sbjct: 301 MLLVVAPEDVEEVLEIFEKYELPASVIGEVTDEGRIRLYYKGEVVADLPIELLAD-APEY 359 Query: 375 DRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSNSIQLPG 434 +R P P + D + LK+LSSPN++S+ WVY QYD +Q+ ++ PG Sbjct: 360 ERPSEPP--KYPEEEKEPEPPADLEDAFLKVLSSPNIASKEWVYRQYDHEVQTRTVVKPG 417 Query: 435 GDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNL 494 DA V+R++ LA ++D PRYV DP+ G AVAE +RN+ A GA+PLA D L Sbjct: 418 EDAAVLRIKETGKLGLALTADCNPRYVYLDPYAGAAGAVAEAYRNLAAVGAEPLAAVDCL 477 Query: 495 NFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGIL 554 NFGNPE+ E+ QFV +VKG+ +AC+ L P+V GNVS YNETNG I PTPTI VG++ Sbjct: 478 NFGNPERPEVYWQFVEAVKGLGDACRALGTPVVGGNVSLYNETNGVPIAPTPTIGMVGLV 537 Query: 555 PDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFV 614 D + + EGD I +IG L S Y G P VD EK V Sbjct: 538 EDVEKLLTSNFKKEGDAIYLIGETKDELGGSEYLRVIHGIVSGQVPAVDLEEEKELADAV 597 Query: 615 LSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYV 674 I A ++A HD+S GGL + LAEM +S G ++ + P LF E GR + Sbjct: 598 REAIRAGLVSAAHDVSRGGLAVALAEMAAASGIGAEVDIDEIASARPDELLFSESNGRAI 657 Query: 675 VCISPENQDLVMSEANNKNIPLRYLGKVSGSTLSI---HNILNIPVATLQTKYESWFPEF 731 V + E +K +P + +GK G L+I + +++ V L+ +E PE+ Sbjct: 658 VAVPEEKA---EEAVKSKGVPAKVIGKTGGDRLTIKTGDDTISVSVKELRDAWEEALPEY 714 Query: 732 I 732 + Sbjct: 715 M 715 >gnl|CDD|184499 PRK14090, PRK14090, phosphoribosylformylglycinamidine synthase II; Provisional. Length = 601 Score = 391 bits (1006), Expect = e-109 Identities = 204/534 (38%), Positives = 299/534 (55%), Gaps = 45/534 (8%) Query: 25 IHILK----RQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVIQGPGENAGVVDI 80 ++IL+ R+PT E+ S MW+EHC Y +KK++R LP TG NAGVV++ Sbjct: 4 LNILEEKLGREPTFVELQAFSVMWSEHCGYSHTKKYIRRLPKTGF------EGNAGVVNL 57 Query: 81 GGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKT 140 + FK+ESHNHPS IEPY GAATGVGGI+RD+ MGARP A +SL I Sbjct: 58 DDYYSIAFKIESHNHPSAIEPYNGAATGVGGIIRDVLAMGARPTAIFDSLHMSRI----- 112 Query: 141 KHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGI 200 + G++ GIA Y NS GVPTVGGE+ Y +N +VN AAG+ + + + SKA Sbjct: 113 ---IDGIIEGIADYGNSIGVPTVGGELRISSLYAHNPLVNVLAAGVVRNDMLVDSKASRP 169 Query: 201 GLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDA 260 G +V G TGRDGI GAS ASE+ A K ++QVGDPF K L+EA LE++ Sbjct: 170 GQVIVIFGGATGRDGIHGASFASEDLTGEKATKL-SIQVGDPFAEKMLIEAFLEMVEEGL 228 Query: 261 VIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILN 320 V QD+GA G+ + E+ +G LG + LD+VP E M +E+++SESQERM ++ + Sbjct: 229 VEGAQDLGAGGVLSATSELVAKGGLGAIVHLDRVPLREPDMEPWEILISESQERMAVVTS 288 Query: 321 PEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAWCE 380 PEK + EI K L I+ +D ++RV++R + V +P++ L+ APE + Sbjct: 289 PEKASRILEIAKKHLLFGDIVAEVIDDPIYRVMYRDDLVMEVPVQLLA-NAPEEEIVEYT 347 Query: 381 PITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSNSIQLPGGDAGVI 440 P I P ++ + + R V+EQYD M+ ++++ PG A V+ Sbjct: 348 PGEI-PEFKRVEFEEVNA----------------REVFEQYDHMVGTDTVLPPGFGAAVM 390 Query: 441 RVEGHETKALAFSS--DVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGN 498 R++ +L S D+ D + GT AV E R ++ GA+PLAIT+ +N+G+ Sbjct: 391 RIKRDGGYSLVTHSRADLA----LQDTYWGTFIAVLESVRKTLSVGAEPLAITNCVNYGD 446 Query: 499 PEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVG 552 P+ + +G + +++AC+ P+ SGN S YN G+ I PT + +G Sbjct: 447 PD-VDPVG-LSAMMTALKDACEFSGVPVASGNASLYNTYQGKPIPPTLVVGMLG 498 Score = 46.8 bits (111), Expect = 2e-05 Identities = 71/332 (21%), Positives = 128/332 (38%), Gaps = 50/332 (15%) Query: 431 QLP----GGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAK 486 +LP G+AGV+ ++ + + A S P + +P+ G V R+++A GA+ Sbjct: 42 RLPKTGFEGNAGVVNLDDYYSIAFKIESHNHPSAI--EPYNGAATGVGGIIRDVLAMGAR 99 Query: 487 PLAITDNLNFGNPEKEEIMGQFVHS-VKGIREACQILDFPIVSGN--VSFYNETNGQSIF 543 P AI D+L+ M + + ++GI + + P V G +S N Sbjct: 100 PTAIFDSLH---------MSRIIDGIIEGIADYGNSIGVPTVGGELRISSLYAHN----- 145 Query: 544 PTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGN----DGCHLDCSMYSLECA------- 592 P + G++ + L+ ++ G +I++ G DG H S S + Sbjct: 146 PLVNVLAAGVVRNDMLVDS-KASRPGQVIVIFGGATGRDGIH-GASFASEDLTGEKATKL 203 Query: 593 SSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDII 652 S +G P EK L M+ + D+ GG++ +E+ G + Sbjct: 204 SIQVGDP-----FAEKMLIEAFLEMVEEGLVEGAQDLGAGGVLSATSELVAKGGLGAIVH 258 Query: 653 L---PI-EIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKV--SGST 706 L P+ E + +P L E Q R V SPE ++ A + + +V Sbjct: 259 LDRVPLREPDMEPWEILISESQERMAVVTSPEKASRILEIAKKHLLFGDIVAEVIDDPIY 318 Query: 707 LSIHN---ILNIPVATLQTKYESWFPEFISES 735 ++ ++ +PV L E E+ Sbjct: 319 RVMYRDDLVMEVPVQLLANAPEEEIVEYTPGE 350 >gnl|CDD|130796 TIGR01735, FGAM_synt, phosphoribosylformylglycinamidine synthase, single chain form. This model represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This form is found mostly in eukaryotes and Proteobacteria. In Bacillus subtilis PurL (FGAM synthase II) and PurQ (FGAM synthase I), homologous to different parts of this model, perform the equivalent function; the unrelated small protein PurS is also required and may be a third subunit. Length = 1310 Score = 149 bits (378), Expect = 2e-36 Identities = 165/685 (24%), Positives = 282/685 (41%), Gaps = 103/685 (15%) Query: 86 VVFKMESHNHPSYIEPYQGAATGVGGILRDIFTM--GARPVAAMNSLRFGAIHHPKTKH- 142 ++ K+E+HNHP+ I P+ GA+TG GG +RD GA+P A + ++ P + Sbjct: 291 ILMKVETHNHPTAIAPFPGASTGAGGEIRDEGATGRGAKPKAGLTGFCVSNLNIPGLEQP 350 Query: 143 -----------------LLSGVVAGIAGYSNSFGVPTVGG------EVEFLPC-----YN 174 ++ + G A ++N FG P + G LP Y+ Sbjct: 351 WEDPFQKPERIASPLDIMIEAPL-GAAAFNNEFGRPNLLGYFRTFELKASLPGGQVRGYH 409 Query: 175 NNIIVNTFAAGIAKTNAIFSSKAR-GIGLPLVYLGAKTGRDGIGGASMASEEFGENIAK- 232 I+ A GI +A K G L+ LG G+GG + +S G N A Sbjct: 410 KPIM---LAGGIGSIDAEHIQKGEIEPGALLIVLGGPAMLIGLGGGAASSMVSGTNTADL 466 Query: 233 KRPTVQVGDP-FTGKC--LLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITL 289 +VQ G+P +C +++ C +L + +I+I D+GA GL+ + E+ + G G + Sbjct: 467 DFASVQRGNPEMERRCQEVIDRCWQLGEKNPIISIHDVGAGGLSNALPELIHDGGRGAVI 526 Query: 290 DLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKL 349 DL VP + G++ E+ +ESQER ++++ E I + F+++G T D Sbjct: 527 DLRAVPLDDPGLSPLEIWCNESQERYVLLVRAENLEIFTAICERERCPFAVVGTATGDGR 586 Query: 350 FRVI---------------HRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSN-PQLLC 393 ++ H +LP++ L + P+ R P P + Sbjct: 587 LTLVDDTPVRRNGQGDAPSHFPNNPVDLPLEVLLGKMPKMTR---FVQRKAPMLQPLDIP 643 Query: 394 SDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMI-----QSNSI---QLPGGDAGVIRVEGH 445 D ++L ++L P ++S+R++ D + + + Q P D V Sbjct: 644 PGLDLHEALERVLRLPAVASKRFLITIGDRSVGGLVARDQMVGPWQTPLADVAVT-AASF 702 Query: 446 ET---KALAFSSDVTPRYVKA--DPFEGTKQAVAECWRNIIATGAKPLA-ITDNLNF--- 496 +T +A+A R KA DP + AV E N+ A L+ + + N+ Sbjct: 703 DTYTGEAMAIGE----RPPKALLDPKASARLAVGEAITNLAAALVGDLSDVKLSANWMAA 758 Query: 497 -GNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPT-------- 547 G+P ++ + +VK + E C L I G S +T Q T + Sbjct: 759 AGHPGEDAALYD---AVKAVSELCPALGIAIPVGKDSLSMKTRWQDNGETKSVTAPGSLV 815 Query: 548 IAGVGILPDYSLMTRIDSAH---EGDLILMIGNDGCH-LDCSMYSLECASSNIGPPPKVD 603 I+ +PD D H + L+L+ G + L S + + G P +D Sbjct: 816 ISAFAPVPDVRKTVTPDLKHDKGDSHLLLVDLGPGKNRLGGSALA-QVFGQLGGDCPDLD 874 Query: 604 CHLEKNHGFF--VLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDP 661 E+ FF + ++ + A HD S GGL+ TL EM + G+D+ L + Sbjct: 875 -DPERLKAFFAVMQGLVAEGLLLAYHDRSDGGLVTTLLEMAFAGHCGLDVDLDALGDSL- 932 Query: 662 KPFLFGEDQGRYVVCISPENQDLVM 686 LF E+ G V+ ++ + V+ Sbjct: 933 FAVLFNEELG-AVIQVAKPDLAAVL 956 Score = 39.8 bits (93), Expect = 0.003 Identities = 39/170 (22%), Positives = 73/170 (42%), Gaps = 16/170 (9%) Query: 204 LVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIA 263 L+ + G++ +GG+++A + FG + P + DP K L+ ++A Sbjct: 842 LLLVDLGPGKNRLGGSALA-QVFG-QLGGDCPDLD--DPERLKAFFAVMQGLVAEGLLLA 897 Query: 264 IQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEK 323 D GL + +EM G+ G+ +DLD + G + + ++ + E + ++ K Sbjct: 898 YHDRSDGGLVTTLLEMAFAGHCGLDVDLDAL-----GDSLFAVLFN---EELGAVIQVAK 949 Query: 324 QHKAQ--EILNKWGLHFSIIGIT--TNDKLFRVIHRGEEVANLPIKALSD 369 A E+L GL I+GI T + R+ G + + L D Sbjct: 950 PDLAAVLELLRAAGLTALILGIGTPTGHPMIRISVNGATLLSEKRSELRD 999 >gnl|CDD|130916 TIGR01857, FGAM-synthase, phosphoribosylformylglycinamidine synthase, clade II. This model represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This model represents a second clade of these enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP. Length = 1239 Score = 142 bits (359), Expect = 3e-34 Identities = 169/690 (24%), Positives = 287/690 (41%), Gaps = 83/690 (12%) Query: 78 VDIGGGD---CVVFKMESHNHPSYIEPYQGAATGVGGILRDIF------------TMGAR 122 VD+ G ++FK E+HNHP+ IEP+ GAAT +GG +RD T Sbjct: 303 VDVDGVKEPWLLMFKNETHNHPTEIEPFGGAATCIGGAIRDPLSGRSYVYQAMRVTGAGD 362 Query: 123 PVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVE--FLPCY-NNNIIV 179 P ++ G + P+ K + + G + Y N G+ T G+V + P Y + V Sbjct: 363 PTVPISETLKGKL--PQRK-ITTTAAHGYSSYGNQIGLAT--GQVSEIYHPGYVAKRMEV 417 Query: 180 NTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFG-ENIAKKRPTVQ 238 A K N + G + L LG KTGRDGIGGA+ +S+E E++ VQ Sbjct: 418 GAVVAATPKENVVREKPEPGDVIIL--LGGKTGRDGIGGATGSSKEHTVESLELCGAEVQ 475 Query: 239 VGDPFTGKCLLEACLELMNTDAVIAIQ---DMGAAGLTCSAIEMGNQGNLGITLDLDKVP 295 G+ + + N + I+ D GA G++ + E+ + G+ +DL+KVP Sbjct: 476 KGNAPEERKIQRL---FRNGNVTRLIKKCNDFGAGGVSVAIGELAD----GLEIDLNKVP 528 Query: 296 TCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHR 355 EG+ E+ +SESQERM ++++PE N+ L +++ T + Sbjct: 529 KKYEGLNGTELAISESQERMAVVVSPEDVDAFLAYCNEENLEATVVATVTEKPRLVMNWN 588 Query: 356 GEEVANLPIKALSDEA--PEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSS 413 G+ + +L + L D + P Q S E + LK+LS N++S Sbjct: 589 GKTIVDLSRRFLDTNGVRQVIDAKVVDKDVKLPEERQKT-SAETLEEDWLKVLSDLNVAS 647 Query: 414 RRWVYEQYDTMIQSNSIQLPGG--------DAGVIR--VEGHETK-ALAFSSDVTPRYVK 462 ++ + E++D+ + + ++ +P G +A V + V G ET A A + P + Sbjct: 648 QKGLQERFDSSVGAGTVLMPLGGKYQLTPTEASVAKLPVLGGETHTASAIAWGFNPYIAE 707 Query: 463 ADPFEGTKQAVAECWRNIIATGA--KPLAITDNLNFGNPEKE-EIMGQFVHSVKGIREAC 519 P+ G AV E ++A GA K ++ F +K+ E G+ ++ G +A Sbjct: 708 WSPYHGAAYAVIESLAKLVAAGADYKKARLSFQEYFEKLDKDAERWGKPFAALLGAIKAQ 767 Query: 520 QILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDG 579 L P + G S ++ PT V R+ S G Sbjct: 768 IDLGLPAIGGKDSMSGTFEELTVPPTLISFAVTTAN----SRRVISPEFKA----AGE-- 817 Query: 580 CHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLA 639 ++Y + + G D L K + + +I K+ + + GG+ +LA Sbjct: 818 -----NIYLIPGQALEDGTI---DFDLLKENFAQIEELIADHKVVSASAVKYGGVAESLA 869 Query: 640 EMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYL 699 +MT + G ++ P + LF G ++ SPE + E + ++ Sbjct: 870 KMTFGNRIGAELNNPELED------LFTAQYGSFIF-ESPEELSIANVEKIGQTTA-DFV 921 Query: 700 GKVSGSTLSIHNILNIPVATLQTKYESWFP 729 KV+G L + + + + K E FP Sbjct: 922 LKVNGEKLDLEEL----ESAWEGKLEEVFP 947 Score = 31.3 bits (71), Expect = 0.98 Identities = 9/19 (47%), Positives = 13/19 (68%) Query: 30 RQPTLTEIGIISAMWNEHC 48 R PT TEI ++ W++HC Sbjct: 204 RNPTETEIKVLDTYWSDHC 222 >gnl|CDD|179999 PRK05297, PRK05297, phosphoribosylformylglycinamidine synthase; Provisional. Length = 1290 Score = 93.3 bits (233), Expect = 2e-19 Identities = 107/375 (28%), Positives = 176/375 (46%), Gaps = 63/375 (16%) Query: 89 KMESHNHPSYIEPYQGAATGVGGILRDIFTM----GARPVAA----MNS-LRF-GAI--- 135 K+E+HNHP+ I P+ GAATG GG +RD G++P A S LR G Sbjct: 288 KVETHNHPTAISPFPGAATGSGGEIRD--EGATGRGSKPKAGLTGFSVSNLRIPGFEQPW 345 Query: 136 --HHPKTKHLLSGVVA---------GIAGYSNSFGVPTVGG-----EVEFLPCYNNNIIV 179 + K + + S A G A ++N FG P + G E + ++N V Sbjct: 346 EEDYGKPERIAS---ALDIMIEGPLGGAAFNNEFGRPNLLGYFRTFEQKV---NSHNEEV 399 Query: 180 NTF------AAGIAKTNAIFSSKAR-GIGLPLVYLGAKTGRDGIGGA---SMASEEFGEN 229 + A GI A K +G L+ LG R G+GG SMAS + E+ Sbjct: 400 RGYHKPIMLAGGIGNIRADHVQKGEIPVGAKLIVLGGPAMRIGLGGGAASSMASGQSSED 459 Query: 230 I--AKKRPTVQVGDP-FTGKC--LLEACLELMNTDAVIAIQDMGAAGLTCSAI-EMGNQG 283 + A +VQ G+P +C +++ C +L + + +++I D+GA GL +A E+ N G Sbjct: 460 LDFA----SVQRGNPEMERRCQEVIDRCWQLGDDNPILSIHDVGAGGL-SNAFPELVNDG 514 Query: 284 NLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGI 343 G DL K+P E GM+ E+ +ESQER ++ + PE + I + F+++G Sbjct: 515 GRGGRFDLRKIPNDEPGMSPLEIWCNESQERYVLAIAPEDLELFEAICERERCPFAVVGE 574 Query: 344 TTNDKLFRVI--HRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKS 401 T ++ + H + +LP+ L + P+ R + T+ P L S D ++ Sbjct: 575 ATEERHLTLEDSHFDNKPVDLPLDVLLGKPPKMHR---DVKTVKAKGPALDYSGIDLAEA 631 Query: 402 LLKLLSSPNLSSRRW 416 + ++L P ++S+ + Sbjct: 632 VERVLRLPTVASKSF 646 Score = 56.0 bits (136), Expect = 3e-08 Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 11/121 (9%) Query: 611 GFF--VLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGE 668 GFF + +++ + A HD S GGL+ TLAEM + G+DI L + D LF E Sbjct: 861 GFFNAIQALVAEGLLLAYHDRSDGGLLTTLAEMAFAGHCGLDIDLD-ALGDDALAALFNE 919 Query: 669 DQGRYVVCISPENQDLVMS--EANNKNIPLRYLGKV-SGSTLSIH----NILNIPVATLQ 721 + G V+ + ++D V + + + + +GK +G + I + + L+ Sbjct: 920 ELG-AVIQVRAADRDAVEAILAEHGLSDCVHVIGKPNAGDRIVITRNGKTVFSESRTELR 978 Query: 722 T 722 Sbjct: 979 R 979 Score = 39.8 bits (94), Expect = 0.002 Identities = 32/138 (23%), Positives = 49/138 (35%), Gaps = 12/138 (8%) Query: 235 PTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKV 294 P V D K A L+ ++A D GL + EM G+ G+ +DLD + Sbjct: 851 PDVD--DAEDLKGFFNAIQALVAEGLLLAYHDRSDGGLLTTLAEMAFAGHCGLDIDLDAL 908 Query: 295 PTCEEGMTAYEMMLSESQERMLMILNPEKQHKA-QEILNKWGL--HFSIIGITTNDKLFR 351 G A + +E E +I A + IL + GL +IG Sbjct: 909 -----GDDALAALFNE--ELGAVIQVRAADRDAVEAILAEHGLSDCVHVIGKPNAGDRIV 961 Query: 352 VIHRGEEVANLPIKALSD 369 + G+ V + L Sbjct: 962 ITRNGKTVFSESRTELRR 979 >gnl|CDD|177603 PHA03366, PHA03366, FGAM-synthase; Provisional. Length = 1304 Score = 82.8 bits (205), Expect = 3e-16 Identities = 64/264 (24%), Positives = 104/264 (39%), Gaps = 30/264 (11%) Query: 461 VKADPFEGTKQAVAEC-----------WRNIIATGAKPLAITDNLNFGNPEKEEIMGQFV 509 V+ DP G K A+ E +I T L++T + E+ Sbjct: 702 VQLDPILGAKYAIVEALTNLMLAPVANLEDITIT----LSVT--WPPTDQAASELY---- 751 Query: 510 HSVKGIREACQIL--DFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRI--DS 565 ++ +E C+ L +F S + S + Q TI P S R+ D Sbjct: 752 RALAACKEFCRELGVNFTFTSASSSPRQDQPPQPGPLFNTIVFTASAPVPSSTPRLTPDL 811 Query: 566 AHEGDLILMIG-NDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKIT 624 G ++ + + L S++ G P + KN V +I+ + Sbjct: 812 KKPGSALVHLSISPEYTLAGSVFEQIFGLK-SGTLPDISPSYLKNLFRAVQHLISEGLVV 870 Query: 625 ACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQDL 684 + HD+S GGLI LAEM ++ +G+ I +P DP FLF E G V+ + P + Sbjct: 871 SGHDVSDGGLIACLAEMALAGGRGVTITVPAGE--DPLQFLFSETPG-VVIEVPPSHLSA 927 Query: 685 VMSEANNKNIPLRYLGKVSGSTLS 708 V++ ++NI +G V S S Sbjct: 928 VLTRLRSRNIICYPIGTVGPSGPS 951 Score = 50.0 bits (120), Expect = 2e-06 Identities = 25/115 (21%), Positives = 40/115 (34%), Gaps = 10/115 (8%) Query: 248 LLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMM 307 L A L++ V++ D+ GL EM G G+T+ VP G + + Sbjct: 856 LFRAVQHLISEGLVVSGHDVSDGGLIACLAEMALAGGRGVTIT---VP---AGEDPLQFL 909 Query: 308 LSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTN---DKLFRVIHRGEEV 359 SE+ ++ + P L + IG F V H G + Sbjct: 910 FSETPGVVIEV-PPSHLSAVLTRLRSRNIICYPIGTVGPSGPSNTFSVSHNGTVL 963 >gnl|CDD|178745 PLN03206, PLN03206, phosphoribosylformylglycinamidine synthase; Provisional. Length = 1307 Score = 69.0 bits (169), Expect = 4e-12 Identities = 99/423 (23%), Positives = 169/423 (39%), Gaps = 86/423 (20%) Query: 85 CVVFKMESHNHPSYIEPYQGAATGVGGILRDIFT--MGARPVAA-----MNSLRFGAIHH 137 ++ E+HN P + PY GA TG GG +RD G+ VA + +LR + Sbjct: 287 DILLTAETHNFPCAVAPYPGAETGAGGRIRDTHATGRGSFVVAGTAGYCVGNLRIEGSYA 346 Query: 138 PKTKH-------------LLSGVVAGIAGYSNSFGVPTVGGEVE-F---LPC-----YNN 175 P +L G + Y N FG P + G F LP + Sbjct: 347 PWEDSSFVYPSNLASPLQILIDASNGASDYGNKFGEPLIQGYTRTFGMRLPNGERREWLK 406 Query: 176 NIIVNTFAAGIAKTNAIFSSKARG-IGLPLVYLGAKTGRDGIGGASMASEEFGENIAK-K 233 I+ F+ GI + + +K IG+ +V +G R G+GG + +S G+N A+ Sbjct: 407 PIM---FSGGIGQIDHTHLTKGEPDIGMLVVKIGGPAYRIGMGGGAASSMVSGQNDAELD 463 Query: 234 RPTVQVGDPFTGKCL---LEACLELMNTDAVIAIQDMGAAGLTCSA----IEMGNQGNLG 286 VQ GD + L + AC+E+ + +++I D GA G C+ I G Sbjct: 464 FNAVQRGDAEMSQKLYRVVRACVEMGEDNPIVSIHDQGAGG-NCNVVKEIIY-----PKG 517 Query: 287 ITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIG-ITT 345 +D+ V + ++ E+ +E QE+ +++ PE + Q I ++ ++IG I Sbjct: 518 AEIDIRAVVVGDHTLSVLEIWGAEYQEQDALLIKPESRDLLQSICDRERCSMAVIGTIDG 577 Query: 346 NDKLFRVIHRGEEVANLPIKALSDEAP------------------EYDRA--WCEPITIP 385 + ++ V+ L P E+ R EP+ IP Sbjct: 578 SGRV--VLVDSAAPEKCEANGLPPPPPAVDLDLEKVLGDMPQKTFEFKRVANKLEPLDIP 635 Query: 386 PSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMI-----QSNSI---QLPGGDA 437 P +L ++L P++ S+R++ + D + Q ++ Q+P D Sbjct: 636 PG--------ITVMDALKRVLRLPSVCSKRFLTTKVDRCVTGLVAQQQTVGPLQIPLADV 687 Query: 438 GVI 440 VI Sbjct: 688 AVI 690 Score = 63.2 bits (154), Expect = 2e-10 Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 6/94 (6%) Query: 614 VLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPF--LFGEDQG 671 +I + I+A HDIS GGL++TL EM + G+++ LP F LF E+ G Sbjct: 873 TQDLIAKRLISAGHDISDGGLVVTLLEMAFAGNCGINVDLP---SSGHSAFETLFAEELG 929 Query: 672 RYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705 V+ +S +N D VM + + +G+V+ S Sbjct: 930 -LVLEVSRKNLDAVMEKLAAAGVTAEVIGQVTAS 962 Score = 40.5 bits (95), Expect = 0.001 Identities = 39/166 (23%), Positives = 73/166 (43%), Gaps = 10/166 (6%) Query: 204 LVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIA 263 L+++ G+ +GG+++A + + + I P + D K EA +L+ + A Sbjct: 829 LLHVDLGKGKRRLGGSALA-QAYDQ-IGDDCPDLD--DVAYLKKAFEATQDLIAKRLISA 884 Query: 264 IQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEK 323 D+ GL + +EM GN GI +DL G +A+E + +E + +++ ++ + Sbjct: 885 GHDISDGGLVVTLLEMAFAGNCGINVDLP-----SSGHSAFETLFAE-ELGLVLEVSRKN 938 Query: 324 QHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSD 369 E L G+ +IG T L V G + +L D Sbjct: 939 LDAVMEKLAAAGVTAEVIGQVTASPLIEVKVDGATCLSEKTASLRD 984 >gnl|CDD|162512 TIGR01739, tegu_FGAM_synt, herpesvirus tegument protein/v-FGAM-synthase. This model describes a family of large proteins of herpesvirues. The protein is described variably as tegument protein or phosphoribosylformylglycinamidine synthase (FGAM-synthase). Most of the length of the protein shows homology to eukaryotic FGAM-synthase. Functional characterizations were not verified during construction of this model. Length = 1202 Score = 65.9 bits (161), Expect = 4e-11 Identities = 60/298 (20%), Positives = 115/298 (38%), Gaps = 37/298 (12%) Query: 458 PRYVKADPFEGTKQAVAEC-----------WRNIIATGAKPLAITDNLNFGNPEKEEIMG 506 V+ADP G A+ E ++I T + + TD+ + Sbjct: 603 TYKVQADPKRGATYAITEALLNLSLSPWNTLEDVIITLSVTWSPTDH------VYSLLK- 655 Query: 507 QFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRI--D 564 +++ ++ C+ L + + + T P +I P +I D Sbjct: 656 ---DALRACKDFCEELGVSFTVTSAA-SSPTQDSGSAPFMSIVFSASCPVLLSAKKITPD 711 Query: 565 SAHEGDLILMIG-NDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKI 623 G ++ + + L S++ + + P + K + +++ I Sbjct: 712 LKSHGSHLIWLSLHPSYTLAGSIFE-QILGLSFIRLPALSPVSLKKLLSALQTLVKEGVI 770 Query: 624 TACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQD 683 + HD+S GGL+ +AEM +S KG+ I LP DP FL E G V+ + P + Sbjct: 771 VSGHDVSDGGLVACVAEMALSGGKGVRITLP-HGT-DPLEFLCSETPG-VVIEVDPSSMY 827 Query: 684 LVMSEANNKNIPLRYLGKVSG----STLSI-HN---ILNIPVATLQTKYESWFPEFIS 733 V+ ++ + + +G+V T S+ HN + P++ LQ + S+ E + Sbjct: 828 AVLQFLRSEGLVFQVIGRVGESGPSPTFSVVHNSTVLFQEPLSLLQGTWRSFSDEENT 885 Score = 42.8 bits (101), Expect = 4e-04 Identities = 30/145 (20%), Positives = 54/145 (37%), Gaps = 10/145 (6%) Query: 226 FGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNL 285 F + + + P + K LL A L+ +++ D+ GL EM G Sbjct: 735 FEQILGLSFIRLPALSPVSLKKLLSALQTLVKEGVIVSGHDVSDGGLVACVAEMALSGGK 794 Query: 286 GITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITT 345 G+ + L E + SE+ +++ ++P + + L GL F +IG Sbjct: 795 GVRITLPHGT------DPLEFLCSETPG-VVIEVDPSSMYAVLQFLRSEGLVFQVIGRVG 847 Query: 346 N---DKLFRVIHRGEEVANLPIKAL 367 F V+H + P+ L Sbjct: 848 ESGPSPTFSVVHNSTVLFQEPLSLL 872 >gnl|CDD|132311 TIGR03267, methan_mark_2, putative methanogenesis marker protein 2. A single member of this protein family is found in each of the first ten complete genome sequences of archaeal methanogens, and nowhere else. Sequence similarity to various bacterial proteins is reflected in Pfam models pfam00586 and pfam02769, AIR synthase related protein N-terminal and C-terminal domains, respectively. The functions of proteins in this family are unknown, but their role is likely one essential to methanogenesis. Length = 323 Score = 58.3 bits (141), Expect = 8e-09 Identities = 68/302 (22%), Positives = 115/302 (38%), Gaps = 49/302 (16%) Query: 61 PTTGKHVIQGPGENAGVVDIGGGDCVVFK--------MESHNHPSYIEPYQGAATGVGGI 112 T +VI G++A + IGG D ++ +++ +P+ V Sbjct: 33 VTYEGNVIVDFGDDAAAIKIGGDDILLLAADGIWGKLLDA-------DPWWAGYCAVLVN 85 Query: 113 LRDIFTMGARPVAAMNSLRFGAIHHPKT-KHLLSGVVAGIAGYSNSFGVPTVGGEVEFLP 171 + DI MG +PV +N L +I+ + +L G+ G FGVP VGG Sbjct: 86 VNDIAAMGGKPVGMVNVL---SINDVDVCREVLEGMREGA----WKFGVPVVGGHTHPDT 138 Query: 172 CYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIA 231 YN ++ GIAK + I S G +++ GR Sbjct: 139 PYN---ALDVAIVGIAKEDCIIRSDTAKPGDLIIFAIDLDGR----------------PY 179 Query: 232 KKRP----TVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGI 287 P T + P + ++A +E+ V A +D+ GL + + +G Sbjct: 180 PSFPLNWDTTTMKSPDYLRAQMDAVVEIAERKLVKAGKDISNPGLIGTLGMLLEASRVGA 239 Query: 288 TLDLDKVPTCEE-GMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTN 346 +DL+ +P E+ M + M S ++ +PE + +L GL S+IG Sbjct: 240 EVDLESIPKPEDVDMVTWLKMYPGSG--FVLTADPENVREIVRVLEDAGLTASVIGEVIE 297 Query: 347 DK 348 D Sbjct: 298 DG 299 Score = 57.5 bits (139), Expect = 1e-08 Identities = 61/253 (24%), Positives = 104/253 (41%), Gaps = 36/253 (14%) Query: 417 VYEQYDTMIQSNSIQLPGGDAGVIRVEGHETKALAFSSD-VTPRYVKADPFEGTKQAVAE 475 + E D + N I G DA I++ G L ++D + + + ADP+ AV Sbjct: 27 ILEPLDVTYEGNVIVDFGDDAAAIKIGG--DDILLLAADGIWGKLLDADPWWAGYCAVLV 84 Query: 476 CWRNIIATGAKPLAITDNLNFGNPEK-EEIMGQFVHSVKGIREACQILDFPIVSGNVSFY 534 +I A G KP+ + + L+ + + E++ +G+RE P+V G+ Sbjct: 85 NVNDIAAMGGKPVGMVNVLSINDVDVCREVL-------EGMREGAWKFGVPVVGGHTHPD 137 Query: 535 NETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYS---LEC 591 N + I G+ + R D+A GDLI+ LD Y L Sbjct: 138 TPYNALDV----AIVGIA---KEDCIIRSDTAKPGDLIIF----AIDLDGRPYPSFPLNW 186 Query: 592 ASSNIGPPPKVDCHLEKNHGFFVLSMINAKKI-TACHDISTGGLIITLAEMTISSAKGMD 650 ++ + P + ++ + I +K+ A DIS GLI TL + +S G + Sbjct: 187 DTTTMKSPDYLRAQMD------AVVEIAERKLVKAGKDISNPGLIGTLGMLLEASRVGAE 240 Query: 651 IILPIEIEKDPKP 663 +++E PKP Sbjct: 241 ----VDLESIPKP 249 >gnl|CDD|162715 TIGR02124, hypE, hydrogenase expression/formation protein HypE. This family contains HypE (or HupE), a protein required for expression of catalytically active hydrogenase in many systems. It appears to be an accessory protein involved in maturation rather than a regulatory protein involved in expression. HypE shows considerable homology to the thiamine-monophosphate kinase ThiL (TIGR01379) and other enzymes. Length = 320 Score = 49.2 bits (118), Expect = 4e-06 Identities = 69/274 (25%), Positives = 110/274 (40%), Gaps = 49/274 (17%) Query: 436 DAGVIRVEGHETKALAFSSD---VTPRYVKADPFEGT---KQAVAECWRNIIATGAKPLA 489 DA V+ + G LAFS+D V P + F G K AV ++ +GAKPL Sbjct: 23 DAAVLELSG---GRLAFSTDSFVVDPLF-----FPGGDIGKLAVCGTVNDVAVSGAKPLY 74 Query: 490 ITDNL----NFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPT 545 ++ F + E I VK + EA + IV+G+ + IF Sbjct: 75 LSCGFILEEGFPIEDLERI-------VKSMAEAARKAGVKIVTGDTKVVEKGKADGIFIN 127 Query: 546 PTIAGVGILPDYSLMTRIDSAHEGDLILM---IGNDGCHLDCSMYSLECASSNIGPPPKV 602 T G+G++P + + GD I++ IG+ G + L + Sbjct: 128 TT--GIGVIP-SGIPISAHNLQPGDKIIVSGTIGDHGAAILAVREGLG-----FETNLES 179 Query: 603 DCHLEKNHGFFVLSMINA-KKITACHDISTGGLIITLAEMTISSAKGMDI---ILPIEIE 658 DC V +++NA + A D + GGL L E +S G+ I +P++ E Sbjct: 180 DCAPLNG---LVETLLNAGPAVHAMRDATRGGLAAVLNEWAQASGVGIVIEEEKIPVKEE 236 Query: 659 KDPKPFLFGED------QGRYVVCISPENQDLVM 686 L G D +G+ V+ + PE + V+ Sbjct: 237 VKGACELLGLDPLYLANEGKLVLAVPPEAAEKVL 270 Score = 36.1 bits (84), Expect = 0.030 Identities = 72/307 (23%), Positives = 119/307 (38%), Gaps = 41/307 (13%) Query: 65 KHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPY--QGAATG---VGGILRDIFTM 119 + I E+A V+++ GG + F +S ++P G G V G + D+ Sbjct: 14 GNEILAAMEDAAVLELSGGR-LAFSTDSF----VVDPLFFPGGDIGKLAVCGTVNDVAVS 68 Query: 120 GARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYN-NNII 178 GA+P+ + L +V +A + GV V G+ + + + I Sbjct: 69 GAKPLYLSCGF---ILEEGFPIEDLERIVKSMAEAARKAGVKIVTGDTKVVEKGKADGIF 125 Query: 179 VNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASE-EFGENIAKKRPTV 237 +NT G+ + S+ G ++ G G G ++ F N+ Sbjct: 126 INTTGIGVIPSGIPISAHNLQPGDKIIVSGT-IGDHGAAILAVREGLGFETNL------- 177 Query: 238 QVGD--PFTG--KCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDK 293 D P G + LL A AV A++D GL E +GI ++ +K Sbjct: 178 -ESDCAPLNGLVETLLNAGP------AVHAMRDATRGGLAAVLNEWAQASGVGIVIEEEK 230 Query: 294 VPTCEEGMTAYEMM----LSESQERML-MILNPEKQHKAQEIL--NKWGLHFSIIGITTN 346 +P EE A E++ L + E L + + PE K EIL + +G +IIG Sbjct: 231 IPVKEEVKGACELLGLDPLYLANEGKLVLAVPPEAAEKVLEILKSHPYGKDAAIIGEVVE 290 Query: 347 DKLFRVI 353 K RV+ Sbjct: 291 RKEGRVV 297 >gnl|CDD|162083 TIGR00878, purM, phosphoribosylaminoimidazole synthetase. This enzyme is found as a homodimeric monofunctional protein in prokaryotes and as part of a larger, multifunctional protein, sometimes with two copies of this enzyme in tandem, in eukaryotes. Length = 332 Score = 41.2 bits (97), Expect = 0.001 Identities = 30/116 (25%), Positives = 50/116 (43%), Gaps = 18/116 (15%) Query: 472 AVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNV 531 VA +++ GA+PL D L G + E+ Q VKGI E C+ +V G Sbjct: 81 LVAMNVNDLLVQGAEPLFFLDYLAVGKLD-PEVASQI---VKGIAEGCKQAGCALVGGET 136 Query: 532 S----FYNETNGQSIFPTPTIAG--VGILPDYSLMTRIDSAHEGDLILMIGNDGCH 581 + Y + +AG VG++ ++T GD+++ +G+ G H Sbjct: 137 AEMPGMYRGGHYD-------LAGTAVGVVEKDEIITGEKVK-PGDVLIGLGSSGIH 184 Score = 30.0 bits (68), Expect = 2.4 Identities = 37/155 (23%), Positives = 53/155 (34%), Gaps = 42/155 (27%) Query: 62 TTGKHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAAT-GVGGILR------ 114 T V+ G G AG+ D+G Y EP + T GVG L Sbjct: 27 TRRPEVMGGLGGFAGLFDLGDK--------------YKEPVLVSGTDGVGTKLLVAEAMN 72 Query: 115 ---------------DIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFG 159 D+ GA P+ ++ L G + + S +V GIA G Sbjct: 73 KHDTIGIDLVAMNVNDLLVQGAEPLFFLDYLAVGKL----DPEVASQIVKGIAEGCKQAG 128 Query: 160 VPTVGGEVEFLP-CYNNNII-VNTFAAGIAKTNAI 192 VGGE +P Y + A G+ + + I Sbjct: 129 CALVGGETAEMPGMYRGGHYDLAGTAVGVVEKDEI 163 >gnl|CDD|162329 TIGR01379, thiL, thiamine-monophosphate kinase. Proteins scoring between the trusted and noise cutoff for this model include short forms from the Thermoplasmas (which lack the N-terminal region) and a highly derived form from Campylobacter jejuni. Eukaryotes lack this enzyme, and add pyrophosphate from ATP to unphosphorylated thiamine in a single step. Length = 317 Score = 38.5 bits (90), Expect = 0.006 Identities = 35/146 (23%), Positives = 56/146 (38%), Gaps = 21/146 (14%) Query: 433 PGGDAGVIRVEGHETKALAFSSD--VTPRYVKAD--PFEGTKQAVAECWRNIIATGAKPL 488 G DA + V E + L ++D V + D P + +AVA ++ A GA P Sbjct: 23 IGDDAAL--VSAPEGRDLVLTTDTLVEGVHFPPDTTPEDLGWKAVAVNLSDLAAMGATPK 80 Query: 489 AITDNLNFGNPEK--EEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTP 546 L+ G P E + F G+ E + P+V G+ + + Sbjct: 81 WFL--LSLGLPSDLDEAWLEAFY---DGLFECAKQYGVPLVGGDT---VSSPELVV---- 128 Query: 547 TIAGVGILPDYSLMTRIDSAHEGDLI 572 T+ +G P + R A GDL+ Sbjct: 129 TVTAIGEAPKGRALLR-SGAKPGDLV 153 Score = 35.4 bits (82), Expect = 0.057 Identities = 62/251 (24%), Positives = 92/251 (36%), Gaps = 41/251 (16%) Query: 61 PTTGKHVIQGPGENAGVVDIGGGDCVVFK---MESHNH-PSYIEPYQGAATGVGGILRDI 116 V G G++A +V G +V + H P P V L D+ Sbjct: 13 LVQDPDVALGIGDDAALVSAPEGRDLVLTTDTLVEGVHFPPDTTPEDLGWKAVAVNLSDL 72 Query: 117 FTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNN 176 MGA P + SL + + L G+ + +GVP VGG+ P Sbjct: 73 AAMGATPKWFLLSL---GLPSDLDEAWLEAFYDGLFECAKQYGVPLVGGDTVSSP----E 125 Query: 177 IIVNTFAAGIAKTN-AIFSSKARGIGLPLVYLGAKTGRDGIGGASMA---------SEEF 226 ++V A G A A+ S A+ G LV++ TG G A +A EE Sbjct: 126 LVVTVTAIGEAPKGRALLRSGAK-PG-DLVFV---TGTLGDSAAGLALLLKGKKEPDEED 180 Query: 227 GENIAKK--RPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGN 284 E + ++ RP +V A N A I + D GL + Sbjct: 181 DEELLQRHLRPEPRVE-------EGLALAGYAN--AAIDVSD----GLASDLGHIAEASG 227 Query: 285 LGITLDLDKVP 295 +GI +DLD++P Sbjct: 228 VGIVIDLDRLP 238 >gnl|CDD|180049 PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional. Length = 327 Score = 35.5 bits (83), Expect = 0.049 Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 4/57 (7%) Query: 473 VAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 VA C +++ GA+PL D + G + E+ Q VKGI E C+ ++ G Sbjct: 82 VAMCVNDLLVQGAEPLFFLDYIATGKLD-PEVAAQV---VKGIAEGCEQAGCALIGG 134 >gnl|CDD|161898 TIGR00476, selD, selenium donor protein. In prokaryotes, the incorporation of selenocysteine as the 21st amino acid, encoded by TGA, requires several elements: SelC is the tRNA itself, SelD acts as a donor of reduced selenium, SelA modifies a serine residue on SelC into selenocysteine, and SelB is a selenocysteine-specific translation elongation factor. 3-prime or 5-prime non-coding elements of mRNA have been found as probable structures for directing selenocysteine incorporation. This model describes SelD, known as selenophosphate synthetase, selenium donor protein, and selenide,water dikinase. SelD provides reduced selenium for the selenium transferase SelA. This protein itself contains selenocysteine in many species; any sequence not aligning to the beginning of the model is likely have selenocysteine residue incorrectly interpreted as a stop codon upstream of the given sequence. The SelD protein also provides selenophosphate for the enzyme tRNA 2-selenouridine synthase, which catalyzes a tRNA base modification. All genomes with SelD should make selenocysteine, selenouridine, or both. Length = 347 Score = 35.6 bits (82), Expect = 0.051 Identities = 43/238 (18%), Positives = 77/238 (32%), Gaps = 17/238 (7%) Query: 72 GENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLR 131 G++A + G +V + + P +PY L DI+ MG P+ AM L Sbjct: 50 GDDAVIYLRHNGLSLV-QTTDYITPIVDDPYMMGRIAAANALSDIYAMGGTPIDAM--LI 106 Query: 132 FGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNA 191 + + T ++ V+ G G GG N + G+ Sbjct: 107 LLGVSNKLTIEVMREVIQGFKDACREAGTSLTGGHT----ILNPWPVFGGAVTGVCPEEE 162 Query: 192 IFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGE----NIAKKRPTVQVGDPFTGKC 247 + + +G L+ T G A + +I K+ + + + Sbjct: 163 VITPSGAQVGDVLIL----TKPLGTQVAVAVHKWLDIPKDLSITKEEVELAINEAIENMA 218 Query: 248 LL-EACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAY 304 L LM+T A D+ G+ A M N+ + ++ +P M Sbjct: 219 RLNRKAAGLMHTFNAHAATDITGFGILGHAQNMAKNSNVEARIVIENLP-VLAKMEEL 275 >gnl|CDD|185502 PTZ00182, PTZ00182, 3-methyl-2-oxobutanate dehydrogenase; Provisional. Length = 355 Score = 35.0 bits (81), Expect = 0.076 Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 6/36 (16%) Query: 640 EMTISSA--KGMDIILPIEIEKDPKPFLFGEDQGRY 673 +M + A +D E+ +DPK F+ GED +Y Sbjct: 34 KMNVREAINSALDE----ELARDPKVFVLGEDVAQY 65 >gnl|CDD|180226 PRK05731, PRK05731, thiamine monophosphate kinase; Provisional. Length = 318 Score = 32.1 bits (74), Expect = 0.53 Identities = 67/301 (22%), Positives = 109/301 (36%), Gaps = 56/301 (18%) Query: 433 PGGDAGVIRVEGHETKALAFSSD--VTPRYVKAD---PFE-GTKQAVAECWRN---IIAT 483 G DA ++ + L S+D V + + D P + G K A+A N + A Sbjct: 24 IGDDAALLGPPPGQR--LVVSTDMLVEGVHFRPDWSSPEDLGYK-ALA---VNLSDLAAM 77 Query: 484 GAKPLAITDNLNFGNPEK--EEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQS 541 GA+P A L P+ E + G+ E ++ G+ T G Sbjct: 78 GARPAAFL--LALALPKDLDEAWLEAL---ADGLFELADRYGAELIGGDT-----TRGPD 127 Query: 542 IFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPK 601 + + T +G +P + R A GDL+ + G G S L + + P Sbjct: 128 LSISVTA--IGDVPGGRALRR-SGAKPGDLVAVTGTLG----DSAAGLALLLNGLRVPDA 180 Query: 602 VDCHLEKNH-------GFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIIL- 653 L H G A +A DIS GL L + +S G DI L Sbjct: 181 DAAALISRHLRPQPRVGLG---QALAGLASAAIDISD-GLAADLGHIAEASGVGADIDLD 236 Query: 654 PIEIEKDPKPFLFGEDQGRYVV----------CISPENQDLVMSEANNKNIPLRYLGKVS 703 + I + GED R+ + PEN+ +++ A + + + +G+V+ Sbjct: 237 ALPISPALREAAEGEDALRWALSGGEDYELLFTFPPENRGALLAAAGHLGVGVTIIGRVT 296 Query: 704 G 704 Sbjct: 297 E 297 Score = 28.3 bits (64), Expect = 7.2 Identities = 28/110 (25%), Positives = 46/110 (41%), Gaps = 18/110 (16%) Query: 66 HVIQGPGENAGVVDIGGGDCVVFKME-----SHNHPSYIEPYQ-G--AATGVGGILRDIF 117 G G++A ++ G +V + H P + P G A V L D+ Sbjct: 19 SRELGIGDDAALLGPPPGQRLVVSTDMLVEGVHFRPDWSSPEDLGYKALA-VN--LSDLA 75 Query: 118 TMGARPVAAMNSLRFGAIHHPK--TKHLLSGVVAGIAGYSNSFGVPTVGG 165 MGARP A + +L A+ PK + L + G+ ++ +G +GG Sbjct: 76 AMGARPAAFLLAL---AL--PKDLDEAWLEALADGLFELADRYGAELIGG 120 >gnl|CDD|172310 PRK13772, PRK13772, formimidoylglutamase; Provisional. Length = 314 Score = 31.3 bits (71), Expect = 0.91 Identities = 21/73 (28%), Positives = 28/73 (38%), Gaps = 23/73 (31%) Query: 436 DAGVIRVEG--------------------HETKALAFSSDVTPRYVKADPFEGTKQAVAE 475 DAGV R +G H ALA + DV E + A+AE Sbjct: 51 DAGVRRNQGRAGAAHGPREIRRALAGVPAHGLPALADAGDVV---CDDGDLESAQAALAE 107 Query: 476 CWRNIIATGAKPL 488 ++A GA+PL Sbjct: 108 VVAEVLAAGARPL 120 >gnl|CDD|181525 PRK08655, PRK08655, prephenate dehydrogenase; Provisional. Length = 437 Score = 31.1 bits (71), Expect = 1.2 Identities = 19/50 (38%), Positives = 24/50 (48%), Gaps = 4/50 (8%) Query: 642 TISSAKGMDIIL-PIEIEKDP---KPFLFGEDQGRYVVCISPENQDLVMS 687 S KG +IL P E +P K F E +G V+ SPE D +MS Sbjct: 122 RTPSLKGQVVILTPTEKRSNPWFDKVKNFLEKEGARVIVTSPEEHDRIMS 171 >gnl|CDD|151104 pfam10567, Nab6_mRNP_bdg, RNA-recognition motif. This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, copurifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G. Length = 309 Score = 30.6 bits (69), Expect = 1.4 Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 4/36 (11%) Query: 663 PFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRY 698 PFL RY++ E+ D++ ++ KN PL Y Sbjct: 162 PFLDNSGNKRYIL----ESVDIINADERRKNFPLNY 193 >gnl|CDD|177653 PLN00014, PLN00014, light-harvesting-like protein 3; Provisional. Length = 250 Score = 30.6 bits (69), Expect = 1.5 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 9/48 (18%) Query: 331 LNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAW 378 K L ++ G+ L R + E++ L K L+DEA YD+ W Sbjct: 196 FCKTLLFVTVAGVL----LIR---KNEDLDTL--KKLADEATFYDKQW 234 >gnl|CDD|177996 PLN02368, PLN02368, alanine transaminase. Length = 407 Score = 29.8 bits (67), Expect = 2.7 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%) Query: 139 KTKHLLSGVVAGIAGYSNSFGVPTVGGEV-EFL 170 + KH LS G+ YS+S G+P V EV EF+ Sbjct: 88 RAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 120 >gnl|CDD|183274 PRK11678, PRK11678, putative chaperone; Provisional. Length = 450 Score = 29.8 bits (68), Expect = 2.8 Identities = 10/16 (62%), Positives = 11/16 (68%), Gaps = 2/16 (12%) Query: 77 VVDIGGG--DCVVFKM 90 VVDIGGG DC + M Sbjct: 213 VVDIGGGTTDCSMLLM 228 >gnl|CDD|179492 PRK02888, PRK02888, nitrous-oxide reductase; Validated. Length = 635 Score = 29.6 bits (67), Expect = 2.8 Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 24/60 (40%) Query: 55 KWL---------RTLPTTGKHVIQGP--GENAGVVDIGGGDCVVFKME-SHNHPSYIEPY 102 KWL R LP GP EN ++DI G KM+ H+ P++ EP+ Sbjct: 444 KWLVSLNKFSKDRFLPV-------GPLHPENDQLIDISGD-----KMKLVHDGPTFAEPH 491 >gnl|CDD|178287 PLN02683, PLN02683, pyruvate dehydrogenase E1 component subunit beta. Length = 356 Score = 28.6 bits (64), Expect = 6.1 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 2/34 (5%) Query: 640 EMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRY 673 EMT+ A ++ L E+ DPK F+ GE+ G Y Sbjct: 26 EMTVRDA--LNSALDEEMSADPKVFIMGEEVGEY 57 >gnl|CDD|151053 pfam10493, Rod_C, Rough deal protein C-terminal region. Rod, the Rough deal protein, displays a dynamic intracellular staining pattern, localising first to kinetochores in pro-metaphase, but moving to kinetochore microtubules at metaphase. Early in anaphase the protein is once again restricted to the kinetochores, where it persists until the end of telophase. This behaviour is in all respects similar to that described for ZW10, and indeed the two proteins function together, localisation of each depending upon the other. These two proteins are found at the kinetochore in complex with a third, Zwilch, in both flies and humans. The C-terminus is the most conserved part of the protein. During pro-metaphase, the ZW10-Rod complex, dynein/dynactin, and Mad2 all accumulate on unattached kinetochores; microtubule capture leads to Mad2 depletion as it is carried off by dynein/dynactin; ZW10-Rod complex accumulation continues, replenishing kinetochore dynein. The continuing recruitment of the ZW10-Rod complex during metaphase may serve to maintain adequate dynein/dynactin complex on kinetochores for assisting chromatid movement during anaphase. The ZW10-Rod complex acts as a bridge whose association with Zwint-1 links Mad1 and Mad2, components that are directly responsible for generating the diffusible 'wait anaphase' signal, to a structural, inner kinetochore complex containing Mis12 and KNL-1AF15q14, the last of which has been proved to be essential for kinetochore assembly in C. elegans. Removal of ZW10 or Rod inactivates the mitotic checkpoint. Length = 555 Score = 28.2 bits (63), Expect = 7.1 Identities = 14/47 (29%), Positives = 17/47 (36%), Gaps = 3/47 (6%) Query: 370 EAPEYDRAWCEPITIP---PSNPQLLCSDEDCTKSLLKLLSSPNLSS 413 E + RAW I P S P+ D KSL+ L P Sbjct: 436 ELEGFSRAWRSVILAPFRSASRPRSPDQDATLYKSLVLLQKCPVSHD 482 >gnl|CDD|180758 PRK06931, PRK06931, diaminobutyrate--2-oxoglutarate aminotransferase; Provisional. Length = 459 Score = 28.2 bits (63), Expect = 7.9 Identities = 19/51 (37%), Positives = 24/51 (47%), Gaps = 13/51 (25%) Query: 177 IIVNTFAAGIAKTNAIFS-------------SKARGIGLPLVYLGAKTGRD 214 +IV+ AG A+T +F+ SKA G GLPL LG K D Sbjct: 262 LIVDEVQAGFARTGKMFAFEHAGIEPDIIVMSKAVGGGLPLAVLGIKKEFD 312 >gnl|CDD|168300 PRK05934, PRK05934, type III secretion system protein; Validated. Length = 341 Score = 28.3 bits (63), Expect = 8.0 Identities = 13/29 (44%), Positives = 20/29 (68%) Query: 305 EMMLSESQERMLMILNPEKQHKAQEILNK 333 E+M ES E++ +IL+ KA+E+LNK Sbjct: 299 EIMQKESPEKIALILSYLDPKKAEELLNK 327 >gnl|CDD|163233 TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein. Members of this protein family belong to the Rhs element Vgr protein family (see TIGR01646), but furthermore all are found in genomes with type VI secretion loci. However, members of this protein family, although recognizably correlated to type VI secretion according the partial phylogenetic profiling algorithm, are often found far the type VI secretion locus. Length = 513 Score = 27.9 bits (63), Expect = 9.0 Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 2/47 (4%) Query: 116 IFTMGARPVAAMNSLRF--GAIHHPKTKHLLSGVVAGIAGYSNSFGV 160 +FT+ P AA+N HH + +L AGY NSF Sbjct: 294 LFTLSGHPRAALNREYLVVSVHHHGRQPQVLEESGGSGAGYRNSFQC 340 >gnl|CDD|178105 PLN02487, PLN02487, zeta-carotene desaturase. Length = 569 Score = 27.8 bits (62), Expect = 9.9 Identities = 8/22 (36%), Positives = 14/22 (63%) Query: 709 IHNILNIPVATLQTKYESWFPE 730 I+ ++ +PV T+Q +Y W E Sbjct: 378 IYKLVGVPVVTVQLRYNGWVTE 399 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.318 0.136 0.407 Gapped Lambda K H 0.267 0.0787 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 12,074,724 Number of extensions: 794944 Number of successful extensions: 1502 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1453 Number of HSP's successfully gapped: 56 Length of query: 737 Length of database: 5,994,473 Length adjustment: 101 Effective length of query: 636 Effective length of database: 3,812,065 Effective search space: 2424473340 Effective search space used: 2424473340 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 61 (27.2 bits)