RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddB
21,608 sequences; 5,994,473 total letters
Searching..................................................done
Query= gi|254780970|ref|YP_003065383.1|
phosphoribosylformylglycinamidine synthase II [Candidatus Liberibacter
asiaticus str. psy62]
(737 letters)
>gnl|CDD|179249 PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase II;
Provisional.
Length = 724
Score = 1181 bits (3059), Expect = 0.0
Identities = 384/726 (52%), Positives = 477/726 (65%), Gaps = 7/726 (0%)
Query: 9 EILTAHGLTKDECDQIIHILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVI 68
E+ GLT DE ++I IL R+P TE+G+ S MW+EHCSYKSSK LR PT G V+
Sbjct: 6 ELYAEMGLTDDEYERIREILGREPNFTELGMFSVMWSEHCSYKSSKPLLRKFPTKGPRVL 65
Query: 69 QGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMN 128
QGPGENAGVVDIG G VVFK+ESHNHPS +EPYQGAATGVGGILRDIF+MGARP+A ++
Sbjct: 66 QGPGENAGVVDIGDGQAVVFKIESHNHPSAVEPYQGAATGVGGILRDIFSMGARPIALLD 125
Query: 129 SLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAK 188
SLRFG + HPKT++LL GVVAGI GY N GVPTVGGEV F YN N +VN G+ +
Sbjct: 126 SLRFGELDHPKTRYLLEGVVAGIGGYGNCIGVPTVGGEVYFDESYNGNPLVNAMCVGLVR 185
Query: 189 TNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCL 248
+ I +KA G+G P+VY+GAKTGRDGIGGAS AS E E +KRP VQVGDPF K L
Sbjct: 186 HDDIVLAKASGVGNPVVYVGAKTGRDGIGGASFASAELSEESEEKRPAVQVGDPFMEKLL 245
Query: 249 LEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMML 308
+EACLEL+ T V+ IQDMGAAGLTCS+ EM +G LGI LDLDKVP EEGMT YE+ML
Sbjct: 246 IEACLELIKTGLVVGIQDMGAAGLTCSSSEMAAKGGLGIELDLDKVPLREEGMTPYEIML 305
Query: 309 SESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALS 368
SESQERML+++ P K+ + I KW L ++IG T+D RV H GE VA++P +AL+
Sbjct: 306 SESQERMLLVVKPGKEEEVLAIFEKWDLDAAVIGEVTDDGRLRVYHHGEVVADVPAEALA 365
Query: 369 DEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSN 428
DEAP YDR + EP + ED ++LLKLLSSPN++S+ WVYEQYD +Q+N
Sbjct: 366 DEAPVYDRPYKEPAYLDELQAD----PEDLKEALLKLLSSPNIASKEWVYEQYDHEVQTN 421
Query: 429 SIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPL 488
++ PGGDA V+R+ G K LA ++D PRYV DP+EG K AVAE RN+ A GA PL
Sbjct: 422 TVVKPGGDAAVLRIRG-GGKGLALTTDCNPRYVYLDPYEGAKLAVAEAARNLAAVGATPL 480
Query: 489 AITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTI 548
AITD LNFGNPEK E+M QFV +V+G+ +AC+ L P+V GNVS YNET G +I+PTP I
Sbjct: 481 AITDCLNFGNPEKPEVMWQFVEAVRGLADACRALGTPVVGGNVSLYNETGGTAIYPTPVI 540
Query: 549 AGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEK 608
VG++ D S T EGDLI ++G L S Y G PPKVD EK
Sbjct: 541 GMVGLIDDVSKRTTSGFKKEGDLIYLLGETKDELGGSEYLKVIHGHVGGRPPKVDLEAEK 600
Query: 609 NHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGE 668
V I +T+ HD+S GGL + LAEM I+ G ++ L + P LF E
Sbjct: 601 RLQELVREAIREGLVTSAHDVSEGGLAVALAEMAIAGGLGAEVDLSDGL--RPDALLFSE 658
Query: 669 DQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGSTLSIHNILNIPVATLQTKYESWF 728
QGRYVV + PEN++ + A +P +G V G L + + L+ +E
Sbjct: 659 SQGRYVVSVPPENEEAFEALAEAAGVPATRIGVVGGDALKVKGNDTESLEELREAWEGAL 718
Query: 729 PEFISE 734
P +
Sbjct: 719 PRLLGG 724
>gnl|CDD|162510 TIGR01736, FGAM_synth_II, phosphoribosylformylglycinamidine
synthase II. Phosphoribosylformylglycinamidine synthase
is a single, long polypeptide in most Proteobacteria and
eukarotes. Three proteins are required in Bacillus
subtilis and many other species. This is the longest of
the three and is designated PurL,
phosphoribosylformylglycinamidine synthase II, or FGAM
synthase II.
Length = 715
Score = 962 bits (2488), Expect = 0.0
Identities = 346/721 (47%), Positives = 452/721 (62%), Gaps = 10/721 (1%)
Query: 16 LTKDECDQIIHILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVIQGPGENA 75
L+ +E + I IL R+P TE+ + SAMW+EHCSYKSSKK L+ PT G +VIQGPGE+A
Sbjct: 1 LSDEEMELIREILGREPNDTELAMFSAMWSEHCSYKSSKKLLKQFPTKGPNVIQGPGEDA 60
Query: 76 GVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAI 135
GVVDIG G VVFKMESHNHPS IEPY GAATGVGGILRDI +MGARP+A ++SLRFG +
Sbjct: 61 GVVDIGDGYAVVFKMESHNHPSAIEPYNGAATGVGGILRDILSMGARPIALLDSLRFGPL 120
Query: 136 HHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSS 195
PK ++L GVVAGI+ Y N GVPTVGGEVEF YN N +VN G+ + + I +
Sbjct: 121 DDPKNRYLFEGVVAGISDYGNRIGVPTVGGEVEFDESYNGNPLVNVMCVGLVRKDDIVTG 180
Query: 196 KARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAK-KRPTVQVGDPFTGKCLLEACLE 254
KA+G G LV +G KTGRDGIGGA+ ASEE E + RP VQVGDPFT K L+EA LE
Sbjct: 181 KAKGPGNKLVLVGGKTGRDGIGGATFASEELSEEAEEEDRPAVQVGDPFTEKLLIEATLE 240
Query: 255 LMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQER 314
++T V I+D+GAAGLT ++ EM +G LG + LDKVP E GMT YE+MLSESQER
Sbjct: 241 AVDTGLVKGIKDLGAAGLTSASSEMAAKGGLGAEIYLDKVPLREPGMTPYEIMLSESQER 300
Query: 315 MLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEY 374
ML+++ PE + EI K+ L S+IG T++ R+ ++GE VA+LPI+ L+D APEY
Sbjct: 301 MLLVVAPEDVEEVLEIFEKYELPASVIGEVTDEGRIRLYYKGEVVADLPIELLAD-APEY 359
Query: 375 DRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSNSIQLPG 434
+R P P + D + LK+LSSPN++S+ WVY QYD +Q+ ++ PG
Sbjct: 360 ERPSEPP--KYPEEEKEPEPPADLEDAFLKVLSSPNIASKEWVYRQYDHEVQTRTVVKPG 417
Query: 435 GDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNL 494
DA V+R++ LA ++D PRYV DP+ G AVAE +RN+ A GA+PLA D L
Sbjct: 418 EDAAVLRIKETGKLGLALTADCNPRYVYLDPYAGAAGAVAEAYRNLAAVGAEPLAAVDCL 477
Query: 495 NFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGIL 554
NFGNPE+ E+ QFV +VKG+ +AC+ L P+V GNVS YNETNG I PTPTI VG++
Sbjct: 478 NFGNPERPEVYWQFVEAVKGLGDACRALGTPVVGGNVSLYNETNGVPIAPTPTIGMVGLV 537
Query: 555 PDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFV 614
D + + EGD I +IG L S Y G P VD EK V
Sbjct: 538 EDVEKLLTSNFKKEGDAIYLIGETKDELGGSEYLRVIHGIVSGQVPAVDLEEEKELADAV 597
Query: 615 LSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYV 674
I A ++A HD+S GGL + LAEM +S G ++ + P LF E GR +
Sbjct: 598 REAIRAGLVSAAHDVSRGGLAVALAEMAAASGIGAEVDIDEIASARPDELLFSESNGRAI 657
Query: 675 VCISPENQDLVMSEANNKNIPLRYLGKVSGSTLSI---HNILNIPVATLQTKYESWFPEF 731
V + E +K +P + +GK G L+I + +++ V L+ +E PE+
Sbjct: 658 VAVPEEKA---EEAVKSKGVPAKVIGKTGGDRLTIKTGDDTISVSVKELRDAWEEALPEY 714
Query: 732 I 732
+
Sbjct: 715 M 715
>gnl|CDD|184499 PRK14090, PRK14090, phosphoribosylformylglycinamidine synthase II;
Provisional.
Length = 601
Score = 391 bits (1006), Expect = e-109
Identities = 204/534 (38%), Positives = 299/534 (55%), Gaps = 45/534 (8%)
Query: 25 IHILK----RQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVIQGPGENAGVVDI 80
++IL+ R+PT E+ S MW+EHC Y +KK++R LP TG NAGVV++
Sbjct: 4 LNILEEKLGREPTFVELQAFSVMWSEHCGYSHTKKYIRRLPKTGF------EGNAGVVNL 57
Query: 81 GGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKT 140
+ FK+ESHNHPS IEPY GAATGVGGI+RD+ MGARP A +SL I
Sbjct: 58 DDYYSIAFKIESHNHPSAIEPYNGAATGVGGIIRDVLAMGARPTAIFDSLHMSRI----- 112
Query: 141 KHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGI 200
+ G++ GIA Y NS GVPTVGGE+ Y +N +VN AAG+ + + + SKA
Sbjct: 113 ---IDGIIEGIADYGNSIGVPTVGGELRISSLYAHNPLVNVLAAGVVRNDMLVDSKASRP 169
Query: 201 GLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDA 260
G +V G TGRDGI GAS ASE+ A K ++QVGDPF K L+EA LE++
Sbjct: 170 GQVIVIFGGATGRDGIHGASFASEDLTGEKATKL-SIQVGDPFAEKMLIEAFLEMVEEGL 228
Query: 261 VIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILN 320
V QD+GA G+ + E+ +G LG + LD+VP E M +E+++SESQERM ++ +
Sbjct: 229 VEGAQDLGAGGVLSATSELVAKGGLGAIVHLDRVPLREPDMEPWEILISESQERMAVVTS 288
Query: 321 PEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAWCE 380
PEK + EI K L I+ +D ++RV++R + V +P++ L+ APE +
Sbjct: 289 PEKASRILEIAKKHLLFGDIVAEVIDDPIYRVMYRDDLVMEVPVQLLA-NAPEEEIVEYT 347
Query: 381 PITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSNSIQLPGGDAGVI 440
P I P ++ + + R V+EQYD M+ ++++ PG A V+
Sbjct: 348 PGEI-PEFKRVEFEEVNA----------------REVFEQYDHMVGTDTVLPPGFGAAVM 390
Query: 441 RVEGHETKALAFSS--DVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGN 498
R++ +L S D+ D + GT AV E R ++ GA+PLAIT+ +N+G+
Sbjct: 391 RIKRDGGYSLVTHSRADLA----LQDTYWGTFIAVLESVRKTLSVGAEPLAITNCVNYGD 446
Query: 499 PEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVG 552
P+ + +G + +++AC+ P+ SGN S YN G+ I PT + +G
Sbjct: 447 PD-VDPVG-LSAMMTALKDACEFSGVPVASGNASLYNTYQGKPIPPTLVVGMLG 498
Score = 46.8 bits (111), Expect = 2e-05
Identities = 71/332 (21%), Positives = 128/332 (38%), Gaps = 50/332 (15%)
Query: 431 QLP----GGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAK 486
+LP G+AGV+ ++ + + A S P + +P+ G V R+++A GA+
Sbjct: 42 RLPKTGFEGNAGVVNLDDYYSIAFKIESHNHPSAI--EPYNGAATGVGGIIRDVLAMGAR 99
Query: 487 PLAITDNLNFGNPEKEEIMGQFVHS-VKGIREACQILDFPIVSGN--VSFYNETNGQSIF 543
P AI D+L+ M + + ++GI + + P V G +S N
Sbjct: 100 PTAIFDSLH---------MSRIIDGIIEGIADYGNSIGVPTVGGELRISSLYAHN----- 145
Query: 544 PTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGN----DGCHLDCSMYSLECA------- 592
P + G++ + L+ ++ G +I++ G DG H S S +
Sbjct: 146 PLVNVLAAGVVRNDMLVDS-KASRPGQVIVIFGGATGRDGIH-GASFASEDLTGEKATKL 203
Query: 593 SSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDII 652
S +G P EK L M+ + D+ GG++ +E+ G +
Sbjct: 204 SIQVGDP-----FAEKMLIEAFLEMVEEGLVEGAQDLGAGGVLSATSELVAKGGLGAIVH 258
Query: 653 L---PI-EIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKV--SGST 706
L P+ E + +P L E Q R V SPE ++ A + + +V
Sbjct: 259 LDRVPLREPDMEPWEILISESQERMAVVTSPEKASRILEIAKKHLLFGDIVAEVIDDPIY 318
Query: 707 LSIHN---ILNIPVATLQTKYESWFPEFISES 735
++ ++ +PV L E E+
Sbjct: 319 RVMYRDDLVMEVPVQLLANAPEEEIVEYTPGE 350
>gnl|CDD|130796 TIGR01735, FGAM_synt, phosphoribosylformylglycinamidine synthase,
single chain form. This model represents a
single-molecule form of
phosphoribosylformylglycinamidine synthase, also called
FGAM synthase, an enzyme of purine de novo biosynthesis.
This form is found mostly in eukaryotes and
Proteobacteria. In Bacillus subtilis PurL (FGAM synthase
II) and PurQ (FGAM synthase I), homologous to different
parts of this model, perform the equivalent function;
the unrelated small protein PurS is also required and
may be a third subunit.
Length = 1310
Score = 149 bits (378), Expect = 2e-36
Identities = 165/685 (24%), Positives = 282/685 (41%), Gaps = 103/685 (15%)
Query: 86 VVFKMESHNHPSYIEPYQGAATGVGGILRDIFTM--GARPVAAMNSLRFGAIHHPKTKH- 142
++ K+E+HNHP+ I P+ GA+TG GG +RD GA+P A + ++ P +
Sbjct: 291 ILMKVETHNHPTAIAPFPGASTGAGGEIRDEGATGRGAKPKAGLTGFCVSNLNIPGLEQP 350
Query: 143 -----------------LLSGVVAGIAGYSNSFGVPTVGG------EVEFLPC-----YN 174
++ + G A ++N FG P + G LP Y+
Sbjct: 351 WEDPFQKPERIASPLDIMIEAPL-GAAAFNNEFGRPNLLGYFRTFELKASLPGGQVRGYH 409
Query: 175 NNIIVNTFAAGIAKTNAIFSSKAR-GIGLPLVYLGAKTGRDGIGGASMASEEFGENIAK- 232
I+ A GI +A K G L+ LG G+GG + +S G N A
Sbjct: 410 KPIM---LAGGIGSIDAEHIQKGEIEPGALLIVLGGPAMLIGLGGGAASSMVSGTNTADL 466
Query: 233 KRPTVQVGDP-FTGKC--LLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITL 289
+VQ G+P +C +++ C +L + +I+I D+GA GL+ + E+ + G G +
Sbjct: 467 DFASVQRGNPEMERRCQEVIDRCWQLGEKNPIISIHDVGAGGLSNALPELIHDGGRGAVI 526
Query: 290 DLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKL 349
DL VP + G++ E+ +ESQER ++++ E I + F+++G T D
Sbjct: 527 DLRAVPLDDPGLSPLEIWCNESQERYVLLVRAENLEIFTAICERERCPFAVVGTATGDGR 586
Query: 350 FRVI---------------HRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSN-PQLLC 393
++ H +LP++ L + P+ R P P +
Sbjct: 587 LTLVDDTPVRRNGQGDAPSHFPNNPVDLPLEVLLGKMPKMTR---FVQRKAPMLQPLDIP 643
Query: 394 SDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMI-----QSNSI---QLPGGDAGVIRVEGH 445
D ++L ++L P ++S+R++ D + + + Q P D V
Sbjct: 644 PGLDLHEALERVLRLPAVASKRFLITIGDRSVGGLVARDQMVGPWQTPLADVAVT-AASF 702
Query: 446 ET---KALAFSSDVTPRYVKA--DPFEGTKQAVAECWRNIIATGAKPLA-ITDNLNF--- 496
+T +A+A R KA DP + AV E N+ A L+ + + N+
Sbjct: 703 DTYTGEAMAIGE----RPPKALLDPKASARLAVGEAITNLAAALVGDLSDVKLSANWMAA 758
Query: 497 -GNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPT-------- 547
G+P ++ + +VK + E C L I G S +T Q T +
Sbjct: 759 AGHPGEDAALYD---AVKAVSELCPALGIAIPVGKDSLSMKTRWQDNGETKSVTAPGSLV 815
Query: 548 IAGVGILPDYSLMTRIDSAH---EGDLILMIGNDGCH-LDCSMYSLECASSNIGPPPKVD 603
I+ +PD D H + L+L+ G + L S + + G P +D
Sbjct: 816 ISAFAPVPDVRKTVTPDLKHDKGDSHLLLVDLGPGKNRLGGSALA-QVFGQLGGDCPDLD 874
Query: 604 CHLEKNHGFF--VLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDP 661
E+ FF + ++ + A HD S GGL+ TL EM + G+D+ L +
Sbjct: 875 -DPERLKAFFAVMQGLVAEGLLLAYHDRSDGGLVTTLLEMAFAGHCGLDVDLDALGDSL- 932
Query: 662 KPFLFGEDQGRYVVCISPENQDLVM 686
LF E+ G V+ ++ + V+
Sbjct: 933 FAVLFNEELG-AVIQVAKPDLAAVL 956
Score = 39.8 bits (93), Expect = 0.003
Identities = 39/170 (22%), Positives = 73/170 (42%), Gaps = 16/170 (9%)
Query: 204 LVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIA 263
L+ + G++ +GG+++A + FG + P + DP K L+ ++A
Sbjct: 842 LLLVDLGPGKNRLGGSALA-QVFG-QLGGDCPDLD--DPERLKAFFAVMQGLVAEGLLLA 897
Query: 264 IQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEK 323
D GL + +EM G+ G+ +DLD + G + + ++ + E + ++ K
Sbjct: 898 YHDRSDGGLVTTLLEMAFAGHCGLDVDLDAL-----GDSLFAVLFN---EELGAVIQVAK 949
Query: 324 QHKAQ--EILNKWGLHFSIIGIT--TNDKLFRVIHRGEEVANLPIKALSD 369
A E+L GL I+GI T + R+ G + + L D
Sbjct: 950 PDLAAVLELLRAAGLTALILGIGTPTGHPMIRISVNGATLLSEKRSELRD 999
>gnl|CDD|130916 TIGR01857, FGAM-synthase, phosphoribosylformylglycinamidine
synthase, clade II. This model represents a
single-molecule form of
phosphoribosylformylglycinamidine synthase, also called
FGAM synthase, an enzyme of purine de novo biosynthesis.
This model represents a second clade of these enzymes
found in Clostridia, Bifidobacteria and Streptococcus
species. This enzyme performs the fourth step in IMP
biosynthesis (the precursor of all purines) from PRPP.
Length = 1239
Score = 142 bits (359), Expect = 3e-34
Identities = 169/690 (24%), Positives = 287/690 (41%), Gaps = 83/690 (12%)
Query: 78 VDIGGGD---CVVFKMESHNHPSYIEPYQGAATGVGGILRDIF------------TMGAR 122
VD+ G ++FK E+HNHP+ IEP+ GAAT +GG +RD T
Sbjct: 303 VDVDGVKEPWLLMFKNETHNHPTEIEPFGGAATCIGGAIRDPLSGRSYVYQAMRVTGAGD 362
Query: 123 PVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVE--FLPCY-NNNIIV 179
P ++ G + P+ K + + G + Y N G+ T G+V + P Y + V
Sbjct: 363 PTVPISETLKGKL--PQRK-ITTTAAHGYSSYGNQIGLAT--GQVSEIYHPGYVAKRMEV 417
Query: 180 NTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFG-ENIAKKRPTVQ 238
A K N + G + L LG KTGRDGIGGA+ +S+E E++ VQ
Sbjct: 418 GAVVAATPKENVVREKPEPGDVIIL--LGGKTGRDGIGGATGSSKEHTVESLELCGAEVQ 475
Query: 239 VGDPFTGKCLLEACLELMNTDAVIAIQ---DMGAAGLTCSAIEMGNQGNLGITLDLDKVP 295
G+ + + N + I+ D GA G++ + E+ + G+ +DL+KVP
Sbjct: 476 KGNAPEERKIQRL---FRNGNVTRLIKKCNDFGAGGVSVAIGELAD----GLEIDLNKVP 528
Query: 296 TCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHR 355
EG+ E+ +SESQERM ++++PE N+ L +++ T +
Sbjct: 529 KKYEGLNGTELAISESQERMAVVVSPEDVDAFLAYCNEENLEATVVATVTEKPRLVMNWN 588
Query: 356 GEEVANLPIKALSDEA--PEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSS 413
G+ + +L + L D + P Q S E + LK+LS N++S
Sbjct: 589 GKTIVDLSRRFLDTNGVRQVIDAKVVDKDVKLPEERQKT-SAETLEEDWLKVLSDLNVAS 647
Query: 414 RRWVYEQYDTMIQSNSIQLPGG--------DAGVIR--VEGHETK-ALAFSSDVTPRYVK 462
++ + E++D+ + + ++ +P G +A V + V G ET A A + P +
Sbjct: 648 QKGLQERFDSSVGAGTVLMPLGGKYQLTPTEASVAKLPVLGGETHTASAIAWGFNPYIAE 707
Query: 463 ADPFEGTKQAVAECWRNIIATGA--KPLAITDNLNFGNPEKE-EIMGQFVHSVKGIREAC 519
P+ G AV E ++A GA K ++ F +K+ E G+ ++ G +A
Sbjct: 708 WSPYHGAAYAVIESLAKLVAAGADYKKARLSFQEYFEKLDKDAERWGKPFAALLGAIKAQ 767
Query: 520 QILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDG 579
L P + G S ++ PT V R+ S G
Sbjct: 768 IDLGLPAIGGKDSMSGTFEELTVPPTLISFAVTTAN----SRRVISPEFKA----AGE-- 817
Query: 580 CHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLA 639
++Y + + G D L K + + +I K+ + + GG+ +LA
Sbjct: 818 -----NIYLIPGQALEDGTI---DFDLLKENFAQIEELIADHKVVSASAVKYGGVAESLA 869
Query: 640 EMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYL 699
+MT + G ++ P + LF G ++ SPE + E + ++
Sbjct: 870 KMTFGNRIGAELNNPELED------LFTAQYGSFIF-ESPEELSIANVEKIGQTTA-DFV 921
Query: 700 GKVSGSTLSIHNILNIPVATLQTKYESWFP 729
KV+G L + + + + K E FP
Sbjct: 922 LKVNGEKLDLEEL----ESAWEGKLEEVFP 947
Score = 31.3 bits (71), Expect = 0.98
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 30 RQPTLTEIGIISAMWNEHC 48
R PT TEI ++ W++HC
Sbjct: 204 RNPTETEIKVLDTYWSDHC 222
>gnl|CDD|179999 PRK05297, PRK05297, phosphoribosylformylglycinamidine synthase;
Provisional.
Length = 1290
Score = 93.3 bits (233), Expect = 2e-19
Identities = 107/375 (28%), Positives = 176/375 (46%), Gaps = 63/375 (16%)
Query: 89 KMESHNHPSYIEPYQGAATGVGGILRDIFTM----GARPVAA----MNS-LRF-GAI--- 135
K+E+HNHP+ I P+ GAATG GG +RD G++P A S LR G
Sbjct: 288 KVETHNHPTAISPFPGAATGSGGEIRD--EGATGRGSKPKAGLTGFSVSNLRIPGFEQPW 345
Query: 136 --HHPKTKHLLSGVVA---------GIAGYSNSFGVPTVGG-----EVEFLPCYNNNIIV 179
+ K + + S A G A ++N FG P + G E + ++N V
Sbjct: 346 EEDYGKPERIAS---ALDIMIEGPLGGAAFNNEFGRPNLLGYFRTFEQKV---NSHNEEV 399
Query: 180 NTF------AAGIAKTNAIFSSKAR-GIGLPLVYLGAKTGRDGIGGA---SMASEEFGEN 229
+ A GI A K +G L+ LG R G+GG SMAS + E+
Sbjct: 400 RGYHKPIMLAGGIGNIRADHVQKGEIPVGAKLIVLGGPAMRIGLGGGAASSMASGQSSED 459
Query: 230 I--AKKRPTVQVGDP-FTGKC--LLEACLELMNTDAVIAIQDMGAAGLTCSAI-EMGNQG 283
+ A +VQ G+P +C +++ C +L + + +++I D+GA GL +A E+ N G
Sbjct: 460 LDFA----SVQRGNPEMERRCQEVIDRCWQLGDDNPILSIHDVGAGGL-SNAFPELVNDG 514
Query: 284 NLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGI 343
G DL K+P E GM+ E+ +ESQER ++ + PE + I + F+++G
Sbjct: 515 GRGGRFDLRKIPNDEPGMSPLEIWCNESQERYVLAIAPEDLELFEAICERERCPFAVVGE 574
Query: 344 TTNDKLFRVI--HRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKS 401
T ++ + H + +LP+ L + P+ R + T+ P L S D ++
Sbjct: 575 ATEERHLTLEDSHFDNKPVDLPLDVLLGKPPKMHR---DVKTVKAKGPALDYSGIDLAEA 631
Query: 402 LLKLLSSPNLSSRRW 416
+ ++L P ++S+ +
Sbjct: 632 VERVLRLPTVASKSF 646
Score = 56.0 bits (136), Expect = 3e-08
Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 11/121 (9%)
Query: 611 GFF--VLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGE 668
GFF + +++ + A HD S GGL+ TLAEM + G+DI L + D LF E
Sbjct: 861 GFFNAIQALVAEGLLLAYHDRSDGGLLTTLAEMAFAGHCGLDIDLD-ALGDDALAALFNE 919
Query: 669 DQGRYVVCISPENQDLVMS--EANNKNIPLRYLGKV-SGSTLSIH----NILNIPVATLQ 721
+ G V+ + ++D V + + + + +GK +G + I + + L+
Sbjct: 920 ELG-AVIQVRAADRDAVEAILAEHGLSDCVHVIGKPNAGDRIVITRNGKTVFSESRTELR 978
Query: 722 T 722
Sbjct: 979 R 979
Score = 39.8 bits (94), Expect = 0.002
Identities = 32/138 (23%), Positives = 49/138 (35%), Gaps = 12/138 (8%)
Query: 235 PTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKV 294
P V D K A L+ ++A D GL + EM G+ G+ +DLD +
Sbjct: 851 PDVD--DAEDLKGFFNAIQALVAEGLLLAYHDRSDGGLLTTLAEMAFAGHCGLDIDLDAL 908
Query: 295 PTCEEGMTAYEMMLSESQERMLMILNPEKQHKA-QEILNKWGL--HFSIIGITTNDKLFR 351
G A + +E E +I A + IL + GL +IG
Sbjct: 909 -----GDDALAALFNE--ELGAVIQVRAADRDAVEAILAEHGLSDCVHVIGKPNAGDRIV 961
Query: 352 VIHRGEEVANLPIKALSD 369
+ G+ V + L
Sbjct: 962 ITRNGKTVFSESRTELRR 979
>gnl|CDD|177603 PHA03366, PHA03366, FGAM-synthase; Provisional.
Length = 1304
Score = 82.8 bits (205), Expect = 3e-16
Identities = 64/264 (24%), Positives = 104/264 (39%), Gaps = 30/264 (11%)
Query: 461 VKADPFEGTKQAVAEC-----------WRNIIATGAKPLAITDNLNFGNPEKEEIMGQFV 509
V+ DP G K A+ E +I T L++T + E+
Sbjct: 702 VQLDPILGAKYAIVEALTNLMLAPVANLEDITIT----LSVT--WPPTDQAASELY---- 751
Query: 510 HSVKGIREACQIL--DFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRI--DS 565
++ +E C+ L +F S + S + Q TI P S R+ D
Sbjct: 752 RALAACKEFCRELGVNFTFTSASSSPRQDQPPQPGPLFNTIVFTASAPVPSSTPRLTPDL 811
Query: 566 AHEGDLILMIG-NDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKIT 624
G ++ + + L S++ G P + KN V +I+ +
Sbjct: 812 KKPGSALVHLSISPEYTLAGSVFEQIFGLK-SGTLPDISPSYLKNLFRAVQHLISEGLVV 870
Query: 625 ACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQDL 684
+ HD+S GGLI LAEM ++ +G+ I +P DP FLF E G V+ + P +
Sbjct: 871 SGHDVSDGGLIACLAEMALAGGRGVTITVPAGE--DPLQFLFSETPG-VVIEVPPSHLSA 927
Query: 685 VMSEANNKNIPLRYLGKVSGSTLS 708
V++ ++NI +G V S S
Sbjct: 928 VLTRLRSRNIICYPIGTVGPSGPS 951
Score = 50.0 bits (120), Expect = 2e-06
Identities = 25/115 (21%), Positives = 40/115 (34%), Gaps = 10/115 (8%)
Query: 248 LLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMM 307
L A L++ V++ D+ GL EM G G+T+ VP G + +
Sbjct: 856 LFRAVQHLISEGLVVSGHDVSDGGLIACLAEMALAGGRGVTIT---VP---AGEDPLQFL 909
Query: 308 LSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTN---DKLFRVIHRGEEV 359
SE+ ++ + P L + IG F V H G +
Sbjct: 910 FSETPGVVIEV-PPSHLSAVLTRLRSRNIICYPIGTVGPSGPSNTFSVSHNGTVL 963
>gnl|CDD|178745 PLN03206, PLN03206, phosphoribosylformylglycinamidine synthase;
Provisional.
Length = 1307
Score = 69.0 bits (169), Expect = 4e-12
Identities = 99/423 (23%), Positives = 169/423 (39%), Gaps = 86/423 (20%)
Query: 85 CVVFKMESHNHPSYIEPYQGAATGVGGILRDIFT--MGARPVAA-----MNSLRFGAIHH 137
++ E+HN P + PY GA TG GG +RD G+ VA + +LR +
Sbjct: 287 DILLTAETHNFPCAVAPYPGAETGAGGRIRDTHATGRGSFVVAGTAGYCVGNLRIEGSYA 346
Query: 138 PKTKH-------------LLSGVVAGIAGYSNSFGVPTVGGEVE-F---LPC-----YNN 175
P +L G + Y N FG P + G F LP +
Sbjct: 347 PWEDSSFVYPSNLASPLQILIDASNGASDYGNKFGEPLIQGYTRTFGMRLPNGERREWLK 406
Query: 176 NIIVNTFAAGIAKTNAIFSSKARG-IGLPLVYLGAKTGRDGIGGASMASEEFGENIAK-K 233
I+ F+ GI + + +K IG+ +V +G R G+GG + +S G+N A+
Sbjct: 407 PIM---FSGGIGQIDHTHLTKGEPDIGMLVVKIGGPAYRIGMGGGAASSMVSGQNDAELD 463
Query: 234 RPTVQVGDPFTGKCL---LEACLELMNTDAVIAIQDMGAAGLTCSA----IEMGNQGNLG 286
VQ GD + L + AC+E+ + +++I D GA G C+ I G
Sbjct: 464 FNAVQRGDAEMSQKLYRVVRACVEMGEDNPIVSIHDQGAGG-NCNVVKEIIY-----PKG 517
Query: 287 ITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIG-ITT 345
+D+ V + ++ E+ +E QE+ +++ PE + Q I ++ ++IG I
Sbjct: 518 AEIDIRAVVVGDHTLSVLEIWGAEYQEQDALLIKPESRDLLQSICDRERCSMAVIGTIDG 577
Query: 346 NDKLFRVIHRGEEVANLPIKALSDEAP------------------EYDRA--WCEPITIP 385
+ ++ V+ L P E+ R EP+ IP
Sbjct: 578 SGRV--VLVDSAAPEKCEANGLPPPPPAVDLDLEKVLGDMPQKTFEFKRVANKLEPLDIP 635
Query: 386 PSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMI-----QSNSI---QLPGGDA 437
P +L ++L P++ S+R++ + D + Q ++ Q+P D
Sbjct: 636 PG--------ITVMDALKRVLRLPSVCSKRFLTTKVDRCVTGLVAQQQTVGPLQIPLADV 687
Query: 438 GVI 440
VI
Sbjct: 688 AVI 690
Score = 63.2 bits (154), Expect = 2e-10
Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 6/94 (6%)
Query: 614 VLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPF--LFGEDQG 671
+I + I+A HDIS GGL++TL EM + G+++ LP F LF E+ G
Sbjct: 873 TQDLIAKRLISAGHDISDGGLVVTLLEMAFAGNCGINVDLP---SSGHSAFETLFAEELG 929
Query: 672 RYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705
V+ +S +N D VM + + +G+V+ S
Sbjct: 930 -LVLEVSRKNLDAVMEKLAAAGVTAEVIGQVTAS 962
Score = 40.5 bits (95), Expect = 0.001
Identities = 39/166 (23%), Positives = 73/166 (43%), Gaps = 10/166 (6%)
Query: 204 LVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIA 263
L+++ G+ +GG+++A + + + I P + D K EA +L+ + A
Sbjct: 829 LLHVDLGKGKRRLGGSALA-QAYDQ-IGDDCPDLD--DVAYLKKAFEATQDLIAKRLISA 884
Query: 264 IQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEK 323
D+ GL + +EM GN GI +DL G +A+E + +E + +++ ++ +
Sbjct: 885 GHDISDGGLVVTLLEMAFAGNCGINVDLP-----SSGHSAFETLFAE-ELGLVLEVSRKN 938
Query: 324 QHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSD 369
E L G+ +IG T L V G + +L D
Sbjct: 939 LDAVMEKLAAAGVTAEVIGQVTASPLIEVKVDGATCLSEKTASLRD 984
>gnl|CDD|162512 TIGR01739, tegu_FGAM_synt, herpesvirus tegument
protein/v-FGAM-synthase. This model describes a family
of large proteins of herpesvirues. The protein is
described variably as tegument protein or
phosphoribosylformylglycinamidine synthase
(FGAM-synthase). Most of the length of the protein shows
homology to eukaryotic FGAM-synthase. Functional
characterizations were not verified during construction
of this model.
Length = 1202
Score = 65.9 bits (161), Expect = 4e-11
Identities = 60/298 (20%), Positives = 115/298 (38%), Gaps = 37/298 (12%)
Query: 458 PRYVKADPFEGTKQAVAEC-----------WRNIIATGAKPLAITDNLNFGNPEKEEIMG 506
V+ADP G A+ E ++I T + + TD+ +
Sbjct: 603 TYKVQADPKRGATYAITEALLNLSLSPWNTLEDVIITLSVTWSPTDH------VYSLLK- 655
Query: 507 QFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRI--D 564
+++ ++ C+ L + + + T P +I P +I D
Sbjct: 656 ---DALRACKDFCEELGVSFTVTSAA-SSPTQDSGSAPFMSIVFSASCPVLLSAKKITPD 711
Query: 565 SAHEGDLILMIG-NDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKI 623
G ++ + + L S++ + + P + K + +++ I
Sbjct: 712 LKSHGSHLIWLSLHPSYTLAGSIFE-QILGLSFIRLPALSPVSLKKLLSALQTLVKEGVI 770
Query: 624 TACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQD 683
+ HD+S GGL+ +AEM +S KG+ I LP DP FL E G V+ + P +
Sbjct: 771 VSGHDVSDGGLVACVAEMALSGGKGVRITLP-HGT-DPLEFLCSETPG-VVIEVDPSSMY 827
Query: 684 LVMSEANNKNIPLRYLGKVSG----STLSI-HN---ILNIPVATLQTKYESWFPEFIS 733
V+ ++ + + +G+V T S+ HN + P++ LQ + S+ E +
Sbjct: 828 AVLQFLRSEGLVFQVIGRVGESGPSPTFSVVHNSTVLFQEPLSLLQGTWRSFSDEENT 885
Score = 42.8 bits (101), Expect = 4e-04
Identities = 30/145 (20%), Positives = 54/145 (37%), Gaps = 10/145 (6%)
Query: 226 FGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNL 285
F + + + P + K LL A L+ +++ D+ GL EM G
Sbjct: 735 FEQILGLSFIRLPALSPVSLKKLLSALQTLVKEGVIVSGHDVSDGGLVACVAEMALSGGK 794
Query: 286 GITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITT 345
G+ + L E + SE+ +++ ++P + + L GL F +IG
Sbjct: 795 GVRITLPHGT------DPLEFLCSETPG-VVIEVDPSSMYAVLQFLRSEGLVFQVIGRVG 847
Query: 346 N---DKLFRVIHRGEEVANLPIKAL 367
F V+H + P+ L
Sbjct: 848 ESGPSPTFSVVHNSTVLFQEPLSLL 872
>gnl|CDD|132311 TIGR03267, methan_mark_2, putative methanogenesis marker protein 2.
A single member of this protein family is found in each
of the first ten complete genome sequences of archaeal
methanogens, and nowhere else. Sequence similarity to
various bacterial proteins is reflected in Pfam models
pfam00586 and pfam02769, AIR synthase related protein
N-terminal and C-terminal domains, respectively. The
functions of proteins in this family are unknown, but
their role is likely one essential to methanogenesis.
Length = 323
Score = 58.3 bits (141), Expect = 8e-09
Identities = 68/302 (22%), Positives = 115/302 (38%), Gaps = 49/302 (16%)
Query: 61 PTTGKHVIQGPGENAGVVDIGGGDCVVFK--------MESHNHPSYIEPYQGAATGVGGI 112
T +VI G++A + IGG D ++ +++ +P+ V
Sbjct: 33 VTYEGNVIVDFGDDAAAIKIGGDDILLLAADGIWGKLLDA-------DPWWAGYCAVLVN 85
Query: 113 LRDIFTMGARPVAAMNSLRFGAIHHPKT-KHLLSGVVAGIAGYSNSFGVPTVGGEVEFLP 171
+ DI MG +PV +N L +I+ + +L G+ G FGVP VGG
Sbjct: 86 VNDIAAMGGKPVGMVNVL---SINDVDVCREVLEGMREGA----WKFGVPVVGGHTHPDT 138
Query: 172 CYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIA 231
YN ++ GIAK + I S G +++ GR
Sbjct: 139 PYN---ALDVAIVGIAKEDCIIRSDTAKPGDLIIFAIDLDGR----------------PY 179
Query: 232 KKRP----TVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGI 287
P T + P + ++A +E+ V A +D+ GL + + +G
Sbjct: 180 PSFPLNWDTTTMKSPDYLRAQMDAVVEIAERKLVKAGKDISNPGLIGTLGMLLEASRVGA 239
Query: 288 TLDLDKVPTCEE-GMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTN 346
+DL+ +P E+ M + M S ++ +PE + +L GL S+IG
Sbjct: 240 EVDLESIPKPEDVDMVTWLKMYPGSG--FVLTADPENVREIVRVLEDAGLTASVIGEVIE 297
Query: 347 DK 348
D
Sbjct: 298 DG 299
Score = 57.5 bits (139), Expect = 1e-08
Identities = 61/253 (24%), Positives = 104/253 (41%), Gaps = 36/253 (14%)
Query: 417 VYEQYDTMIQSNSIQLPGGDAGVIRVEGHETKALAFSSD-VTPRYVKADPFEGTKQAVAE 475
+ E D + N I G DA I++ G L ++D + + + ADP+ AV
Sbjct: 27 ILEPLDVTYEGNVIVDFGDDAAAIKIGG--DDILLLAADGIWGKLLDADPWWAGYCAVLV 84
Query: 476 CWRNIIATGAKPLAITDNLNFGNPEK-EEIMGQFVHSVKGIREACQILDFPIVSGNVSFY 534
+I A G KP+ + + L+ + + E++ +G+RE P+V G+
Sbjct: 85 NVNDIAAMGGKPVGMVNVLSINDVDVCREVL-------EGMREGAWKFGVPVVGGHTHPD 137
Query: 535 NETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYS---LEC 591
N + I G+ + R D+A GDLI+ LD Y L
Sbjct: 138 TPYNALDV----AIVGIA---KEDCIIRSDTAKPGDLIIF----AIDLDGRPYPSFPLNW 186
Query: 592 ASSNIGPPPKVDCHLEKNHGFFVLSMINAKKI-TACHDISTGGLIITLAEMTISSAKGMD 650
++ + P + ++ + I +K+ A DIS GLI TL + +S G +
Sbjct: 187 DTTTMKSPDYLRAQMD------AVVEIAERKLVKAGKDISNPGLIGTLGMLLEASRVGAE 240
Query: 651 IILPIEIEKDPKP 663
+++E PKP
Sbjct: 241 ----VDLESIPKP 249
>gnl|CDD|162715 TIGR02124, hypE, hydrogenase expression/formation protein HypE.
This family contains HypE (or HupE), a protein required
for expression of catalytically active hydrogenase in
many systems. It appears to be an accessory protein
involved in maturation rather than a regulatory protein
involved in expression. HypE shows considerable homology
to the thiamine-monophosphate kinase ThiL (TIGR01379)
and other enzymes.
Length = 320
Score = 49.2 bits (118), Expect = 4e-06
Identities = 69/274 (25%), Positives = 110/274 (40%), Gaps = 49/274 (17%)
Query: 436 DAGVIRVEGHETKALAFSSD---VTPRYVKADPFEGT---KQAVAECWRNIIATGAKPLA 489
DA V+ + G LAFS+D V P + F G K AV ++ +GAKPL
Sbjct: 23 DAAVLELSG---GRLAFSTDSFVVDPLF-----FPGGDIGKLAVCGTVNDVAVSGAKPLY 74
Query: 490 ITDNL----NFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPT 545
++ F + E I VK + EA + IV+G+ + IF
Sbjct: 75 LSCGFILEEGFPIEDLERI-------VKSMAEAARKAGVKIVTGDTKVVEKGKADGIFIN 127
Query: 546 PTIAGVGILPDYSLMTRIDSAHEGDLILM---IGNDGCHLDCSMYSLECASSNIGPPPKV 602
T G+G++P + + GD I++ IG+ G + L +
Sbjct: 128 TT--GIGVIP-SGIPISAHNLQPGDKIIVSGTIGDHGAAILAVREGLG-----FETNLES 179
Query: 603 DCHLEKNHGFFVLSMINA-KKITACHDISTGGLIITLAEMTISSAKGMDI---ILPIEIE 658
DC V +++NA + A D + GGL L E +S G+ I +P++ E
Sbjct: 180 DCAPLNG---LVETLLNAGPAVHAMRDATRGGLAAVLNEWAQASGVGIVIEEEKIPVKEE 236
Query: 659 KDPKPFLFGED------QGRYVVCISPENQDLVM 686
L G D +G+ V+ + PE + V+
Sbjct: 237 VKGACELLGLDPLYLANEGKLVLAVPPEAAEKVL 270
Score = 36.1 bits (84), Expect = 0.030
Identities = 72/307 (23%), Positives = 119/307 (38%), Gaps = 41/307 (13%)
Query: 65 KHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPY--QGAATG---VGGILRDIFTM 119
+ I E+A V+++ GG + F +S ++P G G V G + D+
Sbjct: 14 GNEILAAMEDAAVLELSGGR-LAFSTDSF----VVDPLFFPGGDIGKLAVCGTVNDVAVS 68
Query: 120 GARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYN-NNII 178
GA+P+ + L +V +A + GV V G+ + + + I
Sbjct: 69 GAKPLYLSCGF---ILEEGFPIEDLERIVKSMAEAARKAGVKIVTGDTKVVEKGKADGIF 125
Query: 179 VNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASE-EFGENIAKKRPTV 237
+NT G+ + S+ G ++ G G G ++ F N+
Sbjct: 126 INTTGIGVIPSGIPISAHNLQPGDKIIVSGT-IGDHGAAILAVREGLGFETNL------- 177
Query: 238 QVGD--PFTG--KCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDK 293
D P G + LL A AV A++D GL E +GI ++ +K
Sbjct: 178 -ESDCAPLNGLVETLLNAGP------AVHAMRDATRGGLAAVLNEWAQASGVGIVIEEEK 230
Query: 294 VPTCEEGMTAYEMM----LSESQERML-MILNPEKQHKAQEIL--NKWGLHFSIIGITTN 346
+P EE A E++ L + E L + + PE K EIL + +G +IIG
Sbjct: 231 IPVKEEVKGACELLGLDPLYLANEGKLVLAVPPEAAEKVLEILKSHPYGKDAAIIGEVVE 290
Query: 347 DKLFRVI 353
K RV+
Sbjct: 291 RKEGRVV 297
>gnl|CDD|162083 TIGR00878, purM, phosphoribosylaminoimidazole synthetase. This
enzyme is found as a homodimeric monofunctional protein
in prokaryotes and as part of a larger, multifunctional
protein, sometimes with two copies of this enzyme in
tandem, in eukaryotes.
Length = 332
Score = 41.2 bits (97), Expect = 0.001
Identities = 30/116 (25%), Positives = 50/116 (43%), Gaps = 18/116 (15%)
Query: 472 AVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNV 531
VA +++ GA+PL D L G + E+ Q VKGI E C+ +V G
Sbjct: 81 LVAMNVNDLLVQGAEPLFFLDYLAVGKLD-PEVASQI---VKGIAEGCKQAGCALVGGET 136
Query: 532 S----FYNETNGQSIFPTPTIAG--VGILPDYSLMTRIDSAHEGDLILMIGNDGCH 581
+ Y + +AG VG++ ++T GD+++ +G+ G H
Sbjct: 137 AEMPGMYRGGHYD-------LAGTAVGVVEKDEIITGEKVK-PGDVLIGLGSSGIH 184
Score = 30.0 bits (68), Expect = 2.4
Identities = 37/155 (23%), Positives = 53/155 (34%), Gaps = 42/155 (27%)
Query: 62 TTGKHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAAT-GVGGILR------ 114
T V+ G G AG+ D+G Y EP + T GVG L
Sbjct: 27 TRRPEVMGGLGGFAGLFDLGDK--------------YKEPVLVSGTDGVGTKLLVAEAMN 72
Query: 115 ---------------DIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFG 159
D+ GA P+ ++ L G + + S +V GIA G
Sbjct: 73 KHDTIGIDLVAMNVNDLLVQGAEPLFFLDYLAVGKL----DPEVASQIVKGIAEGCKQAG 128
Query: 160 VPTVGGEVEFLP-CYNNNII-VNTFAAGIAKTNAI 192
VGGE +P Y + A G+ + + I
Sbjct: 129 CALVGGETAEMPGMYRGGHYDLAGTAVGVVEKDEI 163
>gnl|CDD|162329 TIGR01379, thiL, thiamine-monophosphate kinase. Proteins scoring
between the trusted and noise cutoff for this model
include short forms from the Thermoplasmas (which lack
the N-terminal region) and a highly derived form from
Campylobacter jejuni. Eukaryotes lack this enzyme, and
add pyrophosphate from ATP to unphosphorylated thiamine
in a single step.
Length = 317
Score = 38.5 bits (90), Expect = 0.006
Identities = 35/146 (23%), Positives = 56/146 (38%), Gaps = 21/146 (14%)
Query: 433 PGGDAGVIRVEGHETKALAFSSD--VTPRYVKAD--PFEGTKQAVAECWRNIIATGAKPL 488
G DA + V E + L ++D V + D P + +AVA ++ A GA P
Sbjct: 23 IGDDAAL--VSAPEGRDLVLTTDTLVEGVHFPPDTTPEDLGWKAVAVNLSDLAAMGATPK 80
Query: 489 AITDNLNFGNPEK--EEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTP 546
L+ G P E + F G+ E + P+V G+ + +
Sbjct: 81 WFL--LSLGLPSDLDEAWLEAFY---DGLFECAKQYGVPLVGGDT---VSSPELVV---- 128
Query: 547 TIAGVGILPDYSLMTRIDSAHEGDLI 572
T+ +G P + R A GDL+
Sbjct: 129 TVTAIGEAPKGRALLR-SGAKPGDLV 153
Score = 35.4 bits (82), Expect = 0.057
Identities = 62/251 (24%), Positives = 92/251 (36%), Gaps = 41/251 (16%)
Query: 61 PTTGKHVIQGPGENAGVVDIGGGDCVVFK---MESHNH-PSYIEPYQGAATGVGGILRDI 116
V G G++A +V G +V + H P P V L D+
Sbjct: 13 LVQDPDVALGIGDDAALVSAPEGRDLVLTTDTLVEGVHFPPDTTPEDLGWKAVAVNLSDL 72
Query: 117 FTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNN 176
MGA P + SL + + L G+ + +GVP VGG+ P
Sbjct: 73 AAMGATPKWFLLSL---GLPSDLDEAWLEAFYDGLFECAKQYGVPLVGGDTVSSP----E 125
Query: 177 IIVNTFAAGIAKTN-AIFSSKARGIGLPLVYLGAKTGRDGIGGASMA---------SEEF 226
++V A G A A+ S A+ G LV++ TG G A +A EE
Sbjct: 126 LVVTVTAIGEAPKGRALLRSGAK-PG-DLVFV---TGTLGDSAAGLALLLKGKKEPDEED 180
Query: 227 GENIAKK--RPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGN 284
E + ++ RP +V A N A I + D GL +
Sbjct: 181 DEELLQRHLRPEPRVE-------EGLALAGYAN--AAIDVSD----GLASDLGHIAEASG 227
Query: 285 LGITLDLDKVP 295
+GI +DLD++P
Sbjct: 228 VGIVIDLDRLP 238
>gnl|CDD|180049 PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase;
Provisional.
Length = 327
Score = 35.5 bits (83), Expect = 0.049
Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 4/57 (7%)
Query: 473 VAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529
VA C +++ GA+PL D + G + E+ Q VKGI E C+ ++ G
Sbjct: 82 VAMCVNDLLVQGAEPLFFLDYIATGKLD-PEVAAQV---VKGIAEGCEQAGCALIGG 134
>gnl|CDD|161898 TIGR00476, selD, selenium donor protein. In prokaryotes, the
incorporation of selenocysteine as the 21st amino acid,
encoded by TGA, requires several elements: SelC is the
tRNA itself, SelD acts as a donor of reduced selenium,
SelA modifies a serine residue on SelC into
selenocysteine, and SelB is a selenocysteine-specific
translation elongation factor. 3-prime or 5-prime
non-coding elements of mRNA have been found as probable
structures for directing selenocysteine incorporation.
This model describes SelD, known as selenophosphate
synthetase, selenium donor protein, and selenide,water
dikinase. SelD provides reduced selenium for the
selenium transferase SelA. This protein itself contains
selenocysteine in many species; any sequence not
aligning to the beginning of the model is likely have
selenocysteine residue incorrectly interpreted as a stop
codon upstream of the given sequence. The SelD protein
also provides selenophosphate for the enzyme tRNA
2-selenouridine synthase, which catalyzes a tRNA base
modification. All genomes with SelD should make
selenocysteine, selenouridine, or both.
Length = 347
Score = 35.6 bits (82), Expect = 0.051
Identities = 43/238 (18%), Positives = 77/238 (32%), Gaps = 17/238 (7%)
Query: 72 GENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLR 131
G++A + G +V + + P +PY L DI+ MG P+ AM L
Sbjct: 50 GDDAVIYLRHNGLSLV-QTTDYITPIVDDPYMMGRIAAANALSDIYAMGGTPIDAM--LI 106
Query: 132 FGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNA 191
+ + T ++ V+ G G GG N + G+
Sbjct: 107 LLGVSNKLTIEVMREVIQGFKDACREAGTSLTGGHT----ILNPWPVFGGAVTGVCPEEE 162
Query: 192 IFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGE----NIAKKRPTVQVGDPFTGKC 247
+ + +G L+ T G A + +I K+ + + +
Sbjct: 163 VITPSGAQVGDVLIL----TKPLGTQVAVAVHKWLDIPKDLSITKEEVELAINEAIENMA 218
Query: 248 LL-EACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAY 304
L LM+T A D+ G+ A M N+ + ++ +P M
Sbjct: 219 RLNRKAAGLMHTFNAHAATDITGFGILGHAQNMAKNSNVEARIVIENLP-VLAKMEEL 275
>gnl|CDD|185502 PTZ00182, PTZ00182, 3-methyl-2-oxobutanate dehydrogenase;
Provisional.
Length = 355
Score = 35.0 bits (81), Expect = 0.076
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 6/36 (16%)
Query: 640 EMTISSA--KGMDIILPIEIEKDPKPFLFGEDQGRY 673
+M + A +D E+ +DPK F+ GED +Y
Sbjct: 34 KMNVREAINSALDE----ELARDPKVFVLGEDVAQY 65
>gnl|CDD|180226 PRK05731, PRK05731, thiamine monophosphate kinase; Provisional.
Length = 318
Score = 32.1 bits (74), Expect = 0.53
Identities = 67/301 (22%), Positives = 109/301 (36%), Gaps = 56/301 (18%)
Query: 433 PGGDAGVIRVEGHETKALAFSSD--VTPRYVKAD---PFE-GTKQAVAECWRN---IIAT 483
G DA ++ + L S+D V + + D P + G K A+A N + A
Sbjct: 24 IGDDAALLGPPPGQR--LVVSTDMLVEGVHFRPDWSSPEDLGYK-ALA---VNLSDLAAM 77
Query: 484 GAKPLAITDNLNFGNPEK--EEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQS 541
GA+P A L P+ E + G+ E ++ G+ T G
Sbjct: 78 GARPAAFL--LALALPKDLDEAWLEAL---ADGLFELADRYGAELIGGDT-----TRGPD 127
Query: 542 IFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPK 601
+ + T +G +P + R A GDL+ + G G S L + + P
Sbjct: 128 LSISVTA--IGDVPGGRALRR-SGAKPGDLVAVTGTLG----DSAAGLALLLNGLRVPDA 180
Query: 602 VDCHLEKNH-------GFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIIL- 653
L H G A +A DIS GL L + +S G DI L
Sbjct: 181 DAAALISRHLRPQPRVGLG---QALAGLASAAIDISD-GLAADLGHIAEASGVGADIDLD 236
Query: 654 PIEIEKDPKPFLFGEDQGRYVV----------CISPENQDLVMSEANNKNIPLRYLGKVS 703
+ I + GED R+ + PEN+ +++ A + + + +G+V+
Sbjct: 237 ALPISPALREAAEGEDALRWALSGGEDYELLFTFPPENRGALLAAAGHLGVGVTIIGRVT 296
Query: 704 G 704
Sbjct: 297 E 297
Score = 28.3 bits (64), Expect = 7.2
Identities = 28/110 (25%), Positives = 46/110 (41%), Gaps = 18/110 (16%)
Query: 66 HVIQGPGENAGVVDIGGGDCVVFKME-----SHNHPSYIEPYQ-G--AATGVGGILRDIF 117
G G++A ++ G +V + H P + P G A V L D+
Sbjct: 19 SRELGIGDDAALLGPPPGQRLVVSTDMLVEGVHFRPDWSSPEDLGYKALA-VN--LSDLA 75
Query: 118 TMGARPVAAMNSLRFGAIHHPK--TKHLLSGVVAGIAGYSNSFGVPTVGG 165
MGARP A + +L A+ PK + L + G+ ++ +G +GG
Sbjct: 76 AMGARPAAFLLAL---AL--PKDLDEAWLEALADGLFELADRYGAELIGG 120
>gnl|CDD|172310 PRK13772, PRK13772, formimidoylglutamase; Provisional.
Length = 314
Score = 31.3 bits (71), Expect = 0.91
Identities = 21/73 (28%), Positives = 28/73 (38%), Gaps = 23/73 (31%)
Query: 436 DAGVIRVEG--------------------HETKALAFSSDVTPRYVKADPFEGTKQAVAE 475
DAGV R +G H ALA + DV E + A+AE
Sbjct: 51 DAGVRRNQGRAGAAHGPREIRRALAGVPAHGLPALADAGDVV---CDDGDLESAQAALAE 107
Query: 476 CWRNIIATGAKPL 488
++A GA+PL
Sbjct: 108 VVAEVLAAGARPL 120
>gnl|CDD|181525 PRK08655, PRK08655, prephenate dehydrogenase; Provisional.
Length = 437
Score = 31.1 bits (71), Expect = 1.2
Identities = 19/50 (38%), Positives = 24/50 (48%), Gaps = 4/50 (8%)
Query: 642 TISSAKGMDIIL-PIEIEKDP---KPFLFGEDQGRYVVCISPENQDLVMS 687
S KG +IL P E +P K F E +G V+ SPE D +MS
Sbjct: 122 RTPSLKGQVVILTPTEKRSNPWFDKVKNFLEKEGARVIVTSPEEHDRIMS 171
>gnl|CDD|151104 pfam10567, Nab6_mRNP_bdg, RNA-recognition motif. This conserved
domain is found in fungal proteins and appears to be
involved in RNA-processing. It binds to poly-adenylated
RNA, interacts genetically with mRNA 3'-end processing
factors, copurifies with the nuclear cap-binding protein
Cbp20p, and is found in complexes containing other
translation factors, such as EIF4G.
Length = 309
Score = 30.6 bits (69), Expect = 1.4
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 4/36 (11%)
Query: 663 PFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRY 698
PFL RY++ E+ D++ ++ KN PL Y
Sbjct: 162 PFLDNSGNKRYIL----ESVDIINADERRKNFPLNY 193
>gnl|CDD|177653 PLN00014, PLN00014, light-harvesting-like protein 3; Provisional.
Length = 250
Score = 30.6 bits (69), Expect = 1.5
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 9/48 (18%)
Query: 331 LNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAW 378
K L ++ G+ L R + E++ L K L+DEA YD+ W
Sbjct: 196 FCKTLLFVTVAGVL----LIR---KNEDLDTL--KKLADEATFYDKQW 234
>gnl|CDD|177996 PLN02368, PLN02368, alanine transaminase.
Length = 407
Score = 29.8 bits (67), Expect = 2.7
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 139 KTKHLLSGVVAGIAGYSNSFGVPTVGGEV-EFL 170
+ KH LS G+ YS+S G+P V EV EF+
Sbjct: 88 RAKHYLSLTSGGLGAYSDSRGLPGVRKEVAEFI 120
>gnl|CDD|183274 PRK11678, PRK11678, putative chaperone; Provisional.
Length = 450
Score = 29.8 bits (68), Expect = 2.8
Identities = 10/16 (62%), Positives = 11/16 (68%), Gaps = 2/16 (12%)
Query: 77 VVDIGGG--DCVVFKM 90
VVDIGGG DC + M
Sbjct: 213 VVDIGGGTTDCSMLLM 228
>gnl|CDD|179492 PRK02888, PRK02888, nitrous-oxide reductase; Validated.
Length = 635
Score = 29.6 bits (67), Expect = 2.8
Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 24/60 (40%)
Query: 55 KWL---------RTLPTTGKHVIQGP--GENAGVVDIGGGDCVVFKME-SHNHPSYIEPY 102
KWL R LP GP EN ++DI G KM+ H+ P++ EP+
Sbjct: 444 KWLVSLNKFSKDRFLPV-------GPLHPENDQLIDISGD-----KMKLVHDGPTFAEPH 491
>gnl|CDD|178287 PLN02683, PLN02683, pyruvate dehydrogenase E1 component subunit
beta.
Length = 356
Score = 28.6 bits (64), Expect = 6.1
Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 640 EMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRY 673
EMT+ A ++ L E+ DPK F+ GE+ G Y
Sbjct: 26 EMTVRDA--LNSALDEEMSADPKVFIMGEEVGEY 57
>gnl|CDD|151053 pfam10493, Rod_C, Rough deal protein C-terminal region. Rod, the
Rough deal protein, displays a dynamic intracellular
staining pattern, localising first to kinetochores in
pro-metaphase, but moving to kinetochore microtubules at
metaphase. Early in anaphase the protein is once again
restricted to the kinetochores, where it persists until
the end of telophase. This behaviour is in all respects
similar to that described for ZW10, and indeed the two
proteins function together, localisation of each
depending upon the other. These two proteins are found
at the kinetochore in complex with a third, Zwilch, in
both flies and humans. The C-terminus is the most
conserved part of the protein. During pro-metaphase, the
ZW10-Rod complex, dynein/dynactin, and Mad2 all
accumulate on unattached kinetochores; microtubule
capture leads to Mad2 depletion as it is carried off by
dynein/dynactin; ZW10-Rod complex accumulation
continues, replenishing kinetochore dynein. The
continuing recruitment of the ZW10-Rod complex during
metaphase may serve to maintain adequate dynein/dynactin
complex on kinetochores for assisting chromatid movement
during anaphase. The ZW10-Rod complex acts as a bridge
whose association with Zwint-1 links Mad1 and Mad2,
components that are directly responsible for generating
the diffusible 'wait anaphase' signal, to a structural,
inner kinetochore complex containing Mis12 and
KNL-1AF15q14, the last of which has been proved to be
essential for kinetochore assembly in C. elegans.
Removal of ZW10 or Rod inactivates the mitotic
checkpoint.
Length = 555
Score = 28.2 bits (63), Expect = 7.1
Identities = 14/47 (29%), Positives = 17/47 (36%), Gaps = 3/47 (6%)
Query: 370 EAPEYDRAWCEPITIP---PSNPQLLCSDEDCTKSLLKLLSSPNLSS 413
E + RAW I P S P+ D KSL+ L P
Sbjct: 436 ELEGFSRAWRSVILAPFRSASRPRSPDQDATLYKSLVLLQKCPVSHD 482
>gnl|CDD|180758 PRK06931, PRK06931, diaminobutyrate--2-oxoglutarate
aminotransferase; Provisional.
Length = 459
Score = 28.2 bits (63), Expect = 7.9
Identities = 19/51 (37%), Positives = 24/51 (47%), Gaps = 13/51 (25%)
Query: 177 IIVNTFAAGIAKTNAIFS-------------SKARGIGLPLVYLGAKTGRD 214
+IV+ AG A+T +F+ SKA G GLPL LG K D
Sbjct: 262 LIVDEVQAGFARTGKMFAFEHAGIEPDIIVMSKAVGGGLPLAVLGIKKEFD 312
>gnl|CDD|168300 PRK05934, PRK05934, type III secretion system protein; Validated.
Length = 341
Score = 28.3 bits (63), Expect = 8.0
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 305 EMMLSESQERMLMILNPEKQHKAQEILNK 333
E+M ES E++ +IL+ KA+E+LNK
Sbjct: 299 EIMQKESPEKIALILSYLDPKKAEELLNK 327
>gnl|CDD|163233 TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein.
Members of this protein family belong to the Rhs
element Vgr protein family (see TIGR01646), but
furthermore all are found in genomes with type VI
secretion loci. However, members of this protein family,
although recognizably correlated to type VI secretion
according the partial phylogenetic profiling algorithm,
are often found far the type VI secretion locus.
Length = 513
Score = 27.9 bits (63), Expect = 9.0
Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 2/47 (4%)
Query: 116 IFTMGARPVAAMNSLRF--GAIHHPKTKHLLSGVVAGIAGYSNSFGV 160
+FT+ P AA+N HH + +L AGY NSF
Sbjct: 294 LFTLSGHPRAALNREYLVVSVHHHGRQPQVLEESGGSGAGYRNSFQC 340
>gnl|CDD|178105 PLN02487, PLN02487, zeta-carotene desaturase.
Length = 569
Score = 27.8 bits (62), Expect = 9.9
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 709 IHNILNIPVATLQTKYESWFPE 730
I+ ++ +PV T+Q +Y W E
Sbjct: 378 IYKLVGVPVVTVQLRYNGWVTE 399
Database: CddB
Posted date: Feb 4, 2011 9:54 PM
Number of letters in database: 5,994,473
Number of sequences in database: 21,608
Lambda K H
0.318 0.136 0.407
Gapped
Lambda K H
0.267 0.0787 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21608
Number of Hits to DB: 12,074,724
Number of extensions: 794944
Number of successful extensions: 1502
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1453
Number of HSP's successfully gapped: 56
Length of query: 737
Length of database: 5,994,473
Length adjustment: 101
Effective length of query: 636
Effective length of database: 3,812,065
Effective search space: 2424473340
Effective search space used: 2424473340
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.2 bits)