RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14408
(720 letters)
>gnl|CDD|235000 PRK02106, PRK02106, choline dehydrogenase; Validated.
Length = 560
Score = 210 bits (536), Expect = 1e-59
Identities = 115/293 (39%), Positives = 154/293 (52%), Gaps = 20/293 (6%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG-DTPIHSRIPGMSSVLSL----SEFD 488
+D II+GA +AGCVLANRLSE + VLL+EAGG D I M + L+ ++
Sbjct: 6 YDYIIIGAGSAGCVLANRLSEDPDVSVLLLEAGGPDYRWDFFIQ-MPAALAFPLQGKRYN 64
Query: 489 HAYLAEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKL-GYNGWG 547
AY EP N R++ GK LGGSS++ ++Y RG + DY+N+A+L G GW
Sbjct: 65 WAYETEPEPHMN----NRRMECPRGKVLGGSSSINGMVYIRGNAMDYDNWAELPGLEGWS 120
Query: 548 YDETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPC 607
Y + L YF K+E G G L V K N + + E + + GYP
Sbjct: 121 YADCLPYFKKAETR----DGGEDDYRGGDGPLSVTRGKPGTNPLFQAFVE-AGVQAGYPR 175
Query: 608 PKDMNDRYVDVGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQN 667
D+N Y GF + G R+SAA AYL P A KR NL ++ + +++ +
Sbjct: 176 TDDLNG-YQQEGFGPMDRTVTNGRRWSAARAYLDP-ALKRPNLTIVTHALTDRILF-EGK 232
Query: 668 VATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
A GVEY G A +EVIL+AGAI + QLL LSGIGP HL E+ IP
Sbjct: 233 RAVGVEYER-GGGRETARARREVILSAGAINSPQLLQLSGIGPAEHLKELGIP 284
>gnl|CDD|225186 COG2303, BetA, Choline dehydrogenase and related flavoproteins
[Amino acid transport and metabolism].
Length = 542
Score = 196 bits (501), Expect = 7e-55
Identities = 103/290 (35%), Positives = 152/290 (52%), Gaps = 12/290 (4%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG-DTPIHSRIPGMSSVLSLSEFDHAYL 492
+D +IVG+ +AG VLA RLS+ + L VL++EAGG D ++P + L Y
Sbjct: 8 YDYVIVGSGSAGSVLAARLSD-AGLSVLVLEAGGPDRRPLIQMPAAYAFL---MNGPRYD 63
Query: 493 AEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKL-GYNGWGYDET 551
+R + GK LGGSS++ ++Y RG D++ +A+ G GW YD+
Sbjct: 64 WGFRTEPEPHLRGRELAWPRGKVLGGSSSINGMVYVRGHPEDFDAWAQESGAPGWPYDDV 123
Query: 552 LKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPCPKDM 611
L YF ++ED V + + HG G P+ + + N I F + ++LG+P D
Sbjct: 124 LPYFKRAEDLLGVGGQDLRTWHGGGG--PLPVSPPRSPNPIARAFIEAGEQLGFPTTPDP 181
Query: 612 NDRYVDVGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQNVATG 671
N + GF G R+SAA AYL P KR NL +L ++V ++++ + + A G
Sbjct: 182 NGADQE-GFGPYCVTICNGRRWSAARAYLKPAL-KRPNLTLLTGARVRRILL-EGDRAVG 238
Query: 672 VEYVNSKGETVRVT-ANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
VE G T+ A +EV+L AGAI + +LLLLSGIGP HL E I
Sbjct: 239 VEVEIGDGGTIETAVAAREVVLAAGAINSPKLLLLSGIGPADHLLEHGID 288
>gnl|CDD|233581 TIGR01810, betA, choline dehydrogenase. Choline dehydrogenase
catalyzes the conversion of exogenously supplied choline
into the intermediate glycine betaine aldehyde, as part
of a two-step oxidative reaction leading to the
formation of osmoprotectant betaine. This enzymatic
system can be found in both gram-positive and
gram-negative bacteria. As in Escherichia coli ,
Staphylococcus xylosus , and Sinorhizobium meliloti,
this enzyme is found associated in a transciptionally
co-induced gene cluster with betaine aldehyde
dehydrogenase, the second catalytic enzyme in this
reaction. Other gram-positive organisms have been shown
to employ a different enzymatic system, utlizing a
soluable choline oxidase or type III alcohol
dehydrogenase instead of choline dehydrogenase. This
enzyme is a member of the GMC oxidoreductase family
(pfam00732 and pfam05199), sharing a common
evoluntionary origin and enzymatic reaction with alcohol
dehydrogenase. Outgrouping from this model, Caulobacter
crescentus shares sequence homology with choline
dehydrogenase, yet other genes participating in this
enzymatic reaction have not currently been identified
[Cellular processes, Adaptations to atypical
conditions].
Length = 532
Score = 177 bits (451), Expect = 4e-48
Identities = 112/291 (38%), Positives = 148/291 (50%), Gaps = 19/291 (6%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG-DTPIHSRI---PGMSSVLSLSEFDHA 490
D II+G +AG VLA RLSE S VL++EAGG D P I ++ ++
Sbjct: 1 DYIIIGGGSAGSVLAGRLSEDVSNSVLVLEAGGSDYPWDLLIQMPAALAYPAGNKRYNWI 60
Query: 491 YLAEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAK-LGYNGWGYD 549
Y EP + N R+ GK LGGSS++ ++YQRG DYE +AK G W Y
Sbjct: 61 YETEPEPH----MNNRRVGHARGKVLGGSSSINGMIYQRGNPMDYEKWAKPEGMESWDYA 116
Query: 550 ETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPCPK 609
+ L Y+ + E + K G G P+ + + +N + + F + E GY
Sbjct: 117 DCLPYYKRLE----TTFGGEKPYRGHDG--PIKVRRGPADNPLFQAFIEAGVEAGYNKTP 170
Query: 610 DMNDRYVDVGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQNVA 669
D+N + GF + G R SAA AYL P A KR NL V R+ VTK+ + N A
Sbjct: 171 DVNG-FRQEGFGPMDSTVHNGRRVSAARAYLHP-AMKRPNLEVQTRAFVTKINF-EGNRA 227
Query: 670 TGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
TGVE+ G ANKEVIL+AGAI + QLL LSGIG HL E+ I
Sbjct: 228 TGVEFKKG-GRKEHTEANKEVILSAGAINSPQLLQLSGIGDAEHLRELGIE 277
>gnl|CDD|234421 TIGR03970, Rv0697, dehydrogenase, Rv0697 family. This model
describes a set of dehydrogenases belonging to the
glucose-methanol-choline oxidoreductase (GMC
oxidoreductase) family. Members of the present family
are restricted to Actinobacterial genome contexts
containing also members of families TIGR03962 and
TIGR03969 (the mycofactocin system), and are proposed to
be uniform in function.
Length = 487
Score = 142 bits (359), Expect = 5e-36
Identities = 90/294 (30%), Positives = 129/294 (43%), Gaps = 26/294 (8%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAG------GDTPIHSRIPGMSSVLSLSEFD 488
D+++VG +AGCV+A RLSE S V ++EAG P + S
Sbjct: 2 DVLVVGGGSAGCVVAARLSEDPSCTVTVLEAGPGYRDPSRLPAQLTDGLRLPIGPASPVV 61
Query: 489 HAYLAEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKLGYNGWGY 548
Y + R + I G+ LGGS AV + R D++ + GW +
Sbjct: 62 WRY---GVELTDGPRRAS--AIVRGRVLGGSGAVNGGYFCRALPADFDAWPI---PGWSW 113
Query: 549 DETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPCP 608
D+ L +F E +HGT G +PV + E + I F +A G+
Sbjct: 114 DDVLPHFRAIETDLDF----DGPLHGTAGPIPVR--RTAELDGISAAFVAAALGAGFGWI 167
Query: 609 KDMNDRYVDVGF--AELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQ 666
D+N D +P G R S A AYL P A KR NL V ++V +++ +
Sbjct: 168 ADLNGSGPDAPTGVGAVPLNVDGGRRVSTAVAYLLP-ALKRPNLTVEADTRVVRILFS-G 225
Query: 667 NVATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
A GVE + G + A++ V+L AGA+ +A LLLLSGIGP L I
Sbjct: 226 TRAVGVEVL-GDGGPRTLRADR-VVLCAGAVESAHLLLLSGIGPAEQLRAAGIA 277
>gnl|CDD|216089 pfam00732, GMC_oxred_N, GMC oxidoreductase. This family of
proteins bind FAD as a cofactor.
Length = 218
Score = 118 bits (297), Expect = 4e-30
Identities = 68/217 (31%), Positives = 99/217 (45%), Gaps = 29/217 (13%)
Query: 503 VRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKL-GYNGWGYDETLKYFVKSEDY 561
V R+ + AG +GG S+V R + + +A G GWGYD+ L Y K E
Sbjct: 14 VNGRRMILPAGSTVGGGSSVNWSACIRTPAAVLDEWASEFGLEGWGYDDYLPYMDKVE-- 71
Query: 562 RSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYP---CPKDMNDRYVDV 618
G G G+ ++ N + + +A+ELGYP P++ N +
Sbjct: 72 ------------GPLGVTTKGIEESPLNQALLK----AAEELGYPVEAVPRNSNGCH-YC 114
Query: 619 GFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQN-VATGVEYVNS 677
GF L G + S A +L P + NL +L +K K+II + A GVE +
Sbjct: 115 GFCGLG--CPTGAKQSTARTWLRPALER--NLRILTGAKAEKIIILGRGGRAVGVEARDG 170
Query: 678 KGETVR-VTANKEVILTAGAIANAQLLLLSGIGPKAH 713
G R +TA KEV++ AGA+ LLL SG+G H
Sbjct: 171 GGGIKRLITAAKEVVVAAGALNTPPLLLRSGLGKNPH 207
>gnl|CDD|215420 PLN02785, PLN02785, Protein HOTHEAD.
Length = 587
Score = 85.2 bits (211), Expect = 3e-17
Identities = 88/316 (27%), Positives = 129/316 (40%), Gaps = 88/316 (27%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLA 493
+D I+VG AGC LA LS+ + VLL+E GG +P ++ +S E H LA
Sbjct: 56 YDYIVVGGGTAGCPLAATLSQ--NFSVLLLERGG-------VPFGNANVSFLENFHIGLA 106
Query: 494 E-----PSQF--AGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKLGYNGW 546
+ SQ + GV NAR ++ LGG + + Y R ++ F +
Sbjct: 107 DTSPTSASQAFISTDGVINARARV-----LGGGTCINAGFYSRAST----RFIQKA---- 153
Query: 547 GYDETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYP 606
G+D L +E Y V E + VH P
Sbjct: 154 GWDAKLV----NESYPWV---ERQIVH---------------------------WPKVAP 179
Query: 607 CPKDMNDRYVDVGFAELPGMT---------------RYGLRFSAADAYLTPIAGKRTNLY 651
+ D ++VG + G T +G R +AA+ AG L
Sbjct: 180 WQAALRDSLLEVGVSPFNGFTYDHVYGTKVGGTIFDEFGRRHTAAELL---AAGNPNKLR 236
Query: 652 VLKRSKVTKVIINDQNV---ATGVEYVNSKGETVRVTANK----EVILTAGAIANAQLLL 704
VL + V K++ + ATGV + + G + + E+IL+AGAI + Q+LL
Sbjct: 237 VLLHATVQKIVFDTSGKRPRATGVIFKDENGNQHQAFLSNNKGSEIILSAGAIGSPQMLL 296
Query: 705 LSGIGPKAHLDEVKIP 720
LSGIGPK L + KIP
Sbjct: 297 LSGIGPKKELKKHKIP 312
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 50.8 bits (121), Expect = 2e-06
Identities = 51/247 (20%), Positives = 82/247 (33%), Gaps = 38/247 (15%)
Query: 136 PFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQYQFLYSPLSPYNPRSLFSSQNPLSPLSP 195
P P +P +P+ PR+ P P SP P L ++P P SP
Sbjct: 579 PEFPKDPKHPKDPEEPKKPKRPRSAQRPT--------RPKSPKLPELLDIPKSPKRPESP 630
Query: 196 LNANSPYNPRNQISYYNIENPKSPYNIHNPESPLYSDSPYYYRNLPLYNPVETELIDQYS 255
+ P P+ S E PK + P+SP P + + D Y
Sbjct: 631 KSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSP----------KPPFDPKFKEKFYDDYL 680
Query: 256 TSVYNNPDSPLSI---NSPHSVLNPYNLDSILNPFNVKSPLSPENPASPFNPYNP--EPS 310
+ + ++ ++ S S+L ++ PF PL P+ P P+ P +P
Sbjct: 681 DAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPD 740
Query: 311 IFDSHNIYSFFSPHNPYSPFNPYNIDSPF---------------NSDNPDSPFRNYPQYS 355
+I F P + F+ D+P + PD + S
Sbjct: 741 AEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPS 800
Query: 356 RYNPKNP 362
+ K P
Sbjct: 801 EHEDKPP 807
Score = 47.8 bits (113), Expect = 2e-05
Identities = 46/194 (23%), Positives = 72/194 (37%), Gaps = 11/194 (5%)
Query: 214 ENPKSPYNIHNPESPLYSDSPYY-YRNLPLYNPVETELIDQYSTSVYNNPDSPLSINSPH 272
E PK P + +PE P P R +P EL+D + +P P S SP
Sbjct: 580 EFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLD-----IPKSPKRPESPKSPK 634
Query: 273 SVLNPYNLDSILNPFNVKSPLSPENPASPFNPYNP--EPSIFDSHNIYSFFSPHNPYSPF 330
P S P K SP+ P SP P++P + +D + + S +
Sbjct: 635 RPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVV 694
Query: 331 NPYNIDSPFNSDNPDSPFRNYPQYSRYNPK---NPESPFSPQNEDSIYNPENPNSPYNPF 387
+ +S P++P + PK + E PF P + P++ P
Sbjct: 695 LDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPE 754
Query: 388 NENSLYHPGNPDSP 401
E + +H D+P
Sbjct: 755 EERTFFHETPADTP 768
Score = 45.1 bits (106), Expect = 1e-04
Identities = 34/124 (27%), Positives = 47/124 (37%), Gaps = 22/124 (17%)
Query: 286 PFNVKSPLSPENPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPD 345
P K P P++P P +P P+ P + P P + P D P
Sbjct: 573 PTLSKKPEFPKDPKHPKDPEEPKKPK----------RPRSAQRPTRPKSPKLPELLDIPK 622
Query: 346 SPFRNYPQYSRYNPKNPESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSPYDWQ 405
SP R +PK+P+ P PQ S PE P +P P +P P+D +
Sbjct: 623 SPKRPE------SPKSPKRPPPPQRPSSPERPEGPKIIKSP------KPPKSPKPPFDPK 670
Query: 406 RLEK 409
EK
Sbjct: 671 FKEK 674
Score = 38.9 bits (90), Expect = 0.011
Identities = 31/146 (21%), Positives = 50/146 (34%), Gaps = 12/146 (8%)
Query: 71 NPFSPINQYSPYSPYNTDSLFDNTHPESLLNPYNPESLLSPLNPRSVFYPSNPEVPSF-- 128
+P P P S PE L P +P+ SP +P+ P P P
Sbjct: 590 DPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPE 649
Query: 129 DYNNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDS----------PYYNQYQFLYSPLSPY 178
SP P +P F+PK + ++ + ++ + P
Sbjct: 650 GPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPE 709
Query: 179 NPRSLFSSQNPLSPLSPLNANSPYNP 204
P + F++ PL P P + P+ P
Sbjct: 710 TPGTPFTTPRPLPPKLPRDEEFPFEP 735
>gnl|CDD|219686 pfam07992, Pyr_redox_2, Pyridine nucleotide-disulphide
oxidoreductase. This family includes both class I and
class II oxidoreductases and also NADH oxidases and
peroxidases. This domain is actually a small NADH
binding domain within a larger FAD binding domain.
Length = 283
Score = 46.2 bits (110), Expect = 3e-05
Identities = 21/80 (26%), Positives = 32/80 (40%), Gaps = 3/80 (3%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLAE 494
D++I+G AG A RL+ LKV LIE G T + + L L + LA
Sbjct: 1 DVVIIGGGPAGLAAAIRLAR-LGLKVALIEREGGTCYNR--GCLPKKLLLEVAEGLELAI 57
Query: 495 PSQFAGLGVRNARIKITAGK 514
+ +++ G
Sbjct: 58 GLALPEEVYKEFGVEVLLGT 77
>gnl|CDD|235787 PRK06370, PRK06370, mercuric reductase; Validated.
Length = 463
Score = 44.4 bits (106), Expect = 1e-04
Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 4/40 (10%)
Query: 432 DCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GGD 468
+D I++GA AG LA R + +KV LIE GG
Sbjct: 4 QRYDAIVIGAGQAGPPLAARAAG-LGMKVALIERGLLGGT 42
>gnl|CDD|223717 COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production
and conversion].
Length = 396
Score = 44.0 bits (104), Expect = 2e-04
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 5/57 (8%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGD--TPIHSRIPGMSSVLSLSEFD 488
+D++IVGA AG A RL++ + L VL++E G + G S +L E
Sbjct: 4 YDVVIVGAGPAGSSAARRLAK-AGLDVLVLEKGSEPGAKPCC--GGGLSPRALEELI 57
>gnl|CDD|218770 pfam05834, Lycopene_cycl, Lycopene cyclase protein. This family
consists of lycopene beta and epsilon cyclase proteins.
Carotenoids with cyclic end groups are essential
components of the photosynthetic membranes in all
plants, algae, and cyanobacteria. These lipid-soluble
compounds protect against photo-oxidation, harvest light
for photosynthesis, and dissipate excess light energy
absorbed by the antenna pigments. The cyclisation of
lycopene (psi, psi-carotene) is a key branch point in
the pathway of carotenoid biosynthesis. Two types of
cyclic end groups are found in higher plant carotenoids:
the beta and epsilon rings. Carotenoids with two beta
rings are ubiquitous, and those with one beta and one
epsilon ring are common; however, carotenoids with two
epsilon rings are rare.
Length = 374
Score = 40.4 bits (95), Expect = 0.003
Identities = 19/37 (51%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 435 DIIIVGASAAGCVLANRLSE-VSSLKVLLIEAGGDTP 470
D++IVGA AG +LA RL + L+VLLI+AG P
Sbjct: 1 DLVIVGAGLAGLLLALRLRQARPGLRVLLIDAGPGPP 37
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 40.3 bits (94), Expect = 0.003
Identities = 29/118 (24%), Positives = 45/118 (38%), Gaps = 13/118 (11%)
Query: 295 PENPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDN-PDSPFRNYPQ 353
P+NP P P P + N + F P + P +P N +S F + +
Sbjct: 323 PDNPQDPVPP--PNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVE 380
Query: 354 YSRYNPKNPESPFSPQNEDSIYNPENPNSPYN-PFN-----ENSLYHPGNPDSPYDWQ 405
P NP +P P+ E +I N P + P +N+ P P++ D Q
Sbjct: 381 N----PPNPPNPDIPEQEPNIPEDSNKEVPEDVPMEPEDDRDNNFNEPKKPENKGDGQ 434
Score = 35.3 bits (81), Expect = 0.14
Identities = 34/175 (19%), Positives = 47/175 (26%), Gaps = 35/175 (20%)
Query: 244 NPVETELIDQYSTSVYNNPDSPLSINSPHSVLNPYNLDSILNPFNVKSPLSPENPASPFN 303
NP E L V + + P + + N + NP N +P PE +
Sbjct: 340 NPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPE 399
Query: 304 PYNPE-PSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDSPFRNYPQYSRYNPKNP 362
N E P P P + N + P +N P
Sbjct: 400 DSNKEVPEDV----------PMEPEDDRDN-NFNEPKKPEN-----------KGDGQNEP 437
Query: 363 ESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSPYDWQRLEKERYNIYRS 417
P NE N N +PGN + D R N R+
Sbjct: 438 VIPKPLDNERDQSNK------------NKQVNPGNRHNSEDRYTRPHGRNNENRN 480
Score = 29.2 bits (65), Expect = 8.3
Identities = 32/175 (18%), Positives = 60/175 (34%), Gaps = 22/175 (12%)
Query: 58 FDIIPYSNTDDIYNPFSPINQYSPYSPYNTDSLFDNTHPESLLNPYNPESLLSPLNPRSV 117
++ P + + P N + N++ D +P + NP PE +
Sbjct: 344 ENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPED--- 400
Query: 118 FYPSNPEVPSFDYNNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQYQFLYSPLSP 177
SN EVP + P P + N + P+ P++ Q + + P
Sbjct: 401 ---SNKEVPE------DVPMEPEDDRD----NNFNE---PKKPENKGDGQNEPV-IPKPL 443
Query: 178 YNPRSLFSSQNPLSPLSPLNANSPYNPRNQISYYNIENPKSPYNIHNPESPLYSD 232
N R + ++P + N+ Y + + N +I P+ P S+
Sbjct: 444 DNERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNSDI--PKHPERSE 496
>gnl|CDD|233671 TIGR01988, Ubi-OHases, Ubiquinone biosynthesis hydroxylase,
UbiH/UbiF/VisC/COQ6 family. This model represents a
family of FAD-dependent hydroxylases (monooxygenases)
which are all believed to act in the aerobic ubiquinone
biosynthesis pathway. A separate set of hydroxylases, as
yet undiscovered, are believed to be active under
anaerobic conditions. In E. coli three enzyme activities
have been described, UbiB (which acts first at position
6, see TIGR01982), UbiH (which acts at position 4,) and
UbiF (which acts at position 5,). UbiH and UbiF are
similar to one another and form the basis of this
subfamily. Interestingly, E. coli contains another
hydroxylase gene, called visC, that is highly similar to
UbiF, adjacent to UbiH and, when mutated, results in a
phenotype similar to that of UbiH (which has also been
named visB). Several other species appear to have three
homologs in this family, although they assort themselves
differently on phylogenetic trees (e.g. Xylella and
Mesorhizobium) making it difficult to ascribe a specific
activity to each one. Eukaryotes appear to have only a
single homolog in this subfamily (COQ6,) which
complements UbiH, but also possess a non-orthologous
gene, COQ7 which complements UbiF [Biosynthesis of
cofactors, prosthetic groups, and carriers, Menaquinone
and ubiquinone].
Length = 387
Score = 39.9 bits (94), Expect = 0.003
Identities = 16/32 (50%), Positives = 18/32 (56%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
DI+IVG G LA L+ LKV LIEA
Sbjct: 1 DIVIVGGGPVGLALALALARSGGLKVALIEAT 32
>gnl|CDD|223727 COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related
FAD-dependent oxidoreductases [Coenzyme metabolism /
Energy production and conversion].
Length = 387
Score = 39.7 bits (93), Expect = 0.004
Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
D+ IVGA AG LA L+ + L V L+E
Sbjct: 3 LDVAIVGAGPAGLALALALAR-AGLDVTLLER 33
>gnl|CDD|205628 pfam13450, NAD_binding_8, NAD(P)-binding Rossmann-like domain.
Length = 66
Score = 35.5 bits (83), Expect = 0.006
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 8/46 (17%)
Query: 438 IVGASAAGCVLANRLSEVSSLKVLLIEA----GGDTPIHS-RIPGM 478
IVGA +G V A L++ VL++E GG+ +S R PG
Sbjct: 1 IVGAGLSGLVAAYLLAK-RGKDVLVLEKRDRIGGN--AYSERDPGY 43
>gnl|CDD|224169 COG1249, Lpd, Pyruvate/2-oxoglutarate dehydrogenase complex,
dihydrolipoamide dehydrogenase (E3) component, and
related enzymes [Energy production and conversion].
Length = 454
Score = 39.1 bits (92), Expect = 0.007
Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
+D++++GA AG V A R ++ LKV L+E G
Sbjct: 5 YDVVVIGAGPAGYVAAIRAAQ-LGLKVALVEKGE 37
>gnl|CDD|240384 PTZ00367, PTZ00367, squalene epoxidase; Provisional.
Length = 567
Score = 39.1 bits (91), Expect = 0.009
Identities = 17/31 (54%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
+D+IIVG S AG VLA LS+ KVL++E
Sbjct: 34 YDVIIVGGSIAGPVLAKALSKQGR-KVLMLE 63
>gnl|CDD|233693 TIGR02032, GG-red-SF, geranylgeranyl reductase family. This model
represents a subfamily which includes geranylgeranyl
reductases involved in chlorophyll and
bacteriochlorophyll biosynthesis as well as other
related enzymes which may also act on geranylgeranyl
groups or related substrates [Biosynthesis of cofactors,
prosthetic groups, and carriers, Chlorophyll and
bacteriochlorphyll].
Length = 295
Score = 37.7 bits (88), Expect = 0.015
Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
+D+++VGA AG A RL++ L+VLL+E
Sbjct: 1 YDVVVVGAGPAGASAAYRLAD-KGLRVLLLE 30
>gnl|CDD|233669 TIGR01984, UbiH, 2-polyprenyl-6-methoxyphenol 4-hydroxylase. This
model represents the FAD-dependent monoxygenase
responsible for the second hydroxylation step in the
aerobic ubiquinone bioynthetic pathway. The scope of
this model is limited to the proteobacteria. This family
is closely related to the UbiF hydroxylase which
catalyzes the final hydroxylation step. The enzyme has
also been named VisB due to a mutant VISible light
sensitive phenotype [Biosynthesis of cofactors,
prosthetic groups, and carriers, Menaquinone and
ubiquinone].
Length = 382
Score = 36.9 bits (86), Expect = 0.029
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
D+IIVG G LA LS + +K+ LIEA
Sbjct: 1 DVIIVGGGLVGLSLALALSRLGKIKIALIEA 31
>gnl|CDD|233583 TIGR01813, flavo_cyto_c, flavocytochrome c. This model describes a
family of redox proteins related to the succinate
dehydrogenases and fumarate reductases of E. coli,
mitochondria, and other well-characterized systems. A
member of this family from Shewanella frigidimarina
NCIMB400 is characterized as a water-soluble periplasmic
protein with four heme groups, a non-covalently bound
FAD, and essentially unidirectional fumarate reductase
activity. At least seven distinct members of this family
are found in Shewanella oneidensis, a species able to
use a wide variety of pathways for respiraton [Energy
metabolism, Electron transport].
Length = 439
Score = 36.9 bits (86), Expect = 0.036
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 655 RSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIA-NAQLL 703
SKV +I +DQ GV V KG+ + + A K V+L G N +++
Sbjct: 150 NSKVEDLIQDDQGTVVGVV-VKGKGKGIYIKAAKAVVLATGGFGSNKEMI 198
Score = 30.0 bits (68), Expect = 4.8
Identities = 12/43 (27%), Positives = 23/43 (53%), Gaps = 4/43 (9%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIE----AGGDTPIHS 473
D+++VG+ AG A + + V+L+E GG++ I +
Sbjct: 1 DVVVVGSGFAGLSAALSAKKAGAANVVLLEKMPVIGGNSAIAA 43
>gnl|CDD|216400 pfam01266, DAO, FAD dependent oxidoreductase. This family includes
various FAD dependent oxidoreductases:
Glycerol-3-phosphate dehydrogenase EC:1.1.99.5,
Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine
oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1.
Length = 234
Score = 36.1 bits (84), Expect = 0.039
Identities = 10/33 (30%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
D++++G G A L+ L V L+E G
Sbjct: 1 DVVVIGGGIVGLSTAYELAR-RGLSVTLLERGD 32
>gnl|CDD|216176 pfam00890, FAD_binding_2, FAD binding domain. This family includes
members that bind FAD. This family includes the
flavoprotein subunits from succinate and fumarate
dehydrogenase, aspartate oxidase and the alpha subunit
of adenylylsulphate reductase.
Length = 401
Score = 36.5 bits (85), Expect = 0.048
Identities = 11/40 (27%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSR 474
D++++G+ AG A +E + LKV ++E G +
Sbjct: 1 DVVVIGSGLAGLAAALEAAE-AGLKVAVVEKGQPFGGATA 39
Score = 34.2 bits (79), Expect = 0.21
Identities = 19/57 (33%), Positives = 25/57 (43%), Gaps = 3/57 (5%)
Query: 655 RSKVTKVIINDQNVATGVEYVN-SKGETVRVTANKEVILTAGAI-ANAQLLLLSGIG 709
R +I+ D TGV N G VR+ A V+L G NA+LLL +
Sbjct: 160 RHPADDLIVEDG-RVTGVVVENRRNGREVRIRAIAGVLLATGGFGRNAELLLPAAGY 215
>gnl|CDD|236001 PRK07364, PRK07364, 2-octaprenyl-6-methoxyphenyl hydroxylase;
Validated.
Length = 415
Score = 36.2 bits (84), Expect = 0.054
Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 1/37 (2%)
Query: 429 KSGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
+D+ IVG G LA L + S L++ LIEA
Sbjct: 14 TRSLTYDVAIVGGGIVGLTLAAALKD-SGLRIALIEA 49
>gnl|CDD|235774 PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated.
Length = 460
Score = 35.9 bits (84), Expect = 0.067
Identities = 15/33 (45%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
+D+I++GA AG V A R ++ KV LIE G
Sbjct: 4 YDVIVIGAGPAGYVAARRAAK-LGKKVALIEKG 35
>gnl|CDD|178267 PLN02661, PLN02661, Putative thiazole synthesis.
Length = 357
Score = 36.0 bits (83), Expect = 0.069
Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 13/74 (17%)
Query: 398 PDSPYDWQRLEKERYNIYRSVI-------YPPDMTPYVKSGDCFDIIIVGASAAGCVLAN 450
+PYD + I S++ Y DM Y + D++IVGA +AG A
Sbjct: 56 STAPYDLNNFKFA--PIKESIVSREMTRRYMTDMITYADT----DVVIVGAGSAGLSCAY 109
Query: 451 RLSEVSSLKVLLIE 464
LS+ ++KV +IE
Sbjct: 110 ELSKNPNVKVAIIE 123
>gnl|CDD|216530 pfam01494, FAD_binding_3, FAD binding domain. This domain is
involved in FAD binding in a number of enzymes.
Length = 349
Score = 35.0 bits (81), Expect = 0.12
Identities = 18/81 (22%), Positives = 37/81 (45%), Gaps = 4/81 (4%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLAE 494
D++IVG AG +LA L+ + ++V+L+E T + R G++ ++ A L +
Sbjct: 3 DVLIVGGGPAGLMLALLLAR-AGVRVVLVERHATTSVLPRAGGLNQR-TMELLRQAGLED 60
Query: 495 PSQFAGLGVRNARIKITAGKG 515
+ G + + +
Sbjct: 61 --RILAEGAPHEGMGLAFYNT 79
>gnl|CDD|236385 PRK09126, PRK09126, hypothetical protein; Provisional.
Length = 392
Score = 34.9 bits (81), Expect = 0.16
Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 432 DCFDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
DI++VGA AG A L+ S LKV LIE
Sbjct: 2 MHSDIVVVGAGPAGLSFARSLAG-SGLKVTLIE 33
>gnl|CDD|224550 COG1635, THI4, Ribulose 1,5-bisphosphate synthetase, converts PRPP
to RuBP, flavoprotein [Carbohydrate transport and
metabolism].
Length = 262
Score = 34.2 bits (79), Expect = 0.18
Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 5/37 (13%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEA----GG 467
D+IIVGA +G A L++ + LKV + E GG
Sbjct: 32 DVIIVGAGPSGLTAAYYLAK-AGLKVAIFERKLSFGG 67
>gnl|CDD|223981 COG1053, SdhA, Succinate dehydrogenase/fumarate reductase,
flavoprotein subunit [Energy production and conversion].
Length = 562
Score = 34.6 bits (80), Expect = 0.19
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 5/40 (12%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE----AGGDT 469
FD++++G AG A +E + LKV L+ G T
Sbjct: 7 FDVVVIGGGGAGLRAAIEAAE-AGLKVALLSKAPPKRGHT 45
Score = 33.1 bits (76), Expect = 0.53
Identities = 15/81 (18%), Positives = 26/81 (32%), Gaps = 14/81 (17%)
Query: 618 VGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQNVATGVEYVNS 677
G L + L+FS + + V ++++D GV +
Sbjct: 136 TGHELLHTLYEQLLKFS--------------GIEIFDEYFVLDLLVDDGGGVAGVVARDL 181
Query: 678 KGETVRVTANKEVILTAGAIA 698
+ + V K VIL G
Sbjct: 182 RTGELYVFRAKAVILATGGAG 202
>gnl|CDD|182194 PRK10015, PRK10015, oxidoreductase; Provisional.
Length = 429
Score = 34.6 bits (79), Expect = 0.21
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 430 SGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
S D FD I+VGA AG V A ++ + L VL+IE G
Sbjct: 2 SDDKFDAIVVGAGVAGSVAALVMAR-AGLDVLVIERG 37
>gnl|CDD|182273 PRK10157, PRK10157, putative oxidoreductase FixC; Provisional.
Length = 428
Score = 34.1 bits (78), Expect = 0.29
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 430 SGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
S D FD IIVGA AG V A L+ + +VL+IE G
Sbjct: 2 SEDIFDAIIVGAGLAGSVAALVLAREGA-QVLVIERG 37
>gnl|CDD|223108 COG0029, NadB, Aspartate oxidase [Coenzyme metabolism].
Length = 518
Score = 34.2 bits (79), Expect = 0.29
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 644 AGKRTNLYVLKRSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIAN 699
R N+ VL+ ++ +II D GV +N GE A K V+L G +
Sbjct: 143 VRNRPNITVLEGAEALDLIIEDGIGVAGVLVLNRNGELGTFRA-KAVVLATGGLGG 197
Score = 33.4 bits (77), Expect = 0.40
Identities = 9/35 (25%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 429 KSGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLI 463
+ D++I+G+ AG A L+ S +V ++
Sbjct: 3 TDFEHPDVLIIGSGLAGLTAALSLAP--SFRVTVL 35
>gnl|CDD|223652 COG0579, COG0579, Predicted dehydrogenase [General function
prediction only].
Length = 429
Score = 33.8 bits (78), Expect = 0.31
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 1/32 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVS-SLKVLLIE 464
+D++I+G G A LSE L V L+E
Sbjct: 4 YDVVIIGGGIMGAATAYELSEYEPDLSVALLE 35
>gnl|CDD|130849 TIGR01789, lycopene_cycl, lycopene cyclase. This model represents
a family of bacterial lycopene cyclases catalyzing the
transformation of lycopene to carotene. These enzymes
are found in a limited spectrum of alpha and gamma
proteobacteria as well as Flavobacterium.
Length = 370
Score = 33.7 bits (77), Expect = 0.33
Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 7/84 (8%)
Query: 435 DIIIVGASAAGCVLANRLSEVS-SLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLA 493
D IIVG AG ++A RL ++ +IEAG + S LS+ HA+LA
Sbjct: 1 DCIIVGGGLAGGLIALRLQRARPDFRIRVIEAGRTIGGNHTWSFFDS--DLSDAQHAWLA 58
Query: 494 EPSQ--FAGLGVRNA--RIKITAG 513
+ Q + G VR R K+
Sbjct: 59 DLVQTDWPGYEVRFPKYRRKLKTA 82
>gnl|CDD|224992 COG2081, COG2081, Predicted flavoproteins [General function
prediction only].
Length = 408
Score = 33.7 bits (78), Expect = 0.33
Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
FD+II+G AG + A ++ +VLLI+ G
Sbjct: 4 FDVIIIGGGPAGLMAAISAAKAGR-RVLLIDKG 35
>gnl|CDD|145231 pfam01946, Thi4, Thi4 family. This family includes a putative
thiamine biosynthetic enzyme.
Length = 229
Score = 33.2 bits (76), Expect = 0.33
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 5/46 (10%)
Query: 420 YPPDMTPYVKSGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
Y D+ Y +S D++IVGA +G A L++ LKV +IE
Sbjct: 8 YFEDLLDYAES----DVVIVGAGPSGLTAAYYLAK-KGLKVAIIER 48
>gnl|CDD|232791 TIGR00031, UDP-GALP_mutase, UDP-galactopyranose mutase. This
enzyme is involved in the conversion of UDP-GALP into
UDP-GALF through a 2-keto intermediate. It contains FAD
as a cofactor. The gene is known as glf, ceoA, and rfbD.
It is known experimentally in E. coli, Mycobacterium
tuberculosis, and Klebsiella pneumoniae [Cell envelope,
Biosynthesis and degradation of surface polysaccharides
and lipopolysaccharides].
Length = 377
Score = 33.6 bits (77), Expect = 0.35
Identities = 16/31 (51%), Positives = 24/31 (77%), Gaps = 1/31 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
FD IIVGA +G VLAN L++++ +VL++E
Sbjct: 2 FDYIIVGAGLSGIVLANILAQLNK-RVLVVE 31
>gnl|CDD|235584 PRK05732, PRK05732, 2-octaprenyl-6-methoxyphenyl hydroxylase;
Validated.
Length = 395
Score = 33.7 bits (78), Expect = 0.37
Identities = 21/46 (45%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 434 FDIIIVGASAAGCVLANRLSEVS--SLKVLLIEAGGDTPIHSRIPG 477
D+IIVG AG LA LS +S L V LIEA P PG
Sbjct: 4 MDVIIVGGGMAGATLALALSRLSHGGLPVALIEA--FAPESDAHPG 47
>gnl|CDD|168201 PRK05714, PRK05714, 2-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol
hydroxylase; Provisional.
Length = 405
Score = 33.3 bits (76), Expect = 0.42
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 9/54 (16%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAG--------GDTPIHSRIPGMSS 480
D++IVGA G LA L + S L+VLL++ G P R+ +S+
Sbjct: 4 DLLIVGAGMVGSALALAL-QGSGLEVLLLDGGPLSVKPFDPQAPFEPRVSALSA 56
>gnl|CDD|217590 pfam03486, HI0933_like, HI0933-like protein.
Length = 405
Score = 33.3 bits (77), Expect = 0.44
Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
+D+I++G AAG + A ++ +VLLI+ G
Sbjct: 1 YDVIVIGGGAAGLMAAISAAK-RGRRVLLIDKGK 33
>gnl|CDD|130850 TIGR01790, carotene-cycl, lycopene cyclase family protein. This
family includes lycopene beta and epsilion cyclases
(which form beta and delta carotene, respectively) from
bacteria and plants as well as the plant
capsanthin/capsorubin and neoxanthin cyclases which
appear to have evolved from the plant lycopene cyclases.
The plant lycopene epsilon cyclases also transform
neurosporene to alpha zeacarotene.
Length = 388
Score = 33.2 bits (76), Expect = 0.45
Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTP 470
D+ ++G AG +A L+ L+V LIE P
Sbjct: 1 DLAVIGGGPAGLAIALELAR-PGLRVQLIEPHPPIP 35
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 33.1 bits (76), Expect = 0.45
Identities = 20/117 (17%), Positives = 36/117 (30%), Gaps = 5/117 (4%)
Query: 290 KSPLSPENPASPFNPYNPEPSIF--DSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDSP 347
+ ++ N + F + +P+ S P P P +P + P S
Sbjct: 160 DADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSF 219
Query: 348 FRNYPQYSRYNPKNPESPFSPQNEDSIY---NPENPNSPYNPFNENSLYHPGNPDSP 401
+ P S +P NP P P P + P + ++ P +
Sbjct: 220 QSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276
>gnl|CDD|235241 PRK04176, PRK04176, ribulose-1,5-biphosphate synthetase;
Provisional.
Length = 257
Score = 32.9 bits (76), Expect = 0.51
Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
D+ IVGA +G A L++ + LKV + E
Sbjct: 27 DVAIVGAGPSGLTAAYYLAK-AGLKVAVFE 55
>gnl|CDD|181057 PRK07608, PRK07608, ubiquinone biosynthesis hydroxylase family
protein; Provisional.
Length = 388
Score = 33.0 bits (76), Expect = 0.57
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 6/52 (11%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG-----GDTPIHSRIPGMSS 480
FD+++VG G LA L++ S L+V L+ D SR+ +S
Sbjct: 6 FDVVVVGGGLVGASLALALAQ-SGLRVALLAPRAPPRPADDAWDSRVYAISP 56
>gnl|CDD|223636 COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis,
outer membrane].
Length = 374
Score = 32.7 bits (75), Expect = 0.69
Identities = 12/32 (37%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
FD +IVGA +G V+A +++ +VL++E
Sbjct: 2 FDYLIVGAGLSGAVIAEVAAQLGK-RVLIVEK 32
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 33.1 bits (75), Expect = 0.71
Identities = 37/126 (29%), Positives = 53/126 (42%), Gaps = 12/126 (9%)
Query: 194 SPLNANSPYN--PRNQISYYNIENPKSPYNIHNPESPLYSDSPYY---YRNLPLYNPVET 248
+P N+N YN P + Y N N +PY+ +P YS+ PY Y N PL N +
Sbjct: 416 NPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPS 475
Query: 249 ELIDQYSTSVYNNPDSPLSINSPHSVLNPYNLDSILNPFNVKSPLSPENP--ASPFN--P 304
D + S Y+ + N P + L N N F+ + S NP + PF P
Sbjct: 476 SAKDHH--SAYHAAYQHRAANQPAANLPTAN-QPAANNFHGAAGNSVGNPFASRPFGSAP 532
Query: 305 YNPEPS 310
Y +
Sbjct: 533 YGGNAA 538
Score = 31.2 bits (70), Expect = 2.8
Identities = 29/111 (26%), Positives = 44/111 (39%), Gaps = 15/111 (13%)
Query: 198 ANSPYN--PRNQISYYNIENPKSPYNIHNPESPLYSDSPYYYRNLPLYNPVETELIDQYS 255
+N+ Y+ + Y N N +PYN +P S++P Y N P NP + L YS
Sbjct: 410 SNAGYSNPGNSNPGYNNAPNSNTPYN-----NPPNSNTP--YSNPPNSNPPYSNL--PYS 460
Query: 256 TSVYNNPD----SPLSINSPHSVLNPYNLDSILNPFNVKSPLSPENPASPF 302
+ Y+N P S HS + N P + + A+ F
Sbjct: 461 NTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPAANNF 511
>gnl|CDD|223523 COG0446, HcaD, Uncharacterized NAD(FAD)-dependent dehydrogenases
[General function prediction only].
Length = 415
Score = 32.6 bits (74), Expect = 0.75
Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 2/42 (4%)
Query: 436 IIIVGASAAGCVLANRL-SEVSSLKVLLIEAGGDTPIHSRIP 476
I+IVG AAG A L + + ++ LI + R P
Sbjct: 1 IVIVGGGAAGLSAATTLRRLLLAAEITLIGR-EPKYSYYRCP 41
>gnl|CDD|233368 TIGR01350, lipoamide_DH, dihydrolipoamide dehydrogenase. This
model describes dihydrolipoamide dehydrogenase, a
flavoprotein that acts in a number of ways. It is the E3
component of dehydrogenase complexes for pyruvate,
2-oxoglutarate, 2-oxoisovalerate, and acetoin. It can
also serve as the L protein of the glycine cleavage
system. This family includes a few members known to have
distinct functions (ferric leghemoglobin reductase and
NADH:ferredoxin oxidoreductase) but that may be
predicted by homology to act as dihydrolipoamide
dehydrogenase as well. The motif GGXCXXXGCXP near the
N-terminus contains a redox-active disulfide.
Length = 460
Score = 32.6 bits (75), Expect = 0.84
Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 646 KRTNLYVLKRSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLL 705
K+ + +L +KVT V ND V Y N GET +T K V++ G N + L L
Sbjct: 222 KKKGVKILTNTKVTAVEKNDD----QVTYENKGGETETLTGEK-VLVAVGRKPNTEGLGL 276
Query: 706 SGIG 709
+G
Sbjct: 277 EKLG 280
Score = 31.5 bits (72), Expect = 2.0
Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 4/37 (10%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GG 467
+D+I++G G V A R +++ LKV L+E GG
Sbjct: 2 YDVIVIGGGPGGYVAAIRAAQL-GLKVALVEKEYLGG 37
>gnl|CDD|180854 PRK07121, PRK07121, hypothetical protein; Validated.
Length = 492
Score = 32.6 bits (75), Expect = 0.90
Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 656 SKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAG 695
++ T++I++D GVE GETV + A K V+L AG
Sbjct: 198 TRATRLIVDDDGRVVGVE-ARRYGETVAIRARKGVVLAAG 236
>gnl|CDD|235905 PRK06996, PRK06996, hypothetical protein; Provisional.
Length = 398
Score = 32.4 bits (74), Expect = 0.95
Identities = 16/35 (45%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 434 FDIIIVGASAAGCVLAN---RLSEVSSLKVLLIEA 465
FDI IVGA G LA R S +L + LI+A
Sbjct: 12 FDIAIVGAGPVGLALAGWLARRSATRALSIALIDA 46
>gnl|CDD|100027 cd00396, PurM-like, AIR (aminoimidazole ribonucleotide) synthase
related protein. This family includes Hydrogen
expression/formation protein HypE, AIR synthases, FGAM
(formylglycinamidine ribonucleotide) synthase and
Selenophosphate synthetase (SelD). The N-terminal domain
of AIR synthase forms the dimer interface of the
protein, and is suggested as a putative ATP binding
domain.
Length = 222
Score = 31.6 bits (72), Expect = 1.0
Identities = 16/67 (23%), Positives = 24/67 (35%), Gaps = 7/67 (10%)
Query: 654 KRSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAH 713
++ V+I TGV+ V V A ++ T G + L G A
Sbjct: 110 SGARPGDVLI-----LTGVDAVLELVAAGDVHAMHDI--TDGGLLGTLPELAQASGVGAE 162
Query: 714 LDEVKIP 720
+D IP
Sbjct: 163 IDLEAIP 169
>gnl|CDD|221798 pfam12831, FAD_oxidored, FAD dependent oxidoreductase. This family
of proteins contains FAD dependent oxidoreductases and
related proteins.
Length = 415
Score = 32.2 bits (74), Expect = 1.0
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 15/65 (23%)
Query: 435 DIIIVGASAAGCVLANRLSEVSS----LKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHA 490
D+++VG AG A +++ KVLL+E G + GM++ + D
Sbjct: 1 DVVVVGGGPAGVAAA-----IAAARLGAKVLLVERRG------WLGGMATSGLVGTDDGN 49
Query: 491 YLAEP 495
Y
Sbjct: 50 YDRGQ 54
>gnl|CDD|235798 PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed.
Length = 462
Score = 32.0 bits (74), Expect = 1.0
Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GG 467
+D+I++GA G V A R +++ LKV ++E GG
Sbjct: 5 YDVIVIGAGPGGYVAAIRAAQL-GLKVAIVEKEKLGG 40
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 32.0 bits (72), Expect = 1.1
Identities = 24/115 (20%), Positives = 33/115 (28%), Gaps = 17/115 (14%)
Query: 288 NVKSPLSPENPASPFNP-YNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDS 346
N + ++P+ P P P ++ PH P P + FN
Sbjct: 164 NPQQRINPQRFGFPMQPNMGMRPG-------FNQMPPHMP--GMPPNQMRPGFNPMPGMP 214
Query: 347 PFRNYPQYSRYNPKNPESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSP 401
P + Q P F PQ P N P P N PG +
Sbjct: 215 PRPGFNQNPNMMPNMNRPGFRPQ-------PGGFNHPGTPMGPNMQQRPGFNPNQ 262
>gnl|CDD|223651 COG0578, GlpA, Glycerol-3-phosphate dehydrogenase [Energy
production and conversion].
Length = 532
Score = 31.9 bits (73), Expect = 1.2
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
FD+I++G G +A + LKV L+E G
Sbjct: 13 FDVIVIGGGITGAGIARDAA-GRGLKVALVEKG 44
>gnl|CDD|236223 PRK08294, PRK08294, phenol 2-monooxygenase; Provisional.
Length = 634
Score = 32.3 bits (74), Expect = 1.2
Identities = 20/72 (27%), Positives = 28/72 (38%), Gaps = 16/72 (22%)
Query: 393 YHPGNPDSPYDWQRLEKERYNIYRSVIYPPDMTPYVKSGDCFDIIIVGASAAGCVLANRL 452
Y PG+P I R P D+ D D++IVG AG LA +L
Sbjct: 8 YVPGDPRI------QPAAGRGINR----PADL------PDEVDVLIVGCGPAGLTLAAQL 51
Query: 453 SEVSSLKVLLIE 464
S + ++E
Sbjct: 52 SAFPDITTRIVE 63
>gnl|CDD|223737 COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino
acid transport and metabolism].
Length = 387
Score = 31.8 bits (72), Expect = 1.3
Identities = 11/34 (32%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
D++I+G G A L+E V ++EAG
Sbjct: 5 MDVVIIGGGIVGLSAAYYLAE-RGADVTVLEAGE 37
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 31.9 bits (73), Expect = 1.3
Identities = 20/125 (16%), Positives = 32/125 (25%), Gaps = 33/125 (26%)
Query: 127 SFDY-------NNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQYQFLYSPLSPYN 179
Y +NP F NP +N ++++ S LY ++ YN
Sbjct: 241 DLRYPFLSPQTDNPCITFQVLNPG-FYN----NALWISTTTTSKGKTLLITLYKTVTYYN 295
Query: 180 PRSLFSSQNPLSPLSPLNANSPYNPRNQISYYNIENPKSPYNIHNPESPLYSDSPYYYRN 239
+ Q+ K+ N + Y DSPY
Sbjct: 296 T---------------------FETPAQLKPKTKNKNKTTNKTPNDSTSKYWDSPYGDNT 334
Query: 240 LPLYN 244
Sbjct: 335 YNSLE 339
>gnl|CDD|225325 COG2605, COG2605, Predicted kinase related to galactokinase and
mevalonate kinase [General function prediction only].
Length = 333
Score = 31.6 bits (72), Expect = 1.4
Identities = 29/113 (25%), Positives = 36/113 (31%), Gaps = 16/113 (14%)
Query: 445 GCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLAEPS---QFAGL 501
G VL + + + IE G D I R V S E +H L S F
Sbjct: 28 GAVLNATIDKYI---YVTIEKGFDDEIRVRYDRTEFVKSYLENEHKPLVVESLKRDFLEF 84
Query: 502 -GVRNARI----KITAGKGLGGSSAVQNILYQ-----RGTSYDYENFAKLGYN 544
G + G GLG SSA L +G S A+ Y
Sbjct: 85 NGGTPIELHTQSDAPPGSGLGSSSAFVVALLNALHAWKGESLGPYELAREAYE 137
>gnl|CDD|224983 COG2072, TrkA, Predicted flavoprotein involved in K+ transport
[Inorganic ion transport and metabolism].
Length = 443
Score = 31.7 bits (72), Expect = 1.7
Identities = 20/105 (19%), Positives = 31/105 (29%), Gaps = 9/105 (8%)
Query: 432 DCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GGDTPIHSRIPGMSSVLSLSEFD 488
D+ I+GA +G A L + ++ E G T ++R PG+
Sbjct: 7 THTDVAIIGAGQSGLAAAYALKQAGVPDFVIFEKRDDVGGTWRYNRYPGLRLDSPKWLLG 66
Query: 489 HAYL--AEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGT 531
+L FA IK K I +
Sbjct: 67 FPFLPFRWDEAFAPFAEIKDYIKDYLEK----YGLRFQIRFNTRV 107
>gnl|CDD|131515 TIGR02462, pyranose_ox, pyranose oxidase. Pyranose oxidase (also
called glucose 2-oxidase) converts D-glucose and
molecular oxygen to 2-dehydro-D-glucose and hydrogen
peroxide. Peroxide production is believed to be
important to the wood rot fungi in which this enzyme is
found for lignin degradation.
Length = 544
Score = 31.4 bits (71), Expect = 1.9
Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 8/88 (9%)
Query: 637 DAYLTPIAGKRTNLYVLKRSKVTKVIINDQN----VATGVEYVNSKGETVRVTANKEVIL 692
+R L L + T+++ N+ N A V + S G+ + A +L
Sbjct: 218 LQPNDDAPSERFTL--LTNHRCTRLVRNETNESEIEAALVRDLLS-GDRFEIKA-DVYVL 273
Query: 693 TAGAIANAQLLLLSGIGPKAHLDEVKIP 720
GA+ N Q+L+ SG G D P
Sbjct: 274 ACGAVHNPQILVNSGFGQLGRPDPTNPP 301
>gnl|CDD|223566 COG0492, TrxB, Thioredoxin reductase [Posttranslational
modification, protein turnover, chaperones].
Length = 305
Score = 31.1 bits (71), Expect = 2.1
Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 13/40 (32%)
Query: 434 FDIIIVGASAAG------CVLANRLSEVSSLKVLLIEAGG 467
+D+II+G AG A LKV+LI GG
Sbjct: 4 YDVIIIGGGPAGLTAAIYAARAG-------LKVVLILEGG 36
>gnl|CDD|236010 PRK07395, PRK07395, L-aspartate oxidase; Provisional.
Length = 553
Score = 31.2 bits (71), Expect = 2.2
Identities = 14/30 (46%), Positives = 19/30 (63%), Gaps = 2/30 (6%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLI 463
FD+++VG+ AAG L L S L+V LI
Sbjct: 10 FDVLVVGSGAAG--LYAALCLPSHLRVGLI 37
>gnl|CDD|240393 PTZ00383, PTZ00383, malate:quinone oxidoreductase; Provisional.
Length = 497
Score = 30.9 bits (70), Expect = 2.5
Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 428 VKSGDCFDIIIVGASAAGCVLANRLSEVSSLK-VLLIE 464
D +D++IVG G L LS+ ++LK + LIE
Sbjct: 40 RLGSDVYDVVIVGGGVTGTALFYTLSKFTNLKKIALIE 77
>gnl|CDD|178082 PLN02463, PLN02463, lycopene beta cyclase.
Length = 447
Score = 30.8 bits (70), Expect = 2.7
Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 430 SGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
D+++VG AG +A ++SE + L V I+
Sbjct: 25 KSRVVDLVVVGGGPAGLAVAQQVSE-AGLSVCCID 58
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human Gammaherpesvirus
that infects and establishes latency in B lymphocytes in
vivo. The latent membrane protein 2 (LMP2) gene is
expressed in latently infected B cells and encodes two
protein isoforms, LMP2A and LMP2B, that are identical
except for an additional N-terminal 119 aa cytoplasmic
domain which is present in the LMP2A isoform. LMP2A is
thought to play a key role in either the establishment
or the maintenance of latency and/or the reactivation of
productive infection from the latent state. The
significance of LMP2B and its role in pathogenesis
remain unclear.
Length = 489
Score = 31.0 bits (70), Expect = 2.8
Identities = 23/111 (20%), Positives = 40/111 (36%), Gaps = 20/111 (18%)
Query: 292 PLSPENPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDSPFRNY 351
PL P S P E +P+ P S + ++ P ++ D+P
Sbjct: 8 PLGAGGPRSHGGPDGDEGD----------SNPYYPSSFGSSWDRPGPPVPEDYDAPSHRP 57
Query: 352 PQYSRYNPKNPESPFSP--QNEDSIY-------NPENPNSPYNPFNENSLY 393
P Y + + + P Q + S+Y P PY+P ++ S +
Sbjct: 58 PPYGG-SNGDRHGGYQPLGQQDPSLYAGLGQNGGGGLPPPPYSPRDQGSEH 107
Score = 29.1 bits (65), Expect = 9.6
Identities = 28/91 (30%), Positives = 39/91 (42%), Gaps = 16/91 (17%)
Query: 138 NPYNPSSVFNPKNIDSVFNPRNPDS------PYYNQYQFL---YSPLSPYNPRSLFS--S 186
NPY PSS + + P + D+ PY Y PL +P SL++
Sbjct: 28 NPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPPYGGSNGDRHGGYQPLGQQDP-SLYAGLG 86
Query: 187 QNPLSPLSPLNANSPYNPRNQISYYNIENPK 217
QN L P PY+PR+Q S + E P+
Sbjct: 87 QNGGGGLPP----PPYSPRDQGSEHVYEEPR 113
>gnl|CDD|237225 PRK12843, PRK12843, putative FAD-binding dehydrogenase; Reviewed.
Length = 578
Score = 30.9 bits (70), Expect = 2.9
Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 11/56 (19%)
Query: 424 MTPYVKS------GDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA----GGDT 469
M+P V FD+I++GA AAG + A + ++ LKVLL+E GG T
Sbjct: 1 MSPVVSELSPERWDAEFDVIVIGAGAAG-MSAALFAAIAGLKVLLVERTEYVGGTT 55
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 30.9 bits (70), Expect = 2.9
Identities = 49/301 (16%), Positives = 77/301 (25%), Gaps = 45/301 (14%)
Query: 104 NPESLLSPLNPRSVFYPSNPEVPSFDYNNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDSP 163
N LS S+F + +P+ +++N + N S + P + S S
Sbjct: 130 NSSVTLSSSTASSMFNSNKLPLPNPNHSNSATT-NQSGSSFINTPASSSSQPLTNLVVSS 188
Query: 164 YYNQYQFLYSPLSPYNPRSLFSSQNPLSPLSPLNANSPYNPRNQISYYNIENPKSPYNIH 223
F+ S S + N + N S ++
Sbjct: 189 IKRFPYLTSLS-------PFFNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLS 241
Query: 224 NPESPLYSDSPYYYRNLPLYNPVETELIDQYSTSVYNNPDSPLSINSPHSVLNPYNLDSI 283
LP VE +I S+S IN S+
Sbjct: 242 P---------------LPDTQSVENNIILNSSSS----------INELTSIYGSVPSIRN 276
Query: 284 LNPFNV--KSPLSPE-NPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFN 340
L N S L+ + + E S S + SF SP N +
Sbjct: 277 LRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPKSSP-NNLLTEILTT 335
Query: 341 SDNPDSP------FRNYPQYSRYNPKNPESPFSP-QNEDSIYNPENPNSPYNPFNENSLY 393
NP + ++ N +P +N N Y N S
Sbjct: 336 GVNPPQSLPSLLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSLSNLGY-SSNVPSPS 394
Query: 394 H 394
Sbjct: 395 S 395
>gnl|CDD|183782 PRK12834, PRK12834, putative FAD-binding dehydrogenase; Reviewed.
Length = 549
Score = 30.6 bits (70), Expect = 3.1
Identities = 13/30 (43%), Positives = 21/30 (70%), Gaps = 1/30 (3%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
D+I+VGA AG V A L++ + +VLL++
Sbjct: 6 DVIVVGAGLAGLVAAAELAD-AGKRVLLLD 34
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 30.1 bits (68), Expect = 3.3
Identities = 18/95 (18%), Positives = 29/95 (30%), Gaps = 7/95 (7%)
Query: 74 SPINQYSPYSPYNTDSLFDNTHPESLLNPYNPESLLSPLNPRSVFYPSNPEVPSFDYNNP 133
S + SP + S D T S + + + L S+ VPS ++
Sbjct: 6 STRSLLSPLGNELSPSSSDETEDCSEESTDSWSDMYEGLKDSE---SSSNSVPSLSLSST 62
Query: 134 NSPFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQY 168
S S +++ V R Y N
Sbjct: 63 ASS----LSDSSTYSRSLKEVKLERQAQEAYENWL 93
>gnl|CDD|224154 COG1233, COG1233, Phytoene dehydrogenase and related proteins
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 487
Score = 30.5 bits (69), Expect = 3.5
Identities = 14/54 (25%), Positives = 26/54 (48%), Gaps = 7/54 (12%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEF 487
+D++++GA G A L+ + LKV ++E + R+ G + L F
Sbjct: 4 YDVVVIGAGLNGLAAAALLAR-AGLKVTVLEK------NDRVGGRARTFELDGF 50
>gnl|CDD|233672 TIGR01989, COQ6, ubiquinone biosynthesis monooxygenase COQ6. This
model represents the monooxygenase responsible for the
4-hydroxylateion of the phenol ring in the aerobic
biosynthesis of ubiquinone.
Length = 437
Score = 30.5 bits (69), Expect = 3.8
Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 4/40 (10%)
Query: 434 FDIIIVGASAAGCVLANRLSEVS---SLKVLLIEAGGDTP 470
FD++IVG G LA L LKVLL++A D P
Sbjct: 1 FDVVIVGGGPVGLALAAALGNNPLTKDLKVLLLDA-VDNP 39
>gnl|CDD|225307 COG2509, COG2509, Uncharacterized FAD-dependent dehydrogenases
[General function prediction only].
Length = 486
Score = 30.4 bits (69), Expect = 3.9
Identities = 40/186 (21%), Positives = 62/186 (33%), Gaps = 45/186 (24%)
Query: 434 FDIIIVGASAAGCVLANRLSE----VSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDH 489
D++IVGA AG A LS V LK+ +++ G D I R+ E
Sbjct: 19 LDVVIVGAGPAGLFAAYELSGDARKVPILKIYVVDVGLD--IEQRL------CPKDEKKL 70
Query: 490 AYLAEPSQFAGLGVRNARIKITAGKGLGG--SSAVQNILYQRGTSY-----DYENFAKLG 542
+ I G G G S + N+ RG D + F +L
Sbjct: 71 EKCPK------CDPCP----IVIGFGGAGLFSDGILNLRPIRGGDVHERTKDTDEFWEL- 119
Query: 543 YNGWGYDETLKYF------VKSEDYRSVIYNESKAVHGTQGYLP-----VGLFKNKENNI 591
DE+ F S+ + E +A + LP +G + +
Sbjct: 120 --VNLVDESNVQFGAPGAGTFSDLTEQIKDIEFRAAGAGEEILPIYQRHIGTDILPK--V 175
Query: 592 IREIFE 597
++ I E
Sbjct: 176 VKNIRE 181
>gnl|CDD|236374 PRK09077, PRK09077, L-aspartate oxidase; Provisional.
Length = 536
Score = 29.9 bits (68), Expect = 4.9
Identities = 13/33 (39%), Positives = 21/33 (63%), Gaps = 2/33 (6%)
Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
D++I+G+ AAG LA RL+E +V ++ G
Sbjct: 9 CDVLIIGSGAAGLSLALRLAE--HRRVAVLSKG 39
>gnl|CDD|234511 TIGR04226, RrgB_K2N_iso_D2, fimbrial isopeptide formation D2
domain. The Streptococcus Pneumoniae pilus backbone
protein, RrgB, has three tandem domains with Lys-to-Asn
isopeptide bonds, but these three regions are extremely
divergent in sequence. This model represents the
homology domain family of the D2 domain. It occurs just
once in many surface proteins but up to twenty times in
some pilin subunit proteins. Three of every four members
have the typical Gram-positive C-terminal motif, LPXTG,
although in many cases this motif may be involved in
pilin subunit cross-linking rather than cell wall
attachment. Proteins with this domain include fimbrial
proteins with lectin-like adhesion functions, and the
majority of characterized members are involved in
surface adhesion to host structures.
Length = 124
Score = 28.4 bits (64), Expect = 5.7
Identities = 19/73 (26%), Positives = 31/73 (42%), Gaps = 9/73 (12%)
Query: 652 VLKRSKVTKVIINDQNVATGVEYVNSKGETVRVT---------ANKEVILTAGAIANAQL 702
L+ KV ++ + + +Y + G+TV VT A K++ +T A N
Sbjct: 50 GLEYDGSVKVTVDGKTLDVTTDYTVTDGQTVTVTFTDDGLKKLAGKKITVTYTAKVNEGA 109
Query: 703 LLLSGIGPKAHLD 715
+L GI A L
Sbjct: 110 VLGKGIPNTATLT 122
>gnl|CDD|226103 COG3573, COG3573, Predicted oxidoreductase [General function
prediction only].
Length = 552
Score = 29.8 bits (67), Expect = 5.9
Identities = 13/34 (38%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGD 468
D+I+VGA AG V A L++ + +VL+++ G+
Sbjct: 7 DVIVVGAGLAGLVAAAELAD-AGKRVLILDQEGE 39
>gnl|CDD|199833 cd00216, PQQ_DH_like, PQQ-dependent dehydrogenases and related
proteins. This family is composed of dehydrogenases
with pyrroloquinoline quinone (PQQ) as a cofactor, such
as ethanol, methanol, and membrane-bound glucose
dehydrogenases. The alignment model contains an 8-bladed
beta-propeller, and the family also includes distantly
related proteins which are not enzymatically active and
do not bind PQQ.
Length = 434
Score = 29.5 bits (66), Expect = 7.5
Identities = 13/83 (15%), Positives = 25/83 (30%)
Query: 343 NPDSPFRNYPQYSRYNPKNPESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSPY 402
P N + +P ++ + +S N +Y G +P+
Sbjct: 159 YLVMPDPNLLPGKDSTVTDRNTPTGDEHTWTSGGGTGWSSAAYDAELNLIYVGGGNPTPW 218
Query: 403 DWQRLEKERYNIYRSVIYPPDMT 425
+W N+Y S I +
Sbjct: 219 NWGGNRTPGDNLYTSSIVAVNAD 241
>gnl|CDD|227544 COG5219, COG5219, Uncharacterized conserved protein, contains RING
Zn-finger [General function prediction only].
Length = 1525
Score = 29.6 bits (66), Expect = 7.7
Identities = 24/111 (21%), Positives = 41/111 (36%), Gaps = 11/111 (9%)
Query: 222 IHNPESPLYSDSP--YYYRNLPLYNPVETELIDQYSTSVYNNPDSPLSINSPHSVLNP-- 277
I P L S Y R+ +Y P+ + ++ +VYN S +S +S+ +
Sbjct: 1213 IERPGDHLLSVKKEEYRERHTIVYLPLWRLIYGHFADTVYNMYSSIISQLDMNSLDSEVY 1272
Query: 278 ------YNLDSILNPFNV-KSPLSPENPASPFNPYNPEPSIFDSHNIYSFF 321
+ L ++L V S L N + ++ H Y FF
Sbjct: 1273 VLLRGCFLLSTVLFWDLVDNSLLIVYNISINYSSATDRIRCLAIHIYYQFF 1323
>gnl|CDD|235879 PRK06854, PRK06854, adenylylsulfate reductase subunit alpha;
Validated.
Length = 608
Score = 29.5 bits (67), Expect = 7.9
Identities = 14/31 (45%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 435 DIIIVGASAAGCVLANRLSEVSS-LKVLLIE 464
DI+I+G AGC A E + LKVL++E
Sbjct: 13 DILIIGGGMAGCGAAFEAKEWAPDLKVLIVE 43
>gnl|CDD|187566 cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase,
extended (e) SDRs. This subgroup contains
UDP-N-acetylglucosamine 4-epimerase of Pseudomonas
aeruginosa, WbpP, an extended SDR, that catalyzes the
NAD+ dependent conversion of UDP-GlcNAc and UDPGalNA to
UDP-Glc and UDP-Gal. This subgroup has the
characteristic active site tetrad and NAD-binding motif
of the extended SDRs. Extended SDRs are distinct from
classical SDRs. In addition to the Rossmann fold
(alpha/beta folding pattern with a central beta-sheet)
core region typical of all SDRs, extended SDRs have a
less conserved C-terminal extension of approximately 100
amino acids. Extended SDRs are a diverse collection of
proteins, and include isomerases, epimerases,
oxidoreductases, and lyases; they typically have a
TGXXGXXG cofactor binding motif. SDRs are a functionally
diverse family of oxidoreductases that have a single
domain with a structurally conserved Rossmann fold, an
NAD(P)(H)-binding region, and a structurally diverse
C-terminal region. Sequence identity between different
SDR enzymes is typically in the 15-30% range; they
catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 304
Score = 29.1 bits (66), Expect = 8.0
Identities = 13/32 (40%), Positives = 17/32 (53%), Gaps = 3/32 (9%)
Query: 381 NSPYNPFNENSLYHPGNPDSPYDWQRLEKERY 412
+ PY P +E+ HP NP SPY + E Y
Sbjct: 123 DPPYLPKDED---HPPNPLSPYAVSKYAGELY 151
>gnl|CDD|236831 PRK11052, malQ, 4-alpha-glucanotransferase; Provisional.
Length = 695
Score = 29.5 bits (67), Expect = 8.2
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 11/33 (33%)
Query: 112 LNPRSVFYPSNPEVPSFDYNNPNSPFNPYNPSS 144
LNP YP+NPE S PY+PSS
Sbjct: 184 LNPIHALYPANPESAS-----------PYSPSS 205
>gnl|CDD|238990 cd02033, BchX, Chlorophyllide reductase converts chlorophylls into
bacteriochlorophylls by reducing the chlorin B-ring.
This family contains the X subunit of this three-subunit
enzyme. Sequence and structure similarity between bchX,
protochlorophyllide reductase L subunit (bchL and chlL)
and nitrogenase Fe protein (nifH gene) suggest their
functional similarity. Members of the BchX family serve
as the unique electron donors to their respective
catalytic subunits (bchN-bchB, bchY-bchZ and nitrogenase
component 1). Mechanistically, they hydrolyze ATP and
transfer electrons through a Fe4-S4 cluster.
Length = 329
Score = 29.2 bits (65), Expect = 8.3
Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 10/44 (22%)
Query: 513 GKGLGGSSAVQNILYQRGTSYDYENFAKLGYNGWGYDETLKYFV 556
G+G GG RG + +E KLG++ W +D L F+
Sbjct: 123 GRGCGG----------RGIIHGFELLEKLGFHDWDFDYVLLDFL 156
>gnl|CDD|183787 PRK12844, PRK12844, 3-ketosteroid-delta-1-dehydrogenase; Reviewed.
Length = 557
Score = 29.3 bits (66), Expect = 8.6
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 3/46 (6%)
Query: 658 VTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIA-NAQL 702
+T++I+ D V GV V G V + A + V+L +G NA++
Sbjct: 231 LTELIVEDGRV-VGVVVVRD-GREVLIRARRGVLLASGGFGHNAEM 274
>gnl|CDD|220065 pfam08902, DUF1848, Domain of unknown function (DUF1848). This
family of proteins are functionally uncharacterized. The
C-terminus contains a cluster of cysteines that are
similar to the iron-sulfur cluster found at the
N-terminus of pfam04055.
Length = 264
Score = 28.7 bits (65), Expect = 8.8
Identities = 19/56 (33%), Positives = 28/56 (50%), Gaps = 17/56 (30%)
Query: 138 NPYNPSSV----FNPKNIDS-VFNPRNP----------DSPYYNQYQFLYSPLSPY 178
NPYNP V P+++D+ VF +NP D+ Y Y F ++ L+PY
Sbjct: 31 NPYNPKQVSRVSLTPEDVDAIVFWTKNPAPFLPRLDELDARGYPYY-FQFT-LTPY 84
>gnl|CDD|232901 TIGR00275, TIGR00275, flavoprotein, HI0933 family. The model when
searched with a partial length search brings in proteins
with a dinucleotide-binding motif (Rossman fold) over
the initial 40 residues of the model, including
oxidoreductases and dehydrogenases. Partially
characterized members include an FAD-binding protein
from Bacillus cereus and flavoprotein HI0933 from
Haemophilus influenzae [Unknown function, Enzymes of
unknown specificity].
Length = 400
Score = 29.1 bits (66), Expect = 8.9
Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 437 IIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
II+G AAG + A + L VLL+E
Sbjct: 1 IIIGGGAAGLMAAITAAR-EGLSVLLLEKNK 30
>gnl|CDD|234978 PRK01747, mnmC, bifunctional tRNA
(mnm(5)s(2)U34)-methyltransferase/FAD-dependent
cmnm(5)s(2)U34 oxidoreductase; Reviewed.
Length = 662
Score = 29.4 bits (67), Expect = 8.9
Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 1/31 (3%)
Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
D I+G AG LA L+ +V L EA
Sbjct: 262 DAAIIGGGIAGAALALALAR-RGWQVTLYEA 291
>gnl|CDD|180419 PRK06134, PRK06134, putative FAD-binding dehydrogenase; Reviewed.
Length = 581
Score = 29.3 bits (66), Expect = 9.3
Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 4/60 (6%)
Query: 656 SKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIAN--AQLLLLSGIGPKAH 713
+ +++ D VA V V + G + A K V+L AG + A+ L P H
Sbjct: 238 APARELLREDGRVAGAV--VETPGGLQEIRARKGVVLAAGGFPHDPARRAALFPRAPTGH 295
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.134 0.402
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 37,905,742
Number of extensions: 3777310
Number of successful extensions: 3107
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2906
Number of HSP's successfully gapped: 173
Length of query: 720
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 616
Effective length of database: 6,324,786
Effective search space: 3896068176
Effective search space used: 3896068176
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (28.2 bits)