RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14408
         (720 letters)



>gnl|CDD|235000 PRK02106, PRK02106, choline dehydrogenase; Validated.
          Length = 560

 Score =  210 bits (536), Expect = 1e-59
 Identities = 115/293 (39%), Positives = 154/293 (52%), Gaps = 20/293 (6%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG-DTPIHSRIPGMSSVLSL----SEFD 488
           +D II+GA +AGCVLANRLSE   + VLL+EAGG D      I  M + L+       ++
Sbjct: 6   YDYIIIGAGSAGCVLANRLSEDPDVSVLLLEAGGPDYRWDFFIQ-MPAALAFPLQGKRYN 64

Query: 489 HAYLAEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKL-GYNGWG 547
            AY  EP         N R++   GK LGGSS++  ++Y RG + DY+N+A+L G  GW 
Sbjct: 65  WAYETEPEPHMN----NRRMECPRGKVLGGSSSINGMVYIRGNAMDYDNWAELPGLEGWS 120

Query: 548 YDETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPC 607
           Y + L YF K+E              G  G L V   K   N + +   E +  + GYP 
Sbjct: 121 YADCLPYFKKAETR----DGGEDDYRGGDGPLSVTRGKPGTNPLFQAFVE-AGVQAGYPR 175

Query: 608 PKDMNDRYVDVGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQN 667
             D+N  Y   GF  +      G R+SAA AYL P A KR NL ++  +   +++  +  
Sbjct: 176 TDDLNG-YQQEGFGPMDRTVTNGRRWSAARAYLDP-ALKRPNLTIVTHALTDRILF-EGK 232

Query: 668 VATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
            A GVEY    G      A +EVIL+AGAI + QLL LSGIGP  HL E+ IP
Sbjct: 233 RAVGVEYER-GGGRETARARREVILSAGAINSPQLLQLSGIGPAEHLKELGIP 284


>gnl|CDD|225186 COG2303, BetA, Choline dehydrogenase and related flavoproteins
           [Amino acid transport and metabolism].
          Length = 542

 Score =  196 bits (501), Expect = 7e-55
 Identities = 103/290 (35%), Positives = 152/290 (52%), Gaps = 12/290 (4%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG-DTPIHSRIPGMSSVLSLSEFDHAYL 492
           +D +IVG+ +AG VLA RLS+ + L VL++EAGG D     ++P   + L        Y 
Sbjct: 8   YDYVIVGSGSAGSVLAARLSD-AGLSVLVLEAGGPDRRPLIQMPAAYAFL---MNGPRYD 63

Query: 493 AEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKL-GYNGWGYDET 551
                     +R   +    GK LGGSS++  ++Y RG   D++ +A+  G  GW YD+ 
Sbjct: 64  WGFRTEPEPHLRGRELAWPRGKVLGGSSSINGMVYVRGHPEDFDAWAQESGAPGWPYDDV 123

Query: 552 LKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPCPKDM 611
           L YF ++ED   V   + +  HG  G  P+ +   +  N I   F  + ++LG+P   D 
Sbjct: 124 LPYFKRAEDLLGVGGQDLRTWHGGGG--PLPVSPPRSPNPIARAFIEAGEQLGFPTTPDP 181

Query: 612 NDRYVDVGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQNVATG 671
           N    + GF         G R+SAA AYL P   KR NL +L  ++V ++++ + + A G
Sbjct: 182 NGADQE-GFGPYCVTICNGRRWSAARAYLKPAL-KRPNLTLLTGARVRRILL-EGDRAVG 238

Query: 672 VEYVNSKGETVRVT-ANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
           VE     G T+    A +EV+L AGAI + +LLLLSGIGP  HL E  I 
Sbjct: 239 VEVEIGDGGTIETAVAAREVVLAAGAINSPKLLLLSGIGPADHLLEHGID 288


>gnl|CDD|233581 TIGR01810, betA, choline dehydrogenase.  Choline dehydrogenase
           catalyzes the conversion of exogenously supplied choline
           into the intermediate glycine betaine aldehyde, as part
           of a two-step oxidative reaction leading to the
           formation of osmoprotectant betaine. This enzymatic
           system can be found in both gram-positive and
           gram-negative bacteria. As in Escherichia coli ,
           Staphylococcus xylosus , and Sinorhizobium meliloti,
           this enzyme is found associated in a transciptionally
           co-induced gene cluster with betaine aldehyde
           dehydrogenase, the second catalytic enzyme in this
           reaction. Other gram-positive organisms have been shown
           to employ a different enzymatic system, utlizing a
           soluable choline oxidase or type III alcohol
           dehydrogenase instead of choline dehydrogenase. This
           enzyme is a member of the GMC oxidoreductase family
           (pfam00732 and pfam05199), sharing a common
           evoluntionary origin and enzymatic reaction with alcohol
           dehydrogenase. Outgrouping from this model, Caulobacter
           crescentus shares sequence homology with choline
           dehydrogenase, yet other genes participating in this
           enzymatic reaction have not currently been identified
           [Cellular processes, Adaptations to atypical
           conditions].
          Length = 532

 Score =  177 bits (451), Expect = 4e-48
 Identities = 112/291 (38%), Positives = 148/291 (50%), Gaps = 19/291 (6%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG-DTPIHSRI---PGMSSVLSLSEFDHA 490
           D II+G  +AG VLA RLSE  S  VL++EAGG D P    I     ++       ++  
Sbjct: 1   DYIIIGGGSAGSVLAGRLSEDVSNSVLVLEAGGSDYPWDLLIQMPAALAYPAGNKRYNWI 60

Query: 491 YLAEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAK-LGYNGWGYD 549
           Y  EP       + N R+    GK LGGSS++  ++YQRG   DYE +AK  G   W Y 
Sbjct: 61  YETEPEPH----MNNRRVGHARGKVLGGSSSINGMIYQRGNPMDYEKWAKPEGMESWDYA 116

Query: 550 ETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPCPK 609
           + L Y+ + E      +   K   G  G  P+ + +   +N + + F  +  E GY    
Sbjct: 117 DCLPYYKRLE----TTFGGEKPYRGHDG--PIKVRRGPADNPLFQAFIEAGVEAGYNKTP 170

Query: 610 DMNDRYVDVGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQNVA 669
           D+N  +   GF  +      G R SAA AYL P A KR NL V  R+ VTK+   + N A
Sbjct: 171 DVNG-FRQEGFGPMDSTVHNGRRVSAARAYLHP-AMKRPNLEVQTRAFVTKINF-EGNRA 227

Query: 670 TGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
           TGVE+    G      ANKEVIL+AGAI + QLL LSGIG   HL E+ I 
Sbjct: 228 TGVEFKKG-GRKEHTEANKEVILSAGAINSPQLLQLSGIGDAEHLRELGIE 277


>gnl|CDD|234421 TIGR03970, Rv0697, dehydrogenase, Rv0697 family.  This model
           describes a set of dehydrogenases belonging to the
           glucose-methanol-choline oxidoreductase (GMC
           oxidoreductase) family. Members of the present family
           are restricted to Actinobacterial genome contexts
           containing also members of families TIGR03962 and
           TIGR03969 (the mycofactocin system), and are proposed to
           be uniform in function.
          Length = 487

 Score =  142 bits (359), Expect = 5e-36
 Identities = 90/294 (30%), Positives = 129/294 (43%), Gaps = 26/294 (8%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAG------GDTPIHSRIPGMSSVLSLSEFD 488
           D+++VG  +AGCV+A RLSE  S  V ++EAG         P          +   S   
Sbjct: 2   DVLVVGGGSAGCVVAARLSEDPSCTVTVLEAGPGYRDPSRLPAQLTDGLRLPIGPASPVV 61

Query: 489 HAYLAEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKLGYNGWGY 548
             Y     +      R +   I  G+ LGGS AV    + R    D++ +      GW +
Sbjct: 62  WRY---GVELTDGPRRAS--AIVRGRVLGGSGAVNGGYFCRALPADFDAWPI---PGWSW 113

Query: 549 DETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYPCP 608
           D+ L +F   E            +HGT G +PV   +  E + I   F  +A   G+   
Sbjct: 114 DDVLPHFRAIETDLDF----DGPLHGTAGPIPVR--RTAELDGISAAFVAAALGAGFGWI 167

Query: 609 KDMNDRYVDVGF--AELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQ 666
            D+N    D       +P     G R S A AYL P A KR NL V   ++V +++ +  
Sbjct: 168 ADLNGSGPDAPTGVGAVPLNVDGGRRVSTAVAYLLP-ALKRPNLTVEADTRVVRILFS-G 225

Query: 667 NVATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAHLDEVKIP 720
             A GVE +   G    + A++ V+L AGA+ +A LLLLSGIGP   L    I 
Sbjct: 226 TRAVGVEVL-GDGGPRTLRADR-VVLCAGAVESAHLLLLSGIGPAEQLRAAGIA 277


>gnl|CDD|216089 pfam00732, GMC_oxred_N, GMC oxidoreductase.  This family of
           proteins bind FAD as a cofactor.
          Length = 218

 Score =  118 bits (297), Expect = 4e-30
 Identities = 68/217 (31%), Positives = 99/217 (45%), Gaps = 29/217 (13%)

Query: 503 VRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKL-GYNGWGYDETLKYFVKSEDY 561
           V   R+ + AG  +GG S+V      R  +   + +A   G  GWGYD+ L Y  K E  
Sbjct: 14  VNGRRMILPAGSTVGGGSSVNWSACIRTPAAVLDEWASEFGLEGWGYDDYLPYMDKVE-- 71

Query: 562 RSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYP---CPKDMNDRYVDV 618
                       G  G    G+ ++  N  + +    +A+ELGYP    P++ N  +   
Sbjct: 72  ------------GPLGVTTKGIEESPLNQALLK----AAEELGYPVEAVPRNSNGCH-YC 114

Query: 619 GFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQN-VATGVEYVNS 677
           GF  L      G + S A  +L P   +  NL +L  +K  K+II  +   A GVE  + 
Sbjct: 115 GFCGLG--CPTGAKQSTARTWLRPALER--NLRILTGAKAEKIIILGRGGRAVGVEARDG 170

Query: 678 KGETVR-VTANKEVILTAGAIANAQLLLLSGIGPKAH 713
            G   R +TA KEV++ AGA+    LLL SG+G   H
Sbjct: 171 GGGIKRLITAAKEVVVAAGALNTPPLLLRSGLGKNPH 207


>gnl|CDD|215420 PLN02785, PLN02785, Protein HOTHEAD.
          Length = 587

 Score = 85.2 bits (211), Expect = 3e-17
 Identities = 88/316 (27%), Positives = 129/316 (40%), Gaps = 88/316 (27%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLA 493
           +D I+VG   AGC LA  LS+  +  VLL+E GG       +P  ++ +S  E  H  LA
Sbjct: 56  YDYIVVGGGTAGCPLAATLSQ--NFSVLLLERGG-------VPFGNANVSFLENFHIGLA 106

Query: 494 E-----PSQF--AGLGVRNARIKITAGKGLGGSSAVQNILYQRGTSYDYENFAKLGYNGW 546
           +      SQ   +  GV NAR ++     LGG + +    Y R ++     F +      
Sbjct: 107 DTSPTSASQAFISTDGVINARARV-----LGGGTCINAGFYSRAST----RFIQKA---- 153

Query: 547 GYDETLKYFVKSEDYRSVIYNESKAVHGTQGYLPVGLFKNKENNIIREIFETSAQELGYP 606
           G+D  L     +E Y  V   E + VH                                P
Sbjct: 154 GWDAKLV----NESYPWV---ERQIVH---------------------------WPKVAP 179

Query: 607 CPKDMNDRYVDVGFAELPGMT---------------RYGLRFSAADAYLTPIAGKRTNLY 651
               + D  ++VG +   G T                +G R +AA+      AG    L 
Sbjct: 180 WQAALRDSLLEVGVSPFNGFTYDHVYGTKVGGTIFDEFGRRHTAAELL---AAGNPNKLR 236

Query: 652 VLKRSKVTKVIINDQNV---ATGVEYVNSKGETVRVTANK----EVILTAGAIANAQLLL 704
           VL  + V K++ +       ATGV + +  G   +   +     E+IL+AGAI + Q+LL
Sbjct: 237 VLLHATVQKIVFDTSGKRPRATGVIFKDENGNQHQAFLSNNKGSEIILSAGAIGSPQMLL 296

Query: 705 LSGIGPKAHLDEVKIP 720
           LSGIGPK  L + KIP
Sbjct: 297 LSGIGPKKELKKHKIP 312


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 50.8 bits (121), Expect = 2e-06
 Identities = 51/247 (20%), Positives = 82/247 (33%), Gaps = 38/247 (15%)

Query: 136 PFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQYQFLYSPLSPYNPRSLFSSQNPLSPLSP 195
           P  P +P    +P+       PR+   P          P SP  P  L   ++P  P SP
Sbjct: 579 PEFPKDPKHPKDPEEPKKPKRPRSAQRPT--------RPKSPKLPELLDIPKSPKRPESP 630

Query: 196 LNANSPYNPRNQISYYNIENPKSPYNIHNPESPLYSDSPYYYRNLPLYNPVETELIDQYS 255
            +   P  P+   S    E PK   +   P+SP            P     + +  D Y 
Sbjct: 631 KSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSP----------KPPFDPKFKEKFYDDYL 680

Query: 256 TSVYNNPDSPLSI---NSPHSVLNPYNLDSILNPFNVKSPLSPENPASPFNPYNP--EPS 310
            +   + ++  ++    S  S+L     ++   PF    PL P+ P     P+ P  +P 
Sbjct: 681 DAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPD 740

Query: 311 IFDSHNIYSFFSPHNPYSPFNPYNIDSPF---------------NSDNPDSPFRNYPQYS 355
                +I  F  P    + F+    D+P                 +  PD   +     S
Sbjct: 741 AEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPS 800

Query: 356 RYNPKNP 362
            +  K P
Sbjct: 801 EHEDKPP 807



 Score = 47.8 bits (113), Expect = 2e-05
 Identities = 46/194 (23%), Positives = 72/194 (37%), Gaps = 11/194 (5%)

Query: 214 ENPKSPYNIHNPESPLYSDSPYY-YRNLPLYNPVETELIDQYSTSVYNNPDSPLSINSPH 272
           E PK P +  +PE P     P    R     +P   EL+D     +  +P  P S  SP 
Sbjct: 580 EFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLD-----IPKSPKRPESPKSPK 634

Query: 273 SVLNPYNLDSILNPFNVKSPLSPENPASPFNPYNP--EPSIFDSHNIYSFFSPHNPYSPF 330
               P    S   P   K   SP+ P SP  P++P  +   +D +   +  S     +  
Sbjct: 635 RPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVV 694

Query: 331 NPYNIDSPFNSDNPDSPFRNYPQYSRYNPK---NPESPFSPQNEDSIYNPENPNSPYNPF 387
              + +S      P++P   +       PK   + E PF P  +     P++      P 
Sbjct: 695 LDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPE 754

Query: 388 NENSLYHPGNPDSP 401
            E + +H    D+P
Sbjct: 755 EERTFFHETPADTP 768



 Score = 45.1 bits (106), Expect = 1e-04
 Identities = 34/124 (27%), Positives = 47/124 (37%), Gaps = 22/124 (17%)

Query: 286 PFNVKSPLSPENPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPD 345
           P   K P  P++P  P +P  P+              P +   P  P +   P   D P 
Sbjct: 573 PTLSKKPEFPKDPKHPKDPEEPKKPK----------RPRSAQRPTRPKSPKLPELLDIPK 622

Query: 346 SPFRNYPQYSRYNPKNPESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSPYDWQ 405
           SP R        +PK+P+ P  PQ   S   PE P    +P        P +P  P+D +
Sbjct: 623 SPKRPE------SPKSPKRPPPPQRPSSPERPEGPKIIKSP------KPPKSPKPPFDPK 670

Query: 406 RLEK 409
             EK
Sbjct: 671 FKEK 674



 Score = 38.9 bits (90), Expect = 0.011
 Identities = 31/146 (21%), Positives = 50/146 (34%), Gaps = 12/146 (8%)

Query: 71  NPFSPINQYSPYSPYNTDSLFDNTHPESLLNPYNPESLLSPLNPRSVFYPSNPEVPSF-- 128
           +P  P     P S            PE L  P +P+   SP +P+    P  P  P    
Sbjct: 590 DPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPE 649

Query: 129 DYNNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDS----------PYYNQYQFLYSPLSPY 178
                 SP  P +P   F+PK  +  ++     +               ++ +     P 
Sbjct: 650 GPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPE 709

Query: 179 NPRSLFSSQNPLSPLSPLNANSPYNP 204
            P + F++  PL P  P +   P+ P
Sbjct: 710 TPGTPFTTPRPLPPKLPRDEEFPFEP 735


>gnl|CDD|219686 pfam07992, Pyr_redox_2, Pyridine nucleotide-disulphide
           oxidoreductase.  This family includes both class I and
           class II oxidoreductases and also NADH oxidases and
           peroxidases. This domain is actually a small NADH
           binding domain within a larger FAD binding domain.
          Length = 283

 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 21/80 (26%), Positives = 32/80 (40%), Gaps = 3/80 (3%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLAE 494
           D++I+G   AG   A RL+    LKV LIE  G T  +     +   L L   +   LA 
Sbjct: 1   DVVIIGGGPAGLAAAIRLAR-LGLKVALIEREGGTCYNR--GCLPKKLLLEVAEGLELAI 57

Query: 495 PSQFAGLGVRNARIKITAGK 514
                    +   +++  G 
Sbjct: 58  GLALPEEVYKEFGVEVLLGT 77


>gnl|CDD|235787 PRK06370, PRK06370, mercuric reductase; Validated.
          Length = 463

 Score = 44.4 bits (106), Expect = 1e-04
 Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 4/40 (10%)

Query: 432 DCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GGD 468
             +D I++GA  AG  LA R +    +KV LIE    GG 
Sbjct: 4   QRYDAIVIGAGQAGPPLAARAAG-LGMKVALIERGLLGGT 42


>gnl|CDD|223717 COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production
           and conversion].
          Length = 396

 Score = 44.0 bits (104), Expect = 2e-04
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGD--TPIHSRIPGMSSVLSLSEFD 488
           +D++IVGA  AG   A RL++ + L VL++E G +          G  S  +L E  
Sbjct: 4   YDVVIVGAGPAGSSAARRLAK-AGLDVLVLEKGSEPGAKPCC--GGGLSPRALEELI 57


>gnl|CDD|218770 pfam05834, Lycopene_cycl, Lycopene cyclase protein.  This family
           consists of lycopene beta and epsilon cyclase proteins.
           Carotenoids with cyclic end groups are essential
           components of the photosynthetic membranes in all
           plants, algae, and cyanobacteria. These lipid-soluble
           compounds protect against photo-oxidation, harvest light
           for photosynthesis, and dissipate excess light energy
           absorbed by the antenna pigments. The cyclisation of
           lycopene (psi, psi-carotene) is a key branch point in
           the pathway of carotenoid biosynthesis. Two types of
           cyclic end groups are found in higher plant carotenoids:
           the beta and epsilon rings. Carotenoids with two beta
           rings are ubiquitous, and those with one beta and one
           epsilon ring are common; however, carotenoids with two
           epsilon rings are rare.
          Length = 374

 Score = 40.4 bits (95), Expect = 0.003
 Identities = 19/37 (51%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 435 DIIIVGASAAGCVLANRLSE-VSSLKVLLIEAGGDTP 470
           D++IVGA  AG +LA RL +    L+VLLI+AG   P
Sbjct: 1   DLVIVGAGLAGLLLALRLRQARPGLRVLLIDAGPGPP 37


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 40.3 bits (94), Expect = 0.003
 Identities = 29/118 (24%), Positives = 45/118 (38%), Gaps = 13/118 (11%)

Query: 295 PENPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDN-PDSPFRNYPQ 353
           P+NP  P  P  P      + N  + F P +   P       +P N     +S F +  +
Sbjct: 323 PDNPQDPVPP--PNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVE 380

Query: 354 YSRYNPKNPESPFSPQNEDSIYNPENPNSPYN-PFN-----ENSLYHPGNPDSPYDWQ 405
                P NP +P  P+ E +I    N   P + P       +N+   P  P++  D Q
Sbjct: 381 N----PPNPPNPDIPEQEPNIPEDSNKEVPEDVPMEPEDDRDNNFNEPKKPENKGDGQ 434



 Score = 35.3 bits (81), Expect = 0.14
 Identities = 34/175 (19%), Positives = 47/175 (26%), Gaps = 35/175 (20%)

Query: 244 NPVETELIDQYSTSVYNNPDSPLSINSPHSVLNPYNLDSILNPFNVKSPLSPENPASPFN 303
           NP E  L       V +  + P +  +     N      + NP N  +P  PE   +   
Sbjct: 340 NPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPE 399

Query: 304 PYNPE-PSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDSPFRNYPQYSRYNPKNP 362
             N E P             P  P    +  N + P   +N                  P
Sbjct: 400 DSNKEVPEDV----------PMEPEDDRDN-NFNEPKKPEN-----------KGDGQNEP 437

Query: 363 ESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSPYDWQRLEKERYNIYRS 417
             P    NE    N             N   +PGN  +  D       R N  R+
Sbjct: 438 VIPKPLDNERDQSNK------------NKQVNPGNRHNSEDRYTRPHGRNNENRN 480



 Score = 29.2 bits (65), Expect = 8.3
 Identities = 32/175 (18%), Positives = 60/175 (34%), Gaps = 22/175 (12%)

Query: 58  FDIIPYSNTDDIYNPFSPINQYSPYSPYNTDSLFDNTHPESLLNPYNPESLLSPLNPRSV 117
            ++ P  + +       P N  +     N++   D  +P +  NP  PE   +       
Sbjct: 344 ENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPED--- 400

Query: 118 FYPSNPEVPSFDYNNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQYQFLYSPLSP 177
              SN EVP       + P  P +        N +    P+ P++    Q + +  P   
Sbjct: 401 ---SNKEVPE------DVPMEPEDDRD----NNFNE---PKKPENKGDGQNEPV-IPKPL 443

Query: 178 YNPRSLFSSQNPLSPLSPLNANSPYNPRNQISYYNIENPKSPYNIHNPESPLYSD 232
            N R   +    ++P +  N+   Y   +  +  N        +I  P+ P  S+
Sbjct: 444 DNERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNSDI--PKHPERSE 496


>gnl|CDD|233671 TIGR01988, Ubi-OHases, Ubiquinone biosynthesis hydroxylase,
           UbiH/UbiF/VisC/COQ6 family.  This model represents a
           family of FAD-dependent hydroxylases (monooxygenases)
           which are all believed to act in the aerobic ubiquinone
           biosynthesis pathway. A separate set of hydroxylases, as
           yet undiscovered, are believed to be active under
           anaerobic conditions. In E. coli three enzyme activities
           have been described, UbiB (which acts first at position
           6, see TIGR01982), UbiH (which acts at position 4,) and
           UbiF (which acts at position 5,). UbiH and UbiF are
           similar to one another and form the basis of this
           subfamily. Interestingly, E. coli contains another
           hydroxylase gene, called visC, that is highly similar to
           UbiF, adjacent to UbiH and, when mutated, results in a
           phenotype similar to that of UbiH (which has also been
           named visB). Several other species appear to have three
           homologs in this family, although they assort themselves
           differently on phylogenetic trees (e.g. Xylella and
           Mesorhizobium) making it difficult to ascribe a specific
           activity to each one. Eukaryotes appear to have only a
           single homolog in this subfamily (COQ6,) which
           complements UbiH, but also possess a non-orthologous
           gene, COQ7 which complements UbiF [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Menaquinone
           and ubiquinone].
          Length = 387

 Score = 39.9 bits (94), Expect = 0.003
 Identities = 16/32 (50%), Positives = 18/32 (56%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
           DI+IVG    G  LA  L+    LKV LIEA 
Sbjct: 1   DIVIVGGGPVGLALALALARSGGLKVALIEAT 32


>gnl|CDD|223727 COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related
           FAD-dependent oxidoreductases [Coenzyme metabolism /
           Energy production and conversion].
          Length = 387

 Score = 39.7 bits (93), Expect = 0.004
 Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
            D+ IVGA  AG  LA  L+  + L V L+E 
Sbjct: 3   LDVAIVGAGPAGLALALALAR-AGLDVTLLER 33


>gnl|CDD|205628 pfam13450, NAD_binding_8, NAD(P)-binding Rossmann-like domain. 
          Length = 66

 Score = 35.5 bits (83), Expect = 0.006
 Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 8/46 (17%)

Query: 438 IVGASAAGCVLANRLSEVSSLKVLLIEA----GGDTPIHS-RIPGM 478
           IVGA  +G V A  L++     VL++E     GG+   +S R PG 
Sbjct: 1   IVGAGLSGLVAAYLLAK-RGKDVLVLEKRDRIGGN--AYSERDPGY 43


>gnl|CDD|224169 COG1249, Lpd, Pyruvate/2-oxoglutarate dehydrogenase complex,
           dihydrolipoamide dehydrogenase (E3) component, and
           related enzymes [Energy production and conversion].
          Length = 454

 Score = 39.1 bits (92), Expect = 0.007
 Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 1/34 (2%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
           +D++++GA  AG V A R ++   LKV L+E G 
Sbjct: 5   YDVVVIGAGPAGYVAAIRAAQ-LGLKVALVEKGE 37


>gnl|CDD|240384 PTZ00367, PTZ00367, squalene epoxidase; Provisional.
          Length = 567

 Score = 39.1 bits (91), Expect = 0.009
 Identities = 17/31 (54%), Positives = 22/31 (70%), Gaps = 1/31 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
           +D+IIVG S AG VLA  LS+    KVL++E
Sbjct: 34  YDVIIVGGSIAGPVLAKALSKQGR-KVLMLE 63


>gnl|CDD|233693 TIGR02032, GG-red-SF, geranylgeranyl reductase family.  This model
           represents a subfamily which includes geranylgeranyl
           reductases involved in chlorophyll and
           bacteriochlorophyll biosynthesis as well as other
           related enzymes which may also act on geranylgeranyl
           groups or related substrates [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Chlorophyll and
           bacteriochlorphyll].
          Length = 295

 Score = 37.7 bits (88), Expect = 0.015
 Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 1/31 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
           +D+++VGA  AG   A RL++   L+VLL+E
Sbjct: 1   YDVVVVGAGPAGASAAYRLAD-KGLRVLLLE 30


>gnl|CDD|233669 TIGR01984, UbiH, 2-polyprenyl-6-methoxyphenol 4-hydroxylase.  This
           model represents the FAD-dependent monoxygenase
           responsible for the second hydroxylation step in the
           aerobic ubiquinone bioynthetic pathway. The scope of
           this model is limited to the proteobacteria. This family
           is closely related to the UbiF hydroxylase which
           catalyzes the final hydroxylation step. The enzyme has
           also been named VisB due to a mutant VISible light
           sensitive phenotype [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Menaquinone and
           ubiquinone].
          Length = 382

 Score = 36.9 bits (86), Expect = 0.029
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
           D+IIVG    G  LA  LS +  +K+ LIEA
Sbjct: 1   DVIIVGGGLVGLSLALALSRLGKIKIALIEA 31


>gnl|CDD|233583 TIGR01813, flavo_cyto_c, flavocytochrome c.  This model describes a
           family of redox proteins related to the succinate
           dehydrogenases and fumarate reductases of E. coli,
           mitochondria, and other well-characterized systems. A
           member of this family from Shewanella frigidimarina
           NCIMB400 is characterized as a water-soluble periplasmic
           protein with four heme groups, a non-covalently bound
           FAD, and essentially unidirectional fumarate reductase
           activity. At least seven distinct members of this family
           are found in Shewanella oneidensis, a species able to
           use a wide variety of pathways for respiraton [Energy
           metabolism, Electron transport].
          Length = 439

 Score = 36.9 bits (86), Expect = 0.036
 Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 655 RSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIA-NAQLL 703
            SKV  +I +DQ    GV  V  KG+ + + A K V+L  G    N +++
Sbjct: 150 NSKVEDLIQDDQGTVVGVV-VKGKGKGIYIKAAKAVVLATGGFGSNKEMI 198



 Score = 30.0 bits (68), Expect = 4.8
 Identities = 12/43 (27%), Positives = 23/43 (53%), Gaps = 4/43 (9%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIE----AGGDTPIHS 473
           D+++VG+  AG   A    +  +  V+L+E     GG++ I +
Sbjct: 1   DVVVVGSGFAGLSAALSAKKAGAANVVLLEKMPVIGGNSAIAA 43


>gnl|CDD|216400 pfam01266, DAO, FAD dependent oxidoreductase.  This family includes
           various FAD dependent oxidoreductases:
           Glycerol-3-phosphate dehydrogenase EC:1.1.99.5,
           Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine
           oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1.
          Length = 234

 Score = 36.1 bits (84), Expect = 0.039
 Identities = 10/33 (30%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
           D++++G    G   A  L+    L V L+E G 
Sbjct: 1   DVVVIGGGIVGLSTAYELAR-RGLSVTLLERGD 32


>gnl|CDD|216176 pfam00890, FAD_binding_2, FAD binding domain.  This family includes
           members that bind FAD. This family includes the
           flavoprotein subunits from succinate and fumarate
           dehydrogenase, aspartate oxidase and the alpha subunit
           of adenylylsulphate reductase.
          Length = 401

 Score = 36.5 bits (85), Expect = 0.048
 Identities = 11/40 (27%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSR 474
           D++++G+  AG   A   +E + LKV ++E G      + 
Sbjct: 1   DVVVIGSGLAGLAAALEAAE-AGLKVAVVEKGQPFGGATA 39



 Score = 34.2 bits (79), Expect = 0.21
 Identities = 19/57 (33%), Positives = 25/57 (43%), Gaps = 3/57 (5%)

Query: 655 RSKVTKVIINDQNVATGVEYVN-SKGETVRVTANKEVILTAGAI-ANAQLLLLSGIG 709
           R     +I+ D    TGV   N   G  VR+ A   V+L  G    NA+LLL +   
Sbjct: 160 RHPADDLIVEDG-RVTGVVVENRRNGREVRIRAIAGVLLATGGFGRNAELLLPAAGY 215


>gnl|CDD|236001 PRK07364, PRK07364, 2-octaprenyl-6-methoxyphenyl hydroxylase;
           Validated.
          Length = 415

 Score = 36.2 bits (84), Expect = 0.054
 Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 1/37 (2%)

Query: 429 KSGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
                +D+ IVG    G  LA  L + S L++ LIEA
Sbjct: 14  TRSLTYDVAIVGGGIVGLTLAAALKD-SGLRIALIEA 49


>gnl|CDD|235774 PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated.
          Length = 460

 Score = 35.9 bits (84), Expect = 0.067
 Identities = 15/33 (45%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
           +D+I++GA  AG V A R ++    KV LIE G
Sbjct: 4   YDVIVIGAGPAGYVAARRAAK-LGKKVALIEKG 35


>gnl|CDD|178267 PLN02661, PLN02661, Putative thiazole synthesis.
          Length = 357

 Score = 36.0 bits (83), Expect = 0.069
 Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 13/74 (17%)

Query: 398 PDSPYDWQRLEKERYNIYRSVI-------YPPDMTPYVKSGDCFDIIIVGASAAGCVLAN 450
             +PYD    +     I  S++       Y  DM  Y  +    D++IVGA +AG   A 
Sbjct: 56  STAPYDLNNFKFA--PIKESIVSREMTRRYMTDMITYADT----DVVIVGAGSAGLSCAY 109

Query: 451 RLSEVSSLKVLLIE 464
            LS+  ++KV +IE
Sbjct: 110 ELSKNPNVKVAIIE 123


>gnl|CDD|216530 pfam01494, FAD_binding_3, FAD binding domain.  This domain is
           involved in FAD binding in a number of enzymes.
          Length = 349

 Score = 35.0 bits (81), Expect = 0.12
 Identities = 18/81 (22%), Positives = 37/81 (45%), Gaps = 4/81 (4%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLAE 494
           D++IVG   AG +LA  L+  + ++V+L+E    T +  R  G++   ++     A L +
Sbjct: 3   DVLIVGGGPAGLMLALLLAR-AGVRVVLVERHATTSVLPRAGGLNQR-TMELLRQAGLED 60

Query: 495 PSQFAGLGVRNARIKITAGKG 515
             +    G  +  + +     
Sbjct: 61  --RILAEGAPHEGMGLAFYNT 79


>gnl|CDD|236385 PRK09126, PRK09126, hypothetical protein; Provisional.
          Length = 392

 Score = 34.9 bits (81), Expect = 0.16
 Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 432 DCFDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
              DI++VGA  AG   A  L+  S LKV LIE
Sbjct: 2   MHSDIVVVGAGPAGLSFARSLAG-SGLKVTLIE 33


>gnl|CDD|224550 COG1635, THI4, Ribulose 1,5-bisphosphate synthetase, converts PRPP
           to RuBP, flavoprotein [Carbohydrate    transport and
           metabolism].
          Length = 262

 Score = 34.2 bits (79), Expect = 0.18
 Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 5/37 (13%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEA----GG 467
           D+IIVGA  +G   A  L++ + LKV + E     GG
Sbjct: 32  DVIIVGAGPSGLTAAYYLAK-AGLKVAIFERKLSFGG 67


>gnl|CDD|223981 COG1053, SdhA, Succinate dehydrogenase/fumarate reductase,
           flavoprotein subunit [Energy production and conversion].
          Length = 562

 Score = 34.6 bits (80), Expect = 0.19
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 5/40 (12%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE----AGGDT 469
           FD++++G   AG   A   +E + LKV L+       G T
Sbjct: 7   FDVVVIGGGGAGLRAAIEAAE-AGLKVALLSKAPPKRGHT 45



 Score = 33.1 bits (76), Expect = 0.53
 Identities = 15/81 (18%), Positives = 26/81 (32%), Gaps = 14/81 (17%)

Query: 618 VGFAELPGMTRYGLRFSAADAYLTPIAGKRTNLYVLKRSKVTKVIINDQNVATGVEYVNS 677
            G   L  +    L+FS               + +     V  ++++D     GV   + 
Sbjct: 136 TGHELLHTLYEQLLKFS--------------GIEIFDEYFVLDLLVDDGGGVAGVVARDL 181

Query: 678 KGETVRVTANKEVILTAGAIA 698
           +   + V   K VIL  G   
Sbjct: 182 RTGELYVFRAKAVILATGGAG 202


>gnl|CDD|182194 PRK10015, PRK10015, oxidoreductase; Provisional.
          Length = 429

 Score = 34.6 bits (79), Expect = 0.21
 Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 430 SGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
           S D FD I+VGA  AG V A  ++  + L VL+IE G
Sbjct: 2   SDDKFDAIVVGAGVAGSVAALVMAR-AGLDVLVIERG 37


>gnl|CDD|182273 PRK10157, PRK10157, putative oxidoreductase FixC; Provisional.
          Length = 428

 Score = 34.1 bits (78), Expect = 0.29
 Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 430 SGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
           S D FD IIVGA  AG V A  L+   + +VL+IE G
Sbjct: 2   SEDIFDAIIVGAGLAGSVAALVLAREGA-QVLVIERG 37


>gnl|CDD|223108 COG0029, NadB, Aspartate oxidase [Coenzyme metabolism].
          Length = 518

 Score = 34.2 bits (79), Expect = 0.29
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 644 AGKRTNLYVLKRSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIAN 699
              R N+ VL+ ++   +II D     GV  +N  GE     A K V+L  G +  
Sbjct: 143 VRNRPNITVLEGAEALDLIIEDGIGVAGVLVLNRNGELGTFRA-KAVVLATGGLGG 197



 Score = 33.4 bits (77), Expect = 0.40
 Identities = 9/35 (25%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 429 KSGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLI 463
              +  D++I+G+  AG   A  L+   S +V ++
Sbjct: 3   TDFEHPDVLIIGSGLAGLTAALSLAP--SFRVTVL 35


>gnl|CDD|223652 COG0579, COG0579, Predicted dehydrogenase [General function
           prediction only].
          Length = 429

 Score = 33.8 bits (78), Expect = 0.31
 Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVS-SLKVLLIE 464
           +D++I+G    G   A  LSE    L V L+E
Sbjct: 4   YDVVIIGGGIMGAATAYELSEYEPDLSVALLE 35


>gnl|CDD|130849 TIGR01789, lycopene_cycl, lycopene cyclase.  This model represents
           a family of bacterial lycopene cyclases catalyzing the
           transformation of lycopene to carotene. These enzymes
           are found in a limited spectrum of alpha and gamma
           proteobacteria as well as Flavobacterium.
          Length = 370

 Score = 33.7 bits (77), Expect = 0.33
 Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 7/84 (8%)

Query: 435 DIIIVGASAAGCVLANRLSEVS-SLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLA 493
           D IIVG   AG ++A RL       ++ +IEAG     +       S   LS+  HA+LA
Sbjct: 1   DCIIVGGGLAGGLIALRLQRARPDFRIRVIEAGRTIGGNHTWSFFDS--DLSDAQHAWLA 58

Query: 494 EPSQ--FAGLGVRNA--RIKITAG 513
           +  Q  + G  VR    R K+   
Sbjct: 59  DLVQTDWPGYEVRFPKYRRKLKTA 82


>gnl|CDD|224992 COG2081, COG2081, Predicted flavoproteins [General function
           prediction only].
          Length = 408

 Score = 33.7 bits (78), Expect = 0.33
 Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
           FD+II+G   AG + A   ++    +VLLI+ G
Sbjct: 4   FDVIIIGGGPAGLMAAISAAKAGR-RVLLIDKG 35


>gnl|CDD|145231 pfam01946, Thi4, Thi4 family.  This family includes a putative
           thiamine biosynthetic enzyme.
          Length = 229

 Score = 33.2 bits (76), Expect = 0.33
 Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 5/46 (10%)

Query: 420 YPPDMTPYVKSGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
           Y  D+  Y +S    D++IVGA  +G   A  L++   LKV +IE 
Sbjct: 8   YFEDLLDYAES----DVVIVGAGPSGLTAAYYLAK-KGLKVAIIER 48


>gnl|CDD|232791 TIGR00031, UDP-GALP_mutase, UDP-galactopyranose mutase.  This
           enzyme is involved in the conversion of UDP-GALP into
           UDP-GALF through a 2-keto intermediate. It contains FAD
           as a cofactor. The gene is known as glf, ceoA, and rfbD.
           It is known experimentally in E. coli, Mycobacterium
           tuberculosis, and Klebsiella pneumoniae [Cell envelope,
           Biosynthesis and degradation of surface polysaccharides
           and lipopolysaccharides].
          Length = 377

 Score = 33.6 bits (77), Expect = 0.35
 Identities = 16/31 (51%), Positives = 24/31 (77%), Gaps = 1/31 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
           FD IIVGA  +G VLAN L++++  +VL++E
Sbjct: 2   FDYIIVGAGLSGIVLANILAQLNK-RVLVVE 31


>gnl|CDD|235584 PRK05732, PRK05732, 2-octaprenyl-6-methoxyphenyl hydroxylase;
           Validated.
          Length = 395

 Score = 33.7 bits (78), Expect = 0.37
 Identities = 21/46 (45%), Positives = 23/46 (50%), Gaps = 4/46 (8%)

Query: 434 FDIIIVGASAAGCVLANRLSEVS--SLKVLLIEAGGDTPIHSRIPG 477
            D+IIVG   AG  LA  LS +S   L V LIEA    P     PG
Sbjct: 4   MDVIIVGGGMAGATLALALSRLSHGGLPVALIEA--FAPESDAHPG 47


>gnl|CDD|168201 PRK05714, PRK05714, 2-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol
           hydroxylase; Provisional.
          Length = 405

 Score = 33.3 bits (76), Expect = 0.42
 Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 9/54 (16%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAG--------GDTPIHSRIPGMSS 480
           D++IVGA   G  LA  L + S L+VLL++ G           P   R+  +S+
Sbjct: 4   DLLIVGAGMVGSALALAL-QGSGLEVLLLDGGPLSVKPFDPQAPFEPRVSALSA 56


>gnl|CDD|217590 pfam03486, HI0933_like, HI0933-like protein. 
          Length = 405

 Score = 33.3 bits (77), Expect = 0.44
 Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
           +D+I++G  AAG + A   ++    +VLLI+ G 
Sbjct: 1   YDVIVIGGGAAGLMAAISAAK-RGRRVLLIDKGK 33


>gnl|CDD|130850 TIGR01790, carotene-cycl, lycopene cyclase family protein.  This
           family includes lycopene beta and epsilion cyclases
           (which form beta and delta carotene, respectively) from
           bacteria and plants as well as the plant
           capsanthin/capsorubin and neoxanthin cyclases which
           appear to have evolved from the plant lycopene cyclases.
           The plant lycopene epsilon cyclases also transform
           neurosporene to alpha zeacarotene.
          Length = 388

 Score = 33.2 bits (76), Expect = 0.45
 Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTP 470
           D+ ++G   AG  +A  L+    L+V LIE     P
Sbjct: 1   DLAVIGGGPAGLAIALELAR-PGLRVQLIEPHPPIP 35


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 33.1 bits (76), Expect = 0.45
 Identities = 20/117 (17%), Positives = 36/117 (30%), Gaps = 5/117 (4%)

Query: 290 KSPLSPENPASPFNPYNPEPSIF--DSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDSP 347
            + ++  N  + F   + +P+          S   P  P  P +P +         P S 
Sbjct: 160 DADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSF 219

Query: 348 FRNYPQYSRYNPKNPESPFSPQNEDSIY---NPENPNSPYNPFNENSLYHPGNPDSP 401
             + P  S  +P NP  P  P           P    +   P + ++   P    + 
Sbjct: 220 QSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITL 276


>gnl|CDD|235241 PRK04176, PRK04176, ribulose-1,5-biphosphate synthetase;
           Provisional.
          Length = 257

 Score = 32.9 bits (76), Expect = 0.51
 Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
           D+ IVGA  +G   A  L++ + LKV + E
Sbjct: 27  DVAIVGAGPSGLTAAYYLAK-AGLKVAVFE 55


>gnl|CDD|181057 PRK07608, PRK07608, ubiquinone biosynthesis hydroxylase family
           protein; Provisional.
          Length = 388

 Score = 33.0 bits (76), Expect = 0.57
 Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 6/52 (11%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG-----GDTPIHSRIPGMSS 480
           FD+++VG    G  LA  L++ S L+V L+         D    SR+  +S 
Sbjct: 6   FDVVVVGGGLVGASLALALAQ-SGLRVALLAPRAPPRPADDAWDSRVYAISP 56


>gnl|CDD|223636 COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis,
           outer membrane].
          Length = 374

 Score = 32.7 bits (75), Expect = 0.69
 Identities = 12/32 (37%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
           FD +IVGA  +G V+A   +++   +VL++E 
Sbjct: 2   FDYLIVGAGLSGAVIAEVAAQLGK-RVLIVEK 32


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 33.1 bits (75), Expect = 0.71
 Identities = 37/126 (29%), Positives = 53/126 (42%), Gaps = 12/126 (9%)

Query: 194 SPLNANSPYN--PRNQISYYNIENPKSPYNIHNPESPLYSDSPYY---YRNLPLYNPVET 248
           +P N+N  YN  P +   Y N  N  +PY+     +P YS+ PY    Y N PL N   +
Sbjct: 416 NPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPS 475

Query: 249 ELIDQYSTSVYNNPDSPLSINSPHSVLNPYNLDSILNPFNVKSPLSPENP--ASPFN--P 304
              D +  S Y+      + N P + L   N     N F+  +  S  NP  + PF   P
Sbjct: 476 SAKDHH--SAYHAAYQHRAANQPAANLPTAN-QPAANNFHGAAGNSVGNPFASRPFGSAP 532

Query: 305 YNPEPS 310
           Y    +
Sbjct: 533 YGGNAA 538



 Score = 31.2 bits (70), Expect = 2.8
 Identities = 29/111 (26%), Positives = 44/111 (39%), Gaps = 15/111 (13%)

Query: 198 ANSPYN--PRNQISYYNIENPKSPYNIHNPESPLYSDSPYYYRNLPLYNPVETELIDQYS 255
           +N+ Y+    +   Y N  N  +PYN     +P  S++P  Y N P  NP  + L   YS
Sbjct: 410 SNAGYSNPGNSNPGYNNAPNSNTPYN-----NPPNSNTP--YSNPPNSNPPYSNL--PYS 460

Query: 256 TSVYNNPD----SPLSINSPHSVLNPYNLDSILNPFNVKSPLSPENPASPF 302
            + Y+N       P S    HS  +        N      P + +  A+ F
Sbjct: 461 NTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPAANNF 511


>gnl|CDD|223523 COG0446, HcaD, Uncharacterized NAD(FAD)-dependent dehydrogenases
           [General function prediction only].
          Length = 415

 Score = 32.6 bits (74), Expect = 0.75
 Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 2/42 (4%)

Query: 436 IIIVGASAAGCVLANRL-SEVSSLKVLLIEAGGDTPIHSRIP 476
           I+IVG  AAG   A  L   + + ++ LI        + R P
Sbjct: 1   IVIVGGGAAGLSAATTLRRLLLAAEITLIGR-EPKYSYYRCP 41


>gnl|CDD|233368 TIGR01350, lipoamide_DH, dihydrolipoamide dehydrogenase.  This
           model describes dihydrolipoamide dehydrogenase, a
           flavoprotein that acts in a number of ways. It is the E3
           component of dehydrogenase complexes for pyruvate,
           2-oxoglutarate, 2-oxoisovalerate, and acetoin. It can
           also serve as the L protein of the glycine cleavage
           system. This family includes a few members known to have
           distinct functions (ferric leghemoglobin reductase and
           NADH:ferredoxin oxidoreductase) but that may be
           predicted by homology to act as dihydrolipoamide
           dehydrogenase as well. The motif GGXCXXXGCXP near the
           N-terminus contains a redox-active disulfide.
          Length = 460

 Score = 32.6 bits (75), Expect = 0.84
 Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 646 KRTNLYVLKRSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLL 705
           K+  + +L  +KVT V  ND      V Y N  GET  +T  K V++  G   N + L L
Sbjct: 222 KKKGVKILTNTKVTAVEKNDD----QVTYENKGGETETLTGEK-VLVAVGRKPNTEGLGL 276

Query: 706 SGIG 709
             +G
Sbjct: 277 EKLG 280



 Score = 31.5 bits (72), Expect = 2.0
 Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 4/37 (10%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GG 467
           +D+I++G    G V A R +++  LKV L+E    GG
Sbjct: 2   YDVIVIGGGPGGYVAAIRAAQL-GLKVALVEKEYLGG 37


>gnl|CDD|180854 PRK07121, PRK07121, hypothetical protein; Validated.
          Length = 492

 Score = 32.6 bits (75), Expect = 0.90
 Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 656 SKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAG 695
           ++ T++I++D     GVE     GETV + A K V+L AG
Sbjct: 198 TRATRLIVDDDGRVVGVE-ARRYGETVAIRARKGVVLAAG 236


>gnl|CDD|235905 PRK06996, PRK06996, hypothetical protein; Provisional.
          Length = 398

 Score = 32.4 bits (74), Expect = 0.95
 Identities = 16/35 (45%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 434 FDIIIVGASAAGCVLAN---RLSEVSSLKVLLIEA 465
           FDI IVGA   G  LA    R S   +L + LI+A
Sbjct: 12  FDIAIVGAGPVGLALAGWLARRSATRALSIALIDA 46


>gnl|CDD|100027 cd00396, PurM-like, AIR (aminoimidazole ribonucleotide) synthase
           related protein. This family includes Hydrogen
           expression/formation protein HypE, AIR synthases, FGAM
           (formylglycinamidine ribonucleotide) synthase and
           Selenophosphate synthetase (SelD). The N-terminal domain
           of AIR synthase forms the dimer interface of the
           protein, and is suggested as a putative ATP binding
           domain.
          Length = 222

 Score = 31.6 bits (72), Expect = 1.0
 Identities = 16/67 (23%), Positives = 24/67 (35%), Gaps = 7/67 (10%)

Query: 654 KRSKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIANAQLLLLSGIGPKAH 713
             ++   V+I      TGV+ V        V A  ++  T G +      L    G  A 
Sbjct: 110 SGARPGDVLI-----LTGVDAVLELVAAGDVHAMHDI--TDGGLLGTLPELAQASGVGAE 162

Query: 714 LDEVKIP 720
           +D   IP
Sbjct: 163 IDLEAIP 169


>gnl|CDD|221798 pfam12831, FAD_oxidored, FAD dependent oxidoreductase.  This family
           of proteins contains FAD dependent oxidoreductases and
           related proteins.
          Length = 415

 Score = 32.2 bits (74), Expect = 1.0
 Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 15/65 (23%)

Query: 435 DIIIVGASAAGCVLANRLSEVSS----LKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHA 490
           D+++VG   AG   A     +++     KVLL+E  G       + GM++   +   D  
Sbjct: 1   DVVVVGGGPAGVAAA-----IAAARLGAKVLLVERRG------WLGGMATSGLVGTDDGN 49

Query: 491 YLAEP 495
           Y    
Sbjct: 50  YDRGQ 54


>gnl|CDD|235798 PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed.
          Length = 462

 Score = 32.0 bits (74), Expect = 1.0
 Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 4/37 (10%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GG 467
           +D+I++GA   G V A R +++  LKV ++E    GG
Sbjct: 5   YDVIVIGAGPGGYVAAIRAAQL-GLKVAIVEKEKLGG 40


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 32.0 bits (72), Expect = 1.1
 Identities = 24/115 (20%), Positives = 33/115 (28%), Gaps = 17/115 (14%)

Query: 288 NVKSPLSPENPASPFNP-YNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDS 346
           N +  ++P+    P  P     P        ++   PH P     P  +   FN      
Sbjct: 164 NPQQRINPQRFGFPMQPNMGMRPG-------FNQMPPHMP--GMPPNQMRPGFNPMPGMP 214

Query: 347 PFRNYPQYSRYNPKNPESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSP 401
           P   + Q     P      F PQ       P   N P  P   N    PG   + 
Sbjct: 215 PRPGFNQNPNMMPNMNRPGFRPQ-------PGGFNHPGTPMGPNMQQRPGFNPNQ 262


>gnl|CDD|223651 COG0578, GlpA, Glycerol-3-phosphate dehydrogenase [Energy
           production and conversion].
          Length = 532

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 1/33 (3%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
           FD+I++G    G  +A   +    LKV L+E G
Sbjct: 13  FDVIVIGGGITGAGIARDAA-GRGLKVALVEKG 44


>gnl|CDD|236223 PRK08294, PRK08294, phenol 2-monooxygenase; Provisional.
          Length = 634

 Score = 32.3 bits (74), Expect = 1.2
 Identities = 20/72 (27%), Positives = 28/72 (38%), Gaps = 16/72 (22%)

Query: 393 YHPGNPDSPYDWQRLEKERYNIYRSVIYPPDMTPYVKSGDCFDIIIVGASAAGCVLANRL 452
           Y PG+P               I R    P D+       D  D++IVG   AG  LA +L
Sbjct: 8   YVPGDPRI------QPAAGRGINR----PADL------PDEVDVLIVGCGPAGLTLAAQL 51

Query: 453 SEVSSLKVLLIE 464
           S    +   ++E
Sbjct: 52  SAFPDITTRIVE 63


>gnl|CDD|223737 COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino
           acid transport and metabolism].
          Length = 387

 Score = 31.8 bits (72), Expect = 1.3
 Identities = 11/34 (32%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
            D++I+G    G   A  L+E     V ++EAG 
Sbjct: 5   MDVVIIGGGIVGLSAAYYLAE-RGADVTVLEAGE 37


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
           initially from a Japanese patient with hepatitis of
           unknown aetiology, has since been found to infect both
           healthy and diseased individuals and numerous prevalence
           studies have raised questions about its role in
           unexplained hepatitis. ORF1 is a large 750 residue
           protein. The N-terminal half of this protein corresponds
           to the capsid protein.
          Length = 525

 Score = 31.9 bits (73), Expect = 1.3
 Identities = 20/125 (16%), Positives = 32/125 (25%), Gaps = 33/125 (26%)

Query: 127 SFDY-------NNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQYQFLYSPLSPYN 179
              Y       +NP   F   NP   +N    ++++      S        LY  ++ YN
Sbjct: 241 DLRYPFLSPQTDNPCITFQVLNPG-FYN----NALWISTTTTSKGKTLLITLYKTVTYYN 295

Query: 180 PRSLFSSQNPLSPLSPLNANSPYNPRNQISYYNIENPKSPYNIHNPESPLYSDSPYYYRN 239
                                 +    Q+        K+     N  +  Y DSPY    
Sbjct: 296 T---------------------FETPAQLKPKTKNKNKTTNKTPNDSTSKYWDSPYGDNT 334

Query: 240 LPLYN 244
                
Sbjct: 335 YNSLE 339


>gnl|CDD|225325 COG2605, COG2605, Predicted kinase related to galactokinase and
           mevalonate kinase [General function prediction only].
          Length = 333

 Score = 31.6 bits (72), Expect = 1.4
 Identities = 29/113 (25%), Positives = 36/113 (31%), Gaps = 16/113 (14%)

Query: 445 GCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDHAYLAEPS---QFAGL 501
           G VL   + +      + IE G D  I  R      V S  E +H  L   S    F   
Sbjct: 28  GAVLNATIDKYI---YVTIEKGFDDEIRVRYDRTEFVKSYLENEHKPLVVESLKRDFLEF 84

Query: 502 -GVRNARI----KITAGKGLGGSSAVQNILYQ-----RGTSYDYENFAKLGYN 544
            G     +        G GLG SSA    L       +G S      A+  Y 
Sbjct: 85  NGGTPIELHTQSDAPPGSGLGSSSAFVVALLNALHAWKGESLGPYELAREAYE 137


>gnl|CDD|224983 COG2072, TrkA, Predicted flavoprotein involved in K+ transport
           [Inorganic ion transport and metabolism].
          Length = 443

 Score = 31.7 bits (72), Expect = 1.7
 Identities = 20/105 (19%), Positives = 31/105 (29%), Gaps = 9/105 (8%)

Query: 432 DCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA---GGDTPIHSRIPGMSSVLSLSEFD 488
              D+ I+GA  +G   A  L +      ++ E     G T  ++R PG+          
Sbjct: 7   THTDVAIIGAGQSGLAAAYALKQAGVPDFVIFEKRDDVGGTWRYNRYPGLRLDSPKWLLG 66

Query: 489 HAYL--AEPSQFAGLGVRNARIKITAGKGLGGSSAVQNILYQRGT 531
             +L       FA        IK    K          I +    
Sbjct: 67  FPFLPFRWDEAFAPFAEIKDYIKDYLEK----YGLRFQIRFNTRV 107


>gnl|CDD|131515 TIGR02462, pyranose_ox, pyranose oxidase.  Pyranose oxidase (also
           called glucose 2-oxidase) converts D-glucose and
           molecular oxygen to 2-dehydro-D-glucose and hydrogen
           peroxide. Peroxide production is believed to be
           important to the wood rot fungi in which this enzyme is
           found for lignin degradation.
          Length = 544

 Score = 31.4 bits (71), Expect = 1.9
 Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 8/88 (9%)

Query: 637 DAYLTPIAGKRTNLYVLKRSKVTKVIINDQN----VATGVEYVNSKGETVRVTANKEVIL 692
                    +R  L  L   + T+++ N+ N     A  V  + S G+   + A    +L
Sbjct: 218 LQPNDDAPSERFTL--LTNHRCTRLVRNETNESEIEAALVRDLLS-GDRFEIKA-DVYVL 273

Query: 693 TAGAIANAQLLLLSGIGPKAHLDEVKIP 720
             GA+ N Q+L+ SG G     D    P
Sbjct: 274 ACGAVHNPQILVNSGFGQLGRPDPTNPP 301


>gnl|CDD|223566 COG0492, TrxB, Thioredoxin reductase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 305

 Score = 31.1 bits (71), Expect = 2.1
 Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 13/40 (32%)

Query: 434 FDIIIVGASAAG------CVLANRLSEVSSLKVLLIEAGG 467
           +D+II+G   AG         A        LKV+LI  GG
Sbjct: 4   YDVIIIGGGPAGLTAAIYAARAG-------LKVVLILEGG 36


>gnl|CDD|236010 PRK07395, PRK07395, L-aspartate oxidase; Provisional.
          Length = 553

 Score = 31.2 bits (71), Expect = 2.2
 Identities = 14/30 (46%), Positives = 19/30 (63%), Gaps = 2/30 (6%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLI 463
           FD+++VG+ AAG  L   L   S L+V LI
Sbjct: 10  FDVLVVGSGAAG--LYAALCLPSHLRVGLI 37


>gnl|CDD|240393 PTZ00383, PTZ00383, malate:quinone oxidoreductase; Provisional.
          Length = 497

 Score = 30.9 bits (70), Expect = 2.5
 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 428 VKSGDCFDIIIVGASAAGCVLANRLSEVSSLK-VLLIE 464
               D +D++IVG    G  L   LS+ ++LK + LIE
Sbjct: 40  RLGSDVYDVVIVGGGVTGTALFYTLSKFTNLKKIALIE 77


>gnl|CDD|178082 PLN02463, PLN02463, lycopene beta cyclase.
          Length = 447

 Score = 30.8 bits (70), Expect = 2.7
 Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 430 SGDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
                D+++VG   AG  +A ++SE + L V  I+
Sbjct: 25  KSRVVDLVVVGGGPAGLAVAQQVSE-AGLSVCCID 58


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
           (LMP2) protein.  This family consists of several
           Gammaherpesvirus latent membrane protein (LMP2)
           proteins. Epstein-Barr virus is a human Gammaherpesvirus
           that infects and establishes latency in B lymphocytes in
           vivo. The latent membrane protein 2 (LMP2) gene is
           expressed in latently infected B cells and encodes two
           protein isoforms, LMP2A and LMP2B, that are identical
           except for an additional N-terminal 119 aa cytoplasmic
           domain which is present in the LMP2A isoform. LMP2A is
           thought to play a key role in either the establishment
           or the maintenance of latency and/or the reactivation of
           productive infection from the latent state. The
           significance of LMP2B and its role in pathogenesis
           remain unclear.
          Length = 489

 Score = 31.0 bits (70), Expect = 2.8
 Identities = 23/111 (20%), Positives = 40/111 (36%), Gaps = 20/111 (18%)

Query: 292 PLSPENPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFNSDNPDSPFRNY 351
           PL    P S   P   E             +P+ P S  + ++   P   ++ D+P    
Sbjct: 8   PLGAGGPRSHGGPDGDEGD----------SNPYYPSSFGSSWDRPGPPVPEDYDAPSHRP 57

Query: 352 PQYSRYNPKNPESPFSP--QNEDSIY-------NPENPNSPYNPFNENSLY 393
           P Y   +  +    + P  Q + S+Y           P  PY+P ++ S +
Sbjct: 58  PPYGG-SNGDRHGGYQPLGQQDPSLYAGLGQNGGGGLPPPPYSPRDQGSEH 107



 Score = 29.1 bits (65), Expect = 9.6
 Identities = 28/91 (30%), Positives = 39/91 (42%), Gaps = 16/91 (17%)

Query: 138 NPYNPSSVFNPKNIDSVFNPRNPDS------PYYNQYQFL---YSPLSPYNPRSLFS--S 186
           NPY PSS  +  +      P + D+      PY          Y PL   +P SL++   
Sbjct: 28  NPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPPYGGSNGDRHGGYQPLGQQDP-SLYAGLG 86

Query: 187 QNPLSPLSPLNANSPYNPRNQISYYNIENPK 217
           QN    L P     PY+PR+Q S +  E P+
Sbjct: 87  QNGGGGLPP----PPYSPRDQGSEHVYEEPR 113


>gnl|CDD|237225 PRK12843, PRK12843, putative FAD-binding dehydrogenase; Reviewed.
          Length = 578

 Score = 30.9 bits (70), Expect = 2.9
 Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 11/56 (19%)

Query: 424 MTPYVKS------GDCFDIIIVGASAAGCVLANRLSEVSSLKVLLIEA----GGDT 469
           M+P V           FD+I++GA AAG + A   + ++ LKVLL+E     GG T
Sbjct: 1   MSPVVSELSPERWDAEFDVIVIGAGAAG-MSAALFAAIAGLKVLLVERTEYVGGTT 55


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 30.9 bits (70), Expect = 2.9
 Identities = 49/301 (16%), Positives = 77/301 (25%), Gaps = 45/301 (14%)

Query: 104 NPESLLSPLNPRSVFYPSNPEVPSFDYNNPNSPFNPYNPSSVFNPKNIDSVFNPRNPDSP 163
           N    LS     S+F  +   +P+ +++N  +  N    S +  P +  S        S 
Sbjct: 130 NSSVTLSSSTASSMFNSNKLPLPNPNHSNSATT-NQSGSSFINTPASSSSQPLTNLVVSS 188

Query: 164 YYNQYQFLYSPLSPYNPRSLFSSQNPLSPLSPLNANSPYNPRNQISYYNIENPKSPYNIH 223
                               F+     S  S   +       N     +  N  S  ++ 
Sbjct: 189 IKRFPYLTSLS-------PFFNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLS 241

Query: 224 NPESPLYSDSPYYYRNLPLYNPVETELIDQYSTSVYNNPDSPLSINSPHSVLNPYNLDSI 283
                           LP    VE  +I   S+S          IN   S+         
Sbjct: 242 P---------------LPDTQSVENNIILNSSSS----------INELTSIYGSVPSIRN 276

Query: 284 LNPFNV--KSPLSPE-NPASPFNPYNPEPSIFDSHNIYSFFSPHNPYSPFNPYNIDSPFN 340
           L   N    S L+   +  +       E S   S +  SF       SP N    +    
Sbjct: 277 LRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPKSSP-NNLLTEILTT 335

Query: 341 SDNPDSP------FRNYPQYSRYNPKNPESPFSP-QNEDSIYNPENPNSPYNPFNENSLY 393
             NP                + ++  N     +P +N          N  Y   N  S  
Sbjct: 336 GVNPPQSLPSLLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSLSNLGY-SSNVPSPS 394

Query: 394 H 394
            
Sbjct: 395 S 395


>gnl|CDD|183782 PRK12834, PRK12834, putative FAD-binding dehydrogenase; Reviewed.
          Length = 549

 Score = 30.6 bits (70), Expect = 3.1
 Identities = 13/30 (43%), Positives = 21/30 (70%), Gaps = 1/30 (3%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIE 464
           D+I+VGA  AG V A  L++ +  +VLL++
Sbjct: 6   DVIVVGAGLAGLVAAAELAD-AGKRVLLLD 34


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 18/95 (18%), Positives = 29/95 (30%), Gaps = 7/95 (7%)

Query: 74  SPINQYSPYSPYNTDSLFDNTHPESLLNPYNPESLLSPLNPRSVFYPSNPEVPSFDYNNP 133
           S  +  SP     + S  D T   S  +  +   +   L        S+  VPS   ++ 
Sbjct: 6   STRSLLSPLGNELSPSSSDETEDCSEESTDSWSDMYEGLKDSE---SSSNSVPSLSLSST 62

Query: 134 NSPFNPYNPSSVFNPKNIDSVFNPRNPDSPYYNQY 168
            S        S    +++  V   R     Y N  
Sbjct: 63  ASS----LSDSSTYSRSLKEVKLERQAQEAYENWL 93


>gnl|CDD|224154 COG1233, COG1233, Phytoene dehydrogenase and related proteins
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 487

 Score = 30.5 bits (69), Expect = 3.5
 Identities = 14/54 (25%), Positives = 26/54 (48%), Gaps = 7/54 (12%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEF 487
           +D++++GA   G   A  L+  + LKV ++E       + R+ G +    L  F
Sbjct: 4   YDVVVIGAGLNGLAAAALLAR-AGLKVTVLEK------NDRVGGRARTFELDGF 50


>gnl|CDD|233672 TIGR01989, COQ6, ubiquinone biosynthesis monooxygenase COQ6.  This
           model represents the monooxygenase responsible for the
           4-hydroxylateion of the phenol ring in the aerobic
           biosynthesis of ubiquinone.
          Length = 437

 Score = 30.5 bits (69), Expect = 3.8
 Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 4/40 (10%)

Query: 434 FDIIIVGASAAGCVLANRLSEVS---SLKVLLIEAGGDTP 470
           FD++IVG    G  LA  L        LKVLL++A  D P
Sbjct: 1   FDVVIVGGGPVGLALAAALGNNPLTKDLKVLLLDA-VDNP 39


>gnl|CDD|225307 COG2509, COG2509, Uncharacterized FAD-dependent dehydrogenases
           [General function prediction only].
          Length = 486

 Score = 30.4 bits (69), Expect = 3.9
 Identities = 40/186 (21%), Positives = 62/186 (33%), Gaps = 45/186 (24%)

Query: 434 FDIIIVGASAAGCVLANRLSE----VSSLKVLLIEAGGDTPIHSRIPGMSSVLSLSEFDH 489
            D++IVGA  AG   A  LS     V  LK+ +++ G D  I  R+          E   
Sbjct: 19  LDVVIVGAGPAGLFAAYELSGDARKVPILKIYVVDVGLD--IEQRL------CPKDEKKL 70

Query: 490 AYLAEPSQFAGLGVRNARIKITAGKGLGG--SSAVQNILYQRGTSY-----DYENFAKLG 542
               +               I  G G  G  S  + N+   RG        D + F +L 
Sbjct: 71  EKCPK------CDPCP----IVIGFGGAGLFSDGILNLRPIRGGDVHERTKDTDEFWEL- 119

Query: 543 YNGWGYDETLKYF------VKSEDYRSVIYNESKAVHGTQGYLP-----VGLFKNKENNI 591
                 DE+   F        S+    +   E +A    +  LP     +G     +  +
Sbjct: 120 --VNLVDESNVQFGAPGAGTFSDLTEQIKDIEFRAAGAGEEILPIYQRHIGTDILPK--V 175

Query: 592 IREIFE 597
           ++ I E
Sbjct: 176 VKNIRE 181


>gnl|CDD|236374 PRK09077, PRK09077, L-aspartate oxidase; Provisional.
          Length = 536

 Score = 29.9 bits (68), Expect = 4.9
 Identities = 13/33 (39%), Positives = 21/33 (63%), Gaps = 2/33 (6%)

Query: 434 FDIIIVGASAAGCVLANRLSEVSSLKVLLIEAG 466
            D++I+G+ AAG  LA RL+E    +V ++  G
Sbjct: 9   CDVLIIGSGAAGLSLALRLAE--HRRVAVLSKG 39


>gnl|CDD|234511 TIGR04226, RrgB_K2N_iso_D2, fimbrial isopeptide formation D2
           domain.  The Streptococcus Pneumoniae pilus backbone
           protein, RrgB, has three tandem domains with Lys-to-Asn
           isopeptide bonds, but these three regions are extremely
           divergent in sequence. This model represents the
           homology domain family of the D2 domain. It occurs just
           once in many surface proteins but up to twenty times in
           some pilin subunit proteins. Three of every four members
           have the typical Gram-positive C-terminal motif, LPXTG,
           although in many cases this motif may be involved in
           pilin subunit cross-linking rather than cell wall
           attachment. Proteins with this domain include fimbrial
           proteins with lectin-like adhesion functions, and the
           majority of characterized members are involved in
           surface adhesion to host structures.
          Length = 124

 Score = 28.4 bits (64), Expect = 5.7
 Identities = 19/73 (26%), Positives = 31/73 (42%), Gaps = 9/73 (12%)

Query: 652 VLKRSKVTKVIINDQNVATGVEYVNSKGETVRVT---------ANKEVILTAGAIANAQL 702
            L+     KV ++ + +    +Y  + G+TV VT         A K++ +T  A  N   
Sbjct: 50  GLEYDGSVKVTVDGKTLDVTTDYTVTDGQTVTVTFTDDGLKKLAGKKITVTYTAKVNEGA 109

Query: 703 LLLSGIGPKAHLD 715
           +L  GI   A L 
Sbjct: 110 VLGKGIPNTATLT 122


>gnl|CDD|226103 COG3573, COG3573, Predicted oxidoreductase [General function
           prediction only].
          Length = 552

 Score = 29.8 bits (67), Expect = 5.9
 Identities = 13/34 (38%), Positives = 23/34 (67%), Gaps = 1/34 (2%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEAGGD 468
           D+I+VGA  AG V A  L++ +  +VL+++  G+
Sbjct: 7   DVIVVGAGLAGLVAAAELAD-AGKRVLILDQEGE 39


>gnl|CDD|199833 cd00216, PQQ_DH_like, PQQ-dependent dehydrogenases and related
           proteins.  This family is composed of dehydrogenases
           with pyrroloquinoline quinone (PQQ) as a cofactor, such
           as ethanol, methanol, and membrane-bound glucose
           dehydrogenases. The alignment model contains an 8-bladed
           beta-propeller, and the family also includes distantly
           related proteins which are not enzymatically active and
           do not bind PQQ.
          Length = 434

 Score = 29.5 bits (66), Expect = 7.5
 Identities = 13/83 (15%), Positives = 25/83 (30%)

Query: 343 NPDSPFRNYPQYSRYNPKNPESPFSPQNEDSIYNPENPNSPYNPFNENSLYHPGNPDSPY 402
               P  N          +  +P   ++  +       +S       N +Y  G   +P+
Sbjct: 159 YLVMPDPNLLPGKDSTVTDRNTPTGDEHTWTSGGGTGWSSAAYDAELNLIYVGGGNPTPW 218

Query: 403 DWQRLEKERYNIYRSVIYPPDMT 425
           +W        N+Y S I   +  
Sbjct: 219 NWGGNRTPGDNLYTSSIVAVNAD 241


>gnl|CDD|227544 COG5219, COG5219, Uncharacterized conserved protein, contains RING
            Zn-finger [General function prediction only].
          Length = 1525

 Score = 29.6 bits (66), Expect = 7.7
 Identities = 24/111 (21%), Positives = 41/111 (36%), Gaps = 11/111 (9%)

Query: 222  IHNPESPLYSDSP--YYYRNLPLYNPVETELIDQYSTSVYNNPDSPLSINSPHSVLNP-- 277
            I  P   L S     Y  R+  +Y P+   +   ++ +VYN   S +S    +S+ +   
Sbjct: 1213 IERPGDHLLSVKKEEYRERHTIVYLPLWRLIYGHFADTVYNMYSSIISQLDMNSLDSEVY 1272

Query: 278  ------YNLDSILNPFNV-KSPLSPENPASPFNPYNPEPSIFDSHNIYSFF 321
                  + L ++L    V  S L   N +  ++           H  Y FF
Sbjct: 1273 VLLRGCFLLSTVLFWDLVDNSLLIVYNISINYSSATDRIRCLAIHIYYQFF 1323


>gnl|CDD|235879 PRK06854, PRK06854, adenylylsulfate reductase subunit alpha;
           Validated.
          Length = 608

 Score = 29.5 bits (67), Expect = 7.9
 Identities = 14/31 (45%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 435 DIIIVGASAAGCVLANRLSEVSS-LKVLLIE 464
           DI+I+G   AGC  A    E +  LKVL++E
Sbjct: 13  DILIIGGGMAGCGAAFEAKEWAPDLKVLIVE 43


>gnl|CDD|187566 cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase,
           extended (e) SDRs.  This subgroup contains
           UDP-N-acetylglucosamine 4-epimerase of Pseudomonas
           aeruginosa, WbpP,  an extended SDR, that catalyzes the
           NAD+ dependent conversion of UDP-GlcNAc and UDPGalNA to
           UDP-Glc and UDP-Gal.  This subgroup has the
           characteristic active site tetrad and NAD-binding motif
           of the extended SDRs. Extended SDRs are distinct from
           classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 304

 Score = 29.1 bits (66), Expect = 8.0
 Identities = 13/32 (40%), Positives = 17/32 (53%), Gaps = 3/32 (9%)

Query: 381 NSPYNPFNENSLYHPGNPDSPYDWQRLEKERY 412
           + PY P +E+   HP NP SPY   +   E Y
Sbjct: 123 DPPYLPKDED---HPPNPLSPYAVSKYAGELY 151


>gnl|CDD|236831 PRK11052, malQ, 4-alpha-glucanotransferase; Provisional.
          Length = 695

 Score = 29.5 bits (67), Expect = 8.2
 Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 11/33 (33%)

Query: 112 LNPRSVFYPSNPEVPSFDYNNPNSPFNPYNPSS 144
           LNP    YP+NPE  S           PY+PSS
Sbjct: 184 LNPIHALYPANPESAS-----------PYSPSS 205


>gnl|CDD|238990 cd02033, BchX, Chlorophyllide reductase converts chlorophylls into
           bacteriochlorophylls by reducing the chlorin B-ring.
           This family contains the X subunit of this three-subunit
           enzyme. Sequence and structure similarity between bchX,
           protochlorophyllide reductase L subunit (bchL and chlL)
           and nitrogenase Fe protein (nifH gene) suggest their
           functional similarity. Members of the BchX family serve
           as the unique electron donors to their respective
           catalytic subunits (bchN-bchB, bchY-bchZ and nitrogenase
           component 1). Mechanistically, they hydrolyze ATP and
           transfer electrons through a Fe4-S4 cluster.
          Length = 329

 Score = 29.2 bits (65), Expect = 8.3
 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 10/44 (22%)

Query: 513 GKGLGGSSAVQNILYQRGTSYDYENFAKLGYNGWGYDETLKYFV 556
           G+G GG          RG  + +E   KLG++ W +D  L  F+
Sbjct: 123 GRGCGG----------RGIIHGFELLEKLGFHDWDFDYVLLDFL 156


>gnl|CDD|183787 PRK12844, PRK12844, 3-ketosteroid-delta-1-dehydrogenase; Reviewed.
          Length = 557

 Score = 29.3 bits (66), Expect = 8.6
 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 658 VTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIA-NAQL 702
           +T++I+ D  V  GV  V   G  V + A + V+L +G    NA++
Sbjct: 231 LTELIVEDGRV-VGVVVVRD-GREVLIRARRGVLLASGGFGHNAEM 274


>gnl|CDD|220065 pfam08902, DUF1848, Domain of unknown function (DUF1848).  This
           family of proteins are functionally uncharacterized. The
           C-terminus contains a cluster of cysteines that are
           similar to the iron-sulfur cluster found at the
           N-terminus of pfam04055.
          Length = 264

 Score = 28.7 bits (65), Expect = 8.8
 Identities = 19/56 (33%), Positives = 28/56 (50%), Gaps = 17/56 (30%)

Query: 138 NPYNPSSV----FNPKNIDS-VFNPRNP----------DSPYYNQYQFLYSPLSPY 178
           NPYNP  V      P+++D+ VF  +NP          D+  Y  Y F ++ L+PY
Sbjct: 31  NPYNPKQVSRVSLTPEDVDAIVFWTKNPAPFLPRLDELDARGYPYY-FQFT-LTPY 84


>gnl|CDD|232901 TIGR00275, TIGR00275, flavoprotein, HI0933 family.  The model when
           searched with a partial length search brings in proteins
           with a dinucleotide-binding motif (Rossman fold) over
           the initial 40 residues of the model, including
           oxidoreductases and dehydrogenases. Partially
           characterized members include an FAD-binding protein
           from Bacillus cereus and flavoprotein HI0933 from
           Haemophilus influenzae [Unknown function, Enzymes of
           unknown specificity].
          Length = 400

 Score = 29.1 bits (66), Expect = 8.9
 Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 437 IIVGASAAGCVLANRLSEVSSLKVLLIEAGG 467
           II+G  AAG + A   +    L VLL+E   
Sbjct: 1   IIIGGGAAGLMAAITAAR-EGLSVLLLEKNK 30


>gnl|CDD|234978 PRK01747, mnmC, bifunctional tRNA
           (mnm(5)s(2)U34)-methyltransferase/FAD-dependent
           cmnm(5)s(2)U34 oxidoreductase; Reviewed.
          Length = 662

 Score = 29.4 bits (67), Expect = 8.9
 Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 1/31 (3%)

Query: 435 DIIIVGASAAGCVLANRLSEVSSLKVLLIEA 465
           D  I+G   AG  LA  L+     +V L EA
Sbjct: 262 DAAIIGGGIAGAALALALAR-RGWQVTLYEA 291


>gnl|CDD|180419 PRK06134, PRK06134, putative FAD-binding dehydrogenase; Reviewed.
          Length = 581

 Score = 29.3 bits (66), Expect = 9.3
 Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 4/60 (6%)

Query: 656 SKVTKVIINDQNVATGVEYVNSKGETVRVTANKEVILTAGAIAN--AQLLLLSGIGPKAH 713
           +   +++  D  VA  V  V + G    + A K V+L AG   +  A+   L    P  H
Sbjct: 238 APARELLREDGRVAGAV--VETPGGLQEIRARKGVVLAAGGFPHDPARRAALFPRAPTGH 295


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.134    0.402 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 37,905,742
Number of extensions: 3777310
Number of successful extensions: 3107
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2906
Number of HSP's successfully gapped: 173
Length of query: 720
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 616
Effective length of database: 6,324,786
Effective search space: 3896068176
Effective search space used: 3896068176
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (28.2 bits)